<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
		<id>http://muscle.biouml.org/index.php?action=history&amp;feed=atom&amp;title=Methods</id>
		<title>Methods - Revision history</title>
		<link rel="self" type="application/atom+xml" href="http://muscle.biouml.org/index.php?action=history&amp;feed=atom&amp;title=Methods"/>
		<link rel="alternate" type="text/html" href="http://muscle.biouml.org/index.php?title=Methods&amp;action=history"/>
		<updated>2026-05-16T05:15:53Z</updated>
		<subtitle>Revision history for this page on the wiki</subtitle>
		<generator>MediaWiki 1.29.2</generator>

	<entry>
		<id>http://muscle.biouml.org/index.php?title=Methods&amp;diff=424&amp;oldid=prev</id>
		<title>Sspintus@dote.ru: /* Gene set enrichment analysis (GSEA) */</title>
		<link rel="alternate" type="text/html" href="http://muscle.biouml.org/index.php?title=Methods&amp;diff=424&amp;oldid=prev"/>
				<updated>2021-03-03T07:18:43Z</updated>
		
		<summary type="html">&lt;p&gt;‎&lt;span dir=&quot;auto&quot;&gt;&lt;span class=&quot;autocomment&quot;&gt;Gene set enrichment analysis (GSEA)&lt;/span&gt;&lt;/span&gt;&lt;/p&gt;
&lt;table class=&quot;diff diff-contentalign-left&quot; data-mw=&quot;interface&quot;&gt;
				&lt;col class='diff-marker' /&gt;
				&lt;col class='diff-content' /&gt;
				&lt;col class='diff-marker' /&gt;
				&lt;col class='diff-content' /&gt;
				&lt;tr style='vertical-align: top;' lang='en'&gt;
				&lt;td colspan='2' style=&quot;background-color: white; color:black; text-align: center;&quot;&gt;← Older revision&lt;/td&gt;
				&lt;td colspan='2' style=&quot;background-color: white; color:black; text-align: center;&quot;&gt;Revision as of 07:18, 3 March 2021&lt;/td&gt;
				&lt;/tr&gt;&lt;tr&gt;&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot; id=&quot;mw-diff-left-l18&quot; &gt;Line 18:&lt;/td&gt;
&lt;td colspan=&quot;2&quot; class=&quot;diff-lineno&quot;&gt;Line 18:&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class='diff-marker'&gt;&amp;#160;&lt;/td&gt;&lt;td style=&quot;background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;We used DeSEQ2 software to estimate log2 fold change (LFC) of expression of each peak between control and case samples as well as P value as probability of rejecting the null hypothesis which stated that mean expression of a peak is equal between the case and control samples. The P values were adjusted using Bonferroni-Hochberg false discovery rate method. The set of differentially expressed peaks for each case was identified by applying the cutoffs of |LFC| &amp;gt; 1.25 and Padj &amp;lt; 0.05.&amp;#160; We considered an enhancer or a gene differentially expressed if at least one of the peaks within the region of interest was identified as differentially expressed.&lt;/div&gt;&lt;/td&gt;&lt;td class='diff-marker'&gt;&amp;#160;&lt;/td&gt;&lt;td style=&quot;background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;We used DeSEQ2 software to estimate log2 fold change (LFC) of expression of each peak between control and case samples as well as P value as probability of rejecting the null hypothesis which stated that mean expression of a peak is equal between the case and control samples. The P values were adjusted using Bonferroni-Hochberg false discovery rate method. The set of differentially expressed peaks for each case was identified by applying the cutoffs of |LFC| &amp;gt; 1.25 and Padj &amp;lt; 0.05.&amp;#160; We considered an enhancer or a gene differentially expressed if at least one of the peaks within the region of interest was identified as differentially expressed.&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class='diff-marker'&gt;&amp;#160;&lt;/td&gt;&lt;td style=&quot;background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;==Gene set enrichment analysis (GSEA)==&lt;/div&gt;&lt;/td&gt;&lt;td class='diff-marker'&gt;&amp;#160;&lt;/td&gt;&lt;td style=&quot;background-color: #f9f9f9; color: #333333; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #e6e6e6; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;==Gene set enrichment analysis (GSEA)==&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;
&lt;tr&gt;&lt;td class='diff-marker'&gt;−&lt;/td&gt;&lt;td style=&quot;color:black; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #ffe49c; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;We performed gene set enrichment analysis using PANTHER Slim GO biological process database and webserver using Fisher exact test and Bonferroni P-value adjustment. For observatory GSEA analysis (ST11) we also used REVIGO web service with allowed similarity 0.5 (small set of results) against R. norvegicus GO database.&lt;/div&gt;&lt;/td&gt;&lt;td class='diff-marker'&gt;+&lt;/td&gt;&lt;td style=&quot;color:black; font-size: 88%; border-style: solid; border-width: 1px 1px 1px 4px; border-radius: 0.33em; border-color: #a3d3ff; vertical-align: top; white-space: pre-wrap;&quot;&gt;&lt;div&gt;We performed gene set enrichment analysis using PANTHER Slim GO biological process database and webserver using Fisher exact test and Bonferroni P-value adjustment. For observatory GSEA analysis (&lt;ins class=&quot;diffchange diffchange-inline&quot;&gt;[https://docs.google.com/spreadsheets/d/1IeCjpGl6ovKECCGxl1oqOitnFTV7m8Ui_09llmK_GcE/edit#gid=1668182601 &lt;/ins&gt;ST11&lt;ins class=&quot;diffchange diffchange-inline&quot;&gt;]&lt;/ins&gt;) we also used REVIGO web service with allowed similarity 0.5 (small set of results) against R. norvegicus GO database.&lt;/div&gt;&lt;/td&gt;&lt;/tr&gt;

&lt;!-- diff cache key my_wiki:diff:version:1.11a:oldid:423:newid:424 --&gt;
&lt;/table&gt;</summary>
		<author><name>Sspintus@dote.ru</name></author>	</entry>

	<entry>
		<id>http://muscle.biouml.org/index.php?title=Methods&amp;diff=423&amp;oldid=prev</id>
		<title>Sspintus@dote.ru: Created page with &quot; Single-read sequences were analyzed for quality and overrepresented adapter sequences with theFastQC tool. Quality filtering and adapter  trimming were performed with the Tri...&quot;</title>
		<link rel="alternate" type="text/html" href="http://muscle.biouml.org/index.php?title=Methods&amp;diff=423&amp;oldid=prev"/>
				<updated>2021-03-03T07:17:49Z</updated>
		
		<summary type="html">&lt;p&gt;Created page with &amp;quot; Single-read sequences were analyzed for quality and overrepresented adapter sequences with theFastQC tool. Quality filtering and adapter  trimming were performed with the Tri...&amp;quot;&lt;/p&gt;
&lt;p&gt;&lt;b&gt;New page&lt;/b&gt;&lt;/p&gt;&lt;div&gt;&lt;br /&gt;
Single-read sequences were analyzed for quality and overrepresented adapter sequences with theFastQC tool. Quality filtering and adapter  trimming were performed with the Trimmomatic tool v0.39. Read mapping on rat genome Rnor6 was performed with  the STAR local alignment tool  version 2.6.1. CAGE tag start sites (CTSS) for each sample were imputed using the PromoterPipeline aggregation script from C1 CAGE protocol. Peak clustering was performed with Decomposition-based Peak Identification (DPI1) tool.  Bidirectional enhancers were identified using the pipeline by Andersson et al, 2014.  Statistical significance of differential expression of TSS and CTSS peaks was calculated using the DeSeq2 tool.&lt;br /&gt;
==Quality Control==&lt;br /&gt;
For adapter clipping we used maximum mismatch count of 2, palindrome threshold of 30 and simple threshold of 10. For sliding window quality trimming we used  window size of 4 nucleotides and average PHRED score over the window was required not less than 15. Leading and trailing nucleotides of each read were removed if their quality was less than 3. All reads with length less than 36 were discarded.&lt;br /&gt;
==Genome, Annotation and Indexing==&lt;br /&gt;
As a reference genome we used Rattus norvegicus Rnor_6.0 (build NCBI:GCA_000001895.4) “top level” assembly acquired from Ensembl v93. As a source of annotated spice junctions for genome indexing we used  Ensembl v93 R. norvegicus genome annotation.&lt;br /&gt;
==Read Mapping==&lt;br /&gt;
For local alignment of the reads onto Rattus norvegicus genome Rnor_6.0 (build NCBI:GCA_000001895.4) “top level” assembly  we initially allowed for multimap reads to map up to 20 loci. We also required minimum splice alignment overhang of 8 nt for de novo splice junctions (--alignSJoverhangMin 8) and 1 nt for splice junctions used from genome annotation (--alignSJDBoverhangMin 1), spliced alignments were filtered using output splice junctions (--outFilterType BySJout), for maximum number of mismatches per pair relative to read length default value of 0.04 was used (--outFilterMismatchNoverReadLmax 0.04) and minimum intron length was set to 20 nt (--alignIntronMin 20). [34]&lt;br /&gt;
==CTSS Aggregation==&lt;br /&gt;
All alignments were filtered for absence of “read unmapped” flag (flag value 4). Alignments that passed filtering were aggregated with PromoterPipeline tool version 2015.05.16, and resulting CTSS were sorted by chromosome and coordinate.&lt;br /&gt;
==TSS Peaks==&lt;br /&gt;
We clustered CTSS peaks with DPI1 in SPI mode in all samples of SOL and EDL muscles, respectively. Output peaks were filtered by maximum CTSS counts over the peak and maximum TPM normalized CTSS counts per peak. Thus, three sets of output peaks were obtained: unfiltered, ‘permissive’ (ctssMaxCounts3) and ‘robust’ (ctssMaxCounts11, ctssMaxTpm1).&lt;br /&gt;
==Enhancers==&lt;br /&gt;
We identified putative enhancers as bidirectional promoters using the Enhancers software [Andersson et al, 2014; doi:10.1038/nature12787]. To avoid false identification of intergenic enhancers we masked all  loci proximate or including known TSS (+- 500 nt) and exons (+- 200 nt), according to genome  annotation. Enhancers were searched among ‘permissive’ sets of TSS peaks of fast (EDL) and slow (soleus) muscle CTSS.&lt;br /&gt;
==Annotation of TSS peaks to genes and enhancers==&lt;br /&gt;
We annotated all TSS peaks to corresponding enhancers by the criteria of stradless coordinate intersection. To keep consistency with enhancers, which are extended by 200nt by the enhancers pipeline, we extended gene regions by 200nt from 3’ and 5’ ends before intersecting them strand-wise with the permissive set of TSS peaks. Intersected peaks were annotated to corresponding genes.&lt;br /&gt;
==Differentially expressed peaks, genes and enhancers==&lt;br /&gt;
We used DeSEQ2 software to estimate log2 fold change (LFC) of expression of each peak between control and case samples as well as P value as probability of rejecting the null hypothesis which stated that mean expression of a peak is equal between the case and control samples. The P values were adjusted using Bonferroni-Hochberg false discovery rate method. The set of differentially expressed peaks for each case was identified by applying the cutoffs of |LFC| &amp;gt; 1.25 and Padj &amp;lt; 0.05.  We considered an enhancer or a gene differentially expressed if at least one of the peaks within the region of interest was identified as differentially expressed.&lt;br /&gt;
==Gene set enrichment analysis (GSEA)==&lt;br /&gt;
We performed gene set enrichment analysis using PANTHER Slim GO biological process database and webserver using Fisher exact test and Bonferroni P-value adjustment. For observatory GSEA analysis (ST11) we also used REVIGO web service with allowed similarity 0.5 (small set of results) against R. norvegicus GO database.&lt;/div&gt;</summary>
		<author><name>Sspintus@dote.ru</name></author>	</entry>

	</feed>