Skip to main content
Howard Hughes Medical Institute Author Manuscripts logoLink to Howard Hughes Medical Institute Author Manuscripts
. Author manuscript; available in PMC: 2014 Aug 13.
Published in final edited form as: Cell Rep. 2014 Jan 30;6(3):455–466. doi: 10.1016/j.celrep.2014.01.008

Pol II docking and pausing at growth and stress genes in C. elegans

Colin S Maxwell 1, William S Kruesi 2, Leighton J Core 4, Nicole Kurhanewicz 1,3, Colin T Waters 4, Caitlin L Lewarch 1,5, Igor Antoshechkin 6, John T Lis 4, Barbara J Meyer 2, L Ryan Baugh 1,*
PMCID: PMC4026043  NIHMSID: NIHMS579102  PMID: 24485661

Summary

Fluctuations in nutrient availability profoundly impact gene expression. Previous work revealed post-recruitment regulation of RNA Polymerase II (Pol II) during starvation and recovery in Caenorhabitis elegans, suggesting promoter- proximal pausing promotes rapid response to feeding. To test this hypothesis, we measured Pol II elongation genome-wide by two complementary approaches and analyzed elongation in conjunction with Pol II binding and expression. We confirmed bona fide pausing during starvation and also discovered Pol II docking. Pausing occurs at active stress-response genes that become down-regulated in response to feeding. In contrast “docked” Pol II accumulates without initiating upstream of inactive growth genes that become rapidly up-regulated upon feeding. Beyond differences in function and expression, these two sets of genes have different core promoter motifs, suggesting alternative transcriptional machinery. Our work suggests that growth and stress genes are both regulated post- recruitment during starvation, but at initiation and elongation, respectively, coordinating gene expression with nutrient availability.

Keywords: Pol II pausing, Pol II docking, transcription initiation, transcription elongation, starvation, quiescence, L1 arrest, L1 diapause, Caenorhabditis elegans

Introduction

All organisms must cope with fluctuations in environmental conditions. There is a pervasive difference in the genes expressed during stressful conditions and those that support growth, demonstrating a fundamental role of transcriptional regulation (Gasch et al., 2000). Rapid and coordinated responses to changes in environmental conditions are essential, but the mechanisms responsible are not well understood.

For the nematode C. elegans, life in the wild is feast or famine, making it an ideal metazoan model to investigate transcriptional responses to nutrient availability. Larvae that hatch without food arrest development in the first larval stage (L1 arrest or L1 diapause) and become resistant to stress (Baugh, 2013). Arrested larvae respond rapidly to feeding, dramatically altering gene expression and initiating growth (Baugh et al., 2009; Maxwell et al., 2012). During L1 arrest RNA Polymerase II (Pol II) accumulates at the 5′ end of genes that are up-regulated during recovery (Baugh et al., 2009), suggesting that post-recruitment regulation of Pol II contributes to nutritional control of transcription.

It has become clear in recent years that post-recruitment regulation of early elongation (pausing) is widespread in the animals where it has been investigated (Core et al., 2008; Kim et al., 2005; Muse et al., 2007; Rahl et al., 2010; Zeitlinger et al., 2007). Pausing has been suggested to promote rapid response to changes in environmental conditions and during development, as in the heat shock response where it was first discovered (Muse et al., 2007; Rougvie and Lis, 1988; Zeitlinger et al., 2007). However, pausing does not always predict up-regulation in models of inducible gene expression (Gilchrist et al., 2012; Hah et al., 2011; Lin et al., 2011).

We previously used Pol II chromatin immunoprecipitation and sequencing (ChIP-seq) to show that the polymerase accumulates at the 5′ end of many genes during L1 arrest (Baugh et al., 2009). We hypothesized that this accumulation reflects Pol II pausing. However, a “paused” polymerase is defined as having inititated elongation but transiently halted (Adelman and Lis, 2012), and ChIP cannot distinguish between elongating and non-elongating Pol II. In addition, the multimeric Negative Elongation Factor (NELF) contributes to pausing in other systems (Nechaev and Adelman, 2011; Renner et al., 2001; Wu et al., 2003), but none of its subunits have homologs in the C. elegans genome (Narita et al., 2003). Furthermore, trans-splicing obscures the transcription start site (TSS) of most genes in C. elegans (Allen et al., 2011), making interpretation of Pol II accumulation difficult.

Our results here suggest that two independent forms of post-recruitment regulation occur during starvation in C. elegans, docking and pausing, affecting growth and stress genes, respectively. Integrated analysis of Pol II binding, nascent transcript production and elongation confirm that Pol II pausing occurs in starved C. elegans L1 stage larvae, where it is associated with active stress-response genes. Furthermore, a TFIIS mutant suggests backtracking of paused polymerase as in other systems. Surprisingly, this analysis also revealed that “docked” Pol II accumulates without initiating transcription just upstream of TSSs of growth genes. In addition to encoding proteins with distinct functions, genes associated with docking and pausing respond in opposite ways to feeding and are enriched for different core promoter motifs. Our results reveal a fundamental distinction between growth and stress genes and suggest that this difference extends to mechanisms of post-recruitment transcriptional regulation.

Results

Nascent RNA sequencing by scRNA-seq and GRO-seq

Our published Pol II ChIP-seq analysis could not distinguish between inactive and elongating polymerase (Baugh et al., 2009). To address this, we sequenced short, capped RNAs (scRNA-seq) to measure elongation activity genome-wide. Nascent RNAs are capped on their 5′ end in Drosophila (Rasmussen and Lis, 1993), providing a strategy to clone and sequence them as scRNA (Nechaev et al., 2010). In addition, sequencing the 3′ end of scRNA reveals the location of promoter-proximal Pol II with nucleotide resolution (Nechaev et al., 2010). We performed a variety of control experiments that demonstrate the sensitivity, specificity, reproducibility and fidelity of our scRNA-seq procedure (Fig S1, S2; Sup. Info.). These control experiments show that we are able to specifically detect nascent elongation products as scRNAs in C. elegans.

We also analyzed nascent RNAs using global nuclear run-on (GRO-seq) data as an independent measurement of elongating Pol II (Kruesi et al., 2013). Unlike scRNA-seq, which only reports on elongation activity near the TSS, GRO-seq reports on elongation throughout the gene. scRNA-seq also cannot distinguish between RNA species that remain associated with paused Pol II and those that have been released through termination of transcription, but GRO-seq can (Adelman and Lis, 2012). However, GRO-seq does not locate the position of paused polymerase with the single-nucleotide resolution of 3′ scRNA-seq. For these reasons scRNA-seq and GRO-seq provide complementary ways to interrogate elongation.

scRNA-seq reveals Pol II pausing in C. elegans

scRNA detection coincides with Pol II accumulation consistent with promoter-proximal pausing. We used scRNA-seq on a pair of biological replicates starved during the L1 larval stage. We identified 789 contiguous regions of 5′ scRNA-seq coverage (scRNA contigs) that are within 100 bp of the annotated TSS of protein coding genes. We found that these scRNA contigs are associated with accumulation of Pol II detected by ChIP-seq and GRO-seq (Fig 1), suggesting pausing of the polymerase during early elongation. Unless stated otherwise, the Pol II ChIP-seq data presented is from Zhong and co-workers (Table S1; Zhong et al., 2010). The 5′ end of scRNA contigs and GRO-seq signal align precisely (Fig 1E), demonstrating the precision and consistency of these two data sets. These results also suggest that scRNA synthesis initiates at the same position as mRNA synthesis, consistent with scRNAs being nascent transcription products.

Figure 1.

Figure 1

scRNAs coincide with accumulation of elongating RNA Pol II. Coverage of A) the 5′ end of scRNAs, B) the 3′ end of scRNAs, C) GRO-seq reads and D) Pol II ChIP-seq reads is plotted relative to the beginning of 789 contiguous regions (contigs) of scRNA coverage. E) Coverage of 5′ end of scRNA reads (black) and GRO-seq reads (red) in the immediate proximity of the contig start is plotted. Only scRNA contigs within 100 bp of annotated TSSs for protein coding genes are included (WS220). Coverages are the median boot-strap estimates of the mean. See also Fig S1, S2, and S3.

Pol II pauses in approximately the same location relative to the TSS in C. elegans as other animals. Sequencing the 5′ and 3′ ends of scRNAs allows us to determine their size distribution, revealing the distance traveled by Pol II prior to pausing. Consistent with Drosophila (Nechaev et al., 2010), 75% of scRNAs are 25–65 nt long (Fig 1B). There is also good agreement between the position of the 3′ ends of scRNA and the peak of Pol II accumulation based on ChIP-seq, as expected for pausing (Fig 1). The amount of scRNA correlates with Pol II ChIP-seq and GRO-seq coverage over scRNA contig coordinates (Spearman’s r = 0.37 and 0.30 respectively; Spearman’s rank correlation test, p < 2e-16 for both comparisons), suggesting that these assays detect the same population of paused Pol II molecules. Together, these data provide evidence that Pol II is paused in the promoter-proximal region of many genes during L1 arrest (see below for estimation of the number of paused genes).

TFIIS and backtracking of paused Pol II

We found that paused Pol II is prone to backtracking in C. elegans. When Pol II pauses in other organisms it can backtrack a few base pairs, and the general transcription factor TFIIS helps it resume elongation (Adelman et al., 2005; Kettenberger et al., 2003). Depleting TFIIS in yeast or Drosophila S2 cells results in net elongation of nascent RNAs near pause sites, reflecting backtracking without cleavage (Churchman and Weissman, 2011; Nechaev et al., 2010). To examine the function of TFIIS in C. elegans, we sequenced the 3′ end of scRNAs in a TFIIS mutant (T24H10.1(ok2749)) during L1 arrest. The mutant has a significantly different scRNA size distribution (Kolmogorov-Smirnov test, p < 2.2e-16). In particular, fewer of the shortest scRNAs and more of the moderately sized scRNAs are detected in the TFIIS mutant (Fig 2A). This increase in scRNA length is consistent with TFIIS relieving backtracking. Furthermore, these results provide strong evidence that the scRNAs we detect are nascent transcription products as opposed to degradation products.

Figure 2.

Figure 2

TFIIS mutation alters the size distribution of scRNAs. A) The difference in relative coverage between wild type and the TFIIS mutant is plotted relative to the beginning of 789 scRNA contigs within 100 bp of annotated protein coding gene TSSs. Each bin shows the mean change in coverage over a 5 bp window. Coverage is the median boot-strap estimate of the mean. B) A boxplot comparing the coefficient of variation (CV) for the distance between the 3′ ends of scRNAs and the beginning of the contig is plotted. The CV was calculated for each contig separately. In order to address possible effects resulting from the smaller TFIIS mutant library sizes, the wild-type data was resampled to calculate a boot-strap estimate of the CV. C) Three examples of the distribution of the 3′ end of scRNAs are plotted for wild-type and the TFIIS mutant. All genes are plotted with their 5′ end to the left regardless of strand.

TFIIS also affects the dispersion of pause sites in individual genes. We found that the longest scRNAs are actually less abundant in the TFIIS mutant than wild-type (Fig 2A). This result suggests that TFIIS is required for Pol II to escape relatively proximal pause sites and reach secondary pause sites where it is associated with longer scRNAs. To address this hypothesis, we examined the dispersion of pause sites in individual genes. We calculated the coefficient of variation (CV) for the distance between the 3′ ends of scRNAs and the start of the contig within individual contigs. This analysis revealed that there is a smaller CV in the TFIIS mutant than in wild-type (Fig 2B; Mann-Whitney test, p = 5.2e-13). This observation is consistent with the mutant having a relatively narrow scRNA size distribution genome-wide (Fig 2A), but it shows that the effect occurs at individual loci. Examination of 3′ scRNA ends at individual genes supports this interpretation (Fig 2C). These results suggest that Pol II pauses in a more focused region in the TFIIS mutant than wild-type, as if TFIIS helps Pol II escape proximal pause sites in order to pause at more distal sites.

TSS identification and the frequency of pausing

The majority of mRNA transcripts in C. elegans have a 22 nt leader sequence added to their 5′ end in a co-transcriptional trans-splicing reaction (Allen et al., 2011). As a result, current genome annotation of TSSs actually corresponds to trans-splice sites in the majority of cases. Since GRO-seq detects nascent RNAs it provides an opportunity to discover true TSSs. To increase coverage of GRO-seq signal at 5′ ends, we used data from a variant of GRO-seq called GRO-cap to sequence only the capped 5′ end of nascent transcripts. Development of this technique, validation, results and criteria used to identify TSSs are presented elsewhere (Kruesi et al., 2013). We pooled TSS calls generated from embryos, L1 arrest, and fed L3 stage larvae to generate 5,192 high-confidence true TSSs for protein coding genes, excluding TSSs found inside operons. We performed scRNA-seq only in L1 arrest, and we detect scRNAs (with a false discovery rate of 1%) at 29% of true TSSs. The 5′ end of scRNAs agrees extremely well with true TSSs (Fig 3A). The precise registration of these two datasets reflects the reliability of the TSS calls.

Figure 3.

Figure 3

Pol II accumulates upstream of true TSSs at genes with relatively little elongation. A) Coverage of the 5′ end of scRNAs, 3′ end of scRNAs, GRO-seq reads and Pol II ChIP-seq reads is plotted relative to 5,192 true TSSs. B) Coverage of GRO- seq and Pol II ChIP-seq reads is plotted relative to 590 empirically identified SL1 trans-splice sites. Coverages for A and B are the median boot-strap estimates of the mean. C) Mean Pol II ChIP-seq coverage around 5,192 true TSSs is plotted for deciles of scRNA abundance mapping within 100 bp downstream of the TSS. The bottom five deciles of scRNA are each made up of loci with no scRNAs mapping to them and are merged. Dotted, dashed, and solid black lines show the 90%, 80%, and 60% bootstrap confidence intervals of the mean, respectively, based on computing the mean of a sample of 10% of the data 1000 times. F44E5.4 and F44E5.5 were omitted (see Supplemental Experimental Procedures). D) A heatmap of Pol II ChIP-seq coverage is plotted for the same genes as in C. Genes are sorted by the number of scRNA reads mapping within 100 bp downstream of the TSS. See also Fig S4 and S5.

We verified that scRNAs are generated primarily from paused Pol II. Pol II does not appear to pause for the trans-splicing reaction based on Pol II ChIP-seq and GRO-seq (Fig 3B). We therefore used the frequency of scRNA reads that begin with the 22 nt SL1 splice leader and extend into the trans-spliced gene to assess the relative contribution of elongating Pol II to scRNA abundance. We assume that scRNAs present at the trans-splice site are produced exclusively by read-through of elongating Pol II (since there is no pausing) and that scRNAs at TSSs are produced by a combination of read-through and pausing. The average ratio of scRNAs at TSSs vs. trans-splice sites is 4.3, suggesting that on average about 75% of the scRNAs present at TSSs are due to pausing (Sup. Info.).

scRNA-seq reveals paused Pol II immediately downstream of true TSSs. We applied a cutoff of 20 reads of scRNA at the TSS and found 8.2% of genes with evidence of Pol II pausing by this criterion (Table S2). This frequency is consistent with an independent analysis of the GRO-seq pausing index (Kruesi et al., 2013). However, caveats apply to the interpretation of these results. For example, whole animals were used for all of our measurements, presumably affecting sensitivity, and an arbitrary cut-off is used in both cases. Furthermore, paused Pol II is not the only source of scRNA (see above and Sup. Info.), and the GRO-seq pause index is very sensitive to weak signal in the body of the gene (the denominator of the index). Caveats aside, it is noteworthy that pausing in starved C. elegans larvae appears less common than in other systems where it has been investigated. Nevertheless, this analysis suggests that Pol II pauses downstream of true TSSs during starvation in C. elegans (for examples, see Fig S3).

Docked Pol II accumulates upstream of TSSs without initiating

Meta-gene analysis reveals that Pol II accumulation is bimodal near true TSSs. Pol II accumulation is evident just downstream of the true TSSs based on both GRO-seq and Pol II ChIP-seq (Fig 3A), as expected given our analysis of scRNA contigs. Surprisingly, and in contrast to the scRNA contigs (Fig 1D), Pol II ChIP-seq coverage around true TSSs is relatively broad, with a mode of the distribution upstream of the TSS (Fig 3A), which is inconsistent with pausing. Examination of individual genes reveals some with Pol II accumulation downstream of the TSS (Fig S3), some with it upstream (Fig S4) and some with it in both places (Fig S2).

Pol II accumulates upstream of TSSs prior to initiation. Because the set of true TSSs we are using were identified from a combination of embryos as well as starved and fed larvae, not all of them are active during L1 arrest. We hypothesized that inclusion of inactive TSSs in meta-gene analysis resulted in detection of the additional upstream mode of Pol II binding. Consistent with this hypothesis, dividing genes into deciles of scRNA abundance and plotting Pol II ChIP-seq coverage reveals that genes with more scRNA have more Pol II accumulation downstream of the TSS, reflecting paused elongation complexes (Fig 3C). Conversely, genes with less scRNA tend to have accumulation of Pol II upstream of the TSS. A heat map of individual genes shows this pattern as well (Fig 3D). The same pattern also results when Pol II ChIP-seq coverage is plotted for deciles of GRO-seq signal (Fig S5A) or for deciles of Pol II ChIP-seq coverage in the body of the gene, a proxy for elongation activity (Fig S5B). These results show that accumulation of Pol II upstream of the TSS occurs at genes with relatively little elongation activity. We refer to Pol II accumulated upstream of TSSs as “docked” to indicate that it is recruited to the DNA but has not initiated transcription and that its position is inconsistent with a typical pre-initiation complex.

Upstream accumulation of Pol II is not due to divergent transcription

We used unsupervised clustering to identify docked genes based on accumulation of Pol II upstream of the TSS. Genes were assigned to one of three clusters based on Pol II ChIP-seq coverage from 200 bp upstream to 200 bp downstream of the TSS. The three clusters include genes with Pol II accumulation upstream of the TSS (docked cluster; 15%), with Pol II downstream of the TSS (active cluster; 14%), and with relatively low amounts of Pol II binding (no signal cluster; 71%) (Fig 4A). Genes in the docked cluster have relatively little Pol II in the body of the gene and very little elongation activity based on scRNA abundance and GRO-seq. Individual examples of genes from the docked cluster clearly show this pattern as well (Fig 4B, S4). Genes in the active cluster have significantly more Pol II in the body of the gene and much more elongation activity based on scRNA abundance and GRO-seq (Fig 4A,C). The active cluster is enriched for genes that appear paused based on scRNA abundance (at least 20 reads) and GRO-seq (pause index greater than 2) (Fisher’s exact test; p<2e-16 and =3.4e-5, respectively; Table S2). Indeed, the active cluster appears paused on average, though not all genes in this cluster show evidence of pausing.

Figure 4.

Figure 4

Clustering genes based on patterns of Pol II ChIP-seq coverage around true TSSs identifies genes with “docked” and “active” Pol II. A) Average coverage of Pol II ChIP-seq, GRO-seq and 3′ scRNA-seq is plotted for each of the three clusters around true TSSs. Coverages are the median boot-strap estimates of the mean. Browser shots of representative genes from the B) docked and C) active clusters are shown. All genes are plotted with their 5′ end to the left regardless of strand.

Pol II accumulation upstream of TSSs (docking) is not due to antisense or divergent transcription. We used our GRO-seq data to identify genes that are divergently transcribed during L1 arrest and to split them into groups based on the amount of divergent signal. The frequency of divergent transcription has been analyzed elsewhere (Kruesi et al., 2013; Chen et al., 2013). For genes in the docked and active clusters, the Pol II peak position is unaffected by divergent transcription (Fig 5A). We used a ChIP-seq normalization method that improves peak resolution (Enroth et al., 2012), and the offset between the Pol II peak in each cluster is clear (Fig 5A). We also examined the average Pol II position relative to sense and antisense elongation based on GRO-seq and scRNA-seq. For docked genes, Pol II ChIP-seq signal peaks between regions of active transcription, though for active genes the peak coincides with sense transcription (Fig 5B,C). These results are inconsistent with divergent transcription causing Pol II accumumulation upstream of the TSS. The relative magnitude of Pol II ChIP-seq signal compared to elongation in either direction is also much larger in the docked cluster (Fig 5B,C), providing additional evidence that Pol II accumulation upstream of the TSS is not associated with sense or antisense transcription.

Figure 5.

Figure 5

Divergent transcription does not account for accumulation of Pol II upstream of TSSs. A) Pol II ChIP-seq coverage around 5,192 true TSSs is plotted. Genes are first divided by whether they have antisense GRO-seq reads 200 bp upstream of the TSS, and those that do are grouped by quartiles of antisense read count. B, C) Coverage of Pol II ChIP-seq (black), B) 3′ scRNA-seq and C) GRO-seq is plotted around true TSSs of genes in the docked (left) and active (right) cluster. Antisense (red) and sense (blue) coverage are plotted separately. Coverages for A, B and C are the median boot-strap estimates of the mean.

Docked and paused genes have distinct function and expression

Genes with docked and active Pol II encode functionally distinct proteins. Genes in the docked cluster are enriched for Gene Ontology (GO) terms associated with growth and development such as ‘growth,’ ‘larval development’ and ‘translation’ (Fig 6A; Table S3). In contrast, genes in the active cluster, many of which are paused, are enriched for GO terms associated with the starvation response such as ‘response to stress’ and ‘response to unfolded protein’ (a term that includes multiple chaperone proteins). This analysis supports the view that docked and paused genes encode functionally distinct products.

Figure 6.

Figure 6

Docked and active genes have different functions and nutrient-dependent regulation. A) Functional enrichments are plotted for the active and docked cluster using the online service ‘Revigo’, which arranges Gene Ontology (GO) terms using multi-dimensional scaling based on their position in the GO graph. Points are colored by whether they are enriched in the ‘docked’ or the ‘active’ cluster (corrected Hypergeometric p-value < 0.01). The size of the point is scaled according to how many genes are annotated with that functional term in the cluster. B) Gene expression during early L1 development (left) and L1 arrest and 3 hr recovery (right) is plotted for the docked (orange) and active (blue) clusters, as well as for all genes (black). Vertical bars on the ‘all genes’ line show the 95% confidence interval of the mean constructed by subsampling 10% of the data 1000 times. C) Venn diagrams showing the numbers of genes in the docked and active clusters whose expression increases, and increases significantly during the first 6 hours of recovery expressed’ at a FDR of 1%. We tested for differential expression in 3,093 genes that had detectable mRNA reads (FDR 1%) and also had true TSS calls. D) Coverage of Pol II ChIP-seq data around docked genes is plotted during L1 arrest and after 1 hr recovery. In contrast to other figures, Pol II ChIP-seq data is from Baugh and coworkers (Baugh et al., 2009). All coverages are the median boot-strap estimates of the mean. See also Fig S6 and Table S3.

Since docked and active genes are transcribed at different levels during starvation and have different functions, we hypothesized that their transcriptional response to feeding differs. We used previously analyzed microarray data to address this possibility (Baugh et al., 2009). These data were collected from precisely staged animals that hatch in the presence or absence of food, so they either initiate L1 development or enter L1 arrest. We also analyzed expression during recovery from L1 arrest by feeding after 12 hr starvation. Genes docked during starvation are up- regulated relative to other genes as development is initiated after hatching with food, consistent with their GO term enrichments (Fig 6B). In contrast, genes in the active cluster are down-regulated during development, consistent with them comprising the starvation response. Likewise, docked genes are up-regulated and active genes are down-regulated at 3 hr of recovery after 12 hr of starvation (Fig 6B). Notably, paused genes in the active cluster show the same pattern of up-regulation during starvation and down-regulation during recovery (data not shown). This indicates that paused genes do not have a different pattern of expression from the rest of the active cluster. Taken together, these results show that docked and paused genes respond on average in opposite ways to feeding.

mRNA-seq analysis of L1 arrest and recovery confirms that docked and active genes have opposite transcriptional responses to feeding. These data were collected after ~12 hr of L1 arrest (0 hr recovery) and at 1, 3 and 6 hr of recovery by feeding (Maxwell et al., 2012). Docked genes significantly increase transcript abundance during recovery from L1 arrest, and active genes significantly decrease abundance (data not shown), similar to the results of microarray analysis. Yet average expression does not reveal what fraction of docked and active genes are up- and down-regulated, respectively. Therefore, we determined which genes increased or decreased expression during recovery and which of those are statistically significant (Q < 0.05). We then intersected these gene sets with the docked and active genes identified by cluster analysis (Fig 6C). 39% of docked genes are significantly up-regulated during recovery from L1 arrest (Fisher’s exact test; p = 2.4e-7), representing 73% of differentially expressed docked genes. Conversely, 45% of active genes are significantly down-regulated (Fisher’s exact test; p = 2.2e-7), representing 63% of differentially expressed active genes. These results show that the average up-regulation and down-regulation of docked and active genes, respectively, during recovery from L1 arrest reflects the major trend in each case.

Docking decreases during recovery but is not specific to starvation

Docking decreases in response to feeding. To examine whether docking is restricted to starvation, we analyzed a different Pol II ChIP-seq data set generated from L1 arrest and 1 hr recovery (Baugh et al., 2009). We found a similar number of docked genes during L1 arrest in this data as in the data from Zhong and co-workers that we analyze elsewhere (Fig S6A; Sup. Info; Zhong et al., 2010). This result shows that docked Pol II is detected in two independent data sets and at a set of common genes (Fishers’ exact test p < 2.2e-16). However, there are eight-times fewer docked genes after one hour of feeding. Likewise, the total amount of Pol II upstream of docked genes decreases significantly after one hour of feeding compared to L1 arrest (Paired Wilcox test, p = 9.2e-9). This difference is readily apparent in the meta-gene analysis of docked genes during L1 arrest and 1 hr recovery (Fig 6D). These results show that docking is uncommon during early L1 development compared to L1 arrest, demonstrating that it is influenced by nutrient availability.

Although most genes that are docked during L1 arrest are no longer docked after 1 hour feeding, some genes remain docked (Fig S6A). Interestingly, transcript abundance increases most for these genes during early L1 development (Fig S6B). These genes are also significantly more up-regulated after one and six hours of recovery from L1 arrest (Wilcox test, p = 0.042, p = 0.001, respectively). This further illustrates a correlation between docking and up-regulation in response to feeding, and it shows that docking can occur outside of starvation.

Independent of starvation, there are other periods in the lifecycle associated with lack of or reduced growth. For example, there is no growth during embryogenesis, and the relative growth rate decreases in the latter portion of each larval stage (Knight et al., 2002). Genes associated with growth are not abundantly expressed during embryogenesis or L1 arrest (Zaslaver et al., 2011), and they are down-regulated during late L1 development as larvae prepare to molt (Baugh et al., 2009). Likewise, although docked genes are up-regulated during early L1 development (0–6 hr; Fig 6B), their expression decreases during later L1 development (6–16 hr; Fig S6B). We examined published Pol II ChIP-seq data prepared from mixed-stage embryos and fed L3 larvae (Gerstein et al., 2010; Zhong et al., 2010). These data were prepared with transgenic animals expressing a GFP fusion to the largest Pol II subunit (AMA-1) and an anti-GFP antibody. This result therefore provides an important control for our results. Docking is nearly as common in embryos as in starved L1s, whereas it is less common in fed L3 larvae (Fig S6C; Sup. Info.). An overlapping set of genes is also docked in each stage (Fig S6D; Fisher’s exact test p < 2e-16 for all pair-wise comparisons). It is unclear whether the fed L3 larvae were collected early or late in the larval stage, and we may detect docking at least in part due to the staging of these animals. These data further show that docking can occur outside of L1 arrest, perhaps in conjunction with reduced growth.

Docked and paused genes have distinct promoter architectures

The fact that docked and active genes have opposite transcriptional responses to feeding suggests a fundamental difference in their regulation. We hypothesized that this difference in regulation is reflected in the promoter architecture of these two gene sets. We used the software FIMO from the MEME suite to look in promoters (200 bp upstream and 100 bp downstream of TSSs) for occurrences of known core motifs defined in the JASPAR database (Bryne et al., 2007; Grant et al., 2010). Most core motifs (nine of 13) are enriched in the active cluster with a FDR cut-off of 5%, including TATA, GC-box, and Initiator (Inr) (Fisher’s exact test; Q = 1e-16, 4e-10, and 0.009, respectively; Fig 7A; Table S4). Only MTE-1 is enriched in the docked cluster (Fisher’s exact test Q = 0.003), and both Inr and TATA are significantly depleted from this cluster (Fisher’s exact test; Q = 3.4e-6, and 5e-8, respectively). Within the active cluster, paused genes are significantly more likely to have a TATA motif in their promoter than genes that are not paused (45% vs. 30%, respectively; Fishers’ exact test q = 0.009). These results show that known core promoter motifs are associated with paused but not docked genes.

Figure 7.

Figure 7

Docked and active genes have distinct sets of core promoter motifs. A) The positional frequency of the Inr and TATA motifs is plotted relative to true TSSs for each of the three clusters. B) The coverage of Pol II ChIP-seq, the 3′ ends of scRNAs, and GRO-seq around true TSSs for genes in each cluster is plotted. Genes are split by whether or not they have a canonical TATA motif. C) Pol II initiation and elongation are differentially regulated for growth and starvation genes. Upstream accumulation of uninitiated Pol II (docked) is associated with growth and development genes not expressed during starvation but up-regulated by feeding. In contrast, promoter-proximal pausing of early elongation is associated with genes expressed during starvation and down-regulated during growth, which includes stress-response genes. Starvation genes are much more likely than growth genes to have a TATA box, suggesting alternative core transcriptional machinery in the pre-initiation complex of these two sets of genes. We propose that upstream accumulation of docked Pol II involves at least one unknown factor that docks Pol II, represented by a pentagon. See also Tables S4 and S5.

Docked genes are associated with novel promoter motifs. We used the motif identification software DREME from the MEME suite to find motifs differentially enriched among genes in the docked cluster compared to the active cluster, and vice versa (Baily, 2011). DREME identified nine motifs enriched in the docked cluster and 11 in the active cluster (Table S5). Consistent with our analysis of known core motifs, this unbiased approach identified the canonical TATA motif TATAWAAG as enriched in the active cluster compared to the docked cluster (DREME; E = 1.2e-14). These results provide additional evidence that genes in the docked and active clusters have distinct sets of core promoter motifs.

Presence of a TATA box has functional consequences at active but not docked genes. The TATA motif is depleted from the docked cluster, but it does occur at some genes in the cluster. However, presence of TATA does not appear to affect recruitment or elongation at these genes as it does for active genes. That is, TATA is associated with greater Pol II occupancy at active genes based on Pol II ChIP-seq, scRNA-seq and GRO-seq coverage, but it does not have this effect on docked genes in the rare cases it is present (Fig 7B). Since active genes do not appear to be regulated at the level of initiation, this result is consistent with TATA promoting recruitment of Pol II to active but not docked genes.

Discussion

We present an integrated genome-wide analysis of Pol II binding, elongation activity, mRNA abundance and core promoter motifs that reveals distinct forms of post-recruitment regulation of growth and stress genes (Fig 7C). We confirm that promoter-proximal pausing occurs during starvation in C. elegans, but it is associated with active stress-response genes that are down-regulated in response to feeding. Our results also suggest that initiation is regulated post-recruitment during starvation, resulting in accumulation of docked Pol II upstream of TSSs. In contrast to paused Pol II, docked Pol II is associated with growth and development genes that are rapidly up-regulated in response to feeding. We propose that post-recruitment regulation of initation and elongation coordinates gene expression with nutrient availability and growth.

Promoter-proximal pausing in C. elegans L1 arrest

Like S. cerevisiae, the C. elegans genome does not encode homologs for any NELF subunits (Narita et al., 2003), an important regulator of pausing in Drosophila and mammals (Nechaev et al., 2010). Nevertheless, multiple lines of evidence suggest that promoter-proximal pausing occurs during starvation in C. elegans. We show that scRNAs are produced and that their 3′ ends coincide with Pol II accumulation in promoter-proximal regions, consistent with pausing. Pausing has been reported in L1 arrest based on GRO-seq (Kruesi et al., 2013), and we corroborate and expand on this result using scRNA-seq as an independent approach to detect Pol II elongation. scRNA-seq also allows us to show that Pol II typically pauses 30–65 bp downstream of TSSs, similar to Drosophila (Nechaev et al., 2010). We also show that the general transcription factor TFIIS has conserved function in C. elegans, alleviating backtracking of paused polymerase (Adelman et al., 2005; Kettenberger et al., 2003; Nechaev et al., 2010). Furthermore, the core promoter motifs TATA and Inr are associated with pausing, consistent with Drosophila and the complex interaction model which posits a role of these core promoter elements in regulation of early elongation (Kwak et al., 2013). We considered the possibility that our results could reflect premature termination rather than pausing. However, our 3′ scRNA-seq reads (including those that did not map) show no evidence of poly-adenylation (data not shown). Given this negative result and the strong similarities to pausing in other systems, we conclude that pausing occurs during L1 arrest in C. elegans. Given the apparent lack of NELF, this conclusion implies a NELF-independent pausing mechanism, perhaps involving some combination of the conserved DSIF complex, an unidentified GAGA factor or M1BP homolog and the core promoter factors (Li and Gilmour, 2013; Missra and Gilmour, 2010; Wada et al., 1998; Kwak et al., 2013).

Pausing is less common in C. elegans than in other systems. Our results suggest that ~8% of the 5,192 C. elegans genes we examined are paused during L1 arrest. Pausing is even less common in embryos and fed L3 stage larvae (Kruesi et al., 2013). Investigation of Drosophila and mammals suggest that pausing is substantially more widespread (Core et al., 2008; 2012; Min et al., 2011; Muse et al., 2007; Zeitlinger et al., 2007). Despite the relatively low frequency of genes with strong evidence for pausing, 5′ accumulation of active Pol II just downstream of TSSs is a pervasive pattern in the data we present. We show that this pattern is not an effect of outliers, suggesting that regulation of elongation may be widespread but with only a modest effect at most genes. It would be valuable to inhibit P-TEFb to determine if in fact many more genes are regulated during early elongation than our studies have revealed (Rahl et al., 2010).

Pausing has been suggested to facilitate a rapid response to stimulus (Lis, 1998; Muse et al., 2007; Zeitlinger et al., 2007), but this is not always the case. Pausing was discovered in the context of the heat-shock response, a classic example of rapid stimulus-response dynamics (O’Brien and Lis, 1991; Rougvie and Lis, 1988). However, several studies of stimulus-response systems suggest that paused genes are not necessarily induced in response to stimuli (Gilchrist et al., 2012; Hah et al., 2011; Lin et al., 2011). In mammalian cells pausing appears to regulate the expression of rapidly induced targets of TNF-α signaling but not targets of E2 signaling (Danko et al., 2013). These studies and others suggest that the physiological role of pausing depends on context (Adelman and Lis, 2012).

We show that Pol II pauses at actively transcribed genes during starvation in C. elegans, and that these genes are down-regulated in response to feeding. Genes with the most elongation activity, as assessed by scRNA-seq, GRO-seq, and Pol II binding to the gene body, show a pervasive pattern of Pol II accumulation indicative of pausing. This suggests that pausing does not repress transcription during starvation. Furthermore, rather than providing a mechanism to anticipate future activation, genes associated with pausing are down-regulated relative to other genes during recovery from starvation. Consistent with this expression pattern, genes associated with pausing are enriched for stress response genes. Pausing is much less common in embryos or fed larvae (Kruesi et al., 2013), consistent with it reflecting a stress response. These observations suggest that the physiological function of pausing in C. elegans is to promote the expression of genes needed during starvation, not to prime genes for induction in response to feeding.

Docking represents post-recruitment regulation of initiation

Surprisingly, inactive Pol II associates with DNA upstream of TSSs. The amount of upstream Pol II is inversely proportional to the elongation activity at that gene as measured by scRNA-seq, GRO-seq or Pol II binding to the gene body. We hypothesize that Pol II accumulation upstream of these genes represents nutrient-dependent regulation of transcription initiation. We found the canonical TATA motif in C. elegans ~30 bp upstream of the TSS, but Pol II accumulates further upstream (~60 bp). It is unlikely that this accumulation corresponds to a fully assembled preinitiation complex. However, partially assembled pre-initiation complexes have been reported (Esnault et al., 2008). We suggest that “docking” should be used to specify recruitment of Pol II upstream of TSSs without initiation.

Three alternative hypotheses could explain upstream accumulation of Pol II: 1) antisense or divergent transcription, 2) abortive initation and 3) un-regulated transient interaction of Pol II with DNA. The position of docked Pol II is unaffected by and inconsistent with divergent transcription. In addition, the position of docked Pol II upstream of the TSS suggests that it is not undergoing abortive initiation. Multiple lines of evidence argue against a transient interaction model. In this model Pol II is transiently associating with the relatively weak core promoters of “TATA-less” genes during starvation, with insufficient ATP to promote initiation. However, significant amounts of Pol II accumulate at docked genes compared to paused genes, which is inconsistent with transient interaction. Furthermore, not all genes actively transcribed during L1 arrest have a TATA element in their promoters. This observation shows that TATA is not required for transcription during starvation, suggesting that relatively weak core promoters can initiate despite starvation. Finally, docking is not confined to starvation, as it is present in both embryos and fed L3 larvae. Taken together, our results suggest that upstream accumulation of Pol II is due to it stably associating with DNA but not initiating transcription.

Other examples of post-recruitment, pre-initiation regulation of Pol II have been reported. Notably, lymphocyte activation, which involves transcriptome amplification, was recently shown to involve widespread, TFIIH-dependent promoter melting (Kouzine et al., 2013). Pol II also accumulates upstream of many S. cerevisiae genes during stationary phase, where it anticipates future induction upon addition of fresh media (Radonjic et al., 2005). Pol II co-localizes with Mediator subunits upstream of the TSS during stationary phase, which may provide a binding platform for docked Pol II during quiescence (Andrau et al., 2006). It is tempting to speculate that a similar mechanism operates in C. elegans, presumably also involving TFIIH and promoter melting.

Docking is associated with genes induced by feeding

We previously reported 5′ accumulation of Pol II at growth and development genes during starvation in C. elegans, but we could not distinguish between docking and pausing (Baugh et al., 2009). Pausing was the only form of promoter-proximal post-recruitment regulation known in metazoans, and we speculated that genes with 5′ accumulation of Pol II were paused. However, most of the genes identified were actually docked. Here we used unsupervised clustering to identify genes with docked Pol II during L1 arrest, and we found that about 15% of genes have docked Pol II. Although docked genes have little transcriptional activity during starvation, a significant fraction of Pol II near TSSs is docked compared to paused, suggesting physiological significance. GO term enrichments suggest that genes with docked Pol II function in growth and development. Consistent with this interpretation, these genes are up-regulated in response to feeding after hatching with food and during recovery from L1 arrest. Our results show that Pol II accumulation upstream or downstream of TSSs marks two very different sets of genes.

Docking is influenced by nutrient availability but occurs outside of L1 arrest. Based on the available data, docking is most common in L1 arrest and embryos, and growth gene expression is relatively low in both stages (Zaslaver et al., 2011). Docking also decreases dramatically during immediate recovery from L1 arrest in conjunction with up-regulation of docked genes. We propose that docking is associated with periods of no growth (eg, embryogenesis and L1 arrest) or relatively reduced growth (eg, the latter portion of each larval stage) but not growth-intensive periods (eg, the beginning of each larval stage). Such association suggests that docking plays a pervasive role in regulating growth gene expression. We speculate that docking maintains an open chromatin state permissive to regulation, analogous to what has been proposed for pausing (Gilchrist et al., 2012).

Fundamentally distinct regulation of growth and stress genes

Stress resistance and growth reflect distinct, often exclusive priorities. There is a clear distinction in S. cerevisiae between the genes expressed during stress and growth (Gasch et al., 2000). Furthermore, in S. cerevisiae stress-response genes have canonical TATA motifs and are regulated by SAGA, whereas housekeeping genes tend to be “TATA-less” and regulated by TFIID (Huisinga and Pugh, 2004). Interestingly, TATA-containing genes tend to have a focused transcriptional initiation pattern in C. elegans (Chen et al., 2013), suggesting that this difference is conserved. Starvation causes developmental arrest and confers stress resistance in C. elegans (Baugh, 2013), and distinct sets of genes are expressed during arrest and development (Baugh et al., 2009; Maxwell et al., 2012). Our results suggest that these two very different types of genes are transcriptionally regulated by distinct post-recruitment mechanisms and that the mode of regulation is correlated with differences in core promoter architecture. In particular, paused genes are enriched for the TATA motif, and docked genes are depleted for this core motif and others. More experiments are needed to determine whether all genes are prone to docking, as demonstrated for pausing in mammals (Rahl et al., 2010). However, we speculate that growth and stress genes in C. elegans tend to employ alternative pathways of pre-initiation complex formation, perhaps differentially utilizing TFIID and SAGA, affecting their point of post-recruitment regulation and expression.

Growth rate and stress resistance must be balanced to ensure survival and optimize fitness. Mechanisms that control gene expression in response to fluctuating environmental conditions are critical to environmental adaptation, and their disruption can cause cancer, diabetes and other diseases. Our results suggest that post-recruitment regulation of initation and elongation affect growth and stress genes reciprocally to coordinate gene expression with nutrient availability and growth. We anticipate that the putative regulatory mechanisms we describe will be conserved with implications for environmental adaptation as well as human health and disease.

Experimental Procedures

Nematodes were cultured and arrested as previously described (Baugh et al., 2009). scRNA-seq libraries were prepared by size selecting total RNA between 30 and 100 nt, treating sequentially with RNA 5′ Polyphosphatase (Epicentre), Terminator 5′-Phosphate Dependent Exonuclease (Epicentre) and Tobacco Acid Pyrophosphatase (Epicentre), and then following the SOLiD RNA-seq protocol (Applied Biosystems) with appropriate modifications to accommodate irregular insert size. Reads were mapped to the C. elegans genome (WS210) in color space using Bowtie v. 0.12.7 (Langmead et al., 2009). Additional information on analysis procedures can be found in Supplemental Experimental Procedures.

Supplementary Material

01
02
03
04

Highlights.

  • Pol II pausing and backtracking occur during starvation in C. elegans

  • Pol II also binds upstream of TSSs without initiatiating transcription (“docking”)

  • Docking and pausing occur at growth and stress genes, respectively

  • Docked genes are TATA-less and are up-regulated in response to feeding

Acknowledgments

We thank Sergei Nechaev for providing protocols and advice for scRNA-seq. Some strains were provided by the CGC, which is funded by NIH Office of Research Infrastructure Programs (P40 OD010440). The Ellison Medical Foundation and the National Science Foundation (IOS-1120206) supported this work.

Footnotes

The authors have no financial conflicts of interest.

Accession Numbers

The Gene Expression Omnibus (GEO) accession number for the scRNA-seq data reported in this paper is GSE40161. Accession numbers of other data sets analyzed are in Table S1.

Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

References

  1. Adelman K, Lis JT. Promoter-proximal pausing of RNA polymerase II: emerging roles in metazoans. Nat Rev Genet. 2012;13:720–731. doi: 10.1038/nrg3293. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Adelman K, Marr MT, Werner J, Saunders A, Ni Z, Andrulis ED, Lis JT. Efficient Release from Promoter-Proximal Stall Sites Requires Transcript Cleavage Factor TFIIS. Mol Cell. 2005;17:103–112. doi: 10.1016/j.molcel.2004.11.028. [DOI] [PubMed] [Google Scholar]
  3. Allen MA, Hillier LW, Waterston RH, Blumenthal T. A global analysis of C. elegans trans-splicing. Genome Research. 2011;21:255–264. doi: 10.1101/gr.113811.110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Andrau JC, van de Pasch L, Lijnzaad P, Bijma T, Koerkamp MG, van de Peppel J, Werner M, Holstege FCP. Genome-Wide Location of the Coactivator Mediator: Binding without Activation and Transient Cdk8 Interaction on DNA. Molecular Cell. 2006;22:179–192. doi: 10.1016/j.molcel.2006.03.023. [DOI] [PubMed] [Google Scholar]
  5. Baily TL. DREME: Motif discovery in transcription factor ChIP-seq data. Bioinformatics. 2011;27:1653–1659. doi: 10.1093/bioinformatics/btr261. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Baugh LR. To Grow or Not to Grow: Nutritional Control of Development During Caenorhabditis elegans L1 Arrest. Genetics. 2013;194:539–555. doi: 10.1534/genetics.113.150847. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Baugh LR, DeModena J, Sternberg PW. RNA Pol II Accumulates at Promoters of Growth Genes During Developmental Arrest. Science. 2009;324:92–94. doi: 10.1126/science.1169628. [DOI] [PubMed] [Google Scholar]
  8. Bryne JC, Valen E, Tang MH, Marstrand T, Winther O, da Piedade I, Krogh A, Lenhard B, Sandelin A. JASPAR, the open access database of transcription factor-binding profiles: new content and tools in the 2008 update. Nucleic Acids Res. 2007;36:D91–D94. doi: 10.1093/nar/gkm955. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Chen RAJ, Down TA, Stempor P, Chen QB, Egelhofer TA, Hillier LW, Jeffers TE, Ahringer J. The landscape of RNA polymerase II transcription initiation in C. elegans reveals promoter and enhancer architectures. Genome Research. 2013;23:1339–1347. doi: 10.1101/gr.153668.112. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Churchman LS, Weissman JS. Nascent transcript sequencing visualizes transcription at nucleotide resolution. Nature. 2011;469:368–373. doi: 10.1038/nature09652. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Core LJ, Waterfall JJ, Lis JT. Nascent RNA Sequencing Reveals Widespread Pausing and Divergent Initiation at Human Promoters. Science. 2008;322:1845–1848. doi: 10.1126/science.1162228. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Core LJ, Waterfall JJ, Gilchrist DA, Fargo DC, Kwak H, Adelman K, Lis JT. Defining the Status of RNA Polymerase at Promoters. Cell Reports. 2012;2:1025–1035. doi: 10.1016/j.celrep.2012.08.034. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Danko CG, Hah N, Luo X, Martins AL, Core L, Lis JT, Siepel A, Kraus WL. Signaling pathways differentially affect RNA polymerase II initiation, pausing, and elongation rate in cells. Molecular Cell. 2013;50:212–222. doi: 10.1016/j.molcel.2013.02.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Enroth S, Andersson C, Andersson R, Wadelius C, Gustafsson M, Komorowski J. A strand specific high resolution normalization method for chip-sequencing data employing multiple experimental control measurements. Algorithms for Molecular Biology. 2012;7:2. doi: 10.1186/1748-7188-7-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Esnault C, Ghavi-Helm Y, Brun S, Soutourina J, Van Berkum N, Boschiero C, Holstege F, Werner M. Mediator-Dependent Recruitment of TFIIH Modules in Preinitiation Complex. Molecular Cell. 2008;31:337–346. doi: 10.1016/j.molcel.2008.06.021. [DOI] [PubMed] [Google Scholar]
  16. Gasch AP, Spellman PT, Kao CM, Carmel-Harel O, Eisen MB, Storz G, Botstein D, Brown PO. Genomic Expression Programs in the Response of Yeast Cells to Environmental Changes. Molecular Biology of the Cell. 2000;11:4241–4257. doi: 10.1091/mbc.11.12.4241. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Gerstein MB, Lu ZJ, Van Nostrand EL, Cheng C, Arshinoff BI, Liu T, Yip KY, Robilotto R, Rechtsteiner A, Ikegami K, et al. Integrative analysis of the Caenorhabditis elegans genome by the modENCODE project. Science. 2010;330:1775–1787. doi: 10.1126/science.1196914. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Gilchrist DA, Fromm G, dos Santos G, Pham LN, McDaniel IE, Burkholder A, Fargo DC, Adelman K. Regulating the regulators: the pervasive effects of Pol II pausing on stimulus-responsive gene networks. Genes & Development. 2012;26:933–944. doi: 10.1101/gad.187781.112. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Gilchrist DA, Nechaev S, Lee C, Ghosh SKB, Collins JB, Li L, Gilmour DS, Adelman K. NELF-mediated stalling of Pol II can enhance gene expression by blocking promoter-proximal nucleosome assembly. Genes & Development. 2008;22:1921–1933. doi: 10.1101/gad.1643208. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Gilchrist DA, dos Santos G, Fargo DC, Xie B, Gao Y, Li L, Adelman K. Pausing of RNA Polymerase II Disrupts DNA-Specified Nucleosome Organization to Enable Precise Gene Regulation. Cell. 2010;143:540–551. doi: 10.1016/j.cell.2010.10.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Grant CE, Baily TL, Noble WS. FIMO: Scanning for occurrences of a given motif. Bioinformatics. 2010;27:860–866. doi: 10.1093/bioinformatics/btr064. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Hah N, Danko CG, Core L, Waterfall JJ, Siepel A, Lis JT, Kraus WL. A Rapid, Extensive, and Transient Transcriptional Response to Estrogen Signaling in Breast Cancer Cells. Cell. 2011;145:622–634. doi: 10.1016/j.cell.2011.03.042. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Huisinga KL, Pugh BF. A Genome-Wide Housekeeping Role for TFIID and a Highly Regulated Stress-Related Role for SAGA in Saccharomyces cerevisiae. Mol Cell. 2004;13:573–585. doi: 10.1016/s1097-2765(04)00087-5. [DOI] [PubMed] [Google Scholar]
  24. Kettenberger H, Armache KJ, Cramer P. Architecture of the RNA Polymerase II-TFIIS Complex and Implications for mRNA Cleavage. Cell. 2003;114:347–357. doi: 10.1016/s0092-8674(03)00598-1. [DOI] [PubMed] [Google Scholar]
  25. Kim TH, Barrera LO, Zheng M, Qu C, Singer MA, Richmond TA, Wu Y, Green RD, Ren B. A high-resolution map of active promoters in the human genome. Nature. 2005;436:876–880. doi: 10.1038/nature03877. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Knight CG, Patel MN, Azevedo RBR, Leroi AM. A novel mode of ecdysozoan growth in Caenorhabditis elegans. Evol Dev. 2002;4:16–27. doi: 10.1046/j.1525-142x.2002.01058.x. [DOI] [PubMed] [Google Scholar]
  27. Kouzine F, Wojtowicz D, Yamane A, Resch W, Kieffer-Kwon KR, Bandle R, Nelson S, Nakahashi H, Awasthi P, Feigenbaum L, et al. Global regulation of promoter melting in naive lymphocytes. Cell. 2013;153:988–999. doi: 10.1016/j.cell.2013.04.033. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Kruesi WS, Core LJ, Waters CT, Lis JT, Meyer BJ. Condensin controls recruitment of RNA polymerase II to achieve nematode X-chromosome dosage compensation. eLife. 2013;2:e00808. doi: 10.7554/eLife.00808. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Kwak H, Fuda NJ, Core LJ, Lis JT. Precise Maps of RNA Polymerase Reveal How Promoters Direct Initiation and Pausing. Science. 2013;339:950–953. doi: 10.1126/science.1229386. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Langmead B, Trapnell C, Pop M, Salzberg S. Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biology. 2009;10:R25. doi: 10.1186/gb-2009-10-3-r25. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Li J, Gilmour DS. Distinct mechanisms of transcriptional pausing orchestrated by GAGA factor and M1BP, a novel transcription factor. Embo J. 2013;13:1829–1841. doi: 10.1038/emboj.2013.111. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Lin C, Garrett AS, De Kumar B, Smith ER, Gogol M, Seidel C, Krumlauf R, Shilatifard A. Dynamic transcriptional events in embryonic stem cells mediated by the super elongation complex (SEC) Genes & Development. 2011;25:1486–1498. doi: 10.1101/gad.2059211. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Lis J. Promoter-associated Pausing in Promoter Architecture and Postinitiation Transcriptional Regulation. Cold Spring Harbor Symposia on Quantitative Biology. 1998;63:347–356. doi: 10.1101/sqb.1998.63.347. [DOI] [PubMed] [Google Scholar]
  34. Maxwell CS, Antoshechkin I, Kurhanewicz N, Belsky JA, Baugh LR. Nutritional control of mRNA isoform expression during developmental arrest and recovery in C. elegans. Genome Research. 2012;22:1920–1929. doi: 10.1101/gr.133587.111. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Min IM, Waterfall JJ, Core LJ, Munroe RJ, Schimenti J, Lis JT. Regulating RNA polymerase pausing and transcription elongation in embryonic stem cells. Genes & Development. 2011;25:742–754. doi: 10.1101/gad.2005511. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Missra A, Gilmour DS. Interactions between DSIF (DRB sensitivity inducing factor), NELF (negative elongation factor), and the Drosophila RNA polymerase II transcription elongation complex. Proceedings of the National Academy of Sciences. 2010;107:11301–11306. doi: 10.1073/pnas.1000681107. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Muse GW, Gilchrist DA, Nechaev S, Shah R, Parker JS, Grissom SF, Zeitlinger J, Adelman K. RNA polymerase is poised for activation across the genome. Nat Genet. 2007;39:1507–1511. doi: 10.1038/ng.2007.21. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Narita T, Yamaguchi Y, Yano K, Sugimoto S, Chanarat S, Wada T, Kim DK, Hasegawa J, Omori M, Inukai N, et al. Human Transcription Elongation Factor NELF: Identification of Novel Subunits and Reconstitution of the Functionally Active Complex. Molecular and Cellular Biology. 2003;23:1863–1873. doi: 10.1128/MCB.23.6.1863-1873.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Nechaev S, Adelman K. Pol II waiting in the starting gates: Regulating the transition from transcription initiation into productive elongation. Biochimica Et Biophysica Acta (BBA) - Gene Regulatory Mechanisms VL - 2011;1809:34–45. doi: 10.1016/j.bbagrm.2010.11.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Nechaev S, Fargo DC, dos Santos G, Liu L, Gao Y, Adelman K. Global Analysis of Short RNAs Reveals Widespread Promoter-Proximal Stalling and Arrest of Pol II in Drosophila. Science. 2010;327:335–338. doi: 10.1126/science.1181421. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. O’Brien T, Lis JT. RNA polymerase II pauses at the 5′ end of the transcriptionally induced Drosophila hsp70 gene. Molecular and Cellular Biology. 1991;11:5285–5290. doi: 10.1128/mcb.11.10.5285. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Radonjic M, Andrau JC, Lijnzaad P, Kemmeren P, Kockelkorn TTJP, van Leenen D, van Berkum NL, Holstege FCP. Genome-Wide Analyses Reveal RNA Polymerase II Located Upstream of Genes Poised for Rapid Response upon S. cerevisiae Stationary Phase Exit. Molecular Cell. 2005;18:171–183. doi: 10.1016/j.molcel.2005.03.010. [DOI] [PubMed] [Google Scholar]
  43. Rahl PB, Lin CY, Seila AC, Flynn RA, McCuine S, Burge CB, Sharp PA, Young RA. c-Myc Regulates Transcriptional Pause Release. Cell. 2010;141:432–445. doi: 10.1016/j.cell.2010.03.030. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Rasmussen EB, Lis JT. In vivo transcriptional pausing and cap formation on three Drosophila heat shock genes. Proceedings of the National Academy of Sciences. 1993;90:7923–7927. doi: 10.1073/pnas.90.17.7923. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Renner DB, Yamaguchi Y, Wada T, Handa H, Price DH. A Highly Purified RNA Polymerase II Elongation Control System. Journal of Biological Chemistry. 2001;276:42601–42609. doi: 10.1074/jbc.M104967200. [DOI] [PubMed] [Google Scholar]
  46. Rougvie AE, Lis JT. The RNA polymerase II molecule at the 52 end of the uninduced hsp70 gene of D. melanogaster is transcriptionally engaged. Cell. 1988;54:795–804. doi: 10.1016/s0092-8674(88)91087-2. [DOI] [PubMed] [Google Scholar]
  47. Wada T, Takagi T, Yamaguchi Y, Ferdous A, Imai T, Hirose S, Sugimoto S, Yano K, Hartzog GA, Winston F, et al. DSIF, a novel transcription elongation factor that regulates RNA polymerase II processivity, is composed of human Spt4 and Spt5 homologs. Genes & Development. 1998;12:343–356. doi: 10.1101/gad.12.3.343. [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Wu CH, Yamaguchi Y, Benjamin LR. NELF and DSIF cause promoter proximal pausing on the hsp70 promoter in Drosophila. Genes & Development. 2003;17:1402–1414. doi: 10.1101/gad.1091403. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Zaslaver A, Baugh LR, Sternberg PW. Metazoan Operons Accelerate Recovery from Growth-Arrested States. Cell. 2011;145:981–992. doi: 10.1016/j.cell.2011.05.013. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Zeitlinger J, Stark A, Kellis M, Hong JW, Nechaev S, Adelman K, Levine M, Young RA. RNA polymerase stalling at developmental control genes in the Drosophila melanogaster embryo. Nat Genet. 2007;39:1512–1516. doi: 10.1038/ng.2007.26. [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Zhong M, Niu W, Lu ZJ, Sarov M, Murray JI, Janette J, Raha D, Sheaffer KL, Lam HYK, Preston E, et al. Genome-Wide Identification of Binding Sites Defines Distinct Functions for Caenorhabditis elegans PHA-4/FOXA in Development and Environmental Response. PLoS Genet. 2010;6:e1000848. doi: 10.1371/journal.pgen.1000848. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

01
02
03
04

RESOURCES