Abstract
Mouse embryonic stem cells (mESCs) can adopt naïve, ground, and paused pluripotent states that give rise to unique transcriptomes. Here, we use transient transcriptome sequencing (TT‐seq) to define both coding and non‐coding transcription units (TUs) in these three pluripotent states and combine TT‐seq with RNA polymerase II occupancy profiling to unravel the kinetics of RNA metabolism genome‐wide. Compared to the naïve state (serum), RNA synthesis and turnover rates are globally reduced in the ground state (2i) and the paused state (mTORi). The global reduction in RNA synthesis goes along with a genome‐wide decrease of polymerase elongation velocity, which is related to epigenomic features and alterations in the Pol II termination window. Our data suggest that transcription activity is the main determinant of steady state mRNA levels in the naïve state and that genome‐wide changes in transcription kinetics invoke ground and paused pluripotent states.
Keywords: mouse pluripotent stem cells, transcription termination, transcription unit annotation, transcription velocity, transient transcriptome sequencing
Subject Categories: Chromatin, Transcription & Genomics; Stem Cells & Regenerative Medicine
Genome‐wide analyses of RNA metabolism kinetics in naïve, ground and paused pluripotent mouse embryonic stem cells (mESCs) indicate that the three states show markedly different global transcriptional kinetics.

Introduction
Pluripotency in the pre‐implantation embryo is of transient nature in vivo, but embryonic stem cells can be cultured long term in stable and interconvertible pluripotent states in vitro: in serum/leukemia inhibitory factor (LIF) media (SL, serum‐naïve state), serum‐free media containing LIF, Mek1/2 and GSK3β inhibitors (2i, naïve ground state), or serum/LIF media with mTOR inhibitor (mTORi, paused state). In vitro cultures of mESC provide facile model systems for understanding the molecular underpinnings of pluripotency and self‐renewal. The 2i‐induced ground state resembles naïve pre‐implantation E4.5 epiblast cells (Ying et al, 2008; Ghimire et al, 2018). Rewiring of signaling, metabolism, and epigenome have been observed in the SL–2i transition. CpG methylation is dramatically decreased genome‐wide, concomitant with a broad increase in H3K27me3 (Walter et al, 2016; Kumar & Elsässer, 2019). Further, a reduction of global H3K4me3 levels results in diminished promoter bivalency of developmentally regulated genes, where H3K27me3 and H3K4me3 are thought to set up a poised state (Marks et al, 2012; Sachs et al, 2013; Atlasi & Stunnenberg, 2017; Kumar & Elsässer, 2019). Enhancer activity is rewired during the SL–2i transition via Esrrb binding and H3K27ac activation (Atlasi et al, 2019). In contrast, mTOR inhibition suppresses cell growth and division while retaining pluripotency, resembling the diapaused blastocysts in vivo (Bulut‐Karslioglu et al, 2016).
Although most studies have focused on rewiring of regulatory circuits, activating, or disengaging individual enhancers and transcripts, globally reduced transcription activity in 2i and mTORi states was suggested by cell‐level 5‐Ethynyl Uridine (EU) incorporation (Bulut‐Karslioglu et al, 2016). Given that transcript levels are balanced by rates of RNA synthesis and degradation (Schwalb et al, 2016; Herzog et al, 2017), to which extent transcription activity itself determines transcript abundance in the pluripotent states remains to be addressed. Recent progress of quantitative RNA labeling techniques enable the absolute measurements of the rates of RNA synthesis and degradation (Rabani et al, 2011; Schwanhäusser et al, 2011; Schwalb et al, 2016; Herzog et al, 2017; Muhar et al, 2018; Schofield et al, 2018). TT‐seq combines a short 4‐thiouridine (4sU) RNA labeling pulse with an RNA fragmentation step to capture newly synthesized RNA, and thereby detects also short‐lived transcripts and allows estimation of the kinetics of RNA metabolism (Schwalb et al, 2016; Michel et al, 2017; Choi et al, 2021; Lidschreiber et al, 2021). To study transcriptional regulation in SL‐2i and SL–mTORi transitions, we measured total RNA and newly synthesized RNA with TT‐seq, annotated transcription units (TUs) de novo, and examined how transcription of different TU classes responds to state transitions. We also combined TT‐seq with quantitative RNA polymerase (Pol) II ChIP to estimate elongation velocities at the level of TUs. Our results led to a description of RNA metabolism in three states of pluripotent cells and to several unexpected findings.
Results
Transcription unit annotation in mESC pluripotent states
To capture transcriptional responses to different pluripotent state transitions, we switched ES cells from SL medium to 2i medium for 2 or 7 days, mTORi medium for 1 or 2 days, and under each condition pulsed 4‐thiouridine (4sU) for 5 min and performed the TT‐seq method as previously described (Schwalb et al, 2016) (Fig 1A). TT‐seq data were highly reproducible between biological replicates (Fig EV1A–C), and recapitulated the reported expression changes in the early state transitions (Fig EV1D–H); TT‐seq data of cells cultured in 2i medium for 2 or 7 days correlated well (Fig EV1A) and we therefore focused on the primary responses after 2 days in all further analyses. In order to verify our labeled RNA results, we compared TT‐seq gene coverage with GRO‐seq (global run‐on sequencing) (Wang et al, 2015; Flynn et al, 2016), PRO‐seq (precision run‐on sequencing) (Engreitz et al, 2016; Lloret‐Llinares et al, 2018), NET‐seq (native elongating transcript sequencing) (Mylonas & Tessarz, 2018; Tuck et al, 2018), 4sU‐seq (Brown et al, 2017; Benabdallah et al, 2019), and Bru‐seq (bromouridine sequencing) (Ardehali et al, 2017). TT‐seq 4sU labeling was performed within a short time window (5 min) under native cell culture condition, and total RNA was fragmented before biotinylation and purification steps (Fig EV2A). Therefore, TT‐seq‐labeled RNA showed a steady coverage over the gene body, more balanced signal between the first and the last exons, as well as the highest intron/exon coverage ratio as compared to NET‐seq, PRO‐seq, and GRO‐seq (Figs 1B and, EV2B and C). The stranded TT‐seq signal, thus, provides an excellent demarcation of the transcription unit and polymerase‐independent measure of transcriptional activity.
Figure 1. Mouse ES cells transcript annotation with TT‐seq.

-
AAn experimental scheme for transcription kinetics measurement in mESC samples under serum/LIF (SL), 2i/LIF (2i), serum/LIF/mTORi (mTORi) pluripotent states. Image scale bar is 100 μm.
-
BTT‐seq‐labeled RNA and total RNA spike‐in normalized profiles (in log‐scale) on Tbx3 gene, with annotated TUs (red), and Ensembl gene references (blue) in mm10 genome.
-
CLabeled RNA reads distribution by TU types in percentage (y axis breaks from 2 to 40%).
-
DLabeled RNA TPM (transcript per million) by TU types of each sample and replicate. Boxplots are with central bands at the median, 0.25 and 0.75 quartiles box area, 1.5× interquartile range (IQR) whiskers; outliers are hidden.
-
EPrincipal component analysis of labeled RNA (left) and total RNA (right) log RPK by sample replicates. SL‐naïve, 2i‐ground, and mTORi‐paused states are indicated. Serum LIF medium with 2i supplement was used to confirm the majority of 2i‐induced expression changes (Fig EV2G).
-
F, GSpike‐ins scale estimated mRNA half‐life and copy number per cell (Two‐tailed unpaired Student's t‐test ****P < 0.0001, in comparison with SL state). Boxplots are with central bands at the median, 0.25 and 0.75 quartiles box area, 1.5× interquartile range (IQR) whiskers; outliers are hidden.
-
HLabeled and total RNA sample sizes distribution of the mESC pluripotent states. Relative sample sizes are represented by the log‐transformed DESeq's size factors after spike‐in normalization and replicates combined (left). Cell‐level RNA turnover approximation by the contrast of labeled and total size factors, which equals the projection on the green diagonal dashed line.
-
IFrequency enrichment ratio of T‐>C conversions in labeled (LRNA) versus total fragmented RNA (FRNA) reads in the different cell culture conditions. Two‐tailed unpaired Student's t‐test was performed, in comparison with SL state. Boxplots are with central bands at the median, 0.25 and 0.75 quartiles box area, 1.5× interquartile range (IQR) whiskers; outliers are hidden.
-
JDistribution of spliced labeled RNA reads percentages (CIGAR ‘N’) on Ensembl protein‐coding genes. The error bar shows the (0.25, 0.75) quantile of spliced ratio, and the bar height represents the median spliced rate. Two‐tailed unpaired Student's t‐test *P < 0.05, in comparison with SL state.
Figure EV1. Reproducibility evaluation of TT‐seq in the pluripotent states.

-
ATT‐seq mRNA log RPK correlation between replicates. Spearman's rank correlation coefficient was performed, the same as below.
-
BTT‐seq annotated ncRNA log RPK correlation between replicates.
-
CTT‐seq‐labeled RNA gene RPK correlation among wild‐type mESCs in SL state. RW4 E14 is the strain used in this study (male, 129X1/SvJ), compares with TX1072 (female, C57BL/6) (Żylicz et al, 2019), WT26 (E14, male, C57Bl/6J) (Elsässer et al, 2015), and H33WT (E14, male, 129×C57Bl/6J) (Elsässer et al, 2015). Spearman's correlation coefficient is shown.
-
D–FTT‐seq fragmented total RNA (FRNA) log2FC of 2i 2d, SL2i 2d, and mTORi 1d correlations with published 2i and mTORi changes (Bulut‐Karslioglu et al, 2016). Significantly changed genes (P.adj < 0.05) in any of our state transitions are compared (n = 852), the same for H.
-
GComparison of DESeq2's log2FC correlation between total RNA 2i 2 days changes with public RNA‐seq data (Galonska et al, 2015; Finley et al, 2018). Both our data and the public data were counted with Kallisto and tested with DESeq2, and Pearson's correlation was performed at each degree of average P.adjust values in the five samples.
- H
Figure EV2. Nascent RNA‐seq methods comparison.

-
AA brief scheme of GRO‐seq, PRO‐seq, NET‐seq, and TT‐seq workflows of RNA labeling and extraction.
-
BRead coverages on the first and the last exons with 2‐kb flanking regions and the adjacent introns (arrows). Exon and intron coverages are resized to the same dimension and plotted by log mean.
-
CThe ratios of intron versus exon with labeled RNA reads density on Ensembl protein‐coding genes across the nascent RNA‐seq methods. Boxplots are with central bands at the median, 0.25 and 0.75 quartiles box area, 1.5× interquartile range (IQR) whiskers; outliers are hidden. (TT‐seq1, this study; TT‐seq2, Żylicz et al, 2019); (GRO‐seq1, Flynn et al, 2016; GRO‐seq2, Wang et al, 2015); (PRO‐seq1, Engreitz et al, 2016; PRO‐seq2, Lloret‐Llinares et al, 2018); (NET‐seq1, Mylonas & Tessarz, 2018; NET‐seq2, Tuck et al, 2018); (4sU‐seq1, Benabdallah et al, 2019; 4sU‐seq2, Brown et al, 2017); (Bru‐seq, Ardehali et al, 2017).
-
DTU annotation relative total length recovery test by TT‐seq bam file random subsetting under (0.1, 0.2, 0.4, 0.6, 0.8, 1) for different TU types.
- E, F
-
GVenn diagram of GRO‐seq, PRO‐seq, and TT‐seq intergenic RNAs.
-
HRNA Pol I, II, and III occupancy (Jiang et al, 2020) on the total intergenic TUs. Boxplots are with central bands at the median, 0.25 and 0.75 quartiles box area, 1.5× interquartile range (IQR) whiskers; outliers are hidden. Two‐tailed unpaired Student's t‐test is performed with the common TUs against the method‐specific TUs (****P < 0.0001).
-
IA ternary plot of Pol I, II, and III enrichment on the GRO‐seq, PRO‐seq, and TT‐seq combined intergenic TU annotations. Pol III cofactors binding sites (Carrière et al, 2012) are pinpointed for Pol III class assignment cross‐validation.
- J
The pluripotent ES cell genome is pervasively transcribed (Efroni et al, 2008). We used labeled TT‐seq signals to de novo define coding and non‐coding TUs in the pluripotent genome separately for each condition and replicate. An R shiny application (TU filter) was developed offering previously described algorithms for TU discovery (Schwalb et al, 2016) within a simple and reproducible visual workflow (Materials and Methods). Overall, 96% of the uniquely mapped TT‐seq‐labeled RNA reads could be assigned to 11,743 GENCODE protein‐coding genes, while approximately 1.5% were assigned to 20,437 new intergenic RNAs called from TT‐seq (Fig 1C). We automatically annotated TUs by their location and direction relative to transcription start site (TSS) or transcription termination site (TTS) of protein‐coding TUs: upstream antisense (uaRNA), convergent (conRNA), cis‐antisense (asRNA), downstream sense (dsRNA), and downstream antisense (daRNA) TUs (Fig EV3A). An example of the TU annotation is shown for the Tbx3 gene neighborhood (Fig 1B). The sensitivity of our TU discovery pipeline was majorly determined by sequencing depth (Fig EV2D), since accuracy of TU calling depends on continuous read coverage. Transcription of intergenic TUs was generally lower than that of protein‐coding genes, suggesting that their cryptic/non‐canonical promoters are generally weaker than divergent gene promoters (Fig 1C). Moreover, asRNA TUs were generally most lowly transcribed (Fig 1D) and exhibited higher interval inconsistency between replicates (Fig EV3B and C). uaRNAs, commonly arising from divergent transcription initiation at gene promoters, were most robustly called and relatively abundant amongst the non‐coding transcripts (Figs 1C–D, EV2D and EV3C).
Figure EV3. TU annotation and transcription variation of mESC pluripotent states.

-
AGene‐associated ncRNA types are classified by their TSS locations relative to the gene TSS: upstream antisense RNA (−1 kb, 0), convergent RNA (0, +1 kb), cis‐antisense RNA (within gene body), downstream antisense RNA (TES, TES + 1 kb), and intergenic RNA.
-
BIllustration of Jaccard index for annotated TUs comparison.
-
CJaccard indexes of mRNA, uaRNA, asRNA, and intergenic RNA intervals similarity in pairwise comparison between each sample replicate.
-
DLabeled RNA purification cross‐contamination rates calculated as the percentage of unlabeled spike‐in reads compared to all spike‐in reads in respective labeled libraries.
-
EPearson's correlations between mRNAs' total and labeled log2FC in 2i 2 days and mTORi 1‐day transition.
-
FPearson's correlations between labeled RNA and total RNA log RPK by each TU location and pluripotent state with merged replicates and log RPK on the combined TUs, the same for H.
-
GCorrelation between 2i and SL2i ground states changes by total (FRNA) and labeled (LRNA) mRNA log2FCs.
-
HPearson's correlations between the pluripotent states by labeled and total RNA log RPK of averaged replicates.
-
IThe coefficients of ChromHMM states are in response to the internal normalized intergenic RNA log2FC differential transcription in 2i 2‐day transition. Two separate logistic regression models are trained for the unidirectional and bidirectional intergenic TUs to predict their log2FC with ChromHMM states. Each states' coefficients are plotted with the respective logistic regression confidence intervals.
-
JIntergenic TU promoter (−500, 200 bp) evolution conservation scores from phastCons 60way (Siepel et al, 2005) with the same groups as in Fig 2C (two‐tailed unpaired Student's t‐test *P < 2.2e‐16). Boxplots are with central bands at the median, 0.25 and 0.75 quartiles box area, 1.5× interquartile range (IQR) whiskers; outliers are hidden.
-
KIntergenic TU intervals stacked total coverage in the ± 100 kb gene neighborhoods by relative strandedness.
-
LIntergenic TU intervals stacked coverage by enhancer and other states in the ± 100 kb gene neighborhoods (n = 11,684).
-
MSingle‐cell gene expression (Buettner et al, 2015) covariance distribution by the gene pairs of random background (n = 4,000), within a topological associated domain (TAD; n = 3,250), consecutive downstream sense (n = 2,058), 3′ ends convergence (n = 3,206), and promoter divergence (n = 3,036). Wilcoxon test is performed against the background covariance. Median covariances of each gene positioning type are indicated for both SL and 2i states.
We also applied the same annotation process to published GRO‐seq and PRO‐seq data to test if lowly expressed TUs were reproducible. GRO‐seq and PRO‐seq identified a larger number of non‐annotated and method‐specific short ncRNAs compared to TT‐seq, reflecting the different labeling preferences of metabolic and run‐on labeling approaches (Fig EV2E–G). 8,001 intergenic TUs commonly called with all three methods showed the highest proportion of FANTOM5 enhancers (Andersson et al, 2014), ATAC‐seq peaks (Atlasi et al, 2019), STARR‐seq enhancers (Peng et al, 2020), but did not show discernable ChIP‐seq signal enrichment for any of the three RNA polymerases (Fig EV2H–J). Hence, the integration of nascent RNA mapping techniques would allow the most robust selection of transcribed TUs with regulatory potential.
Decreased RNA synthesis and transcript copy in 2i and mTORi conditions
Next, we assessed RNA synthesis and turnover changes associated with ground state and paused state transitions using TT‐seq total and labeled RNA from the biological replicates (Fig 1E). We used a set of labeled and unlabeled spike‐in RNAs to scale labeled and total RNA, and predicted RNA turnover and copy number (Materials and Methods). 2i and mTORi treatment slowed down RNA turnover and decreased total RNA abundance per cell (Bulut‐Karslioglu et al, 2016) (Fig 1F and G). Alternatively, contrasting labeled and total transcriptome size factors also demonstrated that 2i and mTORi states had a smaller labeled‐to‐total RNA ratio, hence, slower cell‐level turnover (Fig 1H). Dilution of RNA by cell division and RNA degradation jointly determine cell‐level RNA turnover, and the observed reduction of turnover in 2i and mTORi is in line with a slowdown of cell cycle progression in the respective states (Bulut‐Karslioglu et al, 2016). To exclude the possibility that the different states would have intrinsically different labeling efficiencies, we first confirmed the purity of labeled RNA by assessing cross‐contamination using the spike‐in RNAs (Fig EV3D). Further, we exploited the known tendency of 4sU to mispair with C (leading to a ~10% conversion rate in reverse transcription reactions) (Herzog et al, 2017), to compare the frequency of 4sU incorporation across the pluripotent states. Stable conversion rates suggested a similar incorporation efficiency in all states (Fig 1I). Moreover, analysis of spliced read fraction across the different states showed only minor differences (Fig 1J). In summary, these quality controls assured us that newly synthesized transcripts were labeled consistently and isolated with high purity, therefore, measuring the unbiased RNA synthesis in the pluripotent states.
Genome neighborhood modulates local transcription variation
Next, we evaluated the manner of transcription variation in the state transitions. To assess the contribution of RNA synthesis to changes in total RNA abundance, we compared the pairwise correlation of the new states' labeled RNA and total RNA log2FC (log2 fold change), which suggests that mRNA abundance change is dominantly regulated via RNA synthesis. Further with length‐normalized read counts (log‐RPK), mRNA, intergenic RNA, and asRNA‐labeled RNA levels correlated well with their total RNA levels (Fig EV3E and F), indicating a lesser impact of degradation. On the other hand, ncRNAs, in particular uaRNAs, showed little correlation between labeled and total RNA changes, suggesting that their steady‐state levels are specifically regulated by degradation (Fig EV3F).
Next, we performed differential expression analysis by counting the labeled RNA reads on mRNA, intergenic RNA, uaRNA and asRNA TU intervals, and compared 2i and mTORi to SL conditions after spike‐in normalization (Fig 2A). The early transitioning from SL to mTORi was associated with a homogeneous down‐regulation of both mRNA and ncRNA synthesis (Fig 2A, right), while individual mRNAs and intergenic RNAs showed a highly variable response with both up‐ and down‐regulation in 2i 2‐day adaptation (Fig 2A, left). mRNA synthesis remained largely correlated between SL in mTORi condition, irrespective of initial expression level, while non‐coding transcripts were more variable (Fig 2B, right). In contrast, 2i induced a greater extent of rewiring for both mRNAs and ncRNAs (Fig 2B, left). We wondered if the changes amongst non‐coding TUs followed a dicernable pattern. By large, intergenic TUs transcription log2FC followed mRNA TUs; however, bidirectional TUs, lacking clear enhancer status, appeared to be able to “escape” the global down‐regulation in the 2i transition (Fig 2A and C). We identified the absence of active chromatin states and low evolutionary conservation as a unique characteristic of these refractory bidirectional TUs (Fig EV3I and J).
Figure 2. Pluripotent states transcription rewiring.

- Spike‐in normalized MA plots contrasting 2i (2 days, yellow) and mTORi (1 day, green) to SL state. Local regression lines are in red. Bidirectional non‐enhancer intergenic TUs from (C) are highlighted in pink.
- Pearson's correlation of log‐labeled RNA RPK by mean transcription‐level bins in the same contrasts.
- DESeq2 internal normalized differential expression tests of intergenic RNA labeled RNA read counts. Major and minor TU directions are assigned by labeled RNA RPKs. FANTOM5 (Andersson et al, 2014) mouse enhancers are overlapped with the intergenic TUs. Boxplots are with central bands at the median, 0.25 and 0.75 quartiles box area, 1.5× interquartile range (IQR) whiskers. Statistical test is performed with two‐tailed unpaired Student's t‐test of log2 fold changes (*P < 0.00001).
- Spike‐in‐normalized labeled and total RNA profiles of Myc gene neighborhood (in log‐scale).
- Log2FC correlation of neighboring intergenic TUs (n = 14,978) and genes (n = 7,087) by genomic distance bins, separately for the relative strandedness to genes and the combined enhancer annotations (FANTOM5 (Andersson et al, 2014), ChromHMM (Pintacuda et al, 2017), and STARR‐seq (Peng et al, 2020)).
Because we observed a more facetted transcriptional response to 2i transition than mTORi (Fig EV3H), we wondered whether transcriptional changes of intergenic TUs followed a discernable mechanism. Myc gene expression is known to be strongly attenuated under 2i treatment (Marks et al, 2012; Galonska et al, 2015) but is maintained in mTORi (Bulut‐Karslioglu et al, 2016). Interestingly, we found the same trends on neighboring intergenic TUs (Fig 2D). Globally, intergenic TU transcription changes correlated well with transcription changes of neighboring genes in a direction‐ and proximity‐dependent manner (Figs 2E and EV3K and L). From the ± 100 kb gene neighborhood, the highest correlation was observed for transcription changes within the 20‐kb region downstream antisense, indicating that convergent transcripts to protein‐coding genes experienced stronger co‐regulation (Fig 2E, left). With gene expression covariance analysis of a single‐cell dataset (Kolodziejczyk et al, 2015), we confirmed the transcription positioning influence to be preserved in the SL–2i transition, and also observed correlation of expression between downstream convergent gene pairs (Fig EV3M). In the 2i transition, transcriptional changes at protein‐coding genes correlated slightly better with transcriptional changes at intergenic TUs overlapping an enhancer state than at other intergenic TUs (Fig 2E, top‐right). This is in line with recent studies demonstrating that hard wiring of promoter–enhancer interactions, thus, coordinated changes in transcriptional activity, explains changes in RNA‐seq abundance between SL and 2i states for many differentially regulated genes (Joshi et al, 2015; Novo et al, 2018; Atlasi et al, 2019). In contrast, mTORi induced less locally coordinated changes (Fig 2E, bottom). In summary, our analysis shows that ncTU‐gene position dependent co‐regulation is an important contribution of intergenic transcription variation in pluripotent state transitions.
Estimating elongation velocity with TT‐seq and Pol II coverages
TT‐seq‐labeled RNA coverage allows estimation of productive transcription initiation frequencies (Gressel et al, 2017), which is defined as the number of Pol II molecules that initiate and release into productive elongation, per unit of time. Meanwhile, Pol II ChIP‐seq captures the Pol II occupancy (average density in a cell population) on the DNA template, which is a function of the number of polymerases and their elongation velocity (Ehrensberger et al, 2013). Thus, by combining Pol II occupancy measurements with TT‐seq derived initiation frequencies, Pol II elongation velocities can be estimated (Caizzi et al, 2021). During transcription initiation, TFIIH complex catalyzes Pol II S5p (CTD Serine‐5 phosphorylation) to recruit RNA 5′ end capping enzyme (Søgaard & Svejstrup, 2007; Glover‐Cutter et al, 2009; Hsin & Manley, 2012); therefore, Pol II S5p ChIP signal corresponds to the fraction of productive Pol II amongst the total chromatin‐engaged Pol II molecules (Steurer et al, 2018). Therefore, we used the ratio of TT‐seq density over Pol II S5p density from native MINUTE‐ChIP collected under the same three conditions, SL, 2i, and mTORi as a proxy for relative transcription elongation velocity (Materials and Methods). We resorted to MINUTE‐ChIP for a quantitative profiling of Pol II S5p densities across the three conditions. MINUTE‐ChIP is a multiplexed method in which chromatin from multiple samples is barcoded, pooled, and immunoprecipitated in a single workflow, allowing direct quantitative comparisons (Kumar & Elsässer, 2019). We barcoded triplicate samples of each condition, yielding highly reproducible genome‐wide profiles (Fig EV4A). Integrating TT‐seq and MINUTE‐ChIP profiles, we estimated elongation velocities of 10,386 protein‐coding genes (Materials and Methods, Table EV1). The estimated elongation velocities agreed well when alternative measurements of productive Pol II occupancy (Pol II S2p MINUTE‐ChIP and total Pol II NET‐seq) were used (Fig EV4B and C). In the TT‐seq‐labeled RNA/Pol II S5p profile, a deep minimum after the TSS indicated promoter‐proximal pausing (Muse et al, 2007; Steurer et al, 2018; Bartman et al, 2019); a flat gene body ratio demonstrated the steady elongation velocity; and our transcription velocity estimation also recapitulated the “getting up to speed” model toward elongation termination (Fig 3A–C) (Jonkers et al, 2014; Jonkers & Lis, 2015).
Figure EV4. Pluripotent states transcription dynamics profiles.

- MINUTE‐ChIP Pol II S5p gene occupancy correlation by replicates. Spearman's rank correlation coefficients are indicated.
- Pearson's correlation between Pol II S5p estimated gene velocity and Pol II S2p estimated velocity.
- Pearson's correlation between Pol II S5p estimated gene velocity and NET‐seq estimated velocity. NET‐seq signal density was calculated from gene exons.
- Comparison of the published elongation velocities at the specified time points (Jonkers et al, 2014) and recalculated multi‐time‐point velocity estimates by linear regression (Materials and Methods). Pearson's correlation is shown.
- K‐means grouped estimated elongation velocity gene coverages are plotted in log scale. Upstream 2 kb and downstream 4 kb are extended from Ensembl protein‐coding gene intervals.
- Scatter plots of Pol II S5p gene body occupancy and TT‐seq‐labeled RNA reads density. Pearson's correlation is performed on the log scale.
- Pausing indexes by Pol II S5p compared to RNA synthesis rates for each pluripotent condition. The medians of pausing indexes and Pearson's correlation coefficients are indicated.
- Mean ratio between normalized TT‐seq‐labeled RNA and Pol II S5p coverages of SL and SL2i cells with 10,674 genes. A separate batch of Pol II S5p MINUTE‐ChIP samples is used.
Figure 3. Elongation velocity estimation demarcates transcription dynamics in the pluripotent states.

-
A–CQuantitative MINUTE‐ChIP mean coverage profiles of Pol II S5p, spike‐in‐normalized TT‐seq‐labeled RNA, and their ratio as the estimated elongation velocity on 10,447 genes.
-
DK‐means grouped average profiles of estimated velocity in the SL condition.
-
EIllustration of pausing index calculation (Materials and Methods).
-
FPausing indices of Pol II S5p by different elongation velocity groups in the SL condition (two‐tailed unpaired Student's t‐test *P < 2.2e‐16). Boxplots are with central bands at the median, 0.25 and 0.75 quartiles box area, 1.5× interquartile range (IQR) whiskers; outliers are hidden.
-
GCorrelation of GRO‐seq measured elongation velocity (Materials and Methods) and Pol II S5p gene body occupancy.
-
HCorrelation of GRO‐seq measured velocity and estimated velocity by TT‐seq‐labeled RNA and Pol II S5p MINUTE‐ChIP ratio.
-
IRidge distribution of estimated elongation velocity of the pluripotent states. Medians are shown as vertical dashed lines.
Next, we closely examined the estimated velocity in SL, and used k‐means to classify the estimated elongation velocity profiles of 10,386 protein‐coding genes into three groups with slow, medium, and fast elongation velocity (Figs 3D and EV4E). Acceleration of elongation velocity toward the TES (transcript end site) was only observed for the slow and medium velocity groups (Fig 3D). Beyond the TES, the fast elongation group showed a steeper decline of transcription velocity (Figs 3D and EV4E). We wondered if the velocity groups were indicative of Pol II promoter‐proximal pausing. With the derived pausing intervals from Start‐seq (Dorighi et al, 2017), we calculated the pausing index (PI) as a ratio of Pol II S5p density within the pausing interval versus the gene body (Fig 3E, Materials and Methods). Comparing PIs of slow and medium velocity genes, we confirmed that promoter‐proximal pausing was less prevalent in the fast elongating gene group (Fig 3F).
Next, we sought to validate and calibrate our elongation velocity estimates with direct measurements. Absolute elongation velocities have previously been determined in mESC by inhibiting pause‐release using a Cdk9 inhibition (flavopiridol) followed by a GRO‐seq time course (Jonkers et al, 2014). In this study, performed under SL condition, velocities for 1,197 transcripts were reported; and we were able to extract velocities for a total of 1,944 transcripts by reanalyzing the dataset using our annotation and model fitting with at least two available time points (Materials and Methods). Our recalculated velocities agreed with the published velocities with minor discrepancies due to different linear models applied (Fig EV4D, Materials and Methods), confirmed a reverse relation with our Pol II S5p density (Fig 3G), and agreed with our estimated elongation velocities to a reasonable degree of correlation (Fig 3H). Using the absolute velocity measurements as reference, we were also able to scale our estimated velocities to the reference unit (kb/min). Thus, the labeled RNA/Pol II S5p ratio provides a transcriptome‐wide elongation velocity estimate. These results also highlight the close connectivity of productive Pol II occupancy with its determinants, initiation frequency, and elongation velocity (Figs 3G and EV4F).
Interpretation of elongation velocity in mouse ES cells
To understand the control of elongation velocity, we first sought to correlate elongation velocity in SL condition with other epigenomic features. Many active histone modifications have been found to be associated with elongation velocity (Jonkers et al, 2014; Veloso et al, 2014), for instance, H3K36me3 and H3K79me2. Using our estimation, we confirmed that these two active marks positively correlated with elongation velocity, and in addition identified a positive correlation with several chromatin remodelers (Chd1, Chd2, and Chd9; Fig 4A). Notably, a recent Cryo‐EM structure implicates Chd1 in clearing the nucleosome barrier in front of the RNA polymerase traversing (Farnung et al, 2021). Moreover, we observed anti‐correlation with polycomb repressive marks (H3K27me3, H2Aub, Ezh2, and Ring1b), and also histone variant H2A.Z, which has been shown to be negatively associated with pause‐release (Mylonas et al, 2021) (Fig 4A).
Figure 4. Transcription elongation velocity interpretation.

-
APearson's correlation of estimated gene elongation velocity (n = 10,611) with sequence and chromatin features (Materials and Methods).
-
BEstimated velocity scaled by GRO‐seq measured velocity plotted across the culture conditions and TU types. Boxplots are with central bands at the median, 0.25 and 0.75 quartiles box area, 1.5× interquartile range (IQR) whiskers; outliers are hidden. P‐values are tested with two‐tailed unpaired Student's t‐test in log scale.
-
CEstimated velocity scaled by GRO‐seq measured velocity plotted across the culture conditions and TU types and estimated elongation time from the respective TU lengths. Boxplots are with central bands at the median, 0.25 and 0.75 quartiles box area, 1.5× interquartile range (IQR) whiskers; outliers are hidden. P‐values are from a two‐tailed unpaired Student's t‐test in log scale.
-
D–FEstimated velocity correlation between mRNA, paired uaRNA, and mRNA TSS pausing window (by Start‐seq peaks). Estimated elongation dynamic parameters are in SL condition, the same as below.
-
G, HCorrelation of estimated pausing time (on Start‐seq peaks) with gene body elongation time and pausing index.
-
ICorrelation between estimated mRNA gene body elongation time and pausing index.
We observed Pol II to travel faster through mRNA and intergenic TUs than uaRNA TUs (Fig 4B). Deriving TU elongation time estimates as elongation velocity divided by TU interval length, we found that Pol II spends the longest time traversing through mRNAs, followed by uaRNAs and intergenic TUs (Fig 4C). When looking at pairs of uaRNA and mRNA arising from a shared promoter region, we found a weak correlation in estimated elongation velocity and time (Fig 4D and E).
Next, we wondered if Pol II pause‐release dynamics were connected to elongation velocity or RNA synthesis rate. Transcription initiation frequency is known to weakly anti‐correlate with promoter‐proximal pausing (Sanchez et al, 2018; Gressel et al, 2019). Accordingly, we observed this trend consistently in all the three conditions (Fig EV4G). Intriguingly, pausing time, which corresponds to the time Pol II needs to travel through the promoter‐proximal pausing interval, also showed good correlation with gene elongation time in SL (Fig 4F and G). Further, we also confirmed that the pausing index moderately correlated with pausing time and mRNA elongation time (Fig 4H and I).
Transcription velocity changes in mESC pluripotent states
We expanded our elongation velocity estimation to 2i and mTORi states, and found 2.2 and 3.7 folds decrease in median elongation velocities relative to SL cells (Figs 3I and EV5A). With 2i supplement in serum‐LIF medium, SL2i cells confirmed that the decrease in elongation velocity was an effect of MEK/GSK3β inhibition (Figs EV4H and EV5B). Moreover, 2i cells showed a preferential reduction in fast‐elongated genes, while mTORi cells slowed down more homogeneously (Fig EV5C). For the small fraction of genes with increased velocity in 2i cells, we observed gene ontology (GO) enrichment in negative regulation of cell development (Fig EV5D). In line with the global transcription velocity reduction in 2i and mTORi states, the average pausing times increased from 21 (SL) to 37 (2i) and 61 (mTORi) seconds in the pausing interval of a median length 52 bp (Fig EV5B).
Figure EV5. Transcription dynamics changes in aspect of elongation velocity.

- Distribution of estimated elongation velocity of the pluripotent states after scaling with experimentally measured velocity, the same for (B). Boxplots are with central bands at the median, 0.25 and 0.75 quartiles box area, 1.5× interquartile range (IQR) whiskers; outliers are hidden. Two‐tailed unpaired Student's t‐test is applied (****P < 0.0001).
- Estimation of pausing time in STAR‐seq TSS intervals by the pluripotent states. Boxplots are with central bands at the median, 0.25 and 0.75 quartiles box area, 1.5× interquartile range (IQR) whiskers; outliers are hidden.
- MA plots of elongation velocity changes in 2i and mTORi conditions against estimated velocity in SL condition. Local regression lines were appended to illustrate the trend of changes. The Kit gene, known to be involved in proliferation and self‐renewal, is highlighted as an example of velocity change relative to global trends after state transitions.
- Gene ontology biological processes of top 579 genes with increased elongation velocity in 2i transition.
- Comparisons of the changes of RNA synthesis, elongation velocity, and Pol II pausing index in 2i and mTORi transition.
Transcription termination is associated with elongation velocity
TT‐seq detects transient RNA downstream of the polyadenylation (pA) site/TES, allowing identification of transcription termination sites (TTSs) (Schwalb et al, 2016). To define the genome‐wide main termination sites from our TT‐Seq data, we used a one‐step global profile detection to locate the position that had the greatest difference in the density of labeled RNA before and after this site within a 15‐kb potential termination window downstream of the TES, which allows a rapid scan irrespective of the local fluctuation of read sparsity (Fig 5A, Materials and Methods). Next, we applied the same algorithm to other transcription readout methods (PRO‐seq, GRO‐seq, and Pol II S5p MINUTE‐ChIP), and found that these termination sites agreed with TT‐seq results (Fig EV6A). The observed median termination distance, defined as the interval from the TES to the determined termination site, was 5.1 kb in SL condition.
Figure 5. Transcription termination extension with elongation velocity.

-
AIllustration of termination site detection algorithm on gene Hsp90ab1, by maximizing the read density contrast as shown in the sliding curves (bottom; Materials and Methods). The main termination sites are indicated by dashed lines and solid points for the three cell states.
-
BGene and chromatin feature occupancy in the called termination windows which are binned by termination distances. Density values are log‐normal transformed and standardized to z‐scores.
-
CCorrelation of termination distances with individual chromatin features (upper); multivariate linear regression decomposed R‐squared values for the termination distance explanation (lower; Materials and Methods).
-
DLabeled RNA median coverage in the potential termination window ordered by externally measured elongation velocity (Jonkers et al, 2014).
-
ETermination distances by GRO‐seq measured elongation velocity (Jonkers et al, 2014), with the median sizes from slow to fast are 5.9, 5.8, 6.1, 6.3, and 6.5 kb in SL condition. Boxplots are with central bands at the median, 0.25 and 0.75 quartiles box area, 1.5× interquartile range (IQR) whiskers; outliers are hidden. Global ANOVA test P‐value is indicated, and two‐tailed unpaired Student's t‐test was performed against the “very slow” group (*P < 0.05, **P < 0.01, ****P < 0.0001).
-
FEstimated elongation velocity median coverage in the potential termination window by the mESC states.
-
GTermination sites distribution (above) and estimated elongation velocity coverage heatmaps (below) by the order of termination distance in each condition.
-
HScatter plots of termination distance changes in 2i and mTORi states with mean termination distance.
-
I–KClasses of termination distance changes in 2i and mTORi state. Top, Venn diagrams with Jaccard overlapping index; middle, labeled RNA reads coverage from the last exon (blue box) over the potential termination window (grey box); bottom, GC versus AT nucleotide mean frequency contrast in the 100 bp flanking region around the termination sites.
Figure EV6. Transcription termination sites comparison and explanation.

-
ATermination distances called by TT‐seq, PRO‐seq, and Pol II S5p coverages in the potential termination window were compared by scatter plots.
-
BAverage coverage ± 1 kb around termination sites of TT‐seq‐labeled RNA, Pol II S5p, estimated elongation velocity, FAIRE‐seq, and daRNA TSS occurrence with 10,447 protein‐coding genes.
-
CGradient boosting machine (gbm) model's feature importance of predicting termination distance. Forty‐one genomic features are used, the same as below.
-
DComparison of the actual termination distances and the predicted distances on the hold‐out test set by the gbm model.
-
EA receiver‐operating characteristic curve (ROC) of showing termination distance groups prediction performance with features as described above.
-
F, GScatter plots of elongation velocity changes and termination distance changes of 2i and mTORi transitions.
Analysis of Pol II mutants revealed that mutants with slower elongation velocity exhibited a shorter termination distance (Hazelbaker et al, 2013; Sheridan et al, 2019). To which extent the natural elongation velocity explains transcriptome‐wide termination distance has not been determined yet. Therefore, we analyzed TT‐seq‐labeled RNA coverage in the 15‐kb potential termination window downstream of the TES across groups of genes with increasing elongation velocity, and found that indeed fast elongation genes appeared to have longer labeled RNA coverages (Fig 5D), albeit genes with similar elongation velocity exhibited a considerable variation of termination distances (Fig 5E).
Hence, we wondered if elongation velocity together with other features associated with termination distance. To this end, we calculated the average velocity in the termination window of each gene and evaluated the extent of which elongation velocity versus a panel of epigenomic features can explain the termination distance. We first assumed linear responses and ranked with Pearson's correlation coefficients. Long termination distances (> 9 kb) were associated with faster estimated termination/gene body velocity, higher histone H3K36me3 occupancy, and longer gene lengths. In contrast, short termination distances (< 3 kb) were associated with histone H3.3, Chd8, Chd1 enrichment, and GC content in the called termination window (Fig 5B). A total of 25% termination distance variance could be explained by a multivariate linear model, in which the estimated termination velocity explained the largest variance of termination distance (Fig 5C). We further confirmed the first place of importance of termination velocity via a gradient boosting machine (gbm) non‐linear model, which explained 56% variance with acceptable prediction accuracy of the termination distances (Fig EV6C–E). In addition, at the termination sites (± 1 kb), we also found localization of chromatin accessibility and daRNA initiation, which suggests that antisense transcription collision may participate in the termination process for a small number of cases (Fig EV6B).
Shorter termination distance associates with attenuated elongation velocity
To understand if altered elongation velocity in 2i and mTORi conditions also affected termination distance, we estimated termination sites in 2i and mTORi condition analogous to above, and found that slower elongation in 2i/mTORi coincided with shorter median termination distances: SL 5.1 kb, 2i 4.6 kb, and mTORi 3.9 kb. Termination sites in 2i and mTORi cells were mainly located within 5‐kb downstream of the TES (Fig 5G). Shorter termination distances matched with a higher degree of global velocity decrease in 2i/mTORi cells (Fig 5H). In line with this, the estimated elongation velocity rapidly declined in 2i/mTORi cells within 5 kb after the TES, while in SL condition, median estimated elongation velocity decreased more gradually (Fig 5F). We grouped genes according to their termination distance changes: shortening (29%; 45.2%), extending (14.4%; 9.4%), or remaining unchanged (52.9%; 41.6%), within a ± 500 bp threshold in 2i or mTORi relative to SL (Fig 5I–K). A small fraction of genes for which an extended termination distance in 2i or mTORi was called using the maximum contrast method did not show an extension of the average TT‐seq signal (Fig 5I), comparable to the unchanged termination sites in 2i/mTORi that largely overlapped (Fig 5K). Thus, this result suggests experimental noise in the extended group. On the other hand, labeled RNA coverage for genes with a shortened termination distance in 2i or mTORi dropped to background levels several kb earlier than in SL (Fig 5J).
Regardless of whether or not the termination sites shifted, a sharp local maximum in G/C richness was observed around both the old and new termination sites (Fig 5I and J). This result recapitulates the G/C‐rich motif mapped at the termination sites in K562 cells (Schwalb et al, 2016), and suggests a collection of termination sites to exist and respond to elongation velocity changes in a gene‐specific manner (Fig EV6F and G). Therefore, within a certain range reached by the Pol II termination velocity, sequence context was a consistent determinant for the local termination site choice.
Discussion
With the assistance of quantitative techniques, we find that related pluripotent states of mouse embryonic stem cells exhibit markedly different global transcriptional kinetics. A global reduction of transcription serves to attenuate cell growth in the paused state (Bulut‐Karslioglu et al, 2016), but may also directly antagonize differentiation tendency of paused cells by limiting the accumulation of critical differentiation signaling (Cherepkova et al, 2016) and in a similar fashion promote mESC self‐renewal in the ground state (Stavridis et al, 2007; Ying et al, 2008; Wray et al, 2011). Furthermore, the correlations between total RNA and labeled RNA (Fig EV3E and F) indicate the decisive role of RNA synthesis in regulating the cellular abundance of coding RNAs and certain ncRNA types in mES cells.
Individual genes/TUs may defy the genome‐wide trend of transcriptional changes: pluripotency‐associated T‐box transcription factor Tbx3 mRNA levels and transcription is maintained stably across the three states (Fig 1A); the Kit gene (encoding a receptor tyrosine kinase) transcription accelerates and expression increases in 2i state in line with its known function promoting mESC proliferation and self‐renewal (Fraser et al, 2013; Todaro et al, 2019) (Fig EV5C); and the transcription of non‐enhancer bidirectional intergenic ncTUs shows an increase in 2i cells against the global trend (Fig 2A and C). The overall transcriptional response of 2i ground state cells was mirrored when the 2i inhibitors were added to a SL culture (Fig EV3G), highlighting the important role of MEK/GSK3b signaling in antagonizing the ground state. Intriguingly, 2i‐induced transcriptional changes have been shown to occur with hardwired enhancer–promoter interactions (Atlasi et al, 2019; McLaughlin et al, 2019; Schoenfelder & Fraser, 2019), and 2i‐specific enhancer activation occurs via the loading of Esrrb upon the pre‐existing chromatin interactions (Atlasi et al, 2019), resulting in H3K27ac enrichment (Barakat et al, 2018; Schoenfelder & Fraser, 2019). However, H3K27ac per se appears characteristic not necessary for enhancer function (Sanchez et al, 2018; Zhang et al, 2020). In our data, intergenic ncRNAs, with or without enhancer annotations, showed a strand and distance dependent co‐regulation with neighboring genes (Fig 2E). The global rewiring of cell signaling and metabolism in the ground state, therefore, reveals the large extent of established co‐regulation of non‐coding RNAs with their adjacent genes (Figs 2C and E, and EV3K–M).
Previous estimates of transcription elongation kinetics rely on chemical inhibition and time‐series experiments, either by transcription release after the inhibitor clearance (Danko et al, 2013; Fuchs et al, 2014; Veloso et al, 2014), Pol II initiation/pause‐release repression (Jonkers et al, 2014) or RNA reporter elongation efficacy (Fukaya et al, 2017). Such velocity measurements still leave details of global and local elongation velocity unaddressed (Jonkers & Lis, 2015). Here, our approximation of elongation velocity provides a transcriptome‐wide measurement of several key parameters. First, we were able to estimate the unperturbed elongation velocity coverage continuously from initiation to termination, due to TT‐seq/Pol II flat gene body coverage/occupancy (Fig EV4E). Second, we were able to compare global elongation velocity across different cell states, due to the simplicity of sample normalization compared to the other velocity estimation methods (inhibitor+GRO‐seq or NET‐seq; Fig 3I). Third, we mapped the local transcription velocity in the intervals of specific transcription stages. For average expressed genes, a continuous velocity estimation might not be possible due to the relatively low coverages of TT‐seq‐labeled RNA and Pol II ChIP, but velocity estimation on the entire TU proved reliable across methods (Fig 3H). Beyond the observed association between transcription velocity and chromatin features, our quantitative strategy of transcription kinetics estimation could facilitate the future study via single‐molecule perturbation for dissecting the enigma of local transcription velocity control.
Finally, elongation velocity influences termination site choice to a greater extent than other chromatin features (Fig 5B and C). Through studying conditions with slower global elongation velocities, we reveal that the optimum termination site usage is responsive to elongation velocity in some genes but unresponsive in others (Fig 5J and K). Hence, termination site usage is determined by multiple factors and the exact changes of termination site usage are gene specific and may depend on GC‐rich motif occurrence in the potential termination window (Fig EV6F and G). Our observations are in line with a model where RNA Pol II traverse encounters termination roadblocks imposed by DNA sequence or antisense transcription collision, and the remaining velocity at the encounter stochastically determines if the polymerase comes to a complete stop (Figs 5J and K, and EV6B). Our model is also compatible with the “torpedo” model in which the 5′–3′ exonuclease XRN2 degrades the cleaved nascent transcript and leads to eviction of Pol II once it caught up to Pol II (West et al, 2004; Brannan et al, 2012; Nojima et al, 2015; Baejen et al, 2017). Termination site choice may underlie some stochasticity, and multiple termination sites may be used in a given instance within a population of cells (Fig EV7A–C).
Figure EV7. Pluripotent genes estimated transcription kinetics.

-
A–CExample genes (Tfcp2l1, Tead4, and Etv5) with the kinetic changes of RNA synthesis, elongation velocity, pausing time, and termination distance in the pluripotent states, showing with TT‐seq‐labeled RNA and Pol II‐S5p coverages in mm9 genome (in linear scale).
In sum, our findings reveal transcription kinetics changes in the inhibitor‐induced pluripotent state transitions. Our data support a model in which inhibition of MEK/GSK3β or mTOR signaling pathways by 2i (CHIR99021/PD0325901) or mTORi (INK128) decreases global transcriptional output, total RNA abundance, RNA turnover, RNA polymerase II elongation velocity, and termination distance.
Materials and Methods
Reagents and Tools table
| Reagent/Resource | Reference or Source | Identifier or Catalog Number |
|---|---|---|
| Experimental Models | ||
| 129X1/SvJ (Mus musculus) | Karolinska Center for Transgene Technologies | RW4 |
| Antibodies | ||
| Rabbit anti‐RNA Pol II S5p | Santa Cruz | sc‐47701 |
| Chemicals, Enzymes and other reagents | ||
| Knockout™ DMEM | ThermoFisher | 10829018 |
| FBS | Sigma | F7524 |
| ESGRO LIF | Sigma | ESG1107 |
| GlutaMAX | ThermoFisher | 10565018 |
| Non‐Essential Amino Acid | Sigma | M7145 |
| β‐mercaptoethanol | Sigma | M3148 |
| ESGRO Complete Basal Medium | Millipore | SF002 |
| CHIR99021 | Sigma | SML1046 |
| PD0325901 | Sigma | PZ0162 |
| INK128 | Cayman | CAYM11811‐1 |
| 4‐thiouridine | Sigma‐Aldrich | T4509 |
| TRIzol | ThermoFisher | 15596018 |
| HPDP‐Biotin | ThermoFisher | 21341 |
| Dimethylformamide | VWR | 1.02937.0500 |
| µMACS streptavidin beads | Miltenyi Biotec | 130‐074‐101 |
| DNase set | Qiagen | 79254 |
| Ovation Universal RNA‐Seq kit | NuGEN | 0348 |
| Ampure XP beads | Beckman Coulter | A63881 |
| NextSeq® 500/550 High Output Kit v2 | Illumina | FC‐404‐2005 |
| PhiX Control v3 | Illumina | FC‐110‐3001 |
| 4‐Thio‐UTP | Jena Bioscience, Saveen&Werner | NU‐1156S |
| Micrococcal nuclease | New England BioLabs | M0247S |
| Protein G magnetic beads | BioRad | 161‐4023 |
| Qubit RNA HS Assay Kit | ThermoFisher | Q32852 |
| Qiagen miRNeasy Micro Kit | Qiagen | 217084 |
| DL‐DTT | Sigma | 43815 |
| Agilent RNA 6000 Pico Kit | Agilent | 5067‐1513 |
| Agilent RNA 6000 Nano Kit | Agilent | 5067‐1511 |
| Ethanol absolute | VWR | 20821.296 |
| 2‐propanol | Sigma | I9516‐4X25ML |
| Software | ||
| R | https://www.r‐project.org/ | 3.6.3 |
| Ubuntu | https://ubuntu.com/ | 18.04.5 LTS |
| STAR | Dobin et al (2013) | 2.7.3a |
| DESeq2 | Love et al (2014) | 1.24.0 |
| Rsubread | Liao et al (2019) | 1.34.7 |
| bowtie2 | http://bowtie‐bio.sourceforge.net/bowtie2/index.shtml | 2.3.5.1 |
| HOMER | http://homer.ucsd.edu/homer/ | 4.11 |
| rtracklayer | Bioconductor | 1.46.0 |
| bamsignals | Bioconductor | 1.18.0 |
| Rsamtools | Bioconductor | 2.2.3 |
| STAN | Bioconductor | 2.14.0 |
| caret | CRAN | 6.0‐88 |
| gbm | CRAN | 2.1.8 |
| Other | ||
| Illumina NextSeq 500 | Illumina | |
| Bioruptor | Diagenode | |
| Bioanalyzer | Agilent | |
| TC20™ Cell Counter | Biorad | |
| Countess® Cell Counter | ThermoFisher | |
| Qubit | ThermoFisher | |
| Simpliamp thermal cycler | ThermoFisher | |
Methods and Protocols
Cell culture
Mouse embryonic stem cell RW4 (male, 129X1/SvJ) were cultured in 0.1% gelatin‐coated dishes with serum medium: Knockout DMEM medium with 15% FBS (Sigma, F7524), 0.1 mM ESGRO LIF (Sigma, ESG1107), 2 mM GlutaMAX (ThermoFisher, 10565018), 0.1 mM Non‐Essential Amino Acid (Sigma, M7145), 0.1 mM β‐mercaptoethanol (Sigma, M3148); 2i medium: ESGRO Complete Basal Medium (Millipore, SF002), 3 μM GSK3β inhibitor CHIR99021 (Sigma, SML1046), 1 μM Mek 1/2 inhibitor PD0325901 (Sigma, PZ0162), 0.1 mM LIF. Inhibition of mTOR was in serum‐LIF (SL) medium supplemented with 200 nM INK128 (CAYM11811‐1).
TT‐seq extraction protocol
TT‐seq labeling steps were performed as described before (Schwalb et al, 2016; Gressel et al, 2019) with minor modifications. In short, cells in the different pluripotent media were cultured for 1–2 days in four 15‐cm dishes. One dish was used for cell number counting, and the remaining were supplemented with 500 µM of 4‐thiouridine (4sU; Sigma‐Aldrich, T4509) for 5 min at 37°C and 5% CO2, then immediately quenched by adding TRIzol (ThermoFisher, 15596018) for RNA extraction after mixing with cell number proportional spike‐in RNAs (0.4 ng/million cells). Total RNA was fragmented to an average of 1,000 nt with Bioruptor (Diagenode), then coupled with HPDP‐Biotin (ThermoFisher, 21341) dissolved in dimethylformamide (VWR, 1.02937.0500). A small aliquot was saved as fragmented total RNA (FRNA) and the rest was purified with µMACS streptavidin beads (Miltenyi Biotec, 130‐074‐101). Purified labeled RNA (LRNA) was subjected to DNase digestion (Qiagen, 79254). Total fragmented RNA and labeled RNA libraries were prepared using Ovation Universal RNA‐Seq kit (NuGEN, 0348). The pooled library was size‐selected by Ampure XP beads (Beckman Coulter, A63881) before sequencing with NextSeq® 500/550 High Output Kit v2 (Illumina, FC‐404‐2005, 75 cycles).
Reads alignment and TU annotation
Paired‐end short reads were aligned to mm9 and mm10 genome references (GENCODE) by STAR 2.7.3a with setting:
‐‐outFilterMismatchNoverReadLmax 0.02 ‐‐outFilterMultimapScoreRange 0 ‐‐alignEndsType EndToEnd.
Mapped reads were subjected to transcription unit annotation as described before (Schwalb et al, 2016), processed in parallel by “TU filter” (R shiny app). Briefly, paired‐end reads midpoints were binned into 200‐bp genome coverage matrices by both strands (adding up the matrices if the sample had multiple replicates), then subjected to the binary hidden Markov state calling with R package “GenoSTAN” with “PoissonLog” method. Next, the active state intervals were aggregate as the raw TUs and joined by exons per gene. Non‐coding TU locations were named by their relative position to the nearby coding TUs (Fig EV3A).
TU differential expression analysis was performed by DESeq2 (1.24.0) (Love et al, 2014) with read counts collected from the annotated TU intervals by featureCounts (Rsubread 1.34.7) (Liao et al, 2019). TU correlation tests were performed with kallisto estimated tpm (transcript per million) by indexed transcript references of non‐coding TU (ncTU) genomic sequences together with GENCODE transcriptome vM21 and ERCC spike‐in sequences.
Spike‐in RNA design
ERCC synthetic spike‐in RNAs were used as external references for total RNA and labeled RNA sample size normalization as described before (Schwalb et al, 2016; Gressel et al, 2019), with minor modifications. Briefly, 6 pairs of spike‐in RNAs with 4sU labeled/unlabeled mixture were prepared as below:
| ERCC spike‐in RNA | Concentration (ng/µl) | Labeled rate (%) |
|---|---|---|
| Sp2 (ERCC‐00043) | 1 | 100 |
| Sp4 (ERCC‐00136) | 0.1 | 100 |
| Sp5 (ERCC‐00145) | 1 | 10 |
| Sp8 (ERCC‐00092) | 0.1 | 10 |
| Sp9 (ERCC‐00002) | 1 | 0 |
| Sp12 (ERCC‐00170) | 0.1 | 0 |
For every million cells, 0.4‐ng spike‐in mix was added into the TRIzol (ThermoFisher, 15596018) cell lysis to eliminate technical errors retained during the steps of biotinylating, RNA purification, and library preparation.
Sample size estimation
We quantified GENCODE transcripts, de novo annotated ncTUs and spike‐in transcripts using the alignment‐free mapper (kallisto 0.46.2). For normalization, we calculated the size factors of all spike‐ins in the total transcriptome and labeled spike‐ins (2, 4, 5, 8) in the labeled transcriptome according to DESeq's method (Love et al, 2014). The normalized transcriptomes were subjected to size factor calculation to be presented as the relative sample sizes of both total and labeled RNA abundance (Fig 1H).
RNA synthesis rate estimation
The estimated labeled/total RNA read counts of GENCODE transcripts and spike‐ins from kallisto were first normalized by respective spike‐ins sizes from each sample. The gene‐level estimated counts took the sum of transcript variants. A linear model was trained only with the normalized labeled spike‐ins (Sp2, Sp4, Sp5, and Sp8) log2 labeled (X L) and total (X F) read counts in response to the respective label rates r: log2(r) ~ X F + X L. The evaluation was performed with five‐fold cross‐validation with 10 times subsampling, and the predictions on the hold‐on set were examined. Then, a final model with spike‐in counts of all samples was trained, which resulted in an adjusted R‐squared 0.9917. The labeling rate was predicted from transcript's labeled/total read counts. RNA half‐life was extrapolated from the predicted labeled rate in 5 min to replacing half of the current total RNA amount. In the same way, a second model was trained to predict all spike‐in weights per cell w: log2(w) ~ X F + X L, which resulted in an adjusted R‐squared 0.991. Then, the copy number per cell was transformed from weight with transcript effective length from kallisto. Finally, RNA synthesis rate (cell−1 min−1, or copy/min per cell) was calculated by multiplying labeled rate and transcript copy number.
MINUTE ChIP library preparation
For each growth condition (SL, 2i, and SL + mTOR inhibitor), three pellets of 1 × 106 cells were collected and samples were processed according to the MINUTE‐ChIP workflow (Kumar & Elsässer, 2019). Briefly, native cell pellets were lysed and DNA was digested to mono‐ to tri‐nucleosome fragments using micrococcal nuclease (New England BioLabs,M0247S) in Lysis buffer (100 mM Tris‐HCL [pH 8.0], 0.2% Triton X‐100, 0.1% sodium deoxycholate, 10 mM CaCl2, and 1× PIC). Subsequently, dsDNA adaptors (containing T7 promoter, 8bp sample barcode, and a 6 bp unique molecular identifier [UMI]) were ligated onto the fragmented chromatin in the same pot supplemented with blunting and ligation reagents. Barcoded samples were then pooled and aliquoted for ChIP (saving 5% as Input) for 4 h at 4°C with Protein G magnetic beads (BioRad; 161‐4023) pre‐coupled with 5 µg of RNA Pol II S5p antibody (Santa Cruz; sc‐47701). Next, ChIP and input DNA were purified and final libraries were generated through sequential steps of in vitro transcription, RNA 3′ adapter ligation, reverse transcription, and PCR amplification. Post library cleanup and quality assessment, libraries were combined at 4‐nM concentration and sequenced on the Illumina NextSeq500 platform.
Read coverage
Short reads genome coverage was processed with R/Bioconductor packages “rtracklayer”, “bamsignals”, and “rsamtools” by piling up only the uniquely mapped and paired‐end reads with insertion size < 2 kb. Sample‐wise coverage of each genomic interval was normalized by the respective size factors. For TT‐seq sample size factors were generated from spike‐in RNA read counts; for MINUTE‐ChIP samples, scaling factors were from the respective ChIP and input sizes after combining sample replicates. In the heatmaps and coverage profile plots, each coverage vector of different lengths was resized by “spline” function to the same number of positions.
Pol I, Pol II, and Pol III TU classification
RNA polymerases ChIP‐seq datasets ((Jiang et al, 2020), GSE145791) were aligned to mm10 genome by bowtie2 v2.3.5 with “‐‐local” setting. Each TU's Pol I, Pol II, and Pol III density were divided by the sum to obtain relative occupancies, and a threshold of 90% enrichment was used for the classification indicated in the ternary plot (Fig EV2I). For validation, ChIP‐seq data of Pol III subunits PRC1, PRC4, and the cofactors BRF1, TFIII ((Carrière et al, 2012), E‐MTAB‐767) were aligned and subjected to peak calling with MACS2 default setting at “−q 0.01” cutoff, and overlapped on the ternary plot of the combined intergenic TUs (GRO‐seq, PRO‐seq, and TT‐seq).
Epigenome feature extraction
Promoter DNA sequences were extracted around the TSS (−1,000 to +50 nt) on the sense strand. CpG number was by CG dinucleotide, TATA number was by the “TATA” pattern. For counting ChIP‐seq signal density, TSS intervals (−500 to 500 bp) and Ensembl (GRCm38.79) gene body intervals were used, with the samples as below:
| Features | Types | Gene parts | Catalogue | Reference |
|---|---|---|---|---|
| TSS DNAme. | DNA | TSS | GSM1127953 | Galonska et al (2015) |
| GB DNAme. | DNA | Gene body | GSM1127953 | Galonska et al (2015) |
| DHS | DNA | TSS | GSM1014154 | Vierstra et al (2014) |
| E2f1 | General TF | TSS | GSM288349 | Chen et al (2008) |
| TBP | General TF | TSS | GSM1816104 | Langer et al (2016) |
| TFIID | General TF | TSS | GSM958503 | Ku et al (2012) |
| Sp1 | General TF | TSS | GSM3258754 | Hartl et al (2019) |
| Aff4 | General TF | Gene body | GSM749810 | Lin et al (2011) |
| cMyc | Pluripotent TF | TSS | GSM2417145 | Chronis et al (2017) |
| Esrrb | Pluripotent TF | TSS | GSM2417188 | Chronis et al (2017) |
| Klf4 | Pluripotent TF | TSS | GSM2417144 | Chronis et al (2017) |
| Nanog | Pluripotent TF | TSS | GSM2417187 | Chronis et al (2017) |
| Oct4 | Pluripotent TF | TSS | GSM2417142 | Chronis et al (2017) |
| Sox2 | Pluripotent TF | TSS | GSM2417143 | Chronis et al (2017) |
| CTCF | Enhancer Activity | TSS | GSM3615255 | Atlasi et al (2019) |
| p300 | Enhancer Activity | TSS | GSM2417169 | Chronis et al (2017) |
| Hdac1 | Enhancer Activity | TSS | GSM2417173 | Chronis et al (2017) |
| Med1 | Enhancer Activity | TSS | GSM3084070 | Sabari et al (2018) |
| Brd4 | Enhancer Activity | TSS | GSM3084073 | Sabari et al (2018) |
| Yy1 | Enhancer Activity | Gene body | GSM2645362 | Weintraub et al (2017) |
| Chd1 | Remodeler | Gene body | GSM1581288 | de Dieuleveult et al (2016) |
| Chd2 | Remodeler | Gene body | GSM1581290 | de Dieuleveult et al (2016) |
| Chd9 | Remodeler | Gene body | GSM1581298 | de Dieuleveult et al (2016) |
| HP1a | Domain | Gene body | GSM2582363 | Ostapcuk et al (2018) |
| LaminB | Domain | Gene body | GSM2579539 | Poleshko et al (2017) |
| Ezh2 | Heterochromatin | TSS | GSM2805185 | Ardehali et al (2017) |
| Ring1b | Heterochromatin | Gene body | GSM2393579 | Kundu et al (2018) |
| H2AZ | Histone | Gene body | GSM1287699 | Surface et al (2016) |
| H2AX | Histone | Gene body | GSM1847704 | Wu et al (2016) |
| H33 YFP | Histone | Gene body | GSM2582412 | Chen et al (2018) |
| H2BK20ac | Acetylation | Gene body | GSM1874093 | Kumar et al (2016) |
| H3K9ac | Acetylation | Gene body | GSM2417092 | Chronis et al (2017) |
| H3K27ac | Acetylation | Gene body | GSM2417096 | Chronis et al (2017) |
| H3K56ac | Acetylation | Gene body | GSM3747805 | Etchegaray et al (2019) |
| H3K64ac | Acetylation | Gene body | GSM3143869 | Martire et al (2019) |
| H3K122ac | Acetylation | Gene body | GSM3143871 | Martire et al (2019) |
| H3K4me1 | Methylation | TSS | GSM2417088 | Chronis et al (2017) |
| H3K4me3 | Methylation | TSS | GSM2417080 | Chronis et al (2017) |
| H3K9me3 | Methylation | Gene body | GSM2417112 | Chronis et al (2017) |
| H3K27me3 | Methylation | Gene body | GSM2417100 | Chronis et al (2017) |
| H3K36me2 | Methylation | Gene body | GSM3772688 | Weinberg et al (2019) |
| H3K36me3 | Methylation | Gene body | GSM2417108 | Chronis et al (2017) |
| H3K79me2 | Methylation | Gene body | GSM2417104 | Chronis et al (2017) |
| H2AUb | Ubiquitination | Gene body | GSM2393583 | Kundu et al (2017) |
| H2AK119ub1 | Ubiquitination | Gene body | GSM2865672 | Yao et al (2018) |
Log1p‐transformed ChIP read densities were trimmed at 99.5% quantile to remove technical outliers, and standardized to normal distribution N(0, 1). Each feature explaining a particular response (e.g., termination distance, Fig 5C) was decomposed for R 2, by a multivariate linear regression through the origin.
Pol II pausing index
Pol II TSS pausing intervals were generated from the closest 5′ capped RNA peaks (Start‐seq (Dorighi et al, 2017), GSM2586572). Start‐seq reads were aligned by STAR 2.7.3a and called peaks by HOMER 4.11 (Heinz et al, 2010) with the following setting: findPeaks ‐style groseq ‐size 20 ‐fragLength 20 ‐inputFragLength 40 ‐tssSize 5 ‐minBodySize 30 ‐pseudoCount 1. Pausing intervals of the active genes were assigned by the closest non‐redundant capped RNA intervals, which were used for TSS peak density calculation. Pol II S5p MINUTE‐ChIP bam coverages, processed as described before (Kumar & Elsässer, 2019), were subjected to pausing interval density and gene body density extraction from (+500, +1,500 bp) interval. The resulting ratios were used as the pausing index (Fig 3F).
Transcription elongation velocity estimation
The time course Cdk9 inhibition GRO‐seq samples (Jonkers et al, 2014) were annotated by TU filter to capture ongoing transcription events. The current travel distance from gene TSS was retrieved by the nearest TU fragment annotation, and was subjected to a linear regression model in response to Cdk9 inhibition times where the slope coefficient represented reversed elongation velocity and the intersection term adjusted for response time delay. The resulting elongation velocities for 1,944 genes were used as the “measured elongation velocity” for cross‐validation. Elongation velocities v can also be estimated from the ratio of the number of polymerases released into elongation, as measured by TT‐seq, over the Pol II occupancy (Gressel et al, 2019; Caizzi et al, 2021). Thus, to derive “estimated elongation velocity” from our multi‐omics data, we combined TT‐seq LRNA coverage with Pol II S5p MINUTE ChIP‐seq coverage as follows:
This approximation allows relative comparison between different conditions as long as the numerator and denominator terms, TT‐Seq and Pol II S5p signals, are quantitative. To this end, TT‐Seq normalization by external spike‐ins is required, and MINUTE‐ChIP quantitative scaling is carried out as described before (Kumar & Elsässer, 2019). MINUTE‐ChIP is a multiplexed ChIP method in which input chromatin is fragmented using MNase and then barcoded and pooled from the different conditions. The native ChIP is performed on an aliquot of the pooled chromatin input, and the resulting Illumina library is demultiplexed by the sample barcode, enabling quantitative comparison amongst all samples in the pool. One important advantage of MINUTE‐ChIP over a standard ChIP‐seq approach with added spike‐in for normalization (e.g., ChIP‐Rx) is that the pooled samples experience the identical IP and wash conditions, removing sample‐to‐sample and batch variation that can confound even spike‐in ChIP. A table of all estimated velocities is provided as Table EV1.
Calling termination sites
The potential termination windows were scanned in the 15‐kb genome interval extending from the last exon ends of each gene, before the TSS of downstream consecutive genes. TT‐seq LRNA reads coverages were piled up with only the first read in pair, scaled with respective spike‐ins, and averaged by sample replicates. To capture the tipping point of RNA synthesis as transcription passes the termination site, we determined the position that gave the maximum contrast of the log‐transformed labeled RNA densities before and after this site. In practice, TT‐seq‐labeled RNA read coverage in the potential termination window was scaled (mean = 0) and their cumulative sums from the beginning to the end of the potential termination window were calculated. The maximum position was defined as the termination site (Fig 5A).
Non‐linear model prediction of termination distance
We used a tree‐based gradient boosting model (gbm, R package) to evaluate the non‐linear response of termination distance with 41 chromatin features. The model training control was 10‐fold cross‐validation, and with the tuned depth 40 and 1,000 trees under 0.1 shrinkage. Test set was split by the ratio 0.2 of total cases (n = 8,348). The trained model of predicting numerical termination distance was applied to the test set, and compared to the actual distances (Fig EV6D). Feature importance was extracted from this model as complement to the R‐squared linear explanation (Fig 5C). We evaluated gbm performance with binned termination distance groups (3, 5, 8, 15 kb) and ROC (receiver‐operating characteristic) curves were then plotted for evaluation (Fig EV6E).
Author contributions
Rui Shao: Conceptualization; Resources; Data curation; Software; Formal analysis; Validation; Investigation; Visualization; Methodology; Writing – original draft; Writing – review & editing. Banushree Kumar: Investigation; Methodology; Writing – original draft; Writing – review & editing. Katja Liedschreiber: Supervision; Investigation; Methodology. Michael Lidschreiber: Conceptualization; Data curation; Formal analysis; Supervision; Validation; Methodology; Writing – original draft; Writing – review & editing. Patrick Cramer: Conceptualization; Supervision; Funding acquisition; Validation; Project administration; Writing – review & editing. Simon J Elsässer: Conceptualization; Data curation; Supervision; Funding acquisition; Investigation; Methodology; Writing – original draft; Project administration; Writing – review & editing.
In addition to the CRediT author contributions listed above, the contributions in detail are:
RS and SJE conceived study. RS performed TT‐seq and BK performed MINUTE‐ChIP experiments. RS analyzed all data. KL, ML, and PC advised on TT‐seq. RS generated figures. RS and SJE wrote the manuscript with input from all authors.
Disclosure statement and competing interests
The author declares that he has no conflict of interest.
Supporting information
Expanded View Figures PDF
Table EV1
Table EV2
Acknowledgements
Bioinformatics analyses were performed on resources provided by the Swedish National Infrastructure for Computing (SNIC) at Uppmax server (projects SNIC 2020/15‐9, SNIC 2020/6‐3, uppstore2018208, SNIC 2018/3‐669, sllstore2017057, SNIC 2017/1‐508). We thank members of the Elsässer laboratory for comments and help with experiments and analysis. K.L., M.L., and P.C. were funded by CIMED and SciLifeLab. S.J.E. acknowledges funding by the Karolinska Institutet SFO for Molecular Biosciences, Vetenskapsrådet (2015‐04815, 2020‐04313), H2020 ERC Starting Grant (715024 RAPID), Åke Wibergs Stiftelse (M15‐0275), Cancerfonden (2015/430). R.S. acknowledges funding from the Chinese Scholarship Council.
Mol Syst Biol. (2022) 18: e10407
Data availability
Primary and processed data generated for this study have been submitted to the Gene Expression Omnibus under GSE168378 (http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE168378; TT‐seq) and GSE126252 (http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE126252; MINUTE‐ChIP).
Transcription unit annotation R shiny app local workflow can be found on GitHub https://github.com/shaorray/TU_filter.
Data analysis steps are also available on Github: https://github.com/shaorray/TT‐seq_mESC_pluripotency.
References
- Andersson R, Gebhard C, Miguel‐Escalada I, Hoof I, Bornholdt J, Boyd M, Chen Y, Zhao X, Schmidl C, Suzuki T et al (2014) An atlas of active enhancers across human cell types and tissues. Nature 507: 455–461 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ardehali MB, Anselmo A, Cochrane JC, Kundu S, Sadreyev RI, Kingston RE (2017) Polycomb repressive complex 2 methylates elongin A to regulate transcription. Mol Cell 68: 872–884.e6 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Atlasi Y, Stunnenberg HG (2017) The interplay of epigenetic marks during stem cell differentiation and development. Nat Rev Genet 18: 643–658 [DOI] [PubMed] [Google Scholar]
- Atlasi Y, Megchelenbrink W, Peng T, Habibi E, Joshi O, Wang S‐Y, Wang C, Logie C, Poser I, Marks H et al (2019) Epigenetic modulation of a hardwired 3D chromatin landscape in two naive states of pluripotency. Nat Cell Biol 21: 568–578 [DOI] [PubMed] [Google Scholar]
- Baejen C, Andreani J, Torkler P, Battaglia S, Schwalb B, Lidschreiber M, Maier KC, Boltendahl A, Rus P, Esslinger S et al (2017) Genome‐wide analysis of RNA polymerase II termination at protein‐coding genes. Mol Cell 66: 38–49.e6 [DOI] [PubMed] [Google Scholar]
- Barakat TS, Halbritter F, Zhang M, Rendeiro AF, Perenthaler E, Bock C, Chambers I (2018) Functional dissection of the enhancer repertoire in human embryonic stem cells. Cell Stem Cell 23: 276–288.e8 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bartman CR, Hamagami N, Keller CA, Giardine B, Hardison RC, Blobel GA, Raj A (2019) Transcriptional burst initiation and polymerase pause release are key control points of transcriptional regulation. Mol Cell 73: 519–532.e4 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Benabdallah NS, Williamson I, Illingworth RS, Kane L, Boyle S, Sengupta D, Grimes GR, Therizols P, Bickmore WA (2019) Decreased enhancer‐promoter proximity accompanying enhancer activation. Mol Cell 76: 473–484.e7 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Brannan K, Kim H, Erickson B, Glover‐Cutter K, Kim S, Fong N, Kiemele L, Hansen K, Davis R, Lykke‐Andersen J et al (2012) mRNA decapping factors and the exonuclease Xrn2 function in widespread premature termination of RNA polymerase II transcription. Mol Cell 46: 311–324 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Brown DA, Di Cerbo V, Feldmann A, Ahn J, Ito S, Blackledge NP, Nakayama M, McClellan M, Dimitrova E, Turberfield AH et al (2017) The SET1 complex selects actively transcribed target genes via multivalent interaction with CpG island chromatin. Cell Rep 20: 2313–2327 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Buettner F, Natarajan KN, Casale FP, Proserpio V, Scialdone A, Theis FJ, Teichmann SA, Marioni JC, Stegle O (2015) Computational analysis of cell‐to‐cell heterogeneity in single‐cell RNA‐sequencing data reveals hidden subpopulations of cells. Nat Biotechnol 33: 155–160 [DOI] [PubMed] [Google Scholar]
- Bulut‐Karslioglu A, Biechele S, Jin H, Macrae TA, Hejna M, Gertsenstein M, Song JS, Ramalho‐Santos M (2016) Inhibition of mTOR induces a paused pluripotent state. Nature 540: 119–123 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Caizzi L, Monteiro‐Martins S, Schwalb B, Lysakovskaia K, Schmitzova J, Sawicka A, Chen Y, Lidschreiber M, Cramer P (2021) Efficient RNA polymerase II pause release requires U2 snRNP function. Mol Cell 81: 1920–1934.e9 [DOI] [PubMed] [Google Scholar]
- Carrière L, Graziani S, Alibert O, Ghavi‐Helm Y, Boussouar F, Humbertclaude H, Jounier S, Aude J‐C, Keime C, Murvai J et al (2012) Genomic binding of Pol III transcription machinery and relationship with TFIIS transcription factor distribution in mouse embryonic stem cells. Nucleic Acids Res 40: 270–283 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chen XI, Xu H, Yuan P, Fang F, Huss M, Vega VB, Wong E, Orlov YL, Zhang W, Jiang J et al (2008) Integration of external signaling pathways with the core transcriptional network in embryonic stem cells. Cell 133: 1106–1117 [DOI] [PubMed] [Google Scholar]
- Chen CCL, Goyal P, Karimi MM, Abildgaard MH, Kimura H, Lorincz MC (2018) H3S10ph broadly marks early‐replicating domains in interphase ESCs and shows reciprocal antagonism with H3K9me2. Genome Res 28: 37–51 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cherepkova MY, Sineva GS, Pospelov VA (2016) Leukemia inhibitory factor (LIF) withdrawal activates mTOR signaling pathway in mouse embryonic stem cells through the MEK/ERK/TSC2 pathway. Cell Death Dis 7: e2050 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Choi J, Lysakovskaia K, Stik G, Demel C, Söding J, Tian TV, Graf T, Cramer P (2021) Evidence for additive and synergistic action of mammalian enhancers during cell fate determination. eLife 10: e65381 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chronis C, Fiziev P, Papp B, Butz S, Bonora G, Sabri S, Ernst J, Plath K (2017) Cooperative binding of transcription factors orchestrates reprogramming. Cell 168: 442–459.e20 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Danko CG, Hah N, Luo X, Martins AL, Core L, Lis JT, Siepel A, Kraus WL (2013) Signaling pathways differentially affect RNA polymerase II initiation, pausing, and elongation rate in cells. Mol Cell 50: 212–222 [DOI] [PMC free article] [PubMed] [Google Scholar]
- de Dieuleveult M, Yen K, Hmitou I, Depaux A, Boussouar F, Dargham DB, Jounier S, Humbertclaude H, Ribierre F, Baulard C et al (2016) Genome‐wide nucleosome specificity and function of chromatin remodellers in ES cells. Nature 530: 113–116 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dobin A, Davis CA, Schlesinger F, Drenkow J, Zaleski C, Jha S, Batut P, Chaisson M, Gingeras TR (2013) STAR: ultrafast universal RNA‐seq aligner. Bioinformatics 29: 15–21 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dorighi KM, Swigut T, Henriques T, Bhanu NV, Scruggs BS, Nady N, Still CD, Garcia BA, Adelman K, Wysocka J (2017) Mll3 and Mll4 facilitate enhancer RNA synthesis and transcription from promoters independently of H3K4 monomethylation. Mol Cell 66: 568–576.e4 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Efroni S, Duttagupta R, Cheng J, Dehghani H, Hoeppner DJ, Dash C, Bazett‐Jones DP, Le Grice S, McKay RDG, Buetow KH et al (2008) Global transcription in pluripotent embryonic stem cells. Cell Stem Cell 2: 437–447 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ehrensberger AH, Kelly GP, Svejstrup JQ (2013) Mechanistic interpretation of promoter‐proximal peaks and RNAPII density maps. Cell 154: 713–715 [DOI] [PubMed] [Google Scholar]
- Elsässer SJ, Noh K‐M, Diaz N, Allis CD, Banaszynski LA (2015) Histone H3.3 is required for endogenous retroviral element silencing in embryonic stem cells. Nature 522: 240–244 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Engreitz JM, Haines JE, Perez EM, Munson G, Chen J, Kane M, McDonel PE, Guttman M, Lander ES (2016) Local regulation of gene expression by lncRNA promoters, transcription and splicing. Nature 539: 452–455 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Etchegaray J‐P, Zhong L, Li C, Henriques T, Ablondi E, Nakadai T, Van Rechem C, Ferrer C, Ross KN, Choi J‐E et al (2019) The histone deacetylase SIRT6 restrains transcription elongation via promoter‐proximal pausing. Mol Cell 75: 683–699.e7 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Farnung L, Ochmann M, Engeholm M, Cramer P (2021) Structural basis of nucleosome transcription mediated by Chd1 and FACT. Nat Struct Mol Biol 28: 382–387 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Finley LWS, Vardhana SA, Carey BW, Alonso‐Curbelo D, Koche R, Chen Y, Wen D, King B, Radler MR, Rafii S et al (2018) Pluripotency transcription factors and Tet1/2 maintain Brd4‐independent stem cell identity. Nat Cell Biol 20: 565–574 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Flynn RA, Do BT, Rubin AJ, Calo E, Lee B, Kuchelmeister H, Rale M, Chu CI, Kool ET, Wysocka J et al (2016) 7SK‐BAF axis controls pervasive transcription at enhancers. Nat Struct Mol Biol 23: 231–238 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Frankish A, Diekhans M, Jungreis I, Lagarde J, Loveland J, Mudge JM, Sisu C, Wright JC, Armstrong J, Barnes IF et al (2021) GENCODE 2021. Nucleic Acids Res 49: D916–D923 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fraser L, Taylor AH, Forrester LM (2013) SCF/KIT inhibition has a cumulative but reversible effect on the self‐renewal of embryonic stem cells and on the survival of differentiating cells. Cell Reprogram 15: 259–268 [DOI] [PubMed] [Google Scholar]
- Fuchs G, Voichek Y, Benjamin S, Gilad S, Amit I, Oren M (2014) 4sUDRB‐seq: measuring genomewide transcriptional elongation rates and initiation frequencies within cells. Genome Biol 15: R69 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fukaya T, Lim B, Levine M (2017) Rapid rates of pol II elongation in the drosophila embryo. Curr Biol 27: 1387–1391 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Galonska C, Ziller MJ, Karnik R, Meissner A (2015) Ground state conditions induce rapid reorganization of core pluripotency factor binding before global epigenetic reprogramming. Cell Stem Cell 17: 462–470 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ghimire S, Van der Jeught M, Neupane J, Roost MS, Anckaert J, Popovic M, Van Nieuwerburgh F, Mestdagh P, Vandesompele JO, Deforce D et al (2018) Comparative analysis of naive, primed and ground state pluripotency in mouse embryonic stem cells originating from the same genetic background. Sci Rep 8: 5884 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Glover‐Cutter K, Larochelle S, Erickson B, Zhang C, Shokat K, Fisher RP, Bentley DL (2009) TFIIH‐associated Cdk7 kinase functions in phosphorylation of C‐terminal domain Ser7 residues, promoter‐proximal pausing, and termination by RNA polymerase II. Mol Cell Biol 29: 5455–5464 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gressel S, Schwalb B, Decker TM, Qin W, Leonhardt H, Eick D, Cramer P (2017) CDK9‐dependent RNA polymerase II pausing controls transcription initiation. eLife 6: e29736 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gressel S, Schwalb B, Cramer P (2019) The pause‐initiation limit restricts transcription activation in human cells. Nat Commun 10: 3603 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hartl D, Krebs AR, Grand RS, Baubec T, Isbel L, Wirbelauer C, Burger L, Schübeler D (2019) CG dinucleotides enhance promoter activity independent of DNA methylation. Genome Res 29: 554–563 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hazelbaker DZ, Marquardt S, Wlotzka W, Buratowski S (2013) Kinetic competition between RNA Polymerase II and Sen1‐dependent transcription termination. Mol Cell 49: 55–66 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Heinz S, Benner C, Spann N, Bertolino E, Lin YC, Laslo P, Cheng JX, Murre C, Singh H, Glass CK (2010) Simple combinations of lineage‐determining transcription factors prime cis‐regulatory elements required for macrophage and B cell identities. Mol Cell 38: 576–589 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Herzog VA, Reichholf B, Neumann T, Rescheneder P, Bhat P, Burkard TR, Wlotzka W, von Haeseler A, Zuber J, Ameres SL (2017) Thiol‐linked alkylation of RNA to assess expression dynamics. Nat Methods 14: 1198–1204 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hsin J‐P, Manley JL (2012) The RNA polymerase II CTD coordinates transcription and RNA processing. Genes Dev 26: 2119–2137 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jiang Y, Huang J, Lun K, Li B, Zheng H, Li Y, Zhou R, Duan W, Wang C, Feng Y et al (2020) Genome‐wide analyses of chromatin interactions after the loss of Pol I, Pol II, and Pol III. Genome Biol 21: 158 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jonkers I, Kwak H, Lis JT (2014) Genome‐wide dynamics of pol II elongation and its interplay with promoter proximal pausing, chromatin, and exons. eLife 3: e02407 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jonkers I, Lis JT (2015) Getting up to speed with transcription elongation by RNA polymerase II. Nat Rev Mol Cell Biol 16: 167–177 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Joshi O, Wang S‐Y, Kuznetsova T, Atlasi Y, Peng T, Fabre PJ, Habibi E, Shaik J, Saeed S, Handoko L et al (2015) Dynamic reorganization of extremely long‐range promoter‐promoter interactions between two states of pluripotency. Cell Stem Cell 17: 748–757 [DOI] [PubMed] [Google Scholar]
- Kolodziejczyk A, Kim J, Tsang J, Ilicic T, Henriksson J, Natarajan K, Tuck A, Gao X, Bühler M, Liu P et al (2015) Single cell RNA‐sequencing of pluripotent states unlocks modular transcriptional variation. Cell Stem Cell 17: 471–485 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ku M, Jaffe JD, Koche RP, Rheinbay E, Endoh M, Koseki H, Carr SA, Bernstein BE (2012) H2A.Z landscapes and dual modifications in pluripotent and multipotent stem cells underlie complex genome regulatory functions. Genome Biol 13: R85 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kumar V, Rayan NA, Muratani M, Lim S, Elanggovan B, Xin L, Lu T, Makhija H, Poschmann J, Lufkin T et al (2016) Comprehensive benchmarking reveals H2BK20 acetylation as a distinctive signature of cell‐state‐specific enhancers and promoters. Genome Res 26: 612–623 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kumar B, Elsässer SJ (2019) Quantitative multiplexed ChIP reveals global alterations that shape promoter bivalency in ground state embryonic stem cells. Cell Rep 28: 3274–3284.e5 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kundu S, Ji F, Sunwoo H, Jain G, Lee JT, Sadreyev RI, Dekker J, Kingston RE (2017) Polycomb repressive complex 1 generates discrete compacted domains that change during differentiation. Mol Cell 65: 432–446.e5 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kundu S, Ji F, Sunwoo H, Jain G, Lee JT, Sadreyev RI, Dekker J, Kingston RE (2018) Polycomb repressive complex 1 generates discrete compacted domains that change during differentiation. Mol Cell 71: 191 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Langer D, Martianov I, Alpern D, Rhinn M, Keime C, Dollé P, Mengus G, Davidson I (2016) Essential role of the TFIID subunit TAF4 in murine embryogenesis and embryonic stem cell differentiation. Nat Commun 7: 11063 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Liao Y, Smyth GK, Shi W (2019) The R package Rsubread is easier, faster, cheaper and better for alignment and quantification of RNA sequencing reads. Nucleic Acids Res 47: e47 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lidschreiber K, Jung LA, von der Emde H, Dave K, Taipale J, Cramer P, Lidschreiber M (2021) Transcriptionally active enhancers in human cancer cells. Mol Syst Biol 17: e9873 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lin C, Garrett AS, De Kumar B, Smith ER, Gogol M, Seidel C, Krumlauf R, Shilatifard A (2011) Dynamic transcriptional events in embryonic stem cells mediated by the super elongation complex (SEC). Genes Dev 25: 1486–1498 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lloret‐Llinares M, Karadoulama E, Chen Y, Wojenski LA, Villafano GJ, Bornholdt J, Andersson R, Core L, Sandelin A, Jensen TH (2018) The RNA exosome contributes to gene expression regulation during stem cell differentiation. Nucleic Acids Res 46: 11502–11513 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Love MI, Huber W, Anders S (2014) Moderated estimation of fold change and dispersion for RNA‐seq data with DESeq2. Genome Biol 15: 550 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Marks H, Kalkan T, Menafra R, Denissov S, Jones K, Hofemeister H, Nichols J, Kranz A, Francis Stewart A, Smith A et al (2012) The transcriptional and epigenomic foundations of ground state pluripotency. Cell 149: 590–604 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Martire S, Gogate AA, Whitmill A, Tafessu A, Nguyen J, Teng Y‐C, Tastemel M, Banaszynski LA (2019) Phosphorylation of histone H3.3 at serine 31 promotes p300 activity and enhancer acetylation. Nat Genet 51: 941–946 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Max T, Søgaard M, Svejstrup JQ (2007) Hyperphosphorylation of the C‐terminal repeat domain of RNA polymerase II facilitates dissociation of its complex with mediator. J Biol Chem 282: 14113–14120 [DOI] [PubMed] [Google Scholar]
- McLaughlin K, Flyamer IM, Thomson JP, Mjoseng HK, Shukla R, Williamson I, Grimes GR, Illingworth RS, Adams IR, Pennings S et al (2019) DNA methylation directs polycomb‐dependent 3D genome re‐organization in naive pluripotency. Cell Rep 29: 1974–1985.e6 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Michel M, Demel C, Zacher B, Schwalb B, Krebs S, Blum H, Gagneur J, Cramer P (2017) TT‐seq captures enhancer landscapes immediately after T‐cell stimulation. Mol Syst Biol 13: 920 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Muhar M, Ebert A, Neumann T, Umkehrer C, Jude J, Wieshofer C, Rescheneder P, Lipp JJ, Herzog VA, Reichholf B et al (2018) SLAM‐seq defines direct gene‐regulatory functions of the BRD4‐MYC axis. Science 360: 800–805 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Muse GW, Gilchrist DA, Nechaev S, Shah R, Parker JS, Grissom SF, Zeitlinger J, Adelman K (2007) RNA polymerase is poised for activation across the genome. Nat Genet 39: 1507–1511 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mylonas C, Tessarz P (2018) Transcriptional repression by FACT is linked to regulation of chromatin accessibility at the promoter of ES cells. Life Sci Alliance 1: e201800085 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mylonas C, Lee C, Auld AL, Cisse II, Boyer LA (2021) A dual role for H2A.Z.1 in modulating the dynamics of RNA polymerase II initiation and elongation. Nat Struct Mol Biol 28: 435–442 [DOI] [PubMed] [Google Scholar]
- Nojima T, Gomes T, Grosso ARF, Kimura H, Dye MJ, Dhir S, Carmo‐Fonseca M, Proudfoot NJ (2015) Mammalian NET‐Seq reveals genome‐wide nascent transcription coupled to RNA processing. Cell 161: 526–540 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Novo CL, Javierre B‐M, Cairns J, Segonds‐Pichon A, Wingett SW, Freire‐Pritchett P, Furlan‐Magaril M, Schoenfelder S, Fraser P, Rugg‐Gunn PJ (2018) Long‐Range enhancer interactions are prevalent in mouse embryonic stem cells and are reorganized upon pluripotent state transition. Cell Rep 22: 2615–2627 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ostapcuk V, Mohn F, Carl SH, Basters A, Hess D, Iesmantavicius V, Lampersberger L, Flemr M, Pandey A, Thomä NH et al (2018) Activity‐dependent neuroprotective protein recruits HP1 and CHD4 to control lineage‐specifying genes. Nature 557: 739–743 [DOI] [PubMed] [Google Scholar]
- Peng T, Zhai Y, Atlasi Y, Ter Huurne M, Marks H, Stunnenberg HG, Megchelenbrink W (2020) STARR‐seq identifies active, chromatin‐masked, and dormant enhancers in pluripotent mouse embryonic stem cells. Genome Biol 21: 243 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pintacuda G, Wei G, Roustan C, Kirmizitas BA, Solcan N, Cerase A, Castello A, Mohammed S, Moindrot B, Nesterova TB et al (2017) hnRNPK recruits PCGF3/5‐PRC1 to the Xist RNA B‐repeat to establish polycomb‐mediated chromosomal silencing. Mol Cell 68: 955–969.e10 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Poleshko A, Shah PP, Gupta M, Babu A, Morley MP, Manderfield LJ, Ifkovits JL, Calderon D, Aghajanian H, Sierra‐Pagán JE et al (2017) Genome‐nuclear lamina interactions regulate cardiac stem cell lineage restriction. Cell 171: 573–587.e14 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rabani M, Levin JZ, Fan L, Adiconis X, Raychowdhury R, Garber M, Gnirke A, Nusbaum C, Hacohen N, Friedman N et al (2011) Metabolic labeling of RNA uncovers principles of RNA production and degradation dynamics in mammalian cells. Nat Biotechnol 29: 436–442 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sabari BR, Dall’Agnese A, Boija A, Klein IA, Coffey EL, Shrinivas K, Abraham BJ, Hannett NM, Zamudio AV, Manteiga JC et al (2018) Coactivator condensation at super‐enhancers links phase separation and gene control. Science 361: eaar3958 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sachs M, Onodera C, Blaschke K, Ebata KT, Song JS, Ramalho‐Santos M (2013) Bivalent chromatin marks developmental regulatory genes in the mouse embryonic germline in vivo . Cell Rep 3: 1777–1784 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sanchez GJ, Richmond PA, Bunker EN, Karman SS, Azofeifa J, Garnett AT, Xu Q, Wheeler GE, Toomey CM, Zhang Q et al (2018) Genome‐wide dose‐dependent inhibition of histone deacetylases studies reveal their roles in enhancer remodeling and suppression of oncogenic super‐enhancers. Nucleic Acids Res 46: 1756–1776 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schoenfelder S, Fraser P (2019) Long‐range enhancer‐promoter contacts in gene expression control. Nat Rev Genet 20: 437–455 [DOI] [PubMed] [Google Scholar]
- Schofield JA, Duffy EE, Kiefer L, Sullivan MC, Simon MD (2018) TimeLapse‐seq: adding a temporal dimension to RNA sequencing through nucleoside recoding. Nat Methods 15: 221–225 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schwalb B, Michel M, Zacher B, Frühauf K, Demel C, Tresch A, Gagneur J, Cramer P (2016) TT‐seq maps the human transient transcriptome. Science 352: 1225–1228 [DOI] [PubMed] [Google Scholar]
- Schwanhäusser B, Busse D, Li N, Dittmar G, Schuchhardt J, Wolf J, Chen W, Selbach M (2011) Global quantification of mammalian gene expression control. Nature 473: 337–342 [DOI] [PubMed] [Google Scholar]
- Sheridan RM, Fong N, D’Alessandro A, Bentley DL (2019) Widespread backtracking by RNA pol II is a major effector of gene activation, 5’ pause release, termination, and transcription elongation rate. Mol Cell 73: 107–118.e4 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Siepel A, Bejerano G, Pedersen JS, Hinrichs AS, Hou M, Rosenbloom K, Clawson H, Spieth J, Hillier LW, Richards S et al (2005) Evolutionarily conserved elements in vertebrate, insect, worm, and yeast genomes. Genome Res 15: 1034–1050 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Stavridis MP, Lunn JS, Collins BJ, Storey KG (2007) A discrete period of FGF‐induced Erk1/2 signalling is required for vertebrate neural specification. Development 134: 2889–2894 [DOI] [PubMed] [Google Scholar]
- Steurer B, Janssens RC, Geverts B, Geijer ME, Wienholz F, Theil AF, Chang J, Dealy S, Pothof J, van Cappellen WA et al (2018) Live‐cell analysis of endogenous GFP‐RPB1 uncovers rapid turnover of initiating and promoter‐paused RNA Polymerase II. Proc Natl Acad Sci USA 115: E4368–E4376 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Surface LE, Fields PA, Subramanian V, Behmer R, Udeshi N, Peach SE, Carr SA , Jaffe JD, Boyer LA (2016) H2A.Z.1 monoubiquitylation antagonizes BRD2 to maintain poised chromatin in escs. Cell Rep 14: 1142–1155 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Todaro F, Campolo F, Barrios F, Pellegrini M, Di Cesare S, Tessarollo L, Rossi P, Jannini EA, Dolci S (2019) Regulation of kit expression in early mouse embryos and ES cells. Stem Cells 37: 332–344 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tuck AC, Natarajan KN, Rice GM, Borawski J, Mohn F, Rankova A, Flemr M, Wenger A, Nutiu R, Teichmann S et al (2018) Distinctive features of lincRNA gene expression suggest widespread RNA‐independent functions. Life Sci Alliance 1: e201800124 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Veloso A, Kirkconnell KS, Magnuson B, Biewen B, Paulsen MT, Wilson TE, Ljungman M (2014) Rate of elongation by RNA polymerase II is associated with specific gene features and epigenetic modifications. Genome Res 24: 896–905 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Vierstra J, Rynes E, Sandstrom R, Zhang M, Canfield T, Hansen RS, Stehling‐Sun S, Sabo PJ, Byron R, Humbert R et al (2014) Mouse regulatory DNA landscapes reveal global principles of cis‐regulatory evolution. Science 346: 1007–1012 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Walter M, Teissandier A, Pérez‐Palacios R, Bourc'his D (2016) An epigenetic switch ensures transposon repression upon dynamic loss of DNA methylation in embryonic stem cells. eLife 5: e11418 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wang L, Zhou YU, Xu L, Xiao R, Lu X, Chen L, Chong J, Li H, He C, Fu X‐D et al (2015) Molecular basis for 5‐carboxycytosine recognition by RNA polymerase II elongation complex. Nature 523: 621–625 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Weinberg DN, Papillon‐Cavanagh S, Chen H, Yue Y, Chen X, Rajagopalan KN, Horth C, McGuire JT, Xu X, Nikbakht H et al (2019) The histone mark H3K36me2 recruits DNMT3A and shapes the intergenic DNA methylation landscape. Nature 573: 281–286 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Weintraub AS, Li CH, Zamudio AV, Sigova AA, Hannett NM, Day DS, Abraham BJ, Cohen MA, Nabet B, Buckley DL et al (2017) YY1 is a structural regulator of enhancer‐promoter loops. Cell 171: 1573–1588.e28 [DOI] [PMC free article] [PubMed] [Google Scholar]
- West S, Gromak N, Proudfoot NJ (2004) Human 5’ –> 3’ exonuclease Xrn2 promotes transcription termination at co‐transcriptional cleavage sites. Nature 432: 522–525 [DOI] [PubMed] [Google Scholar]
- Wray J, Kalkan T, Gomez‐Lopez S, Eckardt D, Cook A, Kemler R, Smith A (2011) Inhibition of glycogen synthase kinase‐3 alleviates Tcf3 repression of the pluripotency network and increases embryonic stem cell resistance to differentiation. Nat Cell Biol 13: 838–845 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wu TP, Wang T, Seetin MG, Lai Y, Zhu S, Lin K, Liu Y, Byrum SD, Mackintosh SG, Zhong M et al (2016) DNA methylation on N(6)‐adenine in mammalian embryonic stem cells. Nature 532: 329–333 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yao M, Zhou X, Zhou J, Gong S, Hu G, Li J, Huang K, Lai P, Shi G, Hutchins AP et al (2018) PCGF5 is required for neural differentiation of embryonic stem cells. Nat Commun 9: 1463 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ying Q‐L, Wray J, Nichols J, Batlle‐Morera L, Doble B, Woodgett J, Cohen P, Smith A (2008) The ground state of embryonic stem cell self‐renewal. Nature 453: 519–523 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhang T, Zhang Z, Dong Q, Xiong J, Zhu B (2020) Histone H3K27 acetylation is dispensable for enhancer activity in mouse embryonic stem cells. Genome Biol 21: 45 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhao YI, Li H, Fang S, Kang Y, Wu W, Hao Y, Li Z, Bu D, Sun N, Zhang MQ et al (2016) NONCODE 2016: an informative and valuable data source of long non‐coding RNAs. Nucleic Acids Res 44: D203–D208 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Żylicz JJ, Bousard A, Žumer K, Dossin F, Mohammad E, da Rocha ST, Schwalb B, Syx L, Dingli F, Loew D et al (2019) The implication of early chromatin changes in X chromosome inactivation. Cell 176: 182–197.e23 [DOI] [PMC free article] [PubMed] [Google Scholar]
