Abstract
RNA-binding proteins play a key role in shaping gene expression profiles during stress, however, little is known about the dynamic nature of these interactions and how this influences the kinetics of gene expression. To address this, we developed kinetic cross-linking and analysis of cDNAs (χCRAC), an ultraviolet cross-linking method that enabled us to quantitatively measure the dynamics of protein–RNA interactions in vivo on a minute time-scale. Here, using χCRAC we measure the global RNA-binding dynamics of the yeast transcription termination factor Nab3 in response to glucose starvation. These measurements reveal rapid changes in protein–RNA interactions within 1 min following stress imposition. Changes in Nab3 binding are largely independent of alterations in transcription rate during the early stages of stress response, indicating orthogonal transcriptional control mechanisms. We also uncover a function for Nab3 in dampening expression of stress-responsive genes. χCRAC has the potential to greatly enhance our understanding of in vivo dynamics of protein–RNA interactions.
Protein RNA interactions are dynamic and regulated in response to environmental changes. Here the authors describe ‘kinetic CRAC’, an approach that allows time resolved analyses of protein RNA interactions with minute time point resolution and apply it to gain insight into the function of the RNA-binding protein Nab3.
Introduction
RNA-binding proteins (RBPs) control almost all aspects of gene expression, including the stability of the RNA, its structure, the rate at which the RNA is processed, how efficiently it is translated and its subcellular localization. Not surprisingly, because of these important functions, RBPs are often found associated with many diverse genetic and somatic diseases, including muscular disorders, autoimmune diseases, and cancer1. RBPs also play a very important role in adapting to dynamic environments, such as those encountered by microbes when exposed to stress. Survival under stress is contingent on the ability to rapidly reprogram gene expression and, while this ability has been largely attributed to the activity of transcription factors, it is becoming increasingly clear that RBPs also play a primary role in shaping gene expression response profiles by modulating RNA processing and decay2. RBPs involved in RNA decay are believed to play an important role during the first few minutes of the adaptation response during which major transcriptional reprogramming events happen3, 4. However, direct measurement of protein–RNA interactions during these early stages has so far proved elusive. Consequently, little is known about the contribution of individual RNA decay factors during rapid rewiring of the gene expression program in response to environmental changes.
In recent years, ultraviolet (UV) cross-linking and immunoprecipitation (CLIP) followed by deep sequencing has emerged as the main technology to map protein–RNA interactions in vivo5. UV-irradiation is used to forge covalent bonds (cross-links) between proteins and directly bound RNAs. Proteins of interest are then purified under stringent conditions and high-throughput sequencing of the cross-linked RNA enables mapping of the interaction sites. A number of CLIP-related techniques have been developed over the years, such as CRAC (cross-linking and analysis of cDNAs), iCLIP, and PAR-CLIP (photoactivatable ribonucleoside-enhanced crosslinking and immunoprecipitation)6–8. For CRAC the protein of interest is fused to a tandem affinity purification tag (HTP; His6-TEV-ProtA) to enable purification of cross-linked RNAs under completely denaturing conditions6. Recent advancements have enhanced the efficiency of the library preparation, increased the data complexity and improved the resolution of RNA-binding site detection7–11. Despite such advances, current protocols are ill-suited to quantitatively measure dynamic changes in protein–RNA interactions. Using current commercially available UV-irradiation equipment, the cross-linking step can take up to 30 min to reach the desired dose (depending on organism and wavelength)12–14. This limitation rules out measurements of the early stress responses, which can happen on the minute time scale3, 4. In addition, due to prolonged UV-irradiation, cells are exposed to major additional stresses, such as DNA damage, which can confound the results and insert a bias toward RNA transcripts that are specific for the irradiation conditions.
To tackle these problems, we have improved the original CRAC protocol and developed a UV-irradiation device that cross-links proteins to RNA in vivo in seconds. These advancements enabled us to perform quantitative time-resolved in vivo measurements of direct protein–RNA interactions at 1-min time-point resolution. We refer to this method as kinetic CRAC (χCRAC).
We have applied χCRAC to glucose-deprived Saccharomyces cerevisiae to investigate the dynamic interactions of the RBP Nab3 during the adaptation process. Nab3 is a component of the Nrd1-Nab3-Sen1 (NNS) transcription termination complex that is involved in degradation of diverse classes of lnc-RNAs, such as cryptic unstable transcripts (CUTs), Nrd1 unterminated transcripts, and various messenger RNAs (mRNAs) and in the maturation of snoRNAs15, 16. Depriving yeast of glucose results in significant changes in mRNA levels and a transcriptome-wide redistribution of NNS components17–20. χCRAC accurately detected these widespread changes in Nab3 binding; importantly, the high temporal resolution enabled us to document transient changes in cross-linking of Nab3 to many transcripts, indicating a potentially pervasive importance of termination factors in the early stages of stress response. We also uncover a role for Nab3 in regulating the expression kinetics of stress-responsive genes and found that Nab3 is required for suppression of retrotransposon transcription during late stages of the glucose deprivation response. This suggests that Nab3 could play an important role in maintaining genome integrity during stress.
Results
Very fast protein–RNA cross-linking in vivo
To establish χCRAC, we developed a UV-irradiation apparatus (Vari-X-linker) to improve the in vivo protein–RNA cross-linking efficiency (Supplementary Fig. 1; see Methods for a more detailed description of the apparatus). To test the effectiveness of the Vari-X-linker, we performed CRAC experiments on yeast strains expressing HTP-tagged (His6-TEV-ProtA) Nab3 that were UV-irradiated in the Vari-X-linker or in the Megatron, which (to the best of our knowledge) is currently the most efficient UV cross-linker on the market for cross-linking cell cultures12. Our tests with the Vari-X-linker’s 254 nm lamps showed that it can cross-link yeast proteins to RNA in seconds, up to tenfold more efficiently than the Megatron (Fig. 1a). Combined with a cell filtration device that we developed, it is possible to cross-link cells and harvest 1 L of cells in ~1 min. We also tested the Vari-X-linker standard lamps on an E. coli strain expressing a His6-TEV-FLAG-tagged Hfq protein21, which showed a sevenfold improvement in cross-linking time (Fig. 1b). The Vari-X-linker can also be used with 365 nm lamps (350 W) to perform PAR-CLIP experiments, providing a high cross-linking efficiency after 2 min of UV-irradiation at considerably lower 4-thio-Uracil concentrations and shorter labeling times (see Methods for more details) (Fig. 1c).
Thus, we can perform time-resolved (PAR-)CLIP/CRAC experiments on very short time-scales, enabling the measurement of dynamic protein–RNA interactions in living cells with high temporal resolution.
Differential expression analysis of Nab3 CRAC data generated using the Megatron and the Vari-X-linker revealed significant differences between the two UV-irradiation conditions (Fig. 2a, DESeq222; adjusted p-values < = 0.05). The Vari-X-linker data were more highly enriched for short-lived lncRNA species (Stable Uncharacterized Transcripts (SUTs), Xrn1 Unstable Transcripts (XUTs), Cryptic Unstable Transcripts (CUTs), and anti-sense transcripts) (Fig. 2b). This suggests that very short UV-irradiation times significantly improve the recovery of these unstable lncRNAs, as shown for two known CUTs that originate from the Nrd1 and Pho84 genes (Fig. 2c,d)23, 24. The ~300 protein-coding genes enriched in the Megatron data (Fig. 2b) were highly enriched for genes that are upregulated during DNA damage (Fig. 2e,f; FDR < 0.01). Although the steady state levels of retrotransposons did not significantly change during the 100 s UV-irradiation in the Megatron (Supplementary Fig. 2), the DESeq2 analyses revealed significantly higher cross-linking of Nab3 to these transcripts, suggesting that Nab3 actively targets these transcripts during long UV-irradiation times (Fig. 2e,f). Additionally, we also detected a significant enrichment of almost all transfer RNAs (tRNAs) in the Megatron data, which we believe reflects Nab3-dependent degradation of tRNAs that accumulate in the nucleus during the DNA damage response25. While these data suggest a role for the NNS complex in regulating DNA damage response and suppressing retrotransposon transcription (see below), it also illustrates that long UV-irradiation times increase the likelihood of detecting alterations in transcription that are the result of the activation of the DNA damage response.
Monitoring in vivo dynamics of protein–RNA interactions
Yeast cells deprived of glucose redistribute NNS components over the transcriptome17. Therefore, to test the feasibility of our χCRAC method, we measured changes in Nab3 cross-linking during glucose deprivation. Because Nab3 co-transcriptionally binds RNA, we also performed χCRAC on RNA polymerase II using a strain expressing an HTP-tagged Rpo21 subunit26. This enabled us to determine how well Nab3 binding correlates with changes in Pol II transcription. To measure changes in steady-state RNA levels, we performed RNA-Seq on ribosomal RNA-depleted total RNA. We devised a simple experimental set-up that would enable us to rapidly shift cells to a new medium (Fig. 3a). Cells were grown to exponential phase in glucose medium after which a fraction of the cells were harvested (t = 0 time-point). The rest was rapidly harvested by filtration and transferred to a flask with medium lacking glucose. After the shift, cells were cross-linked at various time-points. To control for changes in gene expression caused by the filtration process, we performed experiments where filtered cells were transferred back to glucose containing medium. To accurately quantify differences in cross-linking between time-points, we made several improvements to the original CRAC protocol6 to reduce sequence representation biases (Supplementary Fig. 3) and to improve the preparation of complementary DNA (cDNA) libraries (see Methods). After resolving the purified protein–RNA complexes on SDS-polyacrylamide gel electrophoresis (PAGE) gels, they were transferred to nitrocellulose (Fig. 3b). Western blotting was performed to assess the efficiency of protein recovery after the nickel purification steps (Fig. 3b). RNA from each sample was ligated to 5′ adapters with unique barcodes (Supplementary Table 2). To reduce technical noise, cross-linked RNAs from all the time-points were pooled by extracting RNA from a single membrane slice containing the radioactive signal just above the main bands (Fig. 3b, red dashed rectangle) from which a single cDNA library was generated.
Statistical analyses of biological replicates revealed that χCRAC generates highly reproducible results (Fig. 3c, Supplementary Fig. 4). The early (1–4) min time-points were also highly correlated (Fig. 3c), followed by a sharp drop in correlation coefficients, suggesting that major changes in cross-linking profiles take place shortly after the first 4 min of glucose deprivation.
To identify transcripts that showed significant differential cross-linking profiles between the control (glucose to glucose) and treated (glucose to no glucose), we fitted a Gaussian process (GP) regression model to both time series27, 28 to compute the likelihood that the control and treated originated from different profiles (see Methods; Fig. 3d).
Finally, to validate our findings we used the anchor-away system29. By tagging Nab3 with the FKBP12-rapamycin-binding (FRB) domain in the anchor-away strain, we were able to rapidly and effectively deplete Nab3 from the nucleus by adding rapamycin to the glucose medium 1 h before shifting the cells to medium lacking glucose (see Methods, Supplementary note 1, Supplementary Figs. 5 and 6).
χCRAC provides insights into transcription kinetics
GP analyses identified 2431 Pol II transcripts that showed significant changes in Pol II cross-linking profiles after the shift to medium lacking glucose (Bayes Factor > 10 supporting different response dynamics (see Methods)). The largest changes were observed in snoRNAs, protein-coding genes and anti-sense RNAs (Fig. 4a). To determine how well our data agrees with previous transcriptome-wide studies, we analyzed the Pol II cross-linking profiles for protein-coding genes in more detail. For the majority of protein-coding genes the changes in profiles could be summarized into 4 K-means clusters (Fig. 4b). Genes in all four clusters were enriched for distinct GO-terms. Cluster 0 is highly enriched for genes from the Ribiregulon (ribosomal proteins (r-proteins) and ribosome assembly factors) that are downregulated during glucose deprivation. Cluster 1 is enriched for genes involved in DNA metabolism and transposition. Clusters 2 and 3 contain many stress responsive genes and genes involved in respiration that are known to be upregulated during glucose starvation18, 20. These results are in excellent agreement with previous studies18, 20, demonstrating that Pol II χCRAC can accurately measure changes in gene expression at high temporal resolution.
Another reason for performing Pol II χCRAC studies, was to determine whether the data could potentially be used to develop statistical models for estimating RNA half-lives or to generate mechanistic models for RNA transcription and processing during stress. As a first step in this direction, we asked how well the Pol II mRNA χCRAC data (Fig. 4c, y-axis) correlated with changes in the levels of total mRNA as measured by RNASeq at each time point (Fig. 4c, x-axis). Only at the 0 (glucose) and late 30 and 40-min (no glucose) time-points a highly positive correlation between changes in total RNA levels and changes in Pol II transcription (Pearson’s R = 0.63 to 0.68; p-values < 0.01) was observed. These results suggest that it takes about 30–40 min to adjust total RNA levels to mirror Pol II transcription levels. After about 14 min of glucose deprivation ~60% of the transcriptionally upregulated genes also showed a comparable increase in total RNA levels. This percentage only marginally increased at later time-points, (Fig. 4d, red line). This indicates that transcription regulation plays a dominant role during the first 8 min of the glucose deprivation response. In contrast, many genes with decreasing transcription levels only showed a similar decrease in total RNA levels during late stages of the adaptation response, suggesting that the adjustment of steady-state RNA levels for these genes is relatively slow (Fig. 4c and blue line in Fig. 4d). This especially was the case for r-protein coding genes: We observed that transcription of most r-proteins was reduced to basal level already 8 min after the shift, whereas total RNA levels of most r-protein transcripts decreased more slowly (Fig. 4e). The average mRNA half-life of r-protein coding transcripts during rapid glucose removal was estimated to be around 16 min30, which is consistent with the slow decrease in total mRNA levels that we observed during the adaptation response. Interestingly, although both RPL41A and RPL41B were downregulated on the transcriptional level, total mRNA level of these transcripts increased during glucose starvation (Fig. 4e).
Collectively, our data indicate that during glucose deprivation the bulk of the transcriptional changes take place within the first 8 min and that degradation of transcripts from downregulated genes could be a rate-limiting step during the adaptation process.
Nab3-RNA interaction dynamics during glucose starvation
We detected differential cross-linking of Nab3 to over 4100 transcripts (~37% of all features) during glucose deprivation. Using K-means clustering, we divided the Nab3 cross-linking profiles of the differentially bound transcripts into four clusters (Fig. 5a). Interestingly, both clusters 2 and 3 show a very rapid increase in Nab3 cross-linking during the first few minutes of the adaptation response. Clusters 1 and 2 also indicated transient changes in Nab3 cross-linking during the first 8 min of glucose deprivation. These data suggest that Nab3 binding very rapidly changes during glucose deprivation and is dynamic. Cluster 0 contains transcripts that generally show a decrease in Nab3 binding. About three quarters of the transcripts in this cluster are ncRNAs (XUTs, CUTs, SUTs, anti-sense RNAs, and snoRNAs), suggesting that Nab3 binding to this class of transcripts decreases during glucose starvation (Fig. 5b). Cluster 3 genes showed an increase in Nab3 binding during the time-course. These generally were underrepresented in ncRNAs, but contained the largest group of protein-coding genes and a number of tRNAs. The vast majority of the reads mapping to the 3′ end of these tRNAs contained CCA trinucleotides, suggesting that they are mature transcripts.
We next asked how well these changes in Nab3 cross-linking correlated with changes in Pol II transcription (Fig. 5c). The Nab3 and Pol II glucose χCRAC data (t = 0) were highly positively correlated, which is consistent with the co-transcriptional binding of Nab3 to the nascent transcript31, 32. However, 1 min after the shift to medium lacking glucose, the Nab3 and Pol II data decoupled, suggesting rapid changes in Nab3 binding that were independent of alterations in Pol II transcription. Thus, during early stages of glucose deprivation, binding of Nab3 to nascent transcripts might be regulated by additional factors, providing an additional level of control, which is largely orthogonal to transcriptional regulation. At later time-points, however, the correlation between the data sets improved, indicating that the cells have started to adjust to their new environment. Consistent with this, comparison of Nab3 and Pol II χCRAC profiles revealed that at later time-points Nab3-binding profiles generally followed Pol II transcription (Fig. 5d, groups 2–3). Notably, group 1 contained many genes involved in the heat-shock response, transmembrane sugar transporters and glycolysis/gluconeogenesis. Group 2 is enriched for genes involved in the oxidative stress response and starch/sucrose metabolism, hinting at a role for Nab3 in regulating the oxidative stress response.
As many yeast promoters are intrinsically bi-directional24, 33, the induction of the group 1 and 2 genes during glucose starvation frequently resulted in the appearance of divergent anti-sense transcripts (Fig. 5e), which are also bound by Nab3. Two examples are CUT075 (Fig. 5d, group 2) and Unit247/CUT246 (Fig. 5d, group 1) that are readily detected upon the induction of the heat-shock proteins SSA2 and HSP78 (both group 3 genes; Supplementary Fig. 7). Unit247/CUT246 and CUT075 are anti-sense to POM33 and YAP6, respectively. Interestingly, our Nab3-depletion data suggests that Nab3 prematurely terminates these transcripts, preventing the polymerases from reaching the 5′ ends of POM33 and YAP6, which could result in silencing of these genes15.
For a number of the downregulated genes we observed a transient increase in Nab3 cross-linking and a reduction in Pol II transcription (Fig. 5d, group 3). Two examples are shown in Fig. 5f,g. Transcription of the ILV5 and RPP0 genes decreased almost linearly during glucose starvation (Fig. 5f,g, top two graphs), however, Nab3 cross-linking reproducibly increased about 2–3-fold during the first 4 to 8 min (Fig. 5f,g, top two graphs). These examples demonstrate that χCRAC can detect rapid changes in protein–RNA interactions at very high temporal resolution.
In cis changes in Nab3 binding during glucose deprivation
Nab3-binding profiles within transcripts also changed for many genes during glucose starvation (Fig. 6). As anticipated, the majority of the Nab3 cross-linking peaks identified in the glucose data (t = 0) clustered near the 5′end of protein-coding genes where Nab3 is known to act31, 32, 34 (Fig. 6a). However, 14 min after the shift to medium lacking glucose, the binding pattern of Nab3 appeared to spread more into the coding sequence (Fig. 6b). The Nab3 peak distribution plot in Fig. 6c confirmed that the Nab3-binding site distribution in the no-glucose data was significantly different from the glucose data (two sample Kolmogorov–Smirnov test; p-value < 10−5). A striking example was the enolase (ENO1) gene, an enzyme involved in gluconeogenesis and glycolysis (Fig. 6d), which is strongly upregulated during glucose deprivation (Fig. 6d, top panel). In the glucose to glucose Nab3 control data (Fig. 6d, Nab3 (glu to glu)) we mainly observed three Nab3 peaks in the 5′UTR of ENO1 that overlapped with two CUTs. In the no glucose data (Fig. 6d, Nab3 glu to noglu) the main Nab3 peaks were located further downstream in the coding sequence. This transition happens very quickly: already after the first few minutes of glucose starvation we see a change in the intensity of Nab3 binding at various sites in ENO1 (Fig. 6d).
These results demonstrate that χCRAC is also capable of detecting rapid in cis changes in protein–RNA interactions.
We hypothesized that this redistribution of Nab3 in ENO1 could be linked to the use of alternative TSSs when cells are grown in glucose. Such a mechanism is sometimes employed to regulate expression of genes encoding metabolic proteins, such as IMD2 35, 36, under specific conditions. To test whether expression of ENO1 is controlled by a similar mechanism, we analyzed Cap-binding protein (Cbp1) CRAC37, ChIP-Seq38 and TIF-Seq39 data to identify transcription start sites and transcript isoforms, respectively. All data sets show that transcription can initiate upstream of the ENO1 TATA box (Supplementary Fig. 8a–c).The high ChIP signal near the CUT TSS indicates that CUT transcription is regulated by TFIID. All the available data indicate that CUT transcription is driven by a different promoter. Relative to the orthologous ENO2, formation of transcription initiation complexes at the ENO1 promoter appears to be inefficient, and we speculate that this is partly the result of transcriptional interference from the upstream CUT (Supplementary Fig. 8c). Transcription of this CUT probably terminates between the ENO1 TATA box and TSS as high levels of Nab3 cross-linking was detected in this region (Supplementary Fig. 8a). Indeed, analysis of reads containing non-encoded oligo-A tails, which are a hallmark for NNS-exosome degradation40, revealed many degradation intermediates that overlapped with the Nab3 cross-linking sites (Fig. 6d panel Oligo-A tails), but mainly in cells grown in glucose. Nab3 depletion resulted in a ~8-fold increase in the CUT levels, confirming that Nab3 binding triggers the degradation of the CUT. However, overall the expression levels did not change during the time-course (Fig. 6e). Relative to ACT1, ENO1 mRNA levels in glucose were low and Nab3-depleted cells showed a modest increase in ENO1 quantitative reverse transcription-PCR (qRT-PCR) signal (Fig. 6f), possibly because CUT transcripts no longer terminate at the Nab3-binding sites, and less termination in the ENO1 coding sequence (Fig. 6f, left plot). However, Nab3 depletion did not significantly affect gene expression levels of ENO1 during glucose starvation (Fig. 6f).
We speculate that the upstream CUT helps to suppress transcription initiation at the ENO1 transcription start site (TSS) when cells are grown in glucose (also see Discussion and Supplementary note 2).
Nab3 dampens the expression of stress-responsive genes
Our data analyses (Fig. 5b,d) revealed that almost a quarter of the protein-coding transcripts are differentially bound by Nab3 during glucose starvation. We hypothesized that Nab3 could play a role in regulating the kinetics of these genes during stress. To test this model, we again employed the anchor-away method to deplete Nab3 from the nucleus and asked how this globally affected Pol II transcription. To identify Nab3-regulated protein-coding genes we calculated Pol II escape indices (EI41; Fig. 7a) that are a measure of changes in Pol II distribution over the gene upon rapamycin treatment. We assumed that genes that are tightly controlled by Nab3 would show high read densities near Nab3-binding sites in the promoter proximal region as a result of Pol II pausing. Upon Nab3 depletion, we expected that these “pileups” would largely dissolve, leading to an increased Pol II density over the body of the gene (see example in Supplementary Fig. 6b). Thus, genes with an EI > 1 are potentially regulated by Nab3. To reduce noise, we only considered highly expressed genes with an EI > = 2 that showed at least a 1.5 increase in Pol II transcription upon rapamycin treatment (see Methods for more details; Fig. 7b, top-right red quadrant).
Five of the 14 genes selected from the glucose data (Supplementary Table 4) were previously shown to be regulated by NNS, including SER3, IMD3, NRD1, and URA8 and four others (FLX1, HEM4, TRS31, and SEN2) were picked up as a result of snoRNA read-through. We identified a number of new genes that are regulated by Nab3-dependent attenuation during glucose deprivation (Fig. 7b; middle and right plot). These include the aldolase GRE3, the mitochondrial copper transporter PIC2 (Fig. 5 group 2), the maltose fermentation regulatory protein MAL33 and the small GTPase RHO5, all of which were upregulated during glucose starvation. Comparison of the Pol II χCRAC profiles of the Nab3-depleted and control (ethanol treated) data for these genes showed a clear accumulation of Pol II around the Nab3 cross-linking sites in the ethanol-treated cells and little transcription downstream, indicative of Pol II pausing and termination (Supplementary Fig. 9b–e). In the rapamycin-treated cells more Pol II could be detected in the body of these genes. Importantly, rapamycin treatment of the anchor-away strain expressing Nab3 without the FRB domain did not noticeably affect the Pol II transcription profiles of these genes (Supplementary Figs 6b,10), demonstrating that the observed changes in Pol II distribution is a direct result of Nab3-FRB depletion from the nucleus.
To substantiate those results, we performed qRT-PCRs on total RNA isolated from cells treated with rapamycin or the solvent (ethanol), focusing on MAL33 and PIC2 (Fig. 5d; group 2) (Fig. 7c). As positive controls we analyzed the levels of IMD3 (Fig. 5d; group 3) and NRD1, two genes known to be regulated by Nab3-dependent attenuation16, 23. As negative controls we selected three genes (LST8, GLK1, and YBR085C-A; blue dots in Fig. 7b) that showed both a high increase in Nab3 binding and Pol II transcription in glucose-deprived cells (Fig. 5d groups 1 and 2), however, based on the calculated EIs were less likely to be affected by Nab3 binding. Although nuclear depletion of Nab3 resulted in only a modest increase in Pol II cross-linking of IMD3, NRD1, PIC2. and MAL33 (generally less than twofold), total RNA levels increased quite dramatically (Fig. 7c). This suggests that in the absence of Nab3 a substantially higher number of polymerases reach the 3′ end of these genes and are terminated by the canonical cleavage and polyadenylation machinery. After 1 h of rapamycin-treatment total mRNA levels of IMD3 and NRD1 increased about fivefold in glucose, consistent with a role for Nab3 in terminating transcription of these genes16, 23. In Nab3-depleted cells, IMD3 transcription and total RNA levels cells were generally higher throughout the time-course, suggesting that Nab3 is important for repressing IMD3 expression in glucose and during glucose starvation (Supplementary Fig. 9b; Fig. 7c). In contrast, PIC2 and MAL33 mRNA levels only increased in glucose-deprived cells treated with rapamycin (Fig. 7c). Thus, PIC2 and MAL33 are clear examples of stress-specific Nab3 targets. Except for LST8 (Welch’s t-test; p-value < 0.01), nuclear depletion of Nab3 did not significantly alter total mRNA levels of the control group genes under normal or stress-conditions (Fig. 7d, Supplementary Fig. 9f).
The observation that Nab3-depletion did not affect YBR085C-A gene expression levels was surprising given that we detected a strong increase in Nab3 cross-linking near the 5′ end of the transcript (Fig. 8a,b) and identified hundreds of oligo-A-tailed reads in the sequencing data (Fig. 8a), strongly suggesting that the NNS terminates YBR085C-A transcription in glucose-deprived cells. We, therefore, engineered a strain in which the Nab3 and Nrd1 motifs in the 5′ region of YBR085C-A were mutated (without affecting the amino-acid sequence) (Fig. 8c). Quantitative RT-PCR analyses revealed that although the difference in mRNA levels between the mutant and the wild-type gene was always less than twofold during the 20-min time-course, the mutant was upregulated faster than the wild-type gene during glucose starvation, demonstrating a role for the NNS in regulating the kinetics of YBR085C-A expression (Fig. 8d). Thus, we predict that changes in transcription kinetics induced by NNS-dependent termination is more widespread than the Nab3 anchor away depletion data would suggest.
We conclude that Nab3 controls the induction kinetics as well as the maximum mRNA expression levels of stress-responsive genes during glucose deprivation.
Nab3 suppresses retrotransposon transcription during stress
We showed that prolonged UV-exposure substantially increased Nab3 cross-linking to Ty retrotransposon transcripts (Fig. 2b,e, and f). To investigate whether Nab3 regulates Ty retrotransposon expression, we measured their transcription and total RNA levels in the Nab3 anchor-away strains treated with ethanol or rapamycin. Yeast expresses five different classes of Ty retrotransposons (Ty1 to Ty5)42. In line with transposon-abundance, very few reads mapped to Ty5, which was, therefore, not further considered. Consistent with our initial results (Fig. 4b), in the ethanol-treated cells we observed a transient increase in Pol II cross-linking to the highly abundant Ty1 and Ty2 retrotransposons (Fig. 9a,b). Rapamycin treatment did not affect Ty1 transcription kinetics during the first 8 min, however, at later time-points Pol II transcription was significantly higher (Fig. 9a; Welch’s t-test; p-value < 1.0×10−6). This suggests that Nab3 activity is required to suppress transcription of Ty1 transposable elements primarily during late stages of the glucose adaptation response. Consistent with this idea, Nab3 cross-linking to Ty1 was highest at the late time-points (Fig. 9c). Remarkably, Nab3 appears to control transcription of Ty2 retrotransposons more tightly; In Nab3-depleted cells Ty2 transcription was significantly higher in glucose medium (Welch’s t-test; p-value < 1.0×10−7) and continued to increase during the glucose deprivation response (Fig. 9b). Nab3 cross-linking to Ty2 transcripts was dynamic, peaking at 14 min after the medium shift (Fig. 9d). These data demonstrate how χCRAC can be used to measure alterations in Pol II transcription kinetics during changes in the environment or in mutant strains.
For the less-abundant Ty3 and Ty4 retrotransposons the pattern was noisy, however, we could detect an increase in Ty3 Pol II transcription during the last three time-points, indicating that Nab3 is also involved in regulating Ty3 expression (Supplementary Fig. 11). Quantitative RT-PCR analyses confirmed that Ty1 and Ty2 total RNA levels increased during the time-course (Fig. 9e,f), most significantly (Welch’s t-test; p-values < 0.01) at late stages of the adaptation response.
To ascertain why Nab3 appears to more tightly regulate Ty2 transcription, we next compared the Nab3 and Pol II cross-linking profiles over Ty1 and Ty2 retrotransposon genes in the Nab3-FRB anchor-away strain grown in glucose (t = 0) or deprived of glucose for 20 min, with and without rapamycin treatment (Fig. 9g,h). To normalize for the differences in Ty transcript lengths, we divided the reads over an equal number of bins (Fig. 9g,h, x-axis). Strikingly, Nab3 cross-linked primarily to a single region in Ty2 retrotransposons (Fig. 9h, panel II), whereas Nab3 cross-linking over Ty1 transcripts was more diffuse (Fig. 9g, panel II). Although Nab3 cross-linking to Ty1 and Ty2 transcripts increased over time during glucose deprivation (Fig. 9c,d), the cross-linking pattern did not dramatically change (Fig. 9g,h, compare panels II and III). Ty1 and Ty2 Pol II cross-linking profiles in cells grown in glucose were very similar, with the read densities roughly evenly distributed over the genes. Rapamycin treatment of cells in glucose only modestly increased the read density downstream of the main Nab3 peaks (Ty1 EI = 0.15; Ty2 EI = 0.37; Fig. 9g,h, compare panels IV and V). These data suggest that only a small fraction of Ty1 and Ty2 transcripts is terminated by Nab3 in glucose. However, 20 min after the shift to medium lacking glucose, Pol II cross-linking downstream of the Nab3 sites in both Ty1 and (in particular) Ty2 transcripts was substantially reduced (Fig. 9g,h, panel VI). Nab3 depletion by rapamycin treatment did not completely resolve the Pol II pileups near the Nab3 cross-linking sites (Fig. 9h, compare panels VI with VII), however, still a much higher fraction of reads was detected in downstream regions (Ty1 EI = 0.47, Ty2 EI = 1.0). These data support the notion that Nab3-dependent transcription termination is mostly active on retrotransposons during glucose starvation.
Collectively, these results demonstrate that Nab3 plays an important role in regulating the kinetics of retrotransposon gene expression during glucose deprivation.
Discussion
The methodological advances underpinning the development of χCRAC enabled us to glimpse the highly dynamic reprogramming of RBP–RNA interactions in response to stress. Our data reproducibly show widespread relocation of the yeast transcriptional termination factor Nab3 within a minute of the imposition of stress. Given the central role of RBPs in all aspects of RNA life, it is plausible that other RBPs show similar dynamic behaviors. Because χCRAC is generally applicable we anticipate that its use will enable future studies to unearth novel mechanistic insights into the function of RBPs during stress conditions.
Our results provide evidence that Nab3 “dampens” the induction of several stress-responsive genes during stress. What could be the benefit of this? One possibility is that Nab3 functions as part of a negative feedback loop that reduces noise in gene expression by preventing transcription levels from overshooting (Fig. 10a). Nab3 could also regulate the timing of expression of these stress-responsive genes, although both models are not mutually exclusive. Many of these stress-responsive genes are controlled by the same group of transcription factors. Although the benefits of expressing many stress-responsive genes simultaneously undoubtedly has advantages, in some cases it might be required that expression of certain genes is delayed (Fig. 10b) or only strongly induced when the stress signal has reached a certain amplitude. Such a dampening system would also reduce activation of gene expression due to false or noisy signals. In this respect, the role of Nab3 might be similar to that of the Set3c histone deacetylase, although the mechanism is different43. Deletion of Set3c induces the expression of certain genes much faster when cells are subjected to changes in carbon sources and, therefore, it was proposed that Set3c dictates the expression timing of these genes43.
We also uncovered a role of Nab3 in regulating the expression kinetics of Ty retrotransposons, which are upregulated during a variety of stress conditions44. Their expression needs to be carefully controlled as recombination between Ty elements can lead to chromosomal rearrangements, which are detrimental for gene expression and genome stability45. In Nab3-depleted cells, transcription of Ty1 and Ty2, in particular, is significantly upregulated at later stages of the glucose deprivation response and we observed a transient increase of Nab3 binding to Ty2 retrotransposons at late stages of the adaptation response. We propose that Nab3 activity contributes to stress adaptation by rapidly shutting down transcription of retrotransposons (Fig. 10c).
In many fungi, as in mammals, retrotransposon transcription is regulated by the RNAi machinery46, 47. S. cerevisiae, however, does not have RNAi components, and it is tempting to speculate that, in view of its poor conservation, the NNS complex, together with other factors, may have taken up the function of controlling the expression of retrotransposons.
We show that χCRAC can also measure rapid in cis changes in protein–RNA interactions in vivo. We demonstrate that Nab3 binding to the ENO1 transcript changes within the first few minutes of the glucose starvation response (Fig. 6). Follow-up analyses revealed that Nab3 binds a CUT that initiates upstream of the ENO1 TATA box and we predict that transcription of this CUT upstream of the ENO1 promoter helps to suppress ENO1 expression when cells are grown in glucose. Deciphering the mechanism of regulation of ENO1 expression is not trivial as this gene is controlled by many different transcription factors (See Supplementary note 2 and Supplementary Fig. 12). However, it is worth mentioning here that the upstream CUT initiates from an element referred to as the upstream repressor element (URS), which is a binding site for many transcription factors, such as Reb1 and Tye7 (Sgc1), and mediates ~20-fold repression of ENO1 in glucose48. Interestingly, this URS has directionality as reverting this element relieves inhibition of ENO1 expression when cells are grown on glucose49 (Supplementary Fig. 12). This begs the question whether transcription of the CUT is also reversed in this mutant.
There are many biological scenarios where measurements of in cis changes, as observed in ENO1, could shed light on RBP–RNA interaction dynamics. A major area of interest is the assembly of large macromolecular RNP complexes, such as the ribosome and the spliceosome, which involves dynamic interactions between many proteins and RNAs. It is likely that some assembly factors contact different sites on their RNA substrates or may not occupy all of their binding sites simultaneously during the assembly process, as is the case for ribosomal proteins during ribosome assembly50. We envision that χCRAC could be used to perform high-resolution time-resolved analyses of dynamic changes of protein–RNA interactions in RNP particles during their assembly in vivo. Such studies would require the development of protocols to synchronize the cells in a way that the assembly could be monitored from start to finish.
Another major potential area of application is the study of the kinetics of RNA expression, which is a balance of RNA transcription and degradation. Most studies use indirect methods to estimate RNA decay rates, which can rely on Pol II mutants, metabolic labeling or drugs to inactivate transcription51, 52. Although these studies have provided a wealth of interesting results, the data generated by these indirect approaches are not always highly correlated53. In general, model-based studies assume that RNA decay can be summarized by a single mRNA half-life for each transcript, corresponding to a simple exponential decay process52, 54, despite the complexity of RNA degradation pathways55. The highly dynamic behavior of the termination factor Nab3 indeed challenges this assumption. Since RNA degradation involves the activity of many nucleases, dissecting the dynamics of individual proteins is likely to be crucial for understanding how the rate of RNA decay is determined. We envision that χCRAC analyses on individual nucleases would enable us to directly measure such interactions, providing invaluable data to constrain and refine our understanding of the kinetics of gene expression.
Methods
The Vari-X-linker
The Vari-X-linker incorporates a number of new features that enhance the effectiveness of UV cross-linking. The sample is presented in a controlled 1 cm thick layer contained in a specially constructed UV transparent bag and flanked by two beds of powerful 254 nm (400–550 W) or 365 nm lamps (350 W) that were assembled on trays for easy exchange of the lamps. As far as we are aware, the Megatron is currently the fastest cross-linker available on the market for cross-linking proteins to RNA in actively growing cells12. Despite this, it still requires about 100 s to get good cross-linking yields with this machine (or more depending on the protein), which is not fast enough to do time-resolved analyses with minute time-point resolution. Another problem we faced with the Megatron system is that it was not trivial to cross-link cells when the lamps were at full output. As a consequence, cells would not always receive the same level of 254 nm UV intensity during the 100 s of UV-irradiation, resulting in variation in cross-linking efficiencies between samples and noise in the data. It was also not possible to control the temperature inside of the unit. Although the Megatron works well for normal cross-linking studies, these issues made it very difficult to perform time-resolved analyses during short periods. To overcome this, we incorporated a shutter system to allow the lamps to be at full power with stable, repeatable output before exposing the cells. A fan cooling system was installed to minimize thermal shock to the sample. With the Vari-X-linker, the lamps can be left on throughout the experiment and cells will only be exposed when the shutters are opened (Supplementary Fig. 1; shutter release). Using a vacuum pump the cells can be quickly extracted from the UV chamber. The bag in the UV-chamber can be exchanged for a tray that allows for the cross-linking of small volumes or adherent cells in petri-dishes. Cross-linking of adherent cells could be improved by growing the cells on UV-transparent plastic. Using a vacuum pump the cells can be quickly extracted from the UV chamber. We also developed a new filtration device that enables harvesting of 1 L of cells in ~30 s. The Vari-X-linker and the filtration device can be purchased from UVO3 (www.vari-x-link.com; sales@uvo3.co.uk).
Kinetic CRAC (χCRAC)
For an eight time-points time-course, 8 L of cells were grown in synthetic medium with glucose (SD-TRP) to exponential phase (OD600~0.5) at 30 °C. For time-point zero, 1 L of cells were cross-linked in the Vari-X-linker using the high-output 254 nm lamps for 12 s and then harvested by rapidly passing the cells through a 0.8 µm filter (Millipore) using a new vacuum filtration device (see above). The remaining 7 L of cells were harvested on filters and quickly resuspended in S-TRP (no glucose samples) or SD-TRP (glucose control samples) and maintained at 30 °C. For each time-point 1 L of cells were cross-linked in the Vari-X-linker and harvested by filtration as above. This yielded ~1 g of cells for each time-point.
For the PAR-CLIP experiments (Fig. 1c), for each condition 1 L of cells expressing Nab3-HTP were grown to exponential phase in SD-URA-TRP and incubated with 4-thio-Uracil for 5 min (final concentration = 20 µM). After labeling, the cells were rapidly harvested by filtration onto 0.8 µm membranes to remove the free 4-thio-Uracil and resuspended in SD-TRP before UV-irradiation at 365 nm. Removing the free 4-thio-Uracil greatly enhanced the cross-linking efficiency (data not shown).
Cells were lysed in 1 V/w of TN150 (50 mM Tris pH 8.0, 150 mM NaCl, 0.1% NP-40, 5 mM β-mercaptoethanol) and 3 V of Zirkonia beads (0.5 mm; Thistle Scientific) by vortexing the cells five times for 1 min, with a 1-min incubation on ice between each step. Three milliliter of lysis buffer was added and extracts were clarified by centrifugation (20 min at 4500×g and 20 min at 20,000×g at 4 °C). Extracts were incubated with 250 µl of equilibrated IgG Sepharose beads (GE Healthcare) for 2 h at 4 °C. Beads were washed three times 5 min with 10 ml of TN1000 (50 mM Tris pH 7.5, 0.1% Nonidet P-40, 5 mM β-mercaptoethanol, 1 M NaCl) and three times 5 min with TN150 (50 mM Tris pH 7.5, 0.1% Nonidet P-40, 5 mM β-mercaptoethanol, 150 mMNaCl).
For the Nab3 anchor-away nuclear depletion experiments29, cells were incubated with 1 µg/ml rapamycin (Sigma) for 1 h before the cells were shifted to medium lacking glucose (but supplemented with 1 µg/µl rapamycin).
For the Pol II (Rpo21-HTP) χCRAC experiments, cells were lysed in 1 V/w TMn150 (50 mM Tris-HCl pH 8.0, 10 mM MnCl2, 0.1% NP-40, 5 mM β-mercaptoethanol, 150 mM NaCl, Roche Midi protease inhibitors; 1 ml per gram of cells). Subsequently, 1 V/w of TMn150 was added containing 1U/ml of RQ1 RNase-free DNAse (Promega) and the suspension was incubated for half an hour on ice to degrade the chromatin.
For the Hfq cross-linking test (Fig. 1b), 0.5 L of bacteria were grown in Luria-Bertani medium (LB) to an OD600 of 0.4 and cross-linked in the Megatron and Vari-X-linker for the indicated times. To purify the E. coli Hfq-HTF, one gram of cells was lysed as described above. Extracts were incubated with 35 µl of anti-Flag magnetic beads (Sigma) for 2 h at 4 °C. Beads were washed three times 10 min with TN1000 and rinsed three times with TN150.
For the TEV cleavage step, beads (IgG or Flag) were resuspended in 600 µl of TN150 and incubated for 2 h with 10 µg of home-made GST-TEV protease at 18 °C. The TEV eluates were subsequently incubated with 0.1 unit of RNace-IT (Agilent) for 5 min at 37 °C after which 0.4 g of guanidine HCl (Sigma) was added to the TEV eluates to inactivate the RNAses. NaCl and Imidazole was added to a final concentration of 300 and 10 mM, respectively and the samples were incubated overnight with 50 µl of Nickel agarose beads (Qiagen) at 4 °C. Beads were transferred to a Snap Cap columns (Pierce), washed three times with 500 µl wash buffer I (50 mM Tris-pH 7.5, 6 M guanidium-HCl, 0.1% Nonidet P-40, 5 mM β-mercaptoethanol, 300 mM NaCl, 10 mM Imidazole) and three times with 1×PNK buffer (50 mM Tris pH 7.5, 0.1% Nonidet P-40, 5 mM β-mercaptoethanol, 10 mM MgCl2). Beads were subsequently incubated with 80 µl of 1xPNK buffer containing eight units of TSAP alkaline phosphatase (Promega) and 80 units of recombinant RNasin (Promega) for 1 h at 37 °C. After one 500 µl wash with wash buffer I and three 500 µl washes with 1xPNK buffer, the beads were resuspended in 80 µl of 3′ linker ligation mix (1xPNK buffer, App-PE 3′ adapter (see Supplementary Table 2; 0.6 µM final concentration), 10% PEG8000, 30 units of T4 RNA ligase 2 truncated K227Q (NEB), 60 units RNAsin (Promega)). The samples were incubated at 25 °C for 4–6 h. Following one 500 µl wash buffer I wash and three 1xPNK buffer washes, the beads were incubated with 60 µl of 5′end labeling mix (1xPNK buffer, 30µCi 32P-γATP (Perkin Elmer) and 30 units of T4 polynucleotide kinase (NEB)) for 40 min at 37 °C. ATP (Roche) was added to 1 mM final concentration, followed by another 20-min incubation at 37 °C. Beads were subsequently washed three times with 500 µl of wash buffer I and three times with 500 µl of 1xPNK buffer and incubated with 80 µl of 5′ linker ligation mix (1xPNK buffer, 10 mM ATP, 80 units RNAsin (Promega) 40 units of T4 RNA ligase 1 (NEB) and 5′ adapter (1.25 µM final concentration; see Supplementary Table 2) overnight at 16 °C. Beads were subsequently washed three times with 500 µl wash buffer I and three times with 500 µl of wash buffer II (50 mM Tris pH 7.5, 0.1% Nonidet P-40, 5 mM β-mercaptoethanol, 50 mM NaCl, 10 mM Imidazole). Proteins were eluted from the nickel beads using wash buffer II containing 250 mM Imidazole, TCA precipitated (20% final concentration) and resolved on 1 mm thick 4–12% NuPAGE gels (Thermo Fisher Scientific), transferred to nitrocellulose membranes and visualized by autoradiography. Bands corresponding to the size of the protein of interest, including a region ~ 1 cm above the band, were cut from the nitrocellulose membrane and pooled in a single 2 ml tube. Radiolabeled RNA was extracted by incubating the membrane slices with 200 µg of proteinase K in 800 µl of wash buffer II containing 1% SDS and 5 mM EDTA. The solution was transferred to a new tube and the RNA was subsequently phenol-chloroform-extracted and ethanol-precipitated. Reverse transcription with SuperScript III was performed as per the manufacturer’s procedures (Thermo Fisher Scientific) using the reverse transcription primer listed in Supplementary Table 2. The cDNAs were purified using the Zymo DNA Clean & Concentrator 5 kit and eluted into a final volume of 10 µl. Five microliter of cDNA was PCR amplified using Pfu polymerase (Promega) for 20–24 cycles (95 °C 30 s, 52 °C 30 s and 72 °C 1 min) using PCR oligonucleotides listed in Supplementary Table 2. PCR products were resolved on 2% Metaphor agarose gels (Lonza) and 160–300 bp fragments were gel purified using the miniElute kit (Qiagen) according to the manufacturer’s procedures. Paired-end sequencing (50 bp) was performed by Edinburgh Genomics using the IlluminaHiSeq 2500 and 4000 platforms. This improved the detection of high-confidence cross-linking induced mutations6. Following sequencing, samples were demultiplexed using the 5′ adapter barcode sequences and collapsed reads were mapped to the yeast genome. To improve T4 RNA ligation efficiencies, we added random nucleotides to adapter termini that ligate to the RNA (Supplementary Table 2). We found that there is a significant preference for specific donor–acceptor nucleotide combinations for both 5′ and 3′ T4 RNA ligase reactions (Supplementary Fig. 3).
The uncropped images for Figs. 1 and 3b are provided in Supplementary Figs. 13 and 14.
Procedures used for western, northern, qRT-PCR and a description of the yeast strains and media can be found in the Supplementary Methods.
Processing of raw sequencing data
Sequencing was performed on IlluminaHiSeq 2500 and HiSeq 4000 machine by our Edinburgh Genomics facility. The complete pipeline for the processing of paired-end kinetic CRAC data is available on https://bitbucket.org/sgrann/kinetic_crac_pipeline. The entire pipeline can be run using a single script (CRAC_pipeline_PE.py) that divides the tasks over multiple processors. The pipeline performs the following steps: demultiplexing of raw fastq files by pyBarcodeFilter.py version 2.3.3 from the pyCRAC tool suite56 (version 1.2.2.6). Flexbar57 then trims the reads and removes 3′ adapters sequences (Supplementary Table 2). Reads are then collapsed using random barcode information provided in the in-read barcodes using the pyCRAC tool pyFastqDupicateRemover.py (see Supplementary Table 2 for adapter sequences). Reads are then aligned to the reference sequence (yeast genome R64 in our case) using novoalign (www.novocraft.com) version 2.0.7 and those that mapped to multiple genomic regions were randomly distributed over each possible location. PyReadCounters then makes read count and fragments per kilobase transcript per million reads (FPKM) tables for each annotated genomic features. Only genes for which cross-linking could be detected in all time-points were considered. Genomic feature files were obtained from ENSEMBL (version R64-1-1.75). Coordinates for anti-sense transcripts, CUTs, XUTs, SUTs, and retrotransposons24, 33, 58, 59 were obtained from the Saccharomyces Genome Database (sgd; yeastgenome.org).
Identification of Nab3-binding sites and oligo-A reads
PyCalculateFDRs.py was used to find significantly enriched Nab3-binding peaks using default settings. Only peaks with at least five reads were considered and the minimum width of the peak interval was set to 20 nucleotides. Oligo-A reads were identified using blast and in-house perl and python scripts. PyBinCollector.py was used to generated the Nab3 and Pol II cross-linking distribution figures.
K-means clustering
K-means clustering of cross-linking profiles was performed using the STEM clustering program60. Only profiles were selected that showed a fold-change in FPKM of at least 1.5 and had a mean pairwise correlation over two biological replicates of 0.7.
Escape indices and selection of Nab3 attenuated genes
To calculate the Pol II transcription EI41, we first selected protein-coding genes that had a minimum coverage of 10 FPKM at the indicated time-points (0, 4 and 18 min after the induction of the glucose deprivation response). The EI was calculated by summing the nucleotide density in the promoter proximal region (PPR; −100 to + 250 from the 5′UTR) and dividing this number by the nucleotide density of the body region (+251 to 3′ end). Subsequently, we divided the values for the 4 and 18-min time-point by the values for the 0 time-point to calculate the EI. These data were then compared to changes in Pol II transcription, which was calculated by dividing the total normalized nucleotide density of the whole gene from the 18-min sample by the total normalized nucleotide density of the 0-minute sample. We then only selected those genes that showed an increase in transcription of at least 1.5, a coverage of at least 10 FPKM and EI of at least 2. From the resulting list of genes, only genes were selected that (a) had Nab3-binding sites near the 5′ end of the gene and (b) showed reproducible profiles in a replicate experiment.
Data normalization
We scaled the FPKM values of all transcripts within each time point by a constant factor such that the sum of FPKMs for all time points and all experimental replicates is the same. This was done for all data sets that were analyzed simultaneously, e.g., Nab3 and Pol II data sets (see below). For all data sets, the same time points were used (on a few occasions, temporally close time points were deemed identical for experimental purposes). Finally, for all analyzed data sets, we divided each time series of each experimental replicate by its steady state value before the imposition of stress (at 0 min after the nutrient shift). This way, all time series start at the same normalized binding value of 1 a.u. before the nutrient shift and the other values for later time points are relative to the background binding signal. We only keep those transcripts for the analysis that have real values for all time points in all experimental replicates after all steps of the normalization procedure.
Differential gene expression analyses
For the differential expression analyses we used DESeq222 in which two Megatron data sets were compared to four Vari-X-linker Nab3 glucose data sets. Only differentially expressed genes were selected that had an adjusted p-value of 0.05 or lower.
Testing for differential dynamic response
To determine whether the imposition of stress results in differential dynamics of RBP binding, we used a Bayesian non-parametric regression approach. Let f j(t) represent the binding response in condition j (stress or control) at time t, relative to time 0. Our main assumption is that this response, averaged over a population of cells, can be well modeled as a smooth function of time. To capture this assumption, we formulate a probabilistic model for the response function in terms of GPs (see e.g.27). A GP is an infinite-dimensional generalization of the multivariate Gaussian distribution, which provides a suitable prior distribution over a space of functions. Here we enforce the smoothness assumption by modeling correlations between function values at times s and t using a squared-exponential covariance function:
This covariance depends on two hyperparameters α, λ that are fitted to the data as described below.
We assume that observations y j(t) of binding to a specific transcript in condition j at time t (i.e., FPKM from the CRAC experiment at time t in condition j) are obtained from the unobserved function f j(t) by addition of zero-mean Gaussian noise with standard deviation σ. These assumptions enable us to marginalize exactly the unobserved function values to obtain an estimate of the data evidence (or marginal likelihood).
We then reformulate the testing question as a model selection problem. We consider two competing models:
H0, all the time series (control and stress) can be explained as noise corrupted observations of a single underlying function f(t) describing the dynamics of the system (null hypothesis).
H1, control and stress time series result from two distinct underlying dynamics, i.e., there are two functions f c(t) and f t(t), which generate the observations (alternative).
The ratio of the evidence under the two hypotheses (Bayes factor) quantifies the ratio of posterior probabilities of each model being correct, and hence provides a criterion for selecting one hypothesis over the other. We follow Kass and Raftery28 in adopting a Bayes factor greater or equal to 10 as strong evidence of one hypothesis over the other. The Bayes factor computation is performed independently for every transcript; it should be noted that, as this is a Bayesian method, no issues of multiple testing arise, since sampling variability is already accounted for in the marginalization process.
It should be noted that the evidence calculation can only be performed exactly when the covariance hyperparameters α, λ as well as the observation noise with standard deviation σ are known. Such parameters can also be assigned a prior distribution and marginalized for a fully Bayesian treatment; however, this greatly complicates the computational task of computing the evidence as exact marginalization is not possible. To avoid these additional overheads, we adopted an empirical Bayesian strategy and fixed the hyperparameters in a data-driven fashion. The scaling hyperparameterα 2 was defined as 50% of the variance of all binding time series, in control and treatment experimental replicates, for a given transcript. The length scale hyperparameter λ, which determines the number of units the data can be extrapolated away from, was globally set as 1 min. The variance of the observation noise σ 2 was chosen to be the variance of the binding time series of a given transcript in control conditions, i.e., under the glucose-to-glucose shift. Notice that the GP models under both hypotheses were provided with the same hyperparameter values, to avoid over fitting to the data.
Code availability
The pyCRAC package56 used for the data analyses is available on https://bitbucket.org/sgrann/pycrac. The complete data analysis pipeline is available on https://bitbucket.org/sgrann/kinetic_crac_pipeline/. Other python and perl scripts used for isolating oligo-A tailed reads are available upon request.
Data availability
Fastq and processed sequencing data that support the findings of this study have been deposited in the Gene Expression Omnibus (GEO) under the accession code GSE85545 (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?&acc=GSE85545). The data that support the findings of this study are available from the corresponding author upon request.
Electronic supplementary material
Acknowledgements
We would like to thank Lidia Vasiljeva and members of the Granneman lab for critically reading the manuscript. We thank Frank Pugh for helpful suggestions. We are grateful to Ola Helwak for testing the Vari-X-linker with mammalian cells and Francesca Storici for providing the Delitto perfetto plasmids. This work was supported by grants from the Wellcome Trust (to S.G.; 091549) and a BBSRC Sparking award (to S.G.; S.I.2013.0202). G.S. and G.Sch. acknowledge support from the European Research Council under grant MLCS306999. A.S. is supported by the EPSRC. Next generation sequencing was carried out by Edinburgh Genomics, The University of Edinburgh. Edinburgh Genomics is partly supported through core grants from NERC (R8/H10/56), MRC (MR/K001744/1) and BBSRC (BB/J004243/1).
Author contributions
S.G. conceived the χCRAC method. S.G. and R.v.N. designed the experiments. R.v.N., E.d.L., and S.G. performed the experiments. P.W., S.G., and R.v.N. conceived the filtration unit. A.S., G.Sch., S.G., and G.San. designed and performed the computational analyses. All authors contributed to writing the paper and read and approved the manuscript.
Competing interests
P.W. was the director and owner of UVO3 that sells equipment for water sterilization. A.L. and R.F. are employees of UVO3. P.W., A.L., and R.F. have been involved in the development of the Vari-X-linker and the filtration unit. All remaining authors declare no competing financial interests.
Footnotes
Gabriele Schweikert, Erica de Leau and Alina Selega contributed equally to this work.
Electronic supplementary material
Supplementary Information accompanies this paper at doi:10.1038/s41467-017-00025-5.
Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
- 1.Castello A, Fischer B, Hentze MW, Preiss T. RNA-binding proteins in Mendelian disease. Trends Genet. 2013;29:318–327. doi: 10.1016/j.tig.2013.01.004. [DOI] [PubMed] [Google Scholar]
- 2.Pérez-Ortín JE, de Miguel-Jiménez L, Chávez S. Genome-wide studies of mRNA synthesis and degradation in eukaryotes. Biochim. Biophys. Acta. 2012;1819:604–615. doi: 10.1016/j.bbagrm.2011.12.002. [DOI] [PubMed] [Google Scholar]
- 3.Marguerat S, Lawler K, Brazma A, Bähler J. Contributions of transcription and mRNA decay to gene expression dynamics of fission yeast in response to oxidative stress. RNA Biol. 2014;11:702–714. doi: 10.4161/rna.29196. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Kresnowati MTAP, et al. When transcriptome meets metabolome: fast cellular responses of yeast to sudden relief of glucose limitation. Mol. Syst. Biol. 2006;2:49. doi: 10.1038/msb4100083. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Ule J, et al. CLIP identifies Nova-regulated RNA networks in the brain. Science. 2003;302:1212–1215. doi: 10.1126/science.1090095. [DOI] [PubMed] [Google Scholar]
- 6.Granneman S, Kudla G, Petfalski E, Tollervey D. Identification of protein binding sites on U3 snoRNA and pre-rRNA by UV cross-linking and high-throughput analysis of cDNAs. Proc. Natl. Acad. Sci. U.S.A. 2009;106:9613–9618. doi: 10.1073/pnas.0901997106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Hafner M, et al. Transcriptome-wide identification of RNA-binding protein and microRNA target sites by PAR-CLIP. Cell. 2010;141:129–141. doi: 10.1016/j.cell.2010.03.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.König J, et al. iCLIP reveals the function of hnRNP particles in splicing at individual nucleotide resolution. Nat. Struct. Mol. Biol. 2010;17:909–915. doi: 10.1038/nsmb.1838. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Van Nostrand EL, et al. Robust transcriptome-wide discovery of RNA-binding protein binding sites with enhanced CLIP (eCLIP) Nat. Methods. 2016;13:508–514. doi: 10.1038/nmeth.3810. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Zarnegar BJ, et al. irCLIP platform for efficient characterization of protein–RNA interactions. Nat. Methods. 2016;13:489–492. doi: 10.1038/nmeth.3840. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Flynn RA, et al. Dissecting noncoding and pathogen RNA-protein interactomes. RNA. 2015;21:135–143. doi: 10.1261/rna.047803.114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Granneman S, Petfalski E, Tollervey D. A cluster of ribosome synthesis factors regulate pre-rRNA folding and 5.8S rRNA maturation by the Rat1 exonuclease. EMBO. J. 2011;30:4006–4019. doi: 10.1038/emboj.2011.256. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Schaughency P, Merran J, Corden JL. Genome-wide mapping of yeast RNA polymerase II termination. PLoS Genet. 2014;10:e1004632. doi: 10.1371/journal.pgen.1004632. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Beckmann, B. M. RNA interactome capture in yeast. Methods doi:10.1016/j.ymeth.2016.12.008 (in the press; 2016) [DOI] [PMC free article] [PubMed]
- 15.Porrua O, Libri D. Transcription termination and the control of the transcriptome: why, where and how to stop. Nat. Rev. Mol. Cell Biol. 2015;16:190–202. doi: 10.1038/nrm3943. [DOI] [PubMed] [Google Scholar]
- 16.Schulz D, et al. Transcriptome surveillance by selective termination of noncoding RNA synthesis. Cell. 2013;155:1075–1087. doi: 10.1016/j.cell.2013.10.024. [DOI] [PubMed] [Google Scholar]
- 17.Darby MM, Serebreni L, Pan X, Boeke JD, Corden JL. The S. cerevisiae Nrd1-Nab3 transcription termination pathway acts in opposition to ras signaling and mediates response to nutrient depletion. Mol. Cell. Biol. 2012;32:1762–1775. doi: 10.1128/MCB.00050-12. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Gasch AP, et al. Genomic expression programs in the response of yeast cells to environmental changes. Mol. Biol. Cell. 2000;11:4241–4257. doi: 10.1091/mbc.11.12.4241. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Jamonnak N, et al. Yeast Nrd1, Nab3, and Sen1 transcriptome-wide binding maps suggest multiple roles in post-transcriptional RNA processing. RNA. 2011;17:2011–2025. doi: 10.1261/rna.2840711. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Wu J, Zhang N, Hayes A, Panoutsopoulou K, Oliver SG. Global analysis of nutrient control of gene expression in Saccharomyces cerevisiae during growth and starvation. Proc. Natl Acad. Sci. U.S.A. 2004;101:3148–3153. doi: 10.1073/pnas.0308321100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Tree JJ, Granneman S, McAteer SP, Tollervey D, Gally DL. Identification of bacteriophage-encoded anti-sRNAs in pathogenic Escherichia coli. Mol. Cell. 2014;55:199–213. doi: 10.1016/j.molcel.2014.05.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Love MI, Huber W, Anders S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 2014;15:550. doi: 10.1186/s13059-014-0550-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Arigo JT, Carroll KL, Ames JM, Corden JL. Regulation of yeast NRD1 expression by premature transcription termination. Mol. Cell. 2006;21:641–651. doi: 10.1016/j.molcel.2006.02.005. [DOI] [PubMed] [Google Scholar]
- 24.Neil H, et al. Widespread bidirectional promoters are the major source of cryptic transcripts in yeast. Nature. 2009;457:1038–1042. doi: 10.1038/nature07747. [DOI] [PubMed] [Google Scholar]
- 25.Ghavidel A, et al. Impaired tRNA nuclear export links DNA damage and cell-cycle checkpoint. Cell. 2007;131:915–926. doi: 10.1016/j.cell.2007.09.042. [DOI] [PubMed] [Google Scholar]
- 26.Milligan L, et al. Strand-specific, high-resolution mapping of modified RNA polymerase II. Mol. Syst. Biol. 2016;12:874. doi: 10.15252/msb.20166869. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Rasmussen, C. E. & Williams, C. K. I. in Gaussian Processes for Machine Learning (Mit Press, 2006).
- 28.Kass RE, Raftery AE. Bayes factors. J. Am. Stat. Assoc. 2012;90:773–795. doi: 10.1080/01621459.1995.10476572. [DOI] [Google Scholar]
- 29.Haruki H, Nishikawa J, Laemmli UK. The anchor-away technique: rapid, conditional establishment of yeast mutant phenotypes. Mol. Cell. 2008;31:925–932. doi: 10.1016/j.molcel.2008.07.020. [DOI] [PubMed] [Google Scholar]
- 30.Munchel SE, Shultzaberger RK, Takizawa N, Weis K. Dynamic profiling of mRNA turnover reveals gene-specific and system-wide regulation of mRNA decay. Mol. Biol. Cell. 2011;22:2787–2795. doi: 10.1091/mbc.E11-01-0028. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Vasiljeva L, Kim M, Mutschler H, Buratowski S, Meinhart A. The Nrd1-Nab3-Sen1 termination complex interacts with the Ser5-phosphorylated RNA polymerase II C-terminal domain. Nat. Struct. Mol. Biol. 2008;15:795–804. doi: 10.1038/nsmb.1468. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Gudipati RK, Villa T, Boulay J, Libri D. Phosphorylation of the RNA polymerase II C-terminal domain dictates transcription termination choice. Nat. Struct. Mol. Biol. 2008;15:786–794. doi: 10.1038/nsmb.1460. [DOI] [PubMed] [Google Scholar]
- 33.Xu Z, et al. Bidirectional promoters generate pervasive transcription in yeast. Nature. 2009;457:1033–1037. doi: 10.1038/nature07728. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Kim H, et al. Gene-specific RNA polymerase II phosphorylation and the CTD code. Nat. Struct. Mol. Biol. 2010;17:1279–1286. doi: 10.1038/nsmb.1913. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Kopcewicz KA, O’Rourke TW, Reines D. Metabolic regulation of IMD2 transcription and an unusual DNA element that generates short transcripts. Mol. Cell. Biol. 2007;27:2821–2829. doi: 10.1128/MCB.02159-06. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Kuehner JN, Brow DA. Regulation of a eukaryotic gene by GTP-dependent start site selection and transcription attenuation. Mol. Cell. 2008;31:201–211. doi: 10.1016/j.molcel.2008.05.018. [DOI] [PubMed] [Google Scholar]
- 37.Tuck AC, Tollervey D. A transcriptome-wide atlas of RNP composition reveals diverse classes of mRNAs and lncRNAs. Cell. 2013;154:996–1009. doi: 10.1016/j.cell.2013.07.047. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Rhee HS, Pugh BF. Genome-wide structure and organization of eukaryotic pre-initiation complexes. Nature. 2012;483:295–301. doi: 10.1038/nature10799. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Pelechano V, Wei W, Steinmetz LM. Extensive transcriptional heterogeneity revealed by isoform profiling. Nature. 2013;497:127–131. doi: 10.1038/nature12121. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Wlotzka W, Kudla G, Granneman S, Tollervey D. The nuclear RNA polymerase II surveillance system targets polymerase III transcripts. EMBO. J. 2011;30:1790–1803. doi: 10.1038/emboj.2011.97. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Brannan K, et al. mRNA decapping factors and the exonuclease Xrn2 function in widespread premature termination of RNA polymerase II transcription. Mol. Cell. 2012;46:311–324. doi: 10.1016/j.molcel.2012.03.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Kim JM, Vanguri S, Boeke JD, Gabriel A, Voytas DF. Transposable elements and genome organization: a comprehensive survey of retrotransposons revealed by the complete Saccharomyces cerevisiae genome sequence. Genome Res. 1998;8:464–478. doi: 10.1101/gr.8.5.464. [DOI] [PubMed] [Google Scholar]
- 43.Kim T, Xu Z, Clauder-Münster S, Steinmetz LM, Buratowski S. Set3 HDAC mediates effects of overlapping noncoding transcription on gene induction kinetics. Cell. 2012;150:1158–1169. doi: 10.1016/j.cell.2012.08.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Oliver KR, Greene WK. Transposable elements: powerful facilitators of evolution. Bioessays. 2009;31:703–714. doi: 10.1002/bies.200800219. [DOI] [PubMed] [Google Scholar]
- 45.Lesage P, Todeschini AL. Happy together: the life and times of Ty retrotransposons and their hosts. Cytogenet. Genome Res. 2005;110:70–90. doi: 10.1159/000084940. [DOI] [PubMed] [Google Scholar]
- 46.Billmyre RB, Calo S, Feretzaki M, Wang X, Heitman J. RNAi function, diversity, and loss in the fungal kingdom. Chromosome Res. 2013;21:561–572. doi: 10.1007/s10577-013-9388-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Heras SR, Macias S, Cáceres JF, Garcia-Perez JL. Control of mammalian retrotransposons by cellular RNA processing activities. Mob. Genet. Elements. 2014;4:e28439. doi: 10.4161/mge.28439. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Carmen AA, Holland MJ. The upstream repression sequence from the yeast enolase gene ENO1 is a complex regulatory element that binds multiple trans-acting factors including REB1. J. Biol. Chem. 1994;269:9790–9797. [PubMed] [Google Scholar]
- 49.Carmen AA, Brindle PK, Park CS, Holland MJ. Transcriptional regulation by an upstream repression sequence from the yeast enolase gene ENO1. Yeast. 1995;11:1031–1043. doi: 10.1002/yea.320111105. [DOI] [PubMed] [Google Scholar]
- 50.dela Cruz J, Karbstein K, Woolford JL. Functions of ribosomal proteins in assembly of eukaryotic ribosomes in vivo. Annu. Rev. Biochem. 2015;84:93–129. doi: 10.1146/annurev-biochem-060614-033917. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Coller J. Methods to determine mRNA half-life in Saccharomyces cerevisiae. Methods Enzymol. 2008;448:267–284. doi: 10.1016/S0076-6879(08)02614-1. [DOI] [PubMed] [Google Scholar]
- 52.Rabani M, et al. Metabolic labeling of RNA uncovers principles of RNA production and degradation dynamics in mammalian cells. Nat. Biotechnol. 2011;29:436–442. doi: 10.1038/nbt.1861. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Sun M, et al. Comparative dynamic transcriptome analysis (cDTA) reveals mutual feedback between mRNA synthesis and degradation. Genome Res. 2012;22:1350–1359. doi: 10.1101/gr.130161.111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Honkela A, et al. Genome-wide modeling of transcription kinetics reveals patterns of RNA production delays. Proc. Natl Acad. Sci. U.S.A. 2015;112:13115–13120. doi: 10.1073/pnas.1420404112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Deneke C, Lipowsky R, Valleriani A. Complex degradation processes lead to non-exponential decay patterns and age-dependent decay rates of messenger RNA. PLoS ONE. 2013;8:e55442. doi: 10.1371/journal.pone.0055442. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Webb S, Hector RD, Kudla G, Granneman S. PAR-CLIP data indicate that Nrd1-Nab3-dependent transcription termination regulates expression of hundreds of protein coding genes in yeast. Genome Biol. 2014;15:R8. doi: 10.1186/gb-2014-15-1-r8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Dodt M, Roehr JT, Ahmed R, Dieterich C. FLEXBAR-Flexible barcode and adapter processing for next-generation sequencing platforms. Biology (Basel) 2012;1:895–905. doi: 10.3390/biology1030895. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.van Dijk EL, et al. XUTs are a class of Xrn1-sensitive antisense regulatory non-coding RNA in yeast. Nature. 2011;475:114–117. doi: 10.1038/nature10118. [DOI] [PubMed] [Google Scholar]
- 59.Yassour M, et al. Strand-specific RNA sequencing reveals extensive regulated long antisense transcripts that are conserved across yeast species. Genome Biol. 2010;11:1. doi: 10.1186/gb-2010-11-8-r87. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Ernst J, Bar-Joseph Z. STEM: a tool for the analysis of short time series gene expression data. BMC Bioinformatics. 2006;7:191. doi: 10.1186/1471-2105-7-191. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Fastq and processed sequencing data that support the findings of this study have been deposited in the Gene Expression Omnibus (GEO) under the accession code GSE85545 (https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?&acc=GSE85545). The data that support the findings of this study are available from the corresponding author upon request.