The transcriptional elongation rate regulates alternative polyadenylation in yeast

Joseph V Geisberg; Zarmik Moqtaderi; Kevin Struhl

doi:10.7554/eLife.59810

. 2020 Aug 26;9:e59810. doi: 10.7554/eLife.59810

The transcriptional elongation rate regulates alternative polyadenylation in yeast

Joseph V Geisberg ^1,^†, Zarmik Moqtaderi ^1,^†, Kevin Struhl ^1,^✉

Editors: Eric J Wagner², James L Manley³

PMCID: PMC7532003 PMID: 32845240

Abstract

Yeast cells undergoing the diauxic response show a striking upstream shift in poly(A) site utilization, with increased use of ORF-proximal poly(A) sites resulting in shorter 3’ mRNA isoforms for most genes. This altered poly(A) pattern is extremely similar to that observed in cells containing Pol II derivatives with slow elongation rates. Conversely, cells containing derivatives with fast elongation rates show a subtle downstream shift in poly(A) sites. Polyadenylation patterns of many genes are sensitive to both fast and slow elongation rates, and a global shift of poly(A) utilization is strongly linked to increased purine content of sequences flanking poly(A) sites. Pol II processivity is impaired in diauxic cells, but strains with reduced processivity and normal Pol II elongation rates have normal polyadenylation profiles. Thus, Pol II elongation speed is important for poly(A) site selection and for regulating poly(A) patterns in response to environmental conditions.

Research organism: S. cerevisiae

Introduction

In eukaryotes, transcription by RNA polymerase II (Pol II) and subsequent RNA processing steps give rise to numerous same-gene mRNA isoforms. These isoforms can exhibit substantial differences in sequence due to alternative splicing, differential 5’ and/or 3’ end utilization, and other co- and post-transcriptional processes (Geisberg et al., 2014; Berkovits and Mayr, 2015; Floor and Doudna, 2016). This broad mRNA isoform repertoire is important for proper cellular regulation of protein isoform composition, synthesis rate, and localization (Mayr, 2016).

Alternative cleavage/polyadenylation in 3’ untranslated regions (3’UTRs) is an important mechanism for generating same-gene mRNA isoforms. Most eukaryotic mRNAs are cleaved and polyadenylated at multiple locations within 3’UTRs to generate same-gene isoforms that can be separated by as little as a single nt or by as much as several kb (Ozsolak et al., 2010; Sherstnev et al., 2012; Moqtaderi et al., 2013; Pelechano et al., 2013). In S. cerevisiae and related yeast species, a typical gene possesses ~60 mRNA 3’ isoforms, the vast majority of which are found within the first ~300 nt of the 3’UTR (Moqtaderi et al., 2013).

3’UTR regions contain binding sites for proteins and (in many eukaryotes) microRNAs that affect the function of the bound mRNAs (Bartel, 2009; Baltz et al., 2012; Freeberg et al., 2013). Thus, same-gene isoforms that contain or lack particular 3’UTR sequences can differ in their protein and microRNA binding sites, leading to differences in translation efficiency, intracellular localization, and mRNA stability (Mayr, 2016). In yeast, same-gene 3’ mRNA isoforms, even those that differ by 1–3 nt, can possess different half-lives, in vivo structures (based on DMS profiling), and poly(A)-binding protein (Pab1) binding levels (Moqtaderi et al., 2018). Sequences responsible for isoform-specific structures, differential Pab1 binding, and mRNA stability are evolutionarily conserved, indicating biological function (Moqtaderi et al., 2018).

Alternative polyadenylation is regulated on a transcriptome scale by environmental or developmental conditions. For example, cancer cells and pluripotent stem cells preferentially express shorter 3’ mRNA isoforms, whereas differentiated cells preferentially express longer 3’ mRNA isoforms (Mayr and Bartel, 2009; Weill et al., 2012; Elkon et al., 2013; Li and Lu, 2013; Tian and Manley, 2013; Masamha et al., 2014). The mechanism by which thousands of genes undergo regulated polyadenylation is poorly understood but is thought to involve mis-regulation of cleavage/polyadenylation factors. Upregulation of CSTF64, FIP1, NUDT21, and PCF11 favors more proximal isoforms, while numerous factors, including CFI25/NUDT21, CPSF6, CFI68, and ELAVL2/3, enhance the usage of more distal poly(A) sites (Ogorodnikov et al., 2018; Kamieniarz-Gdula et al., 2019). In yeast, alternative polyadenylation in response to environmental conditions has been observed (Sparks et al., 1997; Sparks and Dieckmann, 1998; Graber et al., 2013). Pcf11 and the CPF complex are important contributors to cleavage/polyadenylation site selection, as their inactivation or down-regulation causes a downstream shift in poly(A) sites (Graber et al., 2013; Liu et al., 2017b; Gruber and Zavolan, 2019). In addition, numerous other RNA binding proteins can affect the distribution of 3’ mRNA isoforms in vivo in a more limited fashion (Tian and Manley, 2017).

Transcriptional elongation is mechanistically linked to post-transcriptional processes such as splicing, polyadenylation, nuclear export, and histone modification (Lei et al., 2001; Strässer et al., 2002; Krogan et al., 2003; Ng et al., 2003; Bentley, 2014; Wallace and Beggs, 2017). In yeast cells, Pol II derivatives with slow elongation rates have defects in processivity, the ability of Pol II to travel completely down the gene (Mason and Struhl, 2005). Similarly, mammalian cells harboring a Pol II derivative with a slow elongation rate exhibit reduced Pol II density in 3’ UTRs, whereas cells with a fast Pol II mutant show increased Pol II at more distal sequences (Fong et al., 2015). In Drosophila, a slow Pol II mutant affects poly(A) site selection in 3–5% of genes, with an equal number showing increased upstream or downstream utilization of poly(A) sites (Liu et al., 2017a). Pol II elongation rates of individual mammalian genes can vary by more than an order of magnitude and are conserved across cell types (Veloso et al., 2014). Kinetic competition between elongating Pol II and the Xrn2 exonuclease, which degrades mRNA after cleavage in the 3’ UTR, affects transcriptional termination a few hundred nucleotides downstream (Kim et al., 2004; Fong et al., 2015; Baejen et al., 2017). Similar competition between elongating Pol II and the cleavage/polyadenylation machinery might also affect the choice of poly(A) sites.

Here we show that cells undergoing the diauxic shift, a metabolic shift preceding stationary phase, exhibit a transcriptome-wide 3’ upstream shift in poly(A) site use, leading to shorter 3’ mRNA isoforms. This upstream shift is strikingly mimicked in strains harboring Pol II derivatives with reduced elongation rates. Conversely, albeit to a lesser extent, strains having Pol II derivatives with increased elongation rates show a downstream shift in poly(A) sites. Like yeast Pol II derivatives with slow elongation rates under standard growth conditions (Mason and Struhl, 2005), wild-type Pol II shows a processivity defect in diauxic conditions; this defect is strongly correlated with the magnitude of upstream shift. In contrast, mutant strains defective in Pol II processivity but with normal elongation rates display normal patterns of polyadenylation. Thus, Pol II speed influences alternative polyadenylation, and it likely explains the poly(A) pattern changes that occur in yeast cells undergoing the diauxic shift. We suggest that regulation of the Pol II elongation rate in response to environmental or developmental changes represents a novel mechanism to reprogram the 3’ mRNA isoform repertoire that is distinct from changes in the cleavage/polyadenylation machinery.

Results

Poly(A) sites are shifted upstream under diauxic conditions, favoring shorter 3’ isoforms

To examine mRNA 3’ isoform distribution as a function of growth condition, we grew duplicate S. cerevisiae cultures to mid-log phase in glucose-containing rich medium (YPD), galactose-containing rich medium (YPGal), nutrient-poor minimal medium (MM), and YPD containing 1M sorbitol (Sorbitol), an inducer of osmotic stress. In addition, we examined the early stages of diauxic shift, a condition in which the primary carbon source (glucose) and other metabolites are depleted, by growing the cells for 3 days in YPD (Vivier et al., 1997; Galdieri et al., 2010). We used 3’ READS to map 3’ mRNA isoforms at single nucleotide resolution on a transcriptome scale (~20 million reads/sample). Biological replicates exhibit very high correlation to one another on a combined gene expression basis (R = 0.91 to > 0.99; Figure 1—figure supplement 1A) as well as at the individual isoform level (R = 0.88 to > 0.99; Figure 1—figure supplement 1B,C).

As expected, the total number of sequence reads within a given 3’ UTR can vary under different conditions, reflecting regulated expression of many genes under one or more conditions. However, this work focuses on poly(A) profiles, not overall expression levels, of individual yeast genes. Thus, for each gene, we define the level of the most highly expressed 3’ mRNA isoform to be 100 and use this to normalize the levels of all other isoforms of the same gene.

The span of genomic sequence encompassing all of a given gene’s poly(A) sites is termed the gene’s ‘end zone,’ and a yeast end zone may contain upwards of 60 3’ isoform endpoints (Moqtaderi et al., 2013; Pelechano et al., 2013) across conditions (Figure 1A). To avoid problems related to sequencing depth, we limited our analyses to genes with >1000 sequencing reads. Within this subset, we generally focused on each gene’s major isoforms, which we define as being at least 5% as abundant as that gene’s most highly expressed isoform. On average, a yeast gene gives rise to eight major isoforms (representing ~30% of all 3’UTR polyadenylation sites) under a variety of growth conditions. The section of the end zone occurring between the most proximal and most distal major isoform endpoints is termed the ‘major end zone’, and the length between these boundaries is the ‘major end zone span’ (Figure 1A). For simplicity, we will sometimes summarize the characteristics of an end zone by referring to the major end zone span and boundaries, the maximally expressed isoform endpoint, and the weighted average isoform endpoint (the arithmetic mean of all isoform endpoints in the end zone).

Figure 1—figure supplement 1. — (A) Representative end zone profile (histogram of isoform frequencies) with key landmarks indicated. (B) End zone profiles for four genes under five growth conditions. (C) Major end zone under five growth conditions. Boundaries represent median values genome-wide for 5’-most and 3’-most major isoforms, and the vertical line within the major end zone represents the genome-wide median of the weighted average isoform position. (D) Table of statistics for landmark positions under five growth conditions. Numbers are the median values across genes with a combined read count of at least 1000 in both replicates in every condition. Numbers in bold red are shifted upstream from WT in a statistically meaningful way (p < 0.01).

The isoform distributions of individual gene end zone profiles are very similar in YPD, YPGal, MM, and Sorbitol (Figure 1B). Meta-gene plots are nearly superimposable across conditions (Figure 1C and Figure 1—figure supplement 1D), and end zone parameters such as maximally expressed isoform position (max position), weighted average coordinate, major end zone boundary coordinates, and major end zone span are nearly identical (Figure 1D). Thus, growth conditions that greatly affect expression levels of numerous genes do not alter the overall patterns of steady-state isoforms.

In contrast, the end zone patterns of genes in diauxic conditions display a very significant shift in the 5’ direction (Figure 1B). The maximally expressed isoform position, major end zone boundary coordinates and the weighted average poly(A) position are all shifted 10–25 nt upstream (Figure 1D), while the average end zone pattern is clearly different from all the other conditions (Figure 1C and Figure 1—figure supplement 1D). More than 80% of all genes assayed display an obvious 5’ end zone shift (see below), indicating that the altered poly(A) isoform pattern seen in diauxic conditions is a general, genome-wide phenomenon that is independent of overall expression level.

Yeast cells containing Pol II derivatives with slow elongation rates show a poly(A) pattern that strikingly resembles the pattern in diauxic shift

Pol II derivatives with slow elongation rates have reduced processivity, such that some Pol II molecules dissociate prematurely from the DNA template (Mason and Struhl, 2005; Fong et al., 2015). As elongating Pol II complexes are exceptionally stable and hence unlikely to simply dissociate, premature Pol II dissociation is likely to be an active process such as cleavage/polyadenylation followed by exonucleolytic RNA degradation, as described in the torpedo model (Kim et al., 2004; Baejen et al., 2017). In this view, a slow Pol II would give extra time for the cleavage/polyadenylation machinery to function, leading to a 5’ bias in poly(A) sites. By analogy, Pol II derivatives with a slow elongation rate show a 5’ bias in alternative splicing (de la Mata et al., 2003; Dujardin et al., 2014). We therefore considered whether the upstream shift of poly(A) sites under diauxic conditions might be a consequence of reduced Pol II processivity and/or slower speed.

To examine the influence of Pol II elongation rate on isoform distribution, we generated yeast strains with mutations (either H1085Q or F1086S) in the largest Pol II subunit (Rpb1) that decrease the elongation rate (Braberg et al., 2013). These mutations lie within the ‘trigger loop’ region of Rpb1, a region that physically interacts with the non-templated strand and is important for ribonucleotide selection, catalysis, and Pol translocation II along the template strand (Wang et al., 2006; Kaplan, 2013; Barnes et al., 2015). We refer to these alleles by their relative speeds with respect to wild-type Pol II: ‘slow’ (F1086S; 2.5-fold slower than wild-type) or ‘slower’ (H1085Q; 5-fold slower than wild-type) (Kaplan et al., 2012). mRNA 3’ isoform profiling of these strains reveals a substantial upstream end zone shift that is nearly as dramatic as that observed in diauxic conditions (Figure 2). Globally, most end zone parameters are shifted ~10–20 nt upstream relative to those of the wild-type isogenic strain, and the averaged meta-gene end zone profile most closely resembles that of the wild-type strain in diauxic conditions (Figure 2B and C, and Figure 2—figure supplement 1). Furthermore, 90% of individual genes with an upstream shift in diauxic conditions also show upstream shifts in both slow-Pol II strains (Figure 2C), an outcome that is extremely unlikely to occur by chance (p<10⁻¹⁰⁰). These results suggest a mechanistic relationship between diauxic conditions and slow Pol II elongation in establishing the pattern of polyadenylation.

Figure 2. — (A) End zone profiles for *PDB1* and *BET4* in strains harboring wild-type Rpb1 (in exponential and diauxic growth conditions), Rbp1 H1085Q (‘slower’), and Rpb1 F1086S (‘slow’). (B) Major end zones of these strains. Boundaries represent median values genome-wide for 5’-most and 3’-most major isoforms, and the vertical line within the major end zone represents the genome-wide median of the weighted average isoform position. (C) Table of statistics for landmark positions. Numbers are the median values across genes with a total of at least 1000 sequence reads in both replicates in every condition. Bold red numbers are shifted upstream vs WT in a statistically meaningful way (p < 0.01). (D) Bar graph representation of each gene’s net shift in weighted average isoform position in strains with slow vs wild-type Rpb1. Each horizontal line represents one gene, ordered by shift values in the ‘slow’ strain; the graph includes 3497 genes with a combined read count of at least 1000 for both replicates in all three strains. Yellow bars represent the 'slow' strain, and blue is for the 'slower' strain; overlapping bars appear green. To obtain net shift values for every gene in each mutant strain, the average shift vs WT in two replicates was diminished by the absolute value of the average shift of the WT and mutant biological replicates. The net shift was set to zero if the absolute value of the shift vs WT was less than the absolute value of the shift between biological replicates. (E) Venn diagram overlap of genes categorized as upshifted in the diauxic condition, slower Rpb1 (H1085Q), or slow Rpb1 (F1086S) strains. (F) Correlation of end zone shifts in diauxic and slow Pol II strains. The average net overall end zone shift in slow Pol II strains (x-axis; see Materials and methods) is plotted against the net overall end zone shift in diauxic cells (y-axis). Negative values represent upstream shifts, and positive values indicate downstream end zone shifts.

Figure 2—figure supplement 1. — (A) End zone profiles for *PDB1* and *BET4* in strains harboring wild-type Rpb1 (in exponential and diauxic growth conditions), Rbp1 H1085Q (‘slower’), and Rpb1 F1086S (‘slow’). (B) Major end zones of these strains. Boundaries represent median values genome-wide for 5’-most and 3’-most major isoforms, and the vertical line within the major end zone represents the genome-wide median of the weighted average isoform position. (C) Table of statistics for landmark positions. Numbers are the median values across genes with a total of at least 1000 sequence reads in both replicates in every condition. Bold red numbers are shifted upstream vs WT in a statistically meaningful way (p < 0.01). (D) Bar graph representation of each gene’s net shift in weighted average isoform position in strains with slow vs wild-type Rpb1. Each horizontal line represents one gene, ordered by shift values in the ‘slow’ strain; the graph includes 3497 genes with a combined read count of at least 1000 for both replicates in all three strains. Yellow bars represent the 'slow' strain, and blue is for the 'slower' strain; overlapping bars appear green. To obtain net shift values for every gene in each mutant strain, the average shift vs WT in two replicates was diminished by the absolute value of the average shift of the WT and mutant biological replicates. The net shift was set to zero if the absolute value of the shift vs WT was less than the absolute value of the shift between biological replicates. (E) Venn diagram overlap of genes categorized as upshifted in the diauxic condition, slower Rpb1 (H1085Q), or slow Rpb1 (F1086S) strains. (F) Correlation of end zone shifts in diauxic and slow Pol II strains. The average net overall end zone shift in slow Pol II strains (x-axis; see Materials and methods) is plotted against the net overall end zone shift in diauxic cells (y-axis). Negative values represent upstream shifts, and positive values indicate downstream end zone shifts.

The upstream shift in the slow Pol II strains and in diauxic conditions reflects altered poly(A) site utlization per se, because the formal possibility that it is due to preferential degradation of longer mRNA isoforms is highly unlikely. mRNA stability in yeast involves many hundreds of stabilizing and destabilizing elements that are located anywhere within 3’UTRs (Geisberg et al., 2014; Gupta et al., 2014). As such, longer isoforms within a gene can be either more or less stable than shorter isoforms. Furthermore, the same poly(A) sites are used in normal and diauxic conditions (Figures 1B and 2A, and see below), and it is extremely unlikely that Pol II speed affects the stability of an mRNA isoform, because Pol II must proceed at least 10 nt beyond the poly(A) site in order for this site within the mRNA to become accessible to the cleavage/polyadenylation machinery.

Upstream shifts involve differential utilization of pre-existing poly(A) sites

We examined whether the upstream-shifted isoforms that predominate in diauxic conditions occur at new polyadenylation positions or represent increased utilization of ORF-proximal sites observed in other conditions. We developed a mathematical model to quantify the likelihood that any overlap in poly(A) positions between exponential growth and diauxic or slow-polymerase conditions is due to chance. Assuming that cleavage/polyadenylation can occur at any position within the first 400 nt of the 3’ UTR, the probability that the observed positional overlap occurs by chance is infinitesimal (median R = 1.42×10⁻¹⁰) (Figure 3—figure supplement 1). If we are more conservative and assume instead that the universe of possible cleavage/polyadenylation sites is restricted to sites actually observed in at least one of our growth conditions, then the probability of positional overlap by chance is still vanishingly small (median R = 7.63×10⁻⁷; Figure 3 and Figure 3—figure supplement 1). Thus, the end zone shift generally represents a rebalancing of poly(A) site use rather than the creation of new sites. In accord with these results, the nucleotide frequencies surrounding the major end zone boundary positions and the maximally expressed isoform endpoint are nearly identical under diauxic and slow Pol II conditions (see below). The striking similarity of major isoform positions and nucleotide compositions across conditions indicates that local poly(A) site specificity is mechanistically defined and determined by the basic properties of Pol II and the cleavage/polyadenylation machinery.

Figure 3. — (A) Probability of overlap in isoform distribution by chance as a function of combined end zone length in strains with very slow (H1085Q) or wild-type Rpb1. (B) Probability of overlap in isoform distribution by chance as a function of combined end zone length in exponential growth and diauxic conditions.

Figure 3—figure supplement 1. — (A) Probability of overlap in isoform distribution by chance as a function of combined end zone length in strains with very slow (H1085Q) or wild-type Rpb1. (B) Probability of overlap in isoform distribution by chance as a function of combined end zone length in exponential growth and diauxic conditions.

Pol II derivatives with fast elongation rates show modest downstream shifts in poly(A) patterns

As a complement to the above experiments, we also performed mRNA isoform profiling in strains harboring Rpb1 derivatives (L1101S ‘fast,’ or E1103G, ‘faster’) whose Pol II elongation rates are 2- to 2.5-fold faster than wild-type (Kaplan et al., 2012; Braberg et al., 2013). L1101 and E1103 lie within an α-helical region adjoining the trigger loop that is thought to contact the non-templated DNA strand and be important for Pol II translocation. In comparison to the slow Pol II strains, these fast Pol II strains show more subtle changes to 3’ isoform distributions, with small but significant downstream shifts in end zone parameters such as the maximal position, major end zone span, and weighted average coordinate (Figure 4). The Rpb1-E1103G strain exhibits a slightly greater overall downstream shift than the Rpb1-L1101S strain, consistent with its faster elongation rate (Kaplan et al., 2012; Braberg et al., 2013) Similarly, the upstream shifts of the slow Pol II strains parallel their elongation rates determined in vitro, with the slower mutant shifted farther upstream. The relationship of the elongation rates determined in vitro to the poly(A) pattern shift in vivo strongly suggests that the effects on polyadenylation are due to the elongation rate.

Figure 4. — (A) End zone profiles for *MRM1* and *OPI3* in strains with wild-type, L1101S (‘fast’), and E1103G (‘faster’) Rpb1. (B) Major end zones of these strains. Boundaries represent median values genome-wide for 5’-most and 3’-most major isoforms, and the vertical line within the major end zone represents the genome-wide median of the weighted average isoform position. (C) Table of statistics for landmark positions. Numbers aremedian values across genes with a total of at least 1000 sequence reads in both replicates in every condition. Numbers in bold green are significantly shifted downstream from WT (p < 0.01). (D) Bar graph representation of each gene’s net shift in weighted average isoform position in strains with fast vs wild-type Rpb1. Each horizontal line represents one gene, ordered by shift values in the ‘fast’ strain; the graph includes 3627 genes with a combined read count of at least 1000 for both replicates in all three strains. Yellow represents the 'fast' strain and blue the 'faster' strain, with the overlap appearing green. To obtain net shift values for every gene in each mutant strain, the average shift vs WT in two replicates was diminished by the absolute value of the average shift of the WT and mutant biological replicates. The net shift was set to zero if the absolute value of the shift vs WT was less than the absolute value of the shift between biological replicates. (E) 2790 genes are plotted as a function of the average overall net end zone shift (see Materials and methods) in either catalytically fast (x-axis) or slow (y-axis) Pol II mutants. Genes were classified into Upstream (red), Downstream (green), Both (blue), Neutral (black) and Other (orange) on the basis of each gene’s net end zone shift (see text). The upper right-hand quadrant comprises genes shifted upstream in slow Pol II mutants and downstream in fast Pol II mutants, while genes in the upper left-hand quadrant are shifted upstream in both fast and slow Pol II mutant strains. The bottom right quadrant contains genes that are shifted downstream in both slow and fast Pol II mutants, while the few genes whose end zones are shifted downstream in slow Pol II and upstream in fast Pol II strains are found in the bottom left quadrant. (F) Left: Classification of genes by category. The categories are: ‘Upstream,’ genes whose poly(A) sites were upshifted in both slow-Pol II strains; ‘Downstream,’ genes whose end zone profiles were downshifted in both fast-Pol II strains; ‘Neutral,’ genes with no end zone shift in any slow or fast Pol II-containing strain; and ‘Other,’ genes with any other combination of properties (see Materials and methods). Right: Venn diagram illustrating the 'Both' sub-category of genes (see Materials and Methods), i.e. the intersection of the set of genes shifted upstream in slow Pol II (Upstream category) with the set of genes shifted downstream in the presence of fast Pol II (Downstream category).

Figure 4—figure supplement 1. — (A) End zone profiles for *MRM1* and *OPI3* in strains with wild-type, L1101S (‘fast’), and E1103G (‘faster’) Rpb1. (B) Major end zones of these strains. Boundaries represent median values genome-wide for 5’-most and 3’-most major isoforms, and the vertical line within the major end zone represents the genome-wide median of the weighted average isoform position. (C) Table of statistics for landmark positions. Numbers aremedian values across genes with a total of at least 1000 sequence reads in both replicates in every condition. Numbers in bold green are significantly shifted downstream from WT (p < 0.01). (D) Bar graph representation of each gene’s net shift in weighted average isoform position in strains with fast vs wild-type Rpb1. Each horizontal line represents one gene, ordered by shift values in the ‘fast’ strain; the graph includes 3627 genes with a combined read count of at least 1000 for both replicates in all three strains. Yellow represents the 'fast' strain and blue the 'faster' strain, with the overlap appearing green. To obtain net shift values for every gene in each mutant strain, the average shift vs WT in two replicates was diminished by the absolute value of the average shift of the WT and mutant biological replicates. The net shift was set to zero if the absolute value of the shift vs WT was less than the absolute value of the shift between biological replicates. (E) 2790 genes are plotted as a function of the average overall net end zone shift (see Materials and methods) in either catalytically fast (x-axis) or slow (y-axis) Pol II mutants. Genes were classified into Upstream (red), Downstream (green), Both (blue), Neutral (black) and Other (orange) on the basis of each gene’s net end zone shift (see text). The upper right-hand quadrant comprises genes shifted upstream in slow Pol II mutants and downstream in fast Pol II mutants, while genes in the upper left-hand quadrant are shifted upstream in both fast and slow Pol II mutant strains. The bottom right quadrant contains genes that are shifted downstream in both slow and fast Pol II mutants, while the few genes whose end zones are shifted downstream in slow Pol II and upstream in fast Pol II strains are found in the bottom left quadrant. (F) Left: Classification of genes by category. The categories are: ‘Upstream,’ genes whose poly(A) sites were upshifted in both slow-Pol II strains; ‘Downstream,’ genes whose end zone profiles were downshifted in both fast-Pol II strains; ‘Neutral,’ genes with no end zone shift in any slow or fast Pol II-containing strain; and ‘Other,’ genes with any other combination of properties (see Materials and methods). Right: Venn diagram illustrating the 'Both' sub-category of genes (see Materials and Methods), i.e. the intersection of the set of genes shifted upstream in slow Pol II (Upstream category) with the set of genes shifted downstream in the presence of fast Pol II (Downstream category).

Poly(A) patterns of individual genes vary in their sensitivity to Pol II elongation rate

To address the relationship between poly(A) patterns in the slow and fast Pol II derivatives, we constructed a mathematical error model to determine whether the poly(A) pattern of an individual gene is significantly shifted in either the upstream or downstream direction. In the slow Pol II mutants, 2083 (Rpb1-H1085Q ‘slower’) and 1947 (Rpb1-F1086S ‘slow’) genes (out of a total of 2,790) show significant upstream shifts (Figure 4—figure supplement 1), with more than 97% of the upstream shifts occurring in both strains (1898 out of 1947 genes, p<10⁻¹⁰⁰, hypergeometric test). In the fast Pol II strains, a smaller proportion of genes (23% for Rpb1-L1101S ‘fast’ and 32% for Rpb1-E1103G ‘faster’) show downstream shifts, with the vast majority of these shifts (95%; p<10⁻¹⁰⁰) occurring in both strains (Figure 4—figure supplement 1). Interestingly, 76% (462 out of 605) of genes showing downstream shifts in both fast Pol II strains also exhibit upstream shifts in both slow Pol II strains, an overlap that is highly significant (Figure 4E and F, and Figure 4—figure supplement 1; p=2.68×10⁻⁷). Thus, 17% of yeast genes tested are especially sensitive to perturbations in Pol II elongation rate, both fast and slow. The striking similarities in polyadenylation profiles between the two slow and the two fast Pol II derivatives indicate that these patterns depend on Pol II elongation rate and not on other properties of the Pol II derivatives.

Unexpectedly, a minority class of genes behave in the opposite manner. In the slower Pol II strain, a small number of genes (46; Figures 2D, 4E and F, and Figure 4—figure supplement 1) exhibit an atypical downstream shift, 34 of which also show a downstream shift in strains containing the slow Pol II derivative (p=6.0×10⁻⁵², hypergeometric test). Conversely, in the fast Pol II strains, a small minority of genes show atypical upstream shifts in (196 for Rpb1-L1101S and 228 for Rpb1-E1103G), with ~72% of these showing upstream shifts in both strains (p<10⁻¹⁰⁰) (Figure 4D and E and Figure 4—figure supplement 1). The fact that these opposite patterns are observed in two different strains with the same catalytic properties (either fast or slow Pol II) suggest that a minority of genes have Pol II elongation properties in vivo that are different from one would expect from the Pol II elongation rate determined in vitro on a specific DNA template.

Sequences around cleavage sites of Pol II speed-sensitive genes are enriched for purines

Although overall nucleotide frequencies at sequences located ±10 nt from poly(A) sites are virtually identical (Figure 5—figure supplement 1A), we examined whether such sequences surrounding max isoform endpoints differ between Pol II speed-sensitive genes (‘Both’ category) and genes unaffected by Pol II elongation rate. Interestingly, speed-sensitive genes have reduced frequencies in U and (to a lesser extent) C residues, and a greater incidence of A and G residues relative to speed-unaffected genes (Figure 5). This skewed frequency of purines to pyrimidines is observed in all conditions tested, and nucleotide distributions in speed-sensitive genes bear a striking semblance to one another irrespective of condition or speed category (Figure 5—figure supplement 1C). This is noteworthy because max isoform positions (and hence adjacent sequences) vary greatly among conditions and gene categories (Figure 5—figure supplement 1B and D). These results strongly suggest that localized sequence composition, not location within the 3’UTR, is the primary determinant of susceptibility to cleavage/polyadenylation changes in strains with altered Pol II elongation rates and in diauxic conditions.

Evidence that Pol II elongation rate is decreased in diauxic conditions

The striking similarity of poly(A) profiles in diauxic conditions and in strains with slow-elongating Pol II derivatives suggests that the Pol II elongation rate is slower in diauxic conditions than it is in exponentially growing cells. Because diauxic cells are carbon source starved, it is impossible to directly measure the Pol II elongation rate using a conventional assay that involves rapid glucose shutoff of a long gene (Mason and Struhl, 2005). Instead, we used Pol II processivity as a proxy for elongation rate, based on the observation that a slow elongation rate is associated with decreased Pol II processivity and disproportionate accumulation at promoter regions in vivo (Mason and Struhl, 2005; Fong et al., 2017).

We compared Pol II occupancy at the coding sequences and promoter regions of 14 genes in diauxic versus exponentially growing cells. The resulting promoter:ORF occupancy ratios under each condition were combined to generate a diauxic:exponential processivity ratio (Figure 6A). Some genes display diauxic:exponential ratios of ~1, indicating that the Pol II distributions are similar in both conditions. However, most of the genes tested have diauxic:exponential ratios ranging from 2 to 6, indicating disproportionate accumulation of Pol II at promoter regions in diauxic conditions. Importantly, the extent of the upstream shift in poly(A) site selection is strongly correlated (Figure 6B; R = 0.63) with the diauxic:exponential processivity ratio, suggesting that Pol II elongates slowly under diauxic conditions.

Figure 6. — (A) Pol II occupancy (background-subtracted ChIP signal) at promoters and ORFs of select genes in logarithmic growth and diauxic conditions. For every gene, the promoter/ORF occupancy ratio is determined for each condition, and the ratio of these ratios (diauxic/log phase), termed the processivity ratio, is given under the locus name. (B) Scatter plot of the diauxic/log phase processivity ratio vs upstream shift (see Materials and methods) in nt observed in diauxic conditions. (C) End zone profiles of *NTH1* and *YMC2* in wild-type, *spt4∆*, and *hpr1∆* strains. (D) Plot of genome-wide median major end zones in wild-type (log phase and diauxic), slower-Pol II (Rpb1 H1085Q), *hpr1∆*, and *spt4∆* strains. (E) Landmark statistics table in these strains. (All genes with >1000 reads/condition). Bold red numbers represent statistically meaningful upstream shifts vs WT (p < 0.01).

Figure 6—figure supplement 1. — (A) Pol II occupancy (background-subtracted ChIP signal) at promoters and ORFs of select genes in logarithmic growth and diauxic conditions. For every gene, the promoter/ORF occupancy ratio is determined for each condition, and the ratio of these ratios (diauxic/log phase), termed the processivity ratio, is given under the locus name. (B) Scatter plot of the diauxic/log phase processivity ratio vs upstream shift (see Materials and methods) in nt observed in diauxic conditions. (C) End zone profiles of *NTH1* and *YMC2* in wild-type, *spt4∆*, and *hpr1∆* strains. (D) Plot of genome-wide median major end zones in wild-type (log phase and diauxic), slower-Pol II (Rpb1 H1085Q), *hpr1∆*, and *spt4∆* strains. (E) Landmark statistics table in these strains. (All genes with >1000 reads/condition). Bold red numbers represent statistically meaningful upstream shifts vs WT (p < 0.01).

Pol II elongation rate, not processivity, is important for polyadenylation patterns

The above analysis of Pol II processivity under diauxic and exponential growth conditions cannot distinguish whether the polyadenylation pattern is due to a reduced elongation rate or to a decrease in Pol II processivity. To address the role of Pol II processivity more specifically, we examined the poly(A) profiles of cells that lack either Spt4 or Hpr1, two proteins that travel with elongating Pol II. Spt4 and Hpr1 deletion strains exhibit Pol II processivity defects, but they do not affect the Pol II elongation rate (Mason and Struhl, 2005). The poly(A) profiles of spt4Δ and hpr1Δ strains are very similar to those of the wild-type strain at the individual gene level (Figure 6C). In fact, meta-gene profiles and various end zone parameters indicate a very modest downstream shift in poly(A) site utilization (Figure 6D and Figure 6—figure supplement 1). Thus, Pol II processivity per se does not influence poly(A) profiles, arguing that a decrease in the elongation rate is the cause of the upstream shift observed under diauxic growth conditions.

Discussion

Pol II elongation rate, not Pol II processivity, affects poly(A) site selection

Pol II elongation is mechanistically linked to post-transcriptional processes such as splicing, polyadenylation, chromatin modification, and mRNA localization. Moreover, yeast and metazoan cells containing Pol II derivatives with slow elongation rates show altered patterns of mRNA splicing and histone modifications throughout the transcribed regions. Here we use multiple Pol II derivatives with slow or fast elongation rates and nucleotide-level analysis to show that the rate of Pol II elongation has a dramatic influence on the pattern of polyadenylation (Figure 7).

Figure 7. — The 3’UTRs of speed-sensitive genes contain purine-rich elements (red line segments) and pyrimidine-rich elements (blue line segments) of varying strengths (small, medium or large scissors). Under normal conditions (exponentially-growing wild-type cells), cleavage and polyadenylation takes place predominantly at pyrimidine-rich elements. In diauxic conditions and in cells harboring slow Pol II, purine-rich elements drive an upstream shift in polyadenylation patterns, likely due to increased Pol II dwell time at those sequences. Conversely, fast Pol II shifts the poly(A) patterns to more distal purine rich sites.

Two different slow Pol II derivatives cause a near-identical, transcriptome-wide, upstream shift in the relative use of known poly(A) sites, but they do not typically result in multiple novel poly(A) sites. The upstream shifts are observed in the majority of yeast genes, but some genes are unaffected by these Pol II derivatives. In contrast to the slow Pol II derivatives, two fast Pol II derivatives confer a downstream shift in poly(A) site preference. These downstream shifts seen with the fast Pol II derivatives are much subtler than the upstream shifts observed with the slow Pol II derivatives, in both the number of genes affected and the magnitude of the shifts. Nevertheless, the overlap between upstream- and downstream-shifted genes is far beyond what would be expected by chance, indicating that the poly(A) patterns of many genes are sensitive to both slow and fast Pol II elongation rates. Interestingly, purine-rich sequences flanking cleavage/polyadenylation sites are associated with Pol II genes that are sensitive to fast and/or slow Pol II elongation speed.

Pol II derivatives with slow elongation rates also have defects in Pol II processivity. Strains with hpr1 or spt4 deletions exhibit defects in Pol II processivity comparable to those in strains with slow Pol II derivatives, but they have normal elongation rates (Mason and Struhl, 2005). In these strains, poly(A) patterns are unaffected, indicating that Pol II elongation rate, not Pol II processivity, is the major determinant of poly(A) patterns in yeast.

Evidence that regulated polyadenylation during the diauxic shift is due to decreased elongation rate

Yeast cells undergoing the diauxic shift display a transcriptome-wide, upstream-shifted poly(A) pattern that is remarkably similar (though not identical) to the poly(A) patterns conferred by the two Pol II derivatives with slow elongation rates. Although we cannot directly measure the Pol II elongation rate under diauxic conditions, Pol II under these conditions is disproportionately found in promoter regions, a property linked to slow Pol II elongation rate (Mason and Struhl, 2005; Fong et al., 2017). Importantly, the degree of promoter bias in the Pol II distribution is strongly correlated with the magnitude of the upstream shift. Taken together, our results suggest that upstream-shifted polyadenylation during the diauxic shift is due to a decreased Pol II elongation rate under these conditions.

It is formally possible that the upstream shift in diauxic conditions is due to changes in the biological activity or expression level of a 3’ mRNA processing factor. However, this is highly unlikely, because any diauxic-shift-induced alteration in a 3’ processing factor would have to result in a poly(A) pattern virtually identical to those of two different slow Pol II mutants over thousands of genes. Furthermore, this explanation does not account for why there is such a pronounced Pol II processivity defect, a hallmark of reduced Pol II speed.

The presumed decrease in Pol II elongation rate under diauxic conditions could be due to the physiological state of the cells and/or modification of Pol II (or an associated elongation factor). Cellular stress or slow growth alone would be unlikely to cause the upstream shift, because other stressful conditions, including those that reduce the growth rate, do not affect the poly(A) pattern (Figure 1). However, limitation of a specific nutrient(s) or oxygen could affect the Pol II elongation rate. In this regard, the Pol II elongation rate is reduced in cells treated with 6-azauracil or mycophenolic acid (Mason and Struhl, 2005), conditions that reduce intracellular levels of GTP and UTP and hence substrates for transcription.

Mechanistic implications about regulation of alternative polyadenylation

There are many examples of alternative polyadenylation regulation in response to environmental stress or developmental conditions (Flavell et al., 2008; Sandberg et al., 2008; Ji et al., 2009; Mayr and Bartel, 2009). A variety of experiments suggest that such regulation of mRNA 3’ end formation involves components of the cleavage/polyadenylation machinery and RNA-binding proteins (Elkon et al., 2013; Tian and Manley, 2017). This regulation could occur either by altered expression of such components and/or modification that changes their activity. Our work demonstrates that control of the Pol II elongation rate is an alternative mechanism for regulating alternative polyadenylation in response to physiological conditions. Regulation by Pol II elongation rate is not mutually exclusive with regulation of the cleavage/polyadenylation machinery, and indeed both mechanisms could operate under a given physiological condition. It is currently unknown, and hence would be interesting, to examine Pol II elongation rates under situations in which alternative polyadenylation is regulated.

Materials and methods

Strains

Mutations in RPO21 and precise ORF deletions of HPR1 and SPT4 were introduced into the JGY2000 strain (MATa, his3∆0, leu2∆0, met15∆0, ura3∆0, rpb1::RPB1–FRB, rpl13::RPL13–FK512) (Geisberg et al., 2014) by CRISPR, using derivatives of pML104 (Laughery et al., 2015) to supply Cas9 and guide RNA. All strains were confirmed by PCR and Sanger sequencing.

Strain	RPO21 Allele	Other
JGY2000	RPO21-FRB
JZY5	RPO21-H1085Q-FRB
JZY6	RPO21-F1086S-FRB
JZY14	RPO21-L1101S-FRB
JZY15	RPO21-E1103G-FRB
JZY27	RPO21-FRB	spt4∆
JZY33	RPO21-FRB	hpr1∆

Open in a new tab

RNA analysis

Except for the diauxic condition JGY2000, all strains were grown in 50 ml of media (see below) to OD₆₀₀ = 0.3–0.4 at 30° C. JGY2000, JZY5, JZY6, JZY14, JZY15, JZY27, and JZY33 were grown in YPD. JGY2000 was also grown in YP medium containing 2% Galactose (‘YPGal’), osmotic stress conditions (YPD + 1M sorbitol;'Sorbitol’) and nutrient poor minimal medium (2% dextrose, yeast nitrogen base with ammonium sulfate and without amino acids supplemented with uracil and essential amino acids; 'MM'). Diauxic conditions were achieved by first growing JGY2000 at 30°C in 50 ml YPD to an OD₆₀₀ = 0.3–0.4 (~24 hr) and then allowing the cells to grow in the same medium for an additional 48 hr (~72 hr total growth time and a final OD₆₀₀ = 3.0). Total RNA was isolated and purified from 15 to 20 ml of cells (10 ml for the diauxic condition) using the hot acid phenol method followed by QIAGEN RNeasy as described (Moqtaderi et al., 2018). 3’ READS was performed with 25 ug of purified total RNA with 17 cycles of amplification (Jin et al., 2015). Barcoded libraries were quantified on an Agilent Bioanalyzer 2100, pooled, and sequenced on the Illumina NextSeq 500 platform.

Chromatin immunoprecipitation

Whole-cell lysates from 30 ml of formaldehyde-treated cells were prepared as described (Geisberg et al., 2014). 150 µl of extracts were diluted to a total volume of 950 µl with FA lysis buffer (Aparicio et al., 2004) and immunoprecipitated with 10 µl of 8WG16 antibody (Biolegend #664912) for 2 hr at room temperature. Protein-DNA complexes were then incubated for an additional 2 hr with 50 µl of 50% (v:v in FA lysis buffer) protein A-Sepharose. Beads bound with Pol II-DNA were washed and eluted as described (Aparicio et al., 2004). Pol II binding occupancy was assayed by real-time qPCR (Geisberg et al., 2014) with oligonucleotides specific to either promoter regions or coding sequences of selected genes (see oligo table below).

Gene	Location	Position relative to ATG	Sequence
HSP82	Promoter	−202	5'-TGGTTTTATGAGCGGTTAATTC-3'
HSP82	Promoter	−79	5'-GGGAAGAAATGAGGAGGTC-3'
HSP82	ORF	2022	5'-GGGTTTGAACATTGATGAGG -3'
HSP82	ORF	2146	5'-GGCCATGATGTTCTACCTAA-3'
HSC82	Promoter	−120	5'-GAACTGCCTACCGTAAGTG-3'
HSC82	Promoter	−27	5'-GGTTCTGTAGCGTTTCAAGA-3'
HSC82	ORF	1931	5'-AGACCGCTTTGTTGACTTC-3'
HSC82	ORF	2048	5'-GCGGTTTCTGTTTCTTCATC-3'
URA2	Promoter	−177	5'-ATAGAGATCTTCATGGCACG-3'
URA2	Promoter	−53	5'-AGTTATGGATTTCTATCGTCGT-3'
URA2	ORF	2029	5'-GTAGCCCCATCTCAAACTTT-3'
URA2	ORF	2124	5'-ACATTCACCAACAACACCTA-3'
ADE3	Promoter	−141	5'-CATTATATACGCGCTCTCCA-3'
ADE3	Promoter	−20	5'-AAGTTGTGTTCGTCTCGTTA-3'
ADE3	ORF	1951	5'-GCCTCTTCTGTTATTGCTGA-3'
ADE3	ORF	2075	5'-AATCTTTCACCACCCATAGT-3'
FKS1	Promoter	−131	5'-TGTAGTTTGTGAGAAGGAGAAA-3'
FKS1	Promoter	–7	5'-CCGTTGTATGAAAGACTTGATT-3'
FKS1	ORF	1939	5'-CCAATTAGAATTTTGTCCACCA-3'
FKS1	ORF	2047	5'-TAGCGATAACCAAACCTAAGAC-3'
QDR3	Promoter	−111	5'-TAATAGCTGTGTCCTTGTATCC-3'
QDR3	Promoter	3	5'-CATGTTTATCGCTTTCTGACTT-3'
QDR3	ORF	1920	5'-CATGTTAAACGGTATGGGAAC-3'
QDR3	ORF	2044	5'-GTAAATCGTAGTTCTCTCTCCA-3'
SEC15	Promoter	−75	5'-AATTAATACCTTTAACGAGCGT-3'
SEC15	Promoter	47	5'-ACCTGCTGAAAATCTTTTGAAA-3'
SEC15	ORF	1950	5'-GGAAATACGGTTATCCTCGATA-3'
SEC15	ORF	2072	5'-TGCCAGTCAATTTCAATAGTTT-3'
NAB6	Promoter	−145	5'-CATCCAGAGAAGATATCCCAAA-3'
NAB6	Promoter	−31	5'-GGATTCTTGCGAGTCTTGTT-3'
NAB6	ORF	1942	5'-TCAGACATAGGCAATAGAACAA-3'
NAB6	ORF	2051	5'-ATGTACTTAATGCTCTGAAGGA-3'
CHS5	Promoter	−113	5'-CCCTTCAAGTTCTCCTTTCTAA-3'
CHS5	Promoter	11	5'-ACTGAAGACATTATTCGCTACT-3'
CHS5	ORF	1921	5'-GTTTTGTCCACTAAAGAAGCTA-3'
CHS5	ORF	2035	5'-CATTGAAGGCATCCATTAATCA-3'
KAR2	Promoter	−80	5'-TCTAAAGATTAACGTGTTACTGT-3'
KAR2	Promoter	3	5'-CATGGTATGTTTGATACGCTTT-3'
KAR2	ORF	1930	5'-AAGGTCGCTTATCCAATTACTT-3'
KAR2	ORF	2027	5'-TAATCACCATCGTCATCTTCAT-3'
USA1	Promoter	−83	5'-TGACGTACTTCAGATAAACACT-3'
USA1	Promoter	17	5'-GCTAGATATTCAGACATGTTGC-3'
USA1	ORF	1930	5'-CAAAGGCTATCGGTCTATTCTA-3'
USA1	ORF	2025	5'-CGATAGCACCTTGATAAATAGC-3'
SLA1	Promoter	−112	5'-CAGAACGAATATTTAGCGCATA-3'
SLA1	Promoter	9	5'-CACAGTCATACTCTAGCTCTTT-3'
SLA1	ORF	1998	5'-TGATGTAAGCAATTGTCAAAGA-3'
SLA1	ORF	2085	5'-CATTGAGTTATTGATGTCAGGC-3'
SET1	Promoter	−94	5'-CTGTTAGCAACCCTCAACTTA-3'
SET1	Promoter	9	5'-ATTTGACATTCTCTAAACGCAG-3'
SET1	ORF	1938	5'-ACATTTACTGAACGAAGAAACC-3'
SET1	ORF	2035	5'-TTTCGTCTTCTTCATCATGTTC-3'
HSF1	Promoter	−91	5'-ATAAAGGCAAAGAGTTAGAGGT-3'
HSF1	Promoter	33	5'-ATTGGTCGTCCCTGTATTTG-3'
HSF1	ORF	1908	5'-TATAGACGAACAAGATGCAAGA-3'
HSF1	ORF	2021	5'-GAATTAGTGTTTGTCGAGGAAG-3'

Open in a new tab

Data analysis

We processed sequencing data essentially as described previously (Moqtaderi et al., 2018), mostly using Python 3 (www.python.org). After separating sequence reads from multiplexed libraries by barcode into output from individual samples, we removed adapter sequences from read ends and discarded reads with ambiguous bases and reads not starting with a T (corresponding to an A at the mRNA 3’ end, potentially from polyadenylation). We counted and deleted consecutive Ts at the beginning of each read, saving the number of initial Ts for reference by appending it to the read ID. We mapped the first 17 nt of remaining sequence for each read to the Saccharomyces cerevisiae genome (version Sac cer3) using Bowtie [Langmead et al., 2009], allowing no mismatches and excluding non-unique matches. To ensure that we were working with post-transcriptionally adenylated RNA, we examined the genomic sequence immediately downstream of each mapped read, keeping only those reads for which the initial T count exceeded the number of consecutive As in the adjacent genomic sequence. Lastly, we scaled the remaining mapped reads for each replicate to a total of 25 million.

End zone profiles, important parameters, and definitions

We assigned reads to a gene if they mapped within the 400 nt 3’ UTR window downstream of its ORF. For each sample, we tabulated mRNA 3’ isoform endpoint frequencies for all non-A positions within the first 400 nt of each 3’UTR. These isoform endpoint positions are numbered relative to the end of the associated ORF; for example, position 100 refers to the position 100 nt after the stop codon. We limited most of our analyses to the 2790 genes with ≥1000 normalized reads (combined from both biological replicates) in each of the 11 conditions/strains described in this work.

We constructed end zone profiles by setting the maximally expressed isoform (max isoform) in each gene’s 3’UTR to 100 percent and linearly scaling expression levels of all other isoforms for that gene relative to this maximal value. The overall pattern of isoform expression over the 3’UTR constitutes a gene’s end zone profile. ‘Major isoforms’ are any isoforms with expression levels equaling or exceeding 5% relative to the max isoform. The ‘major end zone’ comprises the region between the 5’-most and 3’-most major isoforms; the ‘major end zone span’ is its length. The ‘weighted average isoform endpoint’ for a gene is computed by adding up the endpoint positions (relative to the ORF end) of all reads mapping to its 3’ UTR and dividing the result by the total number of reads summed.

Percent coordinate usage analysis

For the 2790 genes analyzed in Figure 1—figure supplement 1D, Figure 2—figure supplement 1, Figure 4—figure supplement 1A, and Figure 6—figure supplement 1, we first calculated the total number of all non-A positions at each of the 400 locations (+1 to +400) of the 3’ UTR. We then tabulated the total number of genes that had non-zero reads at each position within the 3’ UTR. Finally, at each location in the 3’UTR, we divided the total number of genes with non-zero reads at that location by the total number of all non-A positions at the same coordinate and multiplied the resulting fraction by 100 to obtain the percentage of genes with reads at each coordinate.

Correlations of biological replicates

We assessed the reproducibility of biological replicates in several ways. First, we compared the total expression by gene between replicates. For each gene, we obtained the total expression level by summing all reads mapping anywhere within the first 400 nt after the ORF. For each of our 11 experimental conditions/strains, we computed the Pearson correlation of total gene expression at a minimum of 5000 genes across two biological replicates (Figure 1—figure supplement 1A, panel 1). Second, we compared the expression of 25,000–75,000 individual 3’ isoforms genome-wide across replicates of the same 11 conditions/strains (Figure 1—figure supplement 1A, panel 2), omitting isoforms with fewer than 10 reads. Third, we compared end zone profiles of individual genes between biological replicates. For this, we analyzed the 2790 genes whose combined expression in both biological replicates was ≥1000 normalized reads in all 11 conditions/strains. For each gene, we computed the Pearson R coefficient across biological replicates by correlating scaled read counts by position for the entire 400 nt 3’UTR (Figure 1—figure supplement 1, panel 3). Combined 3’ READS data from both biological replicates of exponentially growing JGY2000 compare favorably to our previously published no-DMS control dataset for DREADS, a closely-related assay that captures structural information on individual mRNA 3’UTR isoforms (Moqtaderi et al., 2018; data not shown).

Classification of genes by sensitivity to pol II elongation rate perturbations

For each of the slow or fast Pol II strains (JZY5, JZY6, JZY14, and JZY15), we constructed an error model to identify and classify genes whose end zone profiles are significantly shifted (either upstream or downstream) due to changes in the Pol II elongation rate. First, we computed individual percentile coordinates (10%, 25%, 50%, 75% and 90%) for all 2790 genes in each biological replicate for all the Pol II mutant strains and the exponentially growing JGY2000. These coordinates represent 3’UTR locations at which the indicated percentage of total reads occurs upstream of (and including) the calculated coordinate.

For every gene in each strain, we then subtracted the individual percentile coordinates in each biological replicate from each other to obtain raw error values at all five percentile coordinates. Individual raw error values from the four Pol II elongation rate strains were then separately averaged with the corresponding raw error values from the exponentially growing JGY2000 dataset, and the rounded absolute values of those measurements were termed either the 10^th-, 25^th-, 50^th-, 75^th-, or 90^th-percentile errors. The frequency distributions of the 10^th-, 25^th-, 50^th-, 75^th-, and 90^th-percentile errors for JZY4, JZY5, JZY14, and JZY15 (2790 values/percentile for each strain) were tabulated after dividing all non-zero errors by two in order to account for the fact that the error could be either positive or negative.

Using the 10^th percentile parameter as an example, cumulative probabilities at each error value x were calculated by the following equation,

P (x) = \frac{\sum_{i = x}^{M a x (x)} f (i)}{2,790}

where Max(x) represents the maximum observed 10^th-percentile error value and f(i) is the frequency of the error i in the distribution. Therefore, the probability that an experimentally observed net shift of magnitude |k| in the 10^th percentile coordinate is due to pure chance is given by P(|k|). In cases where |k| > Max(x), P(|k|) was assigned the lowest non-zero probability of 3.58 × 10⁻⁴, which equals to 1/2,790. Cumulative probabilities were calculated for the remaining (25^th, 50^th, 75^th, and 90^th) percentile categories as described above.

Experimental net shift values (k) were calculated as follows. Using JZY5 as an example (the same methodology applies to JZY6, JZY14, and JZY15), we (1) subtracted the individual percentile coordinates (see above) at every gene in each biological replicate of JZY5 from their corresponding values in the biological replicates of exponentially growing JGY2000, (2) divided the values by two and (3) rounded the difference to the nearest whole number to obtain the raw shift values. From raw shift values we then subtracted the corresponding raw error values (see above) to obtain net shift values (k). k was set to zero for all cases where the raw error value was greater than the corresponding raw shift value.

The five net shift values (per gene) represent error-subtracted measures of poly(A) position shifts at the indicated percentiles. Negative k values represent upstream shifts in poly(A) usage of JZY5 relative to exponentially-growing JGY2000. Conversely, positive k values reflect greater downstream poly(A) utilization at the indicated percentile categories in JZY5 versus the strain with the normal Pol II elongation rate (exponentially-growing JGY2000).

For each |k|, a probability P(k) was computed based on the error model (see above). A single-gene probability value P(g) was computed by multiplying all the individual P(k) probabilities at the five percentile categories by one another and then by five in order to correct for multiple hypotheses (Dunn, 1961).

For each gene, we calculated two additional parameters: a cumulative net shift and the net number of positions shifted. The former parameter represents the sum of all net shifts (Σk) at the five percentile coordinates. A positive cumulative net shift represents an overall downstream shift in JZY5 poly(A) sites, while a negative number implies a greater prevalence of shorter 3’UTR isoforms in JZY5 relative to exponentially growing JGY2000. The net number of positions shifted is a measure of the total number (as well as the direction of the shift; see below) of the five percentile coordinates that had a non-zero net shift. It was computed by assigning each of the five gene-specific percentile coordinates a value of either −1 (representing a net upstream shift, or negative k value, at that position), +1 (implying a net downstream shift, or a positive k value, at that position), or 0 (no net shift). The sum of the five values for each gene is the net number of shifted positions. A negative net number of positions shifted means that the overall shift in JZY5 poly(A) sites is more likely to be upstream, while a positive net number of positions shifted indicates a distal poly(A) shift in JZY5 relative to exponentially growing JGY2000. This parameter, along with the cumulative overall shift, is especially helpful in assigning genes to specific categories in more complex cases where some of the k values point to shifts in opposite directions.

Initial classification of poly(A) shift direction by gene

For each of the four Pol II strains (JZY5-6, JZY14-15), we first assigned the 2790 genes into one of three categories based on the comparison of each gene’s end zone profile in the Pol II mutant strain to its profile in the exponentially growing JGY2000. The three categories consisted of (1) genes whose poly(A) sites were shifted upstream in a given Pol II mutant strain relative to exponentially growing JGY2000 (‘upshifted’), (2) genes whose end zone profiles in the given mutant Pol II strain were shifted downstream relative to exponentially growing JGY2000 (‘downshifted’), and (3) genes whose end zone profiles were not classified as either upstream or downstream (‘other’). Genes were categorized as upshifted if they met the all of following criteria: (a) a negative cumulative net shift, (b) a negative net number of positions shifted and (c) a P(g) value <0.01. In order to be categorized as downshifted, a gene had to possess (a) a positive cumulative net shift, (b) a positive net number of positions shifted and (c) a P(g) value <0.01.

Combined classification of poly(A) shift behavior by gene across multiple strains

We classified each of the 2790 genes by combining data from the individual, strain-specific categorization (see above) into one of four groups: ‘Upstream’, ‘Downstream’, ‘Neutral’ and ‘Other’. In the combined classification, a gene was classified as Upstream if it was upshifted in both Pol II elongation rate-defective strains (JZY5 and JZY6; see above), irrespective of its behavior in the two fast Pol II strains (JZY14 and JZY15). Similarly, a gene was classified as Downstream if it was downshifted in the each of the two Pol II strains with the fast elongation rate, without regard for its behavior in the slow Pol II strains. A gene was called Neutral if it was classified as ‘other’ in all four of the Pol II elongation rate mutant strains. All genes that didn’t fit any of the criteria above (304 out of 2,790; for example, genes which were upshifted or downshifted in only one strain, etc.) were classified as Other. Finally, we noticed that a large proportion of Downstream genes were also classified as Upstream (Figure 4), and we named this sub-category (which represents the intersection of the Upstream and Downstream groups) ‘Both’.

Conservation of endpoints

Calculations of probabilities that major isoform positions in JZY5, JZY6, JZY14, JZY15 and diauxic JGY2000 overlap with those in exponentially grown JGY2000 were conducted in identical, pairwise fashion. First, for each of the 2790 genes, we identified the portion of the 3’ UTR in which meaningful polyadenylation was observed in any of our 11 strains/conditions. This ‘combined major end zone’ is the union of the gene’s major end zones in every condition tested. Its 5’ boundary is the most ORF-proximal of all major isoforms observed in any of the 11 conditions. Similarly, the 3’ boundary is the most ORF-distal of all major isoforms found in the 11 conditions/strains.

For each gene (using JZY5 as an example), the cumulative probability P(q) for major isoform overlap between JZY5 and exponentially growing JGY2000 is given by the hypergeometric distribution.

I F α \geq β : P (q) = \sum_{i = c}^{β} \frac{(\begin{matrix} β \\ i \end{matrix}) (\begin{matrix} N - β \\ α - i \end{matrix})}{(\begin{matrix} N \\ α \end{matrix})} I F β \geq α : P (q) = \sum_{i = c}^{α} \frac{(\begin{matrix} α \\ i \end{matrix}) (\begin{matrix} N - α \\ β - i \end{matrix})}{(\begin{matrix} N \\ β \end{matrix})}

where N is the number of non-A positions in the combined major end zone, $α$ is the number of major isoforms in exponentially growing JGY2000, $β$ is the number of major isoforms in JZY5, and $c$ is the number of major isoform positions in common between the two strains. In calculations where the entire 3’UTR is assumed to be permissive for polyadenylation (i.e. major poly(A) isoforms are not limited to the combined major end zone window; Figure 3—figure supplement 1A), N is replaced by the number of non-A positions within each gene’s 400-nt 3’UTR. Probability calculations for all other strains listed above were performed exactly as described for JZY5.

Nucleotide frequency composition analysis

Nucleotide frequencies in exponentially-growing JGY2000, diauxic JGY2000, JZY5, and JZY14 were tabulated for max isoform positions. The −1 position refers to the last genomically-encoded nucleotide (i.e., the base immediately upstream of the cleavage/polyadenylation site) within each isoform. Therefore, all positive positions (i.e. positions to the right of the cleavage/polyadenylation site) are not encoded in the actual isoforms. Nucleotide frequencies were computed by summing up the number of A’s, C’s, G’s and U’s at each position within a category, dividing these numbers by the total number of genes within the category and multiplying the resulting fraction by 100. The ‘overall’ category consisted of 2790 genes, while the Upstream and Downstream categories contained 1898 and 605 genes, respectively. The ‘Both’ sub-category, consisting of genes whose end zones are shifted upstream in both slow Pol II mutant strains and shifted downstream in both fast Pol II strains, contained a total of 462 genes. Finally, the ‘Neutral’ category comprised 445 genes.

Acknowledgements

We thank Catherine Maddox for excellent technical assistance and Craig Kaplan for helpful advice on constructing the Pol II mutant strains. This work was supported by grants to KS from the National Institutes of Health (GM30186 and GM131801).

Funding Statement

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Contributor Information

Kevin Struhl, Email: kevin@hms.harvard.edu.

Eric J Wagner, University of Texas Medical Branch at Galveston, United States.

James L Manley, Columbia University, United States.

Funding Information

This paper was supported by the following grants:

National Institutes of Health GM 30186 to Joseph V Geisberg, Zarmik Moqtaderi, Kevin Struhl.
National Institutes of Health GM 131801 to Joseph V Geisberg, Zarmik Moqtaderi, Kevin Struhl.

Additional information

Competing interests

Senior editor, eLife.

No competing interests declared.

Author contributions

Conceptualization, Data curation, Software, Formal analysis, Validation, Investigation, Visualization, Methodology, Writing - original draft, Writing - review and editing.

Conceptualization, Formal analysis, Supervision, Funding acquisition, Writing - original draft, Project administration, Writing - review and editing.

Additional files

Transparent reporting form

elife-59810-transrepform.docx^{(245.6KB, docx)}

Data availability

Sequencing data has been deposited in GEO under accession code GSE151196.

The following dataset was generated:

Geisberg JV, Moqtaderi Z, Struhl K. 2020. The transcriptional elongation rate regulates alternative polyadenylation in yeast. NCBI Gene Expression Omnibus. GSE151196

References

Aparicio OM, Geisberg JV, Struhl K. Chromatin immunoprecipitation for determining the association of proteins with specific genomic sequences in vivo. Current Protocols in Molecular Biology. 2004;17:23. doi: 10.1002/0471143030.cb1707s23. [DOI] [PubMed] [Google Scholar]
Baejen C, Andreani J, Torkler P, Battaglia S, Schwalb B, Lidschreiber M, Maier KC, Boltendahl A, Rus P, Esslinger S, Söding J, Cramer P. Genome-wide analysis of RNA polymerase II termination at Protein-Coding genes. Molecular Cell. 2017;66:38–49. doi: 10.1016/j.molcel.2017.02.009. [DOI] [PubMed] [Google Scholar]
Baltz AG, Munschauer M, Schwanhäusser B, Vasile A, Murakawa Y, Schueler M, Youngs N, Penfold-Brown D, Drew K, Milek M, Wyler E, Bonneau R, Selbach M, Dieterich C, Landthaler M. The mRNA-bound proteome and its global occupancy profile on protein-coding transcripts. Molecular Cell. 2012;46:674–690. doi: 10.1016/j.molcel.2012.05.021. [DOI] [PubMed] [Google Scholar]
Barnes CO, Calero M, Malik I, Graham BW, Spahr H, Lin G, Cohen AE, Brown IS, Zhang Q, Pullara F, Trakselis MA, Kaplan CD, Calero G. Crystal structure of a transcribing RNA polymerase II complex reveals a complete transcription bubble. Molecular Cell. 2015;59:258–269. doi: 10.1016/j.molcel.2015.06.034. [DOI] [PMC free article] [PubMed] [Google Scholar]
Bartel DP. MicroRNAs: target recognition and regulatory functions. Cell. 2009;136:215–233. doi: 10.1016/j.cell.2009.01.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
Bentley DL. Coupling mRNA processing with transcription in time and space. Nature Reviews Genetics. 2014;15:163–175. doi: 10.1038/nrg3662. [DOI] [PMC free article] [PubMed] [Google Scholar]
Berkovits BD, Mayr C. Alternative 3' UTRs act as scaffolds to regulate membrane protein localization. Nature. 2015;522:363–367. doi: 10.1038/nature14321. [DOI] [PMC free article] [PubMed] [Google Scholar]
Braberg H, Jin H, Moehle EA, Chan YA, Wang S, Shales M, Benschop JJ, Morris JH, Qiu C, Hu F, Tang LK, Fraser JS, Holstege FC, Hieter P, Guthrie C, Kaplan CD, Krogan NJ. From structure to systems: high-resolution, quantitative genetic analysis of RNA polymerase II. Cell. 2013;154:775–788. doi: 10.1016/j.cell.2013.07.033. [DOI] [PMC free article] [PubMed] [Google Scholar]
de la Mata M, Alonso CR, Kadener S, Fededa JP, Blaustein M, Pelisch F, Cramer P, Bentley D, Kornblihtt AR. A slow RNA polymerase II affects alternative splicing in vivo. Molecular Cell. 2003;12:525–532. doi: 10.1016/j.molcel.2003.08.001. [DOI] [PubMed] [Google Scholar]
Dujardin G, Lafaille C, de la Mata M, Marasco LE, Muñoz MJ, Le Jossic-Corcos C, Corcos L, Kornblihtt AR. How slow RNA polymerase II elongation favors alternative exon skipping. Molecular Cell. 2014;54:683–690. doi: 10.1016/j.molcel.2014.03.044. [DOI] [PubMed] [Google Scholar]
Dunn OJ. Multiple comparisons among means. Journal of the American Statistical Association. 1961;56:52–64. doi: 10.1080/01621459.1961.10482090. [DOI] [Google Scholar]
Elkon R, Ugalde AP, Agami R. Alternative cleavage and polyadenylation: extent, regulation and function. Nature Reviews Genetics. 2013;14:496–506. doi: 10.1038/nrg3482. [DOI] [PubMed] [Google Scholar]
Flavell SW, Kim TK, Gray JM, Harmin DA, Hemberg M, Hong EJ, Markenscoff-Papadimitriou E, Bear DM, Greenberg ME. Genome-wide analysis of MEF2 transcriptional program reveals synaptic target genes and neuronal activity-dependent polyadenylation site selection. Neuron. 2008;60:1022–1038. doi: 10.1016/j.neuron.2008.11.029. [DOI] [PMC free article] [PubMed] [Google Scholar]
Floor SN, Doudna JA. Tunable protein synthesis by transcript isoforms in human cells. eLife. 2016;5:e10921. doi: 10.7554/eLife.10921. [DOI] [PMC free article] [PubMed] [Google Scholar]
Fong N, Brannan K, Erickson B, Kim H, Cortazar MA, Sheridan RM, Nguyen T, Karp S, Bentley DL. Effects of transcription elongation rate and Xrn2 exonuclease activity on RNA polymerase II termination suggest widespread kinetic competition. Molecular Cell. 2015;60:256–267. doi: 10.1016/j.molcel.2015.09.026. [DOI] [PMC free article] [PubMed] [Google Scholar]
Fong N, Saldi T, Sheridan RM, Cortazar MA, Bentley DL. RNA pol II dynamics modulate Co-transcriptional chromatin modification, CTD phosphorylation, and transcriptional direction. Molecular Cell. 2017;66:546–557. doi: 10.1016/j.molcel.2017.04.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
Freeberg MA, Han T, Moresco JJ, Kong A, Yang YC, Lu ZJ, Yates JR, Kim JK. Pervasive and dynamic protein binding sites of the mRNA transcriptome in Saccharomyces cerevisiae. Genome Biology. 2013;14:R13. doi: 10.1186/gb-2013-14-2-r13. [DOI] [PMC free article] [PubMed] [Google Scholar]
Galdieri L, Mehrotra S, Yu S, Vancura A. Transcriptional regulation in yeast during diauxic shift and stationary phase. OMICS. 2010;14:629–638. doi: 10.1089/omi.2010.0069. [DOI] [PMC free article] [PubMed] [Google Scholar]
Geisberg JV, Moqtaderi Z, Fan X, Ozsolak F, Struhl K. Global analysis of mRNA isoform half-lives reveals stabilizing and destabilizing elements in yeast. Cell. 2014;156:812–824. doi: 10.1016/j.cell.2013.12.026. [DOI] [PMC free article] [PubMed] [Google Scholar]
Graber JH, Nazeer FI, Yeh PC, Kuehner JN, Borikar S, Hoskinson D, Moore CL. DNA damage induces targeted, genome-wide variation of poly(A) sites in budding yeast. Genome Research. 2013;23:1690–1703. doi: 10.1101/gr.144964.112. [DOI] [PMC free article] [PubMed] [Google Scholar]
Gruber AJ, Zavolan M. Alternative cleavage and polyadenylation in health and disease. Nature Reviews Genetics. 2019;20:599–614. doi: 10.1038/s41576-019-0145-z. [DOI] [PubMed] [Google Scholar]
Gupta I, Clauder-Münster S, Klaus B, Järvelin AI, Aiyar RS, Benes V, Wilkening S, Huber W, Pelechano V, Steinmetz LM. Alternative Polyadenylation diversifies post-transcriptional regulation by selective RNA-protein interactions. Molecular Systems Biology. 2014;10:719. doi: 10.1002/msb.135068. [DOI] [PMC free article] [PubMed] [Google Scholar]
Ji Z, Lee JY, Pan Z, Jiang B, Tian B. Progressive lengthening of 3' untranslated regions of mRNAs by alternative polyadenylation during mouse embryonic development. PNAS. 2009;106:7028–7033. doi: 10.1073/pnas.0900028106. [DOI] [PMC free article] [PubMed] [Google Scholar]
Jin Y, Geisberg JV, Moqtaderi Z, Ji Z, Hoque M, Tian B, Struhl K. Mapping 3′ mRNA Isoforms on a Genomic Scale. Current Protocols in Molecular Biology. 2015;110:1–17. doi: 10.1002/0471142727.mb0423s110. [DOI] [PMC free article] [PubMed] [Google Scholar]
Kamieniarz-Gdula K, Gdula MR, Panser K, Nojima T, Monks J, Wiśniewski JR, Riepsaame J, Brockdorff N, Pauli A, Proudfoot NJ. Selective roles of vertebrate PCF11 in premature and Full-Length transcript termination. Molecular Cell. 2019;74:158–172. doi: 10.1016/j.molcel.2019.01.027. [DOI] [PMC free article] [PubMed] [Google Scholar]
Kaplan CD, Jin H, Zhang IL, Belyanin A. Dissection of pol II trigger loop function and pol II activity-dependent control of start site selection in vivo. PLOS Genetics. 2012;8:e1002627. doi: 10.1371/journal.pgen.1002627. [DOI] [PMC free article] [PubMed] [Google Scholar]
Kaplan CD. Basic mechanisms of RNA polymerase II activity and alteration of gene expression in Saccharomyces cerevisiae. Biochimica Et Biophysica Acta (BBA) - Gene Regulatory Mechanisms. 2013;1829:39–54. doi: 10.1016/j.bbagrm.2012.09.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
Kim M, Krogan NJ, Vasiljeva L, Rando OJ, Nedea E, Greenblatt JF, Buratowski S. The yeast Rat1 exonuclease promotes transcription termination by RNA polymerase II. Nature. 2004;432:517–522. doi: 10.1038/nature03041. [DOI] [PubMed] [Google Scholar]
Krogan NJ, Dover J, Wood A, Schneider J, Heidt J, Boateng MA, Dean K, Ryan OW, Golshani A, Johnston M, Greenblatt JF, Shilatifard A. The Paf1 complex is required for histone H3 methylation by COMPASS and Dot1p: linking transcriptional elongation to histone methylation. Molecular Cell. 2003;11:721–729. doi: 10.1016/S1097-2765(03)00091-1. [DOI] [PubMed] [Google Scholar]
Langmead B, Trapnell C, Pop M, Salzberg SL. Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biology. 2009;10:R25. doi: 10.1186/gb-2009-10-3-r25. [DOI] [PMC free article] [PubMed] [Google Scholar]
Laughery MF, Hunter T, Brown A, Hoopes J, Ostbye T, Shumaker T, Wyrick JJ. New vectors for simple and streamlined CRISPR-Cas9 genome editing in Saccharomyces cerevisiae. Yeast. 2015;32:711–720. doi: 10.1002/yea.3098. [DOI] [PMC free article] [PubMed] [Google Scholar]
Lei EP, Krebber H, Silver PA. Messenger RNAs are recruited for nuclear export during transcription. Genes & Development. 2001;15:1771–1782. doi: 10.1101/gad.892401. [DOI] [PMC free article] [PubMed] [Google Scholar]
Li J, Lu X. The emerging roles of 3' untranslated regions in Cancer. Cancer Letters. 2013;337:22–25. doi: 10.1016/j.canlet.2013.05.034. [DOI] [PubMed] [Google Scholar]
Liu X, Freitas J, Zheng D, Oliveira MS, Hoque M, Martins T, Henriques T, Tian B, Moreira A. Transcription elongation rate has a tissue-specific impact on alternative cleavage and polyadenylation in Drosophila melanogaster. RNA. 2017a;23:1807–1816. doi: 10.1261/rna.062661.117. [DOI] [PMC free article] [PubMed] [Google Scholar]
Liu X, Hoque M, Larochelle M, Lemay JF, Yurko N, Manley JL, Bachand F, Tian B. Comparative analysis of alternative polyadenylation in S. cerevisiae and S. pombe. Genome Research. 2017b;27:1685–1695. doi: 10.1101/gr.222331.117. [DOI] [PMC free article] [PubMed] [Google Scholar]
Masamha CP, Xia Z, Yang J, Albrecht TR, Li M, Shyu AB, Li W, Wagner EJ. CFIm25 links alternative polyadenylation to glioblastoma tumour suppression. Nature. 2014;510:412–416. doi: 10.1038/nature13261. [DOI] [PMC free article] [PubMed] [Google Scholar]
Mason PB, Struhl K. Distinction and relationship between elongation rate and processivity of RNA polymerase II in vivo. Molecular Cell. 2005;17:831–840. doi: 10.1016/j.molcel.2005.02.017. [DOI] [PubMed] [Google Scholar]
Mayr C. Evolution and biological roles of alternative 3'UTRs. Trends in Cell Biology. 2016;26:227–237. doi: 10.1016/j.tcb.2015.10.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
Mayr C, Bartel DP. Widespread shortening of 3'UTRs by alternative cleavage and polyadenylation activates oncogenes in cancer cells. Cell. 2009;138:673–684. doi: 10.1016/j.cell.2009.06.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
Moqtaderi Z, Geisberg JV, Jin Y, Fan X, Struhl K. Species-specific factors mediate extensive heterogeneity of mRNA 3' ends in yeasts. PNAS. 2013;110:11073–11078. doi: 10.1073/pnas.1309384110. [DOI] [PMC free article] [PubMed] [Google Scholar]
Moqtaderi Z, Geisberg JV, Struhl K. Extensive structural differences of closely related 3' mRNA isoforms: links to Pab1 binding and mRNA stability. Molecular Cell. 2018;72:849–861. doi: 10.1016/j.molcel.2018.08.044. [DOI] [PMC free article] [PubMed] [Google Scholar]
Ng HH, Robert F, Young RA, Struhl K. Targeted recruitment of Set1 histone methylase by elongating pol II provides a localized mark and memory of recent transcriptional activity. Molecular Cell. 2003;11:709–719. doi: 10.1016/s1097-2765(03)00092-3. [DOI] [PubMed] [Google Scholar]
Ogorodnikov A, Levin M, Tattikota S, Tokalov S, Hoque M, Scherzinger D, Marini F, Poetsch A, Binder H, Macher-Göppinger S, Probst HC, Tian B, Schaefer M, Lackner KJ, Westermann F, Danckwardt S. Transcriptome 3'end organization by PCF11 links alternative polyadenylation to formation and neuronal differentiation of neuroblastoma. Nature Communications. 2018;9:5331. doi: 10.1038/s41467-018-07580-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
Ozsolak F, Kapranov P, Foissac S, Kim SW, Fishilevich E, Monaghan AP, John B, Milos PM. Comprehensive polyadenylation site maps in yeast and human reveal pervasive alternative polyadenylation. Cell. 2010;143:1018–1029. doi: 10.1016/j.cell.2010.11.020. [DOI] [PMC free article] [PubMed] [Google Scholar]
Pelechano V, Wei W, Steinmetz LM. Extensive transcriptional heterogeneity revealed by isoform profiling. Nature. 2013;497:127–131. doi: 10.1038/nature12121. [DOI] [PMC free article] [PubMed] [Google Scholar]
Sandberg R, Neilson JR, Sarma A, Sharp PA, Burge CB. Proliferating cells express mRNAs with shortened 3' untranslated regions and fewer microRNA target sites. Science. 2008;320:1643–1647. doi: 10.1126/science.1155390. [DOI] [PMC free article] [PubMed] [Google Scholar]
Sherstnev A, Duc C, Cole C, Zacharaki V, Hornyik C, Ozsolak F, Milos PM, Barton GJ, Simpson GG. Direct sequencing of Arabidopsis thaliana RNA reveals patterns of cleavage and polyadenylation. Nature Structural & Molecular Biology. 2012;19:845–852. doi: 10.1038/nsmb.2345. [DOI] [PMC free article] [PubMed] [Google Scholar]
Sparks KA, Mayer SA, Dieckmann CL. Premature 3'-end formation of CBP1 mRNA results in the downregulation of cytochrome b mRNA during the induction of respiration in Saccharomyces cerevisiae. Molecular and Cellular Biology. 1997;17:4199–4207. doi: 10.1128/MCB.17.8.4199. [DOI] [PMC free article] [PubMed] [Google Scholar]
Sparks KA, Dieckmann CL. Regulation of poly(A) site choice of several yeast mRNAs. Nucleic Acids Research. 1998;26:4676–4687. doi: 10.1093/nar/26.20.4676. [DOI] [PMC free article] [PubMed] [Google Scholar]
Strässer K, Masuda S, Mason P, Pfannstiel J, Oppizzi M, Rodriguez-Navarro S, Rondón AG, Aguilera A, Struhl K, Reed R, Hurt E. TREX is a conserved complex coupling transcription with messenger RNA export. Nature. 2002;417:304–308. doi: 10.1038/nature746. [DOI] [PubMed] [Google Scholar]
Tian B, Manley JL. Alternative cleavage and polyadenylation: the long and short of it. Trends in Biochemical Sciences. 2013;38:312–320. doi: 10.1016/j.tibs.2013.03.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
Tian B, Manley JL. Alternative polyadenylation of mRNA precursors. Nature Reviews Molecular Cell Biology. 2017;18:18–30. doi: 10.1038/nrm.2016.116. [DOI] [PMC free article] [PubMed] [Google Scholar]
Veloso A, Kirkconnell KS, Magnuson B, Biewen B, Paulsen MT, Wilson TE, Ljungman M. Rate of elongation by RNA polymerase II is associated with specific gene features and epigenetic modifications. Genome Research. 2014;24:896–905. doi: 10.1101/gr.171405.113. [DOI] [PMC free article] [PubMed] [Google Scholar]
Vivier MA, Lambrechts MG, Pretorius IS. Coregulation of starch degradation and dimorphism in the yeast Saccharomyces cerevisiae. Critical Reviews in Biochemistry and Molecular Biology. 1997;32:405–435. doi: 10.3109/10409239709082675. [DOI] [PubMed] [Google Scholar]
Wallace EWJ, Beggs JD. Extremely fast and incredibly close: cotranscriptional splicing in budding yeast. RNA. 2017;23:601–610. doi: 10.1261/rna.060830.117. [DOI] [PMC free article] [PubMed] [Google Scholar]
Wang D, Bushnell DA, Westover KD, Kaplan CD, Kornberg RD. Structural basis of transcription: role of the trigger loop in substrate specificity and catalysis. Cell. 2006;127:941–954. doi: 10.1016/j.cell.2006.11.023. [DOI] [PMC free article] [PubMed] [Google Scholar]
Weill L, Belloc E, Bava FA, Méndez R. Translational control by changes in poly(A) tail length: recycling mRNAs. Nature Structural & Molecular Biology. 2012;19:577–585. doi: 10.1038/nsmb.2311. [DOI] [PubMed] [Google Scholar]

eLife. doi: 10.7554/eLife.59810.sa1

Decision letter

Editor: Eric J Wagner¹

Reviewed by: Nick J Proudfoot²

In the interests of transparency, eLife publishes the most substantive revision requests and the accompanying author responses.

Acceptance summary:

The authors have explored how a metabolic state change in budding yeast, known as the diauxic shift, impacts global cleavage and polyadenylation. Overall, this work reveals that environmental cues can broadly impact alternative polyadenylation that is likely manifested through alterations in Pol II elongation rates. This work is important because it reveals a previously unsuspected interplay between Pol II function and RNA processing.

Decision letter after peer review:

Thank you for submitting your article "The transcriptional elongation rate regulates alternative polyadenylation in yeast" for consideration by eLife. Your article has been reviewed by James Manley as the Senior Editor, a Reviewing Editor, and three reviewers. The following individuals involved in review of your submission have agreed to reveal their identity: Nick J Proudfoot (Reviewer #1).

The reviewers have discussed the reviews with one another and the Reviewing Editor has drafted this decision to help you prepare a revised submission.

Summary:

The authors have explored how a metabolic state change in budding yeast, known as the diauxic shift, impacts global cleavage and polyadenylation. In this research article, they determine that upon diauxic shift there is a broad change in poly(A) site usage to favor cleavage and polyadenylation sites more proximal to stop codons. The upstream shift of poly(A) site usage in the diauxic condition, interestingly, matches well with that observed in yeast expressing Pol II slow mutants. Because Hpr1 and Spt4 mutants, which have processivity issues, do not display such an upstream shift, the authors conclude that it is elongation rate, rather than processivity, that leads to the upstream shift in diauxic condition. Overall, this work reveals that environmental cues can broadly impact alternative polyadenylation that is likely manifested through alterations in Pol II elongation rates underscoring the interplay between Pol II function and RNA processing.

Below is a relatively short list of essential revisions that are believed to be addressable in a contained amount of time. The most prominent of those concerns is the need to provide stronger data indicating that Pol II elongation rate is indeed reduced during diauxic shift – a point central to the overall manuscript conclusion.

Essential revisions:

Experimental:

1) The Pol II fast mutants do not seem to lend much support to their model. Is this because of mitigation of alternative polyadenylation profile changes by RNA degradation? For example, if long isoforms are generated but are also preferentially degraded, the downstream shift may not be obvious. Also, the authors need to examine this possibility. The authors need to check Pol II fast mutants in diauxic condition to see if the upstream shift could be inhibited. This would greatly strengthen their conclusion. (This point is assuming that research has resumed.)

2) Figure 6 is a clear-cut data set showing that diauxic shift does indeed correlate with TSS proximal enhanced Pol II density over genes and that this is unaffected by loss of factors previously ascribed to processivity effects, rather than simply Pol II speed. However, some simple Pol II ChIPs here as well as bioinformatic Pol II occupancy measures would be helpful (Figure 6A and B).

Textual:

1) Figure 1 shows that diauxic shift as compared to simply different growth medium, switches many Pol II transcribed genes into usage of 5' located PAS. However, do these PAS switches affect mRNA levels of affected genes, i.e. does their gene expression change. Also does this correlate with gene ontology for changing the metabolic needs of the cell?

2) The bioinformatic data shown in Figure 3 that preferred PAS sites are non-random (presumably sequence specific) is somewhat over the top and that really these bioinformatic data are of less interest (maybe good for supplementary data?).

3) Improve the readability of Figure 5 as multiple reviewers found it hard to follow. A simple table of base composition would suffice rather than the hard-to-see graphical data across PAS sequences. Ideally the significance of a reduction in U (and C) richness correlating with PAS speed sensitivity needs to be tested by direct mutation? Also, they need to compare the same poly(A) sites that are upregulated or downregulated in diauxic condition vs. other conditions.

[Editors' note: further revisions were suggested prior to acceptance, as described below.]

Thank you for re-submitting your article "The transcriptional elongation rate regulates alternative polyadenylation in yeast" for consideration by eLife. Your revision has been re-reviewed by James Manley as the Senior Editor, a Reviewing Editor, and two reviewers. The reviewers have opted to remain anonymous.

There was a mixed response by the reviewers upon inspection of your revision. Ultimately, through discussion, a consensus was reached to request a revision that addresses their concerns. We stress that this revision need only be textual and agree with the reviewers that some aspects of the manuscript need additional clarification. Provided reasonable modifications to the text are made, we anticipate not sending this revision back to them. Rather than summarize their reviews, which are short, they are appended below.

Reviewer #2:

Geisberg et al. provided a forceful response to the previous review. Overall, while I am not fully convinced by the arguments presented, I am not in favor of prolonging the review, especially in the current difficult research situation caused by the pandemic. After all, the authors have done a solid work to generate substantial data that are useful to the community. I therefore suggest that the authors incorporate some of their response in the paper so that readers would not miss the true significance and novelty of this work. Specifically, they may want to emphasize (1) the true novelty of this work is the second theme, and (2) the features of speed-sensitive 3'UTRs. Regarding other potential explanations for poly(A) site usage in diauxic conditions, they may want to mention that alternation of some key 3' end processing factors may lead to similar changes.

Reviewer #3:

This is a revised manuscript. The editor and reviewers previously concluded that the authors needed "to provide stronger data indicating that Pol II elongation rate is indeed reduced during diauxic shift" to support the authors' conclusions, which the authors have not done. Nor have they adjusted the text of the manuscript to reflect this gap. Instead the authors argue in the rebuttal that the reviewers and editor confused two separate themes with theme one being that slow polymerase causes an upstream shift in poly(A) sites and theme 2 being that poly(A) sites shift upstream during diauxic shift. However, the manuscript continues to prominently suggest a mechanistic link between the two. The abstract literally ends with the bottom line that "Pol II elongation speed is important.…. for regulating poly(A) patterns in response to environmental conditions." This is not the only prominent occurrence where the authors draw very strong causal relationship (e.g. in subsection “Yeast cells containing Pol II derivatives with slow elongation rates show a poly(A) pattern that strikingly resembles the pattern in diauxic shift” where the authors say "strongly suggests a mechanistic relationship"; in subsections "Evidence that pol II elongation rate is decreased in diauxic conditions", “Pol II elongation rate, not processivity, is important for polyadenylation patterns” where the authors say "is the cause"; and in subsection “Evidence that regulated polyadenylation during the diauxic shift is due to decreased elongation rate” where the authors say "due to"). While a manuscript presenting two separate, but related, themes would be acceptable, that is not what the current manuscript does.

Most other comments have been successfully addressed.

eLife. 2020 Aug 26;9:e59810. doi: 10.7554/eLife.59810.sa2

Author response

Summary:

The authors have explored how a metabolic state change in budding yeast, known as the diauxic shift, impacts global cleavage and polyadenylation. In this research article, they determine that upon diauxic shift there is a broad change in poly(A) site usage to favor cleavage and polyadenylation sites more proximal to stop codons. The upstream shift of poly(A) site usage in the diauxic condition, interestingly, matches well with that observed in yeast expressing Pol II slow mutants. Because Hpr1 and Spt4 mutants, which have processivity issues, do not display such an upstream shift, the authors conclude that it is elongation rate, rather than processivity, that leads to the upstream shift in diauxic condition. Overall, this work reveals that environmental cues can broadly impact alternative polyadenylation that is likely manifested through alterations in Pol II elongation rates underscoring the interplay between Pol II function and RNA processing.

Below is a relatively short list of essential revisions that are believed to be addressable in a contained amount of time. The most prominent of those concerns is the need to provide stronger data indicating that Pol II elongation rate is indeed reduced during diauxic shift – a point central to the overall manuscript conclusion.

1) The paper has two major themes, but the reviewers have focused on the first theme, namely regulated polyadenylation in response to an environmental condition via Pol II speed. They largely overlooked the second theme, namely a detailed analysis of how Pol II speed affects polyadenylation. The few papers that investigate this second theme are much less advanced, typically using only a single slow Pol II derivative, not done at the nucleotide level (i.e. mRNA isoforms), and providing no information on the difference between speed-sensitive vs. speed-insensitive 3’ UTRs. As such, some of the results that the reviewers considered to be peripheral (which they are for theme 1) are critical for theme 2 and represent a significant advance over current knowledge. We previously considered dividing the paper into 2 back-to-back short papers, one for each theme. However, the two themes are both important and clearly related, which is why we presented them together.

2) Regarding the request to “provide stronger data indicating that Pol II elongation rate is indeed reduced during the diauxic shift,” we believe that our results and conclusions are compelling, even if below the level of formal proof (which we do not claim and which rarely happens in any paper). The poly(A) patterns under diauxic conditions and in two slow Pol II mutants are remarkably similar over thousands of genes. Diauxic conditions clearly show decreased Pol II processivity, a known feature of slow Pol II. Processivity per se is ruled out by the spt4 and hpr1 deletion experiments. I can’t think of anything plausible that could explain all these data other than our conclusion that Pol II elongation rate is reduced in diauxic conditions. If the reviewers can come up with a plausible alternative, we would be happy to mention it.

Furthermore, there are technical and conceptual problems in trying to “provide stronger data.” Technically, I don’t know how we can measure Pol II speed in diauxic conditions. We can’t use our glucose-shutoff method. Conceptually, even if we could measure speed, it would be impossible to disentangle a true effect on Pol II speed vs. a general slowdown of molecular processes in diauxic conditions, in which cells are barely growing. Trying to correct/normalize the effects of cell growth on molecular processes is a quagmire many have encountered and few have solved. The processivity experiments in the paper get around this, because it is the pattern that is affected, not the absolute rate.

Essential revisions:

Experimental:

1) The Pol II fast mutants do not seem to lend much support to their model. Is this because of mitigation of alternative polyadenylation profile changes by RNA degradation? For example, if long isoforms are generated but are also preferentially degraded, the downstream shift may not be obvious. Also, the authors need to examine this possibility. The authors need to check Pol II fast mutants in diauxic condition to see if the upstream shift could be inhibited. This would greatly strengthen their conclusion. (This point is assuming that research has resumed.)

The reviewers are correct that the fast Pol II mutants are largely irrelevant to the regulation in response to the diauxic shift (i.e. theme 1), and we did not use them at all for this purpose. As such, the other aspects of this comment are also irrelevant for theme 1. However, they are critical to theme 2, which was largely overlooked by the reviewers.

The suggestion that long isoforms in the fast Pol II mutants might be preferentially degraded is extremely unlikely based on what we and others have already published about mRNA decay in yeast (Geisberg et al., 2014; Gupta et al., 2014). Our previous work identified many hundreds of stabilizing and destabilizing elements within 3’UTRs. These elements can occur anywhere within a given 3’UTR, meaning that longer isoforms arising from a given gene can be either more or less stable than shorter isoforms. Furthermore, a completely independent series of experiments clearly demonstrated there was no correlation between 3’UTR length and stability (Gupta et al., 2014). While longer mRNA isoform destabilization had been reported (and varies in its extent) in cancer cells and other cell lines (Mayr and Bartel, 2009; Lin et al., 2012; Spies et al., 2013), the results from both studies described above are completely inconsistent with a general destabilization of longer 3’ isoforms. Furthermore, the key point of Figure 3 is that the same poly(A) sites are used in all Pol II derivatives, so mRNA stability of these isoforms will be the same in all strains. Nevertheless, in response to this comment, the revised paper now discusses issues related to mRNA stability.

The suggested experiment to check fast Pol II mutants under diauxic conditions would yield uninterpretable results. Pitting an upstream-shifting condition against a downstream-shifting condition merely asks which is more potent and provides no mechanistic information. Typically, such experiments give intermediate effects, which in the case here would be an upstream shift that is somewhat less pronounced than a wt strain under diauxic conditions (especially because the upstream shift in diauxic conditions is much stronger than the downstream shift caused by the fast Pol II mutant). However, the precise answer could be anything, and hence it would provide no information to strengthen any conclusion in the paper.

2) Figure 6 is a clear-cut data set showing that diauxic shift does indeed correlate with TSS proximal enhanced Pol II density over genes and that this is unaffected by loss of factors previously ascribed to processivity effects, rather than simply Pol II speed. However, some simple Pol II ChIPs here as well as bioinformatic Pol II occupancy measures would be helpful (Figure 6A and B).

We don’t understand this comment about doing Pol II ChIP experiments. Figure 6A is a Pol II ChIP experiment, with the results plotted with respect to the upstream shift in Figure 6B. Processivity and speed experiments for the Spt4 and Hpr1 mutants were published 15 years ago (Mason and Struhl, 2005), and they involved Pol II ChIP.

Textual:

1) Figure 1 shows that diauxic shift as compared to simply different growth medium, switches many Pol II transcribed genes into usage of 5' located PAS. However, do these PAS switches affect mRNA levels of affected genes, i.e. does their gene expression change. Also does this correlate with gene ontology for changing the metabolic needs of the cell?

Because the paper is concerned with poly(A) profiles, the results are normalized for each gene with the maximal isoform given a value of 100. The reviewers are correct that this presentation ignores the changes in expression level under the various conditions. As such, we now add a paragraph about this (the expression values are present in the Excel files in our GEO submission). As expected, whether or not a gene is regulated under a given conditions does not affect its poly(A) profile. Under all conditions except diauxic, the poly(A) profiles are the same even though expression levels of many genes are regulated. Under diauxic conditions, the vast majority of genes shift in the same manner, even though only a subset is regulated at the expression level.

2) The bioinformatic data shown in Figure 3 that preferred PAS sites are non-random (presumably sequence specific) is somewhat over the top and that really these bioinformatic data are of less interest (maybe good for supplementary data?).

Figure 3 makes the important point that upstream and downstream shifts involve the same poly(A) sites, just a rebalancing of how much they are used. While this might (or might not) have been expected, it is important to demonstrate, especially as it has mechanistic implications (e.g. see above response about suggested experiments). Moreover, it is a useful piece of information for theme 2, and in this regard, we are unaware of anyone looking at this issue at the nucleotide level on a transcriptome scale. As such, we believe that it merits a proper figure.

3) Improve the readability of Figure 5 as multiple reviewers found it hard to follow. A simple table of base composition would suffice rather than the hard-to-see graphical data across PAS sequences. Ideally the significance of a reduction in U (and C) richness correlating with PAS speed sensitivity needs to be tested by direct mutation? Also, they need to compare the same poly(A) sites that are upregulated or downregulated in diauxic condition vs. other conditions.

As requested, we simplified Figure 5 to show how speed-sensitive 3’UTRs differ in base composition from neutral 3’UTRs. Specifically, we showed this only for the wild-type strain, meaning only 2 lines for each nucleotide. This result is identical for all other comparisons including slow only and fast only Pol II, as well as the diauxic condition, and the data for all these other comparisons is now part of the supplemental figure associated with Figure 5. We think it important to keep the form of the original figure because the level of nucleotide preferences varies over the range examined, and the readers should see this instead of a Table that is a simple summary of preferences at an arbitrary location(s).

[Editors' note: further revisions were suggested prior to acceptance, as described below.]

(…)

Reviewer #2:

Geisberg et al. provided a forceful response to the previous review. Overall, while I am not fully convinced by the arguments presented, I am not in favor of prolonging the review, especially in the current difficult research situation caused by the pandemic. After all, the authors have done a solid work to generate substantial data that are useful to the community. I therefore suggest that the authors incorporate some of their response in the paper so that readers would not miss the true significance and novelty of this work. Specifically, they may want to emphasize on (1) the true novelty of this work is the second theme, and (2) the features of speed-sensitive 3'UTRs. Regarding other potential explanations for poly(A) site usage in diauxic conditions, they may want to mention that alternation of some key 3' end processing factors may lead to similar changes.

1) As requested, we now consider the suggestion that alteration of some 3’ processing factor might account for the poly(A) shift in diauxic conditions. We thank the reviewer for bringing up an alternative explanation to our conclusion, which is very helpful for readers of the paper. However, as discussed in a new paragraph, this suggested model is extremely unlikely for two reasons. First, this suggested model would require such a altered 3’ processing factor to cause a virtually identical poly(A) pattern to those of two different slow Pol II mutants over thousands of genes. Second, if the shift in diauxic conditions is due to a 3’ processing factor, why is there such a pronounced Pol II processivity defect, a hallmark of reduced Pol II speed? In addition and unbeknownst to the reviewers, we actually have looked at poly(A) patterns in numerous 3’ processing and transcriptional elongation factors, and we have never seen an upstream shift like those in diauxic and slow Pol II conditions. These data are unpublished and not fully analyzed, but this result is clear.

2) As requested, we have added text to the subsection “Pol II elongation rate, not Pol II processivity, affects poly(A) site selection” to further emphasize theme 2.

Reviewer #3:

This is a revised manuscript. The editor and reviewers previously concluded that the authors needed "to provide stronger data indicating that Pol II elongation rate is indeed reduced during diauxic shift" to support the authors' conclusions, which the authors have not done. Nor have they adjusted the text of the manuscript to reflect this gap. Instead the authors argue in the rebuttal that the reviewers and editor confused two separate themes with theme one being that slow polymerase causes an upstream shift in poly(A) sites and theme 2 being that poly(A) sites shift upstream during diauxic shift. However, the manuscript continues to prominently suggest a mechanistic link between the two. The abstract literally ends with the bottom line that "Pol II elongation speed is important.…. for regulating poly(A) patterns in response to environmental conditions." This is not the only prominent occurrence where the authors draw very strong causal relationship (e.g. in subsection “Yeast cells containing Pol II derivatives with slow elongation rates show a poly(A) pattern that strikingly resembles the pattern in diauxic shift” where the authors say "strongly suggests a mechanistic relationship"; in subsections "Evidence that pol II elongation rate is decreased in diauxic conditions", “Pol II elongation rate, not processivity, is important for polyadenylation patterns” where the authors say "is the cause"; and in subsection “Evidence that regulated polyadenylation during the diauxic shift is due to decreased elongation rate” where the authors say "due to"). While a manuscript presenting two separate, but related, themes would be acceptable, that is not what the current manuscript does.

As requested, we have softened the statements in a few places and also explicitly discussed an alternative explanation of the data (see point 1 of reviewer 2). However, we find it difficult to respond to this re-review. While there was a general request “to provide stronger data indicating that Pol II elongation rate is indeed induced during diauxic shift,” no specific experiments were suggested in the “essential revisions” of the first review and none are suggested in the re-review. Furthermore, in our response to the original reviews, we provided detailed arguments about (1) why our conclusion is compelling (even if not formally proven, which we never claim), (2) why a direct measurement of Pol II speed in diauxic conditions is technically impossible, and (3) why any such measurement could not be interpreted due to complications related to slow growth. Reviewer 3 did not challenge or address these arguments.

I think we accurately stated the strength of our conclusions and have never described them with terms such as “proven”, “conclusive” or “definitive”. “Evidence that” is clearly a qualified statement. The examples of “is the cause” and “due to” are misleading because in all of these examples were qualified by strongly suggest (which we have now softened to suggest) in the same sentence.

There appears to be some confusion about what themes 1 and 2 are. Theme 1 is about regulated polyadenylation and includes upstream shifts in both diauxic and slow Pol II mutants and the processivity experiments. Theme 2 concerns mechanistic information about the relationship of Pol II speed to polyadenylation and includes the fast Pol II data, gene specificity and sequence determinants of speed dependence, and rebalancing of sites upon the shift and the absence of new sites. We mentioned the two themes in our initial response because theme 2 was largely overlooked in the original reviews even though we (and reviewer 2) thought it an important aspect of the paper. It had nothing to do with regulated polyadenylation (theme 1).

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Citations

Geisberg JV, Moqtaderi Z, Struhl K. 2020. The transcriptional elongation rate regulates alternative polyadenylation in yeast. NCBI Gene Expression Omnibus. GSE151196 [DOI] [PMC free article] [PubMed]

Supplementary Materials

Transparent reporting form

elife-59810-transrepform.docx^{(245.6KB, docx)}

Data Availability Statement

Sequencing data has been deposited in GEO under accession code GSE151196.

The following dataset was generated:

Geisberg JV, Moqtaderi Z, Struhl K. 2020. The transcriptional elongation rate regulates alternative polyadenylation in yeast. NCBI Gene Expression Omnibus. GSE151196

[bib1] Aparicio OM, Geisberg JV, Struhl K. Chromatin immunoprecipitation for determining the association of proteins with specific genomic sequences in vivo. Current Protocols in Molecular Biology. 2004;17:23. doi: 10.1002/0471143030.cb1707s23. [DOI] [PubMed] [Google Scholar]

[bib2] Baejen C, Andreani J, Torkler P, Battaglia S, Schwalb B, Lidschreiber M, Maier KC, Boltendahl A, Rus P, Esslinger S, Söding J, Cramer P. Genome-wide analysis of RNA polymerase II termination at Protein-Coding genes. Molecular Cell. 2017;66:38–49. doi: 10.1016/j.molcel.2017.02.009. [DOI] [PubMed] [Google Scholar]

[bib3] Baltz AG, Munschauer M, Schwanhäusser B, Vasile A, Murakawa Y, Schueler M, Youngs N, Penfold-Brown D, Drew K, Milek M, Wyler E, Bonneau R, Selbach M, Dieterich C, Landthaler M. The mRNA-bound proteome and its global occupancy profile on protein-coding transcripts. Molecular Cell. 2012;46:674–690. doi: 10.1016/j.molcel.2012.05.021. [DOI] [PubMed] [Google Scholar]

[bib4] Barnes CO, Calero M, Malik I, Graham BW, Spahr H, Lin G, Cohen AE, Brown IS, Zhang Q, Pullara F, Trakselis MA, Kaplan CD, Calero G. Crystal structure of a transcribing RNA polymerase II complex reveals a complete transcription bubble. Molecular Cell. 2015;59:258–269. doi: 10.1016/j.molcel.2015.06.034. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib5] Bartel DP. MicroRNAs: target recognition and regulatory functions. Cell. 2009;136:215–233. doi: 10.1016/j.cell.2009.01.002. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib6] Bentley DL. Coupling mRNA processing with transcription in time and space. Nature Reviews Genetics. 2014;15:163–175. doi: 10.1038/nrg3662. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib7] Berkovits BD, Mayr C. Alternative 3' UTRs act as scaffolds to regulate membrane protein localization. Nature. 2015;522:363–367. doi: 10.1038/nature14321. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib8] Braberg H, Jin H, Moehle EA, Chan YA, Wang S, Shales M, Benschop JJ, Morris JH, Qiu C, Hu F, Tang LK, Fraser JS, Holstege FC, Hieter P, Guthrie C, Kaplan CD, Krogan NJ. From structure to systems: high-resolution, quantitative genetic analysis of RNA polymerase II. Cell. 2013;154:775–788. doi: 10.1016/j.cell.2013.07.033. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib9] de la Mata M, Alonso CR, Kadener S, Fededa JP, Blaustein M, Pelisch F, Cramer P, Bentley D, Kornblihtt AR. A slow RNA polymerase II affects alternative splicing in vivo. Molecular Cell. 2003;12:525–532. doi: 10.1016/j.molcel.2003.08.001. [DOI] [PubMed] [Google Scholar]

[bib10] Dujardin G, Lafaille C, de la Mata M, Marasco LE, Muñoz MJ, Le Jossic-Corcos C, Corcos L, Kornblihtt AR. How slow RNA polymerase II elongation favors alternative exon skipping. Molecular Cell. 2014;54:683–690. doi: 10.1016/j.molcel.2014.03.044. [DOI] [PubMed] [Google Scholar]

[bib11] Dunn OJ. Multiple comparisons among means. Journal of the American Statistical Association. 1961;56:52–64. doi: 10.1080/01621459.1961.10482090. [DOI] [Google Scholar]

[bib12] Elkon R, Ugalde AP, Agami R. Alternative cleavage and polyadenylation: extent, regulation and function. Nature Reviews Genetics. 2013;14:496–506. doi: 10.1038/nrg3482. [DOI] [PubMed] [Google Scholar]

[bib13] Flavell SW, Kim TK, Gray JM, Harmin DA, Hemberg M, Hong EJ, Markenscoff-Papadimitriou E, Bear DM, Greenberg ME. Genome-wide analysis of MEF2 transcriptional program reveals synaptic target genes and neuronal activity-dependent polyadenylation site selection. Neuron. 2008;60:1022–1038. doi: 10.1016/j.neuron.2008.11.029. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib14] Floor SN, Doudna JA. Tunable protein synthesis by transcript isoforms in human cells. eLife. 2016;5:e10921. doi: 10.7554/eLife.10921. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib15] Fong N, Brannan K, Erickson B, Kim H, Cortazar MA, Sheridan RM, Nguyen T, Karp S, Bentley DL. Effects of transcription elongation rate and Xrn2 exonuclease activity on RNA polymerase II termination suggest widespread kinetic competition. Molecular Cell. 2015;60:256–267. doi: 10.1016/j.molcel.2015.09.026. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib16] Fong N, Saldi T, Sheridan RM, Cortazar MA, Bentley DL. RNA pol II dynamics modulate Co-transcriptional chromatin modification, CTD phosphorylation, and transcriptional direction. Molecular Cell. 2017;66:546–557. doi: 10.1016/j.molcel.2017.04.016. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib17] Freeberg MA, Han T, Moresco JJ, Kong A, Yang YC, Lu ZJ, Yates JR, Kim JK. Pervasive and dynamic protein binding sites of the mRNA transcriptome in Saccharomyces cerevisiae. Genome Biology. 2013;14:R13. doi: 10.1186/gb-2013-14-2-r13. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib18] Galdieri L, Mehrotra S, Yu S, Vancura A. Transcriptional regulation in yeast during diauxic shift and stationary phase. OMICS. 2010;14:629–638. doi: 10.1089/omi.2010.0069. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib19] Geisberg JV, Moqtaderi Z, Fan X, Ozsolak F, Struhl K. Global analysis of mRNA isoform half-lives reveals stabilizing and destabilizing elements in yeast. Cell. 2014;156:812–824. doi: 10.1016/j.cell.2013.12.026. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib20] Graber JH, Nazeer FI, Yeh PC, Kuehner JN, Borikar S, Hoskinson D, Moore CL. DNA damage induces targeted, genome-wide variation of poly(A) sites in budding yeast. Genome Research. 2013;23:1690–1703. doi: 10.1101/gr.144964.112. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib21] Gruber AJ, Zavolan M. Alternative cleavage and polyadenylation in health and disease. Nature Reviews Genetics. 2019;20:599–614. doi: 10.1038/s41576-019-0145-z. [DOI] [PubMed] [Google Scholar]

[bib22] Gupta I, Clauder-Münster S, Klaus B, Järvelin AI, Aiyar RS, Benes V, Wilkening S, Huber W, Pelechano V, Steinmetz LM. Alternative Polyadenylation diversifies post-transcriptional regulation by selective RNA-protein interactions. Molecular Systems Biology. 2014;10:719. doi: 10.1002/msb.135068. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib23] Ji Z, Lee JY, Pan Z, Jiang B, Tian B. Progressive lengthening of 3' untranslated regions of mRNAs by alternative polyadenylation during mouse embryonic development. PNAS. 2009;106:7028–7033. doi: 10.1073/pnas.0900028106. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib24] Jin Y, Geisberg JV, Moqtaderi Z, Ji Z, Hoque M, Tian B, Struhl K. Mapping 3′ mRNA Isoforms on a Genomic Scale. Current Protocols in Molecular Biology. 2015;110:1–17. doi: 10.1002/0471142727.mb0423s110. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib25] Kamieniarz-Gdula K, Gdula MR, Panser K, Nojima T, Monks J, Wiśniewski JR, Riepsaame J, Brockdorff N, Pauli A, Proudfoot NJ. Selective roles of vertebrate PCF11 in premature and Full-Length transcript termination. Molecular Cell. 2019;74:158–172. doi: 10.1016/j.molcel.2019.01.027. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib26] Kaplan CD, Jin H, Zhang IL, Belyanin A. Dissection of pol II trigger loop function and pol II activity-dependent control of start site selection in vivo. PLOS Genetics. 2012;8:e1002627. doi: 10.1371/journal.pgen.1002627. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib27] Kaplan CD. Basic mechanisms of RNA polymerase II activity and alteration of gene expression in Saccharomyces cerevisiae. Biochimica Et Biophysica Acta (BBA) - Gene Regulatory Mechanisms. 2013;1829:39–54. doi: 10.1016/j.bbagrm.2012.09.007. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib28] Kim M, Krogan NJ, Vasiljeva L, Rando OJ, Nedea E, Greenblatt JF, Buratowski S. The yeast Rat1 exonuclease promotes transcription termination by RNA polymerase II. Nature. 2004;432:517–522. doi: 10.1038/nature03041. [DOI] [PubMed] [Google Scholar]

[bib29] Krogan NJ, Dover J, Wood A, Schneider J, Heidt J, Boateng MA, Dean K, Ryan OW, Golshani A, Johnston M, Greenblatt JF, Shilatifard A. The Paf1 complex is required for histone H3 methylation by COMPASS and Dot1p: linking transcriptional elongation to histone methylation. Molecular Cell. 2003;11:721–729. doi: 10.1016/S1097-2765(03)00091-1. [DOI] [PubMed] [Google Scholar]

[bib30] Langmead B, Trapnell C, Pop M, Salzberg SL. Ultrafast and memory-efficient alignment of short DNA sequences to the human genome. Genome Biology. 2009;10:R25. doi: 10.1186/gb-2009-10-3-r25. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib31] Laughery MF, Hunter T, Brown A, Hoopes J, Ostbye T, Shumaker T, Wyrick JJ. New vectors for simple and streamlined CRISPR-Cas9 genome editing in Saccharomyces cerevisiae. Yeast. 2015;32:711–720. doi: 10.1002/yea.3098. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib32] Lei EP, Krebber H, Silver PA. Messenger RNAs are recruited for nuclear export during transcription. Genes & Development. 2001;15:1771–1782. doi: 10.1101/gad.892401. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib33] Li J, Lu X. The emerging roles of 3' untranslated regions in Cancer. Cancer Letters. 2013;337:22–25. doi: 10.1016/j.canlet.2013.05.034. [DOI] [PubMed] [Google Scholar]

[bib34] Liu X, Freitas J, Zheng D, Oliveira MS, Hoque M, Martins T, Henriques T, Tian B, Moreira A. Transcription elongation rate has a tissue-specific impact on alternative cleavage and polyadenylation in Drosophila melanogaster. RNA. 2017a;23:1807–1816. doi: 10.1261/rna.062661.117. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib35] Liu X, Hoque M, Larochelle M, Lemay JF, Yurko N, Manley JL, Bachand F, Tian B. Comparative analysis of alternative polyadenylation in S. cerevisiae and S. pombe. Genome Research. 2017b;27:1685–1695. doi: 10.1101/gr.222331.117. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib36] Masamha CP, Xia Z, Yang J, Albrecht TR, Li M, Shyu AB, Li W, Wagner EJ. CFIm25 links alternative polyadenylation to glioblastoma tumour suppression. Nature. 2014;510:412–416. doi: 10.1038/nature13261. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib37] Mason PB, Struhl K. Distinction and relationship between elongation rate and processivity of RNA polymerase II in vivo. Molecular Cell. 2005;17:831–840. doi: 10.1016/j.molcel.2005.02.017. [DOI] [PubMed] [Google Scholar]

[bib38] Mayr C. Evolution and biological roles of alternative 3'UTRs. Trends in Cell Biology. 2016;26:227–237. doi: 10.1016/j.tcb.2015.10.012. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib39] Mayr C, Bartel DP. Widespread shortening of 3'UTRs by alternative cleavage and polyadenylation activates oncogenes in cancer cells. Cell. 2009;138:673–684. doi: 10.1016/j.cell.2009.06.016. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib40] Moqtaderi Z, Geisberg JV, Jin Y, Fan X, Struhl K. Species-specific factors mediate extensive heterogeneity of mRNA 3' ends in yeasts. PNAS. 2013;110:11073–11078. doi: 10.1073/pnas.1309384110. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib41] Moqtaderi Z, Geisberg JV, Struhl K. Extensive structural differences of closely related 3' mRNA isoforms: links to Pab1 binding and mRNA stability. Molecular Cell. 2018;72:849–861. doi: 10.1016/j.molcel.2018.08.044. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib42] Ng HH, Robert F, Young RA, Struhl K. Targeted recruitment of Set1 histone methylase by elongating pol II provides a localized mark and memory of recent transcriptional activity. Molecular Cell. 2003;11:709–719. doi: 10.1016/s1097-2765(03)00092-3. [DOI] [PubMed] [Google Scholar]

[bib43] Ogorodnikov A, Levin M, Tattikota S, Tokalov S, Hoque M, Scherzinger D, Marini F, Poetsch A, Binder H, Macher-Göppinger S, Probst HC, Tian B, Schaefer M, Lackner KJ, Westermann F, Danckwardt S. Transcriptome 3'end organization by PCF11 links alternative polyadenylation to formation and neuronal differentiation of neuroblastoma. Nature Communications. 2018;9:5331. doi: 10.1038/s41467-018-07580-5. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib44] Ozsolak F, Kapranov P, Foissac S, Kim SW, Fishilevich E, Monaghan AP, John B, Milos PM. Comprehensive polyadenylation site maps in yeast and human reveal pervasive alternative polyadenylation. Cell. 2010;143:1018–1029. doi: 10.1016/j.cell.2010.11.020. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib45] Pelechano V, Wei W, Steinmetz LM. Extensive transcriptional heterogeneity revealed by isoform profiling. Nature. 2013;497:127–131. doi: 10.1038/nature12121. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib46] Sandberg R, Neilson JR, Sarma A, Sharp PA, Burge CB. Proliferating cells express mRNAs with shortened 3' untranslated regions and fewer microRNA target sites. Science. 2008;320:1643–1647. doi: 10.1126/science.1155390. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib47] Sherstnev A, Duc C, Cole C, Zacharaki V, Hornyik C, Ozsolak F, Milos PM, Barton GJ, Simpson GG. Direct sequencing of Arabidopsis thaliana RNA reveals patterns of cleavage and polyadenylation. Nature Structural & Molecular Biology. 2012;19:845–852. doi: 10.1038/nsmb.2345. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib48] Sparks KA, Mayer SA, Dieckmann CL. Premature 3'-end formation of CBP1 mRNA results in the downregulation of cytochrome b mRNA during the induction of respiration in Saccharomyces cerevisiae. Molecular and Cellular Biology. 1997;17:4199–4207. doi: 10.1128/MCB.17.8.4199. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib49] Sparks KA, Dieckmann CL. Regulation of poly(A) site choice of several yeast mRNAs. Nucleic Acids Research. 1998;26:4676–4687. doi: 10.1093/nar/26.20.4676. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib50] Strässer K, Masuda S, Mason P, Pfannstiel J, Oppizzi M, Rodriguez-Navarro S, Rondón AG, Aguilera A, Struhl K, Reed R, Hurt E. TREX is a conserved complex coupling transcription with messenger RNA export. Nature. 2002;417:304–308. doi: 10.1038/nature746. [DOI] [PubMed] [Google Scholar]

[bib51] Tian B, Manley JL. Alternative cleavage and polyadenylation: the long and short of it. Trends in Biochemical Sciences. 2013;38:312–320. doi: 10.1016/j.tibs.2013.03.005. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib52] Tian B, Manley JL. Alternative polyadenylation of mRNA precursors. Nature Reviews Molecular Cell Biology. 2017;18:18–30. doi: 10.1038/nrm.2016.116. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib53] Veloso A, Kirkconnell KS, Magnuson B, Biewen B, Paulsen MT, Wilson TE, Ljungman M. Rate of elongation by RNA polymerase II is associated with specific gene features and epigenetic modifications. Genome Research. 2014;24:896–905. doi: 10.1101/gr.171405.113. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib54] Vivier MA, Lambrechts MG, Pretorius IS. Coregulation of starch degradation and dimorphism in the yeast Saccharomyces cerevisiae. Critical Reviews in Biochemistry and Molecular Biology. 1997;32:405–435. doi: 10.3109/10409239709082675. [DOI] [PubMed] [Google Scholar]

[bib55] Wallace EWJ, Beggs JD. Extremely fast and incredibly close: cotranscriptional splicing in budding yeast. RNA. 2017;23:601–610. doi: 10.1261/rna.060830.117. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib56] Wang D, Bushnell DA, Westover KD, Kaplan CD, Kornberg RD. Structural basis of transcription: role of the trigger loop in substrate specificity and catalysis. Cell. 2006;127:941–954. doi: 10.1016/j.cell.2006.11.023. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib57] Weill L, Belloc E, Bava FA, Méndez R. Translational control by changes in poly(A) tail length: recycling mRNAs. Nature Structural & Molecular Biology. 2012;19:577–585. doi: 10.1038/nsmb.2311. [DOI] [PubMed] [Google Scholar]

PERMALINK

The transcriptional elongation rate regulates alternative polyadenylation in yeast

Joseph V Geisberg

Zarmik Moqtaderi

Kevin Struhl

Roles

Abstract

Introduction

Results

Poly(A) sites are shifted upstream under diauxic conditions, favoring shorter 3’ isoforms

Figure 1. Poly(A) sites are shifted upstream in diauxic cells.

Figure 1—figure supplement 1. Correlation of biological replicates.

Yeast cells containing Pol II derivatives with slow elongation rates show a poly(A) pattern that strikingly resembles the pattern in diauxic shift

Figure 2. Slow Pol II and diauxic end zones are highly similar.

Figure 2—figure supplement 1. Heat map of percent coordinate utilization in 3’UTRs.

Upstream shifts involve differential utilization of pre-existing poly(A) sites

Figure 3. High overlap in poly(A) sites used in diauxic and slow-Pol II strains.

Figure 3—figure supplement 1. High poly(A) site overlap across strains/conditions despite differences in relative levels.

Pol II derivatives with fast elongation rates show modest downstream shifts in poly(A) patterns

Figure 4. Increased usage of downstream poly(A) sites in fast Pol II strains.

Figure 4—figure supplement 1. Downstream end zone shift in fast Pol II strains.

Poly(A) patterns of individual genes vary in their sensitivity to Pol II elongation rate

Sequences around cleavage sites of Pol II speed-sensitive genes are enriched for purines

Figure 5. Increased purine content in sequences flanking poly(A) sites of genes sensitive to Pol II speed.

Figure 5—figure supplement 1. Percent identity of max isoform positions by condition, strain, and category.

Evidence that Pol II elongation rate is decreased in diauxic conditions

Figure 6. Pol II elongation rate is linked to shifted end zone profiles in diauxic conditions.

Figure 6—figure supplement 1. 3’UTR percent coordinate utilization for several strains/conditions.

Pol II elongation rate, not processivity, is important for polyadenylation patterns

Discussion

Pol II elongation rate, not Pol II processivity, affects poly(A) site selection

Figure 7. Model of poly(A) site shift in Pol II speed-sensitive genes.

Evidence that regulated polyadenylation during the diauxic shift is due to decreased elongation rate

Mechanistic implications about regulation of alternative polyadenylation

Materials and methods

Strains

RNA analysis

Chromatin immunoprecipitation

Data analysis

End zone profiles, important parameters, and definitions

Percent coordinate usage analysis

Correlations of biological replicates

Classification of genes by sensitivity to pol II elongation rate perturbations

Initial classification of poly(A) shift direction by gene

Combined classification of poly(A) shift behavior by gene across multiple strains

Conservation of endpoints

Nucleotide frequency composition analysis

Acknowledgements

Funding Statement

Contributor Information

Funding Information

Additional information

Competing interests

Author contributions

Additional files

Data availability

References

Decision letter

Roles

Author response

Associated Data

Data Citations

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases