Skip to main content
Elsevier Sponsored Documents logoLink to Elsevier Sponsored Documents
. 2019 Aug;35(8):553–564. doi: 10.1016/j.tig.2019.05.005

Transcriptional Control by Premature Termination: A Forgotten Mechanism

Kinga Kamieniarz-Gdula 1,2,3,⁎,@, Nick J Proudfoot 1
PMCID: PMC7471841  PMID: 31213387

Abstract

The concept of early termination as an important means of transcriptional control has long been established. Even so, its role in metazoan gene expression is underappreciated. Recent technological advances provide novel insights into premature transcription termination (PTT). This process is frequent, widespread, and can occur close to the transcription start site (TSS), or within the gene body. Stable prematurely terminated transcripts contribute to the transcriptome as instances of alternative polyadenylation (APA). Independently of transcript stability and function, premature termination opposes the formation of full-length transcripts, thereby negatively regulating gene expression, especially of transcriptional regulators. Premature termination can be beneficial or harmful, depending on its context. As a result, multiple factors have evolved to control this process.

Keywords: premature transcription termination, transcription attenuation, alternative polyadenylation (APA), intronic polyadenylation (IPA), alternative last exon (ALE)

Highlights

  • PTT is widespread in metazoans. It can occur close to the TSS or further downstream in the gene body.

  • PTT generates transcripts that, depending on the circumstances, are either rapidly degraded, or are stabilised by polyadenylation, thus contributing to transcriptome diversification.

  • Stable premature transcripts can have independent functions as noncoding (nc)RNA or mRNA encoding proteins with different properties compared with those generated by the full-length transcript.

  • PTT can negatively regulate expression of the full-length transcript and especially controls genes encoding transcriptional regulators.

  • Factors triggering PTT include not only canonical RNA 3′ processing and termination factors, but also other players. Many metazoan factors oppose PTT, thus limiting its damaging potential.

Transcriptional Control by Premature Termination: Revisited

It is well established that early termination can serve as an important mechanism for transcriptional control. PTT (see Glossary), or ‘attenuation’, was demonstrated during the mid-1970s to be a key regulatory event for the synthesis of bacterial enzymes that make amino acids 1., 2., and first reported in 1979 to occur for RNA polymerase II (Pol II) transcription of a viral gene in mammalian cells [3]. Many more cases of eukaryotic PTT have been identified, even though their analysis was hampered by technical limitations and the highly unstable nature of prematurely terminated RNA. The recent development of next-generation sequencing technologies combined with novel methods to measure nascent transcription, single-molecule footprints, and advanced live-imaging makes it possible to revisit this paradigm. In this review, we present recent findings on metazoan PTT, revealing its widespread nature and role in the regulation of protein-coding genes. While we focus on metazoans, a broader perspective of PTT in other kingdoms of life is summarised in Box 1. Our definition of PTT is the release of Pol II from the gene template between the TSS and 3′-untranslated region (UTR; Figure 1, Key Figure) of the gene. We note that transcription termination is tightly linked with RNA 3′ cleavage and polyadenylation (CPA). Consequently, these two terms are often used ambiguously or even confused in the literature. Multiple recent reviews provide a general background to RNA 3′ processing and transcription termination 4., 5., 6., 7., 8., 9., 10., 11. as well as APA 12., 13., 14., 15., 16..

Box 1. PTT in Bacteria, Yeast and Plants.

PTT has been long known to be a key regulatory event in bacteria, referred to as attenuation. Classically, attenuation was shown to control the expression of enzymes involved in amino acid biosynthesis, such as the tryptophan and histidine operons 1., 2.. Bacterial terminators can be intrinsic, associated with a hairpin RNA structure, or factor dependent, usually involving the RNA helicase Rho. Attenuation occurs when an antiterminator hairpin RNA forms ahead of an intrinsic terminator positioned near the 5′ end of an operon. Formation of the antiterminator hairpin precludes the formation of the intrinsic terminator hairpin and so allows transcription to read into the operon and express its protein-coding regions. Switching between the antiterminator and terminator hairpins is controlled by diverse regulators 100., 101.. Given that translation occurs co-transcriptionally, PTT is closely coupled to translation regulation. This differentiates it from eukaryotic regulation.

PTT is also a well-recognised regulatory mechanism in Saccharomyces cerevisiae, mediated by the Nrd1–Nab3–Sen1 (NNS) complex. The first example of attenuation by NNS was demonstrated for the NRD1 gene, which is autoregulated by PTT in response to Nrd1 activity [98]. NNS-mediated PTT further regulates genes involved in nucleotide and amino acid biosynthesis, as well as nitrogen metabolism, and is physiologically relevant upon nutritional shift 102., 103., 104., 105.. The prematurely terminated transcripts sometimes initiate at a TSS upstream of the protein-coding gene [102]. Interestingly, it was recently shown that the DNA repair gene DEF1 is attenuated by Sen1 and CPA factors, without Nrd1 and Nab3 involvement [106]; therefore, PTT in S. cerevisiae might not be limited to the NNS pathway.

There are no Nrd1/Nab3 homologues known in plants. However, PTT has an elaborate role in the control of flowering time in Arabidopsis thaliana. FLC is a transcription factor that acts as a master regulator of flowering. It is carefully titrated: small changes in FLC transcript levels significantly affect flowering. The accumulation of FLC mRNA is prevented by FCA and FPA, two RNA-binding proteins associated with RNA 3′-processing factors. FCA and FPA autoregulate their own levels by premature polyadenylation and termination, independently of each other 107., 108.. Interestingly, they also promote early termination of the lncRNA COOLAIR 108., 109.. COOLAIR is an antisense transcript to FLC, and functions in early cold-induced silencing of FLC transcription [110]. As a result, several layers of premature termination of coding and noncoding transcripts act to control the timing of plant flowering. It is likely that other examples of PTT will be established in plants.

Alt-text: Box 1

Figure 1.

Figure 1

Key Figure. Localisation of Premature and Full-Length Transcription Termination.

We classify any transcription termination events occurring within the 3′-untranslated region (UTR) of the full-length transcript or downstream of the annotated 3′-UTR, as full-length, whereas those occurring before as premature. Premature transcription termination (PTT) can occur in the vicinity of the transcription start site (TSS), where we call it TSS-linked PTT, or further downstream in the gene body, described here as intragenic PTT. As a rule of thumb, we annotate TSS-linked PTT as corresponding to termination events within the first exon; however, please note that this is not an absolute distinction and is likely to vary on a gene-by-gene basis. PTT can generate unstable (dotted wavy line) or stable (solid wavy line), polyadenylated transcripts. If PTT generates a stable, polyadenylated transcript, it can be classified as alternative polyadenylation (APA) or intronic polyadenylation (IPA). Abbreviation: (A)n, polyadenylation.

TSS-Linked Premature Transcription Termination

PTT of a protein-coding gene can be divided into termination events occurring close to the TSS or within the gene body (Figure 1). We predict that PTTs at these two locations are likely to be functionally and mechanistically different, since they occur at different stages of the transcription cycle.

For many genes, their TSS is characterised by high accumulation of Pol II, as measured by chromatin immunoprecipitation (ChIP). This depends on the action of negative elongation factor (NELF) and DRB sensitivity-inducing factor (DSIF), typically occurring 30–50 base pairs downstream of the TSS [17]. Such a Pol II ‘pileup’ is usually interpreted as stable pausing of engaged Pol II. However, it may be also due to PTT with concomitant Pol II turnover [18]. Several lines of evidence support the latter: termination and RNA 3′-processing factors have been observed to accumulate at the 5′ ends of genes 19., 20., short nuclear capped transcripts have been detected [21], and RNA cleavage sites near the TSS were identified [22]. Over the past 2 years, more direct experiments have demonstrated that a high percentage of TSS-bound Pol II molecules terminate prematurely.

Pol II binding to the genome was recently measured at a single-molecule resolution in Drosophila, with the aid of a novel single-molecule footprinting method [23]. This revealed unexpectedly high levels of Pol II turnover at the promoters of paused genes. In particular, the measured Pol II half-life at promoters of model paused genes was comparable to ‘nonpaused’, normally elongating genes. Therefore, Pol II accumulation at these promoters appears to be largely due to PTT, rather than to stable pausing of transcription-competent polymerases. This interpretation is further supported by an independent study that analysed the real-time dynamics of Pol II in live human cells using fluorescence recovery after photobleaching (FRAP) [24]. Computational modelling of Pol II kinetics showed that initiating Pol II remains chromatin bound for only 2.4 s and promoter-paused Pol II for 42 s, in contrast to elongating Pol II, which remained chromatin bound on average for 23 min. These big differences in Pol II residence times suggest that only a small fraction of initiating and pausing Pol II proceeds through a complete transcription cycle, whereas most Pol II is released from chromatin at the promoter. Indeed, the determined rate constants showed that only ~10% of Pol II molecules that initiate transcription will go on to promoter pausing and, of those, only ~10% continue into productive elongation. Thus, this study indicates that 99% of transcription initiation events result in PTT at the promoter, with only 1% giving rise to mRNA [24]. This surprisingly inefficient transcription initiation process is consistent with previous Pol II measurements on a lacO array [25]. Furthermore, the inhibition of PTT is the most plausible explanation for the dramatic increase in promoter-associated Pol II within 2–3 min after H2O2 addition to U2OS cells [26]. As a further clever way to investigate promoter-associated Pol II, the differential sensitivity of transcription initiation and elongation to high ionic strength has been used [27]. This showed that blocking recruitment of Pol II to promoters (but not elongation) by high salt treatment affected its binding in ChIP followed by next-generation sequencing (ChIP-seq), and revealed an almost complete loss of Pol II from promoter-proximal pause sites within 2–5 min. This loss was rapidly reversible and unaffected by transcriptional inhibitors. Therefore, Pol II removal from pause sites appears not to require elongation. Instead, a high rate of assembly and eviction of pre-initiated Pol II complexes at TSS is predicted. Although the above-mentioned studies used different methodologies, they all describe high turnover rates of Pol II at Drosophila or human promoters in various cell types 23., 24., 25., 26., 27.. Therefore, it is unlikely that the observed turnover is an artefact from any one procedure or an unusual cell type. In conclusion, most initiating Pol II molecules appear to terminate prematurely. It follows that the release of Pol II into productive elongation may be regulated by inhibition of this promoter proximal Pol II termination.

While promoter-proximal PTT has been largely overlooked, Pol II pausing in this location is well established in metazoans, and is tightly regulated by negative and positive elongation factors, such as P-TEFb 17., 28., 29.. Several previous studies described longer median half-lives of paused Pol II 30., 31., 32., 33.. One possible explanation for this discrepancy is the use of triptolide to block initiation. This blocks open complex formation by inhibition of TFIIH-associated XPB 34., 35. and was assumed to prevent recruitment of stable Pol II complexes at promoters. However, triptolide also disturbs transcriptional regulation and Pol II stability. Additionally, there is a lag in the onset of XBP inhibition, which may prevent accurate half-life determination 24., 27.. Further studies using different drugs and methods are required to resolve this discrepancy and so determine the relative contribution of promoter-proximal Pol II pausing versus PTT, to Pol II occupancy at promoters and to the control of productive elongation. Although technically challenging, it will be important to more directly determine the percentage of RNA molecules associated with paused Pol II, which, in physiological conditions, lead to mRNA production; that is, to demonstrate their assumed precursor–product relationship. Notably, paused Pol II blocks transcription initiation of additional polymerases 33., 36.. As Pol II residence times are variable on different genes 23., 33. the relevance of PTT and Pol II pausing is likely to be gene specific. Importantly, the two models explaining promoter-associated Pol II accumulation are not mutually exclusive; Pol II pausing may even have a direct role in early termination. Currently the mechanism of TSS-associated PTT remains unclear. For example, does Pol II disengagement from DNA in the vicinity of the TSS require a prior RNA cleavage step?

Intragenic Premature Termination of Transcription

Once Pol II overcomes the TSS-associated elongation checkpoint, subsequent PTT likely requires a co-transcriptional cleavage reaction at a cryptic polyadenylation site (PAS), mainly located in introns. PAS-mediated cleavage within internal exons appears to be a small fraction compared with introns, and leads to transcripts without a stop codon, which typically get rapidly degraded through the nonstop decay pathway [37], although, in rare cases, they can result in truncated proteins [38]. PTT in the beginning of the first intron is frequent in conditions were U1 small nuclear ribonucleoprotein complex (U1 snRNP) or the cap-binding complex (CBC) are depleted 39., 40., 41., and partially coincides with a second Pol II stalling event at stable nucleosomes downstream of CpG island promoters [42]. Such PTT leads to unstable transcripts that undergo rapid degradation (see next section). Most current knowledge of intragenic PTT comes from the analysis of stable premature polyadenylated transcripts, which can be readily detected by various RNA-sequencing methodologies. These stable transcripts are classified as instances of APA, as reviewed in [14]. APA within the gene body, upstream of the 3′-UTR, has been referred to as coding sequence APA (CDS-APA), upstream region APA (UR-APA), intronic polyadenylation (IPA/IpA) or alternative last exon (ALE). We use the term ‘IPA’ here. IPA was first described for specific genes in 1980 43., 44., 45., but only recently has its widespread nature and significance been appreciated, due to the development of different PAS-sequencing methods. Thus, ~40% of murine genes [46] and 16% of genes in human immune cells [47] have been reported to express IPA isoforms. In particular, IPA has been shown to be a frequent genome-wide event, with diverse roles in immune cells [48], inactivation of tumour suppressor genes [49] ,and regulation of DNA repair genes [50]. Notably, a systematic analysis of IPA in normal human tissues, primary immune cells, and multiple myeloma samples has been used to create an atlas of 4927 high-confidence IPA events in these cell types [48]. IPA isoforms were shown to yield stable transcripts, which, in the case of 5′-proximal IPA, tend to produce noncoding RNA (ncRNA). By contrast, more 3′-proximal IPA events tend to produce truncated proteins [48].

Fate and Function of Prematurely Terminated Transcripts

PTT may lead to three outcomes: rapid transcript degradation; formation of a more stable ncRNA; or production of a protein-coding, polyadenylated mRNA isoform (Figure 2). The more 5′ proximal a transcript is terminated, the more likely it is to be degraded. TSS-linked PTT might not require cleavage and could lead to unprocessed, unprotected, nonpolyadenylated RNA. Further downstream, transcripts may be cleaved and at least partially polyadenylated. Short transcripts appear to generally undergo quick degradation, similar to promoter upstream transcripts (PROMPT)/upstream antisense RNA, which also often yield exosome-sensitive RNA 22., 51., 52., 53.. In the case of short sense transcripts, while PAS usage allows for transcript cleavage, this might not lead to efficient polyadenylation. Another explanation for the instability of short transcripts is that the 3′ entry site for the nuclear exosome in such transcripts is physically close to the 5′-capped end of the transcript, which is bound by the CBC, known to contribute to exosomal decay, especially of some classes of ncRNA 41., 54., 55.. Overall, different surveillance pathways may operate at different parts of the gene. It is also plausible that productive 3′-end processing and RNA stability are transcript dependent and influenced by the new coding potential of the shorter isoform, its new 3′-UTR regulatory elements, and so on.

Figure 2.

Figure 2

Fate and Function of Prematurely Terminated Transcripts.

Most transcription start site (TSS)-linked, exonic, and some intronic prematurely terminated transcripts are unstable, and likely lack cellular function. Stable prematurely terminated transcripts generated as a result of intronic polyadenylation (IPA) form either a noncoding (nc) RNA or protein-coding mRNA. Contrary to their name, ncRNA sometimes contain small open reading frames (ORFs), which may be translated into micropeptides. Other ncRNA can serve cellular functions, for example as scaffolds for RNA-binding proteins (RBP) [48]. Many ncRNA have no clear function determined to date. Protein-coding mRNA isoforms generated by PTT diversify the proteome. They lack the C-terminal domain(s) present in the full-length protein and may have different properties. These include membrane binding versus soluble 43., 44., 45., 48., 57., altered specificity or affinity for binding to nucleic acid or protein partners [48], and, in some cases, dominant negative functions 59., 60., 61., 62., 63.. Abbreviation: (A)n, polyadenylation.

Stable prematurely terminated RNAs are mainly polyadenylated and contribute to the IPA transcriptome. They can be further subdivided into ncRNA and protein-coding mRNA isoforms (Figure 2). Some ncRNA contain an open reading frame (ORF) and, therefore, might produce a micropeptide (<100 amino acids). Other ncRNA could serve specific functions. For example, PTT of ASCC3 leads to the formation of a stable ncRNA, which is critical for the recovery of transcription following ultraviolet (UV) damage [56]. IPA-generated ncRNA also interact with RNA-binding proteins, which are normally enriched in the 3′-UTRs of coding transcripts, such as FUS, ELAVL1, PUM2, TAF15, and TIAL1 [48]. Such ncRNAs could act as scaffolds for RBPs and so regulate other RNA in trans.

In the third scenario, protein-coding IPA isoforms (Figure 2) contribute to the diversity of the proteome, for example by generating proteins with physiologically distinct functions, or truncated dominant negative proteins. In the classic case of B cell expressed immunoglobulin M heavy chain mRNA, cellular activation causes a switch from full-length to IPA mRNA isoforms, resulting in a change from membrane-bound to secreted forms of the antibody 43., 44., 45.. IPA appears to regulate membrane-anchoring properties of many other proteins. Computational analysis revealed that 376 mouse genes are likely to use the IPA mechanism to generate proteins with changes to their transmembrane domains (TMD) [57]. In human cells, although TMD-containing proteins are significantly depleted among IPA genes, 499 genes encoding transmembrane proteins undergo IPA and, in 152 cases, these lead to loss of the TMD [48]. For example, IPA generates various mRNA isoforms of the transmembrane T cell co-stimulator CD46, which are predicted to form soluble CD46 [58]. Similarly, PTT variants encoding proteins with dominant negative functions, such as retinoblastoma-binding protein 6 (RBBP6) 59., 60., MAGI3 61., 62. or platelet-derived growth factor receptor α (PDGFRα), have been also described [63]. Interestingly, the site of premature polyadenylation is frequently located within a domain mediating either protein–protein interactions or DNA or RNA binding, such as zinc finger arrays [48]. The partial loss of such interaction surfaces may lead to altered binding affinities for protein interaction partners and altered nucleic acid-binding specificity, respectively. Therefore, IPA may be physiologically relevant by allowing a diversification of protein function. By contrast, the widespread pathological use of premature polyadenylation has been uncovered in chronic lymphocytic leukaemia [49]. Here, mRNA truncations by IPA are recurrent and predominantly affect genes with tumour-suppressive functions. This leads to either their inactivation or transformation into oncogenes. In conclusion, proteome alteration occurring as a result of IPA can be either beneficial or harmful.

Gene Regulation by Premature Transcription Termination

Regardless of whether the prematurely terminated transcript is stable or unstable, and independently of its ability to produce a functional ncRNA or protein, a potential outcome of early termination is the repression of the corresponding full-length mRNA (Figure 3). Such negative regulation has been demonstrated for the CSTF3 gene (known also as CSTF77), originally in Drosophila and then in humans 64., 65., 66.. CSTF3 is a CPA complex subunit, stimulating cleavage. Premature CPA of CSTF3 is induced by high levels of CSTF3 protein, thus forming a negative feedback loop controlling its own activity, which is important for cell cycle control [66]. The CSTF3 example suggested that PTT could be a more general mechanism to repress gene expression in metazoans.

Figure 3.

Figure 3

Transcriptional Control by Premature Termination.

Premature transcription termination (PTT) is mutually exclusive with full-length transcription. Independent of the fate or function of prematurely terminated transcripts, PTT negatively regulates the expression of the full-length transcripts. Genes controlled by PTT include transcriptional regulators 48., 68., DNA repair genes [50], and tumour suppressor genes [49], and tend to be larger than average genes 40., 69..

Supporting this possibility, the abundant U1 snRNP has been shown to function not only in splicing, but also by blocking widespread PTT in thousands of vertebrate genes 39., 40.. Vertebrate genes often contain large introns, with numerous cryptic poly(A) signals. Therefore, they are inherently susceptible to PTT, although this is suppressed by U1. Interestingly, PTT is strongly activated upon UV treatment of cells, and correlates with decreased U1 snRNA levels [67] and a slowdown in transcription elongation [56]. These studies demonstrated the widespread predisposition of vertebrate genes to IPA. However, it was unclear whether PTT serves a regulatory role genome wide, or is a harmful genomic accident, induced by DNA damage and suppressed by U1. Physiological regulation is usually achieved by limiting factors, whereas U1 snRNP is extremely abundant, at ~1 million copies per human cell.

Finally, it was recently shown that a subset of protein-coding genes is downregulated by PTT under physiological conditions, both in cultured human cells and during zebrafish embryogenesis. This is triggered by an RNA 3′-processing and termination factor called PCF11 [68]. Similar to CSTF3, PCF11 uses PTT as an autoregulatory mechanism 68., 69.. However, 218 other human genes were additionally identified as downregulated by PCF11-mediated PTT. This is likely an underestimate, because many genes undergoing PCF11-mediated PTT might also be dependent on PCF11 for efficient full-length transcript expression. Interestingly, half of the PCF11-attenuated genes show PTT without detectable IPA products [68]. This suggests that many transcripts generated by PTT under physiological conditions are unstable and that IPA corresponds to only a subset of intragenic PTT. Therefore, it will be important that future PTT studies also assay the nascent transcriptome. Notably, genes undergoing PCF11-mediated PTT are enriched for transcriptional regulators, both in humans and in zebrafish [68]. Similarly, in primary human tissues, IPA has been shown to occur preferentially on genes encoding transcriptional regulators [48]. PCF11 levels are an order of magnitude lower compared with other 3′-processing factors, but vary between tissues, making PCF11 a likely regulatory factor [68]. Overall, those data indicate that PTT is a naturally occurring, widespread, and controlled phenomenon in vertebrates.

The Role of CPA and Termination Factors in PTT

Since PCF11 acts at both gene ends and throughout the gene body, it appears likely that other canonical CPA and termination factors are involved in PTT (Figure 4). Notably, XRN2, the nuclear 5′–3′ exonuclease ‘torpedo’ that facilitates transcription termination at the 3′ ends of genes, has been shown by ChIP-seq to also localise near TSS, and interact with decapping factors [19]. Coupled decapping of nascent transcripts and PTT has been suggested to limit bidirectional Pol II elongation. PCF11 as well as CPSF73 is enriched at TSS in ChIP-seq [68]. Additionally, in vivo crosslinking and immunoprecipitation (CLIP) revealed that all analysed CPA factors (CPSF73, CstF64, CstF64t, CPSF160, CPSF30, and CFIm25) were significantly detected on both strands in the vicinity of the TSS 70., 71.. Finally, depletion of CPA and termination factors (CPSF73, CSTF64, XRN2, and PCF11) resulted in increased mammalian native elongating transcript sequencing (mNET-seq) signal specifically at the TSS in both sense and antisense direction 68., 71.. This suggests that TSS-linked nonproductive RNA synthesis may be terminated by the same factors that are responsible for 3′ processing and transcription termination at gene ends.

Figure 4.

Figure 4

Factors Triggering and Suppressing Premature Transcription Termination (PTT).

The molecular mechanism of PTT is unclear. However, multiple factors may trigger or prevent PTT. PTT-triggering factors include not only canonical proteins involved in RNA 3′-processing and transcription termination at gene ends [PCF11, XRN2, and the cleavage and polyadenylation (CPA) complex], but also other factors, mainly involved in RNA processing [nuclear exosome, cap-binding complex (CBC), and the Integrator complex]. Intragenic PTT is mutually exclusive with splicing; therefore, the process of splicing and splicing-promoting factors are major factors suppressing PTT. Further players preventing spurious PTT are also shown, and discussed in the main text. We note that many of the listed factors affect PTT indirectly, by affecting RNA polymerase II (Pol II) dynamics, co-transcriptional processing, or RNA stability. Further mechanistic studies are needed to provide proof of which factors have a direct role in PTT. Abbreviations: CFIm, mammalian cleavage factor I; m6A, modification of RNA by N6-methyladenosine.

Apart from PCF11, the involvement of the termination machinery in premature cleavage within the gene body is less well defined. However, IPA globally correlates with the preferential use of proximal PASs in 3′-UTRs 46., 48., 72., 73., which are both more prevalent in proliferating cells but less so during cell differentiation. Consequently, IPA and 3′-UTR APA are likely to be coregulated. In addition to PCF11, FIP1 is a likely candidate for PTT stimulation, because it is the only other CPA factor known to promote early PAS usage [74].

Noncanonical Factors Involved in Premature Termination

Besides the factors that are known to mediate 3′-processing and transcription termination at gene ends, others have emerged that stimulate PTT (Figure 4). In budding yeast, premature termination is tightly linked with RNA degradation by the nuclear exosome [55]. Similarly, mutation of the catalytic subunit of the human nuclear exosome complex, DIS3, resulted in accumulation of truncated RNA, likely to be PTT products [75]. Furthermore, the mammalian nuclear exosome interacts with the CBC. Two CBC-associated proteins, ZC3H18 and SRRT/2, have also been connected with PTT events in the first introns of protein-coding genes [41].

Another candidate factor for PTT is the Integrator complex. This multiprotein complex interacts with Pol II and was initially identified as a RNA 3′-processing complex for snRNA 76., 77.. Two Integrator subunits, IntS9 and IntS11, are homologous to the CPA factors CPSF100 and CPSF73, respectively [78]. More recently, experimental evidence emerged that suggested that Integrator also has a role in the activation of protein-coding genes, particularly in Pol II pause–release and elongation 79., 80., 81., 82.. The subunit containing endonuclease activity, Ints11, was found to bind around TSS and to be required for TSS–proximal Pol II pause–release [80]. This function was dependent on its endonucleolytic activity. Thus, it is conceivable that the Integrator complex is involved in TSS-associated PTT.

In terms of the genetic elements that promote PTT, cryptic poly(A) signals within introns are likely candidates. Interestingly, intragenic enhancers also appear to stimulate PTT, because transcription at these enhancers interferes with, and attenuates, host gene transcription during productive elongation [83]. Further genetic elements might also exist that can trigger PTT.

Factors Opposing Premature Termination

Since PTT is disruptive to full-length transcription, many mechanisms and factors must exist to suppress this process (Figure 4). A major factor in preventing PTT is U1 snRNP, as described earlier. Furthermore, DNA sequences likely evolved to prevent harmful, yet reinforce beneficial, termination events. Around the TSS, poly(A) signals are depleted in the sense direction relative to the upstream antisense direction, while U1 snRNP recognition sites show the opposite pattern. This sequence asymmetry has been proposed to control promoter directionality 22., 53.. It has also been reported that codon usage biases coevolve with transcription termination machinery to suppress PTT and allow optimal gene expression [84]. Termination inhibition can also occur in a chromatin-dependent, and sequence-independent manner, as shown in Drosophila for the piwi-interacting RNA (piRNA) transcription regulator Cutoff. Cutoff prevents cleavage of nascent RNA at PAS by interfering with recruitment of the CPA complex and also protects processed transcripts from degradation [85]. In addition, RNA modifications might prevent PTT, because a negative correlation between N6-methyladenosine (m6A) modification of mRNA and PTT has been reported [62]. Other factors that could protect against PTT are RNA-binding proteins. For example, in neural progenitor cells, Sam68 binding to an intronic PAS in Aldh1a3 prevented its recognition and consequent PTT, promoting cell self-renewal [86]. It was further demonstrated in male germ cells that Sam68 interacts with U1 snRNP and is required for U1 snRNP recruitment to Sam68-regulated intronic PAS [87]. In addition, the FET family of proteins (FUS, EWSR1, and TAF15), mutations in which cause amyotrophic lateral sclerosis, has been demonstrated to interact with U1 snRNP 88., 89. and might participate in U1 snRNP-mediated PTT prevention.

Modifying Pol II activity may be another way to oppose PTT. When Pol II misincorporates a base or runs into a ‘roadblock’ such as a nucleosome, it arrests and backtracks. Pol II elongation is rescued by TFIIS stimulating the RNA endonuclease activity of Pol II, which produces a new 3′ end in the active site. It was recently shown that TFIIS stimulation of Pol II cleavage activity antagonises both premature TSS-proximal and gene end termination [90]. Pol II is further regulated by phosphorylation of the CTD of its largest subunit. Depletion of the CTD kinase CDK12 results in genome-wide increased IPA, indicating that CDK12 suppresses PTT [50]. Interestingly, many DNA repair genes harbour more IPA sites than other expressed genes, and are particularly sensitive to the loss of CDK12. IPA is also opposed by CPA factors that favour distal APA events, especially the CFIm complex subunits, PABPN1 and PABPC1 74., 91.. While this article was in review, a new study described a role for previously uncharacterised Pol II CTD-binding proteins SCAF4 and SCAF8 in binding to RNA upstream of early PAS and suppressing PTT in ~1300 human genes. Interestingly, both factors interact with the CPSF subcomplex of the CPA machinery; therefore, the mechanism of SCAF4 and SCAF8-mediated PTT prevention might involve an effect on the CPA complex [92].

Since premature cleavage and polyadenylation are mutually exclusive with splicing of the intron in which the PAS is located, competition between splicing and PTT can be anticipated. Indeed, knockdown of various splicing factors as well as inhibition of 5′ splice site recognition by antisense nucleotide consistently results in IPA 49., 74.. Therefore, antagonism between splicing and PTT goes beyond U1 snRNP-mediated PAS blocking, as described earlier. Further to splicing, Pol II elongation rates have been shown to influence PTT, because IPA events increase under conditions where Pol II elongation is slowed 93., 94., 95.. Slower Pol II provides a longer window of opportunity for CPA before the 3′ splice site is reached. Hence, changes in the balance between splicing and PTT, together with Pol II pausing, predispose it to termination.

Concluding Remarks and Future Perspectives

Here, we have outlined recent evidence that demonstrates the widespread occurrence of PTT in protein-coding genes, both TSS proximal and intragenic. PTT limits and opposes full-length transcription. Consequently, it may contribute to pathological processes, such as host gene downregulation during viral infection [96] and to carcinogenesis by inactivation of tumour suppressor genes [49]. PTT is globally suppressed by genic DNA sequence biases, and various cellular factors, such as U1 snRNP. By contrast, PTT can be also beneficial: it diversifies the transcriptome and proteome, and contributes to gene regulation. Gene regulation by PTT occurs in at least four kingdoms of life: Eubacteria, Fungi, Plantae, and Animalia (Box 1). Yet, the regulatory aspect of PTT has been largely overlooked in metazoan research. This is partially because the machinery triggering PTT varies in different organisms. The well-researched Nrd1–Nab3–Sen1 (NNS) complex regulating PTT in Saccharomyces cerevisiae is absent not only in plants and animals, but also in the less evolutionarily distant Schizosaccharomyces pombe. Even so, the mechanisms modulating PTT appear conserved. In budding yeast, NNS cooperates with PCF11 [97], and both NNS and PCF11 levels are controlled by PTT 98., 99.. In vertebrates, PCF11 uses PTT to regulate own levels, as well as the levels of other 3′-processing factors and transcriptional regulators [68]. Therefore, vertebrate PCF11 may have at least partially coopted the function of the yeast NNS complex. Currently, many questions about the regulation and function of PTT remain (see Outstanding Questions). In particular, we do not understand the mechanisms that lead to, or prevent, PTT. It is also unclear which of the factors enhancing and preventing PTT are able to do so directly. Despite these current gaps in our knowledge, we anticipate that PTT in metazoans is a critical feature of gene regulation. It is also possible that this process can be manipulated to achieve clinical and biotechnological benefits (Box 2).

Box 2. Implications of PTT in Medicine and Biotechnology.

Aberrant PTT is associated with several pathological processes: inactivation of tumour suppressor genes in chronic lymphocytic leukaemia [49]; downregulation of the expression of Kv11.1 potassium channel leading to a heart rhythm condition [111]; and loss of neuronal growth-associated factor stathmin-2 in neurodegeneration [112]. Blocking pathological PTT by antisense morpholinos has been suggested as a potential therapeutic strategy [113]. Conversely, induction of PTT to prevent formation of disease-inducing transcripts has also been proposed as a therapeutic approach. In particular, the insertion of intronic poly(A) signals upstream of toxic expanded CTG repeats in the DMPK gene as associated with myotonic dystrophy type 1 was demonstrated to revert the pathological phenotype of patient-derived induced pluripotent stem cells [114]. Similar approaches could also be used in biotechnology. Of note, two small-molecule modulators of APA have been discovered that promote distal-to-proximal APA usage [115]. These could prove a useful tool for APA and PTT manipulations.

Alt-text: Box 2

Outstanding Questions

  • What is the relative contribution of Pol II pausing and transcription termination to Pol II accumulation at promoters and to the control of transcription elongation?

  • What is the molecular mechanism of PTT and which pathways are involved? Is PTT always preceded by RNA cleavage?

  • What is the role of canonical RNA 3′-processing and termination factors in PTT? Are early and normal termination processes at gene ends co-regulated?

  • How are the prematurely terminated RNAs processed and what determines their stability?

  • Which biological processes depend on early termination? Which PTT events are beneficial and which are harmful? How widespread is transcriptional regulation by PTT in physiological conditions?

  • In Saccharomyces cerevisiae, many prematurely terminated transcripts use different TSS to the full-length transcript. Does this also occur in metazoans?

  • Which factors control PTT directly and which influence it indirectly? What further factors trigger or prevent PTT?

Acknowledgements

We are grateful to Sebastian Marquardt, Arnaud Krebs, and N.J.P. group members for helpful discussions, and to the reviewers for insightful comments. This work was supported by Wellcome Trust Investigator Award (107928/Z/15/Z), ERC Advanced Grant (339170) to N.J.P., and Marie Curie fellowship from EU FP7 (327985) to K.K.-G.

Glossary

Alternative polyadenylation (APA)

generation of alternative transcripts differing at their 3′ ends, occurring through usage of multiple poly(A) signals in one gene.

Cap-binding complex (CBC)

a protein complex binding the 5′ cap structure of eukaryotic mRNA and protecting it from decapping and degradation.

Chromatin immunoprecipitation followed by next-generation sequencing (ChIP-seq)

a method to assess the genome wide localisation of a protein on chromatin.

Cleavage and polyadenylation (CPA)

RNA-processing events occurring at the 3′ ends of pre-mRNA, leading to the formation of a mature mRNA.

Decapping

removal of the 5′ cap structure from RNA, predisposing it to degradation.

Exosome

a multisubunit protein complex harbouring 3′−5′ exoribonucleolytic and endoribonucleolytic activities that functions in RNA degradation.

Intronic polyadenylation (IPA)

APA occurring within an intron of a protein-coding gene, leading to a truncated transcript; also known as coding sequence APA (CDS-APA), upstream region APA (UR-APA), or alternative last exon (ALE); a subtype of PTT.

Mammalian native elongating transcript sequencing (mNET-seq)

a technique that maps nascent transcripts associated with specific Pol II isoforms in mammalian cells.

Noncoding RNA (ncRNA)

nonprotein-coding transcripts (note that some ncRNA might encode small peptides).

Open reading frame (ORF)

part of a transcript that can be translated, comprising a continuous stretch of codons beginning with a start codon and ending with a stop codon.

Polyadenylation signal [Poly(A) signal]

signal in the nascent RNA (typically including the hexamer AAUAAA) promoting RNA cleavage and polyadenylation.

Polyadenylation site (PAS)

RNA cleavage site to which poly(A) is added, typically located 10–30 nucleotides downstream of the poly(A) signal.

Premature transcription termination (PTT)

early release of Pol II from the gene template between the TSS and 3′-UTR of the gene; sometimes also referred to as transcription attenuation.

Promoter upstream transcripts (PROMPT)

known also as upstream antisense RNA, a class of human transcripts that are produced upstream of the promoters of active genes in a divergent orientation to the main (coding) transcript.

RNA Polymerase II (Pol II)

enzyme responsible for transcribing all protein-coding mRNA in eukaryotic cells, along with many ncRNA.

Transcription start site (TSS)

the first nucleotide in the DNA sequence that undergoes transcription into RNA.

Transcription termination

the process whereby an RNA polymerase stops transcribing and dissociates from its DNA template.

U1 small nuclear ribonucleoprotein complex (U1 snRNP)

comprises the small nuclear RNA (snRNA) U1 and its associated proteins; crucial for splicing.

Untranslated region (UTR)

5′ and 3′ parts of mRNA surrounding the ORF that are not translated into protein and often serve regulatory functions.

References

  • 1.Bertrand K. New features of the regulation of the tryptophan operon. Science. 1975;189:22–26. doi: 10.1126/science.1094538. [DOI] [PubMed] [Google Scholar]
  • 2.Artz S.W., Broach J.R. Histidine regulation in Salmonella typhimurium: an activator attenuator model of gene regulation. Proc. Natl. Acad. Sci. U. S. A. 1975;72:3453–3457. doi: 10.1073/pnas.72.9.3453. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Evans R. Premature termination during adenovirus transcription. Nature. 1979;278:367–370. doi: 10.1038/278367a0. [DOI] [PubMed] [Google Scholar]
  • 4.Di Giammartino D.C., Manley J.L. New links between mRNA polyadenylation and diverse nuclear pathways. Mol. Cell. 2014;37:644–649. doi: 10.14348/molcells.2014.0177. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Lemay J-F., Bachand F. Fail-safe transcription termination: because one is never enough. RNA Biol. 2015;12:927–932. doi: 10.1080/15476286.2015.1073433. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Libri D. Endless quarrels at the end of genes. Mol. Cell. 2015;60:192–194. doi: 10.1016/j.molcel.2015.10.004. [DOI] [PubMed] [Google Scholar]
  • 7.Loya T.J., Reines D. Recent advances in understanding transcription termination by RNA polymerase II. F1000Res. 2016;5:1478. doi: 10.12688/f1000research.8455.1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Porrua O., Libri D. Transcription termination and the control of the transcriptome: why, where and how to stop. Nat. Rev. Mol. Cell Biol. 2015;16:190–202. doi: 10.1038/nrm3943. [DOI] [PubMed] [Google Scholar]
  • 9.Porrua O. Transcription termination: variations on common themes. Trends Genet. 2016;32:508–522. doi: 10.1016/j.tig.2016.05.007. [DOI] [PubMed] [Google Scholar]
  • 10.Proudfoot N.J. Transcriptional termination in mammals: stopping the RNA polymerase II juggernaut. Science. 2016;352:aad9926. doi: 10.1126/science.aad9926. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Shi Y., Manley J.L. The end of the message: multiple protein-RNA interactions define the mRNA polyadenylation site. Genes Dev. 2015;29:889–897. doi: 10.1101/gad.261974.115. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Mayr C. Evolution and biological roles of alternative 3′UTRs. Trends Cell Biol. 2016;26:227–237. doi: 10.1016/j.tcb.2015.10.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Neve J., Furger A. Alternative polyadenylation: less than meets the eye? Biochem. Soc. Trans. 2014;42:1190–1195. doi: 10.1042/BST20140054. [DOI] [PubMed] [Google Scholar]
  • 14.Tian B., Manley J.L. Alternative polyadenylation of mRNA precursors. Nat. Rev. Mol. Cell Biol. 2017;18:18–30. doi: 10.1038/nrm.2016.116. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Turner R.E. Alternative polyadenylation in the regulation and dysregulation of gene expression. Semin. Cell Dev. Biol. 2018;75:61–69. doi: 10.1016/j.semcdb.2017.08.056. [DOI] [PubMed] [Google Scholar]
  • 16.Zheng D., Tian B. RNA-binding proteins in regulation of alternative cleavage and polyadenylation. Adv. Exp. Med. Biol. 2014;825:97–127. doi: 10.1007/978-1-4939-1221-6_3. [DOI] [PubMed] [Google Scholar]
  • 17.Jonkers I., Lis J.T. Getting up to speed with transcription elongation by RNA polymerase II. Nat. Rev. Mol. Cell Biol. 2015;16:167–177. doi: 10.1038/nrm3953. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Ehrensberger A.H. Mechanistic interpretation of promoter-proximal peaks and RNAPII density maps. Cell. 2013;154:713–715. doi: 10.1016/j.cell.2013.07.032. [DOI] [PubMed] [Google Scholar]
  • 19.Brannan K. mRNA decapping factors and the exonuclease Xrn2 function in widespread premature termination of RNA polymerase II transcription. Mol. Cell. 2012;46:311–324. doi: 10.1016/j.molcel.2012.03.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Wagschal A. Microprocessor, Setx, Xrn2, and Rrp6 co-operate to induce premature termination of transcription by RNAPII. Cell. 2012;150:1147–1157. doi: 10.1016/j.cell.2012.08.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Nechaev S. Global analysis of short RNAs reveals widespread promoter-proximal stalling and arrest of Pol II in Drosophila. Science. 2010;327:335–338. doi: 10.1126/science.1181421. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Almada A.E. Promoter directionality is controlled by U1 snRNP and polyadenylation signals. Nature. 2013;499:360–363. doi: 10.1038/nature12349. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Krebs A.R. Genome-wide single-molecule footprinting reveals high RNA polymerase II turnover at paused promoters. Mol. Cell. 2017;67:411–422. doi: 10.1016/j.molcel.2017.06.027. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Steurer B. Live-cell analysis of endogenous GFP-RPB1 uncovers rapid turnover of initiating and promoter-paused RNA Polymerase II. Proc. Natl. Acad. Sci. U. S. A. 2018;115:E4368–E4376. doi: 10.1073/pnas.1717920115. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Darzacq X. In vivo dynamics of RNA polymerase II transcription. Nat. Struct. Mol. Biol. 2007;14:796–806. doi: 10.1038/nsmb1280. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Nilson K.A. Oxidative stress rapidly stabilizes promoter-proximal paused Pol II across the human genome. Nucleic Acids Res. 2017;45:11088–11105. doi: 10.1093/nar/gkx724. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Erickson B. Dynamic turnover of paused Pol II complexes at human promoters. Genes Dev. 2018;32:1215–1225. doi: 10.1101/gad.316810.118. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Adelman K., Lis J.T. Promoter-proximal pausing of RNA polymerase II: emerging roles in metazoans. Nat. Rev. Genet. 2012;13:720–731. doi: 10.1038/nrg3293. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Chen F.X. Born to run: control of transcription elongation by RNA polymerase II. Nat. Rev. Mol. Cell Biol. 2018;19:464–478. doi: 10.1038/s41580-018-0010-5. [DOI] [PubMed] [Google Scholar]
  • 30.Henriques T. Stable pausing by RNA polymerase II provides an opportunity to target and integrate regulatory signals. Mol. Cell. 2013;52:517–528. doi: 10.1016/j.molcel.2013.10.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Jonkers I. Genome-wide dynamics of Pol II elongation and its interplay with promoter proximal pausing, chromatin, and exons. Elife. 2014;3 doi: 10.7554/eLife.02407. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Chen F. Stably paused genes revealed through inhibition of transcription initiation by the TFIIH inhibitor triptolide. Genes Dev. 2015;29:39–47. doi: 10.1101/gad.246173.114. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Shao W., Zeitlinger J. Paused RNA polymerase II inhibits new transcriptional initiation. Nat. Genet. 2017;49:1045–1051. doi: 10.1038/ng.3867. [DOI] [PubMed] [Google Scholar]
  • 34.Titov D.V. XPB, a subunit of TFIIH, is a target of the natural product triptolide. Nat. Chem. Biol. 2011;7:182–188. doi: 10.1038/nchembio.522. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Alekseev S. Transcription without XPB establishes a unified helicase-independent mechanism of promoter opening in eukaryotic gene expression. Mol. Cell. 2017;65:504–514. doi: 10.1016/j.molcel.2017.01.012. [DOI] [PubMed] [Google Scholar]
  • 36.Gressel S. CDK9-dependent RNA polymerase II pausing controls transcription initiation. Elife. 2017;6 doi: 10.7554/eLife.29736. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Vasudevan S. Non-stop decay—a new mRNA surveillance pathway. BioEssays. 2002;24:785–788. doi: 10.1002/bies.10153. [DOI] [PubMed] [Google Scholar]
  • 38.Yao P. Coding region polyadenylation generates a truncated tRNA synthetase that counters translation repression. Cell. 2012;149:88–100. doi: 10.1016/j.cell.2012.02.018. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Kaida D. U1 snRNP protects pre-mRNAs from premature cleavage and polyadenylation. Nature. 2010;468:664–668. doi: 10.1038/nature09479. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Venters C.C. U1 snRNP telescripting: suppression of premature transcription termination in introns as a new layer of gene regulation. Cold Spring Harb. Perspect. Biol. 2019;11 doi: 10.1101/cshperspect.a032235. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Iasillo C. ARS2 is a general suppressor of pervasive transcription. Nucleic Acids Res. 2017;45:10229–10241. doi: 10.1093/nar/gkx647. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Chiu A.C. Transcriptional pause sites delineate stable nucleosome-associated premature polyadenylation suppressed by U1 snRNP. Mol. Cell. 2018;69:648–663. doi: 10.1016/j.molcel.2018.01.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Alt F.W. Synthesis of secreted and membrane-bound immunoglobulin mu heavy chains is directed by mRNAs that differ at their 3′ ends. Cell. 1980;20:293–301. doi: 10.1016/0092-8674(80)90615-7. [DOI] [PubMed] [Google Scholar]
  • 44.Early P. Two mRNAs can be produced from a single immunoglobulin mu gene by alternative RNA processing pathways. Cell. 1980;20:313–319. doi: 10.1016/0092-8674(80)90617-0. [DOI] [PubMed] [Google Scholar]
  • 45.Rogers J. Two mRNAs with different 3′ ends encode membrane-bound and secreted forms of immunoglobulin mu chain. Cell. 1980;20:303–312. doi: 10.1016/0092-8674(80)90616-9. [DOI] [PubMed] [Google Scholar]
  • 46.Hoque M. Analysis of alternative cleavage and polyadenylation by 3′ region extraction and deep sequencing. Nat. Meth. 2013;10:133–139. doi: 10.1038/nmeth.2288. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Lianoglou S. Ubiquitously transcribed genes use alternative polyadenylation to achieve tissue-specific expression. Genes Dev. 2013;27:2380–2396. doi: 10.1101/gad.229328.113. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Singh I. Widespread intronic polyadenylation diversifies immune cell transcriptomes. Nat. Commun. 2018;9:1716. doi: 10.1038/s41467-018-04112-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Lee S.-H. Widespread intronic polyadenylation inactivates tumour suppressor genes in leukaemia. Nature. 2018;561:127. doi: 10.1038/s41586-018-0465-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Dubbury S.J. CDK12 regulates DNA repair genes by suppressing intronic polyadenylation. Nature. 2018;564:141–145. doi: 10.1038/s41586-018-0758-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Preker P. RNA exosome depletion reveals transcription upstream of active human promoters. Science. 2008;322:1851–1854. doi: 10.1126/science.1164096. [DOI] [PubMed] [Google Scholar]
  • 52.Seila A.C. Divergent transcription from active promoters. Science. 2008;322:1849–1851. doi: 10.1126/science.1162253. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Ntini E. Polyadenylation site-induced decay of upstream transcripts enforces promoter directionality. Nat. Struct. Mol. Biol. 2013;20:923–928. doi: 10.1038/nsmb.2640. [DOI] [PubMed] [Google Scholar]
  • 54.Andersen P.R. The human cap-binding complex is functionally connected to the nuclear RNA exosome. Nat. Struct. Mol. Biol. 2013;20:1367–1376. doi: 10.1038/nsmb.2703. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Schmid M., Jensen T.H. Controlling nuclear RNA levels. Nat. Rev. Genet. 2018;19:518. doi: 10.1038/s41576-018-0013-2. [DOI] [PubMed] [Google Scholar]
  • 56.Williamson L. UV irradiation induces a non-coding RNA that functionally opposes the protein encoded by the same gene. Cell. 2017;168:843–855. doi: 10.1016/j.cell.2017.01.019. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Davis M.J. Differential use of signal peptides and membrane domains is a common occurrence in the protein output of transcriptional units. PLoS Genet. 2006;2:e46. doi: 10.1371/journal.pgen.0020046. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Ly P.T. Alternative polyadenylation expands the mRNA isoform repertoire of human CD46. Gene. 2017;625:21–30. doi: 10.1016/j.gene.2017.05.004. [DOI] [PubMed] [Google Scholar]
  • 59.Mbita Z. De-regulation of the RBBP6 isoform 3/DWNN in human cancers. Mol. Cell. Biochem. 2012;362:249–262. doi: 10.1007/s11010-011-1150-5. [DOI] [PubMed] [Google Scholar]
  • 60.Giammartino D.C.D. RBBP6 isoforms regulate the human polyadenylation machinery and modulate expression of mRNAs with AU-rich 3′ UTRs. Genes Dev. 2014;28:2248–2260. doi: 10.1101/gad.245787.114. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Ni T.K., Kuperwasser C. Premature polyadenylation of MAGI3 produces a dominantly-acting oncogene in human breast cancer. Elife. 2016;5 doi: 10.7554/eLife.14730. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Ni T.K. Premature polyadenylation of MAGI3 is associated with diminished N6-methyladenosine in its large internal exon. Sci. Rep. 2018;8:1415. doi: 10.1038/s41598-018-19916-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Mueller A.A. Intronic polyadenylation of PDGFRα in resident stem cells attenuates muscle fibrosis. Nature. 2016;540:276–279. doi: 10.1038/nature20160. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Audibert A., Simonelig M. Autoregulation at the level of mRNA 3′ end formation of the suppressor of forked gene of Drosophila melanogaster is conserved in Drosophila virilis. Proc. Natl. Acad. Sci. U. S. A. 1998;95:14302–14307. doi: 10.1073/pnas.95.24.14302. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Pan Z. An intronic polyadenylation site in human and mouse CstF-77 genes suggests an evolutionarily conserved regulatory mechanism. Gene. 2006;366:325–334. doi: 10.1016/j.gene.2005.09.024. [DOI] [PubMed] [Google Scholar]
  • 66.Luo W. The conserved intronic cleavage and polyadenylation site of CstF-77 Gene imparts control of 3′ end processing activity through feedback autoregulation and by U1 snRNP. PLoS Genet. 2013;9 doi: 10.1371/journal.pgen.1003613. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Devany E. Intronic cleavage and polyadenylation regulates gene expression during DNA damage response through U1 snRNA. Cell Discov. 2016;2 doi: 10.1038/celldisc.2016.13. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Kamieniarz-Gdula K. Selective roles of vertebrate PCF11 in premature and full-length transcript termination. Mol. Cell. 2019;74:158–172. doi: 10.1016/j.molcel.2019.01.027. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69.Wang R. Regulation of intronic polyadenylation by PCF11 impacts mRNA expression of long genes. Cell Rep. 2019;26:2766–2778. doi: 10.1016/j.celrep.2019.02.049. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Martin G. Genome-wide analysis of Pre-mRNA 3′ end processing reveals a decisive role of human cleavage factor I in the regulation of 3′ UTR length. Cell Rep. 2012;1:753–763. doi: 10.1016/j.celrep.2012.05.003. [DOI] [PubMed] [Google Scholar]
  • 71.Nojima T. Mammalian NET-seq reveals genome-wide nascent transcription coupled to RNA processing. Cell. 2015;161:526–540. doi: 10.1016/j.cell.2015.03.027. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72.Elkon R. E2F mediates enhanced alternative polyadenylation in proliferation. Genome Biol. 2012;13:R59. doi: 10.1186/gb-2012-13-7-r59. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73.Taliaferro J.M. Distal alternative last exons localize mRNAs to neural projections. Mol. Cell. 2016;61:821–833. doi: 10.1016/j.molcel.2016.01.020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74.Li W. Systematic profiling of poly(A)+ transcripts modulated by core 3′ end processing and splicing factors reveals regulatory rules of alternative cleavage and polyadenylation. PLoS Genet. 2015;11 doi: 10.1371/journal.pgen.1005166. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75.Szczepińska T. DIS3 shapes the RNA polymerase II transcriptome in humans by degrading a variety of unwanted transcripts. Genome Res. 2015;25:1622–1633. doi: 10.1101/gr.189597.115. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 76.Baillat D. Integrator, a multiprotein mediator of small nuclear RNA processing, associates with the C-Terminal repeat of RNA polymerase II. Cell. 2005;123:265–276. doi: 10.1016/j.cell.2005.08.019. [DOI] [PubMed] [Google Scholar]
  • 77.Rienzo M., Casamassimi A. Integrator complex and transcription regulation: recent findings and pathophysiology. Biochim. Biophys. Acta Gene Regul. Mech. 2016;1859:1269–1280. doi: 10.1016/j.bbagrm.2016.07.008. [DOI] [PubMed] [Google Scholar]
  • 78.Dominski Z. A CPSF-73 homologue is required for cell cycle progression but not cell growth and interacts with a protein having features of CPSF-100. Mol. Cell. Biol. 2005;25:1489–1500. doi: 10.1128/MCB.25.4.1489-1500.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 79.Gardini A. Integrator regulates transcriptional initiation and pause release following activation. Mol. Cell. 2014;56:128–139. doi: 10.1016/j.molcel.2014.08.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 80.Stadelmayer B. Integrator complex regulates NELF-mediated RNA polymerase II pause/release and processivity at coding genes. Nat. Commun. 2014;5:5531. doi: 10.1038/ncomms6531. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 81.Skaar J.R. The Integrator complex controls the termination of transcription at diverse classes of gene targets. Cell Res. 2015;25:288–305. doi: 10.1038/cr.2015.19. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 82.Baillat D., Wagner E.J. Integrator: surprisingly diverse functions in gene expression. Trends Biochem. Sci. 2015;40:257–264. doi: 10.1016/j.tibs.2015.03.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 83.Cinghu S. Intragenic enhancers attenuate host gene expression. Mol. Cell. 2017;68:104–117.e6. doi: 10.1016/j.molcel.2017.09.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 84.Zhou Z. Codon usage biases co-evolve with transcription termination machinery to suppress premature cleavage and polyadenylation. Elife. 2018;7 doi: 10.7554/eLife.33569. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 85.Chen Y.-C.A. Cutoff suppresses RNA polymerase II termination to ensure expression of piRNA precursors. Mol. Cell. 2016;63:97–109. doi: 10.1016/j.molcel.2016.05.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 86.La Rosa P. Sam68 promotes self-renewal and glycolytic metabolism in mouse neural progenitor cells by modulating Aldh1a3 pre-mRNA 3′-end processing. Elife. 2016;5 doi: 10.7554/eLife.20750. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 87.Naro C. Functional interaction between U1snRNP and Sam68 insures proper 3′ end pre-mRNA processing during germ cell differentiation. Cell Rep. 2019;26:2929–2941. doi: 10.1016/j.celrep.2019.02.058. [DOI] [PubMed] [Google Scholar]
  • 88.Jobert L. Human U1 snRNA forms a new chromatin-associated snRNP with TAF15. EMBO Rep. 2009;10:494–500. doi: 10.1038/embor.2009.24. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 89.Chi B. Interactome analyses revealed that the U1 snRNP machinery overlaps extensively with the RNAP II machinery and contains multiple ALS/SMA-causative proteins. Sci. Rep. 2018;8:8755. doi: 10.1038/s41598-018-27136-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 90.Sheridan R.M. Widespread backtracking by RNA Pol II is a major effector of gene activation, 5′ pause release, termination, and transcription elongation rate. Mol. Cell. 2018;73:107–118. doi: 10.1016/j.molcel.2018.10.031. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 91.Zhu Y. Molecular mechanisms for CFIm-mediated regulation of mRNA alternative polyadenylation. Mol. Cell. 2018;69:62–74. doi: 10.1016/j.molcel.2017.11.031. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 92.Gregersen L.H. SCAF4 and SCAF8, mRNA anti-terminator proteins. Cell. 2019 doi: 10.1016/j.cell.2019.04.038. Published online May 10, 2019. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 93.Cui Y., Denis C.L. In vivo evidence that defects in the transcriptional elongation factors RPB2, TFIIS, and SPT5 enhance upstream poly(A) site utilization. Mol. Cell. Biol. 2003;23:7887–7901. doi: 10.1128/MCB.23.21.7887-7901.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 94.Yang Y. PAF complex plays novel subunit-specific roles in alternative cleavage and polyadenylation. PLoS Genet. 2016;12 doi: 10.1371/journal.pgen.1005794. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 95.Liu X. Transcription elongation rate has a tissue-specific impact on alternative cleavage and polyadenylation in Drosophila melanogaster. RNA. 2017;23:1807–1816. doi: 10.1261/rna.062661.117. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 96.Tang S. Herpes simplex virus ICP27 regulates alternative pre-mRNA polyadenylation and splicing in a sequence-dependent manner. Proc. Natl. Acad. Sci. U. S. A. 2016;113:12256–12261. doi: 10.1073/pnas.1609695113. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 97.Grzechnik P. Pcf11 orchestrates transcription termination pathways in yeast. Genes Dev. 2015;29:849–861. doi: 10.1101/gad.251470.114. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 98.Arigo J.T. Regulation of yeast NRD1 expression by premature transcription termination. Mol. Cell. 2006;21:641–651. doi: 10.1016/j.molcel.2006.02.005. [DOI] [PubMed] [Google Scholar]
  • 99.Creamer T.J. Transcriptome-wide binding sites for components of the Saccharomyces cerevisiae non-poly(A) termination pathway: Nrd1, Nab3, and Sen1. PLoS Genet. 2011;7 doi: 10.1371/journal.pgen.1002329. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 100.Bastet L. Maestro of regulation: riboswitches orchestrate gene expression at the levels of translation, transcription and mRNA decay. RNA Biol. 2018;15:679–682. doi: 10.1080/15476286.2018.1451721. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 101.Sherwood A.V., Henkin T.M. Riboswitch-mediated gene regulation: novel RNA architectures dictate gene expression responses. Annu. Rev. Microbiol. 2016;70:361–374. doi: 10.1146/annurev-micro-091014-104306. [DOI] [PubMed] [Google Scholar]
  • 102.Colin J. Cryptic transcription and early termination in the control of gene expression. Genet. Res. Int. 2011;2011 doi: 10.4061/2011/653494. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 103.Arndt K.M., Reines D. Termination of transcription of short noncoding RNAs by RNA Polymerase II. Annu. Rev. Biochem. 2015;84:381–404. doi: 10.1146/annurev-biochem-060614-034457. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 104.Merran J., Corden J.L. Yeast RNA-binding protein Nab3 regulates genes involved in nitrogen metabolism. Mol. Cell. Biol. 2017;37 doi: 10.1128/MCB.00154-17. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 105.Bresson S. Nuclear RNA decay pathways aid rapid remodeling of gene expression in yeast. Mol. Cell. 2017;65:787–800. doi: 10.1016/j.molcel.2017.01.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 106.Whalen C. RNA polymerase II transcription attenuation at the yeast DNA repair gene, DEF1, involves Sen1-dependent and polyadenylation site-dependent termination. G3 (Bethesda) 2018;8:2043–2058. doi: 10.1534/g3.118.200072. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 107.Quesada V. Autoregulation of FCA pre-mRNA processing controls Arabidopsis flowering time. EMBO J. 2003;22:3142–3152. doi: 10.1093/emboj/cdg305. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 108.Hornyik C. The spen family protein FPA controls alternative cleavage and polyadenylation of RNA. Dev. Cell. 2010;18:203–213. doi: 10.1016/j.devcel.2009.12.009. [DOI] [PubMed] [Google Scholar]
  • 109.Liu F. Targeted 3′ processing of antisense transcripts triggers Arabidopsis FLC chromatin silencing. Science. 2010;327:94–97. doi: 10.1126/science.1180278. [DOI] [PubMed] [Google Scholar]
  • 110.Swiezewski S. Cold-induced silencing by long antisense transcripts of an Arabidopsis Polycomb target. Nature. 2009;462:799–802. doi: 10.1038/nature08618. [DOI] [PubMed] [Google Scholar]
  • 111.Gong Q. Upregulation of functional Kv11.1 isoform expression by inhibition of intronic polyadenylation with antisense morpholino oligonucleotides. J. Mol. Cell. Cardiol. 2014;76:26–32. doi: 10.1016/j.yjmcc.2014.08.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 112.Melamed Z. Premature polyadenylation-mediated loss of stathmin-2 is a hallmark of TDP-43-dependent neurodegeneration. Nat. Neurosci. 2019;22:180–190. doi: 10.1038/s41593-018-0293-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 113.Gong Q., Zhou Z. Regulation of isoform expression by blocking polyadenylation signal sequences with morpholinos. Methods Mol. Biol. 2017;1565:141–150. doi: 10.1007/978-1-4939-6817-6_12. [DOI] [PubMed] [Google Scholar]
  • 114.Gao Y. Genome therapy of myotonic dystrophy type 1 iPS cells for development of autologous stem cell therapy. Mol. Ther. 2016;24:1378–1387. doi: 10.1038/mt.2016.97. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 115.Araki S. Decoding transcriptome dynamics of genome-encoded polyadenylation and autoregulation with small-molecule modulators of alternative polyadenylation. Cell Chem. Biol. 2018;25:1470–1484. doi: 10.1016/j.chembiol.2018.09.006. [DOI] [PubMed] [Google Scholar]

RESOURCES