Abstract
RNA transcription errors are transient, yet frequent, events that do have consequences for the cell. However, until recently we lacked the tools to empirically measure and study these errors. Advances in RNA library preparation and next generation sequencing (NGS) have allowed the spectrum of transcription errors to be empirically measured over the entire transcriptome and in nascent transcripts. Combining these powerful methods with forward and reverse genetic strategies has refined our understanding of transcription factors known to enhance RNA accuracy and will enable the discovery of new candidates. Furthermore, these approaches will shed additional light on the complex interplay between transcription fidelity and other DNA transactions, such as replication and repair, and explore a role for transcription errors in cellular evolution and disease.
Keywords: Transcription fidelity DNA break repair TFIIS GreA GreB DksA Epimutation Transcription error Genomic integrity Backtracked RNA polymerase DNA resection Transcription factor Genetic noise Non-genetic phenotypic heterogeneity
1. Introduction
DNA→RNA→protein has been the mantra of biology for over half a century [1]; but while this tenet relies on RNA as the messenger moderating the relationship between genotype and phenotype, little is appreciated of the consequences of errors in the RNA message. And yet far more variability exists between cells than can be explained by DNA variants and a spontaneous mutation rate of ~10−10 [2], begging the question: What is our mantra missing? To bridge this gap, biologists have turned to epigenetics. While the word has been co-opted in recent years for chromatin modification and histone markers, in the purest sense, “epigenetic” describes gene expression alterations that cannot be accounted for by the DNA sequence [3,4]. As gene expression is regulated by complex networks, there exists a dizzying number of epigenetic modifiers that exert their effects through chromatin accessibility, phase separation [5], transcription, post-transcriptional modifications, translation and post-translational modifications. Dysregulation or errors in any of these processes can affect gene expression and phenotype. Thereby, on a larger scale epigenetic errors can promote heterogeneity between cells and thereby evolution. Ribosomes make translation errors at rates estimated between 10−2 and 10−6 per codon and are a source of non-genetic phenotypic variation (reviewed in [6]). However, transcription as a source of phenotypically relevant errors has been largely overlooked. If we narrow our definition of epigenetics to include specifically those gene expression changes that are heritable, any purported epigenetic mechanism must produce heritable change. One goal of this review is to not only illustrate the importance of transcription errors, but also elucidate their potential to generate heritable change.
Compared to the well-studied mechanisms of DNA mutation, less is known about the biology of RNA errors or their role in health and disease. Broadly defined, transcription errors, also known as “epimutations”, are sequence deviations from the DNA template that occur in nascent transcripts. They are distinct from downstream RNA modifications and editing but can exert similar, and potentially functional, consequences on the heterogeneity of the protein pool. Transcription errors stem from the inherently error-prone activity of DNA-dependent RNA polymerases (RNAP) with error rates reported between 10−5 and 10−6 events per base, >5,000-fold higher than DNA replication errors [7]. Transcription factors, DNA damage, certain sequence motifs and the general genomic environment are all believed to modulate the frequency of transcription errors, in addition to external conditions (e.g. oxidative stress [8]). A class of proteins exists to enhance RNAP proofreading - so-called “RNA fidelity factors” - that together share a fascinating link to genomic stability, epigenetics, phenotypic heterogeneity, and cellular evolution through mechanisms under current investigation. This review intends to present what is known about transcription errors, highlight essential, unanswered questions in the field, and predict that transcription fidelity will define a new paradigm in evolution and disease over the coming years.
2. Mechanism of transcription errors
2.1. General Overview
Transcription errors encompass both misincorporations and frameshifts, giving rise to point epimutations and indels, respectively. Frameshifting errors tend to be more destabilizing to the RNA:DNA hybrid in the catalytic center of RNAP and thus are much less frequent (~10−7 per base), with insertions outnumbering deletions [7]. Error rates vary greatly between species, polymerases, genes and even when comparing nascent to total RNA, the latter highlighting the importance of proofreading (Table 1). In fact, one study of nascent transcripts reported that a staggering 1–3% of elongation complexes captured by this method were misincorporated, far in excess of any transcriptome-wide error rate [9]. In eukaryotes RNAPII commits the fewest errors and RNAPIII the most, but both appear more accurate than their bacterial counterpart [7]. The cause and consequences of these differences remain to be discovered.
Table 1:
Cumulative transcription error rate per transcribed nucleotide
| Organism | Error rate | Method |
|---|---|---|
| C. elegans [15] | 5.3 × 10−6 | Rep-seq |
| Buchnera aphidicola [18,102] | 7.7 × 10−5 | CirSeq |
| E. coli MG1655 [18,102] | 9.9 × 10−5 | CirSeq |
| S. cerevisiae [8] | 5.6 × 10−5 | ARC-seq |
| S. cerevisiae [7] | 4.9 × 10−6 | modCirSeq |
The cumulative wild-type transcription error rate per transcribed nucleotide for each organism in vivo includes all base substitution and indel events. All methods sequence multiple repeats of an RNA fragment to generate a consensus sequence revealing true epimutations and to negate errors inherent to reverse transcription, PCR and Illumina sequencing. Buchnera aphidicola is a low G+C content (26%) bacterial endosymbiot whose genome is enriched with long tracts of adenosine and thymine; E. coli has a G+C content of 50.8%. The E. coli error rate is the average transcription error rate over all conditions tested (tryptic soy broth complex media and M9 glucose minimal media each sampled at midlog and stationary phase), since there were no significant differences observed in overall transcription error rates between any pair of individual conditions tested. The base substitution errors rates between E. coli and Buchnera, 8.23 × 10−5 and 4.67 × 10−5, respectively, were found to be significantly different [18]. Moreover, a 10-fold increase in transcription insertions was found in Buchnera compared to E. coli that can be ascribed to the incidence of homopolymeric runs in the two genomes [102]. The two S. cerevisiae error rates were determined from log-phase yeast. For comparison with the cumulative transcription error rate for E. coli MG1655, the cumulative in vivo mutation rate for MG1655 is 2.2 × 10−10 mutations per nucleotide per generation as determined by whole-genome sequencing [103]. (Rep-seq, replicated sequencing; CirSeq, circle sequencing; ARC-seq, accurate RNA consensus sequencing; modCirSeq, modified circle sequencing).
2.2. Spontaneous Errors
RNAPs are error-prone and regularly make spontaneous errors, which are defined as those errors arising solely due to intrinsic properties of the polymerase. An important factor regulating spontaneous errors is the thermodynamic stability of the RNA:DNA hybrid within the transcription bubble. Screens for RNAP fidelity mutants along with biochemical approaches have revealed key domains in the Rpb1 and Rpb9 subunits in yeast [10,11] or in the β and β’ subunits in E. coli [12–14] that modulate spontaneous errors.
There exists no clear epimutational signature for the detection of spontaneous transcription errors. However, globally G>A errors are most common prior to proofreading, after which C>U transitions predominate in RNAP I and II transcribed genes [7,9,15–18]. Cytosine deamination likely contributes to C>U bias as this pattern is absent from nascent transcripts [9]. Curiously, in RNAPIII transcribed genes G>A errors remain common in the presence of general fidelity factors, supporting the premise that RNAPIII is naturally more error-prone or resists proofreading.
Once thought to be purely stochastic, new insights have emerged from patterns in which spontaneous errors occur. For example, in addition to their role in transcription termination, poly-T tracts trigger RNAP backtracking with the potential to induce indels through template slippage, and poly-A tracts have a similar effect [19]. The anti-termination λ N factor inhibits transcription slippage at poly-T sequences by stabilizing the RNA:DNA hybrid at the 5’ end, and thus preventing loss of register between transcript and template [20].
Less characterized is the overlap between polymerase pausing and spontaneous transcription errors. While errors are known to trigger pausing, the details of where and when remain obscure in vivo. Using new NGS technologies (e.g. NET-seq and RNET-seq), routine RNAPII pausing in gene bodies has been observed [21]. Errors are enriched at paused/backtracked transcription complexes, as expected [16]. But interestingly, pause sites are not randomly distributed and instead correlate with specific sequence motifs, suggesting that errors also may not occur randomly [9,21]. Whether purported pause site motifs directly induce RNA errors is unknown. Perhaps stalled or backtracked RNAP complexes can themselves trigger errors through a low-fidelity conformation, harkening back to the chicken-or-egg conundrum: Which came first – the stall or the error? Importantly, this logic does not address other gene regulatory mechanisms for RNAP pausing within gene bodies, which likely factor in. Ultimately, these patterns, while interesting leads, have no power to finely predict where an error will occur and cannot supplant stochasticity as the chief mechanism for spontaneous transcription errors.
2.3. Induced Errors
DNA damage is a potent source of transcription errors, but not all damage is created equal. Bulky lesions such as DNA adducts inhibit elongation and require repair to avoid premature transcription termination [22]. Translesion RNA synthesis can occur across deaminated (uracil), oxidized (8-oxoguanine) and methylated (O6-methylguanine) bases as well as abasic sites to ensure full-length transcripts, but at the cost of accuracy [23,24]. In one example, transcription across UV-induced pyrimidine dimers induced misincorporation events and multiple nucleotide deletions [25]. This process by which DNA damage generates RNA errors with potential phenotypic consequences is referred to as transcriptional mutagenesis [26,27].
Many essential questions regarding the mechanisms of transcription errors remain. Future studies should aim to address the relationship between RNA errors and gene expression, DNA replication, GC content, poly-A:T tracts, telomeric/centromeric location, CpG islands, methylation state, chromatin remodelers, R-loops, common fragile sites and evolvability.
3. Detection of transcription errors
Perhaps the earliest error rate for RNAP was reported in vitro in 1975 using holoenzyme purified from E. coli [28]; however, studying these errors in vivo remained an intractable problem for decades as the field awaited new technologies. Transcription fidelity research was limited to cleverly engineered reporter systems and complementation screening in single-cell model organisms like E. coli and yeast. Some systems leveraged gene networks with positive feedback loops, like the lac operon in E. coli, to capture transient transcription errors through epigenetic phenotypic switching [29]. Another approach incorporated a Cre/Lox gal reporter system that rendered E. coli Gal+ following a site-specific transcription error [30]. While these approaches are efficient methods to screen for transcription fidelity modifiers, epigenetic heterogeneity and heritability, they provide only indirect evidence for transcription errors and limited information on the type of errors involved.
Following the advent of NGS platforms, transcription errors remained technically challenging to sequence as individual events are rare, transient, and RNA requires reverse transcription (RT) into cDNA – itself an error-prone process – for analysis. Early attempts did not control for RT errors [17], which was solved by barcoding individual RNA fragments prior to multiple rounds of cDNA synthesis and termed replicated sequencing (Rep-seq) [15]. By aligning families of cDNAs sharing the same barcode, true RNA errors present in all copies could be resolved from artifact. However, independent rounds of reverse transcription led to inconsistent cDNA synthesis and low yields [15]. These technical hurdles were addressed with the development of circle sequencing (CirSeq), a technique that generates tandem cDNA repeats from a single mRNA fragment to overcome RT and DNA amplification artifact [31,32]. (Reviewed in [33].)
The CirSeq library preparation protocol was recently modified (modCirSeq) to replace the metal ion-based RNA fragmentation strategy, which artificially increased the detected error rate in previous attempts, with an enzymatic reaction [7]. Independently, CirSeq was also adapted for easy discrimination between PCR duplicates, RNA fragments and independent samples via barcoding and named Accurate RNA Consensus sequencing (ARC-seq) (Fig. 1) [8]. Both library preparation methods were successfully applied in yeast, yielding the most accurate transcription error rates reported in eukaryotes to-date [7,8]. For wild-type S. cerevisiae in log phase, ARC-seq provided an RNA epimutation rate of 5.6 × 10−5, and modCirSeq provided a rate of 4.9 × 10−6 (Table 1). It will be interesting to see how this 10-fold difference in the cumulative RNA error rates for yeast is resolved; differences in RNA fragmentation (zinc-based vs enzymatic approaches) and different custom-built bioinformatic pipelines are possible contributing factors. In sum, these exciting new technologies have rendered a prohibitively challenging field accessible to any lab with access to NGS. We predict ARC-seq and modCirSeq will be leveraged to compare transcription error rates between species, individuals, cell types, genomic loci, and states of health and disease, yielding discoveries that could revolutionize the field of epigenetics.
Fig. 1.
Accurate RNA Consensus sequencing (ARC-seq). (1) Barcoded RNA adaptors are ligated to a pool of fragmented RNA molecules to uniquely identify individual RNA fragments. Ligated RNAs are then circularized and subjected to rolling-circle reverse transcription generating a cDNA containing tandem multiple copies of each RNA molecule (each copy is denoted by green, red and blue lines in the cDNA). (2) AscI primers are annealed to the cDNA multimer that is restricted into monomers that are copies of the original RNA fragment. (3) Each monomer is tagged with a unique index to identify the different monomer copies and then amplified and Illumina sequenced. Red circles denote the epimutation event; black and green circles are monomer-specific reverse transcription errors; blue circles are amplification/NGS errors. (4) By aligning the cDNA tags and RNA barcodes a consensus sequence is generated from the cDNA copies and the original epimutation is revealed [8]. Whole-transcriptome data is obtained using a primer against the RNA adaptor during rolling-circle reverse transcription; specific locus data is obtained using a transcript-specific primer at this step. Rep-seq and CirSeq have been recently reviewed [33].
While tandem repeats perform well at differentiating RNA errors from sequencing artifact, these rolling circle-based technologies do have limitations. Firstly, they cannot distinguish transient RNA errors from rare de novo DNA mutation. In sequencing DNA from pooled cells, discovery of a rare variant is unlikely, rendering DNA validation of an RNA error exceedingly difficult. However, mathematically, the odds of encountering a genuine RNA error (~10−5/10−6 per base) are many logs greater than encountering a rare de novo DNA variant. Another caveat of modCirSeq/ARC-seq is the inability to differentiate errors from RNA editing, as occurs when APOBECs and ADARs catalyze nucleotide changes within RNA [34]. However, many edits follow recognizable patterns that can be excluded from ARC-seq analysis; for example, >99% of edits occur in Alu repeats and the majority are A-to-I, which appear as A-to-G transitions by RNA-seq [34]. To work around issues of post-transcriptional modifications, new sequencing technologies that select for nascent transcripts may be utilized.
Native elongating transcript sequencing (NET-seq) pairs RNAPII immunoprecipitation, allowing for 3’ end capture of nascent RNAs, with NGS [21]. With NET-seq, genome-wide patterns for RNAP pause sites have been described and linked to misincorporation events that are remarkably common, occurring in 1–3% of elongation complexes – much higher than the rate of errors in the total RNA pool [9,21]. NET-seq can be optimized to differentiate paused from backtracked RNAPs with the addition of an RNase footprinting step (RNET-seq) as each conformation “protects” nascent RNAs of different sizes [16,35]. However, the sensitivity of RNET-seq relies on sequencing small RNAs 14–16 bp in length, which tend to map poorly on analysis, limiting its application. Ultimately, the significance of paused versus backtracked RNAP in epimutation and transcription conflicts is not well understood. New scalable in vivo methods of detection are needed to distinguish the various inactive states of RNAP to facilitate further study.
4. Proofreading of transcription errors
Elongation occurs through a translocating transcription bubble containing a critical RNA:DNA hybrid of 8–9 bp in length on the antisense DNA strand [36]. Importantly, processive elongation requires an active complex and does not occur when RNAP is paused, backtracked or arrested – all alternative, but distinct, inactive conformations of RNAP [37,38]. Spontaneous errors during elongation such as base mispairings, misalignments and indels destabilize the RNA:DNA hybrid [36]. This disruption impedes elongation and energetically favors inactive conformations such as backtracked RNAP in which the complex slides backwards along the DNA, causing the 3’ end of the nascent transcript to disengage from the catalytic site and thread through the secondary channel [39]. Backtracked RNAP alone is stable with limited proofreading of misincorporated nucleotides through low-level intrinsic cleavage of the 3’ end of nascent transcripts [40]. However, addition of transcription fidelity factors GreA/B (E. coli) or TFIIS (eukaryotes) greatly increases the rate of RNA cleavage, allowing the 3’ end to efficiently re-engage the catalytic site and elongation to continue [41–44]. Besides Gre and TFIIS, Gre-homologue DksA also enhances transcription accuracy; dksA deletion confers a 7-fold increase in errors as measured by a lac reporter system in E. coli [45,46]. These transcription fidelity factors are highly conserved across species, demonstrating their functional importance.
Transcription errors can also occur secondary to DNA damage such as pyrimidine dimers and DNA adducts [25,47]. In this case, the faulty template may cause RNAP to pause, backtrack or arrest, serving as a damage signal to repair mechanisms [48]. Anti-backtracking factors can restart elongation and promote error-prone translesion synthesis (transcriptional mutagenesis) [23], but some lesions, especially bulky ones, cannot be bypassed and require repair to avoid premature transcription termination [49]. Complex damage-sensing and repair pathways have evolved with transcription – collectively called transcription coupled repair (TCR) – that have been thoroughly reviewed elsewhere [22,50].
Regardless of where or why a stalled RNAP complex occurs, a dynamic equilibrium is thought to ensue between pro- and anti-backtracking forces [51]. Promoting backtracking, the repair helicase UvrD works to pull the polymerase backwards [52] while secondary channel interactors DksA and ppGpp may destabilize active RNAPs [53–55], together disrupting elongation. Conversely, Mfd helicase pushes RNAP forward [56,57] while GreA/B oppose backtracking through their cleavage function at the secondary channel where they compete with DksA [58]. Given their opposing functions, these backtracking factors likely work through independent pathways to promote fidelity. A growing body of literature suggests that anti-backtracking factors evolved to mitigate spontaneous errors and promote elongation to enhance processivity [9,16] but at sites of DNA damage these factors impede repair [22,59]. On the other hand, pro-backtracking factors appear important for DNA repair as a ppGpp null mutation renders cells sensitive to genotoxic agents [53] and uvrD is induced by the SOS response to DNA damage in E. coli [60]. How transcription fidelity factors coordinate at different types and sources of inactive RNAP complexes will likely be a hot topic for future research into the mechanisms of transcription:DNA repair conflicts.
4.1. GreA/B
In E. coli, GreA and GreB proteins interact with stalled RNAPs through the secondary channel to induce RNA cleavage and relieve backtracking, with cleavage fragments reportedly ranging in size from 2 to 18 nucleotides [61]. At misincorporated complexes, particularly G>A errors [9,17,62], Gre-mediated cleavage removes the offending base that triggered the stall and repositions the new 3’ RNA end at the active site, thus Gre proteins play dual roles as proofreading and pro-elongation factors. Loss of GreA alone confers no significant phenotype under normal growth conditions, however, when combined with a greB null mutation strains become temperature and starvation sensitive with pronounced replication blockage [63]. Confirming their in vivo role in fidelity, Gre knockouts have increased rates of transcription errors (2.4 to 100-fold) as measured by reporter screens [30,64] and NGS [9,17,62]. These studies depict a role for GreA/B in maintaining high-quality, full-length transcripts, especially under harsh conditions with strong selective pressures for survival.
4.2. TFIIS
Originally described as an activator of RNAPII, TFIIS was discovered to promote transcription fidelity in vitro [42] six years after the TFIIS analogue GreA was identified in E. coli [65]. Most recently, the fidelity function of TFIIS has been validated in yeast by modCirSeq as loss of TFIIS increases transcription errors 10-fold (~4 × 10−5 per bp) with no confounding effect on the DNA mutation rate [7]. Errors in ΔTFIIS strains showed strong bias for G>A transitions, but rates of other misincorporations, insertions and deletions are affected as well [7]. TFIIS is not essential in yeast [66] or C. elegans [67], whereas in mice TFIIS null mutants are embryonic lethal [68]. In humans, TFIIS has three homologues – TCEA1, 2, and 3 – which differ in tissue distribution but share functionality [69].
Functionally, TFIIS behaves like GreA, inducing nascent transcript cleavage to relieve RNAP backtracking and restart elongation across the eukaryotic domain [42,43]. However, many studies support a role for TFIIS in gene regulation and promoter-proximal pausing as well [67,70,71]. Using a dominant-negative TFIIS mutant that inhibits RNA cleavage and stabilizes backtracked complexes, it has been shown that rescue of backtracked RNAPII by TFIIS is a major stimulus of elongation under normal conditions [72]. Additionally, loss of TFIIS in yeast effected a global 5–18 bp shift in pauses sites observed by NET-seq [21]. Important questions remain regarding which function of TFIIS – pro-elongation or proofreading – contributes most to the phenotypes observed in eukaryotes.
5. Consequences of transcription errors
Transcription fidelity is hypothesized to affect cells through two primary mechanisms: genetic and epigenetic. The relative contributions of each to the phenotypes observed in fidelity mutants in vivo remains elusive.
5.1. Genetic consequences of transcription fidelity
There is extensive evidence that conflicts between transcription and DNA replication and repair are under negative selection and therefore detrimental to cells. From bacteria to humans, highly expressed genes have evolved to be co-oriented with the direction of replication to avoid the mutagenic effects of head-on conflicts between RNAP and the replisome [73,74]. The timing of replication in eukaryotes is tightly regulated through the actions of cyclins, checkpoints and transcription factors to minimize local transcription in replicating regions [75]. In mitochondria transcription and replication appear to be mutually exclusive processes [76]. Additionally, transcriptional repression has been widely observed following DNA double-strand breaks (DSBs) or UV-induced DNA damage [77,78]. (For reviews of transcription:replication conflicts see [79–81].)
As fidelity factors regulate both RNAP elongation and TCR of polymerase-stalling lesions, they influence the global timing of transcription and thus affect the likelihood of genome-destabilizing conflicts with the replisome. Significant evidence supports roles for GreA/B and TFIIS, among others, in DNA damage and repair independent of their effects on transcript quality [82]. Increased RNAP backtracking secondary to loss of GreA may even promote DNA damage [83]. However, recent evidence suggests that Gre proteins antagonize DNA repair. A greA null mutant was shown to enhance survival by 10-fold following restriction enzyme-induced DSBs and by 500-fold after phleomycin treatment [59]. Additionally, loss of GreA/B rescues ppGpp and uvrD null mutants from DNA damage susceptibility, highlighting the opposing forces of pro- and anti-backtracking factors in genome stability [52,53]. Increased survival in ΔgreA E. coli is dependent on the homologous recombination enzymes RecA and RecBCD [59]. and on the Holliday junction resolvase RuvABC [84], implicating DNA repair in the mechanism of rescue. Interestingly, expression of a backtracking-prone RNAP mutant confers the same rescue as the greA null [59,84]. Taken together, these studies depict GreA as a negative regulator of TCR and break repair at bulky DNA lesions. Perhaps at minor roadblocks the anti-backtracking activity of GreA instead promotes translesion synthesis, enabling production of full-length transcripts despite damage. Such duality in GreA function at different types of DNA damage has not been investigated but may be of future interest.
While Gre factors are the subject of an extensive body of research, their human counterparts remain relatively understudied, likely due to both inadequate tools and the lethal phenotype in knockouts [85,86]. In contrast to GreA, the role of TFIIS in DNA repair remains elusive. TFIIS is recruited by the TCR complex and restores transcription following UV damage, but TFIIS knockdown has no effect on UV sensitivity or the p53 damage response in human cells [87,88]. The advent of new sequencing and gene knockdown technologies will enable further study of this essential transcription factor, promising to yield novel insights in gene regulation, aging and human disease.
The profound implication that RNA errors may pose a threat comparable to DNA damage in cells suggests an evolutionary tradeoff between transcription fidelity and DNA repair. Given that transcript quality regulates cell state through proteins and noncoding RNAs, perhaps the repercussions of erroneous transcripts on survival can at times outweigh the deleterious effects of DNA damage. For instance, the DNA damage response (DDR) itself is a transcriptional program. To enable global repair and maximize the chances of survival following damage, DDR genes must be transcribed even at the cost of impeding repair at local lesions.
Given the role of DNA damage in disease, the competing priorities of repair and transcription lend credence to a new paradigm that transcription errors might also play a significant role in disease.
5.2. Non-genetic consequences of transcription fidelity
At ~10−5/10−6 per base, the rare and transient nature of transcription errors has led them to be overlooked by many. To fully appreciate their consequences, the combined impact of errors in a cell must be abstracted, but first we will consider the effects of a single, individual error.
As with DNA mutation, most RNA errors are silent, but errors can still impact cells in many ways. First, errors can affect critical steps in post-transcriptional processing like splicing [89]. At the protein level, misincorporation errors within key catalytic, regulatory or binding domains can completely abrogate normal protein function, with misalignment errors conferring even more dramatic effects. Additionally, a single transcript is translated multiple times amplifying a single error. For low copy number transcripts and long-lived proteins, the effects of errors are particularly amplified and may persist long after the error occurred. What’s more, transcription errors can exert dominant effects on the protein pool by titrating away binding partners from WT copies or poisoning multimeric complexes, disrupting entire functional networks, which has been directly observed in vivo. Using an engineered TP53 construct with a site-specific 06-methylguanine that could induce a dominant negative epimutation through transcriptional mutagenesis, a staggering 15% of cells displayed impaired cell cycle arrest and apoptosis, illustrating the power of a single error to derail normal functions with potential long-term consequences [90]. However, unlike transcriptional mutagenesis where DNA lesions cause predictable changes [23], spontaneous transcription errors are more random and – to our knowledge – recurrence of a specific error is rare. So, the question stands – How can a rare, transient, one-time transcription error impact cell fate?
There is evidence in bacteria, yeast and human cells for the regulation of critical gene networks, from metabolism to cell cycle control, by bistable transcription switches [91,92]. Switches operate as gene expression in one state – ON or OFF – with a defined threshold to switch states greater than the threshold to maintain the current state. If an inducer, e.g. lactose in the E. coli lac operon, exceeds the threshold the gene is switched ON, and once ON positive feedback loops stabilize the induced gene until a counteracting repressor exceeds threshold and returns the cell to the OFF state. Despite their simplicity, bistable switches are reminiscent of Waddington’s famed epigenetic landscape of development and comfortably fit contemporary models of cellular plasticity. Switches can be designed with increasing complexity, but the essential principle remains that genes in bi- or multi-stable switches require only fleeting signals to stably and heritably alter cellular phenotype (Fig. 2) [64]. In bistable models of gene regulation, transient transcription errors have the potential to dramatically alter cell fate through stochastic phenotypic switching. In support of this model, loss of fidelity factors GreA/B increases the rate of heritable phenotype switching as measured from a lac reporter [29].
Fig. 2.
Positive feedback and all-or-none phenomena. (A) Left panel: The lac operon comprises an autocatalytic positive feedback loop (the presence of LacY permease will produce more permease) allowing a heritable epigenetic switch at a maintenance concentration of inducer. LacY is presented in green because the lacA gene in our system is replaced by gfp, and the lac operon is now lacZYA::gfp, so when permease is made GFP will also be made and the cell will fluoresce green. Stochastic events that lead to a transient depletion of repressor (in red) within a cell will result in a burst of LacY permease expression and the presence of permease will activate the positive feedback loop, so that the new induced state will be heritably maintained, mimicking mutation [29,33]. Right panel: E. coli microcolony growing in a microfluidics chamber. Lac operon Off cells were seeded and grown in minimal media plus maintenance concentration of inducer. A stochastic phenotypic switch from Lac OFF to ON occurred in a single cell and the phenotype was heritably maintained forming an ON lineage of cells. The ON lineage was torn apart due to the growth dynamics of the microcolony. The brightfield image viewed in the blue channel was merged with the GFP fluorescence image viewed in the yellow channel. (B) Left panel: A bistable RB-EF2 switch underlies the restriction point, the critical event when a mammalian cell commits to proliferation and becomes independent of growth stimulation [92]. In OFF, quiescent, cells EF2 (in green) is repressed by RB (in red). With sufficient growth stimulation by Myc, Rb repression is removed via CycD/Cdk4,6 activation; Myc also induces EF2 which activates CycE/Cdk2 to further block Rb. Since EF2 activates itself, two positive feedback loops are involved to shut OFF Rb and turn ON EF2 resulting in a pathway that converts graded serum inputs into an all-or-none-response. Right panel: Cellular heterogeneity in a tissue. The variation in blue represents underlying sub-threshold expression levels of a gene network in individual cells; full-blown yellow represents the activation of a network and subsequent heritable epigenetic phenotypic switch in a cell lineage. The cellular pattern is based on an M.C. Escher Alhambra tessellation sketch.
Transcription occurs on enormous scales and is constantly fueling new errors, rendering a rare event frequent enough to arise many times over the lifetime of a cell. The odds of a phenotypically relevant error occurring are further inflated in long-lived, quiescent cells such as neurons, mirroring a cumulative risk model akin to other disease models for neurodegeneration and cancer. Together, RNA errors contribute to molecular noise [29] that can impact proteostasis and induce heterogeneity between cells [7]. Non-genetic heterogeneity plays a central role in normal development, disease and evolution, but the precise contribution of transcription errors to these essential processes is still unknown [93]. With such broad connections across biological phenomena, transcription errors are now more relevant than ever, and further study promises to inform on the most fundamental processes within living systems, including epigenetic regulation, genome maintenance, disease and evolution.
6. Perspectives in evolution and disease
There are many possible mechanisms by which transcription errors may contribute to disease, especially diseases driven by non-genetic heterogeneity and genomic instability (Fig. 3). For example, it was recently demonstrated that stochastic expression of the DNA repair protein Ada generates mutation rate heterogeneity in a monoclonal population of E. coli [94]. Transcription errors could potentiate stochastic Ada activity and thereby drive bacterial evolution and antibiotic resistance. In another example, RNA errors were found to increase with age, induce proteotoxic stress, and shorten the lifespan of yeast, linking transcription infidelity to aging and neurodegeneration [95]. In fact, neurobiologists have studied the effects of translation errors (“molecular misreading”) on protein heterogeneity, proteotoxic stress and disease for years with little recognition of parallel mechanisms for transcription errors [96,97]. Despite these interesting associations, few disease models have been rigorously tested and most hypotheses rely primarily on indirect evidence. However, no disease has yielded more mechanistic clues regarding the role of transcription errors than cancer [98].
Fig. 3.
Transient and permanent phenotypic consequences of an RNA transcription error. Green circle is RNAP; black lines are DNA; blue line is nascent RNA transcript; red explosion is an RNA transcription error.
From its origins to later stage events like drug resistance and metastasis, transcription fidelity may propel cancer development. Promoting oncogenesis, transcription errors in p53 or the Rb/E2F bistable switch can transiently dysregulate DNA repair or induce cell cycle progression, propagating mutations before they can be fixed [90,92]. Moreover, RNA errors could contribute to observed fluctuations in p53 signaling, which has been shown to facilitate cell cycle escape [99]. Following malignant transformation, transcription errors may promote phenotypic heterogeneity among cancer cells and fuel tumor evolution.
However, low-fidelity transcription likely exerts negative consequences in cancer cells as well. For example, erroneous transcripts generate molecular noise that adversely affects cellular fitness [95]. The fitness cost of transcription errors may explain why TFIIS knockdown inhibits proliferation and induces apoptosis in a breast cancer cell line [85]. Given their elevated levels of replication and transcription, cancers could also be particularly sensitive to the genetic consequences of transcription errors and RNAP backtracking as increased traffic on the DNA likely promotes genome-destabilizing collisions [100,101]. In summary, transcription errors are highly relevant to the dysregulated gene networks, molecular noise, and genomic instability observed in cancers and thus warrant further study into their contributions to disease.
7. Conclusion
The transcription fidelity field, yet in its infancy, is flush with creative ideas and primed to emerge on the research scene as a new paradigm in disease and evolution. Clearly, much remains to be learned about the basic biology of transcription errors, including their role in phenotypic heterogeneity, adaptability and cellular evolution. By applying the cutting-edge concepts and tools outlined in this review, researchers can begin to tackle critical questions in the field and explore mechanisms for transcription fidelity in antibiotic resistance, aging, neurodegeneration, cancer and beyond.
Acknowledgements
This work was supported by National Institute of Health USA grant R01-GM088653 (C.H.), and by the Robert and Janice McNair Foundation/ McNair Medical Institute M.D./Ph.D. Scholars Program (C.C.B).
Abbreviations:
- ARC-seq
accurate RNA consensus sequencing
- CirSeq
circle sequencing
- DDR
DNA damage response
- DSB
DNA double-strand break
- modCirSeq
modified circle sequencing
- NET-seq
native elongating transcript sequencing
- NGS
next-generation sequencing
- Rep-seq
replicated sequencing
- RNAP
RNA polymerase
- RNET-seq
NET-seq with RNase footprinting
- RT
reverse transcription
- TCR
transcription-coupled repair
- TFIIS
transcription factor S-II
Footnotes
Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.
Conflict of interest
The authors declare that there are no conflicts of interest.
References
- [1].Nirenberg MW, Matthaei JH, The dependence of cell-free protein synthesis in E. coli upon naturally occurring or synthetic polyribonucleotides, Proc. Natl. Acad. Sci. U.S.A 47 (1961) 1588–1602. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [2].Drake JW, Charlesworth B, Charlesworth D, Crow JF, Rates of spontaneous mutation, Genetics 148 (1998) 1667–1686. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [3].Waddington CH, The epigenotype. 1942, Int J Epidemiol 41 (2012) 10–13. doi: 10.1093/ije/dyr184. [DOI] [PubMed] [Google Scholar]
- [4].Wu Ct JR Morris, Genes, genetics, and epigenetics: a correspondence, Science 293 (2001) 1103–1105. doi: 10.1126/science.293.5532.1103. [DOI] [PubMed] [Google Scholar]
- [5].Hnisz D, Shrinivas K, Young RA, Chakraborty AK, Sharp PA, A Phase Separation Model for Transcriptional Control, Cell 169 (2017) 13–23. doi: 10.1016/j.cell.2017.02.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [6].Evans CR, Fan Y, Weiss K, Ling J, Errors during Gene Expression: Single-Cell Heterogeneity, Stress Resistance, and Microbe-Host Interactions, MBio 9 (2018). doi: 10.1128/mBio.01018-18. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [7].Gout J-F, Li W, Fritsch C, Li A, Haroon S, Singh L, Hua D, Fazelinia H, Smith Z, Seeholzer S, Thomas K, Lynch M, Vermulst M, The landscape of transcription errors in eukaryotic cells, Sci Adv 3 (2017) e1701484. doi: 10.1126/sciadv.1701484. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [8].Reid-Bayliss KS, Loeb LA, Accurate RNA consensus sequencing for high-fidelity detection of transcriptional mutagenesis-induced epimutations, Proc. Natl. Acad. Sci. U.S.A 114 (2017) 9415–9420. doi: 10.1073/pnas.1709166114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [9].James K, Gamba P, Cockell SJ, Zenkin N, Misincorporation by RNA polymerase is a major source of transcription pausing in vivo, Nucleic Acids Res 45 (2017) 1105–1113. doi: 10.1093/nar/gkw969. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [10].Kireeva ML, Nedialkov YA, Cremona GH, Purtov YA, Lubkowska L, Malagon F, Burton ZF, Strathern JN, Kashlev M, Transient reversal of RNA polymerase II active site closing controls fidelity of transcription elongation, Mol. Cell 30 (2008) 557–566. doi: 10.1016/j.molcel.2008.04.017. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [11].Walmacq C, Kireeva ML, Irvin J, Nedialkov Y, Lubkowska L, Malagon F, Strathern JN, Kashlev M, Rpb9 subunit controls transcription fidelity by delaying NTP sequestration in RNA polymerase II, J. Biol. Chem 284 (2009) 19601–19612. doi: 10.1074/jbc.M109.006908. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [12].Blank A, Gallant JA, Burgess RR, Loeb LA, An RNA polymerase mutant with reduced accuracy of chain elongation, Biochemistry 25 (1986) 5920–5928. [DOI] [PubMed] [Google Scholar]
- [13].Bar-Nahum G, Epshtein V, Ruckenstein AE, Rafikov R, Mustaev A, Nudler E, A ratchet mechanism of transcription elongation and its control, Cell 120 (2005) 183–193. doi: 10.1016/j.cell.2004.11.045. [DOI] [PubMed] [Google Scholar]
- [14].Zhou YN, Lubkowska L, Hui M, Court C, Chen S, Court DL, Strathern J, Jin DJ, Kashlev M, Isolation and characterization of RNA polymerase rpoB mutations that alter transcription slippage during elongation in Escherichia coli, J. Biol. Chem 288 (2013) 2700–2710. doi: 10.1074/jbc.M112.429464. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [15].Gout J-F, Thomas WK, Smith Z, Okamoto K, Lynch M, Large-scale detection of in vivo transcription errors, Proc. Natl. Acad. Sci. U.S.A 110 (2013) 18584–18589. doi: 10.1073/pnas.1309843110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [16].Imashimizu M, Takahashi H, Oshima T, McIntosh C, Bubunenko M, Court DL, Kashlev M, Visualizing translocation dynamics and nascent transcript errors in paused RNA polymerases in vivo, Genome Biol 16 (2015) 98. doi: 10.1186/s13059-015-0666-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [17].Imashimizu M, Oshima T, Lubkowska L, Kashlev M, Direct assessment of transcription fidelity by high-resolution RNA sequencing, Nucleic Acids Res 41 (2013) 9090–9104. doi: 10.1093/nar/gkt698. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [18].Traverse CC, Ochman H, Conserved rates and patterns of transcription errors across bacterial growth states and lifestyles, Proc. Natl. Acad. Sci. U.S.A 113 (2016) 3311–3316. doi: 10.1073/pnas.1525329113. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [19].Wagner LA, Weiss RB, Driscoll R, Dunn DS, Gesteland RF, Transcriptional slippage occurs during elongation at runs of adenine or thymine in Escherichia coli, Nucleic Acids Res 18 (1990) 3529–3535. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [20].Parks AR, Court C, Lubkowska L, Jin DJ, Kashlev M, Court DL, Bacteriophage λ N protein inhibits transcription slippage by Escherichia coli RNA polymerase, Nucleic Acids Res 42 (2014) 5823–5829. doi: 10.1093/nar/gku203. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [21].Churchman LS, Weissman JS, Nascent transcript sequencing visualizes transcription at nucleotide resolution, Nature 469 (2011) 368–373. doi: 10.1038/nature09652. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [22].Pani B, Nudler E, Mechanistic insights into transcription coupled DNA repair, DNA Repair (Amst.) 56 (2017) 42–50. doi: 10.1016/j.dnarep.2017.06.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [23].Doetsch PW, Translesion synthesis by RNA polymerases: occurrence and biological implications for transcriptional mutagenesis, Mutat. Res 510 (2002) 131–140. [DOI] [PubMed] [Google Scholar]
- [24].Wang W, Walmacq C, Chong J, Kashlev M, Wang D, Structural basis of transcriptional stalling and bypass of abasic DNA lesion by RNA polymerase II, Proc. Natl. Acad. Sci. U.S.A 115 (2018) E2538–E2545. doi: 10.1073/pnas.1722050115. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [25].Marietta C, Brooks PJ, Transcriptional bypass of bulky DNA lesions causes new mutant RNA transcripts in human cells, EMBO Rep 8 (2007) 388–393. doi: 10.1038/sj.embor.7400932. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [26].Liu J, Zhou W, Doetsch PW, RNA polymerase bypass at sites of dihydrouracil: implications for transcriptional mutagenesis., Mol Cell Biol 15 (1995) 6729–6735. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [27].Viswanathan A, You HJ, Doetsch PW, Phenotypic change caused by transcriptional bypass of uracil in nondividing cells, Science 284 (1999) 159–162. [DOI] [PubMed] [Google Scholar]
- [28].Springgate CF, Loeb LA, On the fidelity of transcription by Escherichia coli ribonucleic acid polymerase, J. Mol. Biol 97 (1975) 577–591. [DOI] [PubMed] [Google Scholar]
- [29].Gordon AJE, Halliday JA, Blankschien MD, Burns PA, Yatagai F, Herman C, Transcriptional infidelity promotes heritable phenotypic change in a bistable gene network, PLoS Biol 7 (2009) e44. doi: 10.1371/journal.pbio.1000044. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [30].Bubunenko MG, Court CB, Rattray AJ, Gotte DR, Kireeva ML, Irizarry-Caro JA, Li X, Jin DJ, Court DL, Strathern JN, Kashlev M, Cre A Transcription Fidelity Reporter Identifies GreA as a Major RNA Proofreading Factor in Escherichia coli, Genetics 206 (2017) 179–187. doi: 10.1534/genetics.116.198960. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [31].Acevedo A, Brodsky L, Andino R, Mutational and fitness landscapes of an RNA virus revealed through population sequencing, Nature 505 (2014) 686–690. doi: 10.1038/nature12861. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [32].Lou DI, Hussmann JA, McBee RM, Acevedo A, Andino R, Press WH, Sawyer SL, High-throughput DNA sequencing errors are reduced by orders of magnitude using circle sequencing, Proc. Natl. Acad. Sci. U.S.A 110 (2013) 19872–19877. doi: 10.1073/pnas.1319590110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [33].Gordon AJE, Satory D, Halliday JA, Herman C, Lost in transcription: transient errors in information transfer, Curr. Opin. Microbiol 24 (2015) 80–87. doi: 10.1016/j.mib.2015.01.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [34].Eisenberg E, Levanon EY, A-to-I RNA editing - immune protector and transcriptome diversifier, Nat. Rev. Genet 19 (2018) 473–490. doi: 10.1038/s41576-018-0006-1. [DOI] [PubMed] [Google Scholar]
- [35].Komissarova N, Kashlev M, Functional topography of nascent RNA in elongation intermediates of RNA polymerase, Proc Natl Acad Sci U S A 95 (1998) 14699–14704. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [36].Nudler E, Mustaev A, Lukhtanov E, Goldfarb A, The RNA-DNA hybrid maintains the register of transcription by preventing backtracking of RNA polymerase, Cell 89 (1997) 33–41. [DOI] [PubMed] [Google Scholar]
- [37].Erie DA, The many conformational states of RNA polymerase elongation complexes and their roles in the regulation of transcription, Biochimica et Biophysica Acta (BBA) - Gene Structure and Expression 1577 (2002) 224–239. doi: 10.1016/S0167-4781(02)00454-2. [DOI] [PubMed] [Google Scholar]
- [38].Neuman KC, Abbondanzieri EA, Landick R, Gelles J, Block SM, Ubiquitous transcriptional pausing is independent of RNA polymerase backtracking, Cell 115 (2003) 437–447. [DOI] [PubMed] [Google Scholar]
- [39].Cramer P, Bushnell DA, Fu J, Gnatt AL, Maier-Davis B, Thompson NE, Burgess RR, Edwards AM, David PR, Kornberg RD, Architecture of RNA polymerase II and implications for the transcription mechanism, Science 288 (2000) 640–649. [DOI] [PubMed] [Google Scholar]
- [40].Zenkin N, Yuzenkova Y, Severinov K, Transcript-assisted transcriptional proofreading, Science 313 (2006) 518–520. doi: 10.1126/science.1127422. [DOI] [PubMed] [Google Scholar]
- [41].Erie DA, Hajiseyedjavadi O, Young MC, von Hippel PH, Multiple RNA polymerase conformations and GreA: control of the fidelity of transcription, Science 262 (1993) 867–873. [DOI] [PubMed] [Google Scholar]
- [42].Jeon C, Agarwal K, Fidelity of RNA polymerase II transcription controlled by elongation factor TFIIS, Proc. Natl. Acad. Sci. U.S.A 93 (1996) 13677–13682. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [43].Kettenberger H, Armache K-J, Cramer P, Architecture of the RNA polymerase II-TFIIS complex and implications for mRNA cleavage, Cell 114 (2003) 347–357. [DOI] [PubMed] [Google Scholar]
- [44].Thomas MJ, Platas AA, Hawley DK, Transcriptional fidelity and proofreading by RNA polymerase II, Cell 93 (1998) 627–637. [DOI] [PubMed] [Google Scholar]
- [45].Roghanian M, Zenkin N, Yuzenkova Y, Bacterial global regulators DksA/ppGpp increase fidelity of transcription, Nucleic Acids Res 43 (2015) 1529–1536. doi: 10.1093/nar/gkv003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [46].Satory D, Gordon AJE, Wang M, Halliday JA, Golding I, Herman C, DksA involvement in transcription fidelity buffers stochastic epigenetic change, Nucleic Acids Res 43 (2015) 10190–10199. doi: 10.1093/nar/gkv839. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [47].Steurer B, Marteijn JA, Traveling Rocky Roads: The Consequences of Transcription-Blocking DNA Lesions on RNA Polymerase II, J. Mol. Biol 429 (2017) 3146–3155. doi: 10.1016/j.jmb.2016.11.006. [DOI] [PubMed] [Google Scholar]
- [48].Lainé J-P, Egly J-M, Initiation of DNA repair mediated by a stalled RNA polymerase IIO, EMBO J 25 (2006) 387–397. doi: 10.1038/sj.emboj.7600933. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [49].Tornaletti S, Transcription arrest at DNA damage sites, Mutat. Res 577 (2005) 131–145. doi: 10.1016/j.mrfmmm.2005.03.014. [DOI] [PubMed] [Google Scholar]
- [50].Hanawalt PC, Spivak G, Transcription-coupled DNA repair: two decades of progress and surprises, Nat. Rev. Mol. Cell Biol 9 (2008) 958–970. doi: 10.1038/nrm2549. [DOI] [PubMed] [Google Scholar]
- [51].Gabizon R, Lee A, Vahedian-Movahed H, Ebright RH, Bustamante CJ, Pause sequences facilitate entry into long-lived paused states by reducing RNA polymerase transcription rates, Nat Commun 9 (2018). doi: 10.1038/s41467-018-05344-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [52].Epshtein V, Kamarthapu V, McGary K, Svetlov V, Ueberheide B, Proshkin S, Mironov A, Nudler E, UvrD facilitates DNA repair by pulling RNA polymerase backwards, Nature 505 (2014) 372–377. doi: 10.1038/nature12928. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [53].Kamarthapu V, Epshtein V, Benjamin B, Proshkin S, Mironov A, Cashel M, Nudler E, ppGpp Couples Transcription to DNA Repair in E. coli, Science 352 (2016) 993–996. doi: 10.1126/science.aad6945. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [54].Paul BJ, Barker MM, Ross W, Schneider DA, Webb C, Foster JW, Gourse RL, DksA: A Critical Component of the Transcription Initiation Machinery that Potentiates the Regulation of rRNA Promoters by ppGpp and the Initiating NTP, Cell 118 (2004) 311–322. doi: 10.1016/j.cell.2004.07.009. [DOI] [PubMed] [Google Scholar]
- [55].Zuo Y, Wang Y, Steitz TA, The mechanism of E coli RNA polymerase regulation by ppGpp is suggested by the structure of their complex, Mol. Cell 50 (2013) 430–436. doi: 10.1016/j.molcel.2013.03.020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [56].Howan K, Smith AJ, Westblade LF, Joly N, Grange W, Zorman S, Darst SA, Savery NJ, Strick TR, Initiation of transcription-coupled repair characterized at single-molecule resolution, Nature 490 (2012) 431–434. doi: 10.1038/nature11430. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [57].Park J-S, Marr MT, Roberts JW, coli E Transcription repair coupling factor (Mfd protein) rescues arrested complexes by promoting forward translocation, Cell 109 (2002) 757–767. [DOI] [PubMed] [Google Scholar]
- [58].Vinella D, Potrykus K, Murphy H, Cashel M, Effects on growth by changes of the balance between GreA, GreB, and DksA suggest mutual competition and functional redundancy in Escherichia coli, J. Bacteriol 194 (2012) 261–273. doi: 10.1128/JB.06238-11. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [59].Sivaramakrishnan P, Sepúlveda LA, Halliday JA, Liu J, Núñez MAB, Golding I, Rosenberg SM, Herman C, The transcription fidelity factor GreA impedes DNA break repair, Nature 550 (2017) 214–218. doi: 10.1038/nature23907. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [60].Siegel EC, The Escherichia coli uvrD gene is inducible by DNA damage, Mol. Gen. Genet 191 (1983) 397–400. [DOI] [PubMed] [Google Scholar]
- [61].Borukhov S, Sagitov V, Goldfarb A, Transcript cleavage factors from E. coli, Cell 72 (1993) 459–466. doi: 10.1016/0092-8674(93)90121-6. [DOI] [PubMed] [Google Scholar]
- [62].Traverse CC, Ochman H, A Genome-Wide Assay Specifies Only GreA as a Transcription Fidelity Factor in Escherichia coli, G3 (Bethesda) 8 (2018) 2257–2264. doi: 10.1534/g3.118.200209. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [63].Tehranchi AK, Blankschien MD, Zhang Y, Halliday JA, Srivatsan A, Peng J, Herman C, Wang JD, The transcription factor DksA prevents conflicts between DNA replication and transcription machinery, Cell 141 (2010) 595–605. doi: 10.1016/j.cell.2010.03.036. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [64].Gordon AJE, Satory D, Halliday JA, Herman C, Heritable change caused by transient transcription errors, PLoS Genet 9 (2013) e1003595. doi: 10.1371/journal.pgen.1003595. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [65].Sparkowski J, Das A, The nucleotide sequence of greA, a suppressor gene that restores growth of an Escherichia coli RNA polymerase mutant at high temperature, Nucleic Acids Res 18 (1990) 6443. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [66].Malagon F, Tong AH, Shafer BK, Strathern JN, Genetic interactions of DST1 in Saccharomyces cerevisiae suggest a role of TFIIS in the initiation-elongation transition, Genetics 166 (2004) 1215–1227. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [67].Maxwell CS, Kruesi WS, Core LJ, Kurhanewicz N, Waters CT, Lewarch CL, Antoshechkin I, Lis JT, Meyer BJ, Baugh LR, Pol II docking and pausing at growth and stress genes in C. elegans, Cell Rep 6 (2014) 455–466. doi: 10.1016/j.celrep.2014.01.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [68].Ito T, Arimitsu N, Takeuchi M, Kawamura N, Nagata M, Saso K, Akimitsu N, Hamamoto H, Natori S, Miyajima A, Sekimizu K, Transcription Elongation Factor S-II Is Required for Definitive Hematopoiesis, Mol Cell Biol 26 (2006) 3194–3203. doi: 10.1128/MCB.26.8.3194-3203.2006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [69].Labhart P, Morgan GT, Identification of novel genes encoding transcription elongation factor TFIIS (TCEA) in vertebrates: conservation of three distinct TFIIS isoforms in frog, mouse, and human, Genomics 52 (1998) 278–288. doi: 10.1006/geno.1998.5449. [DOI] [PubMed] [Google Scholar]
- [70].Adelman K, Marr MT, Werner J, Saunders A, Ni Z, Andrulis ED, Lis JT, Efficient release from promoter-proximal stall sites requires transcript cleavage factor TFIIS, Mol. Cell 17 (2005) 103–112. doi: 10.1016/j.molcel.2004.11.028. [DOI] [PubMed] [Google Scholar]
- [71].Nechaev S, Fargo DC, dos Santos G, Liu L, Gao Y, Adelman K, Global analysis of short RNAs reveals widespread promoter-proximal stalling and arrest of Pol II in Drosophila, Science 327 (2010) 335–338. doi: 10.1126/science.1181421. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [72].Sheridan RM, Fong N, D’Alessandro A, Bentley DL, Widespread Backtracking by RNA Pol II Is a Major Effector of Gene Activation, 5’ Pause Release, Termination, and Transcription Elongation Rate, Mol. Cell 73 (2019) 107–118.e4. doi: 10.1016/j.molcel.2018.10.031. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [73].Petryk N, Kahli M, d’Aubenton-Carafa Y, Jaszczyszyn Y, Shen Y, Silvain M, Thermes C, Chen C-L, Hyrien O, Replication landscape of the human genome, Nat Commun 7 (2016) 10208. doi: 10.1038/ncomms10208. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [74].Srivatsan A, Tehranchi A, MacAlpine DM, Wang JD, Co-Orientation of Replication and Transcription Preserves Genome Integrity, PLOS Genetics 6 (2010) e1000810. doi: 10.1371/journal.pgen.1000810. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [75].Meryet-Figuiere M, Alaei-Mahabadi B, Ali MM, Mitra S, Subhash S, Pandey GK, Larsson E, Kanduri C, Temporal separation of replication and transcription during S-phase progression, Cell Cycle 13 (2014) 3241–3248. doi: 10.4161/15384101.2014.953876. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [76].Agaronyan K, Morozov YI, Anikin M, Temiakov D, Mitochondrial biology. Replication-transcription switch in human mitochondria, Science 347 (2015) 548–551. doi: 10.1126/science.aaa0986. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [77].Awwad SW, Abu-Zhayia ER, Guttmann-Raviv N, Ayoub N, NELF-E is recruited to DNA double-strand break sites to promote transcriptional repression and repair, EMBO Rep 18 (2017) 745–764. doi: 10.15252/embr.201643191. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [78].Moné MJ, Volker M, Nikaido O, Mullenders LH, van Zeeland AA, Verschure PJ, Manders EM, van Driel R, Local UV-induced DNA damage in cell nuclei results in local transcription inhibition, EMBO Rep 2 (2001) 1013–1017. doi: 10.1093/embo-reports/kve224. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [79].Bermejo R, Lai MS, Foiani M, Preventing replication stress to maintain genome stability: resolving conflicts between replication and transcription, Mol. Cell 45 (2012) 710–718. doi: 10.1016/j.molcel.2012.03.001. [DOI] [PubMed] [Google Scholar]
- [80].García-Muse T, Aguilera A, Transcription-replication conflicts: how they occur and how they are resolved, Nat. Rev. Mol. Cell Biol 17 (2016) 553–563. doi: 10.1038/nrm.2016.88. [DOI] [PubMed] [Google Scholar]
- [81].Hamperl S, Cimprich KA, Conflict Resolution in the Genome: How Transcription and Replication Make It Work, Cell 167 (2016) 1455–1467. doi: 10.1016/j.cell.2016.09.053. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [82].Nudler E, RNA Polymerase Backtracking in Gene Regulation and Genome Instability, Cell 149 (2012) 1438–1445. doi: 10.1016/j.cell.2012.06.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [83].Dutta D, Shatalin K, Epshtein V, Gottesman ME, Nudler E, Linking RNA polymerase backtracking to genome instability in E. coli, Cell 146 (2011) 533–543. doi: 10.1016/j.cell.2011.07.034. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [84].Trautinger BW, Jaktaji RP, Rusakova E, Lloyd RG, RNA polymerase modulators and DNA repair activities resolve conflicts between DNA replication and transcription, Mol. Cell 19 (2005) 247–258 doi: 10.1016/j.molcel.2005.06.004. [DOI] [PubMed] [Google Scholar]
- [85].Hubbard K, Catalano J, Puri RK, Gnatt A, Knockdown of TFIIS by RNA silencing inhibits cancer cell proliferation and induces apoptosis, BMC Cancer 8 (2008) 133. doi: 10.1186/1471-2407-8-133. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [86].Nagata M, Ito T, Arimitsu N, Koyama H, Sekimizu K, Transcription arrest relief by S-II/TFIIS during gene expression in erythroblast differentiation, Genes Cells 14 (2009) 371–380. doi: 10.1111/j.1365-2443.2008.01277.x. [DOI] [PubMed] [Google Scholar]
- [87].Fousteri M, Vermeulen W, van Zeeland AA, Mullenders LHF, Cockayne syndrome A and B proteins differentially regulate recruitment of chromatin remodeling and repair factors to stalled RNA polymerase II in vivo, Mol. Cell 23 (2006) 471–482. doi: 10.1016/j.molcel.2006.06.029. [DOI] [PubMed] [Google Scholar]
- [88].Jensen A, Mullenders LHF, Transcription factor IIS impacts UV-inhibited transcription, DNA Repair 9 (2010) 1142–1150. doi: 10.1016/j.dnarep.2010.08.002. [DOI] [PubMed] [Google Scholar]
- [89].Carey LB, RNA polymerase errors cause splicing defects and can be regulated by differential expression of RNA polymerase subunits, Elife 4 (2015). doi: 10.7554/eLife.09945. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [90].Ezerskyte M, Paredes JA, Malvezzi S, Burns JA, Margison GP, Olsson M, Scicchitano DA, Dreij K, O6-methylguanine-induced transcriptional mutagenesis reduces p53 tumor-suppressor function, Proc. Natl. Acad. Sci. U.S.A 115 (2018) 4731–4736. doi: 10.1073/pnas.1721764115. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [91].Choi PJ, Cai L, Frieda K, Xie XS, A stochastic single-molecule event triggers phenotype switching of a bacterial cell, Science 322 (2008) 442–446. doi: 10.1126/science.1161427. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [92].Yao G, Lee TJ, Mori S, Nevins JR, You L, A bistable Rb-E2F switch underlies the restriction point, Nat. Cell Biol 10 (2008) 476–482. doi: 10.1038/ncb1711. [DOI] [PubMed] [Google Scholar]
- [93].Brock A, Chang H, Huang S, Non-genetic heterogeneity--a mutation-independent driving force for the somatic evolution of tumours, Nat. Rev. Genet 10 (2009) 336–342. doi: 10.1038/nrg2556. [DOI] [PubMed] [Google Scholar]
- [94].Uphoff S, Lord ND, Okumus B, Potvin-Trottier L, Sherratt DJ, Paulsson J, Stochastic activation of a DNA damage response causes cell-to-cell mutation rate variation, Science (New York, N.Y.) 351 (2016) 1094. doi: 10.1126/science.aac9786. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [95].Vermulst M, Denney AS, Lang MJ, Hung C-W, Moore S, Moseley MA, Mosely AM, Thompson JW, Thompson WJ, Madden V, Gauer J, Wolfe KJ, Summers DW, Schleit J, Sutphin GL, Haroon S, Holczbauer A, Caine J, Jorgenson J, Cyr D, Kaeberlein M, Strathern JN, Duncan MC, Erie DA, Transcription errors induce proteotoxic stress and shorten cellular lifespan, Nat Commun 6 (2015) 8065. doi: 10.1038/ncomms9065. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [96].Martin GM, Bressler SL, Transcriptional infidelity in aging cells and its relevance for the Orgel hypothesis, Neurobiol. Aging 21 (2000) 897–900; discussion 903–904. [DOI] [PubMed] [Google Scholar]
- [97].Orgel LE, The maintenance of the accuracy of protein synthesis and its relevance to ageing, Proc. Natl. Acad. Sci. U.S.A 49 (1963) 517–521. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [98].Brégeon D, Doetsch PW, Transcriptional mutagenesis: causes and involvement in tumour development, Nat. Rev. Cancer 11 (2011) 218–227. doi: 10.1038/nrc3006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [99].Reyes J, Chen J-Y, Stewart-Ornstein J, Karhohs KW, Mock CS, Lahav G, Fluctuations in p53 Signaling Allow Escape from Cell-Cycle Arrest, Mol. Cell 71 (2018) 581–591.e5. doi: 10.1016/j.molcel.2018.06.031. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [100].Kotsantis P, Silva LM, Irmscher S, Jones RM, Folkes L, Gromak N, Petermann E, Increased global transcription activity as a mechanism of replication stress in cancer, Nature Communications 7 (2016) 13087. doi: 10.1038/ncomms13087. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [101].Gaillard H, García-Muse T, Aguilera A, Replication stress and cancer, Nat. Rev. Cancer 15 (2015) 276–289. doi: 10.1038/nrc3916. [DOI] [PubMed] [Google Scholar]
- [102].Traverse CC, Ochman H, Genome-Wide Spectra of Transcription Insertions and Deletions Reveal That Slippage Depends on RNA:DNA Hybrid Complementarity, MBio 8 (2017). doi: 10.1128/mBio.01230-17. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [103].Lee H, Popodi E, Tang H, Foster PL, Rate and molecular spectrum of spontaneous mutations in the bacterium Escherichia coli as determined by whole-genome sequencing, Proc. Natl. Acad. Sci. U.S.A 109 (2012) E2774–2783. doi: 10.1073/pnas.1210309109. [DOI] [PMC free article] [PubMed] [Google Scholar]



