Abstract
The current SARS-CoV-2 pandemic underscores the importance of understanding the evolution of RNA genomes. While RNA is subject to the formation of similar lesions as DNA, the evolutionary and physiological impacts RNA lesions have on viral genomes are yet to be characterized. Lesions that may drive the evolution of RNA genomes can induce breaks that are repaired by recombination or can cause base substitution mutagenesis, also known as base editing. Over the past decade or so, base editing mutagenesis of DNA genomes has been subject to many studies, revealing that exposure of ssDNA is subject to hypermutation that is involved in the etiology of cancer. However, base editing of RNA genomes has not been studied to the same extent. Recently hypermutation of single-stranded RNA viral genomes have also been documented though its role in evolution and population dynamics. Here, we will summarize the current knowledge of key mechanisms and causes of RNA genome instability covering areas from the RNA world theory to the SARS-CoV-2 pandemic of today. We will also highlight the key questions that remain as it pertains to RNA genome instability, mutations accumulation, and experimental strategies for addressing these questions.
Keywords: RNA world theory, messenger RNA, viral RNA, genome stability, viral evolution, hypermutation, APOBEC, ADAR, RNA editing
1. Introduction
One of the favorite aphorisms in modern biology is the title of the Theodosius Dobzhansky’s essay “Nothing in Biology Makes Sense Except in the Light of Evolution” [1]. In his Synthetic Theory of Evolution, Dobzhansky defined two major factors—genetic variation (i.e., mutation and other types of genome instability) and natural selection [2]—that interplay in generating new species. Remarkably high instability levels of RNA genomes accelerate speciation to the levels that often allow documenting evolution in real time. Besides the unique opportunity for researchers, this, at times, represents a considerable threat to the species hosting RNA viral genomes, including ongoing pandemic of SARS-CoV-2. The latter resulted in recent boom of attention to mechanisms of genome instability in RNA viruses.
In today’s biological systems, the genetic material making up the genome is primarily DNA. In contrast, a plethora of viruses that infect cellular hosts throughout all kingdoms rely upon RNA as their primary genetic material. Whichever the genetic material of the virus genome, there is the requirement that the genome remains stable to allow for the transmission of viable genetic material to progeny, and to prevent the extinction of species [3,4,5,6]. However, non-catastrophic levels of genome instabilities are instrumental for accumulating beneficial variants to prepare a species to meet the challenges of ever-changing environments and allow for downstream evolution [7,8,9,10]. Therefore, a balance between a stable genome and instances of genome instability must be met.
To date, there has been numerous studies into the stability/instabilities of DNA genomes, but the same level of research has not been performed for RNA. This disparity is important to note because RNA genomes are predicted to be a vital key to evolution, as prior to the last universal common ancestor (LUCA) the RNA world theory predicts the existence of protocells that used RNA as a genetic material. Further, RNA was also proposed to be used as an enzyme to mediate all metabolic functions (as proteins had not yet evolved) [11,12,13,14,15,16,17,18]. With the reliance upon RNA for the genetic material, as well as for cellular function, there must have been efficient replication of RNA within the cell, but the modes of replication of these protocells are not known. To get a better picture for how these protocells replicated, along with their likely sources of genome instability, remnants of the mechanisms of protocell replication may remain within the genomes of modern RNA viruses.
The modes of replication of modern viruses are known (discussed below in Section 2) and have been found to have the highest mutation rates per nucleotide among all biological species [19]. Viral RNA genomes are not as stable as DNA genomes, and this could be due to multiple factors including; special features of RNA genomes, RNA virus replication machinery, high selection pressure, and the susceptibility of viral RNA to environmental and/or endogenous lesions [20,21]. Thus, these instabilities of RNA virus genomes in turn should speed up their evolution. These evolutionary insights are especially important in the light of the current (at time of publishing) COVID-19 pandemic caused by the RNA Coronavirus SARS- CoV-2. Just how SARS-CoV-2 evolved to become transmissible between humans is not yet known but would require the introduction of variants into the genome through non-catastrophic events of genome instability to gain an evolutionary advantage [22,23]. How these key variants were introduced remains unclear, but in this review, we discuss possible and likely sources of genome instability that introduced these variants as well as highlight key remaining questions from the RNA world to SARS-CoV-2 pandemic.
2. Replication of RNA Genomes
2.1. RNA Protocell Replication
The first protocells of the RNA world theory had to replicate their genome to pass the genetic material on to progeny, but the mechanism of how they replicated remains unclear. At that time, there was an inability to transcribe proteins, so the current mechanisms that organisms use to replicate their genome would not be available. Alternatively, replication of protocell’s genome potentially could proceed via a ribozyme, but this would require the presence of two RNA molecules per cell, which is unlikely in the very first protocells, as having two copies of RNA -one for the genome and one for the actual replicase ribozyme- arising independently in the same early protocell is low [13,24], so, prior to ribozyme evolution, there must have been a mechanism for replicating RNA independent of ribozymes. However, the development of a model for non-enzymatic replication of RNA genomes is at best in its infancy. Specifically, the problem is that, to date, a long-tract non-enzymatic RNA replication mechanism in nature has yet to be found (reviewed in [11]), which is further compounded by the difficulty to separate long stretches of replicated RNA strands, should long tract RNA be synthesized. This inability lends to the likelihood of a hypothesis that, instead of a long tract synthesis, a shorter type of synthesis is utilized [13,25]. This idea has its own problems based on the ends of the replicated genome. A non-enzymatic replication would be required to begin at one end and continue through the other end, which would require an improbable standard oligo for all replication events to act as a primer at the very beginning of the genome. While the other problem arises at the terminal end where the last base is added at a low efficiency in what is called the “last base addition problem” [26] due to the dinucleotide intermediates typically requiring two bases to extend. These two problems can be resolved if the RNA genome template was instead a circle, as there would be no beginning and have no end. Nevertheless, replication of a ssRNA circle to form a dsRNA circle would require efficient long tract RNA synthesis that, should it be successful, would cause the circle to become highly strained due to the high bending energy of dsRNA. Therefore, the early protocells likely would not have a circular RNA genome.
Even with the stated problems above, Szostak and colleagues proposed a new model for replicating “primordial RNA genomes” through what they call a virtual circular genome [11]. This virtual circular genome contains multiple oligos that cover the entire circular RNA genome. Replication of the virtual circular genome will come after the annealing of the oligos and allow for templated addition of activated monomers, dimers, or trimers to allow for extension of the oligo. These oligos can then switch templates to allow for a continued elongation of the oligos to slowly replicate the entirety of the genome, allowing newly synthesized genetic material to pass on to progeny. Further, this mode of replication would also offer the ability for more than one copy of the RNA genome to be present in the same protocell, opening the door to the evolution of ribozymes. This evolution of RNA to form a ribozyme was studied in an in vitro assay [27] of a two domain RNA polymerase (B6.61) with the introduction of sequence variances at three distinct sites that then underwent selection. After 25 rounds of selection, what resulted were five families of polymerases able to bind a specific promotor and synthesize RNA. Together, this supports that the introduction of sequence variants into RNA can evolve to gain a function and that these RNA molecules can act as a ribozyme polymerase, and likely underlies the mechanism for RNA genome replication in the later protocells.
These replicating protocells set the track for evolution that ultimately (after many evolutionary steps) arrived at where we are today. Though, we do not know the intermediate steps between the protocells to today, it is still possible that remnants of these protocells remain as viral pathogens.
2.2. Single-Strand and Double-Strand RNA Virus Genomes
Generally, there are three classes of RNA viruses, and the key feature for separating the classes is based on the state the viral RNA genome (i.e., RNA packaged into the virion) is present [28,29]. The first class of RNA viruses maintains their genomes as double stranded (ds) RNA, examples include rotaviruses and reoviruses, while the other two classes are single stranded (ss). The single-stranded RNA viruses are further separated by the polarity of the genomic strand to be either negative (−) or positive (+), examples include for negative strand ssRNA viruses influenza, measles, mumps, rubella, Ebola, and Zika virus, and for positive stranded viruses include poliovirus, MERS-CoV. A further example of a positive-strand virus of particular interest is SARS-CoV-2.
With these viruses being maintained differently requires different modes of replication (reviewed in [30]). Specifically, in positive-strand ssRNA viruses, RNA dependent RNA polymerase (RdRp) uses the positive genomic strand as a template to create a new negative strand copy (the anti-genome) that is subsequently used as a template to create large numbers of positive-strand viral RNA genomes (Figure 1). Alternatively, in negative-strand RNA viruses RdRp uses the genomic negative-strand as a template to create positive-strand antigenome, that also serve as mRNAs. RdRp subsequently uses the positive-strand anti-genome as a template to create large numbers of negative-strand viral RNA genomes (Figure 1). dsRNA viruses replicate their genome differently by generating positive-strand mRNAs (templated by the dsRNA genome) that are also used by RdRp as a template to create dsRNA genomes packaged into new virus particles (Figure 1). It should be noted that each virus type (even dsRNA viruses) relies upon multiple copies of ssRNA as intermediates of replication. This is important because, unlike dsRNA, the bases in ssRNA are not paired leaving the base exposed, therefore these ssRNA may be more prone to lesions such as base alkylation or chemical base deamination [31,32]. These lesions can stem from multiple sources (discussed at length in Section 5), but the major causes of lesions addressed in this review are the cytosine deaminases APOBECs and the adenine deaminases, ADARs.
Both APOBECs and ADARs are part of the innate immunity and act as an antiviral by inducing base substitutions in viral genomes [33,34,35,36]. The polarity of the induced lesions in the resulting genomes depends on the virus class and their mode of replication. (Figure 1). In positive-strand ssRNA and in dsRNA viruses the predominant ssRNA species is the positive-strand, which is also an mRNA translated into viral proteins (Figure 1A,C). In negative-strand ssRNA viruses (Figure 1B), the negative RNA strand is more abundant. This bias in strand abundance can affect mutation accumulation bias, which may then be detected as mutation spectra strand bias.
3. Viral RNA Genome Rearrangements
Replication of RNA genomes in viruses is rarely perfect, often there is the introduction of errors [37] (discussed in Section 4) as well as the incomplete replication of the genome. When a genome is incompletely replicated it is either left unrepaired and degraded, or it is repaired by recombination with another RNA molecule, (reviewed in [38]). Viral recombination occurs at high rates between 2–20% recombination events per 100 nucleotides [39,40,41,42,43,44], and the rate of recombination is dependent on the fidelity of the RdRp [45,46,47,48]. Specifically, RdRps with a high fidelity are associated with a low recombination rate, while RdRps with low fidelity are associated with high levels of recombination [45,46,47,48]. This is due to the major RNA viral recombination mechanism being initiated by faulty, or incomplete, viral replication.
Replicative RNA recombination begins when the viral RdRp stalls during replication of the viral genome followed by the dissociation of the newly synthesized RNA molecule from the template that subsequently binds to another template where it is used as a primer to begin synthesis (Figure 2A), reviewed in [38]. The synthesis continues through the end of the template to complete the RNA molecule (Figure 2A) [41,42,49,50,51,52]. After RdRp stalling and dissociation, should the RNA find its homologous sequence in another identical RNA genome (Figure 2(Ai)), then the resulting genome will be identical to the previous template [41,42,49,50,51,52,53]. However, if the molecule that is utilized as the template is a completely different RNA molecule, which is possible due to the high number of mRNAs in the cell along with the possibility of infection by other RNA viruses, it will result in a chimeric RNA molecule containing parts of both RNA sequences (Figure 2(Aii)). This creates a novel viral genome, and should the recombination event include new beneficial genes, there may be the increased fitness of the virus. Such increase in fitness can be a major driver of evolution as well as may create new viral disease.
Non-replicative RNA recombination is a much rarer form of RNA recombination, occurring independent of RdRp, where two molecules are joined at their ends to create a chimeric molecule (Figure 2B) [54,55,56,57,58,59,60,61,62,63]. These events have been documented by modern sequencing approaches in viruses incapable of replication, but the mechanism for their formation is not yet known. Even so, a possible avenue for research may come from the many cell types that are able to recombine mRNA molecules through RNA splicing or RNA self-splicing to excise introns using small nuclear ribonucleoproteins (snRNPs) or through ribozymes [64,65,66]. Potentially, non-replicative recombination could use these mechanisms to join two different RNA molecules, instead of its more traditional function of excising introns. Regardless of the mechanism of recombination, what results is the formation of chimeric molecules that can become a novel viral RNA genome that may combine beneficial traits that helps with the virus’s overall fitness.
4. RNA Replication Errors
4.1. The RdRp’s Sequence Variation Effect on Replication Fidelity
RNA viral evolution has resulted in a diverse population of virus types that inevitably contain different combinations of genes within each virus, but present in all classes of viruses is RdRp. [67,68]. Though RdRp is conserved throughout RNA viruses, the overall RdRp sequence is highly variable, with some sequence conservation as low as 10% [69,70]. However, within this variable sequence of the RdRp, there are seven domains that contain key conserved residues. Specifically, the domains that are directly involved in ribonucleotide selection or catalysis contain conserved aspartate or lysine residues that when disrupted, greatly alters the RdRp activity [69,70]. The remaining sequence in the seven domains are not conserved to the same extent, but each specific sequence variation can modify RdRp function. Together, these sequence variations observed results in the variable RdRp replication fidelity, reviewed in [68], which is already orders of magnitude less accurate as compared to DNA replication.
4.2. RdRp’s Replication Fidelity Impacts Viral Evolution
It was argued that the low fidelity of the RdRp drives the evolution of RNA viruses [71,72]. This idea was supported in multiples studies where viruses (influenza A virus, papilloma virus, foot-and-mouth disease viruses, Chikungunya virus, and human enterovirus 71) were exposed to nucleotide analogs (that increases the mutation rate, typically used as an antiviral strategy [73,74,75,76]) and after a few passages, a subpopulation emerged that became resistant through the acquisition of a mutant RdRp that has a higher replication fidelity [77,78,79,80,81]. Together, this suggests that a high mutation rate can mediate the formation of advantageous mutations that can drive evolution, but it also suggests that a higher replication fidelity can result in a more stable virus propagation. The latter notion is supported by viral strains that contained high fidelity RdRp variants continued stable propagation, however, they ultimately become attenuated [82,83].
With such a high mutation rate observed in viruses, there is a selection for smaller genomes because a larger genome would have more opportunities for the acquisition of a deleterious mutation resulting in “error catastrophe” [84,85,86]. Consequently, there is a balance of mutagenesis to be high enough to allow for adaptation, but low enough to be able to maintain a complex genome and prevent error catastrophe. This is believed to be a selection factor causing a tendency to limit of the size of the genome—for most RNA viruses to be around 15 kb in length [85,86]. Nevertheless, the Nidovirales family of viruses have RNA genomes upwards of 30 kb (maximum of 41 kb) [87,88] which is twice as large as a majority of viruses. A reason for the large viral genome size in Nidovirales remained unclear until the Gorbelyna group identified a sequence encoding a 3′ to 5′ exoribonuclease inside the SARS-CoV nsp14 subunit (called ExoN and referred to as so from here on), and speculated this is what could allow for the increase in genome size by proofreading RdRp errors and thereby reducing a chance of error catastrophe [89,90]. Subsequently, the 3′ to 5′ exoribonuclease function of ExoN was found in vitro and ExoN was also found to be essential for the viability of the alphacoronavirus HCoV-229E [91]. Similar experiments were conducted in ExoN-knockout mutants of two betacoronaviruses, MHV and SARS-CoV viruses [92,93], and found that the viruses were still viable, but to a much lower extent as compared to wild type viruses. Also, the ExoN-knockout mutants were deemed to have a “mutator phenotype” as they had a 15 to 21x increase, respectively, in mutations, as compared to wild type ExoN strains, approximately reaching the mutation frequency of other “non-nidovirales” viruses [92,93]. Together, this indicated that ExoN may act by proofreading RNA synthesis, which was later supported by the findings of that ExoN can excise mismatched nucleotides from a double-stranded RNA substrate [94,95].
5. Lesion-Induced Mutagenesis in Viral RNA Genomes
5.1. Environmental and Endogenous RNA Lesions and Modifications
Viral genomic RNAs as well as cellular RNAs are the subject to environmental and endogenous lesions. These lesions can result in RNA breakage or can block RNA, where the broken RNA genomes would be either lost or participate in recombination like events which can in turn produce rearranged genomes (see Section 3 and Figure 2). Like DNA, RNA base lesions and modifications can be caused by a variety of endogenous and exogenous agents (Table 1 and [31,32,96,97,98,99,100,101]). However, unlike for DNA, most base lesions in RNA cannot be repaired. The only known exception is for some alkylation products of cytosine and adenine bases which can be reversed to normal bases by a special class of oxidative demethylases—AlkB in bacteria or ALKBH family in humans [100,101,102]. Whether viruses utilize AlkB (or other similar proteins) to reverse alkylated bases remains unclear. The Flexiviridae plant viruses do encode for an AlkB sequence, gained through horizontal gene transfer, that when transcribed reverses alkylated bases within RNA and DNA, with a preference for RNA [103,104]. This suggests that AlkB does play a role in viruses, but a majority of viruses do not encode for an AlkB. It is possible, however, that viruses utilize the host’s AlkB instead of encoding for their own, but more work will need to be done to understand the role of AlkB to reverse alkylated bases in viruses.
Table 1.
Canonical Base | Modified Base | Possibility of Non-Enzymatic Generation | Enzymatic Generation | Enzymatic Reversal | Found in VIRUSES |
---|---|---|---|---|---|
Adenine | N1-methyladenosine (m1A) | SN2-alkylation | Methyltransferases | AlkB/ALKBH | Yes |
Adenine | N3-methyladenosine (m3A) | SN2-alkylation | Not known | AlkB/ALKBH | TBD |
Adenine | N6-methyladenosine (m6A) | Isomerization of m1A | Methyltransferases | AlkB/ALKBH | Yes |
Adenine | Hypoxanthine base (Inosine ribonucleotide, I) | Low rate spontaneous deamination | Adenine Deaminases Acting on RNA (ADAR) | None | Yes |
Uracil | Pseudouridine (Ψ) | Not known | Pseudouridine synthase | None | Yes |
Guanine | N1-methylguanosine (m1G) | SN2-alkylation | Methyltransferases | AlkB/ALKBH | Yes |
Guanine | N3-methylguanosine (m3G) | SN2-alkylation | Not known | Not known | TBD |
Guanine | N7-methylguanosine (m7G) | SN2-alkylation | Methyltransferases | Not known | Yes |
Guanine | O6-methylguanosine (O6mG) | SN1-alkylation | Not known | Not known | TBD |
Cytosine | N1-methylcytosine (m1C) | SN2-alkylation | Not known | Not known | TBD |
Cytosine | N3-methylcytosine (m3C) | SN2-alkylation | Not known | AlkB/ALKBH | TBD |
Cytosine | N5-methylcytosine (m5C) | Not known | Methyltransferases | Not known | Yes |
Cytosine | Uracil | Spontaneous deamination | APOBEC/AID | None | Yes |
There have been 170 different RNA modifications identified in RNA across all species, with 60 of those being identified in eukaryotes [107]. Many of these RNA modifications have multiple functional consequences [99,108,109] and altogether are referred as the RNA-editome, or as the epitranscriptome. Investigations into the RNA modifications in viruses are still in its infancy, but even so, several enzymatic modifications of viral RNA bases have been reported (Table 1 [31,32,96,97,98,99,100,101]). Moreover, it is likely that there are additional modifications within RNA viruses are yet to be identified. Further, some of the identified RNA modifications have been reported to have physiological functions in RNA viruses, reviewed in [97]. Specifically, when uridines are replaced with pseudouridine in the Hepatitis C virus there is the disruption of the host interferon-β immune response, allowing for easier viral propagation [110]. Further, pseudouridine synthase is essential for the life cycle of Flaviviridae family of viruses indicating an essential role for pseudouridine in viruses [111]. The physiological functions of the N6-methyladenosine can affect viral replication where a low level of N6-methyladenosine leads to an increase in viral replication [97,105], while the increase of N6-methyladenosine results in less viral replication [112]. N5-methylcytosine, has been shown to be a positive regulator of viral replication [113] along with affecting the host immune response by binding to a recognition receptor but failing to initiate the antiviral cascade [114]. Lastly, hypoxanthine occurs in virus genomes after ADARs deaminate adenosine to inosine (hypoxanthine containing nucleoside) and impacts virus’ genomic sequence as inosine is copied as a guanine [115]. Though, these base modifications have been identified, their full mutagenic potential is yet to be determined. It is even not clear, if they are present in the full scale replicating viral genome or only in non-replicating viral mRNAs [105].
So far, only two kinds of enzymatic RNA edits—cytidine to uridine (C to U) by APOBEC cytidine deaminases (Figure 3) and adenosine to inosine (A to I) by ADAR adenosine deaminases (Figure 3) are known to be carried into copies of viral genomes resulting in C to U and A to G mutations respectively [116,117]. Henceforth, the next section will discuss APOBEC cytidine deaminases and ADAR adenosine deaminases as they may have the greatest impact on mutation accumulation in several human RNA viruses.
5.2. Base Substitution Mutagenesis in RNA Viruses
The introduction of base substitutions by APOBEC or ADAR is a common anti-viral strategy as excessive numbers of mutations is likely to result in inviable viral genomes [33,34,35,36]. However, base substitutions to a non-catastrophic level is an important source for viral evolution and population dynamics by introducing novel sequence variants [4,19,21]. These base substitutions are also a common avenue for viruses to escape the host’s adaptive immune system [33,34,35,36,118]. Thus, it is important to identify mechanisms underlying the generation of base substitutions in viral populations. Usual approaches to understanding the mutagenic mechanisms underlying genome mutations comes from a combination of knowledge accumulated in model studies as well as from agnostic documenting features of mutational spectrum that deviate from the spectrum expected if mutagenesis would be completely random [119,120,121]. Such “non-random” features of mutational spectra are also called mutational motifs (by analogy with musical motifs, which combine notes according to the rules of harmony) or mutational signatures (multiple set of features defining uniqueness of an object). This synthetic approach turned to be productive in revealing the mutagenic mechanisms in human cancers [119,120,121] through the use of cancer mutation catalogs—complete lists of de novo mutations in genomes of individual human tumors.
A similar strategy that was applied to human cancers can now be applied to several RNA viruses, especially because of the accumulation of extensive sequencing data for multiple viruses, most notably including SARS-CoV-2, which was a subject of a gigantic genome resequencing effort across the world [122] (see also URL https://www.gisaid.org/, accessed on 15 April 2021). However, since viral genomes are small, each genome would contain a small number, or even no mutations, so a mutation catalogue representative of a population (or a species) could only be built from sequences of multiple genomes. Building such a catalog must be done under the consideration of important factors to allow meaningful statistical evaluation of mutational spectra and signatures [116,117]. Firstly, an important point is that the mutations in the catalog must represent independent changes rather than a small number of events that are amplified through the development of a population, or through species evolution. A simplified example is shown in (Figure 4) where a viral population starting from a single genome accumulates mutations over nine rounds of copying (Figure 4) shows the importance of independent changes as many mutations in these genomes are identical, not because they occurred multiple times, but instead because they stem from a single genome that is then propagated.
Another important factor for the development of a mutational catalog is the knowledge of the original genome sequence from which all other genomes stem. This is required to identify the mutant alleles in every sequenced viral genome as well as to identify the directionality of the mutation events. The latter is especially important for defining mutational events across long term population dynamics or during species evolution. In the cases of identifying mutational signatures in mammalian RNA viruses, a single reference sequence for the entire dataset is not available, so the reference roles are assigned to the sequences in the nodes of phylogenetic trees built for genomes of isolates from a population [123] distant isolates of a single virus species [117,124,125] or from several related quasi-species [126].
Each node is taken as a surrogate of the reference sequence for the genomes in the same clade of a phylogenetic tree. With the surrogate reference sequence established, it can be utilized to develop a mutational catalog. This approach allowed for the detection of C to U changes as a prominent or even the major component of mutagenesis in a wide range of mammalian RNA viruses [127].
One more factor to account for in the analysis of mutagenesis results is strand bias of a particular change [116]. In viruses this bias will depend on a preference of a base modifying factor to single-stranded (ss) or to double-stranded (ds) RNA. It will also be affected by the kind of RNA forming genome of a virus: positive-strand ssRNA, negative-stranded ssRNA, or dsRNA (Figure 1A–C). The base changes shown on all panels of Figure 1 are the same as C to U changes expected from ssRNA-specific cytidine deaminases APOBEC and from A to I dsRNA-specific adenine deaminases ADAR (specific modification to the bases shown in Figure 4). Interestingly, the spectra and strand preference of the two most prevailing kinds of changes in hypermutated isolates of human vaccine-derived rubella virus corresponded to the prevailing C to U, and U to C, changes in genomic strand of this positive-strand ssRNA virus shown on Figure 1A [117]. These hypermutated viruses (up to 300 base substitutions in a 9 kb genome) were extracted from granulomas of different children with primary immunodeficiency. Each independent virus isolate stemmed from the attenuated rubella vaccine virus; whose known original sequence was used as a reference to build a mutation catalog from six isolates that contained 993 mutations. C to U, or A to G, changes were the major mutations in the catalog. Such a pattern of mutations in the rubella vaccine virus mutation catalog matches the signature of APOBEC cytidine deaminases and supports the idea that APOBEC enzymes are the major mutator. Recently, it has been established that APOBEC3A (A3A) enzyme has a preference to the unpaired parts (loops) of folded RNA structures in mRNAs of human tumors [128]. Importantly, C to U changes in the positive-strand in hypermutated rubella genomes were the only type of base substitution that showed a statistically significant density increase in predicted RNA-loops over stems—sequences with a potential for self-pairing [116]. This lends support for APOBEC mutation being the source for C to U changes, however both strand bias and unpaired loop preference could be the feature of any agent causing chemical deamination of cytidines in RNA.
5.3. Mutation Signature Analysis in RNA Viruses
Deamination of cytosines not only occurs in RNA, cytidine deamination in DNA is one of the most frequent spontaneous changes and also has a preference to ssDNA [32], and all APOBECs with cytidine deaminase activity show clear preference to immediate nucleotide context surrounding deaminated cytidines in DNA. Specifically, APOBEC3G has a preference to cCn context (mutated nucleotide capitalized; n—any nucleotide), while APOBEC1 and all other members of APOBEC3 gene cluster prefer tCn deamination motif [129,130,131,132]. The preferred DNA deamination motif for APOBEC3A and APOBEC3B was even narrowed to tCa [133]. Unlike for DNA, detailed editing signatures in RNA are yet to be established. We therefore used APOBEC signature motifs established for DNA to evaluate the mutation spectra in a catalog compiled of hypermutated rubella genomes (Figure 5 shows example for the uCa→uUa motif).
This method was initially developed for evaluating APOBEC mutagenesis in human cancers [134], however it allows statistical estimate of over-representation with any oligonucleotide motif in mutation datasets [133,135,136]. A fraction of mutations in an oligonucleotide motif among mutations of a given nucleotide is compared with the presence of the same oligonucleotide in the genomic context surrounding mutated bases (see also Figure 5 and legend). We found a high level of enrichment with APOBEC motif uCn and even greater enrichment with the narrower uCa motif which is also the most preferred DNA editing motif for APOBEC3A and APOBEC3B [117]. Unlike for APOBEC editing, there is only a multi-motif ADAR editing web-based Inosine-Predict score tool, which takes into account immediate nucleotide context for every guanosine position as well as the potential to form a secondary structure. This tool was developed for ADAR editing in mRNAs [137]. There was slight, albeit statistically significant increase in Inosine-Predict score for adenine positions involved in A to G mutations (U to C in complementary strand) mutations as compared to non-mutated positions of As (or Us) [117].
Altogether, APOBEC-like and ADAR-like changes represented 86% of the 993 mutations in the catalog from six hypermutated genomes of vaccine derived rubella virus. We then applied similar, but yet extended analytical and statistical evaluation approaches to evaluate mutation load and spectrum accumulated from over 30,000 SARS-CoV-2 genome sequences accumulated during first several months of pandemic [116]. In this analysis, we compared the spectrum and signatures with hypermutated isolates of rubella virus. The unique feature of this dataset is that the starting reference sequence is well defined [138], so each difference from the reference is a direct trace of a mutation event. However, some mutations were found in several thousands of isolates. We therefore used a set of non-duplicated mutations to represent the summary of mutational processes operating in pandemic SARS-CoV-2 population. We found that mutational processes with the same signatures that were revealed in hypermutated rubella isolates also may operate in the SARS-CoV-2 pandemic population. The main similarities between SARS-CoV-2 and rubella were: (i) the presence of APOBEC-like signature uCn to uUn in positive strand; (ii) frequent presence of ADAR like A to G and U to C (shown as in positive strand, will correspond to A to G in the negative-strand); (iii) preference of loops vs. stems for C to U mutations. Also specific to SARS-CoV-2 were the statistically significant enrichment with the two additional trinucleotide-centered signatures. Firstly, there was enrichment with mutations in cGn to cAn (reverse complement for nCg to nUg) which could reflect increased frequency of cytosine deamination in CpG motifs and C to U changes in the RNA negative-strand of dsRNA intermediate producing multiple copies of the positive-strand with the complementary G to A change (see example of negative strand mutagenesis in Figure 1A). Secondly, there was increased presence of G to U changes in the positive-strand, which could reflect C to A changes in the negative-strand. These changes could be caused by the increased formation of ROS-induced 8-oxoG within cells or during library preparation [139,140].
Recently, Adebali and colleagues performed an alternative approach to identify the mutational signature of SARS-CoV-2. They analyzed a large number of SARS-CoV-2 sequenced genomes organized in a phylogenetic tree [123]. Unlike in [116], this study used the nodes of the phylogenetic tree as a reference sequence to allow the identification of the mutations that dictate each node. Despite the two studies having used different strategies and approaches for creating datasets of independent mutation events in a large collection of SARS-CoV-2 genomes, the main categories of mutation preferences APOBEC-like, ADAR-like, CpG-like and apparent ROS induced mutations were similar. Overall similar conclusions about prevailing mutagenic sources and signatures were in works addressing intra-host variations of SARS-CoV-2 [141,142]. In summary, resequencing of RNA virus genomes suggested major mutational processes generating diversity that can lead to development of new virus forms. These studies also defined several questions and technical developments that should be addressed in near future.
6. Concluding Remarks and Future Questions
Emergence of new RNA viral quasispecies pathogenic for humans, especially the SARS-CoV-2 pandemic, triggered massive research efforts to all aspects of RNA virus mechanistic studies. Mechanisms underlying instability of RNA virus genomes are important for better prediction of their evolution, new pathogen emergence, and the development of antiviral drugs. Besides that, understanding biological and molecular mechanics that allows this group to flourish rather than be washed away with catastrophic error rates represents a fundamental question related to general mechanisms of evolution. Below are questions and technological applications that we anticipate being addressed soon.
6.1. Single-Molecule Sequencing Applied to RNA Virome
The sequence of a natural individual viral isolate is usually generated from a reference based or de novo alignment of multiple small Illumina reads, thus it does not reflect the variations of individual viral RNAs but instead is an average of the total population [143]. However, the recent combination of deep Illumina sequencing, and advanced bioinformatics, allows intra-host variations to be addressed in genomes of a single viral species during an acute infection period [141,142,144,145], so some interhost variation can be revealed. Even so, the combination of short reads with metagenomics does have its limitations as it relies on the building arbitrary contigs from short reads, so the entire genome cannot be assessed [146]. To overcome these short-read sequencing issues, there are two technologies carrying promise for studies of viral genome instability by generating long reads from individual polynucleotides, Oxford Nanopore Technology and Pacific Biosciences [147]. Each of the two platforms is plagued by a rather high sequencing error rate, but even with the current level of accuracy Oxford Nanopore was used for characterizing viromes [148,149,150]. Any further increase in sequencing accuracy may cause a revolution with viral genome instability research. While the field awaits this increase of accuracy of the core technologies, there is a way to reduce false positive mutation calls by adding unique molecule identifier (UMI) barcodes added by either limited number of PCR cycles or by ligation to increase the accuracy in both platforms [151].
6.2. Impact of RdRp Misincorporation and Proofreading onto Viral Mutation Rates
Low accuracy of RdRp, as compared with replicative DNA polymerases, led to the proposal that the major source of viral genome mutations is connected with replication errors [3,152]. Since many viruses have an exonuclease (ExoN or its homologs; see special section above) appearing to proofread RdRp misincorporation, it is important to collect more information about the impact of RdRp proofreading into prevention of hypermutation in RNA viruses. This would be approached by the modification of either, RdRp accuracy or ExoN capability by mutations and/or by endogenous or environmental factors. It is quite possible that the combination of such functional defects can lead to ultra-mutation phenotypes that would function similar to the synergistic hypermutation observed in cellular organisms when mismatch repair and DNA polymerase proofreading defects are combined [153,154,155]. Further, many antiviral drugs are chain terminating NTP analogs designed to preferentially affect chain extension by RdRp [73]. However, this chain termination can be counteracted by ExoN [90,156], so search for inhibitors of this enzyme is important for practical applications.
6.3. Are There RNA Repair Mechanisms besides AlkB Direct Reversal?
It is long known that RNA is more vulnerable to breakage as compared to DNA [157], and is at least just as susceptible as DNA to base lesions (Table 1 and references therein). However, unlike DNA, there is only one well established mechanism to repair RNA base lesions—direct reversal of alkylation. Currently there are no direct indications for the existence of other RNA repair mechanisms. Speculations can still be made based on structural similarity of RNA and DNA resulting in RNA being a substrate or a ligand for common DNA repair enzyme, e.g., RPA [158], but more research will be needed to reveal RNA repair mechanisms of they do indeed exist.
6.4. Impact of Environmental RNA Lesions onto Viral Genome Instability
RNA, the same as DNA, is the subject for the lesions caused by environmental factors. DNA base lesions caused by environmental insults are well studied. They are often an impediment to replicative DNA polymerases and require specialized trans-lesion synthesis (TLS) DNA polymerases to successfully accomplish genome duplication. TLS polymerases are often error-prone and results in mutagenesis, while the lack of TLS can lead to genome rearrangements or to replication failure [159,160]. However, the same cannot be said about RNA genomes as, per current knowledge, there have been no model studies addressing the impact of environmental RNA damage on structural or sequence integrity of RNA genomes. This leaves a major gap in knowledge in how environmental RNA base damage is repaired, and whether it affects the stability of RNA genomes. This information is paramount for the understanding the dynamic world of RNA viruses.
6.5. Individual Host Impact of RNA Lesions onto Viral Genome Instability
Recent studies indicated that adenosine deaminases ADAR and cytidine deaminases APOBEC are the prevailing sources of base substitutions in several human RNA viruses including SARS-CoV-2, summarized in Section 5. Both of these enzyme types are the part of the innate immunity, which raises a question about RNA virus hypermutation within a single individual. Interestingly, in individuals with primary immunodeficiency in the adaptive immune system hypermutation of the vaccine-derived rubella virus was reported, which could be the reason of excessive activation of innate immunity and, consequently, resulting in the excessive activation of APOBECs [117]. Another important question is about the level of endogenous hypermutation of RNA viruses in species that may serve reservoirs for the occurrence of new quasispecies. Specifically, bats have been a known coronavirus reservoir that have multiple (4-7) APOBEC3 homologs while most of other mammalian orders have only one or two versions of APOBEC3 [161,162]. Therefore, studies into APOBECs within these organisms may reveal insights into the formation of novel viruses.
6.6. Experimental Models to Define Signatures of Environmental and Endogenous Mutagenesis in RNA
Defining diagnostic mutational signatures can develop into a multiprong scalable approach to understanding sources and mechanism of mutagenesis in RNA viruses. Mutational signatures turned to be a productive approach for another set of unstable genomes—human cancer. This could be a pure agnostic analysis of large datasets of genome instability catalogs [119,121,163], which can be also combined with prior mechanistic knowledge about different types of mutagenesis [120]. Another approach is to collect knowledge about mutational signatures in defined systems—mammalian [164,165] or microbial [133,135], and then utilize the information to build a specific statistical hypothesis for interrogating datasets of natural variants. High rates of mutation and relative ease of RNA virus genome sequencing can certainly make these approaches productive and scalable.
Acknowledgments
We thank Hamed Bostan and Marcos Morgan for critical reading of the manuscript.
Author Contributions
Both listed authors (Z.W.K. and D.A.G.), contributed to the visualization, writing, and editing of the manuscript. All authors have read and agreed to the published version of the manuscript.
Funding
This research was supported by the US National Institute of Health Intramural Research Program Project Z1AES103266 to D.A.G.
Conflicts of Interest
Both listed Authors have declared no competing interests.
Footnotes
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
- 1.Dobzhansky T. Nothing in Biology Makes Sense except in the Light of Evolution. Am. Biol. Teach. 1973;35:125–129. doi: 10.2307/4444260. [DOI] [Google Scholar]
- 2.Dobzhansky T. Genetics and the Origin of Species/Theodosius Dobzhansky. 3rd ed. Columbia University Press; New York, NY, USA: 1951. [Google Scholar]
- 3.Domingo E. Virus as Populations. Academic Press; Boston, MA, USA: 2016. Molecular Basis of Genetic Variation of Viruses; pp. 35–71. [DOI] [Google Scholar]
- 4.Domingo E., Perales C. Viral quasispecies. PLoS Genet. 2019;15:e1008271. doi: 10.1371/journal.pgen.1008271. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Domingo E., Sabo D., Taniguchi T., Weissmann C. Nucleotide sequence heterogeneity of an RNA phage population. Cell. 1978;13:735–744. doi: 10.1016/0092-8674(78)90223-4. [DOI] [PubMed] [Google Scholar]
- 6.Domingo E., Sheldon J., Perales C. Viral quasispecies evolution. Microbiol. Mol. Biol Rev. 2012;76:159–216. doi: 10.1128/MMBR.05023-11. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Eigen M., Schuster P. The hypercycle. A principle of natural self-organization. Part A: Emergence of the hypercycle. Naturwissenschaften. 1977;64:541–565. doi: 10.1007/BF00450633. [DOI] [PubMed] [Google Scholar]
- 8.Fornes J., Tomas Lazaro J., Alarcon T., Elena S.F., Sardanyes J. Viral replication modes in single-peak fitness landscapes: A dynamical systems analysis. J. Theor. Biol. 2019;460:170–183. doi: 10.1016/j.jtbi.2018.10.007. [DOI] [PubMed] [Google Scholar]
- 9.Schuster P. Quasispecies on Fitness Landscapes. Curr. Top. Microbiol. Immunol. 2016;392:61–120. doi: 10.1007/82_2015_469. [DOI] [PubMed] [Google Scholar]
- 10.Swetina J., Schuster P. Self-replication with errors. A model for polynucleotide replication. Biophys. Chem. 1982;16:329–345. doi: 10.1016/0301-4622(82)87037-3. [DOI] [PubMed] [Google Scholar]
- 11.Zhou L., Ding D., Szostak J.W. The virtual circular genome model for primordial RNA replication. RNA. 2021;27:1–11. doi: 10.1261/rna.077693.120. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Joyce G.F., Szostak J.W. Protocells and RNA Self-Replication. Cold Spring Harb. Perspect. Biol. 2018;10 doi: 10.1101/cshperspect.a034801. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Szostak J.W., Bartel D.P., Luisi P.L. Synthesizing life. Nature. 2001;409:387–390. doi: 10.1038/35053176. [DOI] [PubMed] [Google Scholar]
- 14.Jheeta S. The Routes of Emergence of Life from LUCA during the RNA and Viral World: A Conspectus. Life. 2015;5:1445–1453. doi: 10.3390/life5021445. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Robertson M.P., Joyce G.F. The origins of the RNA world. Cold Spring Harb. Perspect. Biol. 2012;4 doi: 10.1101/cshperspect.a003608. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Dworkin J.P., Lazcano A., Miller S.L. The roads to and from the RNA world. J. Theor. Biol. 2003;222:127–134. doi: 10.1016/S0022-5193(03)00020-1. [DOI] [PubMed] [Google Scholar]
- 17.Gilbert W. Origin of Life-the Rna World. Nature. 1986;319:618. doi: 10.1038/319618a0. [DOI] [Google Scholar]
- 18.Sankaran N. The RNA World at Thirty: A Look Back with its Author. J. Mol. Evol. 2016;83:169–175. doi: 10.1007/s00239-016-9767-3. [DOI] [PubMed] [Google Scholar]
- 19.Moya A., Holmes E.C., Gonzalez-Candelas F. The population genetics and evolutionary epidemiology of RNA viruses. Nat. Rev. Microbiol. 2004;2:279–288. doi: 10.1038/nrmicro863. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Gago S., Elena S.F., Flores R., Sanjuan R. Extremely high mutation rate of a hammerhead viroid. Science. 2009;323:1308. doi: 10.1126/science.1169202. [DOI] [PubMed] [Google Scholar]
- 21.Duffy S., Shackelton L.A., Holmes E.C. Rates of evolutionary change in viruses: Patterns and determinants. Nat. Rev. Genet. 2008;9:267–276. doi: 10.1038/nrg2323. [DOI] [PubMed] [Google Scholar]
- 22.Abdel-Moneim A.S., Abdelwhab E.M. Evidence for SARS-CoV-2 Infection of Animal Hosts. Pathogens. 2020;9:529. doi: 10.3390/pathogens9070529. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Andersen K.G., Rambaut A., Lipkin W.I., Holmes E.C., Garry R.F. The proximal origin of SARS-CoV-2. Nat. Med. 2020;26:450–452. doi: 10.1038/s41591-020-0820-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Orgel L.E. The origin of life--a review of facts and speculations. Trends Biochem. Sci. 1998;23:491–495. doi: 10.1016/S0968-0004(98)01300-0. [DOI] [PubMed] [Google Scholar]
- 25.Szostak J.W. An optimal degree of physical and chemical heterogeneity for the origin of life? Philos. Trans. R. Soc. B Biol. Sci. 2011;366:2894–2901. doi: 10.1098/rstb.2011.0140. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Wu T., Orgel L.E. Nonenzymatic template-directed synthesis on oligodeoxycytidylate sequences in hairpin oligonucleotides. J. Am. Chem. Soc. 1992;114:317–322. doi: 10.1021/ja00027a040. [DOI] [PubMed] [Google Scholar]
- 27.Cojocaru R., Unrau P.J. Processive RNA polymerization and promoter recognition in an RNA World. Science. 2021;371:1225–1232. doi: 10.1126/science.abd9191. [DOI] [PubMed] [Google Scholar]
- 28.Koonin E.V., Senkevich T.G., Dolja V.V. The ancient Virus World and evolution of cells. Biol. Direct. 2006;1:29. doi: 10.1186/1745-6150-1-29. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Baltimore D. Expression of animal virus genomes. Bacteriol. Rev. 1971;35:235–241. doi: 10.1128/br.35.3.235-241.1971. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Ortin J., Parra F. Structure and function of RNA replication. Annu. Rev. Microbiol. 2006;60:305–326. doi: 10.1146/annurev.micro.60.080805.142248. [DOI] [PubMed] [Google Scholar]
- 31.Beranek D.T. Distribution of methyl and ethyl adducts following alkylation with monofunctional alkylating agents. Mutat. Res. 1990;231:11–30. doi: 10.1016/0027-5107(90)90173-2. [DOI] [PubMed] [Google Scholar]
- 32.Lindahl T. Instability and decay of the primary structure of DNA. Nature. 1993;362:709–715. doi: 10.1038/362709a0. [DOI] [PubMed] [Google Scholar]
- 33.Sobieszczyk M.E., Lingappa J.R., McElrath M.J. Host genetic polymorphisms associated with innate immune factors and HIV-1. Curr. Opin. HIV AIDS. 2011;6:427–434. doi: 10.1097/COH.0b013e3283497155. [DOI] [PubMed] [Google Scholar]
- 34.Ross S.R. Are viruses inhibited by APOBEC3 molecules from their host species? PLoS Pathog. 2009;5:e1000347. doi: 10.1371/journal.ppat.1000347. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Emerman M., Malik H.S. Paleovirology--modern consequences of ancient viruses. PLoS Biol. 2010;8:e1000301. doi: 10.1371/journal.pbio.1000301. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Daugherty M.D., Malik H.S. Rules of engagement: Molecular insights from host-virus arms races. Annu. Rev. Genet. 2012;46:677–700. doi: 10.1146/annurev-genet-110711-155522. [DOI] [PubMed] [Google Scholar]
- 37.Domingo E. Virus as Populations: Composition, Complexity, Quasispecies, Dynamics and Biological Implications. 2nd ed. Academic Press; Cambridge, MA, USA: 2016. Molecular basis of genetic variation of viruses: Error-prone replication; pp. 35–71. [DOI] [Google Scholar]
- 38.Agol V.I., Gmyl A.P. Emergency Services of Viral RNAs: Repair and Remodeling. Microbiol. Mol. Biol. Rev. 2018;82 doi: 10.1128/MMBR.00067-17. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Lai M.M.C. Genetic-Recombination in Rna Viruses. Curr. Top. Microbiol. Immunol. 1992;176:21–32. doi: 10.1007/978-3-642-77011-1_2. [DOI] [PubMed] [Google Scholar]
- 40.Levy D.N., Aldrovandi G.M., Kutsch O., Shaw G.M. Dynamics of HIV-1 recombination in its natural target cells. Proc. Natl. Acad. Sci. USA. 2004;101:4204–4209. doi: 10.1073/pnas.0306764101. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Nagy P.D., Simon A.E. New insights into the mechanisms of RNA recombination. Virology. 1997;235:1–9. doi: 10.1006/viro.1997.8681. [DOI] [PubMed] [Google Scholar]
- 42.Sztuba-Solinska J., Urbanowicz A., Figlerowicz M., Bujarski J.J. RNA-RNA Recombination in Plant Virus Replication and Evolution. Annu. Rev. Phytopathol. 2011;49:415–443. doi: 10.1146/annurev-phyto-072910-095351. [DOI] [PubMed] [Google Scholar]
- 43.Urbanowicz A., Alejska M., Formanowicz P., Błażewicz J., Figlerowicz M., Bujarski J.J. Homologous crossovers among molecules of brome mosaic bromovirus RNA1 or RNA2 segments in vivo. J. Virol. 2005;79:5732–5742. doi: 10.1128/JVI.79.9.5732-5742.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Liao C.L., Lai M.M.C. Rna Recombination in a Coronavirus-Recombination between Viral Genomic Rna and Transfected Rna Fragments. J. Virol. 1992;66:6117–6124. doi: 10.1128/jvi.66.10.6117-6124.1992. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Kempf B.J., Peersen O.B., Barton D.J. Poliovirus Polymerase Leu420 Facilitates RNA Recombination and Ribavirin Resistance. J. Virol. 2016;90:8410–8421. doi: 10.1128/JVI.00078-16. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Peersen O.B. Picornaviral polymerase structure, function, and fidelity modulation. Virus Res. 2017;234:4–20. doi: 10.1016/j.virusres.2017.01.026. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Woodman A., Arnold J.J., Cameron C.E., Evans D.J. Biochemical and genetic analysis of the role of the viral polymerase in enterovirus recombination. Nucleic Acids Res. 2016;44:6883–6895. doi: 10.1093/nar/gkw567. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Xiao Y., Rouzine I.M., Bianco S., Acevedo A., Goldstein E.F., Farkov M., Brodsky L., Andino R. RNA Recombination Enhances Adaptability and Is Required for Virus Spread and Virulence. Cell Host Microbe. 2016;19:493–503. doi: 10.1016/j.chom.2016.03.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Arnold J.J., Ghosh S.K., Cameron C.E. Poliovirus RNA-dependent RNA polymerase (3D(pol)). Divalent cation modulation of primer, template, and nucleotide selection. J. Biol. Chem. 1999;274:37060–37069. doi: 10.1074/jbc.274.52.37060. [DOI] [PubMed] [Google Scholar]
- 50.Kirkegaard K., Baltimore D. The mechanism of RNA recombination in poliovirus. Cell. 1986;47:433–443. doi: 10.1016/0092-8674(86)90600-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Romanova L.I., Blinov V.M., Tolskaya E.A., Viktorova E.G., Kolesnikova M.S., Guseva E.A., Agol V.I. The primary structure of crossover regions of intertypic poliovirus recombinants: A model of recombination between RNA genomes. Virology. 1986;155:202–213. doi: 10.1016/0042-6822(86)90180-7. [DOI] [PubMed] [Google Scholar]
- 52.Simon-Loriere E., Holmes E.C. Why do RNA viruses recombine? Nat. Rev. Microbiol. 2011;9:617–626. doi: 10.1038/nrmicro2614. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Arnold J.J., Gohara D.W., Cameron C.E. Poliovirus RNA-dependent RNA polymerase (3Dpol): Pre-steady-state kinetic analysis of ribonucleotide incorporation in the presence of Mn2+ Biochemistry. 2004;43:5138–5148. doi: 10.1021/bi035213q. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Adams S.D., Tzeng W.P., Chen M.H., Frey T.K. Analysis of intermolecular RNA-RNA recombination by rubella virus. Virology. 2003;309:258–271. doi: 10.1016/S0042-6822(03)00064-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Austermann-Busch S., Becher P. RNA structural elements determine frequency and sites of nonhomologous recombination in an animal plus-strand RNA virus. J. Virol. 2012;86:7393–7402. doi: 10.1128/JVI.00864-12. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Gmyl A.P., Agol V.I. Diverse Mechanisms of RNA Recombination. Mol. Biol. 2005;39:529–542. doi: 10.1007/s11008-005-0069-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Gmyl A.P., Belousov E.V., Maslova S.V., Khitrina E.V., Chetverin A.B., Agol V.I. Nonreplicative RNA recombination in poliovirus. J. Virol. 1999;73:8958–8965. doi: 10.1128/JVI.73.11.8958-8965.1999. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Gmyl A.P., Korshenko S.A., Belousov E.V., Khitrina E.V., Agol V.I. Nonreplicative homologous RNA recombination: Promiscuous joining of RNA pieces? RNA. 2003;9:1221–1231. doi: 10.1261/rna.5111803. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Holmblat B., Jégouic S., Muslin C., Blondel B., Joffret M.-L., Delpeyroux F. Nonhomologous recombination between defective poliovirus and coxsackievirus genomes suggests a new model of genetic plasticity for picornaviruses. mBio. 2014;5:e01119-14. doi: 10.1128/mBio.01119-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Büning M.K., Meyer D., Austermann-Busch S., Roman-Sosa G., Rümenapf T., Becher P. Nonreplicative RNA Recombination of an Animal Plus-Strand RNA Virus in the Absence of Efficient Translation of Viral Proteins. Genome Biol. Evol. 2017;9:817–829. doi: 10.1093/gbe/evx046. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Raju R., Subramaniam S.V., Hajjou M. Genesis of Sindbis virus by in vivo recombination of nonreplicative RNA precursors. J. Virol. 1995;69:7391–7401. doi: 10.1128/jvi.69.12.7391-7401.1995. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Scheel T., Galli A., Li Y.-P., Mikkelsen L.S., Gottwein J., Bukh J. Productive homologous and non-homologous recombination of hepatitis C virus in cell culture. PLoS Pathog. 2013;9:e1003228. doi: 10.1371/journal.ppat.1003228. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Schibler M., Piuz I., Hao W., Tapparel C. Chimeric rhinoviruses obtained via genetic engineering or artificially induced recombination are viable only if the polyprotein coding sequence derives from the same species. J. Virol. 2015;89:4470–4480. doi: 10.1128/JVI.03668-14. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Wilkinson M.E., Charenton C., Nagai K. RNA Splicing by the Spliceosome. Annu. Rev. Biochem. 2020;89:359–388. doi: 10.1146/annurev-biochem-091719-064225. [DOI] [PubMed] [Google Scholar]
- 65.Zhang L., Vielle A., Espinosa S., Zhao R. RNAs in the spliceosome: Insight from cryoEM structures. Wiley Interdiscip. Rev. RNA. 2019;10:e1523. doi: 10.1002/wrna.1523. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Cech T.R. Self-splicing RNA: Implications for evolution. Int. Rev. Cytol. 1985;93:3–22. doi: 10.1016/s0074-7696(08)61370-4. [DOI] [PubMed] [Google Scholar]
- 67.Dolja V.V., Koonin E.V. Metagenomics reshapes the concepts of RNA virus evolution by revealing extensive horizontal virus transfer. Virus Res. 2018;244:36–52. doi: 10.1016/j.virusres.2017.10.020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.te Velthuis A.J.W. Common and unique features of viral RNA-dependent polymerases. Cell Mol. Life Sci. 2014;71:4403–4420. doi: 10.1007/s00018-014-1695-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Bruenn J.A. Relationships among the Positive Strand and Double-Strand Rna Viruses as Viewed through Their Rna-Dependent Rna-Polymerases. Nucleic Acids Res. 1991;19:217–226. doi: 10.1093/nar/19.2.217. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Gorbalenya A.E., Pringle F.M., Zeddam J.-L., Luke B.T., Cameron C.E., Kalmakoff J., Hanzlik T.N., Gordon K., Ward V.K. The palm subdomain-based active site is internally permuted in viral RNA-dependent RNA polymerases of an ancient lineage. J. Mol. Biol. 2002;324:47–62. doi: 10.1016/S0022-2836(02)01033-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Holland J., Spindler K., Horodyski F., Grabau E., Nichol S., Vandepol S. Rapid evolution of RNA genomes. Science. 1982;215:1577–1585. doi: 10.1126/science.7041255. [DOI] [PubMed] [Google Scholar]
- 72.Novella I.S., Duarte E.A., Elena S.F., Moya A., Domingo E., Holland J.J. Exponential increases of RNA virus fitness during large population transmissions. Proc. Natl. Acad. Sci. USA. 1995;92:5841–5844. doi: 10.1073/pnas.92.13.5841. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.Seley-Radtke K.L., Yates M.K. The evolution of nucleoside analogue antivirals: A review for chemists and non-chemists. Part 1: Early structural modifications to the nucleoside scaffold. Antivir. Res. 2018;154:66–86. doi: 10.1016/j.antiviral.2018.04.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.De Clercq E. Antivirals and antiviral strategies. Nat. Rev. Microbiol. 2004;2:704–720. doi: 10.1038/nrmicro975. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 75.De Clercq E., Li G. Approved Antiviral Drugs over the Past 50 Years. Clin. Microbiol. Rev. 2016;29:695–747. doi: 10.1128/CMR.00102-15. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Jordheim L.P., Durantel D., Zoulim F., Dumontet C. Advances in the development of nucleoside and nucleotide analogues for cancer and viral diseases. Nat. Rev. Drug Discov. 2013;12:447–464. doi: 10.1038/nrd4010. [DOI] [PubMed] [Google Scholar]
- 77.Pfeiffer J.K., Kirkegaard K. A single mutation in poliovirus RNA-dependent RNA polymerase confers resistance to mutagenic nucleotide analogs via increased fidelity. Proc. Natl. Acad. Sci. USA. 2003;100:7289–7294. doi: 10.1073/pnas.1232294100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Sadeghipour S., Bek E.J., McMinn P.C. Ribavirin-resistant mutants of human enterovirus 71 express a high replication fidelity phenotype during growth in cell culture. J. Virol. 2013;87:1759–1769. doi: 10.1128/JVI.02139-12. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79.Coffey L.L., Beeharry Y., Borderia A.V., Blanc H., Vignuzzi M. Arbovirus high fidelity variant loses fitness in mosquitoes and mice. Proc. Natl. Acad. Sci. USA. 2011;108:16038–16043. doi: 10.1073/pnas.1111650108. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80.Sierra M., Airaksinen A., González-López C., Agudo R., Arias A., Domingo E. Foot-and-mouth disease virus mutant with decreased sensitivity to ribavirin: Implications for error catastrophe. J. Virol. 2007;81:2012–2024. doi: 10.1128/JVI.01606-06. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 81.Binh N.T., Wakai C., Kawaguchi A., Nagata K. Involvement of the N-terminal portion of influenza virus RNA polymerase subunit PB1 in nucleotide recognition. Biochem. Biophys. Res. Commun. 2014;443:975–979. doi: 10.1016/j.bbrc.2013.12.071. [DOI] [PubMed] [Google Scholar]
- 82.Liu X., Yang X., Lee C.A., Moustafa I.M., Smidansky E.D., Lum D., Arnold J.J., Cameron C.E., Boehr D.D. Vaccine-derived mutation in motif D of poliovirus RNA-dependent RNA polymerase lowers nucleotide incorporation fidelity. J. Biol. Chem. 2013;288:32753–32765. doi: 10.1074/jbc.M113.484428. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 83.Vignuzzi M., Wendt E., Andino R. Engineering attenuated virus vaccines by controlling replication fidelity. Nat. Med. 2008;14:154–161. doi: 10.1038/nm1726. [DOI] [PubMed] [Google Scholar]
- 84.Steinhauer D.A., Domingo E., Holland J.J. Lack of evidence for proofreading mechanisms associated with an RNA virus polymerase. Gene. 1992;122:281–288. doi: 10.1016/0378-1119(92)90216-C. [DOI] [PubMed] [Google Scholar]
- 85.Drake J.W., Holland J.J. Mutation rates among RNA viruses. Proc. Natl. Acad. Sci. USA. 1999;96:13910–13913. doi: 10.1073/pnas.96.24.13910. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 86.Eigen M. Error catastrophe and antiviral strategy. Proc. Natl. Acad. Sci. USA. 2002;99:13374–13376. doi: 10.1073/pnas.212514799. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 87.Bukhari K., Mulley G., Gulyaeva A.A., Zhao L., Shu G., Jiang J., Neuman B.W. Description and initial characterization of metatranscriptomic nidovirus-like genomes from the proposed new family Abyssoviridae, and from a sister group to the Coronavirinae, the proposed genus Alphaletovirus. Virology. 2018;524:160–171. doi: 10.1016/j.virol.2018.08.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 88.Saberi A., Gulyaeva A.A., Brubacher J.L., Newmark P.A., Gorbalenya A.E. A planarian nidovirus expands the limits of RNA genome size. PLoS Pathog. 2018;14:e1007314. doi: 10.1371/journal.ppat.1007314. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 89.Snijder E.J., Bredenbeek P.J., Dobbe J.C., Thiel V., Ziebuhr J., Poon L.L., Guan Y., Rozanov M., Spaan W.J., Gorbalenya A.E. Unique and conserved features of genome and proteome of SARS-coronavirus, an early split-off from the coronavirus group 2 lineage. J. Mol. Biol. 2003;331:991–1004. doi: 10.1016/S0022-2836(03)00865-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 90.Ogando N.S., Ferron F., Decroly E., Canard B., Posthuma C.C., Snijder E.J. The Curious Case of the Nidovirus Exoribonuclease: Its Role in RNA Synthesis and Replication Fidelity. Front. Microbiol. 2019;10:1813. doi: 10.3389/fmicb.2019.01813. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 91.Minskaia E., Hertzig T., Gorbalenya A., Campanacci V., Cambillau C., Canard B., Ziebuhr J. Discovery of an RNA virus 3 ’-> 5 ’ exoribonuclease that is critically involved in coronavirus RNA synthesis. Proc. Natl. Acad. Sci. USA. 2006;103:5108–5113. doi: 10.1073/pnas.0508200103. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 92.Eckerle L.D., Becker M.M., Halpin R.A., Li K., Venter E., Lu X., Scherbakova S., Graham R.L., Baric R.S., Stockwell T.B., et al. Infidelity of SARS-CoV Nsp14-Exonuclease Mutant Virus Replication Is Revealed by Complete Genome Sequencing. PLoS Pathog. 2010;6:e1000896. doi: 10.1371/journal.ppat.1000896. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 93.Eckerle L.D., Lu X., Sperry S.M., Choi L., Denison M.R. High fidelity of murine hepatitis virus replication is decreased in nsp14 exoribonuclease mutants. J. Virol. 2007;81:12135–12144. doi: 10.1128/JVI.01296-07. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 94.Bouvet M., Debarnot C., Imbert I., Selisko B., Snijder E., Canard B., Decroly E. In vitro reconstitution of SARS-coronavirus mRNA cap methylation. PLoS Pathog. 2010;6:e1000863. doi: 10.1371/annotation/a0dde376-2eb1-4ce3-8887-d29f5ba6f162. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 95.Bouvet M., Imbert I., Subissi L., Gluais L., Canard B., Decroly E. RNA 3’-end mismatch excision by the severe acute respiratory syndrome coronavirus nonstructural protein nsp10/nsp14 exoribonuclease complex. Proc. Natl. Acad. Sci. USA. 2012;109:9372–9377. doi: 10.1073/pnas.1201130109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 96.Pogolotti A.L., Jr., Ono A., Subramaniam R., Santi D.V. On the mechanism of DNA-adenine methylase. J. Biol. Chem. 1988;263:7461–7464. doi: 10.1016/S0021-9258(18)68520-5. [DOI] [PubMed] [Google Scholar]
- 97.Potuznik J.F., Cahova H. It’s the Little Things (in Viral RNA) mBio. 2020;11 doi: 10.1128/mBio.02131-20. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 98.Ge J., Yu Y.T. RNA pseudouridylation: New insights into an old modification. Trends Biochem. Sci. 2013;38:210–218. doi: 10.1016/j.tibs.2013.01.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 99.Schaefer M.R. The Regulation of RNA Modification Systems: The Next Frontier in Epitranscriptomics? Genes. 2021;12:345. doi: 10.3390/genes12030345. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 100.Drabløs F., Feyzi E., Aas P.A., Vaagbø C.B., Kavli B., Bratlie M.S., Peña-Diaz J., Otterlei M., Slupphaug G., Krokan H.E. Alkylation damage in DNA and RNA--repair mechanisms and medical significance. DNA Repair. 2004;3:1389–1407. doi: 10.1016/j.dnarep.2004.05.004. [DOI] [PubMed] [Google Scholar]
- 101.Feyzi E., Sundheim O., Westbye M., Aas P., Vågbø C.B., Otterlei M., Slupphaug G., Krokan H. RNA base damage and repair. Curr. Pharm. Biotechnol. 2007;8:326–331. doi: 10.2174/138920107783018363. [DOI] [PubMed] [Google Scholar]
- 102.Thapar R., Bacolla A., Oyeniran C., Brickner J.R., Chinnam N.B., Mosammaparast N., Tainer J.A. RNA Modifications: Reversal Mechanisms and Cancer. Biochemistry. 2019;58:312–329. doi: 10.1021/acs.biochem.8b00949. [DOI] [PubMed] [Google Scholar]
- 103.Born E.V.D., Omelchenko M.V., Bekkelund A., Leihne V., Koonin E.V., Dolja V.V., Falnes P.Ø. Viral AlkB proteins repair RNA damage by oxidative demethylation. Nucleic Acids Res. 2008;36:5451–5461. doi: 10.1093/nar/gkn519. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 104.Bratlie M.S., Drablos F. Bioinformatic mapping of AlkB homology domains in viruses. BMC Genom. 2005;6 doi: 10.1186/1471-2164-6-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 105.Courtney D.G. Post-Transcriptional Regulation of Viral RNA through Epitranscriptional Modification. Cells. 2021;10:1129. doi: 10.3390/cells10051129. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 106.Delaunay S., Frye M. RNA modifications regulating cell fate in cancer. Nat. Cell Biol. 2019;21:552–559. doi: 10.1038/s41556-019-0319-0. [DOI] [PubMed] [Google Scholar]
- 107.Esteve-Puig R., Bueno-Costa A., Esteller M. Writers, readers and erasers of RNA modifications in cancer. Cancer Lett. 2020;474:127–137. doi: 10.1016/j.canlet.2020.01.021. [DOI] [PubMed] [Google Scholar]
- 108.Frye M., Jaffrey S.R., Pan T., Rechavi G., Suzuki T. RNA modifications: What have we learned and where are we headed? Nat. Rev. Genet. 2016;17:365–372. doi: 10.1038/nrg.2016.47. [DOI] [PubMed] [Google Scholar]
- 109.Ontiveros R.J., Stoute J., Liu K.F. The chemical diversity of RNA modifications. Biochem. J. 2019;476:1227–1245. doi: 10.1042/BCJ20180445. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 110.Saito T., Owen D.M., Jiang F., Marcotrigiano J., Gale M., Jr. Innate immunity induced by composition-dependent RIG-I recognition of hepatitis C virus RNA. Nature. 2008;454:523–527. doi: 10.1038/nature07106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 111.Marceau C.D., Puschnik A.S., Majzoub K., Ooi Y.S., Brewer S.M., Fuchs G., Swaminathan K., Mata M.A., Elias K.S.J.E., Sarnow P., et al. Genetic dissection of Flaviviridae host factors through genome-scale CRISPR screens. Nature. 2016;535:159–163. doi: 10.1038/nature18631. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 112.Bradrick S.S. Causes and Consequences of Flavivirus RNA Methylation. Front. Microbiol. 2017;8:2374. doi: 10.3389/fmicb.2017.02374. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 113.Courtney D.G., Chalem A., Bogerd H.P., Law B.A., Kennedy E.M., Holley C.L., Cullen B.R. Extensive Epitranscriptomic Methylation of A and C Residues on Murine Leukemia Virus Transcripts Enhances Viral Gene Expression. mBio. 2019;10 doi: 10.1128/mBio.01209-19. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 114.Durbin A.F., Wang C., Marcotrigiano J., Gehrke L. RNAs Containing Modified Nucleotides Fail To Trigger RIG-I Conformational Changes for Innate Immune Signaling. mBio. 2016;7:e00833-16. doi: 10.1128/mBio.00833-16. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 115.Tomaselli S., Galeano F., Locatelli F., Gallo A. ADARs and the Balance Game between Virus Infection and Innate Immune Cell Response. Curr. Issues Mol. Biol. 2015;17:37–52. [PubMed] [Google Scholar]
- 116.Klimczak L.J., Randall T.A., Saini N., Li J.L., Gordenin D.A. Similarity between mutation spectra in hypermutated genomes of rubella virus and in SARS-CoV-2 genomes accumulated during the COVID-19 pandemic. PLoS ONE. 2020;15:e0237689. doi: 10.1371/journal.pone.0237689. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 117.Perelygina L., Chen M.-H., Suppiah S., Adebayo A., Abernathy E., Dorsey M., Bercovitch L., Paris K., White K.P., Krol A., et al. Infectious vaccine-derived rubella viruses emerge, persist, and evolve in cutaneous granulomas of children with primary immunodeficiencies. PLoS Pathog. 2019;15:e1008080. doi: 10.1371/journal.ppat.1008080. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 118.Smyth R.P., Negroni M., Lever A.M., Mak J., Kenyon J.C. RNA Structure-A Neglected Puppet Master for the Evolution of Virus and Host Immunity. Front. Immunol. 2018;9:2097. doi: 10.3389/fimmu.2018.02097. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 119.Alexandrov L.B., Nik-Zainal S., Wedge D.C., Aparicio S.A.J.R., Behjati S., Biankin A.V., Bignell G.R., Bolli N., Borg A., Børresen-Dale A.-L., et al. Signatures of mutational processes in human cancer. Nature. 2013;500:415–421. doi: 10.1038/nature12477. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 120.Roberts S.A., Gordenin D.A. Hypermutation in human cancer genomes: Footprints and mechanisms. Nat. Rev. Cancer. 2014;14:786–800. doi: 10.1038/nrc3816. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 121.Alexandrov L.B., Kim J., Haradhvala N.J., Huang M.N., Ng A.W.T., Wu Y., Boot A., Covington K.R., Gordenin D.A., Bergstrom E.N., et al. The repertoire of mutational signatures in human cancer. Nature. 2020;578:94–101. doi: 10.1038/s41586-020-1943-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 122.Shu Y., McCauley J. GISAID: Global initiative on sharing all influenza data-from vision to reality. Eurosurveillance. 2017;22 doi: 10.2807/1560-7917.ES.2017.22.13.30494. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 123.Azgari C., Kilinc Z., Turhan B., Circi D., Adebali O. The Mutation Profile of SARS-CoV-2 Is Primarily Shaped by the Host Antiviral Defense. bioRxiv. 2021;13:394. doi: 10.3390/v13030394. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 124.Khrustalev V.V., Barkovsky E.V. Unusual nucleotide content of Rubella virus genome as a consequence of biased RNA-editing: Comparison with Alphaviruses. Int. J. Bioinform. Res. Appl. 2011;7:82–100. doi: 10.1504/IJBRA.2011.039171. [DOI] [PubMed] [Google Scholar]
- 125.Khrustalev V.V., Khrustaleva T.A., Sharma N., Giri R. Mutational Pressure in Zika Virus: Local ADAR-Editing Areas Associated with Pauses in Translation and Replication. Front. Cell Infect. Microbiol. 2017;7:44. doi: 10.3389/fcimb.2017.00044. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 126.Simmonds P. Rampant C-->U Hypermutation in the Genomes of SARS-CoV-2 and Other Coronaviruses: Causes and Consequences for Their Short- and Long-Term Evolutionary Trajectories. mSphere. 2020;5 doi: 10.1128/mSphere.00408-20. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 127.Simmonds P., Ansari M.A. Mutation bias implicates RNA editing in a wide range of mammalian RNA viruses. bioRxiv. 2021 doi: 10.1101/2021.02.09.430395. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 128.Jalili P., Bowen D., Langenbucher A., Park S., Aguirre K., Corcoran R.B., Fleischman A.G., Lawrence M.S., Zou L., Buisson R. Quantification of ongoing APOBEC3A activity in tumor cells by monitoring RNA editing at hotspots. Nat. Commun. 2020;11:2971. doi: 10.1038/s41467-020-16802-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 129.Conticello S.G. Creative deaminases, self-inflicted damage, and genome evolution. Ann. N. Y. Acad. Sci. 2012;1267:79–85. doi: 10.1111/j.1749-6632.2012.06614.x. [DOI] [PubMed] [Google Scholar]
- 130.Green A.M., Weitzman M.D. The spectrum of APOBEC3 activity: From anti-viral agents to anti-cancer opportunities. DNA Repair. 2019;83:102700. doi: 10.1016/j.dnarep.2019.102700. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 131.Harris R.S., Dudley J.P. APOBECs and virus restriction. Virology. 2015;479–480:131–145. doi: 10.1016/j.virol.2015.03.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 132.Refsland E.W., Harris R.S. The APOBEC3 family of retroelement restriction factors. Curr. Top. Microbiol. Immunol. 2013;371:1–27. doi: 10.1007/978-3-642-37765-5_1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 133.Chan K., Roberts S.A., Klimczak L.J., Sterling J.F., Saini N., Malc E., Kim J., Kwiatkowski D.J., Fargo D.C., Mieczkowski P., et al. An APOBEC3A hypermutation signature is distinguishable from the signature of background mutagenesis by APOBEC3B in human cancers. Nat. Genet. 2015;47:1067–1072. doi: 10.1038/ng.3378. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 134.Roberts S.A., Lawrence M.S., Klimczak L.J., Grimm S.A., Fargo D., Stojanov P., Kiezun A., Kryukov G., Carter S.L., Saksena G., et al. An APOBEC cytidine deaminase mutagenesis pattern is widespread in human cancers. Nat. Genet. 2013;45:970–976. doi: 10.1038/ng.2702. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 135.Saini N., Sterling J.F., Sakofsky C.J., Giacobone C.K., Klimczak L.J., Burkholder A.B., Malc E.P., Mieczkowski P.A., Gordenin D.A. Mutation signatures specific to DNA alkylating agents in yeast and cancers. Nucleic Acids Res. 2020;48:3692–3707. doi: 10.1093/nar/gkaa150. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 136.Chan K., Roberts S.A., Klimczak L.J., Sterling J.F., Saini N., Malc E., Kim J., Kwiatkowski D.J., Fargo D.C., Mieczkowski P., et al. UV-exposure, endogenous DNA damage, and DNA replication errors shape the spectra of genome changes in human skin. PLoS Genet. 2021;17:e1009302. doi: 10.1371/journal.pgen.1009302. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 137.Eggington J.M., Greene T., Bass B.L. Predicting sites of ADAR editing in double-stranded RNA. Nat. Commun. 2011;2:319. doi: 10.1038/ncomms1324. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 138.Wu F., Zhao S., Yu B., Chen Y.-M., Wang W., Song Z.-G., Hu Y., Tao Z.-W., Tian J.-H., Pei Y.-Y., et al. A new coronavirus associated with human respiratory disease in China. Nature. 2020;579:265–269. doi: 10.1038/s41586-020-2008-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 139.Boo S.H., Kim Y.K. The emerging role of RNA modifications in the regulation of mRNA stability. Exp. Mol. Med. 2020;52:400–408. doi: 10.1038/s12276-020-0407-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 140.Costello M., Pugh T.J., Fennell T.J., Stewart C., Lichtenstein L., Meldrim J.C., Fostel J.L., Friedrich D.C., Perrin D., Dionne D., et al. Discovery and characterization of artifactual mutations in deep coverage targeted capture sequencing data due to oxidative DNA damage during sample preparation. Nucleic Acids Res. 2013;41:e67. doi: 10.1093/nar/gks1443. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 141.Di Giorgio S., Martignano F., Torcia M.G., Mattiuz G., Conticello S.G. Evidence for host-dependent RNA editing in the transcriptome of SARS-CoV-2. Sci. Adv. 2020;6:eabb5813. doi: 10.1126/sciadv.abb5813. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 142.Graudenzi A., Maspero D., Angaroni F., Piazza R., Ramazzotti D. Mutational signatures and heterogeneous host response revealed via large-scale characterization of SARS-CoV-2 genomic diversity. iScience. 2021;24:102116. doi: 10.1016/j.isci.2021.102116. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 143.Chiara M., D’Erchia A.M., Gissi C., Manzari C., Parisi A., Resta N., Zambelli F., Picardi E., Pavesi G., Horner D.S., et al. Next generation sequencing of SARS-CoV-2 genomes: Challenges, applications and opportunities. Brief. Bioinform. 2020;22:616–630. doi: 10.1093/bib/bbaa297. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 144.Wang Y., Wang D., Zhang L., Sun W., Zhang Z., Chen W., Zhu A., Huang Y., Xiao F., Yao J., et al. Intra-host variation and evolutionary dynamics of SARS-CoV-2 populations in COVID-19 patients. Genome Med. 2021;13:30. doi: 10.1186/s13073-021-00847-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 145.Lythgoe K.A., Hall M., Ferretti L., de Cesare M., MacIntyre-Cockett G., Trebes A., Andersson M., Otecko N., Wise E.L., Moore N., et al. SARS-CoV-2 within-host diversity and transmission. Science. 2021;372:eabg0821. doi: 10.1126/science.abg0821. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 146.Simmonds P. Methods for virus classification and the challenge of incorporating metagenomic sequence data. J. Gen. Virol. 2015;96:1193–1206. doi: 10.1099/vir.0.000016. [DOI] [PubMed] [Google Scholar]
- 147.Boldogkoi Z., Moldovan N., Balazs Z., Snyder M., Tombacz D. Long-Read Sequencing—A Powerful Tool in Viral Transcriptome Research. Trends Microbiol. 2019;27:578–592. doi: 10.1016/j.tim.2019.01.010. [DOI] [PubMed] [Google Scholar]
- 148.Lovestad A.H., Jorgensen S.B., Handal N., Ambur O.H., Aamot H.V. Investigation of intra-hospital SARS-CoV-2 transmission using nanopore whole genome sequencing. J. Hosp. Infect. 2021;111:107–116. doi: 10.1016/j.jhin.2021.02.022. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 149.Charre C., Ginevra C., Sabatier M., Regue H., Destras G., Brun S., Burfin G., Scholtes C., Morfin F., Valette M., et al. Evaluation of NGS-based approaches for SARS-CoV-2 whole genome characterisation. Virus Evol. 2020;6:veaa075. doi: 10.1093/ve/veaa075. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 150.Li J., Wang H., Mao L., Yu H., Yu X., Sun Z., Qian X., Cheng S., Chen S., Chen J., et al. Rapid genomic characterization of SARS-CoV-2 viruses from clinical specimens using nanopore sequencing. Sci. Rep. 2020;10:17492. doi: 10.1038/s41598-020-74656-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 151.Karst S.M., Ziels R.M., Kirkegaard R.H., Sørensen E.A., McDonald D., Zhu Q., Knight R., Albertsen M. High-accuracy long-read amplicon sequences using unique molecular identifiers with Nanopore or PacBio sequencing. Nat. Methods. 2021;18:165–169. doi: 10.1038/s41592-020-01041-y. [DOI] [PubMed] [Google Scholar]
- 152.Fitzsimmons W., Woods R.J., McCrone J.T., Woodman A., Arnold J.J., Yennawar M., Evans R., Cameron C.E., Lauring A.S. A speed-fidelity trade-off determines the mutation rate and virulence of an RNA virus. PLoS Biol. 2018;16:e2006459. doi: 10.1371/journal.pbio.2006459. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 153.Schaaper R.M. Mechanisms of mutagenesis in the Escherichia coli mutator mutD5: Role of DNA mismatch repair. Proc. Natl. Acad. Sci. USA. 1988;85:8126–8130. doi: 10.1073/pnas.85.21.8126. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 154.Schaaper R.M. Base selection, proofreading, and mismatch repair during DNA replication in Escherichia coli. J. Biol. Chem. 1993;268:23762–23765. doi: 10.1016/S0021-9258(20)80446-3. [DOI] [PubMed] [Google Scholar]
- 155.Morrison A., Johnson A.L., Johnston L.H., Sugino A. Pathway correcting DNA replication errors in Saccharomyces cerevisiae. EMBO J. 1993;12:1467–1473. doi: 10.1002/j.1460-2075.1993.tb05790.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 156.Robson F., Khan K.S., Le T.K., Paris C., Demirbag S., Barfuss P., Rocchi P., Ng W.-L. Coronavirus RNA Proofreading: Molecular Basis and Therapeutic Targeting. Mol. Cell. 2020;79:710–727. doi: 10.1016/j.molcel.2020.07.027. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 157.Yan L.W.L., Zaher H.S. How do cells cope with RNA damage and its consequences? J. Biol. Chem. 2019;294:15158–15171. doi: 10.1074/jbc.REV119.006513. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 158.Mazina O.M., Somarowthu S., Kadyrova L.Y., Baranovskiy A.G., Tahirov T.H., Kadyrov F.A., Mazin A.V. Replication protein A binds RNA and promotes R-loop formation. J. Biol. Chem. 2020;295:14203–14213. doi: 10.1074/jbc.RA120.013812. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 159.Chatterjee N., Walker G.C. Mechanisms of DNA damage, repair, and mutagenesis. Environ. Mol. Mutagen. 2017;58:235–263. doi: 10.1002/em.22087. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 160.Vaisman A., Woodgate R. Translesion DNA polymerases in eukaryotes: What makes them tick? Crit. Rev. Biochem. Mol. Biol. 2017;52:274–303. doi: 10.1080/10409238.2017.1291576. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 161.Jebb D., Huang Z., Pippel M., Hughes G.M., Lavrichenko K., Devanna P., Winkler S., Jermiin L.S., Skirmuntt E.C., Katzourakis A., et al. Six reference-quality genomes reveal evolution of bat adaptations. Nature. 2020;583:578–584. doi: 10.1038/s41586-020-2486-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 162.Uriu K., Kosugi Y., Ito J., Sato K. The Battle between Retroviruses and APOBEC3 Genes: Its Past and Present. Viruses. 2021;13:124. doi: 10.3390/v13010124. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 163.Nik-Zainal S., Kucab J.E., Morganella S., Glodzik D., Alexandrov L.B., Arlt V.M., Weninger A., Hollstein M., Stratton M.R., Phillips D.H. The genome as a record of environmental exposure. Mutagenesis. 2015;30:763–770. doi: 10.1093/mutage/gev073. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 164.Riva L., Pandiri A.R., Li Y.R., Droop A., Hewinson J., Quail M.A., Iyer V., Shepherd R., Herbert R.A., Campbell P.J., et al. The mutational signature profile of known and suspected human carcinogens in mice. Nat. Genet. 2020;52:1189–1197. doi: 10.1038/s41588-020-0692-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 165.Kucab J.E., Zou X., Morganella S., Joel M., Nanda S., Nagy E., Gomez C., Degasperi A., Harris R., Jackson S.P., et al. A Compendium of Mutational Signatures of Environmental Agents. Cell. 2019;177:821–836.e816. doi: 10.1016/j.cell.2019.03.001. [DOI] [PMC free article] [PubMed] [Google Scholar]