Skip to main content
Genetics and Molecular Biology logoLink to Genetics and Molecular Biology
. 2023 Jun 19;46(2):e20230005. doi: 10.1590/1678-4685-GMB-2023-0005

The SARS-CoV-2 mutation landscape is shaped before replication starts

Diego Masone 1,2, Maria Soledad Alvarez 3, Luis Mariano Polo 1
PMCID: PMC10280802  PMID: 37338301

Abstract

Mutation landscapes and signatures have been thoroughly studied in SARS-CoV-2. Here, we analyse those patterns and link their changes to the viral replication tissue in the respiratory tract. Surprisingly, a substantial difference in those patterns is observed in samples from vaccinated patients. Hence, we propose a model to explain where those mutations could originate during the replication cycle.

Keywords: SBS spectra, viral replication niche, COVID19 vaccination status


Modifications in the mutation landscape of a genomic sequence can result through several mechanisms (Kucab et al., 2019), such as error-prone polymerases, metabolism, and damaging agents, as an unbalanced redox environment. The comprehensive analysis of the SARS-CoV-2 interhost single base substitution (SBS) showed a mutational spectrum dominated by C>U and, surprisingly, G>U substitutions (Di Giorgio et al., 2020; Panchin and Panchin, 2020; Popa et al., 2020; De Maio et al., 2021). Here we extend those studies to elucidate the impact of the replication niche and vaccination status on that pattern.

The SBS spectrum of SARS-CoV-2 from patients infected with alpha and delta variants was calculated (see methods in S1), confirming that it is dominated by C>U and G>U substitutions, followed by G>A and A>G (Figure 1A). Transition-type SBS -the interchanges between purines (C>U and U>C) or pyrimidines (G>A and A>G)- were expected to be the most frequent, as they can result from the activity of antiviral enzymes such as APOBEC and ADAR deaminases in host cells (Di Giorgio et al., 2020; Liu et al., 2021; Li et al., 2022). In contrast, G>U transversions, particularly prevalent in SARS-CoV-2 (Forni et al., 2022), can result from stochastic processes, such as the misincorporation of nucleotides by an error-prone polymerase with a specific bias or the chemical modification of RNA. Those hypotheses have been discussed previously (Panchin and Panchin, 2020; De Maio et al., 2021; Mourier et al., 2021; Rice et al., 2021), and it is widely agreed that G>U transversion is caused by mutagen exposure, like oxidation due to reactive oxygen species (ROS). The process begins with the oxidation of a guanine base to produce 8-oxoguanine (8-oxoG). Like guanine, 8-oxoG can pair with cytosine; however, it can also pair with adenine (Figure 1B). Exceptionally, if 8-oxoG pairs with adenine during the first cycle of viral RNA replication, it can be substituted by uracil in the second replication cycle (Graudenzi et al., 2021).

Figure 1 - . The environment of infected cells alters the G>U substitution incidence in SARS-CoV-2. (A) Analysis on SBS proportions across alpha and delta variants from unvaccinated (samples collected worldwide before 15th of January 2021, when less than 0.5% of the world population was vaccinated) and vaccinated patients. (B) Diagrammatic representation of standard (Watson and Crick) pairing of guanine and cytosine (top panel) and Hoogsteen base pairing between 8-oxoguanine and adenine (bottom panel). (C) Comparison of SBS spectra of different variants from vaccinated patients. Error bars denote confidence intervals (CI). (D) Proportion of G>U and C>A within the variants, coloured as in (C). The dotted line denotes no asymmetry (ratio=1).

Figure 1 -

Here, we considered how that misincorporation could occur during the intracellular life cycle of SARS-CoV-2. Various external mechanisms can explain modifications in the redox balance in infected cells, with the immune system as the prime suspect (Laforge et al., 2020). Therefore, if the immune system were indeed responsible for the changes in the oxidative environment of the infected cells (Laforge et al., 2020; Paludan and Mogensen, 2022), differences would be expected between SBS spectra from unvaccinated and vaccinated patients (Collier et al., 2021; Szczepanek et al., 2022). Thus, we analysed samples from patients infected with alpha and delta variants divided into unvaccinated and vaccinated groups. Remarkably, G>U transversion was significantly altered (Figure 1A), sustaining a possible role of the immunological responses on the oxidative nature of those mutations. Moreover, the immune cells and those regulating their functions vary through the respiratory tract (Boers et al., 1998; Boers et al., 1999). Thus, SBS patterns from those lineages infecting only part of the respiratory tract should differ from those that can infect the whole tract. For example, omicron subvariants (BA.1 and BA.2) mainly replicate in the upper respiratory tract (Meng et al., 2022), which reflects in a significant decrease in the G>U/C>A ratio when compared to alpha and delta, which can also replicate in the lower respiratory tract (Figure 1C,D).

We hypothesised two scenarios where the nucleotide mispairing could occur when viral RNA (vRNA) is outside or inside double-membrane vesicles (DMVs), leading to different substitution patterns (Figure 2A,B). SARS-CoV-2 contains a positive non-segmented RNA genome [(+)vRNA]. Its replication comprises the early translation of a large polypeptide, then cleaved to produce the RNA-dependent RNA polymerase (RdRp). Both (+)vRNA and RdRp are compartmentalised into DMVs (Klein et al., 2020), avoiding the action of nucleases during vRNA replication (Mendonça et al., 2021). vRNA is then processed through double-stranded RNA intermediates in a sophisticated manner involving (+)vRNA and (-)vRNA. Nevertheless, some vRNA molecules generated inside DMVs are transported to the cytoplasm to produce viral structural proteins. In this scenario, G>U and C>A should have similar magnitudes if the mispairing occurs during replication inside DMVs (Figure 2A). However, G>U substitutions prevail over C>A (Figure 1A,C,D), favouring the theory where the mispairing happens before the vRNA is enclosed into a DMV (Figure 2B). Subsequently, the asymmetry between G>U and C>A transversions can be explained by inferring that guanine oxidation occurs mainly outside DMVs (Figure 2A), so compartmentalisation can play a role in decreasing the exposure of vRNA to the oxidative environment, protecting it from ROS action. Other two pairs of substitutions show asymmetry in their patterns, G>C/C>G and G>U/C>A (Figure 2C). The first of those pairs can be the product of ROS effect over guanine yielding imidazolone (Kino and Sugiyama, 2001), and the second one (G>U/G>A) could be mainly produced by the enzymatic activity of antiviral systems, as uracil is the outcome of cytidine deamination. Therefore, both asymmetries are explained by the protective role of compartmentalisation of the replicative machinery into DMVs. Remarkably, other coronaviruses shield their replication processes and machinery using DMVs (Miller and Krijnse-Locker, 2008). Consequently, it is unsurprising that the unbalance between those pairs of substitutions was also observed in MERS-CoV (Di Giorgio et al., 2020).

Figure 2 - . The influence of compartmentalisation on SBS patterns generated by oxidation. Differential pattern caused by mutagen exposure of vRNA guanines inside (A) or outside (B) double-membrane vesicles (DMVs), where vRNA replicates. Nucleotides circled in orange denote mutations that will occur in that scenario, while those in black mark the final product of the process. DMVs are delimited by double green lines. G* indicates 8-oxoguanine. (C) Proportion of C>U/G>A and G>C/C>G within the variants, coloured as in Figure 1C.

Figure 2 -

Additional studies are needed to elucidate in detail the mechanisms driving viral mutation patterns and how that drives the evolution of new SARS-CoV-2 strains. Particularly, if vaccines could cause novel strains appearance or to affect viral fitness through those mutations, further investigations are warranted to uncover how to manipulate that effect favouring their efficacy.

Acknowledgements

The authors thank Dr Stuart Rulten, Prof Luis Mayorga and Prof Claudia Tomes for their critical and useful comments while conducting this study, and Dr Christopher Ruis (MRC-LMB, Cambridge, UK) and Dr Angie S. Hinrichs (University of California Santa Cruz, Santa Cruz, CA, USA), for their technical support. We gratefully acknowledge all data contributors, i.e., the Authors and their Originating laboratories responsible for obtaining the specimens, and their Submitting laboratories for generating the genetic sequence and metadata and sharing via the GISAID Initiative, on which this research is based. Funding: M.S.A. is a postdoctoral fellow of CONICET. This work was supported by grants from ANPCyT PICT2019-01889 and CONICET-PIP3195 to L.M.P.; ANPCyT PICT2020-01897 and CONICET-PIP0409 to D.M.

Supplementary material.

The following online material is available for this article:

Data S1 -. Material and methods.

References

  1. Boers JE, Ambergen AW, Thunnissen FB. Number and proliferation of basal and parabasal cells in normal human airway epithelium. Am J Respir Crit Care Med. 1998;157:2000–2006. doi: 10.1164/ajrccm.157.6.9707011. [DOI] [PubMed] [Google Scholar]
  2. Boers JE, Ambergen AW, Thunnissen FB. Number and proliferation of clara cells in normal human airway epithelium. Am J Respir Crit Care Med. 1999;159:1585–1591. doi: 10.1164/ajrccm.159.5.9806044. [DOI] [PubMed] [Google Scholar]
  3. Collier AY, Yu J, McMahan K, Liu J, Chandrashekar A, Maron JS, Atyeo C, Martinez DR, Ansel JL, Aguayo R, et al. Differential kinetics of immune responses elicited by Covid-19 Vaccines. N Engl J Med. 2021;385:2010–2012. doi: 10.1056/NEJMc2115596. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. De Maio N, Walker CR, Turakhia Y, Lanfear R, Corbett-Detig R, Goldman N. Mutation rates and selection on synonymous mutations in SARS-CoV-2. Genome Biol Evol. 2021;13:evab087. doi: 10.1093/gbe/evab087. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Di Giorgio S, Martignano F, Torcia MG, Mattiuz G, Conticello SG. Evidence for host-dependent RNA editing in the transcriptome of SARS-CoV-2. Sci Adv. 2020;6:eabb5813. doi: 10.1126/sciadv.abb5813. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Forni D, Cagliani R, Pontremoli C, Clerici M, Sironi M. The substitution spectra of coronavirus genomes. Brief Bioinform. 2022;23:bbab382. doi: 10.1093/bib/bbab382. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Graudenzi A, Maspero D, Angaroni F, Piazza R, Ramazzotti D. Mutational signatures and heterogeneous host response revealed via large-scale characterization of SARS-CoV-2 genomic diversity. iScience. 2021;24:102116. doi: 10.1016/j.isci.2021.102116. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Kino K, Sugiyama H. Possible cause of G-C-->C-G transversion mutation by guanine oxidation product, imidazolone. Chem Biol. 2001;8:369–378. doi: 10.1016/s1074-5521(01)00019-9. [DOI] [PubMed] [Google Scholar]
  9. Klein S, Cortese M, Winter SL, Wachsmuth-Melm M, Neufeldt CJ, Cerikan B, Stanifer ML, Boulant S, Bartenschlager R, Chlanda P. SARS-CoV-2 structure and replication characterized by in situ cryo-electron tomography. Nat Commun. 2020;11:5885. doi: 10.1038/s41467-020-19619-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Kucab JE, Zou X, Morganella S, Joel M, Nanda AS, Nagy E, Gomez C, Degasperi A, Harris R, Jackson SP, et al. A compendium of mutational signatures of environmental agents. Cell. 2019;177:821–836.:e816. doi: 10.1016/j.cell.2019.03.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Laforge M, Elbim C, Frere C, Hemadi M, Massaad C, Nuss P, Benoliel JJ, Becker C. Tissue damage from neutrophil-induced oxidative stress in COVID-19. Nat Rev Immunol. 2020;20:515–516. doi: 10.1038/s41577-020-0407-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Li J, Du P, Yang L, Zhang J, Song C, Chen D, Song Y, Ding N, Hua M, Han K, et al. Two-step fitness selection for intra-host variations in SARS-CoV-2. Cell Rep. 2022;38:110205. doi: 10.1016/j.celrep.2021.110205. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Liu R, Wu P, Ogrodzki P, Mahmoud S, Liang K, Liu P, Francis SS, Khalak H, Liu D, Li J, et al. Genomic epidemiology of SARS-CoV-2 in the UAE reveals novel virus mutation, patterns of co-infection and tissue specific host immune response. Sci Rep. 2021;11:13971. doi: 10.1038/s41598-021-92851-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Mendonça L, Howe A, Gilchrist JB, Sheng Y, Sun D, Knight ML, Zanetti-Domingues LC, Bateman B, Krebs AS, Chen L, et al. Correlative multi-scale cryo-imaging unveils SARS-CoV-2 assembly and egress. Nat Commun. 2021;12:4629. doi: 10.1038/s41467-021-24887-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Meng B, Abdullahi A, Ferreira I, Goonawardane N, Saito A, Kimura I, Yamasoba D, Gerber PP, Fatihi S, Rathore S, et al. Altered TMPRSS2 usage by SARS-CoV-2 Omicron impacts infectivity and fusogenicity. Nature. 2022;603:706–714. doi: 10.1038/s41586-022-04474-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Miller S, Krijnse-Locker J. Modification of intracellular membrane structures for virus replication. Nat Rev Microbiol. 2008;6:363–374. doi: 10.1038/nrmicro1890. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Mourier T, Sadykov M, Carr MJ, Gonzalez G, Hall WW, Pain A. Host-directed editing of the SARS-CoV-2 genome. Biochem Biophys. Res Commun. 2021;538:35–39. doi: 10.1016/j.bbrc.2020.10.092. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Paludan SR, Mogensen TH. Innate immunological pathways in COVID-19 pathogenesis. Sci Immunol. 2022;7:eabm5505. doi: 10.1126/sciimmunol.abm5505. [DOI] [PubMed] [Google Scholar]
  19. Panchin AY, Panchin YV. Excessive G-U transversions in novel allele variants in SARS-CoV-2 genomes. PeerJ. 2020;8:e9648. doi: 10.7717/peerj.9648. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Popa A, Genger JW, Nicholson MD, Penz T, Schmid D, Aberle SW, Agerer B, Lercher A, Endler L, Colaco H, et al. Genomic epidemiology of superspreading events in Austria reveals mutational dynamics and transmission properties of SARS-CoV-2. Sci Transl Med. 2020;12:eabe2555. doi: 10.1126/scitranslmed.abe2555. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Rice AM, Castillo Morales A, Ho AT, Mordstein C, Muhlhausen S, Watson S, Cano L, Young B, Kudla G, Hurst LD. Evidence for strong mutation bias toward, and selection against, U content in SARS-CoV-2: Implications for vaccine design. Mol Biol Evol. 2021;38:67–83. doi: 10.1093/molbev/msaa188. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Szczepanek J, Skorupa M, Goroncy A, Jarkiewicz-Tretyn J, Wypych A, Sandomierz D, Jarkiewicz-Tretyn A, Dejewska J, Ciechanowska K, Palgan K, et al. Anti-SARS-CoV-2 IgG against the S Protein: A Comparison of BNT162b2, mRNA-1273, ChAdOx1 nCoV-2019 and Ad26.COV2.S Vaccines. Vaccines (Basel) 2022;10:99. doi: 10.3390/vaccines10010099. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Data S1 -. Material and methods.

Articles from Genetics and Molecular Biology are provided here courtesy of Sociedade Brasileira de Genética

RESOURCES