Abstract
Wild mammalian species, including bats, constitute the natural reservoir of betacoronavirus (including SARS, MERS, and the deadly SARS-CoV-2). Different hosts or host tissues provide different cellular environments, especially different antiviral and RNA modification activities that can alter RNA modification signatures observed in the viral RNA genome. The zinc finger antiviral protein (ZAP) binds specifically to CpG dinucleotides and recruits other proteins to degrade a variety of viral RNA genomes. Many mammalian RNA viruses have evolved CpG deficiency. Increasing CpG dinucleotides in these low-CpG viral genomes in the presence of ZAP consistently leads to decreased viral replication and virulence. Because ZAP exhibits tissue-specific expression, viruses infecting different tissues are expected to have different CpG signatures, suggesting a means to identify viral tissue-switching events. The author shows that SARS-CoV-2 has the most extreme CpG deficiency in all known betacoronavirus genomes. This suggests that SARS-CoV-2 may have evolved in a new host (or new host tissue) with high ZAP expression. A survey of CpG deficiency in viral genomes identified a virulent canine coronavirus (alphacoronavirus) as possessing the most extreme CpG deficiency, comparable with that observed in SARS-CoV-2. This suggests that the canine tissue infected by the canine coronavirus may provide a cellular environment strongly selecting against CpG. Thus, viral surveys focused on decreasing CpG in viral RNA genomes may provide important clues about the selective environments and viral defenses in the original hosts.
Keywords: SARS-CoV-2, viral evolution, canine intestine, zinc finger antiviral protein, COVID-19
Coronaviruses (CoVs) evolve in mammalian hosts and carry genomic signatures of their host-specific environment, especially the host-specific antiviral and RNA modification activities. Many pathogenic single-stranded RNA viruses, including CoVs, exhibit strong CpG deficiency (Yap et al. 2003; Greenbaum et al. 2008, 2009; Atkinson et al. 2014; Takata et al. 2017). Two mammalian enzymes are inferred to contribute to the observed CpG deficiency. The zinc finger antiviral protein (ZAP, known as ZC3HAV1 in mammals or hZAP in human), a key component in mammalian interferon-mediated immune response, binds specifically to CpG dinucleotides in viral RNA genomes via its RNA-binding domain (Meagher et al. 2019). ZAP inhibits viral replication and mediates viral genome degradation (Takata et al. 2017; Ficarelli et al. 2019, 2020; Meagher et al. 2019). ZAP has two isoforms (ZAP-L and ZAP-S); both participate in initiating antiviral activities but only ZAP-S mediates the return to homeostasis after the antiviral response (Schwerk et al. 2019). ZAP acts against not only retroviruses, such as HIV-1 (Ficarelli et al. 2019, 2020), but also Echovirus 7 (Odon et al. 2019) and Zika virus (Trus et al. 2020), both being positive-sense single-stranded RNA viruses like CoVs. In particular, selection against CpG in viral RNA disappears in ZAP-deficient cells (Takata et al. 2017), suggesting that ZAP may be the only cellular agent targeting CpG in viral RNA genomes.
Experimental evidence is consistent with the interpretation that CpG deficiency in RNA viruses has evolved in response to these cytoplasmic CpG-specific antiviral activities. During natural evolution of HIV-1 within individual patients, viral fitness decreased with increasing CpG dinucleotides (Theys et al. 2018). Experimental increase of CpG dinucleotides in CpG-deficient viral genomes consistently leads to strong decrease in viral replication and virulence (Burns et al. 2009; Tulloch et al. 2014; Antzin-Anduetza et al. 2017; Fros et al. 2017; Wasson et al. 2017; Trus et al. 2020), prompting the proposal of vaccine-development strategies involving increasing CpG to attenuate pathogenic RNA viruses (Burns et al. 2009; Tulloch et al. 2014; Trus et al. 2020; Ficarelli et al. 2020).
Another antiviral enzyme is APOBEC3G, found in innate immune cells. APOBEC3G was originally thought specific to single-stranded DNA, such as reverse-transcribed HIV-1, but is now known to modify a variety of RNA viruses, deaminating C to U (Sharma et al. 2015, 2016, 2019). This would be effective against RNA viruses if the deaminated sites are functionally important. APOBEC3G copurifies with highly edited mRNA substrates (Sharma et al. 2016) and therefore could act on CoV genomes which are positive-sense single-stranded RNA. Although APOBEC3G is not strongly CpG-specific, it could contribute to CpG deficiency when coupled with ZAP-mediated antiviral activities targeting CpG. Modification of CpG to UpG in nonfunctional regions could reduce viral susceptibility to CpG-mediated attack by ZAP relative to viruses with unmodified CpG dinucleotides.
Both ZAP and APOBEC3G exhibit tissue-specific expression patterns in human (Fagerberg et al. 2014). Both are expressed in lungs, but ZAP is the most highly expressed where lymphocytes are the most abundant (bone marrow, lymph node, appendix, and spleen), whereas APOBEC3G is the most highly expressed in lymph node, spleen, and testis (Fagerberg et al. 2014). A severely CpG deficient virus may indicate an evolutionary history in ZAP-abundant tissues, such as strongly CpG-deficient HIV-1, infecting host T cells in lymph organs where ZAP is abundant (Fagerberg et al. 2014). The presence of such viruses indicates that they have found ways to evade ZAP-mediated cellular antiviral defense.
The differential expression of ZAP and APOBEC3G in different host or host tissues is expected to leave different genomic signatures on viral RNA genomes. We may use the conventional index of CpG deficiency (Cardon et al. 1994; Karlin et al. 1997) implemented in DAMBE (Xia 2018):
(1) |
The index is expected to be 1, with no deficiency or excess; <1, if deficient; and >1, if excess. The 1252 betacoronavirus (BetaCoV) full-length genomes deposited in GenBank (of which 1,127 are unique) have mean±SE value of 0.516 ± 0.0017 for ICpG, which is significantly (P < 0.0001) smaller than their null expectation of 1.
If a CoV infects a different host tissue with different ZAP abundance, then its RNA genome will experience different selection pressure against its CpG. This difference in cellular antiviral activity would result in differences in ICpG during viral genomic evolution. In contrast, a CoV infecting a specific host tissue for a long time would experience the same cellular antiviral and RNA modification environment and is consequently expected to have similar and stable ICpG.
Results
SARS-CoV-2 and its most closely related known relative (BatCoV RaTG13) have the lowest ICpG among its close relatives, both being outliers in a plot of viral genomic ICpG vs. GC% (fig. 1). Three groups of BetaCoV most closely related to SARS-CoV-2 are represented in figure 1. Group 1 consists of genomes of human CoV-HKU1 which is found only in human (Dominguez et al. 2012) and circulates among human populations without any dependence on other mammalian species as intermediate or reservoir species. Group 2 includes BetaCoV 1 genomes with two types of hosts: 1) ungulates (with bovine and equine CoV as well as porcine hemagglutinating encephalomyelitis virus); and 2) human, with CoV-OC43 being a recent derivative of bovine CoVs (Hulswit et al. 2019). Group 3 are all SARS-related CoVs from three types of hosts: 1) Rhinolophus bats which serve as a natural reservoir of SARS-related CoVs (Li et al. 2005; Wu, Yang, Ren, He, et al. 2016; Wu, Yang, Ren, Zhang, et al. 2016) and the new SARS-CoV-2 (Zhou et al. 2020), 2) civets (from which CoV genomes with 99.6% identity to SARS virus genomes were identified; Shi and Hu, 2008), and 3) human patients infected by SARS-CoV-2. Figure 1 shows that genomic GC% and ICpG can differ among different viral lineages in the same host or among different hosts for the same viral lineage.
The most striking pattern in figure 1 is an isolated but dramatic shift in the lineage leading to BatCoV RaTG13 which was reported (Zhou et al. 2020) to be sampled from a bat (Rhinolophus affinis) in Yunnan Province in 2013 but only sequenced by Wuhan Institute of Virology after the outbreak of SARS-CoV-2 infection in late 2019. This bat CoV genome is the closest phylogenetic relative of SARS-CoV-2 (Zhou et al. 2020), sharing 96% sequence similarity. Many studies have shown an association between decreased CpG (low ICpG) in viral RNA genomes and increased virulence, not only in HIV evolving within individual patients (Theys et al. 2018) but also in experimentally CpG dinucleotide-enriched viral genomes (Burns et al. 2009; Tulloch et al. 2014; Antzin-Anduetza et al. 2017; Fros et al. 2017; Wasson et al. 2017; Trus et al. 2020). The association between decreased CpG and increased virulence in RNA viruses is mainly due to interferon-induced ZAP protein which binds to CpG dinucleotides in viral RNA genomes by its RNA-binding domain (Meagher et al. 2019), inhibits viral replication, and facilitates viral genome degradation (Takata et al. 2017; Ficarelli et al. 2019, 2020; Meagher et al. 2019). Thus, a decreased ICpG in a viral pathogen suggests an increased threat to public health, but an increased ICpG decreases the threat because such viral pathogens, with increased ICpG and reduced virulence, would be akin to natural vaccines. Many viral researchers have in fact proposed vaccine development by increasing CpG in viral RNA genomes (Burns et al. 2009; Tulloch et al. 2014; Trus et al. 2020; Ficarelli et al. 2020).
In this context, it is unfortunate that BatCoV RaTG13 was not sequenced in 2013; otherwise, the downshifting in ICpG might have served as a warning due to two highly significant implications. First, the virus likely evolved in a tissue with high ZAP expression which favors viral genomes with a low ICpG. Second and more importantly, survival of the virus indicates that it has successfully evaded ZAP-mediated antiviral defense. In other words, the virus has become stealthy and dangerous to humans.
The ICpG value for BatCoV RaTG13 genome is 0.40875, much lower than ICpG values observed in all other BetaCoV genomes sampled from bat species in the genus Rhinolophus. There are 56 BetaCoV genomes sampled from Rhinolophus bats inhabiting south and southeastern Asia (but mostly from central and southern China). Nature had essentially inoculated BetaCoVs into various Rhinolophus lineages and allowed genomic evolution to happen (supplementary fig. 1, Supplementary Material online). Although genomic ICpG values have fluctuated in different viral lineages, only BatCoV RaTG13 has been observed to possess an extraordinarily low ICpG (supplementary fig. 1, Supplementary Material online). This suggests that the ancestor of BatCoV RaTG13 and SARS-CoV-2 may have evolved in a mammalian tissue with high expression of ZAP and emerged with an unusually low ICpG. This mammalian tissue likely is not in Rhinolophus bats because low ICpG has not been observed in other BatCoV lineages (supplementary fig. 1, Supplementary Material online). Identifying a virus with comparably low ICpG would suggest candidate host species possessing tissues with cellular environments that select strongly against CpG in viral genomes.
Among all BetaCoVs available in GenBank on February 3, 2020, there are 1,127 unique genomes of which 927 genomes have explicit host designations (supplementary file Betacoronavirus_CpG.xlsx, Supplementary Material online). Surprisingly, no available BetaCoV genome from diverse natural hosts has a genomic ICpG and GC% combination close to that observed in SARS-CoV-2 and BatCoV RaTG13 (fig. 2). BetaCoV lineages parasitizing Rhinolophus bats overall have relatively low ICpG values (fig. 2).
BetaCoV infecting dromedary camels offers a weak hint that camel digestive system may select more strongly against CpG in viral genomes than camel respiratory system. Camel CoVs form two clusters. One cluster overlaps completely with MERS viruses (fig. 2) that infect mammalian respiratory system (Fehr and Perlman 2015; Li 2016). The other cluster includes camel CoV HKU23 strains positioned close to bovine CoV (grouped under “Ungulate_CoV” in figs. 1 and 2), both belonging to Embecovirus and infecting mainly mammalian digestive system but also respiratory systems (Athanassious et al. 1994; Fulton et al. 2015; Ribeiro et al. 2016; Symes et al. 2018; Chae et al. 2019). Those viruses infecting camel digestive system have lower genomic ICpG and GC% than those infecting camel respiratory system (fig. 2).
To search for a mammalian host with the potential to select viral lineages with low ICpG values, I expanded the search to include all complete alphacoronavirus (AlphaCoV) genomes (supplementary file alphacoronavirus_CpG.xlsx, Supplementary Material online). All complete AlphaCoV genomes (>27,000 nt) with explicit host information are plotted in ICpG and GC content in figure 3. Five points are worth highlighting. First, only genomes from canine coronaviruses (CCoVs), which had caused a highly contagious intestinal disease worldwide in dogs (Pratelli 2006; Le Poder 2011), have genomic ICpG and GC% values similar to those observed in SARS-CoV-2 and BatCoV RaTG13 (fig. 3A). The genome (accession no. KP981644) is from the most virulent pantropic CCoV invading multiple canine organs (Buonavoglia et al. 2006; Decaro et al. 2007; Zappulli et al. 2008). It belongs to a clade with the lowest observed ICpG values (fig. 3B).
Second, canids, like camels, also have CoVs infecting their respiratory system (canine respiratory CoV or CRCoV belonging to BetaCoV). There are two genomes sequenced for CRCoV (accession nos. JX860640 and KX432213). Their genomic ICpG values are 0.4756 and 0.4684, respectively, substantially higher than those for CCoVs infecting the digestive system (fig. 3A). Thus, similar to the pattern observed in CoVs infecting camels, CCoVs infecting canine digestive system have ICpG much lower than CRCoVs infecting canine respiratory system.
Third, none of the available AlphaCoV genomes from bats or other mammalian host species possess genomic ICpG and GC% values similar to those observed in SARS-CoV-2 and BatCoV RaTG13 (fig. 3). Thus, although AlphaCoV infects a diverse array of bat lineages, these bat tissues do not seem to generate AlphaCoV strains with low ICpG values comparable with SARS-CoV-2 and BatCoV RaTG13.
Fourth, I want to highlight one data point involving a CCoV genome represented as a green dot in figure 3 (highlighted by a green arrow in fig. 3A, genome accession KC175339). The CCoV has a genomic GC% of 38.17% and ICpG of 0.4986, much higher than the rest. The virus was originally isolated from a dog but had been propagated extensively in cell culture before being sequenced (Whittaker GR, personal communication). Viruses are propagated in cells that expresses the right cellular receptor for viral entry, but do not mount an immune response to kill the virus or get killed by the virus (Benfield and Saif 1990; Banerjee et al. 2019). The consequent relaxation of selection against the virus (and against CpG in the CCoV genome) in cell culture would allow CpG in the viral RNA genome to rebound through mutation, which would explain the increased ICpG (KC175339 in the phylogeny in fig. 3B). This process of regaining CpG is reminiscent of CpG-specific methylation in Mycoplasma species where CpG was regained when some lineages lost CpG-specific methyltransferases, with a fast-evolving lineage (M. pneumoniae) regaining CpG faster than a slow-evolving lineage (M. genitalium; Xia 2003). This rapid change in ICpG with environmental change as shown in figure 3B has two important implications. First, it suggests the feasibility of tracking certain host-switching or tissue-switching events (which would be impossible if it takes hundreds of years for a virus to change ICpG). Second, many experimental studies (Burns et al. 2009; Tulloch et al. 2014; Odon et al. 2019; Trus et al. 2020; Ficarelli et al. 2020) have demonstrated attenuated virulence in RNA viruses with increasing CpG in the viral RNA genome and suggested this as an efficient means of vaccine development. The observed increase in ICpG in the CCoV genome through cell culture propagation shows a simple way of increasing CpG by simply propagating the virus without selection against CpG dinucleotides in the viral genome.
Fifth, the cellular receptor for SARS-CoV-2 entry into the cell is angiotensin I converting enzyme 2 (ACE2; Zhou et al. 2020). ACE2 is pervasively expressed in human digestive system, at the highest levels in small intestine and duodenum (fig. 3C), with relatively low expression in lung (Fagerberg et al. 2014). This suggests that mammalian digestive system is likely to be infected by CoVs. This is consistent with the interpretation that the low ICpG in SARS-CoV-2 was acquired by the ancestor of SARS-CoV-2 evolving in mammalian digestive system. The interpretation is further corroborated by a recent report that a high proportion of COVID-19 patients also suffer from digestive discomfort (Pan et al. 2020). In fact, 48.5% presented with digestive symptoms as their chief complaint.
Humans are the only other host species observed to produce CoV genomes with low-genomic ICpG values, as shown by the cluster of human AlphaCoV NL63 genomes (fig. 3). This virus mainly not only infects the respiratory system but also causes digestive problems in 33% of the patients reporting respiratory problems (Vabret et al. 2005). In a comprehensive study of the first 12 COVID-19 patients in the United States (Midgley and The COVID-19 Investigation Team, 2020), one patient reported diarrhea as the initial symptom before developing fever and cough (Midgley and The COVID-19 Investigation Team, 2020). Stool samples from 7 out of 10 patients tested positive for SARS-CoV-2, including 3 patients with diarrhea (Midgley and The COVID-19 Investigation Team, 2020), corroborating a previous report of SARS-CoV-2 detection in stool (Holshue et al. 2020). In particular, live SARS-CoV-2 virus was isolated from stool of a COVID-19 patient (Zhang et al. 2020). In this context, it is significant that BatCoV RaTG13, as documented in its genomic sequence in GenBank (MN996532), was isolated from a fecal swab. These observations are consistent with the hypothesis that SARS-CoV-2 has evolved in mammalian intestine or tissues associated with intestine.
Figures 1 and 2 do not include all BetaCoVs from all hosts, so BetaCoVs from other mammalian species may also possess low ICpG values. One example is viruses isolated from pangolins. Nine SARS-CoV-2-like genomes have recently been isolated and sequenced from pangolin and deposited in GISAID database (gisaid.org). The one with the highest sequence coverage (GISAID ID: EPI_ISL_410721) has an ICpG value of 0.3929, close to the extreme low end of ICpG values observed among available SARS-CoV-2 genomes. Thus, SARS-CoV-2, BatCoV RaTG13, and those from pangolin may either have a common ancestor with a low ICpG or have convergently evolved low ICpG values.
Other than ZAP and ABOBEC3G, the enigmatic DNA methyltransferase2 (Dnmt2; Okano et al. 1998a, 1998b; Dong et al. 2001), originally thought to be a Dnmt, may also contribute to viral RNA modification. However, Dnmt2 appears to methylate only small RNA (Jeltsch et al. 2017). For this reason, it may not be important in shaping ICpG in large CoV RNA genomes, although it has been observed to relocate from the nucleus to cytoplasmic stress granules (Schaefer and Lyko 2010a, 2010b; Dev et al. 2017), where it may participate in the methylation of mRNA (Dev et al. 2017).
These observations allow formation of a hypothesis for the origin and initial transmission of SARS-CoV-2. First, the ancestor of SARS-CoV-2 and BatCoV RaTG13 infected the intestine of a mammalian species (e.g., canids or human ingesting bat meat). Second, the presumably strong selection against CpG in the viral RNA genome in canid intestine resulted in rapid evolution of the virus, with many CpG → UpG mutations leading to reduced genomic ICpG and GC%. The licking of anal regions in canids during mating and other circumstances facilitate viral transmission from the digestive system to the respiratory system. Finally, the reduced viral genomic ICpG allowed the virus to evade human ZAP-mediated immune response and became a severe human pathogen. Because SARS-CoV-2, TG13 and the related pangolin-derived coronaviruses all have a low-GC genome, the simplest hypothesis is that the low-GC genome was gained in their common ancestor. However, it is also possible that the viral lineage gained the low-GC genome only recently in the digestive system of a canid and spread to other species. This suggests the importance of monitoring SARS-like CoVs in feral dogs in the fight against SARS-CoV-2.
Although the specific origins of SARS-CoV-2 are of vital interest in the current world health environment, this study more broadly suggests that important evidence of viral evolution can be revealed by consideration of the interaction of host defense with viral genomes, including selective pressure exerted by host tissues on viral genome composition.
Supplementary Material
Supplementary data are available at Molecular Biology and Evolution online.
Supplementary Material
Acknowledgments
This study was supported by Discovery Grant from Natural Science and Engineering Research Council (Grant No. RGPIN/2018-03878) of Canada. I am particularly grateful to K. Katoh and his colleagues for excellent comments and critical references, and to G.R. Whittacker for information on canine CoV (Grant No. ACCN KC175339). I have also benefitted from discussion with D. Gray, X. Jiang, J. Mennigen, G. Wang, C.-I. Wu, J. Yang, and W. Zhai. Five anonymous reviewers contributed significantly to the improvement of the manuscript. Heather Rowe corrected many grammatical errors.
References
- Antzin-Anduetza I, Mahiet C, Granger LA, Odendall C, Swanson CM.. 2017. Increasing the CpG dinucleotide abundance in the HIV-1 genomic RNA inhibits viral replication. Retrovirology 14(1):49. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Athanassious R, Marsolais G, Assaf R, Dea S, Descôteaux JP, Dulude S, Montpetit C.. 1994. Detection of bovine coronavirus and type A rotavirus in neonatal calf diarrhea and winter dysentery of cattle in Quebec: evaluation of three diagnostic methods. Can Vet J. 35(3):163–169. [PMC free article] [PubMed] [Google Scholar]
- Atkinson NJ, Witteveldt J, Evans DJ, Simmonds P.. 2014. The influence of CpG and UpA dinucleotide frequencies on RNA virus replication and characterization of the innate cellular pathways underlying virus attenuation and enhanced replication. Nucleic Acids Res. 42(7):4527–4545. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Banerjee A, Falzarano D, Rapin N, Lew J, Misra V.. 2019. Interferon regulatory factor 3-mediated signaling limits middle-east respiratory syndrome (MERS) coronavirus propagation in cells from an insectivorous bat. Viruses 11(2):152. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Benfield DA, Saif LJ.. 1990. Cell culture propagation of a coronavirus isolated from cows with winter dysentery. J Clin Microbiol. 28(6):1454–1457. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Buonavoglia C, Decaro N, Martella V, Elia G, Campolo M, Desario C, Castagnaro M, Tempesta M.. 2006. Canine coronavirus highly pathogenic for dogs. Emerging Infect Dis. 12(3):492–494. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Burns CC, Campagnoli R, Shaw J, Vincent A, Jorba J, Kew O.. 2009. Genetic inactivation of poliovirus infectivity by increasing the frequencies of CpG and UpA dinucleotides within and across synonymous capsid region codons. JVI 83(19):9957–9969. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cardon LR, Burge C, Clayton DA, Karlin S.. 1994. Pervasive CpG suppression in animal mitochondrial genomes. Proc Natl Acad Sci U S A. 91(9):3799–3803. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chae J-B, Park J, Jung S-H, Kang J-H, Chae J-S, Choi K-S.. 2019. Acute phase response in bovine coronavirus positive post-weaned calves with diarrhea. Acta Vet Scand. 61(1):36. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Decaro N, Martella V, Elia G, Campolo M, Desario C, Cirone F, Tempesta M, Buonavoglia C.. 2007. Molecular characterisation of the virulent canine coronavirus CB/05 strain. Virus Res. 125(1):54–60. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dev RR, Ganji R, Singh SP, Mahalingam S, Banerjee S, Khosla S.. 2017. Cytosine methylation by DNMT2 facilitates stability and survival of HIV-1 RNA in the host cell during infection. Biochem J. 474(12):2009–2026. [DOI] [PubMed] [Google Scholar]
- Dominguez SR, Sims GE, Wentworth DE, Halpin RA, Robinson CC, Town CD, Holmes KV.. 2012. Genomic analysis of 16 Colorado human NL63 coronaviruses identifies a new genotype, high sequence diversity in the N-terminal domain of the spike gene and evidence of recombination. J Gen Virol. 93(11):2387–2398. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dong A. Yoder JA, Zhang X, Zhou L, Bestor TH, Cheng X. 2001. Structure of human DNMT2, an enigmatic DNA methyltransferase homolog that displays denaturant-resistant binding to DNA. Nucleic Acids Res. 29(2):439–448. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fagerberg L, Hallström BM, Oksvold P, Kampf C, Djureinovic D, Odeberg J, Habuka M, Tahmasebpoor S, Danielsson A, Edlund K, et al. 2014. Analysis of the human tissue-specific expression by genome-wide integration of transcriptomics and antibody-based proteomics. Mol Cell Proteomics. 13(2):397–406. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fehr AR, Perlman S.. 2015. Coronaviruses: an overview of their replication and pathogenesis. Methods Mol Biol. 1282:1–23. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ficarelli M, Antzin-Anduetza I, Hugh-White R, Firth AE, Sertkaya H, Wilson H, Neil SJD, Schulz R, Swanson CM.. 2020. CpG dinucleotides inhibit HIV-1 replication through zinc finger antiviral protein (ZAP)-dependent and independent mechanisms. J Virol. 94(6):pii e01337-19. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ficarelli M, Wilson H, Pedro Galao R, Mazzon M, Antzin-Anduetza I, Marsh M, Neil SJ, Swanson CM.. 2019. KHNYN is essential for the zinc finger antiviral protein (ZAP) to restrict HIV-1 containing clustered CpG dinucleotides. eLife 8:pii: e46767. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fros JJ, Dietrich I, Alshaikhahmed K, Passchier TC, Evans DJ, Simmonds P.. 2017. CpG and UpA dinucleotides in both coding and non-coding regions of echovirus 7 inhibit replication initiation post-entry. eLife 6:pii: e29112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fulton RW, Herd HR, Sorensen NJ, Confer AW, Ritchey JW, Ridpath JF, Burge LJ.. 2015. Enteric disease in postweaned beef calves associated with Bovine coronavirus clade 2. J Vet Diagn Invest. 27(1):97–101. [DOI] [PubMed] [Google Scholar]
- Greenbaum BD, Levine AJ, Bhanot G, Rabadan R.. 2008. Patterns of evolution and host gene mimicry in influenza and other RNA viruses. PLoS Pathog. 4(6):e1000079. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Greenbaum BD, Rabadan R, Levine AJ.. 2009. Patterns of oligonucleotide sequences in viral and host cell RNA identify mediators of the host innate immune system. PLoS One 4(6):e5969. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Guindon S, Gascuel O.. 2003. A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood. Syst Biol. 52(5):696–704. [DOI] [PubMed] [Google Scholar]
- Holshue ML, DeBolt C, Lindquist S, Lofy KH, Wiesman J, Bruce H, Spitters C, Ericson K, Wilkerson S, Tural A, et al. 2020. First case of 2019 novel coronavirus in the United States. N Engl J Med. 382(10):929–936. 10.1056/NEJMoa2001191. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hulswit RJG, Lang Y, Bakkers MJG, Li W, Li Z, Schouten A, Ophorst B, van Kuppeveld FJM, Boons G-J, Bosch B-J, et al. 2019. Human coronaviruses OC43 and HKU1 bind to 9-O-acetylated sialic acids via a conserved receptor-binding site in spike protein domain A. Proc Natl Acad Sci U S A. 116(7):2681–2690. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jeltsch A, Ehrenhofer-Murray A, Jurkowski TP, Lyko F, Reuter G, Ankri S, Nellen W, Schaefer M, Helm M.. 2017. Mechanism and biological role of Dnmt2 in nucleic acid methylation. RNA Biol. 14(9):1108–1123. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Karlin S, Mrazek J, Campbell AM.. 1997. Compositional biases of bacterial genomes and evolutionary implications. J Bacteriol. 179(12):3899–3913. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Katoh K, Asimenos G, Toh H.. 2009. Multiple alignment of DNA sequences with MAFFT. Methods Mol Biol. 537:39–64. [DOI] [PubMed] [Google Scholar]
- Le Poder S. 2011. 2011. Feline and canine coronaviruses: common genetic and pathobiological features. Adv Virol. 2011:1–609465. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li F. 2016. Structure, function, and evolution of coronavirus spike proteins. Annu Rev Virol. 3(1):237–261. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li W, Shi Z, Yu M, Ren W, Smith C, Epstein JH, Wang H, Crameri G, Hu Z, Zhang H, et al. 2005. Bats are natural reservoirs of SARS-like coronaviruses. Science 310(5748):676–679. [DOI] [PubMed] [Google Scholar]
- Meagher JL, Takata M, Gonçalves-Carneiro D, Keane SC, Rebendenne A, Ong H, Orr VK, MacDonald MR, Stuckey JA, Bieniasz PD, et al. 2019. Structure of the zinc-finger antiviral protein in complex with RNA reveals a mechanism for selective targeting of CG-rich viral sequences. Proc Natl Acad Sci U S A. 116(48):24303–24309. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Midgley CM, The COVID-19 Investigation Team. 2020. First 12 patients with coronavirus disease 2019 (COVID-19) in the United States. medRxiv. doi: 10.1101/2020.03.09.20032896. Accessed April 16, 2020. [DOI] [PubMed] [Google Scholar]
- Odon V, Fros JJ, Goonawardane N, Dietrich I, Ibrahim A, Alshaikhahmed K, Nguyen D, Simmonds P.. 2019. The role of ZAP and OAS3/RNAseL pathways in the attenuation of an RNA virus with elevated frequencies of CpG and UpA dinucleotides. Nucleic Acids Res. 47(15):8061–8083. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Okano M, Xie S, Li E.. 1998. a. Cloning and characterization of a family of novel mammalian DNA (cytosine-5) methyltransferases. Nat Genet. 19(3):219–220. [DOI] [PubMed] [Google Scholar]
- Okano, M.Xie S, Li E. 1998. b. Dnmt2 is not required for de novo and maintenance methylation of viral DNA in embryonic stem cells. Nucleic Acids Res 26(11):2536–2540. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pan L, Mu M, Ren HG, Yang P, Sun Y, Wang R, Yan J, Li P, Hu B, Jin Y, et al. 2020. Clinical characteristics of COVID-19 patients with digestive symptoms in Hubei, China: a descriptive, cross-sectional, multicenter study. Am J Gastroenterol. Advance Access published April 14, 2020, doi: 10.14309/ajg.0000000000000620. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pratelli A. 2006. Genetic evolution of canine coronavirus and recent advances in prophylaxis. Vet Res. 37(2):191–200. [DOI] [PubMed] [Google Scholar]
- Ribeiro J, Lorenzetti E, Alfieri AF, Alfieri AA.. 2016. Molecular detection of bovine coronavirus in a diarrhea outbreak in pasture-feeding Nellore steers in southern Brazil. Trop Anim Health Prod. 48(3):649–653. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schaefer M, Lyko F.. 2010. a. Lack of evidence for DNA methylation of Invader4 retroelements in Drosophila and implications for Dnmt2-mediated epigenetic regulation. Nat Genet. 42(11):920–921. [DOI] [PubMed] [Google Scholar]
- Scha, efer M, Lyko F.. 2010. b. Solving the Dnmt2 enigma. Chromosoma. 119(1):35–40. [DOI] [PubMed] [Google Scholar]
- Schwerk J, Soveg FW, Ryan AP, Thomas KR, Hatfield LD, Ozarkar S, Forero A, Kell AM, Roby JA, So L, et al. 2019. RNA-binding protein isoforms ZAP-S and ZAP-L have distinct antiviral and immune resolution functions. Nat Immunol. 20(12):1610–1620. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sharma S, Patnaik SK, Taggart RT, Baysal BE.. 2016. The double-domain cytidine deaminase APOBEC3G is a cellular site-specific RNA editing enzyme. Sci Rep. 6(1):39100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sharma S, Patnaik SK, Taggart RT, Kannisto ED, Enriquez SM, Gollnick P, Baysal BE.. 2015. APOBEC3A cytidine deaminase induces RNA editing in monocytes and macrophages. Nat Commun. 6(1):6881. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sharma S, Wang J, Alqassim E, Portwood S, Cortes Gomez E, Maguire O, Basse PH, Wang ES, Segal BH, Baysal BE.. 2019. Mitochondrial hypoxic stress induces widespread RNA editing by APOBEC3G in natural killer cells. Genome Biol. 20(1):37. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Shi Z, Hu Z.. 2008. A review of studies on animal reservoirs of the SARS coronavirus. Virus Res. 133(1):74–87. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Symes SJ, Allen JL, Mansell PD, Woodward KL, Bailey KE, Gilkerson JR, Browning GF.. 2018. First detection of bovine noroviruses and detection of bovine coronavirus in Australian dairy cattle. Aust Vet J. 96(6):203–208. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Takata MA, Gonçalves-Carneiro D, Zang TM, Soll SJ, York A, Blanco-Melo D, Bieniasz PD.. 2017. CG dinucleotide suppression enables antiviral defence targeting non-self RNA. Nature 550(7674):124–127. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Theys K, Feder AF, Gelbart M, Hartl M, Stern A, Pennings PS.. 2018. Within-patient mutation frequencies reveal fitness costs of CpG dinucleotides and drastic amino acid changes in HIV. PLoS Genet. 14(6):e1007420. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Trus I, Udenze D, Berube N, Wheler C, Martel MJ, Gerdts V, Karniychuk U.. 2020. CpG-recoding in zika virus genome causes host-age-dependent attenuation of infection with protection against lethal heterologous challenge in mice. Front Immunol. 10:3077. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tulloch F, Atkinson NJ, Evans DJ, Ryan MD, Simmonds P.. 2014. RNA virus attenuation by codon pair deoptimisation is an artefact of increases in CpG/UpA dinucleotide frequencies. eLife 3:e04531. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Vabret A, Mourez T, Dina J, van der Hoek L, Gouarin S, Petitjean J, Brouard J, Freymuth F.. 2005. Human coronavirus NL63, France. Emerging Infect Dis. 11(8):1225–1229. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wasson MK, Borkakoti J, Kumar A, Biswas B, Vivekanandan P.. 2017. The CpG dinucleotide content of the HIV-1 envelope gene may predict disease progression. Sci Rep. 7(1):8162. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wu Z, Yang L, Ren X, He G, Zhang J, Yang J, Qian Z, Dong J, Sun L, Zhu Y, et al. 2016. Deciphering the bat virome catalog to better understand the ecological diversity of bat viruses and the bat origin of emerging infectious diseases. ISME J. 10(3):609–620. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wu Z, Yang L, Ren X, Zhang J, Yang F, Zhang S, Jin Q.. 2016. ORF8-related genetic evidence for Chinese horseshoe bats as the source of Human Severe Acute Respiratory Syndrome Coronavirus. J Infect Dis. 213(4):579–583. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Xia X. 2003. DNA methylation and mycoplasma genomes. J Mol Evol. 57(0):S21–S28. [DOI] [PubMed] [Google Scholar]
- Xia X. 2018. DAMBE7: new and improved tools for data analysis in molecular biology and evolution. Mol Biol Evol. 35(6):1550–1552. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yap YL, Zhang XW, Danchin A.. 2003. Relationship of SARS-CoV to other pathogenic RNA viruses explored by tetranucleotide usage profiling. BMC Bioinformatics 4(1):43. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zappulli V, Caliari D, Cavicchioli L, Tinelli A, Castagnaro M.. 2008. Systemic fatal type II coronavirus infection in a dog: pathological findings and immunohistochemistry. Res Vet Sci. 84(2):278–282. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhang Y, Chen C, Zhu S, Shu C, Wang D, Song J.. 2020. Isolation of 2019-nCoV from a Stool Specimen for use under a CC0 license. China CDC Wkly [Internet]. 2(8):123–124. [PMC free article] [PubMed] [Google Scholar]
- Zhou P, Yang X-L, Wang X-G, Hu B, Zhang L, Zhang W, Si H-R, Zhu Y, Li B, Huang C-L, et al. 2020. A pneumonia outbreak associated with a new coronavirus of probable bat origin. Nature 579(7798):270–273. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.