Abstract
Background
Coronaviruses have the potential to cross species barriers. To learn the molecular intersections among the most common coronaviruses of domestic and close-contact animals, we analyzed representative coronavirus genera infecting mouse, rat, rabbit, dog, cat, cattle, white-tailed deer, swine, ferret, mink, alpaca, Rhinolophus bat, dolphin, whale, chicken, duck and turkey hosts; reference or complete genome sequences were available for most of these coronavirus genera. Protein sequence alignments and phylogenetic trees were built for the spike (S), envelope (E), membrane (M) and nucleocapsid (N) proteins. The host receptors and enzymes aminopeptidase N (APN), angiotensin converting enzyme 2 (ACE2), sialic acid synthase (SAS), transmembrane serine protease 2 (TMPRSS2), dipeptidyl peptidase 4 (DPP4), cathepsin L (and its analogs) and furin were also compared.
Results
Overall, the S, E, M, and N proteins segregated according to their viral genera (α, β, or γ), but the S proteins of alphacoronaviruses lacked conservation of phylogeny. Interestingly, the unique polybasic furin cleavage motif found in severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2) but not in severe acute respiratory syndrome coronavirus (SARS-CoV) or Middle East respiratory syndrome coronavirus (MERS-CoV) exists in several β-coronaviruses and a few α- or γ-coronaviruses. Receptors and enzymes retained host species-dependent relationships with one another. Among the hosts, critical ACE2 residues essential for SARS-CoV-2 spike protein binding were most conserved in white-tailed deer and cattle.
Conclusion
The polybasic furin cleavage motif found in several β- and other coronaviruses of animals points to the existence of an intermediate host for SARS-CoV-2, and it also offers a counternarrative to the theory of a laboratory-engineered virus. Generally, the S proteins of coronaviruses show crossovers of phylogenies indicative of recombination events. Additionally, the consistency in the segregation of viral proteins of the MERS-like coronavirus (NC_034440.1) from pipistrelle bat supports its classification as a β-coronavirus. Finally, similarities in host enzymes and receptors did not always explain natural cross-infections. More studies are therefore needed to identify factors that determine the cross-species infectivity of coronaviruses.
Keywords: Coronavirus, Host receptors, Comparative phylogeny, Furin cleavage, Spike protein
Background
The ongoing pandemic caused by severe acute respiratory syndrome coronavirus-2 (SARS-CoV-2) has led to unprecedented interest in the study of coronaviruses, although the first coronavirus was identified in the 1930s [1]. Given the most recent outbreaks by SARS-CoV-2, severe acute respiratory syndrome coronavirus (SARS-CoV) and Middle East respiratory syndrome coronavirus (MERS-CoV), the potential for zoonoses and reverse zoonoses, in conjunction with viral, host and environmental factors, have elevated the need to identify the origin, transmission, pathogenesis and control of and novel therapeutic and preventive strategies against these viruses.
The first two-thirds of the coronavirus genome encodes proteins needed for replication, and the remaining one-third encodes accessory and structural proteins, which include hemagglutinin esterase (HE) (present in only Group 2 coronaviruses), spike (S), envelope (E), membrane (M) and nucleocapsid (N) [2, 3]. Previously grouped based on serology, coronaviruses are now categorized into α (alpha), β (beta), γ (gamma) and δ (delta) groups using genetics. Coronaviruses affect many host species ranging from mammals to birds [3, 4].
Cross-species coronavirus infections have been reported among humans and animals. The SARS-CoV epidemic in 2002, for example, appears to have jumped from bats to infect palm civets, then to raccoon dogs, Chinese ferret badgers and finally into the human population [5]. In domestic animals, the most likely recent interspecies transmission leading to an outbreak may be that of canine respiratory coronavirus discovered in 2003 [6]. In Africa, a recent study by Burimuah et al. also showed that the close association of ruminants can lead to a spillover of coronaviruses from cattle to other small ruminants [7]. A high degree of sequence identity among some canine, bovine and human β-coronaviruses [2, 5, 8] strongly suggests that coronaviruses in close-contact environments could recombine or jump the species barrier to initiate emerging infections with sustained transmissions in new hosts. In the last two decades, SARS-CoV-2 has been the third zoonotic coronavirus to originate from animals and cause a pandemic in humans. Although the intermediate host for SARS-CoV-2 has not yet been identified [9, 10], recent comparative molecular analysis indicates that the pangolin and bats harbor the closest relative viruses [11–14].
With the current public availability of whole-genome sequences and additional tools for the analysis of both proteome and genome data, it is now possible to examine viruses and other microbes in great detail even before experimental validations. In this study, we analyzed the phylogenetic relationships among selected coronaviruses that infect domestic and close-contact animals to sketch the subgenomic relationships among the viruses. The molecular phylogeny of coronavirus proteins (spike, envelope, membrane and nucleocapsid), putative host-cell receptors and key functional domains of host enzymes were compared.
Results
Viruses of the same genera may form variable clades at subgenomic levels
Overall, viral S, E, M, and N proteins clustered together according to the viral genera groups (Fig. 1 A-D), although intragroup discordances were evident. For alpaca respiratory α-coronavirus, the S protein shared the same origin as that of the porcine epidemic diarrhea (PED) virus, but distinct from its clade, the E protein shared a distant origin with β-coronaviruses (Fig. 1B). Among β-coronaviruses, proteins from canine respiratory, bovine, rabbit, and white-tailed deer coronaviruses were closely related. The E, M, and N proteins from transmissible gastroenteritis (TGE) virus, canine coronavirus (also referred to as canine enteric coronavirus) and PED virus showed close phylogeny with each other, while the S protein showed discordance. Feline coronavirus (strain UU11) and feline infectious peritonitis (FIP) virus E, M, and N proteins were closely related (Fig. 1 B-D). However, the phylogeny of the S protein from FIP virus was distant from that of feline coronavirus (strain UU11) (Fig. 1 A). To test for possible past recombination events that may have led to the crossover phylogeny between viruses of unrelated host species, we performed a SimPlot analysis of the entire genomes of canine coronavirus, PED virus, TGE virus and alpaca respiratory coronavirus, holding the canine coronavirus sequence as a query against the other three. The results showed that both TGE- and PED-coronaviruses maintained 70-98% similarity with the canine coronavirus along the genome region (approximately 20,000 bases) proximal to the segment coding for the S protein. However, in the S gene segment, the PED virus showed no similarity to either canine coronavirus or TGE virus. TGE virus regained 90-96% similarity to canine coronavirus after a drop of the curve in the proximal region of the S gene (Fig. 1E). The alpaca respiratory coronavirus shared very limited similarity with the canine coronavirus between the proximal 10,000 and 20,000 bases but was similar to the PED virus between the 23,000 and 25,000 base marks corresponding to the spike gene segment. This suggests that the S gene segments of some coronaviruses may originate from other viruses via recombination, leading to divergent subgenome phylogeny among proteins of the same virus.
The polybasic furin cleavage motif in SARS-CoV-2 is present in several β-coronaviruses
In SARS-CoV-2, the S protein contains a polybasic furin cleavage site with the motif RRAR (where R and A are arginine and alanine residues, respectively). This motif is present in the murine β-coronavirus but absent in SARS-CoV, while MERS-CoV has an RSVR sequence at approximately the same location. This cleavage site, located at the junction of the S1/S2 subdomains, specifically has an arginine residue at the second position (RRAR), which is essential for efficient cleavage of the S protein [15]. We show that this furin cleavage motif RRXR (where X is an alanine residue or another amino acid) is present in several β-coronaviruses and the avian infectious bronchitis virus (a γ-coronavirus) as shown in Fig. 2. Notably, a variation in the sequence is also evident in other S proteins, such as RVGR in a β-coronavirus of bat, RSRR in an α-coronavirus of cat, and RKRR in a γ-coronavirus of turkey. Therefore, this motif or its mutant variants exist in natural coronaviruses belonging to different groups.
The TMPRSS2 cleavage site on the viral S protein and the catalytic residues of the host TMPRSS2 enzyme are conserved
The TMPRSS2 cleavage site of the SARS-CoV-2 S protein is well conserved among all viruses studied, although the residues flanking the cleavage site vary widely (Fig. 3A). TMPRSS2 is expressed in tissues of the aerodigestive tract of humans [16], and it is important for initiating SARS-CoV-2 infection by processing the S protein to release fusion peptides for membrane fusion [17]. On the host TMPRSS2 enzyme, we found both the substrate binding site and the triad of catalytic residues (histidine (H) 296, aspartic acid (D) 345 and serine (S) 441) located in the active site to be well conserved among all host species as shown in Fig. 3B and C, respectively.
Comparison of host ACE2, TMPRSS2, APN SAS, DPP4, cathepsin L and furin proteins
Phylogenetic analysis of host enzymes and receptors was performed. TMPRSS2, SAS, APN, DPP4, cathepsin L (and its analogs) and furin showed high similarity among mammals, with distinct separation from those of birds. An overall species-based segregation pattern was observed for the various host enzymes and receptors, except for the ACE2 enzyme, where bat, rabbit and beluga whale ACE2 proteins were distantly related to proteins of other mammals and birds (Fig. 4A-G). Unlike the phylogenies of host species cathepsin and furin, which generally followed species relationship patterns, the phylogeny of DPP4 was rather peculiar in that proteins of cattle and white-tailed deer or beluga whale and bottlenose dolphin were very distantly related. On the other hand, cathepsins of dog and cattle were in the same clade (Fig. 4 E-G). Upon analyzing both the required and critical residues of ACE2 needed for SARS-CoV-2 S protein binding, the cattle and white-tailed deer proteins shared the most similarity scores with humans, followed closely by the dolphin and pig proteins and then by the cat, alpaca and dog proteins (Fig. 4H).
Discussion
We compared the protein-level phylogenetic relationships among common coronaviruses of domestic and close-contact animals and key infection-associated proteins in the hosts. In this report, we highlight that the polybasic furin cleavage site found in SARS-CoV-2, but not in SARS-CoV or MERS-CoV [15], exists in several β-coronaviruses included in this report, although the configuration varied in some of those. For example, in the murine coronavirus (accession no. ACN89705.1), the exact RRAR motif is observed, but in others, it is either reversed or palindromic (Fig. 2). The polybasic furin cleavage site in the SARS-CoV-2 S protein is considered unique to the novel virus that causes COVID-19 [18]. This has led to speculations about the possibility of a laboratory-engineered virus. However, the presence of the same and similar motifs in other β-coronaviruses and even other coronaviruses eliminates the theory of an artificially engineered virus. Although the current search for the intermediate host of SARS-CoV-2 suggests several potential candidates [19], possible recombination events of viruses in the same or different host species, which can give rise to a hybrid or a variant species, should not be ruled out. RpYN09 bat coronavirus, the most recently found closest relative of SARS-CoV-2, lacks the RRAR motif [14], suggesting that the parent virus containing this motif has yet to be discovered.
Although the identities of all host receptors for coronaviruses are not fully known, ACE2 is documented as a receptor mediating both infection and transmission of SARS-CoV-2 [20, 21]. ACE2 is a key regulator of the angiotensin system and is well expressed in the vascular endothelium, smooth muscle cells of the intestines, kidneys and heart muscle cells [22–24]. During infection, twenty key amino acid residues in the ACE2 enzyme interact with the S protein of SARS-CoV-2 [25]. Five of these residues, lysine (K), glutamic acid (E), aspartic acid (D), methionine (M) and lysine (K), at positions 31, 35, 38, 82 and 353, respectively, are critical for S protein binding [26]. Sequence alignment analysis showed that cattle and white-tailed deer share the most similarity scores to humans for both required and critical amino acid residues. Interestingly, a recent USDA/APHIS study showed that approximately 40% of the wild white-tailed deer population in four states in the USA tested positive for anti-SARS-CoV-2 immunoglobulins (https://www.aphis.usda.gov/animal_health/one_health/downloads/qa-covid-white-tailed-deer-study.pdf). This finding indicates the possibility that SARS-CoV-2 may establish and spread in hosts with high affinity receptors. However, the similarity of a single receptor may not explain or predict cross-species infections. For example, while the bovine-canine cross-species jump of a β-coronavirus is obvious from the phylogeny, the comparison of their ACE2 does not indicate a unique relationship of this receptor between cow and dog, although their cathepsins belong to the same clade. Therefore, ACE2 may not necessarily be a receptor for bovine-canine interspecies viral infection. Similarly, ACE2 of bottlenose dolphin and pig shared more common residues with those of human ACE2 than other hosts. Specifically, the Rhinolophus bat reference sequence has one of the least phylogenetic and key amino acid commonalities with human ACE2 (Fig. 4H). This brings into question the relevance of the ACE2 receptor in this bat species.
The TMPRSS2 cleavage site in SARS-CoV-2 was conserved among all viruses we included (Fig. 3A). In the host species, both the substrate binding and active sites of the enzyme were well conserved. Notably, the triad of catalytic residues histidine (H) 296, aspartic acid (D) 345 and serine (S) 441 [27], which interact with the SARS-CoV-2 S protein, are conserved among all species included in this study (Fig. 3B). The overall phylogenetic segregation pattern of host enzymes and receptors was species-related except for the ACE2 enzyme, where bat, rabbit and whale proteins were distantly related to those of other mammals.
Among the viruses we analyzed, consistent patterns with little variation were observed among γ-coronaviruses. The canine respiratory, rabbit, bovine respiratory, and white-tailed deer coronaviruses also consistently clustered together. However, phylogenetic crossovers were evident among canine coronavirus, TGE virus, PED virus, feline coronavirus, and FIP virus. The S protein of the feline coronavirus included in this report was distant from that of the FIP virus, displaying discordant relationships that indicate past recombination events [28]. An interesting phylogenetic common origin was also noted for the S proteins of PED virus and the distantly related alpaca respiratory coronavirus. These crossovers are most likely from past recombination and mutation events. Coronaviruses are noted for their high frequency of recombination [29], and the patterns of recombination among coronaviruses of domestic animals deserve a closer look for us to recognize the origins of cross-species infections and to design effective disease control strategies.
Finally, we also noted that the MERS-like bat coronavirus (accession number YP_009361857.1; https://www.ncbi.nlm.nih.gov/protein/1189488876) is annotated as unclassified in the National Center for Biotechnology Information (NCBI) database. Considering the phylogenetic clustering of the S, M, E, and N proteins, they most likely belong to the β-coronavirus group.
Limitations of our study include the unknown nature of the complete receptor repertoire for coronaviruses among domestic and close-contact animals. Although we based our comparative phylogeny primarily on the reported or putative human receptor or coronavirus infection-associated proteins, coronaviruses may not use the same receptors in different species. It is possible that a virus crossing to another species will adapt to using different receptors or associated proteins.
Conclusion
Recombination or a few critical mutations in viral receptor binding domains or receptor protein key residues may contribute to cross-species infections [30]. We highlighted key molecular relationships that exist among common coronaviruses and their host receptors. Notably, the existence of similar polybasic residues within the S proteins of several coronaviruses suggests that the RRXR sequence motif is not unique to SARS-CoV-2. Unlike genome-level phylogenies, subgenomic-level comparisons provide stronger insights into functional ontogeny and the identification of consequential recombination events. However, not all factors that determine species-specific or cross-species infections are currently known. Therefore, additional studies of host determinants for virus attachment and productive infectivity are urgently needed to understand the role different hosts play in coronavirus infections, maintenance, and host adaptations. Such studies will be critical to identify molecules that can be targeted for the prevention and control of diseases caused by these viruses, especially in hosts of economic or public health importance.
Methods
All protein sequences included in our analysis were collected from the National Center for Biotechnology Information (NCBI) database (https://www.ncbi.nlm.nih.gov/nuccore/), stored in FASTA formats and analyzed. In total, 32 coronaviruses, comprising eight α-coronaviruses, 19 β-coronaviruses and five γ-coronaviruses, were included in our comparative study as shown in Table 1. For each of the viruses, we assembled protein sequence data on the spike (S), membrane (M), envelope (E) and nucleocapsid (N) structural proteins. Furthermore, protein sequence data on key receptors and enzymes reported to be involved in viral infection, namely, angiotensin converting enzyme 2 (ACE2), transmembrane serine protease 2 (TMPRSS2), aminopeptidase N (APN), sialic acid synthase (SAS), dipeptidyl peptidase 4 (DPP4), cathepsin L and furin, of seventeen hosts were also assembled, and their details are shown in Table 2. The FASTA format of all sequences was organized in Text Editor for analyses. Using MEGA X software version 11, sequence alignments and phylogenetic trees were developed. Alignments were generated using the Muscle Tool with neighbor-joining as the cluster method, and the degree of consensus among conserved residues was visualized using Snap Gene software. The SimPlot program [31] was used to graphically depict similarities among selected sequences. Phylogenetic trees were constructed using the neighbor-joining method with a Poisson model of substitution at 1000 bootstrap replications, and all the results were validated with the maximum likelihood method. All other default settings of the bioinformatics programs used were applied.
Table 1.
Virus Group | Name of Virus | Accession ID | Host Species |
---|---|---|---|
Alphacoronavirus (α) | Feline Infectious Peritonitis (FIP) | NC_002306.3 | Cat |
Feline Coronavirus (strain UU11) | FJ938052.1 | Cat | |
Canine Coronavirus | KC175340.1 | Dog | |
Porcine Epidemic Diarrhea (PED) virus | NC_028806.1 | Pig | |
Transmissible Gastroenteritis (TGE) virus | NC_038861.1 | Pig | |
Ferret Coronavirus | NC_030292.1 | Ferret | |
Mink Coronavirus | NC_023760.1 | Mink | |
Alpaca Respiratory Coronavirus | JQ410000.1 | Alpaca | |
Betacoronavirus (β) | Canine Respiratory Coronavirus | KX432213.1 | Dog |
Bovine Coronavirus | NC_003045.1 | Cattle | |
White-tailed Deer Coronavirus | FJ425187.1 | Whitetail Deer | |
Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) | NC_045512.2 | Human | |
Severe Acute Respiratory Syndrome Coronavirus (SARS-CoV) | YP_009825051.1 | Human | |
Middle East Respiratory Syndrome Coronavirus MERS-CoV | YP_009047204.1 | Human | |
MERS-like Coronavirus (PREDICT/PDF-2180) | NC_034440.1 | Pipistrelle Bat | |
Mouse Coronavirus | FJ647220.1 | Mouse | |
Rat Coronavirus | KF294371.1 | Rat | |
Rabbit Coronavirus | JN874562.1 | Rabbit | |
SARS-CoV-2 | QLC48407.1 | Tiger | |
Murine Coronavirus | ACN89705.1 | Mouse | |
Sable Antelope Coronavirus | ABP38306.1 | Sable Antelope | |
Giraffe Coronavirus beta | ABP38334.1 | Giraffe | |
Human Coronavirus OC43 beta | AMK59677.1 | Human | |
Human Coronavirus OC43 | AWW13551.1 | Chimpanzee | |
Camel Coronavirus HKU23 beta | ALA50080.1 | Camel | |
Buffalo Coronavirus | ANJ04717.1 | Buffalo | |
Equine Coronavirus | BAS18866.1 | Horse | |
Gammacoronavirus (γ) | Avian Infectious Bronchitis | NC_001451.1 | Chicken |
Turkey Coronavirus | NC_010800.1. | Turkey | |
Duck Coronavirus | NC_048214.1 | Duck | |
Bottlenose Dolphin Coronavirus | MN690608.1 | Bottlenose Dolphin | |
Beluga Whale Coronavirus | NC_010646.1 | Beluga Whale |
Table 2.
Acknowledgements
We thank Dr. Ruby Perry, Dean, Tuskegee University College of Veterinary Medicine, for support to veterinary students’ summer research, through which this project was initiated.
Authors’ information (optional)
None Declared.
Abbreviations
- α
Alpha
- β
Beta
- γ
Gamma
- S
Spike
- E
Envelope
- M
Membrane
- N
Nucleocapsid
- APN
Aminopeptidase-N
- ACE2
Angiotensin Converting Enzyme 2
- SAS
Sialic Acid Synthase
- TMPRSS2
Transmembrane Serine Protease 2
- SARS-CoV-2
Acute Respiratory Syndrome Coronavirus-2
- SARS-CoV
Acute Respiratory Syndrome Coronavirus
- MERS-CoV
Middle East Respiratory Syndrome Coronavirus
- TGE
Transmissible gastroenteritis
- PED
Porcine Epidemic Diarrhea PED
- FIP
Feline Infectious Peritonitis
- NCBI
National Center for Biotechnology Information
Authors’ contributions
TS and PM: conceptualization, investigation, methodology, review and editing, data curation, validation, supervision. KB: data curation, validation, comparative analysis, methodology, writing manuscript draft, review and editing. SS and CW: data curation and organization, methodology, comparative analysis. GR, WA and RF: scientific and conceptual input, review and editing. The author(s) read and approved the final manuscript.
Funding
This study was supported through grants from Health and Human Services HHS/HRSA #D34HP00001, and National Institutes of Health grants (#U54CA118623, SRE #T35OD010432, TU-CBR/RCMI #U54MD007585 shared resources core facility).
Availability of data and materials
All data used in this study are publicly available in the National Center for Biotechnology Information (NCBI, https://www.ncbi.nlm.nih.gov/nuccore/), and the accession numbers for both virus and host sequences are provided in Tables 1 and 2.
Declarations
Ethics approval and consent to participate
Not Applicable.
Consent for publication
Not Applicable.
Competing interests
Authors declare no competing personal relationships or financial interests that could have appeared to influence the findings reported in this paper.
Footnotes
Publisher’s Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
- 1.Hudson CB, Beaudette FR. Infection of the cloaca with the virus of infectious bronchitis. Science. 1932;76:34. doi: 10.1126/science.76.1958.34-a. [DOI] [PubMed] [Google Scholar]
- 2.Vijgen L, Keyaerts E, Moës E, Thoelen I, Wollants E, Lemey P, et al. Complete genomic sequence of human coronavirus OC43: molecular clock analysis suggests a relatively recent zoonotic coronavirus transmission event. J Virol. 2005;79:1595–1604. doi: 10.1128/jvi.79.3.1595-1604.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Belouzard S, Millet JK, Licitra BN, Whittaker GR. Mechanisms of coronavirus cell entry mediated by the viral spike protein. Viruses. 2012;4:1011–1033. doi: 10.3390/v4061011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Wang L, Byrum B, Zhang Y. Detection and genetic characterization of deltacoronavirus in pigs, Ohio, USA, 2014. Emerg Infect Dis. 2014;20:1227–1230. doi: 10.3201/eid2007.140296. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Perlman S, Netland J. Coronaviruses post-SARS: update on replication and pathogenesis. Nat Rev Microbiol. 2009;7:439–450. doi: 10.1038/nrmicro2147. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Erles K, Toomey C, Brooks HW, Brownlie J. Detection of a group 2 coronavirus in dogs with canine infectious respiratory disease. Virology. 2003;310:216–223. doi: 10.1016/S0042-6822(03)00160-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Burimuah V, Sylverken A, Owusu M, El-Duah P, Yeboah R, Lamptey J, et al. Molecular-based cross-species evaluation of bovine coronavirus infection in cattle, sheep and goats in Ghana. BMC Vet Res. 2020;16:405. doi: 10.1186/s12917-020-02606-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Lu S, Wang Y, Chen Y, Wu B, Qin K, Zhao J, et al. Discovery of a novel canine respiratory coronavirus support genetic recombination among betacoronavirus1. Virus Res. 2017;237:7–13. doi: 10.1016/j.virusres.2017.05.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Goldstein SA, Brown J, Pedersen BS, Quinlan AR, Elde NC. Extensive recombination-driven coronavirus diversification expands the pool of potential pandemic pathogens. bioRxiv. 2021:2021.02.03.429646. 10.1101/2021.02.03.429646. [DOI] [PMC free article] [PubMed]
- 10.Gabutti G, d’Anchera E, Sandri F, Savio M, Stefanati A. Coronavirus: update related to the current outbreak of COVID-19. Infect Dis Ther. 2020;9:241–253. doi: 10.1007/s40121-020-00295-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Boni MF, Lemey P, Jiang X, Lam TT-Y, Perry BW, Castoe TA, et al. Evolutionary origins of the SARS-CoV-2 sarbecovirus lineage responsible for the COVID-19 pandemic. Nat Microbiol. 2020;511. 2020;5:1408–17. 10.1038/s41564-020-0771-4. [DOI] [PubMed]
- 12.Wrobel AG, Benton DJ, Xu P, Calder LJ, Borg A, Roustan C, et al. Structure and binding properties of pangolin-CoV spike glycoprotein inform the evolution of SARS-CoV-2. Nat Commun. 2021;12:837. doi: 10.1038/s41467-021-21006-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Zhang T, Wu Q, Zhang Z. Probable pangolin origin of SARS-CoV-2 associated with the COVID-19 outbreak. Curr Biol. 2020;30:1346–1351.e2. doi: 10.1016/j.cub.2020.03.022. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Zhou H, Ji J, Chen X, Bi Y, Li J, Wang Q, et al. Identification of novel bat coronaviruses sheds light on the evolutionary origins of SARS-CoV-2 and related viruses. Cell. 2021;184:4380–4391.e14. doi: 10.1016/j.cell.2021.06.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Örd M, Faustova I, Loog M. The sequence at spike S1/S2 site enables cleavage by furin and phospho-regulation in SARS-CoV2 but not in SARS-CoV1 or MERS-CoV. Sci Rep. 2020;101. 2020;10:1–10. 10.1038/s41598-020-74101-0. [DOI] [PMC free article] [PubMed]
- 16.Russo R, Andolfo I, Lasorsa VA, Iolascon A, Capasso M. Genetic analysis of the coronavirus SARS-CoV-2 host protease TMPRSS2 in different populations. Front Genet. 2020;11:872. doi: 10.3389/fgene.2020.00872. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Papa G, Mallery DL, Albecka A, Welch LG, Cattin-Ortolá J, Luptak J, et al. Furin cleavage of SARS-CoV-2 spike promotes but is not essential for infection and cell-cell fusion. PLoS Pathog. 2021;17:e1009246. doi: 10.1371/journal.ppat.1009246. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Li W. Delving deep into the structural aspects of a furin cleavage site inserted into the spike protein of SARS-CoV-2: a structural biophysical perspective. Biophys Chem. 2020;264:106420. doi: 10.1016/j.bpc.2020.106420. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Zhao J, Cui W, Tian B. The potential intermediate hosts for SARS-CoV-2. Front Microbiol. 2020;0:2400. doi: 10.3389/fmicb.2020.580137. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Ni W, Yang X, Yang D, Bao J, Li R, Xiao Y, et al. Role of angiotensin-converting enzyme 2 (ACE2) in COVID-19. Crit Care. 2020;24:422. doi: 10.1186/s13054-020-03120-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Millet JK, Jaimes JA, Whittaker GR. Molecular diversity of coronavirus host cell entry receptors. FEMS Microbiol Rev. 2021;45:1–16. doi: 10.1093/FEMSRE/FUAA057. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Bornstein SR, Dalan R, Hopkins D, Mingrone G, Boehm BO. Endocrine and metabolic link to coronavirus infection. Nat Rev Endocrinol. 2020;16:297–298. doi: 10.1038/s41574-020-0353-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Bavishi C, Maddox TM, Messerli FH. Coronavirus disease 2019 (COVID-19) infection and renin angiotensin system blockers. JAMA Cardiol. 2020;5:745–747. doi: 10.1001/jamacardio.2020.1282. [DOI] [PubMed] [Google Scholar]
- 24.Bai F, Pang XF, Zhang LH, Wang NP, McKallip RJ, Garner RE, et al. Angiotensin II AT1 receptor alters ACE2 activity, eNOS expression and CD44-hyaluronan interaction in rats with hypertension and myocardial fibrosis. Life Sci. 2016;153:141–152. doi: 10.1016/j.lfs.2016.04.013. [DOI] [PubMed] [Google Scholar]
- 25.Wu L, Chen Q, Liu K, Wang J, Han P, Zhang Y, et al. Broad host range of SARS-CoV-2 and the molecular basis for SARS-CoV-2 binding to cat ACE2. Cell Discov. 2020;61. 2020;6:1–12. 10.1038/s41421-020-00210-9. [DOI] [PMC free article] [PubMed]
- 26.Luan J, Lu Y, Jin X, Zhang L. Spike protein recognition of mammalian ACE2 predicts the host range and an optimized ACE2 for SARS-CoV-2 infection. Biochem Biophys Res Commun. 2020;526:165–169. doi: 10.1016/j.bbrc.2020.03.047. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Hussain M, Jabeen N, Amanullah A, Baig AA, Aziz B, Shabbir S, et al. Molecular docking between human tmprss2 and sars-cov-2 spike protein: conformation and intermolecular interactions. AIMS Microbiol. 2020;6:350–360. doi: 10.3934/microbiol.2020021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Brown MA. Genetic determinants of pathogenesis by feline infectious peritonitis virus. Vet Immunol Immunopathol. 2011;143:265–268. doi: 10.1016/j.vetimm.2011.06.021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Lai MM. Recombination in large RNA viruses: coronaviruses. Semin Virol. 1996;7:381–388. doi: 10.1006/smvy.1996.0046. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Li F. Structure, function, and evolution of coronavirus spike proteins. Annu Rev Virol. 2016;3:237–261. doi: 10.1146/annurev-virology-110615-042301. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Lole KS, Bollinger RC, Paranjape RS, Gadkari D, Kulkarni SS, Novak NG, et al. Full-length human immunodeficiency virus type 1 genomes from subtype C-infected Seroconverters in India, with evidence of Intersubtype recombination. J Virol. 1999;73:152–160. doi: 10.1128/JVI.73.1.152-160.1999. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
All data used in this study are publicly available in the National Center for Biotechnology Information (NCBI, https://www.ncbi.nlm.nih.gov/nuccore/), and the accession numbers for both virus and host sequences are provided in Tables 1 and 2.