Abstract
Spike (S) glycoproteins mediate the coronavirus entry into the host cell. The S1 subunit of S-proteins contains the receptor-binding domain (RBD) that is able to recognize different host receptors, highlighting its remarkable capacity to adapt to their hosts along the viral evolution. While RBD in spike proteins is determinant for the virus–receptor interaction, the active residues lie at the receptor-binding motif (RBM), a region located in RBD that plays a fundamental role binding the outer surface of their receptors. Here, we address the hypothesis that SARS-CoV and SARS-CoV-2 strains able to use angiotensin-converting enzyme 2 (ACE2) proteins have adapted their RBM along the viral evolution to explore specific conformational topology driven by the residues YGF to infect host cells. We also speculate that this YGF-based mechanism can act as a protein signature located at the RBM to distinguish coronaviruses able to use ACE2 as a cell entry receptor.
Communicated by Ramaswamy H. Sarma
Keywords: SARS-CoV, SARS-CoV-2, ACE2, coronaviruses, receptor-binding domain, receptor-binding motif, evolutionary pattern
1. Introduction
Viruses are the most numerous type of biological entity on Earth and the identification of novel viruses continues to enlarge the known viral biosphere (Pan et al., 2017; Shi et al., 2018). This collection of all viruses presents enormous morphological and genomic diversity as a result of continuous exchange of genetic material with the host cells (Koonin et al., 2020; Nasir et al., 2014). Moreover, this well succeeded long-term virus–host interaction indicates that viruses are more than simple genomic parasites in all cellular life forms (Claverie, 2006). A number of evidences has led to the proposal that viruses play an astonishing role as agents of evolution because of their capacity in propagating between biomes (Sano et al., 2004) and in gene transfer between species (Enard et al., 2016; Filée et al., 2003; Koonin & Dolja, 2013; Van Blerkom, 2003). For this purpose, viruses have developed large number of genome replication and protein expression strategies to benefit from the host translational machinery over time (Baranowski et al., 2003).
Despite all of such enormous diversity in gene sequence, it is not possible to achieve huge number of highly distinct protein structures mainly because of stereochemical constraints on the possible protein folds (Ng et al., 2020). In fact, it has been observed common secondary structures throughout different virus families while the sequences are not fully conserved (Ahola & Karlin, 2015; Ng et al., 2020). This may result in evolutionary efficiency once viruses can exploit already well-designed motifs from similar cellular functions (Baranowski et al., 2003).
Currently, the world population is confronting a new coronavirus disease (COVID-19), a highly infectious disease to humans. This disease is caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) and is affecting human health worldwide. Coronaviruses (CoVs) belong to the large and diverse family Coronaviridae, within the order Nidovirales and suborder Cornidovirineae (Gorbalenya et al., 2020). Their subfamily Orthocoronavirinae contains four genera based on phylogeny and termed as α, β, γ, and δ-coronavirus.
SARS-CoV-2 belongs to the β-coronavirus genus as well as SARS-CoV, middle east respiratory syndrome coronavirus (MERS-CoV), and hCoV-HKU1, to cite a few (Jaimes et al., 2020). Other important representative human viruses as hCoV-NL63 and hCoV-229E belong to α-coronavirus. Phylogenetic relationships among the known members of this subfamily indicate that α and β-coronavirus infect mammals, while γ and δ-coronavirus infect both mammalians and avians.
Members of Coronaviridae family are enveloped, positive single-stranded RNA (+ssRNA) viruses and render the largest genomes among all known RNA viruses (Cheng et al., 2007; Li, 2016; Masters, 2006; Su et al., 2016). The + ssRNA genomes undergo rapid mutational changes (Sanjuán & Domingo-Calap, 2016), leading to faster adaptation to new hosts, though they also contain conserved sequence motifs as observed, for example, in multiple alignments do CoV strains (Ahola & Karlin, 2015; Lau et al., 2007; Woo et al., 2012).
Coronaviruses attach to host cell surface receptors via their spike (S) glycoproteins, located on the viral envelope, to mediate the entry into the host cell. Each monomer of trimeric S-protein comprises two subunits S1 and S2, responsible for the viral attachment and for the membrane fusion, respectively (Beniac et al., 2006; Li, 2012; Song et al., 2018). The S1 coronavirus subunit contains the receptor-binding domain (RBD) that is able to recognize different host receptors, highlighting its remarkable capacity to adapt to their hosts along the viral evolution. Thus, it is not unexpected to observe in this domain high sequence divergence even for the same coronavirus identified in different host species. In contrast, the S2 subunit presents the most conserved region in the S-protein.
The binding of RBD spike proteins to the receptor on the host cell is the first step in virus infection. This initial step is followed by an entry mechanism of enveloped viruses into target cells. Usually, most viruses enter cells through endocytotic pathways with the fusion occurring in the endosomes, although a direct entry into cells can occur by fusion of their envelopes with the cell membrane (Belouzard et al., 2012).
A number of CoVs utilizes angiotensin-converting enzyme 2 (ACE2) as the entry receptor into cells, exemplified by β-genus human respiratory SARS-CoV, SARS-CoV-2, and α-genus hCoV-NL63 (Hoffmann et al., 2020; Jaimes et al., 2020; Letko et al., 2020; Walls et al., 2020). In particular, SARS-CoV, as well as SARS-CoV-2, enter the cell via endocytosis induced by RBD complexed with human ACE2 (hACE2) receptor (Milewska et al., 2018; Ou et al., 2020; Wang et al., 2008, 2008; Yuan et al., 2017). In contrast, the β-genus MERS-CoV and its genetically related bat CoV-HKU4 utilize dipeptidyl peptidase 4 (DPP4) as the viral receptor (Yang et al., 2014). Other viral receptor is aminopeptidase N (APN), recognized for example by the α-genus hCoV-229E (Li, 2015).
The human coronaviruses hCoV-HKU1, hCoV-229E, hCoV-NL63, and hCoV-OC43, cause mild to moderate upper respiratory tract infections (Weiss & Navas-Martin, 2005), while SARS-CoV and SARS-CoV-2 cause severe respiratory diseases, with SARS-CoV-2 being far more lethal than SARS-CoV. SARS-CoV strains vary enormously in infectivity, which can be connected to their binding affinities to hACE2 (Cui et al., 2019). This binding affinity, in turn, can be correlated with disease severity in humans (Li, Zhang, et al., 2005).
While RBD in spike proteins is determinant for the virus–receptor interaction, the active residues lie at the receptor-binding motif (RBM), which is part of RBD and plays a fundamental role binding the outer surface of their receptors (Cui et al., 2019; Hoffmann et al., 2020; Letko et al., 2020; Li, Farzan, et al., 2005; Wan et al., 2020). The importance of the RBM is further explored here in relation to its structural topology. Thus, instead of only analyzing specific residues that make contacts with ACE2 after binding, we go a step further and track the molecular origin that drives the viral attachment to this cell receptor. This investigation has revealed a highly conserved amino acid residue sequence Tyr-Gly-Phe (YGF) in coronavirus variants that employ this receptor. Consequently, we hypothesize that the short sequence YGF is vital for RBD–ACE2 interaction because of the formation of a hydrophobic pocket proper to the receptor specificity (He et al., 2020; Lan et al., 2020; Wan et al., 2020; Wu et al., 2012). Moreover, we recognize that a similar binding mechanism is characteristic of the interaction between ubiquitin-associated (UBA) domain proteins and ubiquitin. In this vein, we conclude that is plausible that SARS-CoV and SARS-CoV-2 strains able to use ACE2 proteins have adapted their RBM along the viral evolution to explore such a mechanism to infect host cells.
1.1. The conserved XGF loop in UBA–ubiquitin interaction
Amino acid sequences of type XGF, where the residue X is frequently the residue Met, form a highly conserved loop characteristic of ubiquitin-associated (UBA) domain that occurs in a variety of proteins. The UBA domain is a conserved motif through eukaryotic evolution and is found in many proteins related to the ubiquitin metabolism and in particular, associated with ubiquitin-mediated proteolysis (Finley, 2009; Pickart & Cohen, 2004). The MGF loop in the UBA domain is typical of a hydrophobic pocket that is critical for recognition and binding affinity to ubiquitin through a hydrophobic surface patch located in the vicinity of this loop (Bertolaet et al., 2001; Cabe et al., 2018; Dieckmann et al., 1998; Long et al., 2008; Madura, 2002; Raasi et al., 2005; Tse et al., 2011; Wilkinson et al., 2001). UBA domains are ubiquitin receptors whose binding is a fundamental step for diverse regulatory functions.
NMR analysis of UBA–ubiquitin interactions identify hydrophobic surface patches formed by the conserved MGF sequence as the main determinants for the protein–protein interaction. A number of alignments of UBA domains has revealed mutations in the MGF sequence, mainly M → L, M → Q, and F → Y, but these mutations still maintain the overall hydrophobic characteristic for the main set of residues located at the protein-interacting surface in these UBA domains (Geetha & Wooten, 2002; Mueller & Feigon, 2002).
An illustrative example of this protein–protein interaction mechanism is presented in Figure S1, supplemental material. This figure displays the UBA domain of Dsk2p protein complexed with ubiquitin (PDB ID: 1WR1) where the residues M342, G343, and F344 form the core of the hydrophobic surface patch for interacting with ubiquitin. The importance of this hydrophobic surface patch has been strengthened by mutagenesis experiments (Ohno et al., 2005) of single substitutions G343A, F344A, and L369A in Dsk2p–UBA domain, producing a large decrease of its binding affinity for ubiquitin.
Figure 1.
Detailed surface view of SARS-CoV and SARS-CoV-2 RBD. (a) Residues YGFY at the interface of SARS-CoV complexed with hACE2 (PDB ID: 2AJF). These residues are in yellow color and form a hydrophobic pocket located in the RBM (magenta color). (b) Residues YGF and EGF (yellow color) at the interface of SARS-CoV-2 complexed with hACE2 (PDB ID: 6LZG). The first sequence is located in a hydrophobic pocket, while the second sequence EGF is on the RBM surface (magenta color). Ribbon representation of ACE2 is in blue color.
2. Results and discussion
2.1. Spike receptor-binding motifs in human CoVs
Here, we investigate the occurrence and importance of the specific amino acid residue sequence YGF for SARS-CoV and SARS-CoV-2 strains able to use ACE2 proteins as receptors. It is displayed in Figure 1a the interface of SARS-CoV RBD spike-protein (magenta and green color) complexed with hACE2 (blue color) to gain insight about the importance of this type of conformational mechanism in creating a shape complementarity between receptor and ligand. The RBM is in magenta color, with the yellow color displaying the YGFY sequence in that pocket, which establishes the proper relative position for favorable binding to surface-exposed hACE2 residues. The YGFY sequence seems strongly conserved in SARS-CoV spike RBD, more precisely located at residues 481–484 in the receptor binding motif. Noteworthy, this sequence seems to be unique because even the shorter YGF sequence does not occur in this region, neither in the RBD. As a consequence of this hydrophobic pocket, amino acid residues responsible for binding interaction are located close to this conformational structure as, for example, the residues N479 and T487 (Figure 2a). These residues have been identified to be essential for SARS-CoV spike RBD/ACE2 binding (Li et al., 2005, 2005; Qu et al., 2005). The residue N479 in SARS-CoV is located near K31 of hACE2 which in turn makes a salt bridge with E35, a residue buried in that hydrophobic environment. The residue T487 is located close to K353 on hACE2, and in turn makes a salt bridge with D38, also buried in that pocket. Other important residues for this attachment are Y442, L472, and D480 (Wan et al., 2020).
Figure 2.
SARS-CoV and SARS-CoV-2/hACE2 RBD interfaces. Ribbon diagrams of SARS-CoV RBD (a) and SARS-CoV-2 RBD (b) complexed with hACE2 (blue color), where the RBM is highlighted in magenta color. The main residues responsible for the structural binding are displayed in the stick representation.
Figure 3a displays the residues of SARS-CoV RBM in direct contact with hACE2 as determined by the hydrophobic–hydrophilic properties of the interface residues as predicted by the CSU program (Sobolev et al., 1999). This bipartite network of contacts highlights the importance of residues that are located near the YGF sequence and contributes to the stabilization of SARS-CoV complexed with hACE2. For example, Y475 makes hydrogen bond (H–B) contacts with Q24, F28, and Y83; N479 with K31, and H34; Y486 with Y41, N330, and R357; and T487 with Y41.
Figure 3.
Contact networks between (a) SARS-CoV residues, and (b) SARS-CoV-2 residues located in the RBM regions with ACE2. SARS-CoV and SARS-CoV-2 residues are in magenta color while human ACE2 residues are in blue color.
Figure 1b displays the interface of SARS-CoV-2 RBD spike protein complexed with hACE2 (blue color). Now, the sequence YGFY observed in SARS-CoV is replaced by YGFQ as a result of sequence alignments shown in Figure 4. The single-point mutation Y484 → Q498 replaces a hydrophobic residue in SARS-CoV by a hydrophilic one in SARS-CoV-2.
Figure 4.
Sequence alignments of human CoVs restricted to RBM residues. The medium purple color highlights the YGFY pattern followed by the mutation Y498Q in the RBM of SARS-CoV-2 strains.
Figure 4 compares residue sequences of human SARS-CoV and SARS-CoV-2 strains aligned with RBM of SARS-CoV Tor2, an epidemic strain isolated from humans during the SARS epidemic in 2002–2003. The human Tor2 strain has high affinity for hACE2 (Cui et al., 2019). We highlight in this figure in medium purple color the hydrophobic sequence YGFY typical of SARS-CoV, occurring at positions 481–484 in the spike protein. The corresponding mutated sequence occurs now at positions 495–498 in SARS-CoV-2 spike protein.
The important residues for the interface interaction found in SARS-CoV are mutated in SARS-CoV-2. The sequence alignments show the mapping, Y442 → L455, L472 → F486, N479 → Q493, D480 → S494, and T487 → N501. These mutations do not present a drastic change in their hydrophobic character (Wimley & White, 1996), thus preserving the overall receptor-binding topological structure for these viruses. In particular, residues L455 and Q493 in SARS-CoV-2 preserve the noted favorable interactions with the residues E35 and K31 in hACE2 (Yi et al., 2020) (Figure 2b). Interestingly, a new GF sequence appears in the RBM of SARS-CoV-2 strains as a consequence of the mutation L472 → F486, producing a small hydrophobic surface, but does not seem to disrupt the proposed topological formation mechanism for ACE2 binding. No other GF sequence appears in their RBD.
Figure 3b displays the SARS-CoV-2 RBM residues in direct contact with hACE2 as predicted by the CSU program, showing again the importance of residues close to the hydrophobic pocket. Details of protein–protein binding interfaces can be quite different among strains, likely related to their infectivity degree. It has been noted that mutations in RBM residue T487 in SARS-CoV have an important role in the human-to-human and animal-to-human transmission of SARS-CoV (Cui et al., 2019; Li, 2015; Qu et al., 2005).
Now, we investigate the relevance of the hydrophobic pocket driven by the YGFY sequence in promoting the stability of SARS-CoV spike receptor binding domain complexed with hACE2. To this end, we conducted a series of mutations to estimate the change in binding affinity using the MutaBind2 method (Zhang et al., 2020).
Initially, we investigate the influence of N479 mutation by the residues E, K, Q, R, and S on the complexation. The mutations N479E, N479K, N479Q, N479R, and N479S have been observed, respectively in pangolin strains (see Figure 5), bat and palm civet strains (see Figures 5 and 6), human SARS-CoV-2 (see Figure 4), bat and palm civet strains (see Figures 5 and 6), and bat strains (see Figure 6). The calculation of for these mutations does not indicate any appreciable effect on SARS-CoV spike RBD/hACE2 binding affinity due to its small variation, as displayed in Table 1. Therefore, we may conclude that N479 does not enhance the binding affinity of spike RBD to hACE2, and could as well be replaced by any of the above residues, preserving the hydrophilic character (Wimley & White, 1996) and conformational stability.
Figure 5.
Sequence alignment of bat CoVs restricted to RBM residues of SARS-CoV Tor2. The residues of YGFY pattern are in medium purple color. Last three alignments are placed together for direct amino acid sequence comparison.
Figure 6.
Sequence alignment of palm civet CoVs and pangolin PCoVs restricted to RBM residues of SARS-CoV Tor2. The residues of YGFY pattern are in medium purple color. Last line includes the SARS-CoV-2 sequence for comparison.
Table 1.
Changes in binding affinity of human SARS-CoV and human SARS-CoV-2 spike RBD complexed with hACE2 upon mutation as predicted by MutaBind2 method. Prediction effects are classified as low-confidence prediction: (1), or high-confidence prediction: (2). Here we adopt the classification presented in MutaBind method (Minghui et al., 2016) because of the high similarity between the ROC (receiver operating characteristic) curves in both methods.
| Effect on the | ||
|---|---|---|
| SARS-CoV | (kcal/mol) | complexation |
| N479E | 0.76 | neutral |
| N479K | 0.66 | neutral |
| N479Q | 0.40 | neutral |
| N479R | 0.08 | neutral |
| N479S | 0.63 | neutral |
| G482A | 1.07 | neutral |
| G482D | 1.85 | highly destabilizing |
| G482V | 1.78 | highly destabilizing |
| G482D, Y484F | 2.40 | highly destabilizing |
| G482D, Y484H | 2.68 | highly destabilizing |
| G482D, Y484N | 3.24 | highly destabilizing |
| G482D, Y484Q | 2.47 | highly destabilizing |
| G482D, Y484T | 3.26 | highly destabilizing |
| Y484F | 0.64 | neutral |
| Y484H | 1.57 | highly destabilizing |
| Y484N | 2.19 | highly destabilizing |
| Y484Q | 1.44 | neutral |
| Y484T | 1.92 | highly destabilizing |
| N479E, G482D, Y484F | 4.58 | highly destabilizing |
| N479E, G482D, Y484H | 4.90 | highly destabilizing |
| N479E, G482D, Y484N | 4.69 | highly destabilizing |
| N479E, G482D, Y484Q | 4.58 | highly destabilizing |
| N479E, G482D, Y484T | 4.67 | highly destabilizing |
| SARS-CoV-2 | ||
| G496D | 0.96 | neutral |
| G496A | 1.10 | neutral |
| G496V | 1.69 | highly destabilizing |
| Q498Y | 0.16 | neutral |
| G496D, Q498F | 2.75 | highly destabilizing |
| G496D, Q498H | 2.52 | highly destabilizing |
| G496D, Q498N | 1.72 | highly destabilizing |
| G496D, Q498T | 1.68 | highly destabilizing |
| Q493S | 0.89 | neutral |
| Q493S, G496D, Q498F | 3.63 | highly destabilizing |
| Q493S, G496D, Q498H | 4.33 | highly destabilizing |
| Q493S, G496D, Q498N | 2.96 | highly destabilizing |
| Q493S, G496D, Q498T | 2.85 | highly destabilizing |
Specific mutations were also performed to investigate the importance of the residues forming the hydrophobic surface patch for the complexation stability. Residues Y481 and F483 are conserved through all SARS-CoV strains we have analyzed in this work, while G482 and Y484 are mutated in some strains (bats and pangolins). Therefore, we mutated G482 and Y484 by plausible residues, i.e. the ones that occur in other strains to evaluated the changes in the binding affinity. To perform this double mutation, we fixed G482D, a mutation observed in bat strains, followed by Y484F, Y484N, and Y484T, observed in bats; Y484H, observed in pangolins; and Y484Q, observed in human SARS-CoV-2. The free-energy changes for these double mutations strongly indicate the desestabilization of human SARS-CoV complexed with hACE2 (Table 1). It is interesting to note that G482D together with Y484Q decreases the binding affinity in human SARS-CoV because they decrease the hydrophobicity of the initial YGFY pocket. This conclusion is supported by the mutant G482D-Y484Q structure obtained by the bioinformatic tool Mutabind2, after a 100 step energy minimization, which does not display steric hindrances between these residues and the ones at the interface with ACE2. Table 1 also displays the changes in binding affinity for single mutations of Y484, a highly connected residue with hACE2 (see Figure 3).
We have also applied the predicting tool Mutabind2 to the single mutations G482A, and G482V to evaluate under which conditions we may obtain disruptive S1–protein–virion interaction. This tool yielded 1.07, and 1.78 kcal/mol, respectively for G482A, and G482V. V842 produces a single steric hindrance at Y436:CZ from spike protein, highlighting the importance of hydrophobicity against steric effects.
Next, we mutated N479 followed by mutations at sites 482 and 484 to analyze the consequences on the binding affinity of this triple mutation by disrupting the hydrophobic surface patch. Again, we fixed, for example, the mutations N479E and G482D. The impacts of this set of mutations can be seen in Table 1. What was considered to be a neutral mutation, N479E shows high destabilizing effect in the new conformational environment. Similar destabilizing effects on the human SARS-CoV spike RBD complexed with hACE2 are obtained for the important residue T487 when one mutates the residues forming the hydrophobic pocket (data not shown).
Now, we repeat the above procedure to investigate the role of residues YGFQ for human SARS-CoV-2 RBD/hACE2 binding affinity. The mutation Q498Y does not alter the binding affinity for the complexation because the predicted is about 0.16 kcal/mol. We also analyzed the impact of double mutations in the YGFQ sequence on the stability of the complexation, see Table 1. To this end, we fixed, for example G496D and replaced Q498 by the residues F, H, N, and T that appear in strains of other species. The predicted changes in binding affinity by mutations indicate destabilization of new complexations.
2.2. Spike receptor-binding motifs in bats
It is known that not all SARS-CoV strains isolated from bat hosts have exploited ACE2 as a cellular attachment. Therefore, the set of amino acid sequences displayed in Figure 5 may exemplify the successful relation between virus evolution and the binding mechanism. This set highlights in medium purple color the preserved amino acid residues in the sequence YGFY, characteristics of human SARS-CoV. For comparison, we also display CoV strains with mutations in that SARS-CoV pattern to explore the relation between the hypothesized mechanism and the cell receptor recognition.
It has been demonstrated that LYRa11 (Letko et al., 2020), Rs3367 (Ge et al., 2013), Rs4874 (Hu et al., 2017), WIV1, and WIV16 (Letko et al., 2020; Yang et al., 2015), have the capacity to use ACE2 for cell entry as well RaTG13, in line with our hypothesis. Also, the near single-point mutation Y → F in the next six strains Rs7327, Rs9401, YN2018B, RsSHC014, Rs4084, and Rs4231, does not interfere, as expected, in the attachment mechanism. This conclusion is supported by cell entry studies for Rs7327 (Hu et al., 2017; Letko et al., 2020), Rs9401, RsSHC014, Rs4084, and Rs4231 (Hu et al., 2017), because they are in a group that is likely to use the ACE2 receptor. This mutation replaces a hydrophobic residue by another one with higher hydrophobicity, reinforcing the conformational topology for binding with the receptor. This single-point mutation Y→F in human SARS-CoV produced a neutral effect in the binding hACE2 (Table 1).
We identified in the next group constituted by BtKY72, BB9904/BGR/2008, and BM48-31/BGR/2008, respectively the mutations Y487T, Y488T, and Y485T, decreasing the initial hydrophobicity of the expected pocket in these strains. It seems unlikely that this mutation and amino acid residue deletions associated to Tor2 RBM sequence affect the YGF-based attachment mechanism for BtKY72 and BB9904/BGR/2008. Unfortunately, there is no available experimental data concerning their receptors. It is important to remark that the residue F492 in BM48-31/BGR/2008 produces another hydrophobic sequence IGF at residues 490–492 (Figure 5). We speculate that this double occurrence may disrupt the aforementioned mechanism because of indications that BM48-31/BGR/2008 does not interact, at least with human ACE2 (Letko et al., 2020). No other GF sequence occurs in the RBD of these strains.
Next CoV strains in Figure 5 do not contain such specific YGF sequences of residues in the RBM neither in their RBD. We find the two-letter sequence GF in Rf1/2004, but it is located in RBD and with GF surrounded by hydrophilic residues. Although we have considered only part of the sequences that better align with RBM of Tor2, it has been demonstrated that the spikes of HuB2013, HKU3, CoVZC45, CoVZXC21, Rf1, Rf4092, and Shaanxi2011 do not use hACE2, a result that is not just a consequence of deletions at the RBD (Letko et al., 2020). Further support has been presented against HKU3 in using hACE2 (Gralinski & Menachery, 2020). It seems unlikely that Rm1/2004 infects hACE2 because its unfavorable binding free energy (Armijos-Jaramillo et al., 2020). Another result concludes that Rp3 is unable of infect hACE2 or even bat ACE2 (Hoffmann et al., 2013).
We have placed together the alignments involving Tor2, RaTG13, and SARS-CoV-2 at the end of Figure 5 for further comparison. The whole genome of RaTG13 shares 96% amino acid sequence identity with SARS-CoV-2, and it is considered the most closely related genome to this CoV (Zhang et al., 2020). Considering its spike protein, and RBM, RaTG13 shares respectively 97% and 76% amino acid identity with SARS-CoV-2. For comparison, RaTG13 shares 79%, 77%, and 53% identity, respectively, for the whole genome, spike protein, and RBM with SARS-CoV Tor2. Therefore, SARS-CoV-2 is mostly similar to RaTG13 than SARS-CoV strains in all regions.
2.3. Spike receptor-binding motifs in palm civets and pangolins
To explore further the role of YGF-based attachment mechanism, we exhibit comparative residue sequences for civet and pangolins, again aligned with RBM of SARS-CoV Tor2 (Figure 6). This figure shows that the pattern YGFY characteristic of human SARS-CoV is maintained for the collected data, but with a single-point mutation Y → H for pangolin hosts PCoV. It is worth to observe that even the shorter two-letter GF sequence is not found in the RBD of these strains, which could promote another hydrophobic pocket.
We have included SARS-CoV-2 on the last line of Figure 6 for a direct comparison. PCoV GX-P2V shares 79%, 77%, and 50% amino acid identity with Tor2, respectively for whole genome, spike protein, and RBM aligned with Tor2. In relation to SARS-CoV-2, PCoV GX-P2V shares 85%, 92%, and 75% amino acid identity, respectively for whole genome, spike protein, and RBM. It is believed that human SARS-CoV passed from palm civets to humans in the 2002–2003 epidemic because their genome sequences are highly similar (Cui et al., 2019; Li, 2008; Qu et al., 2005). The amino acid alignments show an almost identical RBM between human SARS-CoV, represented by Tor2 strain, and collected data from palm civet strains. This identification also includes the YGF-based mechanism able to use ACE2 proteins. Nevertheless, these alignments display high similarity between pangolins and SARS-CoV-2, which also support previous conclusions on pangolins being the probable origin of SARS-CoV-2 (Lam et al., 2020; Zhang et al., 2020). However, based on our data related to host receptor binding and their RBM and S-protein alignments, we cannot discard bat RaTG13-like strain as also the possible origin of SARS-CoV-2.
2.4. SARS-CoV and hCoV-NL63: only functionally related
Although there are no many available experimental data identifying the viral receptor-binding protein for CoVs, it is well established that human SARS-CoV and hCoV-NL63 both employ ACE2 as the cell receptor to infect host cells (Hofmann et al., 2005, 2006). Interestingly, SARS-CoV and hCoV-NL63 domains do not present high sequence similarity. For example, their spike-S1 subunities share only 10% in similarity. Other features separate SARS-CoV and hCoV-NL63 (Milewska et al., 2014). SARS-CoVs are classified as β- coronavirus with subgenus sarbecovirus, while hCoV-NL63 is in genus α-coronavirus and subgenus setracovirus. Although hCoV-NL63 also enters the cell via endocytosis, its functional receptor requires heparan sulfate proteoglycans for the initial attachment, representing an important extra factor for ACE2 to act as a functional receptor (Milewska et al., 2014, 2018). Moreover, the spike-S1 glycoprotein of SARS-CoV binds more efficiently ACE2 than the corresponding spike-S1 of NL63 (NL63-S) (Glowacka et al., 2010). This may be linked to the fact that SARS-CoV and NL63-S contact ACE2 differently, a conclusion based upon the experimental results that NL63-S does not bind to ACE2 through a single and large domain (Hofmann et al., 2006; Wu et al., 2009). Actually, different RBD regions have been identified within NL63-S. One of these regions was positioned at residues 476–616 and comprising three discontinuous RBM regions, RBM1 (residues 497–501), RBM2 (residues 530–540), and RBM3 (residues 575–594) (Li et al., 2007; Lin et al., 2008, 2011). A slightly different RBD has been identified for this CoV (Wu et al., 2009). It would be located at residues 482–602, also with three discontinuous RBM regions, which surround a shallow cavity at hCoV–NL63–ACE2 binding interface. Curiously, its spike protein alignment with Tor2 does not show the expected residue pattern in the corresponding RBM of Tor2 nor in the aforementioned RBD regions of NL63-S. This may help to explain the unusual pathway of binding to ACE2 for this CoV.
3. Methods
The analysis of inter-molecular contacts in the experimentally determined high resolution crystal structures complexed with hACE2 (PDB ID: 2AJF (Li, Farzan, et al., 2005) and PDB ID: 6LZG (Wang et al., 2020)) was performed with the bioinformatic tool Contacts of Structural Units (CSU) (Sobolev et al., 1999).
Single and multiple residue mutations were introduced at positions located on the hydrophobic surface patch to evaluate the importance of these residues in establishing specific interactions with hACE2. Mutations may affect the spike receptor-binding complexed with hACE2 either leading to higher, lower or even neutral binding affinity. We apply the fast and accurate MutaBind2 method (Zhang et al., 2020) to estimate the binding free-energy change upon mutation to predict its functional effects. This method compares free-energy changes between mutated and wild-type three-dimensional conformations. The binding free-energy change upon single mutation was also evaluated with the predictor BeAtMuSiC (Dehouck et al., 2013) based on a set of statistical potentials extracted from experimental mutational data. This computational method predicted very similar effects on the complexation (data not shown) as described in Table 1.
3.1. Sequence alignment
We have also used the bioinformatic tools BLAST and ClustalW for sequence alignment and analysis of CoV strains, and Jalview to examine and edit various sequence alignments. Figures showing the conformational complexations were prepared using PyMol. The list of GenBank accession codes for the spike proteins analyzed in this work is available in Supplementary Table S1.
4. Conclusion
We have analyzed a number of CoV strains to support the hypothesis that SARS-CoV and SARS-CoV-2 strains share a common evolutionary mechanism for the initial attachment to ACE2. Moreover, we speculate that the YGF-based mechanism can act as a protein signature to distinguish CoVs able to use ACE2 as a cell entry receptor whenever this residue sequence is located at the CoV RBM region. For example, bat-SL-CoV ZC45 and ZXC21 are closely related sequences to SARS-CoV-2 with overall genome identity of ∼89% and can be promptly put under suspicious in their ACE2 binding affinity because the lack of such signature. Of course, as exemplified by hCoV-NL63, we cannot discard that another mechanism can act helping such ACE2 binding. It must be accentuated that the occurrence of other XGF sequences, mainly with X being a hydrophobic residue, in the RBM, or even in the RBD region, can disrupt the proposed topological mechanism for ACE2 binding. This because it might introduce hydrophobic loops promoting a new ligand–substrate recognition.
Supplementary Material
Funding Statement
P.P.D.C. and N.A.A. gratefully acknowledge financial supports from the Brazilian agencies CAPES, and FAPESP, process 2015/16116-3, respectively.
Disclosure statement
The authors declare no competing financial interests.
References
- Ahola, T., & Karlin, D. G. (2015). Sequence analysis reveals a conserved extension in the capping enzyme of the alphavirus supergroup, and a homologous domain in nodaviruses. Biology Direct, 10, 16. 10.1186/s13062-015-0050-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Armijos-Jaramillo, V., Yeager, J., Muslin, C., & Perez-Castillo, Y. (2020). SARS-CoV-2, an evolutionary perspective of interaction with human ACE2 reveals undiscovered amino acids necessary for complex stability. Evolutionary Applications, 13, 2168–2178. 10.1111/eva.12980 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Baranowski, E., Ruiz-Jarabo, C. M., Pariente, N., Verdaguer, N., & Domingo, E. (2003). Evolution of cell recognition by viruses: A source of biological novelty with medical implications. Advances Virus Research, 62, 19–111. 10.1016/S0065-3527(03)62002-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Belouzard, S., Millet, J. K., Licitra, B. N., & Whittaker, G. R. (2012). Mechanisms of coronavirus cell entry mediated by the viral spike protein. Viruses, 4(6), 1011–1033. 10.3390/v4061011 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Beniac, D. R., Andonov, A., Grudeski, E., & Booth, T. F. (2006). Architecture of the SARS coronavirus prefusion spike. Nature Structural & Molecular Biology, 13(8), 751–752. 10.1038/nsmb1123 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bertolaet, B. L., Clarke, D. J., Wolff, M., Watson, M. H., Henze, M., Divita, G., & Reed, S. I. (2001). UBA domains mediate protein–protein interactions between two DNA damage-inducible proteins. Journal of Molecular Biology, 313(5), 955–963. 10.1006/jmbi.2001.5105 [DOI] [PubMed] [Google Scholar]
- Cabe, M., Rademacher, D. J., Karlsson, A. B., Cherukuri, S., & Bakowska, J. C. (2018). PB1 and UBA domains of p62 are essential for aggresome-like induced structure formation. Biochemical Biophysics Research & Communication, 503(4), 2306–2311. 10.1016/j.bbrc.2018.06.153 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cheng, V. C. C., Lau, S. K. P., Woo, P. C. Y., & Yuen, K. Y. (2007). Severe acute respiratory syndrome coronavirus as an agent of emerging and reemerging infection. Clinical Microbiology Reviews, 20(4), 660–694. 10.1128/CMR.00023-07 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Claverie, J. M. (2006). Viruses take center stage in cellular evolution. Genome Biology, 7(6), 110. 10.1186/gb-2006-7-6-110 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cui, J., Li, F., & Shi, Z.-L. (2019). Origin and evolution of pathogenic coronaviruses. Nature Reviews: Microbiology, 17(3), 181–192. 10.1038/s41579-018-0118-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dehouck, Y., Kwasigroch, J. M., Rooman, M., & Gilis, D. (2013). BeAtMuSiC: Prediction of changes in protein–protein binding affinity on mutations. Nucleic Acids Research, 41(Web Server issue), W333–W339. 10.1093/nar/gkt450 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dieckmann, T., Withers-Ward, E. S., Jarosinski, M. A., Liu, C.-F., Chen, I. S. Y., & Feigon, J. (1998). Structure of a human DNA repair protein UBA domain that interacts with HIV-1 Vpr. Nature Structural Biology, 5(12), 1042–1047. 10.1038/4220 [DOI] [PubMed] [Google Scholar]
- Enard, D., Cai, L., Gwennap, C., & Petrov, D. A. (2016). Viruses are a dominant driver of protein adaptation in mammals. eLife, 5, e12469. 10.7554/eLife.12469.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Filée, J., Forterre, P., & Laurent, J. (2003). The role played by viruses in the evolution of their hosts: A view based on informational protein phylogenies. Research in Microbiology, 154(4), 237–243. 10.1016/S0923-2508(03)00066-4 [DOI] [PubMed] [Google Scholar]
- Finley, D. (2009). Recognition and processing of ubiquitin–protein conjugates by the proteasome. Annual Review of Biochemistry, 78, 477–513. 10.1146/annurev.biochem.78.081507.101607 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ge, X.-Y., Li, J.-L., Yang, X.-L., Chmura, A. A., Zhu, G., Epstein, J. H., Mazet, J. K., Hu, B., Zhang, W., Peng, C., Zhang, Y.-J., Luo, C.-M., Tan, B., Wang, N., Zhu, Y., Crameri, G., Zhang, S.-Y., Wang, L.-F., Daszak, P., & Shi, Z.-L. (2013). Isolation and characterization of a bat SARS-like coronavirus that uses the ACE2 receptor. Nature, 503(7477), 535–538. https://doi:10.1038/nature12711 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Geetha, T., & Wooten, M. W. (2002). Structure and functional properties of the ubiquitin binding protein p62. FEBS Letters, 512(1–3), 19–24. 10.1016/S0014-5793(02)02286-X [DOI] [PubMed] [Google Scholar]
- Glowacka, I., Bertram, S., Herzog, P., Pfefferle, S., Steffen, I., Muench, M. O., Simmons, G., Hofmann, H., Kuri, T., Weber, F., Eichler, J., Drosten, C., & Pöhlmann, S. (2010). Differential downregulation of ACE2 by the spike proteins of severe acute respiratory syndrome coronavirus and human coronavirus NL63. Journal of Virology, 84(2), 1198–1205. https://doi:10.1128/JVI.01248-09 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gorbalenya, A. E., Baker, S. C., Baric, R. S., de Groot, R. J., Drosten, C., & Gulyaeva, A. A. (2020). The species Severe acute respiratory syndrome-related coronavirus: Classifying 2019-nCoV and naming it SARS-CoV-2. Nature Microbiology, 5, 536–544. 10.1038/s41564-020-0695-z [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gralinski, L. E., & Menachery, V. D. (2020). Return of the Coronavirus: 2019-nCoV. Viruses, 12, 135. https://doi:10.3390/v12020135 [DOI] [PMC free article] [PubMed] [Google Scholar]
- He, J., Tao, H., Yan, Y., Huang, S.-Y., & Xiao, Y. (2020). Molecular mechanism of evolution and human infection with SARS-CoV-2. Viruses, 12, 428. https:// 10.3390/v12040428 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hoffmann, M., Kleine-Weber, H., Schroeder, S., Krüger, N., Herrler, T., Erichsen, S., Schiergens, T. S., Herrler, G., Wu, N.-H., Nitsche, A., Müller, M. A., Drosten, C., & Pöhlmann, S. (2020). SARS-CoV-2 cell entry depends on ACE2 and TMPRSS2 and is blocked by a clinically proven protease inhibitor. Cell, 181(2), 271–280. 10.1016/j.cell.2020.02.052 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hoffmann, M., Müller, M. A., Drexler, J. F., Glende, J., Erdt, M., Gützkow, T., Losemann, C., Binger, T., Deng, H., Schwegmann-Weßels, C., Esser, K.-H., Drosten, C., & Herrler, G. (2013). Differential sensitivity of bat cells to infection by enveloped RNA viruses: Coronaviruses, paramyxoviruses, filoviruses, and influenza viruses. PLoS ONE, 8(8), e72942. https://doi:10.1371/journal.pone.0072942 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hofmann, H., Pyrc, K., van der Hoek, L., Geier, M., Berkhout, B., & Pöhlmann, S. (2005). Human coronavirus NL63 employs the severe acute respiratory syndrome coronavirus receptor for cellular entry. Proceedings of the National Academy of Sciences of the United States of America, 102(22), 7988–7993. https://doi:10.1073/pnas.0409465102 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hofmann, H., Simmons, G., Rennekamp, A. J., Chaipan, C., Gramberg, T., Heck, E., Geier, M., Wegele, A., Marzi, A., Bates, P., & Pöhlmann, S. (2006). Highly conserved regions within the spike proteins of human coronaviruses 229E and NL63 determine recognition of their respective cellular receptors. Journal of Virology, 80(17), 8639–8652. https://doi:10.1128/JVI.00560-06 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hu, B., Zeng, L.-P., Yang, X.-L., Ge, X.-Y., Zhang, W., Li, B., Xie, J.-Z., Shen, X.-R., Zhang, Y.-Z., Wang, N., Luo, D.-S., Zheng, X.-S., Wang, M.-N., Daszak, P., Wang, L.-F., Cui, J., & Shi, Z.-L. (2017). Discovery of a rich gene pool of bat SARS-related coronaviruses provides new insights into the origin of SARS coronavirus. PLoS Pathogens, 13(11), e1006698. 10.1371/journal.ppat.1006698 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jaimes, J. A., André, N. M., Chappie, J. S., Millet, J. K., & Whittaker, G. R. (2020). Phylogenetic analysis and structural modeling of SARS-CoV-2 spike protein reveals an evolutionary distinct and proteolytically sensitive activation loop. Journal of Molecular Biology, 432(10), 3309–3325. 10.1016/j.jmb.2020.04.009 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Koonin, E. V., & Dolja, V. V. (2013). A virocentric perspective on the evolution of life. Current Opinion in Virology, 3(5), 546–557. 10.1016/j.coviro.2013.06.008 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Koonin, E. V., Dolja, V. V., Krupovic, M., Varsani, A., Wolf, Y. I., Yutin, N., Zerbini, F. M., & Kuhn, J. H. (2020). Global organization and proposed megataxonomy of the virus world. Microbiology & Molecular Biology Reviews: MMBR, 84(2), e00061–19. 10.1128/MMBR.00061-19 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lam, T. T.-Y., Jia, N., Zhang, Y.-W., Shum, M. H.-H., Jiang, J.-F., Zhu, H.-C., Tong, Y.-G., Shi, Y.-X., Ni, X.-B., Liao, Y.-S., Li, W.-J., Jiang, B.-G., Wei, W., Yuan, T.-T., Zheng, K., Cui, X.-M., Li, J., Pei, G.-Q., Qiang, X., … Cao, W.-C. (2020). Identifying SARS-CoV-2-related coronaviruses in Malayan pangolins. Nature, 583(7815), 282–285. https://doi:10.1038/s41586-020-2169-0 [DOI] [PubMed] [Google Scholar]
- Lan, J., Ge, J., Yu, J., Shan, S., Zhou, H., Fan, S., Zhang, Q., Shi, X., Wang, Q., Zhang, L., & Wang, X. (2020). Structure of the SARS-CoV-2 spike receptor-binding domain bound to the ACE2 receptor. Nature, 581(7807), 215–228. 10.1038/s41586-020-2180-5 [DOI] [PubMed] [Google Scholar]
- Lau, S. K. P., Woo, P. C. Y., Li, K. S. M., Huang, Y., Wang, M., Lam, C. S. F., Xu, H., Guo, R., Chan, K.-H., Zheng, B.-J., & Yuen, K.-Y. (2007). Complete genome sequence of bat coronavirus HKU2 from Chinese horseshoe bats revealed a much smaller spike gene with a different evolutionary lineage from the rest of the genome. Virology, 367(2), 428–439. 10.1016/j.virol.2007.06.009 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Letko, M., Marzi, A., & Munster, V. (2020). Functional assessment of cell entry and receptor usage for SARS-CoV-2 and other lineage B betacoronaviruses. Nature Microbiology, 5(4), 562–569. 10.1038/s41564-020-0688-y [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li, F. (2008). Structural analysis of major species barriers between humans and palm civets for severe acute respiratory syndrome coronavirus infections. Journal of Virology, 82(14), 6984–6991. https:// 10.1128/JVI.00442-08 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li, F. (2012). Evidence for a common evolutionary origin of coronavirus spike protein receptor-binding subunits. Journal of Virology, 86(5), 2856–2858. 10.1128/JVI.06882-11 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li, F. (2015). Receptor recognition mechanisms of coronaviruses: A decade of structural studies. Journal of Virology, 89(4), 1954–1964. 10.1128/JVI.02615-14 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li, F. (2016). Structure, function, and evolution of coronavirus spike proteins. Annual Review of Virology, 3(1), 237–261. 10.1146/annurev-virology-110615-042301 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li, F., Li, W., Farzan, M., & Harrison, S. C. (2005). Structure of SARS coronavirus spike receptor-binding domain complexed with receptor. Science (New York, NY), 309(5742), 1864–1868. 10.1126/science.1113611 [DOI] [PubMed] [Google Scholar]
- Li, W., Sui, J., Huang, I.-C., Kuhn, J. H., Radoshitzky, S. R., Marasco, W. A., Choe, H., & Farzan, M. (2007). The S proteins of human coronavirus NL63 and severe acute respiratory syndrome coronavirus bind overlapping regions of ACE2. Virology, 367(2), 367–374. https://doi:10.1016/j.virol.2007.04.035 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li, W., Zhang, C., Sui, J., Kuhn, J. H., Moore, M. J., Luo, S., Wong, S.-K., Huang, I.-C., Xu, K., Vasilieva, N., Murakami, A., He, Y., Marasco, W. A., Guan, Y., Choe, H., & Farzan, M. (2005). Receptor and viral determinants of SARS-coronavirus adaptation to human ACE2. The EMBO Journal, 24(8), 1634–1643. 10.1038/sj.emboj.7600640 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lin, H.-X., Feng, Y., Tu, X., Zhao, X., Hsieh, C.-H., Griffin, L., Junop, M., & Zhang, C. (2011). Characterization of the spike protein of human coronavirus NL63 in receptor binding and pseudotype virus entry. Virus Research, 160(1–2), 283–293. https://doi:10.1016/j.virusres.2011.06.029 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lin, H.-X., Feng, Y., Wong, G., Wang, L., Li, B., Zhao, X., Li, Y., Smaill, F., & Zhang, C. (2008). Identification of residues in the receptor-binding domain (RBD) of the spike protein of human coronavirus NL63 that are critical for the RBD–ACE2 receptor interaction. The Journal of General Virology, 89(Pt 4), 1015–1024. https://doi:10.1099/vir.0.83331-0 [DOI] [PubMed] [Google Scholar]
- Long, J., Gallagher, T. R. A., Cavey, J. R., Sheppard, P. W., Ralston, S. H., Layfield, R., & Searle, M. S. (2008). Ubiquitin recognition by the ubiquitin-associated domain of p62 involves a novel conformational switch. The Journal of Biological Chemistry, 283(9), 5427–5440. 10.1074/jbc.M704973200 [DOI] [PubMed] [Google Scholar]
- Madura, K. (2002). The ubiquitin-associated (UBA) domain: On the path from prudence to prurience. Cell Cycle (Georgetown, Tex.), 1(4), 235–244. 10.4161/cc.1.4.130 [DOI] [PubMed] [Google Scholar]
- Masters, P. S. (2006). The molecular biology of coronaviruses. Advances in Virus Research, 66, 193–292. 10.1016/S0065-3527(06)66005-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Milewska, A., Nowak, P., Owczarek, K., Szczepanski, A., Zarebski, M., Hoang, A., Berniak, K., Wojarski, J., Zeglen, S., Baster, Z., Rajfur, Z., & Pyrc, K. (2018). Entry of human coronavirus NL63 into the cell. Journal of Virology, 92(3), e01933–17. 10.1128/JVI.01933-17 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Milewska, A., Zarebski, M., Nowak, P., Stozek, K., Potempa, J., & Pyrc, K. (2014). Human coronavirus NL63 utilizes heparan sulfate proteoglycans for attachment to target cells. Journal of Virology, 88(22), 13221–13230. 10.1128/JVI.02078-14 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Minghui, L., Simonetti, F. L., Goncearenco, A., & Panchenko, A. R. (2016). MutaBind estimates and interprets theeffects of sequence variants on protein–protein interactions. Nucleic Acids Research, 44(W1), W494–W501. 10.1093/nar/gkw374 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mueller, T. D., & Feigon, J. (2002). Solution structures of UBA domains reveal a conserved hydrophobic surface for protein–protein interactions. Journal of Molecular Biology, 319(5), 1243–1255. 10.1016/S0022-2836(02)00302-9 [DOI] [PubMed] [Google Scholar]
- Nasir, A., Forterre, P., Kim, K. M., & Caetano-Anollés, G. (2014). The distribution and impact of viral lineages in domains of life. Frontiers in Microbiology, 5, 194. 10.3389/fmicb.2014.00194 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ng, W. M., Stelfox, A. J., & Bowden, T. A. (2020). Unraveling virus relationships by structure-based phylogenetic classification. Virus Evolution, 6, veaa003. 10.1093/ve/veaa003 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ohno, A., Jee, J. G., Fujiwara, K., Tenno, T., Goda, N., Tochio, H., Kobayashi, H., Hiroaki, H., & Shirakawa, M. (2005). Structure of the UBA domain of Dsk2p in complex with ubiquitin molecular determinants for ubiquitin recognition. Structure (London, England: 1993), 13(4), 521–532. 10.1016/j.str.2005.01.011 [DOI] [PubMed] [Google Scholar]
- Ou, X., Liu, Y., Lei, X., Li, P., Mi, D., & Ren, L. (2020). Characterization of spike glycoprotein of SARS-CoV-2 on virus entry and its immune cross-reactivity with SARS-CoV. Nature Communications, 11, 2020. 10.1038/s41467-020-15562-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pan, D., Nolan, J., Williams, K. H., Robbins, M. J., & Weber, K. A. (2017). Abundance and distribution of microbial cells and viruses in an alluvial aquifer. Frontiers in Microbiology, 8, 1199. 10.3389/fmicb.2017.01199 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pickart, C. M., & Cohen, R. E. (2004). Proteasomes and their kin: Proteases in the machine age. Nature Reviews. Molecular Cell Biology, 5(3), 177–187. 10.1038/nrm1336 [DOI] [PubMed] [Google Scholar]
- Qu, X.-X., Hao, P., Song, X.-J., Jiang, S.-M., Liu, Y.-X., Wang, P.-G., Rao, X., Song, H.-D., Wang, S.-Y., Zuo, Y., Zheng, A.-H., Luo, M., Wang, H.-L., Deng, F., Wang, H.-Z., Hu, Z.-H., Ding, M.-X., Zhao, G.-P., & Deng, H.-K. (2005). Identification of two critical amino acid residues of the severe acute respiratory syndrome coronavirus spike protein for its variation in zoonotic tropism transition via a double substitution strategy. The Journal of Biological Chemistry, 280(33), 29588–29595. https://doi:10.1074/jbc.M500662200 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Raasi, S., Varadan, R., Fushman, D., & Pickart, C. M. (2005). Diverse polyubiquitin interaction properties of ubiquitin-associated domains. Nature Structural & Molecular Biology, 12(8), 708–714. 10.1038/nsmb962 [DOI] [PubMed] [Google Scholar]
- Sanjuán, R., & Domingo-Calap, P. (2016). Mechanisms of viral mutation. Cellular & Molecular Life Sciences: CMLS, 73(23), 4433–4448. 10.1007/s00018-016-2299-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sano, E., Carlson, S., Wegley, L., & Rohwer, F. (2004). Movement of viruses between biomes. Applied Environmental Microbiology, 70(10), 5842–5846. 10.1128/AEM.70.10.5842-5846.2004 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Shi, M., Lin, X.-D., Chen, X., Tian, J.-H., Chen, L.-J., Li, K., Wang, W., Eden, J.-S., Shen, J.-J., Liu, L., Holmes, E. C., & Zhang, Y.-Z. (2018). The evolutionary history of vertebrate RNA viruses. Nature, 556(7700), 197–202. 10.1038/s41586-018-0012-7 [DOI] [PubMed] [Google Scholar]
- Sobolev, V., Sorokine, A., Prilusky, J., Abola, E. E., & Edelman, M. (1999). Automated analysis of interatomic contacts in proteins. Bioinformatics (Oxford, England), 15(4), 327–332. 10.1093/bioinformatics/15.4.327 [DOI] [PubMed] [Google Scholar]
- Song, W., Gui, M., Wang, X., & Xiang, Y. (2018). Cryo-EM structure of the SARS coronavirus spike glycoprotein in complex with its host cell receptor ACE2. PLoS Pathogens, 14(8), e1007236. 10.1371/journal.ppat.1007236 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Su, S., Wong, G., Shi, W., Liu, J., Lai, A. C. K., Zhou, J., Liu, W., Bi, Y., & Gao, G. F. (2016). Epidemiology, genetic recombination, and pathogenesis of coronaviruses. Trends in Microbiology, 24(6), 490–502. 10.1016/j.tim.2016.03.003 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tse, M. K., Hui, S. K., Yang, Y., Yin, S.-T., Hu, H.-Y., Zou, B., Wong, B. C. Y., & Sze, K. H. (2011). Structural analysis of the UBA domain of X-linked inhibitor of apoptosis protein reveals different surfaces for ubiquitin-binding and self-association. PLoS ONE, 6(12), e28511. 10.1371/journal.pone.0028511 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Van Blerkom, L. M. (2003). Role of viruses in human evolution. American Journal of Physical Anthropology, Suppl 37, 14–46. 10.1002/ajpa.10384 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Walls, A. C., Park, Y. J., Tortorici, M. A., Wall, A., McGuire, A. T., & Veesler, D. (2020). Structure, function, and antigenicity of the SARS-CoV-2 spike glycoprotein. Cell, 181(2), 281–292. 10.1016/j.cell.2020.02.058 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wan, Y., Shang, J., Graham, R., Baric, R. S., & Li, F. (2020). Receptor recognition by the novel coronavirus from Wuhan: An analysis based on decade-long structural studies of SARS coronavirus. Journal of Virology, 94, e00127–20. 10.1128/JVI.00127-20 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wang, H., Yang, P., Liu, K., Guo, F., Zhang, Y., Zhang, G., & Jiang, C. (2008). SARS coronavirus entry into host cells through a novel clathrin- and caveolae-independent endocytic pathway. Cell Research, 18(2), 290–301. 10.1038/cr.2008.15 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wang, Q., Zhang, Y., Wu, L., Niu, S., Song, C., & Zhang, Z. (2020). Structural and functional basis of SARS-CoV-2 entry by using human ACE2. Cell, 181, 1â11. 10.1016/j.cell.2020.03.045 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wang, S., Guo, F., Liu, K., Wang, H., Rao, S., Yang, P., & Jiang, C. (2008). Endocytosis of the receptor-binding domain of SARS-CoV spike protein together with virus receptor ACE2. Virus Research, 136(1–2), 8–15. 10.1016/j.virusres.2008.03.004 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Weiss, S. R., & Navas-Martin, S. (2005). Coronavirus pathogenesis and the emerging pathogen severe acute respiratory syndrome coronavirus. Microbiology & Molecular Biology Reviews: MMBR, 69(4), 635–664. 10.1128/MMBR.69.4.635-664.2005 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wilkinson, C. R., Seeger, M., Hartmann-Petersen, R., Stone, M., Wallace, M., Semple, C., & Gordon, C. (2001). Proteins containing the UBA domain are able to bind to multi-ubiquitin chains. Nature Cell Biology, 3(10), 939–943. 10.1038/ncb1001-939 [DOI] [PubMed] [Google Scholar]
- Wimley, W. C., & White, S. H. (1996). Experimentally determined hydrophobicity scale for proteins at membrane interfaces. Nature Structural Biology, 3(10), 842–848. 10.1038/nsb1096-842 [DOI] [PubMed] [Google Scholar]
- Woo, P. C. Y., Lau, S. K. P., Lam, C. S. F., Lau, C. C. Y., Tsang, A. K. L., Lau, J. H. N., Bai, R., Teng, J. L. L., Tsang, C. C. C., Wang, M., Zheng, B.-J., Chan, K.-H., & Yuen, K.-Y. (2012). Discovery of seven novel mammalian and avian coronaviruses in the genus deltacoronavirus supports bat coronaviruses as the gene source of alphacoronavirus and betacoronavirus and avian coronaviruses as the gene source of gammacoronavirus and deltacoronavirus. Journal of Virology, 86(7), 3995–4008. 10.1128/JVI.06540-11 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wu, K., Li, W., Peng, G., & Li, F. (2009). Crystal structure of NL63 respiratory coronavirus receptor-binding domain complexed with its human receptor. Proceedings of the National Academy of Sciences of the United States of America, 106(47), 19970–19974. 10.1073/pnas.0908837106 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wu, K., Peng, G., Wilken, M., Geraghty, R. J., & Li, F. (2012). Mechanisms of host receptor adaptation by severe acute respiratory syndrome coronavirus. The Journal of Biological Chemistry, 287(12), 8904–8911. 10.1074/jbc.M111.325803 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yang, X.-L., Hu, B., Wang, B., Wang, M.-N., Zhang, Q., Zhang, W., Wu, L.-J., Ge, X.-Y., Zhang, Y.-Z., Daszak, P., Wang, L.-F., & Shi, Z.-L. (2015). Isolation and characterization of a novel bat coronavirus closely related to the direct progenitor of severe acute respiratory syndrome coronavirus. Journal of Virology, 90(6), 3253–3256. https://doi:10.1128/JVI.02582-15 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yang, Y., Du, L., Liu, C., Wang, L., Ma, C., Tang, J., Baric, R. S., Jiang, S., & Li, F. (2014). Receptor usage and cell entry of bat coronavirus HKU4 provide insight into bat-to-human transmission of MERS coronavirus. Proceedings of the National Academy of Sciences of the United States of America, 111(34), 12516–12521. http://www.pnas.org/cgi/doi/10.1073/pnas.1405889111 10.1073/pnas.1405889111 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yi, C., Sun, X., Ye, J., Ding, L., Liu, M., Yang, Z., Lu, X., Zhang, Y., Ma, L., Gu, W., Qu, A., Xu, J., Shi, Z., Ling, Z., & Sun, B. (2020). Key residues of the receptor binding motif in the spike protein of SARS-CoV-2 that interact with ACE2 and neutralizing antibodies. Cellular & Molecular Immunology, 17(6), 621–630. 10.1038/s41423-020-0458-z [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yuan, Y., Cao, D., Zhang, Y., Ma, J., Qi, J., & Wang, Q. (2017). Cryo-EM structures of MERS-CoV and SARS-CoV spike glycoproteins reveal the dynamic receptor binding domains. Nature Communications, 8, 15092. 10.1038/ncomms15092 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhang, N., Chen, Y., Lu, H., Zhao, F., Alvarez, R. V., Goncearenco, A., Panchenko, A. R., & Li, M. (2020). MutaBind2: Predicting the impacts of single and multiple mutations on protein–protein interactions. iScience, 23(3), 100939. 10.1016/j.isci.2020.100939 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhang, T., Wu, Q., & Zhang, Z. (2020). Probable pangolin origin of SARS-CoV-2 associated with the COVID-19 outbreak. Currents Biology, 30(7), 1346–1351. 10.1016/j.cub.2020.03.022 [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.






