Abstract
The recent emergence of the pathogen severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), the etiological agent for the coronavirus disease 2019 (COVID-19), is causing a global pandemic that poses enormous challenges to global public health and economies. SARS-CoV-2 host cell entry is mediated by the interaction of the viral transmembrane spike glycoprotein (S-protein) with the angiotensin-converting enzyme 2 gene (ACE2), an essential counter-regulatory carboxypeptidase of the renin-angiotensin hormone system that is a critical regulator of blood volume, systemic vascular resistance, and thus cardiovascular homeostasis. Accordingly, this work reports an atomistic-based, reliable in silico structural and energetic framework of the interactions between the receptor-binding domain of the SARS-CoV-2 S-protein and its host cellular receptor ACE2 that provides qualitative and quantitative insights into the main molecular determinants in virus/receptor recognition. In particular, residues D38, K31, E37, K353, and Y41 on ACE2 and Q498, T500, and R403 on the SARS-CoV-2 S-protein receptor-binding domain are determined as true hot spots, contributing to shaping and determining the stability of the relevant protein–protein interface. Overall, these results could be used to estimate the binding affinity of the viral protein to different allelic variants of ACE2 receptors discovered in COVID-19 patients and for the effective structure-based design and development of neutralizing antibodies, vaccines, and protein/protein inhibitors against this terrible new coronavirus.
Keywords: SARS-CoV-2 spike protein, ACE2, receptor-binding domain, molecular dynamics, computational alanine-scanning mutagenesis, molecular mechanics/Poisson−Boltzmann surface area, free energy of binding
In December 2019, a previously unidentified severe acute respiratory syndrome (SARS) coronavirus (CoV)—named SARS-CoV-2—was discovered and was isolated and sequenced by January 2020 in Wuhan, Hubei province of China.1,2 This virus is currently associated with an ongoing epidemic of atypical pneumonia (Coronavirus disease 19, or COVID-19) that, as of June 2, 2020, has affected almost 6,300,000 people and claimed more than 380,000 lives around the world.3 On March 11, 2020, the World Health Organization (WHO) declared the SARS-CoV-2 pandemic a public health emergency of international concern; as a consequence, at the time of writing well over 100 countries worldwide have just started to lift full or partial lockdowns, economically and personally affecting billions of people. Italy, home of the present authors, was one of the most affected countries in the world, with more than 233,000 cases and 33,600 deaths.4
CoVs are a group of large and enveloped viruses with a positive-sense, single-strand RNA genome classified into four genera (α, β, γ, and δ).5 Before SARS-CoV-2, only two other members of this pathogen family—also belonging to the beta genus—have crossed the species barrier to cause lethal pneumonia in humans: the severe acute respiratory syndrome coronavirus (SARS-CoV) and the Middle East respiratory syndrome coronavirus (MERS-CoV).6 SARS-CoV also initially emerged in China (Guangdong province) in 2002–2003 and swiftly spread via air-travel routes over five continents, globally infecting more than 8000 people, 10% of which with fatal exitus. Nearly 10 years later, MERS-CoV emerged in the Arabian Peninsula, infecting a substantially smaller number of individuals (∼2500) yet claiming 858 lives.6
Both SARS-CoV and MERS-CoV are zoonotic pathogens originating from animals.7 Whereas these two viruses were suggested to originate from bats, the reservoir host fueling their spillovers to humans was determined to be palm civets and dromedary camels, respectively. On the other hand, the source of SARS-CoV-2 is currently an issue of active debate, with bats and/or Malayan pangolins possibly serving as reservoir hosts for this new CoV.8 Independently of their original source, the recurrent human infections by coronavirus pathogens, including the four low pathogenicity, human-endemic HCoV-OC43, HCoV-HKU1, HCoV-NL63, and HCoV-229E,9 strongly suggest that future zoonotic transmission events of these large-genome viruses may continue.10 Unfortunately, notwithstanding this gloomy perspective, no therapeutic option or vaccines have been approved against any human-infecting coronaviruses to date.
The transmembrane spike glycoprotein (S-protein) that forms homotrimers protruding from the viral surface plays the fundamental role of assisting CoV pathogen entry into host cells. This complex process initially involves S-protein/host receptor binding followed by proteolytic processing of the viral glycoprotein to promote virus–cell membrane fusion.11 For SARS-CoV and SARS-CoV-2, a well-characterized S-protein region—the receptor-binding domain (S-RBD)—specifically recognizes angiotensin-converting enzyme 2 (ACE2) as its cellular receptor,12−14 and it is now well-established that host susceptibility to SARS-CoV/CoV-2 is primarily determined by the binding affinity of the viral S-RBD for ACE2 during the initial viral attachment step.14−19
To date, computer modeling of the interaction between the S-RBD of SARS-CoV-2 (S-RBDCoV-2) and ACE2 has identified some residues potentially involved in the interaction; however, the actual interactions have not been investigated in detail.20 In particular, although the structure of the S-RBDCoV-2 in complex with ACE2 has been repeatedly solved by different groups using both X-ray diffraction and cryo-TEM techniques,14−16,18,19 thus providing a comprehensive view of the interaction between the viral spike protein binding domain and its receptor, many questions still await an answer; for example, what is the relative energetic importance of the S-RBDCoV-2/ACE2 contacts? Are there a few hot spot residues on the viral protein or its receptor for these interactions? In this context, for instance, while we were preparing this work, Han and Kral published in this same journal an interesting in silico work in which they proposed some peptide inhibitors whose sequences were directly extracted from the ACE2 α-helical domain involved in S-RBDCoV-2 binding.21 These authors predicted these molecules could be efficient blockers of the viral protein–receptor interaction based on the average interaction energy between each inhibitor and the S-RBDCoV-2. Based on our own long-standing experience in the field of computational protein/protein and protein/ligand interactions,22−42 we believe that detailed structural/energetic information at each single residue would greatly improve our understanding of the binding between the spike of SARS-CoV-2 and its cellular receptor. Accordingly, here, we report the results obtained from a combined computational alanine-scanning interaction entropy method43−46 to compute residue-specific ACE and S-RBDCoV-2 binding free energy at their protein/protein interface. The data thus obtained allowed for a complete structural characterization of both intermolecular and intramolecular interaction networks contributing to S-RBDCoV-2/ACE2 binding under physiological solution-mimicking conditions and the identification of hot spot residues playing a major role in shaping and stabilizing the SARS-CoV-2 S-protein/receptor interface, as discussed below.
Results and Discussion
According to the recent X-ray/cryo-TEM evidence, the superposition of ACE2 alone47 and in complex with S-RBDCoV-2 clearly shows that binding of the S-protein does not induce any conformational change in the relevant receptor-binding site.14,18,19 Moreover, the ACE2-bound S-RBDCoV-2 also preserves the same conformation it adopts when the full spike protein assembles into its native trimeric form.16,17 Briefly, the core structure of S-RBDCoV-2 is composed of twisted five-stranded antiparallel β-sheets (β1, β2, β3, β4, and β7), connected by short helices and flexible loops. Strands β4 and β7 are spaced by α-helices (α4 and α5), short strands (β5 and β6), and coils. This particular sequence of S-RBDCoV-2 domain features most of the SARS-CoV-2 residues contacting ACE2 for binding (Figure 1). On the receptor side, the N-terminal domain of ACE2 presents two lobes, the S-RBDCoV-2 contacting the bottom side of the smaller lobe, with a concave outer surface accommodating the N-terminal helix of the receptor (Figure 1).
Within a distance cutoff of 4.0 Å, the analysis of the equilibrated molecular dynamics (MD) trajectory (Figure S1) of ACE2 in complex with the S-protein RBD of SARS-CoV-2 shows that a total of 14 residues of S-RBDCoV-2 stably and effectively contact 19 residues of the receptor (Figure 1 and Tables S1 and S2). One important feature at the S-RBDCoV-2/ACE2 interface is the number of hydrophilic interactions detected in the relevant crystal structures,14,18 which are conserved in the corresponding solution simulation. Indeed, according to the present study, 14 hydrogen bonds (HBs) and two salt bridges (SBs) stably populate the S-RBDCoV-2/ACE2 interface (Tables S1 and S2).
As atomistic structural and energetic information greatly improve our understanding of the interaction between the viral S-protein RBD and its cellular ACE2 receptor, providing fundamental indications about important targets for the design of neutralizing antibodies and/or structure-based vaccine design is urgently needed in the open fight against this viral spread, with a combined description of all topical protein/protein interactions along with the corresponding energetic quantification (Figure 2 and Tables S3 and S4) being reported and discussed in detail.
Analysis of the ACE2 Residues at the Binding Interface with the S-RBD SARS-CoV-2
Q24
ACE2 residue Q24 locates at one of the extremes of the binding interface between the receptor and the SARS-CoV-2 S-RBD; as such, any intermolecular interaction involving Q24 and the viral S-protein could be important in eventually anchoring the entire superstructure. The MD trajectory of the S-RBDCoV-2/ACE2 complex shows that ACE2 Q24 H-bonds the viral protein residue N487 (3.03 ± 0.18 Å) while involving two other residues G476 and Y489 in weaker contact interactions (CIs) (Figure 3, top). When ACE2 Q24 is replaced with alanine, these interface-stabilizing interactions—along with the slightly beneficial contribution from the intramolecular van der Waals contact with Y83—are no longer made, reflecting a loss of the corresponding binding free energy of ΔΔGACE2(Q24A) = −2.61 ± 0.17 kcal/mol (Figure 2A).
T27
At the interface between ACE2 and S-RBDCoV-2, the receptor residue T27 stabilizes the protein–protein complex via an internal HB with E23 (3.05 ± 0.16 Å) and through hydrophobic/van der Waals interactions with the side chains of F456, Y473, and Y489 of S-RBDCoV-2 (Figure 3, bottom). Replacing this receptor residue with alanine is unfavorable and produces a negative variation of the binding free energy (Figure 2A) equal to ΔΔGACE2(T27A) = −2.23 ± 0.19 kcal/mol.
Y83, M82, L79, and F28
Residue Y83 of ACE2 is part of a large hydrophobic pocket surrounded by the side chains of M82, L79, L97, F28, Q76, and L29, with which it makes a number of stabilizing intramolecular CIs in the viral S-protein/receptor complex along the entire relevant MD trajectory (Figure 4, top). Moreover, Y83 exchanges a strong intermolecular HB with N487 (2.88 ± 0.17 Å), further supported by polar and dispersive CIs with Y489 and F486 (Figure 4, top). In line with these interactions, the calculated variation in binding free energy for mutating Y into A at position 83 of ACE2 is equal to ΔΔGACE2(Y83A) = −3.18 ± 0.20 kcal/mol (Figure 2A). In the same context, the fact that ACE2 residues F28, M82, and L79 afford only a weak network of stabilizing intra/intermolecular CIs to this interface region (Figure 4, top) is confirmed by the calculated ΔΔG values obtained by changing each of these amino acids into alanine in the receptor/S-RBDCoV-2 complex, that is, ΔΔGACE2(M82A) = −0.76 ± 0.12 kcal/mol, ΔΔGACE2(L79A) = −1.04 ± 0.16 kcal/mol, and ΔΔGACE2(F28A) = −0.98 ± 0.08 kcal/mol (Figure 2A).
K31 and E35
In the solved crystal structure of the S-RBDCoV-2/ACE2 complex, K31 is seen in a strain-free, extended conformation at the protein–protein interface where it forms a stable, strong, and charge-neutralizing internal SB with E35. The MD trajectory of the corresponding complex further reveals that the K31–E35 salt bridge equilibrates at a length of 3.94 ± 0.42 Å, allowing for further stabilization via the formation of a strong bifurcate HB of S-RBDCoV-2 Q493 with both residues (3.04 ± 0.25 and 2.94 ± 0.19 Å, respectively). Also, the long side chain of K31 is involved in an extensive network of hydrophobic and van der Waals interactions, which includes the side chains of S-RBDCoV-2 residues L455, F456, and Y489 (Figure 4, bottom). In line, the predicted changes in binding free energy for replacing ACE2 K31 and E35 with alanine in this complex—ΔΔGACE2(K31A) = −4.85 ± 0.14 kcal/mol, ΔΔGACE2(E35A) = −2.89 ± 0.15 kcal/mol (Figure 2A)—support the prominent contribution afforded by these two residues and by K31, in particular, at the receptor/viral protein binding interface.
D30 and H34
ACE2 residue D30 plays an important role in shaping the relevant S-RBDCoV-2/receptor interface as, during the entire MD simulation, it forms an important SB with the viral protein residue K417 (3.85 ± 0.41 Å), intermolecular van der Waals interactions with F456 and L455, and a permanent intramolecular HB with ACE2 H34 (3.31 ± 0.18 Å) (Figure 4, bottom). This explains the considerable loss of binding free energy predicted upon mutating D30 into alanine within the S-RBDCoV-2/ACE2 complex (Figure 2A), that is, ΔΔGACE2(D30A) = 3.89 ± 0.12 kcal/mol. Interestingly, although ACE2 residue H34 is not directly involved in any intermolecular SB/HB with the viral protein, it nonetheless provides favorable polar/dispersive contacts with the side chains of Y453 and L455 of S-RBDCoV-2 (Figure 4, bottom). The absence of these CIs when H34 is mutated into alanine reflects the moderate variations of the corresponding free energy of binding (Figure 2A), that is, ΔΔGACE2(H34A) = −1.77 ± 0.09 kcal/mol.
K353, D38, and Q42
Residue K353 is an indisputable hot spot for the binding of the viral S-proteins to their human receptor. For K31, the side chain of this residue protrudes into the protein–protein interface where, by adopting a strain-free, energetically favorable conformation stabilized by an important charge-neutralizing intramolecular SB with D38 (3.38 ± 0.29 and 3.66 ± 0.39 Å, Figure 5, top), it exchanges a plethora of topical intermolecular interactions persisting through the entire time span (1 μs) of the MD simulation. In particular, the extended conformation of the K353 side chain is anchored in place by three effective intermolecular HBs—two exchanged with the backbone of G502 (2.92 ± 0.13 Å) and G496 (2.95 ± 0.21 Å) and the last with the side chain of Q498 (2.87 ± 0.13 Å)—whereas polar and van der Waals CIs with N501, Y505, and Y41 yield additional intermolecular stabilizing contacts (Figure 5, top). Removing all of these interactions by replacing the side chain of K353 with alanine results in a dramatic variation of the corresponding ΔΔG value (ΔΔGACE2(K353A) = −7.19 ± 0.74 kcal/mol, Figure 2A), confirming the major role played by this receptor residue in binding the viral S-RBD. D38, the intramolecular SB partner of K353, is also a hot spot in the S-RBDCoV-2/ACE2 interaction. Indeed, as seen from the top panel of Figure 5, D38 is stably engaged in two symmetrical HBs with the side chains of the S-protein residues Q498 (2.92 ± 0.19 Å) and Y449 (2.92 ± 0.20 Å), while performing CIs with the same residue (Y449) and G496 (Figure 5, top). Additionally, D38 intramolecularly H-bonds Q42 (3.04 ± 0.18 Å), thereby favoring the formation of a stable Q42–Y449 intermolecular HB (3.03 ± 0.20 Å) and a Q42–Q498 stabilizing CI (Figure 5, top). Based on such MD-derived structural evidence, mutating D into A at position 38 of ACE2 results in the abrogation of most of these inter/intramolecular interactions; this, in turn, properly reflects the substantial variation of the corresponding binding free energy value, so that ΔΔGACE2(D38A) = −5.11 ± 0.21 kcal/mol (Figure 2A). Contextually, a significantly smaller ΔΔG variation is predicted when replacing the side chain of Q42 with alanine—ΔΔGACE2(Q42A) = −2.19 ± 0.11 kcal/mol (Figure 2A)—in accord with the lesser role played by this residue at the protein–protein interface.
Y41, D355, and R357
According to our simulations, Y41, located on ACE2 α-helix 1, is another key residue in the interaction between the viral S-protein and its human cellular receptor. From the bottom panel of Figure 5, it can be seen that, in addition to the CI with the side chain of K353 already reported above, Y41 H-bonds D355 (2.78 ± 0.15 Å) and establishes favorable van der Waals contacts with the side chains of L45 and L351. From the intermolecular perspective, in the S-RBDCoV-2/ACE2 assembly, Y41 exchanges two HBs with the viral protein residues T500 (3.08 ± 0.23 Å) and N501 (3.23 ± 0.22 Å) and a CI with the side chain of Q498 (Figure 5, bottom). The corresponding value of ΔΔGACE2(Y41A) = −4.43 ± 0.33 kcal/mol (Figure 2A) is in line with the important contribution this residue provides to the formation of the viral protein/receptor interface. D355 is another residue playing an important role in shaping this protein/protein interface region. In addition to the main intermolecular HB with Y41 discussed above, D355 also shares the same type of interaction across the interface with T500 of SARS-CoV-2 S-RBD (2.77 ± 0.16 Å, Figure 5, bottom). Moreover, the negative charge on D355 is aptly neutralized at the complex interface via the formation of an internal SB with R357 (3.68 ± 0.19 Å, Figure 6, top) which, in turn, is held in place by weak CIs of polar and dispersive nature with the side chains of N330, W48, and L351 of ACE2 and of T500 of S-RBDCoV-2 (Figure 6, top). The relevant results of the computational alanine-scanning mutagenesis reveal a significant loss in the binding free energy when D355 is mutated to alanine—ΔΔGACE2(D355A) = −3.18 ± 0.20 kcal/mol, Figure 2A—as expected from the abrogation of important interactions such as HBs and SBs exquisitely involving the side chain of D355. Similarly, the replacement of R with A at ACE2 position 357 reflects a decrement of the relevant ΔΔG value: ΔΔGACE2(R357A) = −3.32 ± 0.25 kcal/mol, Figure 2A. Finally, mutating into alanine, those other receptor residues involved in the weak interaction network just described have very little effect on protein/protein binding (e.g., ΔΔGACE2(N330A) = −0.68 ± 0.05 kcal/mol, Figure 2A).
E37 and R393
According to the results of the present study, in the S-RBDCoV-2/ACE2 complex E37 is engaged—beside an internal SB with R393 (3.93 ± 0.38 Å)—in two important binding contacts: a HB with the side chain of Y505 (3.15 ± 0.24 Å) and a strong SB with the side chain of R403 (3.62 ± 0.39 Å, Figure 6, bottom). Thus, the E37A mutation actually shows a considerable variation in the corresponding ΔΔG value—ΔΔGACE2(E37A) = −5.12 ± 0.22 kcal/mol, Figure 2A—making E37 another protein/protein hot spot residue. Similarly, the calculated value of ΔΔG for the alanine variant of ACE2 R393 properly reflects the relevant roles played by this residue at the protein/protein interface (ΔΔGACE2(R393A) = −2.33 ± 0.19 kcal/mol, Figure 2A).
Analysis of the SARS-CoV-2 S-RBD Residues at the Binding Interface with ACE2
Y449, Y453, and T500
According to the present MD simulation results, S-RBDCoV-2 residue Y449 establishes one intermolecular HB with ACE2 Q42 and one intramolecular HB with Q498 (3.04 ± 0.18 Å); also, Y449 exchanges van der Waals contacts with D38 and Q42 (Figure 5, top). The ΔΔG value predicted for the corresponding Y449A replacement is therefore proportional to loss of the relevant interactions, that is, ΔΔGCoV-2(Y449A) = −3.21 ± 0.31 kcal/mol (Figure 2B). Likewise, T500 of S-RBDCoV-2 establishes an extensive network of intermolecular interactions with the ACE2 receptor, which includes H-bonding with Y41 and D355, and polar/van der Waals CIs with the side chains of R357 and N330 (Figure 5, bottom, and Figure 6, top). As reported above, these interactions are instrumental in properly shaping the relevant protein/protein interface regions so that, once removed by replacing T with A on the S-RBDCoV-2, a significant loss in binding free energy is predicted as ΔΔGCoV-2(T500A) = −4.17 ± 0.36 kcal/mol (Figure 2B). On the contrary, Y453 of SARS-CoV-2 plays only a minor role in the corresponding protein/protein interface. For this residue, along with the polar intermolecular interaction with ACE2 H34, only one intramolecular CI with Q493 is detected (Figure 4, bottom). Consequently, ΔΔGCoV-2(Y453A) = −0.79 ± 0.30 kcal/mol (Figure 2B).
N487, Y489, and Y505
S-RBDCoV-2 N487 contributes to ACE2 binding via two HBs with Q24 and Y83 (Figures 3 and 4, top panels). In line, the alanine substitution at this S-protein position displays a variation in binding free energy of ΔΔGCoV-2(N487A) = −2.25 ± 0.35 kcal/mol (Figure 2B). Similarly, mutating the side chain of Y489, for which only CIs with the side chains of the ACE2 residues Q24, Y83, T27, and K31 are detected along the corresponding MD trajectory (Figures 3 and 4), into alanine on the same viral protein domain results in a modest energetic variation (ΔΔGCoV-2(Y489A) = −2.25 ± 0.35 kcal/mol, Figure 2B). Y505 of S-RBDCoV-2, on the other hand, participates in protein/protein hydrogen bonding to the side chain of ACE2 E37 and in CIs with R393 and K353 (Figures 5, top, and 6, bottom). Y505 also establishes a persistent internal π–cation involving the aromatic ring of this residue and the guanidinium group of R403 (Figure 6, bottom). In agreement with this interaction pattern, the Y505A mutation reduces the binding affinity of S-RBDCoV-2 for ACE2 by more than 3 kcal/mol (ΔΔGCoV-2(Y505A) = −3.27 ± 0.31 kcal/mol, Figure 2B).
L455, F456, and F486
Mutating S-RBDCoV-2 L455 into alanine in silico does not reveal any significant change in the affinity of the relevant viral protein for the ACE2 receptor. Indeed, the side chain of this residue points to a charged pocket sealed by the side chains of ACE2 D30, K31, and H34, to which L455 provides moderately stabilizing van der Waals (L455) interactions (Figure 4, bottom). The resultant value of ΔΔG is thus limited to ΔΔGCoV-2(L455A) = −1.21 ± 0.32 kcal/mol (Figure 2B). F456 of S-RBDCoV-2 provides three intermolecular CIs with ACE T27, D30, and K31, an important stabilizing intramolecular π–cation interaction with the side chain of K417 (topical in assisting this lysine in salt-bridging D30), and an internal CI with Y473 (Figures 3 and 4, bottom panels). Similarly, S-RBDCoV-2 F486 appears to stabilize the ACE2 hydrophobic patch around Y83 by exchanging three intermolecular CIs with the receptor side chains of L79, M82, and Y83 (Figure 4, top). When all of these residues are mutated into alanine, the related values of ΔΔG nicely reflect these similarities, as ΔΔGCoV-2(F456A) = −1.99 ± 0.28 kcal/mol and ΔΔGCoV-2(F486A) = −2.13 ± 0.32 kcal/mol (Figure 2B).
Q493 and Q498
At the 493 position of the SARS-CoV-2 S-protein, Q493 forms two interface-anchoring HBs with ACE2 K31 and E35 and one internal HB with the side chain of S494 (3.24 ± 0.21 Å, Figure 4, bottom). S-RBDCoV-2 Q498, however, affords a substantially greater number of favorable interactions to viral protein binding. As seen from Figure 5 (top panel), Q498 indeed establishes two fundamental HBs across the binding interface with ACE2 D38 and K353, along with further stabilizing CIs with the side chains of Q42 and Y41 on the receptor. Also, it exchanges an internal HB with the S-RBDCoV-2 N501 (3.02 ± 0.18 Å, Figure 5, bottom), in addition to the same type of interaction with Y449 discussed above; all of these contacts clearly concur in making this region one of the most structured and energetically important of the whole ACE2/S-RBDCoV-2 binding interface. Thus, the substitution of Q498 with alanine is accompanied by a ∼5.5 kcal/mol loss in binding free energy (ΔΔGCoV-2(Q498A) = −5.36 ± 0.37 kcal/mol, Figure 2B), making this S-RBDCoV-2 residue a viral protein/receptor binding hot spot with respect to the less effective Q493, for which ΔΔGCoV-2(Q493A) = −3.15 ± 0.29 kcal/mol (Figure 2B), in keeping with the differential contribution of these two residues to protein/protein binding.
N501, R403, and K417
N501 on S-RBDCoV-2 H-bonds the side chain of ACE2 Y41 while exchanging a polar CI with K353 and the internal HB with Q498 discussed above (Figure 5, bottom). On the other hand, R403 and K417 are the S-RBDCoV-2 residues making the two topical interface SBs with ACE2 E37 and D30, respectively (Figures 6 and 4, bottom panels). However, at variance with K417, R403 further establishes an internal SB with the side chain of S-RBDCoV-2 D405 (3.02 ± 0.18 Å, Figure 6, bottom) and two other intramolecular CIs with Y495 and Y505. Therefore, the values of the total free energy change for mutating N501, K417, and R403 in alanine—ΔΔGCoV-2(N501A) = −2.40 ± 0.28 kcal/mol, ΔΔGCoV-2(K417A) = −2.72 ± 0.34 kcal/mol, and ΔΔGCoV-2(R403A) = −4.25 ± 0.39 kcal/mol (Figure 2B)—properly rank the relative importance of these residues at the protein/protein interface and flag R403 as another viral protein hot spot for receptor binding.
Conclusions
One of the major goals of this work was to provide an atomistic-based, reliable in silico structural and energetic framework of the interactions between S-RBDCoV-2 and its host cellular receptor ACE2 that may suggest precise targets for the structure-based design and development of neutralizing antibodies, vaccines, and protein/protein inhibitors so urgently needed in the current fight against this terrible new pandemic. Accordingly, we have simulated single alanine substitutions at all different residues of ACE2 and S-RBDCoV-2 that form most of the protein–protein interface and estimated the variation in the corresponding free energy of binding. These mutagenesis studies provide a clear picture of the main molecular determinants in ACE2/S-RBDCoV-2 recognition and highlight residues D38, K31, E37, K353, and Y41 on ACE2 and Q498, T500, and R403 on the SARS-CoV-2 S-protein receptor-binding domain as true hot spots contributing to shaping and determining the stability of the relevant protein–protein interface. In addition, the results and methodologies presented and discussed above are currently being extended by our group to the estimation of the binding affinity of the viral protein to different allelic variants (AVs) of ACE2 receptors discovered in COVID-19 patients, with the ultimate goal of verifying if any of such AVs could eventually associate with different degrees of clinically observed viral pathogenicity.
Methods
All calculations reported in this work were performed in AMBER1948 starting from the recent crystal structure of the ACE2/S-RBDCoV-2 complex (PDB ID 6M0J).18 The role of the protein/protein interface key residues was studied by performing a combination of molecular mechanics/Poisson–Boltzmann surface area,49 computational alanine scanning mutagenesis,50 and interaction entropy methods.44 All details are reported in the extended Methods section of the Supporting Information.
Acknowledgments
We acknowledge CINECA Supercomputing Center for awarding us access to Marconi100 based at CINECA (Bologna, Italy) (COVID19 Computational Alanine Scanning to Discover Genetic Susceptibility (CovAdis), HPC access Grant No. HP10C9HMOT).
Supporting Information Available
The Supporting Information is available free of charge at https://pubs.acs.org/doi/10.1021/acsnano.0c04674.
Author Contributions
§ E.L. and D.M. contributed equally.
The authors declare no competing financial interest.
Supplementary Material
References
- Zhou P.; Yang X. L.; Wang X. G.; Hu B.; Zhang L.; Zhang W.; Si H. R.; Zhu Y.; Li B.; Huang C. L.; Chen H. D.; Chen J.; Luo Y.; Guo H.; Jiang R. D.; Liu M. Q.; Chen Y.; Shen X. R.; Wang X.; Zheng X. S.; et al. A Pneumonia Outbreak Associated with a New Coronavirus of Probable Bat Origin. Nature 2020, 579, 270–273. 10.1038/s41586-020-2012-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhu N.; Zhang D.; Wang W.; Li X.; Yang B.; Song J.; Zhao X.; Huang B.; Shi W.; Lu R.; Niu P.; Zhan F.; Ma X.; Wang D.; Xu W.; Wu G.; Gao G. F.; Tan W. A Novel Coronavirus from Patients with Pneumonia in China. N. Engl. J. Med. 2020, 382, 727–733. 10.1056/NEJMoa2001017. [DOI] [PMC free article] [PubMed] [Google Scholar]
- https://www.who.int/emergencies/diseases/novel-coronavirus-2019/situation-reports (accessed 2020-06-04).
- http://www.salute.gov.it/portale/nuovocoronavirus/dettaglioNotizieNuovoCoronavirus.jsp?lingua=english&menu=notizie&p=dalministero&id=4854 (accessed 2020-06-04).
- Cavanagh D.Coronaviridae: A Review of Coronaviruses and Toroviruses. In Coronaviruses with Special Emphasis on First Insights Concerning SARS; Schmidt A., Weber O., Wolff M. H., Eds.; Birkhäuser Basel: Basel, 2005; pp 1–54. [Google Scholar]
- Graham R. L.; Donaldson E. F.; Baric R. S. A Decade After SARS: Strategies for Controlling Emerging Coronaviruses. Nat. Rev. Microbiol. 2013, 11, 836–848. 10.1038/nrmicro3143. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li F. Receptor Recognition and Cross-Species Infections of SARS Coronavirus. Antiviral Res. 2013, 100, 246–254. 10.1016/j.antiviral.2013.08.014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Andersen K. G.; Rambaut A.; Lipkin W. I.; Holmes E. C.; Garry R. F. The Proximal Origin of SARS-CoV-2. Nat. Med. 2020, 26, 450–452. 10.1038/s41591-020-0820-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Corman V. M.; Muth D.; Niemeyer D.; Drosten C. Hosts and Sources of Endemic Human Coronaviruses. Adv. Virus Res. 2018, 100, 163–188. 10.1016/bs.aivir.2018.01.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Greger M. The Human/Animal Interface: Emergence and Resurgence of Zoonotic Infectious Diseases. Crit. Rev. Microbiol. 2007, 33, 243–299. 10.1080/10408410701647594. [DOI] [PubMed] [Google Scholar]
- Li F. Structure, Function, and Evolution of Coronavirus Spike Proteins. Annu. Rev. Virol. 2016, 3, 237–261. 10.1146/annurev-virology-110615-042301. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li W.; Moore M. J.; Vasilieva N.; Sui J.; Wong S. K.; Berne M. A.; Somasundaran M.; Sullivan J. L.; Luzuriaga K.; Greenough T. C.; Choe H.; Farzan M. Angiotensin-Converting Enzyme 2 Is a Functional Receptor for the SARS Coronavirus. Nature 2003, 426, 450–454. 10.1038/nature02145. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hoffmann M.; Kleine-Weber H.; Schroeder S.; Krüger N.; Herrler T.; Erichsen S.; Schiergens T. S.; Herrler G.; Wu N. H.; Nitsche A.; Müller M. A.; Drosten C.; Pöhlmann S. SARS-CoV-2 Cell Entry Depends on ACE2 and TMPRSS2 and Is Blocked by a Clinically Proven Protease Inhibitor. Cell 2020, 181, 271–280. 10.1016/j.cell.2020.02.052. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Shang J.; Ye G.; Shi K.; Wan Y.; Luo C.; Aihara H.; Geng Q.; Auerbach A.; Li F. Structural Basis of Receptor Recognition by SARS-CoV-2. Nature 2020, 581, 221–224. 10.1038/s41586-020-2179-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yan R.; Zhang Y.; Li Y.; Xia L.; Guo Y.; Zhou Q. Structural Basis for the Recognition of SARS-CoV-2 by Full-Length human ACE2. Science 2020, 367, 1444–1448. 10.1126/science.abb2762. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wrapp D.; Wang N.; Corbett K. S.; Goldsmith J. A.; Hsieh C. L.; Abiona O.; Graham B. S.; McLellan J. S. Cryo-EM Structure of the 2019-nCoV Spike in the Prefusion Conformation. Science 2020, 367, 1260–1263. 10.1126/science.abb2507. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Walls A. C.; Park Y. J.; Tortorici M. A.; Wall A.; McGuire A. T.; Veesler D. Structure, Function, and Antigenicity of the SARS-CoV-2 Spike Glycoprotein. Cell 2020, 181, 281–292. 10.1016/j.cell.2020.02.058. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lan J.; Ge J.; Yu J.; Shan S.; Zhou H.; Fan S.; Zhang Q.; Shi X.; Wang Q.; Zhang L.; Wang X. Structure of the SARS-CoV-2 Spike Receptor-Binding Domain Bound to the ACE2 Receptor. Nature 2020, 581, 215–220. 10.1038/s41586-020-2180-5. [DOI] [PubMed] [Google Scholar]
- Wang Q.; Zhang Y.; Wu L.; Niu S.; Song C.; Zhang Z.; Lu G.; Qiao C.; Hu Y.; Yuen K. Y.; Wang Q.; Zhou H.; Yan J.; Qi J. Structural and Functional Basis of SARS-CoV-2 Entry by Using Human ACE2. Cell 2020, 181, 894–904. 10.1016/j.cell.2020.03.045. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wan Y.; Shang J.; Graham R.; Baric R. S.; Li F. Receptor Recognition by the Novel Coronavirus from Wuhan: An Analysis Based on Decade-Long Structural Studies of SARS Coronavirus. J. Virol. 2020, 94, e00127-20. 10.1128/JVI.00127-20. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Han Y.; Král P. Computational Design of ACE2-Based Peptide Inhibitors of SARS-CoV-2. ACS Nano 2020, 14, 5143–5147. 10.1021/acsnano.0c02857. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ziouziou H.; Andrieu C.; Laurini E.; Karaki S.; Fermeglia M.; Oueslati R.; Taieb D.; Camplo M.; Siri O.; Pricl S.; Katsogiannou M.; Rocchi P. Targeting Hsp27/eIF4E Interaction with Phenazine Compound: A Promising Alternative for Castration-Resistant Prostate Cancer Treatment. Oncotarget 2017, 8, 77317–77329. 10.18632/oncotarget.20469. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Perfetti V.; Laurini E.; Aulić S.; Fermeglia M.; Riboni R.; Lucioni M.; Dallera E.; Delfanti S.; Pugliese L.; Latteri F. S.; Pietrabissa A.; Pricl S. Molecular and Functional Characterization of a New 3′ End KIT Juxtamembrane Deletion in a Duodenal GIST Treated with Neoadjuvant Imatinib. Oncotarget 2017, 8, 56158–56167. 10.18632/oncotarget.19341. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Colombo C.; Belfiore A.; Paielli N.; De Cecco L.; Canevari S.; Laurini E.; Fermeglia M.; Pricl S.; Verderio P.; Bottelli S.; Fiore M.; Stacchiotti S.; Palassini E.; Gronchi A.; Pilotti S.; Perrone F. β-Catenin in Desmoid-Type Fibromatosis: Deep Insights into the Role of T41A and S45F Mutations on Protein Structure and Gene Expression. Mol. Oncol. 2017, 11, 1495–1507. 10.1002/1878-0261.12101. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Genini D.; Brambilla L.; Laurini E.; Merulla J.; Civenni G.; Pandit S.; D’Antuono R.; Perez L.; Levy D. E.; Pricl S.; Carbone G. M.; Catapano C. V. Mitochondrial Dysfunction Induced by a SH2 Domain-Targeting STAT3 Inhibitor Leads to Metabolic Synthetic Lethality in Cancer Cells. Proc. Natl. Acad. Sci. U. S. A. 2017, 114, e4924–e4933. 10.1073/pnas.1615730114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Morgan A.; Gandin I.; Belcaro C.; Palumbo P.; Palumbo O.; Biamino E.; Dal Col V.; Laurini E.; Pricl S.; Bosco P.; Carella M.; Ferrero G. B.; Romano C.; d’Adamo A. P.; Faletra F.; Vozzi D. Target Sequencing Approach Intended to Discover New Mutations in Non-Syndromic Intellectual Disability. Mutat. Res., Fundam. Mol. Mech. Mutagen. 2015, 781, 32–36. 10.1016/j.mrfmmm.2015.09.002. [DOI] [PubMed] [Google Scholar]
- Brambilla L.; Genini D.; Laurini E.; Merulla J.; Perez L.; Fermeglia M.; Carbone G. M.; Pricl S.; Catapano C. V. Hitting the Right Spot: Mechanism of Action of OPB-31121, a Novel and Potent Inhibitor of the Signal Transducer and Activator of Transcription 3 (STAT3). Mol. Oncol. 2015, 9, 1194–1206. 10.1016/j.molonc.2015.02.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pricl S.; Cortelazzi B.; Dal Col V.; Marson D.; Laurini E.; Fermeglia M.; Licitra L.; Pilotti S.; Bossi P.; Perrone F. Smoothened (SMO) Receptor Mutations Dictate Resistance to Vismodegib in Basal Cell Carcinoma. Mol. Oncol. 2015, 9, 389–397. 10.1016/j.molonc.2014.09.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Brune S.; Schepmann D.; Klempnauer K. H.; Marson D.; Dal Col V.; Laurini E.; Fermeglia M.; Wünsch B.; Pricl S. The Sigma Enigma: In Vitro/In Silico Site-Directed Mutagenesis Studies Unveil σ1 Receptor Ligand Binding. Biochemistry 2014, 53, 2993–3003. 10.1021/bi401575g. [DOI] [PubMed] [Google Scholar]
- Gibbons D. L.; Pricl S.; Posocco P.; Laurini E.; Fermeglia M.; Sun H.; Talpaz M.; Donato N.; Quintás-Cardama A. Molecular Dynamics Reveal BCR-ABL1 Polymutants as a Unique Mechanism of Resistance to PAN-BCR-ABL1 Kinase Inhibitor Therapy. Proc. Natl. Acad. Sci. U. S. A. 2014, 111, 3550–3555. 10.1073/pnas.1321173111. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bozzi F.; Conca E.; Laurini E.; Posocco P.; Lo Sardo A.; Jocollè G.; Sanfilippo R.; Gronchi A.; Perrone F.; Tamborini E.; Pelosi G.; Pierotti M. A.; Maestro R.; Pricl S.; Pilotti S. In Vitro and In Silico Studies of MDM2/MDMX Isoforms Predict Nutlin-3A Sensitivity in Well/De-Differentiated Liposarcomas. Lab. Invest. 2013, 93, 1232–1240. 10.1038/labinvest.2013.107. [DOI] [PubMed] [Google Scholar]
- Laurini E.; Posocco P.; Fermeglia M.; Gibbons D. L.; Quintás-Cardama A.; Pricl S. Through the Open Door: Preferential Binding of Dasatinib to the Active Form of BCR-ABL Unveiled by In Silico Experiments. Mol. Oncol. 2013, 7, 968–975. 10.1016/j.molonc.2013.06.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Conca E.; Miranda C.; Dal Col V.; Fumagalli E.; Pelosi G.; Mazzoni M.; Fermeglia M.; Laurini E.; Pierotti M. A.; Pilotti S.; Greco A.; Pricl S.; Tamborini E. Are Two Better Than One? A Novel Double-Mutant KIT in GIST that Responds to Imatinib. Mol. Oncol. 2013, 7, 756–762. 10.1016/j.molonc.2013.02.019. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pierotti M. A.; Tamborini E.; Negri T.; Pricl S.; Pilotti S. Targeted Therapy in GIST: In Silico Modeling for Prediction of Resistance. Nat. Rev. Clin. Oncol. 2011, 8, 161–170. 10.1038/nrclinonc.2011.3. [DOI] [PubMed] [Google Scholar]
- Dileo P.; Pricl S.; Tamborini E.; Negri T.; Stacchiotti S.; Gronchi A.; Posocco P.; Laurini E.; Coco P.; Fumagalli E.; Casali P. G.; Pilotti S. Imatinib Response in Two GIST Patients Carrying Two Hitherto Functionally Uncharacterized PDGFRA Mutations: An Imaging, Biochemical and Molecular Modeling Study. Int. J. Cancer 2011, 128, 983–990. 10.1002/ijc.25418. [DOI] [PubMed] [Google Scholar]
- Conca E.; Negri T.; Gronchi A.; Fumagalli E.; Tamborini E.; Pavan G. M.; Fermeglia M.; Pierotti M. A.; Pricl S.; Pilotti S. Activate and Resist: L576P-KIT in GIST. Mol. Cancer Ther. 2009, 8, 2491–2495. 10.1158/1535-7163.MCT-09-0662. [DOI] [PubMed] [Google Scholar]
- Woodman S. E.; Trent J. C.; Stemke-Hale K.; Lazar A. J.; Pricl S.; Pavan G. M.; Fermeglia M.; Gopal Y. N.; Yang D.; Podoloff D. A.; Ivan D.; Kim K. B.; Papadopoulos N.; Hwu P.; Mills G. B.; Davies M. A. Activity of Dasatinib Against L576P KIT Mutant Melanoma: Molecular, Cellular, and Clinical Correlates. Mol. Cancer Ther. 2009, 8, 2079–2085. 10.1158/1535-7163.MCT-09-0459. [DOI] [PMC free article] [PubMed] [Google Scholar]
- McAuliffe J. C.; Wang W. L.; Pavan G. M.; Pricl S.; Yang D.; Chen S. S.; Lazar A. J.; Pollock R. E.; Trent J. C. Unlucky Number 13? Differential Effects of KIT Exon 13 Mutation in Gastrointestinal Stromal Tumors. Mol. Oncol. 2008, 2, 161–163. 10.1016/j.molonc.2008.05.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Negri T.; Pavan G. M.; Virdis E.; Greco A.; Fermeglia M.; Sandri M.; Pricl S.; Pierotti M. A.; Pilotti S.; Tamborini E. T670X KIT Mutations in Gastrointestinal Stromal Tumors: Making Sense of Missense. J. Natl. Cancer Inst. 2009, 101, 194–204. 10.1093/jnci/djn477. [DOI] [PubMed] [Google Scholar]
- Ferrone M.; Perrone F.; Tamborini E.; Paneni M. S.; Fermeglia M.; Suardi S.; Pastore E.; Delia D.; Pierotti M. A.; Pricl S.; Pilotti S. Functional Analysis and Molecular Modeling Show a Preserved Wild-Type Activity of P53(C238Y). Mol. Cancer Ther. 2006, 5, 1467–1473. 10.1158/1535-7163.MCT-06-0012. [DOI] [PubMed] [Google Scholar]
- Tamborini E.; Pricl S.; Negri T.; Lagonigro M. S.; Miselli F.; Greco A.; Gronchi A.; Casali P. G.; Ferrone M.; Fermeglia M.; Carbone A.; Pierotti M. A.; Pilotti S. Functional Analyses and Molecular Modeling of Two C-Kit Mutations Responsible for Imatinib Secondary Resistance in GIST Patients. Oncogene 2006, 25, 6140–6146. 10.1038/sj.onc.1209639. [DOI] [PubMed] [Google Scholar]
- Pricl S.; Fermeglia M.; Ferrone M.; Tamborini E. T315I-Mutated Bcr-Abl in Chronic Myeloid Leukemia and Imatinib: Insights from a Computational Study. Mol. Cancer Ther. 2005, 4, 1167–1174. 10.1158/1535-7163.MCT-05-0101. [DOI] [PubMed] [Google Scholar]
- Liu X.; Peng L.; Zhou Y.; Zhang Y.; Zhang J. Z. H. Computational Alanine Scanning with Interaction Entropy for Protein–Ligand Binding Free Energies. J. Chem. Theory Comput. 2018, 14, 1772–1780. 10.1021/acs.jctc.7b01295. [DOI] [PubMed] [Google Scholar]
- Sun Z.; Yan Y. N.; Yang M.; Zhang J. Z. Interaction Entropy for Protein-Protein Binding. J. Chem. Phys. 2017, 146, 124124. 10.1063/1.4978893. [DOI] [PubMed] [Google Scholar]
- Yan Y.; Yang M.; Ji C. G.; Zhang J. Z. H. Interaction Entropy for Computational Alanine Scanning. J. Chem. Inf. Model. 2017, 57, 1112–1122. 10.1021/acs.jcim.6b00734. [DOI] [PubMed] [Google Scholar]
- Qiu L.; Yan Y.; Sun Z.; Song J.; Zhang J. Z. H. Interaction Entropy for Computational Alanine Scanning in Protein–Protein Binding. Wiley Interdiscip. Rev.: Comput. Mol. Sci. 2018, 8, e1342 10.1002/wcms.1342. [DOI] [Google Scholar]
- Towler P.; Staker B.; Prasad S. G.; Menon S.; Tang J.; Parsons T.; Ryan D.; Fisher M.; Williams D.; Dales N. A.; Patane M. A.; Pantoliano M. W. ACE2 X-Ray Structures Reveal a Large Hinge-Bending Motion Important for Inhibitor Binding and Catalysis. J. Biol. Chem. 2004, 279, 17996–8007. 10.1074/jbc.M311191200. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Case D. A.; Ben-Shalom I. Y.; Brozell S. R.; Cerutti D. S.; Cheatham T. E. III; Cruzeiro V. W. D.; Darden T. A.; Duke R. E.; Ghoreishi D.; Giambasu G.; Giese T.; Gilson M. K.; Gohlke H.; Goetz A. W.; Greene D.; Harris R.; Homeyer N.; Huang Y.; Izadi S.; Kovalenko A.; et al. AMBER 2019; University of California, San Francisco, 2019.
- Wang E.; Sun H.; Wang J.; Wang Z.; Liu H.; Zhang J. Z. H.; Hou T. End-Point Binding Free Energy Calculation with MM/PBSA and MM/GBSA: Strategies and Applications in Drug Design. Chem. Rev. 2019, 119, 9478–9508. 10.1021/acs.chemrev.9b00055. [DOI] [PubMed] [Google Scholar]
- Simões I. C.; Costa I. P.; Coimbra J. T.; Ramos M. J.; Fernandes P. A. New Parameters for Higher Accuracy in the Computation of Binding Free Energy Differences Upon Alanine Scanning Mutagenesis on Protein-Protein Interfaces. J. Chem. Inf. Model. 2017, 57, 60–72. 10.1021/acs.jcim.6b00378. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.