Abstract
SARS-CoV-2 coronavirus uses for entry to human host cells a SARS-CoV receptor of the angiotensin-converting enzyme (ACE2) that catalyzes the conversion of angiotensin II into angiotensin (1-7). To understand the effect of ACE2 missense variants on protein structure, stability, and function, various bioinformatics tools were used including SIFT, PANTHER, PROVEAN, PolyPhen2.0, I. Mutant Suite, MUpro, SWISS-MODEL, Project HOPE, ModPred, QMEAN, ConSurf, and STRING. All twelve ACE2 nsSNPs were analyzed. Six ACE2 high-risk pathogenic nsSNPs (D427Y, R514G, R708W, R710C, R716C, and R768W) were found to be the most damaging by at least six software tools (cumulative score between 6 and 7) and exert deleterious effect on the ACE2 protein structure and likely function. Additionally, they revealed high conservation, less stability, and having a role in posttranslation modifications such a proteolytic cleavage or ADP-ribosylation. This in silico analysis provides information about functional nucleotide variants that have an impact on the ACE2 protein structure and function and therefore susceptibility to SARS-CoV-2.
1. Introduction
During December 2019 in Wuhan, China, a novel infectious disease called the coronavirus disease 2019 or COVID-19 was detected and found to be caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2). Later on, it was declared as a pandemic by the World Health Organization (WHO) on March 2020 [1, 2]. The most common symptoms were fever, cough, shortness of breath, sputum production, and fatigue [3]. SARS-CoV-2 infects the respiratory tract and engenders acute respiratory disease [3]. SARS-CoV-2 utilizes angiotensin-converting enzyme 2 (ACE2) as an entry receptor and the cellular serine protease TMPRSS2 for SARS-CoV-2 spike (S) protein priming [4].
The ACE2 gene is located on the chromosome X, specifically at the position Xp22. It contains 18 exons that show a striking resemblance in exons size and organization to those of the ACE gene [5, 6]. The complete cDNA sequence of ACE2 encodes an 805 amino acid protein, which is composed mainly of an N-terminal signal peptide (17 amino acid residues), a peptidase domain (positioning from 19-615 amino acids), and a C-terminal collectrin, which plays a significant role as a regulator of renal amino acid transportation, insulin exocytosis, and β-cell proliferation [5, 7].
ACE2 is expressed mostly in the vascular endothelial cells of the kidney and constitutes a vital site of blood pressure regulation in the renin-angiotensin system (RAS) and heart function. In the heart, ACE2 constitutes the first pathway for the metabolism of angiotensin II and is the most essential factor for progressive cardiac disease [8–10]. Interestingly, it seems that ACE2 has diverse biological functions, such as regulation of blood pressure by the renin-angiotensin aldosterone system (RAAS), and metabolizing angiotensin II (Ang II) to Ang (1-7), a biological active peptide that binds to the Mas receptor to exert many beneficial vasodilatory, antifibrotic, antithrombotic, and antiproliferative actions [11, 12]. Very recently, it remains possible that ACE2 gene polymorphism may influence both the susceptibility to SARS-CoV-2 infection and COVID-19 disease outcome [5]. There are a few reports related to computational analysis of missense variants on the ACE2 gene. Hence, the present study is aimed at identifying deleterious variants in the ACE2 gene by using various bioinformatics tools to extrapolate the possible associations of ACE2 polymorphisms with COVID-19 susceptibility.
2. Materials and Methods
2.1. Retrieval of Variant Datasets
The data related to ACE2 SNPs (rsIDs) was retrieved from genome aggregation database (gnomAD A3) (https://gnomad.broadinstitute.org/gene/ENSG00000130234?dataset=gnomad_r3) and Ensembl database (https://www.ensembl.org/Homo_sapiens/Gene/Variation_Gene/Table?db=core;g=ENSG00000130234;r=X:15561033-15602148). The amino acid (AA) sequence was retrieved from UniProt (https://www.uniprot.org/uniprot/Q9BYF1.fasta). Information on human ACE2 protein and gene was collected from the Online Mendelian Inheritance in Man (OMIM) database (https://www.omim.org/entry/300335).
2.2. Prediction of Deleterious nsSNPs of ACE2 Gene
To predict the deleterious effect of nonsynonymous SNPs (nsSNPs) of the human ACE2 gene on protein function, four tools were used, namely, SIFT, PolyPhen 2.0, PANTHER, and PROVEAN. SIFT (Sorting Intolerant from Tolerant) (https://sift.bii.a-star.edu.sg/) is a program that employs sequence homology to predict the impact of an AA substitution on protein function. It classifies AA substitutions with a score less than 0.05 as deleterious [13]. PolyPhen 2.0 (Polymorphism Phenotyping v2), (http://genetics.bwh.harvard.edu/pph2/) evaluates the impact of an AA substitution on the protein function and structure. It classifies the substitution as probably damaging (score = 1.0), and possibly damaging or benign (score = 0.0) [14]. PANTHER cSNP (Protein ANalysis THrough Evolutionary Relationship_coding_SNP) (http://pantherdb.org/tools/csnpScoreForm.jsp) calculates the likelihood of a single AA change on protein function, and it is based on the PANTHER-PSEP (Position_Specific Evolutionary Preservation) method [15]. PROVEAN (Protein Variation Effect Analyzer) (http://provean.jcvi.org/index.php) is a web server that was used to predict the impact of AA substitutions on the biological function of a protein [16].
2.3. Effect on the Stability of Protein
The stability of the protein was checked using the MUpro bioinformatics tool and I. Mutant Suite. MUpro (http://mupro.proteomics.ics.uci.edu/) predicts if a mutation increases or decreases the stability of protein structure [17]. I. Mutant Suite, a support vector machine-based algorithm, is available at (http://gpcr2.biocomp.unibo.it/cgi/predictors/I-Mutant3.0/I-Mutant3.0.cgi). It predicts the mutant protein stability starting from protein sequence alone [18].
2.4. Phylogenetic Conservational Analysis of ACE2
Conservation prediction of the ACE2 protein sequence was analyzed with ConSurf (https://consurf.tau.ac.il/), a web server used for identifying functional regions in proteins by analysing the evolutionary dynamics of AA substitutions among homologous sequences. The conservation score of 1-3 is variable, 5-6 as an intermediate scale, and 7-9 as a highly conserved AA positions [19].
2.5. Prediction of Posttranslational Modification Sites for ACE2
ACE2 protein posttranslational modification sites were predicted using ModPred (http://www.modpred.org/), a sequence-based predictor of 23 types of posttranslational modification (PTM) sites on proteins [20].
2.6. Statistical Analysis and Cumulative Score Calculation of Pathogenic nsSNPs
In order to predict the high-risk pathogenic nsSNPs of human ACE2, the cumulative score for all software tools (SIFT, PolyPhen2.0, PROVEAN, PANTHER, MUpro, I. Mutant, and ModPred) was calculated by using the Sum function in Excel [21]. Then, we set a restricted cumulative score value; when the result of the seven software tools were combined, the amino acid substitution that was evaluated to be deleterious by at least 6 tools would be identified as ACE2 high-risk pathogenic nsSNPs. For the final, correlation analysis was performed using SPSS v19 software. ANOVA and Student t-tests were applied to compare the predictions of the different tools. A p value less than 0.05 was significant [22].
2.7. Modelling
The 3D structure of full-length ACE2 protein was retrieved from The Research Collaboratory for Structural Bioinformatics Protein Data Bank (RCSB-PDB, http://rcsb.org), (ID 6M17) with a resolution of 2.9 Å. The 3D structure of the wild-type and mutant protein was generated using SWISS-MODEL (https://swissmodel.expasy.org/), and the quality of the model was checked using the Qualitative Model Energy Analysis (QMEAN) server (https://swissmodel.expasy.org/qmean/). SWISS-MODEL is a fully automated program that was used to predict the 3D structure of proteins. It generates 3D models by using homology modelling techniques [23]. The FASTA AA sequence of ACE2 protein was an input for the SWISS-MODEL. The predicted model of ACE2 from SWISS-MODEL was an input for the QMEAN analysis. QMEAN is a server that provides access to three scoring functions (QMEAN [24], QMEANBrane [25], and QMEANDisco) [26]. It estimates the quality of protein structure models in protein structure prediction [27]. PyMol is an open source program used for the three-dimensional visualization of macromolecules including proteins, nucleic acids, and small molecules [28]. The align command in PyMol is used to superpose two or more protein structures, and the superposition was evaluated based on RMSD (root mean square deviation) calculation [29].
2.8. Prediction of Structural Effect of Point Mutation on ACE2
Project HOPE (Have (y) Our Protein Explained) is a next-generation web server available at “https://www3.cmbi.umcn.nl/hope/” that is used to analyze the single point mutations. HOPE collects information from data sources and produces a mutation report enriched with figures that illustrates the effects of the mutation [30].
2.9. Prediction of Protein Interactions
STRING (Search Tool for Recurring Instances of Neighbouring Genes) is a web server available at “https://string-db.org/” that provides a platform for searching functional associations between proteins [31].
3. Results
3.1. SNP Dataset
In this study, 242 missense variants were collected from Ensembl and gnomAD A3 databases. SIFT, PolyPhen 2.0, PROVEAN, and PANTHER algorithms were used to predict the functional effects of mutation on the protein. MUpro and I. Mutant Suite tools were used to identify the mutation effects on protein stability. HOPE, ConSurf, ModPred, SWISS-MODEL, and STRING were used to predict the mutation effects on protein structure, function, and protein-protein interactions. In total, 13 different bioinformatics programs and web servers were used to assess the mutation effects on the variants in this investigation, because the prediction of the effect of a mutation using one algorithm is not sufficient for assessing that mutation effect.
3.2. Prediction of Functional nsSNPs in ACE2
The twelve missense SNPs were predicted to be deleterious using SIFT (where tolerance index score was ranged from 0 to 0.02) (Table 1). The nsSNPs predicted by SIFT were validated by PolyPhen 2.0, PANTHER, and PROVEAN. In particular, PolyPhen 2.0 results showed that twelve AA switches (R219C, R219H, M383T, P389H, D427Y, R514G, R708W, R710H, R710C, R716C, L731F, and R768W) were predicted probably damaging (score > 0.96) (Table 1). The PROVEAN analysis identified seven AA substitutions (M383T, P389H, D427Y, R514G, R708W, R710C, and R768W) were scored as deleterious (score below -2.5), and the rest were noted as neutral (score above -2.5) (Table 1). PANTHER predicted that nine nsSNPs were probably damaging (R219C, R219H, M383T, P389H, R514G, R710H, R710C, L731F, and R768W), two were noted as possibly damaging (R708W, and R716C), and one was scored as probably benign (D427Y) (Table 1).
Table 1.
SIFT | PolyPhen 2.0 | PROVEAN | PANTHER | ||||||
---|---|---|---|---|---|---|---|---|---|
SNPs ID | AA variant | Score | Prediction | Score | Prediction | Score | Prediction | PSPE | Prediction |
rs372272603 | R219C | 0 | Del | 1.000 | Pro Dam | -2.483 | Not Del | 750 | Pro Dam |
rs759590772 | R219H | 0 | Del | 0.988 | Pro Dam | -1.520 | Not Del | 750 | Pro Dam |
rs1396769231 | M383T | 0 | Del | 1.000 | Pro Dam | -4.797 | Del | 911 | Pro Dam |
rs762890235 | P389H | 0 | Del | 0.993 | Pro Dam | -7.862 | Del | 1037 | Pro Dam |
rs1316056737 | D427Y | 0 | Del | 0.970 | Pro Dam | -3.390 | Del | 176 | Not Del |
rs1352194082 | R514G | 0.02 | Del | 0.989 | Pro Dam | -6.483 | Del | 1037 | Pro Dam |
rs776995986 | R708W | 0.01 | Del | 1.000 | Pro Dam | -3.105 | Del | 361 | Pos Del |
rs901495523 | R710C | 0 | Del | 1.000 | Pro Dam | -2.936 | Del | 750 | Pro Dam |
rs370187012 | R710H | 0 | Del | 1.000 | Pro Dam | -1.788 | Not Del | 750 | Pro Dam |
rs144869363 | R716C | 0.01 | Del | 0.975 | Pro Dam | -1.638 | Not Del | 220 | Pos Del |
rs759590772 | L731F | 0 | Del | 0.995 | Pro Dam | -1.124 | Not Del | 750 | Pro Dam |
rs372272603 | R768W | 0 | Del | 1.000 | Pro Dam | -2.822 | Del | 750 | Pro Dam |
AA variant: amino acid variant; Del: deleterious; Not Del: not deleterious; Pro Dam: probably damaging; Pos Dam: possibly damaging; PSPE: position-specific evolutionary preservation.
3.3. Effect on the Stability of Protein
MUpro and I. Mutant Suite were used to predict change in protein stability. The result of I. Mutant Suite showed that eight amino acid substitutions (R219H, D427Y, R514G, R708W, R710C, R710H, R716C, and R768W) were recorded as decreasing the stability of the ACE2 protein, with a reliability index value ranged between 0 and 9, while the mutations (R219C, M383T, P389H, and L731F) were predicted to increase the stability of the ACE2 protein (Table 2). The result of stability predicted by MUpro showed that all the twelve nsSNPs (R219C, R219H, M383T, P389H, D427Y, R514G, R708W, R710H, R710C, R716C, L731F, and R768W) have decreased the stability of the ACE2 protein (Table 3).
Table 2.
SNPs ID | Amino acid substitution | Stability | RI | DDG value prediction (kcal/mol) |
---|---|---|---|---|
rs372272603 | R219C | Increase | 0 | -0.50 |
rs759590772 | R219H | Decrease | 3 | -0.58 |
rs1396769231 | M383T | Increase | 2 | -0.32 |
rs762890235 | P389H | Increase | 2 | 0.20 |
rs1316056737 | D427Y | Decrease | 3 | -0.56 |
rs1352194082 | R514G | Decrease | 8 | -1.84 |
rs776995986 | R708W | Decrease | 4 | -0.68 |
rs901495523 | R710C | Decrease | 8 | -1.72 |
rs370187012 | R710H | Decrease | 9 | -1.91 |
rs144869363 | R716C | Decrease | 7 | -2.20 |
rs759590772 | L731F | Increase | 4 | 0.01 |
rs372272603 | R768W | Decrease | 0 | -0.55 |
RI: reliability index; DDG: the free energy change value.
Table 3.
Method 1: SVM (support vector machine) | Method 2: neural network | |||||
---|---|---|---|---|---|---|
Mutation | Delta G | Prediction | Confidence score | Effect | Confidence score | Effect |
R219C | -0.89 | Decrease | -0.082 | Decrease | -0.57 | Decrease |
R219H | -1.16 | Decrease | -0.089 | Decrease | -0.51 | Decrease |
M383T | -1.56 | Decrease | -0.94 | Decrease | -0.70 | Decrease |
P389H | -0.45 | Decrease | -0.55 | Decrease | -0.85 | Decrease |
D427Y | -0.79 | Decrease | 0.16 | Increase | -0.67 | Decrease |
R514G | -1.15 | Decrease | -0.76 | Decrease | -0.98 | Decrease |
R708W | -0.99 | Decrease | -0.88 | Decrease | -0.98 | Decrease |
R710C | -1.17 | Decrease | -0.83 | Decrease | -0.87 | Decrease |
R710H | -1.66 | Decrease | -0.07 | Decrease | -0.98 | Decrease |
R716C | -1.34 | Decrease | -0.10 | Decrease | -0.82 | Decrease |
L731F | -0.87 | Decrease | -0.04 | Decrease | -0.66 | Decrease |
R768W | -1.89 | Decrease | -1 | Decrease | -0.99 | Decrease |
Delta G: energy change value; Confidence score: (>0: increase the stability, <0: decrease the stability).
3.4. Phylogenetic Conservation
According to ConSurf analysis, R514, R708, R710, and R768 are highly conserved residues with conservation score equal to 9 and also predicted to be exposed and functional. M389 and L731 are slightly conserved residues and buried. R219, P389, and D427 are variable residues and exposed, and finally one residue (R716) is average (conservation score equal to 5). Results of ConSurf prediction of ACE2 SNPs are summarized in Figure 1.
3.5. Prediction of Posttranslational Modification (PTM) Sites
ModPred was used to predict the effect of nsSNPs on posttranslational modification (PTM) process of the human ACE2 protein. ModPred identified sites for methylation (R219), ADP-ribosylation (R219, R708, and R768), and Proteolytic cleavage (P389, D427, R708, R710, R716, and R768). The results of PTM sites prediction are shown in Table 4.
Table 4.
Residue | Modification | Score | Confidence |
---|---|---|---|
R219 | Proteolytic cleavage | 0.70 | Medium |
ADP-ribosylation | 0.67 | Medium | |
Methylation | 0.76 | Medium | |
P389 | Proteolytic cleavage | 0.52 | Low |
D427 | Proteolytic cleavage | 0.73 | Medium |
R708 | Proteolytic cleavage | 0.76 | Medium |
ADP-ribosylation | 0.67 | Medium | |
R710 | Proteolytic cleavage | 0.54 | Low |
R716 | Proteolytic cleavage | 0.77 | Medium |
R768 | Proteolytic cleavage | 0.84 | Medium |
ADP-ribosylation | 0.66 | Low |
3.6. Statistical Analysis and Cumulative Score Calculation of Pathogenic nsSNPs
As a result of combining seven algorithms, the amino acid substitution, namely, R708W, was scored as the most deleterious nonsynonymous SNP with a cumulative score of 7 by all the seven tools, while 6 (R219H, D427Y, R514G, R710C, R716C, and R768W), 4 (R219C, M383T, P389H, and R710H), and 1 (L731F) variant got a cumulative score of 6, 5, and 4, respectively (Table 5). Among these variants, six nsSNPs (D427Y, R514G, R708W, R710C, and R768W) are newly evaluated as ACE2 high-risk pathogenic nsSNPs and were selected for further investigation. The prediction from SIFT, PROVEAN, PolyPhen 2.0, PANTHER, ModPred, I. Mutant, and MUpro were shown to be significant with a p value equal to 6.1308E-29 of ANOVA test and correlated (Supplementary figure 1). Student t-test results between the software tools were significant with a p value less than 0.0001, suggesting that the selected algorithms are accurate enough to evaluate the pathogenicity of these nsSNPs.
Table 5.
Mutation | SIFT | PolyPhen | PROVEAN | PANTHER | I.Mutant | MUpro | ModPred | Cumulative score |
---|---|---|---|---|---|---|---|---|
R219C | 1 | 1 | 0 | 1 | 0 | 1 | 1 | 5 |
R219H | 1 | 1 | 0 | 1 | 1 | 1 | 1 | 6 |
M383T | 1 | 1 | 1 | 1 | 0 | 1 | 0 | 5 |
P389H | 1 | 1 | 1 | 1 | 0 | 1 | 0 | 5 |
D427Y∗∗ | 1 | 1 | 1 | 0 | 1 | 1 | 1 | 6 |
R514G∗∗ | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 6 |
R708W∗ | 1 | 1 | 1 | 1 | 1 | 1 | 1 | 7 |
R710C∗∗ | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 6 |
R710H | 1 | 1 | 0 | 1 | 1 | 1 | 0 | 5 |
R716C∗∗ | 1 | 1 | 0 | 1 | 1 | 1 | 1 | 6 |
L731F | 1 | 1 | 0 | 1 | 0 | 1 | 0 | 4 |
R768W∗∗ | 1 | 1 | 1 | 1 | 1 | 1 | 0 | 6 |
∗The highly pathogenic nsSNPs by all the seven tools, ∗∗the highly pathogenic nsSNPs by at least 6 tools, 0 indicates neutral prediction, 1 indicates deleterious prediction.
3.7. 3D Modelling and Biophysical Validation of ACE2
The crystal structure of the ACE2 protein was retrieved from the Protein Data Bank (PDB ID: 6 M17, resolution at 2.9 Å) in complex with SARS-CoV-2 receptor-binding domain and sodium-dependent neutral amino acid transporter B(0)AT1. The structure was used as a template for the comparative modelling of mutant structures through SWISS-MODEL by submitting the FASTA AA sequence of ACE2 (percent identity equal to 100). The QMEAN server was used to predict the quality of the models, where the global QMEAN scores were 0.83 ± 0.05. This indicated that the predicted models were of good quality (Table 6). The six tested mutants, namely, D427Y, R514G, R708W, R710C, R716C, and R768W, were subjected to PyMol using “Align command” and compared to the native structure with regard to conformational variations by calculating the RMSD (root mean square deviation) (Table 7 and Figure 2). Project HOPE was used to identify the structural effects of mutations of interest. Results of project-HOPE of ACE2 SNPs were displayed in Table 8.
Table 6.
SNPs | QMEAN | CBeta interaction energy | All-atom pairwise energy | Solvation energy | Torsion energy |
---|---|---|---|---|---|
D427Y | -1.09 | 1.08 | 1.05 | 0.76 | -1.52 |
R514G | -1.27 | 0.39 | 0.78 | 0.52 | -1.50 |
R708W | -1.28 | 0.33 | 0.70 | 0.52 | -1.50 |
R710C | -1.37 | 0.34 | 0.74 | 0.42 | -1.55 |
R716C | -1.33 | 0.42 | 0.76 | 0.45 | -1.54 |
R768W | -1.09 | 1.01 | 1.05 | 0.74 | -1.50 |
CBeta interaction energy: distance-dependent potential using CBeta atoms as interaction center; All-atom pairwise energy: assessment of long-range interactions; Solvation energy: description of the burial status of the residues; Torsion energy: analysis of the local backbone geometry [32].
Table 7.
Position | Variant | Align | RMSD (Å) |
---|---|---|---|
427 | D427Y | 6 m17.1. E | 0.010 |
514 | R514G | 6 m17.1. E | 0.112 |
708 | R708W | 6 m17.1. E | 0.112 |
710 | R710C | 6 m17.1. E | 0.112 |
716 | R716C | 6 m17.1. E | 0.112 |
768 | R768W | 6 m17.1. E | 0.010 |
Table 8.
Residue | Structure | Properties |
---|---|---|
D427Y | (i) The WT is predicted to be located in its preferred secondary structure, a turn the mutant prefers to be in another secondary structure, therefore the local conformation will be slightly destabilized (ii) Mutation of the WT into none has the following effect SPD-PSN: slightly inhibits interaction with SARS-CoV spike gp (iii) The mutation is possibly damaging to the protein |
|
R514G | (i) The mutation introduces a glycine at this position. Glycine is very flexible and can disturb the required rigidity of the protein (ii) Residues in the vicinity of the mutated residue are annotated in the UniProt as being a binding site (iii) The mutation could affect the local structure and as a consequence affect the binding site (iv) The mutation is possibly not damaging to the protein (v) The mutant is smaller than the WT; this might lead to loss of interaction |
|
R708W | (i) The mutant is more hydrophobic than the WT (ii) The WT charge was positive, and the mutant charge is neutral (iii) The mutant is bigger; this might lead to bumps |
|
R710C | (i) The mutant is more hydrophobic than the WT (ii) The WT is very conserved (iii) The mutant is smaller; this might lead to loss of interaction |
|
R716C | (i) The mutant is smaller than the WT (ii) The mutation is located within a stretch of residues annotated in UniProt as a special region: essential of cleavage by TMPRSS11D and TMPRSS2 (iii) The difference in amino acid properties can disturb this region and its function |
|
R768W | (i) The WT is very conserved. In some rare cases, the mutation might occur without damaging the protein (ii) The mutant is bigger than the WT; this might lead to bumps (iii) The mutation introduces a more hydrophobic residue at this position; this can result in loss of hydrogen bonds and or disturb correct folding |
3.8. Analysis of Protein-Protein Interaction
STRING prediction showed that ACE2 interacts with angiotensin II receptor type 1 (AGTR1), angiotensin II receptor type 2 (AGTR2), prolyl carboxypeptidase (PRCP), renin (REN), angiotensinogen (AGT), membrane metalloendopeptidae (MME), dipeptidyl peptidase 4 (DPP4), meprin A subunit alpha (MEP1A), meprin A subunit beta (MEP1B), and X-prolyl aminopeptidase 2 (XPNPEP2) (Figure 3).
4. Discussion
Angiotensin-converting enzyme 2 (ACE2) plays an important role in the renin-angiotensin aldosterone system by metabolizing angiotensin II to angiotensin (1-7) [11]. This important enzyme was identified as a functional receptor for the severe acute respiratory syndrome coronavirus (SARS-CoV) and the novel coronavirus (SARS-CoV-2) [11]. Some studies suggest that the receptor-binding domains of SARS-CoV-2-S and SARS-CoV-S bind with identical affinities to ACE2 [25]. It has been shown that ACE2 genomic variants may play a key role in susceptibilities to COVID-19 [33].
The SARS-CoV-2 starts its infection by binding to ACE2 via its receptor-binding domain (RDB) [34]. Recently, it has been reported that the SARS-CoV-2 uses two mechanisms of host cell entry: the first mechanism on the ACE2 mediated virus endocytosis by using the clathrin- and caveolae-dependent pathways. The second one is dependent on the transmembrane serine protease 2- (TMPRSS2-) mediated membrane fusion [34].
The study by Shang et al. 2020 [35], Walls et al. [36], Wrapp et al. 2020, and Yan et al. 2020 have demonstrated that there are important ACE2 residues, namely, S19, Q24, T27, F28, D30, K31, H34, E35, E37, R357, E329, N330, K353, G354, D355, R357, P389, and R393, that play crucial roles in mediating the SARS-CoV-2 spike protein-ACE2 interaction. Other results were broadly in line with ACE2 variants that are identified to alter the virus host interaction and consequently alter susceptibility to SARS-CoV-2 [37]. Some of them, like S19P, I21V, E23K, K26R, T27A, N64K, T92I, Q102P, and H378R, are predicted to increase susceptibility and therefore render individuals more susceptible to the SARS-CoV-2 [37]. Other ACE2 variants such K31R, N35I, H34R, E35K, E37K, D38V, Y50F, N51S, M62V, K68E, F72V, Y83H, G326H, G352V, D355N, Q388L, and D509Y are predicted to decrease susceptibility and render individuals more resistant to the SARS-CoV-2 [37]. A number of 61 deleterious variants in the ACE2 gene including R219C, R219H, M383T, P389H, D427Y, R514G, R708W, R710H, R710C, R716C, L731F, and R768W, have been reported to influence susceptibility to COVID-19 (Hou et al. 2020). To confirm these results, the twelve variants, namely, R219C, R219H, M383T, P389H, D427Y, R514G, R708W, R710H, R710C, R716C, L731F, and R768W, were tested. This in silico analysis might be helpful in understanding the effect of missense variants on protein structure, function, and stability of ACE2 in relation with COVID-19.
A 2020 study by Fahd Al-Mulla et al. has shown that the arginine residues at 708 and 716 positions play an important role in ACE2 cleavage by TMPRSS2 and TMPRSS11D, whereas mutations on R708 and R716 seem to reduce directly ACE2 cleavage by TMPRSS2 [32]. Another line of research (Hossein Lanjanian et al. 2021) suggests that the arginine residue at 710 position of the ACE2 receptor plays a crucial role in mediating ACE2-TMPRSS2 interaction by involving hydrogens bonds [38]. A second study achieved by Behrooz Darbani (2020) has identified the rs1316056737 (R514G) as an interaction inhibitor variant that might impact the interaction between the ACE2 receptor and the viral spike protein [39].
In the current study, various algorithms, namely, SIFT, PolyPhen 2.0, PROVEAN, PANTHER, I. Mutant Suite, MUpro, and ConSurf, were used to identify the most deleterious nsSNPs of the ACE2 gene. In our in silico analysis, we identified six nsSNPs (D427Y, R514G, R708W, R710C, R716C, and R768W) from twelve nsSNPs. The six nsSNPs were predicted to be deleterious by at least six algorithms, decreased the stability of ACE2 by both MUpro and I. Mutant, and located in conserved regions with a score conservation of 9, expected D427Y, which may affect the structure and function of ACE2 protein.
ModPred was used to determine the posttranslational modification (PTM) sites of ACE2 identified R710 and R716 as PTM sites for proteolytic cleavage and R708 for ADP-ribosylation and proteolytic cleavage. Consequently, mutations at R708, R710, and R716 might affect PTM of the ACE2 gene.
The structural deviations between the native and modelled mutants were analyzed using the RMSD by measuring the average distance between the atoms of the superimposed proteins [40]. It has also been shown that RMSD values greater than 0.15 were evaluated to be significant and would have an impact on protein function and structure [40]. Therefore, the six models R427Y, R514, R708W, R710C, R716C, and R768W showed lower RMSD values, i.e., 0.010, 0.112, 0.112, 0.112, 0.112, and 0.010, respectively, which indicate minimal structural dissimilarity between the native and mutant models of ACE2.
According to HOPE, the AA substitution (R514G) introduces a glycine (flexible) and could disturb the required rigidity of the ACE2 protein and could affect the binding site where the mutation being located, while the mutation R716C is located in a region essential for cleavage by TMPRSS11D and TMPRSS2 proteases, and the difference in AA properties can disturb this region and its function.
Furthermore, by using the STRING server, ACE2 protein interacts with ten various proteins, namely, AGTR1, AGTR2, PRCP, REN, AGT, MME, DPP4, MEP1A, MEP1B, and XPNPEP2. The majority of those proteins play a crucial role in the regulation of blood pressure and kidney pathways, suggesting the association of ACE2 in kidney diseases. All the ten various proteins had a score ranging from 0.991 to 0.858, suggesting that proteins are partially biologically connected with an PPI enrichment p value equal to 1.29E-09. The proteins AGT, REN, AGTR1, and AGTR2 were found to have an important role in RAS signalling pathways such as regulation of blood pressure, body fluid and electrolyte homeostasis, and sodium retention by the kidney. The interaction analysis using the STRING server suggested that the ACE2 protein not only participate in hypertension and cardiovascular diseases but also a candidate in COVID-19 infection.
However, these findings showed that D427Y, R514G, R708W, R710C, R716C, and R768W, are the structurally and functionally most significant variants in the human ACE2 gene. (Table 7). It has been found that the residues ARG 708, 710, and 716, are located in the dimeric interface of human ACE2, and they are found to be important for its cleavage by the protease TMPRSS2, and this processing is indispensable for augmentation of SARS-S-driven entry into host cells [9]. The limitation of this in silico analysis is that the six deleterious variants should be confirmed with future extensive experiments and clinical wet lab approaches to figure out the mechanism of these mutations in susceptibility to COVID-19.
5. Conclusion
Our present in silico analysis of nsSNPs of the human ACE2 gene concluded that the mutations D427Y, R514G, R708W, R710C, R710H, and R716C are the most deleterious nsSNPs among the reported ACE2 gene variants. All six nsSNPs were predicted to be damaging and also affecting conserved positions, protein stability, and the posttranslational modification sites. Therefore, these mutations would likely affect the function on ACE2 with regard to host susceptibility to SARS-CoV-2 infection. Hence, the results of this study confirm previous findings and may be helpful for further understanding the role of these six ACE2 nsSNPs in susceptibility to COVID-19.
Acknowledgments
The authors are thankful to the Pasteur Institute of Morocco for providing encouragement and facilities.
Data Availability
All data used in this study are included within the article and supplementary information file.
Conflicts of Interest
The authors declare that they have no conflicts of interest.
Authors' Contributions
Anass Kettani and Lahcen Wakrim contributed equally to this work.
Supplementary Materials
References
- 1.Astuti I., Ysrafil Severe Aacute Rrespiratory Ssyndrome Ccoronavirus 2 (SARS-CoV-2): Aan overview of viral structure and host response. Diabetes & Metabolic Syndrome. 2020;14(4):407–412. doi: 10.1016/j.dsx.2020.04.020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Zhang H., Penninger J. M., Li Y., Zhong N., Slutsky A. S. Angiotensin-converting enzyme 2 (ACE2) as a SARS-CoV-2 receptor: molecular mechanisms and potential therapeutic target. Intensive Care Medicine. 2020;46(4):586–590. doi: 10.1007/s00134-020-05985-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Li M., Li L., Zhang Y., Wang X. Expression of the SARS-CoV-2 cell receptor gene ACE2 in a wide variety of human tissues. Infectious Diseases of Poverty. 2020;9(1):p. 45. doi: 10.1186/s40249-020-00662-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Hoffmann M., Kleine-Weber H., Schroeder S., et al. SARS-CoV-2 Ccell Eentry Ddepends on ACE2 and TMPRSS2 and Iis Bblocked by a Cclinically Pproven Pprotease Iinhibitor. Cell. 2020;181(2):271–280.e8. doi: 10.1016/j.cell.2020.02.052. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Devaux C. A., Rolain J., Raoult D. ACE2 receptor polymorphism: Susceptibility to SARS-CoV-2, hypertension, multi- organ failure, and COVID-19 disease outcome. Journal of Microbiology, Immunology and Infection. 2020;53(3):425–435. doi: 10.1016/j.jmii.2020.04.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Itoyama S., Keicho N., Hijikata M., et al. Identification of an alternative 5’-untranslated exon and new polymorphisms of angiotensin-converting enzyme 2 gene: Lack of association with SARS in the Vietnamese population. American Journal of Medical Genetics Part A. 2005;136A(1):52–57. doi: 10.1002/ajmg.a.30779. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Clarke N. E., Hooper N. M., Turner A. J. Angiotensin-Converting Enzyme-2. Handbook of Proteolytic Enzymes. 2012;2013:499–504. doi: 10.1016/B978-0-12-382219-2.00100-9. [DOI] [Google Scholar]
- 8.Boehm M., Nabel E. G. Angiotensin-Converting Enzyme 2 — A New Cardiac Regulator. The New England Journal of Medicine. 2002;347(22):1795–1797. doi: 10.1056/NEJMcibr022472. [DOI] [PubMed] [Google Scholar]
- 9.Crackower M. A., Sarao R., Oudit G. Y., et al. Angiotensin-converting enzyme 2 is an essential regulator of heart function. Nature. 2002;417(6891):822–828. doi: 10.1038/nature00786. [DOI] [PubMed] [Google Scholar]
- 10.Tikellis C., Thomas M. C. Angiotensin-converting enzyme 2 (ACE2) is a key modulator of the renin angiotensin system in health and disease. International Journal of Peptides. 2012;2012:8. doi: 10.1155/2012/256294.256294 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Shenoy V., Ferreira A. J., Katovich M., Raizada M. K. Angiotensin-Converting Enzyme 2/Angiotensin-(1-7)/Mas Receptor Axis: Emerging Pharmacological Target for Pulmonary Diseases. The Protective Arm of the Renin Angiotensin System (RAS) 2015;2015:269–274. doi: 10.1016/B978-0-12-801364-9.00038-9. [DOI] [Google Scholar]
- 12.Silva A. C. S. E., Silveira K. D., Ferreira A. J., Teixeira M. M. ACE2, angiotensin-(1-7) and Mas receptor axis in inflammation and fibrosis. British Journal of Pharmacology. 2013;169(3):477–492. doi: 10.1111/bph.12159. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Ng P. C., Henikoff S. SIFT: predicting amino acid changes that affect protein function. Nucleic Acids Research. 2003;31(13):3812–3814. doi: 10.1093/nar/gkg509. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Adzhubei I., Jordan D. M., Sunyaev S. R. Predicting functional effect of human missense mutations using PolyPhen-2. Current Protocols in Human Genetics. 2013;76(1) doi: 10.1002/0471142905.hg0720s76. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Tang H., Thomas P. D. PANTHER-PSEP: predicting disease-causing genetic variants using position-specific evolutionary preservation. Bioinformatics. 2016;32(14):2230–2232. doi: 10.1093/bioinformatics/btw222. [DOI] [PubMed] [Google Scholar]
- 16.Choi Y., Chan A. P. PROVEAN web server: a tool to predict the functional effect of amino acid substitutions and indels. Bioinformatics. 2015;31(16):2745–2747. doi: 10.1093/bioinformatics/btv195. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Cheng J., Randall A., Baldi P. Prediction of protein stability changes for single-site mutations using support vector machines. Proteins: Structure, Function, and Bioinformatics. 2006;62(4):1125–1132. doi: 10.1002/prot.20810. [DOI] [PubMed] [Google Scholar]
- 18.Capriotti E., Fariselli P., Casadio R. I-Mutant2.0: predicting stability changes upon mutation from the protein sequence or structure. Nucleic Acids Research. 2005;33(Web Server):W306–W310. doi: 10.1093/nar/gki375. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Ashkenazy H., Abadi S., Martz E., et al. ConSurf 2016: an improved methodology to estimate and visualize evolutionary conservation in macromolecules. Nucleic Acids Research. 2016;44(W1):W344–W350. doi: 10.1093/nar/gkw408. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Pejaver V., Hsu W.-L., Xin F., Dunker A. K., Uversky V. N., Radivojac P. The structural and functional signatures of proteins that undergo multiple events of post-translational modification. Protein Science. 2014;23(8):1077–1093. doi: 10.1002/pro.2494. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.AlGhamdi N. A., Alsuwat H. S., Borgio J. F., AbdulAzeez S. Emerging of composition variations of SARS-CoV-2 spike protein and human ACE2 contribute to the level of infection:in silicoapproaches. Journal of Biomolecular Structure and Dynamics. 2020;2020:1–12. doi: 10.1080/07391102.2020.1841032. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.AbdulAzeez S., Sultana S., Almandil N. B., Almohazey D., Bency B. J., Borgio J. F. The rs61742690 (S783N) single nucleotide polymorphism is a suitable target for disrupting BCL11A-mediated foetal-to-adult globin switching. PLoS One. 2019;14(2):p. e0212492. doi: 10.1371/journal.pone.0212492. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Biasini M., Bienert S., Waterhouse A., et al. SWISS-MODEL: modelling protein tertiary and quaternary structure using evolutionary information. Nucleic Acids Research. 2014;42(W1):W252–W258. doi: 10.1093/nar/gku340. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Benkert P., Biasini M., Schwede T. Toward the estimation of the absolute quality of individual protein structure models. Bioinformatics. 2011;27(3):343–350. doi: 10.1093/bioinformatics/btq662. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Studer G., Biasini M., Schwede T. Assessing the local structural quality of transmembrane protein models using statistical potentials (QMEANBrane) Bioinformatics. 2014;30(17):i505–i511. doi: 10.1093/bioinformatics/btu457. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Studer G., Rempfer C., Waterhouse A. M., Gumienny R., Haas J., Schwede T. QMEANDisCo—distance constraints applied on model quality estimation. Bioinformatics. 2020;36(6):1765–1771. doi: 10.1093/bioinformatics/btz828. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Benkert P., Künzli M., Schwede T. QMEAN Server for protein model quality estimation. Nucleic acids research. 2009;37(suppl_2):W510–W514. doi: 10.1093/nar/gkp322. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Yuan S., Chan H. C. S., Hu Z. UsingPyMOLas a platform for computational drug design. Wiley Interdisciplinary Reviews-Computational Molecular Science. 2017;7(2) doi: 10.1002/wcms.1298. [DOI] [Google Scholar]
- 29.Meng E. C., Pettersen E. F., Couch G. S., Huang C. C., Ferrin T. E. Tools for integrated sequence-structure analysis with UCSF Chimera. BMC Bioinformatics. 2006;7(1):p. 339. doi: 10.1186/1471-2105-7-339. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Venselaar H., Beek T. A. H. T., Kuipers R. K. P., Hekkelman M. L., Vriend G. Protein structure analysis of mutations causing inheritable diseases. An e-Science approach with life scientist friendly interfaces. BMC Bioinformatics. 2010;11(1) doi: 10.1186/1471-2105-11-548. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Szklarczyk D., Franceschini A., Wyder S., et al. STRING v10: protein–protein interaction networks, integrated over the tree of life. Nucleic Acids Research. 2015;43(D1):D447–D452. doi: 10.1093/nar/gku1003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Al-Mulla F., Mohammad A., Al Madhoun A., et al. A comprehensive germline variant and expression analyses of ACE2, TMPRSS2 and SARS-CoV-2 activator FURIN genes from the Middle East: cCombating SARS-CoV-2 with precision medicine. BioRxiv. 2020;99176:1–28. [Google Scholar]
- 33.Hou Y., Zhao J., Martin W., et al. New insights into genetic susceptibility of COVID-19: an ACE2 and TMPRSS2 polymorphism analysis. BMC Medicine. 2020;18(1):216–216. doi: 10.1186/s12916-020-01673-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Bian J., Li Z. Angiotensin-converting enzyme 2 (ACE2): SARS-CoV-2 receptor and RAS modulator. Acta Pharmaceutica Sinica B. 2021;11(1) doi: 10.1016/j.apsb.2020.10.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Shang J., Ye G., Shi K., et al. Structural basis of receptor recognition by SARS-CoV-2. Nature. 2020;581(7807):221–224. doi: 10.1038/s41586-020-2179-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Walls A. C., Park Y., Tortorici M. A., Wall A., McGuire A. T., Veesler D. Structure, Function, and Antigenicity of the SARS-CoV-2 Spike Glycoprotein. Cell. 2020;181(2):281–292.e6. doi: 10.1016/j.cell.2020.02.058. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Stawiski E. W., Diwanji D., Suryamohan K., et al. Human ACE2 receptor polymorphisms predict SARS-CoV-2 susceptibility. bioRxiv; 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Lanjanian H., Moazzam-Jazi M., Hedayati M., et al. SARS-CoV-2 infection susceptibility influenced by _ACE2_ genetic polymorphisms: insights from Tehran Cardio-Metabolic Genetic Study. Scientific Reports. 2021;11(1):p. 1529. doi: 10.1038/s41598-020-80325-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Darbani B. The expression and polymorphism of entry machinery for covid-19 in human: juxtaposing population groups, gender, and different tissues. International Journal of Environmental Research and Public Health. 2020;17(10):p. 3433. doi: 10.3390/ijerph17103433. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Solayman M., Saleh M. A., Paul S., Khalil M. I., Gan S. H. In silico analysis of nonsynonymous single nucleotide polymorphisms of the human adiponectin receptor 2 ( ADIPOR2 ) gene. Computational Biology and Chemistry. 2017;68:175–185. doi: 10.1016/j.compbiolchem.2017.03.005. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All data used in this study are included within the article and supplementary information file.