Version Changes
Revised. Amendments from Version 1
The reviewers' suggestions were accepted and added to the current version of the manuscript. New results, such as the summary of diseased SNPs predicted from Meta-SNP in SARS-CoV-2 mutations, Lipinski’s rule of five, and the ADMET profile of the compounds analyzed in the current study, were included. The inclusion of this new analysis drove changes mainly in the results and discussion section of the manuscript. Also, new supplementary data is shared with the readers. In the discussion section, new insights into the literature and previous results from our research groups allowed us to highlight the novelty and importance of our results.
Abstract
Background
Although the COVID-19 pandemic has diminished in intensity, the virus continues to circulate globally. The SARS-CoV-2 main protease (Mpro) is a key enzyme in the life cycle of the virus, making it important for the development of treatments against future variants of the virus. In this work, Peruvian natural compounds were evaluated against different mutations of the SARS-CoV-2 Mpro.
Methods
In silico techniques such as virtual screening, all-atom molecular dynamics simulations, and energy estimation analysis were applied.
Results
Of the tested compounds by virtual screening, rutin was identified as the best binding agent against the different proposed Mpro mutations. In addition, computational simulations and energy estimation analysis demonstrated the high structural and energetic stability between the Mpro-rutin systems.
Conclusions
Overall, our study identified rutin as the most promising compound with a strong affinity for various Mpro mutations, potentially playing a key role in the development of new treatments for emerging viral variants.
Keywords: Main protease, mutations, SARS-CoV-2, Peruvian sources, rutin
Introduction
By the end of 2019, the world experienced the outbreak of the COVID-19 pandemic, which swiftly spread across communities and healthcare systems, causing widespread infections. 1 The pandemic, caused by SARS-CoV-2 (Severe Acute Respiratory Syndrome Coronavirus 2), marked a significant global health crisis. 2 , 3 Even in 2024, the effects of the virus persist, with ongoing concerns about public health and economic recovery. 4 , 5 Despite vaccination efforts and advances in treatments, COVID-19 continues to affect vulnerable populations, and the emergence of new variants remains a challenge for health systems worldwide. By late 2020, multiple variants of SARS-CoV-2 had emerged and spread rapidly, 6 , 7 and new variants have continued to evolve, further complicating global response efforts through 2024.
The SARS-CoV-2 main protease (Mpro) is a critical enzyme that plays a pivotal role in viral replication and transcription. 8 Upon entering the host cell, the viral RNA is translated into large polyproteins, which Mpro cleaves at specific sites to release non-structural proteins (nsps) essential for the virus’s replication. 9 , 10 Mpro specifically processes polyprotein 1ab at multiple cleavage sites and hydrolyzes the Gln-Ser peptide bond within the Leu-Gln-Ser-Ala-Gly recognition sequence. This cleavage site is unique compared to those recognized by other human cysteine proteases known to date. 11 As a result, Mpro has become a prime therapeutic target, with its inhibition being a promising approach to halting viral translation and replication. 12 Structurally, Mpro consists of three domains: domains I (residues 8-101), II (residues 102-184), and III (residues 201-306). 8
Likewise, several researchers have highlighted the importance of studying the stability of the Mpro structure while taking mutations into account, as this can complicate the identification of specific inhibitors. 13 These variants are characterized by changes in the amino acid sequence of the virus compared to the first sequenced strain, Wuhan-Hu-1 (GenBank accession: NC_045512.2). The variants may contain one or more mutations that distinguish them from the wild type. 14 Tracking and evaluating the spread of SARS-CoV-2 genetic variations in different countries is crucial.
It is important to note that registered mutations may alter the binding mechanisms of potential inhibitors, leading to possible resistance. 15 Therefore, it is essential to anticipate the effects of these mutations and identify new inhibitors to counteract them.
In the absence of a specific drug and with the emergence of new mutations, various studies are evaluating the potency of numerous phytochemicals in restricting the replication of SARS-CoV-2 and other viral infections. 16 Phytocompounds are considered promising drug candidates due to their high bioavailability and low toxicity. 17 Similarly, in silico studies have demonstrated the potent inhibitory action of taraxerol, found in Clerodendrum spp., a plant used in traditional medicine in tropical regions of Asia, against SARS-CoV-2 Mpro. 18 Additionally, β-amyrin and stigmasta-5,22-dien-3-ol, present in Cyperus rotundus L., a plant commonly used in traditional Indian medicine, have also shown inhibitory potential. 19
Peru is one of the 12 nations with the highest levels of biodiversity, which has allowed a rich tradition of medicinal practices to flourish and endure over time. 20 The Vavilov Institute recognizes this region as a global center for plant biodiversity. 21 The 20,000 to 30,000 plant species found across its diverse ecosystems account for approximately 10% of all plants used in medicine worldwide. 22
This study makes significant contributions to the ongoing research on novel SARS-CoV-2 Mpro inhibitors. First, it highlights natural compounds derived from Peru’s rich biodiversity, an underexplored resource in prior studies. Furthermore, we have assessed the efficacy of these compounds against eight specific Mpro mutations (Y54C, 23 N142S, 23 T190I, 23 A191V, 23 S139A, 24 R298A, 24 R60C, 15 and G11A 25 ), offering a comprehensive analysis of their interactions with various mutant variants of SARS-CoV-2 Mpro. Using advanced computational approaches—such as virtual screening, molecular dynamics simulations, and binding free energy estimation via the Molecular Mechanics/Generalized Born Surface Area (MM/GBSA) method—we conducted an in-depth evaluation of the structural stability and inhibitory potential of the identified compounds. Notably, our research highlights the high structural stability and potent inhibitory effects of rutin in the Mpro-rutin system. Computational simulations revealed that rutin forms stable, long-lasting interactions with the Mpro active site, underscoring its promise as a potential therapeutic candidate for COVID-19 treatment.
Computational details
Proteins preparation
In this study, we analyzed eight critical mutations of the SARS-CoV-2 Mpro protein, namely Y54C, N142S, T190I, A191V, S139A, G11A, R298A, and R60C. These mutations were chosen for their potential impact on the core protease’s structure and function. The Y54C mutation, identified in Malaysia, and N142S, reported in various countries, were selected because they could potentially alter the N-terminal domain’s stability and the catalytic loop’s flexibility, respectively. The T190I mutation affects the interactions within the substrate binding site, while A191V influences the dimerization dynamics, a crucial process for Mpro’s functionality. The S139A, G11A, and R298A mutations, which result in the complete loss of dimerization, are essential for Mpro’s proteolytic activity. Lastly, the R60C mutation, found in Brazil and Vietnam, affects the protein’s dynamics and the inhibitor’s ability to bind to the active site. The crystal structure of SARS-CoV-2 (PDB ID: 5RE4) reported in the Protein Data Bank ( https://www.rcsb.org/ pdb/) was used. Subsequently, mutated protein sequences were prepared by replacing the amino acids at positions R298A, N142S, A191V, R60C, G11A, Y54C, T190I, and S139A. These sequences were generated by homology modelling on the SWISS-MODEL server ( https://swissmodel.expasy.org) using the crystal structure of SARS-CoV-2 Mpro (PDB ID: 5RE4) as a template.
Preparation of the virtual database and screening
The search for natural products was performed at the Peruvian Natural Products Database (PeruNPDB) 26 online web server (first version) ( https://perunpdb.com.pe/, accessed on 23 January 2022) whereas the simplified molecular-input line-entry system (SMILE) of each compound of was the upload into OpenBabel within the Python Prescription Virtual Screening Tool (PyRx) 27 and the subjection to energy minimization; whereas PyRx performs structure-based virtual screening by applying docking simulations using the AutoDock Vina tool. 28 Likewise, the FASTA sequence of the Crystal Structure of SARS-CoV-2 main protease (Mpro) (PDB: 5RE4) was subjected to a BLAST 29 search (accessed on 16 April 2022) whereas all the mutants were selected and subjected to automated modeling in SWISS-MODEL 30 server (accessed on 17 April 2022). For the analysis, the search space encompassed the whole of the modeled 3D models; and the docking simulation was then run at an exhaustiveness of eight and set to only output the lowest energy pose. Multiple sequence alignments of the Mpro and mutant sequences were visualized using the msa package (version 1.22.0) 31 in the R programming environment (version 4.0.3). The heatmap plot was generated using GraphPad Prism version 9.4.0 for Windows, GraphPad Software, San Diego, California USA ( www.graphpad.com).
Molecular dynamics simulation and Molecular Mechanics/Generalized Born Surface Area (MM/GBSA) calculation
The simulation of the motion is realized by the numerical solution of the classical Newtonian dynamic equations. We used Gromacs v. 2020 32 to calculate the molecular dynamics (MD) simulation and the AMBER-99SB-ILDN force field. The topologies for the Amber force field were determined on the ACPYPE server ( https://www.bio2byte.be/acpype/) for the best metabolite against mutates Mpro. Each system was included in the centre of a cube box of 10 on each side. Likewise, water molecules were added (water model TIP4P). The energy minimization was carried out with the steep-descendent integrator with 200000 calculation steps. Herein, the MD simulation in the canonical ensemble NVT was done for a time of 1ns. Finally, the production of MD continued 100 ns in the isobaric-isothermal ensemble considering the Parrinello-Rahman barostat (1 bar) and V-rescale thermostat (309.65 K). The binding free energy estimation by MM/GBSA (Molecular Mechanics/Generalized Born Surface Area) was calculated with the suit mmpbsa.py 33 from AmberTools20 34 and gmx MMPBSA v1.4.1. 35 The equations related to calculations of binding free energies are the following:
(1) |
(2) |
(3) |
The equation that determines the electrostatic solvation energy ( ∆G GB ) considers ( ∆E MM ) which is the variation be- tween the minimized energy of the protein-ligand complexes of the study which includes the van der Waals ( ∆E vdw ) and electrostatic ( ∆E ele ) contributions, while ( ∆G SA ) is the difference in surface area energies for protein and ligand and ( −T∆S) refers to the contribution of entropy at temperature T.
Finally, the graphical visualizations were made with Visual Molecular Dynamics (VMD), 36 allowing interactive visualization with an easy-to-use interface. The interpretation of the molecular interactions was recreated with Maestro (Schrodinger) 2D interactions diagram. Likewise, the Molecular dynamics simulation results were performed by the Gromacs tools, and the values were processed by Gnuplot 5.2 ( http://gnuplot.info/) command-driven interactive function plotting program.
Results
Mutant SARS-CoV-2 Mpro description
SARS-CoV-2 Mpro is a cysteine protease of 67.6 kDa, and its structure possesses a catalytic dyad (Cys145 and His41) with a substrate-binding pocket located in a cleft between domains I and II. The secondary structure of Mpro has 10 alpha helixes, 13 beta sheets, 8 beta protrusions, 7 beta hairpins, 22 beta turns, 5 gamma turns, and 9 helix-helix interactions. In this work, we used the access code PDB ID: 5RE4 which was downloaded from the Protein Data Bank. This crystal structure was determined by the X-ray diffraction method with a resolution of 1.88 Å.
Besides, we focused on the analysis of eight mutations registered in different parts of the world. First is the Y54C mutation reported in Malaysia, and the N142S mutation was reported 17 times in 5 different countries. T190I is a mutation identified in 15 countries, such as South Africa and the USA. The mutation A191V is characterized by having an occurrence rate of 0.30% and is present in more than 34 countries. Besides, the S139A, G11A, and R298A mutation results provided a better understanding of the dimerization and catalytic mechanism of the Mpro. 24 , 37 In Brazil and Vietnam, the R60C mutation was reported, affecting the protein dynamics and the inhibitor’s binding within its active site. 15 The R298A leads to the interruption of the dimeric conformation and irreversible inhibition of the enzyme’s catalytic activity, 24 and the G11A mutation avoids the insertion of the N finger region (residues 1-9) and therefore wholly declines its activity. 38 The location of the eight mutations is shown in Figure 1.
Figure 2 shows the sequence alignment of Mpro mutations. The black square selects the variation of residues by mutant Mpro. The G11A, Y54C, and R60C mutations are located close to the His41 residue and in Domain I from Mpro. Two mutations (S139A and N142S) are present in Domain II and close to Cys145, and it is expected that these protein structures could show different behaviour than Mpro without mutations. On the other hand, it was also observed that mutations in T190I and A191V are in the connection of Domain II and Domain III. For the case of R298A mutation, it can be observed near Domain III.
The in silico prediction of disease-causing variants was conducted using the publicly accessible Meta-SNP tool ( https: //snps.biofold.org/meta-snp/). 39 This approach is distinguished by its integration of four established methods: PANTHER, PhD-SNP, SIFT, and SNAP each with predefined default threshold parameters. The prediction criteria are as follows: PANTHER, PhD-SNP, and Meta-SNP yield values between 0 and 1 (mutations with values >0.5 are predicted as disease-causing); SIFT produces a positive value (mutations with values >0.05 are predicted as neutral); and SNAP normalizes its output between 0 and 1 (mutations with values >0.5 are predicted as disease-causing). The predictions from Meta-SNP and its integrated tools provide a valuable initial assessment of the potential impact of nsSNVs (non-synonymous Single Nucleotide Variants). However, the variability in predictions underscores the importance of complementing these bioinformatic tools with experimental studies to achieve conclusive validation of each variant’s pathogenicity. Of particular interest are the Y54C and S139A mutations, which showed strong predictions towards pathogenicity and, therefore, warrant further in-depth analysis in future research (See Table 1).
Table 1. Summary of diseased SNPs predicted from Meta-SNP in SARS-CoV-2 mutations.
Mutation | PANTHER | PhD-SNP | SIFT | SNAP | Meta-SNP | RI | Profile |
---|---|---|---|---|---|---|---|
G11A | NA | Neutral | Disease | Disease | Neutral | F [G]=55% | |
- | 0.312 | 0.03 | 0.655 | 0.256 | 5 | F [A]=0% | |
Nali=46 | |||||||
Y54C | NA | Neutral | Disease | Disease | Disease | F [Y]=64% | |
- | 0.499 | 0 | 0.745 | 0.675 | 4 | F [C]=0% | |
Nali=46 | |||||||
R60C | NA | Neutral | Neutral | Disease | Neutral | F [R]=56% | |
- | 0.435 | 0.16 | 0.630 | 0.323 | 4 | F [C]=0% | |
Nali=47 | |||||||
S139A | NA | Disease | Disease | Disease | Disease | F [S]=100% | |
- | 0.535 | 0 | 0.79 | 0.73 | 5 | F [A]=0% | |
Nali=47 | |||||||
N142S | NA | Neutral | Neutral | Neutral | Neutral | F [N]=33% | |
- | 0.189 | 0.7 | 0.47 | 0.193 | 6 | F [S]=4% | |
Nali=47 | |||||||
T190I | NA | Neutral | Neutral | Neutral | Neutral | F [T]=17% | |
- | 0.214 | 0.34 | 0.39 | 0.197 | 6 | F [I]=4% | |
Nali=46 | |||||||
A191V | NA | Neutral | Neutral | Neutral | Neutral | F [A]=32% | |
- | 0.066 | 1 | 0.3 | 0.064 | 9 | F [V]=26% | |
Nali=46 | |||||||
R298A | NA | Neutral | Neutral | Disease | Neutral | F [R]=30% | |
- | 0.159 | 0.06 | 0.53 | 0.183 | 6 | F [A]=0% | |
Nali=46 |
RI=Reliability Index between 0 and 10; F [X]=Frequency of residue X in the sequence profile; Nali: Number of aligned sequences in the mutated site.
Virtual screening analysis
The Virtual Screening technique, widely used for drug discovery, seeks to identify potential compounds for a particular therapeutic target. This approach allowed us to find new possible candidates within the PeruNPDB dataset against one of the therapeutic targets from mutant Mpro of SARS-CoV-2.
The 84 substances included in the study are taken from the original PeruNPDB collection; the most recent dataset consists of 280 substances. Figure 3 shows the gradient palette, the violet color indicated strong binding ( ∆G<-12 kcal/mol), while the yellow color indicated weak binding ( ∆G>-2 kcal/mol). In this heat map, rutin is shown as the best compounds. However, for the T190I and Y54C mutations, the color intensity is lower compared to the other mutations.
In Table S1 of the Supplementary Material, the values of coupling energies are reported. The values for Mpro wild, A191V, G11A, N142S, R60C, R289C, S139A, T190I, and Y54C were -10.7 kcal/mol, -10.4 kcal/mol, -10.7 kcal/mol, -10.4 kcal/mol, - 10.7 kcal/mol, -10.7 kcal/mol, -10.7 kcal/mol, -9.4 kcal/mol, and -9.1 kcal/mol, respectively. Additionally, the results of Lipinski’s rule of five 40 and ADMET (Absorption, Distribution, Metabolism, Excretion and Toxicity) prediction obtained from http://www.scfbio-iitd.res.in/ ADMETlab v 3.0 41 of the 84 compounds are shown in Table S2 and Table S3 from the Supplementary Material. The Lipinski’s rule of five analysis for the majority of the compounds revealed no significant violations in terms of molecular weight, hydrogen bond donors, and acceptors. However, some compounds did not comply with the logP values. Further experimental determination of logP values may provide a better understanding of these discrepancies. The data obtained from the ADMET analysis were found to be comprehensive and informative, allowing for a better understanding of the properties to be considered in the present study.
Molecular dynamics simulations and estimation of binding free energy
The results obtained from virtual screening helped us to consider rutin as a ligand against the different Mpro mutations. Molecular dynamics simulations allow us to understand the behavior of different mutated Mpro at an atomistic level. After analyzing 100 ns of production dynamics, the convergence of each protein is observed by Root-mean squared deviation (RMSD) analysis (See Figure 2A). This result shows us that the different types of mutations achieved equilibrium; likewise, an average RMSD between 0.1 and 0.2 nm is appreciated, an acceptable value in this structural model. The Root-mean squared fluctuation (RMSF) calculates the flexibility of individual residues that make up the Mpro protein during a simulation trajectory. The RMSF per residue diagram structurally indicates which amino acids in a protein contribute the most to a molecular motion. Figure 4B highlights the area of His41 and Cys145 amino acids where the most significant fluctuation in the His41 area occurs with the R298A mutation, while the most significant fluctuation in the Cys145 area occurred in the R60C, Y54C, R298A, and N142S mutation.
On the other hand, Table 2 shows us the quantitative values of the RMSD, where the G11A and R60C mutations showed the lowest average RMSD value. In contrast, the average value for the Y54C mutation was higher than the others. Regarding the average RMSF values of the last 5 ns, for the 5RE4, A191V, G11A, N142S, R60C, S139A, and T190I systems, the average RMSF of the Mpro structures oscillated by 0.8 nm, while for R298A the RMSF average results in 0.09 nm and Y54C it was 0.07 nm.
Table 2. RMSD and RMSF average values of SARS-CoV-2 Mpro wild and mutated.
System | RMSD (nm) | RMSF (nm) |
---|---|---|
5RE4 | 0.15 ±0.02 | 0.08 ±0.03 |
A191V | 0.16 ±0.02 | 0.08 ±0.03 |
G11A | 0.14 ±0.02 | 0.08 ±0.04 |
N142S | 0.16 ±0.02 | 0.08 ±0.03 |
R60C | 0.14 ±0.01 | 0.08 ±0.03 |
R298A | 0.17 ±0.03 | 0.09 ±0.03 |
S139A | 0.15 ±0.02 | 0.08 ±0.03 |
T190I | 0.17 ±0.02 | 0.08 ±0.03 |
Y54C | 0.18 ±0.03 | 0.07 ±0.03 |
Besides virtual screening and molecular dynamics simulation studies, molecular mechanics/generalized Born surface area (MM/GBSA) was performed with all frames of the MD. Table 3 indicates the average free energy values for each system. The values show a high coupling energy estimate, indicating that the interaction was carried out correctly.
Table 3. Calculated Molecular Mechanics/Generalized Born Surface Area (MM/GBSA) binding free energy of the systems.
System | ∆ TOTAL | VDWAALS | EEL | EGB | ∆G gas | ∆G solv |
---|---|---|---|---|---|---|
5RE4 | -33.49±7.32 | -44.09±6.48 | -33.36±16.91 | 50.02±9.38 | -77.45±14.88 | 43.96±9.18 |
A191V | -40.71±6.07 | -47.19±4.43 | -43.39±10.75 | 55.74±7.21 | -90.58±12.04 | 49.88±7.03 |
G11A | -41.17±4.48 | -46.51±3.29 | -42.57±10.84 | 53.40±7.96 | -89.08±10.55 | 47.91±7.78 |
N142S | -36.65±3.82 | -54.18±3.16 | -26.84±6.44 | 51.02±5.15 | -81.02±6.77 | 44.37±5.10 |
R60C | -45.09±7.29 | -48.90±5.88 | -49.45±12.23 | 59.24±7.93 | -98.35±13.21 | 53.26±7.10 |
R298A | -24.11±8.41 | -36.30±8.91 | -21.09±12.85 | 37.81±12.91 | -57.39±19.44 | 33.28±11.74 |
S139A | -25.84±2.95 | -45.56±2.63 | -18.41±7.38 | 43.84±5.14 | -63.97±6.74 | 38.13±5.11 |
T190I | -35.87±5.80 | -48.17±5.82 | -30.99±8.97 | 49.32±5.99 | -79.15±9.71 | 43.29±5.81 |
Y54C | -34.53±5.62 | -28.04±5.52 | -66.78±10.26 | 65.39±6.59 | -94.82±10.55 | 60.29±6.36 |
The mutation R60C showed the best interaction energy (-45.09 kcal/mol) against the different systems studied. The energy values for G11A and A191V were -41.17 kcal/mol and -40.71 kcal/mol, respectively. While the systems that showed low binding energy were mutations R298A and S139A, with average values of -24.11 kcal/mol and -25.84 kcal/mol, respectively.
Likewise, the most significant energy contribution was given by the Van der Waals energies ( ∆E V DWAALS ) in the wild system, A191V, G11A, N142S, R298A, S139A, and T190I. These types of energy are weak and have short-range inter- actions; in biological systems, they play a significant role in stabilizing protein-small molecules. On the other hand, in the R60C and Y54C systems, the energy contribution is given by electrostatic energies ( ∆E ELE ). Electrostatic energy considers into account the charges of each atom in the system, which depend on the medium in which they are found; these have greater scope, and the force of interaction it possesses is linked to the relative orientations it accepts.
Figure 5 shows the last frame of each simulation of the Mpro-rutin complex. In general, it was observed that the interactions in the active site are due to the formation of hydrogen bonds. However, we observed some changes in the region around the active site for the mutations occurring in N142S and Y54C. The residues around N142S are mostly hydrophobic (green contour), hence N142S exhibits a greater energy contribution from hydrophobic interactions ( ∆E V DWAALS = -54.18 kcal/mol, higher than the other mutations). In Y54C, the residues around rutin are polar (sky blue contour), demonstrating its high energetic contribution by electrostatic interactions ( ∆E ELE = -66.78 kcal/mol more elevated than the other mutations).
Discussion
To date, research teams worldwide have been collecting data on SARS-CoV-2 strains, some of which exhibit numerous mutations in various structural proteins. The main protease (Mpro) plays a crucial role in the SARS-CoV-2 life cycle by mediating viral replication and transcription. Mpro functions by cleaving the viral polyproteins pp1a and pp1ab, which are synthesized from the viral RNA once SARS-CoV-2 enters the host cell. It recognizes specific sequences within these polyproteins and cleaves them at approximately 11 conserved sites, releasing nonstructural proteins (nsps) essential for forming the replication-transcription complex (RTC). The active site of Mpro, consisting of cysteine and histidine, catalyzes the cleavage of peptide bonds, thereby enabling the release of nsps required for viral replication. This makes Mpro a key target for therapeutic interventions.
However, mutations in the main protease (Mpro) of SARS-CoV-2 are significant because they can influence the virus’s ability to replicate and its susceptibility to antiviral treatments. Mpro is a critical enzyme in processing viral polyproteins, and any alterations in its structure due to mutations can affect its catalytic efficiency, protein stability, and interaction with inhibitors. 42
In this context, numerous studies have focused on identifying new inhibitors derived from natural compounds, as they offer a rich and promising source for drug discovery against SARS-CoV-2. 43– 50 These compounds provide significant advantages in terms of chemical diversity, safety, sustainability, and therapeutic efficacy. For example, several phytochemical molecules, such as kaempferol, quercetin, luteolin-7-glucoside, demethoxycurcumin, naringenin, apigenin-7-glucoside, oleuropein, curcumin, catechin, and epicatechin gallate, have been reported as promising antiviral agents against SARS-CoV-2. 51 Additionally, Parvez et al. identified azobechalcone, rifampin, isolophirachalcone, tetrandrine, and fangchinoline as potential Mpro inhibitors, 52 while Padhi et al. demonstrated that putaminoxin B, putaminoxin D, jasmonic acid, and jasmonic methyl ester possess good pharmacokinetic properties against Mpro. 53 In 2020, more than a thousand FDA-approved drugs were virtually screened using molecular docking and binding free energy calculations, with nelfinavir emerging as a potential inhibitor of SARS-CoV-2. Goyzueta et al. studied rutin as a promising Mpro inhibitor using in silico techniques. 54 Similarly, reused drugs and phytochemical compounds have shown binding affinities to various Mpro mutants. For instance, salvianolic acid A, extracted from Salvia miltiorrhiza, demonstrated inhibition effects against the N142S and T190I mutations. 23
We analyzed eight specific Mpro mutations (Y54C, N142S, T190I, A191V, S139A, G11A, R298A, and R60C), which have been identified in various parts of the world and have significant implications for the protease’s structure and function. These mutations are critical for developing effective inhibitors for antiviral treatments. The Y54C mutation, found in Malaysia, may affect the stability of Mpro, potentially altering its overall structure and interaction with inhibitors. 55 This could compromise the three-dimensional conformation of Mpro, which is essential for its active site function. The N142S mutation, reported in five countries, could impact the flexibility of the catalytic loop, a key element in the active site’s efficiency. 56 Alterations in this region may significantly affect the protease’s functionality. The T190I mutation, identified in 15 countries, including South Africa and the United States, may alter interactions at the substrate binding site by changing the orientation of catalytic residues, which could impair Mpro’s ability to interact with substrates. 57 , 58 The A191V mutation, with a 0.30% occurrence rate, has been reported in over 34 countries and may influence dimerization dynamics, which are essential for enzymatic function, thereby affecting the stability and activity of Mpro. 38 , 59 The S139A mutation could modify the catalytic environment, affecting Mpro’s interaction with inhibitors. 60 The R298A mutation disrupts dimeric conformation and leads to the irreversible inhibition of catalytic activity, destabilizing Mpro’s structure and impairing its ability to maintain an active conformation. 24 The G11A mutation eliminates the N-finger region (residues 1-9), reducing Mpro’s enzymatic activity and preventing proper dimer formation, which is crucial for proteolytic function. 38 The R60C mutation, identified in Brazil and Vietnam, affects the protein’s dynamics and the inhibitor binding within its active site, thus compromising Mpro inhibition. 15 This mutation particularly impacts Mpro’s three-dimensional structure and its interaction with potential therapeutic inhibitors, thereby reducing the effectiveness of protease inhibitor-based treatments.
The novelty of this study lies in the use of 84 substances taken from the original PeruNPDB ( https://perunpdb.com.pe) collection; the most recent dataset consists of 280 substances. Our results reveal that the rutin metabolite, found in Smallanthus sonchifolius (yacón) and Lepidium meyenii (maca), exhibited the strongest binding affinity with all the proposed Mpro mutations. Rutin, also known as rutoside, is a natural phenolic compound that plays a key role in maintaining the oxidant-antioxidant balance associated with certain diseases. 61 , 62 The MD simulation results indicate that rutin interacted with the active site residues of Mpro mutations and remained stabilized in the active site region with minimal fluctuation. These results are in perfect agreement with the MM/GBSA obtained after the docking calculation and the RMSD analysis. Where the stable RMSD ensures that the MM/GBSA calculation is based on a structurally consistent system, making the free energy predictions more reliable between Mpro mutations and rutin.
Conclusions
The main protease (Mpro) is crucial for SARS-CoV-2 replication and represents a promising drug target. In this study, we analyzed eight Mpro mutations of SARS-CoV-2 (Y54C, N142S, T190I, A191V, S139A, R298A, R60C, and G11A), each located in different regions of the protease. Among these, S139A demonstrated strong pathogenicity predictions through Meta-SNP calculations, warranting further investigation in future research. Additionally, virtual screening identified rutin, from the PeruNPDB database, as the most promising candidate for binding to Mpro. Molecular dynamics simulations and energy estimation analyses confirmed that rutin forms a highly stable complex with Mpro. We believe that computer-assisted drug design and molecular dynamics simulations offer a powerful complementary approach to screening potential Mpro inhibitors, providing an attractive strategy to combat SARS-CoV-2 and its variants.
Ethics and consent
Ethical approval and consent were not required.
Author contributions
Conceptualization: H.L.B.-C. and L.D.G.-M.; data curation: H.L.B.-C., L.D.G.-M., E.G.C.-R., A.E.C.-L., M.P.-C., and M.A.C.-F; formal analysis: H.L.B.-C. and M.A.C.-F.; funding acquisition: H.L.B.-C. and M.A.C.-F.; investigation: H.L.B.-C., M.P.-C., L.D.G.-M., E.G.C.-R., A.E.C.-L., and M.A.C.-F; methodology: H.L.B.-C. and M.A.C.-F.; writing—review and editing: H.L.B.-C., and M.A.C.-F. All authors have read and agreed to the published version of the manuscript.
Data availability
Underlying data
Figshare: Dataset of results from virtual screening and molecular dynamics simulations. https://doi.org/10.6084/m9.figshare.27156276.v1. 63
This project contains the following underlying data:
-
•
Table 1. Summary of diseased SNPs predicted from Meta-SNP in SARS-CoV-2 mutations.
-
•
Table 2. RMSD and RMSF average values of SARS-CoV-2 Mpro wild and mutated.
-
•
Table 3. Calculated MM/GBSA binding free energy of the systems.
-
•
Table S1. Values of the coupling energies obtained by virtual screening.
-
•
Table S2. Lipinski’s “rule of five” analysis of the 84 phytocompounds determined their solubility, permeability, and efficacy for drug discovery.
-
•
Table S3. In silico ADMET properties of 84 phytocompounds.
Data are available under the terms of the Creative Commons Attribution 4.0 International license (CC-BY 4.0).
Funding Statement
Universidad Católica de Santa María funded this research with the following grants: 27499-R-2020, 27574-R-2020, and 28048-R-2021.
The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
[version 2; peer review: 1 approved
References
- 1. Yu X, Li N: Understanding the beginning of a pandemic: China’s response to the emergence of COVID-19. J Infect Public Health. 2021;14(3):347–352. 10.1016/j.jiph.2020.12.024 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2. Hu B, Guo H, Zhou P, et al. : Characteristics of SARS-CoV-2 and COVID-19. Nat Rev Microbiol. 2021;19(3):141–154. 10.1038/s41579-020-00459-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3. Huang C, Wang Y, Li X, et al. : Clinical features of patients infected with 2019 novel coronavirus in Wuhan, China. lancet. 2020;395(10223):497–506. 10.1016/S0140-6736(20)30183-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4. Oldekop JA, Horner R, Hulme D, et al. : COVID-19 and the case for global development. World Dev. 2020;134: 105044. 10.1016/j.worlddev.2020.105044 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5. Nicola M, Alsafi Z, Sohrabi C, et al. : The socio-economic implications of the coronavirus pandemic (COVID-19): A review. Int J Surg. 2020;78:185–193. 10.1016/j.ijsu.2020.04.018 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Volz E, Hill V, McCrone JT, et al. : Evaluating the effects of SARS-CoV-2 spike mutation D614G on transmissibility and pathogenicity. Cell. 2021;184(1):64–75.e11. 10.1016/j.cell.2020.11.020 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7. Rees-Spear C, Muir L, Griffith SA, et al. : The effect of spike mutations on SARS-CoV-2 neutralization. Cell Rep. 2021;34(12): 108890. 10.1016/j.celrep.2021.108890 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8. Ullrich S, Nitsche C: The SARS-CoV-2 main protease as drug target. Bioorg Med Chem Lett. 2020;30(17): 127377. 10.1016/j.bmcl.2020.127377 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9. Brewitz L, Ibbotson L, Salah E, et al. : Thiophene-fused γ-lactams inhibit the SARS-CoV-2 main protease via reversible covalent acylation. Chem Sci. 2024;15(20):7667–7678. 10.1039/D4SC01027B [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10. Johansen-Leete J, Ullrich S, Fry SE, et al. : Antiviral cyclic peptides targeting the main protease of SARS-CoV-2. Chem Sci. 2022;13(13):3826–3836. 10.1039/D1SC06750H [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11. Ghahremanpour MM, Tirado-Rives J, Deshmukh M, et al. : Identification of 14 known drugs as inhibitors of the main protease of SARS-CoV-2. ACS Med Chem Lett. 2020;11(12):2526–2533. 10.1021/acsmedchemlett.0c00521 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12. Dai W, Zhang B, Jiang XM, Su H, Li J, Zhao Y, et al. : Structure-based design of antiviral drug candidates targeting the SARS-CoV-2 main protease. Science 2020;368(6497):1331–1335. 10.1126/science.abb4489 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13. Bzówka M, Mitusińska K, Raczyńska A, et al. : Structural and evolutionary analysis indicate that the SARS-CoV-2 Mpro is a challenging target for small-molecule inhibitor design. Int J Mol Sci. 2020;21(9):3099. 10.3390/ijms21093099 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14. Tao K, Tzou PL, Nouhin J, et al. : The biological and clinical significance of emerging SARS-CoV-2 variants. Nat Rev Genet. 2021;22(12):757–773. 10.1038/s41576-021-00408-x [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15. Khan MI, Khan ZA, Baig MH, et al. : Comparative genome analysis of novel coronavirus (SARS-CoV-2) from different geographical locations and the effect of mutations on major target proteins: An in silico insight. PLoS One. 2020;15(9): e0238344. 10.1371/journal.pone.0238344 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16. Narkhede RR, Cheke RS, Ambhore JP, et al. : The molecular docking study of potential drug candidates showing anti-COVID-19 activity by exploring of therapeutic targets of SARS-CoV-2. Eurasian J Med Oncol. 2020;4(3):185–195. [Google Scholar]
- 17. Rameshkumar MR, Indu P, Arunagirinathan N, et al. : Computational selection of flavonoid compounds as inhibitors against SARS-CoV-2 main protease, RNA-dependent RNA polymerase and spike proteins: A molecular docking study. Saudi J Biol Sci. 2021;28(1):448–458. 10.1016/j.sjbs.2020.10.028 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18. Kar P, Sharma NR, Singh B, et al. : Natural compounds from Clerodendrum spp. as possible therapeutic candidates against SARS-CoV-2: An in silico investigation. J Biomol Struct Dyn. 2021;39(13):4774–4785. 10.1080/07391102.2020.1780947 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19. Kumar SB, Krishna S, Pradeep S, et al. : Screening of natural compounds from Cyperus rotundus Linn against SARS-CoV-2 main protease (Mpro): An integrated computational approach. Comput Biol Med. 2021;134: 104524. 10.1016/j.compbiomed.2021.104524 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20. Castro JC, Maddox JD, Cobos M, et al. : Medicinal Plants of the Peruvian Amazon: Bioactive Phytochemicals, Mechanisms of Action, and Biosynthetic Pathways. Pharmacognosy-Medicinal Plants. 2018. [Google Scholar]
- 21. Hummer KE, Hancock JF: Vavilovian centers of plant diversity: Implications and impacts. HortScience. 2015;50(6):780–783. 10.21273/HORTSCI.50.6.780 [DOI] [Google Scholar]
- 22. Bussmann RW, Sharon D: Medicinal plants of the Andes and the Amazon-The magic and medicinal flora of Northern Peru. Ethnobot Res Appl. 2016;15:1–295. [Google Scholar]
- 23. Sharma T, Abohashrh M, Baig MH, et al. : Screening of drug databank against WT and mutant main protease of SARS-CoV-2: Towards finding potential compound for repurposing against COVID-19. Saudi J Biol Sci. 2021;28(5):3152–3159. 10.1016/j.sjbs.2021.02.059 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24. Goyal B, Goyal D: Targeting the dimerization of the main protease of coronaviruses: a potential broad-spectrum therapeutic strategy. ACS Comb Sci. 2020;22(6):297–305. 10.1021/acscombsci.0c00058 [DOI] [PubMed] [Google Scholar]
- 25. Suárez D, Díaz N: SARS-CoV-2 main protease: A molecular dynamics study. J Chem Inf Model. 2020;60(12):5815–31. 10.1021/acs.jcim.0c00575 [DOI] [PubMed] [Google Scholar]
- 26. Barazorda-Ccahuana HL, Ranilla LG, Candia-Puma MA, et al. : PeruNPDB: The Peruvian natural products database for in silico drug screening. Sci Rep. 2023;13(1):7577. 10.1038/s41598-023-34729-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27. Dallakyan S, Olson AJ. Small-molecule library screening by docking with PyRx. Chemical biology. Springer;2015; pp.243–250. 10.1007/978-1-4939-2269-7_19 [DOI] [PubMed] [Google Scholar]
- 28. Trott O, Olson AJ: AutoDock Vina: improving the speed and accuracy of docking with a new scoring function, efficient optimization, and multithreading. J Comput Chem. 2010;31(2):455–461. 10.1002/jcc.21334 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29. Boratyn GM, Camacho C, Cooper PS, et al. : BLAST: a more efficient report with usability improvements. Nucleic Acids Res. 2013;41(W1):W29–W33. 10.1093/nar/gkt282 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30. Biasini M, Bienert S, Waterhouse A, et al. : SWISS-MODEL: modelling protein tertiary and quaternary structure using evolutionary information. Nucleic Acids Res. 2014;42(W1):W252–W258. 10.1093/nar/gku340 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31. Bodenhofer U, Bonatesta E, Horejš-Kainrath C, et al. : msa: an R package for multiple sequence alignment. Bioinformatics. 2015;31(24):3997–3999. 10.1093/bioinformatics/btv494 [DOI] [PubMed] [Google Scholar]
- 32. Van Der Spoel D, Lindahl E, Hess B, et al. : GROMACS: fast, flexible, and free. J Comput Chem. 2005;26(16):1701–1718. 10.1002/jcc.20291 [DOI] [PubMed] [Google Scholar]
- 33. Miller BR, III, McGee TD, Jr, Swails JM, et al. : MMPBSA. py: an efficient program for end-state free energy calculations. J Chem Theory Comput. 2012;8(9):3314–3321. 10.1021/ct300418h [DOI] [PubMed] [Google Scholar]
- 34. Case DA, Belfon K, Ben-Shalom I, et al. : Amber 2020:2020. [Google Scholar]
- 35. Valdés-Tresanco MS, Valdés-Tresanco ME, Valiente PA, et al. : gmx_MMPBSA: a new tool to perform end-state free energy calculations with GROMACS. J Chem Theory Comput. 2021;17(10):6281–6291. 10.1021/acs.jctc.1c00645 [DOI] [PubMed] [Google Scholar]
- 36. Humphrey W, Dalke A, Schulten K: VMD: visual molecular dynamics. J Mol Graph. 1996;14(1):33–38. 10.1016/0263-7855(96)00018-5 [DOI] [PubMed] [Google Scholar]
- 37. Hu T, Zhang Y, Li L, et al. : Two adjacent mutations on the dimer interface of SARS coronavirus 3C-like protease cause different conformational changes in crystal structure. Virology. 2009;388(2):324–334. 10.1016/j.virol.2009.03.034 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38. Sheik Amamuddy O, Verkhivker GM, Tastan BÖ: Impact of early pandemic stage mutations on molecular dynamics of SARS-CoV-2 Mpro. J Chem Inf Model. 2020;60(10):5080–5102. 10.1021/acs.jcim.0c00634 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39. Capriotti E, Altman RB, Bromberg Y: Collective judgment predicts disease-associated single nucleotide variants. BMC Genomics. 2013;14:S2–S9. 10.1186/1471-2164-14-S3-S2 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40. Lipinski CA: Lead-and drug-like compounds: the rule-of-five revolution. Drug Discov Today Technol. 2004;1(4):337–341. 10.1016/j.ddtec.2004.11.007 [DOI] [PubMed] [Google Scholar]
- 41. Fu L, Shi S, Yi J, et al. : ADMETlab 3.0: an updated comprehensive online ADMET prediction platform enhanced with broader coverage, improved performance, API functionality and decision support. Nucleic Acids Res. 2024;52:W422–W431. 10.1093/nar/gkae236 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42. Wolfe G, Belhoussine O, Dawson A, et al. : Impactful mutations in Mpro of the SARS-CoV-2 proteome. Proceedings of the 11th ACM International Conference on Bioinformatics, Computational Biology and Health Informatics. 2020: pp.1–3. [Google Scholar]
- 43. Wu C, Liu Y, Yang Y, et al. : Analysis of therapeutic targets for SARS-CoV-2 and discovery of potential drugs by computational methods. Acta Pharm Sin B. 2020;10(5):766–788. 10.1016/j.apsb.2020.02.008 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44. Belhassan A, Chtita S, Zaki H, et al. : In silico detection of potential inhibitors from vitamins and their derivatives compounds against SARS-CoV-2 main protease by using molecular docking, molecular dynamic simulation and ADMET profiling. J Mol Struct. 2022;1258: 132652. 10.1016/j.molstruc.2022.132652 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45. Aanouz I, Belhassan A, El-Khatabi K, et al. : Moroccan Medicinal plants as inhibitors against SARS-CoV-2 main protease: Computational investigations. J Biomol Struct Dyn. 2021;39(8):2971–2979. 10.1080/07391102.2020.1758790 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46. Kandeel M, Al-Nazawi M: Virtual screening and repurposing of FDA approved drugs against COVID-19 main protease. Life Sci. 2020;251: 117627. 10.1016/j.lfs.2020.117627 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47. Chtita S, Belhassan A, Aouidate A, et al. : Discovery of potent SARS-CoV-2 inhibitors from approved antiviral drugs via docking and virtual screening. Comb Chem High Throughput Screen. 2021;24(3):441–454. 10.2174/1386207323999200730205447 [DOI] [PubMed] [Google Scholar]
- 48. Aouidate A, Ghaleb A, Chtita S, et al. : Identification of a novel dual-target scaffold for 3CLpro and RdRp proteins of SARS-CoV-2 using 3D-similarity search, molecular docking, molecular dynamics and ADMET evaluation. J Biomol Struct Dyn. 2021;39(12):4522–4535. 10.1080/07391102.2020.1779130 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49. Belhassan A, Zaki H, Chtita S, et al. : Camphor, Artemisinin and Sumac Phytochemicals as inhibitors against COVID-19: Computational approach. Comput Biol Med. 2021;136: 104758. 10.1016/j.compbiomed.2021.104758 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50. El Aissouq A, Chedadi O, Bouachrine M, et al. : Identification of Novel SARS-CoV-2 Inhibitors: A Structure-Based Virtual Screening Approach. J Chem. 2021;2021(1): 1901484.. 10.1155/2021/1901484 [DOI] [Google Scholar]
- 51. Puttaswamy H, Gowtham HG, Ojha MD, et al. : In silico studies evidenced the role of structurally diverse plant secondary metabolites in reducing SARS-CoV-2 pathogenesis. Sci Rep. 2020;10(1):1–24. 10.1038/s41598-020-77602-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52. MSA P, Azim KF, Imran AS, et al. : Virtual screening of plant metabolites against main protease, RNA-dependent RNA polymerase and spike protein of SARS-CoV-2: Therapeutics option of COVID-19. arXiv preprint arXiv:200511254 2020. [Google Scholar]
- 53. Padhi S, Masi M, Chourasia R, et al. : ADMET profile and virtual screening of plant and microbial natural metabolites as SARS-CoV-2 S1 glycoprotein receptor binding domain and main protease inhibitors. Eur J Pharmacol. 2021;890: 173648. 10.1016/j.ejphar.2020.173648 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54. Goyzueta-Mamani LD, Barazorda-Ccahuana HL, Mena-Ulecia K, et al. : Antiviral activity of metabolites from peruvian plants against SARS-CoV-2: an in silico approach. Molecules. 2021;26(13):3882. 10.3390/molecules26133882 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55. Zhou X, Lu X, Lin C, et al. : Structural basis for the inhibition of coronaviral main proteases by PF-00835231. bioRxiv. 2024; 2024. [DOI] [PubMed] [Google Scholar]
- 56. Mótyán JA, Mahdi M, Hoffka G, et al. : Potential resistance of SARS-CoV-2 main protease (Mpro) against protease inhibitors: lessons learned from HIV-1 protease. Int J Mol Sci. 2022;23(7):3507. 10.3390/ijms23073507 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57. Thakur S, Sasi S, Pillai SG, et al. : SARS-CoV-2 mutations and their impact on diagnostics, therapeutics and vaccines. Front Med (Lausanne). 2022;9: 815389. 10.3389/fmed.2022.815389 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58. Mohammad T, Choudhury A, Habib I, et al. : Genomic variations in the structural proteins of SARS-CoV-2 and their deleterious impact on pathogenesis: a comparative genomics approach. Front Cell Infect Microbiol. 2021;11: 765039. 10.3389/fcimb.2021.765039 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59. Cross TJ, Takahashi GR, Diessner EM, et al. : Sequence characterization and molecular modeling of clinically relevant variants of the SARS-CoV-2 main protease. Biochemistry. 2020;59(39):3741–3756. 10.1021/acs.biochem.0c00462 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60. Ferreira JC, Fadl S, Rabeh WM: Key dimer interface residues impact the catalytic activity of 3CLpro, the main protease of SARS-CoV-2. J Biol Chem. 2022;298(6):102023. 10.1016/j.jbc.2022.102023 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61. Imani A, Maleki N, Bohlouli S, et al. : Molecular mechanisms of anticancer effect of rutin. Phytother Res. 2021;35(5):2500–2513. 10.1002/ptr.6977 [DOI] [PubMed] [Google Scholar]
- 62. Cristiano MC, Barone A, Mancuso A, et al. : Rutin-Loaded Nanovesicles for Improved Stability and Enhanced Topical Efficacy of Natural Compound. J Funct Biomater. 2021;12(4):74. 10.3390/jfb12040074 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63. Chavez-Fumagalli MA: Barazorda-Cchauana et al 2023b_new.xlsx.[Dataset]. figshare. 2024. 10.6084/m9.figshare.27156276.v1 [DOI]