Abstract
The main protease (Mpro) of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) plays a vital role in viral replication. To study the function of Mpro and screen inhibitors targeting Mpro, it is necessary to prepare high-purity and high-activity Mpro. In this study, four types of SARS-CoV-2 Mpros containing different termini were prepared, and their activities were determined successfully. The results showed that the activity of wild-type (WT) Mpro was the highest, and the additional residues at the N-terminus but not at the C-terminus had a major effect on the enzyme activity. To explain this, the alignment of structures of different forms of Mpro was determined, and the additional residues at the N-terminus were found to interfere with the formation of the substrate binding pocket. This study confirms the importance of the natural N-terminus to the activity of Mpro and suggests that WT-GPH6 (Mpro with eight additional residues at the C-terminus) can be used as a substitute for authentic Mpro to screen inhibitors. In short, this study provides a reference for the expression and purification of new coronaviruses confronted in the future.
Keywords: SARS-CoV-2, Main protease, Enzyme activity, Additional residues, Kinetic parameters
Abbreviations
- Mpro
main protease
- SARS-CoV-2
severe acute respiratory syndrome coronavirus 2
- COVID-19
coronavirus disease 2019
- SARS
severe acute respiratory syndrome
- MERS
Middle East respiratory syndrome
- ORFs
open reading frames
- PLpro
papain-like protease
- WT-GPH6
SARS-CoV-2 Mpro with eight additional residues (GPHHHHHH) at the C-terminus
- WT
wild-type SARS-CoV-2 Mpro with authentic termini
- H6-WT
SARS-CoV-2 Mpro with twenty-one additional residues (MGSSHHHHHHSSGRENLYFQG) at the N-terminus
- G-WT
SARS-CoV-2 Mpro with one additional residue (G) at the N-terminus
- PSP
PreScission protease
- TEV
tobacco etch virus
1. Introduction
In 2019, coronavirus disease 2019 (COVID-19) caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) appeared in Wuhan, China [1,2]. The main clinical manifestations of mild patients were fever, cough, and dyspnea. A few patients had diarrhea, nausea and other gastrointestinal symptoms [3,4]. Severe patients presented with acute respiratory dysfunction syndrome, multiple organ failure, and even death [5]. The worldwide prevalence of SARS-CoV-2 poses a serious threat to human health and life because of its strong infectivity.
Similar to the two coronaviruses that caused severe acute respiratory syndrome (SARS) in 2003 [6] and Middle East respiratory syndrome (MERS) in 2012 [7], SARS-CoV-2 causing COVID-19 belongs to the genus Betacoronavirus, including SARS-CoV and MERS-CoV [1,8]. Furthermore, SARS-CoV-2 has 79.6% genome sequence identity with SARS-CoV [9] and nearly 50% genome sequence identity with MERS-CoV [10].
SARS-CoV-2 is a single-stranded RNA virus. Its genome is composed of 14 functional open reading frames (ORFs), which can encode at least 29 kinds of proteins. Four structural proteins are included: spike protein, nucleocapsid protein, membrane glycoprotein, and envelope protein [11,12]. SARS-CoV-2 ORF1a and ORF1b encode two different polyproteins, pp1a and pp1ab. These polyproteins are cleaved to generate the nonstructural protein nsp1-16 by pivotal enzymes, including papain-like protease (PLpro) and main protease (Mpro) [13]. Mpro can cleave polyproteins at 11 sites and initially release itself by autocleavage [14,15]. Mpro can recognize substrates as long as 10 residues but generally have specific selectivity for four subsites. Interestingly, substrate recognition pockets of Mpro from different coronaviruses are highly conserved due to sequence homology [16,17]. There is up to a remarkable 96% sequence identity of Mpro derived from SARS-CoV-2 and SARS-CoV [18]. In summary, highly conserved Mpro plays a pivotal role in virus life and has no homologous enzyme in humans, so it can be regarded as a key target for screening inhibitors [[19], [20], [21]].
Mpro, with an approximate molecular weight of 33 kDa, is demonstrated to have a dimeric structure, and each protomer has three domains. The catalytic active sites and active pockets are situated in domain I and domain II, and domain III mainly participates in the formation of dimer structures. The catalytic activity center is a catalytic dimer composed of cysteine (Cys145) and histidine (His41) [17,[22], [23], [24], [25]].
Although SARS-CoV-2 Mpro has been characterized sufficiently, one particular aspect draws our attention. The kinetic parameters of SARS-CoV-2 Mpro were obviously inconsistent from different studies, in which kcat/Km ranged from 370.4 to 28500 M−1 s−1. This is difficult to explain by the differences in experimental procedures and reagents [20,23,[26], [27], [28]]. However, studies on SARS-CoV Mpro showed that additional residues at the terminus could lead to the depression of enzyme activity to varying degrees [29]. Considering the high homology of Mpro from SARS-CoV and SARS-CoV-2, we speculated that additional amino acids would affect the activity of SARS-CoV-2 Mpro analogously.
In this study, we expressed four types of SARS-CoV-2 Mpros with different termini and measured their activity. Similar to previous studies on SARS-CoV Mpro, additional residues at the N-terminus significantly reduced the activity, but additional residues at the C-terminus had little effect on the activity. These results indicated that Mpro in the WT-GPH6 form can be used as a substitute for authentic Mpro and provided references for the expression and purification of coronavirus Mpro in the future.
2. Materials and methods
2.1. Construction of recombinant plasmids
The full-length Mpro gene 10055–10972 of SARS-CoV-2 (GenBank accession no. NC_045512.2) was optimized and synthesized for Escherichia coli (E. coli) expression. Then, the synthesized gene was cloned into the pGEX-6p-1 vector via the restriction sites BamHI and XhoI to obtain the recombinant plasmid, which was designated pGEX-Mpro (Genewiz). To obtain the authentic N-terminus, 12 nucleotides and 24 nucleotides corresponding to AVLQ and GPHHHHHH at N- and C-termini, respectively, were added to the Mpro gene, which was performed by PCR via primers P1–P2 (Table 1 ). Subsequently, the amplified gene product was ligated into the pGEX-6p-1 vector via the restriction sites BamHI and XhoI, and the constructed plasmid was designated pGEX-Mpro-GPH6. Similarly, to obtain the authentic C-terminus, the Mpro gene was amplified with primers P3–P4 (Table 1) and ligated into the modified pET21a vector, which was digested with the restriction enzymes BamHI and XhoI. The modified vector contained the His tag and tobacco etch virus (TEV) protease cleavage site upstream of restriction site BamHI [30]. The constructed recombinant plasmid containing the pET21a and Mpro genes was designated pET-H6-Mpro. The gene sequences of the constructed plasmids were verified by sequencing (Genewiz).
Table 1.
Primers used in this study.
| Primer | Sequence |
|---|---|
| P1 | TTCCAGGGGCCCCTGGGATCCGCGGTACTGCAGAGCGGCTTTCG |
| P2 | GTCACGATGCGGCCGCTCGAGTTAGTGGTGGTGGTGGTGGTGGGGTCCCTGAAAGGTCA |
| P3 | GAAAACTTGTATTTCCAGGGCAGCGGCTTTCGCAAAATGG |
| P4 | GTGGTGGTGGTGGTGCTCGAGTTACTGAAAGGTCACGCCGC |
2.2. Expression and purification of four types of SARS-CoV-2 Mpros
The plasmid pGEX-Mpro-GPH6 was expressed to obtain Mpro with eight additional residues (GPHHHHHH) at the C-terminus, and the generated Mpro was designated WT-GPH6. Purified WT-GPH6 was cleaved with PreScission Protease (PSP) to produce authentic Mpro, which was named WT. Similarly, plasmid pET-H6-Mpro was expressed to generate Mpro with twenty-one additional residues (MGSSHHHHHHSSGRENLYFQG) at the N-terminus, and the resulting Mpro was designated H6-WT. Subsequently, extra residues of H6-WT were removed by using TEV protease to obtain Mpro designated G-WT, which included an extra glycine residue at the N-terminus. The expression and purification of four types of SARS-CoV-2 Mpros (WT-GPH6, WT, H6-WT, G-WT) are described in detail below.
For the expression and purification of WT-GPH6, the plasmid pGEX-Mpro-GPH6 was transformed into E. coli BL21 (DE3). Positive clones harboring the recombinant plasmid were cultured in LB medium containing 100 μg/mL ampicillin at 37 °C. When the OD600 reached 0.6–0.8, 0.5 mM isopropyl-D-thiogalactoside (IPTG) was added, and Mpro was overexpressed at 16 °C for 14–16 h. The cells were harvested by centrifugation at 6000 rpm for 5 min, and then pellets were resuspended in buffer A (20 mM Tris-HCl, 150 mM NaCl, pH 7.5) and lysed by sonication on ice. The lysate was centrifuged at 10000 rpm for 60 min, and the precipitate was discarded. The supernatant was loaded onto a HisTrap column, which was subsequently successively washed with buffer A containing imidazole at different concentrations (20 mM, 50 mM, 100 mM, 300 mM, 500 mM, and 1000 mM). The fractions containing proteins of interest were concentrated and then dialyzed in buffer B (50 mM Tris-HCl, 1 mM EDTA, pH 7.3). The purified WT-GPH6 was analyzed by SDS‒PAGE.
For the purification of WT, PSP was added to purified WT-GPH6 to cleave additional residues at a mass ratio of 1:30, resulting in the target protein WT with an authentic C-terminus. The digestion mixture was loaded onto a GSTtrap column, and the fraction that did not bind to the column was collected to remove PSP. The protein obtained in the previous step was subjected to a HisTrap column, and then the target protein WT was collected by eluting with buffer A to remove WT-GPH6, which had not been digested with PSP. The purified WT was analyzed by SDS‒PAGE.
For the expression and purification of H6-WT, the plasmid pET-H6-Mpro was transformed into E. coli BL21 (DE3). The following procedure was similar to that used for the expression and purification of WT-GPH6. The purified H6-WT was analyzed by SDS‒PAGE.
For the purification of G-WT, TEV protease was added to purified H6-WT remove the His tag at a mass ratio of 1:30, resulting in the target protein G-WT. A mixture of digestion was applied to a HisTrap column, and then the target protein G-WT was fragmented by washing with buffer A to exclude TEV protease and H6-WT, which had not been digested with TEV protease. The purified G-WT was analyzed by SDS‒PAGE.
2.3. Activity detection of four types of SARS-CoV-2 Mpros
Enzyme activity assays of four types of Mpros (WT-GPH6, WT, H6-WT, and G-WT) were performed based on fluorescence resonance energy transfer, and the substrate was MCA-AVLQSGFR-Lys(Dnp)-Lys-NH2 (P9731-5 mg, Beyotime Biotechnology, China) according to previous research [20]. The fluorescence was measured at excitation and emission wavelengths of 320 nm and 395 nm, respectively, and the reaction was carried out in buffer B with a total volume of 500 μL. Initially, 440 μL buffer B and 50 μL Mpro (final concentration of 0.2 μM for WT-GPH6 and WT, 5 μM for H6-WT, and 2.5 μM for G-WT) were added to the reaction system. As soon as 10 μL of substrate (final concentrations of 2.5 μM, 3.3 μM, 5 μM, 10 μM, 20 μM, and 40 μM) was added to the reaction system, the relative fluorescence unit (RFU) value was immediately monitored for 1200 s (F2700, Hitachi, Japan). Similarly, the RFU values of a series of different concentrations of MCA (0.625 μM, 1.25 μM, 2.5 μM, 5 μM, 10 μM, and 20 μM, M185662-1 g, Aladdin, China) were measured in the absence of Mpro to calculate the amount of the cleaved substrate. The initial rate depended on the change in RFUs in the first 90 s. The kinetic constants Km and kcat were obtained from a double reciprocal plot, which was drawn by GraphPad Prism 8.0 software. Experiments were performed in triplicate, and the values were presented as the mean ± standard deviation (SD).
3. Results and discussion
3.1. Construction of the recombinant plasmids
To shed light on the effect of additional residues on the activity of SARS-CoV-2 Mpro, two recombinant plasmids were constructed to produce four types of Mpros with different termini, which included WT-GPH6, WT, H6-WT, and G-WT (Fig. 1 ). Two types of Mpros can be obtained when one recombinant plasmid is expressed based on the hydrolysis strategy of protease by introducing a cleavage site. According to a previous study of SARS-CoV Mpro [29], the plasmid pGEX-Mpro-GPH6 was constructed to produce WT-GPH6 (with eight additional residues at the C-terminus) and WT (with authentic termini). When constructing pGEX-Mpro-GPH6, 12 nucleotides encoding AVLQ were added upstream of the Mpro gene to generate the cleavage site of Mpro, which led to the self-cleavage of the recombinant protein GST-WT-GPH6 to produce WT-GPH6. Meanwhile, 24 nucleotides encoding GPHHHHHH were added downstream of the Mpro gene to create the cleavage site of PSP, which digested WT-GPH6 to WT. To obtain H6-WT and G-WT, we constructed the recombinant plasmid pET-H6-Mpro, which contained a His tag sequence and a TEV protease cleavage site upstream of the Mpro gene.
Fig. 1.
Schematic diagram of four types of SARS-CoV-2 Mpros. The additional residues are shown in blue. The colored arrows represent the protease cleavage site. Red, Mpro; green, PSP; yellow, TEV protease.
3.2. Preparations of WT-GPH6 and WT
When the plasmid pGEX-Mpro-GPH6 was expressed in E. coli, the GST tag-fused protein GST-WT-GPH6 was initially generated. Since four residues (AVLQ) were introduced into the N-terminus of the first residue serine of Mpro, the recognition site of Mpro was formed. In this case, GST-WT-GPH6 was cleaved by itself to produce WT-GPH6. This phenomenon was probably similar to the idea that Mpro released itself from polyproteins in viruses [14]. After purification of the expression product by Ni-affinity chromatography, purified protein was analyzed by SDS‒PAGE, which showed that a single band (Fig. 2 , Lane 1) appeared near 33 kDa. This result was in accord with the predicted molecular mass of 34.77366 kDa, which revealed that SARS-CoV-2 Mpro in the WT-GPH6 form was obtained.
Fig. 2.
Expression and purification of Mpro. Lane M, protein molecular mass marker; Lane 1, WT-GPH6; Lane 2, WT; Lane 3, H6-WT; Lane 4, G- WT.
It was necessary to obtain authentic Mpro to clarify the influence of additional residues. To remove the redundant residues, the cleavage site of PSP was generated by adding two residues, glycine and proline, at the C-terminus of Mpro. In this case, the six amino acids (SGVTFQ) at the C-terminus of Mpro and the two added amino acids (GP) corresponded to the recognition sites P6–P1 and P1′-P2′ of PSP, respectively. When the mixture of WT-GPH6 processed by PSP was purified by affinity chromatography, a slightly lower band appeared in Lane 2 than in Lane 1 (Fig. 2). The slight difference between Lane 1 and Lane 2 can be explained by the difference in molecular mass between WT-GPH6 (34.77366 kDa) and WT (33.79664 kDa), which indicated that the excess eight amino acids at the C-terminus were successfully removed and Mpro in the WT form was obtained.
Although two types of Mpros (WT-GPH6 and WT) were obtained in our study, a special circumstance attracted our attention. In recent research, a series of structures of complexes of Mpro binding with natural substrates were resolved. In the structure of the Mpro H41A mutant containing the nsp5|6 substrate (PDB ID: 7VAH), the side chain of alanine at P2' was exposed to solvent, which indicated that the P2' site had less specificity for substrate recognition [14]. To isolate natural Mpro, a His tag and two excess residues of GP were added. After affinity chromatography, WT-GPH6 was successfully obtained, which indicated that the His tag was rarely removed. Interestingly, the His tag was cleaved by PSP, which revealed that the C-terminal residues (SGVTFQGP) of WT-GPH6 could be recognized by PSP but not by Mpro. Comparing the sequence (SGVTFQGP) with that of the natural nsp5|6 substrate (SGVTFQSAV), it was found that Ser at the P1' site and Ala at the P2' site were replaced by Gly and Pro, respectively. The S1' subsite was a shallow pocket and could accommodate small residues. Therefore, the reason that the sequence (SGVTFQGP) was not identified by Mpro was because Pro was at the P2' site, not Gly at the P1' site, which was consistent with profiling of the substrate specificity of SARS-CoV Mpro [31]. This might be due to the change in peptide chain structure caused by the special structure of proline, which resulted in the inactivation of the substrate.
3.3. Preparations of H6-WT and G-WT
In a previous study, when there were two or five residues in the N-terminus of SARS-CoV Mpro, the activity was only 4% or 1% of that of authentic Mpro, respectively [29]. Considering that the sequences of Mpro from SARS-CoV-2 and SARS-CoV are highly homologous and that their structures are highly similar [20], we speculated that the excess residues at the N-terminus would also significantly affect the activity of Mpro derived from SARS-CoV-2. Therefore, Mpro in the H6-WT form with 21 additional residues (MGSSHHHHHHSSGRENLYFQG) at the N-terminus was generated when the plasmid pET-H6-Mpro was expressed in E. coli. Similarly, the expression product was purified by Ni-affinity chromatography, and purified protein was detected by SDS‒PAGE. The result showed that there was a single band (Fig. 2, Lane 3) at the upper position of the band at Lane 2, which was in accord with the theoretical molecular mass of H6-Mpro (36.22120 kDa).
H6-Mpro is a fusion protein composed of Mpro, the N-terminal His tag and the TEV protease cleavage site. TEV protease can recognize the sequence ENLYFQG and cleave between glutamine and glycine. Therefore, when H6-Mpro was digested by TEV protease, one glycine was left on Mpro. As expected, after purification by affinity chromatography, it was observed that the band at Lane 4 was slightly lower than that at Lane 3 (Fig. 2). The small gap of the bands may exactly correspond to the molecular mass (2.36751 kDa) of the cleaved residues, which demonstrated that the His tag at the N-terminus was successfully released and Mpro in the G-WT form was obtained.
3.4. Activity determination of four types of Mpros
To ascertain whether the additional residues would affect the catalytic activity of SARS-CoV-2 Mpro, the activities of four types of Mpros (WT-GPH6, WT, H6-WT, G-WT) were determined by measuring the kinetic parameters. When four types of Mpros reacted with substrates of different concentrations, the changes in RFUs over 1200 s were continuously monitored (Fig. 3 ). The data showed that the RFU value increased rapidly and that the reaction conformed to the first-order reaction in the first 90 s. This phenomenon indicated that Mpro could recognize and cleave the fluorescent substrates, which suggested that the four types of Mpros were indeed active.
Fig. 3.
The activities of four types of Mpros. A, WT-GPH6; B, WT; C, H6-WT; D, G-WT.
To calculate the kinetic parameters of Mpros, double reciprocal plots were performed according to the initial rate and substrate concentration of the reaction (Table 2 , Fig. S1, S2). The catalytic efficiency of an enzyme is best defined by kcat/Km [32]. Obviously, WT with a kcat/Km of 24485 M−1s−1 had the highest catalytic efficiency, while H6-WT with a kcat/Km of 139 M−1s−1 had the lowest catalytic efficiency. The activity of WT was more than 170 times greater than that of H6-WT and 60 times greater than that of G-WT. The results showed that additional residues at the N-terminus significantly led to a decrease in Mpro activity. Moreover, the greater the number of additional residues at the N-terminus, the greater the decrease in Mpro activity. Fortunately, the activity of WT-GPH6 (kcat/Km = 15736 M−1s−1) was approximately two-thirds of that of WT, which suggested that additional residues at the C-terminus had little effect on Mpro activity.
Table 2.
Comparison of enzyme activities of four types of SARS-CoV-2 Mpros.
| SARS-CoV-2 Mpro | Km (μM) | kcat (s−1) | kcat/Km (M−1s−1) |
|---|---|---|---|
| H6-WT | 41.81 ± 9.79 | 0.0058 ± 0.0013 | 139 |
| G-WT | 28.01 ± 4.77 | 0.0110 ± 0.0016 | 393 |
| WT-GPH6 | 26.48 ± 5.67 | 0.4167 ± 0.0807 | 15736 |
| WT | 13.31 ± 1.33 | 0.3259 ± 0.0277 | 24485 |
In a previous study, SARS-CoV Mpro in the WT form had the highest cleavage efficiency compared to that in other forms, and additional residues at the N-terminus, but not at the C-terminus, were detrimental to enzyme activity [29]. This was consistent with our results, likely because the Mpros from the two viruses were not only highly homologous in sequence but also highly consistent in structure. This confirmed that Mpro could be used as a screening target for broad-spectrum inhibitors.
To date, a series of structures of SARS-CoV-2 Mpro in different forms have been resolved. To determine how the additional residues affected the activity of Mpro, we compared the structures of Mpro in the WT form (PDB ID: 6M03) and G-WT form (PDB ID: 7BRO) (Fig. 4 ). In the WT structure, the first residue serine interacted with Glu166, stabilizing the conformation of the substrate binding pocket. In contrast, the N-terminus of G-WT slightly deviated from the orientation to the active site due to excess glycine [33]. Similarly, the N-terminus could not participate in the formation of a substrate binding pocket for SARS-CoV Mpro in the GPGLS-WT form with five additional residues at the N-terminus, which likely resulted in a reduction in enzyme activity [29]. The structural difference would account for the higher activity of the WT protease compared with other proteases. Furthermore, our results also showed that the activity of H6-WT was lower than that of G-WT. This might be due to the additional residues at the N-terminus, which were adjacent to the substrate binding pocket and would interfere with substrate recognition. In summary, our study once again emphasized the importance of authentic N-terminus for the activity of Mpro.
Fig. 4.
Superposition of WT and G-WT. A, The overall superposition of WT and G-WT; B, the zoom-in view of the termini. Protomer A and B in the WT form are colored green and yellow, respectively. Protomer A and B in the G-WT are colored cyan and magenta, respectively. The key amino acids are presented by stick and labeled in black. “N” and “C” represent the N-terminus and C-terminus, respectively.
In contrast to the N-terminus, the C-terminus of Mpro in the WT form was oriented away from the substrate binding pocket. Therefore, the C-terminal extra residues had less effect on enzyme activity, which indicated that WT-GPH6 can be used as a substitute for authentic Mpro to screen inhibitors targeting the active site.
4. Conclusions
In this study, we described an ingenious strategy for producing four types of SARS-CoV-2 Mpros by expressing the two types of recombinant plasmids in E. coli. Activity assays showed that the WT had the highest cleavage efficiency, and the effect of the additional residues at the N-terminus on the enzyme activity was much greater than that at the C-terminus. This study confirms the importance of the natural N-terminus of Mpro and provides a reference for the expression of Mpro of the new coronavirus in the future.
Acknowledgements
This work was supported by the Fundamental Research Program of Shanxi Province, China (grant 20210302124076, 202103021224067, 202103021224070) and the Shanxi Scholarship Council of China, China (grant 2021–036).
Footnotes
Supplementary data related to this article can be found at https://doi.org/10.1016/j.pep.2022.106208.
Appendix A. Supplementary data
The following is the supplementary data related to this article:
Data availability
Data will be made available on request.
References
- 1.Wu F., Zhao S., Yu B., Chen Y.M., Wang W., Song Z.G., Hu Y., Tao Z.W., Tian J.H., Pei Y.Y., Yuan M.L., Zhang Y.L., Dai F.H., Liu Y., Wang Q.M., Zheng J.J., Xu L., Holmes E.C., Zhang Y.Z. A new coronavirus associated with human respiratory disease in China. Nature. 2020;579:265–269. doi: 10.1038/s41586-020-2008-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Zhu N., Zhang D., Wang W., Li X., Yang B., Song J., Zhao X., Huang B., Shi W., Lu R., Niu P., Zhan F., Ma X., Wang D., Xu W., Wu G., Gao G.F., Tan W., China Novel Coronavirus I., Research T. A novel coronavirus from patients with pneumonia in China,2019. N. Engl. J. Med. 2020;382:727–733. doi: 10.1056/NEJMoa2001017. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Jiang F., Deng L., Zhang L., Cai Y., Cheung C.W., Xia Z. Review of the clinical characteristics of coronavirus disease 2019 (COVID-19) J. Gen. Intern. Med. 2020;35:1545–1549. doi: 10.1007/s11606-020-05762-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Guan W.J., Ni Z.Y., Hu Y., Liang W.H., Ou C.Q., He J.X., Liu L., Shan H., Lei C.L., Hui D.S.C., Du B., Li L.J., Zeng G., Yuen K.Y., Chen R.C., Tang C.L., Wang T., Chen P.Y., Xiang J., Li S.Y., Wang J.L., Liang Z.J., Peng Y.X., Wei L., Liu Y., Hu Y.H., Peng P., Wang J.M., Liu J.Y., Chen Z., Li G., Zheng Z.J., Qiu S.Q., Luo J., Ye C.J., Zhu S.Y., Zhong N.S., China Medical Treatment C. Expert group for, clinical characteristics of coronavirus disease 2019 in China. N. Engl. J. Med. 2020;382:1708–1720. doi: 10.1056/NEJMoa2002032. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Du Y., Tu L., Zhu P., Mu M., Wang R., Yang P., Wang X., Hu C., Ping R., Hu P., Li T., Cao F., Chang C., Hu Q., Jin Y., Xu G. Clinical features of 85 fatal cases of COVID-19 from wuhan. A retrospective observational study. Am. J. Respir. Crit. Care Med. 2020;201:1372–1379. doi: 10.1164/rccm.202003-0543OC. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Drosten C., Günther S., Preiser W., Van Der Werf S., Brodt H.-R., Becker S., Rabenau H., Panning M., Kolesnikova L., Fouchier R.A. Identification of a novel coronavirus in patients with severe acute respiratory syndrome. N. Engl. J. Med. 2003;348:1967–1976. doi: 10.1056/nejmoa030747. [DOI] [PubMed] [Google Scholar]
- 7.Zaki A.M., van Boheemen S., Bestebroer T.M., Osterhaus A.D., Fouchier R.A. Isolation of a novel coronavirus from a man with pneumonia in Saudi Arabia. N. Engl. J. Med. 2012;367:1814–1820. doi: 10.1056/NEJMoa1211721. [DOI] [PubMed] [Google Scholar]
- 8.Cui J., Li F., Shi Z.L. Origin and evolution of pathogenic coronaviruses. Nat. Rev. Microbiol. 2019;17:181–192. doi: 10.1038/s41579-018-0118-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Zhou P., Yang X.L., Wang X.G., Hu B., Zhang L., Zhang W., Si H.R., Zhu Y., Li B., Huang C.L., Chen H.D., Chen J., Luo Y., Guo H., Jiang R.D., Liu M.Q., Chen Y., Shen X.R., Wang X., Zheng X.S., Zhao K., Chen Q.J., Deng F., Liu L.L., Yan B., Zhan F.X., Wang Y.Y., Xiao G.F., Shi Z.L. A pneumonia outbreak associated with a new coronavirus of probable bat origin. Nature. 2020;579:270–273. doi: 10.1038/s41586-020-2012-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Lu R., Zhao X., Li J., Niu P., Yang B., Wu H., Wang W., Song H., Huang B., Zhu N., Bi Y., Ma X., Zhan F., Wang L., Hu T., Zhou H., Hu Z., Zhou W., Zhao L., Chen J., Meng Y., Wang J., Lin Y., Yuan J., Xie Z., Ma J., Liu W.J., Wang D., Xu W., Holmes E.C., Gao G.F., Wu G., Chen W., Shi W., Tan W. Genomic characterisation and epidemiology of 2019 novel coronavirus: implications for virus origins and receptor binding. Lancet. 2020;395:565–574. doi: 10.1016/s0140-6736(20)30251-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Karpinski T.M., Ozarowski M., Seremak-Mrozikiewicz A., Wolski H., Wlodkowic D. The 2020 race towards SARS-CoV-2 specific vaccines. Theranostics. 2021;11:1690–1702. doi: 10.7150/thno.53691. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Yao H., Song Y., Chen Y., Wu N., Xu J., Sun C., Zhang J., Weng T., Zhang Z., Wu Z., Cheng L., Shi D., Lu X., Lei J., Crispin M., Shi Y., Li L., Li S. Molecular architecture of the SARS-CoV-2 virus. Cell. 2020;183:730–738. doi: 10.1016/j.cell.2020.09.018. e713. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Bai C., Zhong Q., Gao G.F. Overview of SARS-CoV-2 genome-encoded proteins. Sci. China Life Sci. 2022;65:280–294. doi: 10.1007/s11427-021-1964-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Zhao Y., Zhu Y., Liu X., Jin Z., Duan Y., Zhang Q., Wu C., Feng L., Du X., Zhao J., Shao M., Zhang B., Yang X., Wu L., Ji X., Guddat L.W., Yang K., Rao Z., Yang H. Structural basis for replicase polyprotein cleavage and substrate specificity of main protease from SARS-CoV-2. Proc. Natl. Acad. Sci. U. S. A. 2022;119 doi: 10.1073/pnas.2117142119. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Hegyi A., Ziebuhr J. Conservation of substrate specificities among coronavirus main proteases. J. Gen. Virol. 2002;83:595–599. doi: 10.1099/0022-1317-83-3-595. [DOI] [PubMed] [Google Scholar]
- 16.Yang H., Xie W., Xue X., Yang K., Ma J., Liang W., Zhao Q., Zhou Z., Pei D., Ziebuhr J., Hilgenfeld R., Yuen K.Y., Wong L., Gao G., Chen S., Chen Z., Ma D., Bartlam M., Rao Z. Design of wide-spectrum inhibitors targeting coronavirus main proteases. PLoS Biol. 2005;3:e324. doi: 10.1371/journal.pbio.0030324. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Anand K., Palm G.J., Mesters J.R., Siddell S. Structure of coronavirus main proteinase reveals combination of a chymotrypsin fold with an extra alpha-helical domain. EMBO J. 2002;21:3213–3224. doi: 10.1093/emboj/cdf327. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Morse J.S., Lalonde T., Xu S., Liu W. Learning from the past: possible urgent prevention and treatment options for severe acute respiratory infections caused by 2019-nCoV. Chembiochem. 2020;21:730–738. doi: 10.26434/chemrxiv.11728983. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Pillaiyar T., Manickam M., Namasivayam V., Hayashi Y., Jung S.H. An overview of severe acute respiratory syndrome-coronavirus (SARS-CoV) 3CL protease inhibitors: peptidomimetics and small molecule chemotherapy. J. Med. Chem. 2016;59:6595–6628. doi: 10.1021/acs.jmedchem.5b01461. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Jin Z., Du X., Xu Y., Deng Y., Liu M., Zhao Y., Zhang B., Li X., Zhang L., Peng C., Duan Y., Yu J., Wang L., Yang K., Liu F., Jiang R., Yang X., You T., Liu X., Yang X., Bai F., Liu H., Liu X., Guddat L.W., Xu W., Xiao G., Qin C., Shi Z., Jiang H., Rao Z., Yang H. Structure of M(pro) from SARS-CoV-2 and discovery of its inhibitors. Nature. 2020;582:289–293. doi: 10.1038/s41586-020-2223-y. [DOI] [PubMed] [Google Scholar]
- 21.Arya R., Kumari S., Pandey B., Mistry H., Bihani S.C., Das A., Prashar V., Gupta G.D., Panicker L., Kumar M. Structural insights into SARS-CoV-2 proteins. J. Mol. Biol. 2021;433 doi: 10.1016/j.jmb.2020.11.024. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Cui W., Yang K., Yang H. Recent progress in the drug development targeting SARS-CoV-2 main protease as treatment for COVID-19. Front. Mol. Biosci. 2020;7 doi: 10.3389/fmolb.2020.616341. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Zhang L., Lin D., Sun X., Curth U., Drosten C., Sauerhering L., Becker S., Rox K., Hilgenfeld R. Crystal structure of SARS-CoV-2 main protease provides a basis for design of improved α-ketoamide inhibitors. Science. 2020;368:409–412. doi: 10.1126/science.abb3405. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Lu L., Su S., Yang H., Jiang S. Antivirals with common targets against highly pathogenic viruses. Cell. 2021;184:1604–1620. doi: 10.1016/j.cell.2021.02.013. [DOI] [PubMed] [Google Scholar]
- 25.Xiong M., Su H., Zhao W., Xie H., Shao Q., Xu Y. What coronavirus 3C-like protease tells us: from structure, substrate selectivity, to inhibitor design. Med. Res. Rev. 2021;41:1965–1998. doi: 10.1002/med.21783. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Nguyen T.T.H., Jung J.H., Kim M.K., Lim S., Choi J.M., Chung B., Kim D.W., Kim D. The inhibitory effects of plant derivate polyphenols on the main protease of SARS coronavirus 2 and their structure-activity relationship. Molecules. 2021:26. doi: 10.3390/molecules26071924. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Drazic T., Kuhl N., Leuthold M.M., Behnam M.A.M., Klein C.D. Efficiency improvements and discovery of new substrates for a SARS-CoV-2 main protease FRET assay. SLAS Discov. 2021;26:1189–1199. doi: 10.1177/24725552211020681. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Ferreira J.C., Fadl S., Rabeh W.M. Key dimer interface residues impact the catalytic activity of 3CLpro, the main protease of SARS-CoV-2. J. Biol. Chem. 2022;298 doi: 10.1016/j.jbc.2022.102023. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Xue X., Yang H., Shen W., Zhao Q., Li J., Yang K., Chen C., Jin Y., Bartlam M., Rao Z. Production of authentic SARS-CoV M(pro) with enhanced activity: application as a novel tag-cleavage endopeptidase for protein overproduction. J. Mol. Biol. 2007;366:965–975. doi: 10.1016/j.jmb.2006.11.073. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Peng R., Li Z., Xu Y., He S., Peng Q., Wu L.A., Wu Y., Qi J., Wang P., Shi Y., Gao G.F. Structural insight into multistage inhibition of CRISPR-Cas12a by AcrVA4. Proc. Natl. Acad. Sci. U. S. A. 2019;116:18928–18936. doi: 10.1073/pnas.1909400116. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Chuck C.P., Chong L.T., Chen C., Chow H.F., Wan D.C., Wong K.B. Profiling of substrate specificity of SARS-CoV 3CL. PLoS One. 2010;5 doi: 10.1371/journal.pone.0013197. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Copeland R.A., Enzymes . second ed. Wiley VCH; New York: 2000. A Practical Introduction to Structure, Mechanism, and Data Analysis. [Google Scholar]
- 33.Fu L., Ye F., Feng Y., Yu F., Wang Q., Wu Y., Zhao C., Sun H., Huang B., Niu P., Song H., Shi Y., Li X., Tan W., Qi J., Gao G.F. Both Boceprevir and GC376 efficaciously inhibit SARS-CoV-2 by targeting its main protease. Nat. Commun. 2020;11:4417. doi: 10.1038/s41467-020-18233-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Data will be made available on request.




