Abstract
The 68‐kDa homodimeric 3C‐like protease of SARS‐CoV‐2, Mpro (3CLpro/Nsp5), is a key antiviral drug target. NMR spectroscopy of this large system proved challenging and resonance assignments have remained incomplete. Here we present the near‐complete (>97 %) backbone assignments of a C145A variant of Mpro (Mpro C145A) both with, and without, the N‐terminal auto‐cleavage substrate sequence, in its native homodimeric state. We also present SILLY (Selective Inversion of thioL and Ligand for NOESY), a simple yet effective pseudo‐3D NMR experiment that utilizes NOEs to identify interactions between Cys‐thiol or aliphatic protons, and their spatially proximate backbone amides in a perdeuterated protein background. High protection against hydrogen exchange is observed for 10 of the 11 thiol groups in Mpro C145A, even those that are partially accessible to solvent. A combination of SILLY methods and high‐resolution triple‐resonance NMR experiments reveals site‐specific interactions between Mpro, its substrate peptides, and other ligands, which present opportunities for competitive binding studies in future drug design efforts.
Keywords: AlphaFold-Multimers, ligand binding, NMR spectroscopy, Mpro/3CLpro, SARS-CoV-2
Sulfhydryl signals were shown to be readily observable in nuclear Overhauser enhancement spectra of perdeuterated proteins, even for partially solvent‐accessible thiol groups. For dimeric SARS‐CoV‐2 main protease, these signals proved invaluable both for achieving nearly complete NMR backbone assignment in both apo‐ and substrate‐bound states and for validating structural features.
Introduction
Thiol (‐SH) groups of Cys residues in proteins can play important structural and functional roles but are often omitted from NMR studies because the proton lacks large J couplings to stable spin‐ isotopes that are required to record triple‐resonance NMR spectra. We demonstrate that the uniquely upfield thiol chemical shifts, combined with considerable protection from solvent exchange, make these signals readily observable in a perdeuterated protein, where they provide valuable information for validating both assignments and structural models. The approach is illustrated for the main protease of the SARS‐CoV‐2 virus, termed Mpro, 3CLpro or Nsp5, a chymotrypsin‐like cysteine protease that plays an essential role in the viral replication process by excising 11 non‐structural proteins (Nsp's), including itself, from the virus’ two polyprotein chains. The H41‐C145 catalytic dyad cleaves at the recognition sequence LQ↓S, and following the success of protease inhibitors to fight other viral diseases such as AIDS and hepatitis‐C, the first FDA‐approved specific Mpro inhibitor is also showing high clinical effectiveness. [1] Mpro is only catalytically active as a homodimer, [2] with one solvent exposed active site (an oxyanion hole) per monomer. Inactivation of the monomeric enzyme is mediated by the conversion of a loop (S138‐C145) to a short 310‐helix, which collapses the oxyanion hole, [3] a feature that was shown to be reversible on re‐binding of substrate peptide with implications for Mpro liberation from the polypeptide chain. [4] The recognition sequence of Mpro is nominally five residues, including four residues prior to the cleavage site (occupying subsites S4…S1) and the first residue past the cleavage site (P1’ in subsite S1’). [5] The sequence of the SARS‐CoV‐2 variant of Mpro is 96 % identical to that of SARS‐CoV, with a high fitness cost to viral particles that develop Mpro inhibitor‐resistance. [6] The recent myriad of structure determinations by X‐ray crystallography [7] and corroboration by NMR [8] indicate that none of the 12 conservative sequence differences significantly impact the backbone structure of Mpro.
In a search for leads in Mpro‐targeted drug development, a recent NMR study of Mpro identified 38 hits. [8a] Resonance assignment of large homodimeric proteins such as Mpro proved challenging and prior backbone amide assignments were limited to ∼63 %. [8a] To prevent auto‐proteolysis of the enzyme during prolonged exposure to the high concentrations (0.2–1.1 mM) and temperature (25–35 °C) needed for NMR measurements, we focus on two Mpro active site mutants, H41Q (Mpro H41Q) and C145 A (Mpro C145A). Furthermore, NMR study of this 68‐kDa homodimeric protein requires deuteration of the non‐exchangeable hydrogens in combination with TROSY‐based observation of the back‐exchanged amide protons. [9] Here, we take advantage of the 11 Mpro C145A Cys sulfhydryl protons (1Hγ) that also exchange with solvent, as well as Mpro‐substrate interactions, to validate and extend the Mpro assignments to essential completeness (>97 %) and demonstrate that the interaction with its N‐terminal autocleavage substrate extends well beyond the canonical five substrate‐binding subsites S4–S1’.
Results and Discussion
The Mpro NMR assignment process proved considerably more challenging than expected (Figure 1; Supporting Information Experimental Section). Not only was a precarious partial‐refolding step necessary to fully re‐protonate the backbone amide groups after expression in D2O (Figure S1, see Supporting Information text), but resonance broadening from conformational and solvent exchange (particularly for residues near the active site; Figure S2,S3), as well as conformational heterogeneity involving the presence of both cis and trans isomers of P184 (Figure S4) posed major challenges to software [10] that usually makes the resonance assignment step straightforward. Nevertheless, nearly complete backbone assignments of Mpro C145A, Mpro H41Q, and the complex with its N‐terminal autocleavage substrate analogue, Mpro C145A:SAVLQSGFRK, were achieved (BMRB: 51455, 51456, Tables S1–S4, Supporting Information Figures S5–S7).
For dimethylsulfoxide (DMSO) and weakly binding ligands such as boceprevir [11] (Figures S8 and S9), [12] the site of interaction on the protein is commonly identified by observing chemical shift perturbations (CSPs) of protein resonances, [13] which can be correlated with ligand observations. [14] The absence of CSPs is commonly used to demonstrate the lack of interaction ‐ as applies for ivermectin, which was proposed to act as a Mpro inhibitor based on computational modeling [15] (Figure S10). For ligands where the exchange rate between free and bound species is comparable to (or slower than) the difference in protein chemical shift between the apo and ligated states, CSPs can be difficult to ascertain, particularly for larger proteins with crowded spectra. In this case, Nuclear Overhauser effects (NOEs) between ligand and protein can be used instead. For perdeuterated proteins, the sensitivity of this NOE‐based approach can be strongly enhanced by using a band‐selective pulse to simultaneously invert all upfield resonances (+3.5 to −0.7 ppm), representing most of the ligand's aliphatic hydrogens, and monitoring NOE transfer of their magnetization to protein backbone amide protons. [16] Recovery of longitudinal 1H magnetization in large, perdeuterated proteins such as Mpro is dominated by efficient 1H−1H cross relaxation with hydroxyl and amide protons that rapidly exchange with the water signal, which is kept along the z‐axis by the TROSY pulse scheme, [9b] permitting short delays between scans (Table S6). The band‐selective pulse also inverts protein sulfhydryl resonances (random coil chemical shift ca 1.9 ppm), prior to readout by the highly sensitive 2D 1H15N‐TROSY‐HSQC pulse sequence, and therefore include NOE interactions to Cys‐1Hγ resonances. [9b] We refer to this method as “Selective Inversion of thioL and Ligand for NOESY” (SILLY), and the difference of spectra collected with‐ and without‐ the band‐selective inversion pulse yields the final SILLY‐TROSY spectrum. When applying the SlLLY method to Mpro C145A in the presence of a 10‐fold excess of the substrate analogue tetrapeptide N‐acetyl‐Val‐DTyr‐Leu‐Gln (VyLQ), the SILLY‐TROSY spectrum shows many amides whose CSPs are challenging to resolve without NOE‐editing (Figure 2). Because VyLQ binds the protein with moderate affinity (Kd ≈80 μM; Figure S11, Table S5) and the exchange between free and bound ligand is on the intermediate NMR time scale, an excess of ligand and a relatively long mixing time of 500 ms were used to maximize transfer of ligand magnetization to the protein, analogous to transferred NOE experiments. [17] In contrast, magnetization from thiol protons is not replenished by exchange with free ligand and decays rapidly with increasing mixing times due to cross relaxation. NOEs from thiol to amide protons are therefore more intense at shorter mixing times such as 70 ms, where interactions are observed for the majority of Cys residues and their immediate neighbors in Mpro C145A (Figure S12). Such signals potentially provide valuable “anchors” during the early stage of sequential assignment and validation markers at a later stage, in particular for regions where sequential connectivities are sparse. Recording of the spectrum in the absence of ligand is essential to distinguish SILLY‐TROSY resonances originating from thiol and unusually upfield shifted hydroxyl protons from ligand protons. In cases where perdeuteration is less complete than achieved in our work (ca 98 %), weak signals resulting from NOEs to residual aliphatic sidechains may also appear in the SILLY‐TROSY spectrum.
A more detailed view of the NOE interactions with the thiol signals can be obtained from three‐dimensional (3D) 15N‐separated NOE spectra, recorded for an Mpro C145A sample that contained the peptide N‐acetyl‐SAVLQSGFRK (SAVLQSGFRK). This auto‐substrate peptide contains the Mpro N‐terminal cleavage site required to excise itself from the SARS‐CoV‐2 precursor pp1a, where S6peptide corresponds to the N‐terminal residue of mature Mpro. Affinity of Mpro for this peptide is high (Kd≤4±2 μM) and NOE interactions extend from the peptide's N‐terminus to R9peptide (Figure S13) validating both a PDB X‐ray structure, [4] and AlphaFold‐Multimer (AF−M) [18] models of this complex (Figures S14, S15). Strips taken through such spectra at the 15N/1H frequencies of selected amide groups then reveal the thiol‐1Hγ chemical shifts (Figure 3). Such resonances are valuable markers for validating resonance assignments, in particular for proteins such as Mpro for which good structural models are available. [19] For example, C265−1Hγ shows NOEs to residues A260‐K269, spanning nearly its entire α‐helix (Figure 3A). Whereas most thiol NOEs are observed to amides proximate in sequence to each Cys, long‐range NOEs are also observed. For example, NOEs from C300‐1Hγ in the protein's C‐terminal α‐helix show proximity to N‐terminal residues G2 and F3 that are essential to Mpro dimerization and catalytic activity (Figure 3B). Other examples include L87, which pairs with C38 in an antiparallel β‐sheet and shows a strong NOE to the C38 thiol, and F112 which shows a strong NOE to the thiol of its β‐sheet partner C128 (Figure 3C, 3D). Many more such interactions can be seen in the full set of strips (Figures S16, S17) and are invaluable to validate the backbone resonance assignments, typically based on triple‐resonance 3D experiments such as HNCA (Figure S18).
The observation of sharp resonances for 10 of the 11 Cys sulfhydryl resonances in Mpro C145A is perhaps surprising considering that thiol protons exchange with solvent at rates that are catalyzed by both OH− and phosphate and the intrinsic hydrogen exchange (HX) rate under the conditions of our sample is larger than 104 s−1. [21] Many of the thiols are buried in the protein interior and appear engaged in structure‐stabilizing hydrogen bonds. With an S−H bond length of 1.34 Å and an Cβ−Sγ−Hγ bond angle of ∼95°–100°, [22] thiols can act as donor for one H‐bond and acceptor for two. The sulfhydryl can also engage in interactions between an aromatic π donor‐orbital and the S−H σ* acceptor‐orbital, [22] as appears to apply for C128‐Hγ interacting with F112‐Cζ (Figure 3D). Because the proton in such interactions is situated above the ring carbon, the ring current contribution to its chemical shift (ca −1.3 ppm) is smaller than if positioned directly above the center of the aromatic ring, and very sensitive to its precise location. For several of the other Mpro thiols, the pattern of thiol‐stabilizing interactions is less clear cut. For example, with a gauche + χ1 rotamer, the C160 thiol appears to donate a third H‐bond to G149‐O, which already accepts two backbone H‐bonds from the C160 and Y161 amide groups. However, a strong NOE interaction from C160‐Hγ to Y161‐HN in the 70‐ms mixing time SILLY‐TROSY spectrum (Figure S12) is consistent with this geometry.
The observation of strong cross peaks to thiol resonances in the 3D NOESY spectrum (Figure 3A, S17), which employed a 200 ms NOE mixing time, indicates that solvent exchange of these labile protons cannot be much faster than 5 s−1. The only thiol resonance missing from the 3D spectrum (Figure S17) is that of partially solvent‐exposed C22, whose intensity may also be adversely impacted by its trans χ1 rotamer, resulting in a larger intraresidue HN‐Hγ distance; rapid exchange with solvent of G23‐HN, which is close to C22‐Sγ in the X‐ray structure, [19] prevents observation of this NOE. Other surface‐exposed sulfhydryls include C85, C156, and C160 which show strong NOE interactions to one or more backbone amides, indicating that intramolecular H‐bonds strongly protect these thiol protons from exchange with solvent.
The Hγ chemical shifts, seen in the 3D NOESY spectrum, allow multiple amides to be linked to individual thiol resonances, providing information not available from the SILLY‐TROSY 2D spectrum. However, the sensitivity of the 2D spectrum is considerably higher, as reflected in a more than 20‐fold shorter measurement time.
Conclusion
The importance of the local structure around Cys thiol groups, which due to their reactive chemistry can play important functional roles, is well recognized. [23] The positions of hydrogen atoms are typically inaccessible in X‐ray diffraction data and instead their presence is commonly modeled based on nearby H‐bond acceptor atoms, which can often be ambiguous. The clear appearance of sulfhydryl signals in NOE spectra of perdeuterated proteins provides a straightforward avenue for studying these reactive groups in more detail, while simultaneously aiding protein resonance assignment. For Mpro, many of the thiols are close to aromatic moieties, leading to upfield ring current shifts (e. g. C128‐F112; C160‐F150; C265‐F230) or, when located near the plane of such rings, to decreased chemical shielding (C85‐F181; C300‐F3), so that chemical shifts provide another sensitive tool for probing local structure, in addition to isotope effects. [23] The nearly complete backbone assignment of Mpro together with the introduction of two active site binding peptides with affinities of ca 4 and 80 μM can facilitate competitive binding NMR assays in further drug development.
Experimental Section
The materials and methods used in our study can be found in the associated Supporting Information.
Conflict of interest
The authors declare no conflict of interest.
1.
Supporting information
Acknowledgements
The authors thank Drs. Joseph M. Courtney, Yang Shen, James Baber and Dennis A. Torchia for helpful discussions. This work was supported by the Intramural Research Program of the NIDDK of the USA National Institutes of Health.
A. J. Robertson, J. Ying, A. Bax, ChemBioChem 2022, e202200471.
Data Availability Statement
The backbone chemical shifts for Mpro C145A (BMRB: 51455) and for Mpro C145A:SAVLQSGFRK (BMRB: 51456) complexes have been deposited in the BioMagResBank (http://www.bmrb.wisc.edu/), and additional assignments are presented in the Supporting Information.
References
- 1. Günther S., et al., Science 2021, 372, 642–646. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.
- 2a. Yang H., Yang M., Ding Y., Liu Y., Lou Z., Zhou Z., Sun L., Mo L., Ye S., Pang H., Gao G. F., Anand K., Bartlam M., Hilgenfeld R., Rao Z., Proc. Natl. Acad. Sci. USA 2003, 100, 13190–13195; [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2b. Chou C.-Y., Chang H.-C., Hsu W.-C., Lin T.-Z., Lin C.-H., Chang G.-G., Biochemistry 2004, 43, 14958–14970. [DOI] [PubMed] [Google Scholar]
- 3.
- 3a. Shi J., Sivaraman J., Song J., J. Virol. 2008, 82, 4620–4629; [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3b. Hu T., Zhang Y., Li L., Wang K., Chen S., Chen J., Ding J., Jiang H., Shen X., Virology 2009, 388, 324–334. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4. Chen S., Jonas F., Shen C., Hilgenfeld R., Protein Cell 2010, 1, 59–74. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.
- 5a. Anand K., Ziebuhr J., Wadhwani P., Mesters J. R., Hilgenfeld R., Science 2003, 300, 1763–1767; [DOI] [PubMed] [Google Scholar]
- 5b. Muramatsu T., Takemoto C., Kim Y. T., Wang H., Nishii W., Terada T., Shirouzu M., Yokoyama S., Proc. Natl. Acad. Sci. USA 2016, 113, 12997–13002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Deng X., StJohn S. E., Osswald H. L., O′Brien A., Banach B. S., Sleeman K., Ghosh A. K., Mesecar A. D., Baker S. C., J. Virol. 2014, 88, 11886–11898. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7. Jin Z., Du X., Xu Y., Deng Y., Liu M., Zhao Y., Zhang B., Li X., Zhang L., Peng C., Duan Y., Yu J., Wang L., Yang K., Liu F., Jiang R., Yang X., You T., Liu X., Yang X., Bai F., Liu H., Liu X., Guddat L. W., Xu W., Xiao G., Qin C., Shi Z., Jiang H., Rao Z., Yang H., Nature 2020, 582, 289–293. [DOI] [PubMed] [Google Scholar]
- 8.
- 8a. Cantrelle F. X., Boll E., Brier L., Moschidi D., Belouzard S., Landry V., Leroux F., Dewitte F., Landrieu I., Dubuisson J., Deprez B., Charton J., Hanoulle X., Angew. Chem. Int. Ed. 2021, 60, 25428–25435; [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8b. Robertson A. J., Courtney J. M., Shen Y., Ying J., Bax A., J. Am. Chem. Soc. 2021, 143, 19306–19310. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.
- 9a. Pervushin K., Riek R., Wider G., Wüthrich K., Proc. Natl. Acad. Sci. USA 1997, 94, 12366–12371; [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9b. Pervushin K. V., Wider G., Wuthrich K., J. Biomol. NMR 1998, 12, 345–348. [DOI] [PubMed] [Google Scholar]
- 10. Schmidt E., Guntert P., J. Am. Chem. Soc. 2012, 134, 12817–12829. [DOI] [PubMed] [Google Scholar]
- 11. Fu L., Ye F., Feng Y., Yu F., Wang Q., Wu Y., Zhao C., Sun H., Huang B., Niu P., Song H., Shi Y., Li X., Tan W., Qi J., Gao G. F., Nat. Commun. 2020, 11, 4417. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12. Keiffer S., Carneiro M. G., Hollander J., Kobayashi M., Pogoryelev D., Ab E., Theisgen S., Muller G., Siegal G., J. Biomol. NMR 2020, 74, 521–529. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13. Schoenle M. V., Li Y., Yuan M., Clarkson M. W., Wilson I. A., Peti W., Page R., J. Am. Chem. Soc. 2021, 143, 7930–7934. [DOI] [PubMed] [Google Scholar]
- 14. Sreeramulu S., Richter C., Berg H., Wirtz Martin M. A., Ceylan B., Matzel T., Adam J., Altincekic N., Azzaoui K., Bains J. K., Blommers M. J. J., Ferner J., Furtig B., Gobel M., Grun J. T., Hengesbach M., Hohmann K. F., Hymon D., Knezic B., Martins J. N., Mertinkus K. R., Niesteruk A., Peter S. A., Pyper D. J., Qureshi N. S., Scheffer U., Schlundt A., Schnieders R., Stirnal E., Sudakov A., Troster A., Vogele J., Wacker A., Weigand J. E., Wirmer-Bartoschek J., Wohnert J., Schwalbe H., Angew. Chem. Int. Ed. 2021, 60, 19191–19200. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15. Eweas A. F., Alhossary A. A., Abdel-Moneim A. S., Front. Microbiol. 2020, 11, 592908. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16. Weinhäupl K., Lindau C., Hessel A., Wang Y., Schütze C., Jores T., Melchionda L., Schönfisch B., Kalbacher H., Bersch B., Rapaport D., Brennich M., Lindorff-Larsen K., Wiedemann N., Schanda P., Cell 2018, 175, 1365–1379.e1325. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17. Anglister J., Srivastava G., Naider F., Prog. Nucl. Magn. Reson. Spectrosc. 2016, 97, 40–56. [DOI] [PubMed] [Google Scholar]
- 18. Evans R., O'Neill M., Pritzel A., Antropova N., Senior A., Green T., Žídek A., Bates R., Blackwell S., Yim J., Ronneberger O., Bodenstein S., Zielinski M., Bridgland A., Potapenko A., Cowie A., Tunyasuvunakool K., Jain R., Clancy E., Kohli P., Jumper J., Hassabis D., bioRxiv 2021, 10.1101/2021.10.04.463034. [DOI] [Google Scholar]
- 19. Douangamath A., Fearon D., Gehrtz P., Krojer T., Lukacik P., Owen C. D., Resnick E., Strain-Damerell C., Aimon A., Abranyi-Balogh P., Brandao-Neto J., Carbery A., Davison G., Dias A., Downes T. D., Dunnett L., Fairhead M., Firth J. D., Jones S. P., Keeley A., Keseru G. M., Klein H. F., Martin M. P., Noble M. E. M., O′Brien P., Powell A., Reddi R. N., Skyner R., Snee M., Waring M. J., Wild C., London N., von Delft F., Walsh M. A., Nat. Commun. 2020, 11, 5047. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20. Geen H., Freeman R., J. Magn. Reson. 1991, 93, 93–141. [Google Scholar]
- 21. Chen J., Yadav N. N., Stait-Gardner T., Gupta A., Price W. S., Zheng G., NMR Biomed. 2020, 33, e4188. [DOI] [PubMed] [Google Scholar]
- 22. Forbes C. R., Sinha S. K., Ganguly H. K., Bai S., Yap G. P. A., Patel S., Zondlo N. J., J. Am. Chem. Soc. 2017, 139, 1842–1855. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23. Takeda M., Jee J., Terauchi T., Kainosho M., J. Am. Chem. Soc. 2010, 132, 6254–6260. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
The backbone chemical shifts for Mpro C145A (BMRB: 51455) and for Mpro C145A:SAVLQSGFRK (BMRB: 51456) complexes have been deposited in the BioMagResBank (http://www.bmrb.wisc.edu/), and additional assignments are presented in the Supporting Information.