Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2020 Jan 1.
Published in final edited form as: Bioorg Med Chem Lett. 2018 Nov 9;29(1):36–39. doi: 10.1016/j.bmcl.2018.11.019

Discovery of Covalent Enzyme Inhibitors Using Virtual Docking of Covalent Fragments

Sandipan Roy Chowdhury a, Steven Kennedy a, Kai Zhu b, Rama Mishra c, Patrick Chuong a, Alyssa-uyen Nguyen a, Stefan G Kathman d, Alexander V Statsyuk a
PMCID: PMC6319256  NIHMSID: NIHMS1512958  PMID: 30455147

Abstract

Here we present a virtual docking screen of 1648 commercially available covalent fragments, and identified covalent inhibitors of cysteine protease cathepsin L. These inhibitors did not inhibit closely related protease cathepsin B. Thus, we have established virtual docking of covalent fragments as an approach to discover covalent enzyme inhibitors.

Graphical Abstract

graphic file with name nihms-1512958-f0001.jpg


Covalent fragments is a technology useful to discover covalent probes for proteins in vitro and in cellulo.1,2 There are some advantages of covalent fragments when compared to reversible fragments. First, it is easier to obtain a crystal structure of the protein with covalently bound fragment for structural optimization.3 Second, intracellular targets of covalent fragments can be easily identified using modern click chemistry methods.2 In recent years computational algorithms have been used as screening methods in drug discovery, but they face unique challenges when applied to fragments.4 Reversible fragments are often promiscuous due to their small size and weak binding affinity to their protein target (Ki μM-mM range) and display dynamic binding modes to the protein targets. This makes it challenging to screen fragments using computational approaches and to accurately predict fragment binding modes. Thus, ligand docking algorithms that have been developed to describe the binding of drug-like compounds to protein targets, may not perform well for fragments, and have to be tested separately.4 Covalent fragments on the other hand have to adopt a specific geometric orientation at the ligand binding site for the covalent reaction, which leads to reduced promiscuity and more defined binding modes. Additional n→π* interactions between the covalent fragment and the nucleophile at the protein surface may further enhance binding interactions between the covalent fragment and the protein.5 However it is currently not clear if covalent docking algorithms designed for covalent drug-like inhibitors can be used to predict the binding mode of covalent fragments and their growth vectors. Attempts to address this challenge have been done for covalent reversible cyanoacrylamide fragments, but not for irreversible covalent fragments that require stringent design rules to avoid non-specific covalent labeling.6 This paper addresses this challenge by showing that covalent docking algorithms can be used to identify covalent fragments that inhibit drug target of interest. In doing so we designed a small 1648-member library of vinylsulfones, conducted virtual docking, and discovered covalent fragments that inhibit cysteine protease cathepsin L but not related protease cathepsin B, calpain I, or deubiquitinating enzyme USP8.

First, we asked if the virtual docking of covalent fragments can correlate with the experiment. In our earlier studies we tested a 100-member library of methyl acrylates with balanced reactivity against cysteine protease papain and identified three covalent fragments that inhibited papain.1 Here we used Shrodinger CovDock program to dock a library of the same 100 fragments to the papain active site cysteine (PDB:1KHQ). In addition, we also screened three previously reported peptide papain inhibitors (Figure 1A-B).1 The docking procedure used CovDoc (Schrodinger 2018–2 version) with the distance cutoffs (8Å Cβ to the ligand, 5Å sulfur to the ligand bonding carbon) to decide if the covalent reaction can happen or a candidate pose should be kept for further consideration. From six known papain inhibitors 1–6, four papain inhibitors 1,2,4, and 6 scored favorably (Figure 1A, top 20 hits). Overall, we observed a correlation between ranking and reported kinact/Ki values: two most potent inhibitors 1 and 4 (highest kinact/Ki values) received highest rankings in CovDock. Docking poses of compounds 1, 2, 4 and 6 showed that they form a hydrogen bond with the backbone amide carbonyl of Asp158. Similar hydrogen bond between Asp158 and the glycine of ZLFG-DAM (DAM – diazomethyl) is observed in the X-Ray crystal structure of the covalent complex of papain and its covalent inhibitor.7 Compounds 1, 4, and 6 also formed hydrogen bonds with Gly66, however these hydrogen bonds seem to be not essential in the context of CovDock. This is because compound 2 did not form hydrogen bonds with Gly66 (because it is an enantiomer of 1), yet received high score and inhibited papain under experimental conditions. Nevertheless, a hydrogen bond between Gly66 and bound covalent papain inhibitors has been observed in X-ray crystal structures, and perhaps could be exploited in virtual docking. Two other compounds 3 and 5 did not receive favorable docking scores and were ranked 96th and 48th respectively (Figure S1).

Figure 1.

Figure 1.

Covalent docking of known inhibitors of papain using Shrodinger CovDock at the active site cysteine (Cys25) of papain (1KHQ:1.6 Å). A) Known inhibitors of papain with reported kinact/Ki values and their rankings in CovDoc. B) Docking poses of compounds 1, 2, 4, and 6 at the papain active site.

Next, we performed covalent docking of 1648 vinylsulfone-based covalent fragments against cysteine protease cathepsin L. Cathepsin L is well validated drug target to treat cancer, osteoporosis and autoimmune disorders, and selective covalent inhibitors of cathepsin L are known.8 Initially we faced standard challenges of a) selecting electrophile which is not hyper reactive, and b) figuring out how to link fragments and electrophile in the library. To solve these challenges, we used the design rules (or guidelines) for covalent fragment libraries that we outlined earlier.9

We turned our attention to vinylsulfone class of covalent cysteine protease inhibitors (Figure 2A). Vinyl sulfones are frequently used to inhibit cysteine proteases, they are ten-fold more reactive toward thiols than methyl acrylates, and have different geometry when compared to acrylates (tetragonal sulfur atom vs sp2-hybridized carbon).1 Thus, by conducting virtual docking on vinylsulfones we could explore different covalent fragment chemotypes, and reactivity. Specifically, we analyzed previously reported vinyl sulfones 7 and 8, which inhibit cathepsin B and L.10 These compounds have a general formula R1-C(=O)-NH-CH2-CH=CH-SO2-Ph, and would be considered prototypes for covalent fragments of this type. Compound 7 is more selective at inhibiting cathepsin L as judged by Ki values. Structural modification of R1 in 7, led to compound 8, which showed reversed selectivity (Ki values) and increased potency (kinact/Ki values) for cathepsin B. Importantly, for both compounds screened against cathepsin B and L kinact stayed within 0.17–0.36 s−1 range, suggesting that structural modifications of R1 group do not significantly increase vinylsulfone electrophile reactivity. This indicates that structural modifications of R1 group can change inhibitor selectivity and potency. Therefore, it is rational to assume that a library of covalent fragments of general formula R1-C(=O)-NH-CH2-CH=CH-SO2-Ph, where R1- is a variable fragment, will display similar variability in selectivity and potency at least when screened against cathepsins or related proteases. Given that the active site cysteine in cathepsin L is highly reactive (pKa ~3.5) most likely covalent fragment libraries such as R1-C(=O)-NH-CH2-CH=CH-SO2-Ph will display similar variations in potency and selectivity when screened against less reactive cysteines with pKa >3.5. Our own experience with acrylate based covalent fragment libraries confirm these predictions.3

One of the key features of fragments is their low molecular weight. Covalent fragments R1-C(=O)-NH-CH2-CH=CH-SO2-Ph contain phenyl group attached to sulfone, and this group adds extra molecular weight. We decided to reduce the molecular weight by replacing the phenyl group with the methyl group, which led to the general formula R1-C(=O)-NH-CH2-CH=CH-SO2-CH3. Inhibitors of this type display different selectivity/potency toward cathepsins, and therefore it is safe to design the library of covalent fragments of this type.10 We also expected that the intrinsic reactivity of covalent fragments R1-C(=O)-NH-CH2-CH=CH-SO2-CH3 will not be significantly affected by R1 group based on our previous kinetic studies (Figure 2B).1 All these considerations suggest that a library of covalent fragments R1-C(=O)-NH-CH2-CH=CH-SO2-CH3 will not have hyper reactive fragments, and R1- group will be determining the potency and selectivity of these fragments against drug targets. Because every member of this library can be rapidly synthesized from the carboxylic acid R1-COOH and corresponding amine NH2-CH2-CH=CH-SO2-CH3, we constructed a small 1648-member virtual library of covalent fragments for which carboxylic acids were commercially available.

Figure 2.

Figure 2.

Selection of electrophile for covalent fragment library. A) kinact and Ki values of previously reported inhibitors of cathepsin B and L, suggesting that vinyl sulfones are safe electrophiles. B) Pseudo-first-order reaction rates of vinylsulfones 9a-c with N-acetylcysteine methylester at pD 8.0 as measured by NMR spectroscopy.

To dock 1648 vinylsulfone ligands, we used human cathepsin L apo form crystal structure (4AXL) with 1.92Å resolution.11 Cys25 is the active site residue of cathepsin L. We first carried the non-covalent docking flowed by minimizing the covalent complex in Optimized Potential for Liquid Simulation (OPLS) force field. The complex was minimized and the average RMSD between the minimized and non-minimized ligand was found to be in between 1.5 to 2Å. This step was required to rank ligands and prioritize the compounds for synthesis. Ligand docked poses and their energetics were analyzed and out of 1648 ligands, 33 were showed good docking poses, lower complex energies, and which had the highest docking scores ≥ 6.0. Out of 33 ligands, we selected five compounds for synthesis based on pricing and structural diversity. Upon further experimental testing compound 11 (racemic mixture) showed time and dose dependent inhibition of cathepsin L (kinact = 0.0006 s−1, Ki = 146 ± 17 μM) (Figure S2). Other compounds had solubility issues. Thus, we proceeded to investigate compound 11 and its analogues.

The docking pose of compound 11 resembled those of many other covalent inhibitors of cysteine proteases (Figure 3A-B). Methylvinyl sulfone moiety forms covalent bond with the active site cysteine of cathepsin L at the S1 pocket, and sulfone moiety interacts with Q19, H163, and W189 of cathepsin L. The amide bond functionality in 11 formed two hydrogen bonds with D162 and G68 respectively (shown with dashed lines). Glycolic acid moiety and dibromophenyl group extended in the direction of S2 pocket of cathepsin L. There 2,4-dibromophenyl group formed hydrophobic contacts with L69 and M70. The key feature of fragments is the ability to have growth vectors. We overlaid crystal structures of known cathepsin L inhibitor 12 (IC50 22 nM) and compound 11 bound to cathepsin L.12 Compound 12 is a reversible covalent nitrile-inhibitor with two chlorophenyl moieties projecting toward S2 and S3 pockets respectively, where they form a combination of hydrophobic and halogen bond contacts. This structural analysis suggests that compound 11 may have potentially two growth vectors: one extending from dibromophenyl moiety toward S2 pocket, and one from the methyl group extending toward S3 pocket of cathepsin. Both of these vectors are relatively far from the vinylsulfone electrophile and chemical modifications at those sites should not inhibit the covalent labeling of the catalytic cysteine.

Figure 3.

Figure 3.

Compound 11 is a covalent inhibitor of cathepsin L. A) Docked pose of compound 11. B) Structural overlay of compound 11 (yellow) and nitrile-based inhibitor 12 (green) bound to cathepsin L active site cysteine (Cys25, PDB-2xu1). C) Analysis of compound 11 and nitrile 12, showing potential growth vectors for covalent fragment 11.

Compound 11 has a methyl group at the chiral center that we suggested can be used as a potential growth vector. To test if this methyl group provides critical contribution to binding and covalent labeling of cathepsin L, we prepared compound 13, which lacks this methyl group. This step also allowed us to eliminate the chiral center, since the corresponding acid to make 11 is available only in its racemic form. Compound 13 inhibited cathepsin L with kinact/Ki values of 5 M−1s−1 (kinact = 0.0003 s−1, Ki =59 ± 10 μM, Figure 4). Thus, the methyl group is not critical for fragment binding, and chemical modification at that position may be used to grow the fragment. Importantly, compound 13 shifts the Km of the cathepsin L substrate (Figure 4A) indicating that compound 13 is a substrate competitive inhibitor under these reaction conditions.13 Compound 13 did not inhibit other classes of cysteine proteases such as Cathepsin B, Calpain I, and USP8 with and without preincubation of the inhibitor (Figure S3). Finally, gel filtration experiments have shown that compound 13 inhibits cathepsin L irreversibly (Figure 4D). Taken together, our experiments show that virtual docking of covalent fragment libraries can be used to discover covalent fragments that can act as selective enzyme inhibitors. Judicious choice of electrophiles and their proper connection to fragments is needed to avoid hyper reactive hits.

Figure 4.

Figure 4.

Determination of Ki, kinact and reversibility studies. A) Km determination for Cathepsin L with its substrate in the presence of varying concentrations of compound 13 shows dose dependent inhibition of the enzyme and is used to determine a Ki value of 59 ± 10 μM. (B) Pseudo-first-order inhibition plots of compound 13 at varying concentrations for 15, 30, or 60 minute preincubation time. (C) The Structure of Compound 13 (D) Irreversibility studies. Compound 13 (50 μM) was preincubated with Cathepsin L (10 nM), followed by zeba spin column. Flow through aliquots were tested for enzyme activity recovery at different time points. ● DMSO controls, □ compound 13.

Our initial proof of concept studies validates virtual docking methods for covalent fragment screening, outlines the path forward to construct larger libraries of covalent fragments and to discover covalent fragment leads for a wider range of nucleophilic drug targets. The discovered covalent inhibitor 13 of Cathepsin L have weak binding to Cathepsin L (Ki =59 ± 10 μM) and slow kinact (0.0003 s−1), kinact/Ki 5 M−1s-1. How far are these values from those of pharmacologically useful probes? It is already encouraging that compound 13 displays selectivity and does not inhibit related cysteine proteases Cathepsin B, Calpain I, and USP8. Recently covalent K-RasG12C inhibitor ARS-853 has been discovered that has weak binding affinity to K-RasG12C (Ki 200 ± 90 μM) but fast kinact 0.05 s−1 (kinact/Ki 250 ± 20 M−1s−1) due to the electrophile activation by K-Ras.14 When profiled against 2,740 cellular cysteines this inhibitor displayed remarkable selectivity and covalently modified Cys12 of K-Ras and only two other proteins: FAM213A and Reticulon-4.15 Further improvement of ARS-853 has led to ARS-1620 (kinact/Ki 1100 ± 300 M−1s−1) active in vivo.16 Thus, it is rational to propose that improving kinact/Ki values of compound 13 to ~250 M−1s−1 may yield useful covalent probes with cellular activity. This can be achieved by improving the Ki of the fragment and/or by improving the kinact. Further use of virtual docking of covalent fragments to discover covalent enzyme inhibitors will be reported in the future.

Supplementary Material

1

Highlights.

  • Virtual docking of 1648 covalent fragments is performed.

  • Covalent inhibitor of Cathepsin L is identified.

  • Cathepsin L inhibitor did not inhibit USP8, Calpain I, Cathepsin B.

ACKNOWLEDGMENT

This work was supported in part by National Institutes of Health Awards R01GM115632 (to A.V.S.) and T32GM105538 (to S.G.K.), William & Ella Owens Research Foundation, and the University of Houston. A.V.S. is a Pew Scholar in the Biomedical Sciences, supported by the Pew Charitable Trusts. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health. The authors declare that they have no conflicts of interest with the contents of this article.

Footnotes

Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

REFERENCES

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

1

RESOURCES