Abstract
Acylhydrazone‐based dynamic combinatorial chemistry (DCC) is a powerful strategy for the rapid identification of novel hits. Even though acylhydrazones are important structural motifs in medicinal chemistry, their further progression in development may be hampered by major instability and potential toxicity under physiological conditions. It is therefore of paramount importance to identify stable replacements for acylhydrazone linkers. Herein, we present the first report on the design and synthesis of stable bioisosteres of acylhydrazone‐based inhibitors of the aspartic protease endothiapepsin as a follow‐up to a DCC study. The most successful bioisostere is equipotent, bears an amide linker, and we confirmed its binding mode by X‐ray crystallography. Having some validated bioisosteres of acylhydrazones readily available might accelerate hit‐to‐lead optimization in future acylhydrazone‐based DCC projects.
Keywords: acylhydrazones, aspartic proteases, bioisosteres, drug design, dynamic combinatorial chemistry
Dynamic combinatorial chemistry (DCC) enables rapid screening of functionally diverse compounds against a target, circumventing the need for individual synthesis, purification and characterization.1, 2, 3, 4, 5, 6, 7 Among many other prominent examples of DCC, reversible disulfide‐bond formation was first introduced in DCC by the groups of Still,8 Sanders,5 and Lehn9 in the late 1990s. Later on, in 1997, the group of Lehn first applied DCC to a protein target using imine formation/exchange.1 Since then, its scope and wider applicability were demonstrated on a range of biological targets. Replacement of the reversible disulfide bond with thioether (‐S‐CH2‐)10 or all‐carbon (olefin, ‐CH2‐CH2‐)11, 12, 13, 14, 15 and of the imine moiety with amines,1 an ethyl linker16 or with an amide linker17 provides stable bioisosteres with potentially preserved binding mode, making DCC an enabling tool for medicinal chemistry and drug discovery (Figure 1 a).
We chose the target protein endothiapepsin, belonging to the family of pepsin‐like aspartic proteases, which play a causative role in numerous diseases such as malaria, Alzheimer's disease, hypertension, and HIV‐1.18 Endothiapepsin is used as a representative enzyme due to its robustness, immense stability and similarity to the drug targets of the class of aspartic proteases. Moreover, it has been used as a model enzyme for mechanistic studies,19, 20, 21 as it is readily available in large quantity and crystallizes easily and importantly remains active at room temperature for more than 20 days.
We previously discovered acylhydrazone‐based inhibitors of endothiapepsin using DCC in combination with de novo structure‐based drug design, which display a promising inhibitory profile (IC50=12.8±0.4 μm).22 Acylhydrazones are considered to be important structural motifs in medicinal chemistry, as they hold the potential to interact with a range of biological targets in antiviral, anticancer and antibacterial drug discovery.
Nevertheless, there are problems associated with acylhydrazones. This class of compounds is considered by some as a member of the pan‐assay interference compounds (PAINS).23 They undergo photoinduced E/Z isomerization.24 In addition, it is important to consider also the behavior of acylhydrazones in vivo. The major setback of acylhydrazones is their lack of stability due to hydrolysis into an aldehyde and a hydrazide under acidic pH. In spite of that, hydrazone and acylhydrazone linkages are used to develop pH‐degradable drug‐delivery systems for site‐specific targeting.25 Furthermore, some acylhydrazones, like PAC‐1, are in clinical trials as a treatment for cancer.26, 27 Nevertheless, it is highly desirable to replace the labile acylhydrazone linker with stable and chemically benign analogues while maintaining the key interactions in the active site of the protein without significant changes in chemical structure.
Surprisingly, to the best of our knowledge, there are only few examples of bioisosteres of acylhydrazones,16 but no report as a direct follow‐up of a DCC experiment. In most cases, the binding mode of the bioisostere is not confirmed experimentally. Having suitable bioisosteres in hand, will establish ‘acylhydrazone‐based DCC′ as a powerful hit/lead‐identification strategy with the potential for further optimization.
Bioisosteres have been introduced as a fundamental strategy to improve the biocompatibility of the parent hit or lead compounds. As such, bioisosteres contribute to the field of medicinal chemistry, in terms of improving potency, enhancing selectivity, altering physicochemical properties, reducing or redirecting metabolism, eliminating or modifying toxicophores and acquiring novel intellectual property.28 Herein, we describe the design, synthesis, and biochemical activity of three bioisosteres of the acylhydrazone (S)‐1, the first acylhydrazone inhibitor of endothiapepsin. Importantly, unlike the parent acylhydrazone, bioisosteres (S)‐2 and (S)‐4 are not prone to hydrolysis, and all three do not liberate potentially toxic hydrazides.
We chose the X‐ray crystal structure of endothiapepsin in complex with acylhydrazone (S)‐1 (PDB ID: https://www.rcsb.org/structure/4KUP)22 as a starting point for the design of stable bioisosteres of the labile acylhydrazone moiety. Hit (S)‐1 displays an IC50 value of 12.8 μm and a ligand efficiency (LE) of 0.27. It interacts with the catalytic dyad using H‐bonding interactions (Asp35 (2.8 Å, 3.2 Å) and Asp219 (2.9 Å)) through its α‐amino group.
We designed bioisosteres using two different design approaches, namely Recore in the LeadIT suite29 and the molecular modeling software Moloc30 for molecular modeling and computation of the dipole moments. In Recore, a defined moiety of a molecule (the core) is replaced by fragments from a 3D database whilst keeping the rest of the molecule intact. To restrict the number of solutions, defined ligand‐based pharmacophore constraints can be assigned. This modeling and docking resulted in various compounds displaying heterocyclic, ester or amide linkages. Among the various heterocycles (e.g., triazole, tetrazole, oxazole (Supporting Information Figure S8)), we chose the best three compounds (Figure 1 b) based on their dipole moments, their calculated ΔG, and predicted binding modes, which are similar to those of the parent acylhydrazone (S)‐1 and synthesized them as a proof‐of‐concept study. The predicted binding modes of three representative bioisosteres in the active pocket of endothiapepsin are shown in Figure S4 (Supporting Information).
Inspection of the soaked crystal structure of endothiapepsin with acylhydrazone (S)‐1 in the active site shows that the aromatic parts of the compound such as indolyl and/or mesityl moieties are able to form π–π‐stacking interactions with the amino acid residues of the protein backbone. In all of the structures (Figure 1 b), the binding modes of the indolyl and mesityl moieties are preserved. It was computationally observed that the α‐amino groups of all bioisosteres (S)‐2‐(S)‐4 form charge‐assisted H bonds to the catalytic dyad (Asp35 and Asp 219) as well as additional H‐bonding interactions with Asp81, and Gly221. The indolyl NH forms H bonds either with Asp81 or Asp33, the NH group of the amide donates an H bond to Gly221 in (S)‐2. In addition to these, the thiazolyl ring of (S)‐4 is involved in several hydrophobic interactions with the protein backbone. The main building blocks required for the synthesis of bioisosteres (S)‐2‐(S)‐4, are N‐α‐Boc‐l‐tryptophan (5) and the 2‐mesitylene‐derived compounds (S)‐5, 8 and 11 (see Schemes S1–S4 in the Supporting Information and Scheme 1).
Very mild peptide‐coupling conditions afforded the bioisostere (S)‐2 with the amide linker, followed by deprotection of the Boc group. Starting from N‐Boc‐l‐tryptophan (5) and 2‐mesitylethanamine hydrochloride (10) in presence of the weak base carbonyldiimidazole, furnished the corresponding amide (S)‐14 in 80 % yield, and after deprotection with TFA, the test compound (S)‐2 in quantitative yield. The ester (S)‐3 was accessible through the Steglich esterification.31 We synthesized the bioisostere (S)‐4 from the building blocks thioamide (S)‐7 and ketobromide 9, which can be both accessed in two steps from N‐α‐Boc‐l‐tryptophan (5) and mesitylacetic acid (8),32, 33 respectively.
Subsequent deprotection of the Boc group of compound (S)‐12 afforded bioisostere (S)‐4 in quantitative yield. The first step to obtain thioamide (S)‐7 consists of the synthesis of amide (S)‐6 followed by thionation using Lawesson's reagent. On the other hand, using modified Arndt–Eistert reaction conditions, starting from mesitylacetic acid (8), afforded intermediate 9. To investigate the biochemical activity of the designed bioisosteres (S)‐2 to (S)‐4, we performed a fluorescence‐based assay adapted from the HIV‐protease assay (see Figures S1–S3 for the IC50 curves, Supporting Information).34
The three designed bioisosteres inhibit the activity of endothiapepsin to a different extent. The most potent inhibitor, the amide bioisostere (S)‐2, displays a K i value of 6.1 μm, very similar to the parent acylhydrazone (S)‐1 (K i=6.0 μm, Table 1). We calculated the K i values from experimental IC50 values using the Cheng–Prusoff equation.35 To verify the predicted binding mode of the bioisosteres, we soaked crystals of endothiapepsin with the most potent bioisostere (S)‐2 and determined the crystal structure of (S)‐2 in complex with endothiapepsin at 1.58 Å resolution (PDB ID: https://www.rcsb.org/structure/5OJE). The structure features clear electron density for the ligand, as shown in Figure 2 b.
Table 1.
Inhibitor | IC50 [μm][a] | K i [μm][b] | ΔG EXPT [kJ mol−1][b] | ΔG HYDE [kJ mol−1][c] |
---|---|---|---|---|
(S)‐1 | 12.8±0.4 | 6.0±0.2 | −30 | −32 |
(S)‐2 | 12.9±0.7 | 6.1±0.4 | −30 | −27 |
(S)‐3 | 28.7±4.1 | 13.5±1.9 | −28 | −28 |
(S)‐4 | 193.7±11.4 | 91.2±5.4 | −23 | −31 |
[a] Eleven different concentrations of inhibitor were used; errors are given in standard deviations (SD). [b] Values indicate the inhibition constant (K i) and the Gibbs free energy of binding (ΔG) derived from IC50 values using the Cheng–Prusoff equation.35 [c] Values indicate the calculated Gibbs free energy of binding (ΔG HYDE; calculated by the HYDE scoring function in the LeadIT suite).
Upon closer examination, the location of the ligand is similar to the docked pose shown in Figure S4 (See Supporting Information). The amino group of the ligand forms two H bonds with Asp35 (2.9 Å) and Asp219 (3.0 Å). The indolyl nitrogen atom forms an H bond with Asp81 (3.2 Å). The hydrophobic part of the indolyl moiety is engaged in hydrophobic interactions with Phe116, Leu125, Tyr79 and Gly221. The mesityl substituent is involved in hydrophobic interactions with Ile300, Ile304, Tyr226, Gly80 and Asp81. The oxygen atom of the amide linkage forms water‐mediated H bonds to the carbonyl oxygen of Gly37 and the amide nitrogen of Gly80. The mediating water molecules are conserved between the crystal structures in complex with (S)‐1 and (S)‐2 (PDB IDs: https://www.rcsb.org/structure/4KUP and https://www.rcsb.org/structure/5OJE, respectively, Supporting Information Figure S7).
The only difference compared to the docked pose is at the amide linkage. In contradiction to the computational modeling, the nitrogen atom of the amide does not form an H bond with the oxygen atom of Gly221, the distance is 4.2 Å. Instead, the hydroxy group of Thr222 acts as an H‐bond acceptor and forms an H bond (2.9 Å) with the amide nitrogen atom of the ligand, which is also shown in Figure 3.
Due to the slightly bent shape of the coordinated ligand, both aromatic groups are able to form hydrophobic interactions with one DMSO molecule, shown in Figure 2. This DMSO molecule is well‐coordinated and seems to displace several water molecules. This may be important for the stabilization of the ligand bound to the protein. A similar DMSO molecule can be observed in previous crystal structures (e.g., PDB ID: https://www.rcsb.org/structure/4KUP).22
The single bond connecting the mesityl unit to the rest of the acylhydrazone (S)‐1 is part of a conjugated system and prefers a planar orientation. It is twisted out of planarity to an unfavorable angle of 34.4° compared to the more favored angle of 107.0° as in bioisostere (S)‐2 (Supporting Information Figure S6).
The bioisostere (S)‐2, however, contains a peptidic bond in the linker, which also prefers planarity. This forces the C−N bond, its third bond, counting from the mesityl substituent, into an unfavorable torsional angle of 122° compared to the preferred 170° of the acylhydrazone (Figure S6). In conclusion, both ligands have to adopt a slightly unfavorable conformation to bind in the pocket of the enzyme, which is reflected in their binding affinities. Based on our observations, it might be difficult to design a linker with improved binding affinity, which would need to be more flexible with respect to the torsional angles, while the H‐bond donor and –acceptor functions of the peptidic nitrogen and oxygen atoms should ideally be preserved.
We report the successful replacement of the acid‐sensitive and hydrolyzable acylhydrazone linker of parent hit (S)‐1, affording stable and equipotent inhibitors of endothiapepsin. We designed and synthesized three bioisosteres and evaluated them for their inhibitory potency against endothiapepsin. Compounds (S)‐2 and (S)‐3, possessing amide and ester linkers, respectively display similar K i values as the parent hit (S)‐1, while compound (S)‐4 is an order of magnitude weaker than the parent hit. The crystal structure of amide (S)‐2 (K i=6.1 μm) in complex with endothiapepsin validates the predicted binding mode. In this proof‐of‐concept study, we identified molecular interactions that should be taken into consideration if further modifications are done to achieve a more druglike replacement for the acylhydrazone linker. Taken together, we demonstrate that acylhydrazones can be replaced without affecting the binding mode and whilst preserving the activity, demonstrating that acylhydrazone‐based DCC is a powerful tool to identify hits, which can then be optimized to stable lead compounds in a straightforward manner.
Experimental Section
Full experimental details are provided in the Supporting Information.
Conflict of interest
The authors declare no conflict of interest.
Supporting information
Acknowledgements
A.K.H.H. gratefully acknowledges funding from the Netherlands Organisation for Scientific Research (VIDI and LIFT grants), the Dutch Ministry of Education, Culture and Science (Gravitation Program 024.001.035), the European Research Council (ERC starting grant 757913), and the Helmholtz Association's Initiative and Networking Fund. Molecular graphics and analyses were performed with the UCSF Chimera package and ccp4mg.[36, 37]
V. R. Jumde, M. Mondal, R. M. Gierse, M. Y. Unver, F. Magari, R. C. W. van Lier, A. Heine, G. Klebe, A. K. H. Hirsch, ChemMedChem 2018, 13, 2266.
References
- 1. Huc I., Lehn J.-M., Proc. Natl. Acad. Sci. USA 1997, 94, 2106–2110. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2. Herrmann A., Chem. Soc. Rev. 2014, 43, 1899–1933. [DOI] [PubMed] [Google Scholar]
- 3. Karan C., Miller B. L., Drug Discovery Today 2000, 5, 67–75. [DOI] [PubMed] [Google Scholar]
- 4. Ramström O., Lehn J.-M., Nat. Rev. Drug Discovery 2002, 1, 26–36. [DOI] [PubMed] [Google Scholar]
- 5. Otto S., Furlan R. L. E., Sanders J. K. M., J. Am. Chem. Soc. 2000, 122, 12063–12064. [Google Scholar]
- 6. Ladame S., Org. Biomol. Chem. 2008, 6, 219–226. [DOI] [PubMed] [Google Scholar]
- 7. Mondal M., Hirsch A. K. H., Chem. Soc. Rev. 2015, 44, 2455–2488. [DOI] [PubMed] [Google Scholar]
- 8. Hioki H., Still W. C., J. Org. Chem. 1998, 63, 904–905. [Google Scholar]
- 9. Ramström O., Lehn J.-M., ChemBioChem 2000, 1, 41–48. [DOI] [PubMed] [Google Scholar]
- 10. Fotouhi N., Joshi P., Tilley J. W., Rowan K., Schwinge V., Wolitzky B., Bioorg. Med. Chem. Lett. 2000, 10, 1167–1169. [DOI] [PubMed] [Google Scholar]
- 11. Palde P. B., Ofori L. O., Gareiss P. C., Lerea J., Miller B. L., J. Med. Chem. 2010, 53, 6018–6027. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12. Stymiest J. L., Mitchell B. F., Wong S., Vederas J. C., Org. Lett. 2003, 5, 47–49. [DOI] [PubMed] [Google Scholar]
- 13. Berezowska I., Chung N. N., Lemieux C., Wilkes B. C., Schiller P. W., J. Med. Chem. 2007, 50, 1414–1417. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14. Mollica A., Guardiani G., Davis P., Ma S. W., Porreca F., Lai J., Mannina L., Sobolev A. P., Hruby V. J., J. Med. Chem. 2007, 50, 3138–3142. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15. Nicolaou K. C., Hughes R., Cho S. Y., Winssinger N., Smethurst C., Labischinski H., Endermann R., Angew. Chem. Int. Ed. 2000, 39, 3823–3828; [DOI] [PubMed] [Google Scholar]; Angew. Chem. 2000, 112, 3981–3986. [Google Scholar]
- 16. Catrow J. L., Zhang Y., Zhang M., Ji H., J. Med. Chem. 2015, 58, 4678–4692. [DOI] [PubMed] [Google Scholar]
- 17. Valade A., Urban D., Beau J. M., ChemBioChem 2006, 7, 1023–1027. [DOI] [PubMed] [Google Scholar]
- 18. Cooper J. B., Curr. Drug Targets 2002, 3, 155–173. [DOI] [PubMed] [Google Scholar]
- 19. Coates L., Erskine P. T., Mall S., Gill R., Wood S. P., Myles D. A. A., Cooper J. B., Eur. Biophys. J. 2006, 35, 559–566. [DOI] [PubMed] [Google Scholar]
- 20. Coates L., Erskine P. T., Wood S. P., Myles D. A. A., Cooper J. B., Biochemistry 2001, 40, 13149–13157. [DOI] [PubMed] [Google Scholar]
- 21. Coates L., Tuan H.-F., Tomanicek S., Kovalevsky A., Mustyakimov M., Erskine P., Cooper J., J. Am. Chem. Soc. 2008, 130, 7235–7237. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22. Mondal M., Radeva N., Köster H., Park A., Potamitis C., Zervou M., Klebe G., Hirsch A. K. H., Angew. Chem. Int. Ed. 2014, 53, 3259–3263; [DOI] [PubMed] [Google Scholar]; Angew. Chem. 2014, 126, 3324–3328. [Google Scholar]
- 23. Baell J. B., Ferrins L., Falk H., Nikolakopoulos G., Aust. J. Chem. 2013, 66, 1483–1494. [Google Scholar]
- 24. Van Dijken D. J., Kovaříček P., Ihrig S. P., Hecht S., J. Am. Chem. Soc. 2015, 137, 14982–14991. [DOI] [PubMed] [Google Scholar]
- 25. Sonawane S. J., Kalhapure R. S., Govender T., Eur. J. Pharm. Sci. 2017, 99, 45–65. [DOI] [PubMed] [Google Scholar]
- 26. Botham R. C., Fan T. M., Im I., Borst L. B., Dirikolu L., Hergenrother P. J., J. Am. Chem. Soc. 2014, 136, 1312–1319. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27. Roth H. S., Botham R. C., Schmid S. C., Fan T. M., Dirikolu L., Hergenrother P. J., J. Med. Chem. 2015, 58, 4046–4065. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28. Meanwell N. A., J. Med. Chem. 2011, 54, 2529–2591. [DOI] [PubMed] [Google Scholar]
- 29.BioSolveIT GmbH, Sankt Augustin. http://www.biosolveit.de, LeadIT, version 2.1.3.
- 30. Gerber P. R., Müller K., J. Comput.-Aided Mol. Des. 1995, 9, 251–268. [DOI] [PubMed] [Google Scholar]
- 31. Neises B., Steglich W., Angew. Chem. Int. Ed. Engl. 1978, 17, 522–524; [Google Scholar]; Angew. Chem. 1978, 90, 556–557. [Google Scholar]
- 32. Gangjee A., Yang J., Ihnat M. A., Kamat S., Bioorg. Med. Chem. 2003, 11, 5155–5170. [DOI] [PubMed] [Google Scholar]
- 33. Kavanagh M. E., Gray J. L., Gilbert S. H., Coyne A. G., McLean K. J., Davis H. J., Munro A. W., Abell C., ChemMedChem 2016, 11, 1924–1935. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34. Toth M. V., Marshall G. R., Int. J. Pept. Protein Res. 1990, 36, 544–550. [DOI] [PubMed] [Google Scholar]
- 35. Cheng H. C., J. Pharmacol. Toxicol. Methods 2001, 46, 61–71. [DOI] [PubMed] [Google Scholar]
- 36. Pettersen E. F., Goddard T. D., Huang C. C., Couch G. S., Greenblatt D. M., Meng E. C., Ferrin T. E., J. Comput. Chem. 2004, 25, 1605–1612. [DOI] [PubMed] [Google Scholar]
- 37. McNicholas S., Potterton E., Wilson K. S., Noble M. E. M., Acta Crystallogr. Sect. D 2011, 67, 386–394. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.