A fast, accurate, and cost-effective design of small-molecule inhibitors that target RNA using a combined NMR-fragment-based and virtual screening.
Abstract
M. tuberculosis (Mtb) is a pathogenic bacterium that causes tuberculosis, which kills more than 1.5 million people worldwide every year. Strains resistant to available antibiotics pose a significant healthcare problem. The enormous complexity of the ribosome poses a barrier for drug discovery. We have overcome this in a tractable way by using an RNA segment that represents the peptidyl transferase center as a target. By using a novel combination of NMR transverse relaxation times (T2) and computational chemistry approaches, we have obtained improved inhibitors of the Mtb ribosomal PTC. Two phenylthiazole derivatives were predicted by machine learning models as effective inhibitors, and this was confirmed by their IC50 values, which were significantly improved over standard antibiotic drugs.
A recent U.K. government review (https://amr-review.org/) on the global threat of anti-microbial resistance predicts that by 2050 2.5 million people will be at risk each year of dying from drug-resistant strains of Mtb, the causative agent of tuberculosis (TB). Two crucial stages in the ongoing effort of drug discovery – both of which are addressed in this study – are the selection of the drug target for the specific pathogen and the discovery of lead compounds. One of the major obstacles in pursuing this goal is the complexity of the bacterial ribosome as a target. We have overcome this barrier by targeting an RNA segment of the ribosomal PTC of Mtb.
The PTC is a universally conserved ribonucleotide chain that catalyzes the formation of peptide bonds during peptide chain synthesis and constitutes a highly selective drug target site within the ribosome.5 Indeed, being the heart of the gene translation machinery, most of the available antibacterial compounds that target the large (50S) ribosome subunit of bacteria act on the PTC.6,7
We have chosen a fragment-based screening (FBS)-virtual screening tandem (Fig. 1A) by virtue of the two major advantages of FBS over high throughput screening: FBS requires a method such as NMR, to detect binding,8 and fragment-based libraries cover a larger chemical space.9
We have previously used saturation transfer difference (STD) spectra to identify fragments that bind to DNA primase.15 The current study addresses the problem inherent in NMR-saturated difference spectroscopy that the low distribution of protons in nucleic acids (compared to proteins) makes it difficult to excite only the macromolecule and hence to obtain an effective transfer of magnetization to the bound fragment molecule. Here, we used T2 relaxation NMR spectroscopy for the initial screening of fragment molecules (Fig. 1A). Using the Carr-Purcell-Meiboom-Gill (CPMG) sequence, we have utilized a property that small molecules bound to larger RNA molecules adopt the relaxation time of the complex.10,11 Obtaining the transverse relaxation rates in the absence and presence of RNA allows determining which fragments bind to RNA. As a target for screening, we used the 29 mer RNA sequence that corresponds to hairpin 91 in the PTC of Mtb (Fig. 1B).
After an initial feasibility screen, we performed a full screen with 1000 compounds from the Maybridge Ro3 1000 collection using pools of 9 to 11 compounds per sample. The fragment exhibiting the largest change in T2 upon RNA binding is [2-(3-chlorophenyl)-1,3-thiazol-4-yl]methanamine (Fig. 2, Tables S1 and S2† present the summary of NMR screening results and values for the transverse relaxation time of the best hit, respectively). Out of the nine fragments with the largest changes in T2, four contain a phenylthiazole moiety. This moiety was therefore used as the scaffold for the subsequent optimization steps in the FBVS workflow (Fig. 1A).
The next step in the FBVS workflow was to select a subset of phenyl thiazol-containing molecules from a library of 230M compounds in the available ZINC database.1 This step yielded 919 molecules which were docked against the available crystal structure of the ribosomal PTC and were ranked based on their binding energy (large –ΔG, Table S3†). We exploited the available structure for the large ribosomal subunit of S. aureus3 (Fig. 1B) to extract the PTC for docking; the high sequence conservation between the PTCs of Mtb and S. aureus (>85% for the PTC and 90% for hairpin 91), allows the use of the ribosomal PTC of S. aureus as a proxy for the target in Mtb.
The inhibition effect of ten compounds with the highest BOND values (1–10, Fig. 3A) on M. smegmatis ribosomes was tested in a bacterial coupled transcription-translation assay (see Methods12) in which the expression of luciferase gene was measured. M. smegmatis ribosomes are often used to test antitubercular agent, as for the ribosomal PTC used in this study (U2673-C2836), there is 100% identity between Mtb and M. smegmatis (the identity of the whole 23S rRNA is 2873/3150, 91%). A reconstituted mycobacterial translation apparatus was obtained as described previously.13 The results indicated effective inhibition of M. smegmatis ribosomes by molecules 2 and 8, both containing piperazine in their R1 position (Fig. 3B). Notably, the IC50 values for molecules 2 and 8 were 9.1 and 2.8 μM, respectively. These values are superior to the IC50 for chloramphenicol (7.1 μM), a broad-spectrum antibiotic that binds the PTC, thereby inhibiting ribosome activity (Fig. 3B and Table S4†).
To show specific binding of molecules 2 and 8 in hairpin 91 of the ribosomal PTC an in-line probing assay was used. In-line probing assay has been used to elucidate the binding properties of a variety of metabolites and small molecules to RNA molecules.14 This assay utilizes a slow “in-line” nucleophilic attack on a phosphodiester bond in RNA by the 2′ oxygen atom of a spatially adjacent nucleoside. This selective cleavage reveals secondary structural features of the backbone RNA when the products of the cleavage are examined using gel electrophoresis. Binding of a small molecule to the RNA “locks” the RNA in an in-line rigid conformation and promotes cleavage of the linkage involved. The outcome is elevated intensities of the gel bands, indicating hotspot linkage in the RNA target. The rigidity of some linkages along the RNA makes others less likely to adopt an in-line conformation, which is indicated by reduced gel bands attributed to the cleavage sites. Such a change in the pattern of the RNA cleavage is observed in an in-line probing gel upon titration of the small molecule inhibitors, 2 and 8 (Fig. 4A and B), thus confirming the binding site of the molecules.
The docking data was subsequently used for establishing machine learning design principles for drug-sized molecules with increased binding efficiency (BOND) to the ribosomal PTC. Currently, there are no guidelines as to which modifications will be more advantageous. Instead, this step is done by an exhaustive brute-force search (medicinal chemistry). Such an approach is not scalable in the face of big-data availability. Machine learning models have been used for a variety of applications in drug discovery.15–18 To establish the design principles for drug sized molecules, we trained machine learning models on the docking data. Specifically, a feature engineering step yielded three properties for each of the 11 atom positions in the scaffold (33 features overall). The three properties MODi, DISTi, and VARi represent various geometrical features of the phenylthiazole scaffold. We trained two statistical regression models for BOND as a function of MODi, DISTi, and VARi – linear regression and random forest.19,20 The most critical features, out of the total of 33 features, for predicting BOND were extracted from each machine learning model. Specifically, locations 6 and 8 on the scaffold were found to be significant, and relevant guidelines for the structure to be attached were formed (see Fig. S4†). We ran the machine learning models on the 10 molecules, and additional antibiotics. The predicted BOND value for molecule 2 (1-[(2-methylthiazol-4-yl)methyl]-4-[(2-phenylthiazol-4-yl)methyl]piperazine) and molecule 8, (1-ethyl-4-[(2-phenylthiazol-4-yl)methyl]piperazine) as well as for six oxazolidinone antibiotics that target the PTC21 (Table S5†) were remarkably above the 85th percentile. In contrast, molecules 1, 3, and 4 present low inhibition value for translation that could be explained by weak geometrical features found by the model (see ESI Experimental†).
Overall, this study shows that T2 relaxation is an extremely efficient and sensitive screening technique for small molecules that bind RNA. The advantage of the NMR-fragment screening approach for finding inhibitors that target the PTC is that it relies on a binding assay rather than a functional assay. It is therefore not necessary to retain the peptidyl transfer activity in the PTC fragment used for the binding assay: the effect of the binding on the peptidyl transfer activity is measured at later stages of the development when drug-sized molecules have been identified.
The in silico FBVS approach enables us to reduce the number of steps in lead compound optimization.9,22 Our approach for computational optimization included virtual filtration to yield drug-sized molecules containing the scaffold found by NMR, automated docking and a machine learning prediction model, which provided a fast, economic, and efficient route to two drug-sized molecules that inhibit M. smegmatis ribosomes.
Conclusions
This study provides the basis for a complete design workflow that can be used for different RNA targets. By developing molecules that target the translation of mRNA by bacterial ribosomes to create proteins, we aim to inhibit the bacterial translational machinery and hence to eliminate the pathogenic bacteria. Our method, which combines experimental and computational techniques, provides proof of concept for the development of inhibitors that target the ribosomes of mycobacterial cells. Fragment-based screening complemented with virtual screening may yield false positives. However, this may be compensated by the relatively large number of hits that can be screened using more conventional in vitro translation assays. Machine learning can circumvent this challenge in the future, as it can be trained directly to predict functional activity (rather than binding energy), once enough data is collected.
Machine learning provided us with an efficient prediction tool for binding energy (taking milliseconds per molecule rather than minutes on the docking simulator), allowing scanning of increasingly larger datasets of potential molecules.
The two lead compounds that we discovered could potentially be passed on to the pharmaceutical industry for testing and development into new drugs for treating active TB.
Conflicts of interest
There are no conflicts to declare.
Supplementary Material
Acknowledgments
This research was supported by Grant No. 2016142 from the United States-Israel Binational Science Foundation (BSF) and the IMTI (TAMAT)/Israel Ministry of Industry–KAMIN Program, Grant No. 59081. The authors would like to thank Profs. Jack Cohen and Ira Weinstock (BGU) for helpful discussions.
Footnotes
†Electronic supplementary information (ESI) available. See DOI: 10.1039/c9sc02520k
References
- Irwin J. J., Shoichet B. K. J. Chem. Inf. Model. 2005;45:177–182. doi: 10.1021/ci049714. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Goodsell D. S., Morris G. M., Olson A. J. J. Mol. Recognit. 1996;9:1–5. doi: 10.1002/(sici)1099-1352(199601)9:1<1::aid-jmr241>3.0.co;2-6. [DOI] [PubMed] [Google Scholar]
- Eyal Z., Matzov D., Krupkin M., Wekselman I., Paukner S., Zimmerman E., Rozenberg H., Bashan A., Yonath A. Proc. Natl. Acad. Sci. U. S. A. 2015;112:E5805–E5814. doi: 10.1073/pnas.1517952112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mitra K., Schaffitzel C., Fabiola F., Chapman M. S., Ban N., Frank J. Mol. Cell. 2006;22:533–543. doi: 10.1016/j.molcel.2006.05.003. [DOI] [PubMed] [Google Scholar]
- Beringer M., Rodnina M. V. Mol. Cell. 2007;26:311–321. doi: 10.1016/j.molcel.2007.03.015. [DOI] [PubMed] [Google Scholar]
- Polacek N., Mankin A. S. Crit. Rev. Biochem. Mol. Biol. 2005;40:285–311. doi: 10.1080/10409230500326334. [DOI] [PubMed] [Google Scholar]
- Agmon I. FEBS Lett. 2017;591:3252–3258. doi: 10.1002/1873-3468.12781. [DOI] [PubMed] [Google Scholar]
- Baker M. Nat. Rev. Drug Discovery. 2013;12:5–7. doi: 10.1038/nrd3926. [DOI] [PubMed] [Google Scholar]
- Singh M., Tam B., Akabayov B. Molecules. 2018;23:233. doi: 10.3390/molecules23020233. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pervushin K., Riek R., Wider G., Wuthrich K. Proc. Natl. Acad. Sci. U. S. A. 1997;94:12366–12371. doi: 10.1073/pnas.94.23.12366. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wu B., Barile E., De S. K., Wei J., Purves A., Pellecchia M. Curr. Top. Med. Chem. 2015;15:2032–2042. doi: 10.2174/1568026615666150519102459. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Matzov D., Eyal Z., Benhamou R. I., Shalev-Benami M., Halfon Y., Krupkin M., Zimmerman E., Rozenberg H., Bashan A., Fridman M., Yonath A. Nucleic Acids Res. 2017;45:10284–10292. doi: 10.1093/nar/gkx658. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Regulski E. E., Breaker R. R. Methods Mol. Biol. 2008;419:53–67. doi: 10.1007/978-1-59745-033-1_4. [DOI] [PubMed] [Google Scholar]
- Soukup G. A., Breaker R. R. RNA. 1999;5:1308–1325. doi: 10.1017/s1355838299990891. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Vamathevan J., Clark D., Czodrowski P., Dunham I., Ferran E., Lee G., Li B., Madabhushi A., Shah P., Spitzer M., Zhao S. Nat. Rev. Drug Discovery. 2019;18:463–477. doi: 10.1038/s41573-019-0024-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Melville J. L., Burke E. K., Hirst J. D. Comb. Chem. High Throughput Screening. 2009;12:332–343. doi: 10.2174/138620709788167980. [DOI] [PubMed] [Google Scholar]
- Gonczarek A., Tomczak J. M., Zareba S., Kaczmar J., Dabrowski P., Walczak M. J. Comput. Biol. Med. 2018;100:253–258. doi: 10.1016/j.compbiomed.2017.09.007. [DOI] [PubMed] [Google Scholar]
- Carpenter K. A., Huang X. Curr. Pharm. Des. 2018;24:3347–3358. doi: 10.2174/1381612824666180607124038. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kam T., Presented in part at the Proceedings of the 3rd International Conference on Document Analysis and Recognition, 1995, Montreal, QC, 14–16 August 1995. [Google Scholar]
- Ho T. K. IEEE Trans. Pattern Anal. Mach. Intell. 1998;20:832–844. [Google Scholar]
- Swaney S. M., Aoki H., Ganoza M. C., Shinabarger D. L. Antimicrob. Agents Chemother. 1998;42:3251–3255. doi: 10.1128/aac.42.12.3251. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ilic S., Akabayov S. R., Arthanari H., Wagner G., Richardson C. C., Akabayov B. Sci. Rep. 2016;6:36322. doi: 10.1038/srep36322. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.