Skip to main content
Springer Nature - PMC COVID-19 Collection logoLink to Springer Nature - PMC COVID-19 Collection
. 2009 Aug 15;14(2):401–408. doi: 10.1007/s11030-009-9187-z

Design of chemical libraries with potentially bioactive molecules applying a maximum common substructure concept

Michael Lisurek 1, Bernd Rupp 1, Jörg Wichard 1, Martin Neuenschwander 1, Jens Peter von Kries 1, Ronald Frank 2, Jörg Rademann 1,3,, Ronald Kühne 1,
PMCID: PMC7089384  PMID: 19685275

Abstract

Success in small molecule screening relies heavily on the preselection of compounds. Here, we present a strategy for the enrichment of chemical libraries with potentially bioactive compounds integrating the collected knowledge of medicinal chemistry. Employing a genetic algorithm, substructures typically occurring in bioactive compounds were identified using the World Drug Index. Availability of compounds containing the selected substructures was analysed in vendor libraries, and the substructure-specific sublibraries were assembled. Compounds containing reactive, undesired functional groups were omitted. Using a diversity filter for both physico-chemical properties and the substructure composition, the compounds of all the sublibraries were ranked. Accordingly, a screening collection of 16,671 compounds was selected. Diversity and chemical space coverage of the collection indicate that it is highly diverse and well-placed in the chemical space spanned by bioactive compounds. Furthermore, secondary assay-validated hits presented in this study show the practical relevance of our library design strategy.

Electronic supplementary material

The online version of this article (doi:10.1007/s11030-009-9187-z) contains supplementary material, which is available to authorized users.

Keywords: Bio informatics, Drug design, High throughput screening, Library design, Molecular diversity

Electronic Supplementary Material

The Below is the Electronic Supplementary Material.

DOC 1 (DOC 422 KB) (422KB, doc)

Acknowledgements

We thank Hans-Dieter Höltje and Victoria Higman for critical reading of the manuscript. The screening data were provided by Jörn Saupe, Samuel Beligny, Svantje Behnken and Susann Matthes. Three institutes, namely the Helmholtz Centre for Infection Research (HZI), the Max Delbrück Centrum (MDC), and the Leibniz Institut für Molekulare Pharmakologie (FMP) co-financed the described screening library, which is now ready to use for supporting screening projects. Extensions to this library are currently made by the MPI in Dortmund, the University of Oslo and the University of Konstanz.

Open Access

This article is distributed under the terms of the Creative Commons Attribution Noncommercial License which permits any noncommercial use, distribution, and reproduction in any medium, provided the original author(s) and source are credited.

Contributor Information

Jörg Rademann, Email: rademann@fmp-berlin.de.

Ronald Kühne, Email: kuehne@fmp-berlin.de.

References

  • 1.Villar HO, Koehler RT. Comments on the design of chemical libraries for screening. Mol Divers. 2000;5:13–24. doi: 10.1023/A:1011326914800. [DOI] [PubMed] [Google Scholar]
  • 2.Miller JL. Recent developments in focused library design: targeting gene-families. Curr Top Med Chem. 2006;6:19–29. doi: 10.2174/156802606775193347. [DOI] [PubMed] [Google Scholar]
  • 3.Irwin JJ. How good is your screening library. Curr Opin Chem Biol. 2006;10:352–356. doi: 10.1016/j.cbpa.2006.06.003. [DOI] [PubMed] [Google Scholar]
  • 4.Xue L, Bajorath J. Molecular descriptors for effective classification of biologically active compounds based on principal component analysis identified by a genetic algorithm. J Chem Inf Comput Sci. 2000;40:801–809. doi: 10.1021/ci000322m. [DOI] [PubMed] [Google Scholar]
  • 5.Brenk R, Schipani A, James D, Krasowski A, Gilbert IH, Frearson J, Wyatt PG. Lessons learnt from assembling screening libraries for drug discovery for neglected diseases. ChemMedChem. 2008;3:435–444. doi: 10.1002/cmdc.200700139. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Zartler ER, Shapiro MJ. Fragonomics: fragment-based drug discovery. Curr Opin Chem Biol. 2005;9:366–370. doi: 10.1016/j.cbpa.2005.05.002. [DOI] [PubMed] [Google Scholar]
  • 7.Hartshorn MJ, Murray CW, Cleasby A, Frederickson M, Tickle IJ, Jhoti H. Fragment-based lead discovery using X-ray crystallography. J Med Chem. 2005;48:403–413. doi: 10.1021/jm0495778. [DOI] [PubMed] [Google Scholar]
  • 8.Jacoby E, Davies J, Blommers MJ. Design of small molecule libraries for NMR screening and other applications in drug discovery. Curr Top Med Chem. 2003;3:11–23. doi: 10.2174/1568026033392606. [DOI] [PubMed] [Google Scholar]
  • 9.Schmidt MF, Isidro-Llobet A, Lisurek M, El-Dahshan A, Tan J, Hilgenfeld R, Rademann J. Sensitized detection of inhibitory fragments and iterative development of non-peptidic protease inhibitors by dynamic ligation screening. Angew Chem Int Ed Engl. 2008;47:3275–3278. doi: 10.1002/anie.200704594. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Bemis GW, Murcko MA. The properties of known drugs. 1. Molecular frameworks. J Med Chem. 1996;39:2887–2893. doi: 10.1021/jm9602928. [DOI] [PubMed] [Google Scholar]
  • 11.Xu YJ, Johnson M. Using molecular equivalence numbers to visually explore structural features that distinguish chemical libraries. J Chem Inf Comput Sci. 2002;42:912–926. doi: 10.1021/ci025535l. [DOI] [PubMed] [Google Scholar]
  • 12.Martin YC. Computer design of potentially bioactive molecules by geometric searching with ALADDIN. Tetrahedron Comput Methodol. 1990;3:15–25. doi: 10.1016/0898-5529(90)90117-Q. [DOI] [Google Scholar]
  • 13.Martin YC. 3D database searching in drug design. J Med Chem. 1992;35:2145–2154. doi: 10.1021/jm00090a001. [DOI] [PubMed] [Google Scholar]
  • 14.Abel U, Koch C, Speitling M, Hansske FG. Modern methods to produce natural-product libraries. Curr Opin Chem Biol. 2002;6:453–458. doi: 10.1016/S1367-5931(02)00338-1. [DOI] [PubMed] [Google Scholar]
  • 15.Koch MA, Schuffenhauer A, Scheck M, Wetzel S, Casaulta M, Odermatt A, Ertl P, Waldmann H. Charting biologically relevant chemical space: a structural classification of natural products (SCONP) Proc Natl Acad Sci USA. 2005;102:17272–17277. doi: 10.1073/pnas.0503647102. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Wagener M, Gasteiger J. The determination of maximum common substructures by a genetic algorithm: application in synthesis design and for the structural analysis of biological activity. Angew Chem Int Ed Engl. 1994;33:1189–1192. doi: 10.1002/anie.199411891. [DOI] [Google Scholar]
  • 17.Evans BE, Rittle KE, Bock MG, DiPardo RM, Freidinger RM, Whiter WL, Lundell GF, Veber DF, Anderson PS, Chang RSL, Lotti VJ, Cerino DJ, Chen TB, Kling PJ, Kunkel KA, Springer JP, Hirshfield J. Methods for drug discovery: development of potent, selective, orally effective cholecystokinin antagonists. J Med Chem. 1988;31:2235–2246. doi: 10.1021/jm00120a002. [DOI] [PubMed] [Google Scholar]
  • 18.Patchett AA, Nargund RP. Privileged structures—an update. Annu Rep Med Chem. 2000;35:289–298. doi: 10.1016/S0065-7743(00)35027-8. [DOI] [Google Scholar]
  • 19.Schnur DM, Hermsmeier MA, Tebben AJ. Are target- family-privileged substructures truly privileged. J Med Chem. 2006;49:2000–2009. doi: 10.1021/jm0502900. [DOI] [PubMed] [Google Scholar]
  • 20.WDI, Derwent World Drug Index, Release 2005, Derwent Information Ltd., London
  • 21.ChemDiv Inc., ChemDiv Chemical Database, http://www.chemdiv.com, 6605 Nancy Ridge Drive, San Diego, CA, 92121, USA
  • 22.MOE Molecular Operating Environment, version 2005.06, Chemical Computing Group Inc., Montreal, Quebec, Canada
  • 23.Lipinski CA, Lombardo F, Dominy BW, Feeney PJ. Experimental and computational approaches to estimate solubility and permeability in drug discovery and development settings. Adv Drug Deliv Rev. 1997;23:3–25. doi: 10.1016/S0169-409X(96)00423-1. [DOI] [PubMed] [Google Scholar]
  • 24.Labute P. A widely applicable set of descriptors. J Mol Graph Model. 2000;18:464–477. doi: 10.1016/S1093-3263(00)00068-1. [DOI] [PubMed] [Google Scholar]
  • 25.ChemACX, CambridgeSoft, http://www.chemacx.com, 100 CambridgePark Drive, Cambridge, MA, 02140, USA
  • 26.Dictionary of Natural Products, version 14.1 (2005). Chapman & Hall/CRC Informa, London

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

DOC 1 (DOC 422 KB) (422KB, doc)

Articles from Molecular Diversity are provided here courtesy of Nature Publishing Group

RESOURCES