Skip to main content
Elsevier - PMC COVID-19 Collection logoLink to Elsevier - PMC COVID-19 Collection
. 2005 Oct 5;1:155–168. doi: 10.1016/S1574-1400(05)01011-X

Capter 11 Filtering in Drug Discovery

Christopher A Lipinski 1
PMCID: PMC7128845  PMID: 32288696

Publisher Summary

This chapter discusses the concept of filtering in drug discovery. Multiple filters may be incorporated into a definition of drug-likeness and this leads to tradeoffs among compound properties in compounds intended for screening. The optimization of compound properties may require some type of multiparameter optimization scheme in library design. Fingerprint algorithms can be used to guide diversity. Filters also need to be employed in the chemistry synthesis planning process so that good quality compounds are made. Differences in property ranges between oral and injectable drugs are summarized in the chapter. Oral drugs are lower in MWT and have fewer H-bond donors, acceptors, and rotatable bonds. A scheme for separating central nervous system (CNS)- from non-CNS-active drugs in the WDI allowed the discovery of simple parameters relating to passive blood brain barrier (BBB) permeability and the prediction of p-glycoprotein (PGP) affinity. The PGP transporter is a major barrier to the entry of compounds to the CNS. Appropriately determined PGP efflux ratios can be used as a measure of compound affinity to PGP.


Recent advances in filtering in drug discovery are reviewed. Filtering is used in a global broad sense to include exclusionary as well as inclusionary criteria and encompasses the following topics related to the discovery of drugs: (1) computational definitions of drug-likeness; (2) positive filters for drug-like activity; (3) lead-likeness as a concept; (4) oral activity filters; (5) CNS drug filters; (6) intestinal permeability filters; (7) drug metabolism parameter filters; (8) promiscuous compound filters; and (9) agrochemical filters. This review does not cover what could be termed as local parameters such as pharmacophore models, docking and scoring, etc.

1. Drug-likeness

To this author there seems to be much more agreement as to what is drug-like than there is as to what is diverse. Defining what is drug-like and non-drug-like requires some type of reference point. The property distributions of commercially available databases have been examined. The available chemicals directory (ACD) seems to be the most common non-drug-like database. The pesticide manual has been used as an alternate standard for non-drug-likeness because it is composed primarily of compounds designed to cause fatality to the primary organism. The Comprehensive Medicinal Chemistry (CMC), Derwent Word Drug Index (WDI) and Modern Drug Data Report (MDDR) are among the more commonly used drug-like databases [1], [2]. In addition, the 10,000 or so Phase II compounds are used to define drug-like compounds [3]. A compound's drug-like index has been calculated based upon the knowledge derived from known drugs selected from the CMC database [4]. The property distributions in combinatorial compounds compared to drugs or natural products largely reflect combinatorial chemistry synthesis constraints such that there are fewer chiral centers and complex ring systems [5]. The distribution of ring systems across multiple databases has been described [6] and a program was written and tested on the MDDR database [7] to identify candidate chemical ring replacements (bioisosteres). From the study of a database of commercially available drugs it is clear that the diversity of molecular framework (ring) shapes is extremely low. The shapes of half of the drugs in the database are described by the 32 most frequently occurring frameworks [8]. The diversity that side chains provide to drug molecules is quite low since only 20 side chains account for over 70% of the side chains [9]. Defining drug-like by what exists in databases leads to the criticism that most of chemistry space will be undefined and that discovery opportunities in unexplored chemistry space will be limited. A solution is to populate chemistry space with non-drug-like markers akin to the way point in a GPS navigation system [10].

Multiple filters (properties) may be incorporated into a definition of drug-likeness and this leads to trade-offs among compound properties in compounds intended for screening [11]. Optimization of compound properties may require some type of multi-parameter optimization scheme in library design [12]. Fingerprint algorithms can be used to guide diversity [13]. Filters also need to be employed in the chemistry synthesis planning process so that good quality compounds are made [14]. Differences in property ranges between oral and injectable drugs have been summarized [15]. Oral drugs are lower in MWT and have fewer H-bond donors, acceptors and rotatable bonds. Property profiles of oral drugs are independent of the year in which the drug was approved to market and to some degree independent of target. Polar surface area (PSA) in one definition is the solvent accessible surface covered by oxygen, nitrogen and the hydrogens attached to oxygen and nitrogen. As a compound progresses through clinical trials there is a steady change in properties, e.g., MWT, Log  P and PSA all decline with a MWT of about 340 found for marketed drugs [16], [17]. The reason for this pattern is unclear since properties related to oral absorption would be expected to have reached a plateau by Phase II and hence selection pressure for properties related to oral absorption should have disappeared by then [18]. Pulmonary drugs tend to have higher PSA because pulmonary permeability is less sensitive to polar hydrogen-bonding functionality [19]. Anatomically this makes sense since lungs are a closed compartment and any accumulating fluid and compounds in terminal alveoli must be cleared. Discrimination between antibacterial and non-antibacterial activity has been achieved based on 3D molecular descriptors. The overall classification rate was around 90% on a data set of 661 compounds using 2–3 variables selected from log  P, charged-weighted negative surface area, positive surface area of heavy atoms and maximum donor delocalizability. Three-dimensional geometry variations had little impact on the discriminatory performance [20].

2. Drug-likeness and chemistry quality

Descriptors for drug-like are most effective if they have physical meaning so as to facilitate chemists designing in drug-likeness [21]. Drug-likeness in the design of combinatorial libraries [22], [23] involves the use of rule-based filters like the rule of 5 [24], the use of exclusionary filters to remove undesired chemistry functionality [25] and the capture of privileged structure information, e.g., from natural product collections [26] or from retro synthetic analysis of collections of bioactive molecules [27]. Natural product structural features are particularly well represented in the cancer chemotherapy and infectious disease areas [28]. Exclusionary filters have been described that remove reactive chemical functionality based on the premise that compounds having covalent chemistry possibilities have no place in drug discovery [29]. Filters are also necessary to remove cross reactivity in pooled compounds [30]. Pooling is a procedure in which single well-characterized compounds are deliberately mixed to speed screening. Components of the mixture must neither contain structural features causing assay false positives nor must they contain common substructural elements that would confuse the deconvolution of activities of the individual components. The magnitude of the number of poor quality screening compounds is emphasized by the report that only 37% of 1.6 million unique commercially available compounds are drug-like [31]. A very similar result was found in a virtual screen for SARS-CoV protease against commercially available and academic compounds. Of the 0.07% virtual hits against 3.6 million compounds, 47% failed three or more of 13 druggability criteria [32]. The criteria were based on physical, chemical and structural properties. Providing high-quality chemistry subject matter is now supported under the NIH molecular library small molecule repository initiative which aims to collect one million drug-like molecules from commercial, industry and government sources [33]. Emphasizing the point that drugs must contain adequate functionality to achieve acceptable receptor interactions, a single filter separates drug-like from non-drug-like compounds based on the observation that non-drugs are often under-functionalized [34].

3. Positive desirable chemistry filters

Privileged structures, e.g., benzodiazepines are recurring structures active against targets unrelated by target family. They can be viewed as molecular filters selecting for desirable chemistry subject matter. As such they are rich sources for screening libraries and have recently been reviewed [35]. Privileged structure features have been employed in the combinatorial design of GPCR libraries [36], in the combinatorial synthesis of privileged bicyclic structures [37] and in the combinatorial synthesis of cyclic peptides [38]. Homology modeling suggests a parallelism between common privileged GPCR ligand features and complementary deeply buried protein features in class A GPCRs [39]. Grouping by target family is also another method helping focus on particular target-directed privileged structures [40]. The idea is that structurally similar target family members will bind structurally similar small molecule ligands [41]. NMR screening helps identify privileged protein binding elements albeit of smaller size [42]. Although not strictly speaking a privileged structure, privileged structural elements such as the hydroxamate moiety found in many metalloprotease inhibitors can be identified [43]. Discernment of privileged structures has historically largely been a data mining exercise. However, very similar recurring structural motifs, so-called ‘molecular anchors’ have been described based on structure-based ligand binding considerations [44], [45]. Rigid small molecule ligands (the molecular anchors) are incapable of hydrophobic collapse and a single non-collapsible ligand conformer binds at a protein cavity site which is also often incapable of hydrophobic collapse. This concept explains the frequent occurrence of non-collapsible spiro structures in privileged structures/molecular anchors. Chemistry design principles directed to the very difficult goal of small molecule interference with protein–protein interactions via an allosteric interaction have been described [46]. An intriguing aspect is the hypothesis that chemistry emphasis should be placed on compound cores capable of interacting with relatively fixed protein hinge regions rather than on elaboration of lipophilic side chains attached to the core. The thermodynamic penalty attendant to ligand binding to a non-lowest energy protein conformer suggests that screening should allow for slow binding with adequate assay equilibration time. An implication is that for this type of target it is better to make larger numbers of smaller libraries than fewer numbers of large libraries. This trend to smaller libraries is now well documented [47]. Taken to its extreme this approach takes the typical dense chemistry space coverage of the traditional combinatorial library (target-oriented synthesis) towards the direction of the diversity-oriented synthesis approach to chemistry lead generation which populates diverse single molecules broadly through chemistry space [48]. This direction is of course in the direction of less efficient, more difficult chemistry. The focus on biological information content richness suggests natural products as combinatorial library starting points [49]. Chemical content richness is found in compounds produced by multicomponent reactions (MCR) which are chemical transformations in which as many as four components form a new compound in a single chemistry reaction step. The Ugi reaction of a carboxylic acid, amine, aldehyde and isonitrile is a classic example. In theory while offering an efficient approach to synthesis of diverse compounds, MCR in high-throughput mode currently suffers from significant chemistry limitations [50].

4. Lead-likeness

The difference between drug-like and lead-like has been described [51]. Leads are less complex in most parameters than drugs, which is understandable in that medicinal chemistry optimization almost invariably increases MWT and Log  P [52]. However, the structural resemblance between a starting lead and a drug is marked [53]. The implication is that a quality lead as opposed to a flawed lead is far likely to lead to a real drug [54]. Lead-like discovery also refers to the screening of small molecule libraries with detection of weak affinities in the high micromolar to millimolar range. The process usually by itself does not lead to an acceptable chemistry starting point. Something else has to be added after the primary screen. Generally, multiple small molecules do not bind to non-adjacent target sites [55], so the screening is that of small MWT singletons. However, binding of two components to the same receptor site is possible as attested by the discovery of sub-nanomolar ligands in what is termed click chemistry [56]. In this process an acetylene and azide terminus from two receptor site independently bound molecules cyclize to a single compound with the two components linked via a 1,2,3 triazole ring. Filtering in the context of lead-like small molecule screening implies control of the properties of drug starting points that eventually result from this process. A rule of three [57] has been coined for small molecule fragment screening libraries; MWT<300; Log  P<3; H-bond donors and acceptors<3 and rotatable bonds<3. Small fragment screening can be by NMR [58], [59], [60], by X-ray [61], [62], or in theory by any method capable of detecting weak interactions.

5. Oral drug activity

The topic of filtering in human therapeutic drug discovery has received numerous frequent reviews [23], [63], [64], [65] as well as criticism if fundamental medicinal chemistry principles are neglected [66]. The ‘rule of 5’ describes four simple parameters associated with improved prospects for oral activity. Poor solubility or poor permeability are more likely if there are >5 H-bond donors (expressed as sum of OH and NH); >10 H-bond acceptors (expressed as sum of O+N); MWT>500 and Log  P>5. There are only four rules. The 5 in rule of 5 arises from the frequent appearance of a 5 in the cutoff parameters. Compounds classes such as natural products, infectious disease drugs, etc. where transporter affinity is prevalent are exceptions [24]. Rotatable bond count is now a widely used filter following the finding that greater than 10 rotatable bonds correlates with decreased rat oral bioavailability [67]. The mechanistic basis for the rotatable bond filter is unclear since the rotatable bond count does not correlate with in vivo clearance rate in the rat but the filter is reasonable from an in vitro screening viewpoint since ligand affinity on average decreases 0.5 kcal for each two rotatable bonds [68]. Compounds indexed in medicinal chemistry journals show the recent trend towards poor properties. Over 50% of medicinal chemistry compounds with activities above 1 nM have MW>425, Log  P>4.25 and Log  Sw<−4.75, indicating that these compounds are larger, more hydrophobic and less soluble when compared to time-tested quality leads [52]. The concept of the importance of compound properties (e.g., rule of 5 compliance) beyond potency is widely accepted [69]. although there are notable occasional exceptions where an orally bioavailable compound is found that lies well outside the rule of 5 limits [70]. Can the rule of 5 be bypassed by delivering drug by a non-oral route, e.g., pulmonary, intra-nasal or dermal? The answer depends very much on the dose. If the total dose is 20 mg or less then alternative delivery routes begin to be feasible. However, a limitation is that only about 10% of current clinical candidates have sufficient potency in the 0.1 mg/kg range to result in such a low dose and finding such very potent compounds seems to be mostly a matter of luck [71]. Beyond chemistry-based features, oral drugs can also be defined by their biological target. It is striking that the 100 best selling (mostly oral) drugs are ligands for proteins encoded by only a very small subset of genes and that a very considerable portion of the targets for orally active drugs may have already been discovered [72]. The term ‘druggable genome’ has been coined to describe the severe restriction that chemistry considerations related to oral activity superimpose on possible biology target space [73].

6. CNS drugs

A scheme for separating CNS from non-CNS active drugs in the WDI allowed discovery of simple parameters relating to passive blood brain barrier (BBB) permeability and prediction of p-glycoprotein (PGP) affinity [74]. The PGP transporter is a major barrier to the entry of compounds to the CNS [75]. Appropriately determined PGP efflux ratios can be used as a measure of compound affinity to PGP. However, the value of filters based on PGP efflux ratios from the commonly used high-throughput mode Caco-2 colonic cell permeability cell culture assay have been questioned as efflux ratios do not correlate with in vivo rat brain penetration [76]. A PSA value of less than 60–70 Å2 tends to identify CNS active compounds [77]. A very simple set of two rules predicts CNS activity: If N+O (the number of nitrogen and oxygen atoms) in a molecule is less than or equal to 5, it has a high chance of entering the brain. The second rule predicts that if log  P−(N+O) is positive then the compound is CNS active [78]. More complex commercially available software programs have been compared as to their ability to predict CNS log BBB ratio [79]. Experimental and theoretical reasons support the belief that surface tension measurements can be predictors for blood brain permeability [80]. Predictors for absorption, distribution, metabolism and excretion (ADME) currently appear most useful in global models. Limitations in local models likely reflect a lack of quality experimental data sets [81] and user dissatisfaction may result from unrealistic expectations given the magnitude of experimental ADME errors [82]. An additional limitation to schemes for separating CNS from non-CNS compounds is the complexity of the BBB. Compounds with affinity to transporters are exceptions to physicochemically based filters like the rule of 5. This is a problem for the CNS since it is estimated that about 15% of all genes selectively expressed at the BBB encode for transporter proteins and that only about 50% of BBB transporters are currently known [83].

7. Intestinal permeability

PSA in rather simple models is a commonly used parameter to predict intestinal permeability [84]. Its rule-based calculation (TPSA) is very fast and does not require 3D structure [85]. A better prediction of intestinal permeability has been reported when PSA is partitioned into smaller molecularly based components [86]. Using molecular surface properties compounds selected from the World Health Organization's (WHO) list of essential drugs could be classified with 87% accuracy as to permeability and solubility using a six bin scheme similar to that in the FDA biopharmaceutical classification system [87]. Pharmacokinetic parameters including permeability can also be generated for filtering or ligand affinity prediction through the Volsurf software [88]. An analysis of small drug-like molecules suggests a filter of log  D>0 and <3 enhances the probability of good permeability [89]. A collection of 222 commercially available drugs was used to determine the exclusion criteria that differentiate poorly absorbed drugs from well-absorbed drugs. Similar to the rule of 5, MWT<500 and log  P<5 were associated with better absorbed compounds. Exceptions to the MWT criteria were compounds with a sugar moiety, high atomic weight and large cyclic structure [90] suggesting the involvement of absorptive biological transporter systems. Based on the intestinal absorption of 158 drug and drug-like compounds in rats there is a significant relationship between rat intestinal absorption, and by extrapolation human absorption, to drug hydrogen-bond acidity and basicity [91].

8. Aqueous solubility

Poor aqueous solubility is a wide spread problem in combinatorial libraries as opposed to poor intestinal permeability which is much less of a problem. About one-half of poor solubility is due to large size/lipophilicity. Log  P>5 identifies 75% of these compounds. The other 50% of poor aqueous solubility is due to crystal packing considerations for which there is no computational filter [92]. Melting point is an experimental indicator of crystal packing. Aqueous solubility decreases about 10× for each 100 °C rise in melting point and so melting point, if available, is a valuable parameter in solubility prediction [93]. Progress toward a computational melting point is suggested by the 63% success in qualitative ranking of compounds into low-medium and high-solubility bins. Descriptors for hydrophilicity, polarity, partial atom charge and molecular rigidity were found to be positively correlated with melting point whereas non-polar atoms and high flexibility within the molecule were negatively correlated [94].

9. Drug metabolism

Volume of distribution (VD) is a key pharmacokinetic parameter. A low VD of less than 1 l/kg identifies drugs residing in the plasma compartment. A VD greater than 1 l/kg identifies compounds accessing tissue compartments outside the plasma compartment, e.g., many CNS drugs have VD values in the tens or higher. A recently developed computational approach to predict VD for neutral and basic drugs works as well as the in vivo experimental measurement provided that accurate experimental compound log  D and pK a are available. Predictivity is retained if computed log  D and pK a are used but accuracy declines somewhat [95], [96]. Approximately 80% of drugs are oxidized by the cytochrome P450 (CYP) family of enzymes; hence a decision tree for CYP substrate affinity is important. This has been described in that characteristics of CYP substrates, such as lipophilicity, MWT and hydrogen-bonding potential, govern selectivity towards individual CYPs [97].

10. Promiscuous compounds

Compounds with a marked propensity to bind to multiple targets, so-called nuisance compounds, are of little value in drug discovery. Such compounds can be experimentally identified by their binding to fetal calf serum [98]. It has long been known that compounds could be identified as reproducible actives in HTS screens that could not be optimized in chemistry. Such compounds often appear active in multiple screens that have no biological relationship. An analysis of such promiscuous compounds from HTS hits led to the conclusion that colloidal aggregates in the 50–1000 nm size range were responsible. The apparent HTS screen activity was due to a biophysical effect rather than due to a normal ligand receptor affinity and hence the hits were unoptimizable in chemistry [99]. This promiscuous aggregation effect was found among 8 of 15 kinase inhibitors widely used in biology screening [100] emphasizing the importance of exclusionary filters to prevent wasting of biology research time by testing compounds with flawed properties. The aggregation phenomenon has been found among known drugs, albeit only when tested at high non-physiological concentrations and a predictive model was developed [101].

11. Agrochemicals

Filtering has also been applied to agrochemicals. Compared to drugs intended for human use, agrochemicals tend to have fewer hydrogen-bond donors [102]. For agrochemical screening computationally intensive surface area parameters offered no advantage over the rule of 5 [103]. Analogous to drug-likeness, agrochem-likeness for large compound collections has been explored using support vector machines (SVM). In this study SVM performed better than neural networks [104].

References

  • 1.Oprea T.I. Property distribution of drug-related chemical databases. J. Comput.-Aided Mol. Des. 2000;14:251–264. doi: 10.1023/a:1008130001697. [DOI] [PubMed] [Google Scholar]
  • 2.Bradley M.P. An overview of the diversity represented in commercially-available databases. Mol. Divers. 2002;5:175–183. doi: 10.1023/a:1021363906868. [DOI] [PubMed] [Google Scholar]
  • 3.Lipinski C.A. Drug-like properties and the causes of poor solubility and poor permeability. J. Pharmacol. Toxicol. Methods. 2001;44:235–249. doi: 10.1016/s1056-8719(00)00107-6. [DOI] [PubMed] [Google Scholar]
  • 4.Xu J., Stevenson J. Drug-like index: a new approach to measure drug-like compounds and their diversity. J. Chem. Inf. Comput. Sci. 2000;40:1177–1187. doi: 10.1021/ci000026+. [DOI] [PubMed] [Google Scholar]
  • 5.Feher M., Schmidt J.M. Property distributions: differences between drugs natural products, and molecules from combinatorial chemistry. J. Chem. Inf. Comput. Sci. 2003;43:218–227. doi: 10.1021/ci0200467. [DOI] [PubMed] [Google Scholar]
  • 6.Lewell X.Q., Jones A.C., Bruce C.L., Harper G., Jones M.M., McLay I.M., Bradshaw J. Drug rings database with web interface. A tool for identifying alternative chemical rings in lead discovery programs. J. Med. Chem. 2003;46:3257–3274. doi: 10.1021/jm0300429. [DOI] [PubMed] [Google Scholar]
  • 7.Sheridan R.P. The most common chemical replacements in drug-like compounds. J. Chem. Inf. Comput. Sci. 2002;42:103–108. doi: 10.1021/ci0100806. [DOI] [PubMed] [Google Scholar]
  • 8.Bemis G.W., Murcko M.A. The properties of known drugs. 1. Molecular frameworks. J. Med. Chem. 1996;39:2887–2893. doi: 10.1021/jm9602928. [DOI] [PubMed] [Google Scholar]
  • 9.Bemis G.W., Murcko M.A. Properties of known drugs. 2. Side chains. J. Med. Chem. 1999;42:5095–5099. doi: 10.1021/jm9903996. [DOI] [PubMed] [Google Scholar]
  • 10.Oprea T.I., Zamora I., Ungell A.-L. Pharmacokinetically based mapping device for chemical space navigation. J. Comb. Chem. 2002;4:258–266. doi: 10.1021/cc010093w. [DOI] [PubMed] [Google Scholar]
  • 11.Wright T., Gillet V.J., Green D.V.S., Pickett S.D. Optimizing the size and configuration of combinatorial libraries. J. Chem. Inf. Comput. Sci. 2003;43:381–390. doi: 10.1021/ci0255836. [DOI] [PubMed] [Google Scholar]
  • 12.Agrafiotis D.K. Multiobjective optimization of combinatorial libraries. J. Comput.-Aided Mol. Des. 2002;16:335–356. doi: 10.1023/a:1020837112154. [DOI] [PubMed] [Google Scholar]
  • 13.Stahura F.L., Xue L., Godden J.W., Bajorath J. Methods for compound selection focused on hits and application in drug discovery. J. Mol. Graph. Model. 2002;20:439–446. doi: 10.1016/s1093-3263(01)00145-0. [DOI] [PubMed] [Google Scholar]
  • 14.Oprea T.I., Gottfries J., Sherbukhin V., Svensson P., Kuhler T.C. Chemical information management in drug discovery: optimizing the computational and combinatorial chemistry interfaces. J. Mol. Graph. Model. 2000;18:512–524. doi: 10.1016/s1093-3263(00)00066-8. [DOI] [PubMed] [Google Scholar]
  • 15.Vieth M., Siegel M.G., Higgs R.E., Watson I.A., Robertson D.H., Savin K.A., Durst G.L., Hipskind P.A. Characteristic physical properties and structural fragments of marketed oral drugs. J. Med. Chem. 2004;47:224–232. doi: 10.1021/jm030267j. [DOI] [PubMed] [Google Scholar]
  • 16.Wenlock M.C., Austin R.P., Barton P., Davis A.M., Leeson P.D. A comparison of physiochemical property profiles of development and marketed oral drugs. J. Med. Chem. 2003;46:1250–1256. doi: 10.1021/jm021053p. [DOI] [PubMed] [Google Scholar]
  • 17.Blake J.F. Examination of the computed molecular properties of compounds selected for clinical development. BioTechniques. 2003;Suppl.:16–20. [PubMed] [Google Scholar]
  • 18.Askenash J. The rule of five revisited. Preclinica. 2004;2:92. [Google Scholar]
  • 19.Tronde A., Norden B., Marchner H., Wendel A.-K., Lennernaes H., Bengtsson U.H. Pulmonary absorption rate and bioavailability of drugs in vivo in rats: structure-absorption relationships and physicochemical profiling of inhaled drugs. J. Pharm. Sci. 2003;92:1216–1233. doi: 10.1002/jps.10386. [DOI] [PubMed] [Google Scholar]
  • 20.Aptula A.O., Kuehne R., Ebert R., Cronin M.T.D., Netzeva T.I., Schueuermann G. Modeling discrimination between antibacterial and non-antibacterial activity based on 3D molecular descriptors. QSAR Comb. Sci. 2003;22:113–128. [Google Scholar]
  • 21.Mitchell T., Showell G.A. Design strategies for building drug-like chemical libraries. Curr. Opin. Drug Discov. Dev. 2001;4:314–318. [PubMed] [Google Scholar]
  • 22.Polinsky A., Shaw G.B. High-speed chemistry libraries: assessment of drug-likeness. In: Wermuth C.G., editor. Practice of Medicinal Chemistry. 2nd edition. Elsevier; London, UK: 2003. pp. 147–157. [Google Scholar]
  • 23.Matter H., Baringhaus K.-H., Naumann T., Klabunde T., Pirard B. Computational approaches towards the rational design of drug-like compound libraries. Comb. Chem. High Throughput Screening. 2001;4:453–475. doi: 10.2174/1386207013330896. [DOI] [PubMed] [Google Scholar]
  • 24.Lipinski C.A., Lombardo F., Dominy B.W., Feeney P.J. Experimental and computational approaches to estimate solubility and permeability in drug discovery and development settings. Adv. Drug Deliv. Rev. 1997;23:3–25. doi: 10.1016/s0169-409x(00)00129-0. [DOI] [PubMed] [Google Scholar]
  • 25.Walters W.P., Ajay, Murcko M.A. Recognizing molecules with drug-like properties. Curr. Opin. Chem. Biol. 1999;3:384–387. doi: 10.1016/s1367-5931(99)80058-1. [DOI] [PubMed] [Google Scholar]
  • 26.Muller G., Giera H. Protein secondary structure templates derived from bioactive natural products. J. Comput.-Aided Mol. Des. 1998;12:1–6. doi: 10.1023/a:1007954801605. [DOI] [PubMed] [Google Scholar]
  • 27.Lewell X.Q., Judd D., Watson S., Hann M. RECAP – Retrosynthetic combinatorial analysis procedure: a powerful new technique for identifying privileged molecular fragments with useful applications in combinatorial chemistry. J. Chem. Inf. Comput. Sci. 1998;38:511–522. doi: 10.1021/ci970429i. [DOI] [PubMed] [Google Scholar]
  • 28.Newman D.J., Cragg G.M., Snader K.M. Natural products as sources of new drugs over the period 1981–2002. J. Nat. Prod. 2003;66:1022–1037. doi: 10.1021/np030096l. [DOI] [PubMed] [Google Scholar]
  • 29.Rishton G.M. Reactive compounds and in vitro false positives in HTS. Drug Discov. Today. 1997;2:382–384. [Google Scholar]
  • 30.Hann M., Hudson B., Lewell X., Lifely R., Miller L., Ramsden N. Strategic pooling of compounds for high-throughput screening. J. Chem. Inf. Comput. Sci. 1999;39:897–902. doi: 10.1021/ci990423o. [DOI] [PubMed] [Google Scholar]
  • 31.Baurin N., Baker R., Richardson C., Chen I., Foloppe N., Potter A., Jordan A., Roughley S., Parratt M., Greaney P., Morley D., Hubbard R.E. Drug-like annotation and duplicate analysis of a 23-supplier chemical database totaling 2.7 million compounds. J. Chem. Inf. Comput. Sci. 2004;44:643–651. doi: 10.1021/ci034260m. [DOI] [PubMed] [Google Scholar]
  • 32.S. Sirois, D.-Q. Wei, Q. Du and K.-C. Chou, Virtual screening for SARS-CoV protease based on KZ7088 pharmacophore points, J. Chem. Inf. Comput. Sci., 2004, 44, 1111–1122. [DOI] [PubMed]
  • 33.http://nihroadmap.nih.gov/molecularlibraries/index.asp.
  • 34.Muegge I., Heald S.L., Brittelli D. Simple selection criteria for drug-like chemical matter. J. Med. Chem. 2001;44:1841–1846. doi: 10.1021/jm015507e. [DOI] [PubMed] [Google Scholar]
  • 35.Patchett A.A., Nargund R.P. Chapter 26. Privileged structures – an update. Annu. Rep. Med. Chem. 2000;35:289–298. [Google Scholar]
  • 36.Guo T., Hobbs D.W. Privileged structure-based combinatorial libraries targeting G protein-coupled receptors. Assay Drug Dev. Technol. 2003;1:579–592. doi: 10.1089/154065803322302835. [DOI] [PubMed] [Google Scholar]
  • 37.Horton D.A., Bourne G.T., Smythe M.L. The combinatorial synthesis of bicyclic privileged structures or privileged substructures. Chem. Rev. 2003;103:893–930. doi: 10.1021/cr020033s. [DOI] [PubMed] [Google Scholar]
  • 38.Horton D.A., Bourne G.T., Smythe M.L. Exploring privileged structures: the combinatorial synthesis of cyclic peptides. Mol. Divers. 2002;5:289–304. doi: 10.1023/a:1021365402751. [DOI] [PubMed] [Google Scholar]
  • 39.Bondensgaard K., Ankersen M., Thogersen H., Hansen B.S., Wulff B.S., Bywater R.P. Recognition of privileged structures by G-protein coupled receptors. J. Med. Chem. 2004;47:888–899. doi: 10.1021/jm0309452. [DOI] [PubMed] [Google Scholar]
  • 40.Roberts B.R. Target information in lead discovery. Targets. 2003;2:14–18. [Google Scholar]
  • 41.Schuffenhauer A., Floersheim P., Acklin P., Jacoby E. Similarity metrics for ligands reflecting the similarity of the target proteins. J. Chem. Inf. Comput. Sci. 2003;43:391–405. doi: 10.1021/ci025569t. [DOI] [PubMed] [Google Scholar]
  • 42.Hajduk P.J., Bures M., Praestgaard J., Fesik S.W. Privileged molecules for protein binding identified from NMR-based screening. J. Med. Chem. 2000;43:3443–3447. doi: 10.1021/jm000164q. [DOI] [PubMed] [Google Scholar]
  • 43.Muller G. Medicinal chemistry of target family-directed masterkeys. Drug Discov. Today. 2003;8:681–691. doi: 10.1016/s1359-6446(03)02781-8. [DOI] [PubMed] [Google Scholar]
  • 44.Rejto P.A., Verkhivker G.M. Unraveling principles of lead discovery: from unfrustrated energy landscapes to novel molecular anchors. Proc. Natl Acad. Sci. USA. 1996;93:8945–8950. doi: 10.1073/pnas.93.17.8945. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.P. A. Rejto and G. M. Verkhivker, Molecular anchors with large stability gaps ensure linear binding free energy relationships for hydrophobic substituents. In Pacific Symposium on Biocomputing'98 (ed. R. B. Altman), Maui, Hawaii, January 4–9, 1998, pp. 362–373. [PubMed]
  • 46.Teague S.J. Implications of protein flexibility for drug discovery. Nat. Rev. Drug Discov. 2003;2:527–541. doi: 10.1038/nrd1129. [DOI] [PubMed] [Google Scholar]
  • 47.Goodnow R. Small molecule lead generation processes for drug discovery. Drug Future. 2002;27:1165–1180. [Google Scholar]
  • 48.Burke M.D., Schreiber S.L. A planning strategy for diversity-oriented synthesis. Angew. Chem. Int. Ed. 2004;43:46–58. doi: 10.1002/anie.200300626. [DOI] [PubMed] [Google Scholar]
  • 49.Breinbauer R., Vetter I.R., Waldmann H. From protein domains to drug candidates: natural products as guiding principles in the design and synthesis of compound libraries. Angew. Chem. Int. Ed. 2002;41:2878–2890. doi: 10.1002/1521-3773(20020816)41:16<2878::AID-ANIE2878>3.0.CO;2-B. [DOI] [PubMed] [Google Scholar]
  • 50.Hulme C., Gore V. Multi-component reactions: emerging chemistry in drug discovery from xylocain to crixivan. Curr. Med. Chem. 2003;10:51–80. doi: 10.2174/0929867033368600. [DOI] [PubMed] [Google Scholar]
  • 51.Oprea T.I., Davis A.M., Teague S.J., Leeson P.D. Is there a difference between leads and drugs? A historical perspective. J. Chem. Inf. Comput. Sci. 2001;41:1308–1315. doi: 10.1021/ci010366a. [DOI] [PubMed] [Google Scholar]
  • 52.Oprea T.I. Current trends in lead discovery: are we looking for the appropriate properties? Mol. Divers. 2002;5:199–208. doi: 10.1023/a:1021368007777. [DOI] [PubMed] [Google Scholar]
  • 53.Proudfoot J.R. Drugs, leads, and drug-likeness: an analysis of some recently launched drugs. Bioorg. Med. Chem. Lett. 2002;12:1647–1650. doi: 10.1016/s0960-894x(02)00244-5. [DOI] [PubMed] [Google Scholar]
  • 54.Lipinski C.A. Compound properties and drug quality. In: Wermuth C.G., editor. Practice of Medicinal Chemistry. 2nd edition. 2003. pp. 341–349. [Google Scholar]
  • 55.Murray C.W., Verdonk M.L. The consequences of translational and rotational entropy lost by small molecules on binding to proteins. J. Comput.-Aided Mol. Des. 2002;16:741–753. doi: 10.1023/a:1022446720849. [DOI] [PubMed] [Google Scholar]
  • 56.Kolb H.C., Sharpless K.B. The growing impact of click chemistry on drug discovery. Drug Discov. Today. 2003;8:1128–1137. doi: 10.1016/s1359-6446(03)02933-7. [DOI] [PubMed] [Google Scholar]
  • 57.Congreve M., Carr R., Murray C., Jhoti H. A ’rule of three’ for fragment-based lead discovery? Drug Discov. Today. 2003;8:876–877. doi: 10.1016/s1359-6446(03)02831-9. [DOI] [PubMed] [Google Scholar]
  • 58.Jacoby E., Davies J., Blommers M.J.J. Design of small molecule libraries for NMR screening and other applications in drug discovery. Curr. Top. Med. Chem. 2003;3:11–23. doi: 10.2174/1568026033392606. [DOI] [PubMed] [Google Scholar]
  • 59.Lepre C.A., Peng J., Fejzo J., Abdul-Manan N., Pocas J., Jacobs M., Xie X., Moore J.M. Applications of SHAPES screening in drug discovery. Comb. Chem. High Throughput Screen. 2002;5:583–590. doi: 10.2174/1386207023329950. [DOI] [PubMed] [Google Scholar]
  • 60.Lepre C.A. Strategies for NMR Screening and Library Design. Methods and Principles in Medicinal Chemistry. Vol. 16. 2003. (BioNMR in Drug Research). pp. 391–415. [Google Scholar]
  • 61.Carr R., Jhoti H. Structure-based screening of low-affinity compounds. Drug Discov. Today. 2002;7:522–527. doi: 10.1016/s1359-6446(02)02245-6. [DOI] [PubMed] [Google Scholar]
  • 62.Davis A.M., Teague S.J., Kleywegt G.J. Application and limitations of X-ray crystallographic data in structure-based ligand and drug design. Angew. Chem. Int. Ed. 2003;42:2718–2736. doi: 10.1002/anie.200200539. [DOI] [PubMed] [Google Scholar]
  • 63.Fattori D. Molecular recognition: the fragment approach in lead generation. Drug Discov. Today. 2004;9:229–238. doi: 10.1016/S1359-6446(03)03007-1. [DOI] [PubMed] [Google Scholar]
  • 64.Charifson P.S., Walters W.P. Filtering databases and chemical libraries. Mol. Divers. 2002;5:185–197. doi: 10.1023/a:1021316023707. [DOI] [PubMed] [Google Scholar]
  • 65.Gillet V.J., Willett P. Computational Methods for the Analysis of Molecular Diversity. Pharmacochemistry Library. Vol. 32. 2002. (Trends in Drug Research III). pp. 125–133. [Google Scholar]
  • 66.Kubinyi H. Opinion: drug research: myths, hype and reality. Nat. Rev. Drug Discov. 2003;2:665–668. doi: 10.1038/nrd1156. [DOI] [PubMed] [Google Scholar]
  • 67.Veber D.F., Johnson S.R., Cheng H.-Y., Smith B.R., Ward K.W., Kopple K.D. Molecular properties that influence the oral bioavailability of drug candidates. J. Med. Chem. 2002;45:2615–2623. doi: 10.1021/jm020017n. [DOI] [PubMed] [Google Scholar]
  • 68.Andrews P.R., Craik D.J., Martin J.L. Functional group contributions to drug-receptor interactions. J. Med. Chem. 1984;27:1648–1657. doi: 10.1021/jm00378a021. [DOI] [PubMed] [Google Scholar]
  • 69.Oprea T.I. Virtual screening in lead discovery: a viewpoint. Molecules. 2002;7:51–62. [Google Scholar]
  • 70.Sauerberg P., Bury P.S., Mogensen J.P., Deussen H.-J., Pettersson I., Fleckner J., Nehlin J., Frederiksen K.S., Albrektsen T., Din N., Svensson L.A., Ynddal L., Wulff E.M., Jeppesen L. Large dimeric ligands with favorable pharmacokinetic properties and peroxisome proliferator-activated receptor agonist activity in vitro and in vivo. J. Med. Chem. 2003;46:4883–4894. doi: 10.1021/jm0309046. [DOI] [PubMed] [Google Scholar]
  • 71.Horspool K.R., Lipinski C.A. Advancing new drug delivery concepts to gain the lead. Drug Deliv. Technol. 2003;3:34–46. [Google Scholar]
  • 72.Zambrowicz B.P., Sands A.T. Knockouts model the 100 best-selling drugs – will they model the next 100? Nat. Rev. Drug Discov. 2003;2:38–51. doi: 10.1038/nrd987. [DOI] [PubMed] [Google Scholar]
  • 73.Hopkins A.L., Groom C.R. Opinion: the druggable genome. Nat. Rev. Drug Discov. 2002;1:727–730. doi: 10.1038/nrd892. [DOI] [PubMed] [Google Scholar]
  • 74.Adenot M., Lahana R.S., Nimes F. Blood–brain barrier permeation models: discriminating between potential CNS and non-CNS drugs including P-glycoprotein substrates. J. Chem. Inf. Comput. Sci. 2004;44:239–248. doi: 10.1021/ci034205d. [DOI] [PubMed] [Google Scholar]
  • 75.Doan K.M.M., Humphreys J.E., Webster L.O., Wring S.A., Shampine L.J., Serabjit-Singh C.J., Adkison K.K., Polli J.W. Passive permeability and P-glycoprotein-mediated efflux differentiate central nervous system (CNS) and non-CNS marketed drugs. J. Pharmacol. Exp. Ther. 2002;303:1029–1037. doi: 10.1124/jpet.102.039255. [DOI] [PubMed] [Google Scholar]
  • 76.Faassen F., Vogel G., Spanings H., Vromans H. Caco-2 permeability, P-glycoprotein transport ratios and brain penetration of heterocyclic drugs. Int. J. Pharm. 2003;263:113–122. doi: 10.1016/s0378-5173(03)00372-7. [DOI] [PubMed] [Google Scholar]
  • 77.Kelder J., Grootenhuis P.D.J., Bayada D.M., Delbressine L.P.C., Ploemen J.-P. Polar molecular surface as a dominating determinant for oral absorption and brain penetration of drugs. Pharm. Res. 1999;16:1514–1519. doi: 10.1023/a:1015040217741. [DOI] [PubMed] [Google Scholar]
  • 78.Norinder U., Haeberlein M. Computational approaches to the prediction of the blood–brain distribution. Adv. Drug Deliv. Rev. 2002;54:291–313. doi: 10.1016/s0169-409x(02)00005-4. [DOI] [PubMed] [Google Scholar]
  • 79.Lobell M., Molnar L., Keseru G.M. Recent advances in the prediction of blood–brain partitioning from molecular structure. J. Pharm. Sci. 2003;92:360–370. doi: 10.1002/jps.10282. [DOI] [PubMed] [Google Scholar]
  • 80.Suomalainen P., Johans C., Soederlund T., Kinnunen P.K.J. Surface activity profiling of drugs applied to the prediction of blood–brain barrier permeability. J. Med. Chem. 2004;47:1783–1788. doi: 10.1021/jm0309001. [DOI] [PubMed] [Google Scholar]
  • 81.Lombardo F., Gifford E., Shalaeva M.Y. In silico ADME prediction: data, models, facts and myths. Mini-Rev. Med. Chem. 2003;3:861–875. doi: 10.2174/1389557033487629. [DOI] [PubMed] [Google Scholar]
  • 82.Stouch T.R., Kenyon J.R., Johnson S.R., Chen X.-Q., Doweyko A., Li Y. In silico ADME/Tox: why models fail. J. Comput.-Aided Mol. Des. 2003;17:83–92. doi: 10.1023/a:1025358319677. [DOI] [PubMed] [Google Scholar]
  • 83.Pardridge W.M. Blood–brain barrier genomics and the use of endogenous transporters to cause drug penetration into the brain. Curr. Opin. Drug Discov. Dev. 2003;6:683–691. [PubMed] [Google Scholar]
  • 84.Stenberg P., Norinder U., Luthman K., Artursson P. Experimental and computational screening models for the prediction of intestinal drug absorption. J. Med. Chem. 2001;44:1927–1937. doi: 10.1021/jm001101a. [DOI] [PubMed] [Google Scholar]
  • 85.Ertl P., Rohde B., Selzer P. Fast calculation of molecular polar surface area as a sum of fragment-based contributions and its application to the prediction of drug transport properties. J. Med. Chem. 2000;43:3714–3717. doi: 10.1021/jm000942e. [DOI] [PubMed] [Google Scholar]
  • 86.Artursson P., Bergstroem C.A.S. Intestinal Absorption: the Role of Polar Surface Area. Methods and Principles in Medicinal Chemistry. Vol. 18. 2003. (Drug Bioavailabilty). pp. 341–357. [Google Scholar]
  • 87.Bergstroem C.A.S., Strafford M., Lazorova L., Avdeef A., Luthman K., Artursson P. Absorption classification of oral drugs based on molecular surface properties. J. Med. Chem. 2003;46:558–570. doi: 10.1021/jm020986i. [DOI] [PubMed] [Google Scholar]
  • 88.Zamora I., Oprea T., Cruciani G., Pastor M., Ungell A.-L. Surface descriptors for protein–ligand affinity prediction. J. Med. Chem. 2003;46:25–33. doi: 10.1021/jm011051p. [DOI] [PubMed] [Google Scholar]
  • 89.Fichert T., Yazdanian M., Proudfoot J.R. A structure-permeability study of small drug-like molecules. Bioorg. Med. Chem. Lett. 2003;13:719–722. doi: 10.1016/s0960-894x(02)01035-1. [DOI] [PubMed] [Google Scholar]
  • 90.Sakaeda T., Okamura N., Nagata S., Yagami T., Horinouchi M., Okumura K., Yamashita F., Hashida M. Molecular and pharmacokinetic properties of 222 commercially available oral drugs in humans. Biol. Pharm. Bull. 2001;24:935–940. doi: 10.1248/bpb.24.935. [DOI] [PubMed] [Google Scholar]
  • 91.Zhao Y.H., Abraham M.H., Hersey A., Luscombe C.N. Quantitative relationship between rat intestinal absorption and Abraham descriptors. Eur. J. Med. Chem. 2003;38:939–947. doi: 10.1016/j.ejmech.2003.07.005. [DOI] [PubMed] [Google Scholar]
  • 92.Lipinski C.A. Integration of physicochemical property considerations into the design of combinatorial libraries. Pharm. News. 2002;9:195–202. [Google Scholar]
  • 93.Ran Y., Yalkowsky S.H. Prediction of drug solubility by the general solubility equation (GSE) J. Chem. Inf. Comput. Sci. 2001;41:354–357. doi: 10.1021/ci000338c. [DOI] [PubMed] [Google Scholar]
  • 94.Bergstroem C.A.S., Norinder U., Luthman K., Artursson P. Molecular descriptors influencing melting point and their role in classification of solid drugs. J. Chem. Inf. Comput. Sci. 2003;43:1177–1185. doi: 10.1021/ci020280x. [DOI] [PubMed] [Google Scholar]
  • 95.Lombardo F., Obach R.S., Shalaeva M.Y., Gao F. Prediction of human volume of distribution values for neutral and basic drugs. 2. Extended data set and leave-class-out statistics. J. Med. Chem. 2004;47:1242–1250. doi: 10.1021/jm030408h. [DOI] [PubMed] [Google Scholar]
  • 96.Lombardo F., Obach R.S., Shalaeva M.Y., Gao F. Prediction of volume of distribution values in humans for neutral and basic drugs using physicochemical measurements and plasma protein binding data. J. Med. Chem. 2002;45:2867–2876. doi: 10.1021/jm0200409. [DOI] [PubMed] [Google Scholar]
  • 97.Lewis D.F.V., Dickins M. Substrate SARs in human P450s. Drug Discov. Today. 2002;7:918–925. doi: 10.1016/s1359-6446(02)02412-1. [DOI] [PubMed] [Google Scholar]
  • 98.K. M. Comess, M. J. Voorbach, M. L. Coen, H. Tang, L. Gao, X. Cheng, M. E. Schurdak, B. A. Beutel and D. J. Burns, Affinity-based high-throughput screening of orphan targets: practical solutions for removing promiscuous binders, Abstracts of Papers, 224th ACS National Meeting, Boston, MA, United States, August 18–22, 2002, COMP-220.
  • 99.McGovern S.L., Helfand B.T., Feng B., Shoichet B.K. A specific mechanism of nonspecific inhibition. J. Med. Chem. 2003;46:4265–4272. doi: 10.1021/jm030266r. [DOI] [PubMed] [Google Scholar]
  • 100.McGovern S.L., Shoichet B.K. Kinase inhibitors: not just for kinases anymore. J. Med. Chem. 2003;46:1478–1483. doi: 10.1021/jm020427b. [DOI] [PubMed] [Google Scholar]
  • 101.Seidler J., McGovern S.L., Doman T.N., Shoichet B.K. Identification and prediction of promiscuous aggregating inhibitors among known drugs. J. Med. Chem. 2003;46:4477–4486. doi: 10.1021/jm030191r. [DOI] [PubMed] [Google Scholar]
  • 102.Tice C.M. Selecting the right compounds for screening: does Lipinski's rule of 5 for pharmaceuticals apply to agrochemicals? Pest Manage. Sci. 2001;57:3–16. doi: 10.1002/1526-4998(200101)57:1<3::AID-PS269>3.0.CO;2-6. [DOI] [PubMed] [Google Scholar]
  • 103.Tice C.M. Selecting the right compounds for screening: use of surface-area parameters. Pest Manage. Sci. 2002;58:219–233. doi: 10.1002/ps.441. [DOI] [PubMed] [Google Scholar]
  • 104.Zernov V.V., Balakin K.V., Ivaschenko A.A., Savchuk N.P., Pletnev I.V. Drug discovery using support vector machines. The case studies of drug-likeness, agrochemical-likeness, and enzyme inhibition predictions. J. Chem. Inf. Comput. Sci. 2003;43:2048–2056. doi: 10.1021/ci0340916. [DOI] [PubMed] [Google Scholar]

Articles from Annual Reports in Computational Chemistry are provided here courtesy of Elsevier

RESOURCES