Abstract
Fragment-based drug design (FBDD) has become an effective methodology for drug development for decades. Successful applications of this strategy brought both opportunities and challenges to the field of Pharmaceutical Science. Recent progress in the computational fragment-based drug design provide an additional approach for future research in a time- and labor-efficient manner. Combining multiple in silico methodologies, computational FBDD possesses flexibilities on fragment library selection, protein model generation, and fragments/compounds docking mode prediction. These characteristics provide computational FBDD superiority in designing novel and potential compounds for a certain target. The purpose of this review is to discuss the latest advances, ranging from commonly used strategies to novel concepts and technologies in computational fragment-based drug design. Particularly, in this review, specifications and advantages are compared between experimental and computational FBDD, and additionally, limitations and future prospective are discussed and emphasized.
Keywords: fragment-based drug design, fragment database, drug discovery, fragment docking, virtual screening
INTRODUCTION
The drug discovery processes currently rely heavily on large-scale screening between compound libraries and different biological targets (1). The high-throughput screening (HTS) technology underwent dramatic improvements within recent years. Besides successful applications of HTS for drug development, limitations do exist with this approach. The HTS can suffer from (1) a low hit rate and (2) a limited coverage of drug-like chemical space. Experimental fragment-based drug design and fragment-based screening (FBS) become complementary choices for lead identification and drug development (2). It is generally believed that a chemical space of 109 diversified molecules can be sampled with 103 fragments (3). The coverage of chemical space can be improved with fragment-based drug design. Meanwhile, it is easier for conducting chemical modification with fragment hits. The selected fragments in a library usually follow the criteria of “rule of three” (4) (M.W. ≤300, hydrogen bond donor and acceptor ≤ 3, CLogP ≤3, NROT ≤3, and PSA ≤60). Questions commonly asked towards the application of rule of three include (1) whether we should use three-dimensional fragments instead of flat structures, (2) whether or not there are significant differences in fragment-binding sites among different protein-binding sites, (3) what are the specific criteria for determining the fragment hits given the low affinity of fragment-protein interaction, and (4) what are strategies and approaches to select and develop fragment hits fulfilled the rule of three into leads. Although discussions and argues do exist about the application of these rules (5,6), “rule of three” remains the preferred reference for fragments selection. Rule of three assists the selection of hits with the suitable size and physical-chemical properties and limits the complexity of molecules in the fragment libraries.
Experimental fragment-based drug design and FBS provided fruitful results in the past decades. Abbott, one example of a pharmaceutical company, and SGX Pharmaceuticals, one example of a biotechnology company, provided successful practices in developing fragments into clinical candidates (7). Other examples include fragment-based screening by X-ray crystallography to β secretase (8), development of FtsZ inhibitor with anti-staphylococcal activity (9), and the discovery of phosphodiester-ase inhibitors through crystallography and scaffold-based drug design (10). However, challenges and limitations for current experimental fragment-based drug design can inevitably restrict the application of this approach (3,11). First, large-scale pure, high-quality, and stable target proteins are required for the screening. The crystallization of certain kinds of proteins, like GPCRs, is challenging. The processes can be expensive and time-consuming. Also, the approach can be difficult when applied to targets without a known structure. Second, given the relatively small sizes of fragments, limited receptor-ligand interactions can be formed between an individual fragment and the surrounding residues and only a part of them is strong enough for detection. Only soluble fragments with relatively high solubility can be suitable for fragments screening. This requirement significantly narrows down appropriate fragments.
Given the restrictions of experimental fragment-based drug design, computational approaches can provide prospective solutions to the drug discovery processes in the future. With the dramatic progress in the field of computation technologies, calculation power has undergone an exponential increase during the last decades (12). Computational fragment-based drug design becomes an efficient and time- money-, and labor-saving approach to facilitate the traditional fragment-based drug design processes. Computational methods can easily establish an integrated fragment library either by filtering existing libraries with certain criteria or by breaking down a variety of compounds (13,14). With attention to the physical-chemical properties and chemical synthetic feasibility, only appropriate and desirable fragments can be included. The diversified fragment libraries, with an emphasis on both quantity and quality generated through computational methods, have significant advantages when compared with traditional fragment libraries. Computational fragment-based drug design can use molecular docking technology for performing the virtual screening. Molecular docking puts a small molecule into a binding pocket of a macromolecule, and the corresponding algorithm will evaluate the receptor-ligand affinities and interactions (15). There is a range of docking algorithms currently available for different types of works (15–18). Examples of current widely used docking programs and scoring functions are Glide (19), GOLD (20,21), Surflex-Dock (22) etc. The fragment-involved docking studies draw increasing attention since the continued developments in ligand efficiency matrices become available. The virtual screening can evaluate a massive amount of fragment-receptor interactions in a time-efficient manner, regardless of the solubility of an individual fragment. In case no crystal structure for the target is available, homology modeling can be used for the model generation (23). Homology modeling can build a target macromolecule based on the structure of the template protein. The template can be determined by the sequence and biological homology (24). Examples of commonly used programs are Modeler (25) and SWISS-MODEL (26). Moreover, computational fragment-based drug design can flexibly adopt different strategies (commonly used strategies will be discussed in the following sections) and introduce various linkers with self-defined criteria and rules, which is critical for achieving diversified results (27). As a fast-growing area of research, increasing attention is focused on the fragment-based drug design. There is a clear trend that the number of publications with this concept has increased dramatically during the last several decades when compared to the total number of publications on general drug design (Fig. 1). The number of publications on FBDD increased roughly 8 times from ~ 35 in 2001 to ~ 280 in 2017, while the total number of publications on general drug design increased around 2.3 times from ~ 3500 in 2001 to ~ 8000 in 2017.
Fig. 1.

The number of publications each year from 2001 to 2017 that include the concepts of “fragment-based drug design” (blue bars) and simply “drug design” (orange dots and line) through the literature search on SciFinder (scifinder.cas.org)
COMMON STRATEGIES IN FRAGMENT-BASED DRUG DESIGN
Fragment binding can reveal hot spots on proteins, which can result in high-affinity receptor-ligand interactions. The fragment-based approach is especially suitable for detecting the interaction spots among the protein-ligand binding, as fragments can interact with a certain region of the target protein. Once the identification of fragments inside the binding pocket is done, these fragments can be grown, linked, or merged to develop the potential ligands (Fig. 2).
Fig. 2.

Commonly used strategies for fragment-based drug design. a Growing. After identification of one suitable fragment inside the binding pocket, substitutions can be added to the identified fragment. The growing process will increase the lead likability of the original fragment to enhance the receptor-ligand interactions. b Linking. Multiple fragments can be identified for one binding pocket simultaneously targeting at different regions. Linkers will be introduced to connect separated fragments to increase the lead likability and to create a novel compound with the potential affinity towards the pocket. c Merging. A known lead can partially occupy the binding pocket. One or more fragment(s) can be identified to be suitable for the remaining space. One or more linker(s) can be introduced to connect the known lead and the fragment(s) to increase the strength of receptor-ligand interactions
Fragment Growing
With a given binding pocket, one suitable fragment can be identified to have preferable interactions with a certain sub-region inside the binding pocket. Multiple substitutions can be added to the identified fragment to grow it up. The combination of substitutions and the identified fragment usually follows the chemical combinatorial rules to guarantee the generation of synthetic feasible compounds. The determination of candidate substitutions can be adjusted according to (1) the shape, size, and physical-chemical properties of the remaining part of the binding pocket or (2) the pharmacophore features revealed from reported/known active compounds. The growing process will generally increase the druglikeness of the original fragment. The newly generated compounds are supposed to have enhanced receptor-ligand interactions.
Fragment Linking
Multiple fragments can be identified to occupy different sub-regions simultaneously inside a given binding pocket for a certain target. Different sub-regions may have distinctive structural features. The arrangements of several residues can create hydrophobic or hydrophilic environments among different sub-regions. Favored fragments among these sub-regions can have various physical-chemical properties. Multiple fragments can be identified to interact with the given binding pocket, with each fragment occupying a certain sub-region. Fragment linkers can be introduced to connect separate fragments to increase the druglikeness. The connection of multiple fragments usually follows the synthetic rules to generate synthetically reasonable novel compounds. The newly generated compounds are supposed to have both good affinity and selectivity to the binding pocket.
Fragment Merging
Usually, there are reported/known lead compounds for a given target. The known compounds may function as probes, which are sufficient for the fundamental studies with different purposes. The known lead may occupy a big portion of the binding pocket, and the remaining space can be suitable for fragment binding. Similar to fragment linking, linkers could be introduced to connect the known lead and the suitable fragment(s). The suitable fragments are selected with a consideration of (1) achieving preferable physical-chemical properties of the final product, (2) forming relatively strong receptor-ligand interactions, and (3) being synthetically reasonable for adding the linker(s). Fragment merging is not only used in novel compound design but also remains a tool for chemical modification and derivative generation.
Overview of Computational Fragment-Based Drug Design
The computational fragment-based drug design is a fast-growing area of research with new concepts and ideas coming through. The detailed discussion on the latest technologies and approaches will be focused on the following sections. The basic ideas remain the same. The flowchart for a typical computational fragment-based drug design involves five major steps (Fig. 3). First, the establishment of the diversified fragment library. Considering the advancement in computational power, a very large number of fragments can be screened in each experiment. This guarantees a good degree of fragment diversity and increases the statistical probability of finding suitable fragments for a given pocket. Second, virtual screening of the fragment library in order to find suitable fragments. Fragments are relatively small compounds, which will not likely give appealing binding energies or docking scores during the docking process. However, by analysis of the potential receptor-ligand interactions, like H-bond or hydrophobic interactions, suitable fragments can be identified. Third, design the lead compounds based on identified fragments. As mentioned above, fragment growing, fragment linking, and lead-fragments merging are strategies that can be used in this step. Fourth, the verification of generated novel leads with corresponding biological assays. Computational validation methods will be applied at the beginning to filter out false positives or false negatives. Corresponding biological assays will be followed to formally verify the effectiveness of these lead compounds. Fifth, the confirmation of binding. The last step mainly contributes to the mechanism understanding. After the growing, linking, or merging steps, fragments inside a compound may not recur at the identical places from the virtual screening. Confirming the binding mode of the compounds with X-ray crystallography or NMR will reveal receptor-ligand interactions and provide insight into the mechanism of receptor activation or inhibition. The recent advancement in FBDD approaches is reviewed below.
Fig. 3.

Flowchart of computational fragment-based drug design. Five major steps are considered: First, the establishment of the diversified fragment library; second, the virtual screening for the given target; third, the fragments processing for the lead generation; fourth, the hits validation with biological assays; and fifth, the confirmation of binding modes
Fragment-Based Molecular Evolutionary Approach
Kentaro Kawai, Naoya Nagata, and Yoshimasa Takahashi described a fragment-based similarity-driven molecular evolutionary approach for designing candidates of new drugs in 2013 (28). This approach emphasizes the generation of candidates with differences on both side chains and the scaffolds when compared with the reference molecule.
The approach can be generally summarized in six steps:
An active molecule will be inputted as a reference structure.
Seed fragments will be created through the fragmentation of the reference molecule.
Initial structures will be generated through the combination of seed fragments and fragments in the library.
Offspring for the next generation will be constructed by mutation and crossover.
Fitness will be evaluated through taminoto coefficient, and the survival will be selected through a tournament method.
Steps 4 and 5 will be repeated to allow the alternative of generation reaching a specified number.
All the fragments in the fragment library are classified into three groups: rings, linkers, and side chains. Connection rules are applied for the fragment combination. Ring-ring connections, ring-linker connections, and ring-side chain connections are three allowable connections. While linker-liner connections, side chain-side chain connections, and linker-side chain connections are three unallowable connections. In mutation, a random selection will be performed to choose a parent molecule. One of the three operations: (1) add a fragment, (2) remove a fragment, or (3) replace a fragment will be performed with a selected parent molecule. For crossover, two offspring molecules will be created through the exchange of the fragment sets from the parent molecules. The crossover point is randomly selected from each parent molecule. In this approach, chemical structures are described through the topological-fragment-descriptor (TFD). The TFD method describes each chemical structure as a multidimensional, numerical pattern vector. The Tanimoto coefficient will be used to determine the similarity between the results and the reference molecule as a fitness function. The tournament method will compare the fitness value between two individuals from the parent set and offspring set and will select the individual with the higher fitness value. Half of the population will be selected eventually.
Molecular evolution experiments on two different targets, hAA2A and r5HT1A, were conducted by Kawai and the group to verify the feasibility of the described approach (28). For ligand design on hAA2A (Fig. 4a), seed fragments were created from the fragmentation of reference molecule 1. One hundred molecules were designed in silico using the seed fragments following the evolutionary approach. Many of these compounds remained the same non-terminal vertex graph (a graph that has no terminal vertex and no isolated vertex) as 1, while molecule 5 did not. Notably, 5 had the same non-terminal vertex graph scaffold with a known hAA2A ligand 7. The same process was followed for ligand design on r5HT1A. There was a comparison between the reference molecule, designed molecule, and active molecule (Fig. 4b). Molecules 8 and 10 were designed when 2 was used as the reference. Known r5HT1A active compounds 9 and 11 were identified to have the same non-terminal vertex graph scaffolds as 8 and 10, respectively. Molecule 13 was designed using 12 as the reference. 13 exactly matched the known r5HT1A active compound 14. In all, new potential molecules can be designed and developed to have similar but different scaffolds towards the references.
Fig. 4.

Application of fragment-based molecular evolutionary approach on hAA2A and r5HT1A. a Molecule 5 was designed using 1 as the reference. 7 was identified as a known active hAA2A compound to share the same scaffold with 5. b Comparisons of scaffolds and structures between designed compounds, reference molecules, and identified known active molecules
SILCS Computational Functional Group Mapping Approach
Olgun Guvench reported a computational functional group mapping (cFGM) technology with all-atom explicit-solvent molecular dynamic simulation (MD) for drug discovery in 2016 (29). SILCS represents Site-Identification by Ligand Competitive Saturation and is included in cFGM involving all-atom explicit-solvent MD. This approach is supposed to provide the information about the selection of functional groups for affinity, specificity, and pharmacokinetic properties, which are critical for not only optimizing known candidate molecules but also for designing novel compounds.
Experimental FGM involves synthesis and testing of a series of small, simple fragments to the given binding pocket. Fragments can be screened with high-concentration biological assays and validated with nuclear magnetic resonance (NMR)-based confirmation of binding. The binding poses of multiple fragments can be functioned as a map for guiding the compound design for medicinal chemists. Fragments can be linked, merged, or grown in a synthetically reasonable manner to result in drug-like molecules.
The computational approaches benefit from (1) time-saving, (2) labor-saving, and (3) material-saving. Their major liabilities remain (1) the accuracy of the model and (2) sufficient conformational sampling. SILCS cFGM brings three scientific advantages to experimental methods. First, the MD trajectory could capture both long-lasting and fleeting interactions, which makes it possible for detecting low-affinity binding regions. Second, as an outcome from the first advantage, a functional group map can include all regions inside the entire target structure. Not only the target surface as revealed from the crystal structure is considered but also the inducible pockets resulting from the fragment binding are included. Third, the aggregation of hydrophobic fragments and the denaturation of the targets can be prevented. Meanwhile, SILCS cFGM can also have quantitative applications. 3D pharmacophore models can be prepared through cFGM. Models can function as screening tools for compound selection. Maps can also be used to construct scoring functions for molecular docking and pose refinement.
Deconstruction-Reconstruction Approach
Haijun Chen et al. gave a comprehensive summary on improvements in the deconstruction-reconstruction approach for recent decades in 2014 (30). Instead of using established fragment libraries for the screening towards the given binding pocket and selecting preferable fragments, the deconstruction-reconstruction approach uses known ligands as the source for fragment generation. A relatively small and specific fragment library can be generated through the deconstruction of known ligands. The following reconstruction step is supposed to combine fragments in different regions and create novel compounds with new scaffolds.
The process, which is constituted with two steps, for the deconstruction-reconstruction approach is straightforward. The first step is the deconstruction of known compounds. Fragments will be created. The virtual screening between the given target and the fragments will reveal the potential interactions. The awareness of suitable fragments for different subdomains will benefit the drug design using growing, linking, and merging in the reconstruction step. Usually, obstacles exist with the reconstruction procedure. The newly designed molecules can have distorted binding modes of their individual fragments. The deconstruction-reconstruction approach is an efficient methodology for compound design and novel scaffold generation.
Multitasking Computational Model Approach
Alejandro Speck-Planche and M. Natalia D. S. Cordeiro reported the first multitasking (mtk) computational model for in silico fragment-based compound design of multiple-target inhibitors (31). One case study of designing molecules with good inhibition activities on multiple breast cancer-related proteins was presented in their paper to illustrate the effectiveness of the proposed approach. Instead of using traditional high-throughput screening or virtual screening for selecting active candidates through a huge database, the multitasking computational model likes to utilize the accumulated high-quality experimental data. Quantitative contributions of inhibition activities for each fragment will be calculated. And the physicochemical interpretations of descriptors in the mtk computational model will be considered. Both the fragment quantitative contribution and the descriptor physicochemical interpretation will provide the guidance for the multi-target compound design.
The establishment of the multitasking computational model involves three steps. The first step is the extraction of biological data for molecules through databases like CHEMBL. Restrictions can be applied towards the selection of molecules. Examples of restrictions are the number of reported measurements on activities, the type of measurements, and the accuracy and reproducibility of the assays. The second step is the calculation of molecular descriptors for selected molecules. Topological descriptors can be calculated by QUBILs-MIDAS (32), which is based on multilinear algebraic maps and discrete mathematics. The third step is the generation of the mtk computational model. Statistical cases will be split into training and test sets. The optimized model will be searched and determined by the training set. The test set will be used to show that the model is equipped with desirable prediction power. Several statistic indices will be utilized for the assessment of the model quality.
The established multitasking computational model will be further used to determine the quantitative contribution of each fragment. Fragments with positive contributions to the biological activities will be selected to connect with each other to generate new molecules. The druglikeness of the newly generated molecules will be analyzed, and the virtual screening between the target(s) and the newly generated molecule(s) will be followed to verify their effectiveness (Table I).
Table I.
Summary of the Latest and Widely Used Computational Programs and Algorithms for Fragment-Based Drug Design
| Program name | Features | Universal resource locator |
|---|---|---|
| ACFIS server (33) | Core fragment can be generated from active molecules. Fragments can be grown to the junction site. Energy calculation will be used to determine the fragment fit. | http://chemyang.ccnu.edu.cn/ccb/server/ACFIS/ |
| AutoT&T v.2 (34) | The multi-round optimization jobs are improved to give a higher speed. Structural crossover can be performed between several leads. Online server is available. | http://www.sioc-ccbg.ac.cn/software/att2/ |
| CHARMMing Web User (35) | Fragment-based docking protocol is included. Commonly used computational chemistry methodologies are incorporated. Virtual screening can be directly performed through the online server. | https://www.charmming.org/charmming/ |
| e-LEA3D (36) | Fragments or combination of fragments can be selected to fit a QSAR model or the binding pocket of a given protein. Virtual screening can be performed with user-specified criteria. Online server is available. | http://chemoinfo.ipmc.cnrs.fr |
| eMolFrag (37) | Non-redundant libraries of fragments could be prepared from a given set of molecules. The algorithm is available as both stand-alone software and a web server. | www.brylinski.org/emolfrag |
| eSynth (38) | Chemically feasible molecules could be reconstructed from molecular fragments. Diverse collections of molecules with the desired activity profiles could be generated. | www.brylinski.org/content/molecular-synthesis |
| FluX(39) | Druglike molecules will be deconstructed with pseudo-retrosynthesis rules. New molecules will be generated with stochastic search algorithm and ligand-based similarity scoring. | http://pubs.acs.org/doi/abs/10.1021/ci0503560 |
| FTMAP (40) | Billions of positions of small organic molecules are sampled as probes. Probe poses are scored using a detailed energy expression. Binding hot spots are identified by clusters of multiple probe types. | http://ftmap.bu.edu/publications.php |
| iScreen (41) | Cloud computing web server. Protein preparation tool is implemented for estimating the size of the ligand binding pocket. The included docking modes are standard, in-water, pH environment, and flexible. | http://iscreen.cmu.edu.tw/intro.php |
| LigBuilder 2 (42) | Designed compounds will be analyzed for synthesis accessibility. The drugability of binding pockets on the protein surface will be quantitatively assessed. Fragments meeting certain drug-like criteria are selected for constructing novel compounds. | http://pubs.acs.org/doi/abs/10.1021/ci100350u The package can be acquired by contacting the authors. |
| Novoflap (43) | The building blocks are 1300 fragments in a fragment library. An evolutionary algorithm and a ligand-based scoring function were combined for this approach. | http://pubs.acs.org/doi/abs/10.1021/ci100080r |
| MED-SuMo (44) | The MED-SuMo fragment database can be used for populating the binding sites. MED-Portion chemical moieties hopping is the major strategy for this approach. | http://medit-pharma.com/index.php?page=fragment-based-approach |
| molBLOCKS (45) | molBLOCKS is a suite of programs to (1) break down sets of molecules, (2) cluster the resulting fragments, and (3) uncover enriched objectives. | http://compbio.cs.princeton.edu/molblocks |
| S4MPLE(46) | Several entities can be docked simultaneously. Sampling is based on the hybrid genetic algorithm. AMBER force field was used for energy calculation. | http://pubs.acs.org/doi/abs/10.1021/ci300495r |
| SHAFTS (47) | 3D molecular similarity calculation uses a hybrid similarity metric and labeled chemistry groups. Experimentally determined conformations are not required for templates in ligand-based virtual screening. | http://pubs.acs.org/doi/abs/10.1021/ci200060s |
SUCCESSFUL APPLICATIONS OF COMPUTATIONAL FRAGMENT-BASED DRUG DESIGN AND DISCOVERY
Case Study 1: Discovery of Small-Molecule STAT3 Inhibitor
Li, Yu, and their groups reported the application of in silico site-directed FBDD for designing the novel inhibitors for STAT3 (48). STAT3 is an attractive target for therapeutic uses among different carcinomas, including breast cancer and sarcomas. The phosphorylation and dimerization of STAT3 are closely related to the oncology of the above-mentioned carcinomas (49,50). STAT3 inhibitors should bind to the SH2 domain on the protein in order to block both the phosphor-ylation and dimerization processes through competing with the native phosphotyrosine loop (48). A flowchart (Fig. 5) was summarized to demonstrate the workflow. In their study, they first did the molecular docking studies between the SH2 domain and the known STAT3 dimerization inhibitors. The fragment library was established based on the known inhibitors, and the sublibraries, which include fragments belonging to the site pTyr705 and the side pocket, were determined according to the binding modes revealed through the docking studies. Then, selected fragments from one sublibrary were linked with fragments in the other sublibrary to build a new library for novel lead compounds. Finally, the molecular docking studies between the novel leads and the SH2 domain were performed to determine the final leads for chemistry synthesis and biological evaluation. Four out of five synthesized compounds have their IC50 less than 5 micromolar against the cancer cell line U2OS. Especially, compound 8 has a great potency among a variety of cancer cell lines. The fluorescence polarization assay verified the binding of 8 with the STAT3 SH2 domain.
Fig. 5.

An illustrated flowchart of small-molecule inhibitor design on STAT3 using “fragment linking.” Fragments belonging to different sites were colored in blue and orange. The structure of compound 8 was specified. The two colors in the compound indicated the interaction sites
Case Study 2: In Silico Design of Allosteric Modulators on mGluR5
Xie and his group reported their practice of using computational FBDD for finding novel allosteric modulators on metabotropic glutamate receptor 5 (51). mGluR5 is an appealing target for drug development since it is associated with multiple diseases in the central nervous system. Furthermore, designing GPCR allosteric modulators is a promising strategy for drug development, given that allosteric binding pockets allow additional receptor-ligand interactions (51,52). The process of their application on compound design was summarized into a flowchart (Fig. 6). In the study, 27,262 known GPCR allosteric modulators were broken into fragments using RECAP analysis first. Rule of three was used here for guiding the selection of preferred fragments. The molecular docking studies between the fragment library and the mGluR5 allosteric binding pocket were followed. The upper and bottom regions inside the pocket were revealed through the docking studies. Then, the fragments were categorized into two groups according to the interactions and affinities between the fragments and the residues of a certain region. RECAP synthesis was adopted for generating novel leads by connecting fragments in these two regions. The molecular docking studies between newly generated leads and the mGluR5 allosteric binding pocket were conducted for selecting the potential modulators. Multiple docking algorithms were used for evaluation. The enrichment test with challenging decoys and the QSAR simulation were continued for validation. Among the top 20 in silico hits, structural diversified compounds with reported or unreported scaffolds could be observed. A similarity search revealed that a series of patented scaffolds with reported mGluR allosteric activities could recur through the in silico design.
Fig. 6.

An illustrated flowchart of allosteric modulator design on mGluR5 using fragment linking. Upper and bottom regions, as well as the corresponding fragments, were colored in blue and orange
Case Study 3: Fragment-Based Inhibitor Discovery on CTX-M β-Lactamase
Chen and Shoichet reported their research on using the in silico fragment-based drug design to facilitate the compounds development processes. Their efforts resulted in the first micromolar-range class A β-lactamase, CTX-M, non-covalent inhibitor (11). β-lactamase CTX-M turns out to be one of the most important mechanisms for the bacteria resistance towards third-generation cephalosporins (53). With a large and open active site, CTX-M has been considered a difficult target for developing non-covalent inhibitors. In their study, molecular docking studies between the active site and a fragment subset (67,489 compounds) or a lead-like subset (1,147,326 compounds) were performed. Compounds were selected among the top-ranking list. Ten out of 69 investigated fragment inhibitors had their IC50 values in millimolar level. Given the complementarity between the active site and the tetrazole group, which was discovered through the fragment docking study, they progressed to look for larger analogs. Compound 12 was subsequently identified with Ki value of 21 micromolar. 12 was ranked 982 among the docking screening for lead-like compounds, which is low enough for researchers to ignore it. Co-crystallization of the protein and inhibitors was performed for confirming the binding and comparing the docking modes. Competitive binding assays on CTX-M and AmpC were continued to evaluate the specificity of inhibitors’ binding. The key points of their approach were summarized and a flowchart was created to illustrate the workflow (Fig. 7). Dr. Yu Chen and Dr. Brian K Shoichet discovered the first micromolar-range CTX-M non-covalent inhibitor with computational fragment-based drug design. Also, they stated that “the high hit rates among fragments partly arise from their initial low specificity, and that specificity can be designed into these molecules as they are advanced”.
Fig. 7.

An illustrated flowchart of inhibitor discovery on CTX-M β-lactamase using fragment growing. Molecular docking studies using lead-like subset were followed after identifying critical fragment hit through fragment-based screening. The structure of compound 12 was specified. The tetrazole group was colored in blue
LIMITATIONS AND FUTURE PROSPECTIVE
Fragments are relatively smaller molecules compared with lead-like and drug-like compounds, which makes the molecular docking processes much more difficult (54). The difficulty mainly occurs because of two aspects. (1) Fragments usually cannot form sufficient interactions with the surrounding residues. The limited amount of functional groups results in the low-affinity binding. Investigators may miss valuable and potential fragment hits because these fragments have a tiny size and weak interactions towards the protein. (2) Due to the relatively small size, the docking modes for fragments can be promiscuous. Fragments can be favored by multiple sub-regions, in case they share similar chemical-physical properties, inside a binding pocket. The prediction is hard to perform, and the subsequent analysis processes can be time-consuming.
Molecular docking technology underwent progress in the last several decades. Researchers can choose from systematic or stochastic methods for conformational search and force-field-based, empirical, or knowledge-based functions for scoring (55). However, the accuracy for predicting the receptor-ligand binding modes still cannot be guaranteed. Considering that parameterized lead-like or drug-like molecules can often have the inaccurate prediction, it is even more challenging to give reliable predictions for small fragments.
Reliable protein models and precise sites of the binding pockets are required for computational fragment-based drug design. The majority of the proteins, especially for membrane proteins like GPCRs, currently do not have the crystallized structure information, which challenges the application of this drug development strategy. Homology modeling relieves this problem to some extent if the target protein shares high sequence similarity with the crystallized template. But the overall situation is far from optimistic. Also, static models may not be able to represent the proteins in the dynamic biological environments. The rotation of one or more critical residues may cause the dramatic conformational change to the shape of the binding sites, or even the whole protein structure. The binding modes revealed through static models may not recur the receptor-ligand interactions in reality.
The development and improvement of novel and current methodologies will be the focus in the future in order to consummate the computational FBDD. New methodologies should improve the accuracy of the fragment docking and make the docking poses predictable. The combination of experimental and computational approaches should also be emphasized. In silico approaches will reduce the workload, save resources, and provide guidance for chemical synthesis and biological tests to some extent. With more practice and continuing improvements in computational FBDD, challenging barriers can be overcome, and future drug development processes can benefit.
Acknowledgments
The authors would like to acknowledge the funding supports to the Xie laboratory from the NIH NIDA (P30 DA035778A1) and DOD (W81XWH-16-1-0490). The first author would like to thank Jie in particular, for the consistant support, love, and the memorable and solemn wedding. How lucky the first author is to have Jie as his bride!
Footnotes
Guest Editor: Xiang-Qun Xie
COMPLIANCE WITH ETHICAL STANDARDS
Conflict of Interest The authors declare that they have no competing interest.
References
- 1.Mayr LM, Bojanic D. Novel trends in high-throughput screening. Curr Opin Pharmacol. 2009;9(5):580–8. doi: 10.1016/j.coph.2009.08.004. [DOI] [PubMed] [Google Scholar]
- 2.Hajduk PJ, Greer J. A decade of fragment-based drug design: strategic advances and lessons learned. Nat Rev Drug Discov. 2007;6(3):211–9. doi: 10.1038/nrd2220. [DOI] [PubMed] [Google Scholar]
- 3.Sheng C, Zhang W. Fragment informatics and computational fragment-based drug design: an overview and update. Med Res Rev. 2013;33(3):554–98. doi: 10.1002/med.21255. [DOI] [PubMed] [Google Scholar]
- 4.Congreve M, Carr R, Murray C, Jhoti H. A ‘rule of three’ for fragment-based lead discovery? Drug Discov Today. 2003;8(19):876–7. doi: 10.1016/s1359-6446(03)02831-9. [DOI] [PubMed] [Google Scholar]
- 5.Jhoti H, Williams G, Rees DC, Murray CW. The ‘rule of three’ for fragment-based drug discovery: where are we now? Nat Rev Drug Discov. 2013;12(8):644–5. doi: 10.1038/nrd3926-c1. [DOI] [PubMed] [Google Scholar]
- 6.Koster H, Craan T, Brass S, Herhaus C, Zentgraf M, Neumann L, et al. A small nonrule of 3 compatible fragment library provides high hit rate of endothiapepsin crystal structures with various fragment chemotypes. J Med Chem. 2011;54(22):7784–96. doi: 10.1021/jm200642w. [DOI] [PubMed] [Google Scholar]
- 7.Congreve M, Chessari G, Tisi D, Woodhead AJ. Recent developments in fragment-based drug discovery. J Med Chem. 2008;51(13):3661–80. doi: 10.1021/jm8000373. [DOI] [PubMed] [Google Scholar]
- 8.Murray CW, Callaghan O, Chessari G, Cleasby A, Congreve M, Frederickson M, et al. Application of fragment screening by X-ray crystallography to beta-secretase. J Med Chem. 2007;50(6):1116–23. doi: 10.1021/jm0611962. [DOI] [PubMed] [Google Scholar]
- 9.Haydon DJ, Stokes NR, Ure R, Galbraith G, Bennett JM, Brown DR, et al. An inhibitor of FtsZ with potent and selective anti-staphylococcal activity. Science. 2008;321(5896):1673–5. doi: 10.1126/science.1159961. [DOI] [PubMed] [Google Scholar]
- 10.Card GL, Blasdel L, England BP, Zhang C, Suzuki Y, Gillette S, et al. A family of phosphodiesterase inhibitors discovered by cocrystallography and scaffold-based drug design. Nat Biotechnol. 2005;23(2):201–7. doi: 10.1038/nbt1059. [DOI] [PubMed] [Google Scholar]
- 11.Chen Y, Shoichet BK. Molecular docking and ligand specificity in fragment-based inhibitor discovery. Nat Chem Biol. 2009;5(5):358–64. doi: 10.1038/nchembio.155. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Hashem IAT, Yaqoob I, Anuar NB, Mokhtar S, Gani A, Khan SU. The rise of “big data” on cloud computing: review and open research issues. Inf Syst. 2015;47:98–115. [Google Scholar]
- 13.Ray PC, Kiczun M, Huggett M, Lim A, Prati F, Gilbert IH, et al. Fragment library design, synthesis and expansion: nurturing a synthesis and training platform. Drug Discov Today. 2017;22(1):43–56. doi: 10.1016/j.drudis.2016.10.005. [DOI] [PubMed] [Google Scholar]
- 14.Keserű GM, Erlanson DA, Ferenczy GG, Hann MM, Murray CW, Pickett SD. Design principles for fragment libraries: maximizing the value of learnings from pharma fragment-based drug discovery (FBDD) programs for use in academia. J Med Chem. 2016;59(18):8189–206. doi: 10.1021/acs.jmedchem.6b00197. [DOI] [PubMed] [Google Scholar]
- 15.Yuriev E, Holien J, Ramsland PA. Improvements, trends, and new ideas in molecular docking: 2012–2013 in review. J Mol Recognit. 2015;28(10):581–604. doi: 10.1002/jmr.2471. [DOI] [PubMed] [Google Scholar]
- 16.Yuriev E, Ramsland PA. Latest developments in molecular docking: 2010–2011 in review. J Mol Recognit. 2013;26(5):215–39. doi: 10.1002/jmr.2266. [DOI] [PubMed] [Google Scholar]
- 17.Wang L, Xie Z, Wipf P, Xie X-Q. Residue preference mapping of ligand fragments in the protein data bank. J Chem Inf Model. 2011;51(4):807–15. doi: 10.1021/ci100386y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Myint K, Ma C, Wang L, Xie X-Q. Fragment-similarity-based QSAR (FS-QSAR) algorithm for ligand biological activity predictions. SAR QSAR Environ Res. 2011;22(3–4):385–410. doi: 10.1080/1062936X.2011.569943. [DOI] [PubMed] [Google Scholar]
- 19.Friesner RA, Banks JL, Murphy RB, Halgren TA, Klicic JJ, Mainz DT, et al. Glide: a new approach for rapid, accurate docking and scoring. 1. Method and assessment of docking accuracy. J Med Chem. 2004;47(7):1739–49. doi: 10.1021/jm0306430. [DOI] [PubMed] [Google Scholar]
- 20.Jones G, Willett P, Glen RC. Molecular recognition of receptor sites using a genetic algorithm with a description of desolvation. J Mol Biol. 1995;245(1):43–53. doi: 10.1016/s0022-2836(95)80037-9. [DOI] [PubMed] [Google Scholar]
- 21.Jones G, Willett P, Glen RC, Leach AR, Taylor R. Development and validation of a genetic algorithm for flexible docking. J Mol Biol. 1997;267(3):727–48. doi: 10.1006/jmbi.1996.0897. [DOI] [PubMed] [Google Scholar]
- 22.Jain AN. Surflex-dock 2.1: robust performance from ligand energetic modeling, ring flexibility, and knowledge-based search. J Comput Aided Mol Des. 2007;21(5):281–306. doi: 10.1007/s10822-007-9114-2. [DOI] [PubMed] [Google Scholar]
- 23.Bohnuud T, Luo L, Wodak SJ, Bonvin AM, Weng Z, Vajda S, et al. A benchmark testing ground for integrating homology modeling and protein docking. Proteins: Structure, Function, and Bioinformatics. 2017;85(1):10–6. doi: 10.1002/prot.25063. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Cavasotto CN, Phatak SS. Homology modeling in drug discovery: current trends and applications. Drug Discov Today. 2009;14(13):676–83. doi: 10.1016/j.drudis.2009.04.006. [DOI] [PubMed] [Google Scholar]
- 25.Webb B, Sali A. Protein structure modeling with MODELLER. Protein Structure Prediction. 2014:1–15. doi: 10.1007/978-1-4939-0366-5_1. [DOI] [PubMed] [Google Scholar]
- 26.Schwede T, Kopp J, Guex N, Peitsch MC. SWISS-MODEL: an automated protein homology-modeling server. Nucleic Acids Res. 2003;31(13):3381–5. doi: 10.1093/nar/gkg520. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Chung S, Parker JB, Bianchet M, Amzel LM, Stivers JT. Impact of linker strain and flexibility in the design of a fragment-based inhibitor. Nat Chem Biol. 2009;5(6):407–13. doi: 10.1038/nchembio.163. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Kawai K, Nagata N, Takahashi Y. De novo design of drug-like molecules by a fragment-based molecular evolutionary approach. J Chem Inf Model. 2014;54(1):49–56. doi: 10.1021/ci400418c. [DOI] [PubMed] [Google Scholar]
- 29.Guvench O. Computational functional group mapping for drug discovery. Drug Discov Today. 2016;21(12):1928–31. doi: 10.1016/j.drudis.2016.06.030. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Chen H, Zhou X, Wang A, Zheng Y, Gao Y, Zhou J. Evolutions in fragment-based drug design: the deconstruction-reconstruction approach. Drug Discov Today. 2015;20(1):105–13. doi: 10.1016/j.drudis.2014.09.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Speck-Planche A, Cordeiro MND. Fragment-based in silico modeling of multi-target inhibitors against breast cancer-related proteins. Mol Divers. 2017:1–13. doi: 10.1007/s11030-017-9731-1. [DOI] [PubMed] [Google Scholar]
- 32.Garcia-Jacas CR, Marrero-Ponce Y, Acevedo-Martinez L, Barigye SJ, Valdes-Martini JR, Contreras-Torres E. QuBiLS-MIDAS: a parallel free-software for molecular descriptors computation based on multilinear algebraic maps. J Comput Chem. 2014;35(18):1395–409. doi: 10.1002/jcc.23640. [DOI] [PubMed] [Google Scholar]
- 33.Hao GF, Jiang W, Ye YN, Wu FX, Zhu XL, Guo FB, et al. ACFIS: a web server for fragment-based drug discovery. Nucleic Acids Res. 2016;44(W1):W550–6. doi: 10.1093/nar/gkw393. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Li Y, Zhao Z, Liu Z, Su M, Wang R. AutoT&T v.2: an efficient and versatile tool for lead structure generation and optimization. J Chem Inf Model. 2016;56(2):435–53. doi: 10.1021/acs.jcim.5b00691. [DOI] [PubMed] [Google Scholar]
- 35.Pevzner Y, Frugier E, Schalk V, Caflisch A, Woodcock HL. Fragment-based docking: development of the CHARMMing Web user interface as a platform for computer-aided drug design. J Chem Inf Model. 2014;54(9):2612–20. doi: 10.1021/ci500322k. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Douguet D. e-LEA3D: a computational-aided drug design web server. Nucleic Acids Res. 2010;38:W615–21. doi: 10.1093/nar/gkq322. Web Server. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Liu T, Naderi M, Alvin C, Mukhopadhyay S, Brylinski M. Break down in order to build up: decomposing small molecules for fragment-based drug design with e MolFrag. J Chem Inf Model. 2017;57(4):627–31. doi: 10.1021/acs.jcim.6b00596. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Naderi M, Alvin C, Ding Y, Mukhopadhyay S, Brylinski M. A graph-based approach to construct target-focused libraries for virtual screening. J Cheminformatics. 2016;8(1):14. doi: 10.1186/s13321-016-0126-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Fechner U, Schneider G. Flux (1): a virtual synthesis scheme for fragment-based de novo design. J Chem Inf Model. 2006;46(2):699–707. doi: 10.1021/ci0503560. [DOI] [PubMed] [Google Scholar]
- 40.Kozakov D, Grove LE, Hall DR, Bohnuud T, Mottarella SE, Luo L, et al. The FTMap family of web servers for determining and characterizing ligand-binding hot spots of proteins. Nat Protoc. 2015;10(5):733–55. doi: 10.1038/nprot.2015.043. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Tsai TY, Chang KW, Chen CY. iScreen: world’s first cloud-computing web server for virtual screening and de novo drug design based on TCM database@Taiwan. J Comput Aided Mol Des. 2011;25(6):525–31. doi: 10.1007/s10822-011-9438-9. [DOI] [PubMed] [Google Scholar]
- 42.Yuan Y, Pei J, Lai L. LigBuilder 2: a practical de novo drug design approach. J Chem Inf Model. 2011;51(5):1083–91. doi: 10.1021/ci100350u. [DOI] [PubMed] [Google Scholar]
- 43.Damewood JR, Jr, Lerman CL, Masek BB. NovoFLAP: a ligand-based de novo design approach for the generation of medicinally relevant ideas. J Chem Inf Model. 2010;50(7):1296–1303. doi: 10.1021/ci100080r. [DOI] [PubMed] [Google Scholar]
- 44.Moriaud F, Richard SB, Adcock SA, Chanas-Martin L, Surgand JS, Ben Jelloul M, et al. Identify drug repurposing candidates by mining the protein data bank. Brief Bioinform. 2011;12(4):336–40. doi: 10.1093/bib/bbr017. [DOI] [PubMed] [Google Scholar]
- 45.Ghersi D, Singh M. molBLOCKS: decomposing small molecule sets and uncovering enriched fragments. Bioinformatics. 2014;30(14):2081–3. doi: 10.1093/bioinformatics/btu173. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Hoffer L, Chira C, Marcou G, Varnek A, Horvath D. S4MPLE—sampler for multiple protein-ligand entities: methodology and rigid-site docking benchmarking. Molecules. 2015;20(5):8997–9028. doi: 10.3390/molecules20058997. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Liu X, Jiang H, Li H. SHAFTS: a hybrid approach for 3D molecular similarity calculation. 1. Method and assessment of virtual screening. J Chem Inf Model. 2011;51(9):2372–85. doi: 10.1021/ci200060s. [DOI] [PubMed] [Google Scholar]
- 48.Yu W, Xiao H, Lin J, Li C. Discovery of novel STAT3 small molecule inhibitors via in silico site-directed fragment-based drug design. J Med Chem. 2013;56(11):4402–12. doi: 10.1021/jm400080c. [DOI] [PubMed] [Google Scholar]
- 49.Wu P, Wu D, Zhao L, Huang L, Shen G, Huang J, et al. Prognostic role of STAT3 in solid tumors: a systematic review and meta-analysis. Oncotarget. 2016;7(15):19863–83. doi: 10.18632/oncotarget.7887. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Banerjee K, Resat H. Constitutive activation of STAT3 in breast cancer cells: a review. Int J Cancer. 2016;138(11):2570–8. doi: 10.1002/ijc.29923. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Bian Y, Feng Z, Yang P, Xie XQ. Integrated in silico fragment-based drug design: case study with allosteric modulators on metabotropic glutamate receptor 5. AAPS J. 2017;19(4):1235–48. doi: 10.1208/s12248-017-0093-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Wenthur CJ, Gentry PR, Mathews TP, Lindsley CW. Drugs for allosteric sites on receptors. Annu Rev Pharmacol Toxicol. 2014;54:165–84. doi: 10.1146/annurev-pharmtox-010611-134525. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Bonnet R. Growing group of extended-spectrum beta-lactamases: the CTX-M enzymes. Antimicrob Agents Chemother. 2004;48(1):1–14. doi: 10.1128/AAC.48.1.1-14.2004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Hubbard RE, Chen I, Davis B. Informatics and modeling challenges in fragment-based drug discovery. Curr Opin Drug Discov Devel. 2007;10(3):289–97. [PubMed] [Google Scholar]
- 55.Ferreira LG, Dos Santos RN, Oliva G, Andricopulo AD. Molecular docking and structure-based drug design strategies. Molecules. 2015;20(7):13384–421. doi: 10.3390/molecules200713384. [DOI] [PMC free article] [PubMed] [Google Scholar]
