Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2021 Mar 1.
Published in final edited form as: Methods Mol Biol. 2017;1529:439–446. doi: 10.1007/978-1-4939-6637-0_23

Computational Tools for Allosteric Drug Discovery: Site Identification and Focus Library Design

Wenkang Huang 1, Ruth Nussinov 2,3, Jian Zhang 4
PMCID: PMC7920515  NIHMSID: NIHMS1665809  PMID: 27914066

Abstract

Allostery is an intrinsic phenomenon of biological macromolecules involving regulation and/or signal transduction induced by a ligand binding to an allosteric site distinct from a molecule’s active site. Allosteric drugs are currently receiving increased attention in drug discovery because drugs that target allosteric sites can provide important advantages over the corresponding orthosteric drugs including specific subtype selectivity within receptor families. Consequently, targeting allosteric sites, instead of orthosteric sites, can reduce drug-related side effects and toxicity. On the down side, allosteric drug discovery can be more challenging than traditional orthosteric drug discovery due to difficulties associated with determining the locations of allosteric sites and designing drugs based on these sites and the need for the allosteric effects to propagate through the structure, reach the ligand binding site and elicit a conformational change. In this study, we present computational tools ranging from the identification of potential allosteric sites to the design of “allosteric-like” modulator libraries. These tools may be particularly useful for allosteric drug discovery.

Keywords: Allosteric site, Allosteric modulator, Allosteric drug discovery, Allostery, Allosteric drug design

1. Introduction

Allostery, which is also known as allosteric regulation, is an essential biological phenomenon that plays significant roles in signal transduction pathways, metabolic processes, and genomic transcription [1, 2]. Perturbation at an allosteric site can rapidly shift the equilibrium of a protein conformational ensemble towards another state, thereby inducing local conformation change at an active site [35]. Potential perturbations include the binding of small molecules/ions and local chemical modifications [68]. Thus, allostery is the most direct mechanism for regulating the function of biological macromolecules. Insight into allostery can lead to new ideas for method development in allosteric drug discovery [9, 10].

Unlike orthosteric drugs, which compete with the substrates of target proteins at the active sites, allosteric drugs bind at a location other than an active site and influence the affinity or catalytic efficiency of biological macromolecules through the propagation of a perturbation signal [1114]. Allosteric drugs have several advantages relative to orthosteric drugs. First, according to sequence conservation analyses [15, 16], allosteric sites are significantly less conserved than orthosteric sites; this phenomenon allows allosteric modulators to selectively target specific subtypes within receptor families [17, 18], resulting in higher selectivity and fewer side effects than orthosteric drugs. Second, allosteric drugs do not block substrate-protein interactions, and there is an upper bound to allosteric regulation. In addition, allosteric modulators can enhance the efficiency of orthosteric drugs [19]. For instance, the allosteric modulator GNF-2 binds to the myristate-binding site of T315I human Bcr-Abl. GNF-2 and the substrate-competitive inhibitor imatinib exhibit additive inhibitory activity against this mutated Bcr-Abl; as a result, a combination of these two drugs can be used to overcome drug resistance in cases of chronic myelogenous leukemia (CML) [20]. Thus, the identification of modulators targeting allosteric sites receives increasing attention in the field of drug discovery, and several allosteric drugs have been approved by the US FDA [21, 22]. For example, Genzyme’s plerixafor is an allosteric antagonist of the C-X-C chemokine receptor type 4 (CXCR4) that enhances the mobilization of hematopoietic stem cells (HSCs).

Allosteric drug discovery also presents new challenges relative to traditional drug discovery approaches. The identification and characterization of drug binding sites is the first step of structure-based drug discovery. However, the locations of allosteric sites remain unclear for most drug targets [23]. Moreover, the discovery of allosteric modulators is hampered by several obstacles, such as the low affinities and unknown structural features of potential small allosteric molecules. In our prior work, we summarized the properties of allosteric sites [24] and allosteric modulators [25] and developed an allosteric site identification method named Allosite [26]. In addition, a preliminary filter for allosteric modulator discovery was also established. In this protocol, we introduce practical guidelines describing how to obtain predictions for allosteric sites and build a focused library of “allosteric-like” molecules.

1.1. Theory

The fundamental strategy for Allosite is to use the topology and physiochemical properties of protein pockets to build a classification model relating allosteric sites to other sites. We have extracted 90 nonredundant allosteric protein–allosteric modulator co-crystals from the AlloSteric Database [27, 28]. After feature selection, 21 pocket descriptors were characterized for each pocket identified by FPocket [29]. The classification model for allosteric site identification was then trained and tested using a support vector machine [30]. In a cross-validation test, the success metrics for the Support Vector Machine (SVM) model were a sensitivity of ~83 % and a specificity of ~96 %. We have made the final model available on the Allosite Web server.

To reveal the structural specificity of allosteric modulators, 3916 known structurally diverse allosteric modulators in the Allosteric Database were compared with compounds from other databases (the Accelrys Available Chemicals Directory, the Accelrys Comprehensive Medicinal Chemistry database, the Chinese Natural Product Database, DrugBank, the MDDR database, and the NCI Open Database). Interestingly, relative to other modulators, allosteric modulators exhibit higher structure rigidity, with less rotatable bonds and more rings from ring systems. In addition, higher hydrophobicity is also observed for allosteric modulators; this finding is consistent with the hydrophobic characteristics of allosteric sites [25]. In summary, we established the following rule for differentiating allosteric modulators from other modulators: (1) molecular weight (MW) ≤ 600; (2) number of rotatable bonds (nRB) ≤ 6; (3) 2 ≤ number of rings (nR) ≤ 5; (4) number of rings in the largest ring Systems (nRIS) = 1 or 2; and (5) 3 ≤ SlogP ≤ 7.

2. Materials

2.1. Software for Visualizing Protein Structures

The PyMOL molecular graphics system is required for visualizing PDB files and allosteric sites. This system, which is a open-source software, is available at http://www.pymol.org.

2.2. Browser

The Allosite server requires a Web browser with JavaScript and cookies enabled. A recommendation to ensure that protein structures can be visualized correctly is to use the latest version of Firefox or Chrome to access Allosite.

3. Methods

In the following subsections, we first describe the individual steps that the Allosite server uses to identify allosteric sites and then describe how to construct focused libraries for the screening of allosteric modulators with a preliminary filter.

3.1. Input File Preparation

Allosite utilizes a method based on the proteins’ three-dimensional structures, which can be obtained from the Protein Data Bank database (see Note 1). If there is no crystal structure for the query protein, homology modeling methods will be helpful for building the protein’s 3D structure. The following considerations should be taken into account to ensure the quality of the prediction. (1) We recommend using an X-ray structure with a resolution <2.5 Å. (2) There should be no missing loops in the main chain of the protein. (3) Small molecules, ions and solvents within the PDB structure will automatically be removed.

3.2. Job Submission

The Allosite Web server is freely available for use at http://mdl.shsmu.edu.cn/AST. Jobs can be submitted either by “PDB ID” or by “PDB File” (see Fig. 1). In “PDB ID” mode, users can specify their input by simply entering the 4-character PDB ID of their query protein. Users with their own experimental/model-based structures can choose the “PDB File” mode to browse their local hard drives and provide a protein structure file. A submitted query protein structure should be in standard PDB format. Another parameter, “Job Name”, must be set before running the job; this parameter can then be used to check the status of the job and retrieve calculated results for the job at any time. After “Job Name” has been specified, users can click “Run” and select PDB chain(s) to submit the job (see Note 2).

Fig. 1.

Fig. 1

The Web interface and workflow of Allosite

3.3. Retrieving the Results

Once the job has been submitted, detailed job information, including a unique Job ID, will appear on the “Select PDB chains” page. Users can also track the progress of a job or access the results page from the “Job Queue” page by searching for their Job ID. The status of a job is refreshed in the “Job List” every 10 s until the “Finished” button appears. The “Finished” button indicates that the job has finished, and results can be viewed by clicking this button. The Allosite approach features rapid calculation times that depend on the size of the query protein. A typical Allosite job for a 400-residue protein will require ~15 s.

3.4. Analyzing the Results

The job will redirect to the calculation result page after the “Finished” button has been clicked. The GLmol applet will load automatically and provide a default color-coded representation of the query protein. The predicted allosteric site can be viewed in the GLmol applet by clicking the “Show Pocket” button. The predicted allosteric site is displayed as white spheres, and allosteric site residues are represented using a stick model. The result page also contains the following pocket properties for the predicted allosteric site: “Pocket Volume”, “Pocket Total SASA”, “Pocket Polar SASA”, and “Pocket Druggability Score”. A representative run of an Allosite job provides 0–4 potential allosteric sites.

3.5. Analyzing the Results Using PyMOL

Result files can be downloaded for offline analysis by clicking the “Download Report” link. After tar archives have been extracted, three files are obtained: a structure file for the query protein, site information for the predicted results, and a .pml PyMOL script for visualization. Users can then analyze the predicted allosteric site in PyMOL (see Fig. 2).

Fig. 2.

Fig. 2

An analysis of Allosite results using PyMOL. The allosteric pocket is represented by white points. Red lines are used to highlight residues in this site

3.6. Designing Focus Libraries of Allosteric Modulators

Based on our prior work, molecules that satisfy the following criteria are more likely to be allosteric modulators: (1) MW ≤ 600; (2) RBN ≤ 6; (3) 2 ≤ nR ≤ 5; (4) nRIS = 1 or 2; and (5) 3 ≤ SlogP ≤ 7. To fetch potential allosteric modulators from a database of chemical molecules, we developed a Web server that can be accessed at http://mdl.shsmu.edu.cn/ASD/. For each job, users can upload their molecular database of interest with either 2D or 3D structures. Three file types are acceptable for uploading: MOL, SDF, and SMILES. When our Web server completes a job, users are redirected to a results page where they can click “Download” to download molecules that have passed the “allosteric-like” filter. Histograms indicate the distribution of five calculated molecular properties. The filter’s local script runs quickly and is recommended for users who intend to filter large molecular databases.

Acknowledgments

This project has been funded in whole or in part with Federal funds from the Frederick National Laboratory for Cancer Research, National Institutes of Health, under contract HHSN261200800001E. This research was supported [in part] by the Intramural Research Program of NIH, Frederick National Lab, Center for Cancer Research. The content of this publication does not necessarily reflect the views or policies of the Department of Health and Human Services, nor does mention of trade names, commercial products, or organizations imply endorsement by the US Government. This research was supported in part by Natural Science Foundation of China (81322046, 81302698, 81473137) and Shanghai Rising-Star Program (13QA1402300).

Footnotes

1.

Many proteins have multiple structures in the PDB database that have been generated in different crystal environments. If a protein has multiple conformation states (such as a protein kinase with active and inactive states [13]), diverse conformations can be separately submitted to the Allosite server to obtain robust results. Moreover, these crystal conformations only represent small proportions of the conformational ensembles of allosteric proteins. Therefore, it is difficult to identify cryptic allosteric sites in proteins because these sites are transient during conformational changes and invisible to conventional X-ray crystal structures [31]. Molecular dynamics (MD) simulations are widely used for conformational ensemble sampling and for generating representative structures from diverse conformations [32]. Thus, users can predict cryptic allosteric sites with Allosite if representative conformations are chosen as inputs.

2.

One class of allosteric modulators binds to a pocket emerging from multimerization or protein–protein interactions, but these modulators do not directly inhibit protein interaction [33]. To identify interfacial allosteric sites, multiple chains should be chosen at the “PDB chain selection” step.

References

  • 1.Changeux JP (2013) The concept of allosteric modulation: an overview. Drug Discov Today Technol 10:e223–e228 [DOI] [PubMed] [Google Scholar]
  • 2.Eisenberg D, Marcotte EM, Xenarios I, Yeates TO (2000) Protein function in the post-genomic era. Nature 405:823–826 [DOI] [PubMed] [Google Scholar]
  • 3.Kar G, Keskin O, Gursoy A, Nussinov R (2010) Allostery and population shift in drug discovery. Curr Opin Pharmacol 10:715–722 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Boehr DD, Nussinov R, Wright PE (2009) The role of dynamic conformational ensembles in biomolecular recognition. Nat Chem Biol 5:789–796 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Motlagh HN, Wrabl JO, Li J, Hilser VJ (2014) The ensemble nature of allostery. Nature 508:331–339 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Csermely P, Palotai R, Nussinov R (2010) Induced fit, conformational selection and independent dynamic segments: an extended view of binding events. Trends Biochem Sci 35:539–546 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Sinha N, Nussinov R (2001) Point mutations and sequence variability in proteins: redistributions of preexisting populations. Proc Natl Acad Sci U S A 98:3139–3144 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Goodey NM, Benkovic SJ (2008) Allosteric regulation and catalysis emerge via a common route. Nat Chem Biol 4:474–482 [DOI] [PubMed] [Google Scholar]
  • 9.Nussinov R, Tsai CJ (2013) Allostery in disease and in drug discovery. Cell 153:293–305 [DOI] [PubMed] [Google Scholar]
  • 10.Nussinov R, Tsai CJ (2014) Unraveling structural mechanisms of allosteric drug action. Trends Pharmacol Sci 35:256–264 [DOI] [PubMed] [Google Scholar]
  • 11.Christopoulos A (2002) Allosteric binding sites on cell-surface receptors: novel targets for drug discovery. Nat Rev Drug Discov 1:198–210 [DOI] [PubMed] [Google Scholar]
  • 12.Szilágyi A, Nussinov R, Csermely P (2013) Allo-network drugs: extension of the allosteric drug concept to protein- protein interaction and signaling networks. Curr Top Med Chem 13:64–77 [DOI] [PubMed] [Google Scholar]
  • 13.Cowan-Jacob SW, Jahnke W, Knapp S (2014) Novel approaches for targeting kinases: allosteric inhibition, allosteric activation and pseudo-kinases. Future Med Chem 6:541–561 [DOI] [PubMed] [Google Scholar]
  • 14.Nussinov R, Tsai C-J (2014) The design of covalent allosteric drugs. Annu Rev Pharmacol Toxicol 55:249–267 [DOI] [PubMed] [Google Scholar]
  • 15.Yang J-S, Seo SW, Jang S et al. (2012) Rational engineering of enzyme allosteric regulation through sequence evolution analysis. PLoS Comput Biol 8, e1002612. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Ma B, Elkayam T, Wolfson H, Nussinov R (2003) Protein-protein interactions: structurally conserved residues distinguish between binding sites and exposed protein surfaces. Proc Natl Acad Sci U S A 100:5772–5777 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Fang Z, Grütter C, Rauh D (2013) Strategies for the selective regulation of kinases with allosteric modulators: exploiting exclusive structural features. ACS Chem Biol 8:58–70 [DOI] [PubMed] [Google Scholar]
  • 18.Kenakin T, Miller LJ (2010) Seven transmembrane receptors as shapeshifting proteins: the impact of allosteric modulation and functional selectivity on new drug discovery. Pharmacol Rev 62:265–304 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Nussinov R, Tsai C (2012) The different ways through which specificity works in orthosteric and allosteric drugs. Curr Pharm Des 18:1311–1316 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Zhang J, Adrián FJ, Jahnke W et al. (2010) Targeting Bcr-Abl by combining allosteric with ATP-binding-site inhibitors. Nature 463:501–506 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Müller CE, Schiedel AC, Baqi Y (2012) Allosteric modulators of rhodopsin-like G protein-coupled receptors: opportunities in drug development. Pharmacol Ther 135:292–315 [DOI] [PubMed] [Google Scholar]
  • 22.Lu S, Li S, Zhang J (2014) Harnessing allostery: a novel approach to drug discovery. Med Res Rev 34:1242–1285 [DOI] [PubMed] [Google Scholar]
  • 23.Lu S, Huang W, Zhang J (2014) Recent computational advances in the identification of allosteric sites in proteins. Drug Discov Today 19:1595–1600 [DOI] [PubMed] [Google Scholar]
  • 24.Li X, Chen Y, Lu S et al. (2013) Toward an understanding of the sequence and structural basis of allosteric proteins. J Mol Graph Model 40:30–39 [DOI] [PubMed] [Google Scholar]
  • 25.Wang Q, Zheng M, Huang Z et al. (2012) Toward understanding the molecular basis for chemical allosteric modulator design. J Mol Graph Model 38:324–333 [DOI] [PubMed] [Google Scholar]
  • 26.Huang W, Lu S, Huang Z et al. (2013) Allosite: a method for predicting allosteric sites. Bioinformatics 29:2357–2359 [DOI] [PubMed] [Google Scholar]
  • 27.Huang Z, Zhu L, Cao Y et al. (2011) ASD: a comprehensive database of allosteric proteins and modulators. Nucleic Acids Res 39: D663–D669 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Huang Z, Mou L, Shen Q et al. (2014) ASD v2. 0: updated content and novel features focusing on allosteric regulation. Nucleic Acids Res 42: D510–D516 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Le Guilloux V, Schmidtke P, Tuffery P (2009) Fpocket: an open source platform for ligand pocket detection. BMC Bioinformatics 10:168. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Chang C-C, Lin C-J (2011) LIBSVM: a library for support vector machines. ACM Trans Intell Syst Technol 2:27 [Google Scholar]
  • 31.Bowman GR, Geissler PL (2012) Equilibrium fluctuations of a single folded protein reveal a multitude of potential cryptic allosteric sites. Proc Natl Acad Sci U S A 109:11681–11686 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Jacobs DJ, Livesay DR, Mottonen JM et al. (2012) Ensemble properties of network rigidity reveal allosteric mechanisms. Methods Mol Biol 796:279–304 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Villoutreix BO, Kuenemann MA, Poyet J-L et al. (2014) Drug-like protein-protein interaction modulators: challenges and opportunities for drug discovery and chemical biology. Mol Inform 33:414–437 [DOI] [PMC free article] [PubMed] [Google Scholar]

RESOURCES