Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2023 Feb 1.
Published in final edited form as: Curr Opin Struct Biol. 2022 May 27;75:102396. doi: 10.1016/j.sbi.2022.102396

Mapping the binding sites of challenging drug targets

Amanda E Wakefield 1,2, Dima Kozakov 3,4, Sandor Vajda 1,2
PMCID: PMC9790766  NIHMSID: NIHMS1856581  PMID: 35636004

Abstract

An increasing number of medically important proteins are challenging drug targets because their binding sites are too shallow or too polar, are cryptic and thus not detectable without a bound ligand or located in a protein-protein interface. While such proteins may not bind druglike small molecules with sufficiently high affinity, they are frequently druggable using novel therapeutic modalities. The need for such modalities can be determined by experimental or computational fragment based methods. Computational mapping by mixed solvent molecular dynamics simulations or the FTMap server can be used to determine binding hot spots. The strength and location of the hot spots provide very useful information for selecting potentially successful approaches to drug discovery.

Keywords: Beyond rule of five compounds, inhibitors of protein-protein interactions, allosteric sites, cryptic sites, mixed solvent molecular dynamics

Introduction

Genome-scale CRISPR knockout screens can discover many novel and medically important drug targets [1], but it is predicted that traditional small-molecule drugs may not be used to modulate about half of these proteins [2], because they have binding sites that are either too large or too small, are highly lipophilic or highly polar, or are simply featureless. Given the properties of the binding site, one could frequently predict that the standard methods of drug discovery by experimental or computational high throughput screening of libraries of druglike small molecules are unlikely to work. Nevertheless, in the recent past, substantial efforts have been devoted to large-scale screenings for some targets with at most moderate success. Examples include targeting ZipA pockets in the interface with FtsZ [3] and the SI/II pocket between switch I and switch II of KRAS in the interface with SOS [4,5]. Although such targets are frequently considered undruggable [5,6], in some cases they can be successfully modulated by new chemical modalities including larger (beyond-the-rule-of-five, bRo5) compounds [7,8], macrocycles [9,10], cyclic or stapled peptides, or peptoid macrocycles [11,12]. Other possible approaches are finding allosteric sites [13], covalent inhibitors [14], or combinations of the two [14].

To determine whether a particular target needs a non-druglike chemical modality and if it does, which one, it is generally useful to determine the binding properties of the protein, particularly the geometry and chemistry of its binding sites. It is now well established that the binding sites of proteins include binding hot spots, defined as small regions where binding of ligands makes major contributions to the binding free energy [15,16]. As we argue in this review, the main value of understanding the hot spot structure of a target protein is that it yields information on the methods that are reasonable choices to target the site [17]. Hot spots can be determined by screening sets of small organic probe molecules for binding to the target protein by X-ray crystallography [15] or NMR [16]. The Multiple Solvent Crystal Structures (MSCS) method, involves determining the X-ray structure of the protein in aqueous solutions of various probe compounds [15]. The protein structures with bound organic molecules are then superimposed to derive a consensus X-ray structure. It was shown that the consensus clusters formed by overlapping probe clusters define consensus sites that are the binding hot spots. Similar results can be obtained by NMR based screening of small organic molecules against the 15N-labeled target protein is screened [16]. It was shown that the consensus clusters formed by multiple probe molecules indicate binding hot spots, and that the number of different probes in the consensus cluster predicts the importance of the site.

Computational mapping of protein binding sites

Since using experimental techniques for determining binding hot spots is generally costly and can even be limited by physical constraints such as limits on the solubility of probe molecules, several computational methods have also been developed. The FTMap algorithm [17] and the mixed solvent molecular dynamics (MSMD) approach [18-21] are both computational analogs of the MSCS or NMR based fragment screening experiments. FTMap exhaustively docks the molecular probes to the protein exploring billions of positions for each probe, selects favorable positions using empirical energy functions, and refines the selected poses by minimizing a more accurate energy function that includes molecular mechanics and structure-based terms. The energy landscape is efficiently sampled using a fast Fourier transform (FFT) based algorithm. The selected probe positions are refined by accounting for probe and limited protein flexibility. To determine the hot spots FTMap finds the consensus sites and ranks the strength of these sites in terms of the number of overlapping probe clusters [17]. The strength and arrangement of hot spots show whether the protein is suitable for binding small druglike ligands, or it is a challenging target and hence needs other type of modalities [17].

MSMD is an alternative hot spot mapping technique based on molecular dynamics (MD) simulations of proteins in binary solvent mixtures. Similar to FTMap, the technique can capture preferred binding sites of fragment-sized organic compounds. The best known methods are MixMD [19] and SILCS (Site-Identification by Ligand Competitive Saturation) [22]. Advantages are that MSMD allows for protein flexibility and accounts for the competition between the probe molecules and water. In contrast, apart from minor side chain motion, FTMap assumes a rigid protein and uses continuum solvation models, thereby missing specific protein-water and probe-water interactions. However, the advantage of FTMap is that the method is much faster than mixed MD, and therefore can be used with a much larger variety of molecular probes and can be applied to large sets of proteins. In particular, it is frequently useful to consider all X-ray structures available for a protein to explore the impact of large conformational changes that would be difficult to model using MD.

Detecting the need for beyond rule of five (bRo5) compounds

Lipinski’s rule of five (Ro5) was developed to define the chemical space of orally bioavailable compounds. However, the concept is too restrictive [23], as over 30% of approved kinase inhibitors and around 50% of protein-protein inhibitors discussed in the scientific literature are beyond the rule of five (bRo5) compounds [23]. The need for using a bRo5 compound to target a protein can be effectively determined by mapping of the binding hot spots [8]. Targets can benefit from bRo5 drugs if they have complex hot spot structures with four or more binding hots spots, including some strong ones. Although such targets are conventionally druggable using molecules that are bRo5 compliant, reaching additional hot spots improves binding affinity, which creates options for improving pharmaceutical properties by adding or replacing some functional groups that otherwise would be detrimental to binding. For example, the only FDA approved nonpeptidic direct thrombin inhibitor Argatroban extends to all five hot spots in the binding site and has a molecular weight of just over 500 Da [8]. Although some lower molecular weight thrombin inhibitors also have high affinity, they turned out to have problems, including but not limited to poor selectivity, weak oral bioavailability, poor metabolic stability, innate liver toxicity, rapid elimination from the blood, high-plasma protein binding, and low anticoagulant activity. Therefore, it may be reasonable to consider as many hot spots as possible in drug design, despite the increase in molecular weight. Many protein kinases also have multiple strong hot spots, but bRo5 inhibitors were generally designed to improve selectivity rather than affinity [8]. Interestingly, targets that have simple hot spot structures with less than four hot spots that are too weak to provide conventional druggability also must use larger compounds that can form interactions with surfaces outside the hot spot region to reach acceptable affinity [8]. More recent studies focus on the pharmaceutical properties of novel bRo5 modalities such as peptidomimetics, with particular emphasis on membrane permeability [24].

Identification of protein-protein inhibitor binding sites

Intercellular protein-protein interaction (PPI) interfaces are challenging targets because the cavities available for binding druglike molecules are generally less defined than the pockets of traditional drug target proteins [25]. In addition, ligand binding may depend on the flexibility of the pocket, therefore potential conformational changes must be considered. Fragment based methods have been important for developing PPI inhibitors [26]. Computational fragment screening by FTMap [27] and SILCS [22] was shown to identify binding hot spots amenable to inhibitor binding based on mapping the structures of the interacting proteins. A more complex method involving molecular dynamics simulations and protein docking gave similar results [28]. Frequently, the hot spot residues in protein-protein interfaces, identified by alanine scanning, extend into binding hot spots of the partner protein, thus the two hot spot concepts are related [29]. Many recent studies search for hot spots residues to find targetable sites [30], primarily with application to cancer [31]. Based on the outcome of drug discovery campaigns, it appears that high affinity inhibitors bind to pockets that are at least partially formed in the protein-protein complex [27], and the tractability of such sites can be reliably determined by mapping either unbound or protein-bound structures [27,28]. In many cases the protein interacting with the target can be reduced to a peptide, most frequently an alpha-helix [32], but beta-turn structures also occur [33]. These secondary structures can be then stabilized to form cyclic peptides or peptidomimetic inhibitors [32].

Searching for allosteric sites

In some medically important proteins targeting the main orthosteric site is challenging, because the site may not provide sufficient selectivity and the inhibitor must compete with the endogenous ligands. For example, high affinity active site inhibitors of tyrosine phosphatases would need to emulate the charged nature of the phosphorylated substrate and achieving selectivity may require fairly large compounds due to the similarity of residues directly surrounding the site [34]. For such targets, allosteric drugs may provide a critical advantage due to non-competitive and highly specific regulation [13]. Identification of allosteric sites involves two aspects, first finding an appropriate site, and second showing allosteric communication to the orthosteric site. We restrict consideration to the first aspect as it appears to be critical, and don’t discuss specialized algorithms such as Allosite [35] and AlloFinder [36].

The mapping methods SILCS [21], MixMD [18,37], and FTMap [38,39] were all used to identify allosteric binding sites. SILCS was shown to detect more potential sites than FTMap, but several such sites appear to be false positives [21]. Most applications focused on kinases and GPCRs, two important target families whose allosteric sites have been extensively studied. Although the majority of currently approved kinase inhibitors target the ATP binding site, there is substantial interest in allosteric sites and allosteric drugs [40]. While type II and type III allosteric inhibitors bind at or near the ATP binding site, the literature identifies ten regions that have been reported as regulatory hot spots and are therefore potential target sites for type IV inhibitors. Kinase Atlas, a collection of binding hot spots located at each of the ten allosteric sites was constructed using the FTMap results for all kinase structures in the PDB. Kinase Atlas https://kinase-atlas.bu.edu) displays summarized results including the presence of binding sites and their druggability for all structures of a particular kinase. Additionally, users may view hot spot information for individual kinase structures [38].

Allosteric modulators represent a very important strategy against GPCR targets. Despite the growing number of GPCR structures, only 39 have been co-crystallized with allosteric inhibitors, and thus identification of allosteric sites is important. FTMap has been applied to several GPCRs by the McCammon group [41,42]. More recently the method was shown to successfully identify allosteric sites within the seven-membrane region of GPCRs [39]. However, FTMap is parameterized for analysis of soluble proteins and may fail to identify allosteric sites in receptor-lipid interfaces. A recent probe confined dynamic mapping protocol developed for GPCRs predicts the location of allosteric sites at both intracellular and extracellular regions and within the receptor-lipid interface [43]. The method enhances sampling of probe molecules within a defined region of a GPCR and prevents membrane distortion during molecular dynamics simulations by applying a harmonic wall potential. In addition, the method uses a set of probes derived from structures of GPCR allosteric ligands [43]. Another recent study used exhaustive docking of small molecular probes, considering the different electrostatics of the transmembrane and solvent-exposed parts of the receptors, resulting in the “pocketome” of G protein-coupled receptors [44].

How useful are cryptic sites for drug discovery?

Some proteins have binding sites that are difficult to detect in ligand-free structures and only become apparent after ligand binding [45]. This is frequently the case for allosteric sites. Attempts to find alternative ways to drug challenging targets lead to the development of computational methods for the identification and analysis of such cryptic sites [45,46]. Cimermancic et al. [46] created a benchmark set of 93 ligand-free and ligand-bound pairs of proteins from the PDB with cryptic binding sites which they also used to build a machine learning model (CryptoSite) to predict cryptic sites in the apo structures. The original CryptoSite data set was expanded by Beglov et al. [47] by adding all ligand-free structures in the PDB for each of the 93 proteins. Mapping of apo structures by FTMap revealed that cryptic binding sites are generally located near a strong binding hot spot and that the sites exhibit above-average flexibility [47]. While the FTMap results were in good agreement with those of CryptoSite, both methods account only for the limited flexibility of the proteins. There is no question that more realistic simulations that reveal the multiplicity of potential conformational states help to identify cryptic sites. MixMD was able to identify cryptic and allosteric sites in ligand-free structures of some proteins that were not found by FTMap [19,37].

Markov state models, MSMD simulations, and collective variable enhanced sampling methods were shown to open transitional pockets [48,49]. The problem is that the number of pockets that are open in more than 10% of the simulation time can be very high [50]. However, based on FTMap results, proteins generally have only three different sites with substantial ligand binding capability [47], and hence most of the newly created pockets are too weak for drug discovery. Results supporting this observation were reported by Bowman et al. [51], who identified multiple hidden allosteric sites in TEM-β lactamase using Markov state models. Although small compounds covalently bound at some of these sites were shown to have an allosteric effect [51], non-covalent modulators designed for the site had only moderate impact [52]. In particular, we have observed that pockets that open solely by the movement of some side chains can bind ligands with at most high micromolar affinity [47], most likely because the side chains protruding into the site compete with the ligands for binding. Since the side chains are always at the site, their local concentration is very high, resulting in substantial competition.

An example: Mapping of KRAS

We show that mapping of KRAS structures provides valuable information on the relative tractability of the binding sites. Figure 1a shows the results of mapping a KRAS structure (PDB ID 6MBT [53]), which has no bound ligand apart from ADP and Mg2+ that are removed before the mapping. Three ligands superimposed from bound structures are added in Figure 1b. The strongest consensus site that includes 36 probe clusters binds the GDP molecule. The second strongest consensus site is located close to residue 12, in a pocket that accommodates the covalent G12C inhibitor AMG 510 in the structure 6OIM [54]. The site binds 17 probe clusters suggesting limited druggability and the need for a covalent drug [55]. Finally, the third consensus site is in the extensively targeted shallow polar pocket between switch I and switch II (SI/II pocket) in the KRAS-SOS interface. This consensus site includes only 9 probe clusters, which suggests that the site is too weak to bind druglike molecules with high affinity [55]. In fact, the site binds the small inhibitor, developed by the Fesik group in 2012 with only Kd = 420 μM [4]. This inhibitor was co-crystallized with KRAS, and the resulting structure 4EPW was also mapped by FTMap after removing the ligands. While the strongest consensus site is still at the GDP binding pocket (Figure 1c), the binding of the inhibitor slightly expands the SI/II pocket, which now binds 16 probe clusters. In addition, ligand binding induces a second hot spot with 10 probe clusters in the SI/II pocket (Figure 1c), and the inhibitor binds to both hot spots (Figure 1d). In collaboration with Boehringer Ingelheim, the Fesik group recently developed a larger inhibitor (MW = 512 g/mol) that binds to the SI/II pocket with Kd = 750 nM [5]. Given the limited druggability of the SI/II site [55] this is an extraordinary achievement, and the compound, shown in Figure 1b can be used as a chemical probe. However, based on the mapping results it is unlikely that any small druglike compound binding at the SI/II site can be developed into a drug, emphasizing the importance of the covalent allosteric inhibitors binding at the G12C site [54,56]. Interestingly, the binding of an inhibitor at the SI/II pocket weakens the hot spot at the allosteric site that binds the covalent inhibitor, suggesting bidirectional allosteric communication between the SI/II pocket and the G12C site.

Figure 1.

Figure 1.

Mapping of KRAS. (a) Hot spots on the wild-type KRAS bound to GDP and Mg2+ (PDB ID 6MBT). All ligands are removed prior to mapping. Only probes at cluster centers are shown, represented as lines. The consensus sites define the binding hot spots. The most important consensus site (cyan) includes 36 probe clusters. The second site (blue) and the third site (magenta) bind 17 and 9 probe clusters, respectively. (b) Same as (a) with ligand superimposed from bound structures. The ligands are shown as sticks. The GDP molecule (orange) binds at the strongest consensus site. The covalent inhibitor AMG 510 bound to G12C from PDB ID 6OIM (blue) binds at the second strongest site [54]. The third site (with 9 probe clusters) is in the SI/II pocket and binds both the inhibitor developed by the Fesik group in 2012 (Kd = 420 μM from PDB ID 4EPW, yellow) [4] and the more recent direct inhibitor in the same pocket (Kd = 750 nM from PDB ID 6GJ7, green) [5]. (c) Mapping the KRAS structure co-crystallized with the low affinity inhibitor (PDB ID 4EPW) following the removal of the inhibitor. The most important consensus site (cyan) now includes only 22 probe clusters. The second consensus site (orange) and the third one (magenta) bind 16 and 10 probe clusters, respectively. The fourth site (blue) also has 10 probe clusters. (d) The top site (cyan) still binds the GDP as in (b). However, the SI/II pocket now includes the second and third consensus sites, both interacting with the inhibitor from 4EPW, shown as yellow sticks. The fourth consensus site is located at the binding site of the covalent inhibitor at KRAS G12C.

We also mapped the 282 KRAS structures in the PDB with < 90% sequence identity to 4EPW and determined the consensus clusters formed by all mapping results. Interestingly the large-scale mapping provided the same top sites obtained by mapping only the unbound structure 6MBT and the inhibitor bound structure 4EPW. The strongest consensus site, located at the GDP binding site, on average binds 17.4 ± 6.1 probe clusters. The second consensus site is in the SI/II pocket, with 8.8 ± 6.8 probe clusters, and the third consensus site is at the pocket that binds the G12C inhibitors and includes 8.6 ± 6.1 probe clusters. These results reveal that the SI/II pockets and the G12C site have almost the same strength, but both have limited druggability [55]. Therefore. the ability of AMG 510 to covalently bind to the cystine residue at position 12 in the G12C mutant is crucial. Unfortunately, the weakness of the site implies that extending the approach to other G12 oncogenic mutants will be very challenging.

Acknowledgements.

This investigation was supported by grants R35GM118078, R21GM127952, RM1135136 and R01GM102864 from the National Institute of General Medical Sciences.

Footnotes

Conflict of interest statement

The authors declare no conflict of interest.

References and recommended reading

Papers of particular interest, published within the period of review, have been highlighted as:

* of special interest

** of outstanding interest

  • 1.**. Behan FM, Iorio F, Picco G, Gon calves E, Beaver CM, Migliardi G, Santos R, Rao Y, Sassi F, Pinnelli M, et al. : Prioritization of cancer therapeutic targets using CRISPR-Cas9 screens. Nature 2019, 568:511–516. CRISPR knockout screens were applied to 324 human cancer cell lines representing 30 different cancer types and identified 628 priority targets that were assigned to one of three tractability groups. Group 1 contained genes that had already been targeted by clinically approved or preclinical compounds. Group 2 encompassed genes with some features that supported their tractability. Almost half (311 genes) were placed in Group 3 without strong evidence of tractability.
  • 2.**. Lou K, Gilbert LA, Shokat KM: A bounty of new challenging targets in oncology for chemical discovery. Biochemistry 2019, 58:3328–3330. In this viewpoint note, the authors argue that the vast majority of targets in Groups 2 and 3 defined by Behan et al. (see above) will likely require novel pharmacological approaches to even consider initiating drug discovery efforts.
  • 3.Rush TS 3rd, Grant JA, Mosyak L, Nicholls A: A shape-based 3-D scaffold hopping method and its application to a bacterial protein-protein interaction. J Med Chem 2005, 48:1489–1495. [DOI] [PubMed] [Google Scholar]
  • 4.Sun Q, Burke JP, Phan J, Burns MC, Olejniczak ET, Waterson AG, Lee T, Rossanese OW, Fesik SW: Discovery of small molecules that bind to K-Ras and inhibit SOS-mediated activation. Angew Chem Int Ed 2012, 51:6140–6143. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.**. Kessler D, Gmachl M, Mantoulidis A, Martin LJ, Zoephel A, Mayer M, Gollner A, Covini D, Fischer S, Gerstberger T, et al. : Drugging an undruggable pocket on KRAS. Proc Natl Acad Sci U S A 2019, 116:15823–15829. The authors target the very challenging SI/II pocket of KRAS that has been extensively targeted during the last decade, resulting only in high micromolar inhibitors. Using fragment based screens, the recent work leads to a high nanomolar inhibitor binding at the same site. This result demonstrates that the SI/II-pocket is druggable, but due its limited binding affinity the compound can only be used as a chemical probe. Although the authors argue that it can provide a starting point for the design of more potent and selective RAS inhibitors, the mapping of KRAS suggests that the site is very weak, and hence finding higher affinity inhibitors will be a major challenge.
  • 6.Atangcho L, Navaratna T, Thurber GM: Hitting undruggable targets: Viewing stabilized peptide development through the lens of quantitative systems pharmacology. Trends Biochem Sci 2019, 44:241–257. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Doak BC, Kihlberg J: Drug discovery beyond the rule of 5 - Opportunities and challenges. Expert Opin Drug Discov 2017, 12:115–119. [DOI] [PubMed] [Google Scholar]
  • 8.**. Egbert M, Whitty A, Keseru GM, Vajda S: Why some targets benefit from beyond rule of five drugs. J Med Chem 2019, 62:10005–10025. The authors developed a classification method to predict which protein targets may benefit from beyond Rule of 5 compounds. By analyzing the FTMap results of unliganded protein structures, they were able to identify patterns in hot spot structures that correlate to the druggability of a site with a conventionally druglike molecule. Notably, target sites with “complex” hot spot structures, i.e. four or more hots spots including at least one strong one, and targets with “simple” hot spot structures, i.e., three or fewer weak hot spots, are good candidates for bRo5 compounds.
  • 9.Begnini F, Poongavanam V, Over B, Castaldo M, Geschwindner S, Johansson P, Tyagi M, Tyrchan C, Wissler L, Sjo P, et al. : Mining natural products for macrocycles to drug difficult targets. J Med Chem 2021, 64:1054–1072. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Viarengo-Baker LA, Brown LE, Rzepiela AA, Whitty A: Defining and navigating macrocycle chemical space. Chem Sci 2021, 12:4309–4328. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Webster AM, Cobb SL: Recent advances in the synthesis of peptoid macrocycles. Chemistry 2018, 24:7560–7573. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Ali AM, Atmaj J, Van Oosterwijk N, Groves MR, Domling A: Stapled peptides inhibitors: A new window for target drug discovery. Comput Struct Biotechnol J 2019, 17:263–281. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Guarnera E, Berezovsky IN: Allosteric drugs and mutations: chances, challenges, and necessity. Curr Opin Struct Biol 2020, 62:149–157. [DOI] [PubMed] [Google Scholar]
  • 14.Moore AR, Rosenberg SC, McCormick F, Malek S: RAS-targeted therapies: is the undruggable drugged? Nat Rev Drug Discov 2020, 19:533–552. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Mattos C, Ringe D: Locating and characterizing binding sites on proteins. Nat Biotech 1996, 14:595–599. [DOI] [PubMed] [Google Scholar]
  • 16.Hajduk PJ, Huth JR, Fesik SW: Druggability indices for protein targets derived from NMR-based screening data. J Med Chem 2005, 48:2518–2525. [DOI] [PubMed] [Google Scholar]
  • 17.Kozakov D, Grove LE, Hall DR, Bohnuud T, Mottarella SE, Luo L, Xia B, Beglov D, Vajda S: The FTMap family of web servers for determining and characterizing ligand-binding hot spots of proteins. Nat Protoc 2015, 10:733–755. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.*. Chan WKB, DasGupta D, Carlson HA, Traynor JR: Mixed-solvent molecular dynamics simulation-based discovery of a putative allosteric site on regulator of G protein signaling 4. J Comput Chem 2021, 42:2170–2180. Regulator of G protein signaling 4 (RGS4) is an intracellular protein that aids in terminating G potein coupled receptor signaling and hence is an important drug target. Current inhibitors of RGS4 bind covalently to cysteine residues on the protein. MixMD simulations were used to identify alternative druggable sites. Pseudo-ligands were placed in consensus hot spots, and perturbation with normal mode analysis led to the identification and characterization of a putative allosteric site. However, the strength of the site remains questionable and requires further investigation.
  • 19.*. Smith RD, Carlson HA: Identification of cryptic binding sites using MixMD with standard and accelerated molecular dynamics. J Chem Inf Model 2021, 61:1287–1299. MixMD was used to identify known cryptic sites of 12 proteins characterized by a wide range of conformational changes, including reorganization of side chains, loop movements, or movement of more significant structures such as whole helices. In five cases standard MixMD simulations were able to map the cryptic binding sites with at least one probe type, and in some cases, there was a need for accelerated MD, which increases sampling of torsional angles. For more complex systems where the movement of a helix or domain is necessary, MixMD was unable to map the binding site even with accelerated dynamics. It was also observed that the probes do not induce the opening of a site but instead, the probes utilize spontaneously opening sites, as part of their inherent conformational behavior.
  • 20.Yu W, Lakkaraju SK, Raman EP, Mackerell AD Jr.: Site-Identification by Ligand Competitive Saturation (SILCS) assisted pharmacophore modeling. J Comput Aided Mol Des 2014, 10.1007/s10822-014-9728-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.*. MacKerell AD Jr., Jo S, Lakkaraju SK, Lind C, Yu W: Identification and characterization of fragment binding sites for allosteric ligand design using the site identification by ligand competitive saturation hotspots approach (SILCS-Hotspots). Biochim Biophys Acta Gen Subj 2020, 1864:129519. The paper employs the SILCS-Hotspots approach, which consists of the mixed solvent MD method SILCS followed by spatial clustering to identify the hot spots. It results in a large number of predicted binding sites and does not accurately account for sites that are inaccessible due to being located within the protein interior or due to low binding affinities. The detected sites recapitulate the known locations of drug-like molecules in both allosteric and orthosteric binding sites. This approach identifies a larger number of known binding sites of drug-like molecules than the commonly used FTMap and Fpocket methods. It appears that the known binding sites are not highly ranked, and the method generates many sites that are likely false positives.
  • 22.Yu W, Jo S, Lakkaraju SK, Weber DJ, MacKerell AD Jr.: Exploring protein-protein interactions using the site-identification by ligand competitive saturation methodology. Proteins 2019, 87:289–301. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Doak BC, Zheng J, Dobritzsch D, Kihlberg J: How beyond rule of 5 drugs and clinical \ candidates bind to their targets. J Med Chem 2016, 59:2312–2327. [DOI] [PubMed] [Google Scholar]
  • 24.Barlow N, Chalmers DK, Williams-Noonan BJ, Thompson PE, Norton RS: Improving membrane permeation in the beyond rule-of-five space by using prodrugs to mask hydrogen bond donors. ACS Chem Biol 2020, 15:2070–2078. [DOI] [PubMed] [Google Scholar]
  • 25.Shin WH, Kumazawa K, Imai K, Hirokawa T, Kihara D: Current challenges and opportunities in designing protein-protein interaction targeted drugs. Adv Appl Bioinform Chem 2020, 13:11–25. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Valenti D, Hristeva S, Tzalis D, Ottmann C: Clinical candidates modulating protein-protein interactions: The fragment-based experience. Eur J Med Chem 2019, 167:76–95. [DOI] [PubMed] [Google Scholar]
  • 27.Kozakov D, Hall DR, Chuang GY, Cencic R, Brenke R, Grove LE, Beglov D, Pelletier J, Whitty A, Vajda S: Structural conservation of druggable hot spots in protein-protein interfaces. Proc Natl Acad Sci U S A 2011, 108:13528–13533. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.*. Rosell M, Fernandez-Recio J: Docking-based identification of small-molecule binding sites at protein-protein interfaces. Comput Struct Biotechnol J 2020, 18:3750–3761. The authors have developed a multi-step protocol for the identification of ligand binding sites in protein-protein interfaces. It starts with short MD simulations of the target protein to open transient pockets. The pockets are identified and ranked using the Fpocket program. This is followed by docking the two proteins to select the pockets that are most likely in the interface. Finally, the putative pockets are confirmed by docking the native ligands. The protocol was applied to the few proteins that have known protein-protein inhibitors and were previously mapped by both FTMap and MSMD methods. It appears that the results are fairly similar.
  • 29.Zerbe BS, Hall DR, Vajda S, Whitty A, Kozakov D: Relationship between hot spot residues and ligand binding hot spots in protein-protein interfaces. J Chem Inf Mod 2012, 52:2236–2244. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Ibarra AA, Bartlett GJ, Hegedus Z, Dutt S, Hobor F, Horner KA, Hetherington K, Spence K, Nelson A, Edwards TA, et al. : Predicting and experimentally validating hot-spot residues at protein-protein interfaces. ACS Chem Biol 2019, 14:2252–2263. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Ozdemir ES, Halakou F, Nussinov R, Gursoy A, Keskin O: Methods for discovering and Targeting Druggable Protein-Protein Interfaces and Their Application to Repurposing. Methods Mol Biol 2019, 1903:1–21. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.*. Wang H, Dawber RS, Zhang P, Walko M, Wilson AJ, Wang X: Peptide-based inhibitors of protein-protein interactions: biophysical, structural and cellular consequences of introducing a constraint. Chem Sci 2021, 12:5977–5993. The paper describes applications of constrained peptide PPI inhibitors, typically mediated by α-helices. The strength of the method is that in many cases a helical segment of the partner protein already binds to the target. The resulting approach offers significant advantages including enhanced stability, affinity, and cellular penetration. Selecting a staple to make productive interactions with target protein and shielding the hydrophilic amides of the peptide from the hydrophobic membrane is the most critical step in the design.
  • 33.*. Zhong M, Lynch A, Muellers SN, Jehle S, Luo L, Hall DR, Iwase R, Carolan JP, Egbert M, Wakefield A, et al. : Interaction energetics and druggability of the protein-protein interaction between Kelch-like ECH-associated protein 1 (KEAP1) and nuclear factor erythroid 2 like 2 (Nrf2). Biochemistry 2020, 59:563–581. The authors combine mapping with FTMap, fragment screening, and alanine-scanning mutagenesis to evaluate the energetics and druggability of the highly charged protein-protein interaction interface between KEAP1 and Nrf2. FTMap identifies four binding energy hot spots within the active site. However, only two of these hot spots are exploited by Nrf2, which binds to KEAP1 primarily via a loop forming a β-turn conformation. KEAP1-bound X-ray structures are obtained for three fragments. One strength of the paper is discussing the complementary information provided by computational hot spot mapping, fragment screening, and alanine-scanning mutagenesis.
  • 34.Lazo JS, McQueeney KE, Sharlow ER: New approaches to difficult drug targets: The phosphatase story. SLAS Discov 2017, 22:1071–1083. [DOI] [PubMed] [Google Scholar]
  • 35.Huang W, Lu S, Huang Z, Liu X, Mou L, Luo Y, Zhao Y, Liu Y, Chen Z, Hou T, et al. : Allosite: a method for predicting allosteric sites. Bioinformatics 2013, 29:2357–2359. [DOI] [PubMed] [Google Scholar]
  • 36.Huang M, Song K, Liu X, Lu S, Shen Q, Wang R, Gao J, Hong Y, Li Q, Ni D, et al. : AlloFinder: a strategy for allosteric modulator discovery and allosterome analyses. Nucleic Acids Res 2018, 46:W451–W458. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Ghanakota P, Carlson HA: Moving beyond active-site detection: MixMD applied to allosteric systems. J Phys Chem B 2016, 120:8685–8695. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.**. Yueh C, Rettenmaier J, Xia B, Hall DR, Alekseenko A, Porter KA, Barkovich K, Keseru G, Whitty A, Wells JA, et al. : Kinase Atlas: Druggability Analysis of potential allosteric sites in kinases. J Med Chem 2019, 62:6512–6524. The Kinase Atlas is intended to provide information on whether a kinase could potentially be targeted by a type II, III, or IV allosteric inhibitor. To determine potential regions for non-type I inhibitors to bind, mapping by FTMap explores ten sites on the kinase catalytic domain that have been described in the literature as allosteric. Kinase Atlas is a systematic collection of the binding hot spots, categorized by allosteric site, from 376 distinct kinases represented by 4910 structures from the Protein Data Bank. Site druggability is indicated by the presence of a strong binding hot spot in one of the 10 potential allosteric sites. Kinase atlas users can access mapping results for individual structures or a summary of mapping results for kinases with multiple structures.
  • 39.*. Wakefield AE, Mason JS, Vajda S, Keseru GM: Analysis of tractable allosteric sites in G protein-coupled receptors. Sci Rep 2019, 9:6180. GPCRs that have X-ray structures co-crystallized with allosteric inhibitors are mapped using FTMap. The mapping identifies over 80% of experimentally confirmed allosteric sites within the three strongest sites found. The method was also able to find partially hidden allosteric sites that were not fully formed in X-ray structures crystallized in the absence of allosteric ligands. These results confirm that some of the strongest ligand binding sites in GPCRs are the intrahelical sites that are capable of binding druglike allosteric modulators. However, due to its parameterization for the analysis of soluble proteins, FTMap may not detect allosteric sites in the receptor-lipid interface.
  • 40.Lu X, Smaill JB, Ding K: New promise and opportunities for allosteric kinase inhibitors. Angew Chem Int Ed Engl 2020, 59:13764–13776. [DOI] [PubMed] [Google Scholar]
  • 41.Miao Y, Nichols SE, McCammon JA: Mapping of allosteric druggable sites in activation-associated conformers of the M2 muscarinic receptor. Chem Biol Drug Des 2014, 83:237–246. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Caliman AD, Miao Y, McCammon JA: Mapping the allosteric sites of the A2A adenosine receptor. Chem Biol Drug Des 2018, 91:5–16. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.**. Ciancetta A, Gill AK, Ding T, Karlov DS, Chalhoub G, McCormick PJ, Tikhonova IG: Probe confined dynamic mapping for G protein-coupled receptor allosteric site prediction. ACS Cent Sci 2021, 7:1847–1862. The paper describes a novel MSMD protocol specifically developed to identify allosteric sites of GPCRs. The method applies a harmonic wall potential enhanced sampling of probe molecules in a selected area of a GPCR while preventing membrane distortion in molecular dynamics simulations. In addition, it uses specific probes derived from GPCR allosteric ligand structures. The resulting method allows the prediction of allosteric sites at both the GPCR intracellular and extracellular sides, as well as at the protein–lipid interface. The method was validated retrospectively on GPCRs with known allosteric sites, and prospectively to locate the binding site of a specific ligand on the D2 dopamine receptor. The prediction was confirmed by mutagenesis.
  • 44.*. Kolb P, Hedderich J, Persechino M, Becker K, Heydenreich F, Gutermuth T, Bouvier M, Bünemann M: The pocketome of G protein-coupled receptors reveals previously untargeted allosteric sites. Res Square, 2021. The binding pockets of 557 GPCR structures were mapped by exhaustively docking small molecular probes in silico and converting the ensemble of binding locations to pocket-defining volumes. The analysis confirmed all previously identified pockets and revealed nine novel untargeted sites. The latter sites were validated on two proteins by mutagenesis. In addition, the correlation of inter-residue contacts with the activation states of receptors was examined, and in these sites, contact patterns closely correlated with activation.
  • 45.Vajda S, Beglov D, Wakefield AE, Egbert M, Whitty A: Cryptic binding sites on proteins: definition, detection, and druggability. Curr Opin Chem Biol 2018, 44:1–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Cimermancic P, Weinkam P, Rettenmaier TJ, Bichmann L, Keedy DA, Woldeyes RA, Schneidman-Duhovny D, Demerdash ON, Mitchell JC, Wells JA, et al. : CryptoSite: Expanding the druggable proteome by characterization and prediction of cryptic binding sites. J Mol Biol 2016, 428:709–719. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Beglov D, Hall DR, Wakefield AE, Luo L, Allen KN, Kozakov D, Whitty A, Vajda S: Exploring the structural origins of cryptic sites on proteins. Proc Natl Acad Sci U S A 2018, 115:E3416–Ε3425. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.*. Kuzmanic A, Bowman GR, Juarez-Jimenez J, Michel J, Gervasio FL: Investigating cryptic binding sites by molecular dynamics simulations. Acc Chem Res 2020, 53:654–661. This is a review of recent contributions of mixed-solvent simulations, metadynamics, Markov state models, and other enhanced sampling methods to the field of cryptic site identification and characterization. These methods are shown to provide information on the nature of the site opening mechanisms, to predict previously unknown sites which were used to design new ligands, and to compute the free energy landscapes and kinetics associated with the opening of sites and subsequent binding of ligands. The impact of the predictions with regards to drug discovery, and particularly towards difficult targets, is discussed. The paper concludes by mentioning the uncertainty of leveraging the predicted cryptic sites for the discovery of small molecule drugs.
  • 49.Oleinikovas V, Saladino G, Cossins BP, Gervasio FL: Understanding Cryptic pocket formation in protein targets by enhanced sampling simulations. J Am Chem Soc 2016, 138:14257–14263. [DOI] [PubMed] [Google Scholar]
  • 50.Bowman GR, Geissler PL: Equilibrium fluctuations of a single folded protein reveal a multitude of potential cryptic allosteric sites. Proc Natl Acad Sci U S A 2012, 109:11681–11686. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Bowman GR, Bolin ER, Hart KM, Maguire BC, Marqusee S: Discovery of multiple hidden allosteric sites by combining Markov state models and experiments. Proc Natl Acad Sci U S A 2015, 112:2734–2739. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Hart KM, Moeder KE, Ho CMW, Zimmerman MI, Frederick TE, Bowman GR: Designing small molecules to target cryptic pockets yields both positive and negative allosteric modulators. PLoS One 2017, 12:e0178678. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Dharmaiah S, Tran TH, Messing S, Agamasu C, Gillette WK, Yan W, Waybright T, Alexander P, Esposito D, Nissley DV, et al. : Structures of N-terminally processed KRAS provide insight into the role of N-acetylation. Sci Rep 2019, 9:10512. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.* *. Canon J, Rex K, Saiki AY, Mohr C, Cooke K, Bagal D, Gaida K, Holt T, Knutson CG, Koppada N, et al. : The clinical KRAS(G12C) inhibitor AMG 510 drives anti-tumour immunity. Nature 2019, 575:217–223. Covalent inhibitors binding to the cysteine residue in the KRAS(G12C) have promising preclinical activity and thus provide the long-awaited success in targeting KRAS. The paper describes optimizing a series of inhibitors using novel binding interactions to markedly enhance their potency and selectivity, resulting in AMG 510, which is the first KRAS(G12C) inhibitor in clinical development. In preclinical analyses, treatment with AMG 510 improved the anti-tumor efficacy of chemotherapy and targeted agents and led to the regression of KRASG12C tumors.
  • 55.Kozakov D, Hall DR, Napoleon RL, Yueh C, Whitty A, Vajda S: New frontiers in druggability. J Med Chem 2015, 58:9063–9088. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Ostrem JM, Peters U, Sos ML, Wells JA, Shokat KM: K-Ras(G12C) inhibitors allosterically control GTP affinity and effector interactions. Nature 2013, 503:548–551. [DOI] [PMC free article] [PubMed] [Google Scholar]

RESOURCES