Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2020 Aug 28.
Published in final edited form as: Methods Mol Biol. 2012;819:59–74. doi: 10.1007/978-1-61779-465-0_5

Expanding the Conformational Selection Paradigm in Protein-Ligand Docking

Guray Kuzu 1, Ozlem Keskin 1, Attila Gursoy 1, Ruth Nussinov 2,3
PMCID: PMC7455014  NIHMSID: NIHMS1613104  PMID: 22183530

Abstract

Conformational selection emerges as a theme in macromolecular interactions. Data validate it as a prevailing mechanism in protein-protein, protein-DNA, protein-RNA, and protein-small molecule drug recognition. This raises the question of whether this fundamental biomolecular binding mechanism can be used to improve drug docking and discovery. Actually, in practice this has already been taking place for some years in increasing numbers. Essentially, it argues for using not a single conformer, but an ensemble. The paradigm of conformational selection holds that because the ensemble is heterogeneous, within it there will be states whose conformation matches that of the ligand. Even if the population of this state is low, since it is favorable for binding the ligand, it will bind to it with a subsequent population shift toward this conformer. Here we suggest expanding it by first modeling all protein interactions in the cell by using Prism, an efficient motif-based protein-protein interaction modeling strategy, followed by ensemble generation. Such a strategy could be particularly useful for signaling proteins, which are major targets in drug discovery and bind multiple partners through a shared binding site, each with some—minor or major—conformational change.

Keywords: Protein-ligand interaction, Hotspots, Drug discovery, Conformational ensemble, Protein interaction prediction, Protein interface, Prism

1. Introduction

Proteins are involved in all molecular processes in living cells including metabolic, signaling, catalysis, viral entry, and regulation; cellular dysfunction due to inhibition, or to nonnative interactions of proteins can cause diseases (1, 2). Understanding the molecular and cellular activities in vivo and controlling their functions in disease requires analyzing the proteins, investigating their interactions, and elucidating their functions. Identifying protein interactions is important not only to understand how cells work, but also to elucidate disease mechanisms, discover effective drugs and figure out their effects on the entire cellular network (3, 4) to forecast side effects. Several experimental techniques (5), such as the yeast two-hybrid system (6), phage display (7), protein arrays (8), and affinity purification (9), generate massive amounts of protein interaction data. Yet, despite these, the complex nature of protein interactions is not entirely understood (10). As more data become available, computational methods which are able to analyze the large datasets are becoming increasingly important to make sense of experimental observations and use them to predict additional interactions, functional mechanisms, and protein and drug design.

Computational structural biology aims to introduce and apply effective methods that predict not only which proteins interact but also how they interact. Predictions of protein interaction can be carried out using docking or knowledge-based approaches. Although docking approaches are broadly used and are effective strategies, they cannot be applied on proteomic scales. The computation times are prohibitively long, and in particular, for reliable docking, additional biochemical data such as mutational information about protein interactions should be provided; in their absence, the number of false positive solutions can be astronomical and it is very difficult to distinguish between native and nonnative predictions (11). Knowledge-based approaches are faster compared to blind docking methods. Because they decrease the solution space by limiting possible orientations, the number of potential interactions is smaller which also leads to relatively shorter timescales. This enables knowledge-based methods to cope with large sets of data. In knowledge-based approaches, templates derived from known interacting proteins can be sequence-based (1214), domain-based (15) or interface-based (16, 17). It has been widely accepted that the structure of the protein is evolutionarily more conserved than the sequence (18). Thus, in principle, prediction algorithms which are purely structure-based, where the methodology is completely independent from any sequence homology, can work; and this holds even in the absence of any sequence similarity. This is all the more so for protein interfaces, which are often more conserved than the overall structure (19). Analysis of the interfaces has shown that even if the global structures and functions differ, proteins can bind through similar interface architectures (20, 21). A structurally nonredundant dataset of protein-protein interfaces can be clustered into three types of groups according to the interface and global structures of the interacting protein pairs (see Fig. 1) (20, 22, 23): in Type I the interacting proteins have similar global structures and functions. This is the most common and expected type. In Type II cluster members have similar interfaces; however, the global structures and functions are different. This type contains examples that validate the paradigm that interface motifs can be conserved even in the absence of global structural similarity (24, 25). In Type III, only one side of the interface is similar and the surfaces of the complementary partners are somewhat different. Hub proteins are mostly clustered into this type; therefore, members of this cluster may help in the characterization of hub proteins and shared binding sites (23). From an energetic point of view, a subset of interface residues can act as “hot spots” (26). These residues contribute more to the binding free energy of complexes; that is, they play a more significant role in the affinity and stability of the interaction. There is a strong correlation between hot spots and conserved residues on structurally similar interfaces (27), which points to the importance of hot spots in determining binding sites. Since hot spots contribute most of the binding energy in the interaction, discovery of molecules that bind to hot spots (1, 28), which can be small molecule drugs (2, 29, 30) or inhibitory peptides (3133), has gained importance in drug design.

Fig. 1.

Fig. 1.

Examples of Type I, II, and III interfaces. The interfaces are highlighted with boxes. (a) Members of Type I proteins use similar interfaces to bind each other. The two glutathione S-transferase complexes are homologous (PDB identifiers: 10gs and 1b48). (b) Members of Type I proteins are not related evolutionarily, but the interface structures are similar. The two complexes, cytochrome C and neuropeptide/membrane protein are examples of this type (PDB identifiers: 1bbh and 1rso). (c) In Type III, only one side of the interface has similar architectures, the complementary sides are different (dynein light chain 8, PDB identifier: 1f95AB; 4-oxalocrotonate tautomerase, PDB identifier: 1 otfAE).

Understanding the mechanism of binding is expected to help drug discovery, since it can lead to more effective methodologies. Over the years, Koshland’s “induced fit” scenario (34) has been widely accepted as the binding mechanism. According to the induced fit, binding of a protein to a ligand leads to a conformational change in the protein which is “induced” by the ligand and culminates in a favorable, tight fit. More recently, an alternative mechanism has been proposed, the so-called “conformational selection and population shift” (3539). This proposition has been based on concepts derived from the free energy landscape (40). It argued that since proteins exist in solution in broad ensembles, among the conformational states present in the ensemble there should be some with binding sites matching the shape (and chemistry) of the ligand. While the energy of these states can be high, and thus they may be only sparsely populated, the binding will stabilize them, with a subsequent “population shift” toward these conformers, which maintains the chemical equilibrium. Recently, considerable experimental and computational data have accumulated (4143) validating the conformational selection and population shift scenario for a broad range of binding events, and it has further been proposed to apply to drug discovery (44). Currently, conformational selection is believed to be the prevailing mechanism, with induced fit dominating in cases where the concentration of the ligand is extremely high (45). Of note, the timescales of induced fit are faster than those of conformational selection and population shift; this is because a shift in the population necessitates climbing barriers, and thus the times depend on the barrier heights. Following binding, there is an induced fit on a minor, local scale to optimize the interactions. The question arises in which way such a mechanistic scenario can help in drug discovery strategies. A reasonable way would be to generate an ensemble of states, and dock these separately to the small molecule drug. However this is an immensely complex task, since it critically depends on the sampling. Since high energy states also need to be considered, the sampling should not be confined to low energy conformations. Drug discovery is usually aimed at enzyme active sites; however, increasingly it also targets disruption or modulation of protein–protein binding sites. While enzyme active sites are known, this is not the case for the protein binding sites, where as we discussed above, data are available only for a (relatively small) fraction of the interactions. For these cases, combining prediction of protein-protein interactions and their binding sites as a first step, coupled with ensemble docking could be a strategy to consider.

Toward such a strategy, here we present a template-based protein-protein interaction algorithm, Prism (Protein Interactions by Structural Matching) (46, 47, 78) integrated with FiberDock (Flexible Induced-fit Backbone Refinement in Molecular Docking) (48). The Prism algorithm reveals possible interactions among a group of protein structures based on known protein–protein interfaces. Due to the existence of a limited number of distinct binding motifs in nature (49), similar interface architectures are shared among functionally and structurally different proteins (20). The method, which is independent of sequence data, utilizes structural and evolutionary similarity of a target protein with partners of an already known interaction to predict an interaction between two protein molecules. Although the structural similarity is detected via geometrical alignment of structures, evolutionary similarity is approximated by the conservation of hot spots. Besides the efficiency in prediction of protein interactions on the proteome scale, the prediction algorithm can be used to construct and analyze specific networks, such as the human cancer protein-protein interaction network (50), or to discover shared binding sites in hub proteins (51). Furthermore, increasing interest in targeting protein-protein interactions (52, 53), especially hot spots in interfaces (54), for drug discovery makes such a strategy particularly promising. Combining Prism with FiberDock is a powerful alternative to guide pharmacological research considering its ability to detect a potential interaction between a drug and its target protein or of a target protein with another protein in the network. Moreover, because the interacting residues can be sequentially discontinuous (see Fig. 2), an algorithm such as Prism which focuses on interfaces and is independent of the order of the residues on the chain is advantageous.

Fig. 2.

Fig. 2.

A two-chain interface (a) An example of a two-chain interface (PDB identifier: 1fq3; chains A and B). Black residues represent contacting residues which interact across the interface. Residues in their spatial vicinity (called nearby residues) are in whitish gray. The remaining residues in the chains A and B are shown in gray. (b) The interface consists of bits and pieces of each of the chains, and some isolated residues. The chain A side of the interface consists of five contacting and 24 nearby residues. There are nine contacting and 17 nearby residues in the chain B interface.

2. Materials and Methods

Prism attempts to predict protein-protein interactions based on structural similarity of the proteins to the complementary sides of a known interface. If it is known that there is an interaction between proteins A and B, and protein A′ is structurally similar to protein A and protein B′ is structurally similar to protein B, it is claimed that A′ and B′ may interact with each other (46). Prism considers a potential binary interaction by querying whether target interfaces structurally and evolutionarily complement each other in a way similar to template interfaces. Then, by using FiberDock, flexible refinement of docking solution candidates is performed by optimizing the side chain orientations. Binding energy is also calculated for the refined structures. To carry out such a protocol, the first step involves the availability of target structures and generation of template datasets. A flowchart summarizing the prediction algorithm is given in Fig. 3.

Fig. 3.

Fig. 3.

Flowchart summary of the prediction algorithm of Prism together with FiberDock.

2.1. Template Dataset

All interfaces of two chain protein complexes available in the Protein Data Bank (55) were extracted. Interfaces consist of interacting residues between two chains and neighboring residues. Neighboring residues are in the spatial vicinity of interacting residues and constitute the scaffold of the interface. Two residues from two different chains are considered as interacting if they are at a distance smaller than the sum of van der Waals radii plus a threshold of 0.5 Á. In addition, a noninteracting residue whose Cα is closer than 6.0 Á to the Cα of any interacting residue is marked as a neighboring residue. In order to obtain a nonredundant dataset, 49,512 two-chain interfaces (as of February 2006) extracted in the first step were clustered structurally following an iterative all-against-all structural comparison in a sequence order-independent way (20, 51). 8,205 clusters were obtained. Interface members of each cluster are structurally similar to the representative interface. A cluster should contain at least five nonhomologous sequences.

The template interface can be constructed in several ways: one can use (1) all representatives (8,205) of the interfaces, or (2) a subset of the representatives, for example, the heterodimeric protein interfaces (1,036), or the nonobligate protein interfaces (158 interfaces) (see Note 1). The type of reduction of the template set is determined with respect to characteristics of the query molecules. Computational hot spots are found by using the HotPoint web server (56) (see Note 2). Prism then searches for a potential interaction by comparing the surfaces of target proteins to the partners of known template interfaces while accounting for evolutionary conservation.

2.2. Target Dataset

Proteins in a target dataset are searched for a potential interaction (see Note 3). The data of query proteins are extracted from the PDB. Multimeric proteins are split into their monomers, and homologous chains are counted only once (see Note 4). The surfaces of the molecules are extracted by using the NACCESS program (described in Subheading 2.3).

2.3. Prediction of Protein-Protein Interaction

Prism suggests a possible interaction between two target proteins A′ and B′, if protein A′ shares structural similarity with one side of template interface I, which is extracted from a known interaction between protein A and protein B, and protein B′ is structurally similar to the other side of the interface I.

The surfaces of target proteins are extracted using the NACCESS program (57) (see Note 5). NACCESS calculates the relative surface accessibilities (RSAs) of residues, which are the percent accessibility with respect to the accessibility of the residue type X in an extended ALA-X-ALA tripeptide (58). Residues whose RSA values are greater than 15% are considered as surface residues. “Nearby” residues are then added to the surface shell as described above, but the threshold value is chosen as 5.0 Á. Structural similarity between target and template interfaces is assessed using MultiProt (59, 60). MultiProt aligns the target surface with each complementary partner of the representative template interfaces and determines the common geometrical cores between structures. MultiProt’s output is the ten best alignments for substructural matching of a target protein surface with a template interface. Target surfaces should geometrically match with 50% of the residues of the template chains if the template chains contain at most 50 residues. This matching threshold is 30% for the larger template chains. In addition, at least one conserved hot spot should be correctly matched between the template interface and the target surface (see Note 6). Moreover, at least five pairs of matched residues from each side of the template interface should be against each other in order to guarantee the correct matching for the left and right partners (see Note 7).

Target proteins which pass the alignment process and match with the partners of the same template interface are next checked if it is physically possible for them to constitute a complex. If the Cα atom of a residue from one partner is at a distance shorter than 3 Á to the Cα atom of a residue from the complementary partner, those two residues are considered as clashing. A threshold of five clashes makes the interaction physically impossible.

Finally, FiberDock (48) is used for flexible refinement of the predicted complexes and for calculation of the energy of the interaction. Steric clashes of side chains due to their orientations are solved via conformational adjustment of the side chains and the binding energy of the final transformed structures is calculated (see Note 8). FiberDock ranks the docked solutions by the calculated energies. Hence, FiberDock checks if a potential interaction estimated by Prism is favorable in terms of global energy.

4. A Drug Target: Insulin Receptor

Mutations in protein kinases contribute to diseases or pathophysiological states, including cancer, autoimmune disorders, cardiac diseases, and inflammatory conditions (61). Therefore, recent effort increasingly focuses on inhibitor and small molecule drug design to modulate these enzymes. The insulin receptor (IR) is a member of the tyrosine kinase receptors. In addition to diabetes, it appears to be related to Alzheimer’s disease and cancer (6266).

IR exists on the surfaces of cells and interacts with insulin, the hormone having a significant role in regulating the energy and glucose metabolism in the body. Insulin receptor substrate 2 (IRS2) is one of the substrates of IR. The conformational change of the insulin receptor tyrosine kinase domain (IR_TKD) through binding to IRS2 is shown in Fig. 4a. In the figure, the molecular structure of free IR_TKD is transparent black (PDB identifier: 1 irk); black and the whitish gray molecules represent the IR_TKD complex with IRS2 (PDB identifier: 3bu3; black: IR_TKD, chain A; whitish gray: IRS2, chain B). IR is inhibited by growth factor receptor-bound protein 14 (Grbl4) and the molecular structure is given in Fig. 4b (PDB identifier: 2auh; black: IR_TKD, chain A; whitish gray: Grbl4, chain B). The beads shown in Fig. 4a, b represent the computational hot spots of IR_TKD extracted by using the HotPoint web server (56). Dark gray beads (Leu1171, Val1173 and Gln1208) are the common hot spots of IR_TKD in receptor/substrate (3bu3) and receptor/inhibitor complexes (2auh). Although the interacting partners are different molecules and a different conformational change is observed following the binding, IR_TKD interacts through the same hot spot residues. Interaction of both the inhibitor and the substrate through the same hot spots indicates the importance of targeting hot spots in drug discovery (53, 67). Several studies have focused on the discovery of small molecules that bind with drug-like potencies to hot spots at the interface (6870).

Fig. 4.

Fig. 4.

Insulin receptor tyrosine kinase domain complex with its substrate and its inhibitor (a) Molecular structure of free insulin receptor tyrosine kinase domain (IR_TKD, black in transparent, PDB identifier: 1 irk) and its complex with insulin receptor substrate 2 (IR_TKD/IRS2, black/whitish gray, PDB identifier: 3bu3). Computational hotspots on IR_TKD are shown with ball representation (light gray and dark gray). Dark gray balls are common hotspots of IR_TKD in IR_TKD/IRS2 and IR_TKD/Grb14 (b). (b) Molecular structure of insulin receptor tyrosine kinase domain complex with growth factor receptor-bound protein 14 (IR_TKD/Grb14, black/whitish gray, PDB identifier: 2auh). Computational hotspots on IR_TKD are shown with ball representation (light gray and dark gray). Dark gray balls are common hotspots of IR_TKD in IR_TKD/Grb14 and IR_TKD/IRS2 (a).

5. Discussion and Conclusions

Conformational selection and population shift is currently the accepted paradigm for molecular recognition. The question arises how to use it to improve experimental and computational strategies. Here our focus is on docking. A knowledge-based docking approach such as Prism, which follows a rationale that if a binding site motif is similar between two proteins it is likely to interact with a common motif of a partner protein, implicitly follows the conformational selection concept. As such, it can also be used toward small molecule ligand and peptide docking. As targets, above we focused on protein-protein interfaces. Our approach considers two steps: in the first the pathways are modeled to obtain their protein-protein interfaces. This is because the PDB contains only a small fraction of the interactions. In the second, ensembles would be generated, and candidate drugs would be docked to representatives of the ensemble clusters. Signaling proteins are particularly good targets: they are at the crossroads of pathways and their binding sites can be shared by a large number of partners (54). Drug binding will elicit allosteric effects which not only will change the conformations of their protein-protein binding sites elsewhere, but will also propagate in the pathway.

Ensemble docking has been a strategy long in use, even if for different consideration—to overcome the technical difficulties in flexible docking. A quick literature search produces hundreds of papers devoted to the subject; among these is the work by Lorber and Shoichet (71) which to our knowledge is the first. Conformational selection has also been used directly in docking (72, 73). However, it is difficult to apply this concept on a comprehensive scale. Docking of a large ensemble is currently prohibitive, because of the timescales. Nonetheless, rapid sampling methods (74) perhaps coupled with semiatomistic approaches (75) or effective filters (76) or other useful strategies (77), hopefully will eventually help in this endeavor which mimics real life mechanisms.

Acknowledgments

This work has been supported by TUBITAK (Research Grant Numbers: 109T343 and 109E207). Guray Kuzu is supported by a TUBITAK fellowship. This project has been funded in whole or in part with Federal funds from the National Cancer Institute, National Institutes of Health, under contract number HHSN261200800001E. The content of this publication does not necessarily reflect the views or policies of the Department of Health and Human Services, nor does mention of trade names, commercial products, or organizations imply endorsement by the U.S. Government. This research was supported (in part) by the Intramural Research Program of the NIH, National Cancer Institute, Center for Cancer Research.

3. Notes

1.

The algorithm strictly depends on the template set. If there is no similar motif in the template set, the algorithm cannot find any similarity between the target protein and template structures; thus a potential interaction for target proteins cannot be predicted. Therefore, choosing the right template set for the target proteins is very important. User can also use his own template set, but the data relating to the structures in the template set should be added in PDB format. Although it may seem as a disadvantage that outcome is a function of the template set, the algorithm finds reliable results in a short computation time if such motifs are available in the template set.

2.

The HotPoint web server is used to find computational hot spots. The PDB code of the input protein should be entered or PDB files of the protein can be loaded. The interacting chains are specified and the distance threshold to extract the interface residues can be chosen as default value, which is summation of van der Waals radii of two atoms plus 0.5 Á or a value defined by the user. On the results page, contacting residues are displayed with their features (residue number, residue name, the chain that the residue belongs to, the corresponding relative accessibility surfaces area values in the monomer and complex forms, a score for its potential to be a hot spot, and the result of the prediction: hot spot or not). The interface file in PDB format and hotspot prediction result file as well as a link for visualization of the interactive 3D model are also available on the result page.

3.

Target proteins should have structural data in the PDB. However, artificial proteins can be searched for a potential interaction if their structural data are added in PDB format. The target set should not contain any DNA or RNA structures, since these kinds of structures are not computed for interaction prediction.

4.

Homolog models are also compatible as target proteins. If a protein contains homologous chains, these chains are represented by one of them in order to avoid redundancy. For example, since laxc protein contains homologous chains A, C and E, chain A is represented as laxcACE.

5.

NACCESS computes the accessible surface area by rolling a solvent probe on the given molecule. The radius of the solvent probe is chosen as 1.4 Å.

6.

If a target protein has no hot spot, the algorithm cannot find a potential interaction for this target protein. It is expected that target proteins with any interface size have at least one hot spot.

7.

If structures of two proteins are similar to each side of a template interface, that is, one target protein has a surface similar to one side of a template interface and the surface of another target protein is similar to the other side of the same template interface, it is expected that they can match with each other. There should be at least five pairs of matched residues from each side of the template interface which are in contact with each other in order to predict that the two target proteins can potentially interact.

8.

In the process of optimizing the predicted protein complex, hydrogen atoms of molecules are also considered and the orientation of the clashing interface residues is adjusted according to the repulsive van der Waals forces. Then, FiberDock calculates binding energies. However, if the solution cannot converge, the global energy cannot be computed.

References

  • 1.White AW, Westwell AD, Brahemi G (2008) Protein-protein interactions as targets for small-molecule therapeutics in cancer. Expert Rev Mol Med 10:e8. [DOI] [PubMed] [Google Scholar]
  • 2.Blazer LL, Neubig RR (2009) Small molecule protein-protein interaction inhibitors as CNS therapeutic agents: current progress and future hurdles. Neuropsychopharmacology 34:126–141 [DOI] [PubMed] [Google Scholar]
  • 3.Neuvirth H, Raz R, Schreiber G (2004) Pro-Mate: a structure based prediction program to identify the location of protein-protein binding sites. J Mol Biol 338:181–199 [DOI] [PubMed] [Google Scholar]
  • 4.Kortemme T, Baker D (2004) Computational design of protein-protein interactions. Curr Opin Chem Biol 8:91–97 [DOI] [PubMed] [Google Scholar]
  • 5.Shoemaker BA, Panchenko AR (2007) Deciphering protein-protein interactions. Part I. Experimental techniques and databases. PLoS Comput Biol 3:e42. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Uetz P, Giot L, Cagney G et al. (2000) A comprehensive analysis of protein-protein interactions in Saccharomyces cerevisiae. Nature 403:623–627 [DOI] [PubMed] [Google Scholar]
  • 7.Landgraf C, Panni S, Montecchi-Palazzi L et al. (2004) Protein interaction networks by proteome peptide scanning. PLoS Biol 2:El4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.MacBeath G, Schreiber SL (2000) Printing proteins as microarrays for high-throughput function determination. Science 289: 1760–1763 [DOI] [PubMed] [Google Scholar]
  • 9.Bauer A, Kuster B (2003) Affinity purification-mass spectrometry. Powerful tools for the characterization of protein complexes. Eur J Biochem 270:570–578 [DOI] [PubMed] [Google Scholar]
  • 10.Keskin O, Ma B, Rogale K et al. (2005) Protein-protein interactions: organization, cooperativity and mapping in a bottom-up Systems Biology approach. Phys Biol 2: S24–35 [DOI] [PubMed] [Google Scholar]
  • 11.Ritchie DW (2008) Recent progress and future directions in protein-protein docking. Curr Protein Pept Sci 9:1–15 [DOI] [PubMed] [Google Scholar]
  • 12.Chen H, Skolnick J (2008) M-TASSER: an algorithm for protein quaternary structure prediction. Biophys J 94:918–928 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Launay G, Simonson T (2008) Homology modelling of protein-protein complexes: a simple method and its possibilities and limitations. BMC Bioinformatics 9:427. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Kundrotas PJ, Lensink MF, Alexov E (2008) Homology-based modeling of 3D structures of protein-protein complexes using alignments of modified sequence profiles. Int J Biol Macromol 43:198–208 [DOI] [PubMed] [Google Scholar]
  • 15.Davis FP, Braberg H, Shen MY et al. (2006) Protein complex compositions predicted by structural similarity. Nucleic Acids Res 34:2943–2952 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Gunther S, May P, Hoppe A et al. (2007) Docking without docking: ISEARCH–prediction of interactions using known inter-faces. Proteins 69:839–844 [DOI] [PubMed] [Google Scholar]
  • 17.Sinha R, Kundrotas PJ, Vakser IA (2010) Docking by structural similarity at protein-protein interfaces. Proteins 78:3235–3241 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.lliergard K, Ardell DH, Elofsson A (2009) Structure is three to ten times more conserved than sequence-a study of structural response in protein cores. Proteins 77:499–508 [DOI] [PubMed] [Google Scholar]
  • 19.Caffrey DR, Samaroo S, Hughes JD et al. (2004) Are protein-protein interfaces more conserved in sequence than the rest of the protein surface? Protein Sci 13:190–202 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Keskin O, Tsai CJ, Wolfson H et al. (2004) A new, structurally nonredundant, diverse data set of protein-protein interfaces and its implications. Protein Sci 13:1043–1055 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Tsai CJ, Lin SL, Wolfson HJ et al. (1996) A dataset of protein-protein interfaces generated with a sequence-order-independent comparison technique. J Mol Biol 260:604–620 [DOI] [PubMed] [Google Scholar]
  • 22.Keskin O, Gursoy A, Ma B et al. (2008) Principles of protein-protein interactions: what are the preferred ways for proteins to interact? Chem Rev 108:1225–1244 [DOI] [PubMed] [Google Scholar]
  • 23.Keskin O, Gursoy A, Nussinov R (2008) Principles of protein recognition and properties of protein-protein interfaces In: Panchenko A, Przytycka T (ed) Protein-protein interactions and networks, vol 9 Springer, London [Google Scholar]
  • 24.Martin J (2010) Beauty is in the eye of the beholder: proteins can recognize binding sites of homologous proteins in more than one way. PLoS Comput Biol 6:el000821. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Keskin O, Nussinov R(2007) Similar binding sites and different partners: implications to shared proteins in cellular pathways. Structure 15:341–354 [DOI] [PubMed] [Google Scholar]
  • 26.Keskin O, Ma B, Nussinov R (2005) Hot regions in protein-protein interactions: the organization and contribution of structurally conserved hot spot residues. J Mol Biol 345:1281–1294 [DOI] [PubMed] [Google Scholar]
  • 27.Ma B, Elkayam T, Wolfson H et al. (2003) Protein-protein interactions: structurally conserved residues distinguish between binding sites and exposed protein surfaces. Proc Natl Acad Sci US A 100:5772–5777 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Moreira IS, Fernandes PA, Ramos MJ (2007) Hot spots-a review of the protein-protein interface determinant amino-acid residues. Proteins 68:803–812 [DOI] [PubMed] [Google Scholar]
  • 29.Konstantinopoulos PA, Karamouzis MV, Papavassiliou AG (2007) Post-translational modifications and regulation of the RAS superfamily of GTPases as anticancer targets. Nat Rev Drug Discov 6:541–555 [DOI] [PubMed] [Google Scholar]
  • 30.Tesmer JJ (2006) Pharmacology. Hitting the hot spots of cell signaling cascades. Science 312:377–378 [DOI] [PubMed] [Google Scholar]
  • 31.London N, Movshovitz-Attias D, Schueler-Furman O(2010) The structural basis of peptide-protein binding strategies. Structure 18:188–199 [DOI] [PubMed] [Google Scholar]
  • 32.Stein A, Aloy P (2008) Contextual specificity in peptide-mediated protein interactions. PLoS One 3:e2524. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Austin RJ, Ja WW, Roberts RW (2008) Evolution of class-specific peptides targeting a hot spot of the Galphas subunit. J Mol Biol 377:1406–1418 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Koshland DEJ, Nemethy G, Filmer D (1966) Comparison of experimental binding data and theoretical models in proteins containing sub-units. Biochemistry 5:365–385 [DOI] [PubMed] [Google Scholar]
  • 35.Tsai CJ, Kumar S, Ma B et al. (1999) Folding funnels, binding funnels, and protein function. Protein Sci 8:1181–1190 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Ma B, Kumar S, Tsai CJ et al. (1999) Folding funnels and binding mechanisms. Protein Eng 12:713–720 [DOI] [PubMed] [Google Scholar]
  • 37.Tsai CJ, Ma B, Nussinov R (l999) Folding and binding cascades: shifts in energy landscapes. Proc Natl Acad Sci U S A 96:9970–9972 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Kumar S, Ma B, Tsai CJ et al. (2000) Folding and binding cascades: dynamic landscapes and population shifts. Protein Sci 9:10–19 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Ma B, Shatsky M, Wolfson HJ et al. (2002) Multiple diverse ligands binding at a single protein site: a matter of pre-existing populations. Protein Sci 11:184–197 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Frauenfelder H, Sligar SG, Wolynes PG (1991) The energy landscapes and motions of proteins. Science 254:1598–1603 [DOI] [PubMed] [Google Scholar]
  • 41.Boehr DD, Nussinov R, Wright PE (2009) The role of dynamic conformational ensembles in biomolecular recognition. Nat Chem Biol 5:789–796 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Masterson LR, Cheng C, Yu T et al. (2010) Dynamics connect substrate recognition to catalysis in protein kinase A. Nat Chem Biol 6:821–828 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Keskin O(2007) Binding induced conformational changes of proteins correlate with their intrinsic fluctuations: a case study of antibodies. BMC Struct Biol 7:31. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Kar G, Keskin O, Gursoy A et al(2010) Allostery and population shift in drug discovery. Curr Opin Pharmacol 10:715–722 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Weikl TR, von Deuster C (2009) Selected-fit versus induced-fit protein binding: kinetic differences and mutational analysis. Proteins 75:104–110 [DOI] [PubMed] [Google Scholar]
  • 46.Aytuna AS, Gursoy A, Keskin O (2005) Prediction of protein-protein interactions by combining structure and sequence conservation in protein interfaces. Bioinformatics 21:2850–2855 [DOI] [PubMed] [Google Scholar]
  • 47.Ogmen U, Keskin O, Aytuna AS et al. (2005) PRISM: protein interactions by structural matching. Nucleic Acids Res 33:W331–336 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Mashiach E, Nussinov R, Wolfson HJ (2010) FiberDock: Flexible induced-fit backbone refinement in molecular docking. Proteins 78:1503–1519 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Aloy P, Bottcher B, Ceulemans H et al. (2004) Structure-based assembly of protein complexes in yeast. Science 303:2026–2029 [DOI] [PubMed] [Google Scholar]
  • 50.Kar G, Gursoy A, Keskin O (2009) Human cancer protein-protein interaction network: a structural perspective. PLoS Comput Biol 5:el000601. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Tuncbag N, Kar G, Gursoy A et al. (2009) Towards inferring time dimensionality in protein-protein interaction networks by integrating structures: the p53 example. Mol Biosyst 5:1770–1778 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Keskin O, Gursoy A, Ma B et al. (2007) Towards drugs targeting multiple proteins in a systems biology approach. Curr Top Med Chem 7:943–951 [DOI] [PubMed] [Google Scholar]
  • 53.Arkin MR, Wells JA (2004) Small-molecule inhibitors of protein-protein interactions: progressing towards the dream. Nat Rev Drug Discov 3:301–317 [DOI] [PubMed] [Google Scholar]
  • 54.Ozbabacan SE, Gursoy A, Keskin O et al. (2010) Conformational ensembles, signal transduction and residue hot spots: application to drug discovery. Curr Opin Drug Discov Devel 13:527–537 [PubMed] [Google Scholar]
  • 55.Berman HM, Westbrook J, Feng Z et al. (2000) The Protein Data Bank. Nucleic Acids Res 28:235–242 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Tuncbag N, Keskin O, Gursoy A (2010) HotPoint: hot spot prediction server for protein interfaces. Nucleic Acids Res 38 Suppl: W402–406 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Hubbard SJTJ (1993) in “Department of Biochemistry and Molecular Biology”, University College, London. [Google Scholar]
  • 58.Jones S, Thornton JM (1997) Analysis of protein-protein interaction sites using surface patches. J Mol Biol 272:121–132 [DOI] [PubMed] [Google Scholar]
  • 59.Nussinov R, Wolfson HJ (1991) Efficient detection of three-dimensional structural motifs in biological macromolecules by computer vision techniques. Proc Natl Acad Sci U S A 88:10495–10499 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 60.Shatsky M, Nussinov R, Wolfson HJ (2004) A method for simultaneous alignment of multiple protein structures. Proteins 56:143–156 [DOI] [PubMed] [Google Scholar]
  • 61.Ortutay C, Valiaho J, Stenberg K et al. (2005) KinMutBase: a registry of disease-causing mutations in protein kinase domains. Hum Mutat 25:435–442 [DOI] [PubMed] [Google Scholar]
  • 62.Smith BJ, Huang K, Kong G et al. (2010) Structural resolution of a tandem hormone-binding element in the insulin receptor and its implications for design of peptide agonists. Proc Natl Acad Sci U S A 107: 6771–6776 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.de la Monte SM, Tong M, Lester-Coll N et al. (2006) Therapeutic rescue of neurodegeneration in experimental type 3 diabetes: relevance to Alzheimer’s disease. J Alzheimers Dis 10:89–109 [DOI] [PubMed] [Google Scholar]
  • 64.Zhang H, Fagan DH, Zeng X et al. (2010) Inhibition of cancer cell proliferation and metastasis by insulin receptor downregulation. Oncogene 29:2517–2527 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Ulanet DB, Ludwig DL, Kahn CR et al. (2010) Insulin receptor functionally enhances multistage tumor progression and conveys intrinsic resistance to IGF-lR targeted therapy. Proc Natl Acad Sci U S A 107:10791–10798 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Belfiore A, Frasca F (2008) IGF and insulin receptor signaling in breast cancer. J Mammary Gland Biol Neoplasia 13:381–406 [DOI] [PubMed] [Google Scholar]
  • 67.Wells JA, McClendon CL (2007) Reaching for high-hanging fruit in drug discovery at protein-protein interfaces. Nature 450: 1001–1009 [DOI] [PubMed] [Google Scholar]
  • 68.Dolgin E (2010) Targeting hotspots of transmission promises to reduce malaria. Nat Med 16:1055. [DOI] [PubMed] [Google Scholar]
  • 69.Landon MR, Amaro RE, Baron R et al. (2008) Novel druggable hot spots in avian influenza neuraminidase H5Nl revealed by computational solvent mapping of a reduced and representative receptor ensemble. Chem Biol Drug Des 71:106–116 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Busschots K, De Rijck J, Christ F et al. (2009) In search of small molecules blocking interactions between HIV proteins and intracellular cofactors. Mol Biosyst 5:21–31 [DOI] [PubMed] [Google Scholar]
  • 71.Lorber DM, Shoichet BK (1998) Flexible ligand docking using conformational ensembles. Protein Sci 7:938–950 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72.Chaudhury S, Gray JJ (2008) Conformer selection and induced fit in flexible backbone protein-protein docking using computational and NMR ensembles. J Mol Biol 381: 1068–1087 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73.Wong S, Jacobson MP (2008) Conformational selection in silica: loop latching motions and ligand binding in enzymes. Proteins 71:153–164 [DOI] [PubMed] [Google Scholar]
  • 74.Ding Y, Mamonov AB, Zuckerman DM (2010) Efficient equilibrium sampling of all-atom peptides using library-based Monte Carlo. J Phys Chem B 114:5870–5877 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75.Cashman DJ, Mamonov AB, Bhatt D et al. (2010) Thermal Motions of the E. Coli Glucose-Galactose Binding Protein Studied Using Well-Sampled, Semi-Atomistic Simulations. Curr Top Med Chem [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 76.Autore F, Melchiorre S, Kleinjung J et al. (2007) Interaction of malaria parasite-inhibitory antibodies with the merozoite surface protein MSP1(19) by computational docking. Proteins 66:513–527 [DOI] [PubMed] [Google Scholar]
  • 77.Huang Z, Wong CF (2009) Conformational selection of protein kinase A revealed by flexible-ligand flexible-protein docking. J Comput Chem 30:631–644 [DOI] [PubMed] [Google Scholar]
  • 78.Tuncbag N, Gursoy A, Nussinov R et al. (2011) Predicting protein-protein interactions on a proteome scale by matching evolutionary and structural similarities at interfaces using PRISM. Nat Protoc 6:1341–1354 [DOI] [PMC free article] [PubMed] [Google Scholar]

RESOURCES