Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2012 Sep 14.
Published in final edited form as: J Am Chem Soc. 2011 Aug 22;133(36):14220–14223. doi: 10.1021/ja206074j

Assessing Helical Protein Interfaces for Inhibitor Design

Brooke N Bullock 1, Andrea L Jochim 1, Paramjit S Arora 1,*
PMCID: PMC3168723  NIHMSID: NIHMS320103  PMID: 21846146

Abstract

Structure-based design of synthetic inhibitors of protein-protein interactions requires adept molecular design and synthesis strategies as well as knowledge of targetable complexes. To address the significant gap between the elegant design of helix mimetics and their sporadic use in biology, we analyzed the full set of helical protein interfaces in the Protein Data Bank to obtain a snapshot of how helices that are critical for complex formation interact with the partner proteins. The results of this study are expected to guide systematic design of synthetic inhibitors of protein-protein interactions. We have experimentally evaluated new classes of protein complexes that emerged from this dataset – highlighting the significance of the results described herein.


Interactions of proteins with partner proteins control essential cellular processes, and misregulation of these interactions is often implicated in disease states.1 However, despite their fundamental role, protein-protein interactions (PPIs) are generally not considered attractive targets for drug design because of their large, and often flat, contact surfaces.24 A promising rational design approach for the discovery of PPI inhibitors is centered on the role of protein secondary structures at protein interfaces. Analysis suggests that although protein interfaces are large, often a small subset of the residues contributes significantly to the free energy of binding.58 Secondary structures are common scaffolds for the organization of these “hot spots” in proteins.4,9,10 It has been demonstrated that synthetic molecules that reproduce key elements of energetically significant protein secondary structures can inhibit chosen interfaces with high affinity and specificity. 1123

We recently analyzed the full set of helical protein interfaces in the Protein Data Bank to identify potentially suitable candidates for inhibition by small molecules or helix mimetics.24,25 We began by identifying protein complexes that feature helical segments at interfaces and computationally evaluating the energetic contribution of helices to complex formation (Figure 1). Although several examinations of protein–protein interactions have been performed, our approach is unique in its focus on interfaces involving a specific secondary structure. The key motivation behind this structure-based dissection of interfaces is to aid systematic design of synthetic inhibitors of PPIs.

Figure 1.

Figure 1

Evaluation of structures from the Protein Data Bank to identify and assess helical interfaces in protein-protein interactions. The helical interfaces were evaluated by computational alanine scanning mutagenesis.

In earlier reports we categorized helical protein interfaces identified with our algorithm by cellular functions24 and proposed a predictive scale for inhibition of protein-protein interactions by synthetic ligands.25 These studies focused on the disposition and energetic contributions of “hot spot” residues within interfacial helices, and provided a list of interactions that have not previously been inhibited along with candidate helices whose mimics may serve as potent inhibitors. Based on these predictions, we have designed cell-permeable synthetic α-helices that interfere with protein-protein interactions that control transcription of hypoxia inducible genes and Ras signaling.13,14 Here we examine the composition and characteristics of helical domains identified to be critical for protein complex formation. We analyzed the full set of available protein complexes in the PDB to assess amino acid propensity at helical interfaces, location and positioning of hot spot residues on helices, and contact residues on partner proteins.

Examination of entries in the PDB (version August 2009) shows that multiprotein complexes constitute roughly 15% of the data-bank.24,25 Of these 62% feature a helix at the interface, highlighting the role of α-helices in protein-protein interactions. However, presence of a helix at the interface does not imply a critical role for the particular helix in the interaction. To evaluate the energetic contribution of each helix to the complex formation, we employed computational alanine scanning mutagenesis scans within Rosetta to identify residues that contribute most strongly to complex formation.26,27 Alanine scanning mutagenesis is a standard approach for identifying hot spot residues.28 The results of this analysis have been reported along with a full list of filtered PPIs.25

Three general strategies have been used to develop helix mimetics: helix stabilization, helical foldamers, and helical surface mimetics.29,30 Helix stabilizing methods based on side chain crosslinks18,31 and hydrogen-bond surrogates32 preorganize amino acid residues and initiate helix formation. Helical foldamers,11,33 such as β-peptides3436 and peptoids,37 are composed of amino acid analogs and are capable of adopting conformations similar to those found in natural proteins. Helical surface mimetics utilize conformationally restricted scaffolds with attached functional groups that resemble the i, i+3, i + 4, and i + 7 pattern of side chain positioning on an α-helix (Figure 2a). Surface mimetics typically impart functionality from one face of the helix,38 while stabilized peptide helices and foldamers are able to reproduce functionality present on multiple faces of the target helix. A key advantage of the helix surface mimicry is that it affords low molecular weight compounds as modulators of protein interactions.3944

Figure 2.

Figure 2

Energetic contributions of residues on different faces of interfacial helices. (a) Positioning of side chain residues on a canonical α-helix, (b) percent occurrence of hot spot residues on one, two or three helical faces (total number helices in each category shown in parentheses), (c) percent occurrence of hot spot residues as a function of helix position, (d–f) examples of protein complexes with hot spot residues on one face, two faces and three faces (PDB codes: 1xl3, 1xiu, and 1or7).

A catalog of PPIs predicting energetic contributions of residues on different faces of interfacial helices should provide an invaluable starting point for design of synthetic inhibitors of protein complex formation. Such a dataset would enable design of an appropriate mimic for a particular interface of interest. Based on this hypothesis, we analyzed the occurrence of hot spot residues on different helical faces. Hot spot residues are defined as residues that upon mutation to alanine are predicted to decrease the binding energy by a threshold value ΔΔGbind ≥ 1.0 kcal mol−1, as measured in Rosetta energy units.5,7,8,26 We used a cut-off value of ΔΔGavg ≥ 2.0 kcal mol−1 to define strongly and weakly interacting interfaces.25 This average binding energy difference accounts for all hot-spot residues at an interface. Our current dataset consists of 480 “strongly interacting” interfaces, which were closely examined. The number of such complexes will grow as new entries are deposited in the PDB.

Analysis reveals that roughly 60% of helical interfaces in the dataset feature helices with hot spot residues on one face of the helix (Figure 2b,d), a third of the complexes utilize helices with hot spots on two faces (Figure 2b,e) and roughly 10% require all three faces for interaction with target protein partner (Figure 2b,f). The full list of protein-protein interactions that correspond to each category is included in the Supporting Information. Residues i, i+1, and i + 2 reside on different faces of a single helical turn; we examined models of each interfacial helix individually as the non-integer number of residues per helical turn makes it difficult to classify locations of non-contiguous residues on helical faces. Overall percent occurrence of hot spot residues at the first twelve positions in interfacial helices is depicted in Figure 2c. Our inquiry suggests that helix surface mimetics may prove to be a highly effective class of synthetic inhibitors; however, a significant fraction of protein-protein interactions will require mimetics that array protein-like functionality on multiple faces. Figure 3 shows the targeting potential of various helix mimetics. Terphenyls, the prototypical helix surface mimetics, imitate one helical face, side chain crosslinked helices can reproduce functionality of up to two faces; although the linker itself may interact with the protein pocket. Hydrogen bond surrogate (HBS) helices and β-peptide foldamers potentially afford complete replicas of functionality present on protein α-helices. We categorized the functions of protein-protein interactions featuring hot spots on different number of helical faces as defined in the PDB (Figure 4). Some interactions could fall into more than one function category. The four largest categories for each type are gene regulation, enzymatic function, cell cycle, and signaling.

Figure 3.

Figure 3

Potential of various helix mimetics to reproduce functionality of one, two or all three faces of protein α-helices.

Figure 4.

Figure 4

Functions associated with protein-protein interactions featuring hot spots on (a) one helical face, (b) two helical faces and (c) three helical faces.

The helical interfaces that form this dataset allow a detailed analysis of basic interactions that underlie protein complex formation. Examination of these fundamental forces will inform design of PPI inhibitors. We calculated the percentage of each helical residue that contributes strongly to binding. (Glycine and proline residues were exempted from alanine scanning since substitutions of proline or glycine to alanine may cause a conformational change in the protein backbone.) Leucine dominates the interface region (Figure 5a), which is not surprising as leucine is also the most prevalent residue in proteins in general. When normalized for natural abundance,45 we find that aromatic residues and arginine, along with leucine, are overrepresented as hot spots at helical interfaces in comparison to polar residues (Figure 5c). These results correspond with previous studies of the types of amino acids appearing as hot spot residues in protein interfaces (Supporting Information, Figure S2);5,9,10,46,47 although our dataset is considerably larger than those previously examined. We expect these results to help guide design of helix mimetics libraries.40,43,44,4850

Figure 5.

Figure 5

(a) Percent occurrence of hot spot amino acids in helix-mediated protein interfaces, (b) percent occurrence of hot spot residues classified into similar groups, (c) representation of hot spot amino acids normalized to natural abundance of amino acids in proteins, and (d) average predicted decrease in binding energy of helical interfaces upon mutation of hot spot residues to alanine. Color code: aromatic (phenylalanine, tryptophan and tyrosine), white; hydrophobic (isoleucine, leucine and valine), green; negatively charged (aspartic acid and glutamic acid), blue; polar neutral (asparagine, cysteine, glutamine, serine and threonine), gray; positively charged (arginine, histidine and lysine) red.

Hydrophobic and aromatic residues constitute a majority of hot spot residues; however, polar and charged residues are also significant contributors at interfaces (Figure 5b).51 This analysis supports the common perception that protein-protein interactions are generally hydrophobic but feature key salt-bridges and other polar interactions that appreciably influence the binding energy landscape.8 This view is further supported by the evaluation of residues on the partner protein that are within 5 Å of the helical hotspot residue (Supporting Information, Figure S3). Not surprisingly a majority of residues that are within the specified radius of a hydrophobic residue are themselves hydrophobic, which is consistent with the hypothesis that the burial of a hot spot in a hydrophobic environment is a major stabilizing influence.5 In this respect, it is interesting to note that, on average, mutations of aromatic residues to alanine are more destabilizing than substitution of other interfacial residues, with the effect being dependent on the size of the aromatic ring (Figure 5d).

Helical protein-protein interactions have so far been successfully targeted by a diverse array of mimetics.12,14,16,18,21,23 Preliminary success in this field validates helix design concepts from multiple research groups and provides an impetus for designing inhibitors of interactions previously considered to be intractable to inhibition by synthetic ligands. A key motivation for our approach is to bridge the significant chasm between the elegant design of helix mimetics and their sporadic use in biology. This study provides a list of targets to be considered for different classes of helix mimetics based on the number of contact surfaces the target helix utilizes for interactions with partner proteins. We have successfully used this information to identify two new classes of protein-protein interactions amenable to disruption by helix mimetics,13,14 supporting the basic hypotheses and results of these computational efforts.

Supplementary Material

1_si_001

Acknowledgments

This work was financially supported by the National Institutes of Health (GM073943) and National Science Foundation (CHE 0848410). B.N.B. thanks the New York University for a Kramer Pre-doctoral Fellowship and A.L.J. thanks NYU for the Dean’s Dissertation Fellowship.

Footnotes

Supporting Information. Lists of helical protein-protein interactions with predicted occurrences of hot spot residues on different faces of target helix, and summary of helix contact residues. This material is available free of charge via the Internet at http://pubs.acs.org.

References

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

1_si_001

RESOURCES