Abstract
Hubs are highly connected proteins in a protein-protein interaction network. Previous work has implicated disordered domains and high surface charge as the properties significant in the ability of hubs to bind multiple proteins. While conformational flexibility of disordered domains plays an important role in the binding ability of large hubs, high surface charge is the dominant property in small hubs. In this study, we further investigate the role of the high surface charge in the binding ability of small hubs in the absence of disordered domains. Using multipole expansion, we find that the charges are highly distributed over the hub surfaces. Residue enrichment studies show that the charged residues in hubs are more prevalent on the exposed surface, with the exception of Arg, which is predominantly found at the interface, as compared to non-hubs. This suggests that the charged residues act primarily from the exposed surface rather than the interface to affect the binding ability of small hubs. They do this through (i) enhanced intra-molecular electrostatic interactions to lower the desolvation penalty, (ii) indirect long – range intermolecular interactions with charged residues on the partner proteins for better complementarity and electrostatic steering, and (iii) increased solubility for enhanced diffusion-controlled rate of binding. Along with Arg, we also find a high prevalence of polar residues Tyr, Gln and His and the hydrophobic residue Met at the interfaces of hubs, all of which have the ability to form multiple types of interactions, indicating that the interfaces of hubs are optimized to participate in multiple interactions.
Keywords: protein-protein interaction networks, hubs, electrostatic potential, multipole expansion, surface charge
Hubs are important proteins in protein-protein interaction networks given their ability to bind multiple partners. Their binding promiscuity has been attributed to the presence of disordered domains and highly charged surfaces1. Conformational flexibility from disordered domains is seen as the major contributor to the binding ability of large hubs (average length greater than 300 residues), while high surface charge is the predominant factor affecting the binding ability in small hubs1. Examples of such small hubs include Ubiquitin and Ferredoxin, which bind several proteins and have highly charged surfaces but no known disordered domains.
Previous studies on hubs have concentrated on large hubs with disordered regions1–3. Here, we focus on the small hubs (average length of 231 residues) without known disordered regions and extend our previous work to identify the role of charged residues in their binding promiscuity. Though the presence of high surface charges is known to promote binding promiscuity in small hubs, its reason is not yet clear. For instance, it is not known whether these charges have a specific distribution on the surface of the small hubs that allows them to bind multiple proteins, or if the charges are concentrated either at the binding interfaces or on the exposed surface or both, and how different these characteristics are in hubs as compared to non-hubs.
Charged residues on protein surfaces have been known to contribute to binding through either long-range or short-range electrostatic interactions with residues on the partner proteins4. Along with polar residues, they are also known to contribute to binding specificity and complex stabilization5, and have been recognized as hotspots on protein interfaces6,7. Charged residues are also more commonly found at the binding interfaces of proteins participating in transient interactions8. Given these properties of charged residues, we attempt to find out more about their role on the surfaces of small hubs.
In this study, we first find the distribution of charges on hub surfaces by mapping the surface electrostatic potential onto a sphere and performing multipole expansion to identify the dominant multipole moments. We calculate the relative enrichment of conserved residues, at the interface and on the exposed surface, of small hubs as compared to those in the entire proteins, to find the region where the charged residues are more prevalent. We also study the hydropathy of the interfaces and exposed surfaces of small hubs. We then compare the above characteristics of hubs with those of non-hubs. Using these analyses, we try to elucidate the role of charged surface residues in the binding ability of small hubs in the absence of disordered regions. There may be other factors that differentiate hubs from non-hubs, such as, their molecular function, levels of expression and degradation, and cellular localization. However, we limit our study to the structural features of hubs and non-hubs.
Materials and methods
Data set
We used the data set of hubs and non-hubs with structures in Protein Data Bank (PDB)9 identified in Patil and Nakamura1 as our base. The interactions were taken from the protein-protein interactions database, IntAct10. Interactions from high-throughput datasets were filtered to remove false positives11. For the purposes of this study, hubs were defined as proteins with 5 or more known interactions, while non-hubs were those with only 1 known interaction. We selected 50 hubs and 131 non-hubs, which satisfy the following criteria:
protein has 10 or less disordered residues i.e. residues with missing electron density or missing structure in PDB,
surface electrostatic potential of the protein was available through eF-site12.
Of these, 35 hubs and 27 non-hubs had at least one complex structure with another protein. The selected set of 35 hubs had an average length of 231 residues and the set of 27 non-hubs had an average length of 273.
Multipole expansion of surface electrostatic potential
The multipole expansion of surface electrostatic potential was calculated for 50 hubs and 131 non-hubs. For each protein, the surface electrostatic potential was obtained from eF-site12 as the potential at several points on the surface of the protein. This potential was then mapped onto a sphere of radius equal to the radius of gyration of the protein. Thus, for every point (x, y, z) on a protein with surface potential V, we calculated the spherical coordinates (R, θ, φ). Then, for each point (R0, θ, φ), where R0 is the radius of gyration of the protein plus 1.5 Å, we set the surface potential of the point to V. This gave us a sphere of radius R0 mapped with the surface potential of the protein at all θ and φ. Using this sphere with the representative surface potential, we performed the multipole expansion and calculated the first 4 poles (monopole, dipole, quadrupole, octupole) on the surface of the sphere at 60 points which were randomly selected from all possible points on the sphere. The multipole expansion at any point (R0, θ, φ), was performed as follows:
where
V(R0) = surface potential at (R0, θ, φ),
Z = monopole,
mi= dipole moment,
Qij= quadrupole tensor,
Oijk= octupole tensor,
R0i, R0j, R0k= vectors of magnitude R0, from the centre of the sphere to the points i, j and k on the sphere.
The spherical harmonics, are given by,
where
= associated Legendre function,
Vi(R0) = surface potential at point i, (R0, θ, φ), on the sphere,
ΔSi = surface area of a triangular region on the sphere surface centered at i,
= total surface area of the sphere.
n = total number of points on the sphere as extrapolated from eF-site
We calculated the magnitude of the dipole contribution by adding the squares of the dipole moments, , calculated at 60 random points on the sphere and then taking its square root. The quadrupole and octupole tensors were calculated in a similar manner. In order to check the validity of our method, we predicted the actual potential at each point on the surface of the sphere as the sum of the monopole, dipole, quadrupole and octupole tensors at that point and manually compared it with that on the original sphere. The number of positive charges on the surfaces of hubs and non-hubs, shown in Supplementary Tables S1 and S2, were calculated as the sum of the residues, Arg and Lys. The number of negative surface charges was calculated as the sum of Asp and Glu residues.
Conserved interface and exposed surface residues
An interface residue was identified as a residue on a chain with at least one atom at a distance of 3 Å or less from an atom of another residue on another chain. Exposed surface residues were those that were not interface residues and were identified by DSSP13 as surface residues with a solvent accessible surface area greater than 60 Å2. Conserved interface and exposed surface residues were those that had a conservation score of less than −0.5 as determined by Consurf-HSSP14. The correlation of the enrichment of conserved interface residues with the experimentally determined interface hotspot residues was calculated using the Pearson’s correlation coefficient, r. A t-test with 49 degrees of freedom (for a sample size of 51 proteins) was used to determine the statistical significance, or p-value, of the correlation coefficient.
Enrichment of residues
The enrichment of conserved interface residues for 31 of 35 hubs and 20 of 27 non-hubs was calculated as:
where
Nci(int) = number of conserved residues of type i at the interfaces of all proteins,
Nc(int) = total number of conserved residues at the interfaces of all proteins,
Ni(tot) = number of residues of type i in all proteins,
N(tot) = total number of residues in all proteins.
The conserved exposed residue enrichment was calculated in a similar manner over all conserved exposed residues.
Hydropathy
The average hydropathy of the interfaces of 35 hubs and 27 non-hubs was calculated as:
where
Hi = hydropathy of residue i in the Kyte-Doolittle hydropathy index15,
Ni(int) = number of residues of type i at the interfaces of all proteins,
N(int) = total number of residues at the interfaces of all proteins.
The hydropathy of the exposed surfaces of hubs and non-hubs was calculated in a similar manner using the residue propensity over all exposed residues. Hydropathy was calculated for all residues and not just conserved ones to take into account the entire protein surface. 95% confidence intervals were calculated for the mean hydropathy values at a significance of 0.05.
Results
Higher surface charge in hubs is known to affect their binding ability resulting in promiscuity1. We study the distribution of the charged residues on the surfaces of 50 hubs and 131 non-hubs. Of these, we use 35 hubs and 27 non-hubs (Supplementary Tables S1 and S2) to identify the role of charged residues on the surfaces of hubs (see Methods). For the purposes of this study, we define a hub as a protein with 5 or more known interactions, and a non-hub as a protein with 1 known interaction.
Charge distribution on hub surfaces
In order to determine the distribution of charges on the surfaces of hubs as compared to non-hubs, we performed multipole expansion of the surface electrostatic potential on 50 hubs and 131 non-hubs. As indicated in the Methods section, we calculated the monopole, dipole, quadrupole and octupole moments of the surface electrostatic potential of the sphere using multipole expansion. To validate our calculations, we predicted the electrostatic potential on the surface of the sphere using the values of the multipole moments obtained and checked this prediction against the actual values on the protein surface. Figure 1 shows the surface electrostatic potential of the protein, the potential mapped onto a sphere, the calculated multipole moments and the predicted surface potential from the multipole moments, for two proteins — Ubiquitin and Ubiquitin-like SMT3 precursor. It can be seen that predicted surface electrostatic potential roughly follows the trends of the actual potential, though sharp changes in the potential are lost. The values for individual multipole moments, for some of the hubs and non-hubs, are shown in Supplementary Tables S1 and S2.
Figure 1.
Projection of surface electrostatic potential onto a sphere, calculation of multipole moments by multipole expansion and prediction of the surface potential for (A) Ubiquitin and (B) Ubiquitin-like SMT3 precursor. Negative potential is indicated in red, positive potential in blue and hydropathy in yellow. Electrostatic potential on the protein surface was obtained from eF-site12. The electrostatic potential on the surface of the sphere were visualized using Molscript33 and Raster3D34. Dipole, Quadrupole and Octupole values indicated are those calculated for 60 random points on the surface of the sphere.
Figure 2 shows the percentage of 50 hubs and 131 non-hubs with a dominant dipole, quadrupole and octupole moments. The combination of quadrupole and octupole moments is dominant in 64% of hubs and 63% non-hubs. Thus, on average, small hubs and non-hubs show a dominant quadrupolar or octupolar nature in their surface charge distribution. However, we do not find any significant difference in the distribution of charges on the surfaces of hubs and non-hubs. A comparison of the monopole moments of hubs and non-hubs shows that 51.43% hubs have a positive monopole moment as compared to 40.74% non-hubs, indicating a net positive charge on the hub surface. The dominance of quadrupolar or octupolar moments in hubs can be interpreted by understanding the physical manifestation of each moment in the multipole expansion. The monopole moment corresponds to the net charge on the surface of the protein while the dipole moment corresponds to the average position of the positive and negative charges along each co-ordinate axis. The quadrupole and octupole moments represent the spread of charge from the co-ordinate axes. Thus a dominant quadrupolar or octupolar nature corresponds to a greater spread of charge over the surface of the sphere, and hence the surface of the protein. This suggests that the charged residues on the surfaces of hubs are well distributed over the surface. Since the surface charges are implicated in promiscuous binding of hubs1 and they are spread out over the surface, it leads to two possible roles for the charged residues: (i) the charged residues act directly through short-range electrostatic interactions with the residues of the partner proteins through several different interfaces distributed over the surface of the hub to bind multiple partners, or (ii) the charged surface residues participate in long-range electrostatic interactions, from the exposed surface, with the residues of the partner proteins in the form of electrostatic steering4, while using only a few specific interfaces for binding. In the following sections, we examine which of these roles is predominantly observed in small hubs.
Figure 2.
Percentage of hubs and non-hubs with a dominant dipole (green), quadrupole (blue) or octupole (orange) moment.
Residue enrichment on hub surfaces
In order to determine if the charged residues on hub surfaces act, primarily, through multiple interfaces or through the exposed surface (non-interface), we studied the enrichment of conserved interface and exposed surface residues in hubs with respect to its total residue propensity. We looked for the conserved residues on the surface because we wanted to identify those that were important to protein function. We were able to identify conserved interface and exposed surface residues for 31 hubs and 20 non-hubs (Supplementary Tables S1 and S2). Table 1 and Figure 3 give the enrichment of the conserved residues at the interface and exposed surface of hubs and non-hubs. We also checked the correlation of the enrichment of conserved interface residues in all 51 proteins (31 hubs and 20 non-hubs) with the interface hotspots identified by Bogan and Thorn6. Our data correlates reasonably well with the experimentally determined hotspots at r=0.60 (t=5.27, p=0.001), when the outlier, Trp, is removed (Supplementary Table S3). The correlation coefficients for 31 hubs and 20 non-hubs with the interface hotspot data, independently, are 0.60 and 0.30 respectively.
Table 1.
Relative enrichment of conserved residues at the interface and on the exposed surface (non-interface) of hubs and non-hubs
| Residues | Hubs | Non-hubs | ||
|---|---|---|---|---|
|
|
|
|||
| Interface | Exposed surface | Interface | Exposed surface | |
| D | 1.14 | 1.74 | 1.61 | 1.24 |
| E | 1.29 | 2.08 | 0.34 | 1.41 |
| K | 0.53 | 1.90 | 1.58 | 2.02 |
| R | 3.55 | 2.43 | 2.33 | 2.07 |
| S | 0.81 | 0.41 | 1.27 | 0.64 |
| T | 1.02 | 0.58 | 1.67 | 0.77 |
| Y | 2.27 | 1.04 | 1.15 | 0.67 |
| H | 1.39 | 1.24 | 0.98 | 1.43 |
| C | 0.79 | 0.19 | 0.91 | 0.33 |
| N | 0.91 | 1.50 | 1.85 | 0.90 |
| Q | 1.67 | 0.82 | 1.42 | 0.93 |
| W | 0.93 | 2.27 | 0.61 | 1.57 |
| A | 0.22 | 0.35 | 0.58 | 0.64 |
| V | 0.30 | 0.26 | 0.23 | 0.42 |
| F | 1.14 | 0.70 | 0.29 | 0.96 |
| P | 0.64 | 1.38 | 0.00 | 1.90 |
| M | 1.52 | 1.39 | 1.00 | 0.44 |
| I | 0.43 | 0.38 | 0.39 | 0.61 |
| L | 0.57 | 0.50 | 0.65 | 0.62 |
| G | 1.09 | 0.48 | 1.42 | 0.49 |
Figure 3.
Relative enrichment of conserved interface (black) and exposed surface (white) residues in (A) hubs and (B) non-hubs.
Figure 3 shows that charged residues, Asp, Glu and Lys, are more prevalent on the exposed surface than at the interface of hubs. This suggests that the charged residues on hub surfaces are more likely to participate indirectly in binding, acting from the exposed surface. Arg appears to be a special case. It is the most abundant residue at the interfaces of both hubs and non-hubs. This is in agreement with previous studies which indicate Arg as the most common residue at protein interfaces16. However its enrichment at the interface is almost 1.5 times greater in hubs as compared to non-hubs. We also find an increased enrichment of Tyr, Gln, His and Met at the hub interface as compared to the non-hub interface. All of these have been previously identified as interface hotspot residues6,7.
Hydropathy
We calculated the average hydropathy of the interfaces and exposed surfaces of 35 hubs and 27 non-hubs, using the amino acid propensity and the Kyte-Doolittle hydropathy scale15. Table 2 shows that both the interfaces and exposed surfaces are hydrophilic in hubs and non-hubs, as indicated by negative hydropathy values. As expected, the exposed surfaces are more hydrophilic than the interfaces. But the interfaces and exposed surfaces of hubs are more hydrophilic than those of non-hubs, further supporting the interpretation that the charges on the hub surfaces, though present at the interface, are more likely to act from the exposed surface.
Table 2.
Average hydropathy of the interfaces and exposed surfaces of hubs and non-hubs. Negative values indicate hydrophilic nature, while positive values indicate hydrophobic nature. Values in brackets indicate the 95% confidence intervals at a significance of 0.05
| Surface region | Hydropathy | |
|---|---|---|
|
| ||
| Hubs | Non-hubs | |
| Interface | −1.44 (±0.31) | −1.31 (±0.45) |
| Exposed surface | −1.86 (±0.28) | −1.46 (±0.27) |
Discussion
We used multipole expansion and residue enrichment studies to identify the role of the excess charged residues on the surfaces of hubs in their binding ability, as compared to non-hubs.
Electrostatics of binding
Using multipole expansion, we find that the charges are widely distributed over the surfaces of hubs. The residue enrichment results clearly imply that more charged residues are present on the exposed surfaces of hubs as compared to the interfaces. This indicates that charged residues on hub surfaces contribute to the promiscuity of hubs while primarily acting from the exposed surface. Clearly, this does not mean that hubs do not use multiple interfaces to bind their targets. But the results indicate that irrespective of the number of interfaces used, the charged residues, except Arg, are more likely to be found on the exposed surface than at the interface. For instance, in our dataset some hubs do have more than one interface, but generally do not have a very large number of interfaces spread across their surfaces. Figure 4 shows two such hubs, Ubiquitin and Ferredoxin, that have two binding sites each, using which they bind their known partners. Until recently, Ubiquitin was known to have only one binding site that it uses to bind its targets. This site is primarily hydrophobic and centered around Ile44. Several proteins were known to bind this interface somewhat differently with varying weak binding affinities17. A new site has now been found using which Ubiquitin binds Rabex5. This interface is slightly charged and centered around Asp5818. In the case of Ferredoxin, two highly charged binding sites are used to interact with Ferredoxin-NADP+ reductase and sulfite reductase, respectively19.
Figure 4.
Binding interfaces of hubs and non-hubs. (A) Ubiquitin (PDB ID: 1WR1-A) (B) Surface potential of Ubiquitin (1wr1_1-A from eF-site12). (C) Ubiquitin (pink) bound to two molecules of Rabex5 (light blue) using distinct binding interfaces shown in blue (Lys6, Thr7, Gly10, Arg42, Ile44, Ala46, Gly47, His68, Val70) and purple (Ser20, Glu51, Arg54, Thr55, Ser57, Asp58, Tyr59, Asn60, Gln62), respectively (PDB ID: 2C7N). (D) Ferredoxin (PDB ID: 1GAQ-B) (E) Surface potential of Ferredoxin (1gaq-B from eF-site12). (F) Ferredoxin (pink) bound to Ferredoxin NADP+ reductase (FNR) (light blue) using the interface residues shown in blue (Gln61, Leu64, Asp65, Asp66, Gln68, Leu95, Thr96, Gly97, Ala98). Purple residues (Glu29, Glu30, Asp34, Glu92, Glu93, Glu94) show the binding site of Ferredoxin to Sulphite reductase (SiR) (PDB ID: 1GAQ). Red indicates negative potential, blue indicates positive potential and yellow indicates hydropathy in B and E. Figure created using jV 3.212.
The charged residues on the exposed surface of a hub may increase its binding ability by sufficiently reducing the electrostatic binding free energy to make interactions with several proteins favourable. The electrostatic binding free energy of two proteins is the sum of (i) the desolvation contributions of the two proteins arising from the loss of interactions of the charged residues with the solvent at the interface, and (ii) contributions of solvent screened electrostatic interactions in the bound state between the charged residues at the interfaces of the two proteins in the form of salt bridges and hydrogen bonds. Of these, the first component is unfavourable for binding while the second one is favourable and accounts for the effect of charged residues at the interface20. The net effect of electrostatics at the interface is generally to destabilize protein binding due to the large desolvation penalties. However, in several cases, this desolvation penalty is reduced by favourable intra-molecular interactions amongst the surrounding charged residues of one or both the proteins21. Intra-molecular interactions are electrostatic interactions, exposed during binding, that take place among functional groups within the same binding partner, as opposed to inter-molecular interactions that take place across the interfaces of two binding partners22. Thus, along with the charged residues at the interface, those that are present around the interface also play an important role in reducing the electrostatic binding free energy thus promoting binding21.
Charged residues on the exposed surface are also known to make significant contributions to binding through long-range electrostatic interactions22. These “action-at-a-distance” interactions, or electrostatic steering, act through regions of solvent and thus improve the overall complementarity of the interfaces of the interacting proteins and modulate the binding free energy to achieve an appropriate level of binding affinity23. Thus the charged residues on the exposed surface of a hub can increase its binding ability considerably, through enhanced intra-molecular interactions and long-range inter-molecular electrostatic interactions, without participating in direct contact with the interfaces of the partner proteins.
Hydropathy results show that the interfaces of hubs are more hydrophilic than those of non-hubs. Along with the results of enrichment in Figure 3, this indicates that even though the charged residues on hub surfaces predominantly act from the exposed surfaces, they are more enriched at the hub interfaces as compared to non-hubs. Thus, the interfaces of hubs have some charged residues that participate in direct binding. But the bulk of the charged residues are present on the exposed surface. An example of this is the hub Ferredoxin, which has two highly charged interfaces along with a very charged exposed surface (Figure 4D, E, F).
Interface flexibility and charge
The high prevalence of Arg at the hub interface provides a lot of binding flexibility to the hub. Arg forms multiple types of favourable interactions6. The Arg side chain is considerably flexible, forming a good anchor residue as a result of its ability to anchor its side chain into a binding groove of an interaction partner24. It also allows binding across interfaces in various orientations. The dihedral angle, CG-CD-NE-CZ, of Arg varies through a spectrum of values, from −120, −60, 60, 120, to 180 degrees, in the structures in our data set (Supplementary Table S4). For instance, a superposition of all structures of Ubiquitin with its various interaction partners binding the interface centered at Ile44, shows the various side-chain orientations for Arg42 which also participates in binding (Figures 5A, B). Along with salt bridges25 and hydrogen bonds16, Arg is also involved in cation-π interactions with aromatic residues16,26 across the interface. A cation-π interaction of Arg with Tyr in the hub, Son of Sevenless-1, is shown in Figure 5C. Arg also provides increased specificity of interaction16.
Figure 5.
Selected views of Arginine participating in various types of interactions in different binding orientations. (A) Arg42 of Ubiquitin (blue) participating in a salt bridge with Asp444 of the CUE domain of Vacuolar protein sorting-associated protein VPS9 (orange). The dihedral angle, CG-CD-NE-CZ, of Arg42 is 131.9 degrees. (PDB ID: 1P3Q). (B) Arg42 of Ubiquitin (blue) forming a hydrogen bond with Gly47 backbone (distance 2.8 Å) of Tumor susceptibility gene 101 protein TSG101 (orange). The dihedral angle, CG-CD-NE-CZ, of Arg42 is −143.4 degrees. (PDB ID: 1S1Q). (C) Arg73 of Son Of Sevenless-1 (blue) participating in a cation-π interaction with Tyr884 of H-RAS (orange). The dihedral angle, CG-CD-NE-CZ, of Arg73 is 124.2 degrees. (PDB ID: 1BKD) Figure created using jV 3.212.
Met is also found predominantly at hub interfaces. It is enriched in interaction hotspots, and contributes favourably to the binding free energy through reduced entropic penalty due to the lack of rotatable bonds and through hydrophobic effects6,7,25. Met, along with Arg, is a good anchor residue with a flexible side chain24. Tyr and His, which are enriched at hub interfaces, are known interface hotspots6,27. Tyr has the ability to form multiple interactions6, while His has the ability to form hydrogen bonds27. Thus, clearly, hub interfaces prefer residues that allow formation of multiple flexible interactions to further increase their binding ability. Gln is also enriched at hub interfaces. Though it is not an interface hotspot residue, it has been previously found to be enriched at the interfaces7,27.
Charged and polar residues at interfaces have been associated with transient or non-obligatory interactions8,28. They are also enriched in small sized interfaces25. This goes well with the functional requirements of small hubs since they need to bind multiple partners during their lifetime and hence participate predominantly in transient interactions. Due the highly charged nature of their exposed surfaces, the binding affinity of the hubs participating in these transient interactions may also be greatly affected by changes in the physiological conditions in the cellular compartment (e.g. pH, temperature, concentrations of ions, chemicals and proteins) 29.
Though we have used multiple complex structures for hubs, where available, the determination of interface and exposed surface residues is limited by the absence of several complex structures for hubs with multiple interfaces. However, there is still no consensus on whether most hubs use a single interface or multiple interfaces 30,31, though hubs with a single interface are predominant in our data set.
The correlation coefficients of the conserved interface residues of hubs and non-hubs with those of previously determined interface hotspot residues indicate that these reflect the trend at hub interfaces only. Thus, the enrichment of residues at non-hub interfaces are not adequately represented by interface hotspot residues determined by Bogan and Thorn6, Hu et al.27 and Ma et al.7.
Solubility
Solubility of proteins is known to be important for their functions32. The charged residues on the surfaces of hubs may also increase their binding ability by providing increased surface hydrophilicity and hence increased solubility. This increased solubility may significantly enhance association rates of hubs that are normally limited by rates of diffusion in the solvent5.
Conclusion
We show that hubs have more charged interfaces and exposed surfaces as compared to non-hubs. Using multipole expansion of electrostatic potential, we show that the combination of quadrupole and octupole moments is dominant in small hubs. This indicates that the charges on the surfaces of hubs are spread out over their surfaces. This distributed charge is predominantly found on the exposed surface and most likely increases the promiscuity of small hubs through short-range intra-molecular interactions and long-range action-at-a-distance interactions through the solvent. The interfaces of hubs are enriched in Arg, most probably due to its ability to form multiple interactions. The interfaces are also enriched in other polar residues, Try, His, Gln and the hydrophobic Met, which provide a hydrophobic effect or the ability to form multiple interactions. The absence or relative reduction of the characteristics discussed above, in non-hubs, further stresses their role in the promiscuity of hubs. Thus the charges on the exposed surfaces and at the interfaces of hubs are optimized for participation in the interactions with several other proteins. Undoubtedly, these factors must also affect the binding ability of large hubs even if they have the flexibility afforded by disordered regions, though we have limited the current study to small hubs.
Supplementary Information
Acknowledgements
We would like to thank Dr. Kengo Kinoshita of Tokyo University for his invaluable help with eF-site and jV 3.2, and Dr. Daron Standley from PDBj for his help with the analysis of PDB files. This study was supported by the grant-in-aid from the Institute for Bioinformatics Research and Development of the Japan Science and Technology Agency (BIRD-JST), and by the grant-in-aid for Scientific Research on priority areas No. 18054013 from the Ministry of Education, Science, Sports and Culture of Japan. AP would like to thank the Department of Biotechnology, India, for its support through the Centre Of Excellence grant.
References
- 1.Patil A, Nakamura H. Disordered domains and high surface charge confer hubs with the ability to interact with multiple proteins in interaction networks. FEBS Lett. 2006;580:2041–2045. doi: 10.1016/j.febslet.2006.03.003. [DOI] [PubMed] [Google Scholar]
- 2.Haynes C, Oldfield CJ, Ji F, Klitgord N, Cusick ME, Radivojac P, Uversky VN, Vidal M, Iakoucheva LM. IntrinsicDisorder Is a Common Feature of Hub Proteins from Four Eukaryotic Interactomes. PLoS Comput Biol. 2006;2:e100. doi: 10.1371/journal.pcbi.0020100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Dunker AK, Cortese MS, Romero P, Iakoucheva LM, Uversky VN. Flexible nets. The roles of intrinsic disorder in protein interaction networks. FEBS J. 2005;272:5129–5148. doi: 10.1111/j.1742-4658.2005.04948.x. [DOI] [PubMed] [Google Scholar]
- 4.Sinha N, Smith-Gill SJ. Electrostatics in protein binding and function. Curr Protein Pept Sci. 2002;3:601–614. doi: 10.2174/1389203023380431. [DOI] [PubMed] [Google Scholar]
- 5.Sheinerman FB, Norel R, Honig B. Electrostatic aspects of protein-protein interactions. Curr Opin Struct Biol. 2000;10:153–159. doi: 10.1016/s0959-440x(00)00065-8. [DOI] [PubMed] [Google Scholar]
- 6.Bogan AA, Thorn KS. Anatomy of hot spots in protein interfaces. J Mol Biol. 1998;280:1–9. doi: 10.1006/jmbi.1998.1843. [DOI] [PubMed] [Google Scholar]
- 7.Ma B, Elkayam T, Wolfson H, Nussinov R. Protein-protein interactions: Structurally conserved residues distinguish between binding sites and exposed protein surfaces. Proc Natl Acad Sci USA. 2003;100:5772–5777. doi: 10.1073/pnas.1030237100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Jones S, Thornton JM. Principles of protein-protein interactions. Proc Natl Acad Sci USA. 1996;93:13–20. doi: 10.1073/pnas.93.1.13. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Berman HM, Westbrook J, Feng Z, Gilliland G, Bhat TN, Weissig H, Shindyalov IN, Bourne PE. TheProtein Data Bank. Nucleic Acids Res. 2000;28:235–242. doi: 10.1093/nar/28.1.235. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Hermjakob H, Montecchi-Palazzi L, Lewington C, Mudali S, Kerrien S, Orchard S, Vingron M, Roechert B, Roepstorff P, Valencia A, Margalit H, Armstrong J, Bairoch A, Cesareni G, Sherman D, Apweiler R. IntAct: an open source molecular interaction database. Nucleic Acids Res. 2004;32:D452–455. doi: 10.1093/nar/gkh052. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Patil A, Nakamura H. Filtering high-throughput protein-protein interaction data using a combination of genomic features. BMC Bioinformatics. 2005;6:100. doi: 10.1186/1471-2105-6-100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Kinoshita K, Nakamura H. eF-site and PDBjViewer: database and viewer for protein functional sites. Bioinformatics. 2004;20:1329–1330. doi: 10.1093/bioinformatics/bth073. [DOI] [PubMed] [Google Scholar]
- 13.Kabsch W, Sander C. Dictionary of protein secondary structure: Pattern recognition of hydrogen-bonded and geometrical features. Biopolymers. 1983;22:2577–2637. doi: 10.1002/bip.360221211. [DOI] [PubMed] [Google Scholar]
- 14.Glaser F, Rosenberg Y, Kessel A, Pupko T, Ben-Tal N. The ConSurf-HSSP database: The mapping of evolutionary conservation among homologs onto PDB structures. Proteins. 2005;58:610–617. doi: 10.1002/prot.20305. [DOI] [PubMed] [Google Scholar]
- 15.Kyte J, Doolittle RF. A simple method for displaying the hydropathic character of a protein. J Mol Biol. 1982;157:105–132. doi: 10.1016/0022-2836(82)90515-0. [DOI] [PubMed] [Google Scholar]
- 16.Crowley PB, Golovin A. Cation-π interactions in protein-protein interfaces. Proteins. 2005;59:231–239. doi: 10.1002/prot.20417. [DOI] [PubMed] [Google Scholar]
- 17.Hicke L, Schubert HL, Hill CP. Ubiquitin-BindingDomains. Nat Rev Mol Cell Biol. 2005;6:610–621. doi: 10.1038/nrm1701. [DOI] [PubMed] [Google Scholar]
- 18.Harper JW, Schulman BA. Structural complexity in ubiquitin recognition. Cell. 2006;124:1133–1136. doi: 10.1016/j.cell.2006.03.009. [DOI] [PubMed] [Google Scholar]
- 19.Saitoh T, Ikegami T, Nakayama M, Teshima K, Akutsu H, Hase T. NMR Study of the Electron Transfer Complex of Plant Ferredoxin and Sulfite Reductase: Mapping the interaction sites of Ferredoxin. J Biol Chem. 2006;281:10482–10488. doi: 10.1074/jbc.M510530200. [DOI] [PubMed] [Google Scholar]
- 20.Lee L-P, Tidor B. Optimization of electrostatic binding free energy. J Chem Phys. 1997;106:8681–8690. [Google Scholar]
- 21.Chong LT, Dempster SE, Hendsch ZS, Lee LP, Tidor B. Computation of electrostatic complements to proteins: a case of charge stabilized binding. Protein Sci. 1998;7:206–210. doi: 10.1002/pro.5560070122. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Lee LP, Tidor B. Optimization of binding electrostatics: charge complementarity in the barnase-barstar protein complex. Protein Sci. 2001;10:362–377. doi: 10.1110/ps.40001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Joughin BA, Green DF, Tidor B. Action-at-a-distance interactions enhance protein binding affinity. Protein Sci. 2005;14:1363–1369. doi: 10.1110/ps.041283105. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Rajamani D, Thiel S, Vajda S, Camacho CJ. Anchor residues in protein-protein interactions. Proc Natl Acad Sci USA. 2004;101:11287–11292. doi: 10.1073/pnas.0401942101. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Tsai CJ, Lin SL, Wolfson HJ, Nussinov R. Studies of protein-protein interfaces: A statistical analysis of the hydrophobic effect. Protein Sci. 1997;6:53–64. doi: 10.1002/pro.5560060106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Gallivan JP, Dougherty DA. Cation-π interactions in structural biology. Proc Natl Acad Sci USA. 1999;96:9459–9464. doi: 10.1073/pnas.96.17.9459. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Hu Z, Ma B, Wolfson H, Nussinov R. Conservation of polar residues as hot spots at protein interfaces. Proteins. 2000;39:331–342. [PubMed] [Google Scholar]
- 28.De S, Krishnadev O, Srinivasan N, Rekha N. Interaction preferences across protein-protein interfaces of obligatory and non-obligatory components are different. BMC Struct Biol. 2005;5:15. doi: 10.1186/1472-6807-5-15. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Nooren IMA, Thornton JM. Diversity of protein-protein interactions. EMBO J. 2003;22:3486–3492. doi: 10.1093/emboj/cdg359. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Kim WK, Henschel A, Winter C, Schroeder M. The Many Faces of Protein-Protein Interactions: A Compendium of Interface Geometry. PLoS Comput Biol. 2006;2:e124. doi: 10.1371/journal.pcbi.0020124. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Keskin O, Nussinov R. Similar Binding Sites and Different Partners: Implications to Shared Proteins in Cellular Pathways. Structure. 2007;15:341–354. doi: 10.1016/j.str.2007.01.007. [DOI] [PubMed] [Google Scholar]
- 32.Shanahan HP, Thornton JM. An examination of the conservation of surface patch polarity for proteins. Bioinformatics. 2004;20:2197–2204. doi: 10.1093/bioinformatics/bth218. [DOI] [PubMed] [Google Scholar]
- 33.Kraulis PJ. MOLSCRIPT: A Program to Produce Both Detailed and Schematic Plots of Protein Structures. J Appl Crystallogr. 1991;24:946–950. [Google Scholar]
- 34.Merritt EA, Bacon DJ. Raster3D: Photorealistic Molecular Graphics. Meth Enzymol. 1997;277:505–524. doi: 10.1016/s0076-6879(97)77028-9. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.





