Abstract
The characteristic distribution of non-binding interactions in a protein is described. It establishes that hydrophobic interactions can be characterized by suitable 3D Gauss functions while electrostatic interactions generally follow a random distribution. The implementation of this observation suggests differentiated optimization procedure for these two types of interactions. The electrostatic interaction may follow traditional energy optimization while the criteria for convergence shall measure the accordance with 3-D Gauss function.
Background
The “oil drop” model proposed by Kauzmann [1] suggested the protein molecule to be treated as oil drop in hydrophobic environment directing the hydrophobic residues toward the central part of protein body with simultaneous exposure of hydrophilic residues on the protein surface [2]. The model applied here uses the 3-D Gauss function as the function expressing the distribution of hydrophobic interaction in protein body. The highest concentration of hydrophobic interaction is expected in central part of protein body, as it is for Gauss function, which decreases towards the surface reaching zero ( or close to zero) hydrophobicity level, according to its bell-shaped form. The accordance of hydrophobic interaction in protein with the idealized distribution is shown, in contrast to the electrostatic interactions distribution, which appeared to represent the random distribution.
Methodology
Data
The protein 2I5M was selected to demonstrate the example revealing the differences between distributions of electrostatic, vdW-type and hydrophobic interactions [3]. It is the protein of length 66 amino acid residues, which is participating in gene regulation processes. This protein represents bacterial cold shock proteins (Csp) which are widely used as models for the experimental and computational analysis of protein stability. The 2I5M is the mutant A46K and S48R, produced to reveal the particular role in structure stabilization of the original protein (1CSB).
Model
The tertiary structure of the protein is assumed to include a hydrophobic core and involve optimization of all other non-bonding interactions (electrostatic, van der Waals and torsion potential). In contrast to many force fields incorporating the hydrophobic interaction as the component of internal force field, the presence of an external force field is expressed via a threedimensional Gauss function [2] (see equation 1). The value of the Gauss function at any point within the protein body can be treated as the idealized hydrophobicity density, determining the structure of the protein's hydrophobic core.
According to the “fuzzy oil drop” model, theoretical hydrophobicity can be calculated with the use of the Gauss function, assuming that the molecule's geometric center coincides with the origin of the coordinate system. Empirical hydrophobicity distribution is given by Levitt's function [4] (see equation 2). Further normalization of both distributions enables quantitative comparison (observed vs. theoretical values), which in turn, allows us to analyze the emerging discrepancies.
Kullback-Leibler information entropy
The agreement between the idealized and observed hydrophobicity distribution is measured according to the Kullback-Leibler relative (divergence) entropy [6], which quantifies the distance between both distributions. The distance between the observed and theoretical distribution (O/T) was calculated in this study. Since this value can only be interpreted in comparison to other potential solutions, random distribution of hydrophobicity (O/R) was also estimated by assigning equal probability of hydrophobicity to each amino acid (Rj equal to 1/N, where N is the number of amino acids) The relation O/T < O/R was taken as evidence of non-random distribution, approximating theoretical values (see equation 3). The calculation O/T and O/R for hydrophobic, electrostatic and vdW interactions is presented in this paper.
Nonbonding interactions
In order to determine the distribution of non-hydrophobic (i.e. electrostatic and van der Waals) interactions, structures derived from PDB were processed using an energy optimization algorithm to eliminate any local collisions. Electrostatic and van der Waals forces (calculated for each atom separately) were then aggregated over whole amino acids, mimicking the distribution of hydrophobic interactions. The distribution of non-bonding interaction was calculated using Gromacs force field applying the group procedure [7, 8, 9, 10, 11]. Each amino acid on each side of the protein molecule was sequentially defined as a group, to calculate its interaction with the rest of the protein molecule.
Discussion
Distribution of nonbinding interactions in proteins
The characteristics of hydrophobic, electrostatic and vdW interaction is shown in Table 1(see Supplementary material). The results expressing the relation O/T < O/R treated as accordant with assumed “fuzzy oil drop” model are given in bold. Good accordance between the observed and theoretical distribution of hydrophobic interaction can be recognized interpreting the values shown in Table 1 (see Supplementary material). The random coil character was recognized for electrostatic interaction. The “fuzzy oil drop” structure for the vdW interactions has been found for this protein. The graphic presentation of the results is shown in Figure 1.
Figure 1.
Distribution of van der Waals, electrostatic and hydrophobic interactions (top, middle and bottom respectively) in the 2I5M molecule. Dark blue line indicates theoretical distribution (accordant with the 3D Gauss function); pink line shows actual distribution of hydrophobicity in 2I5M; yellow line corresponds to random distribution.
Conclusion
It should be noted that only one protein was presented in this study although the larger group of proteins was presented elsewhere [12]. It was shown that the accordance of vdW interaction is observed rather rarely, although no protein has been shown to represent other than random distribution for electrostatic interaction. These observations which can be extended to larger number of proteins [12, 13] suggest that the optimization procedure applied for pair-wise interaction is proper only for electrostatic interaction (and vdW interaction). The result of such optimization produces the structure of low energy (large stabilization) independently on the localization in the protein body. The optimization of hydrophobic interaction should be treated in different way. This optimization shall be oriented to fit the expected hydrophobic interaction accordant with 3-D Gauss function. The pair-wise interaction for this type of interaction produces different than expected interaction distribution. Sample folding simulations acknowledging the presence of an external field (described by a 3D Gauss function which steers the process towards the generation of a hydrophobic core) were conducted for BPTI [14], lysozyme [15] and T0215 [2]. Similar computations involving the presence of a ligand were performed for human hemoglobin [5] and ribonuclease [16]. The ligand was intended to affect the folding process in such a way, as to ensure the creation of a suitable binding pocket. The influence of the external environment (the cellular membrane) upon the dynamic properties of transmembrane proteins was verified with the use of the “fuzzy oil drop” model [17]. The applicability of “fuzzy oil drop” model to mutation influence on the structure of antifreeze proteins is presented elsewhere [18].
The “fuzzy oil drop” model applied for the identification of ligand binding sites [19] and active sites in hydrolases [20] yields insights into the properties of individual proteins. Deformations in the structure of the hydrophobic core resulting from interaction with external molecules can be used to identify binding sites (associated with areas of biological activity). Liang MP et al. (2003) further elaborated this issue in a comparative study of various software packages used for identification of ligand binding cavities, taking into account their efficiency and correctness of results [20]. A similar comparative analysis of several packages WebFEATURE [21], SuMo [22], ConSurf [23], CASTp [24, 25], PASS [26] and QSiteFinder [27] using methods based on the “fuzzy oil drop” model, can be found elsewhere [20]. The 3-D Gauss function introduced to represent the structure of hydrophobic core is able to describe only the static form of protein molecule. However the dynamics of protein structure plays substantial and critical role in respect to its biological function. This is why the dynamic form of the 3-D Gauss function of different shapes even being far in respect to the regular ordered form of 3-D Gauss function is planned to be applied for simulation of dynamic forms of protein. The structural dynamics of a protein molecule seems to be the effect of external force field deformation. The source of this deformation is the influence of molecules present in close neighbourhood of that protein molecule. Simulation of structural changes of protein molecule as the effect of external force field deformation is under consideration.
Supplementary material
Acknowledgments
The work was financially supported by Jagiellonian University − Medical College − project K/ZDS/001531. We would also like to thank Piotr Nowakowski from ACC CYFRONET AGH for proofreading the document.
Footnotes
Citation:Marchewka et al, Bioinformation 6(8): 300-302 (2011)
References
- 1.W Kauzmann. Adv Protein Chem. 1959;14:1. doi: 10.1016/s0065-3233(08)60608-7. [DOI] [PubMed] [Google Scholar]
- 2.L Konieczny, et al. In Silico Biol. 2006;6:0002. [Google Scholar]
- 3.KE Max, et al. J Mol Biol. 2007;369:1087. doi: 10.1016/j.jmb.2007.04.016. [DOI] [PubMed] [Google Scholar]
- 4.M Levitt. J Mol Biol. 1976;104:59. doi: 10.1016/0022-2836(76)90004-8. [DOI] [PubMed] [Google Scholar]
- 5.M Brylinski, et al. Int J Bioinform Res Appl. 2007;3:234. doi: 10.1504/IJBRA.2007.013605. [DOI] [PubMed] [Google Scholar]
- 6.S Kullback, RA Leibler. Annals of Mathematical Statistics. 1951;22:79. [Google Scholar]
- 7.D Van Der Spoel, et al. J Comput Chem. 2005;26:1701. doi: 10.1002/jcc.20291. [DOI] [PubMed] [Google Scholar]
- 8.D Van Der Spoel. Gromacs User Manual version 3.3. [Google Scholar]
- 9.HJC Berendsen, et al. Comput Phys Commun. 1995;91:43. [Google Scholar]
- 10.E Lindahl, et al. J Mol Model. 2001;7:306. [Google Scholar]
- 11.HJ Berendsen, et al. Ann N Y Acad Sci. 1986;482:269. [Google Scholar]
- 12.D Marchewka, I Roterman. ThinkMind BIOTECHNO 2011, The Third International Conference on Bioinformatics, Biocomputational Systems and Biotechnologies; p. 110.p. 115. [Google Scholar]
- 13.I Roterman, et al. J Theor Biol. 2011;283:60. doi: 10.1016/j.jtbi.2011.05.027. [DOI] [PubMed] [Google Scholar]
- 14.M Brylinski, et al. Biochimie. 2006;88:1229. [Google Scholar]
- 15.M Brylinski, et al. J Biomol Struct Dyn. 2006;23:519. doi: 10.1080/07391102.2006.10507076. [DOI] [PubMed] [Google Scholar]
- 16.M Brylinski, et al. Comp Biol Chem. 2006;30:255. [Google Scholar]
- 17.V Zobnina, I Roterman. Proteins. 2009;77:378. doi: 10.1002/prot.22443. [DOI] [PubMed] [Google Scholar]
- 18.M Banach, et al. J Mol Model. 2011 [Google Scholar]
- 19.M Brylinski, et al. PLoS Comput Biol. 2007;3:e94. doi: 10.1371/journal.pcbi.0030094. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.K Prymula, et al. J Comput Aided Mol Des. 2011;25:117. doi: 10.1007/s10822-010-9402-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.MP Liang, et al. Nucleic Acids Res. 2003;31:3324. doi: 10.1093/nar/gkg553. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.M Jambon, et al. Proteins. 2003;52:137. [Google Scholar]
- 23.M Landau, et al. Nucleic Acids Res. 2005;33:W299. [Google Scholar]
- 24.A Binkowski, et al. Nucleic Acids Res. 2003;31:3356. doi: 10.1093/nar/gkg587. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.J Dundas, et al. Nucleic Acids Res. 2006;34:W116. [Google Scholar]
- 26.GP Brady, PF Stouten. J Comput Aided Mol Des. 2000;14:383. doi: 10.1023/a:1008124202956. [DOI] [PubMed] [Google Scholar]
- 27.AT Laurie, RM Jackson. Bioinformatics. 2005;21:1908. [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.