Abstract
Pure Gō models (where every native interaction equally stabilizes the folded state) have widely proved their convenience in the computational investigation of protein folding. However, a chemistry-based description of the real interactions also provides a desirable tune in the analysis of the folding process, and thus some hybrid Gō potentials that combine both aspects have been proposed. Among all the noncovalent interactions that contribute to protein folding, hydrogen bonds are the only ones with a partial covalent character. This feature makes them directional and, thus, more difficult to model as part of the coarse-grained descriptions that are typically employed in Gō models. Thanks to a simplified but rigorous representation of backbone hydrogen bonds that we have recently proposed, we present in this article a combined potential (Gō + backbone hydrogen bond) to study the thermodynamics of protein folding in the frame of very simple simulation models. We show that the explicit inclusion of hydrogen bonds leads to a systematic improvement in the description of protein folding. We discuss a representative set of examples (from two-state folders to downhill proteins, with different types of native structures) that reveal a relevant agreement with experimental data.
Introduction
According to the principle of minimal frustration (1,2), the folding pathway is funnel-shaped and is encoded in the native structure of the protein. As the relationship between the native structure and the kinetic and thermodynamic properties of the folding process is experimentally well established (3–5), many simulation models, known as Gō (or structure-based) potentials, have been suggested considering only native interactions (6–8). This simplified view of the system energetics is usually accompanied by intermediate resolution descriptions of the chain geometry, where each amino acid is represented by a few centers of interaction (frequently just the α-carbons). Regardless of their dual simplicity, these so-called coarse-grained Gō models (9–12) have shown a remarkable ability to describe protein folding, in addition to profiting from a manageable definition and a reduced computational cost.
In some cases, however, Gō models fail due to the oversimplification of the protein energetics (13–16), especially if there are intermediates involving nonnative interactions (17) or alternative stable structures such as aggregates (18). This highlights the importance of the chemical nature of the interactions themselves and the role of nonnative interactions in the ruggedness of the folding funnel (19). As a result, a kind of second-generation Gō models that explicitly consider individual interactions have arisen (20,21). The resulting potentials enhance the accuracy of the original Gō models and provide a more detailed tool for protein folding studies.
If we evaluate each type of interaction individually, the backbone hydrogen bonds deserve special attention. In terms of biological relevance, they have a role all their own, as they guide the secondary structure formation and are said to be determinant in the protein structure specificity (22,23). In addition, their partially covalent nature makes them directional and stronger than other interactions such as hydrophobic ones.
The combination of a reduced but accurate representation of hydrogen bonds and a Gō-type potential seems, then, especially appealing. The lack of an explicit definition of amide and carbonyl groups in α-carbon models makes single-bead hydrogen-bond representations quite elusive, so-seldom has this combination been reported (24). Instead, hydrogen bonds have sometimes been inserted as directional modifiers of the native interaction strength (25), although this alternative misdescribes the directionality of hydrogen bonds in strands (26). In other examples, more centers of interaction per amino acid for either hydrogen bond (27) or Gō interactions (28) are necessary, leading to more complex and time-consuming potentials. In any case, previous works have focused on the structural description of proteins or specific cases like protein aggregation (28), but no rigorous analysis about the effect of hydrogen bonds on protein folding has been carried out to our knowledge.
In this article, we aim to fill this gap and evaluate the general impact of a hydrogen-bond-explicit definition on Gō models, using only an α-carbon representation and paying special attention to the effect on the folding thermodynamic and structural characteristics. The strength of our work lies in a careful election of the potentials. As a Gō model, we have used a potential from Prieto et al. (12) that has been successfully applied to many cases, displaying good agreement with experimental data and a remarkable cooperativity by itself (29–32).
The election of the hydrogen-bond potential is particularly problematic. Although many reduced models have been propounded lately (33–35), they are usually forced to sacrifice an accurate geometry for a single bead representation (36). These topology distortions make them unsuitable for the combination with α-carbon Gō models, as they are somehow incompatible by definition. Rather, we have used a hydrogen-bond model recently introduced by ourselves in Enciso and Rey (37), which has been especially designed to mirror the geometric characteristics of natural hydrogen bonds and reproduces the formation of secondary structures in peptidic systems. By using this approach, the hydrogen-bond potential in this work models both native and nonnative hydrogen bonds, whereas the rest of native interactions are considered through the structure-based potential. In this way, we treat hydrogen-bond interactions in a rigorous way and introduce some frustration in the folding pathway at the same time.
If we pursue the obtention of a realistic thermodynamic view of protein folding, the combination of these two potentials raises a number of questions. First, we must resolve whether the inclusion of hydrogen bonds in this kind of simplified simulation model has any impact on the folding process characteristics. Secondly, we must determine whether this impact induces any kind of bias in the general protein behavior, or instead results in broad applicability.
In this article, we analyze the folding characteristics of a representative bunch of globular proteins. We analyze some proteins widely studied both experimentally and theoretically that, presenting different native structures, have led us to obtain a general overview of our combined potential characteristics. We then go on to focus on proteins where our previous experience shows that pure Gō models fail in their description (e.g., results depending on the experimental structure (31) or downhill folders predicted as two-state ones with Gō models, see below).
We have found that the specific consideration of hydrogen bonds leads to a better description of the folding thermodynamics than the Gō model on its own, reproducing the experimental folding characteristics more accurately, even in the absence of sequence-dependent terms.
Methods
The model
In this work, proteins are described through an off-lattice representation where each amino acid i is represented by a hard sphere centered at the α-carbon position. Each unit is linked to its neighbors by a virtual bond vector of length 3.8 Å, corresponding to a trans peptide bond. An auxiliary vector hi perpendicular to the plane defined by adjacent virtual bond vectors of each bead i is also defined (see Fig. 1 in Enciso and Rey (37)).
Interactions between bead pairs are defined in terms of a Gō potential (12) and a hydrogen-bond one (37). Because they have been thoroughly described elsewhere, only their main features will be discussed here.
We have used a Gō model, proposed by our group some years ago, that has provided excellent results in folding studies (29–32,38). It consists of a harmonic well potential centered at the native distance between α-carbons, (where i and j are the interacting beads). There are two kinds of interactions: local interactions (between residues i and i + 2 and residues i and i + 3) that are always present and preserve the intrinsic chain rigidity of a polypeptidic system and the proper chirality; and long-range interactions (between i and j, where j > i + 3) that are only considered if there is a contact in the native state (i.e., the native distance between any pair of heavy atoms from the referred amino acids is <4.5 Å, according to the experimental PDB structure (39)). One interesting characteristic of our structure-based model is that it reproduces a cooperative transition in proteins that experimentally have been shown to fold via a two-state process. Actually, we have previously checked that the lack of cooperativity that is sometimes assigned to simple models with only one bead per residue and pairwise interactions (see, e.g., Chan et al. (40) and references therein) is partially due to the mathematical form of the attraction basin (12). Our model was designed to correct, at least partially, those effects, by using a narrow attractive well. With that definition, we have been able to reproduce both folding processes with rather narrow, cooperative folding transitions, as well as downhill folding processes, in reasonable agreement with experimental results (30,32).
On the side of hydrogen-bond interactions, we have used a very recent model, also introduced by ourselves, that has proved its ability in the obtention of secondary structure elements for peptides (37). The special care in the geometric characteristics of the secondary structures (as similar as possible to those appearing in native proteins) makes it ideal for our study.
The hydrogen-bond energy between any pair of residues i and j (where j > i + 2 and j ≠ i + 4) is evaluated in a two-part process: first, we check three geometrical restrictions (i.e., length of the tentative hydrogen bond between beads, represented by the distance rij between the corresponding beads, orientation between auxiliary vectors, determined by |cos(hi · hj)|, and relative orientation between the auxiliary vectors and the tentative hydrogen bond, given by |cos(hi . rij)| and |cos(hj . rij)|). Then, if the value of each restriction falls within certain limits, a steplike potential applies. The acceptable ranges and the interaction strength differ depending on the kind of hydrogen bond (either helical or β-type). Although their numerical ranges are described in detail elsewhere (Table I in Enciso and Rey (37)), we include here the main data:
For the first restriction, 4.7 Å ≤ rij ≤ 5.6 Å.
For the second restriction, 0.74 ≤ |cos (hi · hj)| ≤ 0.93 if it is a local interaction, and 0.75 ≤ |cos (hi · hj)| ≤ 1.0 if it is not.
For the third restriction, 0.92 ≤ |cos(hi or j · rij)| ≤ 1.00 if it is a local interaction, and 0.94 ≤|cos(hi or j · rij)| ≤ 1.0 if it is not.
This hydrogen-bond potential follows the same philosophy of other models previously used in the literature (33,34,41). We have checked, however, that in a reasonable temperature range our model provides elements of secondary structure with a more natural structure than those obtained with previously reported models, when only the hydrogen-bond interactions are taken into consideration (37). We should also mention that, as described, our hydrogen-bond model is only defined as a pairwise interaction, without any additional term enforcing cooperativity, as it has been included in other implementations of coarse-grained models for hydrogen-bond interactions (41).
We remark that hydrogen bonds can be formed between every pair of residues, regardless of their relative positions in the native structure. Thus, we are not only defining hydrogen bonds in a more accurate way, but also considering the ruggedness of the folding funnel within our potential definition.
Logically, some Gō contacts also correspond to the formation of hydrogen bonds in the native state. In these cases, the Gō contact is removed from our interaction definition to avoid overstabilizations, turning the detection of native hydrogen bonds into a relevant part of this work. As the Gō potential is based on the PDB structure, we pursue a hydrogen-bond definition fully consistent with the PDB information. This information is indirectly included in the PDB file headers through the secondary structure assignments, but the precise hydrogen-bonded pairs are not stated. To obtain them, we first tried to use the usual methods for hydrogen-bond detection (e.g., DSSP (42) or STRIDE (43)), but we found that they present slight differences with these PDB headers. Being that our hydrogen-bond potential is able to detect nearly 90% of native hydrogen bonds (as defined by PDB headers) in raw (unminimized) PDB structures (37), we decided to let our system relax under the effect of both the full Gō and hydrogen-bond potentials. In this way, we are able to obtain relaxed structures that are fully compatible with these headers and show no remarkable distortions (their root mean-square deviation compared to the original PDB structure is <0.5 Å, computed from the α-carbon positions).
Once all the native hydrogen bonds are detected, they are removed from the Gō potential definition, resulting in a decrease in the number of Gō contacts (compared to the pure Gō potential) when the combined potential is built. As an example, the resulting contact map for the B domain of protein G (PDB code 2GB1 (44)) is shown in Fig. 1. The mauve squares (online color only) correspond to Gō contacts and the black ones are native interactions that have been removed from the Gō potential because they correspond to native hydrogen bonds (see legend of Fig. 1 for a further explanation).
The relative weight of Gō and hydrogen-bond interactions has been properly modulated as a model parameter (wHB = 2.0 wGO) to give optimal results. We have performed several tests, obtaining similar upshots within a 25% variation of this value.
Data generation
To study the characteristics of the whole energetic and structural landscape for our model, we have used a parallel tempering (45) Monte Carlo simulation algorithm, as previously described (12). We have carried out single-chain numerical experiments with 20–30 temperatures each, depending on the system size and complexity. Each full simulation starts from a completely extended conformation for each chain and consists of 5 × 106 Monte Carlo cycles at every temperature after 3 × 106 equilibration cycles. In each cycle, every bead of the system is subjected to a trial Monte Carlo move.
The results presented here correspond to statistical averages over the sampling at every temperature and over different independent runs. For each system, three or five independent runs have been carried out.
Trajectory analysis
Once the simulations are completed, the thermodynamic analysis of our numerical results has been performed by the weighted-histogram-analysis method (WHAM) (46). We have converted the reduced units used in the calculations to real ones by correlating the experimental denaturation temperature of each protein (taken from the literature) to our transition temperature, computed as the maximum of the calculated heat capacity curve. Once the temperature is rescaled, the energy conversion is straightforward.
Results and Discussion
Our purpose is to evaluate whether an explicit consideration of hydrogen bonds improves the thermodynamic properties of folding in combination with classical Gō models. Thanks to an accurate single-bead hydrogen bond (37), we aim to know whether this approach improves aspects like cooperativity and free energy barriers along the folding process. Furthermore, we want to estimate whether this effect is a kind of a priori bias or provides a correct modulation of the free energy landscape of the transition for a given protein. For that purpose, we have structured this section according to different questions, from the most basic to more subtle and specific ones.
Do hydrogen bonds have any effect on folding for a single coarse-grained model like ours?
To find out, we started with the study of a classical test system (12,47–49), a paradigm of a two-state folder: the immunoglobulin binding domain of streptococcal protein G, also called GB1. We used the NMR-resolved structure, with PDB code 2GB1 (44,50) (note that, from now on, we will refer to each protein by its code).
In panel I of Fig. 2, we show a cartoon representation of 2GB1, as well as its heat capacity curve versus temperature (Fig. 2, I a), and the free energy profile versus total energy at the transition temperature resulting from WHAM calculations (Fig. 2, I b). Generally speaking, sharp maxima in heat capacity curves indicate cooperative foldings, whereas free energy profiles provide the energy distributions of the native and denatured states at a given temperature (in this case, the transition temperature), as well as the related free energy barrier. Regarding the semiquantitativeness of our results, two-state folders have free energy barriers greater than 5 kJ·mol–1 (51); smaller barriers lie below or very close to thermal fluctuations and indicate a downhill folding.
In each plot of Fig. 2, the black curve represents the results computed using the pure Gō potential, used as reference, whereas the red curve comes from the combination of Gō and hydrogen-bond interactions introduced in this work. As we can see in Fig. 2 I a, both heat capacity curves present a sharp peak, typical of cooperative transitions as it has been experimentally found for GB1. Hydrogen bonds enhance this cooperativity, also increasing the free energy barrier (see Fig. 2 I b), despite the fact that this cooperativity is not explicitly included in our model definition of the hydrogen-bond interactions. By exploring the minima corresponding to the folded and unfolded states in the free energy profiles, we can connect the differences imposed by the presence of the hydrogen bonds to a better definition of the native state (mediated by the specific definition of hydrogen bonds in the native structure) merged with a destabilization of the denatured one, where the removal of the Gō contacts that form native hydrogen bonds reduces the residual native structure in the denatured state.
Gō models strongly depend on the specific experimental structure, leading to different results for the same protein depending on the specific definition of the structure (30,38). Encouraged by the good results on 2GB1, we can raise our next question.
Can hydrogen bonds correct this logical but undesirable feature of plain Gō models?
The case of GB1 is particularly representative, as it has been observed (30) that the previously studied NMR structure (2GB1) leads to different results compared with the x-ray one (1PGB (52)). The results for 1PGB are shown in Fig. 2 II. If we start focusing on the black curves, we can clearly appreciate the differences between 2GB1 and 1PGB, where the latter one presents an incorrect barrierless folding (free energy barrier at ∼3 kJ·mol–1) and a poorly defined native state at the transition temperature. The inclusion of hydrogen bonds (red curves) has a clear effect, sharpening the description of the energetics of the native state. As a result, we recover an almost identical transition to the 2GB1 case, proving that the explicit consideration of hydrogen bonds provides fully comparable results for this protein, independently of the experimental structure used to define the Gō potential.
After these results, we could think that the explicit consideration of hydrogen bonds in the model always leads to an increase in the free energy barrier, independently of the protein itself. As this has been also suggested for all-α proteins (27), this will be our following task, leading to another question.
Do hydrogen bonds insert a particular bias in the free energy surface for protein folding in our model?
To try to answer this, we have undertaken the study of the folding process for some α-proteins. As an example, we show, in Fig. 2 III, the amino terminal domain of phage 434 repressor (PDB code 1R69 (53,54)), an all-α protein that presents a cooperative folding (55). In our results, differences between the former model and the new one are rather subtle, but still meaningful. In both cases, we obtain a sharp heat capacity peak, distinctive of two-state folders. In the free energy curves, the combined potential shows a slight reduction in the barrier that refutes the hypothesis of a systematic barrier increase. For this protein, there are no changes in the native basin between both models, but the denatured basin presents the already observed shift toward higher energies, linked to the removal of some Gō contacts. This is accompanied by a widening of this basin, leading to a slight reduction in the free energy barrier as a net effect, as already mentioned. The broadening of the denatured basin has a structural source: it is due to the presence of some residual helical structure that, interestingly, has been claimed to be essential for the folding of this protein (55). In Fig. 3 a we show the frequency of hydrogen bonds that we have detected in our simulations for the denatured state of 1R69 at the transition temperature. It presents a low proportion of helical contacts that are not circumscribed to those present in the native state.
As a matter of fact, a remaining helical content has been stated to be a general feature of the denatured state of helical proteins (56). The lack of an explicit consideration of hydrogen bonds in pure Gō models may result in a vague representation of residual helicity in the denatured state. Our approach surmounts this lack of specificity, dealing with these interactions in a more rigorous way. As an additional example, we show in Fig. 2 IV the engrailed homeodomain (PDB code 1ENH (57,58)). Numerous experimental and computational studies state that 1ENH is a two-state folder that nevertheless presents a high folding rate. In this protein, the residual structure is also thought to play an important role in the folding transition (59). The pure Gō results from our reference model show a relatively low heat capacity peak and a free energy barrier of ∼5 kJ·mol–1, edging the downhill limit. Adding hydrogen bonds leads to a small increase in the folding cooperativity. In addition, the free energy barrier grows up to 7 kJ·mol–1, typical of two-state folders: the widening in the denatured state with the combined model (also observed here) is counterbalanced by a better defined native state. In conclusion, the introduction of hydrogen bonds in this kind of protein implies the retention of some residual helical structure, providing a general broadening of the denatured basin but without forcing any systematic bias on the thermodynamical folding properties, which still depend on the intrinsic characteristics of the analyzed protein.
Once the impact of hydrogen bonds in helical proteins is correctly understood, how do they affect β-proteins?
We studied a fibronectin type III domain from human tenascin (PDB code 1TEN (60,61)), a typical example of two-state folder whose folding pathway has been thoroughly studied (62,63). Our results are shown in Fig. 2 V and only reveal minor changes when hydrogen bonds are applied. The free energy profile indicates that the native state is slightly better defined (narrower), whereas the denatured one shows no broadening, evidencing that hydrogen bonds do not induce any artifactual residual structure in this case (hydrogen bond map not shown).
Within β-proteins, we have also performed numerical simulations on the SH3 domain (PDB code 1NLO (64)) due to its biological relevance in aggregation studies and its well-established properties (two-state folder with a relatively high folding rate, comparable to helical proteins (65)). Our results, plotted in Fig. 2 VI, show that the heat capacity curves are typical from two-state folders, either including or omitting hydrogen bonds. However, the free energy barrier (tightly linked to the folding rate) gets considerably lower if hydrogen bonds are considered. The origin of this change lies now in the presence of a very broad denatured state, where many configurations within this basin share some residual hydrogen-bond interactions. The corresponding hydrogen-bond frequency map of Fig. 3 b mostly locates them in the distal loop of the protein (residues 39–44). Interestingly, this region is part of the structure of the transition state of this protein and is thought to be experimentally stabilized by a hydrogen-bond interaction (66). Moreover, it has been found that the distal loop plays a determinant role in the dimerization of the SH3 domain (67), showing therefore the importance of hydrogen-bond interactions in the folding and aggregation of this protein.
Up to this point, our systematic study has focused on two-state folders, which are thought to be the most common ones within living organisms. However, some small proteins can also exhibit a downhill folding where the free energy barrier is negligible (51).
Can hydrogen bonds contribute to a better description of barrierless folders?
We started by simulating the most common example of downhill folder, the E3 binding domain of the dihydrolipoamide succinyltransferase core from the 2-oxoglutarate dehydrogenase multienzyme complex of Escherichia coli, also known as BBL protein (PDB code 1BBL (68)). We have previously studied this protein with the plain Gō model, showing that downhill folding processes can be adequately simulated with this approach (30). However, the effect of hydrogen bonds is unknown. The results shown in Fig. 2 VII are characteristic of downhill folders, with a very broad and shallow heat capacity curve and without any barrier in the free energy profile at the transition temperature. This also proves that the inclusion of hydrogen bonds in our simulations does not induce free energy barriers if they are not present in the real system.
The success in the previous cases fostered our study of the gpW protein (PDB code 1HYW (69)). This α + β protein experimentally presents a very small folding barrier, showing an overall downhill folding (70,71). Knowledge-based potentials have given quite elusive results (72), and so does our plain Gō model. Shown in the black curves on Fig. 2 VIII, it exhibits a clear two-state behavior. However, the inclusion of hydrogen bonds (red curves) flattens the heat capacity curve and considerably reduces the free energy barrier down to ∼4 kJ·mol–1, a typical result of a downhill protein with a barrier smaller than the thermal fluctuations. Moreover, we get an additional confirmation that our hydrogen-bond interactions do not systematically enhance the cooperativity of the folding transition. Instead, they seem to correct some deficiencies of our structure-based model to describe the thermodynamics of the folding transition in several proteins from different structural families and with distinct folding behaviors, making the simulation results of the new combined model closer to the experimental evidence in every case tested.
Summary and Conclusions
Structure-based potentials have traditionally proved their broad applicability in the study of protein folding when nonnative interactions may play a minor role. However, to take these effects into account, these models are recently adapting their interaction descriptions to the chemistry of the particular interactions (20,21). Among them, hydrogen bonds deserve special attention not only because of their biological relevance but also due to the partially covalent nature that makes them directional and stronger than other interactions. This directionality is responsible for the difficulty in achieving an accurate description in computational models with only one center of interaction per amino acid.
In this work we have undertaken this task thanks to a hydrogen-bond model in which this dichotomy is specifically tackled (37), mixed with a Gō potential of proven good quality (12). We have centered this study on the comparison of the resulting combined potential compared to the plain Gō model. Our aim has been to find out whether the inclusion of hydrogen bonds can improve the thermodynamic description of protein folding. Therefore, we have studied a set of representative cases pursuing two aims:1), check whether the good properties of Gō models remain, and 2), find out whether there is a perceptible improvement in the thermodynamic description of protein folding.
Regarding the first aim, we studied a well-known case where the Gō potential had previously proved its aptitude: the two-state folder 2GB1 (12). The introduction of hydrogen bonds does not imply any kind of undesirable effect, maintaining the general two-state scenario and even enhancing the free energy barrier of folding, even though our hydrogen-bond model does not explicitly include a term enhancing cooperativity.
Our model with hydrogen bonds does not only have a modulating role on systems where the Gō model already presented adequate results. We have also applied this approach to proteins where simple Gō potentials do not give a proper description of their folding process. Being the same protein as 2GB1, the example of 1PGB is particularly illustrative because the Gō model used here predicts a completely different (downhill) folding (31). In this case, our combined potential is able to correct this Gō artifact, presenting again a two-state folding identical to 2GB1.
Once our approach was found valid for general cases, we carried out a systematic analysis of folding events using proteins with different types of native structures. To illustrate this, we have shown a couple of examples of all-α (1R69 and 1ENH) and all-β (1TEN and 1NLO) proteins. The role of hydrogen bonds in all-α proteins is especially relevant, as a residual helical content has been observed in the denatured state of many of them (55,56,59). This effect, which the pure Gō model cannot properly reproduce, is correctly modeled when hydrogen bonds are explicitly considered. Interestingly, they do not force changes in the height of the free energy barriers, but correctly modulate them in agreement with experimental results.
In relation to all-β proteins, no systematic remaining structure is observed in their unfolded state, like in 1TEN. In 1NLO, however, some residual structure is observed. Interestingly, these partial interactions are not a model artifact, but they seem to have a biological meaning: they are mainly placed in regions experimentally stabilized by hydrogen bonds in the transition state and with a crucial role in the folding of this protein. Thus, the folding of this protein according to our model constitutes a valuable example of the synergistic effect of this combined potential and its applicability to protein folding studies.
Although we have mainly focused on two-state folders, the combined potential seems to work well in downhill folders too. We studied 1BBL, a typical example where our Gō potential had already given correct results (30). We checked that the introduction of hydrogen bonds does not have any undesirable effect, and downhill behavior is conserved. Our last challenge has been to determine whether the combined potential can simulate downhill folders even when the Gō potential cannot detect them. For that purpose, we studied 1HYW, a downhill protein with a small folding barrier where the Gō model predicts a two-state transition. Interestingly, the insertion of hydrogen bonds considerably reduces it, reaching the downhill range.
As a general view, we can conclude that all these profitable results are mainly based on three issues: a better definition of the native state in most cases; a certain removal of Gō-like residual structure in the denatured state; and a partial retention of helical structure in the denatured state (if this kind of structure is preponderant in the native conformation). As a result, the protein thermodynamics is not biased but adequately modulated according to the intrinsic characteristics of the protein itself: In some cases, the cooperativity of the folding transition is increased when hydrogen bonds are included. In other proteins, the folding barrier is not substantially modified, or it is even reduced. In all the cases, our hydrogen-bond model has contributed to make the simulation results of the combined model closer to the experimental evidence than the results from our Gō model alone.
In conclusion, we have successfully combined a Gō potential and a hydrogen-bond model to simulate the thermodynamic properties of the folding process for a representative set of proteins. Our results, that semiquantitatively match experimental data, show that this combination of potentials is an excellent candidate for carrying out these and other folding studies.
Acknowledgments
This work was partially supported by the Spanish Ministerio de Ciencia e Innovación (grant No. FIS2009-13364-C02-02), by Comunidad Autónoma de Madrid (grant No. S2009/PPQ-1551), and by Universidad Complutense de Madrid/Banco Santander (grant No. GR35/10-A-910068). M. E. acknowledges a Scholarship from the Spanish Ministerio de Educación (FPU program).
References
- 1.Bryngelson J.D., Wolynes P.G. Spin glasses and the statistical mechanics of protein folding. Proc. Natl. Acad. Sci. USA. 1987;84:7524–7528. doi: 10.1073/pnas.84.21.7524. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Onuchic J.N., Nymeyer H., Socci N.D. The energy landscape theory of protein folding: insights into folding mechanisms and scenarios. Adv. Protein Chem. 2000;53:87–152. doi: 10.1016/s0065-3233(00)53003-4. [DOI] [PubMed] [Google Scholar]
- 3.Riddle D.S., Grantcharova V.P., Baker D. Experiment and theory highlight role of native state topology in SH3 folding. Nat. Struct. Biol. 1999;6:1016–1024. doi: 10.1038/14901. [DOI] [PubMed] [Google Scholar]
- 4.Fersht A.R. Transition-state structure as a unifying basis in protein-folding mechanisms: contact order, chain topology, stability, and the extended nucleus mechanism. Proc. Natl. Acad. Sci. USA. 2000;97:1525–1529. doi: 10.1073/pnas.97.4.1525. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Naganathan A.N., Muñoz V. Scaling of folding times with protein size. J. Am. Chem. Soc. 2005;127:480–481. doi: 10.1021/ja044449u. [DOI] [PubMed] [Google Scholar]
- 6.Taketomi H., Ueda Y., Gō N. Studies on protein folding, unfolding and fluctuations by computer simulation. I. The effect of specific amino acid sequence represented by specific inter-unit interactions. Int. J. Pept. Protein Res. 1975;7:445–459. [PubMed] [Google Scholar]
- 7.Sułkowska J.I., Cieplak M. Selection of optimal variants of Gō-like models of proteins through studies of stretching. Biophys. J. 2008;95:3174–3191. doi: 10.1529/biophysj.107.127233. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Hills R.D., Jr., Brooks C.L., 3rd Insights from coarse-grained Gō models for protein folding and dynamics. Int. J. Mol. Sci. 2009;10:889–905. doi: 10.3390/ijms10030889. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Onuchic J.N., Wolynes P.G. Theory of protein folding. Curr. Opin. Struct. Biol. 2004;14:70–75. doi: 10.1016/j.sbi.2004.01.009. [DOI] [PubMed] [Google Scholar]
- 10.Mirny L., Shakhnovich E. Protein folding theory: from lattice to all-atom models. Annu. Rev. Biophys. Biomol. Struct. 2001;30:361–396. doi: 10.1146/annurev.biophys.30.1.361. [DOI] [PubMed] [Google Scholar]
- 11.Levy Y., Wolynes P.G., Onuchic J.N. Protein topology determines binding mechanism. Proc. Natl. Acad. Sci. USA. 2004;101:511–516. doi: 10.1073/pnas.2534828100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Prieto L., de Sancho D., Rey A. Thermodynamics of Gō-type models for protein folding. J. Chem. Phys. 2005;123:154903. doi: 10.1063/1.2064888. [DOI] [PubMed] [Google Scholar]
- 13.Gosavi S., Chavez L.L., Onuchic J.N. Topological frustration and the folding of interleukin-1 β. J. Mol. Biol. 2006;357:986–996. doi: 10.1016/j.jmb.2005.11.074. [DOI] [PubMed] [Google Scholar]
- 14.Clementi C., Nymeyer H., Onuchic J.N. Topological and energetic factors: what determines the structural details of the transition state ensemble and “en-route” intermediates for protein folding? An investigation for small globular proteins. J. Mol. Biol. 2000;298:937–953. doi: 10.1006/jmbi.2000.3693. [DOI] [PubMed] [Google Scholar]
- 15.Gsponer J., Caflisch A. Molecular dynamics simulations of protein folding from the transition state. Proc. Natl. Acad. Sci. USA. 2002;99:6719–6724. doi: 10.1073/pnas.092686399. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Karanicolas J., Brooks C.L., 3rd The importance of explicit chain representation in protein folding models: an examination of Ising-like models. Proteins. 2003;53:740–747. doi: 10.1002/prot.10459. [DOI] [PubMed] [Google Scholar]
- 17.Nabuurs S.M., Westphal A.H., van Mierlo C.P.M. Extensive formation of off-pathway species during folding of an α-β parallel protein is due to docking of (non)native structure elements in unfolded molecules. J. Am. Chem. Soc. 2008;130:16914–16920. doi: 10.1021/ja803841n. [DOI] [PubMed] [Google Scholar]
- 18.Dobson C.M. Protein folding and misfolding. Nature. 2003;426:884–890. doi: 10.1038/nature02261. [DOI] [PubMed] [Google Scholar]
- 19.Paci E., Vendruscolo M., Karplus M. Validity of Gō models: comparison with a solvent-shielded empirical energy decomposition. Biophys. J. 2002;83:3032–3038. doi: 10.1016/S0006-3495(02)75308-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Karanicolas J., Brooks C.L., 3rd Improved Gō-like models demonstrate the robustness of protein folding mechanisms towards non-native interactions. J. Mol. Biol. 2003;334:309–325. doi: 10.1016/j.jmb.2003.09.047. [DOI] [PubMed] [Google Scholar]
- 21.Zarrine-Afsar A., Wallin S., Chan H.S. Theoretical and experimental demonstration of the importance of specific nonnative interactions in protein folding. Proc. Natl. Acad. Sci. USA. 2008;105:9999–10004. doi: 10.1073/pnas.0801874105. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Ozkan S.B., Shell M.S., Weikl T.R. The protein folding problem. Annu. Rev. Biochem. 2008;37:289–316. doi: 10.1146/annurev.biophys.37.092707.153558. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Krantz B.A., Srivastava A.K., Sosnick T.R. Understanding protein hydrogen bond formation with kinetic H/D amide isotope effects. Nat. Struct. Biol. 2002;9:458–463. doi: 10.1038/nsb794. [DOI] [PubMed] [Google Scholar]
- 24.Klimov D.K., Betancourt M.R., Thirumalai D. Virtual atom representation of hydrogen bonds in minimal off-lattice models of α-helices: effect on stability, cooperativity and kinetics. Fold. Des. 1998;3:481–496. doi: 10.1016/s1359-0278(98)00065-0. [DOI] [PubMed] [Google Scholar]
- 25.Cheung M.S., Finke J.M., Onuchic J.N. Exploring the interplay between topology and secondary structural formation in the protein folding problem. J. Phys. Chem. B. 2003;107:11193–11200. [Google Scholar]
- 26.Yap E.-H., Fawzi N.L., Head-Gordon T. A coarse-grained α-carbon protein model with anisotropic hydrogen-bonding. Proteins. 2008;70:626–638. doi: 10.1002/prot.21515. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Hardin C., Luthey-Schulten Z., Wolynes P.G. Backbone dynamics, fast folding, and secondary structure formation in helical proteins and peptides. Proteins. 1999;34:281–294. [PubMed] [Google Scholar]
- 28.Ding F., Dokholyan N.V., Shakhnovich E.I. Molecular dynamics simulation of the SH3 domain aggregation suggests a generic amyloidogenesis mechanism. J. Mol. Biol. 2002;324:851–857. doi: 10.1016/s0022-2836(02)01112-9. [DOI] [PubMed] [Google Scholar]
- 29.Prieto L., Rey A. Influence of the chain stiffness on the thermodynamics of a Gō-type model for protein folding. J. Chem. Phys. 2007;126:165103. doi: 10.1063/1.2727465. [DOI] [PubMed] [Google Scholar]
- 30.Prieto L., Rey A. Influence of the native topology on the folding barrier for small proteins. J. Chem. Phys. 2007;127:175101. doi: 10.1063/1.2780154. [DOI] [PubMed] [Google Scholar]
- 31.Prieto L., Rey A. Simulations of the protein folding process using topology-based models depend on the experimental structure. J. Chem. Phys. 2008;129:115101. doi: 10.1063/1.2977744. [DOI] [PubMed] [Google Scholar]
- 32.Larriva M., Prieto L., Rey A. A simple simulation model can reproduce the thermodynamic folding intermediate of apoflavodoxin. Proteins. 2010;78:73–82. doi: 10.1002/prot.22521. [DOI] [PubMed] [Google Scholar]
- 33.Kolinski A. Protein modeling and structure prediction with a reduced representation. Acta Biochim. Pol. 2004;51:349–371. [PubMed] [Google Scholar]
- 34.Hoang T.X., Trovato A., Maritan A. Geometry and symmetry presculpt the free-energy landscape of proteins. Proc. Natl. Acad. Sci. USA. 2004;101:7960–7964. doi: 10.1073/pnas.0402525101. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Imamura H., Chen J.Z.Y. Minimum model for the α-helix-β-hairpin transition in proteins. Proteins. 2007;67:459–468. doi: 10.1002/prot.21216. [DOI] [PubMed] [Google Scholar]
- 36.Morozov A.V., Kortemme T. Potential functions for hydrogen bonds in protein structure prediction and design. Adv. Protein Chem. 2005;72:1–38. doi: 10.1016/S0065-3233(05)72001-5. [DOI] [PubMed] [Google Scholar]
- 37.Enciso M., Rey A. A refined hydrogen bond potential for flexible protein models. J. Chem. Phys. 2010;132:235102. doi: 10.1063/1.3436723. [DOI] [PubMed] [Google Scholar]
- 38.Rey-Stolle M.F., Enciso M., Rey A. Topology-based models and NMR structures in protein folding simulations. J. Comput. Chem. 2009;30:1212–1219. doi: 10.1002/jcc.21149. [DOI] [PubMed] [Google Scholar]
- 39.Berman H.M., Westbrook J., Bourne P.E. The Protein Data Bank. Nucleic Acids Res. 2000;28:235–242. doi: 10.1093/nar/28.1.235. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Chan H.S., Zhang Z., Liu Z. Cooperativity, local-nonlocal coupling, and nonnative interactions: principles of protein folding from coarse-grained models. Annu. Rev. Phys. Chem. 2011;62:301–326. doi: 10.1146/annurev-physchem-032210-103405. [DOI] [PubMed] [Google Scholar]
- 41.Vieth M., Kolinski A., Skolnick J. Prediction of quaternary structure of coiled coils. Application to mutants of the GCN4 leucine zipper. J. Mol. Biol. 1995;251:448–467. doi: 10.1006/jmbi.1995.0447. [DOI] [PubMed] [Google Scholar]
- 42.Kabsch W., Sander C. Dictionary of protein secondary structure: pattern recognition of hydrogen-bonded and geometrical features. Biopolymers. 1983;22:2577–2637. doi: 10.1002/bip.360221211. [DOI] [PubMed] [Google Scholar]
- 43.Frishman D., Argos P. Knowledge-based protein secondary structure assignment. Proteins. 1995;23:566–579. doi: 10.1002/prot.340230412. [DOI] [PubMed] [Google Scholar]
- 44.Gronenborn A.M., Filpula D.R., Clore G.M. A novel, highly stable fold of the immunoglobulin binding domain of streptococcal protein G. Science. 1991;253:657–661. doi: 10.1126/science.1871600. [DOI] [PubMed] [Google Scholar]
- 45.Hansmann U.H.E. Parallel tempering algorithm for conformational studies of biological molecules. Chem. Phys. Lett. 1997;281:140–150. [Google Scholar]
- 46.Kumar S., Rosenberg J.M., Kollman P.A. The weighted histogram analysis method for free-energy calculations on biomolecules. I. The method. J. Comput. Chem. 1992;13:1011–1021. [Google Scholar]
- 47.McCallister E.L., Alm E., Baker D. Critical role of β-hairpin formation in protein G folding. Nat. Struct. Biol. 2000;7:669–673. doi: 10.1038/77971. [DOI] [PubMed] [Google Scholar]
- 48.Derreumaux P. Role of supersecondary structural elements in protein G folding. J. Chem. Phys. 2003;119:4940–4944. [Google Scholar]
- 49.Shimada J., Shakhnovich E.I. The ensemble folding kinetics of protein G from an all-atom Monte Carlo simulation. Proc. Natl. Acad. Sci. USA. 2002;99:11175–11180. doi: 10.1073/pnas.162268099. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Alexander P., Fahnestock S., Bryan P. Thermodynamic analysis of the folding of the streptococcal protein G IgG-binding domains B1 and B2: why small proteins tend to have high denaturation temperatures. Biochemistry. 1992;31:3597–3603. doi: 10.1021/bi00129a007. [DOI] [PubMed] [Google Scholar]
- 51.Naganathan A.N., Doshi U., Muñoz V. Protein folding kinetics: barrier effects in chemical and thermal denaturation experiments. J. Am. Chem. Soc. 2007;129:5673–5682. doi: 10.1021/ja0689740. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Gallagher T., Alexander P., Gilliland G.L. Two crystal structures of the B1 immunoglobulin-binding domain of streptococcal protein G and comparison with NMR. Biochemistry. 1994;33:4721–4729. [PubMed] [Google Scholar]
- 53.Mondragón A., Subbiah S., Harrison S.C. Structure of the amino-terminal domain of phage 434 repressor at 2.0 Å resolution. J. Mol. Biol. 1989;205:189–200. doi: 10.1016/0022-2836(89)90375-6. [DOI] [PubMed] [Google Scholar]
- 54.Laurents D.V., Corrales S., Padmanabhan S. Folding kinetics of phage 434 Cro protein. Biochemistry. 2000;39:13963–13973. doi: 10.1021/bi001388d. [DOI] [PubMed] [Google Scholar]
- 55.Neri D., Billeter M., Wüthrich K. NMR determination of residual structure in a urea-denatured protein, the 434-repressor. Science. 1992;257:1559–1563. doi: 10.1126/science.1523410. [DOI] [PubMed] [Google Scholar]
- 56.Blanco F.J., Serrano L., Forman-Kay J.D. High populations of non-native structures in the denatured state are compatible with the formation of the native folded state. J. Mol. Biol. 1998;284:1153–1164. doi: 10.1006/jmbi.1998.2229. [DOI] [PubMed] [Google Scholar]
- 57.Clarke N.D., Kissinger C.R., Pabo C.O. Structural studies of the engrailed homeodomain. Protein Sci. 1994;3:1779–1787. doi: 10.1002/pro.5560031018. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Mayor U., Johnson C.M., Fersht A.R. Protein folding and unfolding in microseconds to nanoseconds by experiment and simulation. Proc. Natl. Acad. Sci. USA. 2000;97:13518–13522. doi: 10.1073/pnas.250473497. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Mayor U., Guydosh N.R., Fersht A.R. The complete folding pathway of a protein from nanoseconds to microseconds. Nature. 2003;421:863–867. doi: 10.1038/nature01428. [DOI] [PubMed] [Google Scholar]
- 60.Leahy D.J., Hendrickson W.A., Erickson H.P. Structure of a fibronectin type III domain from tenascin phased by MAD analysis of the selenomethionyl protein. Science. 1992;258:987–991. doi: 10.1126/science.1279805. [DOI] [PubMed] [Google Scholar]
- 61.Clarke J., Hamill S.J., Johnson C.M. Folding and stability of a fibronectin type III domain of human tenascin. J. Mol. Biol. 1997;270:771–778. doi: 10.1006/jmbi.1997.1147. [DOI] [PubMed] [Google Scholar]
- 62.Hamill S.J., Steward A., Clarke J. The folding of an immunoglobulin-like Greek key protein is defined by a common-core nucleus and regions constrained by topology. J. Mol. Biol. 2000;297:165–178. doi: 10.1006/jmbi.2000.3517. [DOI] [PubMed] [Google Scholar]
- 63.Paci E., Clarke J., Karplus M. Self-consistent determination of the transition state for protein folding: application to a fibronectin type III domain. Proc. Natl. Acad. Sci. USA. 2003;100:394–399. doi: 10.1073/pnas.232704999. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Feng S., Kapoor T.M., Schreiber S.L. Molecular basis for the binding of SH3 ligands with non-peptide elements identified by combinatorial synthesis. Chem. Biol. 1996;3:661–670. doi: 10.1016/s1074-5521(96)90134-9. [DOI] [PubMed] [Google Scholar]
- 65.Grantcharova V.P., Baker D. Folding dynamics of the Src SH3 domain. Biochemistry. 1997;36:15685–15692. doi: 10.1021/bi971786p. [DOI] [PubMed] [Google Scholar]
- 66.Grantcharova V.P., Riddle D.S., Baker D. Important role of hydrogen bonds in the structurally polarized transition state for folding of the Src SH3 domain. Nat. Struct. Biol. 1998;5:714–720. doi: 10.1038/1412. [DOI] [PubMed] [Google Scholar]
- 67.Guijarro J.I., Morton C.J., Dobson C.M. Folding kinetics of the SH3 domain of PI3 kinase by real-time NMR combined with optical spectroscopy. J. Mol. Biol. 1998;276:657–667. doi: 10.1006/jmbi.1997.1553. [DOI] [PubMed] [Google Scholar]
- 68.Robien M.A., Clore G.M., Gronenborn A.M. Three-dimensional solution structure of the E3-binding domain of the dihydrolipoamide succinyltransferase core from the 2-oxoglutarate dehydrogenase multienzyme complex of Escherichia coli. Biochemistry. 1992;31:3463–3471. doi: 10.1021/bi00128a021. [DOI] [PubMed] [Google Scholar]
- 69.Maxwell K.L., Yee A.A., Davidson A.R. The solution structure of bacteriophage λ protein W, a small morphogenetic protein possessing a novel fold. J. Mol. Biol. 2001;308:9–14. doi: 10.1006/jmbi.2001.4582. [DOI] [PubMed] [Google Scholar]
- 70.Fung A., Li P., Muñoz V. Expanding the realm of ultrafast protein folding: gpW, a midsize natural single-domain with α+β topology that folds downhill. J. Am. Chem. Soc. 2008;130:7489–7495. doi: 10.1021/ja801401a. [DOI] [PubMed] [Google Scholar]
- 71.Bruscolini P., Naganathan A.N. Quantitative prediction of protein folding behaviors from a simple statistical model. J. Am. Chem. Soc. 2011;133:5372–5379. doi: 10.1021/ja110884m. [DOI] [PubMed] [Google Scholar]
- 72.de Sancho D., Rey A. Energy minimizations with a combination of two knowledge-based potentials for protein folding. J. Comput. Chem. 2008;29:1684–1692. doi: 10.1002/jcc.20924. [DOI] [PubMed] [Google Scholar]