Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2012 Sep 12.
Published in final edited form as: Proteins. 2009 May 1;75(2):430–441. doi: 10.1002/prot.22253

An All-atom Structure-Based Potential for Proteins: Bridging Minimal Models with All-atom Empirical Forcefields

Paul C Whitford a, Jeffrey K Noel a, Shachi Gosavi a, Alexander Schug a, Kevin Y Sanbonmatsu b, José N Onuchic a,*
PMCID: PMC3439813  NIHMSID: NIHMS75684  PMID: 18837035

Abstract

Protein dynamics take place on many time and length scales. Coarse-grained structure-based (Gō) models utilize the funneled energy landscape theory of protein folding to provide an understanding of both long time and long length scale dynamics. All-atom empirical forcefields with explicit solvent can elucidate our understanding of short time dynamics with high energetic and structural resolution. Thus, structure-based models with atomic details included can be used to bridge our understanding between these two approaches. We report on the robustness of folding mechanisms in one such all-atom model. Results for the B domain of Protein A, the SH3 domain of C-Src Kinase and Chymotrypsin Inhibitor 2 are reported. The interplay between side chain packing and backbone folding is explored. We also compare this model to a Cα structure-based model and an all-atom empirical forcefield. Key findings include 1) backbone collapse is accompanied by partial side chain packing in a cooperative transition and residual side chain packing occurs gradually with decreasing temperature 2) folding mechanisms are robust to variations of the energetic parameters 3) protein folding free energy barriers can be manipulated through parametric modifications 4) the global folding mechanisms in a Cα model and the all-atom model agree, although differences can be attributed to energetic heterogeneity in the all-atom model 5) proline residues have significant effects on folding mechanisms, independent of isomerization effects. Since this structure-based model has atomic resolution, this work lays the foundation for future studies to probe the contributions of specific energetic factors on protein folding and function.

Keywords: Energy Landscape Theory, Protein Folding, Structure-based model, All-atom model, Side chain packing

Introduction

In recent years the energy landscape theory of protein folding [15] has been validated through its application to protein folding [610], oligomerization [1114], functional transitions [1520] and structure prediction [21,22]. The theory states that proteins are minimally frustrated, that their energy landscape is funnel shaped and that the folded state of the protein is at the bottom of the funnel. Because of the shape of the landscape there is a strong energetic bias towards the folded state of the protein with relatively infrequent trapping caused by non-native interactions. The resulting heterogeneity observed during folding is due to the geometric constraints of the native structure. Thus, models of proteins that have only the native structure encoded have had great success in determining folding mechanisms. Until recently, most models tended to be coarse-grained, which are very useful in understanding global folding dynamics. In commonly used structure-based (Gō) potentials [9], each residue is represented by a bead centered at the location of the Cα atom (Figure 1b) and only native interactions are stabilizing.

Fig. 1.

Fig. 1

CI2 (Protein Data Base Entry 1YPA [23]) shown in (a) cartoon representation, (b) Cα representation and (c) all-atom (AA) representation. Structures are colored Red (C-terminus) to Blue (N-terminus). The size of the atoms in the Cα and AA representations correspond to the excluded volume radii used in the Cα [9] and AA models studied in this paper. Structures visualized using VMD [24].

On the other end of the spectrum of structural and energetic details are the computationally intensive all-atom empirical forcefields [2530]. These forcefields include an atomistic representation of a protein either with an implicit or an explicit solvent. In these potentials, the parameters which determine the interaction between atoms, such as partial charges and van der Waals radii, are fit to experimental measurements and quantum mechanical calculations. With accurate calibration, a single parameter set may be applied to any protein and with sufficient computing resources, the dynamics of a protein can be calculated on a computer. The physics-based representation of atom-atom interactions automatically includes electrostatic interactions as well as any non-native interactions that may be present. In principle, these models render knowledge of a native structure unnecessary. A major limitation of these potentials is that they are often too expensive to fold all but small proteins [3139]. The timescales that can currently be calculated vary from hundreds of nanoseconds to microseconds, depending on the size of the protein. Biological timescales are usually several orders of magnitude larger and these dynamics cannot be accessed using all-atom empirical forcefields. In addition, sensitivity analysis of the dynamics to the parameters is not possible with these all-atom empirical forcefields.

In all-atom empirical forcefields an observed specificity of (i.e. preference for) native interactions is seen as a consequence of many energetic contributions. Due to the complex formulation of these potentials, it is impossible to partition geometric effects from energetic ones. There is a similar restriction in coarse-grained models due to their simplicity. Partitioning these effects is often impossible since geometry is included implicitly through energetic interactions. By studying all-atom models with structure-based potentials [40,4750], since atomic geometry is explicitly included, we can ask to what extent energetics contribute to the apparent native specificity in protein structure, folding and function. In contrast to enzyme catalysis where specific atomic interactions directly control the chemical reactions, in most cases the energetic specificity required in protein folding is less stringent.

Providing a complete picture of specificity in protein folding and function will require the study of many proteins and many parametric variations. In this manuscript, we lay the foundation for this line of investigation through systematic characterization of a completely specific (only, and all, native interactions are stabilizing) AA structure-based model. We study the effect of varying the parameters of the model on folding barriers, mechanisms, contact formation and side chain dynamics. The test proteins, B domain of Protein A, SH3 domain of C-Src Kinase and Chymotrypsin Inhibitor 2 (CI2) (Figure 2) have been experimentally [5355] and computationally [8,5658] well characterized. Additionally, they possess two-state folding dynamics and represent different secondary and tertiary structures. The present model is energetically unfrustrated, with an explicit representation of all non-hydrogen atoms and homogeneous interaction strengths. We find that folding in the model is robust to parameter changes and dynamics agrees well with both the Cα model and an all-atom empirical forcefield with explicit solvent. Further, side chain ordering can be probed explicitly and the effect of prolines can be calculated. This study and model will serve as a basis for future AA models which incorporate non-specific contributions of energetic frustration, electrostatics and hydration.

Fig. 2.

Fig. 2

Structures of (a) Protein A, (b) SH3 and (c) CI2 (PDB entries 1BDD [51], 1FMK [52] and 1YPA [23]) colored Red (C-terminus) to Blue (N-terminus). These three proteins represent differing structural content and topological complexity. Protein A is a three-helix bundle, SH3 is composed of multiple β strands and in CI2 an alpha helix flanks a β sheet. Proline residues are shown as grey spheres. In Protein A, Gln1 and Ser31 are shown as colored spheres. In SH3, Val4 and Trp35 are shown as spheres. The mini-core of CI2 is circled.

Results

Folding mechanisms are robust to parameter changes

We employ a model where the potential energy function is defined by the native state and all heavy (non-hydrogen) atoms are explicitly represented. Any two atoms that are close in the native structure are said to form a native contact. We describe the folding process by using the fraction of native residue pairs in contact QAA (see methods). Figure 3a shows QAA, QCA (fraction of Cα contacts, see methods) and radius of gyration Rg as functions of time for an AA simulation of CI2, near folding temperature. Since QAA captures the same collapse events as Rg and QCA (Figure 3b), QAA is a useful measure of backbone folding in addition to side chain packing.

Fig. 3.

Fig. 3

(a) Fraction of Cα contacts QCA(t), AA contacts QAA(t) and Radius of Gyration Rg(t) as functions of time for a representative trajectory of CI2 with the AA model. (b) Average structure formation for several reaction coordinates. A contact between residues is formed when a single atom-atom contact between them is formed. An atom-atom contact is considered formed when the pair is at a distance r < γσ where σ is the native pair distance. The fraction of native residue contacts formed QAAX is shown for γ = 1.2 (black) and γ = 1.5 (red). A Cα contact is formed when the Cα atoms are within 1.2 times their native distance (green). All three coordinates capture the same folding events. (c) Atom-atom distance for a contact in the active loop of CI2 versus time at Tf (red) and T < Tf (green). Large changes in distance (> 20Å) coincide with folding transitions. Side chain rearrangements in the folded state (R < 10Å) occur on much faster time scales than folding of the entire protein. (d) Same as Figure (c) with time scale decreased by a factor of 100. Horizontal lines correspond to σ (yellow), 1.2σ (blue) and 1.5σ (purple). As temperature is decreased, distance fluctuations and average distances decrease.

It is crucial to understand the parameter dependence of a model before it can be used to make reliable predictions of folding mechanisms. The robustness of the folding mechanism is probed here by characterizing Protein A, SH3 and CI2 for variants of the AA structure-based energy function. Due to the debate about the balance between secondary and tertiary interactions, we vary the ratio of non-local contact energy to dihedral angles RC/D and the relative strength of backbone dihedral angles to side chain dihedral angles RBB/SC (see methods).

To characterize the folding mechanism for different parameter sets we computed the probability of contacts formed as a function of the folding process P(Qi, QAA). P(Qi, QAA) is the probability that the set of contacts involving residue i, Qi, are formed as a function of QAA. P(Qi, QAA) was calculated for the three proteins for 16 different parameter sets (all combinations of RC/D = 1.0, 2.0, 3.0, 4.0 and RBB/SC = 1.0, 2.0, 3.0, 4.0). Figure 4 shows the folding mechanisms for four parameter sets. The difference in folding mechanism between parameter sets i and j can be quantified by the root mean squared deviation in P(Qi, QAA) over all QAA and Qi, (Prms=(Pi(Qi,QAA)Pj(Qi,QAA))2). The largest values of Prms for Protein A, SH3 and CI2 were 0.057, 0.097 0.077. SH3 is a complicated fold, Protein A a simple fold and CI2 an intermediate fold [59]. Thus, it is not surprising that energetic modifications have the largest effects on Protein A and the smallest effects on SH3.

Fig. 4.

Fig. 4

The left column shows the probability of contacts being formed for each residue P(Qi, QAA) as a function of QAA for RC/D = 1.0 and RBB/SC = 1.0. The three right columns show P(Qi, QAA) for different Hamiltonians relative to RC/D = 1.0 and RBB/SC = 1.0. Blue indicates a decrease in formation, relative to RC/D = 1.0 and RBB/SC = 1.0, and red an increase. Proline containing regions are often sensitive to contact energy. In Protein A, both P12 and P30 fold earlier with increased contact strength. In SH3, the increase in formation of Val4 may be attributed to interactions with Pro56, though Pro50 and Trp35 do not exhibit increased formation. In CI2, both Pro6 and Pro61 exhibit increased formation with increased contact strength. Residues that lack native contacts are shown in grey.

Figure 4 shows proline containing regions are less stable to parametric modifications. Regions with prolines, and regions interacting with prolines, form structure earlier (at lower Q) with increased contact strength. This is because contact strength is increased at the expense of dihedral strength. Prolines possess a covalent CδN bond, which limits the mobility of the ϕ dihedral. Removing energy from the dihedrals does not increase flexibility in prolines. However, adding energy to contacts increases structure formation around prolines. For this reason, increasing RC/D stabilizes and promotes earlier formation of proline containing regions.

Fully folded backbone allows for disordered side chains

While QAA and QCA capture the same cooperative folding events, at folding temperature, QCA is higher than QAA for the folded ensemble. This suggests that while the backbone structure is native (QCA ≈ 0.8), many of the native residue interactions form as temperature is decreased (Figure 3c and d). To account for this structurally and quantitatively, we calculated the difference between the probability of Cα contacts being formed P(QCαi,QCα) and AA contacts being formed P(QAAi,QCα) (Figure 5). A value of 0 indicates that, on average, the Cα atoms of a residue pair are near their native distance when the side chains are in contact. Positive values are seen when extended side chains are interacting, resulting in the Cα atoms being far from their native distance. Negative values indicate backbone folding precedes side chain ordering1. Side chains in Protein A appear to be well-packed, in that there is concomitant side chain and backbone folding. In SH3, the turns have negative values, and are thus underpacked. In CI2, underpacking is primarily found in the active site loop and the C-terminal tail. These results reveal a signature of complicated folds [58,59]: a small subset of native contacts is sufficient to constrain the backbone to its native orientation, resulting in significantly underpacked regions in the native state. This occurs in complicated folds because an individual contact can impose a high level of order on the system. In order to form contacts that are distant in sequence a large number of residues must also order. In Protein A, many contacts are local and only constrain single helical turns. In SH3 and CI2, fewer contacts are required to constrain the entire backbone (including the turns and loops).

Fig. 5.

Fig. 5

Difference in AA contact formation and Cα contact formation P(QAAi,QCα)P(QCαi,QCα) for (a) Protein A, (b) SH3 and (c) CI2. Positive values (red) indicate that residues are interacting without the Cα atoms being near. Negative values (blue) indicate the residues are “underpacked”: the Cα atoms are near each other without the side chains interacting. Residues that lack native contacts are shown in grey. (d–f) Underpacked (blue spheres) and well packed (orange spheres) residues are shown on the native structures. In Protein A, to order the backbone of a helix the side chains must be packed around it. Beta sheets are stabilized by non-local interactions. Thus, a small number of contacts can maintain the tertiary structure of SH3 without the side chains in the turn regions interacting, hence the underpacking. In CI2, the active site loop is significantly underpacked.

Figures 3c and 3d show the dynamics of a typical underpacked contact. As T is lowered below Tf the underpacked contact’s average distance and distance fluctuations smoothly decrease. This results in a gradual increase in Q without a noticeable free energy barrier (See Figure 6e). We hope that these subtle dynamics will be experimentally probed and tested in the future.

Fig. 6.

Fig. 6

Free energy barriers in the AA model for (a) Protein A, (b) SH3 and (c) CI2. Profiles in (a–c) are for RBB/SC = 2.0 with RC/D = 1.0 (black), RC/D = 2.0 (red), RC/D = 3.0 (green) and RC/D = 4.0 (blue). In SH3 and CI2, barrier height decreases and the folded basins move to lower Q with increasing RC/D and increasing RBB/SC. (d) F(QCA(t)) and F(QAA(T)) for a typical parameter set demonstrate that the folded basins in (a–c) correspond to collapsed states. (e) Two distinct folding processes observed in our model: backbone collapse and side chain packing. (f) Free energy barriers obtained from Cα structure-based simulations for Protein A, SH3 and CI2. Barrier heights in the Cα simulations are greater than in AA simulations. Both models predict the largest barriers for SH3 and smallest for Protein A.

Understanding free energy profiles through parametric variation: Free energy profiles can be altered through parametric changes

While the folding mechanisms are stable, the free energy barriers associated with folding and the locations of the folded basins vary systematically with parameters. Figure 6 shows free energy profiles for SH3, CI2 and Protein A for several values of RC/D with RBB/SC = 2.0. There are four distinct, interrelated, trends shared by all three proteins. First, there are two folding processes: backbone collapse and side chain packing. Second, the free energy minimum for the folded state moves to lower Q with increasing RC/D. Third, the free energy barrier decreases with increasing RC/D. Finally, increasing RBB/SC has similar effects as increasing RC/D (not shown).

The free energy basins for the folded states are located at QCA ≈ 0.8 and QAA ≈ 0.5 (Figure 6d), indicating that the backbone orders while many native atom-atom interactions remain extended. Thus, the entropy loss during the cooperative folding transition is likely dominated by backbone ordering. Side chain packing occurs both concomitantly with, and after, backbone ordering.

There are likely two major factors that lead to the observed trends. First, increasing RC/D increases contact strength. As seen in other simplified models [60], when each contact is stronger, a smaller number of contacts is required (lower Q) to provide an equal amount of stabilizing energy. The second contributing factor is the change in side chain entropy. While entropy loss in the backbone dominates the collapse transition, the gradual side chain packing can also lead to shifting basins. Increasing RBB/SC or RC/D reduces the strength of side chain dihedrals, resulting in more mobile unfolded side chains. Therefore, there is an increased entropy loss per side chain upon folding ΔSsc when RC/D or RBB/SC is increased. Since side chains can pack independently of the collapse transition, when ΔSsc increases, a fraction of the side chain interactions extend, while leaving the overall fold intact. Since the folded basin shifts to lower Q, the overall structure required to form a stable fold is reduced. A reduced barrier height naturally results when the folded basin is less ordered.

Free energy barriers, in conjunction with diffusion constants, provide a direct connection to experimental folding rates [6163]. We find that the relative barrier heights calculated using our AA model are similar to those from a Cα model (Fig. 6f). The relative barrier heights calculated from this model are known to correlate well with experimental rates [63]. We note in passing, that the absolute free energy barriers in the AA model can be parametrically changed by up to a factor of two for a given protein and that the relative barrier heights between proteins remain constant. Thus, while the magnitude of the rates will be determined by the diffusion constant, the correlation between experimental folding rates and theoretical barriers is independent of the choice of parameters.

All-atom structure-based simulations capture Cα folding mechanism

Next we compare the backbone folding mechanisms of our AA model and a commonly used Cα model [9]. The Cα representation has been successful at capturing experimentally determined protein folding mechanisms [8,9,11]. The first column in Figure 7 shows the differences in folding mechanisms between the AA model and an energetically homogeneous Cα model. Every contact and dihedral in the homogeneous Cα model has the same interaction strength. Since the AA model distributes contact energy inhomogeneously between residue pairs, it is not surprising that the mechanisms differ.

Fig. 7.

Fig. 7

Comparison of backbone folding in Cα and AA structure-based models. The probability of contacts being formed in a Cα model, minus the probability of Cα contacts being formed in an AA model, is shown for (a–c) Protein A, (d–f) SH3 and (g–i) CI2. (a, d, g) Comparison of AA simulation to a Cα model with homogenous contact strength. (b, e, h) Comparison between AA results to an energetically inhomogeneous Cα model. Regions of increased formation in the AA representation correspond largely to proline containing regions, or regions that interact with proline, such as the minicore in CI2 (black arrows indicate mini-core residues), the tails of SH3 and turn 2 of Protein A. Increased formation in the tails of CI2 can largely be accounted for by the large number of contacts between GLU4 and ARG62. (c, f, i) The inhomogeneous Cα model compared to the AA model with all prolines mutated to alanines. Mutating proline to alanine improved agreement between models. Residues that lack native contacts are shown in grey.

To remove differences arising from energetic homogeneity in the Cα model, we modified it such that each contact is weighted by the number of contacts between each residue pair in the AA model (Figure 7, second column). For Protein A this modification improves agreement. The remaining difference is in a single turn-to-tail contact (Gln1 with Ser31, Fig. 2a) that rarely forms in Cα simulations. In SH3, agreement improves around residues Asp34 and Asn52, while differences persist in Gln45 and the tails. The overall effect is increased formation around Gln45 at the expense of the tails. In CI2, there is significant agreement in the tails, though the mini-core still forms earlier (in the AA model), at the expense of the helix. For all three proteins, several regions of disagreement possess proline residues, whose CδN bond is not included in the Cα model.

To eliminate effects specific to proline, we repeated the AA simulations with all prolines mutated to alanines. The third column of Figure 7 shows the Pro-Ala mutants compared to the inhomogeneous Cα model. Improved agreement is observed in Pro-Ala mutants of SH3 and CI2. In both proteins Pro-Ala mutations delay folding of proline regions, in agreement with proline effects on model stability. In SH3 the tails still form slightly earlier in the AA model, at the expense of residues 35–55. In CI2, the balance between minicore and helix formation is clearly improved, highlighting the importance of prolines in the folding process. Pro-Ala mutations have almost no effect on the folding mechanism of P12 and P30 in Protein A and P25 in CI2. This is likely because these prolines are located in turn regions. In our model, turns are highly constrained by short range contacts, and the reduced dihedral constraint (imposed by a proline) acts as a small perturbation. The remaining differences between the Pro-Ala AA mutants and the inhomogeneous Cα model demonstrate, to no surprise, that the inclusion of side chains alters the relative entropy of residues.

Native basin dynamics of AA structure-based model correlate with the dynamics of an all-atom empirical forcefield with explicit solvent

Two common measures of native state dynamics are native contact formation and root mean squared deviations in structure rmsd. Figure 8 shows the average contact formation in the native ensemble for the structure-based model and an all-atom empirical forcefield with and explicit solvent. While the average contacts are not identical, no major differences in contact formation are observed. The overlaps between the AA maps and the all-atom empirical forcefield maps of Protein A, SH3 and CI2 are 0.85, 0.97 and 0.84. An overlap of 1 indicates identical maps, and 0 indicates the two maps have no contacts in common.

Fig. 8.

Fig. 8

Probability of contacts being formed P(i, j) at T ≈ 0.8Tf for the AA structure-based potential (top left) and an all-atom empirical forcefield (bottom right) for (a) Protein A, (b) SH3 and (c) CI2. Dark red indicates that residue i (x axis) and residue j (Y axis) are always in contact under native conditions. Dark blue indicates the contact is formed rarely (less than 10% of the time). White indicates P(i, j) < 0.025. In all three proteins, contacts are more broadly distributed (higher number of low probability contacts) in the structure-based simulations than in all-atom empirical forcefield simulations (fewer contacts, but with higher probabilities). There are approximately four times as many contacts with P(i, j) < 0.01 for the structure-based simulations than are seen in all-atom empirical simulations, indicating more mobile dynamics.

In a uniquely defined native state, the probability of each contact being formed is 1. Since we sample the native ensemble at finite temperatures, atom mobility leads to additional contacts being formed. In the structure-based model, these additional interactions are strictly repulsive. In an all-atom empirical forcefield these interactions can be attractive, yet they are observed more frequently in the structure-based model2. These contacts are likely due to increased mobility in the structure-based simulations. In all-atom empirical forcefields, hydration shells can result in less mobile side chains, and hence a narrower distribution of contacts.

The increased mobility is quantified by the structural rmsd. The magnitude of fluctuations in all-atom empirical simulations is much lower than in structure-based simulations (not shown). For the all-atom empirical forcefield at 300 K, the average rmsd for Protein A, SH3 and CI2 are 1.53, 1.00 and 0.97 Å. The rmsd of the Cα atoms are 1.23, 0.66 and 0.74. The same values are obtained in structure-based simulations at around T = 0.55Tf. In real temperature units, 0.55Tf corresponds to temperatures significantly less than 300 K. A likely cause for the increased structural fluctuations is hydration effects of explicit solvent molecules in the all-atom empirical forcefield. To compare the distribution of rmsd fluctuations between models, correlation coefficients (r) were computed for the rmsd by atom in the all-atom empirical forcefield and the structure-based potential. For all parameter sets of the structure-based potential, the r ≈ 0.7 for CI2 and SH3 and r ≈ 0.8 for Protein A3.

Discussion

In this manuscript, we describe a systematic analysis of an AA structure-based model which bridges the gap between coarse-grained models and all-atom empirical forcefields. We show that in our Cα and AA structure-based models the global folding mechanisms agree and the main differences are largely due to energetic heterogeneity and the explicit representation of prolines in the AA model. Also, the native basin dynamics are similar in the AA structure-based model and an all-atom empirical forcefield with explicit solvent. In agreement with previous studies, the folding mechanisms in complicated folds are stable to parametric variation. On the other hand, the free energy barriers associated with folding vary systematically with parameters. Since free energy barriers are not a robust feature of this model, understanding the interplay between barrier heights and diffusion will be important before attempting to predict folding rates [61,64,65].

Using this model we characterized two folding processes: one associated with backbone collapse and the other with side chain packing. We observed that backbone collapse is accompanied by partial side chain packing in a cooperative transition and residual side chain packing occurs as temperature is reduced below the global folding temperature. One explanation for the partial separation of backbone folding and side chain ordering may be that mobility in specific residues is necessary for the functional properties of proteins. Proteins are selected for their function. Orthogonal networks of residues responsible for stability and function have been proposed [66,67]. The observation in our model that some residues are not necessary to maintain the backbone structure is consistent with this proposal. In CI2, the backbone of the active site loop is in the native orientation, yet the side chains are not packed. In SH3, several turns are also disordered. Since binding sites are often found in loops, flexible loops may be more easily adapted to new sequences and functions.

Gradual side chain packing can also allow for proteins to functionally respond to cellular stress by affecting side chain orientations, without denaturing the entire protein. This is consistent with the prediction that localized unfolding, or cracking, is important for biological function of kinases and motor proteins [15,18,6873].

The current model explicitly includes the effects of topological contributions to protein folding, and the role of energetic contributions may now be elucidated. Our results are a significant step forward in understanding protein dynamics from the Cα to the all-atom level. In the coming years, it will be interesting to probe the effects of electrostatics, non-native interactions, water and explicit mutations in this model.

Models and Methods

Energy Function

In our AA model of the protein, only heavy (non-hydrogen) atoms are included. Each atom is represented as a single bead of unit mass. Bond lengths, bond angles, improper dihedrals and planar dihedrals are maintained by harmonic potentials. Non-bonded atom pairs that are in contact in the native state between residues i and j, where i > j+3, are given a Lennard-Jones potential, while all other non-local interactions are repulsive. All contacts identified by the Contact of Structural Units software package (CSU)[74] were included. The functional form of the potential is,

V=bondsεr(rro)2+anglesεθ(θθo)2+impropers/planarεχ(χχo)2+backboneεBBFD(ϕ)+sidechainesεSCFD(ϕ)+contactsεC[(σijr)122(σijr)6]+non-contactsεNC(σNCr)12 (1)

where,

FD(ϕ)=[1cos(ϕϕo)]+12[1cos(3(ϕϕo))] (2)

and εr = 100, εθ = 20, εχ = 10 and εNC = 0.01. ro, θo, χo, ϕo and σij are given the values found in the native state and σNC = 2.5Å. When assigning dihedral strengths, we first group dihedral angles that share the middle two atoms. For example, in a protein backbone, one can define up to four dihedral angles that possess the same CCα covalent bond as the central bond. Each dihedral is given the interaction strength of 1/ND, where ND is the number of dihedral angles in the group. εBB and εSC are then scaled so that RBB/SC=εBBεSC. Next, dihedral strengths and contact strengths are scaled such that our other system parameter, the ratio of total contact energy to total dihedral energy RC/D=εCεBB+εSC, is satisfied. The total stabilizing energy is equal for all parameter sets (i. e. ∑εC + ∑εBB + ∑εSC=Constant).

As a reaction coordinate we use QAA and QCA. QAA is the fraction of natively interacting residues that are in contact. Two residues are considered in contact if any native atom-atom interactions between the residues are within 1.2 times the native distance σij. At 1.2σij the potential energy of a native pair is approximately half of the minimum. Similarly, QCA is the fraction of natively interacting residue pairs whose Cα atoms are within 1.2 times their native distance.

Proline to Alanine mutations

To investigate the role of proline residues in the AA model, proline to alanine mutants were constructed. This was achieved by removing the Cγ and Cδ atoms of each proline. Native contacts formed with the Cγ and Cδ of a proline were included as contacts with the Cβ of the corresponding alanine. This ensured the energetics of the system were unperturbed, and only topology was modified.

Simulation Details

All-atom structure-based simulations were performed using the GROMACS software package [26]. No modifications to the source code were necessary. Reduced units were used. The timestep τ was 0.0005. The Berendsen algorithm [75] was used 4 with the coupling constant of 1. For all folding results in this paper, several constant temperature runs were performed, with temperatures that corresponded to the protein being always folded to always unfolded. The Weighted Histogram Analysis Method [76,77] was used to combine data from multiple temperatures into single free energy profiles.

All-atom Empirical Forcefield Simulations

All-atom empirical forcefield simulations were performed using GROMACS [26,78], with the OPLS-AA forcefield [79] with TIP3P water molecules [80]. Each protein was simulated for 10 ns at T=300K and a pressure of 1 atm. A timestep of 2 fs was used in conjunction with the LINCS [81,82] algorithm for constraining covalent bonds with hydrogen. Protein A, SH3 and CI2 were simulated with 2810, 3617 and 4644 water molecules in cubic boxes of initial dimensions 45.15 Å, 48.98 Å and 53.07 Å. Temperature was maintained using the Berendsen algorithm [75]. 1 ns was allowed for equilibration. For the remaining 9 ns, structures were saved at 1 ps intervals.

Comparison of contacts

In the all-atom empirical forcefield simulations contacts were determined for each saved structure using CSU [74]. The average number of contacts < Q > was calculated for each protein. The probability of individual contacts being formed was averaged over all structures with Q =< Q >. With the all-atom empirical potential < Q > was 80, 135 and 146 for Protein A, SH3 and CI2. This analysis was repeated for folded simulations with our AA structure-based simulations. For the structure-based simulations < Q > was 80, 138 and 144. To compare contact maps, the dot product of the two maps was taken.

Acknowledgements

We would like to thank Angel Garcia and Peter G. Wolynes for useful discussions regarding all-atom modeling. PCW and JKN were supported in part by the National Institutes of Health Molecular Biophysics Training Program at University of California at San Diego Grant T32 GM08326. This work was supported in part by Grants PHY-0216576 and 0225630 from the National Science Foundation (NSF)-sponsored Center for Theoretical Biological Physics, NSF Grant 0543906, the LANL LDRD program and NIH Grant R01-GM072686.

Footnotes

1

QAA is a generous definition of side chain packing, since a side chain is “packed” when one or more atom-atom contacts are formed. Thus, “underpacked” residues clearly have very little native structure.

2

In Figure 8 only interactions present more than 2.5% of the time are shown.

3

Comparison of rmsd of the Cα atoms yields similar values of r.

4

When using the Berendsen thermostat, numerical instabilities can arise when the bath-molecule coupling timescale is shorter than the timescale for internal energy diffusion. In our experience, these problems tend to surface when you simulate weakly interacting domains with implicit solvation. Since the present study investigates folding of single domain proteins under weak temperature coupling, these features are not likely a source of significant errors. Nonetheless, future work will also employ Langevin or Nose-hoover temperature coupling.

References

  • 1.Leopold PE, Montal M, Onuchic JN. Protein folding funnels - a kinetic approach to the sequence structure relationship. Proc. Nat. Acad. Sci. USA. 1992;18:8721–8725. doi: 10.1073/pnas.89.18.8721. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Bryngelson JD, Onuchic JN, Socci ND, Wolynes PG. Funnels, pathways, and the energy landscape of protein-folding - a synthesis. Proteins. 1995;21:167–195. doi: 10.1002/prot.340210302. [DOI] [PubMed] [Google Scholar]
  • 3.Bryngelson JD, Wolynes PG. Spin-glasses and the statistical mechanics of protein folding. Proc. Nat. Acad. Sci. USA. 1987;84:7524–7528. doi: 10.1073/pnas.84.21.7524. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Onuchic JN, Wolynes PG. Theory of protein folding. Curr. Opin. Struct. Biol. 2004;14:70–75. doi: 10.1016/j.sbi.2004.01.009. [DOI] [PubMed] [Google Scholar]
  • 5.Ueda Y, Taketomi H, G014D; N. Studies on protein folding, unfolding and fluctuations by computer simulation. I. The effects of specific amino acid sequence represented by specific inter-unit interactions. Int. J. Pept. Res. 1975;7:445–459. [PubMed] [Google Scholar]
  • 6.Shoemaker BA, Wang J, Wolynes PG. Structural correlations in protein folding funnels. Proc. Nat. Acad. Sci. USA. 1997;94:777–782. doi: 10.1073/pnas.94.3.777. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Nymeyer H, Garcia AE, Onuchic JN. Folding funnels and frustration in off-lattice minimalist protein landscapes. Proc. Nat. Acad. Sci. USA. 1998;95:5921–5928. doi: 10.1073/pnas.95.11.5921. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Clementi C, Jennings PA, Onuchic JN. Prediction of folding mechanism for circular-permuted proteins. J. Mol. Biol. 2001;311:879–890. doi: 10.1006/jmbi.2001.4871. [DOI] [PubMed] [Google Scholar]
  • 9.Clementi C, Nymeyer H, Onuchic JN. Topological and energetic factors: What determines the structural details of the transition state ensemble and ”en-route” intermediates for protein folding? An investigation for small globular proteins. J. Mol. Biol. 2000;298:937–953. doi: 10.1006/jmbi.2000.3693. [DOI] [PubMed] [Google Scholar]
  • 10.Gosavi S, Chavez LL, Jennings PA, Onuchic JN. Topological frustration and the folding of interleukin-1 beta. J. Mol. Biol. 2006;357:986–996. doi: 10.1016/j.jmb.2005.11.074. [DOI] [PubMed] [Google Scholar]
  • 11.Levy Y, Onuchic JN. Mechanisms of protein assembly: Lessons from minimalist models. Acc. Chem. Res. 2006;39:135–142. doi: 10.1021/ar040204a. [DOI] [PubMed] [Google Scholar]
  • 12.Levy Y, Cho SS, Shen T, Onuchic JN, Wolynes PG. Symmetry and frustration in protein energy landscapes: A near degeneracy resolves the Rop dimer-folding mystery. Proc. Nat. Acad. Sci. USA. 2005;102:2373–2378. doi: 10.1073/pnas.0409572102. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Levy Y, Cho SS, Onuchic JN, Wolynes PG. A survey of flexible protein binding mechanisms and their transition states using native topology based energy landscapes. J. Mol. Biol. 2005;346:1121–1145. doi: 10.1016/j.jmb.2004.12.021. [DOI] [PubMed] [Google Scholar]
  • 14.Yang SC, Cho SS, Levy Y, Cheung MS, Levine H, Wolynes PG, Onuchic JN. Domain swapping is a consequence of minimal frustration. Proc. Nat. Acad. Sci. USA. 2004;101:13786–13791. doi: 10.1073/pnas.0403724101. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Whitford PC, Miyashita O, Levy Y, Onuchic JN. Conformational transitions of adenylate kinase: Switching by cracking. J. Mol. Biol. 2007;366:1661–1671. doi: 10.1016/j.jmb.2006.11.085. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Whitford PC, Gosavi S, Onuchic JN. Conformational Transitions in Adenylate Kinase: Allosteric Communication Reduces Misligation. J. Biol. Chem. 2008;283:2042–2048. doi: 10.1074/jbc.M707632200. [DOI] [PubMed] [Google Scholar]
  • 17.Schug A, Whitford PC, Levy Y, Onuchic JN. Mutations as trapdoors to two competing native conformations of the Rop-dimer. Proc. Nat. Acad. Sci. USA. 2007;104:17674–17679. doi: 10.1073/pnas.0706077104. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Okazaki K, Koga N, Takada S, Onuchic JN, Wolynes PG. Multiple-basin energy landscapes for large amplitude conformational motions of proteins: Structure-based molecular dynamics simulations. Proc. Nat. Acad. Sci. USA. 2006;103:11844–11849. doi: 10.1073/pnas.0604375103. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Best RB, Chen Y, Hummer G. Slow protein conformational dynamics from multiple experimental structures: The helix/sheet transition of Arc repressor. Structure. 2005;13:1755–1763. doi: 10.1016/j.str.2005.08.009. [DOI] [PubMed] [Google Scholar]
  • 20.Zuckerman DM. Simulation of an ensemble of conformational transitions in a united-residue model of calmodulin. J. Phys. Chem. B. 2004;108:5127–5137. [Google Scholar]
  • 21.Eastwood MP, Hardin C, Luthey-Schulten Z, Wolynes PG. Evaluating protein structure-prediction schemes using energy landscape theory. IBM. J. Res. Dev. 2001;5:475–497. [Google Scholar]
  • 22.Rohl CA, Strauss CEM, Misura KMS, Baker D. Protein structure prediction using rosetta. Methods Enzymol. 2004;383:66–93. doi: 10.1016/S0076-6879(04)83004-0. [DOI] [PubMed] [Google Scholar]
  • 23.Harpaz Y, Elmasry N, Fersht AR, Henrick K. Direct observation of better hydration at the N terminus of an alpha-helix with glycine rather than alanine as the N-cap residue. Proc. Nat. Acad. Sci. USA. 1994;91:311–315. doi: 10.1073/pnas.91.1.311. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Humphrey W, Dalke A, Schulten K. VMD - Visual Molecular Dynamics. J. Molec. Graphics. 1996;14:33–38. doi: 10.1016/0263-7855(96)00018-5. [DOI] [PubMed] [Google Scholar]
  • 25.Adcock SA, McCammon JA. Molecular dynamics: A survey of methods for simulating the activity of proteins. Chem. Revs. 2006;106:1589–1615. doi: 10.1021/cr040426m. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Lindahl E, Hess B, van der Spoel D. Gromacs 3.0: A package for molecular simulation and trajectory analysis. J. Mol. Mod. 2001;7:306–317. [Google Scholar]
  • 27.Ponder JW, Case DA. Force fields for protein simulations. Adv. Prot. Chem. 2003;66:27–85. doi: 10.1016/s0065-3233(03)66002-x. [DOI] [PubMed] [Google Scholar]
  • 28.Phillips JC, Braun R, Wang W, Gumbart J, Tajkhorshid E, Villa E, Chipot C, Skeel RD, Kale L, Schulten K. Scalable molecular dynamics with NAMD. J. Comp. Chem. 2005;26:1781–1802. doi: 10.1002/jcc.20289. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Brooks BR, Bruccoleri RE, Olafson BD, States DJ, Swaminathan S, Karplus M. Charmm: A program for macromolecular energy, minimization, and dynamics calculations. J. Comp. Chem. 1983;4:187–217. [Google Scholar]
  • 30.Eisenberg D, McLachlan AD. Solvation energy in protein folding and binding. Nature. 1986;319:199–203. doi: 10.1038/319199a0. [DOI] [PubMed] [Google Scholar]
  • 31.Zhou R. Trp-cage: Folding free energy landscape in explicit water. Proc. Nat. Acad. Sci. USA. 2003;100:13280–13285. doi: 10.1073/pnas.2233312100. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Paschek D, Nymeyer H, Garcia A. Replica exchange simulation of reversible folding/unfolding of the Trp-cage miniprotein in explicit solvent: On the structure and possible role of internal water. J. Struct. Biol. 2007;157:524–533. doi: 10.1016/j.jsb.2006.10.031. [DOI] [PubMed] [Google Scholar]
  • 33.Duan Y, Kollman PA. Pathways to a protein folding intermediate observed in a 1-microsecond simulation in aqueous solution. Science. 1998;282:740–744. doi: 10.1126/science.282.5389.740. [DOI] [PubMed] [Google Scholar]
  • 34.Garcia A, Onuchic JN. FolDing a protein on a computer: An atomic description of the folding/unfolding of Protein A. Proc. Nat. Acad. Sci. USA. 2003;100:13898–13903. doi: 10.1073/pnas.2335541100. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Schug A, Herges T, Wenzel W. Reproducible Protein Folding with the Stochastic Tunneling Method. Phys. Rev. Letters. 2003;91:158102. doi: 10.1103/PhysRevLett.91.158102. [DOI] [PubMed] [Google Scholar]
  • 36.Schug A, Verma A, Herges T, Lee KH, Wenzel W. Comparison of stochastic optimization methods for all-atom folding of the trp-cage protein. ChemPhysChem. 2005;6:2640–2646. doi: 10.1002/cphc.200500213. [DOI] [PubMed] [Google Scholar]
  • 37.Schug A, Wenzel W. An evolutionary strategy for all-atom protein folding of the sixty-amino acid bacterial ribosomal protein l20. Biophys. J. 2006;90:4273–4280. doi: 10.1529/biophysj.105.070409. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Jayachandran G, Vishal V, Pande VS. Using massively parallel simulation and Markovian models to study protein folding: Examining the dynamics of the villin headpiece. J. Chem. Phys. 2006;124:164902. doi: 10.1063/1.2186317. [DOI] [PubMed] [Google Scholar]
  • 39.Freddolino PL, Liu F, Gruebele M, Schulten K. Ten-microsecond molecular dynamics simulation of a fast-folding WW domain. Biophys. J. 2008;94:L75–L77. doi: 10.1529/biophysj.108.131565. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Clementi C, Garcia AE, Onuchic JN. Interplay among tertiary contacts, secondary structure formation and side-chain packing in the protein folding mechanism: All-atom representation study of protein L. J. Mol. Biol. 2003;326:933–954. doi: 10.1016/s0022-2836(02)01379-7. [DOI] [PubMed] [Google Scholar]
  • 41.Kusunoki H, Minasav G, Macdonald RI, Mondragon A. Independent movement, dimerization and stability of tandem repeats of chicken brain alpha-spectrin. J. Mol. Biol. 2004;344:495–511. doi: 10.1016/j.jmb.2004.09.019. [DOI] [PubMed] [Google Scholar]
  • 42.Scott KA, Batey S, Hootan KA, Clarke J. The folding of spectrin domains I: wild-type domains have the same stability but very different kinetic properties. J. Mol. Biol. 2004;344:195–205. doi: 10.1016/j.jmb.2004.09.037. [DOI] [PubMed] [Google Scholar]
  • 43.Cheung MS, Finke JM, Callahan B, Onuchic JN. Exploring the interplay between topology and secondary structural formation in the protein folding problem. J. Phys. Chem. B. 2003;107:11193–11200. [Google Scholar]
  • 44.Klimov DK, Thirumalai D. Mechanisms and kinetics of beta-hairpin formation. Proc. Nat. Acad. Sci. USA. 97:2544–2549. doi: 10.1073/pnas.97.6.2544. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Takada S, Luthey-Schulten Z, Wolynes PG. Folding dynamics with nonadditive forces: A simulation study of a designed helical protein and a random heteropolymer. J. Chem. Phys. 1999;110:11616–11629. [Google Scholar]
  • 46.Oliveira L, Schug A, Onuchic JN. Geometrical features of the protein folding mechanism are a robust property of the energy landscape- a detailed investigation for several reduced models. J. Phys. Chem. B. 2007 doi: 10.1021/jp0769835. [DOI] [PubMed] [Google Scholar]
  • 47.Shimada J, Kussell EL, Shakhnovich EI. The folding thermodynamics and kinetics of crambin using an all-atom monte carlo simulation. J. Mol. Biol. 2001;308:79–95. doi: 10.1006/jmbi.2001.4586. [DOI] [PubMed] [Google Scholar]
  • 48.Linhananta A, Zhou Y. The role of sidechain packing and native contact interactions in folding: Discontinuous molecular dynamics folding simulations of an all-atom Gō model of fragment B of Staphylococcal protein A. J. Chem. Phys. 2002:8983–8995. [Google Scholar]
  • 49.Zhou Y, Zhang C, Stell G, Wang J. Temperature dependence of the distribution of the first passage time: Results from discontinuous molecular dynamics simulations of an all-atom model of the second-hairpin fragment of protein G. J. Amer. Chem. Soc. 2003;125:6300–6305. doi: 10.1021/ja029855x. [DOI] [PubMed] [Google Scholar]
  • 50.Linhananta A, Boer J, Mackay I. The equilibrium properties and folding kinetics of an all-atom Gō model of the Trp-cage. J. Chem. Phys. 2005;122:114901. doi: 10.1063/1.1874812. [DOI] [PubMed] [Google Scholar]
  • 51.Gouda H, Torigoe H, Saito A, Sato M, Arata Y, Shimada I. Three-dimensional solution structure of the B domain of staphylococcal protein A: comparisons of the solution and crystal structures. Biochemistry. 1992;31:9665–9672. doi: 10.1021/bi00155a020. [DOI] [PubMed] [Google Scholar]
  • 52.Xu W, Harrison SC, Eck MJ. Three-dimensional structure of the tyrosine kinase c-Src. Nature. 1997;385:595–602. doi: 10.1038/385595a0. [DOI] [PubMed] [Google Scholar]
  • 53.Sato S, Religa TL, Daggett V, Fersht AR. Testing protein-folding simulations by experiment: B domain of protein A. Proc. Nat. Acad. Sci. USA. 2004;101:6952–6956. doi: 10.1073/pnas.0401396101. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Viguera AR, Martinez JC, Filimonov VV, Mateo PL, Serrano L. Thermodynamic and kinetic-analysis of the sh3 domain of spectrin shows a 2-state folding transition. Biochemistry. 1994;33:2142–2150. doi: 10.1021/bi00174a022. [DOI] [PubMed] [Google Scholar]
  • 55.Jackson SE, Fersht AR. Folding of Chymotrypsin Inhibitor 2. 1. Evidence for a two-state transition. Biochemistry. 1991;30:10428–10435. doi: 10.1021/bi00107a010. [DOI] [PubMed] [Google Scholar]
  • 56.Shea JE, Onuchic JN, Brooks CL., III Probing the folding free energy landscape of the src-SH3 protein domain. Proc. Nat. Acad. Sci. USA. 2002;99:16064–16068. doi: 10.1073/pnas.242293099. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Hoang TX, Cieplek M. Sequencing of folding events in Go-type proteins. J. Chem. Phys. 2000;113:8319–8328. [Google Scholar]
  • 58.Shea JE, Onuchic JN, Brooks CL., III Exploring the origins of topological frustration: Design of minimally frustrated model of fragement B of protein A. Proc. Nat. Acad. Sci. USA. 1999;96:12512–12517. doi: 10.1073/pnas.96.22.12512. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Plaxco KW, Simonsa KT, Baker D. Contact order, transition state placement and the refolding rates of single domain proteins. J. Mol. Biol. 1998;277:985–994. doi: 10.1006/jmbi.1998.1645. [DOI] [PubMed] [Google Scholar]
  • 60.Prieto L, Rey A. Influence of the chain stiffness on the thermodynamics of a Gō-type model for protein folding. J. Chem. Phys. 2007;126:165103. doi: 10.1063/1.2727465. [DOI] [PubMed] [Google Scholar]
  • 61.Bryngelson JD, Wolynes PG. Intermediates and barrier crossing in a random energy-model (with applications to protein folding) J. Phys. Chem. 1989;93:6902–6912. [Google Scholar]
  • 62.Kramers HA. Brownian motion in a field of force and the diffusion model of chemical reactions. Physica. 1940;7:284–304. [Google Scholar]
  • 63.Chavez LL, Onuchic JN, Clementi C. Quantifying the roughness on the free energy landscape: Entropic bottlenecks and protein folding rates. J. Amer. Chem. Soc. 2004;126:8426–8432. doi: 10.1021/ja049510+. [DOI] [PubMed] [Google Scholar]
  • 64.Socci ND, Onuchic JN, Wolynes PG. Diffusive dynamics of the reaction coordinate for protien folding funnels. J. Chem. Phys. 1996;104:5860–5868. [Google Scholar]
  • 65.Chahine J, Oliveira RJ, Leite VBP, Wang J. Configurational-dependent diffusion can shift the kinetic transition state and barrier height of protein folding. Proc. Nat. Acad. Sci. 2007;104:14646–14651. doi: 10.1073/pnas.0606506104. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Socolich M, Lockless SW, Russ WP, Lee H, Gardner KH, Ranganathan R. Evolutionary information for specifying a protein fold. Nature. 2005;437:512–518. doi: 10.1038/nature03991. [DOI] [PubMed] [Google Scholar]
  • 67.Gosavi S, Whitford PC, Jennings PA, Onuchic JN. Extracting function from a β-trefoil folding motif. doi: 10.1073/pnas.0801343105. Accepted PNAS. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Miyashita O, Onuchic JN, Wolynes PG. Nonlinear elasticity, proteinquakes, and the energy landscapes of functional transitions in proteins. Proc. Nat. Acad. Sci. USA. 2003;100:12570–12575. doi: 10.1073/pnas.2135471100. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69.Miyashita O, Wolynes PG, Onuchic JN. Simple energy landscape model for kinetics of functional transitions in proteins. J. Phys. Chem. B. 2005;109:1959–1969. doi: 10.1021/jp046736q. [DOI] [PubMed] [Google Scholar]
  • 70.Whitford PC, Onuchic JN, Wolynes PG. The Energy Landscape along an Enzymatic Reaction Trajectory: Hinges or Cracks? HFSP J. 2008;2:61–64. doi: 10.2976/1.2894846. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71.Hyeon C, Onuchic JN. Internal strain regulates the nucleotide binding site of the kinesin leading head. Proc. Nat. Acad. Sci. USA. 2007;104:2175–2180. doi: 10.1073/pnas.0610939104. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72.Hyeon C, Onuchic JN. Mechanical control of the directional stepping dynamics of the kinesin motor. Proc. Nat. Acad. Sci. USA. 2007;104:17382–17387. doi: 10.1073/pnas.0708828104. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73.Wonmuk H, Lang MJ, Karplus M. Force generation in kinesin hinges on cover-neck bundle formation. Structure. 2008;16:62–71. doi: 10.1016/j.str.2007.11.008. [DOI] [PubMed] [Google Scholar]
  • 74.Sobolev V, Wade R, Vried G, Edelman M. Molecular docking using surface complementarity. Proteins: Struct. Funct. Genet. 1996;25:120–129. doi: 10.1002/(SICI)1097-0134(199605)25:1<120::AID-PROT10>3.0.CO;2-M. [DOI] [PubMed] [Google Scholar]
  • 75.Berendsen HJC, Postma JPM, VanGunsteren WF, Dinola A, Haak JR. Molecular-dynamics with coupling to an external bath. J. Chem. Phys. 1984;81:3684–3690. [Google Scholar]
  • 76.Ferrenberg AM, Swendsen RH. New Monte Carlo technique for studying phase transitions. Phys. Rev. Letters. 1988;61:2635–2638. doi: 10.1103/PhysRevLett.61.2635. [DOI] [PubMed] [Google Scholar]
  • 77.Ferrenberg AM, Swendsen RH. Optimized Monte-Carlo data analysis. Phys. Rev. Letters. 1989;63:1195–1198. doi: 10.1103/PhysRevLett.63.1195. [DOI] [PubMed] [Google Scholar]
  • 78.Berendsen HJC, van der Spoel D, van Drunen R. GROMACS: A message-passing parallel molecular dynamics implementation. Comp. Phys. Comm. 1995;91:43–56. [Google Scholar]
  • 79.Jorgensen WL, Tirado-Rives J. The OPLS potential functions for proteins. energy minimizations for crystals of cyclic peptides and crambin. J. Am. Chem. Soc. 1988;110:1657–1666. doi: 10.1021/ja00214a001. [DOI] [PubMed] [Google Scholar]
  • 80.Jorgensen WL, Jorgensen WL, Chandrasekhar J, Madura JD, Impey RW, Klein ML. Comparison of simple potential functions for simulating liquid water. J. Chem. Phys. 1983;79:926935. [Google Scholar]
  • 81.Miyamoto S, Kollman PA. SETTLE: An Analytical Version of the SHAKE and RATTLE Algorithms for Rigid Water Models. J. Comp. Chem. 1992;13:952–962. [Google Scholar]
  • 82.Hess B, Bekker H, Berendsen HJC, Fraaije JGEM. LINCS: A Linear Constraint Solver for molecular simulations. J. Comp. Chem. 1997;18:1463–1472. [Google Scholar]

RESOURCES