Skip to main content
Biophysical Journal logoLink to Biophysical Journal
. 2008 Sep 19;95(12):5524–5532. doi: 10.1529/biophysj.108.136697

High Resolution Approach to the Native State Ensemble Kinetics and Thermodynamics

Sangwook Wu 1, Pavel I Zhuravlev 1, Garegin A Papoian 1
PMCID: PMC2599834  PMID: 18805918

Abstract

Many biologically interesting functions such as allosteric switching or protein-ligand binding are determined by the kinetics and mechanisms of transitions between various conformational substates of the native basin of globular proteins. To advance our understanding of these processes, we constructed a two-dimensional free energy surface (FES) of the native basin of a small globular protein, Trp-cage. The corresponding order parameters were defined using two native substructures of Trp-cage. These calculations were based on extensive explicit water all-atom molecular dynamics simulations. Using the obtained two-dimensional FES, we studied the transition kinetics between two Trp-cage conformations, finding that switching process shows a borderline behavior between diffusive and weakly-activated dynamics. The transition is well-characterized kinetically as a biexponential process. We also introduced a new one-dimensional reaction coordinate for the conformational transition, finding reasonable qualitative agreement with the two-dimensional kinetics results. We investigated the distribution of all the 38 native nuclear magnetic resonance structures on the obtained FES, analyzing interactions that stabilize specific low-energy conformations. Finally, we constructed a FES for the same system but with simple dielectric model of water instead of explicit water, finding that the results were surprisingly similar in a small region centered on the native conformations. The dissimilarities between the explicit and implicit model on the larger-scale point to the important role of water in mediating interactions between amino acid residues.

INTRODUCTION

The native state of a typical globular protein is not a single static conformation but possesses rich intrinsic dynamics. In many cases, these dynamics are essential for protein function. Examples include enzymatic catalysis (1), allosteric switching (13), protein-ligand binding (Mb-CO) (4), the change of antibody-ligand binding during maturation process (5), and photorhodopsin photocycle at low temperature (6). The main objective of this article is to advance our understanding of the kinetics of conformational switching in the native basin of a globular protein. In particular, we address the question of whether native dynamics are activated, showing normal Arrhenius temperature dependence, or diffusive, similar to those of ordinary fluids. After reviewing the underlying energy landscape theory and reaction coordinates used in protein folding, we show a novel way to study the kinetics of protein conformational transitions by carrying out a Brownian dynamics simulation on our two-dimensional free energy surface (FES) with two collective coordinates defined by the initial and final conformations.

A successful statistical mechanical approach to understanding protein folding is based on the concept of there being a funneled effective energy landscape (713). The native state is thought to be a set of deep minima, in which the residual frustration between subtly different structural conformations leads to a rugged topography (7,8,1113). Residual entropy plays a similar role in spin glasses (7,1114). A protein at room temperature explores multiple conformations with different thermodynamically favorable contacts engendering this frustration (14,15). These native structural substates seem to be hierarchically organized into a tree with some ultrametric character that, in principle, can be mapped onto the funnel (Fig. 1). Kinetics of transitions between the substates is also hierarchical with timescales dictated by the details of the distribution of energy barriers (7,13,1620).

FIGURE 1.

FIGURE 1

A sketch of a hierarchical organization of native substates at the bottom of the energy funnel is shown. Native state on this schematic picture consists of two substates I and J interchanging on some timescale. On shorter timescales and higher resolution in energy, each one of them also splits into substates. The substate I splits into G and H, which split further into yet another level of substates and corresponding microbasins.

By energy, we mean the energy of contacts. That, strictly speaking, is not an energy in the thermodynamic sense, because it includes the solvent coordinates in an implicit way. For example, hydrophobic interactions, though substantially entropic in origin, may be viewed as an energetic attraction between apolar residues. Even this effective energy of a protein conformation is still a function of hundreds or thousands of protein degrees of freedom (13). Often physical considerations permit phase space reduction through integration leaving only a few degrees of freedom that define then some very low-dimensional FES. Therefore, to avoid confusion (and also for historical reasons) the former representation of the landscape (funnel picture) is phrased in terms of energy, and the latter (low-dimensional FES)—in terms of free energy. The quantity and nature of the remaining coordinates determines the resolution of the FES. In particular, one may choose these so that the hierarchical organization of the substates and transitions will still be manifest. For instance, we may still see this hierarchy in nontrivial multiple timescale non-Arrhenius dynamics with an abundance of traps and barriers on the surface. Once the surface is constructed, studying the features of complicated protein dynamics consumes considerably less effort and permits a concentration on the kinetics of conformational switching. This technique can also be used to shed light on many interesting phenomena such as protein quakes (21).

In this work, we constructed a FES, in two dimensions, for the native state of Trp-cage, a 20-residue protein that folds in 4 μs (22,23). There are 38 nuclear magnetic resonance (NMR) structures recorded for this protein in PDB code (1L2Y) (24). Using the constructed surface, we studied the transition between two NMR conformations, numbered 1 and 37, the most dissimilar of all 38 structures.

Among the plethora of possible order parameters in polymer physics and protein folding in particular, only those with a sufficiently high structural resolution can serve our purposes. One way to achieve such a high resolution is to define a coordinate with respect to a particular conformation A. A widely used coordinate is the root-mean-square distance (RMSD) between the corresponding atoms (usually, of the backbone, Cα). Another possible order parameter has evolved from spin-glass energy landscape theories and is a generalization of what is called the “overlap parameter” (25). In case of polymer chains, the latter is a fraction of contacts that the two conformations have in common (26). This parameter is often called Q, and may be defined as

graphic file with name M1.gif (1)

where rij values are the distances between ith and jth atom in the conformation of interest, Inline graphic is the same for the conformation A with respect to which the Q value is defined (usually, it is the native state), and normalization factor N is equal to number of pairs of atoms whose positions are defining the conformation (27). The similarity index Q changes from 1 (for the conformation A itself) to 0 (when there is no resemblance to A). Indeed, the Gaussian in Eq. 1 suppresses the contribution of a pair of atoms if the distance between these atoms (rij) is very different from that in conformation A (Inline graphic). If one replaces the Gaussian with a rectangular peak and, in addition, only includes pairs of proximal atoms (rij ≲ 5 Å), Q will turn into a fraction of shared contacts (fraction of native contacts when A is the native state).

Though RMSD might seem more familiar and natural, in some cases Q is preferable. If one imagines, for example, two conformations having two α-helices, that are close in one of them and apart in the other one, then RMSD between them will be very large, suggesting no structural similarity, while Q will still show the similarities of individual helices. Another nice feature of Q is that σ in its definition allows one to control the resolution of the order parameter, and one may tweak the notion of conformational similarity according to the particular questions under consideration. Yet another approach would be to use the fraction of shared dihedral angles of the backbone. Like Q, this parameter has shown considerable correlation with the strata of the folding funnel (28) when defined with respect to the native structure. Along with the native structure RMSD and fraction of native contacts or hydrogen bonds, the radius of gyration has also been used as a coordinate, since it is also correlated with folding. However, this measure lacks resolution inside the native basin, which is the focus of our work, as further elaborated below.

Q is a good coordinate for protein folding because the funnel makes it possible to stipulate that motions in directions transverse to Q will usually be fast, and therefore the motion along Q (folding) is nearly adiabatic (29). A one-dimensional profile is then quite appropriate. However, if there were dynamical variables weakly correlated with Q but evolving on similar timescales, the information yielded by a one-dimensional profile would be of very limited usefulness. In particular, studying the dynamics of allosteric switching may require higher dimensional order parameter space, which allows the possibility to see multiple transition paths separately, rather than integrated into an averaged one. A number of studies constructing 2D FESs have been reported in the context of protein folding (19,30,31).

A natural choice of two dynamical variables having sufficient structural precision to provide a surface that allows the examination of the transition between two states 1 and 37 would be Q1 and Q37, that is, the similarities to states 1 and 37, respectively. Although Trp-cage is not an allosteric protein it nevertheless serves as a good test case for a general method that we develop here. In these coordinates the whole phase space is represented by a square {0 ≤ Q1 ≤ 1; 0 ≤ Q37 ≤ 1} (Fig. 2). The Q1 = Q37 diagonal of this square can be paralleled to a Q-axis in one-dimensional free energy profile of protein folding. With our technique, we pick two extremely close points on the one-dimensional free energy profile, near the folded state (Q = 1), that correspond to models 1 and 37, and pull them apart, unfurling the profile to a surface on a square and greatly increasing the resolution in the vicinity of reference states 1 and 37. On the (Q, Rg)-plot, these two points would nearly coalesce, but we zoom in to the region between and around them, leaving the unfolded state unresolved. All the unfolded structures are found close to the origin of the square.

FIGURE 2.

FIGURE 2

Two-dimensional free energy surface (2D-FES) of Trp-cage native basin as a function of collective coordinates Q1 and Q37, indicating corresponding similarities to NMR models 1 and 37, was obtained using WHAM in 914 simulation windows (see Methods and Data S1). An all-atom MD simulation of 1.2 ns in duration was carried out in each window. Open circles, marking FES, represent the 38 NMR structures in the Q1 and Q37 coordinates. Spacing between contour lines is 0.22 kcal/mol (0.37 kT). White rectangles mark the deepest and the second deepest basins corresponding to model 1 (lower right corner) and model 16 (upper left corner), respectively. The difference of ∼0.6 kT in free energy is created by rotation of Tyr-3, cation-π interaction between Arg-16 and Trp-6 and disappearance of 3-10 α-helix.

We obtained the FES through extensive (>1 μs) all-atom explicit water simulations in CHARMM force field (32) using the two-dimensional weighted histogram analysis method (WHAM) (33). Subsequently, we ran damped Brownian dynamics starting from the basin of model 37, looking for the first passage time to the basin of model 1. Analysis of trajectories, first passage time (FPT) distributions, and temperature dependence of switching kinetics suggests a nearly free-diffusional flow, with shallow traps weakly modulating transition dynamics. Approximately half of the trajectories undergo partial unfolding down to Q ≈ 0.8 and visit a number of local traps. We explored a one-dimensional order parameter for describing the kinetics of this transition. Brownian dynamics on this one-dimensional free energy profile yields FPT distributions qualitatively similar to those obtained from the two-dimensional calculations.

We mapped the other 36 NMR structures onto the surface. This is an initial step in ranking them by free energies and organizing into an hierarchy. Having identified two deepest basins we compared structurally the corresponding conformations, highlighting stabilizing contacts and interactions. We also found that entropy and energy surfaces are much more rugged with steeper and higher rises and falls than that of free energy, which we attributed to fast solvent degrees of freedom. Lastly, we repeated the FES calculation with a simple dielectric solvent model (DSM), obtaining surprisingly similar results for the highly native part of the surface. Comparing the differences, however, showed the importance of water in stabilizing the folded state.

METHODS

The Q values among 38 NMR structures were calculated using Biochemical Algorithm Library (i.e., BALL) (34). All MD simulations were carried out using the Large-scale Atomic/Molecular Massively Parallel Simulator (i.e., LAMMPS) package (35) with the CHARMM27 protein-lipid force field (32). In DSM, the system with dielectric constant 80 was heated up to 282 K and equilibrated for 800 ps using targeted MD to maintain the NMR structures. In the explicit solvent model, the protein was solvated with 2275 TIP3P water molecules and the counterions, 5Na+ and 6Cl, in a 50 × 50 × 50 Å3 water box. The SHAKE algorithm was used to restrain the hydrogen bonds. Minimization was performed in two steps. First we minimized energy of water with protein fixed for 10,000 steps with a conjugate gradient method. Then, using the NAMD suite (36), we minimized the energy of the whole system for an additional 10,000 steps. Using the charm2lammps Perl script, the initial input file was generated for LAMMP, and subsequently, a NVT simulation was carried out for 12 ps, followed by an NPT simulation for 60 ps with targeted MD. In the final production phase, 1.2-ns-long simulations were carried out for each of the 914 WHAM windows. In subsequent analysis, the first 200 ps were discarded. The trajectories were recorded every three picoseconds in each window.

The following two-dimensional umbrella potential was used: Inline graphic The spring constants and simulation time in each window were chosen on the basis of good overlap between neighboring windows. In other words, for each window, the rate of going to the areas covered by neighboring windows multiplied by the simulation time for a window should be considerably greater than unity. It turned out that the spring constants satisfying this criterion were in the range of from 11.2 kcal/mol per Å2 to 72.8 kcal/mol per Å2.

Brownian dynamics was carried out using Heun's method for the difference scheme (37). FES was interpolated in MATLAB (The MathWorks, Natick, MA). Gradients were calculated from four nearest points in two dimensions. Forty-thousand trajectories were used to obtain each FPT-distribution.

The equation of one-dimensional collective coordinate ξ is

graphic file with name M5.gif (2)

where Q1,37 is Q between structures 1 and 37 according to Eq. 1 and μ is a parameter that controls structural resolution. This function, shown in Supplementary Material, Fig. S9 in Data S1, corresponds to placing negative and positive Gaussian peaks on top of conformations 1 and 37 in the (Q1, Q37) square. Then, elevation of the resulting surface, ξ(Q1, Q37), serves as an order parameter (Fig. S9 in Data S1). The latter has an important feature (as opposed, for example, to ΔQ = Q1Q37 as an order parameter) that the endpoints of the ξ interval describe as compact regions near structures 1 and 37.

RESULTS AND DISCUSSION

Since the configurations of side chains play a key role inside a native basin (Fig. S7 in Data S1), we included side-chain carbons atoms (78 atoms total that include Cα, Cβ, Cγ, Cδ, Cε, and Cz) in the definition Eq. 1 of Q. Fig. 2 shows the computed two-dimensional free energy surface (FES) for the native state of Trp-cage peptide in terms of two similarity indices, Q1 and Q37. These two conformations (1 and 37) were chosen on the basis of being the most structurally dissimilar among the 38 NMR structures: they belong to different classes (structural basins) according to the hierarchical clustering (38) and have the smallest Q between them. Model 1 corresponds to (Q1 = 1.0, Q37 = 0.889) and model 37 corresponds to (Q1 = 0.889, Q37 = 1.0). All the different models of Trp-cage reside within the rectangle of 0.88 < Q1 < 1.0 and 0.889 < Q37 < 1.0. The deepest microbasin on the whole FES is that of model 1 followed by model 16 as the next deepest.

Stabilizing structural features

A representative structure from the microbasin of model 1 is shown in lower-right corner of Fig. 2 (this structure is almost identical to NMR model 1). Comparing this structure to model 16, which is the second lowest in free energy, reveals structural features that stabilize model 1:

  1. The 3-10 helix is not present in model 1, which is quite consistent with previous result by Zhou (39).

  2. Arg-16 is shifted toward Trp-6. This hints at the presence of a typical cation-π interaction between Arg and Trp, which are the likeliest amino acid residues to be involved in such an interaction (40). The disappearance of 3-10 helix (residues 11–14) may result from the side-chain movement of Arg-16, which is adjacent to the helix and likely contributes to its stability.

  3. Tyr-3 is rotated in model 1, which is favorable for intramolecular interaction between Tyr-3 and Trp-6. Thus, the main factors that lower model 1's free energy are the energetic contribution from the interaction between Trp-6 and Arg-16, the intramolecular interaction due to a ring rotation of Tyr-3, and the increase of entropy due to the loss of the 3-10 helix structure. The backbone RMSD between model 1 and model 16 is ∼0.8 Å.

Transitions between microbasins

Protein dynamics have been thoroughly analyzed in many prior works (4143), often using very diverse viewpoints on the nature of dynamical transitions. Some techniques, such as normal mode analysis, treat proteins as solids possessing vibrations and phonons (4446). Another viewpoint is to imagine activated hopping between different conformations separated by energy barriers (17,47). This can be paralleled to the dynamics of a supercooled liquid. Yet another possibility is that the motion is similar to flow of a normal liquid such that the system spends most of the time in saddle points rather than in minima of the energy landscape. There have been indications that activated, or even glassy-dynamics, are more relevant to proteins, at least at low temperatures (7,9,15). In this work we investigate more deeply the nature of protein dynamics at room temperature. Knowing the dynamical regime is important when studying fluorescence intermittency (48) or allosteric regulation (1). For this purpose we considered the transition between two reference conformations, 1 and 37. If the dynamics are activated they should be dominated by a few or one pathway between the reference states. Another issue we address in this section is the determination of the ruggedness of the surface and the degree in which local traps influence transition dynamics. The influence of traps should be considerable in the activated regime and small in the diffusional flow regime.

To describe conformational transitions, we ran multiple two-dimensional Brownian dynamics simulations on the computed FES (29), starting from the basin of model 1 (Q1 = 0.97 and Q37 = 0.895) (see Methods). Two trajectories are shown in Fig. 3 top. Distribution of first passage times (FPT) for transitions from model 1 basin to that of model 37 is given in Fig. 3 bottom right (black line). Our analysis of trajectories suggested a lack of dominant pathway. Indeed, even the trajectories picked from around maximum of the FPT distribution (a typical trajectory is shown in white in Fig. 3 top) linger in different basins, despite other similarities. The trajectories taken from the shoulder of the FPT distribution (an example is shown in magenta in Fig. 3 top) differ completely. It turned out that almost one-half of the trajectories fall into the shoulder of the FPT distribution (Fig. 3 bottom right). The observed multiplicity of transition pathways may be engendered either by trapping in different local traps (and the abundance of traps on the rugged surface causing the large width of the FPT distribution), or by the thermal noise if it is large enough to easily overcome the barriers (free diffusion). Fig. 3 bottom left shows a magnified part of the surface depicting a microbasin. Even though the trajectory might seem as freely moving, it noticeably lingers in this microbasin. We find that there is almost no trapping as such, although the influence of fine features of the surface is conspicuous. This observation, along with the temperature dependence of mean FPTs discussed below, directly suggests that protein conformational dynamics at room temperature shows a borderline behavior between nearly diffusive (as in a normal liquid) and weakly activated (as in a weakly supercooled liquid).

FIGURE 3.

FIGURE 3

Brownian dynamics of the transition between states corresponding to NMR structures numbered 1 and 37. (top) Two sample trajectories: one (white) is from the peak of FPT distribution in panel bottom right, and the other one (magenta) is from the shoulder. (bottom left) Part of the surface with trajectory shown in higher resolution revealing finer features of the surface and their influence on the trajectory. (bottom right) First passage time (FPT) distributions for conformational transition between 1 and 37. FPTs were computed at two additional temperatures to obtain the corresponding coefficients of variation, to provide reference for categorizing the nature of the dynamical regime at room temperature.

To further support this conclusion, we performed analogous calculations on the same surface, but at several different temperatures, comparing the corresponding FPT distributions. In reality, the FES itself would change with temperature, and therefore, these calculations do not yield direct information on the behavior of Trp-cage at those temperatures: their sole purpose is to provide reference points for identifying the character of the dynamical regime. Higher temperatures promote the role of freely diffusive motions, while lower temperatures promote trapping. The FPT-distributions computed at 153 K, 282 K, and 600 K are given in Fig. 3 bottom right. The FPT distribution variance grows, as temperature drops, but relative variance, or coefficient of variation, decreases. The latter is a measure of distribution width; when Cv < 1, the distribution is considered low-variance, and when Cv > 1, the distribution is considered high-variance. Thus, the number of effective pathways drops with decreasing temperature, and at low temperatures a dominant pathway emerges (Cv = 0.27 at 51 K). We observe at 282 K (Cv = 1.09) a crossover between the low-variance and high-variance regime.

The Arrhenius plot for the mean FPT (〈τ*〉), which is shown in Fig. 4, provides an alternative way to analyze protein conformational dynamics. At low temperatures, we observe normal Arrhenius behavior with 〈τ*〉 ∼ exp(EA/kT). At high temperatures, dependence becomes characteristic of the diffusive regime (for diffusion in confined space), with 〈τ*〉 ∼ D−1T−1. Dashed and solid curves are the fits to these functions, respectively. Room temperature behavior is near the crossover between these two regimes, with activation energy of ∼0.7 kT, corresponding to weak local trapping.

FIGURE 4.

FIGURE 4

Arrhenius plot shows the dependence of the mean FPT with temperature for the conformational transition from 37 to 1 on fixed FES (computed at 282 K), where the FES is temperature-independent. The inset zooms into higher temperature region using a semireciprocal plot. Two regimes are clearly seen: exponential at low temperatures, and linear at high temperatures. The near room temperature (282 K) result is near the crossover.

This provides another view on the problem of single- and multiexponential behavior that has been discussed in the context of both regular (with a barrier) and downhill protein folding (49,50). If P(τ*) denotes a FPT-distribution, or probability density that the transition time is τ*, then Inline graphic is the distribution function, or the probability that the transition has occurred within time τ*. The survival curve, 1 – F(τ*), is then the probability that transition has not yet occurred after time τ*. Though this curve can be fitted by a single exponential, especially at long timescales (where such behavior might be expected, for example, for diffusion in a confined phase space), it turns out that much better fit is achieved by a double exponential (Fig. S8 in Data S1). This is reminiscent of the dynamical behaviors discussed in the context of barrierless, or downhill protein folding where the shorter timescale is called the “speed limit” (49,51).

Summarizing this section, the dynamics of switching between two conformations shows features of both flow (as in a normal liquid) and weakly activated dynamics (as in a weakly supercooled liquid). Also, the transition kinetics shows biexponential dynamical behavior.

One-dimensional reaction coordinate

Next, we investigate whether kinetics of conformational switching between 1 and 37 can be adequately described using a one-dimensional free energy profile, instead of the 2D FES described above. To address this question, we developed an order parameter which smoothly connects the transition endpoints, structures 1 and 37, by placing a positive Gaussian on top of basin 1 and a negative Gaussian on top of basin 37 (P. I. Zhuravlev, S. Wu, M. Rubinstein, and G. A. Papoian, unpublished). Then, elevation of the resulting surface, ξ(Q1, Q37), serves as an order parameter (Fig. S9 in Data S1). The latter has an important feature that the endpoints of the ξ interval describe compact regions near structures 1 and 37. We obtained the corresponding one-dimensional free energy profile by integrating

graphic file with name M7.gif

and ran subsequent one-dimensional Brownian dynamics simulations. The resulting FPT-distribution was characterized with Cv = 0.91 because of increased contribution of the trap in the middle, which includes partially unfolded states. The change, however, is not very large and still is near the crossover between activated and diffusional flow regimes. Furthermore, using this new variable, one may directly calculate a one-dimensional free energy profile instead of computing 2D FES. This requires much less computational time and may be useful, especially when one is more interested in thermodynamical questions, such as finding the free energy difference between states 1 and 37. More details on free energy calculations using this collective coordinate will be published elsewhere (P. I. Zhuravlev, S. Wu, M. Rubinstein, and G. A. Papoian, unpublished).

Explicit solvent model and dielectric solvent model

We repeated the 2D FES calculations for a system where explicit water molecules were removed, and, instead, ɛ = 80 was used for all electrostatic terms in the Hamiltonian. The FES from this simple dielectric solvent model (DSM) is given in Fig. 5. It is more symmetric along the diagonal than in case of explicit water. DSM surface has a basin in the region where both Qs are ∼0.75, while that of explicit solvent model does not; this suggests that water molecules prevent the tendency for partial unfolding. This point is also bolstered by the presence of barriers of ∼kT height scattered in the region of Qs between 0.8 and 0.9 on the explicit solvent surface and absent on the DSM surface. The presence of the basin in lower-left corner of DSM 2D FES is also a result of the lack of hydrophobic effect, which leads to opening of hydrophobic cores composed of Trp-6, Pro-12, Pro-17, Pro-18, and Pro-19 (53); the structures there (open box in Fig. 5) correspond to a loop structure L found in a work by Juraszek and Bolhuis (53) as one of the intermediates during Trp-cage folding. In the upper part, near the basins of 1 and 37, surface features are well preserved even though dielectric solvent model is extremely simplistic. The difference between the explicit solvent FES and the DSM FES given in Fig. 6 demonstrates this more clearly. As Qs decrease, more differences appear between explicit and DSM FESs, highlighting the role of the hydrophobic effect, which is completely absent in the DSM simulations. The 2D FES differences are asymmetric with respect to the diagonal line, indicating that the hydrophobic effect favors structures on one part of the surface. To explore this suggestion, we picked several structures from different sides of the diagonal line, (Q1 = 0.90, Q37 = 0.81) and (Q1 = 0.77, Q37 = 0.76), and calculated the solvent-accessible surface areas (SASAs) for each residue (see Data S1). The SASAs of the hydrophobic core for the first region turned out to be ∼10% smaller, which provides further support that the asymmetric shape of the FES is also hydrophobic in origin. It is an interesting question whether reasonable corrections for the differences between explicit and DSM FES could be provided by introducing a simple hydrophobic term into the DSM Hamiltonian. It will also be interesting to use a Generalized Born model to treat electrostatics instead of DSM. One might anticipate that a free energy surface calculated using a GBSA Hamiltonian (31) will be more similar to the explicit solvent result.

FIGURE 5.

FIGURE 5

FES was computed for the dielectric solvent model (see text). Two of the partially unfolded structures from the open square are shown in the corners. The lower basin is absent in the explicit solvent FES. On the other hand, a number of barriers have disappeared in the DSM FES. The vicinities of the reference points 1 and 37 are very similar between explicit and DSM FESs.

FIGURE 6.

FIGURE 6

The difference between explicit solvent FES and DSM FES. In the Qs ≲ 0.95 region, it mainly represents the hydrophobic effect. The structures in the corners are taken from the hydrophobically favorable and unfavorable regions. The hydrophobic core is shown in van der Waals spheres, so that its opening is visually noticeable.

Finally, we consider physical motivation for the treatment of the hydrophobic effect as an effective energetic term in implicit solvent models. The entropic and energetic landscapes in our explicit water simulations are both very rugged, which we attribute to the solvent degrees of freedom (Fig. S10 in Data S1). However, the protein chain itself is not influenced much by this solvent energy ruggedness, because of the timescale separation between fast motions of the solvent and slower motions of the chain. Instead, the protein chain moves mostly adiabatically in the averaged solvent field, where the resulting free energy ruggedness is ∼10 times smaller. Therefore, treating hydrophobic interactions as though they are energetic has reasonable physical grounds.

CONCLUSION

Two-dimensional FESs are a powerful tool in studying kinetics and thermodynamics of the native state ensemble. With proper choice of dynamical variables one may control the resolution in various regions of the phase space. For investigation of allosteric switching kinetics between two states, similarities to each of these states (generalized fraction of shared contacts) serve as useful collective order parameters. In this work we computed a two-dimensional free energy surface for Trp-cage. We mapped all 38 NMR structures onto the surface, ranking them in free energies and identifying most thermodynamically stable conformations. We pointed out the main interactions that enhance the stability of the deepest basins. Using the computed 2D FES, we studied the kinetics of transition between two Trp-cage native substates reported by a NMR study. We found that the native dynamics of Trp-cage is borderline between diffusional dynamics and weakly activated dynamics. At room temperature and higher, the corresponding survival curve is best fitted by a double exponential. Transition dynamics become clearly activated at lower temperatures, and diffusional at higher temperatures. We constructed a new one-dimensional collective coordinate to describe kinetics and thermodynamics of the native state, with promising results.

To explore the role of the solvent, we also computed 2D FES for the same system but with static dielectric medium instead of water. The explicit and DSM free energy surfaces were quite similar in the vicinity of reference conformations, where resolution of our technique is the highest. However, when protein conformations become somewhat less native, systematic differences between explicit and DSM models point to the role of the hydrophobic effect. Computing energy and entropy contributions separately showed the major role played by water and the necessity for calculation of the free energy as a whole, the latter being a result of cancellation of very large terms. Our method may facilitate rigorous construction of coarse-grained force fields based on free energies. It will be interesting to apply the technique introduced in this work to conformational transitions in larger proteins.

SUPPLEMENTARY MATERIAL

To view all of the supplemental files associated with this article, visit www.biophysj.org.

Supplementary Material

[Supplement]
108.136697_index.html (651B, html)

Acknowledgments

We thank Dr. Michael Rubinstein for stimulating discussions.

This work was supported by the Camille and Henry Dreyfus Foundation and National Science Foundation under grant No. CHE-715225.

Editor: Kathleen B. Hall.

References

  • 1.Kern, D., and E. R. P. Zuiderweg. 2003. The role of dynamics in allosteric regulation. Curr. Opin. Struct. Biol. 13:748–757. [DOI] [PubMed] [Google Scholar]
  • 2.Clarkson, M. W., S. A. Gilmore, M. H. Edgell, and A. L. Lee. 2006. Dynamic coupling and allosteric behavior in a nonallosteric protein. Biochemistry. 45:7693–7699. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Helmstaedt, K., S. Krappmann, and G. H. Braus. 2001. Allosteric regulation of catalytic activity: Escherichia coli aspartate transcarbamoylase versus yeast chorismate mutase. Microbiol. Mol. Biol. Rev. 65:404–421 (table of contents.). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Hong, M. K., D. Braunstein, B. R. Cowen, H. Frauenfelder, I. E. Iben, J. R. Mourant, P. Ormos, R. Scholl, A. Schulte, and P. J. Steinbach. 1990. Conformational substates and motions in myoglobin. External influences on structure and dynamics. Biophys. J. 58:429–436. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Thorpe, I. F., I. Brooks, and L. Charles. 2007. Molecular evolution of affinity and flexibility in the immune system. Proc. Natl. Acad. Sci. USA. 104:8821–8826. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Dioumaev, A. K., and J. K. Lanyi. 2007. Bacteriorhodopsin photocycle at cryogenic temperatures reveals distributed barriers of conformational substates. Proc. Natl. Acad. Sci. USA. 104:9621–9626. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Frauenfelder, H., S. G. Sligar, and P. G. Wolynes. 1991. The energy landscapes and motions of proteins. Science. 254:1598–1603. [DOI] [PubMed] [Google Scholar]
  • 8.Bryngelson, J. D., J. N. Onuchic, N. D. Socci, and P. G. Wolynes. 1995. Funnels, pathways, and the energy landscape of protein folding: a synthesis. Proteins. 21:167–195. [DOI] [PubMed] [Google Scholar]
  • 9.Frauenfelder, H., N. A. Alberding, A. Ansari, D. Braunstein, B. R. Cowen, M. K. Hong, I. E. T. Iben, J. B. Johnson, S. Luck, M. C. Marsden, J. R. Mourant, P. Ormos, L. Reinisch, R. Scholl, A. Schulte, E. Shyamsunder, L. B. Sorensen, P. J. Steinbach, A. Xie, R. D. Young, and K. T. Yue. 1990. Proteins and pressure. J. Phys. Chem. 94:1024–1037. [Google Scholar]
  • 10.Brooks, C. L., J. N. Onuchic, and D. J. Wales. 2001. Statistical thermodynamics: taking a walk on a landscape. Science. 293:612–613. [DOI] [PubMed] [Google Scholar]
  • 11.Frauenfelder, H., and P. G. Wolynes. 1994. Biomolecules: where the physics of complexity and simplicity meet. Phys. Today. 47:58–64. [Google Scholar]
  • 12.Onuchic, J. N., Z. Luthey-Schulten, and P. G. Wolynes. 1997. Theory of protein folding: the energy landscape perspective. Annu. Rev. Phys. Chem. 48:545–600. [DOI] [PubMed] [Google Scholar]
  • 13.Gruebele, M. 2002. Protein folding: the free energy surface. Curr. Opin. Struct. Biol. 12:161–168. [DOI] [PubMed] [Google Scholar]
  • 14.Fenimore, P. W., H. Frauenfelder, B. H. McMahon, and R. D. Young. 2004. Bulk-solvent and hydration-shell fluctuations, similar to α- and β-fluctuations in glasses, control protein motions and functions. Proc. Natl. Acad. Sci. USA. 101:14408–14413. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Frauenfelder, H., and B. McMahon. 1998. Dynamics and function of proteins: the search for general concepts. Proc. Natl. Acad. Sci. USA. 95:4795–4797. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Wales, D. J., M. A. Miller, and T. R. Walsh. 1998. Archetypal energy landscapes. Nature. 394:758–760. [Google Scholar]
  • 17.Becker, O. M., and M. Karplus. 1997. The topology of multidimensional potential energy surfaces: theory and application to peptide structure and kinetics. J. Chem. Phys. 106:1495–1517. [Google Scholar]
  • 18.Levy, Y., and O. M. Becker. 2001. Energy landscapes of conformationally constrained peptides. J. Chem. Phys. 114:993–1009. [Google Scholar]
  • 19.Krivov, S. V., and M. Karplus. 2004. Hidden complexity of free energy surfaces for peptide (protein) folding. Proc. Natl. Acad. Sci. USA. 101:14766–14770. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Metzler, R., J. Klafter, and J. Jortner. 1999. Hierarchies and logarithmic oscillations in the temporal relaxation patterns of proteins and other complex systems. Proc. Natl. Acad. Sci. USA. 96:11085–11089. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Miyashita, O., J. N. Onuchic, and P. G. Wolynes. 2003. Nonlinear elasticity, protein quakes, and the energy landscapes of functional transitions in proteins. Proc. Natl. Acad. Sci. USA. 100:12570–12575. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Simmerling, C., B. Strockbine, and A. E. Roitberg. 2002. All-atom structure prediction and folding simulations of a stable protein. J. Am. Chem. Soc. 124:11258–11259. [DOI] [PubMed] [Google Scholar]
  • 23.Snow, C., B. Zagrovic, and V. Pande. 2002. The Trp cage: folding kinetics and unfolded state topology via molecular dynamics simulations. J. Am. Chem. Soc. 124:14548–14549. [DOI] [PubMed] [Google Scholar]
  • 24.Neidigh, J. W., R. M. Fesinmeyer, and N. H. Andersen. 2002. Designing a 20-residue protein. Nat. Struct. Biol. 9:425–430. [DOI] [PubMed] [Google Scholar]
  • 25.Plotkin, S. S., J. Wang, and P. G. Wolynes. 1996. Correlated energy landscape model for finite, random heteropolymers. Phys. Rev. E Stat. Phys. Plasmas Fluids Relat. Interdiscip. Topics. 53:6271–6296. [DOI] [PubMed] [Google Scholar]
  • 26.Plotkin, S. S., J. Wang, and P. G. Wolynes. 1997. Statistical mechanics of a correlated energy landscape model for protein folding funnels. J. Chem. Phys. 106:2932–2948. [Google Scholar]
  • 27.Hardin, C., M. P. Eastwood, Z. Luthey-Schulten, and P. G. Wolynes. 2000. Associative memory Hamiltonians for structure prediction without homology: α-helical proteins. Proc. Natl. Acad. Sci. USA. 97:14235–14240. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Onuchic, J. N., P. G. Wolynes, Z. Luthey-Schulten, and N. D. Socci. 1995. Toward an outline of the topography of a realistic protein-folding funnel. Proc. Natl. Acad. Sci. USA. 92:3626–3630. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Socci, N. D., J. N. Onuchic, and P. G. Wolynes. 1996. Diffusive dynamics of the reaction coordinate for protein folding funnels. J. Chem. Phys. 104:5860–5868. [Google Scholar]
  • 30.Bursulaya, B., and C. Brooks. 1999. Folding free energy surface of a three-stranded β-sheet protein. J. Am. Chem. Soc. 121:9947–9951. [Google Scholar]
  • 31.Zhou, R. 2003. Free energy landscape of protein folding in water: explicit vs. implicit solvent. Proteins. 53:148–161. [DOI] [PubMed] [Google Scholar]
  • 32.MacKerell, A. D., N. Banavali, and N. Foloppe. 2000. Development and current status of the CHARMM force field for nucleic acids. Biopolymers. 56:257–265. [DOI] [PubMed] [Google Scholar]
  • 33.Kumar, S., J. M. Rosenberg, D. Bouzida, R. H. Swendsen, and P. A. Kollman. 1992. The weighted histogram analysis method for free-energy calculations on biomolecules. J. Comput. Chem. 13:1011–1021. [Google Scholar]
  • 34.Kohlbacher, O., and H. P. Lenhof. 2000. BALL—rapid software prototyping in computational molecular biology. Biochemicals Algorithms Library. Bioinformatics. 16:815–824. [DOI] [PubMed] [Google Scholar]
  • 35.Plimpton, S. 1995. Fast parallel algorithms for short-range molecular dynamics. J. Comput. Phys. 117:1–19. [Google Scholar]
  • 36.Phillips, J. C., R. Braun, W. Wang, J. Gumbart, E. Tajkhorshid, E. Villa, C. Chipot, R. D. Skeel, L. Kal, and K. Schulten. 2005. Scalable molecular dynamics with NAMD. J. Comput. Chem. 26:1781–1802. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Garcia-Ojalvo, J., and J. M. Sancho. 1999. Noise in Spatially Extended Systems. Springer-Verlag, New York. [DOI] [PubMed]
  • 38.Ward, J. H. 1963. Hierarchical grouping to optimize an objective function. J. Am. Stat. Assoc. 58:236–244. [Google Scholar]
  • 39.Zhou, R. 2003. Trp-cage: folding free energy landscape in explicit water. Proc. Natl. Acad. Sci. USA. 100:13280–13285. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Gallivan, J. P., and D. A. Dougherty. 1999. Cation-π interactions in structural biology. Proc. Natl. Acad. Sci. USA. 96:9459–9464. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Shehu, A., L. E. Kavraki, and C. Clementi. 2007. On the characterization of protein native state ensembles. Biophys. J. 92:1503–1511. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Shenu, A., C. Clementi, and L. E. Kavraki. 2007. Sampling conformation space to model equilibrium fluctuations in proteins. Algorithmica. 48:303–327. [Google Scholar]
  • 43.Chen, J., R. I. Dima, and D. Thirumalai. 2007. Allosteric communication in dihydrofolate reductase: signaling network and pathways for closed to occluded transition and back. J. Mol. Biol. 374:250–266. [DOI] [PubMed] [Google Scholar]
  • 44.Bahar, I., and A. J. Rader. 2005. Coarse-grained normal mode analysis in structural biology. Curr. Opin. Struct. Biol. 15:586–592. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Hayward, S., and N. Go. 1995. Collective variable description of native protein dynamics. Annu. Rev. Phys. Chem. 46:223–250. [DOI] [PubMed] [Google Scholar]
  • 46.Ma, J. 2005. Usefulness and limitations of normal mode analysis in modeling dynamics of biomolecular complexes. Structure. 13:373–380. [DOI] [PubMed] [Google Scholar]
  • 47.Caflisch, A. 2006. Network and graph analyses of folding free energy surfaces. Curr. Opin. Struct. Biol. 16:71–78. [DOI] [PubMed] [Google Scholar]
  • 48.Schwille, P., S. Kummer, A. A. Heikal, W. E. Moerner, and W. W. Webb. 2000. Fluorescence correlation spectroscopy reveals fast optical excitation-driven intramolecular dynamics of yellow fluorescent proteins. Proc. Natl. Acad. Sci. USA. 97:151–156. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Yang, W. Y., and M. Gruebele. 2003. Folding at the speed limit. Nature. 423:193–197. [DOI] [PubMed] [Google Scholar]
  • 50.Chekmarev, S. F., S. V. Krivov, and M. Karplus. 2006. Folding of ubiquitin: a simple model describes the strange kinetics. J. Phys. Chem. B. 110:8865–8869. [DOI] [PubMed] [Google Scholar]
  • 51.Liu, F., D. Du, A. A. Fuller, J. E. Davoren, P. Wipf, J. W. Kelly, and M. Gruebele. 2008. An experimental survey of the transition between two-state and downhill protein folding scenarios. Proc. Natl. Acad. Sci. USA. 105:2369–2374. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Reference deleted in proof.
  • 53.Juraszek, J., and P. G. Bolhuis. 2006. Sampling the multiple folding mechanisms of Trp-cage in explicit solvent. Proc. Natl. Acad. Sci. USA. 103:15859–15864. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

[Supplement]
108.136697_index.html (651B, html)
108.136697_1.pdf (921.5KB, pdf)

Articles from Biophysical Journal are provided here courtesy of The Biophysical Society

RESOURCES