Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2024 Jul 10.
Published in final edited form as: Phys Rev E Stat Nonlin Soft Matter Phys. 2009 Mar 4;79(3 Pt 1):030902. doi: 10.1103/PhysRevE.79.030902

Relationship between protein folding thermodynamics and the energy landscape

Jaegil Kim 1,*, Thomas Keyes 1,, John E Straub 1,
PMCID: PMC11234522  NIHMSID: NIHMS2007318  PMID: 19391891

Abstract

The origin of protein folding thermodynamics is examined in terms of the energy landscape, employing an off-lattice protein model with scaled non-native attractions, which is continuously tunable between a Go-like model and a highly frustrated system. Extensive statistical temperature molecular dynamics simulations, combined with inherent structure analysis, reveal the intimate connection between the global geometric properties of the energy landscape and the statistical temperature. The basin depth of the energy landscape is shown to play a key role in the first-order-like characteristics of the statistical temperature, which are easily identified by the squared modulus of the potential energy gradient in the microcanonical ensemble.

PACS number(s): 87.15.Cc, 05.70.–a, 87.15.A–


Protein folding—i.e., how a polypeptide chain reaches a unique native state for a given sequence of amino acids [1]—is one of the fundamental problems in biological science. While significant advances have been made [29], fundamental open questions still remain. For example, one would like to understand the remarkable observation that most single-domain proteins exhibit a cooperative, two-state folding transition with no detectable intermediates [2].

Energy landscape theory [36] provides a theoretical framework for protein folding, in which the connection between a proteinlike sequence and a minimally frustrated energy landscape is rationalized via a folding funnel [4]. A good folding sequence is characterized by a smooth potential energy landscape (PEL), with an overall funnel shape biased to the native state, while a poorly designed sequence like a random heteropolymer has a rugged PEL susceptible to long-lived kinetic trapping [7]. Many simulation studies have been devoted to elucidate the determinants of a proteinlike system, such as the ratio of the folding and glass transition temperatures [3], the stability gap between a native state and low-lying excited states [8], and the collapse cooperativity [9].

Recently, Mazzoni and Casetti [10] proposed the fluctuations of the energy landscape curvature, KR=2U,U being the potential energy, as a determinant. A dramatic enhancement of the fluctuations, κ(T)=KR2-KR2/KR, in the vicinity of the folding temperature T=Tf was observed in the proteinlike folding of well-designed sequences, where ⟨⋅⟩ denotes an equilibrium average. The enhancement in κTf is attributed to effective two-state dynamics between the native and denatured states with different curvatures [10].

Two-state protein folding is analogous to a first-order phase transition in finite-size systems [1113]. Indeed, a recent study [14] reveals that the “backbending” (S bend) in the statistical temperature, or microcanonical caloric curve, characteristic of a first-order-like phase transition, fully characterizes protein folding and clearly distinguishes good folders from poor ones. The question naturally arises: how does the global geometry of the PEL translate into the behavior of the statistical temperature? This paper addresses the issue by establishing a rigorous connection between the statistical temperature and various energy landscape measures, leading to the identification of the principal landscape determinants of two-state folding thermodynamics.

The key quantity in our study is the statistical temperature in the microcanonical ensemble, TS(U)=[S(U)/U]-1,S(U) being the microcanonical entropy. It is well known that in a finite-size system such as a protein, the microcanonical ensemble is well suited to the analysis of phase transitions [1114]. Around a first-order phase transition, S(U) shows a convex dip, with 2S/U2>0, corresponding to backbending in TS(U) or a negative specific heat. Backbending is associated with an extremum condition in the Helemholtz free energy at Tf,FU,Tf=U-TfS(U), which has three roots Ui(i=1-3), satisfying TSUi=TfU1<U2<U3. The multiple roots are associated with two-state transition behavior between free-energy minima at U1 and U3, involving the crossing of a transition barrier at U2, while the transition becomes downhill-like without a barrier for the monotonic TS(U) Thus, all essential features of the transitions occurring in good and poor folders are completely characterized by the behavior of TS(U).

Our analysis linking TS(U) to the PEL is based on the thermodynamic identity between the statistical temperature and the configurational temperature TconF(U) originally introduced by Rugh [15] and later generalized by Jepps et al. [16]:

TSU=TconFU=U2U2UU=U𝒦U, (1)

where the subscript “conF” implies that the configurational temperature is evaluated as the fraction of two separate microcanonical averages, (U)=U2U and 𝒦(U)=2UU [16]. The significance of Eq. (1) is that it allows a rigorous mathematical connection between the purely statistical quantity TS(U) and the geometric structure of the energy landscape via the force, F=-U, and the curvature, KR=2U.

All-atom simulations considering the protein and explicit solvent are the most accurate, but still computationally demanding. Thus a simplified description, retaining the main features of real proteins, but reducing the number of degrees of freedom through a coarse graining, is especially appealing for the characterization of protein-folding mechanisms over long time scales. The newly suggested folding determinants [10,14] have been examined in coarse-grained models, with varied sequences exhibiting a different foldability.

Here we employ the scaled Honeycutt-Thirumalai (SHT) off-lattice model [17] to mimic the effect of sequence variations and to examine how two-state folding results from the global shape of the PEL and the emerging backbending behavior of TS(U). The original HT model [18] has a β-barrel native state and consists of 46 beads with three different types: hydrophobic (B), hydrophilc (L), and neutral (N). We modified the model to incorporate scaled, Go-like interactions by modulating the attractions between non-native BB contacts, which allows, for the original HT sequence B9N3(LB)4N3B9N3(LB)5L, a continuous tuning from a highly frustrated PEL to a weakly rugged funnel. The potential energy is a sum of bond stretching UB, bending UA, torsion UT), and nonbonded interactions (UNB,λ):

Ur;λ=UB+UA+UT+UNB,λ, (2)

where UB=Σikrri-re2,UA=Σikθθi-θe2, and UT=ΣiAi[1+cosϕi+Bi1+cos3ϕi. The stretching and bending constants are kr=200ϵ/σ2 and kθ=10ϵ/rad2, respectively, ϵ and σ being energy and length units, with re=σ and θe=105. Torsion parameters are Ai=Bi=1.2 for a quartet of successive beads with zero or one N bead, while otherwise Ai=0 and Bi=0.2.

The nonbonded term in Eq. (2) contains the scaling factor λ(0λ1) to modulate the non-native attractions as UNB,λ=Σi<j-3Uij+(λ-1)Uijδ(i,α)δ(j,α),Uij=4ϵS1σ/rij12-S2σ/rij6, where S1=S2=1 for BB pairs, S1=2/3 and S2=-1 for LL and LB pairs, and S1=1 and S2=0 for all other pairs involving N. Here δ(i,α)=1 if α=B and bead i belongs to the non-native contact set, and otherwise δ(i,α)=0. As λ approaches zero, the second term in UNB,λ introduces a strong native bias in denatured states by prohibiting close contacts between non-native BB pairs. All 57 native contact pairs are identified with the cutoff 1.3σ, based on the global minimum structure of the original HT model (U=-49.2635).

Due to the high degree of frustration in the original HT model [18,19], an enhanced sampling technique is indispensable to obtain a correct conformational sampling for both the transition and low-temperature regions, overcoming ergodicity breaking. We applied the statistical temperature molecular dynamics (STMD) algorithm [19], which generates a flat energy sampling via the dynamic refinement of the statistical temperature (for details see Ref. [19]) with four different scalings at λ=1.0,0.9,0.5, and 0. All STMD simulations have been performed to sample the temperature range from 0.07 to 1.3, with the average kinetic energy fixed at T0=1.3. A total of 8 × 104 configurations for each simulation were collected to determine the canonical and microcanonical thermodynamics, and minimized using the conjugate-gradient algorithm to map inherent structures (ISs) [20].

As summarized in Table I, the folding Tf and collapse Tθ temperatures, identified by peaks in the T-dependent heat capacities and order parameter fluctuations in Figs. 1(a) and 1(b), respectively, illustrate that the original HT model (SHT with λ=1) is a poor folder and the SHT model with λ=0.9 is an intermediate folder, while λ=0.5 and 0 correspond to good folders according to the classification based on the collapse cooperativity, σ=Tθ-Tf/Tθ [9]. The validity of this classification is also seen in the profiles of the curvature fluctuations κ(T) in Fig. 1(c). A marked peak shows up close to Tf for good folders, while the plots are featureless in poor and intermediate folders [10]. Note that the temperature Tκ, corresponding to the maximum curvature fluctuation in Fig. 1(c), coincides with Tf for good folders, implying that κ(T) is also an effective folding indicator.

TABLE I.

Characteristic temperatures of folding Tf, collapse Tθ, and curvature fluctuation Tκ and the collapse cooperativity σ at different λ=1.0,0.9,0.5, and 0.

Scaling parameter λ Tf Tθ Tκ σ=Tθ-Tf/Tθ
1.0 0.19 0.67 0.68 0.71
0.9 0.49 0.65 0.64 0.25
0.5 0.61 0.67 0.62 0.09
0.0 0.59 0.62 0.60 0.05

FIG. 1.

FIG. 1.

(Color online) (a) Specific heats Cv(T), (b) order parameter fluctuations χ(T), and (c) curvature fluctuations κ(T) at different λ=1.0,0.9,0.5, and 0 from bottom to top in (a) at T=0.6.

The statistical temperature TS(U), determined by STMD in Fig. 2, shows a dramatic change as the energy landscape changes with decreasing λ. The smooth, monotonically varying TS(U) in the original HT model (λ=1) first starts to show a plateau, or weak backbending, and a substantial backbending appears as λ falls below 0.5. Remarkably, the temperatures corresponding to the plateau and backbending regions in TS(U) with λ=0 exactly match Tf and Tθ, respectively. The alteration of the folding mechanism from second-order-like collapse (λ=1) into first-order-like folding (λ=0) is clearly seen through the behavior of TS(U).

FIG. 2.

FIG. 2.

(Color online) Statistical temperatures TS(U) (solid lines) and configurational temperatures TconF(U) (squares and circles) with different λ=1.0,0.9,0.5, and 0 from top to bottom at U=40. Note that horizontal lines at Tθ=0.62 and Tf=0.59 match the backbending and plateau regions of TS(U) at λ=0, respectively.

We find that the folding in Go-like models occurs through the transient formation of a three-strand core near Tθ and the immediate attachment of the terminal strand to attain the native state at TfTθ. A significant slowing down is observed in the folding of the original HT model with TfTθ due to trapping in misfolded states. Since U is the external control parameter in the microcanonical ensemble (playing the role of the temperature in the canonical ensemble), we also identify the characteristic energies to describe the folding transition as a function of U, in the case of λ=0. In Fig. 2, both Uf=10 and Um=20 approximately define the plateau region in TS(U), and both Um and Uθ=50 bound the collapse region associated with the backbending of TS(U), TSUθ=0.65. The good folder with λ=0 starts to collapse as U crosses Uθ, and completes its folding once U passes through Uf.

Assuming that STMD samples all configurations with a given U, the microcanonical average in Eq. (1) is directly computed by binning in energy and collecting configurations corresponding to each bin. The configurational temperature TconF(U) with unit bin size shows good agreement with TS(U) in both λ=1 and 0, illustrating that the thermodynamic identity, Eq. (1), is robust even with a smaller system size.

To illustrate how the PEL controls TS(U), and hence the folding, via Eq. (1), both 𝒦(U) and (U) are plotted in Figs. 3(a) and 3(b), respectively. For all scalings, the curvature 𝒦(U) monotonically increases with decreasing U, showing that folded (denatured) states are associated with steeper (shallow) potential wells. The primary feature differentiating good from poor folders in Fig. 3(a) is that good folders have nearly constant curvature for U>Uθ, suggesting that their denatured states are structurally homogeneous. This finding is in accordance with our expectations for Go-like interactions. The main frustration of the original HT model arises from the conformational diversity of the collapsed states [18]. The native-centric UNB,λ in Go-like models significantly suppresses the frustration from hydrophobic mismatches of β strands during the collapse by reducing non-native interstrand BB contacts and also makes the unfolded landscape less structured. With decreasing U in good folders, the flat 𝒦(U) begins to increase steeply between Uf and Uθ, the backbending and plateau regions of TS(U), contributing to the large curvature fluctuation κTf and to the bimodal energy distributions [10]. By contrast, in poor folders the slope of 𝒦(U) is nearly constant, causing no structure in TS(U) and corresponding to the fact that all systems studied have nearly identical 𝒦(U) at the highest and lowest U.

FIG. 3.

FIG. 3.

(Color online) Microcanonical average of (a) 𝒦(U) and (b) (U) with different λ=1.0,0.9,0.5, and 0 from top to bottom around U=40.

Turning to (U), the most remarkable aspect of Fig. 3(b) is that (U) retains all essential features of TS(U). Despite the discussion in the last paragraph, the dynamic range of 𝒦(U) is small and the variation of TS(U) is mostly encoded into (U). What is the underlying physics relating (U) to TS(U)? Some insight is gained by noting that (U) is a measure of the geometric distance between the sampled configurations at U and their associated ISs [20]. Indeed, the microcanonical basin depth of the PEL, quantified as 𝒟(U)=U-UIS(U),UIS(U)=UISU being the average IS energy, shows a remarkable resemblance to TS(U) in Fig. 4(a). More quantitatively, in a harmonic approximation, the canonical basin depth obeys 𝒟harT(T)=NfT/2,Nf being the number of degrees of freedom. Replacing the temperature T by TS(U) gives 𝒟harT(U)Nf/2TS(U)=Nf/2(U)𝒦(U), which unveils a direct link translating the global geometry of the PEL into the proteinlike folding markers of TS(U). We checked the validity of the harmonic approximation by comparing 𝒟(U) and its harmonic counterparts, 𝒟harT(U) and 𝒟har(U)=12UδrU [21]. As shown in Fig. 4(b), both independently derived DharT(U) and 𝒟har(U) display the essential characteristics of 𝒟(U).

FIG. 4.

FIG. 4.

(Color online) (a) Basin depth 𝒟(U), (b) harmonic basin depths 𝒟har(U) (solid circles) and DharT(U) (triangles) at λ=0, and (c) average IS energy UIS(U). λ=1.0,0.9,0.5,0 from top to bottom in (a).

The basin depth of the PEL as a function of U is most directly seen in the profile of UIS(U) in Fig. 4(c). A sharp drop in UIS(U), to a constant value, occurs for good folders between Uθ and Uf. This means that the system quickly reaches the native state basin as U crosses Uf, while UIS(U) shows a slow decay due to residual frustrations among collapsed states in poor and intermediate folders. The plateau regions in UIS(U) for U>Uθ reflect the homogeneous inherent structures in characteristic of the denatured states of good folders as observed in 𝒦(U) in Fig. 3(a). Our analysis suggests that the Go-like model has a near-Levinthal-like landscape [22], in which the denatured state landscape is gently slanted, while the folded state landscape has a steep funnel to the native state, resulting from the native-centric interactions stabilizing the native state while homogenizing the denatured states.

In summary, we have shown that a sharp variation in the basin depth, 𝒟(U), from the denatured states to the native state, gives rise to the first-order-like folding in good folders. The basin depth is reflected in the statistical temperature TS(U) through the relations TS(U)=(U)/𝒦(U)2/Nf𝒟(U). Two-state protein folding is associated with marked transitions of the potential energy gradient (U), the average IS energy UIS(U), and the curvature 𝒦(U).

Acknowledgments

We are grateful to the National Science Foundation (Grant No. CHE-0750309) and the National Institutes of Health (Grant No. RO1 GM076688) for the generous support of our research.

References

RESOURCES