Abstract
We propose a new way to characterize protein folding transition states by (1) insertion of one or more residues into an unstructured protein loop, (2) measurement of the effect on protein folding kinetics and thermodynamics, and (3) analysis of the results in terms of a rate-equilibrium free energy relationship, αLoop. αLoop reports on the fraction of molecules that form the perturbed loop in the transition state. Interpretation of the changes in equilibrium free energy using standard polymer theory can help detect residual structure in the unfolded state. We illustrate our approach with data for the model proteins CI2 and the alpha spectrin SH3 domain.
Keywords: protein folding, transition state, loop, rate-equilibrium free energy relationships, kinetics
Protein folding transition states are transient ensembles of conformations that cannot be isolated for their study. Their structure must be characterized by indirect means, commonly rate-equilibrium free energy relationships (REFERs) (Leffler 1952; Sánchez and Kiefhaber 2003c). Briefly, we apply a perturbation ∂x to a protein folding reaction and measure the changes in the activation free energy for folding, , and the equilibrium free energy, ΔG 0. In a two-state folding reaction, these changes are linearly related and a proportionality constant, , can be defined. We can obtain global information about the transition state using medium-induced REFERs and local information using structure-induced REFERs (Sánchez and Kiefhaber 2003c). Local structural perturbations used to date include mutation of residue side chains (ϕ-values) (Fersht et al. 1992), chemical modification of the protein backbone (Deechongkit et al. 2004), and engineering of metal binding sites (ψ-values) (Krantz and Sosnick 2001). Commonly, the transition state for folding of a protein is characterized at near-atomic resolution by measuring a large number of φ-, ψ-, or αBackbone-values distributed throughout the structure. The φ-, ψ-, and αBackbone-values for a given region of the protein are sometimes different (Deechongkit et al. 2004; Sosnick et al. 2004; Pandit et al. 2006), and there is currently no consensus on how to integrate them into a unified description of the transition state (Sosnick et al. 2004; Pandit et al. 2006). This puzzle prompts for the development of additional ways to characterize protein folding transition states.
Results and Discussion
Characterization of protein folding transition states
We propose a new structure-induced REFER. Let us consider the simple, two-state folding reaction of a protein that has an unstructured loop. The unstructured loop may be present in the wild-type protein or created by inserting two or more glycine residues into a structured loop. We can then insert one or more extra residues into the already unstructured loop using molecular biology techniques and measure the effect on the thermodynamics and kinetics of folding. This perturbation can be understood as a loop extension-induced REFER:
We can give a physical interpretation to αLoop using general concepts from polymer theory. The end-to-end distance of the chosen unstructured loop is strongly constrained in the native state and, in the absence of residual structure, unconstrained in the unfolded state. In this case, folding of the loop from the unfolded to the native state should have a free energy cost of purely entropic origin. Extension of the loop will increase this free energy cost because of a larger entropic penalty in the native state and an unchanged (null) entropic penalty in the unfolded state (Jacobson and Stockmeier 1950; Zhou 2004). This will lead to a change in ΔG 0. If the loop is constrained into a native-like conformation in the transition state, the change in will be the same as the change in ΔG 0, and the αLoop will be 1. If the loop is as unconstrained in the transition state as in the unfolded state, the change in will be 0 and the αLoop = 0.
Many φ-, ψ-, and αBackbone-values measured to date are fractional (Goldenberg 1999; Krantz and Sosnick 2001; Sánchez and Kiefhaber 2003d; Deechongkit et al. 2004; Krantz et al. 2004; Sosnick et al. 2004; Pandit et al. 2006). Unfortunately, their structural interpretation is ambiguous. They may arise from conformational heterogeneity in the transition state, from weakened interactions, or both (Fersht et al. 1992; Bodenreider and Kiefhaber 2005). To a first approximation, the change in free energy caused by extension of an unstructured loop that does not interact with the rest of the protein is independent of other interactions. In this case, the αLoop can be read as the fraction of molecules in which the perturbed loop is constrained in the transition state in a native-like manner. However, nonnative-like conformations of the loop in the transition state are also possible. For example, partial opening of the loop neck may change the stiffness of the loop and its response to extension. Alternatively, additional residues may become unstructured and be incorporated into the loop. We can approximate the change in free energy for the addition of n residues to an unstructured loop of N residues as ΔΔG 0 = 1.5 · RT · ln(1 + n/N) (Jacobson and Stockmeier 1950). It follows that an extended nonnative-like loop in the transition state will have smaller values of and of the αLoop than a native-like one. In this case, the αLoop gives a lower limit for the fraction of loop-forming molecules (native-like or nonnative-like) in the transition state.
Several protein folding reactions have been perturbed by the extension of an unstructured protein loop. We will use these data to illustrate the use of the αLoop. Figure 1A shows the effect of loop extension in the equilibrium and activation free energies for folding of CI2 and the S19P20 and N47D48 circular permutants of the spectrin SH3 domain (Ladurner and Fersht 1997; Viguera and Serrano 1997). The data for the three proteins are well described by a linear fit (R = 0.97 or higher), suggesting that the rate-limiting transition state is not altered by extension of the unstructured loop (Sánchez and Kiefhaber 2003b,a). The slope for each protein, i.e., the αLoop-value, varies between 0.45 and 0.64 (Table 1). The fractional αLoop-values indicate that the perturbed loops are formed in ∼45%–64% of the molecules in the transition states for folding of these proteins. The φ-values probing side chain interactions across the extended loop are significantly higher than zero in all three proteins (Itzhaki et al. 1995; Viguera et al. 1996), in agreement with our results.
Table 1.
Detection of residual structure in the unfolded state
All REFERs take the free energy (and, consequently, the structure) of the unfolded state as a reference (Fersht et al. 1992). Many small proteins show substantial residual structure in the unfolded state (Sánchez and Kiefhaber 2003b), which puts a cautionary note to the interpretation of structure-induced REFERs (Cho and Raleigh 2006). Quantification of residual structure at each site probed by a φ-, ψ-, or αBackbone-value is time consuming and rarely carried out. The equilibrium data collected in loop extension experiments can be used to detect residual structure in the unfolded state as follows: If a fraction of the protein molecules form the perturbed loop in the unfolded state, they will not pay an entropic penalty upon going from the unfolded to the native state and will not contribute to ΔΔG 0. The observed ΔΔG 0 will be reduced relative to the one predicted for a fully unfolded state in an amount proportional to the population of the loop. A plot of observed ΔΔG 0-values versus the ones predicted from polymer theory (Jacobson and Stockmeier 1950) will be linear, with a slope that can be read as the fraction of molecules that do not form a loop in the unfolded state.
Figure 1B shows the observed and predicted ΔΔG 0-values for CI2, SH3S19P20, and SH3N47D48 (Ladurner and Fersht 1997; Viguera and Serrano 1997). The data for the three proteins are well described by a linear fit (R = 0.98 or higher). The slope for CI2 is ∼1, indicating that the two halves of the protein separated by the extended loop, CI21–40 and CI241–64, are not in contact in the unfolded state. NMR experiments indeed show that residual structure in the unfolded state is confined to CI21–40 (Kazmirski et al. 2001). The slope of such a plot for a series of loop extension variants of the Rop protein (Nagi and Regan 1997) is also very close to 1 (Zhou 2004). On the other hand, the slopes for SH3S19P20 and SH3N47D48 are smaller than 1 (Table 1), suggesting the existence of a population with significant residual structure in the unfolded states of both proteins. NMR experiments found extensive residual structure in the unfolded state of the spectrin SH3 domain (Kortemme et al. 2000). This structure involves residues at both sides of the perturbed loop, in agreement with our results.
Applicability
In order to yield a meaningful αLoop-value, the following conditions must be fulfilled: (1) Creation of an unstructured loop by the insertion of several glycine residues should not change the folding mechanism and (2) should not destabilize the protein drastically, so that the extension variants can be characterized. (3) The inserted sequence should be devoid of interactions with itself or the rest of the protein in the unfolded, native, or transition states. (4) The ΔΔG 0 caused by loop extension should be of entropic origin. The available experimental evidence for creation (Sondek and Shortle 1990; Ladurner and Fersht 1997; Viguera and Serrano 1997; Gordon-Smith et al. 2001) and extension (Martinez et al. 1999) of unstructured loops in the proteins CI2, SH3S19P20, SH3N47D48, and staphylococcal nuclease suggests that these conditions are often met, and that insertion of polyglycine or polyglutamine sequences into most protein loops will yield a useful αLoop-value.
We have shown that loop extension-induced REFERs can give information about loop formation in protein folding transition states. αLoop is an approximation for the fraction of loop-forming molecules in the transition state, a piece of information that can be directly compared with simulation. Additionally, interpretation of the results using standard polymer theory can help detect residual structure in the unfolded state. Since only a handful of loops in a protein will be suitable for extension experiments, αLoop-values should be used in combination with other medium- and structure-induced REFERs in order to reach a unified description of the transition state.
Acknowledgments
I am the recipient of an A.E.C.I. fellowship (Spanish Agency for International Cooperation). I would like to acknowledge Gonzalo de Prat-Gay for his mentorship and Diana Wetzler and Mariano Dellarole for comments on the manuscript.
Footnotes
Reprint requests to: Ignacio Enrique Sánchez, Fundación Instituto Leloir, Avenida Patricias Argentinas, Buenos Aires, Argentina; e-mail: isanchez@leloir.org.ar; fax: 54-11-52387501.
Article and publication are at http://www.proteinscience.org/cgi/doi/10.1110/ps.073217708.
References
- Bodenreider, C., Kiefhaber, T. Interpretation of protein folding ψ values. J. Mol. Biol. 2005;351:393–401. doi: 10.1016/j.jmb.2005.05.062. [DOI] [PubMed] [Google Scholar]
- Cho, J.H., Raleigh, D.P. Denatured state effects and the origin of nonclassical φ values in protein folding. J. Am. Chem. Soc. 2006;128:16492–16493. doi: 10.1021/ja0669878. [DOI] [PubMed] [Google Scholar]
- Deechongkit, S., Nguyen, H., Powers, E.T., Dawson, P.E., Gruebele, M., Kelly, J.W. Context-dependent contributions of backbone hydrogen bonding to β-sheet folding energetics. Nature. 2004;430:101–105. doi: 10.1038/nature02611. [DOI] [PubMed] [Google Scholar]
- Fersht, A.R., Matouschek, A., Serrano, L. The folding of an enzyme. I. Theory of protein engineering analysis of stability and pathway of protein folding. J. Mol. Biol. 1992;224:771–782. doi: 10.1016/0022-2836(92)90561-w. [DOI] [PubMed] [Google Scholar]
- Goldenberg, D.P. Finding the right fold. Nat. Struct. Biol. 1999;6:987–990. doi: 10.1038/14866. [DOI] [PubMed] [Google Scholar]
- Gordon-Smith, D.J., Carbajo, R.J., Stott, K., Neuhaus, D. Solution studies of chymotrypsin inhibitor-2 glutamine insertion mutants show no interglutamine interactions. Biochem. Biophys. Res. Commun. 2001;280:855–860. doi: 10.1006/bbrc.2000.4196. [DOI] [PubMed] [Google Scholar]
- Itzhaki, L.S., Otzen, D.E., Fersht, A.R. The structure of the transition state for folding of chymotrypsin inhibitor 2 analysed by protein engineering methods: Evidence for a nucleation-condensation mechanism for protein folding. J. Mol. Biol. 1995;254:260–288. doi: 10.1006/jmbi.1995.0616. [DOI] [PubMed] [Google Scholar]
- Jacobson, H., Stockmeier, W.H. Intramolecular reaction in polycondensations. I. The theory of linear systems. J. Chem. Phys. 1950;18:1600–1606. [Google Scholar]
- Kazmirski, S.L., Wong, K.B., Freund, S.M., Tan, Y.J., Fersht, A.R., Daggett, V. Protein folding from a highly disordered denatured state: The folding pathway of chymotrypsin inhibitor 2 at atomic resolution. Proc. Natl. Acad. Sci. 2001;98:4349–4354. doi: 10.1073/pnas.071054398. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kortemme, T., Kelly, M.J., Kay, L.E., Forman-Kay, J., Serrano, L. Similarities between the spectrin SH3 domain denatured state and its folding transition state. J. Mol. Biol. 2000;297:1217–1229. doi: 10.1006/jmbi.2000.3618. [DOI] [PubMed] [Google Scholar]
- Krantz, B.A., Sosnick, T.R. Engineered metal binding sites map the heterogeneous folding landscape of a coiled coil. Nat. Struct. Biol. 2001;8:1042–1047. doi: 10.1038/nsb723. [DOI] [PubMed] [Google Scholar]
- Krantz, B.A., Dothager, R.S., Sosnick, T.R. Discerning the structure and energy of multiple transition states in protein folding using ψ-analysis. J. Mol. Biol. 2004;337:463–475. doi: 10.1016/j.jmb.2004.01.018. [DOI] [PubMed] [Google Scholar]
- Ladurner, A.G., Fersht, A.R. Glutamine, alanine or glycine repeats inserted into the loop of a protein have minimal effects on stability and folding rates. J. Mol. Biol. 1997;273:330–337. doi: 10.1006/jmbi.1997.1304. [DOI] [PubMed] [Google Scholar]
- Leffler, J.E. Parameters for the description of transition states. Science. 1952;117:340–341. doi: 10.1126/science.117.3039.340. [DOI] [PubMed] [Google Scholar]
- Martinez, J.C., Viguera, A.R., Berisio, R., Wilmanns, M., Mateo, P.L., Filimonov, V.V., Serrano, L. Thermodynamic analysis of α-spectrin SH3 and two of its circular permutants with different loop lengths: Discerning the reasons for rapid folding in proteins. Biochemistry. 1999;38:549–559. doi: 10.1021/bi981515u. [DOI] [PubMed] [Google Scholar]
- Nagi, A.D., Regan, L. An inverse correlation between loop length and stability in a four-helix-bundle protein. Fold. Des. 1997;2:67–75. doi: 10.1016/S1359-0278(97)00007-2. [DOI] [PubMed] [Google Scholar]
- Pandit, A.D., Jha, A., Freed, K.F., Sosnick, T.R. Small proteins fold through transition states with native-like topologies. J. Mol. Biol. 2006;361:755–770. doi: 10.1016/j.jmb.2006.06.041. [DOI] [PubMed] [Google Scholar]
- Sánchez, I.E., Kiefhaber, T. Evidence for sequential barriers and obligatory intermediates in apparent two-state protein folding. J. Mol. Biol. 2003a;325:367–376. doi: 10.1016/s0022-2836(02)01230-5. [DOI] [PubMed] [Google Scholar]
- Sánchez, I.E., Kiefhaber, T. Hammond behavior versus ground state effects in protein folding: Evidence for narrow free energy barriers and residual structure in unfolded states. J. Mol. Biol. 2003b;327:867–884. doi: 10.1016/s0022-2836(03)00171-2. [DOI] [PubMed] [Google Scholar]
- Sánchez, I.E., Kiefhaber, T. Non-linear rate-equilibrium free energy relationships and Hammond behavior in protein folding. Biophys. Chem. 2003c;100:397–407. doi: 10.1016/s0301-4622(02)00294-6. [DOI] [PubMed] [Google Scholar]
- Sánchez, I.E., Kiefhaber, T. Origin of unusual φ-values in protein folding: Evidence against specific nucleation sites. J. Mol. Biol. 2003d;334:1077–1085. doi: 10.1016/j.jmb.2003.10.016. [DOI] [PubMed] [Google Scholar]
- Sondek, J., Shortle, D. Accommodation of single amino acid insertions by the native state of staphylococcal nuclease. Proteins. 1990;7:299–305. doi: 10.1002/prot.340070402. [DOI] [PubMed] [Google Scholar]
- Sosnick, T.R., Dothager, R.S., Krantz, B.A. Differences in the folding transition state of ubiquitin indicated by φ and ψ analyses. Proc. Natl. Acad. Sci. 2004;101:17377–17382. doi: 10.1073/pnas.0407683101. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Viguera, A.R., Serrano, L. Loop length, intramolecular diffusion and protein folding. Nat. Struct. Biol. 1997;4:939–946. doi: 10.1038/nsb1197-939. [DOI] [PubMed] [Google Scholar]
- Viguera, A.R., Serrano, L., Wilmanns, M. Different folding transition states may result in the same native structure. Nat. Struct. Biol. 1996;3:874–880. doi: 10.1038/nsb1096-874. [DOI] [PubMed] [Google Scholar]
- Zhou, H.X. Loops, linkages, rings, catenanes, cages, and crowders: Entropy-based strategies for stabilizing proteins. Acc. Chem. Res. 2004;37:123–130. doi: 10.1021/ar0302282. [DOI] [PubMed] [Google Scholar]