Skip to main content
Elsevier Sponsored Documents logoLink to Elsevier Sponsored Documents
. 2013 Feb;23(1):66–74. doi: 10.1016/j.sbi.2012.11.009

Take home lessons from studies of related proteins

Adrian A Nickson 1, Beth G Wensley 1,*, Jane Clarke 1
PMCID: PMC3578095  PMID: 23265640

Highlights

► We review the recent advances made from the study of related proteins. ► We relate pathway malleability to the balance between foldons and helical propensity. ► We speculate why different topologies respond differently to mutation. ► We discuss the role of kinetic intermediates in folding pathways. ► We explain why it is important to study several members from each protein fold.

Abstract

The ‘Fold Approach’ involves a detailed analysis of the folding of several topologically, structurally and/or evolutionarily related proteins. Such studies can reveal determinants of the folding mechanism beyond the gross topology, and can dissect the residues required for folding from those required for stability or function. While this approach has not yet matured to the point where we can predict the native conformation of any polypeptide chain in silico, it has been able to highlight, amongst others, the specific residues that are responsible for nucleation, pathway malleability, kinetic intermediates, chain knotting, internal friction and Paracelsus switches. Some of the most interesting discoveries have resulted from the attempt to explain differences between homologues.


Current Opinion in Structural Biology 2013, 23:66–74

This review comes from a themed issue on Folding and binding

Edited by Jayant Udgaonkar and Susan Marqusee

For a complete overview see the Issue and the Editorial

Available online 20th December 2012

0959-440X/$ – see front matter, © 2012 Elsevier Ltd. All rights reserved.

http://dx.doi.org/10.1016/j.sbi.2012.11.009

Introduction

In the fifty years since the protein-folding field was first established, there have been thousands of papers detailing the thermodynamic or kinetic characterization of hundreds of different proteins. One particularly useful approach is ‘The Fold Approach’ [1], which involves a detailed analysis of the folding of several topologically, structurally and/or evolutionarily related proteins in order to discern patterns and trends in folding (stability, pathways and mechanisms).

In this manuscript, we describe a number of studies that highlight how comparisons within and between related protein families have affected our understanding of protein folding. This article builds on our recent review [2] incorporating significant results from the last few years. Here, we focus on the folding of isolated domains and do not discuss multidomain proteins, misfolding or aggregation.

The malleability of protein folding pathways

A unifying folding mechanism

In the early days of the ‘protein-folding problem’, three competing mechanisms were proposed that described how a polypeptide chain might fold to the native state: nucleation [3], hydrophobic-collapse [4] and diffusion-collision (framework) [5]. However, an early Φ-value analysis of the small protein chymotrypsin inhibitor 2 (CI2) demonstrated that none of these mechanisms was appropriate, since secondary and tertiary structure formed concomitantly [6]. Thus the nucleation-condensation mechanism was introduced [7], in which long-range contacts set up the initial topology of the protein (incurring a substantial entropic loss with minimal enthalpic gain), followed by a rapid collapse to the native state (with minimal entropic loss but substantial enthalpic gain). Under these conditions, the transition state is usually an expanded form of the native state [8], which helps to explain the strong correlation between native topological complexity (Contact Order) and folding rates, as noted by Plaxco and Baker in the late 1990s [9].

Although the nucleation-condensation mechanism is observed to be widely applicable, several proteins have been shown to fold in a more hierarchical manner. In particular, the engrailed homeodomain (En-HD) was seen to fold via a classical framework mechanism [10]. To investigate whether this result was owing to the simple architecture of the protein, Fersht and co-workers studied four other members of the homeodomain-like superfamily: c-Myb, hRAP1, Pit1 and hTRF1. They observed a slide in mechanism a slide from hTRF1 (pure nucleation-condensation) to En-HD (pure framework) through c-Myb, hRAP1 and Pit1 (mixed mechanisms), which correlated with the innate secondary structural propensity of each domain [11,12]. The authors used this result to conclude that nucleation-condensation and diffusion-collision are thus “different manifestations of a common unifying mechanism” for protein folding. This variation is not unique, and a continuum of mechanisms has also been seen for different members of the PSBD superfamily, where it is again linked to secondary structural propensity [13].

The foldon concept

Further reconciliations between apparently different folding pathways have also been proposed using the concept of ‘foldons’. This term was initially used to describe the C-terminal domain of bacteriophage T4 fibritin [14], but was quickly adopted by Wolynes and co-workers to describe independently folding units of a protein chain [15]. Although originally referring solely to contiguous regions of polypeptide sequence, Englander [16] and Oliveberg [17,18] redefined the term ‘foldon’ to describe any kinetically competent submotif within a protein (i.e. any subset of residues that can fold cooperatively to a defined structural state).

Perhaps the most successful application of the foldon hypothesis comes from studies of the ferredoxin-like family of proteins including U1A and the small ribosomal protein S6 from Thermus thermophilus (S6T). Here, Oliveberg and co-workers observed that, while the wild-type S6T protein folded through a globally diffuse transition state that typified nucleation-condensation, a circular permutant (with conjoined wild-type termini and a different backbone cleavage site) exhibited an extremely polarized transition state [19]. Moreover, two alternate circular permutants demonstrated that entropy mutations could be used to shift the position of the nucleus within the topology of the S6T protein [20]. This finding was particularly interesting, since it reconciled the folding of S6T and U1A with that of S6A and ADA2h: two other homologous ferredoxin-like proteins that appeared to fold through a different pathway (although still by nucleation condensation). Oliveberg explained these results by suggesting that all ferredoxin-like proteins comprise two overlapping foldons, but that the specific folding pathway is determined by the primary sequence of each domain [18].

It is, perhaps, easiest to compare these foldons to tandem repeat proteins. In these proteins, each repeat is unstable in isolation – and yet each repeat has a defined native structure to which it will fold [21,22]. Interactions between these repeats can provide sufficient stabilization to produce a globally stable native state, and a cooperatively folding protein [23]. In the same way, isolated foldons are unstable – but the combination of several foldons will lead to a stable, structured protein domain. In the ankyrin repeat protein myotrophin, it is the C-terminal repeat that is most stable (least unstable) in isolation, and hence folding begins in this region of the protein. However, when this repeat is destabilized by mutation, it is now the N-terminal repeat that is most stable, and the protein will fold from the opposite end over a different pathway [24], similar to that of Internalin B [25]. A similar rerouting of the folding pathway has also been achieved by mutations in the Notch ankyrin domain [26]. In an analogous manner, the folding of the ferredoxin-like proteins is controlled by which of the two component foldons is the most stable (least unstable), hence the differences in transition state structure between U1A/S6T and S6A/ADA2h [18].

How do folding pathways respond to sequence changes?

Both experiment [27] and theory [28] suggest that the protein-folding nucleus can be subdivided into two distinct sections (Figure 1). The obligate nucleus comprises those few interactions that commit the polypeptide chain to fold to the correct native state topology. Such residues pack early, (with high Φ-values), and incur a substantial entropy cost with little enthalpic gain. They are surrounded by the critical nucleus, which is a shell of additional interactions that are necessary to turn the free-energy profile downhill (i.e. additional interactions that are accumulated up to the global transition state). These interactions are more plastic, and each folding event may use a different subset of residues within the critical nucleus to effect a barrier crossing. The foldon idea can be combined with that of the obligate and critical folding nucleus to explain the many types of pathway malleability: this is described in Figure 2, and exemplified by members of the immunoglobulin-like (Ig-like) fold.

Figure 1.

Figure 1

Folding by nucleation condensation: the key elements of the folding nucleus.

The folding nucleus can be subdivided into the obligate nucleus (dark blue) and the critical nucleus (cyan) [27,28]. The obligate nucleus brings together those elements of secondary structure that are required to set up the native protein topology. Interactions between what have been called the ‘key residues’ [88] form early, and are associated with a high entropy cost and little enthalpic gain. The critical nucleus forms a shell around the obligate nucleus, and provides sufficient extra interactions to turn the free-energy profile downhill, (lower entropic cost, larger enthalpic gain). These interactions are more plastic, and only a subset of these interactions may be required to complete the folding nucleus.

Figure 2.

Figure 2

How folding mechanisms or pathways might change when the sequence of a protein changes.

Top: Protein folding has been described as occurring by a sliding mechanism between a framework mechanism, F (5), and nucleation condensation, NC [7]. (F1) If the secondary structure (helical) propensity of the protein is high (dark grey) then secondary structure formation may precede the formation of a tertiary folding nucleus and the protein folds through the framework mechanism. If the secondary structure weakens then a nucleation-condensation mechanism may become more favourable. (F2) If the secondary structure propensity is weak (light grey), but there is no strong nucleus, the protein may still fold by a framework-like, diffusion-collision mechanisms, where folding proceeds through collision of partly formed secondary structure elements. Changes in sequence may lead to stronger, earlier formation of secondary structure, or a move to nucleation condensation. Bottom: Within nucleation condensation (NC) mechanisms there may be shifts in the folding nucleus. The malleability of a protein-folding pathway is determined by its component foldons and by redundancy in the nucleating residues. The obligate nucleus is shown in blue and the critical nucleus is shown in cyan. (a) Where a protein contains only one potential set of nucleating residues, the folding pathway is robust. Such proteins can be described as ‘ideal’ two state folders, and exhibit V-shaped chevron plots with a single free-energy barrier. Mutation of the nucleating residues will not change the structure of the transition state, but may result in a protein that cannot fold. (b and c) If the obligate nucleus is surrounded by many favourable interactions, then a detrimental mutation within the critical nucleus can lead to the recruitment of other interactions to compensate. This will result either in expansion of the critical nucleus, b, or a shift in the position of the nucleus, c. Such mutations can lead to Hammond effects. (d and e) If a protein can use degenerate residues to set up its native state topology, then mutations within the obligate nucleus can lead to minor shifts in both the obligate nucleus and the critical nucleus; however, if the topology provides alternate foldons, then disruption of the obligate nucleus may result in a complete shift in the position of the folding nucleus. These latter shifts are often linked to anti-Hammond behaviour. Alternatively, in the absence of an alternative set of nucleating residues, destruction of the folding nucleus may lead to a protein that can only fold when transient secondary structure is stabilized by long-range tertiary interactions (F2). Such a protein would be said to fold through the diffusion-collision mechanism.

When considering the folding of related proteins, perhaps the most thoroughly studied fold is that of the Ig-like domains. These all-β proteins have a complex Greek-key architecture, and are extremely common in eukaryotes with over 40 000 distinct domains identified to date [29]. They were chosen for study because, despite their complex topology, there is low sequence identity within each superfamily – and virtually no sequence identity between different superfamilies. Early studies on fibronectin type III (fnIII) domains (TNfn3 and FNfn10) revealed the presence of four key hydrophobic residues in the B, C, E and F strands that constituted the obligate nucleus: interactions of these residues was necessary, but sufficient, to set up the correct topology of the protein [30–32]. Interestingly, the size of the critical nucleus was very different in these two proteins – it is far more extensive in FNfn10 than in TNfn3 (Figure 2B). Moreover, in FNfn10, a few mutations resulted in a small change in the unfolding m-value that could indicate a shift in the critical nucleus (Figure 2C). Most importantly, the obligate nucleus of the evolutionarily unrelated Ig domain titin I27 comprised residues that were structurally equivalent to those in the fnIII domains [33]. Thus, these proteins share an obligate nucleus, which is required to set up the correct topology of these complex Greek-key domains and allow folding to proceed. Indeed, the hydrophobic residues of this obligate nucleus were so well conserved that a search of the Protein Data Bank (PDB) was undertaken to find an Ig-like domain that did not contain this nucleation motif. The resultant domain, CAfn2, was subject to a detailed Φ-value analysis that produced a gratifying result: the folding nucleus had simply ‘slipped’ down the core to use an adjacent pair of hydrophobic residues [34] – both the obligate and critical nuclei have moved in response to sequence changes (Figure 2D).

A final surprise in this analysis of pathway malleability in Ig-like domains came from a more detailed analysis of I27. This domain exhibited unusual anti-Hammond behaviour at high concentrations of denaturant and upon mutation. These data were used to infer the presence of an alternate folding pathway that nucleated at the E–F loop – both the critical and the obligate nucleus have moved entirely (Figure 2E) [35]. Thus we find that Ig-like domains contain at least two potential nucleation motifs, with one foldon comprising the B, C, E and F strands and one foldon centred on the E–F loop. Note that we are not implying that every immunoglobulin-like domain can display all types of pathway malleability, merely that the topology of the immunoglobulin fold allows for each. We speculate that this robustness to sequence changes might account for the success of this fold in Nature.

Are all protein folds as malleable? Using a stringent definition for transition state inflexibility, no shift in the position or size of the folding nucleus, the classic two state folder CI2 and the small three-helix bundle BdpA are the only domains for which no experimental perturbation has resulted in an altered transition state structure (Figure 2A). In the case of CI2, this inflexibility extends to point mutation, circularization, circular permutation [36] and even bisection [37], and it appears that this protein really does have only one energetically accessible nucleation motif. However, since no other members of this fold have been studied, it is not yet known if this is a general feature of this protein topology. The BdpA protein has been less ruthlessly perturbed and, while the transition state is not affected by point mutation or by temperature [38,39], a more serious structural perturbation may yet have an effect. An interesting case is demonstrated by the LysM domain, which shows an identical pattern of Φ-values after circularization [40], albeit with a global decrease in magnitude. A detailed Eyring analysis suggests that the lower entropy cost of transition state formation is compensated for by a lower enthalpy of contacts: the protein still folds through the same pathway, with a structurally identical but spatially expanded nucleus (Figure 2B).

The apparent malleability of the transition state ensemble can be strongly dependent on the imposed perturbation, as demonstrated by the β-sandwich domain α-spectrin SH3. The wild-type transition state is formed from the packing of two out of the three native state β-hairpins (RT loop and distal loop). A circular permutant that cleaved the RT loop resulted in an unchanged folding pathway, (Figure 2B), but an alternate permutant that cut the distal loop resulted in a completely different transition state structure involving the n-Src loop and the WT termini (Figure 2E) [41]. Other large-scale shifts in the obligate nucleus are not uncommon, especially where the folds exhibit symmetry. The symmetrical, ubiquitin-like Protein G, which comprises a central helix packing on two terminal hairpins, is a good example of such a large change. The wild type protein nucleates using the C-terminal hairpin and helix, as determined by Φ-value analysis [42]. However, a computationally redesigned version of the protein was successfully engineered to fold via the N-terminal hairpin [43], with a transition state reminiscent of the homologous Protein L [44]. In both of these cases, SH3 and ubiquitin-like domains, the protein topology provides at least two foldons, either of which is able to nucleate under the right conditions. As with the S6 proteins, these foldons are overlapping.

The role of intermediates in folding

As mentioned previously, the engrailed homeodomain has been shown to fold through a framework mechanism [11]. In fact, the secondary structural propensity of En-HD is so high that individual helices are stable in isolation (Figure 2, F1). This leads to three-state folding behaviour where kinetic intermediates accumulate. Reducing the secondary structural propensity results in a domain where no helix is stable in isolation. Now, the transiently formed helices are only stabilized once they have accumulated sufficient long-range interactions, and this interdependency results in global folding cooperativity, as seen with c-Myb. This behaviour is shown in Figure 2 as the slide from framework (F1) to nucleation condensation (NC). Nevertheless, c-Myb can be specifically mutated to increase the helical propensity, and convert the folding kinetics to three-state [45]. A similar effect is seen with the immunity proteins, Im7 and Im9 [46,47], which share a common transition state structure despite the fact that Im9 folds in a two-state manner (no independently stable submotifs) while Im7 exhibits three-state kinetics (with at least one independently stable submotif). By stabilizing the nucleating foldon, Im9 was rationally engineered to fold through a kinetic intermediate, while retaining the transition state structure of the homologous Im7 domain [48]. This switch does not always require substantial redesign, as shown by some elegant studies of RNase H, which demonstrated that a single point mutation (Ile to Asp) is sufficient to remove an on-pathway folding intermediate and thus energetically couples the two subdomains of the protein [49,50]. Transiently populated intermediates have also been introduced into, or removed from, the lipocalins [51,52], the immunoglobulin-like proteins [53] and the cytochromes [54] without altering the transition state structure. Taken together, these studies are proof that a folding pathway cannot be solely defined by its kinetic intermediates.

A slightly different result came from studies on five homologous members of the PDZ domain-like fold. In each case, the protein was shown to fold over two sequential transition states with a high-energy intermediate. As with Im9, this intermediate was deliberately stabilized, and the resulting domain did indeed fold with three-state kinetics [55]; however, the stabilized intermediate was subsequently shown to be off-pathway [56••]. Moreover, a human PDZ domain was found to fold through an intermediate that was either on- or off-pathway, depending on the solution conditions [56••]. This is an extremely interesting example where one of the component foldons has mutated so as to be the most stable species under certain solution conditions, as shown by the presence of an equilibrium intermediate. We infer that the PDZ domain contains at least two nucleation competent motifs within its structure. If the protein nucleates using the first (stable) foldon, then the second energy barrier is larger than the first and an intermediate accumulates (Figure 3A). If, however, the protein nucleates using the second (unstable) foldon, then the second energy barrier is smaller than the first and the whole folding process is cooperative. Under certain experimental conditions, it is easier for the intermediate to fully unfold and follow the alternate nucleation pathway than it is for the intermediate to progress directly to the native state (Figure 3B). In these cases, the intermediate appears to be off-pathway. The PDZ behaviour was modeled on that of lysozyme, which contains a stable α-domain, an unstable β-domain, and folds with a ‘triangular’ scheme of two parallel pathways, only one of which exhibits a kinetic intermediate [57]. Alternative folding pathways and kinetic traps have also been observed, and analysed, for homologous members of the flavodoxin-like fold [58,59], the β-trefoil family [60,61] and the caspase recruitment domains [62], amongst others.

Figure 3.

Figure 3

A protein with more than one foldon has access to multiple folding pathways and may exhibit both on-pathway and off-pathway intermediates.

Lowercase letters denote unstructured foldons (p, q) and uppercase letters denote structured foldons (P, Q). The double dagger (‡) denotes the foldon that is (un)folding at each transition state. (a) Both the PDZ domains and lysozyme have been shown to fold through a triangular folding scheme under certain experimental conditions. This can be explained by considering a protein with two component foldons (p, q) either of which can fold first. Importantly, one foldon is stable in isolation (P) but the other is unstable in isolation (q). In the blue pathway, the second energy barrier (q folding) is larger than the first energy barrier (p folding) and therefore an on-pathway intermediate accumulates. In the red pathway, the intermediate (p-Q) is unstable and folding is two-state. If the highest energy transition states on each pathway are close in energy, (here: p-q and p-q), there is significant flux over both folding routes (about 3:2 blue:red for the PDZ domains, and 4:1 for the lysozyme domain). (b) Under alternative experimental conditions, formation of one foldon may actually hinder the folding of the second foldon: the energy barrier p-q to p-q is lower than the energy barrier P-q to P-q. Although the majority of the denatured proteins (p-q) fold along the blue pathway to the intermediate (P-q), it is actually less energetically costly for this intermediate to unfold and follow the alternate red pathway than it is for the protein to fold directly from the intermediate to the native state. In this case, the intermediate would appear to be off-pathway – despite the fact that it is possible for the intermediate to fold directly to the native state. This may be the case for the PDZ domain when the temperature is dropped from 37 °C to 25 °C. The intermediate P-q is the same in both cases, but the relative heights of the four energy barriers determine whether or not it is on-pathway (a) or off-pathway (b).

Comparisons between folds

Both spectrin domains and homeodomains are three-helix bundle proteins. Three spectrin domains have been investigated in detail, (R15, R16 and R17), all from chicken brain α-spectrin. As seen for the homeodomains, there is no common folding mechanism, with R16 (and R17) folding by the collision of partly pre-formed helices [63,64], while R15 folds by classical nucleation-condensation [65]. In the spectrin case, however, it is not increased helical propensity in R16 that favours the framework-like mechanism: rather, it is the lack of a competent folding nucleus (Figure 2, F2). Addition of a nucleus results in a change in the folding mechanism from framework towards nucleation condensation, as shown in Figure 2 with a slide from F2 to NC [66••,67]. Interestingly, in contrast to the homeodomains where the framework mechanism leads to faster folding, in spectrin it is the proteins that fold by nucleation condensation that fold faster. This difference is probably related to the difference in size of these two folds. The helices in spectrin are long (8–10 turns per helix) unlike the short 2–3 turn helices in the homeodomains. We have speculated that there is a frustrated search for the correct docking of the helices in the spectrin domains, manifested as ‘internal friction’, that explains this observation [66••,68,69]. Remarkably, it has not been possible to alter the folding pathway of R15, either to move towards a framework-like mechanism, or to induce a change in the position of the nucleus: radical destabilization of the folding nucleus in R15, which causes significantly slower folding and unfolding, still results in a protein with Φ-values that are identical to the wild-type protein (unpublished data). This protein therefore shows no signs of pathway malleability (Figure 2A), unlike its homologues R16 and R17.

Combining experiment and computational studies

Knotted proteins

One of the more surprising results in recent years is the finding that knotted proteins are able to fold spontaneously, without chaperones or enzymatic help, to the native knotted state. Mallam and Jackson investigated two members of the α/β knot family and observed that both YbeA and YibK folded with similar rates and through comparable kinetic pathways, from knotted denatured states [70]. In an elegant recent follow-up study [71], the authors followed the folding of these proteins in a cell-free translation system and demonstrated that the newly synthesized proteins have to knot before they can fold – a rate limiting process that is accelerated by chaperonins. Nevertheless, this knotting process must be controlled by the primary sequence of the protein and thus it is very interesting to investigate homologous proteins where some are knotted and some are not. Faccioli and co-workers used coarse-grained protein models to study the folding of the natively-knotted N-acetylornithine carbamoyltransferase (AOTCase) and a homologous unknotted ornithine carbamoyltransferase (OTCase). They found that, when non-native interactions were ignored, neither protein was able to form a trefoil knot. By contrast, when non-native interactions were added to the model, the AOTCase was able to spontaneously knot in a substantial proportion of the simulations [72••]. This kind of study is particularly useful, since it can be used to highlight important folding contacts that cannot be deduced from the native, denatured or transition states. In this case, the simulations predict contacts that can be added/removed in vitro to make a knotted form of OTCase or a non-knotted mutant of AOTCase.

Nearly the same sequence but a different fold

As a contrast to the fold approach, several groups have been working towards designing proteins with highly similar amino acid sequences, but which cooperatively fold to different native state topologies. This quest, known as the Paracelsus Challenge, was first achieved in 1997 when Reagan and co-workers designed two proteins that were more than 50% identical yet adopted different native folds (ROP-like and ubiquitin-like) [73]. This design was surpassed in 2005, and again in 2008, when Bryan and co-workers developed two polypeptide chains that are 88% identical and yet adopt very different tertiary structures [74]. These proteins have been studied both by experiment and computationally, and the conclusion is that the final native topology is determined by the structure of the denatured state and the very earliest folding events [75,76••]. In the case of the GX88 proteins, the early development of a β-hairpin in one sequence prevents α-helical formation in that region, and leads to the ubiquitin-like fold [75,76••]. The alternate sequence retains significant helical structure in the denatured state, which leads to the all-α helical bundle. Residual structure in the denatured state has also recently been shown to be important for the folding of the ribonuclease domains [77] and the SUMO proteins [78].

In a more recent extensive study of the designed system Gianni and co-workers have shown that GA88 folds using a robust transition state to a three helical bundle, while GB88 folds over a very malleable energy landscape to a ubiquitin-like (mostly β-sheet) topology [79]. This malleability is assigned to the presence of multiple, competing foldons. In contrast to most natural proteins, where the component foldons work in unison to provide a cooperatively folded protein, the Gx88 designed proteins provide an example where two structurally overlapping foldons work in opposition. By fine-tuning the energy cost of each nucleating foldon, the overall topology of the whole protein can be adjusted. This result should be directly applicable to the study of aggregation-prone polypeptides, where minimal perturbations in structure and/or solution conditions are able to change the resulting topology of the folded state from native to the universal cross-β amyloid structure.

Summary

What is clear from many of these studies is that researchers should be wary of characterising the folding of a particular protein topology based on a single member of the fold. While it may be informative to study a wide cross-section of the proteome [80], gross comparisons between different folds are unable to inform as to how and why a polypeptide chain folds to its specific native state. These answers mostly come from more intricate studies, looking for differences in the folding of closely related proteins (the so-called ‘Fold Approach’). For example, such studies have taught us that a folding pathway should not be defined by its kinetic intermediates, since these species can easily be introduced into, or removed from, the energy landscape (e.g. En-HD/c-Myb, Im7/Im9, PDZ). In addition, while some proteins appear to be very restricted in their response to mutation (CI2, LysM), other folds exhibit a high degree of pathway malleability. This latter group includes the immunoglobulin-like domains, which are able to change their folding nucleus in response to deletions in the hydrophobic core [34], changes in solvent conditions [35], and even under mechanical stress [81,82]. This plasticity in the energy landscape may confer an evolutionary advantage over more restricted folds, and may explain why the topologically complex Ig-like domains are so prevalent when compared to more simple folds: changes in sequence that are required for functional reasons can be easily compensated for by a shift in the folding nucleus. It is also observed that symmetric proteins, such as the ubiquitin-like domains [42,44], show more pathway malleability than similarly sized asymmetric proteins – presumably owing to the comparable entropic cost of topologically symmetric foldons [83,84].

The idea that protein domains comprise several foldons (individually cooperative submotifs) is particularly appealing, since it is able to simplify the folding of complex topologies by introducing the concept of a ‘funnel of funnels’ [85]. This would also have the advantage that de novo proteins could be systematically built using a toolbox of smaller components. Indeed, Baker and co-workers recently emphasized that it is easy to rationally stabilize the native state of a protein, but it is much harder to disfavour the plethora of non-native states that are also possible. Their phenomenal success in designing five new stable, monomeric proteins from scratch was based on the structural overlap of several defined motifs with a known topological bias, specifically chosen to favour funnel-shaped energy landscapes [86••]. While it is certainly true that the ferrodoxin-like proteins comprise two overlapping foldons, whether or not this is a general feature of all complex protein folds remains to be seen. Nevertheless, one interesting observation is that the size of the dominant foldon may be related to topological complexity. The spectrin repeats [67] and homeodomain-like bundles [12] each nucleate using two of the three helices; the LysM domain [40], ferrodoxin-like proteins and ubiquitin-like domains [18] appear to use a three component foldon; finally, the complex Greek-key immunoglobulin-like domains [33] and Death Domains [87] use a four-component foldon. While this scaling is not necessary for the protein to fold correctly, (for example, the Ig-like domains can fold using the simple E–F loop motif [35]), it may be an evolutionary method to ensure that the protein folds cooperatively and avoids misfolding or aggregation.

In summary, the ‘fold approach’ has contributed significantly to our understanding of the fundamental principles underpinning the efficient folding of evolved proteins on relatively smooth, funnel-like energy landscapes. Furthermore, such studies allow insight into the design of new proteins that can fold efficiently, on funnel-shaped energy landscapes.

References and recommended reading

Papers of particular interest, published within the period of review, have been highlighted as:

  • • of special interest

  • •• of outstanding interest

Acknowledgements

This work was supported by the Wellcome Trust (grant numbers WT064417 and WT095195). J.C. is a Wellcome Trust Senior Research Fellow.

References

  • 1.Hamill S., Cota E., Chothia C., Clarke J. Conservation of folding and stability within a protein family: the tyrosine corner as an evolutionary cul-de-sac. J Mol Biol. 2000;295:641–649. doi: 10.1006/jmbi.1999.3360. [DOI] [PubMed] [Google Scholar]
  • 2•.Nickson A.A., Clarke J. What lessons can be learned from studying the folding of homologous proteins? Methods. 2010;52:38–50. doi: 10.1016/j.ymeth.2010.06.003. [DOI] [PMC free article] [PubMed] [Google Scholar]; Our recent review on the folding mechanisms of homologous proteins details several further examples of the study of related proteins, and provides more information about many of the concepts presented herein.
  • 3.Wetlaufer D.B. Nucleation, rapid folding, and globular intrachain regions in proteins. Proc Natl Acad Sci USA. 1973;70:697–701. doi: 10.1073/pnas.70.3.697. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Dolgikh D.A., Gilmanshin R.I., Brazhnikov E.V., Bychkova V.E., Semisotnov G.V., SYu V., Ptitsyn O.B. Alpha-Lactalbumin: compact state with fluctuating tertiary structure? FEBS Lett. 1981;136:311–315. doi: 10.1016/0014-5793(81)80642-4. [DOI] [PubMed] [Google Scholar]
  • 5.Karplus M., Weaver D.L. Protein-folding dynamics. Nature. 1976;260:404–406. doi: 10.1038/260404a0. [DOI] [PubMed] [Google Scholar]
  • 6.Jackson S.E., Fersht A.R. Folding of chymotrypsin inhibitor 2. 1. Evidence for a two-state transition. Biochemistry. 1991;30:10428–10435. doi: 10.1021/bi00107a010. [DOI] [PubMed] [Google Scholar]
  • 7.Itzhaki L.S., Otzen D.E., Fersht A.R. The structure of the transition state for folding of chymotrypsin inhibitor 2 analysed by protein engineering methods: evidence for a nucleation-condensation mechanism for protein folding. J Mol Biol. 1995;254:260–288. doi: 10.1006/jmbi.1995.0616. [DOI] [PubMed] [Google Scholar]
  • 8.Paci E., Lindorff-Larsen K., Dobson C.M., Karplus M., Vendruscolo M. Transition state contact orders correlate with protein folding rates. J Mol Biol. 2005;352:495–500. doi: 10.1016/j.jmb.2005.06.081. [DOI] [PubMed] [Google Scholar]
  • 9.Plaxco K.W., Simons K.T., Baker D. Contact order, transition state placement and the refolding rates of single domain proteins. J Mol Biol. 1998;277:985–994. doi: 10.1006/jmbi.1998.1645. [DOI] [PubMed] [Google Scholar]
  • 10.Mayor U., Guydosh N.R., Johnson C.M., Grossmann J.G., Sato S., Jas G.S., Freund S.M.V., Alonso D.O.V., Daggett V., Fersht A.R. The complete folding pathway of a protein from nanoseconds to microseconds. Nature. 2003;421:863–867. doi: 10.1038/nature01428. [DOI] [PubMed] [Google Scholar]
  • 11.Gianni S., Guydosh N.R., Khan F., Caldas T.D., Mayor U., White G.W.N., DeMarco M.L., Daggett V., Fersht A.R. Unifying features in protein-folding mechanisms. Proc Natl Acad Sci USA. 2003;100:13286–13291. doi: 10.1073/pnas.1835776100. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12•.Banachewicz W., Religa T.L., Schaeffer R.D., Daggett V., Fersht A.R. Malleability of folding intermediates in the homeodomain superfamily. Proc Natl Acad Sci USA. 2011;108:5596–5601. doi: 10.1073/pnas.1101752108. [DOI] [PMC free article] [PubMed] [Google Scholar]; This study on the Pit1 homeodomain demonstrates the ‘missing link’ between the nucleation-condensation and framework mechanisms, and supports the hypothesis of a continuum of folding mechanisms between these two extremes.
  • 13.Neuweiler H., Sharpe T.D., Rutherford T.J., Johnson C.M., Allen M.D., Ferguson N., Fersht A.R. The folding mechanism of BBL: plasticity of transition-state structure observed within an ultrafast folding protein family. J Mol Biol. 2009;390:1060–1073. doi: 10.1016/j.jmb.2009.05.011. [DOI] [PubMed] [Google Scholar]
  • 14.Letarov A.V., Londer Y.Y., Boudko S.P., Mesyanzhinov V.V. The carboxy-terminal domain initiates trimerization of bacteriophage T4 fibritin. Biochemistry (Mosc) 1999;64:817–823. [PubMed] [Google Scholar]
  • 15.Panchenko A.R., Luthey-Schulten Z., Cole R., Wolynes P.G. The foldon universe: a survey of structural similarity and self-recognition of independently folding units. J Mol Biol. 1997;272:95–105. doi: 10.1006/jmbi.1997.1205. [DOI] [PubMed] [Google Scholar]
  • 16.Maity H., Maity M., Krishna M.M.G., Mayne L., Englander S.W. Protein folding: the stepwise assembly of foldon units. Proc Natl Acad Sci USA. 2005;102:4741–4746. doi: 10.1073/pnas.0501043102. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Hedberg L., Oliveberg M. Scattered Hammond plots reveal second level of site-specific information in protein folding: phi’ (beta++) Proc Natl Acad Sci USA. 2004;101:7606–7611. doi: 10.1073/pnas.0308497101. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Lindberg M.O., Oliveberg M. Malleability of protein folding pathways: a simple reason for complex behaviour. Curr Opin Struct Biol. 2007;17:21–29. doi: 10.1016/j.sbi.2007.01.008. [DOI] [PubMed] [Google Scholar]
  • 19.Lindberg M., Tångrot J., Oliveberg M. Complete change of the protein folding transition state upon circular permutation. Nat Struct Biol. 2002;9:818–822. doi: 10.1038/nsb847. [DOI] [PubMed] [Google Scholar]
  • 20.Lindberg M.O., Haglund E., Hubner I.A., Shakhnovich E.I., Oliveberg M. Identification of the minimal protein-folding nucleus through loop-entropy perturbations. Proc Natl Acad Sci USA. 2006;103:4083–4088. doi: 10.1073/pnas.0508863103. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Main E.R.G., Jackson S.E., Regan L. The folding and design of repeat proteins: reaching a consensus. Curr Opin Struct Biol. 2003;13:482–489. doi: 10.1016/s0959-440x(03)00105-2. [DOI] [PubMed] [Google Scholar]
  • 22•.Aksel T., Majumdar A., Barrick D. The contribution of entropy, enthalpy, and hydrophobic desolvation to cooperativity in repeat-protein folding. Structure. 2011;19:349–360. doi: 10.1016/j.str.2010.12.018. [DOI] [PMC free article] [PubMed] [Google Scholar]; This paper presents a detailed analysis of the folding of consensus repeat proteins in which an Ising-like model is used to dissect the inherent instability of each repeat from the stabilizing interactions between adjacent repeats.
  • 23.Lowe A.R., Itzhaki L.S. Biophysical characterisation of the small ankyrin repeat protein myotrophin. J Mol Biol. 2007;365:1245–1255. doi: 10.1016/j.jmb.2006.10.060. [DOI] [PubMed] [Google Scholar]
  • 24.Lowe A.R., Itzhaki L.S. Rational redesign of the folding pathway of a modular protein. Proc Natl Acad Sci USA. 2007;104:2679–2684. doi: 10.1073/pnas.0604653104. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Courtemanche N., Barrick D. The leucine-rich repeat domain of Internalin B folds along a polarized N-terminal pathway. Structure. 2008;16:705–714. doi: 10.1016/j.str.2008.02.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Tripp K.W., Barrick D. Rerouting the folding pathway of the Notch ankyrin domain by reshaping the energy landscape. J Am Chem Soc. 2008;130:5681–5688. doi: 10.1021/ja0763201. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Ternström T., Mayor U., Akke M., Oliveberg M. From snapshot to movie: phi analysis of protein folding transition states taken one step further. Proc Natl Acad Sci USA. 1999;96:14854–14859. doi: 10.1073/pnas.96.26.14854. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Shakhnovich E.I. Folding nucleus: specific or multiple? Insights from lattice models and experiments. Fold Des. 1998;3:R108–R111. doi: 10.1016/s1359-0278(98)00056-x. [DOI] [PubMed] [Google Scholar]
  • 29.Han J.-H., Batey S., Nickson A.A., Teichmann S.A., Clarke J. The folding and evolution of multidomain proteins. Nat Rev Mol Cell Biol. 2007;8:319–330. doi: 10.1038/nrm2144. [DOI] [PubMed] [Google Scholar]
  • 30.Hamill S., Steward A., Clarke J. The folding of an immunoglobulin-like Greek key protein is defined by a common-core nucleus and regions constrained by topology. J Mol Biol. 2000;297:165–178. doi: 10.1006/jmbi.2000.3517. [DOI] [PubMed] [Google Scholar]
  • 31.Cota E., Steward A., Fowler S., Clarke J. The folding nucleus of a fibronectin type III domain is composed of core residues of the immunoglobulin-like fold. J Mol Biol. 2001;305:1185–1194. doi: 10.1006/jmbi.2000.4378. [DOI] [PubMed] [Google Scholar]
  • 32.Geierhaas C.D., Paci E., Vendruscolo M., Clarke J. Comparison of the transition states for folding of two Ig-like proteins from different superfamilies. J Mol Biol. 2004;343:1111–1123. doi: 10.1016/j.jmb.2004.08.100. [DOI] [PubMed] [Google Scholar]
  • 33.Fowler S., Clarke J. Mapping the folding pathway of an immunoglobulin domain: structural detail from phi value analysis and movement of the transition state. Structure. 2001;9:355–366. doi: 10.1016/s0969-2126(01)00596-2. [DOI] [PubMed] [Google Scholar]
  • 34.Lappalainen I., Hurley M.G., Clarke J. Plasticity within the obligatory folding nucleus of an immunoglobulin-like domain. J Mol Biol. 2008;375:547–559. doi: 10.1016/j.jmb.2007.09.088. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Wright C.F., Lindorff-Larsen K., Randles L.G., Clarke J. Parallel protein-unfolding pathways revealed and mapped. Nat Struct Biol. 2003;10:658–662. doi: 10.1038/nsb947. [DOI] [PubMed] [Google Scholar]
  • 36.Otzen D.E., Fersht A.R. Folding of circular and permuted chymotrypsin inhibitor 2: retention of the folding nucleus. Biochemistry. 1998;37:8139–8146. doi: 10.1021/bi980250g. [DOI] [PubMed] [Google Scholar]
  • 37.Neira J.L., Davis B., Ladurner A.G., Buckle A.M., Gay G.d.P., Fersht A.R. Towards the complete structural characterization of a protein folding pathway: the structures of the denatured, transition and native states for the association/folding of two complementary fragments of cleaved chymotrypsin inhibitor 2. Direct evidence for a nucleation-condensation mechanism. Fold Des. 1996;1:189–208. doi: 10.1016/s1359-0278(96)00031-4. [DOI] [PubMed] [Google Scholar]
  • 38.Sato S., Fersht A.R. Searching for multiple folding pathways of a nearly symmetrical protein: temperature dependent phi-value analysis of the B domain of protein A. J Mol Biol. 2007;372:254–267. doi: 10.1016/j.jmb.2007.06.043. [DOI] [PubMed] [Google Scholar]
  • 39.Baxa M.C., Freed K.F., Sosnick T.R. Quantifying the structural requirements of the folding transition state of protein A and other systems. J Mol Biol. 2008;381:1362–1381. doi: 10.1016/j.jmb.2008.06.067. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Nickson A.A., Stoll K.E., Clarke J. Folding of a LysM domain: entropy-enthalpy compensation in the transition state of an ideal two-state folder. J Mol Biol. 2008;380:557–569. doi: 10.1016/j.jmb.2008.05.020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Viguera A.R., Serrano L., Wilmanns M. Different folding transition states may result in the same native structure. Nat Struct Biol. 1996;3:874–880. doi: 10.1038/nsb1096-874. [DOI] [PubMed] [Google Scholar]
  • 42.McCallister E.L., Alm E., Baker D. Critical role of beta-hairpin formation in protein G folding. Nat Struct Biol. 2000;7:669–673. doi: 10.1038/77971. [DOI] [PubMed] [Google Scholar]
  • 43.Nauli S., Kuhlman B., Baker D. Computer-based redesign of a protein folding pathway. Nat Struct Biol. 2001;8:602–605. doi: 10.1038/89638. [DOI] [PubMed] [Google Scholar]
  • 44.Kim D.E., Fisher C., Baker D. A breakdown of symmetry in the folding transition state of protein L. J Mol Biol. 2000;298:971–984. doi: 10.1006/jmbi.2000.3701. [DOI] [PubMed] [Google Scholar]
  • 45.White G.W.N., Gianni S., Grossmann J.G., Jemth P., Fersht A.R., Daggett V. Simulation and experiment conspire to reveal cryptic intermediates and a slide from the nucleation-condensation to framework mechanism of folding. J Mol Biol. 2005;350:757–775. doi: 10.1016/j.jmb.2005.05.005. [DOI] [PubMed] [Google Scholar]
  • 46.Capaldi A.P., Kleanthous C., Radford S.E. Im7 folding mechanism: misfolding on a path to the native state. Nat Struct Biol. 2002;9:209–216. doi: 10.1038/nsb757. [DOI] [PubMed] [Google Scholar]
  • 47.Friel C.T., Capaldi A.P., Radford S.E. Structural analysis of the rate-limiting transition states in the folding of Im7 and Im9: similarities and differences in the folding of homologous proteins. J Mol Biol. 2003;326:293–305. doi: 10.1016/s0022-2836(02)01249-4. [DOI] [PubMed] [Google Scholar]
  • 48.Friel C.T., Beddard G.S., Radford S.E. Switching two-state to three-state kinetics in the helical protein Im9 via the optimisation of stabilising non-native interactions by design. J Mol Biol. 2004;342:261–273. doi: 10.1016/j.jmb.2004.06.076. [DOI] [PubMed] [Google Scholar]
  • 49•.Connell K.B., Miller E.J., Marqusee S. The folding trajectory of RNase H is dominated by its topology and not local stability: a protein engineering study of variants that fold via two-state and three-state mechanisms. J Mol Biol. 2009;391:450–460. doi: 10.1016/j.jmb.2009.05.085. [DOI] [PMC free article] [PubMed] [Google Scholar]; This study demonstrates that a single amino acid substitution can be sufficient to change the kinetic profile of a protein folding reaction, without altering the folding pathway. The authors suggest that folding intermediates may actually promote the efficient folding of some protein topologies.
  • 50.Spudich G.M., Miller E.J., Marqusee S. Destabilization of the Escherichia coli RNase H kinetic intermediate: switching between a two-state and three-state folding mechanism. J Mol Biol. 2004;335:609–618. doi: 10.1016/j.jmb.2003.10.052. [DOI] [PubMed] [Google Scholar]
  • 51.Dalessio P.M., Ropson I.J. beta-sheet proteins with nearly identical structures have different folding intermediates. Biochemistry. 2000;39:860–871. doi: 10.1021/bi991937j. [DOI] [PubMed] [Google Scholar]
  • 52.Dalessio P.M., Boyer J.A., McGettigan J.L., Ropson I.J. Swapping core residues in homologous proteins swaps folding mechanism. Biochemistry. 2005;44:3082–3090. doi: 10.1021/bi048125u. [DOI] [PubMed] [Google Scholar]
  • 53.Lorch M., Mason J.M., Clarke A.R., Parker M.J. Effects of core mutations on the folding of a beta-sheet protein: implications for backbone organization in the I-state. Biochemistry. 1999;38:1377–1385. doi: 10.1021/bi9817820. [DOI] [PubMed] [Google Scholar]
  • 54.Borgia A., Bonivento D., Travaglini-Allocatelli C., Di Matteo A., Brunori M. Unveiling a hidden folding intermediate in c-type cytochromes by protein engineering. J Biol Chem. 2006;281:9331–9336. doi: 10.1074/jbc.M512127200. [DOI] [PubMed] [Google Scholar]
  • 55.Ivarsson Y., Travaglini-Allocatelli C., Morea V., Brunori M., Gianni S. The folding pathway of an engineered circularly permuted PDZ domain. Prot Eng Des Sel. 2008;21:155–160. doi: 10.1093/protein/gzm077. [DOI] [PubMed] [Google Scholar]
  • 56••.Haq S.R., Jürgens M.C., Chi C.N., Koh C.-S., Elfström L., Selmer M., Gianni S., Jemth P. The plastic energy landscape of protein folding: a triangular folding mechanism with an equilibrium intermediate for a small protein domain. J Biol Chem. 2010;285:18051–18059. doi: 10.1074/jbc.M110.110833. [DOI] [PMC free article] [PubMed] [Google Scholar]; This paper details the link between various on- and off-pathway folding intermediates seen within the PDZ superfamily. A detailed kinetic and thermodynamic analysis is used to demonstrate that a particular folding intermediate can be shifted from on-pathway to off-pathway through a small change in solvent conditions.
  • 57.Wildegger G., Kiefhaber T. Three-state model for lysozyme folding: triangular folding mechanism with an energetically trapped intermediate. J Mol Biol. 1997;270:294–304. doi: 10.1006/jmbi.1997.1030. [DOI] [PubMed] [Google Scholar]
  • 58.Bollen Y.J.M., van Mierlo C.P.M. Protein topology affects the appearance of intermediates during the folding of proteins with a flavodoxin-like fold. Biophys Chem. 2005;114:181–189. doi: 10.1016/j.bpc.2004.12.005. [DOI] [PubMed] [Google Scholar]
  • 59•.Hills R.D., Kathuria S.V., Wallace L.A., Day I.J., Brooks C.L., Matthews C.R. Topological frustration in beta alpha-repeat proteins: sequence diversity modulates the conserved folding mechanisms of alpha/beta/alpha sandwich proteins. J Mol Biol. 2010;398:332–350. doi: 10.1016/j.jmb.2010.03.001. [DOI] [PMC free article] [PubMed] [Google Scholar]; A combined experimental and computational approach is used to demonstrate that the hydrophobic residues that are required for the fast folding of the flavodoxin-like proteins are also responsible for the prematurely folded unproductive intermediates.
  • 60.Liu C.S., Gaspar J.A., Wong H.J., Meiering E.M. Conserved and nonconserved features of the folding pathway of hisactophilin, a beta-trefoil protein. Protein Sci. 2002;11:669–679. doi: 10.1110/ps.31702. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Chavez L.L., Gosavi S., Jennings P.A., Onuchic J.N. Multiple routes lead to the native state in the energy landscape of the beta-trefoil family. Proc Natl Acad Sci USA. 2006;103:10254–10258. doi: 10.1073/pnas.0510110103. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Chen Y.-R., Clark A.C. Substitutions of prolines examine their role in kinetic trap formation of the caspase recruitment domain (CARD) of RICK. Protein Sci. 2006;15:395–409. doi: 10.1110/ps.051943006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Scott K.A., Randles L.G., Clarke J. The folding of spectrin domains II: phi-value analysis of R16. J Mol Biol. 2004;344:207–221. doi: 10.1016/j.jmb.2004.09.023. [DOI] [PubMed] [Google Scholar]
  • 64.Scott K.A., Randles L.G., Moran S.J., Daggett V., Clarke J. The folding pathway of spectrin R17 from experiment and simulation: using experimentally validated MD simulations to characterize States hinted at by experiment. J Mol Biol. 2006;359:159–173. doi: 10.1016/j.jmb.2006.03.011. [DOI] [PubMed] [Google Scholar]
  • 65.Wensley B.G., Gärtner M., Choo W.X., Batey S., Clarke J. Different members of a simple three-helix bundle protein family have very different folding rate constants and fold by different mechanisms. J Mol Biol. 2009;390:1074–1085. doi: 10.1016/j.jmb.2009.05.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66••.Wensley B.G., Batey S., Bone F.A.C., Chan Z.M., Tumelty N.R., Steward A., Kwa L.G., Borgia A., Clarke J. Experimental evidence for a frustrated energy landscape in a three-helix-bundle protein family. Nature. 2010;463:685–688. doi: 10.1038/nature08743. [DOI] [PMC free article] [PubMed] [Google Scholar]; The authors demonstrate that the slow folding spectrin repeats, R16 and R17, exhibit substantial internal friction (and hence a reduced pre-exponential factor) when compared to the faster folding R15 homologue. This is the first time that such an effect has been demonstrated experimentally.
  • 67•.Wensley B.G., Kwa L.G., Shammas S.L., Rogers J.M., Clarke J. Protein folding: adding a nucleus to guide helix docking reduces landscape roughness. J Mol Biol. 2012;423:273–283. doi: 10.1016/j.jmb.2012.08.003. [DOI] [PMC free article] [PubMed] [Google Scholar]; The authors perform a ‘minimal core swap’ to demonstrate that the internal friction of R16 and R17 can be localized to just five residues. By adding a folding nucleus to the slow folding R16, the protein switches folding mechanism and the internal friction is eliminated.
  • 68.Borgia A., Hoffmann A., Pfeil S., Lipman E.A., Clarke J., Schuler B. Localizing internal friction along the reaction coordinate of protein folding by combining ensemble and single molecule fluorescence spectroscopy. Nat Commun. 2012;3:1195. doi: 10.1038/ncomms2204. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69.Wensley B.G., Kwa L.G., Shammas S.L., Rogers J.M., Browning S., Yang Z., Clarke J. Separating the effects of internal friction and transition state energy to explain the slow, frustrated folding of spectrin domains. Proc Natl Acad Sci USA. 2012;109:17795–17799. doi: 10.1073/pnas.1201793109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Mallam A.L., Jackson S.E. A comparison of the folding of two knotted proteins: YbeA and YibK. J Mol Biol. 2007;366:650–665. doi: 10.1016/j.jmb.2006.11.014. [DOI] [PubMed] [Google Scholar]
  • 71•.Mallam A.L., Jackson S.E. Knot formation in newly translated proteins is spontaneous and accelerated by chaperonins. Nat Chem Biol. 2012;8:147–153. doi: 10.1038/nchembio.742. [DOI] [PubMed] [Google Scholar]; The authors use in vitro translation experiments to demonstrate that knot formation is rate limiting and precedes folding for two homologous knotted proteins. They demonstrate that, although the knots can form spontaneously, their formation is accelerated by the presence of chaperones.
  • 72••.Skrbić T., Micheletti C., Faccioli P. The role of non-native interactions in the folding of knotted proteins. PLoS Comp Biol. 2012;8:e1002504. doi: 10.1371/journal.pcbi.1002504. [DOI] [PMC free article] [PubMed] [Google Scholar]; Computational models are used to demonstrate the importance of non-native interactions to protein folding, and that if such interactions are ignored then it is not possible to generate the correct native structure of a knotted protein in silico.
  • 73.Dalal S., Balasubramanian S., Regan L. Protein alchemy: changing beta-sheet into alpha-helix. Nat Struct Biol. 1997;4:548–552. doi: 10.1038/nsb0797-548. [DOI] [PubMed] [Google Scholar]
  • 74.He Y., Chen Y., Alexander P., Bryan P.N., Orban J. NMR structures of two designed proteins with high sequence identity but different fold and function. Proc Natl Acad Sci USA. 2008;105:14412–14417. doi: 10.1073/pnas.0805857105. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75.Scott K.A., Daggett V. Folding mechanisms of proteins with high sequence identity but different folds. Biochemistry. 2007;46:1545–1556. doi: 10.1021/bi061904l. [DOI] [PubMed] [Google Scholar]
  • 76••.Morrone A., McCully M.E., Bryan P.N., Brunori M., Daggett V., Gianni S., Travaglini-Allocatelli C. The denatured state dictates the topology of two proteins with almost identical sequence but different native structure and function. J Biol Chem. 2011;286:3863–3872. doi: 10.1074/jbc.M110.155911. [DOI] [PMC free article] [PubMed] [Google Scholar]; The authors use biophysical and computational techniques to deduce the mechanistic basic for the Paracelsus switch between GA88 and GB88, which are 88% identical and yet fold to very distinct native state topologies.
  • 77•.Ratcliff K., Marqusee S. Identification of residual structure in the unfolded state of ribonuclease H1 from the moderately thermophilic Chlorobium tepidum: comparison with thermophilic and mesophilic homologues. Biochemistry. 2010;49:5167–5175. doi: 10.1021/bi1001097. [DOI] [PMC free article] [PubMed] [Google Scholar]; A very interesting study that investigates how residual structure is deliberately encoded into the denatured states of thermophilic proteins in order to increase their thermal stability.
  • 78.Kumar D., Chugh J., Sharma S., Hosur R.V. Conserved structural and dynamics features in the denatured states of drosophila SUMO, human SUMO and ubiquitin proteins: implications to sequence-folding paradigm. Proteins. 2009;76:387–402. doi: 10.1002/prot.22354. [DOI] [PubMed] [Google Scholar]
  • 79.Giri R., Morrone A., Travaglini-Allocatelli C., Jemth P., Brunori M., Gianni S. Folding pathways of proteins with increasing degree of sequence identities but different structure and function. Proc Natl Acad Sci USA. 2012;109:17772–17776. doi: 10.1073/pnas.1201794109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 80.Jonsson A.L., Scott K.A., Daggett V. Dynameomics: a consensus view of the protein unfolding/folding transition state ensemble across a diverse set of protein folds. Biophys J. 2009;97:2958–2966. doi: 10.1016/j.bpj.2009.09.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 81.Best R.B., Fowler S.B., Herrera J.L.T., Steward A., Paci E., Clarke J. Mechanical unfolding of a titin Ig domain: structure of transition state revealed by combining atomic force microscopy, protein engineering and molecular dynamics simulations. J Mol Biol. 2003;330:867–877. doi: 10.1016/s0022-2836(03)00618-1. [DOI] [PubMed] [Google Scholar]
  • 82.Forman J.R., Yew Z.T., Qamar S., Sandford R.N., Paci E., Clarke J. Non-native interactions are critical for mechanical strength in PKD domains. Structure. 2009;17:1582–1590. doi: 10.1016/j.str.2009.09.013. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 83.Wolynes P.G. Latest folding game results: protein A barely frustrates computationalists. Proc Natl Acad Sci USA. 2004;101:6837–6838. doi: 10.1073/pnas.0402034101. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 84.Itoh K., Sasai M. Flexibly varying folding mechanism of a nearly symmetrical protein: B domain of protein A. Proc Natl Acad Sci USA. 2006;103:7298–7303. doi: 10.1073/pnas.0510324103. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 85.Haglund E., Lindberg M.O., Oliveberg M. Changes of protein folding pathways by circular permutation. Overlapping nuclei promote global cooperativity. J Biol Chem. 2008;283:27904–27915. doi: 10.1074/jbc.M801776200. [DOI] [PubMed] [Google Scholar]
  • 86••.Koga N., Tatsumi-Koga R., Liu G., Xiao R., Acton T.B., Montelione G.T., Baker D. Principles for designing ideal protein structures. Nature. 2012;491:222–227. doi: 10.1038/nature11600. [DOI] [PMC free article] [PubMed] [Google Scholar]; This groundbreaking paper illustrates how simple topological rules can be combined with computer-aided design to generate new proteins and topologies from scratch. The authors demonstrate this approach by experimentally characterizing five de novo proteins, and showing that each is stable, monomeric, and two-state at equilibrium.
  • 87.Steward A., McDowell G.S., Clarke J. Topology is the principal determinant in the folding of a complex all-alpha Greek key death domain from human FADD. J Mol Biol. 2009;389:425–437. doi: 10.1016/j.jmb.2009.04.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 88.Vendruscolo M., Paci E., Dobson C.M., Karplus M. Three key residues form a critical contact network in a protein folding transition state. Nature. 2001;409:641–645. doi: 10.1038/35054591. [DOI] [PubMed] [Google Scholar]

RESOURCES