Skip to main content
The Journal of Biological Chemistry logoLink to The Journal of Biological Chemistry
. 2011 Nov 23;287(4):2731–2738. doi: 10.1074/jbc.M111.312447

Trimming Down a Protein Structure to Its Bare Foldons

SPATIAL ORGANIZATION OF THE COOPERATIVE UNIT*

Ellinor Haglund , Jens Danielsson , Saraboji Kadhirvel §, Magnus O Lindberg , Derek T Logan §, Mikael Oliveberg ‡,1
PMCID: PMC3268430  PMID: 22117065

Background: Structural cooperativity safeguards native proteins but is yet poorly understood.

Results: Guided by the folding nucleus, we reduced the size of a protein without compromising its structural integrity.

Conclusion: Folding nuclei are used in a modular fashion to extend or reduce the cooperative units of proteins.

Significance: Understanding cooperativity is crucial for understanding protein function and for rational design of structural properties.

Keywords: Protein Design, Protein Domains, Protein Evolution, Protein Folding, Protein Structure, Folding Cooperativity, Native State Integrity

Abstract

Folding of the ribosomal protein S6 is a malleable process controlled by two competing, and partly overlapping, folding nuclei. Together, these nuclei extend over most of the S6 structure, except the edge strand β2, which is consistently missing in the folding transition states; despite being part of the S6 four-stranded sheet, β2 seems not to be part of the cooperative unit of the protein. The question is then whether β2 can be removed from the S6 structure without compromising folding cooperativity or native state integrity. To investigate this, we constructed a truncated variant of S6 lacking β2, reducing the size of the protein from 96 to 76 residues (S6Δβ2). The new S6 variant expresses well in Escherichia coli and has a well dispersed heteronuclear single quantum correlation spectrum and a perfectly wild-type-like crystal structure, but with a smaller three-stranded β-sheet. Moreover, S6Δβ2 displays an archetypical v-shaped chevron plot with decreased slope of the unfolding limb, as expected from a protein with maintained folding cooperativity and reduced size. The results support the notion that foldons, as defined by the structural distribution of the folding nuclei, represent a property-based level of hierarchy in the build-up of larger protein structures and suggest that the role of β2 in S6 is mainly in intermolecular binding, consistent with the position of this strand in the ribosomal assembly.

Introduction

The biological function of proteins is not only a matter of shape and stability, but also relies on the energy landscape controlling the structural dynamics and occupancy of alternatively structured states (1). As a painful reminder of the latter, protein misfolding diseases (2, 3) arise recurrently from mutations that would appear perfectly benign if judged by native structure and activity alone (3, 4). Of particular interest here are the role and molecular origins of folding cooperativity, which safeguard the integrity of the native structure by reducing the occupancy of partly unfolded or misfolded states (5, 6). A clue to how folding cooperativity is maintained and propagated over large structural distances was uncovered by the folding behavior of the ribosomal protein S6 (7, 8). The structure of S6 seems to be composed of two cooperative subunits or “foldons,” σ1 and σ2, that also act as competing nuclei in the folding process (8) (Fig. 1). S6 can thus fold along either of two parallel, and sequentially opposed, pathways, one starting with the nucleation of σ1 and the other starting with the nucleation of σ2 (9) (Fig. 1). Even so, the nucleation events of σ1 and σ2 are not entirely independent, but are coupled by a structural overlap in the form of the shared strand β1 (10). Such overlap between competing nuclei stands out as an efficient way of linking small cooperative subunits into larger structures without compromising global cooperativity or folding kinetics (10). Folding of large proteins would, in principle, simplify folding of smaller proteins that are structurally overlapping to parallel. However, from studies of S6 where the bias between the σ1 and σ2 pathways has systematically been shifted by circular permutation, it is apparent that the edge strand β2 never participates in the nucleation process (6, 8, 10). Despite being an integral part of the native sheet, β2 appears to be outside the cooperative unit of S6 (Fig. 1). It is then expected that S6 should also be able to fold cooperatively without β2, providing that the σ1 and σ2 foldons are sufficiently stable on their own. In this study, we have tested this possibility by simply removing β2 from the S6 structure; the strand was shuffled to the end of the sequence through circular permutation and then cut off (see Fig. 2). The truncated protein (S6Δβ2) displays a highly dispersed solution NMR spectrum characteristic of rigid tertiary structures and a crystal structure that closely matches that of the wild-type protein. Moreover, S6Δβ2 retains the cooperative folding behavior of wild-type S6 and the circular permutant from which it is derived. Taken together, this shows that β2 is not required for folding or native state integrity, but rather plays another role in the S6 structure, possibly in functional shape adjustment. This division of globular domains into cooperative core and subsidiary secondary structure is not only useful for guiding protein engineering, but also sheds light on dynamic behavior and structural constraints in protein evolution.

FIGURE 1.

FIGURE 1.

The two postulated foldons in S6 structure and how they partition folding reaction into two parallel pathways. A, schematic of the S6 structure. B, schematic top view of S6 showing the structural location of the overlapping foldons σ1 (green) and σ2 (blue). These foldons have been inferred from two competing folding nuclei, i.e. regions with high φ-values (6, 7). Notably, β2 seems to be outside this cooperative unit of the S6 structure. C, as folding can nucleate in either σ1 (green) or σ2 (blue), the folding reaction of S6 partitions into two competing pathways: σ1 → σ2 and σ2 → σ1.

FIGURE 2.

FIGURE 2.

Schematic outline of S6Δβ2 construction. Red lettering denotes altered sequence positions, and strand β2 is colored gray. First, the N and C termini of S6wt were linked by the TTPG loop, and the protein was incised between Lys-54 and Asp-55 to localize β2 at the C terminus (permutant P54–55). Second, β2 was truncated from the C terminus of P54–55 to obtain S6Δβ2. The latter step also included the insertion of gatekeepers in β3 (F7T(60) and L8T(61)) and in α1 (A62Q(20) and L63N(21)) to better adjust the newly exposed protein surface to solvent and avoid unwanted aggregation. The positions of these gatekeepers in the tentative S6Δβ2 structure are shown in red.

EXPERIMENTAL PROCEDURES

Protein Expression and Purification

Mutations were performed with the QuikChange Multi site-directed mutagenesis kit (Stratagene, Agilent Technologies, Santa Clara, CA). Oligonucleotides were purchased from Eurofins MWG Operon (Ebersberg, Germany), and all mutations were confirmed by sequencing (Eurofins MWG Operon). The protein was overexpressed in Escherichia coli strain BL21 and purified as described in Ref. 11.

NMR Spectroscopy

All NMR data were obtained at 25 °C, pH 6.3, and with a protein concentration of 0.5 mm, on a Bruker 700-MHz spectrometer (Bruker Avance, Karlsruhe, Germany) equipped with a cryogenically cooled triple resonance probe. Assignment was by standard 15N-{1H} HSQC,2 HNCA, HN(CO)CA, HN(CA)CO, HNCO, 15N-edited NOESY and total correlation spectroscopy experiments. Dynamics were measured by standard T1, T2, and steady-state heteronuclear NOE experiments. Spectra were transformed using NMRPipe (12) and analyzed with the program Sparky (45). In the T1 and T2 experiments, the signal attenuation, from 10 different relaxation delays, was fitted to a single exponential decay and the relaxation rates were determined. The fitting routine was performed using MATLAB (MathWorks, Natick, MA).

X-ray Crystallography

Crystals of S6Δβ2 were grown using the hanging drop vapor diffusion method at 20 °C from drops containing equal volumes of protein (18 mg/ml) and a reservoir solution composed of 1.9 m ammonium sulfate, 0.1 m MES, pH 6.5, and 8% (v/v) 1,4-dioxane. The addition of dioxane was necessary to avoid the formation of crystal clusters due to excessive nucleation. Crystals appeared after 4 days and reached a maximum size of 0.2 × 0.2 × 0.1 mm in 10 days. The crystals were cryoprotected using 20% glycerol in the mother liquor. Crystallographic data were collected at 100 K at station I911-2 of MAX-lab, Lund, Sweden, using a 165mm charge coupled device (CCD) detector from MarResearch (Norderstedt, Germany). Images were indexed, integrated, and scaled using XDS (13). Crystals diffracted to 0.96 Å resolution, and data collection statistics are shown in Table 1.

TABLE 1.

Crystallographic data-collection and refinement statistics for S6Δβ2

Values in parentheses correspond to the highest resolution shell.

Data statistics
    X-ray wavelength (Å) 1.038
    Resolution (Å) 20.0-0.96 (0.99-0.96)
    Space group P212121
    Unit cell parameters (Å) α = 22.2, β = 40.8, γ = 68.66
    Unit cell volume (Å3) 62,189
    Solvent content (%) 28.9
    Mosaicity (°) 0.18
    No. of measured reflections 230,289
    No. of independent reflections 38,300
    Completeness (%) 99.3 (98.2)
    Redundancy 6.0 (4.9)
    Rmerge (I) (%)a 6.2 (65.5)
    <I>/<σ(I)> 16.0 (3.0)

Refinement statistics
    Restraints/parameters 8147/6866
    Rmodel (%)b 11.3
    Rfree (%)c 13.7
    Rmodel (all data) (%) 11.3
    No. of water molecules 129
    r.m.s.d deviation from ideal values
        Bond lengths (Å) 0.016
        Angle distances (Å) 0.033
        Average B-factor, protein (water) (Å2) 11.6 (28.2)
        Average anisotropy for all protein atoms 0.56
    Ramachandran plot statisticse
        Most favored regions (%) 100

a Rmerge = Σhkl Σi |I(hkl)i − 〈I(hkl)〉|/Σhkl Σi I(hkl)i, where Σi I(hkl)i are the individual observations of the reflection with indices h,k,l.

b Rmodel = Σhkl |Fo(hkl) − Fc(hkl)|/Σhkl |Fo(hkl)|, where Fo and Fc are the observed and calculated structure factors, respectively.

c 5% random test set (1932 reflections).

d r.m.s., root mean square deviation.

e Calculated using Molprobity.

The structure of S6Δβ2 was determined by the molecular replacement method using the program Phaser (14), with the structure of ribosomal protein S6 from Thermus thermophilus (Protein Data Bank (PDB) code: 1RIS) (15) as a starting model. The model was built using the automatic building procedure in ARP/wARP (16), and the model was completed by manual building using the molecular graphics program Coot (17). In the initial stages, all refinement was carried out to a resolution of 1.2 Å using phenix.refine (18), with 5% of the total reflections randomly set aside for cross-validation. Subsequently, the resolution was gradually extended to the full resolution range, and refinement was carried out using SHELXL-97 (19) in conjugate-gradient least-squares mode. Disordered side chains were modeled, and their occupancies were refined using the restraint that the occupancies should sum to unity. Hydrogen atoms were not refined, but were included in “riding” positions. Anisotropic refinement was carried out in a series of rounds with SHELXL. After each of these refinement rounds, the model was inspected manually with 2|Fo| − |Fc| and |Fo| − |Fc| Fourier maps generated from SHELX fcf files using Coot. The refinement statistics are summarized in Table 1. The stereochemistry of the model was evaluated using MolProbity (20). Structural alignments and illustrations were made using the SSM algorithm (21) and PyMOL (22), respectively.

Stopped-flow Kinetics

Stopped-flow measurements and curve fitting were performed as in Ref. 10. All kinetic parameters were derived from the linear regime near the bottom of the chevron plots, i.e. at ±2 m from the transition midpoint, to avoid contributions from the curvatures in the unfolding and refolding limbs at high and low [GdmCl]. The data were fitted according to the standard equation

graphic file with name zbc00412-9355-m01.jpg

where kfH2O and kuH2O are the extrapolated values of the refolding (kf) and unfolding (ku) rate constants at 0 m GdmCl, and mf and mu are the slopes of the refolding and unfolding limbs, respectively.

RESULTS

Design of S6Δβ2

The new construct S6Δβ2 was derived from the circular permutant P54–55, which has a linker between the wild-type N and C termini and an incision between Lys-54 and Asp-55 in the flexible loop between β2 and β3 (Fig. 2). The purpose of the permutation is to change the position of β2 to the end of the sequence where it can easily be truncated by the introduction of a stop codon at Arg-78(36) (subscripts refer to the numbering in S6wt) (Fig. 2). To prevent S6Δβ2 oligomerization and better adjust the newly exposed surface to solvent, we introduced polar side-chain gatekeepers (23) in β3 (F7T(60) and L8T(61)) and α1 (A62Q(20) and L63N(21)) (Fig. 2). These gatekeepers were grafted from the mixed α/β-protein heterogeneous nuclear ribonucleoprotein K (HNRNP K) (24) (PDB code 1KHM), which has a structure similar to that of S6Δβ2.

NMR Spectroscopy

The solution HSQC spectrum of S6Δβ2 is well dispersed and displays all the characteristics of a folded protein with rigid tertiary structure (Fig. 3). Assignment of the backbone was complete, and identification of the secondary structure elements of S6Δβ2 was by NH, HN, Hα, Cα, and C′ backbone chemical shifts. The results indicate three β-strands (residues 6–12, 33–39, and 45–51) and two α-helices (residues 15–29 and 56–76), the positions of which overlap precisely with those of the parent protein S6wt (supplemental Fig. S1). No peak splitting was found in sequential vicinity to the prolines, which indicates that S6Δβ2 resides in a single isomeric state. In contrast, P54–55 exhibits proline cis-trans isomerization near the C terminus, which is truncated in S6Δβ2, and around Pro-3 near the N terminus, which is homogeneous in S6Δβ2 (25). The dynamic motions of the S6Δβ2 structure were mapped out by NMR relaxation experiments. At all positions, except for the outermost N-terminal residues, the relaxation data indicate a well ordered protein with a low degree of dynamic motion. The heteronuclear 15N-{1H} NOE values are high and uniform along the S6Δβ2 backbone, with an average value of 0.81 ± 0.17. Although no significant dips are seen along the sequence, residues 20–25 at the center of helix 1 stand out by displaying higher NOE values (Fig. 3). This indicates that the central part of helix 1 is particularly rigid. The rotational correlation time (τc) of the S6Δβ2 molecule was determined according to the approximation (26)

graphic file with name zbc00412-9355-m02.jpg

where ωN is the Larmor frequency of nitrogen, and R1 and R2 are the longitudinal and transversal relaxation rates, respectively. This approximation yields good precision when the R1 and R2 values are homogeneous along the sequence, i.e. when conformational exchange and local dynamics are small overall. The average values of R1 and R2 were 1.8 ± 0.15 and 8.3 ± 1.4 s−1, respectively (Fig. 3). By omitting some extreme ratios of R2/R1, τc was estimated to 5.1 ± 0.4 ns (Equation 2), which corresponds to a hydrodynamic radius of 17.8 ± 0.4 Å using Stokes' law. The hydrodynamic radius expected for a globular protein of the size of S6Δβ2 is 17.4 Å. This close agreement shows that S6Δβ2 does not associate into dimers or higher-order species but remains monomeric at the high protein concentration used for the NMR experiments. The aggregation gatekeepers (23) designed into the edge-exposed β3 in S6Δβ2 seem thus to work according to prediction.

FIGURE 3.

FIGURE 3.

NMR HSQC spectrum and backbone dynamics of S6Δβ2 show folded, soluble protein. A, reference HSQC spectrum of P54–55. B, the HSQC spectrum of S6Δβ2 has sharp and well dispersed cross-peaks similar to those of P54–55, consistent with a folded, monomeric protein. C, relaxation parameters R1, R2, and NOE of S6Δβ2 indicate overall low dynamic motions of the S6Δβ2 backbone as expected for a globally ordered structure.

X-ray Structure

S6Δβ2 crystallizes in space group P212121 with one molecule in the asymmetric unit. The exceptionally high resolution of the diffraction data (0.96 Å) and low mosaicity of the crystals are consistent with a compact molecule with little flexibility (Fig. 4). The structure consists of a three-stranded β-sheet with two α-helices on the same side. The radius of gyration including hydration shell, calculated using CRYSOL (27), is 16.6 Å, in good agreement with the NMR-derived value of 17.4 Å. The S6Δβ2 structure is essentially identical to the in silico theoretical model for P54–55 except for the deletion of strand β2 and the following loop. It is also very similar to that of S6wt. Because this is the first reported crystal structure of a circular permutant of S6, this is an important confirmation that the permutation has caused little perturbation of the overall structure. No extended β-sheets are formed between molecules in the crystal. This is in contrast to another artificial variant, S6Alz, in which hydrophobic residues were engineered into β2, leading to a β-zippered tetramer in the crystal (23). The lack of oligomerization of S6Δβ2 follows from the introduction of edge-strand gatekeepers in β3 and is consistent with NMR data.

FIGURE 4.

FIGURE 4.

Structure of β2-truncated protein S6Δβ2. The crystal structure of S6Δβ2 has a resolution of 0.96Å (beige) and is overlaid with the in silico structure (designed from PDB entry 1RIS (9)) used as basis for the P54–55 design (white). This is used in the absence of a crystal structure for P54–55. Data show that the protein also folds into a native-like structure without the edge strand β2. The most notable effect of β2 truncation is that the new N-terminal edge strand in S6Δβ2 (β3) is slightly shorter than in the S6wt structure; the new N terminus folds away from β1. Also, there is a slight alteration in the position of α1. The residues that change most in conformation on the surface of the β-sheet as a result of β2 removal, namely Trp-9(62) and Asn-49(7), are highlighted.

Structural perturbations caused by the removal of β2 are minimal and are mostly limited to rearrangements of side chains in β3 (Fig. 4). The latter strand relaxes to new positions due to the removal of packing constraints from side chains in β2. For example, Trp-9(62) repacks toward the central strand β1, facilitated by a rearrangement of Asn-49(7) (Fig. 4). Removal of β2 also causes a small rigid body shift of helix α1 (residues 58–76) relative to the rest of the structure. Shifts in Cα positions in this helix between S6wt and S6Δβ2 range from 1.2 to 2.6 Å when the rest of the structure is superimposed. This indicates that β2 has a role in the exact positioning of α1 in the wild-type protein. An important interaction in this positioning could be the salt bridge between Lys-65(23) and wild-type position 42 in β2 (supplemental Fig. S2). In contrast, α2 (residues 15–28) is not shifted upon β2 removal, except for some rearrangement at its N terminus. This rearrangement seems due to interactions with the N- and C-terminal linker (residues 41–44) introduced in the circular permutation.

Folding Kinetics

S6wt displays a v-shaped chevron plot with approximately linear GdmCl dependence of the logarithmized refolding and unfolding rate constants (logkf and logku) (Fig. 5), characteristic for cooperative, two-state folding behavior according to Scheme 1

graphic file with name zbc00412-9355-m03.jpg

where D, ‡, and N are the denatured ensemble, the transition state and the native state, respectively. The protein stability for such a system is defined by

graphic file with name zbc00412-9355-m04.jpg

where KD−N = [D]/[N] = ku/kf is the first-order equilibrium constant for folding, and

graphic file with name zbc00412-9355-m05.jpg

where mD−N = mumf are the m-values. Because mf and mu are measures of changes in the solvent-exposed surface area for the transitions D → ‡ and N → ‡ (Scheme 1), they are commonly used as experimental reaction coordinates for folding (2830). For S6wt, the transition midpoint is at [GdmCl] = 3.4 m, as measured by the intersect between logku and logkf. At [GdmCl] = 0 m, the stability of S6wt is estimated to ΔGD−NH2O = 8.5 kcal/mol (Equation 3, Table 2). For the circular permutant P54–55, the transition midpoint and protein stability increase to [GdmCl] = 4.1 m and ΔGD−NH2O = 9.1 kcal/mol, respectively (Table 2). The major part of this stabilization is likely due to the linkage of the N and C termini as mutations/incisions in the dynamic loop between β2 and β3 have generally small effects on protein stability (10). Coupled to the stability gain, P54–55 displays an increased refolding rate constant, which has previously been suggested to arise from a stabilization of the σ2 pathway (10). Finally, the removal of β2 from P54–55 to obtain S6Δβ2 results in a pronounced decrease of the transition midpoint; the stability decreases by 5.1 kcal/mol to ΔGD−NH2O = 4.0 kcal/mol (Table 2). Most of this stability loss is due to an increased unfolding rate constant (Fig. 5), yielding an overall φ-value for β2 removal of approximately zero (Equation 5)

graphic file with name zbc00412-9355-m06.jpg

where Δlogkf and Δlogku are the changes in rate constants upon β2 removal (31). The result shows that β2 has only marginal impact on the folding transition state ensemble and selectively stabilizes the folded state (10). This supports the conclusion that β2 is outside the cooperative core of the S6 structure. Moreover, S6Δβ2 displays a pronounced decrease of mu (Eq. A), indicating that its folded state buries less surface area than the parent protein P54–55 (Fig. 5 and Table 2). Such a decrease in buried surface area is expected as the truncated S6Δβ2 is effectively a smaller protein than P54–55 and wild-type S6.

FIGURE 5.

FIGURE 5.

Chevron plots of S6wt (black), P54–55 (blue), and β2-truncated variant S6Δβ2 (red), showing logkobs = log(ku + kf) versus [GdmCl]. Rate constants are in units of s−1. The overall wild-type-like chevron plot of S6Δβ2 shows that β2 truncation has little effect on the S6 folding transition, which remains classically two-state. Parameters from fitting of Equation 1 are shown in Table 2.

TABLE 2.

Kinetic data derived from the chevron plots in Figure 5

logkfH2Oa mfa loguH2O mua MPb mD−Nc logKD−Nd ΔGD−Ne
m1 m1 m m1 kcal/mol
S6wt 2.64 ± 0.08 −1.26 ± 0.03 −3.63 ± 0.14 0.57 ± 0.03 3.42 ± 0.04 1.83 ± 0.06 6.27 ± 0.22 8.53 ± 0.30
P54–55 3.30 ± 0.06 −0.95 ± 0.01 −3.40 ± 0.10 0.69 ± 0.02 4.09 ± 0.03 1.64 ± 0.03 6.70 ± 0.16 9.11 ± 0.22
S6Δβ2 2.97 ± 0.03 −0.99 ± 0.03 0.01 ± 0.06 0.38 ± 0.02 2.17 ± 0.04 1.37 ± 0.05 2.97 ± 0.09 4.03 ± 0.12

a Derived from chevron data according to Eq. 1.

b Transition midpoint derived from the intersect between logkf and logku.

c mD−N = mumf according to Eq. 4.

d logKD−N = logKD−NH2O + mD−N[GdmCl] = logku + mu[GdmCl] − logkumu[GdmCl] according to Eq. 4.

e ΔGD−N = −2.3RT logKD−N = −2.3RT(logku − logkf) according to Eq. 3.

DISCUSSION

Foldons as Property-based Level in Structural Hierarchy

Comparison of structurally divergent protein domains indicates that structural evolution to some extent occurs progressively by insertions, deletions, and substitutions of smaller structural units (32). Mutation or insertion of additional residues in a flexible part of the protein can cause it to condense into an ordered secondary structure element that adds to the mother domain. When the mother domain is a well optimized two-state protein (1), the structural integrity of this newly acquired secondary structure element will not be critical for global stability or folding; it can unfold locally or be replaced with a suitable disordered sequence without rupturing the scaffold to which it docks. In sharp contrast, the corresponding alteration of a secondary structure element within the cooperative unit of the mother protein is expected to cause global unfolding because the stabilities of two-state domains are generally very sensitive to core substitutions (1, 31). To distinguish between such property-based differences within protein structures, we have employed an additional level of structural hierarchy based on foldons (7, 8, 10, 33) (Fig. 1): a minimal cooperative unit that, in principle, would be able to fold cooperatively on its own (cf. Ref. 34). This does not always mean that the foldon would be thermodynamically stable on its own, or even contiguous in sequence,3 but its folding transition, albeit to a high-energy state, would proceed over a free-energy barrier (1). The structure of S6 seems to be based on two partly overlapping foldons, σ1 and σ2, identified from the spatial boundaries of two competing folding nuclei (10) (Fig. 1). A notable feature of this arrangement is that β2, regardless of how the protein is rewired through circular permutation, falls outside the boundaries of the two folding nuclei (10); despite β2 being part of the S6 sheet, it seems not to be part of the cooperative core. The ability to remove β2 without compromising folding or stability lends further support to the notion that the properties of the various parts of a protein can be predicted from the spatial distribution of foldons. In the simplest case, the foldons are then expected to outline the inflexible parts of the protein machinery. However, there are also instances where the actual unfolding/folding transitions of foldons are used to control functional rearrangements. For example, reversible unfolding/folding of foldons and domains can lend elasticity to arrayed repeat proteins (35), and the “cracking” of strategically placed foldons can relive local stress and thereby lower the barriers for specific global motions (36, 37).

Foldon Expansion and Reduction

The solution structures and backbone dynamics of S6wt and P54–55 have earlier shown that β2 displays dynamic motions in the region connecting to the seemingly flexible β2-β3 loop (25). Together with the autonomous stability of S6Δβ2 (Fig. 5 and Table 2), this indicates that β2 is free to unfold locally in the folded ground state of the wild-type protein. Even so, it is reasonable to believe that S6 could evolve to integrate β2 with the cooperative core, e.g. to better resist aggregation (2) or proteolytic digestion (38, 39) or to allow further extension of the mother domain. Evidence that such changes of the foldon architecture indeed occur is provided by the S6-like protein muscle acylphosphatase (PDB code 1APS (40)), which comprises a C-terminal extension that folds up against β2 to form a five-stranded sheet (Fig. 6). Kinetic analysis suggests that the fifth strand is also integrated in the folding nucleus of acylphosphatase together with β2 and α1 (41) (Fig. 6). It is not yet established, however, whether this expansion of the cooperative unit leads to the introduction of a third foldon, and thus, three competing pathways in the folding energy landscape, cf. Ref. 8. Conversely, it should be possible to reduce further the S6 structure to a single foldon comprising just two strands and a helix. This minimal folding unit would then correspond to the smallest observed proteins that are able to fold into globular structures without co-factors (8). Indications that such a reduction could indeed occur are provided by ground state rupture of β4 in the pre-equilibrium of the unfolding reaction at high [GdmCl] (42). As the remaining structure still undergoes cooperative unfolding, it is expected to persist as an independent cooperative unit even in the absence of GdmCl. Consistently, the structures of α2 and β4 show relatively low protection factors as measured by hydrogen-exchange NMR (9); in contrast to the σ1 foldon (Fig. 1), which exchanges by global unfolding (EX1), α2 and β4 exchange by local motions in the native free-energy basin (EX2), suggesting that these secondary structure elements can be removed with maintained folding barrier. Although S6 can be expressed and purified with extensive side-chain truncations in the α2 and β4 region,4 we have not yet been able to isolate the σ1 foldon as an independent domain. The isolated σ1 requires further optimization of core packing and solvent interface to populate under equilibrium conditions. An example of such an optimization is provided by the sequence-divergent domain of flavoprotein dodecin from Halobacterium salinarum (PDB 2V18) (43), which displays a topology very similar to S6 as it would appear without α2 and β4 (Fig. 6).

FIGURE 6.

FIGURE 6.

Expansion/reduction of foldons and role of β2 in assembly of S6-like proteins. A, structure of S6wt. B, the C-terminal extension of acylphosphatase forms an additional fifth strand that extends the sheet by docking to β2. The high φ-value of the mutation F94L in β5 (41) suggests that this sheet extension also involves an expansion/shift of the foldons of the cooperative core. C, conversely, dodecin from H. salinarum (43) seem to represent an example of foldon reduction because the structure of this protein matches S6wt without α2 and β4. D, upon truncation of edge-strand gatekeepers in β2 (S6Alz), S6 associates by intermolecular β2 contacts (23). The figure shows the dimeric unit of the S6Alz tetramer. E, similar association patterns are observed for the S6-like monomers of monooxygenase from S. coelicolor (PDB code 1LQ9) (44) and several other viral analogues, implying that β2 plays a key role in the preferred mode of dimerization. F, in the ribosomal complex (PDB code 1G1X), β2 of S6wt provides anchoring point for the loop that coordinates S18.

Possible Roles of β2

The question then arises: if not required for folding (Fig. 5) or structural integrity (Figs. 3 and 4), what is the role of β2? One possibility is that β2 originates from a loop between α1 and β3, which serves to co-localize the N and C termini in the superfamily of ferredoxin-like folds. An advantage of such co-localization, which is common among globular proteins, could be that it facilitates structural evolution by gene shuffling; the fold can be freely evolved, inserted, or deleted from loops of other domains. A more straightforward explanation is, however, that β2 is maintained for functional reasons. In the S6 structure, β2 provides an anchor for the loop that wraps over S18 in the ribosomal assembly (Fig. 6). Furthermore, comparison with other ferredoxin-like structures shows that β2 is the preferred interface for dimerization (see e.g. ferredoxin-like folds at the SCOP: Structural Classification of Proteins database). A representative example is monooxygenase from Streptomyces coelicolor (PDB 1LQ9 and several others) (44) where β2 joins two monomers by forming an anti-parallel sheet (Fig. 6). Interestingly, S6 can be made to assemble in an analogous fashion by truncation of charged gatekeeper side chains in β2 (23) (Fig. 6). In the wild-type protein, these side chains are employed as aggregation gatekeepers that prevent promiscuous edge-to-edge interactions, and upon their removal, S6 assembles into ordered homotetramers (23) (Fig. 6). It is conceivable that the position of β2 outside the cooperative unit of the protein allows such quaternary assemblies to evolve relatively freely, with only minor penalties for the structural properties of the constituent monomers. A basic, two-component, recipe for designing proteins would then be to assemble in an overlapping fashion suitable foldons to shape the cooperative scaffold and then decorate this unit with mutationally tolerant loop and border segments for function and binding.

Supplementary Material

Supplemental Data
*

This work was supported by grants from the Swedish Research Council (VR 2009–5580 (to M. O.) and VR 2006–4387) (to D. T. L.), the Knut and Alice Wallenberg Foundation, the Bertil Hållsten Foundation, and Hjärnfonden.

Inline graphic

This article contains supplemental Figs. S1 and S2.

The atomic coordinates and structure factors (code 3ZZP) have been deposited in the Protein Data Bank, Research Collaboratory for Structural Bioinformatics, Rutgers University, New Brunswick, NJ (http://www.rcsb.org/).

3

As an extension of the original definition by Panchenko et al. 34, the foldon is here not restricted to be contiguous in sequence, but solely determined by three-dimensional packing. Studies of circular permutations show that the sequence connectivity of the foldon structure has little effect on its folding behavior (10).

4

E. Haglund, J. Danielsson, S. Kadhirvel, M. O. Lindberg, D. T. Logan, and M. Oliveberg, unpublished results.

2
The abbreviation used is:
HSQC
heteronuclear single quantum correlation.

REFERENCES

  • 1. Oliveberg M., Wolynes P. G. (2005) The experimental survey of protein folding energy landscapes. Q. Rev. Biophys. 38, 245–288 [DOI] [PubMed] [Google Scholar]
  • 2. Jahn T. R., Radford S. E. (2008) Folding versus aggregation: polypeptide conformations on competing pathways. Arch. Biochem. Biophys. 469, 100–117 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3. Shaw B. F., Valentine J. S. (2007) How do ALS-associated mutations in superoxide dismutase 1 promote aggregation of the protein? Trends Biochem. Sci. 32, 78–85 [DOI] [PubMed] [Google Scholar]
  • 4. Byström R., Andersen P. M., Gröbner G., Oliveberg M. (2010) SOD1 mutations targeting surface hydrogen bonds promote amyotrophic lateral sclerosis without reducing apo-state stability. J. Biol. Chem. 285, 19544–19552 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5. Dobson C. M., Karplus M. (1999) The fundamentals of protein folding: bringing together theory and experiment. Curr. Opin Struct. Biol. 9, 92–101 [DOI] [PubMed] [Google Scholar]
  • 6. Lindberg M., Tångrot J., Oliveberg M. (2002) Complete change of the protein folding transition state upon circular permutation. Nat. Struct. Biol. 9, 818–822 [DOI] [PubMed] [Google Scholar]
  • 7. Lindberg M. O., Haglund E., Hubner I. A., Shakhnovich E. I., Oliveberg M. (2006) Identification of the minimal protein folding nucleus through loop-entropy perturbations. Proc. Natl. Acad. Sci. U.S.A. 103, 4083–4088 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8. Lindberg M. O., Oliveberg M. (2007) Malleability of protein folding pathways: a simple reason for complex behavior. Curr. Opin Struct. Biol. 17, 21–29 [DOI] [PubMed] [Google Scholar]
  • 9. Haglund E., Lind J., Oman T., Ohman A., Mäler L., Oliveberg M. (2009) The HD-exchange motions of ribosomal protein S6 are insensitive to reversal of the protein folding pathway. Proc. Natl. Acad. Sci. U.S.A. 106, 21619–21624 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10. Haglund E., Lindberg M. O., Oliveberg M. (2008) Changes of protein folding pathways by circular permutation. Overlapping nuclei promote global cooperativity. J. Biol. Chem. 283, 27904–27915 [DOI] [PubMed] [Google Scholar]
  • 11. Otzen D. E., Kristensen O., Proctor M., Oliveberg M. (1999) Structural changes in the transition state of protein folding: alternative interpretations of curved chevron plots. Biochemistry 38, 6499–6511 [DOI] [PubMed] [Google Scholar]
  • 12. Delaglio F., Grzesiek S., Vuister G. W., Zhu G., Pfeifer J., Bax A. (1995) NMRPipe: a multidimensional spectral processing system based on UNIX pipes. J. Biomol. NMR 6, 277–293 [DOI] [PubMed] [Google Scholar]
  • 13. Kabsch W. (2010) XDS. Acta Crystallogr. D. Biol. Crystallogr. 66, 125–132 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14. McCoy A. J., Grosse-Kunstleve R. W., Adams P. D., Winn M. D., Storoni L. C., Read R. J. (2007) Phaser crystallographic software. J. Appl. Crystallogr. 40, 658–674 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15. Lindahl M., Svensson L. A., Liljas A., Sedelnikova S. E., Eliseikina I. A., Fomenkova N. P., Nevskaya N., Nikonov S. V., Garber M. B., Muranova T. A. (1994) Crystal structure of the ribosomal protein S6 from Thermus thermophilus. EMBO J. 13, 1249–1254 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16. Lamzin V. S., Wilson K. S. (1993) Automated refinement of protein models. Acta Crystallogr. D. Biol. Crystallogr. 49, 129–147 [DOI] [PubMed] [Google Scholar]
  • 17. Emsley P., Lohkamp B., Scott W. G., Cowtan K. (2010) Features and development of Coot. Acta Crystallogr. D. Biol. Crystallogr. 66, 486–501 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18. Adams P. D., Afonine P. V., Bunkóczi G., Chen V. B., Davis I. W., Echols N., Headd J. J., Hung L. W., Kapral G. J., Grosse-Kunstleve R. W., McCoy A. J., Moriarty N. W., Oeffner R., Read R. J., Richardson D. C., Richardson J. S., Terwilliger T. C., Zwart P. H. (2010) PHENIX: a comprehensive Python-based system for macromolecular structure solution. Acta Crystallogr. D. Biol. Crystallogr. 66, 213–221 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19. Sheldrick G. M. (2008) A short history of SHELX. Acta Crystallogr. A. 64, 112–122 [DOI] [PubMed] [Google Scholar]
  • 20. Chen V. B., Arendall W. B., 3rd, Headd J. J., Keedy D. A., Immormino R. M., Kapral G. J., Murray L. W., Richardson J. S., Richardson D. C. (2010) MolProbity: all-atom structure validation for macromolecular crystallography. Acta Crystallogr. D. Biol. Crystallogr. 66, 12–21 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21. Krissinel E., Henrick K. (2004) Secondary structure matching (SSM), a new tool for fast protein structure alignment in three dimensions. Acta Crystallogr. D. Biol. Crystallogr. 60, 2256–2268 [DOI] [PubMed] [Google Scholar]
  • 22. DeLano W. L. (2010) The PyMOL Molecular Graphics System, version 1.3r1, Schrödinger, LLC, New York [Google Scholar]
  • 23. Otzen D. E., Kristensen O., Oliveberg M. (2000) Designed protein tetramer zipped together with a hydrophobic Alzheimer homology: a structural clue to amyloid assembly. Proc. Natl. Acad. Sci. U.S.A. 97, 9907–9912 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24. Baber J. L., Libutti D., Levens D., Tjandra N. (1999) High precision solution structure of the C-terminal KH domain of heterogeneous nuclear ribonucleoprotein K, a c-myc transcription factor. J. Mol. Biol. 289, 949–962 [DOI] [PubMed] [Google Scholar]
  • 25. Ohman A., Oman T., Oliveberg M. (2010) Solution structures and backbone dynamics of the ribosomal protein S6 and its permutant P54–55. Protein Sci. 19, 183–189 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26. Fushman D., Ohlenschläger O., Rüterjans H. (1994) Determination of the backbone mobility of ribonuclease T1 and its 2′-GMP complex using molecular dynamics simulations and NMR relaxation data. J. Biomol. Struct. Dyn. 11, 1377–1402 [DOI] [PubMed] [Google Scholar]
  • 27. Svergun D. I., Barberato C., Koch M. H. (1995) CRYSOL: a program to evaluate x-ray solution scattering of biological macromolecules from atomic coordinates. J. Appl. Crystallogr. 28, 768–773 [Google Scholar]
  • 28. Tanford C. (1970) Protein denaturation. C. Theoretical models for the mechanism of denaturation. Adv. Protein Chem. 24, 1–95 [PubMed] [Google Scholar]
  • 29. Jackson S. E., Fersht A. R. (1991) Folding of chymotrypsin inhibitor 2. 1. Evidence for a two-state transition. Biochemistry 30, 10428–10435 [DOI] [PubMed] [Google Scholar]
  • 30. Ternström T., Mayor U., Akke M., Oliveberg M. (1999) From snapshot to movie: ϕ analysis of protein folding transition states taken one step further. Proc. Natl. Acad. Sci. U.S.A. 96, 14854–14859 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31. Fersht A. R. (1999) Structure and Mechanism in Protein Science: A Guide to Enzyme Catalysis and Protein Folding, W. H. Freeman and Co., New York [Google Scholar]
  • 32. Grishin N. V. (2001) Fold change in evolution of protein structures. J. Struct. Biol. 134, 167–185 [DOI] [PubMed] [Google Scholar]
  • 33. Olofsson M., Hansson S., Hedberg L., Logan D. T., Oliveberg M. (2007) Folding of S6 structures with divergent amino acid composition: pathway flexibility within partly overlapping foldons. J. Mol. Biol. 365, 237–248 [DOI] [PubMed] [Google Scholar]
  • 34. Panchenko A. R., Luthey-Schulten Z., Wolynes P. G. (1996) Foldons, protein structural modules, and exons. Proc. Natl. Acad. Sci. U.S.A. 93, 2008–2013 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35. Williams P. M., Fowler S. B., Best R. B., Toca-Herrera J. L., Scott K. A., Steward A., Clarke J. (2003) Hidden complexity in the mechanical properties of titin. Nature 422, 446–449 [DOI] [PubMed] [Google Scholar]
  • 36. Miyashita O., Onuchic J. N., Wolynes P. G. (2003) Nonlinear elasticity, proteinquakes, and the energy landscapes of functional transitions in proteins. Proc. Natl. Acad. Sci. U.S.A. 100, 12570–12575 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37. Whitford P. C., Onuchic J. N., Wolynes P. G. (2008) Energy landscape along an enzymatic reaction trajectory: hinges or cracks? HFSP J. 2, 61–64 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38. Park C., Marqusee S. (2004) Probing the high energy states in proteins by proteolysis. J. Mol. Biol. 343, 1467–1476 [DOI] [PubMed] [Google Scholar]
  • 39. Park C., Marqusee S. (2005) Pulse proteolysis: a simple method for quantitative determination of protein stability and ligand binding. Nat. Methods 2, 207–212 [DOI] [PubMed] [Google Scholar]
  • 40. Pastore A., Saudek V., Ramponi G., Williams R. J. (1992) Three-dimensional structure of acylphosphatase. Refinement and structure analysis. J. Mol. Biol. 224, 427–440 [DOI] [PubMed] [Google Scholar]
  • 41. Chiti F., Taddei N., White P. M., Bucciantini M., Magherini F., Stefani M., Dobson C. M. (1999) Mutational analysis of acylphosphatase suggests the importance of topology and contact order in protein folding. Nat. Struct. Biol. 6, 1005–1009 [DOI] [PubMed] [Google Scholar]
  • 42. Otzen D. E., Oliveberg M. (2002) J. Mol. Biol. 317, 613–627 [DOI] [PubMed] [Google Scholar]
  • 43. Bieger B., Essen L. O., Oesterhelt D. (2003) Crystal structure of halophilic dodecin: a novel, dodecameric flavin binding protein from Halobacterium salinarum. Structure 11, 375–385 [DOI] [PubMed] [Google Scholar]
  • 44. Sciara G., Kendrew S. G., Miele A. E., Marsh N. G., Federici L., Malatesta F., Schimperna G., Savino C., Vallone B. (2003) The structure of ActVA-Orf6, a novel type of monooxygenase involved in actinorhodin biosynthesis. EMBO J. 22, 205–215 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45. Goddard T. D., Kneller D. G. (2007) SPARKY 3, University of California, San Francisco [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplemental Data

Articles from The Journal of Biological Chemistry are provided here courtesy of American Society for Biochemistry and Molecular Biology

RESOURCES