Skip to main content
Protein Science : A Publication of the Protein Society logoLink to Protein Science : A Publication of the Protein Society
. 2009 Jun 29;18(8):1806–1813. doi: 10.1002/pro.172

Sequence determinants of thermodynamic stability in a WW domain—An all-β-sheet protein

Marcus Jäger 1,2, Maria Dendle 1,2, Jeffery W Kelly 1,2,*
PMCID: PMC2776968  PMID: 19565466

Abstract

The stabilities of 66 sequence variants of the human Pin1 WW domain have been determined by equilibrium thermal denaturation experiments. All 34 residues composing the hPin1 WW three-stranded β-sheet structure could be replaced one at a time with at least one different natural or non-natural amino acid residue without leading to an unfolded protein. Alanine substitutions at only four positions within the hPin1 WW domain lead to a partially or completely unfolded protein—in the absence of a physiological ligand. The side chains of these four residues form a conserved, partially solvent-inaccessible, continuous hydrophobic minicore comprising the N- and C-termini. Ala mutations at five other residues, three of which constitute the ligand binding patch on the concave side of the β-sheet, significantly destabilize the hPin1 WW domain without leading to an unfolded protein. The remaining mutations affect protein stability only slightly, suggesting that only a small subset of side chain interactions within the hPin1 WW domain are mandatory for acquiring and maintaining a stable, cooperatively folded β-sheet structure.

Keywords: WW domain, Ala scanning, Gly scanning, side chain mutagenesis, protein stability, protein folding, beta-sheet

Introduction

While it is well established that the sequences of small soluble proteins often contain all the chemical information required for their folding,1 it is not possible to identify energetically important H-bonds or side-chain-side-chain interactions simply by inspecting high resolution structures. Instead scanning backbone and side chain mutagenesis has proven to be a very effective means of identifying energetically important backbone H-bonds and side chain interactions (including hydrophobic and electrostatic interactions), respectively.29 Herein we seek to identify the side chains that are critical for acquiring and maintaining the β-sheet structure of the N-terminally truncated WW domain (residues 6–39) from the human cell cycle regulatory protein Pin110 using Ala scanning mutagenesis and related methodology. Complete Ala scans were previously reported on small proteins that are either entirely α-helical11,12 or were composed of both α-helical and β-sheet structural elements.13 Herein, we describe a similar study for the all-β sheet structure of the hPin1 WW domain (hereafter referred to as hPin1 WW), which is an ideal system to utilize traditional side chain mutagenesis to perturb its all-β-sheet structure to discern which side-chain-side-chain interactions are energetically important.6,7,14,15

The hPin1 WW is well suited for perturbation thermodynamic measurements, because it folds rapidly and spontaneously by a highly cooperative two-state mechanism7,1618 and generally affords mutants that are highly soluble and lack a proclivity for aggregation7,19,20 - distinguishing this domain from most de novo designed and natural three-stranded β-sheet proteins.2124 Moreover, the small size of hPin1 WW makes it amenable to recombinant biological synthesis as well as chemical synthesis25,26; the latter has been used to accomplish a comprehensive amide-to-ester scan to identify energetically important backbone-backbone H-bonds.6,8,27,28 In addition, the robust structure and diverse amino acid composition of hPin1 WW enables a variety of biophysical methods to be employed to rigorously study its structure-folding relationship, including NMR, fluorescence and circular dichroism.7,16,19,29 Because hPin1 WW binds peptides containing a phosphorylated serine or phosphorylated threonine residue followed by a single proline residue,10,30 thus enabling phosphorylation-dependent protein-protein interactions, the relationship between folding, structure, function, and evolutionary constraints can also be explored.20,3133

Design of variants

Alanine (Ala) mutations were constructed for 32 of the 33 non-Ala residues of hPin1 WW (residue 31 is Ala in wild type hPin1 WW). Only the C-terminal residue, Gly39, which is completely solvent exposed, was not mutated. Ala-mutagenesis eliminates the side-chain beyond the β-carbon, generally without significantly changing the φ, Ψ backbone dihedral angle preferences, thus allowing an unbiased and quantitative evaluation of the influence of each side chain to the stability of the protein. This perturbation thermodynamic analysis identifies the residues whose side chains are critical for acquiring and maintaining a three-stranded β-sheet structure, as mutation of those to Ala significantly destabilizes the structure.14,15 We also conducted a partial Gly-scan within the two solvent-exposed reverse turns, that is, in the loop 1 (Ser16-Arg21) and loop 2 (His27-Asn30) substructures that connect β-strands 1 and 2 and β-strands 2 and 3, respectively. Gly-substitutions are generally expected to be more destabilizing than Ala-mutations, because of the larger conformational freedom of Gly in the unfolded state due to its expanded permitted φ, Ψ backbone dihedral angles.

For the Ala substitutions that resulted in unfolded or substantially destabilized hPin1 WW variants, specific point mutations were prepared at those positions to learn more about the requirements for acquiring and maintaining a stabilized β-sheet structure. For example, Tyr23 was mutated to Phe and Leu, in addition to the destabilizing Ala substitution. Asn26 was mutated to the isosteric but charged residue Asp, and Pro37 was mutated to the unnatural amino acid 3,4-dehydroproline (Δ3,4P).

Characterization of variants

Equilibrium thermal denaturation curves of hPin1 WW variants were recorded utilizing far-UV CD spectroscopy. Folded wild type hPin1 WW, like other members of the WW domain family, exhibits a characteristic positive ellipticity around 226 nm [Fig. 1(A), solid black spectrum], which disappears upon thermal [Fig. 1(A), dashed black spectrum] or chaotrope denaturation and thus serves as a folding probe.7,16,18,3436 Only four (Trp11Ala, Tyr24Ala, Asn26Ala, Pro37Ala) out of the 32 Ala-mutants investigated in this study appeared to be completely [Fig. 1(A), Trp11Ala, solid green spectrum] or partially unfolded [Fig. 1(A), Pro37Ala, solid red spectrum] at physiological temperatures, as indicated by a significantly reduced positive ellipticity in the far-UV CD spectrum at 226 nm. Thermal unfolding curves of these variants were either non-cooperative {Trp11Ala [Fig. 1(B), green symbols], Tyr24Ala, Asn26Ala} or the mutant was so destabilized that the pretransition baseline was not detectable [Pro37Ala; Fig. 1(B), red symbols], even at low temperature (2°C).

Figure 1.

Figure 1

Thermodynamic analysis of wild type hPin1 WW and variants thereof. (A) Monitoring thermal denaturation of hPin1 WW by far-UV CD spectroscopy: Spectra of folded wild type hPin1 WW domain (2°C, solid black line), denatured wild type hPin1 WW domain (98°C, dashed black line), intrinsically unstructured Trp11Ala hPin1 WW domain (2°C, solid green line) and partially folded Pro37Ala hPin1 WW domain (2°C, solid red line) are shown. (B) Raw thermal denaturation data from wild type hPin1 WW domain (filled black symbols), and the intrinsically unstructured Trp11Ala (filled green symbols) and the partially folded Phe37Ala (filled red symbols) hPin1 WW variants. The hPin1 WW domain concentration in (A) and (B) was ∼15 μM. Measurements were performed in 20 mM sodium phosphate, pH 7.0 using a 2 mm quartz cuvette. The solid line connecting the black symbols is a fit of the data to a two-state model [Eqs. (1) and (2) in Materials and Methods]. The solid lines connecting the red and green symbols are simply guides to the eye and have no theoretical foundation. (C) Representative normalized equilibrium thermal denaturation curves [Eq. (3), Materials and Methods] of hPin 1 WW variants. Shown are data for Y23A-, L7A-, T29A-, P8A-, N30A-, and wild type hPin1 WW domains, in increasing order of stability. (D) Plot of the folding free energy (ΔG, in kJ/mol) at 40°C versus the midpoint of unfolding (TM) for the hPin1 WW point variants listed in Table I. For clarity, the data point obtained for wildtype hPin1 WW is color coded red.

All other mutants were fully folded at 2°C and exhibited cooperative thermal denaturation curves [representative curves are shown in Fig. 1(C)]. The midpoint of unfolding (TM, defined as ΔG(TM) = 0) and folding free energies (ΔG(T)) for each mutant were determined by fitting the thermal denaturation curves to a two-state unfolding model as described previously (Table I).7 All folded mutants exhibit similar unfolding cooperativities, as indicated by the excellent correlation between the extracted free energies of folding (ΔG(T)) and the folding midpoints (TM) estimated from a two-state analysis of the thermal denaturation curves [Fig. 1(D)].

Table I.

Summary of Thermodynamic Data for Wild Type hPin1 WW and Mutants Thereof

Mutant TM (°C) ΔG1 ΔG2 ΔG (40°C) (kJ/mol)
wt 58.6 0.403 (3) 0.00272 (19) −6.55
K6A 59.4 0.400 (3) 0.00153 (25) −7.18
L7A 37.8 0.301 (2) 0.00022 (11) 0.663
L7I 49.3 0.318 (2) −0.00046 (14) −2.99
L7V 44.0 0.321 (2) 0.00041 (13) −1.27
L7NVa 48.8 0.322 (2) 0.00167 (12) −2.79
P8A 47.4 0.361 (1) 0.00293 (8) −2.51
P8G 47.7 0.365 (2) 0.00220 (8) −2.72
P9A 56.0 0.397 (2) 0.00229 (13) −5.76
P9G 53.1 0.378 (3) 0.00282 (16) −4.49
G10A 49.0 0.348 (2) 0.00151 (12) −3.00
W11A <10.0 n.d. n.d. n.d.
W11L <10.0 n.d. n.d. n.d.
W11F 35.0 0.308 (2) −0.00048 (16) 1.52
E12A 52.6 0.373 (1) 0.00104 (10) −4.53
K13A 59.6 0.385 (1) 0.00285 (8) −6.45
R14A 39.2 0.347 (1) 0.00074 (11) 0.278
M15A 51.8 0.380 (2) 0.00289 (13) −4.08
S16A 54.0 0.398 (2) 0.00313 (9) −4.70
S16G 47.6 0.391 (2) 0.00194 (22) −2.69
R17A 58.8 0.391 (2) 0.00232 (15) −6.53
R17G 57.3 0.374 (3) 0.00277 (18) −5.64
S18A 58.4 0.398 (4) 0.00185 (31) −6.69
S18G 56.5 0.440 (4) 0.00227 (35) −6.64
S19A 57.1 0.392 (2) 0.00272 (22) −5.98
S19G 56.0 0.449 (6) 0.00234 (47) −6.58
G20A 48.9 0.355 (2) 0.00270 (11) −2.94
R21A 50.9 0.369 (2) 0.00144 (15) −3.85
R21G 51.5 0.345 (2) 0.00235 (15) −3.74
V22A 54.2 0.403 (2) 0.00116 (14) −5.48
Y23A 33.9 0.328 (2) 0.00098 (18) 2.03
Y23L 45.3 0.313 (2) 0.00153 (13) −1.61
Y23F 52.8 0.376 (2) 0.376 (2) −4.39
Y24A <10.0 n.d. n.d. n.d.
Y24L <10.0 n.d. n.d. n.d.
Y24F 51.4 0.363 (2) 0.00279 (14) −3.77
Y24W 52.9 0.357 (3) 0.00230 (19) −4.22
F25A 32.5 0.316 (2) 0.00042 (15) 2.39
F25L 42.5 0.340 (3) 0.00202 (18) −0.83
F25Y 62.0 0.401 (5) 0.003 (22) −7.45
N26A <10.0 n.d. n.d. n.d.
N26L <10.0 n.d. n.d. n.d.
N26D 36.0 0.327 (2) 0.00044 (18) 1.31
H27A 57.7 0.388 (3) 0.00262 (18) −6.04
H27G 50.5 0.367 (1) 0.00130 (10) −3.71
I28A 54.2 0.379 (2) 0.00165 (15) −5.05
I28G 47.2 0.363 (3) 0.00105 (23) −2.55
T29A 44.3 0.317 (2) 0.00100 (12) −1.34
T29G 34.4 0.316 (1) −0.00007 (8) 1.76
T29S 50.8 0.373 (2) 0.00159 (13) −3.85
T29D 42.9 0.338 (1) 0.00009 (8) −0.97
N30A 53.3 0.372 (2) 0.00208 (14) −4.57
N30G 65.0 0.386 (3) 0.00142 (21) −8.76
A31G 40.9 0.359 (2) 0.00197 (13) −0.32
S32A 56.9 0.369 (2) 0.00272 (13) −5.45
S32G 50.1 0.335 (3) 0.00200 (17) −3.18
Q33A 53.1 0.332 (2) 0.00103 (12) −4.17
W34A 52.9 0.386 (2) 0.00067 (16) −4.86
W34F 58.0 0.399 (2) 0.00326 (9) −6.12
E35A 50.3 0.369 (2) 0.00283 (12) −3.50
R36A 56.7 0.357 (3) 0.00225 (19) −5.33
P37A <25.0 n.d. n.d. n.d.
P37I <25.0 n.d. n.d. n.d.
Δ3,4P37 55.1 0.372 (2) 0.00267 (14) −5.00
S38A 58.8 0.369 (2) 0.00317 (14) −6.76
S38G 58.2 0.411 (3) 0.00382 (16) −6.21

Thermodynamic parameters for wild type (wt) and single point mutants of hPin1 WW domain were determined as described in Materials and Methods. n.d., not determined (mutant too unstable or unfolded). TM = midpoint of unfolding. K6A and S38A refer to Ala mutations at the first and penultimate residue. No measurements were made at position 39. Numbers in brackets indicate 95% confidence interval values. Δ3,4P37 refers to the mutant in which Pro37 is replaced with 3,4-dehydro-Proline, see Figure 2(D) for a line diagram of Δ3,4P37.

The effect of the various Ala- [Fig. 2(A)] and Gly-substitutions [Fig. 2(B)] on the stability of hPin1 WW, calculated at near-physiological temperature (40°C), is depicted graphically. Significantly destabilized mutants whose ΔΔG exceed 4 kJ/mol, but are in the measurable range, are depicted by blue bars. Red bars indicate highly destabilized mutants where only a lower estimate of ΔΔG can be offered, as these hPin1 WW variants were unfolded and therefore accurate free energies could not be obtained. Except for the Asn30Gly variant (loop 2), which significantly exceeds the stability of the Asn30Ala analogue, the Gly variants are either less stable than the Ala-mutants, or exhibit TM's within a couple of degrees of the corresponding Ala variants, reflecting similar stabilities. The higher stability of the N30G mutant is consistent with a preference for Gly at this position among WW domain family members (http://smart.embl-heidelberg.de/smart/do_annotation.pl?DOMAIN = SM00456), suggesting that the gain in stability results from an optimization of the local ϕ/ψ-backbone dihedral angles and improved loop formation predisposition. On average, the six-residue loop 1 substructure (residues Ser16-Arg21) is much more permissive of substitutions by Gly than the four-residue reverse turn 2 (residues His27-Asn30), consistent with loop 1's increased conformational mobility relative to reverse turn 2,7,19,31,37 and therefore, it is able to accommodate destabilizing Gly mutations without significantly compromising hPin1 WW folding or stability. Conformational disorder in the loop 1 substructure of folded hPin1 WW has been shown directly by NMR backbone relaxation studies38 and seems to be important for the biological activity of hPin1 WW.31,38

Figure 2.

Figure 2

Quantitative analysis of the complete Ala- and partial Gly-scans of the hPin WW domain. (A) Effects of the Ala mutations on the equilibrium stability of hPin1 WW. A positive value indicates that the Ala mutation is destabilizing relative to the wild type hPin1 WW domain. Mutations that are destabilizing by more than 4 kJ/mol are labeled (single letter code). Mutations of residues Leu7, Arg14, Tyr23, Phe25, and Thr29 result in destabilized variants that are only fully folded at low temperature (2°C) (blue bars). Accurate free energies cannot be reported for Ala-mutations at residues Trp11, Tyr24, Asn26, and Pro37, as the corresponding Ala-mutations resulted in partially or fully denatured WW domains, even at 2°C. The energies reported therefore are lower estimates (red bars). Residues that contribute to β-strands 1–3 are indicated. (B) Effects of Gly substitutions at 13 positions on the equilibrium stability of hPin1 WW. A positive value indicates that the Gly mutation is destabilizing relative to the wild type protein. Mutations that are destabilizing by more than 2 kJ/mol, as well as mutations that are more stable than wild type, are labeled (single letter code). Mutations of residues Thr29 and Ala31 result in destabilized variants that are only fully folded at low temperature (2°C) (blue bars). Residues that contribute to the loop 1 and loop 2 substructures are indicated. (C) Structural depiction of the hPin1 WW domain. The side chains of residues Trp11, Tyr24, and Pro37 that constitute the hydrophobic stability minicore are shown explicitly in ball-and-stick mode and are color-coded red. (Leu7, colored blue, bottom right, also is part of this stability minicore.) The side chains of residues Arg14, Tyr23, and Phe25 that contribute to the second hydrophobic core involved in ligand binding are shown explicitly in ball-and-stick mode and are color-coded blue (top left). (D) Structure of 3,4-dehydroproline (Δ3,4P).

Considering the data from the Ala-scan as a whole, the mutations can be classified into 3 groups based on their effect on the folding thermodynamics: (i) mutations that lead to unfolded or partially folded WW domains at all temperatures [Fig. 2(A), red bars], (ii) mutations that result in variants that are substantially unfolded (>40%) at physiological temperature, but fully folded at low temperature (2°C) [Fig. 2(A), blue bars], and (iii) mutations that affect hPin1 WW stability only slightly (gray bars).

Ala-mutants that remain unfolded at 2°C

Of the four Ala variants that are completely denatured at physiological temperature and are substantially denatured at low temperature (2°C) [Trp11Ala, Tyr24Ala, Asn26Ala, and Pro37Ala; Fig. 2(A), red bars], three of these side chains (Trp11, Tyr24, and Pro37) form a hydrophobic mini core that is highly conserved in all WW domain family members [Fig. 2(C), labeled red residues]. These residues are not directly involved in peptide ligand binding.39 No evidence for aggregation of these unfolded mutants is apparent, even upon long-term storage at 4°C, consistent with earlier work.40 Preliminary studies indicate that the aggregation resistance is directly related to the presence of the Pro-rich N- and C-terminal extensions, as their removal resulting in a Gly10-Trp34 hPin1 WW domain leads to measurable amyloidogenicity at physiological salt concentrations (M. J. and J. W. K., unpublished data). Pro37 is an invariant residue in WW domain family members that appears to be required for the integrity of the hydrophobic mini core.41 Only a very conservative, non-natural and near-isosteric substitution of the proline ring bearing a double bond, a 3,4-dehydroproline analogue (Δ3,4P), yields a cooperatively folded, yet destabilized protein [Fig. 2(D), Table I].

Trp11 and Tyr24 are highly conserved amongst WW domain family members.42,43 Mutagenesis studies demonstrate that only variants containing aromatic residues at position 11 and 24 are folded, consistent with positional frequencies calculated from >200 WW domain sequence entries in the data base.42,44 Trp is the most frequent amino acid at position 11, and is only replaced by Phe. Tyr is the most abundant residue at position 24, but in some instances it is substituted by Phe and less frequently by Trp. The equivalent of Tyr24, shown to be critical for folding of hPin1 WW in this study, is mutated to a Cys residue in the PQBP1 WW domain (a brain-enriched transcriptional regulator) causing Golabi-Ito-Hall syndrome associated with mental retardation.40

Asn26 serves both as a H-bond donor and acceptor, mediating side-chain-side-chain and side-chain-backbone H-bonds, all of which are disrupted in the Asn26Ala and Asn26Leu variants that are unfolded.10 A near-isosteric Asp residue can compensate for some (but not all) of the lost interactions exhibited by the hydrophobic side chain replacements, and is frequently found in place of Asn at position 26 (http://smart.embl-heidelberg.de/smart/do_annotation.pl?DOMAIN = SM00456), consistent with our data revealing that the Asn26Asp variant is folded. A reverse mutation (Asp30Asn) at the topologically equivalent position in the hYap WW domain36 stabilizes this domain, seemingly accounting almost completely for the loss in stability seen in hPin1 WW upon reverse mutation, suggesting similar side-chain-side-chain and side-chain-backbone H-bonds in both WW domains.

Ala-mutations at the Trp11, Tyr24, Asn26, and Pro37 equivalent positions in the related FBP28 and hYap65 WW domains also yield unfolded proteins,41,45 suggesting that this hydrophobic minicore is generally important for WW domain structural stability. We envision that the absence of Pro37 would render the hydrophobic side chains of Trp11 and Tyr24 accessible to solvent, decreasing the hydrophobic effect stabilizing the β-sheet structure of the WW domain. Increased solvent exposure of aromatic side chains and/or changes in β-sheet twist may explain why most de novo designed triple-stranded β-sheets that lack the equivalent of a delocalized Trp11-Tyr24-Asn26-Pro37 hydrophobic minicore are only modestly stable, and exhibit a high tendency to aggregate.

Ala-mutants that are folded at 2°C, but significantly destabilized at physiological temperature

Five Ala-variants (Leu7Ala, Arg14Ala, Tyr23Ala, Phe25Ala, and Thr29Ala) are unfolded to a substantial degree (>40%) at physiological temperature, but are fully folded at lower temperature (2°C) and exhibit cooperative unfolding transitions with defined pre- and post-transition baselines [Fig. 2(A), blue bars].

The side chain of Leu7 [colored blue in Fig. 2(C), bottom right] participates in the formation of the hydrophobic stability core (colored red) mentioned in the previous section [Fig. 2(C)]. The two methyl groups of Leu7 seal the otherwise partially solvent-exposed hydrophobic minicore in a snap-lid fashion, similar to Ile17 in the sequence-related hYap65 WW domain.30 The thermodynamic influence of Leu7 on hPin1 WW folding seems to be particularly sensitive to side chain modification, as evidenced by the increasing destabilization observed across the Leu7Ile-Leu7Norval-Leu7Val-Leu7Ala mutant series (Table I).

The side chain of Thr29 serves as an H-bond acceptor for the backbone amide of residue Ala31 at the beginning of β-strand 3. Our differential scanning mutagenesis at position 29 reveals that Ser, the only natural residue that can replace Thr29 without eliminating the aforementioned side-chain-backbone H-bond, exhibits a wild type like stability, whereas both Thr29Ala and Thr29Gly variants are considerably more destabilized.

The remaining three side chains in this category, Arg14, Tyr23, and Phe25, are all positioned on the concave side of the β-sheet and form a second hydrophobic minicore [Fig. 2(C), blue color, top left]. Together with residues Ser16 and Arg17 (loop 1), and Trp34 (β-strand 3), these three residues are important for ligand binding. The side chains of residues Trp34 and Arg17 are energetically most important for ligand binding, and Ala mutations at these positions lead to a 18- (Trp34) and 6-fold decrease (Arg17) in the binding constant.39 Since these side chains only contribute marginally to the thermodynamic stability of hPin1 WW, it must be concluded that residues Arg17 and Trp34 have evolved primarily for function. A slight gain in binding affinity (approximately twofold) is observed upon Ala-mutation of residues Arg14 and Phe25, but this gain of function appears to be outweighed by a strong decrease in thermodynamic stability, rendering Arg14Ala and Phe25Ala mutants substantially denatured (>50%) at physiological temperature. Arg14 and Phe25, therefore, appear to have evolved to enhance stability without compromising peptide binding, for example the F25Y variant exhibits a WT Kd.46

Ala-mutants with near wild type stability

This group contains the majority of mutant sequences (23 out of the 32 “to-Ala” substitutions) and includes variants with midpoints of denaturation and free energies of folding within 10°C or 4 kJ/mole of wild type hPin1 WW, respectively. This heterogeneous group includes all residues composing loops 1 and 2 (Ser16-Arg21, His27-Asn30), seven of the eight charged residues, and the two N-terminal Pro residues (Pro8, Pro9). The replaced side chains are either substantially solvent accessible in the folded, unliganded hPin1 WW domain and/or exhibit low sequence conservation among WW-domain family members. As only Glu12, Ser16, and Arg17 contribute directly to ligand binding,39 it is possible that some of the residues in this group evolved for protein solubility and/or prevention of aggregation/proteolytic degradation or coevolved with chaperones and folding enzymes to enhance folding at the expense of aggregation.47

Materials and Methods

Preparation of variants

The hPin1 WW variants Leu7Norval, Trp11Ala, Arg21Gly, Tyr24Ala, Tyr24Leu, Asp26Ala, Asn30Gly, Pro37Ala, Pro37Leu, and Pro37dehydroP were prepared by solid-phase peptide synthesis.26 All other mutants were expressed in E. coli as Glutathion-S-transferase fusion proteins and purified via Glutathion-affinity chromatography.7 Purified hPin1 WW domains were dialyzed against 20 mM sodium phosphate buffer, pH 7.0 and stored at −80°C until use. Sequence identity of all hPin1 WW variants was ascertained by MALDI-mass spectrometry.

Spectroscopic studies

CD-spectroscopic measurements were made in 10 mM sodium phosphate, pH 7.07,16 with an AVIV model 202SF circular dichroism spectrometer (Lakewood, NJ), equipped with a Peltier temperature control system. Measurements were made at a protein concentration of 15 μM in 2 mm quartz cuvettes (Hellma, Forest Hills, NY).

Thermodynamic analysis

The thermodynamic stability of hPin1 WW and variants thereof was determined from equilibrium unfolding curves. Raw equilibrium denaturation data, monitored by far-UV CD at 226 nm, were fitted to Eq. (1), which assumes the validity of a two-state reaction7,29:

graphic file with name pro0018-1806-m1.jpg (1)

In Eq. (1), Keq = exp(−ΔG(T)/RT) is the equilibrium constant for folding, θ is the experimentally measured ellipticity (in units of millidegrees (mdeg)) and θN(T) and θD(T) the slopes of the pre- and posttransitions. Folding free energies ΔG(T) in Eq. (1) were expressed as a parabolic Taylor's expansion around the midpoint of equilibrium folding (T = TM) [Eq. (2)]7,29:

graphic file with name pro0018-1806-m2.jpg (2)

Equilibrium unfolding transitions were normalized to the fraction of denatured protein (FD):

graphic file with name pro0018-1806-m3.jpg (3)

Conclusions

We have shown that the WW domain of the human Pin1 rotamase provides a platform to probe the sequence determinants of folding and stability for a protein that adopts a relatively simple antiparallel β-sheet topology. Despite consisting of only 34 residues, all but four of these can be replaced one at a time by Ala without compromising the folding of this 3-stranded β-sheet at 2°C. These stability-determining residues do not directly contribute to the biological function of the WW domain (peptide binding), and are highly conserved amongst other WW-domain family members. That the majority of the side chains (23/32) were energetically dispensable, resulting in a modest decrease in WW domain stability, suggests that a simplified WW domain could still fold into a stable structure, as shown previously for other proteins.4850 Such low complexity sequences may not be able to optimally utilize the proteostasis network47 and thus might be aggregation-prone within a cell and Ala-rich proteins would probably be susceptible to the triplet repeat expansions linked to a variety of diseases,51 and likely a multitude of other biological challenges, including problems associated with immune system differentiation between self and non-self.52

Acknowledgments

The authors wish to thank Dr. Evan Powers for his help with the figures, Dr. Colleen Fearns for her critical reading and editing of the manuscript, as well as Dr. Houbi Nguyen and Dr. Martin Gruebele (Department of Chemistry, UIUC) for helpful discussions.

References

  • 1.Anfinsen CB. Principles that govern the folding of protein chains. Science. 1973;181:223–230. doi: 10.1126/science.181.4096.223. [DOI] [PubMed] [Google Scholar]
  • 2.Heinz DW, Baase WA, Matthews BW. Folding and function of a T4 lysozyme containing 10 consecutive alanines illustrate the redundancy of information in an amino acid sequence. Proc Natl Acad Sci USA. 1992;89:3751–3755. doi: 10.1073/pnas.89.9.3751. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Cunningham BC, Wells JA. High-resolution epitope mapping of hGH-receptor interactions by alanine-scanning mutagenesis. Science. 1989;244:1081–1085. doi: 10.1126/science.2471267. [DOI] [PubMed] [Google Scholar]
  • 4.Kuroda Y, Kim PS. Folding of bovine pancreatic trypsin inhibitor (BPTI) variants in which almost half the residues are alanine. J Mol Biol. 2000;298:493–501. doi: 10.1006/jmbi.2000.3622. [DOI] [PubMed] [Google Scholar]
  • 5.Matthews BW. Studies on protein stability with T4 lysozyme. Adv Protein Chem. 1995;46:249–278. doi: 10.1016/s0065-3233(08)60337-x. [DOI] [PubMed] [Google Scholar]
  • 6.Deechongkit S, Nguyen H, Powers ET, Dawson PE, Gruebele M, Kelly JW. Context-dependent contributions of backbone hydrogen bonding to beta-sheet folding energetics. Nature. 2004;430:101–105. doi: 10.1038/nature02611. [DOI] [PubMed] [Google Scholar]
  • 7.Jager M, Nguyen H, Crane JC, Kelly JW, Gruebele M. The folding mechanism of a beta-sheet: the WW domain. J Mol Biol. 2001;311:373–393. doi: 10.1006/jmbi.2001.4873. [DOI] [PubMed] [Google Scholar]
  • 8.Powers ET, Deechongkit S, Kelly JW. Backbone-backbone H-bonds make context-dependent contributions to protein folding kinetics and thermodynamics: lessons from amide-to-ester mutations. Adv Protein Chem. 2006;72:39–78. doi: 10.1016/S0065-3233(05)72002-7. [DOI] [PubMed] [Google Scholar]
  • 9.Nguyen H, Jager M, Moretto A, Gruebele M, Kelly JW. Tuning the free-energy landscape of a WW domain by temperature, mutation, and truncation. Proc Natl Acad Sci USA. 2003;100:3948–3953. doi: 10.1073/pnas.0538054100. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Ranganathan R, Lu KP, Hunter T, Noel JP. Structural and functional analysis of the mitotic rotamase Pin1 suggests substrate recognition is phosphorylation dependent. Cell. 1997;89:875–886. doi: 10.1016/s0092-8674(00)80273-1. [DOI] [PubMed] [Google Scholar]
  • 11.Sauer RT, Milla ME, Waldburger CD, Brown BM, Schildbach JF. Sequence determinants of folding and stability for the P22 Arc repressor dimer. FASEB J. 1996;10:42–48. doi: 10.1096/fasebj.10.1.8566546. [DOI] [PubMed] [Google Scholar]
  • 12.Milla ME, Brown BM, Sauer RT. Protein stability effects of a complete set of alanine substitutions in Arc repressor. Nat Struct Biol. 1994;1:518–523. doi: 10.1038/nsb0894-518. [DOI] [PubMed] [Google Scholar]
  • 13.Yu MH, Weissman JS, Kim PS. Contribution of individual side-chains to the stability of BPTI examined by alanine-scanning mutagenesis. J Mol Biol. 1995;249:388–397. doi: 10.1006/jmbi.1995.0304. [DOI] [PubMed] [Google Scholar]
  • 14.Deechongkit S, Nguyen H, Jager M, Powers ET, Gruebele M, Kelly JW. Beta-sheet folding mechanisms from perturbation energetics. Curr Opin Struct Biol. 2006;16:94–101. doi: 10.1016/j.sbi.2006.01.014. [DOI] [PubMed] [Google Scholar]
  • 15.Jager M, Deechongkit S, Koepf EK, Nguyen H, Gao J, Powers ET, Gruebele M, Kelly JW. Understanding the mechanism of beta-sheet folding from a chemical and biological perspective. Biopolymers. 2008;90:751–758. doi: 10.1002/bip.21101. [DOI] [PubMed] [Google Scholar]
  • 16.Koepf EK, Petrassi HM, Sudol M, Kelly JW. WW: an isolated three-stranded antiparallel beta-sheet domain that unfolds and refolds reversibly; evidence for a structured hydrophobic cluster in urea and GdnHCl and a disordered thermal unfolded state. Protein Sci. 1999;8:841–853. doi: 10.1110/ps.8.4.841. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Nguyen H, Jager M, Kelly JW, Gruebele M. Engineering a beta-sheet protein toward the folding speed limit. J Phys Chem B. 2005;109:15182–15186. doi: 10.1021/jp052373y. [DOI] [PubMed] [Google Scholar]
  • 18.Ferguson N, Johnson CM, Macias M, Oschkinat H, Fersht A. Ultrafast folding of WW domains without structured aromatic clusters in the denatured state. Proc Natl Acad Sci USA. 2001;98:13002–13007. doi: 10.1073/pnas.221467198. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Kowalski JA, Liu K, Kelly JW. NMR solution structure of the isolated Apo Pin1 WW domacomparison to the x-ray crystal structures of Pin1. Biopolymers. 2002;63:111–121. doi: 10.1002/bip.10020. [DOI] [PubMed] [Google Scholar]
  • 20.Liu F, Du D, Fuller AA, Davoren JE, Wipf P, Kelly JW, Gruebele M. An experimental survey of the transition between two-state and downhill protein folding scenarios. Proc Natl Acad Sci USA. 2008;105:2369–2374. doi: 10.1073/pnas.0711908105. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Kuznetsov SV, Hilario J, Keiderling TA, Ansari A. Spectroscopic studies of structural changes in two beta-sheet-forming peptides show an ensemble of structures that unfold noncooperatively. Biochemistry. 2003;42:4321–4332. doi: 10.1021/bi026893k. [DOI] [PubMed] [Google Scholar]
  • 22.Syud FA, Stanger HE, Mortell HS, Espinosa JF, Fisk JD, Fry CG, Gellman SH. Influence of strand number on antiparallel beta-sheet stability in designed three- and four-stranded beta-sheets. J Mol Biol. 2003;326:553–568. doi: 10.1016/s0022-2836(02)01304-9. [DOI] [PubMed] [Google Scholar]
  • 23.Stanger HE, Syud FA, Espinosa JF, Giriat I, Muir T, Gellman SH. Length-dependent stability and strand length limits in antiparallel beta-sheet secondary structure. Proc Natl Acad Sci USA. 2001;98:12015–12020. doi: 10.1073/pnas.211536998. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Kortemme T, Ramirez-Alvarado M, Serrano L. Design of a 20-amino acid, three-stranded beta-sheet protein. Science. 1998;281:253–256. doi: 10.1126/science.281.5374.253. [DOI] [PubMed] [Google Scholar]
  • 25.Deechongkit S, Kelly JW. The effect of backbone cyclization on the thermodynamics of beta-sheet unfolding: stability optimization of the PIN WW domain. J Am Chem Soc. 2002;124:4980–4986. doi: 10.1021/ja0123608. [DOI] [PubMed] [Google Scholar]
  • 26.Kaul R, Angeles AR, Jager M, Powers ET, Kelly JW. Incorporating beta-turns and a turn mimetic out of context in loop 1 of the WW domain affords cooperatively folded beta-sheets. J Am Chem Soc. 2001;123:5206–5212. doi: 10.1021/ja0102890. [DOI] [PubMed] [Google Scholar]
  • 27.Deechongkit S, Dawson PE, Kelly JW. Toward assessing the position-dependent contributions of backbone hydrogen bonding to beta-sheet folding thermodynamics employing amide-to-ester perturbations. J Am Chem Soc. 2004;126:16762–16771. doi: 10.1021/ja045934s. [DOI] [PubMed] [Google Scholar]
  • 28.Deechongkit S, You SL, Kelly JW. Synthesis of all nineteen appropriately protected chiral alpha-hydroxy acid equivalents of the alpha-amino acids for Boc solid-phase depsi-peptide synthesis. Org Lett. 2004;6:497–500. doi: 10.1021/ol036102m. [DOI] [PubMed] [Google Scholar]
  • 29.Crane JC, Koepf EK, Kelly JW, Gruebele M. Mapping the transition state of the WW domain beta-sheet. J Mol Biol. 2000;298:283–292. doi: 10.1006/jmbi.2000.3665. [DOI] [PubMed] [Google Scholar]
  • 30.Macias MJ, Hyvonen M, Baraldi E, Schultz J, Sudol M, Saraste M, Oschkinat H. Structure of the WW domain of a kinase-associated protein complexed with a proline-rich peptide. Nature. 1996;382:646–649. doi: 10.1038/382646a0. [DOI] [PubMed] [Google Scholar]
  • 31.Jager M, Zhang Y, Bieschke J, Nguyen H, Dendle M, Bowman ME, Noel JP, Gruebele M, Kelly JW. Structure-function-folding relationship in a WW domain. Proc Natl Acad Sci USA. 2006;103:10648–10653. doi: 10.1073/pnas.0600511103. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Jager M, Nguyen H, Dendle M, Gruebele M, Kelly JW. Influence of hPin1 WW N-terminal domain boundaries on function, protein stability, and folding. Protein Sci. 2007;16:1495–1501. doi: 10.1110/ps.072775507. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Jager M, Dendle M, Fuller AA, Kelly JW. A cross-strand Trp Trp pair stabilizes the hPin1 WW domain at the expense of function. Protein Sci. 2007;16:2306–2313. doi: 10.1110/ps.072904107. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Macias MJ, Gervais V, Civera C, Oschkinat H. Structural analysis of WW domains and design of a WW prototype. Nat Struct Biol. 2000;7:375–379. doi: 10.1038/75144. [DOI] [PubMed] [Google Scholar]
  • 35.Myers JK, Morris DP, Greenleaf AL, Oas TG. Phosphorylation of RNA polymerase II CTD fragments results in tight binding to the WW domain from the yeast prolyl isomerase Ess1. Biochemistry. 2001;40:8479–8486. doi: 10.1021/bi0027884. [DOI] [PubMed] [Google Scholar]
  • 36.Jiang X, Kowalski J, Kelly JW. Increasing protein stability using a rational approach combining sequence homology and structural alignment: stabilizing the WW domain. Protein Sci. 2001;10:1454–1465. doi: 10.1110/ps.640101. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Sharpe T, Jonsson AL, Rutherford TJ, Daggett V, Fersht AR. The role of the turn in beta-hairpin formation during WW domain folding. Protein Sci. 2007;16:2233–2239. doi: 10.1110/ps.073004907. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Peng T, Zintsmaster JS, Namanja AT, Peng JW. Sequence-specific dynamics modulate recognition specificity in WW domains. Nat Struct Mol Biol. 2007;14:325–331. doi: 10.1038/nsmb1207. [DOI] [PubMed] [Google Scholar]
  • 39.Verdecia MA, Bowman ME, Lu KP, Hunter T, Noel JP. Structural basis for phosphoserine-proline recognition by group IV WW domains. Nat Struct Biol. 2000;7:639–643. doi: 10.1038/77929. [DOI] [PubMed] [Google Scholar]
  • 40.Lubs H, Abidi FE, Echeverri R, Holloway L, Meindl A, Stevenson RE, Schwartz CE. Golabi-Ito-Hall syndrome results from a missense mutation in the WW domain of the PQBP1 gene. J Med Genet. 2006;43:e30. doi: 10.1136/jmg.2005.037556. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Petrovich M, Jonsson AL, Ferguson N, Daggett V, Fersht AR. Phi-analysis at the experimental limits: mechanism of beta-hairpin formation. J Mol Biol. 2006;360:865–881. doi: 10.1016/j.jmb.2006.05.050. [DOI] [PubMed] [Google Scholar]
  • 42.Socolich M, Lockless SW, Russ WP, Lee H, Gardner KH, Ranganathan R. Evolutionary information for specifying a protein fold. Nature. 2005;437:512–518. doi: 10.1038/nature03991. [DOI] [PubMed] [Google Scholar]
  • 43.Russ WP, Lowery DM, Mishra P, Yaffe MB, Ranganathan R. Natural-like function in artificial WW domains. Nature. 2005;437:579–583. doi: 10.1038/nature03990. [DOI] [PubMed] [Google Scholar]
  • 44.Sudol M, The WW domain . In: Modular protein domains. Cesareni G, editor. Weinheim: Wiley-VCH Verlag GmbH & Co. KGaA; 2005. pp. 59–72. [Google Scholar]
  • 45.Koepf EK, Petrassi HM, Ratnaswamy G, Huff ME, Sudol M, Kelly JW. Characterization of the structure and function of W –> F WW domain variants: identification of a natively unfolded protein that folds upon ligand binding. Biochemistry. 1999;38:14338–14351. doi: 10.1021/bi991105l. [DOI] [PubMed] [Google Scholar]
  • 46.Verdecia MA, Bowman ME, Lu KP, Hunter T, Noel JP. Structural basis for phosphoserine-proline recognition by group IV WW domains. Nat Struct Biol. 2000;7:639–643. doi: 10.1038/77929. [DOI] [PubMed] [Google Scholar]
  • 47.Balch WE, Morimoto RI, Dillin A, Kelly JW. Adapting proteostasis for disease intervention. Science. 2008;319:916–919. doi: 10.1126/science.1141448. [DOI] [PubMed] [Google Scholar]
  • 48.Brown BM, Sauer RT. Tolerance of Arc repressor to multiple-alanine substitutions. Proc Natl Acad Sci USA. 1999;96:1983–1988. doi: 10.1073/pnas.96.5.1983. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Schafmeister CE, LaPorte SL, Miercke LJ, Stroud RM. A designed four helix bundle protein with native-like structure. Nat Struct Biol. 1997;4:1039–1046. doi: 10.1038/nsb1297-1039. [DOI] [PubMed] [Google Scholar]
  • 50.Riddle DS, Santiago JV, Bray-Hall ST, Doshi N, Grantcharova VP, Yi Q, Baker D. Functional rapidly folding proteins from simplified amino acid sequences. Nat Struct Biol. 1997;4:805–809. doi: 10.1038/nsb1097-805. [DOI] [PubMed] [Google Scholar]
  • 51.Gaspar C, Jannatipour M, Dion P, Laganiere J, Sequeiros J, Brais B, Rouleau GA. CAG tract of MJD-1 may be prone to frameshifts causing polyalanine accumulation. Hum Mol Genet. 2000;9:1957–1966. doi: 10.1093/hmg/9.13.1957. [DOI] [PubMed] [Google Scholar]
  • 52.Gautam AM, Pearson CI, Smilek DE, Steinman L, McDevitt HO. A polyalanine peptide with only five native myelin basic protein residues induces autoimmune encephalomyelitis. J Exp Med. 1992;176:605–609. doi: 10.1084/jem.176.2.605. [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from Protein Science : A Publication of the Protein Society are provided here courtesy of The Protein Society

RESOURCES