Abstract
Conformational equilibria in the protein denatured state have key roles regulating folding, stability, and function. The extent of conformational bias in the protein denatured state under folding conditions, however, has thus far proven elusive to quantify, particularly with regard to its sequence dependence and energetic character. To better understand the structural preferences of the denatured state, we analyzed both the sequence dependence to the mean hydrodynamic size of disordered proteins in water and the impact of heat on the coil dimensions, showing that the sequence dependence and thermodynamic energies associated with intrinsic biases for the α and polyproline II (PPII) backbone conformations can be obtained. Experiments that evaluate how the hydrodynamic size changes with compositional changes in the protein reveal amino acid specific preferences for PPII that are in good quantitative agreement with calorimetry-measured values from unfolded peptides and those inferred by survey of the protein coil library. At temperatures above 25 °C, the denatured state follows the predictions of a PPII-dominant ensemble. Heat effects on coil hydrodynamic size indicate the α bias is comparable to the PPII bias at cold temperatures. Though historically thought to give poor resolution to structural details, the hydrodynamic size of the unfolded state is found to be an effective reporter on the extent of the biases for the α and PPII backbone conformations.
Graphical Abstract
INTRODUCTION
Ever since it was demonstrated >80 years ago that protein macromolecules could be “denatured” with a resulting loss of function,1 the nature and energetics of the denatured state have been topics of wide study2-5 and debate.6-9 The protein denatured state has key roles regulating stability,10 activity,11 transport across membranes,12 and turnover rates.13 Moreover, proteins unfold and refold spontaneously in vivo,14 and the ability to fold with fast kinetics15 and avoid metastable kinetic traps16 is thought to be facilitated by properties of the denatured state that are not fully understood,8 such as conformational preferences that preorganize an unfolded chain.17 Despite these insights and the long-held recognition that a molecular-level understanding of folding and function requires the energy (or Boltzmann) weighted contributions of the myriad of structural states accessible to the protein, a quantitative thermodynamic description of the conformations in the denatured state has proven problematic.
Structural and energetic characterization of the denatured state, which can be troublesome because of the low solubility of unfolded protein and the difficulty in deciphering the character of individual conformations from the ensemble average, has made extensive use of short peptides as experimental models. Because they are too short to fold, peptides can provide access to unfolded states under otherwise folding conditions.18 Also, in the absence of folding, conformational selection is simplified and locally driven by factors such as hydration19 and steric hindrance.20 Peptides are found to have strong preferences for the left-handed polyproline II (PPII) backbone conformation,21-23 suggesting that PPII structures may dominate the ensemble of conformations in the denatured state. Denatured proteins, however, exhibit structural attributes that are not well-modeled by short peptides. Specifically, the rapid collapse of the protein chain in water24,25 and residual structures26-28 that maintain native topology under strongly denaturing conditions29-31 suggest collective and sequence-dispersed effects in the denatured state that are fundamental for defining its structural character.
Support for a peptide-based model of the protein denatured state is provided by the protein coil library.32-35 Coil libraries are constructed from the short segments of irregular structures found in the Protein Data Bank. Overall, coil libraries exhibit structural trends that are in good agreement with the results from peptide studies, ranging from strong preferences for PPII to similar nearest neighbor effects on backbone structure.34,38 Importantly, the chemical shifts and 3-bond J-couplings measured in peptides by NMR spectroscopy are adequately reproduced from structural models of the denatured state that are derived from the protein coil library.39-41
Intrinsically disordered proteins (IDPs) offer another experimental system from which to assess structural preferences in unfolded states under nondenaturing conditions.42 While chemically denatured proteins are known to adopt macromolecular sizes that depend weakly on sequence details other than chain length, IDPs in water exhibit strong sequencedependent influences on structural size (Figure 1A). Computer simulations show that steric effects on disordered structure cannot account for the hydrodynamic size dependence on sequence observed in IDPs.36 Also, temperature changes are found to induce large shifts in the hydrodynamic size for disordered proteins in water43-45 that can exceed the change in size associated with heat denaturation of folded protein of the same chain length.46 The implication of these findings, albeit expected, is that monomeric disordered protein structure is both under thermodynamic control and highly sensitive to the primary sequence.
Herein, we show that the sequence dependence to the mean hydrodynamic radius, Rh, as observed in IDPs provides an independent measurement of the amino acid specific bias for PPII in the denatured state. Because PPII-rich structures are extended,47 the magnitude of a PPII preference in the denatured state can affect the mean hydrodynamic size.45,48,49 Using only the assumptions that: (1) PPII effects on mean Rh follow a simple power-law scaling relationship49 and (2) protein net charge can also influence the hydrodynamic size,37 experiments that evaluate how mean Rh changes with compositional changes in the protein reveal amino acid specific backbone preferences for PPII that are in good quantitative agreement with both calorimetry-measured peptide values23 and survey of the protein coil library.34 IDPs rich in nonpolar amino acids, obtained via mutagenesis to account for the otherwise low representation of the nonpolar types,50 were included in the study and suggest that the PPII propensities are independent of compositional bias and, accordingly, are characteristic also of the denatured states of foldable protein sequences.
As the PPII bias is driven by a favorable enthalpy,51 the effect of increased temperature will be to populate nonPPII states at the expense of PPII. The enthalpy and entropy associated with this structural transition can be determined from heat effects on the mean Rh. Specifically, since the PPII bias is locally driven52 and noncooperative,51 it can be modeled in terms of individual conformational equilibria at each residue position. Thermodynamic parameterization of the PPII-to-nonPPII transition from the temperature dependence to the mean Rh indicates the PPII bias is driven by a significant and favorable enthalpy and partially offset by an unfavorable entropy that matches quantitatively the effects of heat on peptide structures as elucidated by circular dichroism (CD) and NMR spectroscopies.51
When interpreting the effects of the PPII bias on the mean hydrodynamic size, population of the α backbone conformation in the denatured state has consequences that must be considered. The α basin of the Ramachandran map of φ and Ψ dihedral angles is among the most populated regions in the coil library distribution32,35 and is shared with turn structures.34 Because of the backbone geometry of the α configuration, whereby sparse sampling at dispersed positions can produce turns and heavy sampling among contiguous positions yield helix, the effect of the PPII bias on the mean Rh can be either compaction or expansion.36 In ensembles of denatured structures in which the α bias is high, PPII can act as a helix breaker and the PPII bias will trend with reductions in the mean size. When PPII is the dominant conformation, populating PPII at the expense of nonPPII will disrupt turn and extended structures at somewhat equal rates and the net effect is an increase in the mean Rh. The direction (i.e., expansion or compaction) and the rate of change in the mean Rh that was observed with changes in IDP composition, and thus changes in the PPII bias, confirm that PPII is the dominant backbone conformation in the protein denatured state. From the analysis of heat effects on coil hydrodynamic size, the data predict that the α bias is comparable to the PPII bias only at cold temperatures. A method that can experimentally establish amino acid specific bias for the α backbone conformation in the denatured state and directly test this prediction is discussed.
METHODS
Protein Mutagenesis and Purification.
Recombinant mutant protein was expressed in bacterial cells and isolated to >95% purity from cell lysate using affinity chromatography as described elsewhere for the expression and purification of the wild type p53(1–93) fragment of the human p53 protein.45 Genes coding for mutant proteins were cloned into plasmid expression vectors by ATUM (Newark, CA).
CD Spectroscopy.
CD spectra were recorded using a Jasco J-710 spectropolarimeter equipped with a PFD-425S Peltier unit and employed a 1 mm pathlength quartz cuvette. Samples were equilibrated at each temperature for 10 min. Spectra were collected with a resolution of 0.5 nm and a scan rate of 20 nm min−1, and they were the average of eight scans. Spectra were baseline corrected for solvent and buffer contributions. Samples used 0.17 mg mL−1 of protein, 10 mM sodium phosphate, 100 mM sodium chloride (pH 7).
Dynamic Light Scattering Measurement of Mean Rh.
Dynamic light scattering (DLS) readings used noninvasive backscatter optics and were measured with a Zetasizer Nano ZS using Peltier temperature control from Malvern Instruments (Malvern, Worcestershire, UK). Protein samples were buffered at pH 7 in 10 mM sodium phosphate, 100 mM sodium chloride, and filtered using 0.2 μm polyvinylidene difluoride syringe-driven filters. The sample temperature was cycled (5–65 °C in 10 °C steps and back) to establish the reversibility of heat effects on mean Rh. The samples were equilibrated at each temperature for 15 min before measurement. Fresh samples were prepared daily from frozen stock. All measurements used 1 cm pathlength quartz cuvettes. Mean Rh was calculated from the diffusion coefficient D, solvent viscosity η, and the Stokes–Einstein relationship Rh = kT/6πηD, where k is the Boltzmann constant and T is temperature in kelvin. Solvent viscosity was calculated by the solvent builder program provided by Malvern, which uses Sednterp53 to estimate η from the solution contents. Mean Rh was measured >five times for each sample at each temperature. The standard deviation σ of the mean Rh from N number of measurements was calculated by σ = (Σ(mean Rh,i – average mean Rh)2/N)1/2.
Size Exclusion Chromatography Analysis of Hydrodynamic Size.
Size exclusion chromatography (SEC) experiments used Sephadex G-75 gel filtration media equilibrated in 10 mM sodium phosphate, 100 mM sodium chloride (pH 7), and were performed at room temperature. Elution volumes were determined from chromatograms measured using a Bio-Rad BioLogic LP system equipped with a UV absorbance monitor. Protein samples loaded onto the column had a volume of 80 μL and contained 2–3 mg mL—1 of protein in 10 mM sodium phosphate and 100 mM sodium chloride (pH 7). Indicator dyes were loaded separately from the protein samples and contained 20 μL of 0.3 mg mL−1 blue dextran and 0.03 mg mL−1 2,4-dinitrophenyl-L-aspartate to establish the void and column volumes, respectively. KD were calculated as (Ve – Vo)/(Vc – Vo), where Vo is the void volume, Vc the column volume, and Ve the elution volume of the protein, determined from volumes at maximum absorbance. Uncertainties associated with KD were calculated by the standard deviation (σ, defined above) from five or more measurements.
Random Generation of PPII Propensity Scales.
A random number generator based on Knuth’s subtractive method54 was used to generate numbers randomly that distribute uniformly in the range of 0.0–1.0. From this algorithm, 1 000 000 random scales were made using a twostep protocol. First, a random number was obtained and used to target an average scale propensity. This step ensured that scales with low, medium, or high average propensity were sampled at comparable rates. Next, a scale was generated by assigning each amino acid type a random value between 0 and 1 until a set was found whose average for the 20 common amino acids matched the target determined in the first step (±0.05). Average amino acid values among the best performing random scales did not change when the number of generated scales was increased from 500 000 to 1 000 000, indicating the random possibilities were appropriately sampled.
Simulated Ensembles of the Protein Denatured State.
Ensembles of protein structures were generated by a random search of conformational space using a hard sphere collision (HSC) model.45,49,55 Rh dependence on the PPII and α biases in Figure 5 was determined from simulation of polyalanine ensembles reported previously.36 These polyalanine simulations were repeated to obtain the data in Figure S7 whereby the PPII and α regions of the Ramachandran map were increased from (−75 ± 10°, +145 ± 10°) and (−64 ± 10°, −41 ± 10°), respectively, to (−75 ± 25°, +145 ± 25°) and (−64 ± 25°, −41 ± 25°). In the HSC model, the procedure to generate a random conformer starts with a unit peptide and all other atoms for a chain are calculated by the rotational matrix.56 Backbone atoms are generated from the dihedral angles φ, Ψ, and ω and the standard bond angles and bond lengths.57 Backbone dihedral angles are assigned randomly using a random number generator based on Knuth’s subtractive method.54,55 (φ, Ψ) is restricted to the allowed Ramachandran regions58 to sample conformational space efficiently. For peptide bonds, ω is given a Gaussian fluctuation of ±5% about the trans form. Of the two possible positions of the Cβ atom, the one corresponding to L-amino acids was used. van der Waals atomic radii59,60 are used as the only scoring function to eliminate grossly improbable conformations. States that pass the steric filter are population weighted by a calibrated and extensively tested structure-based energy function that has been parameterized to solvent-accessible surface areas.61-69 Random structures are added to the ensemble until the mean size, Rh ≈ 〈L〉/2, is stable,36,45 where L is the maximum Cα–Cα distance for a state and 〈L〉 is the population weighted value. To model conformational propensities (e.g., for PPII or α), conformational bias is achieved by biasing the random search of backbone torsion angles.36,45,48,49
RESULTS
Database of Mean Rh.
The sequence dependence to the hydrodynamic size of unfolded proteins in water was evaluated by using the mean Rh from IDPs obtained by literature survey.36,37 The experimental mean Rh and the primary sequence of each protein in this set is given in Table S1. The net charge, Qnet, is also provided and was calculated from sequence as the number of lysine and arginine residues minus the number of aspartic acid and glutamic acid.
Because amino acids with hydrophobic side chains are found less frequently in IDPs as compared to nonIDPs,50 the database was supplemented with mutants that had substitutions to the nonpolar types. Mutants were made from the sequence of the intrinsically disordered N-terminal region of the p53 protein, p53(1–93), selected because the wild type fragment has good solubility and expresses in bacterial cells at levels appropriate for structural studies.45 A total of 15 substitution sites in the p53(1–93) sequence were chosen based on the sites being dispersed, nonadjacent, minimally disruptive of potential steric effects by avoiding bulky aromatics and positions preceding proline, and nondisruptive of the net charge and the charge pattern. The positions in the p53(1–93) sequence that were substituted are shown in Figure S1. All 15 sites were first substituted to proline, then alanine, leucine, isoleucine, valine, methionine, and glycine (i.e., the nonpolar and nonaromatic amino acids).
To increase sequence diversity in the database, an IDP derived from a foldable sequence was included. Retronuclease, obtained by reversing the primary sequence of the unaltered wildtype staphylococcal nuclease, has the identical composition of L-amino acids and pattern of side chains as a foldable protein, and yet, it is intrinsically disordered.46
Structural Characterization of p53(1–93) Substitution Mutants.
The structure of each substitution mutant was evaluated using methods based on CD spectroscopy, DLS, and SEC. At 25 °C, the CD spectrum of each mutant was similar to spectra reported for IDPs,74 with molar residue ellipticity at 221 nm close to zero (Figure S2). Each spectrum exhibited a local CD maximum at ~225 nm that is seen in proteins with PPII structures.17 The local maximum at ~225 nm decreases in intensity with increasing temperature, consistent with the favorable enthalpy of the PPII bias in the denatured state.51 Deconvolution of CD spectra into specific structural elements is troublesome for disordered proteins, such as p53(1–93), because no satisfactory reference set exists owing to the spectral similarity of disordered protein structures.75 Other than the temperature-dependent local maximum at ~225 nm, signifying PPII content, strong evidence for any other backbone structure was not detected in the CD spectrum of any of the p53(1–93) mutants.
DLS, performed at 25 °C, was used to measure the mean Rh of each substitution mutant. Substitution to proline at all sites produced the mutant with the largest mean Rh of 33.6 ± 1.0 Å, whereas substitutions to glycine yielded the smallest at 30.8 ± 1.4 Å. Typical size distributions that were measured for each p53(1–93) mutant are given in Figure S3. Mean Rh values are given in Table 1. The rank order in mean Rh for the substitution mutants, using the 1-letter amino acid code, was found to be P > V ≈ I > L > A > M > G. As the mean Rh trends with the PPII bias in disordered proteins,49 it is noteworthy that calorimetry-measured amino acid specific PPII propensities in unfolded peptides give the rank order of P > V ≈ I > A > M > L > G,23 almost identical to the rank order in hydrodynamic size that was found for the p53(1–93) mutants except for the leucine rank. Based on the correlation of mean Rh and the PPII bias,49 the DLS results predict that leucine has a higher intrinsic propensity for PPII in the denatured state relative to alanine.
Table 1.
p53(1–93) protein | mean Rha | KDb | mean Rhc |
---|---|---|---|
PRO-rich mutant | 33.6 ± 1.0 | 0.186 ± 0.004 | 33.8 ± 0.2 |
ILE-rich mutant | 33.4 ± 1.2 | 0.196 ± 0.007 | 33.3 ± 0.4 |
VAL-rich mutant | 33.3 ± 1.3 | 0.191 ± 0.006 | 33.6 ± 0.3 |
LEU-rich mutant | 33.1 ± 0.9 | 0.200 ± 0.004 | 33.1 ± 0.2 |
ALA-rich mutant | 32.4 ± 0.9 | 0.209 ± 0.001 | 32.6 ± 0.1 |
MET-rich mutant | 31.8 ± 0.3 | 0.215 ± 0.001 | 32.3 ± 0.1 |
GLY-rich mutant | 30.8 ± 1.4 | 0.245 ± 0.006 | 30.6 ± 0.3 |
wild type | 32.8 ± 0.4 | 0.221 ± 0.005 | 32.0 ± 0.3 |
Mean Rh measured by DLS at 25 °C.
KD measured by SEC at room temperature (~23 °C).
SEC-estimated mean Rh determined from the linear correlation of KD and mean Rh that was obtained from globular folded proteins. KD was 0.385, 0.310, 0.407, and 0.156 for nuclease, carbonic anhydrase, myoglobin, and albumin that have DLS-measured mean Rh of 22.4, 26.8, 22.7, and 35.6 Å, respectively.36 These DLS values compare favorably to Rh estimated from crystal structures70-73 as half the maximum Cα–Cα distance, which are 21.2, 27.3, 21.8, and 35.8 A. The linear correlation of KD and DLS-measured mean Rh from the folded proteins was Rh = −54.01·KD + 43.88.
SEC chromatograms at room temperature (~23 °C) were measured for each mutant and used to further evaluate the differences in mean hydrodynamic size. Thermodynamic retention factors (KD) were calculated from elution volumes, with smaller KD signifying larger mean hydrodynamic size. SEC chromatograms for each mutant are given in Figure S4 and show no evidence of aggregation products. The rank order in mean size among the mutants from KD was again found to be P > V ≈ I> L > A > M > G. Using the linear correlation of mean Rh and KD as established from globular proteins with DLS-measured mean Rh that match their crystallographic structures,36 KD measured for each mutant were converted to SEC-estimated mean Rh. The SEC-estimated values for mean Rh show good agreement with the DLS results and are reported in Table 1.
Sequence Analysis of Mean Rh To Determine Amino Acid Specific Bias for PPII.
Simulation of the denatured state as a collection of random protein structures generated by computer algorithm shows that the population-weighted hydrodynamic radius increases with increasing backbone preference for PPII according to
(1) |
where N is the number of residues and fPPII is the fractional number of residues in the PPII conformation.45,48,49 fPPII is calculated from an ensemble of structures as the Boltzmannweighted value, 〈NPPII〉/N. Rh is 〈L〉/2, with L the maximum distance between any two Cα atoms in a structure to estimate its tumbling length in solution. The power law relationship given by eq 1 is independent of the pattern of PPII propensities in the chain.49 It also is mostly insensitive to steric effects from side chain atoms when the simulation is constrained to biological sequences.36 Unusual sequences, such as all proline or all glycine, cause deviations from eq 1.
When tested experimentally, eq 1 was found to be quantitatively accurate.36,48,49 fPPII can be estimated for any protein sequence by ΣPPPII,i/N, where PPPII,i, is the experimental PPII propensity determined for amino acid type i in unfolded peptides.23 Rh, using eq 1, can thus be predicted for IDPs from sequence and compared to the experimental mean Rh36,49 and likewise the effects on the mean Rh from amino acid substitutions can be predicted as well.48
Sequence analysis of the mean Rh using a database of IDPs shows that the experimental value trends also with the net charge,36 as established by Forman-Kay37 and others.76,77 Empirical modification of eq 1 to account for net charge effects on the mean size gives
(2) |
In the IDP database, mean Rh did not trend with κ,36 which is a measure of the mixing of positive and negative charges in the primary sequence.78 This provided justification for using the net charge to modify eq 1 and obtain eq 2 since, in the database, mean Rh was independent of sequence organization of the charged side chains. The range of κ values in the IDP database is from 0.058 for PGR to 0.423 for prothymosin-α36 and compares favorably to the range of κ found in biological IDPs across many species.79
We find that the mean hydrodynamic size of IDPs is sufficiently sensitive to backbone conformational bias to provide experimental access to amino acid specific propensities for PPII. This is demonstrated in two steps. First, in Figure 2, we show that only a small set of PPII propensity scales reproduce the sequence dependence to the experimental mean Rh. Then, in Figure 3, we show the average amino acid specific value from this set reproduces the PPII scale determined calorimetrically from peptide studies. There also was good agreement when the average amino acid specific value was compared to PPII frequencies from the protein coil library. The “peptide scale” in Figures 2 and 3 is from host-guest analysis of the binding energetics of the Sos peptide23 that adopts PPII in its bound complex with the SH3 domain of Sem-5.80 The “coil library” scale is from Sosnick and Freed.34
Figure 2 data and specifically panel D were obtained from random PPII propensity scales generated with values ranging from 0 to 1 for the 20 common amino acid types. The ability of the peptide scale to reproduce the sequence dependence to experimental mean Rh is shown in panels A–C and compared in panel D to the performance of each random scale and the coil library. Scales were used to predict mean Rh from sequence for the 34 IDPs in the database, using eq 1 as demonstrated in panel A. The correlation, R2, of experimental mean Rh with the eq 1 predicted the value for each scale is reported by the x-axis in panel D. Equation 1 error was found to trend almost exclusively with Qnet, when both error and Qnet were normalized to the protein size.36 The y-axis in panel D reports the correlation of this trend. The slope and intercept from the linear correlation of the error trend (panel B) are the coefficients preceding ∣Qnet∣ and N0.5 in eq 2, respectively. Each scale thus yields a unique empirical modification to eq 1 that corrects for net charge effects on the sequence-predicted mean Rh. Also, because p53(1–93) substitution mutants and retronuclease were added to the IDP database, changing its composition, the coefficients preceding ∣Qnet∣ and N0.5 in eq 2 have changed slightly from our prior report.36 In panel D, the color, from blue to red, is the database average error in predicting Rh from sequence after correcting for the apparent net charge effect on the hydrodynamic size.
It is clear from the data in panel D of Figure 2 that there is a set of random PPII propensity scales that are better than typical at predicting mean Rh from sequence using fPPII, Qnet, and N. Highlighted by the boxed area in the panel figure, these scales predict Rh with good correlation to the experimental Rh (R2 > 0.7; x-axis) and prediction error that also trends with the net charge (R2 > 0.4; y-axis). The distribution of chargecorrected error in the boxed area is shown in Figure 3A. Notably, the results show that the peptide and coil library scales both greatly outperform the random scales in an ability to describe sequence effects on the mean hydrodynamic size when using only conformational bias and net charge considerations.
To test if the relationship linking mean Rh to backbone conformational bias (i.e., eq 1) can discern the differences in PPII preference among the amino acids, the average scale value for each amino acid type was computed from the “best” performing random scales. The “best” scales were defined as those in the boxed area of panel 2D with the smallest error, using the distribution mode (~0.2; see Figure 3A) as the cutoff. The computed averages, unfortunately, report somewhat trivial specificity except for distinguishing proline and nonproline types (Figure S5). If the “best” scales are constrained to include only those that also maintain a PPII propensity rank order of P > V ≈ I > L > A > M > G, as established experimentally by the p53(1–93) substitution mutants, then good, but not perfect, agreement with the peptide scale and the coil library is obtained (R2 ≈ 0.5; Figure S5). If the error cutoff is reduced to match the average error obtained from the coil library (~0.165; see Figure 3A), then the good agreement is improved even further (R2 ≈ 0.6; Figure 3C). These results show that the eq 1 relationship can indeed discern the differences in PPII preference among the amino acid types.
The average scale value for each amino acid type, computed from a constrained set of “best” performing random scales, is given in Table S2 and Figure 1B. These values represent the amino acid specific PPII bias as estimated from sequence effects on IDP mean Rh. Standard deviations are also provided and, surprisingly, they are modest (±0.06) considering 20 parameters were determined from only 34 pairs of mean Rh and sequence. The overall good agreement when comparing the peptide, coil library, and IDP-determined PPII propensities, despite the very different methodologies that were used, is strong evidence supporting the theoretical relationship, as given by eq 1, that links mean Rh in the denatured state to the PPII bias. Likewise, the results indicate the denatured state bias for PPII is substantial and its magnitude at different positions in the protein chain correlates with the local amino acid identity.
The main differences in amino acid specific PPII bias when comparing the IDP scale to the peptide and coil library scales is that the IDP bias for PPII is generally strongest for the branched amino acids, albeit weaker than the proline PPII bias. Based on the IDP scale, ILE and VAL have PPII propensities that are ≥0.5, while LEU is ~0.4. Also, the IDP scale has the proline bias for PPII at less than one (0.93), as does the coil library (0.81). Since proline residues can adopt nonPPII structures34 and the PPII conformation is especially accom-modating to bulky side chains,20 these small differences that were determined from the sequence dependence to IDP hydrodynamic size are reasonably justified.
Thermodynamic Parameterization of the PPII to nonPPII Transition.
The enthalpy and entropy of the PPII to nonPPII transition have been measured in short alanine peptides by monitoring heat effects on structure over a broad temperature range.51 Results from CD spectroscopy, which monitored the change in the CD signal at 215 nm, gave ΔHPPII and ΔSPPII of ~10 kcal mol−1 and 32.7 cal mol−1 K−1, respectively, while NMR measurements using 3-bond J couplings (3JαN) gave ~13 kcal mol−1 and 40.9 cal mol−1 K-1.
Because the PPII bias is noncooperative,51 the effect at individual residue positions from temperature changes can be modeled with the integrated van’t Hoff equation
(3) |
where KPPII is the equilibrium between PPII and nonPPII states, T is the temperature, and R the gas constant. If PPII is the lone dominant conformation in the denatured state, then KPPII for each amino acid type can be estimated at 25 °C from the PPII propensities as KPPII,i = (1 – PPPII,i. The importance of eq 3 is that it provides a second check on the ability of the protein denatured state to be described from the results of peptide studies. Peptide-measured ΔHPPII predicts the temperature dependence to KPPII,i , and thus PPPII,i, which can be used to model the temperature dependence to the mean Rh. Moreover, these two values, ΔHPPII and PPPII,i, give access to the entropy from the relationship (∂G/∂T)P = 〈S. For example, using the alanine PPII propensity from the peptide scale (0.37) and 10 kcal mol−1 for ΔHPPII yields ΔSPPII,ALA = 34.6 cal mol−1 K−1. Specifically, the thermodynamics of the PPII bias in the denatured state can be described from just ΔHPPII and PPPII,i.
Figure 4 shows the results from using three PPII propensity scales (i.e., peptide, IDP, and coil library) to model the temperature dependence to mean Rh and then compared to experimental values measured for retronuclease. Equation 2 and temperature-dependent PPPII,i, calculated by eq 3, were used to predict mean Rh from the primary sequence of the protein. The structure of retronuclease has been thoroughly characterized, and its mean hydrodynamic size was monitored over a broad temperature range using DLS, SEC, and analytical ultracentrifugation methods.46 CD spectroscopy performed with retronuclease indicates a PPII bias that is dominant at temperatures >25 °C and gradually weakens in response to increasing temperature. At cold temperatures (T < 25 °C), an additional bias for the α backbone conformation is apparent in the retronuclease CD spectrum.46
We observed that using 10 or 13 kcal mol−1 for ΔHPPII and the peptide scale, IDP scale, or coil library frequencies for the PPII propensities had only minor effects on the predicted mean Rh of retronuclease at temperatures >25 °C (Figure 4). The fractional number of residues in the PPII conformation from −50 to 100 °C is shown in the top inset, whereas the predicted change in mean Rh (ΔRh) at temperatures >25 °C is shown in the bottom inset. In the temperature range that CD spectroscopy suggests PPII is the dominant conformation (i.e., T >25 °C), the measured change in mean Rh compares favorably to the sequence-predicted change when computed using either of the peptide-measured values for ΔHPPII of ~10 or ~13 kcal mol−1 (bottom inset). This shows that the large ~7 Å reduction in retronuclease mean Rh owing to the 25–65 °C temperature change is in good quantitative agreement with experimental PPII propensities, ΔHPPII, and ΔSPPII.
Directly comparing experimental and sequence-predicted mean Rh at 5, 15, 35, and 45 °C, however, gave obvious differences (Figure 4). At 35 and 45 °C, measured mean Rh were larger than the predicted values, whereas at 5 and 15 °C, they were smaller. Our analysis of heat effects on retronuclease structure assumed that PPII is the lone dominant conformation in the denatured state, which is contradicted by CD results at low temperatures revealing the presence of α preferences.46 By including the effects of the α bias in our analysis of the denatured state hydrodynamic size, both the over and under predictions of mean Rh at 5, 15, 35, and 45 °C can be explained.
For comparison, analysis of the temperature dependence to mean Rh using p53(1–93) mutants is given in Figure S6 and reports phenomenologically similar behavior to retronuclease. DLS readings for these proteins gave low count rates and unreliable data at the temperature extremes, unfortunately. Despite large uncertainties associated with low and high temperatures measurements, the data show changes in mean Rh from 25 to 65 °C that mostly match the sequence-based predictions from PPII propensities and ΔHPPII. At temperatures between 25 and 65 °C, experimental mean Rh was usually larger than the sequence-based predictions, like retronuclease. At temperatures below 25 °C, mean Rh was typically lower than the predicted values.
Bias for the α Conformation in the Denatured State.
Preferential sampling of the main chain dihedral angles φ and Φ for values associated with α-helix can cause changes in the structural dimensions of the denatured state.36 Monitored from the population-weighted mean size, Rh ≈ 〈L〉/2, computer generated ensembles of unfolded structures that sample (φ, Φ) in the α region show compaction under modest α preferences and elongated sizes at higher α sampling rates. This is demonstrated in Figures 5 and S7. Specifically, when (<p, î) sampling in the α region is weakly preferred, the probability for contiguous stretches of residues in the α state is low and turn structures are more likely than the helical segments that form when the α bias is higher. Here, a residue is considered to be in the α conformation if its (φ, Φ is in the region centered at (−64°, −41°), defined in the panel 5A Ramachandran map, regardless of whether or not a main chain hydrogen bond is present, such as i → i + 4 (α-helix) or i → i + 3 (310-helix). A residue is in the PPII conformation if its (φ, Φ) is in the region centered at (−75°, +145°). fα and fPPII presented in Figure 5 were calculated from simulated ensembles and using the smaller 20° × 20° areas to define the extent of the α and PPII regions. Figure S7 shows that increasing the α and PPII regions to larger 50° × 50° areas yields qualitatively identical conclusions.
Because the effect of the α bias on the mean Rh of the denatured state can be accentuated by the PPII bias, whereby ensembles with high PPII propensities show increased sensitivity to changes in the α bias, the consequences of both the α and PPII biases on the mean Rh must be considered. For example, the average chain propensity for PPII in the IDP database is ~0.4 when estimated from sequence using the peptide scale. Thus, the IDP trend of mean Rh with the α bias should follow the red line in panel 5B. Likewise, the effect of the PPII bias on the mean Rh is codependent on the α bias, which is demonstrated in panel 5C. When PPII is the dominant conformation, the structural dimensions of the denatured state follow the relationship given by eq 1 (black line in panel 5C). If, instead, PPII is not the dominant conformation and moderate α preferences are present, then the relationship linking the PPII bias to the mean Rh changes. Specifically, the result from increasing the chain preference for α is to suppress the effect of the PPII bias on the mean Rh. When the α bias is stronger than the PPII bias (i.e., α is the dominant conformation), then the effect of the PPII bias is compaction.
Comparison of experimental mean Rh from IDPs to the simulation-derived curves of panel 5C confirms that PPII is the dominant backbone conformation of the protein denatured state in water. A few IDPs trend with the line signifying stronger than typical α preferences. These IDPs are β-amyloid(1–40), α-synuclein, CFTR-R-region, prothymosin-α, Cad136, and sml1. Also, most IDPs are found to have experimental mean Rh that are slightly larger than expected based upon the sequence-calculated value of fPPII. This suggests that amino acid preferences for PPII may be underestimated by the peptide scale and values for fPPII in the figure are shifted to the right. A similar conclusion can be made from fPPII calculated from sequence by the IDP and coil library scales (Figure S8). Both Kallenbach21 and Creamer22 report experimental amino acid PPII propensities that are larger than the peptide, IDP, and coil library scales, as given in Table S2. The possibility of a larger PPII bias cannot be eliminated by the present study because PPII effects on the mean Rh can be suppressed by the presence of an α bias. Establishing such details from the IDP sequence dependence to mean Rh relies on obtaining the sequence dependence to the α bias in the denatured state.
The idea that the experimental PPII propensities are underestimated by the peptide, IDP, and coil library scales possibly explains some of the retronuclease data, as shown in Figure 4. Underestimated bias for PPII gives underestimated predicted mean Rh at 35 and 45 °C. At 5 and 15 °C, the disagreement between theory and experiment is likely caused by the α bias that was detected in the retronuclease CD spectrum.46
To obtain the sequence dependence to both the α and PPII biases in the denatured state and test the above assumptions, the analysis of sequence effects on IDP mean Rh could be repeated at colder and warmer temperatures. This is shown schematically in Figure 1C. Higher temperatures reduce α effects on the mean Rh and isolate the effects of the PPII bias. Colder temperatures give access to the α bias. Just as the sequence dependence to the mean Rh at T > 25 °C yields the amino acid specific bias for PPII, from comparison to simulated coil values that omit PPII effects, the sequence dependence to the mean Rh at T < 25 °C can yield the amino acid bias for the α basin, from comparison to the theoretical treatment that omits α effects.
DISCUSSION
The α and PPII basins are the two most populated regions of the Ramachandran map when constructed from the protein coil library,32,34,35 a widely used denatured state model built from the segments of irregular structure found in the Protein Data Bank.33,81 Preferences for the α basin are likely from internal hydrogen bonds that stabilize turn and helical structures, whereas preferential solvation can explain the high levels of PPII.19 The protein coil library has been shown to reproduce the intrinsic conformational preferences of the amino acids for helix, sheet, and PPII,34 as well as the effects on the conformational preferences from neighboring residues.39,41 The sequence dependence to NMR coupling constants and chemical shifts as measured in peptides are also in close agreement with structural predictions based on the coil library.39,41 The role of the temperature in describing coil structure, however, is less understood. Heat indeed modulates coil populations. This is evidenced by the large temperature-dependent changes in hydrodynamic size exhibited by IDPs.43-46 Moreover, the ability of a protein to fold, phase separate,82 or recognize its binding partner 83 is also temperature dependent. The extent to which the α and PPII populations in the denatured state change in response to temperature changes is not understood, nor is it fully understood how subtle conformational biases in the denatured state are utilized to facilitate folding and function.
Herein, it was shown that the enthalpy, entropy, and magnitude of denatured state conformational bias can be elucidated from the analysis of heat effects on the mean Rh of the unfolded protein. Changes in the mean Rh resulting from compositional changes in the protein yield amino acid preferences for PPII that recapitulate intrinsic PPII propensities measured calorimetrically in unfolded peptides23 and those inferred from survey of the protein coil library.34 The denatured state follows the predictions of a PPII-dominant ensemble at temperatures >25 °C, while cold temperatures promote bias for the α basin of the Ramachandran map. Molecular simulation of the denatured state shows the population-weighted hydrodynamic size is codependent on the biases for both PPII and a,36 predicting that intrinsic α preferences and its thermodynamic character can be determined from low-temperature studies performed on unfolded protein. Specifically, the α bias at low temperatures is sufficiently pronounced that its magnitude can be measured from its effect on the mean Rh.
The ability to discern structural features of the denatured state from its hydrodynamic size has been historically controversial. In 6 M guanidinium chloride, the dimensions of denatured proteins are successfully predicted by the random coil model3,84 irrespective of numerous experiments showing they can maintain residual structures26,28 and native-like topologies.29,31 Fitzkee and Rose demonstrated that the random coil model can be insensitive to the presence of structured segments in an otherwise flexible and unfolded chain, even when the structured regions constitute >90% of the protein.85 We conjecture that the hydrodynamic size of the denatured state in water accurately reports on amino acid preferences for PPII for multiple reasons. First, PPII is indeed the dominant conformation in the denatured state, as long argued.86 Second, the PPII bias is locally determined52 and thus mostly insensitive to organizational details of the primary sequence. Third, the mean size scales exponentially by the power of the chain bias for extended PPII structures.48,49 And lastly, the strong yet variable effects on local backbone structure owing to the influence of neighboring residues34,39 are apparently minor when projected to the mean global size of the denatured state.
With regard to charge effects on the hydrodynamic size, context did not seem to matter. An analysis of sequencemodulation of mean Rh using the database found no correlation of the mean hydrodynamic size with charge organization in the primary sequence,36 despite the somewhat strong correlation with the net charge (R2 ≈ 0.6, Figure 2 panel B). We assume the database trends are the average from many structural environments that conveniently eliminate the influence of sequence context on the observed structural preferences. Whether or not this convenient averaging also extends to the intrinsic bias for the α basin in the denatured state, which should depend on sequence context since turn and helical structures are stabilized by intrachain contacts, remains to be established.
CONCLUSIONS
Proteins under biological conditions exhibit marginal structural stability. Consequently, the biological processes that are facilitated by protein macromolecules are indirectly, if not directly, reliant on the properties and energetic character of the denatured state. To better understand the structural preferences of the denatured state, we analyzed both the sequence dependence to the mean hydrodynamic size of disordered proteins and the impact of heat on the coil dimensions, showing that the sequence dependence and thermodynamic energies associated with intrinsic biases for the α and PPII backbone conformations can be obtained. Survey of the coil library reveals that the α and PPII basins of the Ramachandran map are the dominant structural features of the protein denatured state. An experimental system has been designed that, when combined with computer simulation of the denatured state ensemble, provides access to a detailed thermodynamic description of denatured protein structures capable of reproducing heat-induced modulation of the α and PPII populations.
Supplementary Material
ACKNOWLEDGMENTS
Supported by the National Institutes of Health grants R15GM115603 (S.T.W.) and R25GM102783 (South Texas Doctoral Bridge Program; N. M. J. Blake, B. O. Oyajobi, R. Walter), and the National Science Foundation grant DMR-1205670 (Texas State University PREM; W. J. Brittain).
Footnotes
Supporting Information
The Supporting Information is available free of charge on the ACS Publications website at DOI: 10.1021/acs.jpcb.9b09088.
Experimental mean Rh, N, sequence-calculated Qnet, sequence-calculated fPPII, and the primary sequence of each protein in the IDP database; substitution sites in p53(1–93) used to generate the mutant proteins; temperature-dependent CD spectrum for each p53(1–93) mutant; size-distributions measured by DLS and SEC chromatograms for each p53(1–93) mutant; amino acid specific averages from the randomly generated PPII propensity scales; experimental intrinsic PPII propensities; temperature dependent mean Rh from DLS for p53(1–93) mutants and comparison to simulation results; simulation results showing the effects of the PPII and α biases on the mean size of the denature state; and the comparison to experimental mean Rh from IDPs (PDF)
The authors declare no competing financial interest.
REFERENCES
- (1).Mirsky AE; Pauling L On the Structure of Native, Denatured, and Coagulated Proteins. Proc. Natl. Acad. Sci. U.S.A 1936, 22, 439–447. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (2).Flory PJ The Configuration of Real Polymer Chains. J. Chem. Phys 1949, 17, 303–310. [Google Scholar]
- (3).Tanford C Protein Denaturation. Adv. Protein Chem 1968, 23, 121–282. [DOI] [PubMed] [Google Scholar]
- (4).Dill KA; Shortle D Denatured States of Proteins. Annu. Rev. Biochem 1991, 60, 795–825. [DOI] [PubMed] [Google Scholar]
- (5).Shortle D The Denatured State (the Other Half of the Folding Equation) and Its Role in Protein Stability. FASEB J. 1996, 10, 27–34. [DOI] [PubMed] [Google Scholar]
- (6).Englander SW; Mayne L The Case for Defined Protein Folding Pathways. Proc. Natl. Acad. Sci. U.S.A 2017, 114, 8253–8258. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (7).Craig PO; Latzer J; Weinkam P; Hoffman RMB; Ferreiro DU; Komives EA; Wolynes PG Prediction of Native-State Hydrogen Exchange from Perfectly Funneled Energy Landscapes. J. Am. Chem. Soc 2011, 133, 17463–17472. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (8).Baldwin RL Clash between Energy Landscape Theory and Foldon-Dependent Protein Folding. Proc. Natl. Acad. Sci. U.S.A 2017, 114, 8442–8443. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (9).Sosnick TR; Barrick D The folding of single domain proteinshave we reached a consensus? Curr. Opin. Struct. Biol. 2011, 21, 12–24. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (10).Shortle D Staphylococcal Nuclease: A Showcase of m-Value Effects In Advances in Protein Chemistry; Anfinsen CB, Richards FM, Edsall JT, Eisenberg DS, Eds.; Protein Stability; Academic Press, 1995; Vol. 46, pp 217–247. [DOI] [PubMed] [Google Scholar]
- (11).Saavedra HG; Wrabl JO; Anderson JA; Li J; Hilser VJ Dynamic Allostery Can Drive Cold Adaptation in Enzymes. Nature 2018, 558, 324–328. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (12).Huber D; Cha M-I; Debarbieux L; Planson A-G; Cruz N; Lopez G; Tasayco ML; Chaffotte A; Beckwith J A Selection for Mutants That Interfere with Folding of Escherichia Coli Thioredoxin-1 in Vivo. Proc. Natl. Acad. Sci. U.S.A 2005, 102, 18872–18877. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (13).Prakash S; Tian L; Ratliff KS; Lehotzky RE; Matouschek A An Unstructured Initiation Site Is Required for Efficient Proteasome-Mediated Degradation. Nat. Struct. Mol. Biol 2004, 11, 830–837. [DOI] [PubMed] [Google Scholar]
- (14).Guo M; Xu Y; Gruebele M Temperature Dependence of Protein Folding Kinetics in Living Cells. Proc. Natl. Acad. Sci. U.S.A 2012, 109, 17863–17867. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (15).Schonbrun J; Dill KA Fast Protein Folding Kinetics. Proc. Natl. Acad. Sci. U.S.A 2003, 100, 12678–12682. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (16).Im H; Ahn H-Y; Yu M-H Bypassing the Kinetic Trap of Serpin Protein Folding by Loop Extension. Protein Sci. 2000, 9, 1497–1502. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (17).Shi Z; Chen K; Liu Z; Kallenbach NR Conformation of the Backbone in Unfolded Proteins. Chem. Rev 2006, 106, 1877–1897. [DOI] [PubMed] [Google Scholar]
- (18).Shi Z; Olson CA; Rose GD; Baldwin RL; Kallenbach NR Polyproline II Structure in a Sequence of Seven Alanine Residues. Proc. Natl. Acad. Sci. U.S.A 2002, 99, 9190–9195. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (19).Mezei M; Fleming PJ; Srinivasan R; Rose GD Polyproline II Helix Is the Preferred Conformation for Unfolded Polyalanine in Water. Proteins: Struct., Funct., Bioinf 2004, 55, 502–507. [DOI] [PubMed] [Google Scholar]
- (20).Pappu RV; Rose GD A simple model for polyproline II structure in unfolded states of alanine-based peptides. Protein Sci. 2009, 11, 2437–2455. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (21).Shi Z; Chen K; Liu Z; Ng A; Bracken WC; Kallenbach NR Polyproline II propensities from GGXGG peptides reveal an anticorrelation with -sheet scales. Proc. Natl. Acad. Sci. U.S.A 2005, 102, 17964–17968. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (22).Rucker AL; Pager CT; Campbell MN; Qualls JE; Creamer TP Host-Guest Scale of Left-Handed Polyproline II Helix Formation. Proteins: Struct., Funct., Bioinf 2003, 53, 68–75. [DOI] [PubMed] [Google Scholar]
- (23).Austin Elam W; Schrank TP; Campagnolo AJ; Hilser VJ Evolutionary conservation of the polyproline II conformation surrounding intrinsically disordered phosphorylation sites. Protein Sci 2013, 22, 405–417. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (24).Agashe VR; Shastry MCR; Udgaonkar JB Initial Hydrophobic Collapse in the Folding of Barstar. Nature 1995, 377, 754–757. [DOI] [PubMed] [Google Scholar]
- (25).Sadqi M; Lapidus LJ; Munoz V How Fast Is Protein Hydrophobic Collapse? Proc. Natl. Acad. Sci. U.S.A 2003, 100, 12117–12122. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (26).Kazmirski SL; Wong K-B; Freund SMV; Tan Y-J; Fersht AR; Daggett V Protein Folding from a Highly Disordered Denatured State: The Folding Pathway of Chymotrypsin Inhibitor 2 at Atomic Resolution. Proc. Natl. Acad. Sci. U.S.A 2001, 98, 4349–4354. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (27).Garcia P; Serrano L; Durand D; Rico M; Bruix M NMR and SAXS characterization of the denatured state of the chemotactic protein Che Y: Implications for protein folding initiation. Protein Sci. 2001, 10, 1100–1112. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (28).McCarney ER; Kohn JE; Plaxco KW Is There or Isn’t There? The Case for (and Against) Residual Structure in Chemically Denatured Proteins. Crit. Rev. Biochem. Mol. Biol. 2005, 40, 181–189. [DOI] [PubMed] [Google Scholar]
- (29).Yi Q; Scalley-Kim ML; Alm EJ; Baker D NMR Characterization of Residual Structure in the Denatured State of Protein L. J. Mol. Biol 2000, 299, 1341–1351. [DOI] [PubMed] [Google Scholar]
- (30).Shortle D; Ackerman MS Persistence of Native-Like Topology in a Denatured Protein in 8 M Urea. Science 2001, 293, 487–489. [DOI] [PubMed] [Google Scholar]
- (31).Ohnishi S; Lee AL; Edgell MH; Shortle D Direct Demonstration of Structural Similarity between Native and Denatured Eglin C. Biochemistry 2004, 43, 4064–4070. [DOI] [PubMed] [Google Scholar]
- (32).Swindells MB; MacArthur MW; Thornton JM Intrinsic φ,Ψ propensities of amino acids, derived from the coil regions of known structures. Nat. Struct. Biol 1995, 2, 596–603. [DOI] [PubMed] [Google Scholar]
- (33).Fitzkee NC; Fleming PJ; Rose GD The Protein Coil Library: A Structural Database of Nonhelix, Nonstrand Fragments Derived from the PDB. Proteins: Struct., Funct., Bioinf 2005, 58, 852–854. [DOI] [PubMed] [Google Scholar]
- (34).Jha AK; Colubri A; Zaman MH; Koide S; Sosnick TR; Freed KF Helix, Sheet, and Polyproline II Frequencies and Strong Nearest Neighbor Effects in a Restricted Coil Library. Biochemistry 2005, 44, 9691–9702. [DOI] [PubMed] [Google Scholar]
- (35).Perskie LL; Street TO; Rose GD Structures, Basins, and Energies: A Deconstruction of the Protein Coil Library. Protein Sci. 2008, 17, 1151–1161. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (36).English LR; Tilton EC; Ricard BJ; Whitten ST Intrinsic α helix propensities compact hydrodynamic radii in intrinsically disordered proteins. Proteins: Struct., Funct., Bioinf 2017, 85, 296–311. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (37).Marsh JA; Forman-Kay JD Sequence Determinants of Compaction in Intrinsically Disordered Proteins. Biophys. J 2010, 98, 2383–2390. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (38).Griffiths-Jones SR; Sharman GJ; Maynard AJ; Searle MS Modulation of intrinsic φ,Ψ propensities of amino acids by neighbouring residues in the coil regions of protein structures: NMR analysis and dissection of a β-hairpin peptide 1 1 Edited by P. E. Wright. J. Mol. Biol 1998, 284, 1597–1609. [DOI] [PubMed] [Google Scholar]
- (39).Smith LJ; Bolin KA; Schwalbe H; MacArthur MW; Thornton JM; Dobson CM Analysis of Main Chain Torsion Angles in Proteins: Prediction of NMR Coupling Constants for Native and Random Coil Conformations. J. Mol. Biol 1996, 255, 494–506. [DOI] [PubMed] [Google Scholar]
- (40).Mantsyzov AB; Shen Y; Lee JH; Hummer G; Bax A MERA: A Webserver for Evaluating Backbone Torsion Angle Distributions in Dynamic and Disordered Proteins from NMR Data. J. Biomol. NMR 2015, 63, 85–95. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (41).Shen Y; Roche J; Grishaev A; Bax A Prediction of Nearest Neighbor Effects on Backbone Torsion Angles and NMR Scalar Coupling Constants in Disordered Proteins. Protein Sci. 2018, 27, 146–158. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (42).Eliezer D Biophysical Characterization of Intrinsically Disordered Proteins. Curr. Opin. Struct. Biol 2009, 19, 23–30. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (43).Kjaergaard M; Nørholm A-B; Hendus-Altenburger R; Pedersen SF; Poulsen FM; Kragelund BB Temperature-dependent structural changes in intrinsically disordered proteins: Formation of α-helices or loss of polyproline II? Protein Sci. 2010, 19, 1555–1564. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (44).Wuttke R; Hofmann H; Nettels D; Borgia MB; Mittal J; Best RB; Schuler B Temperature-Dependent Solvation Modulates the Dimensions of Disordered Proteins. Proc. Natl. Acad. Sci. U.S.A 2014, 111, 5213–5218. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (45).Langridge TD; Tarver MJ; Whitten ST Temperature Effects on the Hydrodynamic Radius of the Intrinsically Disordered N-Terminal Region of the P53 Protein. Proteins: Struct., Fund., Bioinf 2014, 82, 668–678. [DOI] [PubMed] [Google Scholar]
- (46).English LR; Tischer A; Demeler AK; Demeler B; Whitten ST Sequence Reversal Prevents Chain Collapse and Yields Heat-Sensitive Intrinsic Disorder. Biophys. J 2018, 115, 328–340. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (47).Cowan PM; McGavin S Structure of Poly-L-Proline. Nature 1955, 176, 501–503. [Google Scholar]
- (48).Perez RB; Tischer A; Auton M; Whitten ST Alanine and Proline Content Modulate Global Sensitivity to Discrete Perturbations in Disordered Proteins. Proteins: Struct., Funct., Bioinf 2014, 82, 3373–3384. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (49).Tomasso ME; Tarver MJ; Devarajan D; Whitten ST Hydrodynamic Radii of Intrinsically Disordered Proteins Determined from Experimental Polyproline II Propensities. PLoS Comput. Biol 2016, 12, No. e1004686. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (50).Dunker AK; Obradovic Z; Romero P; Garner EC; Brown CJ Intrinsic protein disorder in complete genomes. Genome Informatics. International Conference on Genome Informatics, 2000; Vol. 11, pp 161–171. [PubMed] [Google Scholar]
- (51).Chen IK; Liu Z; IKallenbach NR The Polyproline II Conformation in Short Alanine Peptides Is Noncooperative. Proc. Natl. Acad. Sci. U.S.A. 2004, 101, 15352–15357. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (52).Creamer TP Left-handed polyproline II helix formation is (very) locally driven. Proteins: Struct., Funct, Bioinf 1998, 33, 218–226. [PubMed] [Google Scholar]
- (53).Laue TM; Shah BD; Ridgeway TM; Pelletier SL Computer-Aided Interpretation of Analytical Sedimentation Data for Proteins Analytical Ultracentrifugation in Biochemistry and Polymer Science; Royal Society of Chemistry, 1992, pp 90–125. [Google Scholar]
- (54).Knuth DE The Art of Computer Programming: Seminumerical Algorithms, 2nd ed.; Addison-Wesley: Reading, MA, 1981; Vol. 2. [Google Scholar]
- (55).Whitten ST; Yang H-W; Fox RO; Hilser VJ Exploring the Impact of Polyproline II (PII) Conformational Bias on the Binding of Peptides to the SEM-5 SH3 Domain. Protein Sci. 2008, 17, 1200–1211. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (56).Jeffreys H; Jeffreys BS Methods of Mathematical Physics; Cambridge University Press: New York, 1950. [Google Scholar]
- (57).Momany FA; McGuire RF; Burgess AW; Scheraga HA Energy Parameters in Polypeptides. VII. Geometric Parameters, Partial Atomic Charges, Nonbonded Interactions, Hydrogen Bond Interactions, and Intrinsic Torsional Potentials for the Naturally Occurring Amino Acids. J. Phys. Chem 1975, 79, 2361–2381. [Google Scholar]
- (58).Mandel N; Mandel G; Trus BL; Rosenberg J; Carlson G; Dickerson RE Tuna cytochrome C at 2.0 A resolution. III. Coordinate optimization and comparison of structures. J. Biol. Chem 1977, 252, 4619–4636. [PubMed] [Google Scholar]
- (59).Ramachandran GN; Ramakrishnan C; Sasisekharan V Stereochemistry of Polypeptide Chain Configurations. J. Mol. Biol 1963, 7, 95–99. [DOI] [PubMed] [Google Scholar]
- (60).Iijima H; Dunbar JB; Marshall GR Calibration of Effective van der Waals Atomic Contact Radii for Proteins and Peptides. Proteins: Struct., Funct., Bioinf 1987, 2, 330–339. [DOI] [PubMed] [Google Scholar]
- (61).Baldwin RL Temperature Dependence of the Hydrophobic Interaction in Protein Folding. Proc. Natl. Acad. Sci. U.S.A 1986, 83, 8069–8072. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (62).Murphy KP; Bhakuni V; Xie D; Freire E Molecular Basis of Co-Operativity in Protein Folding: III. Structural Identification of Cooperative Folding Units and Folding Intermediates. J. Mol. Biol 1992, 227, 293–306. [DOI] [PubMed] [Google Scholar]
- (63).Murphy KP; Freire E Thermodynamics of Structural Stability and Cooperative Folding Behavior in Proteins In Advances in Protein Chemistry; Anfinsen CB, Richards FM, Edsall JT, Eisenberg DS, Eds.; Academic Press, 1992; Vol. 43, pp 313–361. [DOI] [PubMed] [Google Scholar]
- (64).Xie D; Freire E Structure Based Prediction of Protein Folding Intermediates. J. Mol. Biol 1994, 242, 62–80. [DOI] [PubMed] [Google Scholar]
- (65).Lee KH; Xie D; Freire E; Amzel LM Estimation of Changes in Side Chain Configurational Entropy in Binding and Folding: General Methods and Application to Helix Formation. Proteins: Struct., Funct., Bioinf 1994, 20, 68–84. [DOI] [PubMed] [Google Scholar]
- (66).Gómez J; Hilser VJ; Xie D; Freire E The Heat Capacity of Proteins. Proteins: Struct., Fund., Bioinf 1995, 22, 404–412. [DOI] [PubMed] [Google Scholar]
- (67).Alejandro D’Aquino J; Gomez J; Hilser VJ; Lee IK H.; Amzel, L. M.; Freire, E. The Magnitude of the Backbone Conformational Entropy Change in Protein Folding. Proteins: Struct., Funct., Genet 1996, 25, 143–156. [DOI] [PubMed] [Google Scholar]
- (68).Habermann SM; Murphy IK P. Energetics of Hydrogen Bonding in Proteins: A Model Compound Study. Protein Sci. 1996, 5, 1229–1239. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (69).Luque I; Mayorga OL; Freire E Structure-Based Thermodynamic Scale of α-Helix Propensities in Amino Acids. Biochemistry 1996, 35, 13681–13688. [DOI] [PubMed] [Google Scholar]
- (70).Hynes TR; Fox RO The crystal structure of staphylococcal nuclease refined at 1.7 Å resolution. Proteins: Struct., Funct., Genet 1991, 10, 92–105. [DOI] [PubMed] [Google Scholar]
- (71).Saito R; Sato T; Ikai A; Tanaka N Structure of Bovine Carbonic Anhydrase II at 1.95 A Resolution. Acta Crystallogr., Sect. D: Biol. Crystallogr 2004, 60, 792–795. [DOI] [PubMed] [Google Scholar]
- (72).Zahran ZN; Chooback L; Copeland DM; West AH; Richter-Addo GB Crystal Structures of Manganese- and Cobalt-Substituted Myoglobin in Complex with NO and Nitrite Reveal Unusual Ligand Conformations. J. Inorg. Biochem 2008, 102, 216–233. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (73).Stein PE; Leslie AGW; Finch JT; Turnell WG; McLaughlin PJ; Carrell RW Crystal Structure of Ovalbumin as a Model for the Reactive Centre of Serpins. Nature 1990, 347, 99–102. [DOI] [PubMed] [Google Scholar]
- (74).Uversky VN Natively Unfolded Proteins: A Point Where Biology Waits for Physics. Protein Sci. 2002, 11, 739–756. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (75).Sreerama N; Venyaminov SY; Woody RW Estimation of Protein Secondary Structure from Circular Dichroism Spectra: Inclusion of Denatured Proteins with Native Proteins in the Analysis. Anal. Biochem 2000, 287, 243–251. [DOI] [PubMed] [Google Scholar]
- (76).Muller-Spath S; Soranno A; Hirschfeld V; Hofmann H; Ruegger S; Reymond L; Nettels D; Schuler B Charge Interactions Can Dominate the Dimensions of Intrinsically Disordered Proteins. Proc. Natl. Acad. Sci. U.S.A 2010, 107, 14609–14614. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (77).Mao AH; Crick SL; Vitalis A; Chicoine CL; Pappu RV Net Charge per Residue Modulates Conformational Ensembles of Intrinsically Disordered Proteins. Proc. Natl. Acad. Sci. U.S.A 2010, 107, 8183–8188. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (78).Das RK; Pappu RV Conformations of Intrinsically Disordered Proteins Are Influenced by Linear Sequence Distributions of Oppositely Charged Residues. Proc. Natl. Acad. Sci. U.S.A 2013, 110, 13392–13397. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (79).Holehouse AS; Das RK; Ahad JN; Richardson MOG; Pappu RV CIDER: Resources to Analyze Sequence-Ensemble Relationships of Intrinsically Disordered Proteins. Biophys. J 2017, 112, 16–21. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (80).Lim WA; Richards FM; Fox RO Structural Determinants of Peptide-Binding Orientation and of Sequence Specificity in SH3 Domains. Nature 1994, 372, 375–379. [DOI] [PubMed] [Google Scholar]
- (81).Jha A. Ik. ; Colubri A; Freed IKF; Sosnick TR Statistical Coil Model of the Unfolded State: Resolving the Reconciliation Problem. Proc. Natl. Acad. Sci. U.S.A 2005, 102, 13099–13104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (82).Dignon GL; Zheng W; Kim YC; Mittal J Temperature-Controlled Liquid–Liquid Phase Separation of Disordered Proteins. ACS Cent. Sci. 2019, 5, 821–830. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (83).Hamburger JB; Ferreon JC; Whitten ST; Hilser VJ Thermodynamic Mechanism and Consequences of the Polyproline II (PII) Structural Bias in the Denatured States of Proteins. Biochemistry 2004, 43, 9790–9799. [DOI] [PubMed] [Google Scholar]
- (84).Kohn JE; Millett IS; Jacob J; Zagrovic B; Dillon TM; Cingel N; Dothager RS; Seifert S; Thiyagarajan P; Sosnick TR; et al. Random-Coil Behavior and the Dimensions of Chemically Unfolded Proteins. Proc. Natl. Acad. Sci. U.S.A 2004, 101, 12491–12496. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (85).Fitzkee NC; Rose GD Reassessing Random-Coil Statistics in Unfolded Proteins. Proc. Natl. Acad. Sci. U.S.A 2004, 101, 12497–12502. [DOI] [PMC free article] [PubMed] [Google Scholar]
- (86).Tiffany ML; Krimm S New Chain Conformations of Poly(Glutamic Acid) and Polylysine. Biopolymers 1968, 6, 1379–1382. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.