Skip to main content
Biophysical Journal logoLink to Biophysical Journal
. 2012 Apr 18;102(8):1969–1978. doi: 10.1016/j.bpj.2012.03.049

Scaling Properties of Glycine-Rich Sequences in Guanidine Hydrochloride Solutions

Michaela L Finnegan 1, Bruce E Bowler 1,
PMCID: PMC3328712  PMID: 22768954

Abstract

The intrinsic polymer properties of glycine-rich sequences are evaluated with a set of iso-1-cytochrome c variants with N-terminal inserts of the sequence (GGGGGK)n for n = 1–5. The thermodynamics and kinetics of His-heme loop formation are measured as a function of guanidine hydrochloride (GdnHCl) concentration for loop sizes ranging from 22 to 46 residues. The scaling exponent for loop formation, ν3, evaluated using the Jacobson-Stockmayer equation is near 1.8, at 1.5 and 3.0 M GdnHCl, but it increases to 2.2 in 6.0 M GdnHCl. Previous work on a set of iso-1-cytochrome c variants with (AAAAAK)n inserts gave ν3 = 2.2 for alanine-rich sequences in both 3.0 and 6.0 M GdnHCl. Chain stiffness was evaluated from the relative magnitude of Flory's characteristic ratio, Cn, for alanine-rich versus glycine-rich sequences. In 3.0 M GdnHCl, Cn(Ala)/Cn(Gly) is 1.6, decreasing to 1.3 in 6.0 M GdnHCl. The data suggest that solvent-backbone interactions dominate polypeptide conformational properties under good solvent conditions whereas side-chain-dependent properties are more important under poor solvent conditions. The results provide a direct experimental assessment in terms of polymer properties of the distinct roles of Gly versus Ala in the folding code.

Introduction

Ever since Anfinsen (1) demonstrated that the amino-acid sequence of a protein is sufficient to define the structure of a protein, there has been considerable interest in cracking the folding code that translates the primary structure of a protein into its tertiary structure. Much progress has been made in predicting protein structure from amino-acid sequence. However, when a structure for a protein with a similar amino-acid sequence does not exist, the results of structure prediction methods remain unsatisfactory (2). Thus, further understanding of the intrinsic properties of amino acids when incorporated into polypeptides is essential to provide a basis for predicting sequence-dependent structural biases.

Measurements of the diffusion-limited rate constant for first contact, kc, between monomers at the ends of a polypeptide chain suggest that sequence effects on kc are minor, except for proline and glycine (3–7). However, the results indicated that glycine content may impact the scaling properties of loop formation in a polypeptide as given by the Jacobson-Stockmayer equation (8,9),

ΔSloop=υ3Rln(n)+Rln((32πCn2)ν3Vi), (1)

where ν3 is the scaling exponent for loop formation, R is the gas constant, n is the number of monomers in the loop (loop size), Cn is Flory's characteristic ratio for a loop containing n monomers, is the distance between monomers, and Vi is the approach volume of the atoms involved in loop formation. In water, a set of poly(GlySer) polypeptides yielded ν3 = 1.72 ± 0.08, whereas poly(Ser) polypeptides gave ν3 = 2.1 ± 0.3 (3,5). Both these values are in the range expected for a freely joint random coil, ν3 = 1.5 (9,10), or a random coil with excluded volume, ν3 = 1.8–2.4 (10–12). Unfortunately, the length of the poly(Ser) polypeptides was limited to 12 residues or less, thus the large error in ν3 for the poly(Ser) data.

Simulations of polyalanine and polyglycine conformational ensembles have been carried out using a hard-sphere potential (13). The dependence of the radius of gyration, Rg, on chain length, n, for chains from 50 to 500 residues long was evaluated from these ensembles (Rg = Ronν). The scaling exponents, ν, for polyglycine and polyalanine were identical (0.60 ± 0.01 and 0.61 ± 0.01, respectively), and consistent with the excluded volume limit of 0.6 for a random coil (10). Thus, simulations of polyalanine and polyglycine suggest that glycine does not affect the scaling properties of a polypeptide in contrast to the apparent, but uncertain, difference observed in experimental studies on the loop size dependence of kc (3,5).

Glycine is a particularly important amino acid. Its small side chain allows greater flexibility for the main chain, providing access to ϕ,ψ dihedral angles about the α-carbon not accessible with other amino-acid side chains (14). As a result, glycine is important in chain reversals, such as β-turns, that are essential for compact globular protein structures (15,16). The role of glycine in the folding code is no doubt prominent. Thus, a detailed understanding of the effects of glycine on the properties of a polypeptide chain is essential.

To address the uncertainty in the literature over whether glycine affects the scaling properties of a polypeptide chain, we have now prepared a set of iso-1-cytochrome c (iso-1-Cytc) variants with predominately glycine inserts of variable length (Gly-rich variants, see Fig. 1) inserted into the flexible N-terminus of the protein (17). We analyze the polymer properties of these Gly-rich variants with our denatured state His-heme loop formation method (18–26), which allows measurement of both the equilibrium and kinetics of loop formation as a function of loop size. From the loop size dependence of loop formation, we are able to extract a fundamental polymer property, the scaling exponent for loop formation, ν3. We also gain insight into chain stiffness. Comparison of ν3 for our Gly-rich variants with our previous work on Ala-rich variants (19) shows that ν3 is similar for the Ala-rich and Gly-rich sequences in 6.0 M guanidine hydrochloride (GdnHCl), suggesting that the conformational properties of polymers of Ala and Gly are similar in a good solvent. By contrast, ν3 diverges for the Ala-rich and Gly-rich sequences at 3.0 M GdnHCl. Thus, the conformational differences of Gly-rich and Ala-rich sequences are more pronounced in a poorer solvent. Comparison of His-heme loop formation for the Gly-rich and Ala-rich variants at the same loop size allows evaluation of the relative magnitude of Flory's characteristic ratio, Cn, for Gly and Ala, a measure of chain stiffness. We find that the relative magnitude of Cn is more similar than expected from standard polymer theories.

Figure 1.

Figure 1

Schematic representation of histidine-heme loop formation with the NH5G-X variants under denaturing conditions.

Materials and Methods

Preparation of variants

All variants contain the following background mutations: C102S to prevent intermolecular disulfide bond formation during physical studies; H26N, H33N, and H39Q to eliminate all naturally occurring histidines in yeast iso-1-Cytc besides the heme ligand, His18; and a K(−2)H mutation to introduce the unique histidine that forms the loop in the denatured state of each of the Gly-rich variants. The NH5G-1 variant was prepared as previously described in Tzul and Bowler (21) and Tzul et al. (22). The NH5G-2 through NH5G-5 variants were prepared by sequential insertion mutagenesis (QuikChange PCR-based mutagenesis; Agilent Technologies, Santa Clara, CA) of DNA sequences into the pBTR1 vector (27) corresponding to the amino-acid sequence GGGGGK (see Table S1 in the Supporting Material for oligonucleotide sequences). The presence of each insert was confirmed by sequencing the iso-1-Cytc gene (Murdock DNA Sequencing Facility, University of Montana, Missoula, MT).

Expression and purification of variants

Escherichia coli BL21-DE3 cells (Novagen, Madison, WI) were used to express all variants from the pBTR1 vector, which coexpresses iso-1-Cytc and heme lyase allowing covalent attachment of the heme-group to iso-1-Cytc within the cytoplasm (28). Protein yields ranged from 5 to 20 mg from 2 L of 2xYT broth grown for 16–18 h at 37°C. Variants that contained fewer inserts tended to give higher yields. Yields for variants with more inserts could be increased by growing cultures for 24 h at 30°C. Isolation and purification of the Gly-rich variants were carried out as described previously (21–23). Before any of the following experiments, the protein was reduced using a small amount of sodium dithionite and purified by HPLC using a UNO S6 cation exchange column (BioRad, Hercules, CA) as described previously (22,29). The Gly-rich variants were then oxidized with a small amount of potassium ferricyanide for 1 h at +4°C. The protein was then separated from the oxidant using G-25 Sephadex beads (Sigma Chemical, St Louis, MO) equilibrated to buffer appropriate to the experiment to follow.

Protein stability measurements

GdnHCl denaturation was monitored by circular dichroism using an Chirascan spectrometer (Applied Photophysics, Leatherhead, UK) linked to a MICROLAB 500 titrator (Hamilton, Reno, NV), as described previously (21). Data at 250 nm and 222 nm were measured as a function of GdnHCl concentration in the presence of 20 mM Tris, 40 mM NaCl, and 1 mM EDTA at pH 7 and 25°C. The ellipticity at 250 nm was subtracted from the ellipticity at 222 nm at each GdnHCl concentration to correct for baseline drift during the GdnHCl titration. The free energy in the absence of GdnHCl, ΔGuo′(H2O), the GdnHCl m-value, and the denaturation midpoint, Cm, were obtained by nonlinear least squares fits of the baseline-corrected ellipticity at 222 nm versus GdnHCl concentration assuming a linear free energy relationship as described previously (25). Because iso-1-Cytc with homopolymeric inserts can be susceptible to backbone cleavage (22), molecular weights of all variants were checked before and after each experiment using matrix-assisted laser desorption/ionization time-of-flight mass spectrometry. The mass spectral data showed that there was no backbone cleavage for any of the variants during experimental measurements when the proteins are handled in buffers containing EDTA (21,22).

Equilibrium measurements of His-heme loop formation in the denatured state

His-heme loop formation under denaturing conditions (1.5 M, 3.0 M, and 6.0 M GdnHCl) at 3 μM protein concentration was measured in the presence of 5 mM Na2HPO4, 15 mM NaCl, and 1.0 mM EDTA by pH titration monitored with the heme Soret absorbance band (using a DU 800 spectrophotometer; Beckman Coulter, Brea, CA), as described previously (24). Data at 398 nm were plotted versus pH and fit using a modified form of the Henderson-Hasselbalch equation, as described previously (24,30) giving the apparent pKa, pKa(obs), and the number of protons, np, needed to titrate off the histidine. Three trials were run for each variant at room temperature, 20 ± 2°C.

Kinetics of loop breakage in the denatured state

Loop breakage kinetics data were measured using stopped-flow mixing methods (using an SX-20 spectrometer; Applied Photophysics), monitored at 398 nm. Downward pH jumps were conducted under denaturing conditions at either 3.0 M or 6.0 M GdnHCl at 25°C. The starting buffer was 10 mM MES with 1.0 mM EDTA at pH 6.2 containing 6 μM protein. This buffer was mixed 1:1 with 100 mM citrate containing 1.0 mM EDTA at either pH 3.5 or pH 3.0. The ending pH was measured from the effluent. The ending pH for the pH 3.5 experiments was 3.57 ± 0.01 in 3.0 M GdnHCl and 3.5 ± 0.1 in 6.0 M GdnHCl. The ending pH for the pH 3.0 experiments was 3.24 ± 0.06 in 3.0 M GdnHCl and 3.22 ± 0.05 in 6.0 M GdnHCl. Data were fit to a single-exponential rise to maximum equation and the rate constant for loop breakage, kb,His, extracted from the fit. Double-exponential fits were attempted and residuals indicated a better fit. However, the standard deviation of the magnitude of the faster rate constant was large. The seemingly better fit of the double-exponential is probably due to fitting a pressure or flow artifact. Thus, we consider the single-exponential fit to be adequate.

Results

Design of variants

To produce a set of Gly-rich variants complementary to our previously reported Ala-rich variants (19), we have inserted (GGGGGK)n segments for n = 1–5 between an engineered histidine at sequence position −2 (i.e., K(−2)H mutation; we use the sequence numbering convention for horse cytochrome c, therefore, the first five residues of iso-1-Cytc are numbered −5 to −1) and Ala(−1) (Fig. 1). Using this set of variants, we can probe the conformational properties of Gly-rich sequences from 6 to 30 residues in length (loop sizes of 22–46 residues). A lysine is included at the end of each six-residue segment to maintain aqueous solubility. The effect of charge on loop stability is potentially a concern. However, it has long been known that GdnHCl neutralizes electrostatic effects on protein stability (31) and it has also been shown that 750 mM NaCl is sufficient to disrupt electrostatic residual structure in the denatured state (32). Thus, electrostatics should have minimal effect on His-heme loop formation with the Gly-rich variants.

The variants are named NH5G-X, where X is an integer, which denotes the number of (GGGGGK) segments following His(−2). The NH5G-X variants have had His26, His33, and His39, in wild-type iso-1-Cytc mutated to Asn, Asn, and Gln, respectively (26). Thus, His(−2) forms the His-heme loop in the denatured state of the NH5G-X variants.

For both the Gly-rich and the Ala-rich variants, the 15 amino acids at the C-terminal end of the loop (closest to the CXXCH motif that attaches the heme to the iso-1-Cytc polypeptide) come from the natural sequence of iso-1-Cytc. This sequence contains no prolines but does contain two glycines (12.5% glycine). The N-terminal helix of iso-1-Cytc is essential for stability, so this segment is necessary for expression of these variants. Thus, the equilibria and kinetics of His-heme loop formation for variants with fewer inserts might experience some perturbation due to this 15-residue segment. We discuss this issue further in the context of the data presented below.

His-heme loop formation in the denatured state

The heme group of iso-1-Cytc is covalently attached through Cys14, Cys17, and His18 (Fig. 1). When deprotonated, His(−2) binds to the heme under denaturing conditions to form a loop. Because His-heme binding is linked to deprotonation of His(−2), a simple pH titration monitored at the heme Soret band near 400 nm is used to monitor denatured state loop formation. An apparent pKa, pKa(obs), is obtained from the data. pKa(obs) can be broken down into two steps,

pKa(obs)=pKa(HisH+)+pKloop(His), (2)

where pKa(HisH+) is the intrinsic pKa of the histidine and pKloop(His) is the pK (= −logK) of His-heme loop formation for a fully deprotonated histidine under denaturing conditions. We have shown that pKa(HisH+) = 6.6 ± 0.1 for histidine, irrespective of sequence position or GdnHCl concentration (24). Thus, we treat pKa(HisH+) as a physical constant and obtain pKloop(His) by subtracting pKa(HisH+) from pKa(obs).

Assuming random coil behavior, we can evaluate the loop size dependence of pKloop(His) in terms of the expression for ΔSloop in Eq. 1, yielding

pKloop(His)=pKloop(His)ref+v3Log(n), (3)

where pKloop(His)ref is the pKloop(His) for a loop size of n = 1. Thus, for a random coil, pKloop(His) should depend linearly on Log(n) with a slope equal to the scaling exponent, ν3. Equation 3 assumes that the His-heme bond enthalpy is the same for all loop sizes. This enthalpic term is incorporated into pKloop(His)ref. Thus, the variation in pKloop(His) with loop size is assumed to result entirely from the unfavorable entropy of constraining a histidine that is n monomers distant from the heme to be near enough to the heme to form a bond.

Stability of variants

To determine the GdnHCl concentration needed to achieve the denatured state, we measured the global stability of all variants using circular-dichroism-monitored GdnHCl denaturation methods (see Fig. S1 in the Supporting Material). We find that all NH5G-X variants are marginally stable with free energies of unfolding in the absence of denaturant, ΔGo′(H2O), ranging from 1.7 kcal mol−1 to 2.1 kcal mol−1 (Table 1). The stabilities of the Gly-rich variants are similar to those of our previously reported Ala-rich variants (19).

Table 1.

Thermodynamic parameters for GdnHCl unfolding at 25°C and pH 7 for iso-1-cytochrome c variants

Variant ΔGo′(H2O) (kcal mol−1) m-value (kcal mol−1 M−1) Cm (M)
NH5G-1 1.7 ± 0.4 4.3 ± 1.0 0.40 ± 0.06
NH5G-2 2.2 ± 0.1 4.5 ± 0.1 0.48 ± 0.03
NH5G-3 2.1 ± 0.2 3.5 ± 0.2 0.60 ± 0.01
NH5G-4 1.7 ± 0.1 2.6 ± 0.2 0.67 ± 0.04
NH5G-5 2.1 ± 0.1 3.3 ± 0.1 0.65 ± 0.02

All parameters are the average and standard deviation of three trials.

Thermodynamic parameters are taken from Tzul et al. (22), in which this variant was referred to as Gly5.

The His-heme loop stabilizes the denatured state. Because the stability of the loop is expected to decrease as loop size increases, a progressive increase in global stability is expected as the length of the Gly-rich insert increases. The unfolding midpoints, Cm, show the expected trend (Table 1). However, ΔGo′(H2O) does not mirror this trend likely due to the short native baseline, which limits our ability to confidently evaluate this parameter. Most importantly, the data in Table 1 (and see Fig. S1) show that all NH5G-X variants are fully unfolded at 1.5, 3.0, and 6.0 M GdnHCl, the conditions used here for denatured state loop formation.

Equilibrium loop formation in the denatured state

His-heme loop formation was measured by pH titration under denaturing conditions (Fig. 2). Spectra as a function of pH show a single isosbestic point at 404 nm (Fig. 2, inset) consistent with the two-state process outlined in Fig. 1. Equilibrium parameters for pH titrations at 1.5, 3.0, and 6.0 M GdnHCl are reported in Table S2 and Fig. 3. The number of protons, np, needed to break the His-heme loop is near one for all variants (Table S2), as expected for the 1 H+ process in Fig. 1.

Figure 2.

Figure 2

Plot of absorbance at 398 nm, A398, versus pH for the NH5G-4 variant in 3.0 M GdnHCl at 20 ± 2°C. (Solid line) Fit of the data to a modified version of the Henderson-Hasselbalch equation as described in the Materials and Methods. (Inset) Absorbance spectra from 350 nm to 450 nm for pH values from 7 to 3 in ∼0.2 pH increments. To the left of the isosbestic point at 404 nm, absorbance increases as pH decreases.

Figure 3.

Figure 3

(A) Plots of pKloop(His) versus loop size, n (scale is logarithmic), for the Gly-rich variants at 1.5 M (▾), 3.0 M (■), and 6.0 M (●) GdnHCl and the Ala-rich variants at 6.0 M GndHCl (○). Data point for the variant with no insert (n = 16) at 6.0 M GdnHCl (Δ). (B) Comparison of pKloop(His) versus loop size, n (scale is logarithmic), for the Gly-rich (■) versus the Ala-rich (□) variants at 3.0 M GdnHCl. Data point for the variant with no insert (n = 16) at 3.0 M GdnHCl (∇). In both panels, the Gly-rich variants (solid lines) and Ala-rich variants (dashed lines) are fits of Eq. 3 to the data for n ≥ 28. The values of ν3 obtained from these fits are reported in Table 2. pKloop(His) was calculated using Eq. 2 and data from Table S2 in the Supporting Material using pKa(HisH+) = 6.6. Data for the Ala-rich and no-insert variants are taken from Tzul and Bowler (19).

The pKloop(His) becomes less negative (less favorable loop) as loop size, n, increases for the Gly-rich variants at 1.5, 3.0, and 6.0 M GdnHCl (Fig. 3), consistent with the less favorable entropy of forming larger loops. Data from our previously published work on Ala-rich variants at 3.0 and 6.0 M GdnHCl are shown in Fig. 3 for comparison (19). We have previously observed that pKloop(His) at 3.0 M GdnHCl is the same for variants with either a single (GGGGGK) insert or a single (AAAAAK) insert between His(−2) and Ala(−1) (22). The same is true at 6.0 M GdnHCl (Fig. 3). Thus, for a single insert the 15 residues of native sequence nearest the heme dominate the properties of the 22-residue loop formed. With two or more inserts, the data for Ala-rich and Gly-rich variants are clearly differentiated, with equilibrium loop formation uniformly more favorable for the Gly-rich variants than for the Ala-rich variants. Thus, in evaluating ν3 with Eq. 3, we use only data for variants with two or more (GGGGGK) or (AAAAAK) inserts (Fig. 3). Scaling exponents collected in Table 2 for both Gly-rich and Ala-rich variants are in the range expected for a random coil with excluded volume (ν3 = 1.8–2.4) (10–12). We note that at 3.0 M GdnHCl, pKloop(His) for iso-1-Cytc with no insert (n = 16) and for a single (AAAAAK) or (GGGGGK) insert is consistent with the line obtained from fitting the data for the Gly-rich variants with two or more inserts to Eq. 3 (Fig. 3 B). By contrast, at 6.0 M GdnHCl, pKloop(His) for no insert (n = 16) and for a single (AAAAAK) or (GGGGGK) insert is consistent with the line obtained from fitting the data for the Ala-rich variants with two or more inserts to Eq. 3 (Fig. 3 A).

Table 2.

Scaling exponents, ν3, as a function of GdnHCl concentration for Gly-rich versus Ala-rich variants

GdnHCl concentration Scaling exponent, ν3
Gly-rich Ala-rich
Equilibrium data
1.5 1.88 ± 0.11
3.0 1.61 ± 0.16 2.01 ± 0.10
6.0 2.28 ± 0.08 1.91 ± 0.11
Kinetic data
3.0 1.67 ± 0.18 2.17 ± 0.16
6.0 2.24 ± 0.08 2.18 ± 0.14

Reported errors are standard errors of the fits to data in Figs. 3 and 4.

Kinetics of His-heme loop formation in the denatured state

The kinetics of His-heme loop formation and breakage in the denatured state is consistent with a process involving rapid deprotonation of histidine followed by His-heme loop formation (23). This mechanism yields

kobs=kb,His+kf,His(Ka(HisH+)(Ka(HisH+)+[H+])), (4)

for the observed rate constant, kobs, for His-heme loop formation (33), where kb,His is the rate constant for His-heme loop breakage, kf,His is the rate constant for His-heme loop formation with a fully deprotonated histidine, and Ka(HisH+) is the acid dissociation constant of the histidine that forms the loop. Note that kb,His can be obtained when [H+] >> Ka(HisH+) because Eq. 4 reduces to kobs = kb,His. Thus, downward pH jump experiments were used to obtain kb,His (see Fig. S2 and Table S3).

In our previous work, we have observed that kb,His decreases at higher GdnHCl concentrations (18,19). The decrease in kb,His correlated well with the increase in solution viscosity as GdnHCl concentration increases (18,19). Thus, we have corrected kb,His at 6.0 M GdnHCl for viscosity (kb6Mcorr, see Table S4). The plots of kb,His versus loop size in Fig. 4 A show that after correction for solution viscosity, kb,His is insensitive to GdnHCl concentration for the Gly-rich variants. Our previously reported kb,His data for the Ala-rich variants are shown for comparison in Fig. 4 A. Two differences are evident. For the Gly-rich variants, kb,His is uniformly larger than for the Ala-rich variants at all loop sizes and kb,His for the Gly-rich variants is relatively insensitive to loop size, whereas kb,His for the Ala-rich variants decreases with increasing loop size before leveling out for n ≥ 34.

Figure 4.

Figure 4

Kinetics of His-heme loop breakage and formation. (A) Plots of kb,His versus loop size, n, for the Gly-rich variants at 3.0 M (■) and 6.0 M (●) GdnHCl and for the Ala-rich variants at 3.0 M (□) and 6.0 M (○) GdnHCl. All 6.0 M GdnHCl data points are viscosity-corrected (see Table S4). Data points for the variant with no insert (n = 16) at 3.0 M (∇) and 6.0 M (Δ) GdnHCl. (B) Plot of Log(kf,His) versus the loop size, n (plotted logarithmically), for the Gly-rich variants at 3.0 M (■) and 6.0 M (●) GdnHCl and for the Ala-rich variants at 3.0 M (□) and 6.0 M (○) GdnHCl. Data points for the variant with no insert (n = 16) at 3.0 M (∇) and 6.0 M (Δ) GdnHCl. The Gly-rich variants (solid lines) and Ala-rich variants (dashed lines) are fits to Eq. 5 in the text for n ≥ 28. The values of ν3 obtained from these fits are reported in Table 2. In both panels, Ala-rich and no-insert data are taken from Tzul and Bowler (19).

Because loop formation is two-state, we obtain the rate constant for His-heme loop formation, kf,His, from Kloop(His) and kb,His (Fig. 4 B and see Table S5). The kf,His data for the Gly-rich variants are compared with our previously reported kf,His data for the Ala-rich variants in Fig. 4 B (19). It is important to note that we (23) and others (34,35) have shown that His-heme loop formation is reaction-controlled, not diffusion-controlled. Thus, kf,His for the Gly-rich variants in Fig. 4 B is ∼3 orders-of-magnitude smaller than rate constants for diffusion-controlled loop formation, kc, obtained with poly(GlySer) peptides of comparable length at similar GdnHCl concentrations (3,36). Because formation of the His-heme bond is reaction-limited, not diffusion-limited, kf,His is not expected to be affected by viscosity (23,34,35). Thus, the decrease in kf,His in 6.0 M vs. 3.0 M GdnHCl is attributable to an increase in the root-mean-square end-to-end distance of the Gly-rich sequences in 6.0 M vs. 3.0 M GdnHCl. At both 3.0 M and 6.0 M GdnHCl, kf,His is smaller at all loop sizes for the Ala-rich variants than for the Gly-rich variants, consistent with Gly-rich sequences being more compact than Ala-rich sequences under the same solvent conditions.

Even though His-heme loop formation is not diffusion-controlled, kf,His still reflects the end-to-end distance distribution of a random coil polypeptide chain. Thus, the Jacobson-Stockmayer equation for loop entropy (Eq. 1) applies, and a plot of Log(kf,His) versus Log(n) should be linear with a slope equal to the scaling exponent, ν3,

Log(kf,His)=Log(kf,His_ref)v3Log(n) (5)

(see derivation in the Supporting Material). In Eq. 5, kf,His_ref corresponds to kf,His for a loop size of n = 1. As with our equilibrium data, we use only the kf,His data for Gly-rich and Ala-rich variants with two or more (GGGGGK) and (AAAAAK) inserts, respectively, in the fits to Eq. 5 shown in Fig. 4 B. Table 2 shows that ν3 obtained from kinetic and equilibrium data is the same for the Gly-rich variants. However, for the Ala-rich variants, ν3 obtained from kinetic data is somewhat larger than when it is obtained from equilibrium data. The difference results from the increase in kb,His due to steric strain at shorter loop sizes for the Ala-rich variants that perturbs pKloop(His) for smaller loop sizes introducing an enthalpy of steric strain not present for pKloop(His) at larger loop sizes. Thus, the ν3 values from the kinetic data should be better estimates of ν3 for Ala-rich sequences. Hence, our data indicate that ν3 is similar for Ala-rich and Gly-rich sequences in 6.0 M GdnHCl. However, under poorer solvent conditions (3.0 M GdnHCl), ν3 decreases for Gly-rich sequences, but not for Ala-rich sequences.

As with our equilibrium results, we note that at 3.0 M GdnHCl, the data for iso-1-Cytc with no inserts (n = 16) and for a single (AAAAAK) or (GGGGGK) insert are consistent with the line obtained from fitting the data for the Gly-rich variants (two or more inserts) to Eq. 5 (Fig. 4 B). By contrast, at 6.0 M GdnHCl, data for iso-1-Cytc with no inserts (n = 16) and for a single (AAAAAK) or (GGGGGK) insert are consistent with the line obtained from fitting the data for the Ala-rich variants (two or more inserts) to Eq. 5 (Fig. 4 B).

Discussion

Scaling properties

As discussed in the Results, the kb,His data in Fig. 4 A indicate that steric strain perturbs pKloop(His) at loop sizes <34 for the Ala-rich variants. Steric strain likely remains in the transition state for His-heme loop formation with the Ala-rich variants for loop sizes <34. However, the increase in kb,His at small loop sizes in Fig. 4 A indicates that steric strain is partially relieved in the transition state relative to the closed-loop form. Thus, we will use ν3 obtained from the kf,His data in Fig. 4 B (Table 2) to compare the Gly-rich to the Ala-rich variants. At 1.5 and 3.0 M GdnHCl concentration, ν3 is 1.7–1.9 for our Gly-rich variants. At 6.0 GdnHCl, ν3 increases to 2.2. By contrast for the Ala-rich variants, ν3 is invariant with GdnHCl concentration; it is near 2.2 at both 3.0 and 6.0 M GdnHCl. Thus, ν3 for Ala-rich and Gly-rich sequences are identical in a good solvent like 6.0 M GdnHCl. The magnitude of ν3 in a good solvent is also in the upper part of the range predicted for the excluded volume limit, ν3 = 1.8–2.4 (10–12). Monte Carlo simulations of polyglycine and polyalanine ranging from 50 to 500 monomers in length using a hard sphere model, which should model a good solvent, also yielded identical scaling exponents for these two polymers (13).

More interestingly, ν3 decreases to 1.7–1.9 under poorer solvent conditions for the Gly-rich variants, whereas ν3 stays near 2.2 for the Ala-rich variants. Our results provide additional support for the difference in ν3 observed from the loop size dependence of kc for Gly-rich poly(GlySer) (ν3 ∼ 1.7) versus poly(Ser) (ν3 ∼ 2.1) in water (3). Interestingly, ν3 is insensitive to solvent quality for poly(GlySer) (3). The insensitivity of ν3 for poly(GlySer) to solvent conditions compared to our Gly-rich variants may reflect the high content of serine with its ability to hydrogen-bond to the main chain.

The decrease in ν3 at lower GdnHCl concentrations for the Gly-rich variants is consistent with simple polymer theories that predict that the scaling exponent for a polymer should decrease as solvent conditions become poorer, reaching ν3 = 1.5 in a θ-solvent where solvent-polymer and polymer-polymer interactions are of equal magnitude (10), and decreasing further as polymer-polymer interactions become progressively more dominant as solvent conditions becomes progressively poorer (11,37). In the limit of maximally compact polymers, the cubic lattice work of Chan and Dill (11) predicts that the probability of loop formation becomes essentially independent of the distance separating monomers in the chain. The observation that ν3 obtained from our experiments on the Gly-rich and Ala-rich variants remains in the range expected for a θ-solvent or good solvent suggests that these polypeptide chains have not undergone a coil-to-globule transition (38,39) even in 1.5 M GdnHCl. Thus, whereas the decrease in ν3 for the Gly-rich sequences indicates that some compaction of the Gly-rich polypeptide occurs in 1.5 and 3.0 M GdnHCl relative to 6.0 GdnHCl, the collapse is not so much as to preclude analysis with simple polymer theories.

Chain stiffness

Polyglycine sequences are expected to be more flexible than polyalanine sequences. Fig. 4 A shows that kb,His is nearly independent of loop size for the Gly-rich variants. By contrast, kb,His drops by ∼20% as loop size increases for the Ala-rich variants. These data indicate that there is little steric strain in the loops formed by the Gly-rich variants whereas steric strain is evident in the loops formed by Ala-rich sequences out to loop sizes of 34. We note that kb,His is 30–40% larger at the longest loop sizes for the Gly-rich variants, also consistent with the greater flexibility of Gly-rich versus Ala-rich sequences. The more favorable equilibrium loop formation (more negative pKloop(His), Fig. 3) and the larger magnitude of kf,His (Fig. 4 B) under the same solvent conditions for the Gly-rich versus the Ala-rich variants of the same loop size are also consistent with the denatured states of the Gly-rich variants being more compact than those of the Ala-rich variants, due to the greater flexibility of Gly-rich sequences.

For a more quantitative estimate of relative chain stiffness, we can use the Jacobson-Stockmayer equation to calculate the relative chain stiffness of the Gly-rich versus the Ala-rich variants. Inspection of the Jacobson-Stockmayer equation (Eq. 1) shows that for loops of the same size but different sequence composition, differences in ΔSloop depend only on Flory's characteristic ratio, Cn, which is a measure of chain stiffness. As with scaling properties, our kf,His data should provide a better estimate than our pKloop(His) data for the chain stiffness of the Gly-rich versus the Ala-rich sequences. In Eq. 5, the terms from the Jacobson-Stockmayer equation not dependent on loop size are contained within the Log(kf,His_ref) term. All terms except Flory's characteristic ratio for the Ala-rich variants, Cn(Ala), and Gly-rich variants, Cn(Gly), cancel out, allowing the ratio between these two quantities to be determined from kf,His data with

Cn(Ala)Cn(Gly)=10((Logkf,His(Gly)Logkf,His(Ala))/ν¯3) (6)

(see derivation in the Supporting Material), where kf,His(Gly) and kf,His(Ala) are kf,His for Gly-rich and Ala-rich variants, respectively, with the same loop size and ν¯3=(ν3,Gly+ν3,Ala)/2 is the average value for the scaling exponent for the Gly-rich and Ala-rich variants at the GdnHCl concentration used to measure kf,His.

Table 3 shows that Cn(Ala)/Cn(Gly) initially increases as a function of loop size and then levels out for the longest loop sizes. Simple polymer theory predicts that Cn increases with polymer length reaching a limiting value for long polymers (9). Because Gly-rich sequences reach a limiting value of Cn faster than Ala-rich sequences (9,40) the ratio Cn(Ala)/Cn(Gly) is expected to grow with loop size leveling out for longer loop sizes as observed in Table 3. The observation that Cn(Ala)/Cn(Gly) is near 1 for a loop size of 22 also reflects the fact that the variants with one insert are most similar because the 15 amino acids of the loop closest to the heme are from the native sequence of iso-1-Cytc. We observe limiting values for Cn(Ala)/Cn(Gly) of ∼1.6 in 3.0 M GdnHCl and ∼1.3 in 6.0 M GdnHCl. Thus, the chain stiffness of the Ala-rich and Gly-rich sequences become more similar at high GdnHCl concentration.

Table 3.

Calculated relative magnitude of Flory's characteristic ratio, Cn, for the Gly-rich versus the Ala-rich variants

Loop size %Gly
Cn(Ala)/Cn(Gly)
Ala-rich Gly-rich 3.0 M GdnHCl 6.0 M GdnHCl
22 9.1 31.8 1.15 ± 0.07 1.02 ± 0.04
28 7.1 42.9 1.38 ± 0.11 1.27 ± 0.04
34 5.8 50.0 1.64 ± 0.20 1.32 ± 0.05
40 5.0 55.0 1.56 ± 0.14 1.32 ± 0.04
46 4.3 58.7 1.61 ± 0.16 1.25 ± 0.04

Average value of ν3 for the Ala-rich and Gly-rich variants, ν¯3 in Table 2 (kinetic data) at each GdnHCl concentration was used in Eq. 6. At 3.0 M GdnHCl, we used ν¯3=1.92±0.35. At 6.0 M GdnHCl, we used 3 = 2.21 ± 0.04. The reported error is the standard propagation of error due to the standard deviation in kf,His and in ν¯3 for the Ala-rich and Gly-rich variants. The data for the Ala-rich variants used in these calculations are from Tzul and Bowler (19).

The Cn(Ala)/Cn(Gly) ratios in Table 3 are surprisingly modest. Simple polymer theories predict Cn ∼ 9.27 for pure polyalanine and Cn ∼ 2.16 for pure polyglycine (9), yielding Cn(Ala)/Cn(Gly) of ∼4.3. However, every sixth amino acid in our constructs is a lysine. Also, our Gly-rich and Ala-rich variants contain 16 amino acids (two of which are Gly) derived from the natural sequence of iso-1-Cytc. Thus, our loops are not pure polyalanine or polyglycine sequences. For loop sizes of 34–46, the % glycine content approaches a constant value for each set of variants (Table 3). The Gly-rich variants with loop sizes of 34–46 range from 50–59% glycine. For these same loop sizes, the Ala-rich variants range from 4 to 6% glycine. Based on the work of Miller et al. (41), Cn(Ala)/Cn(Gly) should be 2.3–2.6 for loop sizes of 34–46 in our system. Thus, even after correcting for the actual sequence composition of the Ala-rich versus the Gly-rich variants, Cn(Ala)/Cn(Gly) is much smaller than predicted.

Consistent with our results, Krieger et al. (3) observed that the difference in kc for poly(Ser) versus poly(GlySer) polypeptides in water was smaller than anticipated based on the expected differences in chain stiffness. They also observed that chain stiffness for the poly(GlySer) polypeptides decreased in 8.0 M GdnHCl relative to water as a result of decreased intrachain interactions—van der Waals or hydrogen bonding—due to better solvation of the polypeptide at higher GdnHCl concentration (3,42). If good solvation of polypeptides is the dominant contribution to chain stiffness at high GdnHCl concentration, solvation effects may explain the greater similarity we observe for Cn(Ala) and Cn(Gly) in 6.0 M GdnHCl (Table 3).

Physical basis for the differences between polyglycine and polyalanine

Our loop formation data clearly show that the Gly-rich and Ala-rich variants are more similar at higher GdnHCl concentrations both with regard to scaling properties and chain stiffness, whereas their polymer properties diverge at lower denaturant concentration. To begin, we consider the possible basis for the similar behavior of these polymers at high denaturant concentrations. Recent theoretical (37) and experimental (43) studies on a homopolymer of glycine show that intrachain hydrogen bonding interactions are decreased in higher concentrations of denaturant and are replaced by hydrogen bonds to the denaturant. Experiment and theory (44–46), with some exceptions (13), indicate that both alanine and glycine have a relatively high preference for the polyproline II structure. Because high denaturant concentration is known to stabilize polyproline II structure (47,48), the conformational properties of Gly-rich and Ala-rich sequences would be expected to be more similar in 6.0 M GdnHCl. Thus, the distribution of conformations should become more similar in 6.0 M GdnHCl, leading to more similar ν3 and Cn values for Gly-rich and Ala-rich variants. In other words, solvent-backbone interactions partially neutralize the effects of side-chain sterics on polypeptide conformational properties in a good solvent.

As solvent conditions become poorer, intrachain interactions are expected to become progressively more dominant. In the extreme, a coil-to-globule transition is expected to occur (37–39,49,50). The observation that the scaling exponent remains in the range expected for a random coil at lower GdnHCl concentrations suggests that the collapse to a globule has not occurred for either the Ala-rich or Gly-rich variants at 1.5–3.0 M GdnHCl. However, the divergence of ν3 for Gly-rich versus Ala-rich sequences and the increase in Cn(Ala)/Cn(Gly) at 3.0 M GdnHCl indicates that the peculiarities of the Gly-rich versus the Ala-rich sequences are beginning to affect chain properties. Polyalanine sequences similar to the inserts used here are known to form stable helices in water (51). Simulations of a seven-residue alanine peptide show that the γ-turn basin is preferentially populated as solvent conditions become poorer (52). By contrast, simulation (37) and experiment (43) indicate that polyglycine forms a collapsed disordered structure with nonspecific hydrogen bonds under poor solvent conditions. The ordered helical structure favored by polyalanine might be expected to stiffen the polypeptide chain relative to polyglycine at lower GdnHCl concentration under poorer solvent conditions as we observe for Cn(Ala)/Cn(Gly) (Table 3). The tendency of polyglycine to undergo nonspecific compaction in a poorer solvent (37,43) would be expected to decrease ν3. It is interesting to note, for loop sizes of 16 and 22 where the residues within the loop are completely or predominately from the natural sequence of iso-1-Cytc, that the polymer properties are more similar to the Ala-rich variants in 6.0 M GdnHCl and more similar to the Gly-rich variants in 1.5 and 3.0 M GdnHCl despite the relatively low 12.5% Gly content of the natural sequence (Fig. 3 and Fig. 4 B). Cn decreases rapidly as glycine content is increased from 0% to 50%, decreasing more slowly at >50% glycine content (41). However, even with this steep initial decrease in Cn, the Ala-rich sequences would be expected to have backbone sterics more similar to the natural sequence of iso-1-Cytc than the Gly-rich sequences. Thus, even though all amino acids except Pro and Gly have backbone sterics similar to Ala, the chemical properties of the side chains of non-Gly/Pro amino acids appear to mediate collapse in a manner more similar to Gly than Ala under poorer solvent conditions.

Conclusions

Comparison of these data for Gly-rich variants of iso-1-Cytc with our previous work on Ala-rich variants shows that glycine does not increase the flexibility of a polypeptide chain as much as expected from simple polymer theories. However, relative to alanine it still promotes compact conformers more effectively. At high denaturant concentration, the scaling exponent and chain stiffness are more similar for Gly-rich and Ala-rich sequences, indicating that denaturant-polymer interactions and not side-chain sterics are the dominant factor in polypeptide conformational properties under good solvent conditions. At lower denaturant concentration, the polymer properties of the Gly-rich and Ala-rich variants diverge, presumably due to the increasing influence of the side chain under poorer solvent conditions, with polyglycine and polyalanine favoring nonspecific and ordered helical intrachain interactions, respectively. Clearly, a fuller understanding of the divergent effects of different side chains on polypeptide polymer properties will be important for discerning how these biases can be manipulated to define the fold of a protein.

Acknowledgments

This work was supported by the National Institutes of Health grants R01 GM074750, its ARRA supplement GM074750-04S1, and a pilot project from NIH CoBRE P20GM103546 to B.E.B.

Supporting Material

Document S1. Two figures, five tables, derivations of Eqs. 5 and 6, and reference (53)
mmc1.pdf (276.9KB, pdf)

References

  • 1.Haber E., Anfinsen C.B. Side-chain interactions governing the pairing of half-cystine residues in ribonuclease. J. Biol. Chem. 1962;237:1839–1844. [PubMed] [Google Scholar]
  • 2.Ben-David M., Noivirt-Brik O., Levy Y. Assessment of CASP8 structure predictions for template free targets. Proteins. 2009;77(Suppl 9):50–65. doi: 10.1002/prot.22591. [DOI] [PubMed] [Google Scholar]
  • 3.Krieger F., Fierz B., Kiefhaber T. Dynamics of unfolded polypeptide chains as model for the earliest steps in protein folding. J. Mol. Biol. 2003;332:265–274. doi: 10.1016/s0022-2836(03)00892-1. [DOI] [PubMed] [Google Scholar]
  • 4.Bieri O., Wirz J., Kiefhaber T. The speed limit for protein folding measured by triplet-triplet energy transfer. Proc. Natl. Acad. Sci. USA. 1999;96:9597–9601. doi: 10.1073/pnas.96.17.9597. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Krieger F., Möglich A., Kiefhaber T. Effect of proline and glycine residues on dynamics and barriers of loop formation in polypeptide chains. J. Am. Chem. Soc. 2005;127:3346–3352. doi: 10.1021/ja042798i. [DOI] [PubMed] [Google Scholar]
  • 6.Lapidus L.J., Steinbach P.J., Hofrichter J. Effects of chain stiffness on the dynamics of loop formation in polypeptides. Appendix: testing a 1-dimensional diffusion model for peptide dynamics. J. Phys. Chem. B. 2002;106:11628–11640. [Google Scholar]
  • 7.Lapidus L.J., Eaton W.A., Hofrichter J. Measuring the rate of intramolecular contact formation in polypeptides. Proc. Natl. Acad. Sci. USA. 2000;97:7220–7225. doi: 10.1073/pnas.97.13.7220. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Jacobson H., Stockmayer W.H. Intramolecular reaction in polycondensations. I. The theory of linear systems. J. Chem. Phys. 1950;18:1600–1606. [Google Scholar]
  • 9.Cantor C.R., Schimmel P.R. W. H. Freeman; San Francisco: 1980. Biophysical Chemistry, Part III: The Behavior of Biological Macromolecules. [Google Scholar]
  • 10.Flory P.J. Hanser Publishers; New York: 1988. Statistical Mechanics of Chain Molecules. [Google Scholar]
  • 11.Chan H.S., Dill K.A. The effect of internal constraints on the configurations of chain molecules. J. Chem. Phys. 1990;92:3118–3135. [Google Scholar]
  • 12.Redner S. Distribution functions in the interior of polymer chains. J. Phys. Math. Gen. 1980;13:3525–3541. [Google Scholar]
  • 13.Tran H.T., Wang X., Pappu R.V. Reconciling observations of sequence-specific conformational propensities with the generic polymeric behavior of denatured proteins. Biochemistry. 2005;44:11369–11380. doi: 10.1021/bi050196l. [DOI] [PubMed] [Google Scholar]
  • 14.Beck D.A.C., Alonso D.O.V., Daggett V. The intrinsic conformational propensities of the 20 naturally occurring amino acids and reflection of these propensities in proteins. Proc. Natl. Acad. Sci. USA. 2008;105:12259–12264. doi: 10.1073/pnas.0706527105. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Richardson J.S. The anatomy and taxonomy of protein structure. Adv. Protein Chem. 1981;34:167–339. doi: 10.1016/s0065-3233(08)60520-3. [DOI] [PubMed] [Google Scholar]
  • 16.Marcelino A.M.C., Gierasch L.M. Roles of β-turns in protein folding: from peptide models to protein engineering. Biopolymers. 2008;89:380–391. doi: 10.1002/bip.20960. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Berghuis A.M., Brayer G.D. Oxidation state-dependent conformational changes in cytochrome c. J. Mol. Biol. 1992;223:959–976. doi: 10.1016/0022-2836(92)90255-i. [DOI] [PubMed] [Google Scholar]
  • 18.Dar T.A., Schaeffer R.D., Bowler B.E. Manifestations of native topology in the denatured state ensemble of Rhodopseudomonas palustris cytochrome c′. Biochemistry. 2011;50:1029–1041. doi: 10.1021/bi101551h. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Tzul F.O., Bowler B.E. Denatured states of low-complexity polypeptide sequences differ dramatically from those of foldable sequences. Proc. Natl. Acad. Sci. USA. 2010;107:11364–11369. doi: 10.1073/pnas.1004572107. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Finnegan M.L., Bowler B.E. Propensities of aromatic amino acids versus leucine and proline to induce residual structure in the denatured-state ensemble of iso-1-cytochrome c. J. Mol. Biol. 2010;403:495–504. doi: 10.1016/j.jmb.2010.09.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Tzul F.O., Bowler B.E. Importance of contact persistence in denatured state loop formation: kinetic insights into sequence effects on nucleation early in folding. J. Mol. Biol. 2009;390:124–134. doi: 10.1016/j.jmb.2009.04.075. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Tzul F.O., Kurchan E., Bowler B.E. Sequence composition effects on denatured state loop formation in iso-1-cytochrome c variants: polyalanine versus polyglycine inserts. J. Mol. Biol. 2007;371:577–584. doi: 10.1016/j.jmb.2007.04.060. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Kurchan E., Roder H., Bowler B.E. Kinetics of loop formation and breakage in the denatured state of iso-1-cytochrome c. J. Mol. Biol. 2005;353:730–743. doi: 10.1016/j.jmb.2005.08.034. [DOI] [PubMed] [Google Scholar]
  • 24.Wandschneider E., Bowler B.E. Conformational properties of the iso-1-cytochrome c denatured state: dependence on guanidine hydrochloride concentration. J. Mol. Biol. 2004;339:185–197. doi: 10.1016/S0022-2836(04)00331-6. [DOI] [PubMed] [Google Scholar]
  • 25.Hammack B.N., Smith C.R., Bowler B.E. Denatured state thermodynamics: residual structure, chain stiffness and scaling factors. J. Mol. Biol. 2001;311:1091–1104. doi: 10.1006/jmbi.2001.4909. [DOI] [PubMed] [Google Scholar]
  • 26.Godbole S., Bowler B.E. A histidine variant of yeast iso-1-cytochrome c that strongly affects the energetics of the denatured state. J. Mol. Biol. 1997;268:816–821. doi: 10.1006/jmbi.1997.0999. [DOI] [PubMed] [Google Scholar]
  • 27.Rosell F.I., Mauk A.G. Spectroscopic properties of a mitochondrial cytochrome c with a single thioether bond to the heme prosthetic group. Biochemistry. 2002;41:7811–7818. doi: 10.1021/bi016060e. [DOI] [PubMed] [Google Scholar]
  • 28.Pollock W.B., Rosell F.I., Mauk A.G. Bacterial expression of a mitochondrial cytochrome c. Trimethylation of Lys72 in yeast iso-1-cytochrome c and the alkaline conformational transition. Biochemistry. 1998;37:6124–6131. doi: 10.1021/bi972188d. [DOI] [PubMed] [Google Scholar]
  • 29.Bowler B.E., Dong A., Caughey W.S. Characterization of the guanidine hydrochloride-denatured state of iso-1-cytochrome c by infrared spectroscopy. Biochemistry. 1994;33:2402–2408. doi: 10.1021/bi00175a008. [DOI] [PubMed] [Google Scholar]
  • 30.Rao K.S., Tzul F.O., Bowler B.E. Thermodynamics of loop formation in the denatured state of Rhodopseudomonas palustris cytochrome c′: scaling exponents and the reconciliation problem. J. Mol. Biol. 2009;392:1315–1325. doi: 10.1016/j.jmb.2009.07.074. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Monera O.D., Kay C.M., Hodges R.S. Protein denaturation with guanidine hydrochloride or urea provides a different estimate of stability depending on the contributions of electrostatic interactions. Protein Sci. 1994;3:1984–1991. doi: 10.1002/pro.5560031110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Kuhlman B., Luisi D.L., Raleigh D.P. pKa values and the pH dependent stability of the N-terminal domain of L9 as probes of electrostatic interactions in the denatured state. Differentiation between local and nonlocal interactions. Biochemistry. 1999;38:4896–4903. doi: 10.1021/bi982931h. [DOI] [PubMed] [Google Scholar]
  • 33.Davis L.A., Schejter A., Hess G.P. Alkaline isomerization of oxidized cytochrome c. Equilibrium and kinetic measurements. J. Biol. Chem. 1974;249:2624–2632. [PubMed] [Google Scholar]
  • 34.Hagen S.J., Hofrichter J., Eaton W.A. Rate of intrachain diffusion of unfolded cytochrome c. J. Phys. Chem. B. 1997;101:2352–2365. [Google Scholar]
  • 35.Hagen S.J., Hofrichter J., Eaton W.A. Diffusion-limited contact formation in unfolded cytochrome c: estimating the maximum rate of protein folding. Proc. Natl. Acad. Sci. USA. 1996;93:11615–11617. doi: 10.1073/pnas.93.21.11615. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Möglich A., Krieger F., Kiefhaber T. Molecular basis for the effect of urea and guanidinium chloride on the dynamics of unfolded polypeptide chains. J. Mol. Biol. 2005;345:153–162. doi: 10.1016/j.jmb.2004.10.036. [DOI] [PubMed] [Google Scholar]
  • 37.Tran H.T., Mao A., Pappu R.V. Role of backbone-solvent interactions in determining conformational equilibria of intrinsically disordered proteins. J. Am. Chem. Soc. 2008;130:7380–7392. doi: 10.1021/ja710446s. [DOI] [PubMed] [Google Scholar]
  • 38.Pappu R.V., Wang X., Crick S.L. A polymer physics perspective on driving forces and mechanisms for protein aggregation. Arch. Biochem. Biophys. 2008;469:132–141. doi: 10.1016/j.abb.2007.08.033. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Ziv G., Thirumalai D., Haran G. Collapse transition in proteins. Phys. Chem. Chem. Phys. 2009;11:83–93. doi: 10.1039/b813961j. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Schimmel P.R., Flory P.J. Conformational energy and configurational statistics of poly-L-proline. Proc. Natl. Acad. Sci. USA. 1967;58:52–59. doi: 10.1073/pnas.58.1.52. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Miller W.G., Brant D.A., Flory P.J. Random coil configuration of polypeptide copolymers. J. Mol. Biol. 1967;23:67–80. [Google Scholar]
  • 42.Möglich A., Joder K., Kiefhaber T. End-to-end distance distributions and intrachain diffusion constants in unfolded polypeptide chains indicate intramolecular hydrogen bond formation. Proc. Natl. Acad. Sci. USA. 2006;103:12394–12399. doi: 10.1073/pnas.0604748103. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Teufel D.P., Johnson C.M., Neuweiler H. Backbone-driven collapse in unfolded protein chains. J. Mol. Biol. 2011;409:250–262. doi: 10.1016/j.jmb.2011.03.066. [DOI] [PubMed] [Google Scholar]
  • 44.Rucker A.L., Pager C.T., Creamer T.P. Host-guest scale of left-handed polyproline II helix formation. Proteins. 2003;53:68–75. doi: 10.1002/prot.10477. [DOI] [PubMed] [Google Scholar]
  • 45.Kelly M.A., Chellgren B.W., Creamer T.P. Host-guest study of left-handed polyproline II helix formation. Biochemistry. 2001;40:14376–14383. doi: 10.1021/bi011043a. [DOI] [PubMed] [Google Scholar]
  • 46.Fleming P.J., Fitzkee N.C., Rose G.D. A novel method reveals that solvent water favors polyproline II over β-strand conformation in peptides and unfolded proteins: conditional hydrophobic accessible surface area (CHASA) Protein Sci. 2005;14:111–118. doi: 10.1110/ps.041047005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Tiffany M.L., Krimm S. Extended conformation of polypeptides and proteins in urea and guanidine hydrochloride. Biopolymers. 1973;12:575–587. [Google Scholar]
  • 48.Whittington S.J., Chellgren B.W., Creamer T.P. Urea promotes polyproline II helix formation: implications for protein denatured states. Biochemistry. 2005;44:6269–6275. doi: 10.1021/bi050124u. [DOI] [PubMed] [Google Scholar]
  • 49.Vitalis A., Wang X., Pappu R.V. Atomistic simulations of the effects of polyglutamine chain length and solvent quality on conformational equilibria and spontaneous homodimerization. J. Mol. Biol. 2008;384:279–297. doi: 10.1016/j.jmb.2008.09.026. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Vitalis A., Wang X., Pappu R.V. Quantitative characterization of intrinsic disorder in polyglutamine: insights from analysis based on polymer theories. Biophys. J. 2007;93:1923–1937. doi: 10.1529/biophysj.107.110080. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Baldwin R.L. α-helix formation by peptides of defined sequence. Biophys. Chem. 1995;55:127–135. doi: 10.1016/0301-4622(94)00146-b. [DOI] [PubMed] [Google Scholar]
  • 52.Gong H., Rose G.D. Assessing the solvent-dependent surface area of unfolded proteins using an ensemble model. Proc. Natl. Acad. Sci. USA. 2008;105:3321–3326. doi: 10.1073/pnas.0712240105. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Kawahara K., Tanford C. Viscosity and density of aqueous solutions of urea and guanidine hydrochloride. J. Biol. Chem. 1966;241:3228–3232. [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Document S1. Two figures, five tables, derivations of Eqs. 5 and 6, and reference (53)
mmc1.pdf (276.9KB, pdf)

Articles from Biophysical Journal are provided here courtesy of The Biophysical Society

RESOURCES