Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2009 Sep 26.
Published in final edited form as: J Mol Biol. 2008 Jul 11;382(1):203–212. doi: 10.1016/j.jmb.2008.07.005

Non-random coil behavior as a consequence of extensive PPII structure in the denatured state

Aitziber L Cortajarena , Gregg Lois , Eilon Sherman §, Corey S O’Hern , Lynne Regan ¶,†,, Gilad Haran §,
PMCID: PMC2603145  NIHMSID: NIHMS69478  PMID: 18644382

Abstract

Unfolded proteins may contain native or non-native residual structure, which has important implications for the thermodynamics and kinetics of folding as well as for misfolding and aggregation diseases. However, it has been universally accepted that residual structure should not affect the global size scaling of the denatured chain, which obeys the statistics of random coil polymers. Here we use a single-molecule optical technique, fluorescence correlation spectroscopy, to probe the denatured state of set of repeat proteins containing an increasing number of identical domains, from two to twenty. The availability of this set allows us to obtain the scaling law for the unfolded state of these proteins, which turns out to be unusually compact, strongly deviating from random-coil statistics. The origin of this unexpected behavior is traced to the presence of extensive non-native polyproline II helical structure, which we localize to specific segments of the polypeptide chain. We show that the experimentally observed effects of PPII on the size scaling of the denatured state can be well-described by simple polymer models. Our findings suggest an hitherto unforeseen potential of non-native structure to induce significant compaction of denatured proteins, affecting significantly folding pathways and kinetics.

Keywords: Protein folding, fluorescence correlation spectroscopy, PPII helix, denatured state, self-avoiding random walk, hydrodynamic radius

Introduction

During the folding process, proteins reach a defined, unique, native structure from a poorly defined ensemble of unfolded conformations. Although we have an atomistic description of the structures of the native states of a multitude of proteins, much less is known about the unfolded ensemble. The unfolded state is often treated as some version of a random coil, a realistic model of which is the self-avoiding random walk (SARW), in which different chain segments cannot occupy the same volume element. For a SARW, the relationship between the radius of gyration (Rg) (or the hydrodynamic radius (Rh)), and the length of the chain is given by the expression Rg α Nν, where ν = 0.59 and N is the number of units in the chain.1; 2 For many different denatured proteins the experimentally-measured relationship between Rg or Rh and N is consistent with the behavior of a SARW.3; 4; 5; 6 This observation has been interpreted as support for an unstructured unfolded state. Conversely, there have been several reports that describe the existence of ‘residual structure’ in the unfolded state 7; 8; 9; 10 which, in general, does not change the global size scaling of the chain.11; 12 A detailed understanding of the nature of the unfolded state is essential for a complete understanding of protein folding, because any residual structure in the ‘unfolded state’ will clearly play a role in modulating both the thermodynamics and kinetics of folding, and might affect the formation of alternative structures, such as amyloid.

Repeat proteins are composed of tandem arrays of a small structural motif. 13 they are widespread in nature and in many cases function as mediators of protein-protein interactions. Their simplified modular structures make them ideal systems for understanding basic principles that drive protein folding.14; 15; 16 The tetratricopeptide repeat is a 34 amino-acid motif in which two anti-parallel α-helices stack together.17 We have previously reported the design and characterization of a consensus tetratricopeptide (CTPR) sequence, and have generated a series of proteins with different numbers of tandem repeats of this sequence.17; 18; 19 Tandem arrays of CTPR units form a regular, superhelical, extended structure (Fig. 1A).18; 19 It has been suggested that the folding mechanism of these proteins may be different than that of globular proteins.18 Here we combine spectroscopy and modeling to shed light on the structure of the denatured state of the CTPR proteins. Surprisingly, we find that extensive non-native structure leads to a compact denatured state, with size scaling strongly deviating from that expected for a random coil.

Figure 1.

Figure 1

Rh values of native and unfolded CTPR proteins measured by FCS. A. Correlation between the Rh values for CTPRs of different lengths (CTPR2, CTPR3, CTPR4, CTPT6, CTPR8 and CTPR20) measured by FCS and the Rh values calculated from the crystal structures of the CTPRs using the program Hydropro,21 (filled circles) or the Rh values calculated using a model that describes the hydrodynamic properties of cylindrical particles (empty circles)22 (Equation 5 in Materials and Methods). The figure shows the ribbon representations of the structures of two CTPR proteins (CTPR4 and CTPR20). B. FCS cross-correlation curves for the CTPR3 protein at various GuHCl concentrations, from 0 to 6M. Arrow indicates the direction of increase in denaturant concentration, which shifts the curves to the right. C. A log-log plot of Rh values of the different CTPRs under strongly denaturing conditions at 6 M GuHCl (filled dots) and the least-squares fits to the data (solid line). The red line represents the empirical relation between the measured Rh and protein length for a set of highly denatured globular proteins 4. The inset shows the dependence on the GuHCl concentration of the Rh value for CTPR proteins of different lengths in the unfolded state, CTPR2 (●), CTPR3 (○), CTPR4 (▼), CTPR6 (▽), CTPR8 (■) and CTPR20 (□), indicating that above ~4M GuHCl the size of these proteins doesn't change anymore.

Results and Discussion

The denatured state of CTPR proteins is not a random coil

Fluorescence correlation spectroscopy (FCS) probes the hydrodynamic properties of macromolecules by recording fluctuations in the fluorescence they emit 20. We employed FCS to measure the hydrodynamic radii of CTPR proteins in dilute solutions, in both their folded and unfolded states. For this purpose we labeled CTPR proteins of 2 to 20 repeats (CTPR2 to CTPR20) with the fluorescent probe Alexa Fluor 488 at a unique C-terminal cysteine residue. We first measured the values of Rh for all the different length CTPR proteins in the native state. The measured values of Rh (Table 1) are in very good agreement with values calculated either from crystal structures of CTPR proteins using the program Hydropro,21 or by treating the folded structures as rods and using an expression for Rh derived for rigid cylinders22 (Fig. 1A).

Table 1.

Rh values of native and unfolded CTPR proteins

# repeats # aa Rh (Å) Native Rh (Å) Unfolded
2 88 20.21±0.87 29.3±1.21
3 122 22.40±0.75 34.07±0.31
4 156 24.89±0.37 36.48±0.42
6 224 28.19±1.11 41.49±1.12
8 292 30.62±0.79 45.30±0.81
20 700 48.36±0.98 64.89±1.32

The hydrodynamic properties of CTPR2 to CTPR20 were then studied under strongly denaturing conditions at 6M guanidinium hydrocholorie (GuHCl, see Table 1 and Fig. 1B & 1C). We calibrated the FCS instrument and corrected the results for viscosity and refractive index effects using rhodamine 6G as a standard (See materials and methods). The availability of the full series of CTPR proteins allowed us to obtain a scaling relation for Rh within the series. Unexpectedly, we observed that for a given length of protein, the measured Rh is different from that anticipated if the polypeptide behaves as a SARW. The denatured state of many proteins of different lengths has been studied using either hydrodynamic methods,4; 5 or small angle x-ray scattering,6 to measure Rh or Rg respectively. These data can be fitted to the empirical scaling factors of Rh α N0.57 and Rg α N0.59, respectively,4; 6 where N is the number of amino acids. Clearly these data are in good agreement with the scaling law for a SARW.1; 2 For the CTPR proteins, however, the relationship between the measured hydrodynamic radii of the unfolded proteins and N, is best described by Rh α N0.37 (Rh = (5.49 ±0.25)×N0.37±0.03) (Fig. 1C) *. Similar results were obtained with urea as denaturant (not shown). Thus, under strongly denaturing conditions the unfolded state ensembles of the CTPR proteins do not behave as a SARW, but are rather more compact, suggesting the presence of significant structure in the unfolded state .

Abundance of polyproline II structure in denatured CTPRs

The properties of proteins are determined by their amino acid composition and sequence. We therefore analyzed the sequence of the CTPR proteins and found that the abundance of several amino acids in the CTPR proteins deviates significantly from that of a reference state, the codon usage in all yeast proteins. Notably, amino acids with a high propensity to form polyproline II (PPII) helical structures, such as P, A, G, and Q23 appear in the CTPR sequence on average 1.8 times more frequently than in the yeast proteome.

Several recent studies have proposed the existence of a significant population of PPII helical conformations in unfolded proteins, leading to the suggestion that the occurrence of PPII structure may be a general feature of the ‘unfolded’ state.24; 25; 26; 27 We used circular dichroism (CD) as a sensitive method by which to detect the presence of PPII helix: it is well established that this helix has a distinctive CD spectrum, with a maximum at 229 nm.24; 28 Fig. 2A shows CD spectra of CTPR8 at different concentrations of GuHCl, from 3.5 M to 6 M. Importantly, at 3.5 M GuHCl the CD signal characteristic of α-helical structure has completely disappeared (data not shown, see also reference 18). The spectra clearly show a positive peak around 229 nm. The peak intensity increases with increasing GuHCl concentration, reaching a maximum at a concentration of about 4.5–5 M. This concentration is similar to that at which the unfolding titration curves monitored by either CD signal at 222 nm or by Rh also reach a maximum value. Similar spectra were obtained with urea as denaturant (data not shown). CD spectra of PPII helices are also characterized by a negative peak at 210 nm 27, which is very difficult to observe in the presence of GuHCl. We were able to observe the negative peak by taking a CD spectrum with a short path-length cell and a high protein concentration (not shown).

Figure 2.

Figure 2

Circular dichroism spectra of CTPR8 and B-A peptide showing PPII helical structure. A. CD spectra of the unfolded state of CTPR8 at increasing GuHCl concentrations. The inset illustrates the temperature dependence of the CD signal, indicating a non-cooperative but reversible melting of the PPII structure. It shows the CD spectra of CTPR8 at 6 M GuHCl at 25°C (solid line), at 98°C (dashed and dotted line) and at 25°C after the sample was heated and cooled down (dashed line). B. CD spectrum of the B-A peptide at 2.5M GuHCl in a 0.1 mm pathlength cuvette at 25°C. C. CD spectra of the B-A peptide in a 0–6.5 M range of GuHCl concentrations.

The modular, repeated, character of the CTPR series of proteins greatly facilitates our investigations of the nature of PPII structure in the unfolded state. An important question is whether the PPII structure extends through the entire sequence, or whether is it present in discrete segments. We estimated the fraction of PPII in CTPR proteins by comparing the observed mean residue ellipticity at 229nm ([Θ]229) with [Θ]229 values reported for PPII-containing peptides.27 All the CTPR proteins, from CTPR2 to CTPR20, show a substantial PPII content, which we roughly estimate to be as much as ~50 % at 6 M GuHCl. The observed [Θ]229 for PPII is the same, regardless of the number of repeats in each protein, indicating that there is an equal amount of PPII structure formed per each repeat, and that there is no cooperativity between repeats in formation of PPII structure.

CTPR1, however, does not exhibit significant PPII structure under denaturing conditions. CTPR1 has the sequence A-B-Acap, where A and B are the two α-helices forming the basic repeat unit, and Acap is a solvating helix (for sequence information see Materials and Methods),17 whereas all the longer CTPR proteins have the sequence (A-B)n-Acap.17; 19 CTPR1 thus lacks the B-A inter-repeat sequence. Based on the method of reference 29, this sequence (but not the B-Acap sequence) can in fact be predicted to have high propensity for PPII formation. To test this hypothesis, we synthesized a peptide corresponding to the B-A sequence. This peptide does indeed adopt PPII structure, and the CD spectrum (Fig. 2B) shows a positive peak at 229 nm and negative minimum at 210 nm, characteristic of PPII structure 27. The [Θ]229 value is comparable to that of the CTPR proteins. Moreover, PPII formation by the peptide shows the same denaturant concentration dependence as that observed for the full-length proteins (Fig. 2C). This result confirms that the B-A sequence in the CTPR proteins is responsible for the PPII structure and that the PPII segments form independently. Because the B-A peptide is unstructured in the absence of denaturant (Not shown), our results clearly indicate that GuHCl not only promotes formation of PPII structure by destabilizing the native state, but also directly stabilizes this structure.

Finally, the PPII content of peptides has been shown to decrease when the temperature increases.30 We therefore tested the effect of temperature on the PPII content of highly-denatured CTPR proteins. We observed a reversible loss of PPII structure with increasing temperature, such that the signal at 98°C indicates essentially zero PPII content (Inset to Fig. 2A).

Lattice simulations of denatured CTPRs

To better understand the effect of PPII structure on the dimensions of the denatured CTPRs, we performed simulations using simple polymer models. First, we studied the behavior of a freely-jointed, self avoiding, random walking polymer on a 3-dimensional lattice for polymers with N equal-sized links of length L, where N varies between 10 and 1000, and L is the lattice constant (see Materials and Methods). To allow comparison with the experimental system, we considered any polymer of length N as containing N/34 repeats. For each of these chains, we calculated the Rg and, as expected for a random walk with excluded volume, a log-log plot of Rg vs. N yielded a straight line with slope 0.59 (Fig. 3A). To mimic the effect of different lengths of PPII structure within a repeat, we then included in the model links with two different sizes, L (the original) and NpL (to represent the number of links in the PPI segments), and allowed NL steps of size L and one step with size NpL for each repeat. We set the repeat length to be 34 (NL + Np = 34), as above. This is a reasonable way to model the PPII segments. It is accepted that at least up to a length of 12 residues, PPII peptides may behave as rigid rods,31; 32 and while isolated non-proline PPII peptides have been shown to be more flexible,33 there is no such information on non-proline PPII peptides buried within proteins. Note that this scheme creates an ordered sequence of rigid and flexible chain stretches. We calculated Rg as a function of N for all Np values from 1 to 34 and calculated the scaling exponent ν from the slope of log-log plots of Rg vs. N. In Fig. 3B we show ν vs. Np; ν = 0.59 for Np =1 (i.e. no PPII and the expected random walk with excluded volume behavior). As Np increases, ν decreases and reaches a minimum when Np/34 is approximately 1/2. The long segments bring the chain to relatively empty regions of the lattice, so that the subsequent steps with small segments are less affected by the excluded volume constraint. This mechanism causes ν to decrease toward the Gaussian coil result of 0.5. As Np increases further, ν returns to 0.59 and we again have a SARW, but with a larger link size. Note that the scaling exponent ν is smallest when the fraction of PPII segments is 50%, which agrees with experimental estimates of the fraction of PPII residues in each CTPR repeat unit. Although we cannot expect to make fully quantitative comparisons between the results of our simplified simulations and the experimentally observed data for a real polypeptide chain, these results provide a rationale for why the scaling exponent observed for denatured states containing PPII is not the same as that for a random coil, based simply on the intrinsic behavior of mixed polymers.

Figure 3.

Figure 3

A polymer model for the effect of PPII on the hydrodynamic properties of the denatured state. A. A log10-log10 plot of Rg vs N for the SARW simulation. The ensemble averaged radius of gyration Rg is plotted against the number of monomers N in a classic excluded volume random walking chain (SARW). The data is fit to a power law relation Rg α Nν, and we calculate ν = 0.587 ± 0.003. The measured exponent is consistent with the accepted value of 0.59. B. The scaling exponent ν as a function of rigid segment (polyproline) length Np. Two dimensional representations of the polymer chains are pictured for Np = 4, 17, and 30. Note that the ratio between large (red) and small segments (black) is drawn to scale for each value of Np, but the overall size of the schematic is made smaller for larger Np to fit into the figure. The value of ν exhibits a minimum at Np = 14, roughly half of a repeat unit. Error bars on the values of ν are ± 0.003. C. The scaling exponent ν as a function of polyproline length Np and several fractions f of polyproline segments that experience an effective attraction. The data for f=0 (no attraction) is identical to Fig. 3B. As f increases, the value of ν decreases and the location of the minimum moves to larger Np. Circles represent f=0, squares f=0.2, diamonds f=0.4, triangles f=0.6, and stars f=0.8. Error bars on the values of ν are ± 0.003 for f=0 and ± 0.011 all other f. D. Visualization of the polymer chain with N= 204, Np=18, and f=0.6 for a representative configuration from the SARW ensemble.

Interactions between PPII segments are likely to also contribute to the small scaling exponent ν ~ 0.37 found in our experiments.34 We therefore introduced effective attractive interactions between PPII segments in the simulations by minimizing the distance between a fraction (f; for 0 ≤ f ≤ 0.8) of the segments and all other segments in the chain. Incorporating this additional feature into the model has a dramatic effect in reducing the value of ν still further (Fig. 3C and 3D). Our results suggest that rod-like PPII segments and the interactions between these segments can significantly influence the structure of the unfolded state.

Conclusion

In conclusion, we used FCS to measure the value of Rh, under denaturing conditions, for a set of repeat proteins with 2 to 20 tandem copies of the same sequence. Our results revealed that the scaling behavior of Rh with N was not as expected for a SARW (Rh α N0.37 vs. the expected Rh α N0.59), indicating a significantly more compact structure. Further investigations showed that there was considerable PPII structure in the denatured proteins. Moreover, we identified the segment of the repeat that formed a PPII helical structure, and showed that PPII was present as regularly spaced, discrete segments throughout the protein. Using simple lattice models of polymers, we showed that chains with a regularly-ordered sequence of short and long links (the latter representing PPII segments) may attain a more compact structure than that of the standard SARW. The incorporation of interactions between PPII segments in the model leads to further compaction of the chains, with size scaling similar to that observed in the experiment. These results present the most comprehensive picture to date of the effect of non-native PPII structure on the nature of the unfolded state ensemble. The formation of non-native structure in the denatured state may have implications for the folding mechanism of CTPR proteins, which we are now exploring35. Finally, since proline-rich segments have been recently shown to be effective in counteracting amyloid fiber formation,36 we hypothesize that PPII structure may in general serve to modulate the aggregation of unfolded proteins.

Materials and Methods

Proteins

CTPR proteins are constructed as a tandem array of different numbers of a 34 amino acid consensus CTPR sequence (sequence of the A-B helices: AEAWYNLGNAYYKQGDYDEAIEYYQKALELDPRS). An additional solvating helix is added to the C-terminus of each protein. This solvating helix (Acap, AKQNLGNAKQKQG) is similar to the A helix but with some hydrophobic residues substituted by hydrophilic residues. The B-A peptide sequence corresponds to a 34 amino acid consensus TPR in which the B helix precedes the A helix (Ac-YDEAIEYYQKALELDPRSAEAWYNLGNAYYKQGD-NH2).

Preparation of labeled proteins

The cysteine-reactive maleimide form of Alexa-488 (Molecular Probes, Inc.) was used to label a unique cysteine inserted at the C-terminus of the proteins CTPR2, CTPR3, CTPR4, CTPR6, CTPR8 and CTPR20. The proteins (100 µM) were reduced by incubation during 15 minutes with 10 mM fresh 1,4-dithio-DL-treitol (DTT). DDT was removed by buffer exchange over a PD-10 column (Amersham Bioscience, Uppsala, Sweden) against 500 mM NaCl, 50 mM phosphate, pH 7.0. A solution of Alexa 488 maleimide in water was added drop-wise into the freshly reduced proteins in presence of 1 mM TCEP and with constant stirring to a final concentration of dye equal to 0.5 mM. The reactions were allowed to proceed for 4 hours at room temperature under continuous stirring, under a non-reducing atmosphere of N2 in the dark. After 4 h an excess of β-mercaptoethanol (5% v/v) was added to consume the remaining of thiol reactive reagent during for 15 minutes at 4°C.

The free dye was removed from the reaction mixtures by two consecutive buffer exchange steps over PD-10 column (Amersham Biosciences) followed by concentration and extensive washing on a Centriprep YM-3 filtration unit (Millipore), using a pH 6.8 buffer with 150 mM NaCl, and 50 mM phosphate (PBS).

The extent of labeling was estimated from the absorption spectra of the proteins using the extinction coefficient for Alexa 488 ε493=72,000 M−1 cm−1 from the Molecular Probes Handbook. The protein concentration was determined using an extinction coefficient calculated from the amino acid composition.37 The absorbance at 280 nm was corrected for the contribution of the dye.

The protein samples were analyzed by mass spectrometry to confirm the labeling of the protein. Matrix-assisted laser desorption ionization-time of flight (MALDI-TOF) analysis was performed on a Voyager-DE PRO Biospectrometry workstation (AB Applied biosystems, Foster City, CA). The protein samples were mixed with nine volumes of the matrix solution (sinapinic acid 10 mg/ml, 50% acetonitrile, 0.1% trifluoro-acetic acid) and spotted onto the MALDI plate. The labeled proteins were frozen and stored in the dark at −80°C.

FCS measurements

FCS measurements were carried out using a laser beam at 488 nm from an Argon-ion laser (Spectra Physics Lasers, Mountain View, CA) as excitation source. The collimated laser beam was focused into the sample, which was mounted on a home-built microscope, by a 60x, 1.2 NA water immersion objective (Olympus, Tokyo, Japan). The laser power intensity was adjusted to 70 µW before the microscope. The beam was directed into the objective by a dichroic mirror (500 DCLP, Chroma, Rockingham, VT) and focused 20 µm into the sample. The fluorescence signal was collected through the same objective and transmitted by the dichroic mirror onto a long-pass filter to eliminate any reflected excitation light. It was then focused onto a 50 µm pin-hole to eliminate out-of-focus photons and divided onto two single-photon avalanche photodiode units (APDs) (Perkin-Elmer Photoelectronics, Fremont,CA) by a 50/50 non-polarizing beam splitter (Unice E-O Services, Chung Li, Taiwan). The signals of the APDs were recorded in the cross-correlation mode for one hour using a Flex02-12D digital real time correlator (Correlator.com, Bridgewater, NJ).

All measurements were carried out on protein samples diluted in PBS buffer to a final concentration of 10 nM, which were transferred into cells constructed from two cover slides and sealed.

Cross-correlation functions were calculated from the signals of the two detectors. The correlation functions presented a fast exponential decay component in the nanosecond to microsecond time scale. In order to calculate the diffusion time of the proteins the correlation curves were fitted using the data from 20 microseconds on, where there is no contribution of photophysical processes, using the following equation:20

G(τ)=1N(1+tτD)1(1+tτD(z0/r0)2)12 (1)

where N is the number of molecules in the focal volume, τD is the diffusion time in the radial direction and r0 and z0 are the radial and axial dimensions of the Gaussian observation volume, respectively. The r0/z0 ratio was obtained for each experimental condition from a measurement of the dye rhodamine 6G (Rh6G), whose diffusion coefficient is known (DRh6G= 280 µm2 s−1).38; Thus, FCS curves were fitted with two free parameters for the diffusion term (τd and N). The diffusion coefficient for each sample was calculated from the τd using the following equation:

τD=r024Dt (2)

using r0 values obtained from the Rh6G measurement. The Stokes-Einstein (SE) relation was used to calculate the hydrodynamic radius (Rh) of the molecules from their diffusion coefficient (Dt) at a given viscosity (η):

Dt=kBT6πηRh (3)

where kB is the Boltzmann constant and T is the absolute temperature.

We also fitted some of the cross-correlation functions over the full time scale including two independent exponentials decays in the equation to consider the triplet state of the dye and the fast process observed in the nanoseconds to microsecond time scale.

G(τ)=1N(1+A1exp(t/τ1)+A2exp(t/τ2))(1+tτD)1+tτD(z0/r0)2 (4)

where τ1 and τ2 are the relaxation times of the two process and A1 and A2 their amplitudes. In this case the FCS curves were fitted with two free parameters for the diffusion term (τd and N) and four parameters for the two exponential decays (A1, A2, τ1 and τ2). The diffusion time results obtained from the fit of the complete correlation function were in agreement with the values obtained from the fit of only the diffusion component of the function.

In order to obtain a well-behaved standard for the assessment of the effect of refractive index changes on apparent diffusion coefficients, we studied the diffusion of Rh6G molecules in all GuHCl solutions. In a simple liquid, the diffusion coefficient is inversely proportional to the viscosity of the medium as described by SE relation (Equation 3). The SE relation implies that the diffusion time relative to that in water, τ/τw should be proportional to the viscosity relative to water viscosity, η/ηw, with a proportionality constant of 1. Fig. 5 shows experimental results from a series of measurements in GuHCl solutions. The full line in this figure is a fit to y=ax which gives a slope of 0.98, in close agreement with the theoretical SE prediction with a slope of 1. Clearly the SE relation holds rather well in all GuHCl solutions, showing that refractive index changes or other optical effects do not significantly affect the measured diffusion times. We also controlled for possible effects of laser power on measured diffusion times42; 43 by repeating some of our measurements at a series of lower powers down to 5 µW. We found slight changes in diffusion time values, which are reflected in error values reported in the paper (see Results). Such changes did not affect to any significance the scaling of Rh with size, which is the main observation in this paper.

Calculation of Rh from crystal structures

The theoretical diffusion coefficients and the hydrodynamic radius for the different length CTPR proteins were calculated from the x-ray crystal structure coordinates using the program Hydropro version 7c21 (http://leonardo.fcu.um.es/macromol/programs/hydropro/hydropro.htm). We used the structures of CTPR8 (PDB_ID: 2AVP)18 and CTPR20 (Kajander et al. submitted paper) previously solved by our group. From the CTPR8 structure we built atomic models for the smaller CTPRs.

Hydropro calculations were performed using the following values in the input file: 3.1 Å for the radius of atomic elements (AER) and a minibead radius from 1 to 5 Å (Sigmin and Sigmax).

Calculation of Rh with rod-like model

Because the crystal structures of long CTPR proteins have a extended rod-like shape, we also calculated their hydrodynamic radii using an expression derived by Ortega and Garcia de la Torre for cylindrical particles.22

Rh=Rh0(1.009+1.395x102(lnp)+7.8880x102(lnp)2+6.040x103(lnp)3) (5)

In this expression Rh0 = L(3/16 p2)1/3 is the hydrodynamic radius of a sphere having the same volume as the cylinder and p = L/d is the aspect ratio between the cylinder length (L) and diameter (d). Using the dimensions of the TPR superhelix obtained from the crystal structure of CTPR8 (L=72 Å/8 repeats and d=38 Å), we calculated the hydrodynamic properties of the CTPRs of different length.

CD spectroscopy

All CD experiments were performed using an AVIV Model 215 CD spectophotometer (AVIV Instruments, Lakewood, NJ) in 150 mM NaCl, 50 mM phosphate buffer pH 6.8 at 25°C. CD spectra of CTPR proteins were acquired at 12.5 µM protein concentration and at 30 µM for the B-A peptide in a 0.2 cm path length cuvette. To enable us to record the CD data at wevelengths below 215 nm shown in Fig. 2B we used higher peptide concentration (760 µM) and shorter path-length cuvettes (0.1 mm). The dynode voltage was monitored at the different wavelengths. From 260 to 203 nm the dynode voltage values were in the optimal range to obtain reliable data (180–500 volts). Only at wavelengths below 203 nm the dynode voltage exceeded 500 volts and the signal-to-noise ratio became poor. The CD spectra were recorded with a band width of 1 nm at 1nm increments and 10 second average time. Molar ellipticity ([Θ]) was calculated using the following equation:

[Θ]=θ10cnl (6)

where θ is the ellipticity measured in millidegrees, c is the molar concentration of the sample (M), n is the number of amino acids in the protein and l is the pathlength in cm.

The estimation of the polyproline II helix (PPII) content from the ellipticity at 228 nm was based on proposed ellipticity values for peptides presenting 100% and 0% PPII helix content.27

Thermal denaturation was monitored at a protein concentration of 12.5 µM by following the ellipticity signal of the PPII at 228 nm from 10°C to 98°C and in the reverse direction from 98°C to 10°C in a 0.2 cm path-length cuvette. The temperature ramp was performed in 1°C steps with an equilibration time at each temperature of 1 min.

Simulations of self avoiding random walking chains on a 3-dimensional lattice

Simulations of a random walking chain with excluded volume (SARW) were conducted on a three dimensional cubic lattice with N monomers. To construct a SARW polymer, each monomer was positioned on a lattice site adjacent to the previous monomer with the constraint that no two monomers can occupy the same lattice site. A polymer chain is complete when N monomers have been placed on the lattice in this self-avoiding fashion. The radius of gyration Rg was calculated as an ensemble average over 106 SARW realizations for each N 44, and this data was used to compute the scaling exponent ν defined through the relation Rg α Nν.

The effect of internal structure on the scaling exponent ν was investigated by including the statistical effects of rigid polyproline segments. Motivated by experimental findings, Np consecutive monomers in the middle of each repeat unit were assigned to represent the polyproline segment, with 1 ≤ Np ≤ 34 (where 34 is the number of monomers in each repeat unit). Monomers that are not in the polyproline segment were placed randomly, as in the SARW. The Np monomers in the polyproline segment were placed such that they all lie in a straight line in one of the six lattice directions and occupy open lattice positions. The orientation of the segment was chosen randomly from the set of all self-avoiding orientations. This model is an extension of the SARW and each repeat unit starts with (34-Np)/2 small steps, followed by one large step of size Np, and ending with (34-Np)/2 small steps. This pattern was continued in the simulations over the entire experimental range of 2–20 repeat units. For each value of Np, the radius of gyration Rg was averaged over an ensemble of 106 realizations and a power law relation was observed between Rg and the number of repeat units. The scaling exponent ν was calculated for each Np from this data.

The effect of interactions between polyproline segments on the exponent ν was modeled by randomly choosing a fraction f of the polyproline segments to have an effective attraction with other polyproline segments. In our model, if a segment is chosen, its orientation is not assigned randomly. Rather, for each possible self-avoiding orientation, the distance ri between the center of the chosen segment and the center of all other existing polyproline segments is calculated. The orientation of the chosen segment is then assigned to be the one that minimizes the sum ∑i(ri2). The radius of gyration Rg was measured and averaged over an ensemble of 105 realizations. A power law relation between Rg and number of repeat units was observed and ν was calculated for 0 ≤ f ≤ 0.8.

Figure 4.

Figure 4

Dependence of the relative translational correlation time of Rh6G molecules on relative viscosity in 0 to 6 M GuHCl solutions (filled circles). The line corresponds to the fit of the data to the equation y=ax, which gives a slope of 0.98, in agreement with the Stokes-Einstein prediction.

Acknowledgements

We thank Professors A. Horovitz, A.D. Miranker and A. Minsky, as well as members of the Regan and Haran Labs for comments and suggestions on the manuscript. ALC was a recipient of an EMBO short-term fellowship. This research is made possible in part by the historic generosity of the Harold Perlman Family, as well as by partial financial support of the US-Israel binational science foundation (grant no. 2002371 to GH), the NIH (grant no. 1R01GM080515-01 to GH) and the Human Frontier Science Program (to GH and LR). Financial support for GL and CSO from NSF grant number DMR-0448838 is gratefully acknowledged.

Footnotes

Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

*

The series of Rh values of the native proteins could also be fitted to a power law, giving the relation Rh = 3.12×N0.41. Note that the pre-exponent is much smaller than obtained for the denatured protein series. The rather high exponent is due to the elongated structure of the native CTPRs.

Interestingly, the m-values for folding of the CPTRs, especially the larger ones, are smaller than expected based on calculations of accessible surface areas of random coiled denatured proteins of corresponding sizes (data not shown), also indicating compact structures.

A recent measurement of the diffusion coefficient of R6G 39 gave a value which is larger by 30% then the above-mentioned number. However, a survey of literature values of protein hydrodynamic radii (e.g. in ref. 40) and our own measurements 41 both suggest that using the new value will lead to hydrodynamic radii which are consistently smaller not only from calculations directly based on PDB structures, but also from measurements with other methods, such as dynamic light scattering.

Contributor Information

Lynne Regan, Email: lynne.regan@yale.edu.

Gilad Haran, Email: Gilad.Haran@weizmann.ac.il.

References

  • 1.Grosberg AY, Kokhlov AR. Statistical Physics of Macromolecules. New York: AIP Press; 1994. [Google Scholar]
  • 2.Flory PJ. Principles of polymer chemistry. New York: Cornell University Press; 1953. [Google Scholar]
  • 3.Tanford C, Kawahare K, Lapanje S. Proteins in 6M guanidine hydrochloride demonstration of random coil behavior. J. Biol. Chem. 1966;241:1921–1923. [PubMed] [Google Scholar]
  • 4.Wilkins DK, Grimshaw SB, Receveur V, Dobson CM, Jones JA, Smith LJ. Hydrodynamic radii of native and denatured proteins measured by pulse field gradient NMR techniques. Biochemistry. 1999;38:16424–16431. doi: 10.1021/bi991765q. [DOI] [PubMed] [Google Scholar]
  • 5.Uversky VN. Natively unfolded proteins: A point where biology waits for physics. Prot. Sci. 2002;11:739–756. doi: 10.1110/ps.4210102. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Kohn JE, Millett IS, Jacob J, Zagrovic B, Dillon TM, Cingel N, Dothager RS, Seifert S, Thiyagarajan P, Sosnick TR, Hasan MZ, Pande VS, Ruczinski I, Doniach S, Plaxco KW. Random-coil behavior and the dimensions of chemically unfolded proteins. Proc. Natl. Acad. Sci. USA. 2005;102:12491–12496. doi: 10.1073/pnas.0403643101. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Kazmirski SL, Wong K-B, Freund SMV, Tan Y-J, Fersht AR, Daggett V. Protein folding from a highly disordered denatured state: The folding pathway of chymotrypsin inhibitor 2 at atomic resolution. Proc. Natl. Acad. Sci. USA. 2001;98:4349–4354. doi: 10.1073/pnas.071054398. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Shortle D, Ackerman MS. Persistence of native-like topology in a denatured protein in 8 M urea. Science. 2001;293:487–489. doi: 10.1126/science.1060438. [DOI] [PubMed] [Google Scholar]
  • 9.Klein-Seetharaman J, Oikawa M, Grimshaw SB, Wirmer J, Duchardt E, Ueda T, Imoto T, Smith LJ, Dobson CM, Schwalbe H. Long-Range Interactions Within a Nonnative Protein. Science. 2002;295:1719–1722. doi: 10.1126/science.1067680. [DOI] [PubMed] [Google Scholar]
  • 10.Religa L, Markson JS, Mayor U, Freund SMV, Fersht AR. Solution structure of a protein denatured state and folding intermediate. Nature. 2005;437:1053–1056. doi: 10.1038/nature04054. [DOI] [PubMed] [Google Scholar]
  • 11.Fitzkee NC, Rose GD. Reassesing random-coil statistics in unfolded proteins. Proc. Natl. Acad. Sci. USA. 2004;101:12497–12502. doi: 10.1073/pnas.0404236101. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Wang Z, Plaxco KW, Makarov DE. Influence of local, residual structure on the scaling behavior and dimensions of unfolded proteins. Biopolymers. 2007;86:321–328. doi: 10.1002/bip.20747. [DOI] [PubMed] [Google Scholar]
  • 13.D'Andrea L, Regan L. TPR proteins: the versatile helix. Trends Biochem. Sci. 2003;28:655–662. doi: 10.1016/j.tibs.2003.10.007. [DOI] [PubMed] [Google Scholar]
  • 14.Lowe AR, Itzhaki LS. Rational redesign of the folding pathway of a modular protein. Proc. Natl. Acad. Sci. USA. 2007;104:2679–2684. doi: 10.1073/pnas.0604653104. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Mello CC, Barrick D. An experimentally determined protein folding energy landscape. Proc. Natl. Acad. Sci. USA. 2004;101:14102–14107. doi: 10.1073/pnas.0403386101. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Main ER, Stott K, SE J, Regan L. Local and long-range stability in tandemly arrayed tetratricopeptide repeats. Proc. Natl. Acad. Sci. USA. 2005;102:5721–5726. doi: 10.1073/pnas.0404530102. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Main ERG, Xiong Y, Cocco MJ, D'Andrea L, Regan L. Design of stable alpha-helical arrays from an idealized TPR motif. Structure. 2003;11:497–508. doi: 10.1016/s0969-2126(03)00076-5. [DOI] [PubMed] [Google Scholar]
  • 18.Kajander T, Cortajarena AL, Main ER, Mochrie SG, Regan L. A new folding paradigm for repeat proteins. J. Am. Chem. Soc. 2005;127:10188–10190. doi: 10.1021/ja0524494. [DOI] [PubMed] [Google Scholar]
  • 19.Kajander T, Cortajarena AL, Mochrie SG, Regan L. Structure and stability of a consensus TPR superhelix. Acta Crystallographica. 2007;D63:800–811. doi: 10.1107/S0907444907024353. [DOI] [PubMed] [Google Scholar]
  • 20.Krichevsky O, Bonnet G. Fluorescence correlation spectroscopy: the technique and its applications. Reports on Progress in Physics. 2002;65:251–297. [Google Scholar]
  • 21.Garcia de la Torre ML, Huertas ML, Carrasco B. Calculation of hydrodynamic properties of globular proteins from their atomic-level structure. Biophys. J. 2000;78:719–730. doi: 10.1016/S0006-3495(00)76630-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Ortega A, Garcia de la Torre J. Hydrodynamic properties of rodlike and dislike particles in dilute solution. J. Chem. Phys. 2003;119:9914–9919. [Google Scholar]
  • 23.Rucker AL, Pager CT, Campbell MN, Qualls JE, Creamer TP. Host-Guest scale of left-handed polyproline II helix formation. Proteins: Struct. Funct. Bioinform. 2003;53:68–75. doi: 10.1002/prot.10477. [DOI] [PubMed] [Google Scholar]
  • 24.Woody RW. Circular dichroism and conformation of unordered polypeptides. Adv. Biophys. Chem. 1992;2:37–79. [Google Scholar]
  • 25.Shi Z, Olson CA, Rose GD, Baldwin RL, Kallenbach NR. Polyproline II structure in a sequence of seven alanine residues. Proc. Natl. Acad. Sci. USA. 2002;99:9190–9195. doi: 10.1073/pnas.112193999. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Kentsis A, Mezei M, Gindin T, Osman R. Unfolded state of polyalanine is a segmented polyproline II helix. Proteins: Struct. Funct. Bioinform. 2004;55:493–501. doi: 10.1002/prot.20051. [DOI] [PubMed] [Google Scholar]
  • 27.Kelly MA, Chellgren BW, Rucker AL, Trotman JM, Fried MG, Miller A-F, Creamer TP. Host-guest study of left-handed polyprolyne II helix formation. Biochemistry. 2001;40:14376–14383. doi: 10.1021/bi011043a. [DOI] [PubMed] [Google Scholar]
  • 28.Tiffany ML, Krimm S. Extended conformations of polypeptides and proteins in urea and guanidine hydrochloride. Biopolymers. 1973;12:575–587. [Google Scholar]
  • 29.Vlasov PK, Vlasova AV, Tumanyan VG, Esipova NG. A Tetrapeptide-Based Method for Polyproline II-Type Secondary Structure Prediction. Proteins. 2005;61:763–768. doi: 10.1002/prot.20670. [DOI] [PubMed] [Google Scholar]
  • 30.Tiffany ML, Krimm S. Effect of temperature on the circular dichroism spectra of polypeptides in the extended state. Biopolymers. 1972;11:2309–2316. doi: 10.1002/bip.1972.360111109. [DOI] [PubMed] [Google Scholar]
  • 31.Schuler B, Lipman EA, Steinbach PJ, Kumke M, Eaton WA. Polyproline and the "spectroscopic ruler" revisited with single-molecule fluorescence. Proc. Natl. Acad. Sci. USA. 2005;102:2754–2759. doi: 10.1073/pnas.0408164102. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Stryer L, Haugland RP. Energy transfer: a spectroscopic ruler. Proc. Natl. Acad. Sci. USA. 1967;58:719–726. doi: 10.1073/pnas.58.2.719. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Makowska J, Rodziewicz-Motowidlo S, Baginska K, Vila JA, Liwo A, Chmurzynski L, Scheraga HA. Polyproline II conformation is one of many local conformational states and is not an overall conformation of unfolded peptides and proteins. Proc. Natl. Acad. Sci. U S A. 2006;103:1744–1749. doi: 10.1073/pnas.0510549103. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Camacho CJ, Schanke T. From collapse to freezing in random heteropolymers. Europhys. Lett. 1997;37:603–608. [Google Scholar]
  • 35.Cortajarena AL, Mochrie SG, Regan L. Mapping the energy landscape of repeat proteins using NMR-detected hydrogen exchange. J. Mol. Biol. 2008 doi: 10.1016/j.jmb.2008.02.046. In press. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Faber PW, Voisine C, King DC, Bates EA, Hart AC. Glutamine/proline-rich PQE-1 proteins protect Caenorhabditis elegans neurons from huntingtin polyglutamine neurotoxicity. Proc. Natl. Acad. Sci. USA. 2002;99:17131–17136. doi: 10.1073/pnas.262544899. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Pace CN, Vajdos F, Fee L, Grimsley G, Gray T. How to measure and predict the molar absorption coefficient of a protein. Prot. Sci. 1995;4:2411–2423. doi: 10.1002/pro.5560041120. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Magde D, Elson E, Webb WW. Fluorescence correlation spectroscopy. II. An experimental realization. Biopolymers. 1974;13:29–61. doi: 10.1002/bip.1974.360130103. [DOI] [PubMed] [Google Scholar]
  • 39.Culbertson CT, Jacobson SC, Ramsey MJ. Diffusion coefficient measurements in microfluidic devices. Talanta. 2002;56:365–379. doi: 10.1016/s0039-9140(01)00602-6. [DOI] [PubMed] [Google Scholar]
  • 40.Krouglova T, Vercammen J, Engelborghs Y. Correct diffusion coefficients of proteins in fluorescence correlation spectroscopy. Application to tubulin oligomers induced by Mg2+ and Paclitaxel. Biophys. J. 2004;87:2635–2646. doi: 10.1529/biophysj.104.040717. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Sherman E, Itkin A, Kuttner YY, Rhoades E, Amir D, Haas E, Haran G. Using fluorescence correlation spectroscopy to study conformational changes in denatured proteins. Biophys. J. 2008 doi: 10.1529/biophysj.107.120220. In press. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Eggeling C, Widengren J, Rigler R, Seidel CAM. Photobleaching of Fluorescent Dyes under Conditions Used for Single-Molecule Detection: Evidence of Two-Step Photolysis. Anal. Chem. 1998;70:2651–2659. doi: 10.1021/ac980027p. [DOI] [PubMed] [Google Scholar]
  • 43.Gregor I, Patra D, Enderlein J. Optical Saturation in Fluorescence Correlation Spectroscopy under Continuous-Wave and Pulsed Excitation. ChemPhysChem. 2005;6:164–170. doi: 10.1002/cphc.200400319. [DOI] [PubMed] [Google Scholar]
  • 44.Wall FT, Erpenbeck JJ. New Method for the Statistical Computation of Polymer Dimensions. J. Chem. Phys. 1959;30:634. [Google Scholar]

RESOURCES