Abstract
Published data on the characterization of unfolded proteins in dilute solutions in aqueous guanidine hydrochloride are analyzed to show that the data are not fit by either the random flight or wormlike chain models for linear chains. The analysis includes data on the intrinsic viscosity, root-mean-square radius of gyration, from small-angle X-ray scattering, and hydrodynamic radius, from the translational diffusion coefficient. It is concluded that residual structure consistent with that deduced from nuclear magnetic resonance on these solutions can explain the dilute solution results in a consistent manner through the presence of ring structures, which otherwise have an essentially flexible coil conformation. The ring structures could be in a state of continual flux and rearrangement. Calculation of the radius of gyration for the random-flight model gives a similar reduction of this measure for chains joined at their endpoints, or those containing loop with two dangling ends, each one-fourth the total length of the chain. This relative insensitivity to the details of the ring structure is taken to support the behavior observed across a range of proteins.
Keywords: protein, radius of gyration, intrinsic viscosity, loop formation
Introduction
In a 2005 article devoted to the discussion of “the case for (and against) residual structure in chemically denatured proteins” the authors cited a number of articles to that date suggesting that “nuclear magnetic resonance (NMR) studies suggest that significant secondary structure and long-range hydrophobic clusters persist in unfolded proteins even at high concentrations of urea or guanidine hydrochloride (GuHCl)”, whereas “intrinsic viscosity, hydrodynamic radii, and small-angle scattering experiments, some of which date back almost four decades, have been taken as evidence that the unfolded state is an effectively random coil ensemble.”1 Since that publication, studies have continued to reveal secondary structure in strongly denatured proteins and polypeptides in dilute solution, mostly based on NMR analysis, with such studies often motivated by interest in the folding of the denatured chain. The aforementioned use of the random flight model, which has dominated most of the interpretations of hydrodynamic behavior of proteins and polypeptides in dilute solution, starting with the seminal studies of Tanford et al.,2 led to the conclusion that the chains may be modeled by an ensemble of random flight chains with excluded volume effects.1,3 One study treated the same data to conclude that the data on the intrinsic viscosity could be represented by a wormlike chain model with the chain having an appreciable persistence length, but negligible excluded volume.4 In this study, we suggest that taken together the experimental data on intrinsic viscosity, hydrodynamic radii, and the radius of gyration suggest a somewhat compact structure of the denatured proteins and polypeptides studied above, possibly owing to intramolecular association to form transient loops among chain elements widely separated along the chain backbone.
Results and Discussion
The published data
The addition of data on the root-mean-square radius of gyration RG for a number of denatured proteins and polypeptides in aqueous (2–6)M GuHCl or (4–8) urea,5 motivates this reconsideration of the previously available data on the intrinsic viscosity [η], second virial coefficient A2, and hydrodynamic radius RH as functions of the molecular weight M in (5–6)M GuHCl. The data on RH = Ξ/6πηsolv, with the molecular friction coefficient Ξ determined from the sedimentation coefficient in the original study,2 have been augmented in recent study by estimates based on NMR studies on denatured proteins in a gradient field and by dynamic light scattering, discussed in detail in Ref. 6; here, ηsolv is the solvent viscosity. The experimental data on these parameters are reproduced in Figure 1, where M is replaced by the number N of residues in the backbone, to more nearly represent the behavior in terms of the chain contour length L.
As noted above, prior treatments based on the behavior of [η], A2, and RH as functions of M have utilized a random flight flexible model for linear chains or a wormlike chain model in an attempt to elucidate the dependence of RG on M, and the value (RG/√M)θ of RG/√M under Flory theta conditions, for which A2 = 0. As the GuHCl solvent system is found to be a thermodynamically good solvent for the proteins studied, with MA2 ≈ [η], as often observed for solutions of flexible or wormlike chains,7 the evaluation of (RG/√M)θ from the dependence of [η] on M requires the use of models and extrapolations to zero M, both of which introduce opportunities for error, especially in view of the relatively narrow range of M for which data are available. For example, the analyses with the random flight and wormlike chain models, though quite different in conformational interpretations, provide reasonable fits to the data on [η] as a function of M in the aqueous GuHCl solutions.2,4 However, both treatments provide estimates of (RG/√M)θ that are too large in comparison with the recent experimental data on RG/√M,5 unavailable to those earlier studies.
As may be seen in Figure 1, although the data are somewhat scattered, it appears that ∂ln [η]/∂ln M = 0.640, ∂ln RG/∂ln M = 0.597, and ∂ln RH/∂ln M = 0.516. If interpreted alone, the value of ∂ln RG/∂ln M would indicate a very strong intramolecular excluded volume effect, and that has been the usual interpretation.1,5 Similarly, if taken alone the values of ∂ln [η]/∂ln M and ∂ln RH/∂ln M would each represent a more modest excluded volume effect. As mentioned above the data in the intermolecular thermodynamics given by the dependence of A2 versus M would suggest a so-called good solvent, consistent with a substantial intramolecular excluded volume for a flexible-coil chain, but irrelevant to such behavior for rodlike or even wormlike chains with a large persistence length â.7 The significance of these possibly disparate tendencies is considered next.
An analysis involving [η]
In a previous study, theoretical expressions for the parameter K were presented for a number of models, including linear and branched random flight chains, linear rodlike, and wormlike chains and ellipsoids of revolution (including spheres), where8
(1) |
with NAV the Avogadro number. Of course, K is trivially related to the more familiar Flory-Fox parameter φ′ given by9
(2) |
so that φ′ = πNAV(RH/RG)K includes the dependence of RH on M encoded into the definition for K such that for linear flexible or wormlike chain models K spans a much smaller range of values and is much more weakly dependent on M than is φ′. For example, calculations with the random flight or wormlike chain models without intramolecular excluded volume show that K0 ≤ K ≤ K∞, with K0 and K∞ the limiting values of K as M tends to zero and infinity, respectively; for example, K0 = 1 and K∞ ≈ 3 for linear chains, depending on the details of the model used.8 Values of K∞ for branched chains tend to exceed that for linear molecules, as does K∞ for the random flight or wormlike chain models for ring-shaped molecules without intramolecular excluded volume, for example, for a ring-shaped molecule, K∞,RING = 1.519K∞,LIN, with K∞,LIN the value for the same model for a linear chain.10
Values of K calculated as a function of Mw for the data on proteins and polypeptides for the data in Figure 1 are shown in Figure 2, along with data on synthetically prepared well-defined model linear and ring-shaped macromolecules. In fact, there are not many such studies, owing principally to the difficulty of preparing well-defined models. Two studies on ring-shaped polymers prepared by the anionic polymerization of styrene, followed by ring closure give comparable results.11–13 Thus, for linear polystyrene, the experimental K ≈ K∞ is constant for 104 < M < 106 (K ≈ 2.6) under Flory theta conditions, and in a good solvent (toluene) it decreases from this value as K ∝ M−0.04 with increasing M, reflecting the differing effects of intramolecular excluded volume on [η] and .14–16 Neither of these studies provided the behavior in good solvents needed to compute K. The data on the ring-shaped polystyrene determined under Flory Theta conditions depart from expectations in two ways: (i) KRING is not expected to depend on M under these conditions and (ii) KRING is smaller than the value 3.95 expected by comparison with KLIN for the same polystyrene. The unexpected behavior may reflect the presence of linear chains remaining from the difficult synthesis of the ring-shaped polymers, as observed in some chromatography studies.17 The data on proteins and polypeptides are seen to be scattered, but with a tendency to decrease from about 3.4 to 2.7 with increasing M over the range studied, with K ∝ 10−0.07. As the protein and polypeptide samples used in the various studies differ, resort is made to the smoothed correlations shown in Figure 1 to compute K. It is seen that the resulting K for the large M limit is comparable to K∞ for a linear random-flight or wormlike chain for large M under Flory theta conditions, but increases to larger values with decreasing M, so that the overall pattern is similar to that seen in Figure 1 for ring-shaped polystyrenes. The effects on K or related to those note on the Flory-Fox parameter φ′ in recent considerations of size exclusion chromatography on denatured proteins.18,19 It may also be noted in Figure 1 that RH ≈ RG for the lower values of M. Although one can conclude that the protein conformation appears to be more compact than a linear random flight or wormlike chain at the lower range of N, and that such behavior is not an indication of free draining hydrodynamics, which would decrease K, it is not possible to interpret these data in terms of a definitive model. Possibilities are discussed in the following.
The effects of association among chain residues
Intermolecular association would lead to branch-shaped aggregates, and as mentioned above, K will tend to be increased above that for the linear counterpart, offering one possible interpretation. However, this possibility is discounted as the existence of such intermolecular association would be expected to have characteristic features in the dependence of the viscosity and light (or X-ray) scattering behavior, not reported for the studies cited.
Studies by NMR and MCD over the past decade have revealed that the so-called denatured state of proteins and polypeptides contain locally ordered sections of a helical nature, even in solvents that strongly suppress hydrogen bonding.1,20–22 This includes evidence that these sections may be involved in intramolecular association among units far removed along the chain backbone, forming loops that flicker in and out of existence. As observed above, the result that K for the lower molecular weight proteins in aqueous GuHCl solutions is comparable to the behavior seen for the ring-shaped polystyrene, suggesting the presence of ring-shaped molecules or chains with containing loops, at least in that range of M. As with the data on polystyrene, the modestly decreasing value of K noted with the data on the protein solutions in aqueous GuHCl solution may reflect a modest intramolecular excluded volume effect, with the higher value obtained at low M closer to the limit that would be observed under Flory theta solvent conditions.
The data on RG/N1/2 and RH/N1/2 for the data in Figure 1 are shown in Figure 3, along with data on (RG/N1/2)STAT computed via statistical coil model for the unfolded state of a range of proteins as a function of N, including excluded volume effects, and taking account of conformational constraints consistent with NMR analysis revealing (non-H-bond stabilized) helical sequences in aqueous GuHCl solution,23 and a constant value of (RG/N1/2)RIS calculated for the rotational-isomeric-state model in the limit of large N.24 It may be seen that the experimental values of RG/N1/2 are about equal to √0.7(RG/N1/2)STAT. Thus, if (RG/N1/2)STAT is presumed to be an accurate measure of RG/N1/2 for the chain without any intramolecular association producing ring structures, , larger than the value g = 1/2 expected for a pure ring-shaped polymer. Although not impossible and perhaps possible for some unfolded proteins,25 it would perhaps be unlikely for transient intramolecular hydrophobic clusters to form rings by association of their endgroups, so it is of interest to consider values of g for a model with a loop formed by association of more generally placed repeating units.
A simplified model with the results shown in Figure 4 comprises a chain with an association of residues each a distance xL from the ends of a chain of length L, resulting in a loop with (1 − 2x)N residues, or length (1 − 2x)L, and two “branches,” each with xL residues, emanating from the crosslink locus. Use of the random flight model to compute g including the constraint imposed by the effective intrachain crosslink site according to the usual expression for a chain with n statistical segments of length b = L/n:26
(3) |
where the sums are over all possible segments i and j, consistent with the crosslink constraint. The calculation may be conveniently accomplished by summing the contributions segment pairs confined to several components of the total structure: (1) a loop with n – 2xn segments, (2) a linear chain with 2xn segments made from the two branches, (3) four linear chains, each with n/2 segments, comprising a 1/2 loop + one branch, and (4) less the over-count from four linear chains, each with (n – 2xn)/2 segments from four 1/2 loops and four linear chains, each with xn segments from four branches. The calculation gives g as a function of the fraction 1 – 2x of the chain in the loop as shown in Figure 4, with
(4) |
As required, g is unity or one-half for fractions of residues in the loop equal to zero (x = 1/2) or unity (x = 0), respectively. Although this model is certainly too simplified to permit a definitive quantitative conclusion, it may be noted that the observed g ≈ 0.7 would correspond to about a fraction 0.25–0.3 of the residues in the loop. However, even if the conclusion cannot be considered to be quantitatively accurate, the existence of such transient structures with a life-time long enough to be seen in the parameters of interest here would seem to be reasonable in view of the properties found in the NMR studies and could be relevant to the refolding process of the denatured protein.
Conclusions
Based on the preceding analysis of published data on the characterization of proteins in dilute solutions in aqueous GuHCl, it is concluded that the data are not fit by either the random flight or wormlike chain models for linear chains, but can be rationalized if it is assumed that transient intramolecular association form temporary ring conformers among the chain units. The analysis includes data on the intrinsic viscosity, root-mean-square radius of gyration, from small-angle X-ray scattering, and hydrodynamic radius, from the translational diffusion coefficient. Such residual ring structures in a state of continual flux and rearrangement in otherwise unfolded proteins are consistent with behavior deduced from NMR on these solutions. Calculation of the radius of gyration for the random flight model gives a similar reduction of this measure for chains joined at their endpoints, or those containing loop with two dangling ends, each one-fourth the total length of the chain. This relative insensitivity to the details of the ring structure is taken to support the behavior observed across a range of proteins.
Glossary
Abbreviations:
- NMR
nuclear magnetic resonance;
References
- 1.McCarney ER, Kohn JE, Plaxco KW. Is there or isn't there? The case for (and against) residual structure in chemically denatured proteins. Crit Rev Biochem Mol Biol. 2005;40:181–189. doi: 10.1080/10409230591008143. [DOI] [PubMed] [Google Scholar]
- 2.Tanford C, Kawahara K, Lapanje S. Proteins as random coils. I. Intrinsic viscosities and sedimentation coefficients in concentrated guanidine hydrochloride. J Am Chem Soc. 1967;89:729–736. [Google Scholar]
- 3.Tanford C. Protein denaturation. C. Theoretical models for the mechanism of denaturation. Adv Protein Chem. 1970;24:1–95. [PubMed] [Google Scholar]
- 4.Bohdanecky M, Petrus V. Analysis of hydrodynamic data for denatured globular proteins in terms of the wormlike cylinder model. Int J Biol Macromol. 1991;13:231–234. doi: 10.1016/0141-8130(91)90078-9. [DOI] [PubMed] [Google Scholar]
- 5.Kohn JE, Millett IS, Jacob J, Zagrovic B, Dillon TM, Cingel N, Dothager RS, Seifert S, Thiyagarajan P, Sosnick TR, Hasan MZ, Pande VS, Ruczinski I, Doniach S, Plaxco KW. Random-coil behavior and the dimensions of chemically unfolded proteins. Proc Natl Acad Sci USA. 2004;101:12491–12496. doi: 10.1073/pnas.0403643101. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Zhou H-X. Dimensions of denatured protein chains from hydrodynamic data. J Phys Chem B. 2002;106:5769–5775. [Google Scholar]
- 7.Casassa EF, Berry GC. Polymer solutions. In: Allen G, editor. Comprehensive polymer science. Vol. 2. New York: Pergamon Press; 1988. Ch 3. [Google Scholar]
- 8.Berry GC. Remarks on a relation among the intrinsic viscosity, the radius of gyration, and the translational friction coefficient. J Polym Sci Part B: Polym Phys. 1988;26:1137–1142. [Google Scholar]
- 9.Fox TG, Jr, Flory PJ. Intrinsic viscosity relations for polystyrene. J Am Chem Soc. 1951;73:1915–1920. [Google Scholar]
- 10.Fukatsu M, Kurata M. Hydrodynamic properties of flexible-ring macromolecules. J Chem Phys. 1966;44:4539–4545. [Google Scholar]
- 11.Roovers J. Dilute-solution properties of ring polystyrenes. J Polym Sci Polym Phys Ed. 1985;23:1117–1126. [Google Scholar]
- 12.Hadziioannou G, Cotts PM, ten Brinke G, Han CC, Lutz P, Strazielle C, Rempp P, Kovacs AJ. Thermodynamic and hydrodynamic properties of dilute solutions of cyclic and linear polystyrenes. Macromolecules. 1987;20:493–497. [Google Scholar]
- 13.McKenna GB, Hadziioannou G, Lutz P, Hild G, Strazielle C, Straupe C, Rempp P, Kovacs AJ. Dilute solution characterization of cyclic polystyrene molecules and their zero-shear viscosity in the melt. Macromolecules. 1987;20:498–512. [Google Scholar]
- 14.Einaga Y, Koyama H, Konishi T, Yamakawa H. Intrinsic viscosity of oligo- and polystyrenes. Macromolecules. 1989;22:3419–3424. [Google Scholar]
- 15.Dehara K, Yoshizaki T, Yamakawa H. Translational diffusion coefficient of oligo- and poly(methyl methacrylates) in dilute solutions. Macromolecules. 1993;26:5137–5142. [Google Scholar]
- 16.Abe F, Einaga Y, Yoshizaki T, Yamakawa H. Excluded-volume effects on the mean-square radius of gyration of oligo- and polystyrenes in dilute solutions. Macromolecules. 1993;26:1884–1890. [Google Scholar]
- 17.Cho D, Park S, Kwon K, Chang T, Roovers J. Structural characterization of ring polystyrene by liquid chromatography at the critical condition and MALDI-TOF mass spectrometry. Macromolecules. 2001;34:7570–7572. [Google Scholar]
- 18.Dondos A. Applicability of the modified universal calibration of gel permeation chromatography on proteins. J Chromatogr. 2006;1127:183–186. doi: 10.1016/j.chroma.2006.06.012. [DOI] [PubMed] [Google Scholar]
- 19.Dondos A. Determination of Flory's parameter F for proteins based on the modified universal calibration of the gel permeation chromatography. Biomacromolecules. 2007;8:2979–2981. doi: 10.1021/bm700508y. [DOI] [PubMed] [Google Scholar]
- 20.Eliezer D. Characterizing residual structure in disordered protein states using nuclear magnetic resonance. Methods Mol Biol (Clifton, NJ) 2007;350:49–67. doi: 10.1385/1-59745-189-4:49. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Mittag T, Forman-Kay Julie D. Atomic-level characterization of disordered protein ensembles. Curr Opin Struct Biol. 2007;17:3–14. doi: 10.1016/j.sbi.2007.01.009. [DOI] [PubMed] [Google Scholar]
- 22.Chugh J, Sharma S, Hosur RV. Pockets of short-range transient order and restricted topological heterogeneity in the guanidine-denatured state ensemble of GED of dynamin. Biochemistry. 2007;46:11819–11832. doi: 10.1021/bi701280p. [DOI] [PubMed] [Google Scholar]
- 23.Jha AK, Colubri A, Freed KF, Sosnick TR. Statistical coil model of the unfolded state: resolving the reconciliation problem. Proc Natl Acad Sci USA. 2005;102:13099–13104. doi: 10.1073/pnas.0506078102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Miller WG, Brant DA, Flory PJ. Random coil configurations of polypeptide copolymers. J Mol Biol. 1967;23:67–80. [Google Scholar]
- 25.Bernado P, Bertoncini CW, Griesinger C, Zweckstetter M, Blackledge M. Defining long-range order and local disorder in native alpha -synuclein using residual dipolar couplings. J Am Chem Soc. 2005;127:17968–17969. doi: 10.1021/ja055538p. [DOI] [PubMed] [Google Scholar]
- 26.Zimm BH, Stockmayer WH. The dimensions of chain molecules containing branches and rings. J Chem Phys. 1949;17:1301–1314. [Google Scholar]