Skip to main content
Protein Science : A Publication of the Protein Society logoLink to Protein Science : A Publication of the Protein Society
. 2016 Jan 5;25(3):734–747. doi: 10.1002/pro.2865

Dimensions, energetics, and denaturant effects of the protein unstructured state

Maodong Li 1, Zhirong Liu 1,
PMCID: PMC4815411  PMID: 26683260

Abstract

Determining the energetics of the unfolded state of a protein is essential for understanding the folding mechanics of ordered proteins and the structure–function relation of intrinsically disordered proteins. Here, we adopt a coil‐globule transition theory to develop a general scheme to extract interaction and free energy information from single‐molecule fluorescence resonance energy transfer spectroscopy. By combining protein stability data, we have determined the free energy difference between the native state and the maximally collapsed denatured state in a number of systems, providing insight on the specific/nonspecific interactions in protein folding. Both the transfer and binding models of the denaturant effects are demonstrated to account for the revealed linear dependence of inter‐residue interactions on the denaturant concentration, and are thus compatible under the coil‐globule transition theory to further determine the dimension and free energy of the conformational ensemble of the unfolded state. The scaling behaviors and the effective θ‐state are also discussed.

Keywords: protein denaturation, single‐molecule FRET, transfer model, binding model, coil‐globule transition


Abbreviation

C state

maximally collapsed denatured state

GdmCl

guanidinium chloride

m‐value

the slope of the empirical linear relation between the free energy of unfolding and the denaturant concentration

N state

native or folded state

Rg

radius of gyration

SASA

solvent accessible surface area

SAXS

small‐angle X‐ray scattering

smFRET

single‐molecule fluorescence resonance energy transfer

U state

unfolded or denatured state

Introduction

Most proteins can spontaneously fold into a specific conformation, called the native (folded) state (N), as a prerequisite to perform their biological functions.1, 2, 3 The knowledge of how the native state is stabilized over the unfolded (denatured) state (U) is important in both understanding the molecular mechanism of biological systems and in developing therapies against protein‐misfolding diseases such as Alzheimer's, Parkinson's, Huntington's, and amyotrophic lateral sclerosis.4, 5, 6 Traditionally, chemical denaturants such as guanidinium chloride (GdmCl) and urea were used widely as probes to study protein stability. The free energy of unfolding ( ΔGNU) has been well documented to depend linearly on denaturant concentration (D):7

ΔGNU(D)=ΔGNU(0)mD, (1)

where the intercept ΔGNU(0) represents the free energy in the absence of denaturant, and the slope m reflects the extent of the influence of the denaturant on the specific protein. However, the exact microscopic mechanism is still in debate.8, 9, 10 The most fundamental disagreement lies in whether denaturant molecules interact directly with proteins or not. In the binding model, for example, it was assumed that denaturant molecules bind directly to the protein at specific sites with certain association constants,11 which results in a logarithmic dependence between the free energy and the denaturant activity (a):

ΔGNU(a)=ΔGNU(0)RTiΔniln(1+Kia), (2)

where K i is the effective association constant of the type‐i binding site, Δn i is the effective number of the binding sites, R is the gas constant, and T is the temperature. In the indirect interacting approach, for example, the transfer model, it is assumed that the denaturant molecules change the water structure (aqueous environment) and the effective properties of the solvent linearly depend on D, which immediately gives Eq. (1) based on the proposition of group additivity.12, 13, 14 In practice, the debate is difficult to adjudicate because the denaturation ability of usual denaturants is weak and their working concentration is so high that denaturant molecules will certainly find themselves in close proximity to the protein chain.8

The recent development of single‐molecule fluorescence resonance energy transfer (smFRET) makes it possible to determine the property of the denatured state under equilibrium conditions of coexistence with the native state, and thus provides a unique opportunity to reexamine the molecular mechanism of the denaturant effect in protein folding.8, 15, 16 In 2009, Nettels et al. showed that the radius of gyration (R g) of the denatured state as a function of the denaturant activity can be well fitted with a binding‐model equation and a sole effective binding constant K.17 If such an empirical fitting reflects the microscopic process, the obtained K can be combined with the unfolding free energy from conventional experiments to determine the binding‐site number Δn in Eq. (2), which is difficult, if not impossible, to determine previously. On the other hand, in the same year, Ziv and Haran proposed that the denatured state can be described by a coil‐globule transition theory of a polymer and suggested that the expansion of the denatured state should be incorporated into the transfer model.18 Remarkably, their analysis on smFRET data of the denatured state showed that the free energy of the coil‐to‐globule collapse depends linearly on the denaturant concentration and the resulting slope agrees very well with the m‐value in Eq. (1) of protein folding. This is quite astonishing when taking into consideration that m is expected to be related to both native and denatured states while only the denatured state was measured in smFRET.

Although there have been many studies on the interplay between smFRET and the denaturant effect,14, 15, 16, 17, 18, 19, 20, 21, 22, 23 some basic questions remain to be answered. For example, are the binding model and the transfer model compatible with each other in interpreting experimental smFRET data? What is the crucial information that can be extracted from smFRET of the denatured state? In addition, there is partial contradiction in the studies from different groups. For example, the results of Ziv and Haran suggest that the θ‐state (where intrachain and chain‐solvent interactions balance such that the polymer appears as an ideal chain) occurs at a denaturant concentration of D = 2∼6M,18 while Hofmann et al. claim that this value occurs at D0 M.20 To clarify these questions, more elaborate studies are necessary.

In this article, we adopt the coil‐globule transition theory to develop a general scheme to extract interaction and free energy information from smFRET data. Experimental data of a series of proteins are analyzed in detail. The free energy difference between specific and nonspecific interactions reveals the driving force of protein folding. Both the transfer model and the binding model are found to explain the extracted intrachain interactions, and are therefore compatible under the coil‐globule transition theory in describing protein behaviors in the denatured state.

Results

m‐Value can be extracted from smFRET data with a coil‐globule transition theory

The behaviors of the denatured state are described by a modified coil‐globule transition theory. Details were provided in Models and Methods section, and a flow chart of data analysis is summarized in Figure 1. With a set of smFRET data on different denaturant concentration D, both the dimensions and energetics can be readily extracted as explained in the follows.

Figure 1.

Figure 1

The flow chart of data analysis under a coil‐globule transition theory. Measured smFRET data ( E¯D), total and labeled lengths of protein (N and n) and generic model parameters (ρ 0 and 3) are included in dashed frames as known quantities. Equations used are shown in rectangles. Input or obtained relations between various quantities are shown in ellipses. Core results are highlighted with shadows. See Models and Methods section for details.

In our model, the protein is regarded as a polymer chain with excluded volume ( 3) and effective short‐range interactions (ε, positive for attraction) among residues (monomers). When the effects of intrachain attraction and solvent–chain interaction cancel each other completely, the system reaches the θ‐state as polymers and the resulting ideal chain is specified by a parameter, ρ 0, in its scaling law Rg,θ=ρ0N1/2, where R g,θ is the root mean squared radius of gyration and N is the residue number of the protein. ρ 0 and 3 are fixed parameters for various proteins while ε depends on both the protein sequence and environment conditions such as denaturant concentration and temperature. If ρ 0 and 3 are known, ε can be extracted from smFRET data and be further used to calculate the free energy of the system (see Models and Methods section).

Using six smFRET datasets of proteins (F1−F5 and G1, see Table 1) in the literature, we optimized the parameters ρ 0 and 3 in the coil‐globule transition theory by making the resulting free energy to be consistent as possible with the m‐value of proteins from usual thermodynamic measurements (see Fig. S1, Supporting Information). As a result, we obtain ρ 0 = 0.34 nm and 3 = 0.030 nm3. Ziv and Haran derived 3 from R g of the native (collapsed) state recorded in the Protein Data Bank (PDB) by Rg,native=Nbσ3 and thus 3 varied with protein.18 In our approach, we assume ρ 0 and 3 to be constant and calculated the R g of the collapse state and θ‐state from the scaling law, which is convenient for the analysis of proteins whose PDB structure is not available, for example, intrinsically disordered proteins (IDPs). For proteins discussed in the work by Ziv and Haran, our parameters are close to their derived values (see Figs. S2−S3, Supporting Information). The results of our analysis are summarized in Figure 2. A linear dependence between ε and D is found, confirming the observation of Ziv and Haran.18 Similar linearity is also found in other datasets we studied. The slope and intercept of εD for all 37 datasets are listed in Table 1. Although we assumed ρ 0 and 3 to be constant among different proteins, the resulting free energy based on smFRET data (scattering symbols in the right columns in Fig. 2) is linear over a very broad range of D, and the slope agrees well with the m‐value from conventional thermodynamic measurements (solid lines in the right columns in Fig. 2) for all six proteins, supporting the validity of our approach. Examination on the parameter sensitivity indicates that the calculated slope decreases with increasing ρ 0 and 3, but the dependence on 3 is relatively weaker (see Fig. S5, Supporting Information). Hofmann et al. also adopted the scaling law in estimating R g of the collapse state and θ‐state, but with their corresponding parameters, the slope of the resulting free energy based on smFRET data deviates from the m‐value (see Fig. S4, Supporting Information). In addition, a protein can be labeled with a FRET donor–acceptor pair at different positions. We find that different labels give consistent ε values when the sequence length between the labeled positions is used for N to fit smFRET data (see Figs. S7–S8).

Table 1.

List of the Datasets and the Determined Properties

Number Protein N n Q + Q m ref Ref. kε ε 0 m cal R g (D ≈ 6 M)
A1 IN 60 57 6 14 22 −0.0727 1.40 1.76 2.88
A2 ProTαN 129 55 5 23 22 −0.0452 1.19 2.46 3.21
A3 ProTαC 129 55 5 36 22 −0.0366 1.21 2.03 3.25
A4 CspTm 67 54 8 14 22 −0.101 1.48 3.04 2.87
B1 hCyp 167 164 20 22 20 −0.0431 1.36 3.26 4.31
B2 hCyp 167 153 18 21 20 −0.0434 1.36 3.43 4.18
B3 hCyp 167 123 12 18 20 −0.0469 1.38 4.46 3.83
B4 hCyp 167 112 12 16 20 −0.0478 1.37 4.22 3.82
B5 hCyp 167 97 10 13 20 −0.0425 1.39 3.53 3.44
C1 IN 60 57 6 14 20 −0.0655 1.39 1.58 2.76
C2 ProTαN 129 55 5 23 20 −0.0364 1.20 1.89 3.14
C3 ProTαC 129 55 5 36 20 −0.0327 1.22 1.82 3.22
D1 CspTm 71 67 11 15 20 −0.0622 1.51 1.63 2.82
D2 CspTm 70 59 8 14 20 −0.0755 1.52 1.85 2.65
D3 CspTm 70 46 5 12 20 −0.0690 1.54 1.72 2.48
D4 CspTm 72 34 5 10 20 −0.0571 1.59 1.61 2.14
D5 CspTm 34 34 5 10 20 −0.0794 1.59 1.00 2.08
E1 R17‐93 116 94 11 23 20 −0.0515 1.35 2.39 3.51
E2 R15‐93 114 94 13 20 20 −0.0787 1.44 3.29 3.30
E3 R17‐60 116 61 10 14 20 −0.0510 1.42 2.33 2.93
E4 R15‐60 114 61 10 13 20 −0.0534 1.47 2.10 2.86
F1 Barstar 90 78 6 18 3.13 24 −0.116 1.82 2.72 2.66
F2 Protein L 65 65 6 14 3.22 25 −0.117 1.57 3.34 3.09
F3 CspTm 66 66 11 15 2.93 25 −0.109 1.54 3.06 3.08
F4 Protein L 64 64 6 14 2.50 23 −0.0777 1.73 1.52 2.54
F5 RNaseH 155 133 15 23 5.61 21 −0.103 1.46 6.17 4.09
G1 CspTm 67 66 11 15 2.43 19 −0.124 1.42 3.68 3.26
G2 CspTm 67 58 8 14 19 −0.123 1.65 3.33 2.92
G3 CspTm 67 47 5 12 19 −0.113 1.54 3.17 2.67
G4 CspTm 67 46 5 12 19 −0.106 1.59 2.89 2.59
G5 CspTm 67 34 5 10 19 −0.084 1.52 2.28 2.39
a1 IN 60 57 6 14 22 −0.062 1.27 1.64 2.90
a2 ProTαN 129 55 5 23 22 −0.060 0.96 3.71 3.60
a3 ProTαC 129 55 5 36 22 −0.160 0.22 10.1 4.51
a4 CspTm 67 54 8 14 22 −0.083 1.79 1.43 2.37
b1 Barstar 90 78 7 18 1.90 24 −0.060 1.60 1.51 2.63
b2 Im9 86 59 5 16 1.69 26 −0.072 1.50 2.64 2.88

Number: the index number for datasets. Datasets using the denaturant GdmCl are labeled in capital letters, while those using urea are labeled in lowercase letters. Protein: the abbreviated name of the protein as in the original references. N: the total length of the protein in number of residues. n: the sequence length of the protein between the labeled dyes. Q +: the positive charge of the sequence between the labeled dyes. Q : the negative charge of the sequence between the labeled dyes. The charges of the dyes are also included in Q + and Q . m ref: the m‐values determined by usual thermodynamic measurements as reported in original references (refer to Ziv and Haran18). k ε and ε 0: the slope and intercept of the εD linear fit, that is, ε = ε 0 + k ε D. m cal: the m‐values determined by the coil‐globule transition theory. ε and ε 0 are measured in units of k B T, while m ref, m cal, and m cal are measured in units of RT/M. R g (D ≈ 6M): R g (in units of nm) for the sequence between dyes obtained from smFRET experiments under a denaturant concentration of D ≈ 6M.

Figure 2.

Figure 2

Coil‐globule transition properties of six protein datasets: (from top to bottom) F1 (Barstar), F2 (Protein L), F3 (CspTm), F4 (Protein L), F5 (RNaseH), and G1 (CspTm). References for the datasets are given in Table I. The properties are extracted from smFRET data of the denatured state by the coil‐globule transition method with the optimized parameters of ρ 0 = 0.34 nm and 3 = 0.030 nm3. The properties are given as a function of the GdmCl concentration D (in units of M). The left panels give the root mean squared R g, that is, Rg21/2, in units of nm, where the value of R g,θ is indicated by solid lines. Middle panels give the mean interaction energy ε in k B T units, where linear fits to scattering data points are shown as solid lines. The right panels give the protein stability ΔGNU/RT, where ΔGNU is calculated by Eqs. (15–19) with the intercept being adjusted to match the conventional thermodynamic measurements, and the linear behaviors determined by the conventional thermodynamic measurements are shown in solid lines for comparison.

Expansion of the denatured state has small contribution to the m‐value

Now we examine the origin of the m‐value. The linear dependence between ε and D may be regarded as a direct consequence of the transfer model. In most applications of the transfer model, the solvent accessible surface area (SASA) of the denatured state is assumed to be independent of D. In the language of the coil‐globule transition theory we studied, the average value of volume occupation fraction, ϕ¯, would remain approximately constant with changing D (see Fig. S9, Supporting Information). Under such an approximation, the differentiation of the unfolding free energy [refer to Eqs. ((15), (16), (17), (18), (19)) in Models and Methods section] with respect to D immediately gives

ΔGNU(ε)D=12N(1ϕ¯)εD, (3)

which is obviously independent of D and equal to −m. However, as Ziv and Haran have indicated,18 the denatured state exhibits a continuous expansion (which reflects in the Rg21/2 curve in the left panels of Fig. 2) with increasing D. Therefore, one can define the D‐dependent m from Eqs. ((15), (16), (17), (18), (19)) as:

m(D)=ΔGNU(ε)D=12N(1ϕ¯)εD+12Nεϕ¯DD[NkBT1ϕ¯ϕ¯ln(1ϕ¯)]mε+mϕ¯,H+mϕ¯,S, (4)

where mε denotes the contribution from the variation of ε with D when the variation of ϕ¯ is ignored, while mϕ¯,H and mϕ¯,S denote the enthalpic and entropic contributions from the variation of ϕ¯ with D. Taking the protein CspTm as an example and adopting the linear εD relation previously obtained in Figure 2, different contributions to m (D) are analyzed in Figure 3. The magnitude of mϕ¯,H and mϕ¯,S changes significantly with D; that is, it will cause a nonlinear dependence of ΔGNU on D if mϕ¯,H or mϕ¯,S acts alone [Fig. 3(a)]. However, they adopt negative and positive signs, respectively, and thus compensate each other to leave a small contribution to m. In contrast, mε changes negligibly with D [owing to the small ϕ¯ within the considered D range, refer to Fig. 3(b) and the expression for mε in Eq. (4)], and forms the main contribution to m and the linear dependence of ΔGNU on D. [It is also noted that m decreases at small D in Fig. 3(a), which actually results from both the variation of mε due to the variation of ϕ¯ and the incomplete compensation between mϕ¯,H and mϕ¯,S.] An examination of the correlation between mε and m (Fig. 4) further confirms that m is dominated exclusively by the contribution of mε. Therefore, although the denatured state exhibits an expansion ( ϕ¯ decreases with D), its enthalpic and entropic contributions to the folding free energy compensate each other and the overall effect can be neglected. That is the reason why the transfer model can be safely applied under a fixed denatured state in usual cases.

Figure 3.

Figure 3

Origin of the m‐value. The protein CspTm is examined and the linear εD relation obtained in the dataset F3 is adopted in the calculation. (a) Different contributions to m (D) (in units of RT), which are calculated by Eq. (4). (b) The average value of the volume occupation fraction, ϕ¯, as a function of D. This property can be widely observed in different proteins (Fig. S8, Supporting Information).

Figure 4.

Figure 4

Agreement between the calculated m and mε. Six systems in Fig. 1 are analyzed with the resulting data shown in squares. mε is calculated with representative ϕ¯ for each system. The solid line is the ideal case of mε=m.

Difference between specific and nonspecific interactions can be extracted by combining smFRET and conventional equilibrium experiments

The coil‐globule transition is regarded as an important process during the initial stage of protein folding.27, 28, 29 In relating the coil‐globule transition to the folding stability, the maximally collapsed denatured state (C) with ϕ¯=1 is introduced as a thermodynamic reference state and its free energy difference with U can be readily calculated by Eqs. ((15), (16), (17)). Although C has similar compactness with N, it is not a unique conformation, but is composed of an ensemble of difference conformations all with ϕ¯=1, and is dominated by nonspecific interactions. On the other hand, N is a unique conformation and is stabilized by specific interactions as hinted by the success of the Gö‐like model.30, 31 In conventional equilibrium methods, the free‐energy difference between C and N, ΔGCN, is impossible to measure because C is usually inaccessible in experiments. Since the ΔGCU can now be extracted from smFRET data with the help of the coil‐globule transition theory, ΔGCN can be determined via ΔGCN=ΔGCUΔGNU, where ΔGNU is the protein stability determined in conventional equilibrium methods. This may provide valuable information about interplay between specific and nonspecific interactions in protein folding, which is crucial in incorporating non‐native interactions into native‐centric chain models.30, 32, 33

The determined free energy difference between C and N is given in Figure 5. The G C state is found to have similar variation with G N as a function of D [Fig. 5(a)]. As a result, the free energy difference ΔGCN, is nearly independent of D, as expected, because C and N have similar SASA. For limited systems with available data in our analysis, ΔGCN is approximately proportional to the chain length [Fig. 5(b)], with an expression of ΔGCN/RT=0.36N+1.58. In all cases, G C is much higher than G N and G U, so the maximally collapsed state cannot exist in equilibrium systems. The properties of G C can only be determined via approximate extrapolation as used here.

Figure 5.

Figure 5

Free energy difference between C and N. (A) The free energy of C, N, and U states for protein CspTm as a function of D. G U and G C are calculated by Eqs. (15, 16) with smFRET dataset F3. G N is calculated as GN=GUΔGNU where ΔGNU is expressed as Eq. (1) whose parameters are determined from equilibrium experiments. (B) ΔGCN of the six proteins correlate with the chain length N. The proteins are the same as Fig. 1. ΔGCN is nearly independent of D, so each protein is presented by a data point. A linear fit is shown as a solid line.

θ‐State and scaling behaviors

The θ‐state is important for both the coil‐globule transition and protein folding.20, 34, 35, 36 Hofmann et al. suggested that the θ‐state is achieved for the denatured proteins at around the aqueous cellular milieu, i.e., D0.20 By contrast, it was suggested from the results of Ziv and Haran that the θ‐state occurs within D=26 M.18 Under the framework of the coil‐globule transition theory, the θ‐state occurs at ε1 since the excess free energy with respect to that of the ideal chain can be expanded up to the linear term of ϕ as (see Fig. S10, Supporting Information):

g(ϕ;ε)12(ε1)ϕ, (5)

which becomes zero at ε=1, resulting in a scaling law of the θ‐state of Rg=ρ0N1/2, as mentioned above. (When higher order terms are taken into account, the θ‐state occurs at a ε value slightly larger than 1.) To simplify the analysis, we use an approximated scaling law of

Rg=ρ0Nν, (6)

to extract the scaling exponent ν from R g with the optimized parameter of ρ 0 = 0.34 nm as explained above. Here ν varies with solvent conditions. Three critical points of ν are highlighted: 3/5 for the expanded coil state,37 1/2 for the θ‐state and 1/3 for the most compact globule state. The determined results for all 31 datasets with GdmCl as denaturant are summarized in Figure 6. ν increases with D in a similar manner for different systems (except a few systems with large net charge at small D). Most datasets end at ν = 0.5 − 0.55, and none can reach the good solvent point of ν = 0.6 at the highest denaturant concentration. Therefore, GdmCl is not sufficiently strong as a denaturant for most proteins even at the highest concentration. The θ‐state, defined as ν = 0.5, occurs at a range of D = 2∼6M for most systems. This is fully consistent with Ziv and Haran,18 while this range is markedly different from the result obtained by Hofmann et al.20 The reason for such a discrepancy lies in the different parameter ρ 0 adopted. Hofmann et al. used an effective ρ 0 value of 0.22 nm, smaller than our optimized value of 0.34 nm as explained above. A direct fit to the data of Ziv and Haran (see Fig. S2, Supporting Information) gives ρ 0 = 0.33 nm. As we have discussed above, with the parameters of Hofmann et al., one cannot reproduce the experimental m‐values (Fig. S4, Supporting Information). Therefore, ρ 0 = 0.34 or 0.33 nm may be more reasonable in interpreting smFRET data.

Figure 6.

Figure 6

Scaling exponents ν as a function of the GdmCl concentration D for 31 protein datasets (A1−G5). ν is calculated from R g via Rg=ρ0Nν with ρ 0 = 0.34 nm, i.e., from each R g datapoint, we obtained a ν as v=ln(Rg/ρ0)/lnN.

To gain more insight about the prefactor ρ 0 of the scaling law, we have calculated R g of the systems A1−G5 at a high GdmCl concentration of D ≈ 6M, and plotted the R g values as a function of N in Figure 7. Data from small‐angle X‐ray scattering (SAXS) studies38 are also plotted for comparison. It is clearly shown that R g values obtained from smFRET are larger than that from SAXS under similar N values [Fig. 7(a)]. As indicated by Kohn et al.,38 the SAXS data can be well described by a scaling law in good solvent conditions, that is, ν0.6 [see Fig. 7(b)]. For smFRET data, on the other hand, we find that a direct fit gives ν0.4. If we fix ν to be the value of the θ‐state (i.e., 0.5), we find that the fitting result [solid line in Fig. 7(c)] can also describe the data quite well. The prefactor ρ 0 obtained in this way, ρ 0 = 0.37 nm, is close to the value we used (0.34). A smaller value of ρ = 0.22 nm [dotted line in Fig. 7(c)] is inconsistent with the data points.

Figure 7.

Figure 7

Scaling properties of R g as a function of N. (a) R g of different proteins measured from SAXS (adopted from Kohn et al.38) and smFRET (datasets A1−G5, D ≈ 6 M, see Table I) experiments. (b) Data from SAXS shown in logarithmic scale and the solid line represents the scaling law fit of Rg=0.193N0.598 nm.38 (c) Data from smFRET and the solid line represent the fitting scaling law of Rg=0.37N0.5 nm by fixing ν=0.5, while the dotted line represents the scaling law of Rg=0.22N0.5 nm.

It is noted that there is long‐standing discrepancy between smFRET and SAXS data as demonstrated in Figure 7(a). The key may lie in the fact that smFRET does not measure R g, not even R ee, but just measures E¯ which is thus translated into R ee and R g with the help of certain theoretical models. Some theoretical approaches adopted in data analysis, for example, the relation of Rg2=Ree2/6, may cause systematic discrepancy. Molecular simulations may be helpful in clarifying the problem, which were not pursued in this study.

Screened charge effect can be incorporated into the coil‐globule transition theory

Most proteins are electriferous. GdmCl is an electrolyte that would eliminate charge effects in the denatured state when D is not too small. However, for urea which is a nonelectrolyte, or for GdmCl with too small a D value to eliminate the charge effects, the electrostatic interactions would cause the denatured protein to expand in size, and such effects should be taken into account in the theoretical description. Here, we adopt an approach previously used by Müller‐Späth et al.,22 where the effect of screened electrostatic interactions is described in terms of the effective excluded volume (see Models and Methods section). It is noted that the approach assumes a random distribution of charges, while the distribution of charges in a protein is fixed by the sequence and might be very inhomogeneous along the chain. Therefore, it should be regarded as a first approximation.

Taking this effect into consideration, we determine the parameter σ in the previous 3 = 0.030 nm3 to be σ = 0.8 nm based on all 31 GdmCl datasets (A1−G5, see Fig. S6, Supporting Information). The results for four example systems are shown in Figure 8. ProTαC and ProTαN possess significant net charges, and their R g increases rapidly at low D [Fig. 8(a,b)]. With the corrected charge effect in the coil‐globule transition theory, a linear εD relation is sufficient to produce results (blue lines) consistent with the experiments. If the charge effect is not incorporated into the theory, a linear εD relation produces results (red lines) that seriously deviate from the experiments at low D. For Protein L with minimal charge, whether to include the charge effect or not does not result in any discernible difference [Fig. 8(c)]. For CspTm with significant opposite charges, the attraction between opposite charges causes R g to decrease at low D [Fig. 8(d)], which is partially captured by the theory that incorporates the charge effect (blue lines).

Figure 8.

Figure 8

Charge effects on R g for four proteins (datasets): (a) ProTαC (C3), (b) ProTαN (A2), (c) Protein L (F2), and (d) CspTm (A4). Results calculated from the experiment smFRET data are shown as circles. The determined ε data points are fitted with a linear εD relation, which are further used to recalculate R g with (blue line) and without (red line) charge effects. With the charge terms, the model fits better with GdmCl dimension data, mainly at very low concentrations.

Linearity versus nonlinearity: lessons from the binding model

The most straightforward explanation for the linear dependence between ε and D is provided by the transfer model where the denaturant changes the solvent environment in a linear‐dependent way with D. However, the binding model can also account for the εD linearity even if it defines a logarithmic dependence on the denaturant activity a. The key for the transition from linear to nonlinear lies in the fact that a of GdmCl depends on D in a highly nonlinear manner:11

log10a=0.5191+1.4839log10D0.2562(log10D)2+0.5884(log10D)3. (7)

Under such a nonlinear aD, the experimental transfer free energies (the solvation free energy difference with denaturant added) δgi of the side chain and backbone group of amino acids39 can be satisfactorily fitted by a linear dependence on D as δgi=miD+bi,14 or by a binding formula on a as δgi=RTniln(1+Kia).20 In a similar spirit, we find that the ε data obtained in Figure 2 can be nicely refitted with a binding model as ε(a)=ε0+Δεln(1+Ka) (Fig. 9). It should be noted that although ε can be fitted with the binding model, the obtained parameter K is quite arbitrary. For example, for Protein L (dataset F2), different K values (0.3, 0.59, 1.0) describe the ε data to nearly the same accuracy (see solid lines with different colors in the panel for F2 in Fig. 9). Such parameter uncertainty is also consistent with the fact that the number of independent parameters in the binding‐model formula [Eq. (2)] is more than those in the linear formula [Eq. (1)] by one. Therefore, the transfer model and the binding model are both effective in interpreting the coil‐globule transition revealed by smFRET data. It is impossible to discriminate these two model based on smFRET results.

Figure 9.

Figure 9

Fitting of ε data with the binding model. The scattered symbols are ε data extracted from smFRET data with the coil‐globule transition theory. Blue solid lines are the fitting to the scatterings with a formula of ε=ε0+Δεln(1+Ka). For Protein L (F2), except for the fitting results with optimized K = 0.59 (blue line), we also draw the results with deviating values of K = 0.3 (red line) and 1.0 (green line) for comparison.

Sometimes, R g data are fitted with a “binding‐model‐like” formula as:17, 20, 22

Rg(a)=Rg,0+ΔRgKa1+Ka (8)

where quotation marks are used to emphasize the fact that a binding model does not necessarily result in an expression of R g as seen in Eq. (8); for example, a binding model to describe ε as described above will not result in Eq. (8) for a coil‐globule transition. The phenomenological applications may be misleading. For example, the fitting K values using Eq. (8) on smFRET data are smaller than 1,17, 22 and inconsistent with any K values fitted in the transfer free energies of amino acids (K ≈ 3 − 7). Actually, Eq. (8) can be used to fit various data with diverse mechanisms. For example, an improper binding model can be tried as:

Rg(D)=Rg,0+ΔRgKD1+KD, (9)

which is not expected to reflect the nature of smFRET data with denaturant GdmCl because a should be used instead of D. The phenomenological fitting with Eq. (9) to R g data works remarkably well (Fig. S11, Supporting Information). In another example, we theoretically calculate R g as a function of ε using the coil‐globule transition theory and found that the resulting data for ε < 1.5 (which covers most data in the smFRET experiments) can also be well fitted with an improper binding model formula (see Fig. S12, Supporting Information). Therefore, the application of Eq. (8) in analyzing smFRET data should be applied cautiously and the meaning of the resulting parameter K is highly suspicious.

It is noted that we use the transfer model and the binding model to account for the linearity of ε on D, but not ΔG. The characteristics of ΔG and R g are explained with the coil‐globule transition theory based on the εD linearity. Therefore, the transfer model and the binding model become compatible under the coil‐globule transition theory in describing the denaturant effects of proteins.

Discussion

In contrast to protein folding that is usually highly cooperative, the coil‐globule transition is noncooperative. This offers significant advantages to smFRET studies, because the properties of the system vary gradually with conditions (such as D) and abundant valuable information (such as ε and G) can be extracted under a wide range of conditions. Therefore, the smFRET technique combined with the coil‐globule transition theory has numerous applications. For example, ε and m can be obtained at different temperatures and at high temperatures when the native state is unstable. The enthalpy and entropy components can then be reliably determined to provide clues about protein folding and the denaturant effect. In analyzing the data under different temperatures, it is noted that parameters such as the Förster radius as R 0 in Eq. (20) may depend on the temperature.40 Another interesting topic is the urea effect. Urea is not an electrolyte and does not separate into ions in solutions. Unlike the case in GdmCl, a depends on D in a roughly linear manner in urea. Thus, urea is a useful system to test the validity of the binding model. At present, smFRET data with urea are limited.22, 24, 26 In addition, urea is not capable of screening electrostatic interactions, which hinders the direct application of our theoretical scheme when the charge effect cannot be ignored. Therefore, the analysis on the urea effect is not pursued in this article. For future smFRET experiments with urea, adding ionic salts such as KCl to screen the charge effect would be helpful.

The interpretation of smFRET data requires the use of theoretical models. The coil‐globule transition theory adopted here is based on a mean field approximation. The validity of the model is supported by the consistent m‐value obtained by various approaches; however, support from other approaches, for example, molecular dynamics (MD) simulations,41, 42, 43 would be helpful. MD simulations are also helpful in clarifying some other questions regarding smFRET results, for example, why the R g measured from smFRET is usually larger than the value obtained from SAXS (Fig. 7).

The smFRET technique is also useful in studying the thermodynamics and energetics of IDPs. IDPs are abundant in all species and possess some advantages in playing essential functions.44, 45, 46 Intriguingly, IDPs do not have ordered native structures, so standard techniques used to characterize conventional proteins may be not applicable for IDPs. SmFRET, with its power in probing the denatured state, becomes an ideal technique in studying IDPs. With the absence of the native state, the application of smFRET in IDPs is even simpler than that in ordered proteins because it is not necessary to remove the signal disturbance from the native state and thus even conventional FRET on the ensemble level is also applicable.

Conclusions

In this work, we have applied a coil‐globule transition theory to smFRET data of the denatured state of a series of proteins to investigate the denaturant effects in proteins. A modified theory with universal parameters for a GdmCl solution was generated to extract interaction and free energy information from smFRET data with no requirement for PDB structural data. By combining the protein stability data, we have determined the free energy difference between the native state and the maximally collapsed denatured state, providing clues on the specific/nonspecific interactions in protein folding. It was demonstrated that most proteins reach the θ‐state within D=26 M, and barely reach the good‐solvent point with a scaling exponent of ν = 0.6. With respect to the mechanism of the denaturant effects, both the transfer model and the binding model are able to account for the observed linear εD relation, and thus are compatible under the coil‐globule transition theory.

Models and Methods

Coil‐globule transition theory

The conformational ensemble of the denatured state of a protein is described by the coil‐globule transition theory by Ziv and Haran,18 which was modified from the mean‐field Sanchez theory of polymers.47 In the spirit of Sanchez's pristine derivation, we also introduce a few modifications to produce a more applicable scheme, as will be explained below.

The denatured protein is regarded as a polymer chain with excluded volume and short‐range attractive interactions among residues (monomers). The probability distribution function of R g is given as:

P(Rg)=P0(Rg)exp(Ng(ϕ;ε)kBT), (10)

where N is the number of residues of the protein. P0(Rg) is the distribution when the excluded volume and attractive interactions are absent, that is, an ideal chain. It is approximated to be the Flory–Fisk empirical distribution:37

P0(Rg)Rg6exp(7Rg22Rg,θ2), (11)

where R g,θ is the root mean‐squared R g of the θ‐state, which depends on the chain length in a scaling law of

Rg,θ=ρ0n1/2, (12)

where ρ 0 is a universal parameter for various proteins. g(ϕ;ε) in Eq. (10) is the excess free energy per monomer of the conformations of R g with respect to those in the ideal chain, and is given as:

g(ϕ;ε)=12ϕε+kBT[1ϕϕln(1ϕ)+1], (13)

where ε is the effective attraction (ε > 0) or repulsion (ε < 0) interactions among residues. ε varies with the denaturant concentration for a protein. ϕ in Eqs. (10) and (13) is the effective volume fraction occupied by the chain, which is defined in terms of R g as:

ϕ=Nbσ3Rg3, (14)

where σ is the average excluded volume of a residue, and b is a unit‐less proportional coefficient between ϕ and Nσ3/Rg3. Nbσ 3 is related to the volume of the native state. In our study, σ and b are assumed to be universal constants for various proteins. It is noted that according to Sanchez's pristine derivation,47 ϕ is proportional to Nσ3/Rg3, but the proportional coefficient b is not necessary equal to 1. Although one can define an effective volume as σ3=bσ3 to remove b and simplify the symbol σ to σ (which is actually adopted in most studies without any explicit statement), we keep the pristine form because σ will also appear in the charge effect as will be discussed below. It is also noted that the relation between R g of the native state and the θ‐state required by Landau's theory of phase transition18, 20 does not apply here because N is not necessarily large and the system is not necessarily at the phase transition state. Therefore, ρ 0 and bσ3 are independent parameters in our model.

Following Ziv and Haran,18 the excess free energy of the denatured state with respect to that of an ideal chain is calculated as:

GU(ε)=Ng(ϕ¯;ε)=12Nεϕ¯+NkBT[1ϕ¯ϕ¯ln(1ϕ¯)+1], (15)

where ϕ¯=ϕ is the average value of ϕ over the distribution at ε. It is noted that the last term (which contributes a constant) is needed to ensure that G=0 when ε=1 and σ0, i.e., the excess free energy of the ideal chain is zero [see also Eq. (5) for linear expansion version]. The maximally collapsed denatured state (C) is introduced as a thermodynamic reference state. By definition, this state has ϕ¯=1 and

GC(ε)=Ng(ϕ=1;ε)=12Nε+NkBT (16)

Therefore, the free energy of collapse is given as:

ΔGCU(ε)=12Nε(1ϕ¯)+NkBT1ϕ¯ϕ¯ln(1ϕ¯). (17)

The free energy of unfolding is related to that of the collapse state as:

ΔGNU(ε)=ΔGNC(ε)+ΔGCU(ε). (18)

Since the solvent accessible surface area (SASA) of C and N is similar, their interaction with solvent is also similar and thus their free energy difference ΔGNC(ε) is approximately independent of the denaturant concentration D and ε, which is a function of D for a protein. Therefore, Eq. (18) is rewritten as:

ΔGNU(ε)=ΔGCU(ε)+ΔGNC0, (19)

where the superscript is used to indicate that ΔGNC0 is a constant independent of ε (D). Eqs. ((15), (16), (17), (18), (19)) can be used to investigate the dependence of protein‐folding free energy on D (except a constant ΔGNC0, which is inessential in determining the m‐value) if ε can be extracted from smFRET experimental data.

In smFRET experiments, the measured average FRET efficiency ( E¯) of the denatured state is ascribed to the following average over the end‐to‐end (donor‐to‐acceptor) distance distribution Pee(Ree), as:

E¯=R06Ree6+R06Pee(Ree)dRee, (20)

where R 0 is the Förster radius of the FRET pair (5.4 nm when AlexaFluor 488 is used as a donor and AlexaFluor 594 as an acceptor chromophore20). To relate E¯ to R g, the relation between Pee(Ree) and P(Rg) should be considered. Different schemes have been proposed for this relationship, e.g., a conditional‐probability approach.18, 22 The simplest approach is to assume Pee(Ree) obeys a Gaussian distribution and requires Rg2=Ree2/6.22 It has been shown that the results from different schemes are similar.22 Therefore, ε can be readily calculated from E¯ at each D using Eqs. ((10)−20) (i.e., via comparing Rg2E¯D and Rg2ε; see also the flow chart in Fig. 1), which can be further used to calculate ΔGCU using Eq. (17). If the protein stability ΔGNU is known from independent conventional methods, ΔGNC can be determined as ΔGNC=ΔGNUΔGCU.

Charge Effects on Coil‐Globule Transition

To account for the long‐range electrostatic interactions between the charges in the chain and the screening of charges by the ionic denaturant GdmCl or ionic salts such as KCl, following Müller‐Späth et al.,22 we have adopted an approach that was originally developed by Higgs and Joanny to extract Flory‐like dimensions for polyampholytes. In this approach, the effect of electrostatic interactions is described in terms of the effective excluded volume as:

(σ*)3=σ3+4πlB(fg)2κ2πlB2(f+g)2κ, (21)

where σ and σ* are the excluded volume without and with electrostatic interactions, respectively. f and g are the probabilities for the occurrence of a positive charge and a negative charge in a monomer, respectively, which are determined from the amino acid sequence. The second term on the right‐hand‐side of Eq. (21) represents repulsive interactions due to the net charge of a protein, which results in an increase of the excluded volume, whereas the third term leads to a reduction in the excluded volume through attractive interactions between opposite charges. l B is the Bjerrum length given in:

lB=e24πε0εrkBT, (22)

where e is the elementary charge, ε 0 is the permittivity of vacuum, and ε r is the dielectric constant. κ is the reciprocal of the Debye length adopted from the Debye–Hückel theory:

κ=8πlBI, (23)

where I is the ionic strength of the solution. By replacing σ in Eq. (14) with σ* defined in Eq. (21), the screening electrostatic interactions can be incorporated into the coil‐globule transition theory above. Unless the concentration of GdmCl and ionic salts is small, σ*σ and the electrostatic interactions can be ignored.

Experimental Data from the Literature

Experimental smFRET data were collected from the literature. Thirty‐seven datasets were measured on 10 different proteins (see Table 1 and Table SI, Supporting Information), including eight foldable proteins (cold shock protein, CspTm; cyclophilin A, hCyp; spectrin domains R15 and R17; Ribonuclease inhibitor protein, Barstar; immunity protein colicin E9; IgG binding domain B1 of protein L, Protein L; Ribonuclease HI, RnaseH) and two more highly charged IDPs (prothymosin α, ProTα; the N‐terminal domain of HIV Integrase, IN). Some references have provided suitable Rg data directly.20, 22, 23 In some other references,19, 21, 24 26 raw E¯ data are available, so we have transformed them into Rg data through Eq. (20) (the Förster radius of the FRET pair R 0 is 5.4 nm here when AlexaFluor 488 is used as a donor and AlexaFluor 594 as an acceptor chromophore20, see also the flow chart in Fig. 1) and a Gaussian distribution with Rg2=Ree2/6. Dye linkers were not taken into account. Most datasets were obtained using GdmCl as denaturant and their numbers are labeled in capital letters in Table 1. Only six out of 37 datasets were obtained using denaturant urea whose number in Table 1 are labeled in lowercase letters instead. In this study, we have focused on the properties under GdmCl because the electrostatic effect in the presence of urea is more complicated.

Supporting information

Supporting Information

Acknowledgment

The authors thank Huaiqing Cao, Haifeng Lang, and Tanlin Sun for helpful discussions.

Short statement: In our manuscript, we adopt a coil‐globule transition theory to develop a general scheme to extract the dimensions and energetics properties of the protein unfolded state from single‐molecule fluorescence resonance energy transfer spectroscopy (smFRET) data. New insights on the specific/nonspecific interactions in protein folding are provided. More importantly, the scaling behaviors and the effective θ‐state are clarified.

References

  • 1. Dill KA, MacCallum JL (2012) The protein‐folding problem, 50 years on. Science 338:1042–1046. [DOI] [PubMed] [Google Scholar]
  • 2. Anfinsen C, Haber E (1961) Studies on reduction and re‐formation of protein disulfide bonds. J Biol Chem 236:1361–1363. [PubMed] [Google Scholar]
  • 3. Bryngelson JD, Onuchic JN, Socci ND, Wolynes PG (1995) Funnels, pathways, and the energy landscape of protein‐folding—a synthesis. Proteins 21:167–195. [DOI] [PubMed] [Google Scholar]
  • 4. Zhang Y, Cao H, Liu Z (2015) Binding cavities and druggability of intrinsically disordered proteins. Protein Sci 24:688–705. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5. Dobson CM (2003) Protein folding and misfolding. Nature 426:884–890. [DOI] [PubMed] [Google Scholar]
  • 6. Dill KA, Shortle D (1991) Denatured states of proteins. Annu Rev Biochem 60:795–825. [DOI] [PubMed] [Google Scholar]
  • 7. Greene RF, Pace CN (1974) Urea and guanidine‐hydrochloride denaturation of ribonuclease, lysozyme, alpha‐chymotrypsin, and beta‐lactoglobulin. J Biol Chem 249:5388–5393. [PubMed] [Google Scholar]
  • 8. England JL, Haran G (2011) Role of solvation effects in protein denaturation: From thermodynamics to single molecules and back. Annu Rev Phys Chem 62:257–277. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9. Moglich A, Krieger F, Kiefhaber T (2005) Molecular basis for the effect of urea and guanidinium chloride on the dynamics of unfolded polypeptide chains. J Mol Biol 345:153–162. [DOI] [PubMed] [Google Scholar]
  • 10. Chen T, Chan HS (2014) Effects of desolvation barriers and sidechains on local‐nonlocal coupling and chevron behaviors in coarse‐grained models of protein folding. Phys Chem Chem Phys 16:6460–6479. [DOI] [PubMed] [Google Scholar]
  • 11. Aune KC, Tanford C (1969) Thermodynamics of denaturation of lysozyme by guanidine hydrochloride. 2. Dependence on denaturant concentration at 25 degrees. Biochemistry 8:4586–4590. [DOI] [PubMed] [Google Scholar]
  • 12. Tanford C (1964) Isothermal unfolding of globular proteins in aqueous urea solutions. J Am Chem Soc 86:2050–2059. [Google Scholar]
  • 13. Auton M, Holthauzen LMF, Bolen DW (2007) Anatomy of energetic changes accompanying urea‐induced protein denaturation. Proc Natl Acad Sci USA 104:15317–15322. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14. O'Brien EP, Ziv G, Haran G, Brooks BR, Thirumalai D (2008) Effects of denaturants and osmolytes on proteins are accurately predicted by the molecular transfer model. Proc Natl Acad Sci USA 105:13403–13408. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15. Schuler B, Hofmann H (2013) Single‐molecule spectroscopy of protein folding dynamics‐expanding scope and timescales. Curr Opin Struc Biol 23:36–47. [DOI] [PubMed] [Google Scholar]
  • 16. Liu B, Chia D, Csizmok V, Farber P, Forman‐Kay JD, Gradinaru CC (2014) The effect of intrachain electrostatic repulsion on conformational disorder and dynamics of the sic1 protein. J Phys Chem B 118:4088–4097. [DOI] [PubMed] [Google Scholar]
  • 17. Nettels D, Mueller‐Spaeth S, Kuester F, Hofmann H, Haenni D, Rueegger S, et al. (2009) Single‐molecule spectroscopy of the temperature‐induced collapse of unfolded proteins. Proc Natl Acad Sci USA 106:20740–20745. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18. Ziv G, Haran G (2009) Protein folding, protein collapse, and tanford's transfer model: Lessons from single‐molecule FRET. J Am Chem Soc 131:2942–2947. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19. Hoffmann A, Kane A, Nettels D, Hertzog DE, Baumgaertel P, Lengefeld J, G Reichardt, DA Horsley, R Seckler, O Bakajin, B Schuler (2007) Mapping protein collapse with single‐molecule fluorescence and kinetic synchrotron radiation circular dichroism spectroscopy. Proc Natl Acad Sci USA 104:105–110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20. Hofmann H, Soranno A, Borgia A, Gast K, Nettels D, Schuler B (2012) Polymer scaling laws of unfolded and intrinsically disordered proteins quantified with single‐molecule spectroscopy. Proc Natl Acad Sci USA 109:16155–16160. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21. Kuzmenkina EV, Heyes CD, Nienhaus GU (2006) Single‐molecule FRET study of denaturant induced unfolding of RNase H. J Mol Biol 357:313–324. [DOI] [PubMed] [Google Scholar]
  • 22. Mueller‐Spaeth S, Soranno A, Hirschfeld V, Hofmann H, Rueegger S, Reymond L, et al. (2010) Charge interactions can dominate the dimensions of intrinsically disordered proteins. Proc Natl Acad Sci USA 107:14609–14614. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23. Sherman E, Haran G (2006) Coil‐globule transition in the denatured state of a small protein. Proc Natl Acad Sci USA 103:11539–11543. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24. Hofmann H, Golbik RP, Ott M, Huebner CG, Ulbrich‐Hofmann R (2008) Coulomb forces control the density of the collapsed unfolded state of barstar. J Mol Biol 376:597–605. [DOI] [PubMed] [Google Scholar]
  • 25. Merchant KA, Best RB, Louis JM, Gopich IV, Eaton WA (2007) Characterizing the unfolded states of proteins using single‐molecule FRET spectroscopy and molecular simulations. Proc Natl Acad Sci USA 104:1528–1533. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26. Tezuka‐Kawakami T, Gell C, Brockwell DJ, Radford SE, Smith DA (2006) Urea‐induced unfolding of the immunity protein IM9 monitored by spFRET. Biophys J 91:L42–L44. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27. Ziv G, Thirumalai D, Haran G (2009) Collapse transition in proteins. Phys Chem Chem Phys 11:83–93. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28. Chan HS, Bromberg S, Dill KA (1995) Models of cooperativity in protein‐folding. Phil Trans Royal Soc Lond Series B‐Biol Sci 348:61–70. [DOI] [PubMed] [Google Scholar]
  • 29. Nakagawa K, Yamada Y, Matsumura Y, Tsukamoto S, Yamamoto‐Ohtomo M, Ohtomo H, et al. (2014) Relationship between chain collapse and secondary structure formation in a partially folded protein. Biopolymers 101:651–658. [DOI] [PubMed] [Google Scholar]
  • 30. Chan HS, Zhang Z, Wallin S, Liu Z (2011) Cooperativity, local‐nonlocal coupling, and nonnative interactions: Principles of protein folding from coarse‐grained models. Annu Rev Phys Chem 62:301–326. [DOI] [PubMed] [Google Scholar]
  • 31. Cheung MS, Garcia AE, Onuchic JN (2002) Protein folding mediated by solvation: Water expulsion and formation of the hydrophobic core occur after the structural collapse. Proc Natl Acad Sci USA 99:685–690. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32. Huang Y, Liu Z (2010) Nonnative interactions in coupled folding and binding processes of intrinsically disordered proteins. Plos One 5:e15375 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33. Zhang Z, Chan HS (2010) Competition between native topology and nonnative interactions in simple and complex folding kinetics of natural and designed proteins. Proc Natl Acad Sci USA 107:2920–2925. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34. Camacho CJ, Thirumalai D (1993) Kinetics and thermodynamics of folding in model proteins. Proc Natl Acad Sci USA 90:6369–6372. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35. Crick SL, Jayaraman M, Frieden C, Wetzel R, Pappu RV (2006) Fluorescence correlation spectroscopy shows that monomeric polyglutamine molecules form collapsed structures in aqueous solutions. Proc Natl Acad Sci USA 103:16764–16769. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36. Badasyan A, Mamasakhlisov YS, Podgornik R, Parsegian VA (2015) Solvent effects in the helix‐coil transition model can explain the unusual biophysics of intrinsically disordered proteins. J Chem Phys 143:014102 [DOI] [PubMed] [Google Scholar]
  • 37. Flory PJ (1949) The configuration of real polymer chains. J Chem Phys 17:303–310. [Google Scholar]
  • 38. Kohn JE, Millett IS, Jacob J, Zagrovic B, Dillon TM, Cingel N, et al. (2004) Random‐coil behavior and the dimensions of chemically unfolded proteins. Proc Natl Acad Sci USA 101:12491–12496. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39. Pace CN (1986) Determination and analysis of urea and guanidine hydrochloride denaturation curves. Method Enzymol 131:266–280. [DOI] [PubMed] [Google Scholar]
  • 40. Aznauryan M, Nettels D, Holla A, Hofmann H, Schuler B (2013) Single‐molecule spectroscopy of cold denaturation and the temperature‐induced collapse of unfolded proteins. J Am Chem Soc 135:14040–14043. [DOI] [PubMed] [Google Scholar]
  • 41. Jin F, Liu Z (2013) Inherent relationships among different biophysical prediction methods for intrinsically disordered proteins. Biophys J 104:488–495. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42. Knott M, Best RB (2012) A preformed binding interface in the unbound ensemble of an intrinsically disordered protein: Evidence from molecular simulations. Plos Comput Biol 8:e1002605 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43. Holzgrafe C, Wallin S (2014) Smooth functional transition along a mutational pathway with an abrupt protein fold switch. Biophys J 107:1217–1225. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44. Liu Z, Huang Y (2014) Advantages of proteins being disordered. Protein Sci 23:539–550. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45. Wright PE, Dyson HJ (1999) Intrinsically unstructured proteins: Re‐assessing the protein structure‐function paradigm. J Mol Biol 293:321–331. [DOI] [PubMed] [Google Scholar]
  • 46. Uversky VN (2013) A decade and a half of protein intrinsic disorder: Biology still waits for physics. Protein Sci 22:693–724. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47. Sanchez IC (1979) Phase‐transition behavior of the isolated polymer‐chain. Macromolecules 12:980–988. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supporting Information


Articles from Protein Science : A Publication of the Protein Society are provided here courtesy of The Protein Society

RESOURCES