Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2021 Feb 1.
Published in final edited form as: Curr Opin Struct Biol. 2019 Dec 2;60:27–38. doi: 10.1016/j.sbi.2019.10.009

Emerging Consensus on the Collapse of Unfolded and Intrinsically Disordered Proteins in Water

Robert B Best 1
PMCID: PMC7472963  NIHMSID: NIHMS1542589  PMID: 31805437

Abstract

Establishing the degree of collapse of unfolded or disordered proteins is a fundamental problem in biophysics, because of its relation to protein folding and to the function of intrinsically disordered proteins. However, until recently, different experiments gave qualitatively different results on collapse and there were large discrepancies between experiments and all-atom simulations. New methodology introduced in the past three years has helped to resolve the differences between experiments, and improvements in simulations have closed the gap between experiment and simulation. These advances have led to an emerging consensus on the collapse of disordered proteins in water.

Keywords: Theta temperature, Self-Avoiding Walk, Gaussian Chain, Bayesian Ensemble Refinement

1. Introduction

Since unfolded or intrinsically disordered proteins lack an ordered structure, they are often well approximated as disordered polymers. In general, polymers will undergo a collapse transition as the interactions with the solvent are varied, as is well known in the case of homopolymers [1]. That is, sufficiently weak interactions with the solvent (poor solvent) will lead to a maximally compact, collapsed chain. Collapse can be quantified in terms of the radius of gyration Rg, which is defined as Rg2=12|rirj|2i,j, where the average runs over all intramolecular atom pairs (i, j) with positions ri, rj. The degree of collapse is usually characterized in terms of a length scaling exponent ν, describing the relation between average Rg and the number of residues in the protein N, as RgNν, or similarly between the average end-end distance R and N, RNν. For example, it is obvious that ν = 1/3 for chains which are maximally compact collapsed globules (such as folded proteins) (Fig. 1 (a),(d)). On the other hand, strong interactions with the solvent (good solvent), or high temperature, will lead to a very expanded chain where ν can be shown to be ~ 3/5 (more precisely, 0.588) for homopolymers, the behaviour expected for purely repulsive chains (self avoiding walks). At an intermediate solvent quality, where the interactions with the solvent perfectly balance the interactions of the chain with itself, ν = 1/2, the same scaling as for a ghost chain (i.e. without self-interactions), a condition known as the Θ-solvent, or Θ-state (Fig. 1(a)). Thus a fundamental property of any disordered protein is its degree of collapse, which varies with solution conditions and temperature. Proteins, of course, are heteropolymers, hence their collapse is more likely to involve interactions of some residues (e.g. hydrophobic ones) than others, [24] and collapse is expected to depend on both the average properties of the sequence (such as hydrophobicity or charge [58]) and on sequence order [9, 10]. Nonetheless, the theory of homopolymer collapse provides a useful starting point for analyzing the properties of unfolded proteins, especially when the sequence is well mixed [11]. The scaling exponent ν is often used to characterize the extent of collapse: for very long chains ν tends towards only three possible limits (1/3, 1/2, 0.588), but as proteins are usually relatively short, ν is more properly referred to as an apparent scaling exponent and can also take on intermediate values (Fig. 1(b)).[12]

Figure 1:

Figure 1:

Background on unfolded protein collapse. (a) Polymer collapse illustrated from simulations of Val-100 with the ABSINTH model[67] (data from Zheng et al.[69]). Variation of radius of gyration Rg, end-end distance R and FRET efficiency E with temperature, all quantities being normalized by their respective values at the Θ temperature, Tθ. Here, increasing temperature is analogous to adding chemical denaturant, as both lessen the importance of intramolecular interactions. The transfer efficiency E was computed with a Förster radius of R0Rθ. (b) Corresponding variation of length scaling exponent ν with temperature. Here ν was approximated from the scaling of internal distances for residues separated by a number of peptide bonds Np, i.e. |rirj|2|ij|=NpNpv. Shaded region respresents collapse not usually observed in unfolded or disordered proteins. Schematic figures illustrating degree of collapse are shown at top. (c) Discrepancy between FRET and SAXS experiments prior to recent methodological advances [33]: Rg for protein L and CspTm from FRET, and for protein L from SAXS as a function of denaturant concentration. Right axis shows equilibrium unfolded population of Protein L (magenta curve). (d) Comparison of Rg of unfolded proteins from simulations with the Rg of their respective folded states. [49] Length scaling laws with power law exponents of 1/3, 1/2 and 3/5 are shown.

Chain collapse is clearly significant for protein folding, since a protein folding into a globular structure implies collapse – but is there a non-specific collapse (i.e. involving primarily non-native interactions, and without formation of specific meta-stable intermediates) which occurs before folding, or is collapse concomitant with folding? [13] And how does the degree of collapse prior to folding depend on sequence, or native structure [1416]? Answers to these questions depend on methods to quantify the dimensions of disordered proteins under different conditions. The degree of collapse is also highly relevant to the function of intrinsically disordered proteins (IDPs). Most obviously, a number of IDPs, or proteins which contain intrinsically disordered regions (IDRs), are associated with the formation of so-called “membraneless organelles” driven by liquid-liquid phase transitions [17]. In the case of homopolymers, the Θ conditions for a single chain are correlated with the critical conditions for phase separation [1], because the underlying interactions driving collapse and phase separation are the same. Indeed it has recently been demonstrated numerically that this holds approximately for proteins also [18]. Therefore the extent of collapse of a monomeric protein is a general indication of its propensity to phase separate [19]. To give a second example, there are a number of IDPs which have been shown to bind other biomolecules in a state which is still largely disordered [2022]. Thus modulation of collapse could influence binding energetics, kinetics and mechanism [22]. Despite the evident importance of quantifying the degree of chain collapse in unfolded proteins, until the last few years, different experiments yielded results that conflicted with each other, as well as with most molecular simulations.

2. Discrepancies amongst experiments and simulations

2.1. Differences amongst experiments

Chemical denaturants such as urea or guanidinium chloride (GdmCl) have long been used in protein folding studies to destabilize proteins, which is generally believed to occur via weak association of denaturant molecules with the unfolded protein [2325]. A landmark collaborative study of a large number of proteins in high concentrations of chemical denaturant established a scaling exponent of ν = 0.598(0.028) [26], within error of the self-avoiding walk scaling exponent of 0.588. This would suggest that all sequences are close to the “good solvent” limit at high denaturant concentrations, presumably because of sufficiently favourable molecular interactions with denaturants. Other studies reached a similar conclusion using other techniques, notably establishing the length scaling via several different labelling pairs in single molecule FRET spectroscopy [8].

However, until recently, there was some controversy over the properties of different protein sequences under more biologically relevant denaturant-free conditions. Many studies had shown a reduction of unfolded state Rg as denaturant was diluted, using single-molecule FRET [8, 11, 2734]. FRET measures the distance between a pair of fluorophores (or dyes) attached to the protein at specific positions, with the efficiency of energy transfer between excited states of the fluorophores being dependent on the distance between them via the Förster relation [35]. From the average distance measured, an average radius of gyration can be inferred. A reduction in Rg with denaturant was also observed by contact quenching experiments[36], while nuclear magnetic resonance (NMR) [37], dynamic light scattering (DLS)[38] and two-focus fluorescence correlation spectroscopy [8] indicated a consistent reduction of the hydrodynamic radius Rh with lowered denaturant concentration. In Fig. 1(c) [33] is shown an example of the unfolded-state collapse inferred from a FRET study of protein L and the cold-shock protein CspTm as guanidinium chloride concentration is reduced. The length scaling exponent in water determined by FRET experiments using multiple labelling sites was approximately 1/2, suggesting that water is close to Θ-solvent conditions for many unfolded proteins (a notable exception being sequences with a high net charge)[8]. It is important to note that this implies a small compaction relative to high concentrations of denaturant, but the Rg is still much larger than for a fully collapsed globule.

The scattering function I(q) measured in small-angle X-ray scattering experiments is related to the distribution of intramolecular pair distances, and hence is a sensitive measure of the Rg. In contrast to the results from FRET and other methods, most studies using small-angle X-ray scattering had concluded that Rg was approximately independent of denaturant concentration, and hence that water was close to the “good solvent” limit [3944], although there were exceptions [37]. For example, the SAXS data for protein L in Fig. 1(c) show a clear contrast with the FRET-derived Rg (the first example where the same protein was studied by both methods, eliminating protein-specific differences as the origin of the disagreement) [29, 33, 39]. This discrepancy regarding the properties of proteins in water firstly raised the technical question of which of the SAXS or FRET results was more accurate. Secondly, the differences regarding the compaction of proteins in water raised questions over the quality of water as a solvent for unfolded proteins, as well as over the mechanism of chemical denaturation: the accepted mechanism of denaturation is that it occurs via weak association of denaturant molecules with the unfolded chain, which should cause a swelling of the chain at higher denaturant concentrations[23].

In spite of the abovementioned issues, SAXS and FRET experiments and some simulations were in at least qualitative agreement in some cases, for example the chain collapse brought about by raising the temperature [45,46], due to a strengthening of the hydrophobic effect, but also the solvation properties of charged and hydrophilic residues [19,47]. This suggested that the apparent differences amongst experiments and simulations could eventually be reconciled.

2.2. Differences between experiments and all-atom simulations

A second and unrelated discrepancy was evident when comparing the properties of atomistic simulations in explicit solvent to experiment: the Rg obtained with most force fields was much below experimental estimates [46, 48] (much more than experimental estimates of Rg differed from each other). In many cases, unfolded states in simulations had Rg barely larger than the folded state (Fig. 1(d)), with a larger amount of secondary structure formation than is inferred from experiment [46, 49, 50]. Thus, it was essentially impossible to make quantitative comparisons between experiment and simulation.

3. Resolving the discrepancy between SAXS and FRET experiments

It has recently been possible to obtain quantitatively consistent results for Rg from both FRET and SAXS experiments [5155], as well as when comparing experiments to unbiased molecular simulations [25,56,57]. What were the key issues in resolving the earlier discrepancies?

3.1. Choice of experimental systems

For a foldable protein sequence, the folded state becomes close to 100% populated at low denaturant concentration, which makes it challenging to study the unfolded state by ensemble methods, because the signal is averaged over both folded and unfolded populations. For example, the unfolded population of protein L is plotted in Fig. 1(c). This limits the range of equilibrium SAXS experiments to high denaturant concentrations. In the case of protein L, time-resolved SAXS was used to estimate Rg at low denaturant before folding occurs [39] (green square in Fig. 1(c)), but such experiments are inherently noisy and require large amounts of material. Single molecule FRET can resolve the unfolded population down to much lower denaturant concentrations and hence study collapse over a wider range of conditions, as seen in Fig. 1(c). Since, as seen in Fig. 1(c), most of the collapse observed by FRET occurs at the lowest denaturant concentrations, it will be missed if the unfolded state cannot be observed under these conditions [51, 53].

A key aspect of recent studies has therefore been the careful choice of proteins which do not fold, or fold only at very low denaturant concentrations, allowing equilibrium SAXS to be used. A second requirement is that the proteins should be highly soluble even in the absence of denaturant, in order that SAXS experiments can be performed without the complications of transient aggregation. The first such study was that by Borgia et al. [51], in which the intrinsically disordered protein ACTR and a destabilized variant of the spectrin R17 domain were studied over a range of urea and guanidinium chloride concentrations by four different techniques, including SAXS and FRET. Both of these proteins were shown by dynamic light scattering to remain highly soluble at low denaturant concentrations. As in earlier studies, the FRET data showed a collapse with denaturant dilution (Fig. 2(d)). An analysis of the SAXS data using Guinier analysis, the simplest method for determining Rg, also showed a contraction with decreasing denaturant concentration. Guinier analysis relates Rg to the slope of a plot of the log-SAXS intensity ln I(q) vs q2, where q is the scattering vector (ln I(q)13Rg2q2) [58]. As evident in Fig. 2(a), there is a clear change of slope with decreasing denaturant concentration, indicating compaction. Thus, SAXS and FRET results were in at least qualitative agreement.

Figure 2:

Figure 2:

Qualitative consistency of SAXS and FRET. (a) Guinier plots for R17 at different concentrations of GdmCl and urea. Note that the primary data and fits are offset on the y-axis for clariy; broken lines show the fitted curves without offset to compare slopes. (b) Guinier fits carried out for R17 in 4.96 M GdmCl with different fitting ranges [0, qmax], where qmaxRg is the number shown in the legend in (c). (c) Rg inferred from Guinier fits in (b), for different qmaxRg. (d) Primary FRET data for unfolded R17 in GdmCl and urea. (e) End-end distance R inferred from R17 FRET in GdmCl, assuming Gaussian chain or SAW polymer models as labelled. (f) Rg inferred from R using fixed conversion factor for Gaussian chain or SAW.

3.2. Identifying shortcomings of standard SAXS analysis methods

Choosing a protein which can be studied down to a low denaturant concentration, however, does not completely resolve all discrepancies. For example, past studies on reduced RNase A, which can be unfolded by reducing the disulfide bonds, showed no change of Rg with denaturant concentration [41], provided the disulfides were fully reduced to avoid artifacts [38, 59]. In this case, Rg could be studied down to zero denaturant concentration without folding. The lack of change in Rg in this case might be partially explained by the fact that the approximation used in Guinier analysis is valid only at small q. This was tested in a systematic Guinier analysis of SAXS data for R17 [51], varying the q range fitted to get Rg. Indeed, it was found that using too large a range of q suppressed variations in Rg (red curves in Fig. 2(b),(c)) with denaturant concentration. This would lead to an underestimate of the change of Rg with denaturant. While using a smaller q range may reduce this systematic error, it results in unacceptably large statistical error (blue curves in Fig. 2(b),(c)), because of the larger experimental error on data points close to q = 0. The smaller range of validity of the Guinier approximation relative to folded proteins is essentially due to the greater asphericity of expanded states relative to globular states [60]. Overall, these results confirm that Guinier analysis is a relatively insensitive method for inferring Rg from unfolded proteins, as has long been known for random coils [61]. We note that the alternative Debye expression for the scattering from a Gaussian chain [62] in principle allows more data to be used in the fit, but it tends to overestimate Rg [60].

3.3. Identifying shortcomings of standard FRET analysis methods

There were also shortcomings in the analysis of FRET, because the observed mean FRET efficiency is an average over a very broad distribution of inter-fluorophore distances P(r), that is E=0E(r)P(r)dr, where E(r) = (1 + (r/R0)6)−1 and R0 is the Förster radius. In order to infer an average distance R=0rP(r)dr from a single FRET measurement 〈E〉, a polymer model distribution for P(r) with a single adjustable parameter was usually used, for example a Gaussian chain or self-avoiding walk (SAW). However, the inferred R was quite sensitive to polymer model chosen [63, 64]. The dependence of the inferred R on polymer model is illustrated in Fig. 2(e) for R17 in GdmCl, and similarly for the derived Rg (Fig. 2(f)) [51]. A further problem with using a single polymer model to interpret all data is that the ratio λ=R2/Rg2 used to convert R to Rg is fixed in each case, which we will see below is not valid.

3.4. New analysis methods: Bayesian ensemble refinement

As indicated above, better methods of analysis were needed for both SAXS and FRET data. A central problem of interpreting both SAXS and FRET data for unfolded proteins is that they are averaged over a broad distribution of structures. Thus, any model for the data needs to account for this distribution, a challenging inference problem, given limited data. An alternative to using simplified analytical models such as a Gaussian chain for FRET or the corresponding Debye model for SAXS is to use an explicit molecular model, such as that which can be generated by molecular simulations. An atomistic simulation model can realistically represent the protein chain and the specific interactions within it, and allows any ensemble-averaged static property to be computed. However, it usually will not exactly reproduce the experimental data due to small inaccuracies in the energy functions used in the simulations. These inaccuracies can be overcome by reweighting the simulation data to match experiment – this must be done with care not to overfit, which can be achieved by including appropriate regularization [51, 52, 65, 66]; in addition it is important that the initial ensemble be sufficiently close to the true ensemble that reweighting is possible [66]. If performed carefully, this procedure should in principle provide the most realistic representation of the disordered ensemble. From the perspective of Bayesian statistics, such reweighting corresponds to using the simulation model as a prior, which is then augmented by the experimental data to give the final model. The procedure is thus referred to as Bayesian Ensemble Refinement [65].

Fig. 3 shows an application of ensemble refinement to reconciling SAXS and FRET data from unfolded R17 [51]. SAXS and FRET measurements were made on R17 in a number of denaturant concentrations in exactly matched buffers; the only difference between the samples apart from protein concentration was that the protein was labelled with fluorophores for FRET, but not SAXS. The two sets of data were used jointly to reweight an ensemble of structures for R17 generated from simulations with the ABSINTH model [67] to obtain a good match with experiment. The good fit of the ensemble to both sets of data suggests that the SAXS and FRET data sets are self-consistent; moreover, ensembles derived from fitting to only SAXS or only FRET data had similar properties to those from a combined fit. The ensembles thus derived can then be used to compute molecular properties and distributions directly. Note the absence in the distributions of distance r and rg (Fig. 3) of any sharp features that would indicate overfitting. The average properties R and Rg show that indeed the protein collapses as denaturant is diluted. Moreover, the Rg inferred from Guinier analysis with a very small q range (green symbols) agrees with that from the ensemble reweighting, but with much larger error bars.

Figure 3:

Figure 3:

Ensemble refinement to interpret SAXS and FRET data. (a) SAXS and FRET data are recorded for the same unfolded protein (here: R17) under identical conditions as a function of denaturation concentration. (b) Unbiased simulations are performed using an implicit solvent model to generate an initial realistic molecular ensemble. (c) Bayesian ensemble refinement seeks to make the minimal perturbation to the simulation ensemble (by reweighting) in order to match the experimental data – ensemble averages after reweighting are overlaid on experimental data in (a) as thin curves. (d) Distance and radius of gyration distributions computed from the reweighted ensemble show an expansion with denaturant concentration. (e) Ensemble average R and Rg from ensemble refinement versus GdmCl concentration (red symbols) are compared with estimates from Gaussian chain (cyan) and SAW (navy) models from FRET and from Guinier analysis using the smallest practical range of q (green symbols).

The comparison between the average distance R and Rg inferred from the ensemble refinement and those inferred from FRET using either a Gaussian chain or SAW model is very revealing (Fig. 3). The ensemble estimates are close to those from the Gaussian chain at low denaturant concentration, but more similar to the SAW at high denaturant concentration. This is presumably because the improvement in solvent quality from water to high denaturant causes the chain to transition from being close to Θ-solvent conditions to closer to the self-avoiding walk limit. As can be seen, this means that using a single polymer model (either Gaussian chain or SAW) to analyze all the data would in general give an overestimate of the change in R with denaturant concentration. This problem is compounded when converting from R to Rg using a fixed conversion factor λ=R2/Rg2, because the relative change in R is larger than that in Rg, hence λ should get larger with increasing denaturant. This decoupling between R and Rg is true even for homopolymers (see Fig. 1(a)), although it may be further enhanced in heteropolymeric sequences [52]. All of these factors conspire to yield overestimates for the change in Rg with denaturant when a single polymer model is applied to FRET data.

3.5. Other new analysis methods

We have seen that modelling the unfolded state using Bayesian ensemble refinement helps to overcome the shortcomings of conventional analyses of SAXS data (by Guinier analysis) and of FRET data (by polymer models), which respectively tend to under- and overestimate changes of Rg with denaturant concentration, contributing to the earlier controversy. A similar algorithm formalized in terms of maximum entropy [68] was applied to SAXS and FRET data for a set of intrinsically disordered proteins [52], also finding that the molecular ensemble could provide a consistent explanation for both experiments.

Another new approach that is closely related to ensemble refinement is the recently introduced “molecular form factor” (MFF) method [53]. In this approach, ensembles of structures are pre-generated by simulations with different solvent quality or temperature, which are each characterized by their scaling exponent ν. The form factor I(q; ν) is calculated as an average over the ensemble at each value of ν, and converted to a dimensionless form M(qRg; ν) = I(qRg)/I0. Then, any new experimental data set I(q) is fitted with I0M(qRg; ν) via I0, Rg for each value of ν using the precalculated M(qRg; ν), generating a set of coupled (ν, Rg) pairs. The optimal (ν, Rg) pair is that which minimizes the deviation from the experimental data as measured by the χ2 function. The MFF method allows more of the experimental scattering data to be used for fitting, overcoming the limited range of validity of the Guinier approximation. When applied to a number of destabilized proteins, it revealed a collapse at lower denaturant concentrations – including for cases where it could not be detected before, such as RNase A[41, 53]. As mentioned above, the MFF approach has a lot in common with ensemble refinement, the main difference being that the ensembles are precomputed. This offers a great advantage in efficiency and easy of use over ensemble refinement. However, if the protein under consideration is not well described as a random coil (if it includes sufficient residual structure, or at the extreme, a folded domain), the MFF approach is not applicable. The ensemble refinement method is thus more general, if more demanding to use.

A couple of other methods have also recently been developed which, similar in spirit to the MFF approach, can easily be applied to extract accurate Rg and ν without the need to run simulations or perform ensemble refinement. The first is the “extended Guinier” model, which adds an additional term to the standard Guinier model and can therefore be fitted analytically to a broader range of q[60]. Like the MFF, it yields coupled estimates for both Rg and ν. The second is the SAW-ν model for interpreting FRET experiments in terms of a distance distribution P(r) [69]. It uses a ν-dependent P(r; ν) based on the P(r) expression for a self-avoiding walk, thus effectively interpolating between different solvent conditions. For example, this would avoid the overestimation of changes in R and Rg encountered when using a single polymer model to try and fit FRET data at all denaturant concentrations.

3.6. Potential influence of FRET fluorophores on collapse

An alternative explanation which has been advanced to explain some of the discrepancy between FRET and SAXS results is the hydrophobicity of the fluorophores [43, 53, 70], which might then cause additional collapse in FRET versus SAXS. It is worth noting that while the fluorophores are large and have extended conjugated aromatic surfaces, they also usually carry a net charge (e.g. −2 in the case of the commonly used AlexaFlour 488 and AlexaFlour 594). The potential influence of the fluorophores has been investigated in several studies. As mentioned above, consistent results could be obtained from ensemble refinement for Rg with a standard AlexaFlour 488/594 dye pair, whether the refinement was done using FRET, SAXS without dyes, or a combination of FRET and SAXS. However, for an alternative more hydrophobic dye pair, the Rg from an independent refinement using just the FRET data was significantly smaller than estimated from SAXS[51], indicating that the fluorophores did perturb the protein. Thus, selection of suitable fluorophores for FRET studies is critical in order to obtain reliable results, but the results with a well chosen set of fluorophores were completely consistent with SAXS [51].

In a second tour de force study of dye effects, SAXS experiments were done on samples labelled with FRET fluorophores as well as samples without labels, for a series of intrinsically disordered proteins. [52] This is challenging because of the large amount of labelled material that must be prepared, but allows the effect of the labels to be addressed directly. The comparison of labelled and unlabelled proteins showed that the addition of the fluorophores sometimes caused a small change in Rg outside the error bars, however the shift was as often an increase in Rg as a decrease, indicating that the effect is not systematic. [52, 54, 71, 72] More recently, an N-terminal disordered fragment of pertactin (the full length protein is folded in water) was studied by SAXS without labels, or with one or two AlexaFlour 488 dyes attached to it. [70] The results indicated a statistically insignificant change in Rg for addition of a single label, relative to unlabelled protein. When two labels were attached, there was a significant additional reduction in the radius of gyration in water relative to the unlabelled protein, but with large errors arising from the noisy SAXS data for the labelled samples.

In summary, the fluorophores used to probe distance changes in FRET experiments can of course slightly perturb the protein properties and should be chosen with care. Ideally, results from multiple dye pairs should be compared [21, 51]. However, from the accumulated evidence so far, they do not systematically favor collapse.

4. Resolving the discrepancy between experiments and all-atom simulations

Some simple changes to simulation force fields in recent years have helped to correct the excessive collapse of disordered proteins in water seen earlier. These modifications included a global scaling of protein water interactions [73, 74], and the introduction of a new water model with stronger dispersion interactions [48, 75], and have been recently reviewed.[50] Evidence of the progress which has been made is the impressive agreement with SAXS data obtained with unbiased simulations [56, 57]. From the perspective of simulating protein collapse in denaturant, an accurate model for the denaturant is also required. Older models were able to capture the relative effects of denaturant on the solubility of different amino acids, but the denaturants in most force fields bound the protein too strongly [76]. A careful reparameterization of the denaturant model to capture experimental solubility data for model peptides[77] allowed for predictive accuracy in all-atom simulations of an IDP [25]. These simulations were helpful in discounting several alternative explanations for earlier SAXS/FRET discrepancies, such as preferential partitioning of the denaturant molecules toward the center of the chain, or systematic differences between labelled and unlabelled molecules[25].

5. Emerging consensus on properties of unfolded proteins and IDPs in water

The results coming from carefully chosen experimental systems as well as novel analysis methods have shown that FRET and SAXS experiments yield a generally consistent picture of the dimensions of unfolded proteins at different denaturant concentrations. In order to make quantitative comparisons, however, it is essential that all data be analyzed with state of the art methods. This means, in the case of FRET, using either ensemble refinement [51, 52] or the simple SAW-ν model[69] to infer changes of R and Rg with denaturant. For SAXS, it means using ensemble refinement [51, 52], the molecular form factor method [53], or the extended Guinier method [60]. Comparing results using new analysis methods with those obtained with old methods [71] is misleading, considering all the drawbacks of older methods pointed out above.

For this reason, a recent study reanalyzed FRET and SAXS data from earlier papers (as well as newer ones) using the same set of state of the art, consistent analysis procedures [54]. To be able to compare data for multiple proteins of different sequence length and composition, only the derived scaling exponents ν are plotted in Fig. 4. These show firstly that the most recent methods such as the MFF approach and Bayesian ensemble refinement give very consistent results when applied to the same data sets (e.g. compare results for PNt in Fig. 4(a)), thus analysis methods are no longer a source of discrepancies. Secondly, all proteins considered have a scaling exponent ~ 3/5 at high denaturant concentration. Thirdly, as denaturant concentration is reduced to zero, all proteins shown undergo a compaction and corresponding reduction in ν to a value in the range 0.45–0.55, close to Θ-solvent conditions (highly charged proteins still remain close to the SAW limit in water [8]); a majority of the proteins have ν slightly greater than 1/2, but still clearly below the value estimated at high denaturant. Like ν, the reduction in radius of gyration between high and low denaturant is sequence-specific, but generally in the range of 10–20 %. Notably, however, none of the proteins considered even come close to collapsing to a globular state (with ν ≈ 1/3) in water.

Figure 4:

Figure 4:

Consistent analysis of multiple SAXS, FRET data sets. (a) Dependence of scaling exponent ν on denaturant concentration from SAXS data from Riback et al. [53] (red, blue symbols), and FRET data from Hofmann et al. [8] (grey symbols) and Borgia et al. [51] (green symbols). Analysis methods – Bayesian refinement or MFF – are given in legend in each case. (b) Dependence of scaling exponent on mean Kyte-Doolittle hydrophobicity for proteins studied by Borgia et al. [51], Fuertes et al. [52], Hofmann et al. [8] and Riback et al. [53]. Experimental methods and analysis methods are indicated in the legend.

Further evidence for the consistency between SAXS and FRET experiments comes from a recent study on the unfolded state of NTL9[55]. Instead of using single molecule experiments to separate folded and unfolded states (requiring large fluorophores), time resolved experiments were performed using a continuous-flow microfluidic mixing device to capture the unfolded state before folding occurs. This allowed the use of small fluorophores, p-cyanophenylalanine and tryptophan as FRET labels, which ought to be even less perturbing to the unfolded state than single molecule probes. The results confirmed the picture that FRET and SAXS reflect a compatible reduction of the Rg in water, with the statistical properties of the chain in water being close to Θ-solvent conditions. Interestingly, in this case the collapse in water was also accompanied by adoption of some specific residual structure.

6. Implications for Protein Folding and Intrinsically Disordered Proteins

What are the implications of these findings for protein folding? It appears that for the relatively small, mostly two-state folding proteins considered here, there is a modest reduction in Rg from high to low denaturant. However, nanosecond fluorescence correlation spectroscopy has shown reconfiguration times in the unfolded state are on the order of 100 ns [78, 79], and so the relaxation time after dilution from high to low denaturant is expected to be on a similar time scale, i.e. much faster than any mixing method for reducing denaturant concentration, and also much faster than even the fastest folding proteins, which fold in ~ 1μs [80] (typical proteins are much slower). This initial collapse is a rapid re-equilibration of the unfolded protein once it is transferred to folding conditions (low denaturant), but is unlikely related to folding mechanism. In addition, the absence of any fully collapsed population other than the folded state suggests that, for these proteins, collapse to globule-like dimensions only occurs during folding transitions. There are of course numerous examples from the literature of significant collapse occurring prior to folding, but these mostly involve larger proteins which populate folding intermediates, and collapse tends to be associated with formation of stable structure in these intermediates [81]. A second implication for protein folding is that the collapse as denaturant is diluted supports a denaturation model in which denaturant preferentially associates with unfolded states, as elegantly illustrated by Ziv and Haran[32].

Although the degree of collapse of intrinsically disordered proteins varies with their sequence, the fact that many IDP sequences are close to Θ-solvent conditions in water may be significant for their function. Being near the midpoint of the collapse transition, is exactly where the chain properties would be most susceptible to post-translational modifications such as phosphorylation, or to changes in environmental conditions [22]. Furthermore the correlation between the Θ-conditions for a single chain and the critical conditions for phase separation [18] suggests that estimates of the scaling exponent from FRET or SAXS measurements, now an integral part of new analysis methods, should help to predict which disordered proteins are prone to phase separation. Being close to Θ-solvent conditions may also have advantages for protein folding. On the one hand, being closer to the collapse transition will require a smaller energy to collapse in order to fold. On the other hand, not being too far below the collapse transition helps to avoid non-native collapse and trapping before folding, which may slow folding rates [82].

7. Conclusions

Through careful experimental work and analysis, it has been possible to resolve apparent contradictions between the results of different experimental measurements of Rg for unfolded and intrinsically disordered proteins. This resolution rested on a three advances: (i) Careful choice of proteins, using either intrinsically disordered proteins or destabilized variants of folded proteins that can be studied down to very low denaturant concentration without populating the folded state. (ii) Development of better methods than the Guinier approximation for analyzing SAXS data of unfolded proteins, such as Bayesian ensemble refinement, the MFF method or the extended Guinier approach. (iii) Development of improved methods for interpreting FRET measurements, such as Bayesian ensemble refinement, or the SAW-ν method. These allow the form of the distance distribution to vary with solvent conditions, and also explicitly account for the “decoupling” between R and Ra that occurs as solvent conditions are varied (such that R usually changes by more than Rg). These developments were all essential to the resolution of the earlier controversy.

Although studying protein collapse as a function of denaturant concentration may seem a fairly abstract problem, the efforts to resolve it have yielded a rich toolbox of new methods that will allow SAXS and FRET measurements (as well as related methods) to be more quantitatively applied to studying protein folding and the properties and function of intrinsically disordered proteins.

Acknowledgements.

I thank Wenwei Zheng, Ben Schuler, Alessandro Borgia and Rohit Pappu for helpful comments on the manuscript. This work was supported by the Intramural Research Program of the National Institute of Diabetes and Digestive and Kidney Diseases.

Footnotes

Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

Declaration of Interest. The author declares no conflict of interest.

References

  • [1].Rubinstein M, Colby RH. Polymer Physics. Oxford University Press, Oxford, 2003. [Google Scholar]
  • [2].Alonso DOV, Dill KA. Solvent denaturation and stabilization of globular proteins. Biochemistry 1991, 30: 5974–5985. [DOI] [PubMed] [Google Scholar]
  • [3].Zerze Gül H and Best Robert B and Mittal Jeetain. Sequence and temperature-dependent properties of unfolded and disordered proteins from atomistic simulations. J Phys Chem B 2015,119: 14622–14630. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [4].Holehouse AS, Pappu RV. Collapse transitions of proteins and the interplay among backbone, sidechain and solvent interactions. Ann Rev Biophys 2018, 47: 19–39. [DOI] [PMC free article] [PubMed] [Google Scholar]; ** Recent review of collapse in unfolded and disordered proteins, broken down into contributions from the backbone and different types of side-chain.
  • [5].Uversky VN, Gillespie JR, Fink AL. Why are “natively unfolded” proteins unstructured under physiologic conditions? Proteins 2000, 41: 415–427. [DOI] [PubMed] [Google Scholar]
  • [6].Mao AH, Crick SL, Vitalis A, Chicoine C, Pappu RV. Net charge per residue modulates conformational ensembles of intriniscally disordered proteins. Proc Natl Acad Sci U S A 2010, 107: 8183–8188. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [7].Müller-Späth S, Sorrano A, Hirschfeld V, Hofmann H, Rüegger S, Reymond L, Nettels D, Schuler B. Charge interactions can dominate the dimensions of intrinsically disordered proteins. Proc Natl Acad Sci USA 2010, 107: 14609–14614. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [8].Hofmann H, Soranno A, Borgia A, Gast K, Nettels D, Schuler B. Polymer scaling laws of unfolded and intrinsically disordered proteins quantified with single-molecule spectroscopy. Proc Natl Acad Sci USA 2012, 109: 16155–16160. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [9].Das RK, Pappu RV. Conformations of intrinsically disordered proteins are influenced by linear sequence distributions of oppositely charged residues. Proc Natl Acad Sei U S A 2013,110: 13392–13397. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [10].Sawle L, Ghosh K. A theoretical method to compute sequence-dependent configurational properties in charged polymers and proteins. J Chem Phys 2015, 143: 085101. [DOI] [PubMed] [Google Scholar]
  • [11].Ziv G, Thirumalai D, Haran G. Collapse transition in proteins. Phys Chem Chem Phys 2009, 11: 83–93. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [12].Rapaport DC. On three-dimensional self-avoiding walks. J Phys A: Math Gen 1985, 18: 113–126. [Google Scholar]
  • [13].Chahine J, Nymeyer H, Leite VBP, Socci ND, Onuchic JN. Specific and nonspecific collapse in protein folding funnels. Phys Rev Lett 2002, 88: 168101. [DOI] [PubMed] [Google Scholar]
  • [14].Samanta HS, Zhuravlev PI, Hinczewski M, Hori N, Chakrabarti S, Thirumalai D. Protein collapse is encoded in the folded state architecture. Soft Matter 2017, 13: 3622–3638. [DOI] [PubMed] [Google Scholar]
  • [15].Kluber A, Burt TA, Clementi C. Size and topology modulate the effects of frustration in protein folding. Proc Natl Acad Sci USA 2018, 115: 92349239. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [16].Thirumalai D, Samanta HS, Maity H, Reddy G. Universal nature of collapsibility in the context of protein folding and evolution. Trends Biochem Sci 2019, 44: 675–687. [DOI] [PubMed] [Google Scholar]; ** Recent summary of theory of collapse transitions in unfolded proteins, emphasizing the connection to folding.
  • [17].Alberti S, Gladfelter A, Mittag T. Considerations and challenges in studying liquid-liquid phase separation and biomolecular condensates. Cell 2019, 176: 419–434. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [18].Dignon G, Zheng W, Best RB, Kim YC, Mittal J. Relation between single-molecule properties and phase behaviour of intrinsically disordered proteins. Proc Natl Acad Sci U S A 2018, 115: 9929–9934. [DOI] [PMC free article] [PubMed] [Google Scholar]; ** Demonstration of correlation between critical conditions for phase separation and Θ-conditions for single chain for protein sequences.
  • [19].Dignon G, Zheng W, Kim YC, Mittal J. Temperature-controlled liquid-liquid phase separation of disordered proteins. ACS Cent Sci 2019, 5: 821830. [DOI] [PMC free article] [PubMed] [Google Scholar]; ** Introduction of temperature-dependent potential in coarse-grained model for LLPS captures both upper and lower critical solution temperatures.
  • [20].Mittag T, Orlicky S, Choy WY, Tang X, Lin H, Sicheri F, Kay LE, Tyers M, Forman-Kay JD. Dynamic equilibrium engagement of a polyvalent ligand with a single-site receptor. Proc Natl Acad Sci USA 2008, 105: 17772–17777. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [21].Borgia A, Borgia MB, Bugge K, Kissling VM, Heidarsson PO, Fernandes CB, Sottini A, Buholzer KJ, Nettels D, Kragelund BB, et al. Extreme disorder in an ultra-high-affinity protein complex. Nature 2018, 555: 61–66. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [22].Vancraenenbroeck R, Harel YS, Zheng W, Hofmann H. Polymer effects modulate binding affinities in disordered proteins. Proc Natl Acad Sci U S A 2019, 116: 19506–19512. [DOI] [PMC free article] [PubMed] [Google Scholar]; ** Demonstration of effect of solution conditions on binding kinetics for charged intrinsically disordered proteins.
  • [23].Nozaki Y, Tanford C. The solubility of amino acids and related compounds in aqueous urea solutions. J Biol Chem 1963, 238: 4074–4081. [PubMed] [Google Scholar]
  • [24].Schellman JA. Solvent denaturation. Biopolymers 1978,17: 1305–1322. [DOI] [PubMed] [Google Scholar]
  • [25].Zheng W, Borgia A, Buholzer K, Grishaev A, Schuler B, Best RB. Probing the action of chemical denaturant on an intrinsically disordered protein by simulation and experiment. J Am Chem Soc 2016, 138: 11702–11713. [DOI] [PMC free article] [PubMed] [Google Scholar]; ** All-atom simulations of an IDP with a denaturant force field tuned to match experimental solubility data reproduce collapse seen in experiment.
  • [26].Kohn JE, Millett IS, Jacob J, Zagrovic B, Dillon TM, Cingel N, Dothager RS, Seifert S, Thiyagarajan P, Sosnick TR, et al. Random-coil behavior and the dimensions of chemically unfolded proteins. Proc Natl Acad Sci USA 2004, 101:12491–12496. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [27].Deniz AA, Laurence TA, Beligere GS, Dahan M, Martin AB, Chemla DS, Dawson PE, Shultz PG, Weiss S. Single-molecule protein folding: diffusion fluorescence energy transfer studies of the denaturation of chymotrypsin inhibitor 2. Proc Natl Acad Sci USA 2000, 97: 5179–5184. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [28].Schuler B, Lipman EA, Eaton WA. Probing the free-energy surface for protein folding with single-molecule fluorescence spectroscopy. Nature 2002, 419: 743–747. [DOI] [PubMed] [Google Scholar]
  • [29].Sherman E, Haran G. Coil-globule transition in the denatured state of a small protein. Proc Natl Acad Sci USA 2006, 103: 11539–11543. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [30].Möglich A, Joder K, Kiefhaber T. End-to-end distance distributions and intrachain diffusion constants in unfolded polypeptide chains indicate intramolecular hydrogen bond formation. Proc Natl Acad Sci USA 2006, 103: 12394–12399. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [31].Hofmann A, Kane A, Nettels D, Hertzog DE, Baumgartel P, Lengefeld J, Reichardt G, Horsley DA, Seckler R, Bakajin O, et al. Mapping protein collapse with single-molecule fluorescence and kinetic synchrotron radiation circular dichroism spectroscopy. Proc Natl Acad Sci USA 2007, 104:105–110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [32].Ziv G, Haran G. Protein folding, protein collapse, and Tanford’s transfer model: lessons from single-molecule FRET. J Am Chem Soc 2009, 131: 2942–2947. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [33].Merchant KA, Best RB, Louis JM, Gopich IV, Eaton WA. Characterizing the unfolded states of proteins using single molecule FRET spectroscopy and molecular simulations. Proc Natl Acad Sci USA 2007, 104: 1528–1533. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [34].Soranno A, Longhi R, Bellini T, Buscaglia M. Kinetics of contact formation and end-to-end distance distributions of swollen disordered peptides. Biophys J 2009, 96: 1515–1528. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [35].Förster T. Zwischenmolekulare energiewanderung und fluoreszenz. Annal Phys 1948, 2:55–75. [Google Scholar]
  • [36].Buscaglia M, Lapidus LJ, Eaton WA, Hofrichter J. Effects of denaturants on the dynamics of loop formation in polypeptides. Biophys J 2006, 91: 276–288. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [37].Choy WY, Mulder FAA, Crowhurst KA, Muhandiram DR, Millet IS, Doniach S, Forman-Kay JD, Kay LE. Distribution of molecular size within an unfolded state ensemble using small-angle X-ray scattering and pulse field gradient NMR techniques. J Mol Biol 2002, 316: 101–112. [DOI] [PubMed] [Google Scholar]
  • [38].Nöppert A, Gast K, Muller-Frohne M, Zirwer D, Damaschun G. Reduced-denatured ribonuclease a is not in a compact state. FEBS Lett 1996, 380:179–182. [DOI] [PubMed] [Google Scholar]
  • [39].Plaxco KW, Millett IS, Segel DJ, Doniach S, Baker D. Chain collapse can occur concomitantly with the rate-limiting step in protein folding. Nature Struct Biol 1999, 6: 554–556. [DOI] [PubMed] [Google Scholar]
  • [40].Jacob J, Krantz B, Dothager RS, Thiyagarajan P, Sosnick TR. Early collapse is not an obligatory step in protein folding. J Mol Biol 2004, 338: 369–382. [DOI] [PubMed] [Google Scholar]
  • [41].Jacob J, Dothager RS, Thiyagarajan P, Sosnick TR. Fully reduced ribonuclease A does not expand at high denaturant concentration or temperature. J Mol Biol 2007, 367: 609–615. [DOI] [PubMed] [Google Scholar]
  • [42].Johansen D, Trewhella J, Goldenberg DP. Fractal dimension of an intrinsically disordered protein: small-angle X-ray scattering and computational study of the bacteriophage λ n protein. Protein Sci 2011, 20: 1955–1970. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [43].Yoo TY, Meisburger SP, Hinshaw J, Pollack L, Haran G, Sosnick TR, Plaxco K. Small-angle X-ray scattering and single-molecule FRET spectroscopy produce highly divergent views of the low-denaturant unfolded state. J Mol Biol 2012, 418: 226–236. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [44].Watkins HM, Simon AJ, Sosnick TR, Lipman EA, Hjelm RP, Plaxco KW. Random coil negative control reproduces the discrepancy between scattering and fret measurements of denatured protein dimensions. Proc Natl Acad Sci USA 2015, 112: 6631–6636. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [45].Kjaergaard M, Norholm AB, Hendus-Altenburger R, Pedersen SF, Poulsen FM, Kragelund BB. Temperature-dependent structural changes in intrinsically-disordered proteins: formation of α-helices or loss of polyproline II? Protein Sci 2010, 19: 1555–1564. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [46].Nettels D, Muller-Spath S, Küster F, Hofmann H, Haenni D, Rüegger S, Reymond L, Hoffmann A, Kubelka J, Heinz B, et al. Single-molecule spectroscopy of the temperature-induced collapse of unfolded proteins. Proc Natl Acad Sci USA 2009, 106: 20740–20745. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [47].Wuttke R, Hofmann H, Nettels D, Borgia MB, Mittal J, Best RB, Schuler B. Temperature-dependent solvation modulates the dimensions of disordered proteins. Proc Natl Acad Sci U S A 2014, 111: 5213–5218. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [48].Piana S, Donchev AG, Robustelli P, Shaw DE. Water dispersion interactions strongly influence simulated structural properties of disordered protein states. J Phys Chem B 2015,119: 5113–5123. [DOI] [PubMed] [Google Scholar]
  • [49].Piana S, Klepeis JL, Shaw DE. Assessing the accuracy of physical models used in protein folding simulations: quantitative evidence from long molecular dynamics simulations. Curr Opin Struct Biol 2014, 24: 98–105. [DOI] [PubMed] [Google Scholar]
  • [50].Best RB. Computational and theoretical advances in studies of intrinsically disordered proteins. Curr Opin Struct Biol 2017, 42: 147–154. [DOI] [PubMed] [Google Scholar]; ** Review of simulations on IDPs including discussing of recent force field improvements to avoid excessively collapsed unfolded states.
  • [51].Borgia A, Zheng W, Buholzer K, Borgia MB, Schüler A, Hofmann H, Soranno A, Nettels D, Gast K, Grishaev A, et al. Consistent view of polypeptide chain expansion in chemical denaturants from multiple experimental methods. J Am Chem Soc 2016, 138: 11714–11726. [DOI] [PMC free article] [PubMed] [Google Scholar]; ** First study to show that SAXS and FRET data for the same protein are consistent. Protein collapse was studied using two proteins, two different denaturants and four different experimental methods. All data, including those from SAXS and FRET, can be consistently explained by the same ensemble of structures, and both proteins collapse as denaturant concentration is lowered.
  • [52].Fuertes G, Banterle N, Ruff KM, Chowdhury A, Mercadante D, Koehler C, Kachala M, Girona GE, Milles S, Mishra A, et al. Decoupling of size and shape fluctuations in heteropolymeric sequences reconciles discrepancies in saxs vs. fret measurements. Proc Natl Acad Sci USA 2017, 114: E6342–E6351. [DOI] [PMC free article] [PubMed] [Google Scholar]; ** First study to compare FRET with SAXS of proteins labelled with fluorophores. Both FRET and SAXS show collapse at low denaturant concentrations and the results are broadly consistent.
  • [53].Riback JA, Bowman MA, Zmyslowski AM, Knoverek CR, Jumper JM, Hinshaw JR, Kaye EB, Freed KF, Clark PL, Sosnick TR. Innovative scattering analysis shows that hydrophobic proteins are expanded in water. Science 2017, 358: 238–241. [DOI] [PMC free article] [PubMed] [Google Scholar]; ** Introduction of a new “molecular form factor” method for analyzing SAXS data shows collapse consistent with FRET experiments, including for proteins where no collapse was previously observed.
  • [54].Best RB, Zheng W, Borgia A, Buholzer K, Borgia MB, Hofmann H, Soranno A, Nettels D, Gast K, Grishaev A, et al. Comment on “Innovative scattering analysis shows that hydrophobic disordered proteins are expanded in water”. Science 2018, 361: eaar7101. [DOI] [PMC free article] [PubMed] [Google Scholar]; ** New methods for analyzing both SAXS and FRET data were applied to both older and newer experimental data sets to permit a fair comparison. When this was done, a similar degree of collapse was inferred from both SAXS and FRET experiments.
  • [55].Peran I, Holehouse AS, Carrico IS, Pappu RV, Bilsel O, Raleigh DP. Unfolded states under folding conditions accommodate sequence-specific conformational preferences with random coil-like dimensions. Proc Natl Acad Sci USA 2019, 116: 12301–12310. [DOI] [PMC free article] [PubMed] [Google Scholar]; ** Recent study in which time-resolved measurements with microfluidic mixing were used to study unfolded state of NTL9 by SAXS and FRET in low denaturant concentration. The two experiments were consistent indicated a collapse to near Θ-solvent conditions, with formation of some residual structure.
  • [56].Henriques J, Cragnell C, Skepö M. Molecular dynamics simulations of intrinsically disordered proteins: force field evaluation and comparison with experiment. J Chem Theor Comput 2015, 11: 3420–3431. [DOI] [PubMed] [Google Scholar]
  • [57].Shrestha UR, Juneja P, Zhang Q, Gurumoorthy V, Borreguero JM, Urban V, Cheng X, Pingali SV, Smith JC, O’Neill HM, et al. Generation of the configurational ensemble of an intrinsically disordered protein from unbiased molecular dynamics simulations. Proc Natl Acad Sci USA 2019, 116: 20446–20452. [DOI] [PMC free article] [PubMed] [Google Scholar]; ** Unbiased simulations with enhanced sampling and state of the art force fields now quantitatively reproduce SAXS and SANS data for an IDP and yield a length scaling exponent close to 1/2 in water.
  • [58].Guinier A. La diffraction des rayons x aux très petits angles: application à l’étude de phénomènes ultramicroscopiques. Ann Phys 1939, 11: 161–237. [Google Scholar]
  • [59].Sosnick TR, Trewhella J. Denatured states of ribonuclease A have compact dimensions and residual secondary structure. Biochemistry 1992, 31: 8329–8335. [DOI] [PubMed] [Google Scholar]
  • [60].Zheng W, Best RB. An extended Guinier analysis for intrinsically disordered proteins. J Mol Biol 2018, 430: 2540–2553. [DOI] [PMC free article] [PubMed] [Google Scholar]; ** An empirical additional term is added to the Guinier equation, allowing a wider range of SAXS data to be fitted than in Guinier analysis, and estimates for both Rg and ν to be made.
  • [61].Calmettes P, Durand D, Smith J, Desmadril M, Minard P, Douillard R. Structure of proteins unfolded by guinidinium chloride. J de Phys IV 1993, 3: 253–256. [Google Scholar]
  • [62].Debye P. Molecular-weight determination by light scattering. J Phys Chem 1947, 51: 18–32. [DOI] [PubMed] [Google Scholar]
  • [63].O’Brien EP, Morrison G, Brooks BR, Thirumalai D. How accurate are polymer models in the analysis of Förster resonance energy transfer experiments on proteins? J Chem Phys 2012, 130: 124903. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [64].Song J, Gomes GN, Gradinaru CC, Chan HS. An adequate account of excluded volume is necessary to infer compactness and asphericity of disordered proteins by Förster resonance energy transfer. J Phys Chem B 2015, 119:15191–15202. [DOI] [PubMed] [Google Scholar]
  • [65].Köfinger J, Hummer G. Bayesian ensemble refinement by replica simulations and reweighting. J Chem Phys 2015, 143: 243150. [DOI] [PubMed] [Google Scholar]
  • [66].Holmstrom ED, Holla A, Zheng W, Nettels D, Best RB, Schuler B. Accurate transfer efficiencies, distance distributions, and ensembles of unfolded and intrinsically disordered proteins from single-molecule fret. Meth Enzym 2018, 611: 287–325. [DOI] [PMC free article] [PubMed] [Google Scholar]; ** Recent review explaining the issues in interpreting experimental data averaged over a broad ensemble of structures, and how simulations can be used to interpret the data – either via Bayesian ensemble refinement, or by adjusting parameters in simplified coarse-grained models.
  • [67].Vitalis A, Pappu RV. ABSINTH: A new continuum solvent model for simulations of polypeptides in aqueous solutions. J Comput Chem 2008, 30: 673–699. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [68].Leung HTA, Bignucolo O, Aregger R, Dames SA, Mazur A, Berneche S, Grze-siek S. A rigorous and efficient method to reweight very large conformational ensembles using average experimental data and to determine their relative information content. J Chem Theor Comput 2016, 12: 383394. [DOI] [PubMed] [Google Scholar]; ** Method for ensemble refinement based on the principle of maximum entropy.
  • [69].Zheng W, Zerze GH, Borgia A, Mittal J, Schuler B, Best RB. Inferring properties of disordered chains from FRET transfer efficiencies. J Chem Phys 2018, 148:123329. [DOI] [PMC free article] [PubMed] [Google Scholar]; ** New method for analyzing FRET data which allows the form of the distance distribution to vary with solvent conditions.
  • [70].Riback JA, Bowman MA, Zmyslowski AM, Plaxco KW, Clark PL, Sosnick TR. Commonly used FRET fluorophores promote collapse of an otherwise disordered protein. Proc Natl Acad Sci U S A 2019, 116: 8889–8894. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [71].Riback JA, Bowman MA, Zmyslowski AM, Knoverek CR, Jumper JM, Kaye EB, Freed KF, Clark PL, Sosnick TR. Response to comment on “Innovative scattering analysis shows that hydrophobic proteins are expanded in water”. Science 2018, 361: eaar7949. [DOI] [PubMed] [Google Scholar]
  • [72].Fuertes G, Banterle N, Ruff KM, Chowdhury A, Pappu RV, Svergun DI, Lemke EA. Comment on “innovative scattering analysis shows that hydrophobic disordered proteins are expanded in water”. Science 2018, 361: eaau8230. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [73].Best RB, Zheng W, Mittal J. Balanced protein-water interactions improve properties of disordered proteins and non-specific protein association. J Chem Theor Comput 2014, 10: 5113–5124. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [74].Huang J, Rauscher S, Nawrocki G, Ran T, Feig M, Groot BLD, Grubmüller H, MacKerell AD. Charmm36m: an improved force field for folded and intrinsically disordered proteins. Nat Methods 2017, 14: 71–73. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [75].Robustelli P, Piana S, Shaw DE. Developing a molecular dynamics force field for both folded and disordered protein states. Proc Natl Acad Sci U S A 2018, 115: E4758–E4766. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [76].Horinek D, Netz RR. Can simulations quantitatively predict peptide transfer free energies to urea solutions? thermodynamic concepts and force field limitations. J Phys Chem A 2011,115: 6125–6136. [DOI] [PubMed] [Google Scholar]
  • [77].Zheng W, Borgia A, Borgia MB, Schuler B, Best RB. Empirical optimization of interactions between proteins and chemical denaturants in molecular simulations. J Chem Theor Comput 2015, 11: 5543–5553. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [78].Nettels D, Gopich IV, Hoffmann A, Schuler B. Ultrafast dynamics of protein collapse from single molecule photon statistics. Proc Natl Acad Sci U S A 2007, 104: 2655–2660. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • [79].Schuler B. Perspective: Chain dynamics of unfolded and intrinsically disordered proteins from nanosecond fluorescence correlation spectroscopy combined with single-molecule FRET. J Chem Phys 2018, 149: 010910. [DOI] [PubMed] [Google Scholar]; ** Review on the use of nanosecond fluorescence correlation spectroscopy and single-molecule FRET to characterize unfolded protein dynamics.
  • [80].Kubelka J, Hofrichter J, Eaton WA. The protein folding “speed limit”. Curr Opin Struct Biol 2004, 14: 76–88. [DOI] [PubMed] [Google Scholar]
  • [81].Udgaonkar JB. Polypeptide chain collapse and protein folding. Arch Biochem Biophys 2013, 531: 24–33. [DOI] [PubMed] [Google Scholar]
  • [82].Camacho CJ, Thirumalai D. Kinetics and thermodynamics of folding in model proteins. Proc Natl Acad Sci USA 1993, 90: 6369–6372. [DOI] [PMC free article] [PubMed] [Google Scholar]

RESOURCES