Abstract
We present a coarse-grained approach for modeling the thermodynamic stability of single-domain globular proteins in concentrated aqueous solutions. Our treatment derives effective protein-protein interactions from basic structural and energetic characteristics of the native and denatured states. These characteristics, along with the intrinsic (i.e., infinite dilution) thermodynamics of folding, are calculated from elementary sequence information using a heteropolymer collapse theory. We integrate this information into Reactive Canonical Monte Carlo simulations to investigate the connections between protein sequence hydrophobicity, protein-protein interactions, protein concentration, and the thermodynamic stability of the native state. The model predicts that sequence hydrophobicity can affect how protein concentration impacts native-state stability in solution. In particular, low hydrophobicity proteins are primarily stabilized by increases in protein concentration, whereas high hydrophobicity proteins exhibit richer nonmonotonic behavior. These trends appear qualitatively consistent with the available experimental data. Although factors such as pH, salt concentration, and protein charge are also important for protein stability, our analysis suggests that some of the nontrivial experimental trends may be driven by a competition between destabilizing hydrophobic protein-protein attractions and entropic crowding effects.
INTRODUCTION
Proteins in their native states play an important role in many biological processes and pharmaceutical applications. They participate in almost every aspect of the biochemical transport and regulation required for living organisms. They also serve as therapeutic drugs for targeting infectious diseases and cancer. However, most proteins, under most conditions, exhibit only marginal thermodynamic stability. As a result, minor sequence mutations or even small perturbations to solution parameters (e.g., pH, temperature, concentration, etc.) can result in protein denaturation (1).
This has important practical consequences. Unfolded or misfolded proteins look and behave differently than they do in their native states (2). They lack the highly specific molecular structure necessary for biological activity. Moreover, since unfolded proteins generally expose a significant number of hydrophobic core residues to the aqueous solvent, they have a tendency to associate and form non-native aggregates in solution. Unwanted protein aggregation and subsequent precipitation pose enormous problems in biological and pharmaceutical contexts (3,4). These processes are connected, although in a manner still imperfectly understood, to a number of debilitating diseases such as Parkinson's, Alzheimer's, Huntington's, and Down's syndrome (5–9). They are also known to cause rapid degradation of pharmaceutical formulations, reducing the shelf life of promising new drugs and restricting the strategies available for purification, handling, and delivery of therapeutics (10–15).
Given the technological and practical importance of protein stability, there is an urgent need to develop a generic understanding of the thermodynamic driving forces for protein unfolding, aggregation, crystallization, and phase separation. One way to facilitate this understanding is to build models that can account for, at various levels of sophistication, three vital aspects of stability for the native and denatured states: the relationship between protein sequence and structure, protein-protein interactions, and the global phase behavior of protein solutions. This is a formidable challenge because proteins are inherently large and complex molecules. Moreover, proteins encounter a wide variety of solution environments in biological and pharmaceutical processes, exposing them to thermal, mechanical, osmotic, and chemical stresses. To describe protein behavior under these conditions, one must also have a reliable method for accounting for hydrophobic interactions (16–23), which are a dominant force (24,25) in biomolecular folding and assembly events. These interactions have been particularly challenging to model because they exhibit subtle dependencies on both the state of the solution and the size and shape of the participating solutes.
Although each of the aforementioned aspects of protein stability have been long appreciated, theoretical investigations have focused more on their independent study than on devising strategies for integrating them into a single model. For example, models developed to investigate the single-molecule protein folding problem (26–30) are almost exclusively too complicated to be extended, either theoretically or via computer simulation, to investigate the collective behavior of thousands of proteins and millions of water molecules in solution. On the other hand, many recent theoretical models introduced to study the thermodynamics of crystallization or liquid-liquid phase separation in protein solutions (31–40), although insightful in many respects, do not consider sequence information, the polymeric character of the individual proteins, or the possibility of protein unfolding. Despite the development of some powerful coarse-grained models that address some of these issues (41–60), theoretical methods that can provide a comprehensive understanding of the protein stability problem are still lacking.
One goal of this article is to introduce a new theoretical strategy for treating, in a simple integrated way, basic aspects of the single-molecule protein folding problem, protein-protein interactions, and the global thermodynamics of protein solutions. A second goal is to use this approach to study the thermal stability of protein molecules in concentrated aqueous environments. In particular, we would like to gain some physical insights into why the native-state stabilities of three commonly studied single-domain globular proteins—ribonuclease A, lysozyme, and metmyoglobin—display different experimental dependencies on protein concentration. Calorimetric studies indicate that, for the conditions investigated, ribonuclease A shows slightly increasing native-state stability with increasing protein concentration (61), whereas lysozyme exhibits decreasing stability (62). Moreover, metmyoglobin shows nonmonotonic behavior: increasing protein concentration decreases native-state stability in dilute solutions but increases stability at high concentrations (63). The nonmonotonic trend for metmyoglobin persists over a wide range of thermodynamic conditions, with the concentration of minimum stability falling in the range 10–50 mg/ml depending on the pH and salt concentration of the solution.
To our knowledge, there is not yet a conceptual framework for understanding these experimental findings or relating them to the biophysical characteristics of the individual protein molecules. However, since hydrophobic interactions are a central component for both protein stability and protein-protein interactions, it is natural to ask whether one should also expect sequence hydrophobicity to be an important factor here. If one designates amino acids Ala, Gly, Ile, Leu, Met, Phe, Pro, Trp, and Val as hydrophobic (see, e.g., Shen et al. (64)), then protein sequences for ribonuclease A, lysozyme, and metmyoglobin are 33%, 44%, and 51% hydrophobic, respectively. We will use this hydrophobicity scale throughout the present article, but other commonly used hydrophobicity scales (see, e.g., Tanford (65) and Dill et al. (66)) show the same trend. Given these differences and the experimental observations discussed above, it is reasonable to hypothesize that the hydrophobic content of a protein's sequence may help to determine the qualitative concentration dependence of its native-state stability in solution. In this article, we use our coarse-grained strategy to explore this idea, and whether predicted trends are consistent with the available experimental data for single-domain globular proteins.
In short, we find that calculations based on our model suggest a simple physical picture for understanding the concentration dependencies of native-state stability that is consistent with the experiments. Specifically, they predict that globular proteins with lower sequence hydrophobicity, such as ribonuclease A, will tend to be stabilized by increases in protein concentration due to entropic crowding effects. They also predict that globular proteins with higher sequence hydrophobicity, such as metmyoglobin, will display concentration destabilization at lower protein concentrations due to protein-protein interactions and crowding-induced restabilization at higher protein concentrations. Of course, there are still important open questions about the generality of these conclusions that need to be systematically explored, including quantifying the precise role of solution pH, salt concentration, and protein charge. A more comprehensive set of experimental data for the concentration-dependencies of globular protein stabilities in solution also seems necessary to fully decouple and analyze the roles of these various factors. Nonetheless, we believe that this study provides a promising starting point for predicting, and ultimately understanding, the thermodynamic consequences of protein concentration on native-state stability in solution.
GENERAL COARSE-GRAINED MODELING STRATEGY
Our general strategy for modeling the various aspects of native-state protein stability in solution is displayed in boldface type below, followed by a brief description of how we implement this approach to develop a specific model.
Employ a heteropolymer collapse (HPC) model to determine thermodynamic and structural characteristics of native and denatured proteins. Here, we utilize the HPC theory introduced by Dill and co-workers (66,67) to predict the temperature-dependent free energy, radius of gyration, and number of hydrophobic surface residues of both the native and denatured states of a single protein molecule in aqueous solution.
Use the molecular characteristics determined from the HPC model to derive approximate center-to-center interprotein potentials. In this step, we determine the effective diameters and energies associated with native-native (NN), native-denatured (ND), and denatured-denatured (DD) contacts. We then integrate this information into a state-dependent protein-protein interaction potential.
Predict how environmental conditions and sequence information affect the native-state stability of proteins in solution. Specifically, we utilize the information from steps 1 and 2 to perform Reactive Canonical Monte Carlo (RCMC) simulations, which sample both protein molecule translations and folding/unfolding events. These simulations permit us to explore how sequence hydrophobicity, temperature, and protein concentration impact native-state stability in solution.
Single protein properties from the HPC model
We derive the single-protein properties required for our approach from the HPC theory developed by Dill and co-workers (66,67). Since the details of the HPC theory are explained in the original publications, we primarily discuss the inputs, outputs, and the essential physics of the approach here. Basically, proteins are modeled as heteropolymers of hydrophobic and polar monomers in an infinitely dilute aqueous solution. Depending on the size of the protein, the number of hydrophobic residues, and the temperature, either a collapsed (native-like) or expanded (denatured-like) state of the protein is predicted to be thermodynamically stable. The corresponding polymer collapse transition (i.e., protein folding) is thus an equilibrium process driven by the formation of favorable hydrophobic contacts and opposed by the loss of chain conformational entropy. Fig. 1 illustrates the collapse process for the heteropolymer chain. Note that the native state exposes fewer hydrophobic residues to the solvent than the denatured state, in accord with Kauzmann's original hydrophobic-core picture of protein folding (24).
FIGURE 1.
Schematic of the protein folding process predicted by HPC theory. Hydrophobic residues are light, and polar residues are dark. The denatured state is more expanded than the native state and exposes more hydrophobic amino acid residues to the solvent. Adapted from Dill et al. (66).
To make predictions about a particular protein, HPC theory requires the following information: the total number of amino acid residues in the protein sequence Nr, the fraction of those residues considered to be hydrophobic Φ (e.g., based on thermodynamic data from oil-water partitioning experiments (65)), and the temperature T. The interactions between hydrophobic residues and solvent, or between hydrophobic and polar residues, are less favorable than those between polar residues and solvent, or between two hydrophobic or two polar residues. The effective energetic difference is assumed to scale with χ(T)kBT, a term parameterized by Dill et al. (66) to approximate the experimental free energy associated with the transfer of a typical hydrophobic amino acid from its own pure phase into water. The conformational entropic contributions to the free energy of the protein are treated using standard statistical mechanical arguments from polymer physics (67).
HPC theory predicts the following coarse structural information about the native (N) and denatured (D) states of the protein, respectively: the radii of gyration RN and RD, the related fractions of residues in contact with the solvent fe(RN) and fe(RD), and the fractions of these partially solvated residues that are considered hydrophobic Θ and Φ. Based on these parameters, the theory predicts the intrinsic, i.e., infinite dilution, free energy of folding 
Note that HPC theory captures the fact that the structure and thermodynamics of proteins can be affected both by protein sequence (e.g., sequence hydrophobicity Φ) and by solution conditions (e.g., temperature). Since these properties will impact protein-protein interactions, they will prove vital for understanding how protein concentration modifies native-state stability. Much of the relevant physics can be understood in terms of the quality or compatibility of the aqueous solvent for the protein. For instance, all other factors being equal, an increase in the number of hydrophobic residues in a protein sequence can lead to a more compact denatured state (68). This behavior is in accord with basic notions of polymer physics. Since water is a poor solvent for apolar molecules, proteins with higher hydrophobic sequence content tend to adopt more compact and structured conformations in their denatured states, increasing the number of hydrophobic-hydrophobic residue contacts. This trend has been studied indirectly in protein systems by examining how mutations affect the solvent exposure and the free energy of denatured proteins using both experiments (69) and statistical mechanical models (70,71). On the other hand, increases in temperature or denaturant concentration tend to produce more expanded denatured states since they tip the thermodynamic balance in favor of chain conformational entropy over weakened hydrophobic contacts (66).
Though the amino acid interactions and the protein sequence information are treated at a rudimentary level in HPC theory, it still gives a reasonable overall accounting of the protein folding process. For example, Fig. 2 shows that if one chooses sequence characteristics (Nr = 154, Φ = 0.5) typical of single-domain globular proteins (see, e.g., Shen et al. (64)), HPC theory predicts free energies of folding in good qualitative agreement with the experiments. HPC theory also qualitatively reproduces many other experimental aspects of the thermodynamics of protein folding, which are discussed in detail elsewhere (66,72). However, since HPC theory assumes an infinitely dilute protein solution, one must keep in mind that the predictions only pertain to protein stability behavior in the absence of any protein-protein interactions.
FIGURE 2.
Free energy of folding for a model protein (Nr = 154, Φ = 0.5 dotted line) from HPC theory compared to that of real proteins (Mb, metmyoglobin; PTI2, dimer of pancreatic trypsin inhibitor; Cyt, cytochrome c; Ct, α-chymotrypsin; Lys, lysozyme; Tr, β-trypsin; and RNA, ribonuclease A). Reproduced with permission from Dill et al. (66). Copyright 1989 American Chemical Society.
Protein-protein interactions
The single-protein information provided by the HPC theory allows us to derive approximate protein-protein interaction potentials. We assume that the attractive part of the protein-protein interaction is primarily due to the driving force of proteins to desolvate their hydrophobic surface residues by burying them into a hydrophobic patch on a neighboring protein (see Fig. 3). In contrast, the repulsive part of the interaction accounts for the volume that each individual protein excludes to the centers of mass of other protein molecules in the solution.
FIGURE 3.
Schematic of a native-denatured protein interaction. Hydrophobic residues are light, and polar residues are dark. The denatured and native states have radii of gyration RD and RN, respectively. The fraction of surface residues that are hydrophobic on the denatured protein is assumed to be equal to the hydrophobicity of the protein sequence Φ. The analogous quantity for the native state, Θ (determined from HPC theory) is generally smaller (i.e., Θ < Φ). Derivation of the various types of protein-protein contact attractions are presented in Appendix A.
HPC theory correctly predicts that denatured protein molecules exclude more volume (RD ≥ RN) to other proteins and display higher surface hydrophobicity (Φ ≥ Θ) than their native-state counterparts (66). As a result, the effective exclusion diameters and the strength of attractions between proteins should be quantitatively different for each of the various types of protein-protein interactions (i.e., NN, ND, DD). In fact, a simple mean-field argument suggests that the magnitudes of those various contact attractions should scale in the following way:
![]() |
(1) |
![]() |
(2) |
![]() |
(3) |
The proportionality constants for Eqs. 1–3 are derived in Appendix A.
We incorporate the state-dependent interaction strengths presented above and the protein sizes derived from the HPC theory into a protein-protein potential Vij (34) that has been used to qualitatively capture many aspects of protein solution thermodynamics (see, e.g., Petsev et al. (73)),
![]() |
![]() |
(4) |
Here, α is a potential range parameter; ɛij is the potential well-depth; σij is the protein-protein diameter; and ij ∈ (NN, ND, DD). If one chooses α = 50, the potential mimics, in an average sense, the type of relatively short-range attractions exhibited by globular proteins (34). We calculate the temperature-dependent parameters ɛij from Eqs. 7–9 found in Appendix A, whereas the ratios σDD/σNN = RD/RN and σND/σNN = (RN + RD)/2RN are direct outputs from HPC theory (66).
By assuming that the above potential can describe NN, ND, and DD interactions, we are adopting the simplistic view that denatured proteins, like their native counterparts, are on average spherical in shape with effective sizes and attractive interactions that can be derived from the structural and energetic predictions of HPC theory. Clearly, a refined picture of the denatured state will be needed to develop a more comprehensive description of these non-native interactions. However, we note that the general strategy presented at the beginning of this section is flexible enough to accommodate the more accurate interprotein potential forms that are likely to emerge from continuing experimental studies (74,75). Thus, although our preliminary model potential is necessarily a simplistic one, future implementations will be able to readily incorporate state-of-the-art ideas about the effective interactions between protein molecules in their various states.
Fig. 4 shows the three interaction potentials for a model protein (Nr = 154, Φ = 0.5) at 362 K, and the inset displays the temperature dependence of the corresponding potential well depths. Note that, as expected, DD protein contacts are more favorable than ND and NN contacts, since denatured proteins have a greater number of solvent-exposed hydrophobic residues.
FIGURE 4.
Effective protein-protein potentials, calculated using Eq. 4 for a model protein (Nr = 154, Φ = 0.5) at T = 362 K. There are three different types of potentials (NN, ND, and DD) because proteins can exist in either native (N) or denatured (D) states. Note that the DD interactions are stronger and have larger exclusion volumes than the NN or ND interactions, since denatured proteins are more expanded and tend to bury more hydrophobic amino acid surface residues upon contact. (Inset plot) Potential well depths for ND, DD, and NN interactions as a function of temperature, calculated using Eqs. 7–9.
Using the above potential, one can also calculate a dimensionless second virial coefficient B22,ij* (76):
![]() |
(5) |
The second virial coefficient is an experimentally measurable quantity (see, e.g., Tessier et al. (77)) that has been linked to protein phase behavior (76,78). Vliegenthart and Lekkerkerker (76) noticed that liquid-liquid phase separation generally occurs in protein solutions when B22,ij* < −6. Fig. 5 shows B22,ij* as a function of temperature for a model protein (Nr = 154, Φ = 0.5), obtained by applying HPC theory, Eq. 4, and Eq. 5. As expected, the B22,ij* values are consistently more negative for interactions involving denatured proteins since they tend to bury more hydrophobic residues when making protein-protein contacts (see Appendix A). Moreover, since B22,DD* < −6 for this model protein and B22,NN* > −6, one might expect significant differences in the solubilities (79,80), and perhaps the aggregation behavior, of the native and denatured states. In the Results and Discussion sections of this article, we explore how these pronounced differences in protein interactions can affect native-state stability in concentrated solutions. We are also currently investigating how these differences can impact the global phase diagram of protein solutions (see Conclusions and Future Work).
FIGURE 5.
calculated using Eq. 5 for the model protein (Nr = 154, Φ = 0.5). Since DD attractions are stronger, the corresponding
is more negative, which generally indicates different solubility behaviors for the denatured and native states.
Simulation details
Our ultimate aim is to understand how protein sequence information, protein-protein interactions, and protein solution variables (e.g., temperature and protein concentration) combine to determine native-state stability. To accomplish this within the context of our model, we need to integrate the single-protein thermodynamics from HPC theory and our derived protein-protein interactions into a simulation technique that can generate representative equilibrium states of solutions at finite protein concentration. However, this task is perfectly suited for the Reactive Canonical Monte Carlo (RCMC) method developed by Johnson, Panagiotopoulos, and Gubbins (81,82). This algorithm allows one to readily compute the equilibrium properties of chemically reacting systems by taking advantage of both molecular translational moves and forward/reverse reaction steps. In our case, the reaction of interest is simply the unimolecular folding/unfolding of the coarse-grained protein molecules in their finite-concentration solution environment.
The inputs required for the canonical-ensemble implementation of the RCMC simulations are the total number of protein molecules N, the dimensionless protein concentration
(where V is the volume), the temperature T, the effective protein-protein potentials from Eq. 4, and the intrinsic free energy of folding
from HPC theory. Given this information, the algorithm samples the phase space relevant to the equilibrium reacting system, converging relatively quickly to the equilibrium fraction fN of native-state proteins. Since earlier articles (81–83) provide the detailed expressions for the reaction move probabilities along with strategies for efficient implementation of the RCMC algorithm, we do not elaborate on those issues here.
For each state point studied in this work, N = 256 proteins were simulated in a cubic box with periodic boundary conditions. To ensure that significant system-size effects were not present, we reproduced our simulations results for a number of state points using both smaller (N = 128) and larger (N = 500) systems. We have also reproduced the results for a number of state points with a grand canonical ensemble version of the algorithm. Fifteen percent of attempted moves in our simulations were denatured to native reactions (i.e., protein folding), and 15% were attempted native to denatured reactions (i.e., protein unfolding). The remaining 70% of attempted moves were protein displacements. Initially, the proteins were placed in an open FCC lattice configuration with either all native (fN = 1) or denatured (fN = 0) states. They were then equilibrated for the number of MC cycles required to displace each protein at least a distance of 5σNN from its original position. The potential energy, the fraction of native-state proteins fN, and the osmotic pressure Π were tracked closely to make sure that equilibration was completed before subsequent production runs were initiated.
To verify that stable, or at least metastable, homogeneous fluid phases were being sampled by our simulations, standard translational and bond-orientational order parameters for detecting crystallization (84,85) were monitored during the equilibration and production stages. We found that in all but a few higher-concentration state points (see Appendix B), convergence to the same fluid state could be reproducibly achieved by initiating from either all-denatured (fN = 0) or all-native (fN = 1) protein lattices. Of course, without knowledge of the complete phase diagram of a system, one cannot unambiguously determine, either from simulations or experiments, whether a given state is thermodynamically stable or metastable with respect to a phase change (e.g., crystallization). In this initial study, we have focused on understanding native-state stability of homogeneous fluid states of model protein solutions, and we have not attempted the larger problem of calculating rigorous thermodynamic phase boundaries for the solution. However, we are currently studying the global phase diagram of our model protein systems, and we will report those results in a future publication (see Conclusions and Future Work).
To explore how sequence hydrophobicity might qualitatively impact the concentration dependence of native-state stability using our approach, we have studied two different model proteins with the same number of amino acid residues (Nr = 154) but different hydrophobic contents (Φ = 0.4 and Φ = 0.5). As was illustrated recently (64), these hydrophobicities span the typical range of those seen in single-domain globular proteins with sizes between 125 and 175 residues, and so they reasonably bracket the types of behaviors that one might expect to see in experiments. As will be shown in Results, the difference in midpoint folding temperatures at infinite dilution for these two model proteins is ∼26 K, with the higher hydrophobicity protein showing more thermal stability. As a rough comparison, it can be seen in Fig. 2 that the folding temperatures of ribonuclease A and metmyoglobin also differ by ∼20 K (66). Although the model proteins (Φ = 0.4 and Φ = 0.5) exhibit both cold and warm denaturation, we focused exclusively on destabilization by the latter mechanism in the present work.
RESULTS
In this section, we briefly present the main results of our study. We divide our data into two main parts that pertain to the two model proteins that were explored. In the Discussion, we explore the physical insights that can be gained from our results.
High hydrophobicity protein (Nr = 154, Φ = 0.5)
In Fig. 6, we plot the equilibrium fraction of folded (i.e., native-state) proteins fN as a function of temperature T for a series of protein concentrations. As is expected, for low concentrations, our simulations are extremely close to the predictions from HPC theory (66). The midpoint folding temperature Tm, i.e., the temperature that yields fN = 0.5, is ∼364 K for this protein at infinite dilution. By fitting the simulation points obtained at each protein concentration to sigmoidal curves, we can deduce the concentration dependence of Tm for our model protein. The main trends are as follows. If one starts with a protein solution at low concentrations, increasing protein concentration results in protein destabilization (i.e., Tm decreases as concentration increases). However, as one continues to increase protein concentration, the protein is ultimately restabilized (i.e., Tm rebounds to higher values). At this point, we simply assert that this nonmonotonic stability behavior is due to a competition between destabilizing attractive protein-protein interactions that dominate at low protein concentration and stabilizing entropic crowding effects (see, e.g., Minton (86)) that prevail at high protein concentration. We will provide quantitative justification for this statement in the Discussion.
FIGURE 6.
RCMC simulation results showing the fraction of folded proteins fN as a function of temperature T for the high hydrophobicity model protein (Nr = 154, Φ = 0.5). At low concentrations, increasing concentration destabilizes native-state proteins, whereas the opposite trend occurs at high protein concentration. The protein concentrations
are shown on the graph nearest to their curve. The lines are present as a guide to the eye.
In Fig. 7, Tm is plotted as a function of protein concentration, yielding a protein stability “phase diagram” for the high hydrophobicity protein. The white (nonshaded) area comprises state points for which the native state is thermodynamically favored (fN > 0.5), whereas the shaded area above the curve indicates the temperature-concentration coordinates where the denatured state prevails (fN < 0.5). As we demonstrate later, very high protein concentrations cannot be attained in this model if a majority of proteins are to remain in the denatured state, simply due to the fact that denatured proteins exclude more volume than proteins in the more compact native state. However, attractive protein-protein interactions also play an important role at lower protein concentrations, where they induce significant concentration destabilization. For example, Tm at
is nearly 10 K below the infinite dilution value.
FIGURE 7.
Protein stability phase diagram for the high hydrophobicity model protein (Nr = 154, Φ = 0.5) in the temperature T-protein concentration
plane. The shaded region indicates the temperatures and concentrations that favor the denatured state (fN < 0.5). Circles represent Tm from the RCMC protein simulations, the squares show Tm from the nonattracting high hydrophobicity protein simulations, and the dashed line indicates the estimated Tm from Eq. 6.
Fig. 8 provides a clearer picture for how increasing protein concentration destabilizes the native state at low concentrations and restabilizes it at high concentrations. To quantify this effect, one can define a midpoint concentration for unfolding at low concentrations, and a midpoint concentration for refolding at high concentrations. Since the intrinsic stability of a protein molecule decreases upon heating (i.e.,
becomes less negative), proteins become more vulnerable to destabilizing effects like attractive protein-protein interactions, and thus the midpoint concentration for unfolding decreases with increasing temperature. Similarly, the midpoint concentration for refolding increases with increasing temperature, indicating that more crowding is required to refold proteins with lower intrinsic thermal stability. A figure containing all of the simulated state points for this protein can be found in Appendix B.
FIGURE 8.
Fraction of folded proteins fN as a function of protein concentration
for several isotherms of the high hydrophobicity model protein (Nr = 154, Φ = 0.5). At temperatures far below the intrinsic folding temperature (∼364 K), concentration effects are almost negligible. As we approach the intrinsic folding temperature at low concentration, increasing concentration shows a pronounced destabilizing effect. At higher concentrations, the native state is restabilized due to crowding effects. The solid lines are present as a guide to the eye.
Low hydrophobicity protein (Nr = 154, Φ = 0.4)
Here we explore protein concentration effects on the thermodynamic stability of a low hydrophobicity protein. In Fig. 9, we plot the fraction of native-state proteins as a function of temperature for a series of different protein concentrations. Again, as expected, the low concentration results closely agree with HPC theory. Moreover, Tm is ∼338 K at infinite dilution, which is logically lower in value than that of the high hydrophobicity protein (∼364 K). The most striking feature of this data set is that the concentration destabilization effects seen for the high hydrophobicity protein are essentially negligible here. In short, for this model protein we find that increasing protein concentration shifts the folding equilibrium to the native state for essentially all finite protein concentrations. Fig. 10 shows the corresponding protein stability phase diagram, which highlights the stabilizing role that protein concentration plays for the low hydrophobicity protein. A figure displaying all of the simulated state points for this protein can also be found in Appendix B.
FIGURE 9.
RCMC simulation results showing the fraction of folded proteins fN as a function of temperature T for the low hydrophobicity model protein (Nr = 154, Φ = 0.4). The general trend is increasing native-state stability with increasing protein concentration. The protein concentrations
are shown on the graph nearest to their curve. The lines are present as a guide to the eye. (Inset) The inset plot is the reduced second virial coefficient, calculated using Eq. 5, as a function of temperature. The lower hydrophobicity of this model protein leads to weaker protein-protein interactions.
FIGURE 10.
Protein stability phase diagram for the low hydrophobicity model protein (Nr = 154, Φ = 0.4) in the temperature T-protein concentration
plane. The shaded region indicates the temperatures and concentrations that favor the denatured state (fN < 0.5). Circles represent Tm from the RCMC protein simulations, the squares show Tm from the nonattracting low hydrophobicity protein simulations, and the dashed line indicates the estimated Tm from Eq. 6. Because of weaker intraprotein attractions, the denatured state is more expanded than the high hydrophobicity protein (see Fig. 11). Thus, concentration effects are mostly controlled by crowding-induced stabilization of the native-state.
In the next section, we discuss the physical origins of the relationship between sequence hydrophobicity, protein interactions, and the different stability behaviors for the high- and low-hydrophobicity proteins.
DISCUSSION
The simulation results suggest that the thermodynamic stability of a high hydrophobicity protein can exhibit a nonmonotonic dependence on concentration. At low total volume fractions, increasing protein concentration has a destabilizing effect for the native state. This finding is consistent with the low-concentration destabilization that is observed experimentally for the high hydrophobicity protein metmyoglobin (Φ = 0.51) (63), and to a lesser extent with the medium hydrophobicity protein lysozyme (Φ = 0.44) (62). Here, as discussed in the Introduction, we have adopted Φ values calculated from a hydrophobicity scale where Ala, Gly, Ile, Leu, Met, Phe, Pro, Trp, and Val are considered hydrophobic (64). Protein concentration destabilization of the native state is also consistent with the increase in non-native aggregation rate that has been experimentally observed with increasing protein concentration for both reversible (87,88) and irreversible protein aggregation (see, e.g., Cueto et al. (62) and Krishnamurthy and Manning (89)), although for irreversible processes it is generally difficult to determine whether a thermodynamic driving force actually underlies the kinetically controlled laboratory phenomenon.
At higher volume fractions, however, our model predicts that increasing protein concentration shifts the folding equilibrium back toward the native state. This second trend is also in agreement with the so-called entropic crowding of proteins in solution that has been explained theoretically and observed experimentally (51,52,54,86). To understand which of these two concentration effects will dominate for different proteins under various solution conditions, we require some additional insight into the molecular mechanisms.
One physical picture that is consistent with the above observations is the following. Native proteins may unfold to the denatured state if by doing so 1), they can form enough favorable attractions with neighboring proteins to outweigh their intrinsic thermal stability (i.e., their negative
); and 2), they do not simultaneously create new highly unfavorable repulsive interactions with neighboring proteins due to their expanded denatured configuration. This type of picture is consistent with experimental results that indicate that reversible formation of non-native oligomers can induce unfolding (90). On the other hand, even intrinsically unstable proteins (i.e., exhibiting positive
) may refold due to crowding if the protein concentration is so high that there is not enough free volume to accommodate them in their denatured states. This is quite similar in nature to the experimental protein stabilization induced by confinement (91). Below, we use our data to subject this physical picture to some basic quantitative tests.
Our strategy in this regard is to devise simple and transparent means for separately calculating the contributions of destabilizing protein-protein attractions and stabilizing crowding effects to native-state stability. Armed with the information that we obtain from these calculations, we will be in a better position to interpret the RCMC results presented in the previous section for our two model proteins. We begin by considering the destabilizing protein-protein interactions for both model proteins, and then we return to the balancing effects of crowding.
As noted in Results, the Tm for the high hydrophobicity protein (Nr = 154, Φ = 0.5) decreases from ∼364 K at infinite dilution to ∼354 K at
To check whether destabilizing protein-protein attractions can explain this trend, we simply balance
(calculated from HPC theory) with the attractive protein-protein interactions that we expect the protein to gain by unfolding in solution:
![]() |
(6) |
Here, ΔND and ΔDD characterize the range of the ND and DD attractions (calculated from the effective pair potentials), whereas ρN and ρD are the number densities of native and denatured proteins, respectively. In particular, the quantity Δij represents the width of the attractive potential well, and it is defined to be the distance between the protein-protein hard-core separation σij and the larger separation rij,u where the potential energy (Eq. 4) has risen to V(rij,u) = −0.05ɛij. Taking a look at the first term on the right-hand side of Eq. 6, the quantity
represents the number of new contacts a protein makes with native-state proteins upon unfolding, and −ɛND/2 is approximately the average energy of that contact interaction. Similarly, the second term quantifies the total energy of the new contacts that a protein makes with neighboring denatured proteins upon unfolding. For simplicity, we have assumed here that the local concentrations of protein states can be reasonably approximated by the corresponding bulk concentrations. To test whether the balancing idea presented above has merit, Eq. 6 is plotted along with the simulation data in Fig. 7. Note that it correctly, and even semiquantitatively, predicts the decrease in Tm exhibited by the RCMC simulation data for the high hydrophobicity protein.
We have also used Eq. 6 to analyze the attractive destabilizing forces for the low hydrophobicity protein solution. As can be seen by the dashed curve in Fig. 10, Eq. 6 predicts that, in the absence of entropic crowding, attraction destabilization should also be a noticeable effect for this protein. However, this prediction is clearly not born out by the full RCMC results of the low hydrophobicity protein solution. To understand why attractions give rise to a net destabilizing effect for the high hydrophobicity protein at low concentrations and not for the low hydrophobicity protein, we need to examine the concentration dependencies of the stabilizing crowding effects for the two protein solutions.
To isolate crowding effects from destabilizing attractions, we have performed additional RCMC simulations of our two model proteins, identical in all respects to the ones discussed previously except that we have now “turned off” all interprotein attractions. For the nonattracting high hydrophobicity protein, we find that the Tm is almost unchanged for concentrations below
(see the squares in Fig. 7). This absence of a strong crowding effect at low concentrations basically explains why the protein-protein attraction destabilization dominates under these conditions. Conversely, the native state of the nonattracting high hydrophobicity protein shows a very pronounced increase in stability for concentrations above
Since crowding stabilization (squares in Fig. 7) outweighs attraction destabilization (dashed line in Fig. 7) for high protein concentrations but not for low concentrations, we get a nonmonotonic concentration dependence for stability of the high hydrophobicity protein.
We also plot the Tm of the nonattracting low hydrophobicity protein solution (squares) in Fig. 10, along with Eq. 6 (dashed curve) and the Tm of the regular (i.e., attracting) low hydrophobicity protein solution (circles). Notice that the attraction destabilization contribution essentially exactly balances the crowding effect for low concentrations, whereas the crowding effect dominates for higher concentrations. The net result is that the native state of the low hydrophobicity protein shows nearly monotonic stabilization with increasing protein concentration.
Why does the crowding stabilization effect play a more important role, even at low concentrations, for the low hydrophobicity protein? As was hinted at above, it is not because the interprotein attractions have a weaker destabilizing effect for this protein. In fact, because lower hydrophobicity means both weaker protein-protein interactions and lower intrinsic thermal stability (i.e., less negative
), Eq. 6 predicts that the low hydrophobicity protein would still be significantly destabilized in the absence of the entropic crowding effect. However, the weaker intraprotein attractions give rise to a more expanded denatured state for the low hydrophobicity protein (see Fig. 11). As a result, there is a larger entropic penalty for unfolding of low hydrophobicity proteins in concentrated solutions, which manifests itself as larger relative thermodynamic stability of the native state. Because of these physical factors, one might generally expect low hydrophobicity proteins to be stabilized (or, at worst, unaffected) by increases in protein concentration despite their comparatively smaller intrinsic stability. As was discussed in the Introduction, this analysis is consistent with the slightly increasing experimental stability of the low hydrophobicity protein ribonuclease A (Φ = 0.33) with increasing protein concentration (61).
FIGURE 11.
Ratio of the effective hard-core diameter (σDD/σNN) as a function of temperature for both model proteins: Nr = 154, Φ = 0.4 and Nr = 154, Φ = 0.5. Note that the lower hydrophobicity protein has a much larger effective size in the denatured state, implying an increased excluded volume.
CONCLUSIONS AND FUTURE WORK
In this article, we have introduced a new coarse-grained approach for modeling protein stability in concentrated solution environments. Our treatment calculates state-dependent center-to-center interactions between proteins from basic structural and energetic characteristics of their native and denatured states. These characteristics are each derived from simple protein sequence information using an insightful heteropolymer collapse theory. We have integrated the thermodynamic and structural information obtained from the heteropolymer collapse theory and the resulting interprotein potentials into RCMC simulations to investigate the connections between protein sequence hydrophobicity, protein-protein interactions, protein concentration, and the thermodynamic stability of the native state.
The calculations based on this approach outline a simple physical picture for understanding the concentration dependencies of native-state stability. In particular, they predict that globular proteins with lower sequence hydrophobicity, such as ribonuclease A, should be stabilized by increases in protein concentration due to entropic crowding effects. They also predict that globular proteins with higher sequence hydrophobicity, such as metmyoglobin, should display concentration destabilization at lower protein concentrations due to protein-protein interactions and crowding-induced restabilization at higher protein concentrations. These findings are consistent with the available experimental data for these proteins. However, as is discussed below, we also recognize that there are still important open questions about the generality of our predicted trends that need to be explored through new calculations, including understanding the role of solution pH, salt concentration, and protein charge. A more comprehensive set of experimental data for the concentration dependencies of globular protein stabilities in solution also seems necessary for this effort.
Finally, we think that there are three promising directions for future research based on this work that will help to broaden our understanding of protein stability. The first is the calculation of the thermodynamic phase boundaries for our model protein solutions. In particular, we are interested in understanding whether molecular unfolding can induce liquid-liquid immiscibility between protein-rich and protein-poor solution phases. These types of thermodynamic transitions could give insights into the driving forces for aggregation or precipitation in protein systems. Moreover, we are interested in investigating how sequence hydrophobicity and protein size can impact the relative location of the liquid-liquid and solid-liquid (protein crystallization) transitions on the phase diagram of protein solutions. A second future area of inquiry is to use our approach to explore the precise molecular mechanisms for concentration-induced protein destabilization. For instance, how does the mechanism for protein destabilization (e.g., the reversible formation of non-native dimers and trimers) depend on the protein characteristics and solution conditions? The third direction is to account for protein charge, solution pH, and salt concentration in our model calculations. Alonso, Dill, and Stigter (92,93) have outlined a systematic methodology for incorporating these effects into the heteropolymer collapse theory used in this work, and we are currently investigating how this type of treatment could be extended into our general coarse-grained simulation strategy.
Acknowledgments
We thank Prof. W. R. Smith and Prof. B. Triska for their help in the development of an efficient RCMC code. We are grateful to Prof. Venkat Ganesan for his useful comments on the model and Prof. Christopher Roberts for providing insights into various aspects of protein stability. We thank Sabrina Bromberg for providing Fig. 2. We also thank Dr. Allen Minton, Dr. Vincent Shen, and Prof. Jeffrey Errington for their comments on an early version of this manuscript, and their ideas about future studies using this model. Finally, we thank Prof. Ken Dill for fruitful discussions regarding the HPC theory and for several useful references.
This work was generously supported by The Merck Company Foundation and the David and Lucile Packard Foundation.
APPENDIX A: ATTRACTIVE WELL-DEPTH DERIVATION
The effective protein-protein contact attractions in our model depend on the molecular characteristics of the proteins involved. Two of those characteristics come directly from the protein sequences, e.g., the total number of hydrophobic NrΦ and polar Nr(1 – Φ) amino acid residues (66). Other properties are structural and can be derived from HPC theory. These structural quantities, which generally take on different values for native (N) and denatured (D) proteins, include the radius of gyration (RN or RD(T)), the fraction of residues on the protein surface (fe(RN) or fe(RD), where fe(r) = 1 – (1 – r−1)3), and the fraction of surface residues that are hydrophobic (Θ(T) or Φ for N or D states, respectively). The protein-protein interactions also depend on the free energy associated with the hydration of a hydrophobic amino acid residue χ(T)kBT, and this term represents the only place where solvent effects enter the theory. The derivation and physical interpretation of each of the quantities mentioned above are described in detail in the original HPC articles (66,67). Here, we use the quantities to derive approximate expressions for the attractive well depths (see Fig. 4, inset) of our coarse-grained protein-protein potentials using geometric approximations and mean-field energetic arguments. These expressions are not intended to quantitatively reproduce the interactions for specific proteins, but rather to provide simple and transparent relations between the strength of globular protein-protein attractions and the molecular characteristics of the proteins themselves.
Here, we explicitly derive the general relationship for the ND contact attraction, and the analytical forms for the NN and DD attractions can be easily deduced from this expression. As a first step, we approximate the fraction of the denatured protein's surface desolvated by contact with a native protein to be
which is simply the projected surface area of the native protein divided by the total ND interaction area. If one analyzes this expression in the limit of RN = RD, then one finds that it makes the reasonable, but not quantitatively exact, prediction that one protein would be completely desolvated if it were closely surrounded by eight neighboring proteins. Of course there should also be an 𝒪(1) prefactor included to correct this formula to account for the specific packing properties of the proteins under study. However, to keep the ideas simple and general, we do not attempt to describe that level of detail here. To determine the surface fraction of a native protein that is desolvated by contact with a denatured protein, we use a similar argument: 
The magnitude of the effective energy change upon making the aforementioned ND contact is then given by
![]() |
(7) |
In this expression, Ns = Nr/1.4 is the total number of segments per protein (1.4 amino acid residues per segment), and σ = 2/3 is the fractional area of each surface residue that is not in direct contact with solvent (67). The product ΦΘ that appears in both terms accounts for the probability of forming a contact between two hydrophobic residues in the mutually desolvated area between proteins. This approximation implicitly assumes that hydrophobic residues are uniformly distributed on the surface of each protein molecule.
Using analogous arguments, the magnitudes of the effective NN and DD contact energies directly follow:
![]() |
(8) |
![]() |
(9) |
The contact energies derived above are used in our model for the well depths of the effective protein-protein potentials given by Eq. 4.
The derivations provided in this Appendix yield, at best, average contact interactions, and for simplicity they have not addressed some potentially important structural and energetic features of real proteins. Clearly, the development of more accurate and comprehensive theoretical expressions for protein-protein interactions and their experimental verification are important areas for future scientific inquiry.
APPENDIX B: OCCURRENCE OF ORDERED NON-NATIVE AGGREGATES
We noted in Simulation Details that a small number of higher-concentration state points did not converge to an equilibrium fluid solution with the same fraction of native proteins fN if they were initialized from all native (fN = 1) versus all denatured (fN = 0) FCC configurations. These state points corresponded to concentrations for which an FCC crystalline lattice of denatured proteins was very favorable energetically, with each particle taking full advantage of its neighbors' hydrophobic contacts. In the context of our simple model, one might view such a structure as ordered non-native aggregates (perhaps very roughly analogous to fibrillar structures). Although we were primarily interested in, and only report properties of, equilibrium fluid protein solutions in Results, we show below that it is also relatively straightforward to predict the general region of the phase diagram where an ordered FCC arrangement of non-native proteins would be favorable in this model.
From Eq. 4, one can determine the ratio of the minimum-energy separation σmin, DD(T) to the hard-core diameter σDD(T) for denatured proteins σmin, DD(T)/σDD(T) ≈ 1.158. However, the nearest-neighbor separation for an FCC crystal of denatured proteins is also given by
![]() |
(10) |
where ρ is the number density. By combining these two ideas, we can determine the ideal energetic conditions for forming an ordered aggregate of denatured proteins from the relation
![]() |
(11) |
Figs. 12 and 13 show that this relationship very closely approximates the conditions where ordered arrangements of denatured proteins actually occurred in our simulations.
FIGURE 12.
Complete set of RCMC simulation data in the temperature T-protein concentration
plane for the higher hydrophobicity protein (Nr = 154, Φ = 0.5). Solid circles represent Tm, plus signs indicate equilibrium fluid protein solutions, shaded triangles indicate the occurrence of ordered non-native aggregates, and the shaded line represents the predicted ordered non-native aggregate curve (see Eq. 11). The solid curve is a guide to the eye.
FIGURE 13.
Complete set of simulation data and results in the temperature T-protein concentration
plane for the lower hydrophobicity protein (Nr = 154, Φ = 0.4). Solid circles represent Tm, plus signs indicate equilibrium fluid protein solutions, shaded triangles indicate the occurrence of ordered non-native aggregates, and the shaded line represents the predicted ordered non-native aggregate curve (see Eq. 11). The solid curve is a guide to the eye.
References
- 1.Privalov, P. L. 1979. Stability of proteins: small globular proteins. Adv. Protein Chem. 33:167–241. [DOI] [PubMed] [Google Scholar]
- 2.Shortle, D. 1999. The denatured state (the other half of the folding equation) and its role in protein stability. FASEB J. 10:27–34. [DOI] [PubMed] [Google Scholar]
- 3.Wetzel, R. 1994. Mutations and off-pathway aggregation of proteins. Trends Biotechnol. 12:193–198. [DOI] [PubMed] [Google Scholar]
- 4.Fink, A. L. 1998. Protein aggregation: folding aggregates, inclusion bodies, and amyloids. Fold. Des. 3:R9–R23. [DOI] [PubMed] [Google Scholar]
- 5.Agussi, A., and C. Haass. 2003. Games played by rogue proteins in prion disorders and Alzheimer's disease. Science. 302:814–818. [DOI] [PubMed] [Google Scholar]
- 6.Hsu, A., C. T. Murphy, and C. Kenyon. 2003. Regulation of aging and age-related disease by DAF-16 and heat-shock factor. Science. 300:1142–1145. [DOI] [PubMed] [Google Scholar]
- 7.Kelly, J. W. 2002. Towards an understanding of amyloidogenesis. Nat. Struct. Biol. 9:323–325. [DOI] [PubMed] [Google Scholar]
- 8.Dobson, C. M. 1999. Protein misfolding, evolution and disease. Trends Biochem. Sci. 24:329–332. [DOI] [PubMed] [Google Scholar]
- 9.Dobson, C. M. 2001. The structural basis of protein folding and its links with human disease. Philos. Trans. R. Soc. Lond. B Biol. Sci. 356:133–145. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Cleland, J. L., M. F. Powell, and S. J. Shire. 1993. The development of stable protein formulations: a close look at protein aggregation, deamidation, and oxidation. Crit. Rev. Thera. Drug Carr. Sys. 10:307–377. [PubMed] [Google Scholar]
- 11.Kendrick, B. S., J. F. Carpenter, J. L. Cleland, and T. W. Randolph. 1998. A transient expansion of the native state precedes aggregation of recombinant human interferon-γ. Proc. Natl. Acad. Sci. USA. 95:14142–14146. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Rey, L., and J. C. May, editors. 1999. Mechanisms of protein stabilization during freeze-drying and storage: the relative importance of thermodynamic stabilization and glassy state relaxation dynamics. In Freeze-Drying/Lyophilization of Pharmaceutical and Biological Products. Marcel Dekker, NY. 161–198.
- 13.Bummer, P. M., and S. Koppenol. 2000. Chemical and physical considerations in protein and peptide stability. Drugs Pharm. Sci. 99:5–69. [Google Scholar]
- 14.Chi, E. Y., S. Krishnan, T. Randolph, and J. Carpenter. 2003. Physical stability of proteins in aqueous solution: mechanism and driving forces in nonnative protein aggregation. Pharm. Res. 20:1325–1336. [DOI] [PubMed] [Google Scholar]
- 15.Roberts, C. J. 2003. Kinetics of irreversible protein aggregation: analysis of extended Lumry-Eyring models and implications for predicting protein shelf life. J. Phys. Chem. B. 107:1194–1207. [Google Scholar]
- 16.Stillinger, F. H. 1973. Structure in aqueous solutions of nonpolar solutes from the standpoint of scaled-particle theory. J. Sol. Chem. 2:141–158. [Google Scholar]
- 17.Pratt, L. R., and D. Chandler. 1977. Theory of the hydrophobic effect. J. Chem. Phys. 67:3683–3704. [Google Scholar]
- 18.Hummer, G., S. Garde, A. E. Garcia, A. Pohorille, and L. R. Pratt. 1996. An information theory model of hydrophobic interactions. Proc. Natl. Acad. Sci. USA. 93:8951–8955. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Lum, K., D. Chandler, and J. D. Weeks. 1999. Hydrophobicity at small and large length scales. J. Phys. Chem. B. 103:4570–4577. [Google Scholar]
- 20.Ashbaugh, H. S., T. M. Truskett, and P. G. Debenedetti. 2001. A simple molecular thermodynamic theory of hydrophobic hydration. J. Chem. Phys. 116:2907–2921. [Google Scholar]
- 21.Pratt, L. R. 2002. Molecular theories of hydrophobic effects: “She is too mean to have her name repeated.” Annu. Rev. Phys. Chem. 53:409–436. [DOI] [PubMed] [Google Scholar]
- 22.Southall, N. T., K. A. Dill, and A. D. J. Haymet. 2002. A view of the hydrophobic effect. J. Phys. Chem. B. 106:521–533. [Google Scholar]
- 23.Rajamani, S., T. M. Truskett, and S. Garde. 2005. Hydrophobic hydration from small to large lengthscales: understanding and manipulating the crossover. Proc. Natl. Acad. Sci. USA. 102:9475–9480. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Kauzmann, W. 1959. Some factors in the interpretation of protein denaturation. Adv. Protein Chem. 14:1–63. [DOI] [PubMed] [Google Scholar]
- 25.Dill, K. A. 1990. Dominant forces in protein folding. Biochemistry. 29:7133–7155. [DOI] [PubMed] [Google Scholar]
- 26.Dill, K. A., S. Bromberg, K. Z. Yue, K. M. Fiebig, D. Yee, P. D. Thomas, and H. Chan. 1995. Principles of protein folding—a perspective from simple exact models. Protein Sci. 4:561–602. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Duan, Y., and P. A. Kollman. 1998. Pathways to a protein folding intermediate observed in a 1-microsecond simulation in aqueous solution. Science. 282:740–744. [DOI] [PubMed] [Google Scholar]
- 28.Snow, C. D., H. Nguyen, V. S. Pande, and M. Gruebele. 2002. Absolute comparison of simulated and experimental protein-folding dynamics. Nature. 420:102–106. [DOI] [PubMed] [Google Scholar]
- 29.Garcia, A. E., and J. N. Onuchic. 2003. Folding a protein in a computer: an atomic description of the folding/unfolding of protein A. Proc. Natl. Acad. Sci. USA. 100:13898–13903. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Herges, T., and W. Wenzel. 2004. An all-atom force field for tertiary structure prediction of helical proteins. Biophys. J. 87:3100–3109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Rosenbaum, D. F., and C. F. Zukoski. 1996. Protein interactions and crystallization. J. Cryst. Growth. 169:752–758. [Google Scholar]
- 32.Neal, B. L., D. Asthagiri, and A. M. Lenhoff. 1998. Molecular origins of osmotic second virial coefficients of proteins. Biophys. J. 75:2469–2477. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Leckband, D., and S. Sivasanker. 1999. Forces controlling protein interactions: theory and experiment. Colloids Surf. B. 14:83–97. [Google Scholar]
- 34.ten Wolde, P. R., and D. Frenkel. 1997. Enhancement of protein crystal nucleation by critical density fluctuations. Science. 277:1975–1978. [DOI] [PubMed] [Google Scholar]
- 35.Foffi, G., G. D. McCullagh, A. Lawlor, E. Zaccarelli, K. A. Dawson, F. Sciortino, P. Tartaglia, D. Pini, and G. Stell. 2002. Phase equilibria and glass transition in colloidal systems with short-ranged attractive interactions: application to protein crystallization. Phys. Rev. E. 65:031407-1–031407-17. [DOI] [PubMed] [Google Scholar]
- 36.Dixit, N. M., and C. F. Zukoski. 2003. Competition between crystallization and gelation: a local description. Phys. Rev. E. 67:061501-1–061501-13. [DOI] [PubMed] [Google Scholar]
- 37.Hloucha, M., J. F. M. Lodge, A. M. Lenhoff, and S. I. Sandler. 2001. A patch-antipatch representation of specific protein interactions. J. Cryst. Growth. 232:195–203. [Google Scholar]
- 38.Lomakin, A., N. Asherie, and G. B. M. Benedek. 1999. Aeolotopic interactions of globular proteins. Proc. Natl. Acad. Sci. USA. 96:9465–9468. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Wu, J. Z., D. Bratko, H. W. Blanch, and J. M. Prausnitz. 2000. Interaction between oppositely charged micelles or globular proteins. Phys. Rev. E. 62:5273–5280. [DOI] [PubMed] [Google Scholar]
- 40.Curtis, R. A., C. Steinbrecher, A. Heinemann, H. W. Blanch, and J. M. Prausnitz. 2002. Hydrophobic forces between protein molecules in aqueous solutions of concentrated electrolyte. Biophys. Chem. 98:249–265. [DOI] [PubMed] [Google Scholar]
- 41.Minton, A. P. 1981. Excluded volume as a determinant of macromolecular structure and reactivity. Biopolymers. 20:2093–2120. [Google Scholar]
- 42.Fields, G. B., D. Alonso, D. Stigter, and K. A. Dill. 1992. Theory for the aggregation of proteins and copolymers. J. Phys. Chem. 96:3974–3981. [Google Scholar]
- 43.Zimmermann, S. B., and A. P. Minton. 1993. Macromolecular crowding: biochemical, biophysical and physiological consequences. Annu. Rev. Biophys. Biomol. Struct. 22:27–65. [DOI] [PubMed] [Google Scholar]
- 44.Minton, A. P. 1998. Molecular crowding: analysis of effects of high concentrations of inert co-solutes on biochemical equilibria and rates in terms of volume exclusion. Meth. Enzymol. 295:127–149. [DOI] [PubMed] [Google Scholar]
- 45.Smith, A. V., and C. K. Hall. 2001. α-helix formation: discontinuous molecular dynamics on an intermediate-resolution protein model. Proteins Struct. Funct. Genet. 44:344–360. [DOI] [PubMed] [Google Scholar]
- 46.Smith, A. V., and C. K. Hall. 2001. Assembly of a tetrameric α-helical bundle: computer simulations on an intermediate-resolution protein model. Proteins Struct. Funct. Genet. 44:376–391. [DOI] [PubMed] [Google Scholar]
- 47.Smith, A. V., and C. K. Hall. 2001. Protein refolding versus aggregation: computer simulations on an intermediate-resolution protein model. J. Mol. Biol. 312:187–202. [DOI] [PubMed] [Google Scholar]
- 48.Dima, R. I., and D. Thirumalai. 2002. Exploring protein aggregation and self-propagation using lattice models: phase diagrams and kinetics. Protein Sci. 11:1036–1049. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Kinjo, A. R., and S. Takada. 2002. Effects of macromolecular crowding on protein folding and aggregation studied by density functional theory: dynamics. Phys. Rev. E. 66:051902-1–051902-10. [DOI] [PubMed] [Google Scholar]
- 50.Kinjo, A. R., and S. Takada. 2002. Effects of macromolecular crowding on protein folding and aggregation studied by density functional theory: statics. Phys. Rev. E. 66:031911-1–031911-9. [DOI] [PubMed] [Google Scholar]
- 51.Hall, D., and A. P. Minton. 2002. Effects of inert volume-excluding macromolecules on protein fiber formation. I. Equilibrium models. Biophys. Chem. 98:93–104. [DOI] [PubMed] [Google Scholar]
- 52.Hall, D., and A. P. Minton. 2003. Macromolecular crowding: qualitative and semiquantitative successes, quantitative challenges. Biochim. Biophys. Acta. 1649:127–139. [DOI] [PubMed] [Google Scholar]
- 53.Kinjo, A. R., and S. Takada. 2003. Competition between protein folding and aggregation with molecular chaperones in crowded solutions: insight from mesoscopic simulations. Biophys. J. 85:3521–3531. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Hall, D., and A. P. Minton. 2004. Effects of inert volume-excluding macromolecules on protein fiber formation. II. Kinetic models for nucleated fiber growth. Biophys. Chem. 107:299–316. [DOI] [PubMed] [Google Scholar]
- 55.Jang, H., C. K. Hall, and Y. Zhou. 2004. Assembly and kinetic folding pathways of a tetrameric β-sheet complex: molecular dynamics simulations on simplified off-lattice protein models. Biophys. J. 86:31–49. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Jang, H., C. K. Hall, and Y. Zhou. 2004. Thermodynamics and stability of a β-sheet complex: molecular dynamics simulations on simplified off-lattice protein models. Protein Sci. 13:40–53. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Nguyen, H. D., and C. K. Hall. 2004. Phase diagrams describing fibrillization by polyalanine peptides. Biophys. J. 87:4122–4134. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Nguyen, H. D., and C. K. Hall. 2004. Molecular dynamics simulations of spontaneous fibril formation by random-coil peptides. Proc. Natl. Acad. Sci. USA. 101:16180–16185. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Braun, F. N. 2002. Adhesion and liquid-liquid phase separation in globular protein solutions. J. Chem. Phys. 116:6826–6830. [Google Scholar]
- 60.Sear, R. P. 2004. Solution stability and variability in a simple model of globular proteins. J. Chem. Phys. 120:998–1005. [DOI] [PubMed] [Google Scholar]
- 61.Sochava, I. V., T. V. Belopolskaya, and O. I. Smirnova. 1985. DSC study of reversible and irreversible thermal denaturation of concentrated globular protein solutions. Biophys. Chem. 22:323–336. [DOI] [PubMed] [Google Scholar]
- 62.Cueto, M., M. J. Dorta, O. Munguia, and M. Llabres. 2003. New approach to stability assessment of protein solution formulations by differential scanning calorimetry. Int. J. Pharm. 252:159–166. [DOI] [PubMed] [Google Scholar]
- 63.Tomicki, P., R. L. Jackman, and D. W. Stanley. 1996. Thermal stability of metmyoglobin in a model system. Lebensm. Wiss. U. Technol. 29:547–551. [Google Scholar]
- 64.Shen, M., F. Davis, and A. Sali. 2005. The optimal size of a globular protein domain: a simple sphere-packing model. Chem. Phys. Lett. 405:224–228. [Google Scholar]
- 65.Tanford, C. 1962. Contribution of hydrophobic interactions to the stability of the globular conformation of proteins. J. Am. Chem. Soc. 84:4240–4247. [Google Scholar]
- 66.Dill, K. A., D. O. V. Alonso, and K. Hutchinson. 1989. Thermal stability of globular proteins. Biochemistry. 28:5439–5449. [DOI] [PubMed] [Google Scholar]
- 67.Dill, K. A. 1985. Theory for the folding and stability of globular proteins. Biochemistry. 24:1501–1509. [DOI] [PubMed] [Google Scholar]
- 68.Dill, K. A., and D. Shortle. 1991. Denatured states of proteins. Annu. Rev. Biochem. 60:795–825. [DOI] [PubMed] [Google Scholar]
- 69.Shortle, D., W. Stites, and A. Meeker. 1990. Contributions of large hydrophobic amino acids to the stability of staphylococcal nuclease. Biochemistry. 29:8033–8041. [DOI] [PubMed] [Google Scholar]
- 70.Shortle, D., H. S. Chan, and K. A. Dill. 1992. Modeling the effects of mutations on the denatured states of proteins. Protein Sci. 1:201–215. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Alonso, D. O. V., and K. A. Dill. 1991. Solvent denaturation and stabilization of globular proteins. Biochemistry. 20:5974–5985. [DOI] [PubMed] [Google Scholar]
- 72.Dill, K. A., and D. Stigter. 1995. Modeling protein stability as heteropolymer collapse. Adv. Protein Chem. 46:59–104. [DOI] [PubMed] [Google Scholar]
- 73.Petsev, D. N., X. Wu, O. Galkin, and P. G. Vekilov. 2003. Thermodynamic functions of concentrated protein solutions from phase equilibria. J. Phys. Chem. B. 107:3921–3926. [Google Scholar]
- 74.Shortle, D. 2002. The expanded denatured state: an ensemble of conformations trapped in a locally encoded topological space. Adv. Protein Chem. 62:1–23. [DOI] [PubMed] [Google Scholar]
- 75.Millet, I. S., S. Doniach, and K. W. Plaxco. 2002. Toward a taxonomy of the denatured state: small angle scattering studies of unfolded proteins. Adv. Protein Chem. 62:241–262. [DOI] [PubMed] [Google Scholar]
- 76.Vliegenthart, G. A., and H. N. W. Lekkerkerker. 2000. Predicting the gas-liquid critical point from the second virial coefficient. J. Chem. Phys. 112:5364–5369. [Google Scholar]
- 77.Tessier, P., A. M. Lenhoff, and S. I. Sandler. 2002. Rapid measurement of protein osmotic second virial coefficients by self-interaction chromatography. Biophys. J. 82:1620–1631. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.George, A., and W. W. Wilson. 1994. Predicting protein crystallization from dilute solution property. Acta Crystallogr. 50:361–365. [DOI] [PubMed] [Google Scholar]
- 79.Ruppert, S., S. I. Sandler, and A. M. Lenhoff. 2001. Correlation between the osmotic second virial coefficient and the solubility of proteins. Biotechnol. Prog. 17:182–187. [DOI] [PubMed] [Google Scholar]
- 80.Sear, R. P. 2002. Distribution of the second virial coeffcients of globular proteins. Europhys. Lett. 60:938–944. [Google Scholar]
- 81.Johnson, J. K., A. Z. Panagiotopoulos, and K. E. Gubbins. 1994. Reactive canonical Monte Carlo. Mol. Phys. 81:717–733. [Google Scholar]
- 82.Johnson, J. K. 1999. Reactive canonical Monte Carlo. Adv. Chem. Phys. 105:461–481. [Google Scholar]
- 83.Smith, W. R., and B. Triska. 1993. The reaction ensemble method for the computer simulation of chemical and phase equilibria. I. Theory and basic examples. J. Chem. Phys. 4:3019–3027. [Google Scholar]
- 84.Truskett, T. M., S. Torquato, and P. G. Debenedetti. 2000. Towards a quantification of disorder in materials: distinguishing equilibrium and glassy sphere packings. Phys. Rev. E. 62:993–1001. [DOI] [PubMed] [Google Scholar]
- 85.Torquato, S., T. M. Truskett, and P. G. Debenedetti. 2000. Is random close packing of sphere well defined? Phys. Rev. Lett. 84:2064–2067. [DOI] [PubMed] [Google Scholar]
- 86.Minton, A. P. 2005. Models for excluded volume interaction between an unfolded protein and rigid macromolecular co-solutes: macromolecular crowding and protein stability revisited. Biophys. J. 88:971–985. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 87.Young, L. R. D., K. A. Dill, and A. L. Fink. 1993. Aggregation and denaturation of apomyoglobin in aqueous urea solutions. Biochemistry. 32:3877–3886. [DOI] [PubMed] [Google Scholar]
- 88.Young, L. R. D., A. L. Fink, and K. A. Dill. 1993. Aggregation of globular proteins. Acc. Chem. Res. 26:614–620. [Google Scholar]
- 89.Krishnamurthy, R., and M. C. Manning. 2002. The stability factor: importance in formulation development. Curr. Pharm. Biotechnol. 3:361–371. [DOI] [PubMed] [Google Scholar]
- 90.Casares, S., M. Sadqi, O. Lopez-Mayorga, F. Conejero-Lara, and N. A. J. van Nuland. 2004. Detection and characterization of partially unfolded oligomers of the SH3 domain of α-spectrin. Biophys. J. 86:2403–2413. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 91.Eggers, D. K., and J. S. Valentine. 2001. Molecular confinement influences protein structure and enhances thermal protein stability. Protein Sci. 10:250–261. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 92.Stigter, D., D. O. V. Alonso, and K. A. Dill. 1991. Protein stability: electrostatic and compact denatured states. Proc. Natl. Acad. Sci. USA. 88:4176–4180. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 93.Alonso, D. O. V., K. A. Dill, and D. Stigter. 1991. The three states of globular proteins: acid denaturation. Biopolymers. 31:1631–1649. [DOI] [PubMed] [Google Scholar]

























