Abstract
The habitat in which proteins exert their function contains up to 400 g/L of macromolecules, most of which are proteins. The repercussions of this dense environment on protein behavior are often overlooked or addressed using synthetic agents such as poly(ethylene glycol), whose ability to mimic protein crowders has not been demonstrated. Here we performed a comprehensive atomistic molecular dynamic analysis of the effect of protein crowders on the structure and dynamics of three proteins, namely an intrinsically disordered protein (ACTR), a molten globule conformation (NCBD), and a one-fold structure (IRF-3) protein. We found that crowding does not stabilize the native compact structure, and, in fact, often prevents structural collapse. Poly(ethylene glycol) PEG500 failed to reproduce many aspects of the physiologically-relevant protein crowders, thus indicating its unsuitability to mimic the cell interior. Instead, the impact of protein crowding on the structure and dynamics of a protein depends on its degree of disorder and results from two competing effects: the excluded volume, which favors compact states, and quinary interactions, which favor extended conformers. Such a viscous environment slows down protein flexibility and restricts the conformational landscape, often biasing it towards bioactive conformations but hindering biologically relevant protein-protein contacts. Overall, the protein crowders used here act as unspecific chaperons that modulate the protein conformational space, thus having relevant consequences for disordered proteins.
Author Summary
Most in vitro and in silico biophysical experiments generally study proteins in an isolated environment, overlooking that their natural environment—the cell cytoplasm—is a solution that is highly populated by proteins. To address this knowledge gap, here we explored how a crowded environment alters the conformational sampling of three proteins, each with a different degree of disorder and flexibility. We simulated a crowded system composed by the three proteins and reaching a cell-like concentration and compared the protein behavior observed with that induced by PEG500, a synthetic crowding agent. Despite some similarities between the environments, protein crowders showed a number of characteristics that raise concerns about the use of diluted solutions or synthetic agents when studying protein behavior.
Introduction
Most in vitro and in silico biophysical experiments treat proteins as highly purified entities that act in isolation, overlooking their natural “habitat”, namely the cell cytoplasm. This “habitat” contains between 80 to 400 g/L of several other macromolecules, which together account for 5%-30% of volume occupancy [1]. Among the effects that a crowded environment exerts on protein behavior, volume exclusion is considered the most relevant [2]. Accordingly, crowders behave as inert molecules that do not interact with proteins, and their presence limits accessible space to proteins, thereby reducing the conformational entropy and favoring compact folded forms of the latter [3]. Following this view, most experimental studies on proteins in dense environments have been performed by adding large polymers, such as poly(ethylene glycol) (PEG), Dextran or Ficoll, to the media. These polymers, often referred to as “inert” crowders, are assumed to exclusively mimic the volume-exclusion effect [4]. However, recent experiments show that “inert crowders” exert a complex variety of effects on protein stability, and results largely dependent on the type and size of the crowder involved [3,5,6]. For example, calorimetric analysis concludes that Dextran, glucose and PEG lead to an enthalpic stabilization and an entropic destabilization of the protein; the latter predominant only in presence of PEG [7]. Indeed this synthetic compound appears to be less “inert” than expected due to attractive interactions with proteins, questioning its effectiveness in recreating a pure volume-exclusion effect. Despite so, PEG continues to be used as a reference agent to model macromolecular crowding [8,9]. Regarding the size, intuitively, the volume excluded by inert crowding agents is proportional to the crowder size and consequently small crowders might even help unfolding [5]. Recent studies in cell-like environments have further challenged such a model, suggesting that compacted conformations of proteins may not always be favored in physiological crowded environments [9–15].
Available data suggest that protein crowders have a dual nature. On the one hand, they display the classical volume-exclusion effect and, on the other, they have the ability to form weak and transient (quinary) soft interactions with solute protein [9,11,12]. These effects generates competition between destabilizing and stabilizing forces, the final result of which is difficult to predict [11,13,14]. To further complicate the scenario, crowding might also affect the folding landscape, leading to alternative states not present in dilute solutions and affecting protein functionality [15]. This distortion of the conformational landscape might have a dramatic impact on highly dynamic proteins, such as intrinsically disordered (IDPs) and molten globule proteins (MGPs) [16]. Unfortunately, most crowding studies performed with these proteins have used synthetic polymers and often report only the expected increase in the compactness of the structure [17–21]. Research into IDPs or MGPs in cell-like crowded environments is more rare and provides unclear conclusions [10], [18], [28–33].
A consensus theory—based on experimental data—on the nature of crowding is impeded by the intrinsic limitations of studying highly dynamic systems in which single molecule information is lost within the experimentally detected structural ensemble [20–26]. Theoretical calculations, particularly molecular dynamics (MD), give direct access to atomic information on single molecules in carefully controlled environments, and they are therefore the perfect complement to experimental ensemble-based techniques when addressing crowding effects [21], [35–38]. Here we took advantage of the power of MD simulations to explore the impact of small-sized synthetic (PEG500) and protein crowders (proteins) on the structure, dynamics and interactions of the following three proteins: i) an intrinsically ordered protein (IOP), the 191-residue interferon regulatory transcription factor (IRF-3); ii) a molten-globule conformation (MGP), the 51-residue nuclear coactivator-binding domain of CREB (NCBD); and iii) an intrinsically disordered protein (IDP), the 47-residue activator for thyroid hormone and retinoid receptor (ACTR). These three proteins not only model the three major types of protein conformational landscapes, but also define a specific biological network, with NCBD as the central partner (the hub), able to transiently interact with IRF-3 and ACTR, thanks to its structural promiscuity [39–42]. This is the first study to present calculations of the effect of crowding on proteins of distinct structural complexity that define a biologically relevant crowded microenvironment.
Results
We performed microseconds-long MD simulations of five crowded systems, each composed by eight conformations of the three protein types (6 NCBD, 1 ACTR and 1 IRF-3) at increasing concentrations [from 175 to 300 g/L] (Fig 1). Each conformation was individually simulated in solution with the synthetic crowder PEG500 and in water. The latter condition was used as a control of the behavior of proteins within the selected simulation protocol.
Control simulations in water
Trajectories in pure water (S1 Fig and Fig 2) showed the expected behavior for the proteins under study. Thus, the intrinsically ordered protein (IOP: IRF3) was stable during the entire trajectory, maintaining the pattern of secondary structure, fold and shape. Native contacts were well preserved, with sizeable movements localized only at the C-terminal helix, in a region with interface contacts in the crystal. A small, but detectable, tightening of the hydrophobic core of the protein occurred. The intrinsically disordered protein (IDP: ACTR) was extremely mobile in water, sampling a wide repertoire of conformations. In this regard, clustering analysis detected more than 250 distinct conformers (most of them compact; see S1 Fig), none of which populated more than 5.5% of the trajectory. The contact map was fuzzy (compare with IRF3 in S1 Fig and Fig 2), suggesting the absence of remote long-lasting contacts, thus hindering the formation of stable folds. Some segments of ACTR tended to form a secondary structure, especially an α-helix at the N-terminal—an observation that is consistent with the results from NMR experiments [32,33]. However, these helical elements were unstable and fuzzy, with local populations rarely above 50% and undefined boundaries, making them unable to nucleate the global structure. Finally, the molten globule protein (MGP: NCBD) showed slow diffusion along the conformational space, with strong memory effects in the trajectories [30,34–36]. When the NCBD trajectory started from the “folded” conformation, significant plasticity was observed (around 100 structural clusters). This plasticity is attributed to the distinct orientation of the three helical motifs (h1, h2 and h3, see below and S3 Fig), which generate a fuzzy contact map with helical arrangements of the prevailing ACTR-binding form, while the helical arrangements required for IRF-3 recognition were rare. When the NCBD starting conformation was “unfolded”, it rapidly collapsed into an amorphous globule; the protein formed many remote and unstable contacts (282 structural clusters), and only small nascent elements of secondary structure (particularly in h1 and h2) were observed. NCBD appears to be a protein that was not evolutionarily designed to collapse into a single well-defined minimum. We conclude that control simulations provide a reasonable picture of the conformational landscape of the three proteins representing IOPs, IDPs and MGPs in water. We can therefore confidently use the same force-field and simulation protocol to explore crowded environments.
Synthetic vs. protein crowders
As described above, most theoretical and experimental studies on crowding have been performed using polymers (as PEG500) as co-solvents, which theoretically act as “inert” crowders mimicking cellular crowding. However, whether polymers such as PEG500 are truly “inert” crowders and whether they correctly mimic the crowded environment in the cell remain to be confirmed. In order to answer these two questions, we compared the trajectories of the three model proteins in water, and in PEG500-crowding and protein-crowding conditions (using similar crowder concentrations in both cases) (Fig 2).
For IOP (IRF3), the effect of crowding was modest, and neither proteins nor PEG500 induced large changes in the local or global structure of this well-structured protein. Crowding stabilized the secondary structure, including the C-terminal helix, which was fragile in the simulations in water. When compared to water, both types of crowders produced an increase in the size of the protein (see Fig 2 for radius of gyration, and S2 Fig for solvent-accessible surface). This observation is not consistent with the “exclude volume” theory. Only protein crowders were observed to decrease the relative ratio of polar solvent-accessible surface, thereby suggesting that they attenuate the hydrophobic effect compared to water, the latter environment showing a more visible collapse of the core (cartoons in Fig 2 and S2 Fig). Interestingly, the crystal structure of IRF-3 was more similar to the conformations sampled in a crowded environment (especially in the protein media) than to those in dilute aqueous conditions. These findings thus suggest that crystals can, in some cases, mimic physiological conditions better than water.
For IDP (ACTR), crowding agents had a huge impact on the conformational landscape, (Fig 2); however, we were unable to find a pattern of general “crowding” effects, since the changes induced by PEG500 differed from those induced by a protein environment. Thus, PEG500 generated a large expansion of the sampled conformational space, which became dominated by extended conformers showing only a moderate amount of secondary structure. In contrast, protein crowders reduced the conformational space sampled, which was now dominated by relatively compact conformations, with well-defined α-helices localized in those regions required for NCBD binding [37,38]. These results demonstrate the inability of PEG500 to reproduce physiological-like crowded conditions around IDPs and suggest that protein crowding might contribute to IDP folding in the bioactive conformation.
For MGP (NCBD), the behavior of crowders largely depended on the starting conformation, mirroring the “memory effects” detected in the simulations in water and reinforcing the idea that NCBD (and probably other MGPs) moves across a complex and frustrated conformational landscape. In the trajectories starting from folded NCBD, crowders favored more extended conformations than those sampled in water, introducing significant changes in the fuzzy pattern of long-range contacts (Fig 2). The helical fragments were often arranged in the bioactive conformations, sometimes closer to the IRF-3-bound state, which has never been sampled in water (S3 Fig). The bias towards the bioactive state was especially visible for protein crowding, where collected ensembles were on average 0.34 nm closer to the bioactive conformation found in the NCBD-IRF-3 complex than those sampled in water. The effect of crowders was even more dramatic (and complex) for NCBD trajectories starting from an unfolded state. Both PEG500 and protein crowders hindered the hydrophobic collapse observed in water, thus favoring extended conformations (Fig 2) in which native helices—which were hardly distinguishable in water—showed significant populations and well-defined boundaries (especially for helix 1). These observations again support the notion that crowding might help disordered proteins to adopt bioactive conformations. When analyzed in detail, the effects of synthetic (PEG500) and protein (protein) crowding differed significantly (Fig 2), thus again raising concerns about the use of small-sized PEG as a model of physiological crowding.
Overall, compared to water, both synthetic and protein crowders favored open and moderately extended conformations with higher secondary structure content. These results are difficult to explain on the basis of the “excluded volume” hypothesis. The breakdown of the energies between each protein and its surroundings reveals that in presence of both protein and PEG500 the Van der Waals term (Lennard-Jones) increase its weight compared to dilute solutions, at the expense of Coulomb interactions (see S2 Table). However, the percentage of the vdW term in PEG500 is smaller but comparable to the one in protein crowding (~ 21.5% and ~ 19.5% respectively in protein crowding and in PEG500 for NCBD, ~ 16% and ~ 12.5% for ACTR, and ~ 13% and ~ 12.5% for IRF-3), confirming that PEG500 is a non-inert crowder. In general crowding behaves as an unexpected partner, favoring protein binding through the conformational selection paradigm and acting as a chaperon that modulates the conformational space of non-ordered proteins.
Concentration effects on crowding
The analysis of 5 independent trajectories obtained at protein concentrations from 175 to 296 g/L showed that the conformational landscape of the proteins was relatively robust to moderate changes in the concentration of the protein environment. However, detailed analysis revealed some subtle, but systematic, concentration-dependent changes (see Fig 3, and S4–S6 Figs). For example, a low concentration of protein crowders favored extended conformations, while increasing concentrations favored more collapsed structures (Fig 3). This observation suggests that the effect of protein crowding results from the combination of two opposing contributions: i) soft protein-protein interactions, which favor the exposure of protein moieties and the prevalence of extended conformations; and ii) the “excluded volume” effect, which favors collapsed structures. At low and moderate protein concentrations, the first effect dominates; however, as the number of possible protein-protein contacts is satisfied, the “excluded volume” effect gains relevance, leading to more collapsed structures. The navigation of proteins above their energy landscape can then be fine-tuned by modifying the protein concentration in different cell compartments, thereby creating an additional layer of regulation of protein structure and function.
Protein quinary contacts and crowding
The results above strongly suggest that soft protein-protein interactions are responsible for the crowding effect generated by a dense protein environment. A key question is whether these contacts correspond to unspecific transient (quinary) or specific interactions, the latter could not be bona fide annotated as crowding. To study this point, we compared the 3 replicas of NCBD (both for the folded and unfolded ones), where NCBD has different protein neighbors. If specific protein-protein interactions play a major role in modulating protein behavior, we can expect the 3 replicas to show distinct behaviors. This was not found to be the case (S3 Table, S4 and S7 Figs); specific interactions can therefore be ruled out as a major guide of the simulations. To further confirm this point, we performed additional trajectories with a 4x larger simulation box (4X CROW; 182 g/L protein concentration), which provided us with several replicas of the different proteins. Again, no remarkable differences were found between the sampling obtained here and the one in smaller simulation boxes (Fig 3, S5 and S8 Figs and Fig 4). Interestingly, the only remarkable exception was one of the copies of ACTR with an N-terminal exposed to a region of low protein density (labeled as A1, in red in Fig 4). There, the lack of protein-protein contacts caused an immediate response (within 100 ns) in ACTR, which underwent structural rearrangements (loss of helicity in the N-tal). These were not achieved when ACTR was surrounded by proteins. In summary, unspecific rather than specific protein-protein contacts appear as a major determinant of the effect of protein crowding.
The crowding shown here had a higher presence of disordered proteins, which were generally characterized by a higher content of charged residues. However, we did not find any significant enrichment in the type of residues located at the contact regions or any dramatic concentration-dependent changes in the inter-protein contacts (Fig 5). Intriguingly, the number of protein-protein interactions and the preference for protein vs. water contacts rose as the intrinsic disorder of the protein increased (Fig 6 and S3 Table). This observation explains why crowding effects are especially dramatic in disordered proteins. In summary, we conclude that our simulations reproduce bona-fide “crowding effects”, which are not contaminated by specific interactions that might occur in a biologically relevant cluster (IRF-3, ACTR and NCBD).
Promiscuity and frustration
As described above, the presence of a protein environment helps the protein adopt conformations that more closely resemble bioactive ones; however, it also generates contact frustration, as the prevalence of non-specific quinary contacts hinders specific partner recognition. This frustration becomes evident by analyzing the interactions between NCBD (a total of 40 trajectories of NCBD were collected) and its partners (ACTR and IRF-3). All the crowded systems failed to reproduce the contacts observed in the experimentally solved complexes (S9 Fig). When binary complexes (NCBD:ACTR and NCBD:IRF-3) were placed in water, they rapidly adjusted to form new contacts. Remarkably, each complex rebuilt a similar pattern of contacts in most of the simulated copies in water (8 out of 10 for ACTR / F1 and 7 for ACTR / U2, see S10 Fig), thereby suggesting that these intrinsically favored contacts are frustrated in crowded conditions as a result of the presence of many competing interaction partners. Overall, the crowded box appeared as a stagnant system, where contact promiscuity generated a frustrated pattern of interaction that hindered the formation of bioactive conformations.
The impact of crowding on dynamics
Protein crowding limited the accessible configurational space both globally (as noted in the number of recognized clusters) and locally (as noted by the fuzziness of the intra-protein contacts) (Fig 2 and S2 Table). The greater the “in-water” intrinsic disorder of a protein, the larger the effect of protein crowding in slowing down protein dynamics (see Fig 6 and for PEG500, where this effect was not observed, S11 Fig), Despite so, the protein underwent frequent but small oscillations that did not produce major conformational changes. As expected, the presence of protein crowders led to a significant increase in viscosity, which was reflected in the reduction of atomic movements. For example, the diffusion of water molecules was slowed down by ~ 25% from a pure aqueous environment (Table 1) [27], while global protein diffusion was reduced to 1/10 of the original value and within the same range as that reported in other studies (below 10X) [40–43]. Note that diffusion values in crowded environments should not be considered as quantitative predictions [35],[44], [51]; however, given that our results confirm those of other studies, we are confident that they still provide a valid qualitative insight. Indeed the impact on diffusion rates and related binding kinetics [8,43] is known to affect the basic functionality of the protein [40].
Table 1. Diffusion coefficient of water molecules and proteins in a crowded environment and in water.
Diffusion Coefficient [μm2 / s] | |||
---|---|---|---|
Water | Proteins | ||
175 g/L | 4900 | (120) | 11 |
192 g/L | 4847 | (199) | 10 |
239 g/L | 4291 | (91) | 7 |
273 g/L | 4219 | (162) | 8 |
296 g/L | 4041 | (123) | 11 |
WATER | 5108 | (226) | 86 |
Discussion
Our study provides a comprehensive picture of the impact of crowding on the conformational space of three proteins with different structural levels: well-structured, intrinsically disordered, and molten globule. Compared to dilute solution, both small-sized synthetic (PEG500) and protein crowders favored open and moderately extended conformations with higher secondary structure content. However, proteins in PEG500 experience a larger increase in conformational entropy, confirming observation from calorimetric analysis in presence of PEG molecules with similar dimension [7]. Overall we join the concerns regarding its employment in macromolecular crowding: detailed analysis showed that, in general, there were few similarities between the effect of PEG500 and protein crowders. This finding thus argues against the generalized use of PEG of low molecular weight to simulate crowding in physiological environments.
Our results suggest that as previously suggested in previous studies [7,11–14] the protein crowding represented here is a battlefield between two opposing forces, namely the soft protein-protein interactions and the “excluded volume” effects; the outcome depending on the concentration and type of crowder involved. Interestingly, the impact of such protein crowding strongly depends on the intrinsic structural level of proteins (in water). We found that crowding leaves the overall structure of folded proteins (such as IRF-3) almost unaffected but reduces their collapse into the hydrophobic core. This observation would explain the small expansion of the protein and the agreement with the available structures from crystals compared to pure water simulations. The impact of crowding on non-structured proteins was found to be more dramatic and complex, and it typically led to a gain in structure, bringing it closer to the ensemble of bioactive conformations and therefore favoring the conformational selection paradigm of protein binding.
Protein crowders (but not PEG500) limit the exploration of new intra- and inter-protein contacts, leading to a global decrease in conformational entropy, balanced by the enthalpic stabilization cause by quinary contacts, in good agreement again with some of the existing models of crowding [7,11–14]. The prevalence of these soft, transient and non-specific protein-protein contacts slows down solvent diffusion, as well as the global and local dynamics of proteins, thus producing frustration in native contacts, which may slow down functional flexible proteins. Crowding favors bioactive conformations, which may facilitate conformational selection processes, but on the other hand hinders the formation of functional contacts, showing then a dual effect whose impact in functionality is difficult to predict. Although we were unable to detect any bias in the type of contacts formed in the crowded box, we cannot exclude bias in the results caused by the high presence of highly charged disordered proteins. Further studies might address the differential impact as crowders for disordered and ordered proteins and discard eventual IDP-driven artifacts, as these proteins have a unique sequence composition that keep them unfolded under physiological conditions, which might bias their crowding properties. Additional work is also required to evaluate the ability of longer “inert” polymers to simulate protein crowding, as PEG500, which is very convenient to perform MD simulations shows a molecular volume much smaller than that of average proteins. We could expect that longer polymers might simulate better the crowding properties of proteins.
The cell interior differs from an aqueous dilute environment. However, it is far from a “bag full of molecules”and is possibly organized into compartments in which proteins are stabilized or destabilized in response to the specific surrounding environment [45], thereby creating an unexpected extra level of regulation of protein functionality, especially in the case of the ultra-sensitive IDPs, whose structure and dynamics can differ depending on the cellular context. From this perspective, crowding can be regarded as a collective chaperon that modulates protein conformational space.
Methods
Overview of the crowded models
We mixed NCBD, ACTR, and IRF-3 to obtain five dense (175, 192, 239, 273 and 296 g (of protein)/mL: protein volume fraction 20–30%) protein solutions. A stoichiometry of 6:1:1 (NCBD, ACTR and IRF-3) was used to better reproduce the central protein of the system: NCBD, for which we considered 6 starting conformations (one per copy), three of them taken from a NMR ensemble (PDB: 2KKJ) and corresponding to “folded” states (F1-F3 in the remaining), while the other three were taken from a 50-ns MD simulation at T = 500K, corresponding to a fully “unfolded” protein (U1-U3 in the remaining). The starting conformations for ACTR and IRF-3 were taken from the Protein Data Bank (PDB entry IDs 1KBH and 1ZOQ respectively) without the bounded partner. To remove any bias from the simulations, the starting positions and orientations of the proteins in the simulation boxes were random (see below) and the distance between conformations was increasingly reduced to reach more dense environments. Water molecules were added to fill the box size, calculated with a decreasing distance from the proteins (from 0.5 to 0.1 nm) and the final density was then calculated considering the proportion between water molecules and proteins. See Fig 1 for a map of the simulations performed. All these simulations were extended for at least 3 μs of unbiased dynamics.
Control simulations
Control simulations at a comparable timescale were performed in two environments: eight simulations (1 for ACTR, 6 for NCBD, and 1 for IRF-3) in pure water boxes; and eight additional simulations in a water:PEG500 mixture (200 g/L PEG500 concentration). In order to check for potential biases produced by the finite size of the simulation box and the use of a given set of relative orientations of the proteins, we performed one additional simulation with a ~4-times larger box containing 24 NCBD, 4 IRF-3 and 4 ACTR proteins. This huge system (~277,000 atoms at 182 g/L of concentration) was simulated for 100 ns, allowing us to collect information on each protein copy in many different surroundings.
To address the interaction of NCBD and its partners in a crowded environment, we extracted protein pairs formed by either a folded or an unfolded conformation of NCBD (F1 and U2 with ACTR, F3 and U3 with IRF-3) from the crowding simulation at 273 g/L and used them as starting seeds for multiple simulations in pure water and crowded conditions (273 g/L). For each of the four systems, 10 simulations of 10 ns were performed (reaching a total of 400 ns in water and in protein crowding respectively). These short times allowed us to exclusively study the fast relaxation of the potentially frustrated protein-protein contacts.
Simulation set-up
All starting structures were titrated, neutralized with monovalent ions, hydrated, minimized, thermalized, and pre-equilibrated using our standard procedure implemented in the MD-Web server [46]. In the case of PEG500 systems, proteins were immersed in a pre-equilibrated box of water/PEG molecules at a concentration of 200 g/L (starting PEG500 conformation PDB ID 4APO); the resulting systems were then pre-equilibrated by relaxing solvent for 10 ns prior to the general MD-Web equilibration procedure [46]. Unless otherwise stated, all the trajectories were collected with Gromacs 4.5 [47] using a time step of 2 fs in the isothermal (300 K) and isobaric (1atm) ensemble with Nose–Hoover thermostat and Berendsen barostat [48–50]. We applied periodic boundary conditions and particle Mesh Ewald corrections [51] for the representation of long-range electrostatic effects with a grid spacing of 1.0 nm and a cut-off of 1.0 nm for Lennard-Jones interactions. Constraints on chemical bonds were solved by the SHAKE algorithm [52]. The Parm99-SB-ILDN force field was used for proteins [53], TIP3P for water molecules [54], and modified TraPPE-UA parameters described by Fischer and colleagues for PEG molecules [55].
Analysis
Gromacs standard routines and analysis tools in MD-Web [46] were used to analyze the trajectories, with a minimum resolution of 20 ps. We evaluated overall protein compactness using the radius of gyration (Rgyr), the deviation from a reference structure with the root mean square deviation (RMSD), the exposed surface to the outside with the solvent-accessible surface area (SASA), and the movements of each residues with the root means square fluctuations (RMSF). The secondary structure was evaluated by STRIDE [56], while VMD was used to visualize molecules and to analyze contacts [57]. The Coulomb and Lennard-Jones energy terms were calculated using GROMACS energy groups for each protein against the rest of the system. Inter- and intra-protein contacts were defined by a cutoff of 0.8 nm between alpha Carbons (Cα). Intra-protein contacts were defined as “explored” when they were found in more than five frames. Conformations recurrently sampled were detected by a two-step clustering of backbone atoms using the standard GROMOS algorithm [58]: first we reduced the total number of conformations in each trajectory with a cutoff of 0.15 nm, and then, for each protein, the reduced ensembles in WAT, PEG500 and CROW were collected together and subjected to a second clustering with a cutoff of 0.35 nm. Following Knott-Best [34], the relative orientation of the helices of NCBD was calculated by the relative elevation and azimuth between the helix vectors, defined with the axis formed by the Cα atoms in the helix of the PDB structure. The translational mean square displacements (MSD) of the center of mass of molecules were calculated to gain information on intermolecular movements (time windows of 10 and 25 ns were used for water and proteins respectively). Self-diffusion coefficients were determined using the Einstein relation, as described elsewhere, and periodic box corrections were applied [59]. Conformational entropies were approximated at the quasi-harmonic level using the last 1 μs of the simulations [60]. Finally, to detect conformational changes, we clustered the all-atom trajectory using the GROMOS algorithm [58] with a cutoff of 0.15 nm (0.1 nm for IRF-3), labeling any change in the cluster as a large conformational change.
Supporting Information
Data Availability
All relevant data are within the paper and its Supporting Information files.
Funding Statement
This work has been supported by the Spanish Ministry of Science MINECO (BIO2015-64802-R), the Ministry of Economy and Competitiveness, the European Fund for the Regional Development, the Catalan SGR, the Instituto Nacional de Bioinformática, and the European H2020 program (BioExcel project). MO is an ICREA Academia Fellow. IRB Barcelona is the recipient of a Severo Ochoa Award of Excellence from MINECO (Government of Spain). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
References
- 1.Zimmerman SB, Minton AP. Macromolecular crowding: biochemical, biophysical, and physiological consequences. Annu Rev Biophys Biomol Struct. 1993;22:27–65. [DOI] [PubMed] [Google Scholar]
- 2.Christiansen A, Wang Q, Cheung MS, Wittung-Stafshede P. Effects of macromolecular crowding agents on protein folding in vitro and in silico. Biophys Rev. 2013;5(2):137–45. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Elcock AH. Models of macromolecular crowding effects and the need for quantitative comparisons with experiment. Curr Opin Struct Biol. 2010;20(2):196–206. 10.1016/j.sbi.2010.01.008 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Singh LR, Mittal S. Denatured State Structural Property Determines Protein Stabilization by Macromolecular Crowding: A Thermodynamic and Structural Approach. PLoS ONE. 2013. November 12;8(11):e78936 10.1371/journal.pone.0078936 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Zhou H-X, Rivas G, Minton AP. Macromolecular crowding and confinement: biochemical, biophysical, and potential physiological consequences. Annu Rev Biophys. 2008;37(1):375–97. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Batra J, Xu K, Zhou H-X. Nonadditive effects of mixed crowding on protein stability. Proteins. 2009. October;77(1):133–8. 10.1002/prot.22425 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Senske M, Törk L, Born B, Havenith M, Herrmann C, Ebbinghaus S. Protein Stabilization by Macromolecular Crowding through Enthalpy Rather Than Entropy. J Am Chem Soc. 2014. June 25;136(25):9036–41. 10.1021/ja503205y [DOI] [PubMed] [Google Scholar]
- 8.Kuznetsova IM, Zaslavsky BY, Breydo L, Turoverov KK, Uversky VN. Beyond the excluded volume effects: mechanistic complexity of the crowded milieu. Mol Basel Switz. 2015. January;20(1):1377–409. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Smith AE, Zhang Z, Pielak GJ, Li C. NMR studies of protein folding and binding in cells and cell-like environments. Curr Opin Struct Biol. 2015. February;30:7–16. 10.1016/j.sbi.2014.10.004 [DOI] [PubMed] [Google Scholar]
- 10.Politou A, Temussi PA. Revisiting a dogma: the effect of volume exclusion in molecular crowding. Curr Opin Struct Biol. 2015. February;30:1–6. 10.1016/j.sbi.2014.10.005 [DOI] [PubMed] [Google Scholar]
- 11.Monteith WB, Cohen RD, Smith AE, Guzman-Cisneros E, Pielak GJ. Quinary structure modulates protein stability in cells. Proc Natl Acad Sci. 2015. February 10;112(6):1739–42. 10.1073/pnas.1417415112 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Wang Y, Sarkar M, Smith AE, Krois AS, Pielak GJ. Macromolecular crowding and protein stability. J Am Chem Soc. 2012;134:16614–8. 10.1021/ja305300m [DOI] [PubMed] [Google Scholar]
- 13.Monteith WB, Pielak GJ. Residue level quantification of protein stability in living cells. Proc Natl Acad Sci U S A. 2014. August;111(31):11335–40. 10.1073/pnas.1406845111 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Sarkar M, Smith AE, Pielak GJ. Impact of reconstituted cytosol on protein stability. Proc Natl Acad Sci U S A. 2013. November 26;110(48):19342–7. 10.1073/pnas.1312678110 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Harada R, Tochio N, Kigawa T, Sugita Y, Feig M. Reduced native state stability in crowded cellular environment due to protein-protein interactions. J Am Chem Soc. 2013;135:3696–701. 10.1021/ja3126992 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Uversky VN. A decade and a half of protein intrinsic disorder: biology still waits for physics. Protein Sci Publ Protein Soc. 2013;22:693–724. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Soranno A, Koenig I, Borgia MB, Hofmann H, Zosel F, Nettels D, et al. Single-molecule spectroscopy reveals polymer effects of disordered proteins in crowded environments. Proc Natl Acad Sci U S A. 2014. April 1;111(13):4874–9. 10.1073/pnas.1322611111 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Roque A, Ponte I, Suau P. Macromolecular crowding induces a molten globule state in the C-terminal domain of histone H1. Biophys J. 2007. September 15;93(6):2170–7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Hong J, Gierasch LM. Macromolecular crowding remodels the energy landscape of a protein by favoring a more compact unfolded state. J Am Chem Soc. 2010. August 4;132(30):10445–52. 10.1021/ja103166y [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Johansen D, Jeffries CMJ, Hammouda B, Trewhella J, Goldenberg DP. Effects of macromolecular crowding on an intrinsically disordered protein characterized by small-angle neutron scattering with contrast matching. Biophys J. 2011;100:1120–8. 10.1016/j.bpj.2011.01.020 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Qin S, Zhou H-X. Effects of Macromolecular Crowding on the Conformational Ensembles of Disordered Proteins. J Phys Chem Lett. 2013. October 17;4(20). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Goldenberg DP, Argyle B. Minimal effects of macromolecular crowding on an intrinsically disordered protein: a small-angle neutron scattering study. Biophys J. 2014. February 18;106(4):905–14. 10.1016/j.bpj.2013.12.003 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Szasz CS, Alexa A, Toth K, Rakacs M, Langowski J, Tompa P. Protein disorder prevails under crowded conditions. Biochemistry (Mosc). 2011. July 5;50(26):5834–44. [DOI] [PubMed] [Google Scholar]
- 24.Sotomayor-Pérez A-C, Subrini O, Hessel A, Ladant D, Chenal A. Molecular Crowding Stabilizes Both the Intrinsically Disordered Calcium-Free State and the Folded Calcium-Bound State of a Repeat in Toxin (RTX) Protein. J Am Chem Soc. 2013;135:11929–34. 10.1021/ja404790f [DOI] [PubMed] [Google Scholar]
- 25.Waudby CA, Camilloni C, Fitzpatrick AWP, Cabrita LD, Dobson CM, Vendruscolo M, et al. In-cell NMR characterization of the secondary structure populations of a disordered conformation of α-synuclein within E. coli cells. PloS One. 2013;8(8):e72286 10.1371/journal.pone.0072286 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Schuler B, Hofmann H. Single-molecule spectroscopy of protein folding dynamics—expanding scope and timescales. Curr Opin Struct Biol. Elsevier Ltd; 2013;23(1):36–47. [DOI] [PubMed] [Google Scholar]
- 27.Harada R, Sugita Y, Feig M. Protein Crowding Affects Hydration Structure and Dynamics. J Am Chem Soc. 2012. March 14;134(10):4842–9. 10.1021/ja211115q [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Predeus AV, Gul S, Gopal SM, Feig M. Conformational Sampling of Peptides in the Presence of Protein Crowders from AA/CG-Multiscale Simulations. J Phys Chem B. 2012. July 26;116(29):8610–20. 10.1021/jp300129u [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.McCully ME, Beck DAC, Daggett V. Multimolecule test-tube simulations of protein unfolding and aggregation. Proc Natl Acad Sci U S A. 2012. October 30;109(44):17851–6. 10.1073/pnas.1201809109 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Naganathan AN, Orozco M. The native ensemble and folding of a protein molten-globule: functional consequence of downhill folding. J Am Chem Soc. 2011;133:12154–61. 10.1021/ja204053n [DOI] [PubMed] [Google Scholar]
- 31.Kjaergaard M, Teilum K, Poulsen FM. Conformational selection in the molten globule state of the nuclear coactivator binding domain of CBP. Proc Natl Acad Sci. 2010. July 13;107(28):12535–40. 10.1073/pnas.1001693107 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Kjaergaard M, Nø rholm A-B, Hendus-Altenburger R, Pedersen SF, Poulsen FM, Kragelund BB. Temperature-dependent structural changes in intrinsically disordered proteins: formation of alpha-helices or loss of polyproline II? Protein Sci Publ Protein Soc. 2010;19:1555–64. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Iešmantavičius V, bing Jensen MR, Ozenne V, Blackledge M, Poulsen FM, Kjaergaard M. Modulation of the Intrinsic Helix Propensity of an Intrinsically Disordered Protein Reveals Long-Range Helix–Helix Interactions. J Am Chem Soc. 2013;135(27):10155–63. 10.1021/ja4045532 [DOI] [PubMed] [Google Scholar]
- 34.Knott M, Best RB. A Preformed Binding Interface in the Unbound Ensemble of an Intrinsically Disordered Protein: Evidence from Molecular Simulations. PLoS Comput Biol. 2012. July 19;8(7):e1002605 10.1371/journal.pcbi.1002605 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Kjaergaard M, Andersen L, Nielsen LD, Teilum K. A folded excited state of ligand-free nuclear coactivator binding domain (NCBD) underlies plasticity in ligand recognition. Biochemistry (Mosc). 2013. March 12;52(10):1686–93. [DOI] [PubMed] [Google Scholar]
- 36.Kjaergaard M, Poulsen FM, Teilum K. Is a Malleable Protein Necessarily Highly Dynamic? The Hydrophobic Core of the Nuclear Coactivator Binding Domain Is Well Ordered. Biophys J. 2012. April 4;102(7):1627–35. 10.1016/j.bpj.2012.02.014 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Iešmantavičius V, Dogan J, Jemth P, Teilum K, Kjaergaard M. Helical Propensity in an Intrinsically Disordered Protein Accelerates Ligand Binding. Angew Chem Int Ed Engl. 2014;1–5. [DOI] [PubMed] [Google Scholar]
- 38.Dogan J, Mu X, Engström Å, Jemth P. The transition state structure for coupled binding and folding of disordered protein domains. Sci Rep [Internet]. 2013. June 25 [cited 2015 Jun 20];3 Available from: http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3691887/ [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Romero P, Obradovic Z, Li X, Garner EC, Brown CJ, Dunker AK. Sequence complexity of disordered protein. Proteins. 2001. January 1;42(1):38–48. [DOI] [PubMed] [Google Scholar]
- 40.Wang Y, Li C, Pielak GJ. Effects of proteins on protein diffusion. J Am Chem Soc. 2010. July 14;132(27):9392–7. 10.1021/ja102296k [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.McGuffee SR, Elcock AH. Diffusion, Crowding & Protein Stability in a Dynamic Molecular Model of the Bacterial Cytoplasm. PLoS Comput Biol. 2010. March 5;6(3):e1000694 10.1371/journal.pcbi.1000694 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Feig M, Sugita Y. Variable Interactions between Protein Crowders and Biomolecular Solutes are Important in Understanding Cellular Crowding. J Phys Chem B. 2012. January 12;116(1):599–605. 10.1021/jp209302e [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Dix JA, Verkman AS. Crowding effects on diffusion in solutions and cells. Annu Rev Biophys. 2008;37:247–63. 10.1146/annurev.biophys.37.032807.125824 [DOI] [PubMed] [Google Scholar]
- 44.Andrews CT, Elcock AH. Molecular Dynamics Simulations of Highly Crowded Amino Acid Solutions: Comparisons of Eight Different Force Field Combinations with Experiment and with Each Other. J Chem Theory Comput. 2013. October 8;9(10):4585–602. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Gierasch LM, Gershenson A. Post-reductionist protein science, or putting Humpty Dumpty back together again. Nat Chem Biol. 2009. November;5(11):774–7. 10.1038/nchembio.241 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Hospital A, Andrio P, Fenollosa C, Cicin-Sain D, Orozco M, Gelpí JL. MDWeb and MDMoby: an integrated web-based platform for molecular dynamics simulations. Bioinformatics. 2012. May 1;28(9):1278–9. 10.1093/bioinformatics/bts139 [DOI] [PubMed] [Google Scholar]
- 47.Pronk S, Páll S, Schulz R, Larsson P, Bjelkmar P, Apostolov R, et al. GROMACS 4.5: a high-throughput and highly parallel open source molecular simulation toolkit. Bioinforma Oxf Engl. 2013;29:845–54. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Berendsen HJC, Postma JPM, van Gunsteren WF, DiNola A, Haak JR. Molecular dynamics with coupling to an external bath. J Chem Phys. 1984. October 1;81:3684–90. [Google Scholar]
- 49.Nosé S. Nosé S.: A molecular-dynamics method for simulations in the canonical ensemble. Mol. Phys. 52, 255–268. Mol Phys. 1984;52(2):255–68. [Google Scholar]
- 50.Hoover WG. Canonical dynamics: Equilibrium phase-space distributions. Phys Rev A. 1985. March 1;31(3):1695–7. [DOI] [PubMed] [Google Scholar]
- 51.Darden T, York D, Pedersen L. Particle mesh Ewald: An N log(N) method for Ewald sums in large systems. J Chem Phys. 1993. June 15;98(12):10089–92. [Google Scholar]
- 52.Ryckaert J-P, Ciccotti G, Berendsen HJ. Numerical integration of the cartesian equations of motion of a system with constraints: molecular dynamics of n-alkanes. J Comput Phys. 1977;23(3):327–41. [Google Scholar]
- 53.Lindorff-Larsen K, Piana S, Palmo K, Maragakis P, Klepeis JL, Dror RO, et al. Improved side-chain torsion potentials for the Amber ff99SB protein force field. Proteins. 2010. June;78(8):1950–8. 10.1002/prot.22711 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Jorgensen WL, Chandrasekhar J, Madura JD, Impey RW, Klein ML. Comparison of simple potential functions for simulating liquid water. J Chem Phys. 1983;79(2):926–35. [Google Scholar]
- 55.Fischer J, Paschek D, Geiger A, Sadowski G. Modeling of aqueous poly(oxyethylene) solutions: 1. Atomistic simulations. J Phys Chem B. 2008. February 28;112(8):2388–98. 10.1021/jp0765345 [DOI] [PubMed] [Google Scholar]
- 56.Frishman D, Argos P. Knowledge-based protein secondary structure assignment. Proteins Struct Funct Bioinforma. 1995. December 1;23(4):566–79. [DOI] [PubMed] [Google Scholar]
- 57.Humphrey W, Dalke A, Schulten K. VMD: visual molecular dynamics. J Mol Graph. 1996. February;14(1):33–8, 27–8. [DOI] [PubMed] [Google Scholar]
- 58.Daura X, Gademann K, Jaun B, Seebach D, van Gunsteren WF, Mark AE. Peptide Folding: When Simulation Meets Experiment. Angew Chem Int Ed. 1999. January 15;38(1–2):236–40. [Google Scholar]
- 59.Allen MP, Tildesley DJ. Computer Simulations of Liquids. Oxford: Oxford Science Publications; 1987. [Google Scholar]
- 60.Andricioaei I, Karplus M. On the calculation of entropy from covariance matrices of the atomic fluctuations. J Chem Phys. 2001. October 8;115(14):6289–92. [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
All relevant data are within the paper and its Supporting Information files.