Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2018 Dec 14.
Published in final edited form as: J Phys Chem B. 2017 Nov 30;121(49):11072–11084. doi: 10.1021/acs.jpcb.7b08785

Slow-Down in Diffusion in Crowded Protein Solutions Correlates with Transient Cluster Formation

Grzegorz Nawrocki 1,#, Po-hung Wang 2,#, Isseki Yu 2,3, Yuji Sugita 2,3,4,5, Michael Feig 1,4,*
PMCID: PMC5951686  NIHMSID: NIHMS964256  PMID: 29151345

Abstract

For a long time the effect of crowded cellular environment on protein dynamics has been largely ignored. Recent experiments indicate that proteins diffuse much slower in a living cell than in a diluted solution and further studies suggest that the diffusion depends on the local surrounding. Here, detailed insight into how diffusion depends on protein-protein contacts is presented based on extensive all-atom molecular dynamics simulations of concentrated villin head piece solutions. After force field adjustments in the form of increased protein-water interactions to reproduce experimental data, translational and rotational diffusion was analyzed in detail. While internal protein dynamics remained largely unaltered, rotational diffusion was found to slow down more significantly than translational diffusion as the protein concentration increased. The decrease in diffusion is interpreted in terms of a transient formation of protein clusters. These clusters persist on sub-microsecond time scales and follow distributions that increasingly shift toward larger cluster size with increasing protein concentrations. Weighting diffusion coefficients estimated for different clusters extracted from the simulations with the distribution of clusters largely reproduces the overall observed diffusion rates, suggesting that transient cluster formation is a primary cause for a slow-down in diffusion upon crowding with other proteins.

TOC image

graphic file with name nihms964256u1.jpg

INTRODUCTION

The cellular environment is very crowded. Up to 40% of the cell volume, i.e. as much as 400 mg/ml, is occupied by macromolecules such as proteins, lipids, and nucleic acids.1 More than half of this fraction are proteins2 that draw particular attention because of their functional relevance. For a long time, the importance of the cellular surrounding has not been fully appreciated as most of the experimental and theoretical studies were restricted either to the analysis of proteins in dilute solution or to simplifying considerations of cellular crowding effects.3 Since recently, an increasing number of studies of proteins in realistic cellular environments46 indicate that in vivo conditions can have a substantial impact on protein structure and dynamics,79 with crucial consequences for their biological function.10

One of the most important properties in the context of cellular environments is protein diffusion as a key determinant of biological processes.1114 Studies conducted in vivo indicate that both translational and rotational motions of proteins are significantly slower in vivo than in dilute solution.1518 However, the degree of retardation varies significantly, even within the same cell depending on the subcellular environment.19 Reduced diffusion may slow down reactions or speed up others, e.g. by extending the presence of a ligand in the vicinity of a receptor.20 The best-understood effect of crowded cellular environments is the so-called excluded-volume effect where crowder molecules limit the available volume for a given solute.21 When the volume fraction of crowder molecules increases, diffusion is reduced simply because the free space is diminished, but there are additional factors beyond the excluded-volume effect that affect protein diffusion in a cell.22 Altered solvent properties such as higher viscosity and a reduced dynamics of water23 could influence macromolecular diffusion, although some experiments indicate that the intracellular viscosity is not a limiting factor for protein tumbling in cells.24, 25 Finally, weak non-specific transient interactions with cytoplasmic components, often termed quinary interactions26, 27, have recently been considered as key factors governing protein diffusion.8, 15, 16, 28 However, the nature and time scales of how such interactions may shape diffusive properties remains unclear.

It has been shown previously, mostly for lysozyme protein solutions,2931 that even at moderate concentrations proteins are prone to form dynamic clusters. As the protein concentration increases, solutions initially dominated by single proteins have been seen to develop large, branched and irregular clusters32 that persist on a finite time even above the geometric percolation threshold.30 Other recent studies indicate that, although dynamic, such clusters persist long enough to retard the overall protein diffusion,31 as diffusion is simply size-dependent.33 The general expectation has been that the cluster formation primarily affects short-time diffusion, whereas long-time diffusion is believed to be still determined by individual proteins which would be reflected in time-scale dependent diffusive behavior. Such behavior would be consistent with colloidal suspensions, where cluster formation has been seen as a result of weak short-range attraction and long-range repulsion.34 For both, protein solutions and colloids in general, it has been shown that shifting the balance toward more attractive interactions, e.g. increasing the concentration, decreasing temperature, or screening the repulsive electrostatic by high ionic strength or pH, enhances cluster formation and the possibility of liquid-liquid phase separations.29, 30, 3337 Studies of colloids, furthermore, show that as clusters grow, a transition to glass and gel phases occurs where dynamically arrested states are formed before crystallization occurs.38 Experiments, theory and simulations of colloid systems generally assume that besides the attraction-dominated states, where particles move together, a repulsion-dominated state exists, where particles are trapped in a cage formed by surrounding particles.39 All of these effects might be expected to contribute to the protein diffusion in a dense solution. Nevertheless, it is questionable to what extend the dynamics of a protein solution might be explained in terms of simple colloidal chemistry since the interactions between proteins are highly complex40, 41 and the present study aims to shed more light on such questions.

Experimental studies of protein dynamics under in vivo conditions or even in the presence of high concentrations of biological crowders such as other proteins are possible but do not always provide full detailed mechanistic insights.4244 Molecular dynamics (MD) simulations are available as an alternative to study crowded systems in atomistic detail. Recent MD simulations have investigated the properties of water23, 45, the stability of proteins8, 9, the conformational sampling of peptides46, and interactions between proteins28 under crowding conditions. A more extensive review is available elsewhere.6

In this study, we describe MD simulations of the well-studied globular protein villin headpiece (simply referred to as ‘villin’ throughout this manuscript) at a range of concentrations from dilute conditions to 10% volume fraction, close to the experimental solubility limit for villin.9 Since villin is not expected to aggregate or form specific clusters, it is used here as a model system for studying the effect of cellular crowding on macromolecular diffusion. Previous studies have indicated that current force fields may lead to artificial protein aggregation under self-crowding conditions47, 48, including for villin.48 A similar observation was also made for the sampling of disordered peptides that form ensembles that are generally too compact with most force fields.49, 50 Both findings suggest that protein-water interactions may be too weak relative to protein-protein interactions. To restore the balance, modified force fields enhance the Lennard-Jones potential between water and protein atoms.49, 51 We followed the same strategy here and initially calibrated a modified force field with scaled water-protein interactions to match solution properties of villin known from experiment. The trajectories with the optimal force field were then further analyzed with respect to diffusive properties and their dependence on transient cluster formation under crowding conditions.

METHODS

Crowded Protein Systems

The chicken villin headpiece HP-36 (PDB entry 1VII52) was simulated at concentrations of 8, 16, and 32 mM. Systems at 8 and 16 mM concentrations consisted of eight villin copies. Systems at 32 mM had either eight or 64 copies of villin (see Table 1). In the initial configuration of each system, the proteins were placed in a cubic box in non-overlapping random positions and orientations (see Fig. 1). The box sizes were chosen to achieve the desired concentrations (see Table 1) and to be significantly larger than the characteristic molecular scales. The box edges are long enough to accommodate 5–10 villin molecules if packed together and are several times larger than average inter-protein distances. Periodic boundary conditions (PBC) were applied to avoid boundary effects. The systems were then solvated with TIP3P water and random water molecules were substituted with two Cl ions per villin molecule to neutralize each system. A system consisting of one copy of villin was used as a reference for infinite dilution. The size of the dilute system was chosen to prevent the protein from interacting with its own images under PBC, even when partially unfolded, neither directly nor indirectly (via mediated water layers). Throughout the manuscript we use residue numbering 1–36 for the villin residues that correspond to residues 41–76 in the full biological sequence.

Table 1.

Simulated concentrated villin solutions

[villin] Np1 V (%)2 #atoms3 Box length4 (nm)
(mM) (g/L)
dilute 1 21 K 5.9
8 34 8 2.5 165 K 11.8
16 77 8 5.0 74 K 9.0
32 138 8 10.0 41 K 7.5
32 135 64 10.0 336 K 14.9
1

number of protein copies;

2

volume fraction occupied by the proteins;

3

total number of atoms including ions and water molecules;

4

after equilibration in the NPT ensemble.

Figure 1.

Figure 1

Configurations after 100 ns for concentrated systems of eight villins at 8 mM (A), 16 mM (B), and 32 mM (C) as well as 64 villins at 32 mM (D) concentration. Colors are used to distinguish different villin copies, water molecules are shown in grey.

Molecular Dynamics Simulations

All-atom MD simulations were carried out with NAMD53 (version 2.10), CHARMM54 (version 42a1) together with OpenMM55, and GENESIS56. The initial systems were subjected to energy minimization, heated to 298 K and then simulated for 2 μs. The CHARMM c36 force field57 was used to describe interactions involving villin and ions and TIP3P58 was used as the water model. Protein-water interactions were enhanced following the prescription by Best et al.49. Using NBFIX, scaling factors λ between 1.00 and 1.10 in a modified Lennard-Jones potential were applied to interactions between water atoms i and protein atoms j:

VLJ(rij)=λεij{(Rijminrij)122(Rijminrij)6} (1)

The Lennard-Jones and short-range electrostatic interactions were shifted to zero at 0.9 nm in the one- and eight-villin systems and were cut off at 0.9 nm with a switching function becoming effective at 0.8 nm in the 64-villin system. Long-range electrostatic interactions were calculated using the particle mesh Ewald (PME) summation method59 under PBC. A more extensive description of the MD protocol can be found in the Molecular Dynamics Simulations section of the SI.

Folding Free Energy via Replica Exchange Simulations

Simulations of the one-copy villin were performed with Hamiltonian Replica Exchange MD (H-REMD) using λ values of 1.05, and 1.10 to obtain folding free energies, ΔGf, of villin. Unfolded protein conformations were generated by biasing the distance between Cα of Arg15 (in the second α-helix) and Cα of Leu29 (in the third α-helix) to facilitate the folding-unfolding transition as introduced in a previous study60. 20 replicas were run where a harmonic potential using a force constant of k=4.5 kcal/mol was varied to target distances ranging from 1.05 to 2.19 nm with a spacing of 0.06 nm between replicas. Each replica was simulated over 100 ns with exchange attempts every 10 ps.

The potential of mean force (PMF) as a function of distance (d) and root-mean-square deviation (RMSD) was calculated using 2D Weighted Histogram Analysis Method (WHAM) implemented by Alan Grossfield.61 The 2D PMF was further integrated to obtain 1D PMFs as a function of RMSD. The ΔGf was calculated as a probability ratio of finding the protein in unfolded (UF) and folded (F) states, according to:

ΔGf=kBTlnK=kBTln[iFp(RMSDi)iUFp(RMSDi)] (2)

where kB is Boltzmann constant, T is temperature, and K is the equilibrium constant. The folded state was defined as RMSD<0.4 nm, whereas, the unfolded as RMSD>8.0 nm for λ=1.05, and RMSD>7.8 nm for λ=1.10. Averages and errors were calculated from 10-ns non-overlapping intervals. This protocol follows the analysis applied in Huang et al.60 for the same system with λ=1.00, the results of which are used here as well for comparison.

Analysis

The analysis of the MD trajectories was performed with the MMTSB Tool Set62 and the VMD visualization package63. Custom scripts were written for calculation of translational diffusion, contacts between proteins, time scales of contacts formation, and distribution of clusters. The SciPy package64 was applied for fitting data. The Matplotlib65 library was used for plotting the results.

The first 200 ns of each multiple protein copy trajectory were omitted from analysis to allow for full equilibration and avoid dependence on the initial configurations. Except for the calculation of order parameters (S2l), only the snapshots of Cα positions saved every 100 ps were used.

Translational diffusion coefficients, Dt, were obtained from mean square displacements (MSD) of protein centers of mass for a given protein:

MSD(τ)=(r(t+τ)r(t))2 (3)

where r is the position of the protein at time t, and τ is the lag time between the two positions. The MSD values were obtained by sliding an observation window of size τ from 0.1 ns to 100 ns along the trajectory in steps of Δt=0.1 ns. Dt was then calculated from the slope of a linear fit to MSD(τ) according to the Einstein relation:

Dt=MSD(τ)6τ (4)

Dt was further corrected for PBC artifacts taking account of altered water viscosity in the crowded systems (see Eq. S1 and S2 in the Analysis section in the SI).

The internal dynamic correlation function was calculated from the MD trajectories according to:

CI(t)=P2|μ^(0)μ^(t)| (5)

where P2(x) is the second-order Legendre polynomial, μ describes the orientation of the unit N-H bond vector with respect to the molecular reference frame. The resulting functions were fitted with:

C1(t)=Sl2+(1Sl2)etτe (6)

where S2l is the generalized order parameter that describes protein dynamics, and τe is the time constant for internal motions.

The rotational correlation function was determined from the trajectories following the protocol proposed by Wong and Case66. Randomly distributed unit vectors were centered and rotated along with the protein before fitting it to a reference structure for each frame. Correlation functions were then obtained as:

CO(ti,t)=P2(nj(ti+t)nj(ti))j (7)

where nj(ti) is one of the randomly distributed unit vectors (generated at time ti). The correlation functions were then fitted to a double-exponential function:

CO(t)=SR2etτRs+(1SR2)etτRf (8)

with fast, τRf, and slow, τRs, correlation times, and weighted by SR2. Rotational diffusion estimates were then obtained according to Eq. 9 and do not require a PBC correction.67

Dr=1l(l+1)τ (9)

where l=2 and the overall rotational correlation time τ is obtained from the fast and slow correlation times as follows:

τ=SR2τRs+(1SR2)τRf (10)

Additionally, averages of the inverse tumbling times were also calculated as they are more relevant for comparison with many experiments such as dynamic light scattering68:

τini(SR2τRs+1SR2τRf)1 (11)

The correlation functions obtained at the highest concentrations were additionally fitted to a triple-exponential function since a double-exponential fit did not fully describe the data:

CO(t)=SRs2etτRs+SRm2etτRm+(1SRs2SRm2)etτRf (12)

where τRf, τRm, and τRs are fast, medium and slow correlation times with the corresponding SRm2 and SRs2 weights.

For both, translational and rotational diffusion rates, errors were obtained from averaging over four subsequent 500 ns trajectory blocks in dilute and over the proteins copies in dense systems. To analyze transient cluster formation, two proteins were assumed to be in a contact if any of their Cα atoms were closer than 0.7 nm. Clusters were counted based on proteins in contact considering minimum images under PBC.

A residue contact map for interactions between proteins based on Cα coordinates was calculated as follows: For each residue i in a given protein copy, Pm, the minimum distance to residues j in other protein copies Pn, excluding m, was calculated as dmin(ri(iPm),rj(jPn)). The distance for a given pair was then averaged by taking each protein m in turn as the starting point and by averaging over simulation snapshots. The example of a minimum distance calculation shown in Fig. S1 indicates that the resulting contact map is not strictly symmetric. The kinetics of contact formation was described by a correlation function depicting whether a contact is still present after a given period of time (see Eq. S3 in the Analysis section in SI). The correlation function was then fitted to a triple-exponential function:

Cc(t)=Scsetτcs+Scmetτcm+(1SscScm)etτcl (13)

with short (τcs), medium (τcm), and long (τcl) relaxation times and weights Scss, Scm and Scl=1ScsScm. No constraints were applied when fitting Eq. 13 and the functional form implies a decay to zero at long times since we did not observe permanent contact formation as in aggregation.

Representative structures of clusters were extracted from the trajectories at the middle of a time interval when a cluster was formed for at least 1 ns. Uncertainties in the cluster distribution and in the diffusion of MD-derived structures were obtained by averaging over four subsequent 450 ns trajectory blocks (discarding 200 ns as equilibration).

Coarse-Grained Simulations

Coarse-grained simulations of purely attractive, neutral, and repulsive proteins were carried out for comparison. Systems of eight protein copies at 8, 16 and 32 mM concentrations were modeled and each of them was simulated with the three different types of interactions. For more details see the Coarse-Grained Simulations section in the SI.

RESULTS AND DISCUSSION

Force Field Optimization

Previous studies have indicated that current force fields may overestimate protein-protein interactions relative to protein-water interactions.47, 48 Therefore, we considered a modified force field based on CHARMM c3657 where the Lennard-Jones potential between water oxygen and protein atoms was scaled by a factor λ (see Eq. 1) as proposed by Best et al.49 We tested λ values between 1.0, corresponding to the unmodified force field, and 1.2, corresponding to protein-water interactions that are increased by 20%. Simulations with different scaling factors were carried out for the eight-copy villin systems at different concentrations (see Table 1) to determine which value of λ provides results that are most consistent with experiment.

Since the strength of the protein-water interactions modulates the hydrophobic effect, we first analyzed whether the stability of villin is maintained when protein-water interactions are enhanced. We found that λ values up to 1.09 do not significantly compromise the stability of the native state (with average RMSD values generally below 0.3 nm, see Fig. S2) while larger scaling factors led to significant destabilization of the native fold. With scaling factors of λ=1.09 and less, there was only occasional partial unfolding that lasted at most for tens of nanoseconds before refolding. With λ=1.10, we find more pronounced native state destabilization in the concentrated solutions, especially at 8 and 32 mM, but not under dilute conditions, based on the average RMSD values. We also calculated average internal order parameters Sl2 (according to Eq. 6, see Table S2) and found little change up to scaling factors of 1.09 but a significant increase in internal dynamics with λ=1.10. Partial destabilization of villin in simulations under crowding conditions has been noted previously,9 although, in that case, the observed partial unfolding of villin was related to interactions with protein G that are not present here. Previous NMR experiments clearly showed that villin maintains its stability even at 32 mM, as there were only minor changes in chemical shifts for residues at the surface.9 Therefore, based on the stability analysis, we concluded that protein-water scaling factors of 1.10 or larger are not consistent with the NMR experiments.

We further analyzed the formation of clusters based on the contact definition described in the Methods section. Generally, as expected, small clusters (monomers, dimers etc.) dominated at low concentrations, whereas larger clusters appeared increasingly at higher concentrations (see Fig. S3). Also as expected, increasing scaling factors shifted the cluster distributions within a given concentration toward smaller clusters (see Fig. S3). The systems are too small and sampling requirements are too large to directly determine solubility from these simulations.69 However, the fraction of the largest possible cluster (octamer) is an indicator of aggregation propensities and whether the solubility limit is approached or exceeded. At 8 mM, octamers were not formed to a significant extent with any scaling factors (see Fig. S3) indicating that villin remained fully soluble in all cases. Scaling factors of 1.00, 1.01, and 1.02 led to significant octamer populations at 16 mM (about 50%. see Fig. S3) while at 32 mM, scaling factors from 1.00 to 1.05 resulted in 50–100 % octamer formation. In experiments, villin is still soluble at 32 mM.9 This observation is not compatible with the strong aggregation found in the simulations at 16 and 32 mM with scaling factors below 1.03 and probably also not with larger scaling factors up to 1.05 where significant aggregation into large clusters is seen at 32 mM.

Since protein stability appears to be sensitive to the scaling factor in the studied range, as the RMSD results show, we calculated the folding free energy of a single villin using different λ values. PMFs as a function of RMSD are shown in Fig. S4. According to the analysis described in the Method section we determined ΔGf as −4.94 kcal/mol, −4.14 kcal/mol and −3.47 kcal/mol for λ equal to 1.00, 1.05, and 1.10, respectively. The comparison with experimental values of −2.4 kcal/mol70 and −3.1 kcal/mol71 (for villin without the N-terminal methionine) suggests that the largest scaling factor gives the best agreement.

Taking all of the evidence together, it is clear that the scaling of protein-water interactions improves the agreement between simulations and experiment. On one hand, scaling factors above 1.05 are needed to prevent strong aggregation while factors of 1.10 and larger lead to native state destabilization in the presence of other villin molecules. The calculated folding free energies for a single villin in dilute conditions furthermore suggest that scaling factors closer to 1.10 are best. Therefore, λ=1.09 was chosen as the optimal scaling factor to describe the concentrated villin solutions studied here as the largest value that does not lead to significant destabilization under crowded conditions. Incidentally, this value is very close to the value of λ=1.10 chosen by Best et al. for the AMBER force field to improve the sampling of disordered peptides and protein-protein affinities49, although it will require further studies to determine whether this exact scaling factor is generally optimal for simulating concentrated protein systems with the CHARMM force field.

The simulations at 8, 16, and 32 mM with λ=1.09 as well as an additional simulation of 64 villin copies at 32 mM, also with λ=1.09, were subsequently analyzed in more detail.

Internal Dynamics of Villin

The villin structures remained largely stable at all concentrations in the simulations. The average Cα RMSD values with respect to the NMR structure (PDB code: 1VII) were 0.26 ± 0.009 nm at 8 mM, 0.26 ± 0.007 nm at 16 mM, and 0.25 ± 0.005 nm and 0.31 ± 0.03 nm in the eight- and 64-villin systems at 32 mM. In the eight-villin simulations, two proteins spontaneously unfolded (8 mM and 16 mM concentrations) for tens of nanoseconds forming a structure of an open hairpin that mostly preserved the secondary structure before refolding. Four proteins unfolded in the 64-copy system partially losing the secondary structure and remaining extended for a longer time period. Native-state destabilization of villin upon crowding has been seen by us before in shorter simulations under different conditions,9 but the unfolding seen here is likely due to a much larger amount of sampling. Villin is only stable by 3.1 kcal/mol and folding/unfolding rates are on the order of 5 μs.71 Therefore it would be expected that villin can spontaneously unfold during microsecond-scale simulations even under dilute conditions.

The internal dynamics was further analyzed by calculating generalized order parameters Sl2 as a function of residue (see Fig. 2). The results suggest that there is little overall effect of crowding on the internal dynamics as a function of concentration, but it appears that villin flexibility may be slightly increased for residues 11–14 and 33–35 while being decreased for residues 4–9.

Figure 2.

Figure 2

Order parameter Sl2 for villin residues (except for Met1 and Pro22) at 8 mM (blue), 16 mM (green) and 32 mM (red: eight-villin system; orange: 64-villin system; magenta: 64-villin system without the four villins that unfold during the simulation). Results under dilute conditions are shown for reference as a black dashed line. Statistical uncertainties are shown as error bars.

Translational Diffusion of Villin

Translational diffusion coefficients were obtained from mean square displacements (see Fig. 3) and were corrected for PBC artifacts as described in the Methods section. The MSD curves display somewhat different slopes at different time scales indicative of transient behavior. Therefore, we obtained diffusion coefficients for three time regimes: < 1 ns, 1–10 ns, and 10–100 ns. Table 2 summarizes the results.

Figure 3.

Figure 3

Mean square displacement vs. time based on center of mass of villin at 8 mM (blue), 16 mM (green), and 32 mM (eight-copy: red; 64-copy: orange). Linear functions fitted to different time scales (<1 ns, 1–10 ns, and 10–100 ns) are shown as dashed lines (dotted, dash-dotted, dashed, respectively). The values are shown in regular (A), logarithmic (B) scales. The derivative of the log-log plot is shown in (C). Gray dashed lines with the slop equal to 1 are shown for reference.

Table 2.

Translational diffusion coefficients of villin at different concentrations

[villin]
(mM)
Np Dt ;< 1 ns
(nm2/ns; slow-down)
Dt ; 1–10 ns
(nm2/ns; slow-down)
Dt ;10–100 ns
(nm2/ns, slow-down)
dilute 1 0.49 (0.004) 1 0.48 (0.008) 1 0.46 (0.03) 1
8 8 0.43 (0.01) 0.88 0.41 (0.01) 0.85 0.40 (0.02) 0.87
16 8 0.35 (0.007) 0.71 0.34 (0.006) 0.71 0.31 (0.01) 0.67
32 8 0.25 (0.008) 0.51 0.24 (0.009) 0.50 0.22 (0.01) 0.48
32 64 0.21 (0.004) 0.43 0.18 (0.004) 0.38 0.16 (0.004) 0.35

Error estimates from averaging over blocks (dilute solution) and protein copies (concentrated solution) are given in parentheses. For each value, the slow-down relative to diffusion in dilute solvent is given in bold.

The TIP3P water model underestimates the viscosity of pure water (ηTIP3P = 0.35 cP72 (at 293 K) compared to the experimental value of ηexp = 0.89 cP73). Therefore, the diffusion of solutes in TIP3P is too fast and a scaling factor based on the ratio of the viscosities, ηexp/ηTIP3P = 2.54 (at 298 K), has previously been applied to correct diffusion coefficients obtained with TIP3P water.74 Based on this scaling factor, we obtain Dt = 0.187 nm2/ns under dilute conditions. Experimental data for Dt is not available for villin, but the program HYDROPRO provides theoretical estimates that generally match experimental values closely for dilute conditions.75 The HYDROPRO estimate is Dt = 0.177 nm2/ns, in excellent agreement with the MD-derived value after scaling.

Under crowded conditions, diffusion rates slow down as expected. Diffusion on different time scales decreases to a different extent but the effect is not very strong (see Table 2 and Figure 3). Time-dependent diffusion in the presence of protein crowders has been examined before for CI228 and discussed in the context of variable protein-protein interactions. Standard colloid theory based on hard sphere models predicts two main diffusive regimes associated with faster short time diffusion within the cage of surrounding proteins and longer time diffusion where a molecule escapes the cage and is then affected by macroscopic solvent viscosity. In the case of CI2, this classical two-regime picture was seen in the case of weaker interactions with protein crowders, but when strong interactions with the surrounding protein crowders were present, the overall diffusion was reduced more and the cage effect was less pronounced as short and long time diffusion were similar.28 The argument made in this earlier work was that diffusion was slowed down already at short time scales due to cluster formation while diminishing the cage effect. The absence of a strong cage effect is apparent from the log-log plot in Fig. 3B and the derivative analysis in Fig. 3C that shows slopes near 1 throughout the entire range of time scales with only a shallow minimum between 10–100 ns where the slope of the log-log curves decreases to 0.85–0.9 in the most concentrated system. This indicates only weak subdiffusive behavior for this system. The observation of similar short and long time diffusion in the villin simulations reported here then also suggests extensive cluster formation.

Similar results for the 8- and 64-copy systems at 32mM indicate that different box sizes do not have a strong effect on the findings reported here. On the other hand, the slightly more reduced diffusion rate in the 64-copy 32mM system can be rationalized simply because larger clusters can be formed than in the eight-copy system.

The simulations reported here are about an order of magnitude longer than our previous simulations of CI2 and the size of the villin molecules is smaller than then crowders in the CI2 study. More formally, the limits of short time diffusion are generally considered as τHtτI, with

τHR2ρηφandτIR2Dt,dilute (12)

where R is the particle radius, ρ is the solvent density, η is the solvent viscosity, and ϕ is the volume fraction.41 For the 32 mM villin solution, this amounts to a time range of 25 ps ≪ t ≪ 11 ns, which is well below the 2-μs simulation length of the present simulations. Therefore, villins would have enough time to diffuse out of a local cage environment and reach the long-time diffusion limit. Further evidence that this is indeed the case comes from analyzing the exchange of the nine closest partners, Nex9, based on a quantity introduced earlier,45, to characterize complete exchange of the local environment for a given villin molecule. In the 64-villin trajectory we found that more than half of the villins in the 64-villin system experienced at least one complete swap of the local environment. This finding also supports that the long time diffusion regime has been reached. As further illustration, Movie S1 shows the center-of-mass motion of selected villins and how one villin escapes from one cage and diffuses freely as a monomer and then enters another cage in the simulation. Another factor influencing the results may be finite-size effects, however, the results with the 8-copy and 64-copy are qualitatively similar with the quantitative differences assumed to be due to the formation of larger clusters in the 64-copy system as discussed above. Therefore, it is unlikely that the lack of clear two time regimes is due to insufficient simulation length or finite-size effects. Instead, it appears that almost uniform diffusion from picoseconds to at least microseconds is a feature of this system and a result of extensive cluster formation.

Rotational Diffusion of Villin

In the crowded systems the rotational correlation functions based on the reorientation of vectors attached to the molecular frame (see Methods) were calculated for each protein copy separately. As Figure 4 shows, there is significant variation between the correlation decays in different copies and especially at the highest concentration, some correlation functions display non-exponential behavior that we attribute to the inhomogeneity of local environment and limited statistics. In the following discussion, we focus on averages over individual protein copies.

Figure 4.

Figure 4

Correlation functions for scalar product between random vectors fixed to the molecular frame used for describing rotational diffusion (see Methods) of individual villin copies (color lines) at 8 mM (A), 16 mM (B), 32 mM with eight copies (C), and 32 mM with 64 copies (D). Correlation functions averaged over villin copies are shown with error bars (black lines).

The correlation functions were fit with a double-exponential with two time scales. The fits to the average correlation functions (black lines in Figure 4) are shown in Figure 5. For the highest concentration solutions, we also fitted a triple-exponential function (Eq. 12) to better represent the initial decay of the correlation functions. Anisotropy in rotational diffusion was not considered here. The resulting double-exponential fit parameters averaged over fits of individual correlation functions are given in Table 3. The results for the triple-exponential fit to the average correlation functions for the 64-copy 32mM system are:τRf = 0.845 ns, τRm = 5.762 ns, and τRs = 29.596 with the weights SRm2 = 0.390 and SRs2 = 0.200 resulting in Dr=0.011 nm2/ns, which is similar to the result from the double-exponential fit. The dilute value is again about three times faster than the value of Dr = 0.068 ns−1 obtained from HYDROPRO due to the underestimated viscosity with the TIP3P water model.

Figure 5.

Figure 5

Correlation functions for scalar product between random vectors fixed to the molecular frame used for describing rotational diffusion (see Methods) averaged over villin copies at 8 mM (blue), 16 mM (green), and 32 mM (eight-copy: red; 64-copy: orange). Villin in dilute solvent is shown as a black line (averaged over trajectory blocks). Exponential functions fit to the average correlation functions are shown as dotted lines. Triple-exponential fits (at 32 mM with 8 and 64 protein copies) are shown as dashed lines (B). All fits have R2 values of larger than 0.995 and χ2 values of less than 0.05. Values of τ are shown in regular (A) and logarithmic (B) scales.

Table 3.

Rotational diffusion parameters of villin at different concentrations

[villin]
(mM)
Np τRf
(ns)
τRf−1
(ns−1)
τRs
(ns)
τRs−1
(ns−1)
SR2 τ
(ns−1)
τini
(ns−1)
Dr
(ns−1)
Dr,ini
(ns−1)
Dr slow-down Dr, ini slow-down
dilute 1 0.59
(0.08)
1.870
(0.321)
1.48
(0.45)
0.887
(0.181)
0.49
(0.13)
0.820
(0.026)
0.722
(0.022)
0.204
(0.006)
0.232
(0.007)
1 1
8 8 0.97
(0.13)
1.142
(0.110)
3.96
(0.93)
0.342
(0.052)
0.17
(0.03)
1.388
(0.180)
1.095
(0.128)
0.134
(0.014)
0.166
(0.015)
0.65 0.72
16 8 1.00
(0.09)
1.069
(0.099)
3.80
(0.21)
0.271
(0.016)
0.42
(0.05)
2.223
(0.207)
1.489
(0.118)
0.082
(0.009)
0.119
(0.011)
0.41 0.51
32 8 1.83
(0.20)
0.601
(0.065)
13.81
(0.81)
0.075
(0.005)
0.52
(0.06)
8.151
(0.910)
3.574
(0.487)
0.024
(0.004)
0.057
(0.010)
0.12 0.25
32 64 2.70
(0.15)
0.445
(0.023)
24.50
(1.07)
0.047
(0.003)
0.54
(0.02)
14.445
(0.721)
5.550
(0.369)
0.014
(0.001)
0.038
(0.002)
0.07 0.16

Fast (τRf) and slow (τRs) correlation times and SR2 weight as in Eq. 8. Averaged inverse tumbling times are reported as well. Total tumbling times τ and τini were obtained according to Eqs. 10 and 11. Rotational diffusion coefficients Dr and Dr,ini were obtained according to Eq. 9. Error estimates are given in parentheses and the slow-down is relative to rotation in dilute solvent.

As the concentration increases in the lower range (8 – 16 mM), the contribution to the overall diffusion, described by SR2, shifts toward the slow time scales, whereas, the fast, τRf, and the slow, τRs, correlation times remain mostly constant. In the higher concentration range (16 – 32 mM), both τRf and SR2 are relatively unaltered, whereas τRs significantly increases. This is generally consistent with observations from NMR studies for the same systems.76 The resulting values of Dr decrease with increasing concentrations. We are reporting results both based on linear averaging of tumbling times according to Eq. 10 and averaging of inverse tumbling times according to Eq. 11. The latter quantity is more commonly discussed when analyzing experiments such as depolarized dynamic light scattering and focuses on the initial decay slope. One interpretation of this data is that as the solutions become denser, a diminishing fraction of villin rotates as a free monomer, while the remaining villin copies experience increasing crowding effects that cause the formation of larger clusters. Interestingly, the rotational diffusion slows down more significantly at higher concentrations (to 16% of dilute value at 32mM based on averaging inverse tumbling times) than the translational diffusion (48–56% of dilute value at 32mM). This generally agrees with the results from the Pielak group for crowded systems involving weakly interacting proteins15, but it differs from results from the Saalwächter group68 where rotational properties of largely non-interacting proteins were studied. This may be expected as the decoupling between translational and rotational diffusion under crowding conditions has been suggested to be protein-specific, although methodological differences may also play a role.68

Transient Cluster Formation

At high concentrations, frequent encounters between proteins are unavoidable. The previous analysis is already suggesting that such interactions may lead to transient clusters and we analyzed this aspect further. The interactions between the proteins were characterized by the calculations of an average number of contacts a single protein forms with others. The results are summarized in Table 4. The number of contacts doubled when the concentration increased from 8 to 16 mM, and rose by another 50% when the concentration increased further from 16 to 32 mM. To shed light on the nature of the interactions governing the contacts formation, we compared with coarse-grained (CG) simulations, where residues were assigned purely attractive, neutral, or repulsive interactions (see Methods section for more details) (Table 4). We find that the number of contacts in the all-atom simulations is roughly one order of magnitude larger than in the CG model with neutral interactions but still significantly less than with purely attractive interactions. We note that the available volume at any of the concentrations does not force interactions as there were no protein-protein contacts in the CG simulations with the repulsive potential. This comparison suggests that the villin interactions are weakly attractive.

Table 4.

Average number of contacts per protein obtained from all-atom and CG simulations

[villin] (mM) all-atom CG attractive CG neutral CG repulsive
8 0.48 (0.13) 3.73 (0.32) 0.065 (0.005) 0.00 (0.00)
16 0.94 (0.12) 3.90 (0.33) 0.093 (0.011) 0.00 (0.00)
32 1.46a (0.19); 2.05b (0.07) 3.85 (0.23) 0.229 (0.016) 0.00 (0.00)
a

eight copies,

b

64 copies

The weak attraction led to transient cluster formation. Fig. 6 shows the size of clusters as a function of concentration. At 8 mM, 60% of the proteins exist as monomers but there is a small fraction of dimers, trimers, and tetramers. At 16 mM, the fraction of monomers decreases to 35% and clusters as large as hexamers are observed. At 32 mM, the eight-copy system is too small to adequately characterize the cluster distribution. In the 64-copy system, we find significant fractions of clusters as large as 45-mers while the fraction of monomers is decreased to less than 10%. Since larger clusters than 50-mers were not observed, we conclude that villin still remains soluble at 32 mM in the simulations but higher-order clusters are present in the simulations with non-negligible fractions. The average cluster size in the 64-copy system, Nc, is 13, a number that is close to Nc = 3–5, previous SAXS experimental values for lysozyme at a volume fraction of 0.1.34 Given the differences in proteins, protein sizes and protein-protein interactions, this is reasonable qualitative agreement. Furthermore, the average number of contacts obtained at 32 mM from 64 protein copies system (Table 4), i.e. 2.05±0.07, corresponds well to the maximum number of neighbors (2) determined from colloids interacting by the centrosymmetric potential as well as patchy colloids at the volume fraction ϕ = 0.1.41

Figure 6.

Figure 6

Distribution of cluster sizes for simulations of eight proteins (A) at 8 mM (blue), 16 mM (green) and 32 mM (red) and of 64-copy system (B) at 32 mM based on contacts defined as Cα-Cα distances of less than 0.7 nm (see Methods).

Villin is not known to form specific clusters and we also did not find a strong preference for specific complexes as the population of cluster structures for a given cluster size is quite diverse (see Fig. S5). However, according to our simulations, certain parts of the villin structure are more likely to be involved in the protein-protein interfaces than others (see Fig. 7). Favorable interactions were found for the residues belonging to the N-terminal α-helix (residues 5–10) and the last few residues at the C-terminus. The contact maps are in rough agreement with a previous simulation study of villin contacts in a crowded mixture of villin and protein G.9 When mapped onto the three-dimensional villin fold (Fig. 8A), it becomes apparent that there is a preference for one side of the structure to interact. The residues involved in these interactions are largely acidic and basic suggesting that the interactions that lead to non-specific clusters are driven more by electrostatics and salt-bridge formation than by hydrophobic interactions. These interactions can also partially explain the observed weak concentration-dependency of specific regions in Sl2 (see Fig. 2). Favorable interactions at residues 5–10 and at the C-terminus are enhanced as the concentration increases. Enhanced interactions result in the decreased flexibility at residues 5–10. In contrast, favorable interactions at the C-terminus result in the increased flexibility at the C-terminus. The difference is that, bulkier residues are facing outward at the C-terminus (e.g. Phe36) but those residues are facing inward at residues 5–10 (e.g. Phe7), interacting with other phenylalanines (Phe11 and Phe18). In the crowded environment, favorable interactions at residues 5–10 are strengthened without being affected by Phe7 but these residues are weakened by a flexible Phe36. Also in residues 11–14, crowding partially affects the stability of the three interacting phenylalanines, so the flexibility in this region increases (See Fig. 2 and 8B).

Figure 7.

Figure 7

Contact map of residues between proteins at 8 mM (A), 16 mM (B), and 32 mM (C). Colors indicate the average Cα distances from a given residue to the closest residue of another nearby villin molecule. Residue numbers 1–36 correspond to 41–76 in the biological sequence.

Figure 8.

Figure 8

Structure of villin colored according to the average Cα distance to nearby villin molecules (A) (short: red, medium: green, long: blue) and by residue type (B) (hydrophobic: white, polar: green, acidic: red, basic: blue). Residue numbers 1–36 correspond to 41–76 in the biological sequence.

Although the range of minimum distance decreases with increasing concentration (Fig. 7) the pattern of favorable contact is very similar at all three concentrations. The contact maps obtained at 32 mM concentration with eight (Fig. 7C) and 64 (Fig. S7) protein copies are also in good agreement suggesting that contact preferences are not sensitive to finite-size effects.

The kinetic nature of cluster formation was characterized by defining a conditional contact function as given in Eq. S3 and fitting a triple-exponential function. The results in Table 5 reveal three distinct time scales: 0.74–1.9 ns, 12.5–42.0 ns, and 108–342 ns. The longest time scale contributes most at the lowest and medium concentrations, but at the highest concentration the contribution from all the three time scales is comparable. Furthermore, if the 8-copy 32 mM results are excluded, which may be too small to obtain reliable statistics, the longest time scale clearly increases with concentration as may be expected, but even at 32mM, cluster formation remains highly transient. However, clusters form mostly on time scales that are long (τcl) compared to characteristic rotational time scales of 1–25 ns (see Table 3) and the time it takes to diffuse by the size of villin (on the order of 10 ns; based on data in Table 2). Therefore, although cluster formation is dynamic, clusters persist long enough to significantly affect diffusional properties. This prevents a pronounced cage effect for single villins as they frequently associate with other molecules rather than moving independently in a cage of surrounding crowders. On the other hand, a cage affect at the larger cluster level is also prevented because clusters frequently reorganize and do not persist long enough to experience diffusion on different time scales. As a result, the effective diffusion of villin becomes less time-dependent than standard colloid theory would predict as discussed above.

Table 5.

Time scales of contact formation

C [mM] τcs [ns] τcm [ns] τcl [ns] Scs Scm Scl
8 1.084 38.279 108.374 0.287 0.096 0.617
16 0.740 12.490 242.680 0.263 0.217 0.520
32 (#8) 1.044 19.069 149.563 0.323 0.191 0.486
32 (#64) 1.882 42.043 341.801 0.374 0.295 0.331

Short (τcs), medium (τcm), and long (τcl) correlation times of contacts between proteins along with the corresponding weights Scs, Scm, and Scl obtained from fitting a triple-exponential to a correlation function based on conditional contact formation (Eq. S3).

The longest obtained time scales of contact formation between villins are consistent with findings for lysozyme solutions studied by neutron spin-echo spectroscopy (NSE) that set the lower limit of the cluster lifetime to 25 ns29. Other studies show that the lifetime of lysozyme clusters is finite on millisecond time scales77, 78. The presented lifetimes are also not in contradiction with reports for solutions of bovine β-lactoglobulin (BLG) that indicate that BLG clusters are static on NSE observation time scale, i.e. up to 50 ns33, but have a limited lifetime on the microsecond observation time scale of dynamic light scattering33.

Diffusion Slow-Down as a Consequence of Cluster Formation

The simulation results present a picture where villin molecules engage in the formation of transient clusters that persist on time scales on the same order or larger than rotational and translational diffusion time scales. This suggests that the observed slow-down in diffusion for the villin system is largely related to cluster formation.

We extracted representative structures of clusters of different sizes from the trajectories and estimated their translational and rotational diffusion coefficients via HYDROPRO.75 The resulting diffusion coefficients as a function of cluster size are shown in Fig. S6. Dr decreases more sharply with increasing clusters size than Dt. We then estimated overall diffusion coefficients by summing the cluster-size dependent diffusion coefficients with weights according to the cluster distributions observed in the simulations (see Fig. 6). The results are shown in Table 6 in comparison with the values of Dt and Dr extracted from the simulations. For translational diffusion, there is close agreement between the cluster-based estimates and the actual MD diffusion rates. Rotational diffusion rates also match well. Best agreement is found with the linear-average rotational diffusion constants for lower concentrations but with the inverse tumbling time averages at 32 mM villin concentration. This indicates that additional factors may affect rotational diffusion beyond simple cluster formation such as reduced solvent viscosity due to crowding or electrostatic friction. As electrostatic friction may be orientationally dependent, this could explain the larger discrepancy for rotational than for translational diffusion between the MD-derived results and the cluster-based estimates. Finally, a simple cluster-based estimate of rotational diffusion also does not reflect the significant heterogeneity and overall non-exponentiality seen in the rotational correlation functions for individual copies at the highest concentration (see Fig. 4) which may explain additional discrepancies.

Table 6.

Translation and rotational diffusion based on transient clusters

C [mM] Np Dt,MD* [nm2/ns] Dt,cluster [nm2/ns] Dr,MD* [ns−1] Dr,ini,MD* [ns−1] Dr,cluster [ns−1]
8 8 0.16 (0.006) 0.16 (0.002) 0.053 (0.006) 0.065 (0.006) 0.051 (0.002)
16 8 0.12 (0.004) 0.14 (0.003) 0.032 (0.004) 0.047 (0.004) 0.038 (0.002)
32 8 0.09 (0.005) 0.11 (0.003) 0.0095 (0.0017) 0.022 (0.004) 0.020 (0.001)
32 64 0.062 (0.002) 0.079 (0.012) 0.0057 (0.0004) 0.015 (0.0008) 0.013 (0.002)

Translational (Dt) and rotational (Dr) diffusion calculated either directly from the simulations (Dt,MD for 10–100 ns and Dr,MD) or as a sum of cluster-size dependent diffusion weighted by the observed cluster distributions (Dt,cluster and Dr,cluster);

*

values extracted from the MD simulations (see Tables 2 and 3) were divided by ηexp/ηTIP3P=2.54 to correct for overestimated diffusion with the TIP3P water model.

CONCLUSIONS

We describe out all-atom MD simulations of crowded villin systems in explicit solvent at different concentrations. Simulations of crowded protein systems remain challenging and we had to increase the strength of protein-water interactions to obtain good agreements with experiments.

After adjusting the force field, we found that both, translational and rotational diffusion, slow down to different extents in the crowded environment. Rotational diffusion is slowed down more significantly apparently because of the transient formation of clusters. In the simulations, we observe cluster formation due to weakly attractive but mostly non-specific interactions between villins. As the concentration increases, the distribution of clusters shifts towards higher-order clusters at higher concentrations but the peak of the distribution remains at monomers, suggesting that villin is still soluble in the simulations. The transient cluster formation also appears to be responsible for a lack of the typical cage effect expected for colloids that is manifest by two distinct diffusive regimes but not strongly present for the villin system studied here.

When we predicted diffusion coefficients from clusters observed in the simulations and weigh those values according to the observed cluster fractions, we largely reproduce the overall observed translational and rotational diffusion coefficients. This suggests that the slow-down in diffusion is primarily due to transient cluster formation rather than reduced solvent viscosity or other types of molecular friction. The observed rotational diffusion does not fully match the cluster-based estimates suggesting that additional factors are involved, with electrostatic friction being the most likely explanation. This aspect will be investigated in more detail in future studies.

Although we studied only villin here, we assume that the findings have broader implications for diffusive properties in crowded cellular environments. Essentially, the finding is that diffusion depends largely on transient cluster formation, which in turn depends on the strength of interactions between different macromolecules and ultimately the surface amino acid composition. Therefore, experimental studies could probe this aspect by determining diffusion rates for proteins with sequence variants that alter their surface properties.

The differential slow-down in rotational and translational diffusion rates is interesting and may have some implications for diffusion of proteins and ligands in a cell. In diffusion-limited enzymatic reactions, relatively slower rotational diffusion would allow enzymes to remain pre-oriented, especially when part of metabolic pathways and thereby increase metabolic efficiencies. On the other hand, the apparent prevalence of weak attractive interactions allows for related metabolic enzymes to be close so that products of one enzyme can quickly find the active site of the next enzyme in a given pathway. However, since we only studied villin here, further studies of metabolic enzymes in the presence of ligands will be needed to test these ideas.

Supplementary Material

Movie S1
Download video file (2.6MB, mpg)

Acknowledgments

Funding was provided by the National Institute of Health Grants R01 GM084953 and GM103695 (to MF), from the National Science Foundation Grant MCB 1330560 (to MF), the Fund from the High Performance Computing Infrastructure (HPCI) Strategic Program (hp120309, hp130003, hp140229, hp150233) and HPCI general trial use project (hp150145, hp160120) and FLAGSHIP 2020 project focused area 1 “Innovative drug discovery infrastructure through functional control of biomolecular systems (hp160207, hp170254) of the Ministry of Education, Culture, Sports, Science and Technology (MEXT) (to YS), a Grant-in-Aid for Scientific Research on Innovative Areas “Novel measurement techniques for visualizing ‘live’ protein molecules at work” (No. 26119006) (to YS), a grant from JST CREST on “Structural Life Science and Advanced Core Technologies for Innovative Life Science Research” (No. JPMJCR13M3) (to YS), RIKEN QBiC, iTHES, and Dynamic Structural Biology (to YS), a Grant-in-Aid for Scientific Research (C) from MEXT (No. 25410025) (to IY). Computer time was used at NSF XSEDE facilities (TG-MCB090003).

Footnotes

SUPPORTING INFORMATION AVAILABLE

Details of MD and coarse-grained simulations, details of the analysis (translational diffusion correction term, correlation function of contact formation, minimum distance for contact map), comparison of translational diffusion obtained with different thermostats, generalized internal order parameter, Sl2, RMSD, cluster distributions, conformational sampling of villin (PMF plots), representative transient cluster structures, translational and rotational diffusion coefficients of MD-derived cluster structures estimated using HYDROPRO, contact map (32 mM, 64 villin copies system), and a movie showing center of mass motion of selected villin molecules.

References

  • 1.Ellis RJ, Minton AP. Cell Biology ‐ Join the Crowd. Nature. 2003;425:27–28. doi: 10.1038/425027a. [DOI] [PubMed] [Google Scholar]
  • 2.Despa F, Orgill DP, Lee RC. Molecular Crowding Effects on Protein Stability. Ann N Y Acad Sci. 2005;1066:54–66. doi: 10.1196/annals.1363.005. [DOI] [PubMed] [Google Scholar]
  • 3.Zhou H‐X, Rivas G, Minton AP. Macromolecular Crowding and Confinement: Biochemical, Biophysical, and Potential Physiological Consequences. Annu Rev Biophys. 2008;37:375–397. doi: 10.1146/annurev.biophys.37.032807.125817. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Luchinat E, Banci L. In‐Cell NMR: A Topical Review. IUCrJ. 2017;4:108–118. doi: 10.1107/S2052252516020625. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Li C, Zhao J, Cheng K, Ge Y, Wu Q, Ye Y, Xu G, Zhang Z, Zheng W, Zhang X, et al. Magnetic Resonance Spectroscopy As a Tool for Assessing Macromolecular Structure and Function in Living Cells. Annu Rev Anal Chem. 2017;10:157–182. doi: 10.1146/annurev-anchem-061516-045237. [DOI] [PubMed] [Google Scholar]
  • 6.Feig M, Yu I, Wang P‐h, Nawrocki G, Sugita Y. Crowding in Cellular Environments at an Atomistic Level from Computer Simulations. J Phys Chem B. 2017;121:8009–8025. doi: 10.1021/acs.jpcb.7b03570. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Wang Y, Sarkar M, Smith AE, Krois AS, Pielak GJ. Macromolecular Crowding and Protein Stability. J Am Chem Soc. 2012;134:16614–16618. doi: 10.1021/ja305300m. [DOI] [PubMed] [Google Scholar]
  • 8.Yu I, Mori T, Ando T, Harada R, Jung J, Sugita Y, Feig M. Biomolecular Interactions Modulate Macromolecular Structure and Dynamics in Atomistic Model of a Bacterial Cytoplasm. eLife. 2016;5:e19274. doi: 10.7554/eLife.19274. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Harada R, Tochio N, Kigawa T, Sugita Y, Feig M. Reduced Native State Stability in Crowded Cellular Environment Due to Protein‐Protein Interactions. J Am Chem Soc. 2013;135:3696–3701. doi: 10.1021/ja3126992. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Gnutt D, Ebbinghaus S. The Macromolecular Crowding Effect – From in vitro into the Cell. Biol Chem. 2016;397:37–44. doi: 10.1515/hsz-2015-0161. [DOI] [PubMed] [Google Scholar]
  • 11.Heinen M, Zanini F, Roosen‐Runge F, Fedunova D, Zhang FJ, Hennig M, Seydel T, Schweins R, Sztucki M, Antalik M, et al. Viscosity and Diffusion: Crowding and Salt Effects in Protein Solutions. Soft Matter. 2012;8:1404–1419. [Google Scholar]
  • 12.Roosen‐Runge F, Hennig M, Zhang FJ, Jacobs RMJ, Sztucki M, Schober H, Seydel T, Schreiber F. Protein Self‐Diffusion in Crowded Solutions. Proc Natl Acad Sci US A. 2011;108:11815–11820. doi: 10.1073/pnas.1107287108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Dix JA, Verkman AS. Crowding Effects on Diffusion in Solutions and Cells. Ann Rev Biophys. 2008;37:247–263. doi: 10.1146/annurev.biophys.37.032807.125824. [DOI] [PubMed] [Google Scholar]
  • 14.Gabdoulline RR, Wade RC. Biomolecular Diffusional Association. Curr Opin Struc Biol. 2002;12:204–213. doi: 10.1016/s0959-440x(02)00311-1. [DOI] [PubMed] [Google Scholar]
  • 15.Wang Y, Li C, Pielak GJ. Effects of Proteins on Protein Diffusion. J Am Chem Soc. 2010;132:9392–9397. doi: 10.1021/ja102296k. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Li C, Wang Y, Pielak GJ. Translational and Rotational Diffusion of a Small Globular Protein under Crowded Conditions. J Phys Chem B. 2009;113:13390–13392. doi: 10.1021/jp907744m. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.London RE, Gregg CT, Matwiyoff NA. Nuclear Magnetic‐Resonance of Rotational Mobility of Mouse Hemoglobin Labeled with [2‐C‐13]Histidine. Science. 1975;188:266–268. doi: 10.1126/science.1118727. [DOI] [PubMed] [Google Scholar]
  • 18.Williams SP, Haggie PM, Brindle KM. F‐19 NMR Measurements of the Rotational Mobility of Proteins in vivo. Biophys J. 1997;72:490–498. doi: 10.1016/S0006-3495(97)78690-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Banks DS, Fradin C. Anomalous Diffusion of Proteins due to Molecular Crowding. Biophys J. 2005;89:2960–2971. doi: 10.1529/biophysj.104.051078. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Wilcox AE, LoConte MA, Slade KM. Effects of Macromolecular Crowding on Alcohol Dehydrogenase Activity Are Substrate‐Dependent. Biochemistry. 2016;55:3550–3558. doi: 10.1021/acs.biochem.6b00257. [DOI] [PubMed] [Google Scholar]
  • 21.Zimmerman SB, Minton AP. Macromolecular Crowding ‐ Biochemical, Biophysical, and Physiological Consequences. Ann Rev Biophys Biomol Struct. 1993;22:27–65. doi: 10.1146/annurev.bb.22.060193.000331. [DOI] [PubMed] [Google Scholar]
  • 22.Mukherjee SK, Gautam S, Biswas S, Kundu J, Chowdhury PK. Do Macromolecular Crowding Agents Exert only an Excluded Volume Effect? A Protein Solvation Study. J Phys Chem B. 2015;119:14145–14156. doi: 10.1021/acs.jpcb.5b09446. [DOI] [PubMed] [Google Scholar]
  • 23.Harada R, Sugita Y, Feig M. Protein Crowding Affects Hydration Structure and Dynamics. J Am Chem Soc. 2012;134:4842–4849. doi: 10.1021/ja211115q. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Wang QH, Zhuravleva A, Gierasch LM. Exploring Weak, Transient Protein‐Protein Interactions in Crowded In vivo Environments by In‐Cell Nuclear Magnetic Resonance Spectroscopy. Biochemistry. 2011;50:9225–9236. doi: 10.1021/bi201287e. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Ye Y, Liu X, Zhang Z, Wu Q, Jiang B, Jiang L, Zhang X, Liu M, Pielak GJ, Li C. 19F NMR Spectroscopy as a Probe of Cytoplasmic Viscosity and Weak Protein Interactions in Living Cells. Chem–Eur J. 2013;19:12705–12710. doi: 10.1002/chem.201301657. [DOI] [PubMed] [Google Scholar]
  • 26.Cohen RD, Pielak GJ. A Cell is More than the Sum of Its (Dilute) Parts: A Brief History of Quinary Structure. Protein Sci. 2017;26:403–413. doi: 10.1002/pro.3092. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.McConkey EH. Molecular Evolution, Intracellular Organization, and the Quinary Structure of Proteins. Proc Natl Acad Sci US A. 1982;79:3236–3240. doi: 10.1073/pnas.79.10.3236. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Feig M, Sugita Y. Variable Interactions between Protein Crowders and Biomolecular Solutes are Important in Understanding Cellular Crowding. J Phys Chem B. 2012;116:599–605. doi: 10.1021/jp209302e. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Porcar L, Falus P, Chen WR, Faraone A, Fratini E, Hong KL, Baglioni P, Liu Y. Formation of the Dynamic Clusters in Concentrated Lysozyme Protein Solutions. J Phys Chem Lett. 2010;1:126–129. [Google Scholar]
  • 30.Cardinaux F, Zaccarelli E, Stradner A, Bucciarelli S, Farago B, Egelhaaf SU, Sciortino F, Schurtenberger P. Cluster‐Driven Dynamical Arrest in Concentrated Lysozyme Solutions. J Phys Chem B. 2011;115:7227–7237. doi: 10.1021/jp112180p. [DOI] [PubMed] [Google Scholar]
  • 31.Liu Y, Porcar L, Chen JH, Chen WR, Falus P, Faraone A, Fratini E, Hong KL, Baglioni P. Lysozyme Protein Solution with an Intermediate Range Order Structure. J Phys Chem B. 2011;115:7238–7247. doi: 10.1021/jp109333c. [DOI] [PubMed] [Google Scholar]
  • 32.Kowalczyk P, Ciach A, Gauden PA, Terzyk AP. Equilibrium Clusters in Concentrated Lysozyme Protein Solutions. J Colloid Interface Sci. 2011;363:579–584. doi: 10.1016/j.jcis.2011.07.043. [DOI] [PubMed] [Google Scholar]
  • 33.Braun MK, Grimaldo M, Roosen‐Runge F, Hoffmann I, Czakkel O, Sztucki M, Zhang FJ, Schreiber F, Seydel T. Crowding‐Controlled Cluster Size in Concentrated Aqueous Protein Solutions: Structure, Self‐ and Collective Diffusion. J Phys Chem Lett. 2017;8:2590–2596. doi: 10.1021/acs.jpclett.7b00658. [DOI] [PubMed] [Google Scholar]
  • 34.Stradner A, Sedgwick H, Cardinaux F, Poon WCK, Egelhaaf SU, Schurtenberger P. Equilibrium Cluster Formation in Concentrated Protein Solutions and Colloids. Nature. 2004;432:492–495. doi: 10.1038/nature03109. [DOI] [PubMed] [Google Scholar]
  • 35.Mani E, Lechner W, Kegel WK, Bolhuis PG. Equilibrium and Non‐Equilibrium Cluster Phases in Colloids with Competing Interactions. Soft Matter. 2014;10:4479–4486. doi: 10.1039/c3sm53058b. [DOI] [PubMed] [Google Scholar]
  • 36.Bucciarelli S, Casal‐Dujat L, De Michele C, Sciortino F, Dhont J, Bergenholtz J, Farago B, Schurtenberger P, Stradner A. Unusual Dynamics of Concentration Fluctuations in Solutions of Weakly Attractive Globular Proteins. J Phys Chem Lett. 2015;6:4470–4474. doi: 10.1021/acs.jpclett.5b02092. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Pellicane G. Colloidal Model of Lysozyme Aqueous Solutions: A Computer Simulation and Theoretical Study. J Phys Chem B. 2012;116:2114–2120. doi: 10.1021/jp212048j. [DOI] [PubMed] [Google Scholar]
  • 38.Kulkarni AM, Dixit NM, Zukoski CF. Ergodic and Non‐Ergodic Phase Transitions in Globular Protein Suspensions. Faraday Discuss. 2003;123:37–50. doi: 10.1039/b204453f. [DOI] [PubMed] [Google Scholar]
  • 39.Zaccarelli E, Poon WCK. Colloidal Glasses and Gels: The Interplay of Bonding and Caging. Proc Natl Acad Sci US A. 2009;106:15203–15208. doi: 10.1073/pnas.0902294106. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Woldeyes MA, Calero‐Rubio C, Furst EM, Roberts CJ. Predicting Protein Interactions of Concentrated Globular Protein Solutions Using Colloidal Models. J Phys Chem B. 2017;121:4756–4767. doi: 10.1021/acs.jpcb.7b02183. [DOI] [PubMed] [Google Scholar]
  • 41.Bucciarelli S, Myung JS, Farago B, Das S, Vliegenthart GA, Holderer O, Winkler RG, Schurtenberger P, Gompper G, Stradner A. Dramatic Influence of Patchy Attractions on Short‐Time Protein Diffusion under Crowded Conditions. Sci Adv. 2016;2:e1601432. doi: 10.1126/sciadv.1601432. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Pastore A, Temussi PA. The Emperor’s New Clothes: Myths and Truths of In‐Cell NMR. Arch Biochem Biophys. 2017;628:114–122. doi: 10.1016/j.abb.2017.02.008. [DOI] [PubMed] [Google Scholar]
  • 43.Gelman H, Wirth AJ, Gruebele M. ReAsH as a Quantitative Probe of In‐Cell Protein Dynamics. Biochemistry. 2016;55:1968–1976. doi: 10.1021/acs.biochem.5b01336. [DOI] [PubMed] [Google Scholar]
  • 44.Hänsel R, Luh LM, Corbeski I, Trantirek L, Dötsch V. In‐Cell NMR and EPR Spectroscopy of Biomacromolecules. Angew Chem Int Ed. 2014;53:10300–10314. doi: 10.1002/anie.201311320. [DOI] [PubMed] [Google Scholar]
  • 45.Wang P‐H, Yu I, Feig M, Sugita Y. Influence of Protein Crowder Size on Hydration Structure and Dynamics in Macromolecular Crowding. Chem Phys Lett. 2017;671:63–70. [Google Scholar]
  • 46.Predeus AV, Gul S, Gopal SM, Feig M. Conformational Sampling of Peptides in the Presence of Protein Crowders from AA/CG‐Multiscale Simulations. J Phys Chem B. 2012;116:8610–8620. doi: 10.1021/jp300129u. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Abriata LA, Dal Peraro M. Assessing the Potential of Atomistic Molecular Dynamics Simulations to Probe Reversible Protein‐Protein Recognition and Binding. Sci Rep. 2015;5:10549. doi: 10.1038/srep10549. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Petrov D, Zagrovic B. Are Current Atomistic Force Fields Accurate Enough to Study Proteins in Crowded Environments? PloS Comput Biol. 2014;10:e1003638. doi: 10.1371/journal.pcbi.1003638. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Best RB, Zheng W, Mittal J. Balanced Protein‐Water Interactions Improve Properties of Disorderd Proteins and Non‐Specific Protein Association. J of Chem Theory Comput. 2014;10:5113–5124. doi: 10.1021/ct500569b. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Rauscher S, Gapsys V, Gajda MJ, Zweckstetter M, de Groot BL, Grubmüller H. Structural Ensembles of Intrinsically Disordered Proteins Depend Strongly on Force Field: A Comparison to Experiment. J Chem Theory Comput. 2015;11:5513–5524. doi: 10.1021/acs.jctc.5b00736. [DOI] [PubMed] [Google Scholar]
  • 51.Piana S, Donchev AG, Robustelli P, Shaw DE. Water Dispersion Interactions Strongly Influence Simulated Structural Properties of Disordered Protein States. J Phys Chem B. 2015;119:5113–5123. doi: 10.1021/jp508971m. [DOI] [PubMed] [Google Scholar]
  • 52.McKnight CJ, Matsudaira PT, Kim PS. NMR Structure of the 35‐Residue Villin Headpiece Subdomain. Nat Struct Biol. 1997;4:180–184. doi: 10.1038/nsb0397-180. [DOI] [PubMed] [Google Scholar]
  • 53.Phillips JC, Braun R, Wang W, Gumbart J, Tajkhorshid E, Villa E, Chipot C, Skeel RD, Kale L, Schulten K. Scalable Molecular Dynamics with NAMD. J Comput Chem. 2005;26:1781–1802. doi: 10.1002/jcc.20289. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Brooks BR, Brooks CL, Mackerell AD, Nilsson L, Petrella RJ, Roux B, Won Y, Archontis G, Bartels C, Boresch S, et al. CHARMM: The Biomolecular Simulation Program. J Comput Chem. 2009;30:1545–1614. doi: 10.1002/jcc.21287. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Eastman P, Friedrichs MS, Chodera JD, Radmer RJ, Bruns CM, Ku JP, Beauchamp KA, Lane TJ, Wang LP, Shukla D, et al. OpenMM 4: A Reusable, Extensible, Hardware Independent Library for High Performance Molecular Simulation. J Chem Theory Comput. 2013;9:461–469. doi: 10.1021/ct300857j. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Jung J, Mori T, Kobayashi C, Matsunaga Y, Yoda T, Feig M, Sugita Y. GENESIS: A Hybrid‐Parallel and Multi‐Scale Molecular Dynamics Simulator with Enhanced Sampling Algorithms for Biomolecular and Cellular Simulations. Wiley Interdiscip Rev Comput Mol Sci. 2015;5:310–323. doi: 10.1002/wcms.1220. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Best RB, Zhu X, Shim J, Lopes P, Mittal J, Feig M, MacKerell AD. Optimization of the Additive CHARMM All‐Atom Protein Force Field Targeting Improved Sampling of the Backbone ϕ, ψ and Side‐Chain χ1 and χ2 Dihedral Angles. J Chem Theory Comput. 2012;8:3257–3273. doi: 10.1021/ct300400x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Jorgensen WL, Chandrasekhar J, Madura JD, Impey RW, Klein ML. Comparison of Simple Potential Functions for Simulating Liquid Water. J Chem Phys. 1983;79:926–935. [Google Scholar]
  • 59.Darden TA, York D, Pedersen LG. Particle‐Mesh Ewald: An N log(N) Method for Ewald Sums in Large Systems. J Chem Phys. 1993;98:10089–10092. [Google Scholar]
  • 60.Huang J, Rauscher S, Nawrocki G, Ran T, Feig M, de Groot BL, Grubmuller H, MacKerell AD. CHARMM36m: an improved force field for folded and intrinsically disordered proteins. Nat Methods. 2017;14:71–73. doi: 10.1038/nmeth.4067. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Grossfield A. WHAM: the Weighted Histogram Analysis Method. http://membrane.urmc.rochester.edu/content/wham (accessed November 10, 2017)
  • 62.Feig M, Karanicolas J, Brooks CL. MMTSB Tool Set: Enhanced Sampling and Multiscale Modeling Methods for Applications in Structural Biology. J Mol Graph Model. 2004;22:377–395. doi: 10.1016/j.jmgm.2003.12.005. [DOI] [PubMed] [Google Scholar]
  • 63.Humphrey W, Dalke A, Schulten K. VMD: Visual Molecular Dynamics. J Mol Graphics. 1996;14:33–38. doi: 10.1016/0263-7855(96)00018-5. [DOI] [PubMed] [Google Scholar]
  • 64.Oliphant TE. Python for Scientific Computing. Comput Sci Eng. 2007;9:10–20. [Google Scholar]
  • 65.Hunter JD. Matplotlib: A 2D Graphics Environment. Comput Sci Eng. 2007;9:90–95. [Google Scholar]
  • 66.Wong V, Case DA. Evaluating Rotational Diffusion from Protein MD Simulations. J Phys Chem B. 2008;112:6013–6024. doi: 10.1021/jp0761564. [DOI] [PubMed] [Google Scholar]
  • 67.Takemura K, Kitao A. Effects of Water Model and Simulation Box Size on Protein Diffusional Motions. J Phys Chem B. 2007;111:11870–11872. doi: 10.1021/jp0756247. [DOI] [PubMed] [Google Scholar]
  • 68.Roos M, Ott M, Hofmann M, Link S, Rössler E, Balbach J, Kruschelnitsky A, Saalwächter K. Coupling and Decoupling of Rotational and Translational Diffusion of Proteins under Crowding Conditions. J Am Chem Soc. 2016;138:10365–10372. doi: 10.1021/jacs.6b06615. [DOI] [PubMed] [Google Scholar]
  • 69.Karandur D, Wong K‐Y, Pettitt BM. Solubility and Aggregation of Gly5 in Water. J Phys Chem B. 2014;118:9565–9572. doi: 10.1021/jp503358n. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Wang MH, Tang YF, Sato SS, Vugmeyster L, McKnight CJ, Raleigh DP. Dynamic NMR Line‐ Shape Analysis Demonstrates that the Villin Headpiece Subdomain Folds on the Microsecond Time Scale. J Am Chem Soc. 2003;125:6032–6033. doi: 10.1021/ja028752b. [DOI] [PubMed] [Google Scholar]
  • 71.Kubelka J, Eaton WA, Hofrichter J. Experimental Tests of Villin Subdomain Folding Simulations. J Mol Biol. 2003;329:625–630. doi: 10.1016/s0022-2836(03)00519-9. [DOI] [PubMed] [Google Scholar]
  • 72.Feller SE, Pastor RW, Rojnuckarin A, Bogusz S, Brooks BR. Effect of Electrostatic Force Truncation on Interfacial and Transport Properties of Water. J Phys Chem. 1996;100:17011–17020. [Google Scholar]
  • 73.Korson L, Drosthan W, Millero FJ. Viscosity of Water at Various Temperatures. J Phys Chem. 1969;73:34–39. [Google Scholar]
  • 74.Lee H, Venable RM, MacKerell AD, Pastor RW. Molecular Dynamics Studies of Polyethylene Oxide and Polyethylene Glycol: Hydrodynamics Radius and Shape Anisotropy. Biophys J. 2008;95:1590–1599. doi: 10.1529/biophysj.108.133025. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 75.Ortega A, Amorós D, García de la Torre J. Prediction of Hydrodynamic and Other Solution Properties of Rigid Proteins from Atomic‐ and Residue‐Level Models. Biophys J. 2011;101:892–898. doi: 10.1016/j.bpj.2011.06.046. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 76.Okamura H, Kigawa T. NMR Analysis of the Protein Dynamics under Macromolecular Crowding Environment. 細胞工学 saiboukougaku. 2014;33:830–834. [Google Scholar]
  • 77.Barhoum S, Yethiraj A. NMR Detection of an Equilibrium Phase Consisting of Monomers and Clusters in Concentrated Lysozyme Solutions. J Phys Chem B. 2010;114:17062–17067. doi: 10.1021/jp108995k. [DOI] [PubMed] [Google Scholar]
  • 78.Liu YX, Wang XJ, Ching CB. Toward Further Understanding of Lysozyme Crystallization: Phase Diagram, Protein‐Protein Interaction, Nucleation Kinetics, and Growth Kinetics. Cryst Growth Des. 2010;10:548–558. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Movie S1
Download video file (2.6MB, mpg)

RESOURCES