Abstract
Energetics was analyzed for Trp‐cage miniprotein in water to elucidate the solvation effect in heat denaturation. The solvation free energy was computed for a set of protein structures at room and high temperatures with all‐atom molecular dynamics simulation combined with the solution theory in the energy representation, and its correlations were investigated against the intramolecular (structural) energy of the protein and the average interaction energy of the protein with the solvent water. It was observed both at room and high temperatures that the solvation free energy is anticorrelated to the structural energy and varies in parallel to the electrostatic component of the protein–water interaction energy without correlations to the van der Waals and excluded‐volume components. When the set of folded structures sampled at room temperature was compared with the set of unfolded ones at high temperature, it was found that the preference order of the two sets is in correspondence to the van der Waals and excluded‐volume components in the sum of the protein intramolecular and protein‐water intermolecular interactions and is not distinguished by the electrostatic component.
Keywords: heat denaturation, energetics, intermolecular interaction, Trp‐cage, free energy, solution theory, molecular dynamics simulation
Abbreviations
- LJ
Lennard‐Jones interaction
- MD
molecular dynamics
- PME
particle‐mesh Ewald
- SASA
solvent‐accessible surface area
Introduction
Heat is a common cause of protein denaturation. A protein may denature at elevated temperature through modification of the solvation effect since the protein structure in solution is determined by the balance between the intramolecular interaction within the protein and the intermolecular interaction with solvent. In statistical thermodynamics, the probability of finding a certain configuration (structure) of the protein in solvent (water) is given by the sum of the intramolecular (structural) energy and the free energy of solvation. The solvation free energy is thus of central importance in identifying the governing factor to control the temperature effect on the protein structure, and its analysis is desirably conducted at atomic resolution since the hydrogen bonding and hydrophobic effect play key roles and can be faithfully described with explicit solvent. The focus of the present study is the solvation free energy of Trp‐cage miniprotein (PDB code: 1L2Y)1 over a variety of configurations both at room and high temperatures. We employ the molecular dynamics (MD) simulation in combination with a statistical‐mechanical theory of solutions, and address the response of the protein‐water energetics to the temperature variation.
Trp‐cage is a 20‐residue miniprotein and exhibits a stable, folded structure at room temperature.1 It has been intensively studied both experimentally and computationally to explore the principle of structure formation of protein at atomic level.2, 3, 4, 5, 6, 7, 8, 9, 10, 11, 12, 13 In the present work, we focus on the energetics of solvation (hydration) of Trp‐cage. MD was performed with explicit solvent to sample a set of folded structures at room temperature and a set of unfolded ones at high temperature, and the correlations of the solvation free energy were examined over the two sets against the intramolecular energy of the protein, the electrostatic and van der Waals components of the protein‐water interaction energy, and the excluded‐volume effect to see how the structural variation of the protein is connected to the intramolecular and intermolecular interactions of the protein.
In a previous work, the correlation between the solvation free energy and the solute‐solvent interaction energy was analyzed to elucidate the urea‐induced effects on amino‐acid analog solutes.14 This analysis was done since the solute‐solvent interaction energy is more straightforward for interpretation and prediction than the free energy. For example, it is a rule of thumb that the electrostatic interaction in polar solvent is more favorable for a more polar solute and that the van der Waals interaction is stronger with a larger solute. The free energy reflects, in contrast, the effects of all the interaction components in nonlinear manner, though it determines the observable properties of solvation in the context of statistical thermodynamics. In Ref. 14, the interaction component which plays the decisive role in the energetics of transfer from pure‐water solvent to urea‐water mixed solvent was identified by examining the correlations of the solvation free energy against a variety of components in the solute–solvent interaction. In this work, the correlation analysis was conducted to see the interaction component which governs the effect of temperature elevation on the protein.
A key quantity in our analyses is the solvation free energy. Its computation requires much demand with explicit solvent, however, when such standard methods as free‐energy perturbation and thermodynamic integration are employed in molecular simulation; these methods are conducted in practice by introducing a number of intermediate states connecting the pure‐solvent and solution systems of interest.15, 16 In the present work, we resort to the method of energy representation to obtain the solvation free energy.17, 18, 19, 20 It is a theory of distribution functions in solution and is formulated by adopting the solute‐solvent pair interaction energy for the coordinate of the distribution functions constituting the free‐energy functional. Among a variety of approximate free‐energy methods,21, 22, 23, 24, 25, 26, 27, 28, 29, 30, 31, 32, 33, 34 the method of energy representation is unique in compromising the accuracy, the efficiency, and the range of applicability.14, 18, 19, 35, 36, 37, 38, 39, 40, 41 In the energy‐representation method, the simulations are performed only for the pure‐solvent and solution systems, and a set of energy distribution functions obtained from the simulations provides the solvation free energy through an approximate functional. The computational load is thus reduced, and the solvation free energy can be evaluated for a protein with a few hundred residues.42, 43, 44, 45, 46 It was observed, furthermore, that the error from the approximate functional is not larger than the error due to the use of force field.14, 39, 41
Results and Discussion
In the present work, two types of MD calculations were carried out. In one of them, the protein was treated as flexible. Its structure was allowed to fluctuate naturally in the presence of water, and a set of protein configurations (snapshot structures) were sampled during the course of equilibrium fluctuation. In the other, each protein structure thus sampled was kept rigid and the protein–water energetics was analyzed by allowing only the water molecules to move. The MD simulations were performed at two thermodynamics states of
R: 300 K and the solvent‐water density of 0.997 g/cm3,
H: 400 K and 0.937 g/cm3.
R is a room‐temperature state in which Trp‐cage keeps the folded structure, and H is a high‐temperature state in which the protein is mostly unfolded with the force field employed in this work.9, 10 They correspond to the (experimental) pressure of 1 atm, and the densities were taken from Ref. 47.
The protein structure was sampled from MD simulations at states R and H. In the following, the set of folded structures sampled at the room‐temperature state and the set of mostly unfolded ones at the high‐temperature state are called sets F and U, respectively; the preparation of these sets is described in detail in the subsection of “MD simulations of flexible protein”. Figure 1 shows the distributions of the solvent‐accessible surface area (SASA) for the protein structures in sets F and U. Evidently, the SASA in set U is larger and distributes over wider domain. The protein at high temperature is more extended with structural variations.
Figure 1.

Probability distribution functions of the solvent‐accessible surface area (SASA) for the protein structures in sets F and U. SASA was computed from the protein‐water boundary defined as the surface at contact with the spherical probe which mimics the water molecule and has a radius of 1.4 Å.
The protein–water interaction was analyzed at both of states R and H for each structure of the protein in sets F and U. There are thus four groups of computations: set F at state R, set F at state H, set U at state R, and set U at state H, which will be hereafter called FatR, FatH, UatR, and UatH, respectively, for brevity. The (relative) stabilities of the sets of folded and unfolded structures can then be examined by comparing FatR and UatR at room temperature and UatH and FatH at high temperature.
When the (snapshot) structure of protein is denoted as X, the probability distribution function of finding the structure X in solution is expressed as
| (1) |
where k B is the Boltzmann constant, T is the temperature, is the intramolecular (structural) energy of the protein solute, and is the free energy of solvation (hydration). consists of the bonded terms for the bond‐stretching, bending, and proper and improper torsions and the nonbonded terms of Coulombic and Lennard‐Jones forms; the nonbonded terms in will be examined later in the subsection of “Protein structure and interaction component”. The relative stabilities of sets F and U of protein structures are determined by the difference in the free energy given by
| (2) |
where S denotes either of sets F and U and specifies the domain of integration over the protein configuration (structure) X. The first term of Eq. (2) is the average intramolecular energy of the protein solute within set S, and the second term is the averaged free energy of solvation in set S. The third term corresponds to the conformational entropy and quantifies the structural variety of the protein. It is more negative (more favorable) when the protein covers a wider range of the configuration space.48, 49, 50, 51, 52, 53, 54, 55, 56, 57 Actually, is not a dimensionless quantity and the value of the third term of Eq. (2) depends on the choice of the unit of X. The term carrying the unit is common between sets F and U, however, and does not play any role in discussing the relative stability. We base our analysis on Eqs. (1) and (2) and examine the correlations among a variety of interaction components.
Intramolecular energy and solvation
To see the relationship between the protein structure and the solvation (hydration) effect on the energetic basis, in Figure 2 we present the correlation plot between the intramolecular energy and the solvation free energy . As noted above, there are four groups of data from FatR, FatH, UatR, and UatH. Over each of the data groups, and are anticorrelated to each other; the anticorrelation was also seen for cytochrome c.43 When sets F and U are compared, the ranges of variations of and are larger for UatR and UatH than for FatR and FatH. This reflects the property that the configuration space of the protein is broader for set U than for set F, as illustrated in Figure 1.
Figure 2.

Correlation plot between the intramolecular (structural) energy and the solvation (hydration) free energy , where each point in the figure refers to a (snapshot) structure in FatR, FatH, UatR, and UatH introduced at the beginning of the section of “Results and Discussion”. The correlation coefficient is −0.83, −0.84, −0.93, and −0.90 for FatR, FatH, UatR, and UatH, respectively. The slope in the linear regression with the least‐square fit of to is −1.1 for FatR and FatH and is −1.0 for UatR and UatH.
According to Figure 2, it is also seen that the average value of is more favorable (more negative) for FatR than for UatR. This shows that the sum of the first and second terms of Eq. (2) is more favorable for set F than for set U at state R. The third term will be more favorable for set U, on the other hand, since set U is an ensemble of (mostly) unfolded structures of the protein. Thus, the protein remains folded at state R because of the energetic effect represented by .
The argument is opposite at state H. Even when the temperature is elevated to 400 K, is still more favorable for set F than for set U. Set U is then more stable than set F due to the presence of the third term in Eq. (2). The conformational entropy plays a key role in heat denaturation.
Solvation free energy and solute–solvent interaction
In the previous subsection, we saw that the solvation free energy varies on the order of 100 kcal/mol over the sets of protein structures examined. It is then of interest to investigate which component of the intermolecular interaction between the protein solute and the solvent water drives the variation of . To do so, we correlate against the average sum of the solute–solvent interaction energy in the solution system (protein–water system) and its electrostatic and van der Waals components denoted as and , respectively. In the present work, was calculated for fixed structure X of the protein and was similarly computed at fixed structure; both of and are functions of X. The intermolecular interaction potential is expressed as a sum of electrostatic and van der Waals components, furthermore, and was correspondingly given by the sum of the electrostatic component and the van der Waals component . It should be noted that while in this study is calculated with an approximate functional, and its electrostatic and van der Waals components are exact (within statistical error) under the used set of force fields. The decomposition scheme of into and is described in more detail in the subsection of “Free‐energy calculation”.
In Figure 3, is correlated against , and in their dimensionless forms (divided by ; ). The correlation is clearly present for the total interaction energy and the electrostatic component . In fact, can be fit to a common straight line against over the four groups of data from FatR, FatH, UatR, and UatH. In a previous work,43 we saw that the equilibrium fluctuation of protein structure in water is driven by the electrostatic interaction between protein and water. Figure 3 shows that the correlation between and persists even over nonequilibrium ensembles of FatH and UatR. The slope of the least‐square fit of against is 2.2 with a correlation coefficient of 1.0, and this strong correlation is reminiscent of the linear response.21, 23 The van der Waals component is essentially invariant, on the other hand, over each of FatR, FatH, UatR, and UatH. It plays a minor role in determining the response of the solvation energetics to the protein structure.
Figure 3.

Correlation plots for the solvation free energy against the average sum of solute–solvent interaction energy in the solution system, the electrostatic component , the van der Waals component , and the excluded‐volume component in their dimensionless forms (divided by ; ), where each point in the figure refers to a (snapshot) structure in FatR, FatH, UatR, and UatH introduced at the beginning of the section of “Results and Discussion”. The solid line represents the linear regression from the least‐square fit of against .
When a solute is introduced into solvent, a number of solvent molecules need to be removed from the region to be occupied by the solute. The free‐energy penalty corresponding to this removal is denoted as the excluded‐volume effect. It is a major part of repulsive component of solute–solvent interaction and is pointed out as a driving force for structure formation of large molecule.56, 58, 59, 60 The excluded‐volume component is a nonobservable part of the solvation free energy, though, and its treatment requires a model of solvation. We examine the excluded‐volume effect within the framework of the approximate functional in the energy‐representation method.
The excluded volume is the region of solute‐solvent configuration where the solute molecule overlaps with the solvent and the interaction energy is prohibitively large. Within the formalism of the energy‐representation method, the solvation free energy (actually, the intermolecular part described in the subsection of “Free‐energy calculation”) is written as
| (3) |
where ϵ is the pair interaction energy between solute and solvent, denotes the average distribution (histogram) of ϵ in the solution system (protein–water system), and describes the effect of solvent reorganization including the excluded‐volume effect. is expressed as an integral over ϵ in Eq. (3), and its excluded‐volume component is obtained by restricting the integral to high‐energy domain ranging from ϵc to infinity. ϵc is the threshold for specifying the excluded‐volume domain, and its value can be set somewhat arbitrarily within a requirement that the domain of corresponds to the solute–solvent overlap and is inaccessible with vanishing in the solution system. In this work, we adopted kcal/mol, while the following discussion is valid irrespective of the (reasonable) choice of the ϵc value.
The correlation of the excluded‐volume component is examined in Figure 3 against the (total) solvation free energy in their dimensionless forms (divided by ). It is seen that the correlation is absent over each of FatR, FatH, UatR, and UatH. The role of the excluded‐volume effect is minor in describing the dependence of the solvation energetics on the protein structure.
Protein structure and interaction component
As described at the beginning of the section of “Results and Discussion”, the stability of a (snapshot) structure X of the protein is determined by . In the present subsection, we pursue the interaction component which is reflected in the protein stability. To do so, the relationships of are analyzed against the electrostatic, van der Waals, and excluded‐volume components in the intramolecular (structural) energy of the protein solute and the intermolecular interaction between the solute and the solvent water. Figure 4 shows these interaction components against , where and are the electrostatic and van der Waals components in the protein intramolecular energy , respectively, and and are the electrostatic and van der Waals components of the solute–solvent interaction energy, respectively. The excluded‐volume effect is not present in the intramolecular energy of the protein solute, and is expressed only with the intermolecular contribution .
Figure 4.

Correlation plots for against the electrostatic component in the sum of the intramolecular and intermolecular energies , the corresponding van der Waals component , and the excluded‐volume component at (a) state R (FatR and UatR) and (b) state H (FatH and UatH), where each point in the figure refers to a (snapshot) structure in FatR, FatH, UatR, and UatH introduced at the beginning of the section of “Results and Discussion”.
When the protein structure varies within set F, does not correlate to any of , , and . If taken alone, none of the electrostatic, van der Waals, and excluded‐volume components plays a decisive role in determining the preference of the protein structure in FatR and FatH. The structural variation extends over a larger scale in set U, on the other hand, as shown in Figure 1. The correlation is then observed over set U (for UatR and UatH) between and . This indicates that the structural preference of the protein in a large‐scale variation reflects the excluded‐volume effect.
When sets F and U are compared, it is observed in Figure 4 that the van der Waals and excluded‐volume components are more favorable (more negative) in set F than in set U, in correspondence to the relative stabilities of sets F and U shown in Figure 2 [when the conformational entropy expressed as the third term of Eq. (2) is disregarded]. To see this point more clearly, in Table 1 we list the averages and standard deviations of , , and in FatR, FatH, UatR, and UatH. According to Table 1, and are distinct between FatR and UatR and between FatH and UatH with the margins of their standard deviations. As a consequence, when the Gaussian distribution is assumed for the values of the interaction components in Table 1, the van der Waals and excluded‐volume components provide the correct order of preference between sets F and U with a probability of more than 0.7 ( ) when single structures are sampled from both sets and their interactions are compared. Detailed arguments are as follows. When m and σ are the average and standard deviation of a Gaussian distribution of a stochastic variable x, respectively, x is sampled with a probability of 0.84 either in or in . Let x F be the value of the van der Waals or excluded‐volume component for a protein structure sampled from set F and x U be the value from set U. The correct order of structural preference is then given with single samplings from sets F and U if x F < x U. Table 1 shows, in turn, that, for set F is similar to for set U, where m F, σ F, m U, and σ U refer to the statistical quantities in sets F and U. The probability of x F < x U is thus larger than that satisfying and since the latter is a sufficient condition of the former, and is more than 0.7 given that each of and is realized independently with a probability of 0.84. A similar argument does not hold for the electrostatic component since its distribution overlaps significantly between FatR and UatR and between FatH and UatH.
Table 1.
Averages and Standard Deviations of , , and in FatR, FatH, UatR, and UatH
| State R | State H | |||
|---|---|---|---|---|
| FatR | UatR | FatH | UatH | |
| Electrostatic | −1255 (18) | −1249 (52) | −1209 (17) | −1198 (48) |
| van der Waals | −110 (6) | −94 (10) | −103 (6) | −86 (10) |
| Excluded‐volume | 155 (1) | 161 (3) | 179 (2) | 186 (4) |
Each entry contains the average and standard deviation and the standard deviation is in parenthesis.
In Figure 3, we saw that the solvation free energy correlates strongly to the electrostatic component of the solute‐solvent interaction energy. Figure 4 and Table 1 shows, on the other hand, that the electrostatic component in the sum of the intramolecular and intermolecular energies does not vary with the protein structure in correspondence to . Actually, it is seen in Figure 5 that the (total) intramolecular energy is also correlated well to its electrostatic component when and are not large. Still, the correlations to the electrostatic components are canceled when the intramolecular and intermolecular effects are summed into , and within the context of correlation, the relative stabilities of sets F and U of protein structures are not governed by the electrostatic interaction at both of states R and H.
Figure 5.

Correlation plots for the protein intramolecular energy against its electrostatic component and the van der Waals component , where each point in the figure refers to a (snapshot) structure in sets F and U introduced at the beginning of the section of “Results and Discussion”.
The opposite argument holds for the van der Waals component. Figure 3 shows that the solvation free energy does not correlate to the van der Waals component of the solute‐solvent energy, and according to Figure 5, the correlation is absent between the intramolecular energy and its van der Waals component . When the intramolecular and intermolecular effects are summed, in contrast, the van der Waals component is in correspondence to the preference order of the protein structure in the large‐scale variation between sets F and U.
As noted above for the correlation within set U and the comparison between sets F and U, the preference of the protein structure is in correspondence to the excluded‐volume effect in a large‐scale variation of the structure. This is actually consistent with Kinoshita et al.'s argument that the protein structure is determined to maximize the “water entropy”.58, 59, 61 Indeed, the excluded‐volume component in the solvation free energy quantifies the free‐energy penalty due to the solute‐solvent overlap, and the solvent water can “move” over larger domain when the configuration space is smaller for the overlap.
Conclusions
Energetics was analyzed for Trp‐cage miniprotein in water at room and high temperatures over a set of folded structures and a set of mostly unfolded ones. It was seen that the sum of the intramolecular energy and the solvation free energy is more favorable (more negative) for the set of folded structures than for the set of unfolded ones at both the room and high temperatures. This shows that the protein is folded due to the energetic effect at room temperature, while it becomes unfolded by heat due to the conformational entropy. When the protein structure varies either at room or high temperature, the solvation free energy changes in parallel to the electrostatic component in the solute‐solvent interaction energy and does not correlate to the van der Waals and excluded‐volume components. When a large‐scale variation of the protein structure was examined through comparison of the sets of folded and unfolded structures, it was observed that the preference order of the two sets is provided by the van der Waals or excluded‐volume component in the sum of the protein intramolecular and protein–water intermolecular effects. The electrostatic component was not distinct between the sets of folded and unfolded structures.
In the present work, the energetics in heat denaturation was investigated in terms of the correlations among a variety of interaction components. It should be noted that although intuitively appealing, the electrostatic, van der Waals, and excluded‐volume interactions are not observable and cannot be separately tuned for real molecules. Still, these notions are useful for interpreting and predicting a large number of observable phenomena, and the correlation analysis may provide a route to justifying or even finding a concept of practical use.
Materials and Methods
MD setups
The solvent was pure water and the solute was Trp‐cage (PDB code: 1L2Y). Its N‐ and C‐termini were represented as and COO−, respectively, and the protein has 304 atoms with a total charge of +1. The original TIP3P model (without the Lennard‐Jones term for the hydrogen atom) was used for water, and the protein was treated with the Amber99sb force field.62, 63
All the molecular dynamics (MD) simulations were conducted in the canonical (NVT) ensemble using GROMACS 4.5.5.64 The MD unit cell was cubic, and the periodic boundary condition was employed with the minimum image convention. A single solute and 20000 water molecules were located in the unit cell. The edge length of the unit cell was 84.4 Å when the solvent density is 0.997 g/cm3 (state R listed at the beginning of Results and Discussion), and was 86.1 Å when the solvent density is 0.937 g/cm3 (state H). The equation of motion was integrated with the leap‐frog algorithm at a time step of 2 fs, and the stochastic dynamics integrator was used at a coupling constant of 2 ps to keep the system at the temperatures described in Results and Discussion.65 The lengths were fixed with LINCS for all the bonds in the protein, and the water molecules were kept rigid with SETTLE.66, 67
The electrostatic interaction was handled by the smooth particle‐mesh Ewald (PME) method with a real‐space cutoff of 13.5 Å, a spline order of 4, a relative tolerance of (inverse decay length of 0.23 Å– 1), and a reciprocal‐space mesh size of 81 for each of the x, y, and z directions.68 The Lennard‐Jones (LJ) interaction was truncated by applying the switching function in the switching range of 10–12 Å. The truncation was done on atom–atom basis both for the real‐space part of PME interaction and the LJ interaction, and the long‐range correction of LJ interaction was not included since it has no effect on the correlation analysis described in Results and Discussion.
MD simulations of flexible protein
First, the PDB structure of Trp‐cage was energy‐minimized in vacuum by the steepest descent method. The energy‐minimized protein was then solvated by water, and the water configuration was equilibrated over 1 ns at state R by freezing the protein structure. An MD simulation at state R was subsequently performed for 30 ns by treating the protein as flexible. The first 10 ns of the MD was discarded as the equilibration period, and after that, the solvent‐accessible surface area was calculated by sampling the protein structure every 50 ps and defining the protein‐water boundary as the surface at contact with a spherical probe of a radius of 1.4 Å. The snapshot structure of the protein employed for the analyses of energetics was saved every 1 ns between 11 and 30 ns, and set F defined at the beginning of Results and Discussion consists of 20 structures in total.
To prepare set U, the initial configuration to simulate state R described above was employed and MD was conducted at a temperature of 500 K with the MD cell size given in the preceding subsection for state H. The 500 K MD was done over 70 ns, and the protein‐water configurations at 30, 40, 50, 60, and 70 ns were used as initial configurations for the subsequent MD at state H. Each MD at state H was carried out over 35 ns, and the SASA was evaluated every 50 ps by discarding the first 20 ns of the 35 ns MD. The analyses of energetics were conducted over the snapshot structures of the protein saved every 5 ns between 20 and 35 ns. Set U thus consists of 20 structures of the protein.
It should be noted that in each simulation with the flexible protein, the protein center of mass was fixed at the center of the MD unit cell. The protein was kept at the cell center only to prevent possible complications in handling the simulation data due to the periodic boundary condition, and this procedure does not affect the statistical quantities of interest.
Free‐energy calculation
For each of the protein structures in sets F and U, the solvation free energy was computed at both of states R and H. The protein structure was kept rigid in the free‐energy calculation, and the method of energy representation was employed for the calculation of the solvation free energy.18, 20, 37 In this method, a set of energy distribution functions are obtained from simulation and are substituted into a functional for the solvation free energy. For each structure of the protein solute, MD was done for two systems. One is the solution system of interest, with the position and orientation of the solute frozen. In the solution system, the solute interacts with the solvent at full coupling, and the distribution function of solute–solvent pair energy is to be determined. The solution MD was conducted over 2 ns, and the instantaneous configuration (snapshot) was sampled every 100 fs. The other is of pure water, for which an MD is performed only with water for 500 ps at a sampling interval of 100 fs. The simulation setups including the MD cell size were identical between the solution and pure‐solvent systems, except for the simulation length. The protein solute was inserted into the pure‐solvent system as a test particle to obtain the density of states of solute–solvent pair interaction and the solvent–solvent pair correlation. Upon insertion, the solute almost always overlaps with solvent molecules, and the overlapping configurations account for the excluded‐volume term of the solvation free energy in the energy‐representation method.18, 20, 37 In the present study, the test‐particle insertion was carried out at random position and orientation with the fixed structure of the solute for 200 times per pure‐water configuration sampled, leading to the generation of 106 solute‐solvent configurations in total. An exception is the protein structure at 30 ns in set F, with which the convergence of the free‐energy calculation was examined by prolonging both the solution and pure‐water MD to 10 ns; the results are illustrated in the next paragraph. The solvation free energy was obtained through an approximate functional given in previous papers, where the methodological details are described.18, 20, 37
As described above, the convergence of the free‐energy calculation was examined at state R with the 30 ns structure in set F by conducting 10 ns MD for both the solution and pure water. Figure 6 shows the solvation free energy as function of the MD length. It is seen that the convergence within 0.2 kcal/mol is reached in 2 ns and 500 ps of the solution and pure‐solvent calculations, respectively, and thus the MD lengths were set to the ones given in the preceding paragraph.
Figure 6.

Convergence of the solvation free energy of the 30 ns structure in set F as function of the MD length at state R. The solid line refers to the dependence on the MD length of the solution system with keeping the pure‐solvent MD length 10 ns. The dashed line is for the pure water at fixed 10‐ns length of the solution MD. The cumulative number of insertions is also shown as the upper abscissa; 500 ps corresponds to 106 insertions since pure water was sampled every 100 fs and the number of insertions was 200 per solvent configuration sampled. The upper abscissa is meaningful only for the pure‐solvent plot.
Throughout the present study, no counterion was placed against the charged protein. The effect of the apparent non‐neutrality was corrected by adding the self‐energy term of electrostatic interaction to the solvation free energy.69, 70, 71 As described in the subsection of “MD setups”, the particle‐mesh Ewald method was adopted in our simulations to handle the electrostatic interaction,68 and its self‐energy term refers to the interaction energy of the solute with its own images and neutralizing background. When the self energy is denoted as , the solvation free energy is computed as
| (4) |
where is the free‐energy change of turning on the solute‐solvent intermolecular interaction. In our calculations, was evaluated exactly and was obtained approximately by the energy‐representation method. Actually, was found to stay in the range of −5 to −7 kcal/mol over the whole sets of protein structures. Thus, in Eq. (4) plays no role in the correlation analysis in Results and Discussion, though the values presented in this paper are those after adding . In addition, the intermolecular interaction potential is typically expressed as a sum of electrostatic and van der Waals components, as is so in the present work. The ensemble average of the total interaction energy between solute and solvent is then written as the sum of the electrostatic component and the van der Waals component . In our treatment, the self energy is not added to and and is incorporated to the (total) solute–solvent energy through
| (5) |
in correspondence to Eq. (4). In any case, the correlation results reflect only the intermolecular effect since the self‐energy contribution is essentially constant.
Acknowledgments
The simulations were conducted partly using computational resources of the HPCI systems provided by COMA at University of Tsukuba, TSUBAME2.5 at Tokyo Institute of Technology, CX400 at Nagoya University, FX10 at University of Tokyo, and Cray XC30 at Kyoto University through the HPCI System Research Project (Project IDs:hp140156, hp140214, hp150131, hp150137, and hp150231). We congratulate the 65th anniversary of Prof. Ronald M. Levy. N. M. started his theoretical/computational studies of solutions and proteins under Ron's supervision and owes a lot to him throughout the career. N. M. has spent fruitful decades with Ron both scientifically and personally, and is honored to make a contribution to the anniversary issue.
References
- 1. Neidigh JW, Fesinmeyer RM, Andersen NH (2002) Designing a 20‐residue protein. Nat Struct Mol Biol 9:425–430. [DOI] [PubMed] [Google Scholar]
- 2. Zhou R (2003) Trp‐cage: Folding free energy landscape in explicit water. Proc Natl Acad Sci 100:13280–13285. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3. Ahmed Z, Beta IA, Mikhonin AV, Asher SA (2005) UV‐resonance Raman thermal unfolding study of Trp‐cage shows that it is not a simple two‐state miniprotein. J Am Chem Soc 127:10943–10950. [DOI] [PubMed] [Google Scholar]
- 4. Juraszek J, Bolhuis PG (2006) Sampling the multiple folding mechanisms of Trp‐cage in explicit solvent. Proc Natl Acad Sci 103:15859–15864. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5. Chen J, Im W III, Brooks CL (2006) Balancing solvation and intramolecular interactions: Toward a consistent generalized born force field. J Am Chem Soc 128:3728–3736. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6. Barua B, Lin JC, Williams VD, Kummler P, Neidigh JW, Andersen NH (2008) The Trp‐cage: Optimizing the stability of a globular miniprotein. Protein Eng Des Sel 21:171–185. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7. Gattin Z, Riniker S, Hore PJ, Mok KH, van Gunsteren WF (2009) Temperature and urea induced denaturation of the TRP‐cage mini protein TC5b: A simulation study consistent with experimental observations. Prot Sci 18:2090–2099. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8. Canchi DR, Paschek D, García AE (2010) Equilibrium study of protein denaturation by urea. J Am Chem Soc 132:2338–2344. [DOI] [PubMed] [Google Scholar]
- 9. Canchi DR, García AE (2011) Backbone and side‐chain contributions in protein denaturation by urea. Biophys J 100:1526–1533. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10. Paschek D, Day R, García AE (2011) Influence of water‐protein hydrogen bonding on the stability of Trp‐cage miniprotein. A comparison between the TIP3P and TIP4P‐Ew water models. Phys Chem Chem Phys 13:19840–19847. [DOI] [PubMed] [Google Scholar]
- 11. Scian M, Lina JC, Le Trong I, Makhatadzed GI, Stenkampb RE, Andersen NH (2012) Crystal and NMR structures of a Trp‐cage mini‐protein benchmark for computational fold prediction. Proc Natl Acad Sci 109:12521–12525. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12. Meuzelaar H, Marino KA, Huerta‐Viga A, Panman MR, Smeenk LEJ, Kettelarij AJ, van Maarseveen JH, Timmerman P, Bolhuis PG, Woutersen S (2013) Folding dynamics of the trp‐cage miniprotein: Evidence for a native‐like intermediate from combined time‐resolved vibrational spectroscopy and molecular dynamics simulations. J Phys Chem B 117:11490–11501. [DOI] [PubMed] [Google Scholar]
- 13. English CA, García AE (2014) Folding and unfolding thermodynamics of the TC10b Trp‐cage miniprotein. Phys Chem Chem Phys 16:2748–2757. [DOI] [PubMed] [Google Scholar]
- 14. Karino Y, Matubayasi N (2013) Interaction‐component analysis of the urea effect on amino acid analogs. Phys Chem Chem Phys 15:4377–4391. [DOI] [PubMed] [Google Scholar]
- 15. Allen MP, Tildesley DJ (1987) Computer simulation of liquids. New York: Oxford Science Publications. [Google Scholar]
- 16. Frenkel D, Smit B (1996) Understanding molecular simulation: From algorithms to applications. London: Academic Press.. [Google Scholar]
- 17. Matubayasi N, Nakahara M (2000) Theory of solutions in the energetic representation. I. Formulation. J Chem Phys 113:6070–6081. [Google Scholar]
- 18. Matubayasi N, Nakahara M (2002) Theory of solutions in the energy representation. II. Functional for the chemical potential. J Chem Phys 117:3605–3616. erratum, (2003), J Chem Phys 118: 2446. [Google Scholar]
- 19. Matubayasi N, Nakahara M (2003) Theory of solutions in the energy representation. III. Treatment of the molecular flexibility. J Chem Phys 119:9686–9702. [Google Scholar]
- 20. Sakuraba S, Matubayasi N (2014) ERmod: Fast and versatile computation software for solvation free energy with approximate theory of solutions. J Comput Chem 35:1592–1608. [DOI] [PubMed] [Google Scholar]
- 21. Levy RM, Belhadj M, Kitchen DB (1991) Gaussian fluctuation formula for electrostatic free‐energy changes in solution. J Chem Phys 95:3627–3633. [Google Scholar]
- 22. Luzhkov V, Warshel A (1992) Microscopic models for quantum mechanical calculations of chemical processes in solutions: LD/AMPAC and SCAAS/AMPAC calculations of solvation energies. J Comput Chem 13:199–213. [Google Scholar]
- 23. Åqvist J, Medina C, Samuelsson JE (1994) A new method for predicting binding affinity in computer‐aided drug design. Prot Eng 7:385–391. [DOI] [PubMed] [Google Scholar]
- 24. Carlson HA, Jorgensen WL (1995) An extended linear response method for determining free energies of hydration. J Phys Chem 99:10667–10673. [Google Scholar]
- 25. Kast SM (2001) Combinations of simulation and integral equation theory. Phys Chem Chem Phys 3:5087–5092. [Google Scholar]
- 26. Vener MV, Leontyev IV, Dyakov Yu A, Basilevsky MV, Newton MD (2002) Application of the linearized MD approach for computing equilibrium solvation free energies of charged and dipolar solutes in polar solvents. J Phys Chem B 106:13078–13088. [Google Scholar]
- 27. Hirata F, editor (2003) Molecular theory of solvation. Dordrecht, Netherlands: Kluwer Academic Publishers.. [Google Scholar]
- 28. Fdez Galván I, Sánchez ML, Martín ME, Olivares del Valle FJ, Aguilar MA. Geometry optimization of molecules in solution: Joint use of the mean field approximation and the free‐energy gradient method. J Chem Phys 2003;118:255–263. [Google Scholar]
- 29. Freedman H, Truong TN (2004) Coupled reference interaction site model/simulation approach for thermochemistry of solvation: Theory and prospects. J Chem Phys 121:2187–2198. [DOI] [PubMed] [Google Scholar]
- 30. Imai T, Harano Y, Kinoshita M, Kovalenko A, Hirata F (2006) A theoretical analysis on hydration thermodynamics of proteins. J Chem Phys 125:024911. [DOI] [PubMed] [Google Scholar]
- 31. Chuev GN, Fedorov MV, Crain J (2007) Improved estimates for hydration free energy obtained by the reference interaction site model. Chem Phys Lett 448:198–202. [Google Scholar]
- 32. Yamamoto T (2008) Variational and perturbative formulations of quantum mechanical/molecular mechanical free energy with mean‐field embedding and its analytical gradients. J Chem Phys 129:244104–2441015. [DOI] [PubMed] [Google Scholar]
- 33. Frolov AI, Ratkova EL, Palmer DS, Fedorov MV (2011) Hydration thermodynamics using the reference interaction site model: Speed or accuracy? J Phys Chem B 115:6011–6022. [DOI] [PubMed] [Google Scholar]
- 34. Weber V, Asthagiri D (2012) Regularizing binding energy distributions and the hydration free energy of protein cytochrome c from all‐atom simulations. J Chem Theory Comput 8:3409–3415. [DOI] [PubMed] [Google Scholar]
- 35. Takahashi H, Matubayasi N, Nakahara M, Nitta T (2004) A quantum chemical approach to the free energy calculations in condensed systems: The QM/MM method combined with the theory of energy representation. J Chem Phys 121:3989–3999. [DOI] [PubMed] [Google Scholar]
- 36. Matubayasi N, Liang KK, Nakahara M (2006) Free‐energy analysis of solubilization in micelle. J Chem Phys 124:154908. [DOI] [PubMed] [Google Scholar]
- 37. Matubayasi N, Shinoda W, Nakahara M (2008) Free‐energy analysis of the molecular binding into lipid membrane with the method of energy representation. J Chem Phys 128:195107–1951013. [DOI] [PubMed] [Google Scholar]
- 38. Takahashi H, Ohno H, Kishi R, Nakano M, Matubayasi N (2008) Computation of the free energy change associated with one‐electron reduction of coenzyme immersed in water: A novel approach within the framework of the quantum mechanical/molecular mechanical method combined with the theory of energy representation. J Chem Phys 129:205103. [DOI] [PubMed] [Google Scholar]
- 39. Karino Y, Fedorov MV, Matubayasi N (2010) End‐point calculation of solvation free energy of amino‐acid analogs by molecular theories of solution. Chem Phys Lett 496:351–355. [Google Scholar]
- 40. Kawakami T, Shigemoto I, Matubayasi N (2012) Free‐energy analysis of water affinity in polymer studied by atomistic molecular simulation combined with the theory of solutions in the energy representation. J Chem Phys 137:234903. erratum, (2014), J. Chem. Phys. 140: 169903 (2 pages). [DOI] [PubMed] [Google Scholar]
- 41. Frolov AI (2015) Accurate calculation of solvation free energies in supercritical fluids by fully atomistic simulations: Probing the theory of solutions in energy representation. J Chem Theory Comput 11:2245–2256. [DOI] [PubMed] [Google Scholar]
- 42. Saito H, Matubayasi N, Nishikawa K, Nagao H (2010) Hydration property of globular proteins: An analysis of solvation free energy by energy representation method. Chem Phys Lett 497:218–222. [Google Scholar]
- 43. Karino Y, Matubayasi N (2011) Free‐energy analysis of hydration effect on protein with explicit solvent: Equilibrium fluctuation of cytochrome c . J Chem Phys 134:041105. [DOI] [PubMed] [Google Scholar]
- 44. Takemura K, Guo H, Sakuraba S, Matubayasi N, Kitao A (2012) Evaluation of protein‐protein docking model structures using all‐atom molecular dynamics simulations combined with the solution theory in the energy representation. J Chem Phys 137:215105. [DOI] [PubMed] [Google Scholar]
- 45. Mizukami T, Saito H, Kawamoto S, Miyakawa T, Iwayama M, Takasu M, Nagao H (2012) Solvation effect on the structural change of a globular protein: A molecular dynamics study. Int J Quant Chem 112:344–350. [Google Scholar]
- 46. Saito H, Iwayama M, Mizukami T, Kang J, Tateno M, Nagao H (2013) Molecular dynamics study on binding free energy of Azurin‐Cytochrome c551 complex. Chem Phys Lett 556:297–302. [Google Scholar]
- 47. Kell GS, Whalley E (1975) Reanalysis of the density of liquid water in the range 0–150°C and 0–1 kbar. J Chem Phys 62:3496–3503. [Google Scholar]
- 48. Sturtevant JM (1977) Heat capacity and entropy changes in processes involving proteins. Proc Natl Acad Sci 74:2236–2240. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49. Karplus M, Kushick JN (1981) Method for estimating the configurational entropy of macromolecules. Macromolecules 14:325–332. [Google Scholar]
- 50. Baldwin RL (1986) Temperature dependence of the hydrophobic interaction in protein folding. Proc Natl Acad Sci 83:8069–8072. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51. Schlitter J (1993) Estimation of absolute and relative entropies of macromolecules using the covariance matrix. Chem Phys Lett 215:617–621. [Google Scholar]
- 52. Privalov PL (1997) Thermodynamics of protein folding. J Chem Thermodyn 29:447–474. [Google Scholar]
- 53. Scháfer H, Mark AE, van Gunsteren WF (2000) Absolute entropies from molecular dynamics simulation trajectories. J Chem Phys 113:7809–7817. [Google Scholar]
- 54. Andricioaei I, Karplus M (2001) On the calculation of entropy from covariance matrices of the atomic fluctuations. J Chem Phys 115:6289–6292. [Google Scholar]
- 55. Fitter J (2003) A measure of conformational entropy change during thermal protein unfolding using neutron spectroscopy. Biophys J 84:3924–3930. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56. Harano Y, Kinoshita M (2005) Translational‐entropy gain of solvent upon protein folding. Biophys J 89:2701–2710. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57. Harpole KW, Sharp KA (2011) Calculation of configurational entropy with a boltzmann‐quasiharmonic model: The origin of high‐affinity protein‐ligand binding. J Phys Chem B 115:9461–9472. [DOI] [PubMed] [Google Scholar]
- 58. Yasuda S, Yoshidome T, Harano Y, Roth R, Oshima H, Oda K, Sugita Y, Ikeguchi M, Kinoshita M (2011) Free‐energy function for discriminating the native fold of a protein from misfolded decoys. Proteins 79:2161–2171. [DOI] [PubMed] [Google Scholar]
- 59. Kinoshita M (2013) A new theoretical approach to biological self‐assembly. Biophys Rev 5:283–293. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60. Rodríguez‐Ropero F, van der Vegt NFA (2015) On the urea induced hydrophobic collapse of a water soluble polymer. Phys Chem Chem Phys 17:8491–8498. [DOI] [PubMed] [Google Scholar]
- 61. Kinoshita M (2009) Roles of translational motion of water molecules in sustaining life. Front Biosci 14:3419–3454. [DOI] [PubMed] [Google Scholar]
- 62. Jorgensen W. L, Chandrasekhar J, Madura J. D, Impey R. W, Klein M. L. Comparison of simple potential functions for simulating liquid water. J Chem Phys 1983;79:926–935. [Google Scholar]
- 63. Hornak V, Abel R, Okur A, Strockbine B, Roitberg A, Simmerling C (2006) Comparison of multiple amber force fields and development of improved protein backbone parameters. Proteins 65:712–725. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64. Hess B, Kutzner C, van der Spoel D, Lindahl E (2008) Gromacs 4: Algorithms for highly efficient, load‐balanced, and scalable molecular simulation. J Chem Theory Comput 4:435–447. [DOI] [PubMed] [Google Scholar]
- 65. van Gunsteren WF, Berendsen HJC (1988) A leap‐frog algorithm for stochastic dynamics. Mol Simul 1:173–185. [Google Scholar]
- 66. Miyamoto S, Kollman PA (1992) Settle: an analytical version of the SHAKE and RATTLE algorithm for rigid water models. J Comput Chem 13:952–962. [Google Scholar]
- 67. Hess B, Bekker H, Berendsen HJC, Fraaije JGEM (1997) Lincs: A linear constraint solver for molecular simulations. J Comput Chem 18:1463–1472. [Google Scholar]
- 68. Essmann U, Perera L, Berkowitz ML, Darden T, Lee H, Pedersen LG (1995) A smooth particle mesh Ewald method. J Chem Phys 103:8577–8593. [Google Scholar]
- 69. Figueirido F, Buono GS, Del Levy RM (1995) On finite‐size effects in computer simulations using the Ewald potential. J Chem Phys 103:6133–6142. [Google Scholar]
- 70. Hummer G, Pratt LR, García AE (1996) Free energy of ionic hydration. J Phys Chem 100:1206–1215. [Google Scholar]
- 71. Ekimoto T, Matubayasi N, Ikeguchi M (2015) Finite‐size effect on the charging free energy of protein in explicit solvent. J Chem Theory Comput 11:215–223. [DOI] [PubMed] [Google Scholar]
