Methods for calculating the entropy and free energy and their application to problems involving protein flexibility and ligand binding

Hagai Meirovitch; Srinath Cheluvaraja; Ronald P White

doi:10.2174/138920309788452209

. Author manuscript; available in PMC: 2010 Apr 1.

Published in final edited form as: Curr Protein Pept Sci. 2009 Jun;10(3):229–243. doi: 10.2174/138920309788452209

Methods for calculating the entropy and free energy and their application to problems involving protein flexibility and ligand binding

Hagai Meirovitch ¹, Srinath Cheluvaraja ¹, Ronald P White ¹

PMCID: PMC2716075 NIHMSID: NIHMS97861 PMID: 19519453

Abstract

The Helmholtz free energy, F and the entropy, S are related thermodynamic quantities with a special importance in structural biology. We describe the difficulties in calculating these quantities and review recent methodological developments. Because protein flexibility is essential for function and ligand binding, we discuss the related problems involved in the definition, simulation, and free energy calculation of microstates (such as the α-helical region of a peptide). While the review is broad, a special emphasize is given to methods for calculating the absolute F (S), where our HSMC(D) method is described in some detail.

I.Introduction

The absolute entropy, S and the absolute Helmholtz free energy, F (or G – Gibbs free energy) are fundamental quantities in statistical mechanics with a special importance in structural biology. S is a measure of order where changes in the S of water lead to the hydrophobic interaction – the main driving force in protein folding. F constitutes the criterion of stability, which is essential for studying the structure and function of peptides, proteins, nucleic acids, and other biological macromolecules. The free energy defines the binding affinities of protein-protein and protein-ligand interactions, it also quantifies many other important processes such as enzymatic reactions, electron transfer, ion transport through membranes, and the solvation of small molecules.

However, calculation of F(S) by computer simulation is extremely difficult, and considerable attention has thus been devoted in the last 50 years to this subject. While significant progress has been made (see reviews in [1-9]), in many cases the efficiency (or accuracy) of existing methods is unsatisfactory and the need for new ideas has kept this field highly active. We summarize here mainly recent developments in this area of research where the emphasis is on methodology issues and less on applications. The present article constitutes a substantial extension of a concise review appeared recently [7].

II. General theoretical considerations

In this section we define various thermodynamic quantities and discuss the problems involved in estimating them by computer simulation.

II.1 Statistical mechanics

The commonly used simulation techniques, Metropolis Monte Carlo (MC) [10] and molecular dynamics (MD) [11,12] are exact methods which enable one to generate samples of system configurations i distributed according to the Boltzmann probability, P_i^B (for simplicity we discuss a discrete system such as a lattice model of N particles),

P_{i}^{B} = \frac{exp [- E_{i} / k_{B} T]}{Z},

(1)

where T is the absolute temperature, k_B is the Boltzmann constant and E_i is the potential energy of configuration i; Z is the configurational partition function.

Z = \sum_{i} exp [- E_{i} / k_{B} T],

(2)

where the summation (integration for a continuum system) is carried out over the entire ensemble of configurations. The ensemble averages of the energy, <E>, and the absolute entropy, S, are given by

〈 E 〉 = \sum_{i} P_{i}^{B} E_{i}

(3)

and

S = 〈 S 〉 = - k_{B} \sum_{i} P_{i}^{B} ln P_{i}^{B} .

(4)

where the free energy, F, can also be expressed (formally) as an ensemble average,

F = 〈 F 〉 = - k_{B} T ln Z = \sum_{i} P_{i}^{B} [E_{i} + k_{B} T ln P_{i}^{B}] = 〈 E 〉 - T S .

(5)

An extremely important property of this representation of F (but not of other representations) is that its variance vanishes, σ²(F)=0; indeed, substituting the expression for P_i^B [equation (1)] in the brackets [equation (5)] leads to a constant, -k_BTlnZ for any i [13,14]. This means that the exact free energy can be obtained from any single structure i if P_i^B is known. Moreover, while F is an extensive variable, its zero fluctuation property holds for any number of atoms N. This important property is not shared by the entropy and the energy - their fluctuations increase as ∼N^1/2, which makes it difficult to calculate small entropy and energy changes [see discussion following equation (25)]. <E> can be estimated from a sample of size n generated with MC or MD (i.e., with the correct Boltzmann probability) by the arithmetic average, E̅

\bar{E} = n^{- 1} \sum_{t = 1}^{n} E_{t}

(6)

where the values of E_t are easily measured for each of the sampled configurations (e.g, the sum of the Lennard-Jones interactions for argon). (One has to distinguish between a summation over the entire ensemble which is denoted by the index i and a summation over a sample of n configurations which is denoted by the index t.) On the other hand, estimation of S by S̅,

\bar{S} = n^{- 1} \sum_{t = 1}^{n} ln P_{t}^{B},

(7)

is not straightforward because (unlike the energy) the value of ln P_t^B is not “written” directly on each of the sampled configuration, rather, it is a function of the entire ensemble through Z [equation (2)]; moreover, Z is not provided by the MC and MD simulation methods (methods which are of a “dynamic type”). The difficulty in calculating F stems from the relation F=E-TS.

II.2 The free energy of a partial region of space

The above discussion in terms of a discrete system also holds for a continuum system where the potential energy is E(x), x ∈ Ω is a 3N-dimensional vector of the Cartesian coordinates of the N atoms, and Ω is the entire configurational space. Thus, the summations (over the entire ensemble) are replaced by integrations over Ω. Moreover, this theory also applies to any partial region Ω_m of Ω, where a corresponding partial free energy, F_m can be defined,

F_{m} = - k_{B} T ln Z_{m} = - k_{B} T ln \int_{Ω_{m}} exp [- E (x) / k_{B} T] d x = E_{m} - T S_{m} .

(8)

Notice that the integral defining Z_m has the dimension of x, hence F_m and S_m are defined up to an additive term ∼lnx (which disappears if the velocity part of the partition function is considered). However, in most cases one is mainly interested in differences ΔF_mn and ΔS_mn for a system in regions Ω_m and Ω_n at a given T, where the additive factors are cancelled. Correspondingly, the dependence on dimensionality is cancelled in the ratio of populations, p_m/p_n (p_m=exp[–F_m/k_BT]/Z) which is much easier to calculate than the populations themselves,

p_{m} / p_{n} = Z_{m} / Z_{n} .

(9)

II.3 Microstates and intermediate flexibility

While the difficulty in calculating the absolute S (F) discussed above is common to all systems, biological macromolecules such as peptides and proteins, are particularly challenging due to their rugged potential energy surface, E(x). More specifically, this surface is “decorated” by a tremendous number of localized energy wells and “wider” ones that are defined over microstates (regions Ω_m), each consisting of many localized wells (Fig. 1); a microstate can be represented by a sample (trajectory) generated by a local MD simulation (e.g., the α-helical region of a peptide, see further discussion in II.4 below). MD studies have shown that a molecule will visit a localized well only for a very short time (as short as several fs) while staying for a much longer time within a microstate [15,16], meaning that the microstates are of a greater physical significance than the localized wells. A central aim of computational structural biology is to fold a protein, i.e., to identify its (single) Ω_m with global minimum F_m (out of trillions of microstates) – an unsolved optimization task. It is noted further that F_m of non-stable microstates, such as a transition state, might also be of interest.

Figure (1) — Schematic one-dimensional representation of part of the energy surface of a peptide or a protein, as a function of a coordinate X. The two large potential energy wells are defined over the corresponding microstates denoted Ω₁ and Ω₂. Each microstate consists of many localized potential wells denoted intermittently by solid and dashed lines. The partition function *Z_m* of microstate m is obtained by integrating exp[-E/k_BT] over Ω_m where *F_m* = - k_BT lnZ_m is the microstate's free energy. The figure suggests that the second microstate is the more stable among the two due to lower energy and higher entropy (Ω₂ is larger than Ω₁) hence lower free energy. If F₂ is also the global free energy minimum of a protein, Ω₂ is expected to describe the native microstate (assuming a perfect force field) and a simulation started from Ω₂ will keep the protein in this microstate for a long time. On the other hand, a peptide can populate significantly several of the most stable microstates in thermodynamic equilibrium.

Free energy calculations are also required in problems which are less challenging than protein folding, i.e., in cases of intermediate flexibility, where a flexible protein segment (e.g., a side chain or a surface loop), a cyclic peptide, or a ligand bound to an enzyme populates significantly several microstates in thermodynamic equilibrium. It is of interest, for example, to know whether the conformational change adopted by a loop (a side chain, ligand, etc.) upon protein binding has been induced by the other protein (induced fit [17,18]) or alternatively the free loop already interconverts among different microstates where one of them is selected upon binding (selected fit [19]). This analysis requires calculating the relative populations, p_m/p_n [equation (9)], which are also needed for a correct analysis of nuclear magnetic resonance (NMR) and x-ray data of flexible macromolecules [20,21].

II.4 On the practical definition of a microstate

Calculating populations, p_m or ratios p_m/p_n by the various techniques cannot be achieved without first establishing a practical definition of a microstate, which is however not trivial. Therefore, we elaborate below about this important issue that has been ignored to a large extent in the literature but has been given considerable thought by us over the course of the years [22-32]. For simplicity we consider an N-residue peptide with rigid geometry, i.e., constant bond lengths and bond angles meaning that its backbone conformation is solely defined by the dihedral angles, φ_k and ψ_k, where k=1,N. (ω_k, for the peptide bond, is fixed at 180°.) For a helical microstate (Ω_h), these angles are expected to vary within relatively small ranges Δφ_k and Δψ_k around φ_k = -60° and ψ_k = -50° (we ignore for a moment the side chains). However, if N is not too small, the correct limits of Ω_h in the [φ_k,ψ_k] space are unknown even for this simplified model since they constitute a complicated narrow “pipe” contained within the (larger) region defined by the product, Δφ₁×Δψ₁×Δφ₂×Δψ₂ ····· Δφ_N×Δψ_N due to the strong correlations among the dihedral angles. Obviously, these correlations are taken into account by an exact simulation method and thus, in practice, Ω_h can be defined (or more correctly, represented) by a local MC (MD) sample of conformations initiated from an α-helical structure, as mentioned earlier.

However, this definition should be used with caution. Thus, a short simulation will span only a small part of Ω_h and this part will grow constantly as the simulation continues; correspondingly, the calculated average potential energy, E_h and the entropy S_h (obtained by any method) will both increase and the free energy, F_h is expected to change as well. As the simulation time is increased further, side chain dihedrals will “jump” to different rotamers, which according to our definition should also be included within Ω_h; for a long enough simulation the peptide is expected to “leave” the α-helical region and move to a different microstate. Thus, in practice, the microstate size and the corresponding thermodynamic quantities can depend on the simulation time used to define the microstate. In some cases, one can better define Ω_h by discarding structures with dihedral angles beyond predefined Δφ_k and Δψ_k values or structures that do not satisfy a certain number of hydrogen bonds; one can also apply energetic restraints where their bias should be removed. However, these restrictions are somewhat arbitrary and are difficult to apply for calculating the differences ΔF_mn and ΔS_mn between microstates Ω_m and Ω_n, which is our main interest. Therefore, one should bear in mind that in practice there is always some arbitrariness in the definition of a microstate, which affects the calculated averages. This arbitrariness is severe with some methods and can be controlled (minimized) by others, as is discussed in the coming sections.

III. Methods for calculating the free energy

The various methods are divided into three main categories, the “counting approach”, thermodynamic integration/perturbation, and methods for calculating the absolute F and S. For brevity in what follows we denote microstates by m and n rather than by Ω_m and Ω_n.

III.1 The counting approach

As has been already pointed out, in many cases one is interested in differences ΔS_mn and ΔF_mn (or Z_m/Z_n) between two microstates (and less in S and F themselves). ΔF_mn can be calculated in the most straightforward way by a counting method, i.e., from a long MD (or MC) simulation that “covers” both microstates, where

Δ F_{m n} = k_{B} T ln [(# m) / (# n)]

(10)

and #m (#n) is the number of times the molecule visited m (n) during the simulation. However, because of high energy barriers, the transition between microstates at room temperature might require long times, nanoseconds or more even for side chain rotamers, meaning that reliable sampling of #m (#n) might become prohibitive. This problem can be alleviated by applying enhanced sampling techniques such as replica exchange [33] or the multicanonical method [34,35]; however, the conformational search capability of these methods is also limited and microstates of interest might be visited poorly (or not at all). The common analysis is based on projecting MD (MC) trajectories onto a small number of coordinates using principal component analysis, PCA (to help define/identify microstates), or in simpler cases calculating the populations along one or two physically significant reaction coordinates [36,37].

III.2 The thermodynamic integration approach

Differences ΔF and ΔS are commonly calculated by thermodynamic integration (TI) over physical quantities such as the energy, temperature, pressure, specific heat, etc. [38,39], as well as non-physical parameters, for instance, using a coupling parameter to act on the interaction potential to effect an “alchemical mutation”. In addition to TI, free energy perturbation (FEP) [1-9,40-47] and histogram analysis methods [48-50] can also be applied and will be included in this category. These are robust and highly versatile approaches, which have been reviewed extensively [1-9] and therefore only recent developments will be discussed here, some of them in detail.

III.2.a Advantages of TI and some pitfalls

An important application of TI is calculating the difference in the binding free energy of two ligands a and b bound to a protein (or a single ligand bound to a protein before or after a mutation in the protein). In this case two different simulations (integrations) are carried out in which a is mutated to b in water (aw→bw) and in the protein environment (Pa→Pb), and the corresponding differences in free energy, ΔF_aw_→_bw and ΔF_Pa_→_Pb are obtained. Because the free energy generated during a reversible thermodynamic cycle is zero, one can obtain the required overall difference in the binding free energy (see Fig. 2),

Figure (2) — A thermodynamic cycle for the binding of two ligands a and b to a protein P. In the experiment the ligands are transferred from the solvent to the active site where one measures the difference ΔΔF = ΔF_aw _→**_Pa** - ΔF_bw _→**_Pb**. In simulations the nonphysical transformation a→b is carried out in the protein and in solution and the corresponding free energies ΔF_Pa_→**_Pb** and ΔF_aw _→**_bw** are calculated. Because the free energy of the entire cycle is zero, the desired ΔΔF is obtained in terms of the nonphysical free energy differences ΔΔF = ΔF_Pa_→**_Pb** - ΔF_aw _→**_bw**.

Δ F_{aw \to Pa} - Δ F_{bw \to Pb} = Δ F_{aw \to bw} - Δ F_{Pa \to Pb} .

(11)

This procedure is extremely valuable because it enables one to calculate small free energy differences, ΔF_aw_→_bw and ΔF_Pa_→_Pb in large systems - a large container of water, and a large protein solvated by water. This stems from the fact that during the TI process only the interactions of the relatively small mutated part of the ligand with the system is directly considered (in the relevant derivatives to be integrated) and the resulting fluctuations are therefore small; on the other hand, obtaining ΔF_aw_→_bw (and ΔF_Pa_→_Pb) from ΔE_aw_→_bw - TΔS_aw_→_bw would be prohibitive, because both the energy and entropy depend on all of the system interactions, the fluctuations are large (∼N^1/2), and in practice the high precision required is not achievable [51-53].

However, TI has weaknesses that should be emphasized. Thus, if one seeks to calculate ΔF_mn between microstates m and n with significant structural variance (e.g., a helix and a hairpin of a peptide) the integration $\int_{m}^{n} dF$ becomes difficult (due to the complex path), and for a large molecule unfeasible. This difficulty may also be problematic in the calculation of the free energy of binding described above. Thus, whereas the mutation of a to b in the more homogeneous solvent environment might be well controlled, the simulation in the protein environment might not converge for very long times due to conformational changes (e.g., “jumps” of side chains among rotamers, etc.) occurring constantly in the entire protein; in other words, the microstate of Pb (and to some extent also of Pa) keep changing as the simulation time increases. Also, sometimes the mutation process does not lead to the required size and shape of the active site of Pb, or to the correct position of b and the correct number of water molecules in the active site of Pb [54,55].

As discussed below, these drawbacks can be overcome to a large extent with methods that calculate the absolute free energy. In this context it should be pointed out that the absolute F can also be obtained with TI provided that a reference state R with known F is available and an efficient integration path R→m can be defined. A classic example is the calculation of F of liquid argon or water by integrating the free energy from an ideal gas reference state. However, for non-homogeneous systems such integration might not be trivial, and in models of peptides and proteins defining adequate reference states is not straightforward (see later discussions in III.3.b.7). However, in spite of these problems, the TI approach is applied regularly for calculating the free energy of binding (and other properties) and the required computer programs are implemented in the commonly used molecular mechanics/molecular dynamics software packages, such as AMBER [56], CHARMM [57], NAMD [58], BOSS [59], GROMOS [60], GROMACS [61], TINKER [62], and others.

III.2.b The Adaptive Integration Method

An interesting development in the TI category is the Adaptive Integration Method (AIM) for computing free energies, radial distribution functions, and potentials of mean force [63]. A general TI process is based on the integral

Δ F = \int_{0}^{1} {〈 \frac{d H (λ, x)}{d λ} 〉}_{λ} d λ,

(12)

where 0 ≤ λ ≤ 1 defines a hybrid Hamiltonian, H(λ) = (1- λ)U₀ + λU₁, that is varied between two energy functions U₀ and U₁. (H(λ) can also be defined by more general nonlinear scalings.) This integral is commonly evaluated by carrying out l separate MD (or MC) simulations at l intermediate λ values, where the l corresponding averages (in conformational space x) of the derivative <dH(λ,x)/dλ>_λ = <U₁-U₀>_λ are calculated. With AIM, on the other hand, the sampling is performed within an MC procedure that allows transitions between coordinates as well as between different λ values. The parameter, λ, is therefore treated as an additional coordinate thus defining an expanded (λ,x) “super-system”. Thus, if the (a-priori unknown) partition function at λ is Z_λ (Z_λ = ∫ exp[−H(λ,x)/k_BT]dx), a normalized (Boltzmann) probability for the super-system to be at (λ,x) can be defined as

P^{B} (λ, x) = \frac{exp [- H (λ, x) / k_{B} T] / Z_{λ}}{∑_{λ^{'}} \int exp [- H (λ^{'}, x) / k_{B} T] d x / Z_{λ^{'}}} .

(13)

Note that each λ value has been weighted (by 1/Z_λ) to give the same probability, P^B(λ)=1/{Σ_λ′∫exp[-H(λ′,x)/k_BT]dx/Z_λ′}. The MC transition probabilities should satisfy the detailed balance condition, p(λ₁→ λ₂) / p(λ₂→ λ₁)=P^B(λ₂,x)/ P^B(λ₁,x), which leads to

p (λ_{1} \to λ_{2}) = min ⌊ 1, exp {- [H (λ_{2}, x) - H (λ_{1}, x) + {\bar{F}}_{λ_{1}} - {\bar{F}}_{λ_{2}}] / k_{B} T} ⌋

(14)

Because Z_λ is not known a-priori, the free energy, F̅_λ = −k_BT ln Z̅_λ in equation (14) is approximate and therefore appears with a bar. The values of F̅_λ (Z̅_λ) are calculated with an adaptive procedure. In particular, the transition probabilities (equation (14)) are estimated by using the current (running) estimates for the free energy derivatives, <dH(λ,x)/dλ>_λ, wherein, the free energy difference, $({\bar{F}}_{λ_{2}} - {\bar{F}}_{λ_{1}})$ , is thus approximated by a simple numerical integration. As the simulation continues, the running averages of the free energy derivatives become more accurate, making the estimated free energy differences increasingly accurate, and thus, the detailed balance condition will be satisfied (albeit asymptotically) with all λ values (bins) being visited an equal number of times. In a more traditional way for estimating F̅_λ, the simulation starts with Z̅_λ = 1 for all λ, where for each visit of a λ value Z̅_λ is increased by 1, and the simulation continues with the current (updated) Z̅_λ; thus, asymptotically (i.e., for a very long run) the ratios of the Z̅_λ values attain stability, Z̅_λi/Z̅_λj ≈ Z̅_λi/Z̅_λj. Notice also that for each λ the transitions between the coordinates (x) can be carried out by any canonical simulation technique (MC, MD etc.).

The authors claim that a larger number of bins (λ values) can be treated with AIM than with TI (for the same amount of computer time) which leads to a much finer resolution. Another potential advantage of AIM lies in the fact that a bin might be visited many times during the simulation, each visit starts from a different structure (seed) leading to an adequate sampling of the contributing microstate(s) for this λ. With TI, on the other hand, only a single simulation (starting from one seed) is typically performed and the coverage of the contributing microstates is expected to be more limited.

It should be pointed out that simulation techniques based on an adaptive calculation of (relative) free energies and entropies have been suggested before, starting with the multicanonical technique of Berg and Neuhaus [34], the method of expanded ensembles of Lyubartsev et al. [64], and the simulated tempering method of Marinari and Parisi [65]. The more recent (and relatively simple) random walk algorithm of Wang and Landau [66] has been used extensively, and has become the basis for more sophisticated techniques developed, for example, by de Pablo's group [67-69]. Also, to enhance efficiency, Escobedo and collaborators have devised methods [70-74], which combine the expanded ensembles idea with other known procedures (e.g., Bennett's method [75]). However, unlike AIM, which is aimed at calculating the free energy, most of these methods are designed primarily as simulation tools that enable a system with a rugged energy surface to cross energy barriers efficiently, while differences in free energy (or entropy) are obtained (like other properties) as byproducts of the simulation. A detailed discussion of these methods is beyond the scope of this review and extensive relevant literature can be found in the references cited above. Finally, it should be pointed out that further development of multicanonical ideas has also been pursued by the groups of Okamoto and Nakamura (also for MD simulations) and this approach has been applied extensively to peptides and proteins in explicit water (see for example [76-82] and references cited therein).

III.2.c Methods based on Jarzynski's identity

Another approach for calculating the (reversible) ΔF is based on Jarzynski's identity [47],

Δ F = - k_{B} T {〈 exp [- W_{f} / k_{B} T 〉}_{0}

(15)

where <⋯>₀ represents an average over non-reversible forward-directed work values, W_f, generated by starting an equilibrium simulation at U₀ which ends at U₁. However, if the transformation from U₀ to U₁ is rapid, a large number of these non-equilibrium simulations must be generated in order to sample the rare, most-contributing paths with low work values. Therefore, increasing efficiency has been a central aim, e.g., by biasing the selection of paths [83-89] or developing alternatives to Jarzynski's identity [90]. Notice however, that these procedures have been tested mostly on highly simplified models.

Shirts and Pande [91] reviewed and developed theoretical estimates for the bias and variance of Jarzynski's identity, TI, and Bennett's method [75]. They applied these methods to toy models but could not define a preferred method for calculating ΔF; however, in applications to simple atomistic models the lowest variance and bias were obtained with Bennett's method. Pande's group also developed efficient methods for calculating the absolute F of binding [92]. In a recent study [93] the accuracy and precision of nine free energy methods have been compared, where among them are, TI, AIM, FEP, Bennett's method, and single-ensemble path sampling [84]. ΔF was calculated for growing a (neutral) Lennard-Jones sphere in water and for charging a Lennard-Jones sphere in water. The efficiency was found to depend on the system and extent of accuracy sought, where overall AIM is the most efficient. Jarzynski's identity was also applied to realistic systems of proteins [94], where steered MD was used for calculating potential of mean force for unbinding acetylcholine from the alpha7 nicotinic acetylcholine receptor ligand-binding domain; four different procedures were checked in this study (see also [95]).

III.3 Calculations of the absolute S and F

Problems associated with the free energy difference-based approaches discussed earlier (e.g., TI) can be remedied to a large extent by calculating the absolute free energy; then, F_m and F_n can be obtained directly from two separate MD (MC) simulations of m and n, which leads to ΔF_mn = F_m – F_n and the need for an integration from m to n (or a long simulation that covers both m and n, as in the counting method) is avoided. Several methods have been developed in this category.

III.3.a Harmonic and quasi-harmonic techniques

A commonly used approach for estimating the absolute S is based on the harmonic approximation which was introduced to biomolecules by Gō and Scheraga [96,97]. They obtained

S = - (k_{B} / 2) ln [Det (Hessian)],

(16)

where Hessian is the matrix of second derivatives of the force field with respect to internal coordinates around an energy minimized structure; the quantum mechanical version (Einstein oscillators) was applied later for peptides by Hagler's group [98]. A related approach, “the second generation mining minima” method (M2) [99,100] has been developed by Gilson's group. With M2, low energy minimized structures (within a microstate) are initially identified, the free energies of the corresponding local potential wells are calculated with a method that considers both harmonic and an-harmonic effects, and the contribution of the individual wells is then accumulated.

An important development has been the introduction of the quasiharmonic (QH) method by Karplus and Kushick [101], where the Boltzmann probability density of structures defining a microstate is approximated by a multivariate Gaussian. Thus,

S_{QH} = \frac{k_{B}}{2} {N + ln [{(2 π)}^{N} Det(σ)]}

(17)

where the covariance matrix, σ, is obtained from a local MD (MC) sample and N is (usually) the number of internal coordinates. Clearly, S_QH constitutes an upper bound for S since correlations higher than quadratic are neglected; also, an-harmonic contributions are ignored, and QH is not suitable for diffusive systems such as water.

While QH has been used extensively during the years (see [102-104] and references cited therein), a systematic study of its performance has been carried out only recently by Gilson's group [105]. They studied linear alkanes and a host-guest system (urea receptor with the ethylenurea ligand) comparing the QH results to those obtained by the M2 method mentioned above. The conclusions of this study are that QH can be accurate for a highly populated single energy well, where the calculation is based on internal coordinates; the use of Cartesians, however, leads to errors of several kcal/mol. When the simulation covers several energy wells the errors of QH (in internal coordinates) can increase to tens of kcal/mol and are significantly larger with QH(Cartesians). Also, while errors sometimes get cancelled in entropy differences, the host-guest studies have shown that the errors in ΔS_QH are substantial. Finally, the convergence of the QH results is slow and in the host-guest system convergence has not been obtained even with 6 ns MD runs, which is in accord with previous studies. These conclusions probably apply to other versions of QH where σ is defined in Cartesian coordinates, such as the ad-hoc quantum mechanical approximation of Schlitter [106,103] and the exact derivation of quantum mechanical QH [107]; the performance of these two methods has been compared [108].

A new version of QH has been suggested recently by Wang and Brüschweiler [109], which enables one to estimate the contribution of different potential wells, e.g. rotameric states. Thus, defining a peptide conformation by the dihedral angles θ_j, a PCA analysis is carried out for a sample of conformations with respect to the complex variables $e^{i θ_{j}}$ (rather than θ_j) which eliminates the modulo 2 ambiguity in θ_j. The sample conformations (defined by the $e^{i θ_{j}}$ ) are then projected onto each of the eigenvectors m and the distribution of the resulting (complex) values leads to an entropy S_m. In practice, however, these distributions are smoothed by Gaussian functions, which depend on a variance parameter, σ. The total 2D entropy is S_2D= Σ S_m (up to an additive factor that controls to a large extent the effect of σ). The method was applied to several simplified models and the protein ubiquitin described by a force field. More recently this technique and the counting method were applied to the second β-hairpin of the B1 domain of streptococcal protein G, and the entropy results of these calculations were found to agree [110].

III.3.b Step-by-step reconstruction methods

Another approach for calculating the absolute S (F) has been suggested by Meirovitch and has been implemented initially in two approximate techniques of general applicability (i.e., not restricted to harmonic conditions), the local states (LS) [111,22-28] and the hypothetical scanning (HS) methods [112,115]. With both methods each conformation i of a sample [generated by MC or MD] is reconstructed step-by-step (from nothing) using transition probabilities (TPs); the product of these TPs leads to an approximation P_i for the correct P_i^B [equation (1)]. Recently, HS has been developed further to become the HSMC(D) method, where the approximate deterministic calculation of TP(HS) was replaced by a stochastic calculation carried out by MC(MD) simulations [116-121,29-32].

The philosophy of this approach is based on the ideas of the exact scanning method, which is thus described first [122,123]. While these methods are applicable to a wide range of systems, they are described here as applied to a simple peptide – a polyglycine molecule of N residues where its conformations are defined by the dihedral angles φ_i,ψ_i, and ω_i and the corresponding bond angles (bond lengths are assumed to be constant). These angles ordered along the chain are denoted by α_k, k=1,6N and the peptide is assumed to be in the helical microstate, Ω_h (however, the conclusions apply to any microstate Ω_m.) The potential energy of the peptide is defined by a force field in vacuum.

III.3.b.1 The exact scanning procedure

The exact scanning method [122,123] is a step-by-step construction procedure for a peptide conformation based on calculating (consecutively) TPs for the α_k, and determining their values and the positions of the related atoms [124]. For example, the angle φ defines the coordinates of the two hydrogens connected to C^α, and the position of C′. Thus, at step k (starting from nothing), k-1 angles α₁, …,α_k-₁ have already been determined and the related structure (the past) is kept constant. α_k is defined with the exact TP density ρ(α_k|α_k₋₁,⋯, α₁)

ρ (α_{k} {| α}_{k - 1}, \dots, α_{1}) = Z_{future} (α_{k}, \dots, α_{1}) / [Z_{future} (α_{k - 1}, \dots, α_{1})] .

(18)

That is, ρ(α_k|α_k₋₁,⋯, α₁)dα_k is the probability for the k^th angle to be found within a small increment, dα_k, centered at α_k, given that the angles, 1 through k-1, are at values α₁, …, α_k_-1. Z_future(α_k,⋯,α₁) is a “future partition function” where the term “future” indicates that the integration defining Z_future is carried out over the variables α_k+1,⋯, α_6N which will be determined only in the future steps of the build-up process. (Similarly, Z_future(α_k₋₁,⋯,α₁) is an integration over the angles α_k,⋯, α_6N.) In these integrations the atoms treated in the past are held fixed at their respective coordinates. More specifically, for Z_future(α_k,⋯,α₁), α₁⋯α_k are fixed, while α_k+1,⋯, α_6N are varied in a restrictive way where the corresponding conformations of the “future” part remain within Ω_h, and thus we write,

Z_{future} (α_{k}, \dots, α_{1}) = \int_{Ω_{h}} exp [- E (α_{6 N}, \dots, α_{1}) / k_{B} T] d α_{k + 1} \dots d α_{6 N} .

(19)

The product of the TPs [equation (18)] leads to the (Boltzmann) probability density of the entire conformation [equation (1)],

ρ^{B} (α_{6 N}, \dots, α_{1}) = \prod_{k = 1}^{6 N} ρ (α_{k} {| α}_{k - 1}, \dots, α_{1}) .

(20)

This construction procedure is not feasible for a large molecule because the scanning range grows exponentially and the helical region is not known, as discussed in II.4; therefore this method was used as a conformational search technique, where only a limited number of future angles were scanned [124]. However, the ideas of the exact scanning method constitute the basis for the three methods, HS, HSMC(D), and LS, as discussed below.

III.3.b.2 The philosophy of the hypothetical scanning approach

The exact scanning method is equivalent to any other exact simulation technique (in particular MC and MD) in the sense that large samples generated by such methods lead to the same averages and fluctuations within the statistical errors. Therefore, one can assume that a given MC or MD sample has rather been generated by the exact scanning method, which enables one to reconstruct each conformation i by calculating the TP densities that hypothetically were used to create it step-by-step. This idea has been implemented initially in two different ways in the LS and HS methods. However, an exact reconstruction of the TPs [equation (18)] is feasible only for a very small peptide. Therefore, calculation of future partition functions [equation (19)] by these methods has been carried out only approximately, by considering a partial future (or a limited past in the case of LS) as discussed in III.3.b.5. On the other hand, with HSMC(D) the entire future is considered and in this respect HSMC(D) can be considered to be exact.

III.3.b.3 The HSMC(D) method

Because HSMC and HSMD are based on the same theoretical grounds, we denote the related probability functions by ‘HS’, where the theory is described for HSMD, which for peptides has been found to be the more practical and efficient method among the two. One starts by generating an MD sample of the helical microstate; the conformations are then represented in terms dihedral and bond angles,1≤ α_k ≤ 6N, and the variability range Δα_k is calculated,

Δ α_{k} = α_{k} (max) - α_{k} (min),

(21)

where α_k(max) and α_k(min) are the maximum and minimum values of α_k found in the sample, respectively. Δα_k, α_k(max), and α_k(min) enable one to verify that the sample spans correctly the Ω_h microstate.

As mentioned in III.3.b.2, with our approach a sample conformation i is reconstructed step-by-step by calculating the TP density of each α_k [equation (18)] from the future partition functions Z_future [equation (19)]. However, a systematic integration of Z_future based on the entire future within the limits of Ω_h is difficult and becomes impractical for a large peptide where Ω_h is unknown (see II.4). The idea of the HSMD method is to obtain the TPs [equation (18)] by carrying out MD simulations of the future part of the chain rather than by evaluating the integrals defining Z_future [equation (19)] in a deterministic way. Thus, at reconstruction step k of conformation i the TP density, ρ(α_k|α_k₋₁,⋯,α_k) is calculated from an MD sample of n_f conformations (generated, in practice, in Cartesian coordinates), where the entire future of the peptide is moved (i.e., the atoms defined by α_k,⋯,α₆_N) while the past (the atoms defined by α₁,⋯,α_k₋₁) are kept fixed at their values in conformation i. A small segment (bin) δα_k [see also equation (18)] is centered at α_k(i) and n_visit, the number of visits of the future chain to this bin during the simulation is calculated; one obtains,

ρ (α_{k} {| α}_{k - 1}, \dots, α_{1}) \approx ρ^{HS} (α_{k} {| α}_{k - 1}, \dots, α_{1}) = n_{visit} / [n_{f} δ α_{k}]

(22)

where the relation becomes exact for very large n_f (n_f → ∞) and a very small bin (δα_k→ 0) (see [32]). This means that in practice ρ^HS(α_k|α_k₋₁,⋯,α₁ will be somewhat approximate due to insufficient future sampling (finite n_f), a relatively large bin size δα_k, an imperfect random number generator, etc.; therefore, we denote this TP and the probability densities derived from it by ‘HS’. (This equation is suitable also for HSMC). It is noted that for practical reasons, it is best with HSMD to treat a pair of angles simultaneously, where each pair consists of a dihedral angle and its successive bond angle (e.g., φ and the bond angle N-C^α-C′). Thus, at each step both α_k and α_k+₁ are considered and n_visit is increased by 1 only if α_k and α_k+₁ are located within the limits of δα_k and δα_k+₁, respectively; therefore equation (22) becomes (see Fig. 3),

Figure (3) — Illustration of the HSMC(D) reconstruction process of conformation i of a peptide consisting of three glycine residues. At each step the transition probability (TP) of a dihedral angle and the successive bond angle is determined and the related atoms are then fixed in their positions in i. The figure describes step 4 where the dihedral and bond angles considered are φ₂ (of the second residue) and the successive θ; these coordinates are also denoted α₇ and α₈, respectively (see text). In this process the already reconstructed part (the past) is depicted with solid lines and solid spheres (atoms); for simplicity the oxygens and most of the hydrogens are discarded. The TP is obtained by carrying out an MD simulation of the as yet unreconstructed part of the peptide (the future) which is depicted with dashed lines and empty spheres. In this simulation the “past” atoms remain fixed at their positions in i while the conformations of the future part should remain within the limits of the microstate; future-past interactions are taken into account. Small bins δφ₂ and δθ are centered at the values of φ₂ and θ in i. The TP is calculated from the number of simultaneous visits of the future part to δφ₂ and δθ during the simulation [see equation (23)]. After TP(4) has been determined the coordinates of the two hydrogen atoms of C^α (2) and those of C′(2) are fixed at their positions in i and the process continues.

ρ^{HS} (α_{k + 1}, α_{k} {| α}_{k - 1}, \dots, α_{1}) = n_{visit} / [n_{f} δ α_{k} δ α_{k + 1}] .

(23)

Notice that in the deterministic calculation of Z_future, [equation (19)] the limits of Ω_h are in practice unknown. On the other hand, with HSMD the future structures generated by MD at each step k remain in general within the limits of the microstate Ω_h defined by the analyzed MD sample due to the microstate's (meta) stability.

Similar to equation (20), the corresponding overall probability density for HSMD is

ρ^{HS} (α_{6 N}, \dots, α_{1}) = \prod_{k = 1}^{6 N} ρ^{HS} (α_{k + 1}, α_{k} {| α}_{k - 1}, \dots, α_{1}),

(24)

where in the product only odd values of k are used. ρ^HS(α₆_N,⋯,α₁) defines an approximate entropy and free energy functionals, S^A and F^A (over the ensemble) which can be shown using Jensen's inequality to constitute rigorous upper and lower bounds, respectively [117],

S^{A} = - k_{B} \int_{Ω_{h}} ρ^{B} ln ρ^{HS} d α_{1} \dots d α_{6 N}

(25)

F^{A} = 〈 E 〉 - T S^{A} = 〈 E 〉 + k_{B} T \int_{Ω_{h}} ρ^{B} ln ρ^{HS} d α_{1} \dots d α_{6 N} .

(26)

In these equations ρ^HS = ρ^HS(α₆_N,⋯,α₁), <E> is the Boltzmann average of the potential energy (force field), estimated from the MD sample and ρ^B [equation (20)] is the (correct) Boltzmann probability density with which the sample has been generated. S^A is estimated from a Boltzmann sample of size n by the arithmetic average of the ln(ρ^HS) values [see equation (7)]. As discussed in II.1, the fluctuation (standard deviation), σ_F of the correct free energy [equation (5)] is zero, while the approximate F^A has finite fluctuation, σ_A (estimated by its arithmetic average, $\bar{σ_{A}}$ ), which is expected to decrease as the approximation improves (i.e., as n_f increases and/or δα_k decreases) [13,14,115-117]

\bar{σ_{A}} = {[\frac{1}{n} \sum_{t = 1}^{n} {[{\bar{F}}^{A} - E_{t} - k_{B} T ln ρ_{t}^{HS}]}^{2}]}^{1 / 2} .

(27)

One can also define a free energy functional, F^B which constitutes a rigorous upper bound for the correct F [116,117]. Thus, by increasing computer time (and/or decreasing δα_k)) a set of improving bounds can be obtained which enable one to determine the accuracy from HSMC(D) results alone without the need to know the correct answer (a “self checking” property). Furthermore, F can be obtained from a very small sample and even from any single conformation (see discussion in II.1). A functional F^D has been also been defined, which leads to the correct F, and additionally, the correct F can also be estimated from the anti-correlation between improving results of F^A and its fluctuation, σ_A (F^A increases where σ_A decreases). However, F^B and F^D are statistically less reliable than F^A. These topics have been developed and tested in a systematic way in particular for argon, water, and self-avoiding walks on a lattice [116-118] (see below).

Unlike the limited applicability of methods that are based on harmonic approximations, HSMC(D) is applicable to fluids, random coil polymers, as well as microstates of a peptide. Thus, results for liquid argon, TIP3P water [117,125], and self-avoiding walks on a square lattice [118] were found to agree within error bars to TI results. Also, for polyglycine molecules, differences ΔF_mn and ΔS_mn for α-helix, extended, and hairpin microstates were calculated very reliably by HSMC [30]. However, in principle, HSMC(D) is not an efficient method because the number of build-up steps increases with system size. It turns out, however, that in calculations of differences ΔS_mn = S_m−S_n (or ΔF_mn) (that are of our main interest) the accuracy of $S_{m}^{A}$ and $S_{n}^{A}$ can be compromised significantly without degrading the accuracy of the calculated difference [e.g., by using small n_f and/or a large bin, δα_k; see equations (22) and (23)] due to cancellation of systematic errors. Thus, $S_{m}^{A} - S_{n}^{A}$ is equal to the correct value within the statistical errors, as has been demonstrated for peptides [29,30,32] as well as for the 7-residue mobile loop, 304-310 (Gly-His-Gly-Ala-Gly-Gly-Ser) of the enzyme porcine pancreatic α-amylase. This loop was modeled in vacuum by the AMBER force field, by AMBER with implicit solvation [120] and with explicit water [121]. Such cancellation of errors (discussed below) is typical of methods that calculate the absolute S (F) and it occurs especially for HSMD leading to a dramatic increase in its efficiency (see end of next section).

III.3.b.4 The cancellation of systematic errors with HSMD

It is important to understand the basis for the cancellation of errors discussed above. We examine first two one-dimensional harmonic microstates (oscillators) with the same mass defined by different spring constants f₁ and f₂. The exact entropy difference, ΔS_mn (here written ΔS_2,1 can be expressed in terms of the variances <x²> and <y²> of the corresponding coordinates,

Δ S_{2, 1} = (1 / 2) k_{B} ln (f_{1} / f_{2}) = k_{B} [ln (< x^{2} >^{1 / 2}) - ln (< y^{2} >^{1 / 2})]

(28)

One can estimate ΔS_2,1 from two separate MD simulations, where the corresponding variances are calculated. If f₁ is significantly smaller than f₂ (i.e., f₁ defines a flatter parabola) and the same step size is used in both simulations a longer simulation will be required for f₁ than for f₂ to gain the same statistical precision. Therefore, if the same sample size is used for both microstates the statistical precision of ΔS_2,1 will be determined mostly by that of S₁.

We now examine the entropy contributed by a backbone dihedral angle, α_k (denoted α for simplicity) in the course of the reconstruction process. α varies in microstates 1 and 2 within the ranges Δα₁ and Δα₂ [equation (21)] which we denote Δ₁ and Δ₂, respectively. The crudest (but sometimes quite reliable) HSMD approximation for the corresponding difference in entropy ΔS₀(α) is

Δ S_{0} (α) = k_{B} [ln Δ_{2} - ln Δ_{1}]

(29)

which is similar to that of equation (28) above (for brevity we shall omit α from the equations below). For better HSMD approximations, $Δ S_{0}^{n_{f}} (l)$ we define the bins δ₁=Δ₁/l and δ₂=Δ₂/l, where l is an increasing integer; the corresponding probabilities are $p_{1}^{n_{f}} (l)$ and $p_{2}^{n_{f}} (l)$ which are defined by n_visit/n_f [equations (22) and (23)]. One obtains,

\begin{array}{l} Δ S_{0}^{n_{f}} (l) = k_{B} [ln (p_{1}^{n_{f}} (l) / δ_{1}) - ln (p_{2}^{n_{f}} (l) / δ_{2})] = k_{B} {ln [p_{1}^{n_{f}} (l) / p_{2}^{n_{f}} (l)] + ln (Δ_{2} / Δ_{1})} \\ or Δ S_{0}^{n_{f}} (l) = Δ S^{n_{f}} (l) + Δ S_{0} \end{array}

(30)

where $Δ S^{n_{f}} (l)$ can be viewed as an an-harmonic term. One can write, $p_{i}^{exact} (l) = p_{i}^{n_{f}} (l) x_{i}^{n_{f}} (l)$ for i=1,2, where $p_{i}^{exact} (l) = p_{i}^{n_{f} = \infty} (l)$ and $x_{i}^{n_{f}} (l)$ are thus factors (systematic errors) satisfying $x_{i}^{n_{f}} (l) \to 1$ for very large n_f; for a given l (bin) one obtains,

Δ S^{n_{f}} (l) = k_{B} {ln p_{1}^{exact} (l) - ln p_{2}^{exact} (l) + ln [x_{2}^{n_{f}} (l) / x_{1}^{n_{f}} (l)]}

(31)

However, for large bins, δ (small l), one would expect to obtain probabilities that are close to the exact ones, $p_{1}^{exact} (l)$ and $p_{2}^{exact} (l)$ [i.e., $x_{1}^{n_{f}} (l)$ and $x_{2}^{n_{f}} (l)$ are ∼1] for a relatively small n_f due to adequate statistics, i.e., relatively large n_visit values. To obtain the exact probabilities (within the statistical errors) for decreased bin sizes, n_f should be increased adequately, which might increase computer time significantly. Thus, for practical values of n_f, $x_{1}^{n_{f}} (l)$ and $x_{2}^{n_{f}} (l)$ might differ significantly from 1 (i.e., large systematic errors). However, we argue that already for relatively small n_f, $x_{2}^{n_{f}} (l) \approx x_{1}^{n_{f}} (l)$ and the last logarithmic term [equation (31)] becomes smaller than the statistical error leading to the correct value, ΔS(l) within the statistical error. To obtain the correct contribution (ΔS) of dihedral angle α to the entropy difference one has to define small enough bins, i.e., large enough l_min, where for l>l_min ΔS(l) remains unchanged within the statistical error.

The relation $x_{2}^{n_{f}} (l) \approx x_{1}^{n_{f}} (l)$ stems from two reasons, where the first one is the fact that HSMD takes all interactions into account and thus for a given n_f the future part of the chain is treated with the same level of approximation in both microstates. Secondly, because with MD the atoms are moved along their potential gradients, the simulations are equally efficient in both microstates. For peptides [32] the condition $x_{2}^{n_{f}} (l) \approx x_{1}^{n_{f}} (l)$ occurs for much smaller n_f with HSMD than with HSMC [30] because the efficiency of the MC procedure used depends on the compactness of a structure (e.g., hairpin versus extended); correspondingly, the computer time required with HSMD was reduced by a factor of ∼100 as compared to that needed with HSMC. Again, as for the parabolas above, if one microstate is significantly “flatter” than the other (i.e., larger Δα_k values), the required n_f value for obtaining convergence of ΔS will be determined mainly by the flatter microstate.

III.3.b.5 The HS and LS methods

With the HS method one seeks to reconstruct a chain by a deterministic calculation of Z_future [equation (19)] for each α_k, based only on a partial future scanning defined by α_k,α_k+₁…α_k+f, where f is the scanning parameter, f«6N. HS works very well for self-avoiding walks on a lattice [14,112,115,118], argon [126], or magnetic models [114], because these systems are not limited to a microstate, i.e., the future scanning at each step is carried out over the entire (available) configurational space. On the other hand, for a peptide in a microstate, HS is practically not applicable, because of the difficulty to define the limits of the future part of a microstate in conformational space, as discussed in II.4.

With the LS method applied to a peptide, [22-28] the conformations of a given sample (of a microstate) are initially expressed in terms of internal coordinates and then a three-stage analysis is carried out where the sample is visited three times. In the first visit the variability range Δα_k is calculated, [equation (21)]. Each range, Δα_k is then divided into l equal segments, where l is the discretization parameter. We denote these segments by ν_k, (ν_k=1,l). Thus, an angle α_k is now represented by the segment ν_k to which it belongs and a conformation i is expressed by the corresponding vector of segments [ν₁(i), ν₂(i), …, ν₆_N (i)]. Under this discretization approximation a set of TP densities, ρ(α_k|α_k₋₁,⋯,α₁) can in principle be estimated by

ρ (α_{k} {| α}_{k - 1} \dots α_{1}) \approx n (ν_{k}, \dots, ν_{1}) / {n (ν_{k - 1}, \dots, ν_{1}) [Δ α_{k} / l]}

(32)

where n(ν_k,⋯,ν₁) is the number of times the local state [i.e., the partial vector (ν_k,⋯,ν₁) representing (α_k,⋯,α₁)] appears in the sample. Because the number of local states increases exponentially with k one has to resort to approximations based on smaller local states that consists of ν_k and the b angles preceding it along the chain, i.e., the vector (ν_k,ν_k-₁, …,ν_k-b), where b is the correlation parameter. The sample is visited for the second time and for a given b one calculates the number of occurrences n(ν_k,ν_k-₁, …,ν_k-b) of all the local states from which a set of transition probabilities p(ν_k| ν_k-₁,…, ν_k-b) are defined. The sample is then visited for the third time and for each member i of the sample one determines the 6N local states and the corresponding transition probabilities, whose product defines an approximate probability density ρ_i(b,l) for conformation i

ρ_{i} (b, l) = \prod_{k = 1}^{6 N} p (ν_{k} | ν_{k - 1}, …, ν_{k - b}) / (Δ α_{k} / l),

(33)

the larger are b and l the better the approximation (for enough statistics). ρ_i(b,l) allows one to define an approximate entropy and free energy functionals, S^A and F^A=<E>-TS^A [as in equations (25) and (26), where ρ_i(b,l) replaces ρ^HS(α₆_N,⋯,α₁)], which constitute rigorous upper and lower bounds for the correct values, respectively [22]. Thus, with LS, the past is treated approximately where the entire future is taken into account, in contrast to HS where the whole past is considered but only part of the future is taken into account. To improve the approximation of these methods, the parameters (b,l) and f should be increased, which requires, respectively, very large samples with LS (to get the adequate statistics), and a lot of computer time for calculating Z_f(HS). LS has been applied very successfully to peptides [22-25,27,28], loops [26] as well as magnetic lattice systems, lattice gas models [111,114] and fluid dynamics [127]. However, for random coil polymers (self-avoiding walks) LS is much less efficient than HS [115].

The above discussion demonstrates that LS (unlike HS) is of a “geometrical” character, i.e., calculation of the entropy does not depend directly on the interaction energy. Other methods that are based on calculating the distribution of local states (but not transition probabilities) have been suggested recently by Hnizdo et al. [128] and Killian et al. [129] who tested them on small molecules and peptides.

Finally, it should be pointed out that both LS, the above two methods, and the mining minima technique [99,100] can be applied to samples based on several microstates (where LS is also applicable to the random coil state), while QH lead to reliable results only for a single microstate [105]. However, QH, which considers the (quadratic) correlations among all variables, is expected to lead to better results than LS for a single microstate. Indeed, for peptides [30,32] and a surface loop of the protein α-amylase [120,121] the entropy results of QH were found to be better (i.e., smaller) than those of LS based on b=2 and l=10. However, the corresponding results for S[HSMC(D)] have always been the lowest (i.e., better).

III.3.b.6 Calculation of differences S_m - S_n

With QH, LS, and HSMC(D) calculation of ΔS_mn=S_m–S_n is based on the absolute values for each microstate. However, in section II.4 we have argued that the definition of a microstate m depends to a large extent on the simulation time t where typically m and its energy and entropy all grow with t. To be able to carry out a reliable estimation of ΔS_mn (ΔF_mn, etc.) we simulate both m and n for the same t looking for a range of t values where ΔF_mn(t), ΔS_mn(t) and ΔE_mn(t) are stable within the statistical errors [due to simultaneous increase of E_m(t), E_n(t), etc.]. For the QH method [equation (17)] such stable results constitute the best final answer. For the LS method, on the other hand, one can calculate $Δ S_{mn}^{A} (b, l)$ [and $Δ F_{mn}^{A} (b, l)$ ] for a set of improved approximations (by increasing b and l); if these differences converge within the statistical errors, the converged values are considered to be the correct differences due to cancellation of equal systematic errors in $S_{m}^{A} (b, l)$ and $S_{n}^{A} (b, l)$ (see III.3.b.4); this discussion also applies to different approximations of HSMC(D).

Obviously, if m is less stable than n the t values should be adjusted (i.e., decreased) to fit the stability of m. If m is significantly larger than n, t_m should be large enough to allow an adequate coverage of m, t_m ∼ t_n[ΠΔα_k(m)/[ΠΔα_k(n)], where t_n is the time required to obtain an adequate sample for n. However, if ΔS_mn(t) increases monotonically it constitutes a lower bound. If the microstate is restrictive, e.g., side chains should populate a single rotamer, the MD sample can be composed of several smaller samples that each starts from the same structure (seed) with a different set of velocities. It should be pointed out that with LS and QH relatively large samples are required for obtaining converged TPs [24] and converged terms of the correlation matrix σ [equation (17)] [100], respectively. Therefore, one should verify that the samples remain in the original microstates and have not “escaped” to neighboring ones. For that, methods have been developed which enable one to analyze the stability of a microstate by calculating distribution profiles of dihedral angles [25,27,28].

Unlike QH and LS, HSMC(D) is not based on gathering statistics from the studied sample; therefore, the required sample size is relatively small; also, F[HSMC(D)] (but not necessarily E and S[HSMC(D)]) can be obtained from a very small sample (even from a single conformation) [117]. Therefore, in our studies of peptides and loops populating significantly different microstates [29,32,120,121] the sample size for HSMC(D) is small and has been determined by the range of t values for which the average of E_m (E_n) is approximately constant (typically ∼600 conformations representing a 0.5 ns trajectory). Again, one can envisage extreme cases where m is significantly larger than n, which would require increasing the sample size for m as described above. In particular, with HSMC(D) the effect of sample size on ΔS_mn=S_m-S_n can be reduced, while controlling this effect with TI and the counting approaches is difficult.

III.3.b.7 Calculation of the absolute F by TI

As pointed out earlier, the absolute S (F) can be obtained in principle also by TI, provided that a convenient reference state with known F is defined. In the early work of Stoessel and Nowak [130] a harmonic reference state U_H=kΣ(r_i-r_i⁰)² is defined, where k is a spring constant and r_i and r_i⁰ are the instantaneous and equilibrium coordinates of atom i, respectively. The hybrid Hamiltonian (H) depends on U_H and the force field U, H(λ)=(1- λ)U + λU_H (0 ≤ λ ≤ 1). For decaglycine in an α-helical microstate the estimated error in F (∼2 kcal/mole) is relatively high. Very recently [131], a similar idea has been implemented somewhat differently where H(λ) = U + (1/2)λU_H and the free energy of the final state (λ=1) is calculated by a normal mode analysis. For the pentapeptide Met-enkephalin the maximum error in the absolute F of a microstate is again relatively high, ±1.5 kcal/mol (using HSMC(D), errors of ∼0.2 kcal/mol were obtained for ΔS (ΔF) between microstates of decaglycine and NH₂(Val)₂(Gly)₆(Val)₂CONH₂ [30,32]).

In a recent paper [132] Ytreberg and Zuckerman define a simple numerically calculable reference state designed to overlap the particular microstate of interest. Here, the microstate (say, an α-helical state of a peptide) is first simulated locally by MD, where the range of each internal coordinate, k, is divided into bins, and (normalized) populations are obtained from the (MD) sample. Using these probabilities a large sample of reference system structures, i, is then generated with known probabilities, $P_{i}^{ref} = Π p_{k} (i)$ , where p_k(i) is the probability of the bin of the kth internal coordinate in structure i. (Compare with the LS method above.) Each reference structure is assigned an energy defined by $E_{i}^{ref} = - k_{B} T ln P_{i}^{ref}$ (which leads to F^ref=0), and thus the desired free energy of the microstate can then be obtained through a standard perturbation expression, $F - F^{ref} = F = - k_{B} T ln {〈 exp [- (E_{i} - E_{i}^{ref}) / k_{B} T] 〉}_{ref}$ , where E_i is the actual force field energy calculated for the same set of coordinates (i). The method was found to work well for the leucine dipeptide; however, for a large microstate the overlap between the reference and real microstates might be small and therefore some enhancements should still be introduced.

III.3.b.8 methods based on Bennett's formula

Expressions for ΔF_mn based on two separate simulations of m and n had been suggested by Bennett [75], where one of them was developed further by Voter [133]. Thus, a local MC simulation of microstate m (the sample is defined by vectors R_m) will not cover a distant microstate n (with R_n) as required by Bennett's expression that depends on both potential energies as U_n-U_m. Therefore, for a simulation of microstate m, a vector D leading from R_m to R_n is defined and the energy difference is obtained by U_n(R_m+D)-U_m(R_m), and similarly U_m(R_n-D)-U_n(R_n) for a simulation of microstate n. Specifically

\frac{Z_{n}}{Z_{m}} = \frac{{〈 M_{T} [U_{n} (R_{m} + D) - U_{m} (R_{m})] 〉}_{n}}{{〈 M_{T} [U_{m} (R_{n} - D) - U_{n} (R_{n})] 〉}_{m}}

(34)

where M_T is the Metropolis function, M_T(ΔE)=min[1,exp(ΔE/k_BT)] and <>_n and <>_m denote averages over m and n, respectively. These ideas have been investigated recently further by Ytreberg and Zuckerman [134] who calculated free energy and entropy differences between microstates of peptides described by internal coordinates using a “single state shifting protocol”. However, the results depend on the number (and type) of shifted coordinates, for which no selection criterion has been provided.

Finally, we provide a short list of other recent methods for calculating F (S). Two methods involve TI [135,136] and one is based on the identity 1/Q=<w(p,x)exp[H(p,x)/k_BT]>, where Q is the partition function, p is the momentum and w(p,x) is a weight function [137]; this identity is used in HSMC(D) [117] and was used before with various w functions. With another method the joint probability density is represented by a two-dimensional Fourier series [138], and in a fourth method energy decomposition approach is used for evaluating S [139]. We mention also two methods for calculating the absolute protein-ligand binding free energy [140,141] and a method for calculating free energy profiles of enzymatic reactions by the linear response approximation [142].

IV. Conclusions

In this review we have discussed the difficulties in calculating the entropy S and the free energy F focusing on the related problems involved in the definition and simulation of microstates of peptides and proteins. While the review is broad, the emphasis is on efficiency issues related to recently developed techniques, in particular techniques for calculating the absolute F (S), where our HSMC(D) method is discussed in some detail. We describe equilibrium and non-equilibrium techniques for calculating the relative binding free energy of ligands to an active site; however, methods for calculating the standard absolute binding free energy have not been covered. Also, we do not elaborate on practical aspects of protein-ligand (DNA-ligand, etc.) interactions, such as modeling of the solvent and calculating its contribution to the free energy. These topics are dealt more extensively in other recent reviews [8,9].

In this context one should emphasize the strong effects of modeling (in particular of electrostatic interactions) on the results for F (S) and other thermodynamic and structural properties. In fact, incompatibility of theoretical results with experimental data due to unreliable modeling can be much more severe than method-related inaccuracies in the calculation of F (S). Therefore, to gain progress in computational structural biology, the existing force fields and solvation models should be improved, efficient techniques for simulation of biological macromolecules should be devised, as well as better techniques for calculating F (S).

TI is the most general methodology, which in many cases is also the easiest to implement. Furthermore, various versions of TI (in particular procedures for calculating the relative free energy of ligands bound to an active site) are already programmed in the commonly used molecular mechanics/molecular dynamics software packages (see II.2.a). Among the TI based techniques, AIM [63,93] appears to be very efficient (at least for the systems studied), but it has not been applied as yet to biological molecules. Simulation methods (e.g., the multicanonical method) that lead to an efficient conformational search and based (like AIM) on an adaptive buildup of the (relative) free energy, have been applied to small biological macromolecules - peptides and loops (see II.2.b); these simulations have been performed with in-house programs. Also, path-based limitations in TI have led to the development of techniques for computing the absolute F (S). Thus, calculation of ΔF_mn=F_m-F_n, which avoids the need to carry out reversible (or non-reversible) thermodynamic integration, has clear advantages, as discussed earlier. However, for an N-atom system the fluctuation in S_m (and practically also in an approximate F_m) is ∼N^1/2 and for large N estimating small ΔF_mn values would be unfeasible. Also, most methods for calculating the absolute F_m (S_m) discussed here are not applicable (at least as yet) to diffusive systems (e.g., water) and further developments in this direction are needed. Moreover, many methods do not provide criteria for estimating their accuracy; the QH method, which belongs to this category, should be used with caution [105]. In this respect HSMC(D) [29-32,116-119,120,121] (which still needs further development) has clear advantages: it is applicable to diffusive systems and to any chain flexibility (microstates as well as the random coil state), and it provides self-checking means for estimating its accuracy. Programming of QH, LS, and other techniques in this category (e.g., Bennett's procedure) is relatively easy and is usually carried out in-house. HSMC(D) is being developed within the framework of TINKER [62] and the software will become available when completed.

Acknowledgments

We thank Dan Zuckerman, Marty Ytreberg, and Mihail Mihailescu for valuable discussions. This work was partially supported by National Science Foundation Information Technology Research Grant, NSF 0225636 and NIH grant 2 R01 GM066090-04 A2.

Abbreviations

MC: Monte Carlo
MD: Molecular dynamics
HS: Hypothetical scanning method
HSMC(D): Hypothetical scanning MC (MD) method
LS: Local states method
AIM: Adaptive integration method
TI: Thermodynamic integration
FEP: Free energy perturbation

References

1.Beveridge DL, DiCapua FM. Free energy via molecular simulation: applications to chemical and biomolecular systems. Annu Rev Biophys Biophys Chem. 1989;18:431–492. doi: 10.1146/annurev.bb.18.060189.002243. [DOI] [PubMed] [Google Scholar]
2.Kollman PA. Free energy calculations: applications to chemical and biochemical phenomena. Chem Rev. 1993;93:2395–2417. [Google Scholar]
3.Jorgensen WL. Free energy calculations: a breakthrough for modeling organic chemistry in solution. Acc Chem Res. 1989;22:184–189. [Google Scholar]
4.Meirovitch H. Calculation of the free energy and entropy of macromolecular systems by computer simulation. In: Lipkowitz KB, Boyd DB, editors. Reviews in Computational Chemistry. Vol. 12. Wiley-VCH; New York: 1998. pp. 1–74. [Google Scholar]
5.Gilson MK, Given JA, Bush BL, McCammon JA. The statistical thermodynamic basis for computing of binding affinities: A critical review. Biophys J. 1997;72:1047–1069. doi: 10.1016/S0006-3495(97)78756-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
6.Boresch S, Tttinger F, eitgeb M, Karplus M. Absolute binding free energies: A qualitative approach for their calculation. J Phys Chem B. 2003;107:9535–9551. [Google Scholar]
7.Meirovitch H. Recent developments in methodologies for calculating entropy and free energy of biological systems by computer simulation. Curr Opinion in Struct Biol. 2007;17:181–186. doi: 10.1016/j.sbi.2007.03.016. [DOI] [PubMed] [Google Scholar]
8.Gilson MK, Zhou HX. Calculation of protein-ligand binding affinities. Ann Rev Biophys Biomol Struct. 2007;36:21–42. doi: 10.1146/annurev.biophys.36.040306.132550. [DOI] [PubMed] [Google Scholar]
9.Foloppe N, Hubbard R. Towards predictive ligand design with free-energy based Computational methods? Curr Med Chemistry. 2007;13:3583–3608. doi: 10.2174/092986706779026165. [DOI] [PubMed] [Google Scholar]
10.Metropolis N, Rosenbluth AW, Rosenbluth MN, Teller AH, Teller E. Equation of state calculations by fast computing machines. J Chem Phys. 1953;21:1087–1092. [Google Scholar]
11.Alder BJ, Wainwright TE. Studies of molecular dynamics. I. General method. J Chem Phys. 1959;31:459–466. [Google Scholar]
12.McCammon JA, Gelin BR, Karplus M. Dynamics of folded proteins. Nature. 1977;267:585–590. doi: 10.1038/267585a0. [DOI] [PubMed] [Google Scholar]
13.Meirovitch H, Alexandrowicz Z. On the zero fluctuation of the microscopic free energy and its potential use. J Stat Phys. 1976;15:123–127. [Google Scholar]
14.Meirovitch H. Simulation of a free energy upper bound, based on the anti-correlation between an approximate free energy functional and its fluctuation. J Chem Phys. 1999;111:7215–7224. [Google Scholar]
15.Elber R, Karplus M. Multiple conformational states of proteins - a molecular dynamics analysis of myoglobin. Science. 1987;235:318–321. doi: 10.1126/science.3798113. [DOI] [PubMed] [Google Scholar]
16.Stillinger FH, Weber TA. Packing structures and transitions in liquids and solids. Science. 1984;225:983–989. doi: 10.1126/science.225.4666.983. [DOI] [PubMed] [Google Scholar]
17.Getzoff ED, Geysen HM, Rodda SJ, Alexander H, Tainer JA, Lerner RA. Mechanisms of antibody binding to a protein. Science. 1987;235:1191–1196. doi: 10.1126/science.3823879. [DOI] [PubMed] [Google Scholar]
18.Rini JM, Schulze-Gahmen U, Wilson IA. Structural evidence for induced fit as a mechanism for antibody- antigen recognition. Science. 1992;255:959–965. doi: 10.1126/science.1546293. [DOI] [PubMed] [Google Scholar]
19.Constantine KL, Friedrichs MS, Wittekind M, Jamil H, Chu CH, Parker RA, Goldfarb V, Mueller L, Farmer BT. Backbone and side chain dynamics of uncomplexed human adipocyte and muscle fatty acid-binding proteins. Biochemistry. 1998;37:7965–7980. doi: 10.1021/bi980203o. [DOI] [PubMed] [Google Scholar]
20.Korzhnev DM, Salvatella X, Vendruscolo M, Di Nardo AA, Davidson AR, Dobson CM, Kay LE. Low populated folding intermediates of Fyn SH3 characterized by relaxation dispersion NMR. Nature. 2004;430:586–590. [Google Scholar]
21.Eisenmesser EZ, Millet O, Labeikovski W, Korzhnev DM, Wolf-Watz M, Bosco DA, Skalicky JJ, Kay LE, Kern D. Intrinsic dynamics of an enzyme underlies catalysis. Nature. 2005;438:117–121. doi: 10.1038/nature04105. [DOI] [PubMed] [Google Scholar]
22.Meirovitch H, Vásquez M, Scheraga HA. Stability of polypeptides conformational states as determined by computer simulation of the free energy. Biopolymers. 1987;26:651–671. doi: 10.1002/bip.360260508. [DOI] [PubMed] [Google Scholar]
23.Meirovitch H, Kitson DH, Hagler AT. Computer simulation of the entropy of polypeptides using the local states method: Application to Cyclo-(Ala-Pro-D-Phe)2 in vacuum and the crystal. J Am Chem Soc. 1992;114:5386–5399. [Google Scholar]
24.Meirovitch H, Koerber SC, Rivier J, Hagler AT. Computer simulation of the free energy of peptides with the local states method: Analogues of gonadotropin releasing hormone in the random coil and stable states. Biopolymers. 1994;34:815–839. doi: 10.1002/bip.360340703. [DOI] [PubMed] [Google Scholar]
25.Meirovitch H, Meirovitch E. New theoretical methodology for elucidating the solution structure of peptides from NMR data. III. Solvation effects. J Phys Chem. 1996;100:5123–5133. doi: 10.1002/(sici)1097-0282(199601)38:1<69::aid-bip6>3.0.co;2-u. [DOI] [PubMed] [Google Scholar]
26.Meirovitch H, Hendrickson TF. The backbone entropy of loops as a measure of their flexibility. Application to a ras protein simulated by molecular dynamics. Proteins. 1997;29:127–140. [PubMed] [Google Scholar]
27.Baysal C, Meirovitch H. Free energy based populations of interconverting microstates of a cyclic peptide lead to the experimental NMR data. Biopolymers. 1999;50:329–344. doi: 10.1002/(SICI)1097-0282(199909)50:3<329::AID-BIP8>3.0.CO;2-4. [DOI] [PubMed] [Google Scholar]
28.Baysal C, Meirovitch H. Ab initio structure prediction of a cyclic pentapeptide in DMSO based on an implicit solvation model. Biopolymers. 2000;53:423–433. doi: 10.1002/(SICI)1097-0282(20000415)53:5<423::AID-BIP6>3.0.CO;2-C. [DOI] [PubMed] [Google Scholar]
29.Cheluvaraja S, Meirovitch H. Simulation method for calculating the entropy and free energy of peptides and proteins. Proc Natl Acad Sci USA. 2004;101:9241–9246. doi: 10.1073/pnas.0308201101. [DOI] [PMC free article] [PubMed] [Google Scholar]
30.Cheluvaraja S, Meirovitch H. Calculation of the entropy and free energy by the hypothetical scanning Monte Carlo Method: Application to peptides. J Chem Phys. 2005;122:054903–14. doi: 10.1063/1.1835911. [DOI] [PubMed] [Google Scholar]
31.Cheluvaraja S, Meirovitch H. Calculation of the entropy and free energy from Monte Carlo simulations of a peptide stretched by an external force. J Phys Chem B. 2005;109:21963–21970. doi: 10.1021/jp052969l. [DOI] [PMC free article] [PubMed] [Google Scholar]
32.Cheluvaraja S, Meirovitch H. Calculation of the entropy and free energy of peptides by molecular dynamics simulations using the hypothetical scanning molecular dynamics method. J Chem Phys. 2006;125:024905–13. doi: 10.1063/1.2208608. [DOI] [PubMed] [Google Scholar]
33.Garcia AE, Sanbonmatsu KY. Exploring the energy landscape of a beta hairpin in explicit solvent. Proteins. 2001;42:345–354. doi: 10.1002/1097-0134(20010215)42:3<345::aid-prot50>3.0.co;2-h. [DOI] [PubMed] [Google Scholar]
34.Berg BA, Neuhaus T. Multicanonical algorithms for first order phase transition. Phys Lett B. 1991;267:249–253. doi: 10.1103/PhysRevLett.68.9. [DOI] [PubMed] [Google Scholar]
35.Ikeda K, Galzitskaya OV, Nakamura H, Higo J. β-hairpins, a-helices, and the intermediates among the secondary structures in the energy landscape of a peptide from a distal β-hairpin of SH3 domain. J Comput Chem. 2003;24:310–318. doi: 10.1002/jcc.10160. [DOI] [PubMed] [Google Scholar]
36.Nguyen PH, Stock G, Mittag E, Hu CK, Li MS. Free energy landscape and fold mechanism of a β-hairpin in explicit water: a replica exchange molecular dynamics study. Proteins. 2005;61:795–808. doi: 10.1002/prot.20696. [DOI] [PubMed] [Google Scholar]
37.Lange OF, Grubmüller H. Collective Langevin dynamics of conformational motions in proteins. J Chem Phys. 2006;124:214903–18. doi: 10.1063/1.2199530. [DOI] [PubMed] [Google Scholar]
38.MacDonald IR, Singer K. Machine calculation of thermodynamic properties of a simple fluid. J Chem Phys. 1967;47:4766–4772. [Google Scholar]
39.Hansen JP, Verlet L. Phase transition of the Lennard –Jones system. Phys Rev. 1969;184:151–161. [Google Scholar]
40.Hoover WG, Ree FH. Use of computer experiments to locate the melting transition and calculate the entropy in the solid phase. J Chem Phys. 1967;47:4873–4878. [Google Scholar]
41.Allen MP, Tildesley DJ. Computer Simulation of Liquids. Clarenden Press; Oxford: 1987. [Google Scholar]
42.Kirkwood JG. Statistical mechanics of fluid mixtures. J Chem Phys. 1935;3:300–313. [Google Scholar]
43.Zwanzig RW. High-temperature equation of state by a perturbation method. I. Nonpolar gases. J Chem Phys. 1954;22:1420–1426. [Google Scholar]
44.Squire DR, Hoover WG. Monte Carlo simulation of vacancies in rare-gas crystals. J Chem Phys. 1969;50:701–706. [Google Scholar]
45.Torrie GM, Valleau JP. Monte Carlo free energy estimates using non-Boltzmann sampling. Application to the subcritical Lennard-Jones fluid. Chem Phys Lett. 1974;28:578–581. [Google Scholar]
46.Torrie GM, Valleau JP. Nonphysical sampling distributions in Monte Carlo free energy estimation: Umbrella sampling. J Comp Phys. 1977;23:187–199. [Google Scholar]
47.Jarzynski C. Nonequilibrium equality for free energy differences. Phys Rev Lett. 1997;78:2690–2693. [Google Scholar]
48.Ferrenberg AM, Swendsen RH. Optimized Monte Carlo data analysis. Phys Rev Lett. 1989;63:1195–1198. doi: 10.1103/PhysRevLett.63.1195. [DOI] [PubMed] [Google Scholar]
49.Kumar S, Rosenberg JM, Bouzida D, Swendsen RH, Kolmann PA. Multidimensional free energy calculations using the weighted histogram analysis method. J Comput Chem. 1995;16:1339–1350. [Google Scholar]
50.Kumar S, Payne PW, Vásquez M. Method for free energy calculations using iterative technique. J Comput Chem. 1996;17:1269–1275. [Google Scholar]
51.Fleischman SH, Brooks CL., III Thermodynamics of aqueous solvation: Solution properties of alcohols and alkanes. J Chem Phys. 1987;87:3029–3037. [Google Scholar]
52.Peter C, Oostenbrink C, van Dorp A, van Gunsteren WF. Estimating entropies from molecular dynamics simulations. J Chem Phys. 2004;120:2652–2661. doi: 10.1063/1.1636153. [DOI] [PubMed] [Google Scholar]
53.Wan S, Stote RH, Karplus M. Calculation of the aqueous solvation energy and entropy, as well as free energy of simple polar solutes. J Chem Phys. 2004;121:9539–9548. doi: 10.1063/1.1789935. [DOI] [PubMed] [Google Scholar]
54.Miyamoto S, Kollman PA. Absolute and relative binding free energy calculations of the interaction of biotin and its analogs with streptavidin using molecular dynamics/free energy perturbation approaches. Proteins. 1993;16:226–245. doi: 10.1002/prot.340160303. [DOI] [PubMed] [Google Scholar]
55.Miyamoto S, Kollman PA. What determines the strength of noncovalent association of ligands to proteins in aqueous solution. Proc Natl Acad Sci USA. 1993;90:8402–8406. doi: 10.1073/pnas.90.18.8402. [DOI] [PMC free article] [PubMed] [Google Scholar]
56.Cornell WD, Cieplak P, Bayly CI, Gould IR, Merz KM, Jr, Ferguson DM, Spellmeyer DC, Fox T, Caldwell JW, Kollman PA. A second generation force field for the simulation of proteins,nucleic acids, and organic molecules. J Am Chem Soc. 1995;117:5179–5197. [Google Scholar]
57.MacKerell AD, Jr, Bashford D, Bellott M, Dunbrack RL, Jr, Evanseck JD, Field MJ, Fischer S, Gao J, Guo H, Ha S, Joseph-McCarthy D, Kuchnir L, Kuczera K, Lau FTK, Mattos C, Michnick S, Ngo T, Nguyen DT, Prodhom B, Reiher WE, III, Roux B, Schlenkrich M, Smith JC, Stote R, Straub J, Watanabe M, Wiórkiewicz-Kuczera J, Yin D, Karplus M. All-atom empirical potential for molecular modeling and dynamics studies of proteins. J Phys Chem B. 1998;102:3586–3616. doi: 10.1021/jp973084f. [DOI] [PubMed] [Google Scholar]
58.Phillips JC, Braun R, Wang W, Gumbart J, Tajkhorshid E, Villa E, Chipot C, Skeel RD, Kalé L, Schulten K. Scalable molecular dynamics with NAMD. J Comput Chem. 2005;26:1781–1802. doi: 10.1002/jcc.20289. [DOI] [PMC free article] [PubMed] [Google Scholar]
59.Jorgensen WL, Tirado-Rives J. Molecular modeling of organic and biomolecular systems using BOSS and MCPRO. J Comput Chem. 2005;26:1689–1700. doi: 10.1002/jcc.20297. [DOI] [PubMed] [Google Scholar]
60.Christen M, Hünenberger PH, Bakowies D, Baron R, Bürgi R, Geerke DP, Heinz TN, Kastenholz MA, Kraütler V, Oostenbrink C, Peter C, Trzesniak D, Van Gunsteren WF. The GROMOS software for biomolecular simulation: GROMOS05. J Comput Chem. 2005;26:1719–1751. doi: 10.1002/jcc.20303. [DOI] [PubMed] [Google Scholar]
61.Van Der Spoel D, Lindahl E, Hess B, Groenhof G, Mark AE, Berendsen HJC. GROMACS: Fast, flexible, and free. J Comput Chem. 2005;26:1701–1718. doi: 10.1002/jcc.20291. [DOI] [PubMed] [Google Scholar]
62.Ponder JW. TINKER - software tools for molecular design, version 4.2. 2004 doi: 10.1021/acs.jctc.8b00529. [DOI] [PMC free article] [PubMed] [Google Scholar]
63.Fasnacht M, Swendsen RH, Rosenberg JM. Adaptive integration method for Monte Carlo simulations. Phys Rev E. 2004;69:056704–15. doi: 10.1103/PhysRevE.69.056704. [DOI] [PubMed] [Google Scholar]
64.Lyubartsev AP, Martsinovski AA, Shevkunov SV, Vorontsov-Velyaminov PN. New approach to Monte Carlo calculation of the free energy: Method of expanded ensembles. J Chem Phys. 1992;96:1776–1783. [Google Scholar]
65.Marinari E, Parisi G. Simulated Tempering: a New Monte Carlo Scheme. Europhys Lett. 1992;19:451–455. [Google Scholar]
66.Wang F, Landau DP. Efficient multiple-range random walk algorithm to calculate the density of states. Phys Rev Lett. 2001;86:2050–2053. doi: 10.1103/PhysRevLett.86.2050. [DOI] [PubMed] [Google Scholar]
67.Rathore N, Knotts TA, III, de Pablo JJ. Density of states simulations of proteins. J Chem Phys. 2003;118:4285–4290. doi: 10.1016/S0006-3495(03)74810-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
68.Mastny EA, de Pablo JJ. Direct calculation of solid-liquid equilibria from density of states Monte Carlo simulations. J Chem Phys. 2005;122:124109–6. doi: 10.1063/1.1874792. [DOI] [PubMed] [Google Scholar]
69.Chopra M, Müller M, de Pablo JJ. Order-parameter-based Monte Carlo simulation of crystallization. J Chem Phys. 2006;124:134102–8. doi: 10.1063/1.2178324. [DOI] [PubMed] [Google Scholar]
70.Fenwick MK, Escobedo FA. Expanded ensemble and replica exchange methods for simulation of protein-like systems. J Chem Phys. 2003;119:11998–12010. [Google Scholar]
71.Fenwick MK, Escobedo FA. On the use of Bennett's acceptance ratio method in multi-canonical-type simulations. J Chem Phys. 2004;120:3066–3074. doi: 10.1063/1.1641000. [DOI] [PubMed] [Google Scholar]
72.Gospodinov ID, Escobedo FA. Multicanonical schemes for mapping out free energy landscapes of single-component and multicomponent systems. J Chem Phys. 2005;122:164103–10. doi: 10.1063/1.1884594. [DOI] [PubMed] [Google Scholar]
73.Abreu CRA, Escobedo FA. A general framework for non-Boltzmann Monte Carlo sampling. J Chem Phys. 2006;124:054116–12. doi: 10.1063/1.2165188. [DOI] [PubMed] [Google Scholar]
74.Escobedo FA. Optimized expanded ensembles for simulations involving molecular insertions and deletions. II. Open systems. J Chem Phys. 2007;127:174104–12. doi: 10.1063/1.2800321. [DOI] [PubMed] [Google Scholar]
75.Bennett CH. Efficient estimation of free energy differences from Monte Carlo data. J Comput Phys. 1976;22:245–268. [Google Scholar]
76.Mitsutake A, Sugita Y, Okamoto Y. Replica-exchange multicanonical and multicanonical replica-exchange Monte Carlo simulations of peptides. II. Application to a more complex system. J Chem Phys. 2003;118:6676–6688. [Google Scholar]
77.Sugita Y, Okamoto Y. Molecular mechanism for stabilizing a short helical peptide studied by generalized-ensemble simulations with explicit solvent. Biophys J. 2005;88:3180–3190. doi: 10.1529/biophysj.104.049429. [DOI] [PMC free article] [PubMed] [Google Scholar]
78.Yoda T, Sugita Y, Okamoto Y. Cooperative folding mechanism of a β-hairpin peptide studied by a multicanonical replica-exchange molecular dynamics simulation. Proteins. 2007;66:846–859. doi: 10.1002/prot.21264. [DOI] [PubMed] [Google Scholar]
79.Nakajima N, Nakamura H, Kidera A. Multicanonical ensemble generated by molecular dynamics simulation for enhanced sampling of peptides. J Phys Chem B. 1997;101:817–824. [Google Scholar]
80.Fukunishi Y, Mikami Y, Nakamura H. The filling potential method: A method for estimating the free energy surface for protein-ligand docking. J Phys Chem B. 2003;107:13201–13210. [Google Scholar]
81.Watanabe YS, Kim JG, Fukunishi Y, Nakamura H. Free energy landscapes of small peptides in an implicit solvent model determined by force-biased multicanonical molecular dynamics simulation. Chem Phys Lett. 2004;400:258–263. [Google Scholar]
82.Kamiya N, Yonezawa Y, Nakamura H, Higo J. Protein-inhibitor flexible docking by a multicanonical sampling: Native complex structure with the lowest free energy and a free-energy barrier distinguishing the native complex from the others. Proteins. 2008;70:41–53. doi: 10.1002/prot.21409. [DOI] [PubMed] [Google Scholar]
83.Ytreberg FM, Zuckerman DM. Efficient use of nonequilibrium measurement to estimate free energy differences for molecular systems. J Comput Chem. 2004;25:1749–1759. doi: 10.1002/jcc.20103. [DOI] [PubMed] [Google Scholar]
84.Ytreberg FM, Zuckerman DM. Single-ensemble nonequilibrium path-sampling estimates of free energy differences. J Chem Phys. 2004;120:10876–10879. doi: 10.1063/1.1760511. [DOI] [PubMed] [Google Scholar]
85.Jarzynski C. Rare events and convergence of exponentially averaged work values. Phys Rev E. 2006;73:046105–10. doi: 10.1103/PhysRevE.73.046105. [DOI] [PubMed] [Google Scholar]
86.Wu D, Kofke DA. Rosenbluth-sampled nonequilibrium work method for Calculation of free energies in molecular simulation. J Chem Phys. 2005;122:204104–13. doi: 10.1063/1.1906209. [DOI] [PubMed] [Google Scholar]
87.Oberhofer H, Dellago C, Geissler P. Biased sampling of nonequilibrium trajectories: can fast switching simulations outperform conventional free energy calculations methods? J Phys Chem. 2005;109:6902–6915. doi: 10.1021/jp044556a. [DOI] [PubMed] [Google Scholar]
88.Lechner W, Oberhofer H, Dellago C, Geissler P. Equilibrium free energies from fast-switching trajectories with large time step. J Chem Phys. 2006;124:044113–12. doi: 10.1063/1.2162874. 2006. [DOI] [PubMed] [Google Scholar]
89.MacFadyen J, Andricioaei I. A skewed-momenta method to efficiently generate conformational-transition trajectories. J Chem Phys. 2005;123:074107–9. doi: 10.1063/1.2000242. [DOI] [PubMed] [Google Scholar]
90.Adib AB. Free energy surfaces from nonequilibrium processes without work measurement. J Chem Phys. 2006;124:144111–5. doi: 10.1063/1.2188946. [DOI] [PubMed] [Google Scholar]
91.Shirts MR, Pande VS. Comparison of efficiency and bias of free energies computed by exponential averaging, the Bennett acceptance ratio, and thermodynamics integration. J Chem Phys. 2005;122:144107–16. doi: 10.1063/1.1873592. [DOI] [PubMed] [Google Scholar]
92.Jayachandran G, Shirts MR, Park S, Pande VJ. Parallelized-over-parts computation of absolute binding free energy with docking and molecular dynamics. J Chem Phys. 2006;125:084901–12. doi: 10.1063/1.2221680. [DOI] [PubMed] [Google Scholar]
93.Ytreberg FM, Swendsen RH, Zuckerman DM. Comparison of free energy methods for molecular simulations. J Chem Phys. 2006;125:184114–11. doi: 10.1063/1.2378907. [DOI] [PubMed] [Google Scholar]
94.Zhang D, Gullingsrud J, McCammon JA. Potential of mean force for a acetylcholine unbinding from the alpha7 nicotinic acetylcholine receptor ligand-binding domain. J Am Chem Soc. 2006;128:3019–3026. doi: 10.1021/ja057292u. [DOI] [PMC free article] [PubMed] [Google Scholar]
95.Xiong H, Crespo A, Marti M, Estrin D, Roitberg AE. Free energy calculations with non-equilibrium methods: applications of the Jarzynski relationship. Theor Chem Acc. 2006;116:338–346. [Google Scholar]
96.Gō N, Scheraga HA. Analysis of the contribution of internal vibrations to the statistical weights of equilibrium conformations of macromolecules. J Chem Phys. 1969;51:4751–4767. [Google Scholar]
97.Gō N, Scheraga HA. On the use of classical statistical mechanics in the treatment of polymer chain conformation. Macromolecules. 1976;9:535–542. [Google Scholar]
98.Hagler AT, Stern PS, Sharon R, Becker JM, Naider F. Computer simulation of the conformational properties of oligopeptides. Comparison of theoretical methods and analysis of experimental results. J Am Chem Soc. 1979;101:6842–6852. [Google Scholar]
99.Chang CE, Gilson MK. Tork: Conformational analysis method for molecules and complexes. J Comput Chem. 2003;24:1987–1998. doi: 10.1002/jcc.10325. [DOI] [PubMed] [Google Scholar]
100.Chen W, Chang CE, Gilson MK. Concepts in receptor optimization: targeting the RGD peptide. J Am Chem Soc. 2005;128:4675–4684. doi: 10.1021/ja056600l. [DOI] [PubMed] [Google Scholar]
101.Karplus M, Kushick JN. Method for estimating the configurational entropy of macromolecules. Macromolecules. 1981;14:325–332. [Google Scholar]
102.Rojas OL, Levy RM, Szabo A. Corrections to the quasiharmonic approximation for evaluating molecular entropies. J Chem Phys. 1986;85:1037–1043. [Google Scholar]
103.Schäfer H, Mark AE, van Gunsteren WF. Absolute entropies from molecular dynamics simulation trajectories. J Chem Phys. 2000;113:7809–7817. [Google Scholar]
104.Schäfer H, Daura X, Mark AE, van Gunsteren WF. Entropy calculations on a reversibly folding peptide: Changes in solute free energy cannot explain folding behavior. Proteins. 2001;43:45–56. doi: 10.1002/1097-0134(20010401)43:1<45::aid-prot1016>3.0.co;2-n. [DOI] [PubMed] [Google Scholar]
105.Chang CE, Chen W, Gilson MK. Evaluating the accuracy of the quasiharmonic approximation. J Chem Theory Comput. 2005;1:1017–1028. doi: 10.1021/ct0500904. [DOI] [PubMed] [Google Scholar]
106.Schlitter J. Estimation of absolute and relative entropies of macromolecules using the covariance matrix. Chem Phys Lett. 1993;215:617–621. [Google Scholar]
107.Andricioaei I, Kaplus M. On the calculation of entropy from covariance matrices of the atomic fluctuations. J Chem Phys. 2001;115:6289–6292. [Google Scholar]
108.Carlsson J, Åqvist J. Absolute and relative entropies from computer simulation with applications to ligand binding. J Phys Chem B. 2005;109:6448–6456. doi: 10.1021/jp046022f. [DOI] [PubMed] [Google Scholar]
109.Wang J, Brüschweiler R. 2D entropy of discrete molecular ensembles. J Chem Theory Comput. 2006;2:18–24. doi: 10.1021/ct050118b. [DOI] [PubMed] [Google Scholar]
110.Li DW, Khanlarzadeh M, Wang J, Huo S, Brüschweiler R. Evaluation of configurational entropy methods from peptide folding-unfolding simulation. J Phys Chem B. 2007;111:13807–13813. doi: 10.1021/jp075220e. [DOI] [PubMed] [Google Scholar]
111.Meirovitch H. Calculation of entropy with computer simulation methods. Chem Phys Lett. 1977;45:389–392. [Google Scholar]
112.Meirovitch H. Computer simulation of the free energy of polymer chains with excluded volume and with finite interactions. Phys Rev A. 1985;32:3709–3715. doi: 10.1103/physreva.32.3709. [DOI] [PubMed] [Google Scholar]
113.Meirovitch H, Scheraga HA. Computer simulation of the entropy of continuum chain models: the two dimensional freely-jointed chain of hard disks. J Chem Phys. 1986;84:6369–6375. [Google Scholar]
114.Meirovitch H. Methods for estimating the entropy with computer simulation. The simple cubic Ising lattice. J Phys A. 1983;16:839–846. [Google Scholar]
115.Meirovitch H. On the simulation of the entropy of macromolecules with different flexibilities. J Chem Phys. 2001;114:3859–3867. [Google Scholar]
116.White RP, Meirovitch H. Absolute entropy and free energy of fluids using the hypothetical scanning method. II. Transition probabilities from canonical Monte Carlo simulations of partial systems. J Chem Phys. 2003;119:12096–12105. [Google Scholar]
117.White RP, Meirovitch H. Lower and upper bounds for the absolute free energy by the hypothetical scanning Monte Carlo method: Application to liquid argon and water. J Chem Phys. 2004;121:10889–10904. doi: 10.1063/1.1814355. [DOI] [PubMed] [Google Scholar]
118.White RP, Meirovitch H. Calculation of the entropy of random coil polymers with the hypothetical scanning Monte Carlo Method. J Chem Phys. 2005;123:214908–11. doi: 10.1063/1.2132285. [DOI] [PMC free article] [PubMed] [Google Scholar]
119.White RP, Meirovitch H. Free volume hypothetical scanning molecular dynamics method for the absolute free energy of liquids. J Chem Phys. 2006;124:204108–13. doi: 10.1063/1.2199529. [DOI] [PMC free article] [PubMed] [Google Scholar]
120.Cheluvaraja S, Meirovitch H. Stability of the free and bound microstates of a mobile loop of α-amylase obtained from the absolute entropy and free energy. J Chem Theory Comput. 2008;4:192–208. doi: 10.1021/ct700116n. [DOI] [PubMed] [Google Scholar]
121.Cheluvaraja S, Mihailescu M, Meirovitch H. Entropy and free energy of a mobile loop in explicit water. J Phys Chem. 2008;112:9512–9522. doi: 10.1021/jp801827f. [DOI] [PMC free article] [PubMed] [Google Scholar]
122.Meirovitch H. A new method for simulation of real chains. Scanning future steps. J Phys A. 1982;15:L735–L740. [Google Scholar]
123.Meirovitch H. Statistical properties of the scanning simulation method for polymer chains. J Chem Phys. 1988;89:2514–2522. [Google Scholar]
124.Meirovitch H, Vásquez M, Scheraga HA. Stability of polypeptides conformational states: II. The free energy of the statistical coil obtained by the scanning simulation method. Biopolymers. 1988;27:1189–1204. doi: 10.1002/bip.360270802. [DOI] [PubMed] [Google Scholar]
125.Jorgensen WL, Chandrasekhar J, Madura JD. Comparison of simple potential functions for simulating liquid water. J Chem Phys. 1983;79:926–935. [Google Scholar]
126.Szarecka A, White RP, Meirovitch H. Absolute entropy and free energy of fluids using the hypothetical scanning method. I. Calculation of transition probabilities from local grand canonical partition functions. J Chem Phys. 2003;119:12084–12095. [Google Scholar]
127.Chorin AJ. Partition functions and equilibrium measures in two-dimensional and quasi-three-dimensional turbulence. Phys Fluids. 1996;8:2656–2660. [Google Scholar]
128.Hnizdo V, Darian E, Fedorowicz A, Demchuk E, Li S, Singh H. Nearest-neighbor nonparametric method for estimating the configurational entropy of complex molecules. J Comput Chem. 2007;28:655–668. doi: 10.1002/jcc.20589. [DOI] [PubMed] [Google Scholar]
129.Killian BJ, Kravitz JY, Gilson MK. Extraction of configurational entropy from molecular simulations via an expansion approximation. J Chem Phys. 2007;127:024107–16. doi: 10.1063/1.2746329. [DOI] [PMC free article] [PubMed] [Google Scholar]
130.Stoessel JP, Novak P. Absolute free energies in biomolecular systems. Macromolecules. Macromolecules. 1990;23:1961–1965. [Google Scholar]
131.Tyka MD, Clarke AR, Sessions RB. An efficient path-independent method for free energy calculations. J Phys Chem B. 2006;110:17212–17220. doi: 10.1021/jp060734j. [DOI] [PubMed] [Google Scholar]
132.Ytreberg FM, Zuckerman DM. Simple estimation of absolute free energies for biomolecules. J Chem Phys. 2006;124:104105–5. doi: 10.1063/1.2174008. [DOI] [PubMed] [Google Scholar]
133.Voter AF. A Monte Carlo method for determining free energy differences and transition state theory rate constants. J Chem Phys. 1985;82:1890–1899. [Google Scholar]
134.Ytreberg FM, Zuckerman DM. Peptide conformational equilibria computed via a single-stage shifting protocol. J Phys Chem B. 2005;109:9096–9103. doi: 10.1021/jp0510692. [DOI] [PubMed] [Google Scholar]
135.Rodinger T, Howell PL, Pomes R. Absolute free energy calculations by thermodynamic integration in four spatial dimensions. J Chem Phys. 2005;123:034104–11. doi: 10.1063/1.1946750. [DOI] [PubMed] [Google Scholar]
136.Blondel A. Ensemble variance in free energy calculations by thermodynamic integration: theory, optimal “alchemical” path, and practical solutions. J Comput Chem. 2004;25:985–993. doi: 10.1002/jcc.20025. [DOI] [PubMed] [Google Scholar]
137.Huang L, Makarov DE. On the calculation of absolute free energy from molecular-dynamics or Monte Carlo data. J Chem Phys. 2006;124:064108–9. doi: 10.1063/1.2166397. [DOI] [PubMed] [Google Scholar]
138.Darian E, Hnizdo V, Fedorowicz A, Singh H, Demchuck E. Estimation of the absolute internal-rotation entropy of molecules with two torsional degrees of freedom from stochastic simulations. J Comput Chem. 2005;26:651–660. doi: 10.1002/jcc.20198. [DOI] [PubMed] [Google Scholar]
139.Ohkubo YZ, Thorpe IF. Evaluating the conformational entropy of macromolecules using an energy decomposition approach. J Chem Phys. 2006;124:024910–6. doi: 10.1063/1.2138696. [DOI] [PubMed] [Google Scholar]
140.Lee MS, Olson MA. Calculation of absolute protein-ligand binding affinity using path and endpoint approaches. Biophys J. 2006;90:864–877. doi: 10.1529/biophysj.105.071589. [DOI] [PMC free article] [PubMed] [Google Scholar]
141.Woo HJ, Roux B. Calculation of absolute protein-ligand binding free energy from computer simulation. Proc Natl Acad Sci USA. 2005;102:6825–6830. doi: 10.1073/pnas.0409005102. [DOI] [PMC free article] [PubMed] [Google Scholar]
142.Rosta E, Klähn M, Warshel A. Toward accurate ab initio QM/MM calculations of free energy profiles of enzymatic reactions. J Phys Chem B. 2006;110:2934–2941. doi: 10.1021/jp057109j. [DOI] [PubMed] [Google Scholar]

[R1] 1.Beveridge DL, DiCapua FM. Free energy via molecular simulation: applications to chemical and biomolecular systems. Annu Rev Biophys Biophys Chem. 1989;18:431–492. doi: 10.1146/annurev.bb.18.060189.002243. [DOI] [PubMed] [Google Scholar]

[R2] 2.Kollman PA. Free energy calculations: applications to chemical and biochemical phenomena. Chem Rev. 1993;93:2395–2417. [Google Scholar]

[R3] 3.Jorgensen WL. Free energy calculations: a breakthrough for modeling organic chemistry in solution. Acc Chem Res. 1989;22:184–189. [Google Scholar]

[R4] 4.Meirovitch H. Calculation of the free energy and entropy of macromolecular systems by computer simulation. In: Lipkowitz KB, Boyd DB, editors. Reviews in Computational Chemistry. Vol. 12. Wiley-VCH; New York: 1998. pp. 1–74. [Google Scholar]

[R5] 5.Gilson MK, Given JA, Bush BL, McCammon JA. The statistical thermodynamic basis for computing of binding affinities: A critical review. Biophys J. 1997;72:1047–1069. doi: 10.1016/S0006-3495(97)78756-3. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R6] 6.Boresch S, Tttinger F, eitgeb M, Karplus M. Absolute binding free energies: A qualitative approach for their calculation. J Phys Chem B. 2003;107:9535–9551. [Google Scholar]

[R7] 7.Meirovitch H. Recent developments in methodologies for calculating entropy and free energy of biological systems by computer simulation. Curr Opinion in Struct Biol. 2007;17:181–186. doi: 10.1016/j.sbi.2007.03.016. [DOI] [PubMed] [Google Scholar]

[R8] 8.Gilson MK, Zhou HX. Calculation of protein-ligand binding affinities. Ann Rev Biophys Biomol Struct. 2007;36:21–42. doi: 10.1146/annurev.biophys.36.040306.132550. [DOI] [PubMed] [Google Scholar]

[R9] 9.Foloppe N, Hubbard R. Towards predictive ligand design with free-energy based Computational methods? Curr Med Chemistry. 2007;13:3583–3608. doi: 10.2174/092986706779026165. [DOI] [PubMed] [Google Scholar]

[R10] 10.Metropolis N, Rosenbluth AW, Rosenbluth MN, Teller AH, Teller E. Equation of state calculations by fast computing machines. J Chem Phys. 1953;21:1087–1092. [Google Scholar]

[R11] 11.Alder BJ, Wainwright TE. Studies of molecular dynamics. I. General method. J Chem Phys. 1959;31:459–466. [Google Scholar]

[R12] 12.McCammon JA, Gelin BR, Karplus M. Dynamics of folded proteins. Nature. 1977;267:585–590. doi: 10.1038/267585a0. [DOI] [PubMed] [Google Scholar]

[R13] 13.Meirovitch H, Alexandrowicz Z. On the zero fluctuation of the microscopic free energy and its potential use. J Stat Phys. 1976;15:123–127. [Google Scholar]

[R14] 14.Meirovitch H. Simulation of a free energy upper bound, based on the anti-correlation between an approximate free energy functional and its fluctuation. J Chem Phys. 1999;111:7215–7224. [Google Scholar]

[R15] 15.Elber R, Karplus M. Multiple conformational states of proteins - a molecular dynamics analysis of myoglobin. Science. 1987;235:318–321. doi: 10.1126/science.3798113. [DOI] [PubMed] [Google Scholar]

[R16] 16.Stillinger FH, Weber TA. Packing structures and transitions in liquids and solids. Science. 1984;225:983–989. doi: 10.1126/science.225.4666.983. [DOI] [PubMed] [Google Scholar]

[R17] 17.Getzoff ED, Geysen HM, Rodda SJ, Alexander H, Tainer JA, Lerner RA. Mechanisms of antibody binding to a protein. Science. 1987;235:1191–1196. doi: 10.1126/science.3823879. [DOI] [PubMed] [Google Scholar]

[R18] 18.Rini JM, Schulze-Gahmen U, Wilson IA. Structural evidence for induced fit as a mechanism for antibody- antigen recognition. Science. 1992;255:959–965. doi: 10.1126/science.1546293. [DOI] [PubMed] [Google Scholar]

[R19] 19.Constantine KL, Friedrichs MS, Wittekind M, Jamil H, Chu CH, Parker RA, Goldfarb V, Mueller L, Farmer BT. Backbone and side chain dynamics of uncomplexed human adipocyte and muscle fatty acid-binding proteins. Biochemistry. 1998;37:7965–7980. doi: 10.1021/bi980203o. [DOI] [PubMed] [Google Scholar]

[R20] 20.Korzhnev DM, Salvatella X, Vendruscolo M, Di Nardo AA, Davidson AR, Dobson CM, Kay LE. Low populated folding intermediates of Fyn SH3 characterized by relaxation dispersion NMR. Nature. 2004;430:586–590. [Google Scholar]

[R21] 21.Eisenmesser EZ, Millet O, Labeikovski W, Korzhnev DM, Wolf-Watz M, Bosco DA, Skalicky JJ, Kay LE, Kern D. Intrinsic dynamics of an enzyme underlies catalysis. Nature. 2005;438:117–121. doi: 10.1038/nature04105. [DOI] [PubMed] [Google Scholar]

[R22] 22.Meirovitch H, Vásquez M, Scheraga HA. Stability of polypeptides conformational states as determined by computer simulation of the free energy. Biopolymers. 1987;26:651–671. doi: 10.1002/bip.360260508. [DOI] [PubMed] [Google Scholar]

[R23] 23.Meirovitch H, Kitson DH, Hagler AT. Computer simulation of the entropy of polypeptides using the local states method: Application to Cyclo-(Ala-Pro-D-Phe)2 in vacuum and the crystal. J Am Chem Soc. 1992;114:5386–5399. [Google Scholar]

[R24] 24.Meirovitch H, Koerber SC, Rivier J, Hagler AT. Computer simulation of the free energy of peptides with the local states method: Analogues of gonadotropin releasing hormone in the random coil and stable states. Biopolymers. 1994;34:815–839. doi: 10.1002/bip.360340703. [DOI] [PubMed] [Google Scholar]

[R25] 25.Meirovitch H, Meirovitch E. New theoretical methodology for elucidating the solution structure of peptides from NMR data. III. Solvation effects. J Phys Chem. 1996;100:5123–5133. doi: 10.1002/(sici)1097-0282(199601)38:1<69::aid-bip6>3.0.co;2-u. [DOI] [PubMed] [Google Scholar]

[R26] 26.Meirovitch H, Hendrickson TF. The backbone entropy of loops as a measure of their flexibility. Application to a ras protein simulated by molecular dynamics. Proteins. 1997;29:127–140. [PubMed] [Google Scholar]

[R27] 27.Baysal C, Meirovitch H. Free energy based populations of interconverting microstates of a cyclic peptide lead to the experimental NMR data. Biopolymers. 1999;50:329–344. doi: 10.1002/(SICI)1097-0282(199909)50:3<329::AID-BIP8>3.0.CO;2-4. [DOI] [PubMed] [Google Scholar]

[R28] 28.Baysal C, Meirovitch H. Ab initio structure prediction of a cyclic pentapeptide in DMSO based on an implicit solvation model. Biopolymers. 2000;53:423–433. doi: 10.1002/(SICI)1097-0282(20000415)53:5<423::AID-BIP6>3.0.CO;2-C. [DOI] [PubMed] [Google Scholar]

[R29] 29.Cheluvaraja S, Meirovitch H. Simulation method for calculating the entropy and free energy of peptides and proteins. Proc Natl Acad Sci USA. 2004;101:9241–9246. doi: 10.1073/pnas.0308201101. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R30] 30.Cheluvaraja S, Meirovitch H. Calculation of the entropy and free energy by the hypothetical scanning Monte Carlo Method: Application to peptides. J Chem Phys. 2005;122:054903–14. doi: 10.1063/1.1835911. [DOI] [PubMed] [Google Scholar]

[R31] 31.Cheluvaraja S, Meirovitch H. Calculation of the entropy and free energy from Monte Carlo simulations of a peptide stretched by an external force. J Phys Chem B. 2005;109:21963–21970. doi: 10.1021/jp052969l. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R32] 32.Cheluvaraja S, Meirovitch H. Calculation of the entropy and free energy of peptides by molecular dynamics simulations using the hypothetical scanning molecular dynamics method. J Chem Phys. 2006;125:024905–13. doi: 10.1063/1.2208608. [DOI] [PubMed] [Google Scholar]

[R33] 33.Garcia AE, Sanbonmatsu KY. Exploring the energy landscape of a beta hairpin in explicit solvent. Proteins. 2001;42:345–354. doi: 10.1002/1097-0134(20010215)42:3<345::aid-prot50>3.0.co;2-h. [DOI] [PubMed] [Google Scholar]

[R34] 34.Berg BA, Neuhaus T. Multicanonical algorithms for first order phase transition. Phys Lett B. 1991;267:249–253. doi: 10.1103/PhysRevLett.68.9. [DOI] [PubMed] [Google Scholar]

[R35] 35.Ikeda K, Galzitskaya OV, Nakamura H, Higo J. β-hairpins, a-helices, and the intermediates among the secondary structures in the energy landscape of a peptide from a distal β-hairpin of SH3 domain. J Comput Chem. 2003;24:310–318. doi: 10.1002/jcc.10160. [DOI] [PubMed] [Google Scholar]

[R36] 36.Nguyen PH, Stock G, Mittag E, Hu CK, Li MS. Free energy landscape and fold mechanism of a β-hairpin in explicit water: a replica exchange molecular dynamics study. Proteins. 2005;61:795–808. doi: 10.1002/prot.20696. [DOI] [PubMed] [Google Scholar]

[R37] 37.Lange OF, Grubmüller H. Collective Langevin dynamics of conformational motions in proteins. J Chem Phys. 2006;124:214903–18. doi: 10.1063/1.2199530. [DOI] [PubMed] [Google Scholar]

[R38] 38.MacDonald IR, Singer K. Machine calculation of thermodynamic properties of a simple fluid. J Chem Phys. 1967;47:4766–4772. [Google Scholar]

[R39] 39.Hansen JP, Verlet L. Phase transition of the Lennard –Jones system. Phys Rev. 1969;184:151–161. [Google Scholar]

[R40] 40.Hoover WG, Ree FH. Use of computer experiments to locate the melting transition and calculate the entropy in the solid phase. J Chem Phys. 1967;47:4873–4878. [Google Scholar]

[R41] 41.Allen MP, Tildesley DJ. Computer Simulation of Liquids. Clarenden Press; Oxford: 1987. [Google Scholar]

[R42] 42.Kirkwood JG. Statistical mechanics of fluid mixtures. J Chem Phys. 1935;3:300–313. [Google Scholar]

[R43] 43.Zwanzig RW. High-temperature equation of state by a perturbation method. I. Nonpolar gases. J Chem Phys. 1954;22:1420–1426. [Google Scholar]

[R44] 44.Squire DR, Hoover WG. Monte Carlo simulation of vacancies in rare-gas crystals. J Chem Phys. 1969;50:701–706. [Google Scholar]

[R45] 45.Torrie GM, Valleau JP. Monte Carlo free energy estimates using non-Boltzmann sampling. Application to the subcritical Lennard-Jones fluid. Chem Phys Lett. 1974;28:578–581. [Google Scholar]

[R46] 46.Torrie GM, Valleau JP. Nonphysical sampling distributions in Monte Carlo free energy estimation: Umbrella sampling. J Comp Phys. 1977;23:187–199. [Google Scholar]

[R47] 47.Jarzynski C. Nonequilibrium equality for free energy differences. Phys Rev Lett. 1997;78:2690–2693. [Google Scholar]

[R48] 48.Ferrenberg AM, Swendsen RH. Optimized Monte Carlo data analysis. Phys Rev Lett. 1989;63:1195–1198. doi: 10.1103/PhysRevLett.63.1195. [DOI] [PubMed] [Google Scholar]

[R49] 49.Kumar S, Rosenberg JM, Bouzida D, Swendsen RH, Kolmann PA. Multidimensional free energy calculations using the weighted histogram analysis method. J Comput Chem. 1995;16:1339–1350. [Google Scholar]

[R50] 50.Kumar S, Payne PW, Vásquez M. Method for free energy calculations using iterative technique. J Comput Chem. 1996;17:1269–1275. [Google Scholar]

[R51] 51.Fleischman SH, Brooks CL., III Thermodynamics of aqueous solvation: Solution properties of alcohols and alkanes. J Chem Phys. 1987;87:3029–3037. [Google Scholar]

[R52] 52.Peter C, Oostenbrink C, van Dorp A, van Gunsteren WF. Estimating entropies from molecular dynamics simulations. J Chem Phys. 2004;120:2652–2661. doi: 10.1063/1.1636153. [DOI] [PubMed] [Google Scholar]

[R53] 53.Wan S, Stote RH, Karplus M. Calculation of the aqueous solvation energy and entropy, as well as free energy of simple polar solutes. J Chem Phys. 2004;121:9539–9548. doi: 10.1063/1.1789935. [DOI] [PubMed] [Google Scholar]

[R54] 54.Miyamoto S, Kollman PA. Absolute and relative binding free energy calculations of the interaction of biotin and its analogs with streptavidin using molecular dynamics/free energy perturbation approaches. Proteins. 1993;16:226–245. doi: 10.1002/prot.340160303. [DOI] [PubMed] [Google Scholar]

[R55] 55.Miyamoto S, Kollman PA. What determines the strength of noncovalent association of ligands to proteins in aqueous solution. Proc Natl Acad Sci USA. 1993;90:8402–8406. doi: 10.1073/pnas.90.18.8402. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R56] 56.Cornell WD, Cieplak P, Bayly CI, Gould IR, Merz KM, Jr, Ferguson DM, Spellmeyer DC, Fox T, Caldwell JW, Kollman PA. A second generation force field for the simulation of proteins,nucleic acids, and organic molecules. J Am Chem Soc. 1995;117:5179–5197. [Google Scholar]

[R57] 57.MacKerell AD, Jr, Bashford D, Bellott M, Dunbrack RL, Jr, Evanseck JD, Field MJ, Fischer S, Gao J, Guo H, Ha S, Joseph-McCarthy D, Kuchnir L, Kuczera K, Lau FTK, Mattos C, Michnick S, Ngo T, Nguyen DT, Prodhom B, Reiher WE, III, Roux B, Schlenkrich M, Smith JC, Stote R, Straub J, Watanabe M, Wiórkiewicz-Kuczera J, Yin D, Karplus M. All-atom empirical potential for molecular modeling and dynamics studies of proteins. J Phys Chem B. 1998;102:3586–3616. doi: 10.1021/jp973084f. [DOI] [PubMed] [Google Scholar]

[R58] 58.Phillips JC, Braun R, Wang W, Gumbart J, Tajkhorshid E, Villa E, Chipot C, Skeel RD, Kalé L, Schulten K. Scalable molecular dynamics with NAMD. J Comput Chem. 2005;26:1781–1802. doi: 10.1002/jcc.20289. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R59] 59.Jorgensen WL, Tirado-Rives J. Molecular modeling of organic and biomolecular systems using BOSS and MCPRO. J Comput Chem. 2005;26:1689–1700. doi: 10.1002/jcc.20297. [DOI] [PubMed] [Google Scholar]

[R60] 60.Christen M, Hünenberger PH, Bakowies D, Baron R, Bürgi R, Geerke DP, Heinz TN, Kastenholz MA, Kraütler V, Oostenbrink C, Peter C, Trzesniak D, Van Gunsteren WF. The GROMOS software for biomolecular simulation: GROMOS05. J Comput Chem. 2005;26:1719–1751. doi: 10.1002/jcc.20303. [DOI] [PubMed] [Google Scholar]

[R61] 61.Van Der Spoel D, Lindahl E, Hess B, Groenhof G, Mark AE, Berendsen HJC. GROMACS: Fast, flexible, and free. J Comput Chem. 2005;26:1701–1718. doi: 10.1002/jcc.20291. [DOI] [PubMed] [Google Scholar]

[R62] 62.Ponder JW. TINKER - software tools for molecular design, version 4.2. 2004 doi: 10.1021/acs.jctc.8b00529. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R63] 63.Fasnacht M, Swendsen RH, Rosenberg JM. Adaptive integration method for Monte Carlo simulations. Phys Rev E. 2004;69:056704–15. doi: 10.1103/PhysRevE.69.056704. [DOI] [PubMed] [Google Scholar]

[R64] 64.Lyubartsev AP, Martsinovski AA, Shevkunov SV, Vorontsov-Velyaminov PN. New approach to Monte Carlo calculation of the free energy: Method of expanded ensembles. J Chem Phys. 1992;96:1776–1783. [Google Scholar]

[R65] 65.Marinari E, Parisi G. Simulated Tempering: a New Monte Carlo Scheme. Europhys Lett. 1992;19:451–455. [Google Scholar]

[R66] 66.Wang F, Landau DP. Efficient multiple-range random walk algorithm to calculate the density of states. Phys Rev Lett. 2001;86:2050–2053. doi: 10.1103/PhysRevLett.86.2050. [DOI] [PubMed] [Google Scholar]

[R67] 67.Rathore N, Knotts TA, III, de Pablo JJ. Density of states simulations of proteins. J Chem Phys. 2003;118:4285–4290. doi: 10.1016/S0006-3495(03)74810-3. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R68] 68.Mastny EA, de Pablo JJ. Direct calculation of solid-liquid equilibria from density of states Monte Carlo simulations. J Chem Phys. 2005;122:124109–6. doi: 10.1063/1.1874792. [DOI] [PubMed] [Google Scholar]

[R69] 69.Chopra M, Müller M, de Pablo JJ. Order-parameter-based Monte Carlo simulation of crystallization. J Chem Phys. 2006;124:134102–8. doi: 10.1063/1.2178324. [DOI] [PubMed] [Google Scholar]

[R70] 70.Fenwick MK, Escobedo FA. Expanded ensemble and replica exchange methods for simulation of protein-like systems. J Chem Phys. 2003;119:11998–12010. [Google Scholar]

[R71] 71.Fenwick MK, Escobedo FA. On the use of Bennett's acceptance ratio method in multi-canonical-type simulations. J Chem Phys. 2004;120:3066–3074. doi: 10.1063/1.1641000. [DOI] [PubMed] [Google Scholar]

[R72] 72.Gospodinov ID, Escobedo FA. Multicanonical schemes for mapping out free energy landscapes of single-component and multicomponent systems. J Chem Phys. 2005;122:164103–10. doi: 10.1063/1.1884594. [DOI] [PubMed] [Google Scholar]

[R73] 73.Abreu CRA, Escobedo FA. A general framework for non-Boltzmann Monte Carlo sampling. J Chem Phys. 2006;124:054116–12. doi: 10.1063/1.2165188. [DOI] [PubMed] [Google Scholar]

[R74] 74.Escobedo FA. Optimized expanded ensembles for simulations involving molecular insertions and deletions. II. Open systems. J Chem Phys. 2007;127:174104–12. doi: 10.1063/1.2800321. [DOI] [PubMed] [Google Scholar]

[R75] 75.Bennett CH. Efficient estimation of free energy differences from Monte Carlo data. J Comput Phys. 1976;22:245–268. [Google Scholar]

[R76] 76.Mitsutake A, Sugita Y, Okamoto Y. Replica-exchange multicanonical and multicanonical replica-exchange Monte Carlo simulations of peptides. II. Application to a more complex system. J Chem Phys. 2003;118:6676–6688. [Google Scholar]

[R77] 77.Sugita Y, Okamoto Y. Molecular mechanism for stabilizing a short helical peptide studied by generalized-ensemble simulations with explicit solvent. Biophys J. 2005;88:3180–3190. doi: 10.1529/biophysj.104.049429. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R78] 78.Yoda T, Sugita Y, Okamoto Y. Cooperative folding mechanism of a β-hairpin peptide studied by a multicanonical replica-exchange molecular dynamics simulation. Proteins. 2007;66:846–859. doi: 10.1002/prot.21264. [DOI] [PubMed] [Google Scholar]

[R79] 79.Nakajima N, Nakamura H, Kidera A. Multicanonical ensemble generated by molecular dynamics simulation for enhanced sampling of peptides. J Phys Chem B. 1997;101:817–824. [Google Scholar]

[R80] 80.Fukunishi Y, Mikami Y, Nakamura H. The filling potential method: A method for estimating the free energy surface for protein-ligand docking. J Phys Chem B. 2003;107:13201–13210. [Google Scholar]

[R81] 81.Watanabe YS, Kim JG, Fukunishi Y, Nakamura H. Free energy landscapes of small peptides in an implicit solvent model determined by force-biased multicanonical molecular dynamics simulation. Chem Phys Lett. 2004;400:258–263. [Google Scholar]

[R82] 82.Kamiya N, Yonezawa Y, Nakamura H, Higo J. Protein-inhibitor flexible docking by a multicanonical sampling: Native complex structure with the lowest free energy and a free-energy barrier distinguishing the native complex from the others. Proteins. 2008;70:41–53. doi: 10.1002/prot.21409. [DOI] [PubMed] [Google Scholar]

[R83] 83.Ytreberg FM, Zuckerman DM. Efficient use of nonequilibrium measurement to estimate free energy differences for molecular systems. J Comput Chem. 2004;25:1749–1759. doi: 10.1002/jcc.20103. [DOI] [PubMed] [Google Scholar]

[R84] 84.Ytreberg FM, Zuckerman DM. Single-ensemble nonequilibrium path-sampling estimates of free energy differences. J Chem Phys. 2004;120:10876–10879. doi: 10.1063/1.1760511. [DOI] [PubMed] [Google Scholar]

[R85] 85.Jarzynski C. Rare events and convergence of exponentially averaged work values. Phys Rev E. 2006;73:046105–10. doi: 10.1103/PhysRevE.73.046105. [DOI] [PubMed] [Google Scholar]

[R86] 86.Wu D, Kofke DA. Rosenbluth-sampled nonequilibrium work method for Calculation of free energies in molecular simulation. J Chem Phys. 2005;122:204104–13. doi: 10.1063/1.1906209. [DOI] [PubMed] [Google Scholar]

[R87] 87.Oberhofer H, Dellago C, Geissler P. Biased sampling of nonequilibrium trajectories: can fast switching simulations outperform conventional free energy calculations methods? J Phys Chem. 2005;109:6902–6915. doi: 10.1021/jp044556a. [DOI] [PubMed] [Google Scholar]

[R88] 88.Lechner W, Oberhofer H, Dellago C, Geissler P. Equilibrium free energies from fast-switching trajectories with large time step. J Chem Phys. 2006;124:044113–12. doi: 10.1063/1.2162874. 2006. [DOI] [PubMed] [Google Scholar]

[R89] 89.MacFadyen J, Andricioaei I. A skewed-momenta method to efficiently generate conformational-transition trajectories. J Chem Phys. 2005;123:074107–9. doi: 10.1063/1.2000242. [DOI] [PubMed] [Google Scholar]

[R90] 90.Adib AB. Free energy surfaces from nonequilibrium processes without work measurement. J Chem Phys. 2006;124:144111–5. doi: 10.1063/1.2188946. [DOI] [PubMed] [Google Scholar]

[R91] 91.Shirts MR, Pande VS. Comparison of efficiency and bias of free energies computed by exponential averaging, the Bennett acceptance ratio, and thermodynamics integration. J Chem Phys. 2005;122:144107–16. doi: 10.1063/1.1873592. [DOI] [PubMed] [Google Scholar]

[R92] 92.Jayachandran G, Shirts MR, Park S, Pande VJ. Parallelized-over-parts computation of absolute binding free energy with docking and molecular dynamics. J Chem Phys. 2006;125:084901–12. doi: 10.1063/1.2221680. [DOI] [PubMed] [Google Scholar]

[R93] 93.Ytreberg FM, Swendsen RH, Zuckerman DM. Comparison of free energy methods for molecular simulations. J Chem Phys. 2006;125:184114–11. doi: 10.1063/1.2378907. [DOI] [PubMed] [Google Scholar]

[R94] 94.Zhang D, Gullingsrud J, McCammon JA. Potential of mean force for a acetylcholine unbinding from the alpha7 nicotinic acetylcholine receptor ligand-binding domain. J Am Chem Soc. 2006;128:3019–3026. doi: 10.1021/ja057292u. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R95] 95.Xiong H, Crespo A, Marti M, Estrin D, Roitberg AE. Free energy calculations with non-equilibrium methods: applications of the Jarzynski relationship. Theor Chem Acc. 2006;116:338–346. [Google Scholar]

[R96] 96.Gō N, Scheraga HA. Analysis of the contribution of internal vibrations to the statistical weights of equilibrium conformations of macromolecules. J Chem Phys. 1969;51:4751–4767. [Google Scholar]

[R97] 97.Gō N, Scheraga HA. On the use of classical statistical mechanics in the treatment of polymer chain conformation. Macromolecules. 1976;9:535–542. [Google Scholar]

[R98] 98.Hagler AT, Stern PS, Sharon R, Becker JM, Naider F. Computer simulation of the conformational properties of oligopeptides. Comparison of theoretical methods and analysis of experimental results. J Am Chem Soc. 1979;101:6842–6852. [Google Scholar]

[R99] 99.Chang CE, Gilson MK. Tork: Conformational analysis method for molecules and complexes. J Comput Chem. 2003;24:1987–1998. doi: 10.1002/jcc.10325. [DOI] [PubMed] [Google Scholar]

[R100] 100.Chen W, Chang CE, Gilson MK. Concepts in receptor optimization: targeting the RGD peptide. J Am Chem Soc. 2005;128:4675–4684. doi: 10.1021/ja056600l. [DOI] [PubMed] [Google Scholar]

[R101] 101.Karplus M, Kushick JN. Method for estimating the configurational entropy of macromolecules. Macromolecules. 1981;14:325–332. [Google Scholar]

[R102] 102.Rojas OL, Levy RM, Szabo A. Corrections to the quasiharmonic approximation for evaluating molecular entropies. J Chem Phys. 1986;85:1037–1043. [Google Scholar]

[R103] 103.Schäfer H, Mark AE, van Gunsteren WF. Absolute entropies from molecular dynamics simulation trajectories. J Chem Phys. 2000;113:7809–7817. [Google Scholar]

[R104] 104.Schäfer H, Daura X, Mark AE, van Gunsteren WF. Entropy calculations on a reversibly folding peptide: Changes in solute free energy cannot explain folding behavior. Proteins. 2001;43:45–56. doi: 10.1002/1097-0134(20010401)43:1<45::aid-prot1016>3.0.co;2-n. [DOI] [PubMed] [Google Scholar]

[R105] 105.Chang CE, Chen W, Gilson MK. Evaluating the accuracy of the quasiharmonic approximation. J Chem Theory Comput. 2005;1:1017–1028. doi: 10.1021/ct0500904. [DOI] [PubMed] [Google Scholar]

[R106] 106.Schlitter J. Estimation of absolute and relative entropies of macromolecules using the covariance matrix. Chem Phys Lett. 1993;215:617–621. [Google Scholar]

[R107] 107.Andricioaei I, Kaplus M. On the calculation of entropy from covariance matrices of the atomic fluctuations. J Chem Phys. 2001;115:6289–6292. [Google Scholar]

[R108] 108.Carlsson J, Åqvist J. Absolute and relative entropies from computer simulation with applications to ligand binding. J Phys Chem B. 2005;109:6448–6456. doi: 10.1021/jp046022f. [DOI] [PubMed] [Google Scholar]

[R109] 109.Wang J, Brüschweiler R. 2D entropy of discrete molecular ensembles. J Chem Theory Comput. 2006;2:18–24. doi: 10.1021/ct050118b. [DOI] [PubMed] [Google Scholar]

[R110] 110.Li DW, Khanlarzadeh M, Wang J, Huo S, Brüschweiler R. Evaluation of configurational entropy methods from peptide folding-unfolding simulation. J Phys Chem B. 2007;111:13807–13813. doi: 10.1021/jp075220e. [DOI] [PubMed] [Google Scholar]

[R111] 111.Meirovitch H. Calculation of entropy with computer simulation methods. Chem Phys Lett. 1977;45:389–392. [Google Scholar]

[R112] 112.Meirovitch H. Computer simulation of the free energy of polymer chains with excluded volume and with finite interactions. Phys Rev A. 1985;32:3709–3715. doi: 10.1103/physreva.32.3709. [DOI] [PubMed] [Google Scholar]

[R113] 113.Meirovitch H, Scheraga HA. Computer simulation of the entropy of continuum chain models: the two dimensional freely-jointed chain of hard disks. J Chem Phys. 1986;84:6369–6375. [Google Scholar]

[R114] 114.Meirovitch H. Methods for estimating the entropy with computer simulation. The simple cubic Ising lattice. J Phys A. 1983;16:839–846. [Google Scholar]

[R115] 115.Meirovitch H. On the simulation of the entropy of macromolecules with different flexibilities. J Chem Phys. 2001;114:3859–3867. [Google Scholar]

[R116] 116.White RP, Meirovitch H. Absolute entropy and free energy of fluids using the hypothetical scanning method. II. Transition probabilities from canonical Monte Carlo simulations of partial systems. J Chem Phys. 2003;119:12096–12105. [Google Scholar]

[R117] 117.White RP, Meirovitch H. Lower and upper bounds for the absolute free energy by the hypothetical scanning Monte Carlo method: Application to liquid argon and water. J Chem Phys. 2004;121:10889–10904. doi: 10.1063/1.1814355. [DOI] [PubMed] [Google Scholar]

[R118] 118.White RP, Meirovitch H. Calculation of the entropy of random coil polymers with the hypothetical scanning Monte Carlo Method. J Chem Phys. 2005;123:214908–11. doi: 10.1063/1.2132285. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R119] 119.White RP, Meirovitch H. Free volume hypothetical scanning molecular dynamics method for the absolute free energy of liquids. J Chem Phys. 2006;124:204108–13. doi: 10.1063/1.2199529. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R120] 120.Cheluvaraja S, Meirovitch H. Stability of the free and bound microstates of a mobile loop of α-amylase obtained from the absolute entropy and free energy. J Chem Theory Comput. 2008;4:192–208. doi: 10.1021/ct700116n. [DOI] [PubMed] [Google Scholar]

[R121] 121.Cheluvaraja S, Mihailescu M, Meirovitch H. Entropy and free energy of a mobile loop in explicit water. J Phys Chem. 2008;112:9512–9522. doi: 10.1021/jp801827f. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R122] 122.Meirovitch H. A new method for simulation of real chains. Scanning future steps. J Phys A. 1982;15:L735–L740. [Google Scholar]

[R123] 123.Meirovitch H. Statistical properties of the scanning simulation method for polymer chains. J Chem Phys. 1988;89:2514–2522. [Google Scholar]

[R124] 124.Meirovitch H, Vásquez M, Scheraga HA. Stability of polypeptides conformational states: II. The free energy of the statistical coil obtained by the scanning simulation method. Biopolymers. 1988;27:1189–1204. doi: 10.1002/bip.360270802. [DOI] [PubMed] [Google Scholar]

[R125] 125.Jorgensen WL, Chandrasekhar J, Madura JD. Comparison of simple potential functions for simulating liquid water. J Chem Phys. 1983;79:926–935. [Google Scholar]

[R126] 126.Szarecka A, White RP, Meirovitch H. Absolute entropy and free energy of fluids using the hypothetical scanning method. I. Calculation of transition probabilities from local grand canonical partition functions. J Chem Phys. 2003;119:12084–12095. [Google Scholar]

[R127] 127.Chorin AJ. Partition functions and equilibrium measures in two-dimensional and quasi-three-dimensional turbulence. Phys Fluids. 1996;8:2656–2660. [Google Scholar]

[R128] 128.Hnizdo V, Darian E, Fedorowicz A, Demchuk E, Li S, Singh H. Nearest-neighbor nonparametric method for estimating the configurational entropy of complex molecules. J Comput Chem. 2007;28:655–668. doi: 10.1002/jcc.20589. [DOI] [PubMed] [Google Scholar]

[R129] 129.Killian BJ, Kravitz JY, Gilson MK. Extraction of configurational entropy from molecular simulations via an expansion approximation. J Chem Phys. 2007;127:024107–16. doi: 10.1063/1.2746329. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R130] 130.Stoessel JP, Novak P. Absolute free energies in biomolecular systems. Macromolecules. Macromolecules. 1990;23:1961–1965. [Google Scholar]

[R131] 131.Tyka MD, Clarke AR, Sessions RB. An efficient path-independent method for free energy calculations. J Phys Chem B. 2006;110:17212–17220. doi: 10.1021/jp060734j. [DOI] [PubMed] [Google Scholar]

[R132] 132.Ytreberg FM, Zuckerman DM. Simple estimation of absolute free energies for biomolecules. J Chem Phys. 2006;124:104105–5. doi: 10.1063/1.2174008. [DOI] [PubMed] [Google Scholar]

[R133] 133.Voter AF. A Monte Carlo method for determining free energy differences and transition state theory rate constants. J Chem Phys. 1985;82:1890–1899. [Google Scholar]

[R134] 134.Ytreberg FM, Zuckerman DM. Peptide conformational equilibria computed via a single-stage shifting protocol. J Phys Chem B. 2005;109:9096–9103. doi: 10.1021/jp0510692. [DOI] [PubMed] [Google Scholar]

[R135] 135.Rodinger T, Howell PL, Pomes R. Absolute free energy calculations by thermodynamic integration in four spatial dimensions. J Chem Phys. 2005;123:034104–11. doi: 10.1063/1.1946750. [DOI] [PubMed] [Google Scholar]

[R136] 136.Blondel A. Ensemble variance in free energy calculations by thermodynamic integration: theory, optimal “alchemical” path, and practical solutions. J Comput Chem. 2004;25:985–993. doi: 10.1002/jcc.20025. [DOI] [PubMed] [Google Scholar]

[R137] 137.Huang L, Makarov DE. On the calculation of absolute free energy from molecular-dynamics or Monte Carlo data. J Chem Phys. 2006;124:064108–9. doi: 10.1063/1.2166397. [DOI] [PubMed] [Google Scholar]

[R138] 138.Darian E, Hnizdo V, Fedorowicz A, Singh H, Demchuck E. Estimation of the absolute internal-rotation entropy of molecules with two torsional degrees of freedom from stochastic simulations. J Comput Chem. 2005;26:651–660. doi: 10.1002/jcc.20198. [DOI] [PubMed] [Google Scholar]

[R139] 139.Ohkubo YZ, Thorpe IF. Evaluating the conformational entropy of macromolecules using an energy decomposition approach. J Chem Phys. 2006;124:024910–6. doi: 10.1063/1.2138696. [DOI] [PubMed] [Google Scholar]

[R140] 140.Lee MS, Olson MA. Calculation of absolute protein-ligand binding affinity using path and endpoint approaches. Biophys J. 2006;90:864–877. doi: 10.1529/biophysj.105.071589. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R141] 141.Woo HJ, Roux B. Calculation of absolute protein-ligand binding free energy from computer simulation. Proc Natl Acad Sci USA. 2005;102:6825–6830. doi: 10.1073/pnas.0409005102. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R142] 142.Rosta E, Klähn M, Warshel A. Toward accurate ab initio QM/MM calculations of free energy profiles of enzymatic reactions. J Phys Chem B. 2006;110:2934–2941. doi: 10.1021/jp057109j. [DOI] [PubMed] [Google Scholar]

PERMALINK

Methods for calculating the entropy and free energy and their application to problems involving protein flexibility and ligand binding

Hagai Meirovitch

Srinath Cheluvaraja

Ronald P White

Abstract

I.Introduction

II. General theoretical considerations

II.1 Statistical mechanics

II.2 The free energy of a partial region of space

II.3 Microstates and intermediate flexibility

Figure (1).

II.4 On the practical definition of a microstate

III. Methods for calculating the free energy

III.1 The counting approach

III.2 The thermodynamic integration approach

III.2.a Advantages of TI and some pitfalls

Figure (2).

III.2.b The Adaptive Integration Method

III.2.c Methods based on Jarzynski's identity

III.3 Calculations of the absolute S and F

III.3.a Harmonic and quasi-harmonic techniques

III.3.b Step-by-step reconstruction methods

III.3.b.1 The exact scanning procedure

III.3.b.2 The philosophy of the hypothetical scanning approach

III.3.b.3 The HSMC(D) method

Figure (3).

III.3.b.4 The cancellation of systematic errors with HSMD

III.3.b.5 The HS and LS methods

III.3.b.6 Calculation of differences Sm - Sn

III.3.b.7 Calculation of the absolute F by TI

III.3.b.8 methods based on Bennett's formula

IV. Conclusions

Acknowledgments

Abbreviations

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

III.3.b.6 Calculation of differences S_m - S_n