Skip to main content
Proceedings of the National Academy of Sciences of the United States of America logoLink to Proceedings of the National Academy of Sciences of the United States of America
. 2001 Mar 27;98(7):3658–3661. doi: 10.1073/pnas.071034098

Free energy reconstruction from nonequilibrium single-molecule pulling experiments

Gerhard Hummer 1,*, Attila Szabo 1
PMCID: PMC31107  PMID: 11274384

Abstract

Laser tweezers and atomic force microscopes are increasingly used to probe the interactions and mechanical properties of individual molecules. Unfortunately, using such time-dependent perturbations to force rare molecular events also drives the system away from equilibrium. Nevertheless, we show how equilibrium free energy profiles can be extracted rigorously from repeated nonequilibrium force measurements on the basis of an extension of Jarzynski's remarkable identity between free energies and the irreversible work.


Recent advances in the micromanipulation of single molecules have led to new insights into the dynamics, interactions, structure, and mechanical properties of individual molecules (14). Single-molecule manipulation with an atomic force microscope (AFM) (59), laser tweezer stretching (10), and analogous computer experiments pioneered by Schulten and coworkers (1113) have revealed details about unfolding and unbinding events of individual proteins and their complexes. In an AFM experiment, a molecule is subjected to a time-varying external force, for instance by pulling on the end of a linear polymer (Fig. 1). The applied force is determined from the time-dependent position of the cantilever tip with respect to the sample. Thus, one can drive rare molecular events (14), determine their force characteristics, and simultaneously monitor them with atomic resolution. However, both experiments and simulations actively perturb the system, leading to hysteresis and nonequilibrium effects.

Figure 1.

Figure 1

Single-molecule force measuring experiments by using AFM (a) and laser tweezers (b). In the AFM experiment (a), the sample is moved at a constant speed v relative to the cantilever with spring constant k. The position zt = vt + δzt of the cantilever tip with respect to the sample is recorded, where δzt is the displacement of the cantilever tip. From repeated measurements of zt, the free energy profile G0(z) of the unperturbed system can be determined exactly (c).

How can one extract equilibrium properties from such measurements that drive the system away from equilibrium (15, 16)? From the second law of thermodynamics, we know that on average, the mechanical work of pulling will be larger than the free energy. Only if the experiment is performed reversibly, i.e., infinitely slowly, will the work equal the free energy. Thus, making rigorous thermodynamic measurements by pulling appears to require an extrapolation to zero pulling speed. However, Jarzynski (17, 18) recently discovered a remarkable identity between thermodynamic free energy differences and the irreversible work, thus extending the inequality of the second law of thermodynamics. This identity, although not directly applicable to atomic force measurements, suggests that in principle one should be able to extract free energy surfaces from repeated pulling experiments. In this paper, we show how this can be done in practice.

Theory.

We begin by showing that Jarzynski's identity follows almost immediately from the Feynman–Kac theorem for path integrals (19). This derivation leads directly to the appropriate extension, which forms the basis for the solution of the free energy reconstruction problem. This extension has been obtained by Crooks (20) as a special case of an even more general relation between forward and backward path averages. Consider a system whose phase–space density evolves according to a Liouville-type equation:

graphic file with name M1.gif 1

t is an explicitly time-dependent evolution operator that has the Boltzmann distribution as a stationary solution, ℒte−βℋ(x,t) = 0, where ℋ(x, t) is a time-dependent Hamiltonian, and β−1 = kBT, with T the temperature and kB Boltzmann's constant. For example, for diffusive dynamics on a potential V(x, t), the time evolution is governed by ℒt = De−βV(x,t)eβV(x,t), where D is the diffusion coefficient and ∇ = ∂/∂x. Other examples include systems that undergo Newtonian, Langevin, Nosé–Hoover thermostat, or Metropolis Monte Carlo dynamics. Now consider the unnormalized Boltzmann distribution at time t,

graphic file with name M2.gif 2

Because this distribution is stationary (ℒtp = 0), and because ∂p/∂t = −β(∂ℋ/∂t)p, it follows that the above p(x, t) is a solution of the sink equation,

graphic file with name M3.gif 3

as can be verified by direct substitution. However, the solution of this sink equation, starting from an equilibrium distribution at time t = 0, can also be expressed as a path integral by using the Feynman–Kac theorem (19). Equating these two different solutions immediately gives:

graphic file with name M4.gif 4

where δ(x) is Dirac's δ distribution. This identity between a weighted average of nonequilibrium trajectories (right) and the equilibrium Boltzmann distribution (left) is implicit in the work of Jarzynski (18), and is given explicitly by Crooks (equation 24, ref. 20). The average 〈…〉 is over an ensemble of trajectories starting from the equilibrium distribution at t = 0 and evolving according to Eq. 1. Each trajectory is weighted with the Boltzmann factor of the external work wt done on the system,

graphic file with name M5.gif 5

Integrating both sides of Eq. 4 with respect to x, we obtain Jarzynski's identity (17, 18)

graphic file with name M6.gif 6

between the Boltzmann-averaged work wt and the equilibrium free energy difference ΔG(t) between times t and 0.

Free Energy Surfaces from Cantilever Positions.

Eq. 4 allows us to reconstruct the underlying free energy surface from repeated single-molecule AFM experiments. The setup in Fig. 1a can be described by a Hamiltonian ℋ(x, t) = ℋ0(x) + k(zvt)2/2, where ℋ0(x) is the Hamiltonian of the resting, unperturbed system, and z = z(x). Here, we have assumed for simplicity that the cantilever is vibrating in a harmonic potential with spring constant k. Substituting this Hamiltonian into Eq. 4, multiplying both sides by δ[zz(x)], integrating over all x, and finally taking the logarithm, we have:

graphic file with name M7.gif 7
graphic file with name M8.gif

where G0(z) is (to within an additive constant) the unperturbed free energy profile along the pulling coordinate z; and Δwt is the external work minus the instantaneous biasing potential, Δwt = wtk(ztvt)2/2 = kv(vt2/2 − ∫Inline graphiczt dt′) − k(ztvt)2/2. At time t = 0, the trajectories are started from points z0 drawn from a Boltzmann distribution corresponding to Hamiltonian ℋ(x, 0) = ℋ0(x) + kz2/2.

Given the positions zt of the cantilever at time t obtained from repeated pulling experiments, we can reconstruct the unperturbed free energy profile G0(z) along z by using Eq. 7. At each time slice t, one can in principle obtain an estimate of the whole free energy surface. In practice, at any given time t, only a small window around the equilibrium position z = vt will be sampled adequately. Thus, an average over several time slices and repeated trajectories is required to obtain an optimal estimate of the free energy surface. At every time slice t, one obtains an ensemble of positions zt and corresponding wts. The positions zt are binned, and the corresponding histogram values are incremented by e−βwt. The complete free energy surface G0(z) can then be reconstructed by adapting the weighted histogram method (21):

graphic file with name M10.gif 8

where the sum is over time slices t and u(z, t) is the time-dependent biasing potential [here: u(z, t) = k(zvt)2/2]. As in the weighted histogram method (21), this procedure can be refined by making Eq. 8 self-consistent through replacement of 〈exp(−βwt)〉 with exp[−βΔG(t)] = ∫exp {−β[u(z, t) + G0(z)]}dz/∫exp{−β[u(z, 0) + G0(z)]}dz, thus requiring an iterative solution for ΔG(t).

To illustrate in detail how this formalism is implemented, we simulate a pulling experiment by conducting Brownian-dynamics simulations on a double-well free energy surface. Ten trajectories were generated on the time-dependent surface V(z) + k(zvt)2/2 with a pulling velocity v = 1 μm s−1, a diffusion coefficient D = 10−7 cm2 s−1, and a cantilever spring constant k = 20.6 pN nm−1 (=0.0206 mN m−1). The pulling velocities and cantilever spring constants are typical of AFM experiments (2, 6). For this diffusion coefficient, the escape rate from the stable minimum is about 10−6 s−1 on the basis of Kramers' theory, in the range of unfolding rates for fairly large proteins under native conditions. The free energy difference between the two states separated by a barrier is ≈14 kcal/mol (at 298 K), again typical of protein unfolding. Nevertheless, this simple one-dimensional model is only a crude approximation to real experiments where several other factors intrude, such as viscous damping of the cantilever motion or slow conformational relaxation orthogonal to the pulling direction (14).

In the pulling simulations, the “cantilever” is moved relative to the “sample” by 20 nm within a time of 20 ms. The position zt and accumulated work wt = kv(vt2/2 − ∫Inline graphiczt dt′) are recorded for 10 runs and analyzed by using Eq. 7. For every pulling trajectory k (k = 1,… ,K; here: K = 10), we determine at discrete times ti (i = 0,… ,N; here: N = 100 time points separated by 0.2 ms) the position of the cantilever zik with respect to the sample, and the corresponding deflection δzik = zikvti. From the deflection, we obtain the instantaneous deflection energy uik = kδzInline graphic/2. The external work is determined by numerical integration: wik = kv[vtInline graphic/2 − ΣInline graphic(tjtj−1)(zjk + zj−1,k)/2] with wik = 0 for i = 0. The results of repeated trajectories are then combined. At every time slice ti, we average 〈exp(−βwt)〉 ≈ ηiK−1ΣInline graphicexp(−βwik). To approximate the average in the top numerator of Eq. 8, we also collect histograms hi(l) at times ti with z-position intervals of Δz:

graphic file with name M16.gif 9

where θl(z) is one if z is in the lth interval [(l − 1)Δzz < lΔz] and zero otherwise. We then estimate the free energy profile by averaging over all time slices ti:

graphic file with name M17.gif 10

Fig. 2 compares the reconstructed and exact free energy surfaces. With the exception of the poorly sampled barrier region because of snapping motion (Fig. 2 Inset), the surface is accurately reproduced. We note that at high pulling speeds, the increasing variance of wt will lead to a systematic bias of the estimator for −β−1ln〈δ(zzt)exp(−βwt)〉, which can be approximately corrected for by using a cumulant expansion.

Figure 2.

Figure 2

Simulated pulling experiment over a 30 kBT barrier. The solid line and symbols show the reference free energy V(z) and the reconstruction from 10 pulling simulations, respectively. Inset shows the force-versus-extension curve for one of the pulling simulations.

Free Energy Surfaces from Pulling Forces.

So far, we made use of position-versus-time curves. However, it is force-versus-position curves that are commonly reported from pulling experiments. As above, we assume a Hamiltonian ℋ(x, t) = ℋ0(x) + u[z(x), t]. The unperturbed system described by ℋ0 is coupled to the measurement apparatus through u[z(x), t], which will typically be a harmonic potential, u[z(x), t] = k(zvt)2/2. The work entering the free energy expression Eq. 9 can be rewritten as

graphic file with name M18.gif 11

where the restoring force is F(z, t) = −∂u(z, t)/∂z, and the integral is along the position-versus-time contour connecting z0 and zt. This identity follows from du = (∂u/∂z)dz + (∂u/∂t)dt. Combining Eqs. 7 and 11 leads to the surprising result that the work entering into the free energy profile G0(z) is not given simply by the force-versus-distance integral, but by an additional factor reflecting the biased choice of the initial state,

graphic file with name M19.gif 12

where the initial points are drawn from a Boltzmann distribution according to ℋ(x, 0) (i.e., for the system interacting with the cantilever, as above). Therefore, if force integrals of two or more trajectories are combined, individual trajectories should be weighted by the reciprocal Boltzmann factor exp[βu(z0, 0)]; i.e., one should subtract from the integrated work the energy stored in the deflected cantilever at time t = 0.

In an AFM experiment, it is thus important to record the zero-force position (δz = 0 in Fig. 1) with the cantilever sufficiently far from the sample. As the cantilever tip is brought into contact with the sample, a sufficiently long relaxation is important to draw initial states from an equilibrium Boltzmann distribution. We note that this new equilibrium will normally not be centered around δz = 0, but according to the combined Hamiltonian ℋ(x, 0) of the sample coupled to the cantilever spring. Nevertheless, the center of this initial distribution can be used to align the z positions of repeated pulling traces. As the cantilever is retracted, both zt and δzt are recorded. With known cantilever spring constant k and pulling speed v, the work can then be obtained from time or position integration.

As a final illustration, we analyze force spectra for extracting individual bacteriorhodopsin molecules from purple membrane patches of Halobacterium salinarum as measured by Oesterhelt et al. (8). In these experiments, a cantilever tip was attached to bacteriorhodopsin, and the protein was slowly pulled out of the membrane. This led to sequential unraveling of the transmembrane helices, resulting in unfolding of the protein. Specifically, we analyze the force-versus-distance curves for one of the experiments (G241C mutant) and estimate a free energy profile for extraction and unfolding of helices D and E, corresponding to the second peak at pulling distances of 20–30 nm in figure 5 of ref. 8. Because the force is fluctuating around the baseline before and after this force peak, we assume (i) that the additional bias u(z0, 0) in Eq. 12 is of the order of kBT and thus negligible on the scale of the integrated work; and that (ii) a quasiequilibrium was established before the force peak. We also assume (iii) that the force curves are aligned. We can then integrate the forces and use Eq. 12 to estimate the underlying free energy profile G0(z), as shown in Fig. 3. Also shown is the mean restoring force, F(z) = dG0(z)/dz, corresponding to reversible pulling. The noise in the force F(z) reflects uncertainties in the z-position alignment and the finite number of measurements. For extraction and stretching of the 57-amino acid stretch with 56 amino acids contained in helices D and E, we estimate a free energy of about 64 kcal/mol, or about 1.1 kcal/mol per residue, and a mean-force peak of about 89 pN. Although these values seem overall reasonable, they should not be considered to be accurate before the above assumptions are removed by appropriate experimental design. In addition, one must account for instrument resolution, which leads to increased variation of the integrated work. Finally, the influence of the pulling speed needs to be investigated. In general, to extract accurate thermodynamic information from a small number of pulling experiments, it will be important to achieve small standard deviations of the work, possibly of a few kBT, by pulling sufficiently slowly.

Figure 3.

Figure 3

Free energy G0(z) (solid line, right-hand scale) and mean restoring force F(z) = dG0(z)/dz (dashed line, left-hand scale) of extracting and unfolding the D and E helix of the G241C mutant of bacteriorhodopsin. Eq. 12 was used to estimate G0(z) from linear approximations (Inset) to the second peak of eight force-versus-distance curves reported in figure 5, Oesterhelt et al. (8).

Conclusions

We have shown how free energy surfaces can be reconstructed rigorously from repeated molecular pulling experiments. With this formalism, thermodynamic properties such as binding equilibria can be extracted from repeated measurements by using AFM (2, 59, 12), laser tweezers (1, 4), or magnetic beads (3). In addition, this work has implications in the area of computer simulations. For example, when a time-dependent harmonic potential is added to the Hamiltonian as above, our procedure is a dynamic generalization of umbrella sampling (22). The weighted-histogram method permits the combination of dynamic umbrella sampling runs with different pulling speeds or directions. Moreover, this generalization has the immediate advantage that no equilibration is required in different windows, and thus the entire trajectory can be used to reconstruct the underlying free energy surface.

Abbreviation

AFM

atomic force microscopy

Footnotes

See commentary on page 3636.

References

  • 1.Perkins T T, Smith D E, Chu S. Science. 1994;264:819–822. doi: 10.1126/science.8171335. [DOI] [PubMed] [Google Scholar]
  • 2.Florin E L, Moy V T, Gaub H E. Science. 1994;264:415–417. doi: 10.1126/science.8153628. [DOI] [PubMed] [Google Scholar]
  • 3.Strick T R, Allemand J F, Bensimon D, Bensimon A, Croquette V. Science. 1996;271:1835–1837. doi: 10.1126/science.271.5257.1835. [DOI] [PubMed] [Google Scholar]
  • 4.Smith S B, Cui Y J, Bustamante C. Science. 1996;271:795–799. doi: 10.1126/science.271.5250.795. [DOI] [PubMed] [Google Scholar]
  • 5.Tskhovrebova L, Trinick J, Sleep J A, Simmons R M. Nature (London) 1997;387:308–312. doi: 10.1038/387308a0. [DOI] [PubMed] [Google Scholar]
  • 6.Rief M, Gautel M, Oesterhelt F, Fernandez J M, Gaub H E. Science. 1997;276:1109–1112. doi: 10.1126/science.276.5315.1109. [DOI] [PubMed] [Google Scholar]
  • 7.Oberhauser A F, Marszalek P E, Erickson H P, Fernandez J M. Nature (London) 1998;393:181–185. doi: 10.1038/30270. [DOI] [PubMed] [Google Scholar]
  • 8.Oesterhelt F, Oesterhelt D, Pfeiffer M, Engel A, Gaub H E, Müller D J. Science. 2000;288:143–146. doi: 10.1126/science.288.5463.143. [DOI] [PubMed] [Google Scholar]
  • 9.Merkel R, Nassoy P, Leung A, Ritchie K, Evans E. Nature (London) 1999;397:50–53. doi: 10.1038/16219. [DOI] [PubMed] [Google Scholar]
  • 10.Kellermayer M S Z, Smith S B, Granzier H L, Bustamante C. Science. 1997;276:1112–1116. doi: 10.1126/science.276.5315.1112. [DOI] [PubMed] [Google Scholar]
  • 11.Isralewitz B, Izrailev S, Schulten K. Biophys J. 1997;73:2972–2979. doi: 10.1016/S0006-3495(97)78326-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Marszalek P E, Lu H, Li H B, Carrion-Vazquez M, Oberhauser A F, Schulten K, Fernandez J M. Nature (London) 1999;402:100–103. doi: 10.1038/47083. [DOI] [PubMed] [Google Scholar]
  • 13.Paci E, Karplus M. J Mol Biol. 1999;288:441–459. doi: 10.1006/jmbi.1999.2670. [DOI] [PubMed] [Google Scholar]
  • 14.Dellago C, Bolhuis P G, Csajka F S, Chandler D. J Chem Phys. 1998;108:1964–1977. [Google Scholar]
  • 15.Balsera M, Stepaniants S, Izrailev S, Oono Y, Schulten K. Biophys J. 1997;73:1281–1287. doi: 10.1016/S0006-3495(97)78161-X. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Gullingsrud J R, Braun R, Schulten K. J Comp Phys. 1999;151:190–211. [Google Scholar]
  • 17.Jarzynski C. Phys Rev Lett. 1997;78:2690–2693. [Google Scholar]
  • 18.Jarzynski C. Phys Rev E. 1997;56:5018–5035. [Google Scholar]
  • 19.Schuss Z. Theory and Applications of Stochastic Differential Equations. New York: Wiley; 1980. [Google Scholar]
  • 20.Crooks G E. Phys Rev E. 2000;61:2361–2366. [Google Scholar]
  • 21.Ferrenberg A M, Swendsen R H. Phys Rev Lett. 1989;63:1195–1198. doi: 10.1103/PhysRevLett.63.1195. [DOI] [PubMed] [Google Scholar]
  • 22.Torrie G M, Valleau J P. Chem Phys Lett. 1974;28:578–581. [Google Scholar]

Articles from Proceedings of the National Academy of Sciences of the United States of America are provided here courtesy of National Academy of Sciences

RESOURCES