Skip to main content
The Journal of Chemical Physics logoLink to The Journal of Chemical Physics
. 2017 Nov 28;147(20):201102. doi: 10.1063/1.5006456

Communication: Coordinate-dependent diffusivity from single molecule trajectories

Alexander M Berezhkovskii 1, Dmitrii E Makarov 2,3,2,3,a)
PMCID: PMC5705243  PMID: 29195291

Abstract

Single-molecule observations of biomolecular folding are commonly interpreted using the model of one-dimensional diffusion along a reaction coordinate, with a coordinate-independent diffusion coefficient. Recent analysis, however, suggests that more general models are required to account for single-molecule measurements performed with high temporal resolution. Here, we consider one such generalization: a model where the diffusion coefficient can be an arbitrary function of the reaction coordinate. Assuming Brownian dynamics along this coordinate, we derive an exact expression for the coordinate-dependent diffusivity in terms of the splitting probability within an arbitrarily chosen interval and the mean transition path time between the interval boundaries. This formula can be used to estimate the effective diffusion coefficient along a reaction coordinate directly from single-molecule trajectories.


While single-molecule experiments are routinely used to observe large conformational changes in biomolecules (e.g., protein folding), recent studies achieve temporal and spatial resolution that permits observation of finer details of molecular trajectories.1–7 In single-molecule force spectroscopy, for example, the extension x(t) of a protein or DNA can be monitored with microsecond time resolution, enabling observation of fast conformational fluctuations. The information contained in a trajectory x(t) can be used to develop more accurate dynamical models that go beyond phenomenological theories commonly employed to describe transition rates.

One such phenomenological theory that has been particularly successful in interpreting numerous experimental observations is that of one-dimensional diffusion along a reaction coordinate x in the potential of mean force V(x). In single-molecule studies, the potential of mean force can be directly inferred from the equilibrium probability distribution of x,8–10

peq(x)=eβV(x), (1)

where β=(kBT)1, while the value of the coordinate-independent diffusivity D is usually estimated by fitting the transition rates or, more recently, other statistical characteristics of the trajectory x(t) such as the distribution of the transition path time,11 with the corresponding predictions of the one-dimensional diffusion model.

Despite the remarkable success of this model in describing many experimental observations,12 theoretical studies have highlighted its limitations.13–17 Recent experimental work has also brought possible shortcomings of this model into spotlight: although its predictions18,19 seem to fit well the observed distributions of transition path times for biomolecular folding,3 the barrier height estimates obtained from such fits were inconsistent with the measured potentials of mean force and transition rates,3 motivating several theoretical proposals to account for this discrepancy.20–22

Perhaps the most straightforward improvement of the one-dimensional diffusion model is to allow the diffusion coefficient to be a coordinate-dependent function D(x) (for a justification of the position dependence of the diffusivity, see the discussion in Ref. 23). Estimation of this function from molecular simulations has been the subject of considerable theoretical work24–31 but has rarely been applied to experimental data (Ref. 32 being the only exception that we know). The purpose of this note is to point out a simple analytical formula for D(x) that can be used to estimate of the coordinate-dependent diffusivity directly from a trajectory x(t) (or a collection of such trajectories). Moreover, this formula highlights the connection between the function D(x) and two key characteristics of stochastic dynamics, the so-called splitting probability33 (also known as committor probability or Pfold34–40) and the mean transition path time (also known as mean direct transit time41 or simply mean transit time). As both of these quantities have already been evaluated from single-molecule trajectories,3,42 obtaining the function D(x) from the same data should be straightforward (provided that statistical errors are manageable).

Assuming the validity of the one-dimensional diffusion model with a coordinate-dependent diffusivity, we wish to find the function D(x) for a<x<b, where the interval boundaries a and b can be chosen at will. We start with the known expression43 for the spitting probability Φ(xb|a) that the system, having started from x, will reach the boundary b before reaching a,

Φ(xb|a)=axD1(y)eβV(y)dyabD1(y)eβV(y)dy. (2)

The probability of reaching a before reaching b is, obviously, Φ(xa|b)=1Φ(xb|a). Differentiating Eq. (2) with respect to x, we obtain

dΦ(xb|a)dx=D1(x)eβV(x)abD1(y)eβV(y)dy. (3)

The integral appearing in the denominator of Eq. (3) can be estimated if we consider the mean transition path time from a to b. A transition path from a to b originates from a and ends in b, continuously residing within the interval a<x(t)<b in between these two events without touching the interval endpoints. We emphasize that the mean transition path time differs from (is shorter than) the mean first passage time from a to b.41 For one-dimensional diffusion, the mean transition path time can be written in terms of the splitting probability in the form proposed by Szabo,

tTPab=abD1(y)eβV(y)dyabdxΦ(xb|a)×1Φ(xb|a)eβV(x). (4)

One can find a detailed derivation of this formula in Ref. 44. The mean time tTP(ba) for a transition path that originates from b and ends in a is also given by this expression, since tTP(ba) and tTP(ab) are identical.45–47 Combining Eqs. (3) and (4) and using Eq. (1), we arrive at the central result of this note,

D(x)=abdzΦ(zb|a)1Φ(zb|a)peq(z)peq(x)dΦ(xb|a)/dxtTPab (5)

for any x within the region a<x<b.

Equation (5) differs from and is complementary to the formula for D(x) derived by Hinczewski et al.27 Computation of the diffusion coefficient in their approach requires evaluation of the mean round-trip time between two points. The mean round-trip time between arbitrary points x1 and x2 is defined as the sum of the mean first passage times from x1 to x2 and from x2 to x1. Evaluation of these mean times involves considering excursions of the trajectories outside the interval x1<x<x2. In contrast, Eq. (5) is independent of the dynamics outside the interval a<x<b.

Figure 1 illustrates the utility of Eq. (5) with three numerical examples. Two different potentials were considered, the symmetric potential βV(x)=(x21)2 (a) and the asymmetric potential βV(x)=(x21)2x/2 (b). Two different functions D(x) were considered for the symmetric potential [Fig. 1(a)], D(x)=1+sin(7x)exp(3x2)/2 and D(x)=1+e3x2/2 (the latter function being an even function of x). For the second potential, the position dependence of the diffusivity is given by D(x)=1+tanh(7x)/2exp[7(x+0.5)2]/2. The boundaries of the transition region were the same in all cases: b=a=1. The diffusivity dependence on x was reconstructed from a single numerically generated long Brownian trajectory, which contained 7000 - 15 000 transition paths from a to b or b to a. The derivative of the splitting probability was simply estimated using finite differences on an equally spaced grid of 30 - 50 points between a and b.

FIG. 1.

FIG. 1.

Reconstructing position-dependent diffusivity using Eq. (5). The top of each panel shows the model double well potential used to generate a Brownian dynamics trajectory in each case. The boundaries of the transition region [within which the dependence D(x) was estimated] are shown as dashed lines. The bottom of each panel shows numerically estimated diffusivity values within the transition region (markers) vs the exact diffusivity (solid lines). See text for further details.

To conclude, let us comment on several practical issues arising when Eq. (5) is applied to molecular trajectories. Although splitting probabilities and transition path times are most commonly employed to describe the dynamics of transitions between two basins of attraction (associated with distinct molecular conformations and corresponding to minima of the potential of mean force), no assumption is made here regarding the physical nature of the boundaries a and b or about the shape of the underlying potential of mean force V(x). Equation (5) can be applied to a potential of any shape and to arbitrary interval boundaries a and b. In fact, independence of the estimated D(x) of the interval may be used as a test for the internal consistency of the 1D diffusion model as a description of the system’s dynamics.

As seen from the above numerical examples, rather long trajectories were used, each including 104 transition paths. Using trajectories that are significantly shorter gives rise to much greater numerical noise in the estimated D(x). Since accurate estimation of the mean transition path time is readily accomplished from 102103 transition paths22 and since the numerator of Eq. (5) is a constant, the main source of noise is the numerical differentiation of the splitting probability in Eq. (5). Although it may be possible to reduce the noise by employing a higher order finite difference estimate for the derivative or by using a numerical interpolation/fit of the estimated splitting probability (similarly to the approach of Ref. 27), we anticipate that our method would be most useful for analyzing long experimental trajectories rather than simulation data. In the latter case, long uninterrupted trajectories often come at a premium, and approaches that utilize more efficient sampling of long-time dynamics (such as milestoning28) or employ Bayesian analysis29 may be advantageous.

Finally, we note that experimental kinetic artifacts arising, e.g., from finite time resolution potentially complicate the analysis of experimental data. Having attracted recent attention,48–50 such artifacts can be accounted for by introducing an additional degree of freedom describing the instrument itself.

Acknowledgments

We are grateful to Mauro Mugnai for a helpful discussion. A.M.B. was supported by the Intramural Research Program of the NIH, Center for Information Technology. D.E.M. was supported by the Robert A. Welch Foundation (Grant No. F-1514) and the National Science Foundation (Grant No. CHE 1566001).

REFERENCES

  • 1.Chung H. S., Louis J. M., and Eaton W. A., Science 335, 981–984 (2012). 10.1126/science.1215768 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Truex K., Chung H. S., Louis J. M., and Eaton W. A., Phys. Rev. Lett. 115(1), 018101 (2015). 10.1103/physrevlett.115.018101 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Neupane K., Foster D. A., Dee D. R., Yu H., Wang F., and Woodside M. T., Science 352(6282), 239–242 (2016). 10.1126/science.aad0637 [DOI] [PubMed] [Google Scholar]
  • 4.Yu H., Liu X., Neupane K., Gupta A. N., Brigley A. M., Solanki A., Sosova I., and Woodside M. T., Proc. Natl. Acad. Sci. U. S. A. 109(14), 5283–5288 (2012). 10.1073/pnas.1107736109 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Edwards D. T., Faulk J. K., Sanders A. W., Bull M. S., Walder R., LeBlanc M. A., Sousa M. C., and Perkins T. T., Nano Lett. 15(10), 7091–7098 (2015). 10.1021/acs.nanolett.5b03166 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Yu H., Siewny M. G., Edwards D. T., Sanders A. W., and Perkins T. T., Science 355(6328), 945–950 (2017). 10.1126/science.aah7124 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Schuler B. and Hofmann H., Curr. Opin. Struct. Biol. 23(1), 36–47 (2013). 10.1016/j.sbi.2012.10.008 [DOI] [PubMed] [Google Scholar]
  • 8.Hummer G. and Szabo A., Acc. Chem. Res. 38(7), 504–513 (2005). 10.1021/ar040148d [DOI] [PubMed] [Google Scholar]
  • 9.Woodside M. T. and Block S. M., Annu. Rev. Biophys. 43, 19–39 (2014). 10.1146/annurev-biophys-051013-022754 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Hummer G. and Szabo A., Proc. Natl. Acad. Sci. U. S. A. 107, 21442–21446 (2010). 10.1073/pnas.1015661107 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Woodside M. T., Lambert J., and Beach K. S., Biophys. J. 107(7), 1647–1653 (2014). 10.1016/j.bpj.2014.08.007 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Neupane K., Manuel A. P., and Woodside M., Nat. Phys. 12, 700–703 (2016). 10.1038/nphys3677 [DOI] [Google Scholar]
  • 13.Hyeon C. and Thirumalai D., J. Am. Chem. Soc. 130(5), 1538–1539 (2008). 10.1021/ja0771641 [DOI] [PubMed] [Google Scholar]
  • 14.Morrison G., Hyeon C., Hinczewski M., and Thirumalai D., Phys. Rev. Lett. 106(13), 138102 (2011). 10.1103/physrevlett.106.138102 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Suzuki Y. and Dudko O. K., Phys. Rev. Lett. 104(4), 048101 (2010). 10.1103/physrevlett.104.048101 [DOI] [PubMed] [Google Scholar]
  • 16.Kirmizialtin S., Huang L., and Makarov D. E., J. Chem. Phys. 122, 234915 (2005). 10.1063/1.1931659 [DOI] [PubMed] [Google Scholar]
  • 17.Cheng R. R. and Makarov D. E., J. Chem. Phys. 134(8), 085104 (2011). 10.1063/1.3556750 [DOI] [PubMed] [Google Scholar]
  • 18.Zhang B. W., Jasnow D., and Zuckerman D. M., J. Chem. Phys. 126, 074504 (2007). 10.1063/1.2434966 [DOI] [PubMed] [Google Scholar]
  • 19.Chaudhury S. and Makarov D. E., J. Chem. Phys. 133, 034118 (2010). 10.1063/1.3459058 [DOI] [PubMed] [Google Scholar]
  • 20.Pollak E., Phys. Chem. Chem. Phys. 18(41), 28872–28882 (2016). 10.1039/c6cp05052b [DOI] [PubMed] [Google Scholar]
  • 21.Makarov D. E., J. Chem. Phys. 146(7), 071101 (2017). 10.1063/1.4977177 [DOI] [PubMed] [Google Scholar]
  • 22.Satija R., Das A., and Makarov D. E., J. Chem. Phys. 147(15), 152707 (2017). 10.1063/1.4993228 [DOI] [PubMed] [Google Scholar]
  • 23.Berezhkovskii A. and Szabo A., J. Chem. Phys. 135(7), 074108 (2011). 10.1063/1.3626215 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Best R. B. and Hummer G., Proc. Natl. Acad. Sci. U. S. A. 102(19), 6732–6737 (2005). 10.1073/pnas.0408098102 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Best R. B. and Hummer G., Proc. Natl. Acad. Sci. U. S. A. 107(3), 1088–1093 (2010). 10.1073/pnas.0910390107 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Best R. B. and Hummer G., Phys. Chem. Chem. Phys. 13(38), 16902–16911 (2011). 10.1039/c1cp21541h [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Hinczewski M., von Hansen Y., Dzubiella J., and Netz R. R., J. Chem. Phys. 132(24), 245103 (2010). 10.1063/1.3442716 [DOI] [PubMed] [Google Scholar]
  • 28.Mugnai M. L. and Elber R., J. Chem. Phys. 142(1), 014105 (2015). 10.1063/1.4904882 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Hummer G., New J. Phys. 7, 34 (2005). 10.1088/1367-2630/7/1/034 [DOI] [Google Scholar]
  • 30.Comer J., Chipot C., and Gonzalez-Nilo F. D., J. Chem. Theory Comput. 9(2), 876–882 (2013). 10.1021/ct300867e [DOI] [PubMed] [Google Scholar]
  • 31.Zhang Q., Brujic J., and Vanden-Eijnden E., J. Stat. Phys. 144(2), 344–366 (2011). 10.1007/s10955-011-0242-7 [DOI] [Google Scholar]
  • 32.Turkcan S., Alexandrou A., and Masson J. B., Biophys. J. 102(10), 2288–2298 (2012). 10.1016/j.bpj.2012.01.063 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Gardiner C. W., Handbook of Stochastic Methods for Physics, Chemistry and the Natural Sciences (Springer-Verlag, Berlin, 1983). [Google Scholar]
  • 34.Peters B., Bolhuis P. G., Mullen R. G., and Shea J. E., J. Chem. Phys. 138(5), 054106 (2013). 10.1063/1.4775807 [DOI] [PubMed] [Google Scholar]
  • 35.Hummer G., J. Chem. Phys. 120, 516–523 (2004). 10.1063/1.1630572 [DOI] [PubMed] [Google Scholar]
  • 36.E W. and Vanden-Eijnden E., Annu. Rev. Phys. Chem. 61, 391–420 (2010). 10.1146/annurev.physchem.040808.090412 [DOI] [PubMed] [Google Scholar]
  • 37.E W. and Vanden-Eijnden E., J. Stat. Phys. 123(3), 503–523 (2006). 10.1007/s10955-005-9003-9 [DOI] [Google Scholar]
  • 38.Vanden-Eijnden E., in Computer Simulations in Condensed Matter: From Materials to Chemical Biology, edited by Ferrario M. M., Ciccotti G., and Binder K. (Springer, 2006). [Google Scholar]
  • 39.Du R., Pande V. S., Grosberg A. Y., Tanaka T., and Shakhnovich E. S., J. Chem. Phys. 108, 334–350 (1998). 10.1063/1.475393 [DOI] [Google Scholar]
  • 40.Berezhkovskii A. M. and Szabo A., J. Phys. Chem. B 117(42), 13115–13119 (2013). 10.1021/jp403043a [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Berezhkovskii A. M., Dagdug L., and Bezrukov S. M., J. Phys. Chem. B 121(21), 5455–5460 (2017). 10.1021/acs.jpcb.7b04037 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Manuel A. P., Lambert J., and Woodside M. T., Proc. Natl. Acad. Sci. U. S. A. 112(23), 7183–7188 (2015). 10.1073/pnas.1419490112 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Berezhkovskii A. M., Pustovoit M. A., and Bezrukov S. M., J. Chem. Phys. 116, 9952–9956 (2002). 10.1063/1.1475758 [DOI] [Google Scholar]
  • 44.Chung H. S. and Gopich I. V., Phys. Chem. Chem. Phys. 16(35), 18644–18657 (2014). 10.1039/c4cp02489c [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Berezhkovskii A. M., Hummer G., and Bezrukov S. M., Physical review letters 97(2), 020601 (2006). 10.1103/physrevlett.97.020601 [DOI] [PubMed] [Google Scholar]
  • 46.Makarov D. E., Single Molecule Science: Physical Principles and Models (CRC Press, Taylor & Francis Group, Boca Raton, 2015). [Google Scholar]
  • 47.Berezhkovskii A. M., Pustovoit M. A., and Bezrukov S. M., J. Chem. Phys. 119(7), 3943–3951 (2003). 10.1063/1.1590957 [DOI] [Google Scholar]
  • 48.Nam G. M. and Makarov D. E., Protein Sci. 25, 123 (2015). 10.1002/pro.2727 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Cossio P., Hummer G., and Szabo A., Proc. Natl. Acad. Sci. U. S. A. 112(46), 14248–14253 (2015). 10.1073/pnas.1519633112 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Neupane K. and Woodside M. T., Biophys. J. 111(2), 283–286 (2016). 10.1016/j.bpj.2016.06.011 [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from The Journal of Chemical Physics are provided here courtesy of American Institute of Physics

RESOURCES