Skip to main content
The Journal of Chemical Physics logoLink to The Journal of Chemical Physics
. 2019 Feb 6;150(5):054106. doi: 10.1063/1.5079742

Committors, first-passage times, fluxes, Markov states, milestones, and all that

Alexander M Berezhkovskii 1,a), Attila Szabo 2
PMCID: PMC6910584  PMID: 30736684

Abstract

Milestoning on a one-dimensional potential starts by choosing a set of points, called milestones, and initiating short trajectories from each milestone, which are terminated when they reach an adjacent milestone for the first time. From the average duration of these trajectories and the probabilities of where they terminate, a rate matrix can be constructed and then used to calculate the mean first-passage time (MFPT) between any two milestones. All these MFPT’s turn out to be exact. Here we adopt a point of view from which this remarkable result is not unexpected. In addition, we clarify the nature of the “states” whose interconversion is described by the rate matrix constructed using information obtained from short trajectories and provide a microscopic expression for the “equilibrium population” of these states in terms of equilibrium averages of the committors.

I. INTRODUCTION

Many recent advances in the theory of rare events were not made algebraically but rather by analyzing the behavior of trajectories. Both transition path theory1–3 and milestoning4–6 can be regarded as part of the emerging field of “statistical mechanics of trajectories.” This paper grew out of our efforts to understand Elber’s milestoning in the simplest context.

Operationally, Elber’s milestoning for one-dimensional diffusive dynamics on a potential can be formulated as follows: Choose an arbitrary set of points labeled by the index i, i = 1, 2, ..., N. Point i is called milestone i. Run Brownian dynamics trajectories starting from milestone i and terminate them when they reach either milestone i − 1 or milestone i + 1 for the first time. The average duration of these trajectories, denoted by ti, is by construction the mean first-passage time (MFPT) from milestone i to one of the two milestones, i − 1 and i + 1. In addition, the fraction of trajectories terminated at milestone i + 1 is determined. This is the splitting probability or committor denoted by φ(ii + 1). Clearly, φii1=1φii+1. This information is then used to construct an N × N rate matrix R that describes the interconversion of yet unspecified “states” I, I = 1, 2, ..., N. This is a three-diagonal matrix with off-diagonal matrix elements RI  I±1=φii±1/ti and diagonal elements RI  I=1/ti. This guarantees that the mean lifetime of state I, given by RII+1+RII11, is ti and the probability to go from state I to state I + 1, given by RII+1/RII+1+RII1, is φii+1, where ti and φii+1 were obtained by running many short trajectories. It is important to emphasize that the distributions of lifetimes of states I are single-exponential, whereas the exact distributions of the MFPT’s in general are not. Nevertheless, the MFPT between any two states obtained by using the rate matrix R turns out to be exact! The importance of this result is that the exact MFPT can be obtained, for example, for two milestones separated by a high barrier. To directly obtain this MFPT by running Brownian dynamics simulations would take virtually forever. On the other hand, using milestoning, all one has to do is run many relatively short trajectories starting from milestones.

Thus, milestoning maps the diffusive dynamics onto a nearest-neighbor continuous-time Markov state model. The fact that the MFPT’s between the states, obtained using the rate matrix, are identical to the MFPT’s between the corresponding milestones calculated using the Smoluchowski equation was initially surprising to us. One goal of this paper is to find a way of thinking that makes this, at first sight, remarkable result not unexpected. To do this, we need to understand the nature of the “states” I whose interconversion is described by the rate matrix R, and the meaning of their “equilibrium populations” obtained from the eigenvector of this rate matrix corresponding to zero eigenvalue.

The outline of this paper is as follows: In Sec. II, we consider just two milestones. This may seem too trivial to provide any insight because the MFPT’s of the two-state model are exact by construction. However, it turns out to be the simplest context where one can understand what the states are and what their equilibrium populations mean. In Sec. III, we consider three milestones, which is the simplest case that involves splitting probabilities. Some concluding remarks are presented in Sec. IV. In the Appendix, we algebraically derive some identities that have been obtained in Sec. II by analyzing a long equilibrium trajectory.

II. TWO MILESTONES

We begin by analyzing the simplest possible example of milestoning. Consider a particle diffusing in a bounded one-dimensional potential U(x), U(x±)=. Choose two arbitrary points (milestones), x = a, and x = b, a < b. The average duration of trajectories, initiated at point a and terminated at point b, is the mean first-passage time (MFPT) from a to b, τ(ab). Similarly, the MFPT τ(ba) can be obtained by starting trajectories from point b and terminating them when they reach point a.

If points a and b were located at the minima of two deep wells, A and B, of a bistable potential, separated by a high barrier, then the distributions of both first-passage times would be very nearly exponential. In this case, it would make sense to describe the dynamics of the system in terms of Markovian transitions between the two wells, i.e., by a two-state model

AkABkBAB, (2.1)

where the rate constants for inter-well transitions are given by

kAB=1τab,   kBA=1τba. (2.2)

If the two points are arbitrary, then the distributions of the first-passage times are no longer exponential. Nevertheless, one can formally adopt the above two-state description, i.e., use the MFPT’s and build a two-state model as discussed above. At least the mean lifetimes of the two states, A and B, kAB1 and kBA1, will be correct. The equilibrium populations, πA and πB, of these states are

πA=kBAkAB+kBA=τabτab+τba,   πB=1πA. (2.3)

The equilibrium unidirectional flux between them, i.e., the number of A-to-B (or B-to-A) transitions per unit time at equilibrium, JAB=JAB=JBA, is

JAB=πAkAB=πBkBA. (2.4)

For future reference, note that the rate constants in this model can be expressed as “flux-over-population,” i.e., kAB=JAB/πA, kBA=JAB/πB. Substituting in Eq. (2.4) the rate constants and equilibrium populations given in Eqs. (2.2) and (2.3), we find that

JAB=1τab+τba. (2.5)

The sum of the MFPT’s in the denominator has been called the mean round-trip time between points a and b.7 Thus the equilibrium unidirectional flux is the inverse mean round-trip time. However, it is not at all obvious what the two “states,” A and B, associated with milestones a and b are, what the equilibrium populations of these states mean, and what the unidirectional flux corresponds to.

A. Coloring an equilibrium trajectory

To answer these questions, consider a long equilibrium trajectory, x(t), of duration T, T → ∞. One way of obtaining the first passage times between points a and b is to use the trajectory coloring procedure introduced by Vanden-Eijnden and Venturoli8 (see also Ref. 3). Let us color the trajectory either red or blue using the following rules: the trajectory changes color from red to blue when it comes to point b from point a, and from blue to red when it comes to point a from point b, as show in the upper panel of Fig. 1. This is similar to the procedure used by Buchete and Hummer9 to define states when constructing Markov state models. In addition, a referee pointed out that it is closely related to transition interface sampling.10,11 Let Ta (Tb) be the total time the trajectory is red (blue) so that Ta + Tb = T. Let Nab be the number of times the color of the trajectory changes from red-to-blue or equivalently from blue-to-red during time T. The MFPT’s τab and τba are the average durations of the red and blue segments,

τab=TaNab,   τba=TbNab. (2.6)

Thus the mean lifetimes of the red and blue segments, τab and τba, are equal to the mean lifetimes of states A and B, kAB1 and kBA1, respectively.

FIG. 1.

FIG. 1.

Upper panel: A long equilibrium trajectory generated by Brownian dynamics simulations that is colored red and blue. The color changes from red to blue when the trajectory from a reaches b for the first time and remains blue until it reaches a for the first time whereupon it turns red. The color of the trajectory depends on the last milestone it touched: the color is red (blue), if the last touched milestone is a (b). Lower panel: The colored trajectory x(t) is converted into a two-state trajectory where the lifetime of state A (B) is equal to the time during which the trajectory x(t) is red (blue).

The probability that the trajectory is red, πa, is the fraction Ta/T=Ta/Ta+Tb, while the probability that the trajectory is blue, πb, is the fraction Tb/T=Tb/Ta+Tb. Using the relations between the MFPT’s and times Ta and Tb, Eq. (2.6), these can be written as

πa=TaTa+Tb=τabτab+τba,   πb=1πa. (2.7)

These are identical to the equilibrium populations in Eq. (2.3) obtained using the two-state model with rate constants given in Eq. (2.2). Therefore, state A (B) associated with milestone a (b) corresponds to red (blue) trajectory segments, as shown in the lower panel of Fig. 1. Note that states A and B are not thermodynamic states. The reason is that the trajectory can be both red and blue in the region between the milestones, a<x<b. The color is red (blue) if the last milestone touched by the trajectory before coming to point x is point a (b). Therefore, when a<x<b, the trajectory can be sometimes red (state A) and sometimes blue (state B) (see Fig. 1), while in thermodynamics any spatial point can belong to only one state. Thus, states A and B associated with the milestones are states of the trajectory rather than thermodynamic states.

The unidirectional flux between points a and b at equilibrium, Jab=Jab=Jba, by definition, is

Jab=NabT=NabTa+Tb. (2.8)

Using Eq. (2.6), this flux can be written in terms of the MFPT’s τab and τba as

Jab=1τab+τba. (2.9)

This is identical to the flux JAB, Eq. (2.5), obtained in the framework of the two-state model. Thus, the equilibrium unidirectional flux between points a and b is the same as with the number of transitions from trajectory state A to state B (and vice versa) per unit time, at equilibrium, Jab=JAB.

To summarize, using the trajectory coloring procedure, we found that states A and B, associated with milestones a and b, indicate the color of the trajectory (red in state A and blue in state B). The equilibrium populations of these states, πA and πB, are the fractions of time when the trajectory is red and blue, πa and πb, respectively. Finally, the equilibrium unidirectional flux between these states, JAB, is the equilibrium unidirectional flux, Jab, between the two milestones.

The two-state model using the rate constants defined as the reciprocals of the MFPT’s is remarkably successful in describing transitions between the trajectory segments of different color. While the MFPT’s calculated from the two-state model are exact by construction, this model gives the exact relation between the unidirectional flux at equilibrium and the MFPT’s, Eq. (2.9). Although at first glance this might seem surprising, there is a point of view from which it is not unexpected. The trajectory corresponding to the two-state model can be exactly generated using the Gillespie algorithm, where the lifetimes of states A and B are chosen from the exponential distributions. For example, for state A, the distribution is kABexpkABt=expt/τab/τab. While these exponential distributions differ from the exact lifetime distributions of the red and blue trajectory segments, the corresponding lifetimes are equal on average (the mean lifetimes are the same by construction). Thus, on average, the two-state trajectory is equivalent to the red-blue trajectory obtained by coloring the initial trajectory shown in the lower panel of Fig. 1. Therefore, the average number of unidirectional transitions per unit time calculated from the two-state trajectory must be the same as that obtained from the colored trajectory. As we shall see in Sec. III, this is the simplest way of understanding why a three-state model exactly gives all the MFPT’s and equilibrium unidirectional fluxes for three milestones.

B. Microscopic interpretation of equilibrium populations of states A and B

Next, we discuss a microscopic interpretation of the equilibrium populations of states A and B. According to the coloring rules, the trajectory is always red when xa and blue when xb. In the intermediate region, a<x<b, the color of the trajectory depends on the prehistory. The trajectory is red, if it entered the intermediate region through point a, and blue, if it entered through point b. Because of detailed balance or time-reversal symmetry, the probability that the trajectory reaches x coming from a is equal to the probability that starting form x, the trajectory reaches a before touching b. This probability, denoted by φxa, is referred to as the committor or splitting probability. Similarly, the probability that the trajectory reaches x having entered the intermediate region from b is equal to the probability that starting form x, the trajectory reaches b before touching a. This probability, denoted by φ(xb), is φ(xb) = 1 − φ(xa).

The probability that the trajectory is colored red (state A) at an arbitrary point x, denoted by φa(x), is

φax=1,  xaφxa,  axb0,  bx. (2.10a)

The probability that the trajectory at point x is colored blue (state B), denoted by φbx, is

φbx=0,  xaφxb,  axb1,  bx. (2.10b)

Since we are dealing with an equilibrium trajectory, the probability of finding the trajectory at point x is given by the Boltzmann distribution, expβUx/Z, where Z=expβUxdx with β=1/kBT, where kB and T denote the Boltzmann constant and the absolute temperature. Therefore, the equilibrium probability densities that an arbitrary point x is red or blue are φixexpβUx/Z, i = a, b. As a consequence, the equilibrium probability of finding the trajectory red (blue) is the Boltzmann average of φax (φbx),

πi=φi=φixeβUxdxeβUxdx,   i=a,b. (2.11)

This probability is also the equilibrium population πi of state I,I=A,B. Thus, the equilibrium population of state I is the Boltzmann average of the probability φi(x),πI=φi.

C. MFPT’s, rate constants, and equilibrium unidirectional flux

The MFPT’s between points a and b, given in Eq. (2.6), can be written in terms of equilibrium populations, Eqs. (2.7) and (2.11), and the equilibrium unidirectional flux, Eq. (2.8),

τab=TaNab=TaTa+TbTa+TbNab=φaJab,  τba=φbJab. (2.12)

These are exact identities which are verified in the Appendix using analytical expressions obtained in the framework of the theory of first-passage processes. It should be pointed out that the mean transition path or direct-transit time between points a and b, denoted by τ(ab), can be written2 as τ(ab)=φaφb/Jab.

Using the above results for the MFPT’s in Eq. (2.2), the rate constants for transitions between states A and B, defined by the color of the trajectory, can be written as

kAB=Jabφa,   kBA=Jabφb. (2.13)

These expressions have been previously obtained by Vanden-Eijnden and Venturoli.8 The relation between our notation and theirs is JνR (unidirectional flux at equilibrium), φa(x)q(x) (the probability that the trajectory at point x is colored red), and φa=ρa (the equilibrium probability that the trajectory is red).

D. Relation to Kramers theory

When the milestones a and b happen to be at the minima of two deep wells separated by a high barrier, the above expressions for the rate constants reduce to those of the Kramers theory. Kramers used flux divided by thermodynamic well population as the definition the rate constant.12 In the case of two deep wells, the probability φax is unity not only when xa but also is essentially unity for a wider range of x, a<x<x*, where x* is a point between point a and the barrier top, chosen so that the potential energy U(x*) is several kBT’s above the potential energy at the well bottom, U(a). Therefore, φa is essentially x*eβUxdx/eβUxdx. The value of the integral in the numerator is determined by the potential near the well bottom at x = a and is insensitive to the location of point x* as long as it lies sufficiently far from point a (the energy difference U(x*)U(a) exceeds several kBT’s). Consequently, the equilibrium population of non-thermodynamic state A, φa, is equal to the equilibrium thermodynamic population of well A. This happens because the interval, where points may have both colors, is located in the barrier region. When the barrier is high, the equilibrium population in this interval is vanishingly small.

As shown in the Appendix, the equilibrium unidirectional flux between points a and b is given by Jab=eβUxdxabeβUxdx/D(x)1, where D(x) is the position-dependent diffusivity. Substituting Jab and φa above into Eq. (2.13), we arrive at

kAB=1x*eβUxdxabeβUxdx/D(x). (2.14)

While the value of the first integral in the denominator is determined by the behavior of the potential near the A-well bottom, the value of the second integral is determined by the behavior of the potential near the barrier top. The value of the latter integral is insensitive to the integration limits when the potential energies U(a) and U(b) are several kBT’s below the barrier top. Assuming that U(x) is quadratic near the A-well bottom and the barrier top, and the diffusivity is position-independent, we replace U(x) in the integrands by the corresponding quadratic approximations, then let x*, a, and b, and perform the integrations. In this way, we recover the celebrated Kramers formula for the rate constant for diffusive barrier crossing (high-friction regime).

III. THREE MILESTONES

We now consider the much more interesting case of three milestones located at points x = a, x = b, and x = c, a<b<c. By starting trajectories at points a or c and terminating them when they reach point b, we can find the MFPT’s τab and τcb as before. The milestone located at point b is different because it has milestones on both sides. Here we initiate trajectories at point b and terminate them when they reach either point a or point c. In this way, we determine the MFPT, τabc, required to reach points a or c for the first time starting from point b. In addition, we determine the committor or splitting probability that a trajectory starting at point b reaches point a before point c, denoted by φba. Clearly, φbc=1φba.

We use these MFPT’s and splitting probabilities to build a three-state model of the dynamics

AkABkBABkBCkCBc. (3.1)

Here A, B, and C are yet unspecified “states” associated with milestones a, b, and c. The transitions between them are described by the rate constants defined as

kAB=1τab,   kBA=φ(ba)τabc,kCB=1τcb,   kBC=φ(bc)τabc. (3.2)

The rate constants kBA and kBC are defined so as to ensure that (1) the splitting probabilities for BA and BC transitions, kBA/kBA+kBC and kBC/kBA+kBC, are φba and φbc, and (2) the mean lifetime of state B, kBA+kBC1, is tB=τabc. The mean lifetimes of states A and C, kAB1 and kCB1, respectively, are tA=τab and tC=τcb. The time evolution of the probability of finding the system in state I, pI(t), I = A, B, C, is given by

dpdt=Rp, (3.3)

where p is the column vector, (pA, pB, pC), and the rate matrix, R, is

R=kABkBA0kABkBA+kBCkCB0kBCkCB. (3.4)

The normalized equilibrium populations of the three states, πA, πB, and πC, πA+πB+πC=1, are solutions of Rπ=0. Solving this equation and replacing the rate constants by the MFPT’s and splitting probabilities, using Eq. (3.2), we find

πA=φbaτabΔ,πB=τabcΔ,πA=φbcτcbΔ, (3.5)

where Δ is

Δ=φbaτab+τabc+φbcτcb. (3.6)

Given the rate matrix, R, we can calculate the MFPT’s between any two states by solving

RTτ=1 (3.7)

subject to the appropriate boundary condition (i.e., if we are interested in the MFPT’s to state I, then τII=0). In this equation, which is the discrete analog of Eq. (A6) in the Appendix, RT is the transpose of matrix R, τ is a column vector of the MFPT’s, and 1 is a column vector with components equal to unity. Solving Eq. (3.7) and using Eq. (3.2), we can express the MFPT’s between any two states in terms of the MFPT’s and splitting probabilities obtained from the simulations. For example, one can show that τAB=τab, as to be expected, and more interestingly, that

τBC=φbaτab+τabcφbc. (3.8)

The MFPT between states A and C is given by τAC=τAB+τBC.

Since matrix RT can be expressed as

RT=tA1000tB1000tC1110φba1φbc011, (3.9)

we can write Eq. (3.7) in the form5

IΦτ=t. (3.10)

Here t is the vector of mean lifetimes, and Φ is the matrix of the splitting probabilities,

Φ=010φba0φbc010. (3.11)

Equations (3.7) and (3.10) must be solved subject to the same boundary conditions. Alternatively, matrix Φ can be redefined so as to explicitly incorporate these boundary conditions.

A. Coloring an equilibrium trajectory

As in the case of two milestones, we are interested in determining the nature of the “states” associated with the milestones and microscopic meaning of their equilibrium populations, πA, πB, and πC, that are normalized solutions of Rπ=0. To this end, as before, consider how the MFPT’s can be obtained from a long equilibrium trajectory x(t) of duration T, T, by coloring it in red, blue, and orange, using the rules proposed by Vanden-Eijnden and Venturoli.8 The trajectory changes its color from red to blue when it comes to point b from point a and from orange to blue when it comes to this point from point c. Then the trajectory remains blue until it touches point a or c for the first time, when it turns red or orange, respectively, as shown in the upper panel of Fig. 2.

FIG. 2.

FIG. 2.

The same as Fig. 1 but for three milestones. Upper panel: The trajectory x(t) colored red, blue, and orange. Blue changes to orange (red) when the trajectory x(t) reaches point c (a) for the first time. When b<x<c (a<x<b), the trajectory can be either blue or orange (blue or red). The color of the trajectory depends on the last milestone it touched: the color is red (blue or orange), if the last touched milestone is a (b or c). Lower panes: A three-state trajectory corresponding to the colored trajectory x(t).

Denote the total times when the trajectory is red, blue, and orange by Ta, Tb, and Tc, and the numbers of times the trajectory changed color from red to blue and from blue to orange and vice versa by Na b and Nb c, respectively. The MFPT’s τab, τabc, and τcb are the average durations of the red, blue, and orange segments,

τab=TaNab,τabc=TbNab+Nbc,τcb=TcNbc. (3.12)

The splitting probability φba (φbc) is the number of times a blue segment turned red (orange), Na b (Nb c), divided by the total number of times it changed color,

φba=NabNab+Nbc,   φbc=NbcNab+Nbc. (3.13)

Using the relations in Eqs. (3.12) and (3.13), we can write the total observation time as

T=Ta+Tb+Tc==Nabτab+Nab+Nbcτabc+Nbcτcb=Nab+NbcΔ, (3.14)

where Δ is given in Eq. (3.6).

B. Equilibrium populations and MFPT’s

The relations in Eqs. (3.12) and (3.13) link the MFPT’s and splitting probabilities obtained from the short simulations initiated from the milestones with information obtained from a long trajectory. We use this to find the fractions of time when the trajectory is red, blue, and orange, πi=Ti/T,i=a,b,c, and show that these fractions are equal to the equilibrium populations of states I, πI, I=A,B,C, obtained from the rate matrix, R, i.e., πi=πI. For example, the fraction of time when the trajectory is red, πa=Ta/T, can be recast as

πa=TaT=NabτabNab+NbcΔ, (3.15)

where we have used Eq. (3.12) for Ta and Eq. (3.14) for T. Finally, using Eq. (3.13), we obtain

πa=TaT=φbaτabΔ. (3.16)

Comparing this with πA in Eq. (3.5), we see that πa=πA. Similarly, it can be shown that πb=Tb/T=πB and πc=Tc/T=πC.

In summary, the mean durations of the red, blue, and orange segments of the trajectory are equal to the mean lifetimes of states A, B, and C of the three-state model. In addition, the fractions of time when the trajectory is red, blue, and orange are equal to the equilibrium populations, πA, πB, and πC, of the three states. Consequently, states A, B, and C of the three-state model, associated with the milestones, correspond to red, blue, and orange segments of the long trajectory, respectively.

Times, Ta, Tb, and Tc, and the numbers of color changes, Na b and Nb c, obtained from the long trajectory can be used to find the exact MFPT’s between any two milestones, not just those in Eq. (3.12). For example, suppose we are interested in the MFPT between milestones b and c, τbc. If the color of the red trajectory fragments is changed to blue, then we are faced with a two (b and c) milestone problem where the duration of the blue trajectories is now Ta + Tb, and so

τbc=Ta+TbNbc. (3.17)

Now we use Eq. (3.12) to write Ta and Tb in terms of the MFPT’s, i.e., Ta=Nabτab and Tb=Nab+Nbcτabc. Substituting this into Eq. (3.17) and using the definitions of the splitting probabilities φba and φbc in terms of the numbers of color changes in Eq. (3.13), we arrive at

τbc=φbaφbcτab+1φbcτabc. (3.18)

Comparing this with τBC in Eq. (3.8), obtained using the three-state model, we see that τbc=τBC. Thus, the three-state model predicts the exact MFPT between points b and c even though this MFPT was not used to determine any of the rate constants. In a similar way, one can show that all possible MFPT’s predicted by the three-state model are exact.

How can we understand this remarkable success of the three-state model? Any quantity that can be calculated using the rate matrix R can also be obtained from a long equilibrium trajectory in the space of states A, B, and C generated using the Gillespie algorithm. All input parameters for this algorithm (i.e., the lifetimes of the states and splitting probabilities) can be obtained from the rate matrix R. Let us compare such a trajectory with the exact one shown in the lower panel of Fig. 2. The most dramatic difference is in the distributions of the state lifetimes and those of the durations of the colored segments. In the trajectory generated using the Gillespie algorithm, the distributions of the state lifetimes are single-exponential, while the distributions of the durations of the colored segments are in general not. However, the mean lifetimes and the splitting probabilities in both trajectories are the same because the Gillespie algorithm used the exact MFPT’s and splitting probabilities as input. Therefore, any average quantity (e.g., the MFPT’s) calculated using the rate matrix must be exact.

C. Microscopic interpretation of “equilibrium populations”

We will now express the equilibrium populations given by the three-state model, πI, (which are identical to the fractions of time, πi, when the trajectory is red, blue, and orange) in terms of the Boltzmann equilibrium averages of the committors. Let φix be the probabilities and pix be the probability densities that the trajectory at point x has the color associated with the milestone i, i = a, b, c. Just as in the two-milestone case, pi(x), are given by the products of the Boltzmann distribution and the probabilities φi(x), pix=φixexpβUx/Z. The probabilities φi(x) are

φax=1,xaφxa,axb0,bx, (3.19)
φbx=0,xaφxb,axc0,cx, (3.20)

and

φcx=0,xbφxc,bxc1,cx. (3.21)

Here φxa (φxc) is the probability of reaching point a (c) starting from point x, axb (bxc), before reaching point b, and φxb=1φxa, for axb, and φxb=1φxc, for bxc. One can see that the probabilities φix are normalized at every point x, φax+φbx+φcx=1. The equilibrium populations of states I, πI, I = A, B, C, given in terms of the Boltzmann equilibrium averages of the committors φi(x), are

πI=πi=pixdx=φix,I=A,B,C,i=a,b,c. (3.22)

D. Unidirectional fluxes and rate constants

It should be no surprise by now that unidirectional fluxes at equilibrium (i.e., the numbers of transitions per unit time) between states A and B, JAB=JAB=JBA, and between states B and C, JBC=JBC=JCB, calculated in the framework of the three-state model, are the same as those between the points a and b, Jab=Jab=Jba, and points b and c, Jbc=Jbc=Jcb, calculated from the long trajectory. To see this, consider the equilibrium unidirectional flux JBC,

JBC=πBkBC=πbφbcτabc, (3.23)

where we have used the fact that πB=πb and the expression for kBC given in Eq. (3.2). The equilibrium unidirectional flux Jbc between milestones b and c, by definition, is

Jbc=NbcT=TbTNbcNab+NbcNab+NbcTb=πbφbcτabc, (3.24)

where we have used the relations in Eqs. (3.12) and (3.13). Thus JBC=Jbc and similarly, JAB=Jab. Because of this and since πI=πi=φi, all the rate constants in Eq. (3.2) can be written as “flux-over-population”

kAB=Jabφa,   kBA=Jabφb,kCB=Jbcφc,   kBC=Jbcφb, (3.25)

just as in the two-milestone case [see Eq. (2.13)].

The mean lifetime of state B, tB, or the mean duration of the blue trajectory segment, tb, is tB=tb=kBA+kBC1=τabc. Using the expressions for the rate constants in Eq. (3.25), we obtain the following relation among the MFPT, τabc, the Boltzmann averaged committor, φb, and the equilibrium unidirectional fluxes:

τabc=φbJab+Jbc, (3.26)

which is to be compared with the results in Eq. (2.12) for τab and τba in the case of two milestones. The sum of fluxes, Jab+Jbc, is denoted by qb in the milestoning literature.6 The identity in Eq. (3.26) can be proved the old-fashioned way, algebraically, using the formalism given in the Appendix: τax0c is the solution of Eq. (A6), in which τx0b should be replaced by τax0c, with boundary conditions τaac=τacc=0, while φxb is the solution of Eq. (A11), in which φxa should be replaced by φxb, with boundary conditions φbb=1 and φab=φcb=0.

IV. CONCLUDING REMARKS

The above analysis can be readily generalized to an arbitrary number of milestones in one dimension. Thus, by running short trajectories starting from one milestone and stopping at an adjacent milestone, one can calculate the exact MFPT between any two milestones no matter how far apart they are. The importance of this result is that one can obtain very long MFPT’s, say, between two milestones separated by a high barrier, that would be virtually impossible to simulate directly. In simplest terms, the reason for this remarkable result is that a long trajectory generated by the Gillespie algorithm with parameters obtained from the rate matrix is “on the average” equivalent to the exact trajectory (i.e., the lower panels in Figs. 1 and 2). By “on the average,” we mean that while the distribution of the lifetimes of the various states are different, the average lifetimes as well as other average properties such as equilibrium populations, unidirectional fluxes, and MFPT’s will be the same.

How can this be generalized to multidimensional diffusive dynamics? In this case, milestones are no longer points but rather surfaces, and the first-passage times between the milestones depend on the location of the starting points on the surface. If we could choose the distribution of starting points in such a way that the MFPT’s obtained by running short trajectories would be identical to those obtained by coloring a long equilibrium trajectory, then all our arguments would immediately carry over. This distribution is related to the probability that the equilibrium trajectory crosses or hits a particular point on the surface (for an algebraic derivation, see the end of the Appendix). How this distribution can be found iteratively by running only short trajectories for non-diffusive dynamics is the focus of an extensive literature that can be accessed through some recent references13–15 and earlier ones therein. We hope that our work will help to reduce the “activation barrier” encountered in trying to understand these papers.

Finally, we would like to mention that once we did something16 in the context of solute transport through a membrane channel, which in retrospect was closely related to milestoning. We considered two models of solute transport through a membrane channel, one, where a solute diffuses through the channel in the presence of a potential of mean force, and the other, where the solute simply jumps between two sites located at the channel ends. We showed that the fluxes calculated within the framework of these models are identical when the jump rates between the two states are chosen as the reciprocal of the MFPT’s to diffuse from one end of the channel to the other. In this way, we provided a microscopic interpretation of the phenomenological rate constants of the widely used two-site model of the solute dynamics in the channel.

ACKNOWLEDGMENTS

We have benefitted from discussions with Ron Elber, Gerhard Hummer, Eric Vanden-Eijnden, and Vladimir Zitserman. We thank Robert Best for help with the manuscript. This study was supported by the Intramural Research Program of the NIH, Center for Information Technology and National Institute of Diabetes and Digestive and Kidney Diseases.

APPENDIX: ALGEBRAIC DERIVATION OF THE RELATION AMONG THE MFPT, FLUX, AND COMMITTOR

Here we will algebraically derive the relation among the MFPT, τab, the equilibrium-averaged committor, φa, and the equilibrium unidirectional flux, Jab.

Consider a particle diffusing in a potential U(x), U(x±)=, with position-dependent diffusivity, D(x). The probability density of finding the particle at point x at time t, given that it was initially (at t = 0) at point x0, px,t|x0,0, satisfies the Smoluchowski equation

pt=Lxp=xD(x)eβU(x)xeβU(x)p (A1)

subject to the initial condition px,0|x0,0=δxx0. It follows from the detailed balance condition that px,t|x0,0=px0,t|x,0eβU(x0)U(x). Substituting this into Eq. (A1) and interchanging the notations of the variables xx0, one finds that px,t|x0,0 satisfies

pt=Lx0+p=eβU(x0)x0D(x0)eβU(x0)px0, (A2)

which is the adjoint or backward Smoluchowski equation.

Let us now choose point x = b to be an absorbing boundary, i.e., pb,t|x0,0=0. The survival probability of a particle starting from x0, x0 < b, at t = 0, denoted by St|x0, is defined as

St|x0=bpx,t|x0,0dx. (A3)

Integrating both sides of Eq. (A2) over x from −∞ to b, we find that the survival probability satisfies

St|x0t=Lx0+St|x0=eβU(x0)x0D(x0)eβU(x0)St|x0x0 (A4)

subject to the initial condition S0|x0=1. The mean particle lifetime is the MFPT from point x0 to point b, τx0b,

τx0b=0tdSt|x0dtdt=0St|x0dt. (A5)

Integrating both sides of Eq. (A4) over time from 0 to ∞ and using the facts that S|x0=0 and S0|x0=1, we find that the MFPT satisfies

Lx0+τx0b=eβU(x0)x0D(x0)eβU(x0)τx0bx0=1. (A6)

This must be solved subject to the boundary condition τbb=0.

Multiplying both sides of Eq. (A6) by expβUx0, integrating over x0 from −∞ to x, where x<b, and using the fact that U()=, we obtain

D(x)eβU(x)τ(xb)x=xeβU(y)dy. (A7)

Dividing both sides by D(x)expβU(x), integrating the resulting equation from x to b, and using the fact that τbb=0, we arrive at

τ(xb)=xbeβU(z)dzD(z)zeβU(y)dy. (A8a)

Similarly, for the MFPT from x to a, xa, we have

τ(xa)=axeβU(z)dzD(z)zeβU(y)dy. (A8b)

According to Eq. (2.5), the equilibrium unidirectional flux Jab between milestones a and b is Jab=τab+τba1. Using Eqs. (A8a) and (A8b), it can be shown that this flux is given by

Jab=1τab+τba=1abeβU(z)dzD(z)eβU(y)dy. (A9)

This result can also be obtained from

Jab=D(x)eβU(x)xeβU(x)p(x),  axb, (A10)

where the probability density p(x) at the end points is p(a)=expβU(a)/expβU(x)dx and p(b) = 0, i.e., milestone b is absorbing and point a is kept at equilibrium. Dividing both sides of Eq. (A10) by D(x)expβU(x) and then integrating both sides of the resulting equation from a to b, we recover the result for the flux in Eq. (A9).

The committor (splitting probability) φ(xa), axb, satisfies the Onsager equation, Lx+φ(xa)=0, subject to the boundary conditions φ(aa)=1 and φ(ba)=0. Using Lx+, defined in Eq. (A2), we can write the Onsager equation as

ddxD(x)eβU(x)dφ(xa)dx=0. (A11)

Consequently, we have

D(x)eβU(x)dφ(xa)dx=C, (A12)

where C is a constant to be determined using the boundary conditions at x = a and x = b. Dividing both sides by D(x)expβU(x), integrating both sides of the resulting equation from x to b, and using the boundary condition at point b, φ(ba)=0, we arrive at

φ(xa)=CxbeβU(z)dzD(z). (A13)

Using the boundary condition at point a, φ(aa)=1, to find C, we finally obtain

φ(xa)=xbeβU(z)dzD(z)abeβU(z)dzD(z),   axb. (A14)

Now we are ready to calculate φa, where φa(x) is given in Eq. (2.10a),

φa=aeβU(x)dx+abφ(xa)eβU(x)dxeβU(y)dy=JababeβU(z)dzD(z)aeβU(x)dx+abxbeβU(z)dzD(z)eβU(x)dx. (A15)

In the last step, we used Eq. (A9) for Jab and Eq. (A14) for φ(xa). The sum in the square brackets is τab. To see this, one has to change the order of integration in the second term in the square brackets, write the sum of integrals as a single double integral, and then compare with τab obtained from Eq. (A8a). Thus, we have derived the result for τab in Eq. (2.12), τab=φa/Jab.

This can be done simpler in a way that can be readily generalized to many dimensions. Multiplying both sides of Eq. (A6) by φa(x0)expβU(x0), where φa(x) is given in Eq. (2.10a), and integrating both sides over all x0, one has

φa(x0)x0D(x0)eβU(x0)τx0bx0dx0  =φa(x0)eβU(x0)dx0. (A16)

Integrating the left-hand side by parts and taking advantage of the fact that U(±)=, we find

D(x0)eβU(x0)φa(x0)x0τx0bx0dx0=φaeβU(y)dy, (A17)

where we have used the definition of the equilibrium average, φa, in Eq. (2.11). Since the derivative of φa(x) is φ(x0a)/x0 inside the interval a<x0<b and zero outside, Eq. (A17) becomes

abD(x0)eβU(x0)φ(x0a)x0τx0bx0dx0  =φaeβU(y)dy. (A18)

Integrating the right-hand side by parts, using Eq. (A11) and the fact that τ(bb)=0, we obtain

D(a)eβU(a)φ(x0a)x0x0=aτab=φaeβU(y)dy. (A19)

By differentiating Eq. (A14) for φ(x0a) and using the expression for the flux Jab in Eq. (A9), one can see that the factor in front of τ(ab) on the left-hand side of the above equation is the product JabeβU(y)dy. Thus, we recover the expression for τab in Eq. (2.12), τab=φa/Jab.

Let us now generalize this to many dimensions with two milestones that are infinite non-intersecting surfaces Σa and Σb that divide the space into three regions: Ωa (left of Σa), Ωab (between Σa and Σb), and Ωb (right of Σb). The multidimensional generalization of Eq. (A16) is

φa(x0)D(x0)eβU(x0)τx0Σbdx0  =φa(x0)eβU(x0)dx0. (A20)

Here φa(x0)=1 for x0Ωa, φa(x0)=φx0Σa for x0Ωab, and φa(x0)=0 for x0Ωb is the multidimensional analog of φa(x) in Eq. (2.10a), D(x0) is the position-dependent diffusivity tensor at point x0, and τx0Σb is the MFPT from point x0 located in regions Ωa or Ωab to the surface Σb. Integrating the left-hand side of Eq. (A20) by parts and using the fact that the surface term vanishes, we obtain

φ(x0)D(x0)eβU(x0)τx0Σbdx0=φaeβU(y)dy, (A21)

where φa is the Boltzmann average of φa(x) analogous to that in Eq. (2.11). Equation (A21) is the multidimensional generalization of Eq. (A17).

Now φa(x0) is non-zero only in the region Ωab between the two surfaces, where φa(x0)=φ(x0Σa). Thus Eq. (A21) becomes

Ωabφ(x0Σa)D(x0)eβU(x0)τx0Σbdx0  =φaeβU(y)dy. (A22)

If we integrate the left-hand side by parts and apply Gauss’s divergence theorem, only the integral over the surface Σa survives because τxΣbxΣb=0 and φx0Σa satisfies, D(x0)eβU(x0)φx0Σa=0, which is the multidimensional version of the Onsager equation in Eq. (A11). As a result, Eq. (A22) reduces to

Σan(x0)D(x0)eβU(x0)φx0Σaτx0Σbdx0  =φaeβU(y)dy, (A23)

where n(x0) is a unit vector perpendicular to the surface Σa, pointing towards Σb.

It can be shown that the total unidirectional equilibrium flux Jab between the two surfaces is given by17

Jab=Σan(x0)D(x0)eβU(x0)φx0Σadx0eβU(y)dy. (A24)

Using this, Eq. (A23) can be written as

τΣaΣb=φaJab. (A25)

Here τΣaΣb is the MFPT between the two surfaces defined by

τΣaΣb=Σapflux(x0)τx0Σbdx0, (A26)

where pflux(x0),x0Σa, defined as

pflux(x0)=n(x0)D(x0)eβU(x0)φx0ΣaΣan(x0)D(x0)eβU(x0)φx0Σadx0, (A27)

is the normalized distribution of the unidirectional flux from surface Σa to surface Σb passing through the point x0 on surface Σa at equilibrium.

Note: This article is part of the Special Topic “Markov Models of Molecular Kinetics” in J. Chem. Phys.

REFERENCES

  • 1.Bolhuis P. G., Chandler D., Dellago C., and Geissler P. L., Annu. Rev. Phys. Chem. 53, 291 (2002). 10.1146/annurev.physchem.53.082301.113146 [DOI] [PubMed] [Google Scholar]
  • 2.Hummer G., J. Chem. Phys. 120, 516 (2004). 10.1063/1.1630572 [DOI] [PubMed] [Google Scholar]
  • 3.Vanden-Eijnden E. and W. E., Annu. Rev. Phys. Chem. 61, 391 (2010). 10.1146/annurev.physchem.040808.090412 [DOI] [PubMed] [Google Scholar]
  • 4.Faradjian A. K. and Elber R., J. Chem. Phys. 120, 10880 (2004). 10.1063/1.1738640 [DOI] [PubMed] [Google Scholar]
  • 5.Vanden-Eijnden E., Venturoli M., Ciccotti G., and Elber R., J. Chem. Phys. 129, 174102 (2008). 10.1063/1.2996509 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Elber R., Q. Rev. Biophys. 50, e8 (2017). 10.1017/s0033583517000063 [DOI] [PubMed] [Google Scholar]
  • 7.Hinczewski M., von Hansen Y., Dzubiella J., and Netz R. R., J. Chem. Phys. 132, 245103 (2010). 10.1063/1.3442716 [DOI] [PubMed] [Google Scholar]
  • 8.Vanden-Eijnden E. and Venturoli M., J. Chem. Phys. 131, 044120 (2009). 10.1063/1.3180821 [DOI] [PubMed] [Google Scholar]
  • 9.Buchete V. and Hummer G., J. Phys. Chem. B 112, 6057 (2008). 10.1021/jp0761665 [DOI] [PubMed] [Google Scholar]
  • 10.van Erp T. S., Moroni D., and Bolhuis P. G., J. Chem. Phys. 118, 7762 (2003). 10.1063/1.1562614 [DOI] [PubMed] [Google Scholar]
  • 11.Cabriolu R., Skjelbred Refsnes K. M., Bolhuis P. G., and van Erp T. S., J. Chem. Phys. 147, 152722 (2017). 10.1063/1.4989844 [DOI] [PubMed] [Google Scholar]
  • 12.Kramers H., Physica 7, 284 (1940). 10.1016/s0031-8914(40)90098-2 [DOI] [Google Scholar]
  • 13.Bello-Rivas J. M. and Elber R., J. Chem. Phys. 142, 094102 (2015). 10.1063/1.4913399 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Aristoff D., Bello-Rivas J. M., and Elber R., Multiscale Model. Simul. 14, 301 (2016). 10.1137/15m102157x [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Lin L., Lu J. F., and Vanden-Eijnden E., Commun. Pure Appl. Math. 71, 1149 (2018). 10.1002/cpa.21725 [DOI] [Google Scholar]
  • 16.Bezrukov S. M., Berezhkovskii A. M., and Szabo A., J. Chem. Phys. 127, 115101 (2007). 10.1063/1.2766720 [DOI] [PubMed] [Google Scholar]
  • 17.Berezhkovskii A. M. and Szabo A., J. Phys. Chem. B 117, 13115 (2013). 10.1021/jp403043a [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from The Journal of Chemical Physics are provided here courtesy of American Institute of Physics

RESOURCES