Abstract
Milestoning on a one-dimensional potential starts by choosing a set of points, called milestones, and initiating short trajectories from each milestone, which are terminated when they reach an adjacent milestone for the first time. From the average duration of these trajectories and the probabilities of where they terminate, a rate matrix can be constructed and then used to calculate the mean first-passage time (MFPT) between any two milestones. All these MFPT’s turn out to be exact. Here we adopt a point of view from which this remarkable result is not unexpected. In addition, we clarify the nature of the “states” whose interconversion is described by the rate matrix constructed using information obtained from short trajectories and provide a microscopic expression for the “equilibrium population” of these states in terms of equilibrium averages of the committors.
I. INTRODUCTION
Many recent advances in the theory of rare events were not made algebraically but rather by analyzing the behavior of trajectories. Both transition path theory1–3 and milestoning4–6 can be regarded as part of the emerging field of “statistical mechanics of trajectories.” This paper grew out of our efforts to understand Elber’s milestoning in the simplest context.
Operationally, Elber’s milestoning for one-dimensional diffusive dynamics on a potential can be formulated as follows: Choose an arbitrary set of points labeled by the index i, i = 1, 2, ..., N. Point i is called milestone i. Run Brownian dynamics trajectories starting from milestone i and terminate them when they reach either milestone i − 1 or milestone i + 1 for the first time. The average duration of these trajectories, denoted by ti, is by construction the mean first-passage time (MFPT) from milestone i to one of the two milestones, i − 1 and i + 1. In addition, the fraction of trajectories terminated at milestone i + 1 is determined. This is the splitting probability or committor denoted by φ(i → i + 1). Clearly, . This information is then used to construct an N × N rate matrix R that describes the interconversion of yet unspecified “states” I, I = 1, 2, ..., N. This is a three-diagonal matrix with off-diagonal matrix elements and diagonal elements . This guarantees that the mean lifetime of state I, given by , is ti and the probability to go from state I to state I + 1, given by , is , where ti and were obtained by running many short trajectories. It is important to emphasize that the distributions of lifetimes of states I are single-exponential, whereas the exact distributions of the MFPT’s in general are not. Nevertheless, the MFPT between any two states obtained by using the rate matrix R turns out to be exact! The importance of this result is that the exact MFPT can be obtained, for example, for two milestones separated by a high barrier. To directly obtain this MFPT by running Brownian dynamics simulations would take virtually forever. On the other hand, using milestoning, all one has to do is run many relatively short trajectories starting from milestones.
Thus, milestoning maps the diffusive dynamics onto a nearest-neighbor continuous-time Markov state model. The fact that the MFPT’s between the states, obtained using the rate matrix, are identical to the MFPT’s between the corresponding milestones calculated using the Smoluchowski equation was initially surprising to us. One goal of this paper is to find a way of thinking that makes this, at first sight, remarkable result not unexpected. To do this, we need to understand the nature of the “states” I whose interconversion is described by the rate matrix R, and the meaning of their “equilibrium populations” obtained from the eigenvector of this rate matrix corresponding to zero eigenvalue.
The outline of this paper is as follows: In Sec. II, we consider just two milestones. This may seem too trivial to provide any insight because the MFPT’s of the two-state model are exact by construction. However, it turns out to be the simplest context where one can understand what the states are and what their equilibrium populations mean. In Sec. III, we consider three milestones, which is the simplest case that involves splitting probabilities. Some concluding remarks are presented in Sec. IV. In the Appendix, we algebraically derive some identities that have been obtained in Sec. II by analyzing a long equilibrium trajectory.
II. TWO MILESTONES
We begin by analyzing the simplest possible example of milestoning. Consider a particle diffusing in a bounded one-dimensional potential U(x), . Choose two arbitrary points (milestones), x = a, and x = b, a < b. The average duration of trajectories, initiated at point a and terminated at point b, is the mean first-passage time (MFPT) from a to b, τ(a → b). Similarly, the MFPT τ(b → a) can be obtained by starting trajectories from point b and terminating them when they reach point a.
If points a and b were located at the minima of two deep wells, A and B, of a bistable potential, separated by a high barrier, then the distributions of both first-passage times would be very nearly exponential. In this case, it would make sense to describe the dynamics of the system in terms of Markovian transitions between the two wells, i.e., by a two-state model
| (2.1) |
where the rate constants for inter-well transitions are given by
| (2.2) |
If the two points are arbitrary, then the distributions of the first-passage times are no longer exponential. Nevertheless, one can formally adopt the above two-state description, i.e., use the MFPT’s and build a two-state model as discussed above. At least the mean lifetimes of the two states, A and B, and , will be correct. The equilibrium populations, and , of these states are
| (2.3) |
The equilibrium unidirectional flux between them, i.e., the number of A-to-B (or B-to-A) transitions per unit time at equilibrium, , is
| (2.4) |
For future reference, note that the rate constants in this model can be expressed as “flux-over-population,” i.e., , . Substituting in Eq. (2.4) the rate constants and equilibrium populations given in Eqs. (2.2) and (2.3), we find that
| (2.5) |
The sum of the MFPT’s in the denominator has been called the mean round-trip time between points a and b.7 Thus the equilibrium unidirectional flux is the inverse mean round-trip time. However, it is not at all obvious what the two “states,” A and B, associated with milestones a and b are, what the equilibrium populations of these states mean, and what the unidirectional flux corresponds to.
A. Coloring an equilibrium trajectory
To answer these questions, consider a long equilibrium trajectory, x(t), of duration T, T → ∞. One way of obtaining the first passage times between points a and b is to use the trajectory coloring procedure introduced by Vanden-Eijnden and Venturoli8 (see also Ref. 3). Let us color the trajectory either red or blue using the following rules: the trajectory changes color from red to blue when it comes to point b from point a, and from blue to red when it comes to point a from point b, as show in the upper panel of Fig. 1. This is similar to the procedure used by Buchete and Hummer9 to define states when constructing Markov state models. In addition, a referee pointed out that it is closely related to transition interface sampling.10,11 Let Ta (Tb) be the total time the trajectory is red (blue) so that Ta + Tb = T. Let Nab be the number of times the color of the trajectory changes from red-to-blue or equivalently from blue-to-red during time T. The MFPT’s and are the average durations of the red and blue segments,
| (2.6) |
Thus the mean lifetimes of the red and blue segments, and , are equal to the mean lifetimes of states A and B, and , respectively.
FIG. 1.
Upper panel: A long equilibrium trajectory generated by Brownian dynamics simulations that is colored red and blue. The color changes from red to blue when the trajectory from a reaches b for the first time and remains blue until it reaches a for the first time whereupon it turns red. The color of the trajectory depends on the last milestone it touched: the color is red (blue), if the last touched milestone is a (b). Lower panel: The colored trajectory x(t) is converted into a two-state trajectory where the lifetime of state A (B) is equal to the time during which the trajectory x(t) is red (blue).
The probability that the trajectory is red, , is the fraction , while the probability that the trajectory is blue, , is the fraction . Using the relations between the MFPT’s and times Ta and Tb, Eq. (2.6), these can be written as
| (2.7) |
These are identical to the equilibrium populations in Eq. (2.3) obtained using the two-state model with rate constants given in Eq. (2.2). Therefore, state A (B) associated with milestone a (b) corresponds to red (blue) trajectory segments, as shown in the lower panel of Fig. 1. Note that states A and B are not thermodynamic states. The reason is that the trajectory can be both red and blue in the region between the milestones, . The color is red (blue) if the last milestone touched by the trajectory before coming to point x is point a (b). Therefore, when , the trajectory can be sometimes red (state A) and sometimes blue (state B) (see Fig. 1), while in thermodynamics any spatial point can belong to only one state. Thus, states A and B associated with the milestones are states of the trajectory rather than thermodynamic states.
The unidirectional flux between points a and b at equilibrium, , by definition, is
| (2.8) |
Using Eq. (2.6), this flux can be written in terms of the MFPT’s and as
| (2.9) |
This is identical to the flux JAB, Eq. (2.5), obtained in the framework of the two-state model. Thus, the equilibrium unidirectional flux between points a and b is the same as with the number of transitions from trajectory state A to state B (and vice versa) per unit time, at equilibrium, .
To summarize, using the trajectory coloring procedure, we found that states A and B, associated with milestones a and b, indicate the color of the trajectory (red in state A and blue in state B). The equilibrium populations of these states, and , are the fractions of time when the trajectory is red and blue, and , respectively. Finally, the equilibrium unidirectional flux between these states, JAB, is the equilibrium unidirectional flux, Jab, between the two milestones.
The two-state model using the rate constants defined as the reciprocals of the MFPT’s is remarkably successful in describing transitions between the trajectory segments of different color. While the MFPT’s calculated from the two-state model are exact by construction, this model gives the exact relation between the unidirectional flux at equilibrium and the MFPT’s, Eq. (2.9). Although at first glance this might seem surprising, there is a point of view from which it is not unexpected. The trajectory corresponding to the two-state model can be exactly generated using the Gillespie algorithm, where the lifetimes of states A and B are chosen from the exponential distributions. For example, for state A, the distribution is . While these exponential distributions differ from the exact lifetime distributions of the red and blue trajectory segments, the corresponding lifetimes are equal on average (the mean lifetimes are the same by construction). Thus, on average, the two-state trajectory is equivalent to the red-blue trajectory obtained by coloring the initial trajectory shown in the lower panel of Fig. 1. Therefore, the average number of unidirectional transitions per unit time calculated from the two-state trajectory must be the same as that obtained from the colored trajectory. As we shall see in Sec. III, this is the simplest way of understanding why a three-state model exactly gives all the MFPT’s and equilibrium unidirectional fluxes for three milestones.
B. Microscopic interpretation of equilibrium populations of states A and B
Next, we discuss a microscopic interpretation of the equilibrium populations of states A and B. According to the coloring rules, the trajectory is always red when and blue when . In the intermediate region, , the color of the trajectory depends on the prehistory. The trajectory is red, if it entered the intermediate region through point a, and blue, if it entered through point b. Because of detailed balance or time-reversal symmetry, the probability that the trajectory reaches x coming from a is equal to the probability that starting form x, the trajectory reaches a before touching b. This probability, denoted by , is referred to as the committor or splitting probability. Similarly, the probability that the trajectory reaches x having entered the intermediate region from b is equal to the probability that starting form x, the trajectory reaches b before touching a. This probability, denoted by φ(x → b), is φ(x → b) = 1 − φ(x → a).
The probability that the trajectory is colored red (state A) at an arbitrary point x, denoted by φa(x), is
| (2.10a) |
The probability that the trajectory at point x is colored blue (state B), denoted by , is
| (2.10b) |
Since we are dealing with an equilibrium trajectory, the probability of finding the trajectory at point x is given by the Boltzmann distribution, , where with , where kB and T denote the Boltzmann constant and the absolute temperature. Therefore, the equilibrium probability densities that an arbitrary point x is red or blue are , i = a, b. As a consequence, the equilibrium probability of finding the trajectory red (blue) is the Boltzmann average of (),
| (2.11) |
This probability is also the equilibrium population of state . Thus, the equilibrium population of state I is the Boltzmann average of the probability .
C. MFPT’s, rate constants, and equilibrium unidirectional flux
The MFPT’s between points a and b, given in Eq. (2.6), can be written in terms of equilibrium populations, Eqs. (2.7) and (2.11), and the equilibrium unidirectional flux, Eq. (2.8),
| (2.12) |
These are exact identities which are verified in the Appendix using analytical expressions obtained in the framework of the theory of first-passage processes. It should be pointed out that the mean transition path or direct-transit time between points a and b, denoted by , can be written2 as .
Using the above results for the MFPT’s in Eq. (2.2), the rate constants for transitions between states A and B, defined by the color of the trajectory, can be written as
| (2.13) |
These expressions have been previously obtained by Vanden-Eijnden and Venturoli.8 The relation between our notation and theirs is (unidirectional flux at equilibrium), (the probability that the trajectory at point x is colored red), and (the equilibrium probability that the trajectory is red).
D. Relation to Kramers theory
When the milestones a and b happen to be at the minima of two deep wells separated by a high barrier, the above expressions for the rate constants reduce to those of the Kramers theory. Kramers used flux divided by thermodynamic well population as the definition the rate constant.12 In the case of two deep wells, the probability is unity not only when but also is essentially unity for a wider range of x, , where is a point between point a and the barrier top, chosen so that the potential energy is several ’s above the potential energy at the well bottom, U(a). Therefore, is essentially . The value of the integral in the numerator is determined by the potential near the well bottom at x = a and is insensitive to the location of point as long as it lies sufficiently far from point a (the energy difference exceeds several ’s). Consequently, the equilibrium population of non-thermodynamic state A, , is equal to the equilibrium thermodynamic population of well A. This happens because the interval, where points may have both colors, is located in the barrier region. When the barrier is high, the equilibrium population in this interval is vanishingly small.
As shown in the Appendix, the equilibrium unidirectional flux between points a and b is given by , where D(x) is the position-dependent diffusivity. Substituting Jab and above into Eq. (2.13), we arrive at
| (2.14) |
While the value of the first integral in the denominator is determined by the behavior of the potential near the A-well bottom, the value of the second integral is determined by the behavior of the potential near the barrier top. The value of the latter integral is insensitive to the integration limits when the potential energies U(a) and U(b) are several ’s below the barrier top. Assuming that U(x) is quadratic near the A-well bottom and the barrier top, and the diffusivity is position-independent, we replace U(x) in the integrands by the corresponding quadratic approximations, then let , , and , and perform the integrations. In this way, we recover the celebrated Kramers formula for the rate constant for diffusive barrier crossing (high-friction regime).
III. THREE MILESTONES
We now consider the much more interesting case of three milestones located at points x = a, x = b, and x = c, . By starting trajectories at points a or c and terminating them when they reach point b, we can find the MFPT’s and as before. The milestone located at point b is different because it has milestones on both sides. Here we initiate trajectories at point b and terminate them when they reach either point a or point c. In this way, we determine the MFPT, , required to reach points a or c for the first time starting from point b. In addition, we determine the committor or splitting probability that a trajectory starting at point b reaches point a before point c, denoted by . Clearly, .
We use these MFPT’s and splitting probabilities to build a three-state model of the dynamics
| (3.1) |
Here A, B, and C are yet unspecified “states” associated with milestones a, b, and c. The transitions between them are described by the rate constants defined as
| (3.2) |
The rate constants and are defined so as to ensure that (1) the splitting probabilities for and transitions, and , are and , and (2) the mean lifetime of state B, , is . The mean lifetimes of states A and C, and , respectively, are and . The time evolution of the probability of finding the system in state I, pI(t), I = A, B, C, is given by
| (3.3) |
where is the column vector, (pA, pB, pC), and the rate matrix, , is
| (3.4) |
The normalized equilibrium populations of the three states, , , and , , are solutions of . Solving this equation and replacing the rate constants by the MFPT’s and splitting probabilities, using Eq. (3.2), we find
| (3.5) |
where is
| (3.6) |
Given the rate matrix, , we can calculate the MFPT’s between any two states by solving
| (3.7) |
subject to the appropriate boundary condition (i.e., if we are interested in the MFPT’s to state I, then ). In this equation, which is the discrete analog of Eq. (A6) in the Appendix, is the transpose of matrix , is a column vector of the MFPT’s, and 1 is a column vector with components equal to unity. Solving Eq. (3.7) and using Eq. (3.2), we can express the MFPT’s between any two states in terms of the MFPT’s and splitting probabilities obtained from the simulations. For example, one can show that , as to be expected, and more interestingly, that
| (3.8) |
The MFPT between states A and C is given by .
Since matrix can be expressed as
| (3.9) |
we can write Eq. (3.7) in the form5
| (3.10) |
Here is the vector of mean lifetimes, and is the matrix of the splitting probabilities,
| (3.11) |
Equations (3.7) and (3.10) must be solved subject to the same boundary conditions. Alternatively, matrix can be redefined so as to explicitly incorporate these boundary conditions.
A. Coloring an equilibrium trajectory
As in the case of two milestones, we are interested in determining the nature of the “states” associated with the milestones and microscopic meaning of their equilibrium populations, , , and , that are normalized solutions of . To this end, as before, consider how the MFPT’s can be obtained from a long equilibrium trajectory x(t) of duration T, , by coloring it in red, blue, and orange, using the rules proposed by Vanden-Eijnden and Venturoli.8 The trajectory changes its color from red to blue when it comes to point b from point a and from orange to blue when it comes to this point from point c. Then the trajectory remains blue until it touches point a or c for the first time, when it turns red or orange, respectively, as shown in the upper panel of Fig. 2.
FIG. 2.
The same as Fig. 1 but for three milestones. Upper panel: The trajectory x(t) colored red, blue, and orange. Blue changes to orange (red) when the trajectory x(t) reaches point c (a) for the first time. When (), the trajectory can be either blue or orange (blue or red). The color of the trajectory depends on the last milestone it touched: the color is red (blue or orange), if the last touched milestone is a (b or c). Lower panes: A three-state trajectory corresponding to the colored trajectory x(t).
Denote the total times when the trajectory is red, blue, and orange by Ta, Tb, and Tc, and the numbers of times the trajectory changed color from red to blue and from blue to orange and vice versa by Na b and Nb c, respectively. The MFPT’s , , and are the average durations of the red, blue, and orange segments,
| (3.12) |
The splitting probability () is the number of times a blue segment turned red (orange), Na b (Nb c), divided by the total number of times it changed color,
| (3.13) |
Using the relations in Eqs. (3.12) and (3.13), we can write the total observation time as
| (3.14) |
where is given in Eq. (3.6).
B. Equilibrium populations and MFPT’s
The relations in Eqs. (3.12) and (3.13) link the MFPT’s and splitting probabilities obtained from the short simulations initiated from the milestones with information obtained from a long trajectory. We use this to find the fractions of time when the trajectory is red, blue, and orange, , and show that these fractions are equal to the equilibrium populations of states I, πI, , obtained from the rate matrix, , i.e., . For example, the fraction of time when the trajectory is red, , can be recast as
| (3.15) |
where we have used Eq. (3.12) for Ta and Eq. (3.14) for T. Finally, using Eq. (3.13), we obtain
| (3.16) |
Comparing this with in Eq. (3.5), we see that . Similarly, it can be shown that and .
In summary, the mean durations of the red, blue, and orange segments of the trajectory are equal to the mean lifetimes of states A, B, and C of the three-state model. In addition, the fractions of time when the trajectory is red, blue, and orange are equal to the equilibrium populations, , , and , of the three states. Consequently, states A, B, and C of the three-state model, associated with the milestones, correspond to red, blue, and orange segments of the long trajectory, respectively.
Times, Ta, Tb, and Tc, and the numbers of color changes, Na b and Nb c, obtained from the long trajectory can be used to find the exact MFPT’s between any two milestones, not just those in Eq. (3.12). For example, suppose we are interested in the MFPT between milestones b and c, . If the color of the red trajectory fragments is changed to blue, then we are faced with a two (b and c) milestone problem where the duration of the blue trajectories is now Ta + Tb, and so
| (3.17) |
Now we use Eq. (3.12) to write Ta and Tb in terms of the MFPT’s, i.e., and . Substituting this into Eq. (3.17) and using the definitions of the splitting probabilities and in terms of the numbers of color changes in Eq. (3.13), we arrive at
| (3.18) |
Comparing this with in Eq. (3.8), obtained using the three-state model, we see that . Thus, the three-state model predicts the exact MFPT between points b and c even though this MFPT was not used to determine any of the rate constants. In a similar way, one can show that all possible MFPT’s predicted by the three-state model are exact.
How can we understand this remarkable success of the three-state model? Any quantity that can be calculated using the rate matrix R can also be obtained from a long equilibrium trajectory in the space of states A, B, and C generated using the Gillespie algorithm. All input parameters for this algorithm (i.e., the lifetimes of the states and splitting probabilities) can be obtained from the rate matrix R. Let us compare such a trajectory with the exact one shown in the lower panel of Fig. 2. The most dramatic difference is in the distributions of the state lifetimes and those of the durations of the colored segments. In the trajectory generated using the Gillespie algorithm, the distributions of the state lifetimes are single-exponential, while the distributions of the durations of the colored segments are in general not. However, the mean lifetimes and the splitting probabilities in both trajectories are the same because the Gillespie algorithm used the exact MFPT’s and splitting probabilities as input. Therefore, any average quantity (e.g., the MFPT’s) calculated using the rate matrix must be exact.
C. Microscopic interpretation of “equilibrium populations”
We will now express the equilibrium populations given by the three-state model, , (which are identical to the fractions of time, , when the trajectory is red, blue, and orange) in terms of the Boltzmann equilibrium averages of the committors. Let be the probabilities and be the probability densities that the trajectory at point x has the color associated with the milestone i, i = a, b, c. Just as in the two-milestone case, pi(x), are given by the products of the Boltzmann distribution and the probabilities φi(x), . The probabilities φi(x) are
| (3.19) |
| (3.20) |
and
| (3.21) |
Here () is the probability of reaching point a (c) starting from point x, (), before reaching point b, and , for , and , for . One can see that the probabilities are normalized at every point x, . The equilibrium populations of states I, πI, I = A, B, C, given in terms of the Boltzmann equilibrium averages of the committors φi(x), are
| (3.22) |
D. Unidirectional fluxes and rate constants
It should be no surprise by now that unidirectional fluxes at equilibrium (i.e., the numbers of transitions per unit time) between states A and B, , and between states B and C, , calculated in the framework of the three-state model, are the same as those between the points a and b, , and points b and c, , calculated from the long trajectory. To see this, consider the equilibrium unidirectional flux JBC,
| (3.23) |
where we have used the fact that and the expression for given in Eq. (3.2). The equilibrium unidirectional flux Jbc between milestones b and c, by definition, is
| (3.24) |
where we have used the relations in Eqs. (3.12) and (3.13). Thus and similarly, . Because of this and since , all the rate constants in Eq. (3.2) can be written as “flux-over-population”
| (3.25) |
just as in the two-milestone case [see Eq. (2.13)].
The mean lifetime of state B, tB, or the mean duration of the blue trajectory segment, tb, is . Using the expressions for the rate constants in Eq. (3.25), we obtain the following relation among the MFPT, , the Boltzmann averaged committor, , and the equilibrium unidirectional fluxes:
| (3.26) |
which is to be compared with the results in Eq. (2.12) for and in the case of two milestones. The sum of fluxes, , is denoted by qb in the milestoning literature.6 The identity in Eq. (3.26) can be proved the old-fashioned way, algebraically, using the formalism given in the Appendix: is the solution of Eq. (A6), in which should be replaced by , with boundary conditions , while is the solution of Eq. (A11), in which should be replaced by , with boundary conditions and .
IV. CONCLUDING REMARKS
The above analysis can be readily generalized to an arbitrary number of milestones in one dimension. Thus, by running short trajectories starting from one milestone and stopping at an adjacent milestone, one can calculate the exact MFPT between any two milestones no matter how far apart they are. The importance of this result is that one can obtain very long MFPT’s, say, between two milestones separated by a high barrier, that would be virtually impossible to simulate directly. In simplest terms, the reason for this remarkable result is that a long trajectory generated by the Gillespie algorithm with parameters obtained from the rate matrix is “on the average” equivalent to the exact trajectory (i.e., the lower panels in Figs. 1 and 2). By “on the average,” we mean that while the distribution of the lifetimes of the various states are different, the average lifetimes as well as other average properties such as equilibrium populations, unidirectional fluxes, and MFPT’s will be the same.
How can this be generalized to multidimensional diffusive dynamics? In this case, milestones are no longer points but rather surfaces, and the first-passage times between the milestones depend on the location of the starting points on the surface. If we could choose the distribution of starting points in such a way that the MFPT’s obtained by running short trajectories would be identical to those obtained by coloring a long equilibrium trajectory, then all our arguments would immediately carry over. This distribution is related to the probability that the equilibrium trajectory crosses or hits a particular point on the surface (for an algebraic derivation, see the end of the Appendix). How this distribution can be found iteratively by running only short trajectories for non-diffusive dynamics is the focus of an extensive literature that can be accessed through some recent references13–15 and earlier ones therein. We hope that our work will help to reduce the “activation barrier” encountered in trying to understand these papers.
Finally, we would like to mention that once we did something16 in the context of solute transport through a membrane channel, which in retrospect was closely related to milestoning. We considered two models of solute transport through a membrane channel, one, where a solute diffuses through the channel in the presence of a potential of mean force, and the other, where the solute simply jumps between two sites located at the channel ends. We showed that the fluxes calculated within the framework of these models are identical when the jump rates between the two states are chosen as the reciprocal of the MFPT’s to diffuse from one end of the channel to the other. In this way, we provided a microscopic interpretation of the phenomenological rate constants of the widely used two-site model of the solute dynamics in the channel.
ACKNOWLEDGMENTS
We have benefitted from discussions with Ron Elber, Gerhard Hummer, Eric Vanden-Eijnden, and Vladimir Zitserman. We thank Robert Best for help with the manuscript. This study was supported by the Intramural Research Program of the NIH, Center for Information Technology and National Institute of Diabetes and Digestive and Kidney Diseases.
APPENDIX: ALGEBRAIC DERIVATION OF THE RELATION AMONG THE MFPT, FLUX, AND COMMITTOR
Here we will algebraically derive the relation among the MFPT, , the equilibrium-averaged committor, , and the equilibrium unidirectional flux, Jab.
Consider a particle diffusing in a potential U(x), , with position-dependent diffusivity, D(x). The probability density of finding the particle at point x at time t, given that it was initially (at t = 0) at point x0, , satisfies the Smoluchowski equation
| (A1) |
subject to the initial condition . It follows from the detailed balance condition that . Substituting this into Eq. (A1) and interchanging the notations of the variables , one finds that satisfies
| (A2) |
which is the adjoint or backward Smoluchowski equation.
Let us now choose point x = b to be an absorbing boundary, i.e., . The survival probability of a particle starting from x0, x0 < b, at t = 0, denoted by , is defined as
| (A3) |
Integrating both sides of Eq. (A2) over x from −∞ to b, we find that the survival probability satisfies
| (A4) |
subject to the initial condition . The mean particle lifetime is the MFPT from point x0 to point b, ,
| (A5) |
Integrating both sides of Eq. (A4) over time from 0 to ∞ and using the facts that and , we find that the MFPT satisfies
| (A6) |
This must be solved subject to the boundary condition .
Multiplying both sides of Eq. (A6) by , integrating over x0 from −∞ to x, where , and using the fact that , we obtain
| (A7) |
Dividing both sides by , integrating the resulting equation from x to b, and using the fact that , we arrive at
| (A8a) |
Similarly, for the MFPT from x to a, , we have
| (A8b) |
According to Eq. (2.5), the equilibrium unidirectional flux Jab between milestones a and b is . Using Eqs. (A8a) and (A8b), it can be shown that this flux is given by
| (A9) |
This result can also be obtained from
| (A10) |
where the probability density p(x) at the end points is and p(b) = 0, i.e., milestone b is absorbing and point a is kept at equilibrium. Dividing both sides of Eq. (A10) by and then integrating both sides of the resulting equation from a to b, we recover the result for the flux in Eq. (A9).
The committor (splitting probability) , , satisfies the Onsager equation, , subject to the boundary conditions and . Using , defined in Eq. (A2), we can write the Onsager equation as
| (A11) |
Consequently, we have
| (A12) |
where C is a constant to be determined using the boundary conditions at x = a and x = b. Dividing both sides by , integrating both sides of the resulting equation from x to b, and using the boundary condition at point b, , we arrive at
| (A13) |
Using the boundary condition at point a, , to find C, we finally obtain
| (A14) |
Now we are ready to calculate , where φa(x) is given in Eq. (2.10a),
| (A15) |
In the last step, we used Eq. (A9) for Jab and Eq. (A14) for . The sum in the square brackets is . To see this, one has to change the order of integration in the second term in the square brackets, write the sum of integrals as a single double integral, and then compare with obtained from Eq. (A8a). Thus, we have derived the result for in Eq. (2.12), .
This can be done simpler in a way that can be readily generalized to many dimensions. Multiplying both sides of Eq. (A6) by , where is given in Eq. (2.10a), and integrating both sides over all x0, one has
| (A16) |
Integrating the left-hand side by parts and taking advantage of the fact that , we find
| (A17) |
where we have used the definition of the equilibrium average, , in Eq. (2.11). Since the derivative of is inside the interval and zero outside, Eq. (A17) becomes
| (A18) |
Integrating the right-hand side by parts, using Eq. (A11) and the fact that , we obtain
| (A19) |
By differentiating Eq. (A14) for and using the expression for the flux Jab in Eq. (A9), one can see that the factor in front of on the left-hand side of the above equation is the product . Thus, we recover the expression for in Eq. (2.12), .
Let us now generalize this to many dimensions with two milestones that are infinite non-intersecting surfaces and that divide the space into three regions: (left of ), (between and ), and (right of ). The multidimensional generalization of Eq. (A16) is
| (A20) |
Here for , for , and for is the multidimensional analog of in Eq. (2.10a), is the position-dependent diffusivity tensor at point x0, and is the MFPT from point x0 located in regions or to the surface . Integrating the left-hand side of Eq. (A20) by parts and using the fact that the surface term vanishes, we obtain
| (A21) |
where is the Boltzmann average of analogous to that in Eq. (2.11). Equation (A21) is the multidimensional generalization of Eq. (A17).
Now is non-zero only in the region between the two surfaces, where . Thus Eq. (A21) becomes
| (A22) |
If we integrate the left-hand side by parts and apply Gauss’s divergence theorem, only the integral over the surface survives because and satisfies, , which is the multidimensional version of the Onsager equation in Eq. (A11). As a result, Eq. (A22) reduces to
| (A23) |
where is a unit vector perpendicular to the surface , pointing towards .
It can be shown that the total unidirectional equilibrium flux between the two surfaces is given by17
| (A24) |
Using this, Eq. (A23) can be written as
| (A25) |
Here is the MFPT between the two surfaces defined by
| (A26) |
where , defined as
| (A27) |
is the normalized distribution of the unidirectional flux from surface to surface passing through the point on surface at equilibrium.
Note: This article is part of the Special Topic “Markov Models of Molecular Kinetics” in J. Chem. Phys.
REFERENCES
- 1.Bolhuis P. G., Chandler D., Dellago C., and Geissler P. L., Annu. Rev. Phys. Chem. 53, 291 (2002). 10.1146/annurev.physchem.53.082301.113146 [DOI] [PubMed] [Google Scholar]
- 2.Hummer G., J. Chem. Phys. 120, 516 (2004). 10.1063/1.1630572 [DOI] [PubMed] [Google Scholar]
- 3.Vanden-Eijnden E. and W. E., Annu. Rev. Phys. Chem. 61, 391 (2010). 10.1146/annurev.physchem.040808.090412 [DOI] [PubMed] [Google Scholar]
- 4.Faradjian A. K. and Elber R., J. Chem. Phys. 120, 10880 (2004). 10.1063/1.1738640 [DOI] [PubMed] [Google Scholar]
- 5.Vanden-Eijnden E., Venturoli M., Ciccotti G., and Elber R., J. Chem. Phys. 129, 174102 (2008). 10.1063/1.2996509 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Elber R., Q. Rev. Biophys. 50, e8 (2017). 10.1017/s0033583517000063 [DOI] [PubMed] [Google Scholar]
- 7.Hinczewski M., von Hansen Y., Dzubiella J., and Netz R. R., J. Chem. Phys. 132, 245103 (2010). 10.1063/1.3442716 [DOI] [PubMed] [Google Scholar]
- 8.Vanden-Eijnden E. and Venturoli M., J. Chem. Phys. 131, 044120 (2009). 10.1063/1.3180821 [DOI] [PubMed] [Google Scholar]
- 9.Buchete V. and Hummer G., J. Phys. Chem. B 112, 6057 (2008). 10.1021/jp0761665 [DOI] [PubMed] [Google Scholar]
- 10.van Erp T. S., Moroni D., and Bolhuis P. G., J. Chem. Phys. 118, 7762 (2003). 10.1063/1.1562614 [DOI] [PubMed] [Google Scholar]
- 11.Cabriolu R., Skjelbred Refsnes K. M., Bolhuis P. G., and van Erp T. S., J. Chem. Phys. 147, 152722 (2017). 10.1063/1.4989844 [DOI] [PubMed] [Google Scholar]
- 12.Kramers H., Physica 7, 284 (1940). 10.1016/s0031-8914(40)90098-2 [DOI] [Google Scholar]
- 13.Bello-Rivas J. M. and Elber R., J. Chem. Phys. 142, 094102 (2015). 10.1063/1.4913399 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Aristoff D., Bello-Rivas J. M., and Elber R., Multiscale Model. Simul. 14, 301 (2016). 10.1137/15m102157x [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Lin L., Lu J. F., and Vanden-Eijnden E., Commun. Pure Appl. Math. 71, 1149 (2018). 10.1002/cpa.21725 [DOI] [Google Scholar]
- 16.Bezrukov S. M., Berezhkovskii A. M., and Szabo A., J. Chem. Phys. 127, 115101 (2007). 10.1063/1.2766720 [DOI] [PubMed] [Google Scholar]
- 17.Berezhkovskii A. M. and Szabo A., J. Phys. Chem. B 117, 13115 (2013). 10.1021/jp403043a [DOI] [PMC free article] [PubMed] [Google Scholar]


