Committors, first-passage times, fluxes, Markov states, milestones, and all that

Alexander M Berezhkovskii; Attila Szabo

doi:10.1063/1.5079742

. 2019 Feb 6;150(5):054106. doi: 10.1063/1.5079742

Committors, first-passage times, fluxes, Markov states, milestones, and all that

Alexander M Berezhkovskii ^1,^a), Attila Szabo ²

PMCID: PMC6910584 PMID: 30736684

Abstract

Milestoning on a one-dimensional potential starts by choosing a set of points, called milestones, and initiating short trajectories from each milestone, which are terminated when they reach an adjacent milestone for the first time. From the average duration of these trajectories and the probabilities of where they terminate, a rate matrix can be constructed and then used to calculate the mean first-passage time (MFPT) between any two milestones. All these MFPT’s turn out to be exact. Here we adopt a point of view from which this remarkable result is not unexpected. In addition, we clarify the nature of the “states” whose interconversion is described by the rate matrix constructed using information obtained from short trajectories and provide a microscopic expression for the “equilibrium population” of these states in terms of equilibrium averages of the committors.

I. INTRODUCTION

Many recent advances in the theory of rare events were not made algebraically but rather by analyzing the behavior of trajectories. Both transition path theory^1–3 and milestoning^4–6 can be regarded as part of the emerging field of “statistical mechanics of trajectories.” This paper grew out of our efforts to understand Elber’s milestoning in the simplest context.

Operationally, Elber’s milestoning for one-dimensional diffusive dynamics on a potential can be formulated as follows: Choose an arbitrary set of points labeled by the index i, i = 1, 2, ..., N. Point i is called milestone i. Run Brownian dynamics trajectories starting from milestone i and terminate them when they reach either milestone i − 1 or milestone i + 1 for the first time. The average duration of these trajectories, denoted by t_i, is by construction the mean first-passage time (MFPT) from milestone i to one of the two milestones, i − 1 and i + 1. In addition, the fraction of trajectories terminated at milestone i + 1 is determined. This is the splitting probability or committor denoted by φ(i → i + 1). Clearly, $φ (i \to i - 1) = 1 - φ (i \to i + 1)$ . This information is then used to construct an N × N rate matrix R that describes the interconversion of yet unspecified “states” I, I = 1, 2, ..., N. This is a three-diagonal matrix with off-diagonal matrix elements $R_{I I \pm 1} = φ (i \to i \pm 1) / t_{i}$ and diagonal elements $R_{I I} = - 1 / t_{i}$ . This guarantees that the mean lifetime of state I, given by ${(R_{I I + 1} + R_{I I - 1})}^{- 1}$ , is t_i and the probability to go from state I to state I + 1, given by $R_{I I + 1} / (R_{I I + 1} + R_{I I - 1})$ , is $φ (i \to i + 1)$ , where t_i and $φ (i \to i + 1)$ were obtained by running many short trajectories. It is important to emphasize that the distributions of lifetimes of states I are single-exponential, whereas the exact distributions of the MFPT’s in general are not. Nevertheless, the MFPT between any two states obtained by using the rate matrix R turns out to be exact! The importance of this result is that the exact MFPT can be obtained, for example, for two milestones separated by a high barrier. To directly obtain this MFPT by running Brownian dynamics simulations would take virtually forever. On the other hand, using milestoning, all one has to do is run many relatively short trajectories starting from milestones.

Thus, milestoning maps the diffusive dynamics onto a nearest-neighbor continuous-time Markov state model. The fact that the MFPT’s between the states, obtained using the rate matrix, are identical to the MFPT’s between the corresponding milestones calculated using the Smoluchowski equation was initially surprising to us. One goal of this paper is to find a way of thinking that makes this, at first sight, remarkable result not unexpected. To do this, we need to understand the nature of the “states” I whose interconversion is described by the rate matrix R, and the meaning of their “equilibrium populations” obtained from the eigenvector of this rate matrix corresponding to zero eigenvalue.

The outline of this paper is as follows: In Sec. II, we consider just two milestones. This may seem too trivial to provide any insight because the MFPT’s of the two-state model are exact by construction. However, it turns out to be the simplest context where one can understand what the states are and what their equilibrium populations mean. In Sec. III, we consider three milestones, which is the simplest case that involves splitting probabilities. Some concluding remarks are presented in Sec. IV. In the Appendix, we algebraically derive some identities that have been obtained in Sec. II by analyzing a long equilibrium trajectory.

II. TWO MILESTONES

We begin by analyzing the simplest possible example of milestoning. Consider a particle diffusing in a bounded one-dimensional potential U(x), $U (x \to \pm \infty) = \infty$ . Choose two arbitrary points (milestones), x = a, and x = b, a < b. The average duration of trajectories, initiated at point a and terminated at point b, is the mean first-passage time (MFPT) from a to b, τ(a → b). Similarly, the MFPT τ(b → a) can be obtained by starting trajectories from point b and terminating them when they reach point a.

If points a and b were located at the minima of two deep wells, A and B, of a bistable potential, separated by a high barrier, then the distributions of both first-passage times would be very nearly exponential. In this case, it would make sense to describe the dynamics of the system in terms of Markovian transitions between the two wells, i.e., by a two-state model

A \underset{k_{B \to A}}{\overset{k_{A \to B}}{⇄}} B,

(2.1)

where the rate constants for inter-well transitions are given by

k_{A \to B} = \frac{1}{τ (a \to b)}, k_{B \to A} = \frac{1}{τ (b \to a)} .

(2.2)

If the two points are arbitrary, then the distributions of the first-passage times are no longer exponential. Nevertheless, one can formally adopt the above two-state description, i.e., use the MFPT’s and build a two-state model as discussed above. At least the mean lifetimes of the two states, A and B, $k_{A \to B}^{- 1}$ and $k_{B \to A}^{- 1}$ , will be correct. The equilibrium populations, $π_{A}$ and $π_{B}$ , of these states are

\begin{matrix} π_{A} = \frac{k_{B \to A}}{k_{A \to B} + k_{B \to A}} = \frac{τ (a \to b)}{τ (a \to b) + τ (b \to a)}, π_{B} = 1 - π_{A} . \end{matrix}

(2.3)

The equilibrium unidirectional flux between them, i.e., the number of A-to-B (or B-to-A) transitions per unit time at equilibrium, $J_{A B} = J_{A \to B} = J_{B \to A}$ , is

J_{A B} = π_{A} k_{A \to B} = π_{B} k_{B \to A} .

(2.4)

For future reference, note that the rate constants in this model can be expressed as “flux-over-population,” i.e., $k_{A \to B} = J_{A B} / π_{A}$ , $k_{B \to A} = J_{A B} / π_{B}$ . Substituting in Eq. (2.4) the rate constants and equilibrium populations given in Eqs. (2.2) and (2.3), we find that

J_{A B} = \frac{1}{τ (a \to b) + τ (b \to a)} .

(2.5)

The sum of the MFPT’s in the denominator has been called the mean round-trip time between points a and b.⁷ Thus the equilibrium unidirectional flux is the inverse mean round-trip time. However, it is not at all obvious what the two “states,” A and B, associated with milestones a and b are, what the equilibrium populations of these states mean, and what the unidirectional flux corresponds to.

A. Coloring an equilibrium trajectory

To answer these questions, consider a long equilibrium trajectory, x(t), of duration T, T → ∞. One way of obtaining the first passage times between points a and b is to use the trajectory coloring procedure introduced by Vanden-Eijnden and Venturoli⁸ (see also Ref. 3). Let us color the trajectory either red or blue using the following rules: the trajectory changes color from red to blue when it comes to point b from point a, and from blue to red when it comes to point a from point b, as show in the upper panel of Fig. 1. This is similar to the procedure used by Buchete and Hummer⁹ to define states when constructing Markov state models. In addition, a referee pointed out that it is closely related to transition interface sampling.^10,11 Let T_a (T_b) be the total time the trajectory is red (blue) so that T_a + T_b = T. Let N_ab be the number of times the color of the trajectory changes from red-to-blue or equivalently from blue-to-red during time T. The MFPT’s $τ (a \to b)$ and $τ (b \to a)$ are the average durations of the red and blue segments,

τ (a \to b) = \frac{T_{a}}{N_{a b}}, τ (b \to a) = \frac{T_{b}}{N_{a b}} .

(2.6)

Thus the mean lifetimes of the red and blue segments, $τ (a \to b)$ and $τ (b \to a)$ , are equal to the mean lifetimes of states A and B, $k_{A \to B}^{- 1}$ and $k_{B \to A}^{- 1}$ , respectively.

FIG. 1. — Upper panel: A long equilibrium trajectory generated by Brownian dynamics simulations that is colored red and blue. The color changes from red to blue when the trajectory from a reaches b for the first time and remains blue until it reaches a for the first time whereupon it turns red. The color of the trajectory depends on the last milestone it touched: the color is red (blue), if the last touched milestone is a (b). Lower panel: The colored trajectory x(t) is converted into a two-state trajectory where the lifetime of state A (B) is equal to the time during which the trajectory x(t) is red (blue).

The probability that the trajectory is red, $π_{a}$ , is the fraction $T_{a} / T = T_{a} / (T_{a} + T_{b})$ , while the probability that the trajectory is blue, $π_{b}$ , is the fraction $T_{b} / T = T_{b} / (T_{a} + T_{b})$ . Using the relations between the MFPT’s and times T_a and T_b, Eq. (2.6), these can be written as

π_{a} = \frac{T_{a}}{T_{a} + T_{b}} = \frac{τ (a \to b)}{τ (a \to b) + τ (b \to a)}, π_{b} = 1 - π_{a} .

(2.7)

These are identical to the equilibrium populations in Eq. (2.3) obtained using the two-state model with rate constants given in Eq. (2.2). Therefore, state A (B) associated with milestone a (b) corresponds to red (blue) trajectory segments, as shown in the lower panel of Fig. 1. Note that states A and B are not thermodynamic states. The reason is that the trajectory can be both red and blue in the region between the milestones, $a < x < b$ . The color is red (blue) if the last milestone touched by the trajectory before coming to point x is point a (b). Therefore, when $a < x < b$ , the trajectory can be sometimes red (state A) and sometimes blue (state B) (see Fig. 1), while in thermodynamics any spatial point can belong to only one state. Thus, states A and B associated with the milestones are states of the trajectory rather than thermodynamic states.

The unidirectional flux between points a and b at equilibrium, $J_{a b} = J_{a \to b} = J_{b \to a}$ , by definition, is

J_{a b} = \frac{N_{a b}}{T} = \frac{N_{a b}}{T_{a} + T_{b}} .

(2.8)

Using Eq. (2.6), this flux can be written in terms of the MFPT’s $τ (a \to b)$ and $τ (b \to a)$ as

J_{a b} = \frac{1}{τ (a \to b) + τ (b \to a)} .

(2.9)

This is identical to the flux J_AB, Eq. (2.5), obtained in the framework of the two-state model. Thus, the equilibrium unidirectional flux between points a and b is the same as with the number of transitions from trajectory state A to state B (and vice versa) per unit time, at equilibrium, $J_{a b} = J_{A B}$ .

To summarize, using the trajectory coloring procedure, we found that states A and B, associated with milestones a and b, indicate the color of the trajectory (red in state A and blue in state B). The equilibrium populations of these states, $π_{A}$ and $π_{B}$ , are the fractions of time when the trajectory is red and blue, $π_{a}$ and $π_{b}$ , respectively. Finally, the equilibrium unidirectional flux between these states, J_AB, is the equilibrium unidirectional flux, J_ab, between the two milestones.

The two-state model using the rate constants defined as the reciprocals of the MFPT’s is remarkably successful in describing transitions between the trajectory segments of different color. While the MFPT’s calculated from the two-state model are exact by construction, this model gives the exact relation between the unidirectional flux at equilibrium and the MFPT’s, Eq. (2.9). Although at first glance this might seem surprising, there is a point of view from which it is not unexpected. The trajectory corresponding to the two-state model can be exactly generated using the Gillespie algorithm, where the lifetimes of states A and B are chosen from the exponential distributions. For example, for state A, the distribution is $k_{A \to B} \exp (- k_{A \to B} t) = \exp (- t / τ (a \to b)) / τ (a \to b)$ . While these exponential distributions differ from the exact lifetime distributions of the red and blue trajectory segments, the corresponding lifetimes are equal on average (the mean lifetimes are the same by construction). Thus, on average, the two-state trajectory is equivalent to the red-blue trajectory obtained by coloring the initial trajectory shown in the lower panel of Fig. 1. Therefore, the average number of unidirectional transitions per unit time calculated from the two-state trajectory must be the same as that obtained from the colored trajectory. As we shall see in Sec. III, this is the simplest way of understanding why a three-state model exactly gives all the MFPT’s and equilibrium unidirectional fluxes for three milestones.

B. Microscopic interpretation of equilibrium populations of states A and B

Next, we discuss a microscopic interpretation of the equilibrium populations of states A and B. According to the coloring rules, the trajectory is always red when $x \leq a$ and blue when $x \geq b$ . In the intermediate region, $a < x < b$ , the color of the trajectory depends on the prehistory. The trajectory is red, if it entered the intermediate region through point a, and blue, if it entered through point b. Because of detailed balance or time-reversal symmetry, the probability that the trajectory reaches x coming from a is equal to the probability that starting form x, the trajectory reaches a before touching b. This probability, denoted by $φ (x \to a)$ , is referred to as the committor or splitting probability. Similarly, the probability that the trajectory reaches x having entered the intermediate region from b is equal to the probability that starting form x, the trajectory reaches b before touching a. This probability, denoted by φ(x → b), is φ(x → b) = 1 − φ(x → a).

The probability that the trajectory is colored red (state A) at an arbitrary point x, denoted by φ_a(x), is

φ_{a} (x) = \{\begin{matrix} 1, & x \leq a \\ φ (x \to a), & a \leq x \leq b \\ 0, & b \leq x \end{matrix} .

(2.10a)

The probability that the trajectory at point x is colored blue (state B), denoted by $φ_{b} (x)$ , is

φ_{b} (x) = \{\begin{matrix} 0, & x \leq a \\ φ (x \to b), & a \leq x \leq b \\ 1, & b \leq x \end{matrix} .

(2.10b)

Since we are dealing with an equilibrium trajectory, the probability of finding the trajectory at point x is given by the Boltzmann distribution, $\exp (- β U (x)) / Z$ , where $Z = \int_{- \infty}^{\infty} \exp (- β U (x)) d x$ with $β = 1 / (k_{B} T)$ , where k_B and T denote the Boltzmann constant and the absolute temperature. Therefore, the equilibrium probability densities that an arbitrary point x is red or blue are $φ_{i} (x) \exp (- β U (x)) / Z$ , i = a, b. As a consequence, the equilibrium probability of finding the trajectory red (blue) is the Boltzmann average of $φ_{a} (x)$ ( $φ_{b} (x)$ ),

π_{i} = ⟨φ_{i}⟩ = \frac{\int_{- \infty}^{\infty} φ_{i} (x) e^{- β U (x)} d x}{\int_{- \infty}^{\infty} e^{- β U (x)} d x}, i = a, b .

(2.11)

This probability is also the equilibrium population $π_{i}$ of state $I, I = A, B$ . Thus, the equilibrium population of state I is the Boltzmann average of the probability $φ_{i} (x), π_{I} = ⟨ φ_{i} ⟩$ .

C. MFPT’s, rate constants, and equilibrium unidirectional flux

The MFPT’s between points a and b, given in Eq. (2.6), can be written in terms of equilibrium populations, Eqs. (2.7) and (2.11), and the equilibrium unidirectional flux, Eq. (2.8),

\begin{matrix} τ (a \to b) = \frac{T_{a}}{N_{a b}} = \frac{T_{a}}{T_{a} + T_{b}} \frac{T_{a} + T_{b}}{N_{a b}} = \frac{⟨φ_{a}⟩}{J_{a b}}, τ (b \to a) = \frac{⟨φ_{b}⟩}{J_{a b}} . \end{matrix}

(2.12)

These are exact identities which are verified in the Appendix using analytical expressions obtained in the framework of the theory of first-passage processes. It should be pointed out that the mean transition path or direct-transit time between points a and b, denoted by $τ (a \leftrightarrow b)$ , can be written² as $τ (a \leftrightarrow b) = ⟨φ_{a} φ_{b}⟩ / J_{a b}$ .

Using the above results for the MFPT’s in Eq. (2.2), the rate constants for transitions between states A and B, defined by the color of the trajectory, can be written as

k_{A \to B} = \frac{J_{a b}}{⟨φ_{a}⟩}, k_{B \to A} = \frac{J_{a b}}{⟨φ_{b}⟩} .

(2.13)

These expressions have been previously obtained by Vanden-Eijnden and Venturoli.⁸ The relation between our notation and theirs is $J \leftrightarrow ν_{R}$ (unidirectional flux at equilibrium), $φ_{a} (x) \leftrightarrow q_{-} (x)$ (the probability that the trajectory at point x is colored red), and $⟨φ_{a}⟩ = ρ_{a}$ (the equilibrium probability that the trajectory is red).

D. Relation to Kramers theory

When the milestones a and b happen to be at the minima of two deep wells separated by a high barrier, the above expressions for the rate constants reduce to those of the Kramers theory. Kramers used flux divided by thermodynamic well population as the definition the rate constant.¹² In the case of two deep wells, the probability $φ_{a} (x)$ is unity not only when $x \leq a$ but also is essentially unity for a wider range of x, $a < x < x^{*}$ , where $x^{*}$ is a point between point a and the barrier top, chosen so that the potential energy $U (x^{*})$ is several $k_{B} T$ ’s above the potential energy at the well bottom, U(a). Therefore, $⟨φ_{a}⟩$ is essentially $\int_{- \infty}^{x^{*}} e^{- β U (x)} d x / \int_{- \infty}^{\infty} e^{- β U (x)} d x$ . The value of the integral in the numerator is determined by the potential near the well bottom at x = a and is insensitive to the location of point $x^{*}$ as long as it lies sufficiently far from point a (the energy difference $U (x^{*}) - U (a)$ exceeds several $k_{B} T$ ’s). Consequently, the equilibrium population of non-thermodynamic state A, $⟨φ_{a}⟩$ , is equal to the equilibrium thermodynamic population of well A. This happens because the interval, where points may have both colors, is located in the barrier region. When the barrier is high, the equilibrium population in this interval is vanishingly small.

As shown in the Appendix, the equilibrium unidirectional flux between points a and b is given by $J_{a b} = {[(\int_{- \infty}^{\infty} e^{- β U (x)} d x) (\int_{a}^{b} e^{β U (x)} d x / D (x))]}^{- 1}$ , where D(x) is the position-dependent diffusivity. Substituting J_ab and $⟨φ_{a}⟩$ above into Eq. (2.13), we arrive at

k_{A \to B} = \frac{1}{(\int_{- \infty}^{x^{*}} e^{- β U (x)} d x) (\int_{a}^{b} e^{β U (x)} d x / D (x))} .

(2.14)

While the value of the first integral in the denominator is determined by the behavior of the potential near the A-well bottom, the value of the second integral is determined by the behavior of the potential near the barrier top. The value of the latter integral is insensitive to the integration limits when the potential energies U(a) and U(b) are several $k_{B} T$ ’s below the barrier top. Assuming that U(x) is quadratic near the A-well bottom and the barrier top, and the diffusivity is position-independent, we replace U(x) in the integrands by the corresponding quadratic approximations, then let $x^{*} \to \infty$ , $a \to - \infty$ , and $b \to \infty$ , and perform the integrations. In this way, we recover the celebrated Kramers formula for the rate constant for diffusive barrier crossing (high-friction regime).

III. THREE MILESTONES

We now consider the much more interesting case of three milestones located at points x = a, x = b, and x = c, $a < b < c$ . By starting trajectories at points a or c and terminating them when they reach point b, we can find the MFPT’s $τ (a \to b)$ and $τ (c \to b)$ as before. The milestone located at point b is different because it has milestones on both sides. Here we initiate trajectories at point b and terminate them when they reach either point a or point c. In this way, we determine the MFPT, $τ (a \leftarrow b \to c)$ , required to reach points a or c for the first time starting from point b. In addition, we determine the committor or splitting probability that a trajectory starting at point b reaches point a before point c, denoted by $φ (b \to a)$ . Clearly, $φ (b \to c) = 1 - φ (b \to a)$ .

We use these MFPT’s and splitting probabilities to build a three-state model of the dynamics

A \underset{k_{B \to A}}{\overset{k_{A \to B}}{⇄}} B \underset{k_{C \to B}}{\overset{k_{B \to C}}{⇄}} c .

(3.1)

Here A, B, and C are yet unspecified “states” associated with milestones a, b, and c. The transitions between them are described by the rate constants defined as

\begin{matrix} k_{A \to B} = \frac{1}{τ (a \to b)}, k_{B \to A} = \frac{φ (b \to a)}{τ (a \leftarrow b \to c)}, \\ k_{C \to B} = \frac{1}{τ (c \to b)}, k_{B \to C} = \frac{φ (b \to c)}{τ (a \leftarrow b \to c)} . \end{matrix}

(3.2)

The rate constants $k_{B \to A}$ and $k_{B \to C}$ are defined so as to ensure that (1) the splitting probabilities for $B \to A$ and $B \to C$ transitions, $k_{B \to A} / (k_{B \to A} + k_{B \to C})$ and $k_{B \to C} / (k_{B \to A} + k_{B \to C})$ , are $φ (b \to a)$ and $φ (b \to c)$ , and (2) the mean lifetime of state B, ${(k_{B \to A} + k_{B \to C})}^{- 1}$ , is $t_{B} = τ (a \leftarrow b \to c)$ . The mean lifetimes of states A and C, $k_{A \to B}^{- 1}$ and $k_{C \to B}^{- 1}$ , respectively, are $t_{A} = τ (a \to b)$ and $t_{C} = τ (c \to b)$ . The time evolution of the probability of finding the system in state I, p_I(t), I = A, B, C, is given by

\frac{d p}{d t} = R p,

(3.3)

where $p$ is the column vector, (p_A, p_B, p_C), and the rate matrix, $R$ , is

R = (\begin{matrix} - k_{A \to B} & k_{B \to A} & 0 \\ k_{A \to B} & - (k_{B \to A} + k_{B \to C}) & k_{C \to B} \\ 0 & k_{B \to C} & - k_{C \to B} \end{matrix}) .

(3.4)

The normalized equilibrium populations of the three states, $π_{A}$ , $π_{B}$ , and $π_{C}$ , $π_{A} + π_{B} + π_{C} = 1$ , are solutions of $R π = 0$ . Solving this equation and replacing the rate constants by the MFPT’s and splitting probabilities, using Eq. (3.2), we find

\begin{matrix} π_{A} = \frac{φ (b \to a) τ (a \to b)}{Δ}, π_{B} = \frac{τ (a \leftarrow b \to c)}{Δ}, \\ π_{A} = \frac{φ (b \to c) τ (c \to b)}{Δ}, \end{matrix}

(3.5)

where $Δ$ is

Δ = φ (b \to a) τ (a \to b) + τ (a \leftarrow b \to c) + φ (b \to c) τ (c \to b) .

(3.6)

Given the rate matrix, $R$ , we can calculate the MFPT’s between any two states by solving

R^{T} τ = - 1

(3.7)

subject to the appropriate boundary condition (i.e., if we are interested in the MFPT’s to state I, then $τ (I \to I) = 0$ ). In this equation, which is the discrete analog of Eq. (A6) in the Appendix, $R^{T}$ is the transpose of matrix $R$ , $τ$ is a column vector of the MFPT’s, and 1 is a column vector with components equal to unity. Solving Eq. (3.7) and using Eq. (3.2), we can express the MFPT’s between any two states in terms of the MFPT’s and splitting probabilities obtained from the simulations. For example, one can show that $τ (A \to B) = τ (a \to b)$ , as to be expected, and more interestingly, that

τ (B \to C) = \frac{φ (b \to a) τ (a \to b) + τ (a \leftarrow b \to c)}{φ (b \to c)} .

(3.8)

The MFPT between states A and C is given by $τ (A \to C) = τ (A \to B) + τ (B \to C)$ .

Since matrix $R^{T}$ can be expressed as

R^{T} = (\begin{matrix} - t_{A}^{- 1} & 0 & 0 \\ 0 & - t_{B}^{- 1} & 0 \\ 0 & 0 & - t_{C}^{- 1} \end{matrix}) (\begin{matrix} 1 & - 1 & 0 \\ - φ (b \to a) & 1 & - φ (b \to c) \\ 0 & - 1 & 1 \end{matrix}),

(3.9)

we can write Eq. (3.7) in the form⁵

(I - Φ) τ = t .

(3.10)

Here $t$ is the vector of mean lifetimes, and $Φ$ is the matrix of the splitting probabilities,

Φ = (\begin{matrix} 0 & 1 & 0 \\ φ (b \to a) & 0 & φ (b \to c) \\ 0 & 1 & 0 \end{matrix}) .

(3.11)

Equations (3.7) and (3.10) must be solved subject to the same boundary conditions. Alternatively, matrix $Φ$ can be redefined so as to explicitly incorporate these boundary conditions.

A. Coloring an equilibrium trajectory

As in the case of two milestones, we are interested in determining the nature of the “states” associated with the milestones and microscopic meaning of their equilibrium populations, $π_{A}$ , $π_{B}$ , and $π_{C}$ , that are normalized solutions of $R π = 0$ . To this end, as before, consider how the MFPT’s can be obtained from a long equilibrium trajectory x(t) of duration T, $T \to \infty$ , by coloring it in red, blue, and orange, using the rules proposed by Vanden-Eijnden and Venturoli.⁸ The trajectory changes its color from red to blue when it comes to point b from point a and from orange to blue when it comes to this point from point c. Then the trajectory remains blue until it touches point a or c for the first time, when it turns red or orange, respectively, as shown in the upper panel of Fig. 2.

FIG. 2. — The same as Fig. 1 but for three milestones. Upper panel: The trajectory x(t) colored red, blue, and orange. Blue changes to orange (red) when the trajectory x(t) reaches point c (a) for the first time. When $b < x < c$ ( $a < x < b$ ), the trajectory can be either blue or orange (blue or red). The color of the trajectory depends on the last milestone it touched: the color is red (blue or orange), if the last touched milestone is a (b or c). Lower panes: A three-state trajectory corresponding to the colored trajectory x(t).

Denote the total times when the trajectory is red, blue, and orange by T_a, T_b, and T_c, and the numbers of times the trajectory changed color from red to blue and from blue to orange and vice versa by N_{a b} and N_{b c}, respectively. The MFPT’s $τ (a \to b)$ , $τ (a \leftarrow b \to c)$ , and $τ (c \to b)$ are the average durations of the red, blue, and orange segments,

\begin{matrix} τ (a \to b) = \frac{T_{a}}{N_{a b}}, τ (a \leftarrow b \to c) = \frac{T_{b}}{N_{a b} + N_{b c}}, τ (c \to b) = \frac{T_{c}}{N_{b c}} . \end{matrix}

(3.12)

The splitting probability $φ (b \to a)$ ( $φ (b \to c)$ ) is the number of times a blue segment turned red (orange), N_{a b} (N_{b c}), divided by the total number of times it changed color,

φ (b \to a) = \frac{N_{a b}}{N_{a b} + N_{b c}}, φ (b \to c) = \frac{N_{b c}}{N_{a b} + N_{b c}} .

(3.13)

Using the relations in Eqs. (3.12) and (3.13), we can write the total observation time as

\begin{matrix} T & = & T_{a} + T_{b} + T_{c} = \\ = & N_{a b} τ (a \to b) + (N_{a b} + N_{b c}) τ (a \leftarrow b \to c) + N_{b c} τ (c \to b) \\ = & (N_{a b} + N_{b c}) Δ, \end{matrix}

(3.14)

where $Δ$ is given in Eq. (3.6).

B. Equilibrium populations and MFPT’s

The relations in Eqs. (3.12) and (3.13) link the MFPT’s and splitting probabilities obtained from the short simulations initiated from the milestones with information obtained from a long trajectory. We use this to find the fractions of time when the trajectory is red, blue, and orange, $π_{i} = T_{i} / T, i = a, b, c$ , and show that these fractions are equal to the equilibrium populations of states I, π_I, $I = A, B, C$ , obtained from the rate matrix, $R$ , i.e., $π_{i} = π_{I}$ . For example, the fraction of time when the trajectory is red, $π_{a} = T_{a} / T$ , can be recast as

π_{a} = \frac{T_{a}}{T} = \frac{N_{a b} τ (a \to b)}{(N_{a b} + N_{b c}) Δ},

(3.15)

where we have used Eq. (3.12) for T_a and Eq. (3.14) for T. Finally, using Eq. (3.13), we obtain

π_{a} = \frac{T_{a}}{T} = \frac{φ (b \to a) τ (a \to b)}{Δ} .

(3.16)

Comparing this with $π_{A}$ in Eq. (3.5), we see that $π_{a} = π_{A}$ . Similarly, it can be shown that $π_{b} = T_{b} / T = π_{B}$ and $π_{c} = T_{c} / T = π_{C}$ .

In summary, the mean durations of the red, blue, and orange segments of the trajectory are equal to the mean lifetimes of states A, B, and C of the three-state model. In addition, the fractions of time when the trajectory is red, blue, and orange are equal to the equilibrium populations, $π_{A}$ , $π_{B}$ , and $π_{C}$ , of the three states. Consequently, states A, B, and C of the three-state model, associated with the milestones, correspond to red, blue, and orange segments of the long trajectory, respectively.

Times, T_a, T_b, and T_c, and the numbers of color changes, N_{a b} and N_{b c}, obtained from the long trajectory can be used to find the exact MFPT’s between any two milestones, not just those in Eq. (3.12). For example, suppose we are interested in the MFPT between milestones b and c, $τ (b \to c)$ . If the color of the red trajectory fragments is changed to blue, then we are faced with a two (b and c) milestone problem where the duration of the blue trajectories is now T_a + T_b, and so

τ (b \to c) = \frac{T_{a} + T_{b}}{N_{b c}} .

(3.17)

Now we use Eq. (3.12) to write T_a and T_b in terms of the MFPT’s, i.e., $T_{a} = N_{a b} τ (a \to b)$ and $T_{b} = (N_{a b} + N_{b c}) τ (a \leftarrow b \to c)$ . Substituting this into Eq. (3.17) and using the definitions of the splitting probabilities $φ (b \to a)$ and $φ (b \to c)$ in terms of the numbers of color changes in Eq. (3.13), we arrive at

τ (b \to c) = \frac{φ (b \to a)}{φ (b \to c)} τ (a \to b) + \frac{1}{φ (b \to c)} τ (a \leftarrow b \to c) .

(3.18)

Comparing this with $τ (B \to C)$ in Eq. (3.8), obtained using the three-state model, we see that $τ (b \to c) = τ (B \to C)$ . Thus, the three-state model predicts the exact MFPT between points b and c even though this MFPT was not used to determine any of the rate constants. In a similar way, one can show that all possible MFPT’s predicted by the three-state model are exact.

How can we understand this remarkable success of the three-state model? Any quantity that can be calculated using the rate matrix R can also be obtained from a long equilibrium trajectory in the space of states A, B, and C generated using the Gillespie algorithm. All input parameters for this algorithm (i.e., the lifetimes of the states and splitting probabilities) can be obtained from the rate matrix R. Let us compare such a trajectory with the exact one shown in the lower panel of Fig. 2. The most dramatic difference is in the distributions of the state lifetimes and those of the durations of the colored segments. In the trajectory generated using the Gillespie algorithm, the distributions of the state lifetimes are single-exponential, while the distributions of the durations of the colored segments are in general not. However, the mean lifetimes and the splitting probabilities in both trajectories are the same because the Gillespie algorithm used the exact MFPT’s and splitting probabilities as input. Therefore, any average quantity (e.g., the MFPT’s) calculated using the rate matrix must be exact.

C. Microscopic interpretation of “equilibrium populations”

We will now express the equilibrium populations given by the three-state model, $π_{I}$ , (which are identical to the fractions of time, $π_{i}$ , when the trajectory is red, blue, and orange) in terms of the Boltzmann equilibrium averages of the committors. Let $φ_{i} (x)$ be the probabilities and $p_{i} (x)$ be the probability densities that the trajectory at point x has the color associated with the milestone i, i = a, b, c. Just as in the two-milestone case, p_i(x), are given by the products of the Boltzmann distribution and the probabilities φ_i(x), $p_{i} (x) = φ_{i} (x) \exp (- β U (x)) / Z$ . The probabilities φ_i(x) are

φ_{a} (x) = \{\begin{matrix} 1, & x \leq a \\ φ (x \to a), & a \leq x \leq b \\ 0, & b \leq x \end{matrix},

(3.19)

φ_{b} (x) = \{\begin{matrix} 0, & x \leq a \\ φ (x \to b), & a \leq x \leq c \\ 0, & c \leq x \end{matrix},

(3.20)

and

φ_{c} (x) = \{\begin{matrix} 0, & x \leq b \\ φ (x \to c), & b \leq x \leq c \\ 1, & c \leq x \end{matrix} .

(3.21)

Here $φ (x \to a)$ ( $φ (x \to c)$ ) is the probability of reaching point a (c) starting from point x, $a \leq x \leq b$ ( $b \leq x \leq c$ ), before reaching point b, and $φ (x \to b) = 1 - φ (x \to a)$ , for $a \leq x \leq b$ , and $φ (x \to b) = 1 - φ (x \to c)$ , for $b \leq x \leq c$ . One can see that the probabilities $φ_{i} (x)$ are normalized at every point x, $φ_{a} (x) + φ_{b} (x) + φ_{c} (x) = 1$ . The equilibrium populations of states I, π_I, I = A, B, C, given in terms of the Boltzmann equilibrium averages of the committors φ_i(x), are

π_{I} = π_{i} = \int_{- \infty}^{\infty} p_{i} (x) d x = ⟨φ_{i} (x)⟩, I = A, B, C, i = a, b, c .

(3.22)

D. Unidirectional fluxes and rate constants

It should be no surprise by now that unidirectional fluxes at equilibrium (i.e., the numbers of transitions per unit time) between states A and B, $J_{A B} = J_{A \to B} = J_{B \to A}$ , and between states B and C, $J_{B C} = J_{B \to C} = J_{C \to B}$ , calculated in the framework of the three-state model, are the same as those between the points a and b, $J_{a b} = J_{a \to b} = J_{b \to a}$ , and points b and c, $J_{b c} = J_{b \to c} = J_{c \to b}$ , calculated from the long trajectory. To see this, consider the equilibrium unidirectional flux J_BC,

J_{B C} = π_{B} k_{B \to C} = \frac{π_{b} φ (b \to c)}{τ (a \leftarrow b \to c)},

(3.23)

where we have used the fact that $π_{B} = π_{b}$ and the expression for $k_{B \to C}$ given in Eq. (3.2). The equilibrium unidirectional flux J_bc between milestones b and c, by definition, is

J_{b c} = \frac{N_{b c}}{T} = \frac{T_{b}}{T} \cdot \frac{N_{b c}}{N_{a b} + N_{b c}} \cdot \frac{N_{a b} + N_{b c}}{T_{b}} = \frac{π_{b} φ (b \to c)}{τ (a \leftarrow b \to c)},

(3.24)

where we have used the relations in Eqs. (3.12) and (3.13). Thus $J_{B C} = J_{b c}$ and similarly, $J_{A B} = J_{a b}$ . Because of this and since $π_{I} = π_{i} = ⟨φ_{i}⟩$ , all the rate constants in Eq. (3.2) can be written as “flux-over-population”

\begin{matrix} k_{A \to B} = \frac{J_{a b}}{⟨φ_{a}⟩}, k_{B \to A} = \frac{J_{a b}}{⟨φ_{b}⟩}, \\ k_{C \to B} = \frac{J_{b c}}{⟨φ_{c}⟩}, k_{B \to C} = \frac{J_{b c}}{⟨φ_{b}⟩}, \end{matrix}

(3.25)

just as in the two-milestone case [see Eq. (2.13)].

The mean lifetime of state B, t_B, or the mean duration of the blue trajectory segment, t_b, is $t_{B} = t_{b} = {(k_{B \to A} + k_{B \to C})}^{- 1} = τ (a \leftarrow b \to c)$ . Using the expressions for the rate constants in Eq. (3.25), we obtain the following relation among the MFPT, $τ (a \leftarrow b \to c)$ , the Boltzmann averaged committor, $⟨φ_{b}⟩$ , and the equilibrium unidirectional fluxes:

τ (a \leftarrow b \to c) = \frac{⟨φ_{b}⟩}{J_{a b} + J_{b c}},

(3.26)

which is to be compared with the results in Eq. (2.12) for $τ (a \to b)$ and $τ (b \to a)$ in the case of two milestones. The sum of fluxes, $J_{a b} + J_{b c}$ , is denoted by q_b in the milestoning literature.⁶ The identity in Eq. (3.26) can be proved the old-fashioned way, algebraically, using the formalism given in the Appendix: $τ (a \leftarrow x_{0} \to c)$ is the solution of Eq. (A6), in which $τ (x_{0} \to b)$ should be replaced by $τ (a \leftarrow x_{0} \to c)$ , with boundary conditions $τ (a \leftarrow a \to c) = τ (a \leftarrow c \to c) = 0$ , while $φ (x \to b)$ is the solution of Eq. (A11), in which $φ (x \to a)$ should be replaced by $φ (x \to b)$ , with boundary conditions $φ (b \to b) = 1$ and $φ (a \to b) = φ (c \to b) = 0$ .

IV. CONCLUDING REMARKS

The above analysis can be readily generalized to an arbitrary number of milestones in one dimension. Thus, by running short trajectories starting from one milestone and stopping at an adjacent milestone, one can calculate the exact MFPT between any two milestones no matter how far apart they are. The importance of this result is that one can obtain very long MFPT’s, say, between two milestones separated by a high barrier, that would be virtually impossible to simulate directly. In simplest terms, the reason for this remarkable result is that a long trajectory generated by the Gillespie algorithm with parameters obtained from the rate matrix is “on the average” equivalent to the exact trajectory (i.e., the lower panels in Figs. 1 and 2). By “on the average,” we mean that while the distribution of the lifetimes of the various states are different, the average lifetimes as well as other average properties such as equilibrium populations, unidirectional fluxes, and MFPT’s will be the same.

How can this be generalized to multidimensional diffusive dynamics? In this case, milestones are no longer points but rather surfaces, and the first-passage times between the milestones depend on the location of the starting points on the surface. If we could choose the distribution of starting points in such a way that the MFPT’s obtained by running short trajectories would be identical to those obtained by coloring a long equilibrium trajectory, then all our arguments would immediately carry over. This distribution is related to the probability that the equilibrium trajectory crosses or hits a particular point on the surface (for an algebraic derivation, see the end of the Appendix). How this distribution can be found iteratively by running only short trajectories for non-diffusive dynamics is the focus of an extensive literature that can be accessed through some recent references^13–15 and earlier ones therein. We hope that our work will help to reduce the “activation barrier” encountered in trying to understand these papers.

Finally, we would like to mention that once we did something¹⁶ in the context of solute transport through a membrane channel, which in retrospect was closely related to milestoning. We considered two models of solute transport through a membrane channel, one, where a solute diffuses through the channel in the presence of a potential of mean force, and the other, where the solute simply jumps between two sites located at the channel ends. We showed that the fluxes calculated within the framework of these models are identical when the jump rates between the two states are chosen as the reciprocal of the MFPT’s to diffuse from one end of the channel to the other. In this way, we provided a microscopic interpretation of the phenomenological rate constants of the widely used two-site model of the solute dynamics in the channel.

ACKNOWLEDGMENTS

We have benefitted from discussions with Ron Elber, Gerhard Hummer, Eric Vanden-Eijnden, and Vladimir Zitserman. We thank Robert Best for help with the manuscript. This study was supported by the Intramural Research Program of the NIH, Center for Information Technology and National Institute of Diabetes and Digestive and Kidney Diseases.

APPENDIX: ALGEBRAIC DERIVATION OF THE RELATION AMONG THE MFPT, FLUX, AND COMMITTOR

Here we will algebraically derive the relation among the MFPT, $τ (a \to b)$ , the equilibrium-averaged committor, $⟨φ_{a}⟩$ , and the equilibrium unidirectional flux, J_ab.

Consider a particle diffusing in a potential U(x), $U (x \to \pm \infty) = \infty$ , with position-dependent diffusivity, D(x). The probability density of finding the particle at point x at time t, given that it was initially (at t = 0) at point x₀, $p (x, t | x_{0}, 0)$ , satisfies the Smoluchowski equation

\frac{\partial p}{\partial t} = L_{x} p = \frac{\partial}{\partial x} [D (x) e^{- β U (x)} \frac{\partial}{\partial x} (e^{β U (x)} p)]

(A1)

subject to the initial condition $p (x, 0 | x_{0}, 0) = δ (x - x_{0})$ . It follows from the detailed balance condition that $p (x, t | x_{0}, 0) = p (x_{0}, t | x, 0) e^{β (U (x_{0}) - U (x))}$ . Substituting this into Eq. (A1) and interchanging the notations of the variables $x \leftrightarrow x_{0}$ , one finds that $p (x, t | x_{0}, 0)$ satisfies

\frac{\partial p}{\partial t} = L_{x_{0}}^{+} p = e^{β U (x_{0})} \frac{\partial}{\partial x_{0}} [D (x_{0}) e^{- β U (x_{0})} \frac{\partial p}{\partial x_{0}}],

(A2)

which is the adjoint or backward Smoluchowski equation.

Let us now choose point x = b to be an absorbing boundary, i.e., $p (b, t | x_{0}, 0) = 0$ . The survival probability of a particle starting from x₀, x₀ < b, at t = 0, denoted by $S (t | x_{0})$ , is defined as

S (t | x_{0}) = \int_{- \infty}^{b} p (x, t | x_{0}, 0) d x .

(A3)

Integrating both sides of Eq. (A2) over x from −∞ to b, we find that the survival probability satisfies

\frac{\partial S (t | x_{0})}{\partial t} = L_{x_{0}}^{+} S (t | x_{0}) = e^{β U (x_{0})} \frac{\partial}{\partial x_{0}} [D (x_{0}) e^{- β U (x_{0})} \frac{\partial S (t | x_{0})}{\partial x_{0}}]

(A4)

subject to the initial condition $S (0 | x_{0}) = 1$ . The mean particle lifetime is the MFPT from point x₀ to point b, $τ (x_{0} \to b)$ ,

τ (x_{0} \to b) = \int_{0}^{\infty} t (- \frac{d S (t | x_{0})}{d t}) d t = \int_{0}^{\infty} S (t | x_{0}) d t .

(A5)

Integrating both sides of Eq. (A4) over time from 0 to ∞ and using the facts that $S (\infty | x_{0}) = 0$ and $S (0 | x_{0}) = 1$ , we find that the MFPT satisfies

L_{x_{0}}^{+} τ (x_{0} \to b) = e^{β U (x_{0})} \frac{\partial}{\partial x_{0}} [D (x_{0}) e^{- β U (x_{0})} \frac{\partial τ (x_{0} \to b)}{\partial x_{0}}] = - 1 .

(A6)

This must be solved subject to the boundary condition $τ (b \to b) = 0$ .

Multiplying both sides of Eq. (A6) by $\exp (- β U (x_{0}))$ , integrating over x₀ from −∞ to x, where $x < b$ , and using the fact that $U (- \infty) = \infty$ , we obtain

D (x) e^{- β U (x)} \frac{\partial τ (x \to b)}{\partial x} = - \int_{- \infty}^{x} e^{- β U (y)} d y .

(A7)

Dividing both sides by $D (x) \exp (- β U (x))$ , integrating the resulting equation from x to b, and using the fact that $τ (b \to b) = 0$ , we arrive at

τ (x \to b) = \int_{x}^{b} e^{β U (z)} \frac{d z}{D (z)} \int_{- \infty}^{z} e^{- β U (y)} d y .

(A8a)

Similarly, for the MFPT from x to a, $x \geq a$ , we have

τ (x \to a) = \int_{a}^{x} e^{β U (z)} \frac{d z}{D (z)} \int_{z}^{\infty} e^{- β U (y)} d y .

(A8b)

According to Eq. (2.5), the equilibrium unidirectional flux J_ab between milestones a and b is $J_{a b} = {(τ (a \to b) + τ (b \to a))}^{- 1}$ . Using Eqs. (A8a) and (A8b), it can be shown that this flux is given by

J_{a b} = \frac{1}{τ (a \to b) + τ (b \to a)} = \frac{1}{(\int_{a}^{b} e^{β U (z)} \frac{d z}{D (z)}) (\int_{- \infty}^{\infty} e^{- β U (y)} d y)} .

(A9)

This result can also be obtained from

J_{a b} = - D (x) e^{- β U (x)} \frac{\partial}{\partial x} (e^{β U (x)} p (x)), a \leq x \leq b,

(A10)

where the probability density p(x) at the end points is $p (a) = \exp (- β U (a)) / \int_{- \infty}^{\infty} \exp (- β U (x)) d x$ and p(b) = 0, i.e., milestone b is absorbing and point a is kept at equilibrium. Dividing both sides of Eq. (A10) by $D (x) \exp (- β U (x))$ and then integrating both sides of the resulting equation from a to b, we recover the result for the flux in Eq. (A9).

The committor (splitting probability) $φ (x \to a)$ , $a \leq x \leq b$ , satisfies the Onsager equation, $L_{x}^{+} φ (x \to a) = 0$ , subject to the boundary conditions $φ (a \to a) = 1$ and $φ (b \to a) = 0$ . Using $L_{x}^{+}$ , defined in Eq. (A2), we can write the Onsager equation as

\frac{d}{d x} [D (x) e^{- β U (x)} \frac{d φ (x \to a)}{d x}] = 0 .

(A11)

Consequently, we have

D (x) e^{- β U (x)} \frac{d φ (x \to a)}{d x} = C,

(A12)

where C is a constant to be determined using the boundary conditions at x = a and x = b. Dividing both sides by $D (x) \exp (- β U (x))$ , integrating both sides of the resulting equation from x to b, and using the boundary condition at point b, $φ (b \to a) = 0$ , we arrive at

φ (x \to a) = C \int_{x}^{b} e^{β U (z)} \frac{d z}{D (z)} .

(A13)

Using the boundary condition at point a, $φ (a \to a) = 1$ , to find C, we finally obtain

φ (x \to a) = \frac{\int_{x}^{b} e^{β U (z)} \frac{d z}{D (z)}}{\int_{a}^{b} e^{β U (z)} \frac{d z}{D (z)}}, a \leq x \leq b .

(A14)

Now we are ready to calculate $⟨φ_{a}⟩$ , where φ_a(x) is given in Eq. (2.10a),

\begin{matrix} ⟨φ_{a}⟩ & = & [\int_{- \infty}^{a} e^{- β U (x)} d x + \int_{a}^{b} φ (x \to a) e^{- β U (x)} d x] / \int_{- \infty}^{\infty} e^{- β U (y)} d y \\ = & J_{a b} [(\int_{a}^{b} e^{β U (z)} \frac{d z}{D (z)}) (\int_{- \infty}^{a} e^{- β U (x)} d x) \\ + \int_{a}^{b} (\int_{x}^{b} e^{β U (z)} \frac{d z}{D (z)}) e^{- β U (x)} d x] . \end{matrix}

(A15)

In the last step, we used Eq. (A9) for J_ab and Eq. (A14) for $φ (x \to a)$ . The sum in the square brackets is $τ (a \to b)$ . To see this, one has to change the order of integration in the second term in the square brackets, write the sum of integrals as a single double integral, and then compare with $τ (a \to b)$ obtained from Eq. (A8a). Thus, we have derived the result for $τ (a \to b)$ in Eq. (2.12), $τ (a \to b) = ⟨φ_{a}⟩ / J_{a b}$ .

This can be done simpler in a way that can be readily generalized to many dimensions. Multiplying both sides of Eq. (A6) by $φ_{a} (x_{0}) \exp (- β U (x_{0}))$ , where $φ_{a} (x)$ is given in Eq. (2.10a), and integrating both sides over all x₀, one has

\begin{matrix} \int_{- \infty}^{\infty} φ_{a} (x_{0}) \frac{\partial}{\partial x_{0}} [D (x_{0}) e^{- β U (x_{0})} \frac{\partial τ (x_{0} \to b)}{\partial x_{0}}] d x_{0} \\ = - \int_{- \infty}^{\infty} φ_{a} (x_{0}) e^{- β U (x_{0})} d x_{0} . \end{matrix}

(A16)

Integrating the left-hand side by parts and taking advantage of the fact that $U (\pm \infty) = \infty$ , we find

\begin{matrix} \int_{- \infty}^{\infty} (D (x_{0}) e^{- β U (x_{0})} \frac{\partial φ_{a} (x_{0})}{\partial x_{0}}) \frac{\partial τ (x_{0} \to b)}{\partial x_{0}} d x_{0} = ⟨φ_{a}⟩ \int_{- \infty}^{\infty} e^{- β U (y)} d y, \end{matrix}

(A17)

where we have used the definition of the equilibrium average, $⟨φ_{a}⟩$ , in Eq. (2.11). Since the derivative of $φ_{a} (x)$ is $\partial φ (x_{0} \to a) / \partial x_{0}$ inside the interval $a < x_{0} < b$ and zero outside, Eq. (A17) becomes

\begin{matrix} \int_{a}^{b} (D (x_{0}) e^{- β U (x_{0})} \frac{\partial φ (x_{0} \to a)}{\partial x_{0}}) \frac{\partial τ (x_{0} \to b)}{\partial x_{0}} d x_{0} \\ = ⟨φ_{a}⟩ \int_{- \infty}^{\infty} e^{- β U (y)} d y . \end{matrix}

(A18)

Integrating the right-hand side by parts, using Eq. (A11) and the fact that $τ (b \to b) = 0$ , we obtain

\begin{matrix} - D (a) e^{- β U (a)} {\frac{\partial φ (x_{0} \to a)}{\partial x_{0}}|}_{x_{0} = a} τ (a \to b) = ⟨φ_{a}⟩ \int_{- \infty}^{\infty} e^{- β U (y)} d y . \end{matrix}

(A19)

By differentiating Eq. (A14) for $φ (x_{0} \to a)$ and using the expression for the flux J_ab in Eq. (A9), one can see that the factor in front of $τ (a \to b)$ on the left-hand side of the above equation is the product $J_{a b} \int_{- \infty}^{\infty} e^{- β U (y)} d y$ . Thus, we recover the expression for $τ (a \to b)$ in Eq. (2.12), $τ (a \to b) = ⟨φ_{a}⟩ / J_{a b}$ .

Let us now generalize this to many dimensions with two milestones that are infinite non-intersecting surfaces $Σ_{a}$ and $Σ_{b}$ that divide the space into three regions: $Ω_{a}$ (left of $Σ_{a}$ ), $Ω_{a b}$ (between $Σ_{a}$ and $Σ_{b}$ ), and $Ω_{b}$ (right of $Σ_{b}$ ). The multidimensional generalization of Eq. (A16) is

\begin{matrix} \int φ_{a} (x_{0}) \nabla \cdot D (x_{0}) e^{- β U (x_{0})} \cdot \nabla τ (x_{0} \to Σ_{b}) d x_{0} \\ = - \int φ_{a} (x_{0}) e^{- β U (x_{0})} d x_{0} . \end{matrix}

(A20)

Here $φ_{a} (x_{0}) = 1$ for $x_{0} \in Ω_{a}$ , $φ_{a} (x_{0}) = φ (x_{0} \to Σ_{a})$ for $x_{0} \in Ω_{a b}$ , and $φ_{a} (x_{0}) = 0$ for $x_{0} \in Ω_{b}$ is the multidimensional analog of $φ_{a} (x)$ in Eq. (2.10a), $D (x_{0})$ is the position-dependent diffusivity tensor at point x₀, and $τ (x_{0} \to Σ_{b})$ is the MFPT from point x₀ located in regions $Ω_{a}$ or $Ω_{a b}$ to the surface $Σ_{b}$ . Integrating the left-hand side of Eq. (A20) by parts and using the fact that the surface term vanishes, we obtain

\begin{matrix} \int \nabla φ (x_{0}) \cdot D (x_{0}) e^{- β U (x_{0})} \cdot \nabla τ (x_{0} \to Σ_{b}) d x_{0} = ⟨φ_{a}⟩ \int e^{- β U (y)} d y, \end{matrix}

(A21)

where $⟨φ_{a}⟩$ is the Boltzmann average of $φ_{a} (x)$ analogous to that in Eq. (2.11). Equation (A21) is the multidimensional generalization of Eq. (A17).

Now $\nabla φ_{a} (x_{0})$ is non-zero only in the region $Ω_{a b}$ between the two surfaces, where $φ_{a} (x_{0}) = φ (x_{0} \to Σ_{a})$ . Thus Eq. (A21) becomes

\begin{matrix} \int_{Ω_{a b}} \nabla φ (x_{0} \to Σ_{a}) \cdot D (x_{0}) e^{- β U (x_{0})} \cdot \nabla τ (x_{0} \to Σ_{b}) d x_{0} \\ = ⟨φ_{a}⟩ \int e^{- β U (y)} d y . \end{matrix}

(A22)

If we integrate the left-hand side by parts and apply Gauss’s divergence theorem, only the integral over the surface $Σ_{a}$ survives because ${τ (x \to Σ_{b})|}_{x \in Σ_{b}} = 0$ and $φ (x_{0} \to Σ_{a})$ satisfies, $\nabla \cdot D (x_{0}) e^{- β U (x_{0})} \cdot \nabla φ (x_{0} \to Σ_{a}) = 0$ , which is the multidimensional version of the Onsager equation in Eq. (A11). As a result, Eq. (A22) reduces to

\begin{matrix} - \int_{Σ_{a}} [n (x_{0}) \cdot D (x_{0}) e^{- β U (x_{0})} \cdot \nabla φ (x_{0} \to Σ_{a})] τ (x_{0} \to Σ_{b}) d x_{0} \\ = ⟨φ_{a}⟩ \int e^{- β U (y)} d y, \end{matrix}

(A23)

where $n (x_{0})$ is a unit vector perpendicular to the surface $Σ_{a}$ , pointing towards $Σ_{b}$ .

It can be shown that the total unidirectional equilibrium flux $J_{a b}$ between the two surfaces is given by¹⁷

\begin{matrix} J_{a b} = - \int_{Σ_{a}} n (x_{0}) \cdot D (x_{0}) e^{- β U (x_{0})} \cdot \nabla φ (x_{0} \to Σ_{a}) d x_{0} / \int e^{- β U (y)} d y . \end{matrix}

(A24)

Using this, Eq. (A23) can be written as

τ (Σ_{a} \to Σ_{b}) = \frac{⟨φ_{a}⟩}{J_{a b}} .

(A25)

Here $τ (Σ_{a} \to Σ_{b})$ is the MFPT between the two surfaces defined by

τ (Σ_{a} \to Σ_{b}) = \int_{Σ_{a}} p_{f l u x} (x_{0}) τ (x_{0} \to Σ_{b}) d x_{0},

(A26)

where $p_{f l u x} (x_{0}), x_{0} \in Σ_{a}$ , defined as

p_{f l u x} (x_{0}) = \frac{n (x_{0}) \cdot D (x_{0}) e^{- β U (x_{0})} \cdot \nabla φ (x_{0} \to Σ_{a})}{\int_{Σ_{a}} n (x_{0}) \cdot D (x_{0}) e^{- β U (x_{0})} \cdot \nabla φ (x_{0} \to Σ_{a}) d x_{0}},

(A27)

is the normalized distribution of the unidirectional flux from surface $Σ_{a}$ to surface $Σ_{b}$ passing through the point $x_{0}$ on surface $Σ_{a}$ at equilibrium.

Note: This article is part of the Special Topic “Markov Models of Molecular Kinetics” in J. Chem. Phys.

REFERENCES

1.Bolhuis P. G., Chandler D., Dellago C., and Geissler P. L., Annu. Rev. Phys. Chem. 53, 291 (2002). 10.1146/annurev.physchem.53.082301.113146 [DOI] [PubMed] [Google Scholar]
2.Hummer G., J. Chem. Phys. 120, 516 (2004). 10.1063/1.1630572 [DOI] [PubMed] [Google Scholar]
3.Vanden-Eijnden E. and W. E., Annu. Rev. Phys. Chem. 61, 391 (2010). 10.1146/annurev.physchem.040808.090412 [DOI] [PubMed] [Google Scholar]
4.Faradjian A. K. and Elber R., J. Chem. Phys. 120, 10880 (2004). 10.1063/1.1738640 [DOI] [PubMed] [Google Scholar]
5.Vanden-Eijnden E., Venturoli M., Ciccotti G., and Elber R., J. Chem. Phys. 129, 174102 (2008). 10.1063/1.2996509 [DOI] [PMC free article] [PubMed] [Google Scholar]
6.Elber R., Q. Rev. Biophys. 50, e8 (2017). 10.1017/s0033583517000063 [DOI] [PubMed] [Google Scholar]
7.Hinczewski M., von Hansen Y., Dzubiella J., and Netz R. R., J. Chem. Phys. 132, 245103 (2010). 10.1063/1.3442716 [DOI] [PubMed] [Google Scholar]
8.Vanden-Eijnden E. and Venturoli M., J. Chem. Phys. 131, 044120 (2009). 10.1063/1.3180821 [DOI] [PubMed] [Google Scholar]
9.Buchete V. and Hummer G., J. Phys. Chem. B 112, 6057 (2008). 10.1021/jp0761665 [DOI] [PubMed] [Google Scholar]
10.van Erp T. S., Moroni D., and Bolhuis P. G., J. Chem. Phys. 118, 7762 (2003). 10.1063/1.1562614 [DOI] [PubMed] [Google Scholar]
11.Cabriolu R., Skjelbred Refsnes K. M., Bolhuis P. G., and van Erp T. S., J. Chem. Phys. 147, 152722 (2017). 10.1063/1.4989844 [DOI] [PubMed] [Google Scholar]
12.Kramers H., Physica 7, 284 (1940). 10.1016/s0031-8914(40)90098-2 [DOI] [Google Scholar]
13.Bello-Rivas J. M. and Elber R., J. Chem. Phys. 142, 094102 (2015). 10.1063/1.4913399 [DOI] [PMC free article] [PubMed] [Google Scholar]
14.Aristoff D., Bello-Rivas J. M., and Elber R., Multiscale Model. Simul. 14, 301 (2016). 10.1137/15m102157x [DOI] [PMC free article] [PubMed] [Google Scholar]
15.Lin L., Lu J. F., and Vanden-Eijnden E., Commun. Pure Appl. Math. 71, 1149 (2018). 10.1002/cpa.21725 [DOI] [Google Scholar]
16.Bezrukov S. M., Berezhkovskii A. M., and Szabo A., J. Chem. Phys. 127, 115101 (2007). 10.1063/1.2766720 [DOI] [PubMed] [Google Scholar]
17.Berezhkovskii A. M. and Szabo A., J. Phys. Chem. B 117, 13115 (2013). 10.1021/jp403043a [DOI] [PMC free article] [PubMed] [Google Scholar]

[c1] 1.Bolhuis P. G., Chandler D., Dellago C., and Geissler P. L., Annu. Rev. Phys. Chem. 53, 291 (2002). 10.1146/annurev.physchem.53.082301.113146 [DOI] [PubMed] [Google Scholar]

[c2] 2.Hummer G., J. Chem. Phys. 120, 516 (2004). 10.1063/1.1630572 [DOI] [PubMed] [Google Scholar]

[c3] 3.Vanden-Eijnden E. and W. E., Annu. Rev. Phys. Chem. 61, 391 (2010). 10.1146/annurev.physchem.040808.090412 [DOI] [PubMed] [Google Scholar]

[c4] 4.Faradjian A. K. and Elber R., J. Chem. Phys. 120, 10880 (2004). 10.1063/1.1738640 [DOI] [PubMed] [Google Scholar]

[c5] 5.Vanden-Eijnden E., Venturoli M., Ciccotti G., and Elber R., J. Chem. Phys. 129, 174102 (2008). 10.1063/1.2996509 [DOI] [PMC free article] [PubMed] [Google Scholar]

[c6] 6.Elber R., Q. Rev. Biophys. 50, e8 (2017). 10.1017/s0033583517000063 [DOI] [PubMed] [Google Scholar]

[c7] 7.Hinczewski M., von Hansen Y., Dzubiella J., and Netz R. R., J. Chem. Phys. 132, 245103 (2010). 10.1063/1.3442716 [DOI] [PubMed] [Google Scholar]

[c8] 8.Vanden-Eijnden E. and Venturoli M., J. Chem. Phys. 131, 044120 (2009). 10.1063/1.3180821 [DOI] [PubMed] [Google Scholar]

[c9] 9.Buchete V. and Hummer G., J. Phys. Chem. B 112, 6057 (2008). 10.1021/jp0761665 [DOI] [PubMed] [Google Scholar]

[c10] 10.van Erp T. S., Moroni D., and Bolhuis P. G., J. Chem. Phys. 118, 7762 (2003). 10.1063/1.1562614 [DOI] [PubMed] [Google Scholar]

[c11] 11.Cabriolu R., Skjelbred Refsnes K. M., Bolhuis P. G., and van Erp T. S., J. Chem. Phys. 147, 152722 (2017). 10.1063/1.4989844 [DOI] [PubMed] [Google Scholar]

[c12] 12.Kramers H., Physica 7, 284 (1940). 10.1016/s0031-8914(40)90098-2 [DOI] [Google Scholar]

[c13] 13.Bello-Rivas J. M. and Elber R., J. Chem. Phys. 142, 094102 (2015). 10.1063/1.4913399 [DOI] [PMC free article] [PubMed] [Google Scholar]

[c14] 14.Aristoff D., Bello-Rivas J. M., and Elber R., Multiscale Model. Simul. 14, 301 (2016). 10.1137/15m102157x [DOI] [PMC free article] [PubMed] [Google Scholar]

[c15] 15.Lin L., Lu J. F., and Vanden-Eijnden E., Commun. Pure Appl. Math. 71, 1149 (2018). 10.1002/cpa.21725 [DOI] [Google Scholar]

[c16] 16.Bezrukov S. M., Berezhkovskii A. M., and Szabo A., J. Chem. Phys. 127, 115101 (2007). 10.1063/1.2766720 [DOI] [PubMed] [Google Scholar]

[c17] 17.Berezhkovskii A. M. and Szabo A., J. Phys. Chem. B 117, 13115 (2013). 10.1021/jp403043a [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Committors, first-passage times, fluxes, Markov states, milestones, and all that

Alexander M Berezhkovskii

Attila Szabo

Abstract

I. INTRODUCTION

II. TWO MILESTONES

A. Coloring an equilibrium trajectory

FIG. 1.

B. Microscopic interpretation of equilibrium populations of states A and B

C. MFPT’s, rate constants, and equilibrium unidirectional flux

D. Relation to Kramers theory

III. THREE MILESTONES

A. Coloring an equilibrium trajectory

FIG. 2.

B. Equilibrium populations and MFPT’s

C. Microscopic interpretation of “equilibrium populations”

D. Unidirectional fluxes and rate constants

IV. CONCLUDING REMARKS

ACKNOWLEDGMENTS

APPENDIX: ALGEBRAIC DERIVATION OF THE RELATION AMONG THE MFPT, FLUX, AND COMMITTOR

REFERENCES

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

Committors, first-passage times, fluxes, Markov states, milestones, and all that

Alexander M Berezhkovskii

Attila Szabo

Abstract

I. INTRODUCTION

II. TWO MILESTONES

A. Coloring an equilibrium trajectory

FIG. 1.

B. Microscopic interpretation of equilibrium populations of states A and B

C. MFPT’s, rate constants, and equilibrium unidirectional flux

D. Relation to Kramers theory

III. THREE MILESTONES

A. Coloring an equilibrium trajectory

FIG. 2.

B. Equilibrium populations and MFPT’s

C. Microscopic interpretation of “equilibrium populations”

D. Unidirectional fluxes and rate constants

IV. CONCLUDING REMARKS

ACKNOWLEDGMENTS

APPENDIX: ALGEBRAIC DERIVATION OF THE RELATION AMONG THE MFPT, FLUX, AND COMMITTOR

REFERENCES

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases