Augmented transition path theory for sequences of events

Chatipat Lorpaiboon; Jonathan Weare; Aaron R Dinner

doi:10.1063/5.0098587

. 2022 Sep 7;157(9):094115. doi: 10.1063/5.0098587

Augmented transition path theory for sequences of events

Chatipat Lorpaiboon ¹, Jonathan Weare ², Aaron R Dinner ^1,^a)

PMCID: PMC9458294 PMID: 36075728

Abstract

Transition path theory provides a statistical description of the dynamics of a reaction in terms of local spatial quantities. In its original formulation, it is limited to reactions that consist of trajectories flowing from a reactant set A to a product set B. We extend the basic concepts and principles of transition path theory to reactions in which trajectories exhibit a specified sequence of events and illustrate the utility of this generalization on examples.

I. INTRODUCTION

Many reactions studied today proceed through competing pathways. Understanding such reactions relies on being able to assess the relative importance of the competing pathways and how they contribute to overall rates. When the pathways are well separated, they can be treated independently, often by traditional theories that assume a well-defined activated complex (transition state) and a simple form for the underlying (free) energy landscape governing the dynamics.^1,2 However, when the (observed) dynamics are stochastic, the pathways of reactions often overlap in configuration space. Approaches that treat competing pathways in a unified fashion are thus needed.

To this end, here, we build on transition path theory (TPT).^3–7 The core idea of transition path theory is that the statistics of the ensemble of reactive trajectories can be related to quantities that are local in space: probability currents of reactive trajectories (henceforth, reactive currents) and committors. These quantities enable TPT to go beyond traditional theories by providing information about mechanisms. Reactive currents quantify flows in phase space. Committors, which are probabilities of reaching one metastable state before another, by definition characterize the progress of stochastic reactions.⁸

In its traditional formulation, TPT focuses on transitions between two metastable states. In the present paper, we extend TPT to compute statistics for sequences of events, and we show how this significantly expands its applicability. Our work builds on but goes beyond previous studies. It is closely related to history-augmented Markov state models, in which states are labeled based on the last metastable state visited.⁹ Separating the ensemble of reactive trajectories using these labels enables rates to be computed from the flux into a metastable state^10,11 as well as reactive currents and committors from the underlying trajectories.¹² Our approach generalizes the labeling strategy to sequences of arbitrary numbers of states and allows specification of not just past events but also future ones.

Our work also has connections to that of Koltai and co-workers, who extended TPT to allow trajectories to leave and enter the region connecting the metastable states to analyze trajectory segments from satellite data for drifters in the ocean.¹³ Specifically, they redefined committors to exclude trajectories that were not wholly within this region and noted that this approach could be used to exclude trajectories that pass through selected states. They also considered computing statistics for trajectories beginning and ending in specific portions of metastable states. Both of these developments allow statistics to be computed for subsets of reactive trajectories based on the states that they visit.

We present our work as follows. First, in Secs. II A and II B, we review how TPT expresses path statistics in terms of spatially and temporally local quantities using committors. In Sec. II C, we present a motivating example in which this is not possible within the existing framework, but it can be made possible by augmenting the stochastic process with labels that account for sequences of events. This is the key idea of the paper. While this idea is straightforward, formulating the theory in full generality requires some technical development, and readers may wish to skim Secs. II D and IV initially, focusing on the brief summaries in the first paragraphs of Secs. II D and IV. In Sec. II D, we discuss conditions of consistency and Markovianity that must be satisfied for TPT to apply to the augmented process. We also generalize committors and integrals based on them. In Sec. III, we review the most commonly computed TPT statistics and show how they can be computed in our augmented TPT framework. In Sec. IV, we introduce a procedure for constructing the augmented process from pairs of successive time points rather than full trajectory segments, and we show how processes can be composed to construct more complex ones. We summarize the operational procedure in Sec. V. Then, in Sec. VI, we illustrate our approach on two systems with multiple pathways and intermediates. In Sec. VII, possible extensions and numerical strategies for treating more complicated systems are discussed. In the Appendix, we provide a method for calculating augmented TPT statistics using a finite difference scheme. Code implementing this method is available at github.com/dinner-group/atpt.

II. FRAMEWORK

In this section, we review TPT to show how it casts statistics for reactive trajectories in terms of local quantities. Then, we present an example that cannot be treated within the traditional TPT framework and show how it can be treated by introducing an augmented process. The essential idea is that the augmented process accounts for the order of events. The challenge in implementing this idea is that, for a finite-length trajectory segment, we generally do not know the events that occur before and after it.

For clarity, we present our results in terms of a discrete-time Markov process X_t with time step Δ, but our results generalize to continuous-time processes in the limit Δ → 0. We denote the time interval r, r + Δ, …, s by r:s and a trajectory segment on this time interval by X_r:s = (X_r, X_r+Δ, …, X_s). For conciseness, we denote an infinite trajectory X_−∞:∞ by X.

A. Ensemble of reactive trajectories

In both traditional and augmented TPT, statistics are computed over the ensemble of reactive trajectories. In this section, we define this ensemble and integrals over it. Here, we focus on traditional TPT, but the framework generalizes to augmented TPT immediately once we define the augmented process in Sec. II D.

Traditional TPT considers a reaction from a set A to a set B via trajectories that cross a region D. In anticipation of our augmented framework, we allow D ⊆ (A ∪ B)^c as in Ref. 13. We consider a trajectory X_r:s to be reactive if its first time point X_r is in the reactant set A, its last time point X_s is in the product set B, and all intervening time points X_r+Δ:s−Δ are in the region D. Mathematically, we implement this definition through the indicator function

ω (X_{r : s}) = 1_{A \times D \times \dots \times D \times B} (X_{r : s}),

(1)

where

1_{S} (x) = \{\begin{cases} 1 & if x \in S, \\ 0 & otherwise, \end{cases}

(2)

and $S_{1} \times \dots \times S_{n} = \{(x_{1}, \dots, x_{n}) ∣ x_{1} \in S_{1}, \dots, x_{n} \in S_{n}\}$ is the n-fold Cartesian product.

Given (1), we define the integral over the ensemble of reactive trajectories to be

I_{ω}^{X} [η] = \lim_{T \to \infty} \frac{1}{2 T} I^{X} [\sum_{\begin{array}{c} r = - T : T - Δ \\ s = r + Δ : T \end{array}} ω (X_{r : s}) η (X_{r : s})],

(3)

where I^X[f(X)] is the integral of f(X) over the distribution of infinite trajectories X, which we denote using the superscript X. When X_t is a stationary ergodic process and I^X is the expectation E^X over the distribution of infinite trajectories, as in traditional TPT, we can compute $I_{ω}^{X} [η]$ from a single infinite trajectory and so I^X can be omitted; however, this is not necessarily true for time-dependent processes, as in Ref. 14, or for augmented processes, as in this work. As X_t is a Markov process, we can compute this integral by sampling configurations X_−T from the distribution of states at time −T and propagating until time T. The prefactor 1/(2T) ensures that $I_{ω}^{X} [η]$ gives consistent results across different trajectory lengths 2T. We can then calculate expectations over the ensemble of reactive trajectories as

E_{ω}^{X} [η] = I_{ω}^{X} [η] / I_{ω}^{X} [1],

(4)

where the normalization factor $I_{ω}^{X} [1]$ is the expected number of reactive trajectories that start (or end) per unit time. The integral $I_{ω}^{X} [η]$ , thus, yields statistics that can be used to characterize and compare reaction pathways.

B. Transition path theory

In general, the ensemble of reactive trajectories can only be meaningfully interpreted through its statistics. Although these statistics can be computed directly from the ensemble of reactive trajectories, TPT enables them to be computed from other data as well by expressing them in terms of spatially and temporally local quantities.

TPT specifically considers functions that can be written as

η (X_{r : s}) = \sum_{t = r : s - Δ} γ (X_{t : t + Δ}) Δ,

(5)

where γ(X_t:t+Δ) is a function of successive time points X_t and X_t+Δ. In this case, substituting (5) into (3) and exchanging the order of the sums yields

I_{ω}^{X} [η] = \lim_{T \to \infty} \frac{Δ}{2 T} \sum_{t = - T : T - Δ} I^{X} [\sum_{\begin{array}{c} r = - T : t \\ s = t + Δ : T \end{array}} ω (X_{r : s}) γ (X_{t : t + Δ})]

(6)

= I^{X, t} [\sum_{\begin{array}{c} r = - \infty : t \\ s = t + Δ : \infty \end{array}} ω (X_{r : s}) γ (X_{t : t + Δ})],

(7)

where from (6) to (7) we have taken the limit T → ∞ and performed a time average over t, which we denote by the superscript t. That is,

I^{X, t} [f (X, t)] = \lim_{T \to \infty} \frac{Δ}{2 T} \sum_{t = - T : T - Δ} I^{X} [f (X, t)] .

(8)

We can, then, factor

\sum_{\begin{array}{c} r = - \infty : t \\ s = t + Δ : \infty \end{array}} ω (X_{r : s}) = 1_{A} (X_{τ_{-} (t)}) 1_{B} (X_{τ_{+} (t + Δ)}),

(9)

where

τ_{-} (t) = \max \{t^{'} \leq t ∣ X_{t^{'}} \in D^{c}\},

(10)

τ_{+} (t) = \min \{t^{'} \geq t ∣ X_{t^{'}} \in D^{c}\}

(11)

are the last exit time from D^c and the first entrance time to D^c, respectively. Equation (9) results from the identities

\sum_{r = - \infty : t} 1_{A \times D \times \dots \times D} (X_{r : t}) = 1_{A} (X_{τ_{-} (t)}),

(12)

\sum_{s = t : \infty} 1_{D \times \dots \times D \times B} (X_{t : s}) = 1_{B} (X_{τ_{+} (t)}) .

(13)

We arrive at (13) by observing that only one term in the sum can be nonzero: Because D and B are disjoint, 1_D×⋯×D×B(X_t:s) can be nonzero only when s = τ₊ (t) is the first time t′ ≥ t that X_t′ ∉ D. Similar logic applies for (12).

Consequently, for a Markov process, (7) can be expressed in terms of only local quantities as follows:

I_{ω}^{X} [η] = I^{X, t} [1_{A} (X_{τ_{-} (t)}) 1_{B} (X_{τ_{+} (t + Δ)}) γ (X_{t : t + Δ})]

(14)

= I^{X, t} [q_{-} (X_{t}) q_{+} (X_{t + Δ}) γ (X_{t : t + Δ})],

(15)

where we have defined the backward and forward committors, respectively, as

q_{-} (X_{t}) = E^{X} [1_{A} (X_{τ_{-} (t)}) ∣ X_{t}],

(16)

q_{+} (X_{t}) = E^{X} [1_{B} (X_{τ_{+} (t)}) ∣ X_{t}] .

(17)

The backward committor q₋(X_t) is the probability that X_t last came from A rather than (A ∪ D)^c, and the forward committor q₊(X_t) is the probability that X_t will go to B before (B ∪ D)^c.

The main result of this section is (15). The advantage of (15) over (3) is that the former involves only statistics that are local in space and time. This aids in the interpretation of these statistics, and it enables their estimation from short trajectories X_t:t+Δ, thus eliminating the need for trajectories that actually cross from A to B.

C. A motivating reaction

To motivate our augmented framework, we consider a reaction with an intermediate state C and compute statistics only for reactive trajectories that proceed through the intermediate. The function that selects trajectories of interest is

\begin{align} ω (X_{r : s}) & = \sum_{\begin{array}{c} t_{1} = r + Δ : s - Δ \\ t_{2} = t_{1} : s - Δ \end{array}} [1_{A \times D \times \dots \times D} (X_{r : t_{1} - Δ}) \\ \times 1_{C \times (C \cup D) \times \dots \times (C \cup D) \times C} (X_{t_{1} : t_{2}}) \\ \times 1_{D \times \dots \times D \times B} (X_{t_{2} + Δ : s})], \end{align}

(18)

where D = (A ∪ B ∪ C)^c, and the sum allows for any t₁ and t₂ satisfying r < t₁ ≤ t₂ < s. The sum searches for times t₁ and t₂, which are the first and last times that the trajectory is in C, respectively. Because determining t₁ and t₂ requires a search over the entire trajectory X_r:s, we cannot factor ω(X_r:s) as in (9).

Now suppose that, for each reactive trajectory segment X_r:s in the infinite trajectory X, we have a process Y_t with

Y_{t} = \{\begin{cases} 0 & if t \leq r, \\ 1 & if r < t < t_{1}, \\ 2 & if t_{1} \leq t \leq t_{2}, \\ 3 & if t_{2} < t < s, \\ 4 & if s \leq t . \end{cases}

(19)

An example reactive trajectory labeled with Y_t is shown in Fig. 1. We can apply TPT on the augmented process Z_t = (X_t, Y_t) because we can write (18) in the form of (1) as

ω (Z_{r : s}) = 1_{(A \times \{0\}) \times D^{'} \times \dots \times D^{'} \times (B \times \{4\})} (Z_{r : s}),

(20)

where $D^{'} = (D \times \{1\}) \cup ((C \cup D) \times \{2\}) \cup (D \times \{3\})$ .

FIG. 1. — Values of Y_t at each point of a reactive trajectory described by (19).

This approach suggests a general strategy. We identify events—in this case, the first time t₁ and last time t₂ that X_t is in C—and define a process Y_t that labels these events. Then, we define reactive trajectories on the augmented state space using (1). So long as Z_t satisfies the assumptions behind TPT, we can express statistics using local quantities in the same manner as in (15). In Sec. II D, we discuss conditions for this to be the case, allowing for the possibility that an infinite trajectory has multiple labelings (e.g., to account for multiple finite reactive segments).

D. Augmented transition path theory

In this section, we introduce a function Ω(Y|X) for constructing an ensemble of trajectories augmented with labels from the distribution of trajectories X. We first consider the case of infinite length trajectories Z = (X, Y) and then the case of finite-length trajectories Z_r:s = (X_r:s, Y_r:s), to which one is limited in practice. We discuss two conditions that must hold for our framework. First, Ω(Y_r:s|X_r:s) must be consistent with Ω(Y|X). Second, Z_t must be Markovian. Later, in Sec. IV, we detail a specific construction of Y from X that requires examining only successive pairs of time points X_t:t+Δ and Y_t:t+Δ.

We now present our augmented framework. We replace X_t with the augmented process Z_t = (X_t, Y_t), where Y_t augments X_t with information about past and future events. Often, a single Y is associated with each infinite trajectory X because the latter contains full information about the past and future of any X_t. However, cases arise in which multiple Y can be associated with a given infinite trajectory X. For example, in the motivating reaction above, we define a Y for each reactive trajectory segment in X (i.e., we consider multiple r and s). It is, thus, necessary to consider a distribution of Y, and we compute integrals over the distribution of infinite trajectories Z as

I^{Z} [f (Z)] = I^{X} [\int Ω (Y | X) f (Z) d Y],

(21)

where Ω(Y|X) is the distribution of Y (and thus Z) for a given infinite trajectory X.

This immediately yields analogs of (15), (16), and (17),

I_{ω}^{Z} [η] = I^{Z, t} [q_{-} (Z_{t}) q_{+} (Z_{t + Δ}) γ (Z_{t : t + Δ})],

(22)

q_{-} (Z_{t}) = E^{Z} [1_{A} (Z_{τ_{-} (t)}) ∣ Z_{t}],

(23)

q_{+} (Z_{t}) = E^{Z} [1_{B} (Z_{τ_{+} (t)}) ∣ Z_{t}],

(24)

where sets A, B, and D are now defined on the augmented state space, and τ₋ (t) and τ₊ (t) are now on the augmented process.

However, we cannot yet evaluate these integrals and expectations because Y_t and, thus, each time point Z_t depends on the infinite trajectory X. Instead, we must convert integrals over Z to integrals over X using

I^{Z} [f (Z_{r : s})] = I^{X} [\int Ω (Y_{r : s} | X_{r : s}) f (Z_{r : s}) d Y_{r : s}],

(25)

where the weight of Y_r:s (and thus Z_r:s) given the trajectory segment X_r:s is

Ω (Y_{r : s} | X_{r : s}) = E^{X} [\iint Ω (Y | X) d Y_{- \infty : r - Δ} d Y_{s + Δ : \infty} | X_{r : s}] .

(26)

When a single Y is associated with each infinite trajectory X, Ω(Y_r:s|X_r:s) is the probability of Y_r:s given X_r:s and so ∫Ω(Y_r:s|X_r:s) dY_r:s = 1; this is not true in the general case. Equation (26) is the first requirement of our augmented framework: We must be able to convert integrals involving Ω(Y|X), which depend on the infinite trajectory X, to those involving Ω(Y_r:s|X_r:s), which depend only on the finite trajectory X_r:s.

Using (25), we can then write (22), (23), and (24) as

\begin{align} I_{ω}^{Z} [η] & = I^{X, t} [\int Ω (Y_{t : t + Δ} | X_{t : t + Δ}) q_{-} (Z_{t}) q_{+} (Z_{t + Δ}) \\ \times γ (Z_{t : t + Δ}) d Y_{t : t + Δ}], \end{align}

(27)

q_{-} (Z_{t}) = \frac{E^{X} [\int Ω (Y_{- \infty : t} | X_{- \infty : t}) 1_{A} (Z_{τ_{-} (t)}) d Y_{- \infty : t - Δ} ∣ X_{t}]}{E^{X} [\int Ω (Y_{- \infty : t} | X_{- \infty : t}) d Y_{- \infty : t - Δ} ∣ X_{t}]}

(28)

= \frac{E^{X} [\int Ω (Y_{- \infty : t} | X_{- \infty : t}) 1_{A} (Z_{τ_{-} (t)}) d Y_{- \infty : t - Δ} ∣ X_{t}]}{Ω (Y_{t} | X_{t})},

(29)

q_{+} (Z_{t}) = \frac{E^{X} [\int Ω (Y_{t : \infty} | X_{t : \infty}) 1_{B} (Z_{τ_{+} (t)}) d Y_{t + Δ : \infty} ∣ X_{t}]}{E^{X} [\int Ω (Y_{t : \infty} | X_{t : \infty}) d Y_{t + Δ : \infty} ∣ X_{t}]}

(30)

= \frac{E^{X} [\int Ω (Y_{t : \infty} | X_{t : \infty}) 1_{B} (Z_{τ_{+} (t)}) d Y_{t + Δ : \infty} ∣ X_{t}]}{Ω (Y_{t} | X_{t})} .

(31)

For q₋(Z_t) and q₊(Z_t), we excluded Y_t from the variables over which we integrate because we conditioned on it. We note that when ω(Z_r:s) has no dependence on Y_r:s [i.e., ω(Z_r:s) = ω(X_r:s)] and there is a one-to-one correspondence between X and Z [i.e., ∫Ω(Y|X) dY = 1], we can recover the traditional TPT committors as

q_{-} (X_{t}) = \int Ω (Y_{t} | X_{t}) q_{-} (Z_{t}) d Y_{t},

(32)

q_{+} (X_{t}) = \int Ω (Y_{t} | X_{t}) q_{+} (Z_{t}) d Y_{t} .

(33)

In traditional TPT, X_t must be a Markov process so that, from (14) to (15), we could take expectations of $1_{A} (X_{τ_{-} (t)})$ and $1_{B} (X_{τ_{+} (t + Δ)})$ to obtain committors q₋(X_t) and q₊(X_t+Δ). For the augmented process Z_t to be similarly treatable, we also require it to be a Markov process. This requirement may be surprising because Y_t can depend on the future of X_t. This can be understood by observing that, for the augmented process, the probability distribution of X_t+Δ depends on both X_t and Y_t. For example, for q₊(Z_t) in (24), the distribution of X_t+Δ:∞ conditioned on Z_t is not the same as that of X_t+Δ:∞ conditioned on X_t alone, since Y_t specifies that X_t must undergo certain events in the future.

Since X_t and Z_t are Markov processes, we can factor the path probabilities P^X[X_r:s] and P^Z[Z_r:s] of the original and augmented processes as follows:

P^{X} [X_{r : s}] = P^{X} [X_{r}] \prod_{t = r : s - Δ} P^{X} [X_{t + Δ} ∣ X_{t}]

(34)

= P^{X} [X_{r}] \prod_{t = r : s - Δ} \frac{P^{X} [X_{t : t + Δ}]}{P^{X} [X_{t}]},

(35)

P^{Z} [Z_{r : s}] = P^{Z} [Z_{r}] \prod_{t = r : s - Δ} P^{Z} [Z_{t + Δ} ∣ Z_{t}]

(36)

= P^{Z} [Z_{r}] \prod_{t = r : s - Δ} \frac{P^{Z} [Z_{t : t + Δ}]}{P^{Z} [Z_{t}]} .

(37)

As above, the superscripts indicate distributions of infinite trajectories. Thus, for example,

P^{X} [X_{r : s}] = \iint P^{X} [X] d X_{- \infty : r - Δ} d X_{s + Δ : \infty}

(38)

is the probability of observing X_r:s with all possible semi-infinite segments X_{−∞:r−Δ} and X_s+Δ:∞ before and after r:s, respectively; P^X[X] is the probability of a specific infinite trajectory X. The probability distribution of Z is

P^{Z} [Z] = Ω (Y | X) P^{X} [X] / c

(39)

with c = ∫Ω(Y|X)P^X[X] dZ. Therefore, from (26),

Ω (Y_{r : s} | X_{r : s}) = \frac{\iint Ω (Y | X) P^{X} [X] d Z_{- \infty : r - Δ} d Z_{s + Δ : \infty}}{P^{X} [X_{r : s}]}

(40)

= \frac{\iint c P^{Z} [Z] d Z_{- \infty : r - Δ} d Z_{s + Δ : \infty}}{P^{X} [X_{r : s}]}

(41)

= c \frac{P^{Z} [Z_{r : s}]}{P^{X} [X_{r : s}]} .

(42)

To compute Ω(Y_r:s|X_r:s), we divide (37) by (35) and then apply (42),

Ω (Y_{r : s} | X_{r : s}) = Ω (Y_{r} | X_{r}) \prod_{t = r : s - Δ} \frac{Ω (Y_{t : t + Δ} | X_{t : t + Δ})}{Ω (Y_{t} | X_{t})} .

(43)

This factorization is the second requirement of our augmented framework: We must be able to construct Ω(Y|X), which depends on the infinite trajectory X, from Ω(Y_t:t+Δ|X_t:t+Δ), which can only depend on pairs of successive time points X_t:t+Δ.

III. REACTIVE STATISTICS

In this section, we discuss TPT statistics that provide information about mechanisms. These include committors, the reactive flux, the reactive density, the reactive current, and expectations over reactive trajectories that they enable computing. We present expressions for augmented TPT in the form of (22), which can be evaluated using (27). The corresponding expressions for traditional TPT can be obtained by replacing Z_t with X_t. The statistics are normalized so that different reactions that are specified through different ω(Z_r:s) but calculated from the same distribution of infinite trajectories X are directly comparable.

We note that augmented TPT is useful even for reactions that can be described using traditional TPT [i.e., ω(Z_r:s) = ω(X_r:s)]. The augmented process allows reaction mechanisms to be resolved in more detail, since committors and other statistics can be calculated on points that depend on both past and future behaviors of trajectories. Furthermore, the addition of past and future information enables the calculation of statistics with η(X_r:s) no longer restricted to the form in (5).

Several of the statistics that we discuss yield quantities on points v in a collective variable (CV) space θ, that is, a space of functions of a subset of the coordinates. We indicate this using the subscript θ. We express these statistics on a CV space rather than the state space of Z_t because, for complex systems, it is often the case that the full state space contains variables that are irrelevant to understanding the reaction. This is particularly true for the augmented state space, which must contain the information required to select reactive trajectories using (1) and compute statistics using (5), both of which rely on Y_t to obtain past or future information. Nevertheless, the theory holds for the choice θ(Z_t) = Z_t.

A. Reactive flux

The reactive flux $R = I_{ω}^{Z} [1]$ is the expected number of reactive trajectories that start (or end) per unit time. We can express the reactive flux in the form of (22) by choosing γ(Z_t:t+Δ) so that η(Z_r:s) = 1 when Z_r:s is reactive. Such choices of γ(Z_t:t+Δ) include 1_A(Z_t)/Δ and 1_B(Z_t+Δ)/Δ, which are nonzero only when Z_t:t+Δ is the first or last step of the reactive trajectory, respectively. Consequently, we can compute the reactive flux using

R = I^{Z, t} [1_{A} (Z_{t}) q_{+} (Z_{t + Δ}) / Δ]

(44)

= I^{Z, t} [q_{-} (Z_{t}) 1_{B} (Z_{t + Δ}) / Δ],

(45)

where we have applied the identities 1_A(Z_t)q₋(Z_t) = 1_A(Z_t) and 1_B(Z_t)q₊(Z_t) = 1_B(Z_t). Equation (44) counts the number of trajectories that exit A in the time interval Δ and then react; (45) is the analog for trajectories entering B.

The reactive flux is of interest not only in its own right but also for calculating expectations over reactive trajectories,

E_{ω}^{Z} [η] = I_{ω}^{Z} [η] / I_{ω}^{Z} [1] = I_{ω}^{Z} [η] / R .

(46)

For example, the duration N(Z_r:s) = s − r of a trajectory can be expressed in the form of (5) with γ(Z_t:t+Δ) = 1, and so the expected duration of a reactive trajectory is

E_{ω}^{Z} [N] = I^{Z, t} [q_{-} (Z_{t}) q_{+} (Z_{t + Δ})] / R .

(47)

B. Reactive density

The reactive density is the distribution of configurations that belong to reactive trajectories. For a point v in the CV space θ, the reactive density ρ_θ(v) is the probability that θ(Z_t) = v and Z_t is part of a reactive trajectory. Equivalently, it is the expected fraction of time an infinite trajectory spends reactive at v. It can be expressed in the form of (22) as

ρ_{θ} (v) = I^{Z, t} [q_{-} (Z_{t}) q_{+} (Z_{t + Δ}) \frac{δ_{v} (θ (Z_{t})) + δ_{v} (θ (Z_{t + Δ}))}{2}],

(48)

where δ is the Dirac delta function. When computing the expectation, δ_v(θ(Z_t)) selects the points Z_t with θ(Z_t) = v. The term [δ_v(θ(Z_t)) + δ_v(θ(Z_t+Δ))]/2 corresponds to assuming that half of the time of each step Z_t:t+Δ is spent in Z_t and half of the time is spent in Z_t+Δ.

In turn, the reactive density can be used to evaluate (22) when γ(Z_t:t+Δ) = [f(θ(Z_t)) + f(θ(Z_t+Δ))]/2 is a path-independent function on the CV space,

I_{ω}^{Z} [η] = \int ρ_{θ} (v) f (v) d v .

(49)

For instance, the expected fraction of time an infinite trajectory spends reactive can be obtained by setting f(v) = 1, so that

I_{ω}^{Z} [N] = \int ρ_{θ} (v) d v = I^{Z, t} [q_{-} (Z_{t}) q_{+} (Z_{t + Δ})],

(50)

where we have assumed the distribution of trajectories X to be a probability distribution, so that I^X[1] = 1.

We note that when the CV space θ is contained in the CV space θ′, i.e., we can write θ(Z_t) = ζ(θ′(Z_t)) for some ζ(v′), we can calculate ρ_θ(v) by projecting $ρ_{θ^{'}} (v^{'})$ onto θ,

ρ_{θ} (v) = \int δ_{v} (ζ (v^{'})) ρ_{θ^{'}} (v^{'}) d v^{'} .

(51)

We can do the same for functions f(v′) defined on the CV space θ′. We calculate A_θ[f](v), the expected value of f(v′) at a point v in the CV space θ, conditioned on trajectories passing through that point being reactive, as

A_{θ} [f] (v) = \frac{\int δ_{v} (ζ (v^{'})) ρ_{θ^{'}} (v^{'}) f (v^{'}) d v^{'}}{ρ_{θ} (v)} .

(52)

We emphasize that f(v′) can use Y_t to obtain information from the past and future of X_t, and so (52) is significantly more powerful than its traditional TPT counterpart. For instance, we can calculate the conditional mean first passage time, the expected time it takes for Z_t to hit B given that Z_t is part of a reactive trajectory, using (52) as discussed further in Sec. III E, whereas in traditional TPT, we would need to employ a Feynman–Kac formula (e.g., see Ref. 15).

C. Reactive current

The reactive current J_θ(v) through a point v in the CV space θ is the net flow of reactive trajectories within θ through v. It can be expressed in the form of (22) as

\begin{align} J_{θ} (v) & = I^{Z, t} [q_{-} (Z_{t}) q_{+} (Z_{t + Δ}) \frac{δ_{v} (θ (Z_{t})) + δ_{v} (θ (Z_{t + Δ}))}{2} \\ \times \frac{θ (Z_{t + Δ}) - θ (Z_{t})}{Δ}] . \end{align}

(53)

Conceptually, for each pair of successive time points Z_t:t+Δ that are part of a reactive trajectory, we compute the numerical derivative [θ(Z_t+Δ) − θ(Z_t)]/Δ and then split it equally between Z_t and Z_t+Δ. In fact, in the limit Δ → 0, when θ(Z_t) is differentiable, (53) becomes

J_{θ} (v) = I^{Z, t} [1_{A} (Z_{τ_{-} (t)}) 1_{B} (Z_{τ_{+} (t)}) δ_{v} (θ (Z_{t})) \frac{d θ (Z_{t})}{d t}],

(54)

which is the time derivative of θ(Z_t) integrated over the distribution of reactive trajectories Z_t with θ(Z_t) = v.

It can be useful to compute the reactive current along the gradient of a function f(v),

\begin{align} J_{θ} [f] (v) & = I^{Z, t} [q_{-} (Z_{t}) q_{+} (Z_{t + Δ}) \frac{δ_{v} (θ (Z_{t})) + δ_{v} (θ (Z_{t + Δ}))}{2} \\ \times \frac{f (θ (Z_{t + Δ})) - f (θ (Z_{t}))}{Δ}] . \end{align}

(55)

In the limit Δ → 0, for differentiable f(v), (55) is J_θ[f](v) = J_θ(v) · ∇_θf(v), which we can derive by observing that the finite differences in (53) and (55) are dθ(Z_t)/dt and df(θ(Z_t))/dt in this limit, respectively, and by the chain rule df(θ(Z_t))/dt = dθ(Z_t)/dt ⋅∇_θf(θ(Z_t)).

Like the reactive density, we can calculate J_θ(v) and J_θ[f](v) by projecting $J_{θ^{'}} (v^{'})$ and $J_{θ^{'}} [f ◦ ζ] (v^{'})$ onto θ,

J_{θ} (v) = \int δ_{v} (ζ (v^{'})) J_{θ^{'}} [ζ] (v^{'}) d v^{'},

(56)

J_{θ} [f] (v) = \int δ_{v} (ζ (v^{'})) J_{θ^{'}} [f ◦ ζ] (v^{'}) d v^{'},

(57)

where (f◦ζ) (v′) = f(ζ(v′)).

D. Committors

The committors q₋(Z_t) and q₊(Z_t) are defined on the state space of Z_t, which makes them useful for calculating other statistics but can make them hard to interpret. To address this issue, we can treat the committors as reaction coordinates and project them onto a CV space θ as A_θ[q₋](v) and A_θ[q₊](v). These quantities have a physical interpretation. For instance, A_θ[q₊](v) is the probability that a trajectory starting at a point Z_t that is drawn from configurations with θ(Z_t) = v in the ensemble of reactive trajectories enters B when it first leaves D. We note that, unlike most other reactive statistics, A_θ[q₋](v) and A_θ[q₊](v) with θ(Z_t) = X_t are not independent of the choice of Y_t, even when the same ensemble of reactive trajectories is selected because the likelihood that a trajectory contributes positively to the committor and the likelihood that it is reactive are correlated.

E. Conditional mean first and last passage times

The first passage time to the product is the time it takes for a trajectory starting at time t to reach the product B, at time τ₊ (t). It can be expressed as

f (Z_{t}^{'}) = Y_{t}^{'} = \{\begin{cases} Y_{t + Δ}^{'} + Δ & if Z_{t} \notin B, \\ 0 & otherwise, \end{cases}

(58)

where $Z_{t}^{'} = (Z_{t}, Y_{t}^{'})$ . This increments $Y_{t}^{'}$ by Δ for each time step backward in time when Z_t ∉ B and sets $Y_{t}^{'} = 0$ when Z_t ∈ B. The conditional mean first passage time, m₊(Z_t), is the expected first passage time to the product for a point Z_t that is part of a reactive trajectory. This statistic, and higher moments of the first passage time distribution, are useful for real-time forecasting, e.g., of weather.¹⁵ To compute the conditional mean first passage time, we take the conditional expectation of (58) with respect to the reactive density using (52), i.e.,

m_{+} (Z_{t}) = A_{θ^{'}} [f] (Z_{t}),

(59)

where $θ^{'} (Z_{t}^{'}) = Z_{t}$ . We can also calculate more general statistics on the distribution of the first passage time. For instance, the conditional variance of the first passage time to the product is $A_{θ^{'}} [f^{2}] (Z_{t}) - {(A_{θ^{'}} [f] (Z_{t}))}^{2}$ . Likewise, the last passage time from the reactant is the time it takes for a trajectory ending at time t to come from the reactant A, at time τ₋ (t), conditioned on Z_t being part of a reactive trajectory, and can be expressed as

g (Z_{t}^{''}) = Y_{t}^{''} = \{\begin{cases} Y_{t - Δ}^{''} + Δ & if Z_{t} \notin A, \\ 0 & otherwise, \end{cases}

(60)

where $Z_{t}^{''} = (Z_{t}, Y_{t}^{''})$ . The conditional mean last passage time from the reactant is then

m_{-} (Z_{t}) = A_{θ^{''}} [g] (Z_{t}),

(61)

where $θ^{''} (Z_{t}^{''}) = Z_{t}$ . These statistics can be projected onto points v on a CV space θ(Z_t) as A_θ[m₋](v) and A_θ[m₊](v).

IV. CONSTRUCTION OF THE AUGMENTED PROCESS

In this section, we describe a particularly useful way to define the augmented process: We decompose Ω(Y|X) into a product over functions of successive time points, κ(Y_t:t+Δ|X_t:t+Δ). We show that Ω(Y_r:s|X_r:s) so defined satisfies the required properties (26) and (43) by construction. We, then, demonstrate the construction of complex augmented processes from simpler ones by composition. Finally, we show how this machinery applies to the motivating reaction.

A. Decomposition of Ω

To apply our augmented framework, we need to construct the augmented process and, in turn, the distribution of infinite trajectories Z such that Ω(Y_r:s|X_r:s) satisfies (26) and (43). One way to do so is to define Ω(Y|X), calculate Ω(Y_r:s|X_r:s) from Ω(Y|X) using (26), and then verify that (43) holds. In this section, we present an alternative approach. We specify the augmented process through a function of pairs of successive time points, κ(Y_t:t+Δ|X_t:t+Δ), and then use it to calculate Ω(Y_r:s|X_r:s). This procedure satisfies (26) and (43) by construction.

We start by using the pair structure of (43) to factor

Ω (Y | X) = \prod_{t = - \infty : \infty} κ (Y_{t : t + Δ} | X_{t : t + Δ}) .

(62)

It follows immediately that

Ω (Y | X) = κ (Y_{- \infty : r} | X_{- \infty : r}) κ (Y_{r : s} | X_{r : s}) κ (Y_{s : \infty} | X_{s : \infty}),

(63)

where

κ (Y_{r : s} | X_{r : s}) = \prod_{t = r : s - Δ} κ (Y_{t : t + Δ} | X_{t : t + Δ}) .

(64)

This cleanly separates terms that depend on the past X_−∞:r and future X_s:∞ from those that depend on the trajectory segment X_r:s.

Using (63), we can compute (26) as

Ω (Y_{r : s} | X_{r : s}) = k_{-} (Y_{r} | X_{r}) κ (Y_{r : s} | X_{r : s}) k_{+} (Y_{s} | X_{s}),

(65)

where we have defined

k_{-} (Y_{t} | X_{t}) = E^{X} [\int κ (Y_{- \infty : t} | X_{- \infty : t}) d Y_{- \infty : t - Δ} | X_{t}],

(66)

k_{+} (Y_{t} | X_{t}) = E^{X} [\int κ (Y_{t : \infty} | X_{t : \infty}) d Y_{t + Δ : \infty} | X_{t}] .

(67)

We describe how to calculate k₋(Y_t|X_t) and k₊(Y_t|X_t) in Sec. V. The case r = s = t is the weight of Y_t given X_t,

Ω (Y_{t} | X_{t}) = k_{-} (Y_{t} | X_{t}) k_{+} (Y_{t} | X_{t}) .

(68)

We can verify that the resulting Z_t is a Markov process by substituting (65) and (68) into (43).

B. Building augmented processes by composition

The factorization in (62) has a number of advantages over (43). One is that it facilitates deriving useful expressions for treating multiple augmented processes. If we have Ω(Y|X) of the form

Ω (Y | X) = \prod_{n} Ω (Y^{(n)} | X^{(n)}),

(69)

where n labels different augmented processes, we can factor both sides by (62) to obtain

κ (Y_{t : t + Δ} | X_{t : t + Δ}) = \prod_{n} κ (Y_{t : t + Δ}^{(n)} | X_{t : t + Δ}^{(n)}),

(70)

which involves only successive time points. Each of the terms $κ (Y_{t : t + Δ}^{(n)} | X_{t : t + Δ}^{(n)})$ defines a process $Y_{t}^{(n)}$ using information from the original process X_t and other processes $Y_{t}^{(m)}$ , which we denote as $X_{t}^{(n)}$ . We can use this to combine multiple augmented processes, which may be defined independently or hierarchically.

For example, consider the augmented process $Z_{t}^{ω} = (X_{t}, Y_{t}^{ω})$ , where $Y_{t}^{ω}$ is used to define pathways and is defined using $κ (Y_{t : t + Δ}^{ω} | X_{t : t + Δ})$ . To compute statistics on the first and last passage times, we can augment this process with the augmented processes in (58) and (60) by defining

κ (Y_{t : t + Δ}^{'} | Z_{t : t + Δ}^{ω}) = \{\begin{cases} δ_{Y_{t}^{'}} (Y_{t + Δ}^{'} + Δ) & if Z_{t}^{ω} \notin B, \\ δ_{Y_{t}^{'}} (0) & otherwise, \end{cases}

(71)

κ (Y_{t : t + Δ}^{''} | Z_{t : t + Δ}^{ω}) = \{\begin{cases} δ_{Y_{t + Δ}^{''}} (Y_{t}^{''} + Δ) & if Z_{t + Δ}^{ω} \notin B, \\ δ_{Y_{t + Δ}^{''}} (0) & otherwise . \end{cases}

(72)

The combined process $Y_{t} = (Y_{t}^{ω}, Y_{t}^{'}, Y_{t}^{''})$ is then specified by

\begin{align} κ (Y_{t : t + Δ} | X_{t : t + Δ}) & = κ (Y_{t : t + Δ}^{ω} | X_{t : t + Δ}) \\ \times κ (Y_{t : t + Δ}^{'} | Z_{t : t + Δ}^{ω}) κ (Y_{t : t + Δ}^{''} | Z_{t : t + Δ}^{ω}) . \end{align}

(73)

This example furthermore shows how (62) allows forward-in-time and backward-in-time augmented processes to be treated in a unified manner and combined, which is not straightforward with (43).

C. Augmented process for the motivating reaction

We can also use (70) to construct augmented processes by combining simpler augmented processes. Here, we detail a possible construction of the augmented process (19) as a composite of three augmented processes. First, we define an augmented process $Z_{t}^{(0)} = (X_{t}, Y_{t}^{(0)})$ that selects all reactive trajectories regardless of pathway,

\begin{align} κ (Y_{t : t + Δ}^{(0)} | X_{t : t + Δ}) & = [1_{\{0\} \times \{0\}} (Y_{t : t + Δ}^{(0)}) \\ + 1_{((D \cup C) \times \{1\}) \times ((D \cup C) \times \{1\})} (Z_{t : t + Δ}^{(0)}) \\ + 1_{\{2\} \times \{2\}} (Y_{t : t + Δ}^{(0)}) \\ + 1_{((A \cup B) \times \{0\}) \times ((D \cup C) \times \{1\})} (Z_{t : t + Δ}^{(0)}) \\ + 1_{((D \cup C) \times \{1\}) \times ((A \cup B) \times \{2\})} (Z_{t : t + Δ}^{(0)}) \\ + 1_{((A \cup B) \times \{0\}) \times ((A \cup B) \times \{2\})} (Z_{t : t + Δ}^{(0)})] . \end{align}

(74)

For a reactive trajectory X_r:s from A to B, $Y_{t}^{(0)}$ splits the infinite trajectory X into three parts: time points X_−∞:r before the reaction $(Y_{t}^{(0)} = 0)$ , time points X_r+Δ:s−Δ during the reaction $(Y_{t}^{(0)} = 1)$ , and time points X_s:∞ after the reaction $(Y_{t}^{(0)} = 2)$ . We list the possible transitions of this augmented process in Fig. 2(a). The nodes are sets in which $Z_{t}^{(0)}$ may belong; an arrow from one set to another indicates that $Y_{t}^{(0)}$ may transition to $Y_{t + Δ}^{(0)}$ when $Z_{t}^{(0)}$ is in the first set and $Z_{t + Δ}^{(0)}$ is in the second set. For instance, the fourth term in (74) corresponds to the arrow from $(A \cup B) \times \{0\}$ to $(D \cup C) \times \{1\}$ .

FIG. 2. — Construction of the augmented process for a reaction with an intermediate. (a) Possible transitions of the augmented process defined by (74). (b) Possible transitions of the augmented process defined by (79). Each arrow from one set to another indicates that Y_t may transition to Y_t+Δ when Z_t is in the first set and Z_t+Δ is in the second set. (c) Determination of Y_r:s for the reactive trajectory X_r:s from Fig. 1. The black elements indicate trajectories Y_r:s that satisfy κ(Y_r:s|X_r:s) = 1. The gray elements indicate pairs Y_t:t+Δ that satisfy κ(Y_t:t+Δ|X_t:t+Δ) = 1 but do not belong to any Y_r:s with κ(Y_r:s|X_r:s).

Next, we employ additional augmented processes to find the first and last times t₁ and t₂ that the trajectory is in C. Using (74), we define the processes

Y_{t}^{(1)} = \{\begin{cases} Y_{t - Δ}^{(1)} & if X_{t} \in D and Y_{t}^{(0)} = 1, \\ 1 & if X_{t} \in C and Y_{t}^{(0)} = 1, \\ 0 & if Y_{t}^{(0)} \in \{0, 2\}, \end{cases}

(75)

Y_{t}^{(2)} = \{\begin{cases} Y_{t + Δ}^{(2)} & if X_{t} \in D and Y_{t}^{(0)} = 1, \\ 1 & if X_{t} \in C and Y_{t}^{(0)} = 1, \\ 0 & if Y_{t}^{(0)} \in \{0, 2\} . \end{cases}

(76)

During the reaction (i.e., $Y_{t}^{(0)} = 1$ ), $Y_{t}^{(1)} = 1$ for times t ≥ t₁ and $Y_{t}^{(2)} = 1$ for times t ≤ t₂. We can write (75) and (76) as

\begin{align} κ (Y_{t : t + Δ}^{(1)} | Z_{t : t + Δ}^{(0)}) & = [1_{D \times \{1\} \times \{(0, 0), (1, 1)\}} (X_{t + Δ}, Y_{t + Δ}^{(0)}, Y_{t : t + Δ}^{(1)}) \\ + 1_{C \times \{1\} \times \{1\}} (X_{t + Δ}, Y_{t + Δ}^{(0)}, Y_{t + Δ}^{(1)}) \\ + 1_{\{0, 2\} \times \{0\}} (Y_{t + Δ}^{(0)}, Y_{t + Δ}^{(1)})], \end{align}

(77)

\begin{align} κ (Y_{t : t + Δ}^{(2)} | Z_{t : t + Δ}^{(0)}) & = [1_{D \times \{1\} \times \{(0, 0), (1, 1)\}} (X_{t}, Y_{t}^{(0)}, Y_{t : t + Δ}^{(2)}) \\ + 1_{C \times \{1\} \times \{1\}} (X_{t}, Y_{t}^{(0)}, Y_{t}^{(2)}) \\ + 1_{\{0, 2\} \times \{0\}} (Y_{t}^{(0)}, Y_{t}^{(2)})] . \end{align}

(78)

We can then combine (74), (77), and (78) using (70),

\begin{align} κ (Y_{t : t + Δ} | X_{t : t + Δ}) = & [1_{\{0\} \times \{0\}} (Y_{t : t + Δ}) \\ + 1_{(D \times \{1\}) \times (D \times \{1\})} (Z_{t : t + Δ}) \\ + 1_{((C \cup D) \times \{2\}) \times ((C \cup D) \times \{2\})} (Z_{t : t + Δ}) \\ + 1_{(D \times \{3\}) \times (D \times \{3\})} (Z_{t : t + Δ}) \\ + 1_{\{4\} \times \{4\}} (Y_{t : t + Δ}) \\ + 1_{(D \times \{5\}) \times (D \times \{5\})} (Z_{t : t + Δ}) \\ + 1_{((A \cup B) \times \{0\}) \times (D \times \{1\})} (Z_{t : t + Δ}) \\ + 1_{((A \cup B) \times \{0\}) \times (C \times \{2\})} (Z_{t : t + Δ}) \\ + 1_{(D \times \{1\}) \times (C \times \{2\})} (Z_{t : t + Δ}) \\ + 1_{(C \times \{2\}) \times (D \times \{3\})} (Z_{t : t + Δ}) \\ + 1_{(C \times \{2\}) \times ((A \cup B) \times \{4\})} (Z_{t : t + Δ}) \\ + 1_{(D \times \{3\}) \times ((A \cup B) \times \{4\})} (Z_{t : t + Δ}) \\ + 1_{((A \cup B) \times \{0\}) \times (D \times \{5\})} (Z_{t : t + Δ}) \\ + 1_{(D \times \{5\}) \times ((A \cup B) \times \{4\})} (Z_{t : t + Δ}) \\ + 1_{((A \cup B) \times \{0\}) \times ((A \cup B) \times \{4\})} (Z_{t : t + Δ})], \end{align}

(79)

where Z_t = (X_t, Y_t), and to match (19), we have merged $Y_{t}^{(0)}$ , $Y_{t}^{(1)}$ , and $Y_{t}^{(2)}$ into

Y_{t} = \{\begin{cases} 0 & if (Y_{t}^{(0)}, Y_{t}^{(1)}, Y_{t}^{(2)}) = (0, 0, 0), \\ 1 & if (Y_{t}^{(0)}, Y_{t}^{(1)}, Y_{t}^{(2)}) = (1, 0, 1), \\ 2 & if (Y_{t}^{(0)}, Y_{t}^{(1)}, Y_{t}^{(2)}) = (1, 1, 1), \\ 3 & if (Y_{t}^{(0)}, Y_{t}^{(1)}, Y_{t}^{(2)}) = (1, 1, 0), \\ 4 & if (Y_{t}^{(0)}, Y_{t}^{(1)}, Y_{t}^{(2)}) = (2, 0, 0), \\ 5 & if (Y_{t}^{(0)}, Y_{t}^{(1)}, Y_{t}^{(2)}) = (1, 0, 0) . \end{cases}

(80)

We show the possible transitions of this augmented process in Fig. 2(b).

In Fig. 2(c), we illustrate the determination of Y_r:s from κ(Y_t:t+Δ|X_t:t+Δ) for the trajectory X_r:s in Fig. 1. For each time point X_t, we list the possible values of Y_t from (79). We then stitch together trajectories by connecting successive time point pairs Z_t:t+Δ and following the arrows in Fig. 2(b). The black elements in Fig. 2(c) indicate step pairs Y_t:t+Δ in trajectories Y_r:s that satisfy κ(Y_r:s|X_r:s) = 1, and the gray elements indicate step pairs Y_t:t+Δ that satisfy κ(Y_t:t+Δ|X_t:t+Δ) = 1 but do not belong to any Y_r:s that satisfies κ(Y_r:s|X_r:s) = 1.

There is one Y associated with each reactive trajectory segment in X. The diagonal path in Fig. 2(c) represents one such reactive trajectory segment and can be selected using augmented TPT. The horizontal paths are collections of augmented processes corresponding to times before each future reaction (Y_t = 0) and after each past reaction (Y_t = 4).

V. ALGORITHM

In this section, we summarize the operational aspects of the method. For the numerical examples that we consider in the present paper, we evaluate integrals of the form in (3) using a finite difference approximation, which we detail in the Appendix; more complex systems can be treated by extending the approach in Refs. ¹⁶ and ¹⁷, which we leave for future work. In terms of the finite difference approximation, the algorithm for evaluating these statistics is as follows:

1.
Define κ(Y_t:t+Δ|X_t:t+Δ), ω(Z_r:s), and γ(Z_t:t+Δ) for the statistic of interest.
2.
Compute k₋(Y_t|X_t) and k₊(Y_t|X_t), which account for weights associated with the past and future segments of trajectories. To this end, we express (66) and (67) as

\begin{align} k_{-} (Y_{t} | X_{t}) & = E^{X} [\int κ (Y_{t - Δ : t} | X_{t - Δ : t}) \\ \times k_{-} (Y_{t - Δ} | X_{t - Δ}) d Y_{t - Δ} | X_{t}], \end{align}

(81)

\begin{align} k_{+} (Y_{t} | X_{t}) & = E^{X} [\int κ (Y_{t : t + Δ} | X_{t : t + Δ}) \\ \times k_{+} (Y_{t + Δ} | X_{t + Δ}) d Y_{t + Δ} | X_{t}], \end{align}

(82)

and solve these equations using (A9) and (A10).

3.
Compute Ω(Y_t:t+Δ|X_t:t+Δ) and Ω(Y_t|X_t) by (65) and (68), respectively.
4.
Compute q₋(Z_t) and q₊(Z_t). To this end, we express the committors as solutions to boundary value problems. For Z_t ∈ D,

q_{-} (Z_{t}) = E^{Z} [q_{-} (Z_{t - Δ}) ∣ Z_{t}]

(83)

\begin{align} = E^{X} [\int Ω (Y_{t - Δ : t} | X_{t - Δ : t}) \\ \times q_{-} (Z_{t - Δ}) d Y_{t - Δ} | X_{t}]/ Ω (Y_{t} | X_{t}), \end{align}

(84)

q_{+} (Z_{t}) = E^{Z} [q_{+} (Z_{t + Δ}) ∣ Z_{t}]

(85)

\begin{align} = E^{X} [\int Ω (Y_{t : t + Δ} | X_{t : t + Δ}) \\ \times q_{+} (Z_{t + Δ}) d Y_{t + Δ} | X_{t}]/ Ω (Y_{t} | X_{t}) . \end{align}

(86)

For Z_t∉D, q₋(Z_t) = 1_A(Z_t) and q₊(Z_t) = 1_B(Z_t). Above, (83) and (85) result from applying the identities (12) and (13) to the definitions (16) and (17), and (84) and (86) follow, in turn, from (29) and (31). We solve (84) and (86) using (A9) and (A10).

5.
Evaluate (27) using (A11).

VI. NUMERICAL EXAMPLES

In this section, we demonstrate our augmented framework on simple examples that make the limitations of traditional TPT apparent. The examples we consider employ overdamped Langevin dynamics on a potential U(x) and satisfy the Fokker–Planck equation given by

\frac{\partial P^{X} [X_{t}]}{\partial t} = \nabla \cdot (P^{X} [X_{t}] \nabla U (X_{t})) + \nabla^{2} P^{X} [X_{t}] .

(87)

We calculate all statistics using a quadrature scheme adapted from Ref. 16, which we detail in the Appendix.

A. Reaction through an intermediate

In our first example, we demonstrate the use of augmented TPT to resolve individual reaction steps. We consider a reaction through an intermediate with the three-dimensional potential

\begin{matrix} U (x) & = & 5 [{(\frac{x_{1} - 1}{2})}^{4} + {(\frac{x_{2}}{3})}^{4} + {(\frac{x_{3}}{3})}^{4} - e^{- {(x_{1} - 2)}^{2} - x_{2}^{2}} \\ - 3 e^{- x_{1}^{2} - {(x_{2} - 2)}^{2} - {(x_{3} - 2)}^{2}} - 2 e^{- x_{1}^{2} - x_{2}^{2} - {(x_{3} - 2)}^{2}} \\ - 3 e^{- x_{1}^{2} - {(x_{2} + 2)}^{2} - {(x_{3} + 2)}^{2}} - 2 e^{- x_{1}^{2} - x_{2}^{2} - {(x_{3} + 2)}^{2}}], \end{matrix}

(88)

where x = (x₁, x₂, x₃). We visualize the U(x) = −3 isosurface and the probability density on the CV space θ(x) = (x₁, x₂) in Fig. 3.

Our reaction of interest is described by the indicator function

ω (X_{r : s}) = 1_{A \times (D \cup C) \times \dots \times (D \cup C) \times B} (X_{r : s}),

(89)

where we have defined the reactant A, product B, and intermediate C to be

\begin{aligned} A & = \{x ∣ x_{1}^{2} + {(x_{2} - 2)}^{2} \leq 0 . 5^{2}\}, \\ B & = \{x ∣ x_{1}^{2} + {(x_{2} + 2)}^{2} \leq 0 . 5^{2}\}, \\ C & = \{x ∣ {(x_{1} - 2)}^{2} + x_{2}^{2} \leq 0 . 5^{2}\}, \end{aligned}

(90)

and D = (A ∪ B ∪ C)^c. This reaction represents, for instance, a catalyzed reaction where the interaction of the substrate with the catalyst (represented by x₁) and a substrate internal coordinate (represented by x₂) can be observed while the status of the reaction (represented by x₃) cannot. The observable variables form the CV space (x₁, x₂), and the sets A, B, and C are defined on this CV space.

There are two pathways in this reaction: uncatalyzed and catalyzed. In the uncatalyzed pathway, the system transitions from the reactant A to S₁, then crosses directly to S₂ before entering the product B. In the catalyzed pathway, instead of directly crossing from S₁ to S₂, the system transitions from S₁ into the intermediate C and then to S₂.

We select trajectories that react through each pathway by applying augmented TPT to the reaction. We define an augmented process for each of the pathways by including only the terms from (79) that are involved in that pathway. For the uncatalyzed pathway, we remove all reactive trajectories that visit the intermediate C by removing all terms that contain $Y_{t} \in \{1, 2, 3\}$ from (79), yielding

\begin{align} κ (Y_{t : t + Δ} | X_{t : t + Δ}) = & [1_{\{0\} \times \{0\}} (Y_{t : t + Δ}) \\ + 1_{(D \times \{5\}) \times (D \times \{5\})} (Z_{t : t + Δ}) \\ + 1_{\{4\} \times \{4\}} (Y_{t : t + Δ}) \\ + 1_{((A \cup B) \times \{0\}) \times (D \times \{5\})} (Z_{t : t + Δ}) \\ + 1_{((A \cup B) \times \{0\}) \times ((A \cup B) \times \{4\})} (Z_{t : t + Δ}) \\ + 1_{(D \times \{5\}) \times ((A \cup B) \times \{4\})} (Z_{t : t + Δ})], \end{align}

(91)

and then select reactive trajectories using

ω (Z_{r : s}) = 1_{(A \times \{0\}) \times (D \times \{5\}) \times \dots \times (D \times \{5\}) \times (B \times \{4\})} (Z_{r : s}) .

(92)

For the catalyzed pathway, we retain only reactive trajectories that pass through the intermediate C by removing all terms that contain $Y_{t} \in \{5\}$ as well as the direct transition from $(A \cup B) \times \{0\}$ to $(A \cup B) \times \{4\}$ , which does not pass through C. This yields the augmented process

\begin{align} κ (Y_{t : t + Δ} | X_{t : t + Δ}) = & [1_{\{0\} \times \{0\}} (Y_{t : t + Δ}) \\ + 1_{(D \times \{1\}) \times (D \times \{1\})} (Z_{t : t + Δ}) \\ + 1_{((C \cup D) \times \{2\}) \times ((C \cup D) \times \{2\})} (Z_{t : t + Δ}) \\ + 1_{(D \times \{3\}) \times (D \times \{3\})} (Z_{t : t + Δ}) \\ + 1_{\{4\} \times \{4\}} (Y_{t : t + Δ}) \\ + 1_{((A \cup B) \times \{0\}) \times (D \times \{1\})} (Z_{t : t + Δ}) \\ + 1_{((A \cup B) \times \{0\}) \times (C \times \{2\})} (Z_{t : t + Δ}) \\ + 1_{(D \times \{1\}) \times (C \times \{2\})} (Z_{t : t + Δ}) \\ + 1_{(C \times \{2\}) \times (D \times \{3\})} (Z_{t : t + Δ}) \\ + 1_{(C \times \{2\}) \times ((A \cup B) \times \{4\})} (Z_{t : t + Δ}) \\ + 1_{(D \times \{3\}) \times ((A \cup B) \times \{4\})} (Z_{t : t + Δ})] . \end{align}

(93)

We then select reactive trajectories using

ω (Z_{r : s}) = 1_{(A \times \{0\}) \times D^{'} \times \dots \times D^{'} \times (B \times \{4\})} (Z_{r : s}),

(94)

where $D^{'} = (D \times \{1\}) \cup ((C \cup D) \times \{2\}) \cup (D \times \{3\})$ . We show the possible transitions of (91) in Fig. 4(a) and (93) in Fig. 4(b).

Our goal is to visualize the mechanism of the reaction in the CV space (x₁, x₂) and quantify the relative rates of the two pathways. To this end, we examine four reactive statistics: the reactive density ρ_θ, the reactive current J_θ, the forward committor A_θ[q₊], and the conditional mean first passage time to the product A_θ[m₊].

We plot the reactive statistics from traditional TPT in Fig. 5(a). The reactive density ρ_θ reveals that reactive trajectories spend much of their time around (x₁, x₂) = (0, 0) and (x₁, x₂) = (2, 0), which is consistent with the presence of intermediates S₁/S₂ and C. The reactive current J_θ (vector field) suggests that the reaction is dominated by the uncatalyzed pathway, although a significant fraction does react through the catalyzed pathway. The forward committor A_θ[q₊] changes rapidly around (x₁, x₂) = (0, 0), suggesting the presence of a bottleneck, corresponding to direct crossing from S₁ to S₂. It is almost uniform around C, suggesting the presence of the intermediate C. The conditional mean first passage time A_θ[m₊] can be interpreted in the same way as the forward committor; however, it allows us to visualize the order in which states are visited more clearly. The region below A has a higher value of A_θ[m₊] than the region around C, which suggests that reactive trajectories usually visit the former before the latter. Together, these reactive statistics suggest a cohesive picture. The reaction is dominated by the uncatalyzed pathway, which has a bottleneck around (x₁, x₂) = (0, 0). Reactive trajectories may leave this pathway before the bottleneck into the catalyzed pathway, which has an intermediate C, and return after the bottleneck (i.e., they appear to circumvent S₁/S₂).

Some of the results from traditional TPT are misleading. For example, traditional TPT suggests that the uncatalyzed pathway is dominant, yet the total reactive flux from A to B is 3.0 × 10⁻⁴, while the reactive flux for trajectories that visit C is 2.2 × 10⁻⁴, i.e., 73% of trajectories go through the intermediate. This results from the restriction of the observed coordinates to (x₁, x₂); in the full state space (x₁, x₂, x₃), traditional TPT is capable of correctly resolving the two pathways. However, we note that even given (x₁, x₂, x₃), traditional TPT cannot calculate dynamical statistics for the ensemble of trajectories that react through a particular pathway. Augmented TPT provides a solution to the overlap issue and enables the calculation of reactive statistics for individual steps of each pathway.

We first analyze the uncatalyzed pathway, which was wrongly suggested by traditional TPT to be the dominant pathway. Reactive statistics for this pathway are shown in Fig. 5(b). As we would expect, the reactive density ρ_θ shows that reactive trajectories spend much of their time around S₁/S₂, and the reactive current J_θ suggests that reactive trajectories flow directly from A to S₁/S₂ to B, without any notable deviation to the vicinity of C. On the uncatalyzed pathway, the forward committor A_θ[q₊] changes rapidly around (x₁, x₂) = (0, 0) due to the transition from S₁ to S₂. Off the uncatalyzed pathway, the forward committor has a higher value closer to A and a lower value closer to B. This is surprising and results from slight differences in the reactive density at different values of x₃. The conditional mean first passage time A_θ[m₊] rapidly decreases near S₁/S₂, suggesting the same single bottleneck.

We now analyze the catalyzed pathway through the intermediate C, which dominates the rate. We select trajectories that react through this pathway using (94). Augmented TPT enables us to split the pathway into individual steps and, so, to resolve the structure of each reaction step. The first step (Y_t = 1) starts when the reactive trajectory leaves the reactant A and ends when it first enters the intermediate C. The second step (Y_t = 2) starts at the first time the reactive trajectory enters C, and it ends at the last time the reactive trajectory leaves C. The third step (Y_t = 3) starts when the reactive trajectory last leaves the intermediate C and ends when it enters the product B. As we now explain, separating the catalyzed pathway into these steps leads to reactive statistics that lead to a different interpretation than those from traditional TPT.

In the first step, the reactive density ρ_θ and reactive current J_θ clearly show that most reactive trajectories flow through an intermediate near (x₁, x₂) = (0, 0), in this case S₁, rather than a more direct path from A to C, as suggested by the reactive current from traditional TPT. Likewise, in the last step, they show that most reactive trajectories flow through an intermediate near (x₁, x₂) = (0, 0), in this case S₂. The absence of any significant reactive current in the second step, along with the high reactive density near C, suggests that reactive trajectories predominantly remain in C during this step, with a few trajectories transitioning back and forth to S₁/S₂, where there is a lower value of reactive density. The reactive current from traditional TPT is misleading because the flows from S₁ to C and from C to S₂ cancel each other, since S₁ and S₂ overlap in the CV space.

The forward committor A_θ[q₊] on the catalyzed pathway is uniformly low in the first step and uniformly high in the third step. This suggests that the main bottleneck occurs in the second step, where A_θ[q₊] ≈ 0.5 around C. The abrupt changes between steps suggest that the dynamics of the variables not captured within the CV space are influential in determining whether the reaction occurs. For the second step, we note that the low value of A_θ[q₊] below A and high value above B reflect the full three-dimensional potential [Fig. 3(a)]. In traditional TPT, the high value of A_θ[q₊] in the first step and the low value in the third step cancel, giving rise to the apparent rapid change near (x₁, x₂) = (0, 0).

In the first step, A_θ[m₊] decreases from (x₁, x₂) = (0, 0) to C, suggesting the presence of a bottleneck between intermediates S₁ to C. The same holds for the second step, suggesting that if the system crosses back to an intermediate near (x₁, x₂) = (0, 0), it needs to overcome the same bottleneck to return to C. In the third step, A_θ[m₊] decreases from (x₁, x₂) = (0, 0) to B, marking the transition from S₂ to B. We note that this decrease occurs at a slightly lower value than in the uncatalyzed pathway. This separation of the two bottlenecks lies in contrast with A_θ[m₊] from traditional TPT, where the superposition of the two bottlenecks creates an apparent bottleneck near (x₁, x₂) = (0, 0), which conflates the dynamics of the catalyzed and uncatalyzed pathways.

Overall, we see that the statistics from traditional TPT qualitatively resemble a superposition of those for the uncatalyzed pathway and those associated with the second step of the catalyzed pathway, in which the system is mainly localized at C. The important contributions from the first and third steps of the catalyzed pathway mask each other in traditional TPT.

B. Reaction with multiple pathways

For our second example, we demonstrate the use of augmented TPT to separate pathways that overlap in the CV space. We consider overdamped Langevin dynamics on the three-dimensional potential

\begin{matrix} U (x) & = & 5 [{(\frac{x_{1}}{2})}^{4} + {(\frac{x_{2}}{3})}^{4} + {(\frac{x_{3}}{2})}^{4} + e^{- x_{2}^{2}} \\ - 3 e^{- x_{1}^{2} - {(x_{2} - 3)}^{2}} - 2 e^{- {(x_{1} - x_{2})}^{2} + {(x_{3} - 1)}^{2}} \\ - 3 e^{- x_{1}^{2} - {(x_{2} + 3)}^{2}} - 2 e^{- {(x_{1} + x_{2})}^{2} - {(x_{3} + 1)}^{2}}], \end{matrix}

(95)

where x = (x₁, x₂, x₃). As previously, these dynamics satisfy the Fokker–Planck equation in (87). The U(x) = −3 isosurface for this potential is shown in Fig. 6(a), and the probability distribution on the (x₁, x₂) coordinates, which we use as CVs, is shown in Fig. 6(b). We define the reactant A, product B, and intermediates C₁, C₂, C₃, and C₄ to be

\begin{gathered} A = \{x ∣ x_{1}^{2} + {(x_{2} - 3)}^{2} \leq 1\}, \\ B = \{x ∣ x_{1}^{2} + {(x_{2} + 3)}^{2} \leq 1\}, \\ C_{1} = \{x ∣ {(x_{1} + x_{2})}^{2} + {(x_{1} - x_{2} + 4)}^{2} / 4 \leq 1\}, \\ C_{2} = \{x ∣ {(x_{1} + x_{2} - 4)}^{2} / 4 + {(x_{1} - x_{2})}^{2} \leq 1\}, \\ C_{3} = \{x ∣ {(x_{1} + x_{2} + 4)}^{2} / 4 + {(x_{1} - x_{2})}^{2} \leq 1\}, \\ C_{4} = \{x ∣ {(x_{1} + x_{2})}^{2} + {(x_{1} - x_{2} - 4)}^{2} / 4 \leq 1\} . \end{gathered}

(96)

We also define the sets C = C₁ ∪ C₂ ∪ C₃ ∪ C₄ and D = (A ∪ B ∪ C)^c. The reaction of interest is specified through the indicator function

ω (X_{r : s}) = 1_{A \times (D \cup C) \times \dots \times (D \cup C) \times B} (X_{r : s}) .

(97)

FIG. 6. — Reaction with multiple pathways. (a) U(x) = −3 isosurface of (95). (b) Marginal distribution on the CV space (x₁, x₂).

We use the intermediates to define pathways. This is advantageous because it is not possible to divide the space into regions corresponding to different pathways owing to overlap. We define four pathways: major pathways I and II and minor pathways III and IV. We define pathway I to first hit intermediate C_i = C₁ after leaving A and last hit intermediate C_j = C₄ before hitting B. We likewise define pathway II with (C_i, C_j) = (C₂, C₃), pathway III with (C_i, C_j) = (C₁, C₃), and pathway IV with (C_i, C_j) = (C₂, C₄).

To define the means of selecting these pathways, we label a trajectory before the reaction with Y_t = 0 and after the reaction with Y_t = 4. When the reaction is in process, we label times before the reactive trajectory first enters C_i with Y_t = 1, after the reactive trajectory last exits C_j with Y_t = 3, and between those times with Y_t = 2. Then, we select pathways using

ω (Z_{r : s}) = 1_{(A \times \{0\}) \times D^{'} \dots D^{'} \times (B \times \{4\})} (Z_{r : s}),

(98)

where $D^{'} = (D \times \{1\}) \cup ((D \cup C) \times \{2\}) \cup (D \times \{3\})$ . This choice of Y_t corresponds to

\begin{align} κ (Y_{t : t + Δ} | X_{t : t + Δ}) = & [1_{\{0\} \times \{0\}} (Y_{t : t + Δ}) \\ + 1_{(D \times \{1\}) \times (D \times \{1\})} (Z_{t : t + Δ}) \\ + 1_{((C \cup D) \times \{2\}) \times ((C \cup D) \times \{2\})} (Z_{t : t + Δ}) \\ + 1_{(D \times \{3\}) \times (D \times \{3\})} (Z_{t : t + Δ}) \\ + 1_{\{4\} \times \{4\}} (Y_{t : t + Δ}) \\ + 1_{(A \times \{0\}) \times (D \times \{1\})} (Z_{t : t + Δ}) \\ + 1_{(A \times \{0\}) \times (C_{i} \times \{2\})} (Z_{t : t + Δ}) \\ + 1_{(D \times \{1\}) \times (C_{i} \times \{2\})} (Z_{t : t + Δ}) \\ + 1_{(C_{j} \times \{2\}) \times (D \times \{3\})} (Z_{t : t + Δ}) \\ + 1_{(C_{j} \times \{2\}) \times (B \times \{4\})} (Z_{t : t + Δ}) \\ + 1_{(D \times \{3\}) \times (B \times \{4\})} (Z_{t : t + Δ})] . \end{align}

(99)

The first five terms of (99) denote the sets in which X_t may be for each of the labels $\{0, 1, 2, 3, 4\}$ and the remaining six terms describe the permitted transitions between the labels. We represent (99) visually in Fig. 7. For instance, the seventh term, $1_{(A \times \{0\}) \times (C_{i} \times \{2\})} (Z_{t : t + Δ})$ , corresponds to the single timestep transition from A to C_i and associates this with a change in the label from Y_t = 0 to Y_t = 2.

FIG. 7. — Possible transitions for the augmented process of the reaction with multiple pathways, defined by (99).

We determine the reactive flux associated with each pathway and compare it with the total reactive flux. For the reaction specified by (97), the total reactive flux is 7.5 × 10⁻⁴. Each of the major pathways has a reactive flux of 2.9 × 10⁻⁴, which is 39% of the total reactive flux. Each of the minor pathways has a reactive flux of 6.4 × 10⁻⁴, which is 8.5% of the total reactive flux. These four pathways, thus, give rise to 95% of the total reactive flux and, so, are representative of the majority of reactive trajectories. The remaining 5% results from trajectories that do not conform to these pathways (e.g., ones that pass through only a single intermediate).

In Fig. 8, we plot four reactive statistics: the reactive density ρ_θ, the reactive current J_θ, the conditional mean last passage time from the reactant A_θ[m₋], and the conditional mean first passage time to the product A_θ[m₊].

Reactive statistics from traditional TPT are shown in Fig. 8(a). From the reactive density ρ_θ, we observe that reactive trajectories spend most of their time in the X-shaped region that connects the intermediates. The reactive current J_θ suggests that the majority of the reactive trajectories flow from the reactant A to either C₁ or C₂, then to either C₃ or C₄ via (x₁, x₂) = (0, 0), and finally to the product B. Importantly, even if we consider the full state space (x₁, x₂, x₃), we cannot determine the relative weights of these four possible pathways because the pathways are composed of segments that belong to multiple pathways (e.g., the first half of pathway III overlaps with pathway I and the second half of pathway III overlaps with pathway II) and the transitions through (x₁, x₂, x₃) = (0, 0, 0) along pathways III and IV occur in opposite directions. Other quantities calculated using traditional TPT have the same issue. Both A_θ[m₋] and A_θ[m₊] are unable to distinguish between the pathways and only indicate the presence of a bottleneck between C₁ ∪ C₂ and C₃ ∪ C₄.

In Fig. 8(b), we visualize the reactive statistics for the major pathways. As pathway I and pathway II are mirror images of one another, we discuss only pathway I. The reactive current J_θ clearly shows that the system transitions directly between on-pathway intermediates, from A to C₁ to C₄ to B. The reactive density ρ_θ corroborates this picture, with relatively little density in C₂ and C₃ compared to C₁ and C₄. We observe that A_θ[m₋] and A_θ[m₊] are highest near C₂ and C₃, suggesting that these configurations are dynamically disconnected from the main flow of the reactive trajectories. The transition from C₁ to C₄ is accompanied by an abrupt increase in A_θ[m₋](v) and an abrupt decrease in A_θ[m₊], which shows that a transition bottleneck is traversed. We note that the sharp increase in A_θ[m₋] from A to C₁ and the sharp decrease in A_θ[m₊] from C₄ to B imply the presence of bottlenecks between each of these pairs of states.

The minor pathways in Fig. 8(c) result from trajectories that switch between the major pathways. Since the two minor pathways are mirror images of each other, we discuss only pathway III, which involves a switch from pathway I to pathway II. In contrast with the major pathways, the reactive density ρ_θ indicates that the system is likely to visit off-pathway intermediates C₂ and C₄ on its way from C₁ to C₃. The conditional mean last passage time A_θ[m₋] from the reactant is nearly identical to that of pathway I, with higher values near C₂ and C₃ and lower values near C₁ and C₄. However, the conditional mean first passage time A_θ[m₊] to the product is nearly identical to that of pathway II. In conjunction with ρ_θ, the slight increase in A_θ[m₋] from C₁ to C₄ suggests that these intermediates readily interconvert, and similarly for the slight increase in A_θ[m₊] from C₃ to C₂. The larger change in A_θ[m₋] and A_θ[m₊] between C₂ ∪ C₃ and C₁ ∪ C₄ implies a bottleneck between C₂ ∪ C₃ and C₁ ∪ C₄. This bottleneck is significant because the transition from C₁ to C₃ must occur for this pathway. As with the major pathways, there are also bottlenecks between A and C₁, and C₃ and B.

VII. DISCUSSION

In this paper, we introduced an augmented process that labels sequences of events. This process enabled us to write statistics that depend on knowledge of past and future events in terms of quantities that are local in time and, in turn, to extend the TPT framework. We demonstrated how this framework can be used to separate statistics of competing pathways in reactions with intermediates to reveal features of mechanisms that are not apparent from traditional TPT analyses. Our framework can also be used to treat new classes of reactions that are not amenable to TPT analyses. For instance, reactions with the same reactant and product states, such as cycles of oscillators and excitable systems, can be handled using augmented TPT but not traditional TPT.

Our framework generalizes a previous extension of TPT¹³ and history-augmented approaches for computing rates^9–11 and reactive statistics.¹² The augmented process that we introduce is distinct from that in Ref. 14, in which the state space is expanded to include a time variable to treat time-dependent processes, including transient relaxations and systems with periodically varying dynamics. As a result, the two approaches can be combined to treat sequences of events of finite-time processes.

Our focus here was on establishing the conceptual framework for augmented TPT, and the examples that we showed were sufficiently simple that the Fokker–Planck equations defining their dynamics could be numerically integrated in the variables by quadrature. The dynamics of models with larger numbers of variables must instead be sampled through simulations that generate stochastic realizations of trajectories (i.e., the dynamics of the variables are numerically integrated in time). Because, like traditional TPT, the framework casts statistics in terms of quantities that are local in time, we can extend methods that compute reactive statistics from short trajectories.^12,16,17 Such efforts are underway.

ACKNOWLEDGMENTS

We thank Adam Antoszewski, Spencer Guo, John Strahan, and Bodhi Vani for useful discussions. We acknowledge the Research Computing Center at the University of Chicago for computational resources. This work was supported by National Institutes of Health Award No. R35 GM136381 and National Science Foundation Award No. DMS-2054306.

APPENDIX: FINITE DIFFERENCE SCHEME

For a time-reversible drift-diffusion process given by

\frac{\partial P^{X} [X_{t}]}{\partial t} = \nabla \cdot (P^{X} [X_{t}] \nabla U (X_{t})) + \nabla^{2} P^{X} [X_{t}]

(A1)

with stationary distribution π(X_t) ∝ exp(−U(X_t)), the infinitesimal generator can be used to compute expectations forward-in-time as

\frac{\partial E^{X} [f (X_{t + Δ}) ∣ X_{t}]}{\partial Δ} = - \nabla U (X_{t}) \cdot \nabla f (X_{t}) + \nabla^{2} f (X_{t}) .

(A2)

To evaluate expectations by quadrature, we adapt the finite difference scheme from Ref. 16, which we reproduce here. We approximate (A1) as a discrete-time Markov jump process with time step Δ on a grid with uniform spacing ϵ. For a small change ϵ_i in the direction of the ith coordinate with magnitude ϵ, we substitute −∇U(X_t) = ∇π(X_t)/π(X_t) and then make the approximation

\begin{matrix} \frac{E [f (X_{t + Δ}) ∣ X_{t}] - f (X_{t})}{Δ} \\ \approx \frac{1}{2} \sum_{i} \frac{(π (X_{t} + ϵ_{i}) - π (X_{t})) / ϵ}{(π (X_{t} + ϵ_{i}) + π (X_{t})) / 2} [\frac{f (X_{t} + ϵ_{i}) - f (X_{t})}{ϵ}] \\ + \frac{1}{2} \sum_{i} \frac{(π (X_{t}) - π (X_{t} - ϵ_{i})) / ϵ}{(π (X_{t}) + π (X_{t} - ϵ_{i})) / 2} [\frac{f (X_{t}) - f (X_{t} - ϵ_{i})}{ϵ}] \\ + \sum_{i} \frac{f (X_{t} + ϵ_{i}) + f (X_{t} - ϵ_{i}) - 2 f (X_{t})}{ϵ^{2}} . \end{matrix}

(A3)

Alternatively, we can write

\begin{align} \frac{E^{X} [f (X_{t + Δ}) ∣ X_{t}] - f (X_{t})}{Δ} \\ = \frac{1}{Δ} (\frac{\int P^{X} [X_{t : t + Δ}] f (X_{t + Δ}) d X_{t + Δ}}{P^{X} [X_{t}]} - f (X_{t})) \end{align}

(A4)

\begin{align} \approx \frac{1}{Δ} [(P (X_{t}, X_{t}) f (X_{t}) + \sum_{i} P (X_{t}, X_{t} + ϵ_{i}) f (X_{t} + ϵ_{i}) \\ + \sum_{i} P (X_{t}, X_{t} - ϵ_{i}) f (X_{t} - ϵ_{i}))/ π (X_{t}) - f (X_{t})], \end{align}

(A5)

where P(X_t:t+Δ) represents the approximation of P^X[X_t:t+Δ] on the grid. Above, the first equality follows from the definition of conditional expectation and the second assumes that all transitions within time Δ are to neighboring grid points.

By matching terms between (A5) and (A3), the only nonzero entries of P(x, x′) are

P (x, x + ϵ_{i}) = \frac{2 Δ}{ϵ^{2}} \frac{1}{1 / π (x) + 1 / π (x + ϵ_{i})},

(A6)

P (x, x - ϵ_{i}) = \frac{2 Δ}{ϵ^{2}} \frac{1}{1 / π (x) + 1 / π (x - ϵ_{i})},

(A7)

P (x, x) = π (x) - \sum_{i} [P (x, x + ϵ_{i}) + P (x, x - ϵ_{i})] .

(A8)

Then, we can use the above expressions to estimate expectations using

E^{X} [f (X_{t - Δ : t}) ∣ X_{t}] \approx \frac{\sum_{X_{t - Δ}} P (X_{t - Δ : t}) f (X_{t - Δ : t})}{π (X_{t})},

(A9)

E^{X} [f (X_{t : t + Δ}) ∣ X_{t}] \approx \frac{\sum_{X_{t + Δ}} P (X_{t : t + Δ}) f (X_{t : t + Δ})}{π (X_{t})},

(A10)

E^{X} [f (X_{t : t + Δ})] \approx \sum_{X_{t : t + Δ}} P (X_{t : t + Δ}) f (X_{t : t + Δ}),

(A11)

where the sums are over points on the grid.

AUTHOR DECLARATIONS

Conflict of Interest

The authors have no conflicts to disclose.

Author Contributions

Chatipat Lorpaiboon: Conceptualization (lead); Formal analysis (lead); Investigation (lead); Methodology (lead); Software (lead); Visualization (lead); Writing – original draft (lead); Writing – review & editing (equal). Jonathan Weare: Conceptualization (equal); Funding acquisition (equal); Supervision (equal); Writing – review & editing (equal). Aaron R. Dinner: Conceptualization (equal); Funding acquisition (equal); Supervision (equal); Writing – review & editing (equal).

DATA AVAILABILITY

Data sharing is not applicable to this article as no new data were created or analyzed in this study.

REFERENCES

1.Hänggi P., Talkner P., and Borkovec M., “Reaction-rate theory: Fifty years after Kramers,” Rev. Mod. Phys. 62, 251–341 (1990). 10.1103/revmodphys.62.251 [DOI] [Google Scholar]
2.Peters B., Reaction Rate Theory and Rare Events (Elsevier, 2017). [Google Scholar]
3.E W. and Vanden-Eijnden E., “Towards a theory of transition paths,” J. Stat. Phys. 123, 503–523 (2006). 10.1007/s10955-005-9003-9 [DOI] [Google Scholar]
4.Vanden-Eijnden E., “Transition path theory,” in Computer Simulations in Condensed Matter Systems: From Materials to Chemical Biology, Lecture Notes in Physics Vol. 1, edited by Ferrario M., Ciccotti G., and Binder K. (Springer, Berlin, Heidelberg, 2006), pp. 453–493. [Google Scholar]
5.E W. and Vanden-Eijnden E., “Transition-path theory and path-finding algorithms for the study of rare events,” Annu. Rev. Phys. Chem. 61, 391–420 (2010). 10.1146/annurev.physchem.040808.090412 [DOI] [PubMed] [Google Scholar]
6.Metzner P., Schütte C., and Vanden-Eijnden E., “Transition path theory for Markov jump processes,” Multiscale Model. Simul. 7, 1192–1219 (2009). 10.1137/070699500 [DOI] [Google Scholar]
7.Metzner P., Schütte C., and Vanden-Eijnden E., “Illustration of transition path theory on a collection of simple examples,” J. Chem. Phys. 125, 084110 (2006). 10.1063/1.2335447 [DOI] [PubMed] [Google Scholar]
8.Du R., Pande V. S., Grosberg A. Y., Tanaka T., and Shakhnovich E. S., “On the transition coordinate for protein folding,” J. Chem. Phys. 108, 334–350 (1998). 10.1063/1.475393 [DOI] [Google Scholar]
9.Suárez E., Adelman J. L., and Zuckerman D. M., “Accurate estimation of protein folding and unfolding times: Beyond Markov state models,” J. Chem. Theory Comput. 12, 3473–3481 (2016). 10.1021/acs.jctc.6b00339 [DOI] [PMC free article] [PubMed] [Google Scholar]
10.Vanden-Eijnden E. and Venturoli M., “Exact rate calculations by trajectory parallelization and tilting,” J. Chem. Phys. 131, 044120 (2009). 10.1063/1.3180821 [DOI] [PubMed] [Google Scholar]
11.Dickson A., Warmflash A., and Dinner A. R., “Separating forward and backward pathways in nonequilibrium umbrella sampling,” J. Chem. Phys. 131, 154104 (2009). 10.1063/1.3244561 [DOI] [PubMed] [Google Scholar]
12.Vani B. P., Weare J., and Dinner A. R., “Computing transition path theory quantities with trajectory stratification,” J. Chem. Phys. 157, 034106 (2022). 10.1063/5.0087058 [DOI] [PMC free article] [PubMed] [Google Scholar]
13.Miron P., Beron-Vera F. J., Helfmann L., and Koltai P., “Transition paths of marine debris and the stability of the garbage patches,” Chaos 31, 033101 (2021). 10.1063/5.0030535 [DOI] [PubMed] [Google Scholar]
14.Helfmann L., Ribera Borrell E., Schütte C., and Koltai P., “Extending transition path theory: Periodically driven and finite-time dynamics,” J. Nonlinear Sci. 30, 3321–3366 (2020). 10.1007/s00332-020-09652-7 [DOI] [Google Scholar]
15.Finkel J., Webber R. J., Gerber E. P., Abbot D. S., and Weare J., “Learning forecasts of rare stratospheric transitions from short simulations,” Mon. Weather Rev. 149, 3647–3669 (2021). 10.1175/mwr-d-21-0024.1 [DOI] [Google Scholar]
16.Thiede E. H., Giannakis D., Dinner A. R., and Weare J., “Galerkin approximation of dynamical quantities using trajectory data,” J. Chem. Phys. 150, 244111 (2019). 10.1063/1.5063730 [DOI] [PMC free article] [PubMed] [Google Scholar]
17.Strahan J., Antoszewski A., Lorpaiboon C., Vani B. P., Weare J., and Dinner A. R., “Long-time-scale predictions from short-trajectory data: A benchmark analysis of the trp-cage miniprotein,” J. Chem. Theory Comput. 17, 2948–2963 (2021). 10.1021/acs.jctc.0c00933 [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

Data sharing is not applicable to this article as no new data were created or analyzed in this study.

[c1] 1.Hänggi P., Talkner P., and Borkovec M., “Reaction-rate theory: Fifty years after Kramers,” Rev. Mod. Phys. 62, 251–341 (1990). 10.1103/revmodphys.62.251 [DOI] [Google Scholar]

[c2] 2.Peters B., Reaction Rate Theory and Rare Events (Elsevier, 2017). [Google Scholar]

[c3] 3.E W. and Vanden-Eijnden E., “Towards a theory of transition paths,” J. Stat. Phys. 123, 503–523 (2006). 10.1007/s10955-005-9003-9 [DOI] [Google Scholar]

[c4] 4.Vanden-Eijnden E., “Transition path theory,” in Computer Simulations in Condensed Matter Systems: From Materials to Chemical Biology, Lecture Notes in Physics Vol. 1, edited by Ferrario M., Ciccotti G., and Binder K. (Springer, Berlin, Heidelberg, 2006), pp. 453–493. [Google Scholar]

[c5] 5.E W. and Vanden-Eijnden E., “Transition-path theory and path-finding algorithms for the study of rare events,” Annu. Rev. Phys. Chem. 61, 391–420 (2010). 10.1146/annurev.physchem.040808.090412 [DOI] [PubMed] [Google Scholar]

[c6] 6.Metzner P., Schütte C., and Vanden-Eijnden E., “Transition path theory for Markov jump processes,” Multiscale Model. Simul. 7, 1192–1219 (2009). 10.1137/070699500 [DOI] [Google Scholar]

[c7] 7.Metzner P., Schütte C., and Vanden-Eijnden E., “Illustration of transition path theory on a collection of simple examples,” J. Chem. Phys. 125, 084110 (2006). 10.1063/1.2335447 [DOI] [PubMed] [Google Scholar]

[c8] 8.Du R., Pande V. S., Grosberg A. Y., Tanaka T., and Shakhnovich E. S., “On the transition coordinate for protein folding,” J. Chem. Phys. 108, 334–350 (1998). 10.1063/1.475393 [DOI] [Google Scholar]

[c9] 9.Suárez E., Adelman J. L., and Zuckerman D. M., “Accurate estimation of protein folding and unfolding times: Beyond Markov state models,” J. Chem. Theory Comput. 12, 3473–3481 (2016). 10.1021/acs.jctc.6b00339 [DOI] [PMC free article] [PubMed] [Google Scholar]

[c10] 10.Vanden-Eijnden E. and Venturoli M., “Exact rate calculations by trajectory parallelization and tilting,” J. Chem. Phys. 131, 044120 (2009). 10.1063/1.3180821 [DOI] [PubMed] [Google Scholar]

[c11] 11.Dickson A., Warmflash A., and Dinner A. R., “Separating forward and backward pathways in nonequilibrium umbrella sampling,” J. Chem. Phys. 131, 154104 (2009). 10.1063/1.3244561 [DOI] [PubMed] [Google Scholar]

[c12] 12.Vani B. P., Weare J., and Dinner A. R., “Computing transition path theory quantities with trajectory stratification,” J. Chem. Phys. 157, 034106 (2022). 10.1063/5.0087058 [DOI] [PMC free article] [PubMed] [Google Scholar]

[c13] 13.Miron P., Beron-Vera F. J., Helfmann L., and Koltai P., “Transition paths of marine debris and the stability of the garbage patches,” Chaos 31, 033101 (2021). 10.1063/5.0030535 [DOI] [PubMed] [Google Scholar]

[c14] 14.Helfmann L., Ribera Borrell E., Schütte C., and Koltai P., “Extending transition path theory: Periodically driven and finite-time dynamics,” J. Nonlinear Sci. 30, 3321–3366 (2020). 10.1007/s00332-020-09652-7 [DOI] [Google Scholar]

[c15] 15.Finkel J., Webber R. J., Gerber E. P., Abbot D. S., and Weare J., “Learning forecasts of rare stratospheric transitions from short simulations,” Mon. Weather Rev. 149, 3647–3669 (2021). 10.1175/mwr-d-21-0024.1 [DOI] [Google Scholar]

[c16] 16.Thiede E. H., Giannakis D., Dinner A. R., and Weare J., “Galerkin approximation of dynamical quantities using trajectory data,” J. Chem. Phys. 150, 244111 (2019). 10.1063/1.5063730 [DOI] [PMC free article] [PubMed] [Google Scholar]

[c17] 17.Strahan J., Antoszewski A., Lorpaiboon C., Vani B. P., Weare J., and Dinner A. R., “Long-time-scale predictions from short-trajectory data: A benchmark analysis of the trp-cage miniprotein,” J. Chem. Theory Comput. 17, 2948–2963 (2021). 10.1021/acs.jctc.0c00933 [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Augmented transition path theory for sequences of events

Chatipat Lorpaiboon

Jonathan Weare

Aaron R Dinner

Abstract

I. INTRODUCTION

II. FRAMEWORK

A. Ensemble of reactive trajectories

B. Transition path theory

C. A motivating reaction

FIG. 1.

D. Augmented transition path theory

III. REACTIVE STATISTICS

A. Reactive flux

B. Reactive density

C. Reactive current

D. Committors

E. Conditional mean first and last passage times

IV. CONSTRUCTION OF THE AUGMENTED PROCESS

A. Decomposition of Ω

B. Building augmented processes by composition

C. Augmented process for the motivating reaction

FIG. 2.

V. ALGORITHM

VI. NUMERICAL EXAMPLES

A. Reaction through an intermediate

FIG. 3.

FIG. 4.

FIG. 5.

B. Reaction with multiple pathways

FIG. 6.

FIG. 7.

FIG. 8.

VII. DISCUSSION

ACKNOWLEDGMENTS

APPENDIX: FINITE DIFFERENCE SCHEME

AUTHOR DECLARATIONS

Conflict of Interest

Author Contributions

DATA AVAILABILITY

REFERENCES

Associated Data

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases