Time-Ordered Product Expansions for Computational Stochastic Systems Biology

Eric Mjolsness

doi:10.1088/1478-3975/10/3/035009

. Author manuscript; available in PMC: 2014 Jun 4.

Published in final edited form as: Phys Biol. 2013 Jun 4;10(3):035009. doi: 10.1088/1478-3975/10/3/035009

Time-Ordered Product Expansions for Computational Stochastic Systems Biology

Eric Mjolsness

PMCID: PMC3786790 NIHMSID: NIHMS503994 PMID: 23735739

Abstract

The time-ordered product framework of quantum field theory can also be used to understand salient phenomena in stochastic biochemical networks. It is used here to derive Gillespie’s Stochastic Simulation Algorithm (SSA) for chemical reaction networks; consequently, the SSA can be interpreted in terms of Feynman diagrams. It is also used here to derive other, more general simulation and parameter-learning algorithms including simulation algorithms for networks of stochastic reaction-like processes operating on parameterized objects, and also hybrid stochastic reaction/differential equation models in which systems of ordinary differ-ential equations evolve the parameters of objects that can also undergo stochastic reactions. Thus, the time-ordered product expansion (TOPE) can be used systematically to derive simulation and parameter-fitting algorithms for stochastic systems.

1 Introduction

The master equation for a continuous-time stochastic dynamical system may be expressed as dp/dt = W · p where W is the time-evolution operator, often an infinite-dimensional matrix. Particular choices for W lead to the special case of the “chemical master equation” for stochastic chemical kinetics, often useful in bioligical applications; we will see this and other applications below. The general master equation has the formal solution p(t) = exp(tW) · p(0). If W can be decomposed as a sum W₀ + W₁, then there is a perturbation theory for exp(tW) in terms of exp(tW₀) and its perturbations by W₁. The time-ordered product expansion (which we refer to by the acronym TOPE) gives a formula for the solution of a master equation [1–3] which can be expressed as follows [4]:

\begin{array}{l} exp (t W) \cdot p_{0} = exp (t (W_{0} + W_{1})) \cdot p_{0} \\ = \sum_{k = 0}^{\infty} [\int_{0}^{t} {d t}_{k} \int_{0}^{t_{k}} {d t}_{k - 1} \dots \int_{0}^{t_{2}} {d t}_{1} exp ((t - t_{k}) W_{0}) \cdot W_{1} \\ \cdot exp ((t_{k} - t_{k - 1}) W_{0}) \dots \cdot W_{1} \cdot exp (t_{1} W_{0}) \cdot p_{0}] \end{array}

(1)

This expression can be derived (as in [4]) by expanding in powers of W₁, each expanded to all orders in W₀, and using the normalization formula for the Dirichlet distribution to subdivide the time interval [0, t]) into k subintervals.

Since W₀ and W₁ do not generally commute, this expression involves alternation from right to left of W₀ and W₁ related operations. Using the “time-ordered exponential” of operators [5], this result can be compactly reexpressed as:

exp (t (W_{0} + W_{1})) = exp ({t W}_{0}) {(exp (\int_{0}^{t} W_{1} (τ) d τ))}_{+}

(2)

where

W_{1} (τ) \equiv exp (- τ W_{0}) W_{1} exp (τ W_{0}) .

(3)

Here ${(exp (\int_{0}^{t} G (τ) d τ))}_{+}$ is obtained term by term from the Taylor series for the operator exponential, by reordering all monomials containing terms evaluated at different times so that they are indexed by ordered sequences of times (τ_k, …, τ₁) that increase right to left (details reviewed in Section 3.5.1 below). In field theory it is standard to prefer Equation 2 over Equation 1 for theoretical calculations, but for algorithmic concreteness this paper will favor the more explicit expression Equation 1 where possible. Indeed, each summand of Equation 1 already looks like a Markov chain in which the matrix or operator product operation “·”. which sum over states, is supplemented by integration over an extra time variable. This observation will be made precise in Section 3. In general there is a risk that the infinite sum over terms could diverge. However, the master equation must conserve total probability and this constrains W to have zero column-sums and also constrains the spectrum of W to have nonpositive real parts. In this setting some decompositions W = W₀ + W₁ converge well enough, as we will see by example below.

One particular specialization of TOPE lets us derive Gillespie’s Stochastic Simulation Algorithm (SSA): take W₀ = −D = the diagonal part of W, and W₁ = Ŵ = the off-diagonal part of W. Then for chemical reaction networks TOPE generates Feynman-like diagrams. An example is illustrated below for the simple reaction network with just two reactions, the forwards and backwards parts of the generic trivalent reaction A + B ⇌ C, to which others can be reduced.

The TOPE (Equation 1 or Equation 2) can be applied recursively, since it reduces one operator exponential exp(tW) to another one exp(tW₀). This fact will be exploited in Section 3 below. But eventually one must get to an operator exponential that is tractable by other means. One way to do this is to let W₀ = D = the diagonal part of W, as in the SSA algorithm derivation below.

2 Methods

2.1 Creation/annihilation operator notation

We will use operator notation for molecule (or other reactant) creation and annihilation state changes [1–4]. Here we just review the notation as used in [4]. The elementary operators a and â act (respectively) to destroy and create identical particles of a given type. In the particle-number basis their elements have the entirely off-diagonal expressions

a_{i j} = j δ_{i j - 1} and {\hat{a}}_{i j} = δ_{i j + 1} forall i, j \in {0, 1, 2, \dots} .

(4)

Here δ_ij is the Kronecker delta function. The creation and annihilation operators satisfy the Heisenberg algebra [a, â] = I but are different from those of quantum mechanics because they are not conjugates or transposes of one another. (This is the reason we do not denote the creation operator a^†, as it is in quantum mechanics, or a^*.) Instead of being conjugate to â, the annihilator a encodes the chemical law of mass action since its nonzero entries are equal to the number of particles available to react or decay. The diagonal “number operator” is N ≡ âa.

The creation and annihilation operators may be represented in terms of their action on probability generating functions $g (z) = \sum_{n = 0}^{\infty} p_{n} z^{n}$ , where p_n is the probability that there exist n particles of a given type. In this case:

a = \partial_{z} \dots and \hat{a} = z \times \dots

(5)

In the presence of different types of particles (eg. molecules or other objects) the creation/annihilation operator notation is generalized, eg to a_α and â_β for molecule types A_α, in which all operators for unequal types commute:

[a_{α}, {\hat{a}}_{β}] \equiv a_{α} {\hat{a}}_{β} - {\hat{a}}_{β} a_{α} = δ_{α β}

Operating on an empty “vacuum” state |0〉 with no objects, the monomials in the creation operators â_β span a Fock space. Molecule or object types indexed by α may even be taken to include arbitrary discrete-valued molecular attributes (or attributes of other objects) such as phosphorylation state or integer-valued parameters. Continuous-valued parameters such as position (in quantum field theory it would more naturally be the conserved momentum, unlike the typical viscous-medium dynamics in biology) may be encoded into a real-valued vector argument x which requires a Dirac delta function instead of a Kronecker delta function, so for example:

[a_{α} (x), {\hat{a}}_{β} (y)] = δ_{α β} δ (x - y)

(6)

A non-molecular example of such parameterized objects would be: cells of a given real-valued volume and/or lengthscale as in Section 3.5.5 below.

However for some attributes such as real-valued object positions one may wish to limit the state space to between zero or n_max,_α molecules (or other objects) at each unique real value. The resulting commutator is still diagonal as described in [4]. The particular case n_max,_α = 1 is not a stochastic version of fermions because particles with different types or values of the attributes still commute rather than anticommuting.

The basic rule for translating chemical reactions into creation/annihilation operator notiation is: first, annihilate all objects on the incoming or left hand side of a reaction; then create all the objects on the outgoing or right hand side of a reaction. Thus, the off-diagonal part of the operator for a reaction

\begin{array}{l} {A_{α (p)} (x_{p}) ∣ 1 \leq p \leq p_{max}}_{*} \to {A_{β (q)} (y_{q}) ∣ 1 \leq q \leq q_{max}}_{*} \\ with reaction rate ρ_{r} ({[x_{p}]}_{1}^{p_{max}}, {[y_{q}]}_{1}^{q_{max}}) \end{array}

that converts an incoming multiset {···}_* of numerically parameterized reactants {A_α₍_p₎(x_p)|p ∈ lhs(r)}_* each with parameter vector x_p (reactants can appear multiple times in a multiset) into an outgoing multiset {A_β₍_q₎(y_q)|q ∈ rhs(r)}_* each with parameter vector y_q, is:

{\hat{O}}_{r} = ρ_{r} ([x_{p}], [y_{q}]) [\prod_{q \in rhs (r)} {\hat{a}}_{β (q)} (y_{q})] [\prod_{p \in lhs (r)} a_{α (p)} (x_{q})] .

(7)

There is one such operator for every possible set of values for the numerical parameters. Since time-evolution operators for different processes just add, a generic operator for all parameter values must sum and/or integrate the operator of Equation 18 over all the parameters, in the Cartesian product of measure spaces in which they take values:

{\hat{O}}_{r} = \int \int d {x} d {y} ρ_{r} ([x_{p}], [y_{q}]) [\prod_{q \in rhs (r)} {\hat{a}}_{β (q)} (y_{q})] [\prod_{p \in lhs (r)} a_{α (p)} (x_{q})]

(8)

The generalization is conceptually straightforward because we have simply used a function ρ_r([x_a], [y_b]) to express the possibly infinite number of different reaction rates that pertain to objects that differ only in their attributes. Because of the algebra of noncommuting basic creation and annihilation operators, reaction operators Ô_r and Ô_r_′ for reactions r ≠ r′ that produce and consume a shared reactant A_α(x) (or A_α for reactants with type α but no other parameters) generally also have nonzero commutators.

Equation 18 or Equation 8 add probability to the new state of the system, but do not take it away from the old state of the system before a reaction. That job requires a negative diagonal matrix as shown in Equation 10 below. In the case of Equation 18, the corresponding diagonal operator is $D_{r} = ρ_{r} [\prod_{a \in lhs (r)} N_{α (p)} (x_{p})]$ . Examples are provided in [4] and below.

2.2 Solvable example: An exact solution for SSA behavior

For a few very simple examples, we can not only solve analytically for the behavior of the biochemical system, but we can even add in the behavior of the SSA simulation algorithm and solve for that exactly as well. For example consider the minimal bidirectional reaction A ↔ Ø. This case is analytically solvable, including the complete statistics of its SSA algorithm simulation. It has forward synthesis and backwards decay reactions. The operator expression is therefore:

W = k_{s} (α \hat{a} - I) + k_{d} (α a - N)

(9)

Here α = 1 is the generating function variable for the total number of reactions, corresponding to off-diagonal matrix elements of W. Power series in α will decompose total probability according to this number.

Translating the master equation for Equation 9 into a PDE in the two variables t and z using representation Equation 5, and solving analytically, this model has the exact solution

\begin{array}{l} g_{m} (z, t ∣ α) = {(α + (z - α) e^{- k_{d} t})}^{m} exp [- \frac{k_{s}}{k_{d}} ((1 - α) k_{d} t + (z α - α^{2}) (e^{- k_{d} t} - 1))] \\ = {(α + (z - α) e^{- k_{d} t})}^{m} exp [\frac{k_{s}}{k_{d}} (z α - 1) (1 - e^{- k_{d} t}))] exp [\frac{k_{s}}{k_{d}} (α^{2} - 1) (k_{d} t + e^{- k_{d} t} - 1))] \\ = Binomialinitialconditionwithdecay * Poissononforwardreactions \\ * Poissononforward / backdwardreactionpairs . \end{array}

As usual z is the generating function variable whose exponent is the total number n_A of A molecules or particles, m is the initial number of molecules, and t is continuous time. The * operation is a convolution of probability distributions. A product of generating functions with the same variable is a convolution of distributions [7]. Note the interpretation in terms of Binomials and Poissons with time-varying parameters. The third factor represents a linearly increasing number of canceling forward/backward reaction pairs as a function of time - a kind of random walk.

The full derivation below will generalize this solvable example, again separating the diagonal from the off-diagonal terms in W.

2.3 Notation for SSA rederivation from TOPE

One specialization of TOPE lets us derive SSA for biochemical reaction networks, as follows. First decompose W into nonegative off-diagonal and non-positive diagonal parts, as must be possible by the conservation and nonnegativity of probability. For example conservation of probability implies ∀p 0 = d(1 · p)/dt = (1 · W) · p ⇒ 1 · W = 0. Then

\begin{array}{l} W = \hat{W} - D, where D ≜ diag (1 \cdot \hat{W}), i . e . \\ {\hat{W}}_{I J} ≜ (1 - δ_{I J}) W_{I J} and D_{I J} ≜ δ_{I J} \sum_{K} {\hat{W}}_{K J} ≜ δ_{I J} D_{I} \end{array}

(10)

where I and J index the possible states of the system. To prevent negative probabilities from evolving under the master equation, all entries of Ŵ and therefore D must be nonnegative. In this circumstance the TOPE becomes:

\begin{array}{l} exp (t (\hat{W} - D)) = \sum_{k = 0}^{\infty} [\int_{0}^{t} \dots \int_{0}^{t} (\prod_{q = 0}^{k} d τ_{q}) δ (t - \sum_{q = 0}^{k} τ_{q}) \times exp (- τ_{k} D) \hat{W} \dots exp (- τ_{1} D) \hat{W} exp (- τ_{0} D)] \\ = \sum_{k = 0}^{\infty} \int_{0}^{t} \dots \int_{0}^{t} (\prod_{q = 0}^{k} d τ_{q}) δ (t - \sum_{q = 0}^{k} τ_{q}) exp (- τ_{k} D) [\prod_{q = k - 1 ↓}^{0} \hat{W} exp (- τ_{q} D)] \end{array}

Since the summands over k and integrants over ${[τ_{q}]}_{0}^{k}$ are mutually exclusive, exhaustive and nonnegative, we define the conditional probability distribution on k and ${[τ_{q}]}_{0}^{k}$ by these summand/integrands (where ${[τ_{q}]}_{0}^{k} ≜ [τ_{0}, \dots τ_{k}]$ denotes an ordered contiguous sequence of time intervals):

Pr (I, {[τ_{q}]}_{0}^{k}, k ∣ J, t) ≜ {exp (- τ_{k} D) [\prod_{q = k - 1 ↓}^{0} \hat{W} exp (- τ_{q} D)] δ (t - \sum_{q = 0}^{k} τ_{q})}_{I, J}

(11)

(where a product over zero terms such as $\prod_{q = - 1 ↓}^{0}$ is interpreted as the identity matrix, and products over negative numbers of terms such as $\prod_{q = - 2 ↓}^{0}$ should not occur). For D_II ≠ 0,

Pr (I, {[τ_{q}]}_{0}^{k}, k ∣ J, t) = {exp (- τ_{k} D) [\prod_{q = k - 1 ↓}^{0} (\begin{matrix} \hat{W} & D^{- 1} \end{matrix}) (D exp (- τ_{q} D))] δ (t - \sum_{q = 0}^{k} τ_{q})}_{I, J}

Either way,

Pr (I ∣ J, t) = \sum_{k = 0}^{\infty} \int_{0}^{t} \dots \int_{0}^{t} (\prod_{q = 0}^{k} d τ_{q}) Pr (I, {[τ_{q}]}_{0}^{k}, k ∣ J, t) = {[exp (t (\hat{W} - D))]}_{I, J} .

The bracket notation ${[X_{q}]}_{min}^{max} ≜ [τ_{min}, \dots τ_{max}]$ for an ordered set of components indexed by q will also be used for state variables ${[I_{q}]}_{min}^{max}$ . The notation “≜” means “equal by definition”. In what follows, the notation “Θ(Pred)” where Pred is a predicate is the Heaviside step function or indicator function taking the value 1 if the predicate is true and 0 if it is false.

2.4 Semigroup property

Suppose t = t₁ + t₂, all nonnegative. Then for any time-evolution equation we must have the semigroup property:

Pr (I ∣ J, t) = \sum_{K} Pr (I ∣ K, t_{2}) Pr (K ∣ J, t_{1}) .

Is there a k-event version of this rule, for k = k₁ + k₂? In other words, can we add (nonnegative) numbers of reaction events rather than time intervals? We observe (where again ${[τ_{q}]}_{0}^{k} \equiv [τ_{0}, \dots τ_{k}]$ )

Pr (I, k ∣ J, t) = \int_{0}^{t} \dots \int_{0}^{t} (\prod_{q = 0}^{k} d τ_{q}) Pr (I, {[τ_{q}]}_{0}^{k}, k ∣ J, t) .

Then, according to a derivation given in Appendix I, if k = k₁ + k₂ and for any $τ_{k_{1}}^{'} \in [0, τ_{k_{1}}]$ , there is a semigroup law:

Pr (I, {[τ_{q}]}_{0}^{k}, k ∣ J, t) = \sum_{K} \int_{0}^{t} d τ Pr (I, [τ_{k_{1}}^{'}, τ_{k_{1} + 1}, \dots τ_{k}], k_{2} ∣ K, τ) \times Pr (K, [τ_{0}, \dots τ_{k_{1} - 1}, τ_{k_{1}} - τ_{k_{1}}^{'}], k_{1} ∣ J, t - τ) .

(12)

In this result there is an arbitrary choice of $τ_{k_{1}}^{'}$ from the interval [0, τ_k₁]. However this form does not yet pertain to conditional probabilities of the form Pr(I, t|k, J), as needed to obtain a computable Markov process algorithm.

3 Results and discussion

Given the foregoing notation, we undertake the derivation of a Markov chain representing the SSA algorithm. We then consider extensions of this result, including parameterized reactants, but focussing mainly on hybrid stochastic event/ordinary differential equation dynamical systems.

3.1 Derivation of a Markov chain

3.1.1 Bayesian recurrence

In Appendix I we argue that the correct Bayesian strategy for moving from Pr(I, k|t, J) to Pr(I, t|k, J), as needed to obtain a simulatable Markov chain, is to consider large stopping times T ≫ t which are overwhelmingly likely to have large reaction numbers n ≫ k; then to marginalize the probability distribution Pr([I], [τ], n|J, T) over all event numbers n > k and to conditionalize it over all event numbers q < k. By that means in Appendix I we derive the recurrence relation

\tilde{Pr} ({[I_{q}]}_{1}^{k}, {[τ_{q}]}_{0}^{k - 1} ∣ k, J) = \tilde{Pr} (I_{k}, τ_{k - 1} ∣ 1, I_{k - 1}) \tilde{Pr} ({[I_{q}]}_{1}^{k - 1}, {[τ_{q}]}_{0}^{k - 2} ∣ k - 1, J)

(13)

where

\tilde{Pr} ({[I_{q}]}_{1}^{k}, {[τ_{q}]}_{0}^{k - 1} ∣ k, J) ≜ lim_{T \to \infty} \sum_{n = k + 1}^{\infty} \sum_{{{[I_{q}]}_{k + 1}^{n}}} \int_{0}^{T} \dots \int_{0}^{T} {[d τ_{q}]}_{k}^{n} Pr ({[I_{q}]}_{1}^{n}, {[τ_{q}]}_{0}^{n}, n ∣ J, T)

(14)

and in particular

\begin{array}{l} \tilde{Pr} (I_{k}, τ_{k - 1} ∣ 1, I_{k - 1}) = \frac{\tilde{Pr} ({[I_{q}]}_{1}^{k}, {[τ_{q}]}_{0}^{k - 1} ∣ k, J)}{\tilde{Pr} ({[I_{q}]}_{1}^{k - 1}, {[τ_{q}]}_{0}^{k - 2} ∣ k - 1, J)} \\ = {\hat{W}}_{I_{k} I_{k - 1}} exp (- τ_{k - 1} D_{I_{k - 1}}) Θ (τ_{k - 1} \geq 0) . \end{array}

(15)

Note that $\tilde{Pr} ({[I_{q}]}_{1}^{k}, {[τ_{q}]}_{0}^{k - 1} ∣ k . J)$ marginalizes over τ_k, the time elapsed since the last event k, as well as all later times and events. It is a distribution on histories up to and including the “just-fired” k’th reaction event, within a much longer history.

3.1.2 Markov chain - Summary

From the forgoing Bayesian recurrence equation, and the definition

\tilde{Pr} (I, t_{k} ∣ k, J) ≜ \sum_{{{[I_{q}]}_{1}^{k - 1}}} \int_{0}^{\infty} \dots \int_{0}^{\infty} {[d τ_{q}]}_{0}^{k - 1} \tilde{Pr} ({[I_{q}]}_{1}^{k}, {[τ_{q}]}_{0}^{k - 1} ∣ k, J) δ (t_{k} - \sum_{q = 0}^{k - 1} τ_{q}),

Appendix I shows $\tilde{Pr} (I, t_{k} ∣ J, k)$ is a probability density function and proves the following Markov-like property:

\tilde{Pr} (I, t ∣ k, J) = \sum_{K} \int_{0}^{t} d τ \tilde{Pr} (I, τ ∣ 1, K) \tilde{Pr} (K, t - τ ∣ k - 1, J) .

We may reexpress this result as

\tilde{Pr} (I, t ∣ k, J) \approx \sum_{K} \int_{0}^{t} d τ W (I, t ∣ K, t - τ) \tilde{Pr} (K, τ ∣ k - 1, J)

where we define the Markov chain kernal

W (I, t ∣ K, t - τ) ≜ \tilde{Pr} (I, τ ∣ 1, K) = {\hat{W}}_{I K} exp (- τ D_{K}) Θ (τ \geq 0)

(16)

In vector/operator notation, for k ≥ 2,

\tilde{Pr} (., . ∣ k, J) \equiv W \circ \tilde{Pr} (., . ∣ k - 1, J)

and finally the algorithmic Markov chain expression for SSA including now an initial distribution $\tilde{Pr} (J ∣ k = 0)$ over J at time t = 0, we have for all k ≥ 0:

\tilde{Pr} (. ∣ k) = W^{k} \circ \tilde{Pr} (. ∣ k = 0)

(17)

which expresses the iteration of a Markov chain of the SSA algorithm. Of course the factor [D_K exp(−τD_K)Θ(τ ≥ 0)] in Equation 16 is just the SSA exponential distribution of non-negative waiting times τ between reaction events, and Ŵ_IK/D_K is just the branching probability for immediately thereafter chosing a reaction that leads to state I.

The foregoing derivation was outlined in far less detail in [8]. A similar equation for SSA was reached by very different methods in [9], Theorem 10.1. To our knowledge this is the first complete derivation of SSA from field theory methods such as TOPE.

This Markov chain expression has also been used as the starting point for the derivation of exact accelerated stochastic simulation algorithms [10,11] that execute many reactions per step (i.e. they “leap” forward) and thus go much faster than SSA, while also sampling from the exact probability distribution given by the just-fired probabilities above. These derivations proceed by algebraic rearrangement of terms to express computationally efficient versions of rejection sampling. The algorithm of [11] has been parallelized, which is often difficult for discrete-event simulations.

3.2 Algorithm: SSA

The SSA algorithm represented by the Markov chain in Equation 16 and Equation 17 above may be written out in pseudocode as follows:

repeat {
   compute propensities k⁽^r⁾
   compute  $k^{(total)} = \sum_{r} k^{(r)}$ 
   draw waiting time Δt from k^(total) exp(−Δtk^(total))
     t: = t + Δt; // advance the clock by Δt
   draw reaction r from distribution k⁽^r⁾/k^(total)and execute reaction r
} until t ≥ t_max

3.3 Extension: Parameterized rule and graph grammar SSA-like algorithm

For biological modeling, including spatial and mechanical modeling of biological systems, it is important to generalize from pure particles to particles with both discrete and continuous attributes. The complication is that reaction or process rates can then depend on the attributes both of the incoming and outgoing objects. A non-molecular example of such parameterized objects would be cells of a given size, whose propensity to divide may actually depend on their real-valued size parameter (as in Section 3.5.5). More generally, this capability enables agent-based modeling and simulation since it allows interacting objects to have dynamic internal state and even (as explained in Section 3.3.2 below) dynamic relationships.

The time-evolution operator of Equation 18 requires that each attribute or parameter vector consist of constants or variables, each variable appearing just once, and any relationships between variables (such as x_2,1 = y_1,2, x_2,1 = x_1,2, and/or y_2,1 = y_1,2) enforced by the reaction rate ρ_r([x_p], [y_q]). Alternatively we can allow repeated appearances of symbolic variables X_c upon which the attributes may depend, through the identity function or otherwise. This is a useful improvement in reaction notation which however may require special-purpose symbolic variable-binding algorithms to support efficiently. Generalizing from Equation 18 and Equation 8, as in [4], we include all possible instantiations of parameters x_p[X] and y_q[X], allowing for repeated occurences of some or all of the variables X_c in [X], with the integrated off-diagonal process representation operator

{\hat{O}}_{r} = \int \dots \int d μ_{c} (X_{c}) ρ_{r} ([x_{p} [X]], [y_{q} [X]]) \times [\prod_{q \in rhs (r)} {\hat{a}}_{β (q)} (y_{q} [X])] [\prod_{p \in Ihs (r)} a_{α (p)} (x_{p} [X])] .

(18)

As before, α(p) and β(q) represent the type of a parameterized object i.e. an object with attributes. Now the symbolic variables X_c each have a type c which has an integration measure μ_c. Again (as in Equation 18 or 8), summation over all discrete-valued parameters and integration over all continuous-valued parameters generalizes the operator to handle all possible sets of parameter values.

3.3.1 Algorithm: SSA with parametrized reactant objects

The resulting variant of the SSA algorithm for parameterized reactions can be expressed in pseudocode as follows (outlined briefly in [8]):

forall reactions r factor ρ⁽^r⁾(x_in, x_out) = k⁽^r⁾(x_in)p⁽^r⁾(x_out|x_in);
repeat {
   compute SSA propensities as k⁽^r⁾(x_in);
   compute  $k^{(total)} = \sum_{r} k^{(r)} (x_{in})$ ;
   draw waiting time Δt from k^(total) exp(−Δtk^(total));
     t: = t + Δt; // advance the clock by Δt
   draw reaction r from distribution k⁽^r⁾(x_in)/k^(total);
   draw x_out from p⁽^r⁾(x_out|x_in) and execute reaction r;
} until t ≥ t_max

3.3.2 Structural matching

The functions ρ(x, y) appearing in Equation 18 may impose constraints including equality of variables; equivalently we may allow some variables to appear multiple times in object parameter lists. Either way there follows a mechanism to encode structural relations - graphs and labelled graphs - in the input and output variable lists. Object attributes can include Object ID codes which other objects can also include in their parameter lists. (Of course, the numeric values of Object IDs can be globally permuted without changing the structural relationships among extant objects.) In this way, the integrated version of the parameterized reaction operator above encodes structural pattern matching, including variable-binding in logical formulae, among the preconditions that can be enforced before such a generalized reaction or “rule” can fire.

From this point of view, syntactic variable-binding has the semantics of multiple integration [4]. In this way we can entrain pattern-matching systems such as the computer algebra system Mathematica, or logic-based programming languages, to the job of simulating complex process rules. As in rule-based expert systems, when multiple rules might fire the Rete algorithm [12] can be used to speed up the computations required to maintain knowledge of their relative rates.

The resulting systems have the power to model and simulate dynamic labelled graphs including growing multicellular tissues with dynamical cell-neighbor relationships [4] and molecular complexes with dynamical binding structure [13–15]. Thus, the TOPE operator algebra approach also explains why and how structural (graph-) matching computations arise naturally in biochemical and multicellular biological simulation.

3.4 Hybrid SSA/ODE setup

As will be shown in Section 3.4.1 below, the operator formulation for a system of ordinary differential equations is [4]:

{\hat{O}}_{drift} = - \int \int d {x} d {y} \hat{a} ({y}) a ({x}) [\sum_{i} \nabla_{y_{i}} (v_{i} ({y}) \prod_{k} δ (y_{k} - x_{k}))]

(19)

Here and in the calculations that follow, the Dirac delta function can be considered as a Gaussian with very small variance, which participates in a limiting process by which, at the end of each calculation, the limit of zero variance is taken.

In [4] this operator expression is generalized from ordinary differential equations to stochastic differential equations, for example those pertaining to the diffusion of particles, as equivalently represented by the Fokker-Planck equation.

3.4.1 Computation of matrix elements

From the commutator

[a (y), \hat{a} (x)] = δ (y - x) (I + Q (N (x) ∣ n_{max}) N (x)),

we may calculate matrix elements of Ô_drift in Equation 18 such as:

\begin{array}{l} 〈 w | {\hat{O}}_{drift} | z 〉 = - 〈 {w} | \int d {x} \int d {y} (\sum_{i} \nabla_{y_{i}} v_{i} ({y}) δ ({y} - {x})) \times \hat{a} ({y}) a ({x}) \hat{a} ({z}) | 0 〉 \\ = - 〈 {w} | \int d {x} \int d {y} (\sum_{i} \nabla_{y_{i}} v_{i} ({y}) δ ({y} - {x})) \times \hat{a} ({y}) δ ({x} - {z}) [I + Q (N (x)) N ({x})] | 0 〉 \\ = - \int d {x} \int d {y} (\sum_{i} \nabla_{y_{i}} v_{i} ({y}) δ ({y} - {x})) δ ({x} - {z}) 〈 {w} ∣ {y} 〉 \\ = - \int d {y} (\sum_{i} \nabla_{y_{i}} v_{i} ({y}) δ ({y} - {z})) δ ({w} - {y}) \\ = + \int d {y} δ ({y} - {z}) (\sum_{i} v_{i} ({y}) \nabla_{y_{i}} δ ({w} - {y})) \\ - \int_{\partial} d {y} (\sum_{i} v_{i} ({y}) δ ({y} - {z}) δ ({y} - {w})) \\ = \sum_{i} v_{i} ({z}) \nabla z_{i} δ ({w} - {z}) + boundaryterm (\to 0 here) \end{array}

The easiest treatment for the boundary terms is to add the assumptions that boundaries are at infinity in the space of parameters x, y and z, and that initial conditions place zero probability there, and that finite velocities v(x) ensure the probability remains zero at infinity at finite times. In that case boundary terms can be neglected. Alternatively, we can define O_drift = Ô_drift − diag(1·Ô_drift) which in this case subtracts off the boundary term. Then

O_{drift} = - \int \int d {x} d {y} (\hat{a} ({y}) a ({x}) - \hat{a} ({x}) a ({x})) \times [\sum_{i} \nabla_{y_{i}} (v_{i} ({y}) \prod_{k} δ (y_{k} - x_{k}))]

(20)

If we define x(t) as a time-varying version of z, satisfying

\frac{\partial x_{i}}{\partial t} = v_{i} ({x_{k}}),

then

\sum_{i} v_{i} ({z}) \nabla_{x_{i}} δ ({w} - {x (t)}) = \sum_{i} (\frac{\partial x_{i}}{\partial t}) (\frac{\partial}{\partial x_{i}}) δ ({w} - {x (t)}) = (\frac{d}{d t}) δ ({w} - {x (t)})

Next we calculate 〈w|exp(τO_drift)|z〉. To this end, Taylor’s theorem may be written

{Shift}_{τ} \circ f (t) = f (t + τ) ≃ \sum_{n = 0}^{\infty} \frac{τ^{n}}{n!} {(\frac{d}{d t})}^{n} f (t) ≃ e^{(τ D_{t})} f (t)

if τ is a constant. For small τ we have

\begin{array}{l} 〈 w ∣ exp (τ O_{drift}) ∣ x 〉 = 〈 w ∣ x 〉 + τ 〈 w ∣ O_{drift} ∣ x 〉 + O (τ^{2}) \\ = (1 + τ (\frac{d}{d t})) δ ({w} - {x (t)}) + O (τ^{2}) \\ = {Shift}_{τ} δ ({w} - {x (t)}) \equiv δ ({w} - {x (t + τ)}) + O (τ^{2}) \end{array}

For larger τ we have

\begin{array}{l} 〈 w ∣ exp (τ O_{drift}) ∣ x 〉 = {lim}_{n \to \infty} 〈 w | \prod_{i = 1}^{n} exp (\frac{τ}{n} O_{drift}) | x 〉 \\ = \int \dots \int {d x}_{n - 1} \dots {d x}_{1} [{Shift}_{τ / n} δ ({w} - {x_{n - 1}})] \dots [{Shift}_{τ / n} δ ({x_{1}} - {x})] \\ = {lim}_{n \to \infty} (\prod_{i = 1}^{n} {Shift}_{τ / n}) δ ({w} - {x (t)}) \\ = {Shift}_{τ} δ ({w} - {x (t)}) \equiv δ ({w} - {x (t + τ)}) \\ = δ ({w} - (z (t = 0) + \int_{0}^{t} v_{i} (z (t)) d t)) \end{array}

Thus (where “IC” means initial condition)

\begin{array}{l} 〈 w ∣ exp ({t O}_{{DE}}) ∣ z 〉 = exp (t \sum_{i} v_{i} ({z}) \nabla_{z_{i}}) δ ({w} - {z}) \\ = δ ({w} - ({z (0) = z} + \int_{0}^{t} v_{i} (z (t^{'})) d t)) \\ = δ ({w} - (Solutionof \frac{\partial x_{i}}{\partial t} = v_{i} ({x_{k}}) withIC z (0) = z)) \end{array}

(21)

QED.

As far as we know this detailed derivation has not appeared previously, though our previous work [4] outlined a simplified version. As a corrollary, using Equation 21 we may multiply by f(w) and integrate over w to calculate

\begin{array}{l} exp (t v ({z}) \cdot \nabla_{z}) δ (w - z) = δ (w - (z (0) + \int_{0}^{t} v (z (t^{'})) {d t}^{'})) \\ \Rightarrow exp (t v ({z}) \cdot \nabla_{z}) f (z) = f (z (0) + \int_{0}^{t} v (z (t^{'})) {d t}^{'}) . \end{array}

(22)

3.5 Hybrid SSA/ODE: Operator algebra derivation

We now derive a new SSA-like simulation algorithm for hybrid combinations of discrete events and ODE dynamics, using operator algebra. The main idea is to replace the exponential distribution factor exp(−tD)with a time integral [15]:

exp (- t D) \to exp (- \int_{0}^{t} D (t^{'}) {d t}^{'}),

(23)

and to add an extra ODE to the system of ODEs in order to keep track of the integral. We will now use the more compact formulation of TOPE in Equation 2 to derive this method.

3.5.1 Heisenberg picture

Let the operators, rather than the states, evolve in time according to W₀ according to Equation 3. This is traditionally called the “Heisenberg picture” in distinction to the “Schroedinger picture” in quantum mechanics. Recall Equation 2 and Equation 3, where (···)₊ is the time-ordering super-operator:

{(O (t_{i}) O (t_{j}))}_{+} = {\begin{matrix} O (t_{i}) O (t_{j}) & if & t_{i} \geq t_{j} \\ O (t_{j}) O (t_{i}) & if & t_{i} \leq t_{j} \end{matrix}

(and likewise for higher order products). Note that if O(t_i) and O(t_j) commute for all pairs of times t_i and t_j, then (O(t_i)O(t_j))₊ = O(t_i)O(t_j), and the time-ordering operator (···)₊ can be dropped. Often the notation Inline graphic (O(t_i)O(t_j)) is used in place of (O(t_i)O(t_j))₊ to denote the super-operator that time-orders operator products.

3.5.2 Application to ODE + decay clock

The hybrid system consisting of chemical reactions (possibly parameterized) together with ordinary differential equations has the combined operator W = (Ô_react − D_react) + O_DE, which we can regroup as

W = (O_{DE} - D_{react}) + {\hat{O}}_{react}

and then apply TOPE to O_DE − D_react first with W₀₀ = O_DE and W₀₁(t_k) = − D_react(t_k), and then again to (O_DE − D_react)+ Ô_react with W₀ = W₀₀ + W₀₁ = O_DE − D_react and W₁ = Ô_react.

In the first application of TOPE to O_DE − D_react with W₀₀ = O_DE, the opererators W₀₁(t_k) = −D_react(t_k) defined at different times are all diagonal in the same (particle number basis and therefore commute:

[D_{react} (t_{i}), D_{react} (t_{j})] = 0.

In this circumstance, we can simply drop the time-ordering super-operator (···)₊ in Equation 2 and write

exp (t (O_{DE} - D_{react})) = exp ({t O}_{DE}) exp (- \int_{0}^{t} {d t}^{'} D_{react} (t^{'}))

(24)

where, as in Equation 3, D_react(t′) = exp(−t′O_DE)D_react exp(t′O_DE). In our case, Equation 24 specializes to:

\begin{array}{l} 〈 w ∣ exp (t (O_{{DE}} - D_{react})) ∣ z 〉 = exp (t \sum_{i} v_{i} ({z}) \nabla_{z_{i}}) exp (- \int_{0}^{t} {d t}^{'} D_{react} (t^{'})) δ ({w} - {z}) \\ = exp (- \int_{0}^{t} {d t}^{'} D_{react} (z (0) + \int_{0}^{t^{'}} v ({z}) d t^{″})) δ (w - (z (0) + \int_{0}^{t} v (z (t^{'})) {d t}^{'})) \end{array}

(25)

This result looks very similar to Equation 22 applied to

f (z) = exp (- \int_{0}^{t} {d t}^{'} D_{react} (t^{'})) δ ({w} - {z})),

and we now aim to understand and exploit this similarity.

3.5.3 Equivalent ODE

Consider the dynamics expressed in Equation 25. Can we obtain the first factor from ODE’s alone? Yes, if we introduce a new state variable τ involved in every ODE-related rule. Set τ(0) = 0 as the new variable’s initial condition, and augment the ODE operators as follows

\begin{array}{l} Z = (z, τ) \\ V (z) = (v {z}), - D (z)) \\ \nabla_{Z} = (\nabla_{z}, \partial_{τ}) \\ {\tilde{O}}_{{DE}} = V (Z) \nabla_{Z} = v ({z}) \cdot \nabla_{z} + D (z) \partial_{τ} \end{array}

(26)

In other words, we have added a differential equation for τ to the ODE system

\frac{\partial x_{i}}{\partial t} = v_{i} ({x_{k}}) and \frac{d τ}{d t} = D (z) .

(27)

This equation is solvable in terms of a “warped time” coordinate

τ (t) = \int_{0}^{t} D_{react} (z (t^{'})) {d t}^{'} .

(28)

(Cf. Equation 23.) There are degenerate cases D_react = 0 only if there are terminal states in the reaction network.

To see that this is the correct procedure, calculate from Equation 21:

\begin{array}{l} 〈 (\begin{matrix} w \\ τ_{max} \end{matrix}) | exp (t {\tilde{O}}_{{DE}}) exp (- τ_{max}) | (\begin{matrix} z (0) \\ τ (0) = 0 \end{matrix}) 〉 \\ = exp (t (\sum_{i} v_{i} ({z}) \nabla_{z_{i}} + D (z) \partial_{τ})) δ ({w} - {z}) δ (τ_{max} - τ) exp (- τ_{max}) \\ = δ ({w} - (z (0) + \int_{0}^{t} v_{i} (z (t^{'})) d t)) δ (τ_{max} - \int_{0}^{t} D_{react} (z (t^{'})) {d t}^{'}) exp (- τ_{max}) \\ = δ ({w} - (SolutionofEquation 27, withIC z (0), τ (0) = 0)) \times exp (- τ_{max}) \end{array}

(29)

This expression agrees with Equation 25, as required. But how do we insure the IC on τ? That can be done as follows:

\begin{array}{l} 〈 (\begin{matrix} w \\ τ_{max} \end{matrix}) | exp (t {\tilde{O}}_{{DE}}) exp (- τ_{max}) | (\begin{matrix} z \\ 0 \end{matrix}) 〉 \\ 〈 (\begin{matrix} w \\ τ_{max} \end{matrix}) | exp (t {\tilde{O}}_{{DE}}) exp (- τ_{max}) | (\begin{matrix} z \\ 0 \end{matrix}) 〉 \times (1 = \int d τ^{'} {d z}^{'} 〈 (\begin{matrix} z^{'} \\ τ^{'} \end{matrix}) | (\begin{matrix} z \\ τ \end{matrix}) 〉) \\ = 〈 (\begin{matrix} w \\ τ_{max} \end{matrix}) | exp (t {\tilde{O}}_{{DE}}) exp (- τ_{max}) (\int d τ^{'} {d z}^{'} | (\begin{matrix} z^{'} \\ 0 \end{matrix}) 〉 〈 (\begin{matrix} z^{'} \\ τ^{'} \end{matrix}) |) ∣ (\begin{matrix} z \\ τ \end{matrix}) 〉 \\ = 〈 (\begin{matrix} w \\ τ_{max} \end{matrix}) | exp (t {\tilde{O}}_{{DE}}) exp (- τ_{max}) P_{τ : = 0} | (\begin{matrix} z \\ τ \end{matrix}) 〉 \\ P_{τ : = 0} \equiv \int d τ^{'} {d z}^{'} | (\begin{matrix} z^{'} \\ 0 \end{matrix}) 〉 〈 (\begin{matrix} z^{'} \\ τ^{'} \end{matrix}) | \end{array}

(30)

is a projection operator (i.e. one that satisfies P · P = P) that resets the variable τ to zero after each use. In summary,

\begin{array}{l} 〈 (\begin{matrix} w \\ τ_{max} \end{matrix}) | exp (t {\tilde{O}}_{{DE}}) exp (- τ_{max}) P_{τ : = 0} | (\begin{matrix} z \\ τ \end{matrix}) 〉 \\ = δ ({w} - (SolutionofEquation 27, withIC z (0), τ (0) = 0)) \times exp (- τ_{max}) \end{array}

(31)

Clearly this result is equivalent to Equation 25 and is in the correct form for a Markov chain that can represent a computation. Of course, the matrix element calculated is only relevant if τ_max as drawn from the exponential is constrained to be equal to the final value of τ in the final state $〈 (\begin{matrix} w \\ τ_{max} \end{matrix}) |$ as solved by the ODE system Õ_{DE}. We can implement this constraint with a factor of δ(τ, − τ_max) in the Markov chain over states and times. Thus a step in the Markov chain in between reactions can be written as:

\begin{matrix} W_{betweenreactions} = δ (τ - τ_{max}) exp (\begin{matrix} t & {\tilde{O}}_{{DE}} \end{matrix}) P_{τ : = 0} exp (- τ_{max}) Θ (τ_{max} \geq 0) \\ W = {\hat{O}}_{react} \cdot W_{betweenreactions} \end{matrix}

As in the SSA derivation, the reaction step is given by factors of Ô_react which need to be normalized by D_react. Using δ(t − t_max(τ_max))dt = δ(τ − τ_max)dτ and dτ/dt = D_react(t), we find

\begin{matrix} M_{01} = exp (- τ_{max}) Θ (τ_{max} \geq 0) \\ M_{00} = δ (t - t_{max} (τ_{max}) exp (\begin{matrix} t & {\tilde{O}}_{{DE}} \end{matrix}) \cdot P_{τ : = 0} \\ M_{1} = {\hat{O}}_{react} / D_{react} \\ W = M_{1} \cdot M_{00} \cdot M_{01} \end{matrix}

(32)

where Inline graphic represents the Markov chain corresponding to the simulation algorithm.

In implementations so far [4,15] we have used instead the equivalent differential operator

{\tilde{O}}_{{DE}} = V (Z) \nabla_{Z} = v ({z}) \cdot \nabla_{z} - D (z) p \partial_{p}

with p = exp(−τ), initialized at p₀ = 1, and a uniform distribution on p_final ∈ [0, 1]. This variant of the ODE was reported independently in [16], though the derivation there did not proceed by general field theory techniques.

3.5.4 Algorithm: Hybrid SSA/ODE solver

By Equation 32 above, a Markov chain algorithm for simulating the hybrid system can be represented in the following SSA-like pseudocode:

factor ρ⁽^r⁾(x_in, x_out) = k⁽^r⁾(x_in)p⁽^r⁾(x_out|x_in);
repeat {
   initialize SSA propensities as k⁽^r⁾(x_in);
   initialize  $k^{(total)} : = \sum_{r} k^{(r)} (x_{in})$ ;
   initialize τ: = 0;
   draw effective waiting time τ_max from exp(−τ_max)
   solve ODE system, including an extra ODE updating $τ : \frac{d τ}{d t} = k^{(total)} (t)$ 
     until τ = τ_max
   draw reaction r from distribution k⁽^r⁾(x_in)/k^(total);
   draw x_out from p⁽^r⁾(x_out|x_in) and execute reaction r;
} until t ≥ t_max

3.5.5 Application: Cell division

As a simplified model of stochastic cell division, we may consider constant growth of a linear dimension l of each cell, dl/dt = v, coupled with a stochastic cell division rule whose propensity depends on the ratio of l to a threshold length l₀ for likely division:

cell (l) \to cell (l / 2), cell (l / 2) with ρ_{division} σ (β (l / l_{0} - 1))

with a sigmoidal function such as σ(x) = 1/(1 + exp(−x)). In this model the parameter β varies the sharpness of the threshold, and ρ_division is the maximal propensity for division. Experimental evidence for stochastic dependence of division events on cell size in plant cells is reviewed in [17].

The differential equation for length can also be put in the form of a reaction rule that includes an ODE:

cell (l) \to cell (l + d l) solving d l / d t = v (l)

as described in [15]. Clearly this model could be augmented with other parameters such as growth signals with their own dynamics. This was done in models of biological stem cell niches in mouse olfactory epithelium and plant root growth, using the foregoing cell division rules. These systems were studied and simulated using the hybrid SSA/ODE algorithm above, in [15,18].

3.5.6 Application: Time-varying propensity for complete polymerization

Consider the n-step polymerization reaction

\begin{matrix} {A \to X_{1} with k_{1}, X_{1} \to X_{2} with k_{2}, \dots, X_{n - 1} \to B with k_{n}} \\ τ_{i} = τ / n and k_{i} = n k . \end{matrix}

There is an n^(max). Then

\begin{matrix} \hat{W} = λ {\hat{a}}_{n + 1} \\ W = λ (({\hat{a}}_{n + 1} + c_{n + 1}) - I_{n + 1}) = λ ({\hat{b}}_{n + 1} - I_{n + 1}) \end{matrix}

where c_n₊₁ is all zeros except for a “1” entry in the lower right corner. Since b̂ and I are matrices that commute, exp(tW) = exp(tλb̂_n₊₁) exp(−tλI_n₊₁) and we easily compute

P (t ∣ τ, n) = {[exp (t W)]}_{1, n + 1} = \frac{λ^{n} t^{n - 1} e^{- λ t}}{(n - 1)!}

This is the distribution on polymer completion times. It is an Erlang distribution (a Gamma distribution with integral values of n). If τ is held fixed and n tends towards infinity, this distribution approaches a delta function δ(t − τ), which can lead to differential-delay equation models for reaction networks involving polymerization processes such as transcription [19]. This probability distribution for termination times also corresponds to the time-varying propensity function

ρ (t ∣ τ, n) = P (t ∣ τ, n) / [1 - \int_{0}^{t} P (t ∣ τ, n) d t] \frac{λ^{n} t^{n - 1} e^{- λ t}}{Γ (n, t λ)},

(33)

which increases monotonically in time.

As in Equation 32, the resulting time-varying propensity still fits within the framework of a Markov chain Inline graphic (I, t′|J, t) that advances the time variable by an increment that is a random variable. The method of the previous section can be used to implement an SSA-like algorithm, with differential equations that govern propensities replaced by algebraic equations (Equation 33) or, if differential equations are also present, by differential-algebraic equations.

3.5.7 Extended Application: Tissue-level model of Arabidopsis root growth

A full tissue-level model of a hybrid SSA/ODE system has been presented in [18], which details a mathematical model of auxin growth hormone patterning along the developing root of the plant Arabidopsis thaliana, including the pattern formation system in the root apical meristem (RAM). The model was first formulated using a fixed 1D geometry of cells along the central “stele” of the root, including both passive diffusion of auxin originating in the above ground part of the plant, and more importantly autoregulated active transport of auxin. This much of the model is formulated using ordinary differential equations and spatial discretization at the scale of one cell.

However the real root involves cell growth, division and possibly biomechanics in an essential way, so the model was reimplemented in the “Plenum” implementation of the “Dynamical Grammars” modeling language. Dynamical Grammars support parameterized rules such as those of Section 3.5 above at multiple scales (eg cellular and/or molecular scales), and the Plenum implementation [15] uses the foregoing hybrid SSA/ODE algorithm as an essential part of its simulation engine. It also uses a data structure of pattern-matched objects (somewhat akin to that of the Rete algorithm [12]) for efficient handling of the variable-binding involved when there are many rules in a grammar, some of which include repeated variable names. The resulting root growth and patterning model includes rules for cell growth, cell division, mechanical forces between neighboring cells in 1D, cell death at the tip of the root, auxin influx from the shoot, production of a hypothetical second morphogen “Y” possibly playing a role similar to cytokinin, autoregulated active transport of auxin between neighboring cells, passive transport of auxin and Y between neighboring cells, degradation of auxin and Y, and dilution of auxin and Y due to cell growth. There are a total of 13 grammar rules that specify the foregoing mechanisms, with one or two rules per listed mechanism. As in the previous cell division example, each rule is either of “solving” keyword ODE type or of “with” keyword discrete event type. We now present the first four rules of this model.

In the root model there is just one type of parameterized object, a cell. Each cell carries its own internal state information in the form of the values of an ordered list of parameters, each of which is constrained to be of some type (often integers or real values) associated with a measure that can be summed or integrated over. In the plant root model, the parameter types of a cell object are as follows:

cell[currID : ℕ mode : ℕ, l : ℝ, r : ℝ, A : ℝ, Y : ℝ, prevID : ℕ, nextID : ℕ]

Here currID is the integer-valued (or “integer-typed”, currID: ℕ) unique identification number (ID) of the current cell, prevID is the integer-valued ID of the previous cell in the 1D line, nextID is the integer-valued ID of the next cell, mode is an integer-valued label specifying the cell’s internal growth state, l is the real-valued (l: ℝ) current cell length, r is its real-valued size or “radius”, A is its real-valued concentration of auxin, Y is its real-valued concentration of hypothetical substance Y. Here A and Y could alternatively be typed as nonnegative integers in a stochastic molecular simulation, but that was not the modeling choice in this investigation. A slightly simplified version of the rules for cell growth, cell division, biomechanics, and passive diffusion of chemical species between neighboring cells is:

grammar root {
   /* cell growth: */
   cell[curr, mode, x, r, A, Y, prev, next] → cell[curr, mode, x, r + dr, A, Y, prev, next]
   solving {dr/dt = 1/τ_cycle}
   /* cell biomechanics (point masses, dissipation dominated) */
     C₁ = cell[curr, mode, x, r, A, Y, prev, next],
     C₂ = cell[next, mode′, x′, r′, A′, Y′, curr, nextnext]
     → C₁ = cell[curr, mode, x + dx, r, A, Y, prev, next], C₂
            solving {dx/dt = −∂_xV_spring(x − x′, r, r′)}
     /*plus similar rule exchanging next and prev; dx/dt just adds up over rules */
   /* switch from growth mode (mode=1) to division-waiting mode (=2): */
     cell[curr, 1, x, r, A, Y, prev, next] → cell[curr, 2, x, r, A, Y, prev, next]
         with ρ_stop/(1 + exp(− (r − r_lim))/T_div)
   /* cell replication, preserving 1D structure: */
     cell[curr, 2, x, r, A, Y, prev, next], cell[prev, mode′, x′, r′, A′, Y′, prevprev, curr]
        cell[next, mode, x″, r″, A″, Y″, curr, nextnext]
     → cell[new₁, 1, x − r + 2rα + r(1 − α), r(1 − α), A, Y, prev, new₂],
        cell[new₂, 1, x − r + rα, rα, A, Y, new₁, next],
        cell[prev, mode′, x′, r′, A′, Y′, prevprev, new₁],
        cell[next, mode, x″, r″, A″, Y″, new₂, nextnext]
     with  $[base + ampl (Y / Y_{0}) / (1 + {(Y / Y_{0})}^{5}] \times Θ (\frac{1}{2} + Δ \leq α \leq \frac{1}{2} + Δ)$ 
   /* auxin/Y passive transport between two neighboring cells: */
     C₁ = cell[curr, mode, x, r, A, Y, prev, next],
     C₂ = cell[next, mode′, x′, r′, A′, Y′, curr, nextnext]
     → C₁ = cell[curr, mode, x, r, A + dA, Y + dY, prev, next],
     C₂ = cell[next, mode′, x′, r′, A′ + dA′, Y ′ + dY′, curr, nextnext]
            solving {dA/dt = D_A(A′ − A), dA′/dt = D_A(A − A′),
                    dY/dt = D_Y (Y′ − Y), dY′/dt = D_Y (Y − Y′) }
}

The actual code for these rules is given in Appendix III. It is written using the Plenum implementation [15] of the Dynamical Grammars framework [4]. Plenum is embedded in the Mathematica computer algebra problem-solving environment. Thus, ordinary and partial derivatives as used above are actually a part of the language. The full model file is available as Supplementary Data to this paper. Repeated variables on the left hand side (LHS) must have identical values for the rule to apply. This situation occurs in the cell biomechanics, cell replication and passive transport rules above, where left-right cell neighbor pairs point to one another by sharing ID parameter values like “curr”, “prev” and “next” in the first and last two parameter positions. By contrast, there is no repetition of variables on the LHS of the autonomous cell growth or cell mode-switching rules above, since they have only one object on the LHS of each rule. Algorithmically such repeated variable matching is achieved by symbolic pattern matching or variable-binding; mathematically it is expressed by operator integrals such as Equation 18. The coordinate system used in this example may seem “backwards” since it is a minor convention that roots grow from left to right, but that the quiescent center near the right tip is the origin of coordinates.

The Plenum implementation also performs several symbolic computations including variable-binding for efficient implementation of rules with repeated variables (Equation 18 above and the present extended example), and aggregating the ODE dτ/dt = k^(tota)(t) of Section 3.5.4 by adding up the symbolic expressions from the individual rules.

Selected pattern formation snapshots are shown in Figure 3. The phenomenology of the resulting simulations, and of the actual root observations with which they largely agree, is discussed in [18]. Root apical meristem is an example of a stem cell niche in plants. A somewhat more complex stem cell niche model for mouse olfactory epithelium, in two dimensions using Plenum, is given in [15].

Snapshots of the root growth model, showing cell positions along the horizontal axis (root tip to the right), and concentrations of auxin (solid red curve with one or two peaks) and hypothetical substance Y (dashed blue curve with one or two peaks) with increasing time. Cell state (1=idling in preparation for cell division, vs. 0=growth) is shown in green dotted curve. Parameters are: ρ_stop = 1, base = .005, ampl = 100, Y₀ = 5, r_lim = 1, T_div = .01, Δ= .2, D_A = 0.08, D_Y = 0.16. Some parameter sets including this one develop extra auxin peaks to the left of the Quiescent Center (~rightmost blue peak), which may specify the location for a new lateral root. Full interpretation of this model is given in [18].

4 Conclusion and outlook

We have shown that the time-ordered product expansion (TOPE) can be used systematically to derive computational simulation and parameter-fitting algorithms for stochastic systems, connecting two seemingly distant areas of research. In doing so we have developed the means to translate formally between field theory language and the language of computable Markov chains in which randomized algorithms can be expressed and derived. By this means we hope to open the door to the use of TOPE and related methods from quantum and statistical field theory in the computational simulation of stochastic biochemical kinetics, with broad applicability in physically based biology. The particular hybrid stochastic process/ordinary differential equation simulation algorithm derived here is very different from interleaving and operator splitting algorithms which are intrinsically approximate; instead, this algorithm is exact in the same sense that SSA is (that is, it draws from the same distribution of just-fired reactions), except for any errors introduced by the ODE solver and in the solver’s detection of the ODE stopping criterion, which is that an auxiliary variable reaches a threshold value. A future prospect for the field theory approach is application to reaction-diffusion systems in which the propagator for particles between reactions is the heat kernal Green’s function for the diffusion equation. The result may be an alternative avenue for derivation of novel particle-based, off-grid stochastic numerical solvers for reaction-diffusion problems as treated in [2], which, like the algorithms shown here, are also amenable to generalizations to exact “leaping” acceleration and to hybrid stochastic/differential equation solution algorithms.

Supplementary Material

PB446673suppdata.zip

NIHMS503994-supplement-PB446673suppdata_zip.zip^{(301.1KB, zip)}

A time history of the reaction A + B ⇌ C. Time flows left to right. Open circles represent reaction events, with probability factor ×W₁. In between reaction events are unimolecular particle propagators exp((*t_k* −*t_k*₋₁)W₀), labelled by arrows and particle names (repeated for clarity). This is a non-spatial version of the Lee model in quantum field theory (cf. for example [6]).

Erlang-derived time-dependent propensities for completion of a multistage process τ = 1, n ∈ {1, …, 10}. Horizontal axis: time, t. Vertical axis: propensity, ρ(*t|τ, n*). Plots for varying n are superimposed. For larger n there is a “maturation” phenomenon whereby completion at small times is very unlikely, and when a process is “overdue” for completion then its propensity becomes very high. By comparison, propensities for very small n increase rapidly at first and are then relatively flat.

Acknowledgments

Research was supported by NIH grants R01 GM086883 and P50 GM76516 to UC Irvine. I also wish to acknowledge the hospitality, travel support, and research environments provided by the Center for Nonlinear Studies (CNLS) at the Los Alamos National Laboratory, the Sainsbury Laboratory Cambridge University, and the Pauli Center for Theoretical Studies at ETH Zürich and the University of Zürich.

Abbreviations list

IC: Initial Condition
ODE: Ordinary Differential Equation
MC: Markov Chain
SSA: (Gillespie) Stochastic Simulation Algorithm
TOPE: Time-Ordered Product Expansion
LHS: Left Hand Side
RHS: Right Hand Side

C Appendix I: Bayesian inference derivation

C.1 Semigroup property

Here we provide the omitted details for Section 2.4. For nonnegative times t = t₁ + t₂, any time-evolution equation must obey the semigroup property:

Pr (I ∣ J, t) = \sum_{K} Pr (I ∣ K, t_{2}) Pr (K ∣ J, t_{1})

i.e.

{[exp (t (\hat{W} - D))]}_{I, J} = \sum_{K} {[exp (t_{2} (\hat{W} - D))]}_{I, K} {[exp (t_{1} (\hat{W} - D))]}_{K, J} .

Is there a k-event version of this rule, for k = k₁ + k₂? We observe (where again ${[τ_{q}]}_{0}^{k} \equiv [τ_{0}, \dots τ_{k}]$ )

Pr (I, k ∣ J, t) = \int_{0}^{t} \dots \int_{0}^{t} (\prod_{q = 0}^{k} d τ_{q}) Pr (I, {[τ_{q}]}_{0}^{k}, k ∣ J, t)

and we calculate for 0 ≤ k₁ ≤ k:

\begin{array}{l} \sum_{K} \int_{0}^{t} d τ Pr (I, [τ_{k_{1}}^{'}, τ_{k_{1} + 1}, \dots τ_{k}], k_{2} ∣ K, τ) Pr (K, {[τ_{q}]}_{0}^{k_{1}}, k_{1} ∣ J, t - τ) \\ = \int_{0}^{t} d τ {[\prod_{q = k ↓}^{k_{1} + 1} exp (- τ_{q} D) \hat{W}] exp (- τ_{k_{1}}^{'} D) \\ {\times δ (τ - (\sum_{q = k_{1} + 1}^{k} τ_{q} + τ_{k_{1}}^{'})) exp (- τ_{k_{1}} D) [\prod_{q = k_{1} - 1 ↓}^{0} \hat{W} exp (- τ_{q} D)] δ (t - τ - \sum_{q = 0}^{k_{1} - 1} τ_{q})}}_{I, J} \\ = {[\prod_{q = k ↓}^{k_{1} + 1} exp (- τ_{q} D) \hat{W}] exp (- (τ_{k_{1}}^{'} + τ_{k_{1}}) D) \\ {\times [\prod_{q = k_{1} - 1 ↓}^{0} \hat{W} exp (- τ_{q} D)] δ (t - (\sum_{q = k_{1} + 1}^{k} τ_{q} + τ_{k_{1}}^{'} + \sum_{q = 0}^{k_{1} - 1} τ_{q}))}}_{I, J} \\ = Pr (I, [{[τ_{q}]}_{0}^{k_{1} - 1}, τ_{k_{1}}^{'} + τ_{k_{1}}, {[τ_{q}]}_{k_{1} + 1}^{k}], k ∣ J, t) . \end{array}

Thus, if k = k₁ + k₂ and for any $τ_{k_{1}}^{'} \in [0, τ_{k_{1}}]$ , there is a semigroup law:

Pr (I, {[τ_{q}]}_{0}^{k}, k ∣ J, t) \sum_{K} \int_{0}^{t} d τ Pr (I, [τ_{k_{1}}^{'}, τ_{k_{1} + 1}, \dots τ_{k}], k_{2} ∣ K, τ) \times Pr (K, [τ_{0}, \dots τ_{k_{1} - 1}, τ_{k_{1}} - τ_{k_{1}}^{'}], k_{1} ∣ J, t - τ) .

In this derivation there is an arbitrary choice of $τ_{k_{1}}^{'}$ from the interval [0, τ_k₁].

C.2 Bayesian recurrence relation

Here we provide the omitted details for Section 3.1.1. We seek a version of the semigroup law that pertains to Pr(I, t|k, J) rather than to Pr(I, k|J, t). This is achieved by a somewhat involved application of Bayes’ rule.

We observe (where again by definition ${[τ_{q}]}_{0}^{k} ≜ [τ_{0}, \dots τ_{k}]$ )

Pr (I, {[τ_{q}]}_{0}^{k}, k ∣ J, t) = {[exp (- τ_{k} D) \prod_{q = k - 1 ↓}^{0} \hat{W} exp (- τ_{q} D) Θ (τ_{q} \geq 0)]}_{I, J} δ (t - \sum_{q = 0}^{k} τ_{q})

so we may define (where J = I₀ and I = I_k and D_{I_q} ≜ D_{I_qI_q}

Pr ({[I_{q}]}_{1}^{k}, {[τ_{q}]}_{0}^{k}, k ∣ J, t) ≜ exp (- τ_{k} D_{I_{k}}) [\prod_{q = 0}^{k - 1} {\hat{W}}_{I_{q + 1} I_{q}} exp (- τ_{q} D_{I_{q}}) Θ (τ_{q} \geq 0)] δ (t - \sum_{q = 0}^{k} τ_{q})

We seek a simple expression for Pr(I, t|k, J), and claim that with suitable caveats it will be determined recursively by

Pr (I, t ∣ k = 1, J) = {\hat{W}}_{I J} exp (- {t D}_{J}),

the two factors of which have inverse cancelling normalizations. The obstacle to overcome is that, from the Bayesian point of view, simultaneous knowledge of the simulation end time and final event number can trickle backwards and influence the distribution of likely event firing times at earlier times and event numbers - a completely nonphysical artifact. To avoid this effect we must be careful to ask the right questions for Bayesian inference to answer. To begin with we consider simulation ending times T much longer than event times t that we wish to sample. All events after event k at time t are assumed to be of no interest, so we integrate them out. All earlier events are assumed to be known already, so we conditionalize over them. This is the correct Bayesian way to introduce time asymmetry into the global distribution $Pr ({[I_{q}]}_{1}^{k}, {[τ_{q}]}_{0}^{k}, k ∣ \dots)$ entire trajectories ( ${[I_{q}]}_{1}^{k}, {[τ_{q}]}_{0}^{k}$ ) above.

The strategy then is to consider large times T ≫ t which are overwhelmingly likely to have large reaction numbers n ≫ k; then to marginalize the probability distribution Pr([I], [τ], n|J, T) over all event numbers n > k and to conditionalize it over all event numbers q < k.

C.2.1 Marginalizing

Define

\hat{Pr} ({[I_{q}]}_{1}^{k}, {[τ_{q}]}_{0}^{k - 1} ∣ k, J, T) ≜ \sum_{n = k + 1}^{\infty} \sum_{{{[I_{q}]}_{k + 1}^{n}}} \int_{0}^{T} \dots \int_{0}^{T} {[d τ_{q}]}_{k}^{n} Pr ({[I_{q}]}_{1}^{n}, {[τ_{q}]}_{0}^{n}, n ∣ J, T)

(34)

This is a “just-fired” probability, in which any wait times τ and events after the kth event are integrated out (marginalized).

In the limit T → ∞ only summands with n ≫ k will contribute (assuming terminal states have been formally eg. by adding an extra, isolated, slow, reversible reaction). First, is this object really a probability distribution? Clearly every value is nonnegative. They also add up to one in the limit T → ∞:

\begin{array}{l} \sum_{{{[I_{q}]}_{1}^{k}}} \int_{0}^{T} {[d τ_{q}]}_{0}^{k - 1} \hat{Pr} ({[I_{q}]}_{1}^{k}, {[τ_{q}]}_{0}^{k - 1} ∣ k, J, T) \\ = \sum_{n = k + 1}^{\infty} \sum_{{{[I_{q}]}_{1}^{n}}} \int_{0}^{T} {[d τ_{q}]}_{0}^{n} Pr ({[I_{q}]}_{1}^{n}, {[τ_{q}]}_{0}^{n}, n ∣ J, T) \\ \underset{T \to \infty}{\to} \sum_{n = 0}^{\infty} \sum_{{{[I_{q}]}_{1}^{n}}} \int_{0}^{T} {[d τ_{q}]}_{0}^{n} Pr ({[I_{q}]}_{1}^{n}, {[τ_{q}]}_{0}^{n}, n ∣ J, T) = 1 \end{array}

(35)

due to the normalization of $Pr ({[I_{q}]}_{1}^{n}, {[τ_{q}]}_{0}^{n}, n ∣ J, T)$ . So, $\tilde{Pr} (\dots ∣ k, J) = lim_{T \to \infty} \hat{Pr} (\dots ∣ k, J, T)$ is also a probability density function.

Next we compute $\hat{Pr} (\dots ∣ J, T)$ using TOPE:

\begin{array}{l} \hat{Pr} ({[I_{q}]}_{1}^{k}, {[τ_{q}]}_{0}^{k - 1} ∣ k, J, T) = \sum_{n = k + 1}^{\infty} \sum_{{{[I_{q}]}_{k + 1}^{n}}} \int_{0}^{T} \dots \int_{0}^{T} {[d τ_{q}]}_{k}^{n} δ (T - \sum_{q = 0}^{n} τ_{q}) \\ \times exp (- τ_{n} D_{I_{n}}) [\prod_{q = 0}^{n - 1} {\hat{W}}_{I_{q + 1} I_{q}} exp (- τ_{q} D_{I_{q}}) Θ (τ_{q} \geq 0)] \\ = \sum_{n = k + 1}^{\infty} \sum_{{{[I_{q}]}_{k + 1}^{n}}} \int_{0}^{T} \int_{0}^{T} {[d τ_{q}]}_{k}^{n} δ (T - \sum_{q = 0}^{k - 1} τ_{q} - \sum_{q = k}^{n} τ_{q}) exp (- τ_{n} D_{I_{n}}) \\ \times [\prod_{q = 0}^{k - 1} {\hat{W}}_{I_{q + 1} I_{q}} exp (- τ_{q} D_{I_{q}}) Θ (τ_{q} \geq 0)] [\prod_{q = k}^{n - 1} {\hat{W}}_{I_{q + 1} I_{q}} exp (- τ_{q} D_{I_{q}})] \end{array}

But now the product [ $\prod_{q = 0}^{k - 1} \dots$ ] is a common factor and can be moved out of all the integrals and sums. Thus

\hat{Pr} ({[I_{q}]}_{1}^{k}, {[τ_{q}]}_{0}^{k - 1} ∣ k, J, T) = [\prod_{q = 0}^{k - 1} {\hat{W}}_{I_{q + 1} I_{q}} exp (- τ_{q} D_{I_{q}}) Θ (τ_{q} \geq 0)] I (I_{k}, k, J, T - \sum_{q = 0}^{k - 1} τ_{q}),

(the first factor of which is independent of T), where

I (I_{k}, k, J, T^{'}) ≜ \sum_{n = k + 1}^{\infty} \sum_{{{[I_{q}]}_{k + 1}^{n}}} \int_{0}^{T} \dots \int_{0}^{T} {[d τ_{q}]}_{k}^{n} δ (T^{'} - \sum_{q = k}^{n} τ_{q}) \times exp (- τ_{n} D_{I_{n}}) [\prod_{q = k}^{n - 1} {\hat{W}}_{I_{q + 1} I_{q}} exp (- τ_{q} D_{I_{q}})] .

If we define new dummy variables I′_q ≜ I_q+k, τ′_q ≜ τ_q+k, and n′ ≜ n − k, then τ′_n_′ = τ_n, I′_n_′ = I_n, and

\begin{array}{l} I (I_{k}, k, J, T^{'}) = \sum_{n^{'} = 1}^{\infty} \sum_{{{[{I^{'}}_{q}]}_{1}^{n^{'}}}} \int_{0}^{T} \dots \int_{0}^{T} {[d {τ^{'}}_{q}]}_{k}^{n^{'}} δ (T^{'} - \sum_{q = k}^{n^{'}} {τ^{'}}_{q}) \\ \times exp (- {τ^{'}}_{n} D_{{I^{'}}_{n}}) [\prod_{q = 0}^{n^{'} - 1} {\hat{W}}_{{I^{'}}_{q + 1} {I^{'}}_{q}} exp (- {τ^{'}}_{q} D_{{I^{'}}_{q}})] \\ = 1 \cdot (e^{T^{'} (\hat{W} - D)} - e^{- T^{'} D}) \cdot δ ({I^{'}}_{0} - J) \\ = 1 - 1 \cdot e^{- T^{'} D} \cdot δ ({I^{'}}_{0} - I_{k}), \end{array}

by adding and subtracting the missing n′= 0 summand and using TOPE again. Now we can take limits:

lim_{T \to \infty} I (I_{k}, k, J, T^{'}) = 1,

and

\begin{array}{l} \tilde{Pr} ({[I_{q}]}_{1}^{k}, {[τ_{q}]}_{0}^{k - 1} ∣ k, J) ≜ lim_{T \to \infty} \hat{Pr} ({[I_{q}]}_{1}^{k}, {[τ_{q}]}_{0}^{k - 1} ∣ k, J, T) \\ \prod_{q = 0}^{k - 1} ({\hat{W}}_{I_{q + 1} I_{q}} exp (- τ_{q} D_{I_{q}}) Θ (τ_{q} \geq 0)) . \end{array}

(36)

As a special case, for k = 1 we find $\tilde{Pr} ([I_{1}], [τ_{0}] ∣ 1, J) = {\hat{W}}_{I_{1} I_{0}} exp (- τ_{0} D_{I_{0}}) Θ (τ_{0} \geq 0)$ .

C.2.2 Conditionalizing

If 2 ≤ k < n, Bayes’ Rule implies:

\hat{Pr} (I_{k}, τ_{k - 1} ∣ {[I_{q}]}_{1}^{k - 1}, {[τ_{q}]}_{0}^{k - 2}, k, J, T) = \frac{\hat{Pr} ({[I_{q}]}_{1}^{k}, {[τ_{q}]}_{0}^{k - 1} ∣ k, J, T)}{\hat{Pr} ({[I_{q}]}_{1}^{k - 1}, {[τ_{q}]}_{0}^{k - 2} ∣ k, J, T)}

(37)

The denominator is a new quantity (since the k’s don’t match up the way they do in the numerator) and it is the integral of the numerator that normalizes the left hand side. It can be evaluated in the limit of large T:

\begin{array}{l} \hat{Pr} ({[I_{q}]}_{1}^{k - 1}, {[τ_{q}]}_{0}^{k - 2} ∣ k, J, T) = \sum_{I_{k}} \int_{0}^{T} d τ_{k - 1} \hat{Pr} ({[I_{q}]}_{1}^{k}, {[τ_{q}]}_{0}^{k - 1} ∣ k, J, T) \\ = \sum_{n = k + 1}^{\infty} \sum_{{{[I_{q}]}_{k}^{n}}} \int_{0}^{T} {[d τ_{q}]}_{k - 1}^{n} Pr ({[I_{q}]}_{1}^{n}, {[τ_{q}]}_{0}^{n}, n ∣ J, T) \\ \underset{T \to \infty}{\to} lim_{T \to \infty} \sum_{n = k}^{\infty} \sum_{{{[I_{q}]}_{k}^{n}}} \int_{0}^{T} {[d τ_{q}]}_{k - 1}^{n} Pr ({[I_{q}]}_{1}^{n}, {[τ_{q}]}_{0}^{n}, n ∣ J, T) \\ = \tilde{Pr} ({[I_{q}]}_{1}^{k - 1}, {[τ_{q}]}_{0}^{k - 2} ∣ k - 1, J) \end{array}

from Equation 34 and Equation 36. As before, the second step is justified by the fact that n ≤ k has probability that approaches zero as T → ∞. Defining

\overset{ˇ}{Pr} (I_{k}, τ_{k - 1} ∣ {[I_{q}]}_{1}^{k - 1}, {[τ_{q}]}_{0}^{k - 2}, k, J) ≜ lim_{T \to \infty} \hat{Pr} (I_{k}, τ_{k - 1} ∣ {[I_{q}]}_{1}^{k - 1}, {[τ_{q}]}_{0}^{k - 2}, k, J, T)

we find (from Equation 37)

\begin{array}{l} \overset{ˇ}{Pr} (I_{k}, τ_{k - 1} ∣ {[I_{q}]}_{1}^{k - 1}, {[τ_{q}]}_{0}^{k - 2}, k, J) = lim_{T \to \infty} \frac{\hat{Pr} ({[I_{q}]}_{1}^{k}, {[τ_{q}]}_{0}^{k - 1} ∣ k, J, T)}{\hat{Pr} ({[I_{q}]}_{1}^{k - 1}, {[τ_{q}]}_{0}^{k - 2} ∣ k, J, T)} \\ = \frac{lim_{T \to \infty} \hat{Pr} ({[I_{q}]}_{1}^{k}, {[τ_{q}]}_{0}^{k - 1} ∣ k, J)}{lim_{T \to \infty} \hat{Pr} ({[I_{q}]}_{1}^{k - 1}, {[τ_{q}]}_{0}^{k - 2} ∣ k, J)} \end{array}

since for valid nonnegative τs the ratio of limits exists and is finite, as we will shortly see. Thus

\begin{array}{l} \overset{ˇ}{Pr} (I_{k}, τ_{k - 1} ∣ {[I_{q}]}_{1}^{k - 1}, {[τ_{q}]}_{0}^{k - 2}, k, J) = \frac{\tilde{Pr} ({[I_{q}]}_{1}^{k}, {[τ_{q}]}_{0}^{k - 1} ∣ k, J)}{\tilde{Pr} ({[I_{q}]}_{1}^{k - 1}, {[τ_{q}]}_{0}^{k - 2} ∣ k - 1, J)} \\ = \frac{\prod_{q = 0}^{k - 1} ({\hat{W}}_{I_{q + 1} I_{q}} exp (- τ_{q} D_{I_{q}}) Θ (τ_{q} \geq 0))}{\prod_{q = 0}^{k - 2} ({\hat{W}}_{I_{q + 1} I_{q}} exp (- τ_{q} D_{I_{q}}) Θ (τ_{q} \geq 0))} \\ = {\hat{W}}_{I_{k} I_{k - 1}} exp (- τ_{k - 1} D_{I_{k - 1}}) Θ (τ \geq 0) . \end{array}

The last line is actually independent (in the functional rather than probabilistic sense of “independent”) of the quantitites ${[I_{q}]}_{1}^{k - 2}, {[τ_{q}]}_{0}^{k - 2}$ , and k, so we can drop all these arguments from $\tilde{Pr} (\dots)$ . Restating,

\begin{array}{l} \overset{ˇ}{Pr} (I_{k}, τ_{k - 1} ∣ I_{k - 1}) = {\hat{W}}_{I_{k} I_{k - 1}} exp (- τ_{k - 1} D_{I_{k - 1}}) Θ (τ_{k - 1} \geq 0) \\ or \\ \overset{ˇ}{Pr} (I, τ ∣ K) = {\hat{W}}_{I K} exp (- τ D_{K}) Θ (τ \geq 0) . \end{array}

Importantly, this expression is equal to $\tilde{Pr} (I, τ ∣ k = 1, K)$ as calculated at the end of the last section. Also the recursive statement of the Bayesian recurrence property Equation 37 becomes:

\tilde{Pr} ({[I_{q}]}_{1}^{k}, {[τ_{q}]}_{0}^{k - 1} ∣ k, J) = \overset{ˇ}{Pr} (I_{k}, τ_{k - 1} ∣ I_{k - 1}) \tilde{Pr} ({[I_{q}]}_{1}^{k - 1}, {[τ_{q}]}_{0}^{k - 2} ∣ k - 1, J) .

Since $\overset{ˇ}{Pr} (I, τ ∣ J) = \tilde{Pr} (I, τ ∣ k = 1, J)$ , we find for all k ≥ 2 the Bayesian recurrence relation in terms of $\tilde{Pr} (\dots)$ alone:

\tilde{Pr} ({[I_{q}]}_{1}^{k}, {[τ_{q}]}_{0}^{k - 1} ∣ k, J) = \tilde{Pr} (I_{k}, τ_{k - 1} ∣ 1, I_{k - 1}) \tilde{Pr} ({[I_{q}]}_{1}^{k - 1}, {[τ_{q}]}_{0}^{k - 2} ∣ k - 1, J) .

(38)

C.3 Markov Chain Derivation

Here we provide the omitted details for Section 3.1.2. Continuing from the foregoing Bayesian recurrence property (Equation 38), we now sum over all I_q except I_k = I and I₀ = J, and integrate over all τ_q, the following equation:

\begin{array}{l} \tilde{Pr} ({[I_{q}]}_{1}^{k}, {[τ_{q}]}_{0}^{k - 1} ∣ k, J) δ (t_{k} - \sum_{q = 0}^{k - 1} τ_{q}) = \tilde{Pr} (I_{k}, τ_{k - 1} ∣ 1, I_{k - 1}) \\ \times \tilde{Pr} ({[I_{q}]}_{1}^{k - 1}, {[τ_{q}]}_{0}^{k - 2} ∣ k - 1, J) δ (t_{k} - \sum_{q = 0}^{k - 1} τ_{q}) . \end{array}

We define and calculate

\begin{array}{l} \tilde{Pr} (I, t_{k} ∣ k, J) ≜ \sum_{{{[I_{q}]}_{1}^{k - 1}}} \int_{0}^{\infty} \dots \int_{0}^{\infty} {[d τ_{q}]}_{0}^{k - 1} \tilde{Pr} ({[I_{q}]}_{1}^{k}, {[τ_{q}]}_{0}^{k - 1} ∣ k, J) δ (t_{k} - \sum_{q = 0}^{k - 1} τ_{q}) \\ = \sum_{{{[I_{q}]}_{1}^{k - 1}}} \int_{0}^{\infty} \dots \int_{0}^{\infty} {[d τ_{q}]}_{0}^{k - 1} \tilde{Pr} (I_{k}, τ_{k - 1} ∣ 1, I_{k - 1}) \tilde{Pr} ({[I_{q}]}_{1}^{k - 1}, {[τ_{q}]}_{0}^{k - 2} ∣ k - 1, J) δ (t_{k} - \sum_{q = 0}^{k - 1} τ_{q}) \\ = \sum_{I_{k - 1}} \int_{0}^{\infty} d τ_{k - 1} \tilde{Pr} (I_{k}, τ_{k - 1} ∣ 1, I_{k - 1}) \\ \times \sum_{{{[I_{q}]}_{1}^{k - 2}}} \int_{0}^{\infty} \dots \int_{0}^{\infty} {[d τ_{q}]}_{0}^{k - 2} \tilde{Pr} ({[I_{q}]}_{1}^{k - 1}, {[τ_{q}]}_{0}^{k - 2} ∣ k - 1, J) δ (t_{k} - τ_{k - 1} - \sum_{q = 0}^{k - 1} τ_{q}) \\ = \sum_{I_{k - 1}} \int_{0}^{\infty} d τ_{k - 1} \tilde{Pr} (I_{k}, τ_{k - 1} ∣ 1, I_{k - 1}) \tilde{Pr} (I_{k - 1}, t_{k} - τ_{k - 1} ∣ k - 1, J) \end{array}

This $\tilde{Pr} (I, t_{k} ∣ J)$ is also probability density:

\sum_{I_{k}} \int_{0}^{\infty} {d t}_{k} \tilde{Pr} (I_{k}, t_{k} ∣ k, J) = \sum_{{{[I_{q}]}_{1}^{k}}} \int_{0}^{\infty} \dots \int_{0}^{\infty} {[d τ_{q}]}_{0}^{k - 1} \tilde{Pr} ({[I_{q}]}_{1}^{k}, {[τ_{q}]}_{0}^{k - 1} ∣ k, J) = 1

as shown in Equation 35 above, using the definition of Equation 36. Summarizing its Markov property:

\tilde{Pr} (I, t ∣ k, J) = \sum_{K} \int_{0}^{t} d τ \tilde{Pr} (I, τ ∣ 1, K) \tilde{Pr} (K, t - τ ∣ k - 1, J) .

(39)

D Appendix II: Maximum likelihood parameter inference

Application of the TOPE to maximum-likelihood parameter learning in stochastic reaction networks has previously been presented [20]. Here, for completeness of presentation for a different audience, we just show the essential gradient calculation step.

Suppose we have observations of the state of a chemical reaction network at times {t_s}, and wish to improve the probability P (Data|Model) of a reaction network model for the flow of probability at intermediate times. We will use the TOPE for each time interval in between observation times t_s:

[e^{(t_{s + 1} - t_{s}) \tilde{W}}] (x (t_{s + 1}), x (t_{s})) = \sum_{k = 0}^{\infty} \int_{t_{s}}^{t_{s + 1}} \dots \int_{t_{s}}^{t_{s + 1}} d {[τ]}_{0}^{n} δ ((t_{s + 1} - t_{s}) - \sum_{p = 0}^{n} τ_{p}) \times [e^{τ_{n} \tilde{D}} \hat{W} \dots e^{τ_{1} \tilde{D}} \hat{W} e^{τ_{0} \tilde{D}}] (x (t_{s + 1}), x (t_{s}))

(40)

We will need to compute the derivatives of this probability with respect to reaction rates:

\begin{array}{l} ρ_{r} \frac{\partial}{\partial ρ_{r}} [e^{(t_{s + 1} - t_{s}) W}] (x (t_{s + 1}), x (t_{s})) = \sum_{k = 0}^{\infty} \int_{t_{s}}^{t_{s + 1}} \dots \int_{t_{s}}^{t_{s + 1}} d {[τ]}_{0}^{n} δ ((t_{s + 1} - t_{s}) - \sum_{p = 0}^{n} τ_{p}) \\ \times \sum_{p = 0}^{n} [e^{- τ_{n} D} \hat{W} \dots e^{- τ_{p + 1} D} (ρ_{r} {\hat{W}}_{r}) e^{- τ_{p} D} \dots e^{- τ_{1} D} \hat{W} e^{- τ_{0} D}] (x (t_{s + 1}), x (t_{s})) \\ - \sum_{k = 0}^{\infty} \int_{t_{s}}^{t_{s + 1}} \dots \int_{t_{s}}^{t_{s + 1}} d {[τ]}_{0}^{n} δ ((t_{s + 1} - t_{s}) - \sum_{p = 0}^{n} τ_{p}) \\ \times \sum_{p = 0}^{n} [e^{- τ_{n} D} \hat{W} \dots \hat{W} (ρ_{r} τ_{p} D_{r}) e^{- τ_{p} D} \hat{W} \dots e^{- τ_{1} D} \hat{W} e^{- τ_{0} D}] (x (t_{s + 1}), x (t_{s})) \\ ρ_{r} {[{\hat{W}}_{r}]}_{I J} = (\frac{ρ_{r} {[{\hat{W}}_{r}]}_{I J}}{\sum_{r} ρ_{r} {[{\hat{W}}_{r}]}_{I J}}) ({[\hat{W}]}_{I J}) = b_{rIJ} {[\hat{W}]}_{I J} \end{array}

where we defined the “branching ratio”

b_{rIJ} \equiv (\frac{ρ_{r} {[{\hat{W}}_{r}]}_{I J}}{\sum_{r} ρ_{r} {[{\hat{W}}_{r}]}_{I J}}) = {〈 δ_{r, R (I, J)} 〉}_{p (I ∣ J)}

for reaction r in state J, assuming each reaction r results in just one output state I per input state J. Here R(I, J) is the random variable denoting the actual reaction chosen in transitioning from state J to state I. Then

\begin{array}{l} ρ_{r} \frac{\partial}{\partial ρ_{r}} [e^{(t_{s + 1} - t_{s}) \tilde{W}}] (x (t_{s + 1}), x (t_{s})) = \sum_{k = 0}^{\infty} \int_{t_{s}}^{t_{s + 1}} \dots \int_{t_{s}}^{t_{s + 1}} d {[τ]}_{0}^{n} δ ((t_{s + 1} - t_{s}) - \sum_{p = 0}^{n} τ_{p}) \\ \times \sum_{p = 0}^{n} [e^{τ_{n} D} \hat{W} \dots e^{τ_{p + 1} D} (b_{r} \hat{W} - \hat{W} ρ_{r} τ_{p} D_{r}) e^{τ_{p} D} \dots e^{τ_{1} D} \hat{W} e^{τ_{0} D}] (x (t_{s + 1}), x (t_{s})) \\ or \\ ρ_{r} \frac{\partial}{\partial ρ_{r}} [e^{(t_{s + 1} - t_{s}) \tilde{W}}] (x (t_{s + 1}), x (t_{s})) \\ = \sum_{k = 0}^{\infty} \sum_{p = 0}^{n} {〈 b_{r} (reaction event p out of n) 〉}_{\hat{W}, x (t_{s + 1}), x (t_{s})} \\ - ρ_{r} \sum_{k = 0}^{\infty} \sum_{p = 0}^{n} {〈 τ_{p} D_{r} 〉}_{\hat{W}, x (t_{s + 1}), x (t_{s})} \end{array}

This finally is a quantity that is easy to compute as a running average during a simulation of the network with incorrect values of the parameters, thereby contributing to the calculation of an improved set of parameter values in a stochastic gradient descent algorithm. This is the key update equation in a learning algorithm for reaction rates in stochastic biochemical networks (extensible to other process networks). Algorithmic details can be found in [20], noting particularly Equation 2.4 therein. A related stochastic learning algorithm is proposed in [8].

E Appendix III: Dynamical grammar for root growth

Given the following function simplified definitions among others:

gGrowthModelMult = 1;
growthConst = 1/gCellCyleTime;
yEffectOnDivisionFunc[y_] := Module[{δ, h1, h2},
 $0.005 + 100 * \frac{{(\frac{y}{q 1})}^{p_{v 1}}}{1 + {(\frac{y}{q^{2}})}^{p_{v 2}}} / \cdot {q 1 \to 5, q 1 \to 1, p_{v 1} \to 1, p_{v 2} \to 5}$ 
]
cellGrowthLocFunc[rad_] := Module[{},
gGrowthModelMult * growthConst
];
springXFunc[curPosx_, curRad_, nbrPosx_, nbrRad_] :=
-∂_curPosx springPotential[curPosx, curRad, nbrPosx, nbrRad]

The actual grammar for selected rules is shown here:

gRootGrowth := Grammar[rules→
{
(* continuous change in cell c1 radius *)
{c1 Equal cell[cellID1, 1(* growth mode *), loc1, rad1, auxin1, y1, cellIDP, cellIDN]}→ c1,
solving[rad1′EqualcellGrowthLocFunc[rad1]],
(* continuous change in cell c1 location *)
{c1 Equal cell[cellID, cMode, loc, rad, auxin, y, cellIDPrev, cellIDNext],
c2 Equal cell[cellIDNext, cModeN, locN, radN, auxinN, yN, cellID, cellIDNN]}→ {c1, c2},
solving[loc′EqualgGrowthModelMult * springXFunc[loc, rad, locN, radN]],
(* continuous change in cell c1 location *)
{c1 Equal cell[cellID, cMode, loc, rad, auxin, y, cellIDPrev, cellIDNext],
c2 Equal cell[cellIDPrev, cModeP, locP, radP, auxinP, yP, cellIDPP, cellID]}→ {c1, c2},
solving[loc′EqualgGrowthModelMult * springXFunc[loc, rad, locP, radP]],
(* change cell mode from growth to wait, when over a radius threshold *)
cell[cellID, 1, loc, rad, auxin, y, cellIDPrev, cellIDNext]→
cell[cellID, 2, loc, rad, auxin, y, cellIDPrev, cellIDNext],
with[gGrowthModelMult * stopGrowthConst * grammarSigmoid[rad - gLimitCellRad, gDivideTemp]],
(* divide a cell when its in wait mode *)
cell[cellID, 2, loc, rad, auxin, y, cellIDPrev, cellIDNext]→ {
cell[cellIDPrev, cModeP, locP, radP, auxinP, yP, cellIDPP, cellID]→
cell[cellIDPrev, cModeP, locP, radP, auxinP, yP, cellIDPP, grammarCreateObjectID[1]],
cell[cellIDNext, cModeN, locN, radN, auxinN, yN, cellID, cellIDNN]→
cell[cellIDNext, cModeN, locN, radN, auxinN, yN, grammarCreateObjectID[2], cellIDNN],
cell[grammarCreateObjectID[1], 1, loc - rad + 2rad * cellpart + rad * (1 - cellpart),
rad * (1 - cellpart), auxin, y, cellIDPrev, grammarCreateObjectID[2]],
cell[grammarCreateObjectID[2], 1, loc - rad + rad * cellpart,
rad * cellpart, auxin, y, grammarCreateObjectID[1], cellIDNext]},
with[gGrowthModelMult * yEffectOnDivisionFunc[y] *
grammarPDF[UniformDistribution[{0.5 - gRangeParam, 0.5 + gRangeParam}], cellpart]],
(* … more rules … *)
(* auxin/y passive transport between two neighboring cells *)
{c0 Equal cell[cellID0, cMode0, loc0, rad0, auxin0, y0, cellIDP0, cellID1],
c1 Equal cell[cellID1, cMode1, loc1, rad1, auxin1, y1, cellID0, cellIDNext]}→ {c0, c1},
solving[auxin1′Equal pt(auxin0 - auxin1), auxin0′Equal pt(auxin1 - auxin0),
y1′Equal pty(y0 - y1), y0′Equal pty(y1 - y0)],
(* … more rules … *)
}];

Note that for efficiency, the symbolic partial derivative is taken out of the grammar (rule 3, biomechanics) and precomputed. Also, the cell division rule above actually has the form of a compound rule, whose right hand side comprises two further rules. This point was simplified out of the notation in the main text. It is an efficiency measure that allows a rule firing to be a multistep process (similar to the subgrammars or macros of [4]) without slowing down the computational identification of cells likely to divide specified by the with clause of the rule. However, its use here relies on the dynamically invariant, domain-specific fact that each cell is the nth neighbor (in this case n=1 or 2) of at most one other cell.

References

1.Doi J. Second quantization representation for classical many-particle system. 1976 J Phys A: Math Gen. 1976;9:1465. [Google Scholar]
2.Doi J. Stochastic theory of diffusion-controlled reaction 1976 J. Phys A: Math Gen. 1976;9:1479. [Google Scholar]
3.Mattis DC, Glasser ML. The uses of quantum field theory in diffusion-limited reactions. Rev Mod Phys. 1998;70:979–1001. [Google Scholar]
4.Mjolsness E, Yosiphon G. Stochastic Process Semantics for Dynamical Grammars. Annals of Mathematics and Artificial Intelligence. 2006;47(3–4) [Google Scholar]
5.Fried HM. Green’s Functions and Ordered Exponentials. Cambridge University Press; 2002. [Google Scholar]
6.Bender CM, Brandt SF, Chen JH, Wang Q. Ghost Busting: PT-Symmetric Interpretation of the Lee Model. Physical Review D. 2005;71:025014. [Google Scholar]
7.Zhang Xueying, DeCock Katrien, Bugallo Mónica F, Djurić Petar M. A general method for the computation of probabilities in systems of first order chemical reactions. J Chem Phys. 2005;122:104101. doi: 10.1063/1.1855311. [DOI] [PubMed] [Google Scholar]
8.Yosiphon G, Mjolsness E. Towards the Inference of Stochastic Biochemical Network and Parameterized Grammar Models. In: Lawrence N, Girolami M, Rattray M, Sanguinetti G, editors. Learning and Inference in Computational Systems Biology. MIT Press; 2010. [Google Scholar]
9.Wilkinson Darren J. Stochastic Modelling for Systems Biology. Chapman & Hall/CRC Press; Boca Raton, Florida: 2006. [Google Scholar]
10.Mjolsness E, Orendorff D, Chatelain P, Koumoutsakos P. An Exact Accelerated Stochastic Simulation Algorithm. Journal of Chemical Physics. 2009;130:144110. doi: 10.1063/1.3078490. [DOI] [PMC free article] [PubMed] [Google Scholar]
11.Orendorff David. PhD thesis. UC Irvine Computer Science Department; 2012. Jun, Exact and Hierarchical Reaction Leaping: Asymptotic Improvements to the Stochastic Simulation Algorithm. Thesis available at: http://computableplant.ics.uci.edu/~dorendor/thesis. [Google Scholar]
12.Forgy C. Rete: A fast algorithm for the many pattern/many object pattern match problem. Artificial Intelligence. 1982;(19):17–37. [Google Scholar]
13.Hlavacek WS, Faeder JR, Blinov ML, Posner RG, Hucka M, Fontana W. Rules for modeling signal-transduction systems. Science’s STKE. 2006:re6. doi: 10.1126/stke.3442006re6. [DOI] [PubMed] [Google Scholar]
14.Danos V, Feret J, Fontana W, Harmer R, Krivine J. Rule-based modelling of cellular signaling. Lect Notes Comput Sci. 2007;4703:17–41. [Google Scholar]
15.Yosiphon G. PhD Thesis. UC Irvine Computer Science Department; 2009. Jun, Stochastic Parameterized Grammars: Formalization, Inference, and Modeling Applications. Thesis and software : http://computableplant.ics.uci.edu/~guy/Plenum.html. [Google Scholar]
16.Crudu A, Debussche A, Radulescu O. Hybrid stochastic simplifications for multiscale gene networks. BMC Systems Biology. 2009;3:89. doi: 10.1186/1752-0509-3-89. [DOI] [PMC free article] [PubMed] [Google Scholar]
17.Roeder AHK. When and where plant cells divide: a perspective from computational modeling. Current Opinion in Plant Biology. 2012;15:1–7. doi: 10.1016/j.pbi.2012.08.002. [DOI] [PubMed] [Google Scholar]
18.Mironova VV, Omelyanchuk Nadya A, Yosiphon Guy, Fadeev Stanislav I, Kolchanov Nikolai A, Mjolsness Eric, Likhoshvai Vitaly A. A plausible mechanism for auxin patterning along the developing root. BMC Systems Biology. 2010;4:98. doi: 10.1186/1752-0509-4-98. [DOI] [PMC free article] [PubMed] [Google Scholar]
19.Likhoshvai VA, Demidenko GV, Fadeev SI. Modeling of Gene Expression by the Delay Equation. Bioinformatics of Genome Regulation and Structure II (2006): Part. 2006;3:421–431. doi: 10.1007/0-387-29455-4_40. [DOI] [Google Scholar]
20.Wang Y, Christley S, Mjolsness E, Xie X. Parameter inference for discretely observed stochastic kinetic models using stochastic gradient descent. BMC Systems Biology. 2010;4:99. doi: 10.1186/1752-0509-4-99. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

PB446673suppdata.zip

NIHMS503994-supplement-PB446673suppdata_zip.zip^{(301.1KB, zip)}

[R1] 1.Doi J. Second quantization representation for classical many-particle system. 1976 J Phys A: Math Gen. 1976;9:1465. [Google Scholar]

[R2] 2.Doi J. Stochastic theory of diffusion-controlled reaction 1976 J. Phys A: Math Gen. 1976;9:1479. [Google Scholar]

[R3] 3.Mattis DC, Glasser ML. The uses of quantum field theory in diffusion-limited reactions. Rev Mod Phys. 1998;70:979–1001. [Google Scholar]

[R4] 4.Mjolsness E, Yosiphon G. Stochastic Process Semantics for Dynamical Grammars. Annals of Mathematics and Artificial Intelligence. 2006;47(3–4) [Google Scholar]

[R5] 5.Fried HM. Green’s Functions and Ordered Exponentials. Cambridge University Press; 2002. [Google Scholar]

[R6] 6.Bender CM, Brandt SF, Chen JH, Wang Q. Ghost Busting: PT-Symmetric Interpretation of the Lee Model. Physical Review D. 2005;71:025014. [Google Scholar]

[R7] 7.Zhang Xueying, DeCock Katrien, Bugallo Mónica F, Djurić Petar M. A general method for the computation of probabilities in systems of first order chemical reactions. J Chem Phys. 2005;122:104101. doi: 10.1063/1.1855311. [DOI] [PubMed] [Google Scholar]

[R8] 8.Yosiphon G, Mjolsness E. Towards the Inference of Stochastic Biochemical Network and Parameterized Grammar Models. In: Lawrence N, Girolami M, Rattray M, Sanguinetti G, editors. Learning and Inference in Computational Systems Biology. MIT Press; 2010. [Google Scholar]

[R9] 9.Wilkinson Darren J. Stochastic Modelling for Systems Biology. Chapman & Hall/CRC Press; Boca Raton, Florida: 2006. [Google Scholar]

[R10] 10.Mjolsness E, Orendorff D, Chatelain P, Koumoutsakos P. An Exact Accelerated Stochastic Simulation Algorithm. Journal of Chemical Physics. 2009;130:144110. doi: 10.1063/1.3078490. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R11] 11.Orendorff David. PhD thesis. UC Irvine Computer Science Department; 2012. Jun, Exact and Hierarchical Reaction Leaping: Asymptotic Improvements to the Stochastic Simulation Algorithm. Thesis available at: http://computableplant.ics.uci.edu/~dorendor/thesis. [Google Scholar]

[R12] 12.Forgy C. Rete: A fast algorithm for the many pattern/many object pattern match problem. Artificial Intelligence. 1982;(19):17–37. [Google Scholar]

[R13] 13.Hlavacek WS, Faeder JR, Blinov ML, Posner RG, Hucka M, Fontana W. Rules for modeling signal-transduction systems. Science’s STKE. 2006:re6. doi: 10.1126/stke.3442006re6. [DOI] [PubMed] [Google Scholar]

[R14] 14.Danos V, Feret J, Fontana W, Harmer R, Krivine J. Rule-based modelling of cellular signaling. Lect Notes Comput Sci. 2007;4703:17–41. [Google Scholar]

[R15] 15.Yosiphon G. PhD Thesis. UC Irvine Computer Science Department; 2009. Jun, Stochastic Parameterized Grammars: Formalization, Inference, and Modeling Applications. Thesis and software : http://computableplant.ics.uci.edu/~guy/Plenum.html. [Google Scholar]

[R16] 16.Crudu A, Debussche A, Radulescu O. Hybrid stochastic simplifications for multiscale gene networks. BMC Systems Biology. 2009;3:89. doi: 10.1186/1752-0509-3-89. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R17] 17.Roeder AHK. When and where plant cells divide: a perspective from computational modeling. Current Opinion in Plant Biology. 2012;15:1–7. doi: 10.1016/j.pbi.2012.08.002. [DOI] [PubMed] [Google Scholar]

[R18] 18.Mironova VV, Omelyanchuk Nadya A, Yosiphon Guy, Fadeev Stanislav I, Kolchanov Nikolai A, Mjolsness Eric, Likhoshvai Vitaly A. A plausible mechanism for auxin patterning along the developing root. BMC Systems Biology. 2010;4:98. doi: 10.1186/1752-0509-4-98. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R19] 19.Likhoshvai VA, Demidenko GV, Fadeev SI. Modeling of Gene Expression by the Delay Equation. Bioinformatics of Genome Regulation and Structure II (2006): Part. 2006;3:421–431. doi: 10.1007/0-387-29455-4_40. [DOI] [Google Scholar]

[R20] 20.Wang Y, Christley S, Mjolsness E, Xie X. Parameter inference for discretely observed stochastic kinetic models using stochastic gradient descent. BMC Systems Biology. 2010;4:99. doi: 10.1186/1752-0509-4-99. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Time-Ordered Product Expansions for Computational Stochastic Systems Biology

Eric Mjolsness

Abstract

1 Introduction

2 Methods

2.1 Creation/annihilation operator notation

2.2 Solvable example: An exact solution for SSA behavior

2.3 Notation for SSA rederivation from TOPE

2.4 Semigroup property

3 Results and discussion

3.1 Derivation of a Markov chain

3.1.1 Bayesian recurrence

3.1.2 Markov chain - Summary

3.2 Algorithm: SSA

3.3 Extension: Parameterized rule and graph grammar SSA-like algorithm

3.3.1 Algorithm: SSA with parametrized reactant objects

3.3.2 Structural matching

3.4 Hybrid SSA/ODE setup

3.4.1 Computation of matrix elements

3.5 Hybrid SSA/ODE: Operator algebra derivation

3.5.1 Heisenberg picture

3.5.2 Application to ODE + decay clock

3.5.3 Equivalent ODE

3.5.4 Algorithm: Hybrid SSA/ODE solver

3.5.5 Application: Cell division

3.5.6 Application: Time-varying propensity for complete polymerization

3.5.7 Extended Application: Tissue-level model of Arabidopsis root growth

Figure 3.

4 Conclusion and outlook

Supplementary Material

Figure 1.

Figure 2.

Acknowledgments

Abbreviations list

C Appendix I: Bayesian inference derivation

C.1 Semigroup property

C.2 Bayesian recurrence relation

C.2.1 Marginalizing

C.2.2 Conditionalizing

C.3 Markov Chain Derivation

D Appendix II: Maximum likelihood parameter inference

E Appendix III: Dynamical grammar for root growth

References

Associated Data

Supplementary Materials

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases