Skip to main content
The Journal of Chemical Physics logoLink to The Journal of Chemical Physics
. 2014 May 28;140(20):204108. doi: 10.1063/1.4878662

Modeling delay in genetic networks: From delay birth-death processes to delay stochastic differential equations

Chinmaya Gupta 1, José Manuel López 1, Robert Azencott 1, Matthew R Bennett 2, Krešimir Josić 1,3, William Ott 1
PMCID: PMC4039735  PMID: 24880267

Abstract

Delay is an important and ubiquitous aspect of many biochemical processes. For example, delay plays a central role in the dynamics of genetic regulatory networks as it stems from the sequential assembly of first mRNA and then protein. Genetic regulatory networks are therefore frequently modeled as stochastic birth-death processes with delay. Here, we examine the relationship between delay birth-death processes and their appropriate approximating delay chemical Langevin equations. We prove a quantitative bound on the error between the pathwise realizations of these two processes. Our results hold for both fixed delay and distributed delay. Simulations demonstrate that the delay chemical Langevin approximation is accurate even at moderate system sizes. It captures dynamical features such as the oscillatory behavior in negative feedback circuits, cross-correlations between nodes in a network, and spatial and temporal information in two commonly studied motifs of metastability in biochemical systems. Overall, these results provide a foundation for using delay stochastic differential equations to approximate the dynamics of birth-death processes with delay.

INTRODUCTION

Gene regulatory networks play a central role in cellular function by translating genotype into phenotype. By dynamically controlling gene expression, gene regulatory networks provide cells with a mechanism for responding to environmental challenges. Therefore, creating accurate mathematical models of gene regulation is a central goal of mathematical biology.

Delay in protein production can significantly affect the dynamics of gene regulatory networks. For example, delay can induce oscillations in systems with negative feedback,1, 2, 3, 4, 5, 6, 7 and has been implicated in the production of robust, tunable oscillations in synthetic gene circuits containing linked positive and negative feedback.8, 9 Indeed, delayed negative feedback is thought to govern the dynamics of circadian oscillators,10, 11 a hypothesis experimentally verified in mammalian cells.12

In genetic regulatory networks, noise and delay interact in subtle and complex ways. Delay can affect the stochastic properties of gene expression and hence the phenotype of the cell.2, 13, 14, 15, 16 It is well known that noise can induce switching in bistable genetic circuits;17, 18, 19, 20, 21, 22, 23, 24, 25, 26 the infusion of delay dramatically enhances the stability of such circuits27 and can induce an analog of stochastic resonance.28, 29 Variability in the delay time (distributed delay) can accelerate signaling in transcriptional signaling cascades.30

Given the importance of delay in gene regulatory networks, it is necessary to develop methods to simulate and analyze such systems across spatial scales. In the absence of delay, it is well known that chemical reaction networks are accurately modeled by ordinary differential equations (ODEs) in the thermodynamic limit, i.e., when molecule numbers are sufficiently large. When molecule numbers are small, however, stochastic effects can dominate. In this case, the chemical master equation (CME) describes the evolution of the probability density function over all states of the system. Gillespie's stochastic simulation algorithm (SSA)31 samples trajectories from the probability distribution described by the CME.

While exact, the CME is difficult to analyze and the SSA can be computationally expensive. To address these issues, a hierarchy of coarse-grained approximations of the SSA has been developed32 (see Figure 1): Spatially discrete approximations, such as τ-leaping33, 34, 35, 36 and K-leaping37 trade exactness for efficiency. At the next level are chemical Langevin equations (CLEs), which are stochastic differential equations (SDEs) of dimension equal to the number of species in the biochemical system. CLEs offer two advantages. First, unlike the SSA, the well-developed ideas from random dynamical systems and stochastic differential equations apply to CLEs. Second, it is straightforward to simulate large systems using CLEs. Finally, in the thermodynamic limit, one arrives at the end of the Markovian hierarchy: the reaction rate equation (RRE).

Figure 1.

Figure 1

Schematic of the modeling hierarchy for biochemical systems. The black arrows link the various components of the theory for Markov systems (no delay); red arrows link the corresponding delay components. Numbers attached to arrows refer to papers that establish the corresponding links. Empty arrowheads denote heuristic derivations; in this paper, we rigorously establish the dSSA to dCLE and dCLE to dRRE links.

The Markovian hierarchy above (no delay) is well-understood,32, 40 but a complete analogue of the Markovian theory does not yet exist for systems with delay. The SSA has been generalized to a delay version – the dSSA – to allow for both fixed2, 41 and variable30, 38 delay. Notably, the modification of the SSA to incorporate non-Markov processes predates the dSSA;42 here, many exponential steps from a linear chain are replaced by a gamma-distributed delay. Some analogues of τ-leaping exist for systems with delay; see, e.g., D-leaping.43

Several methods have been used to formally derive a delay chemical Langevin equation (dCLE) from the delay chemical master equation (dCME); see Sec. 4 for details. Brett and Galla39 use the path integral formalism of Martin, Siggia, Rose, Janssen, and de Dominicis to derive a dCLE approximation without relying on a master equation. The Brett and Galla derivation produces the “correct” dCLE approximation of the underlying delay birth-death (dBD) process in the sense that the first and second moments of the dCLE match those of the dBD process. However, their derivation has some limitations (see Sec. 4). In particular, it gives no rigorous quantitative information about the distance between the dBD process and the dCLE; it does not clarify for what types of deterministic dynamics (in the thermodynamic limit) the dCLE accurately approximates the delay birth-death process, and on what timescales; and it does not clarify in what sense (distributionally or pathwise) the dCLE approximates the delay birth-death process. Our analysis answers these questions.

In this paper, we establish a rigorous link between dBD processes and dCLEs by proving that the distance between the dBD process and the correct approximating dCLE process converges to zero as system size tends to infinity (as measured by expectations of functionals of the processes). In particular, this result applies to all moments. It is natural to express distance in terms of expectations of functionals because the dBD process is spatially discrete while the correct dCLE produces continuous trajectories (see Figure 2). This first result is just a corollary of our main mathematical theorem. We rigorously estimate the probability that a trajectory of the correct dCLE remains inside a narrow tube around the corresponding trajectory of the dBD process. This quantitative tube estimate will allow practitioners to determine the accuracy of simulations of concrete biochemical reaction networks. The mathematical results hold for both fixed delay and distributed delay (see Figure 3a).

Figure 2.

Figure 2

(a) Typical trace obtained by using the dCLE. (b) Corresponding trace obtained using dSSA. By zooming into a small time segment, we see that the trace obtained from the dCLE (c) consists of equi-spaced time points (corresponding to an Euler discretization) and is continuous. The corresponding segment of the trace from the dSSA (d) consists of events that occur at random times and has jump discontinuities. The displayed traces were gathered after a long transient. The model simulated is the single-gene positive feedback model with N = 500; see Sec. 2C1.

Figure 3.

Figure 3

(a) The effect of distributed delay on the protein production process. The “input process” is the first step in the transcription process, while the “output process” is the final mature product that enters the population. The time delay τ accounts for the lag between the initialization of transcription and the production of mature product. In a system with distributed delay, different production events can have different delay times; the order of the output process may therefore not match that of the input process. (b)–(e) Simulated gene regulatory network motifs: a transcriptional cascade (b), oscillators (c), and metastable systems (d) and (e).

The correct dCLE approximation is distinguished within the class of Gaussian approximations of the dBD process by the fact that it matches both the first and second moments of the dBD process. As we will see, it performs remarkably well at moderate system sizes in a number of dynamical settings: steady state dynamics, oscillatory dynamics, and certain metastable switches. We will demonstrate via simulation and argue mathematically using characteristic functions that no other Gaussian process with appropriately scaled noise performs as well. In the following, the term “dCLE” shall refer specifically to the dCLE derived by Brett and Galla39 and expressed by 15, unless specifically stated otherwise. We provide a proof of our mathematical results in the supplementary material.44

SIMULATIONS/OUTLINE AND INTERPRETATION OF RESULTS

Genetic regulatory networks may be simulated using an exact dSSA to account for transcriptional delay.2, 30, 38, 41 Here, we provide a heuristic derivation of a related dCLE, and show that in a number of concrete examples it provides an excellent approximation of the system (see Figure 3). These simulations raise the following questions: Is the dCLE approximation valid in general? Can the expected quality of the approximation be quantified in general? We answer these questions mathematically in Sec. 3.

We will adopt the following notation for reactions with delay:

X+Yμα(X,Y)Z.

Here, α(X, Y) denotes the rate of the reaction, the dashed arrow indicates a reaction with delay, and μ is a probability measure that describes the delay distribution. Solid arrows indicate reactions without delay.

A transcriptional cascade

First, we consider a transcriptional cascade with two genes that code for proteins X and Y. Protein X is produced at a basal rate; production of Y is induced by the presence of X. The state of the system is represented by an ordered pair (X, Y). Note that we use X and Y to denote both protein names and protein numbers. The reactions in the network, and the associated state change vectors vi, are given by

μaXv1=(1,0), (1)
Xb1Xv2=(1,0), (2)
μψ(X)Yv3=(0,1), (3)
Yb2Yv4=(0,1). (4)

This system can be simulated exactly using the dSSA: Suppose the state of the system and the reactions in the queue are known at time t0 (the queued reactions can be thought of as the “input process”; see Figure 3a), and that the delay kernel μ is supported on a finite interval [0, τ0].45

  1. Sample a waiting time tw from an exponential distribution with parameter A0:=a+b1X+ψ(X)+b2Y.

  2. If there is a reaction in the queue that is scheduled to exit at time tq < t0 + tw, advance to time tq and set t0tq and (X,Y)(X,Y)+(vi(1),vi(2)), where vi=(vi(1),vi(2)) is the change in the system due to the scheduled reaction. Finally, sample a new waiting time for the next reaction.

  3. If no reaction exits before t0 + tw, set t0t0 + tw and sample a reaction type from the set {1, 2, 3, 4} with probabilities proportional to {a, b1X, ψ(X), b2Y}, respectively. If the reaction chosen is a non-delayed reaction, perform the update (X, Y) ↦ (X − 1, Y) (or (X, Y) ↦ (X, Y − 1)). However, if the reaction chosen is a delayed reaction, the state change vector (1, 0) (or (0, 1)) is put into the queue along with an exit time tq. The difference τ = tqt0 between the current and the exit time is sampled from the delay distribution μ.

We now heuristically derive the dCLE for the feed-forward system from this spatially discrete process.

Suppose the delay kernel μ is given by a probability density function κ supported on [0, τ0] (dμ(s) = κ(s)ds). We first approximate the number of reactions that produce Y (Eq. 3) that will be completed within the interval [t0, t0 + Δ], where t0 denotes the current time and Δ is a small increment. Since the production of Y involves delay, a reaction of this type that is completed within [t0, t0 + Δ] must have been initiated at some time within [t0 − τ0, t0]. Let t0 > t0 − Δ > t0 − 2Δ > ⋯ be the partition of [t0 − τ0, t0] into intervals of length Δ. The (random) number of reactions completed within [t0, t0 + Δ] and initiated within [t0 − (i + 1)Δ, t0iΔ] may be approximated by a Poisson random variable with mean

ψ(Xt0(i+1)Δ)Δ·κ((i+1)Δ)Δ.

Summing over i, the (random) number of reactions completed within [t0, t0 + Δ] may be approximated by a Poisson random variable with mean

Δi[ψ(Xt0(i+1)Δ)(κ((i+1)Δ)Δ)];

this is a Riemann sum that approximates the integral

Δ0τ0ψ(Xt0s)κ(s)ds.

Known as τ-leaping, this line of reasoning produces a Poissonian approximation of the dBD process

δXt=Poisn(aΔ)Poisn(b1XtΔ),δYt=PoisnΔ0τ0ψ(Xts)dμ(s)Poisn(b2YtΔ).

Here, Poisn(η) denotes a Poisson random variable with mean η.

If these Poisson random variables have large mean, they can be approximated by normal random variables. For example, the Poisson variable representing the number of reactions that produce Y can be approximated by a normal random variable with mean and variance equal to Δ0τ0ψ(Xts)dμ(s). Since each reaction changes the state of either X or Y (but never both), it follows that the evolution of the system can be approximated by the stochastic difference equation

δXt=Δ(ab1Xt)+Δ(a+b1Xt)N(0,1), (5a)
δYt=Δ0τ0ψ(Xts)dμ(s)b2Yt+Δ0τ0ψ(Xts)dμ(s)+b2YtN(0,1), (5b)

where N(0,1) is the standard normal random variable.

System 5 may be written in terms of concentrations. Let N be a system size parameter. We think of N as a characteristic system size; Xt/N and Yt/N therefore represent fractions of this characteristic value. Writing xt = Xt/N, yt = Yt/N, ψ˜(x)=ψ(Nx)/N, and assuming that the basal production rate a scales with N as a=a˜N, we obtain

δxt=Δ(a˜b1xt)+1NΔ(a˜+b1xt)N(0,1), (6a)
δyt=Δ0τ0ψ˜(xts)dμ(s)b2yt+1NΔ0τ0ψ˜(xts)dμ(s)+b2ytN(0,1). (6b)

Equation 6 is the Euler–Maruyama type discretization of a delay stochastic differential equation (dSDE). Replacing Δ with dt and ΔN(0,1) with dWt in 6, we obtain

dxt=(a˜b1xt)dt+1N(a˜+b1xt)dWt1, (7a)
dyt=0τ0ψ˜(xts)dμ(s)b2ytdt+1N0τ0ψ˜(xts)dμ(s)+b2ytdWt2. (7b)

This is the dCLE for the transcriptional cascade in this section.

Taking the formal thermodynamic limit, N → ∞, in Eq. 7 yields the reaction rate equations derived in Ref. 38

dxt=a˜b1xtdt, (8a)
dyt=0τ0ψ˜(xts)dμ(s)b2ytdt. (8b)

The dynamics described by Eq. 8 are quite simple; if b1 > 0 and b2 > 0, then 8 has a globally attracting stable stationary point.

To test the validity of the dCLE approximation given in Eq. 7, we examine if it captures the interaction between the two proteins in our transcriptional cascade network. Figure 4 shows the cross correlation functions obtained by simulating the system with N = 1000 using dSSA. From left to right, the curves correspond to fixed delay τ increasing from 0 to 3. The corresponding cross correlation curves for the dCLE approximation 7 are indistinguishable from those obtained using dSSA.

Figure 4.

Figure 4

Cross correlation functions for the two-species feed-forward architecture at system size N = 1000. The inset shows sample trajectories for X (black; top) and Y (blue; bottom). The mean field model has a fixed point, and for this reason, stochastic dynamics stay within a neighborhood of this fixed point (see Theorem 1). However, the effect of increasing delay is clearly seen in the cross correlation function. The color bar displays the delay size corresponding to the cross correlation curves. Parameter values are given by ψ˜(X)=a˜1x3/(m3+x3), a˜=0.92, a˜1=1.39, b1 = b2 = ln (2), m = 1.33, μ = δτ. The cross correlations have been normalized by dividing by the standard deviations σX and σY of X and Y. Time has been normalized to cell cycle length.

In the heuristic derivation above, we first fix N and let Δ → 0 to obtain the dCLE; we then separately let N → ∞ to obtain the thermodynamic limit. Brett and Galla39 also derive the dCLE by first fixing N and then sending Δ → 0. However, the two limits, Δ → 0 and N → ∞, cannot be taken independently; this is a common problem with heuristic derivations of stochastic differential equations, even in the absence of delay.46 The time discretization, Δ, can be thought of as a sampling frequency, while the system size, N, determines the rate at which reactions fire. If N becomes too large for a given Δ, then the number of reactions that fire within [t, t + Δ] no longer follows a Poisson distribution with mean dependent only on the state of the system at time t. On the other hand, if N is too small for a given Δ, then the Poisson distribution cannot be approximated by a normal distribution. In order to rigorously derive the Langevin approximation and estimate the distance between the dBD and dCLE processes, we will have to take a careful limit by relating Δ to N (with Δ → 0 as N → ∞). We describe the proper scaling in Sec. 3.

Applied to the transcriptional cascade, our main mathematical result (Theorem 1 in Sec. 3) provides an explicit bound on the probability that the dCLE trajectory deviates from a narrow tube around the dBD trajectory (provided Δ is correctly scaled with respect to N).

In the previous example, the limiting deterministic system has a fixed point. Time series for the stochastic system therefore stay within a small neighborhood of this fixed point (see inset, Fig. 4). In the next example, we show that the dCLE approximation remains excellent even when the limiting deterministic dynamics are non-trivial. We consider a degrade-and-fire oscillator for which the limiting deterministic system has a limit cycle. The dCLE correctly captures the peak height and the inter-peak times for the dSSA realization of the degrade and fire oscillator, in addition to statistics such as the mean and variance. The approximation does not break down at small instantaneous protein numbers. Indeed, the mathematical theory developed in this work makes an important point: protein numbers at any particular time do not limit the quality of the dCLE approximation (in the presence of delay, or otherwise). Instead, the quality of the dCLE approximation depends on the latent parameter N. Theorem 1 makes this more precise: if one fixes the allowable error ɛ in the approximation of the dBD process by the dCLE process, then the time T during which the approximation error stays smaller than ɛ increases with N.

In the analysis of the previous example, and in the rest of this work, we make an assumption: There exists an integer N which describes the characteristic number of proteins in the system and all protein numbers and propensity functions can be scaled by this characteristic number. Because of this assumption, our setting is different from that of Gillespie,47 who used the term chemical Langevin equation to differentiate his approximating Langevin equation from previous work such as that of van Kampen.48 While we assume the existence of a system size parameter, we obtain the approximating Langevin equation without reference to a chemical master equation.

Degrade and fire oscillator

The degrade and fire oscillator depicted schematically in Figure 3c consists of a single autorepressive gene and corresponds to the reaction network

μψ(X)XγX,X+Eη(X)E.

The production rate ψ(X) is given by ψ(X) = Nf(X/N), where f is the propensity function

f(x)=α1+x/C14;

the enzymatic degradation rate η(X) is given by η(X) = Ng(X/N). Here, g(x) = Vmaxx/(K + x); K is the Michaelis-Menten constant, Vmax the maximal enzymatic degradation rate, and γ the dilution rate coefficient. In the thermodynamic limit, the system is modeled by the delay differential equation

dxdt=0τ0α1+x(ts)C14dμ(s)γx(t)Vmaxx(t)K+x(t). (9)

As before, x(t) denotes the concentration of protein X. We model the formation of functional repressor protein using distributed delay (described by the probability measure μ); this delayed negative feedback can induce oscillations.5 Figure 5a depicts a sample realization of the stochastic version of the degrade and fire oscillator (the finite system size regime) generated by dSSA.

Figure 5.

Figure 5

Comparison of dSSA results to dSDE approximations for the degrade and fire oscillator. (a) depicts a stochastic realization of the oscillator generated by dSSA. We compare dSSA statistics (black dots in (b)–(e)) to those generated by the dCLE approximation given by Eq. 10 (black curves). We also show results for the dSDE approximation given by Eq. 11, obtained by removing delay from the diffusion term in Eq. 10 (red curves). At system size N = 50, the dCLE approximation given by Eq. 10 closely matches dSSA with respect to spike height distribution (b) and interspike interval distribution (c). In contrast, removing delay from the diffusion term results in a poor approximation of these distributions, as shown by the sizable shifts affecting the red curves. (d) and (e) illustrate mean repressor protein level and repressor protein variance, respectively, as functions of system size N. Equation 10 provides a good approximation for all simulated values of N while the performance of Eq. 11 improves as N increases. The quantity P represents protein number, not protein concentration. Parameter values are α = 20.8, C1 = 0.04, β = ln (2), Vmax = 5.55, γ0 = 0.01, μ = δ0.14. A soft boundary was added at 0 to ensure positivity.

The dCLE approximation is in this case given by

dxt=0τ0f(xts)dμ(s)γxtg(xt)dt+1N0τ0f(xts)dμ(s)+γxt+g(xt)12dWt. (10)

Figure 5 illustrates that Eq. 10 provides a good approximation of the dBD dynamics, even when system size is relatively small. At system size N = 50, the spike height distribution and interspike interval distribution obtained using the dSSA (black dots in Figures 5b, 5c) are nearly indistinguishable from those obtained using Eq. 10 (black curves in Figures 5b, 5c). Further, we see a close match with respect to mean repressor protein level and repressor protein variance across a range of system sizes (Figures 5d, 5e).

Interestingly, the dCLE approximation is very good even though the protein number approaches zero during part of the oscillation. This illustrates a central feature of the theory: the quality of the dCLE approximation is a function of a latent parameter N, not of the number of molecules present at any given time.

The exact form of the diffusion term is crucial to the accuracy of the dCLE approximation. If we remove delay from the diffusion term in Eq. 10, we obtain

dxt=0τ0f(xts)dμ(s)γxtg(xt)dt+1Nf(xt)+γxt+g(xt)1/2dWt. (11)

At system size N = 50 (red curves in Figures 5b, 5c), dSDE 11 produces dramatically different results from those generated by the correct dCLE approximation. The performance of Eq. 11 improves as N increases (Figures 5d, 5e). This is expected, as both Eqs. 10, 11 converge weakly to Eq. 9 as N → ∞.

Metastable systems

Understanding metastability in stochastic systems is of fundamental importance in the study of biological switches.17, 18, 19, 20, 21, 22, 23, 24, 25, 26 While metastability is well understood mathematically in the absence of delay, understanding the impact of delay on metastability remains a major theoretical and computational challenge.27, 49, 50 We examine two canonical examples to show that for certain systems, the dCLE can be used to study the impact of delay on metastability: a positive feedback circuit and a co-repressive genetic toggle switch.

We offer a cautionary note before we proceed with the examples. Even in the Markovian context, SDE and Fokker-Planck modeling does not always accurately capture large deviations statistics associated with an underlying spatially discrete stochastic process. Existing work for systems without delay (see, e.g., Refs. 51, 52, 53) illustrates the delicate relationship between continuum and discrete treatments of hitting times, extinction times, and the like. Determining when the dCLE accurately approximates the large deviations statistics of the underlying dBD process is an open problem.

Single species positive feedback circuit

The simplest metastable system consists of a single protein that drives its own production (Figure 3d). The chemical reaction network is given by

μψ(X)XγX

with ψ(X) = Nf(X/N) for the propensity

f(x)=α+βxbcb+xb.

In the thermodynamic limit, the dynamics of this model are described by the DDE

dxt=0τ0α+βx(ts)bcb+x(ts)bdμ(s)γx(t)dt. (12)

Here, x represents protein concentration and b is the Hill coefficient. In the thermodynamic limit, there are two stable stationary states, xl and xh, as well as an unstable stationary state xs. These states satisfy xl < xs < xh.

In the stochastic (finite N) regime, the stationary states xl and xh become metastable. We simulate the metastable dynamics using dSSA and the dCLE approximation 15 given in this case by

dxt=0τ0f(xts)dμ(s)γxtdt+1N0τ0f(xts)dμ(s)+γxt12dWt. (13)

Figure 6 displays hitting time distributions for the dSSA simulations (black curves) and Eq. 13 (blue curves). A hitting time is defined as follows: We choose neighborhoods (xl − δl, xl + δl) and (xs − δs, xs + δs) of xl and xs, respectively. We start the clock when a trajectory enters (xl − δl, xl + δl) from the right. The clock is stopped when that trajectory first enters (xs − δs, xs + δs). A hitting time is the amount of time that elapses from clock start to clock stop. We see that for no delay (Figure 6a) and fixed delay τ = 1 (Fig. 6b), the dCLE approximation accurately captures the hitting time distributions for Hill coefficients increasing from 15 to 25. Hence, the dCLE approximation accurately captures the rare events associated with a spatially discrete delay stochastic process. This is significant because dSDEs are more amenable to large deviations theoretical analysis than their spatially discrete counterparts.

Figure 6.

Figure 6

Hitting time distributions for the positive feedback circuit. Black curves represent dSSA data; blue curves represent data from the dCLE approximation. (a) corresponds to the Markov case τ = 0 and a Hill coefficient of b = 15. (b) corresponds to a delay τ = 0.75 and b = 25. The tail of the hitting times distribution becomes longer with both increasing delay and increasing Hill coefficient b. The dCLE captures the lengthening of the tail due to both effects. Parameter values are α = 0.35, β = 0.15, c = 0.615, γ = ln (2), μ = δτ, N = 1000.

Hitting times increase dramatically as the delay increases from 0 to 1, in accord with earlier analysis.27 A dramatic increase is also seen as the Hill coefficient increases. This is due to the fact that the potential wells around xl and xh deepen as b increases.

Co-repressive toggle switch

The co-repressive toggle switch (Figure 3e) is a two-dimensional metastable system described in the thermodynamic limit by the DDEs

dxt=0τ0β1+y(ts)2/kdμ1(s)γxdt, (14a)
dyt=0τ0β1+x(ts)2/kdμ1(s)γydt. (14b)

The measure μ1 describes the delay associated with production in this symmetric circuit. Equation 14 has two stable stationary points (xl, yh) and (xh, yl) separated by the unstable manifold associated with a saddle equilibrium point (xs, ys). In the stochastic (finite system size) regime, the stable stationary points become metastable. In this regime, a typical trajectory spends most of its time near the metastable points, occasionally moving between them.

Figure 7 displays density plots corresponding to trajectories that either successfully transition between metastable states (four panels on the right) or make failed transition attempts (four panels on the left). Even for the moderate system size N = 30, the density plots generated by dSSA (top four panels) closely match those generated by the dCLE approximation in Eq. 15 (bottom four panels).

Figure 7.

Figure 7

Conditional density plots for the co-repressive toggle switch. The two left panels illustrate trajectories that leave a neighborhood of the stable point (xh, yl) and fall back into the same neighborhood before transitioning into a neighborhood of the stable point (xl, yh); displayed densities are conditioned on this set of trajectories. The two right panels illustrate trajectories that leave a neighborhood of (xh, yl) and transition to a neighborhood of (xl, yh) before falling back into the first neighborhood; displayed densities are conditioned on the set of such trajectories. Top panels correspond to dSSA; bottom panels correspond to dCLE. Cartoons of typical trajectories corresponding to failed transitions (left) and successful transitions (right) are shown for the dCLE process. Plots are shown for N = 30. The values of the other parameters are β = 0.73, k = 0.05, γ = ln (2).

Given the importance of rare events throughout stochastic dynamics, it is encouraging that the dCLE approximation captures well the dynamics of two popular biochemical motifs of metastability.

MAIN MATHEMATICAL RESULTS

The simulations thus far described suggest that the dCLE closely approximates the dBD process provided that the time increment Δ scales properly with N. Here, we mathematically quantify the quality of the approximation and we determine the optimal scaling of Δ.

The general biochemical setup is as follows. Consider a reaction network of D biochemical species and M possible reactions. We are interested in describing the dynamics as a function of a latent system parameter N, the system size. The state of the system at time t is described by the vector BN(t)ZD that lists the number of molecules of each species present at time t.

Each reaction Rj is described by the following:

  • (a)

    A propensity function fj:RDR+. The firing rate of reaction Rj is given by Nfj(BN(t)/N). It is reasonable to use propensity functions to model firing rates in well-mixed chemical systems.47 We assume the propensities satisfy mild technical conditions;44 these conditions are essentially always satisfied by chemical reaction networks. Examples include reactions of first and second order, Michaelis-Menten kinetics, and propensities of Hill type.

  • (b)

    A state-change vector vjZD. The vector vj describes the change in the number of molecules of each species that results from the completion of a reaction of type j.

  • (c)

    A probability measure μj that models the delay between the initiation and completion of a reaction of type j. We allow the delay to be distributed (μj has a density) or fixed (μj is a Dirac mass δτ at a fixed delay τ). For instantaneous reactions, μj = δ0.

In order for the dCLE to closely approximate the dBD process, the time step Δ must be small enough so that the propensity functions remain essentially constant on time intervals of length Δ and large enough so that many reactions fire on such intervals. Gillespie47 calls such a Δ a macroscopic infinitesimal time for the reaction network. The primary mathematical result of this paper provides a scaling of Δ as a function of N for which Δ is rigorously shown to be a macroscopic infinitesimal time for the reaction network. Further, we quantify the quality of the dCLE approximation: with high probability, the solution trajectory of the dCLE will remain inside a narrow tube around the solution trajectory of the dBD process. We state the main result precisely and then discuss implications.

Since BN(t) is a vector of molecule numbers, we rescale by N to allow for a direct comparison of the dBD process and the dCLE. The correct dCLE approximation of (bN(t)) = (BN(t)/N) is given for 1 ⩽ kD by

dxk=j=1M0τ0vjkfj(x(ts))dμj(s)dt+1N(ΣdW)k, (15)

where W is a D-dimensional vector of independent standard Brownian motions and Σ2 is given by

Σlm2=j=1Mvjlvjm0τ0fj(x(ts))dμj(s).

Let ℓN denote the stochastic process defined by 15.

Theorem 1.For any fixed timeT > 0, Δ(N) = N−1/4is a macroscopic infinitesimal time on [0, T]: There exist constantsK1andK2, depending only onTand the chemical reaction network, such that

Pmaxt[0,T]bN(t)N(t)>K1N1/8K2e13N1/4. (16)

Theorem 1 has several interesting corollaries. First, for every continuous observable Ψ (real-valued function of trajectories of the processes), the difference between the expected value of Ψ with respect to the dBD and dCLE processes converges to zero as N → ∞. This implies in particular that the distance between every moment of the processes converges to zero as N → ∞. Second, since the right side of Eq. 16 is summable with respect to N, it follows from the Borel-Cantelli lemma that a trajectory of the dCLE process will, with probability one, lie within the tube of radius K1/N1/8 around the corresponding trajectory of the dBD process for all but finitely many values of N.

One would use Theorem 1 in a concrete situation by first fixing a desirable probability bound, say 1/1000. Using 16, one would then solve for the radius of the tube around the trajectory of the dBD process for which the dCLE trajectory will remain in this tube up to time T with probability at least 999/1000.

We prove Theorem 1 in three steps.44 First, we prove that the family of measures on trajectories associated with bN is a tight family, roughly meaning that they are all approximately supported on a common compact set of trajectories. Second, we prove a version of 16 for “discrete-time tubes” wherein the maximum on the left side of 16 is taken not over all t ∈ [0, T] but rather over only 0, Δ, 2Δ, …, (T/Δ)Δ. Third, we study the size of fluctuations of the dBD and dCLE processes on time intervals of length Δ in order to upgrade the discrete-time tube estimate to the continuous-time tube estimate 16. We determine the optimal scaling of Δ during this final step. At the beginning of the proof, we assume that Δ scales with N as Δ(N) = N−α and then look for the optimal value of α. In order to both prove the discrete-time tube estimate and control fluctuations of the dCLE process on intervals of length Δ, we scale the radius r(N) of the continuous-time tube as

r(N)=C1Nα/2+C2N(13α)/2.

The first term in the sum measures local fluctuations of the dCLE while the second comes from the proof of the discrete-time tube estimate. In order to minimize r(N), it is asymptotically optimal to have α = 1 − 3α, giving α = 1/4.

We conclude this section with a cautionary note about what the main theorem does not say. Theorem 1 is formulated for fixed time intervals: one must first fix T and only then allow N to vary. As a consequence, this theorem does not apply when one scales T with N such that T → ∞ as N → ∞, as one does in the large deviations context.

DISCUSSION

SDEs are one of our main tools for modeling noisy processes in nature. Interactions between the components of a system or network are frequently not instantaneous. It is therefore natural to include such delay into corresponding stochastic models. However, the relationship between dSDEs and the processes they model has not been fully established.

Delay stochastic differential equations have previously been formally derived from the dCME. Unlike the chemical master equation, however, the dCME is not closed; this complicates the derivation of dSDE approximations. Closure in this context means the following: Let P(n, t) denote the probability that the stochastic system is in state n at time t. The dCME expresses the time derivative of P(n, t) in terms of joint probabilities of the form P(j, t; k, t − τ) – the probability that the system is in state j at time t and was in state k at time t − τ, where τ is the delay. The one-point probability distribution P(·, t) is therefore expressed in terms of two-point joint distributions, resulting in a system that is not closed. Timescale separation assumptions have been used to close the dCME. If the delay time is large compared to the other timescales in the system, one may assume that events that occur at time t − τ are decoupled from those that occur at time t and close the dCME2, 54 by assuming the joint probabilities may be written as products

P(j,t;k,tτ)=P(j,t)P(k,tτ).

Having closed the dCME, one may then derive dSDE approximations54 as well as useful expressions for autocorrelations and power spectra.2

Approximations of dSDE type have also been derived using system size expansions such as van Kampen expansions48, 55 and Kramers-Moyal expansions for both fixed delay56 and distributed delay.57

Brett and Galla39 use the path integral formalism of Martin, Siggia, Rose, Janssen, and de Dominicis to derive the dCLE without relying on the dCME. Using this formalism, a moment generating functional may be expressed in terms of the system size parameter N and the sampling rate Δ. In the continuous-time limit, Δ → 0, the dCLE may be inferred from the moment generating functional. However, the Brett and Galla derivation has some limitations. First, the Δ → 0 limit cannot be taken without simultaneously letting N → ∞. Intuitively, this is because as Δ → 0, the Gaussian approximation to the Poisson distribution with mean NλΔ breaks down unless the parameter Nλ simultaneously diverges to infinity. Second, the derivation gives no quantitative information about the distance between the dCLE and the original dBD process.

Here, we addressed these shortcomings. We proved rigorously that the dBD process can be approximated by a class of Gaussian processes that includes the dCLE. In particular, we established that for most biophysically relevant propensity functions, the dCLE process will approximate all moments of the dBD process. Our proof includes bounds on the quality of the approximation in terms of the time T for which the approximation is desired to hold and the characteristic system size N (see Theorem 1). The error bounds also indicate that the quality of the dCLE approximation worsens with increasing upper bounds on the reaction propensity functions and state-change vectors. Physically, this means that high reaction rates and reactions that cause large changes in the protein populations are detrimental to the quality of the dCLE approximation.

The dCLE is one of many Gaussian processes that approximate the dBD process. Among all Gaussian approximations with noise components that scale as 1/N, the dCLE is optimal because it is the only such approximation that exactly matches the first and second moments of the dBD process. We formally justify this assertion in the supplementary material44 using characteristic functions. As our simulations of the degrade and fire oscillator demonstrate, the dCLE can significantly outperform other Gaussian approximations at moderate system sizes.

Nevertheless, the quantitative tube estimates in Theorem 1 apply to any Gaussian approximation of the dBD process provided the noise scales as 1/N. This is significant because it is often advantageous to use linear noise approximations of the dCLE. Delay appears in the drift component of a linear noise approximation but not in the diffusion component. Linear noise approximations are therefore easier to analyze than their dCLE counterparts. In particular, elements of the theory of large deviations for Markovian systems can be extended to SDEs with delay in the drift.58

For metastable systems, our simulations indicate that the dCLE may capture both temporal information (such as hitting times for the positive feedback model; see Fig. 6) and spatial information (such as densities for trajectories corresponding to failed and successful transitions; see Fig. 7) for some examples. While SDE approximations (without delay) do not in general capture the dynamics in the presence of metastability even for Markov systems,51, 52, 53 our work suggests that dCLE approximations may be used, on a case by case basis, to study rare events for metastable biochemical systems.

We have made a number of simplifying assumptions in order to be able to prove the existence of a macroscopic infinitesimal time postulated by Gillespie.47 In particular, we assumed the existence of a latent system size parameter N, in terms of which this macroscopic infinitesimal time Δ can be expressed. However, for many biochemical systems, it is often unclear what the system size is, and in this case, a more careful analysis will be required to rigorously justify the use of delay stochastic differential equations to approximate the dynamics of the underlying discrete systems. Further complications can arise when the system size itself changes as a function of time or if there is a scale separation between the fast and slow components of the system. In the non-delayed case, hybrid systems59, 60, 61 are often used to address these challenges. Developing a mathematical theory for hybrid systems in the presence of delays is an important future direction.

We have shown that the dCLE provides an accurate approximation of a number of stochastic processes. Although we chose gene regulatory networks in our examples, the theory is applicable to general birth-death processes with delayed events. SDEs, and the chemical Langevin equation in particular, are fundamental in modeling and understanding the behavior of natural and engineered systems. We therefore expect that the dCLE will be widely applicable when delays impact system dynamics.

ACKNOWLEDGMENTS

This work was funded by the National Institutes of Health (NIH), through the joint National Science Foundation (NSF)/National Institute of General Medical Sciences Mathematical Biology Program Grant No. R01GM104974 (to M.R.B., K.J., and W.O.), and the Robert A. Welch Foundation Grant No. C-1729 (to M.R.B.).

References

  1. Amir A., Meshner S., Beatus T., and Stavans J., Mol. Microbiol. 76, 428 (2010). 10.1111/j.1365-2958.2010.07111.x [DOI] [PubMed] [Google Scholar]
  2. Bratsun D., Volfson D., Tsimring L. S., and Hasty J., Proc. Natl. Acad. Sci. U.S.A. 102, 14593 (2005). 10.1073/pnas.0503858102 [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Goodwin B. C., Adv. Enzyme Regul. 3, 425 (1965). 10.1016/0065-2571(65)90067-1 [DOI] [PubMed] [Google Scholar]
  4. Lewis J., Curr. Biol. 13, 1398 (2003). 10.1016/S0960-9822(03)00534-7 [DOI] [PubMed] [Google Scholar]
  5. Mather W., Bennett M. R., Hasty J., and Tsimring L. S., Phys. Rev. Lett. 102, 068105 (2009). 10.1103/PhysRevLett.102.068105 [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Monk N. A., Curr. Biol. 13, 1409 (2003). 10.1016/S0960-9822(03)00494-9 [DOI] [PubMed] [Google Scholar]
  7. Smolen P., Baxter D., and Byrne J., Am. J. Physiol. 277, C777 (1999). [DOI] [PubMed] [Google Scholar]
  8. Stricker J., Cookson S., Bennett M. R., Mather W. H., Tsimring L. S., and Hasty J., Nature (London) 456, 516 (2008). 10.1038/nature07389 [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Tigges M., Marquez-Lago T. T., Stelling J., and Fussenegger M., Nature (London) 457, 309 (2009). 10.1038/nature07616 [DOI] [PubMed] [Google Scholar]
  10. Smolen P., Baxter D. A., and Byrne J. H., Biophys. J. 83, 2349 (2002). 10.1016/S0006-3495(02)75249-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Sriram K. and Gopinathan M., J. Theor. Biol. 231, 23 (2004). 10.1016/j.jtbi.2004.04.006 [DOI] [PubMed] [Google Scholar]
  12. Ukai-Tadenuma M., Yamada R. G., Xu H., Ripperger J. A., Liu A. C., and Ueda H. R., Cell 144, 268 (2011). 10.1016/j.cell.2010.12.019 [DOI] [PubMed] [Google Scholar]
  13. Grönlund A., Lötstedt P., and Elf J., Proc. Natl. Acad. Sci. U.S.A. 107, 8171 (2010). 10.1073/pnas.0913317107 [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Grönlund A., Lötstedt P., and Elf J., Nat. Commun. 2, 419 (2011). 10.1038/ncomms1422 [DOI] [PubMed] [Google Scholar]
  15. Maithreye R., Sarkar R. R., Parnaik V. K., and Sinha S., PLoS ONE 3, e2972 (2008). 10.1371/journal.pone.0002972 [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Scott M., Phys. Rev. E 80, 031129 (2009). 10.1103/PhysRevE.80.031129 [DOI] [PubMed] [Google Scholar]
  17. Wu M., Su R.-Q., Li X., Ellis T., Lai Y.-C., and Wang X., Proc. Natl. Acad. Sci. U.S.A. 110, 10610–10615 (2013). 10.1073/pnas.1305423110 [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Hong T., Xing J., Li L., and Tyson J. J., BMC Syst. Biol. 6, 66 (2012). 10.1186/1752-0509-6-66 [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. He E., Kapuy O., Oliveira R. A., Uhlmann F., Tyson J. J., and Novák B., Proc. Natl. Acad. Sci. U.S.A. 108, 10016 (2011). 10.1073/pnas.1102106108 [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Ozbudak E. M., Thattai M., Lim H. N., Shraiman B. I., and Van Oudenaarden A., Nature (London) 427, 737 (2004). 10.1038/nature02298 [DOI] [PubMed] [Google Scholar]
  21. Kepler T. B. and Elston T. C., Biophys. J. 81, 3116 (2001). 10.1016/S0006-3495(01)75949-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Aurell E. and Sneppen K., Phys. Rev. Lett. 88, 048101 (2002). 10.1103/PhysRevLett.88.048101 [DOI] [PubMed] [Google Scholar]
  23. Warren P. B. and ten Wolde P. R., J. Phys. Chem. B 109, 6812 (2005). 10.1021/jp045523y [DOI] [PubMed] [Google Scholar]
  24. Balaban N. Q., Merrin J., Chait R., Kowalik L., and Leibler S., Science 305, 1622 (2004). 10.1126/science.1099390 [DOI] [PubMed] [Google Scholar]
  25. Gardner T. S., Cantor C. R., and Collins J. J., Nature (London) 403, 339 (2000). 10.1038/35002131 [DOI] [PubMed] [Google Scholar]
  26. Nevozhay D., Adams R. M., Van Itallie E., Bennett M. R., and Balázsi G., PLoS Comput. Biol. 8, e1002480 (2012). 10.1371/journal.pcbi.1002480 [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Gupta C., López J. M., Ott W., Josić K. c. v., and Bennett M. R., Phys. Rev. Lett. 111, 058104 (2013). 10.1103/PhysRevLett.111.058104 [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Fischer M. and Imkeller P., Stoch. Dyn. 5, 247 (2005). 10.1142/S0219493705001389 [DOI] [Google Scholar]
  29. Fischer M. and Imkeller P., Stoch. Anal. Appl. 24, 135 (2006). 10.1080/07362990500397715 [DOI] [Google Scholar]
  30. Josić K., López J. M., Ott W., Shiau L., and Bennett M. R., PLoS Comput. Biol. 7, e1002264 (2011). 10.1371/journal.pcbi.1002264 [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Gillespie D., J. Phys. Chem. 81, 2340 (1977). 10.1021/j100540a008 [DOI] [Google Scholar]
  32. Higham D. J., SIAM Rev. 50, 347 (2008). 10.1137/060666457 [DOI] [Google Scholar]
  33. Cao Y., Gillespie D., and Petzold L., J. Chem. Phys. 124, 044109 (2006). 10.1063/1.2159468 [DOI] [PubMed] [Google Scholar]
  34. Cao Y., Gillespie D., and Petzold L., J. Chem. Phys. 126, 224101 (2007). 10.1063/1.2745299 [DOI] [PubMed] [Google Scholar]
  35. Gillespie D., J. Chem. Phys. 115, 1716 (2001). 10.1063/1.1378322 [DOI] [Google Scholar]
  36. Xu Z. and Cai X., J. Chem. Phys. 128, 154112 (2008). 10.1063/1.2894479 [DOI] [PubMed] [Google Scholar]
  37. Cai X. and Xu Z., J. Chem. Phys. 126, 074102 (2007). 10.1063/1.2436869 [DOI] [PubMed] [Google Scholar]
  38. Schlicht R. and Winkler G., J. Math. Biol. 57, 613 (2008). 10.1007/s00285-008-0178-y [DOI] [PubMed] [Google Scholar]
  39. Brett T. and Galla T., Phys. Rev. Lett. 110, 250601 (2013). 10.1103/PhysRevLett.110.250601 [DOI] [PubMed] [Google Scholar]
  40. Gillespie D., Hellander A., and Petzold L., J. Chem. Phys. 138, 170901 (2013). 10.1063/1.4801941 [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Barrio M., Burrage K., Leier A., and Tian T., PLoS Comput. Biol. 2, e117 (2006). 10.1371/journal.pcbi.0020117 [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Gibson M. A. and Bruck J., J. Phys. Chem. A 104, 1876 (2000). 10.1021/jp993732q [DOI] [Google Scholar]
  43. Bayati B., Chatelain P., and Koumoutsakos P., J. Comput. Phys. 228, 5908 (2009). 10.1016/j.jcp.2009.05.004 [DOI] [Google Scholar]
  44. See supplementary material at http://dx.doi.org/10.1063/1.4878662 for proofs and precise statements of the mathematical results presented in this paper.
  45. While such an assumption is not needed for the dSSA, it is biochemically realistic and used in our mathematical analysis.
  46. Ethier S. N. and Kurtz T. G., Markov Processes: Characterization and Convergence, Wiley Series in Probability and Statistics (John Wiley & Sons, Inc., New York, 1986), pp. x+534. [Google Scholar]
  47. Gillespie D. T., J. Chem. Phys. 113, 297 (2000). 10.1063/1.481811 [DOI] [Google Scholar]
  48. van Kampen N. G., Can. J. Phys. 39, 551 (1961). 10.1139/p61-056 [DOI] [PubMed] [Google Scholar]
  49. Masoller C., Phys. Rev. Lett. 90, 020601 (2003). 10.1103/PhysRevLett.90.020601 [DOI] [PubMed] [Google Scholar]
  50. Tsimring L. S. and Pikovsky A., Phys. Rev. Lett. 87, 250602 (2001). 10.1103/PhysRevLett.87.250602 [DOI] [PubMed] [Google Scholar]
  51. Doering C. R., Sargsyan K. V., and Sander L. M., Multiscale Model. Simul. 3, 283 (2005). 10.1137/030602800 [DOI] [Google Scholar]
  52. Hanggi P., Grabert H., Talkner P., and Thomas H., Phys. Rev. A 29(3), 371 (1984). 10.1103/PhysRevA.29.371 [DOI] [Google Scholar]
  53. Kushner H. J., SIAM J. Appl. Math. 44, 160 (1984). 10.1137/0144012 [DOI] [Google Scholar]
  54. Tian T., Burrage K., Burrage P. M., and Carletti M., J. Comput. Appl. Math. 205, 696 (2007), special issue on evolutionary problems. 10.1016/j.cam.2006.02.063 [DOI] [Google Scholar]
  55. van Kampen N. G., Stochastic Processes in Physics and Chemistry (Elsevier, 1992), Vol. 1. [Google Scholar]
  56. Galla T., Phys. Rev. E 80, 021909 (2009). 10.1103/PhysRevE.80.021909 [DOI] [PubMed] [Google Scholar]
  57. Lafuerza L. F. and Toral R., Phys. Rev. E 84, 021128 (2011). 10.1103/PhysRevE.84.021128 [DOI] [PubMed] [Google Scholar]
  58. Schwartz I. B., Carr T. W., Billings L., and Dykman M., preprint arXiv:1207.7278 (2012).
  59. Haseltine E. L. and Rawlings J. B., J. Chem. Phys. 117, 6959 (2002). 10.1063/1.1505860 [DOI] [Google Scholar]
  60. Salis H. and Kaznessis Y., J. Chem. Phys. 122, 054103 (2005). 10.1063/1.1835951 [DOI] [PubMed] [Google Scholar]
  61. Salis H., Sotiropoulos V., and Kaznessis Y. N., BMC Bioinfo. 7, 93 (2006). 10.1186/1471-2105-7-93 [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from The Journal of Chemical Physics are provided here courtesy of American Institute of Physics

RESOURCES