The stochastic quasi-steady-state assumption: Reducing the model but not the noise

Rishi Srivastava; Eric L Haseltine; Ethan Mastny; James B Rawlings

doi:10.1063/1.3580292

. 2011 Apr 20;134(15):154109. doi: 10.1063/1.3580292

The stochastic quasi-steady-state assumption: Reducing the model but not the noise

Rishi Srivastava ^1,^a), Eric L Haseltine ^2,^b), Ethan Mastny ^1,^c), James B Rawlings ^1,^d)

PMCID: PMC3094464 PMID: 21513377

Abstract

Highly reactive species at small copy numbers play an important role in many biological reaction networks. We have described previously how these species can be removed from reaction networks using stochastic quasi-steady-state singular perturbation analysis (sQSPA). In this paper we apply sQSPA to three published biological models: the pap operon regulation, a biochemical oscillator, and an intracellular viral infection. These examples demonstrate three different potential benefits of sQSPA. First, rare state probabilities can be accurately estimated from simulation. Second, the method typically results in fewer and better scaled parameters that can be more readily estimated from experiments. Finally, the simulation time can be significantly reduced without sacrificing the accuracy of the solution.

INTRODUCTION

Intrinsic noise caused by discrete numbers of reacting molecules is an inherent feature of many cellular processes.¹^,²^,³^,⁴^,⁵ How individual cells control and exploit intrinsic noise has been an area of active research for the past decade. One useful tool for understanding this noise is the chemical master equation, which models a reacting system as a discrete Markov process in which jumps from one discrete state to another represent chemical reactions. The solution of the master equation is computationally tractable only for simple systems. Rather, approximation techniques such as finite state projections⁶ or stochastic simulation algorithm (SSA)⁷^,⁸ are employed to reconstruct the probability distribution and its statistics (usually the mean and variance). Applying these techniques to solve models of biological processes leads to significant improvements in understanding of intrinsic noise and its effect on cellular behavior. For example, Arkin and McAdams showed that in the bacteriophage λ infection, intrinsic noise could bifurcate identically infected cells to either dormant (lysogenic) or reproductive (lytic) states.⁹ Vilar et al. demonstrated that biochemical oscillators could still reliably function even in the presence of such noise.¹⁰ Weinberger et al. showed (both computationally and experimentally) that noise could generate transient bursts of activity in HIV-1 Tat transactivation, which contributes to latency in HIV-1.¹¹ Hensel et al. performed computational studies to illustrate the dynamics of an intracellular infection process demonstrating coupling between a highly reactive species and a rapidly increasing species.¹²

While these works are clearly successes of the modeling community in understanding the intricacies of intrinsic noise, the reality is that stochastic models of biologically relevant systems remain difficult to construct and even more difficult to understand. Many of these difficulties stem primarily from the significant cost of solving these models, particularly when using SSA simulation; see Gillespie¹³ for a recent review of progress made on this front. Moreover, for many systems biology models, the available experimental measurements are not sufficient to confidently estimate the parameters of the model.¹⁴ While these models can still make well-constrained predictions, the unnecessarily large number of parameters in these models increases the cost of calculating parameter sensitivities, i.e., how the model predictions change with perturbations to the parameters. Often these large number of parameters lead to model stiffness, a phenomenon in which significant computational expense is incurred in simulating some subset of the reactions in the model.

Stiffness can arise when some set of reversible reactions occur much more frequently than the remaining reactions, and the affected species typically remain at reasonable (nonzero) numbers. Such stiffness is analogous to the deterministic concept of reaction equilibrium and has been the subject of several recent studies.¹⁵^,¹⁶^,¹⁷^,¹⁸^,¹⁹^,²⁰^,²¹ A separate but equally important source of stiffness results from highly reactive species. This phenomenon results when reaction intermediates, known as quasi-steady-state (QSS) species, react so rapidly that their average number throughout the simulation is nearly zero or their average number is much smaller than the other species (the reactants and products).²²^,²³ Rao and Arkin hypothesized that the stochastic quasi-steady-state reduction should lead to the same reduced model as the deterministic quasi-steady-state reduction. Mastny et al.,²³ however, presented counterexamples in which the reduced models from deterministic and stochastic quasi-steady-state reduction are different. Using singular perturbation analysis, Mastny et al.²³ demonstrated that the QSS species can be removed from the master equation to yield a reduced master equation for the remaining species. They termed this method stochastic QSS singular perturbation analysis (sQSPA). Inspection of the reduced master equation yields reduced reaction expressions, reduced stoichiometries and fewer parameters in comparison to the original master equation. Notably, the reduced reaction expressions do not always correspond to those obtained by using the traditional deterministic quasi-steady-state analysis. Another source of stiffness can occur when there are two distinct type of species present in the system: a highly reactive species and a rapidly increasing species. This stiffness causes small time steps in SSA and leads to significant slow down of SSA simulations. For this stiffness, we use a variant of sQSPA, stochastic QSS singular perturbation analysis with Ω expansion (sQSPA-Ω).

In this paper, we demonstrate the utility of sQSPA and sQSPA-Ω for simulating and understanding stochastic reaction models. We choose two previously published models from the literature that appear to have highly-reactive, or QSSA species: the pap operon regulation⁶ and a biochemical oscillator.¹⁰ We also consider a simple system, a fast fluctuation, which contains a highly reactive species coupled to a rapidly increasing species. This coupling of highly reactive species with rapidly increasing species causes a large computational load for SSA simulations.¹² By reducing these three models using sQSPA and sQSPA-Ω, we show that we can lower the model complexity without altering the inherent noise characteristics of the full model. The reduced model for the pap operon regulation is also useful in estimating a rare state probability from simulation. This rare state probability is not accurately determined by either direct SSA simulation or Kuwahara and Mura's²⁴ recent general purpose method for estimating rare state probabilities in stochastic kinetic models. The reduced model for the biochemical oscillator leads to simplification in the parameter estimation problem as well as significant reduction in the simulation time. The reduced model for the fast fluctuation problem also significantly reduces the simulation time.

RESULTS

Example 1: Pap operon regulation.

Here we consider the gene state switch model of the Pyelonephritis-associated pili (Pap) regulatory network considered by Munsky et al.⁶ This model describes the states (g₁ to g₄) of the pap operon as a function of time. The schematic of four possible states and the mode of transition among them is shown in Fig. 1. The reaction stoichiometries and time invariant rates of transition (r₁ to r₈) are given in Table 1.

Schematic diagram of the Pap regulatory network. There are four possible states of the pap operon depending on the LRP-DNA binding.

Table 1.

Reaction stoichiometry and reaction rates for pap operon regulation.

Number	Reaction stoichiometry	Reaction rate (r_i)
1	g₁→g₂	100.
2	g₂→g₁	0.625
3	g₁→g₃	100.
4	g₃→g₁	1.033
5	g₂→g₄	0.99
6	g₄→g₂	1.033
7	g₃→g₄	0.99
8	g₄→g₃	0.625

Open in a new tab

The master equation for the system is

\begin{matrix} \frac{d P_{1}}{d t} & = - (r_{1} + r_{3}) P_{1} + r_{2} P_{2} + r_{4} P_{3}, \end{matrix}

(1)

\begin{matrix} \frac{d P_{2}}{d t} & = - (r_{2} + r_{5}) P_{2} + r_{1} P_{1} + r_{6} P_{4}, \end{matrix}

(2)

\begin{matrix} \frac{d P_{3}}{d t} & = - (r_{4} + r_{7}) P_{3} + r_{3} P_{1} + r_{8} P_{4}, \end{matrix}

(3)

\begin{matrix} \frac{d P_{4}}{d t} & = - (r_{6} + r_{8}) P_{4} + r_{5} P_{2} + r_{7} P_{3}, \end{matrix}

(4)

in which P_i: i = 1, 2, 3, 4 is the probability of state g_i and r_j: j = 1, 2, …, 8 is the rates of transition defined in Table 1. For the rates specified in Table 1, r₁ and r₃ are large compared to all other rates. We apply the previously described sQSPA reduction technique.²³ In the supporting information,²⁵ we show that Eqs. 1, 2, 3, 4 can be reduced appropriately for the two time regimes: the fast time scale and the slow time scale. The fast time scale regime occurs initially for a short period of time. During the fast time scale regime, probabilities of all the states directly affected by r₁ and r₃ change rapidly. The slow time scale follows this fast time scale regime. From Fig. 1, we can see that the only state that is not directly affected by r₁ and r₃ is state g₄.

On the fast time scale we denote the probability of state g_i as ${\hat{P}}_{i}$ and use the following power series expansion for ${\hat{P}}_{i}$

{\hat{P}}_{i} = {\hat{W}}_{i 0} + ε {\hat{W}}_{i 1} + ε^{2} {\hat{W}}_{i 2} + O (ε^{3}),

in which ${\hat{W}}_{i j}$ is the jth-order approximation of probability of g_i and ε is a small parameter. In the supporting information we show that for the fast time scale, the probability of different states is given by

\begin{matrix} {\hat{W}}_{10} & = e^{- τ}, \end{matrix}

(5)

\begin{matrix} {\hat{W}}_{20} & = K_{1} (1 - e^{- τ}), \end{matrix}

(6)

\begin{matrix} {\hat{W}}_{30} & = K_{3} (1 - e^{- τ}), \end{matrix}

(7)

\begin{matrix} {\hat{W}}_{40} & = 0, \end{matrix}

(8)

in which τ = t(r₁ + r₃) is the rescaled time. Equation 5 indicates that as τ tends to ∞, i.e., we approach the boundary between the fast time scale and the slow time scale regimes, the probability of state g₁ becomes small.

For the slow time scale we use following power series expansion for P_i

P_{i} = W_{i 0} + ε W_{i 1} + ε^{2} W_{i 2} + O (ε^{3}),

in which W_ij are the jth-order probabilities of state g_i: i = 1, 2, 3, 4. In the supporting information we show that in the slow time scale regime

\begin{matrix} W_{10} & = 0, \end{matrix}

(9)

\begin{matrix} \frac{d W_{20}}{d t} & = - [\tilde{r_{1}} + r_{5}] W_{20} + \tilde{r_{2}} W_{30} + r_{6} W_{40}, \end{matrix}

(10)

\begin{matrix} \frac{d W_{30}}{d t} & = - [\tilde{r_{2}} + r_{7}] W_{30} + \tilde{r_{1}} W_{20} + r_{8} W_{40}, \end{matrix}

(11)

\begin{matrix} \frac{d W_{40}}{d t} & = - [r_{6} + r_{8}] W_{40} + r_{5} W_{20} + r_{7} W_{30}, \end{matrix}

(12)

in which $\tilde{r_{1}} = r_{2} r_{3} ∕ (r_{1} + r_{3})$ and $\tilde{r_{2}} = r_{1} r_{4} ∕ (r_{1} + r_{3})$ are both O(1) constants. The reduced reaction mechanism in the slow time scale regime is shown in Fig. 2. Note that state g₁ is removed in the slow time scale regime. In Fig. 3, we show a comparison of probabilities of different states at time t = 10s as obtained from the master equation solution of the full model, Eqs. 1, 2, 3, 4 and from the O(1) approximation of the slow time scale solution of sQSPA, Eqs. 9, 10, 11, 12. We can see that the O(1) approximation of the slow time scale solution of sQSPA gives the probability of state g₁ as 0. This approximation is validated by the small value of the probability of state g₁ given by the master equation solution of the full model.

Reduced system in the slow time scale regime.

Comparison of full model and sQSPA slow time scale reduced model at t = 10 s.

Increasingly better approximations of the probability of state g₁ can be obtained by higher-order terms in the sQSPA reduction.²³ In the supporting information we show that in the slow time scale regime, a better approximation of state g₁ is given by

{\tilde{P}}_{1} = W_{10} + ε W_{11} = ε (r_{2} W_{20} + r_{4} W_{30}),

(13)

where ε = 1/(r₁ + r₃). Next we compare the probability estimate of state g₁ from the full model, Eqs. 1, 2, 3, 4 and sQSPA slow time scale reduced model, Eq. 13. The result is shown in the left side of Fig. 4. The comparison demonstrates that after a small initial time interval, the master equation solution of both the full model and the sQSPA slow time scale reduced model of state g₁ are in good agreement. Next we show that sQSPA is useful for rare state probability estimation. In the right side of Fig. 4, we compare the probability estimate of state g₁ from 5000 SSA simulations of the full model, Eqs. 1, 2, 3, 4 and from 5000 SSA simulations of sQSPA slow time scale reduced model, Eqs. 10, 11, 12. From simulations of the sQSPA slow time scale reduced model, we obtain values of W₂₀, W₃₀, and W₄₀. An estimate of rare state g₁ is then obtained from Eq. 13. The right side of Fig. 4 shows that the estimate of rare state g₁ is noisy and inaccurate from full model SSA simulations with 5000 samples, while this estimate is smooth and better with sQSPA slow time scale reduced model SSA simulations. Accurate estimation of rare state probabilities is a recurrent problem in stochastic simulation because this estimation requires large numbers of SSA simulations. The analysis of sQSPA enables these rare state probabilities to be determined from easily estimated probabilities of the nonrare states [e.g., use of Eq. 13].

A comparison of the full model and sQSPA reduced model. The left side shows the solution of the master equation for the full model and sQSPA-slow time scale reduced (sQSPA-stsr) model. The right side shows the estimate of the probability of state g₁ from full model SSA simulation and from sQSPA-stsr model SSA simulation.

It is also interesting to compare this analysis to general purpose methods for estimating rare state probabilities in stochastic kinetic models. Kuwahara and Mura²⁴ present a general method to calculate rare state probabilities using the idea of importance sampling. Their method was subsequently improved by Roh et al.²⁶ employing a state dependent importance sampling parameter. Kuwahara and Mura's method relies on increasing the sampling of the rare state. The rate constants causing the rare states to have low probability are first biased (changed) so that the rare events occur more frequently in simulation; then the importance sampling weights are adjusted to remove the bias introduced by changing the rate constants. We implemented Kuwahara and Mura's method on the pap operon example by decreasing by a factor of 1000 both rate constants r₁ and r₃ for leaving state g₁. Surprisingly, the rare state probability estimate with this change in rate constants turns out to be no better than the full SSA implementation. The reason for this counter-intuitive lack of improvement has to do with the nature of the rare event. In the pap operon example, the rare state g₁ is visited frequently, but the duration in this state is small. But the decrease in r₁ and r₂ has no effect on the simulation because the time for the next event is chosen using the original rate constants, not the biased ones. So the duration in rare state g₁ is unaffected. The modified simulation does not visit state g₁ more frequently, and it does not remain in this state longer than in the original SSA simulation. Similarly, increasing r₂ and r₄ also does not give better estimates.

Example 2: Biochemical Oscillator.

Sustained oscillations play a key role in biological processes such as circadian rhythms and cell-cycle dynamics. In many cases, these oscillations are driven by underlying biochemical reactions. Noise is an inherent component of cellular processes that causes cell-to-cell variation,¹^,²^,³^,⁴^,⁵ and is a potentially destabilizing influence that must be overcome to achieve sustained, reproducible oscillations. Vilar et al. examined this phenomenon using a model system,¹⁰ and we reconsider the same reaction system

\begin{matrix} A + D_{A} & ⇌_{θ_{A}}^{γ_{A}} & D_{A}^{'}, \\ D_{A} & \overset{α_{A}}{\to} & D_{A} + M_{A}, \\ D_{A}^{'} & \overset{α_{A}^{'}}{\to} & D_{A}^{'} + M_{A}, \\ M_{A} & \overset{δ_{M A}}{\to} & degraded, \\ M_{A} & \overset{β_{A}}{\to} & M_{A} + A, \\ A & \overset{δ_{A}}{\to} & degraded, \\ A + D_{R} & ⇌_{θ_{R}}^{γ_{R}} & D_{R}^{'}, \\ D_{R} & \overset{α_{R}}{\to} & D_{R} + M_{R}, \\ D_{R}^{'} & \overset{α_{R}^{'}}{\to} & D_{R}^{'} + M_{R}, \\ M_{R} & \overset{δ_{M R}}{\to} & degraded, \\ M_{R} & \overset{β_{R}}{\to} & M_{R} + R, \\ R & \overset{δ_{R}}{\to} & degraded, \\ A + R & \overset{γ_{C}}{\to} & C, \\ C & \overset{δ_{A}}{\to} & R . \end{matrix}

(14)

For the parameters given in Table 2, the mRNA species M_A is a QSS species. A full SSA simulation of the model demonstrates that this species predominantly samples [0, 1] over the course of a typical simulation. To understand how one can reduce the model to eliminate this species, consider the simpler reaction system

\begin{matrix} D_{A} & \overset{α_{A}}{\to} & D_{A} + M_{A}, \\ D_{A}^{'} & \overset{α_{A}^{'}}{\to} & D_{A}^{'} + M_{A}, \\ M_{A} & \overset{δ_{M A}}{\to} & degraded, \\ M_{A} & \overset{β_{A}}{\to} & M_{A} + A . \end{matrix}

(15)

In the supporting information, we show that when $β_{A}, δ_{M A} ≫ α_{A}, α_{A}^{'}, δ_{A}$ this simpler system 15 is well approximated by

\begin{matrix} D_{A} \to D_{A} + A r_{A}^{1} = {\bar{β}}_{A} ({\bar{δ}}_{M A} α_{A} D_{A}), \\ D_{A} \to D_{A} + 2 A r_{A}^{2} = {\bar{β}}_{A}^{2} ({\bar{δ}}_{M A} α_{A} D_{A}), \end{matrix}

(16)

\begin{matrix} ⋮ \\ D_{A} \to D_{A} + nA r_{A}^{n} = {\bar{β}}_{A}^{n} ({\bar{δ}}_{M A} α_{A} D_{A}), \\ D_{A}^{'} \to D_{A}^{'} + A r_{A}^{' 1} = {\bar{β}}_{A} ({\bar{δ}}_{M A} α_{A}^{'} D_{A}^{'}), \\ D_{A}^{'} \to D_{A}^{'} + 2 A r_{A}^{' 2} = {\bar{β}}_{A}^{2} ({\bar{δ}}_{M A} α_{A}^{'} D_{A}^{'}), \\ ⋮ \\ D_{A}^{'} \to D_{A}^{'} + nA r_{A}^{' n} = {\bar{β}}_{A}^{n} ({\bar{δ}}_{M A} α_{A}^{'} D_{A}^{'}), \end{matrix}

(17)

in which the M_A species no longer appears and ${\bar{β}}_{A} = β_{A} ∕ (δ_{M A} + β_{A})$ , ${\bar{δ}}_{M A} = 1 - {\bar{β}}_{A}$ . In the reduced system, the DNA species D_A and $D_{A}^{'}$ act as catalysts for production of the A protein. This system behaves similar to the catalytic example previously reported in Ref. 23 and is simulated in an analogous fashion. Namely, the sQSPA reduction indicates that the sum of all catalytic reactions involving D_A is

\begin{matrix} r_{A}^{Σ} & = \sum_{j = 1}^{\infty} r_{A}^{j} = {\bar{β}}_{A} α_{A} D_{A}, \end{matrix}

and the sum of all catalytic reactions involving $D_{A}^{'}$ is

\begin{matrix} r_{A}^{' Σ} & = \sum_{j = 1}^{\infty} r_{A}^{' j} = {\bar{β}}_{A} α_{A}^{'} D_{A}^{'} . \end{matrix}

Therefore, the catalytic production of A from D_A, catalytic reaction set 16 and $D_{A}^{'}$ , catalytic reaction set 17 can each be treated as a single reaction with rate $r_{A}^{Σ}$ or $r_{A}^{' Σ}$ . If the catalytic reaction set involving D_A is selected in the reduced model simulation, then one can use individual catalytic rates (the $r_{A}^{j}$ 's) to determine which catalytic reaction gets fired from the catalytic reaction set 16. A similar approach can be used if the catalytic reaction set involving $D_{A}^{'}$ gets selected.

Table 2.

Parameters for the biochemical oscillator. Initial conditions for unreported species are zero.

Parameter	Value	Units
γ_A	1.	(mol h)⁻¹
θ_A	50.	h⁻¹
α_A	50.	h⁻¹
$α_{A}^{'}$	500.	h⁻¹
δ_MA	1000.	h⁻¹
β_A	5000.	h⁻¹
δ_A	1.	h⁻¹
γ_R	1.	(mol h)⁻¹
θ_R	100.	h⁻¹
α_R	0.01	h⁻¹
$α_{R}^{'}$	50.	h⁻¹
δ_MR	50.	hr⁻¹
β_R	500.	h⁻¹
δ_R	0.2	h⁻¹
γ_C	2.	(mol h)⁻¹
D_A(t = 0)	1.	mol
D_R(t = 0)	1.	mol

Open in a new tab

Next we show that applying the stochastic and deterministic quasi-steady-state model reductions can lead to different reduced models. In the supporting information we show that if we apply deterministic quasi-steady-state classical dQC to the same reaction network 15, we get the following reduced mechanism:

\begin{matrix} D_{A} & \overset{α_{A}}{\to} & D_{A} + A r_{A} = {\bar{β}}_{A} α_{A} D_{A}, \end{matrix}

(18)

\begin{matrix} D_{A}^{'} & \overset{α_{A}^{'}}{\to} & D_{A} + A r_{A}^{'} = {\bar{β}}_{A} α_{A}^{'} D_{A}^{'} . \end{matrix}

(19)

The sQSPA reduced model of the biochemical oscillator replaces the reaction set 15 in the full model 14 with reactions sets 16, 17. The dQC reduced model of the biochemical oscillator replaces the reaction set 15 in the full model 14 with reactions sets 18, 19. The reduced biochemical oscillator model as obtained by sQSPA and as obtained by dQC reveals important features of the two reductions. Simulating the reduced system obtained from sQSPA reproduces the oscillatory characteristic of the full model (Fig. 5). However, the reduced system obtained from dQC is grossly inaccurate and completely fails to reproduce oscillatory characteristics of the full model (Fig. 5). The reason that dQC reduction fails to reproduce the oscillatory characteristics of the full model is that dQC reduction underestimates the rate of production of A compared to both the full model and the sQSPA reduced model.

Stochastic simulation of the biochemical oscillator. Proteins A and R vs time for the full model (left), sQSPA-reduced model (center), and dQC reduced model (right).

To more quantitatively compare the three different models—full, sQSPA reduced and dQC reduced, we compare expected value (E[A(t)]) and standard deviation (σ[A(t)]) of population of species A as obtained by the three models. To obtain the estimates of expected values and standard deviations at different time instants, we perform 3000 SSA simulations of each of the three models. To obtain E[A(t)] or σ[A(t)] for a particular model, we use the following formulas:

\begin{matrix} E [A (t)] & = & \frac{1}{N_{sim}} \sum_{j = 1}^{N_{sim}} A (t) \\ σ [A (t)] & = & \frac{1}{N_{sim} - 1} \sum_{j = 1}^{N_{sim}} {(A (t) - E [A (t)])}^{2} \end{matrix}

and N_sim = 3000 is the total number of SSA simulations performed. In Table 3 we see that at any given time E[A(t)] and σ[A(t)] as obtained by sQSPA reduced model are close to those obtained by the full model. However, the estimates of E[A(t)] and σ[A(t)] obtained with the dQC reduced model deviate significantly from those obtained by the full model.

Table 3.

Comparison of full, sQSPA reduced and dQC reduced models: E[A(t)] and σ[A(t)] obtained by sQSPA reduced model are close to those obtained by Full model. However, E[A(t)] and σ[A(t)] obtained by dQC reduced model is significantly different from those obtained by full model.

	E[A(t)]			σ[A(t)]
t in h	Full	sQSPA	dQC	Full	sQSPA	dQC
0	0	0	0	0	0	0
5	192.77	195.08	1.13	208.21	203.34	4.58
10	0.17	0.14	1.30	0.72	0.90	4.90
35	14.36	14.11	1.66	96.86	95.90	5.13
100	93.65	96.31	1.69	249.47	253.46	5.32
200	129.80	117.43	1.73	287.08	271.38	5.88
400	125.69	127.49	1.82	282.08	287.35	5.88

Open in a new tab

The sQSPA model reduction changes the full model parameter estimation problem of two large parameters (β_A, δ_MA) into the estimation of one well-scaled parameter ( ${\bar{β}}_{A}$ ).²⁷ The left column of Fig. 6 demonstrates how the full model simulations change for different values of β_A and δ_MA. Changing these parameters by a factor of 100 has almost no effect on the simulation. These parameters are essentially unidentifiable from measurements of species A and R. The right column of Fig. 6 shows how the reduced model changes with the parameter ${\bar{β}}_{A}$ . In the reduced model, changing ${\bar{β}}_{A}$ by a small amount has a large effect on the simulation, and this single parameter is easily estimated from measurements of A and R.

Sensitivity of the full and sQSPA reduced models to parameters. Left column: Output of the full model for different value of β_A and δ_MA while keeping their ratio fixed; Right column: Output of sQSPA reduced model for various value of ${\bar{β}}_{A}$ .

An additional benefit of the sQSPA model reduction of the biochemical oscillator is that the computation time is reduced to 160 fold. This reduction in computation time occurs because sQSPA reduced model removes the last two reactions of the reaction set 15. Reaction set 15 shows that formation of A from D_A or $D_{A}^{'}$ occurs through M_A. SSA simulation of the full model spends significant amount of time executing the fast reactions: degradation of M_A or formation of A from M_A. The sQSPA reduced model does not contain these two reactions and the formation of A occurs directly from D_A or $D_{A}^{'}$ , which results in significant computational savings.

Example 3: Fast fluctuation.

In a stochastic simulation of the infection cycle of vesicular stomatitis virus (VSV), fast fluctuation in a protein at low copy number along with a rapid increase in the population of the viral genome occurs. Such a system is expensive to simulate because the frequency of the fluctuation increases as the simulation progresses leading to small time steps in the SSA simulation.¹² To illustrate the phenomenon, consider the following simple three-species, three-reaction system:

\begin{matrix} A + G & \overset{k_{1}}{\to} & C + G, \end{matrix}

(20)

\begin{matrix} C + G & \overset{k_{2}}{\to} & 2 G + A, \end{matrix}

(21)

\begin{matrix} 2 G & \overset{k_{3}}{\to} & G, \\ with k_{2} & \sim & k_{1} ≫ k_{3} . \end{matrix}

(22)

This reaction system describes the interaction of three species in a simplified VSV replication process—two forms of viral polymerase, A and C, and viral genome G. The two forms of polymerase are motivated by the fact that VSV has two different complexes that serve as viral transcriptase and replicase.²⁸ The viral transcriptase form A is a complex of constituent VSV proteins L and P. The replicase form C is a complex of L, N, and P proteins. A is involved in the transcription reaction 20 to produce messenger RNA. The transcription reaction leads to the conversion of transcriptase A into replicase C. We further assume that produced mRNA from reaction 20 is short lived and hence we do not include it in the model. Species C and G are involved in replication reaction 21 to produce an additional viral genome G. The replication 21 reaction leads to the conversion of replicase C into transcriptase A. Finally, there is a second-order degradation reaction 22 of viral genome. The model 20, 21, 22 is insufficient to predict the full viral infection cycle, but it is instructive in understanding the simulation challenges of the full infection cycle model used by Hensel et al.¹² The reaction rate constants k₁, k₂, k₃ denote macroscopic reaction rate constants with units μm³∕(mol∕s). We express microscopic reaction rates in terms of macroscopic rate constants (k₁, k₂, k₃) and the system size Ω

r_{1} = \frac{1}{Ω} k_{1} a g r_{2} = \frac{1}{Ω} k_{2} c g r_{3} = \frac{1}{Ω} k_{3} g (g - 1),

in which the system size appears because the reactions are second order. For the purposes of this example we take Ω=10⁵μm³. A stochastic simulation with the parameter values given in Table 4 is shown in the first row of Fig. 7. Species G increases continuously and this increase forces species C to fluctuate with increasing frequency as shown in the first row of Fig. 7. The left side of Fig. 8 shows that as the simulation progresses, the time step decreases to small values (10⁻⁶ to 10⁻¹²s), which indicates a significant slow down of the SSA simulation. To generate the left side of Fig. 8, we have selected a few time points and plotted the time steps of SSA (Δt) at those time points. The right side of Fig. 8 shows that the frequency of the fluctuation of species C increases with time. To calculate the frequency of fluctuation, we select a few equidistant time points (0.25, 0.75, 1.25, … ) s and a 0.5 s interval containing each of the time points. We count the number of times C changes during these time intervals. The frequency of the fluctuation at a time point is obtained by dividing the number of times C changes by the length of the interval. Both the left and the right side of Fig. 8 demonstrate the slow down of SSA as simulation progresses due to the rapid increase in species G.

Table 4.

Initial population and reaction rate constants for the fast fluctuation example.

Species	initial number	Rate constant	value (μm³∕(mol∕s))
A	3	k₁	9 × 10⁵
C	0	k₂	5 × 10⁵
G	1	k₃	5 × 10⁻²

Open in a new tab

Top row: species C and G vs time from full model simulation. Bottom row: species C and G versus time from Hybrid SSA-Ω model simulation.

Left: The step size (Δt) of the SSA vs time; small time steps indicate slow simulation. Right: Frequency (ν) of fluctuation of C vs time.

To reduce this model, we apply the sQSPA-Ω technique in which we first represent G as a continuous variable

G = Ω φ_{G} + Ω^{1 ∕ 2} ξ,

in which ϕ_G is the deterministic mean and ξ is the noise. We substitute this expression into the master equation 20, 21, 22 to obtain a transformed master equation. Then we collect terms of the same order in Ω. As shown in the supporting information, the O(Ω⁰) terms lead to an approximate evolution equation for species A and C. This evolution equation is a time-invariant binomial distribution

\begin{matrix} W_{0 a} (a) & = (\binom{N_{0}}{a}) {(\frac{1}{K_{1} γ + 1})}^{a} {(\frac{K_{1} γ}{K_{1} γ + 1})}^{N_{0} - a}, \end{matrix}

(23)

\begin{matrix} W_{0 c} (c) & = (\binom{N_{0}}{c}) {(\frac{K_{1} γ}{K_{1} γ + 1})}^{c} {(\frac{1}{K_{1} γ + 1})}^{N_{0} - c}, \end{matrix}

(24)

in which γ = Ω/k₂.

Collecting O(Ω^−1/2) terms leads to an evolution equation for ϕ_G

\frac{d φ_{G}}{d t} = - γ^{- 1} ⟨ c ⟩ φ_{G} + k_{3} φ_{G}^{2},

(25)

where 〈c〉 = (N₀K₁γ)/(K₁γ + 1) is the mean of the population of species C. Equation 25 shows the dependence of the evolution of the mean concentration of G, ϕ_G on the mean of population of species C, 〈c〉. Collecting the O(Ω⁻¹) terms give an evolution equation for the probability density of the noise in G, ξ

\begin{matrix} \frac{\partial W_{0 ξ} (ξ)}{\partial t} & = & - \frac{\partial}{\partial ξ} [(\frac{⟨ c ⟩}{γ} - 2 k_{3} φ_{G}) ξ W_{0 ξ} (ξ)] \\ + \frac{1}{2} \frac{\partial^{2}}{\partial ξ^{2}} [(\frac{⟨ c ⟩ φ_{G}}{γ} + k_{3} φ_{G}^{2}) W_{0 ξ} (ξ)] . \end{matrix}

(26)

This evolution equation for W_0ξ(ξ, t) is a linear Fokker–Planck equation with time varying coefficients. The equivalent stochastic differential equation describing this noise process is²⁹

\begin{matrix} d ξ = (\frac{⟨ c ⟩}{γ} - 2 k_{3} φ_{G}) ξ d t + \sqrt{\frac{⟨ c ⟩ φ_{G}}{γ} + k_{3} φ_{G}^{2}} d W, \end{matrix}

(27)

in which W is the continuous time Wiener process, or integrated white noise. Equations 25, 27 together characterize the population of species G.

Initially when the population of G is small, we perform full SSA for the system. After the onset of the fast fluctuation, we switch to the description of the system given by Eqs. 23, 24, 25, 26, 27. The onset of the fast fluctuation is determined by setting a threshold value for G. We choose 10⁵ for the threshold in this example. We call this combined approach hybrid SSA-Ω, which uses SSA initially and sQSPA-Ω after the onset of the fast fluctuation. Figure 7 shows a comparison of the full SSA and the hybrid SSA-Ω simulations. The left column of Fig. 7 has a sample evolution of C from full SSA (top row) and hybrid SSA-Ω (bottom row), respectively. The right column of Fig. 7 shows a sample evolution of G from the full SSA (top row) and the hybrid SSA-Ω (bottom row). We see that the evolution of C and G from the two simulation approaches match closely. We next compare statistics of G from full and hybrid SSA-Ω simulations. The left side of Fig. 9 shows the mean of G from the full simulation and from the hybrid SSA-Ω simulation, and we see that the hybrid SSA-Ω accurately captures the mean of G. The right side of Fig. 9 shows the same comparison for the standard deviation of G, and we see that the hybrid SSA-Ω also accurately captures the standard deviation of G. Figure 10 compares probability density of C in the full model to the hybrid SSA-Ω model. To obtain an estimate of the probability density of C from the full SSA, we take simulation data from one full SSA simulation after the switch to the hybrid SSA-Ω has occurred. With this simulation data we obtain the frequency distribution of the C species. We take this frequency distribution as an estimate of the probability density of C from the full SSA. The probability density of C from the hybrid SSA-Ω is obtained using Eq. 24. The two densities match closely indicating that the hybrid SSA-Ω model captures the dynamics of the fast fluctuating species accurately.

Left side: Mean of G from full model and Hybrid SSA-Ω reduced models using 500 SSA simulations. Right side: Standard deviation of G from full model and Hybrid SSA-Ω reduced models using 500 simulations.

Comparison of probability density of C obtained from full SSA and Hybrid SSA-Ω.

Comparison of the simulation times for the two models shows an 80-fold reduction with hybrid SSA-Ω. A single simulation of SSA requires about 111 s whereas a single simulation of hybrid SSA-Ω requires 1.45 s.³⁰

CONCLUSIONS

This paper demonstrates the usefulness of the stochastic quasi-steady-state singular perturbation analysis and the stochastic quasi-steady-state singular perturbation analysis with Ω expansion (sQSPA-Ω).²³ These methods are well suited for situations in which some of the species in the system are highly reactive and sample mainly small (1 − 10) values. The reactant and product species may be present in either moderate values (<10³), in that case sQSPA is suitable, or in large values (≫10³), in that case sQSPA-Ω is suitable.

We considered three examples here. The first two examples, pap operon regulation and a biochemical oscillator, use sQSPA and the third one, fast fluctuation, uses sQSPA-Ω. The pap operon regulation full model consisted of four states, and the sQSPA reduction removed one low probability state (g₁). This example highlights difficulties in accurate rare state probability estimation using either SSA simulations or the recent technique proposed by Kuwahara and Mura based on importance sampling.²⁴ We showed that accurate estimation of the rare state probability (g₁) can be obtained using the probabilities of non-rare (g₂, g₃) states and the analysis provided by sQSPA. The biochemical oscillator example contained one species (M_A) that sampled mainly zero because of the rapid M_A degradation reactions. The sQSPA reduction converted the estimation problem of two large rate constants into the estimation of one well scaled parameter, their ratio. Also the reduced model showed a 160-fold decrease in simulation time. The fast fluctuation problem exhibited one species fluctuating rapidly and another increasing rapidly. We applied an extension of sQSPA-Ω that we call hybrid SSA-Ω. Hybrid SSA-Ω leads to an approximate probability distribution of the fast fluctuating species. It leads to a combination of an ordinary differential equation and a Langevin equation for the rapidly increasing species. The reduced model of the fast fluctuation problem showed an 80-fold speed up in the computation time.

In stochastic modeling of biological systems, certain characteristics are desirable: fast simulation times, ability to compute rare state probabilities directly from simulation, and ability to readily estimate model parameters from experimental data. Many detailed biological models do not have some or any of these characteristics. Model reduction tools such as sQSPA and sQSPA-Ω can play useful roles in developing reduced models that do have these characteristics.

ACKNOWLEDGMENTS

This work was supported by a National Institutes of Health (AI071197) award. E.L.H. gratefully acknowledges support from the National Institutes of Health under Ruth L. Kirschstein National Research Service Award 5F32CA120055. All simulations were performed using Octave (http://www.octave.org). Octave is freely distributed under the terms of the GNU General Public License.

References

Elowitz M. B., Levine A. J., Siggia E. D., and Swain P. S., Science 297, 1183 (2002). 10.1126/science.1070919 [DOI] [PubMed] [Google Scholar]
Ozbudak E. M., Thattai M., Kurtser I., Grossman A. D., and van Oudenaarden A., Nat. Genet. 31, 69 (2002). 10.1038/ng869 [DOI] [PubMed] [Google Scholar]
Blake W. J., Kaern M. A., Cantor C. R., and Collins J. J., Nature (London) 422, 633 (2003). 10.1038/nature01546 [DOI] [PubMed] [Google Scholar]
Raser J. M. and O’Shea E. K., Science 304, 1811 (2004). 10.1126/science.1098641 [DOI] [PMC free article] [PubMed] [Google Scholar]
Hooshangi S., Thiberge S., and Weiss R., Proc. Natl. Acad. Sci. U.S.A. 102, 3581 (2005). 10.1073/pnas.0408507102 [DOI] [PMC free article] [PubMed] [Google Scholar]
Munsky B. and Khammash M., J. Chem. Phys. 124, 044104 (2006). 10.1063/1.2145882 [DOI] [PubMed] [Google Scholar]
Gillespie D. T., J. Phys. Chem. 81, 2340 (1977). 10.1021/j100540a008 [DOI] [Google Scholar]
Gillespie D. T., Physica A 188, 404 (1992). 10.1016/0378-4371(92)90283-V [DOI] [Google Scholar]
Arkin A., Ross J., and McAdams H., Genetics 149, 1633 (1998). 10.1146/annurev.biophys.27.1.199 [DOI] [PMC free article] [PubMed] [Google Scholar]
Vilar J. M.G., Kueh H. Y., Barkai N., and Leibler S., Proc. Natl. Acad. Sci. U.S.A. 99, 5988 (2002). 10.1073/pnas.092133899 [DOI] [PMC free article] [PubMed] [Google Scholar]
Weinberger L. S., Burnett J. C., Toettcher J. E., Arkin A. P., and Schaffer D. V., Cell 122, 169 (2005). 10.1016/j.cell.2005.06.006 [DOI] [PubMed] [Google Scholar]
Hensel S., Rawlings J. B., and Yin J., Bull. Math. Biol. 71, 1671 (2009). 10.1007/s11538-009-9419-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
Gillespie D. T., Annu. Rev. Phys. Chem. 58, 35 (2007). 10.1146/annurev.physchem.58.032806.104637 [DOI] [PubMed] [Google Scholar]
Gutenkunst R. N., Waterfall J. J., Casey F. P., Brown K. S., Myers C. R., and Sethna J. P., PLOS Comput. Biol. 3, 1871 (2007). 10.1371/journal.pcbi.0030189 [DOI] [PMC free article] [PubMed] [Google Scholar]
Cao Y., Gillespie D. T., and Petzold L. R., J. Chem. Phys. 122, 014116 (2005). 10.1063/1.1824902 [DOI] [Google Scholar]
Goutsias J., J. Chem. Phys. 122, 184102 (2005). 10.1063/1.1889434 [DOI] [PubMed] [Google Scholar]
Haseltine E. L. and Rawlings J. B., J. Chem. Phys. 123, 164115 (2005). 10.1063/1.2062048 [DOI] [PubMed] [Google Scholar]
Samant A. and Vlachos D. G., J. Chem. Phys. 123, 144114 (2005). 10.1063/1.2046628 [DOI] [PubMed] [Google Scholar]
Salis H. and Kaznessis Y., J. Chem. Phys. 122, 054103 (2005). 10.1063/1.1835951 [DOI] [PubMed] [Google Scholar]
Liu W. E. D. and Vanden-Eijnden E., J. Chem. Phys. 123, 194107 (2005). 10.1063/1.2109987 [DOI] [PubMed] [Google Scholar]
Liu W. E. D. and Vanden-Eijnden E., J. Comput. Phys. 221, 158 (2007). 10.1016/j.jcp.2006.06.019 [DOI] [Google Scholar]
Rao C. V. and Arkin A. P., J. Chem. Phys. 118, 4999 (2003). 10.1063/1.1545446 [DOI] [Google Scholar]
Mastny E. A., Haseltine E. L., and Rawlings J. B., J. Chem. Phys. 127, 094106 (2007). 10.1063/1.2764480 [DOI] [PubMed] [Google Scholar]
Kuwahara H. and Mura I., J. Comput. Phys. 129, 165101 (2008). 10.1063/1.2987701 [DOI] [PubMed] [Google Scholar]
See supplementary material at http://dx.doi.org/10.1063/1.3580292 for the proof.
Roh M., Gillespie D., and Petzold L., J. Comp. Phys. 133, 174106 (2010). 10.1063/1.3493460 [DOI] [PMC free article] [PubMed] [Google Scholar]
Note 1, note that \documentclass[12pt]{minimal}\begin{document}$\protect {\bar\delta }_{MA}=1-\protect {\bar\beta }_A$\end{document}δ¯MA=1−β¯A, which means that \documentclass[12pt]{minimal}\begin{document}$\protect {\bar\delta }_{MA}$\end{document}δ¯MA is not an independent parameter in the reduced model.
Rose J. K. and Whitt M. A., in Fields Virology, edited by Knipe D. M. and Howley P. M. (Lippincot Williams & Wilkins, Philadelphia, 2001), vol. 1, pp. 1221–1244, 4th ed. [Google Scholar]
Kloeden P. E. and Platen E., Numerical solution of stochastic differential equations (Springer Verlag, Berlin, 1992). [Google Scholar]
Note 2, all computations were done with Octave on AMD Athlon Dual core processor with clock speed 2.2 GHz, L1 cache memory 256KB and L2 cache memory 2MB.

[c1] Elowitz M. B., Levine A. J., Siggia E. D., and Swain P. S., Science 297, 1183 (2002). 10.1126/science.1070919 [DOI] [PubMed] [Google Scholar]

[c2] Ozbudak E. M., Thattai M., Kurtser I., Grossman A. D., and van Oudenaarden A., Nat. Genet. 31, 69 (2002). 10.1038/ng869 [DOI] [PubMed] [Google Scholar]

[c3] Blake W. J., Kaern M. A., Cantor C. R., and Collins J. J., Nature (London) 422, 633 (2003). 10.1038/nature01546 [DOI] [PubMed] [Google Scholar]

[c4] Raser J. M. and O’Shea E. K., Science 304, 1811 (2004). 10.1126/science.1098641 [DOI] [PMC free article] [PubMed] [Google Scholar]

[c5] Hooshangi S., Thiberge S., and Weiss R., Proc. Natl. Acad. Sci. U.S.A. 102, 3581 (2005). 10.1073/pnas.0408507102 [DOI] [PMC free article] [PubMed] [Google Scholar]

[c6] Munsky B. and Khammash M., J. Chem. Phys. 124, 044104 (2006). 10.1063/1.2145882 [DOI] [PubMed] [Google Scholar]

[c7] Gillespie D. T., J. Phys. Chem. 81, 2340 (1977). 10.1021/j100540a008 [DOI] [Google Scholar]

[c8] Gillespie D. T., Physica A 188, 404 (1992). 10.1016/0378-4371(92)90283-V [DOI] [Google Scholar]

[c9] Arkin A., Ross J., and McAdams H., Genetics 149, 1633 (1998). 10.1146/annurev.biophys.27.1.199 [DOI] [PMC free article] [PubMed] [Google Scholar]

[c10] Vilar J. M.G., Kueh H. Y., Barkai N., and Leibler S., Proc. Natl. Acad. Sci. U.S.A. 99, 5988 (2002). 10.1073/pnas.092133899 [DOI] [PMC free article] [PubMed] [Google Scholar]

[c11] Weinberger L. S., Burnett J. C., Toettcher J. E., Arkin A. P., and Schaffer D. V., Cell 122, 169 (2005). 10.1016/j.cell.2005.06.006 [DOI] [PubMed] [Google Scholar]

[c12] Hensel S., Rawlings J. B., and Yin J., Bull. Math. Biol. 71, 1671 (2009). 10.1007/s11538-009-9419-5 [DOI] [PMC free article] [PubMed] [Google Scholar]

[c13] Gillespie D. T., Annu. Rev. Phys. Chem. 58, 35 (2007). 10.1146/annurev.physchem.58.032806.104637 [DOI] [PubMed] [Google Scholar]

[c14] Gutenkunst R. N., Waterfall J. J., Casey F. P., Brown K. S., Myers C. R., and Sethna J. P., PLOS Comput. Biol. 3, 1871 (2007). 10.1371/journal.pcbi.0030189 [DOI] [PMC free article] [PubMed] [Google Scholar]

[c15] Cao Y., Gillespie D. T., and Petzold L. R., J. Chem. Phys. 122, 014116 (2005). 10.1063/1.1824902 [DOI] [Google Scholar]

[c16] Goutsias J., J. Chem. Phys. 122, 184102 (2005). 10.1063/1.1889434 [DOI] [PubMed] [Google Scholar]

[c17] Haseltine E. L. and Rawlings J. B., J. Chem. Phys. 123, 164115 (2005). 10.1063/1.2062048 [DOI] [PubMed] [Google Scholar]

[c18] Samant A. and Vlachos D. G., J. Chem. Phys. 123, 144114 (2005). 10.1063/1.2046628 [DOI] [PubMed] [Google Scholar]

[c19] Salis H. and Kaznessis Y., J. Chem. Phys. 122, 054103 (2005). 10.1063/1.1835951 [DOI] [PubMed] [Google Scholar]

[c20] Liu W. E. D. and Vanden-Eijnden E., J. Chem. Phys. 123, 194107 (2005). 10.1063/1.2109987 [DOI] [PubMed] [Google Scholar]

[c21] Liu W. E. D. and Vanden-Eijnden E., J. Comput. Phys. 221, 158 (2007). 10.1016/j.jcp.2006.06.019 [DOI] [Google Scholar]

[c22] Rao C. V. and Arkin A. P., J. Chem. Phys. 118, 4999 (2003). 10.1063/1.1545446 [DOI] [Google Scholar]

[c23] Mastny E. A., Haseltine E. L., and Rawlings J. B., J. Chem. Phys. 127, 094106 (2007). 10.1063/1.2764480 [DOI] [PubMed] [Google Scholar]

[c24] Kuwahara H. and Mura I., J. Comput. Phys. 129, 165101 (2008). 10.1063/1.2987701 [DOI] [PubMed] [Google Scholar]

[c25] See supplementary material at http://dx.doi.org/10.1063/1.3580292 for the proof.

[c26] Roh M., Gillespie D., and Petzold L., J. Comp. Phys. 133, 174106 (2010). 10.1063/1.3493460 [DOI] [PMC free article] [PubMed] [Google Scholar]

[c27] Note 1, note that \documentclass[12pt]{minimal}\begin{document}$\protect {\bar\delta }_{MA}=1-\protect {\bar\beta }_A$\end{document}δ¯MA=1−β¯A, which means that \documentclass[12pt]{minimal}\begin{document}$\protect {\bar\delta }_{MA}$\end{document}δ¯MA is not an independent parameter in the reduced model.

[c28] Rose J. K. and Whitt M. A., in Fields Virology, edited by Knipe D. M. and Howley P. M. (Lippincot Williams & Wilkins, Philadelphia, 2001), vol. 1, pp. 1221–1244, 4th ed. [Google Scholar]

[c29] Kloeden P. E. and Platen E., Numerical solution of stochastic differential equations (Springer Verlag, Berlin, 1992). [Google Scholar]

[c30] Note 2, all computations were done with Octave on AMD Athlon Dual core processor with clock speed 2.2 GHz, L1 cache memory 256KB and L2 cache memory 2MB.

PERMALINK

The stochastic quasi-steady-state assumption: Reducing the model but not the noise

Rishi Srivastava

Eric L Haseltine

Ethan Mastny

James B Rawlings

Abstract

INTRODUCTION