A multiscale approximation in a heat shock response model of E. coli

Hye-Won Kang

doi:10.1186/1752-0509-6-143

. 2012 Nov 21;6:143. doi: 10.1186/1752-0509-6-143

A multiscale approximation in a heat shock response model of E. coli

Hye-Won Kang ^1,^✉

PMCID: PMC3608964 PMID: 23171439

Abstract

Background

A heat shock response model of Escherichia coli developed by Srivastava, Peterson, and Bentley (2001) has multiscale nature due to its species numbers and reaction rate constants varying over wide ranges. Applying the method of separation of time-scales and model reduction for stochastic reaction networks extended by Kang and Kurtz (2012), we approximate the chemical network in the heat shock response model.

Results

Scaling the species numbers and the rate constants by powers of the scaling parameter, we embed the model into a one-parameter family of models, each of which is a continuous-time Markov chain. Choosing an appropriate set of scaling exponents for the species numbers and for the rate constants satisfying balance conditions, the behavior of the full network in the time scales of interest is approximated by limiting models in three time scales. Due to the subset of species whose numbers are either approximated as constants or are averaged in terms of other species numbers, the limiting models are located on lower dimensional spaces than the full model and have a simpler structure than the full model does.

Conclusions

The goal of this paper is to illustrate how to apply the multiscale approximation method to the biological model with significant complexity. We applied the method to the heat shock response model involving 9 species and 18 reactions and derived simplified models in three time scales which capture the dynamics of the full model. Convergence of the scaled species numbers to their limit is obtained and errors between the scaled species numbers and their limit are estimated using the central limit theorem.

Keywords: Multiscale, Markov chains, Chemical reaction, Reaction networks, Heat shock

Background

Stochasticity may play an important role in biochemical systems. For example, stochasticity may be beneficial to give variability in gene expression, to produce population heterogeneity, and to adjust or respond to fluctuations in environment [1]. We are interested in local dynamics of biochemical networks involving some species with a small number of molecules so that the system is assumed to be well-mixed and relative fluctuations of small species numbers may play a role in the system dynamics.

The conventional stochastic model for the well-stirred biochemical network is based on the chemical master equation. The chemical master equation governs the evolution of the probability density of species numbers and is expressed as the balanced equation between influx and outflux of the probability density. When the biochemical network involves many species or bimolecular reactions, it is rarely possible to obtain an exact solution of the master equation in a closed form. Instead of searching for the solution of the master equation, stochastic simulation algorithms are used to obtain the temporal evolution of the species numbers. For example, Gillespie’s Stochastic Simulation Algorithm (SSA, or the direct method) is well known [2,3] and provides a realization of the exact trajectory of the sample path for the species numbers. As the biochemical network has more species and reactions, SSA becomes computationally expensive and more efficient algorithms were suggested by many authors [4-6]. The detailed review of stochastic simulation methods, stochastic approximations, and hybrid simulation methods is given in [7]. For models with well-separated time scales, numerous authors suggested stochastic simulation algorithms for biochemical reaction networks by assuming that “fast” subnetworks have reached a “partial equilibrium” [6] or a “quasi-steady state” [4]. Using these assumptions, the approximate stochastic simulation algorithms involve a reduced number of species or reactions.

On the other hand, Ball et al. [8] described the state of the biochemical reaction network in the well-stirred system directly using stochastic equations for species numbers, and suggested an approximation of the reaction network via limiting models derived using different scalings for the species numbers and for the reaction rate constants. Kang and Kurtz [9] extended this multiscale approximation method and gave a systematic way to obtain limiting models in the time scales of interest. Conditions are given to help identify appropriate values for a set of scaling exponents which determine the time scale of each species and reaction. Using this method, nonstationary behavior of biochemical systems can be analyzed. Moreover, application of the method is flexible in the sense that the method does not require the exact parameter values but gives approximations valid for a range of parameter values. More recently, Crude et al. [10] also proposed a reduction method to derive simplified models with preserving stochastic properties and with key parameters using averaging and hybrid simplification.

The multiscale approximation method in [9] requires consideration of magnitude of both species numbers and rate constants of the reactions involving the corresponding species. When a moderately fast reaction involves two species, one with a small number of molecules and the other with a large number of molecules, the effects of this reaction on these species are different. Net molecule changes of species with large numbers due to the reaction is less noticeable than those of species with small numbers. Therefore, though the same reaction governs these species, their time scales may be different from each other. Letting N₀ be a fixed constant and choosing a large value for N₀, for example N₀=100, we express magnitudes of species numbers and reaction rate constants in terms of powers of N₀ with different scaling exponents. For instance, 1 to 10molecules are expressed as $1 \times N_{0}^{0}$ to $10 \times N_{0}^{0} molecules$ , 500 to 800molecules are rewritten as 5×N₀ to 8×N₀molecules, and 0.0002 sec becomes $2 \times N_{0}^{- 2} sec$ . Assuming N₀ is large, we replace N₀by a large parameter N and stochastic equations for species numbers are expressed in terms of N. Then, N is an analogue of 1/ε where ε is a small parameter in perturbation theory.

A specific time scale of interest is expressed in terms of a power of N, and its exponent contributes to reaction rates due to change of variables in time. For each species (or linear combination of species), we compare a power of N for the species number and those for reaction rates involving this species. Consider a case when the power for the species number is larger than those for the rates of all reactions where the species is involved. Then net molecule changes due to the reactions are not large enough to be noticeable in this time scale, and the species number is approximated as constant. Next, consider a case when the power for the species number is smaller than those for some reaction rates involving the species. In this case, the species number fluctuates very rapidly due to the fast reactions in this time scale, and the averaged behavior of the species number can be described in terms of other species numbers. The method of averaging is similar to approximation of one variable in terms of others using a quasi-steady state assumption. Last, when the power for the species number is equal to those for the rates of reactions where the species is involved, the scaled species number is approximated by a nondegenerate limit describing nonstationary behavior of the species number in the specific time scale of interest. The limit could be described in various kinds of variables: a continuous time Markov chain, a deterministic model given by a system of ordinary differential equations, or a hybrid model with both discrete and continuous variables. Since some of the scaled species numbers are approximated as constants or the averaged behavior of some species numbers is expressed in terms of other variables, dimension of species in the approximation of the biochemical network is reduced.

In the multiscale approximation method, scaling exponents for species numbers and for reaction rate constants are not uniquely determined, since the choice of values for the exponents is flexible. For example, 0.005 sec can be expressed as $0.5 \times N_{0}^{- 1}$ or $5 \times N_{0}^{- 1.5}$ when N₀=100. The goal in this method is to find an appropriate set of scaling exponents to obtain a nondegenerate limit of the scaled species numbers. Orders of magnitude of species numbers in the propensities affect reaction rates, and reaction rates contribute to determining rates of net molecule changes of the species involved in the reactions. Since species numbers and reaction rates interact, it is not easy to determine scaling exponents for all species numbers and reaction rate constants so that the limits of the scaled species numbers become balanced.

Kang and Kurtz [9] introduced balance conditions for the scaling exponents, which help to determine values for a set of exponents. The key idea in these conditions is that for each species (or linear combination of species) the maximum of scaling exponents in the rates of the reactions where this species is produced should be the same as that in the rates of the reactions where this species is consumed, i.e. maximal production and consumption rates of the species should be balanced in the order of magnitude. In case the maximums of scaling exponents for productions and consumptions are not balanced for some species, an increase or decrease of the scaled species number can be described by its limit during a certain time period. However after this time period, the scaled species number will either become zero or blow up to infinity. Therefore, if some of the scaled species numbers are not balanced due to a difference between orders of magnitude of production and consumption rates, the chosen scaling is valid up to a certain time scale. After this time scale, we need to choose different values for scaling exponents. In each time scale of interest we derive a limiting model including a subset of species and reactions, which is used to approximate the state of the full reaction network. The multiscale approximation method is applicable in case some of reaction rates are not known accurately, since the chosen scaling is applicable in some ranges of the parameters. Therefore, based on the behavior of the limiting models, we may be able to estimate behavior for a range of parameter values without performing a huge number of stochastic simulations.

The paper [9] included several simple examples of biochemical networks involving two to four species, and derived limiting models in each time scale of interest. To apply this method, more scaling exponents must be determined as the biochemical network involves more species or reactions. Therefore, it is challenging to apply the method to complex biochemical systems and to determine appropriate values for scaling exponents so that the corresponding limiting models preserve important dynamical features of the full system. One of the goals of this paper is to illustrate how to apply this method to an example with significant complexity. In this paper, using a significantly complicated biochemical network, we derive limiting models, show convergence of the scaled species numbers to their limit, and estimate the error analytically between the scaled species numbers and their limit. We analyze a heat shock response model of Escherichia coli (E. coli) developed by Srivastava, Peterson, and Bentley in [11]. The model involves 9 species and 18 reactions with significant complexity as shown in Figure 1, and it has various time scales due to wide ranges of species numbers and reaction rate constants. Because of various scales involved, this model has been used as an example to show accuracy of the stochastic simulation algorithms which are developed to increase computational efficiency using the multiscale nature of the chemical reaction network [12,13]. Another version of a heat shock response model of E. coli is studied in [6] using an accelerated SSA that also exploits the multiscale nature of the system.

**A chemical reaction network in the heat shock response model of E. coli.** A dotted line represents the effect of the species acting as catalysts. $κ_{k}^{'}$ ’s represent stochastic reaction rate constants.

Applying the multiscale approximation method to the heat shock response model of E. coli, we derive limiting models in three time scales of our interests, which approximate the full network given in Figure 1. Denote ∅ as species we are not interested in. Let S_i represent the ith species and S₂₃be addition of species S₂and S₃. A→B denotes a reaction where one molecule of species A is converted to one molecule of species B. In the early stage of time period of order 1 sec, we obtain the following reduced network:

\begin{align} \emptyset \to S_{2} ⇌ S_{3}, \\ \emptyset \to S_{8} . \end{align}

The reduced network in the early stage has very simple structure without any bimolecular reactions, and all reactions involved are either production from a source or conversion. Moreover, the reduced network is well separated into two due to independence of S₈from S₂and S₃.

In the medium stage of time period of order 100 sec, the full network is reduced to

\begin{align} \emptyset \to S_{23}, \\ \emptyset \to S_{6} \overset{S_{8}}{\to} \emptyset, \emptyset \overset{S_{23}}{\to} S_{6}, \\ S_{7} \to S_{6}, \\ \emptyset \to S_{8}, \end{align}

where a species over the arrow accelerates or inhibits the corresponding reaction. The reaction does not change this species number, but the propensity of the corresponding reaction is a function of this species number. In this time scale, conversion between S₂ and S₃ occurs very frequently and S₂and S₃play a role as a single “virtual” species rather than separate species. The species numbers of S₂₃ and S₈are described as two independent birth processes and the species number of S₇ is governed by conversion. In this time scale, the species number of S₈is normalized and treated as a continuous variable. The interesting thing is that the behavior of the species S₈ which rapidly increases in time is well approximated in both first and second time scales.

In the late stage of time period of order 10,000 sec, we get a reduced network with more species involved than those in the previous time scales. However, the reduced network is still much simpler than the full network in Figure 1. At this time scale, we get

\begin{align} \emptyset \to S_{1} \to \emptyset, \\ \emptyset \overset{S_{1}}{\to} S_{23} \overset{S_{8}, S_{9}}{\to} \emptyset, \\ \emptyset \overset{S_{23}}{\to} S_{4} \to \emptyset, \\ \emptyset \overset{S_{23}}{\to} S_{5} \to \emptyset, \\ \emptyset \to S_{8} \to \emptyset, S_{8} \overset{S_{23}}{\to} \emptyset, \\ \emptyset \overset{S_{23}}{\to} S_{9} . \end{align}

As we see in Figure 1, the full network involves reactions with more than two reactants or products. However, all reactions in the reduced network at the times of order 10,000 sec consist of either production or degradation of each species, though most of the species (6 species out of 9) are involved in the reduced model. As in the medium stage of time period, S₂and S₃play a role as a single species. In the early and medium stages of time period propensities are in a form following the law of mass action, while in the late stage of time period the propensity for degradation of S₂₃ is a nonlinear function of the species numbers similar to the reaction rate appearing in the Michaelis-Menten approximation for an enzyme reaction. The nonlinear function involves the species numbers of S₂₃, S₈, and S₉, which come from averaging of the species numbers of S₂and S₆which fluctuate rapidly in the third time scale. Similarly, the propensity of catalytic degradation of S₈ is not proportional to the number of molecules of S₈.

In the late stage of time period of order 10,000 sec, we study the error between the scaled species numbers and their limit analytically using the central limit theorem derived in [14] and show that the error is of order 10⁻¹.

Methods

In the next several sections, we apply the multiscale approximation to the heat shock response model of E. coli and derive the limiting models. The multiscale approximation method is described in terms of the following steps so that the method can be applied to the general cases.

1. Write a chemical reaction network involving s₀species and r₀ reactions in the form of

\sum_{i = 1}^{s_{0}} ν_{ik} S_{i} \to \sum_{i = 1}^{s_{0}} ν_{ik}^{'} S_{i}, k = 1, \dots, r_{0},

where ν_ik and $ν_{ik}^{'}$ are nonnegative integers. Rearrange the reactions so that the reaction rate constants are decreasing monotonically as k gets large.

2. Derive a system of stochastic equations for species numbers.

(a) Letting X_i(t) be the number of molecules of species S_iat time t, the corresponding stochastic equation is

X_{i} (t) = X_{i} (0) + \sum_{k = 1}^{r_{0}} R_{k}^{t} (λ_{k} (X)) (ν_{ik}^{'} - ν_{ik}), i = 1, \dots s_{0},

where $R_{k}^{t} (\cdot)$ counts the number of times that the kth reaction occurs up to time t.

(b) λ_k(x) is determined by a stochastic version of mass action kinetics, and is expressed as a product of the rate constant and the numbers of molecules of reactants. If the kth reaction is second-order ( $\sum_{i = 1}^{s_{0}} ν_{ik} = 2$ ) with different types of reactants, $λ_{k} (x) = κ_{k}^{'} x_{p} x_{q}$ . When the reactants are two molecules of the same species, $λ_{k} (x) = κ_{k}^{'} x_{p} (x_{p} - 1)$ .

3. Derive a system of stochastic equations for the normalized species numbers after a time change, Z^N,γ(t).

(a) In the equation for X_i(t) obtained in Step 2 (a), replace X_iby $Z_{i}^{N, γ}$ and divide reaction terms by N^α_i. In the kth reaction term, put N^{γ + ρ}_k in the propensity and replace λ_k(X) by ${\hat{λ}}_{k} (Z^{N, γ})$ . Then, we have

\begin{array}{l} Z_{i}^{N, γ} (t) = Z_{i}^{N, γ} (0) + N^{- α_{i}} \sum_{k = 1}^{r_{0}} R_{k}^{t} (N^{γ + ρ_{k}} {\hat{λ}}_{k} (Z^{N, γ})) \\ \times (ν_{ik}^{'} - ν_{ik}), i = 1, \dots s_{0} . \end{array}

(b) In the equation in Step 3 (a), $ρ_{k} = β_{k} + \sum_{j = 1}^{s_{0}} α_{j} ν_{jk}$ .

(c) In the most reactions, ${\hat{λ}}_{k}$ is obtained by replacing $κ_{k}^{'}$ by κ_kin λ_k. In case the kth reaction is second-order with reactants of the same species, $λ_{k} (x) = κ_{k}^{'} x_{p} (x_{p} - 1)$ is replaced by ${\hat{λ}}_{k} (z) = κ_{k} z_{p} (z_{p} - N^{- α_{p}})$ .

4. Write a set of species balance equations and their time-scale constraints.

(a) Define $Γ_{i}^{+}$ and $Γ_{i}^{-}$ as subsets of reactions where the species number of S_iincreases or decreases every time the reaction occurs. Comparing ρ_k’s for $k \in Γ_{i}^{+}$ and those for $k \in Γ_{i}^{-}$ , set the balance equations as

max_{k \in Γ_{i}^{+}} ρ_{k} = max_{k \in Γ_{i}^{-}} ρ_{k}, i = 1, \dots, s_{0} .

(b) Time-scale constraints are given as

γ \leq max_{k \in Γ_{i}^{+} \cup Γ_{i}^{-}} ρ_{k}, i = 1, \dots, s_{0} .

5. Find a minimum set of linear combinations of species whose maximum of collective production (or consumption) rates may be different from that of one of any species. We construct a minimum set of linear combinations of species by selecting a linear combination of species if any reaction term involving the species consisting of the linear combination is canceled in the equation for the linear combination of species.

6. For each selected linear combination of species, write a collective species balance equation and its time-scale constraint. They are obtained similarly to the ones in Step 4 using subsets of reactions where the number of molecules of linear combinations of species either increases or decreases instead of using $Γ_{i}^{+}$ and $Γ_{i}^{-}$ .

7. Select a large value for N₀and choose an appropriate set of α_i’s and β_k’s so that

(a) the species number X_iand the reaction rate constant $κ_{k}^{'}$ are approximately of orders $N_{0}^{α_{i}}$ and $N_{0}^{β_{k}}$ ;

(b) the normalized species number $Z_{i}^{N, γ}$ and the scaled reaction rate constant κ_kare of order 1;

(d) β_k’s are monotone decreasing among each class of reactions which have the same number of molecules of reactants.

8. Plugging the chosen values for α_i’s and β_k’s in the time-scale constraints obtained in Steps 4 and 6, compute an upper bound (denoted as γ₀) for a time-scale exponent. Then, the chosen set of exponents α_i’s and β_k’s can be used for γsatisfying γ≤γ₀. For γ>γ₀, select another set of exponents α_i’s and β_k’s using Steps 7 and 8.

9. Using each set of values for α_i’s and β_k’s, identify a natural time scale exponent of each species (denoted as γ_i for species S_i) so that γ_i satisfies

max_{k \in Γ_{i}^{+} \cup Γ_{i}^{-}} (γ_{i} + ρ_{k}) = α_{i}, i = 1, \dots, s_{0} .

We collect γ_i’s with the same values, whose species are in the same time scales in the approximation.

10. Modify α_i’s and β_k’s so that the conditions in Step 7 are satisfied and that γ_i’s are divided into appropriate number of values, which gives the number of time scales, N^γ=N^γ_i, we are interested in.

11. For each chosen γ, derive a limiting equation for each species S_iwith γ_i=γ. Using the stochastic equation obtained in Step 3 (a), we let N go to infinity.

(a) For $k \in Γ_{i}^{+} \cup Γ_{i}^{-}$ , the kth reaction term converges to zero if α_i>γ + ρ_k.

(b) If α_i=γ + ρ_k, the kth reaction term appears as a limit in the limiting equation. The limit of the kth reaction term is discrete if α_i=0, while it is a continuous variable with the limit of its propensity if α_i>0.

(c) There is no k satisfying α_i<γ + ρ_kin the equation for species S_iwith γ=γ_idue to the definition of γ_igiven in Step 9.

12. In the limiting equation for each species S_iwith γ_i=γ, we approximate propensities in the reaction terms. Suppose that the normalized species number for S_jappears in the propensities.

(a) If γ_j>γ, the limit of the normalized species number for S_jis its initial value.

(b) If γ_j=γ, the limit of the normalized species number for S_jappears as a variable in the propensities in the limiting equation.

(c) If γ_j<γ, the limit of the normalized species number for S_jis expressed as a function of the limits of the normalized species numbers for S_iwith γ_i=γ. The function for S_jis obtained by dividing the equation for S_jby $N^{max_{k \in Γ_{j}^{+} \cup Γ_{j}^{-}} (γ + ρ_{k}) - α_{j}}$ and letting N go to infinity.

13. If a limiting model is not closed, consider limiting equations for some linear combinations of species selected in Step 5 whose natural time scale exponents are equal to the chosen γ.

The method for multiscale approximation described above can be applied to general chemical reaction networks containing different scales in species numbers and reaction rate constants. We can apply the method in case the rates of chemical reactions are determined by law of mass action and when there is no species whose number is either zero or infinity at all times. As given in [9], in the reaction network involving ∅→S₁, ∅→S₂, ∅→S₃, S₁ + S₂→∅, and S₁ + S₃→∅, convergence of the limit for the scaled species numbers may not be guaranteed at some time scales. Suppose that production rate of S₁ is larger than that of S₂but with the same order of magnitude, and that production rate of S₃ is much smaller than those of S₁and S₂. Then, X₁(t) may blow up to infinity and X₂(t) may go to zero at some time scales. In this case, the method is not applicable.

Results and discussion

Model description

We analyze a heat shock response model of E. coli developed by Srivastava, Peterson, and Bentley [11]. The heat shock response model gives a simplified mechanism occurring in the E. coli to respond to high temperature. Heat causes unfolding, misfolding, or aggregation of proteins, and cells overcome the heat stress by producing heat shock proteins, which refold or degrade denatured proteins. In E. coli, σ³²factors play an important role in recovery from the stress under the high temperature. σ³²factors catalyze production of the heat shock proteins such as chaperon proteins and other proteases. In this model, J denotes a chaperon complex, FtsH represents a σ³²-regulated stress protein, and GroEL is a σ³²-mediated stress response protein.

σ³² factors are in three different forms, free σ³²protein, σ³² combined with RNA polymerase (Eσ³²), and σ³² combined with a chaperon complex (σ³²-J). Under the normal situation without stress, most of the σ³² factors combine with chaperon complexes and form σ³²-J. A chaperon complex J keeps σ³²factors in an inactive form, and σ³²factors can directly respond to the stress by changing into different forms. When there exist σ³²factors combined with chaperon complexes, FtsH catalyzes degradation of σ³² factors. Thus, if enough σ³²-regulated stress proteins are produced, σ³²factors are degraded.

Not only σ³²factors, but recombinant proteins also require chaperon complexes to form a complex so that denatured protein can be fixed. Therefore, σ³²factors and recombinant proteins compete to bind chaperon complexes, and different levels of binding affinity of recombinant proteins to chaperon complexes change the evolution of the system state. In the model, we assume that σ³² factors and recombinant proteins have the same affinity to bind to chaperon complexes. The system is sensitive to the amount and forms of σ³² factors: a small decrease of σ³²factors causes a large reduction of production of chaperon complexes and σ³²-regulated stress proteins, and the ratio of three different forms of σ³²factors determines system dynamics in the stress response [11]. The total initial number of molecules of σ³² factors in each cell is small [11] (also see initial values for S₂, S₃, and S₇ which are 1, 1, and 7 in Table 1), and the stochastic model is appropriate to be considered.

Table 1.

Species in the heat shock response model of E. coli and their initial values


X₁	=	# of S₁	σ³² mRNA	X₁(0)	=	10
X₂	=	# of S₂	σ³² protein	X₂(0)	=	1
X₃	=	# of S₃	Eσ³²	X₃(0)	=	1
X₄	=	# of S₄	FtsH	X₄(0)	=	93
X₅	=	# of S₅	GroEL	X₅(0)	=	172
X₆	=	# of S₆	J	X₆(0)	=	54
X₇	=	# of S₇	σ³²-J	X₇(0)	=	7
X₈	=	# of S₈	Recombinant protein	X₈(0)	=	50
X₉	=	# of S₉	J-Recombinant protein	X₉(0)	=	0

Open in a new tab

The model involves 9 species and 18 reactions. Denote s₀ as the number of species and r₀ as the number of reactions. Let X(t) be a state vector whose ith component represents the number of molecules of species S_i at time t for i=1,⋯,s₀. Define a random process which counts the number of times that the kth reaction occurs by time t as

R_{k}^{t} (λ_{k} (X)) \equiv Y_{k} (\int_{0}^{t} λ_{k} (X (s)) ds), k = 1, \dots, r_{0},

where λ_k(X) is the propensity of the kth reaction and the Y_k’s are independent unit Poisson processes. Therefore, $R_{k}^{t} (\cdot)$ is a nonnegative integer-valued random process increasing by 1. As λ_k(·) gets large, the moment when $R_{k}^{t} (λ_{k} (\cdot))$ increases becomes more frequent. Let ν_ik( $ν_{ik}^{'}$ ) be the number of molecules of S_i that are consumed (produced) in the kth reaction. Define ν_k( $ν_{k}^{'}$ ) as an s₀-dimensional vector whose ith component is ν_ik( $ν_{ik}^{'}$ ). Then, X(t) is given as

X (t) = X (0) + \sum_{k = 1}^{r_{0}} R_{k}^{t} (λ_{k} (X)) (ν_{k}^{'} - ν_{k}) .

(1)

That is, species numbers at time t are expressed in terms of their initial values and sum of the number of times that each reaction occurs multiplied by net molecule changes in the corresponding reaction. In our model, the system of equations are derived using a set of reactions in Table 2 as:

Table 2.

Reactions in the heat shock response model of E. coli

	Reaction	Transition
R1	$\emptyset \overset{gene}{\to} S_{8}$	Recombinant protein synthesis
R2	S₂→S₃	Holoenzyme association
R3	S₃→S₂	Holoenzyme disassociation
R4	$\emptyset \overset{S_{1}}{\to} S_{2}$	σ³² translation
R5	$S_{3} \overset{gene}{\to} S_{2} + S_{5}$	GroEL synthesis
R6	$S_{3} \overset{gene}{\to} S_{2} + S_{4}$	FtsH synthesis
R7	$S_{3} \overset{gene}{\to} S_{2} + S_{6}$	J-production
R8	S₇→S₂ + S₆	σ³²-J-disassociation
R9	S₂ + S₆→S₇	σ³²-J-association
R10	S₆ + S₈→S₉	Recombinant protein-J association
R11	S₈→∅	Recombinant protein degradation
R12	S₉→S₆ + S₈	Recombinant protein-J disassociation
R13	$\emptyset \overset{gene}{\to} S_{1}$	σ³² transcription
R14	S₁→∅	σ³² mRNA decay
R15	$S_{7} \overset{S_{4}}{\to} S_{6}$	σ³² degradation
R16	$S_{5} \to \emptyset$	GroEL degradation
R17	$S_{6} \to \emptyset$	J-disassociation
R18	$S_{4} \to \emptyset$	FtsH degradation

Open in a new tab

In Reaction 5, 6, and 7, we assume that the number of molecules of each gene is 1 and that these reactions are effectively unimolecular. Similarly, Reactions 1 and 13 are treated as production from a source.

\begin{array}{l} X_{1} (t) = X_{1} (0) + R_{13}^{t} (κ_{13}^{'}) - R_{14}^{t} (κ_{14}^{'} X_{1}), \\ X_{2} (t) = X_{2} (0) + R_{3}^{t} (κ_{3}^{'} X_{3}) + R_{4}^{t} (κ_{4}^{'} X_{1}) + R_{5}^{t} (κ_{5}^{'} X_{3}) \\ + R_{6}^{t} (κ_{6}^{'} X_{3}) + R_{7}^{t} (κ_{7}^{'} X_{3}) + R_{8}^{t} (κ_{8}^{'} X_{7}) - R_{2}^{t} (κ_{2}^{'} X_{2}) \\ - R_{9}^{t} (κ_{9}^{'} X_{2} X_{6}), \\ X_{3} (t) = X_{3} (0) + R_{2}^{t} (κ_{2}^{'} X_{2}) - R_{3}^{t} (κ_{3}^{'} X_{3}) - R_{5}^{t} (κ_{5}^{'} X_{3}) \\ - R_{6}^{t} (κ_{6}^{'} X_{3}) - R_{7}^{t} (κ_{7}^{'} X_{3}), \\ X_{4} (t) = X_{4} (0) + R_{6}^{t} (κ_{6}^{'} X_{3}) - R_{18}^{t} (κ_{18}^{'} X_{4}), \\ X_{5} (t) = X_{5} (0) + R_{5}^{t} (κ_{5}^{'} X_{3}) - R_{16}^{t} (κ_{16}^{'} X_{5}), \\ X_{6} (t) = X_{6} (0) + R_{7}^{t} (κ_{7}^{'} X_{3}) + R_{8}^{t} (κ_{8}^{'} X_{7}) + R_{12}^{t} (κ_{12}^{'} X_{9}) \\ + R_{15}^{t} (κ_{15}^{'} X_{4} X_{7}) - R_{9}^{t} (κ_{9}^{'} X_{2} X_{6}) - R_{10}^{t} (κ_{10}^{'} X_{6} X_{8}) \\ - R_{17}^{t} (κ_{17}^{'} X_{6}), \\ X_{7} (t) = X_{7} (0) + R_{9}^{t} (κ_{9}^{'} X_{2} X_{6}) - R_{8}^{t} (κ_{8}^{'} X_{7}) - R_{15}^{t} (κ_{15}^{'} X_{4} X_{7}), \\ X_{8} (t) = X_{8} (0) + R_{1}^{t} (κ_{1}^{'}) + R_{12}^{t} (κ_{12}^{'} X_{9}) - R_{10}^{t} (κ_{10}^{'} X_{6} X_{8}) \\ - R_{11}^{t} (κ_{11}^{'} X_{8}), \\ X_{9} (t) = X_{9} (0) + R_{10}^{t} (κ_{10}^{'} X_{6} X_{8}) - R_{12}^{t} (κ_{12}^{'} X_{9}) . \end{array}

(2)

$κ_{k}^{'}$ represents the stochastic reaction rate constant for the kth reaction, and their values from [11] are given in Table 3.

Table 3.

Stochastic reaction rate constants in the heat shock response model of E. coli

Rates		Rates
$κ_{1}^{'}$	4.00×10⁰	$κ_{10}^{'}$	3.62×10⁻⁴
$κ_{2}^{'}$	7.00×10⁻¹	$κ_{11}^{'}$	9.99×10⁻⁵
$κ_{3}^{'}$	1.30×10⁻¹	$κ_{12}^{'}$	4.40×10⁻⁵
$κ_{4}^{'}$	7.00×10⁻³	$κ_{13}^{'}$	1.40×10⁻⁵
$κ_{5}^{'}$	6.30×10⁻³	$κ_{14}^{'}$	1.40×10⁻⁶
$κ_{6}^{'}$	4.88×10⁻³	$κ_{15}^{'}$	1.42×10⁻⁶
$κ_{7}^{'}$	4.88×10⁻³	$κ_{16}^{'}$	1.80×10⁻⁸
$κ_{8}^{'}$	4.40×10⁻⁴	$κ_{17}^{'}$	6.40×10⁻¹⁰
$κ_{9}^{'}$	3.62×10⁻⁴	$κ_{18}^{'}$	7.40×10⁻¹¹

Open in a new tab

We convert deterministic rate constants in [11] using the volume of E. coli which is assumed to be 1.5×10⁻¹⁵L.

We derive the limiting models in three time scales, which approximate a full network in a certain time period involving a subset of species and reactions. In what follows, $Z_{i}^{γ}$ is a limit of the scaled species number of S_i at some time scales depending on γ, and as γ gets larger the times are in the later stage. Note that the exponent γin $Z_{i}^{γ}$ does not imply (Z_i)^γ but it shows dependence of $Z_{i}^{γ}$ on γ. Let κ_k be a scaled reaction rate constant for the kth reaction. In the first time scale (when the times are in the early stage), the subnetwork governed by

\begin{array}{l} Z_{2}^{0} (t) = Z_{2}^{0} (0) + R_{3}^{t} (κ_{3} Z_{3}^{0}) + R_{4}^{t} (κ_{4} Z_{1}^{0} (0)) - R_{2}^{t} (κ_{2} Z_{2}^{0}), \\ Z_{3}^{0} (t) = Z_{3}^{0} (0) + R_{2}^{t} (κ_{2} Z_{2}^{0}) - R_{3}^{t} (κ_{3} Z_{3}^{0}), \\ Z_{8}^{0} (t) = Z_{8}^{0} (0) + R_{1}^{t} (κ_{1}), \end{array}

(3)

approximates the network when the times are of order 1 sec. Denote $Z_{23}^{1}$ as the limit of the addition of the scaled species numbers for S₂and S₃. In the second time scale (when the times are in the medium stage), the subnetwork governed by

\begin{array}{l} Z_{23}^{1} (t) = Z_{23}^{1} (0) + R_{4}^{t} (κ_{4} Z_{1}^{1} (0)), \\ Z_{6}^{1} (t) = Z_{6}^{1} (0) + R_{7}^{t} (\frac{κ_{2} κ_{7}}{κ_{2} + κ_{3}} Z_{23}^{1}) + R_{12}^{t} (κ_{12} Z_{9}^{1} (0)) \\ + R_{15}^{t} (κ_{15} Z_{4}^{1} (0) Z_{7}^{1}) - R_{10}^{t} (κ_{10} Z_{6}^{1} Z_{8}^{1}), \\ Z_{7}^{1} (t) = Z_{7}^{1} (0) - R_{15}^{t} (κ_{15} Z_{4}^{1} (0) Z_{7}^{1}), \\ Z_{8}^{1} (t) = Z_{8}^{1} (0) + κ_{1} t, \end{array}

(4)

approximates the network at the times of order 100 sec. In the third time scale, set the limit of the averaged scaled species numbers of fast-fluctuating species S₂, S₃, and S₆ as

\begin{array}{l} {\bar{Z}}_{2}^{2} (t) \equiv \frac{κ_{3}}{κ_{2} + κ_{3}} Z_{23}^{2} (t), \\ {\bar{Z}}_{3}^{2} (t) \equiv \frac{κ_{2}}{κ_{2} + κ_{3}} Z_{23}^{2} (t), \\ {\bar{Z}}_{6}^{2} (t) \equiv \frac{κ_{7} {\bar{Z}}_{3}^{2} (s) + κ_{12} Z_{9}^{2} (s)}{κ_{10} Z_{8}^{2} (s)} . \end{array}

When the times are in a late stage, the subnetwork governed by

\begin{array}{l} Z_{1}^{2} (t) = Z_{1}^{2} (0) + R_{13}^{t} (κ_{13}) - R_{14}^{t} (κ_{14} Z_{1}^{2}), \\ Z_{23}^{2} (t) = Z_{23}^{2} (0) + \int_{0}^{t} [κ_{4} Z_{1}^{2} (s) - \frac{κ_{3} κ_{9}}{κ_{2} + κ_{3}} Z_{23}^{2} (s)) \\ \times ((\frac{\frac{κ_{2} κ_{7}}{κ_{2} + κ_{3}} Z_{23}^{2} (s) + κ_{12} Z_{9}^{2} (s)}{κ_{10} Z_{8}^{2} (s)})] ds \\ \equiv Z_{23}^{2} (0) + \int_{0}^{t} [κ_{4} Z_{1}^{2} (s) - κ_{9} {\bar{Z}}_{2}^{2} (s) {\bar{Z}}_{6}^{2} (s)] ds, \\ Z_{4}^{2} (t) = Z_{4}^{2} (0) + \int_{0}^{t} (\frac{κ_{2} κ_{6}}{κ_{2} + κ_{3}} Z_{23}^{2} (s) - κ_{18} Z_{4}^{2} (s)) ds \\ \equiv Z_{4}^{2} (0) + \int_{0}^{t} (κ_{6} {\bar{Z}}_{3}^{2} (s) - κ_{18} Z_{4}^{2} (s)) ds, \\ Z_{5}^{2} (t) = Z_{5}^{2} (0) + \int_{0}^{t} (\frac{κ_{2} κ_{5}}{κ_{2} + κ_{3}} Z_{23}^{2} (s) - κ_{16} Z_{5}^{2} (s)) ds \\ \equiv Z_{5}^{2} (0) + \int_{0}^{t} (κ_{5} {\bar{Z}}_{3}^{2} (s) - κ_{16} Z_{5}^{2} (s)) ds, \\ Z_{8}^{2} (t) = Z_{8}^{2} (0) + \int_{0}^{t} (κ_{1} - \frac{κ_{2} κ_{7}}{κ_{2} + κ_{3}} Z_{23}^{2} (s) - κ_{11} Z_{8}^{2} (s)) ds \\ \equiv Z_{8}^{2} (0) + \int_{0}^{t} (κ_{1} - κ_{7} {\bar{Z}}_{3}^{2} (s) - κ_{11} Z_{8}^{2} (s)) ds, \\ Z_{9}^{2} (t) = Z_{9}^{2} (0) + \int_{0}^{t} \frac{κ_{2} κ_{7}}{κ_{2} + κ_{3}} Z_{23}^{2} (s) ds \\ \equiv Z_{9}^{2} (0) + \int_{0}^{t} κ_{7} {\bar{Z}}_{3}^{2} (s) ds, \end{array}

(5)

approximates the network at the times of order 10,000 sec. Detailed derivation is given in the later sections. Note that it is possible to identify different numbers of time scales depending on the scaling of the species numbers and reaction rate constants. In the heat shock response model of E. coli, it is possible to obtain approximate models with two or four time scales. However, if the number of time scales are too many, the limiting model in each time scale may involve one species and a few number of reactions and the model in this case may not be interesting to consider.

Derivation of the scaled models

The stochastic equations given in Equations (2) describe temporal evolution of the species numbers. For example, the equations for species S₂and S₃ are

\begin{array}{l} X_{2} (t) = X_{2} (0) + R_{3}^{t} (κ_{3}^{'} X_{3}) + R_{4}^{t} (κ_{4}^{'} X_{1}) + R_{5}^{t} (κ_{5}^{'} X_{3}) \\ + R_{6}^{t} (κ_{6}^{'} X_{3}) + R_{7}^{t} (κ_{7}^{'} X_{3}) + R_{8}^{t} (κ_{8}^{'} X_{7}) - R_{2}^{t} (κ_{2}^{'} X_{2}) \\ - R_{9}^{t} (κ_{9}^{'} X_{2} X_{6}), \end{array}

(6a)

\begin{array}{l} X_{3} (t) = X_{3} (0) + R_{2}^{t} (κ_{2}^{'} X_{2}) - R_{3}^{t} (κ_{3}^{'} X_{3}) - R_{5}^{t} (κ_{5}^{'} X_{3}) \\ - R_{6}^{t} (κ_{6}^{'} X_{3}) - R_{7}^{t} (κ_{7}^{'} X_{3}) . \end{array}

(6b)

In Equation (6), species numbers of S₂and S₃ are determined by the times when reactions occur and by the number of times that reactions happen. On the other hand, reaction time and frequency are determined by propensities which are some functions of species numbers. Therefore, reaction rates and species numbers interact one another. Reaction rates vary from O(10⁻¹¹) to O(1) as we see in Table 3, and species numbers in this model are from O(1) to O(10⁴) as we see later in the simulation of the full network. We express each species number and rate constant in terms of powers of a common number with different weights on exponents. Define N₀=100 as a fixed unitless constant used to express the magnitude of the species numbers and the reaction rate constants. Define α_i for i=1,⋯,s₀ and β_k for k=1,⋯,r₀ as the scaling exponent for species S_i and for the reaction rate constant $κ_{k}^{'}$ . We express the reaction rate constants in a form of $N_{0}^{β_{k}} κ_{k}$ where κ_k is of order 1 and is determined so that $κ_{k}^{'} = N_{0}^{β_{k}} κ_{k}$ . For example, we have $κ_{6}^{'} = 4.88 \times 1 0^{- 3}$ and we can choose β₆=−1 so that the reaction rate is expressed as $κ_{6}^{'} = 0.488 \times N_{0}^{β_{6}}$ . Assuming that N₀ is large, we replace N₀ by N and express the process as $X_{i}^{N} (t)$ to show dependence of the species numbers on N. Note that {X^N(t)} is a family of processes depending on N and $X_{i}^{N} (t) = X_{i} (t)$ when N=N₀. Then, the equation for $X_{3}^{N}$ is given as

\begin{array}{l} X_{3}^{N} (t) = X_{3}^{N} (0) + R_{2}^{t} (N^{β_{2}} κ_{2} X_{2}^{N}) - R_{3}^{t} (N^{β_{3}} κ_{3} X_{3}^{N}) \\ - R_{5}^{t} (N^{β_{5}} κ_{5} X_{3}^{N}) - R_{6}^{t} (N^{β_{6}} κ_{6} X_{3}^{N}) - R_{7}^{t} (N^{β_{7}} κ_{7} X_{3}^{N}), \end{array}

where $X_{3}^{N} (0)$ is defined later so that $X_{3}^{N} (0) = X_{3} (0)$ when N=N₀. Since the numbers of molecules of species are in different orders of magnitude, we scale the number of molecules of the ith species by N^α_i and set a normalized species number as

Z_{i}^{N} (t) = N^{- α_{i}} X_{i}^{N} (t) .

The ith species number may have different orders of magnitude at different times so α_i may have different values for different time scales. Now, we set the initial values as

X_{i}^{N} (0) \equiv ⌊{(\frac{N}{N_{0}})}^{α_{i}} X_{i} (0)⌋,

(7)

so that $X_{i}^{N_{0}} (0) = X_{i} (0)$ and $lim_{N \to \infty} Z_{i}^{N} (0) = lim_{N \to \infty}$ $N^{- α_{i}} X_{i}^{N} (0) = N_{0}^{- α_{i}} X_{i} (0)$ .

Next, we scale the propensities of reactions by replacing $X_{i}^{N}$ by $N^{α_{i}} Z_{i}^{N}$ and replacing $κ_{k}^{'}$ by N^β_kκ_k. For example, consider the 9th reaction term in (6a). Replacing $κ_{9}^{'}$ by N^β₉κ₉, $X_{2}^{N}$ by $N^{α_{2}} Z_{2}^{N}$ , and $X_{6}^{N}$ by $N^{α_{6}} Z_{6}^{N}$ , the 9th reaction term becomes

R_{9}^{t} (κ_{9}^{'} X_{2}^{N} X_{6}^{N}) = R_{9}^{t} (N^{β_{9} + α_{2} + α_{6}} κ_{9} Z_{2}^{N} Z_{6}^{N}) .

(8)

For simplicity, set ρ₉=β₉ + α₂ + α₆ and define a scaling exponent in the propensity of the kth reaction term as

ρ_{k} \equiv β_{k} + ν_{k} \cdot α,

where $α = {(α_{1}, \dots, α_{s_{0}})}^{T}$ and $ν_{k} = {(ν_{1 k}, \dots, ν_{s_{0} k})}^{T}$ . Here, ν_ik gives the number of molecules of species S_iconsumed in the kth reaction. Then, (8) is rewritten as

R_{9}^{t} (κ_{9}^{'} X_{2}^{N} X_{6}^{N}) = R_{9}^{t} (N^{ρ_{9}} κ_{9} Z_{2}^{N} Z_{6}^{N}) .

Dividing (6a) by N^α₂ and (6b) by N^α₃ and scaling the propensities, we get

\begin{array}{l} Z_{2}^{N} (t) = Z_{2}^{N} (0) + N^{- α_{2}} [R_{3}^{t} (N^{ρ_{3}} κ_{3} Z_{3}^{N}) + R_{4}^{t} (N^{ρ_{4}} κ_{4} Z_{1}^{N})) \\ + R_{5}^{t} (N^{ρ_{5}} κ_{5} Z_{3}^{N}) + R_{6}^{t} (N^{ρ_{6}} κ_{6} Z_{3}^{N}) \\ + R_{7}^{t} (N^{ρ_{7}} κ_{7} Z_{3}^{N}) + R_{8}^{t} (N^{ρ_{8}} κ_{8} Z_{7}^{N}) \\ (- R_{2}^{t} (N^{ρ_{2}} κ_{2} Z_{2}^{N}) - R_{9}^{t} (N^{ρ_{9}} κ_{9} Z_{2}^{N} Z_{6}^{N})], \end{array}

(9a)

\begin{array}{l} Z_{3}^{N} (t) = Z_{3}^{N} (0) + N^{- α_{3}} [R_{2}^{t} (N^{ρ_{2}} κ_{2} Z_{2}^{N}) - R_{3}^{t} (N^{ρ_{3}} κ_{3} Z_{3}^{N})) \\ - R_{5}^{t} (N^{ρ_{5}} κ_{5} Z_{3}^{N}) - R_{6}^{t} (N^{ρ_{6}} κ_{6} Z_{3}^{N}) \\ (- R_{7}^{t} (N^{ρ_{7}} κ_{7} Z_{3}^{N})] . \end{array}

(9b)

For each reaction, ρ_kis given in terms of α_iand β_k in the Additional file 1: Table S1.

We are interested in dynamics of species numbers $Z_{2}^{N} (t)$ and $Z_{3}^{N} (t)$ in various stages of time period. In the early stage of time period, normalized species numbers of S₂ and S₃ are very close to their scaled initial values, since these species numbers have not changed yet. In the medium stage of time period, the normalized species numbers of S₂and S₃ are asymptotically equal to non-constant limits. In the late stage of time period, the normalized species numbers of S₂ and S₃fluctuate very rapidly and their averaged behavior is captured in terms of some function of other species numbers.

We want to express the time scale of each species in terms of power of N. First, we express order of magnitude of a specific time period of interest as a power of N with a time scale exponent γ. Applying a time change by replacing t by N^γtin $Z_{i}^{N} (t)$ , we define a variable for the normalized species numbers after a time change as

Z_{i}^{N, γ} (t) \equiv N^{- α_{i}} X_{i}^{N} (t N^{γ}) = Z_{i}^{N} (t N^{γ}) .

(10)

Then, $Z_{i}^{N, γ} (t)$ gives a normalized species number at the times of order N^γ. A natural time scale of S_iis the time when $Z_{i}^{N, γ} (t)$ has a nonzero finite limit which is not constant and of order 1.

Changing a time variable by replacing t by N^γt in (9a) and (9b), the normalized species numbers of S₂and S₃after a time change satisfy

\begin{array}{l} Z_{2}^{N, γ} (t) = Z_{2}^{N, γ} (0) + N^{- α_{2}} [R_{3}^{t} (N^{γ + ρ_{3}} κ_{3} Z_{3}^{N, γ})) \\ + R_{4}^{t} (N^{γ + ρ_{4}} κ_{4} Z_{1}^{N, γ}) + R_{5}^{t} (N^{γ + ρ_{5}} κ_{5} Z_{3}^{N, γ}) \\ + R_{6}^{t} (N^{γ + ρ_{6}} κ_{6} Z_{3}^{N, γ}) + R_{7}^{t} (N^{γ + ρ_{7}} κ_{7} Z_{3}^{N, γ}) \\ + R_{8}^{t} (N^{γ + ρ_{8}} κ_{8} Z_{7}^{N, γ}) - R_{2}^{t} (N^{γ + ρ_{2}} κ_{2} Z_{2}^{N, γ}) \\ - (R_{9}^{t} (N^{γ + ρ_{9}} κ_{9} Z_{2}^{N, γ} Z_{6}^{N, γ})], \end{array}

(11a)

\begin{array}{l} Z_{3}^{N, γ} (t) = Z_{3}^{N, γ} (0) + N^{- α_{3}} [R_{2}^{t} (N^{γ + ρ_{2}} κ_{2} Z_{2}^{N, γ})) \\ - R_{3}^{t} (N^{γ + ρ_{3}} κ_{3} Z_{3}^{N, γ}) - R_{5}^{t} (N^{γ + ρ_{5}} κ_{5} Z_{3}^{N, γ}) \\ (- R_{6}^{t} (N^{γ + ρ_{6}} κ_{6} Z_{3}^{N, γ}) - R_{7}^{t} (N^{γ + ρ_{7}} κ_{7} Z_{3}^{N, γ})], \end{array}

(11b)

where N^γin each propensity comes from the change of the time variable. Here, the initial values may depend on γ, since we can choose different values for α_ifor each γdue to changes in order of magnitude of species numbers in time. The stochastic equations after scaling and a time change for all species are given in the Additional file 1: Section 1.

Balance conditions

Our goal is to approximate dynamics of the full network in the heat shock response model of E. coli in specific times of interest in terms of simplified subnetworks preserving significant biological features. In each time period of interest, we obtain a nondegenerate limiting model which is not equal to zero and does not blow up to infinity. In this section, we introduce balance conditions which help us to choose appropriate values for the scaling exponents α_i’s and β_k’s so that the limit is nonzero finite. For each time period of interest of order $N_{0}^{γ}$ where N₀=100, we choose values for scaling exponents so that orders of magnitude of the species number for S_i and the kth reaction rate constant are about $N_{0}^{α_{i}}$ and $N_{0}^{β_{k}}$ , respectively. That is,

\begin{align} Z_{i}^{N_{0}, γ} (t) = \frac{X_{i}^{N_{0}} (t N_{0}^{γ})}{N_{0}^{α_{i}}} = O (1), \\ κ_{k} = \frac{κ_{k}^{'}}{N_{0}^{β_{k}}} = O (1) . \end{align}

It is natural to choose β_k’s in monotone decreasing manner in k, since $κ_{k}^{'}$ ’s are in monotone decreasing order as shown in Table 3. In Table 3, the production rates from a source are the rates per second. The unimolecular reaction rates are the rates per molecule per second while the bimolecular reaction rates are the rates per a pair of molecules per second. Since the reaction rates are expressed in different units, we separate rate constants into three classes based on the number of reactants and assume that monotonicity of β_k’s holds in each class of reactions. In other words, we choose β_k’s so that

\begin{align} β_{1} & \geq β_{13}, \\ β_{2} & \geq β_{3} \geq β_{4} \geq β_{5} \geq β_{6} \geq β_{7} \geq β_{8} \geq β_{11} \geq β_{12} \geq β_{14} \\ \geq β_{16} \geq β_{17} \geq β_{18}, and \\ β_{9} & \geq β_{10} \geq β_{15} . \end{align}

Next, in order to make the normalized specie number $Z_{i}^{N, γ} (t)$ balanced, it is required that the rates of production and consumption of S_ishould be in the same order of magnitude. If the order of magnitude of production rate is larger than that of consumption, the normalized species number asymptotically goes to infinity. In the opposite case, the normalized species number asymptotically becomes zero. Therefore, for each species S_i, we set the balance equation for α_i’s and β_k’s so that the maximal exponent in the propensities of the reactions producing S_i is equal to that in the propensities of the reactions consuming S_i. For example, to obtain a balance equation for species S₂, we compare the scaling exponents in propensities of reactions involving S₂using (11a), and set the maximal exponents of production and consumption of S₂ equal. Similarly, using (11b), we set the maximal exponents in the production rates and the consumption rates of S₃ equal. Then, the balance equations for species S₂and S₃ are

max (ρ_{3}, ρ_{4}, ρ_{5}, ρ_{6}, ρ_{7}, ρ_{8}) = max (ρ_{2}, ρ_{9}),

(12a)

ρ_{2} = max (ρ_{3}, ρ_{5}, ρ_{6}, ρ_{7}) .

(12b)

If the maximal orders of magnitudes of production and consumption rates for S₂ are different from each other, the species number of S₂should be large enough so that a difference between production and consumption of S_i is not noticeable. In other words if α_i’s and β_k’s do not satisfy (12a), α₂should be at least as large as the scaling exponents located in all reaction terms in (11a) to prevent the limit becoming zero or blowing up to infinity. Similarly, in case (12b) is not satisfied, α₃ should be at least as large as the scaling exponents located in the reaction terms in (11b) to prevent the limit becoming zero or blowing up to infinity.

\begin{array}{l} α_{2} \geq γ + max (ρ_{2}, ρ_{3}, ρ_{4}, ρ_{5}, ρ_{6}, ρ_{7}, ρ_{8}, ρ_{9}), \\ α_{3} \geq γ + max (ρ_{2}, ρ_{3}, ρ_{5}, ρ_{6}, ρ_{7}), \end{array}

(13)

Solving (13) for γ, we get the following time-scale constraints:

γ \leq α_{2} - max (ρ_{2}, ρ_{3}, ρ_{4}, ρ_{5}, ρ_{6}, ρ_{7}, ρ_{8}, ρ_{9}) \equiv u_{2},

(14a)

γ \leq α_{3} - max (ρ_{2}, ρ_{3}, ρ_{5}, ρ_{6}, ρ_{7}) \equiv u_{3} .

(14b)

Inequalities in (14) mean that if maximal production and consumption rates are not balanced either for S₂ or S₃, the chosen set of values for scaling exponents can be used to approximate the dynamics of the full network up to times of order N^u₂ or N^u₃. For times later than those of order N^u₂or N^u₃, we need to choose another set of values for scaling exponents based on the balance equations. We call the balance equation and the time-scale constraint for each species as the species balance condition. If either (12a) or (??) is satisfied, we say that the species balance condition for S₂ is satisfied.

Even though species balance conditions for S₂and S₃ are satisfied, the limit of the normalized species numbers for S₂or S₃ may become degenerate. Consider addition of species S₂and S₃ as a single virtual species, and compare the collective rates of production and consumption of this species. Recall that S₂₃ denotes addition of species S₂and S₃. Since production of one species is canceled by consumption of the other species, maximal production rate of S₂₃ may be different from that of S₂or S₃. Suppose that the maximal collective rates of production or consumption of S₂₃ are slower than the maximal production or consumption rates of S₂and S₃. Also, suppose that the maximal collective rates of production and consumption of the complex have different orders of magnitude. Then, a limit of the normalized species number of S₂₃can be zero or infinity, even though the species balance conditions for S₂ and S₃ are satisfied. Therefore, we need an additional condition to obtain balance between collective production and consumption rates for S₂₃. To obtain a balance equation for S₂₃, we unnormalize (11a) and (11b) by multiplying N^α₂ and N^α₃, respectively. Adding the unnormalized equations for species S₂and S₃ and dividing it by $N^{max (α_{2}, α_{3})}$ , we get

\begin{array}{l} N^{- max (α_{2}, α_{3})} (N^{α_{2}} Z_{2}^{N, γ} (t) + N^{α_{3}} Z_{3}^{N, γ} (t)) \\ = N^{- max (α_{2}, α_{3})} (N^{α_{2}} Z_{2}^{N, γ} (0) + N^{α_{3}} Z_{3}^{N, γ} (0)) \\ + N^{- max (α_{2}, α_{3})} R_{4}^{t} (N^{γ + ρ_{4}} κ_{4} Z_{1}^{N, γ}) \\ + N^{- max (α_{2}, α_{3})} R_{8}^{t} (N^{γ + ρ_{8}} κ_{8} Z_{7}^{N, γ}) \\ - N^{- max (α_{2}, α_{3})} R_{9}^{t} (N^{γ + ρ_{9}} κ_{9} Z_{2}^{N, γ} Z_{6}^{N, γ}) . \end{array}

(15)

Comparing the maximal scaling exponents of production and consumption of S₂₃ in (15), a balance equation for S₂₃is given as

max (ρ_{4}, ρ_{8}) = ρ_{9} .

(16)

In case (16) is not satisfied, the order of magnitude of the species number for S₂₃ should be larger than those of collective production and consumption rates so that a difference between production and consumption is not noticeable. This gives

max (α_{2}, α_{3}) \geq γ + max (ρ_{4}, ρ_{8}, ρ_{9}) .

(17)

Solving (??) for γ, we get

γ \leq max (α_{2}, α_{3}) - max (ρ_{4}, ρ_{8}, ρ_{9}) \equiv u_{23} .

(18)

Similarly to the time-scale constraint in the species balance condition, (18) implies that if maximal collective production and consumption rates for S₂₃are not balanced, our choice of values for scaling exponents are valid up to times of order N^u₂₃.

We call (16) and (18) the collective species balance condition for S₂₃, that is, either (16) or (18) must hold. The species balance conditions for all species and the collective species balance conditions for all positive linear combinations of species should be satisfied to obtain a nondegenerate limit of $Z_{i}^{N, γ}$ (Condition 3.2 in [9]). Condition 3.2 can be reduced by Lemma 3.4-3.8 and Remark 3.9 in [9]. A key idea is to find a minimum subset of linear combinations of species so that production of one species is canceled by consumption of the other species when we combine the species. In that case, maximal collective production rate of the linear combination of the species may be different from that of each species. Therefore, species balance conditions may not imply the collective species balance condition for the linear combination of the species. For example, a collective species balance condition for addition of S₂ and S₃ should be satisfied, since reactions producing S₂or S₃ may not increase the species number of S₂₃. In Table 4, we choose linear combinations of species whose collective species balance conditions may not be satisfied by the species balance conditions. For other linear combinations of species, their collective species balance conditions are derived from the ones in Table 4. Satisfying all balance conditions in Table 4 guarantees satisfying balance conditions for all positive linear combination of species, and these conditions help to identify scaling exponents which give a nondegenerate limit of the normalized species numbers in the heat shock response model of E. coli. In most cases satisfying balance conditions gives nondegenerate limiting models in the times of interest, but we can still find counter examples as given in the last paragraph in the section for methods.

Table 4.

Balance equations and time-scale constraints for each species and for each collective species chosen

	Balance equations	Time-scale constraints
S₁	ρ₁₃=ρ₁₄	$γ \leq α_{1} - max (ρ_{13}, ρ_{14})$
S₂	$max (ρ_{3}, ρ_{4}, ρ_{5}, ρ_{6}, ρ_{7}, ρ_{8}) = max (ρ_{2}, ρ_{9})$	$γ \leq α_{2} - max (ρ_{2}, ρ_{3}, ρ_{4}, ρ_{5}, ρ_{6}, ρ_{7}, ρ_{8}, ρ_{9})$
S₃	$ρ_{2} = max (ρ_{3}, ρ_{5}, ρ_{6}, ρ_{7})$	$γ \leq α_{3} - max (ρ_{2}, ρ_{3}, ρ_{5}, ρ_{6}, ρ_{7})$
S₄	ρ₆=ρ₁₈	$γ \leq α_{4} - max (ρ_{6}, ρ_{18})$
S₅	ρ₅=ρ₁₆	$γ \leq α_{5} - max (ρ_{5}, ρ_{16})$
S₆	$max (ρ_{7}, ρ_{8}, ρ_{12}, ρ_{15}) = max (ρ_{9}, ρ_{10}, ρ_{17})$	$γ \leq α_{6} - max (ρ_{7}, ρ_{8}, ρ_{9}, ρ_{10}, ρ_{12}, ρ_{15}, ρ_{17})$
S₇	$ρ_{9} = max (ρ_{8}, ρ_{15})$	$γ \leq α_{7} - max (ρ_{8}, ρ_{9}, ρ_{15})$
S₈	$max (ρ_{1}, ρ_{12}) = max (ρ_{10}, ρ_{11})$	$γ \leq α_{8} - max (ρ_{1}, ρ_{10}, ρ_{11}, ρ_{12})$
S₉	ρ₁₀=ρ₁₂	$γ \leq α_{9} - max (ρ_{10}, ρ_{12})$
S₂ + S₃ + S₇	ρ₄=ρ₁₅	$γ \leq max (α_{2}, α_{3}, α_{7}) - max (ρ_{4}, ρ_{15})$
S₂ + S₃	$max (ρ_{4}, ρ_{8}) = ρ_{9}$	$γ \leq max (α_{2}, α_{3}) - max (ρ_{4}, ρ_{8}, ρ_{9})$
S₂ + S₇	$max (ρ_{3}, ρ_{4}, ρ_{5}, ρ_{6}, ρ_{7}) = max (ρ_{2}, ρ_{15})$	$γ \leq max (α_{2}, α_{7}) - max (ρ_{2}, ρ_{3}, ρ_{4}, ρ_{5}, ρ_{6}, ρ_{7}, ρ_{15})$
S₆ + S₇ + S₉	ρ₇=ρ₁₇	$γ \leq max (α_{6}, α_{7}, α_{9}) - max (ρ_{7}, ρ_{17})$
S₆ + S₇	$max (ρ_{7}, ρ_{12}) = max (ρ_{10}, ρ_{17})$	$γ \leq max (α_{6}, α_{7}) - max (ρ_{7}, ρ_{10}, ρ_{12}, ρ_{17})$
S₆ + S₉	$max (ρ_{7}, ρ_{8}, ρ_{15}) = max (ρ_{9}, ρ_{17})$	$γ \leq max (α_{6}, α_{9}) - max (ρ_{7}, ρ_{8}, ρ_{9}, ρ_{15}, ρ_{17})$
S₈ + S₉	ρ₁=ρ₁₁	$γ \leq max (α_{8}, α_{9}) - max (ρ_{1}, ρ_{11})$

Open in a new tab

In each case, either the balance equation or the time-scale constraint must hold.

Based on species and collective species balance equations in Table 4, we choose appropriate values for α_i’s and β_k’s so that most of the balance equations are satisfied. If some of the balance equations are not satisfied, corresponding time-scale constraints give a range of γ where the chosen α_i’s and β_k’s are valid. The time-scale constraint, γ≤γ₀, implies that the set of scaling exponents α_i’s and β_k’s chosen is appropriate only up to time whose order of magnitude is equal to N^γ₀. For the times larger than O(N^γ₀), we need to choose a different set of values for the scaling exponents, α_i’s. Assuming that reaction rate constants do not change in time and that the species numbers vary in time, we in general use one set of β_k’s for all time scales and may use several sets of α_i’s. A large change of the species numbers in time requires different α_i’s in different time scales. For the heat shock model we identify three different time scales as we will see in the section of limiting models in three time scales, and α₁, α₂, α₃, α₈, and α₉ may depend on the time scale. α₄, α₅, α₆, and α₇ are the same for all time scales.

Before we determine scaling exponents for S₁, S₂, and S₃, we run one realization of stochastic simulation to find ranges of the species numbers in time. Using initial values for species S₁, S₂, and S₃, X₁(0)=10 and X₂(0)=X₃(0)=1 as given in Table 1 and using N₀=100, we set $X_{1} (t) \approx O (100) = O (N_{0}^{α_{1}})$ , $X_{2} (t) \approx O (1) = O (N_{0}^{α_{2}})$ , and $X_{3} (t) \approx O (1) = O (N_{0}^{α_{3}})$ with α₁=1 and α₂=α₃=0 in the early stage of time period. Plugging in α_i’s and β_k’s in the balance equations for S₂, S₃, and S₂₃, equality holds in (12a) and (12b) but not in (16). Therefore, (18) gives

\begin{align} γ & \leq max (α_{2}, α_{3}) - max (ρ_{4}, ρ_{8}, ρ_{9}) \\ = max (0, 0) - max (0, - 2, - 2) = 0 . \end{align}

Then, the first set of scaling exponents with α₁=1 and α₂=α₃=0 is valid only when γ≤0. Next, based on the fact that X₂(t)≈O(10) and X₃(t)≈O(10) in the medium stage of time period, we choose α₂=α₃=0 for γ>0. At this stage of time period, we set $X_{1} (t) = O (10) \approx O (N_{0}^{α_{1}})$ with α₁=0. Then, (12a) and (12b) are satisfied but not (16). The condition (18) gives γ≤1, and the second set of scaling exponents with α₁=α₂=α₃=0 is valid when γ≤1. Finally, we set α₁=0 and α₂=α₃=1 for γ>1 based on the fact that the numbers of molecules of S₂and S₃ grow in time and are of order 100. Then, (12a), (12b), and (16) are all satisfied, and the third set of scaling exponents with α₁=0 and α₂=α₃=1 can be used for γ>1.

The three sets of values for the scaling exponents chosen are given in the Additional file 1: Table S4. With chosen values for the scaling exponents, we check whether each balance equation is satisfied and give a time-scale constraint in the Additional file 1: Table S6 in case the balance equation is not satisfied. Different choices of α_i’s and β_k’s from the ones in the Additional file 1: Table S4 give different limiting models. As long as the chosen values for α_i’s and β_k’s satisfy balance conditions, the limiting model will describe nontrivial behavior of the species numbers which are nonzero and finite in the specific time of interest.

Limiting models in three time scales

In the heat shock response model of E. coli, we identify a time scale of interest using the chosen set of scaling exponents and derive a limiting model which approximates dynamics of the full chemical reaction network. Each limiting model involves a subset of species and reactions, and gives features of the full network during the time interval of interest.

To identify a time scale involving a limiting model with interesting dynamics (nondegenerate), we first need to determine a natural time scale of each species. Recall that a natural time scale of species S_i is the time period of order N^γ_i when $Z_{i}^{N, γ_{i}} (t)$ is of order 1. The natural time scale exponent γ_ifor species S_i is rigorously determined by

max_{k \in Γ_{i}^{+} \cup Γ_{i}^{-}} (γ_{i} + ρ_{k}) = α_{i},

(19)

where Γi + denotes the collection of reactions where the species number of S_i increases every time the reaction occurs. Similarly, Γi− is the subset of reactions where the species number of S_idecreases every time the reaction occurs. In (19), the left-side term is the maximal order of magnitude of rates of reactions involving S_iand the right-side term is the order of magnitude of the species number for S_i. If times are earlier than those of order N^γ_i(γ<γ_i), fluctuations of species number of S_i due to the reactions involving S_iare not noticeable compared to magnitude of the species number of S_i. Then, the species number of S_i is approximated as its initial value. In the times of order N^γ_i(γ=γ_i), changes of species number of S_i due to the reactions and the species number of S_i are similar in magnitude and behavior of the species number of S_iis described by its nondegenerate limit. If times are later than those of order N^γ_i(γ>γ_i), the species number of S_i fluctuates very rapidly due to the reactions involving S_i compared to the magnitude of the species number of S_i. Then, the averaged behavior of the species number of S_iis approximated by some function of other species numbers. Note that γ_i depends on α_i’s and β_k’s, and the time scale of the ith species may change if we use several sets of α_i’s.

All values of α_i’s and ρ_k’s for three scalings which are used to derive limiting models are given in the Additional file 1: Table S4. The equations for normalized species numbers and the equation for $Z_{23}^{N, γ}$ which are used later in this section are given in the Additional file 1: Section 1 and Section 2, respectively. When we derive limiting models in three time scales, boundedness of the normalized species numbers is required. For first two time scales, we define stopping times so that the normalized species numbers are bounded up to those times. For the last time scale, we proved stochastic boundedness of some normalized species numbers in a finite time interval. For more details, see Additional file 1: Section 5.

Consider a model with the first set of scaling exponents including α₁=1 and α₂=α₃=0. Note that the first set of scaling exponents is valid when γ≤0 based on the time scale constraints given in Table 4. Substituting α₂=0 and ρ_k’s for the first scaling to the equation for $Z_{2}^{N, γ}$ given in (11a), we have

\begin{array}{l} Z_{2}^{N, γ} (t) = Z_{2}^{N, γ} (0) + R_{3}^{t} (N^{γ} κ_{3} Z_{3}^{N, γ}) + R_{4}^{t} (N^{γ} κ_{4} Z_{1}^{N, γ}) \\ + R_{5}^{t} (N^{γ - 1} κ_{5} Z_{3}^{N, γ}) + R_{6}^{t} (N^{γ - 1} κ_{6} Z_{3}^{N, γ}) \\ + R_{7}^{t} (N^{γ - 1} κ_{7} Z_{3}^{N, γ}) + R_{8}^{t} (N^{γ - 2} κ_{8} Z_{7}^{N, γ}) \\ - R_{2}^{t} (N^{γ} κ_{2} Z_{2}^{N, γ}) - R_{9}^{t} (N^{γ - 2} κ_{9} Z_{2}^{N, γ} Z_{6}^{N, γ}) . \end{array}

(20)

When γ=γ₂, the maximal scaling exponent in the propensities of all reaction terms in (20) should be equal to the scaling exponent for the species number of S₂. Therefore, γ₂satisfies

max (γ_{2}, γ_{2} - 1, γ_{2} - 2) = 0 = α_{2},

(21)

and we get γ₂=0. Similarly, we get γ₃=γ₈=0.

Next, we plug α₁=1 and ρ_k’s for the first scaling in the equation for $Z_{1}^{N, γ}$ and get

\begin{array}{l} Z_{1}^{N, γ} (t) = Z_{1}^{N, γ} (0) + N^{- 1} R_{13}^{t} (N^{γ - 2} κ_{13}) \\ - N^{- 1} R_{14}^{t} (N^{γ - 1} κ_{14} Z_{1}^{N, γ}) . \end{array}

(22)

By comparing the maximal scaling exponent in the propensities of all reaction terms in (22) and the scaling exponent for the species number of S₁, γ₁ satisfies

max (γ_{1} - 2, γ_{1} - 1) = 1 = α_{1},

(23)

and we get γ₁=2. Similarly, we get γ_i>0 for i=4,5,6,7,9. Among all natural time scale exponents of species, we choose the smallest one, γ=0, and set t∼O(N⁰)=O(1) as the first time scale we are interested in. Since γ₁>0, $Z_{1}^{N, 0} (t) \to Z_{1}^{0} (0)$ as N→∞. Similarly, $Z_{i}^{N, 0} (t) \to Z_{i}^{0} (0) = N_{0}^{- α_{i}} X_{i} (0)$ for i=4,5,6,7,9 as N→∞. To sum up, in this time scale with γ=0, the species numbers of S_i’s for i=1,4,5,6,7,9 change more slowly than other species numbers, and the species numbers with slow time scales are approximated as constant.

To derive the limiting equation for S₂, we set γ=0 in (20). Since the 2nd, 3rd, and 4th reaction terms have propensities with N⁰=1 and the species number of S₂ is of order 1, these reaction terms converge to nonzero limits in the limiting equation. On the other hand, the propensities of the 5th, 6th, 7th, 8th and 9th reaction terms are of order N⁻¹ or N⁻² which are smaller than the species number for S₂of order 1. Therefore, these reaction terms converge to zero as N→∞at least in the finite time interval. In the 2nd and 3rd reaction terms in (20), $Z_{2}^{N, 0} (s) \to Z_{2}^{0} (s)$ and $Z_{3}^{N, 0} (s) \to Z_{3}^{0} (s)$ as N→∞ since γ₂=γ₃=0. Then, using $Z_{1}^{N, 0} (s) \to Z_{1}^{0} (0)$ as N→∞, the limit of $Z_{2}^{N, 0}$ satisfies

Z_{2}^{0} (t) = Z_{2}^{0} (0) + R_{3}^{t} (κ_{3} Z_{3}^{0}) + R_{4}^{t} (κ_{4} Z_{1}^{0} (0)) - R_{2}^{t} (κ_{2} Z_{2}^{0}) .

Similarly, we get a limiting model with $Z_{2}^{0}$ , $Z_{3}^{0}$ , and $Z_{8}^{0}$ for γ=0 as given in (3).

Next, consider a model with the second set of scaling exponents including α₁=α₂=α₃=0. Note that the second set of scaling exponents is valid when γ≤1 based on the time scale constraints given in Table 4. To determine the natural time scale of S₆, substitute α₆=0 and ρ_k’s for the second scaling in the equation for $Z_{6}^{N, γ}$ , and we have

\begin{array}{l} Z_{6}^{N, γ} (t) = Z_{6}^{N, γ} (0) + R_{7}^{t} (N^{γ - 1} κ_{7} Z_{3}^{N, γ}) \\ + R_{8}^{t} (N^{γ - 2} κ_{8} Z_{7}^{N, γ}) + R_{12}^{t} (N^{γ - 1} κ_{12} Z_{9}^{N, γ}) \\ + R_{15}^{t} (N^{γ - 1} κ_{15} Z_{4}^{N, γ} Z_{7}^{N, γ}) \\ - R_{9}^{t} (N^{γ - 2} κ_{9} Z_{2}^{N, γ} Z_{6}^{N, γ}) \\ - R_{10}^{t} (N^{γ - 1} κ_{10} Z_{6}^{N, γ} Z_{8}^{N, γ}) \\ - R_{17}^{t} (N^{γ - 2} κ_{17} Z_{6}^{N, γ}) . \end{array}

(24)

Comparing the exponents inside and outside of the reaction terms in (24), γ₆ satisfies

max (γ_{6} - 1, γ_{6} - 2) = 0 = α_{6},

(25)

and we get γ₆=1. Similarly, we get γ₇=γ₈=1, γ_i<1 for i=2,3, and γ_i>1 for i=1,4,5,9. We already get the temporal behavior of species numbers of S₂, S₃, and S₈ through the limiting model when γ=0. Thus, we set t∼O(N¹) as the second time scale we are interested in, and derive a limiting model for S₆, S₇, and S₈ when γ=1. Note that species S₈ is involved in the limiting models for both γ=0 and γ=1, since we use different sets of scaling exponents in these models. For i=1,4,5,9 $Z_{i}^{N, 1} (t) \to Z_{i}^{1} (0)$ as N→∞, since γ_i>1. Thus, in the 12th and 15th reaction terms in (24), $Z_{9}^{N, 1} (s) \to Z_{9}^{1} (0)$ and $Z_{4}^{N, 1} (s) \to Z_{4}^{1} (0)$ as N→∞. Since the propensities of the 8th, 9th, and 17th reaction terms in (24) are of order N^γ−2=N⁻¹ for γ=1 and the species number of S₆ is of order 1, these reaction terms go to zero as N→∞. In the 10th and 15th reaction terms in (24), $Z_{6}^{N, 1} (s)$ , $Z_{7}^{N, 1} (s)$ , and $Z_{8}^{N, 1} (s)$ are asymptotically O(1) and converge to $Z_{6}^{1} (s)$ , $Z_{7}^{1} (s)$ , and $Z_{8}^{1} (s)$ as N→∞ since γ₆=γ₇=γ₈=1.

Now, consider the asymptotic behavior of the 7th reaction term in (24) when γ=1. Since γ₃<1, $Z_{3}^{N, 1} (t)$ fluctuates very much, and there exists no functional limit as N→∞. However, $\int_{0}^{t} Z_{3}^{N, 1} (s) ds$ still converges, which gives the averaged behavior of the normalized species number of S₃. To get the limit of $\int_{0}^{t} Z_{3}^{N, 1} (s) ds$ , we plug the second set of scaling exponents in the equation for $Z_{3}^{N, γ}$ and obtain

\begin{array}{l} Z_{3}^{N, 1} (t) = Z_{3}^{N, 1} (0) + R_{2}^{t} (N κ_{2} Z_{2}^{N, 1}) - R_{3}^{t} (N κ_{3} Z_{3}^{N, 1}) \\ - R_{5}^{t} (κ_{5} Z_{3}^{N, 1}) - R_{6}^{t} (κ_{6} Z_{3}^{N, 1}) - R_{7}^{t} (κ_{7} Z_{3}^{N, 1}) . \end{array}

(26)

The law of large numbers of Poisson processes gives an asymptotic limit of the scaled reaction terms as

lim_{N \to \infty} sup_{u \leq u_{0}} |\frac{Y_{k} (N^{α_{i}} u)}{N^{α_{i}}} - u| = 0, u_{0} > 0

(27)

where the Y_k’s are unit Poisson processes and α_i>0. For example, the 2nd reaction term in (26) divided by N is approximated as

\frac{R_{2}^{t} (N κ_{2} Z_{2}^{N, 1})}{N} = \frac{Y_{2} (\int_{0}^{t} N κ_{2} Z_{2}^{N, 1} (s) ds)}{N} \approx \int_{0}^{t} κ_{2} Z_{2}^{N, 1} (s) ds.

Dividing (26) by N and using the law of large numbers for Poisson processes, we get

\int_{0}^{t} (κ_{2} Z_{2}^{N, 1} (s) - κ_{3} Z_{3}^{N, 1} (s)) ds \to 0,

(28)

as N→∞.

We introduce an auxiliary variable to make the limiting model closed and define

Z_{23}^{N, γ} (t) \equiv Z_{2}^{N, γ} (t) + Z_{3}^{N, γ} (t) .

Plugging α₂=α₃=0 and ρ_k’s in the second scaling in the equation for $Z_{23}^{N, γ}$ , we get

\begin{array}{l} Z_{23}^{N, γ} (t) = Z_{23}^{N, γ} (0) + R_{4}^{t} (N^{γ - 1} κ_{4} Z_{1}^{N, γ}) + R_{8}^{t} (N^{γ - 2} κ_{8} Z_{7}^{N, γ}) \\ - R_{9}^{t} (N^{γ - 2} κ_{9} Z_{2}^{N, γ} Z_{6}^{N, γ}) . \end{array}

(29)

Since $Z_{23}^{N, γ_{23}} (t) \sim O (1)$ where γ₂₃ denotes a natural time scale exponent of S₂₃, we compare the scaling exponents of N in the reaction terms in (29) and the scaling exponent of N outside the reaction terms. Then γ₂₃satisfies

max (γ_{23} - 1, γ_{23} - 2) = 0 = max (α_{2}, α_{3}),

and we get γ₂₃=1. Since these reaction terms have N^γ−2=N⁻¹ in their propensities when γ=1, which is smaller than the species number for S₂₃ of order 1, these reaction terms converge to zero as N→∞. Using $Z_{1}^{N, 1} (s) \to Z_{1}^{1} (0)$ , the limit of $Z_{23}^{N, 1}$ satisfies

Z_{23}^{1} (t) = Z_{23}^{1} (0) + R_{4}^{t} (κ_{4} Z_{1}^{1} (0)) .

Adding and subtracting terms in (28) and dividing the equation by −(κ₂ + κ₃), we get

\int_{0}^{t} (Z_{3}^{N, 1} (s) - \frac{κ_{2}}{κ_{2} + κ_{3}} Z_{23}^{N, 1} (s)) ds \to 0,

as N→∞, and this is used to obtain the limit of the 7th reaction term in (24). Letting N→∞, the limiting equation for $Z_{6}^{N, 1}$ is given as

\begin{array}{l} Z_{6}^{1} (t) = Z_{6}^{1} (0) + R_{7}^{t} (\frac{κ_{2} κ_{7}}{κ_{2} + κ_{3}} Z_{23}^{1}) + R_{12}^{t} (κ_{12} Z_{9}^{1} (0)) \\ + R_{15}^{t} (κ_{15} Z_{4}^{1} (0) Z_{7}^{1}) - R_{10}^{t} (κ_{10} Z_{6}^{1} Z_{8}^{1}) . \end{array}

(30)

In (30), note that $R_{12}^{t} (κ_{12} Z_{9}^{1} (0)) = 0$ since X₉(0)=0 as given in Table 1. Limiting equations for $Z_{7}^{N, 1}$ and $Z_{8}^{N, 1}$ can be derived similarly, and a limiting model with $Z_{23}^{1}$ , $Z_{6}^{1}$ , $Z_{7}^{1}$ , and $Z_{8}^{1}$ for γ=1 is given in (4).

Last, consider a model with the third scaling exponents with α₁=0 and α₂=α₃=1. To derive a limiting equation for $Z_{23}^{N, 2}$ , we plug ρ_k’s and α₂=α₃=1 for the third scaling in the equation for $Z_{23}^{N, γ}$ and get

\begin{array}{l} Z_{23}^{N, 2} (t) = Z_{23}^{N, 2} (0) + N^{- 1} [R_{4}^{t} (N κ_{4} Z_{1}^{N, 2}) + R_{8}^{t} (κ_{8} Z_{7}^{N, 2})) \\ - (R_{9}^{t} (N κ_{9} Z_{2}^{N, 2} Z_{6}^{N, 2})] . \end{array}

(31)

In (31), the 8th reaction term is asymptotically zero, since the term is of order N⁻¹. Using the law of large numbers for Poisson processes in (27), the 4th and the 9th terms in (31) are asymptotically equal to

\int_{0}^{t} (κ_{4} Z_{1}^{N, 2} (s) - κ_{9} Z_{2}^{N, 2} (s) Z_{6}^{N, 2} (s)) ds.

(32)

Since γ₁=2, $Z_{1}^{N, 2} (s) \to Z_{1}^{2} (s)$ as N→∞. On the other hand, since γ₂,γ₆<2, both $Z_{2}^{N, 2} (s)$ and $Z_{6}^{N, 2} (s)$ in (32) fluctuate rapidly and we must identify the averaged limit. $Z_{3}^{N, 2}$ is also averaged, since γ₃<2. We actually show convergence of the fast-fluctuating species numbers of S₂ and S₃ to some limits in the Additional file 1: Section 5.1. For any ε>0 and for any t such that $ε < t \leq τ_{\infty}^{2}$ ,

Z_{2}^{N, 2} (t) \to {\bar{Z}}_{2}^{2} (t),

(33)

Z_{3}^{N, 2} (t) \to {\bar{Z}}_{3}^{2} (t),

(34)

uniformly as N→∞.

On the other hand, since γ₆<2, $\int_{0}^{t} Z_{6}^{N, 2} (s) ds$ converges to a limit which gives averaged behavior of the normalized species number of S₆. Using the equation for $Z_{6}^{N, γ}$ , we get

\begin{array}{l} Z_{6}^{N, 2} (t) = Z_{6}^{N, 2} (0) + R_{7}^{t} (N^{2} κ_{7} Z_{3}^{N, 2}) + R_{8}^{t} (κ_{8} Z_{7}^{N, 2}) \\ + R_{12}^{t} (N^{2} κ_{12} Z_{9}^{N, 2}) + R_{15}^{t} (N κ_{15} Z_{4}^{N, 2} Z_{7}^{N, 2}) \\ - R_{9}^{t} (N κ_{9} Z_{2}^{N, 2} Z_{6}^{N, 2}) - R_{10}^{t} (N^{2} κ_{10} Z_{6}^{N, 2} Z_{8}^{N, 2}) \\ - R_{17}^{t} (κ_{17} Z_{6}^{N, 2}) . \end{array}

(35)

Dividing (35) by N², using the law of large numbers for Poisson processes in (27), and using the stochastic boundedness of the propensities of the 8th, 9th, 15th, and 17th reaction terms in the finite time interval shown in the Additional file 1: Section 5.1, we get

\int_{0}^{t} (κ_{7} Z_{3}^{N, 2} (s) + κ_{12} Z_{9}^{N, 2} (s) - κ_{10} Z_{6}^{N, 2} (s) Z_{8}^{N, 2} (s)) ds \to 0,

(36)

as N→∞. Therefore, a difference between the 10th and 12th reaction terms is approximated in terms of

\int_{0}^{t} κ_{7} Z_{3}^{N, 2} (s) ds,

(37)

which converges to $\int_{0}^{t} κ_{7} {\bar{Z}}_{3}^{2} (s) ds$ from (34). Therefore, we get

\int_{0}^{t} (κ_{10} Z_{6}^{N, 2} (s) Z_{8}^{N, 2} (s) - κ_{12} Z_{9}^{N, 2} (s)) ds \to \int_{0}^{t} κ_{7} {\bar{Z}}_{3}^{2} (s) ds,

(38)

as N→∞.

Now, from the equations for $Z_{8}^{N, γ}$ and $Z_{9}^{N, γ}$ , we get

\begin{array}{l} Z_{8}^{N, 2} (t) = Z_{8}^{N, 2} (0) + N^{- 2} [R_{1}^{t} (N^{2} κ_{1}) + R_{12}^{t} (N^{2} κ_{12} Z_{9}^{N, 2})) \\ - (R_{10}^{t} (N^{2} κ_{10} Z_{6}^{N, 2} Z_{8}^{N, 2}) - R_{11}^{t} (N^{2} κ_{11} Z_{8}^{N, 2})], \end{array}

(39a)

\begin{array}{l} Z_{9}^{N, 2} (t) = Z_{9}^{N, 2} (0) + N^{- 2} [R_{10}^{t} (N^{2} κ_{10} Z_{6}^{N, 2} Z_{8}^{N, 2})) \\ - (R_{12}^{t} (N^{2} κ_{12} Z_{9}^{N, 2})] . \end{array}

(39b)

Using the law of large numbers of Poisson processes in (27), the reaction terms in (39a) and (39b) are asymptotically equal to

\begin{array}{l} N^{- 2} [R_{1}^{t} (N^{2} κ_{1}) + R_{12}^{t} (N^{2} κ_{12} Z_{9}^{N, 2}) - R_{10}^{t} (N^{2} κ_{10} Z_{6}^{N, 2} Z_{8}^{N, 2})) \\ - (R_{11}^{t} (N^{2} κ_{11} Z_{8}^{N, 2})] \\ \approx \int_{0}^{t} (κ_{1} + κ_{12} Z_{9}^{N, 2} (s) - κ_{10} Z_{6}^{N, 2} (s) Z_{8}^{N, 2} (s)) \\ (- κ_{11} Z_{8}^{N, 2} (s)) ds, \end{array}

(40a)

\begin{array}{l} N^{- 2} [R_{10}^{t} (N^{2} κ_{10} Z_{6}^{N, 2} Z_{8}^{N, 2}) - R_{12}^{t} (N^{2} κ_{12} Z_{9}^{N, 2})] \\ \approx \int_{0}^{t} (κ_{10} Z_{6}^{N, 2} (s) Z_{8}^{N, 2} (s) - κ_{12} Z_{9}^{N, 2} (s)) ds. \end{array}

(40b)

Using (40a), (40b), and (38), the limiting equations of (39a) and (39b) are given as

\begin{align} Z_{8}^{2} (t) & = Z_{8}^{2} (0) + \int_{0}^{t} (κ_{1} - κ_{7} {\bar{Z}}_{3}^{2} (s) - κ_{11} Z_{8}^{2} (s)) ds, \\ Z_{9}^{2} (t) & = Z_{9}^{2} (0) + \int_{0}^{t} κ_{7} {\bar{Z}}_{3}^{2} (s) ds. \end{align}

(41)

In (41), note that $Z_{9}^{2} (0) = 0$ since X₉(0)=0 as given in Table 1.

Since $Z_{8}^{N, 2} (0) > 0$ and balance conditions are satisfied, $Z_{8}^{N, 2} (t) \neq 0$ in the finite time interval. Since γ₈=2,

\frac{1}{κ_{10} Z_{8}^{N, 2} (t)} \to \frac{1}{κ_{10} Z_{8}^{2} (t)} .

(42)

Using (38) and (42), $\int_{0}^{t} Z_{6}^{N, 2} (s) ds$ is averaged as

\int_{0}^{t} Z_{6}^{N, 2} (s) ds \to \int_{0}^{t} \frac{κ_{7} {\bar{Z}}_{3}^{2} (s) + κ_{12} Z_{9}^{2} (s)}{κ_{10} Z_{8}^{2} (s)} ds.

(43)

From (33) and (43), we get

\begin{array}{l} \int_{0}^{t} κ_{9} Z_{2}^{N, 2} (s) Z_{6}^{N, 2} (s) ds \to \int_{0}^{t} κ_{9} {\bar{Z}}_{2}^{2} (s) \\ \times (\frac{κ_{7} {\bar{Z}}_{3}^{2} (s) + κ_{12} Z_{9}^{2} (s)}{κ_{10} Z_{8}^{2} (s)}) ds. \end{array}

(44)

For more details used in (43) and (44), see Lemma 1.5 and Theorem 2.1 in [15]. Finally, we get the limiting equation of $Z_{23}^{N, 2}$ as

\begin{array}{l} Z_{23}^{2} (t) = Z_{23}^{2} (0) + \int_{0}^{t} [κ_{4} Z_{1}^{2} (s) - κ_{9} {\bar{Z}}_{2}^{2} (s)) \\ \times ((\frac{κ_{7} {\bar{Z}}_{3}^{2} (s) + κ_{12} Z_{9}^{2} (s)}{κ_{10} Z_{8}^{2} (s)})] ds. \end{array}

Theorem 1

For γ=0, ${Z_{2}^{N, 0}, Z_{3}^{N, 0}, Z_{8}^{N, 0}}$ converges to the solution of (3) for $t \in [0, τ_{\infty}^{0})$ . For γ=1, ${Z_{23}^{N, 1}, Z_{6}^{N, 1}, Z_{7}^{N, 1}, Z_{8}^{N, 1}}$ converges to the solution of (4) for $t \in [0, τ_{\infty}^{1})$ . In (3), $Z_{8}^{0}$ is a discrete process, while $Z_{8}^{1}$ is a deterministic process in (4). For γ=2, ${Z_{1}^{N, 2}, Z_{23}^{N, 2}, Z_{4}^{N, 2}, Z_{5}^{N, 2}, Z_{8}^{N, 2}, Z_{9}^{N, 2}}$ converges to the solution of (5) for $t \in [0, τ_{\infty}^{2})$ .

Conditional equilibrium distributions

In the previous section, we derived limiting models in three different time scales. Except for the subset of species in the limiting model, the remaining species are approximated as constants in the first time scale, since their natural time scale exponents (γ_i) are larger than γ=0, i.e., species with γ_i>γ=0 did not start to fluctuate at these times yet. In the second and third time scales, there are subsets of species whose natural time scale exponents are smaller than γ=1 and 2, respectively. Normalized species numbers with γ_i<γfluctuate very rapidly at these times and their averaged behavior is approximated in terms of other variables which converge to a nondegenerate limit. For those species, the normalized species numbers do not converge to a limit in a functional sense, but still we can find a limit in a probabilistic sense (i.e. convergence in distribution) and their distribution. Conditioned on the normalized species numbers which converge to a nondegenerate limit in the time scale of interest, we can find the conditional equilibrium (or the local averaging) distributions of species numbers whose natural time scale exponents are smaller than the time scale exponents of interests. Conditioning on the normalized species numbers which converge to a nondegenerate limit is similar to fixing slowly-moving variables and describing behavior of the fast-fluctuating variables in terms of slowly-moving variables treating them as constants. In the next remark, we give a conditional equilibrium distribution of the subset of species with natural time scale exponents smaller than γ=1 and γ=2.

Remark 2

For γ=1, for each t>0, $(Z_{2}^{N, 1} (t), Z_{3}^{N, 1} (t))$ converges in distribution to $({\hat{Z}}_{2}^{1} (t), {\hat{Z}}_{3}^{1} (t))$ such that $({\hat{Z}}_{2}^{1} (t), {\hat{Z}}_{3}^{1} (t))$ conditioned on $Z_{23}^{1} (t)$ has a binomial distribution with parameter

\frac{κ_{3}}{κ_{2} + κ_{3}}, \frac{κ_{2}}{κ_{2} + κ_{3}},

respectively, that is,

\begin{array}{l} P \{{\hat{Z}}_{2}^{1} (t) = z_{2}, {\hat{Z}}_{3}^{1} (t) = m - z_{2} | Z_{23}^{1} (t) = m\} \\ = C (m, z_{2}) {(\frac{κ_{3}}{κ_{2} + κ_{3}})}^{z_{2}} {(\frac{κ_{2}}{κ_{2} + κ_{3}})}^{m - z_{2}} . \end{array}

For γ=2, for each t>0, $(Z_{6}^{N, 2} (t), Z_{7}^{N, 2} (t))$ converges in distribution to $({\hat{Z}}_{6}^{2} (t), {\hat{Z}}_{7}^{2} (t))$ where ${\hat{Z}}_{6}^{2} (t)$ and ${\hat{Z}}_{7}^{2} (t)$ are independent Poisson distributed random variables with parameters

\frac{\frac{κ_{2} κ_{7}}{κ_{2} + κ_{3}} Z_{23}^{2} (t) + κ_{12} Z_{9}^{2} (t)}{κ_{10} Z_{8}^{2} (t)},

and

\frac{\frac{κ_{3} κ_{9}}{κ_{2} + κ_{3}} Z_{23}^{2} (t)}{κ_{15} Z_{4}^{2} (t)} \cdot \frac{\frac{κ_{2} κ_{7}}{κ_{2} + κ_{3}} Z_{23}^{2} (t) + κ_{12} Z_{9}^{2} (t)}{κ_{10} Z_{8}^{2} (t)} .

The detailed method to compute conditional equilibrium distributions is given in Section 6 in [[9]].

Mean value of the random variable with a binomial distribution, B(n,p), is equal to np. Therefore, for γ=1, we treat $Z_{23}^{1} (t)$ as constant and get a limit of the averaged values for $Z_{2}^{N, 1} (t)$ and $Z_{3}^{N, 1} (t)$ as

\begin{align} {\bar{Z}}_{2}^{1} (t) & = Z_{23}^{1} (t) \times \frac{κ_{3}}{κ_{2} + κ_{3}}, \\ {\bar{Z}}_{3}^{1} (t) & = Z_{23}^{1} (t) \times \frac{κ_{2}}{κ_{2} + κ_{3}} . \end{align}

Mean value of the random variable with a Poisson distribution, Pois(λ), is equal to λ, and we obtain a limit of the averaged values for $Z_{6}^{N, 2} (t)$ and $Z_{7}^{N, 2} (t)$ as the parameters given in Remark 2.

Simulation results

Recall that the normalized species numbers after a time change are defined as

Z_{i}^{N, γ} (t) = N^{- α_{i}} X_{i}^{N} (t N^{γ}) .

Using the limiting models in the three time scales given in (3)-(5), we approximate the species numbers in the full model by unnormalizing the species numbers and applying time change backward as

\begin{array}{l} X_{i} (t) = X_{i}^{N_{0}} (t) \approx lim_{N \to \infty} {(\frac{N_{0}}{N})}^{α_{i}} X_{i}^{N} (t {(\frac{N}{N_{0}})}^{γ}) \\ = N_{0}^{α_{i}} Z_{i}^{γ} (t N_{0}^{- γ}), \end{array}

using a real value N₀=100 for the parameter. In Figures 2, 3, 4 and Figure 5(a)-(d), the panels located in the left column give mean and standard deviation from the mean of stochastic simulation for X_i(t) and the panels located in the right column give mean and standard deviation from the mean of simulation for $N_{0}^{α_{i}} Z_{i}^{γ} (t N_{0}^{- γ})$ using the limiting models. The mean and standard deviation of species numbers are computed from 3000 realizations of the sample path of the stochastic simulation.

**Simulation results when** γ **= 0.** Simulation of the full model (left) and that of approximation using the limiting model (right) when the time is of order $N_{0}^{0}$ (=1).

**Simulation results when** γ **= 1.** Simulation of the full model (left) and that of approximation using the limiting model (right) when the time is of order N₀(=100).

**Simulation results when** γ **= 2.** Simulation of the full model (left) and that of approximation using the limiting model (right) when the time is of order $N_{0}^{2}$ (=10000).

**Simulation results when** γ **= 2 (continued).** Simulation of the full model (left) and that of approximation using the limiting model (right) when the time is of order $N_{0}^{2}$ . Figures (e), (f), (g), and (h) are simulation results for species 6 and 7. The graphs (f) and (h) give approximation of the averaged species numbers of S₆and S₇.

In Figure 2, we compare the simulation for the full model and for the approximation using the limiting model in the first scaling. The first scaling (γ=0) is for the times of order $N_{0}^{0} = 1 sec$ , and we look at the evolution of mean and standard deviation of the species numbers up to 100 sec. The full model and the limiting model for γ=0 are stochastic, and the limiting model approximates the evolution of statistics of the species numbers quite precisely. As shown in Figure 2(f) $Z_{8}^{0} (t)$ overestimates X₈(t), since the limiting model does not include reactions consuming S₈. Therefore, consumption of S₈may not be captured well in the approximation.

In Figure 3, we compare the simulation for the full model and for the approximation using the limiting model in the second scaling. Since the second scaling (γ=1) is for the times of order $N_{0}^{1} = 100 sec$ , we observe the evolution of the species numbers up to 1000 sec. In this time scale, the evolution of S₈shown in Figure 3(h) is approximated by a deterministic variable. The evolution of the species number of S₈ in the full model given in Figure 3(g) is stochastic, but its standard deviation is very small. As in the previous time scale, $N_{0} Z_{8}^{1} (t N_{0}^{- 1})$ slightly overestimates X₈(t), since the limiting model does not include any consumptions of S₈. The remaining three species, S₂₃, S₆, and S₇are approximated by stochastic variables. The increasing species number of S₂₃ in time and the rapid decrease in species number of S₆are well captured by the limiting model. The species numbers of S₇are described by stochastic variables both in the full model and in the limiting model. The behavior of S₇in two models is not exactly the same, and discrepancy of the mean species numbers of S₇ comes from the approximation of X₄(t) in terms of its initial value. In the limiting model, S₇is approximated as a stochastic process decreasing by 1 with the propensity proportional to $Z_{4}^{1} (0)$ . However, X₄(t) increases during the times in [0,1000] sec in the full model, and this difference gives slower decreasing rate of the mean number of S₇ in the limiting model than that in the full model.

In Figure 4 and Figure 5(a)-(d), we compare the simulation for the full model and for the approximation obtained from the limiting model in the third scaling. Since the third scaling (γ=2) is for the times of order $N_{0}^{2} = 10, 000 sec$ , we look at the simulation up to 20,000 sec. In this time scale, the limiting model is stochastic. The species number of S₁ in the limiting model is approximated by a stochastic discrete variable increasing and decreasing by 1, and the remaining species numbers in the limiting model satisfy stochastic equations driven by the stochastic discrete variable $Z_{1}^{2}$ . As we have seen in the proof of Theorem 1 in the Additional file 1: Section 5.1, the processes for S₁ in the full model and in the limiting model are exactly the same. Therefore, we use a same series of random numbers, when we simulate the full and limiting models. In Figure 4(b), the process for S₁ is random, but its standard deviation is very small. Therefore, in one realization of simulation of the limiting model, behavior of S₁appears as constant. Since all the remaining variables in the limiting model are governed by the variable for S₁ and they satisfy the stochastic differential equations, evolution of one sample path of the species numbers for S₂₃, S₄, S₅, S₈, and S₉ in the limiting model looks like a solution of the system of ordinary differential equations.

In Figure 5, (e)-(h) are the species numbers for S₆and S₇in the full model and their averaged values in the limiting model. Note that the species numbers for S₆ and S₇ do not appear in the limiting model, since their values are approximated in terms of other species numbers. Therefore, the difference between mean species numbers for S₆and S₇in the full model and those in the approximation does not affect the error directly. For γ=2, $Z_{6}^{N, 2}$ and $Z_{7}^{N, 2}$ are asymptotically averaged out by the variables in the limiting model as given in Remark 2. Since the averaged value for S₆ plays an important role in the evolution of $Z_{23}^{2}$ in the limiting model and since the averaged value for S₇ gives the conditional mean value for S₇ in the limiting model, we compare the species numbers of S₆and S₇ in the full model and the approximated averaged values in the limiting model. In Figure 5(f) and (h), we plot the mean and standard deviation from the mean for

\begin{array}{l} {\bar{Z}}_{6}^{2} (t N_{0}^{- 2}) = \frac{\frac{κ_{2} κ_{7}}{κ_{2} + κ_{3}} Z_{23}^{2} (t N_{0}^{- 2}) + κ_{12} Z_{9}^{2} (t N_{0}^{- 2})}{κ_{10} Z_{8}^{2} (t N_{0}^{- 2})}, \\ {\bar{Z}}_{7}^{2} (t N_{0}^{- 2}) = \frac{κ_{9} (\frac{κ_{3}}{κ_{2} + κ_{3}} Z_{23}^{2} (t N_{0}^{- 2})) (\frac{\frac{κ_{2} κ_{7}}{κ_{2} + κ_{3}} Z_{23}^{2} (t N_{0}^{- 2}) + κ_{12} Z_{9}^{2} (t N_{0}^{- 2})}{κ_{10} Z_{8}^{2} (t N_{0}^{- 2})})}{κ_{15} Z_{4}^{2} (t N_{0}^{- 2})}, \end{array}

in time. They are stochastic variables determined by the ones in the limiting model with very small fluctuations. Since ${\bar{Z}}_{6}^{2} (t N_{0}^{- 2})$ and ${\bar{Z}}_{7}^{2} (t N_{0}^{- 2})$ describe averaged behavior of S₆and S₇, X₆(t) and X₇(t) in Figure 5(e) and (g) have more fluctuations than the averaged species numbers in Figure 5(f) and (h).

In Figure 5(e)-(h) there is a discrepancy between the species numbers and their averaged values in the very early time, and the discrepancy comes from a disagreement in initial values of the species numbers in the full model and those of the averaged values in the limiting model. The integrated species numbers for S₆ and S₇ up to times of order 10,000 are supposed to be approximated by the integrated averaged values over the time interval, and the initial difference is due to a boundary layer phenomenon.

Error estimates

In the previous sections, we scaled species numbers and derived their limit to approximate temporal behavior of the species numbers in the full network. Among three limiting models given in (3)-(5), the first two are systems with discrete variables (except for $Z_{8}^{1}$ ) which change by integer values. On the other hand, the last one is a hybrid system with both discrete and continuous variables. A discrete variable $Z_{1}^{2}$ increases or decreases by one and stochasticity of all other variables comes from how much $Z_{1}^{2}$ fluctuates. Since $Z_{1}^{2}$ rarely changes at the times of our interest, the rest of the variables in (5) behaves such as a solution of systems of ordinary differential equations. Our choice of the scaling parameter value, N₀=100, is not very large and it is possible that the limiting model does not contain enough fluctuations as much as the full network actually has due to our assumption that N₀is replaced by a large parameter N.

In this section, we estimate an error between the normalized species numbers and their limit given in (5) at the times of 10,000 sec. Define

\begin{array}{l} U^{N} (t) = N^{1 / 2} (Z_{23}^{N, 2} (t) - Z_{23}^{2} (t), Z_{4}^{N, 2} (t) - Z_{4}^{2} (t), Z_{5}^{N, 2} (t)) \\ - {(Z_{5}^{2} (t), Z_{8}^{N, 2} (t) - Z_{8}^{2} (t), Z_{9}^{N, 2} (t) - Z_{9}^{2} (t))}^{T}, \end{array}

and denote U(t)=(U₂₃(t),U₄(t),U₅(t),U₈(t),U₉(t))^T as a limit of U^N(t) as N goes to infinity. Note that we do not consider an error between $Z_{1}^{N, 2} (t)$ and $Z_{1}^{2} (t)$ , since they are exactly the same processes. In the next remark, we show that U^N(t) converges to U(t) in the probabilistic sense and thus the error between $Z_{i}^{N, 2} (t)$ and $Z_{i}^{2} (t)$ is approximately of order $N_{0}^{- 1 / 2} = 0.1$ . Since U(t) gives an explicit form of the error, we have better approximation of X_i(t) for γ=2 as

X_{i} (t) \approx N_{0}^{α_{i}} (Z_{i}^{2} (t N_{0}^{- 2}) + N_{0}^{- 1 / 2} U_{i} (t N_{0}^{- 2})),

for S₂₃, S₄, S₅, S₈, and S₉.

Remark 3

For γ=2, for each t>0, U^N(t) converges in distribution to U(t) which is a solution of

\begin{array}{l} U (t) = U (0) + \int_{0}^{t} {(1, 0, 0, 0, 0)}^{T} \sqrt{κ_{4} Z_{1}^{2} (s) + κ_{9} {\bar{Z}}_{2}^{2} (s) {\bar{Z}}_{6}^{2} (s)} dW (s) \\ + \int_{0}^{t} [\begin{array}{l} C_{23} (s) U_{23} (s) + C_{8} (s) U_{8} (s) + C_{9} (s) U_{9} (s) \\ \frac{κ_{2} κ_{6}}{κ_{2} + κ_{3}} U_{23} (s) - κ_{18} U_{4} (s) \\ \frac{κ_{2} κ_{5}}{κ_{2} + κ_{3}} U_{23} (s) - κ_{16} U_{5} (s) \\ - \frac{κ_{2} κ_{7}}{κ_{2} + κ_{3}} U_{23} (s) - κ_{11} U_{8} (s) \\ \frac{κ_{2} κ_{7}}{κ_{2} + κ_{3}} U_{23} (s) \end{array}] ds, \end{array}

where W(t) is a standard Brownian motion and

\begin{align} C_{23} (s) & = - \frac{κ_{9}}{κ_{2} + κ_{3}} (κ_{3} {\bar{Z}}_{6}^{2} (s) + \frac{κ_{2} κ_{7}}{κ_{10}} \cdot \frac{{\bar{Z}}_{2}^{2} (s)}{Z_{8}^{2} (s)}), \\ C_{8} (s) & = κ_{9} \frac{{\bar{Z}}_{2}^{2} (s) {\bar{Z}}_{6}^{2} (s)}{Z_{8}^{2} (s)}, \\ C_{9} (s) & = - \frac{κ_{9} κ_{12}}{κ_{10}} \cdot \frac{{\bar{Z}}_{2}^{2} (s)}{Z_{8}^{2} (s)} . \end{align}

The detailed method to compute an error using the central limit theorem is derived in [14].

Estimating order of magnitude of an error is an analogue of that in van Kampen’s system size expansion [16]. A difference is that in the system size expansion, the system state representing the species numbers is scaled by the system size Ω and noise between the scaled process and its deterministic value is approximated as a random variable of order Ω^−1/2. In our approach N is not a system size but a parameter for scaling, and species numbers are scaled by powers of N. Though the limiting model for γ=2 is not deterministic, it is still possible to estimate an error analytically due to the fact that $Z_{1}^{2} (t)$ which produces stochasticity in the limiting model is an exact process equal to $Z_{1}^{N, 2} (t)$ . Another difference between our approach and van Kampen’s system size expansion is that a subset of species numbers is averaged in terms of other species numbers which appear in the limiting model for γ=2 due to the various scales involved.

Our estimates of the error is also different from diffusion approximations. In the diffusion approximations, the reaction terms centered by their propensities in the stochastic equations for discrete variables of species numbers are approximated in terms of time-changed Brownian motion. On the other hand, the noise term in the error estimates is determined by both the centered reaction terms in the equations for discrete variables and a difference between the discrete variables for the normalized species number and their continuous limit.

To find the asymptotic order of magnitude of $Z_{i}^{N, 2} (t) - Z_{i}^{2} (t)$ , we show convergence of $r_{N} (Z_{i}^{N, 2} (t) - Z_{i}^{2} (t))$ to a nonzero finite limit for some r_N. Among the species S₂₃, S₄, S₅, S₈, and S₉, the species number of S₂₃is scaled with the smallest exponent, and thus noise in the limit of $r_{N} (Z_{i}^{N, 2} (t) - Z_{i}^{2} (t))$ is determined dominantly by the component $r_{N} (Z_{23}^{N, 2} (t) - Z_{23}^{2} (t))$ . Since $Z_{23}^{N, 2} (t)$ is the species number scaled by N, we expect that r_N=N^1/2and the error between the scaled species numbers and their limit is of order $N_{0}^{- 1 / 2}$ . For a detailed approach to derive r_Nand U(t), see more about the central limit theorem in [14]. The fact that all components but the first one in the diffusion term in the equation for U(t) are zero supports the idea that noise is dominantly determined by the error between $Z_{23}^{N, 2} (t)$ and $Z_{23}^{2} (t)$ . A sketch of the proof of Remark 3 is given in the Additional file 1: Section 6.

Conclusions

We considered a stochastic model for a well-stirred biochemical network with small numbers of molecules for some species. As the biochemical network consists of more species and reactions, network topology becomes more complex and it is harder to analyze. Therefore, how to reduce the biochemical network while preserving its important biochemical features is a very important issue.

In this paper, we applied the multiscale approximation method introduced by Ball et al. [8] and extended by Kang and Kurtz [9] to a heat shock response model of E. coli developed by Srivastava et al. [11]. Using the fact that the species numbers and the reaction rate constants in the model vary over several orders of magnitude, we scaled them using a scaling parameter with different exponents both of which contribute to determining the time scales of species. We derived balance conditions for each species and for a subset of linear combinations of species explicitly in this model, and chose appropriate values for the scaling exponents satisfying the balance conditions. Assuming that initial values of the species numbers are positive, satisfying the balance conditions is required to get a nondegenerate limiting model. We assumed that the reaction rate constants do not change in time, while we may use several sets of scaling exponents for the species numbers due to rapid changes in some species numbers in time. In this analysis, we chose three sets of scaling exponents, and they are used to derive limiting models in different time scales.

In each time scale we derived a limiting model, and used it to approximate the species numbers in the full network. In the limiting model, species numbers whose scaling exponents are larger than those of all rates of reactions involving the species are treated as constants, since changes of the species numbers due to the reactions are not noticeable at these times. When the scaling exponent of the species number is smaller than the scaling exponents of the rates of some productions and consumptions of the species and in case the scaling exponents for both kinds of reactions are equal, the scaled species number is averaged out and is approximated in terms of other variables. Therefore, the limiting model includes a subset of species and reactions and network topology in it becomes simpler. We derived the conditional equilibrium distributions of the fast-fluctuating species numbers and studied errors between the scaled species numbers and their limits in the third time scale.

Using the limiting models, we approximated the temporal evolution of species numbers in three time scales. By comparing stochastic simulation of the full model and approximations using the limiting models, we see that the main features of evolution of species numbers are well captured by the limiting models.

Competing interests

The author(s) declare that they have no competing interests.

Authors’ contributions

Based on the model of heat shock response of E. coli developed in [11], the author applied the multiscale approximation method introduced in [9] to the model. The author derived limiting models, showed convergence of the scaled species numbers to their limits, and estimated errors analytically. The author simulated the full network model and approximate processes using the limiting models and compared the results.

Supplementary Material

Additional file 1

Supplementary material for “A multiscale approximation in a heat shock response model of E. coli.” This is a supplementary material of the paper including calculations and tables.

Click here for file^{(201KB, pdf)}

Acknowledgements

The author would like to greatly thank Thomas G. Kurtz for his continuous support and many helpful discussion. This work is an extension of the author’s Ph.D work at the University of Wisconsin, is proceeded while the author held a postdoctoral appointment under Hans G. Othmer at the University of Minnesota, and is completed while the author held a postdoctoral appointment in the Mathematical Biosciences Institute at the Ohio State University. The support provided by three appointments is acknowledged. This research has been supported in part by the National Science Foundation under grant DMS 05-53687, 08-05793, and 09-31642 and the Mathematical Biosciences Institute.

References

Kærn M, Elston T, Blakem W, Collins J. Stochasticity in gene expression: from theories to phenotypes. Nat Rev Genet. 2005;6(6):451–464. doi: 10.1038/nrg1615. [DOI] [PubMed] [Google Scholar]
Gillespie D. A general method for numerically simulating the stochastic time evolution of coupled chemical reactions. J Comput Phys. 1976;22(4):403–434. [Google Scholar]
Gillespie D. Exact stochastic simulation of coupled chemical reactions. J Phys Chem. 1977;81(25):2340–2361. [Google Scholar]
Rao C, Arkin A. Stochastic chemical kinetics and the quasi-steady-state assumption: application to the Gillespie algorithm. J Chem Phys. 2003;118(11):4999–5010. [Google Scholar]
Haseltine E, Rawlings J. Approximate simulation of coupled fast and slow reactions for stochastic chemical kinetics. J Chemi Phys. 2002;117(15):6959–6969. [Google Scholar]
Cao Y, Gillespie D, Petzold L. Multiscale stochastic simulation algorithm with stochastic partial equilibrium assumption for chemically reacting systems. J Comput Phys. 2005;206(2):395–411. [Google Scholar]
Pahle J. Biochemical simulations: stochastic, approximate stochastic and hybrid approaches. Briefings Bioinf. 2009;10(1):53–64. doi: 10.1093/bib/bbn050. [DOI] [PMC free article] [PubMed] [Google Scholar]
Ball K, Kurtz T, Popovic L, Rempala G. Asymptotic analysis of multiscale approximations to reaction networks. Ann Appl Probability. 2006;16(4):1925–1961. [Google Scholar]
Kang HW, Kurtz T. Separation of time-scales and model reduction for stochastic reaction networks. 2012. arXiv preprint arXiv:1011.1672, to appear in Annals of Applied Probability.
Crudu A, Debussche A, Radulescu O. Hybrid stochastic simplifications for multiscale gene networks. BMC Syst Biol. 2009;3:89. doi: 10.1186/1752-0509-3-89. [DOI] [PMC free article] [PubMed] [Google Scholar]
Srivastava R, Peterson M, Bentley W. Stochastic kinetic analysis of the Escherichia coli stress circuit using σ32-targeted antisense. Biotechnol Bioeng. 2001;75(1):120–129. doi: 10.1002/bit.1171. [DOI] [PubMed] [Google Scholar]
Takahashi K, Kaizu K, Hu B, Tomita M. A multi-algorithm, multi-timescale method for cell simulation. Bioinformatics. 2004;20(4):538–546. doi: 10.1093/bioinformatics/btg442. [DOI] [PubMed] [Google Scholar]
Weinan E, Vanden-Eijnden E. Nested stochastic simulation algorithm for chemical kinetic systems with disparate rates. J Chem Phys. 2005;123(19):194107. doi: 10.1063/1.2109987. [DOI] [PubMed] [Google Scholar]
Kang HW, Popovic L, Kurtz T. Central limit theorems and diffusion approximations for multiscale Markov chain models. 2012. arXiv preprint arXiv:1208.3783, submitted.
Kurtz T. Averaging for martingale problems and stochastic approximation. Appl Stochastic Anal. 1992. pp. 186–209.
Van Kampen NG. Stochastic processes in physics and chemistry (North-Holland Personal Library) Elsevier; 2007. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Additional file 1

Supplementary material for “A multiscale approximation in a heat shock response model of E. coli.” This is a supplementary material of the paper including calculations and tables.

Click here for file^{(201KB, pdf)}

[B1] Kærn M, Elston T, Blakem W, Collins J. Stochasticity in gene expression: from theories to phenotypes. Nat Rev Genet. 2005;6(6):451–464. doi: 10.1038/nrg1615. [DOI] [PubMed] [Google Scholar]

[B2] Gillespie D. A general method for numerically simulating the stochastic time evolution of coupled chemical reactions. J Comput Phys. 1976;22(4):403–434. [Google Scholar]

[B3] Gillespie D. Exact stochastic simulation of coupled chemical reactions. J Phys Chem. 1977;81(25):2340–2361. [Google Scholar]

[B4] Rao C, Arkin A. Stochastic chemical kinetics and the quasi-steady-state assumption: application to the Gillespie algorithm. J Chem Phys. 2003;118(11):4999–5010. [Google Scholar]

[B5] Haseltine E, Rawlings J. Approximate simulation of coupled fast and slow reactions for stochastic chemical kinetics. J Chemi Phys. 2002;117(15):6959–6969. [Google Scholar]

[B6] Cao Y, Gillespie D, Petzold L. Multiscale stochastic simulation algorithm with stochastic partial equilibrium assumption for chemically reacting systems. J Comput Phys. 2005;206(2):395–411. [Google Scholar]

[B7] Pahle J. Biochemical simulations: stochastic, approximate stochastic and hybrid approaches. Briefings Bioinf. 2009;10(1):53–64. doi: 10.1093/bib/bbn050. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B8] Ball K, Kurtz T, Popovic L, Rempala G. Asymptotic analysis of multiscale approximations to reaction networks. Ann Appl Probability. 2006;16(4):1925–1961. [Google Scholar]

[B9] Kang HW, Kurtz T. Separation of time-scales and model reduction for stochastic reaction networks. 2012. arXiv preprint arXiv:1011.1672, to appear in Annals of Applied Probability.

[B10] Crudu A, Debussche A, Radulescu O. Hybrid stochastic simplifications for multiscale gene networks. BMC Syst Biol. 2009;3:89. doi: 10.1186/1752-0509-3-89. [DOI] [PMC free article] [PubMed] [Google Scholar]

[B11] Srivastava R, Peterson M, Bentley W. Stochastic kinetic analysis of the Escherichia coli stress circuit using σ32-targeted antisense. Biotechnol Bioeng. 2001;75(1):120–129. doi: 10.1002/bit.1171. [DOI] [PubMed] [Google Scholar]

[B12] Takahashi K, Kaizu K, Hu B, Tomita M. A multi-algorithm, multi-timescale method for cell simulation. Bioinformatics. 2004;20(4):538–546. doi: 10.1093/bioinformatics/btg442. [DOI] [PubMed] [Google Scholar]

[B13] Weinan E, Vanden-Eijnden E. Nested stochastic simulation algorithm for chemical kinetic systems with disparate rates. J Chem Phys. 2005;123(19):194107. doi: 10.1063/1.2109987. [DOI] [PubMed] [Google Scholar]

[B14] Kang HW, Popovic L, Kurtz T. Central limit theorems and diffusion approximations for multiscale Markov chain models. 2012. arXiv preprint arXiv:1208.3783, submitted.

[B15] Kurtz T. Averaging for martingale problems and stochastic approximation. Appl Stochastic Anal. 1992. pp. 186–209.

[B16] Van Kampen NG. Stochastic processes in physics and chemistry (North-Holland Personal Library) Elsevier; 2007. [Google Scholar]

PERMALINK

A multiscale approximation in a heat shock response model of E. coli

Hye-Won Kang

Abstract

Background

Results

Conclusions

Background

Figure 1.

Methods

Results and discussion

Model description

Table 1.

Table 2.

Table 3.

Derivation of the scaled models

Balance conditions

Table 4.

Limiting models in three time scales

Theorem 1

Conditional equilibrium distributions

Remark 2

Simulation results

Figure 2.

Figure 3.

Figure 4.

Figure 5.

Error estimates

Remark 3

Conclusions

Competing interests

Authors’ contributions

Supplementary Material

Acknowledgements

References

Associated Data

Supplementary Materials

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases