Skip to main content
Biophysical Journal logoLink to Biophysical Journal
. 2010 Nov 17;99(10):3172–3181. doi: 10.1016/j.bpj.2010.09.057

Correlation Resonance Generated by Coupled Enzymatic Processing

William H Mather †,, Natalie A Cookson †,§, Jeff Hasty †,‡,§, Lev S Tsimring , Ruth J Williams †,¶,
PMCID: PMC2980735  PMID: 21081064

Abstract

A major challenge for systems biology is to deduce the molecular interactions that underlie correlations observed between concentrations of different intracellular molecules. Although direct explanations such as coupled transcription or direct protein-protein interactions are often considered, potential indirect sources of coupling have received much less attention. Here we show how correlations can arise generically from a posttranslational coupling mechanism involving the processing of multiple protein species by a common enzyme. By observing a connection between a stochastic model and a multiclass queue, we obtain a closed form expression for the steady-state distribution of the numbers of molecules of each protein species. Upon deriving explicit analytic expressions for moments and correlations associated with this distribution, we discover a striking phenomenon that we call correlation resonance: for small dilution rate, correlations peak near the balance-point where the total rate of influx of proteins into the system is equal to the maximum processing capacity of the enzyme. Given the limited number of many important catalytic molecules, our results may lead to new insights into the origin of correlated behavior on a global scale.

Introduction

Significant correlations in the concentrations of different intracellular molecules are often observed in the context of a cellular response to an external perturbation such as stress. It is a major challenge in systems biology to deduce the molecular interactions that underlie such correlations. Intracellular interactions are frequently described by the forward propagation of a signal through specific molecular complexes, as found, for example, in MAPK cascades and feed-forward gene networks (1,2).

These forward interactions are often used as the building blocks for systems with bidirectional interactions, such as gene networks with feedback. A general and less obvious bidirectional interaction between components can arise when components share and compete for common machinery. This type of indirect interaction increases the apparent coupling between seemingly disparate components. An example of this type of coupling has recently been observed in a system where two fluorescent reporters were tagged for fast degradation by a common protease (N. A. Cookson, W. H. Mather, T. Danino, O. Mondragón-Palomino, R. J. Williams, L. S. Tsimring, and J. Hasty, unpublished). In this study, it was shown that a rate-limited interaction can lead to strong coupling between the two fluorescent reporters.

As a theoretical complement to the aforementioned experimental study, here we investigate, in detail, a stochastic model for the processing of multiple protein species by a common enzyme. Our stochastic model tracks the dynamics of the numbers of molecules of finitely many protein species subject to processing by a limited number of copies of a common enzyme. By observing a connection between this stochastic model and a multiclass queue, we obtain a closed form expression for the steady-state distribution of the numbers of molecules of each protein species. We derive explicit analytic expressions for moments and correlations of this distribution and discover a striking phenomenon that we call “correlation resonance”: for small dilution rate, correlations between the numbers of molecules for different protein species peak near the balance-point, where the total rate of influx of proteins into the system is equal to the maximum processing capacity of the enzyme. The correlations at the peak become perfect when dilution becomes negligible.

Although there are various interesting studies of correlations in biochemical reaction network models, including some considering correlations in time as well as across species (see, e.g., (4,5)), we believe this is the first work to analyze correlations between different species in a stochastic model where the species are processed by a common but limited number of copies of an enzyme. Indeed, our formulas and qualitative findings concerning correlations appear to be new both for biological systems and queuing theory. Simulation results are in excellent agreement with our analytic results and suggest that our qualitative findings are robust to variations in the model that transcend our exact analysis.

To our knowledge, this is the first use of the theory of multiclass queues to investigate correlations in a genetic network model, and for this, multiclass queues seem well suited. Historically, multiclass queues have been used in business and electrical engineering to model congestion and delay arising when different classes of customers or packets “wait in line” for processing by a common limited resource. For example, in the Internet, packets from qualitatively different classes of content, e.g., voice and video, can be processed by a single server. The server has a maximum processing speed and so even though voice messages are distinct from video, an increase in voice traffic will slow video processing. Accordingly, there is a correlation between video throughput and voice throughput.

Multiclass queues are also used in modeling manufacturing, computer, and service systems where the entities being processed can be products, tasks, or people, and are usually referred to generically as “jobs”—a term we shall adopt henceforth in describing multiclass queues. For genetic networks, molecules are analogous to “jobs”, copies of the enzyme are analogous to “servers”, and protein species correspond to “classes”. As we illustrate in this article, the modern theory of multiclass queues can enable one to derive formulas and qualitative properties of correlations between multiple protein species processed by common enzymatic machinery. Single-class queuing models recently considered in Arazi et al. (6) and Levine and Hwa (7) in the context of genetic networks allow only a single chemical species to be processed by a given enzyme, and do not capture the kinds of correlations treated here.

The article is organized as follows. In Stochastic Model, beginning with a standard set of biochemical reactions, we define precisely our stochastic model of coupled enzymatic processing. In Main Results, we summarize the main results of the article which are derived in Queuing Connection and Steady-State Distribution, Moments and Correlations, and Generalization for Reversible Binding. These results are based on the new observation that there is a connection between our stochastic model and a multiclass queue (see Fig. 1). This connection is explained in detail in Queuing Connection and Steady-State Distribution, where we use it to obtain a closed form expression for the steady-state distribution of the numbers of molecules of each protein species in our stochastic model. In Moments and Correlations, we derive explicit formulas for moments and correlations associated with this steady-state distribution.

Figure 1.

Figure 1

Schematic for m = 2 (two species of protein) for the stochastic model and an associated multiclass queue. The molecular count process in the stochastic model is equivalent in distribution to the job count process in the multiclass queue. (a) The stochastic model where proteins of species i are added to a volume at rate λi and are selected at random (without regard to type) for enzymatic processing. (b) A multiclass queue associated with the stochastic model where jobs are analogous to molecules and servers are analogous to copies of the enzyme. Jobs of type i arrive at rate λi and are inserted randomly into a queue of jobs awaiting processing; when a server becomes free, a new job is selected for processing from the head of the queue of waiting jobs.

Our observations concerning correlation resonance are based on these formulas, which are illustrated for sample parameters in Figs. 2 and 3. Generalization for Reversible Binding derives similar results to those in Moments and Correlations for a generalization of our stochastic model considered as an approximate model for the biochemical reaction system with reversible binding of the enzyme. We conclude the article with a discussion of the potential implications that correlations generated by indirect coupling may have for the behavior of genetic networks. In anticipation of potential future network applications, in W. H. Mather, J. Hasty, L. S. Tsimring, and R. J. Williams (unpublished), we generalize the queuing results used here from the situation of a single multiclass queue to networks of multiclass queues.

Figure 2.

Figure 2

Results from stochastic simulation of the biochemical reactions compared to analytic and numerical results for two types of proteins (m = 2). Here “analytic” corresponds to analytic formulas obtained for the stochastic model with η+ = +∞ when K = 0, and with the Michaelis-Menten-like approximation for degradation rate when K > 0. The label “numeric” represents results for a refinement of the Michaelis-Menten-like approximation (when L > 1 and K > 0) where the distribution of N is computed numerically using a one-dimensional lattice method. (a, c, and e) Mean and coefficient of variation for Q2, and correlation, all as functions of λ1 for L = 1 (one copy of the enzyme) and K = 0, 5. Parameters used were μ = 10, λ2 = 5, and γ = 0.01. For simulations, we set η+ = 108 when K = 0 and η+ = 200, and η = 1000 when K = 5. Averages (in time) for stochastic simulations were taken from trajectories of duration 2 × 105. (bd, and f) Similar to panels a, c, and e, with the only parameter differences being L = 10, μ = 1, and η+ = 20, η = 100 when K = 5.

Figure 3.

Figure 3

Dependence on system parameters of the peak (maximum) correlation and its location (when the correlation is considered as a function of Λ = λ1 + λ2 with L, μ, θ = log10(λ1/λ2), and γ fixed), for m = 2 (two species of protein). The peak correlation rmax and the corresponding total production rate Λmax at the peak were found by numerical methods using the analytic formula for the correlation. (a) Plot of rmax as a function of θ and γ for L = 10, μ = 1, and K = 0. Only values of peak correlation for θ ≥ 0 are shown because rmax (θ,γ) = rmax(−θ,γ) due to symmetry. (b) Peak correlation as a function of γ for θ = 0 and different values of L with  = 10 and K = 0. (c) Plot of Λmax/ as a function of γ for several different values of L. Other parameters are the same as in panel b. The value of Λmax is independent of θ (see the Supporting Material), though the value of the peak correlation does depend on θ. For small γ, Λmax, i.e., peak correlation occurs near balance.

Stochastic Model

For concreteness, we consider a stochastic model of enzymatic degradation of proteins as a prototype for common enzymatic processing. Other types of processing could be treated similarly, provided that products of the processing do not interact further with the other components of the model.

We consider a system with m protein species X1,…,Xm, which are being produced from their corresponding DNA templates D1,…,Dm at constant rates λ1,…, λm, respectively. Binding of a copy of the protein Xi to an (unbound) copy of the enzyme E occurs at rate η and the complex XiE is degraded with rate μ. We also take into account dilution of proteins at rate γ, in both their bound and unbound forms. This system is governed by the following set of biochemical reactions. For i = 1,…,m,

DiλiXi+Di, (1)
Xi+EηXiE, (2)
XiEμE, (3)
XiEγE, (4)
Xiγ. (5)

All reaction times are exponentially distributed. We assume that the binding reaction rate η is so large that the associated binding reactions are effectively instantaneous.

More precisely, our stochastic model for this system may be described as follows. New molecules of protein species i are produced at the jump times of a Poisson process with rate λ1 > 0, for each i = 1,…,m. We assume there are L > 0 copies of the enzyme and that, when one of the L copies becomes free, it selects a protein molecule at random, binds to it instantly, and begins to degrade it. The degradation time is exponentially distributed with mean 1/μ where μ > 0. In addition, dilution is modeled as follows (we view this as an approximation to the discrete process of cell growth and division). Each protein molecule will remain in the system for, at most, an exponentially distributed amount of time with mean 1/γ where γ > 0. A molecule may be removed from the system by degradation before its dilution lifetime is up, or vice versa. Because the number of copies of the enzyme is often tightly regulated, we assume that dilution does not remove copies of the enzyme.

Production, degradation, and dilution are all statistically independent, production of one protein species is independent of that of any other species, and times to degrade distinct molecules as well as times until removal by dilution for distinct molecules are all independent of one another. Note that μ and γ are the same for all protein species. Arguably, a common value of γ is reasonable, as dilution is a communal effect. Although our exact analysis uses the assumptions of a common degradation rate μ and a fixed number of copies of the enzyme, simulations of the biochemical reactions for two protein species with differing degradation rates for the two protein species and a randomly varying number of copies of the enzyme (due to dilution and production) suggest that our qualitative findings regarding correlations are robust to such changes in the model (see the Supporting Material for more details).

We let Qi(t) denote the total number of molecules of protein species i that are present in the system at time t; this includes free molecules of species i as well as molecules of the species that are in the process of being degraded by enzyme. Let

N(t)=i=1mQi(t)

be the total number of molecules of all protein species that are present in the system at time t. The stochastic process {Qi(t), t ≥ 0} is the molecular count process for species i and

{Q(t)=(Q1(t),,Qm(t)),t0}

is the vector-valued count process for all species. (The letter Q here is mnemonic for quantity and the letter N is mnemonic for total number.) Because μ and γ do not depend upon i, the process N = {N(t), t ≥ 0} is a one-dimensional birth-death process and consequently is Markov. However, Q is not a Markov process. To obtain a full Markovian state descriptor for our model, we need to include some further information about the numbers of each species that are being processed at any given time. In Queuing Connection and Steady-State Distribution, we describe a multiclass queuing model whose job count process is distributionally equivalent to the molecular count process Q and give a natural Markovian state descriptor for the multiclass queue (see Fig. 1).

Main Results

Using a connection between our stochastic model and a multiclass queue, we show that the steady-state distribution for the molecular count process, Q, can be described as follows. Conditioned on the total number of protein molecules N in the system being n, the steady-state distribution for Q is a multinomial distribution with parameters (n; p1,…,pm), where pi = λi/Λ, i = 1,…,m, and

Λ=i=1mλi

is the total rate at which protein molecules (of all species) are being produced. In other words, in steady state, given that there is a total of n molecules in the system, the distribution of the quantities of each type is as if each of the n molecules, independently of the others, is of type i with probability pi, the proportion of the newly produced proteins that are of type i.

Consequently, the steady-state moments for Q can be expressed in terms of the steady-state moments for N. In particular, for i = 1,…,m, the steady-state mean and squared coefficient of variation (SCV = variance divided by the square of the mean) for Qi are given by

E[Qi]=piE[N], (6)
SCV(Qi)=SCV(N)1E[N]+1E[Qi], (7)

and the correlation between Qi and Qj for ji is given by

rij=E[QiQj]E[Qi]E[Qj]Var(Qi)Var(Qj)=ν(N)1(ν(N)1+1pi)1/2(ν(N)1+1pj)1/2, (8)

where E[Y] denotes expectation (or mean) of a steady-state random variable Y, Var(Y) denotes the variance of Y, and ν(N) = Var(N)/E[N] is the Fano factor for N.

The steady-state distribution for N (being that of a one-dimensional birth-death process) is readily seen to be given by

P(N=n)=cΛn=1nϕ(),n=0,1,2,, (9)

where

ϕ(n)=min(n,L)μ+nγ,n=0,1,2,, (10)

is the total rate at which protein molecules are being removed from the system due to the combined effects of degradation and dilution and c is the normalizing constant chosen so that the right-hand side of Eq. 9 represents a probability distribution. Explicit expressions for the steady-state moment-generating function and first and second moments for N are derived later in Moments and Correlations when there is one copy of the enzyme (L = 1) and in the Supporting Material when there are multiple copies (L > 1). These yield corresponding analytic formulas for steady-state moments and correlations for Q. In Figs. 2 and 3, we illustrate these formulas with plots for some sample parameters in the case of two protein species (m=2).

In Fig. 2, sample plots are given for the mean of Q2, coefficient of variation for Q2, and correlation between Q1 and Q2, all as functions of λ1 for L = 1 and L = 10, while the other parameters are held fixed. The plots labeled K = 0 correspond to the stochastic model described so far, whereas the plots for K > 0 correspond to a generalization considered further below. The plots show results produced by stochastic simulation of the set of biochemical reactions as well as those obtained using our analytic formulas. Comparison of the two shows excellent agreement.

The peak in correlation apparent in Fig. 2, e and f, illustrates a general phenomenon that we observed in numerical exploration of our analytic correlation formulas: with small dilution rate (γ << μ), the correlation has a peak near the balance-point where the total rate of influx of proteins is equal to the maximum processing capacity of the enzyme, i.e., where λ1 + λ2 = . We call this phenomenon “correlation resonance”. These observations are consistent with the following simple asymptotic formula derived for L = 1 which shows that, in the zero dilution limit (γ → 0), the correlation becomes perfect as the balance-point (ρ =Δ Λ/μ = 1) is approached from below:

rij=(1+1pi(1ρ1))1/2(1+1pj(1ρ1))1/2,i,j=1,,m,andji. (11)

In Fig. 3, based on numerical analysis of our analytic formulas for two species, we illustrate the dependence of the size and location of the peak correlation on the model parameters. Fixing the number of copies of the enzyme L, the logarithm of the ratio of the production rates θ = log10(λ1/λ2), the degradation rate μ, and the dilution rate γ, and considering the correlation r12 as a function of Λ = λ1 + λ2, the peak correlation rmax and the value of Λ = Λmax (where the peak occurs) were computed numerically.

The plots in Fig. 3 are representative of the results we found in numerical exploration of our formulas. Fig. 3 a illustrates the dependence of rmax on θ and γ, for L = 10 and μ = 1. We observe that the peak correlation is large (near 1) for small dilution rate γ and nearly equal production rates (θ ≈ 0), and decays monotonically as either γ or |θ| is increased. For γ fixed, the maximum value of rmax occurs at θ = 0. This maximum value is close to 1 for small γ and decays to 0 as γ is increased. Fig. 3 b illustrates, for θ = 0 and fixed processivity  = 10, the dependence of rmax on γ for several different values of L. For large and small γ, rmax approaches 0 and 1, respectively, independent of L. But for intermediate γ, rmax decays with L as expected, because increasing L reduces competition between the species for enzyme. Fig. 3 c illustrates the dependence of Λmax on γ for several different values of L with  = 10. For fixed L, μ and γ, Λmax is independent of θ (see the Supporting Material), and for small values of γ, Λmax.

The generalization of our stochastic model considered later in Generalization for Reversible Binding is intended as an approximation for the biochemical reaction system where the binding reactions in Eq. 2 are allowed to be reversible,

Xi+Eηη+XiE,i=1,,m, (12)

and η+ and η are both large. This generalization is obtained by simply replacing the degradation rate μ in our stochastic model by a Michaelis-Menten-like rate function of the form /(K + n), where K = η/η+ and n is the total number of proteins in the system. With this change in the stochastic model, the formulas in Eqs. 6–8 for the steady-state means, SCVs, and correlations of Q in terms of the steady-state moments of N still hold, but the moments of N are now different. We derive explicit formulas for these for L = 1 and K > 0 in Generalization for Reversible Binding and for L > 1 and K > 0 in the Supporting Material. Plots in Fig. 2 labeled with K = 5 illustrate these formulas for a two-species model.

In addition to the plots resulting from stochastic simulation of the set of biochemical reactions and from our analytic formulas, for L = 10 and K = 5, additional curves labeled “numeric” result from a refinement of the Michaelis-Menten-like approximation which gives a better fit to the simulation results. For this refinement, the degradation rate μ in the original stochastic model is replaced by a state-dependent rate μk(n) where n is the total number of protein molecules in the system. The function k(n) is the steady-state fraction of the enzymatic processors that are bound to protein molecules, computed for the fast binding/unbinding reactions given by Eq. 12, and conditioned on the total number of protein molecules in the system being fixed at n. The plot is labeled as “numeric” because even though an analytic expression is available for k(n), computing the moments of N with this generalization was done numerically.

Fig. 4 shows sample trajectories illustrating the dynamic behavior of the molecular count process Q when there are 10 copies of the enzyme (L = 10); the other parameter values (except for λ1) are the same as in Fig. 2, b, d, and f, when K = 5 and λ1 is allowed to take the three values 1, 5, and 20, corresponding to underloaded, balanced, and overloaded regimes, respectively. Although our mathematical analysis does not treat dynamic behavior, the correlations exhibited here are consistent with our steady-state results and the behavior in the λ1 = 5 case is reminiscent of process-level correlation (or state space collapse) observed for various multiclass queues near the balance-point (9,10).

Figure 4.

Figure 4

Sample trajectories for Q1 and Q2 resulting from stochastic simulations of the biochemical reactions in Eqs. 1, 3–5, and 12 when there are two species of protein (m = 2), 10 copies of the enzyme (L = 10), and λ2 = 5, μ = 1, γ = 0.01, η+ = 20, and η = 100 (compare to Fig. 2, b, d, and f). Trajectories are shown in panels ac for λ1 = 1, λ1 = 5, and λ1 = 20, respectively. In each case, the last 1000 time units of a 4 × 107 time-unit run are shown. Time averages computed using the last half of each run yielded correlations of (a) 0.23, (b) 0.72, and (c) 0.20, respectively, which are consistent with the results displayed in Fig. 2f.

In the next three sections, we give a full description of our results and explain how these results are derived.

Queuing Connection and Steady-State Distribution

We can envisage the evolution of the stochastic model introduced in Stochastic Model as that of a multiclass queue in which molecules are analogous to jobs, copies of the enzyme are analogous to servers, and degradation corresponds to processing of a job by a server. In addition, dilution corresponds to a phenomenon called reneging, wherein jobs waiting or in service abandon the system before their processing is completed. The job count process for such a multiclass queue has the same distribution as the molecular count process Q for our stochastic model.

We now describe the multiclass queuing model in detail. Jobs of type i arrive to a single queue according to a Poisson process with rate λi for i = 1,…,m. There are L servers available for processing jobs (each server can process only one job at a time). We keep jobs in the queue while they are being processed; jobs are ordered in the queue and the jobs currently being processed are at the head of the queue. When a new job arrives to the queue, if there are fewer than L jobs in the system, the new job joins the end of the queue and immediately commences being processed by one of the free servers; on the other hand, if there are L or more jobs already in the system, then all of the servers are busy processing jobs and the new job is placed in a random position amongst those jobs in the queue that are not currently being processed. (The reason for this random placement of newly arriving jobs is explained in the next paragraph.) If n denotes the total number of jobs in the system at a given time, then when nL, each job is being processed at an exponential rate of μ and is independently reneging at an exponential rate of γ. And when n > L, the first L jobs are each being processed at an exponential rate of μ, and each of the n jobs is independently reneging at an exponential rate of γ. When a job finishes being processed or reneges, the job departs from the system and the size of the queue is reduced by one, with the previous order of the remaining jobs being maintained. A newly freed server immediately selects the job that is at the head-of-the-line of waiting jobs and starts processing it. If there is no such job, the server idles until a new job is available for processing.

Random placement of new jobs amongst the queue of waiting jobs ensures that the job count process will have the same distribution as would be obtained if the server, in selecting a new job to be processed, were to choose at random from amongst the jobs waiting to be processed. This observation is the key to the fact that the job count process in this multiclass queuing model has the same distribution as the molecular count process in the stochastic model. Fig. 1 depicts the setup for the stochastic model (Fig. 1 a) and for the multiclass queuing model (Fig. 1 b) when there are two protein species/types of jobs (m = 2).

We have chosen to do the modeling in terms of random placement in the queue (rather than random selection to begin processing) so as to make a correspondence with the multiclass queuing model framework in Kelly (11).

To enable the reader to clearly see the correspondence with that setup, we give a few more details of this correspondence below. In Kelly's notation, when a new job arrives to the queue with n jobs in it already, if nL, with probability Δ(ℓ, n + 1) = 1/(n + 1 − L) , the new job is placed in position ℓ ∈{L + l,…,n +1} in the queue, and if ℓ ≤ n, jobs previously in positions ℓ, ℓ + 1,…,n move to positions ℓ + 1, ℓ + 2,…,n + 1, respectively. If n < L, the new job is placed behind the other jobs in the queue, i.e., it is placed in position ℓ = n + 1 with probability Δ(n + 1, n + 1) = 1.

The exponential processing and reneging are realized in the following way. Each job requires an exponentially distributed amount of service with mean 1. When there are n jobs in the queue, total service effort (from both processing and reneging) is given by ϕ(n) (see Eq. 10); this is the overall rate at which jobs are departing from the system when there are n jobs in the system. The proportion Γ(ℓ, n) of this effort that is directed to the job in position ℓ is (μ + γ)/ϕ(n) for ℓ = 1,…,min(n,L) and is γ/ϕ(n) for ℓ = L + 1,…,n when n > L. A job departs from the system whenever the amount of service effort it has received is equal to its random service requirement. When a job in position ℓ ∈{l,…,n} departs, the jobs in positions ℓ + 1,…,n move to positions, ℓ, ℓ + 1,…, n − 1, respectively. (Here we have used the capital Greek letters Δ,Γ instead of the lower case Greek letters used in Kelly (11) to avoid confusion with our use of the latter for other parameters.)

The m-dimensional process that tracks the number of jobs of each type that are in the queuing system (both waiting and being processed), has the same distribution as the molecular count process of the stochastic model introduced back in Stochastic Model. Accordingly, we shall use the same symbol Q to denote both processes and, as before, let

N=i=1mQi.

At any time t, let S(t) be the vector of length N(t) where the th component of this vector is the type (a member of the set {1,…,m}) of the job that is in position ℓ in the queue at time t. A vector of length zero (the empty vector) corresponds to the queue being empty. Thus, S(t) is simply the list of jobs in the queue where an entry in the list specifies the type of the job in that position. The process S is a continuous time Markov chain.

It follows immediately from Theorem 3.1 and Corollary 3.4 of Kelly (11) that the steady-state distribution for S, and hence for Q and N, can be given explicitly. (Because the Markov chain S is irreducible, its steady-state distribution is the same as its stationary distribution.)

Theorem 1

The continuous time Markov chain S describing the state of the multiclass queuing model associated with our stochastic model of enzymatic processing has a unique steady-state distribution given by

P(S=(s1,,sn))=c=1n(psΛϕ()), (13)

for s1,,sn{1,,m}, and n = 0,1,2,…, where pi = λi/Λ, i = 1,…,m,

Λ=i=1mλi,

and c is the normalizing constant:

c=(n=0Λn=1nϕ())1. (14)

The associated steady-state distribution for

N=i=1mQi

is given by Eq. 9 and, conditioned on N = n > 0, the steady-state distribution of Q is a multinomial distribution with parameters (n; p1,…,pm), so that for any nonnegative integers q1,…,qm, and

n=i=1mqi,

the steady-state distribution of Q is given by

P(Q=(q1,,qm))=P(N=n)n!q1!…qm!i=1mpiqi. (15)

A key assumption for the analysis in Kelly (11) to hold is that the service effort provided to a job does not depend on its type, just its position in the queue. The above result is a special case of well-known results on equilibrium distributions in multiclass queuing theory and is associated with the fact that the queue is quasireversible. The concept of quasireversibility is specific to queuing systems; it was first identified by Muntz (12) and independently discovered and used by Kelly (11,13,14), who also coined the term “quasireversibility”.

A multiclass queue is quasireversible if the queue, when initialized with its steady-state distribution and run in reverse time, taking the departure processes as arrival processes and interchanging the parameters Γ(ℓ, n) and Δ(ℓ, n), is a queue of a similar type to the original queue run in forward time. In the language of the Markov chain S, for a stationary version of S, the process run in reverse time is a Markovian state descriptor associated with a multiclass queuing model with Poisson inputs having the same parameters as the original input processes, and with the parameters Γ(ℓ, n) and Δ(ℓ, n) interchanged. In particular, the multiclass queuing model has the input-output property that, in steady state, the departure processes (where a departure can occur either due to completion of processing or reneging) for each of the job types are the same as the arrival processes, i.e., independent Poisson processes with rates λi, i = 1,…,m.

Moments and Correlations

Using the fact that the job count process in the multiclass queue of the previous section has the same distribution as the molecular count process for our stochastic model, by Theorem 1, the steady-state distribution for this process Q is given by Eqs. 14–15 and Eq. 9. In this section, we derive explicit analytic expressions for moments and correlations of this distribution. In particular, we obtain succinct expressions for the first two moments and for the correlations when there is one copy of the enzyme (L = 1) in Eqs. 22–24 below. (Formulas for L > 1 are derived in the Supporting Material.) We believe that our explicit formulas for the correlation and our findings concerning correlation resonance are new even to queuing theory.

In the following derivation, Q and

N=i=1mQi

are assumed to have their steady-state distributions. Due to the factorized form of Eq. 15, the moments of Q can be expressed in terms of the moments of N. In particular, the means, variances, and second moments satisfy the following for i, j = 1,…,m,

E[Qi]=piE[N], (16)
Var(Qi)=pi2(Var(N)E[N])+piE[N], (17)
E[Qi2]=pi(1pi)E[N]+pi2E[N2], (18)
E[QiQj]=pipj(E[N2]E[N])forji, (19)

from which Eqs. 6–8 follow immediately. Furthermore, the moment-generating function for Q is given by

E[eu·Q]=n=0(i=1mpieui)nP(N=n)foruRm.

Here N has the steady-state distribution of a one-dimensional birth-death process with constant birth rate Λ and state-dependent death rate given by ϕ; this distribution is given by Eq. 9. Below, for L = 1, we derive formulas for the first two moments of N and for its moment-generating function; these formulas are written succinctly in terms of confluent hypergeometric functions.

Fix L = 1 and let β = (μ/γ) + 1, and δ = Λ/γ. To obtain an expression for the moment-generating function of N, we note that

=1nϕ()=γn(β)n,

where

(x)n=x(x+1)(x+2)(x+n1)

is the rising factorial or Pochhammer symbol. Thus, the normalizing constant c satisfies

c1=n=0δn(β)n=M(1,β,δ),

where M is the confluent hypergeometric function given by

M(x,y,z)=n=0(x)nzn(y)nn!. (20)

This function M is sometimes also denoted by 1F1 and is a standard function in computer packages such as MATLAB (The MathWorks, Natick, MA), Maple (Waterloo Maple, Waterloo, Ontario, Canada), and Mathematica (Wolfram Research, Champaign, IL). We then have the following formula for the moment-generating function of N:

E[euN]=cn=0(euΛ)n=1nϕ()=M(1,β,euδ)M(1,β,δ),uR. (21)

The function M has the property that

ddzM(x,y,z)=xyM(x+1,y+1,z).

Using this, and differentiating the moment-generating function in Eq. 21, we obtain

E[N]=δM(2,β+1,δ)βM(1,β,δ)

and

E[N2]=E[N]+2δ2M(3,β+2,δ)β(β+1)M(1,β,δ).

Substitution of these formulas into Eqs. 16 –19 and Eq. 8 yields the following key result.

Theorem 2

For our stochastic model of enzymatic processing associated with Eqs. 1–5, when there is one copy of the enzyme (L = 1), the steady-state means, variances, and correlations for the molecular count process Q are given by the following formulas. For i, j = 1,…,m and ji,

E[Qi]=piδM(2,β+1,δ)βM(1,β,δ), (22)
Var(Qi)=2pi2δ2M(3,β+2,δ)β(β+1)M(1,β,δ)(piδM(2,β+1,δ)βM(1,β,δ))2+piδM(2,β+1,δ)βM(1,β,δ), (23)
rij=h(β,δ)(h(β,δ)+pi1)1/2(h(β,δ)+pj1)1/2, (24)

where β = (μ/γ) + 1, δ = Λ/γ, and

Λ=i=1mλi,
h(β,δ)=2δM(3,β+2,δ)(β+1)M(2,β+1,δ)δM(2,β+1,δ)βM(1,β,δ),

and M is the confluent hypergeometric function given by Eq. 20. The above expression for the correlation rij is nonnegative (see the Supporting Material for a proof).

In the zero dilution limit (γ → 0), the correlation formula simplifies. Using the asymptotic result given in (4.3.7) of Slater (15),

M(x,y,yw)=(1w)x(1x(x+1)2y(w1w)2+O(|y|2)), (25)

for x fixed, w bounded, and y → ∞, we obtain that in the zero dilution limit, for ρ = Λ/μ < 1, i, j = 1,…,m and j ≠ i, rij is given by Eq. 11. As ρ ↑ 1 in Eq. 11, rij converges monotonically upwards to its maximum value of 1, indicating approach towards perfect correlation of the steady-state numbers of molecules of different protein species. This is reminiscent of process-level state space collapse observed for various multiclass queues in heavy traffic (9,10) and is the same limiting behavior as seen in the heavy traffic asymptotics (as ρ ↑ 1) obtained by Rege and Sengupta (16) for a stationary processor sharing queue. (Processor sharing is not the same service discipline as the random-order-of-service queuing discipline considered here, but it turns out that the steady-state queue length distribution is the same for both disciplines under our model assumptions.) Despite these prior observations in the heavy traffic regime (ρ ≈ 1), our quantitative formulas for the correlations rij, as well as our observations concerning the qualitative behavior of rij for various parameters (see Figs. 2 and 3), appear to be new.

Generalization For Reversible Binding

In this section, we consider a generalization of the stochastic model which is intended as an approximate model for the biochemical reactions where the binding reactions in Eq. 2 are replaced by the reversible reactions in Eq. 12. Our generalization of the stochastic model is obtained by simply replacing the degradation rate μ by a Michaelis-Menten-like rate function of the form /(K + n), where K = η/η+ and n is the total number of proteins in the system. This is a commonly used approximation when the binding/unbinding reactions are fast (17). With this change in the stochastic model, the molecular count process still corresponds to the job count process in a quasireversible multiclass queue for which the steady-state distribution is given explicitly by Eq. 9 and Eqs. 14 and 15 with ϕ now given by

ϕ(n)=min(n,L)nμK+n+nγ,n=0,1,2,. (26)

The formulas in Eqs. 16–19 and Eq. 8 for the moments and correlations of Q in terms of the moments of N still hold, but the moments of N are now different. We derive formulas for these for L = 1 and K > 0 below; derivations for L > 1 can be found in the Supporting Material.

Fix L = 1 and set α = K + 1, β = (μ/γ) + α, and δ = Λ/γ. Then,

=1nϕ()=γnn!(β)n(α)n

and the normalizing constant c satisfies c−1 = M(α, β, δ), where M is the confluent hypergeometric function given by Eq. 20. In a similar manner to that in the previous section, we then have

E[euN]=M(α,β,euδ)M(α,β,δ),E[N]=αδM(α+1,β+1,δ)βM(α,β,δ),E[N2]=E[N]+α(α+1)δ2M(α+2,β+2,δ)β(β+1)M(α,β,δ).

This yields the following generalization of the theorem of the previous section.

Theorem 3

For the generalization of our stochastic model of enzymatic processing associated with reversible binding, when there is one copy of the enzyme (L = 1), the steady-state means, variances, and correlations for the molecular count process Q are given by the following formulas. For i, j = 1,…, m, ji,

E[Qi]=αpiδM(α+1,β+1,δ)βM(α,β,δ),
Var(Qi)=α(α+1)pi2δ2M(α+2,β+2,δ)β(β+1)M(α,β,δ)(αpiδM(α+1,β+1,δ)βM(α,β,δ))2+αpiδM(α+1,β+1,δ)βM(α,β,δ),
rij=h(α,β,δ)(h(α,β,δ)+pi1)1/2(h(α,β,δ)+pj1)1/2,

where α = K + 1, β = (μ/γ) + α, and δ = Λ/γ,

h(α,β,δ)=(α+1)δM(α+2,β+2,δ)(β+1)M(α+1,β+1,δ)αδM(α+1,β+1,δ)βM(α,β,δ).

The above expression for the correlation rij is nonnegative (see the Supporting Material for a proof).

Using the asymptotic result given in Eq. 25, we obtain in the zero dilution limit that for ρ = Λ/μ < 1, rij is still given by the expression in Eq. 11. If we set K = 0 in the above formulas, we recover the formulas for the stochastic model with instantaneous irreversible binding reactions. The above formulas are illustrated in Figs. 2 and 3 for sample parameters.

Discussion

In intracellular signaling and metabolic systems, enzymes often interact with multiple substrates, which may represent different pathways or branches of the same pathway (18,19). As we showed in this article, this resource sharing provides an indirect but strong mechanism for correlation among concentrations of different substrates and may lead to apparent coupling among the pathways. This result is somewhat counterintuitive, because, generally, it is assumed that correlations arise from direct interactions. By exploiting a connection between enzymatic processing and multiclass queuing theory, we were able to obtain closed-form expressions for the steady-state joint distributions of multiple substrates and to derive explicit analytic formulas for the correlations and other moments of the distributions.

We demonstrated that, for a small dilution rate, the correlation reaches a peak near balance, where the total production rate of the substrates equals the maximal processivity of the common enzyme. This transition is also characterized by a sharp increase in the mean substrate concentrations. For much smaller production rates, the queue is short or absent, and competition for the enzyme is weak, resulting in small correlations. For production rates much larger than the processivity of the enzyme, the queue is very long, the production is mainly balanced by dilution, and correlations due to common enzymatic processing become small again. Our explicit formulas for correlations and our findings concerning correlation resonance appear to be new both in the context of biological systems and in queuing theory.

In deriving the results for the distributions and their moments, we exploited the quasireversible nature of the corresponding multiclass queuing model (11). Our theoretical results obtained here are corroborated by our experimental findings in N. A. Cookson, W. H. Mather, T. Danino, O. Mondragón-Palomino, R. J. Williams, L. S. Tsimring, and J. Hasty (unpublished). Using a synthetic two-gene circuit in Escherichia coli and multicolor fluorescent flow cytometry and microscopy, we demonstrated that modulating the transcription rate of one fluorescent protein affects the concentration of another if they both are endowed with the same LAA tag, and hence targeted for degradation by the common enzymatic degradation machine ClpXP.

We believe that substrate correlations due to coupled enzymatic processing play an important role in many cellular processes and should be incorporated in future computational models. This form of posttranslational regulation may explain some of the existing connections in wiring diagrams of proteomic and metabolic networks or introduce new links. Furthermore, this effect may provide a significant contribution to the apparent extrinsic noise in gene expression which is usually determined by cross-correlations of protein concentrations (20,21). This effect also has to be taken into consideration in the design of synthetic gene circuits which often rely on enzymatic processing of network components for increasing the speed of their turnover (22,23).

Acknowledgments

R.J.W. thanks Frank Kelly for conversations concerning quasireversibility in multiclass queues and Estelle Basor for help in identifying a connection with the confluent hypergeometric function.

This work was supported in part by the National Institute of General Medical Sciences of the National Institutes of Health (grants Nos. GM079333 and GM089976) and the Department of Energy Computational Science Graduate Fellowship Program (grant No. DE-FG02-97ER25308 to N.A.C. and W.H.M.). L.S.T. was supported, in part, by Office of Naval Research, Multidisciplinary University Research Initiative grant No. N00014-07-0741. Research of R.J.W. was supported, in part, by National Science Foundation grants Nos. DMS-0825686 and DMS-0906535.

Supporting Material

Document S1. Further details, a figure, and a reference
mmc1.pdf (227.5KB, pdf)

References

  • 1.Jordan J.D., Landau E.M., Iyengar R. Signaling networks: the origins of cellular multitasking. Cell. 2000;103:193–200. doi: 10.1016/s0092-8674(00)00112-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Alon U. CRC Press; Boca Raton, FL: 2006. An Introduction to Systems Biology: Design Principles of Biological Circuits. [Google Scholar]
  • 3.Reference deleted in proof.
  • 4.Arkin A., Shen P., Ross J. A test case of correlation metric construction of a reaction pathway from measurements. Science. 1997;277:1275–1279. [Google Scholar]
  • 5.Heuett W.J., Qian H. Grand canonical Markov model: a stochastic theory for open nonequilibrium biochemical networks. J. Chem. Phys. 2006;124:044110. doi: 10.1063/1.2165193. [DOI] [PubMed] [Google Scholar]
  • 6.Arazi A., Ben-Jacob E., Yechiali U. Bridging genetic networks and queueing theory. Physica A. 2004;332:585–616. [Google Scholar]
  • 7.Levine E., Hwa T. Stochastic fluctuations in metabolic pathways. Proc. Natl. Acad. Sci. USA. 2007;104:9224–9229. doi: 10.1073/pnas.0610987104. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Reference deleted in proof.
  • 9.Bramson M. State space collapse with application to heavy traffic limits for multiclass queueing networks. Queue. Sys. Theory Appl. 1998;30:89–148. [Google Scholar]
  • 10.Williams R.J. Diffusion approximations for open multiclass queueing networks: sufficient conditions involving state space collapse. Queue. Sys. Theory Appl. 1998;30:27–88. [Google Scholar]
  • 11.Kelly F.P. John Wiley and Sons; New York: 1979. Reversibility and Stochastic Networks. [Google Scholar]
  • 12.Muntz, R. R. 1972. Poisson Departure Processes and Queueing Networks. IBM Research Report No. RC 4145. IBM Thomas J. Watson Research Center, Yorktown Heights, NY.
  • 13.Kelly F.P. Networks of queues with customers of different types. J. Appl. Probab. 1975;12:542–554. [Google Scholar]
  • 14.Kelly F.P. Networks of queues. Adv. Appl. Probab. 1976;8:416–432. [Google Scholar]
  • 15.Slater L.J. Cambridge University Press; Cambridge, UK: 1960. Confluent Hypergeometric Functions. [Google Scholar]
  • 16.Rege K.M., Sengupta B. Queue-length distribution for the discriminatory processor-sharing queue. Oper. Res. 1996;44:653–657. [Google Scholar]
  • 17.Keener J., Sneyd J. Springer-Verlag; New York: 1998. Mathematical Physiology. [Google Scholar]
  • 18.Elf J., Paulsson J., Ehrenberg M. Near-critical phenomena in intracellular metabolite pools. Biophys. J. 2003;84:154–170. doi: 10.1016/S0006-3495(03)74839-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Ptashne M. Regulated recruitment and cooperativity in the design of biological regulatory systems. Philos. Trans. Math. Phys. Eng. Sci. 2003;361:1223–1234. doi: 10.1098/rsta.2003.1195. [DOI] [PubMed] [Google Scholar]
  • 20.Swain P.S., Elowitz M.B., Siggia E.D. Intrinsic and extrinsic contributions to stochasticity in gene expression. Proc. Natl. Acad. Sci. USA. 2002;99:12795–12800. doi: 10.1073/pnas.162041399. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Elowitz M.B., Levine A.J., Swain P.S. Stochastic gene expression in a single cell. Science. 2002;297:1183–1186. doi: 10.1126/science.1070919. [DOI] [PubMed] [Google Scholar]
  • 22.Hasty J., McMillen D., Collins J.J. Engineered gene circuits. Nature. 2002;420:224–230. doi: 10.1038/nature01257. [DOI] [PubMed] [Google Scholar]
  • 23.Heinemann M., Panke S. Synthetic biology—putting engineering into biology. Bioinformatics. 2006;22:2790–2799. doi: 10.1093/bioinformatics/btl469. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Document S1. Further details, a figure, and a reference
mmc1.pdf (227.5KB, pdf)

Articles from Biophysical Journal are provided here courtesy of The Biophysical Society

RESOURCES