Significance
Modeling intracellular processes has long relied on the markovian assumption. However, as soon as a reactant interacts with its environment, molecular memory definitely exists and its effects cannot be neglected. Since the Markov theory cannot translate directly to modeling and analysis of nonmarkovian processes, this leads to many significant challenges. We develop a formulation, namely the stationary generalized chemical-master equation, to model intracellular processes with molecular memory. This formulation converts a nonmarkovian question to a markovian one while keeping the stationary probabilistic behavior unchanged. Both a stationary generalized Fokker–Planck equation and a generalized linear noise approximation are further developed for the fast evaluation of fluctuations. These formulations can have broad applications and may help us discover new biological knowledge.
Keywords: chemical-master equation, biochemical-reaction system, nonmarkovian reaction kinetics, gene-expression noise, molecular memory
Abstract
Many cellular processes are governed by stochastic reaction events. These events do not necessarily occur in single steps of individual molecules, and, conversely, each birth or death of a macromolecule (e.g., protein) could involve several small reaction steps, creating a memory between individual events and thus leading to nonmarkovian reaction kinetics. Characterizing this kinetics is challenging. Here, we develop a systematic approach for a general reaction network with arbitrary intrinsic waiting-time distributions, which includes the stationary generalized chemical-master equation (sgCME), the stationary generalized Fokker–Planck equation, and the generalized linear-noise approximation. The first formulation converts a nonmarkovian issue into a markovian one by introducing effective transition rates (that explicitly decode the effect of molecular memory) for the reactions in an equivalent reaction network with the same substrates but without molecular memory. Nonmarkovian features of the reaction kinetics can be revealed by solving the sgCME. The latter 2 formulations can be used in the fast evaluation of fluctuations. These formulations can have broad applications, and, in particular, they may help us discover new biological knowledge underlying memory effects. When they are applied to generalized stochastic models of gene-expression regulation, we find that molecular memory is in effect equivalent to a feedback and can induce bimodality, fine-tune the expression noise, and induce switch.
Quantitative understanding of the dynamics of single living cells is one of the main goals of modern molecular-systems biology. Traditionally, modeling of intracellular biochemical processes is based on the markovian hypothesis, i.e., the stochastic motion of the reactants is uninfluenced by previous states, only by the current state. This memoryless property implies that markovian reaction kinetics can be described by poissonian processes with constant rates, which are characterized by exponential waiting-time distributions (1, 2). The mathematical tractability of markovian reaction processes enables great simplifications in problem formulation, leading to important successes in the description of many intracellular processes (1–4). These studies revealed that the characteristic parameters in the waiting-time distributions, as well as the complex properties of the intracellular process, can have a strong impact on the reaction kinetics.
However, intracellular reaction processes are not necessarily markovian but may be nonmarkovian. First, as a general rule, the dynamics of a given reactant resulting from its interactions with the environment cannot be described as a markovian process since this interaction can create “molecular memory” characterized by nonexponential waiting-time distributions. Second, the reduction of multistep reactions into a single step one may lead to a nonmarkovian process (5). Third, synthesis of a macromolecule would involve several small single- or multimolecular reaction steps (referring to Fig. 1A), creating a memory between individual events (6–9). This is evidenced by the fact that inactive phases of the promoter involving the prolactin gene in a mammalian cell are differently distributed, showing strong memory (10). More generally, the complex control process of gene expression, which would involve several repressors, transcription factors (TFs), and mediators as well as chromatin remodeling or changes in supercoiling, can generate nonexponential time intervals between transcription windows. Indeed, molecular memory, which can result in nonmarkovian (or nonpoissonian) reaction kinetics, has been confirmed by the increasing availability of time-resolved data on different kinds of interactions (10–16).
Fig. 1.
A framework for analysis of stochastic reaction processes with molecular memory. (A) A possibly involved multistep multimolecular reaction process with exponential waiting times is mapped onto a single-step reaction process with nonexponential waiting times, where the initial and final states are observable or measurable, but the intermediate states, which are consequences of single-molecular or multimolecular reactions, would be unknown or have not been unspecified. Note that does not represent a reaction but represents the transformational relation between states and , which possibly involves single- or multimolecular reactions. (B) Our theory shows that a nonmarkovian problem can be converted into a markovian one via an sgCME.
The extensive existence of molecular memory raises important yet unsolved questions: e.g., how does this memory affect nonmarkovian reaction kinetics? In what way and how accurately do life forms achieve the order required to develop and sustain their lives from the disordered reaction events? These questions were partially addressed in a seminal paper (6) wherein Pedraza and Paulsson analyzed a queuing model of stochastic gene expression with molecular memory, which was further generalized and analyzed (7, 17, 18). Recently, Park et al. (19) analyzed another class of queuing models of stochastic gene expression with molecular memory and presented the chemical fluctuation theorem, which provides an accurate relationship between the environment-coupled chemical dynamics of gene expression and gene-expression variability. These analyzed models are based on queuing theory, which is useful in treating simple single-molecular reaction networks such as birth–death processes, but it seems difficult to extend it to complex bi- or multimolecular reactions, which are a characteristic of many intracellular reaction networks. On the other hand, the continuous time random walk (CTRW) framework provides a different modeling way for stochastic reaction processes and has been extensively used in modeling and analysis of stochastic processes (20–22). Despite this, we still lack a general theory for modeling and analysis of general reaction networks with intrinsic waiting-time distributions. Since the sources of nonmarkovianity and network topology can jointly influence the behavior of the whole biochemical system, the reliable tools and mathematical machinery of the Markov theory do not translate directly to modeling and analysis of nonmarkovian reaction processes, leading to many significant challenges. It is needed to develop new methods to characterize nonmarkovian reaction kinetics.
In recent years, the interest in nonmarkovian processes within scientific communities has been blossoming, from the viewpoints of experimental observations (10–16), mathematical modeling (5–7, 17–30), and numerical simulation (30–38). However, because of difficulties in treating nonmarkovian reaction kinetics, only a few exact master equations are known in the literature, most of which are rather formal, or concerning specific systems or stochastic processes (6, 7, 17–20, 39, 40). The aim of this paper is to fill this gap by introducing useful formulations for a general biochemical-reaction system with molecular memory. The basic idea is that we first construct a topology-equivalent, memoryless reaction network by introducing an effective transition rate (ETR) for each reaction in the original nonmarkovian system and then establish an effective master equation for the equivalent reaction network, whose stationary equation is termed as a stationary generalized chemical-master equation (sgCME). The sgCME exactly captures the stationary probabilistic behavior of the original nonmarkovian system. In other words, the original nonmarkovian question is converted into a markovian one (Fig. 1B), greatly simplifying analysis of nonmarkovian processes. Based on the sgCME, we further develop another 2 useful techniques for studying stochastic reaction processes on networks: stationary generalized Fokker–Planck equation (sgFPE) and generalized linear noise approximation (gLNA), each enabling fast yet effective evaluation of fluctuations, including the effect of molecular memory.
We demonstrate the power of the above 3 techniques by analyzing 4 examples: a generalized model of constitutive gene expression, a generalized model of gene self-regulation, a generalized ON-OFF model, and a generalized model of genetic toggle switch, each considering molecular memory. These examples are chosen so that analytical or numerical approximations of solutions to the sgCMEs can easily be obtained, and these approximations are then used to clarify the origins of nonmarkovianity and fluctuations in each case and to trace the effects of different parameters on the stochastic properties of the systems. The results reveal the importance of the effect of molecular memory on reaction kinetics.
Methods
A Theoretical Framework for Reaction Processes on a Network.
Before presenting our sgCME, let us give the representation of a general reaction network in terms of intrinsic waiting-time distributions. In order to cast the dynamics of different species that participate in different reactions (numbered by with ) into a CTRW framework, we first define a state space. Denote by the chemical species and by the corresponding particle number, where . Let represent the state vector of particle numbers, where denotes transpose. Denote by and the stoichiometric coefficients for the reaction, i.e., and represent the loss and gain in particle number during the reaction, respectively. On the other hand, a single event of reaction is characterized by the intrinsic waiting time , whose probability density function (PDF), denoted by , depends, in general, on the system state . To that end, the above reaction network can be represented by
| [1] | 
Note that represents the net change of in the reaction. Function is called the intrinsic waiting-time distribution for reaction (20, 39, 40). We emphasize that such a waiting-time distribution is an extension of exponential waiting-time distribution in the markovian models but is different from waiting-time distributions in queuing theory (21). In addition, intrinsic waiting-time distributions are suitable to the description of any reactions, including bimolecular or multimolecular reactions, so they are also extensions of delay distributions introduced in refs. 5 and 28, wherein the authors proposed a methodology to represent chains of single-molecular reactions by simpler, reduced models.
If all of the reaction events happen in a markovian (or memoryless) manner, then reduces to for all (hereafter, we always assume ), where is the reaction-propensity function for reaction . This expression is natural since exponential waiting-time distributions, which have a clear biophysical foundation (41), are a main characteristic of markovian reaction processes. In contrast to the times until the next molecular events happen in single-step processes (elementary reactions), the waiting times between reaction events are not exponentially distributed in general. If the process goes through a series of (identical) exponential steps, the waiting times will be gamma-distributed (42). If we consider an intrinsic waiting-time distribution of the gamma type: , where may be understood as the number of small yet possibly unspecified reaction steps, then corresponds to an exponential waiting-time distribution, whereas (called memory index) to a nonexponential waiting-time distribution with the noise intensity (defined as the ratio of the variance over the squared mean) being . Note that for a reaction network, as long as there is a reaction such that the corresponding memory index is not equal to 1, the whole reaction kinetics is nonmarkovian. In this paper, we will only consider intrinsic waiting-time distributions of (exponential) and (nonexponential).
sgCME.
Effective transition rates.
For convenience, let be the cumulative distribution function of , i.e., . If we define , then represents the probability that the reaction happens and the reaction waiting time is in interval . Note that the conservative condition always holds.
Now, we introduce a memory function for reaction , denoted by , which is defined through the Lapalace transform: , where , and the above bar represents the Laplace transform of a function, e.g., . To help the reader understand function , let us consider waiting-time distributions of special forms: , with being the reaction-propensity function for reaction , implying that all of the reaction events happen in markovian manners. Then, by the expression of in combination with the inverse of the Laplace transform, we can show for all . In this case, the memory functions reduce to the common reaction-propensity functions.
Interestingly, we find that the limit exists, given waiting-time distributions for a reaction network. Moreover, if this limit is denoted by , then it can be analytically expressed as (see SI Appendix for details)
| [2] | 
which is termed as the ETR for reaction , where . If there exists some subscript such that for all , i.e., if the waiting time of the reaction follows an exponential distribution, function reduces to the common reaction-propensity function, i.e., (see SI Appendix for details). If waiting-time distributions for all of the reactions are exponential, the nonmarkovian reaction network reduces to a markovian reaction one. Therefore, ETRs are extensions of reaction-propensity functions. However, different from function that is a polynomial of , function is in general a rational function of . For example, consider a generalized birth–death process: and , where represents the number of molecules. If and with being a positive integer, we can show and , which is apparently a rational function of if . SI Appendix also gives the explicit expressions of ETRs in the case of environmental perturbations or external noise that can lead to stochastic reaction delays.
Effective CME.
For the above (nonmarkovian) reaction network, if we let be the probability that the system is in state at time , then according to the CTRW theory (20, 39, 40), we can derive the following CME expressed in Laplace transforms
| [3] | 
By the final value theorem (42), we know that if the limit exists, 2 limits and are equal, i.e., . Multiplying on both sides of Eq. 3, letting and using , we can obtain the following stationary equation (SI Appendix):
| [4] | 
which is called the sgCME, where is the stationary distribution corresponding to and is assumed to exist (numerical simulations verifies this, referring to Fig. 2C). Eq. 4 with Eq. 2 is one of the main results of this paper.
Fig. 2.
Analysis of a generalized model of constitutive gene expression without self-regulation, i.e., is independent of . (A) Schematic representation of the model. (B) Transitions between states. (C) Protein distributions at several different time points, where curves with empty circles correspond to the original nonmarkovian model, whereas colored curves correspond to the constructed markovian model. (D) Influences of memory index on mean protein levels, where empty circles represent the results obtained by the modified “first-reaction” Gillespie algorithm (41), whereas the lines represent the results obtained by theoretical predictions. (E) Stationary protein distributions for different values of and . Parameter values are set as: , and (C) and (D and E).
On the other hand, we can use to construct a reaction network. This is done by taking as the reaction-propensity function for the reaction in an topologically equivalent reaction network with the same substrates but without molecular memory. For this reaction network, we can establish its CME. In fact, if we let be the probability that the system is in state at time , the corresponding CME takes the form
| [5] | 
which is called an effective CME for the original nonmarkovian reaction system, where “effective” will be interpreted below.
Notably, the stationary equation corresponding to Eq. 5 is exactly the same as Eq. 4 except for notation, implying that the stationary probabilistic behavior of the original nonmarkovian reaction system is exactly the same as that of the constructed markovian reaction network. In this sense, the original nonmarkovian problem is converted to a markovian one. However, there would exist differences in dynamic probability behavior between the 2 reaction networks, referring to Fig. 2C, which also verifies that the stationary distribution indeed exists even in the presence of molecular memory. Despite this difference, Eq. 5 provides a way for studying complex nonmarkovian reaction kinetics.
Generalized Linear Noise Approximation.
In analysis of reaction networks, an extensively used technique is the linear noise approximation (LNA) (1, 2). Here, we derive a generalized LNA (gLNA) for the above general reaction system with arbitrary waiting-time distributions.
First, the rate equations corresponding to the constructed-above markovian reaction network read
| [6] | 
where , represents the concentration of reactive species , i.e., with being the volume of the system, and is given by Eq. 2 if is replaced with . Let be the stoichiometric matrix and be a column vector of ETRs. Then, the algebraic equation
| [7] | 
determines the steady state of the deterministic system described by Eq. 6, which is denoted by .
Then, we adopt the Ω-expansion method (1, 2) to derive an algebraic equation for covariance matrix. Write , where is a vector of random variables. Under this transform, the original becomes another . According to SI Appendix, we can derive the following Lyapunov matrix equation:
| [8] | 
where is a matrix with elements , is a noise matrix with elements , and is a covariance matrix to be unknown. Note that the diagonal elements of represent the variances of the random variables, and the vector of the mean concentrations of the reactive species is approximately given by .
We point out that the above analysis framework is convenient for both clarifying the origins of nonmarkovianity (including fluctuations) and tracing the effects of different parameters on the stochastic properties of the underlying systems. SI Appendix provides details for examples of analysis by the gLNA. We point out that a slow-scale LNA (ssLNA) has been developed, which describes a class of nonmarkovian systems stemming from timescale separation (43). Specifically, starting with a markovian system composed of fast and slow species, the authors of that paper derived the LNA for the nonmarkovian system, which describes only the slow species (the observables).
sgFPE.
As an effective approximation of the CME in some situations, the Fokker–Planck equation (FPE) has extensively been used (1, 2, 44, 45), mainly because the latter is more easily analyzed and can often provide more intuitive understanding of a biochemical system than the former. However, the FPE has not been established in the presence of molecular memory. Here, we derive a sgFPE for a general nonmarkovian reaction network with arbitrary waiting-time distributions.
First, although Eq. 3 holds for the discrete variables, it also holds for the corresponding continuous variables. Second, Taylor-expanding the CME in the case of continuous variables to the second-order term yields
| [9] | 
Multiplying on both sides of Eq. 9, letting and making use of the facts: and , we can arrive at the following sgFPE:
| [10] | 
In the next section, we will use Eq. 10 to analyze generalized stochastic models of gene expression and obtain some interesting results on the effect of molecular memory.
In the following section, we will apply the above general theory to 4 gene models: a generalized model of constitutive gene expression, a generalized model of gene self-regulation, a generalized ON-OFF model, and a generalized model of genetic toggle switch, where by “generalized,” we mean that each model considers molecular memory or nonmarkovianity. Then, we discover biological knowledge, e.g., molecular memory is in effect equivalent to a feedback and can induce bimodality, fine-tune the expression noise, and induce switch. (The related data will be available from the corresponding author upon request.)
Results
The Effect of Molecular Memory Is Equivalent to the Introduction of a Feedback.
Understanding how a gene is turned on at a mechanistic level has been one of the big challenges in molecular biology and has received extensive attention over decades. Identifying the actual sequence of events during gene expression and establishing the method of recruitment have turned out to be a surprisingly difficult task (46). Here, we introduce a generalized model to mimic complex biochemical processes underlying gene expression, referring to Fig. 2A, where the proteins are assumed to be produced instantaneously after messenger RNAs (mRNAs) are produced. Fig. 2B is a schematic representation of transitions between protein states with time. Let and be waiting-time distributions for protein synthesis and degradation, respectively, where represents the number of protein molecules.
First, consider the case without regulation but with molecular memory. Two waiting-time distributions are set as and , where and are positive constants (which may be understood as the mean synthesis and degradation rates, respectively). Before presenting analytical results, we perform numerical calculation with results shown in Fig. 2, where Fig. 2C demonstrates that the stationary protein distribution indeed exists even in the presence of molecular memory. Hereafter, we will vary memory index ( or ) while keeping the constant average time between successive reactions by scaling parameter appropriately with , i.e., keeping the ratio fixed.
As pointed out above, the stationary probabilistic behavior of the original nonmarkovian reaction system is exactly the same as that of the constructed markovian reaction network, but there would exist differences in dynamic probability behavior between the 2 networks. Fig. 2C shows that 2 dynamic distributions are different at the initial stage, but this difference gradually reduces and finally disappears with time. Fig. 2E demonstrates how molecular memory (i.e., or ) affects the stationary protein distributions, whereas Fig. 2D shows that always decreases the mean protein number but always increases this number. These numerical results imply that the effect of molecular memory is equivalent to the introduction of a feedback. For this, we give an intuitive interpretation. First, note that in simulation, we keep the average waiting times between successive reactions constant by scaling appropriately with , i.e., we keep ratios and constant. This implies that the average of waiting times remains unchanged, but their variances decrease with increasing . Second, the waiting-time distribution will collapse onto a Dirac delta function due to zero variance if tends to infinity. Third, note that the reaction event that actually occurs is the one whose waiting time is minimum. Therefore, for a fixed , if increases, the variability in birth waiting times decreases. As a result, the probability of birth events decreases and hence the effective reaction rate decreases. Similarly, if the protein-decay rate decreases with , the mean protein number will increase. Finally, we emphasize that such a memory-induced feedback stems from fluctuations in waiting times rather than changes in their means.
In order to obtain analytical results, we consider the case of and . In this case, 2 ERTs are given by with and . Note that function with has the following properties: and the derivative is less than zero, i.e., for all . Therefore, the effect of molecular memory is equivalent to the introduction of a negative feedback. In addition, we can show that the stationary protein distribution is given by with , where symbol is a confluent hypergeometric function (47), and symbol is defined as . Constants are determined by comparing the coefficients for the same power of in the equality of , where are assumed to be real (the case of complex roots can be similarly analyzed). This form of the distribution is similar to that of the stationary mRNA distribution in a stochastic gene model with a DNA loop (48).
Molecular Memory Can Induce Bimodality.
In the markovian case, gene self-regulating systems have been extensively studied, and some analytical results have been obtained (49–52). However, gene self-regulating processes are in general nonmarkovian as pointed out in the introduction, raising the question of how nonmarkovianity impacts gene-product distributions. To address this question, let us consider the following gene model: , , where represents the number of protein molecules, and and are the intrinsic waiting-time distributions for protein synthesis and degradation, which take the forms and , where is a regulation function of Hill type, is the Hill coefficient, and , (representing the feedback strength), , and are positive parameters. This setting enables us to have a description in terms of one promoter state rather than 2 switching states but also hides the fact that proteins produced by the gene in the ON state bind to the gene and take it back to the OFF state. If promoter switching between ON and OFF states is very quick and if the protein–DNA binding rate is not much larger than the unbinding rate, the use of an effective Hill-type function is reasonable (53).
Similar to the case of no regulation, 2 effective transition rates, and , can also be given analytically. Although the exact stationary protein distribution cannot be analytically given, the solution to Eq. 10, i.e., the stationary distribution of continuous variables, can be approximately expressed as (see SI Appendix for details)
| [11] | 
where is a normalization factor.
Numerical results are demonstrated in Fig. 3, where Fig. 3 A and C shows how the number of the most probable protein molecules obtained by a statistical method depends on memory index , whereas Fig. 3 B and D demonstrates that the stationary distribution predicted by Eq. 11 can well approximate that obtained by solving the sgCME. Note that corresponds to the markovian case (red dashed lines), whereas or to the nonmarkovian case. In Fig. 3 A and C, the shadowed areas represent that bimodality exists. Also note that the number of the most probable protein molecules shown in Fig. 3 A and C is in agreement with that predicted by a deterministic system (SI Appendix). From Fig. 3 B and D, we observe that bimodal protein distribution exists only for moderately large with or only for moderately small with . Since memory index determines the strength of molecular memory, Fig. 3 implies that the molecular memory only with moderate strengths can induce bimodality. Here, we also give an intuitive explanation for the demonstrated numerical results. First, the occurrence of bimodality needs appropriate nonlinearity. Second, if is far away from 1, then the system’s nonlinearity enhances. However, values of must be appropriately chosen to generate bimodal protein distributions since only the appropriate nonlinearity can lead to bimodality.
Fig. 3.
Molecular memory can induce bimodal protein expression in the presence of feedback: the gene model depicted in Fig. 2A, where depends on . Empty circles represent the results obtained by the sgCME, whereas the solid lines represent the results predicted by Eq. 11. A and B correspond to the case of , where A demonstrates the dependence of most probable protein numbers on memory index , whereas B shows stationary protein distributions. C and D correspond to the case of , where C demonstrates the dependence of most probable protein numbers on , whereas D shows stationary protein distributions. Parameter values are set as: (A and B); (C and D).
Molecular Memory Can Fine-Tune the Gene Expression Noise.
Transcription is a key step in gene expression. Biochemical processes associated with transcription often involve a variety of TFs, which regulate the promoter kinetics. For bacterial cells, promoters can exist in a surprisingly large number of regulatory states, e.g., the PRM promoter of phage lambda in Escherichia coli is regulated by 2 different TFs binding to 2 sets of 3 operators that can be brought together by looping out the intervening DNA, and, as a result, the number of regulatory states of the PRM promoter is up to 128 (54). In contrast, eukaryotic promoters are more complex, involving nucleosomes competing with or being removed by TFs (55). In addition to the conventional regulation by TFs, the eukaryotic promoters can also be epigenetically regulated via histone modifications (56–58), and such regulation may lead to very complex promoter structures (59). Given this complexity, we introduce intrinsic waiting-time distributions to model promoter kinetics. Specifically, assume that the switch times from OFF to ON and vice versa follow gamma distributions given, respectively, by and , where represents the number of mRNA molecules. In addition, assume that the waiting-time distributions for transcription and degradation are given by and . The corresponding gene model is schematically depicted in Fig. 4A.
Fig. 4.
Effect of molecular memory on gene expression. (A) Schematic representation of a model of stochastic transcription. (B) Dependence of the mean mRNA and the noise intensity on , where . (C) Dependence of the mean mRNA and the noise intensity on , where . (D) Dependence of the mean mRNA and the noise intensity on , where . (E) Dependence of the mean mRNA and the noise intensity on , where . The parameter values are set as and . This setting implies the average time between successive reactions is kept fixed by scaling appropriately with .
Let be ETRs for transition from OFF to ON, for transition from ON to OFF, for transcription, and for degradation, respectively. According to Eq. 2, we can obtain the analytical expressions of the ETRs, which are given in SI Appendix. Moreover, the analytical expression of with or with can imply that the effect of molecular memory is equivalent to the introduction of a feedback, as interpreted above. Let and be the probabilities that mRNA has molecules at states OFF and ON, respectively. Then, the corresponding sgCME takes the following form:
| [12] | 
In general, Eq. 12 has no analytical solution but can be solved numerically (see SI Appendix for details). Fig. 4 shows numerical results, where we vary memory index but always keep the constant average time between successive reactions by scaling the characteristic parameter (e.g., ) appropriately with .
From this figure, we observe that the mRNA mean is monotonically decreasing in memory index (Fig. 4B) or in memory index (Fig. 4D) if the other memory indices are set as 1, but monotonically increasing in memory index (Fig. 4C) or in memory index (Fig. 4E) if the other memory indices are set as 1. Fig. 4B shows that memory index amplifies the mRNA noise, whereas Fig. 4 C–E demonstrates that the other 3 memory indices can reduce the mRNA noise. In a word, Fig. 4 indicates that molecular memory plays an unneglectable role in gene expression.
Molecular Memory Can Induce Switch.
Recall that a toggle-switch network (Fig. 5A) can model the cross-repression between the determinants of different cellular states, which can result in a definite choice between 2 outcomes (60–62). Conventional models of genetic toggle switch consider exponential waiting-time distributions. However, the expression of a gene in general involves a multistep process. Indeed, transcriptional repressor monomer (A or B) binds first to dimers and then to specific DNA sequences near the promoter, repressing the production of transcriptional repressor monomer (B or A). This multistep process can lead to nonexponential waiting times, creating a memory between individual reaction events. Here, we consider a generalized model of genetic toggle switch, which is schematically shown in Fig. 5A with 4 reactions listed in Fig. 5B, where and are intrinsic waiting-time distributions for the synthesis and degradation of protein A and protein B, respectively, and with and representing the numbers of protein A and protein B molecules, respectively. Assume that these waiting-time distributions are given by with , ; with , . Note that and correspond to the markovian case, whereas or corresponds to the nonmarkovian case.
Fig. 5.
(A) Schematic representation of a genetic toggle-switch model with molecular memory, where 2 genes are repressed by each other. (B) Four reactions corresponding to A, where waiting times for synthesis and degradation of each protein follow distributions. Default parameter values are taken as: . (C and F) Joint distributions of proteins A and B, obtained by a numerical algorithm (SI Appendix). (D and G) Heat maps in the plane of protein A and B. (E and H) Time series of the difference between the levels of protein A and B, obtained by sgGA. C–E correspond to exponential waiting times, where parameter values are set as . F–H correspond to nonexponential waiting times, where parameter values are set as . Prob., probability.
Numerical results are demonstrated in Fig. 5 C–H, where Fig. 5 C–E corresponds to the case of exponential waiting times, whereas Fig. 5 F–H to the case of nonexponential waiting times. We observe that if the waiting times for synthesis of protein A and B follow exponential distributions (i.e., if we set ), the steady-state joint distribution of proteins A and B is unimodal, referring to Fig. 5 C and D. However, if the waiting times for synthesis of protein A and B follow nonexponential distributions (e.g., if we set ), the steady-state joint distribution of proteins A and B is bimodal, referring to Fig. 5 F and G. To examine the time dependence of the populations of 2 proteins in a single cell, we first perform stochastic simulations with a numerical algorithm (see SI Appendix for details) and then calculate the difference between the levels of proteins A and B. Numerical results are shown in Fig. 5 E and H. Comparing Fig. 5E with Fig. 5H, we find that 2 switching states occur only in the case of nonexponential waiting times or molecular memory. Thus, we conclude from Fig. 5 that molecular memory can induce bimodal distributions in the toggle-switch model depicted in Fig. 5A or Fig. 5B.
Discussion
Previous studies of biochemical-reaction processes on networks are mainly based on markovian (i.e., memoryless) hypothesis. However, as soon as a reactant interacts with its environment, the effect of molecular memory cannot be neglected. We have derived an exact sgCME, an sgLNA, and an sgFPE for a general biochemical-reaction network with molecular memory characterized by nonexponential waiting-time distributions. These derived equations allow one to retain analytical and/or numerical tractability, being general in scope, and thus are of a potential applicability in a wide variety of problems that transcend pure physics applications. The derived sgCME is particularly useful in finding stationary distributions in a number of nonmarkovian biochemical systems, as demonstrated in this article. Analysis of stochastic gene expression examples has indicated that the sgCME can help us find new biological knowledge, e.g., the effect of molecular memory is equivalent to the introduction of a feedback, and molecular memory can induce bimodality, although the distribution is not bimodal in the corresponding markovian case. The power of the sgCME can be enhanced by analyzing other examples, such as nonmarkovian random walks and diffusion on networks (21, 63–65) and nonmarkovian open quantum systems (66).
Our general theory can reproduce some known results for queuing models of biological processes. First, recall that Pedraza and Paulsson (6) analyzed a (including ) model of gene expression with a general queuing waiting-time distribution for the arrival of bursts and an exponential waiting-time distribution for the decay of mRNAs and derived an approximate formula for the mRNA noise. In SI Appendix, we have used the above theory to reproduce this formula. Second, environmental perturbations or external noise, which is often inevitable in cellular processes, can be modeled with time delay (20). We have derived the analytical expressions of effective transition rates and further established the corresponding gLNA (see SI Appendix for details). Functionally, this gLNA may be analogous to the chemical fluctuation theorem for models of gene expression (19), where subscript “” represents that the corresponding waiting-time distributions are time-varying.
Our theoretical framework can also be used in the inference of the structure and parameters involved in system modes for a broad class of nonmarkovian biochemical-reaction processes on networks. For example, the structure of gene promoters and their kinetics, which would be complex due to, e.g., TF regulation, can be inferred based on experimental data. In fact, we can first infer the key parameter in the Erlang waiting-time distribution from experimental data, since it can represent the number of small, difficultly specified reaction steps involved in transitions from ON to OFF states or vice versa, implying that the promoter structure can be determined. Then, we can use the standard method (e.g., the maximum likelihood estimation) to infer the values of other parameters from the experimental data, such as the mean switching rates between ON and OFF states, the mean transcription or translational rate. These inferred kinetic parameters in turn determine promoter kinetics and gene-expression dynamics. Furthermore, the sgCME can be used in the analysis of the corresponding stationary probabilistic behavior. In a word, we expect that our analytical framework will be of use for studying a variety of phenomena in biological and physical sciences and, indeed, in other areas where individual-based models with general waiting-time distributions and/or delayed interactions are relevant.
In the realistic world, “non-Markov is the rule, Markov is the exception,” as remarked by N. G. van Kampen (67). A stochastic process (i.e., the biological phenomenon evolving in time) may be or may not be markovian, depending on the variables used to describe it. If all of the variables are observable or measurable, the process is markovian. In general, however, this is impractical and even impossible. Therefore, most of real stochastic processes we observe are nonmarkovian. To model real stochastic processes with some unobservable variables, many different methods of modeling have been proposed, e.g., queuing models (6, 7, 17–19), delay models (5, 28), Langevin equations with color noise (68), and CTRW models (20–22, 26). Correspondingly, some simulation algorithms have also been developed, e.g., those based on general renewal processes (30–33) and the one by introducing some exogenously reaction channels (38). These approaches, despite their own advantages, have finite applications, e.g., queuing models are inconvenient to treating bi- or multimolecular reaction networks.
Finally, we point out that CTRWs used in our theory incorporate the timing of move, where a random walker waits between 2 moves for a duration that independently follows an intrinsic waiting-time distribution. In other words, the move events are generated by a renewal process. On the other hand, CTRWs can be categorized into the 2 classes of active CTRWs and passive CTRWs, depending on whether a random walker actively initializes them as it travels or passively follows states when available (21). In active CTRWs, the interevent time of a state is reinitialized when a random walker lands on it. In passive CTRWs to which queuing models correspond, however, the interevent time of a state is not reset, and the waiting time depends on the last activation time. Usually, active CTRWs generates interevent times from a given PDF, more suitable to the analytical study of random processes, whereas passive CTRWs use interevent times observed in real data, less favorable to model and analyze. Based on the active CTRW framework, we have established a set of theories for a general biochemical network with arbitrary intrinsic waiting-time distributions. Our analysis of generalized birth and death processes based on the passive CTRW framework (SI Appendix) has provided a general thinking by establishing the relationship between the active and passive CTRWs.
Supplementary Material
Acknowledgments
T.S. was supported by National Natural Science Foundation of China (Grants 11932019, 11775314, and 91530320). J.J. was supported by National Natural Science Foundation of China (Grants 11475273 and 11631005), Science and Technology Program of Guangzhou (Grant 201707010117), and Guangdong Key Research and Development Project 2019B0233002.
Footnotes
The authors declare no competing interest.
This article is a PNAS Direct Submission. H.Q. is a guest editor invited by the Editorial Board.
This article contains supporting information online at www.pnas.org/lookup/suppl/doi:10.1073/pnas.1913926116/-/DCSupplemental.
References
- 1.Gardiner C., Stochastic Methods-A Handbook for the Natural and Social Sciences (Springer, New York, NY, 2009). [Google Scholar]
- 2.Van Kampen N. G., Stochastic Processes in Physics and Chemistry (North-Holland, Amsterdam, The Netherlands, 2007). [Google Scholar]
- 3.Pardoux E., Markov Processes and Applications: Algorithms, Networks, Genome and Finance (Wiley & Sons, New York, NY, 2008). [Google Scholar]
- 4.Andersson H., Britton T., Stochastic Epidemic Models and Their Statistical Analysis (Springer, New York, 2000). [Google Scholar]
- 5.Barrio M., Leier A., Marquez-Lago T. T., Reduction of chemical reaction networks through delay distributions. J. Chem. Phys. 138, 104114 (2013). [DOI] [PubMed] [Google Scholar]
- 6.Pedraza J. M., Paulsson J., Effects of molecular memory and bursting on fluctuations in gene expression. Science 319, 339–343 (2008). [DOI] [PubMed] [Google Scholar]
- 7.Jia T., Kulkarni R. V., Intrinsic noise in stochastic models of gene expression with molecular memory and bursting. Phys. Rev. Lett. 106, 058102 (2011). [DOI] [PubMed] [Google Scholar]
- 8.Nishinari K., Okada Y., Schadschneider A., Chowdhury D., Intracellular transport of single-headed molecular motors KIF1A. Phys. Rev. Lett. 95, 118101 (2005). [DOI] [PubMed] [Google Scholar]
- 9.Basu A., Chowdhury D., Traffic of interacting ribosomes: Effects of single-machine mechanochemistry on protein synthesis. Phys. Rev. E Stat. Nonlin. Soft Matter Phys. 75, 021902 (2007). [DOI] [PubMed] [Google Scholar]
- 10.Harper C. V., et al. , Dynamic analysis of stochastic transcription cycles. PLoS Biol. 9, e1000607 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Salathé M., et al. , A high-resolution human contact network for infectious disease transmission. Proc. Natl. Acad. Sci. U.S.A. 107, 22020–22025 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Corral A., Long-term clustering, scaling, and universality in the temporal occurrence of earthquakes. Phys. Rev. Lett. 92, 108501 (2004). [DOI] [PubMed] [Google Scholar]
- 13.Stumpf P. S., et al. , Stem cell differentiation as a non-Markov stochastic process. Cell Syst. 5, 268–282.e7 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Suter D. M., et al. , Mammalian genes are transcribed with widely different bursting kinetics. Science 332, 472–474 (2011). [DOI] [PubMed] [Google Scholar]
- 15.Guérin T., Bénichou O., Voituriez R., Non-Markovian polymer reaction kinetics. Nat. Chem. 4, 568–573 (2012). [DOI] [PubMed] [Google Scholar]
- 16.Barabási A. L., The origin of bursts and heavy tails in human dynamics. Nature 435, 207–211 (2005). [DOI] [PubMed] [Google Scholar]
- 17.Schwabe A., Rybakova K. N., Bruggeman F. J., Transcription stochasticity of complex gene regulation models. Biophys. J. 103, 1152–1161 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Kumar N., Singh A., Kulkarni R. V., Transcriptional bursting in gene expression: Analytical results for general stochastic models. PLoS Comput. Biol. 11, e1004292 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Park S. J., et al. , The Chemical Fluctuation Theorem governing gene expression. Nat. Commun. 9, 297 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Aquino T., Dentz M., Chemical continuous time random walks. Phys. Rev. Lett. 119, 230601 (2017). [DOI] [PubMed] [Google Scholar]
- 21.Masuda N., Porter M. A., Lambiotte R., Random walks and diffusion on networks. Phys. Rep. 716, 1–58 (2017). [Google Scholar]
- 22.Kutner R., Masoliver J., The continuous time random walk, still trendy: Fifty-year history, state of art, and outlook. Eur. Phys. J. B 90, 50 (2017). (A collection of papers in this special issue). [Google Scholar]
- 23.Black A. J., McKane A. J., Nunes A., Parisi A., Stochastic fluctuations in the susceptible-infective-recovered model with distributed infectious periods. Phys. Rev. E Stat. Nonlin. Soft Matter Phys. 80, 021922 (2009). [DOI] [PubMed] [Google Scholar]
- 24.Van Mieghem P., van de Bovenkamp R., Non-Markovian infection spread dramatically alters the susceptible-infected-susceptible epidemic threshold in networks. Phys. Rev. Lett. 110, 108701 (2013). [DOI] [PubMed] [Google Scholar]
- 25.Starnini M., Gleeson J. P., Boguñá M., Equivalence between non-Markovian and Markovian dynamics in epidemic spreading processes. Phys. Rev. Lett. 118, 128301 (2017). [DOI] [PubMed] [Google Scholar]
- 26.Jo H. H., Perotti J. I., Kaski K., Kertész J., Analytically solvable model of spreading dynamics with non-Poissonian processes. Phys. Rev. X 4, 011041 (2014). [Google Scholar]
- 27.Kiss I. Z., Röst G., Vizi Z., Generalization of pairwise models to non-Markovian epidemics on networks. Phys. Rev. Lett. 115, 078701 (2015). [DOI] [PubMed] [Google Scholar]
- 28.Leier A., Marquez-Lago T. T., Delay chemical master equation: Direct and closed-form solutions. Proc. Math. Phys. Eng. Sci. 471, 20150049 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Brett T., Galla T., Stochastic processes with distributed delays: Chemical Langevin equation and linear-noise approximation. Phys. Rev. Lett. 110, 250601 (2013). [DOI] [PubMed] [Google Scholar]
- 30.Gillespie D. T., Monte Carlo simulation of random walks with residence time dependent transition probability rates. J. Comput. Phys. 28, 395–407 (1978). [Google Scholar]
- 31.Boguñá M., Lafuerza L. F., Toral R., Serrano M. Á., Simulating non-Markovian stochastic processes. Phys. Rev. E Stat. Nonlin. Soft Matter Phys. 90, 042108 (2014). [DOI] [PubMed] [Google Scholar]
- 32.Vestergaard C. L., Génois M., Temporal Gillespie algorithm: Fast simulation of contagion processes on time-varying networks. PLoS Comput. Biol. 11, e1004579 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Masuda N., Rocha L. E. C., A Gillespie algorithm for non-Markovian stochastic processes. SIAM Rev. 60, 95–115 (2018). [Google Scholar]
- 34.Bratsun D., Volfson D., Tsimring L. S., Hasty J., Delay-induced stochastic oscillations in gene regulation. Proc. Natl. Acad. Sci. U.S.A. 102, 14593–14598 (2005). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Barrio M., Burrage K., Leier A., Tian T., Oscillatory regulation of Hes1: Discrete stochastic delay modelling and simulation. PLoS Comput. Biol. 2, e117 (2006). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Anderson D. F., A modified next reaction method for simulating chemical systems with time dependent propensities and delays. J. Chem. Phys. 127, 214107 (2007). [DOI] [PubMed] [Google Scholar]
- 37.Cai X., Exact stochastic simulation of coupled chemical reactions with delays. J. Chem. Phys. 126, 124108 (2007). [DOI] [PubMed] [Google Scholar]
- 38.Voliotis M., Thomas P., Grima R., Bowsher C. G., Stochastic simulation of biomolecular networks in dynamic environments. PLoS Comput. Biol. 12, e1004923 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Kenkre V. M., Montroll E. W., Shlesin M. F., Generalized master equations for continuous-time random walks. J. Stat. Phys. 9, 45–50 (1973). [Google Scholar]
- 40.Landman U., Montroll E. W., Shlesinger M. F., Random walks and generalized master equations with internal degrees of freedom. Proc. Natl. Acad. Sci. U.S.A. 74, 430–433 (1977). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Gillespie D. T., A general method for numerically simulating the stochastic time evolution of coupled chemical reactions. J. Comput. Phys. 22, 403–434 (1976). [Google Scholar]
- 42.Feller W., “Special densities. Randomization” in An Introduction to Probability Theory and its Applications (John Wiley & Sons, New York, NY, 2008), vol. 2, p. 47. [Google Scholar]
- 43.Thomas P., Straube A. V., Grima R., The slow-scale linear noise approximation: An accurate, reduced stochastic description of biochemical networks under timescale separation conditions. BMC Syst. Biol. 6, 39 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Schnoerr D., et al. , Approximation and inference methods for stochastic biochemical kinetics-a tutorial review. J. Phys. A Math. Theor. 50, 093001 (2017). [Google Scholar]
- 45.Angstmann C. N., et al. , Generalized continuous time random walks, master equations, and fractional Fokker-Planck equations. SIAM J. Appl. Math. 75, 1445–1468 (2015). [Google Scholar]
- 46.Green M. R., Eukaryotic transcription activation: Right on target. Mol. Cell 18, 399–402 (2005). [DOI] [PubMed] [Google Scholar]
- 47.Lebedev N. N., “Hypergfome functions” in Special Functions and Their Applications (Dover, New York, NY, 1972), pp. 238–280. [Google Scholar]
- 48.Zhang J., Zhou T., Promoter-mediated transcriptional dynamics. Biophys. J. 106, 479–488 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Ge H., Qian H., Xie X. S., Stochastic phenotype transition of a single cell in an intermediate region of gene state switching. Phys. Rev. Lett. 114, 078101 (2015). [DOI] [PubMed] [Google Scholar]
- 50.Grima R., Schmidt D. R., Newman T. J., Steady-state fluctuations of a genetic feedback loop: An exact solution. J. Chem. Phys. 137, 035104 (2012). [DOI] [PubMed] [Google Scholar]
- 51.Friedman N., Cai L., Xie X. S., Linking stochastic dynamics to population distribution: An analytical framework of gene expression. Phys. Rev. Lett. 97, 168302 (2006). [DOI] [PubMed] [Google Scholar]
- 52.Cao Z., Grima R., Linear mapping approximation of gene regulatory networks with stochastic dynamics. Nat. Commun. 9, 3305 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Holehouse J., Grima R., Revisiting the reduction of stochastic models of genetic feedback loops with fast promoter switching. Biophys. J. 117, 1311–1330 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Vilar J. M. G., Saiz L., CplexA: A mathematica package to study macromolecular-assembly control of gene expression. Bioinformatics 26, 2060–2061 (2010). [DOI] [PubMed] [Google Scholar]
- 55.Hornung G., et al. , Noise-mean relationship in mutated promoters. Genome Res. 22, 2409–2417 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Halme A., Bumgarner S., Styles C., Fink G. R., Genetic and epigenetic regulation of the FLO gene family generates cell-surface variation in yeast. Cell 116, 405–415 (2004). [DOI] [PubMed] [Google Scholar]
- 57.Octavio L. M., Gedeon K., Maheshri N., Epigenetic and conventional regulation is distributed among activators of FLO11 allowing tuning of population-level heterogeneity in its expression. PLoS Genet. 5, e1000673 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Weinberger L., et al. , Expression noise and acetylation profiles distinguish HDAC functions. Mol. Cell 47, 193–202 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Stavreva D. A., Varticovski L., Hager G. L., Complex dynamics of transcription regulation. Biochim. Biophys. Acta 1819, 657–666 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Gardner T. S., Cantor C. R., Collins J. J., Construction of a genetic toggle switch in Escherichia coli. Nature 403, 339–342 (2000). [DOI] [PubMed] [Google Scholar]
- 61.Lipshtat A., Loinger A., Balaban N. Q., Biham O., Genetic toggle switch without cooperative binding. Phys. Rev. Lett. 96, 188101 (2006). [DOI] [PubMed] [Google Scholar]
- 62.Biancalani T., Assaf M., Genetic toggle switch in the absence of cooperative binding: Exact results. Phys. Rev. Lett. 115, 208101 (2015). [DOI] [PubMed] [Google Scholar]
- 63.Scholtes I., et al. , Causality-driven slow-down and speed-up of diffusion in non-Markovian temporal networks. Nat. Commun. 5, 5024 (2014). [DOI] [PubMed] [Google Scholar]
- 64.Gleeson J. P., O’Sullivan K. P., Banos R. A., Moreno Y., Effects of network structure, competition and memory time on social spreading phenomena. Phys. Rev. X 6, 021019 (2016). [Google Scholar]
- 65.Delvenne J. C., Lambiotte R., Rocha L. E. C., Diffusion on networked systems is a question of time or structure. Nat. Commun. 6, 7366 (2015). [DOI] [PubMed] [Google Scholar]
- 66.Vega I. D., Alonso D., Dynamics of non-Markovian open quantum systems. Rev. Mod. Phys. 89, 015001 (2017). [Google Scholar]
- 67.van Kampen N. G., Remarks on non-Markov processes. Braz. J. Phys. 28, 90 (1998). [Google Scholar]
- 68.Łuczka J., Non-markovian stochastic processes: Colored noise. Chaos 15, 26107 (2005). [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.





