Skip to main content
PLOS ONE logoLink to PLOS ONE
. 2014 Feb 13;9(2):e88167. doi: 10.1371/journal.pone.0088167

Constructing the Energy Landscape for Genetic Switching System Driven by Intrinsic Noise

Cheng Lv 1,#, Xiaoguang Li 2,#, Fangting Li 1,3,*, Tiejun Li 2,4,*
Editor: Derek Abbott5
PMCID: PMC3923795  PMID: 24551081

Abstract

Genetic switching driven by noise is a fundamental cellular process in genetic regulatory networks. Quantitatively characterizing this switching and its fluctuation properties is a key problem in computational biology. With an autoregulatory dimer model as a specific example, we design a general methodology to quantitatively understand the metastability of gene regulatory system perturbed by intrinsic noise. Based on the large deviation theory, we develop new analytical techniques to describe and calculate the optimal transition paths between the on and off states. We also construct the global quasi-potential energy landscape for the dimer model. From the obtained quasi-potential, we can extract quantitative results such as the stationary distributions of mRNA, protein and dimer, the noise strength of the expression state, and the mean switching time starting from either stable state. In the final stage, we apply this procedure to a transcriptional cascades model. Our results suggest that the quasi-potential energy landscape and the proposed methodology are general to understand the metastability in other biological systems with intrinsic noise.

Introduction

Stochasticity is an inherent property of living cells. Especially when the low copy number of species like the DNA and mRNA are taken into account, stochastic fluctuations can become significant and may qualitatively affect the behavior of the whole system [1], [2]. To deal with these fluctuations, cells have evolved many mechanisms, of which genetic switch is a typical example. Cellular systems performing genetic switches usually consists of one positive feedback or double negative feedbacks [3], [4]. Depending on the robustness of the feedbacks, cells can perform switches either spontaneously or on call [5].

Previous kinetic studies about cellular stochasticity have been formulated by using the generating function [6], system size expansion [7], [8], large deviation theory (LDT) [9][13], or by employing WKB approximation to the chemical master equations (CMEs) [14], [15], etc. However, only few of them take transcriptional noise into account explicitly. Some recent studies have shown that correlations between mRNA and protein levels do not always perform equally well in revealing genetic regulatory relationships [16], [17], and the involvement of mRNA has a large effect on the switching times [18], [19]. On the other hand, ever since Waddington’s “epigenetic landscape” proposed in 1957 [20], the energy landscape have been widely used to provide intuitive illustration of the dynamics and evolution of genetic regulatory systems [1], [11], [21]. Thus it is important and desired to have an approach which can effectively determine the key features of a noisy gene expression system, such as constructing the corresponding “Waddington potential”, identifying the transition paths between metastable states and computing the transition rates, etc.

In this paper, we present a methodology to understand the metastability of the genetic switches in gene expression driven by the intrinsic noise based on LDT for Markov processes [22][24]. By explicitly taking mRNA noise into account, we obtain the most probable transition paths for off-to-on and on-to-off genetic switches through the geometric minimum action method (gMAM) [25]. Furthermore, we construct the global quasi-potential energy landscape, which is the rationalized version of the Waddington potential in this context. Based on the obtained quasi-potential, we obtain quantitative results for transition rates between metastable states and the intrinsic noise strength of gene expression state. We also consider the reduction of redundant dimensions if we are only interested in the energy landscape for partial components of the whole system. We successfully apply this methodology to a transcriptional cascades model. The relation between our and other approaches in literature is also discussed. From the authors’ opinion, this framework is generally applicable for studying transitions between stable-saddle-stable fixed points with jump type noise generated by Gillespie type birth-death dynamics [26]. The quasi-potential energy landscape will be a powerful tool to unravel the metastable properties in more general biological processes.

Models and Methods

We illustrate our method through a specific two-state gene expression model in Fig. 1. We will refer to it as the “dimer model” throughout the remainder of this paper. In this model, the gene at the active state transcribes mRNA with a much larger rate than it at the inactive state. Proteins translated from mRNA can aggregate into dimers that bind to the promotor site of the gene via a positive feedback. All of the processes are modeled as elementary reactions and all reaction rates are rescaled by protein decay rate (i.e. we will set Inline graphic unless stated otherwise). Here we assume that the reaction rates of dimer binding and dropping from DNA are much larger than the other reactions.

Figure 1. The autoregulatory dimer model with positive feedback.

Figure 1

Promoter transitions are regulated by the dimerized transcription factor with rate Inline graphic and Inline graphic. Inline graphic is the transcription rate of active promotor, with a very small transcription rate of inactive promotor Inline graphic. Inline graphic is kinetic rate of translation, Inline graphic and Inline graphic are degradation rates of mRNA and protein, Inline graphic and Inline graphic are the rates of dimerization and de-dimerization. All the processes are modeled as elementary reactions and all reaction rates are rescaled by the protein decay rate (i.e. Inline graphic unless stated otherwise).

Classical Methods and Issues

The deterministic mean-field description of this dimer model through quasi-steady state approximation (QSSA) yields the ODEs.

Inline graphic

Inline graphic

Inline graphic

where Inline graphic, Inline graphic and Inline graphic are the mean number of the mRNA, protein and dimer respectively, and the parameters are shown in Fig. 1. This system has two stable fixed points and one saddle in physically reasonable parameter regime (see Text S1:I). These two stable fixed points correspond to the expressed and unexpressed states at which the copy number of proteins is at high or low state, respectively. With this deterministic description, once the system settles in one of its two attractive fixed points, it will stay there forever. However, in the presence of intrinsic noise, the system will fluctuate around its attractive fixed points and switch between these two metastable states on a large timescale. This type of switching among metastable states under small perturbations is exactly the rare events studied in the literature. The large deviation theory (LDT) is an appropriate tool to quantitatively describe the rare transitions [23], [25], [27]. Roughly it tells that when the system size Inline graphic is sufficiently large, the probability that the trajectory of the stochastic dynamics Inline graphic stays in a small Inline graphic-neighborhood around a specific path Inline graphic can be given as

graphic file with name pone.0088167.e022.jpg (1)

where Inline graphic is called the rate functional. Thus the most probable transition path can be obtained by minimizing Inline graphic associated with the Lagrangian the Inline graphic. Our task is to find the Inline graphic for specific models. For Gillespie type birth-death dynamics, Inline graphic has no closed form and only its dual Hamiltonian can be obtained in the large volume limit Inline graphic, i.e. the number of all types of molecules goes to infinity. However, this approach encounters difficulty if we take the DNA switching into consideration since there is only one DNA copy in the considered model. Thus the straightforward utilization of the existed Hamiltonian in the large volume limit is invalid here.

Large Deviation Theory

To solve this issue, we develop the LDT directly for this specific system following the way in [28] with further extension. The biologically relevant choice of parameters in our model suggests the scaling Inline graphic, Inline graphic and the others are Inline graphic, where Inline graphic is the system size which is usually chosen as the typical number of proteins in the expressed state. This is also the correct scaling under which the mean field limit of the CMEs gives the ODE system derived from QSSA (see Text S1:I). Define the rescaled concentration variable Inline graphic where Inline graphic and Inline graphic is the state vector for the number of mRNAs, proteins and dimers. Correspondingly define Inline graphic and Inline graphic to transform all the parameters to Inline graphic magnitude. However for notational simplicity we will drop the tilde symbol on these parameters in the rest of the paper. It turns out that the Lagrangian of our model has the form.

graphic file with name pone.0088167.e039.jpg (2)

which combines the LDT result for large volume limit as shown in (1) and the Donsker-Varadhan type LDT result [22], [29] for DNA fast switching. Here Inline graphic resembles the velocity in classical mechanics, Inline graphic is a probabilistic 2-vector describes the residence distribution of DNA at the inactive or active state.

Let us illustrate the construction of (2) via an intuitive way as follows. The net effect of the fast switching of DNA induces a residence distribution Inline graphic with components Inline graphic and Inline graphic characterizing the probability of DNA staying at the inactive and active states, respectively. Whenever DNA is at the inactive or active state, we can apply the traditional LDT result in the large volume limit. This leads to the first part Inline graphic in (2). The second part describes how the visiting distribution induced by the random fast switching of DNA is close to a prescribed residence distribution Inline graphic given the current state Inline graphic. This is exactly what the Donsker-Varadhan type LDT gives [22], [29], which is described by Inline graphic. Since we are only interested in the LDT for the state variable Inline graphic, the overall Lagrangian should be taken infimum with respect to all of the possible residence distributions Inline graphic. All of the statements will be made clear in the continued paragraphs.

Similar as the case in the large volume limit, it is not feasible to get the explicit form of the Lagrangian Inline graphic in general, but its dual Hamiltonian Inline graphic can be available, where Inline graphic is the generalized momentum conjugate to Inline graphic as in classical mechanics. They are connected via the Legendre transform.

Inline graphic

Inline graphic

Inline graphic

Inline graphic

graphic file with name pone.0088167.e059.jpg (3)

For the Gillespie type birth-death process with Inline graphic reaction channels and the propensity function Inline graphic and stoichiometric vector Inline graphic for Inline graphic, the existed LDT result gives the Hamiltonian [23].

graphic file with name pone.0088167.e064.jpg (4)

Specifically in our model we have.

graphic file with name pone.0088167.e065.jpg (5)

with the Hamiltonian

graphic file with name pone.0088167.e066.jpg (6)

when the DNA is at the inactive state and

graphic file with name pone.0088167.e067.jpg (7)

when the DNA is at the active state. Here Inline graphic and Inline graphic corresponds to the part of the Hamiltonian for the gene expression, i.e. the transcription and translation processes. On the other hand, the famous Donsker-Varadhan LDT gives the Lagrangian

graphic file with name pone.0088167.e070.jpg (8)

where Inline graphic is any 2-vector and Inline graphic is the infinitesimal generator for the DNA two-states jumping process at a given state Inline graphic defined as

graphic file with name pone.0088167.e074.jpg (9)

The direct calculation shows that

graphic file with name pone.0088167.e075.jpg (10)

Combining Eqs. (3), (5), (6), (7) and (10), we obtain the final explicit Hamiltonian by optimization.

graphic file with name pone.0088167.e076.jpg (11)

where Inline graphic and Inline graphic. This derivation can be easily extended to similar problems.

It is worth noting that one can show the Hessian of Inline graphic with respect to Inline graphic has the form.

graphic file with name pone.0088167.e081.jpg (12)

where Inline graphic, Inline graphic and Inline graphic Since Inline graphic are positive when Inline graphic are positive, the decomposition in (12) shows that the Hessian is positive definite. This means the Hamiltonian is convex with respect to Inline graphic in physically meaningful domain. It is quite different from that obtained by WKB asymptotics [15] (See Text S1: II). The convexity of the Hamiltonian is testified to be essential for the robustness and efficiency of the numerical algorithm [25] both theoretically and practically. It can be rigorously proved that it is a natural by-product from the LDT analysis.

Quasi-Potential and Optimal Transition Path

With the obtained LDT, we can get the optimal transition path through variational optimization. Moreover, we can define the local quasi-potential Inline graphic with respect to a meta-stable state Inline graphic as.

graphic file with name pone.0088167.e090.jpg (13)

From classical mechanics, the local quasi-potential Inline graphic satisfies a steady-state Hamilton-Jacobi equation characterized by the Hamiltonian shown in Equation (11).

graphic file with name pone.0088167.e092.jpg (14)

The LDT also confirms the equilibrium distribution of the system through a global quasi-potential function Inline graphic.

graphic file with name pone.0088167.e094.jpg (15)

where the symbol Inline graphic means the equality relation on a logarithmic scale, and Inline graphic can be obtained from its local version Inline graphic by a suitable sticking procedure which we will describe in the next subsection. This function Inline graphic, which naturally serves as a rationalized version of the Waddington potential, is one main point of this article.

The classical Hamilton-Jacobi theory enables one to solve the local quasi-potential Inline graphic satisfying (14) with variational methods. Here we employ the powerful geometric minimum action method (gMAM) proposed in [25] to compute Inline graphic by minimizing the action functional with a prescribed Hamiltonian (11). The key idea of gMAM is essentially the Maupertuis principle in classical mechanics, which reformulates the action functional on the space of curves with intrinsic parameter, thus frees the time variable in the minimization process and still keeps its efficiency in high dimensions. This approach also resolves the issue of the singular boundary value problem by solving Hamilton’s equations directly [15], [30]. Specifically after each run of gMAM with fixed starting and ending points, one obtains the minimized action Inline graphic and the corresponding optimal path. The readers may be referred to Text S1:IV for more details.

It is worth asking whether the choice of the large parameter Inline graphic affects the final results since any choice is artificial in practice. An affirmative answer is given in Text S1:III that only the scaling matters and the final systems are equivalent with respect to different choices of the large parameter Inline graphic.

Construction of Global Quasi-Potential Energy Landscape

Based on the obtained local quasi-potential Inline graphic starting from the on and off states, we may construct the global quasi-potential energy landscape for genetic switching model by sticking them together. The system with only two metastable states and one saddle point, as our dimer model, can be handled conveniently as the way shown below. The readers may refer to [24] for systematic methods of sticking the global quasi-potential for more complex systems.

In our dimer model, we first compute the local quasi-potential Inline graphic starting from two metastable states Inline graphic and Inline graphic. We define Inline graphic and Inline graphic, where Inline graphic is the saddle point. Denote Inline graphic. Suppose Inline graphic, then the global quasi-potential Inline graphic is given by.

graphic file with name pone.0088167.e114.jpg (16)

otherwise Inline graphic has the form

graphic file with name pone.0088167.e116.jpg (17)

It is not difficult to observe that sticking the two local quasi-potential via the linking saddle Inline graphic is the key point in this construction.

In most cases, the considered system is in high dimensions while we are only interested in partial variables which is in low dimensions. This is also the case in our dimer model. Although the global quasi-potential Inline graphic is in three dimensions, we are mainly interested in its 2D configuration in the mRNA-Protein plane. So we need to reduce the redundant dimension Inline graphic to obtain a 2-D potential Inline graphic. We proceed with the following arguments.

According to the LDT analysis (15), we obtain.

graphic file with name pone.0088167.e121.jpg (18)

With the same reason, we have

graphic file with name pone.0088167.e122.jpg (19)

where Inline graphic is the reduced distribution for mRNA and protein. By definition, this distribution is given by

graphic file with name pone.0088167.e124.jpg (20)

From the Laplace asymptotics [29] we get a simple reduction strategy.

graphic file with name pone.0088167.e125.jpg (21)

This argument is general for any high dimensional situations and indeed it is also applied to the noise cascading model considered in our later text.

Results

Optimal Transition Path

The large-deviation theory predicts that when events with little likelihood occurs, they will follow the optimal transition path which minimizes the action (13) with high probability. The probability of those paths deviated from the optimal one will decay exponentially. By choosing two stable states as the starting and ending points respectively, we obtained the switching path from either of the two states (see Fig. 2). For the convenience of visualization, we project the transition paths onto the mRNA-protein plane.

Figure 2. Switching paths (A) from off to on state (purple solid curve) and (B) from on to off state (red solid curve) and MC simulations for both switching trajectories.

Figure 2

We take the two stable fixed points in the deterministic dynamics as the starting and ending points. Darkness of the shading points represents the number of visits for reactive trajectories with smoothing. (C) Averaged switching trajectories from MC simulation. For each number of protein, we average in the mRNA dimension using probability as weight. Here the statistical results around each stable state is not shown because of the restrictions by our MC simulation algorithm (see Text SI:VI-A). The results are obtained from 1000 independent long time MC simulations. The parameters here are Inline graphic, Inline graphic, Inline graphic, Inline graphic, Inline graphic, Inline graphic, Inline graphic, Inline graphic, and Inline graphic

Figure 2 shows clearly that when switch occurs, the trajectory prefers to be around the most probable path characterized by the Hamiltonian (11). The fact that the off-to-on and on-to-off paths are not identical agrees with the previous studies that the switching process is irreversible. The irreversibility is fundamental in chemical reaction kinetics due to the non-gradient nature of the considered system and can be considered as a form of hysteresis. However, in contrast with the previous study [11], our results indicate that when the noise level goes to zero, both optimal transition paths pass through the same bottleneck, i.e. the saddle point obtained from the corresponding deterministic model. This suggests that the saddle point has the lowest barrier height along its stable manifold and is in accordance with the energy landscape shown later.

Global Quasi-potential Energy Landscape

Applying the method of constructing global quasi-potential, we can compute the 2-D potential Inline graphic for our dimer model. The result is shown in Fig. 3.

Figure 3. Quasipotential energy landscape of the whole genetic switching system with (A) two and (B) three dimensional view as well as switching paths between two stable fixed points.

Figure 3

Each path passes through the saddle point. Here, the parameters are the same as in Fig. 2.

In Fig. 3, we observe that the on and off states correspond to two local minimum on the quasi-potential energy landscape, the saddle of the deterministic dynamical system exactly corresponds to the saddle point on the quasi-potential energy landscape too. The flatness along the mRNA direction keeps in good accordance with the large fluctuation observed in the reactive trajectories.

To further characterize the switching path, we denote the first half (i.e. the part between the starting point and the saddle point) as the uphill path and the latter half as the downhill path. One may note that the transition path is also given by the Hamilton’s equations Inline graphic, Inline graphic. Therefore based on the fact Inline graphic, we obtained Inline graphic when Inline graphic. At the saddle point in any transition path, we have Inline graphic [25], and thus Inline graphic along the whole downhill path. With this result we obtain the downhill equations Inline graphic, which exactly corresponds to the corresponding deterministic dynamics. This fact explains that after climbing the saddle point the biological system relaxes to its attracting state fast without costing any action. This fact was also pointed out in [30][32].

On the other hand, the Hamilton-Jacobi theory also yields the uphill dynamics.

graphic file with name pone.0088167.e144.jpg (22)

It is difficult to give a thorough understanding about the whole uphill path because of the general nonlinearity of Inline graphic. However, an analysis around the critical points is instructive. Based on the fact Inline graphic at critical points (i.e. the metastable states and saddle), we have by Taylor expansion.

graphic file with name pone.0088167.e147.jpg (23)

Recall that Inline graphic corresponds to the deterministic mean field ODEs, the equation (23) is exactly the uphill path of a chemical Langevin dynamics [33] (See more details in Text S1:VI). However, this chemical Langevin dynamics is not a straightforward generalization from the mean field ODEs like the usual large volume limit. Indeed, this reflects the specialty in our model setup which is related to the DNA fast switching. To see this more concretely, we have the approximated uphill dynamics for the Inline graphic-component in our dimer model as.

graphic file with name pone.0088167.e150.jpg (24)

The corresponding effective Langevin dynamics for the Inline graphic-component reads.

graphic file with name pone.0088167.e152.jpg (25)

where Inline graphic and Inline graphic are two standard temporal Gaussian white noise with mean Inline graphic and covariance Inline graphic, and Inline graphic It is remarkable that besides the mean field transcription rate Inline graphic, we have an additional term Inline graphic in Inline graphic. This is quite different from the usual chemical Langevin equations where for the Inline graphic-th reaction the diffusion term Inline graphic has the same form as its corresponding drift part Inline graphic except a square root operation. The additional term in Inline graphic makes that the fluctuation of the transcription is larger than that in gene expression processes without DNA switching, yet has the same mean field transcription rate Inline graphic This observation coincides with the theoretical analysis in [34] although no feedback is considered there.

The quasi-potential energy landscape not only provides the pictorial illustration for the dynamical transitions, it also contains many quantitative information to understand the metastability in genetic switching models. Once the global energy landscape is obtained, one can get the stationary distribution of the whole system via Eq. (15). Furthermore, it is very easy to calculate two main characteristic quantities used to describe a genetic switching system through only small amount of computational efforts. One is the transition rates, corresponding to the lowest barrier heights between two metastable states, and the other is the noise strength, corresponding to the steepness of quasi-potential around each metastable state.

Global Property: Mean Switching Time (MST)

The transition rate of switching systems is often characterized by Mean Switching Time (MST). We can compute the MST from either metastable state in the dimer model. For example, according to [24], the MST Inline graphic from on-to-off transition can be estimated from an asymptotic analysis.

graphic file with name pone.0088167.e167.jpg (26)

Here Inline graphic is a prefactor, and the quasi-potential energy barrier is Inline graphic, where Inline graphic and Inline graphic are the action values at the saddle and on states, respectively. The result for the MST of off-to-on transition Inline graphic is similar. Although for one dimensional system the prefactor of MST can be obtained [31], there are no available results in high dimensions because of the geometry problem and the non-gradient nature of the system [35], [36]. Fortunately, the prefactor varies slowly in many cases, therefore we can compare the MC simulations with the exponential time part and adjust the prefactor Inline graphic to fit the numerical results.

The sensitivity of both MSTs, Inline graphic and Inline graphic, to the change of transcription rate Inline graphic and mRNA decay rate Inline graphic are investigated and compared with MC simulations in Fig. 4A and Fig. 4B, where the prefactors of off-to-on and on-to-off transitions are estimated as 2300 and 29, respectively. It can be observed that the MST is excellently predicted by Eq. (26) up to a slowly varying prefactor. And it is worth noting that when the MST becomes very large, the efficiency of the classical MC simulations gets extremely low while it is well kept in our approach.

Figure 4. The mean switching time (MST) and quasipotential energy landscape as a function of parameters.

Figure 4

(A) and (B): MST as a function of transcription rate Inline graphic. Promoter transition rates Inline graphic, the gMAM results with numerical prefactor of off-to-on transition (red solid line) and on-to-off transition (blue dashed line), compared with MC simulations (Inline graphic) and (Inline graphic), respectively. (C) and (D): The gMAM results with different promoter transition rates of off-to-on transition (red) and on-to-off transition (blue), where solid line with Inline graphic is same as (A) and (B), the faster transition rate in dashed line with Inline graphic, the slower transition rate in dotted line with Inline graphic. Other parameters are Inline graphic; in (A,C), Inline graphic and (B,D) Inline graphic

The positive feedbacks in genetic circuit usually provide cellular memory or all-or-none switch. The results in Fig. 4 reveal the robustness and sensitivity of the dimer model to the change of different kinetic parameters. In Fig. 4A and Fig. 4B, the promoter transition rates Inline graphic. When the transcription rate Inline graphic increases from 360 to 420, the MST from on-to-off states Inline graphic increases exponentially, while the MST of off-to-on transition Inline graphic decreases slowly; it means the on-state becomes more stable while the off-state can still keep its stability. Therefore when the transcription rate Inline graphic is increased, our genetic dimer circuit with positive feedback provides a stable cellular memory at the on state, but the off state remains its stability. Thus the system can not switch from the off state to the on state effectively.

How to turn on the genetic switch? The results in Fig. 5 provide two possible effective choices. The first choice is to pose an additional source of the mRNA production, which we call as the trigger signal. We denote the additional mRNA production rate as Inline graphic. If Inline graphic increases from 0 to 20, then MST of off-to-on transition Inline graphic will decrease exponentially and the genetic switch is turned on. We also show how the global energy landscape changes with different trigger rate Inline graphic in Fig. 5C and Inline graphic in Fig. 5D. It is evident to see from the figures that when the trigger signal increases, the barrier height from off-state to on-state decreases, which is in accordance with the MST of off-to-on transition Inline graphic. The relevant biological switch can be found in the start point of budding yeast cell cycle process, where the additional trigger signal in G1 cyclin Cln3 causes the activation of G1 transcription factor SBF and MBF [37]. The second possible choice is to decrease the degradation rate of protein Inline graphic. When the degradation rate of protein Inline graphic decreases from 1 to 0.5 in Fig. 5B, MST from off-to-on transition Inline graphic will also decrease exponentially from Inline graphic to Inline graphic and turn on the genetic switch. This is the case in Bacillus subtilis, where the transitions into competent state is caused by decreasing the degradation rate of protein ComK [38]. All the other relative parameters are listed in the caption of Figures.

Figure 5. The mean switching time (MST) of off-to-on transition as a function of (A) trigger signal strength that transcribes mRNA at constant rate and (B) degradation rate of protein Inline graphic.

Figure 5

(C) and (D): Quasipotential energy landscape with different trigger strength. Inline graphic in (C), and Inline graphic in (D). Other parameters are Inline graphic; Inline graphic in (A,C,D), and Inline graphic in (B).

Furthermore, we calculate the MST of both from off to on and on to off states in the different promoter transition rates. We show the results with the fast rates (Inline graphic) in Fig. 4C and slow rates (Inline graphic) in Fig. 4D. These results indicate that the system with slow promoter transition rates tend to have short MST of off-to-on transition (red dashed lines) while fast rates lead to long MST of off-to-on transition (red dash-dotted lines). This is due to the reason that faster promoter transition rates lead to smaller mRNA and protein noise strength (see Fig. 6 for more detailed information). Ignoring the difference of the mechanism of initial transcription between prokaryotes and eukaryotes, in the simple case, the faster promoter transition rates correspond to the gene expression process in prokaryotes, and the slower promoter transition rates correspond to the slow chromatin remodeling process in eukaryotic case [39]. The results suggest that prokaryotes may have stronger cellular memory than eukaryotes.

Figure 6. The coefficient of variation (CV) versus mean number of (A) mRNA and (B) protein induced by varying transcription rate Inline graphic with different promoter transition rates.

Figure 6

The lines and discrete dots correspond to analytical results and MC simulations, respectively. The results with fast promoter transition rates are shown in blue dash-dotted line and Inline graphic, medium rates in red solid line and Inline graphic, and slow rates in magenta dashed line and ◊. The parameters here are the same as in Fig. 4 (A,C).

Local Property: Fluctuation Around Stable States

Another quantitative information that quasi-potential energy landscape can provide is the noise strength of stable states. Here we use the coefficient of variation (CV, i.e. the standard deviation over the mean) to measure the strength of fluctuation instead of the Fano factor, for the system here has positive feedback thus deviates far from Poisson statistics. Notice that the stationary distribution Inline graphic, we can expand Inline graphic in the vicinity of high stable state Inline graphic up to second order thus get the Gaussian approximation.

graphic file with name pone.0088167.e218.jpg (27)

Here, Inline graphic, Inline graphic, and Inline graphic is the determinant of matrix Inline graphic. Eq. (27) holds only in the vicinity of the on state with standard deviations Inline graphic and Inline graphic. With the Inline graphic and Inline graphic above, we can easily obtain the CV as shown in Fig. 6.

Figure 6 demonstrates that when the average expression levels increase, the noise strength of mRNA and protein decreases in our positive feedback model. The fluctuation of mRNA is usually larger than that of protein. Furthermore, the noise level with slow promoter transition rates is almost always larger than the one with fast promoter transition rates. This is in accordance with the results of MST that the system with long MST has small noise and vise versa. The inconsistent portion between analytical and simulation results (the left part of the line with slow promoter transition rates in Fig. 6B) is due to the inapplicability of Eq. (27) during the low barrier crossing process for the on state. More details may be referred to Text S1:V.

Application in Transcriptional Cascades

To further illustrate the power of quasi-potential energy landscape and the abundant quantitative information it contains, we apply our methodology to a transcriptional cascades model based on the previous work of S. Hooshangi et al. [40]. In their work, S. Hooshangi et al. synthesized transcriptional cascades comprised of one, two, and three repression layers and analyzed the sensitivity and noise propagation as a function of network complexity. They used different concentrations of anhydrotetracycline (aTc) as inducer and measured the fluorescence intensities of protein eyfp (the last layer of each cascade) by the flow cytometer.

Here we simplify the 3-layer cascades as Inline graphic, where Inline graphic denotes the concentration of aTc as inducing signal and Inline graphic denote the output of proteins in different layers respectively. Then we directly construct the quasi-potential energy landscape for each layer and obtained the normalized probability distribution of the output to certain signal Inline graphic from Eq. (15). The dose response curves to increasing signal Inline graphic are shown in Fig. 7, which are consistent well with the previous experimental results. Further more, two features of transcriptional cascades can be observed. Firstly, the more layers the transcriptional cascades have, the sharper the response curves are (as the Hill coefficient of the 3-layer cascades is 2.00, 3.15 and 4.08 respectively). Thus the sensitivity is increased in the cascades. Secondly, the fluctuation of output can be described by the spreading width of its distribution, so more layers of cascades amplify the cell-cell variability (see Fig. S2). In short, when a cascade has more layers, its response curve gets steeper with a wider probability distribution and thus larger fluctuations. The straightforward calculation of CV based on Fig. 7 has been done and it agrees well with the MC simulations (see Text S1:VI and Figure S2).

Figure 7. The dose response curves and probability distribution of the output protein in the 3-layer cascades (denoted by Inline graphic) as a function of inducing signal Inline graphic, 1-layer in (A), 2-layer (B) and 3-layer (C).

Figure 7

The probability distribution can be directly obtained from Eq. (15) after normalization. The Hill coefficient for each cascade is fitted as 2.00, 3.15 and 4.08 respectively.

Limitations of The Study, Open Questions, and Future Works

We have already illustrated a general methodology based on LDT to quantitatively understand the metastability in gene expression processes perturbed by the intrinsic noise and applied it to a dimer auto-regulatory circuit model. It is clear that this methodology can be extended to more general systems, provide one can explicitly write down the Hamiltonian of the system. If all of the considered species have relatively large numbers, the Hamiltonian is simply the Eq. (4). For the case where the large volume limit fails to be true, our method is also applicable under an additional assumption that the low copy number of species reach their stationary distribution much faster than the others. This is the situation that we treat DNA in our dimer model. However, we would like to mention the limitations of our work, which of course motivates us for future studies.

The main limitations or the corresponding open questions can be summarized into the following three aspects:

  1. The case where the large volume limit and the fast switching mechanism are both invalid. This prevents us to construct the LDT for the considered system. Thus there is no Hamiltonian and the current methodology fails. How to quantitatively study such systems and define the proper Waddington energy landscape is an issue.

  2. The curse of dimensionality. When the problem is extended to high dimensions, although the computation of optimal transition path and the analysis of MST and CV can be achieved with a reasonable cost, constructing the whole global energy landscape is not feasible in general. However as we have argued before, the whole energy landscape is even not necessary since we are only interested in its configuration for partial components. How to extract these information directly from the Hamiltonian by smart utilization of Eq. (21) is what we are trying to do.

  3. Transition rate formula in the high dimensional case. Despite the transition rate formula, i.e. the Arrhenius type formula, for the equilibrium models are well developed [41], there is no complete answer for the non-equilibrium case. The rate with form Inline graphic has long been established in [24], but the prefactor is not known. In one dimensional case, partial result is given [31]. But its high dimensional form is still an open question.

To understand the transition behavior for more general biological systems driven by noise, the above open problems should be overcome in the future studies.

Conclusion and Discussion

In this paper, we have presented a methodology to construct the quasi-potential energy landscape of genetic switching system while explicitly taking mRNA noise into account. This global potential, which is a rationalized version of Waddington potential, can provide a quantitative tool to understand the metastability in more general biological processes with intrinsic noise. The results also provide some insights in gene-expression switching circuit with positive feedback, especially the robustness and sensitivity of the genetic switching system under different promotor transition rates.

For the connection with previous general methodology in literature [11], we focus more on the energy landscape and metastability properties for systems with intrinsic noise. Although one can principally compute the stationary distribution by solving a steady state chemical master equation on a truncated domain, our approach sufficiently utilizes the special structure of the system. Indeed, the global quasi-potential Inline graphic employed in this paper connects with the potential defined in [11] through Inline graphic, which is independent of Inline graphic. The reason we can do this is simply because the system size Inline graphic is large enough here.

With regard to the WKB and reduction approach in [15], we obtain a convex Hamiltonian based on rigorous mathematical analysis and explicitly take mRNA noise into account through the gMAM method. The convexity proves to be essential for the computational efficiency and robustness. Our derivations can be also easily extended to similar problems.

Overall, the quasi-potential energy landscape and the proposed methodology can serve as a useful tool to explore the gene expression process with intrinsic noise. Further developments such as high dimensionality issue and its applications to other biological systems like complex cellular decision making and the developmental process of cells are deserved to be investigated. The biological meaning of optimal transition path and transition states remains to be uncovered in the future studies.

Supporting Information

Figure S1

The network design of three synthetic transcriptional cascades.

(EPS)

Figure S2

Coefficient of variation as a function of mean.

(EPS)

Text S1

This file contains details that needed to understand the main body. It is arranged as follows: I. Mean field limit of CMEs, II. Comparison of Hamiltonian, III. Scale independence on the choice of system size, IV. Introduction of the gMAM, V.Stochastic Simulation, VI. Analysis of Uphill Path, VII. Application in Transcriptional Cascades.

(PDF)

Acknowledgments

The authors are grateful to Weinan E, Xiang Zhou, Qi Ouyang, Chunbo Lou, Hongli Wang and Feng Lin for helpful discussions.

Funding Statement

The work is supported by NSFC grants no. 11174011, 11021463 (F.Li), 11171009 and 91130005 and the National Science Foundation for Excellent Young Scholars (Grant No. 11222114) (T.Li). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1. Balázsi G, van Oudenaarden A, Collins JJ (2011) Cellular decision making and biological noise: From microbes to mammals. Cell 144: 910–925. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2. Munsky B, Neuert G, van Oudenaarden A (2012) Using gene expression noise to understand gene regulation. Proc Natl Acad Sci USA 336: 183–187. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Ptashne M (1992) A genetic switch: Phage Lambda and higher organisms, 2nd edition. Blackwell Science, Malden, Massachusetts and Cell Press, Cambridge, Massachusetts.
  • 4. Ozbudak EM, Thattai M, Lim HN, Shraiman BI, van Oudenaarden A (2004) Multistability in the lactose utilization network of Escherichia coli. Nature 427: 737–740. [DOI] [PubMed] [Google Scholar]
  • 5. Kussell E, Leibler S (2005) Phenotypic diversity, population growth, and information in fluctuating environments. Science 309: 2075–2078. [DOI] [PubMed] [Google Scholar]
  • 6. Thattai M, van Oudenaarden A (2001) Intrinsic noise in gene regulatory networks. Proc Natl Acad Sci USA 98: 8614–8619. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.van Kampen NG (1981) Stochastic processes in physics and chemistry. North-Holland, Amsterdam.
  • 8. Hornos JEM, Schultz D, Innocentini GC, Wang J, Walczak AM, et al. (2005) Self-regulating gene: An exact solution. Phys Rev E 72: 051907. [DOI] [PubMed] [Google Scholar]
  • 9. Wang J, Xu L, Wang E (2008) Potential landscapes and flux framework of nonequilibrium networks: robustness, dissipation, and coherence of biochemical oscillations. Proc Natl Acad Sci USA 105: 12271–12276. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10. Liu D (2008) A numerical scheme for optimal transition paths of stochastic chemical kinetic systems. J Comp Phys 227: 8672–8684. [DOI] [PubMed] [Google Scholar]
  • 11. Wang J, Zhang K, Wang E (2010) Kinetic paths, time scale, and underlying landscapes: A path integral framework to study global natures of nonequilibrium systems and networks. J Chem Phys 133: 1–13. [DOI] [PubMed] [Google Scholar]
  • 12. Zhou JX, Aliyu MDS, Aurell E, Huang S (2012) Quasi-potential landscape in complex multi-stable systems. J R Soc Interface 9: 3539–3553. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13. Ge H, Qian H (2012) Analytical mechanics in stochastic dynamics: most probable path, large-deviation rate function and Hamilton-Jacobi equation. Int J Mod Phys B 26: 1230012. [Google Scholar]
  • 14. Meerson B, Sasorov PV (2008) Noise-driven unlimited population growth. Phys Rev E 78: 060103. [DOI] [PubMed] [Google Scholar]
  • 15. Assaf M, Roberts E, Luthey-Schulten Z (2011) Determining the stability of genetic switches: Explicitly accounting for mRNA noise. Phys Rev Lett 106: 248102. [DOI] [PubMed] [Google Scholar]
  • 16. Taniguchi Y, Choi PJ, Li GW, Chen HY, Babu M, et al. (2010) Quantifying E. coli proteome and transcriptome with single-molecule sensitivity in single cells. Science 329: 533–538. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17. Gandhi SJ, Zenklusen D, Lionnet T, Singer RH (2011) Transcription of functionally related constitutive genes is not coordinated. Nat Struct Mol Biol 18: 27–34. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18. Mehta P, Mukhopadhyay R, Wingreen NS (2008) Exponential sensitivity of noise-driven switching in genetic networks. Phys Biol 5: 1–6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19. Zong C, So LH, Sepúlveda LA, Skinner SO, Golding I (2010) Lysogen stability is determined by the frequency of activity bursts from the fate-determining gene. Mol Syst Biol 6: 440. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Waddington CH, Kacser H (1957) The strategy of the genes: A discussion of some aspects of theoretical biology. George Allen and Unwin, London.
  • 21. Ao P (2004) Potential in stochastic differential equations: Novel construction. J Phys A 37: L25–L30. [Google Scholar]
  • 22.Varadhan SRS (1984) Large deviations and applications. SIAM, Philadelphia.
  • 23.Shwartz A, Weiss A (1995) Large deviations for performance analysis: Queues, communications and computing. Chapman and Hall, London.
  • 24.Freidlin MI, Wentzell AD (1998) Random perturbations of dynamical systems, 2nd edition. Springer, New York.
  • 25. Heymann M, Vanden-Eijnden E (2008) The geometric minimum action method: A least action principle on the space of curves. Comm Pure Appl Math 61: 1052–1117. [Google Scholar]
  • 26. Gillespie DT (1977) Exact stochastic simulation of coupled chemical reactions. J Phys Chem 81: 2340–2361. [Google Scholar]
  • 27. Weinan E, Ren W, Vanden-Eijnden E (2004) Minimum action method for the study of rare events. Comm Pure Appl Math 57: 637–656. [Google Scholar]
  • 28. Liptser RS (1996) Large deviation for two scaled diffusion. Prob Theory Relat Fields 106: 71–104. [Google Scholar]
  • 29. Touchette H (2009) The large deviation approach to statistical mechanics. Phys Rep 478: 1–69. [Google Scholar]
  • 30. Dykman MI, Mori E, Ross J, Hunt PM (1994) Large fluctuations and optimal paths in chemical kinetics. J Phys Chem 100: 5735. [Google Scholar]
  • 31. Escudero C, Kamenev A (2009) Switching rates of multistep reactions. Phys Rev E 79: 041149. [DOI] [PubMed] [Google Scholar]
  • 32. Assaf M, Meerson B (2010) Extinction of metastable stochastic populations Phys Rev E. 81: 021116. [DOI] [PubMed] [Google Scholar]
  • 33. Gillespie DT (2000) The chemical Langevin equations. J Chem Phys 113: 297–306. [Google Scholar]
  • 34. Somkid I, Higham DJ (2010) Zero, one and two-switch models of gene regulation. Discrete Cont Dyn Syst Ser B 14: 495. [Google Scholar]
  • 35. Naeh T, Klosek MM, Matkowsky BJ, Schuss Z (1990) A direct approach to the exit problem. SIAM J Appl Math 50: 595–627. [Google Scholar]
  • 36. Maier RS, Stein DL (1997) Limiting exit location distributions in the stochastic exit problem. SIAM J Appl Math 57: 752–790. [Google Scholar]
  • 37. Skotheim JM, Di Talia S, Siggia ED, Cross FR (2008) Positive feedback of G1 cyclins ensures coherent cell cycle entry. Nature 454: 291–296. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38. Maamar H, Raj A, Dubnau D (2007) Noise in gene expression determines cell fate in bacillus subtilis. Science 317: 526–529. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39. Raser JM, O’Shea EK (2004) Control of stochasticity in eukaryotic gene expression. Science 304: 1811–1814. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40. Hooshangi S, Thiberge S, Weiss R (2005) Ultrasensitivity and noise propagation in a synthetic transcriptional cascade. Proc Natl Acad Sci USA 102: 3581–3586. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41. Weinan E, Ren W, Vanden-Eijnden E (2002) String method for the study of rare events. Phy Rev B 66: 052301. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Figure S1

The network design of three synthetic transcriptional cascades.

(EPS)

Figure S2

Coefficient of variation as a function of mean.

(EPS)

Text S1

This file contains details that needed to understand the main body. It is arranged as follows: I. Mean field limit of CMEs, II. Comparison of Hamiltonian, III. Scale independence on the choice of system size, IV. Introduction of the gMAM, V.Stochastic Simulation, VI. Analysis of Uphill Path, VII. Application in Transcriptional Cascades.

(PDF)


Articles from PLoS ONE are provided here courtesy of PLOS

RESOURCES