A Stochastic Model of Gene Expression with Polymerase Recruitment and Pause Release

Zhixing Cao; Tatiana Filatova; Diego A Oyarzún; Ramon Grima

doi:10.1016/j.bpj.2020.07.020

. 2020 Aug 3;119(5):1002–1014. doi: 10.1016/j.bpj.2020.07.020

A Stochastic Model of Gene Expression with Polymerase Recruitment and Pause Release

Zhixing Cao ^1,^2,⁵, Tatiana Filatova ^2,⁴, Diego A Oyarzún ^2,³, Ramon Grima ^2,^∗

PMCID: PMC7474183 PMID: 32814062

Abstract

Transcriptional bursting is a major source of noise in gene expression. The telegraph model of gene expression, whereby transcription switches between on and off states, is the dominant model for bursting. Recently, it was shown that the telegraph model cannot explain a number of experimental observations from perturbation data. Here, we study an alternative model that is consistent with the data and which explicitly describes RNA polymerase recruitment and polymerase pause release, two steps necessary for messenger RNA (mRNA) production. We derive the exact steady-state distribution of mRNA numbers and an approximate steady-state distribution of protein numbers, which are given by generalized hypergeometric functions. The theory is used to calculate the relative sensitivity of the coefficient of variation of mRNA fluctuations for thousands of genes in mouse fibroblasts. This indicates that the size of fluctuations is mostly sensitive to the rate of burst initiation and the mRNA degradation rate. Furthermore, we show that 1) the time-dependent distribution of mRNA numbers is accurately approximated by a modified telegraph model with a Michaelis-Menten like dependence of the effective transcription rate on RNA polymerase abundance, and 2) the model predicts that if the polymerase recruitment rate is comparable or less than the pause release rate, then upon gene replication, the mean number of RNA per cell remains approximately constant. This gene dosage compensation property has been experimentally observed and cannot be explained by the telegraph model with constant rates.

Significance

The random nature of gene expression is well established experimentally. Mathematical modeling provides a means of understanding the factors leading to the observed stochasticity. There is evidence that the classical two-state model of stochastic messenger RNA (mRNA) dynamics (the telegraph model) cannot describe perturbation experiments, and a new model that includes polymerase dynamics has been proposed. In this article, we present the first detailed study of this model, deriving an exact solution for the mRNA distribution in steady-state conditions and an approximate time-dependent solution and showing that the model can explain gene dosage compensation. As well, we use the theory together with transcriptomic data to deduce which parameters when perturbed lead to a maximal change in the size of mRNA fluctuations.

Introduction

There is widespread evidence that mammalian genes are expressed in bursts: infrequent periods of transcriptional activity that produce a large number of messenger RNA (mRNA) transcripts within a short period of time (1, 2, 3). This is in contrast to constitutive expression in which mRNAs are produced in random, uncorrelated events, with a time-independent probability (4). The size and frequency of transcriptional bursts affect the magnitude of temporal fluctuations in mRNA and the protein content of a cell and thus constitute an important source of intracellular noise (5).

A large number of studies have sought to elucidate the mechanisms leading to bursting by constructing simple stochastic models that can explain the data. The simplest of these models is the telegraph model whereby 1) a gene is in two states, an ON state where mRNA is expressed, and an OFF state where there is no expression, and 2) mRNA degrades in the cytoplasm. These first-order reactions are effective because each encapsulates the effect of a large number of underlying biochemical reactions. The chemical master equation of this model has been solved exactly to obtain the probability distribution of mRNA numbers as a function of time (6). For parameter conditions consistent with bursty expression, the steady-state distribution is well approximated by a negative binomial that fits some of the experimental data (7).

Recent studies have extended the telegraph model in various directions (see (8,9) for a recent review). Mammalian cells have been shown to display complex promoter dynamics during the switch from transcriptionally inactive to active states. Such dynamics cannot be described by a single reaction step whose time is exponentially distributed (2), as assumed by the telegraph model. In (10), this complexity is accounted for by deriving analytical expressions linking the Fano factor of mRNA distributions to the general waiting-time distribution of the time to switch from inactive to active states. In contrast, other works (11, 12, 13) have sought to describe promoter dynamics with transitions between a number of discrete promoter states, only some of which are active; in special cases of such models, the steady-state distribution of mRNA fluctuations can be derived analytically. Moreover, dynamic regulation of eve stripe 2 expression in living Drosophila (14) suggests the occurrence of multiple rates of RNA polymerase II (Pol II) loading, which argues in favor of the multistate model rather than the simpler telegraph model. Another study, based on live cell imaging of the amoeba Dictyostelium, postulates a continuum of transcriptional states (15) rather than discrete states. All these models share a common property with the telegraph model, namely that when a transcript is produced, the gene state is unchanged.

Bartman et al. (16) recently argued that it is unclear how polymerase recruitment and pause release, two well-known steps in mRNA production, map onto the active and inactive states assumed by the telegraph model. This argument also applies to the various multistate variants of the telegraph model. In particular, in these models, one cannot tell whether the initiation of a burst permits polymerase recruitment to occur or whether it permits release from the paused state. In (16), the telegraph model and several possible models of transcription were considered that incorporated bursting (burst initiation and termination steps) together with polymerase recruitment and pause release steps. Using stochastic simulations in conjunction with RNA fluorescence in situ hybridization and Pol II chromatin immunoprecipitation sequencing measurements, they showed that the only model compatible with the data is one in which 1) polymerase recruitment follows after burst initiation, and 2) only one polymerase is permitted to bind each promoter-proximal region at a time, and this bound polymerase has to undergo pause release before a second polymerase can be recruited to a gene copy (in line with the findings in (17,18)). We emphasize that although this model has three effective gene states, it is not a special case of the multistate gene models studied in (11, 12, 13). These models assume that the gene state does not change upon production of mRNA because they model the production of a mature transcript without detailed modeling of the steps between transcriptional initiation and termination. However, the model expounded in (16) models transcription at a finer level of detail, which requires that the production of nascent mRNA results in a change of gene state, a property that is crucial to capture the second property above. Note the number of nascent mRNA molecules, irrespective of their length, is equal to the number of polymerases currently transcribing the gene (19). An interesting recent review discussing the assumptions behind common gene expression models including those with polymerase dynamics can be found in (20).

In this article, we present the first detailed study of the model proposed by Bartman et al. (16). The article is organized as follows. In Model, we introduce the chemical master equation formulation of the model. In Exact Solution, we obtain an exact steady-state solution of this model, and in Sensitivity Analysis, we use the theoretical results and transcriptomic data to investigate the sensitivity of the size of mRNA fluctuations to the five parameters. In Effective Telegraph Model, we show that by mapping the model onto an effective telegraph model, we can obtain an approximate time-dependent solution. In Connection to the Refractory Model, we show that although our model has three effective promoters states, it is not the same as the refractory model of gene expression devised by Naef and co-workers (2). In Protein Dynamics, we show that the protein number distribution can also be obtained in the limit of fast mRNA decay and that this is generally different than that obtained using the conventional three-stage model of gene expression (21). We finish with a discussion of the biological implications of our results in Conclusions.

Results and Discussion

Model

We consider a stochastic transcriptional bursting model (recently introduced in (16) and henceforth referred to as the multiscale model; see Fig. 1 A), whereby a gene fluctuates between three states: two permissive states (D₁₀ and D₁₁) and a nonpermissive state (D₀).

The transition from D₀ to D₁₀ (burst initiation) is mediated by transcription factor binding with rate constant σ_u, which is reversible with rate constant σ_b (this transition may alternatively represent other processes such as nucleosome remodeling). Subsequently, the binding of Pol II to D₁₀ with rate constant λ (which is proportional to Pol II abundance) leads to D₁₁. This represents a state in which Pol II is paused and models the experimental observation that Pol II pauses downstream of the transcription initiation site preceding productive elongation (18). The polymerase is released from this state with rate constant ρ, leading to two simultaneous processes: 1) because now the polymerase can actively transcribe RNA, it implies the production of nascent mRNA (denoted as N) with rate ρ; and 2) the gene state changes from D₁₁ to D₁₀. This step models the experimental observation that unless the polymerase is unpaused, there is no binding of new Pol II (17,18). In the paused state D₁₁, both the polymerase and the transcription factor can unbind from the gene and lead to the nonpermissive state D₀ (burst termination). Both reversible switches operate at different timescales (hours versus minutes) with max{σ_b, σ_u} ≪ min {ρ, λ}, leading to multiscale transcriptional bursting (16,22). After termination, the nascent mRNA becomes a mature mRNA (denoted by M); this occurs with rate r. Subsequently, the mature mRNA decays with rate constant d. Note that we assume all reactions to be first order, characterized by exponentially distributed waiting times between successive reactions.

In what follows, for simplicity, we assume that the lifetime of nascent mRNA is very short, i.e., r is large, such that the reaction D₁₁ → D₁₀ + N, N → M can be approximated by the single reaction step D₁₁ → D₁₀ + M. In the next section, we derive the steady-state distribution of mature mRNA (simply called mRNA henceforth).

Exact solution

Let P_θ (n, t) (θ = 0, 10, 11) denote the probability of a cell being in state D_θ with n mRNAs at time t (arguments n and t are hereafter omitted for brevity). The dynamics of probability P_θ are described by the set of coupled master equations

\begin{array}{l} \partial_{t} P_{0} = (E^{1} - 1) d n P_{0} - σ_{u} P_{0} + σ_{b} (P_{10} + P_{11}), \\ \partial_{t} P_{10} = (E^{1} - 1) d n P_{10} - (σ_{b} + λ) P_{10} + σ_{u} P_{0} + ρ E^{- 1} P_{11}, \\ \partial_{t} P_{11} = (E^{1} - 1) d n P_{11} - (ρ + σ_{b}) P_{11} + λ P_{10}, \end{array}

(1)

where the step operator $E^{i}$ acts on a general function g(n) as $E^{i} g (n) = g (n + i)$ (23). To solve Eq. 1, we use the generating function method and define $G_{θ} (z) = \sum_{n} z^{n} P_{θ} (n)$ for θ = 0, 10, 11 so that Eq. 1 can be recast as a set of coupled partial differential equations

\partial_{t} G_{0} + d (z - 1) \partial_{z} G_{0} = - σ_{u} G_{0} + σ_{b} G_{10} + σ_{b} G_{11},

(2a)

\partial_{t} G_{10} + d (z - 1) \partial_{z} G_{10} = ρ z G_{11} - (σ_{b} + λ) G_{10} + σ_{u} G_{0},

(2b)

\partial_{t} G_{11} + d (z - 1) \partial_{z} G_{11} = - ρ G_{11} - σ_{b} G_{11} + λ G_{10},

(2c)

wherein the variable z is dropped for brevity. By setting z = 1 and the time derivatives to zero (considering steady-state conditions), we can deduce that the probability of being in the nonpermissive state D₀ is G₀(1) = σ_b/(σ_u + σ_b) and the probability of being in one of the two permissive states D₁₀ or D₁₁ is G₁₀(1) + G₁₁(1) = σ_u/(σ_u + σ_b).

To solve (2a), (2b), (2c) for G₀(z), G₁₀(z), and G₁₁(z) in steady-state conditions, we set $\partial_{t} G_{θ} = 0$ , solve G₁₀ from Eq. 2c as a function of G₁₁, and combine the yielded result to solve G₀ from Eq. 2b as a function of G₁₁ so that Eq. 2a consequently becomes a differential equation with G₁₁ being the only variable

d^{3} u^{2} \partial_{u}^{3} G_{11} + (3 d + γ_{1} + γ_{2}) d^{2} u \partial_{u}^{2} G_{11} + [(d + γ_{1}) (d + γ_{2}) - ρ λ u] d \partial_{u} G_{11} - (d + σ_{u}) ρ λ G_{11} = 0,

(3)

with u = z − 1, γ₁ = σ_b + σ_u, and γ₂ = ρ + λ + σ_b. By defining a new variable x = ρλu/d², Eq. 3 can be further simplified to

x^{2} \partial_{x}^{3} G_{11} + (1 + \frac{γ_{1} + d}{d} + \frac{γ_{2} + d}{d}) x \partial_{x}^{2} G_{11} + (\frac{γ_{1} + d}{d} \frac{γ_{2} + d}{d} - x) \partial_{x} G_{11} - \frac{σ_{u} + d}{d} G_{11} = 0,

which is in the canonical form of the differential equation for the generalized hypergeometric function

x^{2} \partial_{x}^{3} f (x) + (1 + b_{1} + b_{2}) x \partial_{x}^{2} f (x) + (b_{1} b_{2} - x) \partial_{x} f (x) - a_{1} f (x) = 0,

admitting the solution f(x) = C₁F₂(a₁, b₁, b₂, x), with C being an integration constant. Hence, the solution for G₁₁ is in terms of the generalized hypergeometric function

G_{11} = C \cdot {}_{1}{F_{2}} (\frac{σ_{u} + d}{d}; \frac{γ_{1} + d}{d}, \frac{γ_{2} + d}{d}; \frac{ρ λ}{d^{2}} u) .

(4)

On the other hand, summing Eqs. 2a, 2b, and 2c and denoting $G = \sum_{θ} G_{θ}$ , one can get $\partial_{u} G = ρ G_{11} / d$ , which together with Eq. 4 leads to

G (u) = C_{2} \cdot {}_{1}{F_{2}} (\frac{σ_{u}}{d}; \frac{σ_{b} + σ_{u}}{d}, \frac{σ_{b} + ρ + λ}{d}; \frac{ρ λ}{d^{2}} u) .

Note that in the last step, we made use of the general relation $\partial_{z}_{1}^{} F_{2} (a; b, c; z) = (a / b c) \cdot_{1}^{} F_{2} (a + 1; b + 1, c + 1; z)$ . The integration constant C₂ is found to be 1 by using the normalization condition G(0) = 1. Hence, the exact solution for the generating function is

G (u) =_{1}^{} F_{2} (\frac{σ_{u}}{d}; \frac{γ_{1}}{d}, \frac{γ_{2}}{d}; \frac{ρ λ}{d^{2}} u) .

(5)

Hence, it follows that the marginal probability of finding n mRNAs in a cell is

P (n) = \frac{1}{n!} {\frac{d^{n} G (u)}{d u^{n}} |}_{u = - 1} = \frac{1}{n!} (\frac{ρ λ}{d^{2}}) \frac{{(\frac{σ_{u}}{d})}_{n}}{{(\frac{σ_{b} + σ_{u}}{d})}_{n} {(\frac{σ_{b} + ρ + λ}{d})}_{n}}_{1}^{} F_{2} (\frac{σ_{u}}{d} + n; \frac{σ_{b} + σ_{u}}{d} + n, \frac{σ_{b} + ρ + λ}{d} + n; - \frac{ρ λ}{d^{2}}),

(6)

where ${(\cdot)}_{n}$ is the Pochhammer symbol. In Fig. 1 B, we show that distributions obtained from Eq. 6 as well as the corresponding modality (a phenotypic signature (24)) are indistinguishable from distributions produced using the stochastic simulation algorithm (SSA) (25). Note that here, we have solved for the mature mRNA distribution under the assumption that nascent mRNA is short lived. In cases in which this assumption is not physiologically meaningful and one is interested in the nascent mRNA distribution, then the latter is given by Eq. 6 with d replaced by r (the rate at which nascent mRNA changes to mature mRNA because of the termination of transcription).

Special case of bursty transcription

It can be further shown by perturbation theory in Appendix A that when ρ, λ, and σ_b are much greater than the rest of the parameters, the exact solution Eq. 6 reduces to the negative binomial distribution $P (n) = NB ((σ_{u} / d), (ρ / ρ + α))$ with α = σ_bγ₂/λ. Note the constraint on the parameters leads to a time series with large- and short-lived bursts of transcription (because ρ, λ, and σ_b are large), separated by long silent intervals (because σ_u is small). Such bursty trancription is common in mammalian cells (3).

Relationship to the telegraph model

It can also be shown that that in the limit of large ρ, the exact solution of Eq. 6 reduces to the confluent hypergeometric solution of the telegraph model (see Appendix B). This is equivalent to the steady-state solution of the two-state system $D_{0} ⇌_{σ_{b}}^{σ_{u}} D_{10} \overset{λ}{\to} D_{10} + M, M \overset{d}{\to} \emptyset$ . The reduction to a two-state model results from genes spending a short time in state D₁₁ because of the large value of ρ. The production of an mRNA molecule involves the slow reaction step from D₁₀ to D₁₁ with rate λ followed by a very fast reverse step with rate ρ. Hence, the rate of mRNA production is determined by the reaction rate of the slowest reaction, i.e., it is equal to λ. By similar reasoning, we can deduce that in the limit of large λ, the gene spends a short time in the state D₁₀, and the multiscale model reduces to the two-state telegraph model with a rate of mRNA production equal to ρ.

Sensitivity analysis

The exact solution in Eq. 5 allows us to examine the stochastic properties of the multiscale model over large swathes of parameter space. We investigate the relative sensitivity of the coefficient of variation of mRNA fluctuations, $CV = \sqrt{Var (n)} / 〈 n 〉$ , which is typically employed as a measure of the magnitude of transcriptional noise. To this end, we calculate the first two central moments, ( $〈 n 〉$ and $Var (n)$ ), from Eq. 5 using $〈 n 〉 = {\partial_{u} G |}_{u = 0}$ and $Var (n) = {\partial_{u}^{2} G |}_{u = 0} + 〈 n 〉 - {〈 n 〉}^{2}$ . The mean and CV are then given by

〈 n 〉 = \frac{σ_{u} λ ρ}{d γ_{1} γ_{2}},

(7a)

{CV}^{2} = \frac{1}{〈 n 〉} + \frac{d}{σ_{u}} \cdot \frac{γ_{1}}{γ_{1} + d} \cdot \frac{γ_{2}}{γ_{2} + d} .

(7b)

Note that because the parameters ρ and λ appear symmetrically in (7a), (7b), for simplicity, we enforce the constraint ρ = λ (we will relax this constraint later). Hence, the relative sensitivity of the quantity $\bar{CV} = {CV |}_{ρ = λ}$ , which can serve as a gauge of transcriptional noise, is insightful to study and defined as $Λ_{p} = (p / \bar{CV}) \partial \bar{CV} / \partial p$ for a model parameter p, meaning that 1% change in p leads to a Λ_p% change in $\bar{CV}$ . The parameter values for the sensitivity analysis were sampled from experimental distributions recently inferred for 3575 genes of CAST allele in mouse fibroblasts (3) using the telegraph model. To obtain values for ρ and λ, we equate the mean of the telegraph model (with ON switching rate σ_b, OFF switching rate σ_u, transcription rate ρ_u, and degradation rate d) ${〈 n 〉}_{tel} = σ_{u} ρ_{u} / γ_{1} d$ with the mean of the multiscale model (Eq. 7a) under the constraint ρ = λ, giving

ρ = ρ_{u} (1 + \sqrt{1 + \frac{σ_{b}}{ρ_{u}}}) .

(8)

Distributions for each parameter in the data set are presented in Fig. 2 A, and the box plots in Fig. 2 B show the relative sensitivity for each parameter. The parameters in order of most sensitive first are σ_u, d, σ_b, and ρ = λ. This order is the same as obtained by ranking parameters according to the inverse of their mean experimental values (the mean of the distributions in Fig. 2 A), implying that changes to the CV are most easily accomplished by perturbations to the slowest reactions. Given the vectors $Λ_{p_{1}}$ and $Λ_{p_{2}}$ for any pair $p_{1} \neq p_{2}$ and p₁, p₂ in the set {ρ, λ, σ_b, σ_u, d} where each entry is a different gene, in Fig. 2 C, we calculate the Pearson correlation coefficient between the vectors and the corresponding joint distributions. This shows that (σ_u, σ_b) is the least dependent pairing, and hence, they constitute a quasiorthogonal decomposition of the sensitivity. In other words, a change in the CV due to a change in σ_u is practically uncorrelated with a change in the CV due to a change in σ_b, and hence, these two parameters can be seen as independent “control knobs” to change the CV; this is of interest in synthetic biology, in which an engineering design approach is taken to modify a biological system for improved functionality (26,27). The same set of parameters ranked by sensitivity are obtained, if instead of setting λ = ρ, we consider ρ ≫ λ or λ ≫ ρ, and hence, it appears that our results in this section are robust and invariant with respect to the ratio λ/ρ.

Relative sensitivity analysis of the coefficient variation $\bar{CV}$ of mRNA noise over five kinetic parameters for 3575 genes of CAST allele data for mouse fibroblasts. (A) Distributions of the kinetic parameters in the dataset (obtained from (3)); values of ρ or λ are calculated using Eq. 8. (B) Box plots indicate the median (values shown at bottom), the 25% and 75% quantiles, and mean and outliers of relative sensitivity. (C) Joint distributions and Pearson correlation between the relative sensitivity vectors for each pair of parameters suggest that (σ_b, σ_u) and (σ_u, d) are the least-dependent pairs. To see this figure in color, go online.

Effective telegraph model

Earlier, we showed that in the limit of large ρ or large λ, the solution of the multiscale model tends to the solution of the telegraph model. Next, we use the first-passage time method to reduce the multiscale model into an effective telegraph model, without making the aforementioned assumptions. To this end, we consider the transcription motif of the multiscale model, $D_{10} \overset{λ}{\to} D_{11} \overset{ρ}{\to} D_{10} + M$ , whose corresponding master equations for producing newborn mRNA starting from state D₁₀ are

\begin{array}{l} \partial_{t} P_{10} = - λ P_{10}, \\ \partial_{t} P_{11} = λ P_{10} - ρ P_{11}, \\ \partial_{t} P_{M} = ρ P_{11}, \end{array}

(9)

where P₁₀, P₁₁, and P_M represent the probability of staying in states D₁₀, D₁₁, or producing a new mRNA, respectively. We remark that the reaction D₁₁ → D₀ is absent from the motif because of its relatively small reaction rate σ_b compared to ρ and λ. The initial conditions for Eq. 9 are ${P_{10} |}_{t = 0} = 1$ and ${P_{11} |}_{t = 0} = {P_{M} |}_{t = 0} = 0$ . Solving for P_M in Eq. 9, we can calculate the mean first-passage time for mRNA production

〈 t_{f} 〉 = \int_{0}^{\infty} t P_{f} d t = \frac{ρ + λ}{λ ρ},

(10)

where $P_{f} = \partial_{t} P_{M}$ is the first-passage time distribution (28). Because the effective transcription rate is the inverse of the mean first-passage time, it immediately follows that the effective telegraph model is

D_{0} ⇌_{σ_{b}}^{σ_{u}} D_{10} {\overset{ρ_{u} = \frac{λ ρ}{λ + ρ}}{\to}}_{10} + M, M \overset{d}{\to} \emptyset .

(11)

Alternatively, one can obtain this result by equating the means of our model Eq. 7a and of the telegraph model ${〈 n 〉}_{tel} = ρ_{u} σ_{u} / γ_{1} d$ and solving for the effective production rate ρ_u, giving $ρ_{u} = λ ρ / γ_{2} ≃ λ ρ / (λ + ρ)$ because, typically, ρ, λ ≫ σ_b.

In Fig. 3, we show the high accuracy of the effective telegraph model approximation from Eq. 11. In particular, Fig. 3 A shows a heatmap of the distance between the distributions of mRNA numbers predicted by the effective telegraph model and the multiscale model. As a distance measure, we use the Hellinger distance (HD), a Euclidean distance-based metric normalized to the interval between 0 and 1. The effective telegraph model is naturally a more accurate description to the multiscale model when there is one rate-limiting step (large difference between ρ and λ) rather than when there are two rate-limiting steps (ρ = λ).

An effective telegraph model (given by reaction scheme (11)) approximates the distribution of mRNA numbers of the multiscale model. (A) Hellinger distance (HD) between steady-state distributions of mRNA numbers for the effective telegraph model and the multiscale model as a function of ρ and λ with σ_u = 0.2, σ_b = 0.1, and d = 1. The discrepancy between the two distributions grows as ρ and λ approach the line ρ = λ. (B) Shown is the time-dependent distributions for Point I in (A) (the point with the largest HD) predicted by the effective model compared to those computed by the SSA for the multiscale model. (C) Heat map of HD between both distributions as a function of σ_b and σ_u with ρ = λ = 23 and d = 1. (D) Stochastic bifurcation diagram for the number of modes of the steady-state distributions predicted by the two models. The small dark blue region is where modality of both models disagree. Insets show distributions corresponding to the points marked in (C and D). To see this figure in color, go online.

Because the time-dependent distribution of the telegraph model is known in closed form (6,29), it follows that by the effective model in Eq. 11 we have an approximation for the time-dependent distribution of the multiscale model too. The accuracy of this approximation is shown in Fig. 3 B, where it is compared to the time-dependent distributions computed using the SSA for the multiscale model. The parameters here correspond to those of Point I in Fig. 3 A (the largest HD). Differences between the distributions of the two models are negligible except near time t = 0. We further investigate how burst initiation and termination rates (σ_u, σ_b) affect the approximation error with a heatmap of HD as a function of σ_u and σ_b (Fig. 3 C) and a stochastic bifurcation diagram for the number of modes of the effective telegraph and multiscale model distributions (Fig. 3 D) at steady state. The point of maximal HD in Fig. 3 C (Point II) displays distributions that are not that different from each other; see upper right inset of Fig. 3 D. The two models display the same number of modes in all regions of parameter space except for a narrow region in which modality detection is challenging because the distributions have a broad plateau; see lower right inset of Fig. 3 D (Point III). This again confirms the high accuracy of the effective telegraph model approximation. The biological implications of the Michaelis-Menten dependence of the transcription rate ρ_u in Eq. 11 on λ and ρ is discussed in Conclusions; in particular, there we argue how this special feature of our model can explain gene dosage compensation observed in experiments.

Connection to the refractory model

Besides the telegraph model, another prevalent stochastic transcriptional model is the refractory model (2) (a three-state model, see Fig. 4 A, left), wherein the burst initiation requires two steps. This model was devised to explain the experimental observation that the distribution of “off” intervals is not exponential but rather has a peak at a nonzero value. To understand the connection between our model and the refractory model, we first exactly solve the refractory model for the steady-state distribution of mRNA numbers.

Effective telegraph model approximation for the refractory model. (A) Schematics of both models. (B) Hellinger distance between the steady-state distributions of mRNA numbers predicted by both models and a bifurcation diagram of their number of modes (*black lines*) as a function of σ_u and λ with σ_b = 0.8, ρ_u = 30, and d = 1. (C) Distributions for Points I and II in (B), showing significant disagreement in the height of the zero mode (insets show a zoom at the mode at zero). To see this figure in color, go online.

Given the reaction scheme illustrated in Fig. 4 A, it follows that the temporal evolution of probability P_θ(n) of finding n mRNAs and gene state D_θ (θ = 0, 1, or 2) can be described by the following master equations:

{\begin{cases} \partial_{t} P_{0} (n) = (E^{1} - 1) d n P_{0} (n) + σ_{b} P_{2} (n) - σ_{u} P_{0} (n), \\ \partial_{t} P_{1} (n) = (E^{1} - 1) d n P_{1} (n) + σ_{u} P_{0} (n) - λ P_{1} (n), \\ \partial_{t} P_{2} (n) = (E^{1} - 1) d n P_{2} (n) + (E^{- 1} - 1) ρ_{u} P_{2} (n) + λ P_{1} (n) - σ_{b} P_{2} (n) . \end{cases}

The corresponding generating function equations are given by

\partial_{t} G_{0} + d (z - 1) \partial_{z} G_{0} = σ_{b} G_{2} - σ_{u} G_{0},

(12a)

\partial_{t} G_{1} + d (z - 1) \partial_{z} G_{1} = σ_{u} G_{0} - λ G_{1},

(12b)

\partial_{t} G_{2} + d (z - 1) \partial_{z} G_{2} = ρ_{u} (z - 1) G_{2} + λ G_{1} - σ_{b} G_{2},

(12c)

where $G_{θ} = \sum_{n} z^{n} P_{θ} (n)$ . We intend to solve Eqs. 12a, 12b, and 12c at steady state and thus set $\partial_{t} G_{θ} = 0$ . Then, we solve G₁ as a function of G₂ from Eq. 12c, subsequently substitute it into Eq. 12b, and solve G₀ as a function of G₂. After that, Eq. 12a becomes an ordinary differential equation with G₂ being the only variable to be solved

u^{2} \partial_{u}^{3} G_{2} + (3 + \tilde{λ} + {\tilde{σ}}_{b} + {\tilde{σ}}_{u} - {\tilde{ρ}}_{u} u) u \partial_{u}^{2} G_{2} + [1 + {\tilde{σ}}_{b} + {\tilde{σ}}_{u} + {\tilde{σ}}_{b} {\tilde{σ}}_{u} - {\tilde{ρ}}_{u} (3 + {\tilde{σ}}_{u}) u + \tilde{λ} (1 + {\tilde{σ}}_{b} + {\tilde{σ}}_{u} - {\tilde{ρ}}_{u} u)] \partial_{u} G_{2} - (1 + \tilde{λ}) (1 + {\tilde{σ}}_{u}) {\tilde{ρ}}_{u} G_{2} = 0,

(13)

where ${\tilde{ρ}}_{u}$ , $\tilde{λ}$ , ${\tilde{σ}}_{b}$ , and ${\tilde{σ}}_{u}$ are the kinetic parameters normalized with respect to d and u = z − 1. Eq. 13 is the canonical form of the differential equation for the generalized hypergeometric function ₂F₂, admitting the solution

G_{2} (u) = C \cdot_{2}^{} F_{2} (\tilde{λ} + 1, {\tilde{σ}}_{u} + 1; β_{1} - β_{2} + 1, β_{1} + β_{2} + 1; {\tilde{ρ}}_{u} u),

(14)

where C is an integration constant, and β₁ and β₂ denote

β_{1} = \frac{{\tilde{σ}}_{u} + {\tilde{σ}}_{b} + \tilde{λ}}{2}, β_{2} = \frac{1}{2} \sqrt{{\tilde{λ}}^{2} - 2 \tilde{λ} {\tilde{σ}}_{b} + {\tilde{σ}}_{b}^{2} - 2 \tilde{λ} {\tilde{σ}}_{u} - 2 {\tilde{σ}}_{b} {\tilde{σ}}_{u} + {\tilde{σ}}_{u}^{2}} .

Summing Eqs. 12a, 12b, and 12c leads to $\partial_{u} G = \partial_{u} (\sum_{θ} G_{θ}) = {\tilde{ρ}}_{u} G_{2}$ , one can obtain G from Eq. 14 in the form of the generalized hypergeometric function

G (u) = C_{2} \cdot_{2}^{} F_{2} (\tilde{λ}, {\tilde{σ}}_{u}; β_{1} - β_{2}, β_{1} + β_{2}; {\tilde{ρ}}_{u} u),

(15)

and C₂ is found to be 1 by the normalization condition G(0) = 1. Eq. 15 together with $P (n) = (1 / n!) {(d^{n} G / d u^{n}) |}_{u = - 1}$ defines the distribution of mRNA numbers for the refractory model in steady-state conditions. A similar solution is also known for a generalization of the refractory model (11).

The next step is to map the refractory model onto an effective telegraph model by matching the mean mRNA numbers

{〈 n 〉}_{ref} = \frac{λ σ_{u} ρ_{u}}{d (λ σ_{u} + λ σ_{b} + σ_{u} σ_{b})}, {〈 n 〉}_{tel} = \frac{ρ_{u} {\bar{σ}}_{u}}{d (σ_{b} + {\bar{σ}}_{u})} .,

leading to an effective burst initiation rate ${\bar{σ}}_{u} = σ_{u} λ / (σ_{u} + λ)$ and the corresponding effective model shown in Fig. 4 A (right). Note that whereas the multiscale model is approximately equivalent to an effective telegraph model with a renormalized mRNA production rate, the refractory model’s telegraph approximation leads to a renormalized rate of switching to the active state.

We then compare the steady-state distributions of the refractory model and its effective telegraph model. A heatmap of HD quantifying their distributional difference and a modality diagram (marked as black lines) of the two distributions are illustrated in Fig. 4 B. Both the regions of high HD and Region 2 where only the telegraph model predicts bimodality are significantly large, and Region 1 where both predict bimodality is small. This shows that the refractory model, in general, is not well approximated by the telegraph model, particularly the latter’s probability for low mRNA numbers is not accurate (see Fig. 4 C). Given the telegraph model’s excellent approximation to the multiscale model, it is clear that the multiscale model and refractory model can be distinguished.

Protein dynamics

Finally, for completeness, we extend the multiscale model to provide analytic steady-state distributions of protein numbers. This allows interpretations of single-cell data of protein expression (see, for example, (30)). We consider the network in Fig. 1 A with two additional reactions: 1) a first-order reaction modeling the translation of mRNA to proteins with rate constant k and 2) a first-order reaction modeling the decay of protein with rate constant d_p. It is shown in Appendix C that under the classic short-lived mRNA assumption (d ≫ d_p) (21), the generating function corresponding to the steady-state distribution of protein numbers is given by

G (v) =_{3}^{} F_{2} (a_{1}, a_{2}, a_{3}; b_{1}, b_{2}; b v),

(16)

with b₁ = (σ_b + σ_u)/d_p, b₂ = (σ_b + λ + ρ)/d_p, the mean translational burst size b = k/d, and the parameters a₁, a₂, and a₃ being solutions of the equations

\begin{array}{l} a_{1} a_{2} a_{3} = σ_{u} λ ρ / d_{p}^{3}, \\ a_{1} + a_{2} + a_{3} = b_{1} + b_{2}, \\ a_{1} a_{2} + a_{1} a_{3} + a_{2} a_{3} = b_{1} b_{2} + λ ρ / d_{p}^{2} . \end{array}

In the limit of large λ or ρ, we show in Appendix C that Eq. 16 reduces to the Gaussian hypergeometric function (₂F₁), which was reported in (21), for the classical three-stage model of gene expression in the limit of fast mRNA decay.

Conclusions

Here, we performed the first detailed analytical study of a multiscale model of bursty gene expression based on recent experimental data from mammalian cells (16). The conventional telegraph model does not include an independently regulated pause release step and hence cannot differentiate the effects of changing polymerase pause release versus polymerase recruitment rates, whereas the multiscale model studied here can distinguish these effects. Although our model has three effective gene states (one of which regulates pause release), it is not a special case of existing multistate models because in our model, the gene state changes upon production of new nascent mRNAs to model the experimental observation that unless the polymerase is unpaused (and nascent mRNA starts being actively transcribed by this polymerase), there can be no binding of new Pol II. In contrast, current models assume the gene state does not change upon production of mRNA because they model the production of a mature transcript without detailed modeling of the steps between transcriptional initiation and termination.

We have derived simple closed-form expressions for the approximate time evolution of the mRNA numbers and used the theory to understand which reactions contribute mostly to fluctuations. We also showed that 1) this model can be distinguished from the refractory model, another three-gene-state model popular in the literature and 2) a number of previous models in the literature are special cases of our model, valid only in certain parameter regimes. Specifically, the mRNA and protein distributions of the conventional three-stage model of gene expression provide a good approximation to the multiscale bursting model in certain regions of parameter space as shown in Appendices B and C.

The simplicity of the equations for the mean and the variance allow the inference of rate parameters from single-cell data using maximal likelihood methods (31). Potential extensions include 1) the impact of cell cycle effects such as binomial partitioning and variability in the cell cycle duration and 2) introducing a detailed description of polymerase movement along the gene during elongation. The use of the recently developed linear mapping approximation (32) appears to be a promising means to extend the analytical solution of this model to include feedback loops via DNA-protein interactions (33,34).

An important result of the article is that the time-dependent mRNA distribution of the multiscale model with polymerase dynamics and three states can be accurately approximated by the two-state telegraph model, modified with a Michaelis-Menten-like dependence of the effective transcription rate on polymerase abundance. Specifically, by Eq. 11, the transcription rate of a gene locus is ρ_u = λρ/(λ + ρ), where λ is the binding rate of Pol II (see Fig. 1 A), which is proportional to the local number of Pol II molecules at the gene locus with active transcription (35). This equation implies that the transcription rate is proportional to the local number of Pol II molecules if λ is approximately less than ρ, i.e., if the Pol II binding rate is less than or equal to the rate at which Pol II is unpaused. In contrast, if unpausing is the rate-limiting step (ρ ≪ λ), then the transcription rate is practically independent of the local Pol II number.

Now, when the number of gene copies doubles during replication, the local number of Pol II molecules will correspondingly decrease because of increased sharing of Pol II. Hence, if we are in the regime $λ ≲ ρ$ , the transcription rate per gene copy decreases; thus, the total transcription rate for a gene per cell postreplication will be consequently slower than twice the total transcription rate prereplication. This implies that the mean number of RNA per cell is not significantly affected by replication; indeed, this “dosage compensation” has been observed experimentally for some genes in mouse embryonic stem cells (36) though a different explanation than above was suggested. In one study (37), it was estimated that for 6 yeast genes (RPB2, RPB3, TAF5, TAF6, TAF12, and KAP104), the formation of the preinitiation complex at the promoter (λ) is approximately equal to the rate at which the RNA polymerase escapes the promoter (ρ); hence, gene dosage compensation via polymerase sharing, as implied by our model, may be common. In contrast, if we are in the regime ρ ≪ λ, the transcription rate per gene copy before and after replication is the same, and hence, the total transcription rate for a gene per cell postreplication will be twice the total transcription rate prereplication. This is also what is predicted by the telegraph model with constant burst initiation and termination rates and observed experimentally for a reporter gene expressed from a strong synthetic promoter (36). Note that because the mean burst size is the mean number of RNAs transcribed when the gene is on, by our reasoning above, it also follows that when $λ ≲ ρ$ , the mean burst size is altered upon gene replication. The idea that the number of RNA polymerases is the limiting factor in transcription has been recently hypothesized (38) and has implications for the mitigation of burden imposed by gene circuits in synthetic biology (39). Our model here goes one step further by deriving the explicit relationship between the transcription rate and the number of RNA polymerases. Generally, our model supports the observation that there are differences in transcriptional activity between different stages of the cell cycle (40) that cannot be explained by the conventional telegraph model.

Author Contributions

Z.C. formulated the research question, performed the calculations, produced the figures, and wrote an initial draft of the manuscript. T.F. performed some of the calculations for protein distributions. D.O. supervised the research and edited the manuscript. R.G. formulated the research question, supervised the research, and wrote the manuscript with assistance from the co-authors.

Acknowledgments

Z.C. gratefully acknowledges support of the UK Research Councils Synthetic Biology for Growth programme and of the Biotechnology and Biological Sciences Research Council, Engineering and Physical Sciences Research Council, and Medical Research Council (B/M018040/1) and careful proofreading by J. Holehouse. R.G. acknowledges support from Biotechnology and Biological Sciences Research Council grant BB/M025551/1. D.O. acknowledges support from the Human Frontier Science Program (grant RGY076/2015).

Editor: Alexander Berezhkovskii.

Appendix A: Analytic Distribution For Mrna Numbers When $ρ$ , $λ$ , And $σ_{b}$ Are Large

Given the large values of ρ, λ, and σ_b, we implement the following parametrization:

σ_{b} \mapsto σ_{b} δ, ρ \mapsto ρ δ, λ \mapsto λ δ,

where δ is a large real number.

By means of the method of characteristics, solving (2a), (2b), (2c) is tantamount to seeking a solution to the ordinary differential equation system

\partial_{s} t = 1 \Rightarrow t = s

\partial_{s} z = d (z - 1) \Rightarrow z - 1 = r e^{d s}

\partial_{s} G = ρ δ (z - 1) G_{11},

(17a)

\partial_{s} G_{10} = ρ δ z G_{11} - σ_{b} δ G_{10} - λ δ G_{10} + σ_{u} (G - G_{10} - G_{11}),

(17b)

\partial_{s} G_{11} = - ρ δ G_{11} - σ_{b} δ G_{11} + λ δ G_{10} .

(17c)

Dividing δ on both sides of Eqs. 17a, 17b, and 17c, one obtains a singular system consisting of

{\begin{cases} ϵ \partial_{s} G = ρ (z - 1) G_{11}, \\ ϵ \partial_{s} G_{10} = ρ z G_{11} - σ_{b} G_{10} - λ G_{10} + ϵ σ_{u} (G - G_{10} - G_{11}), \\ ϵ \partial_{s} G_{11} = - ρ G_{11} - σ_{b} G_{11} + λ G_{10}, \end{cases}

(18)

with $ϵ = 1 / δ ≃ 0$ . Expanding G, G₁₀, and G₁₁ in Eq. 18 as a series in powers of $ϵ$ ,

G = G^{(0)} + ϵ G^{(1)} + O (ϵ^{2}), G_{10} = G_{10}^{(0)} + ϵ G_{10}^{(1)} + O (ϵ^{2}), G_{11} = G_{11}^{(0)} + ϵ G_{11}^{(1)} + O (ϵ^{2}),

and matching the orders of $ϵ$ , we have

Order of ϵ^{0} : {\begin{array}{l} ρ (z - 1) G_{11}^{(0)} = 0 & \Rightarrow & G_{11}^{(0)} = 0 \\ ρ z G_{11}^{(0)} - σ_{b} G_{10}^{(0)} - λ G_{10}^{(0)} = 0 & \Rightarrow & G_{10}^{(0)} = 0 \end{array}

and

Order of ϵ^{1} : {\begin{array}{l} \partial_{s} G^{(0)} = ρ (z - 1) G_{11}^{(1)} \\ \partial_{s} G_{10}^{(0)} = ρ z G_{11}^{(1)} - σ_{b} G_{10}^{(1)} - λ G_{10}^{(1)} + σ_{u} (G^{(0)} - G_{10}^{(0)} - G_{11}^{(0)}) & \Rightarrow & ρ z G_{11}^{(1)} - σ_{b} G_{10}^{(1)} - λ G_{10}^{(1)} + σ_{u} G^{(0)} = 0 \\ \partial_{s} G_{11}^{(0)} = - ρ G_{11}^{(1)} - σ_{b} G_{11}^{(1)} + λ G_{10}^{(1)} & \Rightarrow & - ρ G_{11}^{(1)} - σ_{b} G_{11}^{(1)} + λ G_{10}^{(1)} = 0 . \end{array}

Then, we have

\partial_{s} G^{(0)} = - \frac{ρ u σ_{u}}{ρ u - α} G^{(0)},

where α = σ_bγ₂/λ, and u = z − 1 =re^ds. Its solution immediately follows as

G^{(0)} = C (r) {(ρ r e^{d s} - α)}^{- \frac{σ_{u}}{d}},

(19)

with C(r) being a function of r to be determined from the initial condition. Suppose that the initial condition for this process is $g (u) = {G^{(0)} |}_{t = 0}$ , which is known a priori. For instance, say the initial distribution of n mRNA molecules is P(n) = p_n, then g(u) = ∑_np_n(u + 1)ⁿ. Letting s be equal to 0 (or equivalently t = 0), it follows u = r and g(u) = g(r), and we can establish the following relation

g (r) = C (r) {(ρ r - α)}^{- \frac{σ_{u}}{d}},

from which we can solve C(r) as

C (r) = g (r) {(ρ r - α)}^{\frac{σ_{u}}{d}} .

Substituting the latter back into Eq. 19 and replacing r = ue^−dt, we can calculate the leading-order solution of G from (Eq. 19) as

G (u) = g (u e^{- d t}) {(\frac{ρ u e^{- d t} - α}{ρ u - α})}^{\frac{σ_{u}}{d}} .

(20)

At steady state, the leading-order solution in (Eq. 20) becomes

G (z) = {(\frac{α}{α - ρ (z - 1)})}^{\frac{σ_{u}}{d}},

and the corresponding distribution of mRNA numbers is a negative binomial distribution $NB (\frac{σ_{u}}{d}, \frac{ρ}{ρ + α})$ .

Appendix B: Convergence to Telegraph Model for Large $ρ$

To this end, we parametrize ρ as $ρ \mapsto ρ δ$ , where δ is a large real number. As such, (2a), (2b), (2c) can be recast as

\partial_{t} G_{0} + d (z - 1) \partial_{z} G_{0} = - σ_{u} G_{0} + σ_{b} G_{10} + σ_{b} G_{11},

(21a)

\partial_{t} G_{10} + d (z - 1) \partial_{z} G_{10} + (σ_{b} + λ) G_{10} - σ_{u} G_{0} = ρ δ z G_{11},

(21b)

\partial_{t} G_{11} + d (z - 1) \partial_{z} G_{11} + σ_{b} G_{11} - λ G_{10} = - ρ δ G_{11} .

(21c)

Dividing both sides of Eqs. 21b and 21c by δ and setting ϵ = δ⁻¹, we have that

ϵ (\partial_{t} G_{10} + d (z - 1) \partial_{z} G_{10} + (σ_{b} + λ) G_{10} - σ_{u} G_{0}) = ρ z G_{11},

(22a)

ϵ (\partial_{t} G_{11} + d (z - 1) \partial_{z} G_{11} + σ_{b} G_{11} - λ G_{10}) = - ρ G_{11} .

(22b)

Again using the same method as before, we expand G₀, G₁₀, and G₁₁ in Eqs. 21a and (22a), (22b) as a series in powers of $ϵ$ , collect the terms for ϵ⁰ and ϵ¹, and obtain

Order of ϵ^{0} : {\begin{cases} \partial_{t} G_{0}^{(0)} + d (z - 1) \partial_{z} G_{0}^{(0)} = - σ_{u} G_{0}^{(0)} + σ_{b} G_{10}^{(0)} + σ_{b} G_{11}^{(0)}, \\ ρ z G_{11}^{(0)} = 0, \\ ρ G_{11}^{(0)} = 0, \end{cases}

(23)

and

Order of ϵ^{1} : {\begin{cases} \partial_{t} G_{10}^{(0)} + d (z - 1) \partial_{z} G_{10}^{(0)} + (σ_{b} + λ) G_{10}^{(0)} - σ_{u} G_{0}^{(0)} = ρ z G_{11}^{(1)}, \\ \partial_{t} G_{11}^{(0)} + d (z - 1) \partial_{z} G_{11}^{(0)} + σ_{b} G_{11}^{(0)} - λ G_{10}^{(0)} = - ρ G_{11}^{(1)} . \end{cases}

(24)

From Eq. 23, we can solve that $G_{11}^{(0)} = 0$ , with which we can further get $λ G_{10}^{(0)} = ρ G_{11}^{(1)}$ from Eq. 24. Given both results, Eqs. 23 and 24 can be simplified to

{\begin{cases} \partial_{t} G_{0}^{(0)} + d (z - 1) \partial_{z} G_{0}^{(0)} = - σ_{u} G_{0}^{(0)} + σ_{b} G_{10}^{(0)}, \\ \partial_{t} G_{10}^{(0)} + d (z - 1) \partial_{z} G_{10}^{(0)} = λ (z - 1) G_{10}^{(0)} - σ_{b} G_{10}^{(0)} + σ_{u} G_{0}^{(0)}, \end{cases}

which are exactly the generating function equations of the telegraph model (see Eqs. A2 and A3 in (29)), thus showing that the multiscale transcriptional bursting model converges to the telegraph model when ρ → ∞. A similar proof can be constructed to show that the telegraph model is also obtained in the limit λ → ∞.

Appendix C: Analytic Marginal Distribution for Protein Numbers for the Multiscale Model in the Limit of Fast mRNA Decay

To the reaction scheme illustrated in Fig. 1 A, we add two reactions: 1) a first-order reaction modeling the translation of mRNA to proteins with rate constant k and 2) a first-order reaction modeling the decay of protein with rate constant d_p. The following coupled master equations describe the time evolution of the probability P_θ(n, m) of finding n mRNAs, m proteins, and gene state D_θ (θ = 0, 10, 11) in a cell:

{\begin{matrix} \partial_{t} P_{0} (n, m) = d (n + 1) P_{0} (n + 1, m) - d n P_{0} (n, m) + d_{p} (m + 1) P_{0} (n, m + 1) - d_{p} m P_{0} (n, m) \\ + k n P_{0} (n, m - 1) - k n P_{0} (n, m) - σ_{u} P_{0} (n, m) + σ_{b} P_{10} (n, m) + σ_{b} P_{10} (n, m), \\ \partial_{t} P_{10} (n, m) = d (n + 1) P_{10} (n + 1, m) - d n P_{10} (n, m) + d_{p} (m + 1) P_{10} (n, m + 1) - d_{p} m P_{10} (n, m) \\ + k n P_{10} (n, m - 1) - k n P_{10} (n, m) + σ_{u} P_{0} (n, m) - (σ_{b} + λ) P_{10} (n, m) + ρ P_{11} (n - 1, m), \\ \partial_{t} P_{11} (n, m) = d (n + 1) P_{11} (n + 1, m) - d n P_{11} (n, m) + d_{p} (m + 1) P_{11} (n, m + 1) - d_{p} m P_{11} (n, m) \\ + k n P_{11} (n, m - 1) - k n P_{11} (n, m) + λ P_{10} (n, m) - (ρ + σ_{b}) P_{11} (n, m) . \end{matrix}

(25)

By defining $G_{θ} = \sum_{n} \sum_{m} z_{m}^{n} z_{p}^{m} P_{θ} (n, m)$ , solving Eq. 25 is tantamount to seeking solutions to the set of differential equations

{\begin{cases} \partial_{t} G_{0} + [d (z_{m} - 1) - k (z_{p} - 1) z_{m}] \partial_{z_{m}} G_{0} + d_{p} (z_{p} - 1) \partial_{z_{p}} G_{0} = - σ_{u} G_{0} + σ_{b} G_{10} + σ_{b} G_{11}, \\ \partial_{t} G_{10} + [d (z_{m} - 1) - k (z_{p} - 1) z_{m}] \partial_{z_{m}} G_{10} + d_{p} (z_{p} - 1) \partial_{z_{p}} G_{10} = σ_{u} G_{0} - (σ_{b} + λ) G_{10} + ρ z_{m} G_{11}, \\ \partial_{t} G_{11} + [d (z_{m} - 1) - k (z_{p} - 1) z_{m}] \partial_{z_{m}} G_{11} + d_{p} (z_{p} - 1) \partial_{z_{p}} G_{11} = λ G_{10} - (ρ + σ_{b}) G_{11} . \end{cases}

(26)

By means of the method of characteristics, Eq. 26 is equivalently represented as

\partial_{s} t = 1, \partial_{s} z_{m} = d (z_{m} - 1) - k (z_{p} - 1) z_{m}, \partial_{s} z_{p} = d_{p} (z_{p} - 1),

and

{\begin{cases} \partial_{s} G_{0} = - σ_{u} G_{0} + σ_{b} G_{10} + σ_{b} G_{11}, \\ \partial_{s} G_{10} = σ_{u} G_{0} - (σ_{b} + λ) G_{10} + ρ z_{m} G_{11}, \\ \partial_{s} G_{11} = λ G_{10} - (ρ + σ_{b}) G_{11} . \end{cases}

Assuming that mRNA decays much faster than protein such that $\partial_{s} z_{m} ≃ 0$ ((21a), (21b), (21c)), we get that

z_{m} = \frac{1}{1 - b v}, and v = z_{p} - 1,

(27)

and b = k/d is the mean translational burst size. Using Eq. 27, we can reduce Eq. 26 to

v \partial_{v} G_{0} = - {\tilde{σ}}_{u} G_{0} + {\tilde{σ}}_{b} G_{10} + {\tilde{σ}}_{b} G_{11},

(28a)

v \partial_{v} G_{10} = {\tilde{σ}}_{u} G_{0} - ({\tilde{σ}}_{b} + \tilde{λ}) G_{10} + \frac{\tilde{ρ}}{1 - b v} G_{11},

(28b)

v \partial_{v} G_{11} = \tilde{λ} G_{10} - (\tilde{ρ} + {\tilde{σ}}_{b}) G_{11},

(28c)

where ${\tilde{σ}}_{b}$ , ${\tilde{σ}}_{u}$ , $\tilde{ρ}$ , and $\tilde{λ}$ are kinetic parameters normalized with respect to protein degradation rate d_p. It follows from summing (28a), (28b), (28c) that

G_{11} = \frac{(1 - b v) \partial_{v} G}{\tilde{ρ} b} .

(29)

Using the definitions $b_{1} = {\tilde{σ}}_{b} + {\tilde{σ}}_{u}$ and $b_{2} = {\tilde{σ}}_{b} + \tilde{λ} + \tilde{ρ}$ and plugging Eq. 29 into Eqs. 28b and 28c, it gives us that

(1 - b v) v^{2} \partial_{v}^{3} G + [1 + b_{1} + b_{2} - b v (3 + b_{1} + b_{2})] v \partial_{v}^{2} G + {b_{1} b_{2} - b v [(1 + b_{1}) (1 + b_{2}) + \tilde{λ} \tilde{ρ}]} \partial_{v} G - b {\tilde{σ}}_{u} \tilde{λ} \tilde{ρ} G = 0,

which admits a solution

G (v) =_{3}^{} F_{2} (a_{1}, a_{2}, a_{3}; b_{1}, b_{2}; b v),

(30)

with a₁, a₂, and a₃ being roots of

{\begin{cases} a_{1} a_{2} a_{3} = {\tilde{σ}}_{u} \tilde{λ} \tilde{ρ}, \\ a_{1} + a_{2} + a_{3} = b_{1} + b_{2}, \\ a_{1} a_{2} + a_{1} a_{3} + a_{2} a_{3} = b_{1} b_{2} + \tilde{λ} \tilde{ρ} . \end{cases}

Hence, summarizing, Eq. 30 and $P (m) = {(1 / m!) d^{m} G (v) / d v^{m} |}_{v = - 1}$ define the steady-state distribution of protein numbers, which is

P (m) = \frac{b^{m}}{m!} \frac{{(a_{1})}_{n} {(a_{2})}_{n} {(a_{3})}_{n}}{{(b_{1})}_{n} {(b_{2})}_{n}}_{3}^{} F_{2} (a_{1} + n, a_{2} + n, a_{3} + n; b_{1} + n, b_{2} + n; - b),

given that mRNA is short lived.

Next, we will show the solution Eq. 30 converges to the Gaussian hypergeometric function (₂F₁) for the three-stage gene expression model (21) when ρ is large. To this end, we parameterize $\tilde{ρ}$ in Eqs. 28b and 28c as $\tilde{ρ} \mapsto \tilde{ρ} δ$ , where δ is a large number. Dividing both sides of Eqs. 28b and 28c by δ, we have

ϵ (v \partial_{v} G_{10} - {\tilde{σ}}_{u} G_{0} + ({\tilde{σ}}_{b} + \tilde{λ}) G_{10}) = \frac{\tilde{ρ}}{1 - b v} G_{11},

(31a)

ϵ (v \partial_{v} G_{11} - \tilde{λ} G_{10} + {\tilde{σ}}_{b} G_{11}) = - \tilde{ρ} G_{11},

(31b)

where $ϵ = 1 / δ ≃ 0$ . Again similarly, we expand G₀, G₁₀, and G₁₁ in Eqs. 28a and (31a), (31b) as a series in powers of $ϵ$ , collect the terms for $ϵ$ ⁰ and $ϵ$ ¹, and obtain

Order of ϵ^{0} : {\begin{cases} v \partial_{v} G_{0}^{(0)} = - {\tilde{σ}}_{u} G_{0}^{(0)} + {\tilde{σ}}_{b} G_{10}^{(0)} + {\tilde{σ}}_{b} G_{11}^{(0)}, \\ \frac{\tilde{ρ}}{1 - b v} G_{11}^{(0)} = 0, \\ \tilde{ρ} G_{11}^{(0)} = 0, \end{cases}

(32)

and

Order of ϵ^{1} : {\begin{cases} v \partial_{v} G_{10}^{(0)} - {\tilde{σ}}_{u} G_{0}^{(0)} + ({\tilde{σ}}_{b} + \tilde{λ}) G_{10}^{(0)} = \frac{\tilde{ρ}}{1 - b v} G_{11}^{(1)}, \\ v \partial_{v} G_{11}^{(0)} - \tilde{λ} G_{10}^{(0)} + {\tilde{σ}}_{b} G_{11}^{(0)} = - \tilde{ρ} G_{11}^{(1)}, \end{cases} .

(33)

From Eq. 32, we get $G_{11}^{(0)} = 0$ , which is used to reduce Eq. 33 and the first equation in Eq. 32 to

{\begin{cases} v \partial_{v} G_{0}^{(0)} = - {\tilde{σ}}_{u} G_{0}^{(0)} + {\tilde{σ}}_{b} G_{10}^{(0)}, \\ v \partial_{v} G_{10}^{(0)} = {\tilde{σ}}_{u} G_{0}^{(0)} - {\tilde{σ}}_{b} G_{10}^{(0)} + \frac{\tilde{λ} b v}{1 - b v} G_{10}^{(0)} . \end{cases}

(34)

Note that Eq. 34, which is the leading order of (28a), (28b), (28c), is exactly the same as the generating functions of the three-stage gene expression model reported in (21) (see Eqs. 68–69). By means of similar arguments, one can show the reduction of our model when λ is large.

References

1.Bahar Halpern K., Tanami S., Itzkovitz S. Bursty gene expression in the intact mammalian liver. Mol. Cell. 2015;58:147–156. doi: 10.1016/j.molcel.2015.01.027. [DOI] [PMC free article] [PubMed] [Google Scholar]
2.Suter D.M., Molina N., Naef F. Mammalian genes are transcribed with widely different bursting kinetics. Science. 2011;332:472–474. doi: 10.1126/science.1198817. [DOI] [PubMed] [Google Scholar]
3.Larsson A.J.M., Johnsson P., Sandberg R. Genomic encoding of transcriptional burst kinetics. Nature. 2019;565:251–254. doi: 10.1038/s41586-018-0836-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
4.Zenklusen D., Larson D.R., Singer R.H. Single-RNA counting reveals alternative modes of gene expression in yeast. Nat. Struct. Mol. Biol. 2008;15:1263–1271. doi: 10.1038/nsmb.1514. [DOI] [PMC free article] [PubMed] [Google Scholar]
5.Sanchez A., Golding I. Genetic determinants and cellular constraints in noisy gene expression. Science. 2013;342:1188–1193. doi: 10.1126/science.1242975. [DOI] [PMC free article] [PubMed] [Google Scholar]
6.Peccoud J., Ycart B. Markovian modeling of gene-product synthesis. Theor. Popul. Biol. 1995;48:222–234. [Google Scholar]
7.Raj A., Peskin C.S., Tyagi S. Stochastic mRNA synthesis in mammalian cells. PLoS Biol. 2006;4:e309. doi: 10.1371/journal.pbio.0040309. [DOI] [PMC free article] [PubMed] [Google Scholar]
8.Tunnacliffe E., Chubb J.R. What is a transcriptional burst? Trends Genet. 2020;36:288–297. doi: 10.1016/j.tig.2020.01.003. [DOI] [PubMed] [Google Scholar]
9.Cao Z., Grima R. Analytical distributions for detailed models of stochastic gene expression in eukaryotic cells. Proc. Natl. Acad. Sci. USA. 2020;117:4682–4692. doi: 10.1073/pnas.1910888117. [DOI] [PMC free article] [PubMed] [Google Scholar]
10.Kumar N., Kulkarni R.V. Constraining the complexity of promoter dynamics using fluctuations in gene expression. Phys. Biol. 2019;17:015001. doi: 10.1088/1478-3975/ab4e57. [DOI] [PMC free article] [PubMed] [Google Scholar]
11.Zhou T., Zhang J. Analytical results for a multistate gene model. SIAM J. Appl. Math. 2012;72:789–818. [Google Scholar]
12.Rodriguez J., Ren G., Larson D.R. Intrinsic dynamics of a human gene reveal the basis of expression heterogeneity. Cell. 2019;176:213–226.e18. doi: 10.1016/j.cell.2018.11.026. [DOI] [PMC free article] [PubMed] [Google Scholar]
13.Zhang J.J., Zhou T.S. Stationary moments, distribution conjugation and phenotypic regions in stochastic gene transcription. Math. Biosci. Eng. 2019;16:6134–6166. doi: 10.3934/mbe.2019307. [DOI] [PubMed] [Google Scholar]
14.Bothma J.P., Garcia H.G., Levine M. Dynamic regulation of eve stripe 2 expression reveals transcriptional bursts in living Drosophila embryos. Proc. Natl. Acad. Sci. USA. 2014;111:10598–10603. doi: 10.1073/pnas.1410022111. [DOI] [PMC free article] [PubMed] [Google Scholar]
15.Corrigan A.M., Tunnacliffe E., Chubb J.R. A continuum model of transcriptional bursting. eLife. 2016;5:e13051. doi: 10.7554/eLife.13051. [DOI] [PMC free article] [PubMed] [Google Scholar]
16.Bartman C.R., Hamagami N., Raj A. Transcriptional burst initiation and polymerase pause release are key control points of transcriptional regulation. Mol. Cell. 2019;73:519–532.e4. doi: 10.1016/j.molcel.2018.11.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
17.Shao W., Zeitlinger J. Paused RNA polymerase II inhibits new transcriptional initiation. Nat. Genet. 2017;49:1045–1051. doi: 10.1038/ng.3867. [DOI] [PubMed] [Google Scholar]
18.Gressel S., Schwalb B., Cramer P. CDK9-dependent RNA polymerase II pausing controls transcription initiation. eLife. 2017;6:e29736. doi: 10.7554/eLife.29736. [DOI] [PMC free article] [PubMed] [Google Scholar]
19.Xu H., Skinner S.O., Golding I. Stochastic kinetics of nascent rna. Phys. Rev. Lett. 2016;117:128101. doi: 10.1103/PhysRevLett.117.128101. [DOI] [PMC free article] [PubMed] [Google Scholar]
20.Phillips R., Belliveau N.M., Scholes C. Figure 1 theory meets figure 2 experiments in the study of gene expression. Annu. Rev. Biophys. 2019;48:121–163. doi: 10.1146/annurev-biophys-052118-115525. [DOI] [PMC free article] [PubMed] [Google Scholar]
21.Shahrezaei V., Swain P.S. Analytical distributions for stochastic gene expression. Proc. Natl. Acad. Sci. USA. 2008;105:17256–17261. doi: 10.1073/pnas.0803850105. [DOI] [PMC free article] [PubMed] [Google Scholar]
22.Tantale K., Mueller F., Bertrand E. A single-molecule view of transcription reveals convoys of RNA polymerases and multi-scale bursting. Nat. Commun. 2016;7:12248. doi: 10.1038/ncomms12248. [DOI] [PMC free article] [PubMed] [Google Scholar]
23.Van Kampen N.G. Elsevier; Amsterdam, the Netherlands: 1992. Stochastic Processes in Physics and Chemistry. [Google Scholar]
24.Thomas P., Popović N., Grima R. Phenotypic switching in gene regulatory networks. Proc. Natl. Acad. Sci. USA. 2014;111:6994–6999. doi: 10.1073/pnas.1400049111. [DOI] [PMC free article] [PubMed] [Google Scholar]
25.Gillespie D.T. Exact stochastic simulation of coupled chemical reactions. J. Phys. Chem. 1977;81:2340–2361. [Google Scholar]
26.Mannan A.A., Liu D., Oyarzún D.A. Fundamental design principles for transcription-factor-based metabolite biosensors. ACS Synth. Biol. 2017;6:1851–1859. doi: 10.1021/acssynbio.7b00172. [DOI] [PubMed] [Google Scholar]
27.Arpino J.A.J., Hancock E.J., Polizzi K. Tuning the dials of synthetic biology. Microbiology. 2013;159:1236–1253. doi: 10.1099/mic.0.067975-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
28.Redner S. Cambridge University Press; Cambridge, UK: 2001. A Guide to First-Passage Processes. [Google Scholar]
29.Iyer-Biswas S., Hayot F., Jayaprakash C. Stochasticity of gene products from transcriptional pulsing. Phys. Rev. E Stat. Nonlin. Soft Matter Phys. 2009;79:031911. doi: 10.1103/PhysRevE.79.031911. [DOI] [PubMed] [Google Scholar]
30.Bothma J.P., Norstad M.R., Garcia H.G. Llamatags: a versatile tool to image transcription factor dynamics in live embryos. Cell. 2018;173:1810–1822.e16. doi: 10.1016/j.cell.2018.03.069. [DOI] [PMC free article] [PubMed] [Google Scholar]
31.Cao Z., Grima R. Accuracy of parameter estimation for auto-regulatory transcriptional feedback loops from noisy data. J. R. Soc. Interface. 2019;16:20180967. doi: 10.1098/rsif.2018.0967. [DOI] [PMC free article] [PubMed] [Google Scholar]
32.Cao Z., Grima R. Linear mapping approximation of gene regulatory networks with stochastic dynamics. Nat. Commun. 2018;9:3305. doi: 10.1038/s41467-018-05822-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
33.Holehouse J., Grima R. Revisiting the reduction of stochastic models of genetic feedback loops with fast promoter switching. Biophys. J. 2019;117:1311–1330. doi: 10.1016/j.bpj.2019.08.021. [DOI] [PMC free article] [PubMed] [Google Scholar]
34.Holehouse J., Cao Z., Grima R. Stochastic modeling of auto-regulatory genetic feedback loops: a review and comparative study. Biophys. J. 2020;118:1517–1525. doi: 10.1016/j.bpj.2020.02.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
35.Cisse I.I., Izeddin I., Darzacq X. Real-time dynamics of RNA polymerase II clustering in live human cells. Science. 2013;341:664–667. doi: 10.1126/science.1239053. [DOI] [PubMed] [Google Scholar]
36.Skinner S.O., Xu H., Golding I. Single-cell analysis of transcription kinetics across the cell cycle. eLife. 2016;5:e12175. doi: 10.7554/eLife.12175. [DOI] [PMC free article] [PubMed] [Google Scholar]
37.Choubey S., Kondev J., Sanchez A. Deciphering transcriptional dynamics in vivo by counting nascent rna molecules. PLoS Comput. Biol. 2015;11:e1004345. doi: 10.1371/journal.pcbi.1004345. [DOI] [PMC free article] [PubMed] [Google Scholar]
38.Lin J., Amir A. Homeostasis of protein and mRNA concentrations in growing cells. Nat. Commun. 2018;9:4496. doi: 10.1038/s41467-018-06714-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
39.Nikolados E.M., Weiße A.Y., Oyarzún D.A. Growth defects and loss-of-function in synthetic gene circuits. ACS Synth. Biol. 2019;8:1231–1240. doi: 10.1021/acssynbio.8b00531. [DOI] [PubMed] [Google Scholar]
40.Zopf C.J., Quinn K., Maheshri N. Cell-cycle dependence of transcription dominates noise in gene expression. PLoS Comput. Biol. 2013;9:e1003161. doi: 10.1371/journal.pcbi.1003161. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib1] 1.Bahar Halpern K., Tanami S., Itzkovitz S. Bursty gene expression in the intact mammalian liver. Mol. Cell. 2015;58:147–156. doi: 10.1016/j.molcel.2015.01.027. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib2] 2.Suter D.M., Molina N., Naef F. Mammalian genes are transcribed with widely different bursting kinetics. Science. 2011;332:472–474. doi: 10.1126/science.1198817. [DOI] [PubMed] [Google Scholar]

[bib3] 3.Larsson A.J.M., Johnsson P., Sandberg R. Genomic encoding of transcriptional burst kinetics. Nature. 2019;565:251–254. doi: 10.1038/s41586-018-0836-1. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib4] 4.Zenklusen D., Larson D.R., Singer R.H. Single-RNA counting reveals alternative modes of gene expression in yeast. Nat. Struct. Mol. Biol. 2008;15:1263–1271. doi: 10.1038/nsmb.1514. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib5] 5.Sanchez A., Golding I. Genetic determinants and cellular constraints in noisy gene expression. Science. 2013;342:1188–1193. doi: 10.1126/science.1242975. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib6] 6.Peccoud J., Ycart B. Markovian modeling of gene-product synthesis. Theor. Popul. Biol. 1995;48:222–234. [Google Scholar]

[bib7] 7.Raj A., Peskin C.S., Tyagi S. Stochastic mRNA synthesis in mammalian cells. PLoS Biol. 2006;4:e309. doi: 10.1371/journal.pbio.0040309. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib8] 8.Tunnacliffe E., Chubb J.R. What is a transcriptional burst? Trends Genet. 2020;36:288–297. doi: 10.1016/j.tig.2020.01.003. [DOI] [PubMed] [Google Scholar]

[bib9] 9.Cao Z., Grima R. Analytical distributions for detailed models of stochastic gene expression in eukaryotic cells. Proc. Natl. Acad. Sci. USA. 2020;117:4682–4692. doi: 10.1073/pnas.1910888117. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib10] 10.Kumar N., Kulkarni R.V. Constraining the complexity of promoter dynamics using fluctuations in gene expression. Phys. Biol. 2019;17:015001. doi: 10.1088/1478-3975/ab4e57. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib11] 11.Zhou T., Zhang J. Analytical results for a multistate gene model. SIAM J. Appl. Math. 2012;72:789–818. [Google Scholar]

[bib12] 12.Rodriguez J., Ren G., Larson D.R. Intrinsic dynamics of a human gene reveal the basis of expression heterogeneity. Cell. 2019;176:213–226.e18. doi: 10.1016/j.cell.2018.11.026. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib13] 13.Zhang J.J., Zhou T.S. Stationary moments, distribution conjugation and phenotypic regions in stochastic gene transcription. Math. Biosci. Eng. 2019;16:6134–6166. doi: 10.3934/mbe.2019307. [DOI] [PubMed] [Google Scholar]

[bib14] 14.Bothma J.P., Garcia H.G., Levine M. Dynamic regulation of eve stripe 2 expression reveals transcriptional bursts in living Drosophila embryos. Proc. Natl. Acad. Sci. USA. 2014;111:10598–10603. doi: 10.1073/pnas.1410022111. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib15] 15.Corrigan A.M., Tunnacliffe E., Chubb J.R. A continuum model of transcriptional bursting. eLife. 2016;5:e13051. doi: 10.7554/eLife.13051. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib16] 16.Bartman C.R., Hamagami N., Raj A. Transcriptional burst initiation and polymerase pause release are key control points of transcriptional regulation. Mol. Cell. 2019;73:519–532.e4. doi: 10.1016/j.molcel.2018.11.004. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib17] 17.Shao W., Zeitlinger J. Paused RNA polymerase II inhibits new transcriptional initiation. Nat. Genet. 2017;49:1045–1051. doi: 10.1038/ng.3867. [DOI] [PubMed] [Google Scholar]

[bib18] 18.Gressel S., Schwalb B., Cramer P. CDK9-dependent RNA polymerase II pausing controls transcription initiation. eLife. 2017;6:e29736. doi: 10.7554/eLife.29736. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib19] 19.Xu H., Skinner S.O., Golding I. Stochastic kinetics of nascent rna. Phys. Rev. Lett. 2016;117:128101. doi: 10.1103/PhysRevLett.117.128101. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib20] 20.Phillips R., Belliveau N.M., Scholes C. Figure 1 theory meets figure 2 experiments in the study of gene expression. Annu. Rev. Biophys. 2019;48:121–163. doi: 10.1146/annurev-biophys-052118-115525. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib21] 21.Shahrezaei V., Swain P.S. Analytical distributions for stochastic gene expression. Proc. Natl. Acad. Sci. USA. 2008;105:17256–17261. doi: 10.1073/pnas.0803850105. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib22] 22.Tantale K., Mueller F., Bertrand E. A single-molecule view of transcription reveals convoys of RNA polymerases and multi-scale bursting. Nat. Commun. 2016;7:12248. doi: 10.1038/ncomms12248. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib23] 23.Van Kampen N.G. Elsevier; Amsterdam, the Netherlands: 1992. Stochastic Processes in Physics and Chemistry. [Google Scholar]

[bib24] 24.Thomas P., Popović N., Grima R. Phenotypic switching in gene regulatory networks. Proc. Natl. Acad. Sci. USA. 2014;111:6994–6999. doi: 10.1073/pnas.1400049111. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib25] 25.Gillespie D.T. Exact stochastic simulation of coupled chemical reactions. J. Phys. Chem. 1977;81:2340–2361. [Google Scholar]

[bib26] 26.Mannan A.A., Liu D., Oyarzún D.A. Fundamental design principles for transcription-factor-based metabolite biosensors. ACS Synth. Biol. 2017;6:1851–1859. doi: 10.1021/acssynbio.7b00172. [DOI] [PubMed] [Google Scholar]

[bib27] 27.Arpino J.A.J., Hancock E.J., Polizzi K. Tuning the dials of synthetic biology. Microbiology. 2013;159:1236–1253. doi: 10.1099/mic.0.067975-0. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib28] 28.Redner S. Cambridge University Press; Cambridge, UK: 2001. A Guide to First-Passage Processes. [Google Scholar]

[bib29] 29.Iyer-Biswas S., Hayot F., Jayaprakash C. Stochasticity of gene products from transcriptional pulsing. Phys. Rev. E Stat. Nonlin. Soft Matter Phys. 2009;79:031911. doi: 10.1103/PhysRevE.79.031911. [DOI] [PubMed] [Google Scholar]

[bib30] 30.Bothma J.P., Norstad M.R., Garcia H.G. Llamatags: a versatile tool to image transcription factor dynamics in live embryos. Cell. 2018;173:1810–1822.e16. doi: 10.1016/j.cell.2018.03.069. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib31] 31.Cao Z., Grima R. Accuracy of parameter estimation for auto-regulatory transcriptional feedback loops from noisy data. J. R. Soc. Interface. 2019;16:20180967. doi: 10.1098/rsif.2018.0967. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib32] 32.Cao Z., Grima R. Linear mapping approximation of gene regulatory networks with stochastic dynamics. Nat. Commun. 2018;9:3305. doi: 10.1038/s41467-018-05822-0. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib33] 33.Holehouse J., Grima R. Revisiting the reduction of stochastic models of genetic feedback loops with fast promoter switching. Biophys. J. 2019;117:1311–1330. doi: 10.1016/j.bpj.2019.08.021. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib34] 34.Holehouse J., Cao Z., Grima R. Stochastic modeling of auto-regulatory genetic feedback loops: a review and comparative study. Biophys. J. 2020;118:1517–1525. doi: 10.1016/j.bpj.2020.02.016. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib35] 35.Cisse I.I., Izeddin I., Darzacq X. Real-time dynamics of RNA polymerase II clustering in live human cells. Science. 2013;341:664–667. doi: 10.1126/science.1239053. [DOI] [PubMed] [Google Scholar]

[bib36] 36.Skinner S.O., Xu H., Golding I. Single-cell analysis of transcription kinetics across the cell cycle. eLife. 2016;5:e12175. doi: 10.7554/eLife.12175. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib37] 37.Choubey S., Kondev J., Sanchez A. Deciphering transcriptional dynamics in vivo by counting nascent rna molecules. PLoS Comput. Biol. 2015;11:e1004345. doi: 10.1371/journal.pcbi.1004345. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib38] 38.Lin J., Amir A. Homeostasis of protein and mRNA concentrations in growing cells. Nat. Commun. 2018;9:4496. doi: 10.1038/s41467-018-06714-z. [DOI] [PMC free article] [PubMed] [Google Scholar]

[bib39] 39.Nikolados E.M., Weiße A.Y., Oyarzún D.A. Growth defects and loss-of-function in synthetic gene circuits. ACS Synth. Biol. 2019;8:1231–1240. doi: 10.1021/acssynbio.8b00531. [DOI] [PubMed] [Google Scholar]

[bib40] 40.Zopf C.J., Quinn K., Maheshri N. Cell-cycle dependence of transcription dominates noise in gene expression. PLoS Comput. Biol. 2013;9:e1003161. doi: 10.1371/journal.pcbi.1003161. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

A Stochastic Model of Gene Expression with Polymerase Recruitment and Pause Release

Zhixing Cao

Tatiana Filatova

Diego A Oyarzún

Ramon Grima

Abstract

Significance

Introduction

Results and Discussion

Model

Figure 1.

Exact solution

Special case of bursty transcription

Relationship to the telegraph model

Sensitivity analysis

Figure 2.

Effective telegraph model

Figure 3.

Connection to the refractory model

Figure 4.

Protein dynamics

Conclusions

Author Contributions

Acknowledgments

Appendix A: Analytic Distribution For Mrna Numbers When $ρ$ , $λ$ , And $σ_{b}$ Are Large

Appendix B: Convergence to Telegraph Model for Large $ρ$

Appendix C: Analytic Marginal Distribution for Protein Numbers for the Multiscale Model in the Limit of Fast mRNA Decay

References

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

A Stochastic Model of Gene Expression with Polymerase Recruitment and Pause Release

Zhixing Cao

Tatiana Filatova

Diego A Oyarzún

Ramon Grima

Abstract

Significance

Introduction

Results and Discussion

Model

Figure 1.

Exact solution

Special case of bursty transcription

Relationship to the telegraph model

Sensitivity analysis

Figure 2.

Effective telegraph model

Figure 3.

Connection to the refractory model

Figure 4.

Protein dynamics

Conclusions

Author Contributions

Acknowledgments

Appendix A: Analytic Distribution For Mrna Numbers When ρ, λ, And σb Are Large

Appendix B: Convergence to Telegraph Model for Large ρ

Appendix C: Analytic Marginal Distribution for Protein Numbers for the Multiscale Model in the Limit of Fast mRNA Decay

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

Appendix A: Analytic Distribution For Mrna Numbers When $ρ$ , $λ$ , And $σ_{b}$ Are Large

Appendix B: Convergence to Telegraph Model for Large $ρ$