Coordination of gene expression noise with cell size: analytical results for agent-based models of growing cell populations

Philipp Thomas; Vahid Shahrezaei

doi:10.1098/rsif.2021.0274

. 2021 May 26;18(178):20210274. doi: 10.1098/rsif.2021.0274

Coordination of gene expression noise with cell size: analytical results for agent-based models of growing cell populations

Philipp Thomas ^1,^✉, Vahid Shahrezaei ¹

PMCID: PMC8150024 PMID: 34034535

Abstract

The chemical master equation and the Gillespie algorithm are widely used to model the reaction kinetics inside living cells. It is thereby assumed that cell growth and division can be modelled through effective dilution reactions and extrinsic noise sources. We here re-examine these paradigms through developing an analytical agent-based framework of growing and dividing cells accompanied by an exact simulation algorithm, which allows us to quantify the dynamics of virtually any intracellular reaction network affected by stochastic cell size control and division noise. We find that the solution of the chemical master equation—including static extrinsic noise—exactly agrees with the agent-based formulation when the network under study exhibits stochastic concentration homeostasis, a novel condition that generalizes concentration homeostasis in deterministic systems to higher order moments and distributions. We illustrate stochastic concentration homeostasis for a range of common gene expression networks. When this condition is not met, we demonstrate by extending the linear noise approximation to agent-based models that the dependence of gene expression noise on cell size can qualitatively deviate from the chemical master equation. Surprisingly, the total noise of the agent-based approach can still be well approximated by extrinsic noise models.

Keywords: stochastic gene expression, single-cell analysis, chemical master equation, agent-based modelling

1. Introduction

Cells must continuously synthesize molecules to grow and divide. At a single-cell level, gene expression and cell size are coordinated but heterogeneous which can drive phenotypic variability and decision making in cell populations [1–5]. The interplay between these sources of cell-to-cell variability is not well understood since they have traditionally been studied separately. A general stochastic theory integrating size-dependent biochemical reactions with the dynamics of growing and dividing cells is hence still missing.

Many models of noisy gene expression and its regulation are based on the chemical master equation that describes the stochastic dynamics of biochemical reactions in a fixed reaction volume [6–8]. The small scale of compartmental sizes of cells implies that a small number of molecules is present at any time leading to large variability of reaction rates from cell to cell, commonly referred to as gene expression noise [9–11]. Another factor contributing to gene expression noise is that cells are continuously growing and dividing causing molecule numbers to (approximately) double over the course of a growth-division cycle. A common approach to account for cell growth is to include extra degradation reactions that describe dilution of gene expression levels due to cell growth [9–13] akin to what is done in deterministic rate equation models [14,15]. We will refer to this approach as the effective dilution model (EDM, figure 1a). However, little is known of how well this approach represents the dependence of gene expression noise on cell size observed in a growing population.

Figure 1. — Modelling approaches for cell size dependence of gene expression. (a) The *effective dilution model* describes cells at constant size with intracellular reactions coupled to effective dilution reactions. (b) The *extrinsic noise model* incorporates static cell size variability as a source of extrinsic noise coupled with effective dilution models. (c) The *agent-based approach* models intracellular reactions occurring across a growing and dividing cell population without the need for effective dilution reactions.

Cells achieve concentration homeostasis through coupling reaction rates to cell size via highly abundant upstream factors like cell cycle regulators, polymerases or ribosomes that approximately double over the division cycle [3,16,17]. Cell size fluctuates in single cells, however, providing a source of extrinsic noise in reaction rates that can be identified via noise decompositions [18,19]. A few studies combined EDMs with static cell size variations as an explanatory source of extrinsic noise [20–22]. In brief, the total noise in these models amounts to intrinsic fluctuations due to gene expression and dilution, and extrinsic variation across cell sizes in the population. We refer to this class of models as extrinsic noise models (ENMs, figure 1b). Yet it remains unclear how reliably these effective models describe cells that continuously synthesize molecules, grow and divide.

An increasing number of studies are investing efforts towards quantifying the dependence of gene expression noise on cell cycle progression and growth, either experimentally via ergodic principles or pseudo-time [23,24] and time-lapse imaging [22,25,26] or theoretically through noise decomposition [27–29], master equations including cell cycle dynamics [4,17,30–35] and agent-based approaches including age-structure of growing populations [35–40]. The essence of agent-based models (ABMs) is that each cell in a population is represented by an agent whose physiological state is tracked along with their molecular reaction networks. In principle, these models are able to predict gene expression distributions of cells progressing through well-defined cell cycle states as measured by time-lapse microscopy and snapshots of heterogeneous populations. The unprecedented detail of these models must cast doubt on the predictions of master equation models (EDMs and ENMs) in which growth and division are modelled by effective dilution reactions. Yet it is presently unclear why these effective models have fared reasonably well in predicting gene expression noise reported by single-cell experiments [10,17,41].

Nevertheless, most ABMs still ignore cell size, a major physiological factor affecting both intracellular reactions and cell division dynamics alike. Since cell size varies at least twofold as required by size homeostasis in a growing population, and it scales some reaction rates as required by concentration homeostasis, it is expected that cell size must significantly contribute to gene expression variation across a population. In this article, we bridge the gap between the chemical master equation and agent-based approaches by integrating cell size dynamics with the stochastic kinetics of molecular reaction networks.

The outline of the paper is as follows. First, we explain the analytical framework for EDMs, ENMs and ABMs (§2). Then we introduce the concept of stochastic concentration homeostasis (SCH), a rigorous condition under which the chemical master equations of the EDM and ENM agree exactly with the ABM (§3.1). This new condition is met by some but not all common models of gene expression. We show that when these conditions are not met, the effective models agree with the ABM only on average (§3.2). To address this problem, we propose a comprehensive theoretical framework extending the linear noise approximation to agent-based dynamics with which we quantify cell size scaling of gene expression in growing cells (§3.3). Our findings indicate that the EDM can qualitatively fail to predict this dependence but our novel approximation method accurately describes gene expression noise in the presence of cell size control variations and division errors. We further show that ENMs present surprisingly accurate approximations for the total noise statistics (§3.4).

2. Methods

We consider a biochemical reaction network of N molecular species S = (S₁, S₂, …, S_N)^T embedded in a cell of size s. The network then has the general form:

\sum_{i = 1}^{N} ν_{i r}^{-} S_{i} \overset{k_{r}}{⟶} \sum_{i = 1}^{N} ν_{i r}^{+} S_{i}, r = 1, \dots, R,

2.1

where $ν_{r}^{\pm} = {(ν_{1 r}^{\pm}, ν_{2 r}^{\pm}, \dots, ν_{N r}^{\pm})}^{T}$ are the stoichiometric coefficients and k_r is the reaction rate constant of the rth reaction. In the following, we outline deterministic, effective dilution and extrinsic noise models and develop a new agent-based approach coupling stochastic reaction dynamics to cell size in growing and dividing cells (figure 1).

2.1. Effective dilution models, extrinsic noise models and the chemical master equation

2.1.1. Rate equation models and concentration homeostasis

Deterministically, the vector of molecular concentrations $\bar{X} = {({\bar{X}}_{1}, {\bar{X}}_{2}, \dots, {\bar{X}}_{N})}^{T}$ is governed by rate equation models in balanced growth conditions. The balanced growth condition states that there exists a steady state between reaction and dilution rates

α \bar{X} = \sum_{r = 1}^{R} (ν_{r}^{+} - ν_{r}^{-}) f_{r} (\bar{X}) .

2.2

Here, $f_{r} (\bar{X})$ are macroscopic reaction-rate functions and α is the exponential growth rate of cells determining the dilution rate due to growth. Since these quantities are independent of cell size, the balanced growth condition (2.2) implies concentration homeostasis in rate equation models.

2.1.2. Effective dilution model

The chemical master equation [6] and equivalently the stochastic simulation algorithm [7] are state-of-the-art stochastic models of reaction kinetics inside cells. Although well-established, they are strictly valid only when describing cellular fluctuations at constant cell size s. A straightforward approach to circumvent this limitation is to supplement (2.1) by additional degradation reactions of rate α that model dilution of molecules due to cell growth:

S_{i} \overset{α}{⟶} Ø, i = 1, 2, \dots, N,

2.3

akin to what is traditionally for reaction rate equations (2.2). The chemical master equation of this EDM then takes the familiar form

0 = \frac{\partial Π_{EDM} (x | s)}{\partial t} = [Q (s) + α D] Π_{EDM} (x | s),

2.4

governing the conditional probability of molecule numbers x = (x₁, x₂, …, x_N)^T of the species S in a cell of size s and where

Q_{x, x^{'}} (s) = \sum_{r = 1}^{R} w_{r} (x^{'}, s) (δ_{x, x^{'} + ν_{r}^{+} - ν_{r}^{-}} - δ_{x, x^{'}}),

2.5

are the elements of the transition matrix of the molecular reactions (2.1) and we included the extra dilution reactions (2.3) via $D_{x, x^{'}} (s) = \sum_{i = 1}^{N} x_{i}^{'} (δ_{x_{i}, x_{i}^{'} - 1} - δ_{x_{i}, x_{i}^{'}})$ . We are here interested in the stationary solution and hence set the time-derivative in equation (2.4) to zero. Such effective models are motivated through the fact [6,42] that when the microscopic propensities w_r are linked to the macroscopic rate functions f_r of the rate equation models via mass-action kinetics

w_{r} (x, s) \approx s f_{r} (X),

2.6

where X = x/s is the concentration, the mean concentrations of EDMs follow the concentrations $\bar{X}$ of the rate equations (2.2) (see §2.1.4).

2.1.3. Extrinsic noise model

A common way to incorporate static size variability between cells in the model is to consider cell size s to be distributed across cells according to a cell size distribution $Π (s)$ . We will refer to this approach as the ENM, which leads to a mixture model of concentrations X = x/s,

Π_{ENM} (X) = \int_{0}^{\infty} d s Π_{EDM} (x = X s | s) Π (s),

2.7

and analogous expressions for the molecule number distributions.

2.1.4. Analytical solutions and noise decomposition

The advantage of the EDM and ENM is that its noise statistics can be approximated in closed-form using the linear noise approximation [6,43,44]. In this approximation, the mean concentrations are approximated by the solution $\bar{X}$ of the rate equations (2.2) and the probability distribution $Π_{EDM} (x | s)$ is approximated by a Gaussian. In the same limit, the covariance matrix $Σ_{Y}$ can be decomposed into intrinsic and extrinsic components, $Σ_{Y}^{int}$ and $Σ_{Y}^{ext}$ , using the law of total variance [18,19]

Σ_{Y} = \underset{gene expression}{\underset{⏟}{Σ_{Y}^{int}}} + \underset{cell size variation}{\underset{⏟}{Σ_{Y}^{ext}}},

2.8

which correspond to molecular fluctuations due to gene expression and cell size variation, respectively, for Y ∈ {EDM, ENM}. Specifically, for molecule numbers x, we have $Σ_{Y}^{int} = E_{Π} [{Cov}_{Π_{Y}} [x | s]]$ and $Σ_{Y}^{ext} = {Cov}_{Π} [E_{Π_{Y}} [x | s]]$ , where $E_{Π}$ denotes the expectation value with respect to the distribution $Π$ , and analogously for concentrations. The intrinsic components $Σ_{Y}^{int}$ satisfy a Lyapunov equation called the linear noise approximation:

0 = J_{d} Σ_{Y}^{int} + Σ_{Y}^{int} J_{d}^{T} + Ω_{Y}^{- 1} D_{d} (\bar{X}),

2.9

where $Ω_{Y}$ has to be chosen depending on whether concentration or number covariances are of interest:

2.10

The matrix $J_{d}$ is the Jacobian of the rate equations (2.2) and $D_{d}$ denotes the diffusion matrix obeying

J_{d} (\bar{X}) = J (\bar{X}) - α \underline{1}, D_{d} (\bar{X}) = D (\bar{X}) + α diag (\bar{X}),

2.11

where $J (\bar{X}) = \sum_{r = 1}^{R} (ν_{r}^{+} - ν_{r}^{-}) \nabla_{\bar{X}}^{T} f_{r} (\bar{X})$ and $D (\bar{X}) = \sum_{r = 1}^{R} f_{r} (\bar{X})$ $(ν_{r}^{+} - ν_{r}^{-}) {(ν_{r}^{+} - ν_{r}^{-})}^{T}$ . The extrinsic components $Σ_{Y}^{ext}$ follow from the dependence of the mean on cell size, which features only in the molecule number variance of the ENM:

2.12

where the last cell follows from $Σ_{ENM}^{ext} = {Cov}_{Π} [E_{Π} [x | s]]$ with $E_{Π} [x | s] = s \bar{X}$ .

As a concrete example, we consider transcription of mRNAs with a size-dependent transcription rate that are translated into stable proteins:

Ø \overset{k_{0} s}{⟶} M \overset{k_{dm}}{⟶} Ø, M \overset{k_{tl}}{⟶} M + P .

2.13

We then account for dilution through the additional reactions

M \overset{α}{⟶} Ø, P \overset{α}{⟶} Ø .

2.14

The mean protein concentration is given by $\bar{P} = k_{0} b / α$ and the coefficient of variation predicted by the EDM and ENM follow the familiar expression [10]

{CV}_{Y}^{2} = \frac{1}{Ω_{Y} \bar{P}} (1 + b \frac{δ}{1 + δ}) + \frac{Σ_{Y}^{ext}}{{\bar{P}}^{2}},

2.15

where we account for size variability via $Ω_{Y}$ and $Σ_{Y}^{ext}$ given by equations (2.10) and (2.12), respectively, and the parameters

δ = 1 + \frac{k_{dm}}{α} and b = \frac{k_{tl}}{k_{dm} + α},

2.16

correspond to the ratio of mRNA and protein degradation/dilution rates and the translational burst size, respectively. From equations (2.15) with (2.10) and (2.12), it is clear that size variation acts on the intrinsic noise component of molecule concentrations (via $E_{Π} [s^{- 1}] \approx E_{Π} {[s]}^{- 1} (1 + {CV}_{Π}^{2} [s])$ ) but the extrinsic noise component of molecule numbers (via $Σ_{Y}^{ext}$ (2.12)).

2.2. Agent-based modelling

Little is known about the accuracy of EDMs and ENMs in predicting cellular noise in growing populations. In the following, we introduce an agent-based modelling approach that serves as a gold standard to assess the validity of these effective models. The ABM represents cells as agents that progressively synthesize molecules via intracellular reactions (2.1), grow in size and undergo cell division. Every division gives rise to two daughter cells of varying birth sizes, each of which inherits a proportion of molecules from the mother cell via stochastic size-dependent partitioning at division.

The ABM simulation algorithm is given in box 1, which combines the First-Division algorithm, previously introduced for agent-based cell populations [38], with the Extrande method adapted to simulate reaction networks embedded in a growing cell [48]. In the following, we describe the exact analytical framework with which we characterize the snapshot distributions that underlie such a population of agents.

Box 1. First-Division Algorithm for agent-based simulations of size-dependent gene regulatory networks.

Exact simulation algorithm of general stochastic reaction networks within growing cells (agents) undergoing binary cell division according to cell size control rules [45–47]. The algorithm combines the Extrande method [48] for simulating reaction networks embedded in a growing cell and the First-Division algorithm [38] for the population dynamics. The state of each cell is given by birth time t₀, birth size s₀, present cell size s and the vector of molecule numbers x.

Box 1. First-Division Algorithm for agent-based simulations of size-dependent gene regulatory networks.

2.2.1. Master equation for agent-based populations

We consider the number of cells n(τ, s, x, t) with age τ (time since the last division), cell size s and molecule counts x in a snapshot at time t, which evolves as

\begin{matrix} \underset{growth}{\underset{⏟}{(\frac{\partial}{\partial t} + \frac{\partial}{\partial τ} + \frac{\partial}{\partial s} α s + \bar{γ} (s, τ))}} n (τ, s, x, t) = \underset{stochastic reactions}{\underset{⏟}{Q (s) n (τ, s, x, t)}}, \\ \underset{no . newborn cells}{\underset{⏟}{n (0, s, x, t)}} = 2 \int_{0}^{\infty} d τ^{'} \int_{0}^{\infty} d s^{'} \underset{division error}{\underset{⏟}{B (s | s^{'})}} \\ \times \sum_{x^{'}} \underset{partitioning of molecules}{\underset{⏟}{B (x | x^{'}, s / s^{'})}} \underset{no . dividing cells}{\underset{⏟}{\bar{γ} (s^{'}, τ^{'}) n (τ^{'}, s^{'}, x^{'}, t)}} \end{matrix}}

2.17

and describes cell growth, stochastic reaction kinetics and a boundary condition for cell division that ensures that the number of newborn cells is twice the number of dividing cells after partitioning their size and molecular contents. These evolution equations have been derived in [38,39] for age-dependent snapshots but here we extend such ABMs to include also cell size dynamics and size-dependent reaction dynamics. We allow for the following generalizations: (i) size increases exponentially in single cells, (ii) cells divide with rate $\bar{γ} (s, τ)$ that is both size- and age-dependent, (iii) the transition matrix $Q (s)$ of the molecular reactions depends on cell size s via the propensities (see definition after equation (2.4)), and (iv) the molecular partitioning kernel B(x|x′, s/s′) depends on the inherited size fraction s/s′ of a daughter cell. We now describe in detail how we model the individual noise sources associated with cell size control, division errors, and molecule partitioning.

Cell size control fluctuations. Recent studies [45,49] have shown that the distribution of sizes with which cells divide does not explicitly depend on cell age but on the birth size s₀. Assuming that $\bar{γ} (s, τ) = α s γ (s, s^{- α τ})$ , where γ(s, s₀) is the division rate per unit size (see also [50,51]), the division-size distribution is given by

φ (s_{d} | s_{0}) = γ (s_{d}, s_{0}) e^{- \int_{s_{0}}^{s_{d}} d s γ (s, s_{0})} .

2.18

As a concrete example of (2.18), we consider a model where the division size is linearly related to birth size [46,51,52]

s_{d} = a s_{0} + Δ .

2.19

The division rate can be calculated from the distribution $\tilde{φ} (Δ)$ of the noise term Δ in (2.19) via $\tilde{γ} (Δ) = \tilde{φ} (Δ) / (\int_{Δ}^{\infty} d u \tilde{φ} (u))$ and setting $γ (s, s_{0}) = \tilde{γ} (s - a s_{0})$ , which gives the correct division-size distribution $φ (s_{d} | s_{0}) = \tilde{φ} (s_{d} - a s_{0})$ as expected. The model generalizes the sizer (a = 0) to concerted cell size controls such as the adder (a = 1) and timer-like (2 > a > 1) models [45,47,49]. In the following, we will refer to CV_φ[Δ] as the size-control noise.

Division errors. After division, size is partitioned between cells and the birth size of the two daughter cells is obtained from s₀′ = θs_d and s₀″ = (1 − θ)s_d where θ is the inherited size fraction, a random variable between 0 and 1 with distribution $\bar{π} (θ)$ (see box 1). This can be modelled using the division kernel

B (s_{0} | s^{'}) = \int_{0}^{1} d θ π (θ) δ (θ - \frac{s_{0}}{s^{'}}),

where $π (θ) = (1 / 2) \bar{π} (θ) + (1 / 2) \bar{π} (1 - θ)$ including the case of asymmetric division. We will refer to ${CV}_{π} [θ]$ as the division error about the centre $E_{π} [θ] = 1 / 2$ .

Molecule partitioning at cell division. The partitioning kernel B(x|x′, θ) denotes the probability that a cell inherits x molecules from a total of x′ molecules from its mother and this probability depends on the daughter’s inherited size fraction θ. We assume that cells are sufficiently well mixed and each molecule is partitioned independently with probability θ such that the division kernel is binomial

B (x | x^{'}, θ) = \prod_{i = 1}^{N} (\begin{aligned} x_{i}^{'} \\ x_{i} \end{aligned}) θ^{x_{i}} {(1 - θ)}^{x_{i}^{'} - x_{i}} .

2.20

To make analytical progress, we assume that the population establishes a long-term stationary distribution $Π (s, s_{0}, x)$ characterizing the fraction of cells with molecule numbers x, cell size s and birth size s₀ that is invariant in time. To this end, we let $n (τ, s, x, t) \propto e^{α t} Π (s, τ, x)$ and change variables from cell age τ to birth size s₀ such that $Π (s, s_{0}, x) = {(α s)}^{- 1} Π (s, τ = \ln (s / s_{0}) / α, x)$ . We find that this transformation reduces the PDE (2.17) to an integro-ODE:

(α + \frac{\partial}{\partial s} α s + α s γ (s, s_{0})) Π (s, s_{0}, x) = Q (s) Π (s, s_{0}, x)

2.21a

and

\begin{aligned} s_{0} Π (s_{0}, s_{0}, x) \\ = 2 \sum_{x^{'}} \int_{0}^{\infty} d s^{'} \int_{0}^{s^{'}} d s_{0}^{'} B (x | x^{'}, s_{0} / s^{'}) B (s_{0} | s^{'}) s^{'} γ (s^{'}, s_{0}^{'}) Π (s^{'}, s_{0}^{'}, x^{'}) . \end{aligned}

2.21b

We finally characterize the marginal cell size distribution $Π (s, s_{0})$ and the conditional molecule number distribution $Π (x | s, s_{0})$ via Bayes’ formula

Π (s, s_{0}) = \sum_{x} Π (s, s_{0}, x), Π (x | s, s_{0}) = \frac{Π (s, s_{0}, x)}{Π (s, s_{0})},

2.22

which together provide the full information about the population snapshot.

2.2.2. Cell size distribution

The evolution of the size distribution $Π (s, s_{0})$ is obtained by summing equations (2.21) over all possible x, which yields:

(α + \frac{\partial}{\partial s} α s + α s γ (s, s_{0})) Π (s, s_{0}) = 0

2.23a

and

s_{0} Π (s_{0}, s_{0}) = 2 \int_{0}^{\infty} d s^{'} \int_{0}^{s^{'}} d s_{0}^{'} B (s_{0} | s^{'}) s^{'} γ (s^{'}, s_{0}^{'}) Π (s^{'}, s_{0}^{'}) .

2.23b

Equations (2.23) can be solved analytically

Π (s, s_{0}) = \frac{2}{Z} ψ_{bw} (s_{0}) Φ (s | s_{0}) \frac{1}{s^{2}},

2.24

where ψ_bw(s₀) is the birth size distribution in a backward lineage (see [50] for details), $Φ (s | s_{0}) = \exp (- \int_{s_{0}}^{s} d s^{'} γ (s^{'}, s_{0}))$ is the probability that a cell born at size s₀ has not divided before reaching size s, and $Z = E_{ψ_{bw}} [s_{0}^{- 1}]$ is a normalizing constant.

2.2.3. Molecule number distributions for cells of a certain size

The conditional molecule number distribution $Π (x | s, s_{0})$ gives the probability to find the molecule numbers x in a cell of size s that was born at size s₀ and satisfies

α s \frac{\partial}{\partial s} Π (x | s, s_{0}) = Q (s) Π (x | s, s_{0})

2.25a

and

Π (x | s_{0}, s_{0}) = \sum_{x^{'}} \int_{0}^{\infty} d s^{'} \int_{0}^{s^{'}} d s_{0}^{'} B (x | x^{'}, \frac{s_{0}}{s^{'}}) ρ (s^{'}, s_{0}^{'} | s_{0}) Π (x^{'} | s^{'}, s_{0}^{'}) .

2.25b

Equations (2.25) follow directly from substituting equation (2.22) into (2.21) and using (2.18) and (2.23). The solution of these equations depends implicitly on the ancestral cell size distribution ρ,

ρ (s^{'}, s_{0}^{'} | s_{0}) = \frac{1}{ψ_{bw} (s_{0})} \frac{s_{0}}{s^{'}} B (s_{0} | s^{'}) φ (s^{'} | s_{0}^{'}) ψ_{bw} (s_{0}^{'}),

2.25c

that gives the probability of a cell born at size s₀ having an ancestor with division size s′ and birth size s₀′. The main difference between the molecule number distributions of the ABM and the EDM/ENM is the boundary condition at cell division, which as we shall see can have a significant effect on the reaction dynamics.

3. Results

We here introduce the concept of SCH as a generalization of concentration homeostasis in deterministic systems (see §2.1.1) to higher moments and distributions in stochastic reaction networks. SCH is a homeostatic condition for the distribution p(x|s) of a size-dependent stochastic process to be expressed as a mixture of Poisson random variables drawn from an underlying continuous stochastic concentration vector κ = (κ₁, κ₂, …, κ_N)^T that is statistically independent of s:

p (x | s) = \int_{K} d κ χ (κ) \prod_{i = 1}^{N} \frac{{(κ_{i} s)}^{x_{i}}}{x_{i}!} e^{- s κ_{i}} .

3.1

The fact that κ and its density χ(κ) are independent of s ensures concentration homeostasis in the stochastic sense.

3.1. The effective dilution model is valid for reaction networks with stochastic concentration homeostasis

Theorem A.2 (appendix A) is a central result of our analysis and it states that if the EDM (2.4) satisfies SCH, i.e. equation (3.1) holds for $Π_{EDM} (x | s)$ , then its stationary solution is also a solution of the ABM (2.25):

Π (x | s, s_{0}) = Π_{EDM} (x | s),

3.2

and the solution is independent of the birth size s₀. Equivalently (appendix A), we can say that the EDM/ABM satisfies SCH if the factorial-moment generating function is of the form

G_{EDM} (z | s) = \sum_{x} \prod_{i = 1}^{N} z_{i}^{x_{i}} Π_{EDM} (x | s) = F (s (z - 1)),

3.3

where $F (t) = E_{χ} [e^{\sum_{i = 1}^{N} t_{i} κ_{i}}]$ , the moment-generating function of the concentration vector, is cell-size (s) independent. An interesting observation is that SCH implies that the mean numbers and coefficients of variation for cells of the same size s are given by

E_{Π} [x | s] = s E_{χ} [κ], {CV}_{Π}^{2} (x | s) = \frac{1}{s E_{χ} [κ]} + {CV}_{χ}^{2} (κ) .

3.4

Since κ is independent of s, SCH implies homeostasis of the mean concentrations in (3.4) but concentration homeostasis on average does not necessarily imply SCH. The coefficients of variation coincide both for concentrations and molecule numbers and have size-dependent and size-independent components. In the following, we provide examples of reaction networks for which SCH holds for all values of the rate constants and demonstrate the validity of the EDM by comparing its distribution solutions to ABM simulations.

It can be seen from (3.1) and (3.4) that when the EDM’s stationary distribution is Poissonian with deterministic concentration vector κ, this distribution satisfies SCH and hence is also a solution of the ABM. More generally, SCH can be checked without solving for G_EDM(z|s) (or $Π_{EDM} (x | s)$ ). Assuming mass-action kinetics (2.6), for example, a sufficient condition for SCH is that the network consists entirely of mono-molecular reactions (see appendix A) of the form:

\begin{matrix} Ø \overset{D (t) s}{⟶} S_{1}, or Ø \overset{D (t)}{⟶} m_{s} \times S_{1}, \\ or & Ø \overset{s}{⟶} S_{1}, or S_{1} \to Ø, or S_{1} \to S_{2}, \end{matrix}}

3.5

where S₁ and S₂ denote any pair of species that are partitioned at cell division and D(t) is a exogenous stationary stochastic process modelling a genetic state which is copied but not partitioned at cell division and does not scale with cell size. The propensities of zero-order reactions in SCH networks must either be proportional to cell size or include size-dependent random bursts m_s whose burst distribution satisfies SCH itself. We illustrate the predictive power of this result by demonstrating SCH for common gene expression models involving reactions of the form (3.5) and show that the analytical solution of the chemical master equations agrees exactly with the ABMs (figure 2a–c).

mRNA expression involving a two-state promoter [53] (figure 2a),

D_{off} ⇌_{k_{off}}^{k_{on}} D_{on} \overset{s k_{0}}{⟶} D_{on} + M, M \overset{k_{dm}}{⟶} Ø

satisfies SCH for all parameter values since the network is of the form (3.5) whenever the transcription rate is proportional to cell size. The stochastic concentration variable is distributed as κ ∼ (k₀/(k_dm + α))Beta(k_on/(k_dm + α), k_off/(k_dm + α)).

Bursty protein expression (figure 2b) of a stable (non-degrading) protein arising from a two-stage model of gene expression can be modelled using stochastic bursts:

Ø \overset{k_{0}}{⟶} m_{s} \times P .

According to (3.5) the model satisfies SCH for all parameter values when the burst distribution obeys SCH. This is the case for geometrically distributed bursts m_s whose mean is proportional to cell size, E[m_s|s] = bs. It can be shown that the stochastic concentration variable follows κ ∼ Gamma(k₀/α, b).

Similarly, bursty protein expression from a two-state promoter arising from a three-stage gene expression model [54,56] (figure 2c),

D_{off} ⇌_{k_{off}}^{k_{on}} D_{on} \overset{k_{0}}{⟶} D_{on} + m_{s} \times P,

also satisfies SCH for geometrically distributed bursts (with mean bs) but the concentration variable is doubly stochastic¹ κ ∼ Gamma((a₋/α), br), r ∼ Beta(a₋/α, (a₊ + k_on + k_off)/α) where $a_{\pm} = k_{off} + k_{on} + k_{0} \pm \sqrt{(k_{off} + k_{on} + k_{0})^{2} - 4 k_{0} k_{on}}$ . We observe excellent numerical agreement between the ABM simulations and analytical EDM solutions in all these cases validating our theoretical predictions (figure 2a–c).

A complex example of the reactions (3.5) that obeys SCH for all parameter values but yet defies analytical solution is

\begin{aligned} D_{i} \overset{g_{i j}}{⟶} D_{j} \forall i, j = 1, \dots, N_{D} \\ D_{i} \overset{s t_{i}}{⟶} D_{i} + M_{1}, \forall i = 1, \dots, N_{D} \\ M_{1} \overset{k_{1}}{⟶} M_{2} \overset{k_{2}}{⟶} \dots \overset{k_{S}}{⟶} M_{S}, \\ M_{i} \overset{δ_{i}}{⟶} Ø \forall i = 1, \dots, N_{M} \end{aligned}

where the exogenous genetic states D_i undergo switching with rates g_ij and are copied but not partitioned at cell division, transcription rates s t_i are assumed to be proportional to cell size s, and processing of transcripts M_i follows a multi-step process with rates k_i and degradation with rates δ_i. For example, it can be checked that for N_M = 1 we recover the 2^m-multistate model [57] as a special case whose EDM has a factorial-moment generating function (compare eqn (7) in [57] with (3.3)) satisfies SCH precisely when the transcription rates are proportional to cell size.

On the other hand, discrepancies between the EDM and ABM solutions will be apparent when reactions do not obey SCH. To illustrate this point, we return to the gene expression model with transcriptional size-scaling and explicit protein translation reaction (2.13). Note that in the EDM extra reactions are being added for the dilution of mRNAs and proteins, while for the ABM proteins are diluted through growth and divisions. Using our condition (3.3), it is straight-forward to verify that the Poissonian mRNA distributions of the EDM coincide exactly with the distributions of the ABM (figure 2d). However, this condition is not met for the protein distribution since the translation reaction is not a monomolecular reaction of the form (3.5). To demonstrate the breakdown of the EDM, we compare the analytical steady-state distributions obtained by Bokes et al. [55] against ABM simulations at various cell sizes (figure 2e). We observe that the error of the EDM (as quantified by the ℓ₁-distance of the two distributions, figure 2f) is pronounced both for newborn and dividing cells. The remainder of this article is dedicated to investigate the sources and consequences of these discrepancies.

3.2. The EDM approximates the mean concentrations of ABMs lacking SCH

SCH provides a general criterion with which to probe the validity of the EDM probability distributions. In practice, however, approximate agreement of the first few moments, e.g. mean and variances, often suffices. Here, we establish that under the mass-action scaling assumption (2.6) the mean concentrations of the ABM and EDM agree approximately, and they satisfy concentration homeostasis on average. This can be seen by multiplying equation (2.25) by x and averaging, which yields ODEs for the mean numbers:

α s \frac{\partial}{\partial s} E_{Π} [x | s, s_{0}] = \sum_{r = 1}^{R} (ν_{r}^{+} - ν_{r}^{-}) E_{Π} [w_{r} (x) | s, s_{0}],

3.6a

and the boundary conditions

E_{Π} [x | s_{0}, s_{0}] = E_{ρ} [\frac{s_{0}}{s^{'}} E_{Π} [x | s^{'}, s_{0}^{'}] | s_{0}] .

3.6b

Unfortunately, equations (3.6) are not necessarily closed since the equation for the mean may involve higher order moments when w_r(x) depends nonlinearly on x and we need to resort to approximations. Analogously to the linear noise approximation, we set $E [x | s, s_{0}] = s \bar{X}$ and $E_{Π} [w_{r} (x) | s, s_{0}] \approx s f_{r} (\bar{X})$ and insert the resulting expression into equations (3.6). It follows that $\bar{X}$ is independent of size and satisfies the rate equations (2.2). We conclude that, for mass-action kinetics, the EDM agrees exactly with the ABM on average for networks with linear propensities and approximately for large cell size for nonlinear reaction networks.

3.3. Scaling of fluctuations with size in individual cells manifests the breakdown of the EDM lacking SCH

Next we investigate the scaling of fluctuations with cell size. Under the linear noise approximation the covariance matrix $Σ (s, s_{0}) = {Cov}_{Π} [x | s, s_{0}]$ evolves according to

α s \frac{\partial}{\partial s} Σ (s, s_{0}) = J Σ (s, s_{0}) + Σ (s, s_{0}) J^{T} + s D (\bar{X}),

3.7a

where $J (\bar{X})$ and $D (\bar{X})$ are the Jacobian and diffusion matrices defined after equation (2.11). To make analytical progress, we assume for now that cell division is deterministic ( ${CV}_{φ} [Δ] = {CV}_{π} [θ] = 0$ ), which implies the following boundary condition

4 Σ (s_{0}, s_{0}) = 2 s_{0} diag (\bar{X}) + Σ (2 s_{0}, s_{0}) .

3.7b

The first term is due to binomial partitioning of molecules and the second stems from gene expression noise at division. It is implicit in the deterministic division assumption ( ${CV}_{φ} [Δ] = {CV}_{π} [θ] = 0$ ) that the birth size s₀ across cells is fixed and that the size distribution in equation (2.24) reduces to

Π (s) = Π (s | s_{0}) = \frac{2 s_{0}}{s^{2}}

3.8

for s₀ ≤ s ≤ 2s₀ and zero otherwise, in agreement with previous results [58,59]. Similarly, the ancestral distribution (2.25c) reduces to ρ(s′, s₀′|s₀) = δ(s′ − 2s₀)δ(s₀′ − s₀).

Equations (3.7) can be solved in closed form using the eigendecomposition of the Jacobian $J$ . The solution to (3.7a) that respects the boundary condition (3.7b) is (appendix D)

\begin{aligned} Σ (s, s_{0}) = \sum_{i j} \frac{s {\hat{u}}_{i} {\hat{u}}_{j}^{†}}{(α - λ_{i} - λ_{j}^{*})} \\ \times [{\tilde{D}}_{i j} + \frac{{\tilde{D}}_{i j} + {\tilde{X}}_{i j} (λ_{i} + λ_{j}^{*} - α)}{2^{((λ_{i} + λ_{j}^{*}) / α) - 1} - 2} {(\frac{s_{0}}{s})}^{1 - (λ_{i} + λ_{j}^{*}) / α}], \end{aligned}

3.9

where ^† denotes the conjugate-transpose and we defined the matrices $\tilde{D} = U^{- 1} D U^{- †}$ , $\tilde{X} = U^{- 1} diag (\bar{X}) U^{- †}$ and $U = ({\hat{u}}_{1}, \dots, {\hat{u}}_{N})$ whose columns are the eigenvectors of $J$ such that $U^{- 1} J U = diag (λ)$ .

We demonstrate the implications of this result using the gene expression example with transcriptional size-scaling and explicit translation reaction (2.13). The mean of mRNA numbers m and protein numbers p are

E_{Π} [m | s] = s \frac{k_{0}}{α δ}, E_{Π} [p | s] = s \frac{b k_{0}}{α},

3.10

where the constants are defined in equation (2.16). These expressions hold both for the EDM and the ABM, and they exhibit concentration homeostasis on average as shown in the previous section. The exact agreement between EDM and ABM is also confirmed by ABM simulations (figure 3b).

Figure 3. — Comparing the statistics of the effective dilution and agent-based models. (a) Simple model of mRNA transcription and protein translation transcriptional size-scaling (2.13). (b) Mean mRNA (top) and protein levels (bottom) agree with the EDM (solid grey lines) and ABM simulations (blue dots). (c) mRNA statistics display unit Fano factor indicating Poisson statistics in agreement with EDM. (d) ABM simulations (dots, box 1) display non-monotonic cell size scaling of protein noise, which are predicted by the agent-based theory (solid red) but not by the EDM (solid grey). Parameters are k₀ = 10, k_dm = 10, k_tl = 100, α = 1. Cell size control parameters are as in figure 2.

Using equation (3.7), we find that the cell size-dependent fluctuations satisfy

\begin{matrix} {Var}_{Π} [m | s, s_{0}] = E_{Π} [m | s], \\ {Cov}_{Π} [m, p | s, s_{0}] = \frac{s b k_{0}}{α δ} (1 + {(\frac{s_{0}}{s})}^{δ} \frac{2^{δ}}{1 - 2^{δ + 1}}), \\ {Var}_{Π} [p | s, s_{0}] = E_{Π} [p | s] \\ \times (1 + 2 b - \frac{s_{0}}{s} \frac{4 b δ}{3 (δ - 1)} + {(\frac{s_{0}}{s})}^{δ} \frac{b}{δ - 1} \frac{2^{δ + 1}}{(2^{δ + 1} - 1)}) . \end{matrix}}

3.11

We note that the mRNA variance of the ABM agrees precisely with the EDM (figure 3c). The agreement is a direct consequence of SCH exhibited by the mRNA transcription and degradation reactions (cf. (3.5)). However, the expressions for the predicted mRNA–protein covariance and protein variance disagree with their EDM counterpart since the reactions involving the protein violate SCH. To explore this dependence, we compare the corresponding coefficients of variation of both models (figure 3d). The EDM overestimates cell-to-cell variation of small cells but underestimates it for large cells. Moreover, the EDM’s coefficient of variation decreases monotonically with cell size, but this is not the case for the ABM.

Strikingly, the coefficient of variation peaks as cells progress through the cell cycle (figure 3d, solid red line), which is in excellent agreement with the ABM simulations (blue dots) but is not seen in the EDM (solid grey). This can be seen directly from equation (3.11) for which protein fluctuations can be approximated in the limit of fast mRNA degradation (δ ≫ 1) as

{CV}_{ABM}^{2} [p | s, s_{0}] \approx \frac{1}{E_{Π} [p | s]} (1 + b (2 - \frac{s_{0}}{s} \frac{4}{3})),

3.12

which has a maximum at a cell size of s = s₀ 8b/(3(2b + 1)) as confirmed by agent-based simulations (figure 3d). Depending on the burst size b, the peak shifts from s = s₀ for b = 3/2 to s = 4/3 s₀ for b ≫ 1. The qualitative difference between the scalings of gene expression noise with cell size of EDMs and ABMs manifests the breakdown of the EDM, which is observed both for concentrations and molecule numbers since their coefficients of variation coincide when considering cells of the same size.

3.3.1. Effect of cell size control on gene expression dynamics

Next, we ask how fluctuations in the cell size control affect gene expression noise. It may be intuitively expected that noise in cell size control and division errors cause variable birth sizes, variable division times and hence noisy expression levels. mRNA fluctuations in the gene expression model with transcriptional size-scaling (2.13) obey SCH and hence are unaffected by these noise sources. The effect on protein noise remains yet to be elucidated.

To this end, we assume small birth-size variations and approximate the actual birth size with an averaged estimate $E_{Π} [s_{0} | s]$ of the retrospective birth size for a cell of size s. The covariance matrix (or any other moment) can then be approximated as

Σ (s) = E_{Π} [Σ (s, s_{0}) | s] \approx Σ (s, E_{Π} [s_{0} | s])) .

3.13a

This simplification can formally be justified through a saddle-point approximation as the joint distribution $Π (s, s_{0})$ has a maximum at $Π (s, E_{Π} [s_{0} | s])$ ). Generally no analytical expression of $E_{Π} [s_{0} | s]$ can be derived from equation (2.24) in the presence of cell size control fluctuations, however, and we approximate $E_{Π} [s_{0} | s]$ by a matched asymptotic expansion (appendix C):

E_{Π} [s_{0} | s] \approx {\bar{s}}_{0} - \underset{small cells}{\underset{⏟}{σ \frac{\sqrt{2 / π} e^{- ({(s - {\bar{s}}_{0})}^{2} / 2 σ^{2})}}{1 + \erf ((s - {\bar{s}}_{0}) / \sqrt{2} σ)}}} + \underset{large cells}{\underset{⏟}{2 a σ^{2} γ (s - a {\bar{s}}_{0})}},

3.13b

which holds for the linear cell size control model (2.19). The first term is the average birth size in the absence of cell size control fluctuations, the second term denotes the contributions from small cells, while the third term stems from large cells. The interpretation of this conditional expectation is that small cells were born with sizes smaller than average while larger cells were born with sizes above average depending on their size control (figure 6 in appendix C). The parameters in equation (3.13b) are given by the mean birth size ${\bar{s}}_{0}$ and variance σ² in a backward lineage tracing the ancestors of a random cell in the population (see [50] for details):

\begin{matrix} {\bar{s}}_{0} = \frac{(2 - a) (1 + {CV}_{θ}^{2})}{2 - a (1 + {CV}_{θ}^{2})}, \\ σ^{2} = {\bar{s}}_{0}^{2} \frac{{CV}_{Δ}^{2} (1 + 3 {CV}_{θ}^{2}) {(2 - a (1 + {CV}_{θ}^{2}))}^{2} + 4 {CV}_{θ}^{2} (1 - {CV}_{θ}^{2})}{{({CV}_{θ}^{2} + 1)}^{2} (4 - a^{2} (1 + 3 {CV}_{θ}^{2}))} . \end{matrix}}

3.13c

Equation (3.13) provide a closed-form approximation of the cell size dependence of any given moment accurate to order O(σ³).

Figure 6. — Retrospective averages of birth size. Matched asymptotic expansions of $E_{Π} [s_{0} | s]$ (equation (3.13b), lines) for the adder size control (a = 1, s₀ = 1) and agent-based simulations (dots) for (a) varying size control noise CV[Δ] and (b) division errors CV[θ].

To verify the accuracy of proposed approximation, we test the theory for various strengths of size control noise and division errors (figure 4). We observe that increasing cell size noise results in the monotonic decrease of gene expression noise with cell size (figure 4a–d) in good agreement with ABM simulations, even for large cell size fluctuations. We further ask about the effects of partitioning noise, which shows a similar dependence but agrees less well with the ABM simulations for cells smaller than the mean birth size (figure 4e–h), presumably since the effect of large variability in birth sizes is not captured in our approximation. Nevertheless, the present approximation qualitatively captures the overall cell size dependence of the ABM simulations (figure 4). Our findings confirm that birth size variation contributes significantly to the cell size dependence of gene expression noise of networks lacking SCH.

3.4. ENMs provide surprisingly accurate approximations of total noise in ABMs lacking SCH

We go on to compare the ENM introduced in §2.1 with the ABM. In contrast to the EDM, the ENM predicts the total noise statistics including the variability introduced by the cell size distribution. It is clear that the ENM agrees exactly with the ABM whenever the network obeys SCH. In particular, the marginal factorial-moment generating function of the ABM’s molecule numbers $G (z) = E_{Π} [\prod_{i = 1}^{N} z_{i}^{x_{i}}]$ (irrespective of cell size) follows from equation (3.1) as

G (z) = E_{χ} [\prod_{i = 1}^{N} K (κ_{i} (z_{i} - 1))],

3.14

when the concentration distribution χ has been identified (as we did for the models in §3.1) and the moment-generating function $K (t) = E_{Π} [e^{t s}]$ of the cell size distribution (2.24) is known. When SCH does not hold, the ABM statistics can in principle be obtained through integrating equation (3.9) against the size distribution $Π (s, s_{0})$ . Specifically, denoting molecule numbers by x and concentrations by X = x/s, as before, we have

\begin{matrix} {Cov}_{Π} [X] = E_{Π} [\frac{Σ (s, s_{0})}{s^{2}}], \\ {Cov}_{Π} [x] = E_{Π} [Σ (s, s_{0})] + {Var}_{Π} (s) \bar{X} {\bar{X}}^{T}, \end{matrix}}

3.15

where $Σ (s, s_{0})$ is the size-dependent covariance matrix discussed in §3.3.

To illustrate this dependence, we consider the gene expression model with transcriptional size-scaling (2.13) and integrate equation (3.11) numerically against the size distribution (2.24). We observe that the mRNA noise-mean relationship of the ABM follows exactly the ENM predictions when the mean varies through the transcription rate (figure 5a). This agreement is confirmed for various strengths of cell size control fluctuations and division errors, both for mRNA concentrations and numbers, which validates our theoretical predictions that the mRNA distribution satisfies SCH and hence the ENM is exact.

However, the protein noise–mean relationships of the ABM and ENM differ (figure 5b). The discrepancy, albeit small, exists even for deterministic divisions ( ${CV}_{φ} [Δ] = {CV}_{π} [θ] = 0$ ) for which the averages (3.15) over the size distribution can be carried out analytically and result in:

\begin{matrix} {CV}_{ABM}^{2} [P] = \frac{1}{Ω_{P} \bar{P}} [1 + \frac{2 b (12 - (24 (2^{δ - 1} - 1)) / ((2^{δ + 1} - 1) (δ - 1)) + 13 δ)}{27 (δ + 2)}], \\ {CV}_{ABM}^{2} [p] = {CV}_{Π}^{2} [s] \\ + \frac{1}{Ω_{p} \bar{P}} [1 + \frac{2 b ((2 (2^{δ - 1} - 1) / ((2^{δ + 1} - 1) (δ - 1))) + δ (\ln (8) - 1) - 1)}{3 δ \ln (2)}], \end{matrix}}

3.16

where $Ω_{P} = E_{Π} {[s^{- 1}]}^{- 1}$ and $Ω_{p} = E_{Π} [s]$ and δ, b are defined as in equation (2.16). It can be verified by optimizing (3.16) over δ that the ENM underestimates ABM noise of protein numbers by at most $2 %$ , while it overestimates noise in protein concentrations by the same amount. The difference between the ENM and ABM predictions increases with cell size control noise for concentration measures but appears to be practically independent of cell size control noise for protein number fluctuations (figure 5b, insets). The protein number noise, but not concentration noise, exhibits an extrinsic noise floor ( ${CV}_{Π}^{2} [s]$ in (3.16)) for large mean numbers due to extrinsic cell size variability across the population and this noise floor increases with noise in size control (CV_φ[Δ]) and division errors ( ${CV}_{π} [θ]$ ) as it is also predicted by the ENM (figure 5b).

Similar conclusions hold for the noise–mean relationship when the mean is varied through translation rate (figure 5c) but there appears an additional intrinsic noise floor due to stochastic bursting (figure 5c and equation (3.16)) that is present both in the protein concentration and number noise. This phenomenon is in qualitative agreement with previous findings [27,35,40] and similarly predicted by the ENM (2.15). Presumably, the better quantitative agreement for molecule numbers as compared to concentrations (figure 5b,c) is due the fact that the ENM and ABM predictions are dominated by extrinsic noise, which has the same effect in both models. Our observations suggest that, surprisingly, the ENM provides much more accurate approximations of the ABM statistics than the EDM.

4. Discussion

We presented an agent-based framework to study gene expression noise coupled to cell size dynamics across growing and dividing cell populations. The framework consists of an exact algorithm for simulating the stochastic dynamics of dividing cells (box 1), which generalizes previous algorithms for isolated lineages [4,17,32,60–62] towards growing cell populations, and a master equation framework (§2) that exactly characterizes the snapshot-distribution of gene expression and cell size across such a agent-based population.

Our theory shows that the newly defined SCH (cf. §3.1 and theorem A.2) provides a necessary and sufficient condition for the stationary distributions of the chemical master equation (EDMs and ENMs) to agree exactly with the snapshot distributions of detailed ABMs. A broad class of gene networks (3.5), involving mono-molecular reactions, multi-state promoters and bursting, satisfy SCH irrespective of network parameters when the reaction rates scale with cell size according to the law of mass-action. SCH is however not restricted to this particular class of network and can generally be checked on a case-by-case basis using the generating function equations, which can be accomplished without solving the chemical master equation analytically (see appendix A).

SCH guarantees that gene expression distributions for cells of a given size are entirely independent of extrinsic noise sources affecting birth size such as cell size control and division noise. They thus reveal whether a network embedded in a growing cell can be insulated against such noise sources, an important feature that can guide the design of synthetic circuits.

Nevertheless, most gene regulatory networks of interest do not obey SCH. To address this issue, we developed the linear noise approximation for ABMs lacking SCH embedded in growing and dividing cells. The theory provides ODEs for the mean molecule numbers (3.6) and their covariances (3.7), which—unlike the conventional linear noise approximation describing EDMs [6,43]—describe their evolution across sizes and features a boundary condition describing the stochastic partitioning of molecules at cell division. We showed that, when the reactions follow mass-action size-scaling, the mean concentrations of EDMs and ABMs agree, because they exhibit concentration homeostasis (§3.2). The theory further provides closed-form analytical expressions for the covariance matrix of gene expression fluctuations in the absence of SCH. We note that like the conventional linear noise approximation, the linear noise approximation for ABMs is exact for linear reaction networks but represents an approximation for nonlinear reaction networks (§3.3).

While the EDM always predicts birth-size-independent noise, the ABM’s covariance matrices generally depend on it (§3.3.1), both for concentrations and molecule numbers. This means that, unlike in SCH conditions, size-control noise and division errors can propagate to gene expression levels, and we unveiled quantitative and qualitative differences between EDMs and ABMs regarding the dependence of expression noise on cell size. Such differences prevail even for relatively simple gene networks involving protein expression (see (2.13)) for which our linear noise approximation readily provides exact expressions for mean and noise statistics.

Despite these discrepancies, we found that the ENM of these simple gene expression models provides surprisingly accurate total noise estimates (§3.4). In fact, we showed analytically that the ENM (and EDM) of bursty production with translational size-scaling agrees exactly with the ABM since it obeys SCH. For transcriptional size-scaling, which implies the absence of SCH, the ENM deviates at most a few per cent from ABM’s total protein noise prediction. To resolve such small differences experimentally, one would need to probe on the order of 10 000 cells for measuring the squared coefficient of variation accurate to three leading digits (assuming sampling errors inversely proportional to cell number), which is achievable only with high-throughput techniques.

An outstanding question is whether the good agreement we observed is specific to the particular model or parameter values we have chosen or whether the ENM is more generally valid. We have made an initial step in this direction by providing closed-form expressions for the ABM’s linear noise approximation of any single-species reaction network with deterministic size-distribution (appendix D). These results demonstrate that the ENM overestimates the ABM’s coefficient of variation of concentrations by at most $8 %$ but underestimates it by at most $2 %$ , and vice versa for molecule numbers, and these bounds hold independently of the choice of parameters. This suggests that ENMs could be surprisingly accurate approximation of ABMs. Other effective models of bursty protein production without any size-scaling as proposed in [63] cannot obey SCH since they ignore cell size and generally produce larger errors than the ENM even for deterministic cell cycles.

A limitation of our study is that we assumed the validity of the linear noise approximation for the noise statistics of networks lacking SCH. Mean and covariance of the linear noise approximation are exact for linear reaction networks, as those we have studied here, but it represents an approximation for networks with nonlinear propensities valid in the limit of large molecule numbers. To improve the estimates of our theory, one could consider higher-order terms in the system size expansion [44], resort to moment-closure approximations [64] or to compute moment bounds [65] for nonlinear reaction networks.

Another limitation is that we neglected growth rate variability, which is a significant source of noise at the single-cell level [49]. It would be interesting to include these features in our ABMs, compare them to the effective models, and investigate whether SCH can be generalized to this case. Previous studies [3] have investigated the dependence of gene expression noise on growth rate dynamics in isolated lineages using small noise approximations similar to the one used here. Nevertheless, it may be expected that selection plays a pronounced role in populations where cells compete for growth unlike in isolated lineages, which in turn may lead to significant deviations of ENMs from ABMs [38,66,67] that we have not studied here.

In summary, we proposed SCH as a general condition for exactness of EDMs. In the absence of SCH, we found that despite qualitative differences in the predictions of EDMs, ENMs closely approximate the total noise statistics of ABMs. Our results reinstate the validity of effective models as approximations of the agent-based dynamics, and thus they significantly extend the scope of state-of-the-art master equation methods to a broad range of single-cell analyses in growing cell populations.

Appendix A. Stochastic concentration homeostasis and the validity of EDMs and ENMs

Appendices A and B use multi-index notation. In brief, a multi-index is a N-tuple α = (α₁, α₂, …, α_N). One defines powers of a vector x via $x^{α} = x_{1}^{α_{1}} x_{2}^{α_{2}} \dots x_{N}^{α_{N}}$ , derivatives $\partial_{x}^{α} = \frac{\partial^{α_{1}}}{\partial x^{α_{1}}} \frac{\partial^{α_{2}}}{\partial x^{α_{2}}} \dots \frac{\partial^{α_{N}}}{\partial x^{α_{N}}}$ , sum of components |α| = α₁ + α₂ + · · · + α_N, and the factorials α! = α₁! · α₂! · · · α_N! and analogously $((\binom{α}{β})) = α! / β! (α - β)!$ .

Definition A.1 (Stochastic concentration homeostasis). —

A probability mass function $Π (x | s)$ with state space $X$ obeys SCH if for all s ∈ [0, ∞) there exists a random variable κ on $K$ with density χ(κ) satisfying:

$Π (x | s) = \int_{K} d κ χ (κ) \frac{{(κ s)}^{x}}{x!} e^{- s | κ |}, \forall x \in X .$ A 1

Definition A 1 implies that if κ has a moment-generating function $F (t) = E_{χ} [e^{t^{T} κ}]$ then using (A 1) one finds the factorial-moment generating function

\begin{aligned} E_{Π} [z^{x} | s] & = \sum_{x \in X} z^{x} Π (x | s) = \int_{K} d κ χ (κ) e^{- s | κ |} \sum_{x \in X} z^{x} \frac{{(κ s)}^{x}}{x!} \\ = E_{χ} [e^{s {(z - 1)}^{T} κ}] = F (s (z - 1)) \end{aligned}

A 2

and similarly the factorial moments

μ_{n} (s) = E_{Π} [\frac{x!}{(x - n)!} | s] = {(\partial_{z}^{n} E_{Π} [z^{x} | s])}_{z = 1} = s^{| n |} E_{χ} [κ^{n}] .

A 3

for a multi-index n. Since the factorial-moment generating function uniquely determines the distribution, equations (A 2) and (A 3) may equivalently serve as definitions of SCH. Furthermore, when {x(s)}_s∈[0,∞) is interpreted as a point processes along the size coordinate s, SCH emphasizes the fact that it is mixed Poisson process with stationary concentration vector κ.

Theorem A.2. —

Assume that the partitioning kernel B(x|x′, θ) is binomial with probability θ given by the ratio of daughter birth size and mother division size. A stationary solution of the EDM (2.4), if it exists, is also a solution of the ABM (2.25):

$Π (x | s, s_{0}) = Π_{EDM} (x | s),$ A 4

if and only if the EDM’s solution (2.4) obeys SCH.

The utility of the theorem is that SCH can be checked without solving the chemical master equation. We demonstrate this aspect for a general reaction network of the form (2.1) with mass-action propensities $w_{r} (x, s) = s^{1 - | ν_{r}^{-} |} k_{r} (x! / (x - ν_{r}^{-})!)$ , whose factorial-moment generating function (see ch. 7 in [68]) obeys:

α s \frac{\partial}{\partial s} G (z | s, s_{0}) = \sum_{r = 1}^{R} k_{r} s^{1 - | ν_{r}^{-} |} (z^{ν_{r}^{+}} - z^{ν_{r}^{-}}) \partial_{z}^{ν_{r}^{-}} G (z | s, s_{0})

A 5

Substituting G(z|s, s₀) = F(s(z − 1)) and x = s(z − 1) gives

α x \cdot \nabla F (x) = \sum_{r = 1}^{R} k_{r} s ({(x + s)}^{ν_{r}^{+}} s^{- | ν_{r}^{+} |} - {(x + s)}^{ν_{r}^{-}} s^{- | ν_{r}^{-} |}) \partial_{x}^{ν_{r}^{-}} F (x) .

It can now be seen that the right-hand side of the above equation is independent of s if either (i) $| ν_{r}^{-} | = 0$ and $| ν_{r}^{+} | = 1$ , (ii) $| ν_{r}^{-} | = 1$ and $| ν_{r}^{+} | = 0$ , or (iii) $| ν_{r}^{-} | = | ν_{r}^{+} | = 1$ . Thus, the EDM and the ABM solutions coincide for mass-action reaction networks (2.1) when they comprise only the mono-molecular reactions given in (3.5).

Similarly, we check that adding bursty reactions of the form $Ø \overset{k}{⟶} m_{s} \times X$ leads to a generating function equation

α s \frac{\partial}{\partial s} G (z | s, s_{0}) = k [g (z | s) - 1] G (z | s, s_{0}) + \dots,

A 6

where $g (z | s) = E [z^{m_{s}} | s]$ is the factorial-moment generating function of the burst distribution. Equation (A 6) transforms to $α x \cdot \nabla F (x) = k [g (x / s + 1 | s) - 1] F (x)$ after substituting G(z|s, s₀) = F(s(z − 1)) and x = s(z − 1). Hence, bursty reactions obey SCH if and only if the burst distribution obeys SCH, i.e. if there exist a moment-generating function f satisfying f(x) = g(x/s + 1|s) as in (A 2).

Appendix B. Proof of theorem A.2

The proof of theorem A.2 is divided in three steps. The first step shows that a general condition (B 1) guarantees snapshot distributions that are independent of birth size. We then show that (B 1) satisfies the effective dilution model and reduces to SCH for binomial partitioning at cell division. The proof also clarifies that the assumption of binomial partitioning cannot be removed under biological constraints conserving the total number of molecule numbers at cell division. General conditions for the existence of the EDM’s stationary distributions have been discussed in [69].

B.1. Step 1: Distributions invariant of birth size

The conditional distribution $Π (x | s, s_{0})$ is independent of birth size s₀ if and only if

Π (x | θ s, s_{0}) = \sum_{x^{'}} B (x | x^{'}, θ) Π (x^{'} | s, s_{0}),

B 1

where B(x|x′, θ) is the partitioning kernel in equation (2.25b) that depends only the inherited size fraction θ = s₀/s′. This fact can be verified using (B 1) in the boundary condition (2.25b), which leads to

Π (x | s_{0}, s_{0}) = \int_{0}^{\infty} d s^{'} \int_{0}^{s^{'}} d s_{0}^{'} ρ (s^{'}, s_{0}^{'} | s_{0}) Π (x | s_{0}, s_{0}^{'}) .

This implies that $Π (x | s, s_{0})$ must be independent of birth size

Π (x | s, s_{0}) = Π (x | s) .

In the following, we show that under condition (B 1) $Π (x | s)$ coincides with the EDM solution.

B.2. Step 2: Transformation into an effective dilution model

Let us denote the factorial-moment generating function of the partitioning kernel by $G_{B} (z | x^{'}, θ) = \sum_{x} z^{x} B (x | x^{'}, θ)$ such that the invariance condition (B 1) becomes

G (z | θ s) = \sum_{x^{'}} G_{B} (z | x^{'}, θ) Π (x^{'} | s) .

B 2

Assume that additionally

θ \partial_{θ} G_{B} (z | x^{'}, θ) = \sum_{i = 1}^{N} (z_{i} - 1) \partial_{z_{i}} G_{B} (z | x^{'}, θ),

B 3

which holds for the binomial partition kernel G_B(z|x′, θ) = (1 − θ + θz)^x′. Differentiating equation (B 2) with respect to θ then gives

\begin{aligned} θ \partial_{θ} G (z | θ s) & = \sum_{x^{'}} θ \partial_{θ} G_{B} (z | x^{'}, θ) Π (x^{'} | s) \\ = \sum_{i = 1}^{N} (z_{i} - 1) \partial_{z_{i}} G (z | θ s), \end{aligned}

B 4

where in the last line we used assumption (B 3). Changing variables (θs → s) in (B 4) yields

s \partial_{s} G (z | s) = \sum_{i = 1}^{N} (z_{i} - 1) \frac{\partial}{\partial z_{i}} G (z | s),

or equivalently the EDM

\begin{aligned} (α s \frac{\partial}{\partial s}) Π (x | s) \\ = - α \sum_{i = 1}^{N} [(x_{i} + 1) Π (x_{1}, \dots, x_{i} + 1, \dots, x_{N} | s) - x_{i} Π (x | s)] \\ = - D Π (x | s) . \end{aligned}

B 5

Using the above relation, we see that (2.25a) coincides with (2.4) and (A 4) follows.

B.3. Step 3: SCH and the necessity of binomial partitioning

Finally, we show that condition (B 3) required for the validity of the EDM implies independent binomial partitioning of molecules. (B 3) is a linear PDE that can be solved using the method of characteristics, which leads to

θ \frac{\partial z_{i}}{\partial θ} = (1 - z_{i}), θ \frac{\partial G_{B}}{\partial θ} = 0.

The general solution is G_B(z|x′, θ) = J(1 − θ + θz) where the function J is fixed by the condition that for θ = 1 all molecules are partitioned deterministically, i.e. J(z) = z^x′. Hence, we obtain

G_{B} (z | x^{'}, θ) = {(1 - θ + θ z)}^{x^{'}},

which corresponds to independent binomial partitioning of molecules in (2.20). It then follows that (B 2) (and (B 1)) are equivalent to

G (z | θ s) = G ((1 - θ) + θ z | s) .

B 6

Finally, we show that (B 6) is equivalent to G(z|s) = F(s(z − 1)) for binomial partitioning. Expanding (B 6) around z = 1 and identifying the series coefficients with the factorial moments $μ_{n} (s)$ in (A 3), we find that the factorial moments are homogeneous functions of order $| n | = \sum_{i} n_{i}$ : μ_n(θs) = θ^|n|μ_n(s). Then by Euler’s homogeneous function theorem, it follows that the factorial moments with index n, satisfy s(∂/∂s)μ_n(s) = |n|μ_n(s) and hence μ_n(s) = s^|n|μ_n(1). This implies that the factorial-moment generating function is

G (z | s) = \sum_{n} s^{| n |} μ_{n} (1) \frac{{(z - 1)}^{n}}{n!} = F (s (z - 1)),

with $F (x) = \sum_{n} x^{n} μ_{n} (1) / n!$ . It remains to be shown that F is indeed a moment-generating function. To this end, we note that $\frac{\partial^{k} G (z | s)}{\partial z^{k}} = s^{| k |} F^{(k)} (s (z - 1)) \geq 0$ for z ∈ (1, −∞) and hence F(− x) is a completely monotone function on x ∈ (0, ∞), which implies that there exists a distribution χ for which $F (- x) = E_{χ} [e^{- s x}]$ is a Laplace transform, which concludes the proof of theorem A.2.

Appendix C. Approximation of birth size moments

We here derive an analytical approximation (3.13b) for the conditional birth size moments. We start by rewriting $E_{Π} [s_{0} | s]$ in terms of the backward lineage distribution ψ_bw using equation (2.24):

\begin{aligned} E_{Π} [s_{0} | s] & = \int_{0}^{s} d s_{0} s_{0} Π (s_{0} | s) \\ = \frac{\int_{0}^{s} d s_{0} s_{0} Π (s_{0}, s)}{\int_{0}^{s} d s_{0} Π (s_{0}, s)} = \frac{E_{ψ} [s_{0} Φ (s | s_{0}) 1_{s_{0} \leq s}]}{E_{ψ} [Φ (s | s_{0}) 1_{s_{0} \leq s}]} . \end{aligned}

C 1

We now apply matched asymptotic expansion to this expression.

C.1. Large cell asymptotics

For large cells s ≫ s₀, we can extend the range of integration in equation (C 1) and compute the expectation value as follows

\begin{aligned} E_{ψ} [f (s_{0}, s)] = \int_{0}^{\infty} d s_{0} ψ_{b w} (s_{0}) f (s_{0}, s) \\ = \int_{0}^{\infty} d s_{0} \int_{- \infty}^{\infty} \frac{d k}{2 π} e^{- i k (s_{0} - {\bar{s}}_{0})} (1 - \frac{k^{2} σ^{2}}{2}) f (s_{0}, s) + O (σ^{3}) \\ = f ({\bar{s}}_{0}, s) + \frac{σ^{2}}{2} \frac{\partial^{2} f ({\bar{s}}_{0}, s)}{\partial {\bar{s}}_{0}^{2}} + O (σ^{3}) . \end{aligned}

Using $f (s_{0}, s) = s_{0} Φ (s | s_{0})$ and $f (s_{0}, s) = Φ (s | s_{0})$ in equation (C 1), the conditional moments of birth size can be approximated by

\begin{aligned} E_{Π}^{large} [s_{0} | s] & = {\bar{s}}_{0} (1 + \frac{2 σ^{2}}{{\bar{s}}_{0}} \frac{\partial \ln Φ (s | {\bar{s}}_{0})}{\partial {\bar{s}}_{0}}) + O (σ^{3}) \\ = ({\bar{s}}_{0} + 2 a σ^{2} γ (s - a {\bar{s}}_{0})) + O (σ^{3}), \end{aligned}

where the last equality follows from γ(s, s₀) = γ(s − as₀) for the linear cell size control model (2.19), and ${\bar{s}}_{0}$ and σ are the mean and standard deviation of the backward lineage distribution ψ_bw given by equations (3.13c).

C.2. Small cell asymptotics

Next we consider small cells by noting that $Φ (s | s_{0})$ is practically constant when s ≈ s₀, the integral in (C 1) can be approximated by

E_{Π} [s_{0} | s] \approx \frac{E_{ψ} [s_{0} 1_{s_{0} \leq s}]}{E_{ψ} [1_{s_{0} \leq s}]} .

C 2

Assuming that ψ, is approximately Gaussian with mean ${\bar{s}}_{0}$ and variance σ², we find that near $s \approx {\bar{s}}_{0}$ , we have

\begin{matrix} E_{ψ} [s_{0} 1_{s_{0} \leq s}] \approx \frac{1}{2} {\bar{s}}_{0} (1 + \erf (\frac{s - {\bar{s}}_{0}}{\sqrt{2} σ})) - \frac{σ e^{- ({(s - {\bar{s}}_{0})}^{2}) / 2 σ^{2}}}{\sqrt{2 π}} \\ ann & E_{ψ} [1_{{\bar{s}}_{0} \leq s}] \approx \frac{1}{2} (1 + \erf (\frac{s - {\bar{s}}_{0}}{\sqrt{2} σ})) \end{matrix}} (C 3)

C 3

and hence

E_{Π}^{small} [s_{0} | s] = {\bar{s}}_{0} - \frac{\sqrt{2 / π} σ e^{- ({(s - {\bar{s}}_{0})}^{2}) / 2 σ^{2}}}{1 + \erf ((s - {\bar{s}}_{0}) / \sqrt{2} σ)} + O (σ^{3}),

which is accurate to order σ³.

C.3. Global asymptotics

The two asymptotic solutions can be matched at the boundary layer. Since

lim_{s \to \infty} E_{Π}^{small} [s_{0} | s] = lim_{s \to s_{0}} E_{Π}^{large} [s_{0} | s] = {\bar{s}}_{0},

the uniformly valid matched asymptotic expansion is

E_{Π} [s_{0} | s] \approx E_{Π}^{small} [s_{0} | s] + E_{Π}^{large} [s_{0} | s] - {\bar{s}}_{0},

which gives equation (3.13b) (figure 6).

Appendix D. Analytical solutions and error bounds using the linear noise approximation for deterministic cell division

We begin by outlining the solution of (3.7). Defining $\tilde{Σ} (s, s_{0}) = U^{- 1} Σ (s, s_{0}) U^{- †}$ , equation (3.7) of the main text becomes

α s \partial_{s} {\tilde{Σ}}_{i j} = (λ_{i} + λ_{j}^{*}) {\tilde{Σ}}_{i j} + s {\tilde{D}}_{i j}

D 1

and

4 {\tilde{Σ}}_{i j} (s_{0}, s_{0}) = {\tilde{Σ}}_{i j} (2 s_{0}, s_{0}) + 2 s_{0} {\tilde{X}}_{i j} .

D 2

Equation (D 1) has the solution

{\tilde{Σ}}_{i j} (s, s_{0}) = c_{i j} s^{(λ_{i} + λ_{j}^{*}) / α} + \frac{{\tilde{D}}_{i j} s}{α (1 - (λ_{i} + λ_{j}^{*}) / α)},

D 3

where the constants c_ij are fixed using the boundary condition (D 2) which gives the solution of the EDM, equation (3.9) of the main text.

Using equation (3.15) and averaging (D 3) over the deterministic size distribution (3.8), we find the covariance matrix of concentrations X,

\begin{aligned} {Cov}_{Π} [X] & = \frac{1}{Ω} \sum_{i, j = 1}^{N} {\hat{u}}_{i} {\hat{u}}_{j}^{†} \frac{β_{i j} (\coth ((1 / 2) β_{i j} \ln 2) + 3)}{3 (β_{i j} + 1)} \frac{α {\tilde{X}}_{i j}}{ξ_{i j}} \\ + \frac{β_{i j} (3 β_{i j} - c o t h ((1 / 2) β_{i j} \ln 2))}{3 (β_{i j}^{2} - 1)} \frac{{\tilde{D}}_{i j}}{ξ_{i j}}, \end{aligned}

D 4

where $ξ_{i j} = 2 α - λ_{i} - λ_{j}^{*}$ , β_ij = ξ_ij/α, $Ω^{- 1} = E_{Π} [s^{- 1}] = (3 / 4) 1 / s_{0}$ , and ${\hat{u}}_{i}$ are the eigenvectors of the Jacobian $J$ introduced before equation (3.9). Similarly, considering molecule numbers x, we have ${Cov}_{Π} [x] = Σ_{ABM}^{int} + {Var}_{Π} (s) \bar{X} {\bar{X}}^{T}$ where the intrinsic noise contribution is given by

\begin{aligned} Σ_{ABM}^{int} & = Ω \sum_{i, j = 1}^{N} {\hat{u}}_{i} {\hat{u}}_{j}^{†} [\frac{(2^{β_{i j}} - 2) β_{i j}}{(2^{β_{i j}} - 1) (β_{i j} - 1) \ln 4} \frac{α {\tilde{X}}_{i j}}{ξ_{i j}} \\ + \frac{β_{i j} ((2^{β_{i j}} - 1) β_{i j} \ln 4 - 2^{β_{i j}} (1 + \ln 4) + 2 + \ln 4)}{(2^{β_{i j}} - 1) {(β_{i j} - 1)}^{2} \ln 4} \frac{{\tilde{D}}_{i j}}{ξ_{i j}}], \end{aligned}

D 5

with $Ω = E_{Π} [s] = s_{0} \ln 4$ .

The expressions greatly simplify for a single species since $\hat{u} = {\hat{u}}_{i}$ , β = β_ij and ξ = ξ_ij. We note that in this case (D 4) increases monotonically with β while (D 5) decreases monotonically with β. Using the limits β → 0 and β → ∞, we find that the ABM’s coefficients of variation can be bounded by the EDM’s coefficients:

\frac{6}{7} \leq \frac{{CV}_{ENM}^{2} [X]}{{CV}_{ABM}^{2} [X]} \leq \frac{3}{2} \ln 2,

D 6

and

2 \ln^{2} (2) \leq \frac{{CV}_{ENM}^{2} [x]}{{CV}_{ABM}^{2} [x]} \leq \frac{4 \ln 2}{1 + \ln 4},

D 7

where we have used the fact that $D (\bar{X}) \geq α \bar{X}$ and the ENM solution of (2.9) is $Σ_{ENM}^{int} = α \bar{X} / Ω ξ + (D / Ω ξ)$ . The result implies that the ENM overestimates the ABM’s coefficient of variation of concentrations by at most $8 %$ but underestimates it by at most $2 %$ , and vice versa for molecule numbers.

Endnote

This follows from the fact that F(x) = ₂F₁(a, λ; γ + λ; b x) is the moment-generating function of κ ∼ Gamma(a, (1 + b r)⁻¹) and r ∼ Beta(λ, γ). Letting $E_{Π} [z^{p} | s] = F (s (z - 1))$ , a = a₋/α, λ = a₋α and λ + γ = (k_on + k_off)/α then yields the factorial-moment generating function solutions in [54,56].

Data accessibility

An implementation of the First-Division Algorithm (box 1) in Julia is available at github.com/pthomaslab/fda.

Authors' contributions

P.T. and V.S. designed the study and interpreted the results. P.T. developed the theory and analysed the data.

Competing interests

We declare we have no competing interests.

Funding

This work has been supported by a UKRI Future Leaders Fellowship (grant no. MR/T018429/1) to P.T. and the EPSRC Centre for Mathematics of Precision Healthcare (grant no. EP/N014529/1).

References

1.Kiviet DJ, Nghe P, Walker N, Boulineau S, Sunderlikova V, Tans SJ. 2014. Stochasticity of metabolism and growth at the single-cell level. Nature 514, 376-379. ( 10.1038/nature13582) [DOI] [PubMed] [Google Scholar]
2.Bruggeman FJ, Teusink B. 2018. Living with noise: on the propagation of noise from molecules to phenotype and fitness. Curr. Opin. Syst. Biol. 8, 144-150. ( 10.1016/j.coisb.2018.02.010) [DOI] [Google Scholar]
3.Thomas P, Terradot G, Danos V, Weiße AY. 2018. Sources, propagation and consequences of stochasticity in cellular growth. Nat. Commun. 9, 1-11. ( 10.1038/s41467-017-02088-w) [DOI] [PMC free article] [PubMed] [Google Scholar]
4.Bertaux F, Marguerat S, Shahrezaei V. 2018. Division rate, cell size and proteome allocation: impact on gene expression noise and implications for the dynamics of genetic circuits. R. Soc. Open. Sci. 5, 172234. ( 10.1098/rsos.172234) [DOI] [PMC free article] [PubMed] [Google Scholar]
5.Vargas-Garcia CA, Ghusinga KR, Singh A. 2018. Cell size control and gene expression homeostasis in single-cells. Curr. Opin. Syst. Biol. 8, 109-116. ( 10.1016/j.coisb.2018.01.002) [DOI] [PMC free article] [PubMed] [Google Scholar]
6.Van Kampen NG. 1992. Stochastic processes in physics and chemistry, vol. 1. Amsterdam, The Netherlands: Elsevier. [Google Scholar]
7.Gillespie DT. 1977. Exact stochastic simulation of coupled chemical reactions. J. Phys. Chem. 81, 2340-2361. ( 10.1021/j100540a008) [DOI] [Google Scholar]
8.Gillespie DT. 1992. A rigorous derivation of the chemical master equation. Physica A 188, 404-425. ( 10.1016/0378-4371(92)90283-V) [DOI] [Google Scholar]
9.McAdams HH, Arkin A. 1997. Stochastic mechanisms in gene expression. Proc. Natl Acad. Sci. USA 94, 814-819. ( 10.1073/pnas.94.3.814) [DOI] [PMC free article] [PubMed] [Google Scholar]
10.Thattai M, Van Oudenaarden A. 2001. Intrinsic noise in gene regulatory networks. Proc. Natl Acad. Sci. USA 98, 8614-8619. ( 10.1073/pnas.151588598) [DOI] [PMC free article] [PubMed] [Google Scholar]
11.Swain PS, Elowitz MB, Siggia ED. 2002. Intrinsic and extrinsic contributions to stochasticity in gene expression. Proc. Natl Acad. Sci. USA 99, 12 795-12 800. ( 10.1073/pnas.162041399) [DOI] [PMC free article] [PubMed] [Google Scholar]
12.Lei X, Tian W, Zhu H, Chen T, Ao P. 2015. Biological sources of intrinsic and extrinsic noise in cI expression of lysogenic phage lambda. Sci. Rep. 5, 13597. ( 10.1038/srep13597) [DOI] [PMC free article] [PubMed] [Google Scholar]
13.Tonn MK, Thomas P, Barahona M, Oyarzún DA. 2019. Stochastic modelling reveals mechanisms of metabolic heterogeneity. Commun. Biol. 2, 1-9. ( 10.1038/s42003-018-0242-0) [DOI] [PMC free article] [PubMed] [Google Scholar]
14.Ingalls BP. 2013. Mathematical modeling in systems biology: an introduction. New York, NY: MIT press. [Google Scholar]
15.Charlebois DA, Balázsi G. 2019. Modeling cell population dynamics. In Silico Biol. 13, 21-39. ( 10.3233/ISB-180470) [DOI] [PMC free article] [PubMed] [Google Scholar]
16.Lin J, Amir A. 2018. Homeostasis of protein and mRNA concentrations in growing cells. Nat. Commun. 9, 1-11. ( 10.1038/s41467-017-02088-w) [DOI] [PMC free article] [PubMed] [Google Scholar]
17.Sun XM et al. 2020. Size-dependent increase in RNA polymerase II initiation rates mediates gene expression scaling with cell size. Curr. Biol. 30, 1217-1230. ( 10.1016/j.cub.2020.01.053) [DOI] [PubMed] [Google Scholar]
18.Hilfinger A, Paulsson J. 2011. Separating intrinsic from extrinsic fluctuations in dynamic biological systems. Proc. Natl Acad. Sci. USA 108, 12 167-12 172. ( 10.1073/pnas.1018832108) [DOI] [PMC free article] [PubMed] [Google Scholar]
19.Bowsher CG, Swain PS. 2012. Identifying sources of variation and the flow of information in biochemical networks. Proc. Natl Acad. Sci. USA 109, E1320-E1328. ( 10.1073/pnas.1119407109) [DOI] [PMC free article] [PubMed] [Google Scholar]
20.Kempe H, Schwabe A, Crémazy F, Verschure PJ, Bruggeman FJ. 2015. The volumes and transcript counts of single cells reveal concentration homeostasis and capture biological noise. Mol. Biol. Cell 26, 797-804. ( 10.1091/mbc.E14-08-1296) [DOI] [PMC free article] [PubMed] [Google Scholar]
21.Blasi T, Buettner F, Strasser MK, Marr C, Theis FJ. 2017. cgCorrect: a method to correct for confounding cell–cell variation due to cell growth in single-cell transcriptomics. Phys. Biol. 14, 036001. ( 10.1088/1478-3975/aa609a) [DOI] [PubMed] [Google Scholar]
22.Nordholt N, Van Heerden J, Kort R, Bruggeman FJ. 2017. Effects of growth rate and promoter activity on single-cell protein expression. Sci. Rep. 7, 1-11. ( 10.1038/s41598-017-05871-3) [DOI] [PMC free article] [PubMed] [Google Scholar]
23.Kafri R, Levy J, Ginzberg MB, Oh S, Lahav G, Kirschner MW. 2013. Dynamics extracted from fixed cells reveal feedback linking cell growth to cell cycle. Nature 494, 480-483. ( 10.1038/nature11897) [DOI] [PMC free article] [PubMed] [Google Scholar]
24.Kuritz K, Stöhr D, Maichl DS, Pollak N, Rehm M, Allgöwer F. 2020. Reconstructing temporal and spatial dynamics from single-cell pseudotime using prior knowledge of real scale cell densities. Sci. Rep. 10, 1-10. ( 10.1038/s41598-020-60400-z) [DOI] [PMC free article] [PubMed] [Google Scholar]
25.Zopf C, Quinn K, Zeidman J, Maheshri N. 2013. Cell-cycle dependence of transcription dominates noise in gene expression. PLoS Comput. Biol. 9, e1003161. ( 10.1371/journal.pcbi.1003161) [DOI] [PMC free article] [PubMed] [Google Scholar]
26.Walker N, Nghe P, Tans SJ. 2016. Generation and filtering of gene expression noise by the bacterial cell cycle. BMC Biol. 14, 11. ( 10.1186/s12915-016-0231-z) [DOI] [PMC free article] [PubMed] [Google Scholar]
27.Schwabe A, Bruggeman FJ. 2014. Contributions of cell growth and biochemical reactions to nongenetic variability of cells. Biophys. J. 107, 301-313. ( 10.1016/j.bpj.2014.05.004) [DOI] [PMC free article] [PubMed] [Google Scholar]
28.Antunes D, Singh A. 2015. Quantifying gene expression variability arising from randomness in cell division times. J. Math. Biol. 71, 437-463. ( 10.1007/s00285-014-0811-x) [DOI] [PubMed] [Google Scholar]
29.Soltani M, Vargas-Garcia CA, Antunes D, Singh A. 2016. Intercellular variability in protein levels from stochastic expression and noisy cell cycle processes. PLoS Comput. Biol. 12, e1004972. ( 10.1371/journal.pcbi.1004972) [DOI] [PMC free article] [PubMed] [Google Scholar]
30.Johnston IG, Gaal B, Enver T, Iborra FJ, Jones NS. 2012. Mitochondrial variability as a source of extrinsic cellular noise. PLoS Comput. Biol. 8, e1002416. ( 10.1371/journal.pcbi.1002416) [DOI] [PMC free article] [PubMed] [Google Scholar]
31.Luo R, Ye L, Tao C, Wang K. 2013. Simulation of E. coli gene regulation including overlapping cell cycles, growth, division, time delays and noise. PLoS ONE 8, e62380. ( 10.1371/journal.pone.0062380) [DOI] [PMC free article] [PubMed] [Google Scholar]
32.Gomez D, Marathe R, Bierbaum V, Klumpp S. 2014. Modeling stochastic gene expression in growing cells. J. Theor. Biol. 348, 1-11. ( 10.1016/j.jtbi.2014.01.017) [DOI] [PubMed] [Google Scholar]
33.Johnston IG, Jones NS. 2015. Closed-form stochastic solutions for non-equilibrium dynamics and inheritance of cellular components over many cell divisions. Proc. R. Soc. A 471, 20150050. ( 10.1098/rspa.2015.0050) [DOI] [PMC free article] [PubMed] [Google Scholar]
34.Nieto CA, Garcia CAV, Sanchez C, Arias-Castro JC, Pedraza JM. 2020. Correlation between protein concentration and bacterial cell size can reveal strategies of gene expression. Phys. Biol. 17, 045002. ( 10.1088/1478-3975/ab891c) [DOI] [PubMed] [Google Scholar]
35.Perez-Carrasco R, Beentjes C, Grima R. 2020. Effects of cell cycle variability on lineage and population measurements of messenger RNA abundance. J. R. Soc. Interface 17, 20200360. ( 10.1098/rsif.2020.0360) [DOI] [PMC free article] [PubMed] [Google Scholar]
36.Berg OG. 1978. A model for the statistical fluctuations of protein numbers in a microbial population. J. Theor. Biol. 71, 587-603. ( 10.1016/0022-5193(78)90326-0) [DOI] [PubMed] [Google Scholar]
37.Brenner N, Farkash K, Braun E. 2006. Dynamics of protein distributions in cell populations. Phys. Biol. 3, 172. ( 10.1088/1478-3975/3/3/002) [DOI] [PubMed] [Google Scholar]
38.Thomas P. 2017. Making sense of snapshot data: ergodic principle for clonal cell populations. J. R. Soc. Interface 14, 20170467. ( 10.1098/rsif.2017.0467) [DOI] [PMC free article] [PubMed] [Google Scholar]
39.Thomas P. 2019. Intrinsic and extrinsic noise of gene expression in lineage trees. Sci. Rep. 9, 474. ( 10.1038/s41598-018-35927-x) [DOI] [PMC free article] [PubMed] [Google Scholar]
40.Jedrak J, Ochab-Marcinek A. 2020. Contributions to the ‘noise floor’ in gene expression in a population of dividing cells. Sci. Rep. 10, 1-13. ( 10.1038/s41598-020-69217-2) [DOI] [PMC free article] [PubMed] [Google Scholar]
41.Dessalles R, Fromion V, Robert P. 2020. Models of protein production along the cell cycle: an investigation of possible sources of noise. PLoS ONE 15, e0226016. ( 10.1371/journal.pone.0226016) [DOI] [PMC free article] [PubMed] [Google Scholar]
42.Kurtz TG. 1978. Strong approximation theorems for density dependent Markov chains. Stoch. Process Their. Appl. 6, 223-240. ( 10.1016/0304-4149(78)90020-0) [DOI] [Google Scholar]
43.Elf J, Ehrenberg M. 2003. Fast evaluation of fluctuations in biochemical networks with the linear noise approximation. Genome Res. 13, 2475-2484. ( 10.1101/gr.1196503) [DOI] [PMC free article] [PubMed] [Google Scholar]
44.Thomas P, Matuschek H, Grima R. 2012. Computation of biochemical pathway fluctuations beyond the linear noise approximation using iNA. In 2012 IEEE Int. Conf. on Bioinformatics and Biomedicine, Philadelphia, PA, USA, 4–7 October 2012. pp. 1–5. New York, NY: IEEE.
45.Osella M, Nugent E, Lagomarsino MC. 2014. Concerted control of Escherichia coli cell division. Proc. Natl Acad. Sci. USA 111, 3431-3435. ( 10.1073/pnas.1313715111) [DOI] [PMC free article] [PubMed] [Google Scholar]
46.Tanouchi Y, Pai A, Park H, Huang S, Stamatov R, Buchler NE, You L. 2015. A noisy linear map underlies oscillations in cell size and gene expression in bacteria. Nature 523, 357-360. ( 10.1038/nature14562) [DOI] [PMC free article] [PubMed] [Google Scholar]
47.Sauls JT, Li D, Jun S. 2016. Adder and a coarse-grained approach to cell size homeostasis in bacteria. Curr. Opin. Cell Biol. 38, 38-44. ( 10.1016/j.ceb.2016.02.004) [DOI] [PMC free article] [PubMed] [Google Scholar]
48.Voliotis M, Thomas P, Grima R, Bowsher CG. 2016. Stochastic simulation of biomolecular networks in dynamic environments. PLoS Comput. Biol. 12, e1004923. ( 10.1371/journal.pcbi.1004923) [DOI] [PMC free article] [PubMed] [Google Scholar]
49.Taheri-Araghi S, Bradde S, Sauls JT, Hill NS, Levin PA, Paulsson J, Vergassola M, Jun S. 2015. Cell-size control and homeostasis in bacteria. Curr. Biol. 25, 385-391. ( 10.1016/j.cub.2014.12.009) [DOI] [PMC free article] [PubMed] [Google Scholar]
50.Thomas P. 2018. Analysis of cell size homeostasis at the single-cell and population level. Front. Phys. 6, 64. ( 10.3389/fphy.2018.00064) [DOI] [Google Scholar]
51.Martins BM, Tooke AK, Thomas P, Locke JC. 2018. Cell size control driven by the circadian clock and environment in cyanobacteria. Proc. Natl Acad. Sci. USA 115, E11415-E11424. ( 10.1073/pnas.1811309115) [DOI] [PMC free article] [PubMed] [Google Scholar]
52.Priestman M, Thomas P, Robertson BD, Shahrezaei V. 2017. Mycobacteria modify their cell size control under sub-optimal carbon sources. Front. Cell Dev. Biol. 5, 64. ( 10.3389/fcell.2017.00064) [DOI] [PMC free article] [PubMed] [Google Scholar]
53.Peccoud J, Ycart B. 1995. Markovian modeling of gene-product synthesis. Theor. Popul. Biol. 48, 222-234. ( 10.1006/tpbi.1995.1027) [DOI] [Google Scholar]
54.Shahrezaei V, Swain PS. 2008. Analytical distributions for stochastic gene expression. Proc. Natl Acad. Sci. USA 105, 17 256-17 261. ( 10.1073/pnas.0803850105) [DOI] [PMC free article] [PubMed] [Google Scholar]
55.Bokes P, King JR, Wood AT, Loose M. 2012. Exact and approximate distributions of protein and mRNA levels in the low-copy regime of gene expression. J. Math. Biol. 64, 829-854. ( 10.1007/s00285-011-0433-5) [DOI] [PubMed] [Google Scholar]
56.Choudhary K, Narang A. 2020. Urn models for stochastic gene expression yield intuitive insights into the probability distributions of single-cell mRNA and protein counts. Phys. Biol. 17, 066001. ( 10.1088/1478-3975/aba50f) [DOI] [PubMed] [Google Scholar]
57.Ham L, Schnoerr D, Brackston RD, Stumpf MP. 2020. Exactly solvable models of stochastic gene expression. J. Chem. Phys. 152, 144106. ( 10.1063/1.5143540) [DOI] [PubMed] [Google Scholar]
58.Maclean F, Munson R. 1961. Some environmental factors affecting the length of Escherichia coli organisms in continuous cultures. Microbiology 25, 17-27. ( 10.1099/00221287-25-1-17) [DOI] [PubMed] [Google Scholar]
59.Koch A, Schaechter M. 1962. A model for statistics of the cell division process. Microbiology 29, 435-454. ( 10.1099/00221287-29-3-435) [DOI] [PubMed] [Google Scholar]
60.Lu T, Volfson D, Tsimring L, Hasty J. 2004. Cellular growth and division in the Gillespie algorithm. IET Syst. Biol. 1, 121-128. ( 10.1049/sb:20045016) [DOI] [PubMed] [Google Scholar]
61.Van Heerden JH, Kempe H, Doerr A, Maarleveld T, Nordholt N, Bruggeman FJ. 2017. Statistics and simulation of growth of single bacterial cells: illustrations with B. subtilis and E. coli. Sci. Rep. 7, 1-11. ( 10.1038/s41598-017-15895-4) [DOI] [PMC free article] [PubMed] [Google Scholar]
62.Blanco C, Nieto C, Vargas C, Pedraza J. 2020. PyEcoLib: a python library for simulating E. coli stochastic size dynamics. bioRxiv 319152. ( 10.1101/2020.09.29.319152) [DOI] [Google Scholar]
63.Beentjes CH, Perez-Carrasco R, Grima R. 2020. Exact solution of stochastic gene expression models with bursting, cell cycle and replication dynamics. Phys. Rev. E 101, 032403. ( 10.1103/PhysRevE.101.032403) [DOI] [PubMed] [Google Scholar]
64.Schnoerr D, Sanguinetti G, Grima R. 2017. Approximation and inference methods for stochastic biochemical kinetics—a tutorial review. J. Phys. A 50, 093001. ( 10.1088/1751-8121/aa54d9) [DOI] [Google Scholar]
65.Kuntz J, Thomas P, Stan GB, Barahona M. 2019. Bounding the stationary distributions of the chemical master equation via mathematical programming. J. Chem. Phys 151, 034109. ( 10.1063/1.5100670) [DOI] [PubMed] [Google Scholar]
66.Nozoe T, Kussell E, Wakamoto Y. 2017. Inferring fitness landscapes and selection on phenotypic states from single-cell genealogical data. PLoS Genet. 13, e1006653. ( 10.1371/journal.pgen.1006653) [DOI] [PMC free article] [PubMed] [Google Scholar]
67.Ciechonska M, Sturrock M, Grob A, Larrouy-Maumus G, Shahrezaei V, Isalan M. 2020. Ohm’s Law for increasing fitness gene expression with selection pressure. bioRxiv 693234. ( 10.1101/693234) [DOI] [Google Scholar]
68.Gardiner C. 2009. Stochastic methods, vol. 4. Berlin, Germany: Springer. [Google Scholar]
69.Kuntz J, Thomas P, Stan GB, Barahona M. 2021. Stationary distributions of continuous-time Markov chains: a review of theory and truncation-based approximations. SIAM Rev. 63, 3-64. ( 10.1137/19M1289625) [DOI] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

An implementation of the First-Division Algorithm (box 1) in Julia is available at github.com/pthomaslab/fda.

[RSIF20210274C1] 1.Kiviet DJ, Nghe P, Walker N, Boulineau S, Sunderlikova V, Tans SJ. 2014. Stochasticity of metabolism and growth at the single-cell level. Nature 514, 376-379. ( 10.1038/nature13582) [DOI] [PubMed] [Google Scholar]

[RSIF20210274C2] 2.Bruggeman FJ, Teusink B. 2018. Living with noise: on the propagation of noise from molecules to phenotype and fitness. Curr. Opin. Syst. Biol. 8, 144-150. ( 10.1016/j.coisb.2018.02.010) [DOI] [Google Scholar]

[RSIF20210274C3] 3.Thomas P, Terradot G, Danos V, Weiße AY. 2018. Sources, propagation and consequences of stochasticity in cellular growth. Nat. Commun. 9, 1-11. ( 10.1038/s41467-017-02088-w) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20210274C4] 4.Bertaux F, Marguerat S, Shahrezaei V. 2018. Division rate, cell size and proteome allocation: impact on gene expression noise and implications for the dynamics of genetic circuits. R. Soc. Open. Sci. 5, 172234. ( 10.1098/rsos.172234) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20210274C5] 5.Vargas-Garcia CA, Ghusinga KR, Singh A. 2018. Cell size control and gene expression homeostasis in single-cells. Curr. Opin. Syst. Biol. 8, 109-116. ( 10.1016/j.coisb.2018.01.002) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20210274C6] 6.Van Kampen NG. 1992. Stochastic processes in physics and chemistry, vol. 1. Amsterdam, The Netherlands: Elsevier. [Google Scholar]

[RSIF20210274C7] 7.Gillespie DT. 1977. Exact stochastic simulation of coupled chemical reactions. J. Phys. Chem. 81, 2340-2361. ( 10.1021/j100540a008) [DOI] [Google Scholar]

[RSIF20210274C8] 8.Gillespie DT. 1992. A rigorous derivation of the chemical master equation. Physica A 188, 404-425. ( 10.1016/0378-4371(92)90283-V) [DOI] [Google Scholar]

[RSIF20210274C9] 9.McAdams HH, Arkin A. 1997. Stochastic mechanisms in gene expression. Proc. Natl Acad. Sci. USA 94, 814-819. ( 10.1073/pnas.94.3.814) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20210274C10] 10.Thattai M, Van Oudenaarden A. 2001. Intrinsic noise in gene regulatory networks. Proc. Natl Acad. Sci. USA 98, 8614-8619. ( 10.1073/pnas.151588598) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20210274C11] 11.Swain PS, Elowitz MB, Siggia ED. 2002. Intrinsic and extrinsic contributions to stochasticity in gene expression. Proc. Natl Acad. Sci. USA 99, 12 795-12 800. ( 10.1073/pnas.162041399) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20210274C12] 12.Lei X, Tian W, Zhu H, Chen T, Ao P. 2015. Biological sources of intrinsic and extrinsic noise in cI expression of lysogenic phage lambda. Sci. Rep. 5, 13597. ( 10.1038/srep13597) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20210274C13] 13.Tonn MK, Thomas P, Barahona M, Oyarzún DA. 2019. Stochastic modelling reveals mechanisms of metabolic heterogeneity. Commun. Biol. 2, 1-9. ( 10.1038/s42003-018-0242-0) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20210274C14] 14.Ingalls BP. 2013. Mathematical modeling in systems biology: an introduction. New York, NY: MIT press. [Google Scholar]

[RSIF20210274C15] 15.Charlebois DA, Balázsi G. 2019. Modeling cell population dynamics. In Silico Biol. 13, 21-39. ( 10.3233/ISB-180470) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20210274C16] 16.Lin J, Amir A. 2018. Homeostasis of protein and mRNA concentrations in growing cells. Nat. Commun. 9, 1-11. ( 10.1038/s41467-017-02088-w) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20210274C17] 17.Sun XM et al. 2020. Size-dependent increase in RNA polymerase II initiation rates mediates gene expression scaling with cell size. Curr. Biol. 30, 1217-1230. ( 10.1016/j.cub.2020.01.053) [DOI] [PubMed] [Google Scholar]

[RSIF20210274C18] 18.Hilfinger A, Paulsson J. 2011. Separating intrinsic from extrinsic fluctuations in dynamic biological systems. Proc. Natl Acad. Sci. USA 108, 12 167-12 172. ( 10.1073/pnas.1018832108) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20210274C19] 19.Bowsher CG, Swain PS. 2012. Identifying sources of variation and the flow of information in biochemical networks. Proc. Natl Acad. Sci. USA 109, E1320-E1328. ( 10.1073/pnas.1119407109) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20210274C20] 20.Kempe H, Schwabe A, Crémazy F, Verschure PJ, Bruggeman FJ. 2015. The volumes and transcript counts of single cells reveal concentration homeostasis and capture biological noise. Mol. Biol. Cell 26, 797-804. ( 10.1091/mbc.E14-08-1296) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20210274C21] 21.Blasi T, Buettner F, Strasser MK, Marr C, Theis FJ. 2017. cgCorrect: a method to correct for confounding cell–cell variation due to cell growth in single-cell transcriptomics. Phys. Biol. 14, 036001. ( 10.1088/1478-3975/aa609a) [DOI] [PubMed] [Google Scholar]

[RSIF20210274C22] 22.Nordholt N, Van Heerden J, Kort R, Bruggeman FJ. 2017. Effects of growth rate and promoter activity on single-cell protein expression. Sci. Rep. 7, 1-11. ( 10.1038/s41598-017-05871-3) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20210274C23] 23.Kafri R, Levy J, Ginzberg MB, Oh S, Lahav G, Kirschner MW. 2013. Dynamics extracted from fixed cells reveal feedback linking cell growth to cell cycle. Nature 494, 480-483. ( 10.1038/nature11897) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20210274C24] 24.Kuritz K, Stöhr D, Maichl DS, Pollak N, Rehm M, Allgöwer F. 2020. Reconstructing temporal and spatial dynamics from single-cell pseudotime using prior knowledge of real scale cell densities. Sci. Rep. 10, 1-10. ( 10.1038/s41598-020-60400-z) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20210274C25] 25.Zopf C, Quinn K, Zeidman J, Maheshri N. 2013. Cell-cycle dependence of transcription dominates noise in gene expression. PLoS Comput. Biol. 9, e1003161. ( 10.1371/journal.pcbi.1003161) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20210274C26] 26.Walker N, Nghe P, Tans SJ. 2016. Generation and filtering of gene expression noise by the bacterial cell cycle. BMC Biol. 14, 11. ( 10.1186/s12915-016-0231-z) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20210274C27] 27.Schwabe A, Bruggeman FJ. 2014. Contributions of cell growth and biochemical reactions to nongenetic variability of cells. Biophys. J. 107, 301-313. ( 10.1016/j.bpj.2014.05.004) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20210274C28] 28.Antunes D, Singh A. 2015. Quantifying gene expression variability arising from randomness in cell division times. J. Math. Biol. 71, 437-463. ( 10.1007/s00285-014-0811-x) [DOI] [PubMed] [Google Scholar]

[RSIF20210274C29] 29.Soltani M, Vargas-Garcia CA, Antunes D, Singh A. 2016. Intercellular variability in protein levels from stochastic expression and noisy cell cycle processes. PLoS Comput. Biol. 12, e1004972. ( 10.1371/journal.pcbi.1004972) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20210274C30] 30.Johnston IG, Gaal B, Enver T, Iborra FJ, Jones NS. 2012. Mitochondrial variability as a source of extrinsic cellular noise. PLoS Comput. Biol. 8, e1002416. ( 10.1371/journal.pcbi.1002416) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20210274C31] 31.Luo R, Ye L, Tao C, Wang K. 2013. Simulation of E. coli gene regulation including overlapping cell cycles, growth, division, time delays and noise. PLoS ONE 8, e62380. ( 10.1371/journal.pone.0062380) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20210274C32] 32.Gomez D, Marathe R, Bierbaum V, Klumpp S. 2014. Modeling stochastic gene expression in growing cells. J. Theor. Biol. 348, 1-11. ( 10.1016/j.jtbi.2014.01.017) [DOI] [PubMed] [Google Scholar]

[RSIF20210274C33] 33.Johnston IG, Jones NS. 2015. Closed-form stochastic solutions for non-equilibrium dynamics and inheritance of cellular components over many cell divisions. Proc. R. Soc. A 471, 20150050. ( 10.1098/rspa.2015.0050) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20210274C34] 34.Nieto CA, Garcia CAV, Sanchez C, Arias-Castro JC, Pedraza JM. 2020. Correlation between protein concentration and bacterial cell size can reveal strategies of gene expression. Phys. Biol. 17, 045002. ( 10.1088/1478-3975/ab891c) [DOI] [PubMed] [Google Scholar]

[RSIF20210274C35] 35.Perez-Carrasco R, Beentjes C, Grima R. 2020. Effects of cell cycle variability on lineage and population measurements of messenger RNA abundance. J. R. Soc. Interface 17, 20200360. ( 10.1098/rsif.2020.0360) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20210274C36] 36.Berg OG. 1978. A model for the statistical fluctuations of protein numbers in a microbial population. J. Theor. Biol. 71, 587-603. ( 10.1016/0022-5193(78)90326-0) [DOI] [PubMed] [Google Scholar]

[RSIF20210274C37] 37.Brenner N, Farkash K, Braun E. 2006. Dynamics of protein distributions in cell populations. Phys. Biol. 3, 172. ( 10.1088/1478-3975/3/3/002) [DOI] [PubMed] [Google Scholar]

[RSIF20210274C38] 38.Thomas P. 2017. Making sense of snapshot data: ergodic principle for clonal cell populations. J. R. Soc. Interface 14, 20170467. ( 10.1098/rsif.2017.0467) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20210274C39] 39.Thomas P. 2019. Intrinsic and extrinsic noise of gene expression in lineage trees. Sci. Rep. 9, 474. ( 10.1038/s41598-018-35927-x) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20210274C40] 40.Jedrak J, Ochab-Marcinek A. 2020. Contributions to the ‘noise floor’ in gene expression in a population of dividing cells. Sci. Rep. 10, 1-13. ( 10.1038/s41598-020-69217-2) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20210274C41] 41.Dessalles R, Fromion V, Robert P. 2020. Models of protein production along the cell cycle: an investigation of possible sources of noise. PLoS ONE 15, e0226016. ( 10.1371/journal.pone.0226016) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20210274C42] 42.Kurtz TG. 1978. Strong approximation theorems for density dependent Markov chains. Stoch. Process Their. Appl. 6, 223-240. ( 10.1016/0304-4149(78)90020-0) [DOI] [Google Scholar]

[RSIF20210274C43] 43.Elf J, Ehrenberg M. 2003. Fast evaluation of fluctuations in biochemical networks with the linear noise approximation. Genome Res. 13, 2475-2484. ( 10.1101/gr.1196503) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20210274C44] 44.Thomas P, Matuschek H, Grima R. 2012. Computation of biochemical pathway fluctuations beyond the linear noise approximation using iNA. In 2012 IEEE Int. Conf. on Bioinformatics and Biomedicine, Philadelphia, PA, USA, 4–7 October 2012. pp. 1–5. New York, NY: IEEE.

[RSIF20210274C45] 45.Osella M, Nugent E, Lagomarsino MC. 2014. Concerted control of Escherichia coli cell division. Proc. Natl Acad. Sci. USA 111, 3431-3435. ( 10.1073/pnas.1313715111) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20210274C46] 46.Tanouchi Y, Pai A, Park H, Huang S, Stamatov R, Buchler NE, You L. 2015. A noisy linear map underlies oscillations in cell size and gene expression in bacteria. Nature 523, 357-360. ( 10.1038/nature14562) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20210274C47] 47.Sauls JT, Li D, Jun S. 2016. Adder and a coarse-grained approach to cell size homeostasis in bacteria. Curr. Opin. Cell Biol. 38, 38-44. ( 10.1016/j.ceb.2016.02.004) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20210274C48] 48.Voliotis M, Thomas P, Grima R, Bowsher CG. 2016. Stochastic simulation of biomolecular networks in dynamic environments. PLoS Comput. Biol. 12, e1004923. ( 10.1371/journal.pcbi.1004923) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20210274C49] 49.Taheri-Araghi S, Bradde S, Sauls JT, Hill NS, Levin PA, Paulsson J, Vergassola M, Jun S. 2015. Cell-size control and homeostasis in bacteria. Curr. Biol. 25, 385-391. ( 10.1016/j.cub.2014.12.009) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20210274C50] 50.Thomas P. 2018. Analysis of cell size homeostasis at the single-cell and population level. Front. Phys. 6, 64. ( 10.3389/fphy.2018.00064) [DOI] [Google Scholar]

[RSIF20210274C51] 51.Martins BM, Tooke AK, Thomas P, Locke JC. 2018. Cell size control driven by the circadian clock and environment in cyanobacteria. Proc. Natl Acad. Sci. USA 115, E11415-E11424. ( 10.1073/pnas.1811309115) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20210274C52] 52.Priestman M, Thomas P, Robertson BD, Shahrezaei V. 2017. Mycobacteria modify their cell size control under sub-optimal carbon sources. Front. Cell Dev. Biol. 5, 64. ( 10.3389/fcell.2017.00064) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20210274C53] 53.Peccoud J, Ycart B. 1995. Markovian modeling of gene-product synthesis. Theor. Popul. Biol. 48, 222-234. ( 10.1006/tpbi.1995.1027) [DOI] [Google Scholar]

[RSIF20210274C54] 54.Shahrezaei V, Swain PS. 2008. Analytical distributions for stochastic gene expression. Proc. Natl Acad. Sci. USA 105, 17 256-17 261. ( 10.1073/pnas.0803850105) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20210274C55] 55.Bokes P, King JR, Wood AT, Loose M. 2012. Exact and approximate distributions of protein and mRNA levels in the low-copy regime of gene expression. J. Math. Biol. 64, 829-854. ( 10.1007/s00285-011-0433-5) [DOI] [PubMed] [Google Scholar]

[RSIF20210274C56] 56.Choudhary K, Narang A. 2020. Urn models for stochastic gene expression yield intuitive insights into the probability distributions of single-cell mRNA and protein counts. Phys. Biol. 17, 066001. ( 10.1088/1478-3975/aba50f) [DOI] [PubMed] [Google Scholar]

[RSIF20210274C57] 57.Ham L, Schnoerr D, Brackston RD, Stumpf MP. 2020. Exactly solvable models of stochastic gene expression. J. Chem. Phys. 152, 144106. ( 10.1063/1.5143540) [DOI] [PubMed] [Google Scholar]

[RSIF20210274C58] 58.Maclean F, Munson R. 1961. Some environmental factors affecting the length of Escherichia coli organisms in continuous cultures. Microbiology 25, 17-27. ( 10.1099/00221287-25-1-17) [DOI] [PubMed] [Google Scholar]

[RSIF20210274C59] 59.Koch A, Schaechter M. 1962. A model for statistics of the cell division process. Microbiology 29, 435-454. ( 10.1099/00221287-29-3-435) [DOI] [PubMed] [Google Scholar]

[RSIF20210274C60] 60.Lu T, Volfson D, Tsimring L, Hasty J. 2004. Cellular growth and division in the Gillespie algorithm. IET Syst. Biol. 1, 121-128. ( 10.1049/sb:20045016) [DOI] [PubMed] [Google Scholar]

[RSIF20210274C61] 61.Van Heerden JH, Kempe H, Doerr A, Maarleveld T, Nordholt N, Bruggeman FJ. 2017. Statistics and simulation of growth of single bacterial cells: illustrations with B. subtilis and E. coli. Sci. Rep. 7, 1-11. ( 10.1038/s41598-017-15895-4) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20210274C62] 62.Blanco C, Nieto C, Vargas C, Pedraza J. 2020. PyEcoLib: a python library for simulating E. coli stochastic size dynamics. bioRxiv 319152. ( 10.1101/2020.09.29.319152) [DOI] [Google Scholar]

[RSIF20210274C63] 63.Beentjes CH, Perez-Carrasco R, Grima R. 2020. Exact solution of stochastic gene expression models with bursting, cell cycle and replication dynamics. Phys. Rev. E 101, 032403. ( 10.1103/PhysRevE.101.032403) [DOI] [PubMed] [Google Scholar]

[RSIF20210274C64] 64.Schnoerr D, Sanguinetti G, Grima R. 2017. Approximation and inference methods for stochastic biochemical kinetics—a tutorial review. J. Phys. A 50, 093001. ( 10.1088/1751-8121/aa54d9) [DOI] [Google Scholar]

[RSIF20210274C65] 65.Kuntz J, Thomas P, Stan GB, Barahona M. 2019. Bounding the stationary distributions of the chemical master equation via mathematical programming. J. Chem. Phys 151, 034109. ( 10.1063/1.5100670) [DOI] [PubMed] [Google Scholar]

[RSIF20210274C66] 66.Nozoe T, Kussell E, Wakamoto Y. 2017. Inferring fitness landscapes and selection on phenotypic states from single-cell genealogical data. PLoS Genet. 13, e1006653. ( 10.1371/journal.pgen.1006653) [DOI] [PMC free article] [PubMed] [Google Scholar]

[RSIF20210274C67] 67.Ciechonska M, Sturrock M, Grob A, Larrouy-Maumus G, Shahrezaei V, Isalan M. 2020. Ohm’s Law for increasing fitness gene expression with selection pressure. bioRxiv 693234. ( 10.1101/693234) [DOI] [Google Scholar]

[RSIF20210274C68] 68.Gardiner C. 2009. Stochastic methods, vol. 4. Berlin, Germany: Springer. [Google Scholar]

[RSIF20210274C69] 69.Kuntz J, Thomas P, Stan GB, Barahona M. 2021. Stationary distributions of continuous-time Markov chains: a review of theory and truncation-based approximations. SIAM Rev. 63, 3-64. ( 10.1137/19M1289625) [DOI] [Google Scholar]

PERMALINK

Coordination of gene expression noise with cell size: analytical results for agent-based models of growing cell populations

Philipp Thomas

Vahid Shahrezaei

Abstract

1. Introduction

Figure 1.

2. Methods

2.1. Effective dilution models, extrinsic noise models and the chemical master equation

2.1.1. Rate equation models and concentration homeostasis

2.1.2. Effective dilution model

2.1.3. Extrinsic noise model

2.1.4. Analytical solutions and noise decomposition

2.2. Agent-based modelling

Box 1. First-Division Algorithm for agent-based simulations of size-dependent gene regulatory networks.

2.2.1. Master equation for agent-based populations

2.2.2. Cell size distribution

2.2.3. Molecule number distributions for cells of a certain size

3. Results

3.1. The effective dilution model is valid for reaction networks with stochastic concentration homeostasis

Figure 2.

3.2. The EDM approximates the mean concentrations of ABMs lacking SCH

3.3. Scaling of fluctuations with size in individual cells manifests the breakdown of the EDM lacking SCH

Figure 3.

3.3.1. Effect of cell size control on gene expression dynamics

Figure 6.

Figure 4.

3.4. ENMs provide surprisingly accurate approximations of total noise in ABMs lacking SCH

Figure 5.

4. Discussion

Appendix A. Stochastic concentration homeostasis and the validity of EDMs and ENMs

Definition A.1 (Stochastic concentration homeostasis). —

Theorem A.2. —

Appendix B. Proof of theorem A.2

B.1. Step 1: Distributions invariant of birth size

B.2. Step 2: Transformation into an effective dilution model

B.3. Step 3: SCH and the necessity of binomial partitioning

Appendix C. Approximation of birth size moments

C.1. Large cell asymptotics

C.2. Small cell asymptotics

C.3. Global asymptotics

Appendix D. Analytical solutions and error bounds using the linear noise approximation for deterministic cell division

Endnote

Data accessibility

Authors' contributions

Competing interests

Funding

References

Associated Data

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases