Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2017 Aug 10.
Published in final edited form as: Phys Rev Lett. 2017 Jan 12;118(2):028102. doi: 10.1103/PhysRevLett.118.028102

Suppression of Beneficial Mutations in Dynamic Microbial Populations

Philip Bittihn 1,2,*, Jeff Hasty 1,2,3,4, Lev S Tsimring 1,2
PMCID: PMC5552243  NIHMSID: NIHMS886950  PMID: 28128631

Abstract

Quantitative predictions for the spread of mutations in bacterial populations are essential to interpret evolution experiments and to improve the stability of synthetic gene circuits. We derive analytical expressions for the suppression factor for beneficial mutations in populations that undergo periodic dilutions, covering arbitrary population sizes, dilution factors, and growth advantages in a single stochastic model. We find that the suppression factor grows with the dilution factor and depends nontrivially on the growth advantage, resulting in the preferential elimination of mutations with certain growth advantages. We confirm our results by extensive numerical simulations.


The fixation of random mutations is the driving force of evolutionary adaptation. Mutations can also be problematic, e.g., in synthetic biology, where long-term stability is required to reliably and safely translate more than a decade of circuit design to in vivo or industrial settings, but disabling a synthetic gene network is often beneficial to the cell [1]. Maintaining a bacterial strain over long times in a lab setting to study its evolution [2,3] requires enforcing certain population dynamics. To be able to interpret the results and build quantitative models, it is therefore essential to understand how these dynamics themselves alter the impact of beneficial mutations.

We consider the most widely used protocol, serial passage [3], which is characterized by phases of exponential growth alternating with strong reductions in population size (“bottlenecks”). A constant population size maintained, e.g., in a turbidostat [4] or in microfluidic traps [5] serves as the reference scenario for which theoretical results are well known [69]. These established results were recently extended to include populations that vary in size [10,11], transmission phases [12], or clonal interference [13]. While repeated pruning of an exponentially growing population was considered before [1417], closed-form predictions currently only exist for certain limiting cases and, as we show below, their range of applicability is even more limited than previously thought. Therefore, a complete and consistent picture of the fixation process during serial passage is still lacking. Using two complementary approaches for a single evolutionary model, we derive closed-form analytical expressions which provide a quantitative characterization of mutant fixation for arbitrary dilution factors, population sizes, and selective advantages that agrees with direct numerical simulations.

We use a stochastic model of division by binary fission:

Xα(1-μ)2X;XαμX+Y;Y(1+s)α2Y. (1)

X and Y represent wild-type and mutant cells, respectively. To obtain analytical results, memoryless reactions are assumed, resulting in exponentially distributed division times. However, we will later extend some of our results to more realistic distributions. In Eq. (1), α is the wild-type division rate, μ ≤ 1 is the mutation probability upon division, and s ≥ 0 is the growth rate change of the mutant. For a constant population with Nc individuals, a random individual is removed after each division (Moran process). In dynamic populations, cells divide freely for some time T, then the population is pruned (“diluted”) to a fixed number of survivors Ns and the cycle repeats. The latter resembles subculturing in fresh growth medium in the serial passage protocol. The survival probability is assumed to be equal for all cells. On average, the population size before dilution is fNs, where f = exp(αT) is the dilution factor. We use Nc = Ns(f − 1)/log f, rounded to the nearest integer, to achieve approximately the same time-averaged population size in both cases (for μ = 0).

Typical trajectories of the model are depicted in Figs. 1(a) and 1(b). The fixation time τ is defined as the time until the population consists of only mutants. We numerically computed the average fixation times τc and τd for a constant population and the dynamic protocol, respectively [see Fig. 1(c)]. Figure 1(d) shows that τd > τc across all s, meaning that the dynamic population can withstand the evolutionary pressure of beneficial mutations longer.

FIG. 1.

FIG. 1

Typical trajectories of the model Eq. (1) for (a) a dynamic population undergoing repeated pruning and (b) a constant population. Parameters are α = 1, Ns = 10, T = log 10, μ = 10−3, resulting in f = 10, Nc = 39. (c) Fixation times in constant (crosses) and dynamic (squares) populations from 10 000 stochastic simulations using an accelerated algorithm [18]. (d) Fixation time ratio. Solid lines in (c) and (d) indicate exact numerical values from Markov models for a constant population and a population pruned when reaching a fixed size fNs.

Below, we will calculate the fixation probability p of a single mutation under the influence of the above population dynamics. If μ is sufficiently small, there is a direct correspondence between τ and p: Based on the idea of the slow-scale stochastic simulation algorithm [19], the mutation rate can be approximated by αμnX¯ [20], where nX¯ is the time-averaged number of wild-type cells for μ = 0. Since only a fraction p of mutations becomes fixed eventually, the average fixation time is τ=(αμnX¯p)-1, implying τd/τc = pc/pd.

We will first use a diffusion approach to characterize p for s close to zero and then employ a recently developed branching process approach for larger s.

Diffusion approximation

This approach was initially developed by Kimura [9] and is valid for weakly beneficial mutations when the fixation process is dominated by genetic drift. In contrast to Wahl and co-workers [14,15], we will consider all contributions to stochastic fluctuations, including cell division, and model random selection upon dilution with the exact hypergeometric distribution. Let Mδy(y) and Vδy(y) be the mean and variance, respectively, of the change of the fraction of mutants from the current generation to the next, given that the current fraction of mutants is y = nY/(nX + nY). Then, with the definition G(y)=exp[-20yMδy(y)/Vδy(y)dy], the probability of fixation u(y) is given by u(y)=0yG(y)dy/01G(y)dy. Let Λ be the Taylor expansion of Mδy(y)/Vδy(y) near s = 0 up to the order of s chosen such that Λ is independent of y. Then, the fixation probability for an initial fraction of mutants y is

u(y)=1-exp(-2Λy)1-exp(-2Λ). (2)

For dynamic populations, we define y as the fraction of mutants at the beginning of each cycle. Therefore, Mδy(y)/Vδy(y) describes the effect of one growth cycle and subsequent pruning. Mutations occur at any time during the growth phase, implying that the initial fraction of mutants y for Eq. (2) (i.e., at the beginning of the cycle following the mutation’s introduction) is a random variable. We accommodate this by approximating pdud(ȳd), where ȳd is the average initial fraction of mutants. Hence, estimating pd amounts to calculating Λd and ȳd.

To obtain Mδy(y)/Vδy(y) for dynamic populations and subsequently Λd, we note that, for the growth phase, the wild-type and mutant subpopulations are described by simple birth processes which start with (1 − y0)Ns and y0Ns individuals, respectively. At the end of a cycle, t = T, the mean and variance for a population starting with N0 individuals and a division rate λ are

Mλ(N0)=N0exp(λT), (3a)
Vλ(N0)=ξ2N0exp(λT)[exp(λT)-1]. (3b)

ξ2 will allow us later to scale the stochastic fluctuations during growth, but we will initially evaluate only the case ξ2 = 1, which corresponds to the model Eq. (1).

As dilution does not, on average, alter the fraction of mutants, Eq. (3) can be used directly to obtain Mδy, whereas for Vδy, Eq. (3) is combined with the variance of the hypergeometric distribution for the dilution event [20]. A first-order expansion of Mδy/Vδy around s = 0 then leads to Λd ≈ 2s(Nsξ2)f log f/[(f − 1)(1 + ξ2)]. To estimate ȳd, we consider a mutant subpopulation that first appears at time θ into a cycle and initially consists of ms individuals. For the model Eq. (1), ms = 1, since the second reaction produces a single mutant cell. By the end of the initial cycle, the mutant subpopulation will have grown for a time Tθ to a size larger than ms. As might be intuitive (and can be shown explicitly [20]), the probability distribution of θ is pmut(θ)=exp(αθ)/0Texp(αθ)dθ, i.e., proportional to the average rate of division events at a given time θ within a cycle. Using pmut(θ), we obtain the average sizes of the wild-type and mutant subpopulations for random θ as weighted averages of Eq. (3b) and estimate the average initial fraction of mutants as limNs→∞Nsȳd = [ms(fs − 1)/s(f − 1)], independent of ξ2 [20].

Substituting ȳd and Λd into Eq. (2), we obtain

pd=1-exp(-2ms1+ξ2(Ns-ξ2)flogf(fs-1)Ns(f-1)2)1-exp(-21+ξ2(Ns-ξ2)flogff-1s). (4)

Equation (4) can also be used for the constant population case by replacing NsNc and taking the limit f → 1, so it reduces to

pc=1-exp[-2mss/(1+ξ2)]1-exp[-2Ncs/(1+ξ2)]. (5)

Note that a more accurate approximation of pc can be obtained by calculating Λc directly [20]. For ξ2 = 1 and ms = 1, the theoretical estimates Eqs. (4) and (5) are plotted in Fig. 2(a). As expected, the theory matches the numerical data towards s = 0. The accuracy is remarkable considering it being a continuous approximation of a discrete process in small populations and the usage of a large-Ns approximation for ȳd. A Taylor expansion of τd/τc = pc/pd around s = 0 yields (for large Ns)

FIG. 2.

FIG. 2

(a) Fixation probabilities from numerical simulations (symbols) compared to diffusion approximation, Eqs. (5) and (4) (lines). (b) Numerical τd/τc (symbols) and diffusion approximation (lines). Colored dashed lines show initial slope at s = 0 according to Eq. (6); gray dashed line is the asymptotic ratio Δ, Eq. (7). For all data in (a) and (b) f = 20, ξ2 = 1, ms = 1. (c),(d) Numerical p and τd/τc (symbols) compared to branching process approximation, Eqs. (9) and (11) (lines). The y intercept in (d) is also Δ.

τd/τc1+sNs(f-1)[1-Δ-1(f)](1+ξ2)logf+O(s2), (6)

with Δ(f) = [(f − 1)/log f]2/f. For any f > 1, Δ is larger than 1, so the slope of τd/τc at s = 0 is positive. Thus, periodic dilutions do not impact neutral mutations as expected, while beneficial mutations are suppressed by a factor that grows with s in the vicinity of s = 0. The slope of τd/τc increases with Ns, in agreement with Fig. 2(b).

First taking the limit Ns → ∞ and subsequently considering small s, we obtain pd ≈ [2ms/(1 + ξ2)]Δ−1s and pc ≈ [2ms/(1 + ξ2)]s from Eqs. (4) and (5), respectively, which corresponds to the only limit for which analytical results were previously available [15]. The factor by which mutations are suppressed by serial dilutions in this limit is, therefore,

lims0limNsτdτc=Δ(f), (7)

which is shown as a gray dashed line in Fig. 2(b). For any finite population size, there is a smooth transition of τd/τc towards Δ with the rate indicated by Eq. (6).

Branching process approximation

As a complementary approach, we employ the framework developed by Uecker and Hermisson [21]. Based on an inhomogeneous branching processes, they derived the following expression for the fixation probability:

p=2[1+0(λ+δ)(t)exp(-0t(λ-δ)(t)dt)dt]-1, (8)

where λ(t) and δ(t) are the per capita birth and death rates, respectively, of the mutant subpopulation in the “branching limit.” It implicitly assumes that stochastic fluctuations of the wild-type population size can be ignored and, thus, we do not expect this approximation to capture finite population size effects present for small s.

For a constant population, we have the per capita birth rate λc(t) = (1 + s)α for the mutant subpopulation. Mutant individuals are replaced by wild-type individuals when a wild-type individual is born with rate αnX and a mutant is chosen for removal with probability [nY/(nX + nY + 1)] ≈ (nY/nX). The per capita death rate is therefore δc(t) = α. Substitution into Eq. (8) yields

pc=s1+s. (9)

For the population undergoing serial dilutions, we assume small time intervals of length σT during which cells die with rate σ−1 log f, reducing the population size from fNs to Ns exactly in the branching limit. Assuming the mutation is introduced at time θ into a cycle, these “windows of death” occur at times ti = iTθ, i = 1, 2, …. Therefore, we have

λd(t)=(1+s)α, (10a)
δd(t)={logfσti<t<ti+σ,i=1,2,0otherwise. (10b)

Substituting these rates into Eq. (8) and taking the limit σ → 0 yields the fixation probability pd(θ) conditioned on the time of appearance θ [20]. By averaging over pmut(θ), we obtain the unconditional fixation probability

pd=1f-1[fF(f-11-f-s)-F(f-1f1+s-f)], (11)

where F(·) is defined using the hypergeometric function 2F1(a, b; c; z) as F(x)=2F1(1,1/(1+s);1+1/(1+s);−x). Figures 2(c) and 2(d) show a comparison of this theory with numerical simulations. As s → 0, pd converges to 0, but τd/τc = pc/pd approaches the finite value Δ, which is identical to the result derived earlier from the diffusion approach, Eq. (7), and therefore consistent with the implicit assumption of large populations. In contrast, for finite population sizes, only the diffusion approximation correctly captures the immediate vicinity of s = 0, where τd/τc → 1 [compare Figs. 2(b) and 2(d)].

Exponential division time distributions, which have zero mode, are unrealistic, because cells need time to mature before the next division. In reality, the division time distribution has a clear peak with a Fano factor smaller than 1 [22]. According to Eqs. (6) and (7), the initial increase of τd/τc should be faster for a less stochastic division process (i.e., smaller ξ2), while the plateau value Δ should not depend on ξ2.

To test these predictions, we consider an extension of Eq. (1), where the simple memoryless division is replaced by a process with k stages, which has been characterized in detail by Kendall [23]: The total division time d of, e.g., a wild-type cell is distributed according to 2kαd~X2k2. After individuals have established an equilibrium distribution across the k different stages, the population grows like exp[αk(21/k − 1)t], leading to deterministic growth ∝ 2αt as k → ∞. We use an adjusted growth rate of α[k(21/k − 1)]−1 in numerical simulations to maintain an effective population growth according to exp(αt), resulting in the same average population size for unchanged cycle lengths T. For a population of individuals starting in the first stage, the initial population growth is delayed, reducing the effective initial size of the mutant subpopulation from 1 to ms = 1/[2k(1 − 2−1/k)]. The variance of the population size is different from that of a memoryless division process by a factor of ξ2 ≈ [2(log 2)2/k].

Figure 3(a) shows numerical simulations for different k, along with the approximation of Eq. (6), substituting the changed value for ξ2. Note that this neglects the fact that division events in the mutant subpopulation are initially correlated. Nevertheless, there is good quantitative agreement with Eq. (6). Figure 3(b) shows that there are some quantitative differences for larger s, but, according to Fig. 3(d), for small s beyond the initial region of increase for finite population sizes [cf. Fig. 2(b)], τd/τc is indeed at most weakly dependent on k, as predicted by Eq. (7).

FIG. 3.

FIG. 3

Fixation probabilities and ratios in the multistage model. (a) τd/τc for small s for different k in numerical simulations (symbols). Lines indicate the slope predicted by Eq. (6) with ξ2 = 2(log 2)2/k. (b) τd/τc from numerical simulations for larger s. (c) pc and pd as functions of the dilution factor f. (d) τd/τc for the data shown in (c) compared to the analytical approximation, Eq. (7). Parameters are Ns = 50, f = 20 (a), (b) and Ns = 20, s = 0.2 (c), (d).

In this study, we have developed a complete analytical characterization of the fixation probability of beneficial mutations in exponentially growing populations with repeated bottlenecks, akin to serial passage. The most intriguing result is that the impact of serial passage on the fixation probability depends nontrivially on the growth rate change s, a novel effect not seen in the previously considered large-population, low-s limit, where all fixation probabilities were found to be proportional to s and, therefore, the ratio pc/pd is a constant [15,17]. Therefore, the experimental protocol acts as a filter which biases the distribution of selective advantages of fixed mutations with respect to a constant population or serial passage with different f. Our results provide quantitative predictions for three distinct regimes: Firstly, starting from pc/pd = 1 at s = 0 (no suppression of neutral mutations), the suppression factor increases gradually (and more quickly for larger Ns) as s increases, which is captured by the diffusion approach, Eqs. (4)(6). Secondly, in an intermediate regime, pc/pd reaches a plateau value Δ, Eq. (7), which only depends on the dilution factor f. Thirdly, towards large s, the fixation probability for the serial passage protocol slowly returns to that for a constant population, as described by the branching process approximation, Eq. (11).

To our knowledge, no previous analytical results existed for the first and third regime. For the plateau, we find Δ to be monotonic with respect to f [cf. Fig. 3(d)] and to approach 1 for f → 1, which is in contrast to the prediction of an optimal dilution factor f from the previously derived formula Δ′ = f/(log f)2 [15]. However, this earlier result used a binomial distribution for the dilution process, which is only a good approximation for large f [20], and, indeed, in this regime, Δ′ ≈ Δ. Our prediction is not only confirmed by full numerical simulations, but also intuitive as frequent but mild dilutions are experimentally indistinguishable from a constant population. Furthermore, as can be shown explicitly [20], it is consistent with Ref. [17].

Equation (6) reaches Δ at a selective advantage δs = {[(f − 1)(1 + ξ2)]/[fNs log f]}, providing an order-of-magnitude estimate for regimes of validity of Eq. (6) (sδs) versus Eq. (11) (sδs), which is particularly important for very small population sizes, when δs is large. For larger populations, the value of s at which Δ is attained is negligible compared to the region in which τd/τc ≈ Δ [cf. Fig. 2(b), Ns = 100], which indicates equal suppression of mutations conferring arbitrary moderate growth rate changes.

While, in reality, cell division and mutation are far more complex than described by the model Eq. (1), our results establish a baseline that can be used to gauge the influence of other effects. We confirmed that they hold qualitatively for more realistic division statistics and even quantitatively through the proxy parameters ξ2 and ms in the low-s regime (cf. Fig. 3). Generalizing the branching process approximation to take these statistics into account for larger s presents an interesting direction of future research. We also found that the exact periodicity of dilutions is not essential, as pruning at a fixed number of cells fNs leads to almost identical numerical results [20]. Another possible extension is to consider nonexponential growth, although a previous study found little effect for the specific case considered there [15].

Our quantitative analytical results provide a framework for the interpretation of evolution experiments involving serial passage by predicting how the experimental protocol itself can facilitate or suppress the fixation of mutations with certain selective advantages, which is a prerequisite for investigating the relation between population level adaptation and its molecular basis for other than neutral mutations [24]. They may also provide guidance for limiting the impact of undesired mutations in engineered bacteria by adjusting the experimental protocol or employing synthetic ecologies to shape their inherent population dynamics.

Supplementary Material

Supplementary Text

Acknowledgments

All authors acknowledge funding from NIH Grant No. R01-GM069811, from NSF Grant No. MCB-1121748 and partial support from the San Diego Center for Systems Biology, NIH Grant No. P50-GM085764. P. B. also acknowledges support from HFSP fellowship LT000840/2014-C.

References

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Text

RESOURCES