Scaling solution in the large population limit of the general asymmetric stochastic Luria-Delbrück evolution process

David A Kessler; Herbert Levine

doi:10.1007/s10955-014-1143-3

. Author manuscript; available in PMC: 2016 Feb 19.

Published in final edited form as: J Stat Phys. 2014 Nov 15;158(4):783–805. doi: 10.1007/s10955-014-1143-3

Scaling solution in the large population limit of the general asymmetric stochastic Luria-Delbrück evolution process

David A Kessler ¹, Herbert Levine ²

PMCID: PMC4760705 NIHMSID: NIHMS757845 PMID: 26900175

Abstract

One of the most popular models for quantitatively understanding the emergence of drug resistance both in bacterial colonies and in malignant tumors was introduced long ago by Luria and Delbrück. Here, individual resistant mutants emerge randomly during the birth events of an exponentially growing sensitive population. A most interesting limit of this process occurs when the population size N is large and mutation rates are low, but not necessarily small compared to 1/N. Here we provide a scaling solution valid in this limit, making contact with the theory of Levy α-stable distributions, in particular one discussed long ago by Landau. One consequence of this association is that moments of the distribution are highly misleading as far as characterizing typical behavior. A key insight that enables our solution is that working in the fixed population size ensemble is not the same as working in a fixed time ensemble. Some of our results have been presented previously in shortened form [11]

Keywords: Luria-Delbruck, mutants, growth, alpha-stable distribution

1 Introduction

In 1943, Luria and Delbrück [16] (LD hereafter) used a simple evolutionary model together with laboratory experiments to argue that bacterial resistance (to bacteriophage, in their specific case) arose via the selection of pre-existing mutations. Their model postulated a constant probability at which dividing normal cells would have a resistant daughter; these mutations were assumed to not affect the growth rate until the bacteria were challenged. The LD model and various extensions thereof have been used extensively in recent years to study the emergence of antibiotic-resistant microbes [14,15] as well as chemotherapy-resistant cancer cells [4,18,13,19,6]. Similar models are used to study the generation of differentiated cells from stem-like precursors [3].

There has been considerable effort in the mathematics and statistical physics communities devoted to understanding the probability distribution that arises from the LD process [10,5,1,20,8,9]. While early efforts focused on various quasi-deterministic versions of the model where both wild-type and mutant cells or alternatively just the wild-type are assumed to undergo continuous exponential growth, more recently interest has focussed on the fully stochastic model. A major achievement has very recently been achieved by Antal and Krapivsky [2] who solve for the generating function of the full joint distribution of the number of wild-types and mutants as a function of time, starting from a single wild-type individual. Also noteworthy is the solution by Angerer [1] for the probability distribution of the number of mutants given a fixed total population, N, in the case of the symmetric birth-only version of the stochastic model, where both wild-type and mutant have identical birth rates and cell death is excluded. However, it has proven difficult to obtain much physical insight from these solutions, and to relate these solutions to those of the simpler quasi-deterministic models. In particular, the remarkable fact that the distribution is these simpler models becomes [17] a universal α-stable (Lévy) distribution in the limit of small mutation rate μ, and large N such that μN ≫ 1 has had no known analogue in the fully stochastic models.

Here, we introduce a new strategy to solve this model, taking advantage of the fact that for most cases of interest the population size N is very large and the mutation rate μ is very small. For example, a tumor may have upwards of 10⁸ cells when it is first detected and the mutation rate, even if elevated above normal human mutation rates by genomic instability, would still be 10⁻⁶ or smaller (depending on exactly what range of mutations are assumed to be able to confer resistance). Note that μN is in general large, and hence a perturbative treatment would not be accurate. We show that if one focuses attention on the fixed N ensemble, one can derive an alternative generating function equation which leads immediately to a simple integral expression for the probability distribution. One can recover the asymptotic properties of this distribution for the general case where the resistant population may have a different fitness than the wild-type.

The outline of this work is as follows. To illustrate our methodology and to lay out the general features of the LD process, in particular that it gives rise to a distribution P(m) for the number m of mutants which is heavy-tailed, we first present the special case of zero death rate. We show the general features of the quasi-deterministic model, including the fact that at large μN one obtains a universal α-stable distribution is actually valid for the fully stochastic version as well. Next, we consider cell death in addition to division and mutation, and show that even for this case we recover for large μN the limiting Landau distribution (for equal birth rates) and its generalization for the asymmetric case. Finally, we compare our results to those of Antal and Krapivsky, who formally solve the same problem with a fixed time ensemble. Some of our results have appeared previously in abbreviated form [11].

2 The symmetric pure birth model

To illustrate our basic ideas, we start with the relatively simple case of no cell death (i.e. a pure birth/mutation process) and with equal birth rates for both the wild-type and mutant subpopulations. If we start from a fixed number N₀ of initial wild-type cells and wish to study the ensemble at fixed total population N, the number of events is just N − N₀. Rather than computing the probability function at fixed time, we instead use the event number as the independent variable. Then, P_N(0) is just the probability that in the N − N₀ births since the beginning, there were no mutations, thus

P_{N} (0) = {(1 - μ)}^{N - N_{0}} .

(1)

The master equation for N > N₀, m > 0 reads

P_{N + 1} (m) = \frac{1}{N} [P_{N} (m - 1) μ (N - m + 1) + P_{N} (m) (1 - μ) (N - m) + P_{N} (m - 1) (m - 1)] .

(2)

To derive what we will refer to as the scaling solution for the probability distribution, we will keep the leading order in N term for each order in μ. Later, we will see exactly when this is valid. It is easy to check that this involves keeping the coefficient of N^k for terms proportional to μ^k. Let us denote this coefficient in P_N(m) as $A_{m}^{k}$ , so that

P_{N} (m) \approx \sum_{k = 0}^{\infty} A_{m}^{k} y^{k}; y \equiv μ N .

(3)

Substituting this term into the master equation, we can check that the leading order in N condition (the (μN)^k term is automatically satisfied, but matching the next order in N term (proportional to μ^kN^k−1) leads to the non-trivial condition for m, k ≥ 1:

(k + m) A_{m}^{k} = A_{m - 1}^{k - 1} - A_{m}^{k - 1} + (m - 1) A_{m - 1}^{k},

(4)

with

A_{0}^{k} = {(- 1)}^{k} / k!; A_{m}^{0} = 0, (m = 1 \dots \infty) .

(5)

The former condition comes from our exact solution of P_N(0), Eq. (1), above and the latter from the fact that P_N(m) for all m > 0 vanishes if μ = 0. To solve this equation, we define the generating function

F (x, y) \equiv \sum_{k = 1, m = 1} A_{m}^{k} x^{m} y^{k},

(6)

in terms of which the generating function, P_N(x), of the probability distribution is given by

P_{N} (x) = F (x, μ N) + e^{- y} .

(7)

From the recursion relation Eq. (4) we easily obtain the differential equation

[y \frac{\partial}{\partial y} + x (1 - x) \frac{\partial}{\partial x}] F (x, y) = - y (1 - x) F + x y e^{- y} .

(8)

As already discussed in [11], we can solve this equation by first converting it to a homogeneous equation via the transformation F = G − e^−y, so that

[y \frac{\partial}{\partial y} + x (1 - x) \frac{\partial}{\partial x}] G (x, y) = - y (1 - x) G m,

(9)

with G(y = 0) = 1. The method of characteristics tells us to write G(x, y) = G(x(y), y), choosing dx/dy = x(1 − x)/y, so that the characteristic is

x = \frac{y}{α + y},

(10)

where α labels the characteristic. Then the equation reads

y \frac{d G}{d y} = - \frac{α y}{α + y} G,

(11)

with the solution

G = {(1 + \frac{y}{α})}^{- α}; α = \frac{y (1 - x)}{x},

(12)

so that

F = {(1 - x)}^{y (1 - x) / x} - e^{- y},

(13)

and thus

P_{N} (x) = {(1 - x)}^{μ N (1 - x) / x} .

(14)

This result was derived initially by Lea and Coulson [14], for a model which neglected the stochastic nature of the birth-death dynamics for the wild-type population. In our derivation, going to the scaling limit of N large, m ≪ N, has resulted in a distribution which has no N₀ dependence. Since when N₀ is large, the wild-type stochasticity is negligible, the fully stochastic treatment has to reduce to the Lea-Coulson result in the scaling limit.

One can numerically evaluate the Lea-Coulson distribution from the generating function via a contour integration to pick out P_N(m):

P_{N} (m) = \frac{1}{2 π i} \oint \frac{d z}{z^{m + 1}} {(1 - z)}^{μ N (1 - z) / z} .

(15)

This is more conveniently computed by moving the contour to hug the branch cut along the real axis x > 1:

P_{N} (m) \approx \frac{1}{π} \int_{1}^{\infty} d x x^{- m - 1} {(x - 1)}^{- μ N (x - 1) / x} \sin \frac{π μ N (x - 1)}{x} .

(16)

A computation of this for the case μN = 2 is shown in Fig. 1, (labelled r = 1), together with results of a direct numerical solution of the master equation for the parameters μ = 0.004, N = 500. The agreement is excellent even for this not so tiny value of μ.

It is clear by looking at the small x behavior that Eq. (14) correctly reproduces P_N(0) ≈ e^−μN. In the small μN limit,

P_{N} (x) \approx 1 + μ N \frac{1 - x}{x} \ln (1 - x) = 1 - μ N \frac{1 - x}{x} \sum_{m = 1}^{\infty} \frac{x^{m}}{m},

(17)

so that for m ≥ 1

P_{N} (m) \approx μ N (\frac{1}{m} - \frac{1}{m + 1}) = \frac{μ N}{m (m + 1)},

(18)

which is monotonically decreasing and has a fat-tailed 1/m² decay. This 1/m² behavior at large m is in fact true for all μN, and is a direct result of the logarithmic singularity of P_N(x) at x = 1.

The large μN limit is also tractable, for m’s of order μN. We write $m = μ N \tilde{m}$ , and write x = 1 + t/(μN), giving

P_{N} (m) = \frac{1}{μ N π} \int_{0}^{\infty} d t e^{- \tilde{m} t - t \ln t / y} \sin π t = \frac{1}{μ N π} \int_{0}^{\infty} d t e^{- (\tilde{m} - \ln y) t - t \ln t} \sin π t .

(19)

This is just a Landau distribution, the one-sided α-stable (Lévy) distribution with index α = 1, written in terms of the original variables as P_Landau(m/μN − ln μN) times the normalization factor 1/μN. The Landau distribution rises very rapidly from zero, reaches it maximum and then falls with a 1/m² fat tail. The appearance of the α-stable distribution in the quasi-deterministic model [17,11] is a consequence of the fact that the total number of mutants is the sum of the mutants descended from each mutation event, and from large N these are independent and identically distributed with a power-law tail, and thus the conditions of the generalized central-limit theorem are satisfied. The Landau distribution arises in the fully stochastic model for the same reasons; the only difference being an overall shift in the distribution.

Thus, the Lea-Coulson distribution is seen to interpolate between the monotonic μN/m(m + 1) distribution at small μN to the fat-tailed one-sided α-stable distribution with its sharp rise for small m and 1/m² falloff at large m. A signpost value is μN = 2, above which P_N(m) is no longer monotonic. The position of the maximum moves rightward with increasing μN, asymptotically behaving as m_max ≈ μN(ln μN − 0.22). This analysis is summarized in Fig. 2.

Fig. 2 — The probability distributions *P_N*(m)*/μN* as given by the Lea-Coulson formula for μN = 0.2, 2 and 10. In addition, the small μN limiting form 1/m(m+1) and the large μN Landau distribution are also shown. It is seen that the Landau distribution correctly describes the distribution except for m’s of order unity.

For m of order N, the Lea-Coulson result breaks down because we have neglected terms that become of comparable order in this new regime. This outer regime [11] describes how the distribution must drop to zero as we approach m = N − N₀; after all, the most extreme possibility is that every division resulted in adding one more mutant. Since μ is small, this part of the distribution arises when one of the first births lead to a mutant and hence the mutant sub-population has as much chance to expand exponentially as does the wild-type. To describe this scenario, one need only keep terms linear in μ. This can be done explicitly in the symmetric birth-only model [11]. However, even this problem is difficult to solve exactly in the general case, and is only of mathematical interest, since this outer part of the distribution describes extremely rare events and have essentially no physical relevance.

3 The non-symmetric pure birth problem

We now extend the previous results to account for a fitness (aka birth rate) difference between the wild-type and the resistant subpopulations. Normally it would be exacted that alleles conferring resistance are less fit in the absence of any drug, and hence we expect the division rate ratio r of wild-type to resistant cell to be greater than one. The master equation connecting the mutant number distribution at one overall population size to the next overall population size, must take into account the relative probability that it is a wild-type cell that divides versus that it is a mutant cell which divides. If the current population has m mutant cells and hence N − m wild-type ones, this relative probability is (N − m)r/((N − m)r + m). Then the equation clearly becomes

\begin{array}{l} P_{N + 1} (m) = (1 - μ) P_{N} (m) \frac{(N - m) r}{(N - m) r + m} \\ + \frac{P_{N} (m - 1)}{(N - m + 1) r + (m - 1)} [(N - m + 1) r μ + (m - 1)] . \end{array}

(20)

Again, we solve this equation in the limit of large N, small μ by keeping only the leading term in N for each poet of μ, again using the expansion Eq. (3). Substituting this into the master equation, we obtain a generalization of Eq. (4):

(k + \frac{m}{r}) A_{m}^{k} = A_{m - 1}^{k - 1} - A_{m}^{k - 1} + \frac{(m - 1)}{r} A_{m - 1}^{k} .

(21)

Following the same procedure as before, we transform this into an equation for the generating function,

[y \frac{\partial}{\partial y} + \frac{x (1 - x)}{r} \frac{\partial}{\partial x}] F (x, y) = - y (1 - x) F (x, y) + x y e^{- y} .

(22)

Again, the inhomogeneous solution is −e^y, and the homogeneous solution satisfies

[y \frac{\partial}{\partial y} + \frac{x (1 - x)}{r} \frac{\partial}{\partial x}] G (x, y) = - y (1 - x) G (x, y),

(23)

with G(y = 0) = 1. Again, we solve by the method of characteristics, with the characteristic labelled by α given by

x_{α} (y) = \frac{y^{1 / r}}{α + y^{1 / r}},

(24)

so that

\frac{d}{d y} G (y, x (y)) = - \frac{α}{α + y^{1 / r}} G,

(25)

with the solution

G = \exp (- \int_{0}^{y} \frac{α}{α + t^{1 / r}} d t) = \exp (- y_{2} F_{1} (1, r; 1 + r; - \frac{y^{1 / r}}{α})),

(26)

where ₂F₁ is a hypergeometric function. Substituting α = y¹^/r(1 − x)/x, we get

G = \exp (- y_{2} F_{1} (1, r; 1 + r; - \frac{x}{1 - x})) = \exp (- y (1 - x)_{2} F_{1} (1, 1; 1 + r; x)),

(27)

where we have employed a linear transformation identity, so that finally

P_{N} (x) = \exp (- μ N (1 - x)_{2} F_{1} (1, 1; 1 + r; x)) .

(28)

Again, to evaluate P_m we integrate around the branch cut, which again extends rightward from x = 1. Using standard identities involving hypergeometric functions, we have

\begin{array}{l} _{2} F_{1} (1, r; 1 + r; - \frac{x}{1 - x}) = \frac{r}{r - 1} (\frac{1 - x}{x})_{2} F_{1} (1, 1 - r; 2 - r; - \frac{1 - x}{x}) \\ + \frac{π r}{\sin π r} {(\frac{1 - x}{x})}^{r}_{2} F_{1} (r; 0; r; - \frac{1 - x}{x}) \\ = \frac{r}{r - 1} (\frac{1 - x}{x})_{2} F_{1} (1, 1 - r; 2 - r; - \frac{1 - x}{x}) \\ + \frac{π r}{\sin π r} {(\frac{1 - x}{x})}^{r} . \end{array}

(29)

This allows us to evaluate the discontinuity along the branch cut, giving

\begin{array}{r} P_{N} (m) = \frac{1}{π} \int_{1}^{\infty} \frac{d x}{x^{m + 1}} \exp (μ N \frac{r (x - 1)}{(r - 1) x}_{2} F_{1} (1, 1 - r, 2 - r, \frac{x - 1}{x})) \\ \times \exp (- μ N \frac{π r}{\tan π r} {(\frac{x - 1}{x})}^{r}) \sin (μ N π r {(\frac{x - 1}{x})}^{r}) . \end{array}

(30)

Again, Fig. 1 presents a comparison of this expression with results of a direct numerical solution of the master equation for the parameters μ = 0.004, N = 500. The agreement is again excellent.

The non-analyticity at x = 1 as before governs the large m asymptotics (subject to m ≪ N, as usual). To investigate the behavior near x = 1, we use Eq. (29) to obtain (t ≡ 1 − x)

_{2} F_{1} (1, r; 1 + r; - \frac{x}{1 - x}) \approx \frac{r}{r - 1} t + \frac{r π}{\sin π r} t^{r} .

(31)

By standard Tauberian theorems, this corresponds to a m¹⁺^r decay at large m, independent at μN. In particular, for small μN, we have

\begin{array}{l} P_{N} (x) \approx 1 - μ N (1 - x)_{2} F_{1} (1, 1; 1 + r; x) \\ = 1 - μ N + μ N \sum_{m = 1}^{\infty} \frac{(m - 1)! Γ (1 + r) r}{Γ (r + m + 1)}, \end{array}

(32)

so that for m ≥ 1:

P_{N} (m) \approx μ N \frac{(m - 1)! Γ (1 + r) r}{Γ (r + m + 1)} \underset{m \to \infty}{\to} μ N \frac{r Γ (1 + r)}{m^{1 + r}} .

(33)

For large μN, as we saw in the case of r = 1, the contour integral giving P_N(m) is dominated by the x ≈ 1 singularity of the integrand. Thus,

P_{N} (m) \approx \int_{0}^{\infty} dt e^{- m t} \exp (- μ N [\frac{r}{r - 1} t + \frac{r π}{\sin π r} t^{r}]) .

(34)

This is nothing but the one-sided α-stable distribution, now with index α = r, consistent with the power-law decay. To see this, note that the characteristic function, ϕ (the Fourier transform of the probability density) of the α-stable distribution with index r has the form

ϕ = \exp [i t ν - {| c t |}^{r} (1 - i sgn (t) \tan (π r / 2))],

(35)

where ν is called the shift parameter, which is just the mean of the distribution, and c is the scale parameter. By rotating t → −it in Eq. (34), we get for the shift parameter

ν = μ N \frac{r}{r - 1},

(36)

which is indeed the mean of the distribution Eq. (32). For the scale parameter, we get

c = {[μ N \frac{r π}{\sin π r} \cos \frac{π r}{2}]}^{1 / r} = {[μ N \frac{r π}{2 \sin π r / 2}]}^{1 / r} .

(37)

4 General Asymmetric process with death

We now study the general asymmetric LD model with death, so that each event is either a birth event with rates b_w, b_m or a death event with rates d_w, d_m. We start from the probability distribution as a function of time. The master equations for P(N, m, t), where N denotes the total (w.t. + mutated) population and m is the number of mutants, are:

\begin{array}{l} \dot{P} (0, 0) = d_{w} P (1, 0) + d_{b} P (1, 1), \\ \dot{P} (N, m) = b_{w} (N - 1 - m) (1 - μ) P (N - 1, m) + b_{w} μ (N - m) P (N - 1, m - 1) \\ + b_{m} (m - 1) P (N - 1, m - 1) \\ + d_{w} (N + 1 - m) P (N + 1, m) + d_{m} (m + 1) P (N + 1, m + 1) \\ - [(b_{w} + d_{w}) (N - m) + (b_{m} + d_{m}) m] P (N, m); (N > 1), \end{array}

(38)

with P (N₀, 0) = 1 at t = 0.

To simplify these expressions, we make use of the fact that we are interested in a fixed value of N, independent of how much time has elapsed since the beginning. From the biology perspective, what one knows (ideally) is the size of the tumor at discovery, with no idea how long the tumor has been growing. Thus, we are interested in the conditional probabilities

P (m | N) \equiv {(Z_{N})}^{- 1} \int_{0}^{\infty} P (N, m; t) d t \equiv {(Z_{N})}^{- 1} T_{N} (m); Z_{N} \equiv \sum_{m} T_{N} (m) .

(39)

Integrating the master equation over time leads to the following equation for T_N(m), N ≥ 1:

\begin{array}{l} [(b_{w} + d_{w}) (N - m) + (b_{m} + d_{m}) m] T_{N} (m) - δ_{N, N_{0}} δ_{m, 0} \\ = b_{w} (N - 1 - m) (1 - μ) T_{N - 1} (m) \\ + b_{w} μ (N - m) T_{N - 1} (m - 1) + b_{m} (m - 1) T_{N - 1} (m - 1) \\ + d_{w} (N + 1 - m) T_{N + 1} (m) + d_{m} (m + 1) T_{N + 1} (m + 1) . \end{array}

(40)

with the definition T₀(m) ≡ 0. Again we assume $T_{N} (m) = \sum_{k} {(μ N)}^{k} A_{m}^{k}$ . Then, plugging this into the recursion relation, and noting that as always the N^k term cancels, we find the generalized scaling-limit recursion

\begin{array}{l} A_{m}^{k} [m (b_{m} + d_{m}) + k (b_{w} - d_{w})] = b_{w} A_{m - 1}^{k - 1} - b_{w} A_{m}^{k - 1} + b_{m} (m - 1) A_{m - 1}^{k} \\ + d_{m} (m + 1) A_{m + 1}^{k} . \end{array}

(41)

By now, the procedure for solving this equation should be obvious. We rewrite this systems as an equation for the generating function’

[(b_{w} - d_{w}) y \frac{\partial}{\partial y} + (x (b_{m} + d_{m}) - b_{m} x^{2} - d_{m}) \frac{\partial}{\partial x}] F (x, y) = - b_{w} y (1 - x) F + ℐ,

(42)

where ℐ is the inhomogeneous term. Again x = 1 is the singular point with the coefficient of ∂F/∂x vanishes. Defining r ≡ (b_w − d_w)/(b_m − d_m), the ratio of net birth rates, the characteristic is given by

x (y) = \frac{y^{1 / r} + α d_{m}}{y^{1 / r} + α b_{m}} .

(43)

The homogeneous equation now reads

(b_{w} - d_{w}) y \frac{d G}{d y} = - b_{w} y \frac{α (b_{m} - d_{m})}{y^{1 / r} + α b_{m}} G .

(44)

Notice that except for factors, this is the exact same equation we encountered in the asymmetric pure-birth case, and so the solution is immediate:

\begin{array}{l} G = N \exp (- \frac{b_{w}}{r b_{m}} y_{2} F_{1} (1, r; 1 + r; - \frac{y^{1 / r}}{α})) \\ = N \exp (- y \frac{b_{w}}{r b_{m}}_{2} F_{1} (1, r; 1 + r; - \frac{b_{m} x - d_{m}}{b_{m} (1 - x)})), \end{array}

(45)

where $N$ is a normalization prefactor. Our desired conditional probability generating function, which we denote by P_N(x) is then

P_{N} (x) = \exp (- μ N \frac{b_{w}}{r b_{m}}_{2} F_{1} (1, r; 1 + r; - \frac{b_{m} x - d_{m}}{b_{m} (1 - x)})) .

(46)

We see that in the pure birth case, d_w = d_m = 0, this reproduces the results of the previous section.

The large m behavior is, as we have seen, determined by the behavior in the vicinity of x = 1. Writing x = 1 − t, we have

_{2} F_{1} (1, r; 1 + r; - \frac{b_{m} x - d_{m}}{b_{m} (1 - x)}) \approx \frac{r b_{m}}{(r - 1) (b_{m} - d_{m})} t + \frac{r π}{\sin π r} {(\frac{b_{m}}{b_{m} - d_{m}} t)}^{r} .

(47)

The analytic structure is identical to the asymmetric pure-birth case, and so P(m|N) again decays as m¹⁺^r:

P_{N} (m | N) \approx μ N {(\frac{b_{m}}{b_{m} - d_{m}})}^{r} \frac{b_{w} Γ (1 + r)}{b_{m} m^{1 + r}} .

(48)

Though the exponent is only a function of the the ratio of the net birth rates, the coefficient is more complicated.

We can again evaluate P(m|N) from our generating function via an integration along the branch cut:

\begin{array}{l} P_{N} (m) = \frac{1}{π} \int_{1}^{\infty} \frac{d x}{x^{m + 1}} \exp (μ N \frac{b_{w} (x - 1)}{(r - 1) (b_{m} x - d)}_{2} F_{1} (1, 1 - r, 2 - r, \frac{b_{m} (x - 1)}{b_{m} x - d})) \\ \times \exp (- μ N \frac{π b_{w}}{b_{m} \tan π r} {(\frac{b_{m} (x - 1)}{b_{m} x - d})}^{r}) \sin (μ N \frac{π b_{w}}{b_{m}} {(\frac{b_{m} (x - 1)}{b_{m} x - d})}^{r}) . \end{array}

(49)

We present in Fig. 3 such a calculation for the case b_w = 3/4, d_w = 1/4, b_m = b_w/r, d_m = d_w/r, with r = 1.3. Also shown is a direct numerical solution of the master equation, Eq. (40), for μ = 0.004, N = 500, which confirms our asymptotic analysis. For comparison, the pure birth distribution with the same r = 1.3 is shown.

For small μN, we can write P(m|N) explicitly, since

\begin{array}{l} P_{N} (x) \approx 1 - μ N \frac{b_{w}}{r b_{m}}_{2} F_{1} (1, r; 1 + r; - \frac{b_{m} x - d_{m}}{b_{m} (1 - x)}) \\ = 1 - μ N \frac{b_{w}}{r b_{m}} \sum_{k = 0}^{\infty} \frac{1}{k!} {(\frac{d_{m}}{b_{m} (1 - x)})}^{k} \frac{Γ (1 + k) Γ (r + k) Γ (1 + r)}{Γ (1) Γ (r) Γ (1 + r + k)} \\ \times_{2} F_{1} (1 + k, r + k; 1 + r + k; - \frac{x}{1 - x}) \\ = 1 - μ N \frac{b_{w}}{r b_{m}} \sum_{k = 0}^{\infty} {(\frac{d_{m}}{b_{m}})}^{k} \frac{r}{r + k} (1 - x)_{2} F_{1} (1, 1 + k; 1 + r + k; x) \\ = 1 - μ N \frac{b_{w}}{b_{m}} \sum_{k = 0}^{\infty} {(\frac{d_{m}}{b_{m}})}^{k} \frac{1 - x}{r + k} \sum_{m = 0}^{\infty} \frac{Γ (1 + m) Γ (1 + k + m) Γ (1 + r + k)}{Γ (1) Γ (1 + k) Γ (1 + r + k + m) m!} x^{m} \\ = 1 + μ N \frac{b_{w}}{b_{m}} \sum_{k = 0}^{\infty} {(\frac{d_{m}}{b_{m}})}^{k} [- \frac{1}{r + k} + \sum_{m = 1}^{\infty} \frac{r Γ (k + m) Γ (r + k)}{Γ (1 + k) Γ (1 + r + k + m)} x^{m}] \\ = 1 + μ N \frac{b_{w}}{r b_{m}} [-_{2} F_{1} (1, r; 1 + r; \frac{d_{m}}{b_{m}}) + \sum_{m = 1}^{\infty} \frac{r^{2} Γ (r) Γ (m)}{Γ (r + m + 1)}_{2} F_{1} (r, m; r + m + 1; \frac{d_{m}}{b_{m}}) x^{m}], \end{array}

(50)

so that, for m > 0,

P (m | N) \approx μ N r \frac{b_{w}}{b_{m}} \frac{Γ (r) Γ (m)}{Γ (r + m + 1)}_{2} F_{1} (r, m; r + m + 1; \frac{d_{m}}{b_{m}}) \equiv p (m | N),

(51)

and

P (0 | N) \approx 1 - μ N \frac{b_{w}}{r b_{m}}_{2} F_{1} (1, r; 1 + r; \frac{d_{m}}{b_{m}}) \equiv 1 + p (0 | N) .

(52)

It is straightforward, though tedious, to show that

t_{N} (m) = \frac{1}{b_{w} N} p (m | N)

(53)

is an exact solution of the small μ version of Eq. (40) (using the fact that to zeroth order in μ, T_N(0) ≈ 1/(b_wN):

\begin{array}{l} [(b_{w} + d_{w}) (N - m) + (b_{m} + d_{m}) m] t_{N} (m) \\ = b_{w} (N - 1 - m) t_{N - 1} (m) + b_{m} (m - 1) t_{N - 1} (m - 1) \\ + d_{w} (N + 1 - m) t_{N + 1} (m) + d_{m} (m + 1) t_{N + 1} (m + 1), \\ [(b_{w} + d_{w}) (N - 1) + (b_{m} + d_{m})] t_{N} (1) \\ = b_{w} (N - 2) t_{N - 1} (1) + d_{w} N t_{N + 1} (1) + 2 d_{m} t_{N + 1} (2) + μ, \\ [(b_{w} + d_{w}) N] t_{N} (0) = b_{w} (N - 1) t_{N - 1} (0) + d_{w} (N + 1) t_{N + 1} (0) \\ + d_{m} t_{N + 1} (1) - μ, \end{array}

(54)

with 0 ≤ m ≤ ∞. However, the finite N system is truncated, with the equation for t_N(N) being different, so that this solution does not represent an exact solution of this truncated system. Nevertheless, it turns out that t_N(0) and t_N(N) decouple from the other equations, and in fact T_N(m) = t_N(m) is the exact solution of the N₀ = 1 small μ problem for 0 < m < N.

The perspicacious reader will have noticed that the problem we are solving in this section does not reduce exactly to the problem we posed when we considered the pure-birth asymmetric model. The equations are the same with the identification that T_N(m) in our section maps to P_N(m)/((N − m)r + m) in that section. However, the probabilities we construct from the T_N do not correspond to the P’s in the previous section. Here, we weight each state by the time spent in the state, which we did not do there. To leading order in N, however, the time spent in a state is independent of the state (as long as m ≪ N), so the large-N solution is identical in the two ensembles.

Note that when x = 0, we get

P (0 | N) = \exp (- μ N \frac{b_{w}}{r b_{m}}_{2} F_{1} (1, r; 1 + r; \frac{d_{m}}{b_{m}})),

(55)

which is equivalent to a well-known result of Ref. [8]. The advantage of our approach is that it is clear what approximations have to be made to obtain this result, something that is not overly clear in this earlier reference. It should also be noted in this context that the result obtained in Ref. [8] for P(m|N) in the small μN limit is not correct.

We can use Eq. (47) to find the large μN, m ∼ μN limiting distribution, where the branch cut integral is as usual dominated by the singular point at y = 1. Given that the analytic structure here, as we have seen, is exactly the same as for the asymmetric pure-birth case, we immediately can conclude that the distribution for large μN reproduces the one-sided α-stable distribution with index r. The shift parameter is

\begin{array}{l} ν = μ N \frac{b_{w}}{b_{m}} \frac{b_{m} - d_{m}}{b_{w} - d_{w}} \frac{b_{w} - d_{w}}{b_{w} - d_{w} - b_{m} + d_{m}} \frac{b_{m}}{b_{m} - d_{m}} \\ = μ N \frac{b_{w}}{b_{w} - d_{w} - b_{m} + d_{m}} . \end{array}

(56)

That this is correct can be seen from the calculation of $\bar{m}$ in the original DL model:

\begin{array}{l} \bar{m} = \int_{0}^{T} μ b_{w} N_{0} e^{(b_{w} - d_{w}) t} e^{(b_{m} - d_{m}) (T - t)} d t \\ = μ (N - N_{0}) \frac{b_{w}}{b_{w} - d_{m} - b_{m} + d_{m}} . \end{array}

(57)

For the scale parameter, we get

\begin{array}{l} c = \frac{b_{m}}{b_{m} - d_{m}} {[μ N (\frac{b_{w}}{b_{m}}) (\frac{b_{m} - d_{m}}{b_{w} - d_{w}}) \frac{b π}{\sin π b} \cos \frac{π b}{2}]}^{1 / b} \\ = {(\frac{b_{m}}{b_{m} - d_{m}})}^{1 - 1 / b} {[μ N (\frac{b_{w}}{b_{w} - d_{w}}) \frac{b π}{2 \sin \frac{π b}{2}}]}^{1 / b} . \end{array}

(58)

Lastly, for completeness, we write down the results for the symmetric birth-death process. Here r = 1 and the large μN limit is thus again a Landau distribution. For P(0|N) we have

P (0 | N) \equiv = e^{- μ N b_{w} / d_{w} \ln \frac{b_{w}}{b_{w} - d_{w}}} .

(59)

For the large μN limit, we have

P (m | N) \approx \frac{b - d}{π μ N b} \int_{0}^{\infty} d t e^{- t m / (μ N b / (b - d) - t \ln t + t \ln μ N} \sin π t,

(60)

which is again a Landau distribution, P_Landau(m(b − d)/μNb − lnμN). Amazingly, introducing death to the LD problem hardly makes a change in the answer, just renormalizing the variable m by the ratio (b − d)/b.

5 Comparison with the fixed time ensemble

5.1 Pure birth model

The most general solution of LD problem is given in the recent paper by Antal and Krapivsky (AK) [2], who solve the problem of determining the probability distribution at a fixed time. We first present their solution, modifying their notation slightly so as not to interfere destructively with the notation we have been using until now. They write down the backward version of the master equation for the probabilities of w wild-types and m mutants in the form

\begin{array}{l} \frac{d P^{w}}{d t} (w, m, t) = b_{w} (1 - μ) P^{w w} (w, m, t) + μ b_{w} P^{w w} (w, m, t) \\ - (b_{w} + d_{w}) P^{w} (w, m, t) + d_{w} δ_{w, 0} δ_{m, 0}, \\ \frac{d P^{m}}{d t} (w, m, t) = b_{m} P^{m m} (w, m, t) - (b_{m} + d_{m}) P^{m} (w, m, t) + d_{m} δ_{N, 0} δ_{m, 0} . \end{array}

(61)

In this expression the division and death rates of the wild-type (w) and mutant (m) subpopulations are denoted by b and d with the proper subscript. The superscript denotes a process which starts from a single wild-type individual (superscript w), a single mutant individual (superscript m), two wild-type (ww) and one of each (wm). A great simplification occurs in passing to the generating function version, in that the independent time evolution of any two individuals leads to factorization of the two-individual probabilities, so that the generating function for P^ww is the square of the generating function for (P^w), with similar results for P^wm and P^ww. This then leads to a pair of coupled ODE’s for W(x, y) and M(x, y), the generating functions defined as

W (x, y; t) \equiv \sum x^{m} y^{w} P^{w} (w, m; t); M (x, y; t) \equiv \sum x^{m} y^{w} P^{m} (w, m; t),

(62)

and the equations are

\begin{array}{l} \frac{\partial W}{d t} = b_{w} (1 - μ) W^{2} + μ b_{w} M W + d_{w} - (b_{w} + d_{w}) W, \\ \frac{\partial M}{d t} = b_{m} M^{2} + d_{m} - (b_{m} + d_{m}) M . \end{array}

(63)

To see how the solution of this system works and how it relates to our approach, we return for the moment to the simpler case of no death and symmetric division b_w = b_m ≡ b, and recall that the AK derivation so far is for N₀ = 1. We can rewrite these equations by picking the unit of time to set the wild-type growth rate b(1 − μ) equal to 1. This leads to the new system

\begin{array}{l} \frac{\partial W}{d t} = W^{2} + ν M W - (1 + ν) W \\ \frac{\partial M}{d t} = λ M^{2} - λ M \end{array}

(64)

with ν ≡ μ/(1 − μ), λ ≡ 1/(1 − μ). The initial condition for the M equation is that M(t = 0) = x, since by definition the state defining M starts with one mutant cell. The solution of this equation is

M = - \frac{z}{1 - z}; z = - \frac{x}{1 - x} e^{- λ t} .

(65)

Substituting this into the W equation, one obtains the solution

W = - \frac{C z}{{(1 - z)}^{μ} + C - C z} .

(66)

The constant of integration C is picked to satisfy the initial condition W (t = 0) = y, yielding

C = - \frac{y {(1 - x)}^{1 - μ}}{y - x} .

(67)

From this, one can obtain directly the distribution of mutant numbers, independent of the number of wild-type, given by F (x, t) ≡ W (x, y = 1; t); setting y = 1 gives C = 1 and using the mean population size $\bar{N} \equiv e^{λ t}$ , we obtain

F (x) = \frac{x}{x + \bar{N} (1 - x) [1 - {(1 - x + x / \bar{N})}^{μ}]} .

(68)

To compare this result with our finding given above in Eq. 13, we go to the scaling limit, which is small μ, large $\bar{N}$ , finite m (so that the number of wild-type is approximately equal to the total population). We can then drop the $x / \bar{N}$ term, and we have

F (x) \approx \frac{x}{x - (1 - x) μ \bar{N} \ln (1 - x)},

(69)

so that indeed $μ \bar{N}$ is the relevant control parameter. This is not the Lea-Coulson generating function, as has already been noted in our abbreviated initial paper [11]. In particular, $P_{0} = 1 / (1 + μ \bar{N})$ , so that it does not decay exponentially with $μ \bar{N}$ as we found above. The difference lies in the fact that the AK calculation is performed for fixed time, not at fixed total population N as before. For large time, the distribution of the number of wild-type cells, independent of the number of mutants can be shown to be geometric:

P^{w} (w) \approx \frac{1}{\bar{w} (t)} e^{- w / \bar{w} (t)}; \bar{w} (t) = e^{b (1 - μ) t} .

(70)

From this, given that the probability of zero mutants given w wild-type cells is, for small μ, exp(−μ(w −1)), it immediately follows that the probability of zero mutants is given by

P_{0} \approx \int_{0}^{\infty} \frac{d w}{\bar{w} (t)} e^{- μ w} e^{- w / \bar{w} (t)} \approx \frac{1}{1 + μ \bar{N}},

(71)

in agreement with the AK calculation. Clearly, then, the configurations at fixed time with small numbers of wild-types and thus a relatively large probability of never having spawned a resistant cell increase P₀ drastically. In Fig. 4 we present the distribution resulting from the AK fixed time ensemble, for times corresponding to $μ \bar{N} = 0.1$ , 1 and 10. We see that in all cases the distribution is monotonic, and looks nothing like the Lea-Coulson result, nor does it go over to the Landau distribution at large $μ \bar{N}$ .

Fig. 4 — The scaled distribution of mutants, $P_{t} (m) / μ \bar{N}$ , in the AK fixed time ensemble, for times corresponding to $μ \bar{N} = 0.1$ , 1 and 10.

Looking at F (x) for small $μ \bar{N}$ , we see that it does reproduce the small μN limit of the Lea-Coulson answer. More generally, the structure of the singularity at x = 1 is the same of for Lea-Coulson, so that fixed-time ensemble does show the same 1/m² behavior as our fixed-N ensemble for the intermediate asymptotic regime 1 ≪ m ≪ N. The behavior in the cut-off regime m ~ N is not surprisingly very different from the fixed-N behavior, since the shift of the pole of F (x) to $x = 1 / (1 + 1 / \bar{N})$ means that the decay of P(m) is exponential for m ~ N

Of course, the full AK solution contains information about the wild-type distribution encoded in its dependence on y which has been dropped in the derivation of F. If we return to the full solution for W, we can of course restrict ourself to a fixed value of $m = \bar{N}$ by finding the coefficient of the term that goes like y^N. This is still not exactly the same problem we solved. Our initial ensemble allows the elapsed time to vary, whereas this new result fixes both the time and the total population. However, the distribution of mutants as we have seen is controlled by the total population, and we do not expect the time taken to reach this population to matter significantly, at least as long as it is close to the average time. Further, as we have already noted, for m ≪ N, the total population N and the wild-type population w are the same to leading order in N. Thus, undaunted, we continue. The projection of W(x, y) onto fixed $m = \bar{N}$ , which we denote $W_{\bar{N}} (x)$ can be done exactly, since the y dependence of W is simple and takes the form

W = A_{1} y / (A_{2} y + A_{3})

(72)

with

A_{1} = z, A_{2} = z - 1 + \frac{{[(1 - z) (1 - x)]}^{μ}}{1 - x}, A_{3} = - \frac{x {[(1 - z) (1 - x)]}^{μ}}{1 - x},

(73)

and where as before $z = - \frac{x / \bar{N}}{1 - x}$ . Thus,

W_{\bar{N}} (x) = N \frac{A_{1}}{A_{3}} {(- \frac{A_{2}}{A_{3}})}^{\bar{N} - 1} .

(74)

where $N$ is a normalization factor, chosen so that $W_{\bar{N}} (1) = 1$ . The non-trivial normalization factor arises because we do not know a priori the overall probability of finding exactly the correct value of N in the overall ensemble, and hence this must be divided out to obtain a meaningful probability distribution. Now, in the limit x → 1,

\begin{array}{l} z \to - \frac{1}{\bar{N} (1 - y)} \to - \infty, A_{2} \to - \frac{1}{\bar{N} (1 - y)} + \frac{{\bar{N}}^{- μ}}{1 - y} = \frac{{\bar{N}}^{- μ} (1 - {\bar{N}}^{μ - 1})}{1 - y}, \\ A_{3} \to - \frac{{\bar{N}}^{- μ}}{1 - y} . \end{array}

(75)

Thus, we can find the value of the normalization constant

N = {\bar{N}}^{1 - μ} {(1 - {\bar{N}}^{μ - 1})}^{1 - \bar{N}} \approx {\bar{N}}^{1 - μ} e^{1 + μ \ln \bar{N}} = \bar{N} e^{1} .

(76)

Given this, we can now calculate the leading $large- \bar{N}$ behavior of $W_{\bar{N}} (x)$ , which we wish to compare to F(x). For fixed x and large $\bar{N}$ , z → 0 and

A_{2} / A_{3} = \frac{{[(1 - z) (1 - x)]}^{1 - μ}}{x} - \frac{1}{x} \approx - 1 + \frac{1}{\bar{N}} - μ \frac{(1 - x) \ln (1 - x)}{x} .

(77)

Also,

A_{1} / A_{3} = - \frac{z (1 - x)}{x {[(1 - z) (1 - x)]}^{μ}} \approx \frac{1}{N} .

(78)

Finally we obtain

W_{\bar{N}} (x) \approx N e^{1} (\frac{1}{N}) e^{- 1 + μ N \frac{(1 - x) \ln (1 - x)}{x}} = {(1 - x)}^{μ N (1 - x) / x} .

(79)

This is just the Lea-Coulson result that we had obtained above. Thus, our expectations that post-projecting the fixed-time ensemble for fixed wild-type abundance would reproduce the fixed population ensemble result in the intermediate asymptotic regime are indeed realized.

The above calculation was for N₀ = 1. For general N₀, the AK result for the generating function W(x, y; N₀) is immediate:

W (x, y; N_{0}) = {[W (x, y)]}^{N_{0}},

(80)

due to the independence of the life histories of the initial N₀ wild-type. Clearly then, the AK result, i.e. conditioning on fixed time, is N₀ dependent, as opposed to the N₀ independence of the fixed total population result, for m ≪ N. We can generalize our calculation of the fixed $m = \bar{N}$ projection of the AK result to general $N_{0} ≪ \bar{N}$ . Now, we have exp $(λ t) = \bar{N} / N_{0}$ . Then, performing the Taylor expansion of $W^{N_{0}}$ we get

W_{\bar{N}} (x; N_{0}) = N {(\frac{A_{1}}{A_{3}})}^{N_{0}} (\begin{array}{l} \bar{N} - 1 \\ N_{0} - 1 \end{array}) {(- \frac{A_{2}}{A_{3}})}^{\bar{N} - N_{0}} .

(81)

The normalization factor is now

\begin{array}{l} N = {(\frac{\bar{N}}{N_{0}})}^{N_{0} (1 - μ)} {\bar{N}}^{1 - N_{0}} (N_{0} - 1)! {[1 - {(\frac{\bar{N}}{N_{0}})}^{μ - 1}]}^{N_{0} - \bar{N}} \\ \approx \bar{N} {(\frac{e}{N_{0}})}^{N_{0}} (N_{0} - 1)! . \end{array}

(82)

Then,

\begin{array}{l} W_{\bar{N}} (x; N_{0}) = N {(\frac{N_{0}}{\bar{N}})}^{N_{0}} (\begin{array}{l} \bar{N} - 1 \\ N_{0} - 1 \end{array}) {(1 - \frac{N_{0}}{\bar{N}} + μ \frac{(1 - x) \ln (1 - x)}{x})}^{\bar{N} - N_{0}} \\ \approx {(1 - x)}^{μ \bar{N} (1 - x) / x}, \end{array}

(83)

recovering, as we expect, the N₀-independent Lea-Coulson result. It is remarkable that the fixed population ensemble results are much more similar to the original semi-deterministic model of Luria and Delbrück, with its universal aspects such as N₀ independence and large μN α-stable distribution than is the fixed time ensemble.

5.2 General Asymmetric Birth-Death Model

We chose to first present our projection method in the simplest possible context of the pure-birth, symmetric model, as in this case the AK solution is expressible in terms of elementary functions and the structure of the calculation is most transparent. The procedure, however, works in essentially the same way for the most general asymmetric birth-death model for which AK presented their solution for the fixed time ensemble. The model is now parameterized in terms of wild-type and mutant birth rates, b_w, b_m as well as wild-type and mutant death rates, d_w, d_m. AK choose to scale time so that the non-mutating wild-type birth process W → WW has rate unity. AK then introduce the following rates:

\begin{matrix} α_{m} \equiv \frac{b_{m}}{b_{w} (1 - μ)}, β_{w} \equiv \frac{d_{w}}{b_{w} (1 - μ)}, β_{m} \equiv \frac{d_{m}}{b_{w} (1 - μ)}, v = \frac{μ}{1 - μ}, \\ λ_{w} = 1 - β_{w} - v, λ_{m} = α_{m} - β_{m} . \end{matrix}

(84)

AK also introduce the auxiliary parameter

ω \equiv - \frac{λ_{w}}{2} - \frac{v β_{m}}{2 α_{m}} + \sqrt{{(\frac{λ_{w}}{2} + \frac{v β_{m}}{2 α_{m}})}^{2} + \frac{v λ_{m}}{α_{m}} .}

(85)

Note that in the small μ limit, ω is of order μ:

ω \approx μ \frac{b_{m} - d_{m}}{b_{w} - d_{w}} \frac{b_{w}}{b_{m}} .

(86)

AK define as well

\begin{array}{l} a \equiv \frac{ω}{λ_{m}} \approx \frac{μ b_{w}^{2}}{b_{m} (b_{w} - d_{w})}, b \equiv \frac{ω + 1 - β_{w}}{λ_{m}} \approx \frac{b_{w} - d_{w}}{b_{m} - d_{m}}, \\ c \equiv 1 + a + b - \frac{ν}{α_{m}} \approx b + 1, \end{array}

(87)

where the approximations are correct to leading order in μ. Then, the full solution derived in AK takes the extremely complicated form

W (x, y; t) = 1 + ω + \frac{T_{1} y + T_{2}}{T_{3} y + T_{4}} = 1 + ω + \frac{(T_{1} y + T_{2})}{T_{4}} \sum_{k} {(- 1)}^{k} {(y \frac{T_{3}}{T_{4}})}^{k},

(88)

where

\begin{array}{l} T_{1} = z^{c + 1} z_{0} F_{3} (z) F_{2} (z_{0}) + (c - 1) z_{0}^{c} F_{2} (z) F_{1} (z_{0}) z - z^{2} z_{0}^{c} F_{4} (z) F_{1} (z_{0}), \\ T_{2} = z^{c + 1} z_{0} F_{3} (z) [F_{2} (z_{0}) (λ_{m} (c - 1) - 1 - ω) - z_{0} F_{4} (z_{0}) λ_{m}] \\ - z_{0}^{c} z (c - 1) F_{2} (z) [F_{1} (z_{0}) (1 + ω) + λ_{m} F_{3} (z_{0}) z_{0}] \\ + z_{0}^{c} z^{2} F_{4} (z) [F_{1} (z_{0}) (1 + ω) + λ_{m} z_{0} F_{3} (z_{0})], \\ T_{3} = z^{c} z_{0} F_{1} (z) F_{2} (z_{0}) - z_{0}^{c} z F_{2} (z) F_{1} (z_{0}), \\ T_{4} = z^{c} z_{0} F_{1} (z) [F_{2} (z_{0}) ((c - 1) λ_{m} - 1 - ω) - z_{0} λ_{m} F_{4} (z_{0})] \\ + z z_{0}^{c} F_{2} (z) [F_{1} (z_{0}) (1 + ω) + λ_{m} z_{0} F_{3} (z_{0})], \end{array}

(89)

with the function definitions

\begin{array}{l} F_{1} =_{2} F_{1} (a, b; c; z), \\ F_{2} =_{2} F_{1} (1 + a - c, 1 + b - c; 2 - c; z), \\ F_{3} = \frac{a b}{c}_{2} F_{1} (1 + a, 1 + b; 1 + c; z), \\ F_{4} = \frac{(1 + a - c) (1 + b - c)}{2 - c}_{2} F_{1} (2 + a - c, 2 + b - c; 3 - c; z), \end{array}

(90)

and

z = \frac{1}{\bar{M}} [1 - \frac{λ_{m}}{α_{m} (1 - x)}], z_{0} = \bar{M} z .

(91)

Here, $\bar{M} \equiv e^{λ t}$ and is related to $\bar{N}$ , the average number of wild-type cells, by

\bar{N} \equiv e^{λ_{w} t} = {\bar{M}}^{λ_{w} / λ_{m}} \approx {\bar{M}}^{b} .

(92)

Then, $W_{\bar{N}} (x)$ is given by

W_{\bar{N}} (x) = N \frac{T_{1} T_{4} - T_{2} T_{3}}{T_{4}^{2}} {(- \frac{T_{3}}{T_{4}})}^{N - 1} .

(93)

For the calculation of the T₁T₄ − T₂T₃ factor, it is sufficient to work to zeroth order in μ. To this order F₁ = F₂ = 1 and F₃ = F₄ = 0. Then, we have

\begin{array}{l} T_{1} T_{4} - T_{2} T_{3} \approx [b z_{0}^{c} z] [z^{c} z_{0} (b λ_{m} - 1) + z z_{0}^{c}] - [- b z_{0}^{c} z] [z^{c} z_{0} - z_{0}^{c} z] \\ = b^{2} λ_{m} z_{0}^{c + 1} z^{c + 1} . \end{array}

(94)

For the $T_{4}^{2}$ term in the denominator, things are even simpler since we can use the fact that z ≪ z₀:

T_{4}^{2} \approx z_{0}^{2 c} z^{2},

(95)

so that

\frac{T_{1} T_{4} - T_{2} T_{3}}{T_{4}^{2}} \approx b^{2} λ_{m} {(\frac{z}{z_{0}})}^{b} \approx b λ_{m} / N .

(96)

For the calculation of T₃/T₄ we have to keep both the first 1/N and μ corrections since the ratio is to be raised to the $\bar{N} th$ power. We have, since $a ~ O (μ)$

\begin{array}{l} F_{1} (x) \approx 1 + a \sum_{n = 1}^{\infty} \frac{Γ (b + n) Γ (b + 1)}{Γ (b) Γ (b + 1 + n) n} x^{n} \\ = 1 + \frac{a}{b} [1 - b \ln (1 - x) -_{2} F_{1} (1, b; 1 + b; x)] \equiv 1 + μ F_{1}^{1} (x), \end{array}

(97)

and

\begin{array}{l} F_{3} (x) \approx a \frac{Γ (b + 1)}{Γ (b)} \sum_{n = 1}^{\infty} \frac{n Γ (n) Γ (b + n)}{n! Γ (b + n + 1)} x^{n - 1} \\ = a \frac{_{2} F_{1} (1, b; b + 1; x) - 1}{x} \equiv μ F_{3}^{1} (x), \end{array}

(98)

and so

T_{3} \approx - z_{0}^{c} z [1 + μ F_{1}^{1} (z_{0}) - \frac{1}{\bar{N}}] .

(99)

Similarly,

T_{4} \approx z_{0}^{c} z [1 + ω + μ λ_{m} z_{0} F_{3}^{1} (z_{0}) + μ F_{1}^{1} (z_{0}) + \frac{b λ_{m} - 1)}{\bar{N}}] .

(100)

Thus

\frac{T_{3}}{T_{4}} \approx - [1 - \frac{b λ_{m}}{\bar{N}} - ω - μ λ_{m} z_{0} F_{3}^{1} (z_{0})] .

(101)

Thus, we have our result

W_{\bar{N}} (x) \approx N (\frac{b λ_{m}}{\bar{N}}) \exp (- b λ_{m} - \bar{N} ω - μ \bar{N} λ_{m} z_{0} F_{3}^{1} (z_{0})) .

(102)

Since, when x → 1, z₀ → −∞ and $μ F_{3}^{1} (z_{0}) \to - a / z_{0}$ , the condition $W_{\bar{N}} (1) = 1$ implies

\begin{array}{l} W_{\bar{N}} (x) \approx \exp (- μ \bar{N} λ_{m} z_{0} (F_{3}^{1} (z_{0}) + a / z_{0})) \\ = \exp (- μ N (\frac{b_{w}}{b_{m}}) (\frac{b_{m} - d_{m}}{b_{w} - d_{w}})_{2} F_{1} (1, b; b + 1; z_{0})) . \end{array}

(103)

This recovers the result we obtained above, as expected.

It is amazing how our relatively simple scaling distribution is encoded in the immensely more complicated AK formalism. Being exact, their equation must contain all the details regarding the outer region (when m is the same order as N), must allow for arbitrary values of μ and N, and must reflect the intricate interplay of time and population size. Our approach is much less general, but has the distinct advantage of leading to answers that are simple enough for us to gain some insight into what is going on. In the last part of this section, we will see that we can employ a trick to recover our results from the AK formalism in a much simpler fashion.

5.3 An alternate method

There is an alternate way to extract our desired fixed population distribution from the AK results. We have seen that in the regime of interest, m ≪ N, the distribution is independent of N₀. If we take N₀ large however, the wild-type dynamics becomes deterministic and so the fixed time and fixed population ensembles become identical. Thus, all we need to do is to take the AK results for large N₀, and this gives the desired generating function directly. We now proceed to carry this out for the general non-symmetric case.

Following the notation in AK, we focus on

W = 1 + ω + λ_{m} Ψ (z),

(104)

with the definition

Ψ (z) \equiv \frac{z^{g} F_{3} (z) + C (1 - g) F_{2} (z) + C z F_{4} (z)}{z^{g - 1} F_{1} (z) + C F_{2} (z)},

(105)

where the F′s are the same as given above, with the constants taking the values (valid to leading order in μ):

\begin{array}{l} ω \approx μ \frac{b_{m} - d_{m}}{b_{w} - d_{w}} \frac{b_{w}}{b_{m}}; λ_{m} \approx \frac{b_{m} - d_{m}}{b_{w}}; \\ e \approx μ \frac{b_{w}^{2}}{b_{m} (b_{w} - d_{w})}; f \approx \frac{b_{w} - d_{w}}{b_{m} - d_{m}}; g \approx d + 1, \end{array}

(106)

and where the constant C is determined by the normalization condition W (t = 0) = y:

C = z_{0}^{g} \frac{κ F_{1} (z_{0}) - F_{3} (z_{0})}{(1 - f - κ z_{0}) F_{2} (z_{0}) + z_{0} F_{4} (z_{0})},

(107)

with z₀ the value of z at t = 0. Note that in general z is proportional to $e^{λ_{m} t}$ . Finally $κ \equiv - \frac{ω}{λ_{m} z_{0}} \approx - \frac{e}{z_{0}}$ . This seems to be an completely unwieldy mess. But, we get an enormous simplification if we recall that we are interested in small μ and therefore can use the leading order results F₁ = 1, F₂ = 1, and F₄ is O(μ) and hence can be dropped in the denominator, since C is also O(μ). with the definition $F_{3}^{1} (z) = F_{3}^{1} (z) / μ$ , we can derive the much simpler leading order expression

C \approx z_{0}^{g} \frac{e / z_{0} + μ F_{3}^{1} (z_{0})}{f} \equiv μ C_{1} .

(108)

Finally, we use the fact that z is small as it is related $e^{- λ_{m} t}$ which equals N₀/N raised to the positive power f. Thus, W(x) is given by

W (x) = {(1 + λ_{m} Ψ (z))}^{N_{0}} = \exp (- μ N (\frac{b_{m} - d_{m}}{b_{w}}) (d / μ + z_{0} F_{3}^{1} (z_{0}))) .

(109)

We need an explicit expression for $F_{3}^{1} (x)$ . We have

\begin{array}{l} F_{3}^{1} (x) = \frac{a}{μ} \frac{Γ (b + 1)}{Γ (b)} \sum_{n = 1}^{\infty} \frac{n Γ (n) Γ (b + n)}{n! Γ (b + n + 1)} x^{n - 1} \\ = \frac{a b}{μ} \sum_{n = 1} \frac{x^{n - 1}}{b + n} \\ = \frac{a}{μ} \frac{_{2} F_{1} (1, b; b + 1; x) - 1}{x} . \end{array}

(110)

Thus, we have the final expression

\begin{array}{l} W (x) = \exp (- μ N (\frac{b_{m} - d_{m}}{b_{w}}) (\frac{b_{w}^{2}}{b_{m} (b_{w} - d_{w})})_{2} F_{1} (1, b; b + 1; z_{0})) \\ = \exp (- μ N (\frac{b_{w}}{b_{m}}) (\frac{b_{m} - d_{m}}{b_{w} - d_{w}})_{2} F_{1} (1, b; b + 1; z_{0})), \end{array}

(111)

again reproducing what we have previously calculated.

6 Conclusions

We have analyzed here the pure birth as well as the birth-death stochastic Luria-Delbrück model in the limit of small mutation rate μ and large current population, N, where the product μN is unrestricted. The answer in all cases exhibits an intermediate asymptotic power-law decay m¹⁺^r for 1 ≪ m ≪ N, where r is the ratio of the wild-type net growth rate to the mutant net growth rate. For small μN, the distribution is monotonically decreasing, and approaches the pure power-law already for m’s of order unity. As μN increases the probability of a small number of mutants decreases sharply, and above a critical μN of order 1, the distribution develops a peak. The power-law decay sets in then only a distance past this peak. For large μN, the distribution becomes the one-sided Lévy distribution with index r. The distribution for m ≪ N is in all cases independent of the initial wide type colony size N₀. It is also independent of the precise nature of the ensemble, as long as there are sufficient numbers of wild-type individuals in all members of the ensemble with non-vanishingly small weighting. This can be guaranteed by fixing $N = \bar{N}$ at either fixed time or in a time average, or alternatively by having a large number for the number N₀ of initial wild-type individuals. The power-law tail, as well as the strong dependence of the distribution on N₀ for m ∼ N has significant implications for tasks such as using measured data to infer mutation rate [10,7]. This also has implications for the variation expected in patient responses to administered therapy, although it seems that the extrinsic variation due to different rates dominates over the intrinsic variation due to process stochasticity [12].

Acknowledgments

This work was supported by the NSF Center for Theoretical Biological Physics, (grant no. PHY-1308264). HL was also supported by CPRIT Scholar program of the State of Texas, and DK was also supported by the Israeli Science Foundation.

Footnotes

PACS 87.23.Kg,05.40.-a

Contributor Information

David A. Kessler, Email: kessler@dave.ph.biu.ac.il, Department of Physics, Bar-Ilan University, Ramat-Gan, IL52900 Israel.

Herbert Levine, Email: Herbert.Levine@rice.edu, Center for Theoretical Biological Physics, Rice University, Houston, TX 77096, USA.

References

1.Angerer WP. An explicit representation of the luria-delbrück distribution. J Math Biol. 2001;42(2):145–174. doi: 10.1007/s002850000053. [DOI] [PubMed] [Google Scholar]
2.Antal T, Krapivsky P. Exact solution of a two-type branching process: models of tumor progression. J of Stat Mech, Theory and Experiment. 2011;2011(8):P08, 018. [Google Scholar]
3.Clayton E, Doupé DP, Klein AM, Winton DJ, Simons BD, Jones PH. A single type of progenitor cell maintains normal epidermis. Nature. 2007;446(7132):185–189. doi: 10.1038/nature05574. [DOI] [PubMed] [Google Scholar]
4.Coldman AJ, Goldie JH. A stochastic model for the origin and treatment of tumors containing drug-resistant cells. Bull Math Biol. 1986;69(3):279–292. doi: 10.1007/BF02459682. [DOI] [PubMed] [Google Scholar]
5.Dewanji A, Luebeck EG, Moolgavkar SH. A generalized luria-delbrück model. Math Biosci. 2005;197(2):140–152. doi: 10.1016/j.mbs.2005.07.003. [DOI] [PubMed] [Google Scholar]
6.Diaz LA, Jr, Willians RT, Wu J, Kinde I, Hecht JR, Berlin J, Allen B, Bozic I, Reiter JG, Nowak MA, Kinzler KW, Oliner KS, Vogelstein B. The molecular evolution of acquired resistance to targeted EGFR blockade in colorectal cancers. Nature. 2012;486(7404):537–540. doi: 10.1038/nature11219. [DOI] [PMC free article] [PubMed] [Google Scholar]
7.Gerrish P. A simple formula for obtaining markedly improved mutation rate estimates. Genetics. 2008;180(3):1773–1778. doi: 10.1534/genetics.108.091777. [DOI] [PMC free article] [PubMed] [Google Scholar]
8.Iwasa Y, Nowak MA, Michor F. Evolution of resistance during clonal expansion. Genetics. 2006;172(4):2557–2566. doi: 10.1534/genetics.105.049791. [DOI] [PMC free article] [PubMed] [Google Scholar]
9.Kashdan E, Pareschi L. Mean field mutation dynamics and the continuous luria-delbrück distribution. Math Biosci. 2012;240(2):223–230. doi: 10.1016/j.mbs.2012.08.001. [DOI] [PubMed] [Google Scholar]
10.Kepler TB, Oprea M. Improved inference of mutation rates: 1. an integral representation for the luria-delbrück distribution. Theor Pop Biol. 2001;59(1):41–48. doi: 10.1006/tpbi.2000.1498. [DOI] [PubMed] [Google Scholar]
11.Kessler DA, Levine H. Large population solution of the stochastic luria-delbrück evolution model. Proc Nat Acad Sci. 2013;110687(29):11, 682–11. doi: 10.1073/pnas.1309667110. [DOI] [PMC free article] [PubMed] [Google Scholar]
12.Kessler DA, Levine H, Austin R. Resistance to chemotherapy: Patient variability and cellular heterogeneity. Cancer Res. doi: 10.1158/0008-5472.CAN-14-0118. [DOI] [PubMed] [Google Scholar]
13.Komarova N. Stochastic modeling of drug resistance in cancer. J Theor Biol. 2006;239(3):351–366. doi: 10.1016/j.jtbi.2005.08.003. [DOI] [PubMed] [Google Scholar]
14.Lea DE, Coulson CA. The distribution of the numbers of mutants in bacterial populations. J Genetics. 1949;49(3):264–285. doi: 10.1007/BF02986080. [DOI] [PubMed] [Google Scholar]
15.Lenski RE, Slatkin M, Ayala FJ. Mutation and selection in bacterial populations: alternatives to the hypothesis of directed mutation. Proc Nat Acad Sci. 1989;86(8):2775–2778. doi: 10.1073/pnas.86.8.2775. [DOI] [PMC free article] [PubMed] [Google Scholar]
16.Luria SE, Delbrück M. Mutation of bacteria from virus sensitivity to virus resistance. Genetics. 1943;28(6):491–511. doi: 10.1093/genetics/28.6.491. [DOI] [PMC free article] [PubMed] [Google Scholar]
17.Mandelbrot B. A population birth-and-mutation process, i: Explicit distributions for the number of mutants in an old culture of bacteria. J Appl Prob. 1974;11(3):437–444. [Google Scholar]
18.Moolgavkar SH. Carcinogenesis modeling: from molecular biology to epidemiology. Ann Rev Publ Health. 1986;7(1):151–169. doi: 10.1146/annurev.pu.07.050186.001055. [DOI] [PubMed] [Google Scholar]
19.Tomasetti C, Levy D. An elementary approach to modeling drug resistance in cancer. Math Biosci Eng. 2010;7(4):905–918. doi: 10.3934/mbe.2010.7.905. [DOI] [PMC free article] [PubMed] [Google Scholar]
20.Zheng Q. Progress of a half-century in the study of the luria-delbrück distribution. Math Biosci. 1999;162(1):1–32. doi: 10.1016/s0025-5564(99)00045-0. [DOI] [PubMed] [Google Scholar]

[R1] 1.Angerer WP. An explicit representation of the luria-delbrück distribution. J Math Biol. 2001;42(2):145–174. doi: 10.1007/s002850000053. [DOI] [PubMed] [Google Scholar]

[R2] 2.Antal T, Krapivsky P. Exact solution of a two-type branching process: models of tumor progression. J of Stat Mech, Theory and Experiment. 2011;2011(8):P08, 018. [Google Scholar]

[R3] 3.Clayton E, Doupé DP, Klein AM, Winton DJ, Simons BD, Jones PH. A single type of progenitor cell maintains normal epidermis. Nature. 2007;446(7132):185–189. doi: 10.1038/nature05574. [DOI] [PubMed] [Google Scholar]

[R4] 4.Coldman AJ, Goldie JH. A stochastic model for the origin and treatment of tumors containing drug-resistant cells. Bull Math Biol. 1986;69(3):279–292. doi: 10.1007/BF02459682. [DOI] [PubMed] [Google Scholar]

[R5] 5.Dewanji A, Luebeck EG, Moolgavkar SH. A generalized luria-delbrück model. Math Biosci. 2005;197(2):140–152. doi: 10.1016/j.mbs.2005.07.003. [DOI] [PubMed] [Google Scholar]

[R6] 6.Diaz LA, Jr, Willians RT, Wu J, Kinde I, Hecht JR, Berlin J, Allen B, Bozic I, Reiter JG, Nowak MA, Kinzler KW, Oliner KS, Vogelstein B. The molecular evolution of acquired resistance to targeted EGFR blockade in colorectal cancers. Nature. 2012;486(7404):537–540. doi: 10.1038/nature11219. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R7] 7.Gerrish P. A simple formula for obtaining markedly improved mutation rate estimates. Genetics. 2008;180(3):1773–1778. doi: 10.1534/genetics.108.091777. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R8] 8.Iwasa Y, Nowak MA, Michor F. Evolution of resistance during clonal expansion. Genetics. 2006;172(4):2557–2566. doi: 10.1534/genetics.105.049791. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R9] 9.Kashdan E, Pareschi L. Mean field mutation dynamics and the continuous luria-delbrück distribution. Math Biosci. 2012;240(2):223–230. doi: 10.1016/j.mbs.2012.08.001. [DOI] [PubMed] [Google Scholar]

[R10] 10.Kepler TB, Oprea M. Improved inference of mutation rates: 1. an integral representation for the luria-delbrück distribution. Theor Pop Biol. 2001;59(1):41–48. doi: 10.1006/tpbi.2000.1498. [DOI] [PubMed] [Google Scholar]

[R11] 11.Kessler DA, Levine H. Large population solution of the stochastic luria-delbrück evolution model. Proc Nat Acad Sci. 2013;110687(29):11, 682–11. doi: 10.1073/pnas.1309667110. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R12] 12.Kessler DA, Levine H, Austin R. Resistance to chemotherapy: Patient variability and cellular heterogeneity. Cancer Res. doi: 10.1158/0008-5472.CAN-14-0118. [DOI] [PubMed] [Google Scholar]

[R13] 13.Komarova N. Stochastic modeling of drug resistance in cancer. J Theor Biol. 2006;239(3):351–366. doi: 10.1016/j.jtbi.2005.08.003. [DOI] [PubMed] [Google Scholar]

[R14] 14.Lea DE, Coulson CA. The distribution of the numbers of mutants in bacterial populations. J Genetics. 1949;49(3):264–285. doi: 10.1007/BF02986080. [DOI] [PubMed] [Google Scholar]

[R15] 15.Lenski RE, Slatkin M, Ayala FJ. Mutation and selection in bacterial populations: alternatives to the hypothesis of directed mutation. Proc Nat Acad Sci. 1989;86(8):2775–2778. doi: 10.1073/pnas.86.8.2775. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R16] 16.Luria SE, Delbrück M. Mutation of bacteria from virus sensitivity to virus resistance. Genetics. 1943;28(6):491–511. doi: 10.1093/genetics/28.6.491. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R17] 17.Mandelbrot B. A population birth-and-mutation process, i: Explicit distributions for the number of mutants in an old culture of bacteria. J Appl Prob. 1974;11(3):437–444. [Google Scholar]

[R18] 18.Moolgavkar SH. Carcinogenesis modeling: from molecular biology to epidemiology. Ann Rev Publ Health. 1986;7(1):151–169. doi: 10.1146/annurev.pu.07.050186.001055. [DOI] [PubMed] [Google Scholar]

[R19] 19.Tomasetti C, Levy D. An elementary approach to modeling drug resistance in cancer. Math Biosci Eng. 2010;7(4):905–918. doi: 10.3934/mbe.2010.7.905. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R20] 20.Zheng Q. Progress of a half-century in the study of the luria-delbrück distribution. Math Biosci. 1999;162(1):1–32. doi: 10.1016/s0025-5564(99)00045-0. [DOI] [PubMed] [Google Scholar]

PERMALINK

Scaling solution in the large population limit of the general asymmetric stochastic Luria-Delbrück evolution process

David A Kessler

Herbert Levine

Abstract

1 Introduction

2 The symmetric pure birth model

Fig. 1.

Fig. 2.

3 The non-symmetric pure birth problem

4 General Asymmetric process with death

Fig. 3.

5 Comparison with the fixed time ensemble

5.1 Pure birth model

Fig. 4.

5.2 General Asymmetric Birth-Death Model

5.3 An alternate method

6 Conclusions

Acknowledgments

Footnotes

Contributor Information

References

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

Scaling solution in the large population limit of the general asymmetric stochastic Luria-Delbrück evolution process

David A Kessler

Herbert Levine

Abstract

1 Introduction

2 The symmetric pure birth model

Fig. 1.

Fig. 2.

3 The non-symmetric pure birth problem

4 General Asymmetric process with death

Fig. 3.

5 Comparison with the fixed time ensemble

5.1 Pure birth model

Fig. 4.

5.2 General Asymmetric Birth-Death Model

5.3 An alternate method

6 Conclusions

Acknowledgments

Footnotes

Contributor Information

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases