Evolutionary dynamics of tumor progression with random fitness values

Rick Durrett; Jasmine Foo; Kevin Leder; John Mayberry; Franziska Michor

doi:10.1016/j.tpb.2010.05.001

. Author manuscript; available in PMC: 2011 Aug 1.

Published in final edited form as: Theor Popul Biol. 2010 May 19;78(1):54–66. doi: 10.1016/j.tpb.2010.05.001

Evolutionary dynamics of tumor progression with random fitness values

Rick Durrett ^1,^*, Jasmine Foo ^2,^†, Kevin Leder ^2,^‡, John Mayberry ^1,^§,^¶, Franziska Michor ^2,^||

PMCID: PMC2929987 NIHMSID: NIHMS212822 PMID: 20488197

Abstract

Most human tumors result from the accumulation of multiple genetic and epigenetic alterations in a single cell. Mutations that confer a fitness advantage to the cell are known as driver mutations and are causally related to tumorigenesis. Other mutations, however, do not change the phenotype of the cell or even decrease cellular fitness. While much experimental effort is being devoted to the identification of the functional effects of individual mutations, mathematical modeling of tumor progression generally considers constant fitness increments as mutations are accumulated. In this paper we study a mathematical model of tumor progression with random fitness increments. We analyze a multi-type branching process in which cells accumulate mutations whose fitness effects are chosen from a distribution. We determine the effect of the fitness distribution on the growth kinetics of the tumor. This work contributes to a quantitative understanding of the accumulation of mutations leading to cancer.

Keywords: cancer evolution, branching process, fitness distribution, beneficial fitness effects, mutational landscape

1 Introduction

Tumors result from an evolutionary process occurring within a tissue (Nowell, 1976). From an evolutionary point of view, tumors can be considered as collections of cells that accumulate genetic and epigenetic alterations. The phenotypic changes that these alterations confer to cells are subjected to the selection pressures within the tissue and lead to adaptations such as the evolution of more aggressive cell types, the emergence of resistance, induction of angiogenesis, evasion of the immune system, and colonization of distant organs with metastatic growth. Advantageous heritable alterations can cause a rapid expansion of the cell clone harboring such changes, since these cells are capable of outcompeting cells that have not evolved similar adaptations. The investigation of the dynamics of cell growth, the speed of accumulating mutations, and the distribution of different cell types at various timepoints during tumorigenesis is important for an understanding of the natural history of tumors. Further, such knowledge aids in the prognosis of newly diagnosed tumors, since the presence of cell clones with aggressive phenotypes lead to less optimistic predictions for tumor progression. Finally, a knowledge of the composition of tumors allows for the choice of optimum therapeutic interventions, as tumors harboring pre-existing resistant clones should be treated differently than drug-sensitive cell populations.

Mathematical models have led to many important insights into the dynamics of tumor progression and the evolution of resistance (Goldie and Coldman, 1983 and 1984; Bodmer and Tomlinson, 1995; Coldman and Murray, 2000; Knudson, 2001; Maley and Forrest, 2001; Michor et al., 2004; Iwasa et al., 2005; Komarova and Wodarz, 2005; Michor et al., 2006; Michor and Iwasa, 2006; Frank 2007; Wodarz and Komarova, 2007; Schweinsberg, 2008; Durrett, Schmidt, and Schweinsberg, 2009). These mathematical models generally fall into one of two classes: (i) constant population size models, and (ii) models describing exponentially growing populations. Many theoretical investigations of exponentially growing populations employ multi-type branching process models (e.g., Iwasa et al., 2006; Haeno et al., 2007; Durrett and Moseley, 2009), while others use population genetic models for homogeneously mixing exponentially growing populations (e.g., Beerenwinkel et al., 2007; Durrett and Mayberry, 2010). In this paper, we focus on branching process models. In these models, cells with i ≥ 0 mutations are denoted as type-i cells, and Z_i(t) specifies the number of type-i cells at time t. Type-i cells die at rate b_i, give birth to one new type-i cell at rate a_i, and give birth to one new type-(i + 1) cell at rate u_i₊₁. Some authors (e.g., Haeno et al., 2007) consider an alternate version of our model in which mutations occur with probability μ_i₊₁ during birth events which occur at rate α_i, but the two versions are equivalent provided u_i₊₁ = α_iμ_i₊₁ and a_i = α_i (1 − μ_i₊₁). This relationship between the parameters must be kept in mind when comparing results across papers.

One biologically unrealistic aspect of this model as presented in the literature is that all type-i cells are assumed to have the same birth and death rates. This assumption describes situations during tumorigenesis in which the order of mutations is predetermined, i.e. the genetic changes can only be accumulated in a particular sequence and all other combinations of mutations lead to lethality. Furthermore, in this interpretation of the model, there cannot be any variability in phenotype among cells with the same number of mutations. In many situations arising in biology there is marked heterogeneity in phenotype even if genetically, the cells are identical (Elowitz et al., 2002; Becskei et al., 2005; Kaern et al., 2005; Feinerman et al., 2008). This variability may be driven by stochasticity in gene expression or in post-transcriptional or post-translational modifications. In this paper, we modify the branching process model so that mutations alter cell birth rates by a random amount.

An important consideration for this endeavor is the choice of the mutational fitness distribution. The exponential distribution has become the preferred candidate in theoretical studies of the genetics of adaptation. The first theoretical justification of this choice was given by Gillespie (1983, 1984), who argued that if the number of possible alleles is large and the current allele is close to the top of the rank ordering in fitness values, then extreme value theory should provide insight into the distribution of the fitness values of mutations. For many distributions including the normal, Gamma, and lognormal distributions, the maximum of n independent draws, when properly scaled, converges to the Gumbel or double exponential distribution, Λ(x) = exp(−e⁻^x). In the biological literature, it is generally noted that this class of distributions only excludes exotic distributions like the Cauchy distribution, which has no moments. However, in reality, it eliminates all distributions with P(X > x) ~ Cx⁻^α. For distributions in the domain of attraction of the Gumbel distribution, and if Y₁ > Y₂ ··· >Y_k are the k largest observations in a sample of size n, then there is a sequence of constants b_n so that the spacings Z_i = i(Y_i−Y_i₊₁)/b_n converge to independent exponentials with mean 1, see e.g., Weissman (1978). Following up on Gillespie’s work, Orr (2003) added the observation that in this setting, the distribution of the fitness increases due to beneficial mutations has the same distribution as Z₁ independent of the rank i of the wild type cell.

To infer the distribution of fitness effects of newly emerged beneficial mutations, several experimental studies were performed; for examples, see Imhoff and Schlotterer (2001), Sanjuan et al. (2004), and Kassen and Bataillon (2006). The data from these experiments is generally consistent with an exponential distribution of fitness effects. However, there is an experimental caveat that cannot be neglected (Rozen et al., 2002): if only those mutations are considered that reach 100% frequency in the population, then the exponential distribution is multiplied by the fixation probability. By this operation, a distribution with a mode at a positive value develops. In a study of a quasi-empirical model of RNA evolution in which fitness was based on secondary structures, Cowperthwaite et al. (2005) found that fitnesses of randomly selected genotypes appeared to follow a Gumbel-type distribution. They also discovered that the fitness distribution of beneficial mutations appeared exponential only when the vast majority of small-effect mutations were ignored. Furthermore, it was determined that the distribution of beneficial mutations depends on the fitness of the parental genotype (Cowperthwaite et al., 2005; MacLean and Buckling, 2009). However, since the exceptions to this conclusion arise when the fitness of the wild type cell is low, these findings do not contradict the picture based on extreme value theory.

In contrast to the evidence above, recent work of Rokyta et al. (2008) has shown that in two sets of beneficial mutations arising in the bacteriophage ID11 and in the phage φ6 – for which the mutations were identified by sequencing – beneficial fitness effects are not exponential. Using a statistical method developed by Biesal et al. (2007), they tested the null hypothesis that the fitness distribution has an exponential tail. They found that the null hypothesis could be rejected in favor of a distribution with a right truncated tail. Their data also violated the common assumption that small-effect mutations greatly outnumber those of large effect, as they were consistent with a uniform distribution of beneficial effects. A possible explanation for the bounded fitness distribution may be found in the culture conditions utilized in the experiments: they evolved ID11 on E. coli at an elevated temperature (37° C instead of 33° C). There may be a limited number of mutations that will enable ID11 to survive in increased temperatures. The latter situation may be similar to scenarios arising during tumorigenesis, where, in order to develop resistance to a drug or to progress to a more aggressive stage, the conformation of a particular protein must be changed or a certain regulatory network must be disrupted. If there is a finite, but large, number of possible beneficial mutations, then it is convenient to use a continuous distribution as an approximation.

In this paper, we consider both bounded distributions and unbounded distributions for the fitness advance and derive asymptotic results for the number of type-k individuals at time t. We determine the effects of the fitness distribution on the growth kinetics of the population, and investigate the rates of expansion for both bounded and unbounded fitness distributions. This model provides a framework to investigate the accumulation of mutations with random fitness effects.

The remainder of this section is dedicated to statements and discussion of our main results. Proofs of these results can be found in Sections 2–5.

1.1 Bounded distributions

The model we consider is a multi-type branching process in which type-i cells have accumulated i ≥ 0 advantageous mutations. All cells in the population die at rate b₀. The initial population consists entirely of type-0 cells that give birth at rate a₀ > b₀ to new type-0 cells and produce type-1 cells at rate u₁. We assume that the population of type-0 cells starts at a sufficiently large population V₀ so that we can approximate its size by Z₀(t) = V₀e^λ₀t, where λ₀ = a₀ − b₀. When a type-0 cell produces a type-1 cell, the new cell gives birth to type-1 cells at rate a₀ + x, where x ≥ 0 is drawn according to a probability distribution ν and produces type-2 cells at rate u₂. In general, a type-k cell with birth rate a produces a new type-(k + 1) cell at rate u_k and the new type-(k + 1) cell assumes an increased birth rate a + x where x ≥ 0 is drawn according to ν. We let Z_k(t) denote the total number of type-k cells in the population at time t. When we refer to the kth generation of mutants, we mean the set of all type-k cells.

We begin by considering situations in which the distribution of the increase in the birth rate is concentrated on [0, b]. In particular, suppose that ν has density g with support in [0, b] and assume that g satisfies:

g is continuous at b, g (b) > 0, g (x) \leq G for x \in [0, b]

(*)

Our first result describes the mean number of first generation mutants at time t, EZ₁(t).

Theorem 1

If (*) holds, then

{E Z}_{1} (t) \sim \frac{V_{0} u_{1} g (b)}{b t} e^{(λ_{0} + b) t}

where a(t) ~ b(t) means a(t)/b(t) → 1.

The next result shows that the actual growth rate of type-1 cells is slower than the mean. Here, and in what follows, we use ⇒ to indicate convergence in distribution.

Theorem 2

If (*) holds and p = b/λ₀, then for θ ≥ 0,

E exp (- θ t^{1 + p} e^{- (λ_{0} + b) t} Z_{1} (t)) \to exp (- V_{0} u_{1} θ^{λ_{0} / (λ_{0} + b)} c_{1} (λ_{0}, b)),

(1.1)

where c₁(λ₀, b) is a constant whose value will be given in (3.8). In particular, we have

t^{1 + p} e^{- (λ_{0} + b) t} Z_{1} t) \Rightarrow V_{1},

where V₁ has Laplace transform given by the righthand side of (1.1).

Theorem 2 is similar to Theorem 3 in Durrett and Moseley (2009) which assumes a deterministic fitness distribution so that all type-1 cells have growth rate λ₁ = λ₀ + b. There, the asymptotic growth rate of the first generation is exp(λ₁t). In contrast, the continuous fitness distribution we consider here has the effect of slowing down the growth rate of the first generation by the polynomial factor t¹⁺^p. To explain this difference, we note that the calculation of the mean given in Section 3 shows that the dominant contribution to Z₁(t) comes from growth rates x = b − O(1/t). However, mutations with this growth rate are unlikely until the number of type-0 cells is O(t), i.e., roughly at time r₁ = (1/λ₀) log t. Thus at time t, the number of type-1 cells will be roughly exp((λ₀+b)(t − r₁)) = exp((λ₀ +b)t)/t¹⁺^p.

To prove Theorem 2, we look at mutations as a point process in [0, t] × [0, b]: there is a point at (s, x) if there was a mutant with birth rate a₀ + x at time s. This allows us to derive the following explicit expression for the Laplace transform of Z₁(t):

E (e^{- θ Z_{1} (t)}) = exp (- u_{1} \int_{0}^{b} d x g (x) \int_{0}^{t} d s V_{0} e^{λ_{0} s} (1 - {\tilde{φ}}_{x, t - s} (θ)))

where ${\tilde{φ}}_{x, r} (θ) = {E e}^{- θ {\tilde{Z}}_{r}^{x}}$ and ${\tilde{Z}}_{r}^{x}$ is a continuous-time branching process with birth rate a₀+x, death rate b₀, and initial population ${\tilde{Z}}_{0}^{x} = 1$ . In Figure 1, we compare the exact Laplace transform of t¹⁺^p exp(−(λ₀+b)t)Z₁(t) with the results of simulations and the limiting Laplace transform from Theorem 2, illustrating the convergence as t → ∞.

Plot of the exact Laplace transform (LT) for t⁽¹⁺^p⁾ e^{−(λ₀+b)t} Z₁ (t) at times t = 60, 80,100,120, the approximations from Monte Carlo (MC) simulations at the corresponding times, and the asymptotic Laplace transform from Theorem 2. Parameter values: a₀ = 0.2, b₀ = 0.1, b = 0.01, and u₁ = 10⁻³. g is uniform on [0, .01].

Notice that the Laplace transform of V₁ has the form exp(C θ^α) where α = λ₀/(λ₀ + b) which implies that P(V₁ > v) ~ v⁻^α as v → ∞ (see, for example, the argument in Section 3 of Durrett and Moseley (2009)). To gain some insight into how this limit comes about, we give a second proof of the convergence that tells us the limit is the sum of points in a nonhomogeneous Poisson process. Each point in the limiting process represents the contribution of a different mutant lineage to Z₁(t). More precisely, we define a three dimensional point process Inline graphic (t) on [0, t] × [0, b] × (0, ∞) by the following rule: there is a point at (s, x, v) if there was a type-1 mutant with birth rate a₀ + x at time s and the number of its type-1 descendants at time t, $Z_{1}^{s, x} (t)$ , has $e^{- (λ_{0} + x) (t - s)} Z_{1}^{s, x} (t) \to v$ as t → ∞ with v > 0. We define F : [0, ∞)³ → [0, ∞) by

F (s, x, v) = v t^{1 + p} e^{- (λ_{0} + b) t} e^{(λ_{0} + x) (t - s)}

i.e. F maps a point in Inline graphic (t) onto its contribution to V₁ = lim_t_→∞ t¹⁺^pe^{−(λ₀+b)t}Z₁(t).

Theorem 3

As t → ∞, F( Inline graphic (t)) ⇒ Λ where Λ is a Poisson process on (0, ∞) with mean measure μ(z, ∞) = A₁(λ₀, b)u₁V₀z^{−λ₀/(λ₀+b)} and A₁(λ₀, b) is a constant whose value is given in (3.9). In particular, V₁ = lim_t→∞ t^1+pe^{−(λ₀+b)t}Z₁(t) is the sum of the points in Λ.

A similar result can be obtained for deterministic fitness distributions, see the Corollary to Theorem 3 in Durrett and Moseley (2009). However, the new result shows that the point process limit is not an artifact of assuming that all first generation mutants have the same growth rate. Even when the fitness advances are random, different mutant lines contribute to the limit. This result is consistent with observations of Maley et al. (2006) and Shah et al. (2009) that tumors contain cells with different mutational haplotypes. Theorem 3 also gives quantitative predictions about the relative contribution of different mutations to the total population. These implications will be explored further in a follow-up paper currently in progress.

With the behavior of the type-1 individuals analyzed, we are ready to proceed to the study of type-k individuals. The computation of the mean is straightforward.

Theorem 4

If (*) holds, then

{E Z}_{k} (t) \sim \frac{V_{0} \cdot u_{1} \dots u_{k} \cdot g {(b)}^{k}}{t^{k} b^{k} k!} e^{(λ_{0} + k b) t}

As in the k = 1 case, the mean involves a polynomial correction to the exponential growth and again, does not give the correct growth rate for the number of type-k cells. To state the correct limit theorem describing the growth rate of Z_k(t), we will define p_k and u_1,_k by

k + p_{k} = \sum_{j = 0}^{k - 1} \frac{λ_{0} + k b}{λ_{0} + j b} and u_{1, k} = \prod_{j = 1}^{k} u_{j}^{λ_{0} / (λ_{0} + (j - 1) b)}

for all k ≥ 1.

Theorem 5

If (*) holds, then for θ ≥ 0

E exp (- θ t^{k + p_{k}} e^{- (λ_{0} + k b) t} Z_{k} (t)) \to exp (- c_{k} (λ_{0}, b) V_{0} u_{1, k} θ^{λ_{0} / (λ_{0} + k b)})

(1.2)

where c_k(λ₀, b) is a constant whose value will be given in (4.9). In particular, we have

t^{k + p_{k}} e^{- (λ_{0} + k b) t} Z_{k} (t) \Rightarrow V_{k},

where V_k has Laplace transform given by the righthand side of (1.2).

If we let $Z_{k}^{s, x, v} (t)$ be the number of type-k descendants at time t of the 1 mutant at (s, x, v) ∈ Inline graphic (t) where (t) is the three dimensional point process described in the paragraph preceding Theorem 3, then $Z_{k}^{s, x, v}$ is the same as a process in which the initial type (here type-1 cells) behaves like ve^{(λ₀ + x)(t − s)} instead of Z₀(t) = V₀e^λ₀t. Therefore, Theorem 5 can be proved by induction. To explain the form of the result we consider the case k = 2. Breaking things down according to the times and the sizes of the mutational changes, we have

{E Z}_{2} (t) = \int_{0}^{b} d x_{1} g (x_{1}) \int_{0}^{b} d x_{2} g (x_{2}) \int_{0}^{t} d s_{1} \int_{s_{1}}^{t} d s_{2} V_{0} e^{λ_{0} s_{1}} u_{1} e^{(λ_{0} + x_{1}) (s_{2} - s_{1})} u_{2} e^{(λ_{0} + x_{1} + x_{2}) (t - s_{2})}

As in the result for Z₁(t) the dominant contribution comes from x₁, x₂ = b − O(1/t) and as in the discussion preceding the statement of Theorem 2, the time of the first mutation to b − O(1/t) is ≈ r₁ = (log t)/λ₀. The descendants of this mutation grow at exponential rate λ₀ + b − O(1/t), so the time of the first mutation to 2b − O(1/t) is ≈ r₂ = r₁ + (log t)/(λ₀ + b). Noticing that

exp ((λ_{0} + 2 b) (t - r_{1} - r_{2})) = exp ((λ_{0} + 2 b) t) t^{- (λ_{0} + 2 b) / λ_{0} - (λ_{0} + 2 b) / (λ_{0} + b)}

tells us what to guess for the polynomial term: t^−(2+p₂) where

2 + p_{2} = \frac{λ_{0} + 2 b}{λ_{0}} + \frac{λ_{0} + 2 b}{λ_{0} + b}

In Figure 2, we compare the asymptotic Laplace transform from Theorem 5 with the results of simulations in the case k = 2. To explain the slow convergence to the limit, we note that if we take account of the mutation rates u₁, u₂ in the heuristic from the previous paragraph (which becomes important when u₁, u₂ are small), then the first time we see a type-1 cell with growth rate b − O(1/t) will not occur until time $λ_{0}^{- 1} log (t / u_{1})$ when the type-0 cells reach O(t/u₁) and so the first type-2 cell with growth rate 2b − O(1/t) will not be born until time $r = λ_{0}^{- 1} log (t / u_{1}) + {(λ_{0} + b)}^{- 1} log (t / u_{2})$ when the descendants of the type-1 cells with growth rate b − O(1/t) reach size O(t/u₂). When u₁ = u₂ = 10⁻³, λ₀ = .1, and b = .01, r ≈ 223. The mutations created at this point will need some time to grow and become dominant in the population. It would be interesting to compare simulations at time 300, but we have not been able to do this due to the large number of different growth rates in generation 1.

Plot of the approximations to the Laplace transform of t^2+p₂e^{−(λ₀+2b)t} Z₂(t) from Monte Carlo (MC) simulations at times t = 80,100,120 along with the asymptotic Laplace transform from Theorem 5. Parameter values: a₀ = 0.2, b₀ = 0.1, b = 0.01, and u₁ = u₂ = 10⁻³. g is uniform on [0, 0.01].

1.2 Unbounded distributions

In this section, we consider situations in which the fitness distribution is unbounded. We will suppose that the fitness distribution ν has tail

ν (x, \infty) \sim K x^{β} exp (- γ x^{α})

(1.3)

as x → ∞ for some α, γ, K > 0, and β ∈ ℝ. Our assumption (1.3) on the tail of ν is satisfied by a number of natural distributions including the gamma(β +1, γ) distribution which has α = 1 (and includes the exponential distribution as the special case β = 0) and the normal distribution which has α = 2, β = −1.

To analyze this situation, we will again take a Poisson process viewpoint and look at the contribution from a mutation at time s with increased growth rate x. A mutation that increases the growth rate by x at time s will, if it does not die out, grow to e^{(λ₀ + x)(t − s)} ζ at time t where ζ has an exponential distribution. The growth rate (λ₀ + x)(t − s) ≥ z when

x \geq \frac{z}{t - s} - λ_{0} .

Therefore,

\begin{array}{l} μ (z, \infty) \equiv E (# mutations with (λ_{0} + x) (t - s) \geq z) \\ = V_{0} u_{1} \int_{0}^{t} e^{λ_{0} s} ν (z / (t - s) - λ_{0}, \infty) d s \\ = K V_{0} u_{1} \int_{0}^{t} e^{λ_{0} s} q (z / (t - s) - λ_{0}) {(\frac{z}{t - s} - λ_{0})}^{β} exp (- γ {(\frac{z}{t - s} - λ_{0})}^{α}) d s \\ = K V_{0} u_{1} \int_{0}^{t} q (z / (t - s) - λ_{0}) {(\frac{z}{t - s} - λ_{0})}^{β} exp (φ (s, z)) d s \end{array}

where

q (x) = \frac{ν (x, \infty)}{K x^{β} exp (- γ x^{α})} \to 1

(1.4)

as x → ∞ and

φ (s, z) = λ_{0} s - γ {(\frac{z}{t - s} - λ_{0})}^{α} .

(1.5)

The size of this integral can be found by maximizing the exponent φ over s for fixed z. Since

\frac{\partial φ}{\partial s} (s, z) = λ_{0} - α γ {(\frac{z}{t - s} - λ_{0})}^{α - 1} \frac{z}{{(t - s)}^{2}}

(1.6)

and

\frac{\partial^{2} φ}{\partial s^{2}} (s, z) = - α (α - 1) γ {(\frac{z}{t - s} - λ_{0})}^{α - 2} \frac{z^{2}}{{(t - s)}^{4}} - α γ {(\frac{z}{t - s} - λ_{0})}^{α - 1} \frac{2 z}{{(t - s)}^{3}}

(1.7)

we can see that ∂² φ/∂s²(s, z) < 0 when αz > λ₀(t − s) so that for all z in this range, φ(s, z) is concave as a function of s and achieves its maximum at a unique value s_z.

When α = 1, it is easy to set (1.6) to 0 and solve for s_z. This in turn leads to an asymptotic formula for μ(z, ∞) and allows us to derive the following limit theorem for Z₁(t).

Theorem 6

Suppose α = 1 and let c₀ = λ₀/4γ. Then t⁻² log Z₁(t) → c₀ and

\frac{1}{t} [log Z_{1} (t) - c_{0} t^{2} (1 + \frac{(2 β + 1) log t}{λ_{0} t})] \Rightarrow y^{*}

where y* is the rightmost point in the point process with intensity given by

{(2 c_{0})}^{β} {(π / λ_{0})}^{1 / 2} K V_{0} u_{1} exp (γ λ_{0} - λ_{0} y / 2 c_{0}) .

(1.8)

When α ≠ 1, solving for s_z becomes more difficult, but we are still able to prove the following limit theorem for Z₁(t).

Theorem 7

For any integer α > 1, there exist explicitly calculable constants c_k = c_k(α, γ), 0 ≤ k < α, and κ = κ(β, α, γ) so that t^−(α+1)/α log Z₁(t) → c₀ and

\frac{1}{t^{1 / α}} [log Z_{1} (t) - c_{0} t^{(α + 1) / α} (1 + \sum_{1 \leq k < α} c_{k} t^{- k / α} + κ \frac{log t}{t})] \Rightarrow y^{*}

where y* is the rightmost particle in a point process with explicitly calculable intensity.

The complicated form of the result is due to the fact that the fluctuations are only of order t^1/^α so we have to be very precise in locating the maximum. The explicit formulas for the constants and the intensity of the point process are given in (5.11) and (5.12). With more work this result could be proved for general α > 1, but we have not tried to do this or prove Conjecture 1 below because the super-exponential growth rates in the unbounded case are too fast to be realistic.

We conclude this section with two comments. First, the proof of Theorem 7 shows that in contrast to the bounded case, in the unbounded case, most type-1 individuals are descendants of a single mutant. Second, the proof shows that the distribution of the mutant with the largest growth rate is born at time s ~ t/(α + 1) (see Remark 1 at the end of Section 5) and has growth rate z = O(t⁽^α^+1)/^α). The intuition behind this is that since the type-0 cells have growth rate e^λ₀s and the distribution of the increase in fitness has tail ≈ e^{−γx^α}, the largest advance x attained by time t should occur when s = O(t) and satisfy

e^{C λ_{0} t} e^{- γ x^{α}} = O (1) or x = O (t^{1 / α}) .

The growth rate of its family is then (λ₀ + x)(t − s) = O(t⁽^α^+1)/^α).

Since the type-1 cells grow at exponential rate c₁t⁽^α^+1)/^α, if we apply this same reasoning to type-2 mutants, then the largest additional fitness advance x attained by type-2 individuals should satisfy

e^{c_{1} t (α + 1) / α} e^{- γ x^{α}} = O (1) or x = O (t^{1 / α + 1 / α^{2}}) .

and the growth rate of its family will be O(t^{1+1/α+1/α²}). Extrapolating from the first two generations, we make the following

Conjecture 1

Let $q (k) = \sum_{j = 0}^{k} α^{- j}$ . As t → ∞,

\frac{1}{t^{q (k)}} log Z_{k} (t) \to c_{k}

Note that in the case of the exponential distribution, q(k) = k + 1.

The rest of the paper is organized as follows. Sections 2–5 are devoted to proofs of our main results. After some preliminary notation and definitions in Section 2, Theorems 1–3 are proved in Section 3, Theorems 4–5 in Section 4, and Theorems 6–7 in Section 5. We conclude with a discussion of our results in Section 6.

2 Preliminaries

This section contains some preliminary notation and definitions which we will need for the proofs of our main results. We denote by Inline graphic (t) the points in a two-dimensional Poisson process on [0, t] × [0, ∞) with mean measure

V_{0} e^{λ_{0} s} d s ν (d x),

where in Sections 3–4, ν(dx) = g(x)dx with g satisfying (*) and in Section 5, ν has tail satisfying (1.3). In other words, we have a point at (s, x) if there was a mutant with birth rate a₀ + x at time s. Define a collection of independent birth/death branching processes $Z_{1}^{s, x} (t)$ indexed by (s, x) ∈ Inline graphic (t) with $Z_{1}^{s, x} (s) = 1$ , individual birth rate a₀ + x, and death rate b. $Z_{1}^{s, x} (t)$ is the contribution of the mutation at (s, x) and

Z_{1} (t) = \sum_{(s, x) \in N (t)} Z_{1}^{s, x} (t) .

It is well known that

e^{- (λ_{0} + x) (t - s)} Z_{1}^{s, x} (t) \to \frac{b}{a_{0} + x} δ_{0} + \frac{λ_{0} + x}{a_{0} + x} ζ,

where ζ ~ exp((λ₀ + x)/(a₀ + x)) (see, for example, equation (1) in Durrett and Moseley (2009)). In several results, we shall make use of the three-dimensional Poisson process Inline graphic (t) on [0, t] × [0, ∞) × (0, ∞) with intensity

V_{0} e^{λ_{0} s} ν (d x) {(\frac{λ_{0} + x}{a_{0} + x})}^{2} e^{- v (λ_{0} + x) / (a_{0} + x)} d v .

In words, (s, x, v) ∈ Inline graphic (t) if there was a mutant with birth rate a₀ + x at time s and the number of its descendants at time t, $Z_{1}^{s, x} (t)$ , has $Z_{1}^{s, x} (t) \sim v e^{(λ_{0} + x) (t - s)}$ . It is also convenient to define the mapping z: [0, ∞) × [0, t] → [0, ∞) which maps a point (s, x) ∈ Inline graphic (t) to the growth rate of the induced branching process if it survives: z(s, x) = (λ₀ + x)(t − s) and let

μ (A) = E ∣ {(s, x) \in N (t) : z (s, x) \in A} ∣

for A ⊂ [0, ∞).

We shall use C to denote a generic constant whose value may change from line to line. We write f(t) ~ g(t) if f(t)/g(t) → 1 as t → ∞ and f(t) = o(g(t)) is f(t)/g(t) → 0. f(t) ≫ (≪)g(t) means that f(t)/g(t) → ∞ (resp. 0) as t → ∞ and f(t) = O(g(t)) means |f(t)| ≤ Cg(t) for all t > 0. We also shall use the notation f(t) ≃ g(t) if f(t) − g(t) → 0 as t → ∞.

3 Bounded distributions, Z₁

In this section, we prove Theorems 1–3.

Proof of Theorem 1

Mutations to type-1 cells occur at rate V₀u₁e^λ₀s so

\begin{array}{l} {E Z}_{1} (t) = u_{1} \int_{0}^{t} \int_{0}^{b} e^{(t - s) (λ_{0} + x)} g (x) d x V_{0} e^{λ_{0} s} d s \\ = u_{1} V_{0} e^{λ_{0} t} \int_{0}^{b} d x g (x) \int_{0}^{t} e^{(t - s) x} d s \\ = u_{1} V_{0} e^{λ_{0} t} \int_{0}^{b} d x g (x) \frac{e^{t x} - 1}{x} . \end{array}

(3.1)

We begin by showing that the contribution from x ∈ [0, b − (1 + k) (log t)/t] can be ignored for any k ∈ [0, ∞). The Mean Value theorem implies that

\frac{e^{t x} - 1}{x} \leq t e^{t x} .

(3.2)

Using this and the fact that $\int_{c}^{d} t e^{t x} d x \leq e^{t d}$ for any c < d, we can see that

t^{k} e^{- b t} \int_{0}^{b - (1 + k) (log t) / t} d x g (x) \frac{e^{t x} - 1}{x} \cdot \leq G t^{k} e^{- (1 + k) log t} \to 0

(3.3)

To handle the other piece of the integral, we take k = 1 and note that

\int_{b - (2 log t) / t}^{b} d x g (x) \frac{e^{t x} - 1}{x} \sim \frac{g (b)}{b} e^{b t} \int_{b - 2 log t / t}^{b} e^{t (x - b)} d x .

After changing variables y = (b − x)t, dx = −dy/t, the last integral

= \frac{1}{t} \int_{0}^{2 log t} e^{- y} d y \sim 1 / t,

which proves the result.

The above proof tells us that the dominant contribution to the type-1 cells comes from mutations with fitness increase x ≥ b_t = b − 2log t/t. To describe the times at which the dominant contributions occur, let S(t) = (2/b) log log t. Then the contribution to the mean from x ∈ [b_t, b] and s ≥ S(t) is by (3.1)

\begin{array}{l} \leq G u_{1} V_{0} e^{(λ_{0} + b) t} \frac{2 (log t)}{t} \int_{S (t)}^{\infty} e^{- s b_{t}} d s \\ \leq G u_{1} V_{0} e^{(λ_{0} + b) t} \frac{2 (log t)}{t b_{t}} e^{- b_{t} S (t)} . \end{array}

Since b_tS(t) ≥ (3/2) log log t for all t sufficiently large, this quantity is o(t⁻¹e^(λ₀+b)t). In words, the dominant contribution to the mean comes from points close to (0, b) or more precisely from [0, (2/b) log log t] × [b − (2 log t)/t, b].

Proof of Theorem 2

It suffices to prove (1.1). The computation in (3.3) with k = 2 + p implies that the contribution from mutations with x ≤ b_t = b − (3 + p)(log t)/t can be ignored. Therefore, we have

E exp (- θ Z_{1} (t) e^{- t (λ_{0} + b)} t^{1 + p}) ≃ E (exp (- θ Z_{1} (t) e^{- t (λ_{0} + b)} t^{1 + p}); A_{t})

where A_t = {(s, x) ∈ Inline graphic (t): x > b_t}. Lemma 2 of Durrett and Moseley (2009) implies that

E (e^{- θ Z_{1} (t)}; A_{t}) = exp (- u_{1} \int_{b_{t}}^{b} d x g (x) \int_{0}^{t} d s V_{0} e^{λ_{0} s} (1 - {\tilde{φ}}_{x, t - s} (θ)))

where ${\tilde{φ}}_{x, r} (θ) = {E e}^{- θ {\tilde{Z}}_{r}^{x}}$ and ${\tilde{Z}}_{r}^{x}$ is a birth/death branching process with birth rate a₀ + x, death rate b₀, and initial population ${\tilde{Z}}_{0}^{x} = 1$ . Using

e^{- (λ_{0} + b) t} = e^{- (λ_{0} + x) (t - s)} e^{- (λ_{0} + x) s} e^{- (b - x) t}

(3.4)

we have

E (exp (- θ Z_{1} (t) e^{- t (λ_{0} + b)} t^{1 + p}); A_{t}) = exp (- u_{1} V_{0} \int_{b_{t}}^{b} d x g (x) \int_{0}^{t} d s e^{λ_{0} s} {1 - {\tilde{φ}}_{x, t - s} (θ e^{- (λ_{0} + x) (t - s)} e^{- (λ_{0} + x) s} e^{- (b - x) t} t^{1 + p})})

Changing variables s = r_x + r where $r_{x} = \frac{1}{λ_{0} + x} log (t^{1 + p})$ on the inside integral, y = (b − x)t, dy/t = −dx on the outside, and continuing to write x as short hand for b − y/t, the above

= exp (- u_{1} V_{0} \int_{0}^{(3 + p) log t} \frac{d y}{t} g (x) t^{(1 + p) λ_{0} / (λ_{0} + x)} \int_{- r_{x}}^{t - r_{x}} d r e^{λ_{0} r} {1 - {\tilde{φ}}_{x, t - r - r_{x}} (θ e^{- (λ_{0} + x) (t - r - r_{x})} e^{- (λ_{0} + x) r} e^{- y})})

(3.5)

Formula (20) in Durrett and Moseley (2009) implies that as u → ∞,

1 - {\tilde{φ}}_{x, u} (θ e^{- (λ_{0} + x) u}) \to \frac{λ_{0} + x}{a_{0} + x} \cdot \frac{θ}{θ + \frac{λ_{0} + x}{a_{0} + x}}

(3.6)

and therefore, letting t → ∞ and using (1 + p) λ₀/(λ₀ + b) = 1, we can see that the expression in (3.5)

\to exp (- u_{1} V_{0} g (b) \int_{0}^{\infty} d y \frac{λ_{0} + b}{a_{0} + b} \int_{- \infty}^{\infty} d r e^{λ_{0} r} \frac{θ e^{- (λ_{0} + b) r} e^{- y}}{θ e^{- (λ_{0} + b) r} e^{- y} + \frac{λ_{0} + b}{a_{0} + b}})

Changing variables $r = \frac{1}{λ_{0} + b} {q + log [θ e^{- y} (a_{0} + b) / (λ_{0} + b)]}$ , dr = dq/(λ₀ + b) gives

= exp (- u_{1} V_{0} g (b) θ^{λ_{0} / (λ_{0} + b)} {(\frac{λ_{0} + b}{a_{0} + b})}^{b / (λ_{0} + b)} \int_{0}^{\infty} d y e^{- y λ_{0} / (λ_{0} + b)} \int_{- \infty}^{\infty} \frac{d q}{λ_{0} + b} e^{q λ_{0} / (λ_{0} + b)} \frac{e^{- q}}{e^{- q} + 1})

To simplify the first integral we note that

\int_{0}^{\infty} d y e^{- y λ_{0} / (λ_{0} + b)} = \frac{λ_{0} + b}{λ_{0}}

For the second integral, we prove

Lemma 1

If 0 < c < 1

\int_{- \infty}^{\infty} d q e^{q c} \frac{e^{- q}}{e^{- q} + 1} = Γ (c) Γ (1 - c)

(3.7)

Proof

We can rewrite the integral as

\int_{- \infty}^{\infty} d q e^{q c} \int_{0}^{\infty} d x e^{- x} e^{- q} exp (- e^{- q} x)

so that after interchanging the order of integration and changing variables w = e⁻^qx, dw = −dqe⁻^qx so that w/x = e⁻^q, dw/x = −dqe⁻^q, we have

= \int_{0}^{\infty} d x \int_{0}^{\infty} \frac{d w}{x} {(w / x)}^{- c} e^{- x} e^{- w} = \int_{0}^{\infty} d x x^{- 1 + c} e^{- x} \int_{0}^{\infty} d w w^{- c} e^{- w}

which is = Γ(c)Γ(1 − c).

Taking c = λ₀/(λ₀ + b) and letting

c_{1} (λ_{0}, b) = g (b) \frac{λ_{0} + b}{λ_{0}} \cdot \frac{1}{λ_{0} + b} {(\frac{a_{0} + b}{λ_{0} + b})}^{- b / (λ_{0} + b)} Γ (λ_{0} / (λ_{0} + b)) Γ (1 - λ_{0} / (λ_{0} + b))

(3.8)

we have proved Theorem 2.

Recall that we have assumed Z₀(t) = V₀e^λ₀t is deterministic. This assumption can be relaxed to obtain the following generalization of Theorem 2 which is used in Section 4.

Lemma 2

Suppose that Z₀(t) is a stochastic process with Z₀(t) ~ e^λ₀tV₀ for some constant V₀ as t → ∞. Then the conclusions of Theorem 2 remain valid.

To see why this is true, we can use a variant of Lemma 2 from Durrett and Moseley (2009) to conclude that

E (e^{- θ Z_{1} (t)} ∣ F_{t}^{0}) = exp (- u_{1} \int_{0}^{b} d x g (x) \int_{0}^{t} d s Z_{0} (s) (1 - {\tilde{φ}}_{x, t - s} (θ))),

where $F_{t}^{0}$ is the σ-field generated by Z₀(s) for s ≤ t. Therefore,

E (e^{- θ Z_{1} (t)}) = E exp (- u_{1} \int_{0}^{b} d x g (x) \int_{0}^{t} d s Z_{0} (s) (1 - {\tilde{φ}}_{x, t - s} (θ))),

Given ε > 0, we can choose t_ε > 0 so that

| \frac{Z_{0} (t)}{V_{0} exp (λ_{0} t)} - 1 | < ε

for all t > t_ε. Since the contribution from t ≤ t_ε will not affect the limit and the term inside the expectation is bounded, the rest of the proof can be completed in the same manner as the proof of Theorem 2.

We conclude this section with the

Proof of Theorem 3

Let Inline graphic (t) be the three dimensional Poisson process defined in Section 2. Recall that

F (s, x, v) = v t^{1 + p} e^{- (λ_{0} + b) t} e^{(λ_{0} + x) (t - s)}

i.e. F maps a point in Inline graphic (t) to its contribution to the limit t^1+pe^{−(λ₀+b)t}Z₁(t). Using (3.4), we see that in order to have F(s, x, v) > z we need

v > z t^{- (1 + p)} e^{(b - x) t} e^{(λ_{0} + x) s}

Therefore, the expected number of mutations that contribute more than z to the limit is

u_{1} V_{0} \int_{0}^{b} d x g (x) \int_{0}^{t} d s e^{λ_{0} s} \frac{λ_{0} + x}{a_{0} + x} exp (- \frac{λ_{0} + x}{a_{0} + x} \cdot z t^{- (1 + p)} e^{(b - x) t} e^{(λ_{0} + x) s})

The exponential term can be simplified by making the change of variables

s = \frac{1}{λ_{0} + x} log (\frac{r}{z t^{- (1 + p)} e^{(b - x) t} \frac{λ_{0} + x}{a_{0} + x}}),

ds = dr/r(λ₀ + x) yielding the equivalent expression

u_{1} V_{0} \int_{0}^{b} d x g (x) z^{- λ_{0} / (λ_{0} + x)} {(\frac{λ_{0} + x}{a_{0} + x})}^{x / (λ_{0} + x)} \cdot t^{(1 + p) λ_{0} / (λ_{0} + x)} e^{- (b - x) t λ_{0} / (λ_{0} + x)} \int_{α (x, t)}^{β (x, t)} \frac{d r}{λ_{0} + x} r^{- x / (λ_{0} + x)} e^{- r}

where α(x, t) = zt⁻⁽¹⁺^p⁾e⁽^b⁻^x⁾^t(λ₀ + x)/(a₀ + x) and β(x, t) = α(x, t) e^(λ₀+x)t. As in the previous proof, the main contribution comes from x ∈ [b_t, b] so when we change variables y = (b − x)t, dx = −dy/t, replace the x’s by b’s and use 1 = (1 + p)λ₀/(λ₀ + b) we convert the above into

g (b) z^{- λ_{0} / (λ_{0} + b)} \frac{u_{1} V_{0}}{λ_{0} + b} {(\frac{λ_{0} + b}{a_{0} + b})}^{b / (λ_{0} + b)} \int_{0}^{\infty} d y e^{- y λ_{0} / (λ_{0} + b)} \int_{0}^{\infty} r^{- b / (λ_{0} + b)} e^{- r} d r

Performing the integrals gives the result with

A_{1} (λ_{0}, b) = g (b) \frac{1}{λ_{0}} {(\frac{λ_{0} + b}{a_{0} + b})}^{b / (λ_{0} + b)} Γ (λ_{0} / (λ_{0} + b))

(3.9)

4 Bounded distributions, Z_k

We now move on to the proofs of Theorems 4 and 5. Recall that we have defined p_k by the relation

k + p_{k} = \sum_{j = 0}^{k - 1} \frac{λ_{0} + k b}{λ_{0} + j b} .

Proof of Theorem 4

Breaking things down according to the times and the sizes of the mutational changes we have

\begin{array}{l} {E Z}_{k} (t) = \int_{0}^{b} d x_{1} g (x_{1}) \dots \int_{0}^{b} d x_{k} g (x_{k}) \int_{0}^{t} d s_{1} \dots \int_{s_{k - 1}}^{t} d s_{k} V_{0} e^{λ_{0} s_{1}} u_{1} e^{(λ_{0} + x_{1}) (s_{2} - s_{1})} \dots u_{k} e^{(λ_{0} + x_{1} + \dots + x_{k}) (t - s_{k})} \\ = \int_{0}^{b} d x_{1} g (x_{1}) \dots \int_{0}^{b} d x_{k} g (x_{k}) \int_{0}^{t} d s_{1} \dots \int_{s_{k - 1}}^{t} d s_{k} V_{0} u_{1} \dots u_{k} e^{λ_{0} t} e^{x_{1} (t - s_{1})} \dots e^{x_{k} (t - s_{k})} . \end{array}

(4.1)

The first step is to show

Lemma 3

Let b_t = b − (2k + 1)(log t)/t. The contribution to EZ_k(t) from points (x₁,…x_k) with some xi ≤ b_t is o(t^−2ke^(λ₀+kb)t).

Proof

(3.2) implies that

\int_{s_{j - 1}}^{t} d s_{j} e^{(x_{j} + \dots + x_{k}) (t - s_{j})} = \frac{e^{(x_{j} + \dots + x_{k}) (t - s_{j - 1})} - 1}{x_{j} + \dots + x_{k}} \leq t e^{(x_{j} + \dots + x_{k}) (t - s_{j - 1})} .

Applying this and working backwards in the above expression for EZ_k(t), we get

{E Z}_{k} (t) \leq t^{k} V_{0} u_{1} \dots u_{k} \int_{0}^{b} d x_{1} g (x_{1}) \dots \int_{0}^{b} d x_{k} g (x_{k}) e^{(λ_{0} + x_{1} + \dots + x_{k}) t}

and the desired result follows.

With the Lemma established, when we work backwards

\int_{s_{j - 1}}^{t} d s_{j} e^{(x_{j} + \dots + x_{k}) (t - s_{j})} = \frac{e^{(x_{j} + \dots + x_{k}) (t - s_{j - 1})} - 1}{x_{j} + \dots + x_{k}} \sim \frac{e^{(x_{j} + \dots + x_{k}) (t - s_{j - 1})}}{(k - j + 1) b}

From this and induction, we see that the contribution from points {x_1, … x_k) with x_i ∈ [b_t, b] for all i is

\sim \frac{V_{0} u_{1} \dots u_{k}}{b^{k} k!} g {(b)}^{k} \int_{b_{t}}^{b} d x_{1} \dots \int_{b_{t}}^{b} d x_{k} e^{(λ_{0} + x_{1} + \dots + x_{k}) t}

Changing variables y_i = t(b − x_i) the above

\sim \frac{V_{0} u_{1} \dots u_{k} g {(b)}^{k}}{b^{k} t^{k} k!} e^{(λ_{0} + k b) t}

which proves the desired result.

In the proof of the last result, we showed that the dominant contribution comes from mutations with x_i > b_t. To prove our limit theorem we will also need a result regarding the times at which the mutations to the dominant types occur.

Lemma 4

Let $α_{k} = \frac{2 k + 1}{k b}$ . The contribution to EZ_k{t) from points with s₁ ≥ α_k log t is o(t^−2ke^(λ₀+kb)t).

Proof

Replace the X_i’s in the exponents by b’s, we can see from (4.1) that the expected contribution from points with s₁ ≥ α_k log t is

\begin{array}{l} \leq b^{k} G^{k} V_{0} u_{1} \dots u_{k} \int_{α_{k} log t}^{t} d s_{1} \int_{s_{1}}^{t} d s_{2} \dots \int_{s_{k - 1}}^{t} d s_{k} e^{λ_{0} t} e^{b (t - s_{1})} \dots e^{b (t - s_{k})} \\ \leq C e^{λ_{0} t} \int_{α_{k} log t}^{t} e^{k b (t - s_{1})} d s_{1} \\ \leq C e^{(λ_{0} + k b) t} t^{- α_{k} k b} \end{array}

and the desired result follows.

Recall that

k + p_{k} = \sum_{j = 0}^{k - 1} \frac{λ_{0} + k b}{λ_{0} + j b} .

For the induction used in the next proof, we will also need the corresponding quantity with λ₀ replaced by λ₀ + x and k by k − 1

k - 1 + p_{k - 1} (x) = \sum_{j = 0}^{k - 2} \frac{λ_{0} + x + (k - 1) b}{λ_{0} + x + j b}

which means

p_{k - 1} (x) = \sum_{j = 0}^{k - 2} \frac{(k - 1 - j) b}{λ_{0} + x + j b}

The limit will depend on the mutation rates through

u_{1, k} = \prod_{j = 1}^{k} u_{j}^{λ_{0} / (λ_{0} + (j - 1) b)}

Again we will need the corresponding quantity with k − 1 terms

u_{2, k} (x) = \prod_{j = 1}^{k - 1} u_{j + 1}^{(λ_{0} + x) / (λ_{0} + x + (j - 1) b)} .

We shall write u_2,_k = u_2,_k(b) and note that

u_{1, k} = u_{1} u_{2, k}^{λ_{0} / (λ_{0} + b)}

(4.2)

Proof of Theorem 5

We shall prove the result under the more general assumption that Z₀(t) ~ V₀e^λ₀t for some constant V₀. The result then holds for k = 1 by Lemma 2. We shall prove the general result by induction on k. To this end, suppose the result holds for k −1. Let $Z_{k}^{s, x, v} (t)$ be the type-k descendants at time t of the 1 mutant at (s,x,v) ∈ Inline graphic (t). Since $Z_{1}^{s, x} (t) \sim v e^{(λ_{0} + x) (t - s)}$ compared to Z₀(t) ~ V₀e^λ₀t, it follows from the induction hypothesis that

E exp (- θ {(t - s)}^{k - 1 + p_{k - 1} (x)} e^{- (λ_{0} + x + (k - 1) b) (t - s)} Z_{k}^{s, x, v} (t)) \to exp (- c_{k - 1} (λ_{0} + x, b) v u_{2, k} (x) θ^{(λ_{0} + x) / (λ_{0} + x + (k - 1) b)})

(4.3)

Integrating over the contributions from the three-dimensional point process we have

E exp (- θ Z_{k} (t)) = exp (- \int_{0}^{b} d x g (x) \int_{0}^{t} d s u_{1} V_{0} e^{λ_{0} s} \int_{0}^{\infty} d v {(\frac{λ_{0} + x}{a_{0} + x})}^{2} exp (- \frac{λ_{0} + x}{a_{0} + x} v) (1 - φ_{x, v, t - s}^{k - 1} (θ)))

where $φ_{x, v, t - s}^{k - 1} (θ) = E exp (- θ Z_{k}^{0, x, v} (t - s))$ . To prove the desired result we need to replace θ by θt^k⁺^pke^{−(λ₀+kb)t}. Doing this with (4.3) in mind we have

E exp (- θ t^{k + p_{k}} e^{- (λ_{0} + k b) t} Z_{k} (t)) = exp (- \int_{0}^{b} d x g (x) \int_{0}^{t} d s u_{1} V_{0} e^{λ_{0} s} \int_{0}^{\infty} d v {(\frac{λ_{0} + x}{a_{0} + x})}^{2} exp (- \frac{λ_{0} + x}{a_{0} + x} v) {1 - φ_{x, v, t - s}^{k - 1} (θ t^{k + p_{k}} e^{- (λ_{0} + x + (k - 1) b) (t - s)} e^{- (b - x) t} e^{- (λ_{0} + x + b (k - 1)) s})})

(4.4)

By Lemmas 3 and 4, we can restrict attention to x ∈ [b_t, b] and s ≤ α_k log t. The first restriction implies that all of the x’s except the one in (b − x) can be set equal to b and the second that we can replace t by t − s. Since (k + p_k) −(k − 1 + p_k₋₁ (b)) = (λ₀ + kb)/λ_0, the term in the exponential on the righthand side of (4.4) is

≃ - \int_{b_{t}}^{b} d x g (x) \int_{0}^{α_{k} log t} d s u_{1} V_{0} e^{λ_{0} s} \int_{0}^{\infty} d v {(\frac{λ_{0} + b}{a_{0} + b})}^{2} exp (- \frac{λ_{0} + b}{a_{0} + b} v) (1 - φ_{x, v, t - s}^{k - 1} (θ {(t - s)}^{k - 1 + p_{k - 1} (b)} e^{- (λ_{0} + k b) (t - s)} t^{(λ_{0} + k b) / λ_{0}} e^{- (b - x) t} e^{- (λ_{0} + k b) s}))

Changing variables s = R(t) + r where R(t) = (1/λ₀)(log t), and y = (b − x)t, dy = −tdx the above becomes

= - g (b) \int_{0}^{(2 k + 1) log t} d y \int_{0}^{\infty} d y {(\frac{λ_{0} + b}{a_{0} + b})}^{2} exp (- \frac{λ_{0} + b}{a_{0} + b} v) \int_{- R (t)}^{α_{k} log t - R (t)} d r u_{1} V_{0} e^{λ_{0} r} (1 - φ_{x, v, t - s}^{k - 1} (θ {(t - s)}^{k - 1 + p_{k - 1} (b)} e^{- (λ_{0} + k b) (t - s)} e^{- y} e^{- (λ_{0} + k b) r}))

Using (4.3) now we have that the 1 − φ term converges to

1 - exp (- c_{k - 1} (λ_{0} + b, b) v u_{2, k} {[θ e^{- y}]}^{(λ_{0} + b) / (λ_{0} + k b)} e^{- (λ_{0} + b) r})

To simplify this expression, we let

r = \frac{1}{λ_{0} + b} (q + Q (v, y)) where Q (v, y) = log {c_{k - 1} (λ_{0} + b, b) v u_{2, k} {[θ e^{- y}]}^{(λ_{0} + b) / (λ_{0} + k b)}}

dr = dq/(λ₀ + b). Plugging this into e^λ₀r results in

e^{q λ_{0} / (λ_{0} + b)} {(c_{k - 1} (λ_{0} + b, b) v u_{2, k})}^{λ_{0} / (λ_{0} + b)} θ^{λ_{0} / (λ_{0} + k b)} e^{- y λ_{0} / (λ_{0} + k b)}

so we conclude that the term in the exponential on the righthand side of (4.4) converges to

- g (b) \frac{c_{k - 1} {(λ_{0} + b, b)}^{λ_{0} / (λ_{0} + b)}}{λ_{0} + b} V_{0} u_{1} u_{2, k}^{λ_{0} / (λ_{0} + b)} θ^{λ_{0} / (λ_{0} + k b)} \int_{0}^{\infty} d v {(\frac{λ_{0} + b}{a_{0} + b})}^{2} v^{λ_{0} / (λ_{0} + b)} exp (- \frac{λ_{0} + b}{a_{0} + b} v) \int_{0}^{\infty} d y e^{- y λ_{0} / (λ_{0} + k b)} \int_{- \infty}^{\infty} \frac{d q}{λ_{0} + b} e^{q λ_{0} / (λ_{0} + b)} (1 - exp (- e^{- q}))

(4.5)

To obtain a cleaner expression for the constants, we begin by noting that the change of variables w = v (λ₀ + b)/(a₀ + b), dw = dv(λ₀ + b)/(a₀ + b) implies that

\int_{0}^{\infty} d v {(\frac{λ_{0} + b}{a_{0} + b})}^{2} v^{λ_{0} / (λ_{0} + b)} exp (- \frac{λ_{0} + b}{a_{0} + b} v) = {(\frac{a_{0} + b}{λ_{0} + b})}^{- 1 + λ_{0} / (λ_{0} + b)} Γ (1 + λ_{0} / (λ_{0} + b))

(4.6)

Furthermore, we also have

\int_{0}^{\infty} d y e^{- y λ_{0} / (λ_{0} + k b)} = \frac{λ_{0} + k b}{λ_{0}}

(4.7)

Finally, to evaluate the third integral in (4.5), we make the change of variables x = e⁻^q, dx = −e⁻^q dq, or dq = −dx/x to show that it is

= \int_{0}^{\infty} d x x^{- 1 - λ_{0} / (λ_{0} + b)} (1 - e^{- x}) d x .

Integrating by parts f (x) = 1−e⁻^x, g′(x) = x^{−1−λ₀/(λ₀+b)}, f′ (x) = e⁻^x, g(x) = x^{−λ₀/(λ₀+b)} (λ₀+ b)/λ₀ shows that the previous expression is

\frac{λ_{0} + b}{λ_{0}} Γ (1 - λ_{0} / (λ_{0} + b))

(4.8)

Combining (4.6) – (4.8) and using (4.2), we conclude that the expression in (4.5) is

= c_{k - 1} {(λ_{0} + b, b)}^{λ_{0} / (λ_{0} + b)} \cdot g (b) \frac{λ_{0} + k b}{λ_{0}} \cdot V_{0} u_{1, k} θ^{λ_{0} / (λ_{0} + k b)} \cdot \frac{1}{λ_{0}} {(\frac{a_{0} + b}{λ_{0} + b})}^{- 1 + λ_{0} / (λ_{0} + b)} Γ (1 + λ_{0} / (λ_{0} + b)) Γ (1 - λ_{0} / (λ_{0} + b))

(4.9)

Setting c_k(λ₀,b) equal to the quantity in (4.9) divided by V₀u_1,kθ^{λ₀/(λ₀+kb)} we have proved the result.

To work out an explicit formula for the constant and to compare with Durrett and Moseley (2009), it is useful to let λ_j = λ₀ + jb, a_j = a₀ + jb and

c_{h, j} = \frac{1}{λ_{j - 1}} {(\frac{a_{j}}{λ_{j}})}^{- 1 + λ_{j - 1} / λ_{j}} Γ (1 + λ_{j - 1} / λ_{j}) Γ (1 - λ_{j - 1} / λ_{j})

From this we see that

\begin{array}{l} c_{k} (λ_{0}, b) = c_{k - 1} {(λ_{1}, b)}^{λ_{0} / λ_{1}} g (b) \frac{λ_{k}}{λ_{0}} c_{h, 1} \\ = c_{k - 2} {(λ_{2}, b)}^{λ_{0} / λ_{2}} \cdot {(g (b) \frac{λ_{k - 1}}{λ_{0}} c_{h, 2})}^{λ_{0} / λ_{1}} \cdot g (b) \frac{λ_{k}}{λ_{0}} c_{h, 1} \end{array}

and hence

c_{k} (λ_{0}, b) = \prod_{j = 1}^{k} {(g (b) \frac{λ_{k - j + 1}}{λ_{0}} c_{h, j})}^{λ_{0} / λ_{j - 1}}

In Durrett and Moseley (2009) if we let Inline graphic be the σ-field generated by Z_j(t) for j ≤ k and all t ≥ 0 then

E (e^{- θ V_{k}} ∣ F_{k - 1}) = exp (- u_{k} V_{k - 1} c_{h, k} θ^{λ_{k - 1} / λ_{k}})

Iterating we have

\begin{array}{l} E (e^{- θ V_{k}} ∣ F_{k - 2}) = E (exp (- u_{k} V_{k - 1} c_{h, k} θ^{λ_{k - 1} / λ_{k}}) ∣ F_{k - 2}) \\ = exp (- u_{k - 1} u_{k}^{λ_{k - 2} / λ_{k - 1}} V_{k - 2} c_{h, k - 1} c_{h, k}^{λ_{k - 2} / λ_{k - 1}} θ^{λ_{k - 2} / λ_{k}}) \end{array}

and hence

E (e^{- θ V_{k}} ∣ V_{0}) = exp (- c_{θ, k} V_{0} u_{1, k} θ^{λ_{0} / λ_{k}})

where $c_{θ, k} = \prod_{j = 1}^{k} c_{h, j}^{λ_{0} / λ_{j - 1}}$ .

5 Proofs for unbounded distributions

In this Section, we prove Theorems 6 and 7. The first step is to show that unlike in the case of bounded mutational advances, for unbounded distributions, the main contribution to the limit is given by the descendants of a single mutation. We will later show that the largest growth rate will come from z = O(t⁽^α^+1)/^α) so the next result is enough. Recall that z(s, x) = (λ₀ + x) (t − s) is the growth rate of the family descended from a mutant at (s, x).

Lemma 5

For any z̄, t > 0, we have

E (\sum_{(s, x) : z (s, x) \leq \bar{z}} Z_{1}^{s, x} (t)) \leq V_{0} u_{1} t e^{λ_{0} t + \bar{z}}

Proof

z(s, y) ≤ z̄ if and only if we have a mutation at time s which increases fitness by y ≤ z̄/(t − s) − λ₀ and hence, the expected number of individuals produced by mutations with growth rates ≤ z̄ is

V_{0} u_{1} \int_{0}^{t} \int_{0}^{\frac{\bar{z}}{t - s} - λ_{0}} e^{λ_{0} s} \cdot e^{z (s, y)} ν (d y) d s \leq V_{0} u_{1} t e^{λ_{0} t + \bar{z}}

since $\int_{0}^{\infty} ν (d y) = 1$ .

To motivate the proof of the general result, we begin with the case when α = 1. Recall that the mean number of mutations with growth rate larger than z has

μ (z, \infty) = K V_{0} u_{1} \int_{0}^{t} q (z / (t - s) - λ_{0}) {(\frac{z}{t - s} - λ_{0})}^{β} exp (φ (s, z)) d s

where q, φ are as in (1.4), (1.5).

Proof of Theorem 6

Since

Z_{1} (t) = \sum_{(s, x) \in N (t)} Z_{1}^{s, x} (t) = \sum_{(s, x) : z (s, x) \leq z} Z_{1}^{s, x} (t) + \sum_{(s, x) : z (s, x) > z} Z_{1}^{s, x} (t)

for any z > 0, we have

\frac{1}{t} log Z_{1} (t) \sim \frac{1}{t} [log (\sum_{(s, x) : z (s, x) \leq z} Z_{1}^{s, x} (t)) \lor log (\sum_{(s, x) : z (s, x) > z} Z_{1}^{s, x} (t))]

as t → ∞. Lemma 5 tells us that if there is a mutation with growth rate z = O(t²), then the contribution from mutations with growth rates smaller than z − ε can be ignored so it suffices to describe the distribution of the largest growth rates. We will show that if

z_{t} = c_{0} t^{2} (1 + \frac{(2 β + 1) log t}{λ_{0} t} + f (t))

then

μ (z_{t}, \infty) \to {\begin{array}{l} 4 c_{0}^{β} {(π / λ_{0})}^{1 / 2} V_{0} u_{1} exp (γ λ_{0} - 2 λ_{0} x / 2 c_{0}) & if & t f (t) \to \frac{x}{c_{0}}, x \geq 0 \\ 0 & if & t f (t) \to \infty \end{array}

(5.1)

so that the largest growth rate is O(t²) and comes from the rightmost particle in the point process with intensity given by (1.8).

To prove (5.1), we first need to locate the maximum of φ. Let z > λ₀t so that there exists a unique maximum s_z. Solving φ_s(s, z) = 0 and using the expression for φ_s in (1.6) yields

s_{z} = t - a_{0} z^{1 / 2}

where a₀ = (γ/λ₀)^1/2 = (4c₀)^−1/2 which leads to the expression

\begin{array}{l} φ (s_{z}, z) = λ_{0} t - λ_{0} (t - s_{z}) - γ (\frac{z}{t - s} - λ_{0}) \\ = λ_{0} t - λ_{0} a_{0} z^{1 / 2} - γ z^{1 / 2} / a_{0} + γ λ_{0} \\ = λ_{0} (t - 2 a_{0} z^{1 / 2}) + γ λ_{0} . \end{array}

(5.2)

If we take

z_{t} = c_{0} t^{2} (1 + \frac{(2 β + 1) log t}{λ_{0} t} + f (t)) = {(\frac{t}{2 a_{0}})}^{2} (1 + \frac{(2 β + 1) log t}{λ_{0} t} + f (t))

in (5.2) and use (1 + y)^1/2 = 1 + y/2 + O(y²), we obtain

φ (s_{z_{t}}, z_{t}) = - \frac{(2 β + 1) log t}{2} - λ_{0} t f (t) + γ λ_{0} + o (1)

(5.3)

as t → ∞. Furthermore, (1.7) implies that

\begin{array}{l} φ_{s s} (s_{z_{t}}, z_{t}) = - \frac{2 γ z_{t}}{{(t - s_{z_{t}})}^{3}} = - \frac{2 γ}{a_{0}^{3} z_{t}^{1 / 2}} = - \frac{a}{t} + o (1) \\ φ_{sss} (s_{z_{t}}, z_{t}) = - \frac{6 γ z_{t}}{{(t - s_{z_{t}})}^{4}} = - \frac{6 γ}{a_{0}^{4} z_{t}} = - \frac{24 γ}{a_{0}^{t} t^{2}} + o (1) \end{array}

as t → ∞ with $a = 4 γ / a_{0}^{2}$ . Since φ_s(s_z, z) = 0, taking a Taylor expansion around s_z yields

φ (s, z_{t}) = φ (s_{z_{t}}, z_{t}) - \frac{a}{2 t} {(s - s_{z_{t}})}^{2} + g (s, z_{t})

(5.4)

where |g(s, z)| ≤ C|s − s_z|³/t² for all s. Also note that letting

ψ (s, z) = q (z / (t - s) - λ_{0}) {(\frac{z}{t - s} - λ_{0})}^{β}

and recalling (1.4), we have

\begin{array}{l} ψ (s_{z_{t}}, z_{t}) = q (z_{t} / (t - s_{z_{t}}) - λ_{0}) {(\frac{z}{t - s_{z_{t}}} - λ_{0})}^{β} \\ = c_{t} z_{t}^{β / 2} / a_{0}^{β} + o (z_{t}^{β / 2}) \\ = c_{t} {(2 c_{0})}^{β} t^{β} + o (t^{β}) \end{array}

where c_t → 1 as t → ∞ so that

ψ (s, z_{t}) = c_{t} {(2 c_{0})}^{β} t^{β} + g_{2} (s, z_{t})

where |g₂(s, z)||s − s_z|⁻¹t⁻^β = o(1).

Write

\int_{0}^{t} ψ (s, z_{t}) e^{φ (s, z_{t})} d s = \int_{A} ψ (s, z_{t}) e^{φ (s, z_{t})} d s + \int_{A^{c}} ψ (s, z_{t}) e^{φ (s, z_{t})} d s

where A = {s: |s − s_{z_t}| ≤ C(t log t)^1/2} ∩ [0, t]. Since concavity implies that for s ∈ A^c and C sufficiently large, we have

exp (φ (s, z_{t})) \leq \frac{1}{t^{2 + β}} exp (φ (s_{z_{t}}, z_{t}))

the contribution of the second integral is negligible. After the change of variables s = s_{z_t} + (t/a)^1/2r, when t is large, the first integral becomes

\int_{A} ψ (s, z_{t}) e^{φ (s, z_{t})} d s = (c_{t} {(2 c_{0})}^{β} t^{β} + o (1)) e^{φ (s_{z_{t},} z_{t})} \int_{- C {(log t)}^{1 / 2}}^{C {(log t)}^{1 / 2}} e^{g (s, z_{t})} e^{- r^{2} / 2} {(t / a)}^{1 / 2} d r .

and therefore since |g(s, z_t)| ≤ C(t log t)^3/2/t² when s ∈ A, we have

μ (z_{t}, \infty) = K V_{0} u_{1} \int_{0}^{t} ψ (s, z_{t}) e^{φ (s, z_{t})} d s \sim K V_{0} u_{1} b t^{β + 1 / 2} e^{φ (s_{z_{t}}, z_{t})}

(5.5)

where $b = {(2 c_{0})}^{β} \sqrt{2 π / a} = {(2 c_{0})}^{β} {(π / λ_{0})}^{1 / 2}$ . Since

φ (s_{z_{t}}, z_{t}) = - \frac{(2 β + 1) λ_{0} log t}{2} - λ_{0} t f (t) + γ λ_{0}

we can conclude that

μ (z_{t}, \infty) \to {\begin{array}{l} K V_{0} u_{1} b exp (γ λ_{0} - 2 λ_{0} a_{0}^{2} x) = V_{0} u_{1} b exp (γ λ_{0} - 2 λ_{0} x / 2 c_{0}) & if & t f (t) \to \frac{x}{c_{0}} \\ 0 & if & t f (t) \to \infty \end{array}

which proves (5.1).

When α ≠ 1, we no longer have an explicit formula for the maximum value s_z which complicates the process of identifying the largest growth rate. We shall assume for convenience that α > 1 is an integer.

Proof of Theorem 7

As in the proof of Theorem 6, it suffices to describe the distribution for the largest growth rates. Let z > λ₀t so the maximum s_z exists. To find a useful expression for the value of φ(s_z, z), we write

φ (s, z) = λ_{0} t - λ_{0} (t - s) - γ {(\frac{z}{t - s} - λ_{0})}^{α} .

Using the definition of s_z as the solution to φ_s(s_z, z) = 0 yields the condition that

{(t - s_{z})}^{α + 1} = \frac{α γ}{λ_{0}} z^{α} {(1 - λ_{0} \frac{t - s_{z}}{z})}^{α - 1}

i.e.,

t - s_{z} = {(\frac{α γ}{λ_{0}})}^{1 / (α + 1)} z^{α / (α + 1)} {(1 - λ_{0} \frac{t - s_{z}}{z})}^{(α - 1) / (α + 1)} .

If we substitute the right side of this equation back in for t − s_z in the parenthesis, then writing a₀ = (αγ/λ₀)^1/(^α⁺¹⁾ we have

\begin{array}{l} t - s_{z} = a_{0} z^{α / (α + 1)} {(1 - λ_{0} a_{0} z^{- 1 / (α + 1)} {(1 - \frac{λ_{0} (t - s_{z})}{z})}^{\frac{α - 1}{α + 1}})}^{\frac{α - 1}{α + 1}} \\ = a_{0} z^{α / (α + 1)} {(1 - λ_{0} a_{0} z^{- 1 / (α + 1)} {(1 - λ_{0} a_{0} z^{- 1 / (α + 1)} {(1 - \frac{λ_{0} (t - s_{z})}{z})}^{\frac{α - 1}{α + 1}})}^{\frac{α - 1}{α + 1}})}^{\frac{α - 1}{α + 1}} \end{array}

Repeating this α times and then using the approximation (1 − x)ⁿ = 1 − nx + O(x²) with n = (α − 1)/(α + 1) yields

t - s_{z} = z^{α / (α + 1)} (\sum_{j = 0}^{α} a_{j} z^{- j / (α + 1)} + O (z^{- 1}))

(5.6)

where

a_{j} = a_{0} {(\frac{λ_{0} a_{0} (α - 1)}{α + 1})}^{j}

for j ≥ 1. The error term is O(z⁻¹) because

0 < (1 - λ_{0} (t - s) / z) \leq 1

for all z > λ₀t and s ≤ t. Factoring out a₀ in (5.6) and using (1 + x)⁻¹ = Σ(−x)^j when |x| < 1, we have that

\begin{array}{l} \frac{z}{t - s} - λ_{0} = a_{0}^{- 1} z^{1 / (α + 1)} (1 - \sum_{i_{1} = 1}^{α} a_{0}^{- 1} a_{i_{1}} z^{- i_{1} / (α + 1)} + \sum_{i_{1}, i_{2} = 1}^{α} a_{0}^{- 2} a_{i_{1}} z^{- (i_{1} + i_{2}) / (α + 1)} - \dots + {(- 1)}^{α} \sum_{i_{1}, \dots, i_{α} = 1}^{α} a_{0}^{- α} \prod_{j = 1}^{α} a_{i_{j}} z^{- \sum_{j = 1}^{α} i_{j} / (α + 1)} + O (z^{- 1})) - λ_{0} z^{1 / (α + 1)} z^{- 1 / (α + 1)} \\ = z^{1 / (α + 1)} (\sum_{j = 0}^{α} b_{j} z^{- j / (α + 1)} + O (z^{- 1})) \end{array}

(5.7)

for large z where the b_j are given by

\begin{array}{l} b_{0} = 1 / a_{0} \\ b_{1} = - a_{1} / a_{0}^{2} - λ_{0} \\ b_{2} = - (a_{2} - a_{1}^{2}) / a_{0}^{3} \\ b_{3} = - (a_{4} - 2 a_{1} a_{3} - a_{2}^{2} - 3 a_{1}^{2} a_{2} + a_{1}^{4}) / a_{0}^{4} \end{array}

and in general,

b_{i} = \sum_{k = 1}^{α} \sum_{\begin{matrix} i_{1}, \dots, i_{k} = 1 \\ i_{1} + \dots + i_{k} = i \end{matrix}}^{α} {(- a_{0})}^{- (k + 1)} \prod_{j = 1}^{k} a_{i_{j}} .

(5.7) implies that

- γ {(\frac{z}{t - s} - λ_{0})}^{α} = - γ z^{α / (α + 1)} (b_{0}^{α} + α b_{0}^{α - 1} b_{1} z^{- 1 / (α + 1)} + (α b_{0}^{α - 1} b_{2} + (\begin{matrix} α \\ 2 \end{matrix}) b_{0}^{α - 2} b_{1}^{2}) z^{- 2 / (α + 1)} + \dots + (α b_{0}^{α - 1} b_{α} + \dots + b_{1}^{α}) z^{α / (α + 1)} + O (z^{- 1}))

and therefore,

\begin{array}{l} φ (s_{z}, z) = λ_{0} t + λ_{0} (t - s) - γ {(\frac{z}{t - s} - λ_{0})}^{α} \\ = λ_{0} t + \sum_{j = 0}^{α} d_{j} z^{\frac{α - j}{α + 1}} + O (z^{- 1 / (α + 1)}) \end{array}

(5.8)

where the d_j can be calculated explicitly, for example:

\begin{array}{l} d_{0} = - λ_{0} a_{0} - γ b_{0}^{α} \\ d_{1} = - λ_{0} a_{1} - γ α b_{0}^{α - 1} c_{1} \\ d_{2} = - λ_{0} a_{2} - γ (α b_{0}^{α - 1} b_{2} + (\begin{matrix} α \\ 2 \end{matrix}) b_{0}^{α - 2} b_{1}^{2}) \\ d_{3} = - λ a_{3} - γ (α b_{0}^{α - 1} b_{3} + (\begin{matrix} α \\ 2 \end{matrix}) b_{0}^{α - 2} b_{1} b_{2} + (\begin{matrix} α \\ 3 \end{matrix}) b_{0}^{α - 3} b_{1}^{3}) . \end{array}

To figure out the distribution of the growth rate for the largest mutant, we let c₀ = (−λ₀/d₀)⁽^α^+1)/^α and then search for κ_j, j = 1, …, α − 1 and κ so that plugging

z_{t} = c_{0} t^{(α + 1) / α} (1 + \sum_{j = 1}^{α - 1} κ_{j} t^{- j / α} + \frac{κ log t}{t} + f (t))

into (5.8) yields

φ (s_{z_{t}}, z_{t}) = k_{1} - k_{2} t f (t) - k_{3} log t

(5.9)

for some constants k₁, k₂, k₃. Substituting z_t into (5.8) and writing κ₀ = 1, κ_α = x/c₀ to ease the notation we obtain

φ (s_{z_{t}}, z_{t}) = λ_{0} t + \sum_{j = 0}^{α} d_{j} {(- \frac{λ_{0} t}{d_{0}})}^{(α - j) / α} {(\sum_{j = 0}^{α} κ_{j} t^{- j / α} + κ t^{- 1} log t)}^{(α - j) / (α + 1)} + O (t^{- 1 / α}) .

Since λ₀t + d₀(−λ₀t/d₀) = 0, the first order terms in this expansion is t⁽^α^−1)/^α and after using the Taylor series expansion

{(1 + x)}^{p} = 1 + p x + p (p - 1) x^{2} / 2 + \dots + p (p - 1) \dots (p - α + 1) x^{α} / α! + O (x^{α + 1})

we obtain

φ (s_{z_{0}}, z_{0}) = \sum_{j = 1}^{α} ρ_{j} t^{(α - j) / α} + ρ log t + O (t^{- 1 / α} log t)

(5.10)

where

\begin{array}{l} ρ = d_{0} (- \frac{λ_{0}}{d_{0}}) (\frac{α}{α + 1}) κ = - \frac{α λ_{0}}{α + 1} κ \\ ρ_{1} = d_{0} (- \frac{λ_{0}}{d_{0}}) (\frac{α}{α + 1}) c_{1} + d_{1} {(- \frac{λ_{0}}{d_{0}})}^{(α - 1) / α} \\ ρ_{2} = d_{0} (- \frac{λ_{0}}{d_{0}}) [\frac{α}{α + 1} c_{2} + \frac{α}{α + 1} (\frac{α}{α + 1} - 1) c_{1}^{2}] + d_{1} {(- \frac{λ_{0}}{d_{0}})}^{(α - 1) / α} (\frac{α - 1}{α}) c_{1} + d_{2} {(- \frac{λ_{0}}{d_{0}})}^{(α - 2) / α} \end{array}

and in general

ρ_{j} = \sum_{i = 0}^{j} d_{i} {(- \frac{λ_{0}}{d_{0}})}^{(α - i) / α} \sum_{k = 1}^{j - i} \prod_{ℓ = 1}^{k} (\frac{α - i}{α + 1} - ℓ + 1) κ_{i_{ℓ}}

j = 1, 1, …, α where for each i and k, in the inner product, i₁, …, i_k are always chosen to satisfy i₁ + i₂ + ··· + i_k = j − i. Since ρ_j depends only on κ_i, i ≤ j, then after noting that the coefficient of κ_j in ρ_j is −αλ₀/(α + 1), we can use forward substitution to solve the system ρ_j = 0, j = 1, 2, …, α − 1 for κ_j to obtain the recursive formulas

c_{j} \equiv κ_{j} = - \frac{α + 1}{α λ_{0}} (ρ_{j} - \frac{- α λ_{0}}{α + 1} κ_{j})

(5.11)

for i = 1, 2, …, α − 1. Setting ρ = −k₃ yields

κ = \frac{(α + 1) k_{3}}{α λ_{0}}

and for this choice of c_j, κ, we obtain (5.9) with

k_{2} = - \frac{α}{α + 1} \frac{d_{0}}{c_{0}} (- \frac{λ_{0}}{d_{0}}) = \frac{α λ_{0}}{(α + 1) c_{0}}

and k₁ = −(ρ_α − k₂x). Since

\begin{array}{l} {(\frac{z_{t}}{t - s_{z_{t}}} - λ_{0})}^{β} = z_{t}^{β / (α + 1)} / a_{0}^{β} + o (z_{t}^{β / (α + 1)}) \\ = {(\frac{c_{0}^{1 / (α + 1)}}{a_{0}})}^{β} t^{β / α} + o (z_{t}^{β / (α + 1)}) \end{array}

choosing k₃ = (2β/α + 1)/2 replaces (5.3) in the proof of Theorem 6.

Now substituting (5.6) and (5.7) in (1.7) yields

\begin{array}{l} φ_{s s} (s_{z}, z) = - α (α - 1) γ z^{\frac{α - 2}{α + 1}} {(\sum_{j = 0}^{α} b_{j} z^{- j / (α + 1)} + O (z^{- 1}))}^{α - 2} \times \frac{z^{2}}{z^{4 α / (α + 1)} {(\sum_{j = 0}^{α} a_{j} z^{- j / (α + 1)} + O (z^{- 1}))}^{4}} - α γ z^{\frac{α - 1}{α + 1}} {(\sum_{j = 0}^{α} b_{j} z^{- j / (α + 1)} + O (z^{- 1}))}^{α - 1} \times \frac{2 z}{z^{3 α / (α + 1)} {(\sum_{j = 0}^{α} a_{j} z^{- j / (α + 1)} + O (z^{- 1}))}^{3}} \\ = [- α (α - 1) γ b_{0}^{α - 2} / a_{0}^{4} - α γ b_{0}^{α - 1} / a_{0}^{3}] z^{- α / (α + 1)} + o (z^{- α / (α + 1)}) \\ = - \frac{α^{2} γ}{a_{0}^{α + 2}} z^{- α / (α + 1)} + o (z^{- α / (α + 1)}) \end{array}

where in the second to last line we have used the fact that $b_{0} = a_{0}^{- 1}$ . When z = z_t, this becomes

φ_{s s} (s_{z_{t}}, z_{t}) = - \frac{a}{t} + o (t^{- 1})

where

a = \frac{α^{2} γ}{a_{0}^{α + 2} c_{0}^{α / (α + 1)}} .

Since φ_s(s_z, z) = 0 and a calculation similar to the one above shows that φ_sss(s_{z_t}, z_t) = O(t⁻²), we have

φ (s, z_{t}) = φ (s_{z_{t}}, z_{t}) - \frac{a}{2} {(s - s_{z_{t}})}^{2} + g (s_{z_{t}}, z_{t})

where |g(s,z)| ≤ C|s − s_z|³/t² for all s. This replaces (5.4) from the α = 1 proof and the rest of the proof is the same. Note that the intensity for the limiting point process is given by

K V_{0} u_{1} {(\frac{c_{0}^{1 / (α + 1)}}{a_{0}})}^{β} \sqrt{2 π / a} exp (k_{1} - k_{2} x) .

(5.12)

Remark 1

From (5.6), we have

t - s_{z_{t}} \sim a_{0} {(c_{0} t^{(α + 1) / α})}^{(α + 1) / α} = \frac{α t}{α + 1}

which tells us that the time at which the mutant with largest growth rate is born is ~ t/(α + 1).

6 Discussion

In this paper, we have analyzed a multi-type branching process model of tumor progression in which mutations increase the birth rates of cells by a random amount. We studied both bounded and unbounded distributions for the random fitness advances and calculated the asymptotic rate of expansion for the kth generation of mutants.

In the bounded setting, we found that there are only two parameters of the distribution that affect the limiting growth rate of the kth generation (see Theorems 1, 2, 4, and 5): the upper bound for the support of the distribution and the value of its density at the upper bound. This is a rather intuitive result since one would expect that in the long run, the kth generation will be dominated by mutants with the maximum possible fitness. In addition, we found that there is a polynomial correction to the exponential growth of the kth generation. This correction is not present in the case where the fitness advances are deterministic. We have discussed this point in further detail in Section 1.1 and after the proof of Theorem 5 in Section 4. Finally, we showed that the limiting population is descended from several different mutations (see Theorem 3).

In the unbounded setting, we assumed that the distribution of the fitness advance has tail

ν (x, \infty) \sim K x^{β} e^{- γ x^{α}}

(6.1)

where α, β, γ, and K are parameters. We found that the population of cells with a single mutation grows asymptotically at a super-exponential rate exp(t⁽^α^+1)/^α) (see Theorems 6 and 7) and at large times, most of the first generation is derived from a single mutation (see Lemma 5 and the preceding paragraph). The super-exponential growth rate suggests that distributions of the form (6.1), which includes the exponential distribution that is often used to model fitness advances of organisms under selective pressure, is not a good choice for modeling the mutational advances in the progression to cancer where there is very little evidence for populations growing at a super-exponential rate.

These conclusions provide several interesting contributions to the existing literature on evolutionary models of cancer progression. First, our model generalizes previous multi-type branching models of tumor progression by allowing for random fitness advances as mutations are accumulated and provides a mathematical framework for further investigations into the role played by the fitness distribution of mutational advances in driving tumorigenesis. Second, we have discovered that bounded distributions lead to exponential growth whereas unbounded distributions lead to super-exponential growth. This dichotomy might provide a new method for testing whether a tumor population has evolved with an unbounded distribution of mutational advances. Third, we observe that in the case of bounded distributions, the growth rate of the tumor is somewhat ‘robust’ with respect to the mutational fitness distribution and depends only on its upper endpoint. Finally, our calculations of the growth rates for the kth generation of mutants serve as a groundwork for studying the evolution and role of heterogeneity in tumorigenesis. These implications will be explored further in future work.

Footnotes

Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

References

Becskei A, Kaufmann BB, van Oudenaarden A. Contributions of low molecule number and chromosomal positioning to stochastic gene expression. Nature Genetics. 2005;9:937–944. doi: 10.1038/ng1616. [DOI] [PubMed] [Google Scholar]
Beerenwinkel N, Antal T, Dingli D, Traulsen A, Kinzler KW, Velculescu VE, Vogelstein B, Nowak MA. Genetic progression and the waiting time to cancer. PLoS Computational Biology. 2007;3 doi: 10.1371/journal.pcbi.0030225. paper e225. [DOI] [PMC free article] [PubMed] [Google Scholar]
Beisel CJ, Rokyta DR, Wichman HA, Joyce P. Testing the extreme value domain of attraction for distributions of beneficial fitness effects. Genetics. 2007;176:2441–2449. doi: 10.1534/genetics.106.068585. [DOI] [PMC free article] [PubMed] [Google Scholar]
Bodmer W, Tomlinson I. Failure of programmed cell death and differentiation as causes of tumors: some simple mathematical models. Proc Natl Acad Sci USA. 1995;92:11130–11134. doi: 10.1073/pnas.92.24.11130. [DOI] [PMC free article] [PubMed] [Google Scholar]
Coldman AJ, Murray JM. Optimal control for a stochastic model of cancer chemotherapy. Mathematical Biosciences. 2000;168:187–200. doi: 10.1016/s0025-5564(00)00045-6. [DOI] [PubMed] [Google Scholar]
Cowperthwaite MC, Bull JJ, Meyers LA. Distributions of beneficial fitness effects in RNA. Gentics. 2005;170:1449–1457. doi: 10.1534/genetics.104.039248. [DOI] [PMC free article] [PubMed] [Google Scholar]
Durrett R, Mayberry J. Traveling waves of selective sweeps. Ann Appl Prob. 2009 to appear. [Google Scholar]
Durrett R, Moseley S. Evolution of resistance and progression to disease during clonal expansion of cancer. Theor Pop Biol. 2009 doi: 10.1016/j.tpb.2009.10.008. to appear. [DOI] [PubMed] [Google Scholar]
Durrett R, Schmidt D, Schweinsberg J. A waiting time problem arising from the study of multi-stage carcinogenesis. Ann Appl Prob. 2009;19:676–718. [Google Scholar]
Elowitz MB, et al. Stochastic gene expression in a single cell. Science. 2002;297:1183–1186. doi: 10.1126/science.1070919. [DOI] [PubMed] [Google Scholar]
Feinerman O, et al. Variability and robustness in T cell activation from regulated heterogeneity in protein levels. Science. 2008;321:1081. doi: 10.1126/science.1158013. [DOI] [PMC free article] [PubMed] [Google Scholar]
Frank SA. Dynamics of Cancer: Incidence, Inheritance and Evolution. Princeton Series in Evolutionary Biology. 2007 [PubMed] [Google Scholar]
Gillespie JH. A simple stochastic gene substitution model. Theor Pop Biol. 1983;23:202–215. doi: 10.1016/0040-5809(83)90014-x. [DOI] [PubMed] [Google Scholar]
Gillespie JH. Molecular evolution over the mutational landscape. Evolution. 1984;38:1116–1129. doi: 10.1111/j.1558-5646.1984.tb00380.x. [DOI] [PubMed] [Google Scholar]
Goldie JH, Coldman AJ. Quantitative model for multiple levels of drug resistance in clinical tumors. Cancer Treatment Reports. 1983;67:923–931. [PubMed] [Google Scholar]
Goldie JH, Coldman AJ. The genetic origin of drug resistance in neoplasms: implications for systemic therapy. Cancer Research. 1984;44:3643–3653. [PubMed] [Google Scholar]
Haeno H, Iwasa Y, Michor F. The evolution of two mutations during clonal expansion. Genetics. 2007;177:2209–2221. doi: 10.1534/genetics.107.078915. [DOI] [PMC free article] [PubMed] [Google Scholar]
Iwasa Y, Michor F, Komorova NL, Nowak MA. Population genetics of tumor suppressor genes. J Theor Biol. 2005;233:15–23. doi: 10.1016/j.jtbi.2004.09.001. [DOI] [PubMed] [Google Scholar]
Iwasa Y, Nowak MA, Michor F. Evolution of resistance during clonal expansion. Genetics. 2006;172:2557–2566. doi: 10.1534/genetics.105.049791. [DOI] [PMC free article] [PubMed] [Google Scholar]
Kassen R, Bataillon T. Distribution of fitness effects among beneficial mutations before selection in experimental populations of bacteria. Nature Genetics. 2006;38:484–488. doi: 10.1038/ng1751. [DOI] [PubMed] [Google Scholar]
Kaern M, et al. Stochasticity in gene expression: from theories to phenotypes. Nature Reviews Genetics. 2005;6:451. doi: 10.1038/nrg1615. [DOI] [PubMed] [Google Scholar]
Knudson AD. Two genetic hits (more or less) to cancer. Nature Reviews Cancer. 2001;1:157–162. doi: 10.1038/35101031. [DOI] [PubMed] [Google Scholar]
Komarova NL, Wodarz D. Drug resistance in cancer: principles of emergence and prevention. Proc Natl Acad Sci USA. 2005;102:9714–9719. doi: 10.1073/pnas.0501870102. [DOI] [PMC free article] [PubMed] [Google Scholar]
Maley CC, et al. Genetic clonal diveresity predicts progression to esophageal adenocarcinoma. Nature Genetics. 2006;38:468–473. doi: 10.1038/ng1768. [DOI] [PubMed] [Google Scholar]
Maley CC, Forrest Exploring the relationship between neutral and selective mutations in cancer. Artif Life. 2001;6:325–345. doi: 10.1162/106454600300103665. [DOI] [PubMed] [Google Scholar]
Michor F, Iwasa Y, Nowak MA. Dynamics of cancer progression. Nature Reviews Cancer. 2004;4:197–205. doi: 10.1038/nrc1295. [DOI] [PubMed] [Google Scholar]
Michor F, Nowak MA, Iwasa Y. Stochastic dynamics of metastasis formation. J Theor Biol. 2006;240:521–530. doi: 10.1016/j.jtbi.2005.10.021. [DOI] [PubMed] [Google Scholar]
Michor F, Iwasa Y. Dynamics of metastasis suppressor gene inactivation. J Theor Biol. 2006;241:676–689. doi: 10.1016/j.jtbi.2006.01.006. [DOI] [PubMed] [Google Scholar]
Nowak MA, Michor F, Iwasa Y. Genetic instability and clonal expansion. J Theor Biol. 2006;241:26–32. doi: 10.1016/j.jtbi.2005.11.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
Nowell PC. The cloncal evolution of tumor cell populations. Science. 1976;194:23–28. doi: 10.1126/science.959840. [DOI] [PubMed] [Google Scholar]
Orr HA. The distribution of fitness effects among beneficial mutations. Genetics. 2003;163:1519–1526. doi: 10.1093/genetics/163.4.1519. [DOI] [PMC free article] [PubMed] [Google Scholar]
Otto SP, Jones CD. Detecting the undetected: Estimating the total number of loci underlying a quantitative trait. Genetics. 2002;156:2093–2107. doi: 10.1093/genetics/156.4.2093. [DOI] [PMC free article] [PubMed] [Google Scholar]
Rokyta DR, Beisel CJ, Joyce P, Ferris MT, Burch CL, Wichman HA. Beneficial fitness effects are not exponential in two viruses. J Mol Evol. 2008;67:368–376. doi: 10.1007/s00239-008-9153-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
Rozen DE, de Visser JAGM, Gerrish PJ. Fitness effects of fixed beneficial mutations in microbial populations. Curret Biology. 2002;12:1040–1045. doi: 10.1016/s0960-9822(02)00896-5. [DOI] [PubMed] [Google Scholar]
Sanjuán R, Moya A, Elena SF. The distribution of fitness effects caused by single-nucleotide substitutions in an RNA virus. Proc Natl Acad Sci, USA. 2004;101:8396–8401. doi: 10.1073/pnas.0400146101. [DOI] [PMC free article] [PubMed] [Google Scholar]
Schweinsberg J. The waiting time for m mutations. Electron J Probab. 2008;13:1442–1478. [Google Scholar]
Shah SP, et al. Mutational evolution in a lobular breast tumour profiled at single nucleotide resolution. Nature. 2009;461:809–813. doi: 10.1038/nature08489. [DOI] [PubMed] [Google Scholar]
Weissman I. Estimation of parameters and large quantiles based on the k largest observations. j Amer Stat Assoc. 1978;73:812–815. [Google Scholar]
Wodarz D, Komarova NL. Can loss of apoptosis protect against cancer? Trends Genet. 2007;23:232–237. doi: 10.1016/j.tig.2007.03.005. [DOI] [PubMed] [Google Scholar]

[R1] Becskei A, Kaufmann BB, van Oudenaarden A. Contributions of low molecule number and chromosomal positioning to stochastic gene expression. Nature Genetics. 2005;9:937–944. doi: 10.1038/ng1616. [DOI] [PubMed] [Google Scholar]

[R2] Beerenwinkel N, Antal T, Dingli D, Traulsen A, Kinzler KW, Velculescu VE, Vogelstein B, Nowak MA. Genetic progression and the waiting time to cancer. PLoS Computational Biology. 2007;3 doi: 10.1371/journal.pcbi.0030225. paper e225. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R3] Beisel CJ, Rokyta DR, Wichman HA, Joyce P. Testing the extreme value domain of attraction for distributions of beneficial fitness effects. Genetics. 2007;176:2441–2449. doi: 10.1534/genetics.106.068585. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R4] Bodmer W, Tomlinson I. Failure of programmed cell death and differentiation as causes of tumors: some simple mathematical models. Proc Natl Acad Sci USA. 1995;92:11130–11134. doi: 10.1073/pnas.92.24.11130. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R5] Coldman AJ, Murray JM. Optimal control for a stochastic model of cancer chemotherapy. Mathematical Biosciences. 2000;168:187–200. doi: 10.1016/s0025-5564(00)00045-6. [DOI] [PubMed] [Google Scholar]

[R6] Cowperthwaite MC, Bull JJ, Meyers LA. Distributions of beneficial fitness effects in RNA. Gentics. 2005;170:1449–1457. doi: 10.1534/genetics.104.039248. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R7] Durrett R, Mayberry J. Traveling waves of selective sweeps. Ann Appl Prob. 2009 to appear. [Google Scholar]

[R8] Durrett R, Moseley S. Evolution of resistance and progression to disease during clonal expansion of cancer. Theor Pop Biol. 2009 doi: 10.1016/j.tpb.2009.10.008. to appear. [DOI] [PubMed] [Google Scholar]

[R9] Durrett R, Schmidt D, Schweinsberg J. A waiting time problem arising from the study of multi-stage carcinogenesis. Ann Appl Prob. 2009;19:676–718. [Google Scholar]

[R10] Elowitz MB, et al. Stochastic gene expression in a single cell. Science. 2002;297:1183–1186. doi: 10.1126/science.1070919. [DOI] [PubMed] [Google Scholar]

[R11] Feinerman O, et al. Variability and robustness in T cell activation from regulated heterogeneity in protein levels. Science. 2008;321:1081. doi: 10.1126/science.1158013. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R12] Frank SA. Dynamics of Cancer: Incidence, Inheritance and Evolution. Princeton Series in Evolutionary Biology. 2007 [PubMed] [Google Scholar]

[R13] Gillespie JH. A simple stochastic gene substitution model. Theor Pop Biol. 1983;23:202–215. doi: 10.1016/0040-5809(83)90014-x. [DOI] [PubMed] [Google Scholar]

[R14] Gillespie JH. Molecular evolution over the mutational landscape. Evolution. 1984;38:1116–1129. doi: 10.1111/j.1558-5646.1984.tb00380.x. [DOI] [PubMed] [Google Scholar]

[R15] Goldie JH, Coldman AJ. Quantitative model for multiple levels of drug resistance in clinical tumors. Cancer Treatment Reports. 1983;67:923–931. [PubMed] [Google Scholar]

[R16] Goldie JH, Coldman AJ. The genetic origin of drug resistance in neoplasms: implications for systemic therapy. Cancer Research. 1984;44:3643–3653. [PubMed] [Google Scholar]

[R17] Haeno H, Iwasa Y, Michor F. The evolution of two mutations during clonal expansion. Genetics. 2007;177:2209–2221. doi: 10.1534/genetics.107.078915. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R18] Iwasa Y, Michor F, Komorova NL, Nowak MA. Population genetics of tumor suppressor genes. J Theor Biol. 2005;233:15–23. doi: 10.1016/j.jtbi.2004.09.001. [DOI] [PubMed] [Google Scholar]

[R19] Iwasa Y, Nowak MA, Michor F. Evolution of resistance during clonal expansion. Genetics. 2006;172:2557–2566. doi: 10.1534/genetics.105.049791. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R20] Kassen R, Bataillon T. Distribution of fitness effects among beneficial mutations before selection in experimental populations of bacteria. Nature Genetics. 2006;38:484–488. doi: 10.1038/ng1751. [DOI] [PubMed] [Google Scholar]

[R21] Kaern M, et al. Stochasticity in gene expression: from theories to phenotypes. Nature Reviews Genetics. 2005;6:451. doi: 10.1038/nrg1615. [DOI] [PubMed] [Google Scholar]

[R22] Knudson AD. Two genetic hits (more or less) to cancer. Nature Reviews Cancer. 2001;1:157–162. doi: 10.1038/35101031. [DOI] [PubMed] [Google Scholar]

[R23] Komarova NL, Wodarz D. Drug resistance in cancer: principles of emergence and prevention. Proc Natl Acad Sci USA. 2005;102:9714–9719. doi: 10.1073/pnas.0501870102. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R24] Maley CC, et al. Genetic clonal diveresity predicts progression to esophageal adenocarcinoma. Nature Genetics. 2006;38:468–473. doi: 10.1038/ng1768. [DOI] [PubMed] [Google Scholar]

[R25] Maley CC, Forrest Exploring the relationship between neutral and selective mutations in cancer. Artif Life. 2001;6:325–345. doi: 10.1162/106454600300103665. [DOI] [PubMed] [Google Scholar]

[R26] Michor F, Iwasa Y, Nowak MA. Dynamics of cancer progression. Nature Reviews Cancer. 2004;4:197–205. doi: 10.1038/nrc1295. [DOI] [PubMed] [Google Scholar]

[R27] Michor F, Nowak MA, Iwasa Y. Stochastic dynamics of metastasis formation. J Theor Biol. 2006;240:521–530. doi: 10.1016/j.jtbi.2005.10.021. [DOI] [PubMed] [Google Scholar]

[R28] Michor F, Iwasa Y. Dynamics of metastasis suppressor gene inactivation. J Theor Biol. 2006;241:676–689. doi: 10.1016/j.jtbi.2006.01.006. [DOI] [PubMed] [Google Scholar]

[R29] Nowak MA, Michor F, Iwasa Y. Genetic instability and clonal expansion. J Theor Biol. 2006;241:26–32. doi: 10.1016/j.jtbi.2005.11.012. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R30] Nowell PC. The cloncal evolution of tumor cell populations. Science. 1976;194:23–28. doi: 10.1126/science.959840. [DOI] [PubMed] [Google Scholar]

[R31] Orr HA. The distribution of fitness effects among beneficial mutations. Genetics. 2003;163:1519–1526. doi: 10.1093/genetics/163.4.1519. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R32] Otto SP, Jones CD. Detecting the undetected: Estimating the total number of loci underlying a quantitative trait. Genetics. 2002;156:2093–2107. doi: 10.1093/genetics/156.4.2093. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R33] Rokyta DR, Beisel CJ, Joyce P, Ferris MT, Burch CL, Wichman HA. Beneficial fitness effects are not exponential in two viruses. J Mol Evol. 2008;67:368–376. doi: 10.1007/s00239-008-9153-x. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R34] Rozen DE, de Visser JAGM, Gerrish PJ. Fitness effects of fixed beneficial mutations in microbial populations. Curret Biology. 2002;12:1040–1045. doi: 10.1016/s0960-9822(02)00896-5. [DOI] [PubMed] [Google Scholar]

[R35] Sanjuán R, Moya A, Elena SF. The distribution of fitness effects caused by single-nucleotide substitutions in an RNA virus. Proc Natl Acad Sci, USA. 2004;101:8396–8401. doi: 10.1073/pnas.0400146101. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R36] Schweinsberg J. The waiting time for m mutations. Electron J Probab. 2008;13:1442–1478. [Google Scholar]

[R37] Shah SP, et al. Mutational evolution in a lobular breast tumour profiled at single nucleotide resolution. Nature. 2009;461:809–813. doi: 10.1038/nature08489. [DOI] [PubMed] [Google Scholar]

[R38] Weissman I. Estimation of parameters and large quantiles based on the k largest observations. j Amer Stat Assoc. 1978;73:812–815. [Google Scholar]

[R39] Wodarz D, Komarova NL. Can loss of apoptosis protect against cancer? Trends Genet. 2007;23:232–237. doi: 10.1016/j.tig.2007.03.005. [DOI] [PubMed] [Google Scholar]

PERMALINK

Evolutionary dynamics of tumor progression with random fitness values

Rick Durrett

Jasmine Foo

Kevin Leder

John Mayberry

Franziska Michor

Abstract

1 Introduction

1.1 Bounded distributions

Theorem 1

Theorem 2

Figure 1.

Theorem 3

Theorem 4

Theorem 5

Figure 2.

1.2 Unbounded distributions

Theorem 6

Theorem 7

Conjecture 1

2 Preliminaries

3 Bounded distributions, Z1

Proof of Theorem 1

Proof of Theorem 2

Lemma 1

Proof

Lemma 2

Proof of Theorem 3

4 Bounded distributions, Zk

Proof of Theorem 4

Lemma 3

Proof

Lemma 4

Proof

Proof of Theorem 5

5 Proofs for unbounded distributions

Lemma 5

Proof

Proof of Theorem 6

Proof of Theorem 7

Remark 1

6 Discussion

Footnotes

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

3 Bounded distributions, Z₁

4 Bounded distributions, Z_k