Stochastic dynamics of cancer initiation

Jasmine Foo; Kevin Leder; Franziska Michor

doi:10.1088/1478-3975/8/1/015002

. Author manuscript; available in PMC: 2013 Feb 11.

Published in final edited form as: Phys Biol. 2011 Feb 7;8(1):015002. doi: 10.1088/1478-3975/8/1/015002

Stochastic dynamics of cancer initiation

Jasmine Foo ^1,^*, Kevin Leder ^1,^*, Franziska Michor ^1,^**

PMCID: PMC3569097 NIHMSID: NIHMS308328 PMID: 21301064

Abstract

Most human cancer types result from the accumulation of multiple genetic and epigenetic alterations in a single cell. Once the first change (or changes) have arisen, tumorigenesis is initiated and the subsequent emergence of additional alterations drives progression to more aggressive and ultimately invasive phenotypes. Elucidation of the dynamics of cancer initiation is of importance for an understanding of tumor evolution and cancer incidence data. In this paper, we develop a novel mathematical framework to study the processes of cancer initiation. Cells at risk of accumulating oncogenic mutations are organized into small compartments of cells and proliferate according to a stochastic process. During each cell division, an (epi)genetic alteration may arise which leads to a random fitness change, drawn from a probability distribution. Cancer is initiated when a cell gains a fitness sufficiently high to escape from the homeostatic mechanisms of the cell compartment. To investigate cancer initiation during a human lifetime, a ‘race’ between this fitness process and the aging process of the patient is considered; the latter is modeled as a second stochastic Markov process in an aging dimension. This model allows us to investigate the dynamics of cancer initiation and its dependence on the mutational fitness distribution. Our framework also provides a methodology to assess the effects of different life expectancy distributions on lifetime cancer incidence. We apply this methodology to colorectal tumorigenesis while considering life expectancy data of the US population to inform the dynamics of the aging process. We study how the probability of cancer initiation prior to death, the time until cancer initiation, and the mutational profile of the cancer-initiating cell depends on the shape of the mutational fitness distribution and life expectancy of the population.

1. Introduction

Tissues of multi-cellular organisms are organized into morphologically stable compartments or niches [1, 2]. Such compartments are made up of separate clones of cells, which proliferate to fulfill their organ-specific tasks [3, 4]. In healthy tissues, compartment sizes are stabilized by homeostatic mechanisms which induce compensatory regulatory responses via cellular signaling, apoptosis, and other processes [5]. During each cellular replication, a genetic or epigenetic alteration may arise. Many of those changes do not alter the reproductive fitness of the cell and are selectively neutral [6]. Some alterations, however, provide the cell with a fitness advantage due to increased proliferation capabilities, decreased death, enhanced migration and invasion, or the ability to induce angiogenesis [7]. Once a cell has evolved a sufficiently aggressive phenotype, it can escape from homeostatic control mechanisms and initiate tumorigenesis [8, 9, 10, 11, 12, 13, 14]. During the expansion of this initiated clone, additional genetic and/or epigenetic changes are accumulated that drive cancer progression and lead to more malignant and ultimately invasive phenotypes [7].

While tumorigenesis has classically been defined as a disease resulting from the accumulation of genetic alterations [15], it is becoming increasingly apparent that epigenetic modifications are similarly important for the initiation and progression of human cancer [16]. Patterns of DNA methylation and chromatin structure are significantly altered in cancer cells and include genome-wide losses of, and regional gains in, DNA methylation. Aberrant promoter methylation, for instance, is associated with the loss of gene function that can provide a selective advantage to tumor cells, as do genetic alterations. For example, the von Hippel-Lindau (VHL) gene, which causes familial renal cancer if mutated in the germ line, is often epigenetically silenced in the sporadic form of this tumor type [17]. Similarly, the breast cancer 1, early onset (BRCA1) and serine/threonine kinase 11 (STK11) genes are often epigenetically inactivated in sporadic cases of breast and colon cancer, respectively, while they predispose to these cancer types in carriers of a germ line mutation [18, 19]. In addition to gene silencing events that are associated with methylation changes, methylation patterns can also influence tumorigenesis by other mechanisms; cytosine methylation, for instance, can cause spontaneous hydrolytic deamination of cytosine and lead to C to T transitions [20]. This enhanced mutagenesis might lead to many gene alterations observed in tumors – as many as 50% of inactivating point mutations in the coding region of the human TP53 tumor suppressor gene in somatic cells occur at methylated cytosines [21]. Epigenetic alterations therefore cannot be neglected in the study of cancer.

Due to its importance for an understanding of tumorigenesis, the dynamics of cancer initiation have been the subject of several mathematical investigations. These studies considered cancer initiation to occur when a single cell has accumulated n specific alterations [22, 23, 24, 25, 26, 27]. The fitness effects of these alterations were assumed to be either neutral or advantageous; the latter was realized as an additive jump in the growth rate of the mutated cell. Several authors studied the situation in which the accumulation of two alterations is sufficient to initiate tumorigenesis [22, 23, 24, 25]; cells were considered to proliferate according to the Moran model in a population of fixed size [28]. Later on, scenarios in which n ≥ 2 mutations are necessary for cancer initiation were investigated [26, 27]. These contributions are part of a literature of mathematical approaches to cancer initiation and progression [29, 30, 31, 32, 33, 34, 35, 36, 37, 38, 39, 40, 41, 42, 43, 44, 45, 46, 47, 48, 49, 50, 51].

In this paper, we propose a novel mathematical model of cancer intiation. Unlike earlier efforts, we do not consider the situation in which cancer is initiated once a specified number of genetic alterations has been accumulated; rather, cancer initiation occurs as soon as the fitness of a cell passes a threshold value. Once this threshold is reached, the cell can escape from homeostatic control and initiates clonal expansion. In particular, we model a mechanism for the breakdown of homeostasis in a normally fixed-size compartment of cells via the accumulation of random mutational fitness changes emerging during cell replication. The fitness changes conferred by (epi)genetic alterations are modeled as random variates selected from a mutational fitness landscape. The fitness threshold necessary for cancer initiation can then be reached via a large number of mutations each conferring small fitness changes, a few mutations each conferring large fitness effects, or a mixture of large and small effects. We then investigate the dynamics of cancer initiation conditioned on the event that initiation occurs prior to death of the patient due to causes other than the cancer type of interest. This conditioning is addressed by adding a second dimension representing an aging process to the model. This framework is used to determine the probability of cancer initiation prior to death as well as the expected waiting time until cancer initiation. Furthermore, we investigate the expected number of neutral and non-neutral mutations that are present in the cancer-initiating clone. This quantity is of clinical interest since it provides insight into the genotype of the cancer-initiating cell and thus the genotypic composition and potentially the drug sensitivity of the resulting tumor. Our mathematical framework sheds light onto the effects of the shape and characteristics of the mutational fitness landscape on the dynamics of cancer initiation, and provides a methodology to assess the consequences of different life expectancy distributions on lifetime cancer incidence and the mutational composition of cancer-initiating cells.

2. The Model

Consider a compartment of N cells that proliferate according to a stochastic Moran process (see the Appendix for a discussion of this process) [28, 52]. This compartment consists of those cells that are at risk of accumulating the mutations leading to cancer initiation. If only tissue-specific stem cells live long enough to accumulate the necessary changes, then the population is made up of stem cells [53]; alternatively, the compartment additionally contains progenitor cells [50, 54]. Since healthy tissues are subdivided into small compartments of cells [1, 2], N is small. Initially, all cells are unmutated and have relative fitness 1. The time intervals between reproduction events are independent, identically distributed exponential random variables with mean 1/N. During each reproduction event, one cell is chosen at random to die, and one cell is chosen to reproduce according to its relative fitness. The population size is strictly constant. The probability of a specific cell being chosen for reproduction during an event is determined by the contribution of its fitness to the total fitness of the population; if there is a single cell of fitness s in a population of N − 1 cells of fitness 1, then the cell with fitness s is chosen to reproduce with probability s/[s + (N − 1)]. Here fitness is defined as the relative reproductive success of cells.

Mutational fitness distribution f_ψ

During each cell division event, an (epi)genetic alteration may occur with probability u ≪ 1; thus alterations arise in the compartment of cells at rate Nu. The fitness effects of individual alterations are random variates drawn from a probability distribution, f_ψ, which represents a mutational fitness landscape. This distribution f_ψ may be state-dependent and thus vary according to the fitness of the parent cell (e.g. $f_{ψ}^{x}$ is dependent on the fitness x of the parent cell). Figure 1 shows a schematic of this process for a state-independent distribution f_ψ. This flexible framework allows us to study the effects of mutational fitness landscapes on the dynamics of cancer initiation.

We consider a population of N cells residing in a compartment or niche of fixed size. These cells replicate according to a stochastic Moran process. During each elementary time step of the process, a cell is chosen at random proportional to fitness to divide and its offspring replaces another randomly chosen cell. During each cell division, a genetic or epigenetic alteration arises with probability u. Each alteration may confer a random additive fitness change to the cell. The parent cell, on the left, thus gives rise to a daughter cell with the same fitness, x, with probability 1 − u (upper right). With probability u, the parent cell produces a mutated offspring with fitness x + ψ, where ψ is a random additive fitness change selected according to the mutational fitness distribution *f_ψ* (lower right). If a cell within the compartment gains a sufficiently large fitness value, then tumorigenesis is initiated.

Dynamics in fitness space

Suppose the population consists of two types of cells at time t = 0: N − 1 normal cells and one cell carrying a single mutation. Define the waiting time until a cell in this population accumulates a second mutation as T_mut and the time until homogenization (i.e., until the cell carrying the mutation reaches zero or one hundred percent frequency in the population) as T_hom. Then we have

P (T_{mut} \leq T_{hom}) \leq 3 Nu (\log N + γ),

(1)

where γ is the Euler-Mascheroni constant. In other words, if 3Nu(log N + γ) ≈ 0, a cell harboring a mutation is far more likely to reach fixation in the compartment or die out before a second mutation arises. See the Appendix for the proof of equation (1). Therefore, when u ≪ 1/N², the periods of time during which the compartment is not homogeneous comprise a negligible amount of time. As the population size and/or the mutation rate increase during tumor progression, more complex dynamics emerge [55]; this scenario, however, is likely not applicable to situations of cancer initiation since in healthy tissues, compartment sizes and mutation rates are small. For the parameter ranges of interest for the study of cancer initiation, a cell compartment is almost always homogeneous.

Based on these considerations, the process describing the evolution of fitness values in a compartment of cells is approximated by a Markov process Z(·), where Z(t) represents the fitness of the homogeneous compartment at time t. The process Z jumps whenever a cell harboring a novel non-neutral mutation reaches fixation in the compartment, and takes values in the space of all possible fitness values dictated by the fitness landscape. By equation (1), this process closely approximates the behavior of cellular fitness values in a small compartment for the vast majority of time. Note that by focusing on the times when the process is homogenous, the state space of the system is significantly reduced, thus allowing for more feasible computational analysis.

Consider the fitness values a and b, for some 0 ≤ a < b, such that a compartment of cells remains in a homeostatic state for fitnesses within the range [a, b]; thus Z(t) ∈ [a, b] for all times t. Once a cell in the compartment gains a fitness greater than the threshold b, it escapes from homeostasis and cancer is initiated. The fitness value a is defined as a reflecting boundary for Z; any cell with fitness below a is immediately replaced by a cell with fitness a. For computational purposes, we consider the process Z living on a discretized state space on the range [a, b]. Since we consider a homeostatic cell compartment before initiation of tumorigenesis, a natural choice for the parameters is a = 1 − 1/N and b = 1 + 1/N since these values signify the boundaries for neutral evolution [56].

Let us now consider in detail the dynamics of Z in fitness space. We introduce the mutation kernel M(·, ·), where M(x, y) represents the probability that a cell with fitness x produces a daughter cell with fitness y (i.e. $M (x, y) = f_{ψ}^{x} (y - x)$ ). If y > x, then the fitness of the daughter cell is advantageous as compared to the fitness of its parent cell; if y < x, it is disadvantageous, and if y = x, it is neutral. Since neutral mutations are allowed to occur, M(x, x) can be non-zero. If a single cell of fitness y arises in a population of N − 1 cells of fitness x, the probability that the cell with fitness y reaches fixation in the population is given by

ρ_{x, y} = \frac{1 - x ∕ y}{1 - {(x ∕ y)}^{N}} .

(2)

This expression is called the fixation probability and can easily be found by first step analysis. By symmetry, we have ρ_x,x = 1/N. The intensity matrix for the Markov process Z is then defined as

Q (x, y) = N u ρ_{x, y} M (x, y),

for y < b. This expression becomes Q(x, y) = NuM(x, y) when y ≥ b. The reflecting behavior at the lower boundary is realized through the mutation kernel M. By definition, we have

Q (x, x) = \sum_{y \neq x} Q (x, y) .

Since the process stops as soon as one cell gains a fitness value greater than b, the states x ≥ b are absorbing, so Q(x, y) = 0 for all y and x ≥ b. The summation is over all y in the discrete fitness space.

Modeling the lifetime of a patient

The Moran model dynamics described by the coarse-grained process Z represent the change in the fitness values of cells in the compartment over time. In order to model the dynamics of cancer initiation during a human lifetime, a ‘race’ between this fitness process Z and the aging process of the patient is considered. To this end, we introduce a second stochastic process, L(t), which is a continuous-time Markov chain with an absorbing state representing death of the patient. Thus this process, L, hits the absorbing state with probability one and the first passage time of this event represents the age of the patient at death. The transition probabilities of this Markov chain can be tuned to match mortality data for the population of interest. We fit the dynamics of the process L to qualitatively describe the current life expectancy in the United States [57].

To address a similar problem, Liu and Lin [58] utilized a Coxian phase-type distribution to construct an aging process to fit life expectancy data. We follow a simplified version of this approach, and define L to be a Coxian process with d states. In particular, the process L(t) is initialized at state 1 and takes values in the state space {1,…,d}. If L(t) = ℓ < d, it jumps monodirectionally to state ℓ + 1 at rate 1, and the final state d is an absorbing state. The time for the process to hit the absorbing state is described by a Gamma(d − 1, 1) distribution. The transition intensity matrix for this process is (S)_i,j = S_i,j, which satisfies S_i,i = −S_i,i+1 and is constant along the diagonal. Let us denote this value along the main diagonal by S instead of unity, since changing the diagonal allows for flexibility in the choice of time scales with respect to the replication rate of cells in the compartment. Figure 2 shows that death of a patient occurs when the process L makes d − 1 upward steps; note that the time between each step is an exponential random variable with mean 1. The Gamma distribution provides a good qualitative approximation for the lifetime distribution of a population. When modeling the fraction of people that survive longer than x years via P(G > x), where G is a Gamma(81,1) random variable, then this tail probability stays relatively flat until around 68 years, at which point it begins a sharp decay towards zero.

We consider a stochastic process governing the evolution of fitness values of cells within the compartment (see Fig. 1) coupled with a stochastic process representing aging and death of the patient. The figure shows two sample path simulations of this two-dimensional process. The fitness of cells within the compartment is shown on the horizontal axis, and the lifetime of the patient is displayed on the vertical axis. If the trajectory hits the right boundary before reaching the upper border, then cancer initiation occurs before death of the patient (sample path (1)); if the trajectory hits the top boundary before reaching the right border, then death occurs before cancer initiation (sample path (2)). The fitness values bounding the area of interest are dictated by the cutoffs for neutral evolution [56].

In summary, we have designed a model of cancer initiation during a human lifetime using atwo-dimensional Markov process. The first dimension of this process, Z, represents the fitness of cells at risk of accumulating the mutations initiating tumorigenesis; these cells are organized into small compartments or niches of a constant size. The second dimension L represents the aging process of a patient. The state of the process Z is governed by the more detailed dynamics of a Moran process at the cellular level, and the process L is chosen so that its absorption time, which has a phase-type distribution ϕ, matches mortality data. This model has several parameters: the mutation rate u, the lifetime distribution ϕ(t), the number of cells in the compartment N, and the mutational fitness distribution f_ψ. This model is then used to determine the dynamics of the event that cancer initiation occurs prior to death, i.e. that the right boundary of Z is reached (corresponding to one cell reaching a fitness value greater than or equal to the boundary value a) prior to absorption in the L-direction. Figure 2 shows a diagram of this process and two possible sample paths. All paths initiate at (Z = 1, L = 1) and sample path (1) demonstrates a trajectory in which cancer initiation occurs prior to death; such paths, however, comprise only a fraction of all possible outcomes. Sample path (2) visualizes a trajectory in which death occurs prior to cancer initiation. Note that cancer initiation is not equivalent to diagnosis of the disease, so that the initiation statistics cannot be compared to cancer incidence data.

3. Analysis

The stochastic model outlined above is used to determine several quantities: (i) the probability of cancer initiation prior to the time of death; (ii) the waiting time until cancer initiation prior to death, and (iii) the expected number of neutral and non-neutral mutations present in the cancer-initiating cell, conditional to cancer initiation occurring before death. The estimates for these quantities are obtained in the form of the solution to simple linear systems in order to keep the analysis applicable to general mutation kernels M. These estimates are later used to study the effect of the mutation kernel and of lifetime distributions on the dynamics of cancer initiation.

3.1. The probability of cancer initiation prior to death

For a given state in the two dimensional space (x, r), let I(x, r) denote the probability of Z(t) reaching a fitness value above b (i.e. initiating cancer) before L(t) reaches the absorbing state d (i.e. death), starting from the initial condition Z(0) = x, L(0) = r. For the process Z, there can be a transition of the form x → y and for the process L(t), there can be a transition of the form r → s. Thus when the current state of the two processes is (x, r), then conditional on a jump occurring, the jump v has the following probability distribution:

\begin{matrix} P (v = (y - x, 0)) = \frac{N u ρ_{x, y} M (x, y)}{- Q (x, x) - S} ≐ P_{Z} (x, y ∣ r), \\ P (v = (0, s - r)) = \frac{S}{- Q (x, x) - S} ≐ P_{L} (r, s ∣ x), \end{matrix}

for any y ≠ x in the fitness space and any s ∈ {1,…,d}, s ≠ r.

By first step analysis, we obtain a linear system that the probabilities I(x, r) satisfy:

I (x, r) = \sum_{y} I (y, r) P_{Z} (x, y ∣ r) + \sum_{s} I (x, s) P_{L} (r, s ∣ x) .

(3)

We also have the boundary conditions I(x, d) = 0 for all x ∈ [a, b] and I(b, r) = 1 for r ∈ [0, d). This linear system can easily be solved to obtain I(x, r). Cancer initiation is then defined as the event that the fitness of one cell reaches a = 1+1/N before death of the patient; the probability of initiation prior to death is p_i ≡ I(1, 1). The transition probabilities for the compound process (Z,L) conditioned upon initiation prior to death are given by

{\tilde{P}}_{Z} (x, y ∣ r) ≐ P_{Z} (x, y ∣ r) \frac{I (y, r)}{I (x, r)}

(4)

and

{\tilde{P}}_{L} (r, s ∣ x) ≐ P_{L} (r, s ∣ x) \frac{I (x, s)}{I (x, r)} .

(5)

3.2. The waiting time until cancer initiation

Let τ represent the time of cancer initiation, and σ the time at which the process L is absorbed, i.e. the patient dies. Denote the time until the first jump of the process (Z, L) by T₁. Then we have

E [θ^{T_{1}}] = E [θ^{T_{1}} ∣ τ < σ] = \frac{- S - Q (x, x)}{\log θ - S - Q (x, x)} .

This result is important because of the following:

E [θ^{τ} ∣ (Z (T_{1}), L (T_{1})) = (y, r)] = E [θ^{T_{1}}] E_{(y, r)} [θ^{τ}] .

This expression results from the independence of the duration of the first jump time and all subsequent jump times. The generating function for the conditioned initiation time starting from state (x, r) is defined as

G (x, r, θ) = E_{(x, r)} [θ^{τ} ∣ τ < σ] .

(6)

Then, performing first step analysis, we obtain the linear system:

\begin{matrix} G (x, r, θ) \\ = \frac{- S - Q (x, x)}{\log θ - S - Q (x, x)} (\sum_{y} {\tilde{P}}_{Z} (x, y ∣ r) G (y, r, θ) + \sum_{s} {\tilde{P}}_{L} (r, s ∣ x) G (x, s, θ)) . \end{matrix}

The boundary condition is G(b, r, θ) = 1.

We perform a simpler calculation to determine the mean waiting time until cancer initiation. Define w(x, r) to be the expected time until initiation starting from the state (x, r), conditioned on initiation prior to death: $w (x, r) = E_{x, r} [τ ∣ τ < σ]$ . Then we have

w (x, r) = \sum_{y} {\tilde{P}}_{Z} (x, y ∣ r) w (r, y) + \sum_{s} {\tilde{P}}_{L} (r, s ∣ x) w (x, s) + \frac{1}{- Q (x, x) - S}

(7)

with boundary condition w(b, r) = 0.

3.3. The number of neutral mutations in the cancer-initiating cell

An accurate understanding of the genomic composition of the cell that leads to clonal expansion and cancer initiation may aid in the identification of drug targets. Furthermore, it helps to elucidate the distribution of advantageous and passenger (selectively neutral) mutations in cancer. Let us first consider the number of neutral mutations that have been accumulated in the compartment before cancer is initiated. Denote the number of neutral mutations present in the compartment at time t by m(t). Next we compute the function

μ (x, r) = E_{(x, r)} [m (τ) ∣ τ < σ] .

(8)

Thus, μ represents the number of neutral mutations accumulated in the cancer-initiating cell conditional to cancer initiation occuring before death of the patient.

Between jumps of the two-dimensional process (Z, L), a random number of neutral mutations can reach fixation within the compartment of cells. Let T_j and T_j+1 be the jump times of (Z, L), and for simplicity denote (Z(T_j), L(T_j)) = X_j. During the transition from X_j to X_j+1, the compartment can accumulate Y_j (X_j) neutral mutations. Define

η (x, r) = \frac{u M (x, x)}{u M (x, x) - Q (x, x) - S} .

The top of the fraction represents the rate at which neutral mutations which eventually reach fixation arise within the compartment, and the bottom of the fraction represents the total rate at which fixating mutations arrive and the time process changes. With this definition, Y_j(x, r) is distributed like a geometric random variable with

P (Y_{j} (x, r) = n) = η {(x, r)}^{n} (1 - η (x, r)),

which gives

E [Y_{j} (x, r)] = \frac{η (x, r)}{1 - η (x, r)} .

By conditioning on the first step, we obtain that μ(·, ·) satisfies the following linear system for each possible fitness x in [a, b]:

μ (x, r) = \frac{η (x, r)}{(1 - η (x, r))} + \sum_{y} {\tilde{P}}_{Z} (x, y ∣ r) μ (y, r) + \sum_{s} {\tilde{P}}_{L} (r, s ∣ x) μ (x, s) .

Note that μ(x, r) = 0 for those fitnesses that lie outside of [a, b]. Therefore μ(·, ·) can be determined by solving the linear system above.

3.4. The number of non-neutral mutations in the cancer-initiating cell

Let us next determine the mean number of advantageous and disadvantageous mutations in the intiating cell. Let n(t) be the number of non-neutral mutations present in the compartment of cells at time t. Then, the number of non-neutral mutations at initiation is given by

ν (x, r) = E_{(x, r)} [n (τ) ∣ τ < σ],

where ν(x, r) satisfies the system

ν (x, r) = \sum_{y \neq x} {\tilde{P}}_{Z} (x, y ∣ r) (ν (y, r) + 1) + \sum_{s} {\tilde{P}}_{L} (r, s ∣ x) ν (x, s)

with boundary condition ν(b, r) = 0. The expected number of advantageous or disadvantageous mutations present can be obtained in a similar fashion. For example, by defining n_a(t) to be the number of advantageous mutations present at time t, the number of advantageous mutations present at initiation is given by

ν_{a} (x, r) = E (x, r) [n_{a} (τ) ∣ τ < σ]

and ν_a(x, r) satisfies the linear system

\begin{matrix} ν_{a} (x, r) = & \sum_{y > x} {\tilde{P}}_{Z} (x, y ∣ r) (ν_{a} (y, r) + 1) + \sum_{y < x} {\tilde{P}}_{Z} (x, y ∣ r) ν_{a} (y, r) \\ + \sum_{s} {\tilde{P}}_{L} (r, s ∣ x) ν_{a} (x, s), \end{matrix}

(9)

where the boundary condition is again ν_a(b, r) = 0.

4. Application of the model to colorectal cancer initiation

Let us now apply this mathematical framework to colorectal cancer as a specific example. Colorectal tumors progress through four distinct clinical stages: dysplastic crypts, small benign tumors, malignant tumors invading surrounding tissues, and finally metastatic cancer [11]. This progression is driven by the accumulation of several genetic changes [59]. Mutations of the adenomatous polyposis coli (APC) gene are considered the earliest and most prevalent genetic changes in colorectal tumorigenesis [11]; other important contributors are mutations in the KRAS and TP53 genes [11]. Colonic tumors arise from the rapidly proliferating epithelium of the colon. This epithelium is organized into N_crypts = 10⁷ compartments of cells called crypts [60]. Each crypt contains about 1000 - 4000 cells. Approximately 4 - 10 of those cells are stem cells, residing at the base of each crypt [4, 61]. The progeny of stem cells migrate up the crypt, continuing to divide until they reach its mid-portion. Then they stop dividing and differentiate into mature cells. When the cells reach the top of the crypt, they undergo apoptosis and are engulfed by stromal cells or shed into the gut lumen. The cell migration from the base to the top of the crypt takes about 3 - 6 days [62].

To study the probability of cancer initiation from any crypt, we estimate the probability of initiation from a single crypt prior to death, p_i. Then the number of crypts containing cancer-initiating cells is binomially distributed with parameters N_crypts and p_i. Taking a Poisson approximation, the probability of cancer initiation prior to death from any crypt becomes approximately 1 − exp[−p_iN_crypts], and the average number of crypts containing cancer-initiating cells emerging prior to death is N_crypts × p_i. Note that if p_i is of order 10⁻⁶ or greater, the probability of having at least one cancer-initiated crypt before death is 1. If p_i is on the order of 10⁻⁸ or less, the probability of having at least one such crypt before death is much less than 1. Unfortunately no data is available on the frequency of cancer-initiated crypts in the population since these crypts are generally too small to be detected by routine colonoscopies, but slightly larger growths – adenomatous polyps – are observed in approximately 50% of people above age 70 [63, 64, 65]. These statistics can be used to guide parameter choices for the mutational fitness distribution.

For the purpose of our mathematical model, we consider a compartment size of N = 10 colonic stem cells and a stem cell division frequency of approximately once per week [66, 67]. The overall mutation rate – giving rise to neutral, advantageous or deleterious mutations – is about u = 0.001 per cell division, since there are 3 · 10⁹ base pairs in the human genome and the per base pair mutation rate is about 10⁻¹¹ to 10⁻¹² [68]. There are no estimates for the shape of the mutational fitness distribution for human colonic epithelial cells. Therefore, to investigate the dependence of the dynamics of cancer initiation on the mutational fitness distribution f_ψ, we construct a general family of fitness distributions with a mode at zero. For simplicity, assume that the mutational fitness distribution is state independent: $f_{ψ} = f_{ψ}^{x}$ . The general form of the mutational distribution is then given by

P (ψ = k Δ) = c α^{k}, k \in N,

and

P (ψ = - k Δ) = c β^{k}, k \in N,

where c is a normalization constant. In practice, we use a truncated approximation of this unbounded distribution. The point mass at the origin represents neutral mutations, such as alterations in non-coding DNA. The parameter Δ represents the fitness space discretization, which is set as Δ = 0.004. This distribution thus has two key shape parameters: α, which controls the decay rate on the right (i.e. advantageous mutations), and β, which controls the decay rate on the left (i.e. deleterious mutations). When α and β are set to 1, the distribution is uniform. Note that a larger probability of neutral mutations can be added by considering a mixture of this distribution with a point mass at 0. Figure 3 illustrates this family of distributions. An increase in the shape parameter β while holding α constant results in a more slowly decaying deleterious tail of the distribution; this leads to a higher percentage of disadvantageous mutations with more negative fitness jumps possible. Similarly, an increase in α while holding β constant results in a higher percentage of advantageous mutations with larger positive fitness jumps possible (distribution not shown). This general form of fitness distributions is next used to determine the consequences of changing the distribution on the dynamics of cancer initiation.

The stochastic process governing the evolution of fitness values of cells within a compartment depends on the shape of the fitness distribution conferred by mutations. For simplicity, we consider a general family of fitness distributions determined by the shape parameters α and β, which govern the decay of advantageous and deleterious mutations, respectively. This flexible setup allows us to study the effects of varying the mutational fitness distribution on the dynamics of cancer initiation. As specific examples, the figure displays the probability density function of *f_ψ* for α = 0.5 and β = 0.4, 0.5, 0.6.

4.1. Effects of varying the mutational fitness distribution, f_ψ

Let us now investigate the results of modifying the shape parameters of the mutational fitness distribution on the dynamics of cancer initiation from a single compartment of cells. Throughout this section, we consider an aging process L whose properties are qualitatively fitted to the US mortality data [57], with d = 82 states.

We first study the consequences of varying the decay rates, α and β, of the positive and negative tails of the mutational fitness distribution f_ψ on the probability of cancer initiation. Figure 4a and b displays the probability of initiation before death, p_i, as the shape parameters α and β are varied; here p_i is determined by solving equation (3) for I(1, 1). Fig. 4a shows the probability of cancer initiation as a function of α, for different values of β; note that as α increases, the probability of initiation is enhanced since the frequency and fitness advantages of beneficial mutations increase. Similarly, as β increases (Fig. 4b), the probability of cancer initiation decreases. The incidence of human tumors provides some clues as to the relative magnitude of α and β. Based on the frequency of adenomatous polyps in the US population (≈ 50%) [63, 64, 65], the probability of harboring a cancer-initiated crypt before death is estimated between 50 – 90%. Thus it is unlikely that p_i ≫ 10⁻⁷, since this choice would result in an unrealistically high probability of cancer initiation. Therefore, it is unlikely that α ≫ β, corresponding to a vast majority of mutations being advantageous, because the probability of cancer initiation would also be unrealistically high for this regime.

(a) and (b) The panels display the probability of cancer initiation from a single compartment of cells, *p_i*, before death of the patient due to causes other than the cancer type of interest. The panels are plotted on a semilogarithmic plane for clarity. The probability of cancer initiation increases with α and decreases with β. (c) and (d) The panels display the expected time of cancer initiation, conditioned on initiation occurring before death of the patient. (a) and (c) We vary the shape parameter α, which governs the decay rate on the right of the distribution (i.e. advantageous mutations). (b) and (d) We vary the shape parameter β, which governs the decay rate on the left of the distribution (i.e. deleterious mutations). The aging process L is fit to US life expectancy data [57] and parameters are u = 0.001 and N = 10.

Let us now investigate the expected time until cancer initiation conditional upon initiation occurring prior to death. This quantity is found by solving for w(1, 1) in equation (7), and corresponds to the age of a patient at the time of cancer initiation. Note again that this quantity is not synonymous to the age of a patient at diagnosis with a tumor. Figure 4c and d shows the conditional expected time of cancer initiation as a function of varying α (Fig. 4c) and β (Fig. 4d). The dynamics of this system display complex nonlinear behaviors; when α is large, the initiation time declines sharply, since a mutational fitness distribution placing more mass on advantageous mutations results in faster cancer initiation. In this regime, increasing β results in a larger conditional initiation time since increasing mass is placed on disadvantageous mutations, and therefore more time is required for cancer initiation. However, for small values of α, alarger β results in exactly the opposite behavior – shorter average conditional initiation times. This observation is counterintuitive, since distributions with large β (and hence mostly disadvantageous mutations) are expected to result in a longer initiation time as in the case of large α. Similar dynamics can be observed in Fig. 4d, where the initiation time, τ_init, is shown as a function of varying β for several values of α. When β is small, the probability of initiation is high and increasing α results in smaller initiation times; when β is large, increasing α results in the opposite effect.

Figures 4c and d demonstrate that these interesting dynamics occur in regimes that coincide with small cancer initiation probabilities (see Figures 4a and b), suggesting that the decrease in the conditional initiation time is caused by a selection effect on the sample paths due to the conditioning event. This regime includes the biologically relevant range in which the probability of initiation is realistically small (e.g., approximately 10⁻⁷) for asingle compartment of cells. Thus we hypothesize that in the regime in which the probability of initiation, p_i, is small, the sample paths that result in initiation are the relatively rare paths in which the average advantageous mutational fitness jump is high, resulting in a low initiation time. As β increases, the probability of initiation decreases so this selection effect is enhanced. In other words, paths reaching cancer initiation in this low p_i regime experience fewer, but more beneficial, advantageous mutations as the mutational fitness distribution is skewed more to the left (i.e. as β increases).

To investigate this behavior in greater detail, let us next examine the conditional average number of advantageous and disadvantageous mutations that reach 100% frequency in the population prior to cancer initiation. This quantity is given by equation (9). Figure 5a shows the conditional average number of advantageous mutations as a function of increasing α (Fig. 5a). At high values of α (corresponding to a high p_i regime), increasing β results in more advantageous mutations, whereas at low values of α (corresponding to a low p_i regime), increasing β results in fewer advantageous mutations. This observation agrees with our hypothesis that the rare sample paths that reach initiation in the low p_i regime experience fewer but stronger advantageous mutations as β increases. To test this hypothesis, we performed Monte Carlo simulations of the two dimensional process (Z, L) conditioned upon initiation to evaluate the average fitness advantage of beneficial mutations. We investigated low p_i regimes and observed that as β increases, the distribution of average fitness changes of advantageous mutations shifts to the right (Fig. 5b). For parameters in the high p_i regime (i.e., high α), this phenomenon does not occur (data not shown). This observation, together with the analysis of the expected number of advantageous mutations in the initiating cell, independently confirm the hypothesis that in the regime of low p_i, conditioning on initiation prior to death results in the selection of rare sample paths with fewer, and more beneficial, advantageous mutations as β increases.

(a) The panel displays the expected number of advantageous mutations in the cell that reaches the fitness threshold 1 + 1/N and initiates tumorigenesis; these numbers of mutations are conditioned on cancer initiation occurring before death of the patient due to causes other than the cancer type of interest. We vary α, for β = 0.4, 0.5, 0.6, and display the expected number of advantageous mutations in the cancer initiating cell. The aging process L is fit to US life expectancy data [57] and parameters are u = 0.001 and N = 10. (b) The panel shows the empirical density (histogram) of the average fitness change of advantageous mutations that reach fixation in the compartment of cells prior to cancer initiation, conditioned on initiation occurring before death. Parameter values are α = 0.15 and β = 0.3(top), α = 0.15 and β = 0.5(middle), and α = 0.15 and β = 0.7(bottom). The otherparameters are as in (a). (c) The panel showsthe conditional expected number of advantageous, neutral and disadvantageous mutations that have reached fixation in the compartment prior to cancer initiation. The mode of the lifetime distribution *ϕ_mode* is varied. Parameters are u = 0.001, α = 0.25, β = 0.5 and N = 10.

In summary, in the biologically relevant regime with a relatively small probability of cancer initiation before death (approximately 10⁻⁷ per compartment), the dynamics of initiation display interesting dependencies on the shape parameters of the mutational fitness distribution f_ψ. As the parameter β increases and the distribution is skewed towards deleterious mutations, the average number of advantageous mutations and the average time until initiation both decrease, somewhat counterintuitively. This behavior occurs due to the strong selection effect of conditioning on cancer initiation prior to death; very few sample paths reach the event of cancer initiation, and these paths have selectively fewer, but effectively stronger, advantageous mutations. Therefore, an explicit consideration of the temporal axis (and conditioning on initiation occurring prior to death) reveals counterintuitive, qualitatively different dynamics that would not be observed in models without this essential feature. Determining the shape characteristics of the mutational fitness distribution would elucidate key properties of the dynamics of emergence and genotype of colorectal cancer-initiating cells. Towards this end, our analyses suggest that it is unlikely that α ≫ β (i.e. many more advantageous than disadvantageous mutations) in the mutational fitness distribution of stem cells in the colonic crypt, since this regime would lead to unrealistically high cancer initiation probabilities. Additionally, in the parameter space in which the probability of initiation is plausibly small, we expect approximately O(10) advantageous mutations in the initiating cell (Figure 5a); recall, however, that the scenario studied is bounded by the fitness values of neutral evolution and therefore, advantageous mutations confer a higher fitness to the cell but are not considered true ‘driver’ mutations.

4.2. Effects of varying the mode of the lifetime distribution

Let us now investigate the sensitivity of the dynamics of colon cancer initiation to the life expectancy distribution. Since lifetime distributions vary between different countries, socioeconomic classes and races, this methodology can provide insight into the differing cancer incidence rates among these populations. To investigate this dependency, we retain the same phase-type Gamma distribution structure for the lifetime distribution, but vary the mode, ϕ_mode, and consider the effects of longer and shorter overall life expectancies on the probability of cancer initiation. Table 1 displays the probability of cancer initiation prior to death for a range of values of ϕ_mode. The data in Table 1 is shown for three mutational fitness distributions where α < β (corresponding to more disadvantageous mutations), α = β (corresponding to a symmetric distribution), and α > β (corresponding to more advantageous mutations). The mode of the data fitting current US life expectancy data is ϕ_mode = 80. As the life expectancy increases, the probability of cancer initiation before death increases subexponentially. Furthermore, the ratio of α to β significantly impacts the initiation probability; when β > α, disadvantageous mutations are more frequent than advantageous mutations, and the probability of a cell accumulating fitness 1 + 1/N (i.e. cancer initiation) is low. When β < α, advantageous mutations are more frequent and thus the probability of initiation is increased. These results suggest that the probability of cancer initiation is highly sensitive to the mutational fitness distribution, and in particular to the ratio of beneficial to deleterious mutations.

Table 1. The effects of the life expectancy distribution on cancer initiation.

The table displays the probability of cancer initiation prior to death of the patient for a varying mode of the lifetime distribution, ϕ_mode. Three mutational fitness distributions are considered: α < β, α = β, and α > β. As the life expectancy increases, the probability of cancer initiation before death also increases. Furthermore, the ratio of α to β significantly impacts the initiation probability; when β > α, disadvantageous mutations are more frequent than advantageous mutations, and the probability of a cell accumulating fitness 1 + 1/N (i.e. cancer initiation) is low. When β < α, advantageous mutations are more frequent and thus the probability of initiation is enhanced. The mode of the data fitting current US life expectancy data is ϕ_mode = 80. Parameters used are u = 0.001 and N = 10.

ϕ _mode	α = 0.25, β = 0.5	α = 0.3, β = 0.3	α = 0.5, β = 0.25

45	4.0450e-10	1.0608e-06	1.5979e-02
50	6.3142e-10	1.9604e-06	2.4384e-02
55	9.3832e-10	3.4264e-06	3.5433e-02
60	1.3372e-09	5.7100e-06	4.9395e-02
65	1.8381e-09	9.1313e-06	6.6441e-02
70	2 4486e-09	1 4086e-05	8.6642e-02
75	3.1736e-09	2.1050e-05	1 0996e-01
80	4.0147e-09	3.0584e-05	1 3627e-01
85	4.9705e-09	4.3330e-05	1.6533e-01
90	6.0369e-09	6.0017e-05	1.9686e-01

Open in a new tab

We also determined the conditional expected number of neutral and non-neutral mutations that reached fixation in the population prior to cancer initiation. Figure 5c shows the conditional expected number of advantageous, neutral, and disadvantageous mutants in the initiating cell. As the mode of the lifetime distribution increases, the average number of each type of mutation increases linearly. In addition, the ratio of the conditional expected number of neutral to conditional expected number of advantageous/disadvantageous mutations appears invariant to ϕ_mode. Note that the number of disadvantageous, neutral, and advantageous mutations in the initiating cell does not correspond to the frequencies of these mutations in the mutational fitness distribution f_ψ. For example, Figure 5c demonstrates that the frequency of neutral mutations in the total mutation incidence is approximately 30 – 35%; however, the mutational fitness distribution, f_ψ, used in this example places approximately 40% of its mass on neutral mutations. This difference is once again an effect of conditioning; sample paths in the set in which cancer initiation occurs prior to death exhibit a different mutational fitness distribution than the original distribution. This observation also demonstrates the necessity of explicitly considering a temporal process and conditioning on cancer initiation prior to death, since this difference would not be observed in models without this feature. We conclude that colon cancer incidence is predicted to be lower in patient populations with lower life expectancies; in those cases in which tumorigenesis is initiated, fewer passenger mutations are present in the cancer-initiating cell as compared to cases in populations with larger life expectancies.

4.3. Sensitivity to the mutation rate

Finally, let us investigate the sensitivity of the probability of colon cancer initiation prior to death to the mutation rate u. Figure 6 shows the probability of initiation as a function of the mutation rate in the setting β > α. The probability of cancer initiation decreases super-exponentially as the mutation rate decreases. This functional dependence is studied for two different values of ϕ_mode; the super-exponential decay is similar in both scenarios, but the curves are shifted by a constant. Thus we expect that patient populations with higher mutation rates accordingly have a higher colon cancer incidence; this observation is consistent with increased cancer incidence after exposure to radiation and other carcinogens.

The figure shows the probability of cancer initiation prior to death of the patient for varying mutation rate u. Two values for the mode of the lifetime distribution are shown: *ϕ_mode* = 50, 80. Parameters are α = 0.4, β = 0.6 and N = 10.

5. Discussion

In this paper, we have introduced a novel stochastic model to investigate the dynamics of cancer initiation prior to the death of the patient. In the context of this model, cancer initiation occurs when a single cell within a small fixed-size compartment of cells acquires the characteristics necessary to initiate clonal expansion of cells. We have introduced several novel features with this model:

Genetic and epigenetic alterations confer random fitness changes to the cell.
Cancer initiation is a fitness-dependent event – once a cell gains a sufficiently large fitness, clonal expansion ensues.
The stochastic process governing the evolution of cellular fitness is coupled with a temporal aging process to determine the dynamics of cancer initiation prior to death of the individual.

These features are unlike earlier models of cancer initiation [22, 23, 24, 25, 26, 27]; in these models, cancer initiation was assumed to occur as soon as a single cell has accumulated a specific number and type of mutations, which were considered to confer fixed fitness effects on cells. Furthermore, these earlier models did not couple the dynamics of mutations arising in populations of cells with the processes of aging and death of individuals. Without considering the latter dynamics, however, the consideration of cancer initiation is incomplete.

We have considered a Moran process describing a compartment of cells in which genetic and epigenetic alterations confer random fitness changes which are variates drawn from a mutational fitness distribution. This approach allows for the inclusion of all possible (epi)genetic alterations arising in proliferating cells rather than a single class of mutations. We considered a spectrum of mutational fitness effects, from mutations which are positively selected during tumorigenesis, to passenger mutations which are selectively neutral, and deleterious alterations which confer a fitness cost to the cell. The consideration of non-advantageous mutations is important because most alterations have the potential to affect the evolutionary dynamics of cancer initiation, even though they may not play a direct and causative role in carcinogenesis. For instance, the accumulation of mutations can alter the fitness of a cell and thus affect the likelihood and timing of cancer initiation. We have introduced a second dimension to the process of cancer initiation to ensure that the dynamics are dictated by tumors that initiate within a human lifetime. This goal is achieved by modeling the lifetime of a patient as the first passage time of a simple continuous-time Markov chain in an aging dimension, and investigating the conditioned dynamics on the set for which cancer initiation occurs before patient death. The explicit consideration of the temporal axis and conditioning on cancer initiation occurring prior to death reveals counterintuitive, qualitatively different dynamics that would not be observed in models without this essential feature.

We then examined the dynamics of colon cancer initiation as a specific example. We utilized parameter values dictated by the geometry of colonic crypts, the number of colonic stem cells, and the mutation rate estimated for human cells. Furthermore, we considered the life expectancy data of the US population to inform the dynamics of the aging process. Since no estimates of the shape of the mutational fitness distribution are available, we chose a specific parametric family of distributions representing a broad class of distributions that includes exponential decay and uniform distributions. We then studied how the probability of cancer initiation prior to death, the time until cancer initiation, and the mutational profile of the cancer-initiating cell depends on the shape of the mutational fitness distribution, the life expectancy distribution, and the mutation rate. We observed qualitatively different dynamics in parameter regimes where the probability of cancer initiation is low as compared to those biologically unrealistic scenarios in which tumorigenesis is initiated often and quickly. In particular, we observed that tumors that are initiated in the regime of low initiation probabilities experience fewer, but more beneficial, advantageous mutations as the mutational fitness distribution is skewed more towards disadvantageous alterations. This behavior arises due to the strong selection effect of conditioning on cancer initiation prior to death; very few evolutionary trajectories lead to initiation before the death of the patient, and these trajectories accumulate fewer but selectively stronger advantageous mutations. Furthermore, this analysis demonstrated that human populations with shorter average life expectancies have fewer cases of cancer initiation; in those cases in which tumorigenesis is initiated, fewer passenger mutations are present in the cancer-initiating cell. Note that passenger mutations are defined as those that do not confer any fitness change to the cell; the number of passenger mutations in the cancer-initiating cell we determine using this model might differ from that identified by cancer genome screens since the latter analyses might identify all neutral and near-neutral mutations as ‘passengers’. This number might then represent an overestimation of the number of true (strictly neutral) passenger mutations. Note also that the mutational distribution of the cancer-initiating cell is substantially different from the originating distribution of mutational fitness effects, exhibiting another differential result of conditioning on cancer initiation prior to death. This analysis could be extended to comparatively study the dynamics of cancer initiation in populations with varying life expectancy distributions, such as different countries, ethnic groups or socio-economic classes. To perform such analyses, it is necessary to consider differential dynamics in Z as well as the aging process L. In particular, certain populations may have higher mutation rates due to behavior, diet and other environmental factors which are in turn linked to life expectancy.

This mathematical framework represents only one out of many possibilities of modeling the processes leading to cancer. For simplicity, we have neglected spatial effects that could lead to different dynamics in separate areas of the same compartment or tissue. We have therefore considered colonic crypts to be independent from one another. However, crypt division and replacement by neighboring crypts may induce a complex local dependence between cellular compartments; these issues will be explored in future work. Furthermore, we have not considered interactions of normal and mutated cells with the immune system; such interactions may modulate the dynamics of cancer initiation since certain cell types may be inhibited by immune system cells. Also, we have not investigated the effects of temporally or spatially varying fitness values of cells which could result from their interactions with the microenvironment or other cell types. Our work can further be extended to consider a dependency between the dynamics of the fitness process Z and the aging process L. Specifically, as a patient ages, DNA repair mechanisms deteriorate, thus resulting in higher mutation rates; consideration of such scenarios will be the topic of future work. For an accurate understanding of the dynamics of tumorigenesis in human populations, the elucidation of mutational fitness distributions as well as cell type-specific mutation rates is essential. The determination of these parameters in experimental systems is an important goal of the field and would contribute to the investigation of the evolutionary dynamics of human cancer.

Acknowledgements

This work is supported by NCI grant U54CA143798.

6. Appendix

6.1. Standard Moran Process

The Moran process [28] is a standard tool for studying populations that maintain a constant size N (see e.g., [24, 69]). In the traditional setting, two types of cells are considered – cell types a and A – whichhave respective fitness values (i.e. growth rates) f and 1. Reproduction events arrive according to a Poisson process with rate N. If the number of type A cells is j, then the probability that a type A cell will reproduce during the next birth is given by j/[j + f(N − j)]. In addition, after each birth event a cell is randomly chosen to die. The Moran process is the Markov process that tracks the number of type A (or a cells) cells over time.

6.2. Multi-Type Moran Process

As a departure from the Moran process described in the previous subsection, we consider afixed-size population of cells containing greater than two types of cells. Each cell may have a fitness in the set $F = {f_{1}, \dots, f_{K}}$ . The state of the system at time t is described by $Z_{t} = (Z_{t}^{1}, \dots, Z_{t}^{K})$ , where $Z_{t}^{K}$ represents the number of cells with fitness f_k at time t. Reproduction events occur at the arrival times of a Poisson process with rate N; these times are denoted by {T_i}_i≥1. Given that the current state of system is z = (z¹,…,z^K), during the next reproduction event a cell of fitness f_k will be chosen to reproduce with probability

\frac{z^{k} f_{k}}{\sum_{j = 1}^{K} z^{j} f_{j}},

and a cell is chosen at random to die. During each cell division event, there is a probability u of producing a daughter cell with a new fitness value which is chosen from amutational fitness distribution. Equation (1) states that this process can be replaced with a simpler process which jumps between homogeneous states (i.e., all cells in the compartment have the same fitness). To investigate the accuracy of this approximation, let us consider a system that starts in state z₀, where N − 1 cells have fitness f and one cell has fitness $\hat{f}$ . We will use the following notation: $P_{z} (\cdot) ≐ P (\cdot ∣ Z_{0} = z)$ . Then we have

Lemma 1

P_z0(T_mut < T_hom) ≤ 3Nu(log N + γ).

Proof of Lemma 1

Define N(t) to be the number of replication events by time t, and let m(n) = 1 if a mutation occurs at replication event n. Then note that

\begin{matrix} P_{z_{0}} (T_{mut} < T_{hom}) = & \sum_{n = 1}^{\infty} P_{z_{0}} (N (T_{mut}) = n, N (T_{hom}) > n) \\ \leq & \sum_{n = 1}^{\infty} P_{z_{0}} (N (T_{mut}) = n, N (T_{hom}) > n - 1) \\ \leq & \sum_{n = 1}^{\infty} P_{z_{0}} (m (n) = 1, N (T_{mut}) > n - 1, N (T_{hom}) > n - 1) \\ = & u \sum_{n = 1}^{\infty} P_{z_{0}} (N (T_{mut}) > n - 1, N (T_{hom}) > n - 1) \\ \leq & u \sum_{n = 1}^{\infty} P_{z_{0}} (N (T_{hom}) > n - 1 ∣ N (T_{mut}) > n - 1) . \end{matrix}

Then a conditioning argument shows that

P_{z_{0}} (Z_{T_{1}} = z_{1} ∣ N (T_{mut}) > n - 1) = \frac{1}{1 - u} P (Z_{T_{1}} = z_{1} ∣ Z_{0} = z_{0})

for all z₁ such that ${1 \leq k \leq K : z_{1}^{k} > 0} \subset {1 \leq k \leq K : z_{0}^{k} > 0}$ . For all other z₁ we have that

P_{z_{0}} (Z_{T_{1}} = z_{1} ∣ N (T_{mut}) > n - 1) = 0 .

Let $\tilde{P} (\cdot)$ denote the probability measure associated with a Moran process with only two types of cells and no further mutation allowed. Then it follows that

P_{z_{0}} (N (T_{hom}) > n - 1 ∣ N (T_{mut}) > n - 1) = {\tilde{P}}_{z_{0}} (N (T_{hom}) > n - 1) .

Thus it remains to bound ${\tilde{E}}_{z_{0}} (N (T_{hom}))$ . This will be done by following a similar approach as the proof of Thm 6.3 of [69]. First let $r = \hat{f} ∕ f$ , then for each 1 ≤ y ≤ N − 1 define N_y to be the number of visits of the Moran process to the state y. During each visit of the process to the site y, B_y birth events occur, where B_y is a geometric random variable with success probability

ρ_{y} = \frac{y (N - y) (\tilde{f} + f)}{N (\hat{f} j + f (N - j))},

It follows that

{\tilde{E}}_{z_{0}} (N (T_{hom})) = \frac{1}{ρ_{y}} \sum_{y = 1}^{N - 1} E_{z_{0}} N_{y} .

Using formula for E_z0N_y from the proof of Thm. 6.3 of [69], we arrive at

{\tilde{E}}_{z_{0}} (N (T_{hom})) = N \sum_{y = 1}^{N - 1} (\frac{r^{N - y} - 1}{r^{N} - 1}) (\frac{1}{N - y} + \frac{r}{y}) .

Note that for all r ≥ 0, we have

\frac{r^{N - y} - 1}{r^{N} - 1} \leq 1 .

It then follows that

\tilde{E} (N (T_{hom})) \leq N \sum_{y = 1}^{N - 1} (\frac{1}{N - y} + \frac{r}{y}) \approx (1 + f_{K}) N (\log N + γ)

where γ is the Euler-Mascheroni constant and we used the approximation

\sum_{j = 1}^{N} \frac{1}{j} \approx \log N + γ .

The result follows by recalling that f_K ≤ 2.

References

[1].Mintz B. Clonal basis of mammalian differentiation. Symp. Soc. Exp. Biol. 1971;25:345–370. [PubMed] [Google Scholar]
[2].Mintz B. Malignancy versus normal differentiation of stem cells as analyzed in genetically mosaic animals. Adv. Pathobiol. 1977;6:153–157. [PubMed] [Google Scholar]
[3].Kovacs L, Potten CS. An estimation of proliferative population size in stomach, jejenum and colon of dba-2 mice. Cell Tissue Kinet. 1973;6:125–134. doi: 10.1111/j.1365-2184.1973.tb01601.x. [DOI] [PubMed] [Google Scholar]
[4].Bach SP, Renehan AG, Potten CS. Stem cells: the intestinal stem cell as a paradigm. Carcinogenesis. 2000;21:469–476. doi: 10.1093/carcin/21.3.469. [DOI] [PubMed] [Google Scholar]
[5].Jacobson MD, Weil M, Raff MC. Programmed cell death in animal development. Cell. 1997;88:347–354. doi: 10.1016/s0092-8674(00)81873-5. [DOI] [PubMed] [Google Scholar]
[6].Kimura M. Evolutionary rate at the molecular level. Nature. 1968;217:624–626. doi: 10.1038/217624a0. [DOI] [PubMed] [Google Scholar]
[7].Hanahan D, Weinberg RA. The hallmarks of cancer. Cell. 2000;100:57–70. doi: 10.1016/s0092-8674(00)81683-9. [DOI] [PubMed] [Google Scholar]
[8].Levine AJ. The tumor suppressor genes. A. Rev. Biochem. 1993;62:623–651. doi: 10.1146/annurev.bi.62.070193.003203. [DOI] [PubMed] [Google Scholar]
[9].Mitelman F, Johansson B, Mertens F. Catalog of chromosome aberrations in cancer. Wiley-Liss; New York: 1994. [Google Scholar]
[10].Kinzler KW, Vogelstein B. Gatekeepers and caretakers. Nature. 1997;386:761–763. doi: 10.1038/386761a0. [DOI] [PubMed] [Google Scholar]
[11].Kinzler KW, Vogelstein B. The genetic basis of human cancer. McGraw-Hill; Toronto: 1998. [Google Scholar]
[12].Lengauer C, Kinzler KW, Vogelstein B. Genetic instabilities in human cancers. Nature. 1998;396:643–649. doi: 10.1038/25292. [DOI] [PubMed] [Google Scholar]
[13].Knudson AG. Two genetic hits to cancer. Nature Rev. Cancer. 2001;1:156–162. doi: 10.1038/35101031. [DOI] [PubMed] [Google Scholar]
[14].Hahn WC, Weinberg RA. Rules for making human tumor cells. New Eng. J. Med. 2002;347:1593–1603. doi: 10.1056/NEJMra021902. [DOI] [PubMed] [Google Scholar]
[15].Vogelstein B, Kinzler K. The genetic basis of human cancer. McGraw-Hill; 2002. [Google Scholar]
[16].Jones PA, Baylin SB. The fundamental role of epigenetic events in cancer. Nature Reviews Genetics. 2002;3:415–428. doi: 10.1038/nrg816. [DOI] [PubMed] [Google Scholar]
[17].Herman JG, et al. Silencing of the vhl tumor-suppressor gene by dna methylation in renal carcinoma. Proc Natl Adac Sci USA. 1994;91:9700–9704. doi: 10.1073/pnas.91.21.9700. [DOI] [PMC free article] [PubMed] [Google Scholar]
[18].Esteller M, et al. Promoter hypermethylation and brca1 inactivation in sporadic breast and ovarian tumors. J Natl Cancer Inst. 2000;92:564–569. doi: 10.1093/jnci/92.7.564. [DOI] [PubMed] [Google Scholar]
[19].Esteller M, et al. Epigenetic inactivation of lkb1 in primary tumors associated with the peutz-jeghers syndrome. Oncogene. 2000;19:164–168. doi: 10.1038/sj.onc.1203227. [DOI] [PubMed] [Google Scholar]
[20].Coulondre C, Miller JH, Farabaugh PlJ., Gilbert W. Molecular basis of base substitution hotspots in escherichia coli. Nature. 1978;274:775–780. doi: 10.1038/274775a0. [DOI] [PubMed] [Google Scholar]
[21].Pfeifer GP, Tang M, Denissenko MF. Mutation hotspots and dna methylation. Curr Top Microbiol Immunol. 2000;249:1–19. doi: 10.1007/978-3-642-59696-4_1. [DOI] [PubMed] [Google Scholar]
[22].Iwasa Y, Michor F, Komarova N, Nowak M. Population genetics of tumor suppressor genes. Journal of Theoretical Biology. 2005;233:15–23. doi: 10.1016/j.jtbi.2004.09.001. [DOI] [PubMed] [Google Scholar]
[23].Michor F, Iwasa Y, Nowak MA. Dynamics of cancer progression. Nature Reviews Cancer. 2004;4:197–206. doi: 10.1038/nrc1295. [DOI] [PubMed] [Google Scholar]
[24].Komarova NL, Wodarz W. Computational Biology of Cancer. World Scientific; 2005. [Google Scholar]
[25].Komarova NL, Sengupta A, Nowak MA. Mutation-selection networks of cancer initiation: Tumor suppressor genes and chromosone instability. Journal of Theoretical Biology. 2003;223:433–450. doi: 10.1016/s0022-5193(03)00120-6. [DOI] [PubMed] [Google Scholar]
[26].Schweinsberg J. Waiting for n mutations. Electronic Journal of Probability. 2008;13:1442–1478. [Google Scholar]
[27].Durrett R, Schmidt D, Schweinsberg J. A waiting time problem arising from the study of multi-stage carcinogenesis. Annals of Applied Probability. 2009;19:676–718. [Google Scholar]
[28].Moran PAP. The statistical processes of evolutionary theory. Clarendon Press; Oxford: 1962. [Google Scholar]
[29].Charles D, Luce-Clausen E. The kinetics of papilloma formation in benzpyrene-treated mice. Cancer Research. 1942;2:261–263. [Google Scholar]
[30].Fisher JC, Hollomon JH. A hypothesis for the origin of cancer foci. Cancer. 1951;4:916–918. doi: 10.1002/1097-0142(195109)4:5<916::aid-cncr2820040504>3.0.co;2-7. [DOI] [PubMed] [Google Scholar]
[31].Nordling CO. A new theory on cancer-inducing mechanism. Br. J. Cancer. 1953;7:68–72. doi: 10.1038/bjc.1953.8. [DOI] [PMC free article] [PubMed] [Google Scholar]
[32].Armitage P, Doll R. A two-stage theory of carcinogenesis in relation to the age distribution of human cancer. Br. J. Cancer. 1957;11 doi: 10.1038/bjc.1957.22. [DOI] [PMC free article] [PubMed] [Google Scholar]
[33].Fisher JC. Multiple-mutation theory of carcinogenesis. Nature. 1958;181:651–652. doi: 10.1038/181651b0. [DOI] [PubMed] [Google Scholar]
[34].Ashley DJ. Colonic ccncer arising in polyposis coli. Journal of Medical Genetics. 1969;6:376–378. doi: 10.1136/jmg.6.4.376. [DOI] [PMC free article] [PubMed] [Google Scholar]
[35].Knudson AG. Mutation and cancer: statistical study of retinoblastoma. Proc. Natl. Acad. Sci. USA. 1971;68:820–823. doi: 10.1073/pnas.68.4.820. [DOI] [PMC free article] [PubMed] [Google Scholar]
[36].Bodmer W, Tomlinson I. Failure of programmed cell death and differentiation as causes of tumors: some simple mathematical models. Proc. Natl. Acad. Sci. USA. 1995:11130–11134. doi: 10.1073/pnas.92.24.11130. [DOI] [PMC free article] [PubMed] [Google Scholar]
[37].Tomlinson IP, Novelli MR, Bodmer WF. The mutation rate and cancer. Proc. Natl. Acad. Sci. USA. 1996;93:14800–14803. doi: 10.1073/pnas.93.25.14800. [DOI] [PMC free article] [PubMed] [Google Scholar]
[38].Maley CC, Forrest S. Exploring the relationship between neutral and selective mutations in cancer. Artif. Life. 2001;6:325–345. doi: 10.1162/106454600300103665. [DOI] [PubMed] [Google Scholar]
[39].Nunney L. The population genetics of multistage carcinogenesis. Proc. Biol. Soc. 2003;270:1183–1191. doi: 10.1098/rspb.2003.2351. [DOI] [PMC free article] [PubMed] [Google Scholar]
[40].Frank SA. Genetic predisposition to cancer - insights from population genetics. Nat. Rev. Genetics. 2004;5:764–772. doi: 10.1038/nrg1450. [DOI] [PubMed] [Google Scholar]
[41].Frank SA. Age-soecific incidence of inherited versus sporadic cancers: a test of the multistage theory of carcinogenesis. Proc. Natl. Acad. Sci. USA. 2005;102:1071–1075. doi: 10.1073/pnas.0407299102. [DOI] [PMC free article] [PubMed] [Google Scholar]
[42].Michor F, Iwasa Y, Nowak MA. The age incidence of chronic myeloid leukemia can be explained by a one-mutation model. Proc. Natl. Acad. Sci. USA. 2006;103:14931–14934. doi: 10.1073/pnas.0607006103. [DOI] [PMC free article] [PubMed] [Google Scholar]
[43].Michor F, Nowak MA, Iwasa Y. Stochastic dynamics of metastasis formation. J. Theor. Biol. 2006;240:521–530. doi: 10.1016/j.jtbi.2005.10.021. [DOI] [PubMed] [Google Scholar]
[44].Michor F, Iwasa Y. Dynamics of metastasis suppressor gene inactivation. J. Theor. Biol. 2006;241:676–689. doi: 10.1016/j.jtbi.2006.01.006. [DOI] [PubMed] [Google Scholar]
[45].Beerenwinkel N, Antal T, Dingli D, Traulsen A, Kinzler KW, Velculescu VE, et al. Genetic progression and the waiting time to cancer. PLoS Computational Biology. 2007;3:e225. doi: 10.1371/journal.pcbi.0030225. [DOI] [PMC free article] [PubMed] [Google Scholar]
[46].D’Onofrio A, Tomlinson IP. A nonlinear mathematical model of cell turnover, differentiation, and tumorigenesis in the intestinal crypt. J. Theor Biol. 2007;244:367–374. doi: 10.1016/j.jtbi.2006.08.022. [DOI] [PubMed] [Google Scholar]
[47].Johnston MD, Edwards CM, Bodmer WF, Maini PK, Chapman SJ. Mathematical modeling of cell population dynamics int he colonic crypt and in colorectal cancer. Proc. Natl. Acad. Sci. USA. 2007;104:4008–4013. doi: 10.1073/pnas.0611179104. [DOI] [PMC free article] [PubMed] [Google Scholar]
[48].Wodarz D, Komarova NL. Can loss of apoptosis protect against cancer? Trends Genet. 2007;23:232–237. doi: 10.1016/j.tig.2007.03.005. [DOI] [PubMed] [Google Scholar]
[49].Jones S, Chen WD, Parmigiani G, Diehl F, Beerenwinkel N, Antal T, et al. Comparative lesion sequencing provides insights into tumor evolution. Proc. Natl. Acad. Sci. USA. 2008;105:4283–4288. doi: 10.1073/pnas.0712345105. [DOI] [PMC free article] [PubMed] [Google Scholar]
[50].Haeno H, Levine RL, Gilliland DG, Michor F. The cell of origin of hematopoietic malignancies. Proc. Natl. Acad. Sci. USA. 2009;106:16616–16621. doi: 10.1073/pnas.0908107106. [DOI] [PMC free article] [PubMed] [Google Scholar]
[51].Durrett R, Foo J, Leder K, Mayberry J, Michor F. Evolutionary dynamics of tumor progression with random fitness values. Theor Popul Biol. 2010;78:54–66. doi: 10.1016/j.tpb.2010.05.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
[52].Wodarz D, Komarova N. Computational Biology of Cancer. World Scientific; 2005. [Google Scholar]
[53].Reya T, Morrison S, Clarke M, Weissman I. Stem cells, cancer, and cancer stem cells. Nature. 2001;414:105–111. doi: 10.1038/35102167. [DOI] [PubMed] [Google Scholar]
[54].Huntly BJP, Shigematsu H, DEguchi K, Lee BH, Mizuno S, Duclos N, Rowan R, et al. Moz-tif2, but not bcr-abl, confers properties of leukemic stem cells to committed murine hematopoietic progenitors. Cancer Cel. 2004;6:587–696. doi: 10.1016/j.ccr.2004.10.015. [DOI] [PubMed] [Google Scholar]
[55].Iwasa Y, Michor F, Nowak M. Stochastic tunnels in evolutionary dynamics. Genetics. 2004;166:1571–1579. doi: 10.1534/genetics.166.3.1571. [DOI] [PMC free article] [PubMed] [Google Scholar]
[56].Crow JF, Kimura M. An Introduction to Population Genetics Theory. Harper and Row; 1970. [Google Scholar]
[57].Arias E. United States life tables, 2004. National Vital Statistics Reports. 2007;56(9) [PubMed] [Google Scholar]
[58].Lin XS, Liu X. Markov aging process and phase-type law of mortality. North American Actuarial Journal. 2007;11(4) [Google Scholar]
[59].Fearon ER, Vogelstein B. A genetic model for colorectal tumorigenesis. Cell. 1990;61:759–767. doi: 10.1016/0092-8674(90)90186-i. [DOI] [PubMed] [Google Scholar]
[60].Frank SA, Nowak M. Problems of somatic mutation and cancer. BioEssays. 2004;26:291–299. doi: 10.1002/bies.20000. [DOI] [PubMed] [Google Scholar]
[61].Yatabe Y, Tavare S, Shibata D. Investigating stem cells in human colon by using methylation patterns. Proc. Natl. Acad. Sci. USA. 2001;98:10839–10844. doi: 10.1073/pnas.191225998. [DOI] [PMC free article] [PubMed] [Google Scholar]
[62].Lipkin M, Sherlock PJ, Bell B. Generation time of epithelial cells in the human colon. Nature. 1962;195 doi: 10.1038/195175b0. [DOI] [PubMed] [Google Scholar]
[63].Lieberman D. Cost effectiveness of colon cancer screening. Am. J. Gastroenterol. 1991;33:1789. [PubMed] [Google Scholar]
[64].Jass JR, Stewart SM. Evolution of hereditary non-polyposis colorectal cancer. Gut. 1992;33:783. doi: 10.1136/gut.33.6.783. [DOI] [PMC free article] [PubMed] [Google Scholar]
[65].Ransohoff D, Lang C. Stem cells, cancer, and cancer stem cells. N. Engl. J. Med. 1991;325:37. doi: 10.1056/NEJM199107043250107. [DOI] [PubMed] [Google Scholar]
[66].Nicolas P, Kim K-M, Shibata D, Tavare S. The stem cell population of the human colon crypt: Analysis via methylation patterns. PLoS Computational Biology. 2007;3:e28. doi: 10.1371/journal.pcbi.0030028. [DOI] [PMC free article] [PubMed] [Google Scholar]
[67].Kim K-M, Shibata D. Methylation reveals a niche: stem cell succession in human colon crypts. Oncogene. 2002;21:5441–5449. doi: 10.1038/sj.onc.1205604. [DOI] [PubMed] [Google Scholar]
[68].Kunkel TA, Bebenek K. Dna replication fidelity. Annu Rev Biochem. 2000;69:487–529. doi: 10.1146/annurev.biochem.69.1.497. [DOI] [PubMed] [Google Scholar]
[69].Durrett R. Probability Models for DNA Sequence Evolution. Springer; 2008. [Google Scholar]

[R1] [1].Mintz B. Clonal basis of mammalian differentiation. Symp. Soc. Exp. Biol. 1971;25:345–370. [PubMed] [Google Scholar]

[R2] [2].Mintz B. Malignancy versus normal differentiation of stem cells as analyzed in genetically mosaic animals. Adv. Pathobiol. 1977;6:153–157. [PubMed] [Google Scholar]

[R3] [3].Kovacs L, Potten CS. An estimation of proliferative population size in stomach, jejenum and colon of dba-2 mice. Cell Tissue Kinet. 1973;6:125–134. doi: 10.1111/j.1365-2184.1973.tb01601.x. [DOI] [PubMed] [Google Scholar]

[R4] [4].Bach SP, Renehan AG, Potten CS. Stem cells: the intestinal stem cell as a paradigm. Carcinogenesis. 2000;21:469–476. doi: 10.1093/carcin/21.3.469. [DOI] [PubMed] [Google Scholar]

[R5] [5].Jacobson MD, Weil M, Raff MC. Programmed cell death in animal development. Cell. 1997;88:347–354. doi: 10.1016/s0092-8674(00)81873-5. [DOI] [PubMed] [Google Scholar]

[R6] [6].Kimura M. Evolutionary rate at the molecular level. Nature. 1968;217:624–626. doi: 10.1038/217624a0. [DOI] [PubMed] [Google Scholar]

[R7] [7].Hanahan D, Weinberg RA. The hallmarks of cancer. Cell. 2000;100:57–70. doi: 10.1016/s0092-8674(00)81683-9. [DOI] [PubMed] [Google Scholar]

[R8] [8].Levine AJ. The tumor suppressor genes. A. Rev. Biochem. 1993;62:623–651. doi: 10.1146/annurev.bi.62.070193.003203. [DOI] [PubMed] [Google Scholar]

[R9] [9].Mitelman F, Johansson B, Mertens F. Catalog of chromosome aberrations in cancer. Wiley-Liss; New York: 1994. [Google Scholar]

[R10] [10].Kinzler KW, Vogelstein B. Gatekeepers and caretakers. Nature. 1997;386:761–763. doi: 10.1038/386761a0. [DOI] [PubMed] [Google Scholar]

[R11] [11].Kinzler KW, Vogelstein B. The genetic basis of human cancer. McGraw-Hill; Toronto: 1998. [Google Scholar]

[R12] [12].Lengauer C, Kinzler KW, Vogelstein B. Genetic instabilities in human cancers. Nature. 1998;396:643–649. doi: 10.1038/25292. [DOI] [PubMed] [Google Scholar]

[R13] [13].Knudson AG. Two genetic hits to cancer. Nature Rev. Cancer. 2001;1:156–162. doi: 10.1038/35101031. [DOI] [PubMed] [Google Scholar]

[R14] [14].Hahn WC, Weinberg RA. Rules for making human tumor cells. New Eng. J. Med. 2002;347:1593–1603. doi: 10.1056/NEJMra021902. [DOI] [PubMed] [Google Scholar]

[R15] [15].Vogelstein B, Kinzler K. The genetic basis of human cancer. McGraw-Hill; 2002. [Google Scholar]

[R16] [16].Jones PA, Baylin SB. The fundamental role of epigenetic events in cancer. Nature Reviews Genetics. 2002;3:415–428. doi: 10.1038/nrg816. [DOI] [PubMed] [Google Scholar]

[R17] [17].Herman JG, et al. Silencing of the vhl tumor-suppressor gene by dna methylation in renal carcinoma. Proc Natl Adac Sci USA. 1994;91:9700–9704. doi: 10.1073/pnas.91.21.9700. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R18] [18].Esteller M, et al. Promoter hypermethylation and brca1 inactivation in sporadic breast and ovarian tumors. J Natl Cancer Inst. 2000;92:564–569. doi: 10.1093/jnci/92.7.564. [DOI] [PubMed] [Google Scholar]

[R19] [19].Esteller M, et al. Epigenetic inactivation of lkb1 in primary tumors associated with the peutz-jeghers syndrome. Oncogene. 2000;19:164–168. doi: 10.1038/sj.onc.1203227. [DOI] [PubMed] [Google Scholar]

[R20] [20].Coulondre C, Miller JH, Farabaugh PlJ., Gilbert W. Molecular basis of base substitution hotspots in escherichia coli. Nature. 1978;274:775–780. doi: 10.1038/274775a0. [DOI] [PubMed] [Google Scholar]

[R21] [21].Pfeifer GP, Tang M, Denissenko MF. Mutation hotspots and dna methylation. Curr Top Microbiol Immunol. 2000;249:1–19. doi: 10.1007/978-3-642-59696-4_1. [DOI] [PubMed] [Google Scholar]

[R22] [22].Iwasa Y, Michor F, Komarova N, Nowak M. Population genetics of tumor suppressor genes. Journal of Theoretical Biology. 2005;233:15–23. doi: 10.1016/j.jtbi.2004.09.001. [DOI] [PubMed] [Google Scholar]

[R23] [23].Michor F, Iwasa Y, Nowak MA. Dynamics of cancer progression. Nature Reviews Cancer. 2004;4:197–206. doi: 10.1038/nrc1295. [DOI] [PubMed] [Google Scholar]

[R24] [24].Komarova NL, Wodarz W. Computational Biology of Cancer. World Scientific; 2005. [Google Scholar]

[R25] [25].Komarova NL, Sengupta A, Nowak MA. Mutation-selection networks of cancer initiation: Tumor suppressor genes and chromosone instability. Journal of Theoretical Biology. 2003;223:433–450. doi: 10.1016/s0022-5193(03)00120-6. [DOI] [PubMed] [Google Scholar]

[R26] [26].Schweinsberg J. Waiting for n mutations. Electronic Journal of Probability. 2008;13:1442–1478. [Google Scholar]

[R27] [27].Durrett R, Schmidt D, Schweinsberg J. A waiting time problem arising from the study of multi-stage carcinogenesis. Annals of Applied Probability. 2009;19:676–718. [Google Scholar]

[R28] [28].Moran PAP. The statistical processes of evolutionary theory. Clarendon Press; Oxford: 1962. [Google Scholar]

[R29] [29].Charles D, Luce-Clausen E. The kinetics of papilloma formation in benzpyrene-treated mice. Cancer Research. 1942;2:261–263. [Google Scholar]

[R30] [30].Fisher JC, Hollomon JH. A hypothesis for the origin of cancer foci. Cancer. 1951;4:916–918. doi: 10.1002/1097-0142(195109)4:5<916::aid-cncr2820040504>3.0.co;2-7. [DOI] [PubMed] [Google Scholar]

[R31] [31].Nordling CO. A new theory on cancer-inducing mechanism. Br. J. Cancer. 1953;7:68–72. doi: 10.1038/bjc.1953.8. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R32] [32].Armitage P, Doll R. A two-stage theory of carcinogenesis in relation to the age distribution of human cancer. Br. J. Cancer. 1957;11 doi: 10.1038/bjc.1957.22. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R33] [33].Fisher JC. Multiple-mutation theory of carcinogenesis. Nature. 1958;181:651–652. doi: 10.1038/181651b0. [DOI] [PubMed] [Google Scholar]

[R34] [34].Ashley DJ. Colonic ccncer arising in polyposis coli. Journal of Medical Genetics. 1969;6:376–378. doi: 10.1136/jmg.6.4.376. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R35] [35].Knudson AG. Mutation and cancer: statistical study of retinoblastoma. Proc. Natl. Acad. Sci. USA. 1971;68:820–823. doi: 10.1073/pnas.68.4.820. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R36] [36].Bodmer W, Tomlinson I. Failure of programmed cell death and differentiation as causes of tumors: some simple mathematical models. Proc. Natl. Acad. Sci. USA. 1995:11130–11134. doi: 10.1073/pnas.92.24.11130. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R37] [37].Tomlinson IP, Novelli MR, Bodmer WF. The mutation rate and cancer. Proc. Natl. Acad. Sci. USA. 1996;93:14800–14803. doi: 10.1073/pnas.93.25.14800. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R38] [38].Maley CC, Forrest S. Exploring the relationship between neutral and selective mutations in cancer. Artif. Life. 2001;6:325–345. doi: 10.1162/106454600300103665. [DOI] [PubMed] [Google Scholar]

[R39] [39].Nunney L. The population genetics of multistage carcinogenesis. Proc. Biol. Soc. 2003;270:1183–1191. doi: 10.1098/rspb.2003.2351. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R40] [40].Frank SA. Genetic predisposition to cancer - insights from population genetics. Nat. Rev. Genetics. 2004;5:764–772. doi: 10.1038/nrg1450. [DOI] [PubMed] [Google Scholar]

[R41] [41].Frank SA. Age-soecific incidence of inherited versus sporadic cancers: a test of the multistage theory of carcinogenesis. Proc. Natl. Acad. Sci. USA. 2005;102:1071–1075. doi: 10.1073/pnas.0407299102. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R42] [42].Michor F, Iwasa Y, Nowak MA. The age incidence of chronic myeloid leukemia can be explained by a one-mutation model. Proc. Natl. Acad. Sci. USA. 2006;103:14931–14934. doi: 10.1073/pnas.0607006103. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R43] [43].Michor F, Nowak MA, Iwasa Y. Stochastic dynamics of metastasis formation. J. Theor. Biol. 2006;240:521–530. doi: 10.1016/j.jtbi.2005.10.021. [DOI] [PubMed] [Google Scholar]

[R44] [44].Michor F, Iwasa Y. Dynamics of metastasis suppressor gene inactivation. J. Theor. Biol. 2006;241:676–689. doi: 10.1016/j.jtbi.2006.01.006. [DOI] [PubMed] [Google Scholar]

[R45] [45].Beerenwinkel N, Antal T, Dingli D, Traulsen A, Kinzler KW, Velculescu VE, et al. Genetic progression and the waiting time to cancer. PLoS Computational Biology. 2007;3:e225. doi: 10.1371/journal.pcbi.0030225. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R46] [46].D’Onofrio A, Tomlinson IP. A nonlinear mathematical model of cell turnover, differentiation, and tumorigenesis in the intestinal crypt. J. Theor Biol. 2007;244:367–374. doi: 10.1016/j.jtbi.2006.08.022. [DOI] [PubMed] [Google Scholar]

[R47] [47].Johnston MD, Edwards CM, Bodmer WF, Maini PK, Chapman SJ. Mathematical modeling of cell population dynamics int he colonic crypt and in colorectal cancer. Proc. Natl. Acad. Sci. USA. 2007;104:4008–4013. doi: 10.1073/pnas.0611179104. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R48] [48].Wodarz D, Komarova NL. Can loss of apoptosis protect against cancer? Trends Genet. 2007;23:232–237. doi: 10.1016/j.tig.2007.03.005. [DOI] [PubMed] [Google Scholar]

[R49] [49].Jones S, Chen WD, Parmigiani G, Diehl F, Beerenwinkel N, Antal T, et al. Comparative lesion sequencing provides insights into tumor evolution. Proc. Natl. Acad. Sci. USA. 2008;105:4283–4288. doi: 10.1073/pnas.0712345105. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R50] [50].Haeno H, Levine RL, Gilliland DG, Michor F. The cell of origin of hematopoietic malignancies. Proc. Natl. Acad. Sci. USA. 2009;106:16616–16621. doi: 10.1073/pnas.0908107106. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R51] [51].Durrett R, Foo J, Leder K, Mayberry J, Michor F. Evolutionary dynamics of tumor progression with random fitness values. Theor Popul Biol. 2010;78:54–66. doi: 10.1016/j.tpb.2010.05.001. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R52] [52].Wodarz D, Komarova N. Computational Biology of Cancer. World Scientific; 2005. [Google Scholar]

[R53] [53].Reya T, Morrison S, Clarke M, Weissman I. Stem cells, cancer, and cancer stem cells. Nature. 2001;414:105–111. doi: 10.1038/35102167. [DOI] [PubMed] [Google Scholar]

[R54] [54].Huntly BJP, Shigematsu H, DEguchi K, Lee BH, Mizuno S, Duclos N, Rowan R, et al. Moz-tif2, but not bcr-abl, confers properties of leukemic stem cells to committed murine hematopoietic progenitors. Cancer Cel. 2004;6:587–696. doi: 10.1016/j.ccr.2004.10.015. [DOI] [PubMed] [Google Scholar]

[R55] [55].Iwasa Y, Michor F, Nowak M. Stochastic tunnels in evolutionary dynamics. Genetics. 2004;166:1571–1579. doi: 10.1534/genetics.166.3.1571. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R56] [56].Crow JF, Kimura M. An Introduction to Population Genetics Theory. Harper and Row; 1970. [Google Scholar]

[R57] [57].Arias E. United States life tables, 2004. National Vital Statistics Reports. 2007;56(9) [PubMed] [Google Scholar]

[R58] [58].Lin XS, Liu X. Markov aging process and phase-type law of mortality. North American Actuarial Journal. 2007;11(4) [Google Scholar]

[R59] [59].Fearon ER, Vogelstein B. A genetic model for colorectal tumorigenesis. Cell. 1990;61:759–767. doi: 10.1016/0092-8674(90)90186-i. [DOI] [PubMed] [Google Scholar]

[R60] [60].Frank SA, Nowak M. Problems of somatic mutation and cancer. BioEssays. 2004;26:291–299. doi: 10.1002/bies.20000. [DOI] [PubMed] [Google Scholar]

[R61] [61].Yatabe Y, Tavare S, Shibata D. Investigating stem cells in human colon by using methylation patterns. Proc. Natl. Acad. Sci. USA. 2001;98:10839–10844. doi: 10.1073/pnas.191225998. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R62] [62].Lipkin M, Sherlock PJ, Bell B. Generation time of epithelial cells in the human colon. Nature. 1962;195 doi: 10.1038/195175b0. [DOI] [PubMed] [Google Scholar]

[R63] [63].Lieberman D. Cost effectiveness of colon cancer screening. Am. J. Gastroenterol. 1991;33:1789. [PubMed] [Google Scholar]

[R64] [64].Jass JR, Stewart SM. Evolution of hereditary non-polyposis colorectal cancer. Gut. 1992;33:783. doi: 10.1136/gut.33.6.783. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R65] [65].Ransohoff D, Lang C. Stem cells, cancer, and cancer stem cells. N. Engl. J. Med. 1991;325:37. doi: 10.1056/NEJM199107043250107. [DOI] [PubMed] [Google Scholar]

[R66] [66].Nicolas P, Kim K-M, Shibata D, Tavare S. The stem cell population of the human colon crypt: Analysis via methylation patterns. PLoS Computational Biology. 2007;3:e28. doi: 10.1371/journal.pcbi.0030028. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R67] [67].Kim K-M, Shibata D. Methylation reveals a niche: stem cell succession in human colon crypts. Oncogene. 2002;21:5441–5449. doi: 10.1038/sj.onc.1205604. [DOI] [PubMed] [Google Scholar]

[R68] [68].Kunkel TA, Bebenek K. Dna replication fidelity. Annu Rev Biochem. 2000;69:487–529. doi: 10.1146/annurev.biochem.69.1.497. [DOI] [PubMed] [Google Scholar]

[R69] [69].Durrett R. Probability Models for DNA Sequence Evolution. Springer; 2008. [Google Scholar]

PERMALINK

Stochastic dynamics of cancer initiation

Jasmine Foo

Kevin Leder

Franziska Michor

Abstract

1. Introduction

2. The Model

Mutational fitness distribution f_ψ

Figure 1. A schematic representation of the stochastic process governing cellular fitness.

Dynamics in fitness space

Modeling the lifetime of a patient

Figure 2. Sample path simulations of the two-dimensional stochastic process.

3. Analysis

3.1. The probability of cancer initiation prior to death

3.2. The waiting time until cancer initiation

3.3. The number of neutral mutations in the cancer-initiating cell

3.4. The number of non-neutral mutations in the cancer-initiating cell

4. Application of the model to colorectal cancer initiation

Figure 3. The mutational fitness distribution.

4.1. Effects of varying the mutational fitness distribution, f_ψ

Figure 4. The probability of and time until cancer initiation prior to death.

Figure 5. Advantageous mutations in the cancer-initiating cell.

4.2. Effects of varying the mode of the lifetime distribution

Table 1. The effects of the life expectancy distribution on cancer initiation.

4.3. Sensitivity to the mutation rate

Figure 6. The effects of the mutation rate on cancer initiation.

5. Discussion

Acknowledgements

6. Appendix

6.1. Standard Moran Process

6.2. Multi-Type Moran Process

Lemma 1

Proof of Lemma 1

References

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

Stochastic dynamics of cancer initiation

Jasmine Foo

Kevin Leder

Franziska Michor

Abstract

1. Introduction

2. The Model

Mutational fitness distribution fψ

Figure 1. A schematic representation of the stochastic process governing cellular fitness.

Dynamics in fitness space

Modeling the lifetime of a patient

Figure 2. Sample path simulations of the two-dimensional stochastic process.

3. Analysis

3.1. The probability of cancer initiation prior to death

3.2. The waiting time until cancer initiation

3.3. The number of neutral mutations in the cancer-initiating cell

3.4. The number of non-neutral mutations in the cancer-initiating cell

4. Application of the model to colorectal cancer initiation

Figure 3. The mutational fitness distribution.

4.1. Effects of varying the mutational fitness distribution, fψ

Figure 4. The probability of and time until cancer initiation prior to death.

Figure 5. Advantageous mutations in the cancer-initiating cell.

4.2. Effects of varying the mode of the lifetime distribution

Table 1. The effects of the life expectancy distribution on cancer initiation.

4.3. Sensitivity to the mutation rate

Figure 6. The effects of the mutation rate on cancer initiation.

5. Discussion

Acknowledgements

6. Appendix

6.1. Standard Moran Process

6.2. Multi-Type Moran Process

Lemma 1

Proof of Lemma 1

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

Mutational fitness distribution f_ψ

4.1. Effects of varying the mutational fitness distribution, f_ψ