Skip to main content
Evolutionary Applications logoLink to Evolutionary Applications
. 2016 Mar 8;9(4):565–582. doi: 10.1111/eva.12361

The implications of small stem cell niche sizes and the distribution of fitness effects of new mutations in aging and tumorigenesis

Vincent L Cannataro 1,, Scott A McKinley 2, Colette M St Mary 1
PMCID: PMC4831459  PMID: 27099622

Abstract

Somatic tissue evolves over a vertebrate's lifetime due to the accumulation of mutations in stem cell populations. Mutations may alter cellular fitness and contribute to tumorigenesis or aging. The distribution of mutational effects within somatic cells is not known. Given the unique regulatory regime of somatic cell division, we hypothesize that mutational effects in somatic tissue fall into a different framework than whole organisms; one in which there are more mutations of large effect. Through simulation analysis, we investigate the fit of tumor incidence curves generated using exponential and power‐law distributions of fitness effects (DFE) to known tumorigenesis incidence. Modeling considerations include the architecture of stem cell populations, that is, a large number of very small populations, and mutations that do and do not fix neutrally in the stem cell niche. We find that the typically quantified DFE in whole organisms is sufficient to explain tumorigenesis incidence. Further, deleterious mutations are predicted to accumulate via genetic drift, resulting in reduced tissue maintenance. Thus, despite there being a large number of stem cells throughout the intestine, its compartmental architecture leads to the accumulation of deleterious mutations and significant aging, making the intestinal stem cell niche a prime example of Muller's Ratchet.

Keywords: aging, biomedicine, evolutionary theory, fitness, population genetics‐theoretical, stem cells, tumorigenesis

Introduction

Evolution in somatic tissue

The epithelial tissues within many animals are continually replenished by populations of stem cells that divide throughout the organism's lifetime. For instance, the epithelial lining of the intestinal tract is replaced weekly by millions of independent populations of stem cells located in intestinal crypts [reviewed in Barker (2014)]. This continual division provides an opportunity for mutation, resulting in the accumulation of mutant lineages and somatic evolution (Lynch 2010). Stem cell lineages with decreased fitness, or a diminished ability to divide and survive, will represent a failure in this tissue renewal process and the aging of tissues and multicellular organisms as a whole (López‐Otín et al. 2013; Moskalev et al. 2013). Lineages with increased fitness, or faster division rates and an increased propensity to survive, will result in the accumulation of cells and neoplasia (Merlo et al. 2006). Although considered premalignant at the onset, the accumulation of cells into a polyp, in which cells continually divide and accumulate subsequent mutations, can develop a cancerous phenotype over time (Winawer 1999).

Distribution of fitness effects

The effect that a new mutation will have on an individual's fitness can be characterized by a distribution of fitness effects (DFE). The DFE of several organisms have been experimentally estimated using mutation accumulation experiments or directed mutagenesis experiments in the laboratory (Eyre‐Walker and Keightley 2007; Halligan and Keightley 2009). The majority of random mutations to a genome that affect fitness have a deleterious effect on fitness, while a small subset increase fitness (Eyre‐Walker and Keightley 2007). Additionally, many mutations that affect fitness have a small effect, while few have a large effect. In general, both beneficial (Imhof and Schlotterer 2001; Orr 2003; Kassen and Bataillon 2006) and deleterious (Elena et al. 1998) mutational fitness effects can be described well using an exponential distribution. We note that certain beneficial DFE may not be exponentially distributed, and are better classified as having either a bounded or heavier‐than‐exponential tail (Rokyta et al. 2008; Bank et al. 2014; Levy et al. 2015), and compound distributions or distributions with more parameters may better fit empirical measures of DFE (Sanjuán et al. 2004).

By understanding the mutational DFE in somatic tissue, we can predict the evolutionary trajectories of tissues within multicellular organisms as they age. Absolute fitness is typically measured as the reproductive success of a genetically identical lineage, which can be measured empirically as the growth rate of a population and interpreted ecologically as the death rate of individuals in a population subtracted from the birth rate. Within stem cell populations, this is analogous to the differentiation rate of the stem cell lineage subtracted from the division rate. However, the total growth rate of the healthy stem cell population is necessarily zero to insure tissue homeostasis. As we describe in the next section, the stem cells exist in two populations: a static niche population with cells that are undergoing division and migrating into the second population, containing cells that are undergoing division and differentiation. Therefore, although only mutations to stem cell division rate would confer a change in selection pressure within the stem cell niche, fixed mutations to both division and differentiation rate will alter the rate of growth of the total stem cell lineage, the expected size of the stem cell population as a whole, and contribute to the probability of a tumorigenesis event. Hence, mutations to division and differentiation rate affect the reproductive success of stem cell lineages, that is, fitness, and we consider distributions of mutational effects on these two rates in this work. Although there has been no direct measurement of the distribution of fitness effects in somatic tissue [but see Vermeulen et al. (2013), Snippert et al. (2014) for estimations of the selective advantage for some known cancer drivers], the evolution of cancer progression has been previously modeled using discrete (Beerenwinkel et al. 2007; Bozic et al. 2010; McFarland et al. 2013) and continuous (Foo et al. 2011) fitness effects. Here, we differ from these previous models by investigating different mutational effect frameworks using parameters derived from whole organisms to explore mutation accumulation in crypts initialized at their measured healthy size in mice and humans and quantify both aging and tumorigenesis.

When quantifying tumor incidence, we are concerned with the moment that the regulatory regime in the intestinal crypt breaks down: when the stem cell division rate exceeds its differentiation rate. We call this point the tumorigenesis threshold. The resulting population will accumulate stem cells without bound, which is thought to be the cause of crypt fission and the main mechanism of polyp or adenoma growth (Loeffler and Grossmann 1991; Wong et al. 2002). We investigate the full spectrum of deleterious and beneficial mutational effects on the progression of a healthy crypt to tumor initiation using empirically measured rates of division.

The evolution of multicellularity has necessitated the evolution of regulatory systems that hold somatic stem cells at a relatively low fitness (when compared to their maximum potential) in order to ensure the cooperation of the different cellular systems constituting a whole organism. As such, in addition to the beneficial mutations that would be accounted for by a typical DFE for whole organisms [which are commonly assumed to already be highly fit (Orr 2010)], we expect mutations of large effect in somatic tissue as regulatory processes become dysfunctional, such as the deactivation of tumor suppressor genes or the activation of oncogenes. It is reasonable to hypothesize that a heavy‐tailed distribution could better classify mutational effects that have a beneficial effect in somatic stem cells by capturing both the mutations of small effect and also having a nontrivial probability of capturing the mutations of large effect often associated with cancer.

We evaluate whether or not the DFE estimated in whole organisms can explain known tumor incidence in the intestine. Further, we explore whether or not tumor incidence is better explained by a heavy‐tailed distribution for mutations beneficial to fitness. Thus, we create a model of an evolving intestinal stem cell pool and implement alternate DFE and compare the resultant incidence curves to known tumor incidence curves.

Materials and methods

Description of the model

Crypt population structure

The base of each intestinal crypt harbors a population of symmetrically dividing cells expressing markers associated with the stem cell phenotype (Lopez‐Garcia et al. 2010; Snippert et al. 2010). Within this population, there exists a subpopulation niche that is responsible for maintaining tissue homeostasis (Kozar et al. 2013; Vermeulen et al. 2013). We model the stem cells of the intestinal crypt as two populations of cells, the first being this stem cell niche, which consists of a fixed population of stem cells, N, and the second consisting of the stem cells displaced from this niche but not yet committed to differentiation. The sum of these populations represent the total number of stem cells within the crypt, NT. In our model, cells within the niche divide at rate λ and displace their neighbors through overcrowding, as proposed by Lopez‐Garcia et al. (2010) and revealed by in vivo live imaging by Ritsma et al. (2014). This population of cells experiences genetic drift and selection; cells that have a higher division rate are more likely to push their neighbors out of the niche [as demonstrated by Snippert et al. (2014)] and cells with lower division rates are more likely to be displaced. Mutations may occur at division with mutation rate μ and result in either a lineage with a new division rate or a lineage with a new rate of committing to differentiation. Displaced stem cells divide at the rate of their progenitor cells in the niche and commit to differentiation at rate ν, hereafter referred to as the differentiation rate. We assume that once a lineage commits to differentiation, it is destined to be expelled from the crypt. We define tumorigenesis in the crypt as the moment a lineage of stem cells with a division rate greater than its differentiation rate has become fixed in the niche, resulting in exponential population growth. We note that, although a stem cell's propensity to commit to differentiation in healthy tissue is partially dependent on external signaling queues, such as Wnt signals from Paneth cells in the small intestinal crypt stem cell niche (Clevers 2013), the ability of a stem cell to interpret and respond to, or even gain independence from, external signals is an intrinsic and heritable property of the stem cell (Reya and Clevers 2005). The parameters λ, N, and NT have been previously estimated (Kozar et al. 2013; Vermeulen et al. 2013), and we calculate ν according to eqn (4) in the Appendix, where ν=1+Ny¯(t) λ and y¯ is the average number of stem cells outside of the niche.

Distribution of fitness effects

We first describe our model of mutations that affect the division rate of stem cells and address mutations that affect differentiation rate later in section “Mutations that alter the differentiation rate of stem cells result in rapid aging and tumorigenesis" When mutations occur, the new division rate is greater than the previous rate with probability PB, and the mean positive change of rate is s+. We consider positive and negative changes that are exponentially distributed for deleterious effects and exponentially or Pareto distributed for beneficial effects, see the Appendix. The mean negative change is s. We define the exponential DFE in eqn (1) and the power‐law DFE in eqn (2).

m(λ;λ0)exp=(1PB)βλ0eβ1λλ0λ<λ0PBαλ0eαλλ01λ>λ0 (1)
m(λ;λ0)Pareto=(1PB)βλ0eβ1λλ0λ<λ0PBα1λ0λλ0αλ>λ0 (2)

The power‐law distribution is well defined if α > 1 and is considered to be heavy‐tailed (having infinite variance) if 1 < α < 3.

Selection assumptions

We are concerned with the mutations that arise and reach fixation within the stem cell niche. Due to drift, all stem cells with the same division rate as the background population have an equal probability of reaching fixation, commonly referred to as neutral drift dynamics (Lopez‐Garcia et al. 2010; Snippert et al. 2010). Following Wodarz and Komarova (2005), we use a Moran model to estimate the probability that a mutant lineage fixes in the stem cell niche:

pfix(λ;λold)=1λoldλ1(λoldλ)N (3)

where N is the number of cells in the niche. The mutation rate is low relative to the division rate, so we assume that there are at most two competing division rates at any given time.

Using the above formula (3), we can use Bayes' theorem to compute the probability density Φ(λ|λold) of a new fixed division rate λ given that the previous division rate is λold:

Φ(λ|λold)=pfix(λ;λold)m(λ;λold)0pfix(;λold)m(;λold)d. (4)

As described above, tumorigenesis occurs when the division rate λ is greater than differentiation rate ν and we define the point at which this happens to be the tumorigenesis threshold. In our modeling framework, each new fixed mutation presents a new possibility that the division rate exceeds the threshold for tumorigenesis. From (4), we can iteratively derive the sequence of functions {fn} that represent the density of the distribution of the stem cell division rates conditioned that n mutations have fixed in the stem cell niche and tumorigenesis has not occurred as of mutation n−1. If we let λ0 denote the initial stem cell division rate, then f1(λ)=Φ(λ|λold=λ0). For each n, let pn denote the probability that tumorigenesis occurs due to the nth mutation (given that n mutations have occurred). Then, p1=νf1(λ)dλ. From there, we can write the recursive formulae

fn+1(λ)=11pn0νΦ(λ|λold=)fn()dandpn+1=νfn+1(λ)dλ. (5)

From this, we have a recursive formula for the probabilities, {qn}, that tumorigenesis has not occurred given n fixed mutations: q1=1p1 and

qn+1=(1pn+1)qn. (6)

To translate this result to an individual's lifetime, we model the time‐dependent arrival of new mutations as a Poisson process with fixed rate parameter μ mutations per cell division. We keep track of the time‐dependent number M(t) of mutations that fix in the stem cell niches by time t. Then, using T to denote the time that tumorigenesis occurs in a given crypt, we can write the probability that tumorigenesis has not occurred as of time t by the equation

P(T>t)=n=1qnP(M(t)=n) (7)

Depending on the species, an individual has hundreds of thousands or even millions of crypts. The probability that an individual has at least one crypt that has undergone tumorigenesis can be calculated by considering the distribution of fixed mutations that have accumulated among the individual's crypts and the probability that these mutations result in tumorigenesis. This can then be extrapolated to the incidence rate of tumors among a population of individuals (Fig. 1). Let T represent the time that tumorigenesis first occurs in any of an individual's crypts. We use the following estimate to calculate tumorigenesis incidence data reported in the Results section. In the Supporting Information, we describe the full calculation and a few simplifying assumptions we make to develop a computationally tractable model.

Figure 1.

Figure 1

A representation of our model. (A) A cross section of an intestinal crypt, blue circles at the base of the crypt represent stem cells, while yellow circles represent cells that have committed to differentiation. The oval cross section at the base encompasses the stem cell niche, while stem cells above this niche are destined to commit to differentiation. Taking a top‐down look at the oval, large circles represent a cross section of the intestinal crypt base, which houses the intestinal stem cells, represented by smaller blue and red circles. Mutations may occur to a single cell in the stem cell niche. These mutations alter the fitness of the cell according to a specified distribution of fitness effects. Given the new fitness, the mutated lineage has a certain probability, pfix(λ;λold), of reaching fixation within the stem cell niche. (B) Here, the rectangles represent a cross section of the intestinal epithelium with the numbers representing the locations of individual crypts and describing the number of fixed mutations for each crypt. An organism accumulates fixed mutations over its lifetime.

P(Notumorigenesisattimet)n=1n^P(Nocrypttumorigenesisattimet|nmutations)E(#ofcryptswithnmutations)

i.e.,

P(T>t)n=1nmax(qn)C(μ^t)neμ^t/n!. (8)

In the above, C is the number of crypts in the length of intestine being investigated, nmax is the maximum number of mutations simulated, and μ^=Nμλ00pfix(λ;λ0)m(λ;λ0)dλ, where, as above, N is the number of cells in the stem cell niche, μ is the mutation rate per cell division, and pfix is defined by eqn (3).

Parameter choices

Some estimates of crypt dynamics parameters have shifted over time, for example, the stem cell division rate in the mouse was formerly thought to be once every 1–1.5 days (Lopez‐Garcia et al. 2010), but more recent estimates indicate they divide once every 3–10 days (Kozar et al. 2013). Kozar et al. (2013) demonstrated that the division rate of stem cells in the stem cell niche of mice varied from approximately 0.1 to 0.2 to 0.3 divisions per day along the proximal small intestine, distal small intestine, and colon, respectively. Likewise the estimated number of stem cells within the mouse stem cell niche varies from approximately five to six to seven, respectively. The total number of cells in crypts expressing stem cell markers has been reported to be 14–16 in mice (Lopez‐Garcia et al. 2010; Snippert et al. 2010; Clevers 2013). For the analysis of our mouse model, we chose the middle value of these parameter ranges, a crypt with 15 total cells expressing stem cell markers, with 6 of the cells constituting the stem cell niche dividing 0.20 times per stem cell per day. To estimate the differentiation rate of stem cells outside the stem cell niche, we used a continuous time Markov chain, described in the Appendix. According to this model, in order for the total stem cells in the crypt of a mouse to stay at a constant population size, the differentiation rate of stem cells outside of the stem cell niche must be 0.333 per stem cell per day.

The parameters associated with crypt dynamics in mice have been well described; however, we were unable to obtain any data on population incidence of intestinal polyps or tumors in wild type mice. On the other hand, while crypt dynamics in humans have not been as well studied, there exist incidence data for large intestine polyps (Chapman 1963). To parameterize the human colon crypt system, we considered a few sources. Nicolas et al. (2007) analyzed the methylation patterns within the human colon crypt and their Bayesian analysis suggests a posterior density mode between 15 and 20 stem cells maintaining homeostasis and constituting the stem cell niche within the crypt. Their posterior density provides more support for numbers of stem cells larger than this mode than for numbers smaller, so we chose 20 as an initial value for the number of stem cells within the stem cell niche. Bravo and Axelrod (2013) report an average of 35.7 quiescent stem cells within the human colon crypt through a staining experiment, so we assume there are 36 total stem cells within the human colon crypt. Human colon stem cells divide about once every 7 days (Potten et al. 2003), which would mean they would have to differentiate at a rate of about 0.321 per day to maintain homeostasis at the assumed initial parameters.

We parameterized the initial DFE based on those measured in whole organisms to evaluate whether they can account for known tumorigenesis incidence. The distribution of fitness effects has been estimated in mutation accumulation experiments and directed mutagenesis experiments. We consider the DFE proposed by Joseph and Hall (2004) in a mutation accumulation study because they report the expected effect size of deleterious and beneficial mutations, as well as the mutation rate and the proportion of mutations that were beneficial in a diploid eukaryotic system (Saccharomyces cerevisiae). They report an average beneficial heterozygous fitness effect of 0.061, which is slightly lower but within an order of magnitude of the effect of average beneficial mutation measured for vesicular stomatitis virus of 0.07 (Sanjuán et al. 2004) and E.coli of 0.087 (Kassen and Bataillon 2006). They found that 5.75% of accumulated mutations were beneficial and that the overall mutation rate to alleles that alter fitness was 6.3×105 mutations per haploid genome per generation. This would result in a diploid beneficial mutation rate of 2×6.3×105×0.0575=7.245×106. This is within an order of magnitude of the beneficial mutation rate reported for E. coli (Wiser et al. 2013).

Mutation accumulation experiments may not capture the true distribution of fitness effects because they rely on observing the mutations of lineages that survive and persist in a population. Because of this, they are biased against mutations of large deleterious effect. Additionally, the random passaging of individuals to repopulate new generations may result in drastically different estimates of average mutational effect size for the same species. For instance, average deleterious effect of mutations in Saccharomyces cerevisiae has been estimated to be 0.061 (Joseph and Hall 2004), 0.086 (Wloch et al. 2001), and 0.217 (Zeyl and DeVisser 2001). Directed mutagenesis of random genome targets in an RNA virus revealed an average nonlethal deleterious fitness effect of 0.244 (Sanjuán et al. 2004). It is likely that the inherent average effect size of a mutation of deleterious effect would be better reflected by the larger estimates because mutations of large deleterious effect may be lost in mutation accumulation experiments.

After building our model with DFE parameters estimated from whole organisms, we describe overall patterns of mutation accumulation and risk of tumorigenesis and then we utilized least squares analysis to explore the best fit among a series of plausible choices of μ and the expected value of s+ for the human incidence curves and compared to data from Chapman (1963) (best fit figures available as Figs S1–S3). For the division rate scenario, both for the exponential and for power‐law DFE, we vary the expected value of s+ from 0.041 to 0.07 and μ from 2.5×105 to 7×104. For the differentiation rate scenario, we vary s+ from 0.041 to 0.07 and μ from 2.5×107 to 7×104.

The model described above was executed using R version 3.1.1. R scripts developed for this study are available at https://github.com/vcannataro/Somatic‐Evo‐DFE.

Results

Mutations result in both aging and tumorigenesis within the intestine

Because stem cell niche populations are small, it is possible for mutant lineages with a fitness disadvantage to fix in the niche. This, coupled with the fact that the vast majority of mutations that occur will have a deleterious effect on stem cell fitness, results in the expected value of the probability density describing the new division rates to move away from the tumorigenesis threshold with subsequent fixed mutations (Fig. 2A,C,E). In general, the accumulation of fixed mutations within crypts results in impaired stem cell maintenance and lower stem cell production, contributing to the aging of the tissue and organism.

Figure 2.

Figure 2

The accumulation of probability densities describing stem cell division rate. (A) Exponentially distributed fitness effects on division rate using the parameters in Table 1 for the mouse. The first density is a green dashed line. Each probability density represents the division rate of a fixed lineage after n fixed mutations, with n indicated by an arrow. (B) Zooming in on the tumorigenesis threshold, we see that the area of the division rate density that is over the tumorigenesis threshold increases at first and then decreases with subsequent mutation. There is a change in slope of the densities at the tumorigenesis threshold because subsequent densities are calculated from the previous density which has had the area to the right of the tumorigenesis threshold removed and the area to the left renormalized to 1. (C,D) are the same as (A) and (B), respectively, but are for the human scenario. The larger population size decreases the strength of drift. Order of mutations in (C) proceeds as in (A) and proceeds from 1 through 8 from bottom to top in (D). (E) The expected values of the probability densities in (A) and (B) divided by their original values over subsequent fixed mutations.

The probability that a particular fixed mutation will result in tumorigenesis in the crypt, pn (eqn (5)), is equal to the area under these densities that crosses the tumorigenesis threshold (Fig. 2B,D). For the initial parameterization in mice and humans, this increases at first, but then decreases with subsequent fixed mutations as the probability densities describing division rate move away from the tumorigenesis threshold.

Predicted incidence curves in mice and humans using DFE derived from a whole organism

Using the model described in “Selection‐assumptions", we determined the cumulative probability distribution of tumorigenesis within a population of crypts in an individual organism. For mice, using the initial parameters in Table 1 and exponentially distributed beneficial fitness effects, we find that the incidence of tumorigenesis is predicted to increase linearly with age, with close to nine percent of mice experiencing tumorigenesis at 3 years of age (Fig. 3A). Human tumorigenesis incidence in the large intestine is predicted to be approximately 36% at 80 years of age (Fig. 3B), using an exponentially distributed beneficial fitness effects and the initial null parameters from Table 1.

Table 1.

Initial model parameters, combining whole organism DFE with organismal crypt parameters. See text above for reasoning behind initial parameter choices

Parameter Description Value in mouse (Ref) Value in human (Ref)
PB
Percent of mutations with a beneficial effect 0.0575 (Joseph and Hall 2004) 0.0575 (Joseph and Hall 2004)
s+
Effect size of a mutation of beneficial effect 0.061 (Joseph and Hall 2004) 0.061 (Joseph and Hall 2004)
s
Effect size of a mutation of deleterious effect 0.217 (Zeyl and DeVisser 2001) 0.217 (Zeyl and DeVisser 2001)
μ Mutation rate per genes influencing fitness per division 2×6.3×105 (Joseph and Hall 2004) 2×6.3×105 (Joseph and Hall 2004)
λ0
Normal stem cell division rate per day 0.2 (Kozar et al. 2013) 0.143 (Potten et al. 2003)
ν0
Normal stem cell differentiation rate per day 0.333 (this study) 0.321 (this study)
N Number of stem cells in the stem cell niche at the base of the crypt 6 (Kozar et al. 2013) 20 (Nicolas et al. 2007)
NT
Total number of stem cells expressing stem cell markers in the crypt 15 (Clevers 2013) 36 (Bravo and Axelrod 2013)
Crypts Number of crypts in the small and large intestine, respectively 7.5×105, 4.5×105 (Potten et al. 2003) 5×107, 2×107 (Potten et al. 2003)

Figure 3.

Figure 3

Tumorigenesis incidence in mice and humans using whole organism DFE parameters. (A) The population incidence of tumorigenesis throughout the entire intestinal tract of the mouse. (B) The population incidence of tumorigenesis throughout the large intestine in humans. The black dashed lines are generated from the species specific parameters listed in Table 1. The solid red line connects large intestine polyp incidence data found during autopsy (Chapman 1963).

The only incidence data for early tumors or polyps were found for the large intestine in humans. The predicted incidence curve derived from an exponentially distributed DFE follows the same qualitative dynamics as the tumor incidence data. Incidence curves that are derived from a power‐law distribution using the initial parameters in Table 1 predict nearly 100% tumorigenesis by 80 years of age and do not follow the incidence data dynamics. Hence, we performed a least squares analysis, varying parameters that have not been characterized for human somatic tissue, to find the parameter set in our exploratory space with the best fit to the observed incidence curve to the data.

Altering the expected beneficial fitness effects and the mutation rate provides better fits for both exponential and power‐law derived incidence curves

The expected mean fitness effects (s+,s) of the DFE and the mutation rate (μ) per division of a mutation that alters the stem cell fitness were inferred from whole organisms as an initial parameter choice (Table 1). A parameter space around the initial choices was explored, and a least squares analysis was performed to find a better fit to the data (additional information in the Appendix, Figs S1–S3). Just the mutation rate (μ) and expected beneficial fitness effect (s+) are presented because changes to the expected deleterious fitness effect (s) had little effect on the resultant tumorigenesis incidence curves. We found that both the exponential and power‐law scenarios can provide similarly good fits to the data, however, with distinctly different parameters. The exponential DFE derived curve provided the best fit with the same mutation rate as our initial choice (Table 1), with a slightly larger expected beneficial fitness effect (E[s+]=0.064, Fig. 4A, red dashed line). Interestingly, assuming the same expected beneficial fitness effect as in Table 1 and varying the mutation rate provides a reasonable fit with a slightly larger mutation rate (μ=1.75×104, 4A, blue dashed line). The power‐law DFE provided a similarly good fit to the incidence curve, but for a parameter space that assumes a much smaller expected beneficial fitness effect and a large mutation rate (E[s+]=0.044, μ=5×104, Fig. 4B red dashed line).

Figure 4.

Figure 4

Tumorigenesis incidence curves resulting from least squares parameter fitting. (A) Incidence curves derived from the assumption of an exponential beneficial DFE. The best fit to the data out of the explored parameter space has the same μ as the yeast reported in Table 1 and E[s+] = 0.064 (red dashed line). Black dashed lines derived from E[s+] = 0.064 and, from bottom to top, μ=7.5×105 to 1.75×104 by 2.5×105. Blue dashed line is the predicted incidence curve with the best fit with E[s+] = 0.061 (initial DFE derived from yeast reported in Table 1), which had μ=1.75×104. (B) Incidence curves derived from the assumption of a power‐law beneficial DFE. All parameters are the same as in Table 1, except E[s+] = 0.044 for each curve and, ranging from top to bottom, μ ranges from 4.5×104 to 5.5×104 by 2.5×105, with 5×104 providing the best fit.

Mutations that alter the differentiation rate of stem cells result in rapid aging and tumorigenesis

Mutations affecting differentiation rate influence the lifetime of a stem cell lineage. Mutations that increase differentiation rate will decrease the fitness of the lineage, while mutations that decrease differentiation rate increase fitness. Mutations affecting differentiation rate neutrally drift to fixation in the stem cell niche because the differentiation phenotype is not expressed in the niche, hence all cells divide at the same rate. Thus, the probability of fixation of mutations to differentiation rate is (1/N), regardless of mutational effect. We only considered an exponential mutational effect distribution because the distinction between exponential and power‐law distributions is only significant in prevalence of large deviations from the mean, and beneficial mutational effects in this scenario exist between ν0 and zero. Because all mutations that solely affect differentiation rate drift neutrally, and the majority of mutations decrease fitness (by increasing differentiation rate), the majority of fixed mutations move stem cell pools away from the tumorigenesis threshold (Fig. 5).

Figure 5.

Figure 5

The accumulation of probability densities describing stem cell differentiation rate. (A) Exponentially distributed fitness effects on differentiation rate using the parameters in Table 1 for the mouse. The first density is a green dashed line. Each probability density represents the differentiation rate of a fixed lineage after n fixed mutations, with subsequent mutations traveling away from the original differentiation rate. (B) Zooming in on the tumorigenesis threshold, we see that the area of the differentiation rate density that is over the tumorigenesis threshold decreases with subsequent mutation. There is a change in slope of the densities at the tumorigenesis threshold because subsequent densities are calculated from the previous density which has had the area to the left of the tumorigenesis threshold removed and the area to the right renormalized to 1. (C,D) are the same as (A) and (B), respectively, but are for the human scenario. Order of mutations in (C) proceeds as in (A). (E) The expected values of the probability densities in (A) and (B) divided by their original values over subsequent fixed mutations.

Mutational effects are typically described as a proportion of the phenotype they are affecting, and as such, the same DFE applied to a larger rate will have a larger absolute expected effect. The differentiation rate of stem cells displaced from the niche is necessarily larger than the intrinsic division rate because only a subpopulation of the entire stem cell population is exposed to committing to differentiation; however, all cells are dividing (Ritsma et al. 2014), and the stem cell population is maintained at a steady‐state equilibrium. Thus, mutations affecting differentiation rate in our model have a larger absolute effect for the same proportional change in rate when compared to the previous analysis on mutations to division rate. Hence, given a fixed mutation, we see a high incidence of tumorigenesis when the mutation affects differentiation rate (Fig. 6A,B). Fitting analyses along a range of plausible parameter space revealed a poorer fit when compared to mutations that alter division rate because mutations that alter differentiation rate will always result in large tumor incidence at early age.

Figure 6.

Figure 6

The tumorigenesis incidence resulting from stem cell mutational effects on differentiation rate. Calculations presented for tumor incidence in (A) mice, (B) humans, and (C) Best fit incidence curve in red; expected beneficial fitness effect of 0.057 and mutation rate of 2.5 ×106. The other curves have the same mutation rate but vary around the expected beneficial fitness effect by increments of 0.001.

Discussion

Whole organism DFE are sufficient to explain tumorigenesis

We hypothesized that mutations in somatic tissues would differ in their distribution, compared to unicellular whole organisms, because of the regulatory processes that control cell division and differentiation rates in multicellular organisms. However, we found that whole organism DFE were sufficient to account for patterns of tumorigenesis in the intestines. This suggests that somatic evolution is not unique, but instead is based on the same patterns of mutation that we see in whole organisms. Hence, the differences in evolutionary patterns between somatic tissues and whole organisms, such as the tendency of tissues to age via the accumulation of deleterious mutations while populations of whole organisms instead evolve to greater mean fitness in benign environments, arise as a consequence of the small populations of stem cells within multicellular organisms and the asexual nature of cell division. Somatic aging via mutation is thus akin to the action of Muller's ratchet, the accumulation of deleterious mutations in organisms that cannot eliminate them via recombination. Indeed, the ratchet acts more strongly than in populations of organisms as a result of the relative importance of drift versus selection in very small stem cell populations (i.e., niches). This raises the interesting question of why somatic tissues are organized in this way and whether small stem cell pools predominate to minimize tumorigenesis at the expense of aging, as has been suggested by Michor et al. (2003).

Given the role of well‐known large effect mutations in cancer, it is tempting, from a mathematical modeling point of view, to adopt a heavy‐tailed (infinite variance) distribution for the DFE. In contrast to the DFE employed for modeling populations of whole organisms (e.g., an exponential distribution), which tend to exhibit small incremental changes, a heavy‐tailed regime enables a significant contribution from “one‐shot” large mutations. To probe this possibility, we included in our simulations a power‐law (Pareto) distribution which, through its shape parameter α, can be either heavy‐tailed (1 < α ≤ 3) or not (α > 3). It is noteworthy, then, that the best fit parameters were very far from the heavy‐tailed regime (α ≈ 16). The prevalence of large outlier mutations for such a distribution is comparable to what would be seen from an exponential distribution, meaning that the heavy‐tailed regime is not an appropriate modeling framework to explain the data.

Small populations and genetic drift lead to aging

One of our primary findings is that mutation effects drive crypt aging as much, if not more so, than tumorigenesis. Tumor formation and aging are two manifestations of the accumulation of cellular genetic damage. This damage is especially relevant to the aging process when it affects the functional competence of stem cells and compromises their ability to replenish the various cell populations of their constituent tissue (López‐Otín et al., 2013). Mutations to stem cells that result in aging have been associated with diminishing the stem cell's potential to proliferate (Rossi et al. 2007), competitively exclude healthy stem cells (Nijnik et al. 2007), and self‐renew or differentiate (Jones and Rando 2011; Moskalev et al. 2013). These effects on stem cell dynamics would decrease the number of functional stem and nonstem cells in tissues, thus resulting in tissue aging, as defined in aging reviews and experimental work (above) and previous mathematical models (Wodarz 2007). As mutations become fixed in the intestinal stem cell niche, the expected value of the probability density describing new stem cell lineage division rates decreases when we consider mutations that affect division rate, and the expected value for differentiation rate increases when we consider mutations that affect differentiation rate, and thus, crypts are predominately aging. The intestinal stem cell niche is maintained at a population size smaller than the effective population sizes of whole organisms and our findings derive directly from this population structure.

Our study, which emphasizes small healthy crypt populations, contrasts with previous studies that have investigated the accumulation of deleterious mutations in somatic tissue. These studies have looked at larger initial population sizes and in effect model hyperplasia or growing tumors. For instance, McFarland et al. (2013) modeled populations with an initial population size of approximately 1000 cells based on estimates from hyperplasia in mice 2 weeks after APC deletion. Similarly, McFarland et al. (2014) investigated mutation accumulation in models of hyperplasia and growing cancers. Datta et al. (2013) modeled deleterious mutations in housekeeping genes in an exponentially growing tumor initialized at 1×106 cells. Beckman and Loeb (2005) assumed their population of cells was sufficiently large to ensure a deleterious mutation of any strength could not become fixed. These approaches are useful to describe tumor growth in initiated tumors but fail to capture the relative importance of drift in evolving stem cell niches and the process of tumorigenesis from healthy stem cell niches, which exist as very small populations.

We find that crypts with fixed mutations are distributed along a range of both aging and tumor formation. The expected value of the division rate density moving away from the tumorigenesis threshold causes the probability of tumorigenesis per fixed mutation to eventually decrease. For example, in mice, there is a smaller probability that the fourth fixed mutation in a stem cell niche will result in tumorigenesis when compared to the third mutation in the exponential beneficial DFE scenario. The human intestinal crypt stem cell niche consists of a larger number of stem cells so drift plays a smaller role in the evolutionary trajectory of these crypts. Nonetheless, the mode of the distribution of division rate still moves away from the tumorigenesis threshold, albeit at a slower rate.

Although our model assumes that the size of the stem cell niche (N) remains constant and mutations only change the division rate or differentiation rate of lineages, it is possible that mutations could alter the niche size. If mutations altered the size of a crypt's stem cell niche, they would change the probability of fixation of subsequent mutations.

Mutations that only affect differentiation rate do not match incidence data curves

Analyses of colon cancer genomes from different individuals reveals that a small number of genes, associated with large fitness advantage, are commonly mutated among cancers (Wood et al. 2007). For instance, many colon cancers contain cells that have mutations in genes involved in the Wnt‐signaling cascade responsible for maintaining “stemness" (Clevers and Nusse 2012). A study by Smith et al. (2002) found that 56% of 106 sequenced tumors had mutations in the APC gene, which, when nonfunctional, results in the activation of the Wnt cascade (Reya and Clevers 2005). Additionally, cancers that have a mutation in the APC gene tend to have the mutation distributed throughout the tumor, suggesting the mutations occurred early in tumor growth (Sottoriva et al. 2015). Because the Wnt‐signaling cascade is involved with maintaining a stem cell phenotype, mutations in this cascade would influence the propensity for stem cells to differentiate. Additionally, Smith et al. (2002) also found that 61.3% of colorectal cancers had mutations in p53, involved in regulating apoptosis, and 27.4% of colorectal cancers had mutations in K‐ras, thought to drive cancer growth by accelerating stem cell division and leading to enhanced crypt fission (Snippert et al. 2014).

Our modeling scenario of mutations only having an effect on the differentiation rate of stem cells and having effect sizes equal to those measured in whole organisms results in rapid tumorigenesis, with nearly 100% of human individuals having a polyp in their large intestine at young age. Indeed, individuals with familial adenomatous polyposis (FAP), who already have a germline mutation in one copy of their APC gene and only need one mutational hit on the other to form an adenoma, regularly develop adenomas as teenagers (Bozic et al. 2010). In our model, the large tumorigenesis incidence associated with mutations solely affecting differentiation rate is due to both the mutations having a larger absolute effect toward the tumorigenesis threshold and there being a higher overall probability of fixation of new mutations among all the crypts due to the mutations fixing through neutral drift. Even when we decrease the expected beneficial mutational effect size and decrease the mutation rate in an attempt to better fit the tumorigenesis incidence data, we find that mutations only affecting differentiation rate still result in more tumorigenesis than predicted at young age. However, the data were derived from autopsies on individuals >10 years of age, so data for tumorigenesis are lacking in this age group. Additionally, we modeled scenarios where mutations only affect division or differentiation, nature is certainly more complex, and mutation in both differentiation and division rates are likely to co‐occur within a crypt population. Indeed, the APC protein discussed above contributes directly or indirectly to cellular division, differentiation, migration, cell orientation, and apoptosis (Dikovskaya et al. 2007; McCartney and Näthke 2008).

We model stem cell dynamics and mutational effects on those dynamics as a property that is controlled by an individual stem cell's genome, that is, a stem cell's heritable ability to produce or respond to internal signals, or respond to external signals, to divide or differentiate. The external signals regulating stem cell phenotype, such as Wnt signals produced by Paneth cells in the small intestine (Clevers 2013), are produced by cells differentiated from stem cells. Mutations to the stem cell genome may eventually influence the production of these signals in daughter cells. These mutations would drift neutrally in the niche, as they are not expressed until after stem cell differentiation, and, unless the lineage harboring the mutation reaches fixation in the niche, would eventually be lost from the crypt because Paneth cells die in approximately 20 days (Bry et al. 1994). Thus, mutations that result in differential signaling output by daughter cells can be modeled as neutrally fixed mutations acting intrinsically in the stem cells.

The influence of organism specific factors on somatic evolution

We find less tumor incidence in mice than humans throughout their respective lifetimes using the same DFE parameters. Mice only live a few years and have an order of magnitude fewer crypts in their entire intestine than humans have in just their large intestine (Potten et al. 2003). They also have smaller numbers of stem cells within their crypts, although those stem cells are dividing at a faster rate than human stem cells. Overall, this results in a lower chance of mutant lineages reaching fixation within crypts during the shorter mouse lifetime, and therefore, a reduction in the overall number of crypts with fixed mutations is lower. For instance, using the distribution of fixed mutations derived in the Appendix, at 2 years old, a mouse is expected to have about 75 crypts with two mutations, and only about 28% of mice will have a single crypt with three mutations. At 85 years old, a human is expected to have about 44 crypts with five mutations, one crypt with six mutations, and about four percent of humans at 85 years old will have a crypt with seven fixed mutations. As humans age they experience more fixed mutations, each of which confers a higher probability of tumorigenesis than the previous, whereas mice are expected to experience the accumulation of fewer mutations, possibly explaining the near linearity of the mouse incidence curve and the upwards curvature of the human incidence curve. Of note, given that a tumorigenesis event has occurred, it is likely the product of one mutation in the mouse model, whereas multiple mutations may contribute to the initiation of a tumor in the human model (the Appendix, Fig. S4).

The incidence of polyps at autopsy reported by Chapman (1963) was based on visual observations of discernible elevations of the mucosa in the entire large intestine during autopsy. It would take time for an initiated tumor to grow to a visible mass, so the true tumorigenesis incidence curve may lie in front of the data recorded in this study, with a lag time of growth before the tumor is visible. This lag time would be a function of the individual mutational spectrum of the initiated tumor and the tumor's environment.

Overall, we have shown that small homeostatic populations of stem cells, typical of somatic tissues in multicellular organisms, accumulate mutations that affect cellular fitness, contributing both to aging and tumorigenesis over an organism's lifetime. We show that the evolution of intestinal stem cell populations under the assumption of an organismal DFE, as opposed to the assumption of a heavy‐tailed beneficial DFE, best predicted early tumor formation. However, aging, rather than tumorigenesis, predominated among crypts in the intestine. Our modeling approach emphasizes tumorigenesis in the context of aging, and vice versa, and demonstrates the importance of mutational processes within very small populations in both these phenomena.

Data archiving statement

All data used in this manuscript have already been published or archived elsewhere.

Supporting information

Figure S1. Heat map depicting values for the least squares analysis of predicted tumor incidence and human tumor incidence data for the exponential beneficial DFE on division rate scenario.

Figure S2. Heat map depicting values for the least squares analysis of predicted tumor incidence and human tumor incidence data for the power‐law beneficial DFE on division rate scenario.

Figure S3. Heat map depicting values for the least squares analysis of predicted tumor incidence and human tumor incidence data for the mutations affecting differentiation rate scenario.

Figure S4. Mutation profiles of a tumor at the onset of tumorigenesis.

Figure S5. The simulated stem cell dynamics within a human crypt for both the displaced stem cells (A,B) and the stem cell niche (C,D) showing a fixation event of a mutant lineage.

Acknowledgement

We thank Charlie Baer, Andres J. Garcia, and Jake M. Ferguson for useful discussion. This research was partially supported by the National Science Foundation under Grant No. 0801544 in the Quantitative Spatial Ecology, Evolution and Environment Program at the University of Florida.

Description of the mathematical methodology

In this section, we describe the mathematical model underlying our research approach. It is a multiscale model, including the dynamics of the stem cell niche, the consequences for the larger stem cell population and a crypt, the dynamics in a population of crypts that comprise an individual's colon, and the dynamics of tumorigenesis among many individuals in a population. Due to the very large number of crypts in the colon and the desire to analyze population level incidence curves, we used several principles of rare‐event analysis in our numerical computations and also introduced a few approximations to make computations tractable.

Population dynamics within the stem cell niche

First, we develop a model for the population dynamics of a crypt immediately after a mutation has occurred. Suppose that there are N cells in the stem cell niche and let X(t) represent the number of cells that are descended from the original mutated cell at time t. We model X(t) as a continuous time Markov chain (CTMC) that takes its values in the set {0, … ,N} with X(0) = 1.

In accordance with the Markov process assumption, the time between divisions of a given stem cell is independent of all other cells and exponentially distributed with rate parameters λold or λ for the old and the new lineages, respectively. When a cell divides, we assume that there is crowding in the stem cell niche and an old cell is forced out. [In fact, the actual order of events remains unclear. It has also been hypothesized that a cell may leave the niche first, then triggering a cell division to replace it (Lopez‐Garcia et al. 2010, but see Ritsma et al. 2014).] Whether or not the value of the process X(t) changes depends on whether the cell that has been forced out is from the same lineage as the one that divided. The assumption that leads to the simplest mathematical model is nearest neighbor displacement. There are two cases (i) the dividing cell is of the same lineage as both of its neighbors, and (ii) the dividing cell is adjacent to a cell of the opposing lineage. In the first case, the value of X(t) does not change as a result of the cell division. In the latter case, there is a one‐half probability that a cell of the opposing lineage will be displaced. As such, for X(t) ∈ {1,2, …, N−1}, the Markov transition rates are given by

Nearest neighbor displacement:X(t)X(t)+1at rateλX(t)X(t)1at rateλ0. (9)

(The rate of one of the two border cells dividing is 2λ for the new lineage and 2λold for the old lineage and then each is multiplied by the one‐half probability of displacing an opposing lineage cell.) An alternate hypothesis is that after division, any other cell in the crypt might be displaced. The corresponding transition rates would be

Nonlocal displacement:X(t)X(t)+1at rate1NX(t)(NX(t))λX(t)X(t)1at rate1NX(t)(NX(t))λold (10)

The probability of fixation is actually the same for both models (though the expected time until fixation will differ). Let {t1,t2,} be the sequence of times when X(t) changes values. Disregarding the role of time in the process, we track the values with the process {Xn}n0 defined by Xn:=X(tn). The probability of a transition XX+1 is the rate at which the size of the mutant lineage increases divided by the total rate of change in lineage count. For both models, this probability of an increase in the mutant lineage size is p=λ/(λ+λold). Using the classical theory of hitting probabilities for biased random walks (Wodarz and Komarova 2005), one can readily derive the probability pfix(λ;λold) recorded in eqn (3) in the main text.

The intervals between mutations that fix in the stem cell niche

The DFEs used in this work are both considered in terms of percentage increase or decrease, rather than in terms of absolute quantities of change. In mathematical terms, this means that the densities can be expressed in terms of the ratio λ/λold. A remarkable consequence of this assumption is that the probability of a new lineage fixing in the niche is independent of the prevailing division rate λold. To see this, consider the probability of that a new lineage fixes after a mutation drawn from the exponential DFE. Recalling eqn (3), we note that the probability of fixation formula can be written in terms of the ratio of the new to the old division rate, r=λ/λold,

pfix(λ;λold)=pfix(r)=1r11rN.

We can then write

p^=PFixation|λold=0pfix(λ;λold)m(λ;λold)dλ=0λoldpfixλλold1PBβλoldeβ(1λλold)dλ+λoldpfixλλoldPBαλoldeα(1λλold)dλ=01pfix(r)1PBβeβ(1r)dr+1pfix(r)PBαeα(r1)dr,

which is independent of the choice of value λold. A similar result holds for the power‐law DFE. Generally, this property holds for any DFE that can be expressed in terms of the ratio λ/λold. It follows that the number of mutations that must occur in order for a new division rate to fix is distributed Geometrically with success probability p^. By standard properties of CTMC, we can then say that the time between the arrivals of “successful” mutations is exponentially distributed with rate parameter μp^λoldN.

Population dynamics outside the stem cell niche

Once outside the niche, a stem cell can either divide (at rate λ), or it can differentiate into transient amplifying cells (at rate ν). For the purposes of this model, we consider differentiated cells to be dead. There is a chance that the lineage of a stem cell outside the niche can undergo sufficiently many mutations to cause tumorigenesis, but we found by way of numerical investigations that this does not significantly contribute to overall incidence of these cancers. As such, let Y(t) denote the number of stem cells outside the niche that have not yet differentiated. Assuming, for the moment, that all members of the stem cell niche have a division rate λ, the CTMC Y(t) is defined by the transition rates

Y(t)Y(t)+1at rate(N+Y(t))λY(t)Y(t)1at rateY(t)ν.

The form of the rate of increase follows from the observation that Y(t) increases anytime a stem cell divides, whether that stem cell is in the crypt or not. On the other hand, because stem cells in the niche are assumed to not differentiate, the total rate of decrease is proportional to the number of stem cells outside the niche. Because the population size is so small, there is high variability and we note that Y(t) can regularly hit the value zero. Because the niche is protected by unrelated biological processes, this does not constitute extinction of the full stem cell population. As soon as another stem cell in the niche divides, the population outside the niche is renewed again. A typical trace for Y(t) can be seen in Fig. S5.

The law of this CTMC, yn(t)=PY(t)=n, satisfies the system of master equations

ddtyn(t)=(N+(n1))λyn1(t)1n1(n)+(n+1)νyn+1(t)(N+n)λ+nνyn(t). (11)

One can then show that the mean y¯(t)=n=1nyn(t) satisfies the ODE

ddty¯(t)=Nλ+(λν)y¯(t). (12)

If λ < ν, this ODE converges to a steady‐state value /(νλ). Otherwise the mean diverges to infinity with exponential growth. For this reason, we consider this threshold to be the initiation of tumorigenesis.

An alternate way to view the dynamics is to note that each time a stem cell in the niche divides it creates a new independent lineage outside the crypt. Let Yj(t) be the number of living stem cells outside the crypt that are descended from (and include) the product of the jth stem cell division in the niche. As such Y(t)=j=1Yj(t). Each process Yj(t) can be understood as a branching process, with an offspring distribution that is geometrically distributed with “success probability" q = ν/(ν λ). (The number of offspring is determined by the number of times the cell divides before differentiating. This is a sequence of independent trials where the probability of having another offspring, rather than differentiating, is λ/(ν + λ).) As long as the mean of this offspring distribution is less than or equal to one, these lineages will eventually go extinct. Therefore, the critical stem cell division rate corresponds to when the mean of the offspring distribution (which can be shown to be (1 − q)/q = λ/ν) is less than one. In other words, the critical division rate λ is simply λ=ν.

Population dynamics in the crypt

Of course, tumorigenesis in a given crypt is exceedingly unlikely, even over the lifetime of an individual. We model the colon as a collection of C107 individual crypts that are mathematically identical and independent. The number of fixed mutations in the ith crypt at time t is denoted Mi(t), and let {λ0i,λ1i} denote the sequence of division rates that become fixed in the ith crypt at times {0,τ1i,τ2i}, respectively. It follows that the inter‐arrival times are independent and distributed as

τk+1iτkiExp(p^μλkiN). (13)

Whether tumorigenesis has occurred in the ith crypt will be tracked by the function χi(t), defined by

χi(t)=1,ifνmaxλki:kMi(t)0,otherwise.

That is to say, χi(t)=1 if tumorigenesis has occurred before time t. It follows that the time of first tumorigenesis in the colon is given by the time

T:=inft>0:i=1Cχi(t)1. (14)

The per capita population incidence curves are then just the cumulative distribution function of the random variable T, which can be expressed in terms of the individual crypt dynamics as follows:

PT>t=Pχi(t)=0for alli{1,,C}.

To prepare for our numerical approximation of this quantity, we introduce one last bit of notation, {Nm(t)}m=0, which represents the number of crypts that have seen the arrival of m new fixed lineages as of time t. Then

PT>t|Nm(t)=nmfor allm=m=0Pχ(t)=0|M(t)=mnm. (15)

These dynamics can be simulated by Gillespie's method (Gillespie 1977), but such an approach is computationally intensive. For this reason, we introduced a few simplifying assumptions. For example, we model the arrival rates of new fixed lineages in the crypts as being constant over time [having fixed rate μ^=p^μλ0N, rather than a sequence of rates given in eqn (13)]. This allows us to assume that the number of mutations in each crypt at time t is Poisson distributed with mean μ^t. With such a tremendously large number of crypts in the colon, it is in turn reasonable to assume that the number of crypts takes the form nmCPM(t)=mCeμ^t(μ^t)m/m!. To complete the derivation of eqn (8) in the main text, we truncate the infinite product in eqn (15) and note that in the notation of the main text, Pχ(t)=0|M(t)=m=qm.

DFE equations

To define the parameters of the system, we specified the probability PB of a beneficial (versus deleterious) mutation and the respective means s+ and s of the DFE conditioned on the event that the mutation is beneficial or deleterious. The form of the mean of the DFE depends on whether it is exponential or heavy tailed. The exponential form DFE has mean

E(λnew|λold)exp=λold(1+PBα(1PB)β)

while the heavy‐tailed DFE has mean

E(λnewλold)Pareto=PBα1α2λold+(1PB)λoldλoldβ.

The conditional means have the form

s+:=E(λnewλold,beneficial)exp=λold1+1αs:=E(λnewλold,deleterious)exp=λold11β

for the exponential DFE, and

s+:=E(λnewλold,beneficial)Pareto=λoldα1α2s:=E(λnewλold,deleterious)Pareto=λold11β

for the heavy‐tailed DFE.

Least squares analysis

We generated tumor incidence curves and used a least squares analysis to determine which set of these parameters best fit the tumor incidence data described in Chapman (1963). The best fit has the smallest sum of squared residuals of the parameter space explored in Figs S1–S3.

Tumor mutational profile

Figure S4 contains the probabilities that each individual mutational profile was the culprit in tumorigenesis given that tumorigenesis occurred in a single crypt. They were calculated using Bayes' theorem to compute the probability that a certain mutational load fixed in the crypt given that a tumorigenesis event happened,

PM(T)=nT=t=fT(tM(T)=n)P(M(T)=n)j=1n^fT(tM(T)=j)P(M(T)=j)

where we recall that M(t) is the number of fixed mutations as of time t, T is the precise time that tumorigenesis occurs in the crypt, and fT refers to density of the random variable T. The quantity P(M(T)=n) is the probability that tumorigenesis occurs exactly on the nth mutation, a quantity we defined earlier as pn and gave a recursive formula for in eqn (5) in the main text. To compute the quantity fT(tM(T)=n), note that because the arrival time of the nth mutation is independent of the event that it causes tumorigenesis, we have that fT(tM(T)=n)=fτn(t), where τn is the arrival time of the nth mutation. By hypothesis, τn is Poisson distributed with mean μ^t with μ^ being defined after eqn (8) in the main text. The distributions of mutational profiles given the tumorigenesis event occurred at a certain point in time throughout an organism's lifetime are given in Fig. S4.

Literature Cited

  1. Bank, C. , Hietpas R. T., Wong A., Bolon D. N., and Jensen J. D. 2014. A bayesian MCMC approach to assess the complete distribution of fitness effects of new mutations: uncovering the potential for adaptive walks in challenging environments. Genetics 196:841–852. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Barker, N. 2014. Adult intestinal stem cells: critical drivers of epithelial homeostasis and regeneration. Nature Reviews Molecular Cell Biology 15:19–33. [DOI] [PubMed] [Google Scholar]
  3. Beckman, R. A. and Loeb L. A. 2005. Negative clonal selection in tumor evolution. Genetics 171:2123–2131. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Beerenwinkel, N. , Antal T., Dingli D., Traulsen A., Kinzler K. W., Velculescu V. E., Vogelstein B., and Nowak M. A. 2007. Genetic progression and the waiting time to cancer. PLoS Computational Biology 3:e225. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Bozic, I. , Antal T., Ohtsuki H., Carter H., Kim D., Chen S., Karchin R., Kinzler K. W., Vogelstein B., and Nowak M. A. 2010. Accumulation of driver and passenger mutations during tumor progression. Proceedings of the National Academy of Sciences of the United States of America 107:18545–18550. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Bravo, R. and Axelrod D. E. 2013. A calibrated agent‐based computer model of stochastic cell dynamics in normal human colon crypts useful for in silico experiments. Theoretical Biology and Medical Modelling 10:66. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Bry, L. , Falk P., Huttner K., Ouellette A., Midtvedt T., and Gordon J. I. 1994. Paneth cell differentiation in the developing intestine of normal and transgenic mice. Proceedings of the National Academy of Sciences of the United States of America 91:10335–10339. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Chapman, I. 1963. Adenomatous polypi of large intestine: incidence and distribution. Annals of Surgery 157:223–226. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Clevers, H. 2013. The intestinal crypt, a prototype stem cell compartment. Cell 154:274–284. [DOI] [PubMed] [Google Scholar]
  10. Clevers, H. and Nusse R. 2012. Wnt‐catenin signaling and disease. Cell 149:1192–1205. [DOI] [PubMed] [Google Scholar]
  11. Datta, R. S. , Gutteridge A., Swanton C., Maley C. C., and Graham T. 2013. Modelling the evolution of genetic instability during tumour progression. Evolutionary Applications 6:20–33. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Dikovskaya, D. , Schiffmann D., Newton I. P., Oakley A., Kroboth K., O. Sansom , Jamieson T. J., Meniel V., Clarke A., and Näthke I. S. 2007. Loss of APC induces polyploidy as a result of a combination of defects in mitosis and apoptosis. Journal of Cell Biology 176:183–195. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Elena, S. F. , Ekunwe L., Hajela N., Oden S. A., and Lenski R. E. 1998. Distribution of fitness effects caused by random insertion mutations in Escherichia coli . Genetica 102‐103:349–358. [PubMed] [Google Scholar]
  14. Eyre‐Walker, A. and Keightley P. D. 2007. The distribution of fitness effects of new mutations. Nature Reviews Genetics 8:610–618. [DOI] [PubMed] [Google Scholar]
  15. Foo, J. , Leder K., and Michor F. 2011. Stochastic dynamics of cancer initiation. Physical Biology 8:015002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Gillespie, D. T. 1977. Exact Stochastic Simulation of couple chemical reactions. The Journal of Physical Chemistry 81:2340–2361. [Google Scholar]
  17. Halligan, D. L. and Keightley P. D. 2009. Spontaneous mutation accumulation studies in evolutionary genetics. Annual Review of Ecology, Evolution, and Systematics 40:151–172. [Google Scholar]
  18. Imhof, M. and Schlotterer C. 2001. Fitness effects of advantageous mutations in evolving Escherichia coli populations. Proceedings of the National Academy of Sciences of the United States of America 98:1113–1117. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Jones, D. L. and Rando T. A. 2011. Emerging models and paradigms for stem cell ageing. Nature Cell Biology 13:506–512. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Joseph, S. B. and Hall D. W. 2004. Spontaneous mutations in diploid Saccharomyces cerevisiae: more beneficial than expected. Genetics 168:1817–1825. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Kassen, R. and Bataillon T. 2006. Distribution of fitness effects among beneficial mutations before selection in experimental populations of bacteria. Nature Genetics 38:484–488. [DOI] [PubMed] [Google Scholar]
  22. Kozar, S. , Morrissey E., Nicholson A. M., van der Heijden M., Zecchini H. I., Kemp R., Tavaré S., Vermeulen L., and Winton D. J. 2013. Continuous clonal labeling reveals small numbers of functional stem cells in intestinal crypts and adenomas. Cell Stem Cell 13:626–633. [DOI] [PubMed] [Google Scholar]
  23. Levy, S. F. , Blundell J. R., Venkataram S., Petrov D. A., Fisher D. S., and Sherlock G. 2015. Quantitative evolutionary dynamics using high‐resolution lineage tracking. Nature 519:181–186. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Loeffler, M. and Grossmann B. 1991. A stochastic branching model with formation of subunits applied to the growth of intestinal crypts. Journal of Theoretical Biology 150:175–191. [DOI] [PubMed] [Google Scholar]
  25. Lopez‐Garcia, C. , Klein A. M., Simons B., and Winton D. J. 2010. Intestinal stem cell replacement follows a pattern of neutral drift. Science 330:822–825. [DOI] [PubMed] [Google Scholar]
  26. López‐Otín, C. , Blasco M. A., Partridge L., Serrano M., and Kroemer G. 2013. The hallmarks of aging. Cell 153:1194–1217. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Lynch, M. 2010. Evolution of the mutation rate. Trends in Genetics 26:345–352. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. McCartney, B. M. and Näthke I. S. 2008. Cell regulation by the Apc protein. Apc as master regulator of epithelia. Current Opinion in Cell Biology 20:186–193. [DOI] [PubMed] [Google Scholar]
  29. McFarland, C. D. , Korolev K. S., Kryukov G. V., Sunyaev S. R., and Mirny L. A. 2013. Impact of deleterious passenger mutations on cancer progression. Proceedings of the National Academy of Sciences of the United States of America 110:2910–2915. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. McFarland, C. D. , Mirny L., and Korolev K. S. 2014. A tug‐of‐war between driver and passenger mutations in cancer and other adaptive processes. Proceedings of the National Academy of Sciences of the United States of America 111:15138–15143. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Merlo, L. M. F. , Pepper J. W., Reid B. J., and Maley C. C. 2006. Cancer as an evolutionary and ecological process. Nature Reviews Cancer 6:924–935. [DOI] [PubMed] [Google Scholar]
  32. Michor, F. , Frank S., May R. M., Iwasa Y., and Nowak M. A. 2003. Somatic selection for and against cancer. Journal of Theoretical Biology 225:377–382. [DOI] [PubMed] [Google Scholar]
  33. Moskalev, A. A. , Shaposhnikov M. V., Plyusnina E. N., Zhavoronkov A., Budovsky A., Yanai H., and Fraifeld V. E. 2013. The role of DNA damage and repair in aging through the prism of Koch‐like criteria. Ageing Research Reviews 12:661–684. [DOI] [PubMed] [Google Scholar]
  34. Nicolas, P. , Kim K.‐M., Shibata D., and Tavaré S. 2007. The stem cell population of the human colon crypt: analysis via methylation patterns. PLoS Computational Biology 3:e28. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Nijnik, A. , Woodbine L., Marchetti C., Dawson S., Lambe T., Liu C., Rodrigues N. P., Crockford T. L., Cabuy E., Vindigni A., Enver T., Bell J. I., Slijepcevic P., Goodnow C. C., Jeggo P. A., and Cornall R. J. 2007. DNA repair is limiting for haematopoietic stem cells during ageing. Nature 447:686–690. [DOI] [PubMed] [Google Scholar]
  36. Orr, H. A. 2003. The distribution of fitness effects among beneficial mutations. Genetics 163:1519–1526. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Orr, H. A. 2010. The population genetics of beneficial mutations. Philosophical Transactions of the Royal Society of London. Series B, Biological Sciences 365:1195–1201. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Potten, C. S. , Booth C., and Hargreaves D. 2003. The small intestine as a model for evaluating adult tissue stem cell drug targets. Cell Proliferation 36:115–129. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Reya, T. and Clevers H. 2005. Wnt signalling in stem cells and cancer. Nature 434:843–850. [DOI] [PubMed] [Google Scholar]
  40. Ritsma, L. , Ellenbroek S. I. J., Zomer A., Snippert H. J., de Sauvage F. J., Simons B. D., Clevers H., and van Rheenen J. 2014. Intestinal crypt homeostasis revealed at single‐stem‐cell level by in vivo live imaging. Nature 507:362–365. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Rokyta, D. R. , Beisel C. J., Joyce P., Ferris M. T., Burch C. L., and Wichman H. A. 2008. Beneficial fitness effects are not exponential for two viruses. Journal of Molecular Evolution 67:368–376. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Rossi, D. , Bryder D., Seita J., Nussenzweig A., Hoeijmakers J., and Weissman I. L. 2007. Deficiencies in DNA damage repair limit the function of haematopoietic stem cells with age. Nature 447:725–729. [DOI] [PubMed] [Google Scholar]
  43. Sanjuán, R. , Moya A., and Elena S. F. 2004. The distribution of fitness effects caused by single‐nucleotide substitutions in an RNA virus. Proceedings of the National Academy of Sciences of the United States of America 101:8396–8401. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Smith, G. , Carey F. A., Beattie J., Wilkie M. J. V., Lightfoot T. J., Coxhead J., Garner R. C., Steele R. J. C., and Wolf C. R. 2002. Mutations in APC, Kirsten‐ras, and p53‐alternative genetic pathways to colorectal cancer. Proceedings of the National Academy of Sciences of the United States of America 99:9433–9438. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Snippert, H. J. , van der Flier L. G., Sato T., van Es J. H., van den Born M., Kroon‐Veenboer C., Barker N., Klein A. M., van Rheenen J., Simons B., and Clevers H. 2010. Intestinal crypt homeostasis results from neutral competition between symmetrically dividing Lgr5 stem cells. Cell 143:134–144. [DOI] [PubMed] [Google Scholar]
  46. Snippert, H. J. , Schepers A. G., van Es J. H., Simons B., and Clevers H. 2014. Biased competition between Lgr5 intestinal stem cells driven by oncogenic mutation induces clonal expansion. EMBO Reports 15:62–69. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Sottoriva, A. , Kang H., Ma Z., Graham T., Salomon M. P., Zhao J., Marjoram P., Siegmund K., Press M. F., Shibata D., and Curtis C. 2015. A Big Bang model of human colorectal tumor growth. Nature Genetics 47:209–216. [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Vermeulen, L. , Morrissey E., van der Heijden M., Nicholson A. M., Sottoriva A., Buczacki S., Kemp R., Tavare S., and Winton D. J. 2013. Defining stem cell dynamics in models of intestinal tumor initiation. Science 342:995–998. [DOI] [PubMed] [Google Scholar]
  49. Winawer, S. J. 1999. Natural history of colorectal cancer. The American Journal of Medicine 106: 3S–6S; discussion 50S–51S. [DOI] [PubMed] [Google Scholar]
  50. Wiser, M. J. , Ribeck N., and Lenski R. E. 2013. Long‐term dynamics of adaptation in asexual populations. Science 342:1364–1367. [DOI] [PubMed] [Google Scholar]
  51. Wloch, D. M. , Szafraniec K., Borts R. H., and Korona R. 2001. Direct estimate of the mutation rate and the distribution of fitness effects in the yeast Saccharomyces cerevisiae . Genetics 159:441–452. [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Wodarz, D. 2007. Effect of stem cell turnover rates on protection against cancer and aging. Journal of Theoretical Biology 245:449–458. [DOI] [PubMed] [Google Scholar]
  53. Wodarz, D. and Komarova N. L. 2005. Computational Biology of Cancer. World Scientific, Singapore. [Google Scholar]
  54. Wong, W.‐M. , Mandir N., Goodlad R. A., Wong B. C. Y., Garcia S. B., Lam S.‐K., and Wright N. A. 2002. Histogenesis of human colorectal adenomas and hyperplastic polyps: the role of cell proliferation and crypt fission. Gut 50:212–217. [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Wood, L. , Parsons D., and Jones S. 2007. The genomic landscapes of human breast and colorectal cancers. Science 318:1108–1113. [DOI] [PubMed] [Google Scholar]
  56. Zeyl, C. and DeVisser J. 2001. Estimates of the rate and distribution of fitness effects of spontaneous mutation in Saccharomyces cerevisiae . Genetics 157:53–61. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Figure S1. Heat map depicting values for the least squares analysis of predicted tumor incidence and human tumor incidence data for the exponential beneficial DFE on division rate scenario.

Figure S2. Heat map depicting values for the least squares analysis of predicted tumor incidence and human tumor incidence data for the power‐law beneficial DFE on division rate scenario.

Figure S3. Heat map depicting values for the least squares analysis of predicted tumor incidence and human tumor incidence data for the mutations affecting differentiation rate scenario.

Figure S4. Mutation profiles of a tumor at the onset of tumorigenesis.

Figure S5. The simulated stem cell dynamics within a human crypt for both the displaced stem cells (A,B) and the stem cell niche (C,D) showing a fixation event of a mutant lineage.


Articles from Evolutionary Applications are provided here courtesy of Wiley

RESOURCES