Skip to main content
Genetics logoLink to Genetics
. 2015 Jan 26;199(4):1213–1228. doi: 10.1534/genetics.114.171553

Stochastic Tunneling and Metastable States During the Somatic Evolution of Cancer

Peter Ashcroft *, Franziska Michor , Tobias Galla *,1
PMCID: PMC4391561  PMID: 25624316

Abstract

Tumors initiate when a population of proliferating cells accumulates a certain number and type of genetic and/or epigenetic alterations. The population dynamics of such sequential acquisition of (epi)genetic alterations has been the topic of much investigation. The phenomenon of stochastic tunneling, where an intermediate mutant in a sequence does not reach fixation in a population before generating a double mutant, has been studied using a variety of computational and mathematical methods. However, the field still lacks a comprehensive analytical description since theoretical predictions of fixation times are available only for cases in which the second mutant is advantageous. Here, we study stochastic tunneling in a Moran model. Analyzing the deterministic dynamics of large populations we systematically identify the parameter regimes captured by existing approaches. Our analysis also reveals fitness landscapes and mutation rates for which finite populations are found in long-lived metastable states. These are landscapes in which the final mutant is not the most advantageous in the sequence, and resulting metastable states are a consequence of a mutation–selection balance. The escape from these states is driven by intrinsic noise, and their location affects the probability of tunneling. Existing methods no longer apply. In these regimes it is the escape from the metastable states that is the key bottleneck; fixation is no longer limited by the emergence of a successful mutant lineage. We used the so-called Wentzel–Kramers–Brillouin method to compute fixation times in these parameter regimes, successfully validated by stochastic simulations. Our work fills a gap left by previous approaches and provides a more comprehensive description of the acquisition of multiple mutations in populations of somatic cells.

Keywords: stochastic modeling, population genetics, cancer, Moran process, WKB method


UNDERSTANDING the dynamics of an evolving population structure has long been the goal of population genetics. Several authors have constructed probabilistic models to study allele frequency distributions in populations subject to mutation, selection, and genetic drift (Fisher 1930; Wright 1931; Moran 1962). The mathematical analysis of these models leads to an improved understanding of the underlying system and has been crucial for the interpretation of the laws of evolution. This is most evident in the quantitative analysis of cancer, which has seen numerous studies throughout the 20th century that addressed the kinetics of cancer initiation and progression (Nordling 1953; Armitage and Doll 1954; Fisher 1958; Knudson 1971; Moolgavkar 1978). Due to these and other studies (see Weinberg 2013 for a review), we now know that human cancer initiates when cells within a proliferating tissue accumulate a certain number and type of genetic and/or epigenetic alterations. These alterations can be point mutations, amplification and deletion of genomic material, structural changes such as translocations, loss or gain of DNA methylation and histone modifications, and others (Weinberg 2013).

The dynamics of mutation acquisition is governed by evolutionary parameters such as the rate at which alterations arise, the selection effect that these alterations confer to cells, and the size of the population of cells that proliferate within a tissue. Much effort has been devoted to modeling these processes mathematically and computationally and to analyzing the rates at which mutations arise within precancerous tissues (Moolgavkar and Luebeck 1992; Nunney 1999; Gatenby and Vincent 2003; Michor et al. 2004; Haeno et al. 2009). In particular, several investigators have studied the dynamics of two mutations arising sequentially in a population of a fixed finite number of cells. This scenario describes the inactivation of a tumor-suppressor gene (TSG), which directly regulates the growth and differentiation pathways of the cells (Weinberg 2013). This may or may not lead directly to cancer. Cells in which the TSG is inactivated can take a variety of fitness values. For instance embryonic retina cells with an inactivated RB1 gene can proliferate uncontrollably and create retinoblastomas (Knudson 1971). By definition these cells have a higher fitness than the wild-type cells. Alternatively, if chromosomal instability (CIN) is taken into account, cells with deactivated TSG can have a lower fitness than the wild type (Michor et al. 2005). Empirical evidence for the exact fitness (dis)advantage conferred to cells as a result of accumulating mutations is in general difficult to obtain, since in vitro growth assays of nontransformed cells are challenging. For this reason and to provide general methods, the modeling literature has addressed a range of fitness values for single- and double-mutant cells (e.g., Michor et al. 2004).

Subsequent modeling work on mutation acquisition (Komarova et al. 2003; Iwasa et al. 2004; Proulx 2011; Haeno et al. 2013) has revealed a more detailed picture; a homogeneous population harboring no mutations can move to a homogeneous state in which all cells carry two mutations without ever visiting a homogeneous state in which all cells harbor just one mutation. This phenomenon is referred to as “stochastic tunneling” and represents an additional route to the homogeneous state with two mutations; the sequential route is still available to the system, but it becomes less likely in certain parameter regimes. In this context the term tunneling refers only to overlapping transitions between the homogeneous states; it does not imply a statement about the structure of the underlying fitness landscape. The process we refer to as tunneling is not limited to valley-crossing scenarios. Figure 1A provides a schematic illustration of the tunneling process.

Figure 1.

Figure 1

Stochastic tunneling and fitness landscape examples. (A) Schematic of stochastic tunneling. The population can reach the all-2 state via two routes. The first is the sequential fixation route in which the first mutation takes over the population, and where this is then followed by the second mutation. The second route does not visit the all-1 state. This is the stochastic tunneling route. The arrows are related to the change of state only and imply nothing about the fitness of these states. (B) Examples of fitness landscapes. The first example is the conventional landscape, with r1r0 and r2>max(r0,r1). The second example has r1>r2, and as a result this landscape permits a metastable state (a fixed point in the deterministic dynamics as discussed below).

As with much of the existing literature on the stochastic tunneling, our work is not just limited to the case of cancer initiation. Instead our results are related and applicable to more general scenarios in population genetics, including situations in which a heterogeneous population is maintained through mutation–selection balance or the case of Muller’s ratchet when increasingly deleterious mutations become fixed (Muller 1964).

So far, most analytical investigations of stochastic tunneling (Komarova et al. 2003; Iwasa et al. 2004; Nowak et al. 2004; Proulx 2011) have been limited to considering transitions between homogeneous (or monomorphic) states of the population, as indicated in Figure 1A. These investigations were performed assuming that cells proliferate according to the Moran process—a stochastic model of overlapping generations in which one cell division and one death event occur during each time step (Moran 1962). Nowak et al. (2004) analyzed the effect of the population size and mutation rates on the rate of appearance of a single cell with two mutations. These authors noted that for small, intermediate, and large populations, it takes two hits, one hit, and zero hits, respectively, for a cell to accumulate two mutations. Here a hit is defined as a rate-limiting step, such as the appearance of an alteration when mutation rates are small. Considering fixation of cells with two mutations, Komarova et al. (2003), Iwasa et al. (2004), and Haeno et al. (2013) obtained explicit expressions for the tunneling rate. Subsequently, Iwasa et al. (2005) used the assumption of independent lineages (i.e., individual lineages of cells harboring one mutation were considered to behave independently from each other) to compute the probability distribution for the time of emergence of a single second mutant in intermediate or large populations. Proulx (2011) used a similar branching-process approach to derive a further tunneling rate.

Other types of dynamics such as the Wright–Fisher process have been studied as well; see, e.g., Proulx (2011). In the Wright–Fisher process, cell generations are assumed to be nonoverlapping, so that many birth and death events occur during each time step (Ewens 2004). Using this process, Weinreich and Chao (2005) determined the critical population sizes for sequential fixation or stochastic tunneling, and Weissman et al. (2009) calculated the rate of tunneling as a function of the mutation rates, population size, and relative fitness of cells harboring one mutation. Finally, these results were extended to investigate the effects of recombination, or sexual reproduction, on the rate of stochastic tunneling (Weissman et al. 2010; Lynch 2010; Altland et al. 2011). These authors found that the time to establishment of the double-mutant cells can be reduced by several orders of magnitude when sexual reproduction is considered (Weissman et al. 2010).

Recently, Proulx (2012) studied stochastic tunneling in a model that was not built upon the homogeneous-state approach. The author constructed a mutational network to study gene duplication. Although the setting of the model is very different from the setting we consider here, the underlying principles are similar.

The existing approaches for the Moran model in an asexual population provide accurate analytical approximations for a subset of the parameter space. We present a systematic overview of the scope of existing quantitative work. Extensive regions of parameter space, up to date, have been left unexplored by analytical approaches. These are predominantly situations in which the double mutant is not the most advantageous in the sequence. Before the double mutant reaches fixation, the population must travel across a fitness hill or move constantly downhill in fitness space, as illustrated in Figure 1B. The dynamics can then become trapped in quasi-equilibria—a consequence of the mutation–selection balance (Crow and Kimura 1970). In these long-lived equilibria, the population is heterogeneous, and so previous approaches are not justified. Throughout our article, these states are referred to as “metastable.” When these states exist, fixation is driven purely by demographic fluctuations. We address this regime based on ideas from mathematical statistical physics. Specifically we employ the Wentzel–Kramers–Brillouin (WKB) method to derive quantitative predictions for fixation times. Examples of using the WKB method to describe the escape from metastable states include the computation of mixing times in evolutionary games (Black et al. 2012), investigating extinction processes in coexisting bacteria (Lohmar and Meerson 2011) or predator–prey systems (Gottesman and Meerson 2012), and investigating epidemic models (van Herwaarden and Grasman 1995; Kamenev and Meerson 2008; Dykman et al. 2008; Black and McKane 2011; Billings et al. 2013). In the presence of recombination, Altland et al. (2011) have shown that metastable states can appear when recombination rates are large, even if the double mutant is advantageous. The authors note that the time to escape across this “recombination barrier” increases exponentially with system size, as explained based on a WKB approach.

In this article, we first classified the generic types of behavior that can occur in a population of cells that acquire two subsequent mutations in a Moran process: we determined when metastable states occur, when fixation is driven by intrinsic noise as opposed to deterministic flow, and where in parameter space fixation occurs in several subsequent hits. This classification was achieved by systematically studying the underlying deterministic dynamics of the process. We then obtained expressions for fixation times in parameter regimes that could not be captured by previous methods, i.e., regimes in which metastable states are found. We thus employed the WKB method to provide a more complete analytical description of the fixation dynamics in these parameter regimes. Our work fills the gap left by the existing literature and leads to a more comprehensive understanding of mutation acquisition and stochastic tunneling in evolving populations.

The Model

We considered a well-mixed, finite population of N cells. Each cell can be of one of three possible types, labeled type 0, a wild-type cell harboring no mutations, type 1, a cell harboring one mutation, and type 2, a cell harboring two mutations. Initially, all cells are of type 0. The evolution of the population is determined by a Moran process (Moran 1962). During each elementary time step of this process, a cell is randomly chosen to reproduce proportional to its fitness. In the same time step another cell is randomly removed, such that the total population size remains constant. The daughter cell can either inherit its type from the parent or acquire a mutation during division. The relative fitness values of type 0, type 1, and type 2 cells are denoted by r0, r1, and r2. Without loss of generality, we use r0=1 throughout. The mutation rates u1 and u2 denote the probability that the daughter of a type 0 cell is of type 1 and the probability that the daughter of a type 1 cell is of type 2, respectively. We neglect all other combinations of mutations. The assumption of no backmutation is commonly used in the population genetics literature (Ma et al. 2008). It is justifiable since the human genome is very large, 3×109 bp, and the probability of mutating a specific base per cell division very small, 10101011 (Kunkel and Bebenek 2000). Therefore the chance of undoing a specific point mutation is vanishingly small. The probability that a second critical alteration occurs at a different locus is much higher.

In our model, finite populations will eventually reach a state in which all cells have acquired two mutations. This state is “absorbing”; i.e., once this state has been reached, no further dynamics can occur. There are of course physical processes beyond the second mutation. In precancerous tissues, for example, there will be a finite probability that cells progress from this state to accumulate further changes (see Discussion). These processes are not the focus of our work though, and so are not included in the model.

Let us denote the number of type 0, type 1, and type 2 cells by n0, n1, and n2, respectively; we have N=n1+n2+n3. The transition rates for the Moran process are given by

T10=(1u1)r0(n0/N)r¯n1,T20=(1u1)r0(n0/N)r¯n2,T01=u1r0(n0/N)+(1u2)r1(n1/N)r¯n0,T21=u1r0(n0/N)+(1u2)r1(n1/N)r¯n2,T02=u2r1(n1/N)+r2(n2/N)r¯n0,T12=u2r1(n1/N)+r2(n2/N)r¯n1. (1)

The quantity r¯=(r0n0+r1n1+r2n2)/N is the average fitness in the population. The transition labeled ij represents a process in which a cell of type i is replaced by a cell of type j. In a process labeled 10, for example, the state of the population changes from (n0,n1,n2) to state (n0+1,n11,n2). As an example, the first reaction rate, T10, in Equation 1 can be broken down as follows: a type 0 cell is chosen to reproduce at rate (n0/N)×(r0/r¯). The offspring does not mutate with probability (1u1). Finally, a type 1 cell is chosen to be removed at rate n1. The rates for the other processes can be interpreted analogously. We choose a continuous-time setup, and correspondingly all rates in Equation 1 scale linearly in the population size N. Simulations are carried out using a standard Gillespie algorithm (Gillespie 1977), and times are measured in cellular generations.

This process is described exactly by a master equation (van Kampen 2007), which governs the behavior of the probability, Pn(t), such that the population is in state n=(n0,n1,n2) at time t and is given by

P˙n(t)=v[TnvvPnv(t)TnvPn(t)]. (2)

The vector v indicates a change in the composition of the population due to the corresponding reaction, and P˙n(t) represents the partial derivative of P˙n(t) with respect to time. This equation states that the probability that the population is in state n at time t increases due to transitions into the state n and decreases due to transitions out of the state n.

The master equation contains full information about the stochastic population dynamics. In particular, the detailed statistics of the population at any time can be derived from it, and it captures effects driven by intrinsic noise, such as extinction and fixation. Obtaining a full solution of the master equation is difficult or impossible though in all but the simplest of cases. As a starting point, it is often useful to first consider the deterministic limit of infinite populations. In this limit, the distribution P˙n(t) is sharply peaked around its average, and so the dynamics reduces to a set of equations for this mean. This approach does not capture any of the stochastic effects. However, the types of stochastic trajectories that can be observed for different parameter sets are, to some extent, set by the underlying deterministic dynamics. We thus first analyze the deterministic limit of the model.

Deterministic Analysis and Types of Stochastic Behavior

In the limit N, the population evolves according to a deterministic set of equations. Writing xi=limNni/N, we have the relation x0+x1+x2=1, and the average fitness is given by r¯=r0x0+r1x1+r2x2. The equations governing the dynamics of the population are then

r¯x˙0=[(1u1)r0r¯]x0,r¯x˙1=u1r0x0+[(1u2)r1r¯]x1,r¯x˙2=u2r1x1+(r2r¯)x2. (3)

These equations can be derived systematically using a system-size expansion of the master equation (2); see, e.g., van Kampen (2007). Note that xi refers to the relative concentration of cells of type i in the population (not to be confused with the probability of being found in a homogeneous state of type i cells as studied in Iwasa et al. 2004 and Haeno et al. 2013). For example, x0=x1=x2=13 would indicate that the population is in a state in which all three types are present with equal numbers.

Given the relation x0+x1+x2=1, the dynamics has only two independent degrees of freedom. Time courses of the system can hence be thought of as a trajectory in a “concentration simplex,” as depicted in the satellite diagrams of Figure 2. Each point in the simplex represents one particular state (1x1x2,x1,x2) of the population. At points in the interior of the simplex all three types of cells are present in the population (xi>0 for i=0,1,2). Points on the edges of the simplex represent states in which one of the three types is not present, for example, x0=0 for points along the edge connecting the bottom-right corner of the simplex with the top corner. We refer to this as the 1–2 edge in the following, and similarly for the other edges. The three corners of the simplex represent the homogeneous states, i.e., x0=1 (bottom left corner), x1=1 (bottom right) and x2=1 (top corner).

Figure 2.

Figure 2

Phase diagram with satellite diagrams showing the deterministic flow. Fixed points are found on the 1–2 boundary and/or in the interior of the simplex. Boundary fixed points occur when r2<(1u2)r1 (regions I and II, mutation–selection balance between types 1 and 2). Stable interior fixed points occur when r2<(1u1)r0 and (1u2)r1<(1u1)r0 (regions II and III, mutation–selection balance between all three types). No fixed points are found in regions IV and V (beneficial type 2 mutation). The satellite diagrams show the deterministic flow, Equation 3. Thick (red) lines show the deterministic flow from the all-wild-type initial condition. Solid circles indicate stable fixed points, and the open circle for region II corresponds to a saddle point that is stable along the 1–2 boundary. In all cases, the point x2=1 is an absorbing state and is therefore a fixed point as well. Below each simplex we illustrate the shape of the fitness landscape, which generates each type of behavior.

The deterministic equations (3) have a trivial fixed point (a point (1x1*x2*,x1*,x2*) at which x˙i=0 for all i=0,1,2) at x2=1, corresponding to the absorbing state. The equations can have a further zero, one, or two nontrivial fixed points, depending on the values of the fitness parameters and the mutation rates. These fixed points correspond to points at which mutation and selection balance (Crow and Kimura 1970). Each fixed point can be either stable (i.e., attracting from all directions) or a saddle (attracting from some directions, repelling in others). The system is found not to contain any fully repelling fixed points (Appendix A). Figure 2 shows the deterministic dynamics in different parameter regimes, indicated as regions I–V. Below we discuss the stochastic behavior in each of these parameter regimes.

Region I (mutation–selection balance between type 1 and type 2 cells)

In region I, the deterministic dynamics flows toward a fixed point on the 1–2 edge of the concentration simplex (x0=0). The type 0 cells have the lowest fitness and are deterministically lost by selection. The fixed point is a consequence of mutation–selection balance between type 1 and type 2 cells (Crow and Kimura 1970). Writing r2=(1s)r1, the existence condition for the equilibrium (r2<(1u2)r1) reduces to u2<s. It is a well-known result from population genetics that this condition prevents the deterministic loss of the type 1 cells (Crow and Kimura 1970). The deterministic system gets stuck at this fixed point, but a finite population will eventually reach fixation in the all-2 state. At large but finite population sizes, the stochastic dynamics are expected to approximately follow the deterministic path shown in Figure 2 such that type 0 cells quickly become extinct. The lack of backward mutations means the population cannot depart from this edge and the problem reduces to one degree of freedom. The mutation–selection balance maintains the heterogeneous population state of type 1 and type 2 cells. The intrinsic noise then has to drive the system from this metastable state into the absorbing all-2 state against the direction of selection. Fixation times are expected to grow exponentially with the population size (Antal and Scheuring 2006; Mobilia 2010; Altland et al. 2011).

Region II (mutation–selection balance between all three types and, separately, between types 1 and 2)

In region II, the deterministic flow from the all-wild-type state is toward a stable fixed point in the interior of the simplex. This point corresponds to the mutation–selection balance point of all three species. There is a second fixed point located on the 1–2 edge, which corresponds to mutation–selection balance between types 1 and 2 in the absence of type 0 cells (analogous to region I). As type 0 cells have the highest fitness in this regime, selection is directed away from the 1–2 edge. Thus the fixed point on this edge is a saddle. As before the stochastic dynamics in finite populations will reach the all-2 state eventually. The population will closely follow the deterministic trajectory (see Figure 2) before reaching the metastable state about the stable interior fixed point. Here the mutation–selection balance maintains the heterogeneous state with all three species present. The population will fluctuate about this fixed point until it eventually overcomes the adverse selection and escapes. There are two possibilities for the subsequent behavior: (i) Type 0 cells become extinct and the population settles into the metastable state on the 1–2 edge. Intrinsic fluctuations enable the population to overcome the adverse selection along the edge and reach the absorbing all-2 state. This corresponds to sequential extinction, first of type 0 cells and then of type 1 cells. This process is equivalent to a minimal model of Muller’s ratchet (Muller 1964), in which the most advantageous phenotypes are sequentially lost. A trajectory of this type is illustrated in Figures 3, A and C. (ii) Cells of type 0 and type 1 can in principle go extinct (almost) simultaneously. The trajectory of the system then hits the 1–2 edge infinitesimally close to the all-2 corner of the simplex (here infinitesimally close means a distance of order 1/N away from the upper corner). It does not become trapped in the metastable state located on the 1–2 edge. In numerical simulations (data not shown) we find that this second path is realized only very rarely, and so our mathematical analysis of region II below focuses on sequential extinction.

Figure 3.

Figure 3

Routes to fixation in cases with a stable interior fixed point. (A) The concentration simplex for a combination of parameters in region II (r1=0.96, r2=0.91, u1=0.01, u2=0.01). The red line is the dominant path for sequential extinction, as obtained from the WKB analysis (see text). The thick purple line indicates the dominant path for simultaneous extinction of types 0 and 1, which is rare. Faint lines indicate the deterministic flow (Equation 3). The thin orange line represents the trajectory of a single stochastic simulation (N=300). (B) Dominant trajectory, flow lines, and stochastic trajectory for a combination of parameters in region III (r1=0.90, r2=0.97, u1=0.01, u2=0.01). Cells of types 0 and 1 go extinct (essentially) at the same time t44,000. The dominant trajectory as obtained from the WKB calculation (thick purple line) runs directly into the all-2 corner of the concentration simplex. (C) The concentrations of the three types of cells as a function of time. These are obtained from the same stochastic simulation as shown in the simplex A. A moving average has been taken to improve clarity. As seen from the data, cells go extinct sequentially: initially all three types are present, cells of type 0 go extinct at time t28,000 and cells of type 1 go extinct at time t45,000. (D) The concentrations (moving average) of the three types of cells as a function of time for the stochastic simulation shown in the simplex B, showing the simultaneous extinction of types 0 and 1.

Region III (mutation–selection balance between all three types)

In region III the deterministic dynamics has a single stable fixed point in the interior of the concentration simplex. This point again corresponds to the mutation–selection balance point of all three species. Large, but finite, populations will behave as discussed in case (ii) for region II. They will initially become trapped in the metastable state about the mutation–selection balance point, before intrinsic fluctuations eventually drive the system to the absorbing all-2 state. In region III, type 0 cells and type 1 cells go extinct at essentially the same time. If the type 0 cells become extinct first, then type 1 cells quickly become extinct as selection along the 1–2 edge is directed toward the absorbing state (r2>(1u2)r1). This is illustrated in Figures 3, B and D.

Regions IV and V (beneficial type 2 mutation)

In a subset of the parameter space, shown as regions IV and V in Figure 2, the deterministic flow from the all-wild-type state is directly to the absorbing all-2 state. For such model parameters we expect that fixation in finite populations will be quick as type 2 cells are favored by selection (and mutation). These scenarios agree with the theory of natural selection, in which the populations fitness increases over time (Fisher 1930). In region IV this is achieved by crossing a fitness valley, and in region V it is achieved by sequentially selecting the most advantageous phenotype.

Figure 4 illustrates which parameter regimes have previously been studied in the stochastic tunneling literature. These existing studies almost exclusively focus on regions IV and V, i.e., cases in which fixation is driven not primarily by demographic noise, but by the underlying deterministic flow. As mentioned above fixation is typically fast in regions IV and V. Based on similar studies in evolutionary game theory one would expect the fixation time to grow logarithmically with the population size, τlnN (Antal and Scheuring 2006), and this is indeed what we find in simulations (data not shown). The regions containing nontrivial fixed points are largely unexplored by previous investigations. Fixation is controlled by stochastic effects so that fixation times are large and broadly distributed. As we discuss below, fixation times grow exponentially with the population size in such cases. This is perfectly in line with the findings of Haeno et al. (2013), who point out that fixation in these regions takes a very long time. Efficient measurements of fixation time in simulations are hence difficult. Methods that require the numerical solutions of, for example, the backward Fokker–Planck equation or a backward master equation reach their limits here as well Haeno et al. (2013). The contribution of our work is to analyze precisely these previously inaccessible cases. We compute the fixation properties of systems in which the underlying deterministic flow has one or more attracting fixed points away from the absorbing all-2 state.

Figure 4.

Figure 4

Phase diagram highlighting previous work. The colored area roughly corresponds to the regions in Haeno et al. (2013) in which the fixation probability is successfully predicted [see Haeno et al. (2013, Figure 6)]. The southwest–northeast striped region, with r1 deleterious or slightly advantageous, and r2 very advantageous, is the approximate region of interest of the studies Komarova et al. (2003), Nowak et al. (2004), and Iwasa et al. (2005). These studies focused on the time to emergence of a single type 2 cell. The northwest–southeast striped region, with r1 neutral or deleterious, and r2 advantageous, is approximately the region of interest of Iwasa et al. (2004) and Proulx (2011). These studies were concerned with computing fixation times of the advantageous type 2 cells and rely on the assumption that the number of type 1 cells is small. Finally, the horizontal striped region approximately corresponds to the literature of crossing fitness valleys, notably Weinreich and Chao (2005), Weissman et al. (2009), Lynch (2010), and Altland et al. (2011). These studies are concerned with 1/N<r2r01.

WKB Analysis

Let us now analyze the dynamics in regions I, II, and III, i.e., situations in which the deterministic dynamics has one or two nontrivial fixed points. In large but finite populations these fixed points correspond to metastable states in which the effect of mutation and selection balance. The aim of the following analysis is to calculate the rate at which the population will escape this state and enter the absorbing state in which all cells harbor two mutations. To proceed with the analysis we make the following simplifying assumptions, which are justified by the previous deterministic analysis:

  1. We assume that the population first settles into a distribution about the mutation–selection balance point. This distribution is calculated below.

  2. We assume that the population will leak into the absorbing state on a very long time scale from this distribution. With this assumption we can also say that the time taken for the population to reach the metastable state is negligible when compared to the escape time.

With these assumptions we can compute [from the master equation (2)] the distribution about the mutation–selection balance point and the escape rate. These assumptions (and hence the subsequent analysis) require the selective pressure to be greater than the effect of noise, such that the metastable states are long lived. For this reason, the approach described here is valid only for large N, which satisfy this condition (the minimum value of N for which our analysis is valid is dependent on the remaining model parameters, but comparisons with simulation results show it is accurate for N100).

Mathematically we formulate the problem as follows: the shape of the distribution about the mutation–selection balance point is given by πn and is henceforth referred to as the quasi-stationary distribution (QSD) in line with existing literature (Assaf and Meerson 2010). The mean time taken to escape from the metastable state, τ, is much greater than the time taken to initially reach the metastable state tr, i.e., τtr. Provided this condition holds, we can assume that after a short time the probability to find the population in state n is given by

Pnnabs(ttr)πnet/τ,Pnabs(ttr)1et/τ. (4)

The exponential decay factor, et/τ, describes the leaking process from the metastable state into the absorbing state, nabs=(0,0,N). The second equation follows from normalization.

To find the mean fixation time of the type 2 cells, we substitute Equation 4 into the master equation (2) to obtain the quasi-stationary master equation (QSME)

01τπnnabs=vTnvvπnvTnvπn. (5)

For n=nabs (the absorbing state) we have

1τ=vTnabsvvπnabsv=T(0,1,N1)12π(0,1,N1)+T(1,0,N1)02π(1,0,N1), (6)

where we have used Tnabsv=0. Hence if we find the QSD πn by solving the QSME (5), we can determine the mean fixation time, τ, and the probability to have reached fixation by time t, Pnabs(t)=1et/τ.

By separating variables in Equation 4, we have reduced the complexity of the master equation (2) [time does not feature in the QSME (5)]. If we now replace the discrete variables n with continuous variables x=n/N, we further reduce the complexity. This continuous approximation is valid as we have already stated that we require N to be large. We now employ the WKB ansatz to represent the QSD as

πNx=ψ(x)=Cexp[NS(x)S1(x)+O(N1)], (7)

where S(x)O(1) is known as the action, and exp[S1(x)] (S1O(1)) is the so-called amplitude (Assaf and Meerson 2010). We have introduced C as a normalization constant. To find the QSD, we follow e.g., Assaf and Meerson (2010) and expand the QSME (5) in powers of N1. Further analysis can be carried out if the QSME has only one variable. This is relevant in regions I and II of our model, where the population must escape from a metastable state on the 1–2 edge of the concentration simplex. An example of the QSD obtained from this procedure is shown in Figure 5.

Figure 5.

Figure 5

Quasi-stationary distribution compared with simulation results. We show the approximation Equation 4 (solid lines; solid bars for n1=0) against the distribution of states n1 obtained from an ensemble of 105 simulation runs (symbols). Here the metastable state is located along the 1–2 boundary (region I: here the solution is obtained from the theory with S(x1) and S1(x1) given by Equation B7), and the normalization C given by Equation B9). Dashed lines correspond to the boundary-layer solution of the master equation, valid for 0n1<N1/2 (Equation B12). Note that the distributions away from n1=0 have been rescaled by factors of 100 for t=400 and 500 for t=2000 for optical convenience. The arrow indicates the location of the fixed point. Parameters are r0=1.0, r1=1.05, r2=1.0, u1=1×102, u2=2×102, N=200.

Escape from an interior metastable state can also be studied using the WKB approach. The QSME then retains 2 degrees of freedom, and explicit expressions for the action and escape time cannot be obtained. In the following we briefly describe the main steps for each of the two situations.

Analysis of mutation–selection balance between types 1 and 2

We first consider the case in which the population must escape from a metastable state on the 1–2 edge (regions I and II). As the population cannot depart from the 1–2 boundary, the system then reduces to 1 degree of freedom. We parameterize the system in terms of n1, such that n1=0 is the absorbing state (all cells of type 2) and n2=Nn1. The analysis now closely follows the work of Assaf and Meerson (2010), specifically their scenario A. The outcome of the analysis are expressions for the action S(x1), S1(x1) (determined up to an additive constant), and the normalization constant C. With this we find an explicit expression for the mean escape time, τ, from the metastable state. In region I in which there exists just a single deterministic fixed point, this time corresponds to the mean fixation time, as documented above. The fixation time in region II is discussed in the next section. This analysis is covered in detail in Appendix B; we here list only the main steps and the result of the solution procedure:

  1. Substitute the WKB ansatz (7) into the QSME (5), and then expand in powers of N1. The equation at leading order in N can be solved to find the action S(x1). The equation at next-leading order can be solved to find the function S1(x1). The QSD (7) is normalized by considering a Gaussian approximation of ψ(x1). Note that with expressions for S, S1, and C, we have derived the distribution about the mutation–selection balance point. An example of this is plotted in Figure 5.

  2. Find a so-called boundary-layer solution by recursively solving the QSME (5) for 0<n1N1/2, using Equation 6 to express the solution as a function of the expected escape time τ.

  3. Matching the recursive boundary-layer solution and the WKB solution in the domain 0<n1<N1/2 gives the expected escape time from the metastable state to the absorbing state,
    τ=2πr1r2Nu2(1u2)(r1r2)[(1u2)r1r2]2×exp{N[u2r1r2u2r1ln(u2(1u2)r12r1r2)+ln((1u2)r1)r2r2u2r1lnr2]}. (8)

Analysis of mutation–selection balance between all three types

When a fixed point exists away from the state-space boundaries, as found in regions II and III in Figure 2, the solution procedure described above is no longer viable; the equations recovered from step 1 above can no longer be integrated directly subject to the appropriate boundary conditions (see Appendix C). Instead, minimum-action Hamiltonian trajectories must be computed. Problems of this type are usually tackled in one of two ways: first, the equations of motion could be integrated using a shooting method to find the optimal (most likely) trajectory with a given final point (Kamenev and Meerson 2008; Black and Mckane 2011; Gottesman and Meerson 2012); second, the equations can be integrated using an iterative scheme, which converges to the optimal trajectory (Lohmar and Meerson 2011) connecting given start and end points. We found that the second method quickly converges for our problem, so results presented in the following use this method. The details of the procedure are described in Appendix C.

Using this method we calculated the action, S, accumulated over the trajectory from the stable interior fixed point to the boundary saddle fixed point in region II or directly to the absorbing state in region III. We then used a known result presented by Van Herwaarden and Grasman (1995) for the escape time from a metastable state,

τCNeNS, (9)

where the constant C can be found by fitting to simulation data. This expression has the same functional dependence on N as the one given in Equation 8.

In principle, one could numerically find the action of a trajectory starting at the stable fixed point and finishing at a general position x in concentration space. This would yield the values S(x), and the distribution about the metastable state is then given by ψ(x)exp[NS(x)]. This very technical analysis is tedious and beyond the scope of this article.

Results

We analyze the results separately for each region (I, II, and III) of parameter space. In particular we discuss the implications the model parameters have for the probability with which tunneling occurs, and for the fixation time of type 2 cells.

Region I

In this region, type 1 cells have a fitness advantage over both type 0 and type 2 cells, such that the fitness landscape has an intermediate maximum (a fitness “hill;” see Figure 2). As a result, type 0 cells are deterministically lost and the population relaxes to the QSD (i.e., the mutation–selection equilibrium) on the 1–2 edge, as described above, and probability slowly leaks into the absorbing state.

To test the accuracy of this approach, in Figure 5 we compare the QSD measured in simulations with the theoretical approximation. The data in the figure reveal good agreement between theory and model simulations for N1/2n1N, and for n1=0. In the region just above n1=0 the agreement between the theoretical result for Pn1 and simulation data breaks down; here the theoretical value from Equation 7 diverges. This is a result of taking the limit 1/τ0 in the QSME (5), in particular for the case n1=0, which corresponds to 1/τ=T(0,1,N1)12π1. The value of τ is crucial to determining the value of π1. When calculating the mean fixation time, τ, we circumvented this known problem by considering a so-called boundary-layer approach (Assaf and Meerson 2010). The boundary-layer solutions (dashed lines in Figure 5; for details of the calculation see Appendix B) show better agreement with simulation results close to n1=0 than the QSD obtained from the WKB ansatz (solid lines).

Results for the mean fixation time in region I are shown in Figure 6A. In Figure 6B, we plot the probability that type 2 cells have reached fixation by time t=104 (including fixation earlier than that). We refer to this quantity as the fixation probability. Fixation times are shown to increase exponentially with r1. This is a consequence of the increasing height of the selection “barrier,” which must be overcome for type 2 cells to reach fixation. Also, increasing r1 pushes the mutation–selection balance point toward the all-1 state. This results in a further increase in fixation time (or decrease in fixation probability). As the metastable state approaches the all-1 state, the probability of the population reaching the all-1 state due to demographic fluctuations increases. Thus increasing r1 decreases the probability of tunneling.

Figure 6.

Figure 6

Results for region I. (A) Mean fixation time of type 2 cells in region I. Symbols correspond to mean fixation time from simulations of the model (averaged over an ensemble of 2–5×103 samples) initiated in the all-wild-type state. Shape of symbol indicates fitness of type 2 cells (see legend); solid symbols are for u1=u2=102 and open symbols are for u1=u2=103. Solid lines (high mutation) and dashed lines (low mutation) are the WKB prediction for fixation time, Equation 8. The approximation breaks down when (1u2)r1r2, which is when the fixed point approaches the absorbing state. (B) Fixation probability of type 2 cells in region I, evaluated at time t=104. Lines correspond to the WKB prediction, Equations 4 and 8, and colors and symbols follow the same convention as in A. Remaining parameters are r0=1.0 and N=100.

Increasing the fitness of type 2 cells, on the other hand, pushes the metastable state closer to the absorbing state. This leads to a significant reduction in the fixation time (increase in fixation probability) as also shown in Figure 6A (Figure 6B). Increasing the mutation rate u2 has a similar effect to increasing r2; the mutation–selection balance point approaches the absorbing state, and the net effect of selection away from the absorbing state is reduced, leading to a decrease in the fixation time. In line with the previous literature (Iwasa et al. 2004; Haeno et al. 2013), increasing the mutation rate increases the probability of tunneling.

In Figure 6, A and B, the theoretical predictions from the WKB method are in excellent agreement with simulation results. This is the case even at the moderate population size of N=100. Small deviations occur when mutation rates are low (dashed lines and open symbols in Figure 6). The theory then slightly underestimates the fixation time (overestimates the fixation probability). This is a consequence of assuming that the population approaches the metastable state in a negligible amount of time. For very small mutation rates, it takes an increasing period of time for successful (i.e., nonvanishing) mutant lineages to appear. Deviations between theory and simulation results occur when (1u2)r1r2. At this point the theory breaks down as the fixed point on the 1–2 edge approaches the absorbing state. The barrier associated with adverse selection is then negligible and the assumptions underlying the WKB approximation are no longer justified.

Region II

To reach fixation in this region the population must accumulate successive mutations of lower fitness (r0r1r2). The population first approaches a metastable state corresponding to the mutation–selection balance point of all three species. From here there are two possible routes to fixation, sequential or (almost) simultaneous extinction of types 0 and 1, as described previously and shown in Figure 3. By computing the action accumulated along both routes, we have shown that the path of least action—the most probable path to fixation—corresponds to the path of sequential extinction. We treat this two-hit process as two separate problems: (i) escape from the interior metastable state to the boundary (loss of the advantageous type 0 phenotype) and (ii) escape from the boundary metastable state to the absorbing all-2 state (analogous to region I). A typical realization of this sequence of events is shown in Figure 3C.

As in region I, the probability of tunneling decreases as the fitness advantage of type 1 cells over type 2 cells increases. This is because the fixed point on the 1–2 edge approaches the n1=N state. For the same reason, the tunneling rate decreases as the mutation rates decrease.

Following the convention used by Gottesman and Meerson (2012), we labeled the time to reach the 1–2 boundary as τ32, indicating that the three-species system turns into a two-species system when the wild-type cell goes extinct. Similarly, the time to travel from the boundary fixed point (two species present in the population) to the absorbing state (one species) is denoted by τ21. With this notation we also labeled the action accumulated along each segment as S32 and S21. As τ21 is given by Equation 8, we express the mean fixation time of type 2 cells as

τ=C32NeNS32+τ21. (10)

The coefficient C32 is found by fitting to simulation data for the time taken to reach the 1–2 boundary as a function of the population size.

Small changes to the parameters now have significant effects on the fixation time, as shown in Figure 7 (solid symbols/solid lines). Increasing the fitness of the type 2 cells moves both the interior fixed point and the boundary fixed point toward the absorbing all-2 state. It also reduces the strength of selection away from the absorbing state. These combined effects dramatically reduce the mean fixation time and its rate of increase with the population size.

Figure 7.

Figure 7

Results for regions II and III. Fixation time in region II (solid symbols) and region III (open symbols) of Figure 2 as a function of system size from simulations, averaged over 100 realizations. Lines are from the theory, see Equation 10 for region II (solid lines) and Equation 9 for region III (dashed lines). Remaining parameters are r0=1.0 and u1=u2=102.

Region III

In this region fixation is controlled by the escape from the interior metastable state; it is a one-hit process. The stable interior fixed point, corresponding to the mutation–selection balance point of the three species, is located close to the all-wild-type state as type 0 cells are the most advantageous. The direct path from the metastable state to the all-2 state is the dominant (least action) path, as shown in Figure 3B. As a result, the probability of tunneling is higher than in the previous cases. It increases as the fitness of type 2 cells and the mutation rates increase as the stable interior fixed point moves to lower numbers of type 1 cells (away from the all-1 state).

The fixation time is computed from Equation 9, where S=S31 is the action accumulated along the direct path from the stable interior fixed point to the all-2 state. The coefficient C31 is again found by fitting to simulation data. We see in Figure 7 (open symbols/dashed lines) that varying the model parameters has a lesser effect on fixation times than in region II. In region III, fixation is a one-hit process—the population must only escape the stable fixed point—and not a two-hit process as in region II where the effects of the two steps are compounded. Contrary to the results for region I, the mean fixation time is a decreasing function of r1 in region III. This can be explained as follows: by increasing r1, the selection strength away from the 1–2 boundary decreases and the stable state moves to higher type 1 numbers, such that the population has an improved chance of reaching the 1–2 boundary. From there selection is directed toward the absorbing state, and the time spent on the 1–2 boundary is negligible compared to the time to reach this edge. Hence, the fixation time reduces as type 1 cells become more fit. The rate of increase of the fixation time with the population size reduces as well (see Figure 7).

Note that there are systematic deviations between theory and simulation results in the data set shown as open triangles in Figure 7, and to a lesser extent also for the data shown as open diamonds. This is attributed to the fact that the fitness parameters r1 and r2 are very similar to each other or equal for these instances, and they are also close to the fitness of the wild type. Selection is then close to neutral and the metastable state is only weakly attracting. The WKB approach then reaches its limits as the assumption of a long-lived metastable state begins to break down.

Discussion

In this article we investigated the fixation of two successive mutations in a finite population of individuals proliferating according to the Moran process. We discussed this in the context of the somatic evolution of a compartment of cells. The accumulation of two mutations can correspond to the inactivation of a tumor-suppressor gene or alteration of genes causing CIN (Weinberg 2013). If the cell carrying two mutations is deleterious, as can be the case with recessive CIN genes (Michor et al. 2005), it will generally have low concentrations within the tissue. Then the chance of a cancerous phenotype emerging (further mutation) is very low. Demographic fluctuations can drive the double mutant to higher numbers, but these states are short lived. If the double mutant reaches fixation, the state is maintained until a further mutation occurs and hence the chance of a cancerous phenotype emerging is much greater.

We first analyzed the deterministic limit of the evolutionary dynamics. We identified parameter regimes in which mutation and selection balance. These are regimes in which the double mutant is not the most advantageous in the sequence. In finite populations, this mutation–selection balance gives rise to long-lived metastable states. Our analysis identified the escape from these metastable states as the key bottleneck to fixation of cells with two mutations. For parameter values for which there is no mutation–selection balance (i.e., type 2 cells have the highest fitness), the fixation dynamics is largely governed by the deterministic flow. The rate-limiting steps are then the appearance of successful mutant lineages (Nowak et al. 2004), and the subsequent fixation of cells with two mutations is a zero-hit process. As such the progression from healthy tissue (all wild type) to susceptible tissue (all type 2; inactivated TSG) will be fast relative to the cases in which a mutation–selection balance exists. If there is one stable fixed point in the deterministic dynamics, the process becomes a one-hit phenomenon limited by the escape from the corresponding metastable state. In regions with two fixed points one observes a two-hit process. The population becomes trapped in a first metastable state, escapes to a second metastable state, and then reaches full fixation.

In addition to this qualitative classification, we calculated fixation times in parameter regimes previously inaccessible to existing analytical approaches. These are precisely the regions of parameter space in which mutation–selection balance exists. We used the WKB method to calculate the mean escape time from the corresponding metastable states to the absorbing all-type 2 state. This escape time is identified as the fixation time of type 2 cells. We tested the analytical expressions and numerical results obtained from the WKB approach against individual-based simulations of the population dynamics. Our theoretical predictions in principle rely on a limit of large but finite populations, and so they can be expected to be valid only for large enough populations. The comparison against simulations demonstrates the accuracy of our theory even at moderate population sizes of N=100 cells. For populations much smaller than this the assumptions of the WKB method break down. The rate-limiting step is then the occurrence of a successful lineage of mutants and not the escape from metastable states. The expressions obtained from the WKB approach become more accurate as the population size increases.

This analysis allowed us to classify how changes to the fitness landscape, mutation rates, and population size affect the probability of tunneling and the time-to-fixation of cells harboring two mutations. In terms of the development of tumors, our analysis shows that the path to accumulating mutations is not simply limited by the mutation rates, but also by escape from metastable states. Populations can exist in a heterogeneous state for very long periods of time before fluctuations eventually drive the second mutation to fixation. The probability with which stochastic tunneling occurs is, in part, determined by the location of these metastable states. If they are located close to the all-type 1 state, then the probability of tunneling is low. This occurs when cells with one mutation have a higher fitness than those with two mutations (regions I and II). The probability of tunneling decreases as the fitness gap between these two types of cell increases or as mutation rates decrease. The mean fixation time increases exponentially as the fitness gap increases. Cells with one and two mutations are present in the tissue compartment for long periods of time; their numbers are maintained by the mutation–selection balance. Cell types are lost sequentially. Wild-type cells can be driven to extinction by selection (region I) or by demographic fluctuations (region II). The extinction of type 1 cells is driven exclusively by fluctuations. When type 2 cells have a higher fitness than type 1 cells, and when both are less fit than the wild type (region III), selection is always against cells of type 1. Mutation–selection balance maintains a low concentration of type 1 cells in the tissue, and hence the probability of tunneling is high. In this regime the mean fixation time is a decreasing function of the fitness of type 1 cells. As for all escape problems from metastable states, the fixation time scales exponentially with the size of the population. Fixation is noise driven, and as the population size is increased the noise strength decreases, and hence fixation takes longer.

Although our theory is aimed at large population sizes and exponentially growing fixation times, we have shown that it can also make accurate predictions on biologically relevant timescales. Assuming a cell generation lasts for 1 day, our theory can capture fixation times of around 3 years or more (>103 generations). Related studies on the progression of cancer suggest a typical time scale on the order of 10 years to accumulate a sufficient number of mutations (Beerenwinkel et al. 2007; Bozic et al. 2010), which is well within the scope of our theory. However, the times predicted by our theory are extremely sensitive to parameter variation. This limits the parameter ranges for which biologically relevant time scales can be generated. Specifically selective (dis)advantages need to be small (10%). This is in agreement with selection coefficients in related studies (Bozic et al. 2010). Of course the length of a cellular generation can vary by an order of magnitude or so, depending on the specific cell type (Weinberg 2013).

Our results do, however, allow an extrapolation to situations when fixation times become very long, for instance, for very large populations and/or when selection is strongly against the invading mutants. In these scenarios, stochastic simulations can become too expensive computationally to provide meaningful measurements. Analytical methods based on backward master equations or backward Fokker–Planck equations suffer from computational limitations as well in such cases. Our mathematical work complements existing analytical approaches to the Moran model of cells acquiring two successive mutations. Previous work has provided an appropriate machinery with which to compute the time-to-fixation of the second mutation in situations without metastable states. The present article specifically addresses cases in which fixation is limited by the escape from long-lived states. This contribution closes a gap in the analytical characterization of fixation in this model and a more complete picture is now available. We have added a new method to the toolbox used to study stochastic tunneling. Our deterministic analysis provides a systematic procedure to determine which tool to use for a given set of parameters. This accomplishment removes the need for stochastic simulations altogether, or at the very least it limits the circumstances under which they are needed.

This work has clear limitations in that it focuses on the Moran model with only two successive mutations. We have not considered any processes beyond the second mutation; however, such cases can exist in physical systems. If the type 2 cells are not cancerous, one would be interested in, for example, calculating when a metastatic cell (three mutations) first arises. In general this does not require the fixation of type 2 cells and is related to the total number of type 2 cells over time. If metastable states are present, the cumulative number of type 2 cells prior to fixation is small, as described above. While we do not analyze this further, the typical number of type 2 cells at any time can in principle be computed from the quasi-stationary distribution, Equation 4.

Our systematic approach, along with the combined theoretical apparatus of previous work and the WKB method, is readily transferable to more complex models of cancer initiation and progression. One possible extension to this study is the generalization to more than two mutations. If a cell can accumulate d possible mutations, metastable states are found provided rd<maxi=0,1,,d1ri. A similar analysis can then be carried out. If the fitness landscape is arranged such that r0>r1>>rd, the problem is analogous to Muller’s ratchet (Muller 1964), which describes the accumulation of successive maladaptive mutations. Metzger and Eule (2013) recently studied a special case of this problem using a WKB approach. Finally Gokhale et al. (2009) studied valley-crossing dynamics with d possible mutations. They have shown that allowing multiple paths to accumulate, the mutations reduce the fixation time. As such, allowing multiple paths in our model could reduce the fixation times we have measured. Work along both of these lines is in progress.

Acknowledgments

P.A. acknowledges support from the Engineering and Physical Sciences Council (United Kingdom), EPSRC. F.M. acknowledges support from the Dana-Farber Cancer Institute Physical Sciences-Oncology Center (National Institutes of Health U54CA143798). T.G. acknowledges support from the EPSRC, grant reference EP/K037145/1.

Appendix A

Fixed Points and Stability

From the deterministic equations (3) it can be seen that the state x=(0,0,1) is a fixed point; i.e., x˙i=0 at this point for i=0,1,2. This is the absorbing state, so this result is rather obvious. Nontrivial fixed points exist away from the absorbing state in some parameter regions.

The stability of a fixed point is determined by the eigenvalues of the Jacobian of the deterministic equations (3). Due to the overall constraint x0+x1+x2=1, the system is effectively two dimensional. We can write the Jacobian in terms of two variables, x1 and x2, as

J(x1,x2)=(x1x˙1x2x˙1x1x˙2x2x˙2). (A1)

Along the 1–2 boundary of the concentration simplex, Equations 3 can be expressed in terms of a single variable, x1. A fixed point, x1*, on this boundary satisfies the equations

x˙1=[(1u2)r1r¯]x1*=0x˙2=u2r1x1*+(r2r¯)(1x1*)=0, (A2)

where r¯=r1x1*+r2(1x1*). These equalities are satisfied by the value x1*=1u2r1/(r1r2) (along with x2*=1x1*). The parameter range in which this fixed point exists is determined by the condition 0<x1*<1, which we can write as r2<(1u2)r1. The fixed point on the 1–2 edge therefore exists when type 1 cells have a fitness advantage over type 2 cells; the factor 1u2 accounts for effects of mutation. Increasing this fitness advantage moves the fixed point toward x1=1, or equivalently away from the absorbing state at x1=0. For vanishing mutation rate u2, the fixed point approaches the x1=1 state.

Evaluating the eigenvalues of the 2×2 Jacobian in Equation A1 at this fixed point, we find that the point is stable if (1u2)r1>(1u1)r0 and that it is a saddle if (1u2)r1<(1u1)r0. These two cases correspond to regions I and II in Figure 2.

A fixed point of Equations 3 with x0>0 is found as

x1*=[(1u1)r0r2]u1r0u2r1(r0r2)+(r0r1)[(1u1)r0r2],x2*=u1u2r0r1u2r1(r0r2)+(r0r1)[(1u1)r0r2], (A3)

provided the model parameters satisfy (1u1)r0>(1u2)r1 and (1u1)r0>r2. Further analysis of the Jacobian (A1) at this point shows that the fixed point is stable whenever it exists. This is the region of parameter space in which cells with one and two mutations, respectively, are both less fit than the wild type. This is the case in regions II and III in Figure 2. The fixed point moves closer to x0=1 when the fitness advantage of the wild type cells is increased (e.g., by lowering the fitness of type 1 cells, r10). Decreasing the mutation rates also moves the fixed point closer to x0=1.

Appendix B

Solving the Quasi-stationary Master Equation

In terms of the variable x=n/N, we can write the QSME (5) for nnabs as

01τψ(x)=v[Nwv(xvN)ψ(xvN)Nwv(x)ψ(x)], (B1)

where wv(x)=TNxv/N and ψ(x)=πNx. Substituting the WKB ansatz, Equation 7, into Equation B1 and expanding in powers of N1 we arrive at

0=vwv(x){evS(x)[1+1NvS1(x)12(v)2NS(x)]1}vevS(x)vNwv(x)+O(N2), (B2)

where we have ignored the term O((Nτ)1) as this term is smaller than O(N2) (τ scales as eN). The leading-order terms of this equation are equivalent to the Hamilton–Jacobi equation,

H(q,p)=vwv(q)(evp1)=0, (B3)

where q=x is the so-called “position” variable, and p=qS(q) is the so-called “momentum” variable. This equation is best solved using the method of characteristics; i.e., we look for parametric solutions, (q,p)=(q(t),p(t)). These trajectories fulfill Hamilton’s equations,

q=pH(q,p),p=qH(q,p). (B4)

They satisfy the principle of least action and correspond to the most likely path taken in the so-called phase-space, the space spanned by (q,p). The Hamilton–Jacobi equation has the trivial solution p=0, which corresponds to the deterministic “relaxation” trajectory, for which the equation of motion is simply

q=pH(q,p)|p=0=vvwv(q). (B5)

As we are interested in escape from a stable fixed point, we seek the nontrivial “activation” trajectory, for which pa(q)0 in general. The relevant boundary condition is pa(q*)=0, where q* indicates the fixed point of the deterministic dynamics from which the trajectory emanates.

In one dimension, i.e., in the case of a single fixed point on the 1–2 boundary (region I of Figure 2), the Hamilton–Jacobi equation (B3) can be written as

H(q,p)=w+(q)(ep1)+w(q)(ep1)=0, (B6)

where q=x1 is the concentration of cells of type 1, p=S(q), w+(q)=T21(q)/N, and w(q)=T12(q)/N (reaction rates as described in Equation 1). This equation can be solved to obtain the activation trajectory pa(q), and hence S(q). We can now substitute pa(q) into the equation consisting of next-leading-order terms [O(N1)] of Equation B2 to find S1(q). Following this procedure we find

S(q)=dqln[w+(q)w(q)],S1(q)=12ln[w+(q)w(q)]. (B7)

The QSD is now determined up to a normalization factor. The QSD is peaked about the fixed point located at q*=x1*=1u2r1/(r1r2); see Appendix A. Hence we can expand the QSD (7) about this fixed point such that

ψ(q)Cexp{N[S(q*)+12(qq*)2S(q*)]S1(q*)}, (B8)

where we have used S(q*)=pa(q*)=0. Normalizing to unity then determines the normalization coefficient,

C=S(q*)2πNeNS(q*)+S1(q*). (B9)

The QSD determined above breaks down when n1N1/2, or equivalently when qN1/2, i.e., close to the absorbing state. In this region we consider a recursive solution of Equation 5 that does not rely on a specific form for the QSD; i.e., we do not use the WKB ansatz (7). We expand Equation 5 about n1=0 to obtain

0=v1wv(0)[(n1v)πn1vn1πn1]. (B10)

This is to be solved for πn1 (1<n1N1/2). Using w(0)=1, we can write this as

fn1+1=[1+w+(0)]fn1w+(0)fn11, (B11)

where fn1=n1πn1. This recursive system can be solved to arrive at

πn1=π1{1[w+(0)]n1}n1[1w+(0)]π1[w+(0)]n1n1[w+(0)1], (B12)

where the second step follows from w+(0)>1. Using Equation 6 and expanding the relevant transition rate, Tnabsvv=T(0,1,N1)12, about n1=0 we can write π11/τ. By matching the recursively obtained boundary-layer solution of Equation B12 with the WKB solution in Equation B8 at 0<qN1/2, we obtain an expression for the fixation time τ, as shown in Equation 8.

Appendix C

Numerical Solutions for Fixation Time

We now address the case in which there is an internal stable fixed point of the deterministic dynamics. The problem then retains 2 degrees of freedom. We follow the initial steps of Appendix B to arrive at the Hamilton–Jacobi Equation B3. Given that the original system is two dimensional, we now find four variables for the Hamilton–Jacobi problem, two position variables q1 and q2 (equivalent to x1 and x2), and two corresponding momenta p1 and p2. These are defined by p=qS(q). As the “energy” is fixed (H=0) we have three effective degrees of freedom and no obvious solution to H(q,p)=0. We consider again Hamilton’s equations (B4). These equations describe the trajectory that minimizes the action, and hence by solving these we can then determine the fixation time. To determine the boundary conditions we need to find the fixed points of Equations (B4). We first note that there are three zero-momentum fixed points, which correspond to the fixed points of the deterministic equations (3). Following Gottesman and Meerson (2012), we label these as M1 for the absorbing state [q=(0,0,1)], M2 for the 1–2 boundary fixed point, and M3 for the stable interior fixed point defined by Equation A3. As we seek to determine the activation trajectory, we need to find fixed points of Equations B4 with nonzero momenta, but with positions corresponding to M1 and M2 (the possible end points of the trajectories). These so-called fluctuational fixed points are labeled as F1 and F2.

The relevant trajectory is then found using an iterative method to solve the two-boundary problem. Consider the scenario in which the stable interior fixed point M3 is the only fixed point of the deterministic system for x1 and x2, other than the absorbing state, i.e. for parameters in region III of Figure 2. Here the activation trajectory that leads to fixation starts at M3 and finishes at F1. To start the iteration, we fix the momenta for all times to the values at F1, and then numerically integrate the equations of motion (B4) for the position vector q forward in time, starting at M3 and keeping the momenta constant. This integration is carried out for a sufficient range of time to reach the vicinity of the fixed point F1, but not too long to avoid numerical errors building up. In the next step the relations for the momenta in Equation B4 are then integrated backward in time using the trajectory q(t) found in the previous iteration. The momenta at the start of this backward integration are chosen as those corresponding to F1. This procedure is then iterated, with alternating forward and backward integration of Hamilton’s equation. At each step of the procedure the action of the path is found as

S31=0tdtp(t)q(t). (C1)

The iteration of alternating forward and backward integration is then repeated until S31 has reached convergence. The action can then be used in Equation 9 to determine the fixation time τ31.

In region II of parameter space a similar procedure is applied. In this case the minimizing trajectory in (p,q)-space which connects M3 and F2.

Footnotes

Communicating editor: L. M. Wahl

Literature Cited

  1. Altland A., Fischer A., Krug J., Szendro I. G., 2011.  Rare events in population genetics: stochastic tunneling in a two-locus model with recombination. Phys. Rev. Lett. 106: 088101. [DOI] [PubMed] [Google Scholar]
  2. Antal T., Scheuring I., 2006.  Fixation of strategies for an evolutionary game in finite populations. Bull. Math. Biol. 68: 1923–1944. [DOI] [PubMed] [Google Scholar]
  3. Armitage P., Doll R., 1954.  The age distribution of cancer and a multi-stage theory of carcinogenesis. Br. J. Cancer 8: 1–12. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Assaf M., Meerson B., 2010.  Extinction of metastable stochastic populations. Phys. Rev. E Stat. Nonlin. Soft Matter Phys. 81: 021116. [DOI] [PubMed] [Google Scholar]
  5. Beerenwinkel N., Antal T., Dingli D., Traulsen A., Kinzler K. W., et al. , 2007.  Genetic progression and the waiting time to cancer. PLOS Comput. Biol. 3: e225. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Billings L., Mier-Y L.-Teran-Romero, B. Lindley, and I. B. Schwartz, 2013.  Intervention-based stochastic disease eradication. PLoS ONE 8: e70211. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Black A. J., McKane A. J., 2011.  WKB calculation of an epidemic outbreak distribution. J. Stat. Mech. 2011: P12006. [Google Scholar]
  8. Black A. J., Traulsen A., Galla T., 2012.  Mixing times in evolutionary game dynamics. Phys. Rev. Lett. 109: 028101. [DOI] [PubMed] [Google Scholar]
  9. Bozic I., Antal T., Ohtsuki H., Carter H., Kim D., et al. , 2010.  Accumulation of driver and passenger mutations during tumor progression. Proc. Natl. Acad. Sci. USA 107: 18545–18550. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Crow J. F., Kimura M., 1970.  An Introduction to Population Genetics Theory. Harper & Row, New York. [Google Scholar]
  11. Dykman M. I., Schwartz I. B., Landsman A. S., 2008.  Disease extinction in the presence of random vaccination. Phys. Rev. Lett. 101: 078101. [DOI] [PubMed] [Google Scholar]
  12. Ewens W. J., 2004.  Mathematical Population Genetics. I. Theoretical Introduction, 2nd Ed Springer-Verlag, New York. [Google Scholar]
  13. Fisher J. C., 1958.  Multiple-mutation theory of carcinogenesis. Nature 181: 651–652. [DOI] [PubMed] [Google Scholar]
  14. Fisher R. A., 1930.  The Genetical Theory of Natural Selection. Clarendon Press, Oxford. [Google Scholar]
  15. Gatenby R. A., Vincent T. L., 2003.  An evolutionary model of carcinogenesis. Cancer Res. 63: 6212–6220. [PubMed] [Google Scholar]
  16. Gillespie D. T., 1977.  Exact stochastic simulation of coupled chemical reactions. J. Phys. Chem. 81: 2340–2361. [Google Scholar]
  17. Gokhale C. S., Iwasa Y., Nowak M. A., Traulsen A., 2009.  The pace of evolution across fitness valleys. J. Theor. Biol. 259: 613–620. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Gottesman O., Meerson B., 2012.  Multiple extinction routes in stochastic population models. Phys. Rev. E Stat. Nonlin. Soft Matter Phys. 85: 021140. [DOI] [PubMed] [Google Scholar]
  19. Haeno H., Levine R. L., Gilliland D. G., Michor F., 2009.  A progenitor cell origin of myeloid malignancies. Proc. Natl. Acad. Sci. USA 106: 16616–16621. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Haeno H., Maruvka Y. E., Iwasa Y., Michor F., 2013.  Stochastic tunneling of two mutations in a population of cancer cells. PLoS ONE 8: e65724. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Iwasa Y., Michor F., Nowak M. A., 2004.  Stochastic tunnels in evolutionary dynamics. Genetics 166: 1571–1579. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Iwasa Y., Michor F., Komarova N. L., Nowak M. A., 2005.  Population genetics of tumor suppressor genes. J. Theor. Biol. 233: 15–23. [DOI] [PubMed] [Google Scholar]
  23. Kamenev A., Meerson B., 2008.  Extinction of an infectious disease: a large fluctuation in a nonequilibrium system. Phys. Rev. E Stat. Nonlin. Soft Matter Phys. 77: 061107. [DOI] [PubMed] [Google Scholar]
  24. Knudson A. G., 1971.  Mutation and cancer: statistical study of retinoblastoma. Proc. Natl. Acad. Sci. USA 68: 820–823. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Komarova N. L., Sengupta A., Nowak M. A., 2003.  Mutation–selection networks of cancer initiation: tumor suppressor genes and chromosomal instability. J. Theor. Biol. 223: 433–450. [DOI] [PubMed] [Google Scholar]
  26. Kunkel T. A., Bebenek K., 2000.  DNA replication fidelity. Annu. Rev. Biochem. 69: 497–529. [DOI] [PubMed] [Google Scholar]
  27. Lohmar I., Meerson B., 2011.  Switching between phenotypes and population extinction. Phys. Rev. E Stat. Nonlin. Soft Matter Phys. 84: 051901. [DOI] [PubMed] [Google Scholar]
  28. Lynch M., 2010.  Scaling expectations for the time to establishment of complex adaptations. Proc. Natl. Acad. Sci. USA 107: 16577–16582. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Ma J., Ratan A., Raney B. J., Suh B. B., Miller W., et al. , 2008.  The infinite sites model of genome evolution. Proc. Natl. Acad. Sci. USA 105: 14254–14261. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Metzger J. J., Eule S., 2013.  Distribution of the Fittest Individuals and the Rate of Muller’s ratchet in a model with overlapping generations. PLOS Comput. Biol. 9: e1003303. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Michor F., Iwasa Y., Nowak M. A., 2004.  Dynamics of cancer progression. Nat. Rev. Cancer 4: 197–205. [DOI] [PubMed] [Google Scholar]
  32. Michor F., Iwasa Y., Vogelstein B., Lengauer C., Nowak M. A., 2005.  Can chromosomal instability initiate tumorigenesis? Semin. Cancer Biol. 15: 43–49. [DOI] [PubMed] [Google Scholar]
  33. Mobilia M., 2010.  Oscillatory dynamics in rock–paper–scissors games with mutations. J. Theor. Biol. 264: 1–10. [DOI] [PubMed] [Google Scholar]
  34. Moolgavkar S. H., 1978.  The multistage theory of carcinogenesis and the age distribution of cancer in man. J. Natl. Cancer Inst. 61: 49–52. [DOI] [PubMed] [Google Scholar]
  35. Moolgavkar S. H., Luebeck E. G., 1992.  Multistage carcinogenesis: population-based model for colon cancer. J. Natl. Cancer Inst. 84: 610–618. [DOI] [PubMed] [Google Scholar]
  36. Moran P. A. P., 1962.  The Statistical Processes of Evolutionary Theory. Clarendon Press, Oxford. [Google Scholar]
  37. Muller H. J., 1964.  The relation of recombination to mutational advance. Mutat. Res. 1: 2–9. [DOI] [PubMed] [Google Scholar]
  38. Nordling C. O., 1953.  A new theory on the cancer-inducing mechanism. Br. J. Cancer 7: 68–72. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Nowak M. A., Michor F., Komarova N. L., Iwasa Y., 2004.  Evolutionary dynamics of tumor suppressor gene inactivation. Proc. Natl. Acad. Sci. USA 101: 10635–10638. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Nunney L., 1999.  Lineage selection and the evolution of multistage carcinogenesis. Proc. Biol. Sci. 266: 493–498. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Proulx S. R., 2011.  The rate of multi-step evolution in Moran and Wright–Fisher populations. Theor. Popul. Biol. 80: 197–207. [DOI] [PubMed] [Google Scholar]
  42. Proulx S. R., 2012.  Multiple routes to subfunctionalization and gene duplicate specialization. Genetics 190: 737–751. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. van Herwaarden O. A., Grasman J., 1995.  Stochastic epidemics: major outbreaks and the duration of the endemic period. J. Math. Biol. 33: 581–601. [DOI] [PubMed] [Google Scholar]
  44. van Kampen N. G., 2007.  Stochastic Processes in Physics and Chemistry, 3rd Ed Elsevier, Amsterdam. [Google Scholar]
  45. Weinberg R. A., 2013.  The Biology of Cancer, 2nd Ed Garland Science, New York. [Google Scholar]
  46. Weinreich D. M., Chao L., 2005.  Rapid evolutionary escape by large populations from local fitness peaks is likely in nature. Evolution 59: 1175–1182. [PubMed] [Google Scholar]
  47. Weissman D. B., Desai M. M., Fisher D. S., Feldman M. W., 2009.  The rate at which asexual populations cross fitness valleys. Theor. Popul. Biol. 75: 286–300. [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Weissman D. B., Feldman M. W., Fisher D. S., 2010.  The rate of fitness-valley crossing in sexual populations. Genetics 186: 1389–1410. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Wright S., 1931.  Evolution in Mendelian populations. Genetics 16: 97. [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from Genetics are provided here courtesy of Oxford University Press

RESOURCES