Abstract
We describe a cell-molecular based evolutionary mathematical model of tumor development driven by a stochastic Moran birth-death process. The cells in the tumor carry molecular information in the form of a numerical genome which we represent as a four-digit binary string used to differentiate cells into 16 molecular types. The binary string is able to undergo stochastic point mutations that are passed to a daughter cell after each birth event. The value of the binary string determines the cell fitness, with lower fit cells (e.g. 0000) defined as healthy phenotypes, and higher fit cells (e.g. 1111) defined as malignant phenotypes. At each step of the birth-death process, the two phenotypic sub-populations compete in a prisoner’s dilemma evolutionary game with the healthy cells playing the role of cooperators, and the cancer cells playing the role of defectors. Fitness, birth-death rates of the cell populations, and overall tumor fitness are defined via the prisoner’s dilemma payoff matrix. Mutation parameters include passenger mutations (mutations conferring no fitness advantage) and driver mutations (mutations which increase cell fitness). The model is used to explore key emergent features associated with tumor development, including tumor growth rates as it relates to intratumor molecular heterogeneity. The tumor growth equation states that the growth rate is proportional to the logarithm of cellular diversity/heterogeneity. The Shannon entropy from information theory is used as a quantitative measure of heterogeneity and tumor complexity based on the distribution of the 4-digit binary sequences produced by the cell population. To track the development of heterogeneity from an initial population of healthy cells (0000), we use dynamic phylogenetic trees which show clonal and sub-clonal expansions of cancer cell sub-populations from an initial malignant cell. We show tumor growth rates are not constant throughout tumor development, and are generally much higher in the subclinical range than in later stages of development, which leads to a Gompertzian growth curve. We explain the early exponential growth of the tumor and the later saturation associated with the Gompertzian curve which results from our evolutionary simulations using simple statistical mechanics principles related to the degree of functional coupling of the cell states. We then compare dosing strategies at early stage development, mid-stage (clinical stage), and late stage development of the tumor. If used early during tumor development in the subclinical stage, well before the cancer cell population is selected for growth, therapy is most effective at disrupting key emergent features of tumor development.
Keywords: cancer model, evolutionary game theory, Moran process, gompertzian tumor growth, tumor heterogeneity, birth-death process
1. Introduction
At the molecular and cellular levels, cancer is an evolutionary process [1, 2, 3, 4] driven by random mutational events [5, 6, 7, 8] responsible for genetic diversification which typically arises via waves of clonal and sub-clonal expansions [9, 10], operating over an adaptive fitness landscape in which Darwinian selection favors highly proliferative cell phenotypes which in turn drive rapid tumor growth [11, 12, 13]. The tumor environment should be viewed as a complex Darwinian adaptive eco-system consisting of cell types which have evolved over many years [1]. As a result, all but the most well designed and tailored therapeutic strategies often deliver disappointing outcomes and potentially introduce a potent new source of selective pressure for the proliferation of variant cells which develop an enhanced ability to resist future therapeutic assaults [14, 15, 16, 17, 18]. The prospects for influencing and controlling such a system are likeliest at the emerging early stages of tumor development when the cell population has not yet been selected for growth and survival, and the tumor size is small. But by the time a typical tumor becomes clinically detectable (often after several years of growth), it already contains O(108) or more malignant cells (and potentially occupies a volume of 1 – 2 mm3), some of which may have entered the blood circulation [12]. Since there is very little human data available in this early subclinical stage of tumor development, computational models can serve as a useful surrogate in this critical developmental stage which clearly influences and determines many important emergent features of the tumor at later stages.
Our goals in this paper are to describe a mathematical model for stochastic cell kinetics in the beginning stages of tumor development (from a single malignant cell) that includes cell reproduction and death, mutations, evolution, and the subsequent emergence of genetic heterogeneity well documented in many soft-tissue tumors [19, 20, 21, 22, 23, 24, 25, 46, 54]. The model is a computational one, driven by a stochastic Moran (birth-death) process with a finite cell population, in which birth-death rates are functions of cell fitness. The fitness is determined by the cell’s numerical genome in the form of a four-digit binary string capable of undergoing point mutational dynamics with one digit in the string flipping values stochastically. The corresponding numerical value of the binary string determines whether the cell is healthy (low-fitness) or cancerous (high fitness). These two classes of cells compete against each other at each birth-death event, with fitness calculated according to the payoff matrix associated with the prisoner’s dilemma evolutionary zero-sum game [26, 27, 28, 29]. The healthy cells play the role of cooperators, while the cancer cells play the role of defectors [27, 29]. Our goal is to understand how the model parameters: passenger (mp) and driver mutation rates (md), selection strength (w), birth and death rates, affect tumor growth characteristics, such as tumor growth rates, fixation probabilities of malignant and healthy cell types, saturation rates of cancer cells, and the emergence of genetic heterogeneity in a tumor at later stages of development when the tumor is clinically detectable.
An important outcome of the model is that growth of the cancer cell population is directly influenced by the intratumor heterogeneity (represented as the distribution of the 4-digit binary strings throughout the cell population), with high heterogeneity driving more rapid growth. The connection between heterogeneity and growth has been discussed in the literature [30, 31, 32, 33, 34, 54]. We quantify heterogeneity in a tumor using tools from information theory [36, 37], as well as quantitative analysis of phylogenetic trees associated with clonal and sub-clonal expansions [38, 46] in the developing tumor. Because our numerical simulations are carried out from initial conditions corresponding to a homogeneous population of healthy cells (0000) all the way to a saturated population of cancer cells, we can use the model to test basic dose and scheduling strategies [39, 40] at the very early stages of tumor development in the subclinical range, well before a tumor would be clinically detectable by current technology. Our point of view is that this emerging subclinical tumor should be more amenable (and potentially vulnerable) to a well planned therapeutic assault than a more mature tumor comprised (on average) of larger numbers of cells with more aggressive proliferative capabilities (having undergone generations of selection), that are potentially in the early stages of migration to other organs. More complex features that might influence early stage dynamics, like human-immune response [32] and the tumor microenvironment [50] are not included in this model in order to keep things as simple and clear as possible.
2. Description of the model
The ingredients in our model includes a stochastic birth-death process that is the engine which drives tumor growth, with heritable mutations operating over a fitness landscape so that natural selection can play out over many cell division timescales. Genetic mutations (point mutations) are modeled using a four-digit binary string of information that each cell carries with it.1 This simple sequence divides the cells into 16 different “genotypes”, ranging from 0000 up to 1111, and this information is passed on to the daughter cell during a birth event. The birth-death replacement process is based on a fitness function defined in terms of interactions quantified by the prisoner’s dilemma payoff matrix which operates on two general classes of cells: healthy (the cooperators), and cancerous (the defectors). Natural selection acts on each generation of the cell population as the computational simulation proceeds on a cell division timescale. In this version of the model we typically simulate up to O(1011) cell divisions, so our mutation rates are chosen to be relatively high to accommodate these somewhat modest timescales. See [7] for discussions on mutation rates in cancer.
2.1. The Moran birth-death process
The stochastic engine [41] that drives tumor growth in our model is a finite cell-based Moran process consisting of a population of N cells, divided into two sub-populations consisting of i cancer cells, and N − i healthy cells. In all of our simulations, N is large enough so that there is not a significant difference between the results from our finite-cell model and the (deterministic) replicator equation approach for infinite populations, a connection that is discussed in detail in [53]. At each time-step in the simulation, one cell is chosen for reproduction and one cell is chosen for elimination. The cells are chosen randomly, based on their prevalence in the population pool which, in turn, is weighted by the fitness function based on a chosen payoff matrix. The probability of choosing a cancer cell at any given step is i/N, while the probability of choosing a healthy cell is (N − i)/N. As it unfolds, the process is a stochastic birth-death process where the total population size, N, stays constant and the number of cancer cells in the population, i, is the stochastic state variable. At any given step, the probability of transitioning from i cancer cells to j cancer cells is denoted Pij, with j = i + 1 or j = i − 1. These probabilities are determined by the birth/death rates associated with the cancer cell population, which in turn are determined by a cell population fitness function. Each cell carries with it a binary string in the form of a four digit binary sequence from 0000 up to 1111. This defines 16 different cell types, which are course-grained into two groups: healthy cells (0000 - 1010), and cancer cells (1011-1111). These two sub-populations interact at each birth-death time-step with fitness defined in terms of the prisoner’s dilemma payoff matrix. The algorithmic details are shown in the appendix Figure A.1. To set the stage for more complex simulations, Figure 2.1 shows the result of a stochastic simulation (depicting i) driven by the Moran process alone, with no mutations, and no selection. Figure 2.1 shows three different simulations, one leading to the elimination of all cancer cells via random drift (red), another fluctuates between a mixed cell population after 10,000 cell divisions (yellow), and a third leading to fixation of the cancer cell population (blue) after around 5000 cell divisions. The average of 25 stochastic simulations is also plotted (note that the average will converge to half cancer cells and half healthy cells by the law of large numbers). The mean time to fixation of the cancer cell population which starts with ‘i’ cells in this simple setting (no mutations, no selection) is given by
(2.1) |
With no mechanism for natural selection, there is no shape to the growth curves.
2.2. The prisoner’s dilemma payoff matrix
To introduce the effect of selection which will regulate cell birth and death rates, we use the prisoner’s dilemma evolutionary game in which two players compete against each other for the best payoff. Each has to decide whether to cooperate (healthy cell) or defect (cancer cell) and each receives a payoff determined from the prisoner’s dilemma payoff matrix2, A:
(2.2) |
The essence of the prisoner’s dilemma game is the two players compete against each other, and each has to decide what best strategy to adopt in order to maximize their payoff. This 2×2 matrix assigns the payoff (e.g. reward) to each player on each interaction. My options, as a strategy or, equivalently, as a cell type, are listed along the rows, with row 1 associated with my possible choice to cooperate, or equivalently my cell type being healthy, and row 2 associated with my possible choice to defect, or equivalently my cell type being cancerous. Your options are listed down the columns, with column 1 associated with your choice to cooperate (or you being a healthy cell), and column 2 associated with your choice to defect (or you being a cancer cell). The analysis of a rational player in a prisoner’s dilemma game runs as follows. I do not know what strategy you will choose, but suppose you choose to cooperate (column 1). In that case, I am better off defecting (row 2) since I receive a payoff of 5 instead of 3 (if I also cooperate). Suppose instead you choose to defect (column 2). In that case, I am also better off defecting (row 2) since I receive a payoff of 1 instead of 0 (if I were to have cooperated). Therefore, no matter what you choose, I am better off (from a pure payoff point of view) if I defect. What makes this game such a useful paradigm for strategic interactions ranging from economics, political science, biology [55], and even psychology [27] is the following additional observation. You will analyze the game in exactly the same way I did (just switch the roles of me and you in the previous rational analysis), so you will also decide to defect no matter what I do. The upshot if we both defect is that we will each receive a payoff of 1, instead of each receiving a payoff of 3 if we had both chosen to cooperate. The defect-defect combination is a Nash equilibrium [26], and yet it is sub-optimal for both players and for the system as a whole. Rational thought rules out the cooperate-cooperate combination which would be better for each player (3 points each) and for both players combined (6 points). In fact, the Nash equilibrium strategy of defect-defect is the worst possible system wide choice, yielding a total payoff of 2 points, compared to the cooperate-defect or defect-cooperate combination, which yields a total payoff of 5 points, or the best system-wide strategy of cooperate-cooperate yielding a total payoff of 6 points.
The game becomes even more interesting if it is played repeatedly [26, 27, 28, 29], with each player allowed to decide what strategy to use on each interaction so as to accumulate a higher payoff than the competition over a sequence of N games. In order to analyze this kind of an evolving set-up, a fitness function must be introduced based on the payoff matrix A. Let us now switch our terminology so that the relevance to tumor cell kinetics becomes clear. In this case, we randomly select pairs of cells out of the total population at each step, and subject them to a birth-death process, basing our birth rates and death rates on the prisoner’s dilemma payoff matrix. Thus, in our context, it is not the strategies that evolve, as cells cannot change type based on strategy (only based on mutations), but the prevalence of each cell type in the population is evolving, with the winner identified as the sub-type that first reaches fixation in the population. As the populations evolve, the fitness of the two competing sub-populations can be tracked, as well as the overall fitness of the combined total population of cells.
2.3. The fitness landscape
Let us start by laying out the various probabilities of pairs of cells interacting and clearly defining payoffs when there are i cancer cells, and N − i healthy cells in the population. The probability that a healthy cell interacts with another healthy cell is given by (N − i − 1)/(N − 1), whereas the probability that a healthy cell interacts with a cancer cell is i/(N − 1). The probability that a cancer cell interacts with a healthy cell is (N − i)/(N − 1), whereas the probability that a cancer cell interacts with another cancer cell is (i − 1)/(N − 1). The payoffs associated with the healthy cells and cancer cells, obtained by weighting the payoff matrix values with appropriate probabilities, are given by (following notation in [53]):
(2.3) |
(2.4) |
This gives rise to the average payoff associated with the population of cells:
(2.5) |
Based on these formulas, we define the fitness of the healthy cells as:
(2.6) |
and the fitness of the cancer cells as:
(2.7) |
Here, (WH, WC) are ‘selection strength’ parameters, 0 ≤ WH ≤ 1, 0 ≤ WC ≤ 1, that measure the strength of selection pressure on each of the population of cells. If WH = 0, there is no natural selection acting on the healthy cell population and the dynamics is driven purely by the Moran process. When WH = 1, the selection pressure on the healthy cell population is strongest and the prisoner’s dilemma payoff matrix has maximum effect. Likewise for the parameter WC and how it controls selection pressure in the cancer cell population. Since therapy imposes selection pressure on different sub-populations of cells, WH and WC are the two parameters we alter to administer simulated therapeutic responses. We discuss this in section §3.5.
The expected fitness of each of the sub-populations are:
(2.8) |
(2.9) |
with total expected fitness:
(2.10) |
From these formulas, we can define the transition probability of going from i to i + 1 cancer cells on a given step:
(2.11) |
The first term represents that probability that a cancer cell is selected for reproduction (weighted by fitness), and a healthy cell is selected for death. Likewise, the transition probability of going from i to i − 1 cancer cells on a given step is:
(2.12) |
Here, the first term is the probability healthy cell is selected for reproduction (weighted by fitness), and a cancer cell is selected for death. The remaining transition probabilities are as follows:
(2.13) |
It is these simple formulas that drive the subsequent dynamics of the competing populations of cells and determine the emergent features of the forming tumor (cancer cell population). A typical set of simulations of the evolving fitness of the healthy cell population, ϕH, the cancer cell population ϕC, and the total fitness, ϕ, is shown in Figure 2.2 as the selection parameter varies from 0 to 1 (wH = wC ≡ w). As the population evolves, the fitness of the healthy cell population decreases, the fitness of the cancer cell population increases (sometimes reaching a maximum point), while the total population fitness decreases.
2.4. Passenger and driver mutations
Two remaining parameters in our model are the passenger mutation rate, mp and the driver mutation rate, md [7]. Passenger mutations confer no fitness advantage, hence mp controls point mutations that act on the digit strings that define the 11 levels of healthy cells 0000-1010, and the 5 levels of cancer cells 1011-1111. A mutation diagram is shown in Figure 2.3 depicting all of the possible point mutation transitions at each step. Mutations that stay within either of those two ranges do not alter the cell fitness. On the other hand, the driver mutation parameter controls mutations that take a binary string from a healthy cell and, via a point mutation, alter it so that the string becomes a cancer cell3. A simple example would be a mutation that alters 1010 (healthy) to 1011 (cancer) by stochastically flipping the first digit from 0 to 1. The interested reader can consult the flow diagram in Figure A.1 of the Appendix for more details of the algorithm. The full code is available from the authors upon request.
3. Results
Gompertzian growth arising from multicellular systems occurs in many settings with different physical and biological constraints acting in concert. Hence it appears as if this universal growth curve does not depend on specific physical mechanisms (e.g. oxygen diffusion, blood supply, tumor microenvironment, etc.) but more on multi-cellularity and the ability for populations of cells to assume a heterogeneous distribution of functional states, as was described most clearly in Kendal’s 1985 paper [30] and documented clinically in breast [40] and other tumor types. Alternative bio-mechanistic models of tumor growth at the cellular level have been developed (see [47, 48, 49, 50, 51, 52]) although do not generally include molecular information or evolutionary effects. Features of the Gompertzian growth curve defined by eqns (3.1), (3.2) allow us to clearly describe three distinct growth regimes, the earliest being subclinical and the most critical regime in which to influence future tumor kinetics, the second being the clinical regime where growth measurements are typically obtained [12], and the third being the lethal burden phase where growth saturates. The growth equation, (3.9), relates tumor heterogeneity to growth rates, and we quantify heterogeneity via the Shannon entropy [36, 37] of the cellular population. One of the main features of our evolutionary simulations is to show how it (i) leads to Gompertzian growth, (ii) how growth is driven by heterogeneity quantitated via Shannon entropy, (iii) how the initiation of heterogeneity and fitness can be tracked via dynamic phylogenetic trees, and (iv) how tumor kinetics can be influenced via therapeutic strategies that target heterogeneity best in earlier growth regimes. In keeping consistent with the notation of the Gompeterzian growth curve, we now represent the tumor growth as the proportion of cancer cells in the population, nG(t).
3.1. Gompertzian tumor growth and three growth regimes
The basic (top-down) equations giving rise to pure Gompertzian growth [43, 44, 45] are the coupled equations:
(3.1) |
(3.2) |
Here, is the proportion of growing cancer cells in the mixed population, which grows exponentially according to (3.1), but with a time-dependent growth rate which is exponentially decaying according to (3.2). It is straightforward to integrate (3.1) to obtain:
(3.3) |
Then, (3.2) is solved with:
(3.4) |
Plugging (3.4) into (3.3) and integrating yields the Gompertzian curve:
(3.5) |
where in the long-time limit, the population saturates to the value
(3.6) |
which we normalize to one (without loss of generality). The key features of Gompertzian growth are shown in Figure 3.1. As the cancer cell proportion nG(t) increases (Figure 3.1a), there are three distinct growth regimes defined by the inflection point on the nG growth curve , and the two inflection points on the growth-velocity curve . As shown in Figure 3.1(a), there are three points that divide the growth curve into four distinct regions. For convenience, and symmetry, we lump the second and third regions together and define three basic growth regimes:
Regime 1 (Subclinical): Increasing velocity , increasing acceleration d2nG/dt2. Cell population and tumor volume grows at an exponential rate;
Regime 2 (Clinical): In this regime, reaches its maximum value. In the early part of the regime, is increasing while d2nG/dt2 decreases. In the later part of the regime, is decreasing and d2nG/dt2 becomes negative (deceleration). Growth rates are clinically typically measured as linear;
Regime 3 (Saturation/Lethal): Decreasing tumor velocity with decreasing deceleration. Growth rate rapidly slows towards full saturation of the cancer cell population.
Regime 1, generally speaking, is the subclinical growth regime where the developing tumor has substantially fewer than 108 malignant cells with a tumor size smaller than 1 or 2 mm3. Typically, the clinically measurable regime is Regime 2, while the lethal stage when the tumor saturates is associated with Regime 3. In reality, the boundaries of these regimes are, of course, not sharp and depend on tumor type and location which influence detectability. But the clarity of the pure Gompertzian curve gives a useful framework which delineates the three distinct growth regimes based on clear principles associated with growth, velocity, and acceleration. The growth rate curve is shown in Figure 3.1c, with its derivatives shown in Figure 3.1d. It is most instructive to show the average growth rates defined in each of the three regimes, also shown in the Figure 3.1c. The average growth rate in the time interval from t1 to t2 is defined as:
(3.7) |
The subclinical regime 1 has the highest average growth, whereas regime 2, where tumor growth is typically measured, average growth rates are lower, followed by the lowest average growth in the clinically lethal regime 3. This implies that clinically measured growth rates typically underestimate growth rates that preceded it in the subclinical stage. It also implies that linear extrapolation back from clinically measured growth rates to estimate tumor initiation times (see [11, 12, 13]) will systematically overestimate the amount of time the tumor has been developing before being measured. While this might generally be seen as good news (since the cancer initiation event was more recent than estimated via linear extrapolation), it also gives the clinician a shorter window of time in which to act.
3.2. Heterogeneity and growth via statistical mechanics
Kendal [30] lays out a clear argument of how this growth curve arises from a purely statistical mechanics point of view. In a nutshell, his argument can be explained by considering a population of n cells, let the jth cell (j = 1, 2, 3,…,n) have the potential to assume one of qj possible states. The number of combinations of states possible within the population, P, can be thought of as a measure of intra-neoplastic diversity:
(3.8) |
and is related to the growth rate of a tumor via the equation:
(3.9) |
where n(t) is the number of cells capable of proliferation at a given time t and α is a parameter that sets the timescale of growth4. There are two basic cases to consider. First, suppose the cells have no interaction at all, say in the earliest stages of tumor development, and let each of the n cells have the ability to assume one of m possible states. Then, P = mn, and the growth equation becomes
(3.10) |
The solution to this equation is the exponentially growing population:
(3.11) |
Thus, early stage development is characterized by exponential growth (regime 1), with a growth rate proportional to the log of the number of assumable states of the cells comprising the tumor population. This stage is characterized by the Gompertzian curve shown in Figure 3.1a to the left of the first blue dot, in regime 1. Contrast this with later stages of tumor growth, when the sub-populations of cells communicate and influence each other’s growth characteristics, either via competition, or cooperation (regime 3) within the tumor microenvironment. In effect, this will constrain (reduce) the number of assumable states of each cell, since the population is effectively coupled. In the extreme, suppose P = mn/nn. In other words, suppose P is now inversely related to the total number of possible intercellular interactions. Inserting this into (3.9) yields
(3.12) |
The solution to this equation is exactly the Gompertzian growth curve (3.5) and accounts for regimes 2 and 3 previously discussed in which tumor growth slows down. The growth equation (3.9) which relates cancer cell population growth to tumor heterogeneity is capable of producing a family of growth curves, depending on details of intercellular coupling, which of course is influenced by details of the biological and physical constraints influencing the tumor microenvironment. Thus, the growth equation (3.9) has the ability to produce different detailed shapes based on assumptions associated with intercellular coupling. Table 3.1 shows the average growth rates in the three regimes as a function of the key parameters in the model.
Table 3.1.
md | mp | tEmax | tSAT | nd | np | γave,1 | γave,2 | γave,3 |
---|---|---|---|---|---|---|---|---|
0.4 | 0.1 | 5.50e+5 | 1.830e+6 | 1.289e+4 | 4.68e+4 | 3.14e−5 | 3.68e−6 | 1.448e−7 |
0.3 | 0.2 | 4.88e+5 | 1.753e+6 | 1.682e+4 | 8.26e+4 | 4.04e−5 | 4.31e−6 | 1.677e−7 |
0.2 | 0.3 | 4.85e+5 | 1.761e+6 | 1.715e+4 | 1.230e+4 | 3.86e−5 | 4.41e−6 | 1.729e−7 |
0.1 | 0.4 | 5.40e+5 | 1.426e+6 | 1.362e+4 | 1.836e+4 | 3.04e−5 | 3.81e−6 | 1.658e−7 |
3.3. Quantitative measures of tumor heterogeneity and growth
For our purposes, we measure heterogeneity using the Shannon entropy from information theory [36]:
(3.13) |
(here, log is defined as base 2). The probability pi measures the proportion of cells of type i, with i = 1,…, 16 representing the distribution of binary strings ranging from 0000 to 1111. We then course-grain this distribution further so that cells having strings ranging from 0000 up to 1010 are called “healthy”, while those ranging from 1011 to 1111 are “cancerous”5. The growth equation (3.9) then becomes
(3.14) |
It follows from (3.14) that the cancer cell proportion nE(t) can be written in terms of entropy as:
(3.15) |
The panel in Figure 3.2 shows the results from our cell-based simulations. Figure 3.2a shows the Gompertzian curve associated with the proportion of cancer cells in the population, while Figure 3.2b shows the velocity and accelerations associated with growth, and can be compared with Figure 3.1b. In Figure 3.2c we show the entropy during a typical simulation, marking the maximum entropy point which peaks relatively early in the simulation before the entropy returns back down to zero, reflecting the fact that cancer cells have reached fixation and have saturated the population. Figure 3.2d shows the fitness of the cancer cell sub-population, healthy cell sub-population, and the overall tumor fitness (wH = wC ≡ w = 0.5). As a typical simulation proceeds, the cancer cell sub-population fitness increases, the healthy cell sub-population fitness decreases, while the overall tumor fitness decreases. Figure 3.2e, 3.2f shows the Gompertzian growth curves as the selection pressure increases (Figure 3.2e) and as the mutation rate increases (Figure 3.2f). High values for either of these parameters leads to a very steep growth curve, as is expected.
Figure 3.3 shows the growth curves linearly extrapolated back to give a prediction of when the first driver mutation occurred that initiated tumor growth. The growth rates from regime 2 (linear regime) are used to extrapolate back to the initiation event. Since the actual growth rate in regime 1 is much faster than linear, the linear extrapolation extends the event too far back in time as compared to when the event actually occurred. The inset of Figure 3.3 shows histograms of the average growth rates in each of the three regimes as a function of the mutation rate m (here, we take mp = md = m).
A typical stochastic simulation showing the evolution of all 16 possible cell types is shown in Figure 3.4. We also show E(t), where entropy is computed using the most extreme course-grained two-state system comprised of the two sub-populations of healthy cells and cancer cells. We compare in Figure 3.4 the Gompertzian growth curve (eqn. (3.5)) and the corresponding curve obtained from eqn. (3.15) to the stochastic simulation and the agreement is excellent. Likewise, we also show a comparison of dn/dt with eqn. (3.14) and eqn. (3.1) with E(t) normalized so that limiting values match the stochastic simulation, and the agreement is also excellent. In the beginning, entropy is zero, since the population consists purely of healthy cells, and in the end of the simulation, entropy is again zero as the population consists purely of cancer cells. Entropy peaks somewhere early in the simulation when the mixture of cell types is equally distributed over cancer and healthy types. It is this intermediate but important heterogeneously distributed state that is the key driver of growth, as is clear from eqn. (3.14).
3.4. Dynamic phylogenetic trees and evolution of fitness
To track the initiation of cellular heterogeneity from an initially homogeneous state, we follow all of the mutations that take place during the course of a simulation, and organize this in the form of a phylogenetic tree in Figure 3.5 showing the typical size of the genotypic space and the evolution of the genotypic landscape. As the simulation proceeds, the phylogenetic tree dynamically branches out into an increasingly complex structure, with fitness characteristics color coded in Figure 3.5a. We also show the bins associated with each of the 16 cell types, the number of cancer cells i(t), and the entropy associated with the sub-population of cell types as a simulation proceeds, in Figure 3.5b. Knowing exactly the types of cells comprising the tumor at any given time allows us to target cell distributions for simulated therapies to test different strategies, which we describe next.
3.5. A comparison of early vs. late therapy
In Figure 4.1 we show the results from asking the simple question of how early therapy (administered in regime 1) compares with therapy in the middle stages of tumor development (regime 2), or in the later stages of development (regime 3). Eqns (2.11), (2.12) are the governing equations controlling birth/death rates of the cancer cell, healthy cell sub-populations as natural selection plays out. Since the proliferation of cancer cells can be thought of as an imbalance of selection pressures on the competing sub-populations in favor of the cancer sub-population, the goal of any therapeutic intervention is to alter this complex imbalance in favor of the healthy cell sub-population. We implement this by adjusting the selection pressure parameters (wH, wC) in the formulas (2.6), (2.7). In particular, when therapy is ‘on’, we choose wC = 0, and wH = 1, tilting the selection pressure in favor of the healthy cell sub-population. When therapy is ‘off’, the two parameters return to their original baseline values, which here we take as wH = 0.1, wC = 0.1. Figure 4.1 depicts the proportion of cancer cells in the population both in the absence of therapy, and when therapy is administered. As a comparative tool, in each case, we administer the therapy until a fixed number of cancer cells remains (in each case, we take this threshold number to be 25 cancer cells), and we compare the amount of time, Δt, it takes to achieve this low level. The figure clearly shows Δt1 < Δt2 < Δt3 < Δt4, while if therapy is administered too late, as in Δt5, the low threshold is never achieved. The simulations show that a shorter therapeutic time-period is needed if administered earlier to gain the same level of success. The topic of how best to optimize computational therapies is complex, and these simulations are only meant as a confirmation and quantification of how early stage therapy is more effective than late stage therapy.
4. Discussion
To summarize the main points forming the framework of our model:
A tumor is a complex Darwinian ecosystem of competing cells operating on an adaptive fitness landscape driven by mutational dynamics and shaped by evolutionary pressures;
The basic competitors in an evolutionary game theory model of tumor development are cell populations with a broad distribution of fitness characteristics course grained into two types: healthy cells (cooperators) and cancer cells (defectors). Each of these cell sub-populations attempts to maximize its own fitness;
Cell fitness is associated with reproductive prowess and in this respect, healthy cells are less fit than cancer cells;
Primary tumors initiate from a single malignant cell that has undergone the appropriate mutational steps and subsequently undergoes clonal and sub-clonal expansion. Polyclonality and heterogeneity are thus seen as emergent features of tumor development;
Parameters and distributions measured in the detectable range of tumor growth, such as tumor growth rates and fixation probabilities, are emergent features that have developed from a monoclonal state via cell kinetics and evolutionary development taking place in the subclinical regime;
Tumor growth is driven by molecular heterogeneity of the cell population comprising the tumor and reflected in the growth equation (3.9);
Tumor cell populations are more amenable to therapeutic strategies in the early stages of development, before selection for growth and survival have shaped the environment.
We believe the simple evolutionary model described in this paper, driven by a Moran process and shaped by heritable mutations with a fitness landscape based on the prisoner’s dilemma evolutionary game is useful in helping to understand early stage tumor growth and how it is influenced by the interplay of a few select small number of key parameters. When a malignant tumor cell population has already exceeded O(108 – 1010) cells, some of which may have entered the circulation or lymphatics and migrated to other sites, the opportunity to control or even shape future events may be limited. Attacking tumor heterogeneity as soon as it develops seems to be a useful strategy, particularly if heterogeneity is the driver of growth, as in eqn. (3.9). Whether these concepts can be developed in the more general context when cell dissemination to other sites is included in the model, and then translated into actionable clinical strategies is a challenge for the future.
Box 1.
mutation rate m (where m = mp + md), selection pressure w and the initial state vector x containing N total cells are the inputs for a simulation. Box 2: the prisoner’s dilemma game (a = 3; b = 0; c = 5; d = 1) is used to calculate the fitness of each healthy and cancer cell type, which is a function of the payoff values and the state vector, x. Box 3, 4: a single cell is chosen for death according to the relative proportion of the cell type in the cell population. Simultaneously, a single cell is selected for birth according to the relative proportion, weighted by cell fitness. Box 5: During the replication process, the daughter cell inherits a replica of the parent cell’s genetic string, with errors occurring at a rate of m. A single bit of the daughter cell’s genetic string may flip during each cell division. The possible mutations can be thought of as a single step random walk on the Markov diagram shown in Figure 2.3.
Acknowledgments
The project was supported in part by Award Number U54CA143906 from the National Cancer Institute. The content is solely the responsibility of the authors and does not necessarily represent the official view of the National Cancer Institute or the National Institutes of Health. PKN would like to thank Profs. Peter Kuhn, Jim Hicks, and Dr. Jorge Nieva for helpful comments on the manuscript.
Appendix A
Footnotes
To be clear, the four digit sequence is not meant as a bare-bones representation of the full human genome, but as a simple representation of the relevant differences in genetic information contained in different cells, allowing us to course-grain the cells into 16 different categories based on their genetic/epigenetic profiles.
What defines a prisoner’s dilemma matrix are the inequalities c > a > d > b. The chosen values in (2.2) are relatively standard, but not unique. More discussion of why the prisoner’s dilemma matrix, which models the evolution of defection, is a useful paradigm for cancer can be found in [55] and some of the references therin.
In our simulations, we assume that driver mutations cannot revert to passenger mutations, i.e. once a cancer cell is born, it stays in that category. We do not know of any evidence in the literature that shows the reversion of a cancer cell to a healthy cell, nor is this particularly a focus of this manuscript.
Kendal’s formulation [30] assumes a cell population made up of three sub-groups: (1) proliferative cells; (2) nonproliferative and nonclonogentic cells; (3) nonproliferative but clonogenic cells, with an assumption that the neoplasm’s growth rate is influenced by the proportion of proliferating to nonproliferating cells and an expression of each clone’s growth potential. The log is chosen based on the fact that heterogeneity is measured as the multiplicative combination of achievable states in the tumor, and the requirement that G(P1 ·P2) = G(P1)+G(P2) for any two sub-populations P1, P2 and growth function G. The discussion of the relationship between tumor heterogeneity and growth is an ongoing topic in the current literature [19, 20, 23, 25, 46].
Our results are relatively insensitive to where we draw the dividing line between healthy and cancerous.
References
- 1.Merlo LM, Pepper JW, Reid BJ, Maley CC. Cancer as an evolutionary and ecological process. Nat Rev Cancer. 2006 Dec;6(12):924–35. doi: 10.1038/nrc2013. [DOI] [PubMed] [Google Scholar]
- 2.Attolini CS-O, Michor F. Evolutionary theory of cancer. Ann NY Acad Sci. 2009;1168:23–51. doi: 10.1111/j.1749-6632.2009.04880.x. [DOI] [PubMed] [Google Scholar]
- 3.Frank SA. (Princeton Series in Evolutionary Biology).Dynamics of Cancer: Incidence, Inheritance, and Evolution. 2007 [PubMed] [Google Scholar]
- 4.Greaves M. Cancer: The Evolutionary Legacy. Oxford University Press; 2000. [Google Scholar]
- 5.Nordling C. Br J Cancer. 1953;7(1):68–72. doi: 10.1038/bjc.1953.8. A new theory of cancer-inducing mechanism. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Knudson A. Mutation and cancer: statistical study of retinoblastoma. PNAS USA. 1971;68(4):820–823. doi: 10.1073/pnas.68.4.820. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Tomlinson IPM, Novelli MR, Bodmer WF. The mutation rate and cancer. PNAS USA. 1996 Dec;93:14800–14803. doi: 10.1073/pnas.93.25.14800. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Vogelstein B, Kinzler KW. The Genetic Basis of Human Cancer. 2nd. McGraw-Hill; 2002. [Google Scholar]
- 9.Greaves M, Maley CC. Clonal evolution in cancer. Nature. 2012;481(7381):306–313. doi: 10.1038/nature10762. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Nowell PC. The clonal evolution of tumor cell populations. Science. 1976;194:23–28. doi: 10.1126/science.959840. [DOI] [PubMed] [Google Scholar]
- 11.Laird AK. Dynamics of tumor growth. Br J Cancer. 1964 Sep;18(3):490–502. doi: 10.1038/bjc.1964.55. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Friberg S, Mattson S. On the growth rates of human malignant tumors: Implications for medical decision making. J Surg Onc. 1997;65:284–297. doi: 10.1002/(sici)1096-9098(199708)65:4<284::aid-jso11>3.0.co;2-2. [DOI] [PubMed] [Google Scholar]
- 13.Benzekry S, Lamont C, Beheshti A, Tracz A, Ebos JML, Hlatky L, Hahnfeldt P. Classical mathematical models for description and prediction of experimental tumor growth. PLoS Comp Bio Aug. 2014;10(8):e1003800. doi: 10.1371/journal.pcbi.1003800. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Iwasa Y, Nowak MA, Michor F. Evolution of resistance during clonal expansion. Genetics April. 2006;172(4):2557–2566. doi: 10.1534/genetics.105.049791. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Durrett R, Moseley S. Evolution of resistance and progression to disease during clonal expansion of cancer. Theo Pop Bio. 2010;77:42–48. doi: 10.1016/j.tpb.2009.10.008. [DOI] [PubMed] [Google Scholar]
- 16.Komarova NL, Wodarz D. Drug resistance in cancer: principles of emergence and prevention. PNAS USA. 2005;102:9714–9719. doi: 10.1073/pnas.0501870102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Komarova NL. Stochastic modeling of drug resistance in cancer. J Theo Bio. 2006;239:351–366. doi: 10.1016/j.jtbi.2005.08.003. [DOI] [PubMed] [Google Scholar]
- 18.Goldie JH, Coleman AJ. Drug Resistance in Cancer: Mechanisms and Models. Cambridge University Press; 1998. [Google Scholar]
- 19.Navin N, et al. Inferring tumor progression from genomic heterogeneity. Genome Res. 2010;20:68–80. doi: 10.1101/gr.099622.109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Marusyk A, Polyak K. Tumor heterogeneity: causes and consequences. Biochim Biophys Acta. 2010;1805:105–117. doi: 10.1016/j.bbcan.2009.11.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Durrett R, Foo J, Leder K, Mayberry J, Michor F. Intratumor heterogeneity in evolutionary models of tumor progression. Genetics. 2011 Jun;188:461–477. doi: 10.1534/genetics.110.125724. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Abbott RG, Forest S, Pienta KJ. Simulating the hallmarks of cancer. Artificial Life. 2006;12:617–634. doi: 10.1162/artl.2006.12.4.617. [DOI] [PubMed] [Google Scholar]
- 23.Heppner GH. Tumor heterogeneity. Cancer Research. 1984 Jun;44:2259–2265. [PubMed] [Google Scholar]
- 24.Shipitsin M, Campbell LL, Argani P, et al. Molecular definition of breast tumor heterogeneity. Cancer Cell. 2007 Mar;11:259–273. doi: 10.1016/j.ccr.2007.01.013. [DOI] [PubMed] [Google Scholar]
- 25.Gerlinger M, Rowan AJ, Horswell S, et al. Intratumor heterogeneity and branched evolution revealed by multiregion sequencing. New England J Medicine. 2012 Mar 8;366(10):883–892. doi: 10.1056/NEJMoa1113205. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Smith J Maynard. Evolution and The Theory of Games. Cambridge University Press; 1982. [Google Scholar]
- 27.Nowak MA. Evolutionary Dynamics: Exploring the Equations of Life. Harvard University Press; 2006. [Google Scholar]
- 28.Vincent T. Carcinogenesis as an evolutionary game. Adv In Complex Sys. 2006;9(4):369–382. [Google Scholar]
- 29.Axelrod R, Axelrod DE, Pienta KJ. Evolution of cooperation among tumor cells. PNAS USA. 2006 Sep 5;103(36):13474–13479. doi: 10.1073/pnas.0606053103. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Kendal WS. Gompertzian growth as a consequence of tumor heterogeneity, Math. Biosciences. 1985;73:103–107. [Google Scholar]
- 31.Laird AK. Dynamics of tumor growth: Comparison of growth rates and extrapolation of growth curve to one cell. British J Cancer. 1965 doi: 10.1038/bjc.1965.32. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Adam JA, Bellomo N, editors. A Survey of Models for Tumor-Immune System Dynamics] Springer-Verlag; 1997. Chapter 3: Mathematical Modeling of Tumor Growth Kinetics, Z. Bajzer, S. Vuk-Pavlovic, M. Huzak.; Chapter 7: Tumor Heterogeneity and Growth Control, S. Michelson, J.T. Leith. [Google Scholar]
- 33.West GB, Brown JH, Enquist BJ. A general model for ontogenetic growth. Nature. 2001 Oct;413:11. doi: 10.1038/35098076. [DOI] [PubMed] [Google Scholar]
- 34.Araujo RP, McElwain DLS. A history of the study of solid tumor growth: The contribution of mathematical modeling. Bull of Math Bio. 2004;66:1039–1091. doi: 10.1016/j.bulm.2003.11.002. [DOI] [PubMed] [Google Scholar]
- 35.Burrell RA, McGranahan N, Bartek J, Swanton C. The causes and consequences of genetic heterogeneity in cancer evolution. Nature Review. 2013 Sep;509:19. 338–345. doi: 10.1038/nature12625. Nature Insight: Tumour Heterogeneity 19 Sept 2013 Vol 501 No 7467. [DOI] [PubMed] [Google Scholar]
- 36.Cover TM, Thomas JA. Elements of Information Theory. 2nd. Wiley-Interscience; 2006. [Google Scholar]
- 37.Newton PK, Mason J, Hurt B, Bethel K, Bazhenova L, Nieva J, Kuhn P. Entropy, complexity, and Markov diagrams for random walk cancer models. Nature Scientific Reports. 2014 Dec;4:7558. doi: 10.1038/srep07558. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Schuh RT. Biological Systematics: Principles and Applications. Cornell University Press; 2000. [Google Scholar]
- 39.Traina TA, Dugan U, Higgins B, Kolinsky K, Theodoulou M, Hudis CA, Norton L. Optimizing chemotherapy dose and schedule by Norton-Simon mathematical modeling. Breast Dis. 2010 Jan 1;311:7–18. doi: 10.3233/BD-2009-0290. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Abbott LH, Michor F. Mathematical models of targeted cancer therapy. Br J Cancer. 2006;95:1136–1141. doi: 10.1038/sj.bjc.6603310. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Ross SM. Introduction to Probability Models. 10th. Academic Press; 2010. [Google Scholar]
- 42.Norton L. A Gompertzian model of human breast cancer growth. Cancer Research. 1988 [PubMed] [Google Scholar]
- 43.Frenzen CL, Murray JD. A cell kinetics justification for Gompertz’ equation. SIAM J Appl Math. 1986 Aug;46(4):614–629. [Google Scholar]
- 44.Kozusko F, Bajzer Z. Combining Gompertzian growth and cell population dynamics. Math Biosci. 2003;185:153–167. doi: 10.1016/s0025-5564(03)00094-4. [DOI] [PubMed] [Google Scholar]
- 45.Tracqui P. Biophysical models of tumor growth. Rep Prog Phys. 2009;72:056701. [Google Scholar]
- 46.Swanton C. Intratumor heterogeneity: evolution through space and time. Cancer Res. 2012 Oct 1;72(19):4875–4882. doi: 10.1158/0008-5472.CAN-12-2217. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Rejniak KA, Anderson ARA. Hybrid models of tumor growth. Wiley Interdisciplinary Reviews, Systems Biology and Medicine. 2011 Jan-Feb;3(1):115–125. doi: 10.1002/wsbm.102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Cristini V, Lowengrub J, Nie Q. Nonlinear simulation of tumor growth. J Math Bio. 2003;46(3):191–224. doi: 10.1007/s00285-002-0174-6. [DOI] [PubMed] [Google Scholar]
- 49.Wise SM, Lowengrub JS, Frieboes HB, Cristini V. Three-dimensional multispecies nonlinear tumor growth I: Model and numerical method. J Theor Bio. 2008;253(3):524–543. doi: 10.1016/j.jtbi.2008.03.027. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Macklin P, Lowengrub J. Nonlinear simulation of the effect of microenvironment on tumor growth. J Theor Bio. 2007;245:677–704. doi: 10.1016/j.jtbi.2006.12.004. [DOI] [PubMed] [Google Scholar]
- 51.Frieboes HB, Jin F, Chuang YL, Wise SM, Lowengrub JS. Three-dimensional multispecies nonlinear tumor growth II: Tumor invasion and angiogenesis. J Theor Bio. 2010;264(4):1254–1278. doi: 10.1016/j.jtbi.2010.02.036. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Kim Y, Stolarska MA, Othmer HG. The role of the microenvironment in tumor growth and invasion. Prog In Biophys and Mol Bio. 2011;106(2):353–379. doi: 10.1016/j.pbiomolbio.2011.06.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Traulsen A, Clauseen JC, Hauert C. Coevolutionary dynamics: from finite to infinite populations. Phys Rev Letters. 2005;95:238701. doi: 10.1103/PhysRevLett.95.238701. [DOI] [PubMed] [Google Scholar]
- 54.Heppner GH, Miller BE. Tumor heterogeneity: Biological implications and therapeutic consequences. Cancer Metastasis Rev. 1983;2:5–23. doi: 10.1007/BF00046903. [DOI] [PubMed] [Google Scholar]
- 55.West J, Hasnain Z, Mason J, Newton PK. The prisoner’s dilemma as a cancer model. arXiv:1512.04591v4. 2015 doi: 10.1088/2057-1739/2/3/035002. [DOI] [PMC free article] [PubMed] [Google Scholar]