Skip to main content
British Journal of Cancer logoLink to British Journal of Cancer
. 2017 Feb 9;116(6):785–792. doi: 10.1038/bjc.2017.5

Cancer treatment scheduling and dynamic heterogeneity in social dilemmas of tumour acidity and vasculature

Artem Kaznatcheev 1,*, Robert Vander Velde 2, Jacob G Scott 1,3, David Basanta 1
PMCID: PMC5355932  PMID: 28183139

Abstract

Background:

Tumours are diverse ecosystems with persistent heterogeneity in various cancer hallmarks like self-sufficiency of growth factor production for angiogenesis and reprogramming of energy metabolism for aerobic glycolysis. This heterogeneity has consequences for diagnosis, treatment and disease progression.

Methods:

We introduce the double goods game to study the dynamics of these traits using evolutionary game theory. We model glycolytic acid production as a public good for all tumour cells and oxygen from vascularisation via vascular endothelial growth factor production as a club good benefiting non-glycolytic tumour cells. This results in three viable phenotypic strategies: glycolytic, angiogenic and aerobic non-angiogenic.

Results:

We classify the dynamics into three qualitatively distinct regimes: (1) fully glycolytic; (2) fully angiogenic; or (3) polyclonal in all three cell types. The third regime allows for dynamic heterogeneity even with linear goods, something that was not possible in prior public good models that considered glycolysis or growth factor production in isolation.

Conclusions:

The cyclic dynamics of the polyclonal regime stress the importance of timing for anti-glycolysis treatments like lonidamine. The existence of qualitatively different dynamic regimes highlights the order effects of treatments. In particular, we consider the potential of vascular normalisation as a neoadjuvant therapy before follow-up with interventions like buffer therapy.

Keywords: evolutionary game theory, public goods game, angiogenesis, aerobic glycolysis, VEGF, heterogeneity


Tumours are highly heterogeneous ecosystems (Welch, 2016), with various cancerous and non-cancerous sub-populations of cells competing for access to space, growth-factors, nutrients, oxygen and other limited resources. This emergence and persistence of heterogeneity has implications for diagnosis, treatment and disease progression (Dexter and Leith, 1986; Maley et al, 2006; Venkatesan and Swanton, 2015). Current explanations of intra-tumour heterogeneity include evolutionary neutrality (Iwasa and Michor, 2011), niche specialisation (Nagy, 2004; Gatenby and Gillies, 2008), non-equilibrium dynamics (Gonzalez-Garcia et al, 2002) and frequency-dependent selection (Tomlinson, 1997; Tomlinson and Bodmer, 1997; Basanta et al, 2008). It remains an open problem to identify which, or how many, of these mechanisms are at work in any given neoplasm (Merlo et al, 2006).

The progression of neoplasms to metastatic disease is marked by the acquisition of a number of hallmarks (Hanahan and Weinberg, 2000; Hanahan and Weinberg, 2011), including self-sufficiency of growth factor production for angiogenesis and reprogramming energy metabolism for aerobic glycolysis. As with many of the other hallmarks, there is evidence of intra-tumour heterogeneity in both the production of cytokines like vascular endothelial growth factor (VEGF) (Achilles et al, 2001; Marusyk and Polyak, 2010) and glycolysis (Kallinowski et al, 1988; Willmann et al, 2015; Hensley et al, 2016). Given that it is possible for an individual cancer cell to not invest (as heavily) in angiogenesis or not forgo the benefit of oxygen by avoiding aerobic glycolysis then how do these population level traits evolve, and how are they maintained? We answer this question with a mathematical model that treats acid production through glycolysis as a tumour-wide public good that is coupled to the club good of oxygen from better vascularisation.

By investing in better vascularisation—by (over)-producing VEGF, for example—the whole tumour can benefit from an improvement in vascularisation and the subsequent rise in availability of nutrients and oxygen (Nishida et al, 2006). An individual cancer cell, however, could reap these benefits from mere proximity to (over)-producers and save on the energetic cost of producing the relevant growth-factor: free-riding on the benefits created by other cancer cells. These free-riding cancer cells could out-compete the (over)-producers and take over the tumour. Such a switch away from growth factor (over)-production, however, decreases the overall fitness of the tumour—hurting the society of cancer cells—by making fewer nutrients and less oxygen available to all cancer cells. What is favourable at the individual level is unfavourable at the tumour population level. This represents a classic example of an evolutionary social dilemma.

A similar social dilemma exists for the increase of acidification from glycolysis. It is striking that the upregulation of glycolysis—the so called Warburg effect (Warburg et al, 1926; Warburg, 1956a, 1956b)—is observed even in the absence of hypoxia (Pauwels et al, 2000; Gambhir, 2002; Hanahan and Weinberg, 2011). Glycolysis is comparatively inefficient in terms of ATP yield when oxygen is not a limiting factor, raising the conundrum of what selective advantage it provides to a cell to compensate for its energetic cost. The acid-mediated tumour invasion hypothesis suggests that this advantage comes from the acidification of the tumour micro-environment that leads to higher proliferation and invasiveness through increased extracelluar matrix degradation and an increase in normal cell death (Gatenby, 1995; Gatenby and Gawlinski, 1996; Gatenby and Gawlinski, 2003; Gatenby and Gillies, 2004). Tumour cells typically achieve a maximum proliferation rate with an extracellular pH that is acidic enough (pHe=6.8) to lead to normal cells' death from activation of p53-dependent apoptosis pathways and loss of function of critical pH-sensitive genes (Rubin, 1971; Casciari et al, 1992; Park et al, 1999; Webb et al, 2011). Thus, acidosis provides a (relative) benefit to acid-resistant tumour cells competing against the acid-sensitive non-cancerous cells.

Early models have assumed that resistance to acidity is available only to glycolytic cells (Gatenby and Gawlinski, 1996; Basanta et al, 2008; Basanta et al, 2011), but there is little evidence to suggest that aerobic cancer cells could not also develop this resistance. In human colonic adenoma, for example, low extracellular pH is accompanied by p53-dependent apoptosis in wild type. But cells that lost wild-type p53—a common mutation among cancer cells—receive a selective growth advantage independent of their metabolic process (Williams et al, 1999). In fact, recent in silico models parameterised from experimental data suggest that tumours acquire glycolytic capacity after acid resistance (Robertson-Tessi et al, 2015). In the absence of hypoxia, resistant non-glycolytic cells could benefit from aerobic metabolism to out-compete the glycolytic cancer cells without contributing to acidosis. Such a switch away from glycolysis decreases the overall acid production by the tumour, normalising pH, and negating part of the advantage acid-resistant cells have against non-cancerous soma. This individual benefit hurts the society of cancer cells and qualifies acid production as another evolutionary social dilemma.

Evolutionary game theory (EGT) is a tool to make sense of these sort of social dilemmas. Originating with Maynard Smith and Price (1973), EGT is a mathematical approach to modelling frequency-dependent selection where players interact via phenotypic strategies. EGT has been used in oncology to study the conditions that select for more aggressive tumour phenotypes in gliomas (Basanta et al, 2008; Basanta et al, 2011), colorectal cancer (Gatenby and Vincent, 2003; Gatenby et al, 2005), multiple myeloma (Dingli et al, 2009) and prostate cancer (Basanta et al, 2012); as well as the effects of treatment on the progression of cancer (Basanta et al, 2012; Orlando et al, 2012).

Recently, Archetti introduced the public goods game to oncology for looking separately at two-strategy problems like growth-factor production (Archetti, 2013) and the production of acid in the Warburg effect (Archetti, 2014). With colleagues, he has implemented the growth-factor production game in an experimental system (Archetti et al, 2015). He concluded that a heterogeneous (polyclonal) equilibrium of producers and non-producers cannot exist unless the benefits that these goods provide are nonlinear in the number of producers. In the linear cases, considered separately, this would predict that VEGF (over)-producers (VOP) and glycolytic (GLY) cells would both go extinct, leaving a population of aerobic cells that do not call for more vasculature (DEF). This is in accord with the intuition that free-riders always win in social dilemmas, and would result in the elimination of heterogeneity of production of VEGF and acidity. However, angiogenesis and glycolysis are intimately related and should not be considered in isolation because the benefits of oxygen affect the degree of hypoxia and thus the relative cost of glycolysis when compared with aerobic metabolism.

We focus on this inter-dependence of these two hallmarks by coupling the two goods—the public good of acidification and the club good of vascularisation—in a three-strategy game. Our model reveals dynamics that cannot be predicted from treating micro-environmental acidification and vascularisation in isolation from each other. In particular, we show dynamic cycles of constantly changing proportions of cancer cell types—a realisation of intra-tumour heterogeneity even with linear goods. This has consequences for the design of treatments, as it suggests that the timing, and order, of therapeutic interventions could drastically affect the outcome. For example, it shows the importance of preparatory treatment or neoadjuvant therapy that manages the low-frequency cell types in contrast to simply targeting the most common clone, and of the advantages of targeting the tumour micro-environment instead of just targeting the cancer cell.

Materials and methods

Double goods game

Consider a focal glycolytic cell interacting with n other nearby cells of which nG∈[0, n] are also glycolytic. Together they produce a relative benefit ba(nG+1) due to acidity to be distributed among the n+1 cells. Therefore, this focal glycolytic cells receives a net benefit of Inline graphic.

By averaging over all possible focal glycolytic cells and interaction group compositions, we get that the expected fitness of a population of glycolytic (GLY) cells with random assortment is

graphic file with name bjc20175e2.jpg

where xG is the proportion of GLY in the population, and the angle brackets represent averaging with, in this case, nG sampled from the binomial distribution with n trials and xG as the probability of success (i.e., choosing a GLY cell). It is possible to include an explicit cost of glycolysis in equation (1), but this is mathematically equivalent to adjustment to ba or (relative) fitness of the aerobic cells (for more detail, see Supplementary Appendix A.2).

For an aerobic cell—non-GLY—in a similar case, the benefit due to acid is only Inline graphic, since it does not itself produce acid. But, in addition to the effects of acid, an aerobic cell benefits from a more efficient metabolism by using the oxygen delivered by the vasculature. In particular, a VEGF (over)-producer (VOP) will receive the benefit Inline graphic but pay a cost c for the higher production, and the aerobic non-(over)-producer of VEGF (DEF) will receive the benefit Inline graphic but pay no cost, essentially free-riding.

Therefore, the expected fitness of the two aerobic populations with random assortment are

graphic file with name bjc20175e6.jpg
graphic file with name bjc20175e7.jpg

where xV, xD are the proportions of VOP and DEF in the population, and the averages in the second summands are taken with nG, nV sampled from the multinomial distribution with n trials and xG as the probability of the first outcome (i.e., choosing a GLY cell), and xV as the probability of the second outcome (i.e., choosing a VOP cell). In each equation, the first summand is the benefit due to acidification and the second is the benefit from the club good of vascularisation. These fitness functions are described in more detail in Supplementary Appendix A.

Note that, unlike acidity, the benefit of oxygen from vasculature is divided among only the nnG+1 non-glycolytic cells in the interacting group and not all n+1 cells in the interaction group. Such a good that excludes some (GLY) but is distributed evenly among the others—regardless of if they contributed (VOP) or not (DEF)—is known in economics as a club good (Buchanan, 1965). Alternatively, for an ecological connection, we can think of glycolytic cells as loners in the optional public goods game (Hauert et al, 2002).

Evolutionary dynamics

The evolutionary dynamics of the population are given by the replicator equation (Taylor and Jonker, 1978; Hofbauer and Sigmund, 1998):

graphic file with name bjc20175e8.jpg
graphic file with name bjc20175e9.jpg
graphic file with name bjc20175e10.jpg

where Inline graphic is the average fitness of the population.

Alternatively, we can write down these dynamics in their factored form (see Supplementary Appendix B for a proof of equivalence) as:

graphic file with name bjc20175e12.jpg
graphic file with name bjc20175e13.jpg

where p=xG is the proportion of GLY, Inline graphic is the proportion of aerobic cells that (over)-produce VEFG, and Inline graphic is the average fitness of the aerobic cells.

These equations are accurate for large populations at carrying capacity—in vivo tumours up against a resource limitation or managed by an immune response—or in their exponential growth phase—typical of in vitro experiments. In other cases they serve as an approximation. See Supplementary Appendix C for more discussion on interpreting replicator dynamics.

Results

We can apply the double goods game described in the previous section to study the temporal evolution of different populations in various scenarios. These scenarios are set by the four micro-environmental parameters of our model: ba, the benefit per unit of acidification; bv, the benefit from oxygen per unit of vascularisation; c, the cost of (over)-producing VEGF; and, n, the number of interaction partners in the public good. Within a scenario, the last piece of information is the initial proportions of cells xG(0), xV(0), xD(0) (or p(0), q(0) in the factored form). The values of these variables will depend on the particular cancer ecosystem (i.e., patient and type of tumour). Since these variables can be difficult or impossible to measure clinically, it is important to understand what broad qualitative relationships between them mean for long-term dynamics.

Although the fitness functions in equations (1, 2, 3) are Bernstein polynomials of degree n, we can use properties of binomial coefficients to simplify the corresponding gain functions of the factored replicator dynamics without any approximation (see Supplementary Appendix D). This allows us to rewrite equations (7) and (8) as

graphic file with name bjc20175e16.jpg
graphic file with name bjc20175e17.jpg

Notice that when p=0, equation (9) recovers the social dilemma of angiogenesis that we discussed in the introduction with the free-riding DEF cells taking over the population of aerobic cells, driving q towards 0. When p≠0 and q=0, we have a hypoxic tumour, and glycolytic cells are favoured, driving p towards 1. On the other hand, if p≠0 and q=1, then we recover the social dilemma of aerobic glycolysis, with VOP cells favoured if ba<(bvc)(n+1), and GLY cells otherwise. In other words, if one of the strategies is absent in the population then no persistence of heterogeneity is possible among the remaining two strategies. This is consistent with predictions from two-strategy linear public goods games.

The more interesting case that is unique to our model is when all three strategies are initially expressed in the population. In this setting, we can analytically characterise the population dynamics into one of three the qualitatively different regimes based on the values of the four main micro-environmental parameters by solving how the above gain functions for p and q cross zero (for more information, see Supplementary Appendix D). This classification is shown visually in Figure 1. We name these three regimes by their end points (and evolutionarily stable strategies); fully glycolytic (green region in Figure 1), fully angiogenic (red) and heterogeneous (yellow). The following three subsections describe each of these three regimes in turn. Example dynamics from each regime are given in the inset simplexes of Figure 1.

Figure 1.

Figure 1

The three possible dynamic regimes for the double goods game.The possible parameter settings for bv are varied horizontally, starting at c. The possible parameter settings for Inline graphic are varied vertically, starting at 0. Each of the three inset simplexes have the same coordinates, with the top vertex corresponding to all GLY, left to all VOP, and right to all DEF. Each simplex is a typical example of dynamics within its regime. The specific micro-environmental parameters for each example: (1) ba=37.5, bv=2, c=1, n=4 for green; (2) ba=2.5, bv=7, c=1, n=4 for red; and (3) ba=2.5, bv=2, c=1, n=4 for yellow.

Fully glycolytic tumours: Inline graphic

We solve for the average fitness of aerobic cells as

graphic file with name bjc20175e19.jpg

where the first summand is the benefit from acidification, and the second summand is the benefit due to oxygen from vascularisation. Since DEF only consume the club good from vascularisation, without producing any, equation (11) is maximised to bap+bvc when all aerobic cells are producers of the club good (i.e., all VOP, q=1).

Similarly, we can solve for the fitness of GLY:

graphic file with name bjc20175e20.jpg

where the first summand is the benefit from acidosis that all cancer cells receive, and the second summand is the slight increase in acidification that glycolytic cells get from always being in a group with an extra acid producer (themselves).

From this, we see that if the fitness benefit of a single unit of acidification Inline graphic is higher than the maximum benefit from the club good for aerobic cells (bvc), then the difference between equations (12) and (11) is always positive. Thus, GLY will always have a strictly higher fitness than aerobic cells, and be selected for. In this scenario, the population will converge towards all GLY, regardless of the initial proportions (as long as there is at least some GLY in the population). This dynamic regime is achieved for any micro-environmental parameter settings corresponding to the green region in Figure 1.

Fully angiogenic tumours: Inline graphic

Consider an interaction group with nV VOP and nD DEF cells. If the focal agent interacting with this group is an (over)-producer, then it will receive a benefit from oxygen of Inline graphic. If the focal agent is a defector, then they will receive Inline graphic. Since, by definition, nV+nD is less than the interaction group size n, then regardless of the number of glycolytic cells:

graphic file with name bjc20175e25.jpg

Thus, if the benefit to (over)-producers from their extra unit of vascularisation Inline graphic is higher than the cost c to produce that unit (or, equivalently, if bvc>cn) then VOP will always have a strictly higher fitness than DEF, selecting q towards 1. In addition, if the maximum possible benefit of the club good to aerobic cells (bvc) is higher than the benefit of an extra unit of acidification Inline graphic then (based on the difference of equations (12) and (11), or the negation of the conditions in the previous section) for sufficiently high number of (over)-producers (q close enough to 1), GLY will have lower fitness than aerobic cells. When both conditions are satisfied, the population will converge towards all VOP. This dynamic regime is achieved for any micro-environmental parameter settings corresponding to the red region in Figure 1. Notice in the typical example dynamics inset Figure 1 that if a population starts with mostly aerobic cells (p close to 0) that are not overproducing VEGF (q close to 0), then the population might see a transient decrease in the number of aerobic cells on the way to the all-VOP equilibrium.

Heterogeneous tumours: Inline graphic

From equation (13), we know that if the benefit from an extra unit of vascularisation in a fully aerobic group Inline graphic is lower than the cost c to produce that unit, then for a sufficiently low proportion of GLY and thus sufficiently large number of aerobic cells sharing the club good, DEF will have higher fitness than VOP. This will lead to a decrease in the proportion q of (over)-producers among aerobic cells and thus a decrease in the average fitness of aerobic cells Inline graphic (see equation (11)). A lower fitness in aerobic cells will lead to an increase in the proportion of GLY until the aerobic groups (among which the club good is split) get sufficiently small and fitness starts to favour VOP over DEF, swinging the dynamics back. Thus, resulting in cyclic dynamics, examples of which can be seen in the top panel of Figure 2 or inset Figure 1.

Figure 2.

Figure 2

Dynamics for an untreated tumour and examples of two different sets of dynamics resulting from changed timings of a given intervention.Each graph is proportion of cells vs time, with GLY (xG) in green, VOP (xV) in solid red and DEF (xD) in blue. The dashed red lines show the proportion of VOP among aerobic cells Inline graphic. All three graphs start with same initial conditions (p(0)(=xG(0))=0.9, q(0)=0.6) and the same tumour micro-environment (ba=2.5, bv=2, c=1, n=4). In the top panel, we highlight the cyclic behaviour by noting when p and q return to their initial values given by the green circles and red hollow circles, respectively. In the second and third panel, we consider an anti-glycolytic treatment of the same strength (subtracting 3 from the fitness of GLY) and the same duration (three time steps). The difference between the second and third panel is in timing, marked in grey: the second panel starts treatment at t=0, and the third at t=10. If the proportion of a cell-type goes below 10−4 then it dies off entirely. At t=30, no cell types are extinct in the top panel; in the second and third panel, the only non-extinct cell types are GLY and DEF, respectively. This highlights the importance of the timing of therapy for evolutionary outcome.

Under this scenario, the population will orbit around an internal fixed-point at Inline graphic and Inline graphic. The exact position of p* is the solution to the polynomial equation Inline graphic (see Supplementary Appendix D for details). The amplitude of the orbit will depend on the distance between p(0), q(0) and p*, q*. This dynamic regime is achieved for any micro-environmental parameter settings corresponding to the yellow region in Figure 1.

Therapeutic implications

With the possible dynamic regimes in mind, we can think about treatment in one of two ways: (1) treatments that target the player by directly reducing the proportion of a given strategy in the population; or (2) treatments that target the game by changing the parameters (ba, bv, c or n) and taking us from one dynamic regime to another. In both cases, we need to be mindful of counter-intuitive phenomena, such as timing and order effects and the importance of managing heterogeneity.

Treat the player: targeting cell types

A treatment that targets a given strategy and can be applied long enough to drive that strategy to extinction can be considered a viable intervention. If we were in the first dynamic regime (green in Figure 1; Inline graphic) or the second dynamic regime (red in Figure 1; Inline graphic), then the population will always converge towards all-GLY or all-VOP, respectively. This means that unless the strategy-targeting therapy is strong and long enough to drive that strategy to extinction, it will not affect the overall outcome beyond a potential transient delay. In particular, the timing of the therapy will not have a qualitatively significant effect. In the heterogeneous case (yellow in Figure 1; Inline graphic); however, counter-intuitive results are possible and the timing of treatment becomes important.

As an example in Figure 2, we consider a tumour described by the micro-environmental parameters ba=2.5, bv=2, c=1 and n=4 and an initial composition xG=0.9, xV=0.06, xD=0.04. If left untreated then the proportions of strategies would cycle around an internal fixed point at Inline graphic, as seen in the top panel. In that panel, the proportion of glycolytic cells xG will oscillate between about 0.9 and 0.34 with a period of about 26 time units.

In the standard paradigm of personalised medicine, treatments target the most abundant subclone. In this case, it would be the glycolytic cells—the most common, and least fit, strategy at t=0. We can imagine targeting these cells specifically, with a therapy that imposes a large fitness cost, like lonidamine (Floridi et al, 1981), for example. Here, we choose to set the fitness cost of this therapy to 3, which leads to very quick and aggressive reduction in the GLY population. In a perfect world, the therapy would be applied long enough to drive GLY to extinction. But what happens if it is only applied for three time units, enough to drastically reduce the proportion of GLY cells—below detectable levels—but not below the extinction threshold? The cyclic dynamics then allow the glycolytic cells to recover by out-competing the mostly DEF population. To make matters worse, as GLY recovers, VOP is pushed below the extinction threshold leaving just the other two cell types to compete. Without VEGF (over)-production, the remaining aerobic cells are less fit than the glycolytic cells and are also driven to extinction. The overall result at t=30 is a relapse with even more glycolytic cells than before treatment. In this scenario, while the goal was to eliminate glycolytic cells, the opposite occurred: elimination of all the aerobic cells and creation of a completely glycolytic tumour.

If instead the treatment was delayed until t=10—when the GLY cells are at their lowest proportions, and highest fitness, in the tumour's natural cycle—then much more favourable results could be achieved. With the lower initial proportion of GLY cells, three units of time are long enough to drive the cells to extinction. Without glycolytic cells, the competition between DEF and VOP becomes a classic social dilemma and the VEGF (over)-producers are driven to extinction. The result is now what was desired: an aerobic tumour with no—or significantly diminished—ability to recruit blood vessels.

In this case, we can think of the natural tumour dynamics as a neoadjuvant ‘treatment' that lowered the GLY population slightly while preparing the VOPDEF composition to be in a favourable position after therapy.

Treat the game: targeting micro-environment

Consider the hypothetical case-study in Figure 3. Here, we have a situation with poor vasculature (bv=2, c=1, n=4) and a highly glycolytic tumour (p0=0.9, ba=37.5). If left untreated, the tumour would quickly reach all-GLY, driving the other two strategies extinct. Thus, the primary goal is to eliminate GLY, and create an easier-to-target all-VOP tumour. The secondary goal is to do so with minimal transient tumour heterogeneity. We consider two possible interventions. The treatments can be applied sequentially with a window of one time-step between them or simultaneously. One treatment is a buffer therapy (Robey et al, 2009) that reduces the benefit of acidity; setting ba=2.5 from then on. Another is vascular normalisation therapy (VNT) (Jain, 2013); setting bv=7 from then on. In Figure 3, the top panel considers giving buffer therapy followed by VNT, the middle panel has the VNT preceding the buffer therapy and the bottom panel has both treatments at given simultaneously at t=0.

Figure 3.

Figure 3

Three possible orders of therapeutic intervention.Each graph on the left shows the proportion of cells vs time, with GLY (xG) in green, VOP (xV) in red and DEF (xD) in blue. The dashed red lines show the proportion of VOP among aerobic cells Inline graphic. Each graph on the right is ba/(n+1) vs bv and shows how treatment moves the tumour between qualitatively different dynamic regimes through the space on micro-environmental parameters (for detailed explanation see Figure 1). All graphs start with same initial proportions (xG(0)=0.9, q(0)=0.6). The untreated tumour has parameters ba=37.5, bv=2, c=1 and n=4. In the top two panels, two treatments are applied: the first at time 0 and the second at time 1. In the last panel, a single treatment is applied at time 0. We consider two game-targeting treatments: (1) a buffer therapy that reduces the benefit of acidity (setting ba=2.5 from then on; horizontal arrows in the right panel); and (2) a vascular normalisation therapy (VNT; setting bv=7 from then on; vertical arrows in the right panel). In the top panel, buffer therapy is followed by VNT, the second panel shows VNT followed by buffer therapy, and in the final panel both treatments are given simultaneously (diagonal arrow in the bottom right panel).

The buffer then VNT (top panel) ordering produces immediate results, with the proportion of GLY no longer increasing—and even starting to decline—right away and reaching minimal levels earlier than VNT followed by buffer therapy (middle panel). However, the top ordering increases the heterogeneity among the anaerobic cells and although the tumour will eventually move to a state of all VOP, by time-step 3 (when it is nearly all VOP for the middle panel) there is a high level of VOPDEF heterogeneity, and earlier during treatment (say t=2) the top panel has heterogeneity in all three cell types. Something that the physician wanted to avoid.

In contrast, the middle ordering sees no immediate results from the VNT. Instead, this first treatment can be thought of as a neoadjuvant therapy that eliminates the VOPDEF heterogeneity among the rare aerobic cells before targeting the predominantly GLY population. By time-step 3, the middle ordering sees a similarly high level of response in the GLY phenotype, but without creating a high level of heterogeneity in the tumour.

We might expect that both normalising the benefit due to oxygen from vascularisation and decreasing the benefit from acidification at the same time would offer superior results to sequential therapy. However, as we can see from the bottom panel of Figure 3, this is not necessarily the case. Although the same reduction in glycolytic cells is reached 1 time unit faster than the sequential therapies, the heterogeneity among aerobic cells remains high; like buffer therapy followed by VNT.

Staggering buffer therapy after VNT might not produce immediately evident results but it lets us reach those results without encountering a highly heterogeneous tumour. When targeting the micro-environment, it matters which qualitatively different dynamic regimes the game goes through, even if the final micro-environmental parameters are the same.

Discussion

We introduce and classify the double goods game of acidity and vasculature. There are three qualitatively different dynamic regimes that end in a tumour that is either: (1) fully glycolytic (all-GLY); (2) fully angiogenic (all-VOP); or (3) a heterogeneous (polyclonal) cycle in proportions of all three cell types. Which of these regimes is achieved depends on the micro-environmental parameters like the benefit per producer due to acidity (ba), due to vasculature (bv) and the cost of (over)-producing VEGF (c). We show that polyclonal tumours made up of three different cellular strategies are evolutionarily stable even with linear goods. This stands in stark contrast to the all-DEF equilibrium that we would expect from considering the linear goods in isolation. Our results highlight the difficulty of ruling out possible dynamics from overly reductionist accounts of cancer, and the importance of modelling both the vascularisation and acidity when studying the Warburg effect (Warburg, 1956a, 1956b).

The dynamic nature of the polyclonal equilibrium reminds us of the importance of tracking the tumour composition through time, not basing treatment on measurements from a single time-point, and optimising the timing of treatment. As an example, we consider an anti-glycolysis treatment like lonidamine (Floridi et al, 1981). If timed correctly and applied for long enough, then this treatment can drive the glycolytic cells extinct and reduce the tumour to the two-strategy case of VEGF-production considered in prior work (Archetti, 2013). From there, somatic evolution will drive VEGF (over)-producers extinct, leaving us with an all- DEF tumour (lower panel of Figure 2). However, if the same treatment is applied at the wrong time in the cycle of heterogeneity (or not for long enough) then the glycolytic population can recover while the VEGF (over)-producers are driven extinct by non-producers. Without VEGF (over)-producers, the glycolytic cells can out-compete the aerobic cells and drive them to extinction, resulting in a fully glycolytic tumour (middle panel of Figure 2). A backfire effect for treatment.

Since the heterogeneous equilibrium is not the only possible outcome of these game dynamics, it is also important to measure the micro-environmental parameters such as the benefit per unit of vascularisation and per unit of acidification that determine the game. This dichotomy between tumour composition and micro-environmental parameters carries over from measurement to treatment. In section ‘Treat the game: targeting micro-environment', we consider treatments like buffer therapy (Robey et al, 2009) and VNT (Jain, 2013) to change the micro-environmental parameters and thus target the game. By shifting to a more desirable game, we allow natural somatic evolution to lead us to a better outcome. The order in which we shift between games is important, especially for transient heterogeneity.

For highly glycolytic tumours, it is important to consider neoadjuvant VNT before buffer therapy. VNT allows us to reduce the heterogeneity in aerobic cells before targeting the more common glycolytic cells. Thus, when buffer therapy turns the game against glycolytic cells, the tumour is prepped in a low heterogeneity state and moves to an all-VOP phenotype without high levels of transient polyclonality. If buffer therapy is applied before VNT, or even if the two are applied simultaneously, then the response of glycolytic cells is not only quicker but also prone to creating much longer lasting heterogeneity in the tumour. We expect that similar considerations of neoadjuvant therapy for managing the rarer cancer sub-types might prove effective in other treatment settings.

There are several different types of heterogeneity that can work for or against the patient. These include strategy heterogeneity (i.e., a polyclonal tumour) and game heterogeneity (i.e., differences in micro-environmental parameters). Physicians have to be mindful of both types when treating a given patient. So far, we concentrated on difference in micro-environmental parameters due to variation between patients, tissues and the effects of therapy. However, there can also be game heterogeneity within different regions of the same tumour. For example, Kaznatcheev (2015) have previously shown that the game within the bulk of the tumour can differ from the game at static boundaries. But it is through static boundaries like blood vessels, organ capsules or basement membranes that metastatic invasion happens. Theoretical future work could more explicitly model invasiveness by coupling our double goods game to the go-vs-grow game (Basanta et al, 2008) and also broadening the analysis to ex situ cells. If physicians want to minimise the risk of metastates then it is important to measure the micro-environmental parameters at such boundaries.

To make our EGT empirical, we need to concentrate on building experiments that apply recently developed protocols for measurements of microscopic games (Gore et al, 2009; Archetti et al, 2015; Li et al, 2015) to classic model systems of acid production and oxygen consumption (Rubin, 1971; Casciari et al, 1992; Park et al, 1999; Williams et al, 1999). Although the linear goods we consider in this paper are useful for defining protocols and approximating initial measurements, in future it would be helpful to fully analyse the nonlinear double goods game that we define in Supplementary Appendix A. This can be done by either adapting general tools such as the (Pena et al, 2014) analysis of sign changes of gain sequences, or by considering specific extreme cases like step functions. However, it is only through experimental work that we can determine the kinds of games that cancer cells play, and thus which modelling extensions are most fruitful to pursue.

Acknowledgments

JGS thanks the NIH for its generosity in providing a Loan Repayment Grant. DB acknowledges funding from NCI U01CA202958-01.

Footnotes

Supplementary Information accompanies this paper on British Journal of Cancer website (http://www.nature.com/bjc)

This work is published under the standard license to publish agreement. After 12 months the work will become freely available and the license terms will switch to a Creative Commons Attribution-NonCommercial-Share Alike 4.0 Unported License.

The authors declare no conflict of interest.

Supplementary Material

Supplementary Appendix

References

  1. Achilles E-G, Fernandez A, Allred EN, Kisker O, Udagawa T, Beecken W-D, Flynn E, Folkman J (2001) Heterogeneity of angiogenic activity in a human liposarcoma: a proposed mechanism for ‘no take' of human tumors in mice. J Natl Cancer Inst 93(14): 1075–1081. [DOI] [PubMed] [Google Scholar]
  2. Archetti M, Ferraro DA, Christofori G (2015) Heterogeneity for IGF-II production maintained by public goods dynamics in neuroendocrine pancreatic cancer. Proc Natl Acad Sci USA 112(6): 1833–1838. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Archetti M (2013) Evolutionary game theory of growth factor production: implications for tumour heterogeneity and resistance to therapies. Br J Cancer 109(4): 1056–1062. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Archetti M (2014) Evolutionary dynamics of the warburg effect: glycolysis as a collective action problem among cancer cells. J Theor Biol 341: 1–8. [DOI] [PubMed] [Google Scholar]
  5. Basanta D, Gatenby RA, Anderson ARA (2012) Exploiting evolution to treat drug resistance: combination therapy and the double bind. Mol Pharm 9(4): 914–921. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Basanta D, Scott JG, Fishman MN, Ayala G, Hayward SW, Anderson ARA (2012) Investigating prostate cancer tumour-stroma interactions: clinical and biological insights from an evolutionary game. Br J Cancer 106(1): 174–181. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Basanta D, Scott JG, Rockne R, Swanson KR, Anderson ARA (2011) The role of IDH1 mutated tumour cells in secondary glioblastomas: an evolutionary game theoretical view. Phys Biol 8(1): 015016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Basanta D, Simon M, Hatzikirou H, Deutsch A (2008) Evolutionary game theory elucidates the role of glycolysis in glioma progression and invasion. Cell Prolif 41(6): 980–987. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Buchanan JM (1965) An economic theory of clubs. Economica 32(125): 1–14. [Google Scholar]
  10. Casciari JJ, Sotirchos SV, Sutherland RM (1992) Variations in tumor cell growth rates and metabolism with oxygen concentration, glucose concentration, and extracellular pH. J Cell Physiol 151(2): 386–394. [DOI] [PubMed] [Google Scholar]
  11. Dexter DL, Leith JT (1986) Tumor heterogeneity and drug resistance. J Clin Oncol 4(2): 244–257. [DOI] [PubMed] [Google Scholar]
  12. Dingli D, Chalub FACC, Santos FC, Van Segbroeck S, Pacheco JM (2009) Cancer phenotype as the outcome of an evolutionary game between normal and malignant cells. Br J Cancer 101(7): 1130–1136. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Floridi A, Paggi MG, Marcante ML, Silvestrini B, Caputo A, de Martino C (1981) Lonidamine, a selective inhibitor of aerobic glycolysis of murine tumor cells. J Natl Cancer Inst 66(3): 497–499. [PubMed] [Google Scholar]
  14. Gambhir SS (2002) Molecular imaging of cancer with positron emission tomography. Nat Rev Cancer 2(9): 683–693. [DOI] [PubMed] [Google Scholar]
  15. Gatenby RA, Gawlinski ET (1996) A reaction-diffusion model of cancer invasion. Cancer Res 56(24): 5745–5753. [PubMed] [Google Scholar]
  16. Gatenby RA, Gawlinski ET (2003) The glycolytic phenotype in carcinogenesis and tumor invasion insights through mathematical models. Cancer Res 63(14): 3847–3854. [PubMed] [Google Scholar]
  17. Gatenby RA, Gillies RJ (2004) Why do cancers have high aerobic glycolysis? Nat Rev Cancer 4(11): 891–899. [DOI] [PubMed] [Google Scholar]
  18. Gatenby RA, Gillies RJ (2008) A microenvironmental model of carcinogenesis. Nat Rev Cancer 8(1): 56–61. [DOI] [PubMed] [Google Scholar]
  19. Gatenby RA, Vincent TL, Gillies RJ (2005) Evolutionary dynamics in carcinogenesis. Math Models Methods Appl Sci 15(11): 1619–1638. [Google Scholar]
  20. Gatenby RA, Vincent TL (2003) An evolutionary model of carcinogenesis. Cancer Res 63: 6212–6220. [PubMed] [Google Scholar]
  21. Gatenby RA (1995) The potential role of transformation-induced metabolic changes in tumor-host interaction. Cancer Res 55(18): 4151–4156. [PubMed] [Google Scholar]
  22. Gonzalez-Garcia I, Sole RV, Costa J (2002) Metapopulation dynamics and spatial heterogeneity in cancer. Proc Natl Acad Sci USA 99(20): 13085–13089. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Gore J, Youk H, Van Oudenaarden A (2009) Snowdrift game dynamics and facultative cheating in yeast. Nature 459(7244): 253–256. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Hanahan D, Weinberg RA (2000) The hallmarks of cancer. Cell 100(1): 57–70. [DOI] [PubMed] [Google Scholar]
  25. Hanahan D, Weinberg RA (2011) Hallmarks of cancer: the next generation. cell 144(5): 646–674. [DOI] [PubMed] [Google Scholar]
  26. Hauert C, De Monte S, Hofbauer J, Sigmund K (2002) Replicator dynamics for optional public good games. J Theor Biol 218(2): 187–194. [DOI] [PubMed] [Google Scholar]
  27. Hensley CT, Faubert B, Yuan Q, Lev-Cohain N, Jin E, Kim J, Jiang L, Ko B, Skelton R, Loudat L, Wodzak M, Klimko C, McMillan E, Butt Y, Ni M, Oliver D, Torrealba J, Malloy CR, Kernstine K, Lenkinski RE, DeBerardinis RJ (2016) Metabolic heterogeneity in human lung tumors. Cell 164(4): 681–694. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Hofbauer J, Sigmund K (1998) Evolutionary Games and Population Dynamics. Cambridge University Press. [Google Scholar]
  29. Iwasa Y, Michor F (2011) Evolutionary dynamics of intratumor heterogeneity. PLoS One 6(3): e17866. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Jain RK (2013) Normalizing tumor microenvironment to treat cancer: bench to bedside to biomarkers. J Clin Oncol 31(17): 2205–2218. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Kallinowski F, Vaupel P, Runkel S, Berg G, Fortmeyer HP, Baessler KH, Wagner K, Mueller-Klieser W, Walenta S (1988) Glucose uptake, lactate release, ketone body turnover, metabolic micromilieu, and pH distributions in human breast cancer xenografts in nude rats. Cancer Res 48(24 Part 1): 7264–7272. [PubMed] [Google Scholar]
  32. Kaznatcheev A, Scott J, Basanta D (2015) Edge effects in game-theoretic dynamics of spatially structured tumours. J R Soc Interface 12(108): 20150154. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Li X-Y, Pietschke C, Fraune S, Altrock PM, Bosch TCG, Traulsen A (2015) Which games are growing bacterial populations playing? J R Soc Interface 12(108): 20150121. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Maley CC, Galipeau PC, Finley JC, Jon Wongsurawat V, Li X, Sanchez CA, Paulson TG, Blount PL, Risques R-A, Rabinovitch PS, Reid BJ (2006) Genetic clonal diversity predicts progression to esophageal adenocarcinoma. Nat Genet 38(4): 468–473. [DOI] [PubMed] [Google Scholar]
  35. Marusyk A, Polyak K (2010) Tumor heterogeneity: causes and consequences. Biochim Biophys Acta 1805(1): 105–117. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Maynard Smith J, Price GR (1973) The logic of animal conflict. Nature 246: 15. [Google Scholar]
  37. Merlo LMF, Pepper JW, Reid BJ, Maley CC (2006) Cancer as an evolutionary and ecological process. Nat Rev Cancer 6(12): 924–935. [DOI] [PubMed] [Google Scholar]
  38. Nagy JD (2004) Competition and natural selection in a mathematical model of cancer. Bull Math Biol 66(4): 663–687. [DOI] [PubMed] [Google Scholar]
  39. Nishida N, Yano H, Nishida T, Kamura T, Kojiro M (2006) Vascular health and risk management. Angiogenesis Cancer 2: 213–219. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Orlando PA, Gatenby RA, Brown JS (2012) Cancer treatment as a game: integrating evolutionary game theory into the optimal control of chemotherapy. Phys Biol 9(6): 065007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Park HJ, Lyons JC, Ohtsubo T, Song CW (1999) Acidic environment causes apoptosis by increasing caspase activity. Br J Cancer 80(12): 1892. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Pauwels EKJ, Sturm EJC, Bombardieri E, Cleton FJ, Stokkel MPM (2000) Positron-emission tomography with [18f] fluorodeoxyglucose. J Cancer Res Clin Oncol 126(10): 549–559. [DOI] [PubMed] [Google Scholar]
  43. Pena J, Lehmann L, Noldeke G (2014) Gains from switching and evolutionary stability in multi-player matrix games. Journal of Theoretical Biology 346: 23–33. [DOI] [PubMed] [Google Scholar]
  44. Robertson-Tessi M, Gillies RJ, Gatenby RA, Anderson ARA (2015) Impact of metabolic heterogeneity on tumor growth, invasion, and treatment outcomes. Cancer Res 75(8): 1567–1579. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Robey IF, Baggett BK, Kirkpatrick ND, Roe DJ, Dosescu J, Sloane BF, Hashim AI, Morse DL, Raghunand N, Gatenby RA, Gillies RJ (2009) Bicarbonate increases tumor pH and inhibits spontaneous metastases. Cancer Res 69(6): 2260–2268. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Rubin H (1971) pH and population density in the regulation of animal cell multiplication. J Cell Biol 51(3): 686–702. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Taylor PD, Jonker LB (1978) Evolutionary stable strategies and game dynamics. Math Biosci 40(1): 145–156. [Google Scholar]
  48. Tomlinson IP, Bodmer WF (1997) Modelling the consequences of interactions between tumour cells. Br J Cancer 75(2): 157. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Tomlinson IP (1997) Game-theory models of interactions between tumour cells. Eur J Cancer 33(9): 1495–1500. [DOI] [PubMed] [Google Scholar]
  50. Venkatesan S, Swanton C (2015) Tumor evolutionary principles: how intratumor heterogeneity influences cancer treatment and outcome. In American Society of Clinical Oncology Educational Book/ASCO. Meeting, Vol. 35, pp e141-e149. [DOI] [PubMed]
  51. Warburg O, Wind F, Negelein E (1926) Ueber den stoffwechsel von tumoren im körper. J Mol Med 5(19): 829–832. [Google Scholar]
  52. Warburg O (1956a) On the origin of cancer cells. Science 123(3191): 309–314. [DOI] [PubMed] [Google Scholar]
  53. Warburg O (1956b) On respiratory impairment in cancer cells. Science 124(3215): 269–270. [PubMed] [Google Scholar]
  54. Webb BA, Chimenti M, Jacobson MP, Barber DL (2011) Dysregulated pH: a perfect storm for cancer progression. Nat Rev Cancer 11(9): 671–677. [DOI] [PubMed] [Google Scholar]
  55. Welch DR (2016) Tumor heterogeneity—a ‘contemporary concept' founded on historical insights and predictions. Cancer Res 76(1): 4–6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Williams AC, Collard TJ, Paraskeva C (1999) An acidic environment leads to p53 dependent induction of apoptosis in human adenoma and carinoma cell lines: implications for clonal selection during colorectal carcinogenesis. Oncogene 18(21): 3199–3204. [DOI] [PubMed] [Google Scholar]
  57. Willmann L, Schlimpert M, Halbach S, Erbes T, Stickeler E, Kammerer B (2015) Metabolic profiling of breast cancer: differences in central metabolism between subtypes of breast cancer cell lines. J Chromatogr B Analyt Technol Biomed Life Sci 1000: 95–104. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Appendix

Articles from British Journal of Cancer are provided here courtesy of Cancer Research UK

RESOURCES