Abstract
The evolution of resistance remains one of the primary challenges for modern medicine from infectious diseases to cancers. Many of these resistance-conferring mutations often carry a substantial fitness cost in the absence of treatment. As a result, we would expect these mutants to undergo purifying selection and be rapidly driven to extinction. Nevertheless, pre-existing resistance is frequently observed from drug-resistant malaria to targeted cancer therapies in non-small cell lung cancer (NSCLC) and melanoma. Solutions to this apparent paradox have taken several forms from spatial rescue to simple mutation supply arguments. Recently, in an evolved resistant NSCLC cell line, we found that frequency-dependent ecological interactions between ancestor and resistant mutant ameliorate the cost of resistance in the absence of treatment. Here, we hypothesize that frequency-dependent ecological interactions in general play a major role in the prevalence of pre-existing resistance. We combine numerical simulations with robust analytical approximations to provide a rigorous mathematical framework for studying the effects of frequency-dependent ecological interactions on the evolutionary dynamics of pre-existing resistance. First, we find that ecological interactions significantly expand the parameter regime under which we expect to observe pre-existing resistance. Next, even when positive ecological interactions between mutants and ancestors are rare, these resistant clones provide the primary mode of evolved resistance because even weak positive interaction leads to significantly longer extinction times. We then find that even in the case where mutation supply alone is sufficient to predict pre-existing resistance, frequency-dependent ecological forces still contribute a strong evolutionary pressure that selects for increasingly positive ecological effects (negative frequency-dependent selection). Finally, we genetically engineer several of the most common clinically observed resistance mechanisms to targeted therapies in NSCLC, a treatment notorious for pre-existing resistance. We find that each engineered mutant displays a positive ecological interaction with their ancestor. As a whole, these results suggest that frequency-dependent ecological effects can play a crucial role in shaping the evolutionary dynamics of pre-existing resistance.
Introduction
The rapid, and often inevitable, evolution of therapy resistance is the primary threat to modern medicine’s successful treatment of cancer, and infectious disease (e.g. bacterial, viral, fungal, and parasitic infections)1–5. The story of resistance and treatment failure is strikingly similar across biological kingdoms. A patient is diagnosed and undergoes an initially successful treatment, only for a small resistant subclone of the original disease to relapse, resulting in treatment failure. For decades, the response to this paradigm has been the development of novel, more efficient drugs, targeting orthogonal pathways in hopes of winning the evolutionary arms race. While this response has undeniably resulted in major success stories when considering individual cancers or infections, the overall outlook for drug-resistant disease remains grim6–9.
As a result, growing efforts have been made to study these diseases in an evolutionary context, whereby scientists seek to understand the ecological and evolutionary forces that seem inevitably to result in the untreatable disease state. Understanding these evolutionary forces that lead to resistance, should allow scientists and physicians to not only design more effective drugs, but perhaps more crucially, design more effect treatments. For example, recent work has focused on improving and prolonging the efficacy of our already established drugs via optimal dose scheduling10–12, drug combinations13–17, understanding spatial dynamics18–20, understanding ecological interactions between competing subclones21–24, and exploiting collateral sensitivity25–29.
In a similar spirit, this work seeks to understand the evolutionary fates of potential resistance-conferring mutations that emerge before treatment has occurred. The fraction of these mutants that survive to see treatment are often the primary cause of treatment failure, referred to as “pre-existing resistance”30–33. While these resistant populations provide a large fitness advantage once treatment begins, they often carry a significant fitness disadvantage, or fitness cost , in the absence of treatment34–38. Nevertheless, resistance-conferring mutants often persist until treatment, at which time their treatment-sensitive ancestors are preferentially killed, resulting in the competitive release and relapse of the resistant population and inevitable treatment failure39,40. Understanding how these resistant clones - with a fitness disadvantage - persist in the disease population prior to treatment may allow us to prevent resistance from emerging.
This interest is derived from recent work where we measured the frequency-dependent ecological interaction between an evolved EGFR tyrosine kinase inhibitor (TKI) resistant non-small-cell lung cancer (NSCLC) population and its TKI-sensitive ancestor40. The focus of that work was on the ecological interaction under TKI treatment, and the inevitable competitive release. Strikingly, we observed an interaction between the resistant mutant and its ancestor in the absence of any treatment. The resistant population was observed to grow about twenty percent slower than the ancestor when cultured separately, however when the resistant population was co-cultured with a majority ancestor population, that difference in fitness nearly vanished. This observation, referred to as negative frequency-dependent selection (negative because the fitness for the mutant decreases as the mutant frequency increases), is a long-studied phenomenon41–43, and has been described as the most “intuitively obvious explanation for polymorphisms in nature”44. Despite its long history and potential for potent evolutionary effects, frequency-dependent selection remains understudied in the context of drug resistance. This is especially surprising, because a resistant population typically first emerges as a single individual in a predominantly ancestor population, and as a result frequency-dependent ecological interactions have a profound potential to effect the dynamics of a resistant clone (Fig. 1).
Figure 1. Cartoon abstraction demonstrating how frequency-dependent ecological interactions might increase the likelihood of pre-existing resistance.
(A) Cartoon visualization of a typical frequency-dependent growth experiment. The ancestor (black line) is assumed to grow at a constant rate. Two hypothetical resistant mutants are depicted. Both mutants share the same intrinsic fitness and fitness cost, however the positive ecological mutant (red, growth increases as the fraction of ancestor cells increases) has a significantly higher ecological fitness ) than the negative ecological mutant (blue, growth decreases as the fraction of ancestor cells increases). (B) Top: Cartoon visualization of an evolving population with no ecological interactions. All mutants are assumed to have a non-insignificant fitness cost, , and as a result go extinct. Bottom: The same evolving population, assuming ecological interactions are present. Note that an identical number of mutants emerge, however semi-rare mutants with positive ecological interactions demonstrate an increased time to extinction. As a result, when a drug intervention is administered, pre-existence is much more likely to be present.
In this work, we seek to develop a rigorous theory of pre-treatment evolution that incorporates frequency-dependent ecological interactions between the emerging resistant subclones and the ancestor from which they evolve. Using both a generalized Moran process and Wright-Fisher simulations, we show that mutants with the same intrinsic fitness (monoculture fitness) can have mean extinction times that vary by several orders of magnitude as a function of their ecological fitness (fitness when co-cultured in a predominantly ancestor environment). Next, we calculate the expected number of resistance-conferring mutants in the population as a function of the cost of resistance, as well as the population size, and rate at which resistance-conferring mutations occur. When comparing the result of this calculation both when we assume ecological interactions exist, and when they are forbidden, we identify a wide parameter space where pre-existence is only likely to occur if ecological-interactions are assumed. We then investigate the “many mutant regime” where pre-existence is likely even without ecological interactions, and demonstrate that these ecological interactions would play a prominent role in shaping the distribution of mutants, dramatically increasing the prevalence of mutants with high ecological fitness. Importantly, we show that these ecological effects drive the evolutionary outcomes even when mutants with high ecological fitness are rare. Surprisingly, despite the complexity of the model, we obtain analytical approximations for extinction rates, expected number of resistance-conferring mutants, and the distribution of observed mutants over the full range of ecological fitness. These analytical approximates both support our numerical simulations and allow us to extend our results to population sizes too large to simulate.
Finally, we test our theory experimentally by engineering several of the most common clinically-observed mutations to TKI-therapy in EGFR-driven NSCLC and compete these mutants against the TKI-sensitive ancestor. In all cases we observed an ecological interaction that resulted in mutant ecological fitnesses larger than their intrinsic fitness. Taken together, these theoretical and experimental results argue that frequency-dependent ecological interactions between resistance mutants and their ancestor confer the primary mode by which resistance emerges in modern cancer therapeutics, and potentially all evolutionary diseases.
Results
Ecologically-dependent extinction time distributions with a generalized Moran process
We begin by considering a one-step birth-death process45–47 with states , where is the total population size, is the mutant population, and is the ancestor population. We do not consider mutation and as a result the states (extinction) and (fixation) are absorbing. To account for ecological interactions, the mutant’s growth rate is defined to be a function of , or the fraction of the population that is of the ancestral type, and assumed to be linear. In addition, for simplicity, we define the ancestor’s growth rate to be constant and, without loss of generality, normalized to 1 (see Materials and Methods for full model details). First, we are interested in how the distribution of extinction times differs between recently emerged mutants with identical fitness costs, but distinct ecological interactions (Fig. 2A, left). In particular we assume one (neutral) mutant has no ecological interaction with the ancestor, and thus (Fig. 2A, blue), while the comparative (positive) mutant has an interaction that ameliorates the fitness cost of the mutant at extremely large ancestor fractions, (Fig. 2A, red). Here is the fitness cost or difference between the ancestor’s growth rate and the mutant’s mono-culture growth rate, is the mutant’s intrinsic fitness or mono-culture growth rate, and is the mutant’s ecological fitness or a mutant’s fitness in an otherwise purely ancestor co-culture environment (definitions depicted visually in Fig. 2A, left). In population genetics literature this positive interaction is known as negative frequency-dependent selection and fitness is typically plotted as a function of the mutant’s frequency rather than the ancestor. Throughout this work we have chosen to plot fitness relative to the ancestor’s frequency in order to more intuitively connect a positive ecological interaction with a positive slope in frequency-dependent growth plots. In the case of a mutant with a positive ecological interaction, we see that the extinction time distribution is heavily right-skewed in comparison to a neutral ecological effect. As a result, if these two mutants were equally likely to emerge in a population, we would expect to observe a mutant with a positive ecological interaction significantly more often than an equivalent mutant with a neutral ecological interaction. However, ecological interactions are not always positive. Repeating this process in comparing a neutral mutant with a mutant that has a negative ecological interaction with the ancestor reveals a distinct shift to shorter extinction times (Fig. S1).
Figure 2. Analytical approximations and simulations predict that extinction times depend on ecological interactions.
(A) Closed form extinction time distributions are calculated and visualized for a generalized Moran process . The red distribution results from a mutant with a positive ecological interaction with the ancestor , while the blue population has no ecological interaction with the ancestor . (B) Wright-Fisher simulations are used to numerically calculate the mean extinction time as a function of , 500 generations, is drawn uniformly in is drawn uniformly in [0,1]). (C) Wright-Fisher simulations are repeated for varying values of , and to confirm theoretical prediction that the extinction time distribution depends only on . (D) Phase diagram depicting the three regimes of pre-existing resistance.
Extinction times depend on ecological interactions in a Wright-Fisher model
While formulating our system as a generalized Moran process allows for convenient closed-form solutions to quantities of interest such as extinction time distributions, this representation becomes computationally expensive as the population size approaches increasingly realistic values. In addition, we have completely ignored mutation, as well as more realistic conditions where many mutants are competing within an evolving population. As such, we switch to a Wright-Fisher formulation of our system48–50). In the Wright-Fisher model, populations are still constant in population size , however each individual of the population is replaced every generation with offspring inheriting the parent’s genotype with probability proportional to the parent’s fitness. (Note that in many cases systems with varying populations can be approximately mapped onto a Wright-Fisher model that describes similar evolutionary dynamics, in which case is interpreted as an effective population size51.) In addition, individuals acquire mutations with some probability and we assume mutant populations are sufficiently small that we can ignore both ecological and genetic mutant-mutant interactions. Still, several characteristics from the generalized Moran process remain. Namely, the ancestor’s growth is defined to be constant and normalized to 1, and the mutant growth rate is assumed to vary linearly between and (as a result, a mutant’s growth is fully characterized by these two fitness values along with the fraction of the population that is ancestor).
Each simulation begins with an exclusively ancestor population and with each generation cells mutate with probability . Each mutant that arises has an intrinsic fitness drawn with uniform probability in and a corresponding ecological fitness drawn with uniform probability in [0,1]. Each Wright-Fisher ‘generation’ consists of a mutation step, followed by an offspring/selection step. For each mutant that emerges we record its intrinsic and ecological fitness values and track its evolutionary trajectory, and thus extinction time . A mutant that emerges but does not survive the subsequent selection step is defined to survive 0 generations. Employing this model we find that the mean extinction time varies nearly five orders of magnitude between the most positive (≈10 generations) and deleterious (≈0.001 generations) ecological interactions (Fig. 2B). In order to develop a more rigorous understanding of the evolutionary dynamics, we sought an analytical approximation for the extinction time of a mutant under the same Wright-Fisher conditions. Strikingly, we find a robust approximation across the whole range of :
| (1) |
Despite its simple form, this approximation agrees with simulation results with a typical error of 5% (Fig. 2B, full derivation and details found in the SI). Interestingly, the approximation is only a function of ecological fitness, and not mutation rate (assuming ), population size, or fitness cost. This finding is supported by our simulation results (Fig. 2C). The dependence of solely on in Eq. (1) is due to two factors: i) the small total proportion of mutants in the population, which means the fitness of a mutant is approximately ; ii) at each time step (new generation) the chances of the mutants achieving a population comparable to are vanishingly small, so the extinction probability becomes approximately independent of .
Ecological interactions can increase the probability of pre-existing resistance
Next we consider the model’s implications for pre-existing resistance. Specifically, we are interested in quantifying the expected number of mutants in an evolving population. While it might be tempting to quickly conclude that including ecological interactions will necessarily increase the probability of pre-existing resistance because positive interactions will lead to longer extinction times, it is important to note that mutants with a high intrinsic fitness are more likely to acquire a relatively deleterious ecological fitness, than one that is beneficial. As such, a careful mathematical treatment is required. When the expected number of mutants in a population is low (), potential resistance-conferring mutations are unlikely to be present at time of treatment. Contrarily, when the expected number of mutants is greater than 1, we expect treatment threatening resistance to be present when a drug is administered. We begin adapting our analytical model to calculate the mean number of mutants (see SI for full derivation and details). To begin we consider the case where no ecological interactions are present ( completely describes the growth rate of the mutant). In this case it can be shown that , the mean number of mutants ignoring ecological interactions, is:
| (2) |
Next, we seek to find an analytical approximation for , the mean number of mutants assuming ecological interactions exist. In the case of a sufficiently small mutation rate, we can approximate the total mutant fraction as,
| (3) |
Here, is the maximum value that can take. While we can set arbitrarily close to 1, it can never be exactly 1 for a well-defined normalization. Interestingly, for sufficiently small , the ratio is constant with . While the simplicity of the approximation is appealing, unfortunately it breaks down as gets large. As a result, a more robust, though significantly more complex, approximation was derived (see SI for full derivation):
| (4) |
Here is the Lambert function, which is the solution of the equation . This approximation allows for efficient calculation across several decades of within 10% of our numerical simulations. Employing these analytical approximations we identify three regimes of interest. The least interesting regime is the small regime (Fig. 2D, green). Here the effective population size is insufficient to maintain a mutant subpopulation regardless of the strength or frequency of ecological interactions. This regime corresponds to extremely rare pre-existing resistance and high likelihood of treatment success.
As gets larger (Fig. 2D, yellow) we enter a regime where ecological interactions would suggest pre-existing resistance is likely , while ignoring ecological interactions would suggest pre-existence is still rare . In this regime mutants have yet to become abundant, however, mutants with strong ecological interactions persist sufficiently long to threaten treatment efficacy. Representative simulation trajectories of this “rare mutant regime” are shown in Fig. 3A. Without ecological interactions (Fig. 3A, top panel) the mutation rate alone is insufficient to maintain a mutant subpopulation capable of threatening future treatment efficacy. However, with the introduction of ecological interactions (Fig. 3A, bottom panel and Fig. S2), rare positive ecological mutants climb to significant fractions of the population, and have measurably longer extinction times that may threaten future treatments. As one might intuitively expect, the size of this regime where ecological effects drive pre-existence is heavily dependent on the imposed fitness cost of resistant mutants. We find that the larger the fitness cost imposed by resistance, the larger the comparative increase provided by allowing ecological effects.
Figure 3. Positive ecological interactions make pre-existence more likely and dominate the stationary distribution of mutants.
(A) Representative Wright-Fisher trajectory in the “rare mutant regime”. Black corresponds to the ancestral population. Mutants exist in higher fractions and for longer periods with ecological interactions. Each mutant is colored by its ecological fitness where red represents an value near 1 and blue represents an value near 0. (B) Representative trajectory in the “many mutant regime”. Strong positive ecological interactions dominate the stationary distribution of mutants (visually the mutants appear red, not blue). (C) Left: Stationary distribution of mutant ecological fitnesses when the mutant generating function is uniform across ecological fitness. Right: joint distribution density plot between intrinsic and ecological fitness. (D) Same as C, however the mutant generating function is now Gaussian centered about .
Ecological interactions significantly influence the distribution of mutants
Next, we consider the final regime when is large (Fig. 2D, blue). In this regime the mutational supply is sufficiently large to self-sustain a small, resistant subpopulation, regardless of ecological interactions (that is, both and ). Representative simulation trajectories of this “many mutant regime” are shown in Fig. 3B. At first glance one might assume this regime is uninteresting. In both cases mutants are sufficiently common to threaten future treatments, albeit ecological interactions significantly increase the steady-state fraction of resistant mutants. However, the results become more interesting when we consider the shape of the resistant subpopulation distribution. In each trajectory plot, the color is proportional to the mutants ecological fitness with red representing an ecological fitness near 1 and blue representing an ecological fitness near 0. By inspection it is immediately clear that the most positive ecological mutants are over represented in the mutant population, considering they emerge with equal probability. However, we can do better and extract this relationship explicitly from our simulations (Fig. 3C, left). We find, similar to the impact of ecological effects on extinction times, that the frequency of a mutant spans multiple orders of magnitude as a mutant’s ecological fitness varies from 0 to 1.
Extending our previous analytical work, it is straightforward to show that the stationary distribution of mutant ecological fitnesses goes as,
| (5) |
The above approximation works remarkably well despite the simplicity of its form. From this equation we find that the frequency of a mutant is invariant with respect to fitness cost and population size. This is shown explicitly via numerical simulations and visualization of the joint distribution of fitness cost and ecological fitness (Fig. 2C, right).
Non-uniform ecological distributions show similar qualitative results
An important context to keep in mind with the work is that up to this point we have assumed emerging mutants are assigned an ecological fitness with uniform probability in [0,1]. This assumption was not made for simplicity, but instead out of necessity. While evolutionary biologists have spent significant time both theorizing about, and measuring the distribution of fitness effects (DFE), very little time has been spent quantifying either the frequency or magnitude of ecological effects (which we propose calling the distribution of ecological effects, DEE). As a result, it is difficult to even speculate on what the null model ought to be.
Crucially, the analytical approximations derived herein can be generalized to fit any assumed, or future measured, DEE. While we assumed a uniform distribution, a Gaussian model where the most positive and negative ecological interactions are rare relative to more modest, or non-interacting mutants, may be more accurate. As an example, the general stationary distribution of mutant ecological fitnesses would become,
| (6) |
Here, can be any theorized or measured distribution of ecological effects. Eq. (6) can be roughly interpreted as a balancing of two opposing forces to produce a stationary state: the probability that a new mutant with fitness arises and survives is counterbalanced by the probability that an existing mutant with fitness disappears. As proof of principle, we numerically simulate the distribution of mutant ecological fitnesses under an assumed Gaussian DEE, and show the above analytical approximation still holds. The results are qualitatively similar to the uniform DEE and, strikingly, despite the rarity of mutants with positive ecological interactions, they still manage to dominate the predicted stationary distribution of mutants (Fig. 2D).
Sufficiently large positive ecological interactions result in a stable fixed point between mutant and ancestor
We now briefly consider the regime wherein the ecological fitness of a mutant can sample values greater than 1. Put another way, when the mutant population emerges, it may emerge into an environment where it out-competes its ancestor. Importantly, all emerging mutants still have a nonzero fitness cost relative to the wild-type and therefore as selection increases their frequency, their relative growth advantage becomes a growth disadvantage, preventing a strong selective sweep. Though our earlier analytical approximations do not apply for , the numerical simulations are robust in this regime. We find that the majority of stationary distribution mutants are mutants with ecological fitnesses larger than the ancestor, or (Fig. 4A). This qualitative change in behavior above can be explained in evolutionary game theory terms by a switch in the evolutionary game being played. When , the ancestor out-competes the mutant population at all population frequencies. As a result, it is a question of when, not if, the mutant population will be driven to extinction. When , however, the mutant population out-competes the ancestor at high ancestor frequencies, while the ancestor out-competes the mutant at high mutant frequencies (as a result of the mutant fitness cost). This leads to a stable fixed point at some ancestor frequency where the two populations have an equal growth rate. This is particularly worrying in the case of therapy-resistant mutants, because it suggests if such a mutant emerges and survives the initial stochasticity of drift, it will coexist at a sizeable frequency in the population until treatment.
Figure 4. Positive ecological fitnesses above 1 result in a stable fixed point between mutant and ancestor.
(A) Stationary distribution of mutant ecological fitnesses when the mutant generating function has uniform probability in [0,1.10]. (B) Stationary distribution of mutant intrinsic fitnesses. (C) Joint distribution density plot between intrinsic and ecological fitness reveals the size of the fitness cost now has a significant impact on mutant survival. (D) Cartoon illustration of why two mutants with identical values of can result in different extinction times. Colored arrows point to stabled fixed points between mutant and ancestor.
Next, contrary to our previous results, the fitness cost of the mutant plays an important role in determining the stationary distribution of mutants (Fig. 4B,C). Here we see that only the mutants with the largest positive ecological interactions and smallest fitness costs (intrinsic fitness = 1 − fitness cost) are represented at meaningful frequencies. This result can be explained by the qualitative shift in evolutionary game for mutants where . Previously, regardless of the fitness cost, any mutant with would grow at that ecological fitness, as the mutant population never became a meaningful fraction of the whole population (Fig. 4D, left). However, as hinted at in the numerical simulations, and combine to determine the stable fixed point between the mutant and the ancestor (Fig. 4D, right). As a result, even mutants with are no longer characterized by their ecological fitness, instead they are characterized by their fitness at the frequency determined by the stable fixed point.
Clinically observed lung cancer mutations confer positive ecological interactions
Epidermal growth factor (EGFR) tyrosine kinase inhibitors (TKIs) are the first-line treatment for patients diagnosed with advanced non-small cell lung cancer (NSCLC). While the development of targeted TKIs has importantly extended overall survival times, these drugs are rarely curative39 and patients often recur with TKI-resistant tumors. As a result, EGFR-mutant NSCLC is an ideal system for studying pre-existing resistance and where we would expect to find positive ecological interactions between mutants and their ancestor.
To test our theory we genetically engineered (see Materials and Methods) three of the most commonly clinically observed resistance mutations found in response to TKIs:52,53 BRAF-V600E, KRAS-G12V, and PIK3CA-E545K. Then, using our previously described evolutionary game assay23,40, we measured the ecological interaction between each of these mutants and the ancestor PC9 cell line from which they emerged.
Excitingly, we found that each of the three engineered mutants had positive ecological interactions with their ancestor (Fig. 5, bottom). In addition, these positive ecological interactions are strikingly similar in both magnitude and shape to the measured ecological interaction between the evolved mutant and its ancestor (Fig. 5, top). While we already reported on the ecological interaction between the evolved mutant and its ancestor as it was the motivator of this study40, we performed additional sequencing analysis (WXS and RNA-seq) and identified several common clinical mutations present, distinct from the engineered mutations: MET overexpression, CCND1 amplification, and KRASG12D mutation (Fig. 5, top). Taken together, the engineered and evolved mutants combine to survey approximately 70% of clinically known resistance mechanisms to TKIs in NSCLC, and in all cases we observed large positive ecological interactions ameliorating a sizeable fitness cost of resistance. Next, as a control, we measured the ecological interaction between the engineered BRAF-V600E and KRAS-G12V cell lines in co-culture and did not observe a positive ecological interaction (Fig. S3). This is unsurprising because a key aspect of our model is that positive ecological interactions are enriched during co-evolution and thus should be more likely between ancestor and mutant, not two mutants. These experimental results are harmonious with our theoretical work and strongly support the hypothesis that frequency-dependent ecological interactions can play a critical role in the acquisition of resistance in evolutionary diseases.
Figure 5. Common clinically observed resistance mutations in NSCLC harbor strong postitive ecological interactions with their ancestor in a model system of pre-existing resistance.
Stacked bar chart: Visual representation of the known resistance mechanisms to Osimertinib, a third generation TKI and the current standard of care for EGFR-positive NSCLC. Mutation frequencies and categorical definitions from Leonetti et al54. Top: Evolved gefitinib-resistant NSCLC PC9 mutant (previously reported40) exhibits a positive ecological interaction with its ancestor. Fresh sequencing analysis identifies clinically observed resistant mutations including: KRASG12D, MET amplification, and CCND1 amplification (cell cycle gene). Bottom: Measured positive ecological interactions between engineered resistant mutants (KRAS-G12V top-left, PIK3CA-E545K top-right, BRAF-V600F (bottom) and their ancestor.
Discussion
While much work has gone into quantifying clinically problematic resistant bacteria, cancers and viruses, we nearly always characterize these clones in monoculture - entirely outside the eco-evolutionary forces that selected for (or against) them in the first place. In this work we set out to provide the foundation for a rigorous and generalizable mathematical framework that incorporates frequency-dependent ecological interactions and can be used to study their role in pre-existing resistance. This work both compliments and builds off of recent studies from a wide range of disciplines ranging from theoretical population genetics and ecology to clinical trials across several biological kingdoms. We demonstrate that the presence of ecological interactions can significantly increase the probability of pre-existing resistance, in addition to shaping the distribution of mutants likely to be present before treatment. We derive analytical approximations of several quantities of interest including extinction time, mean mutant population numbers, and the underlying distribution of mutants each as a function of ecological fitness. Importantly, these results can easily be generalized to any theorized distribution of ecological effects, or future experimentally measured distribution. As an important example, we show that even when we assume positive ecological interactions are rare, they still end up as a plurality of the stationary mutant frequency distribution. Finally, in an model system for pre-existing resistance, we show common clinically observed mutants harbor positive, frequency-dependent ecological interactions when co-cultured with their ancestor, providing strong evidence for our theory in cancer. In addition, recent exciting work in bacteria provides additional evidence, as frequency-dependent interactions resulted in maintenance of otherwise costly antibiotic resistant populations in Escherichia coli55 and Pseudomonas aeruginosa56.
It is also important to address several limitations of our work. As we mentioned earlier, the distribution of ecological effects (DEE) has never been experimentally measured. As a result, assumptions regarding the distributional parameters have to made in order to calculate meaningful quantities of interest. While we did our best to combat this by developing analytical models that are agnostic to this distribution, the quantitative aspect of our results are subject to the specifics of a model. Our hope is that the analytical and numerical results herein, when combined with the promising experimental work in NSCLC, motivate future measurements of the DEE across diverse model systems. Similarly, our own experimental validation is constrained to one subsystem. Our predictions are broad and should apply to many evolving populations where pre-existence is evolved. Therefore it is important that future studies should should aim to test these theories not just in other cancers, but in other organisms from HIV to drug resistant bacteria. It is possible that these principles provide the most explanatory power in cancer and bacteria where it is common to find highly dense heterogeneous populations in contrast to viruses, for example. Finally, while the model aims to generally capture major evolutionary forces that may underlie pre-existing resistance, it is still an abstraction of a much more complex clinical scenario where the immune system, spatial dynamics, and treatment adherence, to name only a few, can play major roles.
Materials and Methods
Cell culture
PC-9 are human adenocarcinoma cells derived from undifferentiated lung tissue were obtained from Sigma (Sigma, USA). PC-9 cells were cultured in RPMI-1640 medium supplemented with 10% heat inactivated fetal bovine serum (FBS) and 1% penicillin streptomycin solution at 37°C with humidity containing 5% CO2. Cells were split every four days to maintain optimum confluency of .
Engineering of mutant cell lines
To establish PC-9 cells stably expressing target genes, HEK-293T cells were co-transfected using TransIT-Lenti transfection reagent (Mirus, USA), with 500ng psPAX2 (addgene, USA), 100ng PMD2 (addgene, USA) and 500ng of target genes. Viral particles were collected after 48hrs and used to transduce PC-9 cells. Then, to establish ancestor PC-9 cells stably expressing nuclear localized GFP, cells were transduced with pLVX-eGFP-Hygro (Vectorbuild, USA). In addition, to establish query cells expressing fluorescently labeled PC-9 cells with a gene of interest cells were co-transduced with pLVX-mCherry-Hygro or pLVX-mCherry-Puro and each of pLVX-PIK3CA-E545K-Bsd (Vectorbuild, USA). Next, 72hrs after transduction, cells were selected with hygromycin, puromycin and blasticidin.
Drug sensitivity assay
Cells were harvested at 70–80% confluence, stained with trypan blue (Corning, USA), and counted with a TC20 Automated Cell Counter (Bio-Rad, USA). Luminescent based cell viability assays using CellTiter-Glo (CTG) reagent (Promega, USA) were performed in 96 well plate (Corning, USA). A total number of 3,000 cells were plated in of complete medium per well in three replicate per drug concentration with Multidrop reagent dispenser (Thermo Fishers, USA). After 3hrs of incubation, of gefitinib, osimertinib and erlotinib (Cayman, USA) diluted in complete RPMI-1640 medium were added to the cells. Compounds were tested in a threefold dilution in a range of and for gefitinib, osimertinib and erlotinib respectively. After 72hrs of incubation, CTG reagent was add to the cells; incubated for 10 minutes at room temperature and luminescence was measured.
Game assay
PC-9 mutants stably expressing nuclear localized fluorescent signal ancestor PC-9 stably expressing nuclear localized GFP were co-cultured at different initial proportion of ancestor cells at a density of 1,500 cells in of fresh medium. After 3hrs of incubation, of DMSO diluted in complete RPMI-1640 medium (final DMSO concentration of ) were added to the cells in three replicates per initial proportion. Then time-lapse microscopy images were obtained for GFP and mCherry using BioSpa automated incubator (BioTek, USA) every 4 hours over the course of 96hrs. Then, images were processed with the open-source software CellProfiler57. Images were background subtracted, converted to 8-bit, contrast enhanced, and thresholded, then raw cell numbers were extracted.
Generalized Moran model details
In this work we consider a well known generalized Moran process, a model previously used to study frequency dependent evolutionary dynamics45–47. Briefly, we consider a one-step birth-death process with states and characterized by birth and death rates and with . As a result this model describes a fixed population size , with resistant mutants and ancestor population. We forgo mutation rate and as a consequence the states and are absorbing. We consider an evolutionary game with a 2×2 payoff matrix such that:
As a result, we can write the expected payoffs in a population of mutants and ancestor individuals as:
RNA-Seq
Paired-end reads are preprocessed using fastp to trim and quality filter the reads. Following the filtering, reads are aligned to GRCh38 reference genome via STAR aligner. Read quantification is done using Salmon on the extracted transcriptome locations from spliced STAR alignment. Gene-level abundance are aggregated from bootstrapped transcript abundances using R package tximport. The Arriba tool is coupled with spliced alignments for fusion-transcript detection as well. Pathway level expression activities are quantified using R package GSVA and msigdbR for the Hallmark Pathways. The R package ComplexHeatmap was used to generate heatmaps.
Whole exome sequencing
Paired-end whole-exome reads of ancestor and parental lines were preprocessed using fastp similar to RNA-Seq. Alignment to GATK (GATK best practices bucket) version of GRCh38 reference is done using bwa-mem aligner. Following the alignment, variant calling pipeline according to the GATK workflow including duplicate marking and variant calling via HaplotypeCaller was conducted. Variants passing filtering based on hard-filtering are further annotated using Variant Effect Predictor (VEP) tool. Exome alignments are further input to CNVkit for copy-number alterations. Using a flat-reference for bias correction log2 scaled abundances are generated for ancestor and resistant strains. Copy number segments are captured using circual binary segmentation and assigned to genes mapping to the segment.
Supplementary Material
Acknowledgments
This work was made possible by the National Institute of Health T32CA094186 (JAM), 5R37CA244613-03 (JGS), 5T32GM00725046 (JGS), The Research Council of Norway 325628/IAR (DST), and American Cancer Society RSG-20-096-01 (JGS).
Footnotes
Code availability
Code used in this study will be made openly available on GitHub.
References
- 1.Fisher M. C. et al. Tackling the emerging threat of antifungal resistance to human health. Nat. Rev. Microbiol. 20, 557–571 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Goldberg D. E., Siliciano R. F. & Jacobs W. R. Outwitting evolution: fighting drug-resistant tb, malaria, and hiv. Cell 148, 1271–1283 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Darby E. M. et al. Molecular mechanisms of antibiotic resistance revisited. Nat. Rev. Microbiol. 1–16 (2022). [DOI] [PubMed] [Google Scholar]
- 4.Pennings P. S. Hiv drug resistance: problems and perspectives. Infect. disease reports 5 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Menard D. & Dondorp A. Antimalarial drug resistance: a threat to malaria elimination. Cold Spring Harb. perspectives medicine 7, a025619 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Sandru A., Voinea S., Panaitescu E. & Blidaru A. Survival rates of patients with metastatic malignant melanoma. J. medicine life 7, 572 (2014). [PMC free article] [PubMed] [Google Scholar]
- 7.Boucher H. W. et al. Bad bugs, no drugs: no eskape! an update from the infectious diseases society of america. Clin. infectious diseases 48, 1–12 (2009). [DOI] [PubMed] [Google Scholar]
- 8.Organization, W. H. Antimicrobial resistance: global report on surveillance (World Health Organization, 2014). [Google Scholar]
- 9.Arnold M. et al. Progress in cancer survival, mortality, and incidence in seven high-income countries 1995–2014 (icbp survmark-2): a population-based study. The Lancet Oncol. 20, 1493–1505 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Maltas J., Singleton K. R., Wood K. C. & Wood K. B. Drug dependence in cancer is exploitable by optimally constructed treatment holidays. bioRxiv DOI: 10.1101/2022.07.01.498458 (2022). https://www.biorxiv.org/content/early/2022/07/03/2022.07.01.498458.full.pdf. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Iram S. et al. Controlling the speed and trajectory of evolution with counterdiabatic driving. Nat. Phys. 17, 135–142 (2021). [Google Scholar]
- 12.Gatenby R. A., Silva A. S., Gillies R. J. & Frieden B. R. Adaptive therapy. Cancer research 69, 4894–4903 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Feder A. F., Harper K. N., Brumme C. J. & Pennings P. S. Understanding patterns of hiv multi-drug resistance through models of temporal and spatial drug heterogeneity. eLife 10, e69032, DOI: 10.7554/eLife.69032 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Anderson G. R. et al. A landscape of therapeutic cooperativity in kras mutant cancers reveals principles for controlling tumor evolution. Cell reports 20, 999–1015 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Hegreness M., Shoresh N., Damian D., Hartl D. & Kishony R. Accelerated evolution of resistance in multidrug environments. Proc. Natl. Acad. Sci. 105, 13977–13981 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Torella J. P., Chait R. & Kishony R. Optimal drug synergy in antimicrobial treatments. PLoS computational biology 6, e1000796 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Dean Z., Maltas J. & Wood K. B. Antibiotic interactions shape short-term evolution of resistance in e. faecalis. PLoS pathogens 16, e1008278 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Fu F., Nowak M. A. & Bonhoeffer S. Spatial heterogeneity in drug concentrations can facilitate the emergence of resistance to cancer therapy. PLoS Comput. Biol 11, e1004142 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Zhang Q. et al. Acceleration of emergence of bacterial antibiotic resistance in connected microenvironments. Sci. 333, 1764–1767 (2011). [DOI] [PubMed] [Google Scholar]
- 20.De Jong M. G. & Wood K. B. Tuning spatial profiles of selection pressure to modulate the evolution of drug resistance. Phys. review letters 120, 238102 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Korolev K. S., Xavier J. B. & Gore J. Turning ecology and evolution against cancer. Nat. Rev. Cancer 14, 371–380 (2014). [DOI] [PubMed] [Google Scholar]
- 22.Wargo A. R., Huijben S., De Roode J. C., Shepherd J. & Read A. F. Competitive release and facilitation of drug-resistant parasites after therapeutic chemotherapy in a rodent malaria model. Proc. Natl. Acad. Sci. 104, 19914–19919 (2007). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Kaznatcheev A., Peacock J., Basanta D., Marusyk A. & Scott J. G. Fibroblasts and alectinib switch the evolutionary games played by non-small cell lung cancer. Nat. ecology & evolution 3, 450–456 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Good B. H., McDonald M. J., Barrick J. E., Lenski R. E. & Desai M. M. The dynamics of molecular evolution over 60,000 generations. Nat. 551, 45–50 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Maltas J. & Wood K. B. Pervasive and diverse collateral sensitivity profiles inform optimal strategies to limit antibiotic resistance. PLoS biology 17, e3000515 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Maltas J. & Wood K. B. Dynamic collateral sensitivity profiles highlight challenges and opportunities for optimizing antibiotic sequences. bioRxiv DOI: 10.1101/2021.12.19.473361 (2021). https://www.biorxiv.org/content/early/2021/12/21/2021.12.19.473361.full.pdf. [DOI] [Google Scholar]
- 27.Scarborough J. A. et al. Identifying states of collateral sensitivity during the evolution of therapeutic resistance in ewing’s sarcoma. Iscience 23, 101293 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Maltas J., McNally D. M. & Wood K. B. Evolution in alternating environments with tunable interlandscape correlations. Evol. 75, 10–24 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Zhao B. et al. Exploiting temporal collateral sensitivity in tumor clonal evolution. Cell 165, 234–246 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Lin J. J. & Shaw A. T. Resisting resistance: targeted therapies in lung cancer. Trends cancer 2, 350–364 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Boshuizen J. et al. Reversal of pre-existing ngfr-driven tumor and immune therapy resistance. Nat. communications 11, 3946 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Jangir P. K. et al. Pre-existing chromosomal polymorphisms in pathogenic E. coli potentiate the evolution of resistance to a last-resort antibiotic. eLife 11, e78834, DOI: 10.7554/eLife.78834 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Robinson M., Tian Y., Delaney IV W. E. & Greenstein A. E. Preexisting drug-resistance mutations reveal unique barriers to resistance for distinct antivirals. Proc. Natl. Acad. Sci. 108, 10290–10295 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Szakacs G. et al. Targeting the achilles heel of multidrug-resistant cancer by exploiting the fitness cost of resistance. Chem. reviews 114, 5753–5774 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Sampah M. E. S., Shen L., Jilek B. L. & Siliciano R. F. Dose-response curve slope is a missing dimension in the analysis of hiv-1 drug resistance. Proc. Natl. Acad. Sci. 108, 7613–7618 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Melnyk A. H., Wong A. & Kassen R. The fitness costs of antibiotic resistance mutations. Evol. applications 8, 273–283 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Weinreich D. M., Delaney N. F., DePristo M. A. & Hartl D. L. Darwinian evolution can follow only very few mutational paths to fitter proteins. science 312, 111–114 (2006). [DOI] [PubMed] [Google Scholar]
- 38.Kühnert D. et al. Quantifying the fitness cost of hiv-1 drug resistance mutations through phylodynamics. PLoS pathogens 14, e1006895 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Hayashi H. et al. Overall treatment strategy for patients with metastatic nsclc with activating egfr mutations. Clin. Lung Cancer 23, e69–e82 (2022). [DOI] [PubMed] [Google Scholar]
- 40.Farrokhian N. et al. Measuring competitive exclusion in non–small cell lung cancer. Sci. Adv. 8, eabm7212, DOI: 10.1126/sciadv.abm7212 (2022). https://www.science.org/doi/pdf/10.1126/sciadv.abm7212. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Ayala F. J. & Campbell C. A. Frequency-dependent selection. Annu. review Ecol. systematics 5, 115–138 (1974). [Google Scholar]
- 42.Clarke B. C., Shelton P. & Mani G. Frequency-dependent selection, metrical characters and molecular evolution. Philos. Transactions Royal Soc. Lond. B, Biol. Sci. 319, 631–640 (1988). [DOI] [PubMed] [Google Scholar]
- 43.Levin B. Frequency-dependent selection in bacterial populations. Philos. Transactions Royal Soc. Lond. B, Biol. Sci. 319, 459–472 (1988). [DOI] [PubMed] [Google Scholar]
- 44.Trotter M. V. & Spencer H. G. Frequency-dependent selection and the maintenance of genetic variation: exploring the parameter space of the multiallelic pairwise interaction model. Genet. 176, 1729–1740 (2007). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Traulsen A., Claussen J. C. & Hauert C. Coevolutionary dynamics in large, but finite populations. Phys. Rev. E 74, 011901 (2006). [DOI] [PubMed] [Google Scholar]
- 46.Ashcroft P., Traulsen A. & Galla T. When the mean is not enough: Calculating fixation time distributions in birth-death processes. Phys. Rev. E 92, 042154, DOI: 10.1103/PhysRevE.92.042154 (2015). [DOI] [PubMed] [Google Scholar]
- 47.Nowak M. A., Sasaki A., Taylor C. & Fudenberg D. Emergence of cooperation and evolutionary stability in finite populations. Nat. 428, 646–650 (2004). [DOI] [PubMed] [Google Scholar]
- 48.Imhof L. A. & Nowak M. A. Evolutionary game dynamics in a wright-fisher process. J. mathematical biology 52, 667–681 (2006). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Wright S. Evolution in mendelian populations. Genet. 16, 97 (1931). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Fisher R. A. Xxi.-on the dominance ratio. Proc. royal society Edinb. 42, 321–341 (1923). [Google Scholar]
- 51.Sjodin P., Kaj I., Krone S., Lascoux M. & Nordborg M. On the meaning and existence of an effective population size. Genet. 169, 1061–1070 (2005). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Chmielecki J. et al. Candidate mechanisms of acquired resistance to first-line osimertinib in egfr-mutated advanced non-small cell lung cancer. Nat. Commun. 14, 1070 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Camidge D. R., Pao W. & Sequist L. V. Acquired resistance to tkis in solid tumours: learning from lung cancer. Nat. reviews Clin. oncology 11, 473–481 (2014). [DOI] [PubMed] [Google Scholar]
- 54.Leonetti A. et al. Resistance mechanisms to osimertinib in egfr-mutated non-small cell lung cancer. Br. journal cancer 121, 725–737 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Leale A. M. & Kassen R. The emergence, maintenance, and demise of diversity in a spatially variable antibiotic regime. Evol. Lett. 2, 134–143 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Dimitriu T. et al. Negative frequency dependent selection on plasmid carriage and low fitness costs maintain extended spectrum β-lactamases in escherichia coli. Sci. reports 9, 17211 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Stirling D. R. et al. Cellprofiler 4: improvements in speed, utility and usability. BMC bioinformatics 22, 1–11 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Neuts M. F. Matrix-geometric solutions in stochastic models: an algorithmic approach (Dover, 1994). [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.





