Abstract
We study evolution driven by spatial heterogeneity in a stochastic model of source-sink ecologies. A sink is a habitat where mortality exceeds reproduction so that a local population persists only due to immigration from a source. Immigrants can, however, adapt to conditions in the sink by mutation. To characterize the adaptation rate, we derive expressions for the first arrival time of adapted mutants. The joint effects of migration, mutation, birth and death result in two distinct parameter regimes. These results may pertain to the rapid evolution of drug-resistant pathogens and insects.
Biological evolution and ecology are intimately linked, because the reproductive success or “fitness” of an organism depends crucially on its ecosystem. The biotic and abiotic factors defining an ecosystem often have complex time- and space-dependent dynamics [1]. Yet, most models of evolution describe homogeneous, fixed-size populations subjected to a constant selection pressure [2]. Even though such minimal descriptions have lead to invaluable insights into some of the major evolutionary forces [2], their scope is clearly limited. An important challenge in evolutionary biology is therefore to understand the interaction between ecology and evolution.
In this Letter, we study how spatial heterogeneities can drive evolution in ecologies displaying so-called source–sink dynamics (SSD) [3-6]. A species’ habitat is called a sink if the local mortality exceeds the local reproduction so that a population can persist only due to continuous immigration from a source habitat, where reproduction exceeds mortality. Sinks often occur at the border of a species’ range: if an environmental variable such as temperature or humidity varies in space and limits a species’ range, the conditions at the border are often poor. SSD results in a sustained presence of poorly adapted immigrants in the sink; this suggests that it could assist adaptation to the sink conditions. Immigrants that, due to mutations, acquire the ability to reproduce efficiently in the sink have an opportunity to establish a population there. Such a mutant can be successful even if it does not have a competitive advantage in its original habitat (the source); it is sufficient if the mutation allows it to colonize the sink. Importantly, this adaptive process driven by the opportunity to establish a new niche is qualitatively different from the conventional notion of a population climbing a fitness gradient [7, 8].
Evolution in source-sink systems is not merely of theoretical interest. For instance, SSD may accelerate the evolution of drug resistance in bacteria [9]. In humans or livestock treated with antibiotics, drug levels can vary between different organs, creating sources and sinks within a host [10]. Also, SSD can emerge when bacteria migrate between treated and untreated individuals [9]. Likewise, the migration of fruit flies between plantations using different (amounts of) insecticides could assist the emergence of insecticide resistance [11, 12]. The evolution of virulence has been associated with SSD as well [13-15].
Here we analyze a minimal stochastic model of adaptation driven by SSD. In particular, we study how the rate of adaptation depends on parameters such as migration and mutation rates. Earlier studies examined source–sink systems from various perspectives. Although the models proposed differ from each other in various respects, they can be divided into three classes. First, several models are based on deterministic population dynamics [16-21]. Such models are convenient but ignore the intrinsic stochasticity of the demographic processes, which, as we will see below, can be important. Second, some studies employ the formalism of quantitative genetics [22, 23]. This type of models describes the response of a quantitative trait to selection in a population characterized by a distribution of phenotypes. This line of attack is appropriate only for multi-locus traits and is again deterministic. Third, individual-based simulations have been used [24]. Unfortunately, such studies are necessarily limited to a narrow set of parameters. In contrast, our model is fully stochastic and yields analytical results that are valid for a wide range of parameter values.
We consider a haploid population in an environment consisting of two “patches” (Fig. 1(a)). Individuals migrate between the patches at a rate ν and die at a rate δ. Each organism has one of two possible genotypes, called “wild-type” (w) and “mutant” (m). Mutations turn a wild-type into a mutant at a rate μf; the reverse occurs at a rate μb. The reproduction rate of genotype g ∈ {W, M} in patch i obeys the logistic form γig(Ni) ≡ max {rig(1 − Ni/K), 0}, where Ni is the population size in patch i. The logistic form keeps Ni finite and introduces competition between organisms sharing a patch. K is called the carrying capacity and rig is the maximal reproduction rate of type g in patch i. The wild-type can reproduce in patch 1 but not in patch 2 (r2w = 0); this introduces SSD. By contrast, mutants can reproduce in both patches. We initially choose r1w = r1M ≡ r, so that both types are equally “fit” in patch 1. Later we will relax this to consider the effect of a possible fitness cost conferred by the mutation.
FIG. 1.

Source-sink model. We consider 2 patches and 2 genotypes. The wild-type can reproduce only in patch 1 while the mutant can grow in both patches. (a) Organisms mutate and migrate (indicated by arrows) and die at rate δ. (b) A wild-type population in patch 1 will in time give rise to a mutant in patch 2. Arrows indicate the two competing pathways.
We make a few biologically motivated assumptions. First, we limit our analysis to large populations (K >> 1). Second, we assume that mutation rates are low (μb, μf << ν, δ). Third, we assume that ν < δ, i.e., organisms are unlikely to migrate multiple times within their lifetime.
The adaptation process consists of two parts. First a mutant has to appear in the sink; next this mutant has to establish a population. The latter process has been treated by (stochastic) models of colony growth [18, 19, 25]. We therefore focus on the question how long it takes before the first mutant arrives in patch 2 starting from a wild-type population in patch 1.
Fig. 1(b) shows the two pathways that can generate a mutant in patch 2. In the upper path (denoted by ↱) first a mutation occurs in patch 1 and later a mutant migrates to patch 2. In the lower path, ⬏, a wild-type first migrates to patch 2 and then mutates. A priori it is unclear which path is more likely. Below, we derive the first arrival time (FAT) distributions for both paths.
We start with path ↱. Let nig be the number of organisms with genotype g in patch i. Since K >> 1, N1 ≡ n1w + n1M is approximately constant; it fluctuates around the value N ≡ [1-(δ + ν)/r]K for which the reproduction rate equals the rate at which organisms disappear from the first patch (γ1m(N) = δ + ν) [26]. Below, we assume that N1 = N; this allows us to study path ↱ in isolation. We write a Master Equation for the probability P↱(n, t) that at time t no mutant has yet migrated to patch 2 and n1M has value n. It reads
| (1) |
with w− ≡ (δ + μb)n, w+(n) ≡ γ1M(N)n + μf(N − n), u(n) = νn and initial condition P↱(n, 0) = δn,0 (δn,m is the Kronecker delta function).
We rewrite Eq. 1 in terms of the generating function and solve the resulting partial differential equation. The probability that at time t no mutant has migrated yet, S↱(t) = G↱(1, t), follows as
| (2) |
with a↱ ≡ μfN/(δ + ν − μf), b↱ ≡ (μb + μf)/2, and . Now the desired FAT probability density F↱ (t) ≡ −dS↱(t)/dt can be expressed as
| (3) |
where 〈n1M(t)〉 is the mean value of n1M at time t given that no mutant has migrated yet; it obeys
| (4) |
These results show that path ↱ is governed by two time scales. First, 〈n1M(t)〉 builds up in a time scale τ↱ = 1/2c↱. Second, at large times F↱(t) decays with time scale due to the time dependence of S↱(t); this time scale reflects the total migration rate after 〈n1M〉 has equilibrated. We define .
We note that b↱ << c↱; thus κ↱ ≈ μfN/[2(δ + ν)] and the mean first arrival time (MFAT) T↱ is given by
| (5) |
where Γ(x) is the Gamma function. Interestingly, only two lumped parameters remain: c↱ and κ↱. If κ↱ << 1 (i.e., μfN << δ + ν) the process is limited by the generation of mutants whereas if κ↱ >> 1 (i.e., μfN >> δ + ν) it is migration limited. By expanding Eq. 5 and using Stirling’s approximation we obtain in these limits:
| (6) |
The plot in Fig. 2(a) clearly reveals these two regimes.
FIG. 2.

Mean first arrival time (MFAT) vs. parameters, for (a) path ↱ and (b) path ⬏. In each case the MFAT depends on only two lumped parameters: ci and κi. In both plots clearly two regimes can be discerned as a function of κi.
We now turn to the lower path, ⬏. As we expect the number of mutants in patch 1 to be small we assume that n1w ≈ N. A Master Equation can then be written for the probability P⬏(n, t) that at time t no mutation has yet occurred in patch 2 and n2w = n. It is identical to Eq. 1 but with w−(n) ≡ (δ + ν)n, w+(n) ≡ νN, and u(n) ≡ μfn. The probability S↱(t) that no mutation occurs before time t and the FAT distribution F⬏(t) can now be obtained by a similar derivation as for the ↱ case.
Again the process has two time scales: τ⬏ = 1/c⬏, with c⬏ ≡ δ + ν + μf, and . We define ; the MFAT of path ⬏, called T⬏, then reads
| (7) |
where γ(x, y) is the lower incomplete Gamma function. Fig. 2(b) is a plot of Eq. 7; it is again characterized by two regimes. Indeed, in the limits κ⬏ << 1 and κ⬏ >> 1,
| (8) |
We are now in the position to calculate the full FAT distribution F(t) taking into account both paths ↱ and ⬏. Since both paths are nearly independent the probability that neither path has completed at time t is S(t) ≡ S↱(t)S⬏(t), from which F(t) and the combined MFAT T follow. However, comparing Eq. 6 to Eq. 8 we recognize that T↱ < T⬏ unless κ⬏ >> 1, in which case T↱ ≈ T⬏. Therefore we conclude that path is dominant and we should expect F(t) ≈ F↱(t) and T ≈ T↱.
We tested these results using kinetic Monte Carlo simulations. Fig. 3(a) shows T as a function of K and μf, and Fig. 3(b) plots T versus ν for various δ. The curves are theoretical predictions calculated by numerical integration of S(t). Fig. 3(c) shows all data of Figs 3(a) and (b) in a single scaling plot. Since T ≈ T↱, Eq. 5 predicts that all points should collapse on one curve if c↱T is plotted against κ↱. This is indeed the case.
FIG. 3.
Mean first arrival time T as a function of parameters. Unless specified, parameters are: r = 1, δ = 10−1, ν = 10−3, μb = 10−4, μb = 10−7, K = 105. (a) T as a function of the carrying capacity K, for various mutation rates μf. Data points are averages over 104 simulations; lines are corresponding theoretical predictions. (b) T versus migration rate ν, for various death rates δ. (c) As predicted by the theory all data points from Figs (a) and (b) collapse on a single curve after rescaling. The plot shows two regimes, for κ↱ << 1 and κ↱ >> 1, corresponding to mutation- or migration-limited dynamics.
So far we assumed that the wild-type cannot reproduce at all in patch 2. One may expect that path ⬏ could become more relevant if it does, albeit at a reduced rate r2w < r. We now derive results for this extended model. In the derivations for path ↱ the rate r2w plays no role, but the results for path ⬏ do change. The dynamics of n2w can be approximated by the rate equation
With initial condition n2w(0) = 0 it is solved by
| (9) |
with b⬏ ≡ (δ + ν − r2w)/2 and . The expression for S⬏(t) has the same form as Eq. 2, but with a⬏ ≡ μfK/r2w, and we obtain
| (10) |
Again we find two time scales: τ⬏ = 1/2c⬏ and . If (i.e., if μfK << r2w or c⬏ ≈ b↱) the first time scale can be ignored and T⬏ ≈ τ↱.
In Fig. 4 we test this result with simulations. Evidently, as long as r2w << δ + ν the MFAT is insensitive to changes in r2w. Only when r2w approaches δ + ν, T⬏ decreases rapidly. Indeed, if r2w = δ + ν we obtain T↱ ≈ T⬏, which means that paths ↱ and ⬏ each contribute about equally to the MFAT T. We therefore conclude that in source–sink systems, where r2w < δ + ν (i.e., to the left of the vertical line in Fig. 4) a nonzero r2w can result in at most a two-fold increase in the rate of adaptation.
FIG. 4.

T vs. reproduction rate of wild-type in the sink, r2w. As long as r2w ≈ δ + ν, T is insensitive to r2w because path ⬏ is dominant. When r2w ≈ δ + ν, T↱ ≈ T⬏ and the process speeds up about two-fold. If r2w ≈ δ + ν patch 2 is no longer a sink. Also shown is the mean time before n2m = 500. Clearly the growth of the mutant colony slows down dramatically as r2w approaches r. Parameters used: r = 1, δ = 10−1, ν = 10−3, K = 105, μf = 10−7, μb = 10−4.
For r2w > δ + ν, although the FAT of a mutant in patch 2 is much reduced (see the red line to the right of the vertical line in Fig. 4), it is more difficult for the mutant to conquer patch 2 because it has to compete with the wild-type. To demonstrate this, Fig. 4 also shows the mean waiting time before n2M = 500, obtained from simulations. Clearly, as r2w → r the competition dramatically slows down the growth of the mutant population.
Finally, we extend the model to include a fitness cost s of adaptation, i.e., we consider r1M = (1 − s)r1w. This means that, in Eq. 1, γ1M(N) = (δ + ν)(1-s). S↱ and F↱ are still given by Eqs 2-4, but with modified constants a↱ ≡ μfN/[(δ + ν)(1−s)−μf], b↱ ≡ [μb + μf + s(δ + ν)]/2, and . It follows that, in the parameter regime considered (μb, μf << δ,ν, and δ ≳ ν), s becomes important only when . For example, even at a low migration rate ν/δ = 10−2 (that is, 99% of the organisms never migrate in their lifetime) the fitness cost is noticable only if s ≳ 0.1. In population genetics this is considered a very large fitness pressure [2]. To conclude, the adaptation is not slowed down by a fitness cost conferred by the mutation unless the cost is very large. This apparently surprising result may be understood by noting that the mutant is initially very rare in the source so that a weak selection pressure is overwhelmed the demographic noise [27]. This result underscores the importance of stochasticity in the process.
To summarize, we formulated a stochastic model for adaptation in source-sink ecologies. In contrast with most traditional models of evolution, in which adaptation results from competition within a single well-stirred population, here adaptation is driven by spatial heterogeneity. To characterize the speed of adaptation, we derived analytical results for the arrival time of the first adapted mutant in the sink. The behavior of the system is found to have two qualitatively distinct regimes in which the system is either mutation-limited (μfN << δ + ν) or migration-limited (μfN >> δ + ν). In the latter regime the mean FAT does not scale as (μfN)−1, as one might naively expect, but as (μfN)−1/2. Because real beneficial mutation rates and population sizes vary considerably (mutation rates in the range 10−5–10−10 per generation and population sizes 104–1010 are reasonable), both regimes can be relevant [12]. Furthermore the results demonstrate that the first adapted mutant found in the sink usually originates as a neutral mutation in the source which by chance migrates to the sink (the so-called “Dykhuizen–Hartl effect” [28, 29]). Mutations arising in the sink contribute only if the system is migration-limited (κ⬏ >> 1), or if r2w ≈ δ + ν (when patch 2 can hardly be called a sink). Strikingly, these results hold even if the mutation is mildly deleterious in the source habitat.
Many variations on the current model can be envisioned. For instance, we assumed that the reproduction rate in patch 2 depended on the genotype while the death rate was assumed constant. Yet, in reality both rates could vary in space. Also, the model can be extended to include more than two patches, which would presumably allow for a stepwise adaptation to an environmental gradient. In future work such variations and extensions could be explored within the formalism presented here.
Acknowledgments
This work was supported by the Center for Theoretical Biological Physics sponsored by the National Science Foundation (NSF) (Grant PHY-0822283).
Footnotes
PACS numbers: 87.23.Cc, 87.23.Kg, 87.10.Mn
References
- [1].May RM, McLean AR. Theoretical ecology: principles and applications. Oxford University Press; Oxford: 2007. [Google Scholar]
- [2].Hartl DL. A primer of population genetics. 3rd ed. Sinauer Associates; Sunderland, Mass.: 2000. [Google Scholar]
- [3].Holt RD. Theor Popul Biol. 1985;28:181. [Google Scholar]
- [4].Pulliam HR. Am Nat. 1988;132:652. [Google Scholar]
- [5].Pulliam HR, Danielson BJ. Am Nat. 1991;137:S50. [Google Scholar]
- [6].Watkinson AR, Sutherland WJ. J Anim Ecol. 1995;64:126. doi: 10.1111/j.1365-2656.2006.01121.x. [DOI] [PubMed] [Google Scholar]
- [7].Wright S. Proceedings of the Sixth International Congress on Genetics. 1932:355. [Google Scholar]
- [8].Poelwijk FJ, et al. Nature. 2007;445:383. doi: 10.1038/nature05451. [DOI] [PubMed] [Google Scholar]
- [9].Perron GG, Gonzalez A, Buckling A. Proc Biol Sci. 2007;274:2351. doi: 10.1098/rspb.2007.0640. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [10].Baquero F, Negri MC. Bioessays. 1997;19:731. doi: 10.1002/bies.950190814. [DOI] [PubMed] [Google Scholar]
- [11].Caprio MA, Tabashnik BE. J Econ Entomol. 1992;85:611. [Google Scholar]
- [12].Karasov T, Messer PW, Petrov DA. PLoS Genet. 2010:6. doi: 10.1371/journal.pgen.1000924. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [13].Sokurenko EV, et al. Nat Rev Microbiol. 2006;4:548. doi: 10.1038/nrmicro1446. [DOI] [PubMed] [Google Scholar]
- [14].Chattopadhyay S, et al. J Mol Evol. 2007;64:204. doi: 10.1007/s00239-006-0063-5. [DOI] [PubMed] [Google Scholar]
- [15].Dennehy JJ, et al. P Roy Soc B-Biol Sci. 2010 [Google Scholar]
- [16].Holt R, Gaines M. Evol Ecol. 1992;6:433. [Google Scholar]
- [17].Kawecki TJ. Evolutionary Ecology. 1995;9:38. [Google Scholar]
- [18].Holt RD, Gomukiewicz R. Case studies in mathematical modeling: ecology, physiology, and cell biology. Prentice-Hall; Englewood Cli s, NJ: 1997. pp. 25–50. [Google Scholar]
- [19].Holt R, Gomulkiewicz R. Am Nat. 1997;149:563. [Google Scholar]
- [20].Gomulkiewicz R, Holt RD, Barfield M. Theor Popul Biol. 1999;55:283. doi: 10.1006/tpbi.1998.1405. [DOI] [PubMed] [Google Scholar]
- [21].Kawecki TJ. Proc Biol Sci. 2000;267:1315. doi: 10.1098/rspb.2000.1144. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [22].Ronce O, Kirkpatrick M. Evolution. 2001;55:1520. doi: 10.1111/j.0014-3820.2001.tb00672.x. [DOI] [PubMed] [Google Scholar]
- [23].Holt RD, Gomulkiewicz R, Barfield M. Proc Biol Sci. 2003;270:215. doi: 10.1098/rspb.2002.2219. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [24].Holt RD, Knight TM, Barfield M. Am Nat. 2004;163:253. doi: 10.1086/381408. [DOI] [PubMed] [Google Scholar]
- [25].Goel NS, Dyn N. Stochastic models in biology. Blackburn Press; Caldwell, N.J.: 2003. [Google Scholar]
- [26].We ignore back-migration from patch 2; this is allowed if ν << δ or N2 remains small; also, we show below that the results are insensitive to perturbations of γ1M(N1).
- [27].Ohta, Gillespie Theor Popul Biol. 1996;49:128. doi: 10.1006/tpbi.1996.0007. [DOI] [PubMed] [Google Scholar]
- [28].Kimura M. The neutral theory of molecular evolution. Cambridge University Press; Cambridge: 1983. [Google Scholar]
- [29].Dykhuizen D, Hartl DL. Genetics. 1980;96:801. doi: 10.1093/genetics/96.4.801. [DOI] [PMC free article] [PubMed] [Google Scholar]

