Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2015 Aug 1.
Published in final edited form as: J Math Biol. 2014 Aug 24;71(2):325–359. doi: 10.1007/s00285-014-0824-5

PROTECTED POLYMORPHISMS AND EVOLUTIONARY STABILITY OF PATCH-SELECTION STRATEGIES IN STOCHASTIC ENVIRONMENTS

STEVEN N EVANS 1, ALEXANDRU HENING 2, SEBASTIAN J SCHREIBER 3
PMCID: PMC4486641  NIHMSID: NIHMS692112  PMID: 25151369

Abstract

We consider a population living in a patchy environment that varies stochastically in space and time. The population is composed of two morphs (that is, individuals of the same species with different genotypes). In terms of survival and reproductive success, the associated phenotypes differ only in their habitat selection strategies. We compute invasion rates corresponding to the rates at which the abundance of an initially rare morph increases in the presence of the other morph established at equilibrium. If both morphs have positive invasion rates when rare, then there is an equilibrium distribution such that the two morphs coexist; that is, there is a protected polymorphism for habitat selection. Alternatively, if one morph has a negative invasion rate when rare, then it is asymptotically displaced by the other morph under all initial conditions where both morphs are present. We refine the characterization of an evolutionary stable strategy for habitat selection from [Schreiber, 2012] in a mathematically rigorous manner. We provide a necessary and sufficient condition for the existence of an ESS that uses all patches and determine when using a single patch is an ESS. We also provide an explicit formula for the ESS when there are two habitat types. We show that adding environmental stochasticity results in an ESS that, when compared to the ESS for the corresponding model without stochasticity, spends less time in patches with larger carrying capacities and possibly makes use of sink patches, thereby practicing a spatial form of bet hedging.

Keywords: density-dependent, frequency-dependent, protected polymorphism, evolutionarily stable strategy, exclusion, dimorphic, ideal-free, invasion rate, habitat selection, bet hedging

1. Introduction

Habitat selection by individuals impacts key attributes of a population including its spatial distribution, temporal fluctuations in its abundance, and its genetic composition. In environmentally heterogeneous landscapes, individuals selecting more favorable habitats are more likely to survive or reproduce. As population densities increase in these habitats, individuals may benefit by selecting previously unused habitats. Thus, both environmental conditions and density-dependent feedbacks generate selective pressures on habitat selection. Under equilibrium conditions, spatial heterogeneity can select for populations exhibiting an ideal-free distribution–equal per-capita growth rates in all occupied patches and lower per-capita growth rates if individuals moved into unoccupied patches (Fretwell and Lucas, 1969). Under non-equilibrium conditions, spatial-temporal heterogeneity can select for individuals occupying sink habitats in which the per-capita growth rate is always negative (Holt, 1997; Jansen and Yoshimura, 1998). Environmental heterogeneity can also promote coexistence of genotypes only differing in their habitat choices (Jaenike and Holt, 1991). Despite significant advances in the mathematical theory for habitat selection under equilibrium conditions, a mathematical theory for habitat selection in stochastic environments is largely lacking. Here, we take a step to addressing this mathematical shortfall while at the same gaining new insights into the evolution of habitat selection for populations living in stochastic, patchy environments.

Since the classic paper Fretwell and Lucas (1969), the ideal-free distribution has been studied extensively from empirical, theoretical, and mathematical perspectives. Empirical support for ideal-free distributions exists for many taxa including fish (Godin and Keenleyside, 1984; Oksanen et al., 1995; Haugen et al., 2006), birds (Harper, 1982; Doncaster et al., 1997), mammals (Beckmann and Berger, 2003), and insects (Dreisig, 1995). For example, Oksanen et al. (1995) found that armored catfish in Panamanian stream pools were distributed such that the resource availability per catfish was equal in all occupied pools, despite significant variation in light availability across these occupied pools. Theoreticians have identified several “non-ideal” mechanisms (e.g. sedentarism, adaptive movement with finite speed, density-dependent dispersal) that, under equilibrium conditions, generate an ideal-free distribution (Hastings, 1983; Cosner, 2005; Gejji et al., 2012). For example, at equilibrium, sedentary populations achieve an ideal-free distribution provided, paradoxically, the populations initially occupied all habitat patches. While many early studies asserted that the ideal free distribution is an evolutionarily stable strategy (ESS) (Fretwell and Lucas, 1969; van Baalen and Sabelis, 1993; Schreiber et al., 2000), only recent advanced nonlinear analyses fully verified this assertion (Cressman et al., 2004; Cressman and Křivan, 2006, 2010; Cantrell et al., 2007, 2010, 2012).

In nature, observed habitat occupancies are frequently less extreme than predicted by the ideal-free distribution: individuals underuse higher quality habitats and overuse lower quality habitats compared to theoretical predictions (Milinski, 1979; Tregenza, 1995). Notably, populations occupying sink habitats have been documented in many species (Sokurenko et al., 2006; Tittler et al., 2006; Robinson et al., 2008; Anderson and Geber, 2010). One possible explanation for these observations is that populations experience temporal as well as spatial variation in environmental conditions and, consequently, theory based on equilibrium assumptions tells an incomplete story. In support of this explanation, several theoretical studies have shown that occupation of sink habitats should evolve when temporal variation is sufficiently great in other habitats (Holt, 1997; Jansen and Yoshimura, 1998; Holt and Barfield, 2001; Schreiber, 2012). These theoretical developments, however, rely on linearizations of density-dependent models, and do not analyze the dynamics of competing genotypes, the ultimate basis for evolutionary change due to natural selection. Hence, these studies leave unanswered the question, “Does the linear analysis correctly identify competitive exclusion in pairwise interactions that is the basis for the analysis of evolutionarily stable strategies?”

Within populations, individuals can exhibit different habitat selection strategies, and there is some evidence these differences can be genetically based (Via, 1990; Jaenike and Holt, 1991). For instance, some individuals of the fruit fly species Drosophila tripunctata prefer tomato host plants (one potential habitat for its larvae) while others prefer mushrooms (another potential habitat), and these differences are based on two genetically independent traits, settling behavior and ovipositor site preference (Jaenike, 1985). Jaenike and Holt (1991) found that genetic variation in habitat selection is common, especially in arthropods and mollusks. Furthermore, they demonstrated using mathematical models that this genetic variation can stem from density-dependent regulation occurring locally within each habitat. Specifically, Jaenike and Holt write “frequency-dependent selection favors alleles that confer upon their carriers a preference for underused habitats, even if there is no genetic variation in how well individuals are adapted to the different habitat” (Jaenike and Holt, 1991, p.S78). Their analysis, however, doesn’t account for temporal fluctuations in environmental conditions and this raises the question, “Does environmental stochasticity facilitate or hinder the maintenance of genetic variation in habitat selection?”

To answer the aforementioned questions, we provide an in-depth analysis of a model introduced in (Schreiber, 2012). The single genotype (i.e. monomorphic) version of this model and a characterization of its dynamics are given in Section 2. The competing genotype (i.e. dimorphic) version of the model and invasion rates of each genotype when rare are introduced in Section 3. In Section 4, we prove that these invasion rates determine the long-term fate of each of the genotypes. Specifically, if both genotypes have positive invasion rates when rare, then there is a positive stationary distribution under which the genotypes coexist. Alternatively, if one genotype has a negative invasion rate when rare, then it is asymptotically displaced by the other genotype. These result allows us to use the invasion rates when rare to explore conditions supporting a protected polymorphism for habitat selection. In Section 5, we refine the characterization of an evolutionary stable strategy for habitat selection from (Schreiber, 2012) in a mathematically rigorous manner, and provide an explicit formula for this ESS when there are two habitat types. Section 6 concludes with a discussion of how our results relate to the existing literature and identifies future challenges for the theory of habitat selection in stochastic environments.

2. The Monomorphic Model

To set the stage for two competing populations spread over several patches, we start with a single population living in one patch. Let Zt be the population abundance at time t ≥ 0. The stochastic process (Zt)t≥0 is governed by the Itô stochastic logistic equation

dZt=Zt(μκZt)dt+σZtdWt, (2.1)

where μ is the intrinsic rate of growth of the population in the absence of stochasticity, κ is the strength of intraspecific competition, σ2 > 0 is the infinitesimal variance parameter of the stochastic growth rate, and (Wt)t≥0 is a standard Brownian motion. The process (Zt)t≥0 is a strong Markov process with continuous paths. We call an object with such properties a diffusion.

As shown in our first proposition, the process (Zt)t≥0 lives in the positive half line ++(0,); that is, if we start it in a strictly positive state, then it never hits zero. Furthermore, the long-term behavior of the process is determined by the stochastic rate of growth μσ22. When the stochastic growth rate is negative the population abundance converges asymptotically to zero with probability one. On the other hand, when this parameter is positive the distribution of the abundance converges to an equilibrium given by a Gamma distribution. These results are well-known, but, as introduction to the methods used to prove our main results, we provide a proof in Appendix A.

Proposition 2.1. Consider the diffusion process (Zt)t≥0 given by the stochastic differential equation (2.1).

  • The stochastic differential equation has a unique strong solution that is defined for all t ≥ 0 and is given by
    Zt=Z0exp((μσ22)t+σWt)1+Z0μκ0texp((μσ22)s+σWs)ds.
  • If Z0 = z > 0, then Zt > 0 for all t ≥ 0 almost surely.

  • If μσ22<0, then limt→∞ Zt = 0 almost surely.

  • If μσ22=0, then lim inft→∞ Zt = 0 almost surely, lim supt→∞Zt = ∞ almost surely, and limt1t 0tZsds=0 almost surely.

  • If μσ22>0, then (Zt)t≥0 has a unique stationary distribution ρ on ++ with Gamma density g(x)=1Γ(k)θkxk1exθ, where
    θσ22κandk2μσ21.

Moreover, if Z0 = z > 0, then

limt1t0th(Zs)ds=0h(x)g(x)dxalmostsurely

for any Borel function h:++ with 0h(x)g(x)dx<. In particular,

limt1t0tZsds=1κ(μσ22)almostsurely.

Next, we consider a population living in a spatially heterogeneous environment with n different patches. These patches may represent distinct habitats, patches of the same habitat type, or combinations thereof. The abundance of the population in the i-th patch at time t ≥ 0 is Xti. Let (Xti)t0 be given by

dXti=Xti(μiκiXti)dt+XtidEti, (2.2)

where μi is the intrinsic rate of growth the population in patch i in the absence of stochasticity, κi is the strength of intraspecific competition in patch i, and Eti=ΣjγjiBtj for a standard multivariate Brownian motion (B1, …, Bn)T on n and an n × n matrix Γ := (γij). The infinitesimal covariance matrix for the non-standard Brownian motion (Et1,,Etn) is Σ = (σij):= ΓTΓ.

The populations in the various patches described by equation (2.2) are coupled only by the spatial correlations present in the driving Brownian motion (Et1,,Etn). We further couple the population dynamics across patches by assuming the fraction of population in patch i equals αi for all time. This spatial distribution can be realized at the scale of the individual when, as described in greater detail in Remark 2.2, individuals disperse rapidly and independently of one another in such a manner that the fraction of time spent in patch i equals αi for each individual. Under this assumption, we call α = (α1,α2, ⋯ αn), with αi ≥ 0 for all 1 ≤ in and Σi=1n αi = 1, a patch-selection strategy. Continuing to denote the abundance of the population in the i-th patch at time t ≥ 0 as Xti, we have Xti=αiXt, where Xt=Σi=1nXti is the total population abundance at time t ≥ 0. If we impose these constraints on (X1,Xn), then it is heuristically reasonable that the process X is an autonomous Markov process that satisfies the SDE

dXt=Xti=1nαi(μiκiαiXt)dt+Xti=1nαidEti. (2.3)

Remark 2.2. One way to justify the formulation of (2.3) rigorously is to first modify (2.2) to obtain a system of SDEs explicitly accounting for dispersal. Suppose that individuals disperse from patch i to patch j at a rate δdij for some fixed rate matrix D = (dij). As usual, we adopt the convention dii = −Σji dij. The resulting system of SDEs is

dX~ti=X~ti(μiκiX~ti)dt+δjX~tjdjidt+X~tidEti. (2.4)

Assume that the rate matrix D has a unique stationary distribution α; that is, αj > 0 for 1 ≤ jn, Σi=1n αi = 1,

j=1nαjdji=0 (2.5)

for 1 ≤ in. In this case, a vector (y1, …, yn) satisfies

j=1nyjdji=0 (2.6)

for 1 ≤ in if and only if

yj=αjc (2.7)

for 1 ≤ jn for some constant c. Moreover, summing (2.7) we find that

c=i=1nyi. (2.8)

Note that by (2.5) we can write the drift term in (2.4) that contains δ as

δjX~tjdjidt=δj(X~tjαjX~t)djidt (2.9)

where X~tΣi=1nX~ti. using (2.7) and (2.8), we see that (x1, …, xn) and xΣi=1nxi are such that

j=1n(xjαjx)dji=0 (2.10)

for i = 1, …, n if and only if

xjαjx=αji=1n(xiαix)=0 (2.11)

for 1 ≤ jn.

It follows from (2.9) and the equivalence between (2.10) and (2.11) that as δ increases the solution of (2.4) experiences an increasingly strong drift towards the one-dimensional subspace

{(x1,,xn):xi=αi(x1++xn),i=1,,n}.

In the limit δ → ∞, it is plausible that the system (2.4) converges to one for which

X~ti=αiX~t,

where X~tX~t1++X~tn, and the total population size X~t satisfies the autonomous one-dimensionl SDE (2.3) with Xt replaced by X~t. This heuristic for obtaining (2.3) as a high dispersal rate limit of (2.4) can be made rigorous by applying Theorem 6.1 from Katzenberger (1991).

Let xy=Σi=1nxiyi denote the standard Euclidean inner product and define another inner product 〈·, ·〉κ by x,yκΣi=1nκixiyi. Since (α · Et)t≥0 is a Brownian motion with infinitesimal variance parameter α · Σα, (2.3) can be expressed more simply as

dXt=Xt(αμα,ακXt)dt+XtαΣαdWt, (2.12)

where Wt is a standard Brownian motion.

The total population (Xt)t0 defined by (2.12) behaves exactly like the one-patch case defined by (2.1) with the parameters μμ · κ → 〈α, ακ and σαΣα. In particular, (Xt)t0 is a diffusion process and we have the following immediate consequence of Proposition 2.1

Proposition 2.3. Consider the diffusion process (X)t0 given by (2.12).

  • If X0=x>0, then Xt>0 for all t ≥ 0 almost surley.

  • If αμαΣα2<0, then limtXt=0 almost surley.

  • If αμαΣα2=0, then liminftXt=0 almost surley, limsuptXt= almost surley, and limt1t 0t Xsds=0 almost surely.

  • If αμαΣα2>0, then the process (Xt)t0 has a unique stationary distribution ρX on ++ with Gamma density g(x)=1Γ(k)θkxk1exθ, where
    θαΣα2α,ακandk2αμαΣα1.

Moreover,

limt1t0th(Xs)ds=0h(x)g(x)dxalmostsurely

for any Borel function h:++ with 0h(x)g(x)dx<. In particular,

limt1t0tXsds=1α,ακα(μΣα2)almostsurely.

For the dynamics (2.2) in patch i, Proposition 2.1 implies that if there was no coupling between patches by dispersal, then then population abundance in patch i would converge to 0 if μiσii/2 < 0 and converge to a non-trivial equilibrium if μiσii/2 > 0. As noted by Schreiber (2012) and illustrated below, the spatially coupled model is such that the population can persist and converge to an equilibrium even when μiσii/2 < 0 for all patches.

Persistence of coupled sink populations in symmetric landscapes

Consider a highly symmetric landscape where μi = r, σii = σ2 > 0 for all i, κi = a for all i, and σij = 0 for all ij. If individuals are equally distributed across the landscape (αi = 1/n for all i), then

μiσii2=rσ22andαμαΣα2=rσ22n.

The increase in the stochastic growth rate from rσ2/2 for an isolated population to rσ2/(2n) for the spatially coupled population stems from individuals spending equal time in patches with uncorrelated environmental fluctuations. Specifically, the environmental variance experienced by individuals distributing their time equally amongst n uncorrelated patches is n times smaller than the environmental variance experienced by an individual spending their time entirely in one patch. Whenever σ2 > 2r > σ2/n, this reduction in variance allows the entire population to persist despite patches, in and of themselves, not supporting population growth.

3. Dimorphic model and invasion rates

To understand the evolution of patch-selection strategies, we now consider competition between populations that only differ in their patch-selection strategy. Let Xt and Yt be the total population sizes at time t ≥ 0 of two populations playing the respective patch selection strategies α = (α1,α2,…,αn) and β = (β1,β2,…,βn), so that the densities of the populations in patch i are αiXt and βiYt at time t ≥ 0. The dynamics of these two strategies are described by the pair of stochastic differential equations

dXt=Xti=1nαi(μiκi(αiXt+βiYt))dt+Xti=1nαidEtidYt=Yti=1nβi(μiκi(αiXt+βiYt))dt+Yti=1nβidEti. (3.1)

Since

d[X,X]t=Xt2αΣαdtd[Y,Y]t=Yt2βΣβdtd[X,Y]t=XtYtαΣβdt,

the diffusion process ((Xt, Yt))t≥0 for the spatially coupled, competing strategies can be represented more compactly as

dXt=Xt[μαα,βκYtα,ακXt]dt+XtαΣαdUtdYt=Yt[μβα,βκXtβ,βκYt]dt+YtβΣβdVt, (3.2)

where (U, V) is a (non-standard) Brownian motion with covariance structure d[U, U]t = dt, d[V, V]t = dt, and d[U,V]t=αΣβαΣαβΣβdt. Using a construction similar from Remark 2.2, system (3.2) can be seen as a high dispersal limit. This system exhibits a degeneracy when U = V i.e. αΣβαΣαβΣβ=1. If Σ is nonsingular, then, by the Cauchy-Schwarz inequality, this degeneracy only occurs if α = β. We do not consider this possibility in what follows.

To determine whether the two populations coexist or one displaces the other, we introduce the invasion rate I(α,β) of a population playing strategy β when introduced at small densities into a resident population playing strategy α. As shown in the next proposition, this invasion rate is defined by linearizing the dynamics of Y and computing the long-term population growth rate I(α,β) associated with this linearization. When I(α,β)>0, the population playing strategy β tends to increase when rare. When I(α,β)<0, the population playing strategy β tends to decrease when rare.

Proposition 3.1. Consider the partially linearized system

dXt=Xt[μαα,ακXt]dt+XtαΣαdUtdY^t=Y^t[μβα,βκXt]dt+Y^tβΣβdVt. (3.3)

Assume X0>0 and Y^0>0.

If α · (μ − Σα/2) > 0, so the Markov process X has a stationary distribution concentrated on ++, then the limit limtlogY^tt exists almost surely and is given by

I(α,β)=β(μΣβ2)α,βκα,ακα(μΣα2). (3.4)

On the other hand, if α·(μ − Σα/2) ≤ 0, so that limt1t 0t Xsds=0 almost surely, then the limit limtlogY^tt exists almost surely and is given by

I(α,β)=β(μΣβ2). (3.5)

Proof. By Itô’s lemma,

dlogY^t=(μβα,βκXt)dt+βΣβdVt12(βΣβ)dt.

Assume that μααΣα2>0. By Proposition 2.3,

limt1t0tXsds=α2α,ακ(2μΣα)almost surely.

Therefore,

limtlogY^tt=βμα,βκ2α,ακα(2μΣα)12βΣβalmost surely,

as claimed.

On the other hand, assume that μααΣα20. By Proposition 2.3,

limt1t0tXsds=0almost surely.

Therefore,

limtlogY^tt=μβ12βΣβalmost surely,

again as claimed.

In the next proposition, we show that if a population playing strategy β cannot invade a population playing strategy α (i.e. I(α,β)<0, ), then the population strategy α can invade the population playing strategy β (i.e. I(β,α)>0). This suggests, as we will show in the next section, that such a strategy α should exclude strategy β.

Proposition 3.2. Suppose that α·(μ − Σα2) > 0 and I(α,β)<0. Then, I(β,α)>0.

Proof. Set A:= α·(μ − Σα2) and B:= β·(μ − Σβ2). Assume that A > 0 and I(α,β)<0. To show that I(β,α)>0, we consider two cases, B ≤ 0 and B > 0. Suppose B ≤ 0. Then, I(β,α)=A>0 by Proposition 3.1 and by assumption.

Alternatively, suppose that B > 0. Then

I(α,β)=BAα,βκα,ακandI(β,α)=ABα,βκβ,βκ

by Proposition 3.1. Assume, contrary to our claim, that I(α,β)<0 and I(β,α)0. From the Cauchy-Schwarz inequality x,yκx,xκ12y,yκ12 we get

Bα,ακ12β,βκ12Bα,βκAα,βκAα,ακ12β,βκ12Aα,βκ>Bα,ακ.

The above inequalities yield the contradiction Bα,ακ12Aβ,βκ12 and Bα,ακ12<Aβ,βκ12.

An immediate consequence of Proposition 3.1 is the following corollary. This corollary implies that if a population playing strategy β can invade a population playing strategy α and a population playing strategy α can invade a population playing strategy β, then a single population playing strategy β converges to a non-trivial equilibrium and the same is true of a single population playing strategy β. This suggests, as we show in the next section, that under these conditions these two strategies should coexist.

Corollary 3.3. The invasion rate satisfies I(α,β)β(μΣβ2). In particular, if I(α,β)>0 and I(β,α)>0, then α · (μ − Σα/2) > 0 and β · (μ − Σβ/2) > 0.

4. Exclusion and protected polymorphisms

Our main results about the dimorphic process (X, Y) is that the invasion rates determine the long-term fate of competing strategies. If the invasion rates predict that strategy β cannot invade a population playing strategy α, then the population playing strategy α drives the population playing strategy β asymptotically to extinction as shown in the following theorem. We give a proof in Appendix B.

Theorem 4.1. If α · (μ − Σα/2) > 0 and I(α,β)<0, then, for (x,y)++2, the probability measures

1t0t(x,y){(Xs,Ys)}ds

converge weakly as t → ∞ to ρXδ0, where ρX is the unique stationary distribution of Xt concentrated on ++, and δ0 is the point mass at 0.

On the other hand, if the invasion rates predict that each strategy can invade when rare, then the following theorem proves that the competing strategies coexist: for any initial conditions the joint distribution of (Xt, Yt) converges as t → ∞ to a probability distribution on ++2 with density ψ and, moreover, for any Borel set B++2 the long term proportion of times t for which (Xt, Yt) spends in B converges to

Bψ(x,y)dxdy.

A proof is given in Appendix C. In order to appreciate the assumptions of the theorem, it helps to recall Corollary 3.3 which says that if I(α,β)>0 and I(β,α)>0 then α · (μ − Σα/2) > 0 and β · (μ − Σβ/2) > 0 so that a single population playing strategy α or β will persist.

Theorem 4.2. Suppose that I(α,β)>0 and I(β,α)>0. Then, there exists a unique stationary distribution π of (X; Y ) on ++2 that is absolutely continuous with respect to Lebesgue measure. Moreover, for any bounded, measurable function f:++2

limt1t0tf(Xs,Ys)ds=++2f(x,y)π(dx,dy)almostsurely. (4.1)

Furthermore, the process (X, Y) is strongly ergodic, so that for any initial distribution q one has

limtdTV(q{(Xt,Yt)},π)=0, (4.2)

where dTV is the total variation distance.

From the perspective of population genetics, the coexistence of these two strategies corresponds to a protected polymorphism: each strategy (a morph) increases when rare and, therefore, is protected from extinction. This protection from extinction, however, is only ensured over ecological time scales as mutations may result in new morphs that can displace one or both coexisting morphs (Ravigné et al., 2004). The concept of protected polymorphisms was introduced by Prout (1968) when studying deterministic models of competing haploid populations in a spatially heterogenous with overlapping generations. Turelli et al. (2001) extended this concept to stochastic difference equations for competing haploid populations with a constant population size. Theorem 4.2 provides a mathematically rigorous characterization of protected polymorphisms for our stochastic models with fluctuating population sizes.

Theorem 4.2 implies that coexistence depends on the intrinsic stochastic growth rate of the populations and the competitive effect of each population on the other. The intrinsic stochastic growth rates are given by

rα=α(μΣα2)andrβ=β(μΣβ2).

While the competitive effect of the population with strategy α on the population with strategy β is given by the ratio of the magnitude of α projected in the β direction (i.e. 〈β/||β||κ, ακ, where βκ=β,βκ) divided by the magnitude of β (i.e. ||βκ). Mathematically, the competitive effect of α on β and the competitive effect of β on α are given by

Cα,β=ββκ,ακβκandCβ,α=αακ,βκακ.

Provided rα and rβ are positive, Theorem 4.2 implies that there is a protected polymorphism if

rαrβ>Cβ,αandrβrα>Cα,β. (4.3)

In words, the relative intrinsic stochastic growth rate of each population must exceed the competitive effect on itself due to the other population. Conversely, if one of the inequalities in (4.3) is reversed, then Theorem 4.1 implies that one population excludes the other. Unlike the standard Lotka-Volterra competition equations, Proposition 3.2 implies that both inequalities in (4.3) cannot be simultaneously reversed and, consequently, bistable dynamics are impossible.

Environmental stochasticity impedes protected polymorphisms in symmetric landscapes.

Consider a landscape where all patches have the same carrying capacities (e.g. κi = 1 for all i), the same intrinsic rates of growth (i.e. μi = a for all i), and the same amount of uncorrelated environmental stochasticity (e.g. σii = σ2 for all i and σij = 0 for ij). Then the protected polymorphism inequalities (4.3) become

aσ2α22aσ2β22>Cβ,αandaσ2β22aσ2α22>Cα,β (4.4)

where the only σ2 dependency is on the left hand sides of both inequalities. As aσ2β22rσ2α22 is a decreasing function of σ2 whenever αβ<1 and an increasing function of σ2 whenever αβ>1, it follows that the set of set of strategies supporting a protected polymorphism

A(σ2)={(α,β):(4.4)holds}

is a decreasing function of σ2 i.e A(σ22) is a proper subset of A(σ12) whenever σ2 > σ1 ≥ 0. Figure 1A illustrates this conclusion for a two-patch landscape. Intuitively, increasing environmental stochasticity in these symmetric landscapes reduces the stochastic growth rate for all strategies and, thereby, makes it less likely for populations to persist let alone coexist. For asymmetric landscapes, how the set A(σ2) of protected polymorphisms varies with σ2 is more subtle, as illustrated in Figure 1B. In this case, some protected polymorphisms are facilitated by environmental stochasticity, while other protected polymorphisms are disrupted by environmental stochasticity.

Figure 1.

Figure 1

Protected polymorphisms and exclusion in two-patch landscapes. Contour plots of I(α,β) where lighter shades correspond to higher values of I(α,β). The regions where I(α,β)I(β,α)>0 are delineated by the solid curves and correspond to parameter combinations supporting a protected polymorphism. Regions where I(α,β)I(α,β)<0 correspond to strategies that cannot coexist. The dashed-dotted and dotted curves indicate how regions of coexistence and exclusion change for higher and lower levels of environmental stochasticity σ2, respectively. In panel A, the landscape is spatially homogeneous with μ = (1, 1), κ = (1, 1) and Σ = σ2I where I is the 2 × 2 identity matrix. In B, the landscape is spatial heterogeneous with respect to the deterministic carrying capacities κ = (3, 1) and the remaining parameters as A.

For the symmetric landscapes, we can identify a strategy that displaces all others. Namely, the strategy α=(1n,,1n) of visiting all patches with equal frequency. This strategy maximizes the function function αaσ2α22. Hence, if we consider a competing strategy βα, then αβ=α2=1n and

I(α,β)=aσ2β22αβα2(aσ2α22)=aσ2β22(aσ2α22)<0.

e.g. the invasion rates are negative along the vertical transect α1 = 1/2 in Figure. 1A. This strategy α is an example of an evolutionarily stable strategy that we discuss further in the next section.

5. Evolutionarily stable strategies

The concept of an evolutionary stable strategy was introduced by Maynard Smith and Price (1973). Loosely stated, an evolutionary strategy is a strategy that cannot be invaded by any other strategy and, consequently, can be viewed as an evolutionary endpoint. For our models, we say patch selection strategy α is an evolutionarily stable strategy (ESS) if I(α,β)<0 for all strategies βα. In light of Theorem 4.1, an ESS not only resists invasion attempts by all other strategies, but can displace all other strategies. An ESS α is called a pure ESS if αi = 1 for some patch i, otherwise it is a mixed ESS. Our next result provides an algebraic characterization of mixed and pure ESSs. However, it remains to be understood whether these ESSs can be reached by small mutational steps in the strategy space (i.e. are convergently stable (Geritz et al., 1997)).

Theorem 5.1. Assume that the covariance matrix Σ is positive definite and that there is at least one patch selection strategy which persists in the absence of competition with another strategy; that is, that maxα α · (μ − Σα/2) > 0.

Mixed strategy

An ESS α with αi > 0 for iI with I ⊆ {1, 2, …, n} and |I| ≥ 2 satisfies

αΣα2=μiκiαiα(2μΣα)2α,ακj=1nσijαj (5.1)

for all iI. Conversely, if |I| = n, then a strategy α satisfying (5.1) is an ESS.

Pure strategy

The strategy αi = 1 and αj = 0 for ji is an ESS if and only if

μjσjj2<σjj2+σijσii2 (5.2)

for all ji.

Furthermore, in the case of n = 2, there exists a mixed ESS whenever the reversed inequalities

μjσjj2>σjj2+σijσii2 (5.3)

hold for i = 1, 2 and ji.

The first statement of Theorem 5.1 provides a sufficient and necessary condition for a mixed ESS utilizing all patches. For example, in a symmetric landscape (as described in the previous section), this ESS condition is only satisfied for α = (1/n, 1/n, …, 1/n).

The second statement of Theorem 5.1 provides a characterization of when using only a single patch is an ESS. Since the right hand side of equation (5.2) is negative, using patch i can only be an ESS if all other patches have a negative stochastic rate of growth, μjσjj2<0 for all ji. However, even if only patch i has a positive stochastic growth rate, an ESS may use the other patches, as we illustrate next for two-patch landscapes.

ESSs in two-patch, uncorrelated landscapes

For an uncorrelated two patches landscape (i.e. n = 2 and σ12 = 0), Theorem 5.1 implies that there is a mixed ESS whenever

μ1>σ222andμ2>σ112 (5.4)

and this ESS satisfies

αi=μi+αΣακi(μααΣα2)α,ακ+σii. (5.5)

Equation (5.4) implies that even if deterministic growth in patch 2 is strictly negative (i.e. μ2 < 0), then there is selection for movement into this patch provided the variance of the fluctuations in patch 1 are sufficiently large relative to the intrinsic rate of decline in patch 2 (Fig. 2).

Figure 2.

Figure 2

ESS for patch selection (left) and mean population abundance (right) in a source-sink landscape. Parameter values: n = 2, σ11 = σ2, σ22 = σ12 = 0, μ = (1, μ2), and κ = (1, 1).

In the limit of no noise (i.e. σii ↓ 0 for i = 1, 2), equation (5.5) becomes

αi=μiκiμαα,ακ.

While our results do not apply to the deterministic case, this limiting expression for the ESS suggests, correctly, that the ESS for the deterministic model satisfies

αi=μiκijμjκjwheneverμi>0.

In other words, the fraction of individuals selecting patch i is proportional to the equilibrium density μi/κi supported by patch i. Equation (5.5) implies that adding stochasticity in equal amounts to all patches (i.e. σii = σ2 for all i) results in an ESS where, relative to the deterministic ESS, fewer individuals select patches supporting the highest mean population abundance and more individuals selecting patches supporting lower mean population abundances (Fig. 3).

Figure 3.

Figure 3

The effect of the deterministic carrying capacities and environmental stochasticity on the ESS for patch selection (left) and mean population abundance (right) in a two-patch landscape. The ratio κ1/(κ2 + κ2) corresponds to the ratio of the deterministic carrying capacity (μ2/κ2) in patch 2 to the sum of the deterministic carrying capacities (μ1/κ2 + μ2/κ2) when μ1 = μ2 = 1. Parameter values: n = 2, σ11 = σ22 = σ2, σ12 = 0, μ = (1, 1), and κ = (1, κ2).

6. Discussion

Habitat selection by organisms is a complex process determined by a mixture of genetic, developmental, ecological, and environmental factors. For ecologists, habitat selection plays a fundamental role in determining the spatial and temporal distribution of a population (Rosenzweig, 1981; Orians and Wittenberger, 1991). For evolutionary biologists, habitat selection determines the suite of environmental factors driving local adaptation (Edelaar and Bolnick, 2012). Indeed, in the words of the eminent evolutionary biologist Ernst Mayr, “With habitat and food selection – behavioral phenomena – playing a major role in the shift into new adaptive zones, the importance of behavior in initiating new evolutionary events is self-evident” (Mayr, 1963, p. 604). Here, we examined how spatial and temporal heterogeneity in demographic rates across multiple habitat patches influence the dynamics of competing populations who only differ in their habitat patch selection preferences. We assume that habitat selection has a genetic basis (e.g. genes that influence the physiological or neurological capacity of individuals to detect and respond to habitat cues) and that genetic differences in habitat choice have no pleiotropic effects on habitat specific fitness. Our analysis reveals that, generically, only two outcomes are possible, coexistence or displacement of one population by the other for all initial conditions, and that these outcomes are determined by the invasion rates of populations when rare. In addition to providing a mathematically rigorous justification of prior work, our analysis provides new insights into protected polymorphisms for habitat selection and raises several questions about evolutionary stable strategies for habitat selection.

Protected polymorphisms correspond to populations of competing genotypes exhibiting negative frequency-dependence: each population tends to increase when rare (Prout, 1968). In the case of patch selection, these competing populations differ in the frequencies in which they select habitat patches. In a survey of the empirical literature, Jaenike and Holt (1991) found “that genetic variation for habitat selection is common, especially in arthropods and mollusks, the groups that have been studied most frequently.” Moreover, they argued that some of this variation may be maintained through protected polymorphism. Specifically, “in a haploid model without intrinsic fitness differences among genotypes [i.e. soft selection], genetic variation in fixed habitat preferences may be maintained stably” (Jaenike and Holt, 1991, pg. S83). We provide a general analytic criterion (see, inequality (4.3)) characterizing these protected polymorphisms for spatially and temporally variable environments. This criterion depends on the intrinsic fitnesses (rα and rβ) of each population and their competitive coefficients (Cα,β and Cβ,α) that characterize the effect of each population on the other. Competitive effects are greatest when there is an overlap in patch use and one population tends to select the patches with the higher carrying capacities more than the other population. Intuitively, by occupying patches with a larger carrying capacities, populations achieve higher regional densities. Coupled with overlap in patch use, these higher densities result in a greater competitive impact of one population on another. A protected polymorphism occurs when the relative fitness of each population (e.g. rα/rβ for strategy α) is greater than the competitive effect of the other population on it (e.g. rα/rβ > Cβ,α for the population playing strategy α). Hence, as in the case of species coexistence (Chesson, 2000), protected polymorphism are most likely when fitness differences are small (i.e. rα/rβ ≈ 1) and competitive effects are small (i.e. both Cα,β and Cβ,α < 1). Environmental stochasticity solely effects the intrinsic fitness terms and can facilitate or inhibit protected polymorphisms. For landscapes in which all patches experience the same degree of uncorrelated, temporal variation, environmental stochasticity has an inhibitory effect as it magnifies fitness differences between competing strategies (e.g. rα/rβ increases with environmental stochasticity). For asymmetric landscapes, however, temporal variability can facilitate polymorphisms by reducing fitness differences of competing strategies.

In contrast to protected polymorphisms, our analysis reveals that populations playing an evolutionarily stable strategy (ESS) for patch selection not only thwart invasion attempts by all other strategies but also can invade and displace a population playing any other strategy. Furthermore, our analysis provides a mathematically rigorous justification of an earlier characterization of ESSs (Schreiber, 2012). This characterization implies that populations playing the ESS always occupy source habitats (i.e. patches where μiσii22>0). Indeed, consider a population playing strategy α that does not occupy some source patch, say patch i. Then a different behavioral genotype β that only selects patch i can invade as I(α,β)=μiσi22>0. In the limiting case of a deterministic environment, our characterization of the ESS recovers the classic result of McPeek and Holt (1992): the fraction of time spent in a patch is proportional to the carrying capacity of the patch. Adding environmental stochasticity generally results in populations playing the ESS decreasing the time spent in the patches with larger carrying capacities and possibly making use of sink patches (i.e. patches where μiσi22<0. This shift in patch choice can be viewed as a spatial form of bet hedging: individuals increase fitness by decreasing the variance in their stochastic growth rate at the expense of their mean growth rate (Childs et al., 2010).

We are able to show that for two patch landscapes there always exists an ESS for patch selection. However, several questions remain unanswered. First, what happens for landscapes with more than two patches? Is there always an ESS? Second, while we know that a population playing an ESS can displace a monomoprhic population playing a different strategy, can it displace polymorphic populations? Finally, are ESSs always convergently stable (Geritz et al., 1997)? If there are positive answers to this final suite of questions, then ESSs can be generally viewed as the ultimate evolutionary end state for patch selection strategies.

Going beyond the models considered here, studying the evolution habitat use faces many challenges. Our models assume that populations spend a fixed fraction of time in each patch and do so instantaneously. What happens if we relax these assumptions? For example, if populations are more ideal and able to track changes in population density instantaneously, then we have something closer to the classical notion of ideal free movement (Fretwell and Lucas, 1969). For these populations, what is the optimal (in an evolutionary sense) density-dependent strategy? Moreover, can such a strategy displace the static strategies considered here? Alternatively, if populations are less ideal and diffusing randomly on the landscape, what happens then? The linear version of this question was tackled in part by Evans et al. (2013). However, the mathematical analysis for analogous stochastic models with density-dependent feedbacks is largely unexplored. Going beyond single species, the coevolution of patch selection among interacting species has a rich history for spatially heterogeneous, but temporally homogeneous environments (van Baalen and Sabelis, 1993; Křivan, 1997; Schreiber et al., 2000; van Baalen et al., 2001; Schreiber et al., 2002; Cressman et al., 2004; Schreiber and Vejdani, 2006; Cantrell et al., 2007). For example, spatial heterogeneity can select for the evolution of contrary choices in which the prey prefers low quality patches to escape the predator and the predator prefers high quality patches to capture higher quality food items (Fox and Eisenbach, 1992; Schreiber et al., 2000). Understanding how environmental stochastic influences this coevolution of patch choice and the community level consequences of these coevolutionary outcomes provides a plethora of important, yet largely untouched challenges for future work.

Acknowledgments

The authors thank Dan Crisan, Alison Etheridge, Tom Kurtz, and Gregory Roth for helpful discussions.

S.N.E. was supported in part by NSF grant DMS-0907639 and NIH grant 1R01GM109454-01.

A.H. was supported by EPSRC grant EP/K034316/1.

S.J.S. was supported in part by NSF grants EF-0928987 and DMS-1022639.

Appendix A: Proof of Proposition 2.1

The stochastic differential equation for Z is of the form

dZt=b(Zt)dt+σ(Zt)dWt, (6.1)

where b(z) := μzκz2 and σ(z) := σz. It follows from Itô’s existence and uniqueness theorem for strong solutions of stochastic differential equations that this equation has a unique strong solution up to possibly a finite but strictly positive explosion time.

Set Rt := log Zt for t ≥ 0. By Itô’s lemma,

dRt=(μσ22κexp(Rt))dt+σdWt. (6.2)

It follows from the comparison principle of Ikeda and Watanabe (see Chapter VI Theorem 1.1 of Ikeda and Watanabe (1989)), Theorem 1.4 of Le Gall (1983), or Theorem V.43.1 of Rogers and Williams (2000)) that

RtR0+(μσ22)t+σWt, (6.3)

and so Z does not explode to +∞ in finite time. Moreover, since rμκer is a bounded, uniformly Lipschitz function on (∞, 0] it follows from Itô’s existence and uniqueness theorem that R does not explode −∞ to in finite time, so that Z does not hit 0 in finite time. We could have also established this result by using the scale function and speed measure calculated below to check Feller’s necessary and sufficient for the boundary point of a one-dimensional diffusion to be inaccessible – see Theorem 23.12 of Kallenberg (2002).

It is not hard to check using Itô’s lemma that an explicit solution of the SDE is

Zt=Z0exp((μσ22)t+σWt)1+Z0μκ0texp((μσ22)s+σWs)ds.

We see from the inequality (6.3) that if μ − σ2/2 < 0, then limt→∞ Zt = 0 almost surely.

We use the theory based on the scale function and speed measure of a one-dimensional diffusion (see, for example, Chapter 23 of Kallenberg (2002) or Sections V.6-7 of Rogers and Williams (2000)) below to establish that Z is positive recurrent with a unique stationary distribution when μσ2/2 > 0. Similar calculations show that Z is null recurrent when μσ2/2 = 0, and hence lim inft→∞ Zt = 0 almost surely and lim supt→∞ Zt = ∞. It follows from (6.2) and the comparison principle that if Z′ and Z″ are two solutions of (6.1) with respective parameters μ′, κ′, σ′ and μ″, κ″, σ″ satisfying μ′ ≤ μ″, κ′ = κ″, σ′ = σ″ and the same initial conditions, then ZtZt. We will show below that

limt1t0tZsds=1κ(μσ22)

almost surely when μσ2/2 > 0, and hence

limt1t0tZsds=0

almost surely when μσ2/2 > 0.

We now identify the scale function and speed measure of the one-dimensional diffusion Z. A choice for the scale function is

s(x)=cxexp(ay2b(z)σ2(z)dz)dy=cx(ya)2μσ2e2κσ2(ya)dy (6.4)

for arbitrary a,c++ (recall that the scale function is only defined up to affine transformations). If we set σ~=(σs)s1, then

ds(Zt)=σ~(s(Zt))dW~t

and the diffusion process s(Z) is in natural scale on the state space s(++) with speed measure m that has density 1σ~2.

The total mass of the speed measure is

m(++)=s(++)1σ~2(x)dx=s(++)1((σs)s1)2(x)dx=01σ2(u)s(u)du=01(σu)2(ua)2μσ2e2κσ2(ua)du=1σ2a2μσ0u2μσ22e2κσ2(ua)du. (6.5)

By Theorem 23.15 of Kallenberg (2002), the diffusion process Z has a stationary distribution concentrated on ++ if and only if the process s(Z) has (−∞, +∞) as its state space and the speed measure has finite total mass or s(Z) has a finite interval as its state space and the boundaries are reflecting. The introduction of an extra negative drift to geometric Brownian motion cannot make zero a reflecting boundary, so we are interested in conditions under which s(++)=(,) and the speed measure has finite total mass. We see from (6.4) and (6.5) that this happens if and only if μσ2/2 > 0, a condition we assume holds for the remainder of the proof.

The diffusion s(Z) has a stationary distribution with density f1m(++)σ~2 on s(++)=(,+), and so the stationary distribution of Z is the distribution on ++ that has density

g(x)=f(s(x))s(x)=1m(++)σ~2(s(x))s(x)=1m(++)σ2(x)s(x)=1m(++)x2σ2(xa)2μσ2e2κσ2(xa),x++.

This has the form of a Gamma(k,θ) density with parameters θσ22κ and k=2μσ21. Therefore,

g(x)=1Γ(k)θkxk1exθ=1Γ(2μσ21)(σ22κ)2μσ21x2μσ22e2κxσ2,x++.

Theorem 20.21 from Kallenberg (2002) implies that the shift-invariant σ-field is trivial for all starting points. The ergodic theorem for stationary stochastic processes then tells us that, if we start Z with its stationary distribution,

limt1t0th(Zs)ds=0h(x)g(x)dx

for any Borel function h:++ with 0h(x)g(x)dx<. Since Z has positive continuous transition densities we can conclude that

limt1t0th(Zs)ds=0h(x)g(x)dx

x-almost surely for any x++.

In particular,

++xg(x)dx=kθ=1κ(μσ22).

Appendix B: Proof of Theorem 4.1

To simplify our presentation, we re-write the joint dynamics of X and Y as

dXt=Xt(μα(aXt+cYt))dt+σXXtdUtdYt=Yt(μβ(cXt+bYt))dt+σYYtdVt, (6.6)

where aα,ακ, bβ,βκ, cα,βκ, σXαΣα and σYβΣβ.

To prove Theorem 4.1, we need several preliminary results. First, we prove existence and uniqueness of solutions to the system (6.6) as well as a useful comparison result in Theorem 6.1. Second, in Proposition 6.3, we establish that (Xt, Yt) remains in ++2=(0,)2 for all t ≥ 0 whenever (X0,Y0)++2. Third, in Proposition 6.4, we show that weak limit points of the empirical measures 1t0t(x,y){Xs,Ys)} ds are stationary distributions for the process (X, Y) thought of as a process on +2 (rather than ++2). Finally, we show that limt→∞ Yt = 0 with probability one in Proposition 6.5 and conclude by showing that 1t0t(x,y){Xs,Ys)} ds converges weakly to ρXδ0 concentrated on ++×{0}.

Theorem 6.1. The stochastic differential equation in (6.6) has a unique strong solution and Xt,YtLp((x,y)) for all t, p > 0 for all (x,y)++2. This solution satisfies Xt > 0 and Yt > 0 for all t ≥ 0, (x,y)-almost surely for all (x,y)++2. Let ((Xt,Yt))t0 be the stochastic process defined by the pair of stochastic differential equations

dXt=Xt(μαaXt)dt+σXXtdUtdYt=Yt(μβbYt)dt+σYYtdVt (6.7)

If (X0,Y0)=(X0,Y0), then

XtXt

and

YtYt

for all t ≥ 0.

Proof. The uniqueness and existence of strong solutions is fairly standard, see, for example, Theorem 2.1 in Li and Mao (2009). One notes that the drift coefficients are locally Lipschitz so strong solutions exist and are unique up to the explosion time. It is easy to show this explosion time is almost surely infinite (see Theorem 2.1 in Li and Mao (2009)). Next, suppose that X0=X0. We adapt the comparison principle of Ikeda and Watanabe (Chapter VI Theorem 1.1 from Ikeda and Watanabe (1989)) proved by the local time techniques of Le Gall (see Theorem 1.4 from Le Gall (1983) and Theorem V.43.1 in Rogers and Williams (2000)) to show that XtXt0 for all t ≥ 0.

Define ρ:++ by ρ(x) = |x|2. Note that

0tρ(XsXs)1𝟙{XsXs>0}d[XX]s=0tρ(XsXs)1(σXXsσXXs)2𝟙{XsXs>0}dsσX2t.

Since 0+ρ(u)1du=, by Proposition V.39.3 from Rogers and Williams (2000) the local time at 0 of XX is zero for all t ≥ 0. Put x+ := x∨0. By Tanaka’s formula (see equation IV.43.6 in Rogers and Williams (2000)),

(XtXt)+=0t𝟙{XsXs>0}(σXXsσXXs)dUt+0t𝟙{XsXs>0}[(μα(aXs+cYs))Xs(μαaXs)Xs]ds.

For K > 0 define the stopping time

TKinf{t>0:XtKorXtK}

and the stopped processes XtK=XTKt and XtK=XTKt. Then, stopping the processes at TK and taking expectations yields

0𝔼(XtKXtK)+=𝔼0tTK𝟙{XsXs>0}[(μαXsXs(aXs+cYs))(μαXsaXs2)]ds=𝔼0tTK𝟙{XsXs>0}[μα(XsXs)a(Xs2Xs2)cXsYs]ds𝔼0tTK𝟙{XsXs>0}μα(XsXs)dsμα𝔼0tTK(XsXs)+dsμα𝔼0t(XsKXsK)+ds.

By Gronwall’s Lemma (see, for example, Appendix 5 of Ethier and Kurtz (2005)) 𝔼[(XtKXtK)+]=0 for all t ≥ 0, so XtKXtK for all t ≥ 0. Now let K → ∞ and that X does not explode to get that XtXt for all t ≥ 0. Since we have shown before that X is dominated by a geometric Brownian motion, a process that has finite moments of all orders, we get that Xt,YtLp((x,y)) for all t, p > 0 and for all (x,y)++2.

Remark 6.2. Note that the SDEs for all the processes considered here have unique strong solutions in Lp for all t ≥ 0, p > 0 and for all strictly positive starting points. This follows by arguments similar to those that are in Theorem 2.1 from Li and Mao (2009) and in Theorem 6.1 by noting that our SDEs for (X, Y), (X, Y) etc. are all of the form

dX˘t=X˘t[λ1λ2Y˘tλ3X˘t]dt+X˘tσXdUtdY˘t=Y˘t[λ4λ5X˘tλ6Y˘t]dt+Y˘tσYdVtX˘0=xY˘0=y

for λ1,,λ6+ and x,y++.

The next proposition tells us that none of our processes hit zero in finite time.

Proposition 6.3. Let (X, Y) be the process given by (6.6). If (X0,Y0)++2, then (Xt,Yt)++2 for all t ≥ 0 almost surely. A similar conclusion holds for all of the other processes we work with.

Proof. As an example of the method of proof, we look at the process (X, Y) given by (6.6). Taking logarithms and using Itô’s lemma,

dlogXt=(μα(aXt+cYt)12σX2)dt+σXdUt.

Therefore,

logXt=0t(μα(aXs+cYs)12σX2)ds+σXUt.

can’t go to −∞ in finite time because Xt and Yt do not blow up.

Proposition 6.4. Let (X, Y) be the process given by (6.6) and fix (x,y)++2. Any sequence {tn}n such that tn → ∞ has a subsequence {un}n such that the sequence of probability measures

1un0un(x,y){(Xs,Ys)}ds

converges in the topology of weak convergence of probability measures on +2. Any such limit is a stationary distribution for the process (X, Y) thought of as a process with state space +2.

Proof. Set φ(x, y) := x + y so that φ ≥ 0 for x,y > 0. Put ψ (x, y) = μ · αx + μ · βyx(ax + cy) − y(cx + by). Note that ψ is bounded above on the quadrant x, y ≥ 0 and lim||(x,y)||→∞ψ(x, y) = −∞ where ||·|| is the Euclidean distance on 2. Using Itô’s lemma we get

φ(Xt,Yt)0tψ(Xs,Ys)ds=0tσYYsdVs+0tσXXsdUs.

Therefore, φ(Xt,Yt)0tψ(Xs,Ys)ds is a martingale. Applying Theorem 9.9 of Ethier and Kurtz (2005) completes the proof.

The following result is essentially Theorem 10 in Liu et al. (2011). We include the proof for completeness.

Proposition 6.5. Suppose that α·μ−α·Σα/2 > 0, β·μ−β·Σβ/2 > 0, and I(α,β)<0. If (X, Y) is the process given by (6.6), then limt→∞ limtYt=0(x,y)a.s. for all (x,y)++2.

Proof. Using Ito’s lemma and the definition of I(α,β),

alog(YtY0)tclog(XtX0)t=a(μβσY22)c(μασX22)(abc2)0tYsdst+aσYVttcσXUtt=aI(α,β)(abc2)0tYsdst+aσYVttcσXUtt.

By the Cauchy-Schwarz inequality, (abc2) = 〈α, ακβ, βκ − (〈α, βκ)2 ≥ 0, and so

log(YtY0)tcalog(XtX0)t+I(α,β)+σYVttcaσXUtt.

Let X be the process defined by (6.7) with X0=X0. Proposition 2.3 implies

limt1t0tXsds=(μασX22)aalmost surely. (6.8)

It follows from Theorem 6.1 that XtXt for all t ≥ 0. Thus, with probability one,

limsuptlogXttlimsuptlogXtt=(μασX22)alimt1t0tXsds+σXlimtUtt=(μασX22)a(μασX22)a=0.

Since U and V are Brownian motions, limtUtt=limtVtt=0, and limsuptlogXtt0 almost surely, so

limsuptlogYttI(α,β)<0almost surely.

In particular, limtt→∞ Yt = 0 almost surely.

We can now finish the proof of Theorem 4.1. Fix ε > 0 and η > 0 sufficiently small. Define the stopping time

Tϵinf{t0:Ytϵ}.

and the stopped process XtϵXtTϵ. By Proposition 6.5, there exists T > 0 such that

(x,y){Ytϵfor alltT}1η

Define the process Xˇ via

dXˇt=Xˇt[(μαcϵ)aXˇt]dt+σXXˇtdUt

and the stopped process XˇtϵXˇtTϵ. Start the process Xˇ at time T with the condition XˇT=XT. We want to show that the process Xˇϵ is dominated by the process Xε, that is XtϵXˇtϵ for all tT. By the strong Markov property, we can assume T = 0.

The proof is very similar to the one from Theorem 6.1. With the notation from the proof of Theorem 6.1, we have

0tρ(XˇsϵXsϵ)1𝟙{XˇsϵXsϵ>0}d[XˇϵXϵ]s=0tρ(XˇsϵXsϵ)1(σXXˇsϵσXXsϵ)2𝟙{XˇsϵXsϵ>0}]dsσX2t

so the local time of the process XˇϵXϵ at zero is identically zero. Then, using Tanaka’s formula

(XˇtϵXtϵ)+=0tTϵ𝟙{XˇsXs>0}(σXXˇsσXXs)dUt+0tTϵ𝟙{XˇsXs>0}[((μαcϵ)XˇsaXˇs2)(μαXsXs(cYs+aXs))]ds.

Taking expectations,

𝔼[(XˇtϵXtϵ)+]=𝔼0tTϵ𝟙{XˇsXs>0}[(μα(XˇsXs)(cϵXˇscXsYs)a(Xˇs2Xs2))ds]μα𝔼0tTϵ(XˇsXs)+dsμα𝔼0t(XˇsϵXsϵ)+ds.

By Gronwall’s Lemma, 𝔼[(XˇtϵXtϵ)+]=0. As a result, remembering we assumed T = 0, we have XˇtϵXtϵ for all tT. For ε small enough we know that Xˇ has a stationary distribution concentrated on ++. For any sequence an → ∞, if the Cesaro averages 1an0an(x,y){(Xs,Ys)} converge weakly, then the limit is a distribution of the form φ ⊗ δ0, where φ is a mixture of the unique stationary distribution ρX described in Proposition 2.3 and the point mass at 0. By the above, the limit of 1an0an(x,y){(Xs,Ys)} cannot have any mass at (0, 0) because XˇtXt on the event {Ytε for all tT} that has probability (x,y){Ytϵfor alltT}1η. Since η > 0 was arbitrary, we conclude that φ=ρXδ0 required.

Appendix C: Proof of Theorem 4.2

Our proof is along the same lines as the proofs of Theorems 4 and 5 in Schreiber et al. (2011).

We will once again simplify our notation by re-writing the SDE for the pair (X, Y) as in (6.6). We assume throughout this appendix that the hypotheses of Theorem 4.2 hold; that is, I(α,β)>0 and I(β,α)>0.

Let ((Xt,Yt))t0 be the stochastic process defined by the pair of stochastic differential equations in (6.7) with initial conditions (X0,Y0)=(X0,Y0). We know from Theorem 6.1 that XtXt and YtYt for all t ≥ 0.

Note from Corollary 3.3 that α · (μ − Σα/2) > 0 and β · (μ − Σβ/2) > 0 and hence, by Proposition 2.3, the process (X,Y) has a unique stationary distribution on ++2 and is strongly ergodic.

Let

Πt()1t0t𝟙{(Xs,Ys)}ds

be the normalized occupation measures of (X, Y). We know that the random probability measures

Πt()1t0t𝟙{(Xs,Ys)}ds

converge almost surely and so, in particular, they are tight on +2=[0,)2; that is, for any ε > 0 we can find a box [0,K] × [0,K] such that

1t0t𝟙{(Xs,Ys)[0,K]×[0,K]}ds>1ϵfor allt>0.

Therefore,

1t0t𝟙{(Xs,Ys)[0,K]×[0,K]}ds1t0t𝟙{(Xs,Ys)[0,K]×[0,K]}ds>1ϵfor allt>0,

and hence the normalized occupation measures of (X, Y) are also tight on +2. By Prohorov’s theorem (Kallenberg, 2002, Theorem 16.3), there exists a random probability measure ν on +2 and a (possibly random) sequence (tn)++ such that tn → ∞ for which

Πtnν (6.9)

as n → ∞ almost surely, where ⇒ denotes weak convergence of probability measures on +2. That is, with probability one for all bounded and continuous function u:+2 we have

+u(x,y)Πtn(dx,dy)+u(x,y)ν(dx,dy)

as n → ∞.

Proposition 6.6. The probability measure ν is almost surely a stationary distribution for (X, Y) thought of as a process with state space +2.

Proof. Let (Pt)t≥0 be the semigroup of the process (X, Y) thought of as a process on +2. For simplicity let us write Zt := (Xt, Yt) for all t ≥ 0 and νn := Πtn.

By the Strong Law of Large Numbers for martingales, we have that for all r+ and all bounded measurable functions f

limκ1ki=0k1[f(Zr+(i+1)t)Ptf(Zr+it)]=0almost surely.

As a result,

1k(tktf(Zs)ds0(kt)tPtf(Zs)ds)=1ki=0k10t[f(Zr+(i+1)t)Ptf(Zr+it)]dr0askalmost surely.

This implies that

limu1u0u[f(Zs+t)Pt(Zs)]ds=0almost surely.

Thus,

fdνPtfdν=limn(fdνnPtfdνn)=limn1tn[0tn(f(Zs)Ptf(Zs))ds]=limn1tn[0tnt(f(Zs+t)Ptf(Zs))ds+0tf(Zs)dstnttnPtf(Zs)ds]=limn1tn[0tnt(f(Zs+t)Ptf(Zs))ds]=0almost surely. (6.10)

The last result is equivalent to saying that ν is almost surely a stationary distribution for (X, Y).

Proposition 6.7. There exists a stationary distribution π of (X, Y) that assigns all of its mass to ++2.

Proof. We argue by contradiction. Because the process stays in one of the four sets ++, ++×{0}, {0}×++, {(0,0)} when it is started in the set, any stationary distribution (X, Y) thought of as a process on +2 can be written as a convex combination of stationary distributions that respectively assign all of their masses to one of the four sets, should such a stationary distribution exist for the given set. Suppose there is no stationary distribution that is concentrated on ++2. Then, any stationary distribution is the convex combination of stationary distributions that respectively assign all of their mass to the three sets ++×{0}, {0}×++, and {(0,0)}, and hence any stationary distribution of the form

pXμX+pYμY+p0δ(0,0),

where the random variables pX, pY, p0 are nonnegative and pX + pY + p0 = 1 almost surely, and μX=ρXδ0 and μY=δ0ρY for ρX and ρY the unique stationary distributions of X and Y. Next, we proceed as in Proposition 6.5 to find the limit of logXtntn. Let us first argue that

limn1tn0tnXsds=+2xν(dx,dy)limn1tn0tnYsds=+2yν(dx,dy)almost surely. (6.11)

Note that the infinitesimal generator of (log X, log Y) thought of as a process on 2 is uniformly elliptic with smooth coefficients and so it has smooth transition densities (see, for example, Section 3.3.4 of Stroock (2008)). Moreover, an application of a suitable minimum principle for the Kolmogorov forward equation (see, for example, Theorem 5 in Section 2 of Chapter 2 of Friedman (1964)) shows that the transition densities are everywhere strictly positive. It follows that (X, Y) thought of as a process on +2 has smooth transition densities that are everywhere positive.

Because the process X also has smooth, every positive transition densities for similar reasons, the almost sure behavior of the X started from a fixed point is the same as it is starting from its stationary distribution ρX. As a result, we get by Birkhoff’s pointwise ergodic theorem (Kallenberg, 2002, Theorem 10.6) that, for all K > 0,

limn1tn0tnXs𝟙{Xs>K}ds=𝔼ρX[Xs𝟙{Xs>K}]

x almost surely for any x+. Therefore, by dominated convergence

limKlimn1tn0tnXs𝟙{Xs>K}ds=limm𝔼ρX[Xs𝟙{Xs>K}]=0.

The following inequalities are immediate due to the positivity of the terms

1tn0tnXs𝟙{XsK}ds1tn0tnXsds=1tn0tnXs𝟙{XsK}ds+1tn0tnXs𝟙{Xs>K}ds. (6.12)

Recall that XtXt for all t ≥ 0 and hence

1tn0tnXs𝟙{Xs>K}ds1tn0tnXs𝟙{Xs>K}ds.

This implies

limsupn1tn0tnXs𝟙{Xs>K}dslimsupn1tn0tnXs𝟙{Xs>K}ds,

and therefore

0limKlimsupn1tn0tnXs𝟙{Xs>K}dslimKlimsupn1tn0tnXs𝟙{Xs>K}ds=0. (6.13)

By (6.9) and Theorem 4.27 of Kallenberg (2002),

limn1tn0tnXs𝟙{XsK}ds=++2x𝟙{xK}ν(dx,dy).

for any K such that

ν({K}×+)=0.

While this last condition need not hold a priori for all K, we can only have

ν({K}×+)>0

for countably many K, so there exists a sequence (Km)+ such that Km → ∞ as m → ∞ with

ν({Km}×+)=0.

By dominated convergence,

limmlimn1tn0tnXs𝟙{XsKm}ds=limK+2x𝟙{xK}ν(dx,dy)=+2xν(dx,dy). (6.14)

Combining (6.12), (6.13) and (6.14) gives (6.11).

It follows from Itô’s formula, the observation I(α,α)=0, (6.11), and the fact that limnUtntn=0 that

limnlogXtntn=μασX22𝔼ν[aXt+bYt]=pX(μαa𝔼ρX[Xt]σX22)+pY(μαb𝔼ρY[Yt]σX22)+p0(μασX22)=pXI(α,α)+pYI(α,β)+p0(μασX22)=pYI(α,β)+p0(μασX22)almost surely.

By assumption, I(α,β)>0 and we have already observed that μασX22>0. Because Xt converges in distribution as t → ∞ to a distribution that assigns all of its mass to ++2, it follows that log logXtntn converges in probability to 0. However, since XtXt for all t ≥ 0 it follows that pYI(α,β)+p0(μασX22)0 and hence

pY=p0=0almost surely. (6.15)

The same argument applied to (Yt)t≥0 establishes

pX=p0=0almost surely. (6.16)

Therefore, pX = pY = p0 = 0, and this contradicts the assumption that pX + pY + p0 = 1.

We can now finish the proof of Theorem 4.2.

Proof. Proposition 6.7 implies that (X, Y) has a stationary distribution π on ++2. By Theorem 20.17 from Kallenberg (2002), our process (X, Y) is either Harris recurrent or uniformly transient. We say that (Xt, Yt) → ∞ almost surely as t → ∞ if 𝟙K(Xt,Yt)0 as t → ∞ for any compact set K++2. Theorem 20.21 from Kallenberg (2002) gives that if (X,Y) is transient, then (Xt, Yt) → ∞ and so (X, Y) cannot have a stationary distribution. Hence, since we know our process has a stationary distribution π, it must be Harris recurrent. Theorem 20.21 from Kallenberg (2002) then gives us equation (4.1).

Theorem 20.18 from Kallenberg (2002), 20.18 gives that any Harris recurrent Feller process on ++2 with strictly positive transition densities has a locally finite invariant measure that is equivalent to Lebesgue measure and is unique up to a normalization. We already know that we have a stationary distribution, so this distribution is unique and has an almost everywhere strictly positive density with respect to Lebesgue measure. Theorem 20.12 from Kallenberg (2002) says that any Harris recurrent Feller process is strongly ergodic, and so equation (4.2) holds.

Remark 6.8. In Theorem 3.1 of Zhang and Chen (2013), the authors claim to show that the system of SDE describing (X, Y) always has a unique stationary distribution. We note that their use of moments just checks tightness in +2[0,)2 and not in ++2=(0,)2. It does not stop mass going off to +2\++2=(+×{0})({0}×+), which is exactly what can happen in our case. Thus, their proof only shows the existence of a stationary distribution on +2 − it does not show the existence of a stationary distribution on ++2. Furthermore, their proof for the uniqueness of a stationary distribution on +2 breaks down because their assumption of irreducibility is false. The process (X, Y) is irreducible on ++2, but it is not irreducible on +2 since Pt((0,0),U)(0,0){(Xt,Yt)U}=0 for any open subset U that lies in the interior of +2. If we work on +2, it is not true that the diffusion (X, Y) has a unique stationary distribution. We can obtain infinitely many stationary distributions on +2 of the form (uρX+vδ0)δ0 where ρX is the unique stationary distribution of X on ++ and u,v+ satisfy u + v = 1.

Appendix D: Proof of Theorem 5.1

Assume that the matrix Σ is positive definite and that the dispersion proportion vector α is such that μ · αα · Σα/2 > 0 so that a population playing the strategy α persists. Under these assumptions the function βI(α,β) is strictly concave. Hence, by the method of Lagrange multipliers, I(α,β)<0 for all βα and αi > 0 for all i if and only if there exists a constant, which we denote by λ, such that

λ=Iβi(α,β)β=α=μiκiαi(μααΣα2)α,ακjαjσij (6.17)

for all i. Multiplying (6.17) by αi and summing with respect to i, we get

λ=μαα,ακ(μααΣα2)α,ακαΣα=αΣα2

This expression for the Lagrange multiplier and (6.17) provide the characterization of a mixed ESS in equation (5.1) when αi > 0 for all i. The characterization of the more general case of αi > 0 for at least two patches follows similarly by restricting the method of Lagrange multiples to the appropriate face of the probability simplex.

Suppose that μiσii/2 > 0 so that a population remaining in patch i and not dispersing to other patches persists. The strategy αi = 1 and αj = 0 for all ji is an ESS only if

Iβj(α,β)β=αIβi(α,β)β=α<0

for all ji. Evaluating these partial derivatives gives the criterion (5.2) for the pure ESS.

We conclude by considering the case n = 2. Define the function g:[0,1] by

g(a)=Iβ1((a1,a2),(b1,b2))(a1,a2)=(a,1a),(b1,b2)=(a,1a)Iβ2((a1,a2),(b1,b2))(a1,a2)=(a,1a),(b1,b2)=(a,1a).

The inequalities (5.2) for the pure strategies (1, 0) and (0, 1), respectively, correspond to g(0) < 0 and g(1) > 0, respectively. Hence, when these inequalities are reversed, the intermediate value theorem implies there exists a ∈ (0, 1) such that g(a) = 0. Such an a satisfies the mixed ESS criterion (5.1) and, therefore, is an ESS.

Contributor Information

STEVEN N. EVANS, Department of Statistics #3860, 367 Evans Hall, University of California, Berkeley, CA 94720-3860, USA, evans@stat.berkeley.edu

ALEXANDRU HENING, Department of Statistics, 1 South Parks Road, Oxford OX1 3TG, United Kingdom, hening@stats.ox.ac.uk.

SEBASTIAN J. SCHREIBER, Department of Evolution and Ecology, University of California, Davis, CA 956116, USA, sschreiber@ucdavis.edu

References

  1. Anderson JT, Geber MA. Demographic source-sink dynamics restrict local adaptation in Elliott’s blueberry (Vaccinium elliottii) Evolution. 2010;64:370–384. doi: 10.1111/j.1558-5646.2009.00825.x. [DOI] [PubMed] [Google Scholar]
  2. Beckmann JP, Berger J. Using black bears to test ideal-free distribution models experimentally. Journal of Mammalogy. 2003;84:594–606. [Google Scholar]
  3. Cantrell RS, Cosner C, Deangelis DL, Padron V. The ideal free distribution as an evolutionarily stable strategy. Journal of Biological Dynamics. 2007;1:249–271. doi: 10.1080/17513750701450227. [DOI] [PubMed] [Google Scholar]
  4. Cantrell RS, Cosner C, Lou Y. Evolution of dispersal and the ideal free distribution. Mathematical Biosciences and Engineering. 2010;7:17–36. doi: 10.3934/mbe.2010.7.17. [DOI] [PubMed] [Google Scholar]
  5. Cantrell RS, Cosner C, Lou Y. Evolutionary stability of ideal free dispersal strategies in patchy environments. Journal of Mathematical Biology. 2012;65:943–965. doi: 10.1007/s00285-011-0486-5. [DOI] [PubMed] [Google Scholar]
  6. Chesson PL. General theory of competitive coexistence in spatially-varying environments. Theoretical Population Biology. 2000;58:211–237. doi: 10.1006/tpbi.2000.1486. [DOI] [PubMed] [Google Scholar]
  7. Childs DZ, Metcalf CJE, Rees M. Evolutionary bet-hedging in the real world: empirical evidence and challenges revealed by plants. Proceedings of the Royal Society B: Biological Sciences. 2010;277:3055–3064. doi: 10.1098/rspb.2010.0707. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Cosner C. A dynamic model for the ideal-free distribution as a partial differential equation. Theoretical Population Biology. 2005;67:101–108. doi: 10.1016/j.tpb.2004.09.002. [DOI] [PubMed] [Google Scholar]
  9. Cressman R, Křivan V. Migration dynamics for the ideal free distribution. American Naturalist. 2006;168:384–397. doi: 10.1086/506970. [DOI] [PubMed] [Google Scholar]
  10. Cressman R, Křivan V. The ideal free distribution as an evolutionarily stable state in density-dependent population games. Oikos. 2010;119:1231–1242. [Google Scholar]
  11. Cressman R, Křivan V, Garay J. Ideal free distributions, evolutionary games, and population dynamics in multiple-species environments. American Naturalist. 2004;164:473–489. doi: 10.1086/423827. [DOI] [PubMed] [Google Scholar]
  12. Doncaster CP, Clobert J, Doligez B, Danchin E, Gustafsson L. Balanced dispersal between spatially varying local populations: an alternative to the source-sink model. American Naturalist. 1997;150(4):425–445. doi: 10.1086/286074. [DOI] [PubMed] [Google Scholar]
  13. Dreisig H. Ideal free distributions of nectar foraging bumblebees. Oikos. 1995;72:161–172. [Google Scholar]
  14. Edelaar P, Bolnick DI. Non-random gene flow: an underappreciated force in evolution and ecology. Trends in Ecology & Evolution. 2012;27:659–665. doi: 10.1016/j.tree.2012.07.009. [DOI] [PubMed] [Google Scholar]
  15. Ethier SN, Kurtz TG. Markov Processes: Characterization and Convergence. Wiley; Hoboken, NJ: 2005. [Google Scholar]
  16. Evans SN, Ralph P, Schreiber SJ, Sen A. Stochastic growth rates in spatiotemporal heterogeneous environments. Journal of Mathematical Biology. 2013;66:423–476. doi: 10.1007/s00285-012-0514-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Fox LR, Eisenbach J. Contrary choices: possible exploitation of enemy-free space by herbivorous insects in cultivated vs. wild crucifers. Oecologia. 1992;89:574–579. doi: 10.1007/BF00317166. [DOI] [PubMed] [Google Scholar]
  18. Fretwell SD, Lucas HL., Jr. On territorial behavior and other factors influencing habitat distribution in birds. Acta Biotheoretica. 1969;19:16–36. [Google Scholar]
  19. Friedman A. Partial differential equations of parabolic type. Prentice-Hall Inc.; Englewood Cliffs, N.J.: 1964. [Google Scholar]
  20. Gejji R, Lou Y, Munther D, Peyton J. Evolutionary convergence to ideal free dispersal strategies and coexistence. Bulletin of Mathematical Biology. 2012;74:257–299. doi: 10.1007/s11538-011-9662-4. [DOI] [PubMed] [Google Scholar]
  21. Geritz SAH, Metz JAJ, Kisdi E, Meszena G. Dynamics of adaptation and evolutionary branching. Physical Review Letters. 1997;78:2024–2027. [Google Scholar]
  22. Godin JJ, Keenleyside MHA. Foraging on patchily distributed prey by a cichlid fish (Teleostei, Cichlidae): a test of the ideal free distribution theory. Animal Behaviour. 1984;32:120–131. [Google Scholar]
  23. Harper DGC. Competitive foraging in mallards: Ideal free ducks. Animal Behaviour. 1982;30:575–584. [Google Scholar]
  24. Hastings A. Can spatial variation alone lead to selection for dispersal? Theoretical Population Biology. 1983;24:244–251. [Google Scholar]
  25. Haugen TO, Winfield IJ, Vøllestad LA, Fletcher JM, James JB, Stenseth NC. The ideal free pike: 50 years of fitness-maximizing dispersal in Windermere. Proceedings of the Royal Society B: Biological Sciences. 2006;273:2917–2924. doi: 10.1098/rspb.2006.3659. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Holt RD. On the evolutionary stability of sink populations. Evolutionary Ecology. 1997;11:723–731. [Google Scholar]
  27. Holt RD, Barfield M. On the relationship between the ideal free distribution and the evolution of dispersal. In: Clobert J, Danchin E, Dhondt A, Nichols J, editors. Dispersal. Oxford University Press; USA: 2001. pp. 83–95. [Google Scholar]
  28. Ikeda N, Watanabe S. Stochastic differential equations and diffusion processes, volume 24 of North-Holland Mathematical Library. second edition North-Holland Publishing Co.; Amsterdam: 1989. [Google Scholar]
  29. Jaenike J. Genetic and environmental determinants of food preference in Drosophila tripunctata. Evolution. 1985;39:362–369. doi: 10.1111/j.1558-5646.1985.tb05673.x. [DOI] [PubMed] [Google Scholar]
  30. Jaenike J, Holt RD. Genetic variation for habitat preference: evidence and explanations. American Naturalist. 1991;137:S67–S90. [Google Scholar]
  31. Jansen VAA, Yoshimura J. Populations can persist in an environment consisting of sink habitats only. Proceeding of the National Academy of Sciences USA. 1998;95:3696–3698. doi: 10.1073/pnas.95.7.3696. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Kallenberg O. Foundations of Modern Probability. Springer; New York: 2002. [Google Scholar]
  33. Katzenberger GS. Solutions of a stochastic differential equation forced onto a manifold by a large drift. The Annals of Probability. 1991;19:1587–1628. [Google Scholar]
  34. Křivan V. Dynamic ideal free distribution: effects of optimal patch choice on predator-prey dynamics. American Naturalist. 1997;149:164–178. [Google Scholar]
  35. Le Gall J-F. Seminar on probability, XVII, volume 986 of Lecture Notes in Math. Springer; Berlin: 1983. Applications du temps local aux équations différentielles stochastiques unidimensionnelles; pp. 15–31. [Google Scholar]
  36. Li X, Mao X. Population dynamical behavior of non-autonomous Lotka-Volterra competitive system with random perturbation. Discrete and Continuous Dynamical Systems. 2009;24:523–545. [Google Scholar]
  37. Liu M, Wang K, Wu Q. Survival analysis of stochastic competitive models in a polluted environment and stochastic competitive exclusion principle. Bulletin of Mathematical Biology. 2011;73:1969–2012. doi: 10.1007/s11538-010-9569-5. [DOI] [PubMed] [Google Scholar]
  38. Maynard Smith J, Price GR. The logic of animal conflict. Nature. 1973;246:15–18. [Google Scholar]
  39. Mayr E. Animal species and evolution. Harvard University Press; 1963. [Google Scholar]
  40. McPeek MA, Holt RD. The evolution of dispersal in spatially and temporally varying environments. American Naturalist. 1992;6:1010–1027. [Google Scholar]
  41. Milinski M. An evolutionarily stable feeding strategy in sticklebacks. Zeitschrift für Tierpsychologie. 1979;51:36–40. [Google Scholar]
  42. Oksanen T, Power ME, Oksanen L. Ideal free habitat selection and consumer-resource dynamics. American Naturalist. 1995;146:565–585. [Google Scholar]
  43. Orians GH, Wittenberger JF. Spatial and temporal scales in habitat selection. American Naturalist. 1991;137:S29–S49. [Google Scholar]
  44. Prout T. Sufficient conditions for multiple niche polymorphism. American Naturalist. 1968;102:493–496. [Google Scholar]
  45. Ravigné V, Olivieri I, Dieckmann U. Implications of habitat choice for protected polymorphisms. Evolutionary Ecology Research. 2004;6:125–145. [Google Scholar]
  46. Robinson HS, Wielgus RB, Cooley HS, Cooley SW. Sink populations in carnivore management: Cougar demography and immigration in a hunted population. Ecological Applications. 2008;18:1028–1037. doi: 10.1890/07-0352.1. [DOI] [PubMed] [Google Scholar]
  47. Rogers LCG, Williams D. Diffusions, Markov processes, and martingales. Vol. 2. Cambridge Mathematical Library. Cambridge University Press; Cambridge: 2000. Itô calculus, Reprint of the second (1994) edition. [Google Scholar]
  48. Rosenzweig ML. A theory of habitat selection. Ecology. 1981;62:327–335. [Google Scholar]
  49. Schreiber SJ. Evolution of patch selection in stochastic environments. American Naturalist. 2012;180:17–34. doi: 10.1086/665655. [DOI] [PubMed] [Google Scholar]
  50. Schreiber SJ, Vejdani M. Handling time promotes the coevolution of aggregation in predator-prey systems. Proceedings of the Royal Society: Biological Sciences. 2006;273:185–191. doi: 10.1098/rspb.2005.3236. [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Schreiber SJ, Fox LR, Getz WM. Coevolution of contrary choices in host-parasitoid systems. American Naturalist. 2000;155:637–648. doi: 10.1086/303347. [DOI] [PubMed] [Google Scholar]
  52. Schreiber SJ, Fox LR, Getz WM. Parasitoid sex allocation a ects coevolution of patch selection in host-parasitoid systems. Evolutionary Ecology Research. 2002;4:701–718. [Google Scholar]
  53. Schreiber SJ, Benäim M, Atchadé KAS. Persistence in fluctuating environments. Journal of Mathematical Biology. 2011;62:655–683. doi: 10.1007/s00285-010-0349-5. [DOI] [PubMed] [Google Scholar]
  54. Sokurenko EV, Gomulkiewicz R, Dykhuizen DE. Source–sink dynamics of virulence evolution. Nature Reviews Microbiology. 2006;4:548–555. doi: 10.1038/nrmicro1446. [DOI] [PubMed] [Google Scholar]
  55. Stroock DW. Partial differential equations for probabilists, volume 112 of Cambridge Studies in Advanced Mathematics. Cambridge University Press; Cambridge: 2008. [Google Scholar]
  56. Tittler R, Fahrig L, Villard MA. Evidence of large-scale source-sink dynamics and long-distance dispersal among Wood Thrush populations. Ecology. 2006;87:3029–3036. doi: 10.1890/0012-9658(2006)87[3029:eolsda]2.0.co;2. [DOI] [PubMed] [Google Scholar]
  57. Tregenza T. Building on the ideal free distribution. Advances in Ecological Research. 1995;26:253–307. [Google Scholar]
  58. Turelli M, Schemske DW, Bierzychudek P. Stable two-allele polymorphisms maintained by fluctuating fitnesses and seed banks: protecting the blues in Linanthus parryae. Evolution. 2001;55:1283–1298. doi: 10.1111/j.0014-3820.2001.tb00651.x. [DOI] [PubMed] [Google Scholar]
  59. van Baalen M, Sabelis MW. Coevolution of patch selection strategies of predator and prey and the consequences for ecological stability. American Naturalist. 1993;142:646–670. doi: 10.1086/285562. [DOI] [PubMed] [Google Scholar]
  60. van Baalen M, Křivan V, van Rijn PCJ, Sabelis MW. Alternative food, switching predators, and the persistence of predator-prey systems. American Naturalist. 2001;157:512–524. doi: 10.1086/319933. [DOI] [PubMed] [Google Scholar]
  61. Via S. Ecological genetics and host adaptation in herbivorous insects: The experimental study of evolution in natural and agricultural systems. Annual Review for Entomology. 1990;35:421–446. doi: 10.1146/annurev.en.35.010190.002225. [DOI] [PubMed] [Google Scholar]
  62. Zhang Z, Chen D. A new criterion on existence and uniqueness of stationary distribution for diffusion processes. Advances in Difference Equations. 2013;2013:13. [Google Scholar]

RESOURCES