PROTECTED POLYMORPHISMS AND EVOLUTIONARY STABILITY OF PATCH-SELECTION STRATEGIES IN STOCHASTIC ENVIRONMENTS

STEVEN N EVANS; ALEXANDRU HENING; SEBASTIAN J SCHREIBER

doi:10.1007/s00285-014-0824-5

. Author manuscript; available in PMC: 2015 Aug 1.

Published in final edited form as: J Math Biol. 2014 Aug 24;71(2):325–359. doi: 10.1007/s00285-014-0824-5

PROTECTED POLYMORPHISMS AND EVOLUTIONARY STABILITY OF PATCH-SELECTION STRATEGIES IN STOCHASTIC ENVIRONMENTS

STEVEN N EVANS ¹, ALEXANDRU HENING ², SEBASTIAN J SCHREIBER ³

PMCID: PMC4486641 NIHMSID: NIHMS692112 PMID: 25151369

Abstract

We consider a population living in a patchy environment that varies stochastically in space and time. The population is composed of two morphs (that is, individuals of the same species with different genotypes). In terms of survival and reproductive success, the associated phenotypes differ only in their habitat selection strategies. We compute invasion rates corresponding to the rates at which the abundance of an initially rare morph increases in the presence of the other morph established at equilibrium. If both morphs have positive invasion rates when rare, then there is an equilibrium distribution such that the two morphs coexist; that is, there is a protected polymorphism for habitat selection. Alternatively, if one morph has a negative invasion rate when rare, then it is asymptotically displaced by the other morph under all initial conditions where both morphs are present. We refine the characterization of an evolutionary stable strategy for habitat selection from [Schreiber, 2012] in a mathematically rigorous manner. We provide a necessary and sufficient condition for the existence of an ESS that uses all patches and determine when using a single patch is an ESS. We also provide an explicit formula for the ESS when there are two habitat types. We show that adding environmental stochasticity results in an ESS that, when compared to the ESS for the corresponding model without stochasticity, spends less time in patches with larger carrying capacities and possibly makes use of sink patches, thereby practicing a spatial form of bet hedging.

Keywords: density-dependent, frequency-dependent, protected polymorphism, evolutionarily stable strategy, exclusion, dimorphic, ideal-free, invasion rate, habitat selection, bet hedging

1. Introduction

Habitat selection by individuals impacts key attributes of a population including its spatial distribution, temporal fluctuations in its abundance, and its genetic composition. In environmentally heterogeneous landscapes, individuals selecting more favorable habitats are more likely to survive or reproduce. As population densities increase in these habitats, individuals may benefit by selecting previously unused habitats. Thus, both environmental conditions and density-dependent feedbacks generate selective pressures on habitat selection. Under equilibrium conditions, spatial heterogeneity can select for populations exhibiting an ideal-free distribution–equal per-capita growth rates in all occupied patches and lower per-capita growth rates if individuals moved into unoccupied patches (Fretwell and Lucas, 1969). Under non-equilibrium conditions, spatial-temporal heterogeneity can select for individuals occupying sink habitats in which the per-capita growth rate is always negative (Holt, 1997; Jansen and Yoshimura, 1998). Environmental heterogeneity can also promote coexistence of genotypes only differing in their habitat choices (Jaenike and Holt, 1991). Despite significant advances in the mathematical theory for habitat selection under equilibrium conditions, a mathematical theory for habitat selection in stochastic environments is largely lacking. Here, we take a step to addressing this mathematical shortfall while at the same gaining new insights into the evolution of habitat selection for populations living in stochastic, patchy environments.

Since the classic paper Fretwell and Lucas (1969), the ideal-free distribution has been studied extensively from empirical, theoretical, and mathematical perspectives. Empirical support for ideal-free distributions exists for many taxa including fish (Godin and Keenleyside, 1984; Oksanen et al., 1995; Haugen et al., 2006), birds (Harper, 1982; Doncaster et al., 1997), mammals (Beckmann and Berger, 2003), and insects (Dreisig, 1995). For example, Oksanen et al. (1995) found that armored catfish in Panamanian stream pools were distributed such that the resource availability per catfish was equal in all occupied pools, despite significant variation in light availability across these occupied pools. Theoreticians have identified several “non-ideal” mechanisms (e.g. sedentarism, adaptive movement with finite speed, density-dependent dispersal) that, under equilibrium conditions, generate an ideal-free distribution (Hastings, 1983; Cosner, 2005; Gejji et al., 2012). For example, at equilibrium, sedentary populations achieve an ideal-free distribution provided, paradoxically, the populations initially occupied all habitat patches. While many early studies asserted that the ideal free distribution is an evolutionarily stable strategy (ESS) (Fretwell and Lucas, 1969; van Baalen and Sabelis, 1993; Schreiber et al., 2000), only recent advanced nonlinear analyses fully verified this assertion (Cressman et al., 2004; Cressman and Křivan, 2006, 2010; Cantrell et al., 2007, 2010, 2012).

In nature, observed habitat occupancies are frequently less extreme than predicted by the ideal-free distribution: individuals underuse higher quality habitats and overuse lower quality habitats compared to theoretical predictions (Milinski, 1979; Tregenza, 1995). Notably, populations occupying sink habitats have been documented in many species (Sokurenko et al., 2006; Tittler et al., 2006; Robinson et al., 2008; Anderson and Geber, 2010). One possible explanation for these observations is that populations experience temporal as well as spatial variation in environmental conditions and, consequently, theory based on equilibrium assumptions tells an incomplete story. In support of this explanation, several theoretical studies have shown that occupation of sink habitats should evolve when temporal variation is sufficiently great in other habitats (Holt, 1997; Jansen and Yoshimura, 1998; Holt and Barfield, 2001; Schreiber, 2012). These theoretical developments, however, rely on linearizations of density-dependent models, and do not analyze the dynamics of competing genotypes, the ultimate basis for evolutionary change due to natural selection. Hence, these studies leave unanswered the question, “Does the linear analysis correctly identify competitive exclusion in pairwise interactions that is the basis for the analysis of evolutionarily stable strategies?”

Within populations, individuals can exhibit different habitat selection strategies, and there is some evidence these differences can be genetically based (Via, 1990; Jaenike and Holt, 1991). For instance, some individuals of the fruit fly species Drosophila tripunctata prefer tomato host plants (one potential habitat for its larvae) while others prefer mushrooms (another potential habitat), and these differences are based on two genetically independent traits, settling behavior and ovipositor site preference (Jaenike, 1985). Jaenike and Holt (1991) found that genetic variation in habitat selection is common, especially in arthropods and mollusks. Furthermore, they demonstrated using mathematical models that this genetic variation can stem from density-dependent regulation occurring locally within each habitat. Specifically, Jaenike and Holt write “frequency-dependent selection favors alleles that confer upon their carriers a preference for underused habitats, even if there is no genetic variation in how well individuals are adapted to the different habitat” (Jaenike and Holt, 1991, p.S78). Their analysis, however, doesn’t account for temporal fluctuations in environmental conditions and this raises the question, “Does environmental stochasticity facilitate or hinder the maintenance of genetic variation in habitat selection?”

To answer the aforementioned questions, we provide an in-depth analysis of a model introduced in (Schreiber, 2012). The single genotype (i.e. monomorphic) version of this model and a characterization of its dynamics are given in Section 2. The competing genotype (i.e. dimorphic) version of the model and invasion rates of each genotype when rare are introduced in Section 3. In Section 4, we prove that these invasion rates determine the long-term fate of each of the genotypes. Specifically, if both genotypes have positive invasion rates when rare, then there is a positive stationary distribution under which the genotypes coexist. Alternatively, if one genotype has a negative invasion rate when rare, then it is asymptotically displaced by the other genotype. These result allows us to use the invasion rates when rare to explore conditions supporting a protected polymorphism for habitat selection. In Section 5, we refine the characterization of an evolutionary stable strategy for habitat selection from (Schreiber, 2012) in a mathematically rigorous manner, and provide an explicit formula for this ESS when there are two habitat types. Section 6 concludes with a discussion of how our results relate to the existing literature and identifies future challenges for the theory of habitat selection in stochastic environments.

2. The Monomorphic Model

To set the stage for two competing populations spread over several patches, we start with a single population living in one patch. Let Z_t be the population abundance at time t ≥ 0. The stochastic process (Z_t)_t≥0 is governed by the Itô stochastic logistic equation

{dZ}_{t} = Z_{t} (μ - κ Z_{t}) dt + σ Z_{t} {dW}_{t},

(2.1)

where μ is the intrinsic rate of growth of the population in the absence of stochasticity, κ is the strength of intraspecific competition, σ² > 0 is the infinitesimal variance parameter of the stochastic growth rate, and (W_t)_t≥0 is a standard Brownian motion. The process (Z_t)_t≥0 is a strong Markov process with continuous paths. We call an object with such properties a diffusion.

As shown in our first proposition, the process (Z_t)_t≥0 lives in the positive half line $ℝ_{+ +} ≔ (0, \infty)$ ; that is, if we start it in a strictly positive state, then it never hits zero. Furthermore, the long-term behavior of the process is determined by the stochastic rate of growth $μ - \frac{σ^{2}}{2}$ . When the stochastic growth rate is negative the population abundance converges asymptotically to zero with probability one. On the other hand, when this parameter is positive the distribution of the abundance converges to an equilibrium given by a Gamma distribution. These results are well-known, but, as introduction to the methods used to prove our main results, we provide a proof in Appendix A.

Proposition 2.1. Consider the diffusion process (Z_t)_t≥0 given by the stochastic differential equation (2.1).

The stochastic differential equation has a unique strong solution that is defined for all t ≥ 0 and is given by
$Z_{t} = \frac{Z_{0} exp ((μ - σ^{2} ∕ 2) t + σ W_{t})}{1 + Z_{0} \frac{μ}{κ} \int_{0}^{t} exp ((μ - σ^{2} ∕ 2) s + σ W_{s}) ds} .$
If Z₀ = z > 0, then Z_t > 0 for all t ≥ 0 almost surely.
If $μ - \frac{σ^{2}}{2} < 0$ , then lim_t→∞ Z_t = 0 almost surely.
If $μ - \frac{σ^{2}}{2} = 0$ , then lim inf_t→∞ Z_t = 0 almost surely, lim sup_t→∞Z_t = ∞ almost surely, and $\lim_{t \to \infty} \frac{1}{t}$ $\int_{0}^{t}$ $Z_{s} d s = 0$ almost surely.
If $μ - \frac{σ^{2}}{2} > 0$ , then (Z_t)_t≥0 has a unique stationary distribution ρ on $ℝ_{+ +}$ with Gamma density $g (x) = \frac{1}{Γ (k) θ^{k}} x^{k - 1} e^{- \frac{x}{θ}}$ , where
$θ ≔ \frac{σ^{2}}{2 κ} and k ≔ \frac{2 μ}{σ^{2}} - 1 .$

Moreover, if Z₀ = z > 0, then

\lim_{t \to \infty} \frac{1}{t} \int_{0}^{t} h (Z_{s}) ds = \int_{0}^{\infty} h (x) g (x) dx almost surely

for any Borel function $h : ℝ_{+ +} \to ℝ$ with $\int_{0}^{\infty} ∣ h (x) ∣ g (x) d x < \infty$ . In particular,

\lim_{t \to \infty} \frac{1}{t} \int_{0}^{t} Z_{s} ds = \frac{1}{κ} \cdot (μ - \frac{σ^{2}}{2}) almost surely .

Next, we consider a population living in a spatially heterogeneous environment with n different patches. These patches may represent distinct habitats, patches of the same habitat type, or combinations thereof. The abundance of the population in the i-th patch at time t ≥ 0 is ${\overset{‒}{X}}_{t}^{i}$ . Let ${({\overset{‒}{X}}_{t}^{i})}_{t \geq 0}$ be given by

d {\overset{‒}{X}}_{t}^{i} = {\overset{‒}{X}}_{t}^{i} (μ_{i} - κ_{i} {\overset{‒}{X}}_{t}^{i}) dt + {\overset{‒}{X}}_{t}^{i} {dE}_{t}^{i},

(2.2)

where μ_i is the intrinsic rate of growth the population in patch i in the absence of stochasticity, κ_i is the strength of intraspecific competition in patch i, and $E_{t}^{i} = Σ_{j} γ_{j i} B_{t}^{j}$ for a standard multivariate Brownian motion (B¹, …, Bⁿ)^T on $ℝ^{n}$ and an n × n matrix Γ := (γ_ij). The infinitesimal covariance matrix for the non-standard Brownian motion $(E_{t}^{1}, \dots, E_{t}^{n})$ is Σ = (σ_ij):= Γ^TΓ.

The populations in the various patches described by equation (2.2) are coupled only by the spatial correlations present in the driving Brownian motion $(E_{t}^{1}, \dots, E_{t}^{n})$ . We further couple the population dynamics across patches by assuming the fraction of population in patch i equals α_i for all time. This spatial distribution can be realized at the scale of the individual when, as described in greater detail in Remark 2.2, individuals disperse rapidly and independently of one another in such a manner that the fraction of time spent in patch i equals α_i for each individual. Under this assumption, we call α = (α₁,α₂, ⋯ α_n), with α_i ≥ 0 for all 1 ≤ i ≤ n and $Σ_{i = 1}^{n}$ α_i = 1, a patch-selection strategy. Continuing to denote the abundance of the population in the i-th patch at time t ≥ 0 as ${\overset{‒}{X}}_{t}^{i}$ , we have ${\overset{‒}{X}}_{t}^{i} = α_{i} {\overset{‒}{X}}_{t}$ , where ${\overset{‒}{X}}_{t} = Σ_{i = 1}^{n} {\overset{‒}{X}}_{t}^{i}$ is the total population abundance at time t ≥ 0. If we impose these constraints on $({\overset{‒}{X}}^{1} \dots, {\overset{‒}{X}}^{n})$ , then it is heuristically reasonable that the process $\overset{‒}{X}$ is an autonomous Markov process that satisfies the SDE

d {\overset{‒}{X}}_{t} = {\overset{‒}{X}}_{t} \sum_{i = 1}^{n} α_{i} (μ_{i} - κ_{i} α_{i} {\overset{‒}{X}}_{t}) dt + {\overset{‒}{X}}_{t} \sum_{i = 1}^{n} α_{i} {dE}_{t}^{i} .

(2.3)

Remark 2.2. One way to justify the formulation of (2.3) rigorously is to first modify (2.2) to obtain a system of SDEs explicitly accounting for dispersal. Suppose that individuals disperse from patch i to patch j at a rate δd_ij for some fixed rate matrix D = (d_ij). As usual, we adopt the convention d_ii = −Σ_j≠i d_ij. The resulting system of SDEs is

d {\tilde{X}}_{t}^{i} = {\tilde{X}}_{t}^{i} (μ_{i} - κ_{i} {\tilde{X}}_{t}^{i}) dt + δ \sum_{j} {\tilde{X}}_{t}^{j} d_{ji} dt + {\tilde{X}}_{t}^{i} {dE}_{t}^{i} .

(2.4)

Assume that the rate matrix D has a unique stationary distribution α; that is, α_j > 0 for 1 ≤ j ≤ n, $Σ_{i = 1}^{n}$ α_i = 1,

\sum_{j = 1}^{n} α_{j} d_{ji} = 0

(2.5)

for 1 ≤ i ≤ n. In this case, a vector (y¹, …, yⁿ) satisfies

\sum_{j = 1}^{n} y^{j} d_{ji} = 0

(2.6)

for 1 ≤ i ≤ n if and only if

y^{j} = α_{j} c

(2.7)

for 1 ≤ j ≤ n for some constant c. Moreover, summing (2.7) we find that

c = \sum_{i = 1}^{n} y^{i} .

(2.8)

Note that by (2.5) we can write the drift term in (2.4) that contains δ as

δ \sum_{j} {\tilde{X}}_{t}^{j} d_{ji} dt = δ \sum_{j} ({\tilde{X}}_{t}^{j} - α_{j} {\tilde{X}}_{t}) d_{ji} dt

(2.9)

where ${\tilde{X}}_{t} ≔ Σ_{i = 1}^{n} {\tilde{X}}_{t}^{i}$ . using (2.7) and (2.8), we see that (x¹, …, xⁿ) and $x ≔ Σ_{i = 1}^{n} x^{i}$ are such that

\sum_{j = 1}^{n} (x^{j} - α_{j} x) d_{ji} = 0

(2.10)

for i = 1, …, n if and only if

\begin{matrix} x^{j} - α_{j} x & = α_{j} \sum_{i = 1}^{n} (x^{i} - α_{i} x) \\ = 0 \end{matrix}

(2.11)

for 1 ≤ j ≤ n.

It follows from (2.9) and the equivalence between (2.10) and (2.11) that as δ increases the solution of (2.4) experiences an increasingly strong drift towards the one-dimensional subspace

{(x_{1}, \dots, x_{n}) : x_{i} = α_{i} (x_{1} + \dots + x_{n}), i = 1, \dots, n} .

In the limit δ → ∞, it is plausible that the system (2.4) converges to one for which

{\tilde{X}}_{t}^{i} = α_{i} {\tilde{X}}_{t},

where ${\tilde{X}}_{t} ≔ {\tilde{X}}_{t}^{1} + \dots + {\tilde{X}}_{t}^{n}$ , and the total population size ${\tilde{X}}_{t}$ satisfies the autonomous one-dimensionl SDE (2.3) with ${\overset{‒}{X}}_{t}$ replaced by ${\tilde{X}}_{t}$ . This heuristic for obtaining (2.3) as a high dispersal rate limit of (2.4) can be made rigorous by applying Theorem 6.1 from Katzenberger (1991).

Let $x \cdot y = Σ_{i = 1}^{n} x_{i} y_{i}$ denote the standard Euclidean inner product and define another inner product 〈·, ·〉_κ by ${〈 x, y 〉}_{κ} ≔ Σ_{i = 1}^{n} κ_{i} x_{i} y_{i}$ . Since (α · E_t)_t≥0 is a Brownian motion with infinitesimal variance parameter α · Σα, (2.3) can be expressed more simply as

d {\overset{‒}{X}}_{t} = {\overset{‒}{X}}_{t} (α \cdot μ - {〈 α, α 〉}_{κ} {\overset{‒}{X}}_{t}) dt + {\overset{‒}{X}}_{t} \sqrt{α \cdot Σ α} {dW}_{t},

(2.12)

where W_t is a standard Brownian motion.

The total population ${({\overset{‒}{X}}_{t})}_{t \geq 0}$ defined by (2.12) behaves exactly like the one-patch case defined by (2.1) with the parameters μ → μ · κ → 〈α, α〉_κ and $σ \to \sqrt{α \cdot Σ α}$ . In particular, ${({\overset{‒}{X}}_{t})}_{t \geq 0}$ is a diffusion process and we have the following immediate consequence of Proposition 2.1

Proposition 2.3. Consider the diffusion process ${(\overset{‒}{X})}_{t \geq 0}$ given by (2.12).

If ${\overset{‒}{X}}_{0} = x > 0$ , then ${\overset{‒}{X}}_{t} > 0$ for all t ≥ 0 almost surley.
If $α \cdot μ - \frac{α \cdot Σ α}{2} < 0$ , then $\lim_{t \to \infty} {\overset{‒}{X}}_{t} = 0$ almost surley.
If $α \cdot μ - \frac{α \cdot Σ α}{2} = 0$ , then $\lim \inf_{t \to \infty} {\overset{‒}{X}}_{t} = 0$ almost surley, $\lim \sup_{t \to \infty} {\overset{‒}{X}}_{t} = \infty$ almost surley, and $\lim_{t \to \infty} \frac{1}{t}$ $\int_{0}^{t}$ ${\overset{‒}{X}}_{s} d s = 0$ almost surely.
If $α \cdot μ - \frac{α \cdot Σ α}{2} > 0$ , then the process ${({\overset{‒}{X}}_{t})}_{t \geq 0}$ has a unique stationary distribution $ρ_{\overset{‒}{X}}$ on $ℝ_{+ +}$ with Gamma density $g (x) = \frac{1}{Γ (k) θ^{k}} x^{k - 1} e^{- \frac{x}{θ}}$ , where
$θ ≔ \frac{α \cdot Σ α}{2 {〈 α, α 〉}_{κ}} and k ≔ \frac{2 α \cdot μ}{α \cdot Σ α} - 1 .$

Moreover,

\lim_{t \to \infty} \frac{1}{t} \int_{0}^{t} h ({\overset{‒}{X}}_{s}) ds = \int_{0}^{\infty} h (x) g (x) dx almost surely

for any Borel function $h : ℝ_{+ +} \to ℝ$ with $\int_{0}^{\infty} ∣ h (x) ∣ g (x) d x < \infty$ . In particular,

\lim_{t \to \infty} \frac{1}{t} \int_{0}^{t} {\overset{‒}{X}}_{s} ds = \frac{1}{{〈 α, α 〉}_{κ}} α \cdot (μ - \frac{Σ α}{2}) almost surely .

For the dynamics (2.2) in patch i, Proposition 2.1 implies that if there was no coupling between patches by dispersal, then then population abundance in patch i would converge to 0 if μ_i−σ_ii/2 < 0 and converge to a non-trivial equilibrium if μ_i−σ_ii/2 > 0. As noted by Schreiber (2012) and illustrated below, the spatially coupled model is such that the population can persist and converge to an equilibrium even when μ_i − σ_ii/2 < 0 for all patches.

Persistence of coupled sink populations in symmetric landscapes

Consider a highly symmetric landscape where μ_i = r, σ_ii = σ² > 0 for all i, κ_i = a for all i, and σ_ij = 0 for all i ≠ j. If individuals are equally distributed across the landscape (α_i = 1/n for all i), then

μ_{i} - \frac{σ_{ii}}{2} = r - \frac{σ^{2}}{2} and α \cdot μ - \frac{α \cdot Σ α}{2} = r - \frac{σ^{2}}{2 n} .

The increase in the stochastic growth rate from r − σ²/2 for an isolated population to r − σ²/(2n) for the spatially coupled population stems from individuals spending equal time in patches with uncorrelated environmental fluctuations. Specifically, the environmental variance experienced by individuals distributing their time equally amongst n uncorrelated patches is n times smaller than the environmental variance experienced by an individual spending their time entirely in one patch. Whenever σ² > 2r > σ²/n, this reduction in variance allows the entire population to persist despite patches, in and of themselves, not supporting population growth.

3. Dimorphic model and invasion rates

To understand the evolution of patch-selection strategies, we now consider competition between populations that only differ in their patch-selection strategy. Let X_t and Y_t be the total population sizes at time t ≥ 0 of two populations playing the respective patch selection strategies α = (α₁,α₂,…,α_n) and β = (β₁,β₂,…,β_n), so that the densities of the populations in patch i are α_iX_t and β_iY_t at time t ≥ 0. The dynamics of these two strategies are described by the pair of stochastic differential equations

\begin{matrix} {dX}_{t} & = X_{t} \sum_{i = 1}^{n} α_{i} (μ_{i} - κ_{i} (α_{i} X_{t} + β_{i} Y_{t})) dt + X_{t} \sum_{i = 1}^{n} α_{i} {dE}_{t}^{i} \\ {dY}_{t} & = Y_{t} \sum_{i = 1}^{n} β_{i} (μ_{i} - κ_{i} (α_{i} X_{t} + β_{i} Y_{t})) dt + Y_{t} \sum_{i = 1}^{n} β_{i} {dE}_{t}^{i} . \end{matrix}

(3.1)

Since

\begin{matrix} d {[X, X]}_{t} & = X_{t}^{2} α \cdot Σ α dt \\ d {[Y, Y]}_{t} & = Y_{t}^{2} β \cdot Σ β dt \\ d {[X, Y]}_{t} & = X_{t} Y_{t} α \cdot Σ β dt, \end{matrix}

the diffusion process ((X_t, Y_t))_t≥0 for the spatially coupled, competing strategies can be represented more compactly as

\begin{matrix} {dX}_{t} & = X_{t} [μ \cdot α - {〈 α, β 〉}_{κ} Y_{t} - {〈 α, α 〉}_{κ} X_{t}] dt + X_{t} \sqrt{α \cdot Σ α} {dU}_{t} \\ {dY}_{t} & = Y_{t} [μ \cdot β - {〈 α, β 〉}_{κ} X_{t} - {〈 β, β 〉}_{κ} Y_{t}] dt + Y_{t} \sqrt{β \cdot Σ β} {dV}_{t}, \end{matrix}

(3.2)

where (U, V) is a (non-standard) Brownian motion with covariance structure d[U, U]_t = dt, d[V, V]_t = dt, and $d {[U, V]}_{t} = \frac{α \cdot Σ β}{\sqrt{α \cdot Σ α} \sqrt{β \cdot Σ β}} d t$ . Using a construction similar from Remark 2.2, system (3.2) can be seen as a high dispersal limit. This system exhibits a degeneracy when U = V i.e. $\frac{α \cdot Σ β}{\sqrt{α \cdot Σ α} \sqrt{β \cdot Σ β}} = 1$ . If Σ is nonsingular, then, by the Cauchy-Schwarz inequality, this degeneracy only occurs if α = β. We do not consider this possibility in what follows.

To determine whether the two populations coexist or one displaces the other, we introduce the invasion rate $I (α, β)$ of a population playing strategy β when introduced at small densities into a resident population playing strategy α. As shown in the next proposition, this invasion rate is defined by linearizing the dynamics of Y and computing the long-term population growth rate $I (α, β)$ associated with this linearization. When $I (α, β) > 0$ , the population playing strategy β tends to increase when rare. When $I (α, β) < 0$ , the population playing strategy β tends to decrease when rare.

Proposition 3.1. Consider the partially linearized system

\begin{matrix} d {\overset{‒}{X}}_{t} & = {\overset{‒}{X}}_{t} [μ \cdot α - {〈 α, α 〉}_{κ} {\overset{‒}{X}}_{t}] dt + {\overset{‒}{X}}_{t} \sqrt{α \cdot Σ α} {dU}_{t} \\ d {\hat{Y}}_{t} & = {\hat{Y}}_{t} [μ \cdot β - {〈 α, β 〉}_{κ} {\overset{‒}{X}}_{t}] dt + {\hat{Y}}_{t} \sqrt{β \cdot Σ β} {dV}_{t} . \end{matrix}

(3.3)

Assume ${\overset{‒}{X}}_{0} > 0$ and ${\hat{Y}}_{0} > 0$ .

If α · (μ − Σα/2) > 0, so the Markov process $\overset{‒}{X}$ has a stationary distribution concentrated on $ℝ_{+ +}$ , then the limit $\lim_{t \to \infty} \frac{\log {\hat{Y}}_{t}}{t}$ exists almost surely and is given by

I (α, β) = β \cdot (μ - Σ β ∕ 2) - \frac{{〈 α, β 〉}_{κ}}{{〈 α, α 〉}_{κ}} α \cdot (μ - Σ α ∕ 2) .

(3.4)

On the other hand, if α·(μ − Σα/2) ≤ 0, so that $\lim_{t \to \infty} \frac{1}{t}$ $\int_{0}^{t}$ ${\overset{‒}{X}}_{s} d s = 0$ almost surely, then the limit $\lim_{t \to \infty} \frac{\log {\hat{Y}}_{t}}{t}$ exists almost surely and is given by

I (α, β) = β \cdot (μ - Σ β ∕ 2) .

(3.5)

Proof. By Itô’s lemma,

d \log {\hat{Y}}_{t} = (μ \cdot β - {〈 α, β 〉}_{κ} {\overset{‒}{X}}_{t}) dt + \sqrt{β \cdot Σ β} {dV}_{t} - \frac{1}{2} (β \cdot Σ β) dt .

Assume that $μ \cdot α - \frac{α \cdot Σ α}{2} > 0$ . By Proposition 2.3,

\lim_{t \to \infty} \frac{1}{t} \int_{0}^{t} {\overset{‒}{X}}_{s} ds = \frac{α}{2 {〈 α, α 〉}_{κ}} \cdot (2 μ - Σ α) almost surely .

Therefore,

\lim_{t \to \infty} \frac{\log {\hat{Y}}_{t}}{t} = β \cdot μ - \frac{{〈 α, β 〉}_{κ}}{2 {〈 α, α 〉}_{κ}} α \cdot (2 μ - Σ α) - \frac{1}{2} β \cdot Σ β almost surely,

as claimed.

On the other hand, assume that $μ \cdot α - \frac{α \cdot Σ α}{2} \leq 0$ . By Proposition 2.3,

\lim_{t \to \infty} \frac{1}{t} \int_{0}^{t} {\overset{‒}{X}}_{s} ds = 0 almost surely .

Therefore,

\lim_{t \to \infty} \frac{\log {\hat{Y}}_{t}}{t} = μ \cdot β - \frac{1}{2} β \cdot Σ β almost surely,

again as claimed.

In the next proposition, we show that if a population playing strategy β cannot invade a population playing strategy α (i.e. $I (α, β) < 0$ , ), then the population strategy α can invade the population playing strategy β (i.e. $I (β, α) > 0$ ). This suggests, as we will show in the next section, that such a strategy α should exclude strategy β.

Proposition 3.2. Suppose that α·(μ − Σα2) > 0 and $I (α, β) < 0$ . Then, $I (β, α) > 0$ .

Proof. Set A:= α·(μ − Σα2) and B:= β·(μ − Σβ2). Assume that A > 0 and $I (α, β) < 0$ . To show that $I (β, α) > 0$ , we consider two cases, B ≤ 0 and B > 0. Suppose B ≤ 0. Then, $I (β, α) = A > 0$ by Proposition 3.1 and by assumption.

Alternatively, suppose that B > 0. Then

I (α, β) = B - A \frac{{〈 α, β 〉}_{κ}}{{〈 α, α 〉}_{κ}} and I (β, α) = A - B \frac{{〈 α, β 〉}_{κ}}{{〈 β, β 〉}_{κ}}

by Proposition 3.1. Assume, contrary to our claim, that $I (α, β) < 0$ and $I (β, α) \leq 0$ . From the Cauchy-Schwarz inequality ${〈 x, y 〉}_{κ} \leq {〈 x, x 〉}_{κ}^{1 ∕ 2} {〈 y, y 〉}_{κ}^{1 ∕ 2}$ we get

\begin{matrix} B {〈 α, α 〉}_{κ}^{1 ∕ 2} {〈 β, β 〉}_{κ}^{1 ∕ 2} & \geq B {〈 α, β 〉}_{κ} \geq A {〈 α, β 〉}_{κ} \\ A {〈 α, α 〉}_{κ}^{1 ∕ 2} {〈 β, β 〉}_{κ}^{1 ∕ 2} & \geq A {〈 α, β 〉}_{κ} > B {〈 α, α 〉}_{κ} . \end{matrix}

The above inequalities yield the contradiction $B {〈 α, α 〉}_{κ}^{1 ∕ 2} \geq A {〈 β, β 〉}_{κ}^{1 ∕ 2}$ and $B {〈 α, α 〉}_{κ}^{1 ∕ 2} < A {〈 β, β 〉}_{κ}^{1 ∕ 2}$ .

An immediate consequence of Proposition 3.1 is the following corollary. This corollary implies that if a population playing strategy β can invade a population playing strategy α and a population playing strategy α can invade a population playing strategy β, then a single population playing strategy β converges to a non-trivial equilibrium and the same is true of a single population playing strategy β. This suggests, as we show in the next section, that under these conditions these two strategies should coexist.

Corollary 3.3. The invasion rate satisfies $I (α, β) \leq β \cdot (μ - Σ β ∕ 2)$ . In particular, if $I (α, β) > 0$ and $I (β, α) > 0$ , then α · (μ − Σα/2) > 0 and β · (μ − Σβ/2) > 0.

4. Exclusion and protected polymorphisms

Our main results about the dimorphic process (X, Y) is that the invasion rates determine the long-term fate of competing strategies. If the invasion rates predict that strategy β cannot invade a population playing strategy α, then the population playing strategy α drives the population playing strategy β asymptotically to extinction as shown in the following theorem. We give a proof in Appendix B.

Theorem 4.1. If α · (μ − Σα/2) > 0 and $I (α, β) < 0$ , then, for $(x, y) \in ℝ_{+ +}^{2}$ , the probability measures

\frac{1}{t} \int_{0}^{t} ℙ^{(x, y)} {(X_{s}, Y_{s}) \in \cdot} ds

converge weakly as t → ∞ to $ρ_{\overset{‒}{X}} \otimes δ_{0}$ , where $ρ_{\overset{‒}{X}}$ is the unique stationary distribution of ${\overset{‒}{X}}_{t}$ concentrated on $ℝ_{+ +}$ , and δ₀ is the point mass at 0.

On the other hand, if the invasion rates predict that each strategy can invade when rare, then the following theorem proves that the competing strategies coexist: for any initial conditions the joint distribution of (X_t, Y_t) converges as t → ∞ to a probability distribution on $ℝ_{+ +}^{2}$ with density ψ and, moreover, for any Borel set $B \subset ℝ_{+ +}^{2}$ the long term proportion of times t for which (X_t, Y_t) spends in B converges to

\int_{B} ψ (x, y) dxdy .

A proof is given in Appendix C. In order to appreciate the assumptions of the theorem, it helps to recall Corollary 3.3 which says that if $I (α, β) > 0$ and $I (β, α) > 0$ then α · (μ − Σα/2) > 0 and β · (μ − Σβ/2) > 0 so that a single population playing strategy α or β will persist.

Theorem 4.2. Suppose that $I (α, β) > 0$ and $I (β, α) > 0$ . Then, there exists a unique stationary distribution π of (X; Y ) on $ℝ_{+ +}^{2}$ that is absolutely continuous with respect to Lebesgue measure. Moreover, for any bounded, measurable function $f : ℝ_{+ +}^{2} \to ℝ$

\lim_{t \to \infty} \frac{1}{t} \int_{0}^{t} f (X_{s}, Y_{s}) ds = \int_{ℝ_{+ +}^{2}} f (x, y) π (dx, dy) almost surely .

(4.1)

Furthermore, the process (X, Y) is strongly ergodic, so that for any initial distribution q one has

\lim_{t \to \infty} d_{TV} (ℙ^{q} {(X_{t}, Y_{t}) \in \cdot}, π) = 0,

(4.2)

where d_TV is the total variation distance.

From the perspective of population genetics, the coexistence of these two strategies corresponds to a protected polymorphism: each strategy (a morph) increases when rare and, therefore, is protected from extinction. This protection from extinction, however, is only ensured over ecological time scales as mutations may result in new morphs that can displace one or both coexisting morphs (Ravigné et al., 2004). The concept of protected polymorphisms was introduced by Prout (1968) when studying deterministic models of competing haploid populations in a spatially heterogenous with overlapping generations. Turelli et al. (2001) extended this concept to stochastic difference equations for competing haploid populations with a constant population size. Theorem 4.2 provides a mathematically rigorous characterization of protected polymorphisms for our stochastic models with fluctuating population sizes.

Theorem 4.2 implies that coexistence depends on the intrinsic stochastic growth rate of the populations and the competitive effect of each population on the other. The intrinsic stochastic growth rates are given by

r_{α} = α \cdot (μ - Σ α ∕ 2) and r_{β} = β \cdot (μ - Σ β ∕ 2) .

While the competitive effect of the population with strategy α on the population with strategy β is given by the ratio of the magnitude of α projected in the β direction (i.e. 〈β/||β||_κ, α〉_κ, where ${∣ ∣ β ∣ ∣}_{κ} = \sqrt{{〈 β, β 〉}_{κ}}$ ) divided by the magnitude of β (i.e. ||β_κ). Mathematically, the competitive effect of α on β and the competitive effect of β on α are given by

C_{α, β} = \frac{{〈 β ∕ {∣ ∣ β ∣ ∣}_{κ}, α 〉}_{κ}}{{∣ ∣ β ∣ ∣}_{κ}} and C_{β, α} = \frac{{〈 α ∕ {∣ ∣ α ∣ ∣}_{κ}, β 〉}_{κ}}{{∣ ∣ α ∣ ∣}_{κ}} .

Provided r_α and r_β are positive, Theorem 4.2 implies that there is a protected polymorphism if

\frac{r_{α}}{r_{β}} > C_{β, α} and \frac{r_{β}}{r_{α}} > C_{α, β} .

(4.3)

In words, the relative intrinsic stochastic growth rate of each population must exceed the competitive effect on itself due to the other population. Conversely, if one of the inequalities in (4.3) is reversed, then Theorem 4.1 implies that one population excludes the other. Unlike the standard Lotka-Volterra competition equations, Proposition 3.2 implies that both inequalities in (4.3) cannot be simultaneously reversed and, consequently, bistable dynamics are impossible.

Environmental stochasticity impedes protected polymorphisms in symmetric landscapes.

Consider a landscape where all patches have the same carrying capacities (e.g. κ_i = 1 for all i), the same intrinsic rates of growth (i.e. μ_i = a for all i), and the same amount of uncorrelated environmental stochasticity (e.g. σ_ii = σ² for all i and σ_ij = 0 for i ≠ j). Then the protected polymorphism inequalities (4.3) become

\frac{a - σ^{2} {∣ ∣ α ∣ ∣}^{2} ∕ 2}{a - σ^{2} {∣ ∣ β ∣ ∣}^{2} ∕ 2} > C_{β, α} and \frac{a - σ^{2} {∣ ∣ β ∣ ∣}^{2} ∕ 2}{a - σ^{2} {∣ ∣ α ∣ ∣}^{2} ∕ 2} > C_{α, β}

(4.4)

where the only σ² dependency is on the left hand sides of both inequalities. As $\frac{a - σ^{2} {∣ ∣ β ∣ ∣}^{2} ∕ 2}{r - σ^{2} {∣ ∣ α ∣ ∣}^{2} ∕ 2}$ is a decreasing function of σ² whenever $\frac{∣ ∣ α ∣ ∣}{∣ ∣ β ∣ ∣} < 1$ and an increasing function of σ² whenever $\frac{∣ ∣ α ∣ ∣}{∣ ∣ β ∣ ∣} > 1$ , it follows that the set of set of strategies supporting a protected polymorphism

A (σ^{2}) = {(α, β) : (4.4) holds}

is a decreasing function of σ² i.e $A (σ_{2}^{2})$ is a proper subset of $A (σ_{1}^{2})$ whenever σ² > σ1 ≥ 0. Figure 1A illustrates this conclusion for a two-patch landscape. Intuitively, increasing environmental stochasticity in these symmetric landscapes reduces the stochastic growth rate for all strategies and, thereby, makes it less likely for populations to persist let alone coexist. For asymmetric landscapes, how the set A(σ²) of protected polymorphisms varies with σ² is more subtle, as illustrated in Figure 1B. In this case, some protected polymorphisms are facilitated by environmental stochasticity, while other protected polymorphisms are disrupted by environmental stochasticity.

Protected polymorphisms and exclusion in two-patch landscapes. Contour plots of $I (α, β)$ where lighter shades correspond to higher values of $I (α, β)$ . The regions where $I (α, β) I (β, α) > 0$ are delineated by the solid curves and correspond to parameter combinations supporting a protected polymorphism. Regions where $I (α, β) I (α, β) < 0$ correspond to strategies that cannot coexist. The dashed-dotted and dotted curves indicate how regions of coexistence and exclusion change for higher and lower levels of environmental stochasticity σ², respectively. In panel A, the landscape is spatially homogeneous with μ = (1, 1), κ = (1, 1) and Σ = σ²I where I is the 2 × 2 identity matrix. In B, the landscape is spatial heterogeneous with respect to the deterministic carrying capacities κ = (3, 1) and the remaining parameters as A.

For the symmetric landscapes, we can identify a strategy that displaces all others. Namely, the strategy $α = (\frac{1}{n}, \dots, \frac{1}{n})$ of visiting all patches with equal frequency. This strategy maximizes the function function $α \mapsto a - σ^{2} {∣ ∣ α ∣ ∣}^{2} ∕ 2$ . Hence, if we consider a competing strategy β ≠ α, then $α \cdot β = {∣ ∣ α ∣ ∣}^{2} = \frac{1}{n}$ and

\begin{matrix} I (α, β) & = a - σ^{2} {∣ ∣ β ∣ ∣}^{2} ∕ 2 - \frac{α \cdot β}{{∣ ∣ α ∣ ∣}^{2}} (a - \frac{σ^{2} {∣ ∣ α ∣ ∣}^{2}}{2}) \\ = a - σ^{2} {∣ ∣ β ∣ ∣}^{2} ∕ 2 - (a - \frac{σ^{2} {∣ ∣ α ∣ ∣}^{2}}{2}) < 0 . \end{matrix}

e.g. the invasion rates are negative along the vertical transect α1 = 1/2 in Figure. 1A. This strategy α is an example of an evolutionarily stable strategy that we discuss further in the next section.

5. Evolutionarily stable strategies

The concept of an evolutionary stable strategy was introduced by Maynard Smith and Price (1973). Loosely stated, an evolutionary strategy is a strategy that cannot be invaded by any other strategy and, consequently, can be viewed as an evolutionary endpoint. For our models, we say patch selection strategy α is an evolutionarily stable strategy (ESS) if $I (α, β) < 0$ for all strategies β ≠ α. In light of Theorem 4.1, an ESS not only resists invasion attempts by all other strategies, but can displace all other strategies. An ESS α is called a pure ESS if α_i = 1 for some patch i, otherwise it is a mixed ESS. Our next result provides an algebraic characterization of mixed and pure ESSs. However, it remains to be understood whether these ESSs can be reached by small mutational steps in the strategy space (i.e. are convergently stable (Geritz et al., 1997)).

Theorem 5.1. Assume that the covariance matrix Σ is positive definite and that there is at least one patch selection strategy which persists in the absence of competition with another strategy; that is, that max_α α · (μ − Σα/2) > 0.

Mixed strategy

An ESS α with α_i > 0 for i ∈ I with I ⊆ {1, 2, …, n} and |I| ≥ 2 satisfies

- \frac{α \cdot Σ α}{2} = μ_{i} - κ_{i} α_{i} \frac{α \cdot (2 μ - Σ α)}{2 {〈 α, α 〉}_{κ}} - \sum_{j = 1}^{n} σ_{ij} α_{j}

(5.1)

for all i ∈ I. Conversely, if |I| = n, then a strategy α satisfying (5.1) is an ESS.

Pure strategy

The strategy α_i = 1 and α_j = 0 for j ≠ i is an ESS if and only if

μ_{j} - \frac{σ_{jj}}{2} < - \frac{σ_{jj}}{2} + σ_{ij} - \frac{σ_{ii}}{2}

(5.2)

for all j ≠ i.

Furthermore, in the case of n = 2, there exists a mixed ESS whenever the reversed inequalities

μ_{j} - \frac{σ_{jj}}{2} > - \frac{σ_{jj}}{2} + σ_{ij} - \frac{σ_{ii}}{2}

(5.3)

hold for i = 1, 2 and j ≠ i.

The first statement of Theorem 5.1 provides a sufficient and necessary condition for a mixed ESS utilizing all patches. For example, in a symmetric landscape (as described in the previous section), this ESS condition is only satisfied for α = (1/n, 1/n, …, 1/n).

The second statement of Theorem 5.1 provides a characterization of when using only a single patch is an ESS. Since the right hand side of equation (5.2) is negative, using patch i can only be an ESS if all other patches have a negative stochastic rate of growth, $μ_{j} - \frac{σ_{j j}}{2} < 0$ for all j ≠ i. However, even if only patch i has a positive stochastic growth rate, an ESS may use the other patches, as we illustrate next for two-patch landscapes.

ESSs in two-patch, uncorrelated landscapes

For an uncorrelated two patches landscape (i.e. n = 2 and σ₁₂ = 0), Theorem 5.1 implies that there is a mixed ESS whenever

μ_{1} > - σ_{22} ∕ 2 and μ_{2} > - σ_{11} ∕ 2

(5.4)

and this ESS satisfies

α_{i} = \frac{μ_{i} + α \cdot Σ α}{κ_{i} (μ \cdot α - α \cdot Σ α ∕ 2) ∕ {〈 α, α 〉}_{κ} + σ_{ii}} .

(5.5)

Equation (5.4) implies that even if deterministic growth in patch 2 is strictly negative (i.e. μ₂ < 0), then there is selection for movement into this patch provided the variance of the fluctuations in patch 1 are sufficiently large relative to the intrinsic rate of decline in patch 2 (Fig. 2).

ESS for patch selection (left) and mean population abundance (right) in a source-sink landscape. Parameter values: n = 2, σ₁₁ = σ², σ₂₂ = σ₁₂ = 0, μ = (1, μ₂), and κ = (1, 1).

In the limit of no noise (i.e. σ_ii ↓ 0 for i = 1, 2), equation (5.5) becomes

α_{i} = \frac{μ_{i}}{κ_{i} μ \cdot α ∕ {〈 α, α 〉}_{κ}} .

While our results do not apply to the deterministic case, this limiting expression for the ESS suggests, correctly, that the ESS for the deterministic model satisfies

α_{i} = \frac{μ_{i} ∕ κ_{i}}{\sum_{j} μ_{j} ∕ κ_{j}} whenever μ_{i} > 0 .

In other words, the fraction of individuals selecting patch i is proportional to the equilibrium density μ_i/κ_i supported by patch i. Equation (5.5) implies that adding stochasticity in equal amounts to all patches (i.e. σ_ii = σ² for all i) results in an ESS where, relative to the deterministic ESS, fewer individuals select patches supporting the highest mean population abundance and more individuals selecting patches supporting lower mean population abundances (Fig. 3).

The effect of the deterministic carrying capacities and environmental stochasticity on the ESS for patch selection (left) and mean population abundance (right) in a two-patch landscape. The ratio κ₁/(κ₂ + κ₂) corresponds to the ratio of the deterministic carrying capacity (μ₂/κ₂) in patch 2 to the sum of the deterministic carrying capacities (μ₁/κ₂ + μ₂/κ₂) when μ₁ = μ₂ = 1. Parameter values: n = 2, σ₁₁ = σ₂₂ = σ², σ₁₂ = 0, μ = (1, 1), and κ = (1, κ₂).

6. Discussion

Habitat selection by organisms is a complex process determined by a mixture of genetic, developmental, ecological, and environmental factors. For ecologists, habitat selection plays a fundamental role in determining the spatial and temporal distribution of a population (Rosenzweig, 1981; Orians and Wittenberger, 1991). For evolutionary biologists, habitat selection determines the suite of environmental factors driving local adaptation (Edelaar and Bolnick, 2012). Indeed, in the words of the eminent evolutionary biologist Ernst Mayr, “With habitat and food selection – behavioral phenomena – playing a major role in the shift into new adaptive zones, the importance of behavior in initiating new evolutionary events is self-evident” (Mayr, 1963, p. 604). Here, we examined how spatial and temporal heterogeneity in demographic rates across multiple habitat patches influence the dynamics of competing populations who only differ in their habitat patch selection preferences. We assume that habitat selection has a genetic basis (e.g. genes that influence the physiological or neurological capacity of individuals to detect and respond to habitat cues) and that genetic differences in habitat choice have no pleiotropic effects on habitat specific fitness. Our analysis reveals that, generically, only two outcomes are possible, coexistence or displacement of one population by the other for all initial conditions, and that these outcomes are determined by the invasion rates of populations when rare. In addition to providing a mathematically rigorous justification of prior work, our analysis provides new insights into protected polymorphisms for habitat selection and raises several questions about evolutionary stable strategies for habitat selection.

Protected polymorphisms correspond to populations of competing genotypes exhibiting negative frequency-dependence: each population tends to increase when rare (Prout, 1968). In the case of patch selection, these competing populations differ in the frequencies in which they select habitat patches. In a survey of the empirical literature, Jaenike and Holt (1991) found “that genetic variation for habitat selection is common, especially in arthropods and mollusks, the groups that have been studied most frequently.” Moreover, they argued that some of this variation may be maintained through protected polymorphism. Specifically, “in a haploid model without intrinsic fitness differences among genotypes [i.e. soft selection], genetic variation in fixed habitat preferences may be maintained stably” (Jaenike and Holt, 1991, pg. S83). We provide a general analytic criterion (see, inequality (4.3)) characterizing these protected polymorphisms for spatially and temporally variable environments. This criterion depends on the intrinsic fitnesses (r_α and r_β) of each population and their competitive coefficients (C_α,β and C_β,α) that characterize the effect of each population on the other. Competitive effects are greatest when there is an overlap in patch use and one population tends to select the patches with the higher carrying capacities more than the other population. Intuitively, by occupying patches with a larger carrying capacities, populations achieve higher regional densities. Coupled with overlap in patch use, these higher densities result in a greater competitive impact of one population on another. A protected polymorphism occurs when the relative fitness of each population (e.g. r_α/r_β for strategy α) is greater than the competitive effect of the other population on it (e.g. r_α/r_β > C_β,α for the population playing strategy α). Hence, as in the case of species coexistence (Chesson, 2000), protected polymorphism are most likely when fitness differences are small (i.e. r_α/r_β ≈ 1) and competitive effects are small (i.e. both C_α,β and C_β,α < 1). Environmental stochasticity solely effects the intrinsic fitness terms and can facilitate or inhibit protected polymorphisms. For landscapes in which all patches experience the same degree of uncorrelated, temporal variation, environmental stochasticity has an inhibitory effect as it magnifies fitness differences between competing strategies (e.g. r_α/r_β increases with environmental stochasticity). For asymmetric landscapes, however, temporal variability can facilitate polymorphisms by reducing fitness differences of competing strategies.

In contrast to protected polymorphisms, our analysis reveals that populations playing an evolutionarily stable strategy (ESS) for patch selection not only thwart invasion attempts by all other strategies but also can invade and displace a population playing any other strategy. Furthermore, our analysis provides a mathematically rigorous justification of an earlier characterization of ESSs (Schreiber, 2012). This characterization implies that populations playing the ESS always occupy source habitats (i.e. patches where $μ_{i} - σ_{i i}^{2} ∕ 2 > 0$ ). Indeed, consider a population playing strategy α that does not occupy some source patch, say patch i. Then a different behavioral genotype β that only selects patch i can invade as $I (α, β) = μ_{i} - σ_{i}^{2} ∕ 2 > 0$ . In the limiting case of a deterministic environment, our characterization of the ESS recovers the classic result of McPeek and Holt (1992): the fraction of time spent in a patch is proportional to the carrying capacity of the patch. Adding environmental stochasticity generally results in populations playing the ESS decreasing the time spent in the patches with larger carrying capacities and possibly making use of sink patches (i.e. patches where $μ_{i} - σ_{i}^{2} ∕ 2 < 0$ . This shift in patch choice can be viewed as a spatial form of bet hedging: individuals increase fitness by decreasing the variance in their stochastic growth rate at the expense of their mean growth rate (Childs et al., 2010).

We are able to show that for two patch landscapes there always exists an ESS for patch selection. However, several questions remain unanswered. First, what happens for landscapes with more than two patches? Is there always an ESS? Second, while we know that a population playing an ESS can displace a monomoprhic population playing a different strategy, can it displace polymorphic populations? Finally, are ESSs always convergently stable (Geritz et al., 1997)? If there are positive answers to this final suite of questions, then ESSs can be generally viewed as the ultimate evolutionary end state for patch selection strategies.

Going beyond the models considered here, studying the evolution habitat use faces many challenges. Our models assume that populations spend a fixed fraction of time in each patch and do so instantaneously. What happens if we relax these assumptions? For example, if populations are more ideal and able to track changes in population density instantaneously, then we have something closer to the classical notion of ideal free movement (Fretwell and Lucas, 1969). For these populations, what is the optimal (in an evolutionary sense) density-dependent strategy? Moreover, can such a strategy displace the static strategies considered here? Alternatively, if populations are less ideal and diffusing randomly on the landscape, what happens then? The linear version of this question was tackled in part by Evans et al. (2013). However, the mathematical analysis for analogous stochastic models with density-dependent feedbacks is largely unexplored. Going beyond single species, the coevolution of patch selection among interacting species has a rich history for spatially heterogeneous, but temporally homogeneous environments (van Baalen and Sabelis, 1993; Křivan, 1997; Schreiber et al., 2000; van Baalen et al., 2001; Schreiber et al., 2002; Cressman et al., 2004; Schreiber and Vejdani, 2006; Cantrell et al., 2007). For example, spatial heterogeneity can select for the evolution of contrary choices in which the prey prefers low quality patches to escape the predator and the predator prefers high quality patches to capture higher quality food items (Fox and Eisenbach, 1992; Schreiber et al., 2000). Understanding how environmental stochastic influences this coevolution of patch choice and the community level consequences of these coevolutionary outcomes provides a plethora of important, yet largely untouched challenges for future work.

Acknowledgments

The authors thank Dan Crisan, Alison Etheridge, Tom Kurtz, and Gregory Roth for helpful discussions.

S.N.E. was supported in part by NSF grant DMS-0907639 and NIH grant 1R01GM109454-01.

A.H. was supported by EPSRC grant EP/K034316/1.

S.J.S. was supported in part by NSF grants EF-0928987 and DMS-1022639.

Appendix A: Proof of Proposition 2.1

The stochastic differential equation for Z is of the form

{dZ}_{t} = b (Z_{t}) dt + σ (Z_{t}) {dW}_{t},

(6.1)

where b(z) := μz − κz² and σ(z) := σz. It follows from Itô’s existence and uniqueness theorem for strong solutions of stochastic differential equations that this equation has a unique strong solution up to possibly a finite but strictly positive explosion time.

Set R_t := log Z_t for t ≥ 0. By Itô’s lemma,

{dR}_{t} = (μ - \frac{σ^{2}}{2} - κ exp (R_{t})) dt + σ {dW}_{t} .

(6.2)

It follows from the comparison principle of Ikeda and Watanabe (see Chapter VI Theorem 1.1 of Ikeda and Watanabe (1989)), Theorem 1.4 of Le Gall (1983), or Theorem V.43.1 of Rogers and Williams (2000)) that

R_{t} \leq R_{0} + (μ - \frac{σ^{2}}{2}) t + σ W_{t},

(6.3)

and so Z does not explode to +∞ in finite time. Moreover, since r ↦ μ − κe^r is a bounded, uniformly Lipschitz function on (∞, 0] it follows from Itô’s existence and uniqueness theorem that R does not explode −∞ to in finite time, so that Z does not hit 0 in finite time. We could have also established this result by using the scale function and speed measure calculated below to check Feller’s necessary and sufficient for the boundary point of a one-dimensional diffusion to be inaccessible – see Theorem 23.12 of Kallenberg (2002).

It is not hard to check using Itô’s lemma that an explicit solution of the SDE is

Z_{t} = \frac{Z_{0} exp ((μ - σ^{2} ∕ 2) t + σ W_{t})}{1 + Z_{0} \frac{μ}{κ} \int_{0}^{t} exp ((μ - σ^{2} ∕ 2) s + σ W_{s}) ds} .

We see from the inequality (6.3) that if μ − σ²/2 < 0, then lim_t→∞ Z_t = 0 almost surely.

We use the theory based on the scale function and speed measure of a one-dimensional diffusion (see, for example, Chapter 23 of Kallenberg (2002) or Sections V.6-7 of Rogers and Williams (2000)) below to establish that Z is positive recurrent with a unique stationary distribution when μ−σ²/2 > 0. Similar calculations show that Z is null recurrent when μ − σ²/2 = 0, and hence lim inf_t→∞ Z_t = 0 almost surely and lim sup_t→∞ Z_t = ∞. It follows from (6.2) and the comparison principle that if Z′ and Z″ are two solutions of (6.1) with respective parameters μ′, κ′, σ′ and μ″, κ″, σ″ satisfying μ′ ≤ μ″, κ′ = κ″, σ′ = σ″ and the same initial conditions, then $Z_{t}^{'} \leq Z_{t}^{″}$ . We will show below that

\lim_{t \to \infty} \frac{1}{t} \int_{0}^{t} Z_{s} ds = \frac{1}{κ} \cdot (μ - σ^{2} ∕ 2)

almost surely when μ − σ²/2 > 0, and hence

\lim_{t \to \infty} \frac{1}{t} \int_{0}^{t} Z_{s} ds = 0

almost surely when μ − σ²/2 > 0.

We now identify the scale function and speed measure of the one-dimensional diffusion Z. A choice for the scale function is

\begin{matrix} s (x) & = \int_{c}^{x} exp (- \int_{a}^{y} \frac{2 b (z)}{σ^{2} (z)} dz) dy \\ = \int_{c}^{x} {(\frac{y}{a})}^{- 2 μ ∕ σ^{2}} e^{\frac{2 κ}{σ^{2}} (y - a)} dy \end{matrix}

(6.4)

for arbitrary $a, c \in ℝ_{+ +}$ (recall that the scale function is only defined up to affine transformations). If we set $\tilde{σ} = (σ s^{'}) \circ s^{- 1}$ , then

ds (Z_{t}) = \tilde{σ} (s (Z_{t})) d {\tilde{W}}_{t}

and the diffusion process s(Z) is in natural scale on the state space $s (ℝ_{+ +})$ with speed measure m that has density $\frac{1}{{\tilde{σ}}^{2}}$ .

The total mass of the speed measure is

\begin{matrix} m (ℝ_{+ +}) & = \int_{s (ℝ_{+ +})} \frac{1}{{\tilde{σ}}^{2} (x)} dx = \int_{s (ℝ_{+ +})} \frac{1}{{((σ s^{'}) \circ s^{- 1})}^{2} (x)} dx = \int_{0}^{\infty} \frac{1}{σ^{2} (u) s^{'} (u)} du \\ = \int_{0}^{\infty} \frac{1}{{(σ u)}^{2} {(\frac{u}{a})}^{- 2 μ ∕ σ^{2}} e^{\frac{2 κ}{σ^{2}} (u - a)}} du \\ = \frac{1}{σ^{2} a^{2 μ ∕ σ}} \int_{0}^{\infty} u^{\frac{2 μ}{σ^{2}} - 2} e^{- \frac{2 κ}{σ^{2}} (u - a)} du . \end{matrix}

(6.5)

By Theorem 23.15 of Kallenberg (2002), the diffusion process Z has a stationary distribution concentrated on $ℝ_{+ +}$ if and only if the process s(Z) has (−∞, +∞) as its state space and the speed measure has finite total mass or s(Z) has a finite interval as its state space and the boundaries are reflecting. The introduction of an extra negative drift to geometric Brownian motion cannot make zero a reflecting boundary, so we are interested in conditions under which $s (ℝ_{+ +}) = (- \infty, \infty)$ and the speed measure has finite total mass. We see from (6.4) and (6.5) that this happens if and only if μ − σ²/2 > 0, a condition we assume holds for the remainder of the proof.

The diffusion s(Z) has a stationary distribution with density $f ≔ \frac{1}{m (ℝ_{+ +}) {\tilde{σ}}^{2}}$ on $s (ℝ_{+ +}) = (- \infty, + \infty)$ , and so the stationary distribution of Z is the distribution on $ℝ_{+ +}$ that has density

\begin{matrix} g (x) & = f (s (x)) s^{'} (x) \\ = \frac{1}{m (ℝ_{+ +}) {\tilde{σ}}^{2} (s (x))} s^{'} (x) \\ = \frac{1}{m (ℝ_{+ +}) σ^{2} (x) s^{'} (x)} \\ = \frac{1}{m (ℝ_{+ +}) x^{2} σ^{2} {(\frac{x}{a})}^{- 2 μ ∕ σ^{2}} e^{\frac{2 κ}{σ^{2}} (x - a)}}, x \in ℝ_{+ +} . \end{matrix}

This has the form of a Gamma(k,θ) density with parameters $θ ≔ \frac{σ^{2}}{2 κ}$ and $k = \frac{2 μ}{σ^{2}} - 1$ . Therefore,

g (x) = \frac{1}{Γ (k) θ^{k}} x^{k - 1} e^{- \frac{x}{θ}} = \frac{1}{Γ (\frac{2 μ}{σ^{2}} - 1) {(\frac{σ^{2}}{2 κ})}^{\frac{2 μ}{σ^{2}} - 1}} x^{\frac{2 μ}{σ^{2}} - 2} e^{\frac{- 2 κ x}{σ^{2}}}, x \in ℝ_{+ +} .

Theorem 20.21 from Kallenberg (2002) implies that the shift-invariant σ-field is trivial for all starting points. The ergodic theorem for stationary stochastic processes then tells us that, if we start Z with its stationary distribution,

\lim_{t \to \infty} \frac{1}{t} \int_{0}^{t} h (Z_{s}) ds = \int_{0}^{\infty} h (x) g (x) dx

for any Borel function $h : ℝ_{+ +} \to ℝ$ with $\int_{0}^{\infty} ∣ h (x) ∣ g (x) d x < \infty$ . Since Z has positive continuous transition densities we can conclude that

\lim_{t \to \infty} \frac{1}{t} \int_{0}^{t} h (Z_{s}) ds = \int_{0}^{\infty} h (x) g (x) dx

$ℙ^{x}$ -almost surely for any $x \in ℝ_{+ +}$ .

In particular,

\int_{ℝ_{+ +}} xg (x) dx = k θ = \frac{1}{κ} \cdot (μ - \frac{σ^{2}}{2}) .

Appendix B: Proof of Theorem 4.1

To simplify our presentation, we re-write the joint dynamics of X and Y as

\begin{matrix} {dX}_{t} & = X_{t} (μ \cdot α - ({aX}_{t} + {cY}_{t})) dt + σ_{X} X_{t} {dU}_{t} \\ {dY}_{t} & = Y_{t} (μ \cdot β - ({cX}_{t} + {bY}_{t})) dt + σ_{Y} Y_{t} {dV}_{t}, \end{matrix}

(6.6)

where $a ≔ {〈 α, α 〉}_{κ}$ , $b ≔ {〈 β, β 〉}_{κ}$ , $c ≔ {〈 α, β 〉}_{κ}$ , $σ_{X} ≔ \sqrt{α \cdot Σ α}$ and $σ_{Y} ≔ \sqrt{β \cdot Σ β}$ .

To prove Theorem 4.1, we need several preliminary results. First, we prove existence and uniqueness of solutions to the system (6.6) as well as a useful comparison result in Theorem 6.1. Second, in Proposition 6.3, we establish that (X_t, Y_t) remains in $ℝ_{+ +}^{2} = {(0, \infty)}^{2}$ for all t ≥ 0 whenever $(X_{0}, Y_{0}) \in ℝ_{+ +}^{2}$ . Third, in Proposition 6.4, we show that weak limit points of the empirical measures $\frac{1}{t}$ $\int_{0}^{t}$ $ℙ^{(x, y)} {X_{s}, Y_{s}) \in \cdot}$ ds are stationary distributions for the process (X, Y) thought of as a process on $ℝ_{+}^{2}$ (rather than $ℝ_{+ +}^{2}$ ). Finally, we show that lim_t→∞ Y_t = 0 with probability one in Proposition 6.5 and conclude by showing that $\frac{1}{t}$ $\int_{0}^{t}$ $ℙ^{(x, y)} {X_{s}, Y_{s}) \in \cdot}$ ds converges weakly to $ρ_{\overset{‒}{X}} \otimes δ_{0}$ concentrated on $ℝ_{+ +} \times {0}$ .

Theorem 6.1. The stochastic differential equation in (6.6) has a unique strong solution and $X_{t}, Y_{t} \in L^{p} (ℙ^{(x, y)})$ for all t, p > 0 for all $(x, y) \in ℝ_{+ +}^{2}$ . This solution satisfies X_t > 0 and Y_t > 0 for all t ≥ 0, $ℙ^{(x, y)}$ -almost surely for all $(x, y) \in ℝ_{+ +}^{2}$ . Let ${(({\overset{‒}{X}}_{t}, {\overset{‒}{Y}}_{t}))}_{t \geq 0}$ be the stochastic process defined by the pair of stochastic differential equations

\begin{matrix} d {\overset{‒}{X}}_{t} & = {\overset{‒}{X}}_{t} (μ \cdot α - a {\overset{‒}{X}}_{t}) dt + σ_{X} {\overset{‒}{X}}_{t} {dU}_{t} \\ d {\overset{‒}{Y}}_{t} & = {\overset{‒}{Y}}_{t} (μ \cdot β - b {\overset{‒}{Y}}_{t}) dt + σ_{Y} {\overset{‒}{Y}}_{t} {dV}_{t} \end{matrix}

(6.7)

If $(X_{0}, Y_{0}) = ({\overset{‒}{X}}_{0}, {\overset{‒}{Y}}_{0})$ , then

X_{t} \leq {\overset{‒}{X}}_{t}

and

Y_{t} \leq {\overset{‒}{Y}}_{t}

for all t ≥ 0.

Proof. The uniqueness and existence of strong solutions is fairly standard, see, for example, Theorem 2.1 in Li and Mao (2009). One notes that the drift coefficients are locally Lipschitz so strong solutions exist and are unique up to the explosion time. It is easy to show this explosion time is almost surely infinite (see Theorem 2.1 in Li and Mao (2009)). Next, suppose that $X_{0} = {\overset{‒}{X}}_{0}$ . We adapt the comparison principle of Ikeda and Watanabe (Chapter VI Theorem 1.1 from Ikeda and Watanabe (1989)) proved by the local time techniques of Le Gall (see Theorem 1.4 from Le Gall (1983) and Theorem V.43.1 in Rogers and Williams (2000)) to show that ${\overset{‒}{X}}_{t} - X_{t} \geq_{0}$ for all t ≥ 0.

Define $ρ : ℝ_{+} \to ℝ_{+}$ by ρ(x) = |x|². Note that

\begin{matrix} \int_{0}^{t} ρ & {(∣ {\overset{‒}{X}}_{s} - X_{s} ∣)}^{- 1} 𝟙 {{\overset{‒}{X}}_{s} - X_{s} > 0} d [\overset{‒}{X} - X] s \\ = & \int_{0}^{t} ρ {(∣ {\overset{‒}{X}}_{s} - X_{s} ∣)}^{- 1} {(σ_{X} {\overset{‒}{X}}_{s} - σ_{X} X_{s})}^{2} 𝟙 {{\overset{‒}{X}}_{s} - X_{s} > 0} ds \\ \leq & σ_{X}^{2} t . \end{matrix}

Since $\int_{0 +} ρ {(u)}^{- 1} d u = \infty$ , by Proposition V.39.3 from Rogers and Williams (2000) the local time at 0 of $X - \overset{‒}{X}$ is zero for all t ≥ 0. Put x⁺ := x∨0. By Tanaka’s formula (see equation IV.43.6 in Rogers and Williams (2000)),

{(X_{t} - {\overset{‒}{X}}_{t})}^{+} = \int_{0}^{t} 𝟙 {X_{s} - {\overset{‒}{X}}_{s} > 0} (σ_{X} X_{s} - σ_{X} {\overset{‒}{X}}_{s}) {dU}_{t} + \int_{0}^{t} 𝟙 {X_{s} - {\overset{‒}{X}}_{s} > 0} [(μ \cdot α - ({aX}_{s} + {cY}_{s})) X_{s} - (μ \cdot α - a {\overset{‒}{X}}_{s}) {\overset{‒}{X}}_{s}] ds .

For K > 0 define the stopping time

T_{K} ≔ inf {t > 0 : X_{t} \geq K or {\overset{‒}{X}}_{t} \geq K}

and the stopped processes $X_{t}^{K} = X_{T_{K} \land t}$ and ${\overset{‒}{X}}_{t}^{K} = {\overset{‒}{X}}_{T_{K} \land t}$ . Then, stopping the processes at T_K and taking expectations yields

\begin{matrix} 0 & \leq 𝔼 {(X_{t}^{K} - {\overset{‒}{X}}_{t}^{K})}^{+} \\ = 𝔼 \int_{0}^{t \land T_{K}} 𝟙 {X_{s} - {\overset{‒}{X}}_{s} > 0} [(μ \cdot α X_{s} - X_{s} ({aX}_{s} + {cY}_{s})) - (μ \cdot α {\overset{‒}{X}}_{s} - a {\overset{‒}{X}}_{s}^{2})] ds \\ = 𝔼 \int_{0}^{t \land T_{K}} 𝟙 {X_{s} - {\overset{‒}{X}}_{s} > 0} [μ \cdot α (X_{s} - {\overset{‒}{X}}_{s}) - a (X_{s}^{2} - {\overset{‒}{X}}_{s}^{2}) - {cX}_{s} Y_{s}] ds \\ \leq 𝔼 \int_{0}^{t \land T_{K}} 𝟙 {X_{s} - {\overset{‒}{X}}_{s} > 0} μ \cdot α (X_{s} - {\overset{‒}{X}}_{s}) ds \\ \leq μ \cdot α 𝔼 \int_{0}^{t \land T_{K}} {(X_{s} - {\overset{‒}{X}}_{s})}^{+} ds \\ \leq μ \cdot α 𝔼 \int_{0}^{t} {(X_{s}^{K} - {\overset{‒}{X}}_{s}^{K})}^{+} ds . \end{matrix}

By Gronwall’s Lemma (see, for example, Appendix 5 of Ethier and Kurtz (2005)) $𝔼 [{(X_{t}^{K} - {\overset{‒}{X}}_{t}^{K})}^{+}] = 0$ for all t ≥ 0, so $X_{t}^{K} \leq {\overset{‒}{X}}_{t}^{K}$ for all t ≥ 0. Now let K → ∞ and that $\overset{‒}{X}$ does not explode to get that $X_{t} \leq {\overset{‒}{X}}_{t}$ for all t ≥ 0. Since we have shown before that $\overset{‒}{X}$ is dominated by a geometric Brownian motion, a process that has finite moments of all orders, we get that $X_{t}, Y_{t} \in L^{p} (ℙ^{(x, y)})$ for all t, p > 0 and for all $(x, y) \in ℝ_{+ +}^{2}$ .

Remark 6.2. Note that the SDEs for all the processes considered here have unique strong solutions in L^p for all t ≥ 0, p > 0 and for all strictly positive starting points. This follows by arguments similar to those that are in Theorem 2.1 from Li and Mao (2009) and in Theorem 6.1 by noting that our SDEs for (X, Y), ( $\overset{‒}{X}$ , $\overset{‒}{Y}$ ) etc. are all of the form

\begin{matrix} d {\overset{˘}{X}}_{t} & = {\overset{˘}{X}}_{t} [λ_{1} - λ_{2} {\overset{˘}{Y}}_{t} - λ_{3} {\overset{˘}{X}}_{t}] dt + {\overset{˘}{X}}_{t} σ_{X} {dU}_{t} \\ d {\overset{˘}{Y}}_{t} & = {\overset{˘}{Y}}_{t} [λ_{4} - λ_{5} {\overset{˘}{X}}_{t} - λ_{6} {\overset{˘}{Y}}_{t}] dt + {\overset{˘}{Y}}_{t} σ_{Y} {dV}_{t} \\ {\overset{˘}{X}}_{0} & = x \\ {\overset{˘}{Y}}_{0} & = y \end{matrix}

for $λ_{1}, \dots, λ_{6} \in ℝ_{+}$ and $x, y \in ℝ_{+ +}$ .

The next proposition tells us that none of our processes hit zero in finite time.

Proposition 6.3. Let (X, Y) be the process given by (6.6). If $(X_{0}, Y_{0}) \in ℝ_{+ +}^{2}$ , then $(X_{t}, Y_{t}) \in ℝ_{+ +}^{2}$ for all t ≥ 0 almost surely. A similar conclusion holds for all of the other processes we work with.

Proof. As an example of the method of proof, we look at the process (X, Y) given by (6.6). Taking logarithms and using Itô’s lemma,

d \log X_{t} = (μ \cdot α - ({aX}_{t} + {cY}_{t}) - \frac{1}{2} σ_{X}^{2}) dt + σ_{X} {dU}_{t} .

Therefore,

\log X_{t} = \int_{0}^{t} (μ \cdot α - ({aX}_{s} + {cY}_{s}) - \frac{1}{2} σ_{X}^{2}) ds + σ_{X} U_{t} .

can’t go to −∞ in finite time because X_t and Y_t do not blow up.

Proposition 6.4. Let (X, Y) be the process given by (6.6) and fix $(x, y) \in ℝ_{+ +}^{2}$ . Any sequence ${t_{n}}_{n \in ℕ}$ such that t_n → ∞ has a subsequence ${u_{n}}_{n \in ℕ}$ such that the sequence of probability measures

\frac{1}{u_{n}} \int_{0}^{u_{n}} ℙ^{(x, y)} {(X_{s}, Y_{s}) \in \cdot} ds

converges in the topology of weak convergence of probability measures on $ℝ_{+}^{2}$ . Any such limit is a stationary distribution for the process (X, Y) thought of as a process with state space $ℝ_{+}^{2}$ .

Proof. Set φ(x, y) := x + y so that φ ≥ 0 for x,y > 0. Put ψ (x, y) = μ · αx + μ · βy − x(ax + cy) − y(cx + by). Note that ψ is bounded above on the quadrant x, y ≥ 0 and lim_{||(x,y)||→∞}ψ(x, y) = −∞ where ||·|| is the Euclidean distance on $ℝ^{2}$ . Using Itô’s lemma we get

φ (X_{t}, Y_{t}) - \int_{0}^{t} ψ (X_{s}, Y_{s}) ds = \int_{0}^{t} σ_{Y} Y_{s} {dV}_{s} + \int_{0}^{t} σ_{X} X_{s} {dU}_{s} .

Therefore, $φ (X_{t}, Y_{t}) - \int_{0}^{t} ψ (X_{s}, Y_{s}) d s$ is a martingale. Applying Theorem 9.9 of Ethier and Kurtz (2005) completes the proof.

The following result is essentially Theorem 10 in Liu et al. (2011). We include the proof for completeness.

Proposition 6.5. Suppose that α·μ−α·Σα/2 > 0, β·μ−β·Σβ/2 > 0, and $I (α, β) < 0$ . If (X, Y) is the process given by (6.6), then lim_t→∞ $\lim_{t \to \infty} Y_{t} = 0 ℙ^{(x, y)} - a . s$ . for all $(x, y) \in ℝ_{+ +}^{2}$ .

Proof. Using Ito’s lemma and the definition of $I (α, β)$ ,

\begin{matrix} a \frac{\log (\frac{Y_{t}}{Y_{0}})}{t} - c \frac{\log (\frac{X_{t}}{X_{0}})}{t} & = a (μ \cdot β - \frac{σ_{Y}^{2}}{2}) - c (μ \cdot α - \frac{σ_{X}^{2}}{2}) - (a b - c^{2}) \frac{\int_{0}^{t} Y_{s} ds}{t} + a σ_{Y} \frac{V_{t}}{t} - c σ_{X} \frac{U_{t}}{t} \\ = a I (α, β) - (ab - c^{2}) \frac{\int_{0}^{t} Y_{s} ds}{t} + a σ_{Y} \frac{V_{t}}{t} - c σ_{X} \frac{U_{t}}{t} . \end{matrix}

By the Cauchy-Schwarz inequality, (ab − c²) = 〈α, α〉_κ〈β, β〉_κ − (〈α, β〉_κ)² ≥ 0, and so

\frac{\log (\frac{Y_{t}}{Y_{0}})}{t} \leq \frac{c}{a} \frac{\log (\frac{X_{t}}{X_{0}})}{t} + I (α, β) + σ_{Y} \frac{V_{t}}{t} - \frac{c}{a} σ_{X} \frac{U_{t}}{t} .

Let $\overset{‒}{X}$ be the process defined by (6.7) with ${\overset{‒}{X}}_{0} = X_{0}$ . Proposition 2.3 implies

\lim_{t \to \infty} \frac{1}{t} \int_{0}^{t} {\overset{‒}{X}}_{s} ds = (μ \cdot α - σ_{X}^{2} ∕ 2) ∕ a almost surely .

(6.8)

It follows from Theorem 6.1 that $X_{t} \leq {\overset{‒}{X}}_{t}$ for all t ≥ 0. Thus, with probability one,

\begin{matrix} \underset{t \to \infty}{\lim \sup} \frac{\log X_{t}}{t} & \leq \underset{t \to \infty}{\lim \sup} \frac{\log {\overset{‒}{X}}_{t}}{t} \\ = (μ \cdot α - \frac{σ_{X}^{2}}{2}) - a \lim_{t \to \infty} \frac{1}{t} \int_{0}^{t} {\overset{‒}{X}}_{s} ds + σ_{X} \lim_{t \to \infty} \frac{U_{t}}{t} \\ = (μ \cdot α - \frac{σ_{X}^{2}}{2}) - a (μ \cdot α - σ_{X}^{2} ∕ 2) ∕ a \\ = 0 . \end{matrix}

Since U and V are Brownian motions, $\lim_{t \to \infty} \frac{U_{t}}{t} = \lim_{t \to \infty} \frac{V_{t}}{t} = 0$ , and $\lim \sup_{t \to \infty} \frac{\log X_{t}}{t} \leq 0$ almost surely, so

\underset{t \to \infty}{\lim \sup} \frac{\log Y_{t}}{t} \leq I (α, β) < 0 almost surely .

In particular, limt_t→∞ Y_t = 0 almost surely.

We can now finish the proof of Theorem 4.1. Fix ε > 0 and η > 0 sufficiently small. Define the stopping time

T_{ϵ} ≔ inf {t \geq 0 : Y_{t} \geq ϵ} .

and the stopped process $X_{t}^{ϵ} ≔ X_{t \land T_{ϵ}}$ . By Proposition 6.5, there exists T > 0 such that

ℙ^{(x, y)} {Y_{t} \leq ϵ for all t \geq T} \geq 1 - η

Define the process $\overset{ˇ}{X}$ via

d {\overset{ˇ}{X}}_{t} = {\overset{ˇ}{X}}_{t} [(μ \cdot α - c ϵ) - a {\overset{ˇ}{X}}_{t}] dt + σ_{X} {\overset{ˇ}{X}}_{t} {dU}_{t}

and the stopped process ${\overset{ˇ}{X}}_{t}^{ϵ} ≔ {\overset{ˇ}{X}}_{t \land T_{ϵ}}$ . Start the process $\overset{ˇ}{X}$ at time T with the condition ${\overset{ˇ}{X}}_{T} = X_{T}$ . We want to show that the process ${\overset{ˇ}{X}}^{ϵ}$ is dominated by the process X^ε, that is $X_{t}^{ϵ} \geq {\overset{ˇ}{X}}_{t}^{ϵ}$ for all t ≥ T. By the strong Markov property, we can assume T = 0.

The proof is very similar to the one from Theorem 6.1. With the notation from the proof of Theorem 6.1, we have

\begin{matrix} \int_{0}^{t} ρ & {(∣ {\overset{ˇ}{X}}_{s}^{ϵ} - X_{s}^{ϵ} ∣)}^{- 1} 𝟙 {{\overset{ˇ}{X}}_{s}^{ϵ} - X_{s}^{ϵ} > 0} d {[{\overset{ˇ}{X}}^{ϵ} - X^{ϵ}]}_{s} \\ = & \int_{0}^{t} ρ {(∣ {\overset{ˇ}{X}}_{s}^{ϵ} - X_{s}^{ϵ} ∣)}^{- 1} {(σ_{X} {\overset{ˇ}{X}}_{s}^{ϵ} - σ_{X} X_{s}^{ϵ})}^{2} 𝟙 {{\overset{ˇ}{X}}_{s}^{ϵ} - X_{s}^{ϵ} > 0}] ds \\ \leq & σ_{X}^{2} t \end{matrix}

so the local time of the process ${\overset{ˇ}{X}}^{ϵ} - X^{ϵ}$ at zero is identically zero. Then, using Tanaka’s formula

{({\overset{ˇ}{X}}_{t}^{ϵ} - X_{t}^{ϵ})}^{+} = \int_{0}^{t \land T_{ϵ}} 𝟙 {{\overset{ˇ}{X}}_{s} - X_{s} > 0} (σ_{X} {\overset{ˇ}{X}}_{s} - σ_{X} X_{s}) {dU}_{t} + \int_{0}^{t \land T_{ϵ}} 𝟙 {{\overset{ˇ}{X}}_{s} - X_{s} > 0} [((μ \cdot α - c ϵ) {\overset{ˇ}{X}}_{s} - a {\overset{ˇ}{X}}_{s}^{2}) - (μ \cdot α X_{s} - X_{s} ({cY}_{s} + {aX}_{s}))] ds .

Taking expectations,

\begin{matrix} 𝔼 [{({\overset{ˇ}{X}}_{t}^{ϵ} - X_{t}^{ϵ})}^{+}] & = 𝔼 \int_{0}^{t \land T_{ϵ}} 𝟙 {{\overset{ˇ}{X}}_{s} - X_{s} > 0} [(μ \cdot α ({\overset{ˇ}{X}}_{s} - X_{s}) - (c ϵ {\overset{ˇ}{X}}_{s} - {cX}_{s} Y_{s}) - a ({\overset{ˇ}{X}}_{s}^{2} - X_{s}^{2})) ds] \\ \leq μ \cdot α 𝔼 \int_{0}^{t \land T_{ϵ}} {({\overset{ˇ}{X}}_{s} - X_{s})}^{+} ds \\ \leq μ \cdot α 𝔼 \int_{0}^{t} {({\overset{ˇ}{X}}_{s}^{ϵ} - X_{s}^{ϵ})}^{+} ds . \end{matrix}

By Gronwall’s Lemma, $𝔼 [{({\overset{ˇ}{X}}_{t}^{ϵ} - X_{t}^{ϵ})}^{+}] = 0$ . As a result, remembering we assumed T = 0, we have ${\overset{ˇ}{X}}_{t}^{ϵ} \leq X_{t}^{ϵ}$ for all t ≥ T. For ε small enough we know that $\overset{ˇ}{X}$ has a stationary distribution concentrated on $ℝ_{+ +}$ . For any sequence a_n → ∞, if the Cesaro averages $\frac{1}{a_{n}}$ $\int_{0}^{a_{n}}$ $ℙ^{(x, y)} {(X_{s}, Y_{s}) \in \cdot}$ converge weakly, then the limit is a distribution of the form φ ⊗ δ₀, where φ is a mixture of the unique stationary distribution $ρ_{\overset{‒}{X}}$ described in Proposition 2.3 and the point mass at 0. By the above, the limit of $\frac{1}{a_{n}}$ $\int_{0}^{a_{n}}$ $ℙ^{(x, y)} {(X_{s}, Y_{s}) \in \cdot}$ cannot have any mass at (0, 0) because ${\overset{ˇ}{X}}_{t} \leq X_{t}$ on the event {Y_t ≤ ε for all t ≥ T} that has probability $ℙ^{(x, y)} {Y_{t} \leq ϵ for all t \geq T} \geq 1 - η$ . Since η > 0 was arbitrary, we conclude that $φ = ρ_{\overset{‒}{X}} \otimes δ_{0}$ required.

Appendix C: Proof of Theorem 4.2

Our proof is along the same lines as the proofs of Theorems 4 and 5 in Schreiber et al. (2011).

We will once again simplify our notation by re-writing the SDE for the pair (X, Y) as in (6.6). We assume throughout this appendix that the hypotheses of Theorem 4.2 hold; that is, $I (α, β) > 0$ and $I (β, α) > 0$ .

Let ${(({\overset{‒}{X}}_{t}, {\overset{‒}{Y}}_{t}))}_{t \geq 0}$ be the stochastic process defined by the pair of stochastic differential equations in (6.7) with initial conditions $({\overset{‒}{X}}_{0}, {\overset{‒}{Y}}_{0}) = (X_{0}, Y_{0})$ . We know from Theorem 6.1 that $X_{t} \leq {\overset{‒}{X}}_{t}$ and $Y_{t} \leq {\overset{‒}{Y}}_{t}$ for all t ≥ 0.

Note from Corollary 3.3 that α · (μ − Σα/2) > 0 and β · (μ − Σβ/2) > 0 and hence, by Proposition 2.3, the process $(\overset{‒}{X}, \overset{‒}{Y})$ has a unique stationary distribution on $ℝ_{+ +}^{2}$ and is strongly ergodic.

Let

Π_{t} (\cdot) ≔ \frac{1}{t} \int_{0}^{t} 𝟙 {(X_{s}, Y_{s}) \in \cdot} ds

be the normalized occupation measures of (X, Y). We know that the random probability measures

{\overset{‒}{Π}}_{t} (\cdot) ≔ \frac{1}{t} \int_{0}^{t} 𝟙 {({\overset{‒}{X}}_{s}, {\overset{‒}{Y}}_{s}) \in \cdot} ds

converge almost surely and so, in particular, they are tight on $ℝ_{+}^{2} = {[0, \infty)}^{2}$ ; that is, for any ε > 0 we can find a box [0,K] × [0,K] such that

\frac{1}{t} \int_{0}^{t} 𝟙 {({\overset{‒}{X}}_{s}, {\overset{‒}{Y}}_{s}) \in [0, K] \times [0, K]} ds > 1 - ϵ for all t > 0 .

Therefore,

\begin{matrix} \frac{1}{t} \int_{0}^{t} 𝟙 {(X_{s}, Y_{s}) \in [0, K] \times [0, K]} ds & \geq \frac{1}{t} \int_{0}^{t} 𝟙 {({\overset{‒}{X}}_{s}, {\overset{‒}{Y}}_{s}) \in [0, K] \times [0, K]} ds \\ > 1 - ϵ for all t > 0, \end{matrix}

and hence the normalized occupation measures of (X, Y) are also tight on $ℝ_{+}^{2}$ . By Prohorov’s theorem (Kallenberg, 2002, Theorem 16.3), there exists a random probability measure ν on $ℝ_{+}^{2}$ and a (possibly random) sequence $(t_{n}) \subset ℝ_{+ +}$ such that t_n → ∞ for which

Π_{t_{n}} \Rightarrow ν

(6.9)

as n → ∞ almost surely, where ⇒ denotes weak convergence of probability measures on $ℝ_{+}^{2}$ . That is, with probability one for all bounded and continuous function $u : ℝ_{+}^{2} \to ℝ$ we have

\int_{ℝ_{+}} u (x, y) Π_{t_{n}} (dx, dy) \to \int_{ℝ_{+}} u (x, y) ν (dx, dy)

as n → ∞.

Proposition 6.6. The probability measure ν is almost surely a stationary distribution for (X, Y) thought of as a process with state space $ℝ_{+}^{2}$ .

Proof. Let (P_t)_t≥0 be the semigroup of the process (X, Y) thought of as a process on $ℝ_{+}^{2}$ . For simplicity let us write Z_t := (X_t, Y_t) for all t ≥ 0 and ν_n := Π_{t_n}.

By the Strong Law of Large Numbers for martingales, we have that for all $r \in ℝ_{+}$ and all bounded measurable functions f

\lim_{κ \to \infty} \frac{1}{k} \sum_{i = 0}^{k - 1} [f (Z_{r + (i + 1) t}) - P_{t} f (Z_{r + it})] = 0 almost surely .

As a result,

\begin{matrix} \frac{1}{k} (\int_{t}^{kt} f (Z_{s}) ds - \int_{0}^{(k - t) t} P_{t} f (Z_{s}) ds) & = \frac{1}{k} \sum_{i = 0}^{k - 1} \int_{0}^{t} [f (Z_{r + (i + 1) t}) - P_{t} f (Z_{r + it})] dr \\ \to 0 as k \to \infty almost surely . \end{matrix}

This implies that

\lim_{u \to \infty} \frac{1}{u} \int_{0}^{u} [f (Z_{s + t}) - P_{t} (Z_{s})] ds = 0 almost surely .

Thus,

\begin{matrix} \int f d ν - \int P_{t} f d ν & = \lim_{n \to \infty} (\int f d ν_{n} - \int P_{t} f d ν_{n}) \\ = \lim_{n \to \infty} \frac{1}{t_{n}} [\int_{0}^{t_{n}} (f (Z_{s}) - P_{t} f (Z_{s})) ds] \\ = \lim_{n \to \infty} \frac{1}{t_{n}} [\int_{0}^{t_{n} - t} (f (Z_{s + t}) - P_{t} f (Z_{s})) ds + \int_{0}^{t} f (Z_{s}) ds - \int_{t_{n} - t}^{t_{n}} P_{t} f (Z_{s}) ds] \\ = \lim_{n \to \infty} \frac{1}{t_{n}} [\int_{0}^{t_{n} - t} (f (Z_{s + t}) - P_{t} f (Z_{s})) ds] \\ = 0 almost surely . \end{matrix}

(6.10)

The last result is equivalent to saying that ν is almost surely a stationary distribution for (X, Y).

Proposition 6.7. There exists a stationary distribution π of (X, Y) that assigns all of its mass to $ℝ_{+ +}^{2}$ .

Proof. We argue by contradiction. Because the process stays in one of the four sets $ℝ_{+ +}$ , $ℝ_{+ +} \times {0}$ , ${0} \times ℝ_{+ +}$ , {(0,0)} when it is started in the set, any stationary distribution (X, Y) thought of as a process on $ℝ_{+}^{2}$ can be written as a convex combination of stationary distributions that respectively assign all of their masses to one of the four sets, should such a stationary distribution exist for the given set. Suppose there is no stationary distribution that is concentrated on $ℝ_{+ +}^{2}$ . Then, any stationary distribution is the convex combination of stationary distributions that respectively assign all of their mass to the three sets $ℝ_{+ +} \times {0}$ , ${0} \times ℝ_{+ +}$ , and {(0,0)}, and hence any stationary distribution of the form

p_{X} μ_{X} + p_{Y} μ_{Y} + p_{0} δ_{(0, 0)},

where the random variables p_X, p_Y, p₀ are nonnegative and p_X + p_Y + p₀ = 1 almost surely, and $μ_{X} = ρ_{\overset{‒}{X}} \otimes δ_{0}$ and $μ_{Y} = δ_{0} \otimes ρ_{\overset{‒}{Y}}$ for $ρ_{\overset{‒}{X}}$ and $ρ_{\overset{‒}{Y}}$ the unique stationary distributions of $\overset{‒}{X}$ and $\overset{‒}{Y}$ . Next, we proceed as in Proposition 6.5 to find the limit of $\frac{\log X_{t_{n}}}{t_{n}}$ . Let us first argue that

\begin{matrix} \lim_{n \to \infty} \frac{1}{t_{n}} \int_{0}^{t_{n}} X_{s} ds & = \int_{ℝ_{+}^{2}} x ν (dx, dy) \\ \lim_{n \to \infty} \frac{1}{t_{n}} \int_{0}^{t_{n}} Y_{s} ds & = \int_{ℝ_{+}^{2}} y ν (dx, dy) almost surely . \end{matrix}

(6.11)

Note that the infinitesimal generator of (log X, log Y) thought of as a process on $ℝ^{2}$ is uniformly elliptic with smooth coefficients and so it has smooth transition densities (see, for example, Section 3.3.4 of Stroock (2008)). Moreover, an application of a suitable minimum principle for the Kolmogorov forward equation (see, for example, Theorem 5 in Section 2 of Chapter 2 of Friedman (1964)) shows that the transition densities are everywhere strictly positive. It follows that (X, Y) thought of as a process on $ℝ_{+}^{2}$ has smooth transition densities that are everywhere positive.

Because the process $\overset{‒}{X}$ also has smooth, every positive transition densities for similar reasons, the almost sure behavior of the $\overset{‒}{X}$ started from a fixed point is the same as it is starting from its stationary distribution $ρ_{\overset{‒}{X}}$ . As a result, we get by Birkhoff’s pointwise ergodic theorem (Kallenberg, 2002, Theorem 10.6) that, for all K > 0,

\lim_{n \to \infty} \frac{1}{t_{n}} \int_{0}^{t_{n}} {\overset{‒}{X}}_{s} 𝟙 {{\overset{‒}{X}}_{s} > K} ds = 𝔼^{ρ_{\overset{‒}{X}}} [{\overset{‒}{X}}_{s} 𝟙 {{\overset{‒}{X}}_{s} > K}]

$ℙ^{x}$ almost surely for any $x \in ℝ_{+}$ . Therefore, by dominated convergence

\lim_{K \to \infty} \lim_{n \to \infty} \frac{1}{t_{n}} \int_{0}^{t_{n}} {\overset{‒}{X}}_{s} 𝟙 {{\overset{‒}{X}}_{s} > K} ds = \lim_{m \to \infty} 𝔼_{ρ_{\overset{‒}{X}}} [{\overset{‒}{X}}_{s} 𝟙 {{\overset{‒}{X}}_{s} > K}] = 0 .

The following inequalities are immediate due to the positivity of the terms

\begin{matrix} \frac{1}{t_{n}} \int_{0}^{t_{n}} X_{s} 𝟙 {X_{s} \leq K} ds & \leq \frac{1}{t_{n}} \int_{0}^{t_{n}} X_{s} ds \\ = \frac{1}{t_{n}} \int_{0}^{t_{n}} X_{s} 𝟙 {X_{s} \leq K} ds + \frac{1}{t_{n}} \int_{0}^{t_{n}} X_{s} 𝟙 {X_{s} > K} ds . \end{matrix}

(6.12)

Recall that $X_{t} \leq {\overset{‒}{X}}_{t}$ for all t ≥ 0 and hence

\frac{1}{t_{n}} \int_{0}^{t_{n}} X_{s} 𝟙 {X_{s} > K} ds \leq \frac{1}{t_{n}} \int_{0}^{t_{n}} {\overset{‒}{X}}_{s} 𝟙 {{\overset{‒}{X}}_{s} > K} ds .

This implies

\underset{n \to \infty}{\lim \sup} \frac{1}{t_{n}} \int_{0}^{t_{n}} X_{s} 𝟙 {X_{s} > K} ds \leq \underset{n \to \infty}{\lim \sup} \frac{1}{t_{n}} \int_{0}^{t_{n}} {\overset{‒}{X}}_{s} 𝟙 {{\overset{‒}{X}}_{s} > K} ds,

and therefore

\begin{matrix} 0 & \leq \lim_{K \to \infty} \underset{n \to \infty}{\lim \sup} \frac{1}{t_{n}} \int_{0}^{t_{n}} X_{s} 𝟙 {X_{s} > K} ds \\ \leq \lim_{K \to \infty} \underset{n \to \infty}{\lim \sup} \frac{1}{t_{n}} \int_{0}^{t_{n}} {\overset{‒}{X}}_{s} 𝟙 {{\overset{‒}{X}}_{s} > K} ds = 0 . \end{matrix}

(6.13)

By (6.9) and Theorem 4.27 of Kallenberg (2002),

\lim_{n \to \infty} \frac{1}{t_{n}} \int_{0}^{t_{n}} X_{s} 𝟙 {X_{s} \leq K} ds = \int_{ℝ_{+ +}^{2}} x 𝟙 {x \leq K} ν (dx, dy) .

for any K such that

ν ({K} \times ℝ_{+}) = 0 .

While this last condition need not hold a priori for all K, we can only have

ν ({K} \times ℝ_{+}) > 0

for countably many K, so there exists a sequence $(K_{m}) \subset ℝ_{+}$ such that K_m → ∞ as m → ∞ with

ν ({K_{m}} \times ℝ_{+}) = 0 .

By dominated convergence,

\begin{matrix} \lim_{m \to \infty} \lim_{n \to \infty} \frac{1}{t_{n}} \int_{0}^{t_{n}} X_{s} 𝟙 {X_{s} \leq K_{m}} ds & = \lim_{K \to \infty} \int_{ℝ_{+}^{2}} x 𝟙 {x \leq K} ν (dx, dy) \\ = \int_{ℝ_{+}^{2}} x ν (dx, dy) . \end{matrix}

(6.14)

Combining (6.12), (6.13) and (6.14) gives (6.11).

It follows from Itô’s formula, the observation $I (α, α) = 0$ , (6.11), and the fact that $\lim_{n \to \infty} \frac{U_{t_{n}}}{t_{n}} = 0$ that

\begin{matrix} \lim_{n \to \infty} \frac{\log X_{t_{n}}}{t_{n}} & = μ \cdot α - \frac{σ_{X}^{2}}{2} - 𝔼^{ν} [{aX}_{t} + {bY}_{t}] \\ = p_{X} (μ \cdot α - a 𝔼^{{\overset{‒}{ρ}}_{X}} [X_{t}] - \frac{σ_{X}^{2}}{2}) + p_{Y} (μ \cdot α - b 𝔼^{{\overset{‒}{ρ}}_{Y}} [Y_{t}] - \frac{σ_{X}^{2}}{2}) + p_{0} (μ \cdot α - \frac{σ_{X}^{2}}{2}) \\ = p_{X} I (α, α) + p_{Y} I (α, β) + p_{0} (μ \cdot α - \frac{σ_{X}^{2}}{2}) \\ = p_{Y} I (α, β) + p_{0} (μ \cdot α - \frac{σ_{X}^{2}}{2}) almost surely . \end{matrix}

By assumption, $I (α, β) > 0$ and we have already observed that $μ \cdot α - \frac{σ_{X}^{2}}{2} > 0$ . Because ${\overset{‒}{X}}_{t}$ converges in distribution as t → ∞ to a distribution that assigns all of its mass to $ℝ_{+ +}^{2}$ , it follows that log $\frac{\log {\overset{‒}{X}}_{t_{n}}}{t_{n}}$ converges in probability to 0. However, since $X_{t} \leq {\overset{‒}{X}}_{t}$ for all t ≥ 0 it follows that $p_{Y} I (α, β) + p_{0} (μ \cdot α - \frac{σ_{X}^{2}}{2}) \leq 0$ and hence

p_{Y} = p_{0} = 0 almost surely .

(6.15)

The same argument applied to (Y_t)_t≥0 establishes

p_{X} = p_{0} = 0 almost surely .

(6.16)

Therefore, p_X = p_Y = p₀ = 0, and this contradicts the assumption that p_X + p_Y + p₀ = 1.

We can now finish the proof of Theorem 4.2.

Proof. Proposition 6.7 implies that (X, Y) has a stationary distribution π on $ℝ_{+ +}^{2}$ . By Theorem 20.17 from Kallenberg (2002), our process (X, Y) is either Harris recurrent or uniformly transient. We say that (X_t, Y_t) → ∞ almost surely as t → ∞ if $𝟙_{K} (X_{t}, Y_{t}) \to 0$ as t → ∞ for any compact set $K \subset ℝ_{+ +}^{2}$ . Theorem 20.21 from Kallenberg (2002) gives that if (X,Y) is transient, then (X_t, Y_t) → ∞ and so (X, Y) cannot have a stationary distribution. Hence, since we know our process has a stationary distribution π, it must be Harris recurrent. Theorem 20.21 from Kallenberg (2002) then gives us equation (4.1).

Theorem 20.18 from Kallenberg (2002), 20.18 gives that any Harris recurrent Feller process on $ℝ_{+ +}^{2}$ with strictly positive transition densities has a locally finite invariant measure that is equivalent to Lebesgue measure and is unique up to a normalization. We already know that we have a stationary distribution, so this distribution is unique and has an almost everywhere strictly positive density with respect to Lebesgue measure. Theorem 20.12 from Kallenberg (2002) says that any Harris recurrent Feller process is strongly ergodic, and so equation (4.2) holds.

Remark 6.8. In Theorem 3.1 of Zhang and Chen (2013), the authors claim to show that the system of SDE describing (X, Y) always has a unique stationary distribution. We note that their use of moments just checks tightness in $ℝ_{+}^{2} ≔ {[0, \infty)}^{2}$ and not in $ℝ_{+ +}^{2} = {(0, \infty)}^{2}$ . It does not stop mass going off to $ℝ_{+}^{2} \ ℝ_{+ +}^{2} = (ℝ_{+} \times {0}) \cup ({0} \times ℝ_{+})$ , which is exactly what can happen in our case. Thus, their proof only shows the existence of a stationary distribution on $ℝ_{+}^{2}$ − it does not show the existence of a stationary distribution on $ℝ_{+ +}^{2}$ . Furthermore, their proof for the uniqueness of a stationary distribution on $ℝ_{+}^{2}$ breaks down because their assumption of irreducibility is false. The process (X, Y) is irreducible on $ℝ_{+ +}^{2}$ , but it is not irreducible on $ℝ_{+}^{2}$ since $P_{t} ((0, 0), U) ≔ ℙ^{(0, 0)} {(X_{t}, Y_{t}) \in U} = 0$ for any open subset U that lies in the interior of $ℝ_{+}^{2}$ . If we work on $ℝ_{+}^{2}$ , it is not true that the diffusion (X, Y) has a unique stationary distribution. We can obtain infinitely many stationary distributions on $ℝ_{+}^{2}$ of the form $(u ρ_{\overset{‒}{X}} + v δ_{0}) \otimes δ_{0}$ where $ρ_{\overset{‒}{X}}$ is the unique stationary distribution of $\overset{‒}{X}$ on $ℝ_{+ +}$ and $u, v \in ℝ_{+}$ satisfy u + v = 1.

Appendix D: Proof of Theorem 5.1

Assume that the matrix Σ is positive definite and that the dispersion proportion vector α is such that μ · α − α · Σα/2 > 0 so that a population playing the strategy α persists. Under these assumptions the function $β \mapsto I (α, β)$ is strictly concave. Hence, by the method of Lagrange multipliers, $I (α, β) < 0$ for all β ≠ α and α_i > 0 for all i if and only if there exists a constant, which we denote by λ, such that

λ = {\frac{\partial I}{\partial β_{i}} (α, β) ∣}_{β = α} = μ_{i} - κ_{i} α_{i} (μ \cdot α - α \cdot Σ α ∕ 2) ∕ {〈 α, α 〉}_{κ} - \sum_{j} α_{j} σ_{ij}

(6.17)

for all i. Multiplying (6.17) by α_i and summing with respect to i, we get

\begin{matrix} λ & = μ \cdot α - {〈 α, α 〉}_{κ} (μ \cdot α - α \cdot Σ α ∕ 2) ∕ {〈 α, α 〉}_{κ} - α \cdot Σ α \\ = - α \cdot Σ α ∕ 2 \end{matrix}

This expression for the Lagrange multiplier and (6.17) provide the characterization of a mixed ESS in equation (5.1) when α_i > 0 for all i. The characterization of the more general case of α_i > 0 for at least two patches follows similarly by restricting the method of Lagrange multiples to the appropriate face of the probability simplex.

Suppose that μ_i − σ_ii/2 > 0 so that a population remaining in patch i and not dispersing to other patches persists. The strategy α_i = 1 and α_j = 0 for all j ≠ i is an ESS only if

{\frac{\partial I}{\partial β_{j}} (α, β) ∣}_{β = α} - {\frac{\partial I}{\partial β_{i}} (α, β) ∣}_{β = α} < 0

for all j ≠ i. Evaluating these partial derivatives gives the criterion (5.2) for the pure ESS.

We conclude by considering the case n = 2. Define the function $g : [0, 1] \to ℝ$ by

g (a) = {\frac{\partial I}{\partial β_{1}} ((a_{1}, a_{2}), (b_{1}, b_{2})) ∣}_{(a_{1}, a_{2}) = (a, 1 - a), (b_{1}, b_{2}) = (a, 1 - a)} - {\frac{\partial I}{\partial β_{2}} ((a_{1}, a_{2}), (b_{1}, b_{2})) ∣}_{(a_{1}, a_{2}) = (a, 1 - a), (b_{1}, b_{2}) = (a, 1 - a)} .

The inequalities (5.2) for the pure strategies (1, 0) and (0, 1), respectively, correspond to g(0) < 0 and g(1) > 0, respectively. Hence, when these inequalities are reversed, the intermediate value theorem implies there exists a ∈ (0, 1) such that g(a) = 0. Such an a satisfies the mixed ESS criterion (5.1) and, therefore, is an ESS.

Contributor Information

STEVEN N. EVANS, Department of Statistics #3860, 367 Evans Hall, University of California, Berkeley, CA 94720-3860, USA, evans@stat.berkeley.edu

ALEXANDRU HENING, Department of Statistics, 1 South Parks Road, Oxford OX1 3TG, United Kingdom, hening@stats.ox.ac.uk.

SEBASTIAN J. SCHREIBER, Department of Evolution and Ecology, University of California, Davis, CA 956116, USA, sschreiber@ucdavis.edu

References

Anderson JT, Geber MA. Demographic source-sink dynamics restrict local adaptation in Elliott’s blueberry (Vaccinium elliottii) Evolution. 2010;64:370–384. doi: 10.1111/j.1558-5646.2009.00825.x. [DOI] [PubMed] [Google Scholar]
Beckmann JP, Berger J. Using black bears to test ideal-free distribution models experimentally. Journal of Mammalogy. 2003;84:594–606. [Google Scholar]
Cantrell RS, Cosner C, Deangelis DL, Padron V. The ideal free distribution as an evolutionarily stable strategy. Journal of Biological Dynamics. 2007;1:249–271. doi: 10.1080/17513750701450227. [DOI] [PubMed] [Google Scholar]
Cantrell RS, Cosner C, Lou Y. Evolution of dispersal and the ideal free distribution. Mathematical Biosciences and Engineering. 2010;7:17–36. doi: 10.3934/mbe.2010.7.17. [DOI] [PubMed] [Google Scholar]
Cantrell RS, Cosner C, Lou Y. Evolutionary stability of ideal free dispersal strategies in patchy environments. Journal of Mathematical Biology. 2012;65:943–965. doi: 10.1007/s00285-011-0486-5. [DOI] [PubMed] [Google Scholar]
Chesson PL. General theory of competitive coexistence in spatially-varying environments. Theoretical Population Biology. 2000;58:211–237. doi: 10.1006/tpbi.2000.1486. [DOI] [PubMed] [Google Scholar]
Childs DZ, Metcalf CJE, Rees M. Evolutionary bet-hedging in the real world: empirical evidence and challenges revealed by plants. Proceedings of the Royal Society B: Biological Sciences. 2010;277:3055–3064. doi: 10.1098/rspb.2010.0707. [DOI] [PMC free article] [PubMed] [Google Scholar]
Cosner C. A dynamic model for the ideal-free distribution as a partial differential equation. Theoretical Population Biology. 2005;67:101–108. doi: 10.1016/j.tpb.2004.09.002. [DOI] [PubMed] [Google Scholar]
Cressman R, Křivan V. Migration dynamics for the ideal free distribution. American Naturalist. 2006;168:384–397. doi: 10.1086/506970. [DOI] [PubMed] [Google Scholar]
Cressman R, Křivan V. The ideal free distribution as an evolutionarily stable state in density-dependent population games. Oikos. 2010;119:1231–1242. [Google Scholar]
Cressman R, Křivan V, Garay J. Ideal free distributions, evolutionary games, and population dynamics in multiple-species environments. American Naturalist. 2004;164:473–489. doi: 10.1086/423827. [DOI] [PubMed] [Google Scholar]
Doncaster CP, Clobert J, Doligez B, Danchin E, Gustafsson L. Balanced dispersal between spatially varying local populations: an alternative to the source-sink model. American Naturalist. 1997;150(4):425–445. doi: 10.1086/286074. [DOI] [PubMed] [Google Scholar]
Dreisig H. Ideal free distributions of nectar foraging bumblebees. Oikos. 1995;72:161–172. [Google Scholar]
Edelaar P, Bolnick DI. Non-random gene flow: an underappreciated force in evolution and ecology. Trends in Ecology & Evolution. 2012;27:659–665. doi: 10.1016/j.tree.2012.07.009. [DOI] [PubMed] [Google Scholar]
Ethier SN, Kurtz TG. Markov Processes: Characterization and Convergence. Wiley; Hoboken, NJ: 2005. [Google Scholar]
Evans SN, Ralph P, Schreiber SJ, Sen A. Stochastic growth rates in spatiotemporal heterogeneous environments. Journal of Mathematical Biology. 2013;66:423–476. doi: 10.1007/s00285-012-0514-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
Fox LR, Eisenbach J. Contrary choices: possible exploitation of enemy-free space by herbivorous insects in cultivated vs. wild crucifers. Oecologia. 1992;89:574–579. doi: 10.1007/BF00317166. [DOI] [PubMed] [Google Scholar]
Fretwell SD, Lucas HL., Jr. On territorial behavior and other factors influencing habitat distribution in birds. Acta Biotheoretica. 1969;19:16–36. [Google Scholar]
Friedman A. Partial differential equations of parabolic type. Prentice-Hall Inc.; Englewood Cliffs, N.J.: 1964. [Google Scholar]
Gejji R, Lou Y, Munther D, Peyton J. Evolutionary convergence to ideal free dispersal strategies and coexistence. Bulletin of Mathematical Biology. 2012;74:257–299. doi: 10.1007/s11538-011-9662-4. [DOI] [PubMed] [Google Scholar]
Geritz SAH, Metz JAJ, Kisdi E, Meszena G. Dynamics of adaptation and evolutionary branching. Physical Review Letters. 1997;78:2024–2027. [Google Scholar]
Godin JJ, Keenleyside MHA. Foraging on patchily distributed prey by a cichlid fish (Teleostei, Cichlidae): a test of the ideal free distribution theory. Animal Behaviour. 1984;32:120–131. [Google Scholar]
Harper DGC. Competitive foraging in mallards: Ideal free ducks. Animal Behaviour. 1982;30:575–584. [Google Scholar]
Hastings A. Can spatial variation alone lead to selection for dispersal? Theoretical Population Biology. 1983;24:244–251. [Google Scholar]
Haugen TO, Winfield IJ, Vøllestad LA, Fletcher JM, James JB, Stenseth NC. The ideal free pike: 50 years of fitness-maximizing dispersal in Windermere. Proceedings of the Royal Society B: Biological Sciences. 2006;273:2917–2924. doi: 10.1098/rspb.2006.3659. [DOI] [PMC free article] [PubMed] [Google Scholar]
Holt RD. On the evolutionary stability of sink populations. Evolutionary Ecology. 1997;11:723–731. [Google Scholar]
Holt RD, Barfield M. On the relationship between the ideal free distribution and the evolution of dispersal. In: Clobert J, Danchin E, Dhondt A, Nichols J, editors. Dispersal. Oxford University Press; USA: 2001. pp. 83–95. [Google Scholar]
Ikeda N, Watanabe S. Stochastic differential equations and diffusion processes, volume 24 of North-Holland Mathematical Library. second edition North-Holland Publishing Co.; Amsterdam: 1989. [Google Scholar]
Jaenike J. Genetic and environmental determinants of food preference in Drosophila tripunctata. Evolution. 1985;39:362–369. doi: 10.1111/j.1558-5646.1985.tb05673.x. [DOI] [PubMed] [Google Scholar]
Jaenike J, Holt RD. Genetic variation for habitat preference: evidence and explanations. American Naturalist. 1991;137:S67–S90. [Google Scholar]
Jansen VAA, Yoshimura J. Populations can persist in an environment consisting of sink habitats only. Proceeding of the National Academy of Sciences USA. 1998;95:3696–3698. doi: 10.1073/pnas.95.7.3696. [DOI] [PMC free article] [PubMed] [Google Scholar]
Kallenberg O. Foundations of Modern Probability. Springer; New York: 2002. [Google Scholar]
Katzenberger GS. Solutions of a stochastic differential equation forced onto a manifold by a large drift. The Annals of Probability. 1991;19:1587–1628. [Google Scholar]
Křivan V. Dynamic ideal free distribution: effects of optimal patch choice on predator-prey dynamics. American Naturalist. 1997;149:164–178. [Google Scholar]
Le Gall J-F. Seminar on probability, XVII, volume 986 of Lecture Notes in Math. Springer; Berlin: 1983. Applications du temps local aux équations différentielles stochastiques unidimensionnelles; pp. 15–31. [Google Scholar]
Li X, Mao X. Population dynamical behavior of non-autonomous Lotka-Volterra competitive system with random perturbation. Discrete and Continuous Dynamical Systems. 2009;24:523–545. [Google Scholar]
Liu M, Wang K, Wu Q. Survival analysis of stochastic competitive models in a polluted environment and stochastic competitive exclusion principle. Bulletin of Mathematical Biology. 2011;73:1969–2012. doi: 10.1007/s11538-010-9569-5. [DOI] [PubMed] [Google Scholar]
Maynard Smith J, Price GR. The logic of animal conflict. Nature. 1973;246:15–18. [Google Scholar]
Mayr E. Animal species and evolution. Harvard University Press; 1963. [Google Scholar]
McPeek MA, Holt RD. The evolution of dispersal in spatially and temporally varying environments. American Naturalist. 1992;6:1010–1027. [Google Scholar]
Milinski M. An evolutionarily stable feeding strategy in sticklebacks. Zeitschrift für Tierpsychologie. 1979;51:36–40. [Google Scholar]
Oksanen T, Power ME, Oksanen L. Ideal free habitat selection and consumer-resource dynamics. American Naturalist. 1995;146:565–585. [Google Scholar]
Orians GH, Wittenberger JF. Spatial and temporal scales in habitat selection. American Naturalist. 1991;137:S29–S49. [Google Scholar]
Prout T. Sufficient conditions for multiple niche polymorphism. American Naturalist. 1968;102:493–496. [Google Scholar]
Ravigné V, Olivieri I, Dieckmann U. Implications of habitat choice for protected polymorphisms. Evolutionary Ecology Research. 2004;6:125–145. [Google Scholar]
Robinson HS, Wielgus RB, Cooley HS, Cooley SW. Sink populations in carnivore management: Cougar demography and immigration in a hunted population. Ecological Applications. 2008;18:1028–1037. doi: 10.1890/07-0352.1. [DOI] [PubMed] [Google Scholar]
Rogers LCG, Williams D. Diffusions, Markov processes, and martingales. Vol. 2. Cambridge Mathematical Library. Cambridge University Press; Cambridge: 2000. Itô calculus, Reprint of the second (1994) edition. [Google Scholar]
Rosenzweig ML. A theory of habitat selection. Ecology. 1981;62:327–335. [Google Scholar]
Schreiber SJ. Evolution of patch selection in stochastic environments. American Naturalist. 2012;180:17–34. doi: 10.1086/665655. [DOI] [PubMed] [Google Scholar]
Schreiber SJ, Vejdani M. Handling time promotes the coevolution of aggregation in predator-prey systems. Proceedings of the Royal Society: Biological Sciences. 2006;273:185–191. doi: 10.1098/rspb.2005.3236. [DOI] [PMC free article] [PubMed] [Google Scholar]
Schreiber SJ, Fox LR, Getz WM. Coevolution of contrary choices in host-parasitoid systems. American Naturalist. 2000;155:637–648. doi: 10.1086/303347. [DOI] [PubMed] [Google Scholar]
Schreiber SJ, Fox LR, Getz WM. Parasitoid sex allocation a ects coevolution of patch selection in host-parasitoid systems. Evolutionary Ecology Research. 2002;4:701–718. [Google Scholar]
Schreiber SJ, Benäim M, Atchadé KAS. Persistence in fluctuating environments. Journal of Mathematical Biology. 2011;62:655–683. doi: 10.1007/s00285-010-0349-5. [DOI] [PubMed] [Google Scholar]
Sokurenko EV, Gomulkiewicz R, Dykhuizen DE. Source–sink dynamics of virulence evolution. Nature Reviews Microbiology. 2006;4:548–555. doi: 10.1038/nrmicro1446. [DOI] [PubMed] [Google Scholar]
Stroock DW. Partial differential equations for probabilists, volume 112 of Cambridge Studies in Advanced Mathematics. Cambridge University Press; Cambridge: 2008. [Google Scholar]
Tittler R, Fahrig L, Villard MA. Evidence of large-scale source-sink dynamics and long-distance dispersal among Wood Thrush populations. Ecology. 2006;87:3029–3036. doi: 10.1890/0012-9658(2006)87[3029:eolsda]2.0.co;2. [DOI] [PubMed] [Google Scholar]
Tregenza T. Building on the ideal free distribution. Advances in Ecological Research. 1995;26:253–307. [Google Scholar]
Turelli M, Schemske DW, Bierzychudek P. Stable two-allele polymorphisms maintained by fluctuating fitnesses and seed banks: protecting the blues in Linanthus parryae. Evolution. 2001;55:1283–1298. doi: 10.1111/j.0014-3820.2001.tb00651.x. [DOI] [PubMed] [Google Scholar]
van Baalen M, Sabelis MW. Coevolution of patch selection strategies of predator and prey and the consequences for ecological stability. American Naturalist. 1993;142:646–670. doi: 10.1086/285562. [DOI] [PubMed] [Google Scholar]
van Baalen M, Křivan V, van Rijn PCJ, Sabelis MW. Alternative food, switching predators, and the persistence of predator-prey systems. American Naturalist. 2001;157:512–524. doi: 10.1086/319933. [DOI] [PubMed] [Google Scholar]
Via S. Ecological genetics and host adaptation in herbivorous insects: The experimental study of evolution in natural and agricultural systems. Annual Review for Entomology. 1990;35:421–446. doi: 10.1146/annurev.en.35.010190.002225. [DOI] [PubMed] [Google Scholar]
Zhang Z, Chen D. A new criterion on existence and uniqueness of stationary distribution for diffusion processes. Advances in Difference Equations. 2013;2013:13. [Google Scholar]

[R1] Anderson JT, Geber MA. Demographic source-sink dynamics restrict local adaptation in Elliott’s blueberry (Vaccinium elliottii) Evolution. 2010;64:370–384. doi: 10.1111/j.1558-5646.2009.00825.x. [DOI] [PubMed] [Google Scholar]

[R2] Beckmann JP, Berger J. Using black bears to test ideal-free distribution models experimentally. Journal of Mammalogy. 2003;84:594–606. [Google Scholar]

[R3] Cantrell RS, Cosner C, Deangelis DL, Padron V. The ideal free distribution as an evolutionarily stable strategy. Journal of Biological Dynamics. 2007;1:249–271. doi: 10.1080/17513750701450227. [DOI] [PubMed] [Google Scholar]

[R4] Cantrell RS, Cosner C, Lou Y. Evolution of dispersal and the ideal free distribution. Mathematical Biosciences and Engineering. 2010;7:17–36. doi: 10.3934/mbe.2010.7.17. [DOI] [PubMed] [Google Scholar]

[R5] Cantrell RS, Cosner C, Lou Y. Evolutionary stability of ideal free dispersal strategies in patchy environments. Journal of Mathematical Biology. 2012;65:943–965. doi: 10.1007/s00285-011-0486-5. [DOI] [PubMed] [Google Scholar]

[R6] Chesson PL. General theory of competitive coexistence in spatially-varying environments. Theoretical Population Biology. 2000;58:211–237. doi: 10.1006/tpbi.2000.1486. [DOI] [PubMed] [Google Scholar]

[R7] Childs DZ, Metcalf CJE, Rees M. Evolutionary bet-hedging in the real world: empirical evidence and challenges revealed by plants. Proceedings of the Royal Society B: Biological Sciences. 2010;277:3055–3064. doi: 10.1098/rspb.2010.0707. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R8] Cosner C. A dynamic model for the ideal-free distribution as a partial differential equation. Theoretical Population Biology. 2005;67:101–108. doi: 10.1016/j.tpb.2004.09.002. [DOI] [PubMed] [Google Scholar]

[R9] Cressman R, Křivan V. Migration dynamics for the ideal free distribution. American Naturalist. 2006;168:384–397. doi: 10.1086/506970. [DOI] [PubMed] [Google Scholar]

[R10] Cressman R, Křivan V. The ideal free distribution as an evolutionarily stable state in density-dependent population games. Oikos. 2010;119:1231–1242. [Google Scholar]

[R11] Cressman R, Křivan V, Garay J. Ideal free distributions, evolutionary games, and population dynamics in multiple-species environments. American Naturalist. 2004;164:473–489. doi: 10.1086/423827. [DOI] [PubMed] [Google Scholar]

[R12] Doncaster CP, Clobert J, Doligez B, Danchin E, Gustafsson L. Balanced dispersal between spatially varying local populations: an alternative to the source-sink model. American Naturalist. 1997;150(4):425–445. doi: 10.1086/286074. [DOI] [PubMed] [Google Scholar]

[R13] Dreisig H. Ideal free distributions of nectar foraging bumblebees. Oikos. 1995;72:161–172. [Google Scholar]

[R14] Edelaar P, Bolnick DI. Non-random gene flow: an underappreciated force in evolution and ecology. Trends in Ecology & Evolution. 2012;27:659–665. doi: 10.1016/j.tree.2012.07.009. [DOI] [PubMed] [Google Scholar]

[R15] Ethier SN, Kurtz TG. Markov Processes: Characterization and Convergence. Wiley; Hoboken, NJ: 2005. [Google Scholar]

[R16] Evans SN, Ralph P, Schreiber SJ, Sen A. Stochastic growth rates in spatiotemporal heterogeneous environments. Journal of Mathematical Biology. 2013;66:423–476. doi: 10.1007/s00285-012-0514-0. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R17] Fox LR, Eisenbach J. Contrary choices: possible exploitation of enemy-free space by herbivorous insects in cultivated vs. wild crucifers. Oecologia. 1992;89:574–579. doi: 10.1007/BF00317166. [DOI] [PubMed] [Google Scholar]

[R18] Fretwell SD, Lucas HL., Jr. On territorial behavior and other factors influencing habitat distribution in birds. Acta Biotheoretica. 1969;19:16–36. [Google Scholar]

[R19] Friedman A. Partial differential equations of parabolic type. Prentice-Hall Inc.; Englewood Cliffs, N.J.: 1964. [Google Scholar]

[R20] Gejji R, Lou Y, Munther D, Peyton J. Evolutionary convergence to ideal free dispersal strategies and coexistence. Bulletin of Mathematical Biology. 2012;74:257–299. doi: 10.1007/s11538-011-9662-4. [DOI] [PubMed] [Google Scholar]

[R21] Geritz SAH, Metz JAJ, Kisdi E, Meszena G. Dynamics of adaptation and evolutionary branching. Physical Review Letters. 1997;78:2024–2027. [Google Scholar]

[R22] Godin JJ, Keenleyside MHA. Foraging on patchily distributed prey by a cichlid fish (Teleostei, Cichlidae): a test of the ideal free distribution theory. Animal Behaviour. 1984;32:120–131. [Google Scholar]

[R23] Harper DGC. Competitive foraging in mallards: Ideal free ducks. Animal Behaviour. 1982;30:575–584. [Google Scholar]

[R24] Hastings A. Can spatial variation alone lead to selection for dispersal? Theoretical Population Biology. 1983;24:244–251. [Google Scholar]

[R25] Haugen TO, Winfield IJ, Vøllestad LA, Fletcher JM, James JB, Stenseth NC. The ideal free pike: 50 years of fitness-maximizing dispersal in Windermere. Proceedings of the Royal Society B: Biological Sciences. 2006;273:2917–2924. doi: 10.1098/rspb.2006.3659. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R26] Holt RD. On the evolutionary stability of sink populations. Evolutionary Ecology. 1997;11:723–731. [Google Scholar]

[R27] Holt RD, Barfield M. On the relationship between the ideal free distribution and the evolution of dispersal. In: Clobert J, Danchin E, Dhondt A, Nichols J, editors. Dispersal. Oxford University Press; USA: 2001. pp. 83–95. [Google Scholar]

[R28] Ikeda N, Watanabe S. Stochastic differential equations and diffusion processes, volume 24 of North-Holland Mathematical Library. second edition North-Holland Publishing Co.; Amsterdam: 1989. [Google Scholar]

[R29] Jaenike J. Genetic and environmental determinants of food preference in Drosophila tripunctata. Evolution. 1985;39:362–369. doi: 10.1111/j.1558-5646.1985.tb05673.x. [DOI] [PubMed] [Google Scholar]

[R30] Jaenike J, Holt RD. Genetic variation for habitat preference: evidence and explanations. American Naturalist. 1991;137:S67–S90. [Google Scholar]

[R31] Jansen VAA, Yoshimura J. Populations can persist in an environment consisting of sink habitats only. Proceeding of the National Academy of Sciences USA. 1998;95:3696–3698. doi: 10.1073/pnas.95.7.3696. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R32] Kallenberg O. Foundations of Modern Probability. Springer; New York: 2002. [Google Scholar]

[R33] Katzenberger GS. Solutions of a stochastic differential equation forced onto a manifold by a large drift. The Annals of Probability. 1991;19:1587–1628. [Google Scholar]

[R34] Křivan V. Dynamic ideal free distribution: effects of optimal patch choice on predator-prey dynamics. American Naturalist. 1997;149:164–178. [Google Scholar]

[R35] Le Gall J-F. Seminar on probability, XVII, volume 986 of Lecture Notes in Math. Springer; Berlin: 1983. Applications du temps local aux équations différentielles stochastiques unidimensionnelles; pp. 15–31. [Google Scholar]

[R36] Li X, Mao X. Population dynamical behavior of non-autonomous Lotka-Volterra competitive system with random perturbation. Discrete and Continuous Dynamical Systems. 2009;24:523–545. [Google Scholar]

[R37] Liu M, Wang K, Wu Q. Survival analysis of stochastic competitive models in a polluted environment and stochastic competitive exclusion principle. Bulletin of Mathematical Biology. 2011;73:1969–2012. doi: 10.1007/s11538-010-9569-5. [DOI] [PubMed] [Google Scholar]

[R38] Maynard Smith J, Price GR. The logic of animal conflict. Nature. 1973;246:15–18. [Google Scholar]

[R39] Mayr E. Animal species and evolution. Harvard University Press; 1963. [Google Scholar]

[R40] McPeek MA, Holt RD. The evolution of dispersal in spatially and temporally varying environments. American Naturalist. 1992;6:1010–1027. [Google Scholar]

[R41] Milinski M. An evolutionarily stable feeding strategy in sticklebacks. Zeitschrift für Tierpsychologie. 1979;51:36–40. [Google Scholar]

[R42] Oksanen T, Power ME, Oksanen L. Ideal free habitat selection and consumer-resource dynamics. American Naturalist. 1995;146:565–585. [Google Scholar]

[R43] Orians GH, Wittenberger JF. Spatial and temporal scales in habitat selection. American Naturalist. 1991;137:S29–S49. [Google Scholar]

[R44] Prout T. Sufficient conditions for multiple niche polymorphism. American Naturalist. 1968;102:493–496. [Google Scholar]

[R45] Ravigné V, Olivieri I, Dieckmann U. Implications of habitat choice for protected polymorphisms. Evolutionary Ecology Research. 2004;6:125–145. [Google Scholar]

[R46] Robinson HS, Wielgus RB, Cooley HS, Cooley SW. Sink populations in carnivore management: Cougar demography and immigration in a hunted population. Ecological Applications. 2008;18:1028–1037. doi: 10.1890/07-0352.1. [DOI] [PubMed] [Google Scholar]

[R47] Rogers LCG, Williams D. Diffusions, Markov processes, and martingales. Vol. 2. Cambridge Mathematical Library. Cambridge University Press; Cambridge: 2000. Itô calculus, Reprint of the second (1994) edition. [Google Scholar]

[R48] Rosenzweig ML. A theory of habitat selection. Ecology. 1981;62:327–335. [Google Scholar]

[R49] Schreiber SJ. Evolution of patch selection in stochastic environments. American Naturalist. 2012;180:17–34. doi: 10.1086/665655. [DOI] [PubMed] [Google Scholar]

[R50] Schreiber SJ, Vejdani M. Handling time promotes the coevolution of aggregation in predator-prey systems. Proceedings of the Royal Society: Biological Sciences. 2006;273:185–191. doi: 10.1098/rspb.2005.3236. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R51] Schreiber SJ, Fox LR, Getz WM. Coevolution of contrary choices in host-parasitoid systems. American Naturalist. 2000;155:637–648. doi: 10.1086/303347. [DOI] [PubMed] [Google Scholar]

[R52] Schreiber SJ, Fox LR, Getz WM. Parasitoid sex allocation a ects coevolution of patch selection in host-parasitoid systems. Evolutionary Ecology Research. 2002;4:701–718. [Google Scholar]

[R53] Schreiber SJ, Benäim M, Atchadé KAS. Persistence in fluctuating environments. Journal of Mathematical Biology. 2011;62:655–683. doi: 10.1007/s00285-010-0349-5. [DOI] [PubMed] [Google Scholar]

[R54] Sokurenko EV, Gomulkiewicz R, Dykhuizen DE. Source–sink dynamics of virulence evolution. Nature Reviews Microbiology. 2006;4:548–555. doi: 10.1038/nrmicro1446. [DOI] [PubMed] [Google Scholar]

[R55] Stroock DW. Partial differential equations for probabilists, volume 112 of Cambridge Studies in Advanced Mathematics. Cambridge University Press; Cambridge: 2008. [Google Scholar]

[R56] Tittler R, Fahrig L, Villard MA. Evidence of large-scale source-sink dynamics and long-distance dispersal among Wood Thrush populations. Ecology. 2006;87:3029–3036. doi: 10.1890/0012-9658(2006)87[3029:eolsda]2.0.co;2. [DOI] [PubMed] [Google Scholar]

[R57] Tregenza T. Building on the ideal free distribution. Advances in Ecological Research. 1995;26:253–307. [Google Scholar]

[R58] Turelli M, Schemske DW, Bierzychudek P. Stable two-allele polymorphisms maintained by fluctuating fitnesses and seed banks: protecting the blues in Linanthus parryae. Evolution. 2001;55:1283–1298. doi: 10.1111/j.0014-3820.2001.tb00651.x. [DOI] [PubMed] [Google Scholar]

[R59] van Baalen M, Sabelis MW. Coevolution of patch selection strategies of predator and prey and the consequences for ecological stability. American Naturalist. 1993;142:646–670. doi: 10.1086/285562. [DOI] [PubMed] [Google Scholar]

[R60] van Baalen M, Křivan V, van Rijn PCJ, Sabelis MW. Alternative food, switching predators, and the persistence of predator-prey systems. American Naturalist. 2001;157:512–524. doi: 10.1086/319933. [DOI] [PubMed] [Google Scholar]

[R61] Via S. Ecological genetics and host adaptation in herbivorous insects: The experimental study of evolution in natural and agricultural systems. Annual Review for Entomology. 1990;35:421–446. doi: 10.1146/annurev.en.35.010190.002225. [DOI] [PubMed] [Google Scholar]

[R62] Zhang Z, Chen D. A new criterion on existence and uniqueness of stationary distribution for diffusion processes. Advances in Difference Equations. 2013;2013:13. [Google Scholar]

PERMALINK

PROTECTED POLYMORPHISMS AND EVOLUTIONARY STABILITY OF PATCH-SELECTION STRATEGIES IN STOCHASTIC ENVIRONMENTS

STEVEN N EVANS

ALEXANDRU HENING

SEBASTIAN J SCHREIBER

Abstract

1. Introduction

2. The Monomorphic Model

Persistence of coupled sink populations in symmetric landscapes

3. Dimorphic model and invasion rates

4. Exclusion and protected polymorphisms

Environmental stochasticity impedes protected polymorphisms in symmetric landscapes.

Figure 1.

5. Evolutionarily stable strategies

Mixed strategy

Pure strategy

ESSs in two-patch, uncorrelated landscapes

Figure 2.

Figure 3.

6. Discussion

Acknowledgments

Appendix A: Proof of Proposition 2.1

Appendix B: Proof of Theorem 4.1

Appendix C: Proof of Theorem 4.2

Appendix D: Proof of Theorem 5.1

Contributor Information

References

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

PROTECTED POLYMORPHISMS AND EVOLUTIONARY STABILITY OF PATCH-SELECTION STRATEGIES IN STOCHASTIC ENVIRONMENTS

STEVEN N EVANS

ALEXANDRU HENING

SEBASTIAN J SCHREIBER

Abstract

1. Introduction

2. The Monomorphic Model

Persistence of coupled sink populations in symmetric landscapes

3. Dimorphic model and invasion rates

4. Exclusion and protected polymorphisms

Environmental stochasticity impedes protected polymorphisms in symmetric landscapes.

Figure 1.

5. Evolutionarily stable strategies

Mixed strategy

Pure strategy

ESSs in two-patch, uncorrelated landscapes

Figure 2.

Figure 3.

6. Discussion

Acknowledgments

Appendix A: Proof of Proposition 2.1

Appendix B: Proof of Theorem 4.1

Appendix C: Proof of Theorem 4.2

Appendix D: Proof of Theorem 5.1

Contributor Information

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases