Stochastic population growth in spatially heterogeneous environments: the density-dependent case

Alexandru Hening; Dang H Nguyen; George Yin

doi:10.1007/s00285-017-1153-2

. 2017 Jul 3;76(3):697–754. doi: 10.1007/s00285-017-1153-2

Stochastic population growth in spatially heterogeneous environments: the density-dependent case

Alexandru Hening ^1,^2,^✉, Dang H Nguyen ³, George Yin ³

PMCID: PMC5772867 PMID: 28674928

Abstract

This work is devoted to studying the dynamics of a structured population that is subject to the combined effects of environmental stochasticity, competition for resources, spatio-temporal heterogeneity and dispersal. The population is spread throughout n patches whose population abundances are modeled as the solutions of a system of nonlinear stochastic differential equations living on ${[0, \infty)}^{n}$ . We prove that r, the stochastic growth rate of the total population in the absence of competition, determines the long-term behaviour of the population. The parameter r can be expressed as the Lyapunov exponent of an associated linearized system of stochastic differential equations. Detailed analysis shows that if $r > 0$ , the population abundances converge polynomially fast to a unique invariant probability measure on ${(0, \infty)}^{n}$ , while when $r < 0$ , the population abundances of the patches converge almost surely to 0 exponentially fast. This generalizes and extends the results of Evans et al. (J Math Biol 66(3):423–476, 2013) and proves one of their conjectures. Compared to recent developments, our model incorporates very general density-dependent growth rates and competition terms. Furthermore, we prove that persistence is robust to small, possibly density dependent, perturbations of the growth rates, dispersal matrix and covariance matrix of the environmental noise. We also show that the stochastic growth rate depends continuously on the coefficients. Our work allows the environmental noise driving our system to be degenerate. This is relevant from a biological point of view since, for example, the environments of the different patches can be perfectly correlated. We show how one can adapt the nondegenerate results to the degenerate setting. As an example we fully analyze the two-patch case, $n = 2$ , and show that the stochastic growth rate is a decreasing function of the dispersion rate. In particular, coupling two sink patches can never yield persistence, in contrast to the results from the non-degenerate setting treated by Evans et al. which show that sometimes coupling by dispersal can make the system persistent.

Keywords: Stochastic population growth, Density-dependence, Ergodicity, Spatial and temporal heterogeneity, Lotka–Volterra model, Lyapunov exponent, Habitat fragmentation, Stochastic environment, Dispersion

Introduction

The survival of an organism is influenced by both biotic (competition for resources, predator-prey interactions) and abiotic (light, precipitation, availability of resources) factors. Since these factors are space-time dependent, all types of organisms have to choose their dispersal strategies: If they disperse they can arrive in locations with different environmental conditions while if they do not disperse they face the temporal fluctuations of the local environmental conditions. The dispersion strategy impacts key attributes of a population including its spatial distribution and temporal fluctuations in its abundance. Individuals selecting more favorable habitats are more likely to survive or reproduce. When population densities increase in these habitats, organisms may prosper by selecting habitats that were previously unused. There have been numerous studies of the interplay between dispersal and environmental heterogeneity and how this influences population growth; see Hastings (1983), Gonzalez and Holt (2002), Schmidt (2004), Roy et al. (2005), Schreiber (2010), Cantrell et al. (2012), Durrett and Remenik (2012), Evans et al. (2013) and references therein. The mathematical analysis for stochastic models with density-dependent feedbacks is less explored. In the setting of discrete-space discrete-time models there have been thorough studies by Benaïm and Schreiber (2009); Schreiber (2010); Schreiber et al. (2011). Continuous-space discrete-time population models that disperse and experience uncorrelated, environmental stochasticity have been studied by Hardin et al. (1988a, b, 1990). They show that the leading Lyapunov exponent r of the linearization of the system around the extinction state almost determines the persistence and extinction of the population. For continuous-space continuous-time population models Mierczyński and Shen (2004) study the dynamics of random Kolmogorov type PDE models in bounded domains. Once again, it is shown that the leading Lyapunov exponent r of the linarization around the trivial equilibrium 0 almost determines when the population goes extinct and when it persists. In the current paper we explore the question of persistence and extinction when the population dynamics is given by a system of stochastic differential equations. In our setting, even though our methods and techniques are very different from those used by Hardin et al. (1988a) and Mierczyński and Shen (2004), we still make use of the system linearized around the extinction state. The Lyapunov exponent of this linearized system plays a key role throughout our arguments.

Evans et al. (2013) studied a linear stochastic model that describes the dynamics of populations that continuously experience uncertainty in time and space. Their work has shed some light on key issues from population biology. Their results provide fundamental insights into “ideal free” movement in the face of uncertainty, the evolution of dispersal rates, the single large or several small (SLOSS) debate in conservation biology, and the persistence of coupled sink populations. In this paper, we propose a density-dependent model of stochastic population growth that captures the interactions between dispersal and environmental heterogeneity and complements the work of Evans et al. (2013). We then present a rigorous and comprehensive study of the proposed model based on stochastic analysis.

The dynamics of a population in nature is stochastic. This is due to environmental stochasticity—the fluctuations of the environment make the growth rates random. One of the simplest models for a population living in a single patch is

\begin{matrix} d U (t) = U (t) (a - b U (t)) d t + σ U (t) d W (t), t \geq 0, \end{matrix}

1.1

where U(t) is the population abundance at time t, a is the mean per-capita growth rate, $b > 0$ is the strength of intraspecific competition, $σ^{2}$ is the infinitesimal variance of fluctuations in the per-capita growth rate and ${(W (t))}_{t \geq 0}$ is a standard Brownian motion. The long-term behavior of (1.1) is determined by the stochastic growth rate $a - \frac{σ^{2}}{2}$ in the following way (see Evans et al. 2015; Dennis and Patil 1984):

If $a - \frac{σ^{2}}{2} > 0$ and $U (0) = u > 0$ , then ${(U (t))}_{t \geq 0}$ converges weakly to its unique invariant probability measure $ρ$ on $(0, \infty)$ .
If $a - \frac{σ^{2}}{2} < 0$ and $U (0) = u > 0$ , then ${lim}_{t \to \infty} U (t) = 0$ almost surely.
If $a - \frac{σ^{2}}{2} = 0$ and $U (0) = u > 0$ , then ${lim inf}_{t \to \infty} U (t) = 0$ almost surely, ${lim sup}_{t \to \infty} U (t) = \infty$ almost surely, and ${lim}_{t \to \infty} \frac{1}{t} \int_{0}^{t} U (s) d s = 0$ almost surely.

Organisms are always affected by temporal heterogeneities, but they are subject to spatial heterogeneities only when they disperse. Population growth is influenced by spatial heterogeneity through the way organisms respond to environmental signals (see Hastings 1983; Cantrell and Cosner 1991; Chesson 2000; Schreiber and Lloyd-Smith 2009). There have been several analytic studies that contributed to a better understanding of the separate effects of spatial and temporal heterogeneities on population dynamics. However, few theoretical studies have considered the combined effects of spatio-temporal heterogeneities, dispersal, and density-dependence for discretely structured populations with continuous-time dynamics.

As seen in both the continuous (Evans et al. 2013) and the discrete (Palmqvist and Lundberg 1998) settings, the extinction risk of a population is greatly affected by the spatio-temporal correlation between the environment in the different patches. For example, if spatial correlations are weak, one can show that populations coupled via dispersal can survive even though every patch, on its own, would go extinct (see Evans et al. 2013; Jansen and Yoshimura 1998; Harrison and Quinn 1989). Various species usually exhibit spatial synchrony. Ecologists are interested in this pattern as it can lead to the extinction of rare species. Possible causes for synchrony are dispersal and spatial correlations in the environment (see Legendre 1993; Kendall et al. 2000; Liebhold et al. 2004). Consequently, it makes sense to look at stochastic patch models coupled by dispersion for which the environmental noise of the different patches can be strongly correlated. We do this by extending the setting of Evans et al. (2013) by allowing the environmental noise driving the system to be degenerate.

The rest of the paper is organized as follows. In Sect. 2, we introduce our model for a population living in a patchy environment. It takes into account the dispersal between different patches and density-dependent feedback. The temporal fluctuations of the environmental conditions of the various patches are modeled by Brownian motions that are correlated. We start by considering the relative abundances of the different patches in a low density approximation. We show that these relative abundances converge in distribution to their unique invariant probability measure asymptotically as time goes to infinity. Using this invariant probability measure we derive an expression for r, the stochastic growth rate (Lyapunov exponent) in the absence of competition. We show that this r is key in analyzing the long-term behavior of the populations. In Appendix A we show that if $r > 0$ then the abundances converge weakly, polynomially fast, to their unique invariant probability measure on ${(0, \infty)}^{n}$ . In Appendix B, we show that if $r < 0$ then all the population abundances go extinct asymptotically, at an exponential rate (with exponential constant r). Appendix C is dedicated to the case when the noise driving our system is degenerate (that is, the dimension of the noise is lower than the number of patches). In Appendix D, we show that r depends continuously on the coefficients of our model and that persistence is robust—that is, small perturbations of the model do not make a persistent system become extinct. We provide some numerical examples and possible generalizations in Sect. 4.

Model and results

We study a population with overlapping generations, which live in a spatio-temporally heterogeneous environment consisting of n distinct patches. The growth rate of each patch is determined by both deterministic and stochastic environmental inputs. We denote by $X_{i} (t)$ the population abundance at time $t \geq 0$ of the ith patch and write $X (t) = (X_{1} (t), \dots, X_{n} (t))$ for the vector of population abundances. Following Evans et al. (2013), it is appropriate to model $X (t)$ as a Markov process with the following properties when $0 \leq Δ t ≪ 1$ :

the conditional mean is
$\begin{matrix} E [X_{i} (t + Δ t) - X_{i} (t) | X_{i} (t) = x_{i}] \approx [a_{i} x_{i} - x_{i} b_{i} (x_{i}) + \sum_{j \neq i} (x_{j} D_{j i} - x_{i} D_{i j})] Δ t, \end{matrix}$
where $a_{i} \in R$ is the per-capita growth rate in the ith patch, $b_{i} (x_{i})$ is the per-capita strength of intraspecific competition in patch i when the abundance of the patch is $x_{i}$ , and $D_{i j} \geq 0$ is the dispersal rate from patch i to patch j;
the conditional covariance is
$\begin{matrix} Cov [X_{i} (t + Δ t) - X_{i} (t), X_{j} (t + Δ t) - X_{j} (t) | X = x] \approx σ_{i j} x_{i} x_{j} Δ t \end{matrix}$
for some covariance matrix $Σ = (σ_{i j})$ .

The difference between our model and the one from Evans et al. (2013) is that we added density-dependent feedback through the $x_{i} b_{i} (x_{i})$ terms.

We work on a complete probability space $(Ω, F, {F_{t}}_{t \geq 0}, P)$ with filtration ${F_{t}}_{t \geq 0}$ satisfying the usual conditions. We consider the system

\begin{matrix} d X_{i} (t) = & (X_{i} (t) (a_{i} - b_{i} (X_{i} (t))) + \sum_{j = 1}^{n} D_{j i} X_{j} (t)) d t \\ + X_{i} (t) d E_{i} (t), i = 1, \dots, n, \end{matrix}

2.1

where $D_{i j} \geq 0$ for $j \neq i$ is the per-capita rate at which the population in patch i disperses to patch $j, D_{i i} = - \sum_{j \neq i} D_{i j}$ is the total per-capita immigration rate out of patch $i, E (t) = {(E_{1} (t), \dots, E_{n} (t))}^{T} = Γ^{⊤} B (t)$ , $Γ$ is a $n \times n$ matrix such that $Γ^{⊤} Γ = Σ = {(σ_{i j})}_{n \times n}$ and $B (t) = (B_{1} (t), \dots, B_{n} (t))$ is a vector of independent standard Brownian motions adapted to the filtration ${F_{t}}_{t \geq 0}$ . Throughout the paper, we work with the following assumption regarding the growth of the instraspecific competition rates.

Assumption 2.1

For each $i = 1, \dots, n$ the function $b_{i} : R_{+} \mapsto R$ is locally Lipschitz and vanishing at 0. Furthermore, there are $M_{b} > 0, γ_{b} > 0$ such that

\begin{matrix} \frac{\sum_{i = 1}^{n} x_{i} (b_{i} (x_{i}) - a_{i})}{\sum_{i = 1}^{n} x_{i}} > γ_{b} for any x_{i} \geq 0, i = 1, \dots, n satisfying \sum_{i = 1}^{n} x_{i} \geq M_{b} \end{matrix}

2.2

Remark 2.1

Note that if we set $x_{j} = x \geq M_{b}$ and $x_{i} = 0, i \neq j$ , we get from (2.2) that

\begin{matrix} b_{j} (x) - a_{j} > γ_{b}, x \geq M_{b}, j = 1, \dots, n . \end{matrix}

Remark 2.2

Note that condition (2.2) is biologically reasonable because it holds if the $b_{i}$ ’s are sufficiently large for large $x_{i}$ ’s. We provide some simple scenarios when Assumption 2.1 is satisfied.

Suppose $b_{i} : [0, \infty) \to [0, \infty), i = 1, \dots, n$ are locally Lipschitz and vanishing at 0. Assume that there exist $γ_{b} > 0, {\tilde{M}}_{b} > 0$ such that
$\begin{matrix} inf_{x \in [{\tilde{M}}_{b}, \infty)} b_{i} (x) - a_{i} - γ_{b} > 0, i = 1, \dots, n \end{matrix}$
It is easy to show that Assumption 2.1 holds.
Particular cases of (a) are for example, any $b_{i} : R_{+} \mapsto R$ that are locally Lipschitz, vanishing at 0 such that ${lim}_{x \to \infty} b_{i} (x) = \infty$ .
One natural choice for the competition functions, which is widely used throughout the literature, is $b_{i} (x) = κ_{i} x, x \in (0, \infty)$ for some $κ_{i} > 0$ . In this case the competition terms become $- x_{i} b (x_{i}) = - κ_{i} x_{i}^{2}$ .

Remark 2.3

Note that if we have the SDE

\begin{matrix} d X_{i} (t) = & (X_{i} (t) f_{i} (X_{i} (t)) + \sum_{j = 1}^{n} D_{j i} X_{j} (t)) d t \\ + X_{i} (t) d E_{i} (t), i = 1, \dots, n, \end{matrix}

2.3

where $f_{i}$ are locally Lipschitz this can always be rewritten in the form (2.1) with

\begin{matrix} a_{i} : = f_{i} (0) and b_{i} (x) : = f_{i} (0) - f_{i} (x), i = 1, \dots, n . \end{matrix}

Therefore, our setting is in fact very general and incorporates both nonlinear growth rates and nonlinear competition terms.

The drift $\tilde{f} (x) = ({\tilde{f}}_{1} (x), \dots, {\tilde{f}}_{n} (x))$ where ${\tilde{f}}_{i} (x) = x_{i} (a_{i} - b_{i} (x_{i})) + \sum_{j = 1}^{n} D_{j i} X_{j} (t)$ is sometimes said to be cooperative. This is because $f_{i} (x) \leq f_{i} (y)$ if $(x, y) \in R_{+}^{n}$ such that $x_{i} = y_{i}, x_{j} \leq y_{j}$ for $j \neq i$ . A distinctive property of cooperative systems is that comparison arguments are generally satisfied. We refer to Chueshov (2002) for more details.

Remark 2.4

If the dispersal matrix $(D_{i j})$ has a normalized dominant left eigenvector $α = (α_{1}, \dots, α_{n})$ then one can show that the system

\begin{matrix} d X_{i} (t) = & (X_{i} (t) (a_{i} - b_{i} X_{i} (t)) + δ \sum_{j = 1}^{n} D_{j i} X_{j} (t)) d t \\ + X_{i} (t) d E_{i} (t), i = 1, \dots, n, \end{matrix}

converges as $δ \to \infty$ to a system $({\tilde{X}}_{1} (t), \dots, {\tilde{X}}_{n} (t))$ for which

\begin{matrix} {\tilde{X}}_{i} (t) = α_{i} \tilde{X} (t), t \geq 0, i = 1, \dots, n, \end{matrix}

where $\tilde{X} (t) = {\tilde{X}}_{1} (t) + \dots + {\tilde{X}}_{n} (t)$ and $\tilde{X}$ is an autonomous Markov process that satisfies the SDE

\begin{matrix} d \tilde{X} (t) = \tilde{X} (t) \sum_{i = 1}^{n} α_{i} (a_{i} - b_{i} α_{i} \tilde{X} (t)) d t + \tilde{X} (t) \sum_{i = 1}^{n} α_{i} d E_{i} (t) . \end{matrix}

As such, our system is a general version of the system treated in Evans et al. (2015). One can recover the system from Evans et al. (2015) as an infinite dispersion limit of ours.

We denote by $X^{x} (t)$ the solution of (2.1) started at $X (0) = x \in R_{+}^{n}$ . Following Evans et al. (2013), we call matrices D with zero row sums and non-negative off-diagonal entries dispersal matrices. If D is a dispersal matrix, then it is a generator of a continuous-time Markov chain. Define $P_{t} : = exp (t D), t \geq 0$ . Then $P_{t}, t \geq 0$ is a matrix with non-negative entries that gives the transition probabilities of a Markov chain: The (i, j)th entry of $P_{t}$ gives the proportion of the population that was initially in patch i at time 0 but has dispersed to patch j at time t and D is the generator of this Markov chain. If one wants to include mortality induced because of dispersal, one can add cemetery patches in which dispersing individuals enter and experience a killing rate before moving to their final destination. Our model is a density-dependent generalization of the one by Evans et al. (2013). We are able to prove that the linearization of the density-dependent model fully determines the non-linear density-dependent behavior, a fact which was conjectured by Evans et al. (2013). Furthermore, we prove stronger convergence results and thus extend the work of Evans et al. (2013). Analogous results for discrete-time versions of the model have been studied by Benaïm and Schreiber (2009) for discrete-space and by Hardin et al. (1988a, b) for continuous-space.

We will work under the following assumptions.

Assumption 2.2

The dispersal matrix D is irreducible.

Assumption 2.3

The covariance matrix $Σ$ is non-singular.

Assumption 2.2 is equivalent to forcing the entries of the matrix $P_{t} = exp (t D)$ to be strictly positive for all $t > 0$ . This means that it is possible for the population to disperse between any two patches. We can always reduce our problem to this setting by working with the maximal irreducible subsets of patches. Assumption 2.3 says that our randomness is non-degenerate, and thus truly n-dimensional. We show in Appendix C how to get the desired results when Assumption 2.3 does not hold.

Throughout the paper we set $R_{+}^{n} : = {[0, \infty)}^{n}$ and $R_{+}^{n, \circ} : = {(0, \infty)}^{n}$ . We define the total abundance of our population at time $t \geq 0$ via $S (t) : = \sum_{i = 1}^{n} X_{i} (t)$ and let $Y_{i} (t) : = \frac{X_{i} (t)}{S (t)}$ be the proportion of the total population that is in patch i at time $t \geq 0$ . Set $Y (t) = (Y_{1} (t), \dots, Y_{n} (t))$ . An application of Itô’s lemma to (2.1) yields

\begin{matrix} \begin{matrix} d Y_{i} (t) & = Y_{i} (t) (a_{i} - \sum_{j = 1}^{n} a_{j} Y_{j} (t) - b_{i} (S (t) Y_{i} (t)) + \sum_{j = 1}^{n} Y_{j} (t) b_{j} (S (t) Y_{j} (t))) d t \\ + \sum_{j = 1}^{n} D_{j i} Y_{j} (t) d t + Y_{i} (t) (\sum_{j, k = 1}^{n} σ_{k j} Y_{k} (t) Y_{j} (t)) - \sum_{j = 1}^{n} σ_{i j} Y_{j} (t)) d t \\ + Y_{i} (t) [d E_{i} (t) - \sum_{j = 1}^{n} Y_{j} (t) d E_{j} (t)] \\ d S (t) & = S (t) (\sum_{i = 1}^{n} (a_{i} Y_{i} (t) - Y_{i} (t) b_{i} (S (t) Y_{i} (t)))) d t + S (t) \sum_{i = 1}^{n} Y_{i} (t) d E_{i} (t) \end{matrix} \end{matrix}

2.4

We can rewrite (2.4) in the following compact equation for $(Y (t), S (t))$ where $b (x) = (b_{1} (x_{1}), \dots, b_{n} (x_{n}))$ .

\begin{matrix} \begin{matrix} d Y (t) & = (diag (Y (t)) - Y (t) Y^{⊤} (t)) Γ^{⊤} d B (t) \\ + D^{⊤} Y (t) d t + (diag (Y (t)) - Y (t) Y^{⊤} (t)) \\ \times (a - Σ Y (t) - b (S (t) Y (t))) d t \\ d S (t) & = S (t) {[a - b (S (t) Y (t))]}^{⊤} Y (t) d t + S (t) {Y (t)}^{⊤} Γ^{⊤} d B (t), \end{matrix} \end{matrix}

2.5

where $Y (t)$ lies in the simplex $Δ : = {(y_{1}, \dots, y_{n}) \in R_{+}^{n} : y_{1} + \dots + y_{n} = 1}$ . Let $Δ^{\circ} = {(y_{1}, \dots, y_{n}) \in R_{+}^{n, \circ} : y_{1} + \dots + y_{n} = 1}$ be the interior of $Δ$ .

Consider Equation (2.5) on the boundary $((y, s) : y \in Δ, s = 0)$ (that is, we set $S (t) \equiv 0$ in the equation for $Y (t)$ ). We have the following system

\begin{matrix} d \tilde{Y} (t) = & (diag (\tilde{Y} (t)) - \tilde{Y} (t) {\tilde{Y}}^{⊤} (t)) Γ^{⊤} d B (t) \\ + D^{⊤} \tilde{Y} (t) d t + (diag (\tilde{Y} (t)) - \tilde{Y} (t) {\tilde{Y}}^{⊤} (t)) (a - Σ \tilde{Y} (t)) d t \end{matrix}

2.6

on the simplex $Δ$ . We also introduce the linearized version of (2.1), where the competition terms $b_{i} (x_{i})$ are all set to 0,

\begin{matrix} d X_{i} (t) = & (X_{i} (t) a_{i} + \sum_{j = 1}^{n} D_{j i} X_{j} (t)) d t \\ + X_{i} (t) d E_{i} (t), i = 1, \dots, n . \end{matrix}

2.7

and let $S (t) = \sum_{i = 1}^{n} X_{i} (t)$ be the total population abundance, in the absence of competition. The processes $(X_{1} (t), \dots, X_{n} (t)), \tilde{Y} (t)$ and $S (t)$ have been studied by Evans et al. (2013).

Evans et al. (2013, Proposition 3.1) proved that the process ${(\tilde{Y} (t))}_{t \geq 0}$ is an irreducible Markov process, which has the strong Feller property and admits a unique invariant probability measure $ν^{*}$ on $Δ$ . Let $\tilde{Y} (\infty)$ be a random variable on $Δ$ with distribution $ν$ . We define

\begin{matrix} r : = & \int_{Δ} (a^{⊤} y - \frac{1}{2} y^{⊤} Σ y) ν^{*} (d y) \\ = & \sum_{i} a_{i} E [{\tilde{Y}}_{i} (\infty)] - \frac{1}{2} E [\sum_{i j} σ_{i j} {\tilde{Y}}_{i} (\infty) {\tilde{Y}}_{j} (\infty)] \end{matrix}

2.8

Remark 2.5

We note that r is the stochastic growth rate (or Lyapunov exponent) of the total population $S (t)$ in the absence of competition. That is,

\begin{matrix} P \{lim_{t \to \infty} \frac{ln S^{x} (t)}{t} = r\} = 1 . \end{matrix}

The expression (2.8) for r coincides with the one derived by Evans et al. (2013).

We use superscripts to denote the starting points of our processes. For example $(Y^{y, s} (t), S^{y, s} (t))$ denotes the solution of (2.4) with $(Y (0), S (0)) = (y, s) \in Δ \times (0, \infty)$ . Fix $x \in R_{+}^{n}$ and define the normalized occupation measures,

\begin{matrix} Π_{t}^{(x)} (\cdot) = \frac{1}{t} \int_{0}^{t} 1_{{X^{x} (u) \in \cdot}} d u . \end{matrix}

2.9

These random measures describe the distribution of the observed population dynamics up to time t. If we define the sets

\begin{matrix} S_{η} : = {x = (x_{1}, \dots, x_{n}) \in R_{+}^{n, \circ} : | x_{i} | \leq η for some i = 1, \dots, n}, \end{matrix}

then $Π_{t}^{(x)} (S_{η})$ is the fraction of the time in the interval [0, t] that the total abundance of some patch is less than $η$ given that our population starts at $X (0) = x$ .

Definition 2.1

One can define a distance on the space of probability measures living on the Borel measurable subsets of $R_{+}^{n}$ , that is on the space $(R_{+}^{n}, B (R_{+}^{n}))$ . This is done by defining ${‖ \cdot, \cdot ‖}_{TV}$ , the total variation norm, via

\begin{matrix} {‖ μ, ν ‖}_{TV} : = sup_{A \in B (R_{+}^{n})} | μ (A) - ν (A) | . \end{matrix}

Theorem 2.1

Suppose that Assumptions 2.2 and 2.3 hold and that $r > 0$ . The process $X (t) = {(X_{1} (t), \dots, X_{n} (t))}_{t \geq 0}$ has a unique invariant probability measure $π$ on $R_{+}^{n, \circ}$ that is absolutely continuous with respect to the Lebesgue measure and for any $q^{*} > 0$ ,

\begin{matrix} lim_{t \to \infty} t^{q^{*}} {‖ P_{X} (t, x, \cdot) - π (\cdot) ‖}_{TV} = 0, x \in R_{+}^{n, \circ}, \end{matrix}

2.10

and $P_{X} (t, x, \cdot)$ is the transition probability of ${(X (t))}_{t \geq 0}$ . Moreover, for any initial value $x \in R_{+}^{n} \ {0}$ and any $π$ -integrable function f we have

\begin{matrix} P \{lim_{T \to \infty} \frac{1}{T} \int_{0}^{T} f (X^{x} (t)) d t = \int_{R_{+}^{n, \circ}} f (u) π (d u)\} = 1 . \end{matrix}

2.11

Remark 2.6

Theorem 2.1 is a direct consequence of Theorem A.2, which will be proved in Appendix A. As a corollary we get the following result.

Definition 2.2

Following Roth and Schreiber (2014), we say that the model (2.1) is stochastically persistent if for all $ε > 0$ , there exists $η > 0$ such that with probability one,

\begin{matrix} Π_{t}^{(x)} (S_{η}) \leq ε \end{matrix}

for t sufficiently large and $x \in S_{η} \ {0}$ .

Corollary 2.1

If Assumptions 2.2 and 2.3 hold, and $r > 0$ , then the process $X (t)$ is stochastically persistent.

Proof

By Theorem 2.1, we have that for all $x \in R_{+}^{n, \circ}$ ,

\begin{matrix} P \{Π_{t}^{(x)} \Rightarrow π as t \to \infty\} = 1 . \end{matrix}

Since $π$ is supported on $R_{+}^{n, \circ}$ , we get the desired result. $□$

Biological interpretation of Theorem 2.1 The quantity r is the Lyapunov exponent or stochastic growth rate of the total population process ${(S (t))}_{t \geq 0}$ in the absence of competition. This number describes the long-term growth rate of the population in the presence of a stochastic environment. According to (2.8) r can be written as the difference $\bar{μ} - \frac{1}{2} {\bar{σ}}^{2}$ where

$\bar{μ}$ is the average of per-capita growth rates with respect to the asymptotic distribution $\tilde{Y} (\infty)$ of the population in the absence of competition.
${\bar{σ}}^{2}$ is the infinitesimal variance of the environmental stochasticity averaged according to the asymptotic distribution of the population in the absence of competition.

We note by (2.8) that r depends on the dispersal matrix, the growth rates at 0 and the covariance matrix of the environmental noise. As such, the stochastic growth rate can change due to the dispersal strategy or environmental fluctuations.

When the stochastic growth rate of the population in absence of competition is strictly positive (i.e. $r > 0$ ) our population is persistent in a strong sense: for any starting point $(X_{1} (0), \dots, X_{n} (0)) = (x_{1}, \dots, x_{n}) \in R_{+}^{n, \circ}$ the distribution of the population densities at time t in the n patches $(X_{1} (t), \dots, X_{n} (t))$ converges as $t \to \infty$ to the unique probability measure $π$ that is supported on $R_{+}^{n, \circ}$ .

Definition 2.3

We say the population of patch i goes extinct if for all $x \in R_{+}^{n} \ {0}$

\begin{matrix} P \{lim_{t \to \infty} X_{i}^{x} (t) = 0\} = 1 . \end{matrix}

We say the population goes extinct if the populations from all the patches go extinct, that is if for all $x \in R_{+}^{n} \ {0}$

\begin{matrix} P \{lim_{t \to \infty} X^{x} (t) = 0\} = 1 . \end{matrix}

Theorem 2.2

Suppose that Assumptions 2.2 and 2.3 hold and that $r < 0$ . Then for any $i = 1, \dots, n$ and any $x = (x_{1}, \dots, x_{n}) \in R_{+}^{n}$ ,

\begin{matrix} P \{lim_{t \to \infty} \frac{ln X_{i}^{x} (t)}{t} = r\} = 1 . \end{matrix}

2.12

Biological interpretation of Theorem 2.2 If the stochastic growth rate of the population in the absence of competition is negative (i.e. $r < 0$ ) the population densities of the n patches $(X_{1} (t), \dots, X_{n} (t))$ go extinct exponentially fast with rates $r < 0$ with probability 1 for any starting point $(X_{1} (0), \dots, X_{n} (0)) = (x_{1}, \dots, x_{n}) \in R_{+}^{n}$ .

In Appendix A, we prove Theorem 2.1 while Theorem 2.2 is proven in Appendix B.

Degenerate noise

We consider the evolution of the process ${(X (t))}_{t \geq 0}$ given by (2.1) when Assumption 2.3 does not hold. If the covariance matrix $Σ = Γ^{T} Γ$ coming for the Brownian motions $E (t) = {(E_{1} (t), \dots, E_{n} (t))}^{T} = Γ^{⊤} B (t)$ is singular, the environmental noise driving our SDEs has a lower dimension than the dimension n of the underlying state space. It becomes much more complex to prove that our process is Feller and irreducible. In order to verify the Feller property, we have to verify the so-called Hörmander condition, and to verify the irreducibility, we have to investigate the controllability of a related control system.

We are able to prove the following extinction and persistence results.

Theorem 2.3

Assume that $\tilde{Y} (t)$ has a unique invariant probability measure $ν^{*}$ . Define r by (2.8). Suppose that $r < 0$ . Then for any $i = 1, \dots, n$ and any $x = (x_{1}, \dots, x_{n}) \in R_{+}^{n}$

\begin{matrix} P \{lim_{t \to \infty} \frac{ln X_{i}^{x} (t)}{t} = r\} = 1 . \end{matrix}

2.13

In particular, for any $i = 1, \dots, n$ and any $x = (x_{1}, \dots, x_{n}) \in R_{+}^{n}$

\begin{matrix} P \{lim_{t \to \infty} X_{i}^{x} (t) = 0\} = 1 . \end{matrix}

Remark 2.7

The extra assumption in this setting is that the Markov process describing the proportions of the populations of the patches evolving without competition, $\tilde{Y} (t)$ , has a unique invariant probability measure. In fact, we conjecture that $\tilde{Y} (t)$ always has a unique invariant probability measure. We were able to prove this conjecture when $n = 2$ —see Remark 3.1 for details.

Theorem 2.4

Assume that $\tilde{Y} (t)$ has a unique invariant probability measure $ν^{*}$ . Define r by (2.8). Suppose that Assumption 2.2 holds and that $r > 0$ . Assume further that there is a sufficiently large $T > 0$ such that the Markov chain ${(Y (k T), S (k T))}_{k \in N}$ it is irreducible and aperiodic, and that every compact set in $Δ^{\circ} \times (0, \infty)$ is petite for this Markov chain.

The process $X (t) = {(X_{1} (t), \dots, X_{n} (t))}_{t \geq 0}$ has a unique invariant probability measure $π$ on $R_{+}^{n, \circ}$ that is absolutely continuous with respect to the Lebesgue measure and for any $q^{*} > 0$ ,

\begin{matrix} lim_{t \to \infty} t^{q^{*}} {‖ P_{X} (t, x, \cdot) - π (\cdot) ‖}_{TV} = 0, x \in R_{+}^{n, \circ}, \end{matrix}

2.14

where ${‖ \cdot, \cdot ‖}_{TV}$ is the total variation norm and $P_{X} (t, x, \cdot)$ is the transition probability of ${(X (t))}_{t \geq 0}$ . Moreover, for any initial value $x \in R_{+}^{n} \ {0}$ and any $π$ -integrable function f, we have

\begin{matrix} P \{lim_{T \to \infty} \frac{1}{T} \int_{0}^{T} f (X^{x} (t)) d t = \int_{R_{+}^{n, \circ}} f (u) π (d u)\} = 1 . \end{matrix}

2.15

Remark 2.8

We require as before that $\tilde{Y} (t)$ has a unique invariant probability measure. Furthermore, we require that there exists some time $T > 0$ such that if we observe the process $(Y (t), S (t))$ at the fixed times $T, 2 T, 3 T, \dots, k T, \dots$ it is irreducible (loosely speaking this means that the process can visit any state) and aperiodic (returns to a given state occur at irregular times).

Case study: $n = 2$

Note that the two Theorems above have some extra assumptions. We exhibit how one can get these conditions explicitly as functions of the various parameters of the model. For the sake of a clean exposition we chose to fully treat the case when $n = 2$ and $b_{i} (x) = b_{i} x, x \geq 0, i = 1, 2$ for some $b_{1}, b_{2} > 0$ (each specific case would have to be studied separately as the computations change in each setting). As a result, (2.1) becomes

\begin{matrix} \{\begin{matrix} d X_{1} (t) = (X_{1} (t) (a_{1} - b_{1} X_{1} (t)) - α X_{1} (t) + β X_{2} (t)) d t + σ_{1} X_{1} (t) d B (t) \\ d X_{2} (t) = (X_{2} (t) (a_{2} - b_{2} X_{2} (t)) + α X_{1} (t) - β X_{2} (t)) d t + σ_{2} X_{2} (t) d B (t), \end{matrix} \end{matrix}

where $σ_{1}, σ_{2}$ are non-zero constants and ${(B (t))}_{t \geq 0}$ is a one dimensional Brownian motion. The Lyapunov exponent can now be expressed as (see Remark 3.1)

\begin{matrix} r = & a_{2} - \frac{σ_{2}^{2}}{2} + (a_{1} - a_{2} + σ_{2}^{2}) \int_{0}^{1} y ρ_{1}^{*} (y) d y \\ - \frac{{(σ_{1} - σ_{2})}^{2}}{2} \int_{0}^{1} y^{2} ρ_{1}^{*} (y) d y \end{matrix}

2.16

where $ρ_{1}^{*}$ is given in (3.5) later.

If $σ_{1} = σ_{2} = : σ$ , one has (see Remark 3.1)

\begin{matrix} r = a_{2} - \frac{σ^{2}}{2} + (a_{1} - a_{2} + σ^{2}) y^{⋆} . \end{matrix}

2.17

Theorem 2.5

Define r by (2.16) if $σ_{1} \neq σ_{2}$ and by (2.17) if $σ_{1} = σ_{2} = σ$ . If $r < 0$ then for any $i = 1, 2$ and any $x = (x_{1}, x_{2}) \in R_{+}^{2}$

\begin{matrix} P \{lim_{t \to \infty} \frac{ln X_{i}^{x} (t)}{t} = r\} = 1 . \end{matrix}

2.18

Theorem 2.6

Suppose that $σ_{1} \neq σ_{2}$ or $β + (b_{2} / b_{1}) (a_{1} - a_{2} - α + β) - α {(b_{2} / b_{1})}^{2} \neq 0$ . Define r as in Theorem 2.5. If $r > 0$ then the conclusion of Theorem 2.4 holds.

Remark 2.9

Once again the parameter r tells us when the population goes extinct and when it persists. To obtain the conclusion of Theorem 2.4 when $r > 0$ , we need $σ_{1} \neq σ_{2}$ or $β + (b_{2} / b_{1}) (a_{1} - a_{2} - α + β) - α {(b_{2} / b_{1})}^{2} \neq 0 .$ The condition $σ_{1} \neq σ_{2}$ tells us that the noise must at least differ through its variance. If $σ_{1} = σ_{2}$ then we require

\begin{matrix} a_{1} + β \frac{b_{1} + b_{2}}{b_{2}} \neq a_{2} + α \frac{b_{1} + b_{2}}{b_{1}} . \end{matrix}

The term $β \frac{b_{1} + b_{2}}{b_{2}}$ measures the dispersion rate of individuals from patch 2 to patch 1 averaged by the inverse relative competition strength of patch 2. In particular, if $b_{1} = b_{2}$ we have that

\begin{matrix} 2 (β - α) \neq a_{2} - a_{1}, \end{matrix}

that is twice the difference of the dispersal rates cannot equal the difference of the growth rates. The dynamics of the system is very different if these conditions do not hold (see Sect. 3.2 and Theorem 2.7).

Theorem 2.7

Suppose that $σ_{1} = σ_{2} = σ, b_{1} = b_{2}$ and $2 (β - α) = a_{2} - a_{1}$ . In this setting one can show that the stochastic growth rate is given by $r = a_{1} - α + β - \frac{σ^{2}}{2}$ . Assume that $(X_{1} (0), X_{2} (0)) = x = (x_{1}, x_{2}) \in R_{+}^{2, \circ}$ and let U(t) be the solution to

\begin{matrix} d U (t) = U (t) (a_{1} - α + β - b U (t)) d t + σ U (t) d B (t), U (0) = x_{2} . \end{matrix}

Then we get the following results

If $x_{1} = x_{2}$ then $P (X_{1}^{x} (t) = X_{2}^{x} (t) = U (t), t \geq 0) = 1 .$
If $x_{1} \neq x_{2}$ then $P (X_{1}^{x} (t) \neq X_{2}^{x} (t), t \geq 0) = 1 .$
If $r < 0$ then $X_{1} (t)$ and $X_{2} (t)$ converges to 0 exponentially fast. If $r > 0$ then
$\begin{matrix} P \{lim_{t \to \infty} \frac{X_{1}^{x} (t)}{U^{x} (t)} = lim_{t \to \infty} \frac{X_{2}^{x} (t)}{U^{x} (t)} = 1\} = 1 . \end{matrix}$
Thus, both $X_{1} (t)$ and $X_{2} (t)$ converge to a unique invariant probability measure $ρ$ on $(0, \infty)$ , which is the invariant probability measure of U(t). The invariant probability measure of ${(X_{1} (t), X_{2} (t))}_{t \geq 0}$ is concentrated on the one-dimensional manifold ${x = (x_{1}, x_{2}) \in R_{+}^{2, \circ} : x_{1} = x_{2}}$ .

The proof of Theorem 2.7 is presented in Sect. 3.2.

Robust persistence and extinction

The model we work with is an approximation of the real biological models. As a result, it is relevant to see if ‘close models’ behave similarly to ours. This reduces to studying the robustness of our system. Consider the process

\begin{matrix} d {\hat{X}}_{i} = {\hat{X}}_{i} ({\hat{a}}_{i} - {\hat{b}}_{i} (X_{i})) d t + {\hat{D}}_{i j} (\hat{X}) {\hat{X}}_{i} d t + {\hat{X}}_{i} \hat{Γ} (\hat{X}) d B (t) \end{matrix}

2.19

where $\hat{b} (\cdot), \hat{D} (\cdot), \hat{Γ} (\cdot)$ are locally Lipschitz functions and ${\hat{D}}_{i j} (x) \geq 0$ for all $x \in R_{+}^{n}, i \neq j$ and ${\hat{D}}_{i i} (x) = - \sum_{j \neq i} D_{i j} (x) .$ If there exists $θ > 0$ such that

\begin{matrix} sup_{x \in R_{+}^{n, \circ}} \{‖ a - \hat{a} ‖, ‖ b (x) - \hat{b} (x) ‖, ‖ D - \hat{D} (x) ‖, ‖ Γ - \hat{Γ} (x) ‖\} < θ, \end{matrix}

2.20

then we call $\hat{X}$ a $θ$ -perturbation of $X$ .

Theorem 2.8

Suppose that the dynamics of ${(X (t))}_{t \geq 0}$ satisfy the assumptions of Theorem 2.1. Then there exists $θ > 0$ such that any $θ$ -perturbation ${(\hat{X} (t))}_{t \geq 0}$ of ${(X (t))}_{t \geq 0}$ is persistent. Moreover, the process ${(\hat{X} (t))}_{t \geq 0}$ has a unique invariant probability measure $\hat{π}$ on $R_{+}^{n, \circ}$ that is absolutely continuous with respect to the Lebesgue measure and for any $q^{*} > 0$

\begin{matrix} lim_{t \to \infty} t^{q^{*}} {‖ P_{\hat{X}} (t, x, \cdot) - \hat{π} (\cdot) ‖}_{TV} = 0, x \in R_{+}^{n, \circ}, \end{matrix}

where $P_{\hat{X}} (t, x, \cdot)$ is the transition probability of ${(\hat{X} (t))}_{t \geq 0}$ .

Biological interpretation of Theorem 2.8 As long as the perturbation of our model is small, persistence does not change to extinction. Our model, even though it is only an approximation of reality, can provide relevant information regarding biological systems. Small enough changes in the growth rates, the competition rates, the dispersion matrix and the covariance matrix leave a persistent system unchanged.

Theoretical and numerical examples

This subsection is devoted to some theoretical and numerical examples. We choose the dimension to be $n = 2$ , so that we can compute the stochastic growth rate explicitly.

Remark 3.1

If an explicit expression for r is desirable, one needs to determine the first and second moments for the invariant probability measure $ν^{*}$ . One can show that $ρ^{*}$ , the density of $ν^{*}$ with respect to Lebesgue measure, satisfies

\begin{matrix} - \sum_{i} \frac{\partial}{\partial y_{i}} [μ_{i} (y) ρ^{*} (y)] + \frac{1}{2} \sum_{i, j} \frac{\partial^{2}}{\partial y_{i} \partial y_{j}} [v_{i j} (y) ρ^{*} (y)] = 0, y \in Δ, \end{matrix}

3.1

where $μ_{i} (y)$ and $v_{i, j} (y)$ are the entries of

\begin{matrix} \begin{matrix} μ (y) & = D^{⊤} y + (diag (y) - y y^{⊤}) (a - Σ y), \\ v (y) & = (diag (y) - y y^{⊤} (t)) Γ^{⊤} Γ (diag (y) - y y^{⊤} (t)), \end{matrix} \end{matrix}

and $ρ^{*}$ is constrained by $\int_{Δ} ρ^{*} (y) d y = 1$ with appropriate boundary conditions. The boundary conditions are usually found by characterizing the domain of the infinitesimal generator of the Feller diffusion process $\tilde{Y} (t)$ , which is usually a very difficult problem.

However, following Evans et al. (2013), in the case of two patches ( $n = 2$ ) and non-degenerate noise the problem is significantly easier. Let $Σ = diag (σ_{1}^{2}, σ_{2}^{2})$ . The system becomes

\begin{matrix} \{\begin{matrix} d X_{1} (t) = (X_{1} (t) (a_{1} - b X_{1} (t)) - α X_{1} (t) + β X_{2} (t)) d t + σ_{1} X_{1} (t) d B_{1} (t) \\ d X_{2} (t) = (X_{2} (t) (a_{2} - b X_{2} (t)) + α X_{1} (t) - β t X_{2} (t)) d t + σ_{2} X_{2} (t) d B_{2} (t) . \end{matrix} \end{matrix}

3.2

It is easy to find the density $ρ_{1}^{*}$ of ${\tilde{Y}}_{1} (\infty)$ explicitly (by solving (3.1)) and noting that 0, 1 are both entrance boundaries for the diffusion ${\tilde{Y}}_{1} (t)$ ). Then

\begin{matrix} ρ_{1}^{*} (x) = C x^{β - α_{1}} {(1 - x)}^{- β - α_{2}} exp (- \frac{2}{σ_{1}^{2} + σ_{2}^{2}} (\frac{β}{x} + \frac{α}{1 - x})), x \in (0, 1) \end{matrix}

where $C > 0$ is a normalization constant and

\begin{matrix} \begin{matrix} α_{i} & : = \frac{2 σ_{i}^{2}}{σ_{1}^{2} + σ_{2}^{2}}, i = 1, 2 \\ β & : = \frac{2}{σ_{1}^{2} + σ_{2}^{2}} (a_{1} - a_{2} + β - α) . \end{matrix} \end{matrix}

One can then get the following explicit expression for the Lyapunov exponent

\begin{matrix} r = & a_{2} - \frac{σ_{2}^{2}}{2} + (a_{1} - a_{2} + σ_{2}^{2}) \int_{0}^{1} y ρ_{1}^{*} (y) d y \\ - \frac{σ_{1}^{2} + σ_{2}^{2}}{2} \int_{0}^{1} y^{2} ρ_{1}^{*} (y) d y . \end{matrix}

3.3

Next, consider the degenerate case

\begin{matrix} \{\begin{matrix} d X_{1} (t) = (X_{1} (t) (a_{1} - b_{1} X_{1} (t)) - α X_{1} (t) + β X_{2} (t)) d t + σ_{1} X_{1} (t) d B (t) \\ d X_{2} (t) = (X_{2} (t) (a_{2} - b_{2} X_{2} (t)) + α X_{1} (t) - β X_{2} (t)) d t + σ_{2} X_{2} (t) d B (t), \end{matrix} \end{matrix}

3.4

where $σ_{1}, σ_{2}$ are non-zero constants and ${(B (t))}_{t \geq 0}$ is a one dimensional Brownian motion. Since ${\tilde{Y}}_{1} (t) + {\tilde{Y}}_{2} (t) = 1$ , to find the invariant probability measure of $\tilde{Y} (t)$ , we only need to find the invariant probability measure of ${\tilde{Y}}_{1} (t)$ .

If $σ_{2} \neq σ_{2}$ we can find the invariant density $ρ_{1}^{*}$ of ${\tilde{Y}}_{1} (\infty)$ explicitly (by solving (3.1). Then

\begin{matrix} ρ_{1}^{*} (x) = & C x^{\hat{β} - {\hat{α}}_{1}} {(1 - x)}^{- \hat{β} - {\hat{α}}_{2}} exp (- \frac{2}{{(σ_{1} - σ_{2})}^{2}} (\frac{β}{x} + \frac{α}{1 - x})), \\ x \in (0, 1) \end{matrix}

3.5

where $C > 0$ is a normalization constant and

\begin{matrix} \begin{matrix} {\hat{α}}_{1} & : = \frac{- 2 σ_{1}}{(σ_{1} - σ_{2})}, {\hat{α}}_{2} : = \frac{2 σ_{2}}{(σ_{1} - σ_{2})}, \\ \hat{β} & : = \frac{2}{{(σ_{1} - σ_{2})}^{2}} (a_{1} - a_{2} + β - α) . \end{matrix} \end{matrix}

The Lyapunov exponent can now be expressed as

\begin{matrix} r = & a_{2} - \frac{σ_{2}^{2}}{2} + (a_{1} - a_{2} + σ_{2}^{2}) \int_{0}^{1} y ρ_{1}^{*} (y) d y \\ - \frac{{(σ_{1} - σ_{2})}^{2}}{2} \int_{0}^{1} y^{2} ρ_{1}^{*} (y) d y . \end{matrix}

We note that the structure of the stochastic growth rate r for non-degenerate noise (3.3) and for degenerate noise (2.16) with $σ_{1} \neq σ_{2}$ is the same. The only difference is that one needs to make the substitution $σ_{1}^{2} + σ_{2}^{2} \mapsto {(σ_{1} - σ_{2})}^{2}$ and the changes in ${\hat{α}}_{i}$ .

If $σ_{1} = σ_{2} = : σ$ the system (2.6) for $\tilde{Y} (t) = ({\tilde{Y}}_{1} (t), {\tilde{Y}}_{2} (t))$ can be written as

\begin{matrix} \{\begin{matrix} d {\tilde{Y}}_{1} (t) = & ({\tilde{Y}}_{1} (t) (a_{1} - a_{1} {\tilde{Y}}_{1} (t) - a_{2} {\tilde{Y}}_{2} (t)) - α {\tilde{Y}}_{1} (t) + β {\tilde{Y}}_{2} (t)) d t \\ + σ^{2} {\tilde{Y}}_{1} (t) [{({\tilde{Y}}_{1} (t) + {\tilde{Y}}_{2} (t))}^{2} - {({\tilde{Y}}_{1} (t) + {\tilde{Y}}_{2} (t))}^{2}] d t \\ d {\tilde{Y}}_{1} (t) = & ({\tilde{Y}}_{2} (t) (a_{2} - a_{1} {\tilde{Y}}_{1} (t) - a_{2} {\tilde{Y}}_{2} (t)) - β {\tilde{Y}}_{2} (t) + α {\tilde{Y}}_{1} (t)) d t \\ + σ^{2} {\tilde{Y}}_{2} (t) [{({\tilde{Y}}_{1} (t) + {\tilde{Y}}_{2} (t))}^{2} - {({\tilde{Y}}_{1} (t) + {\tilde{Y}}_{2} (t))}^{2}] d t . \end{matrix} \end{matrix}

3.6

Using the fact that ${\tilde{Y}}_{1} (t) + {\tilde{Y}}_{2} (t) = 1$ this reduces to

\begin{matrix} d {\tilde{Y}}_{1} (t) = ((a_{1} - a_{2}) [1 - {\tilde{Y}}_{1} (t)] {\tilde{Y}}_{1} (t) + β - (α + β) {\tilde{Y}}_{1} (t)]) d t . \end{matrix}

3.7

The unique equilibrium of 3.7 in [0,1] is the root $y^{⋆}$ in [0,1] of $(a_{1} - a_{2}) (1 - y) y + β - (α + β) y = 0 .$ Hence, the unique invariant probability measure of $\tilde{Y} (t)$ in this case is the Dirac measure concentrated in $(y^{⋆}, 1 - y^{⋆})$ . Thus

\begin{matrix} r = a_{2} - \frac{σ^{2}}{2} + (a_{1} - a_{2} + σ^{2}) y^{⋆} . \end{matrix}

The degenerate case $σ_{1} = σ_{2}, α = β$

Consider the following system, where $α, σ, a_{i}, b_{i}, i = 1, 2$ are positive constants.

\begin{matrix} \{\begin{matrix} d X_{1} (t) = (X_{1} (t) (a_{1} - b_{1} X_{1} (t)) - α X_{1} (t) + α X_{2} (t)) d t + σ X_{1} (t) d B (t) \\ d X_{2} (t) = (X_{2} (t) (a_{2} - b_{2} X_{2} (t)) + α X_{1} (t) - α X_{2} (t)) d t + σ X_{2} (t) d B (t) . \end{matrix} \end{matrix}

3.8

Suppose that $a_{1} \neq a_{2}$ or that $b_{1} \neq b_{2}$ . This system is degenerate since both equations are driven by a single Brownian motion. In this case, the unique equilibrium of (3.7) in [0,1] is the root $y^{⋆}$ in [0,1] of $(a_{1} - a_{2}) (1 - y) y + α (1 - 2 y) = 0 .$ Solving this quadratic equation, we have $y^{⋆} = \frac{a_{1} - a_{2} - 2 α + \sqrt{{(a_{1} - a_{2})}^{2} + 4 α^{2}}}{2 (a_{1} - a_{2})}$ if $a_{1} \neq a_{2}$ and $y^{⋆} = \frac{1}{2}$ if $a_{1} = a_{2}$ .

It can be proved easily that this equilibrium is asymptotically stable and that ${lim}_{t \to \infty} {\tilde{Y}}_{1} (t) = y^{⋆}$ . Thus, if $a_{1} \neq a_{2}$

\begin{matrix} \begin{matrix} r = & a_{1} y^{⋆} + a_{2} (1 - y^{⋆}) - \frac{σ^{2}}{2} \\ = & a_{2} + \frac{a_{1} - a_{2} - 2 α + \sqrt{{(a_{1} - a_{2})}^{2} + 4 α^{2}}}{2} - \frac{σ^{2}}{2} \\ = & \frac{a_{1} + a_{2} - 2 α + \sqrt{{(a_{1} - a_{2})}^{2} + 4 α^{2}}}{2} - \frac{σ^{2}}{2} . \end{matrix} \end{matrix}

As a result

\begin{matrix} r = \{\begin{matrix} \frac{a_{1} + a_{2} - 2 α + \sqrt{{(a_{1} - a_{2})}^{2} + 4 α^{2}}}{2} - \frac{σ^{2}}{2} & if a_{1} \neq a_{2}, b_{1} = b_{2} \\ a_{1} - \frac{σ^{2}}{2} & if a_{1} = a_{2}, b_{1} \neq b_{2} . \end{matrix} \end{matrix}

3.9

Note that if $a_{1} \neq a_{2}$ and $b_{1} = b_{2}$

\begin{matrix} α + (b_{2} / b_{1}) (a_{1} - a_{2}) - α {(b_{2} / b_{1})}^{2} = a_{1} - a_{2} \neq 0 \end{matrix}

and that if $a_{1} = a_{2}$ and $b_{1} \neq b_{2}$

\begin{matrix} α + (b_{2} / b_{1}) (a_{1} - a_{2}) - α {(b_{2} / b_{1})}^{2} = α (1 - b_{2} / b_{1}) \neq 0 . \end{matrix}

Therefore, the assumptions of Theorem 2.6 hold. If $r < 0$ , by Theorem 2.5 the population goes extinct, while if $r > 0$ , the population persists by Theorem 2.6.

The degenerate case when the conditions of Theorem 2.6 are violated

We analyse the system

\begin{matrix} \{\begin{matrix} d X_{1} (t) = (X_{1} (t) (a_{1} - b X_{1} (t)) - α X_{1} (t) + β X_{2} (t)) d t + σ X_{1} (t) d B (t) \\ d X_{2} (t) = (X_{2} (t) (a_{2} - b X_{2} (t)) + α X_{1} (t) - β X_{2} (t)) d t + σ X_{2} (t) d B (t), \end{matrix} \end{matrix}

3.10

when $2 (β - α) = a_{2} - a_{1}$ . In this case $σ_{1} = σ_{2} = σ$ ,

\begin{matrix} β + (b_{2} / b_{1}) (a_{1} - a_{2} - α + β) - α (b_{2} / b_{1}) = 0 \end{matrix}

and

\begin{matrix} r = a_{1} - α + β - \frac{σ^{2}}{2} . \end{matrix}

If $r < 0$ then ${lim}_{t \to \infty} X_{1} (t) = {lim}_{t \to \infty} X_{2} (t) = 0$ almost surely as the result of Theorem 2.5.

We focus on the case $r > 0$ and show that some of the results violate the conclusions of Theorem 2.6.

If we set $Z (t) = X_{1} (t) / X_{2} (t)$ then (see (C.6))

\begin{matrix} d Z (t) = ((1 - Z (t)) Z (t) X_{2} (t) + β + {\hat{a}}_{1} Z (t) - α Z^{2} (t)) d t . \end{matrix}

Noting that ${\hat{a}}_{1} = a_{1} - a_{2} - α + β = α - β$ yields

\begin{matrix} d (Z (t) - 1) = (- (Z (t) - 1) Z (t) X_{2} (t) - (Z (t) - 1) (α Z (t) + β)) d t . \end{matrix}

Assume $Z (0) \neq 1$ and without loss of generality suppose $Z (0) > 1$ . This implies

\begin{matrix} Z (t) - 1 = (Z (0) - 1) exp (- \int_{0}^{t} [Z (s) X_{2} (s) + (α Z (s) + β)] d s) . \end{matrix}

3.11

Since Z(t) and $X_{2} (t)$ do not explode to $\pm \infty$ in finite time we can conclude that if $Z (0) \neq 0$ then $Z (t) \neq 0$ for any $t \geq 0$ with probability 1. In other words, if $x = (x_{1}, x_{2}) \in R_{+}^{2, \circ}$ with $x_{1} \neq x_{2}$ then

\begin{matrix} P (X_{1}^{x} (t) = X_{2}^{x} (t), t \geq 0) = 0 . \end{matrix}

One can further see from (3.11) that $Z (t) - 1$ tends to 0 exponentially fast. If $Z (0) = 1$ let $X_{1} (0) = X_{2} (0) = x > 0$ . Similar arguments to the above show that

\begin{matrix} P (X_{1}^{x} (t) \neq X_{2}^{x} (t), t \geq 0) = 0 . \end{matrix}

To gain more insight into the asymptotic properties of $(X_{1} (t), X_{2} (t))$ , we study

\begin{matrix} \begin{matrix} d X_{2} (t) = & X_{2} (t) (({\hat{a}}_{2} - b X_{2} (t)) + α Z (t)) d t + σ X_{2} (t) d B (t) \\ = & X_{2} (t) (a_{1} - α + β - b X_{2} (t)) + α (Z (t) - 1)) d t + σ X_{2} (t) d B (t) \end{matrix} \end{matrix}

We have from Itô’s formula that,

\begin{matrix} \begin{matrix} d \frac{1}{X_{2} (t)} & = (b + (- a_{1} + α - β + σ^{2} - α (Z (t) - 1)) \frac{1}{X_{2} (t)}) d t \\ - σ \frac{1}{X_{2} (t)} d B (t) . \end{matrix} \end{matrix}

By the variation-of constants formula (see Mao 1997, Section 3.4), we have

\begin{matrix} \frac{1}{X_{2} (t)} = ϕ^{- 1} (t) [\frac{1}{x_{2}} + b \int_{0}^{t} ϕ (s) d s] \end{matrix}

where

\begin{matrix} ϕ (t) : = exp [r t + α \int_{0}^{t} (Z (s) - 1) d s + σ B (t)] . \end{matrix}

Thus,

\begin{matrix} X_{2} (t) = \frac{ϕ (t)}{x_{2}^{- 1} + b \int_{0}^{t} ϕ (s) d s} . \end{matrix}

It is well-known that

\begin{matrix} U (t) : = \frac{e^{r t + σ B (t)}}{x_{2}^{- 1} + b \int_{0}^{t} e^{r s + σ B (s)} d s}, \end{matrix}

is the solution to the stochastic logistic equation

\begin{matrix} d U (t) = U (t) (a_{1} - α + β - b U (t)) d t + σ U (t) d B (t), U (0) = x_{2} . \end{matrix}

By the law of the iterated logarithm, almost surely

\begin{matrix} lim_{t \to \infty} ϕ (t) = lim_{t \to \infty} e^{r t + σ B (t)} = \infty . \end{matrix}

3.12

We have

\begin{matrix} \frac{X_{2} (t)}{U (t)} = \frac{exp (α \int_{0}^{t} (Z (s) - 1) d s) [x_{2}^{- 1} + b \int_{0}^{t} e^{r t + σ B (t)} d s]}{x_{2}^{- 1} + b \int_{0}^{t} ϕ (s) d s} . \end{matrix}

In view of (3.12), we can use L’hospital’s rule to obtain

\begin{matrix} lim_{t \to \infty} \frac{X_{2} (t)}{U (t)} \\ = lim_{t \to \infty} \frac{exp (α \int_{0}^{t} (Z (s) - 1) d s) e^{r t + σ B (t)}}{ϕ (t)} \\ + lim_{t \to \infty} \frac{α (Z (t) - 1) exp (α \int_{0}^{t} (Z (s) - 1) d s) [x_{2}^{- 1} + b \int_{0}^{t} e^{r t + σ B (t)} d s]}{b ϕ (t)} \\ = 1 + lim_{t \to \infty} \frac{α (Z (t) - 1) [x_{2}^{- 1} + b \int_{0}^{t} e^{r t + σ B (t)} d s]}{b e^{r t + σ B (t)}} \end{matrix}

3.13

almost surely. By the law of the iterated logarithm, ${lim}_{t \to \infty} \frac{e^{r t + σ B (t)}}{e^{(r - ε) t}} = \infty$ and ${lim}_{t \to \infty} \frac{e^{r t + σ B (t)}}{e^{(r + ε) t}} = 0$ for any $ε > 0$ . Applying this and (3.11) to (3.13), it is easy to show that with probability 1

\begin{matrix} lim_{t \to \infty} \frac{X_{2} (t)}{U (t)} = 1 . \end{matrix}

Since ${lim}_{t \to \infty} Z (t) = 1$ almost surely, we also have ${lim}_{t \to \infty} \frac{X_{1} (t)}{U (t)} = 1$ almost surely. Thus, the long term behavior of $X_{1} (t)$ and $X_{2} (t)$ is governed by the one-dimensional diffusion U(t). In particular, both $X_{1} (t)$ and $X_{2} (t)$ converge to a unique invariant probability measure $ρ$ on $(0, \infty)$ , which is the invariant probability measure of U(t). In this case, the invariant probability measure of $X (t) = {(X_{1} (t), X_{2} (t))}_{t \geq 0}$ is not absolutely continuous with respect to the Lebesgue measure on $R_{+}^{2, \circ}$ . Instead, the invariant probability measure is concentrated on the one-dimensional manifold ${x = (x_{1}, x_{2}) \in R_{+}^{2, \circ} : x_{1} = x_{2}}$ .

Biological interpretation The stochastic growth rate in this degenerate setting is given by $r = a_{1} - α + β - \frac{σ^{2}}{2}$ . We note that this term is equal to the stochastic growth rate of patch $1, a_{1} - \frac{σ^{2}}{2}$ , to which we add $β$ , the rate of dispersal from patch 1 to patch 2, and subtract $α$ , the rate of dispersal from patch 2 to patch 1. When

\begin{matrix} a_{1} - \frac{σ^{2}}{2} > α - β \end{matrix}

one has persistence, while when

\begin{matrix} a_{1} - \frac{σ^{2}}{2} < α - β \end{matrix}

one has extinction. In particular, if the patches on their own are sink patches so that $a_{1} - \frac{σ^{2}}{2} < 0$ and $a_{2} - \frac{σ^{2}}{2} < 0$ dispersion cannot lead to persistence since

\begin{matrix} a_{1} - \frac{σ^{2}}{2} > α - β and a_{2} - \frac{σ^{2}}{2} > β - α \end{matrix}

cannot hold simultaneously. The behavior of the system when $r > 0$ is different from the behavior in the non-degenerate setting of Theorem 2.1 or the degenerate setting of Theorem 2.6. Namely, if the patches start with equal populations then the patch abundances remain equal for all times and evolve according to the one-dimensional logistic diffusion U(t). If the patches start with different population abundances then $X_{1} (t)$ and $X_{2} (t)$ are never equal but tend to each other asymptotically as $t \to \infty$ . Furthermore, the long term behavior of $X_{1} (t)$ and $X_{2} (t)$ is once again determined by the logistic diffusion U(t) as almost surely $\frac{X_{i} (t)}{U (t)} \to 1$ as $t \to \infty$ . As such, if $r > 0$ we have persistence but the invariant measure the system converges to does not have $R_{+}^{2, \circ}$ as its support anymore. Instead the invariant measure has the line ${x = (x_{1}, x_{2}) \in R_{+}^{2, \circ} : x_{1} = x_{2}}$ as its support.

Example 3.1

We discuss the case when $a_{1} \neq a_{2}$ and $σ_{1} = σ_{2}$ . The stochastic growth rate can be written by the analysis in the sections above as

\begin{matrix} r = \{\begin{matrix} \frac{a_{1} + a_{2} - 2 α + \sqrt{{(a_{1} - a_{2})}^{2} + 4 α^{2}}}{2} - \frac{σ^{2}}{2} & if α = β, b_{1} = b_{2} \\ a_{1} - α + β - \frac{σ^{2}}{2} & if a_{2} - a_{1} = 2 (β - α), b_{1} = b_{2} . \end{matrix} \end{matrix}

3.14

Biological interpretation In the case when $a_{1} = a_{2}, σ_{1} = σ_{2}$ and $b_{1} \neq b_{2}$ (so that the two patches only differ in their competition rates) the stochastic growth rate r does not depend on the dispersal rate $α$ . The system behaves just as a single-patch system with stochastic growth rate $a_{1} - \frac{σ^{2}}{2}$ . In contrast to Evans et al. (2013, Example 1) coupling two sink patches by dispersion cannot yield persistence.

However, if the growth rates of the patches are different $a_{1} \neq a_{2}$ then the expression for r given in (3.14) yields for $α ≫ | a_{1} - a_{2} |$ that

\begin{matrix} r \approx \frac{a_{1} + a_{2}}{2} - \frac{σ^{2}}{2} + \frac{{(a_{1} - a_{2})}^{2}}{8 α} . \end{matrix}

In particular

\begin{matrix} lim_{α \to \infty} r (α) = \frac{a_{1} + a_{2}}{2} - \frac{σ^{2}}{2} . \end{matrix}

We note that r is a decreasing function of the dispersal rate $α$ for large values of $α$ (also see Fig. 1). This is different from the result of Evans et al. (2013, Example 1) where r was shown to be an increasing function of $α$ . In contrast to the non-degenerate case, coupling patches by dispersal decreases the stochastic growth rate and as such makes persistence less likely. This highlights the negative effect of spatial correlations on population persistence and why one may no longer get the rescue effect. This is one of your main biological conclusions. Furthermore, we also recover that dispersal has a negative impact on the stochastic growth rate when there is spatial heterogeneity (i.e. $a_{1} \neq a_{2}$ ). This fact has a long history, going back to the work by Karlin (1982).

Discussion and generalizations

For numerous models of population dynamics it is natural to assume that time is continuous. One reason for this is that often environmental conditions change continuously with time and therefore can naturally be described by continuous time models. There have been a few papers dedicated to the study of stochastic differential equation models of interacting, unstructured populations in stochastic environments (see Benaïm et al. 2008; Schreiber et al. 2011; Evans et al. 2015). These models however do not account for population structure or correlated environmental fluctuations.

Examples of structured populations can be found by looking at a population in which individuals can live in one of n patches (e.g. fish swimming between basins of a lake or butterflies dispersing between meadows). Dispersion is viewed by many population biologists as an important mechanism for survival. Not only does dispersion allow individuals to escape unfavorable landscapes (due to environmental changes or lack of resources), it also facilitates populations to smooth out local spatio-temporal environmental changes. Patch models of dispersion have been studied extensively in the deterministic setting (see for example Hastings 1983; Cantrell et al. 2012). In the stochastic setting, there have been results for discrete time and space by Benaïm and Schreiber (2009), for continuous time and discrete space by Evans et al. (2013) and for structured populations that evolve continuously both in time and space.

We analyze the dynamics of a population that is spread throughout n patches, evolves in a stochastic environment (that can be spatially correlated), disperses among the patches and whose members compete with each other for resources. We characterize the long-term behavior of our system as a function of r—the growth rate in the absence of competition. The quantity r is also the Lyapunov exponent of a suitable linearization of the system around 0. Our analysis shows that $r < 0$ implies extinction and $r > 0$ persistence. The limit case $r = 0$ cannot be analyzed in our framework. We expect that new methods have to be developed in order to tackle the $r = 0$ scenario.

Since mathematical models are always approximations of nature it is necessary to study how the persistence and extinction results change under small perturbations of the parameters of the models. The concept of robust persistence (or permanence) has been introduced by Hutson and Schmitt (1992). They showed that for certain systems persistence holds even when one has small perturbations of the growth functions. There have been results on robust persistence in the deterministic setting for Kolmogorov systems by Schreiber (2000) and Garay and Hofbauer (2003). Recently, robust permanence for deterministic Kolmogorov equations with respect to perturbations in both the growth functions and the feedback dynamics has been analyzed by Patel and Schreiber (2016). In the stochastic differential equations setting results on robust persistence and extinction have been proven by Schreiber et al. (2011) and Benaïm et al. (2008). We prove analogous results in our framework where the populations are coupled by dispersal. For robust persistence we show in Appendix D that even with density-dependent perturbations of the growth rates, dispersion matrix and environmental covariance matrix, if these perturbations are sufficiently small and if the unperturbed system is persistent then the perturbed system is also persistent. In the case of extinction we can prove robustness when there are small constant perturbations of the growth rates, dispersal matrices and covariance matrices.

In ecology there has been an increased interest in the spatial synchrony present in population dynamics. This refers to the changes in the time-dependent characteristics (i.e. abundances etc) of structured populations. One of the mechanisms which creates synchrony is the dependence of the population dynamics on a synchronous random environmental factor such as temperature or rainfall. The synchronizing effect of environmental stochasticity, or the so-called Moran effect, has been observed in multiple population models. Usually this effect is the result of random but correlated weather effects acting on spatially structured populations. Following Legendre (1993) one could argue that our world is a spatially correlated one. For many biotic and abiotic factors, like population density, temperature or growth rate, values at close locations are usually similar. For an in-depth analysis of spatial synchrony see Kendall et al. (2000) and Liebhold et al. (2004). Most stochastic differential models appearing in population dynamics treat only the case when the noise is non-degenerate (although see Rudnicki 2003; Dieu et al. 2016). This simplifies the technical proofs significantly. However, from a biological point of view it is not clear that the noise should never be degenerate. For example if one models a system with multiple populations then all populations can be influenced by the same factors (a disease, changes in temperature and sunlight etc). Environmental factors can intrinsically create spatial correlations and as such it makes sense to study how these degenerate systems compare to the non-degenerate ones. In our setting the n different patches could be strongly spatially correlated. Actually, in some cases it could be more realistic to have the same one-dimensional Brownian motion ${(B_{t})}_{t \geq 0}$ driving the dynamics of all patches. We were able to find conditions under which the proofs from the non-degenerate case can be generalized to the degenerate setting. This is a first step towards a model that tries to explain the complex relationship between dispersal, stochastic environments and spatial correlations.

We fully analyze what happens if there are only two patches, $n = 2$ , and the noise is degenerate. Our results show unexpectedly, and in contrast to the non-degenerate results by Evans et al. (2013), that coupling two sink patches cannot yield persistence. More generally, we show that the stochastic growth rate is a decreasing function of the dispersal rate. In specific instances of the degenerate setting, even when there is persistence, the invariant probability measure the system converges to does not have $R_{+}^{2, \circ}$ as its support. Instead, the abundances of the two patches converge to an invariant probability measure supported on the line ${x = (x_{1}, x_{2}) \in R_{+}^{2, \circ} : x_{1} = x_{2}}$ . These examples shows that degenerate noise is not just an added technicality—the results can be completely different from those in the non-degenerate setting. The negative effect of spatial correlations (including the fully degenerate case) has been studied in several papers for discrete-time models (see Schreiber 2010; Harrison and Quinn 1989; Palmqvist and Lundberg 1998; Bascompte et al. 2002; Roy et al. 2005). The negative impact of dispersal on the stochastic growth rate r when there is spatial heterogeneity (i.e. $a_{1} \neq a_{2}$ ) has a long history going back to the work of Karlin (1982) on the Reduction Principle. Following Altenberg (2012) the reduction principle can be stated as the widely exhibited phenomenon that mixing reduces growth, and differential growth selects for reduced mixing. The first use of this principle in the study of the evolution of dispersal can be found in Hastings (1983). The work of Kirkland et al. (2006) provides an independent proof of the Reduction Principle and applications to nonlinear competing species in discrete-time, discrete-space models. In the case of continuous-time, discrete-space models (given by branching processes) a version of the Reduction Principle is analysed by Schreiber and Lloyd-Smith (2009).

k species competing and dispersing in n patches

Real populations do not evolve in isolation and as a result much of ecology is concerned with understanding the characteristics that allow two species to coexist, or one species to take over the habitat of another. It is of fundamental importance to understand what will happen to an invading species. Will it invade successfully or die out in the attempt? If it does invade, will it coexist with the native population? Mathematical models for invasibility have contributed significantly to the understanding of the epidemiology of infectious disease outbreaks (Cross et al. 2005) and ecological processes (Law and Morton 1996; Caswell 2001). There is widespread empirical evidence that heterogeneity, arising from abiotic (precipitation, temperature, sunlight) or biotic (competition, predation) factors, is important in determining invasibility (Davies et al. 2005; Pyšek and Hulme 2005). However, few theoretical studies have investigated this; see, e.g., Schreiber and Lloyd-Smith (2009), Schreiber and Ryan (2011) and Schreiber (2012).

In this paper we have considered the dynamics of one population that disperses through n patches. One possible generalization would be to look at k populations $(X^{1}, \dots, X^{k})$ that compete with each other for resources, have different dispersion strategies and possibly experience the environmental noise differently. Looking at such a model could shed light upon fundamental problems regarding invasions in spatio-temporally heterogeneous environments.

The extension of our results to competition models could lead to the development of a stochastic version of the treatment of the evolution of dispersal developed for patch models in the deterministic setting by Hastings (1983) and Cantrell et al. (2012). In the current paper we have focused on how spatio-temporal variation influences the persistence and extinction of structured populations. In a follow-up paper we intend to look at the dispersal strategies in terms of evolutionarily stable strategies (ESS) which can be characterized by showing that a population having a dispersal strategy $(D_{i j})$ cannot be invaded by any other population having a different dispersal strategy $({\tilde{D}}_{i j})$ . The first thing to check would be whether this model has ESS and, if they exist, whether they are unique. One might even get that there are no ESS in our setting. For example, Schreiber and Li (2011) show that there exist no ESS for periodic non-linear models and instead one gets a coalition of strategies that act as an ESS. We expect to be able to generalize the results of Cantrell et al. (2012) to a stochastic setting using the methods from this paper.

Acknowledgements

We thank Sebastian J. Schreiber and three anonymous referees for their detailed comments which helped improve this manuscript.

Appendix A: The case $r > 0$

The next sequence of lemmas and propositions is used to prove Theorem 2.1. We start by showing that our processes are well-defined Markov processes.

Proposition A.1

The SDE (stochastic differential equation) defined by (2.1) has unique strong solutions $X (t) = (X_{1} (t), \dots, X_{n} (t)), t \geq 0$ for any $x = (x_{1}, \dots, x_{n}) \in R_{+}^{n}$ . Furthermore, $X (t)$ is a strong Markov process with the Feller property, is irreducible on $R_{+}^{n} \ {0}$ and $P {X_{i} (t) > 0, t > 0, i = 1, \dots, n} = 1$ if $X (0) \in R_{+}^{n} \ {0}$ .

Proof

Since the coefficients of (2.1) are locally Lipschitz, there exists a unique local solution to (2.1) with a given initial value. In other words, for any initial value, there is a stopping time $τ_{e} > 0$ and a process ${(X (t))}_{t \geq 0}$ satisfying (2.1) up to $τ_{e}$ and $lim_{t \to τ_{e}} ‖ X (t) ‖ = \infty$ (see e.g. Khasminskii 2012, Section 3.4). Clearly, if $X (0) = 0$ then $X (t) = 0, t \in [0, τ_{e})$ which implies that $τ_{e} = \infty$ . By a comparison theorem for SDEs (see Geiß and Manthey (1994, Theorem 1.2) and Remark A.2 below),

\begin{matrix} P \{X_{i} (t) < X_{i} (t), t \in (0, τ_{e}), i = 1, \dots, n\} = 1 if X_{i} (0) = X_{i} (0) \geq M_{b} \end{matrix}

A.1

where ${(X_{i} (t))}_{t \geq 0}$ is given by (2.7). Since (2.7) has a global solution due to the Lipschitz property of its coefficients, we have from (A.1) that $τ_{e} = \infty$ almost surely. Define the process

\begin{matrix} d {\bar{X}}_{i} (t) = (- |\frac{3 a_{i}}{2}| {\bar{X}}_{i} (t) + \sum_{j = 1}^{n} D_{j i} {\bar{X}}_{j} (t)) d t + {\bar{X}}_{i} (t) d E_{i} (t), i = 1, \dots, n . \end{matrix}

Since the $b_{i}$ s are continuous and vanish at 0, there exists $r > 0$ such that for $| x | \leq r$ we have

\begin{matrix} - |\frac{3 a_{i}}{2}| \leq a_{i} - b_{i} (x_{i}), i = 1, \dots, n . \end{matrix}

A.2

Let $τ$ be the stopping time

\begin{matrix} τ : = inf \{t : |\bar{X} (t)| > r\} \end{matrix}

A.3

Now, consider the case $X (0) \in R_{+}^{n} \ {0}$ . By Evans et al. (2013, Proposition 3.1), (A.2), (A.3) and a comparison argument (see Remark A.2 and the proof of Evans et al. (2015, Theorem 4.1)), we can show that

\begin{matrix} P \{X_{i} \geq {\bar{X}}_{i} (t) > 0, t \in (0, τ)\} = 1, \end{matrix}

which implies

\begin{matrix} P \{X_{i} (t) > 0, t \in (0, \infty)\} = 1 for all X (0) \in R_{+}^{n} \ {0} . \end{matrix}

A.4

Moreover, since $P \{0 \leq X_{i} (t) < X_{i} (t) for all t \geq 0, i = 1, \dots, n\} = 1$ , we can use standard arguments (e.g., Mao 1997, Theorem 2.9.3) to obtain the Feller property of the solution to (2.1). $□$

Remark A.1

There are different possible definitions of “Feller” in the literature. What we mean by Feller is that the semigroup ${(T_{t})}_{t \geq 0}$ of the process maps the set of bounded continuous functions $C_{b} (R_{+}^{n})$ into itself i.e.

\begin{matrix} T_{t} (C_{b} (R_{+}^{n})) \subset C_{b} (R_{+}^{n}), t \geq 0 . \end{matrix}

Definition A.1

We call a mapping $f : R^{d} \to R^{d}$ quasi-monotonously increasing, if for $j = 1, \dots, d$

\begin{matrix} f_{j} (x) \leq f_{j} (y), \end{matrix}

whenever $x_{j} = y_{j}$ and $x_{l} \leq y_{l}, l \neq j$ .

Remark A.2

One often wants to apply the well-known comparison theorem for one-dimensional SDEs (see Ikeda and Watanabe 1989) to a multidimensional setting. Below we explain why we can make use of comparison theorems for stochastic differential equations in our setting. Consider the following two systems

\begin{matrix} d R_{j} (t) = a_{j} (t, R (t)) d t + \sum_{k = 1}^{r} σ_{j k} (t, R (t)) d W_{k} (t) \end{matrix}

A.5

and

\begin{matrix} d S_{j} (t) = b_{j} (t, S (t)) d t + \sum_{k = 1}^{r} σ_{j k} (t, S (t)) d W_{k} (t) \end{matrix}

A.6

for $j = 1, \dots, d, t \geq 0$ together with the initial condition

\begin{matrix} R_{j} (0) \leq S_{j} (0), j = 1, \dots, d P - a.s., \end{matrix}

A.7

where $W = {(W_{1} (t), \dots, W_{r} (t))}_{t \geq 0}$ is an r-dimensional standard Brownian motion, and the coefficients $a_{i}, b_{i}, σ_{j k}$ are continuous mappings on $R_{+} \times R^{d}$ . Suppose (A.5) and (A.6) have explosion times $θ_{R}, θ_{S}$ .

Let (C0), (C1), and (C2) be the following conditions.

(C0)
The solution to (A.5) is pathwise unique and the drift coefficient a(t, x) is quasi-monotonously (see Definition A.1) increasing with respect to x.
(C1)
For every $t \geq 0, j = 1, \dots, d$ and $x \in R^{d}$ the following inequality holds
$\begin{matrix} a_{j} (t, x) \leq b_{j} (t, x) . \end{matrix}$
(C2)
There exists a strictly increasing function $ρ : R_{+} \to R_{+}$ with $ρ (0) = 0$ and
$\begin{matrix} \int_{0^{+}}^{\infty} \frac{1}{ρ^{2} (u)} d u = \infty \end{matrix}$
such that for each $j = 1, \dots, d$
$\begin{matrix} \sum_{k = 1}^{r} | σ_{j k} (t, x) - σ_{j k} (t, y) | \leq ρ (| x_{j} - y_{j} |) for all t \geq 0, x, y \in R^{d} . \end{matrix}$

Sometimes it is assumed incorrectly that conditions (C1) and (C2) suffice to conclude that $P {R (t) \leq Y (t), t \in [0, θ_{R} \land θ_{S})} = 1$ . Some illuminating counterexamples regarding this issue can be found in Assing and Manthey (1995, Section 3). However, if in addition to conditions (C1) and (C2), one also has condition (C0), then Geiß and Manthey (1994, Theorem 1.2) indicates that $P {R (t) \leq Y (t), t \in [0, θ_{R} \land θ_{S})} = 1$ . Note that, in the setting of our paper, the drift coefficient of (2.7) is quasi-monotonously increasing and we can pick $ρ (x) = x, x \in R_{+}$ . Therefore, conditions (C0), (C1), and C(2) hold, which allows us to use the comparison results. In special cases one can prove comparison theorems even when quasi-monotonicity fails; see Evans et al. (2015, Theorem 6.1) and Nlath et al. (2007, Corollary A.2).

To proceed, let us recall some technical concepts and results needed to prove the main theorem. Let $Φ = (Φ_{0}, Φ_{1}, \dots)$ be a discrete-time Markov chain on a general state space $(E, E)$ , where $E$ is a countably generated $σ$ -algebra. Denote by $P$ the Markov transition kernel for $Φ$ . If there is a non-trivial $σ$ -finite positive measure $φ$ on $(E, E)$ such that for any $A \in E$ satisfying $φ (A) > 0$ we have

\begin{matrix} \sum_{n = 1}^{\infty} P^{n} (x, A) > 0, x \in E \end{matrix}

where $P^{n}$ is the n-step transition kernel of $Φ$ , then the Markov chain $Φ$ is called $φ$ -irreducible. It can be shown (see Nummelin 1984) that if $Φ$ is $φ$ -irreducible, then there exists a positive integer d and disjoint subsets $E_{0}, \dots, E_{d - 1}$ such that for all $i = 0, \dots, d - 1$ and all $x \in E_{i}$ , we have

\begin{matrix} P (x, E_{j}) = 1 where j = i + 1 (mod d) \end{matrix}

and

\begin{matrix} φ (E \ ⋃_{i = 0}^{d - 1} E_{i}) = 0 . \end{matrix}

The smallest positive integer d satisfying the above is called the period of $Φ .$ An aperiodic Markov chain is a chain with period $d = 1$ .

A set $C \in E$ is called petite, if there exists a non-negative sequence ${(a_{n})}_{n \in N}$ with $\sum_{n = 1}^{\infty} a_{n} = 1$ and a nontrivial positive measure $ν$ on $(E, E)$ such that

\begin{matrix} \sum_{n = 1}^{\infty} a_{n} P^{n} (x, A) \geq ν (A), x \in C, A \in E . \end{matrix}

The following theorem is extracted from Jarner and Roberts (2002, Theorem 3.6).

Theorem A.1

Suppose that $Φ$ is irreducible and aperiodic and fix $0 < γ < 1$ . Assume that there exists a petite set $C \subset E$ , positive constants $κ_{1}, κ_{2}$ and a function $V : E \to [1, \infty)$ such that

\begin{matrix} P V \leq V - κ_{1} V^{γ} + κ_{2} 1_{C} . \end{matrix}

Then there exists a probability measure $π$ on $(E, E)$ such that

\begin{matrix} {(n + 1)}^{\frac{γ}{1 - γ}} {‖ P (x, \cdot) - π (\cdot) ‖}_{T V} \to 0 as n \to \infty for all x \in E . \end{matrix}

The next series of lemmas and propositions are used to show that we can construct a function V satisfying the assumptions of Theorem A.1.

Lemma A.1

For any $T > 0$ , there exists an open set $N_{0} \subset R_{+}^{n, \circ}$ such that the Markov chain ${(Y (k T), S (k T)), k \in N}$ on $Δ \times (0, \infty)$ is $φ$ -irreducible and aperiodic, where $φ (\cdot) = m (\cdot \cap N_{0})$ and $m (\cdot)$ is Lebesgue measure. Moreover, every compact set $K \subset Δ \times (0, \infty)$ is petite. Similarly, $Δ$ is a petite set of the Markov chain ${\tilde{Y} (k T), k \in N}$ .

Proof

To prove this lemma, it is more convenient to work with the process $X (t)$ that lives on $R_{+}^{n} \ {0}$ . Since ${(X (t))}_{t \geq 0}$ is a nondegenerate diffusion with smooth coefficients in $R_{+}^{n, \circ}$ , by Rey-Bellet (2006, Corollary 7.2), the transition semigroup $P_{X} (t, x, \cdot)$ of ${(X (t))}_{t \geq 0}$ has a smooth, positive density $(0, \infty) \times R_{+}^{2 n, \circ} ∋ (t, x, x^{'})) \mapsto p_{X} (t, x, x^{'}) \in [0, \infty)$ . Fix a point $x_{0} \in R_{+}^{n, \circ}$ . Since $\int_{R_{+}^{n, \circ}} p (t, x_{0}, x) d x = 1$ there exists $x_{1} \in R_{+}^{n, \circ}$ such that $p_{X} (\frac{T}{2}, x_{0}, x_{1}) > 0$ . There exist bounded open sets $N_{0} ∋ x_{0}, N_{1} ∋ x_{1}$ satisfying

\begin{matrix} \hat{p} : = inf \{p_{X} (\frac{T}{2}, x, x^{'}) > 0 : x \in N_{0}, x^{'} \in N_{1}\} > 0 . \end{matrix}

A.8

Slightly modifying the proof of Evans et al. (2013, Proposition 3.1) (the part proving the irreducibility of the solution process), we have that ${\tilde{p}}_{x} : = P_{X} (\frac{T}{2}, x, N_{0}) > 0$ for all $x \in R_{+}^{n} \ {0} .$ Since ${(X (t))}_{t \geq 0}$ has the Feller property, there is a neighborhood $N_{x} ∋ x$ such that

\begin{matrix} P_{X} (\frac{T}{2}, x^{'}, N_{0}) > \frac{{\tilde{p}}_{x}}{2}, x^{'} \in N_{x} . \end{matrix}

A.9

For any compact set $K \in R_{+}^{n} \ {0}$ , there are finite $x_{2}, \dots, x_{k}$ such that $K \subset ⋃_{i = 2}^{k} N_{x_{i}}$ . As a result,

\begin{matrix} P_{X} (\frac{T}{2}, x^{'}, N_{0}) > {\tilde{p}}_{K} : = min \{\frac{{\tilde{p}}_{x_{i}}}{2}, i = 2, \dots, k\} . \end{matrix}

A.10

In view of (A.8), (A.9), and (A.10), an application of the Chapman-Kolmogorov equations yields that for any $x \in K$ and any measurable set $A \subset R_{+}^{n, \circ}$ ,

\begin{matrix} P_{X} (T, x, A) \geq & \int_{N_{0}} P_{X} (\frac{T}{2}, x, d x^{'}) P_{X} (\frac{T}{2}, x^{'}, A) \\ \geq & {\tilde{p}}_{K} \hat{p} m (A \cap N_{1}), \end{matrix}

where $m (\cdot)$ is Lebesgue measure on $R_{+}^{n, \circ}$ . Since the measure $ν (\cdot) = m (\cdot \cap N_{1})$ is non-trivial, we can easily obtain that K is a petite set of the Markov chain ${(X (k T)), k \in N} .$ Moreover, K can be chosen arbitrarily. Hence, for any $x \in R_{+}^{n} \ {0}$ there is ${\bar{p}}_{x} > 0$ such that

\begin{matrix} P_{X} (T, x, \cdot) \geq {\bar{p}}_{x} m (\cdot \cap N_{1}), \end{matrix}

A.11

which means that ${(X (k T)), k \in N}$ is irreducible.

Suppose that ${(X (k T)), k \in N}$ is not aperiodic. Then there are disjoint subsets of $R_{+}^{n} \ {0}$ , denoted by $A_{0}, \dots, A_{d - 1}$ with $d > 1$ such that for any $x \in A_{i}$ ,

\begin{matrix} P_{X} (T, x, A_{j}) = 1 where j = i + 1 (mod d) . \end{matrix}

Since $P (T, x, \cdot)$ has a density, $m (A_{i}) > 0$ for $i = 0, \dots, d - 1$ . In view of (A.11), we must have $m (N_{0} \cap A_{i}) = 0$ for any $i = 0, \dots, d - 1$ . This contradicts the fact that

\begin{matrix} m (N_{0} ⋂ (E \ ⋃_{i = 0}^{d - 1} A_{i})) = 0 . \end{matrix}

This contradiction implies that ${X (k T), k \in N}$ is aperiodic. In the same manner, we can prove that $\tilde{Y} (t)$ is irreducible, aperiodic and its state space, $Δ$ , is petite. $□$

Lemma A.2

There exists a positive constant $K_{1}$ such that

\begin{matrix} E S^{y, s} (t) \leq e^{- γ_{b} t} s + K_{1}, (y, s) \in Δ \times (0, \infty), t \geq 0 . \end{matrix}

A.12

Moreover, for any $H > 0, T > 0$ , and $ε > 0$ , there is a $\tilde{k} = \tilde{k} (H, T, ε) > 0$ such that

\begin{matrix} P {S^{y, s} (t) < \tilde{k}, t \in [0, T]} > 1 - ε, (y, s) \in Δ \times (0, H] . \end{matrix}

A.13

Proof

In view of (2.2), if $s \geq M_{b}$ then $- {[b (s y)]}^{⊤} y + a^{⊤} y + γ_{b} \leq 0$ . Let

\begin{matrix} {\tilde{K}}_{1} = sup_{y \in Δ, s \leq M_{b}} \{s (- {[b (s y)]}^{⊤} y + a^{⊤} y + γ_{b})\} < \infty . \end{matrix}

For $k \in N$ , define the bounded stopping time

\begin{matrix} η_{k}^{y, s} = inf {t \geq 0 : S^{y, s} (t) \geq k} . \end{matrix}

A.14

Dynkin’s formula for the function $f (t, s) : = e^{γ_{b} t} s$ and the bounded stopping time $t \land η_{k}^{y, s}$ yield

\begin{matrix} E [e^{γ_{b} t \land η_{k}^{y, s}} S^{y, s} (t \land η_{k}^{y, s})] = & s + E \int_{0}^{t \land η_{k}^{y, s}} e^{γ_{b} u} S^{y, s} (u) \\ (γ_{b} + [a - b {(S^{y, s} (u) Y^{y, s} (u)]}^{⊤} Y^{y, s} (u)) d u \\ \leq s + E \int_{0}^{t \land η_{k}^{y, s}} {\tilde{K}}_{1} e^{γ_{b} u} d u \leq s + \frac{{\tilde{K}}_{1}}{γ_{b}} (e^{γ_{b} t} - 1) . \end{matrix}

A.15

The claim (A.13) follows directly from (A.15). Moreover, by letting $k \to \infty$ in (A.15), we obtain from Fatou’s lemma that

\begin{matrix} E e^{γ_{b} t} S^{y, s} (t) \leq s + K_{1} e^{γ_{b} t} for K_{1} = \frac{{\tilde{K}}_{1}}{γ_{b}}, \end{matrix}

A.16

which implies (A.12). $□$

Proposition A.2

For any $ε > 0$ and $T > 0$ , there is a $δ = δ (ε, T) > 0$ such that

\begin{matrix} P \{‖ (Y^{y, s} (t), S^{y, s} (t)) - ({\tilde{Y}}^{y} (t), 0) ‖ \leq ε, 0 \leq t \leq T\} > 1 - ε \end{matrix}

given that $(y, s) \in Δ \times (0, δ) .$

Proof

In view of (A.13), for any $ε > 0, T > 0$ , there is $\tilde{k} = \tilde{k} (ε, T) > 0$ such that

\begin{matrix} P {η_{\tilde{k}}^{y, s} < T} \geq 1 - \frac{ε}{3}, (y, s) \in Δ \times (0, ε) \end{matrix}

A.17

where $η_{k}^{y, s}$ is defined by (A.14). Since the coefficients of equation (2.4) are locally Lipschitz, using the arguments from Mao (1997, Lemma 9.4) and noting $S^{y, 0} (t) \equiv 0$ , we obtain for any $(y, s) \in Δ \times (0, ε)$ that

\begin{matrix} E (sup_{0 \leq t \leq T \land η_{\tilde{k}}^{y, s} \land η_{\tilde{k}}^{y, 0}} {∥(Y^{y, s} (t), S^{y, s} (t)) - (Y^{y, 0} (t), 0)∥}^{2}) \leq C s^{2}, \end{matrix}

A.18

where C is a constant that depends on $H, T, \tilde{k}$ . Applying Chebyshev’s inequality to (A.18), there is a $δ \in (0, ε)$ such that for all $(y, s) \in Δ \times (0, δ)$

\begin{matrix} P \{sup_{0 \leq t \leq T \land η_{\tilde{k}}^{y, s} \land η_{\tilde{k}}^{y, 0}} ∥(Y^{y, s} (t), S^{y, s} (t)) - (Y^{y, 0} (t), 0)∥ < ε\} > 1 - \frac{ε}{3} . \end{matrix}

A.19

Combining (A.18) and (A.19) yields

\begin{matrix} P \{sup_{0 \leq t \leq T} ∥(Y^{y, s} (t), S^{y, s} (t)) - (Y^{y, 0} (t), 0)∥ < ε\} > 1 - ε . \end{matrix}

A.20

for any $(y, s) \in Δ \times (0, δ)$ . The desired result is obtained by noting that $Y^{y, 0} (t) = {\tilde{Y}}^{y} (t), t \geq 0$ . $□$

Lemma A.3

There are positive constants $K_{2}$ and $K_{3}$ such that for any $(y, s) \in Δ \times (0, \infty), T \geq 0$ , one has

\begin{matrix} E ({[ln S^{y, s} (T)]}^{2}) \leq ({(ln s)}^{2} + 1) K_{2} exp {K_{3} T}, \end{matrix}

A.21

Proof

In view of Itô’s formula,

\begin{matrix} \begin{matrix} d {ln}^{2} S (t) = & ({Y (t)}^{⊤} Σ Y (t) + 2 ln S (t) (a^{⊤} Y (t) - {[b (S (t) Y (t))]}^{⊤} \\ Y (t) - \frac{1}{2} {Y (t)}^{⊤} Σ Y (t))) d t + 2 ln S (t) {Y (t)}^{⊤} d E (t) . \end{matrix} \end{matrix}

A.22

Now, we estimate $g (y, s) = y^{⊤} Σ y + 2 ln s (a^{⊤} y - b (s y) - \frac{1}{2} y^{⊤} Σ y)$ for $(y, s) \in Δ \times (0, \infty)$ . Let $M_{b}$ be as in (2.2). If $s > M_{b}$ then $ln s > 0$ and $(a^{⊤} y - b (s y) - \frac{1}{2} y^{⊤} Σ y) < 0$ . Letting

\begin{matrix} M_{1} : = sup_{{(y, s) \in Δ \times (0, M_{b}]}} \{|(a^{⊤} y - b (s y) - \frac{1}{2} y^{⊤} Σ y)|\} < \infty \end{matrix}

and

\begin{matrix} ‖ Σ ‖ : = & sup {y^{⊤} Σ y : y \in Δ}, \\ g (y, s) \leq & ‖ Σ ‖ + M_{1} | ln s | \leq M_{1} {ln}^{2} s + 2 M_{1} \\ + ‖ Σ ‖ for all (y, s) \in Δ \times (0, \infty) . \end{matrix}

With this estimate, we can apply Dynkin’s formula to (A.22) and use standard arguments (e.g., Mao 1997, Theorem 2.4.1) to obtain

\begin{matrix} E ({[ln S^{y, s} (T)]}^{2} 1_{A}) \leq K_{2} {(ln s)}^{2} exp {K_{3} T} for all (y, s) \in Δ \times (0, \infty) \end{matrix}

for some positive constants $K_{2}$ and $K_{3}$ . $□$

Lemma A.4

There is a positive constant $K_{4}$ such that for any $(y, s) \in Δ \times (0, 1)$ , and $A \in F$ ,

\begin{matrix} E ({[ln S^{y, s} (T \land ζ^{y, s})]}_{-}^{2}) \leq {(ln s)}^{2} + K_{4} \sqrt{P (A)} (T + 1) {[ln s]}_{-} + K_{4} T^{2}, \end{matrix}

A.23

where ${[ln x]}_{-} : = max {0, - ln x},$ and

\begin{matrix} ζ^{y, s} : = inf {t \geq 0 : S^{y, s} (t) = 1} . \end{matrix}

A.24

Proof

Let

\begin{matrix} M_{2} = sup_{{(y, s) \in Δ \times (0, 1)}} \{(- a^{⊤} y + \frac{1}{2} y^{⊤} Σ y + b {(s y)}^{⊤} y\} < \infty . \end{matrix}

Using Dynkin’s formula,

\begin{matrix} \begin{matrix} - ln S^{y, s} (T \land ζ^{y, s}) & = - ln s - M^{y, s} (T \land ζ^{y, s}) \\ + \int_{0}^{T \land ζ^{y, s}} (- a^{⊤} Y^{y, s} (t) + b {(S^{y, s} (t) Y^{y, s} (t))}^{T} Y^{y, s} (t) \\ + \frac{1}{2} {Y^{y, s} (t)}^{⊤} Σ Y^{y, s} (t)) d t \\ \leq {[ln s]}_{-} + M_{2} T + | M^{y, s} (T \land ζ^{y, s}) |, \end{matrix} \end{matrix}

A.25

where

\begin{matrix} M^{y, s} (t) = \int_{0}^{t} {Y (t)}^{⊤} d E (t) = \int_{0}^{t} {Y (t)}^{⊤} Γ d B (t) . \end{matrix}

A.26

It follows from (A.25) that

\begin{matrix} {[ln S^{y, s} (T \land ζ^{y, s})]}_{-}^{2} 1_{A} \leq & {({[ln s]}_{-} + M_{2} T + | M^{y, s} (T \land ζ^{y, s}) |)}^{2} 1_{A} \\ \leq & ({[ln s]}_{-}^{2} + 2 (M_{2} T + | M^{y, s} (T \land ζ^{y, s}) |) {[ln s]}_{-}) 1_{A} \\ + & (2 {(M_{2} T)}^{2} + 2 {| M^{y, s} (T \land ζ^{y, s}) |}^{2}) 1_{A} \end{matrix}

A.27

An application of Itô’s isometry yields

\begin{matrix} \begin{matrix} E [| M_{z, y} (T \land ζ^{y, s}) |^{2} 1_{A}] \leq E | M_{z, y} (T \land ζ^{y, s}) |^{2} \leq ‖ Σ ‖ T . \end{matrix} \end{matrix}

A.28

By a straightforward use of Hölder’s inequality and (A.28),

\begin{matrix} \begin{matrix} E [| M_{z, y} (T \land ζ^{y, s}) | 1_{A}] \leq & (P (A) E | M_{z, y} (T \land ζ^{y, s}) |^{2})^{1 / 2} \\ \leq & \sqrt{P (A)} \sqrt{‖ Σ ‖ T} \leq \sqrt{P (A) ‖ Σ ‖} (T + 1) . \end{matrix} \end{matrix}

A.29

Taking expectation on both sides of (A.27) and using the estimates from (A.28) and (A.29), we have

\begin{matrix} E [{[ln S^{y, s} (T \land ζ^{y, s})]}_{-}^{2} 1_{A}] \leq {[ln s]}_{-}^{2} P (A) + K_{4} (T + 1) \sqrt{P (A)} {[ln s]}_{-} + K_{4} T^{2}, \end{matrix}

for some positive constant $K_{4} .$ $□$

Let $M_{3}$ be a positive constant such that

\begin{matrix} |a^{⊤} y - \frac{1}{2} y^{⊤} Σ y - a^{⊤} y^{'} - \frac{1}{2} {y^{'}}^{⊤} Σ y^{'}| \leq M_{3} ‖ y^{'} - y |, y, y^{'} \in Δ . \end{matrix}

A.30

From now on, we assume that $ε \in (0, 1)$ is chosen small enough to satisfy the following

\begin{matrix} \begin{matrix} (M_{3} + 2) ε + sup_{{0 \leq s \leq ε, y \in Δ}} {b {(s y)}^{⊤} y} < \frac{r}{4} \\ - \frac{3 r}{2} (1 - 3 ε) + 2 K_{4} \sqrt{ε} < - r \end{matrix} \end{matrix}

A.31

Lemma A.5

For $ε$ satisfying (A.31), there is $δ (ε) = δ \in (0, 1)$ and $T^{*} (ε) = T^{*} > 1$ such that

\begin{matrix} P \{ln s + \frac{3 r T^{*}}{4} \leq ln S^{y, s} (T^{*}) < 0\} \geq 1 - 3 ε \end{matrix}

A.32

for all $(y, s) \in Δ \times (0, δ) .$

Proof

Since $Δ$ is a petite set of ${\tilde{Y} (t) : t \geq 0}$ , in view of Meyn and Tweedie (1993, Theorem 6.1), there are $γ_{1}$ and $γ_{2} > 0$ such that

\begin{matrix} ‖ \tilde{P} (t, y, \cdot) - ν^{*} ‖_{T V} \leq γ_{1} exp (- γ_{2} t), y \in Δ, t \in [0, \infty) . \end{matrix}

A.33

where $\tilde{P} (t, y, \cdot)$ is the transition probability of ${\tilde{Y} (t) : t \geq 0}$ . Let $M_{4} = max_{y \in Δ} {| a^{⊤} y - \frac{1}{2} y^{⊤} Σ y |} < \infty .$ In view of (2.8) and (A.33), we have

\begin{matrix} \begin{matrix} \frac{1}{T} E & |\int_{0}^{T} (a^{⊤} {\tilde{Y}}^{y} (t) - \frac{1}{2} {{\tilde{Y}}^{y} (t)}^{⊤} Σ {\tilde{Y}}^{y} (t)) d t - r T| \\ \leq \frac{1}{T} \int_{0}^{T} \int_{Δ} |(a^{⊤} y^{'} - \frac{1}{2} {y^{'}}^{⊤} Σ y^{'}) (\tilde{P} (t, y, d y^{'}) - ν^{*} (d y^{'}))| \\ \leq \frac{M_{4}}{T} \int_{0}^{T} {‖ \tilde{P} (t, y, \cdot) - ν^{*} ‖}_{T V} d t \leq \frac{M_{4} γ_{1}}{T} . \end{matrix} \end{matrix}

A.34

On one hand, letting $M^{y, s} (T)$ be defined as (A.26), we have from Itô’s isometry that

\begin{matrix} E {[\frac{M^{y, s} (T)}{T}]}^{2} = \frac{1}{T^{2}} E \int_{0}^{T} {Y^{y, s} (t)}^{⊤} Σ Y^{y, s} (t) d t \leq \frac{‖ Σ ‖}{T} . \end{matrix}

A.35

With standard estimation techniques, it follows from (A.34) and (A.35) that for any $ε > 0$ , there is a $T^{*} = T^{*} (ε)$ such that

\begin{matrix} P \{|\frac{1}{T^{*}} \int_{0}^{T^{*}} (a^{⊤} {\tilde{Y}}^{y} (t) - \frac{1}{2} {{\tilde{Y}}^{y} (t)}^{⊤} Σ {\tilde{Y}}^{y} (t)) d t - r| < ε\} > 1 - ε, y \in Δ, \end{matrix}

A.36

and

\begin{matrix} P \{|\frac{M^{y, s} (T^{*})}{T^{*}}| < ε\} > 1 - ε, (y, s) \in Δ \times (0, \infty) . \end{matrix}

A.37

By virtue of Proposition A.2, (A.30), and (A.36), there is $δ = δ (ε, T^{*}) \in (0, ε)$ such that

\begin{matrix} P (Ω_{1}^{y, s}) > 1 - 2 ε, (y, s) \in Δ \times (0, δ) \end{matrix}

where

\begin{matrix} Ω_{1}^{y, s} : = & \{\int_{0}^{T^{*}} (a^{⊤} Y^{y, s} (t) - \frac{1}{2} {Y^{y, s} (t)}^{⊤} Σ Y^{y, s} (t)) d t > T^{*} (r - (M_{3} + 1) ε)\} \\ \cap \{S^{y, s} (t) < ε, t \in [0, T^{*}]\} . \end{matrix}

Using $y^{⊤} b (s y) < \frac{r}{4} for all (y, s) \in Δ \times (0, ε)$ from (A.31) we have that on the set $Ω_{2}^{y, s} : = Ω_{1}^{y, s} ⋂ \{|\frac{M^{y, s} (T)}{T}| < ε\}$ the following holds

\begin{matrix} \begin{matrix} 0 > ln ε \geq ln S^{y, s} (T^{*}) & = ln s + M^{y, s} (T^{*}) - \int_{0}^{T^{*}} {Y^{y, s} (t)}^{⊤} b (S^{y, s} (t) Y^{y, s} (t)) d t \\ + \int_{0}^{T^{*}} (a^{⊤} Y^{y, s} (t) - \frac{1}{2} {Y^{y, s} (t)}^{⊤} Σ Y^{y, s} (t)) d t \\ \geq ln s + (r - (M_{3} + 2) ε - sup_{{0 \leq s \leq ε, y \in Δ}} {b {(s y)}^{⊤} y}) T^{*} \\ \geq ln s + \frac{3 r}{4} T^{*} . \end{matrix} \end{matrix}

A.38

Noting

\begin{matrix} P (Ω_{2}^{y, s}) \geq 1 - 3 ε for all (y, s) \in Δ \times (0, δ), \end{matrix}

the proof is complete. $□$

Proposition A.3

Assume $r > 0$ . Let $δ$ and $T^{*}$ be as in Lemma A.5. There exists a positive constant $K^{*} = K^{*} (δ, T^{*})$ such that

\begin{matrix} E {[ln S^{y, s} (T^{*})]}_{-}^{2} \leq {[ln s]}_{-}^{2} - r T^{*} {[ln s]}_{-} + K^{*} \end{matrix}

A.39

for any $(y, s) \in Δ \times (0, \infty) .$

Proof

We look at three cases of the initial data $(y, s)$ .

Case I $s \in (0, δ)$ . We have from Lemma A.5 that $P (Ω_{2}^{y, s}) \geq 1 - 3 ε$ where $Ω_{2}^{y, s}$ is defined as in the proof of Lemma A.5. On $Ω_{2}^{y, s}$ , we have

\begin{matrix} - ln s - \frac{3 r T^{*}}{4} \geq - ln S^{y, s} (T^{*}) > 0 . \end{matrix}

Hence,

\begin{matrix} 0 \leq {[ln S^{y, s} (T^{*})]}_{-} \leq {[ln s]}_{-} - \frac{3 r T^{*}}{4} . \end{matrix}

Squaring both sides yields

\begin{matrix} {[ln S^{y, s} (T^{*})]}_{-}^{2} \leq {[ln s]}_{-}^{2} - \frac{3 r T^{*}}{2} {[ln s]}_{-} + \frac{9 r^{2} {T^{*}}^{2}}{16}, \end{matrix}

which implies that

\begin{matrix} E [1_{Ω_{2}^{y, s}} {[ln S^{y, s} (T^{*})]}_{-}^{2}] \leq & P (Ω_{2}^{y, s}) {[ln s]}_{-}^{2} - \frac{3 r T^{*}}{2} P (Ω_{2}^{y, s}) {[ln s]}_{-} \\ + \frac{9 r^{2} {T^{*}}^{2}}{16} P (Ω_{2}^{y, s}) . \end{matrix}

A.40

On $Ω_{3}^{y, s} : = {ζ^{y, s} < T^{*}}$ with $ζ^{y, s}$ defined in (A.24), since $ln S^{y, s} (ζ^{y, s}) = 0$ , we have from Lemma A.3 and the strong Markov property of $(Y (t), S (t))$ that

\begin{matrix} E [1_{Ω_{3}^{y, s}} {[ln S^{y, s} (T^{*})]}_{-}^{2}] \leq P (Ω_{3}^{y, s}) K_{2} exp (K_{3} T^{*}) . \end{matrix}

A.41

On the set $Ω_{4}^{y, s} : = Ω \ (Ω_{2}^{y, s} \cup Ω_{3}^{y, s})$ , applying Lemma A.4 and noting that $ζ^{y, s} > T^{*}$ in $Ω_{4}^{y, s}$ and $T^{*} > 1$ , we obtain

\begin{matrix} E [1_{Ω_{4}^{y, s}} {[ln S^{y, s} (T^{*})]}_{-}^{2}] \leq & {[ln s]}_{-}^{2} P (Ω_{4}^{y, s}) \\ + 2 K_{4} T^{*} \sqrt{P (Ω_{4}^{y, s})} {[ln s]}_{-} + K_{4} {T^{*}}^{2} . \end{matrix}

A.42

Adding (A.40), (A.41), and (A.42) side by side, we get

\begin{matrix} E {[ln S^{y, s} (T^{*})]}_{-}^{2} \leq & {[ln s]}_{-}^{2} + (- \frac{3 r}{2} (1 - 3 ε) + 2 K_{4} \sqrt{ε}) T^{*} {[ln s]}_{-} + K_{5}^{*} (T^{*}) \\ \leq & {[ln s]}_{-}^{2} - r T^{*} {[ln s]}_{-} + K_{5}^{*} (T^{*}), \end{matrix}

A.43

where $K_{5}^{*} (T^{*})$ is a positive constant independent of $(y, s) \in Δ \times (0, δ)$ .

Case II $s \in [δ, 1]$ . We have from Lemma A.3 that

\begin{matrix} \begin{matrix} E {[ln S^{y, s} (T^{*})]}_{-}^{2} \leq & E {[ln S^{y, s} (T^{*})]}^{2} \leq {[ln s]}^{2} + K_{2} exp (K_{3} T^{*}) \\ \leq & ({[ln δ]}^{2} + 1) K_{2} exp (K_{3} T^{*}) . \end{matrix} \end{matrix}

A.44

Case III $s \in (1, \infty)$ . Note that if $ζ^{y, s} > T^{*}$ , then ${[ln S^{y, s} (T^{*})]}_{-}^{2} = 0$ . Thus, using Lemma A.3 and the strong Markov property of $(Y (t), S (t))$ once more, we obtain

\begin{matrix} E {[ln S^{y, s} (T^{*})]}_{-}^{2} = E (1_{{ζ^{y, s} < T^{*}}} {[ln S^{y, s} (T^{*})]}_{-}^{2}) \leq K_{2} exp (K_{3} T^{*}) . \end{matrix}

A.45

Combing (A.43), (A.44), and (A.45), and setting $K^{*} = max {K_{5}^{*} (T^{*}), ({[ln δ]}^{2} + 1) K_{2} exp (K_{3} T^{*})}$ , the proof is concluded. $□$

Theorem A.2

Suppose that Assumptions 2.2 and 2.3 hold and that $r > 0$ . Let $P (t, (y, s), \cdot)$ be the semigroup of the process $({(Y (t), S (t))}_{t \geq 0}$ . Then, there exists an invariant probability measure $π^{*}$ of the process $({(Y (t), S (t))}_{t \geq 0}$ on $Δ \times (0, \infty)$ . Moreover, $π^{*} (Δ^{\circ} \times (0, \infty)) = 1, π^{*}$ is absolutely continuous with respect to the Lebesgue measure on $Δ \times (0, \infty)$ and

\begin{matrix} lim_{t \to \infty} t^{q^{*}} {‖ P (t, (y, s), \cdot) - π^{*} (\cdot) ‖}_{T V} = 0, (y, s) \in Δ^{\circ} \times (0, \infty), \end{matrix}

A.46

where ${‖ \cdot ‖}_{T V}$ is the total variation norm and $q^{*}$ is any positive number. In addition, for any initial value $(y, s) \in Δ \times (0, \infty)$ and any $π^{*}$ -integrable function f, we have

\begin{matrix} P \{lim_{T \to \infty} \frac{1}{T} \int_{0}^{T} f (Y^{y, s} (t), S^{y, s} (t)) d t \\ = \int_{Δ^{\circ} \times (0, \infty)} f (y^{'}, s^{'}) π^{*} (d y^{'}, s^{'})\} = 1 . \end{matrix}

A.47

Proof

By virtue of Lemma A.2, there is an $h_{1} : = 1 - exp (- γ_{b} T^{*}) > 0$ satisfying

\begin{matrix} E S^{y, s} + 1 \leq & s + 1 - h_{1} s + K_{1} \leq s + 1 - h_{1} \sqrt{s + 1} + K_{1} \\ + h_{1}, (y, s) \in Δ \times (0, \infty) . \end{matrix}

A.48

Let $V (s) = s + 1 + {[ln s]}_{-}^{2}$ . In view of Proposition A.3 and (A.48),

\begin{matrix} \begin{matrix} E V (S^{y, s} (T^{*})) & \leq s + 1 - h_{2} (\sqrt{s + 1} + {[ln s]}_{-}) + H_{2} \\ \leq V (s) - \frac{h_{2}}{2} \sqrt{V (s)} + H_{2} for all (y, s) \in Δ \times (0, \infty), \end{matrix} \end{matrix}

A.49

where $h_{2} = min {h_{1}, r T^{*}}, H_{2} = H_{1} + h_{1} + K_{1}$ . Let $κ > 1$ such that

\begin{matrix} \frac{\sqrt{V (s)}}{2} \geq H_{2} for all s \notin [κ^{- 1}, κ] . \end{matrix}

A.50

Combining (A.49) and (A.50), we arrive at

\begin{matrix} E V (S^{y, s} (T^{*})) \leq & V (s) - \frac{h_{2}}{4} \sqrt{V (s)} + H_{2} 1_{{(y, s) \in Δ \times [κ^{- 1}, κ]}} \\ for all (y, s) \in Δ \times (0, \infty) . \end{matrix}

A.51

Using the estimate (A.51), Lemma A.1, and Theorem A.1, the Markov chain ${(Y (k T^{*}), S (k T^{*}))}_{k \geq 0}$ has a unique invariant probability measure $π^{*}$ and

\begin{matrix} k ‖ P (k T^{*}, (y, s), \cdot) - π^{*} ‖_{T V} \to 0 as k \to \infty . \end{matrix}

A.52

As a direct consequence, for fixed $y_{0}, s_{0}$ , the family ${P (k T^{*}, (y_{0}, s_{0}), \cdot), k \in N}$ is tight, that is, for any $θ > 0$ , there is a compact set $K_{θ} \subset Δ \times (0, \infty)$ such that

\begin{matrix} P (k T^{*}, (y_{0}, s_{0}), K_{θ}) > 1 - θ for all k \in N . \end{matrix}

A.53

Since $s^{2} + {ln}^{2} s \to \infty$ as $s \to 0$ or $s \to \infty$ , in view of Lemmas A.2 and A.3 and a standard estimate, there is a $κ_{θ} > 1$ such that

\begin{matrix} P \{S^{y, s} (t) \in [κ_{θ}^{- 1}, κ_{θ}]\} > 1 - θ, for all (y, s) \in K_{θ}, t \in [0, T^{*}], \end{matrix}

or equivalently,

\begin{matrix} P (t, (y, s), Δ \times [κ_{θ}^{- 1}, κ_{θ}]) > 1 - θ for all (y, s) \in K_{θ}, t \in [0, T^{*}] . \end{matrix}

A.54

Using the Chapman-Kolmogorov relation together with (A.53) and (A.54) yields

\begin{matrix} P (u, (y_{0}, s_{0}), Δ \times [κ_{θ}^{- 1}, κ_{θ}]) > 1 - 2 θ for all u \geq 0, \end{matrix}

which implies that the family of empirical measures $\{\frac{1}{T} \int_{0}^{T} P (u, (y_{0}, s_{0}), \cdot) d u, T > 0\}$ is tight in $Δ \times (0, \infty)$ . Thus $(Y (t), S (t))$ has an invariant probability measure $π_{*}$ on $Δ \times (0, \infty)$ (see e.g., Evans et al. 2015, Proposition 6.4). As a result, the Markov chain ${(Y (k T^{*}), S (k T^{*}))}_{k \in N}$ has an invariant probability measure $π_{*}$ . In view of (A.52), $π_{*}$ must coincide with $π^{*}$ . Thus, $π^{*}$ is an invariant probability measure of the process ${(Y (t), S (t))}_{t \geq 0}$ on $Δ \times (0, \infty)$ .

In the proofs, we used the function ${[ln s]}_{-}^{2}$ for the sake of simplicity. In fact, we can treat ${[ln s]}_{-}^{1 + q}$ for any small $q \in (0, 1)$ in the same manner. We can show that there are $h_{q}, H_{q} > 0$ , and a compact set $K_{q} \subset Δ \times (0, \infty)$ satisfying

\begin{matrix} E V_{q} (S^{y, s} (T^{*})) \leq & V_{q} (s) - h_{q} {[V_{q} (s)]}^{\frac{1}{1 + q}} + H_{q} 1_{{(y, s) \in K_{q}}}, \\ (y, s) \in Δ \times (0, \infty), \end{matrix}

A.55

where $V_{q} (s) : = s + 1 + {[ln s]}_{-}^{1 + q} .$ Then applying Theorem A.1, we obtain

\begin{matrix} k^{1 / q} ‖ P (k T^{*}, (y, s), \cdot) - π^{*} ‖ \to 0 as k \to \infty . \end{matrix}

A.56

Let $f : Δ \times (0, \infty) \mapsto [- 1, 1]$ be a measurable function. Since $π^{*}$ is an invariant measure, then for any $u \geq 0$ ,

\begin{matrix} \int_{Δ \times (0, \infty)} f (y^{'}, s^{'}) π^{*} (d y^{'}, d s^{'}) \\ = \int_{Δ \times (0, \infty)} π^{*} (d y_{1}, d s_{1}) \int_{Δ \times (0, \infty)} P (u, (y_{1}, s_{1}), (y^{'}, d s^{'})) f (y^{'}, s^{'}) . \end{matrix}

Using this equality and the Chapman–Kolmogorov equation, we have

\begin{matrix} \begin{matrix} | f (y^{'}, s^{'}) (P (t + u, (y, s), d y^{'}, d s^{'}) - π^{*} (d y^{'}, d s^{'}) | \\ = | \int_{Δ \times (0, \infty)} (P (t, (y, s), d y_{1}, d s_{1}) - π^{*} (d y_{1}, d s_{1})) \\ \times \int_{Δ \times (0, \infty)} (f (y^{'}, s^{'}) P (t, (y_{1}, s_{1}), (d y^{'}, d s^{'})) | \\ \leq ‖ P (t, (y, s), \cdot) - π^{*} ‖_{T V} \\ (since | \int_{Δ \times (0, \infty)} (f (y^{'}, s^{'}) P (t, (y_{1}, s_{1}), (d y^{'}, d s^{'}) | \leq 1 for all y_{1}, s_{1}), \end{matrix} \end{matrix}

which means that $‖ P (t, (y, s), \cdot) - π^{*} ‖_{T V}$ is decreasing in t. As a result, we deduce from (A.55) that

\begin{matrix} t^{q^{*}} {‖ P (t, (y, s), \cdot) - π^{*} ‖}_{T V} \to 0 as t \to \infty, \end{matrix}

where $q^{*} = 1 / q \in (1, \infty)$ .

In view of Proposition A.1, for any $t > 0, P {Y^{y, s} (t) \in Δ^{\circ}} = 1$ . Thus,

\begin{matrix} π^{*} (Δ^{\circ} \times (0, \infty)) = \int_{Δ \times (0, \infty)} P {Y^{y, s} (t) \in Δ^{\circ}} π^{*} (d y, d s) = π^{*} (Δ \times (0, \infty)) = 1 . \end{matrix}

By Kallenberg (2002, Theorem 20.17), our process ${(Y (t), S (t))}_{t \geq 0}$ is either Harris recurrent or uniformly transient on $Δ^{\circ} \times (0, \infty)$ . Using Kallenberg (2002, Theorem 20.21), our process cannot be uniformly transient and also have an invariant probability measure. Therefore, our process is Harris recurrent. Kallenberg (2002, Theorem 20.17) further indicates that any Harris recurrent Feller process on $Δ^{\circ} \times (0, \infty)$ with strictly positive transition densities has a locally finite invariant measure that is equivalent to Lebesgue measure and is unique up to normalization. Since we already know that ${(Y (t), S (t))}_{t \geq 0}$ has a unique invariant probability measure, this probability measure has an almost everywhere strictly positive density with respect to the Lebesgue measure. $□$

Appendix B: The case $r < 0$

Theorem B.1

Suppose that $r < 0$ . Then for any $i = 1, \dots, n$ and any $x = (x_{1}, \dots, x_{n}) \in R_{+}^{n}$ ,

\begin{matrix} P \{lim_{t \to \infty} \frac{ln X_{i}^{x} (t)}{t} = r\} = 1 . \end{matrix}

B.1

In particular, for any $i = 1, \dots, n$ and any $x = (x_{1}, \dots, x_{n}) \in R_{+}^{n}$

\begin{matrix} P \{lim_{t \to \infty} X_{i}^{x} (t) = 0\} = 1 . \end{matrix}

Proof

Let $θ > 0$ and ${\overset{ˇ}{a}}_{i} = a_{i} + θ$ , and define the process ${\overset{ˇ}{X}}^{x} (t) = ({\overset{ˇ}{X}}_{1}^{x} (t), \dots, {\overset{ˇ}{X}}_{n}^{x} (t))$ as the solution to

\begin{matrix} d {\overset{ˇ}{X}}_{i} (t) = ({\overset{ˇ}{X}}_{i} (t) ({\overset{ˇ}{a}}_{i}) + \sum_{j = 1}^{n} D_{j i} {\overset{ˇ}{X}}_{j} (t)) d t + {\overset{ˇ}{X}}_{i} (t) d E_{i} (t), i = 1, \dots, n \end{matrix}

B.2

started at $x = (x_{1}, \dots, x_{n}) \in R_{+}^{n}$ . Letting $\overset{ˇ}{S} (t) = \sum {\overset{ˇ}{X}}_{i} (t)$ and $\overset{ˇ}{Y} (t) = \frac{\overset{ˇ}{X} (t)}{S (t)}$ , we have

\begin{matrix} \begin{matrix} d \overset{ˇ}{Y} (t) = & (diag (\overset{ˇ}{Y} (t)) - \overset{ˇ}{Y} (t) {\overset{ˇ}{Y}}^{⊤} (t)) Γ^{⊤} d B (t) \\ + D^{⊤} \overset{ˇ}{Y} (t) d t + (diag (\overset{ˇ}{Y} (t)) - \overset{ˇ}{Y} (t) {\overset{ˇ}{Y}}^{⊤} (t)) (\overset{ˇ}{a} - Σ \overset{ˇ}{Y} (t)) d t \\ d ln \overset{ˇ}{S} (t) = & ({\overset{ˇ}{a}}^{⊤} \overset{ˇ}{Y} (t) - \frac{1}{2} {\overset{ˇ}{Y} (t)}^{⊤} Σ \overset{ˇ}{Y} (t)) d t + {\overset{ˇ}{Y} (t)}^{⊤} Γ^{⊤} d B (t) \end{matrix} \end{matrix}

B.3

Let $({\overset{ˇ}{Y}}^{y} (t), {\overset{ˇ}{S}}^{y, s} (t))$ be the solution to (B.3) with initial condition $(y, s)$ . Note that ${\overset{ˇ}{Y}}^{y} (t)$ does not depend on s. First, fix $y_{0} \in Δ$ . We have that

\begin{matrix} lim_{t \to \infty} \frac{1}{t} (\int_{0}^{t} ({\overset{ˇ}{a}}^{⊤} {\overset{ˇ}{Y}}^{y_{0}} (u) - \frac{1}{2} {{\overset{ˇ}{Y}}^{y_{0}} (u)}^{⊤} Σ {\overset{ˇ}{Y}}^{y_{0}} (u)) d u + \int_{0}^{t} {{\overset{ˇ}{Y}}^{y_{0}} (u)}^{⊤} Γ^{⊤} d B (u)) \\ = \overset{ˇ}{r} : = \int_{Δ} ({\overset{ˇ}{a}}^{⊤} y - \frac{1}{2} y^{⊤} Σ y) {\overset{ˇ}{ν}}^{*} (d y), P - a.s., \end{matrix}

B.4

where ${\overset{ˇ}{ν}}^{*}$ is the unique invariant probability measure of ${(\overset{ˇ}{Y} (t))}_{t \geq 0}$ . By the continuous dependence of r on the coefficients (established in the Proposition D.1), there is $θ > 0$ such that $\overset{ˇ}{r} < \frac{r}{2} < 0$ . Let $δ > 0$ such that $sup {- b_{i} (x) : x < δ, i = 1, \dots, n} < θ$ (this is possible since the $b_{i}$ ’s are continuous and vanish at 0). Because $\overset{ˇ}{r} < 0$ , it follows from (B.4) that

\begin{matrix} sup_{t \in [0, \infty)} (\int_{0}^{t} ({\overset{ˇ}{a}}^{⊤} {\overset{ˇ}{Y}}^{y_{0}} (s) - \frac{1}{2} {{\overset{ˇ}{Y}}^{y_{0}} (s)}^{⊤} Σ {\overset{ˇ}{Y}}^{y_{0}} (s)) d s \\ + \int_{0}^{t} {{\overset{ˇ}{Y}}^{y_{0}} (s)}^{⊤} Γ^{⊤} d B (s)) < \infty P - a.s. . \end{matrix}

As a result, for any $ε > 0$ , there is an $H_{ε} > 0$ satisfying

\begin{matrix} P \{sup_{t \in [0, \infty)} (\int_{0}^{t} ({\overset{ˇ}{a}}^{⊤} {\overset{ˇ}{Y}}^{y_{0}} (u) - \frac{1}{2} {{\overset{ˇ}{Y}}^{y_{0}} (u)}^{⊤} Σ {\overset{ˇ}{Y}}^{y_{0}} (u)) d u \\ + \int_{0}^{t} {{\overset{ˇ}{Y}}^{y_{0}} (u)}^{⊤} Γ^{⊤} d B (u)) < H_{ε}\} > 1 - ε, \end{matrix}

which combined with (B.3) implies that

\begin{matrix} P \{sup_{t \in [0, \infty)} {\overset{ˇ}{S}}^{y_{0}, s_{0}} (t) < δ\} > 1 - ε if s_{0} < δ exp (- H_{ε}) . \end{matrix}

B.5

Then, a comparison argument shows (see Remark A.2) that for $x_{0} = s_{0} y_{0} \in R_{+}^{n}$ and $i = 1, \dots, n$

\begin{matrix} P \{X_{i}^{x_{0}} (t) \leq {\overset{ˇ}{X}}_{i}^{x_{0}} (t), t \in [0, ξ^{x_{0}})\} = 1 \end{matrix}

B.6

where $ξ^{x_{0}} = inf {t \geq 0 : \sum_{i = 1}^{n} {\overset{ˇ}{X}}_{i}^{x_{0}} (t) \geq δ}$ . By virtue of (B.5), $P {ξ^{x_{0}} = \infty} > 1 - ε$ if $s_{0} < δ exp (- H_{ε})$ . Using (B.4) and (B.6) yields that

\begin{matrix} P \{\underset{t \to \infty}{lim sup} \frac{ln S^{y_{0}, s_{0}}}{t} \leq \overset{ˇ}{r} < 0\} > 1 - ε if s < δ exp (- H_{ε}) . \end{matrix}

B.7

Thus, the process ${(Y (t), S (t))}_{t \geq 0}$ is not a recurrent diffusion process in $Δ \times (0, \infty)$ . Hence, it must be transient with probability 1, that is, for any compact $K \in (0, \infty)$ and any initial value $(y, s) \in Δ \times (0, \infty)$ we have

\begin{matrix} P \{lim_{t \to \infty} 1_{{S^{y, s} (t) \in K}} = 0\} = 1 . \end{matrix}

B.8

In view of Lemma A.2,

\begin{matrix} P \{{lim}_{t \to \infty} S^{y, s} (t) = \infty\} = 0 . \end{matrix}

B.9

It follows from (B.8) and (B.9) that $P \{{lim}_{t \to \infty} S^{y, s} (t) = 0\} = 1$ for any $(y, s) \in Δ \times (0, \infty)$ . Moreover, since ${(\tilde{Y} (t))}_{{t \geq 0}}$ has a unique invariant probability measure $ν^{*}$ , on the boundary $Δ \times {0}, (Y (t), S (t))$ has a unique invariant probability measure $ν^{*} \times δ_{0}^{*}$ , where $δ_{0}^{*}$ is the Dirac measure concentrated on ${0}$ . Fix $(y, s) \in Δ \times (0, \infty)$ , and define the normalized occupation measures,

\begin{matrix} Π_{t} (\cdot) = \frac{1}{t} \int_{0}^{t} 1_{{(Y^{y, s} (u), S^{y, s} (u)) \in \cdot}} d u . \end{matrix}

Since $P \{lim_{t \to \infty} S^{y, s} (t) = 0\} = 1$ , the family $\{Π_{k} (\cdot), k \in N\}$ is tight in the space $Δ \times [0, \infty)$ for almost every $ω$ . In view of the proofs of Evans et al. (2015, Theorem 4.2) or Schreiber et al. (2011, Theorems 4, 5) the set of ${weak}^{*}$ limit points of ${Π_{k}, k \in N}$ is a nonempty set of invariant probability measures of the process $(Y (t), S (t))$ . As pointed out above, the process $(Y (t), S (t))$ has only one invariant probability measure, namely, $ν^{*} \times δ_{0}^{*}$ . Thus, for almost every $ω \in Ω, {Π_{k} (\cdot), k \in N}$ converges weakly to $ν^{*} \times δ_{0}^{*}$ as $k \to \infty$ . As a result, for any bounded continuous function $g (\cdot, \cdot) : Δ \times [0, \infty) \mapsto R$ we have $lim_{k \to \infty} \frac{1}{k} \int_{0}^{k} g (Y^{y, s} (t), S^{y, s} (t)) d t = \int_{Δ} g (y^{'}, 0) ν^{*} (d y^{'}) P -a.s .$ Since $g (\cdot, \cdot)$ is bounded, we easily obtain

\begin{matrix} lim_{T \to \infty} \frac{1}{T} \int_{0}^{T} g (Y^{y, s} (t), S^{y, s} (t)) d t = \int_{Δ} g (y^{'}, 0) ν^{*} (d y^{'}) P -a.s . \end{matrix}

B.10

Consequently,

\begin{matrix} lim_{T \to \infty} \frac{1}{T} \int_{0}^{T} (a^{⊤} Y^{y, s} (t) - \frac{1}{2} {Y^{y, s} (t)}^{⊤} Σ Y^{y, s} (t)) d t = r P -a.s \end{matrix}

B.11

Since $P \{{lim}_{t \to \infty} S^{y, s} (t) = 0\} = 1$ and $b_{i} (0) = 0, i = 1, \dots, n$ , we have by Dominated Convergence that

\begin{matrix} lim_{T \to \infty} \frac{1}{T} \int_{0}^{T} {Y^{y, s} (t)}^{⊤} b (S^{y, s} (t) Y^{y, s} (t)) d t = 0 P -a.s. \end{matrix}

B.12

Applying the strong law of large numbers for martingales to the process ${(M^{y, s} (t))}_{t \geq 0}$ defined by (A.26), we deduce

\begin{matrix} lim_{T \to \infty} \frac{M^{y, s} (T)}{T} = 0 P -a.s. \end{matrix}

B.13

Note that

\begin{matrix} \begin{matrix} \frac{ln S^{y, s} (T)}{T} = & \frac{ln s}{T} + \frac{M^{y, s} (T)}{T} - \frac{1}{T} \int_{0}^{T} {Y^{y, s} (t)}^{⊤} b (S^{y, s} (t) Y^{y, s} (t)) d t \\ + \frac{1}{T} \int_{0}^{T} (a^{⊤} Y^{y, s} (t) - \frac{1}{2} {Y^{y, s} (t)}^{⊤} Σ Y^{y, s} (t)) d t \end{matrix} \end{matrix}

B.14

Applying (B.11), (B.12), and (B.13) to (B.14), we obtain

\begin{matrix} lim_{T \to \infty} \frac{ln S^{y, s} (T)}{T} = r, P -a.s. \end{matrix}

B.15

In light of (B.15), to derive $P \{{lim}_{T \to \infty} \frac{ln X_{i}^{x} (T)}{T} = r\} = 1$ , it suffices to show $P \{{lim}_{T \to \infty} \frac{ln Y_{i}^{y, s} (T)}{T} = 0\} = 1$ for each $i = 1, \dots, n$ . In view of Itô’s lemma,

\begin{matrix} \frac{ln Y_{i}^{y, s} (T)}{T} = & \frac{ln y_{i}}{T} + \frac{1}{T} \int_{0}^{T} (a_{i} - \sum_{j = 1}^{n} a_{j} Y_{j}^{y, s} (t) - D_{i i} - \frac{σ_{i i}}{2} \\ + \sum_{j, k = 1}^{n} \frac{σ_{k j}}{2} Y_{k}^{y, s} (t) Y_{j}^{y, s} (t))) d t \\ + \frac{1}{T} \int_{0}^{T} (- b_{i} (S^{y, s} (t) Y_{i}^{y, s} (t)) + \sum_{j = 1}^{n} Y_{j}^{y, s} (t) b_{j} (S^{y, s} (t) Y_{j}^{y, s} (t))) d t \\ + \frac{1}{T} \int_{0}^{T} (\sum_{j = 1, j \neq i}^{n} D_{j i} \frac{Y_{j}^{y, s} (t)}{Y_{i}^{y, s} (t)}) d t \\ + \frac{1}{T} \int_{0}^{T} [d E_{i} (t) - \sum_{j = 1}^{n} Y_{j}^{y, s} (t) d E_{j} (t)], \end{matrix}

B.16

and

\begin{matrix} \frac{ln {\tilde{Y}}_{i}^{y} (T)}{T} = & \frac{ln y_{i}}{T} + \frac{1}{T} \int_{0}^{T} (a_{i} - \sum_{j = 1}^{n} a_{j} {\tilde{Y}}_{j}^{y} (t) - D_{i i} \\ - \frac{σ_{i i}}{2} + \sum_{j, k = 1}^{n} \frac{σ_{k j}}{2} {\tilde{Y}}_{k}^{y} (t) {\tilde{Y}}_{j}^{y} (t))) d t \\ + \frac{1}{T} \int_{0}^{T} (\sum_{j = 1, j \neq i}^{n} D_{j i} \frac{{\tilde{Y}}_{j}^{y} (t)}{{\tilde{Y}}_{i}^{y} (t)}) d t \\ + \frac{1}{T} \int_{0}^{T} [d E_{i} (t) - \sum_{j = 1}^{n} {\tilde{Y}}_{j}^{y} (t) d E_{j} (t)] . \end{matrix}

B.17

By the strong laws of large numbers for martingales,

\begin{matrix} lim_{T \to \infty} \frac{1}{T} \int_{0}^{T} [d E_{i} (t) - \sum_{j = 1}^{n} {\tilde{Y}}_{j}^{y} (t) d E_{j} (t)] = 0, P -a.s. \end{matrix}

B.18

Let $G_{i} = sup_{y \in Δ} \{|a_{i} - \sum_{j = 1}^{n} a_{j} y_{j} - D_{i i} - \frac{σ_{i i}}{2} + \sum_{j, k = 1}^{n} \frac{σ_{k j}}{2} y_{k} y_{j}|\} < \infty .$ As a result of (B.17) and (B.18) and the fact that ${lim sup}_{T \to \infty} \frac{ln {\tilde{Y}}_{i}^{y} (T)}{T} \leq 0$ almost surely, we obtain

\begin{matrix} \underset{T \to \infty}{lim sup} \frac{1}{T} \int_{0}^{T} (\sum_{j = 1, j \neq i}^{n} D_{j i} \frac{{\tilde{Y}}_{j}^{y} (t)}{{\tilde{Y}}_{i}^{y} (t)}) d t \leq G_{i}, P -a.s. \end{matrix}

B.19

For any $k > 0$ , it follows from (B.18) and the strong law of large numbers that

\begin{matrix} \int_{Δ} k \land (\sum_{j = 1, j \neq i}^{n} D_{j i} \frac{y_{j}}{y_{i}}) ν^{*} (d y) = & lim_{T \to \infty} \frac{1}{T} \int_{0}^{T} k \land (\sum_{j = 1, j \neq i}^{n} D_{j i} \frac{{\tilde{Y}}_{j}^{y} (t)}{{\tilde{Y}}_{i}^{y} (t)}) d t \\ \leq & G_{i} \end{matrix}

Letting $k \to \infty$ we have

\begin{matrix} ρ_{i} : = \int_{Δ} \sum_{j = 1, j \neq i}^{n} D_{j i} \frac{y_{j}}{y_{i}} ν^{*} (d y) \leq G_{i}, \end{matrix}

which implies

\begin{matrix} lim_{T \to \infty} \frac{1}{T} \int_{0}^{T} (\sum_{j = 1, j \neq i}^{n} D_{j i} \frac{{\tilde{Y}}_{j}^{y} (t)}{{\tilde{Y}}_{i}^{y} (t)}) d t = ρ_{i} . \end{matrix}

B.20

Using (B.18), (B.20), and applying the strong law of large numbers for the process ${(\tilde{Y} (t))}_{t \geq 0}$ , we arrive at

\begin{matrix} \begin{matrix} lim_{T \to \infty} \frac{ln {\tilde{Y}}_{i}^{y} (T)}{T} = β_{i} + ρ_{i} \leq 0, P - a . s ., \end{matrix} \end{matrix}

B.21

where

\begin{matrix} β_{i} : = \int_{Δ} (a_{i} - \sum_{j = 1}^{n} a_{j} y_{j} - D_{i i} - \frac{σ_{i i}}{2} + \sum_{j, k = 1}^{n} \frac{σ_{k j}}{2} y_{k} y_{j}) ν^{*} (d y) . \end{matrix}

If $β_{i} + ρ_{i} < 0$ , then ${\tilde{Y}}_{i}^{y} (T) \to 0$ almost surely as $T \to \infty$ , which contradicts the fact that ${\tilde{Y}}^{y} (T)$ converges weakly to $ν^{*}$ that is concentrated on $Δ^{\circ}$ . As a result, $β_{i} + ρ_{i} = 0$ . For any $θ > 0$ , there is $k_{θ} > 0$ such that

\begin{matrix} \int_{Δ} k_{θ} \land (\sum_{j = 1, j \neq i}^{n} D_{j i} \frac{y_{j}}{y_{i}}) ν^{*} (d y) > ρ_{i} - θ . \end{matrix}

Using (B.10), we have with probability 1 that

\begin{matrix} \underset{T \to \infty}{lim inf} \frac{1}{T} \int_{0}^{T} (\sum_{j = 1, j \neq i}^{n} D_{j i} \frac{Y_{j}^{y, s} (t)}{Y_{i}^{y, s} (t)}) d t \\ \geq lim_{T \to \infty} \frac{1}{T} \int_{0}^{T} k_{θ} \land (\sum_{j = 1, j \neq i}^{n} D_{j i} \frac{Y_{j}^{y, s} (t)}{Y_{i}^{y, s} (t)}) d t \\ \geq ρ_{i} - θ . \end{matrix}

B.22

and

\begin{matrix} lim_{T \to \infty} \frac{1}{T} \int_{0}^{T} (a_{i} - \sum_{j = 1}^{n} a_{j} Y_{j}^{y, s} (t) - D_{i i} - \frac{σ_{i i}}{2} \\ + \sum_{j, k = 1}^{n} \frac{σ_{k j}}{2} Y_{k}^{y, s} (t) Y_{j}^{y, s} (t))) d t = β_{i} . \end{matrix}

B.23

Applying (B.22), (B.23), and the fact $P \{lim_{T \to \infty} S^{y, s} (T) = 0\} = 1$ to (B.16), we obtain that

\begin{matrix} \underset{T \to \infty}{lim inf} \frac{ln Y_{i}^{y, s} (T)}{T} \geq β_{i} + ρ_{i} - θ = - θ, P -a.s. \end{matrix}

Since it holds for any $θ > 0$ , we have

\begin{matrix} lim_{T \to \infty} \frac{ln Y_{i}^{y, s} (T)}{T} = 0, P -a.s. \end{matrix}

The above equality combined with (B.15) and $X_{i} (T) = Y_{i} (T) S (T)$ yield the desired result. $□$

Appendix C: Degenerate diffusion in $R^{n}$

If the correlation matrix $Σ$ is degenerate, the diffusion $\tilde{Y} (t)$ from (2.6) still has an invariant probability measure $ν^{*}$ since it is a Feller-Markov process in a compact set. Moreover, $ν^{*} (Δ^{\circ}) = 1$ because the property that $P \{\tilde{Y} (t) \in Δ^{\circ}, t > 0\} = 1$ is satisfied as long as Assumption 2.2 holds, that is, the dispersion matrix $(D_{i j})$ is irreducible. It is readily seen that the following is true.

Theorem C.1

\begin{matrix} P \{lim_{t \to \infty} \frac{ln X_{i}^{x} (t)}{t} = r\} = 1 . \end{matrix}

C.1

In particular, for any $i = 1, \dots, n$ and any $x = (x_{1}, \dots, x_{n}) \in R_{+}^{n}$

\begin{matrix} P \{lim_{t \to \infty} X_{i}^{x} (t) = 0\} = 1 . \end{matrix}

Remark C.1

The Markov process ${\tilde{Y} (t), t \geq 0}$ has a unique invariant probability measure if it is irreducible. Moreover, since $P {{\tilde{Y}}^{y} (t) > 0 for all t > 0} = 1$ for any $y \in Δ$ , we need only check its irreducibility in $Δ^{\circ}$ . To prove that the diffusion ${\tilde{Y} (t), t \geq 0}$ is irreducible in $Δ^{\circ}$ , we pursue the following approach:

First, we show that the process ${\tilde{Y} (t), t \geq 0}$ verifies Hörmander’s condition. As a result, the process ${\tilde{Y} (t), t \geq 0}$ has a smooth density function for any $t > 0$ ; see e.g., Rey-Bellet (2006).
Next, we show that there is an open set $N \subset Δ^{\circ}$ such that for any open set $N_{0} \subset N$ , and $y \in Δ^{\circ}$ , there is a $t_{0} > 0$ such that $P {{\tilde{Y}}^{y} (t_{0}) \in N_{0}} > 0$ . This claim is usually proved by analyzing the control systems corresponding to the diffusion and using the support theorem. We refer to Kliemann (1987) and Rey-Bellet (2006) for more details. This then shows that the process ${\tilde{Y} (t), t \geq 0}$ is irreducible in $Δ^{\circ}$ .

Now we consider the case $r > 0$ . We still assume that ${\tilde{Y} (t) : t \geq 0}$ has a unique invariant probability measure. In order to obtain Theorem 2.1 for our degenerate process, we have to show that there is a sufficiently large $T > 0$ such that the Markov chain ${(Y (k T), S (k T))}_{k \in N}$ is irreducible and aperiodic and every compact subset of $Δ^{\circ} \times (0, \infty)$ is petite for this Markov chain. Note that if every compact subset of $Δ^{\circ} \times (0, \infty)$ is petite with respect to ${(Y (k T), S (k T))}_{k \in N}$ , then any compact subset of $Δ \times (0, \infty)$ is petite with respect to ${(Y (k T), S (k T))}_{k \in N}$ by the arguments in the proof of Lemma A.1.

Sufficient conditions for the above properties can be obtained by verifying the well-known Hörmander condition as well as investigating the control systems associated with the diffusion (2.4). Once we have the Markov chain ${(Y (k T), S (k T))}_{k \in N}$ being irreducible and aperiodic, and every compact subset of $Δ^{\circ} \times (0, \infty)$ being petite for sufficiently large T, we can follow the steps from Appendix A to obtain the following result.

Theorem C.2

Assume that $\tilde{Y} (t)$ has a unique invariant probability measure $ν^{*}$ . Define r by (2.8). Suppose that Assumption 2.2 holds and that $r > 0$ . Assume further that there is a sufficiently large $T > 0$ such that the Markov chain ${(Y (k T), S (k T))}_{k \in N}$ is irreducible and aperiodic, and that every compact set in $Δ^{\circ} \times (0, \infty)$ is petite for this Markov chain.

\begin{matrix} lim_{t \to \infty} t^{q^{*}} {‖ P_{X} (t, x, \cdot) - π (\cdot) ‖}_{TV} = 0, x \in R_{+}^{n, \circ}, \end{matrix}

C.2

\begin{matrix} P \{lim_{T \to \infty} \frac{1}{T} \int_{0}^{T} f (X^{x} (t)) d t = \int_{R_{+}^{n, \circ}} f (u) π (d u)\} = 1 . \end{matrix}

C.3

C.1: Case study: $n = 2$

In what follows, we show that if $r > 0$ , there is a sufficiently large $T > 0$ such that the Markov chain ${(Y (k T), S (k T))}_{k \in N}$ is irreducible and aperiodic, and that every compact set in $Δ^{\circ} \times (0, \infty)$ is petite for the Markov chain.

For simplicity of presentation, we restrict ourselves to the $n = 2$ case, and assume that $b_{i} (x) = b_{i} x, x \geq 0, i = 1, 2$ for some $b_{1}, b_{2} > 0$ . As a result, (2.1) becomes

\begin{matrix} \{\begin{matrix} d X_{1} (t) = (X_{1} (t) (a_{1} - b_{1} X_{1} (t)) - α X_{1} (t) + β X_{2} (t)) d t + σ_{1} X_{1} (t) d B (t) \\ d X_{2} (t) = (X_{2} (t) (a_{2} - b_{2} X_{2} (t)) + α X_{1} (t) - β X_{2} (t)) d t + σ_{2} X_{2} (t) d B (t), \end{matrix} \end{matrix}

C.4

where $σ_{1}, σ_{2}$ are non-zero constants and ${(B (t))}_{t \geq 0}$ is a one dimensional Brownian motion.

Setting $S (t) = X_{1} (t) + X_{2} (t)$ and $Y_{i} (t) = X_{i} (t) / S (t), i = 1, 2$ , we have from Itô’s Lemma,

\begin{matrix} \begin{matrix} d Y_{i} (t) & = Y_{i} (t) (a_{i} - \sum_{j = 1}^{2} a_{j} Y_{j} - b_{i} S (t) Y_{i} (t) + S (t) \sum_{j = 1}^{2} b_{j} Y_{j}^{2} (t))) d t \\ + {(- 1)}^{i} (α Y_{1} (t) - β Y_{2} (t)) d t \\ + Y_{i} (t) (\sum_{j, k = 1}^{2} σ_{k} σ_{j} Y_{k} (t) Y_{j} (t)) - \sum_{j = 1}^{2} σ_{i} σ_{j} Y_{j} (t)) d t \\ + {(- 1)}^{i} (σ_{2} - σ_{1}) Y_{1} (t) Y_{2} (t) d B (t) \\ d S (t) & = S (t) (\sum_{i = 1}^{2} (a_{i} Y_{i} (t) - Y_{i} (t) b_{i} S (t) Y_{i} (t)) d t \\ + S (t) (σ_{1} Y_{1} (t) + σ_{2} Y_{2} (t)) d B (t) . \end{matrix} \end{matrix}

C.5

We use the process ${(Y_{1} (t), Y_{2} (t), S (t))}_{t \geq 0}$ to construct a Lyapunov function for a suitable skeleton ${(Y_{1} (k T^{*}), Y_{2} (k T^{*}), S (k T^{*}))}_{k \in N}$ as we have done in Appendix A. However, to simplify the computations when verifying the hypotheses of Theorems C.1 and C.2, instead of working with $(Y_{1} (t), Y_{2} (t), S (t))$ , we treat the system $(Z (t), X_{2} (t))$ where $Z (t) : = X_{1} (t) / X_{2} (t)$ . An application of Itô’s Lemma yields

\begin{matrix} \begin{matrix} d Z (t) & = ((b_{2} - b_{1} Z (t)) Z (t) X_{2} (t) + β + {\hat{a}}_{1} Z (t) - α Z^{2} (t)) d t \\ + Z (t) [σ_{1} - σ_{2}] d B (t) \\ d X_{2} (t) & = X_{2} (t) (({\hat{a}}_{2} - b_{2} X_{2} (t)) + α Z (t)) d t + σ_{2} X_{2} (t) d B (t), \end{matrix} \end{matrix}

C.6

where ${\hat{a}}_{1} = a_{1} - a_{2} - α + β + σ_{2}^{2} - σ_{1} σ_{2}$ and ${\hat{a}}_{2} = a_{2} - β$ .

To proceed, we first convert (C.6) to Stratonovich form to facilitate the verification of Hörmander’s condition. System (C.6) can be rewritten as

\begin{matrix} \begin{matrix} d Z (t) = & ((b_{2} - b_{1} Z (t)) Z (t) X_{2} (t) + β + ({\hat{a}}_{1} - \frac{{(σ_{1} - σ_{2})}^{2}}{2}) Z (t) - α Z^{2} (t)) d t \\ + Z (t) [σ_{1} - σ_{2}] \circ d B (t) \\ d X_{2} (t) = & X_{2} (t) (({\hat{a}}_{2} - \frac{σ_{2}^{2}}{2} - b_{2} X_{2} (t)) + α Z (t)) d t + σ_{2} X_{2} (t) \circ d B (t) . \end{matrix} \end{matrix}

C.7

Let

\begin{matrix} A_{0} (z, y) = (\begin{matrix} (b_{2} - b_{1} z) z y + β + ({\hat{a}}_{1} - \frac{{(σ_{1} - σ_{2})}^{2}}{2}) z - α z^{2} \\ y ({\hat{a}}_{2} - \frac{σ_{2}^{2}}{2} - b_{2} y) + α z y \end{matrix}), \end{matrix}

and

\begin{matrix} A_{1} (z, y) = (\begin{matrix} (σ_{1} - σ_{2}) z \\ σ_{2} y \end{matrix}) . \end{matrix}

Recall that the diffusion (C.7) is said to satisfy Hörmander’s condition if the set of vector fields $A_{1},$ $[A_{1}, A_{0}],$ $[A_{1}, [A_{1}, A_{0}]],$ $[A_{0}, [A_{1}, A_{0}]],$ $\dots$ spans $R^{2}$ at every $(z, y) \in R_{+}^{2, \circ}$ , where $[\cdot, \cdot]$ is the Lie bracket, which is defined as follows (see Rey-Bellet 2006 for more details). If $Φ (z, y) = {(Φ_{1} (z, y), Φ_{2} (z, y))}^{⊤}$ and $Ψ (z, y) = {(Ψ_{1} (z, y), Ψ_{2} (z, y))}^{⊤}$ are vector fields on $R^{2}$ (where $z^{⊤}$ denotes the transpose of z), then the Lie bracket $[Φ, Ψ]$ is a vector field given by

\begin{matrix} {[Φ, Ψ]}_{j} (z, y) & = (Φ_{1} (z, y) \frac{\partial Ψ_{j}}{\partial z} (z, y) - Ψ_{1} (z, y) \frac{\partial Φ_{j}}{\partial z} (z, y)) \\ + (Φ_{2} (z, y) \frac{\partial Ψ_{j}}{\partial y} (z, y) - Ψ_{2} (z, y) \frac{\partial Φ_{j}}{\partial y} (z, y)), j = 1, 2 . \end{matrix}

Proposition C.1

Suppose that $σ_{1} \neq σ_{2}$ or $β + (b_{2} / b_{1}) (a_{1} - a_{2}) - α {(b_{2} / b_{1})}^{2} \neq 0$ . Then Hörmander’s condition holds for the diffusion ${(Z (t), X_{2} (t))}_{t \geq 0}$ given by (C.7). As a result, the transition probability $P (t, (z, y), \cdot)$ of ${(Z (t), X_{2} (t))}_{t \geq 0}$ has a smooth density $R_{+} \times R_{+}^{4} ∋ (t, z, y, z^{'}, y^{'}) \mapsto p (t, z, y, z^{'}, y^{'}) \in R_{+}$ with respect to Lebesgue measure.

Proof

Set $σ : = \frac{σ_{1} - σ_{2}}{σ_{2}}$ . By a direct calculation,

\begin{matrix} A_{2} (z, y) : = & \frac{1}{σ_{2}} [A_{0}, A_{1}] (z, y) = (\begin{matrix} σ (β + α z^{2}) + (σ + 1) b_{1} z^{2} y - z y b_{2} \\ - σ α z y + b_{2} y^{2} \end{matrix}), \end{matrix}

and for $k > 2$ , we have

\begin{matrix} A_{k} (z, y) : = & \frac{1}{σ_{2}} [A_{1}, A_{k - 1}] (z, y) \\ = & (\begin{matrix} σ^{k - 1} (β + {(- 1)}^{k} α z^{2}) + {(- 1)}^{k} {(σ + 1)}^{2} b_{1} z^{2} y + {(- 1)}^{k + 1} z y b_{2} \\ {(- 1)}^{k + 1} σ^{2} α z y + {(- 1)}^{k} b_{2} y^{2} . \end{matrix}) . \end{matrix}

If $σ \neq 0$ or equivalently $σ_{1} \neq σ_{2}$ , a straightforward but tedious computation shows that the rank of the matrix with columns $A_{1}, A_{2}, A_{3}, A_{4}$ is always 2 for any $(z, y) \in R_{+}^{2, \circ}$ . As a result, if $σ_{1} \neq σ_{2}$ , Hörmander’s condition is satisfied for the diffusion (C.7). Therefore, the transition probability $P (t, (z, y), \cdot)$ of $(Z (t), X_{2} (t))$ has a smooth density function, denoted by $p (t, z, y, z^{'}, y^{'})$ ; see Rey-Bellet (2006, Corollary 7.2).

Now, we show that Hörmander’s condition holds if $σ_{1} = σ_{2}$ and $β + (b_{2} / b_{1}) (a_{1} - a_{2} - α + β) - α {(b_{2} / b_{1})}^{2} \neq 0$ . In this case,

\begin{matrix} A_{2} (z, y) = [A_{0}, A_{1}] (z, y) = (\begin{matrix} - α y z (b_{2} - b_{1} z) \\ α b_{2} y^{2} \end{matrix}), \end{matrix}

and

\begin{matrix} C (z, y) = (\begin{matrix} C_{1} (z, y) \\ C_{2} (z, y) \end{matrix}) : = [A_{0}, \frac{1}{α b_{2}} A_{2}] (z, y), \end{matrix}

where

\begin{matrix} C_{1} (z, y) & = y (2 b_{1} z / b_{2} - 1) A_{0, 1} (z, y) + y z (1 - z b_{1} / b_{2}) \frac{\partial A_{0, 1} (z, y)}{\partial z} \\ + z (z b_{1} / b_{2} - 1) A_{0, 2} (z, y) + y^{2} z (z b_{1} / b_{2} - 1) . \end{matrix}

With $A_{0, i} (z, y)$ denoting the i-th component of $A_{0} (z, y) .$ Observe that $A_{1} (x, y), A_{2} (z, y)$ span $R^{2}$ for any $(z, y) \in R_{+}^{2, \circ}$ satisfying $z \neq b_{2} / b_{1} .$ If $z = b_{2} / b_{1}$ we have $C_{1} (b_{2} / b_{1}, y) = y A_{0, 1} (b_{2} / b_{1}, y) = y [β + (b_{2} / b_{1}) (a_{1} - a_{2} - α + β) - α {(b_{2} / b_{1})}^{2}] \neq 0$ hence $C (b_{2} / b_{1}, y)$ and $A_{2} (b_{2} / b_{1}, y)$ span $R^{2}$ for all $y > 0$ . As a result, we obtain the desired result.

$□$

To proceed, we consider the following control system, which is associated with (C.7).

\begin{matrix} \{\begin{matrix} d z_{ϕ} (t) = & (b_{2} - b_{1} z_{ϕ} (t)) z_{ϕ} (t) y_{ϕ} (t) + β \\ + ({\hat{a}}_{1} - \frac{{(σ_{1} - σ_{2})}^{2}}{2}) z_{ϕ} (t) - α z_{ϕ}^{2} (t) + (σ_{1} - σ_{2}) z_{ϕ} ϕ (t) \\ d y_{ϕ} (t) = & y_{ϕ} (t) ({\hat{a}}_{2} - \frac{σ_{2}^{2}}{2} - b_{2} y_{ϕ} (t)) + α z_{ϕ} (t) y_{ϕ} (t) + σ_{2} y_{ϕ} (t) ϕ (t) \end{matrix} \end{matrix}

C.8

Let $(z_{ϕ} (t, z, y),$ $y_{ϕ} (t, z, y))$ be the solution to equation (C.8) with control $ϕ$ and initial value (z, y). Denote by $O_{1}^{+} (z, y)$ the reachable set from (z, y), that is the set of $(z^{'}, y^{'}) \in R_{+}^{2, \circ}$ such that there exists a $t \geq 0$ and a control $ϕ (\cdot)$ satisfying $z_{ϕ} (t, z, y) = z^{'}, y_{ϕ} (t, z, y) = z^{'}$ . We first recall some concepts introduced in Kliemann (1987). Let U be a subset of $R_{+}^{2, \circ}$ satisfying $u_{2} \in \bar{O_{1}^{+} (u_{1})}$ for any $u_{1}, u_{2} \in U$ . Then there is a unique maximal set $V \supset U$ such that this property still holds for V. Such V is called a control set. A control set C is said to be invariant if $\bar{O_{1}^{+} (w)} \subset \bar{C}$ for all $w \in C$ .

Finding invariant control sets for (C.8) is facilitated by using a change of variables argument. Put $w_{ϕ} (t) = z_{ϕ} (t) y_{ϕ}^{r + 1} (t)$ with $r = \frac{- σ_{1}}{σ_{2}}$ . We have

\begin{matrix} \{\begin{matrix} d w_{ϕ} (t) = & h (w_{ϕ} (t), y_{ϕ} (t)) d t \\ d y_{ϕ} (t) = & y_{ϕ} (t) ({\hat{a}}_{2} - \frac{σ_{2}^{2}}{2} - b_{2} y_{ϕ} (t)) + α w_{ϕ} (t) y_{ϕ}^{- r} (t) + σ_{2} y_{ϕ} (t) ϕ (t), \end{matrix} \end{matrix}

C.9

where

\begin{matrix} h (w, y) = & w (a_{1} - \frac{σ_{1}^{2}}{2} + r (a_{2} - \frac{σ_{2}^{2}}{2}) + r β - α \\ - b_{1} w y^{r} - b_{2} r y + β y^{1 - r} w^{- 1} + α r w y^{r - 1}) . \end{matrix}

Denote by $O_{2}^{+} (w, y)$ the set of $(w^{'}, y^{'}) \in R_{+}^{2, \circ}$ such that there is a $t > 0$ and a control $ϕ (\cdot)$ such that $w_{ϕ} (t, w, y) = w^{'}, z_{ϕ} (t, w, y) = w^{'}$ .

Lemma C.1

The control system (C.9) has only one invariant control set $\tilde{C}$ and $\bar{O_{2}^{+} (w, y)} \supset \tilde{C}$ for any $(w, y) \in R_{+}^{2, \circ}$ , The set $\tilde{C}$ is defined by $\tilde{C} = {(w, y) \in R_{+}^{2, \circ} : w < c^{*}}$ , where

\begin{matrix} c^{*} = sup \{w : sup_{y > 0} {h (w^{'}, y)} \geq 0 for all w^{'} < w\} . \end{matrix}

Consequently, the control system (C.8) has only one invariant control set $C$ and $\bar{O_{1}^{+} (z, y)} \supset C$ for any $(w, y) \in R_{+}^{2, \circ}$ , where $C : = {(z, y) \in R_{+}^{2, \circ} : z y^{r + 1} \leq c^{*}}$ . Moreover, by Kliemann (1987, Lemma 4.1), $(Z (t), X_{2} (t))$ has at most one invariant probability measure whose support is $C$ .

Proof

First, we need to show that $c^{*}$ is well-defined (although it can be $+ \infty$ ). Since $lim_{w \to 0} h (w, y) = \infty$ , which implies that $\{w : sup_{y > 0} {h (w^{'}, y)} \geq 0 for all w^{'} \leq w\}$ is a nonempty set. Hence $c^{*}$ is well-defined. The claim that $\bar{O_{2}^{+} (w, y)} \supset \tilde{C}$ for any $(w, y) \in R_{+}^{2, \circ}$ can be proved by standard arguments. Let us explain the main ideas here. On the phase space $(w, y) \in R_{+}^{2, \circ}$ , since the control $ϕ (t)$ only appears in the equation of $y_{ϕ}$ , we can easily control vertically, that is, for any initial points $y_{0}$ and $w_{0}$ , there is a control so that $y_{ϕ}$ can reach any given point $y_{1}$ while $w_{ϕ}$ stays in a given neighborhood of $w_{0}$ . If $h (w_{0}, y_{0}) < 0$ , we can choose a feedback control such that $(w_{ϕ} (t), u_{ϕ} (t))$ reaches a point to the ‘left’ $(w_{1}, y_{0})$ with $w_{1} < w_{0}$ as long as $h (w, y_{0}) < 0$ for $w \in [w_{1}, w_{0}]$ . Likewise, for $h (w_{0}, y_{0}) > 0$ , we can choose a feedback control such that $(w_{ϕ} (t), u_{ϕ} (t))$ can reach a point to the ‘right’ $(w_{1}, y_{0})$ with $w_{1} > w_{0}$ as long as $h (w, y_{0}) > 0$ for $w \in [w_{0}, w_{1}]$ . We also have that $inf_{y > 0} {h (w, y)} = - \infty$ for any $w > 0$ . Using these facts, we can follow the steps from Du et al. (2016, Section 3) to obtain the desired results. $□$

Lemma C.2

There is a point $(z^{*}, y^{*}) \in C$ such that for any open set $N^{*} ∋ (z^{*}, y^{*})$ and $T > 0$ , there is an open neighborhood $W^{*} ∋ (z^{*}, y^{*})$ and a control $ϕ^{*}$ such that

\begin{matrix} (z_{ϕ^{*}} (t, z, y), y_{ϕ^{*}} (t, z, y)) \in N^{*} for all (z, y) \in W^{*}, t \in [0, T] . \end{matrix}

Proof

To obtain the result, we work on (C.9), which is equivalent to (C.8). By the definition of $\tilde{C}$ and the fact that $lim_{y \to \infty} h (w, y) = - \infty$ if $r > 0$ and $lim_{y \to 0} h (w, y) = - \infty$ if $r < 0$ , there is a point $(w^{*}, y^{*}) \in \tilde{C}$ such that $h (w^{*}, y^{*}) = 0$ . We can design a feedback control $ϕ^{*}$ such that

\begin{matrix} \{\begin{matrix} d w_{ϕ^{*}} (t) = & h (w_{ϕ^{*}} (t), y^{*}) d t \\ d y_{ϕ^{*}} (t) = & 0 . \end{matrix} \end{matrix}

C.10

If $w_{ϕ^{*}} (t) = w^{*}$ then $w_{ϕ^{*}} (t) = w^{*} for all t > 0$ . By the continuous dependence on initial data of solutions to differential equations, for any given neighborhood ${\tilde{N}}^{*}$ of $(w^{*}, y^{*})$ , we can find a neighborhood ${\tilde{W}}^{*}$ of $(w^{*}, y^{*})$ such that $(w_{ϕ^{*}} (t, w, y), y_{ϕ^{*}} (t, w, y)) \in {\tilde{N}}^{*}$ for any $t \in [0, T]$ and $(w, y) \in {\tilde{W}}^{*}$ , which proves the lemma. $□$

Proposition C.2

Suppose $σ_{1} \neq σ_{2}$ or $β + (b_{2} / b_{1}) (a_{1} - a_{2}) - α {(b_{2} / b_{1})}^{2} \neq 0$ . For any $T > 0$ , every compact set $K \subset R_{+}^{2, \circ}$ is petite set with respect to the Markov chain ${(Z (k T), X_{2} (k T))}_{k \in N}$ .

Proof

Let $(z^{*}, y^{*})$ be as in Lemma C.2. Pick $(z^{⋄}, y^{⋄}) \in R_{+}^{2, \circ}$ such that $p (T, z^{*}, y^{*}, z^{⋄}, y^{⋄}) > 0$ . By the smoothness of $p (T, \cdot, \cdot, \cdot, \cdot)$ , there exists a neighborhood $N^{*}$ and an open set $N^{⋄} ∋ (z^{⋄}, y^{⋄})$ such that

\begin{matrix} p (1, z, y, z^{'}, y^{'}) \geq p^{⋄} > 0 for all (z, y) \in N^{*}, (z^{'}, y^{'}) \in N^{⋄} . \end{matrix}

C.11

Let $W^{*}$ be a neighborhood of $(z^{*}, y^{*})$ satisfying

\begin{matrix} (z_{ϕ^{*}} (t, z, y), y_{ϕ^{*}} (t, z, y)) \in N^{*} for all (z, y) \in W^{*}, t \in [0, T] . \end{matrix}

C.12

For each $(z, y) \in R_{+}^{2, \circ}$ , noting that $(z^{*}, y^{*}) \in C \subset \bar{O_{1}^{+} (z, y)}$ , there is a control $ϕ$ and $t_{z, y} > 0$ such that

\begin{matrix} (z_{ϕ} (t_{z, y}, z, y), y_{ϕ} (t_{z, y}, z, y)) \in W^{*} . \end{matrix}

C.13

Let $n_{z, y} \in N$ such that $(n_{z, y} - 1) T < t_{z, y} \leq n_{n, y} T$ and $\tilde{ϕ}$ be defined as $\tilde{ϕ} (t) = ϕ (t)$ if $t < t_{z, y}$ and $\tilde{ϕ} (t) = ϕ^{*} (t)$ if $t > t_{z, y}$ . Using the control $\tilde{ϕ}$ , we obtain from (C.12) and (C.13) that

\begin{matrix} (z_{\tilde{ϕ}} (n_{z, y} T, z, y), y_{\tilde{ϕ}} (n_{z, y} T, z, y)) \in N^{*} . \end{matrix}

C.14

In view of the support theorem (see Ikeda and Watanabe 1989, Theorem 8.1, p. 518),

\begin{matrix} P (n_{z, y} T, z, y, N^{*}) : = 2 ρ_{z, y} > 0 . \end{matrix}

Since $(Z_{z, y} (t), Y_{z, y} (t))$ is a Markov–Feller process, there exists an open set $V_{z, y} ∋ (z, y)$ such that $P (n_{z, y} T, z^{'}, y^{'}, N^{*}) \geq ρ_{u, v} for all (z^{'}, y^{'}) \in V_{z, y} .$ Since K is a compact set, there is a finite number of $V_{z_{i}, y_{i}}, i = 1, \dots, k_{0}$ satisfying $K \subset ⋃_{i = 1}^{k_{0}} V_{z_{i}, y_{i}} .$ Let $ρ_{K} = min {ρ_{z_{i}, y_{i}}, i = 1, \dots, k_{0}} .$ For each $(z, y) \in K$ , there exists $n_{z_{i}, y_{i}}$ such that

\begin{matrix} P (n_{z_{i}, y_{i}} T, z, y, N^{*}) \geq ρ_{K} . \end{matrix}

C.15

From (C.11) and (C.15), for all $(z, y) \in K$ , there exists $n_{z_{i}, y_{i}}$ such that

\begin{matrix} p ((n_{z_{i}, y_{i}} + 1) T, z, y, z^{'}, y^{'}) \geq ρ_{K} p^{⋄} for all (z^{'}, y^{'}) \in N^{⋄} . \end{matrix}

C.16

It follows from (C.16) that

\begin{matrix} \frac{1}{k_{0}} \sum_{i = 1}^{k_{0}} P ((n_{z_{i}, y_{i}} + 1) T, z, y, A) \geq & \frac{1}{k_{0}} ρ_{K} p^{⋄} m (N^{⋄} \cap A) \\ for all A \in B (R_{+}^{2, \circ}), \end{matrix}

C.17

where $m (\cdot)$ is the Lebesgue measure on $R_{+}^{2, \circ} .$ Equation (C.17) implies that every compact set $K \subset R_{+}^{2, \circ}$ is petite for the Markov chain ${(Z (k T), X_{2} (k T))}_{k \in N} .$ $□$

We have shown in the beginning of Sect. 2.2. that $\tilde{Y} (t)$ has a unique invariant probability measure $ν^{*}$ . Having Proposition C.2, we note that the assumptions, and therefore the conclusions, of Theorems C.1 and C.2 hold for model (C.4). This argument proves Theorems 2.5 and 2.6.

Appendix D: Robustness of the model

The robustness is studied from several angles, including continuous dependence of r on the coefficients of the stochastic differential equation, robustness of persistence, and robust attenuation against extinction. They are presented in a couple subsections.

D.1: Continuous dependence of r on the coefficients

We show that r depends continuously on the coefficients of the stochastic differential equation (2.6). Consider the equation

\begin{matrix} \begin{matrix} d \hat{Y} (t) = & (diag (\hat{Y} (t)) - \hat{Y} (t) {\hat{Y}}^{⊤} (t)) {\hat{Γ}}^{⊤} d B (t) \\ + {\hat{D}}^{⊤} \hat{Y} (t) d t + (diag (\hat{Y} (t)) - \hat{Y} (t) {\hat{Y}}^{⊤} (t)) (\hat{a} - \hat{Σ} \hat{Y} (t)) d t \end{matrix} \end{matrix}

D.1

on the simplex $Δ$ . Suppose that $\hat{Σ}$ is positive definite. In this case, ${(\hat{Y} (t))}_{t \geq 0}$ has a unique invariant probability measure ${\hat{ν}}^{*}$ . Define

\begin{matrix} \hat{r} : = \int_{Δ} ({\hat{a}}^{⊤} y - \frac{1}{2} y^{⊤} \hat{Σ} y) {\hat{ν}}^{*} (d y) \end{matrix}

D.2

Fix the coefficients of 2.6.

Proposition D.1

For any $ε > 0$ , there is a $θ_{2} > 0$ such that if

\begin{matrix} max \{‖ a - \hat{a} ‖, ‖ D - \hat{D} ‖, ‖ Γ - \hat{Γ} ‖\} < θ_{2} \end{matrix}

then

\begin{matrix} | r - \hat{r} | < ε . \end{matrix}

Proof

First, let $θ_{1} > 0$ such that if $max \{‖ a - \hat{a} ‖, ‖ D - \hat{D} ‖, ‖ Γ - \hat{Γ} ‖\} < θ_{1}$ , then

\begin{matrix} |({\hat{a}}^{⊤} y - \frac{1}{2} y^{⊤} \hat{Σ} y) - (a^{⊤} y - \frac{1}{2} y^{⊤} Σ y)| < \frac{ε}{3} for all y \in Δ . \end{matrix}

D.3

Let $γ_{1}, γ_{2}, M_{3}, M_{4}$ be defined as in the proof of Lemma A.5. Pick $T = T (ε) > 0$ such that

\begin{matrix} ‖ \tilde{P} (T, y, \cdot) - ν^{*} ‖_{T V} \leq γ_{1} exp (- γ_{2} T) < \frac{ε}{3 M_{4}} for all y \in Δ . \end{matrix}

D.4

By standard arguments, there is a $θ_{2} \in (0, θ_{1})$ such that if $max \{‖ a - \hat{a} ‖, ‖ D - \hat{D} ‖, ‖ Γ - \hat{Γ} ‖\} < δ_{2}$ , then

\begin{matrix} P \{‖ {\tilde{Y}}^{y} (T) - {\hat{Y}}^{y} (T) ‖ < \frac{ε}{6 M_{3}}\} > \frac{ε}{6 M_{4}} for all y \in Δ \end{matrix}

D.5

Let $y^{*}$ be a $Δ$ -valued and $F_{0}$ -measurable random variable whose distribution is ${\hat{ν}}^{*}$ . Clearly,

\begin{matrix} \int_{Δ} (a^{⊤} y - \frac{1}{2} y^{⊤} Σ y) {\hat{ν}}^{*} (d y) = E (a^{⊤} {\hat{Y}}^{y^{*}} (T) - \frac{1}{2} {({\hat{Y}}^{y^{*}} (T))}^{⊤} Σ {\hat{Y}}^{y^{*}} (T)) . \end{matrix}

D.6

In view of (D.4),

\begin{matrix} |E (a^{⊤} {\tilde{Y}}^{y^{*}} (T) - \frac{1}{2} {({\tilde{Y}}^{y^{*}} (T))}^{⊤} Σ {\tilde{Y}}^{y^{*}} (T)) - r| \leq & M_{4} sup_{y \in Δ} {‖ \tilde{P} (t, y, \cdot) - μ^{*} ‖} \\ \leq & \frac{ε}{3} . \end{matrix}

D.7

It follows from (D.5) that

\begin{matrix} E |a^{⊤} {\hat{Y}}^{y^{*}} (T) - \frac{1}{2} {({\hat{Y}}^{y^{*}} (T))}^{⊤} Σ {\hat{Y}}^{y^{*}} (T) - a^{⊤} {\tilde{Y}}^{y^{*}} (T) + \frac{1}{2} {({\tilde{Y}}^{y^{*}} (T))}^{⊤} Σ {\tilde{Y}}^{y^{*}} (T)| \\ \leq M_{3} \frac{ε}{6 M_{3}} P \{‖ {\tilde{Y}}^{y^{*}} - {\hat{Y}}^{y^{*}} ‖ < \frac{ε}{6 M_{3}}\} + M_{4} P \{‖ {\tilde{Y}}^{y^{*}} - {\hat{Y}}^{y^{*}} ‖ \geq \frac{ε}{6 M_{3}}\} \leq \frac{ε}{3} . \end{matrix}

D.8

In view of (D.2), (D.3), (D.6), (D.7), and (D.8), if

\begin{matrix} max \{‖ a - \hat{a} ‖, ‖ D - \hat{D} ‖, ‖ Γ - \hat{Γ} ‖\} < θ_{2} \end{matrix}

then $| r - \hat{r} | < ε$ , which completes the proof. $□$

Remark D.1

The continuous dependence of r on the coefficients can also be proved by generalizing the arguments from the proof of Evans et al. (2013, Proposition 3). Since Evans et al. (2013, Proposition 3) focuses only on the continuity for a specific parameter rather than all parameters, we provided an alternative proof for the sake of completeness.

D.2: Robust persistence and extinction

Sketch of proof of Theorem 2.8

As usual, we work with

\begin{matrix} d \hat{Y} (t) = & (diag (\hat{Y} (t)) - \hat{Y} (t) {\hat{Y}}^{⊤} (t)) {\hat{Γ}}^{⊤} (\hat{S} (t) \hat{Y} (t)) d B (t) + \hat{D} {(\hat{S} (t) \hat{Y} (t))}^{⊤} \hat{Y} (t) d t \\ + (diag (\hat{Y} (t)) - \hat{Y} (t) {\hat{Y}}^{⊤} (t)) (\hat{a} - \hat{Σ} (\hat{S} (t) \hat{Y} (t)) \hat{Y} (t) - \hat{b} (\hat{S} (t) \hat{Y} (t))) d t \\ d \hat{S} (t) \\ = & \hat{S} (t) {[\hat{a} - \hat{b} (\hat{S} (t) \hat{Y} (t))]}^{⊤} \hat{Y} (t) d t + \hat{S} (t) {\hat{Y} (t)}^{⊤} {\hat{Γ}}^{⊤} (\hat{S} (t) \hat{Y} (t)) d B (t), \end{matrix}

D.9

where $\hat{S} (t) : = \sum_{i} {\hat{X}}_{i} (t), \hat{Y} (t) : = \frac{\hat{X} (t)}{\hat{S} (t)}$ . In order to have a complete proof for this proposition one can follow the steps from Appendix A. First, since $Σ$ is positive definite then so is $\hat{Σ} (x) : = \hat{Γ} {(x)}^{⊤} \hat{Γ} (x)$ if ${sup}_{x \in R_{+}^{n, \circ}} ‖ \hat{Γ} (x) - Γ ‖$ is sufficiently small. As a result, ${(\hat{X} (t))}_{t \geq 0}$ is a nondegenerate diffusion in $R_{+}^{n, \circ}$ and Lemma A.1 holds for ${(\hat{Y} (n T), \hat{S} (n T))}_{n \in N}$ . We also have the following results: there exist positive constants ${\hat{K}}_{i} : i = 1, \dots, 4$ , which do not depend on $θ$ as long as $θ$ is sufficiently small, such that

\begin{matrix} E {\hat{S}}^{y, s} (t) \leq & e^{- γ_{b} t / 2} s + {\hat{K}}_{1}, (y, s) \in Δ \times (0, \infty), t \geq 0 . \end{matrix}

D.10

\begin{matrix} E ({[ln {\hat{S}}^{y, s} (T)]}^{2}) \leq & ({(ln s)}^{2} + 1) {\hat{K}}_{2} exp {{\hat{K}}_{3} T}, (y, s) \in Δ \times (0, \infty), T \geq 0, \end{matrix}

D.11

and

\begin{matrix} E ({[ln {\hat{S}}^{y, s} (T \land {\hat{ζ}}^{y, s})]}_{-}^{2}) \leq {(ln s)}^{2} + {\hat{K}}_{4} \sqrt{P (A)} (T + 1) {[ln s]}_{-} + {\hat{K}}_{4} T^{2} \end{matrix}

D.12

for all $(y, s) \in Δ \times (0, 1), A \in F$ where

\begin{matrix} {\hat{ζ}}^{y, s} : = inf {t \geq 0 : {\hat{S}}^{y, s} (t) = 1} . \end{matrix}

On the other hand, standard arguments show that for any $ε > 0, T > 0$ , there is a $θ = θ (ε, T) > 0$ such that

\begin{matrix} P \{∥(Y^{y, s} (t), S^{y, s} (t)) - ({\hat{Y}}^{y, s} (t), {\hat{S}}^{y, s} (t))∥ \leq ε, 0 \leq t \leq T\} > 1 - ε \end{matrix}

given that $(y, s) \in Δ \times [0, 1] .$ Combining this fact with Proposition A.2, one can find $δ = δ (ε, T) > 0$ and $θ = θ (ε, T) > 0$ such that

\begin{matrix} P \{∥({\tilde{Y}}^{y, s} (t), 0) - ({\hat{Y}}^{y, s} (t), {\hat{S}}^{y, s} (t))∥ \leq ε, 0 \leq t \leq T\} > 1 - ε \end{matrix}

given that $(y, s) \in Δ \times (0, δ)$ and (2.20) holds. With this fact, we can use Lemma A.5 with slight modification to show that, for any $ε > 0$ , there is a $T^{*} = T^{*} (ε)$ and $δ = δ (ε, T^{*}), θ = θ (ε, T^{*})$ such that

\begin{matrix} P \{ln s + \frac{3 r T^{*}}{4} \leq ln {\hat{S}}^{y, s} (T^{*}) < 0\} \geq 1 - 3 ε for all (y, s) \in Δ \times (0, δ) \end{matrix}

D.13

given that (2.20) holds. Having (D.10), (D.11), (D.12), and (D.13), we can use the arguments from Proposition A.3 and Theorem A.2 to finish the proof. $□$

Remark D.2

If $r < 0, X (t)$ converges to $0$ with probability 1. By virtue of Proposition D.1, if $\hat{D}, \hat{Γ}$ are constant matrices and $max \{‖ a - \hat{a} ‖, ‖ D - \hat{D} ‖, ‖ Γ - \hat{Γ} ‖\}$ is sufficiently small then $\hat{X} (t)$ converges to $0$ with an exponential rate almost surely. We conjecture that this result holds for any $θ$ -perturbation of $X (t)$ defined by (2.20). However, when $\hat{D} : = \hat{D} (x), \hat{Γ} : = \hat{Γ} (x)$ , comparison arguments may be not applicable. Moreover, it is also difficult to analyze the asymptotic behavior of the equation without competition terms, namely

\begin{matrix} d \hat{X} (t) = & (diag (\hat{X} (t)) \hat{a} + {\hat{D} (\hat{X} (t))}^{⊤} \hat{X} (t)) d t \\ + diag (\hat{X} (t)) {\hat{Γ} (\hat{X} (t))}^{⊤} d B (t) . \end{matrix}

D.14

Footnotes

A. Hening was in part supported by EPSRC Grant EP/K034316/1.

The research of D. Nguyen and G. Yin was supported in part by the National Science Foundation under grant DMS-1710827.

Contributor Information

Alexandru Hening, Email: Alexandru.Hening@tufts.edu, Email: a.hening@imperial.ac.uk.

Dang H. Nguyen, Email: dangnh.maths@gmail.com

George Yin, Email: gyin@math.wayne.edu.

References

Altenberg L. The evolution of dispersal in random environments and the principle of partial control. Ecol Monogr. 2012;82(3):297–333. doi: 10.1890/11-1136.1. [DOI] [Google Scholar]
Assing S, Manthey R. The behavior of solutions of stochastic differential inequalities. Probab Theory Relat Fields. 1995;103(4):493–514. doi: 10.1007/BF01246336. [DOI] [Google Scholar]
Bascompte J, Possingham H, Roughgarden J. Patchy populations in stochastic environments: critical number of patches for persistence. Am Nat. 2002;159(2):128–137. doi: 10.1086/324793. [DOI] [PubMed] [Google Scholar]
Benaïm M, Schreiber SJ. Persistence of structured populations in random environments. Theor Popul Biol. 2009;76(1):19–34. doi: 10.1016/j.tpb.2009.03.007. [DOI] [PubMed] [Google Scholar]
Benaïm M, Hofbauer J, Sandholm WH. Robust permanence and impermanence for stochastic replicator dynamics. J Biol Dyn. 2008;2(2):180–195. doi: 10.1080/17513750801915269. [DOI] [PubMed] [Google Scholar]
Blath J, Etheridge A, Meredith M. Coexistence in locally regulated competing populations and survival of branching annihilating random walk. Ann Appl Probab. 2007;17(5–6):1474–1507. doi: 10.1214/105051607000000267. [DOI] [Google Scholar]
Cantrell RS, Cosner C. The effects of spatial heterogeneity in population dynamics. J Math Biol. 1991;29(4):315–338. doi: 10.1007/BF00167155. [DOI] [Google Scholar]
Cantrell RS, Cosner C, Lou Y. Evolutionary stability of ideal free dispersal strategies in patchy environments. J Math Biol. 2012;65(5):943–965. doi: 10.1007/s00285-011-0486-5. [DOI] [PubMed] [Google Scholar]
Caswell H. Matrix population models. New York: Wiley Online Library; 2001. [Google Scholar]
Chesson P. General theory of competitive coexistence in spatially-varying environments. Theor Popul Biol. 2000;58(3):211–237. doi: 10.1006/tpbi.2000.1486. [DOI] [PubMed] [Google Scholar]
Chueshov I. Monotone random systems theory and applications. Berlin: Springer Science & Business Media; 2002. [Google Scholar]
Cross PC, Lloyd-Smith JO, Johnson PLF, Getz WM. Duelling timescales of host movement and disease recovery determine invasion of disease in structured populations. Ecol Lett. 2005;8(6):587–595. doi: 10.1111/j.1461-0248.2005.00760.x. [DOI] [Google Scholar]
Davies KF, Chesson P, Harrison S, Inouye BD, Melbourne B, Rice KJ. Spatial heterogeneity explains the scale dependence of the native-exotic diversity relationship. Ecology. 2005;86(6):1602–1610. doi: 10.1890/04-1196. [DOI] [Google Scholar]
Dennis B, Patil GP. The gamma distribution and weighted multimodal gamma distributions as models of population abundance. Math Biosci. 1984;68(2):187–212. doi: 10.1016/0025-5564(84)90031-2. [DOI] [Google Scholar]
Dieu NT, Nguyen DH, Du NH, Yin G. Classification of asymptotic behavior in a stochastic sir model. SIAM J Appl Dyn Syst. 2016;15(2):1062–1084. doi: 10.1137/15M1043315. [DOI] [Google Scholar]
Du NH, Nguyen DH, Yin G. Conditions for permanence and ergodicity of certain stochastic predator-prey models. J Appl Probab. 2016;53:187–202. doi: 10.1017/jpr.2015.18. [DOI] [Google Scholar]
Durrett R, Remenik D. Evolution of dispersal distance. J Math Biol. 2012;64(4):657–666. doi: 10.1007/s00285-011-0444-2. [DOI] [PubMed] [Google Scholar]
Evans SN, Ralph PL, Schreiber SJ, Sen A. Stochastic population growth in spatially heterogeneous environments. J Math Biol. 2013;66(3):423–476. doi: 10.1007/s00285-012-0514-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
Evans SN, Hening A, Schreiber SJ. Protected polymorphisms and evolutionary stability of patch-selection strategies in stochastic environments. J Math Biol. 2015;71(2):325–359. doi: 10.1007/s00285-014-0824-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
Garay BM, Hofbauer J. Robust permanence for ecological differential equations, minimax, and discretizations. SIAM J Math Anal. 2003;34(5):1007–1039. doi: 10.1137/S0036141001392815. [DOI] [Google Scholar]
Geiß C, Manthey R. Comparison theorems for stochastic differential equations in finite and infinite dimensions. Stoch Process Appl. 1994;53(1):23–35. doi: 10.1016/0304-4149(94)90055-8. [DOI] [Google Scholar]
Gonzalez A, Holt RD. The inflationary effects of environmental fluctuations in source-sink systems. Proc Nat Acad Sci. 2002;99(23):14872–14877. doi: 10.1073/pnas.232589299. [DOI] [PMC free article] [PubMed] [Google Scholar]
Hardin DP, Takáč P, Webb GF. Asymptotic properties of a continuous-space discrete-time population model in a random environment. J Math Biol. 1988;26(4):361–374. doi: 10.1007/BF00276367. [DOI] [Google Scholar]
Hardin DP, Takáč P, Webb GF. A comparison of dispersal strategies for survival of spatially heterogeneous populations. SIAM J Appl Math. 1988;48(6):1396–1423. doi: 10.1137/0148086. [DOI] [Google Scholar]
Hardin DP, Takáč P, Webb GF. Dispersion population models discrete in time and continuous in space. J Math Biol. 1990;28(1):1–20. doi: 10.1007/BF00171515. [DOI] [PubMed] [Google Scholar]
Harrison S, Quinn JF. Correlated environments and the persistence of metapopulations. Oikos. 1989;56(3):293–298. doi: 10.2307/3565613. [DOI] [Google Scholar]
Hastings A. Can spatial variation alone lead to selection for dispersal? Theor Popul Biol. 1983;24(3):244–251. doi: 10.1016/0040-5809(83)90027-8. [DOI] [Google Scholar]
Hutson V, Schmitt K. Permanence and the dynamics of biological systems. Math Biosci. 1992;111(1):1–71. doi: 10.1016/0025-5564(92)90078-B. [DOI] [PubMed] [Google Scholar]
Ikeda N, Watanabe S. Stochastic differential equations and diffusion processes. Amsterdam: North-Holland Publishing Co.; 1989. [Google Scholar]
Jansen VAA, Yoshimura J. Populations can persist in an environment consisting of sink habitats only. Proc Nat Acad Sci. 1998;95(7):3696–3698. doi: 10.1073/pnas.95.7.3696. [DOI] [PMC free article] [PubMed] [Google Scholar]
Jarner SF, Roberts GO. Polynomial convergence rates of Markov chains. Ann Appl Probab. 2002;12(1):224–247. doi: 10.1214/aoap/1015961162. [DOI] [Google Scholar]
Kallenberg O. Foundations of modern probability. Berlin: Springer; 2002. [Google Scholar]
Karlin S. Classifications of selection-migration structures and conditions for a protected polymorphism. Evol Biol. 1982;14(61):204. [Google Scholar]
Kendall BE, Bjørnstad ON, Bascompte J, Keitt TH, Fagan WF. Dispersal, environmental correlation, and spatial synchrony in population dynamics. Am Nat. 2000;155(5):628–636. doi: 10.1086/303350. [DOI] [PubMed] [Google Scholar]
Khasminskii R (2012) Stochastic stability of differential equations, volume 66 of Stochastic modelling and applied probability, 2nd edn. Springer, Heidelberg (With contributions by G. N. Milstein and M. B. Nevelson)
Kirkland S, Li C-K, Schreiber SJ. On the evolution of dispersal in patchy landscapes. SIAM J Appl Math. 2006;66(4):1366–1382. doi: 10.1137/050628933. [DOI] [Google Scholar]
Kliemann W. Recurrence and invariant measures for degenerate diffusions. Ann Probab. 1987;15:690–707. doi: 10.1214/aop/1176992166. [DOI] [Google Scholar]
Law R, Morton RD. Permanence and the assembly of ecological communities. Ecology. 1996;77:762–775. doi: 10.2307/2265500. [DOI] [Google Scholar]
Legendre P. Spatial autocorrelation: trouble or new paradigm? Ecology. 1993;74(6):1659–1673. doi: 10.2307/1939924. [DOI] [Google Scholar]
Liebhold A, Koenig WD, Bjørnstad ON. Spatial synchrony in population dynamics. Annu Rev Ecol Evolut Syst. 2004;35:467–490. doi: 10.1146/annurev.ecolsys.34.011802.132516. [DOI] [Google Scholar]
Mao X. Stochastic differential equations and their applications. Chichester: Horwood Publishing Limited; 1997. [Google Scholar]
Meyn SP, Tweedie RL. Stability of Markovian processes. III. Foster–Lyapunov criteria for continuous-time processes. Adv Appl Probab. 1993;24(3):518–548. [Google Scholar]
Mierczyński J, Shen W. Lyapunov exponents and asymptotic dynamics in random Kolmogorov models. J Evol Equ. 2004;4(3):371–390. doi: 10.1007/s00028-004-0160-0. [DOI] [Google Scholar]
Nummelin E. General irreducible Markov chains and nonnegative operators, volume 83 of Cambridge tracts in mathematics. Cambridge: Cambridge University Press; 1984. [Google Scholar]
Palmqvist E, Lundberg P (1998) Population extinctions in correlated environments. Oikos 83(2):359–367
Patel S, Schreiber SJ (2016) Robust permanence for ecological equations with internal and external feedbacks. arXiv:1612.06554 [DOI] [PMC free article] [PubMed]
Pyšek P, Hulme PE. Spatio-temporal dynamics of plant invasions: linking pattern to process. Ecoscience. 2005;12(3):302–315. doi: 10.2980/i1195-6860-12-3-302.1. [DOI] [Google Scholar]
Rey-Bellet L (2006) Ergodic properties of Markov processes. In: Attal S, Joye A, Pillet CA (eds) Open quantum systems. II, volume 1881 of Lecture notes in mathematics, pp 1–39. Springer, Berlin
Roth G, Schreiber SJ. Persistence in fluctuating environments for interacting structured populations. J Math Biol. 2014;69(5):1267–1317. doi: 10.1007/s00285-013-0739-6. [DOI] [PubMed] [Google Scholar]
Roy M, Holt RD, Barfield M. Temporal autocorrelation can enhance the persistence and abundance of metapopulations comprised of coupled sinks. Am Nat. 2005;166(2):246–261. doi: 10.1086/431286. [DOI] [PubMed] [Google Scholar]
Rudnicki R. Long-time behaviour of a stochastic preypredator model. Stoch Process Appl. 2003;108(1):93–107. doi: 10.1016/S0304-4149(03)00090-5. [DOI] [Google Scholar]
Schmidt KA. Site fidelity in temporally correlated environments enhances population persistence. Ecol Lett. 2004;7(3):176–184. doi: 10.1111/j.1461-0248.2003.00565.x. [DOI] [Google Scholar]
Schreiber SJ. Criteria for Cr robust permanence. J Differ Equ. 2000;162(2):400–426. doi: 10.1006/jdeq.1999.3719. [DOI] [Google Scholar]
Schreiber SJ (2010) Interactive effects of temporal correlations, spatial heterogeneity and dispersal on population persistence. Proc R Soc Lond B Biol Sci. http://rspb.royalsocietypublishing.org/content/early/2010/02/12/rspb.2009.2006.full. Accessed 01 Dec 2016 [DOI] [PMC free article] [PubMed]
Schreiber SJ. The evolution of patch selection in stochastic environments. Am Nat. 2012;180(1):17–34. doi: 10.1086/665655. [DOI] [PubMed] [Google Scholar]
Schreiber SJ, Li C-K. Evolution of unconditional dispersal in periodic environments. J Biol Dyn. 2011;5(2):120–134. doi: 10.1080/17513758.2010.525667. [DOI] [PubMed] [Google Scholar]
Schreiber SJ, Lloyd-Smith JO. Invasion dynamics in spatially heterogeneous environments. Am Nat. 2009;174(4):490–505. doi: 10.1086/605405. [DOI] [PubMed] [Google Scholar]
Schreiber SJ, Ryan ME. Invasion speeds for structured populations in fluctuating environments. Theor Ecol. 2011;4(4):423–434. doi: 10.1007/s12080-010-0098-5. [DOI] [Google Scholar]
Schreiber SJ, Benaïm M, Atchadé KAS. Persistence in fluctuating environments. J Math Biol. 2011;62(5):655–683. doi: 10.1007/s00285-010-0349-5. [DOI] [PubMed] [Google Scholar]

[CR1] Altenberg L. The evolution of dispersal in random environments and the principle of partial control. Ecol Monogr. 2012;82(3):297–333. doi: 10.1890/11-1136.1. [DOI] [Google Scholar]

[CR2] Assing S, Manthey R. The behavior of solutions of stochastic differential inequalities. Probab Theory Relat Fields. 1995;103(4):493–514. doi: 10.1007/BF01246336. [DOI] [Google Scholar]

[CR3] Bascompte J, Possingham H, Roughgarden J. Patchy populations in stochastic environments: critical number of patches for persistence. Am Nat. 2002;159(2):128–137. doi: 10.1086/324793. [DOI] [PubMed] [Google Scholar]

[CR4] Benaïm M, Schreiber SJ. Persistence of structured populations in random environments. Theor Popul Biol. 2009;76(1):19–34. doi: 10.1016/j.tpb.2009.03.007. [DOI] [PubMed] [Google Scholar]

[CR5] Benaïm M, Hofbauer J, Sandholm WH. Robust permanence and impermanence for stochastic replicator dynamics. J Biol Dyn. 2008;2(2):180–195. doi: 10.1080/17513750801915269. [DOI] [PubMed] [Google Scholar]

[CR6] Blath J, Etheridge A, Meredith M. Coexistence in locally regulated competing populations and survival of branching annihilating random walk. Ann Appl Probab. 2007;17(5–6):1474–1507. doi: 10.1214/105051607000000267. [DOI] [Google Scholar]

[CR7] Cantrell RS, Cosner C. The effects of spatial heterogeneity in population dynamics. J Math Biol. 1991;29(4):315–338. doi: 10.1007/BF00167155. [DOI] [Google Scholar]

[CR8] Cantrell RS, Cosner C, Lou Y. Evolutionary stability of ideal free dispersal strategies in patchy environments. J Math Biol. 2012;65(5):943–965. doi: 10.1007/s00285-011-0486-5. [DOI] [PubMed] [Google Scholar]

[CR9] Caswell H. Matrix population models. New York: Wiley Online Library; 2001. [Google Scholar]

[CR10] Chesson P. General theory of competitive coexistence in spatially-varying environments. Theor Popul Biol. 2000;58(3):211–237. doi: 10.1006/tpbi.2000.1486. [DOI] [PubMed] [Google Scholar]

[CR11] Chueshov I. Monotone random systems theory and applications. Berlin: Springer Science & Business Media; 2002. [Google Scholar]

[CR12] Cross PC, Lloyd-Smith JO, Johnson PLF, Getz WM. Duelling timescales of host movement and disease recovery determine invasion of disease in structured populations. Ecol Lett. 2005;8(6):587–595. doi: 10.1111/j.1461-0248.2005.00760.x. [DOI] [Google Scholar]

[CR13] Davies KF, Chesson P, Harrison S, Inouye BD, Melbourne B, Rice KJ. Spatial heterogeneity explains the scale dependence of the native-exotic diversity relationship. Ecology. 2005;86(6):1602–1610. doi: 10.1890/04-1196. [DOI] [Google Scholar]

[CR14] Dennis B, Patil GP. The gamma distribution and weighted multimodal gamma distributions as models of population abundance. Math Biosci. 1984;68(2):187–212. doi: 10.1016/0025-5564(84)90031-2. [DOI] [Google Scholar]

[CR15] Dieu NT, Nguyen DH, Du NH, Yin G. Classification of asymptotic behavior in a stochastic sir model. SIAM J Appl Dyn Syst. 2016;15(2):1062–1084. doi: 10.1137/15M1043315. [DOI] [Google Scholar]

[CR16] Du NH, Nguyen DH, Yin G. Conditions for permanence and ergodicity of certain stochastic predator-prey models. J Appl Probab. 2016;53:187–202. doi: 10.1017/jpr.2015.18. [DOI] [Google Scholar]

[CR17] Durrett R, Remenik D. Evolution of dispersal distance. J Math Biol. 2012;64(4):657–666. doi: 10.1007/s00285-011-0444-2. [DOI] [PubMed] [Google Scholar]

[CR18] Evans SN, Ralph PL, Schreiber SJ, Sen A. Stochastic population growth in spatially heterogeneous environments. J Math Biol. 2013;66(3):423–476. doi: 10.1007/s00285-012-0514-0. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR19] Evans SN, Hening A, Schreiber SJ. Protected polymorphisms and evolutionary stability of patch-selection strategies in stochastic environments. J Math Biol. 2015;71(2):325–359. doi: 10.1007/s00285-014-0824-5. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR20] Garay BM, Hofbauer J. Robust permanence for ecological differential equations, minimax, and discretizations. SIAM J Math Anal. 2003;34(5):1007–1039. doi: 10.1137/S0036141001392815. [DOI] [Google Scholar]

[CR21] Geiß C, Manthey R. Comparison theorems for stochastic differential equations in finite and infinite dimensions. Stoch Process Appl. 1994;53(1):23–35. doi: 10.1016/0304-4149(94)90055-8. [DOI] [Google Scholar]

[CR22] Gonzalez A, Holt RD. The inflationary effects of environmental fluctuations in source-sink systems. Proc Nat Acad Sci. 2002;99(23):14872–14877. doi: 10.1073/pnas.232589299. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR23] Hardin DP, Takáč P, Webb GF. Asymptotic properties of a continuous-space discrete-time population model in a random environment. J Math Biol. 1988;26(4):361–374. doi: 10.1007/BF00276367. [DOI] [Google Scholar]

[CR24] Hardin DP, Takáč P, Webb GF. A comparison of dispersal strategies for survival of spatially heterogeneous populations. SIAM J Appl Math. 1988;48(6):1396–1423. doi: 10.1137/0148086. [DOI] [Google Scholar]

[CR25] Hardin DP, Takáč P, Webb GF. Dispersion population models discrete in time and continuous in space. J Math Biol. 1990;28(1):1–20. doi: 10.1007/BF00171515. [DOI] [PubMed] [Google Scholar]

[CR26] Harrison S, Quinn JF. Correlated environments and the persistence of metapopulations. Oikos. 1989;56(3):293–298. doi: 10.2307/3565613. [DOI] [Google Scholar]

[CR27] Hastings A. Can spatial variation alone lead to selection for dispersal? Theor Popul Biol. 1983;24(3):244–251. doi: 10.1016/0040-5809(83)90027-8. [DOI] [Google Scholar]

[CR28] Hutson V, Schmitt K. Permanence and the dynamics of biological systems. Math Biosci. 1992;111(1):1–71. doi: 10.1016/0025-5564(92)90078-B. [DOI] [PubMed] [Google Scholar]

[CR29] Ikeda N, Watanabe S. Stochastic differential equations and diffusion processes. Amsterdam: North-Holland Publishing Co.; 1989. [Google Scholar]

[CR30] Jansen VAA, Yoshimura J. Populations can persist in an environment consisting of sink habitats only. Proc Nat Acad Sci. 1998;95(7):3696–3698. doi: 10.1073/pnas.95.7.3696. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR31] Jarner SF, Roberts GO. Polynomial convergence rates of Markov chains. Ann Appl Probab. 2002;12(1):224–247. doi: 10.1214/aoap/1015961162. [DOI] [Google Scholar]

[CR32] Kallenberg O. Foundations of modern probability. Berlin: Springer; 2002. [Google Scholar]

[CR33] Karlin S. Classifications of selection-migration structures and conditions for a protected polymorphism. Evol Biol. 1982;14(61):204. [Google Scholar]

[CR34] Kendall BE, Bjørnstad ON, Bascompte J, Keitt TH, Fagan WF. Dispersal, environmental correlation, and spatial synchrony in population dynamics. Am Nat. 2000;155(5):628–636. doi: 10.1086/303350. [DOI] [PubMed] [Google Scholar]

[CR35] Khasminskii R (2012) Stochastic stability of differential equations, volume 66 of Stochastic modelling and applied probability, 2nd edn. Springer, Heidelberg (With contributions by G. N. Milstein and M. B. Nevelson)

[CR36] Kirkland S, Li C-K, Schreiber SJ. On the evolution of dispersal in patchy landscapes. SIAM J Appl Math. 2006;66(4):1366–1382. doi: 10.1137/050628933. [DOI] [Google Scholar]

[CR37] Kliemann W. Recurrence and invariant measures for degenerate diffusions. Ann Probab. 1987;15:690–707. doi: 10.1214/aop/1176992166. [DOI] [Google Scholar]

[CR38] Law R, Morton RD. Permanence and the assembly of ecological communities. Ecology. 1996;77:762–775. doi: 10.2307/2265500. [DOI] [Google Scholar]

[CR39] Legendre P. Spatial autocorrelation: trouble or new paradigm? Ecology. 1993;74(6):1659–1673. doi: 10.2307/1939924. [DOI] [Google Scholar]

[CR40] Liebhold A, Koenig WD, Bjørnstad ON. Spatial synchrony in population dynamics. Annu Rev Ecol Evolut Syst. 2004;35:467–490. doi: 10.1146/annurev.ecolsys.34.011802.132516. [DOI] [Google Scholar]

[CR41] Mao X. Stochastic differential equations and their applications. Chichester: Horwood Publishing Limited; 1997. [Google Scholar]

[CR42] Meyn SP, Tweedie RL. Stability of Markovian processes. III. Foster–Lyapunov criteria for continuous-time processes. Adv Appl Probab. 1993;24(3):518–548. [Google Scholar]

[CR43] Mierczyński J, Shen W. Lyapunov exponents and asymptotic dynamics in random Kolmogorov models. J Evol Equ. 2004;4(3):371–390. doi: 10.1007/s00028-004-0160-0. [DOI] [Google Scholar]

[CR44] Nummelin E. General irreducible Markov chains and nonnegative operators, volume 83 of Cambridge tracts in mathematics. Cambridge: Cambridge University Press; 1984. [Google Scholar]

[CR45] Palmqvist E, Lundberg P (1998) Population extinctions in correlated environments. Oikos 83(2):359–367

[CR46] Patel S, Schreiber SJ (2016) Robust permanence for ecological equations with internal and external feedbacks. arXiv:1612.06554 [DOI] [PMC free article] [PubMed]

[CR47] Pyšek P, Hulme PE. Spatio-temporal dynamics of plant invasions: linking pattern to process. Ecoscience. 2005;12(3):302–315. doi: 10.2980/i1195-6860-12-3-302.1. [DOI] [Google Scholar]

[CR48] Rey-Bellet L (2006) Ergodic properties of Markov processes. In: Attal S, Joye A, Pillet CA (eds) Open quantum systems. II, volume 1881 of Lecture notes in mathematics, pp 1–39. Springer, Berlin

[CR49] Roth G, Schreiber SJ. Persistence in fluctuating environments for interacting structured populations. J Math Biol. 2014;69(5):1267–1317. doi: 10.1007/s00285-013-0739-6. [DOI] [PubMed] [Google Scholar]

[CR50] Roy M, Holt RD, Barfield M. Temporal autocorrelation can enhance the persistence and abundance of metapopulations comprised of coupled sinks. Am Nat. 2005;166(2):246–261. doi: 10.1086/431286. [DOI] [PubMed] [Google Scholar]

[CR51] Rudnicki R. Long-time behaviour of a stochastic preypredator model. Stoch Process Appl. 2003;108(1):93–107. doi: 10.1016/S0304-4149(03)00090-5. [DOI] [Google Scholar]

[CR52] Schmidt KA. Site fidelity in temporally correlated environments enhances population persistence. Ecol Lett. 2004;7(3):176–184. doi: 10.1111/j.1461-0248.2003.00565.x. [DOI] [Google Scholar]

[CR53] Schreiber SJ. Criteria for Cr robust permanence. J Differ Equ. 2000;162(2):400–426. doi: 10.1006/jdeq.1999.3719. [DOI] [Google Scholar]

[CR54] Schreiber SJ (2010) Interactive effects of temporal correlations, spatial heterogeneity and dispersal on population persistence. Proc R Soc Lond B Biol Sci. http://rspb.royalsocietypublishing.org/content/early/2010/02/12/rspb.2009.2006.full. Accessed 01 Dec 2016 [DOI] [PMC free article] [PubMed]

[CR55] Schreiber SJ. The evolution of patch selection in stochastic environments. Am Nat. 2012;180(1):17–34. doi: 10.1086/665655. [DOI] [PubMed] [Google Scholar]

[CR56] Schreiber SJ, Li C-K. Evolution of unconditional dispersal in periodic environments. J Biol Dyn. 2011;5(2):120–134. doi: 10.1080/17513758.2010.525667. [DOI] [PubMed] [Google Scholar]

[CR57] Schreiber SJ, Lloyd-Smith JO. Invasion dynamics in spatially heterogeneous environments. Am Nat. 2009;174(4):490–505. doi: 10.1086/605405. [DOI] [PubMed] [Google Scholar]

[CR58] Schreiber SJ, Ryan ME. Invasion speeds for structured populations in fluctuating environments. Theor Ecol. 2011;4(4):423–434. doi: 10.1007/s12080-010-0098-5. [DOI] [Google Scholar]

[CR59] Schreiber SJ, Benaïm M, Atchadé KAS. Persistence in fluctuating environments. J Math Biol. 2011;62(5):655–683. doi: 10.1007/s00285-010-0349-5. [DOI] [PubMed] [Google Scholar]

PERMALINK

Stochastic population growth in spatially heterogeneous environments: the density-dependent case

Alexandru Hening

Dang H Nguyen

George Yin

Abstract

Introduction

Model and results

Assumption 2.1

Remark 2.1

Remark 2.2

Remark 2.3

Remark 2.4

Assumption 2.2

Assumption 2.3

Remark 2.5

Definition 2.1

Theorem 2.1

Remark 2.6

Definition 2.2

Corollary 2.1

Proof

Definition 2.3

Theorem 2.2

Degenerate noise

Theorem 2.3

Remark 2.7

Theorem 2.4

Remark 2.8

Case study: n=2

Theorem 2.5

Theorem 2.6

Remark 2.9

Theorem 2.7

Robust persistence and extinction

Theorem 2.8

Theoretical and numerical examples

Remark 3.1

The degenerate case σ1=σ2,α=β

The degenerate case when the conditions of Theorem 2.6 are violated

Example 3.1

Fig. 1.

Discussion and generalizations

k species competing and dispersing in n patches

Acknowledgements

Appendix A: The case r>0

Proposition A.1

Proof

Remark A.1

Definition A.1

Remark A.2

Theorem A.1

Lemma A.1

Proof

Lemma A.2

Proof

Proposition A.2

Proof

Lemma A.3

Proof

Lemma A.4

Proof

Lemma A.5

Proof

Proposition A.3

Proof

Theorem A.2

Proof

Appendix B: The case r<0

Theorem B.1

Proof

Appendix C: Degenerate diffusion in Rn

Theorem C.1

Remark C.1

Theorem C.2

C.1: Case study: n=2

Proposition C.1

Proof

Lemma C.1

Proof

Case study: $n = 2$

The degenerate case $σ_{1} = σ_{2}, α = β$

Appendix A: The case $r > 0$

Appendix B: The case $r < 0$

Appendix C: Degenerate diffusion in $R^{n}$

C.1: Case study: $n = 2$