Skip to main content
Springer logoLink to Springer
. 2021 Apr 28;82(6):53. doi: 10.1007/s00285-021-01596-0

Separation of timescales for the seed bank diffusion and its jump-diffusion limit

Jochen Blath 1, Eugenio Buzzoni 1, Adrián González Casanova 2, Maite Wilke Berenguer 3,
PMCID: PMC8081708  PMID: 33909136

Abstract

We investigate scaling limits of the seed bank model when migration (to and from the seed bank) is ‘slow’ compared to reproduction. This is motivated by models for bacterial dormancy, where periods of dormancy can be orders of magnitude larger than reproductive times. Speeding up time, we encounter a separation of timescales phenomenon which leads to mathematically interesting observations, in particular providing a prototypical example where the scaling limit of a continuous diffusion will be a jump diffusion. For this situation, standard convergence results typically fail. While such a situation could in principle be attacked by the sophisticated analytical scheme of Kurtz (J Funct Anal 12:55–67, 1973), this will require significant technical efforts. Instead, in our situation, we are able to identify and explicitly characterise a well-defined limit via duality in a surprisingly non-technical way. Indeed, we show that moment duality is in a suitable sense stable under passage to the limit and allows a direct and intuitive identification of the limiting semi-group while at the same time providing a probabilistic interpretation of the model. We also obtain a general convergence strategy for continuous-time Markov chains in a separation of timescales regime, which is of independent interest.

Keywords: Strong seed bank, Two-island model, Separation of timescales, Diffusion limits, Jump-diffusion, Duality

Motivation and main results

In this extended introductory section, we first provide some background on the biological concept of dormancy and its relevance in particular in microbial communities. This is followed by a short review of modelling approaches for dormancy in population genetics, where we think that dormancy might be seen as an additional evolutionary force, interacting with other forces such as genetic drift in complex ways. Since dormancy periods vary over several orders of magnitude (depending on the underlying species and environmental conditions), we aim for a systematic classification of relevant timescales, leading to the distinction of three separate scaling regimes. While the first two regimes have been modelled and analysed in population genetics before, the last one, leading to a separation of timescales between genetic drift and dormancy periods, is new, and completes the picture (at least on the level of ‘toy models’) of modelling scenarios. Our results for this regime will be presented in this introduction both for the forward-in time population model as well as for the dual genealogical processes, leading to novel scaling limits, which are interesting also from a purely mathematical perspective.

The proofs of these results can be found in Sects. 2 and 3 for the results going backwards and forwards in time, respectively. We believe that our rather direct method of proof to obtain and characterise these limits, making extensive use of duality for Markov processes, can be applied in a variety of situations, so that in each section, we first present the corresponding methodology in a general set-up and then discuss its application to our concrete motivation.

Background on dormancy Dormancy is a complex trait that has developed independently in many species across the tree of life and comes in many different guises. Originally, theory for dormancy and the resulting seed banks has be developed in the context of bet-hedging strategies for plants Cohen (1966). However, dormancy is also a highly common trait in microbial communities, with important consequences for their evolutionary, ecological and pathogenic properties.

Here, we define dormancy as the ability of (micro-) organisms to enter and leave a state of vanishing metabolic activity. It has been observed for many habitats that at any given time a large fraction of micro-organisms can be in such a dormant state. For example, more than 80% of bacteria in soil are reported to be metabolically inactive, forming large ‘seed banks’ comprised of dormant individuals, see Lennon and Jone (2011). While dormancy seems to be an efficient and wide-spread strategy, e.g. to withstand unfavourable environmental conditions, competitive pressure, or antibiotic treatment, it is at the same time a costly trait whose maintenance involves energy and a sophisticated ‘switching machinery’.

Dormancy also plays a role in various (human) diseases. So-called persister cells, that may evade antibiotic treatment by remaining in a state of low activity, play a major role in chronic infections, cf. Fisher et al. (2017), and individual cell dormancy is linked to relapses in cancer, cf. Marx (2018), Endo and Inoue (2019).

In this paper, we will focus on microbial seed banks. Lennon and Jone (2011) and Shoemaker and Lennon (2018) provide a broad overview of this rich and fascinating field and serve as a motivation in the present paper. Given the relevance of biological systems exhibiting dormancy, investigating the mathematical implications of dormancy in large populations seems to be a timely and interesting task.

Classification of the duration of dormancy: Known models and motivation for this paper As indicated above, dormancy comes in many different forms, specific to the involved species and environments. One variation lies in the duration of dormancy periods: While in some microbial species dormancy periods last at most a few days, others stay dormant for prolonged periods of time, and some, e.g. bacterial endospores, have been reported to successfully resuscitate from dormancy after millions of years (Shoemaker and Lennon 2018; Cano and Borucki 1995; Johnson et al. 2007; Morono et al. 2020). The theoretical derivation and analysis of mathematical models may help to identify, understand and classify the different effects of dormancy, on suitable timescales, on the population dynamics and genealogical processes of the underlying populations.

Hence, in this paper, we consider the consequences of dormancy and seed banks in the framework of population genetics. More precisely, we are interested in the interplay of dormancy and the classical evolutionary force of random genetic drift, in particular with respect to its sensitivity to the duration of dormancy periods.

In a bi-allelic, haploid population that reproduces according to the Wright-Fisher model, the frequency of a given allele converges to the Wright-Fisher diffusion, given as the solution to

dZ(t)=Z(t)(1-Z(t))dB(t),

where (B(t))t0 is a standard Brownian motion, if one measures time in the coalescent timescale (also known as the evolutionary timescale), i.e. on the order of the population size as this tends to infinity. This diffusion is dual to the block-counting process of the Kingman coalescent which in turn describes the genealogy of the population. These objects serve as a reference for populations without dormancy and are widely studied and applied in biology and mathematics alike. See e.g. Wakeley (2009) or Etheridge (2011) for an overview. We will consider suitable extensions incorporating dormancy.

We propose to distinguish three regimes comparing the duration of dormancy periods to the coalescent timescale, i.e. the scale at which the random genetic drift acts.

1. Dormancy periods are small compared to the coalescent timescale.

In 2001, Kaj et al. (2001) introduced a model for dormancy in the following fashion: instead of always choosing the ancestor in the preceding generations like in the Wright-Fisher model, individuals are allowed to choose an ancestor several generations in the past. Their lineages thus ‘jump’ this number of generations and can be interpreted as dormant during that time. If we denote by B1 the expected size of the ‘jump’, the genealogy of the model converges on the coalescent timescale to a delayed Kingman coalescent, depicted in Fig. 1b, where coalescences occur at rate β2, where β:=1/B, instead of at rate 1, cf. Kaj et al. (2001), Blath et al. (2013). This in turn is dual to the delayed Wright-Fisher diffusion

dZ~(t)=β2Z~(t)(1-Z~(t))dB(t), 1

that again describes the frequency of a given allele in the population, cf. Fig. 2a. Note that β does not depend on the population size, whence its qualitatively weak impact on the coalescent timescale.

Fig. 1.

Fig. 1

Typical realisations of a the Kingman coalescent, where lineages merge at rate 1 per pair, b a delayed Kingman coalescent, where lineages merge at rate β2<1 per pair, and c the seed bank coalescent, see Def. 1.2. In the seed bank coalescent there are two kinds of lines: blue lines are active lineages, while purple lines are dormant lineages. The differences can be seen in the (asymptotic) expected time to the most recent ancestor when started with a sample of n (active and m dormant) individuals given on the time-axis (colour figure online)

Fig. 2.

Fig. 2

Typical realisations of the trajectory of a a time-changed Wright-Fisher diffusion, where the time-change is an effect of a weak seed bank, b the seed bank diffusion, with the frequency of a given allele in the active population displayed in blue and in the dormant population, in purple, c the frequency process (X~(t),Y~(t)), using the same colour code (colour figure online)

2. Dormancy periods on the order of the coalescent timescale

For microbial species, however, dormancy times can be much longer than just a few ‘generations’, In this set-up, Lennon and Jone (2011) proposed a model based on two reservoirs, the ‘active’ and the ‘dormant’ population, between which individuals ‘migrate/switch’ via initiation of and resuscitation from dormancy, at fixed rates. A mathematical model for ‘spontaneous/stochastic’ switching (observed in nature under stable environmental conditions, cf. Epstein 2009; Shoemaker and Lennon 2018), was introduced and studied in Blath et al. (2016). This is reminiscent of the ‘two-island model’ (Wright 1931; Moran 1959) with the notable difference of the absence of reproduction on the second island.

If the size of the active and dormant population are proportional with the ratio given by some K>0, the frequencies X(t) and Y(t) of a given allele in the active and dormant population, respectively, when time is measured on the coalescent timescale, are described by the seed bank diffusion, cf. Fig. 2b. This diffusion was first introduced in Corollary 2.5 in Blath et al. (2016). The existence of a unique strong solution that is Feller follows from Theorem 3.2 and Remark 3.2 in Shiga and Shimizu (1980), see also Greven et al. (2020) for a more general seed bank diffusion.

Definition 1.1

(Seed bank diffusion) Let (B(t))t0 be a standard Brownian motion and cK finite positive constants. The [0,1]2-valued continuous strong Markov process (X(t),Y(t))t0 given as the unique strong solution of the initial value problem

dX(t)=c(Y(t)-X(t))dt+X(t)(1-X(t))dB(t),[.1cm]dY(t)=Kc(X(t)-Y(t))dt, 2

with (X(0),Y(0))=(x,y)[0,1]2, is called seed bank diffusion with parameters cK, starting at (x,y)[0,1]2.

The genealogy of such a population is given by the seed bank coalescent, introduced in Definition 3.2 in Blath et al. (2016). Here, lineages can switch between an active and a dormant state independently (hence ‘spontaneous’ switching) at a given rate c>0. While the active lineages behave like the Kingman coalescent, dormant lineages are prohibited from coalescing, as depicted in Fig. 1c.

That dormancy appears in such a prominent form in the coalescent and in the diffusion and therefore is visible on the coalescent timescale is due to the underlying scaling assumptions of the model. These imply that dormancy times are of the order of the population size and therefore on the coalescent timescale. Here, many population genetic quantities and statistics are affected in non-trivial ways, see Blath et al. (2015), Blath et al. (2016) and Blath et al. (2020b) for a discussion of the scaling assumptions and further extensions of the model. Since the seed bank here has a major qualitative effect on both the diffusion and the coalescent, this is sometimes referred to as the strong seed bank model.

As in the previous models, an important mathematical tool in our analysis will be the formal duality relation between the seed bank diffusion (X(t),Y(t))t0 and the block-counting process of the seed bank coalescent (N(t),M(t))t0. Note that the notion of a ‘block’ comes from the mathematical definition of a coalescent as a partition-valued process. In the biological context, the process could as well be denoted the line-counting process, keeping track of the number of ancestral lines presents at each time in the past.

Definition 1.2

(Block-counting process of the seed bank coalescent) Let E:=N0×N0. Let c,K>0. We define (N(t),M(t))t0 to be the continuous-time Markov chain taking values in E with conservative Q-matrix R given by

R(n,m),(n¯,m¯)=n2if(n¯,m¯)=(n-1,m),cnif(n¯,m¯)=(n-1,m+1),cKmif(n¯,m¯)=(n+1,m-1),1-n2-cn-cKm,if(n¯,m¯)=(n,m),0,otherwise. 3

This continuous-time Markov chain introduced in Definition 2.7 in Blath et al. (2016), satisfies the moment duality

Ex,y[X(t)nY(t)m]=En,m[xN(t)yM(t)] 4

for every t>0, for every (x,y)[0,1] and for every n,mN0, see Theorem 2.8 in Blath et al. (2016). In other words, the distribution of the seed bank diffusion at any time t is uniquely determined by the moment dual at said time.

3. Dormancy periods are large compared to the coalescent timescale.

In view of the (potentially) extreme duration of dormancy times of bacterial spores, it is natural to ask: What happens in the third natural scaling-regime, when dormancy times are long in comparison to the scale on which genetic drift acts? This is the question answered in this manuscript in the following subsections.

To this end, we consider scaling limits of the above seed bank/two-island model when migration between active and dormant states (say at rate c) and reproduction (say at rate 1) act on different timescales, that is c being much smaller than 1. Interesting limits can only be expected when switching to a ‘fast’ super-evolutionary timescale. Indeed, if one just lets c0, then one obtains the trivial limit where the active population follows a Wright-Fisher diffusion and a Kingman coalescent, respectively, and is completely separated from the dormant population, as can be readily seen from (2) and (3). Hence, in order to capture the effect of long dormancy times one needs to speed up time by a factor 1/c, as c0, thus switching to a new timescale, which we will refer to as the super-evolutionary timescale. At this super-evolutionary timescale migration between the active and the dormant population occurs at rate 1 while reproduction, and hence genetic drift, acts ‘instantaneously’. Intuitively, fast reproduction should drive the X coordinate of the diffusion process immediately towards the boundaries 0 and 1, which then only rarely switches between these states due to immigration of ‘ancient’ alleles. This is indeed what we will see below.

This scaling regime also leads to mathematically appealing problems. The naïve scaling limit would lead to a coefficient of “” for the genetic drift in the seed bank diffusion and an infinite coalescent rate in the seed bank coalescent, respectively, and we thus need to find a way to rigorously identify and describe such a ‘degenerate’ mathematical limit.

Main results under separation of timescales: the frequency process The following two theorems provide the main results for the frequency processes of Wright-Fisher models with seed banks, if dormancy times are sufficiently long for the timescales of dormancy and genetic drift to separate. Note that we switch to the super-evolutionary timescale.

Theorem 1.3

Let (Xc(t),Yc(t))t0 be the seed bank diffusion given in Definition 1.1 with migration rate c>0. Assume that the initial distributions (Xc(0),Yc(0)) converge weakly to an (x,y)[0,1]2 as c0. Then, there exists a strong Markov process (X~(t),Y~(t))t0, started in (X~(0),Y~(0))=(x,y) with the property that for any sequence of migration rates with cκ0 when κ,

Xcκ1cκt,Ycκ1cκtt0f.d.d.(X~(t),Y~(t))t0asκ.

Furthermore,

limt0P{X~(t)=1}=1-limt0P{X~(t)=0}=x 5

and we may choose (X~(t),Y~(t))t0 to be cádlág and such that for every t>0 (X~(t),Y~(t)){0,1}×[0,1].

Here, càdlàg stands for continue à droite, limite à gauche, i.e. the property of a path to be right-continuous for every t0 and have a limit from the left for every t>0.

Note that the above convergence is in the sense of the finite-dimensional distributions (f.d.d.), which uniquely determines the law of the limit. As indicated above, it will have jumps in the first component X~, which is remarkable since the prelimiting processes all have continuous paths. In order to understand this, we prove in Proposition 3.8 that, if started in {0,1}×[0,1], (X~(t),Y~(t))t0 coincides in distribution with a Feller process (X¯(t),Y¯(t))t0 taking values in {0,1}×[0,1] which is defined via the generator

A¯f(x,y)=(1-x)y(f(1,y)-f(x,y))+x(1-y)(f(0,y)-f(x,y))+K(x-y)fy(x,y), 6

for functions f in {f:{0,1}×[0,1]Rf(0,·),f(1,·)C1([0,1],R)}.

The dynamics of the process (X~(t),Y~(t))t0 are therefore as follows: The first component X~ is indeed a piece-wise deterministic process, switching between states 0 and 1. The switching rate at time t for jumps from 0 to 1 is just given by the value of the second component Y~(t), and from 1 to 0 with complementary rate 1-Y~(t). In-between jump times of X~, the second component Y~ behaves deterministically, following the equation

dY~(t)=K(X~(t)-Y~(t))dt,

So while X~(t) is in state 0, Y~(t) decreases deterministically with exponential rate -KY~(t), and while X~(t) is in state 1, Y~(t) increases with exponential rate K(1-Y~(t)). This is illustrated in Fig. 2c.

Interpretation: dormancy versus genetic drift on different timescales In the classical Wright-Fisher model without dormancy, genetic drift drives the frequency process (Z(t))t0 of a given allele towards the boundaries 0 and 1, where it fixates. This occurs on timescales of the order the (effective total) population size.

In the weak seed bank regime frequencies are described by (Z~(t))t0 and genetic drift is ‘slowed down’ in a quantitative sense by a factor β2, since dormant individuals may jump generations, increasing the effective population size accordingly. For example, expected fixation times will be stretched by the factor β-2.

In the strong seed bank regime, dormancy times and genetic drift both act on the same timescale. The resulting additional seed bank ‘island’ in the diffusion (X(t),Y(t))t0 will slow down the effect of genetic drift in a qualitative sense. In fact, although the active population may fixate briefly in 0 or 1, the seed bank component will then quickly reintroduce variability via the migration term, hence the memory in the seed bank prevents final fixation in finite time (at least for non-trivial initial states). This interesting effect is discussed in detail in Blath et al. (2019), where it is also shown that the seed bank introduces ‘variability’ into the population model in a suitable sense, by means of a delay-equation reformulation of the seed bank diffusion.

Finally, in the extreme case where dormancy periods are much longer than the timescale of genetic drift, if time is measured in the super-evolutionary scale, fixation/extinction in the active population of (X~(t),Y~(t))t0 will happen instantaneously, and last for a finite time. The switches of the frequency in the active population between 0 and 1 can be explained as follows: When a single ‘ancient’ allele ‘resuscitates’, it will usually not be able to fixate in the population and go extinct again. However, on the super-evolutionary timescale, these ‘trials’ reoccur many times, and eventually a resuscitating allele will fixate. If it is of the same type as the allele currently present in the active population, nothing changes and there will be no jump. However, if it is of the other type, this will cause X~ to switch to the opposite boundary. The probabilities of the allele resuscitating at time t being of the given type or of the opposite type are Y~(t) and 1-Y~(t), which explains the form of the rates in Theorem 6.

These observations regarding fixation or coexistence of types can be summed up as follows. In the Wright-Fisher diffusion without mutation (Z(t))t0, ultimately, one type will fixate. In the weak seed bank regime described by (Z~(t))t0, there will also be one type that fixates, but the (expected) time until this happens is increased by a factor of β-2. In the strong seed bank regime, we will occasionally see fixation of one type in the active population, but then the seed bank will reintroduce variability immediately, so that coexistence is visible almost all the time. Finally, in the case of dormancy on the super-evolutionary timescale, at any given time, the active population will always be homomorphic, but the dominant type will switch from time to time, and there are no visible periods of coexistence at all.

Duality and genealogical interpretation of the scaling regimes

As we have seen, the processes describing the forward-in-time frequency of a given allele in a Wright-Fisher model with seed bank have natural dual processes describing their genealogies. Such genealogical processes shed light on the effect of dormancy on the ancestral processes of samples, but are also useful tools for the proofs of the previous theorems, as they tend to be mathematically simpler objects. Our new scaling regime is no exception.

In the super-evolutionary scaling regime of Theorem 1.3 we obtain the block-counting process of the ancient ancestral lines process as a scaling limit of the genealogies (see Theorem 1.5 below). Intuitively, since we are considering a population for which dormancy times are of a larger order than the times of coalescences, at the super-evolutionary timescale, coalescences occur instantaneously, while migration between the active and the dormant state occurs at order 1, cf. Fig. 3. Hence, in the limit, for each time t>0, there will be at most one active line. More formally, we obtain the following definition.

Fig. 3.

Fig. 3

A typical realisation of an ancient ancestral lines process. Blue lines are active lineages, purple lines are dormant. At the macroscopic time-scale coalescence occurs instantaneously, which is what we see between the times 0 and 0+. Afterwards we have at most one active lineage at any given time. If a dormant lineage activates, it coalesces immediately with the active lineage (colour figure online)

Definition 1.4

(The ancient ancestral lines process) Let (n0,m0)N0×N0. The (n0,m0)-ancient ancestral lines process is the continuous-time Markov chain (N~(t),M~(t))t0 with initial value (N~(0),M~(0))=(n0,m0), taking values in the state space

E(n0,m0):={0,,n0+m0}2,

with semi-group

Π(t):=PetG,t>0,

where Π(0) is defined as IE, the identity on E(n0,m0). P is a projection (P2=P) given by

P(n,m),(n¯,m¯):=1,ifn¯=1,n1,m¯=m,1,ifn¯=n=0,m¯=m,0,otherwise, 7

for all (n,m),(n¯,m¯)E(n0,m0) and G is defined as

G(n,m),(n¯,m¯):=Km,ifn¯=1,n0,m¯=m-1,1,ifn¯=0,n1,m¯=m+1,-1-Km,ifn¯=1,n1,m¯=m,-Km,ifn¯=n=0,m¯=m,0,otherwise.

Note the form of the semi-group of the Markov chain which in particular is not standard, i.e. limt0Π(t)=PIdE (cf. Chung 1960). Since the projection P acts for all t>0, this process takes values in the smaller space {0,1}×{0,,m0+1} P-a.s. for every (fixed) t>0. The first two “rates” given in the definition of G correspond to the events of resuscitation (with immediate coalescence if applicable) and initiation of dormancy. G is, however, not a Q-matrix, since for any n¯2 it has negative values off the diagonal. These only regard states that will be collapsed by P into the smaller state space.

The technical challenges due to the degenerate form of the semi-group of the scaling limit coming from “separation of timescales phenomena” (cf. for example Wakeley 2009, Chapter 6 from the population genetics perspective) require special care as we detail in Sect. 2.1. Subsequently, we apply the above strategy to our model in Sect. 2.2 proving that the ancient ancestral lines process arises as the scaling limit of the block-counting process of the seed bank coalescent in the sense of convergence of the finite-dimensional distributions.

Theorem 1.5

Denote by (Nc(t),Mc(t))t0 the block counting process of the seed bank coalescent as defined in Definition 1.2 with migration rate c>0 and assume that it starts at some (n0,m0)N×N, P-a.s.

Furthermore let (N~(t),M~(t)))t0 be the ancient ancestral lines process from Definition 1.4 with the same initial condition. Then, for any sequence of migration rates (cκ)κN with cκ0 when κ, we have

Ncκ1cκt,Mcκ1cκtt0f.d.d.(N~(t),M~(t))t0.

Without loss of generality, we assume (N~(t),M~(t))t0 to be càdlàg.

Spontaneous and simultaneous switching

One should note that for the above models, we assumed a ‘spontaneous’ switching. ‘Simultaneous’ switching, where transition to and from the dormant population are triggered by environmental cues, are currently an active area of research, see e.g. Blath et al. (2020a).

Scaling limits for continuous-time Markov chains

Motivated by the example of the super-evolutionary scaling in the introductory section, as a first step, we consider scaling limits of continuous-time Markov chains. Indeed, when speeding up time, some transition rates diverge to , thus obstructing direct Q-matrix computations and producing states that are vacated immediately. This effect is frequently observed when dealing with “separation of timescales phenomena” and can in a ‘well-behaved’ scenario still lead to a scaling limit with potentially “degenerate”, i.e. non-standard transition semi-group of the form

PetG,t0,

where P is a projection to a subspace of the original state space as a result of “immediately vacated states” and satisfies G=PG=GP. For discrete-time Markov chains, this situation was considered e.g. in Möhle (1998), Birkner et al. (2013) and recently also Möhle and Notohara (2016). Since the handling of such situations for continuous-time Markov chains (such as the above block counting process) might be of general interest and is somewhat more involved than the discrete case, we give a detailed “recipe” for such convergence proofs in Sect. 2.1. Note that all of these results can in principle be seen as specialised and ready-to-use variants of the general operator-theoretic scheme derived in Kurtz (1973) in the context of ‘random evolutions’ (see also Ethier and Kurtz 1986, Sect. 1.7). Recent applications of this scheme can also be found in Bobrowski (2015).

Separation of timescales phenomena for continuous-time Markov chains: a strategy

Given a sequence of continuous-time Markov chains (ξκ(t))t0, κN with finite state-space E (equipped with a metric d), suppose that our aim is to prove its convergence in finite-dimensional distributions under a suitable time-rescaling (Cκ)κN to a continuous-time Markov chain (ξ(t))t0 when κ.

Our programme to carry out such a proof has two steps:

First, consider an appropriate time discretisation of (ξκ(t))t0, κN. Employing the machinery from Birkner et al. (2013), Möhle (1998) and Möhle and Notohara (2016) available in this context, one can prove convergence of a rescaling of the discretised processes to a continuous-time Markov chain (ξ(t))t0 when κ in the sense of weak convergence in finite-dimensional distributions.

Second, we prove a continuity result to show that the suitably rescaled original process converges in finite-dimensional distributions to the same limit.

In order to formulate the conditions on the time-rescaling and the original sequence of Markov chains, we rewrite the time-rescaling as Cκ=bκ/aκ, where further assumptions on the non-negative sequences (aκ)κN and (bκ)κN will be specified below.

Step (i) Time discretisation and its convergence

The following lemma is an immediate application of Lemma 1.7 in Birkner et al. (2013) analogous to Theorem 1 in Möhle (1998). We rephrase it in this framework for the convenience of the reader and as reference for the examples we will consider below.

Observe that for a non-negative sequence (aκ)κN, (ξκ(i/aκ))iN is a discrete-time Markov chain with finite state-space E for each κN. We equip the matrices A=(Ae,e¯)e,e¯E on E with the matrix norm A:=maxeEe¯EAe,e¯. Since E is finite, convergence in the matrix norm is equivalent to pointwise convergence.

Lemma 2.1

Let (aκ)κN and (bκ)κN be non-negative sequences such that aκ, bκ, bκ/aκ as κ. For each κN denote by Πκ the transition matrix of the discrete-time, time-homogeneous Markov chain (ξκ(i/aκ))iN.

Assume that for every κN we have a representation of the transition matrix of the form

Πκ=Aκ+1bκBκ, 8

such that the following holds: Aκ is a stochastic matrix and

limClimκsuprCaκ(Aκ)r-P=0 9

for some matrix P. Furthermore, we require that the matrix limit with respect to the matrix norm

G:=limκPBκPexists. 10

Then, we obtain the following convergence (with respect to the matrix norm):

limκΠκtbκ=limκAκ+1bκBκtbκ=PetG=:Π(t)for allt>0. 11

In particular, if we define Π(0):=IdE, then (Π(t))t0 is a semi-group that generates a continuous-time Markov chain which we denote by (ξ(t))t0.

If ξκ(0)wξ(0) as κ, Eq. (11) implies

ξκbκtaκt0f.d.d.(ξ(t))t0,asκ.

Here, w denotes weak convergence.

Before proceeding to the proof of this lemma, let us make a few remarks about the assumptions and results observed in it.

Remark 2.2

  1. Since Πκ is the transition matrix of the (ξκ(t))t0 under a time-change by aκ-1, in a representation like (8), Aκ is a stochastic matrix that contains only entries of order 1 and aκ-1, and Bκ contains only entries of order 1 and o(1). Since we then speed-up time by a factor bκ, we obtain a separation of timescales, where the entries in Aκ give rise to a projection matrix P acting on the probability distributions on E, while the entries in Bκ give rise to a “Q-matrix”. The Aκ contain the transition rates of (ξκ(t))t0 that occur at a faster rate than the new timescale, hence they occur “instantaneously” in the limit. The entries in Bκ correspond to the transitions of (ξκ(t))t0 that either occur on the new timescale or are slower, hence describing the transitions visible in the limit and those that vanish.

  2. Note that given (9), the matrix P is necessarily a projection on E, i.e. satisfies P2=P. Since P=P2, we have PG=GP=G and hence PetG=etGP=P-I+etG for any t0. In particular, (Π(t))t0 is not standard, as limt0Π(t)=PΠ(0)=IdE. P effectively restricts the state-space of the limiting chain to a subspace of E.

    Observe that G differs from a normal Q-matrix as it may have negative entries off the diagonal.

Proof of Lemma 2.1

Conditions (8), (9) and (10) above are precisely conditions (36), (46) and (48) in Birkner et al. (2013). Hence (11) is the claim of (49) in Lemma 1.7 and Remark 1.8 in Birkner et al. (2013). Remark 2.2 in particular implies that the Chapman-Kolmogorov equations hold for (Π(t))t0 and hence this generates a continuous-time Markov chain which we denote by (ξ(t))t0 (see, for example, Kallenberg 2002, Thm. 8.4). The convergence in Eq. (11) and the Markov property then imply the convergence in finite-dimensional distributions.

Step (ii) Convergence of the continuous-time Markov chains

The previous step ensured the existence of a limit for suitably discretised versions of the original sequence of continuous-time Markov chains (ξκ(t))t0. The following lemma tells us under what conditions such a discretisation is sufficiently fine to also imply the convergence of the (ξκ(t))t0 to the same limit.

Lemma 2.3

Let (ξκ(t))t0,κN be a sequence of continuous-time, time-homogeneous Markov chains with finite state space E (equipped with some metric d). Let (aκ)κN and (bκ)κN be non-negative sequences.

Denote by Gκ the Q-matrix of (ξκ(t))t0 for each κN and set qκ:=maxeE-Ge,eκ. If

  1. qκaκ0 as κ, and

  2. ξκbκtaκt0f.d.d.(ξ(t))t0 as κ,

then also

ξκbκaκtt0f.d.d.(ξ(t))t0asκ.

Proof

When started at eE, the time to the first jump of ξκ is exponentially distributed with parameter -Ge,eκ. Hence on sees that condition a) was chosen precisely such that

P(ξκ(t))t0has a jump in0,1aκ1-exp-qκaκ0,κ. 12

Observe that for the distance between ξκbκtaκ and ξκbκtaκ at any time t0 we have

dξκbκtaκ,ξκbκtaκ>0

only if the process (ξκ(t))t0 has a jump in the interval bκtaκ,bκtaκ. Since the length of this interval can be estimated through

0bκtaκ-bκtaκ1aκ

and the Markov chains are time-homogeneous we can in turn estimate the probability of a jump in the interval using (12) and obtain

Pdξκbκtaκ,ξκbκtaκ>0P(ξκ(t))t0has a jump inbκtaκ,bκtaκ1-exp-qκaκ0,κ. 13

In order to prove the convergence of the finite-dimensional distributions, recall that weak convergence of measures is equivalent to convergence in the Prohorov metric (see, e.g. Whitt (2002), Section 3.2). Hence, assumption (b) yields that for all time points 0t0,,tl<, states e0,,elE and any ε>0 sufficiently small there exists a κ¯N such that for all κκ¯:

Pξκbκt0aκ=e0,,ξκbκtlaκ=elPξ(t0)=e0,,ξ(tl)=el-ε2.

Combining this with (13) we see that for all time points 0t0,,tl<, states e0,,elE and any ε>0 sufficiently small there exists a κ¯N such that for all κκ¯

Pξκbκt0aκ=e0,,ξκbκtlaκ=elP{ξκbκt0aκ=e0,,ξκbκtlaκ=el,dξκbκt0aκ,ξκbκt0aκ==dξκbκtlaκ,ξκbκtlaκ=0}Pξκbκt0aκ=e0,,ξκbκtlaκ=el-ε2Pξ(t0)=e0,,ξ(tl)=el-ε.

This implies the convergence of the finite-dimensional distributions of ξκbκaκtt0 to the finite-dimensional distributions of (ξ(t))t0 in the Prohorov metric and hence weakly, which completes the proof.

The ancient ancestral lines process (and other scaling limits)

Let us apply this machinery to the “ancestral lines process” introduced in Sect. 1. Indeed, consider the block-counting process of the seed bank coalescent defined in Definition 1.2 with vanishing migration rate c.

If we let c0 and simultaneously speed up time by a factor 1/c, we obtain a new structure given in Definition 1.4, thus uncovering a separation-of-timescales phenomenon. Theorem 1.5 formalises this heuristic and establishes the ancient ancestral lines process as scaling limit in finite-dimensional distributions of the block-counting process of the seed bank coalescent. Note that indeed P is a projection matrix and PG=GP=G, for P and G as in Definition 1.4.

Proof of Theorem 1.5

Let (cκ)κN be a positive sequence such that cκ0. Without loss of generality assume cκ1 for all κN. We prove the result using the machinery outlined in the previous section with aκ:=cκ-2 and bκ:=cκ-3.

Recall that (Ncκ(t),Mcκ(t))t0 is the block counting process of the seed bank coalescent as defined in Definition 1.2 with migration rate cκ>0 and assume that it starts at some (n0,m0)N×N, P-a.s. Let N0 be equipped with the discrete topology.

Step (i) In analogy to the notation in the previous section we abbreviate

(ξκ(t))t0:=(Ncκ(t),Mcκ(t))t0

and consider a discretised process with time steps of length aκ-1=cκ2 by defining

ηκ(i):=ξκ(icκ2),iN0.

Let Πκ be the transition matrix of the Markov chain (ηκ(i))iN0. The transition probabilities of this chain are

(Πκ)(n,m),(n¯,m¯)=P{ηκ(1)=(n¯,m¯)ηκ(0)=(n,m)}=P(Ncκ(cκ2),Mcκ(cκ2))=(n¯,m¯)(Ncκ(0),Mcκ(0))=(n,m)=n2cκ2+o(cκ3),ifn¯=n-1,m¯=m,cκncκ2+o(cκ3),ifn¯=n-1,m¯=m+1,cκKmcκ2+o(cκ3),ifn¯=n+1,m¯=m-1,1-n2cκ2-o(cκ2)-cκncκ2-cκKmcκ2-o(cκ3),ifn¯=n,m¯=m,o(cκ3),otherwise, 14

for any sensible (n,m),(n¯,m¯)E(n0,m0), recalling the convention of n2=0 for n1. This can be seen as follows.

Denote by T1 the time of the first jump of ξκ and by T2 the time between the first and the second jump of ξκ. By the strong Markov property we know that T1 and ξκ(T1), as well as T1 and T2 are independent. Conditioning on ξκ to start in (nm), we also know that T1 follows an exponential distribution with parameter n2+cκn+cκKm and that T2 dominates an exponential random variable with parameter 2n-12+n+12+cκ(3n+1+3Km) (condition on the possible values of ξκ(T1), then take the minimum of the possible exponential random variables describing the waiting time to the next jump). Using this one can check that

PT1+T2cκ2o(cκ3). 15

To calculate the transition probabilities in (14), note that (15) tells us that the probability of seeing more than one jump by ξκ in the interval [0,cκ2] is in o(cκ3). In particular, this gives us the order of the transition probabilities for ηκ to states summarised under “otherwise”, i.e. those that require more than one jump by ξκ. The transitions that are possible with just one jump are “coalescence”, “dormancy” and “resuscitation” in the order in which they appear in (14). We calculate the case of “coalescence”: Note that in order to see such a transition at least one jump must have happened. Hence,

P{ηκ(1)=(n-1,m)ηκ(0)=(n,m)}=P{ξκ(T1)=(n-1,m),T1cκ2,T1+T2>cκ2ηκ(0)=(n,m)}+o(cκ3)=P{ξκ(T1)=(n-1,m),T1cκ2ηκ(0)=(n,m)}+o(cκ3)=n2n2+cκ(n+Km)1-PT1cκ2+o(cκ3)=n2cκ2+o(cκ3)

where we used (15) for the third equality, the independence of ξκ(T1) and T1 for the fourth and a Taylor expansion and the distribution of T1 for the fifth equality. The transition probabilities for “dormancy” and “resuscitation” can be calculated analogously. The calculation of the transition probability to the same state the chain originated from is obvious.

With the representation in (14) we now obtain the decomposition as in (8)

Πκ=Aκ+Bκbκ

with bκ=cκ-3 as defined above and

(Aκ)(n,m),(n¯,m¯)=n2cκ2,ifn¯=n-1,m¯=m,1-n2cκ2,ifn¯=n,m¯=m,0,otherwise,

and

(Bκ)(n,m),(n¯,m¯)=n+o(1),ifn¯=n-1,m¯=m+1,Km+o(1),ifn¯=n+1,m¯=m-1,-n-Km+o(1),ifn¯=n,m¯=m,o(1),otherwise. 16

In order to apply Lemma 2.1, we now need to check condition (9), i.e.

limClimκsuprCcκ-2(Aκ)r-P=0 17

for P given in (7). Since Aκ is a stochastic matrix, let (Zrκ)rN0 be the Markov chain associated to it. This is a pure death process in the first component and constant in the second. By definition of the matrix norm, we get

(Aκ)r-P=max(n,m)E(n0,m0)(n¯,m¯)E(n0,m0)|(Aκ)(n,m),(n¯,m¯)r-P(n,m),(n¯,m¯)|=max(n,m)E(n0,m0)(|(Aκ)(n,m),(1,m)r-1|+n¯=2n|(Aκ)(n,m),(n¯,m)r-0|)=max(n,m)E(n0,m0)2(1-(Aκ)(n,m),(1,m)r)=2max(n,m)E(n0,m0)P{Zrκ(1,m)Z0κ=(n,m)}=2P{Zrκ(1,m0)Z0κ=(n0,m0)}

Observe that for all n{2,,n0} (and all m{0,,m0}) the probability of Zκ to jump to (n-1,m) in the next step can be bounded:

(Aκ)(n,m),(n-1,m)=n2cκ2cκ2.

Hence, the number of time-steps required for Zκ to reach (1,m0) if it is started in (n0,m0) is dominated by the sum of n0-1 independent geometric random variables γ1κ,,γn0-1κ with success probability cκ2. More precisely, if we define T:=inf{r¯N0Zr¯κ=(1,m0)}, then

P{Zrκ(1,m0)Z0κ=(n0,m0)}P{TrZ0κ=(n0,m0)}P{γ1κ++γn0-1κr}.

By Markov’s inequality, we get

Pγ1κ++γn0-1κr1rE[γ1κ++γn0-1κ]=1r(n0-1)cκ2.

Combining these observations we obtain

limClimκsuprCcκ-2(Aκ)r-PlimClimκsuprCcκ-22P{γ1κ++γn0-1κr}=limClimκ2P{γ1κ++γn0-1κCcκ-2}=limClimκcκ2C(n0-1)cκ2=0

and (17) holds. We are now left to establish the matrix-norm limit (10) and show that coincides with the G given in Definition 1.4. Notice that Bκ itself converges when κ uniformly and in the matrix norm (recalling that the state space E(n0,m0) is finite):

B:=limκBκ=n,ifn¯=n-1,m¯=m+1,Km,ifn¯=n+1,m¯=m-1,-n-Km,ifn¯=n,m¯=m,0,otherwise.

Simply multiplying the matrices on the left-hand-side we obtain PBP=G and therefore

limκPBκP=PBP=G 18

and thus (10). Since we have proven the assumptions, Lemma 2.1 yields

limκΠκtcκ-3=limκAκ+cκ3Bκtcκ-3=PetG=:Π(t)for allt>0,

and under the additional assumption that ηκ(0)=(Ncκ(0),Mcκ(0))=(N~(0),M~(0)), also

(ηκ(cκ-3t))t0f.d.d.(N~(t),M~(t))t0,κ,

where (N~(t),M~(t))t0 is the ancient ancestral lines process defined in Definition 1.4.

Step (ii) We would now like to apply Lemma 2.3. Denote by Qcκ the Q-matrix of the process (ξκ(t))t0 as given in Definition 1.2. We can estimate

qκ:=max(n,m)E(n0,m0)-(Qcκ)(n,m),(n,m)n0+m02+cκ(n0+m0)+cκK(n0+m0).

As we can see, condition (a) of Lemma 2.3 holds with

qκaκ=n0+m02+cκ(n0+m0)+cκK(n0+m0)cκ-20,κ.

Condition (b) was proven in Step (i). Therefore we may conclude

Ncκ(cκ-1t),Mcκ(cκ-1t)t0=ξκcκ-3cκ-2tt0f.d.d.N~(t),M~(t)t0

when κ and the proof of Theorem 1.5 is complete.

Remark 2.4

(Imbalanced Island Size) It is straightforward to pursue the same consideration for the two-island model and its structured coalescentHerbots (1994); Notohara (1990). The two-island model considers two populations much like the seed bank model, but allows for coalescence in the second population. Its genealogy is then given by the structured coalescent, whose block-counting process allows for the same transition rates described in (3) adding r(n,m),(n¯,m¯)=m2 for n¯=n and m¯=m-1, i.e. coalescence in the second island (and adapting the diagonal entries accordingly).

Letting the migration rate converge c0 while speeding up time by 1/c as we have done for the block counting process of the seed bank coalescent above will lead to a structure with instantaneous coalescences in both islands, leaving us with a single line migrating between them.

In this set-up it is much more interesting to consider a two-island model with different scalings of the coalescence rates in the islands. In order to do this, we introduce the parameters α and α such that the Q-matrix of the block-counting process of the structured coalescent now is

R^(n,m),(n¯,m¯)=αn2,if(n¯,m¯)=(n-1,m),αm2,if(n¯,m¯)=(n,m-1),cn,if(n¯,m¯)=(n-1,m+1),cKm,if(n¯,m¯)=(n+1,m-1),1-αn2-αm2-cn-cKm,if(n¯,m¯)=(n,m),0,otherwise. 19

α and α are associated with the notion of effective population size (cf. e.g. Wakeley 2009) so a different scaling corresponds to a significant difference in population size on the two islands. If, in addition to c0 we assume the coalescence rate α=α(c)>0 in the second island to scale as c, i.e. α/c1, the result is a two-island model with instantaneous coalescences in the first island, but otherwise ‘normal’ migration and coalescence behaviour in the second.

In order to formalise this heuristic observation, denote by (Nc,α(t),Mc,α(t))t0 the block-counting process of the structured coalescent as defined by the rates in (19) with migration rate c>0 and coalescence rate α>0 in the second island and assume that it starts at some (n0,m0)N×N, P-a.s. (The parameters α,K>0 are arbitrary but fixed.)

Define (N^(t),M^(t))t0 to be the continuous-time Markov chain with initial value (N^(0),M^(0))=(n0,m0), taking values in the state space E(n0,m0):={0,,n0+m0}2, with transition matrix Π(t):=PetG^, for t>0 and Π(0)=IdE, where P is given by (7) (as in the case of seed banks) and G^ is now a matrix of the form

G^(n,m),(n¯,m¯):=Km+m2,ifn¯=1,n1,m¯=m-1,Km,ifn¯=1,n=0,m¯=m-1,m2,ifn¯=0,n=0,m¯=m-1,1,ifn¯=0,n1,m¯=m+1,-m2-1-Km,ifn¯=1,n1,m¯=m,-m2-Km,ifn¯=n=0,m¯=m,0,otherwise.

Then, for any sequence of migration rates (cκ)κN and any sequence of coalescence rates (ακ)κN with cκ0 and cκ/ακ1 when κ

Ncκ,ακ1cκt,Mcκ,ακ1cκtt0f.d.d.(N^(t),M^(t))t0,κ.

This observation for the two-island model is analogous to Theorem 1.5 for seed banks. Its proof is a close parallel to that of Theorem 1.5. Considering, again, the sequences aκ:=cκ-2 and bκ:=cκ-3, Aκ and P coincide with those in the proof of Theorem 1.5, hence the hardest work has already been done. Small alterations to Bκ immediately yield the result and we therefore omit any further details.

Scaling limits for the diffusion

We would now also like to observe similar scaling limits for the diffusion (2). As we saw in the case of Markov chains, rescaling time may lead to a limiting process that is still Markovian, but whose semi-group is not standard, i.e. not continuous in 0. We can use moment duality to obtain this limit.

Convergence of the finite-dimensional distributions obtained from duality

We present a method to obtain convergence in finite-dimensional distributions of a sequence of Markov processes using moment duality and the convergence in finite-dimensional distributions of the dual processes. The result does not depend on whether time is rescaled, too, or not. It is, however, of particular interest in the rescaled case, since it might lead to the identification of limiting objects which rather “ill-behaved”. Indeed, we will see examples in Sect. 3.2 where the limit does not have a generator with a sufficiently large domain and hence the common approach of proving convergence through generator convergence fails.

For any tuples n:=(n1,,nd)N0d and x:=(x1,,xd)[0,1]d, define the mixed-moment function m as m(x,n):=x1n1xdnd.

Theorem 3.1

Let (ζκ(t))t0, κN0, be a sequence of Feller processes taking values in [0,1]d (for some dN), and (ξκ(t))t0, κN0, a sequence of Markov chains with values in N0d such that they are pairwise moment duals, i.e.

κN0,t0x[0,1]d,nN0d:En[m(x,ξκ(t))]]=Ex[m(ζκ(t),n)].

As usual, Pn and Px denote the distributions for which ξ and ζ, start in n and x, respectively.

If (ξκ)κN0 converges to some Markov chain ξ in the f.d.d.-sense, then there exists a Markov process ζ with values in [0,1]d such that it is the f.d.d.-limit of (ζκ)κN0 and the moment dual to ξ, i.e.

t0x[0,1]d,nN0d:En[m(x,ξ(t))]]=Ex[m(ζ(t),n)]. 20

Remark 3.2

At first glance one might suspect that this result should also hold in a more general set-up as long as the employed duality function yields convergence determining families for the respective semi-groups. Indeed, most of the steps of the proof would still go through. However, note that we did not assume existence of a limiting Markov process beforehand. We can conclude this by the solvability of Hausdorff’s moment problem on [0,1]dHildebrandt and Schoenberg (1933), which precisely treats the existence (and uniqueness) of a distribution with a given sequence of moments and therefore “matches” the moment duality function in our theorem.

Proof of Theorem 3.1

The proof can roughly be split into three steps: We first use duality to prove the convergence of the one-dimensional distributions of (ζκ)κN0. This, together with the Markov property will give us the convergence of the finite-dimensional distributions of (ζκ)κN0 to a family of limiting distributions. Then we prove consistency of this family of distributions and hence by Kolmogorov’s Extension-Theorem the existence of a limiting process ζ, which must then be Markovian.

Since the mixed-moment function m is continuous and bounded as a function on N0d, the convergence of the finite-dimensional distributions of (ξκ)κN and the assumed moment duality yield

Ex[m(ζκ(t),n)]=En[m(x,ξκ(t))]κEn[m(x,ξ(t))]=:γ(n,x,t) 21

for any t0, x[0,1]d and nN0d. For fixed x[0,1]d and t0 this is a monotonic sequence, i.e.

nN0d,k1n1,,kdnd:Δ1k1Δdkdγ(n,x,t)0,

where Δiγ(n,x,t):=γ((n1,,ni-1,ni-1,ni+1,,nl),x,t)-γ(n,x,t) is the difference operator acting on the ith component of n, i=1,,d. This can be seen from (21):

Δ1k1Δdkdγ(n,x,t)=limκΔ1k1ΔlkdEn[m(x,ξκ(t))]=limκΔ1k1ΔlkdEx[m(ζκ(t),n)]=limκEx[m(ζκ(t),n)(1-ζκ1(t))k1(1-ζκd(t))kd]0. 22

Hence, the Hausdorff moment problem for (γ(n,x,t))nN0d is solvable according to Theorem 1 in Hildebrandt and Schoenberg (1933), which means that there exists a measure μx,t on ([0,1]d,B([0,1]d)) (where B is the Borel-σ-algebra) such that

nN0d:γ(n,x,t)=[0,1]dm(x¯,n)dμx,t(x¯).

In particular, this holds for n=(0,,0), hence μx,t([0,1]d)=E0[1]=1 and μx,t is therefore a distribution. Since the polynomials are dense in the continuous functions, (21) implies the convergence of the one-dimensional distributions to (μx,t)t0 (for each starting point x[0,1]d).

To check the convergence in finite-dimensional distributions, let us first make a general observation regarding weak convergence. Let Pκ, κN, and P be distributions on ([0,1],B([0,1])) such that the Pκ converge weakly to P. Furthermore, let fκ,f:[0,1]R, κN, be continuous such that fκ(x) is uniformly bounded in κ and x and converges to f pointwise (and therefore uniformly). Then we can estimate

|[0,1]dfκ(x)Pκ(dx)-[0,1]df(x)P(dx)|maxx[0,1]dfκ(x)-f(x)+[0,1]df(x)Pκ(dx)-[0,1]df(x)P(dx)κ0 23

Returning to the task at hand, let Pκ be the probability transition function of ζκ and recall that we assumed the ζκ to be Feller, hence x[0,1]df(x¯)Pκ(x,t,dx¯) is continuous and bounded by 1 for any f continuous and bounded by 1. For 0t1<<tl<, x[0,1]d and n(1),,n(l)N0d then observe

Ex[m(ζκ(t1),n(1))m(ζκ(tl),n(l))]=[0,1]d[0,1]dm(x¯(1),n(1))m(x¯(l),n(l))Pκ(x¯(l-1),tl-tl-1,dx¯(l))Pκ(x,t1,dx¯(1))=[0,1]dm(x¯(1),n(1))[0,1]dm(x¯(l),n(l))Pκ(x¯(l-1),tl-tl-1,dx¯(l))Pκ(x,t1,dx¯(1))κ:γ(n(1),,n(l),x,t1,,tl). 24

Here we used the Markov property of ζκ in the first equality. For the convergence to some constant γ(n(1),,n(l),x,t1,,tl) we used the convergence of the finite-dimensional distributions shown above together with the observation that xm(x,n) is continuous and bounded by 1 (on [0,1]d) and a recursive application of (23).

By the same argument as in (22), for fixed x[0,1]d and t0, γ(n(1),,n(l),x,t1,,tl), (n(1),,n(l))(N0d)l, is a monotonic sequence and Theorem 1 in Hildebrandt and Schoenberg (1933) yields the existence of a distribution μI,x on (([0,1]d)I,B([0,1]d)I) for any finite set of indices I={t1,,tl}[0,) and starting point x[0,1]d. In addition, (24) implies the convergence of the finite-dimensional distributions of (ζκ)κN to a respective μI,x. Since these μI,x are the limits of a consistent family they are themselves consistent and according to Kolmogorov’s Extension-Theorem there exists a unique measure μx on the product-space (([0,1]d)[0,),B([0,1]d)[0,)) which is the distribution of the desired process ζ. This is a Markov process, because the (ζκ)κN0 are Markov processes.

The duality of ζ and ξ follows from the duality of the prelimiting processes.

Ancient ancestral material scaling regime

As an application of Theorem 3.1 we consider the diffusion (2) with the scaling regime of Sect. 2.2, namely, with the migration rate c0 while simultaneously speeding up time by a factor 1/c and obtain Theorem 1.3 stating the convergence of the rescaled diffusions to a Markovian limit (X~(t),Y~(t))t0.

Theorem 3.3

Let (Xc(t),Yc(t))t0 be the seed bank diffusion given in Definition 1.1 with migration rate c>0. Assume that the initial distributions (Xc(0),Yc(0)) converge weakly to a (x,y)[0,1]2 as c0. Then, there exists a Markov process (X~(t),Y~(t))t0, started in (X~(0),Y~(0))=(x,y) with the property that for any sequence of migration rates with cκ0 when κ,

Xcκ1cκt,Ycκ1cκtt0f.d.d.(X~(t),Y~(t))t0asκ

and (X~(t),Y~(t))t0 is the moment dual of (N~(t),M~(t))t0 given in Definition 1.4.

Note that (X~(t),Y~(t))t0 makes sense for more general initial conditions in [0,1]2. In any case, the limiting process (X~(t),Y~(t))t0 instantaneously jumps into the smaller state space {0,1}×[0,1] at time 0+. The jump probabilities to 0 and 1 are the fixation probabilities of the ordinary Wright-Fisher diffusion. This corresponds to an instantaneous application of a projection operator P~ defined as the limit (in a suitable sense)

P~:=limtP~t,

where (P~t)t0 is the semi-group associated to the classical Wright-Fisher diffusion, cf. Kurtz (1973) (or Bobrowski 2015, Equation (3)). Intuitively, this can be explained as follows: In the regime, where dormancy duration is significantly larger than the effect of genetic drift, the population evolves according to a Wright-Fisher diffusion without dormancy and has the chance to be absorbed in 0 or 1, before ever seeing a resuscitation/migration into the population from the seed bank. Hence, on the super-evolutionary time-scale the probabilitiesIntuitively to immediately jump to 0 or 1 are precisely given by the corresponding fixation probabilities of the Wright-Fisher diffusion.

Remark 3.4

(Convergence on path space?) Once convergence of the finite-dimensional distributions is established in Theorem 1.3, it is natural (at least for mathematicians) to ask whether it is possible to prove tightness on the space of càdlàg paths space in order to obtain weak convergence. However, since the set of continuous paths form a closed subset of the càdlàg paths in the classical Skorohod (J1) topology (cf. Skorohod (1956)), and the solutions to our pre-limiting seed bank diffusions are continuous, convergence in the above topologies would predict a limit with continuous paths, which we know not to be correct at least in 0. This makes weak convergence on path space impossible. However, the set of jump times of the above process is finite on finite time intervals, and in particular has Lebesgue-measure zero, so that we expect that convergence is true in weaker topologies, such as the Meyer-Zheng topology corresponding to convergence in measure (Meyer and Zheng 1984; Kurtz 1991). However, we refrain from going into these technicalities here, which we consider to be outside the scope of this manuscript.

Remark 3.5

Remark 3.2 in Shiga and Shimizu (1980) implies that the unique strong solution to the SDE (1.1) which is the seed bank diffusion from Definition 2 is a Feller process. This is considered in more generality in Theorem 2.4 in Greven et al. (2020).

Proof of Theorem 3.3

Since the Xcκt/cκ,Ycκt/cκt0 are constant time-changes of the seed bank diffusion introduced in Definition 1.1, they are Feller, as well.

Since the moment duality of the block-counting process of the seed bank coalescent and the seed bank diffusion (4) holds for every time t0, it is preserved for the time-changed processes Ncκt/cκ,Mcκt/cκt0 and Xcκt/cκ,Ycκt/cκt0. Together with Theorem 1.5 all assumptions of Theorem 3.1 hold and we get the existence of a Markov process (X~(t),Y~(t))t0 that is the dual of (N~(t),M~(t))t0. By the uniqueness of the solution to the Hausdorff moment problem (Theorem 2 in in Hildebrandt and Schoenberg 1933) a distribution on [0,1]2 is uniquely determined by all its mixed-moments. The moment duality of the limit with a process that does not depend on the scaling sequence (cκ)κN0 therefore implies that the one-dimensional distributions of the limit do not depend on the choice of scaling sequence, either. Since the limit is a Markov-process the one-dimensional distributions uniquely determine its entire distribution. Hence, the distribution of the limit does not depend on the choice of scaling sequence (cκ)κN0.

So far we have characterised the process (X~(t),Y~(t))t0 only as the moment dual of the continuous-time Markov chain (N~(t),M~(t))t0 whose semi-group we could give explicitly in Definition 1.4. We now use this characterisation to better understand the process (X~(t),Y~(t))t0 itself. More precisely, since (20) holds in particular for t>0, m=0 and any n1, x,y[0,1] we see

Ex,y[X~(t)nY~(t)0]=En,0[xN~(t)yM~(t)]=xPn,0(N~(t)=1,M~(t)=0)+yPn,0(N~(t)=0,M~(t)=1)=x(PetG)(n,0),(1,0)+y(PetG)(n,0),(0,1)=x(etG)(1,0),(1,0)+y(etG)(1,0),(0,1). 25

We used the fact that the first component of the ancient ancestral lines process (N~(t),M~(t))t0 takes values in {0,1} for any t>0 in the second equality and the definition of the projection in the last equality. Since the right-hand side does not depend on n1, we can conclude that

X~(t){0,1}Px,y-a.s.\ for anyt>0and any(x,y)[0,1]2. 26

We can use this observation together with (25) to obtain

limt0Px,y{X~(t)=1}=limt0Ex,y[X~(t)n]=x(IN02)(1,0),(1,0)+y(IN02)(1,0),(0,1)=x. 27

(Here (IN02) is the identity matrix on N02.)

This small observation has an important consequence: Much like in the case of its dual (N~(t),M~(t))t0, the semi-group of the ancient ancestral material process (X~(t),Y~(t))t0 is not right-continuous in 0.

Intuitively, the reproduction mechanism (in the active population) acts so fast, that fixation (or extinction) in the active population happens instantaneously. Whenever there is an invasion from the seed bank, the chances that this is by an individual of the type extinct in the active population (thereby causing a change of type here) are given by the frequency of said type in the dormant population. The limit is thus a pure jump process in the active component that moves between the states 0 and 1 at rates proportional to the frequency in the dormant population of the allele that is extinct in the active population, while the seed bank component retains its classical behaviour. We can formalise this observation if we restrict the process to the smaller state space {0,1}×[0,1], see Proposition 3.8 below.

Definition 3.6

Let (N¯(t),M¯(t))t0 be the Markov chain on {0,1}×N0 given by the Q-matrix

G¯(n,m),(n¯,m¯)=Km,ifn¯=1,n{0,1},m¯=m-1,1,ifn¯=0,n=1,m¯=m+1,-n-Km,ifn¯=n,m¯=m,0,otherwise.

for any (n,m),(n¯,m¯){0,1}×N0.

Furthermore, let (X¯(t),Y¯(t))t0 be the Markov process on {0,1}×[0,1] defined by the generator given in (6).

Proposition 3.7

(X¯(t),Y¯(t))t0 is well-defined i.e. the closure of A¯ given in (6) is indeed the generator of a Markov process and this process is Feller. Furthermore, we may assume that (X¯(t),Y¯(t))t0 is cádlág on [0,).

Proof

Define E:={0,1}×[0,1] and

C:=C(E,R)={f:ERf(0,·),f(1,·):[0,1]Rare continuous},DA¯:=C1:={f:ERf(0,·),f(1,·):[0,1]Rare differentiable}.

We verify the conditions of the Hille–Yosida Theorem, cf. Theorem 19.11 in Kallenberg (2002), for (A¯,DA¯), where A¯ is given in (6). First note that

{f:ERf(0,·),f(1,·):[0,1]Rare polynomials}DA¯

hence DA¯ is dense in C. In order to verify the maximum principle choose an arbitrary fDA¯ and let (x¯,y¯)E be such that f(x¯,y¯)f(x,y)0 for all (x,y)E. Then

Af(x¯,y¯)=(1-x¯)y¯(f(1,y¯)-f(x¯,y¯))+x¯(1-y¯)(f(0,y¯)-f(x¯,y¯))+K(x¯-y¯)fy(x¯,y¯)

Since we assumed f to have a maximum in (x¯,y¯), the first two summands are non-positive. If y¯(0,1), a maximum in (x¯,y¯) implies fy(x¯,y¯)=0. If y¯=0, a maximum in (x¯,y¯) implies fy(x¯,y¯)0 and therefore (x¯-0)fy(x¯,y¯)0. Likewise, if y¯=1, a maximum in (x¯,y¯) implies fy(x¯,y¯)0 and therefore (x¯-1)fy(x¯,y¯)0. Hence, A¯f(x¯,y¯)0 and the maximum principle holds. We are left to prove that there exists a λ>0 such that (λ-A¯)DA¯ is dense in C. First, observe that fC if and only if it can be written in the form f(x,y)=(1-x)f0(y)+xf1(y), where f0(·),f1(·):[0,1]R are continuous. Since the polynomials are dense in the continuos functions on [0, 1] and A¯ is a linear operator, it suffices to show that for any rN0 we can find f(r),f(r)DA¯ such that (λ-A¯)f(r)(x,y)=(1-x)yr and (λ-A¯)f(r)(x,y)=xyr. In an intuitive abuse of notation, we will in the following denote maps of the form (x,y)xyr by xyr and likewise for (1-x)yr. We begin by calculating, for any rN0

(λ-A¯)xyr=λxyr-(1-x)y(yr-xyr)-x(1-y)(0-xyr)-K(x-y)xryr-1=(1-x)(-yr+1)+x(-rKyr-1+(λ+1+rK)yr-yr+1),=x(-rKyr-1+(λ+1+rK)yr)-yr+1(λ-A¯)(1-x)yr=λ(1-x)yr-(1-x)y(0-(1-x)yr)-x(1-y)(yr-(1-x)yr)-K(x-y)(1-x)ryr-1=(1-x)((λ+rK)yr+yr+1))+x(-yr+yr+1),=(1-x)(λ+rK)yr+x(-yr)+yr+1(λ-A¯)yr=x(-rKyr-1)+(λ+rK)yr.

Proceed by induction on the degree r, beginning with r=0. Observe that (λ-A¯)1=λ and

(λ-A¯)(1-x)-1λ+Ky=(1-x)λ-x+y-1λ+K-xK+(λ+K)y=(1-x)λ+x-λλ+K,

therefore

(λ-A¯)λ+K-λ(λ+K+1)(1-x)-1λ+Ky-1=λ+K-λ(λ+K+1)x(-λ)+x-λλ+K=xy0

and immediately also

(λ-A¯)1λ-λ+K-λ(λ+K+1)(1-x)-1λ+Ky-1=1-x=(1-x)y0.

Now let nN and assume that for any rn-1 there exist f(r),f(r)DA¯ such that (λ-A¯)f(r)(x,y)=(1-x)yr and (λ-A¯)f(r)(x,y)=xyr. Note that

(λ-A¯)yn+nKf(n-1)(x,y)=(λ+nK)yn.

In addition, similarly to the above,

(λ-A¯)(1-x)yn-1λ+(n+1)Kyn+1=(1-x)(λ+nK)yn+x-λλ+(n+1)Kyn.

Hence we may again obtain

(λ-A¯)-λ-(n+1)Kλ+(λ+nK)(λ+(n+1)K)×(1-x)yn-1λ+(n+1)Kyn+1-yn+nKf(n-1)=-λ-(n+1)Kλ+(λ+nK)(λ+(n+1)K)x(λ+nK)(-yn)+x-λλ+(n+1)Kyn=xyn

and with this also

(λ-A¯){1λ+nKyn+nKf(n-1)(x,y)+λ+(n+1)Kλ+(λ+nK)(λ+(n+1)K)×(1-x)yn-1λ+(n+1)Kyn+1-yn+nKf(n-1)}=yn-xyn=(1-x)yn.

This completes the proof that (λ-A¯)DA¯ is dense in C. Hence, the closure of A¯ generates a Feller semigroup on C. According to Kallenberg (2002, Proposition 19.14) this Feller semigroup then generates a Feller process, which we may assume to be cádlág paths thanks to Kallenberg (2002, Theorem 19.15).

Both processes correspond to the ancient ancestral material scaling when considering only the reduced “effective” state spaces:

Proposition 3.8

The processes (N¯(t),M¯(t))t0 and (X¯(t),Y¯(t))t0 introduced in Definition 3.6 are moment duals, i.e.

t0(x,y)[0,1]2,(n,m)N02:En,mxN¯(t)yM¯(t)=Ex,yX¯(t)nY¯(t)m. 28

Furthermore, (N¯(t),M¯(t))t0 coincides in distribution with (N~(t),M~(t))t0 if (both are) started in the reduced state-space {0,1}×N0.

Likewise, (X¯(t),Y¯(t))t0 coincides in distribution with (X~(t),Y~(t))t0 if (both are) started in the reduced state-space {0,1}×[0,1].

Moment duality of the involved processes will be important for the proof of the last statement, which is crucial for the proof of Theorem 1.3.

Fig. 4.

Fig. 4

Strategy of the proof of Proposition 3.8. The moment duality of (N~(t),M~(t))t0 and (X~(t),Y~(t))t0 is a consequence of Theorem 3.3. The laws of (N~(t),M~(t))t0 and (N¯(t),M¯(t))t0 agree when restricted to the reduced state-space {0,1}×N0. We show the moment duality of (N¯(t),M¯(t))t0 and (X¯(t),Y¯(t))t0, which then allows us to conclude that the restricted laws of (X~(t),Y~(t))t0 and (X¯(t),Y¯(t))t0 also agree on {0,1}×[0,1]

Proof

We prove the claims in order of appearance.

The duality of (N¯(t),M¯(t))t0 and (X¯(t),Y¯(t))t0 can be shown using the respective generators: Define S((x,y),(n,m)):=Sx,y(n,m):=Sn,m(x,y):=xnym for (n,m){0,1}×N0 and (x,y){0,1}×[0,1]. Applying A¯ to Sn,m yields

A¯Sn,m(x,y)=y(ym-0nym)1l{0}(x)+(1-y)(0nym-ym)1l{1}(x)+K(x-y)xnmym-1=-(x-y)(ym-0nym)1l{0}(x)+(x-y)(0nym-ym)1l{1}(x)+K(x-y)xnmym-1=-n(x-y)ym+Km(x-y)xnym-1=Kmxn+1ym-1+(-Kmxn-nx)ym+nym+1

where we continue to use 00=1, the fact that n{0,1} and simply sorted the terms by powers of y for easier comparison in the last line.

In order to do the analogous calculation for (N¯(t),M¯(t))t0 we need its generator. Since G¯ is the conservative Q-matrix, the generator G is given by

G¯f(n,m):=(n¯,m¯){0,1}×N0G¯(n,m),(n¯,m¯)f(n¯,m¯)=Km(f(1,m-1)-f(n,m))+(f(0,m+1)-f(n,m))1l{1}(n)

for all f:{0,1}×N0R which are bounded. If we apply G to S as a function in (n,m){0,1}×N0, we get

G¯Sx,y(n,m)=Km(xym-1-xnym)+(ym+1-xym)1l{1}(n)=Km(xym-1-xym)1l{1}(n)=n+Km(xym-1-ym)1l{0}(n)=1-n+1(ym+1-xym)1l{1}(n)=n=Kmxym-1+(-Kmnx-Km(1-n)-nx)ym+nym+1.

A close look noting that for our choices of variables we have xn+1=x and nx+(1-n)=xn shows that

(x,y){0,1}×[0,1],(n,m){0,1}×N0:A¯Sn,m(x,y)=G¯Sx,y(n,m).

S is bounded and continuous. For any t0, (x,y){0,1}×[0,1] the functions Sx,y and {0,1}×N0(n,m)Ex,y[X¯(t)nY¯(t)m] are bounded. Furthermore, for any t0, (n,m){0,1}×N0, the functions Sn,m and {0,1}×[0,1](x,y)En,m[xN~(t)yM~(t)] are continuously differentiable on (0, 1) and continuous on [0, 1] in the second component and continuous due to the theorem of bounded convergence. Hence, all assumptions of Jansen and Kurt (2014, Prop. 1.2) hold and we have proven the duality.

Next, we want to prove the equality of (N~(t),M~(t))t0 and (N¯(t),M¯(t))t0 in distribution, if both processes have the same initial distribution (which then must be in the smaller space {0,1}×N0)). Recall that (PetG)t0 is the semi-group of (N~(t),M~(t))t0 from Definition 1.4. On the other hand, the semi-group of (N¯(t),M¯(t))t0 is given by (etG¯)t0. Since both are Markov processes it suffices to prove that they both have the same semi-group. Note that technically these semi-groups have different dimensions, so to be precise, we want to prove that the restriction of (PetG)t0 to the space {0,1}×N0 coincides with (etG¯)t0, i.e.

t0:((PetG)i,j)i,j{0,1}×N0=etG¯.

This will be true, because of the structure of G that reflects that the space {0,1}×N0 is absorbing for (N~(t),M~(t))t0. More precisely, we prove by induction that

(Gi,jk)i,j{0,1}×N0=G¯k 29

for all kN. Comparing the definitions of G in Definition 1.4 and of G¯ in Definition 3.6, we see that (29) holds for k=1. Assume this is true for some fixed kN. Then, for (n,m),(n¯,m¯){0,1}×N0

G(n,m),(n¯,m¯)k+1=(l1,l2)N02G(n,m),(l1,l2)1G(l1,l2),(n¯,m¯)k=(l1,l2){0,1}×N0G(n,m),(l1,l2)1=G¯(n,m),(l1,l2)1G(l1,l2),(n¯,m¯)k=G¯(l1,l2),(n¯,m¯)k=G¯(n,m),(n¯,m¯)k+1

where we used that G(n,m),(l1,l2)=0 if (l1,l2){0,1}×N0 in and then applied the induction assumption. Hence (29) does indeed hold for any kN. Hence, for every choice of (n,m),(n¯,m¯){0,1}×N0 and t0, recalling that PG=G,

(PetG)(n,m),(n¯,m¯)=kN0tkk!G(n,m),(n¯,m¯)k=kN0tkk!G¯(n,m),(n¯,m¯)k=(etG¯)(n,m),(n¯,m¯).

Therefore the processes (N~(t),M~(t))t0 and (N¯(t),M¯(t))t0 do indeed coincide in distribution if started in the same state (n,m){0,1}×N0.

Since we now in particular have the equality of the one-dimensional distributions, we can use the duality (28) and the duality given in Theorem 3.3 to obtain

Ex,yX~(t)nY~(t)m=En,mxN~(t)yM~(t)=En,mxN¯(t)yM¯(t)=Ex,yX¯(t)nY¯(t)m 30

for all t0 and all (x,y){0,1}×[0,1] and (n,m){0,1}×N0.

Recall from (26) that for any t>0 we have (X~(t),Y~(t)){0,1}×[0,1], Px,y-a.s., (x,y)[0,1]2. Since a distribution on {0,1}×[0,1] is uniquely determined by its moments of order (n,m){0,1}×N0, (30) implies that (X~(t),Y~(t))(X¯(t),Y¯(t)) for any t>0 (when started in the same (x,y){0,1}×[0,1]). Since they are both Markovian, this implies that the distributions of (X¯(t),Y¯(t))t0 and (X~(t),Y~(t))t0 coincide when started in the reduced state-space {0,1}×[0,1].

Combining these results we obtain the proof of Theorem 1.3.

Proof of Theorem 1.3

Theorem 3.3 already yields the existence of (X~(t),Y~(t))t0 as the limit in finite-dimensional distributions.

(5) is simply the observation of (27).

Hence we are left to prove that we can choose a process with the above properties (determined only by the distribution!) with nice path-properties.

Fix, (x,y)[0,1]2. Now, let (X¯(t),Y¯(t))t0 and (X¯(t),Y¯(t))t0 be independent copies of (X¯(t),Y¯(t))t0, starting at (0,y){0,1}×[0,1] and (1,y){0,1}×[0,1], respectively. Furthermore, let B be an independent Bernoulli random variable with success parameter x. With this, define the process

t0:(X~~(t),Y~~(t)):=B(X¯(t),Y¯(t))t0+(1-B)(X¯(t),Y¯(t))t0.

This process is cádlág (with a random initial distribution (By)). We now prove that (X~(t),Y~(t))t>0 and (X~~(t),Y~~(t))t>0 are equal in distribution. (Note that we claim this for t>0 only.) We prove this using duality. Recall that for t>0, and any (n,m)N02, Pn,m{N~(t){0,1}}=1 and we can therefore calculate

E[X~~(t)nY~~(t)m]=xE1,y[X¯(t)nY¯(t)m]+(1-x)E0,y[X¯(t)nY¯(t)m]=xE1,y[X~(t)nY~(t)m]+(1-x)E0,y[X~(t)nY~(t)m]=xEn,m[1N~(t)yM~(t)]+(1-x)En,m[0N~(t)yM~(t)]=En,m[(x+(1-x)1l{N~(t)=0})yM~(t)]=En,m[xN~(t)yM~(t)]=Ex,y[X~(t)nY~(t)m].

Here, we used Proposition 3.8 in the second equality, the duality between (X~(t),Y~(t))t0 and (N~(t),M~(t))t0 from Theorem 3.3 in the third and last equality, and the observation, that Pn,m{N~(t){0,1}}=1, in the fifth equality. Since (n,m)N02 was arbitrary, we have shown that for every t>0, (X~~(t),Y~~(t)) and (X~(t),Y~(t)) are equal in distribution. Since both processes are time-homogeneous Markov processes, this implies the claim. Thus, the process (X^(t),Y^(t))t0, defined as (X^(0),Y^(0)):=(x,y) and

t0:(X^(t),Y^(t)):=(X~~(t),Y~~(t)),

is cádlág for all t>0 and coincides in distribution with (X~(t),Y~(t))t0 started in (X~(0),Y~(0))=(x,y).

Remark 3.9

(Imbalanced Island size: Part 2) We return to the example discussed in Remark 2.4 of the two-island model and its close relation to the seed bank model. The frequency process of the given allele is then described by the two-island diffusion (Kermany et al. 2008),

dX(t)=c(Y(t)-X(t))dt+αX(t)(1-X(t))dB(t),dY(t)=cK(X(t)-Y(t))dt+αY(t)(1-Y(t))dB(t), 31

where (B(t)t0 and (B(t))t0 are independent Brownian Motions.

Again, the interesting consideration here is to use different scalings of the coalescence rates in the islands, i.e. different scalings for α0 and α0. If, in addition to c0 we assume the coalescence rate α>0 in the second island to scale as c, i.e. α/c1, the result is a two-island model with instantaneous coalescences in the first island, but otherwise regular migration and diffusive behaviour in the second. For more precision, denote by (Xc,α(t),Yc,α(t))t0 the two-island diffusion with migration rate c>0 and island 2 of size α>0 and assume that it starts at some (x,y)[0,1]2, P-a.s.. Repeating the calculations we did for the seed bank model, it can be shown that the sequence (Xcκ,ακ(t),Ycκ,ακ(t))t0 will converge to a Markovian degenerate limit coinciding in distribution with a Markov process with generator

L^f(x,y)=(1-x)y(f(1,y)-f(x,y))+x(1-y)(f(0,y)-f(x,y))+K(x-y)fy(x,y)+12y(1-y)2y2f(x,y)

for functions f in {f:{0,1}×[0,1]Rf(0,·),f(1,·)C2([0,1],R) whenever started in the smaller state-space {0,1}×[0,1].

Acknowledgements

JB and MWB were supported by DFG Priority Programme 1590 “Probabilistic Structures in Evolution”, Project BL 1105/5-1, EB and AGC by the Berlin Mathematical School.

Funding Information

Open Access funding enabled and organized by Projekt DEAL.

Footnotes

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

  1. Birkner M, Blath J, Eldon B. An ancestral recombination graph for diploid populations with skewed offspring distribution. Genetics. 2013;193(1):255–290. doi: 10.1534/genetics.112.144329. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Blath J, González Casanova A, Kurt N, Wilke Berenguer M. The seed bank coalescent with simultaneous switching. Electron J Probab. 2020;25:27. doi: 10.1214/19-EJP401. [DOI] [Google Scholar]
  3. Blath J, Buzzoni E, Koskela J, Wilke Berenguer M. Statistical tools for seed bank detection. Theor Popul Biol. 2020;132:1–15. doi: 10.1016/j.tpb.2020.01.001. [DOI] [PubMed] [Google Scholar]
  4. Blath J, González Casanova A, Kurt N, Spanò D. The ancestral process of long-range seed bank models. J Appl Probab. 2013;50(3):741–759. doi: 10.1239/jap/1378401233. [DOI] [Google Scholar]
  5. Blath J, González Casanova A, Eldon B, Kurt N, Wilke Berenguer M. Genetic variability under the seedbank coalescent. Genetics. 2015;200(3):921–934. doi: 10.1534/genetics.115.176818. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Blath J, González Casanova A, Kurt N, WilkeBerenguer M. A new coalescent for seed-bank models. Ann Appl Probab. 2016;26(2):857–891. doi: 10.1214/15-AAP1106. [DOI] [Google Scholar]
  7. Blath J, Buzzoni E, González Casanova A, Wilke Berenguer M. Structural properties of the seed bank and the two island diffusion. J Math Biol. 2019;79(1):369–392. doi: 10.1007/s00285-019-01360-5. [DOI] [PubMed] [Google Scholar]
  8. Bobrowski A. Singular perturbations involving fast diffusion. J Math Anal Appl. 2015;427(2):1004–1026. doi: 10.1016/j.jmaa.2015.02.029. [DOI] [Google Scholar]
  9. Cano R, Borucki M. Revival and identification of bacterial spores in 25- to 40-million-year-old dominican amber. Science (New York, NY) 1995;268:1060–1064. doi: 10.1126/science.7538699. [DOI] [PubMed] [Google Scholar]
  10. Chung KL (1960) Markov chains with stationary transition probabilities. Grundlehren der Mathematischen Wissenschaften, 1st edn, vol. 104. Springer, Berlin
  11. Cohen D. Optimizing reproduction in a randomly varying environment. J Theor Biol. 1966;12:119–129. doi: 10.1016/0022-5193(66)90188-3. [DOI] [PubMed] [Google Scholar]
  12. Endo H, Inoue M. Dormancy in cancer. Cancer Sci. 2019;110(2):474–480. doi: 10.1111/cas.13917. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Epstein SS. Microbial awakenings. Nature. 2009;457(7233):1083. doi: 10.1038/4571083a. [DOI] [PubMed] [Google Scholar]
  14. Etheridge A (2011) Some mathematical models from population genetics. Lecture notes in mathematics, vol 2012. Springer, Heidelberg. Lectures from the 39th Probability Summer School held in Saint-Flour, 2009, École d’Été de Probabilités de Saint-Flour. [Saint-Flour Probability Summer School]
  15. Ethier SN, Kurtz TG (1986) Markov processes: characterization and convergence. In: Wiley series in probability and mathematical statistics: probability and mathematical statistics. Wiley, New York
  16. Fisher RA, Gollan B, Helaine S. Persistent bacterial infections and persister cells. Nat Rev Microbiol. 2017;15(8):453. doi: 10.1038/nrmicro.2017.42. [DOI] [PubMed] [Google Scholar]
  17. Greven A, den Hollander WTF, Oomen M (2020) Spatial populations with seed-bank: well-posedness, duality and equilibrium. arXiv:2004.14137(Preprint)
  18. Herbots HM (1994) Stochastic models in population genetics: genealogical and genetic differentiation in structured populations. PhD thesis, University of London
  19. Hildebrandt TH, Schoenberg IJ. On linear functional operations and the moment problem for a finite interval in one or several dimensions. Ann Math. 1933;34(2):317–328. doi: 10.2307/1968205. [DOI] [Google Scholar]
  20. Jansen S, Kurt N. On the notion(s) of duality for Markov processes. Probab Surv. 2014;11:59–120. doi: 10.1214/12-PS206. [DOI] [Google Scholar]
  21. Johnson SS, Hebsgaard MB, Christensen TR, Mastepanov M, Nielsen R, Munch K, Brand T, Gilbert MT, Zuber MT, Bunce M, Ronn R. Ancient bacteria show evidence of DNA repair. Proc Natl Acad Sci. 2007;104(36):14401–14405. doi: 10.1073/pnas.0706787104. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Kaj I, Krone SM, Lascoux M. Coalescent theory for seed bank models. J Appl Probab. 2001;38(2):285–300. doi: 10.1239/jap/996986745. [DOI] [Google Scholar]
  23. Kallenberg O. Foundations of modern probability. Probability and its applications. 2. New York: Springer; 2002. [Google Scholar]
  24. Kermany ARR, Zhou X, Hickey DA. Joint stationary moments of a two-island diffusion model of population subdivision. Theor Popul Biol. 2008;74(3):226–232. doi: 10.1016/j.tpb.2008.07.004. [DOI] [PubMed] [Google Scholar]
  25. Kurtz TG. A limit theorem for perturbed operator semigroups with applications to random evolutions. J Funct Anal. 1973;12:55–67. doi: 10.1016/0022-1236(73)90089-X. [DOI] [Google Scholar]
  26. Kurtz TG. Random time changes and convergence in distribution under the meyer-zheng conditions. Ann Probab. 1991;19:1010–1034. [Google Scholar]
  27. Lennon JT, Jone SE. Microbial seed banks: the ecological and evolutionary implications of dormancy. Nat Rev Microbiol. 2011;9:119. doi: 10.1038/nrmicro2504. [DOI] [PubMed] [Google Scholar]
  28. Marx V. How to pull the blanket off dormant cancer cells. Nat Methods. 2018;15:249–252. doi: 10.1038/nmeth.4640. [DOI] [PubMed] [Google Scholar]
  29. Meyer P-A, Zheng WA. Tightness criteria for laws of semimartingales. Ann Inst Henri Poincaré Probab Stat. 1984;20(4):353–372. [Google Scholar]
  30. Möhle M. A convergence theorem for Markov chains arising in population genetics and the coalescent with selfing. Adv Appl Probab. 1998;30(2):493–512. doi: 10.1239/aap/1035228080. [DOI] [Google Scholar]
  31. Möhle M, Notohara M. An extension of a convergence theorem for Markov chains arising in population genetics. J Appl Probab. 2016;53(3):953–956. doi: 10.1017/jpr.2016.54. [DOI] [Google Scholar]
  32. Moran PAP. The theory of some genetical effects of population subdivision. Aust J Bio Sci. 1959;12(2):109–116. doi: 10.1071/BI9590109. [DOI] [Google Scholar]
  33. Morono Y, Ito M, Hoshino T, et al. Aerobic microbial life persists in oxic marine sediment as old as 101.5 million years. Nat Commun. 2020;11:3626. doi: 10.1038/s41467-020-17330-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Notohara M. The coalescent and the genealogical process in geographically structured population. J Math Biol. 1990;29(1):59–75. doi: 10.1007/BF00173909. [DOI] [PubMed] [Google Scholar]
  35. Shiga T, Shimizu A. Infinite-dimensional stochastic differential equations and their applications. J Math Kyoto Univ. 1980;20(3):395–416. [Google Scholar]
  36. Shoemaker WR, Lennon JT. Evolution with a seed bank: the population genetic consequences of microbial dormancy. Evol Appl. 2018;11(1):60–75. doi: 10.1111/eva.12557. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Skorohod AV. Limit theorems for stochastic processes. Teor Veroyatnost i Primenen. 1956;1:289–319. [Google Scholar]
  38. Wakeley J. Coalescent theory: an introduction. Greenwood Village: Roberts & Company Publishers; 2009. [Google Scholar]
  39. Whitt W. Stochastic-process limits: an introduction to stochastic-process limits and their application to queues. Springer series in operations research. New York: Springer; 2002. [Google Scholar]
  40. Wright S. Evolution in Mendelian populations. Genetics. 1931;16(2):97–159. doi: 10.1093/genetics/16.2.97. [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from Journal of Mathematical Biology are provided here courtesy of Springer

RESOURCES