Abstract
The goal of this article is to contribute towards the conceptual and quantitative understanding of the evolutionary benefits for (microbial) populations to maintain a seed bank consisting of dormant individuals when facing fluctuating environmental conditions. To this end, we discuss a class of ‘2-type’ branching processes describing populations of individuals that may switch between ‘active’ and ‘dormant’ states in a random environment oscillating between a ‘healthy’ and a ‘harsh’ state. We incorporate different switching strategies and suggest a method of ‘fair comparison’ to incorporate potentially varying reproductive costs. We then use this concept to compare the fitness of the different strategies in terms of maximal Lyapunov exponents. This gives rise to a ‘fitness map’ depicting the environmental regimes where certain switching strategies are uniquely supercritical.
Keywords: Dormancy, Persistence, Bienaymé–Galton–Watson process, Branching process, Seed bank, Fitness
Introduction
Biological motivation
Dormancy is an evolutionary trait that comes in many guises and has evolved independently multiple times across the tree of life. In particular, it is ubiquitous among microbial communities. As a general definition, we say that an individual exhibits a dormancy trait if it is able to enter a reversible state of vanishing metabolic activity. It has been observed that a large fraction of microbes on earth is currently in a dormant state, thus creating seed banks consisting of inactive individuals (see e.g. Lennon and Jones 2011; Shoemaker and Lennon 2018; Lennon et al. 2020 for recent overviews). A common ecological and evolutionary explanation for the emergence of the corresponding complex dormancy traits is that the maintenance of a seed bank serves as a bet-hedging strategy to ensure survival in fluctuating and potentially unfavourable environmental conditions. Recent theory has also shown that dormancy traits can already be beneficial in competing species models in the presence of sufficiently strong competitive pressure for limited resources (even in otherwise stable environments) (see Blath and Tobiás 2020). However, maintaining a dormancy trait is costly and comes with a substantial trade-off: For example, microbes need to invest resources into resting structures and the machinery required for switching into and out of a dormant state, which are then unavailable for reproduction.
Dormancy also has implications for the pathogenic character of microbial communities and plays an important role in human health. For example, dormancy in the form of persister cells can lead to chronic infections, since these cells can withstand antibiotic treatment (Balaban et al. 2004; Fisher et al. 2017; Lewis 2010). Further, dormancy, both on the level of individual cells as well as the tumor level, plays a crucial role in cancer dynamics (Endo and Inoue 2019). In all of the above situations external treatment can be seen as a form of environmental stress for the pathogens.
Hence, improving the conceptual and quantitative understanding of the mechanisms leading to fitness advantages for individuals with a dormancy trait in fluctuating environments, incorporating the potentially different costs of forming active and dormant offspring (and potential reproductive trade-offs due to the maintenance of dormancy traits), seems to be a worthwhile task.
Deterministic versus stochastic models and known results
In the last two decades, dormancy-related population dynamic modeling based on deterministic dynamical systems has expanded rather rapidly, often with a focus on phenotypic plasticity in microbial communities (see e.g. Bär et al. 2002; Balaban et al. 2004; Kussell et al. 2005; Kussell and Leibler 2005; Malik and Smith 2008; Fowler and Winstanley 2018). The important paper by Balaban et al. (2004) for example describes ‘persistence’ (which can be seen as a form of dormancy) as a phenotypic switch, and several of the above papers deal with models incorporating various switching strategies and fluctuating environments. Kussell et al. (2005) consider periodic antibiotic treatment (and also treat a stochastic version of their model via simulation), and Kussell and Leibler (2005) incorporate randomly changing environments, however under the condition that the random changes are slow. Fitness is typically measured in terms of the (maximal) Lyapunov exponents of the underlying dynamical systems, which is often difficult to evaluate analytically in the presence of random environments. Kussel and Leibler approximate the Lyapunov exponent under a ‘slow environment condition’, reducing the model to an essentially one-dimensional system, which is a strategy that we will meet again in different forms later on. These models have been taken up in a mathematical article by Malik and Smith (2008), who provide a set of rigorous results regarding the maximal Lyapunov exponents of dynamical systems explicitly incorporating dormancy, considering both stochastic and responsive switches in (periodically) changing environments. They also compare these to the corresponding results for populations without dormancy trait (so-called ‘sleepless population’). However, for truly random environments, they do not provide exact representations for the maximal Lyapunov exponents and instead give relatively simple (yet useful) bounds.
Recently, also stochastic (individual-based) models for seed banks and dormancy have gained increasing attention, in particular in population genetics (Kaj et al. 2001; Blath et al. 2015, 2016, 2020). However, these models are mainly concerned with genealogical implications of seed banks and typically require constant population size (without random environment). In stochastic population dynamics, while there are interesting recent simulation studies such as Locey et al. (2017), rigorous mathematical modeling and results are still relatively rare. Here, a suitable framework for individual-based seed bank models with fluctuating population size is given by the theory of multi-type branching processes (in random environments). Indeed, dormancy has been described in a brief example in the book by Haccou et al. (2007, Example 5.3) as a 2-type branching process, which served as a motivation for this paper. For quiescence (which is a similar concept as dormancy), a multi-type branching-process model has been proposed in Alarcón and Jensen (2011), including a simulation study. On the theoretical side, in the context of phenotypic switches, Dombry et al. (2011) and Jost and Wang (2014), again building on motivation from Kussell and Leibler (2005), have developed a branching-process based framework for phenotypic plasticity and obtained interesting rigorous results on the optimality of switching strategies in random environments. However, their set-ups and results, though closely related, do not focus on dormancy, and only partially cover the reproductive and switching strategies that we are going to discuss in Sect. 2.2 (we will explicitly comment on the differences with respect to our model and results as appropriate). They are able to determine Lyapunov exponents in random environments, but similarly to Kussell and Leibler (2005) under a condition which essentially restricts the problem to a one-dimensional system.
Modeling approach and organization of the present paper
Our approach, motivated by the example in Haccou et al. (2007), is based on a 2-type branching process with denoting the active and denoting the dormant individuals at time/generation n, which we embed in a random environment that is described by a stochastic process and governs the respective sequence of random reproductive laws . As in Kussell and Leibler (2005), Malik and Smith (2008), Dombry et al. (2011), we will discuss results related to both stochastic/spontaneous as well as responsive switching strategies. Further, we also consider mixed strategies. We aim for explicit results under ‘fair comparison’ regarding resource limitations including potentially different costs for active and dormant offspring, and also in comparison with a 1-type branching process without dormancy trait (‘sleepless case’), expressing qualitative and quantitative fitness advantages in terms of the maximal Lyapunov exponent.
By providing a model tailored to dormancy in a random environment, we close a gap related to the multi-type branching process models and results provided in Dombry et al. (2011) and Jost and Wang (2014) related to phenotypic switching, which only partially cover our dormancy models and results. We also extend and refine results of Malik and Smith (2008) which explicitly model dormancy in the deterministic dynamical systems case, but with a smaller set of switching strategies and few results for truly random environments. Additionally, we pay particular attention to reproductive costs related to dormancy.
All models and results will be introduced and discussed in Sect. 2. We observe that there are natural parameter regimes under which either the spontaneous or the responsive switching strategies, or even a mixture of strategies, will be fit, while all the others are detrimental. We discuss the corresponding parameter regimes in detail and illustrate them in certain important cases, see e.g. Figs. 3, 4 and 5. This shows that already in our relatively simple random environment (involving only two states), dormancy leads to a rather rich picture regarding the long-term behaviour of the embedded branching processes.
However, while our results will capture several prototypical scenarios corresponding to both stochastic/spontaneous and responsive switching (and mixtures), we are still far from being able to provide a mathematically complete classification in the full space of switching strategies. One theoretical reason for this is that computing the maximal Lyapunov exponent of a random multiplicative sequence of positive matrices, which is the mathematical core of the problem, is infeasible in general (see e.g. Bougerol and Lacroix (1985); Ledrappier (1984) for an overview of the mathematical theory), and works only if the underlying matrices exhibit additional algebraic properties.
Hence, a further aim of this paper is to provide a small review of current methods to compute/estimate maximal Lyapunov exponents. The reason which allows Dombry et al. (2011) and Jost and Wang (2014) to treat the responsive switching regime in their papers is that their assumptions reduce the system to an essentially one-dimensional case, which interestingly has a similar effect as the ‘slow variation assumption’ of Kussell and Leibler (2005), and we will investigate similar cases. While the exponents are easily accessible for the corresponding ‘rank-1 matrices’ (and, of course, scalars), spontaneous/stochastic switching strategies can a priori involve both ‘rank-1’ and ‘rank-2 matrices’. We will see that while special cases of the stochastic switching regime can again be treated with the rank-1 approach, for the general stochastic switching regime involving rank-2 matrices, techniques used in Malik and Smith (2008) are available, which lead at least to bounds on the Lyapunov exponents. We also provide further bounds and estimators. These theoretical considerations can be found in Sect. 3.
Finally, in Sect. 4, we discuss some open questions and potential further steps in modeling and analysis of dormancy and seed banks in random environments, from a somewhat theoretical perspective.
Models and main results
Recall that for a classical (1-type) Bienaymé–Galton–Watson process, say , it is assumed that individuals die and reproduce independently of each other according to some given common offspring distribution on . We extend this framework by introducing a second component that acts as a reservoir of dormant individuals, also referred to as seed bank. Moreover, we allow the offspring distribution in each generation n to depend on the state of a random environment process . This gives rise to a particular class of 2-type Bienaymé–Galton–Watson processes in random environment that we introduce formally in Definition 2.1 and which will be the main object of study in this paper. However, later on, we will also discuss more general p-type branching processes (for ) in random environments, so that we will first introduce the corresponding general notation, which is standard in the theory of multi-type branching processes.
Notation
Let be a stationary and ergodic Markov chain on some probability space taking values in some measurable space and denote by its stationary distribution. Such a process will be called random environment process. For , we write to denote the space of probability measures on , and set . Elements of will be interpreted as the collection of the p offspring distributions on (one for each type). An infinite sequence generated by and a random variable will be called sequence of random offspring distributions with respect to the environment process . Finally, a sequence of -valued random variables will be called a p-type Bienaymé–Galton–Watson process in random environment (p-type BGWPRE), if is independent of and if for each given realization of E (and thus also of ) the process is a Markov chain whose law satisfies
for every and , where the are independent random variables taking values in , and for each , the are identically distributed according to . In the language of branching processes, if the state of the environment at time n is , then each of the individuals of type i alive at time n produces offspring according to the probability distribution , independent of the offspring production of all the other individuals. For notational clarity, we will often write to denote the random variable Q that is used in the definition of a branching process Z.
We are now ready to define the class of branching processes modeling dormancy:
Definition 2.1
With the above notation (for ), a 2-type BGWPRE will be called a Bienaymé–Galton–Watson process with dormancy in random environment , abbreviated BGWPDRE, if, -almost surely,
2.1 |
Particles of type 1 are called active and particles of type 2 are called dormant.
Note that Condition (2.1) ensures that a (dormant) type 2 particle can either switch its state to type 1 (active), remain in the dormant state 2, or die—no other transitions are possible. There is no restriction on the offspring reproduction of active type 1 particles other than the following first moment condition.
Throughout we assume, for -a.e. realization of and any , that the distribution is such that the corresponding random variables distributed according to satisfy for all . Moreover, we write
to denote the expected number of offspring of type j produced by a single particle of type i in generation n in the environment , and we denote by
2.2 |
the corresponding mean matrix. Suppose that, for any , the matrix is irreducible. Then, by the Perron–Frobenius-Theorem, the spectral radius of is a simple eigenvalue with for any (other) eigenvalue of .
Branching processes with dormancy in constant environment
As a gentle warm-up, we first compare the survival probabilities and extinction times of a classical 1-type BGWP with those of a 2-type BGWPDRE in the absence of environmental fluctuations (in this case, we use the abbreviation BGWPD). For simplicity, we further restrict ourselves to the binary branching case (following the set-up of Example 5.3 in Haccou et al. 2007), which can be thought of as a model for bacterial reproduction via binary fission and sporulation as exhibited e.g. by Bacillus subtilis, and summarize several standard results (that nevertheless will be proved in the appendix for the reader’s convenience). These results will then serve as a motivation and reference point for our later results involving fluctuating environments, which will also deal with more general reproductive mechanisms.
Let , and be a 1-type BGWP with offspring distribution , where denotes the Dirac measure. This mechanism can be seen as a caricature of reproduction via cell division: Every individual in each generation independently either splits in two (cell division) with probability p or dies with probability .
Furthermore, for , and , let and be a 2-type BGWPD with offspring distribution, , given by
Figure 1 illustrates the model. The parameters can be interpreted as follows: is the probability with which an active individual either exhibits a reproductive or a switching event. In this case, a binary split will happen with probability b (reproduction), and a switching event into a dormant state (e.g. by sporulation) with probability . Note that a ‘switch’ can be thought of as the simultaneous death of an active individual and the corresponding birth of a dormant individual. With probability , an active individual will die. Dormant individuals resuscitate (“wake up”) with probability w, and die with probability d, otherwise they stay in their dormant state (with probability ). Note that for and , the active component of equals in distribution. Hence, can be seen as a way of incorporating a reproductive trade-off that arises from the maintenance costs of the dormancy trait, delivering reduced splitting and increased death probability in comparison to the 1-type model. Additionally, note that in our model the potential to switch into dormancy also reduces the reproductive capability, since entering the seed bank is only possible during a ‘reproduction-or-switching event’ at a chance of .
From the following more general result, which we will prove in the Appendix, we obtain a comparison of long-term survival behaviour of X and Z.
Proposition 2.2
Let be a 1-type BGWP, and a BGWPD with and . Assume that the offspring distributions and , respectively, are of finite variance with and . Set and denote by
the extinction times and survival probabilities of Z and X, respectively. If
2.3 |
for all , then the following statements hold:
If , then .
If , then and .
- If , then . However, can be chosen in such a way that , and for some
2.4
Condition (2.3) ensures that the total amount of offspring of active individuals in Z is stochastically dominated by the amount of offspring in X.
Proposition 2.2 shows that—at least in the simple binary model—in the super-critical case (i.e. ) maintaining a seed bank always leads to a decreased survival probability. Indeed, the reproductive trade-off, incorporated by the penalty , is always detrimental. The same holds for the critical regime ( and ), where both processes always go extinct: Here, the expected time to extinction is even finite for the 2-type process Z.
However, in the sub-critical regime (3), while both processes do go extinct with probability 1, for small w and d (i.e. and ) the population with dormancy trait can be more likely to survive for extended periods of time, since by (2.4) for , i.e. . This is in line with basic intuition, since for small w and d individuals spend a long time in the dormant state delaying extinction. Notably, the condition on w and d can be given explicitly, i.e. (3) holds whenever
This suggests that the ‘prolonged survival in the sub-critical regime’ effect could lead to a fitness advantage in the presence of a random environment, fluctuating between a healthy (super-critical) and a harsh (sub-critical) scenario, even if the dormancy trait exhibits a reproductive trade-off in the healthy scenario, since dormancy could potentially compensate for this during harsh times by delaying extinction. A central goal of this article is to identify and classify scenarios in which this is indeed the case. We thus now extend our models to incorporate such a fluctuating environment.
Branching processes with dormancy in randomly fluctuating environment
As indicated by Proposition 2.2 above, evolutionary fitness advantages resulting from a dormancy trait may be expected to manifest themselves in the presence of a random environment, where prolonged survival times may help to survive during harsh times. Here, we even expect that dormancy may turn an otherwise (overall) sub-critical process into a super-critical one, even in the presence of reproductive trade-offs. We now introduce a simple model for a fluctuating environment, randomly oscillating between two states “1” and “2”, where “1” corresponds to healthy and “2” to harsh conditions, which is identical to the environment given as an example in Dombry et al. (2011, Sections 1.1.1 and 1.1.2.).
Definition 2.3
(Binary random environment). Let and . Define a discrete-time homogeneous Markov chain with values in via the transition matrix
where and denote the environmental switching probabilities. Further, denote by the stationary distribution of and let .
Remark 2.4
The initial condition for as well as the assertion ensure that is stationary and ergodic. Hence, the process is a random environment process as defined at the beginning of this section, taking only two values.
In the remainder of this section we will not give complete definitions of any further BGWPDRE’s. We will only be interested in the mean matrices, while the exact offspring distributions will typically be irrelevant. We will also restrict our attention to the binary random environment from Definition 2.3. As indicated in (2.2), for environmental states , these matrices will be given by
where and represent the average amount of active and dormant offspring of active individuals respectively, while and denote resuscitation (‘waking’) and death probabilities of dormant individuals. Our main results will concern the following three particular strategies.
Definition and Remark 2.5
(Switching strategies). The following idealized switching strategies represent important special cases that have been discussed in the literature, see e.g. Malik and Smith (2008), Lennon and Jones (2011), Dombry et al. (2011). In particular, one distinguishes between responsive switching, triggered by environmental conditions, and spontaneous or stochastic switching, which is assumed to happen in each individual with a certain probability, independently of the environment and the behaviour of other individuals.
- Responsive switching: We consider the case where individuals behave “optimally” in the sense that they invest all their resources into the production of active individuals during the healthy environmental spells (choosing , ), while during harsh environmental conditions they invest everything into dormant offspring (choosing ). Hence, in this case, the offspring mean matrices are of the form
for some and . - Stochastic switching: Here, the population assumes a reproductive strategy which is independent of its environmental state. This can be modeled by choosing and for and . That way, , which means that active individuals split up their resources into the production of active and dormant offspring in the same way in both environments. Then, the offspring mean matrices for equate to
Note that represents the effect of harsh environments: Since , the expected number of offspring of active individuals is reduced in harsh environments. For , all active individuals are killed immediately during such conditions, without producing any offspring. - Preliminary (or anticipatory) switching: In this strategy, we consider individuals that invest all their resources into the production of dormant individuals during the healthy environmental spells (choosing ), while during harsh environmental conditions they invest everything into active offspring (choosing , ). This can be understood as counter-intuitive responsive switching, always prefering the unfavorable type in reproduction, in anticipation of a coming environmental change. The corresponding offspring mean matrices, for some and , are given by
In population genetic models with seed bank, recently, similar types of switching have been distinguished (spontaneous vs. simultaneous switching) and these lead to topologically different limiting coalescent models describing the ancestry of a sample (see Blath et al. 2015, 2016, 2020). Here, in the presence of a random environment, we will see that the right choice of switching strategy can lead to qualitative fitness advantages, depending on the distribution of the environmental process.
Of course, less extreme variants, or even mixtures, of the above switching strategies should be interesting in practice. For example, as reported in van Vliet (2015) and Sturm and Dworkin (2015), phenotypic diversity in Bacillus subtilis with respect to the ‘exit from dormancy mechanisms’ seems to combine stochastic switching of some individuals with responsive switching due to environmental cues of others on the population level. However, the special form of the mean matrices in these ‘pure’ strategies makes it possible to explicitly compute or obtain suitable bounds on the corresponding maximal Lyapunov exponents, which are crucial to assess and compare the fitness of the corresponding BGWDREs, as we will see later. Interestingly, these building blocks will also allow the analysis of certain mixtures of strategies (Remark 2.21)
Remark 2.6
(Comparison to multi-type branching process models considered in Dombry et al. (2011) and Jost and Wang (2014)). Note that the model in Definition 2.1 is closely related to a very general multi-type branching process in random environment (MBPRE) modeling phenotypic diversity (with many, even a continuum of, possible types) considered in Dombry et al. (2011), and Jost and Wang (2014), who themselves are inspired by the earlier work of Kussell and Leibler (2005). However, in their models, the authors follow a two step procedure, where in a first step each particle gives birth to a random amount of offspring (depending on its type and the state of the environment), and then in a second step, independently of that amount, the offspring particles are fitted (individually) with their new phenotypes. As the authors point out, this clearly disentangles the birth and “migration” (between phenotypes) phases. The BGWPDRE-model that we propose here is tailored to dormancy and does not disentangle these steps. This has consequences for the possible switching strategies. In fact it turns out that some of our reproductive strategies in Definition 2.5 are not covered by the framework of Dombry et al. (2011) and Jost and Wang (2014). For example, they do not cover the case that active offspring in the healthy environment can either split into two active offspring (cell division), or switch to a dormant state (e.g. by sporulation), as in the example of Sect. 2.1, since the phenotype distribution in this case would have to be allowed to depend on the number of offspring of the parent, see also Remark 3.3.
Remark 2.7
(Comparison to switching strategies employed in Malik and Smith (2008)). Malik and Smith in Malik and Smith (2008) consider a related, but less general switching model. Again, there are two possible environmental states, however, the bad environment here always completely prevents the reproduction of active individuals. Exact analytic expressions for the Lyapunov exponents are obtained only for the case where the environment is deterministic. In the random environment case, still some bounds are provided. It turns out that we can adapt the corresponding methods to obtain bounds for Lyapunov exponents of our models (cf. Remark 3.7).
Asymptotic growth of BGWPDREs and Lyapunov exponents
Of particular interest is the asymptotic behaviour of the process Z. It is well known, cf. Kaplan (1974), that
The study of such products of random matrices has a long and venerable history dating back to first results by Furstenberg and Kesten (1960). For stationary and ergodic sequences of non-negative matrices satisfying , where and is a matrix norm, Kingman (1973), see also Oseledec (1968), proved that, by means of the subadditive ergodic theorem,
2.5 |
exists -a.s. and satisfies
Note that, by norm equivalence, is independent of the chosen matrix norm. The limit, , is called maximal Lyapunov exponent.
Remark 2.8
(Exact computation of Lyapunov exponents). There are only a few cases where the maximal Lyapunov exponent can be computed explicitly. For instance, if is a stationary and ergodic process of positive matrices. Then, recalling that denotes the spectral radius of the matrix M, trivially for all , and if an application of Birkhoff’s ergodic theorem yields, -a.s.,
2.6 |
A further simple case is given by a sequence of stationary and ergodic matrices with such that the matrices are either mutually diagonizable, i.e. for all , or upper (lower) triangular. Then, -a.s.,
Further cases in which the maximal Lyapunov exponent can be computed explicitly are discussed in Key (1987). For the general case, where the computation of is infeasible, there are various strategies for giving bounds known in the literature, see also Crisanti et al. (1993). We will discuss and employ possible methods in Sect. 3.2.
Remark 2.9
(Approximation of Lyapunov exponents). Under certain further assumptions on the stationary and ergodic sequence, , of non-negative matrices with , Key (1990) proved that, -a.s. and in mean,
for any one-homogeneous, non-negative, super-multiplicative function, f, such that . By defining
then it follows from the sub-multiplicativity of , the super-multiplicativity of f, and the stationarity of the sequence that increases monotonically to , whereas decreases monotonically to . Although this provides an easy way to derive upper and lower bounds on the maximal Lyapunov exponent, the computational effort increases exponentially in k. For i.i.d. sequences Pollicott (2010) and Jurga and Morris (2019) established efficient approximation schemes with super-exponential convergence rates, see also Protasov and Jungers (2013) for further bounds.
Remark 2.10
(Lyapunov exponent and long-term behaviour). Notice that for a p-type BGWPRE, Z, the sequence of mean matrices, , forms a stationary and ergodic process. Thus, provided that , the corresponding maximal Lyapunov exponent, , exists describing the asymptotic rate of growth/decay of the expected value of Z.
The almost sure behaviour of the process Z has also been studied intensively. For instance, if Z is a p-type BGWPRE such that for all and , then it follows from Tanny (1981, Theorem 9.10) that, for almost all realizations of the environment, implies that Z becomes extinct almost surely, whereas for there exists a positive probability that Z never becomes extinct. Moreover, conditioned on survival, we have that almost surely as . In particular, the almost sure growth of the stochastic switching model, cf. Definition 2.5-b), conditioned on non-extinction, is given by .
Actually, Tanny established in (1981, Theorem 9.6 and Theorem 9.10) a classification theorem for more general multi-type BGWPRE with non-negative mean matrices satisfying certain regularity conditions. Notice that these conditions are not satisfied by our responsive and preliminary switching model, cf. Definition 2.5-a). However, due to the particular structure that allows a reduction of this BGWPDRE to a 1-type BGWPRE (cf. Dombry et al. 2011, Proposition 7), an analogous classification theorem can then be deduced from Tanny (1981, Theorem 9.6), see also Athreya and Karlin (1971a, 1971b).
Remark 2.11
(Lyapunov exponent, fitness and survival-probability). The previously mentioned features of the maximal Lyapunov exponent, describing various growth properties of population models, justifies the use of as a measure of fitness of population models, as is common in the literature. However, there is no direct monotone relationship between and the survival probability of the underlying population in the super-critical case, as the following example confirms: Taking the setting from Sect. 2.1, choosing X with parameter , one can compute that and . Then, for Z letting , , , and it holds that
However (cf. (5.1) below), . Hence, the comparison of Lyapunov exponents of distinct population models does not necessarily provide a complete picture of the advantages of one growth strategy over the other.
This phenomenon has been studied in more detail by Jost and Wang (2014), where the authors illustrate that different optimization criteria (i.e. largest growth rate vs. smallest extinction probability) can lead to distinct optimal strategies.
We now move on to some of the main results of this paper. Indeed, the Lyapunov exponent for a BGWPDRE with responsive and preliminary switching strategy can be computed explicitly:
Theorem 2.12
(Lyapunov exponent of the responsive switcher). Let be a BGWPDRE with environment process from Definition 2.3, following the responsive switching regime in Definition 2.5-a). Then, -a.s.,
Theorem 2.13
(Lyapunov exponent of the preliminary switcher). Let be a BGWPDRE with environment process from Definition 2.3, following the preliminary switching regime in Definition 2.5-c). Then, -a.s.,
A proof will be provided in the next section. For the stochastic switcher we obtain the following analytic result under the additional assumption that the determinant of the mean matrices vanishes:
Theorem 2.14
(Lyapunov exponent of the stochastic switcher). Let be a BGWPDRE with environment process from Definition 2.3, following the stochastic switching regime in Definition 2.5-b) with Then, -a.s.,
2.7 |
These results are closely related to results in Dombry et al. (2011), considering that mean matrices of determinant zero correspond to the non-hereditary-with-sensing case therein (cf. Sect. 3.1 for the proof and further remarks). In the hereditary case, i.e. the case of non-zero determinants, neither (Dombry et al. 2011) nor the article at hand obtain an explicit result for . However, various bounds will be discussed in Sect. 3.2. We provide one of them here in a special case, for illustration:
Theorem 2.15
Let be a BGWPDRE with environment process from Definition 2.3, following the stochastic switching regime in Definition 2.5 with , and . Then, -a.s.,
Note that since w and d do not depend on e, we get . Notably, when , this lower bound equates to the result from Theorem 2.14.
Further bounds will be provided in the next section, where we also try to shed light on the structures of switching mechanisms that allow for the computation of analytical results and bounds. Indeed, we will distinguish the so-called ‘rank-1’-case (which is closely related to the results in Dombry et al. (2011)), allowing explicit computations, and the ‘rank-2’-case, where often only bounds can be provided. Here, we refer to the rank of mean matrices of the reproduction and switching mechanisms. Obviously, the mean matrices of the responsive and preliminary switcher in Definition 2.5 are degenerate and of rank 1, as are the mean matrices of the stochastic switcher in Theorem 2.14, due to the vanishing determinant, while the stochastic switcher of Theorem 2.15 has mean matrices of rank 2. Yet, this switching mechanism also has particular properties that will be exploited in the next section.
Before we carry out these considerations and prove the above results, we first investigate the selective advantages of the switching strategies in different environments.
Fair comparison of BGWPDREs with different switching strategies
To decide which switching strategy of two different BGWPDREs is superior in an environment given by , one needs to impose a condition that ensures that both processes “may use an equal amount of available resources”. One way to do this would be to require that both processes can produce in expectation the same amount of offspring in each generation, be it active or dormant offspring, and to assume that the death probabilities of both processes are the same in both the active and dormant states each. The processes thus can adapt to the environment only by means of their specific switching strategies while using the same amount of resources. This motivates our notion of fitness advantages under “fair comparison”. We formulate this concept in the general framework of p-type branching processes in random environments .
Definition 2.16
(Fitness advantage under fair comparison). For let and two p-type BGWPRE with respect to the same environmental process such that -a.s. for all and their mean matrices satiesfy
2.8 |
Then, if , we say that Z is fitter than at fair comparison. If additionally , we say that Z has a strong (or qualitative) fitness advantage over under fair comparison.
Remark 2.17
The concept of Definition 2.16 is in the same spirit as the comparison of strategies in Dombry et al. (2011), since equation (2.8) assures that for each t, type-t-individuals in both populations produce in expectation the same amount of offspring, only varying the distribution of types among offspring.
- For BGWPDREs in environment , equation (2.8) is equivalent to
for each . - To allow a comparison of a BGWPDRE to a 1-type process (i.e. without dormancy), let be a 1-type BGWPRE with environment with conditional offspring means and (referring to healthy and harsh environmental states respectively). This process can be understood as a 2-type BGWPRE process in the sense of Definition 2.1, starting in (1, 0), and having mean matrices
for and some arbitrary . Although these matrices are reducible, this makes a fair comparison feasible. Note that the notion of fair comparison alone does not yet imply any kind of reproductive trade-off. However, every Lyapunov exponent or bound of such we will compute in the rest of this paper is continuous in the model parameters. This continuity and the strictness of the inequality in the definition of fitness advantages very well include the possibility of advantages under ‘disadvantageous’ comparison, including (sufficiently small but non-trivial) trade-offs.
One of the main goals of this article is to identify situations, under fair comparison, in which one switching strategy can be super-critical, whereas the other switching strategies and the process without dormancy is sub-critical. Note that this is impossible in the absence of a random environment, as pointed out in the discussion after Proposition 2.2. This is now obtained in the context of fair comparison and making use of Remark 2.17:
Theorem 2.18
Denote by an environment process as in Definition 2.3, by X a 1-type branching process as in Remark 2.17 and by BGWPDREs following either the responsive, preliminary or stochastic switching strategy as in Definition 2.5. Then, for either of the four processes there are non-trivial parameter regimes and environmental distributions, under which this process has a strong fitness advantage over the other three in the sense of Definition 2.16.
We prove this Theorem by means of examples of dominant strategies combining the results of Theorem 2.12, Theorem 2.13, Theorem 2.14 and (2.6) from Remark 2.8 after fitting the parameters to the regime of fair comparison.
Example 2.19
(Strong fitness advantages of seed bank switching strategies). Let X be a 1-type BGWPRE as in Remark 2.17 above with and , where such that X is sub-critical in the second environment. Further, let , and be three BGWPDREs with mean matrices
Note that both the responsive and preliminary switching matrices correspond to and the stochastic switching matrices additionally to . In particular, these processes satisfy (2.8), the condition of fair comparison. Also note that , such that we obtain an exact result from Theorem 2.14.
Now, only the environment-related parameters and are left to play with, describing the severity of harsh environments and the lengths of the environmental phases. The following cases prove Theorem 2.18:
- For , and we obtain
- For , and we get
- Letting , and implies
- Finally, choosing , and yields
Remark 2.20
(Interpretation of advantageous strategies). Figure 2 provides more insight into the behaviour of the four strategies than Example 2.19, by taking the parameter regimes (1) and (3) thereof and plotting the respective Lyapunov exponents as functions of . Furthermore, Fig. 3 shows the fitness advantage landscapes of the four models in dependence of , where strong advantages are colorized.
The responsive switcher, when , suffers most upon entering or exiting the harsh environment. Hence, in a scenario where environments rarely change (cf. Figs. 2 (left) and 3), responsive switching does well compared to the other strategies.
The stochastic switcher however, performs a bet hedging strategy, i.e. investing in dormant offspring even in good times to have better chances in worse times. This can often be very costly, but really pays off when environments change with a moderate frequency (cf. Fig. 3), especially when bad environments get very harsh, i.e. when is small.
The preliminary switcher is a somewhat paradoxical form of responsive switching, since it invests under good environmental conditions all its resources in producing dormant offspring, whereas in bad environments it only produces active offspring. In the extreme case , where deterministically the environment changes at every generation, it is intuitive that the preliminary switcher is optimal. Figure 3 even shows a non-trivial parameter region, in which this strategy is dominant.
Note that the 1-type process without dormancy trait will always dominate the switching strategies when becomes sufficiently big—i.e. when the process gets less and less sub-critical in bad environments—as illustrated in Fig. 3 (right). In fact, in that particular parameter setting, when , only the region of will appear in the phase diagram, meaning that for any environmental parameters . This corresponds to Proposition 2.2 from the beginning of this paper, where we saw that seed bank strategies are at a disadvantage in super-critical environments.
Lastly, note that, for general values of , the case of iid environments—which corresponds to the line on which —would not at all capture the strong advantage of responsive and preliminary switching in the settings of Fig. 3. Hence, for providing a complete understanding of the fitness landscapes, the iid case is insufficient.
Remark 2.21
(Combining basic strategies). The presence of phenotypic diversity regarding different switching strategies within the same Bacillus population [at least with respect to the exit strategy from dormancy, see van Vliet (2015) and Sturm and Dworkin (2015)] suggests also to investigate in mixtures of switching strategies. To model this, we let each individual choose at birth whether it behaves according to the preliminary (with probability ) or the responsive (with probability ) switching mechanism. The resulting mean matrices are given by
still maintaining fair comparison. Furthermore, these matrices also have determinant 0. (In fact, linear combinations of rank-1-matrices under fair comparison always retain rank 1). In particular, under fair comparison, the stochastic switcher with can be represented as the convex combination of the responsive and preliminary switcher with and . Hence, we can again compute their fitness explicitly, and this leads to very interesting behaviour.
Inserting the matrices of Example 2.19, we
obtain for
Figure 4 illustrates—in comparison to Fig. 3—which influence the convex combination of the basic strategies can have. Very intuitively, the regions where and have an advantage lie between the regions of the responsive/stochastic and stochastic/preliminary strategies. Remarkably, e.g. around for there is even a region where the convex combination of the responsive and preliminary strategy with yields a strong advantage possibly preventing extinction which, however, is certain for the responsive, stochastic and preliminary strategy.
This can be motivated as follows: For both either small or large, one of the pure strategies (responsive or preliminary) seems to be optimal. However, for moderate environmental variation is high and both fast switching and slow switching environmental phases might occur. Then, a combination of both strategies ensures that the worst case for neither strategy can affect the whole population. If one considers the strategy of stochastic switching as a bet-hedging strategy, then using phenotypic diversity to employ a mixture of extreme strategies might be seen as a ‘second-level’ bet-hedging strategy, now with respect to switching behaviour.
Remark 2.22
(Refinement of fair comparison). The notion of fair comparison implicitly assumes that the production of or conversion into dormant forms is equally costly as the production of active offspring. In many scenarios this will not be realistic. In fact, the production of inactive individuals can be both very efficient (e.g. in seed plants) as well as rather costly (e.g. sporulation of Bacillus subtilis, see Piggot and Hilbert 2004). Exchanging (2.8) in Definition 2.16 by
for some leads us to the notion of “-weighted fair comparison”. For BGWDPREs in the environment we are mainly interested in -weighted fair comparison with for some . Hence, the condition above reads
2.9 |
The idea behind (2.9) is to ensure that both populations still make use of the same amount of resources when producing dormant offspring becomes either less () or more () resource consuming than producing active offspring. This can be seen as one particular way of incorporating a reproductive trade-off (another natural one is the introduction of the parameter in the BGWPWD from Sect. 2.1). Obviously, we recover the notion of fair comparison for .
To get some intuition on the influence of on the fitness under fair comparison, we provide an example: Indeed, we adjust Example 2.19 by setting
With this, the four processes from the example satisfy the condition for -weighted fair comparison, while we still maintain to obtain an exact result from Theorem 2.14. Also, a convex combination yields -weighted fair comparison, although not necessarily retaining rank 1 anymore such that requires simulation.
Figure 5 illustrates the influence of on the phase diagram in Fig. 3 (left). Here, we see that halving the cost parameter largely enhances the advantages of carrying any dormancy trait, where the advantageous region for seems to increase the most. Naturally, the switching strategies will always dominate the 1-type process as approaches 0, i.e. as dormant offspring become very cost-efficient. On the other hand, having a cost parameter shifts the landscape in such a way that the 1-type process overtakes the strong-advantage-region from the stochastic switcher.
Figure 6 (left) shows the phase diagram of stochastic versus responsive switching for various values of , where the separatrix in general is given by the equation
2.10 |
With the parameters specified above, we observe that, for , the separatrix becomes a constant function with . Remarkably, this effect leads to parameter regimes where the fitness of the responsive switcher exceeds that of the stochastic switcher if is either small or big, while stochastic switching wins for intermediate , e.g. at the point marked by . This particular case is further depicted in Fig. 6 (right), where the respective Lyapunov exponents are plotted as functions of . (Note that the 1-type-fitness is constant here, since it is not influenced by the cost of dormant offspring.)
Remark 2.23
(Non-zero determinant case for mean matrices). Rather than combining the pure strategies, one can also compare different stochastic switching strategies under fair comparison, e.g. by choosing mean matrices of non-zero determinant, as illustrated in Fig. 7. Here, we add to the setting of Example 2.19 two further stochastic switchers with matrices
for . These satisfy the conditions of fair comparison to the processes in Example 2.19 while and .
In contrast to the stochastic switcher given by in Example 2.19, the -matrices describe a strategy that focuses more on the production of active offspring, making better use of good times. Note that a positive determinant could also come from a decrease of the resuscitation rate , increasing the chance of enduring long-lasting harsh times. Both effects increase the fitness in less frequently changing environments, Fig. 7 (left).
The -matrices, however, describe a population that mostly produces dormant offspring, making for a strategy that prevails in frequently changing and sufficiently harsh environments as seen in Fig. 7 (right).
Figure 8 illustrates the influence of the determinant on the phase diagram: In comparison to Fig. 3 (left), roughly, the -strategy with positive determinant wins ground on the left half, but loses ground on the right half of the diagram, while the -strategy with negative determinant generates the opposite effect.
Of course, the above examples invite a much larger and systematic study of parameter ranges and switching strategies, but we think that this goes beyond the scope of the present paper, with its focus on mathematical methods.
Technical results for rank-1 and rank-2 switching strategies
In this section, we provide theoretical results for the explicit computation and bounds for maximal Lyapunov exponents related to switching strategies in BGWPDRE. We prove Theorems 2.12, 2.13, 2.14 (rank-1 case) and 2.15 (rank-2 case) and provide bounds in the rank-2 case and a short literature review.
Rank-1-matrices and exact results
Consider a stationary and ergodic sequence of non-negative matrices such that, for any , the rank of is equal to one. Hence, for any , we can find column vectors such that and the sequence is stationary. Note that in this case, only one eigenvalue of is non-zero and, as a consequence, .
Lemma 3.1
Let be a stationary and ergodic sequence of non-negative matrices with for any . Suppose that and . Then, -a.s. and in mean,
3.1 |
Proof
First, note that both and . The latter ensures that the maximal Lyapunov exponent, , exists -a.s. and in mean. Thus it remains to show that is equal to the expression on the right-hand side of (3.1).
In order to apply (Kingman 1973, Theorem 1) which ensures that is finite, we first assume
3.2 |
This implies that , and . In particular, -a.s., for all . Since
3.3 |
we immediately deduce from Birkhoff’s ergodic theorem that the first term on the right-hand side of (3.3) converges, -a.s. and in , to as . Since , it follows that , and (3.1) holds in mean. Moreover, for any
where the constant appearing in the computation above results from the comparison of equivalent matrix norms. Thus, by using the Borel–Cantelli Lemma we conclude that -a.s., and (3.1) follows.
However, if the additional assumption in (3.2) is violated then we conclude that . Thus, by Kingman (1973, Theorem 2), the maximal Lyapunov exponent, , as well as the limit of the sum on the right-hand side of (3.3) exists with probability one, and . Using that concludes the proof.
Corollary 3.2
Let Z be a p-type BGWPRE in environment given in Definition 2.3. Suppose that , for any , and . Then, -a.s.,
Proof
By Lemma 3.1, it holds that, -a.s.,
By using that for any , and , the assertion follows.
Proof of Theorem 2.12, 2.13 and 2.14
This follows directly from Corollary 3.2.
Remark 3.3
(Connection to Dombry et al. 2011).
Similarly to Dombry et al. (2011, Propositions 1 and 7), the responsive switcher can be regarded as a 1-type BGWPRE process in a more complex random environment, here given by with corresponding offspring means and for . With this, Theorem 2.12 follows by applying the Ergodic Theorem.
For a given fixed mean offspring per type and environment, say for , consider a collection of distributions of offspring types—say for , —as reproduction strategy. In Dombry et al. (2011) the maximal Lyapunov exponent is only computed explicitly in the so-called non-hereditary case, that is, when the distributions of offspring types do not depend on the parent type, . Regarding and as column vectors in , the corresponding mean matrices are and thus, of rank 1. Hence, the case in which Dombry et al. (2011) obtain exact results for aligns with the case where we do. A natural generalization is to give type-t-individuals an offspring type distribution depending on e as a convex combination of two distributions, say and . This provides a simple example of the hereditary case and produces mean matrices of rank at most 2.
Rank-2-matrices and bounds
Let a stationary and ergodic sequence of non-negative matrices such that, for any , the rank of is at most two, i.e. there are column vectors , such that
for any . Further, for any , we denote by a non-negative matrix that is defined by
3.4 |
Note that the sequence is as well stationary and ergodic.
Remark 3.4
There are several ways to decompose a non-negative -matrix into the sum of two products of non-negative vectors, e.g. for any and it holds
3.5 |
3.6 |
3.7 |
3.8 |
where the entries in (3.5) and (3.6) are always non-negative, in (3.7) they are non-negative if and in the last if . Notably, (3.6) corresponds to writing M(e) as convex combination of responsive and preliminary switchers as indicated in Remark 2.21.
Lemma 3.5
Let be a stationary and ergodic sequence of non-negative matrices with for any and for any . Then, -a.s. and in mean,
3.9 |
Proof
First, by an elementary computation, we get that and . Thus, Kingman (1973, Theorem 6) implies that
exist -a.s. and in mean. Thus, we are left with showing that both limits coincide. Recall that the limit does not depend on the chosen matrix norm. Choosing , we obtain
By setting for any , it follows that
Thus, by using the same argument as in the proof of Lemma 3.1, we obtain that, -a.s. and in mean, , which concludes the proof.
Next, we focus on establishing various bounds for the maximal Lyapunov exponent for the resulting product of matrices.
Proposition 3.6
Let be a stationary and ergodic Markov chain with values in as given in Definition 2.3, and such that . Denoting , the following holds:
- For , let for . Then, -a.s. and in mean,
where denotes the spectral radius of the -matrix - For denote by the set of probability density functions on . Then, -a.s. and in mean,
where , denotes integration by with respect to and the entropy of , i.e.
Proof
(a) First, by the ergodic theorem, letting , we have that, -a.s. and in mean,
Moreover, it is well known that an upper bound for the maximal Lyapunov exponent of the stationary and ergodic sequence follows immediately from Jensen’s inequality. Indeed, using the matrix norm , ,
where . Hence, the assertion follows from Horn and Johnson (1990, Corollary 5.6.14).
(b) Note that, for any , equates to
The result follows from Jensen’s inequality, taking supremum and limit superior. Note that the right-hand side converges almost surely and hence in mean by monotone convergence.
Proof of Theorem 2.15
Since w and d do not depend on e, it holds . Notably, when , this lower bound equates to the result from Theorem 2.14. Hence, in what follows we assume .
For , using representation (3.7) we obtain
where . Otherwise, if and we use (3.8),
Hence, in both cases it holds and Proposition 3.6-(b) concludes the proof by considering and applying the Ergodic Theorem.
Remark 3.7
(Further bounds on the maximal Lyapunov exponent). Let us consider the stochastic switching model with mean matrices M(1) and M(2). In view of Remark 2.9 any sub-multiplicative function yields that . Likewise, for any super-multiplicative function we obtain that . Examples of super-multiplicative functions are the minimal column and row sums, respectively, any diagonal element, or the permanent of a matrix A.
For a slightly improved upper bound note that for any sub-multiplicative matrix norm
which takes into account the effects of one type of environmental change. Hence, we obtain that , where
As one can see in Fig. 9 (right), for small in some cases this can give a better upper bound than the one from Hautphenne and Latouche (2016).
A more evolved approach is to choose a sequence such that can be interpreted as path of a Markov chain. Combining this ansatz with Markov chain limit results leads to
Corollary 3.8
For let , such that the stochastic matrix Q defined as
is irreducible and aperiodic, and denote by q its stationary distribution. Then,
where if and otherwise.
Let us point out that this result can also be deduced directly from Arnold et al. (1994, Theorem 4.3), where the authors, based on concepts from equilibrium statistical mechanics, established a variational characterization of the maximal Lyapunov exponent for general ergodic sequences of positive matrices satisfying certain integrability conditions. Nevertheless, for the sake of being self-contained we provide a proof of Corollary 3.8 at the end of this section. A similar upper bound has been derived by Gharavi and Anantharam (2005).
Note that Corollary 3.8 in this special case offers an analytical approach for finding an over-all reliable lower bound by adjusting the eight -parameters. Additionally, it provides a way to give an approximate uniform lower bound, which, in some regimes, outperforms the other lower bounds discussed here—see Fig. 9 (mid).
Remark 3.9
(Connection to Hautphenne and Latouche 2016and Kussell and Leibler 2005).
Using Eq. (3.5) for both mean matrices of the stochastic switcher yields . Hence, letting , Proposition 3.6-(a) gives the same upper bound as in Hautphenne and Latouche (2016, Theorem 2). Changing the values of allows to influence the loss from the estimation by Jensen’s inequality, offering potential improvement for this upper bound.
- Analogous to Corollary 3.8 the method in Hautphenne and Latouche (2016) is based on constructing via transition matrices of the form
where v(e) denotes the respective and suitably normalized right-eigenvectors of M(e). In fact, the lower bound in Hautphenne and Latouche (2016, Theorem 3) can be achieved from Corollary 3.8 by choosing and, as above, by decomposition (3.5). Then, the lower bound in Corollary 3.8 becomes
which illustrates the connection. Figure 9 demonstrates that there are choices for the parameters that can be made to improve the lower bound from Hautphenne and Latouche (2016), especially for small . This particular choice of transition matrices defines a Markov chain closely related to the so-called retrospective process (cf. Wang 2014, Chapter 3).3.10 - Kussell and Leibler (2005) approximate the maximal Lyapunov exponent under a slow environment condition. The soundness of this approximation can be verified by the previously discussed bounds of Hautphenne and Latouche (2016): As and , gets close to and hence, the second addend in (3.10) approaches . On the other hand, the -matrix obtained in Remark 3.9-(1) tends to
and . Thus, both bounds approach and so does .
Proof of Corollary 3.8
Denote by and by random variables on holding and
for all . Then, is a time-homogeneous stationary Markov chain with transition matrix Q. Furthermore, with is a homogeneous Markov chain with stationary distribution given by
for . Now, with . Letting the distribution of conditional on , it follows by stationarity of the and ergodicity that
amounting to the first addend of the lower bound. On the other hand, using the Markov property,
Hence, by stationarity of and the Ergodic Theorem it follows
and the corollary holds by Proposition 3.6-(b).
Discussion and outlook
Discussion
In the previous sections we evaluated and compared the fitness of 2-type branching processes with dormancy. We incorporated different switching strategies between active and dormant states, type-specific relative reproductive costs and fluctuating environments changing between healthy and harsh states. We will now discuss our results from a somewhat elevated perspective, formulating simplified ‘take-home messages’, and put them in the context of earlier work. Further, we will comment on the role of stochasticity in our model and sketch areas for future research.
‘Take-home messages’ and the ‘rules of thumb’ of Malik and Smith
In Malik and Smith (2008), the authors work in a dynamical-systems based set-up and provide several ‘rules of thumb’ summarizing their findings. We try to formulate a corresponding set of such rules; however, one should be careful with these necessarily vague statements—in doubt one should always refer to the exact mathematical results. The following statements hold under our ‘fair comparison’ assumption ().
(1) Each switching strategy can be more fit than the ‘sleepless’ strategy when i) good times are rare and ii) bad times are sufficiently harsh. Otherwise, the sleepless population has a (potentially strong) fitness advantage.
This was essentially also observed in Malik and Smith (2008). Here, i) corresponds in our setting to the condition . In this case indeed we see that is small—cf. Fig. 3. However, one should note that this observation also depends on the severity of the ‘harsh’ environmental state, as will eventually always dominate as , thus explaining the additional condition ii). In fact, we can strengthen this rule by adding that each switching strategy can even be exclusively super-critical (“strong fitness advantage”), if the ratio between good times and bad times is sufficiently adjusted, cf. the colored areas of Fig. 3. From now on, we will always assume a sufficiently severe harsh environment (that is, is sufficiently small).
Note that in the opposite limit the horizontal line in Fig. 3 tends to a horizontal line through the point 1, the slope of the linear function through the origin tends to , and the separatrix between the responsive and spontaneous switcher tends to the function . In this case, the stochastic switcher completely dominates the diagram, but has no strong fitness advantage.
(2) The responsive switcher is more fit than the stochastic switcher when environmental states change rarely, i.e. when is small.
Note that a similar rule has been stated in Kussell and Leibler (2005). The corresponding rule in Malik and Smith (2008) is that the “responsive switcher is more fit than the stochastic switcher when either good times are very rare or are very common”. In our case, this would correspond to the condition that either or , suggesting that the point lies below the graph of a suitable ‘hyperbola’. Indeed, we are able to compute the exact boundary of the region where , which is given by (2.10), see Fig. 3 and Remark 2.22. Hence we are able to provide a very explicit classification of fitness advantage areas. However, we also see that when environmental states both change rarely, i.e. also if and are small. Since Malik and Smith (2008) only consider environmental cycles of fixed length , they cannot observe this effect. In contrast, Malik and Smith (2008) also provide results for the limit of extremely quickly fluctuating environments, which are meaningless in our model, since we assume discrete time/generations.
(3) When the environment changes almost every generation, i.e. when is large, the preliminary switcher is the fittest.
The preliminary switcher does not appear explicitly in Malik and Smith (2008) and might seem counter-intuitive at first glance. However, in the discrete generation context, when the environment becomes predictable (e.g. seasonal/periodic changes), this strategy can be optimal, as Fig. 3 shows.
(4) For intermediate values of , stochastic switching emerges as optimal strategy.
As discussed in Remark 2.21, stochastic switching can be constructed via convex combinations of responsive and preliminary switching, retaining rank 1. Their Lyapunov exponents, , can then be computed by Theorem 2.14 and maximized in q(1) and q(2) to obtain the optimal rank-1-strategy under fair comparison, which is a non-trivial convex combination for intermediate as Fig. 4 suggests. Indeed, mixed strategies can be uniquely super-critical (‘strong fitness advantage’). This observation has no analogue in Malik and Smith (2008), where although ‘hybrid’ strategies are mentioned (p. 1144), no results are being provided. However, in general the optimal strategy might not be of rank 1, as Fig. 7 suggests that non-zero determinants might further increase fitness in certain scenarios.
The model of Dombry, Mazza and Bansaye
In Dombry et al. (2011), the authors deal with a branching process model for phenotypic switches between potentially many types, and with a general class of random environments (though still assumed to be stationary and ergodic). However, in some regards their model and results are also more restrictive than ours, and some scenarios of dormancy-related reproduction are not covered. To understand the differences, let us recall their distinction between hereditary and non-hereditary reproduction strategies. In non-hereditary strategies, in a first step, the offspring numbers of individuals are sampled, and then, in a second step, independently, the new phenotypes are attached to the offspring individuals. This case disentangles reproduction and phenotype-allocation and allows to obtain a wealth of elegant results on the fitness and optimality of switching strategies. In contrast, the hereditary case does not feature this disentanglement, and type allocation may depend on the type of the parents. Because of this, this case is mathematically much harder to investigate. In fact, here, Dombry et al. (2011) provide no systematic results for the Lyapunov exponents of the system.
Unfortunately, the non-hereditary case already excludes our simple example for dormancy-related reproduction being the result of either binary fission or sporulation from Sect. 2.1, since here, the offspring number determines the offspring type. It is still possible to transfer some of their machinery to the cases which in our models correspond to rank-1 matrices, but not to the rank-2 case.
Nevertheless it is interesting to compare some of the results for the non-hereditary case with our results. In the case with spontaneous switching (related to the ‘no-sensing’ case in the language of Dombry et al. (2011)), they show that there are situations where a diversified strategy with several phenotypes can have a strong fitness advantage over a ‘single-type’ strategy, cf. their Section 1.1.1. For the case ‘with sensing’ (responsive switching) and for , they show that for rarely changing environments, responsive switching is optimal, whereas in highly fluctuating environments, preliminary switching dominates. Further, a mixed strategy—e.g. stochastic switching—is optimal in intermediate regimes. This corresponds to our observation in Fig. 3 (left). However, similarly to the iid case given on the line where , this covers only a small part of the fitness landscape.
The role of relative reproductive costs and ‘weighted fair comparison’
Note that while (Dombry et al. 2011) do consider mixed switching strategies, they implicitly always assume a ‘fair-comparison’ of reproductive strategies (cf. the fixed type distributions on p. 377), corresponding to our comparison with . If this assumption is violated, that is, dormant offspring are either ‘more cost efficient’ than active offspring (), or ‘more expensive’ (), the picture regarding optimal strategies becomes very rich and exhibits novel effects. Note that situations in which dormant offspring are more expensive than active offspring could relate for example to the sporulation process of Bacillus subtilis, which takes much longer than producing an active offspring by binary fission (Piggot and Hilbert 2004), and thus leads to fewer dormant offspring per time unit, resulting in higher ‘effective’ costs. On the other hand, plants often produce many seeds at a low cost, and this is clearly the optimal strategy in the presence of extremely harsh environments (‘winter’) that effectively kills all ‘active’ individuals from the current generation of a species.
In our model, the following picture emerges (cf. Fig. 5): Under reduced costs for dormant individuals (here, ), rule (1) still holds in a qualitative sense; however, the region where the sleepless case is optimal is further reduced, while the relation between the regions of the switching strategies does not seem to change qualitatively. Under increased costs for dormant offspring (here, ), a new effect appears. Here, the ‘sleepless’ population can suddenly gain a strong fitness advantage in moderately fluctuating environments, at the cost of the stochastic switcher.
Interestingly, both the responsive and the preliminary strategies seem less severely affected by variable relative reproductive costs as both retain their ‘strong advantages’ in Fig. 5 (right). However, for severe relative fitness differences, the qualitative picture may again change drastically. For example, in Fig. 6 (left) the separatrix between the regions of dominance of the responsive and the stochastic switcher performs a phase transition, where for it becomes a straight line. This invites a more comprehensive study of the sensitivity of the optimal strategies on the relative reproductive costs, which however seems beyond the scope of the present paper.
Stochastic versus deterministic modeling
Stochasticity enters in our model in two distinct ways. The first is environmental stochasticity, where random changes between harsh and healthy states affect all individuals in the population simultaneously. Second, there is demographic stochasticity, due to random reproduction/switching (described by a branching process), which is independent between individuals. Despite these two sources of randomness, a first observation is that several qualitative results remain valid under both approaches (e.g. rule 1).
One reason is that the reproductive mechanism enters our results on the maximal Lyapunov exponents only through the mean matrices of the offspring distributions, which yields the same information as is contained in the transition rates of dynamical systems. Further, for large populations, the law of large numbers implies that branching processes in discrete time, resp. birth-death processes in continuous time, can be well approximated by dynamical systems, see e.g, Kurtz (1974), Ethier and Kurtz (1986) and Fournier and Méléard (2004) (the latter for spatial and measure-valued set-ups) on the ecological time-scale.
However, demographic stochasticity does play a major role in situations when population sizes may fluctuate strongly, and in particular may be very small. This could for example be the case in scenarios when new dormancy traits invade a resident population without this trait (as in Blath and Tobiás 2020), or infections in an early stage. It is certainly also relevant when considering extinction probabilities, which by definition involve small population sizes. See e.g. Jost and Wang (2014), who investigate the extinction probability of branching processes under optimal phenotype allocation.
Environmental stochasticity determines the ‘random order’ in which the mean matrices corresponding to the environmental states enter the formula for the Lyapunov exponent. The formula of course also holds for deterministic periodic environments. In the rank-1 case, exponents can e.g. be computed via Lemma 3.1, and this can lead in (suitably chosen) environments to similar effects as in corresponding random environments (e.g. a periodic environment, where two harsh periods are always followed by two healthy periods and vice versa, and a random environment with ). However, in the (non-commutative) rank two case, the relation between stochastic and deterministic environments, and the impact on optimal switching strategies, is not clear and requests future work.
Outlook
As indicated above, our results are still incomplete and our study invites further research in several directions.
For example, progress regarding the exact computation of Lyapunov exponents is certainly desirable, but this is known to be difficult and probably needs particular and sophisticated methods for particular switching strategies, depending on the algebraic properties of the underlying mean matrices. This also holds true for more general environments. An alternative to analytic solutions could be the exploration of the ‘strategy space’ via simulation, e.g. involving genetic algorithms. Simulation methods in more complex models could also allow the assessment of antibiotic treatment protocols.
A readily accessible task is the extension of our model to continuous-time birth-death processes. This then suggests to explore the relation to ‘adaptive dynamics’ related set-ups, in which one could try to merge random environments and dormancy with competition and mutation. A starting point could be the recent model on dormancy under competition in Blath and Tobiás (2020), extended by rates depending on the state of a fluctuating environment. Having more than one species with potentially different dormancy strategies, perhaps even in a spatial set-up à la (Fournier and Méléard 2004), could lead to truly ecological models. However, this would also require to disentangle different notions of fitness (e.g. invasion fitness vs. long-term fitness).
Finally, such general models could form the basis to understand effects of long-term changes in the distribution of the random environment (for example due to climate change). It is well-known that climate change can have a serious impact on seed banks, see e.g. Ooi (2012), and it would be interesting to understand the robustness of switching methods under such scenarios.
Acknowledgements
The authors gratefully acknowledge support by DFG SPP 1590 “Probabilistic structures in evolution”, Project BL 1105/5-1. The authors also thank Matthias Hammer for valuable comments and discussions.
Appendix
Proof of Proposition 2.2
For the classical BGW process with offspring distribution we write , to denote the corresponding offspring probability generating function. Set and . Recall that, by assumption, the variance of is finite. It is well-known (cf. Athreya and Ney 1972, Theorem I.5.1), that the survival probability is given by , where is the smallest fixed point of . Furthermore it is known that for converges to a positive limit if , as does if (cf. Athreya and Ney 1972, Corollary I.11.1 and Athreya and Ney 1972, Theorem I.9.1, respectively). Since iff , it follows that
is infinite if and finite if .
Coming to , its offspring probability generating function is given by
Now, by Athreya and Ney (1972, Theorem V.3.2), can be given as the first component of the smallest fixed point of g. Denoting and as in the example in Sect. 2.1, basic transformations yield that iff
such that by monotonicity .
From (2.3) we obtain for that
which implies that , where the inequality is strict, if .
It remains to prove the results regarding . For this, denoting by and by , the mean matrix of the offspring distribution of Z is given by
along with its largest eigenvalue
5.1 |
Then, survives with positive probability iff , while , if (cf. Athreya and Ney 1972, Theorem V.3.2). More importantly, for case (3), when d is so small that , (5.1) implies . Thus, by continuity, there also is such that .
We now apply Athreya and Ney (1972, Theorem V.4.4): Since the offspring distributions are of finite variance, the second moment condition holds and the theorem implies
which concludes the proof of (3).
For the critical case (2) note that . (The maximum row sum can be seen as an operator norm and hence is an upper bound for all eigenvalues.) Hence, in the case the proof is complete, since . At last, it remains to show that even if , . Thus, letting note that iff
which can only hold if either or , both contradicting our assumptions.
Funding
Open Access funding enabled and organized by Projekt DEAL.
Footnotes
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Contributor Information
Jochen Blath, Email: blath@math.tu-berlin.de.
Felix Hermann, Email: hermann@math.tu-berlin.de.
Martin Slowik, Email: slowik@math.uni-mannheim.de.
References
- Alarcón T, Jensen HJ. Quiescence: a mechanism for escaping the effects of drug on cell populations. J R Soc Interface. 2011;8(54):99–106. doi: 10.1098/rsif.2010.0130. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Arnold L, Demetrius L, Gundlach VM. Evolutionary formalism for products of positive random matrices. Ann Appl Probab. 1994;4(3):859–901. [Google Scholar]
- Athreya KB, Ney PE (1972) Branching processes. Springer, New York-Heidelberg. Die Grundlehren der mathematischen Wissenschaften, Band 196
- Athreya KB, Karlin S. On branching processes with random environments. I. Extinction probabilities. Ann Math Stat. 1971;42:1499–1520. [Google Scholar]
- Athreya KB, Karlin S. Branching processes with random environments. II. Limit theorems. Ann Math Stat. 1971;42:1843–1858. [Google Scholar]
- Balaban NQ, Merrin J, Chait R, Kowalik L, Leibler S. Bacterial persistence as a phenotypic switch. Science. 2004;305:0036–8075. doi: 10.1126/science.1099390. [DOI] [PubMed] [Google Scholar]
- Bär M, von Hardenberg J, Meron E, Provenzale A. Modelling the survival of bacteria in drylands: the advantage of being dormant. Proc R Soc Lond B. 2002;269:937–942. doi: 10.1098/rspb.2002.1958. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Blath J, Tobiás A. Invasion and fixation of microbial dormancy traits under competitive pressure. Stoch Process Appl. 2020;130(12):7363–7395. [Google Scholar]
- Blath J, González Casanova A, Eldon B, Kurt N, Wilke-Berenguer M. Genetic variability under the seedbank coalescent. Genetics. 2015;200(3):921–934. doi: 10.1534/genetics.115.176818. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Blath J, González Casanova A, Kurt N, Wilke-Berenguer M. A new coalescent for seed-bank models. Ann Appl Probab. 2016;26(2):857–891. [Google Scholar]
- Blath J, González Casanova A, Kurt N, Wilke-Berenguer M. The seedbank coalescent with simultaneous switching. Electron J Probab. 2020;25(27):1–21. [Google Scholar]
- Bougerol P, Lacroix J (1985) Products of random matrices with applications to Schrödinger operators. Progress in Probability and Statistics, vol 8. Birkhäuser
- Crisanti A, Paladin G, Vulpiani A (1993) Products of random matrices in statistical physics. Springer Series in Solid-State Sciences, vol 104. Springer, Berlin. With a foreword by Giorgio Parisi
- Dombry C, Mazza C, Bansaye V. Phenotypic diversity and population growth in a fluctuating environment. Adv Appl Probab. 2011;43(2):375–398. [Google Scholar]
- Endo H, Inoue M. Dormancy in cancer. Cancer Sci. 2019;110(2):474–480. doi: 10.1111/cas.13917. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ethier SN, Kurtz TG (1986) Markov processes. Wiley Series in Probability and Mathematical Statistics: Probability and Mathematical Statistics. Wiley, New York. Characterization and convergence
- Fisher RA, Gollan B, Helaine S. Persistent bacterial infections and persister cells. Nat Rev Microbiol. 2017;15(8):453–464. doi: 10.1038/nrmicro.2017.42. [DOI] [PubMed] [Google Scholar]
- Fournier N, Méléard S. A microscopic probabilistic description of a locally regulated population and macroscopic approximations. Ann Appl Probab. 2004;14(4):1880–1919. [Google Scholar]
- Fowler AC, Winstanley HF. Microbial dormancy and boom-and-bust population dynamics under starvation stress. Theor Popul Biol. 2018;120:114–120. doi: 10.1016/j.tpb.2018.02.001. [DOI] [PubMed] [Google Scholar]
- Furstenberg H, Kesten H. Products of random matrices. Ann Math Stat. 1960;31:457–469. [Google Scholar]
- Gharavi R, Anantharam V. An upper bound for the largest Lyapunov exponent of a Markovian product of nonnegative matrices. Theoret Comput Sci. 2005;332(1–3):543–557. [Google Scholar]
- Haccou P, Jagers P, Vatutin VA (2007) Branching processes: variation, growth, and extinction of populations. Cambridge Studies in Adaptive Dynamics, vol 5. Cambridge University Press, Cambridge; IIASA, Laxenburg
- Hautphenne S, Latouche G. Lyapunov exponents for branching processes in a random environment: the effect of information. J Stat Phys. 2016;163(2):393–410. [Google Scholar]
- Horn RA, Johnson CR. Matrix analysis. Cambridge: Cambridge University Press; 1990. [Google Scholar]
- Jost J, Wang Y. Optimization and phenotype allocation. Bull Math Biol. 2014;76(1):184–200. doi: 10.1007/s11538-013-9915-5. [DOI] [PubMed] [Google Scholar]
- Jurga N, Morris I. Effective estimates on the top Lyapunov exponents for random matrix products. Nonlinearity. 2019;32(11):4117–4146. [Google Scholar]
- Kaj I, Krone SM, Lascoux M. Coalescent theory for seed bank models. J Appl Probab. 2001;38(2):285–300. [Google Scholar]
- Kaplan N. Some results about multidimensional branching processes with random environments. Ann Probab. 1974;2:441–455. [Google Scholar]
- Key ES. Computable examples of the maximal Lyapunov exponent. Probab Theory Relat Fields. 1987;75(1):97–107. [Google Scholar]
- Key ES. Lower bounds for the maximal Lyapunov exponent. J Theor Probab. 1990;3(3):477–488. [Google Scholar]
- Kingman JFC. Subadditive ergodic theory. Ann Probab. 1973;1:883–909. [Google Scholar]
- Kurtz TG. Convergence of sequences of semigroups of nonlinear operators with an application to gas kinetics. Trans Am Math Soc. 1974;186(259–272):1973. [Google Scholar]
- Kussell E, Leibler S. Phenotypic diversity, population growth, and information in fluctuating environments. Science. 2005;309:2075–2078. doi: 10.1126/science.1114383. [DOI] [PubMed] [Google Scholar]
- Kussell E, Kishony R, Balaban NQ, Leibler S. Bacterial persistence. Genetics. 2005;169:1807–1814. doi: 10.1534/genetics.104.035352. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ledrappier F (1984) Quelques propriétés des exposants caractéristiques. In: École d’été de probabilités de Saint-Flour, XII—1982, Lecture Notes in Math., vol 1097. Springer, pp 305–396
- Lennon JT, den Hollander FTW, Wilke Berenguer M, Blath J (2020) Principles of seed banks: complexity emerging from dormancy, pp 1–45. Nat Commun (accepted). arXiv:2012.00072 [DOI] [PMC free article] [PubMed]
- Lennon JT, Jones SE. Microbial seed banks: the ecological and evolutionary implications of dormancy. Nat. Rev. Microbiol. 2011;9:119–130. doi: 10.1038/nrmicro2504. [DOI] [PubMed] [Google Scholar]
- Lewis K. Persister cells. Ann Rev Microbiol. 2010;64:357–372. doi: 10.1146/annurev.micro.112408.134306. [DOI] [PubMed] [Google Scholar]
- Locey KJ, Fisk MC, Lennon JT. Microscale insight into microbial seed banks. Front Microbiol. 2017;7:2040. doi: 10.3389/fmicb.2016.02040. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Malik T, Smith HL. Does dormancy increase fitness of bacterial populations in time-varying environments? Bull Math Biol. 2008;70(4):1140–1162. doi: 10.1007/s11538-008-9294-5. [DOI] [PubMed] [Google Scholar]
- Ooi MKJ. Seed bank persistence and climate change. Seed Sci Res. 2012;22:S53–S60. [Google Scholar]
- Oseledec VI. A multiplicative ergodic theorem. Characteristic Ljapunov, exponents of dynamical systems. Trudy Moskov Mat Obšč. 1968;19:179–210. [Google Scholar]
- Piggot PJ, Hilbert DW. Sporulation of bacillus subtilis. Curr Opin Microbiol. 2004;7(6):579–586. doi: 10.1016/j.mib.2004.10.001. [DOI] [PubMed] [Google Scholar]
- Pollicott M. Maximal Lyapunov exponents for random matrix products. Invent Math. 2010;181(1):209–226. [Google Scholar]
- Protasov VY, Jungers RM. Lower and upper bounds for the largest Lyapunov exponent of matrices. Linear Algebra Appl. 2013;438(11):4448–4468. [Google Scholar]
- Shoemaker WR, Lennon JT. Evolution with a seed bank: the population genetic consequences of microbial dormancy. Evol Appl. 2018;11(1):60–75. doi: 10.1111/eva.12557. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sturm A, Dworkin J. Phenotypic diversity as a mechanism to exit cellular dormancy. Curr Biol. 2015;25(17):2272–2277. doi: 10.1016/j.cub.2015.07.018. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tanny D. On multitype branching processes in a random environment. Adv Appl Probab. 1981;13(3):464–497. [Google Scholar]
- van Vliet S. Bacterial dormancy: how to decide when to wake up. Curr Biol. 2015;25(17):R753–R755. doi: 10.1016/j.cub.2015.07.039. [DOI] [PubMed] [Google Scholar]
- Wang Y. Variational characterization of multitype Galton–Watson branching processes. Acta Math Appl Sin Engl Ser. 2014;30(1):27–36. [Google Scholar]