Abstract
Continuum limits in the form of stochastic differential equations are typically used in theoretical population genetics to account for genetic drift or more generally, inherent randomness of the model. In evolutionary game theory and theoretical ecology, however, this method is used less frequently to study demographic stochasticity. Here, we review the use of continuum limits in ecology and evolution. Starting with an individual‐based model, we derive a large population size limit, a (stochastic) differential equation which is called continuum limit. By example of the Wright–Fisher diffusion, we outline how to compute the stationary distribution, the fixation probability of a certain type, and the mean extinction time using the continuum limit. In the context of the logistic growth equation, we approximate the quasi‐stationary distribution in a finite population.
Keywords: continuum limit, diffusion approximation, extinction time, fixation probability, stationary distribution
We review the use and the derivation of continuum limits in the context of ecology and evolution and provide key analytic expressions of the fixation probability, the mean extinction time and the stationary distribution. By example, we compute these quantities for the Wright–Fisher process and the stationary distribution of the logistic growth equation.

1. INTRODUCTION
The huge computational power available today allows researchers to develop individual‐based models of high complexity to explore dynamical processes in ecology and evolution. Here, we aim to make a link between these individual‐based descriptions and continuous models like (stochastic) differential equations that remain amenable to analysis. We review these techniques and apply them to some frequently used models in ecology and evolution.
In ecology, probably the most common description of population dynamics is the logistic growth equation (Verhulst, 1838). Its attractiveness draws from its simplicity. It has a globally attractive fixed point (when started with any nonzero population size), the carrying capacity of the population. This simplicity comes at the cost that biological observations like population size fluctuations or even extinction events are not captured by this deterministic model. To account for these stochastic effects, one needs to change to a stochastic differential equation which can be derived from individual‐based reactions (Champagnat et al., 2006). Stochastic differential equations consist of a deterministic and a stochastic term, the latter also often referred to as noise. In theoretical ecology, the effect of random environmental fluctuations on deterministic dynamics like the logistic equation has been studied in great detail (e.g., Allen et al., 1993; Schaffer et al., 1986). Demographic stochasticity, which arises from the inherent randomness of birth and death events of individuals, is often modeled in discrete‐time models (e.g., Melbourne & Hastings, 2008; Schreiber et al., 2018). Models including demographic stochasticity in continuous time are more scarce in ecology. Typically, these models are studied in the context of evolutionary ecology, for example to model co‐evolution in ecological communities (Dieckmann & Law, 1996). More generally, questions related to extinction and coexistence, transitions between different stable deterministic equilibria or the maintenance of quasi‐cycles can be studied within the framework of stochastic differential equations (Boettiger, 2018; Jeltsch et al., 2019). Here, we outline how to derive stochastic differential equations from an individual‐based model. We do this along similar lines as reviewed in Black and McKane (2012) but with a larger emphasis on the technical details.
In evolutionary game theory, the Moran process has become a popular model for stochastic dynamics in finite populations (Nowak et al., 2004). It is a model describing the dynamics of different alleles in a population of fixed size and overlapping generations. As this is a birth–death process, quantities like fixation probabilities, fixation or extinction times, and the stationary distribution can be calculated based on recursions (Allen, 2011; Goel & Richter‐Dyn, 1974; Karlin & Taylor, 1975; Traulsen & Hauert, 2009). A continuum approximation for quantities that are known exactly may thus make limited sense at first sight—but it can provide a very useful new perspective. Another important process in population genetics is the Wright–Fisher process—a model for allele frequency dynamics in a population of fixed size and nonoverlapping generations (Wright, 1931). It is more popular in population genetics but is also used in evolutionary game theory (e.g., Imhof & Nowak, 2006; Taylor & Maciejewski, 2012; Traulsen et al., 2006; Wakano & Lehmann, 2014). The Wright–Fisher process is mathematically more challenging to analyze exactly than the Moran process. Therefore, continuum approximations resulting in stochastic differential equations are used to compute typical quantities of interest such as the probability of fixation of a certain genotype or the mean time until this fixation event occurs (Crow & Kimura, 1970; Ewens, 2004). Additional to the derivation of the continuum limit, we also present how to compute these quantities. In evolutionary game theory, demographic stochasticity is particularly important for behaviors that only evolve in small populations, such as spite (Gardner & West, 2004; Nowak et al., 2004). Initially, stochastic replicator dynamics have been developed to address the problem of equilibrium selection in evolutionary games (Cabrales, 2000; Foster & Young, 1990; Fudenberg & Harris, 1992). With the more recent development to look at games with a larger number of possible strategies, an approach based on weak mutation which allows to reduce the analysis to pairwise comparison between strategies became popular (Fudenberg & Imhof, 2006; Wu et al., 2011). These methods are particularly useful when many strategies have to be taken into account (García & Traulsen, 2019; Rand & Nowak, 2011; Sigmund et al., 2010; Traulsen et al., 2012; Vasconcelos et al., 2017). However, while the focus on weak mutation allows an analysis that takes demographic noise into account, additional approaches are necessary when, for example, internal equilibria are present (Black et al., 2012; Vasconcelos et al., 2017). The stochastic tools discussed here may be a further step into that direction.
Even though similar in the questions they try to answer, evolutionary game theory and population genetics are developing in parallel, sometimes with little interaction between them. As this is partly arising from the different methods applied, here we aim to provide an introduction to the continuum limit for those less comfortable with these methods and hesitant to go into the extensive, more mathematical, literature. In summary, for the theoretical population geneticist with a probabilistic background, we provide a summary of some key results on stochastic differential equations; for the evolutionary game theorist, we give a new perspective on the derivations of results obtained when using discrete birth–death processes; and lastly, for the theoretical ecologist familiar with deterministic modeling, we outline how to derive and work with stochastic versions of classical ecological and evolutionary processes.
Since our goal is to illustrate how to apply a continuum limit to individual‐based descriptions of a biological process, the calculations and derivations below may remain vague where more mathematical theory is necessary. For a mathematically rigorous presentation of this topic, we refer to the lecture notes by Etheridge (2012) or the book by Ewens (2004). A more application‐oriented treatment of stochastic processes in biology can be found in the books by Lande et al. (2003), Otto and Day (2007), and Allen (2011).
2. EVOLUTIONARY AND ECOLOGICAL PROTO‐TYPE PROCESSES
We outline the derivation of continuum limits by application to exemplary processes from evolution and ecology, the Wright–Fisher and Moran process, and the logistic equation. By showing the explicit derivation in these examples, we provide the necessary tool set to derive continuum limits of more complex processes motivated by individual‐based models. In this section, we define the models by their microscopic descriptions, that is, we describe the model dynamics as viewed from an individual's perspective.
2.1. Wright–Fisher and Moran process
The two most popular processes to model (stochastic) evolutionary dynamics are the Wright–Fisher and the Moran process. While in the Wright–Fisher process generations are nonoverlapping and time is measured in discrete steps, generations in the Moran model are overlapping and measured in either discrete or continuous time. Originally, both processes describe the stochastic variation of allele frequencies due to finite population size effects referred to as genetic drift.
2.1.1. Wright–Fisher model
One of the oldest population genetics model is the finite size Wright–Fisher process (Fisher, 1930; Wright, 1931). Given a population of constant size, it describes the change in frequencies of alleles in nonoverlapping generations over time, measured in (discrete) generations.
Classically, one considers a population of N individuals where each individual is of type A or B. The population is considered to be in its ecological equilibrium. The population size N is therefore constant over time. One interpretation of the dynamics is that every generation each individual chooses, independently of all other individuals, an ancestor from the previous generation and inherits its type. Under selection, the likelihood of drawing type A individuals increases (or decreases) which introduces a sampling bias. The probability for an offspring to have a parent of type A, conditional on k individuals being of type A in the parental generation, is
| (1) |
where is the selective advantage of type A. The number of type A individuals in the next generation is then given by a binomial distribution with sample size N and success probability pk. Denoting the number of type A individuals in generation n by Xn, we have
| (2) |
Unfortunately, the Wright–Fisher model, even though very illustrative, is difficult to study analytically. Through the developments in stochastic modeling in the last century, a lot of this new theory could be adopted to overcome this problem (e.g., Ewens, 2004; Kimura, 1983).
2.1.2. Moran model
Another way to resolve the difficulties associated with the Wright–Fisher process is provided by the Moran process (Moran, 1958). The setup is the same as for the Wright–Fisher process (constant population size N with two types or – in population genetics – alleles A and B) with one exception: Time is not measured in generations but each change in the population configuration affects only one individual, the one that dies and gets replaced by an offspring of another randomly selected individual. Therefore, generations are overlapping, and time can be measured in discrete steps or continuously.
Discrete time
The Moran process in discrete time progresses as follows. Every time step, one individual is randomly chosen to reproduce and the offspring replaces a randomly chosen individual among the remaining N − 1 individuals (sometimes, the replacement mechanism is not restricted to the remaining individuals but also includes the parent). Therefore, in a population with k type‐A individuals, the probability that one of these replaces a type‐B individual is given by
| (3) |
with pk as defined in Equation 1. Analogously, the probability for the number of type‐A individuals to decrease from k to k − 1 reads
| (4) |
We have implemented selection on the reproduction step, but it could also affect the replacement step. In a nonspatial setting, as considered here, this leads to the same transition probabilities. However, the Moran model can also be studied on a graph which aims to model spatial structure. In that case, the order of reproduction and replacement, and which of these steps is affected by selection, matters and can potentially give rise to different evolutionary dynamics (Kaveh et al., 2015; Lieberman et al., 2005). We note further that without selection (s = 0) we have pk = k/N, that is, the increase and decrease probabilities are equal for any choice of k. Dynamics with this property are called neutral.
Continuous time
The same dynamics (albeit on a different time scale) are obtained by assuming that each pair of individuals is associated with a random exponentially distributed time (also described as exponential clocks). The next pair to update their types is determined by the smallest random time (or the clock that rings first). At these updating times, one of the two individuals is chosen to reproduce, the offspring replacing the other individual of the pair. There is no standard choice when it comes to choosing the rate of these exponential times.
Both formulations of the Moran process are Markov chains, either in discrete or continuous time, with the special property of having jumps of ±1 only. These processes are called birth–death processes. The theory of these is well developed, see for example the books of Karlin and Taylor (1975, 1981), Gardiner (2004), or Allen (2011), so that the dynamics of Moran processes are often amenable to analysis (typically by solutions of recursion equations).
To sum up the introduction to these two evolutionary models, the difference between the Wright–Fisher model and the Moran model is the progression of populations in time. In the Wright–Fisher process, generations are nonoverlapping, that is, all individuals update their type at the same time. Therefore, the distribution of types in the offspring generation is binomial. In contrast, generations are overlapping in the Moran model and the dynamics are described by a birth–death process since only one individual is updated at a time.
2.2. Logistic growth
In ecology, one is typically interested in population sizes or densities rather than allele frequencies. The simplest population growth model is that of exponential growth. Obviously, a population cannot grow exponentially forever. Its growth will be limited at some point, for example due to spatial constraints or resource depletion. This form of density regulation suffices to stabilize a population around its carrying capacity, the positive population size at which in the deterministic process the growth rate equals zero.
Here, we give the mechanistic basis that could potentially describe such a process. We denote a single individual of the population by Y. The birth and death processes can then be written as
| (5) |
The parameters β and δ correspond to the rates at which the two reactions happen, that is, each reaction corresponds to an exponential clock with rate either β or δ. For β > δ, the population grows to infinity (exponential growth), whereas for β <δ, it goes extinct.
Population regulation is achieved through a nonlinear term that is interpreted as an interaction between two individuals, for example, competition for space. The corresponding microscopic process is given by
| (6) |
The parameter γ is referred to as the intraspecific competition coefficient, and K is a measure of the number of individuals at carrying capacity. The division by K in the competition rate is accounting for the probability of interaction of two individuals in a well‐mixed population where space is measured by the parameter K so that Y/K becomes a density (or rate of encountering an individual when randomly moving in space). For a more detailed derivation of these type of interaction rates, we refer to Anderson and Kurtz (2015).
The logistic process is, like the Moran process, a birth–death process. We will see in the next section that in the infinite population size limit (we let K tend to infinity), the mechanistic description above yields the logistic equation
| (7) |
where r = β − δ is the per‐capita growth rate, c = (β − δ)/γ is the rescaled carrying capacity, and y = Y/K is the density of the population.
3. INFINITE POPULATION SIZE LIMIT
The microscopic descriptions can be implemented by a stochastic simulation algorithm. Yet, the theoretical analysis of finite size populations can be challenging. A common technique to overcome this challenge is to consider a continuum approximation, that is, studying the limiting model for N (or K) to infinity. The limit is a (stochastic) differential equation of the form
| (8) |
where (Wt)t ≥ 0 is a standard Brownian motion. This equation describes the population dynamics, that is, the macroscopic evolution of a certain model. For a general introduction to stochastic differential equations, see for example the books by Karlin and Taylor (1981) and Gard (1988).
The term μ(xt) is called the infinitesimal mean, that is, the expected change of the stochastic process (xt)t ≥ 0 in a very short (infinitesimal) time interval. It represents the deterministic dynamics of the process. The term σ 2 (xt) is called the infinitesimal variance, that is, the expected variation of the continuum limit in very small time steps. It quantifies the random fluctuations. In the case where σ 2 (x) is zero, the limit is deterministic and Equation 8 reduces to an ordinary differential equation.
One can formally show that the terms μ(x) and σ 2 (x) indeed correspond to the changes of the mean and the variance in infinitesimally small time steps if they are derived formally (Appendix 1). This allows us to compute them by the following identities:
| (9) |
where and denote the expectation and variance of the process xt.
A solution of a stochastic differential equation of the form in Equation 8 is called a diffusion. Another common representation of Equation 8 is the following integral equation
| (10) |
where the stochastic integral is interpreted in the sense of Itô. For a discussion of the different choices of stochastic integrals and their consequences in terms of modeling, we refer for example to Turelli (1977).
We now present how to derive Equation 8 for the three introduced models. The strategy is rather simple: Compute the infinitesimal mean and variance as given in Equation 9 for the individual‐based model.
3.1. Discrete‐time derivation: Wright–Fisher model
For the reason of illustration, we assume a Wright–Fisher model as outlined in Section 2.1 without selection, s = 0. We need to compute the expectations in Equation 9 using the probability distribution given in Equation 2 (with s = 0). Let us ignore the time step Δt for the moment and simply compute the change in expectation and variance from one generation to the other. Writing Xt for the number of individuals of type A and setting ΔXt = Xt − Xt −1, we find
| (11) |
where we used that the number of individuals of a certain type in the next generation is binomially distributed. Analogously, the infinitesimal variance is
| (12) |
It remains to account for the transition from discrete to continuous time. In this case, the natural choice is Δt = N −1. This can be seen by examining the infinitesimal variance. To obtain a limit different from zero or infinity in Equation 12 for N to infinity, we need to divide that equation by N. Comparing Equation 12 to the corresponding line in Equation 9, we set Δt = N −1. Replacing k/N by x and taking the limit N → ∞, we find the neutral Wright–Fisher diffusion for the allele frequency dynamics
| (13) |
This equation is called Wright–Fisher diffusion and describes the dynamics of a neutral allele due to genetic drift. In other words, the allele frequency behaves like a random walk in continuous time and space. A similar derivation as above can be done by including selection and mutation. The more lengthy computation is relegated to Appendix 2.
Conclusion 1: To derive a continuum limit of a finite population size model in discrete time, one computes the infinitesimal mean and variance as given in Equation 9 and rescales time so that the two quantities converge in a meaningful way, that is, do not tend to infinity.
3.2. Continuous‐time derivation: general case
In principle, the same methodology as above is applicable for the derivation of the continuum limit of a process measured in continuous time. However, in view of our subsequent analysis of the limit Equation 8, we will introduce a new tool, the infinitesimal generator.
The change in infinitesimal time of any continuous‐time Markov process (xt)t ≥ 0 can be described by the infinitesimal generator, denoted (Ethier & Kurtz, 1986, chapter 1, eq. (1.10)). Intuitively, one can think of it as the derivative of the expectation (of an arbitrary function) of a stochastic process. Formally, it is defined by
| (14) |
where denotes the conditional expectation of the stochastic process f(xt) at time Δt given the initial value x 0 = x. Here, f is an arbitrary function so that the limit is well‐defined. For example, applying to f(x) = x describes the dynamics of the mean of xt, and for f(x) = x 2, we obtain the dynamics of the second moment of xt. From the first two moments, we can recover the variance, so that from Equation 14, one can derive the infinitesimal mean and variance.
The infinitesimal generator is useful in our context since it can be related to a diffusion process. More precisely, the infinitesimal generator associated with the stochastic diffusion
| (15) |
is given by (we refer to Appendix 1 for a derivation by the Itô formula)
| (16) |
Our strategy is to find a limit of the infinitesimal generator associated with a finite population size process, which corresponds to the form given in Equation 16. We consider a continuous‐time birth–death process with transition rates Tk + and Tk − for 0 ≤ k ≤ N. Due to the exponentially distributed waiting times, the probability for a single update until time t is λt exp(−λt), where λ is the rate of the corresponding exponential clock. Setting x = X/N, the frequency of type‐A individuals, we find the infinitesimal generator for the model with finite population size N, , to be of the form
| (17) |
We have used the Landau notation o(Δt) to summarize processes that scale with order (Δt)1 + ε for any ε > 0. We will also use the Big‐O notation O(Δt) for processes that scale with order Δt or higher. Doing a Taylor expansion for large N and neglecting the terms of order higher than 1/N 2, we find
| (18) |
Translating this equation to a stochastic differential equation, we identify the single components as
| (19) |
Note that we have made no assumption on the dependence of the transition probabilities on the frequency x, such that this approach is applicable for constant selection, linear frequency dependence arising in two player games (Traulsen et al., 2005) or multiplayer games with polynomial frequency dependence (Gokhale & Traulsen, 2010; Peña et al., 2014).
Conclusion 2: For time‐continuous finite population size models with jumps of ±1, that is, a birth–death process, the terms of the continuum limit can be computed by Equation 19.
3.2.1. Example: Moran process with selection and mutation
Returning to our proto‐type processes, we explicitly derive the stochastic differential equation corresponding to the Moran model with selection and mutation. We decouple reproduction and mutation processes, but similar derivations can be made if we assume a coupling of mutations to reproduction events. The selection coefficient is denoted by s, and the mutation rates from type A to B and type B to A are given by uA → B and uB → A, respectively. Then, the transition rates are
| (20) |
Inserting these into Equation 19 yields
| (21) |
Depending on the choice of selection and mutation rates, these equations result in different limits. Typically, one is interested in nontrivial limits for these equations, that is, a limit so that not both components equal zero. Often this can be achieved by rescaling time (Section 3.3) and/or defining the strength of selection and mutation in terms of the population size N. As an example, we will focus on two specific limits: (a) strong selection and strong mutation and (b) weak selection and weak mutation.
Strong selection and mutation
We consider large selection and mutation rates. We assume that s and ui do not depend on N but are constant. To obtain a limit equation for the frequency of individuals of type A, x = X/N, we rescale time by N, that is, t→Nt, which transforms Equation 21 to
| (22) |
The first equation is independent of N and the vanishing variance in the second equation implies that the limit process is deterministic. We find the ordinary differential equation
| (23) |
which describes the change in allele frequency in a population under strong selection and mutation over time.
Weak selection and mutation
In contrast to the previous scenario, we now assume that both selection and mutation are weak. We let s and ui scale inversely with N and define the constants α = sN and νi = uiN. Inserting these into Equation 21 (and here without rescaling time) yields
| (24) |
which gives the diffusion limit
| (25) |
Note that compared to the Wright–Fisher diffusion in Equation 13 this limit has twice as much variance. This difference is explained by the different sampling schemes in the individual‐based description of the model. To see this, we assume no selection, s = 0, and no mutation, uA → B = uB → A = 0.
In the Wright–Fisher process, individuals are updated by binomial sampling. The variance of this sampling procedure is Nx(1 − x), where the factor N vanishes by rescaling the time. This gives σ 2 (x) = x(1 − x).
In the Moran model, or more generally for a birth–death process, the variance is computed by the sum of the transition rates, cf. Equation 19. In our example, both transitions happen at rate 1, which explains the additional factor 2. The difference between the variances is therefore a result of the different sampling schemes of the individual‐based models.
As a consequence of this difference in the variance σ 2 (x) between the two models, the Moran diffusion limit progresses twice as fast as the Wright–Fisher diffusion limit which can be seen by the scaling property of the Brownian motion. In terms of the original discrete processes, this means that N individual jumps, like in the Moran process, accumulate more variance than one update of the whole population, like in the Wright–Fisher process. The sampling therefore determines the variance and consequently the speed of the continuum limit.
Conclusion 3: The Moran process, by definition, has the same mean behavior as the Wright–Fisher model. However, its variance in the diffusion limit is twice the variance of the corresponding Wright–Fisher diffusion. This difference arises from the different sampling schemes of the individual‐based models.
3.3. Change of time scales in the derivation of a continuum limit
In the derivation of the continuum limit, we have repeatedly rescaled time to obtain a nontrivial limit, for example, right before Equations (13), (22) and 22. Rescaling the time speeds up (or slows down) the original process so that the dynamics of interest, for example, allele frequency changes, become observable. For example, if the dynamics were to be very fast in the original process, we would need to slow down time appropriately to observe the changes of the quantity of interest more gradually. In general, we are free to chose any scaling of time. However, it is important to keep in mind the scaling when interpreting results obtained in the limiting process in terms of the original process. Especially so, if one is interested in quantities involving time, for example, fixation or extinction times.
Conclusion 4: Different assumptions on the model dynamics, for example, on selection and mutation, can lead to different continuum limits on the population level. To identify parameter combinations that result in a reasonable continuum limit, one needs to study Equation 19 to match the orders of the scaling parameter. Rescaling time gives an additional degree of freedom when trying to match these orders to obtain a reasonable limit.
4. DIFFUSION APPROXIMATION
We have seen that if we let the population size N tend to infinity, we can derive a (stochastic) differential equation describing the studied evolutionary or ecological process. A natural question that arises is how these results relate to finite population size models. To study this difference between the finite population process and the continuum limit, we consider the logistic growth equation. The transition rates are given by
| (26) |
Repeating the steps from the previous section with y = j/K, we find the following expressions for the infinitesimal mean and variance:
| (27) |
Thus, the classical deterministic logistic equation is obtained in the infinite population size limit, K → ∞:
| (28) |
How well does the finite population size description approximate this deterministic limit? One way to approach this question is to simply not take the limit of K to infinity. The finite population size logistic equation, derived from Equation 27, is then approximated by
| (29) |
where the superscript K in indicates the order of magnitude of the carrying capacity.
The approximation in Equation 29 is called Diffusion approximation. One can prove formally that this approximation, under the assumptions that the function μ(x) and σ 2(x) are (twice) continuously differentiable, performs equally well as a more accurate analysis based on the central limit theorem (Ethier & Kurtz, 1986, Theorem 11.3.2). For a rigorous discussion of diffusion approximations and their relation to the central limit theorem, we refer to Ethier and Kurtz (1986, Chapter 11).
In terms of performance of the diffusion approximation, the population size measure K does not need to be very large for the individual‐based model to approach the deterministic limit (K ≈ 1,000 is enough in this example, Figure 1).
FIGURE 1.

Individual‐based simulations of the logistic growth model. (a) For low population sizes, the individual‐based simulation (solid lines) fluctuates strongly around the deterministic solution of the population (dashed lines) given by Equation 28. (b) Increasing the scaling parameter K, the stochastic fluctuations around the deterministic prediction decrease, until eventually the individual‐based simulation is indistinguishable from the deterministic curve. The parameter values are chosen as follows: β = 2, δ = 1, γ = 1, and (a) K=100, (b) K=1,000. The initial population sizes are stated in subfigure (b)
Conclusion 5: Not taking the limit in Equation 19 yields the diffusion approximation of the studied model. This approximation is a stochastic differential equation (Equation 29) where the variance (typically) scales inversely with the square root of the scaling parameter.
5. STATIONARY DISTRIBUTIONS
For the Moran model, we have derived two different limits that differ in their assumptions on selection and mutation. If both selection and mutation are strong, the infinite population size limit is an ordinary differential equation. For weak selection and weak mutation, we derived a stochastic differential equation. One qualitative difference between these two limits is that trajectories of the deterministic limit will always converge to a fixed point (other limits are possible in general, e.g., limit cycles) while the stochastic differential equation fluctuates indefinitely for positive mutation rates. The deterministic fixed point of the Moran model is given by the solution of Equation 23 equal to zero. In our example, a single fixed point x* lies within the interval between 0 and 1 and is stable. Therefore, all trajectories will approach this value, for example, Figure 2a.
FIGURE 2.

Allele frequency dynamics with selection and mutation. (a) The deterministic system given by Equation 23 converges to the fixed point (dashed line) and remains there. (b) The stochastic process given by Equation 25 fluctuates strongly in frequency space and for the chosen parameter values spends most time close to the monotypic states x = 0 and x = 1
In contrast, Equation 25 is a stochastic equation. Thus, even if the trajectories approach or hit the deterministic fixed point they will not stay there due to the stochasticity of the Brownian motion, cf. Figure 2b. Still, we can make predictions about the time a trajectory spends in certain allele configurations. This information is summarized in the stationary distribution, the stochastic equivalent of a deterministic fixed point. If the initial state of the population is given by the stationary distribution, then the distribution of all future time points will not change. For birth–death processes, the stationary distribution can be calculated based on detailed balance, that is, the incoming and outgoing rates need to be equal for every state of the process (Antal et al., 2009; Claussen & Traulsen, 2005; Gardiner, 2004). Formally, the stationary distribution, denoted ψ, is defined as the solution of
| (30) |
where x 0∼ψ denotes that x 0 is distributed according to ψ and f is an arbitrary function. Importantly, this condition means that the distribution of allele frequencies does not change over time because its derivative in time is zero (for any choice of f). The above equation can be solved with the infinitesimal generator (Etheridge (2012, Chapter 3.6)). The solution is expressed in terms of the speed measure density m(x), which we introduce in Box 1, and is given by
| (31) |
BOX 1. Scale function and speed measure of a one‐dimensional diffusion.
A one‐dimensional stochastic diffusion can be transformed into a standard Brownian motion. Since the Brownian motion is well‐studied, a lot of results can then be translated to the stochastic diffusion by the transformation functions, the scale function, and the speed measure.
First, we rescale the space of the original process by the scale function. It is defined by
| (32) |
where the lower boundaries of the integrals can be chosen arbitrarily. The name of this function derives from the fact that for a one‐dimensional diffusion xt satisfying
| (33) |
the scaled process S(xt) becomes a time‐changed Brownian motion on the interval [S(0), S(1)], that is, there is no deterministic contribution in the scaled process. The process S(xt) is a Brownian motion with a “non‐standard” time scale. To map this time‐changed Brownian motion to the time scale of a standard Brownian motion, one needs to rescale time by the speed measure M. It defines how much faster (or slower) the process S(xt) is evolving compared to a standard Brownian motion. The speed measure is defined by
| (34) |
the density of the speed measure. The time is then rescaled by
Compactly written, we have changed the stochastic diffusion xt to the standard Brownian motion by the following steps:
Intuitively, the speed measure at a point x, m(x), quantifies the time which the process spends in this state. Therefore, ψ(x) is nothing but the average time spent in state x.
Conclusion 6: The stationary distribution of a one‐dimensional diffusion can be expressed in terms of the density of the speed measure m(x) through Equation 31. The density of the speed measure is given by the scale function corresponding to the stochastic diffusion process, Equations (32), (34) and (22), (34) in Box 1.
5.1. Stationary distribution of the Wright–Fisher diffusion
As an example, let us consider the Wright–Fisher diffusion with selection and mutation as derived in Equation 13 (and Equation 25 when derived from the Moran model), that is,
| (35) |
Computing Equation 31 with help of the quantities defined in Box 1, one obtains
| (36) |
where is the Gamma function and 1 F 1 (a, b, z) is the generalized hypergeometric function.
The equation itself does not provide much insight. To illustrate the possible shapes of stationary distributions, we plot several choices of mutation rates and selection coefficients in Figure 3. We see that for higher mutation rates, more probability mass is allocated to intermediate allele frequencies (compare the solid and dashed lines). In this case, the Wright–Fisher diffusion spends more time in states of coexistence than in monotypic states (the boundaries of the allele frequency space in Figure 3) because temporary extinction events are prevented by recurrent mutations. If the mutation rates are asymmetric (dotted line), the stationary distribution is skewed toward the type with the lower mutation rate. If one type is favored selectively (dash‐dotted line), the stationary distribution is skewed toward the favored type.
FIGURE 3.

Stationary distribution of the Wright–Fisher diffusion with selection and mutation. The lines are given by Equation 36. Larger mutation rates accumulate more probability on intermediate allele frequencies (compare solid and dashed lines). Selection (or asymmetric mutation) skews the stationary distribution toward the selectively favored type (or type with the lower mutation rate), dash‐dotted (dotted) line
Stationary distributions are the stochastic equivalents of deterministic fixed points and as such provide a basic description and a starting point for further analysis of the qualitative behavior of a stochastic model, especially in situations where polymorphisms of alleles, coexistence of species, or spatial population distributions are to be expected (e.g., Czuppon & Rogers, 2019; Gaston & He, 2002; Lehmann, 2012; Polansky, 1979; Turelli, 1981).
5.2. Quasi‐stationary distribution of the logistic process
Next, we consider the diffusion approximation of the logistic growth model, that is,
| (37) |
The logistic process for finite K has a (unique) absorbing state, y = 0 because there is no transition from this state to positive population densities. Once the population went extinct, it remains so. Since the extinction state is accessible from all values y > 0 (and because the population size remains finite for all times), the population will go extinct with probability 1. The only stationary distribution is the point measure on 0, that is, ψ ∼ δ 0 (δx is the Dirac measure at point x).
In contrast, the positive deterministic population equilibrium, y* = (β − δ)/γ, is a stable fixed point of the deterministic system. Considering large values of the deterministic equilibrium (), we expect the finite population size process from Equation 37 to remain close to this value for long times. In fact, the expected extinction time of the logistic growth process when started in the positive population equilibrium is of order exp(K) (Champagnat, 2006). This suggests that the process will be in a quasi‐stationary state, that is, before its extinction the population is described by the stationary distribution of the corresponding logistic process conditioned on nonextinction.
Formally, the quasi‐stationary distribution is computed by conditioning the original process on its survival. This means that the transition rates change and the novel process can be analyzed by the techniques described above. However, this method goes beyond the scope of this manuscript. For a theoretical treatment of this topic in the context of the logistic equation, we refer to Cattiaux et al. (2009), Assaf et al. (2010), Méléard and Villemonais (2012). For a general review on methods related to quasi‐stationary distributions, see Ovaskainen and Meerson (2010).
Another way to approximate the quasi‐stationary distribution when extinction is very unlikely for long times (which is the case for large K) is provided by the central limit theorem (sometimes also called linear noise approximation in this context). Here, the distribution of the process is derived from its local dynamics around the deterministic fixed point y* (Ethier & Kurtz, 1986; van Kampen, 2007). The underlying assumption is that the population stays close to its positive steady state and just slightly fluctuates around this value. This is only a valid assumption when the probability of extinction within the studied time frame is essentially zero. These small fluctuations are described by a Gaussian distribution. Formally this translates to
| (38) |
where U is a Gaussian random variable and yt the deterministic trajectory. Writing μ(y) = (β − δ − γy)y and σ 2 (y) = (β + δ + γy)y, the dynamics of U can be rewritten as
| (39) |
Evaluating the process U at , we obtain a description of the variance in the deterministic fixed point. For this fixed choice of yt and , U becomes an Ornstein–Uhlenbeck process. Its stationary distribution is then given by
| (40) |
This distribution describes the fluctuations of the process around the deterministic steady state y*. Therefore, when plugging the distribution ψU into the original process from Equation 38, we find the quasi‐stationary distribution of around the deterministic equilibrium y* which yields
| (41) |
For increasing population sizes K, the variance is decreasing and vanishes in the limit K→∞, as is to be expected by the deterministic limit.
Conclusion 7: If the deterministic process has a stable steady state but is almost surely going extinct for finite population sizes, a quasi‐stationary distribution can be computed to describe the behavior of the process conditioned on survival. If the extinction probability is very low, an approximation of this distribution is given by the linear noise approximation where the variance around the deterministic steady state is modeled by the Ornstein–Uhlenbeck process derived from Equation 39.
6. FIXATION PROBABILITIES
We have seen that stochastic descriptions of processes can lead to outcomes that are different from their deterministic counterparts. Here, we study one of these phenomena: the probability for a certain type to become fixed in a population. For one‐dimensional stochastic differential equations, fixation probabilities can be computed explicitly. As before, we denote by xt the frequency of type A individuals at time t ≥ 0 in the population. We use the fact that the mean of the scaled process S(xt) in Equation 32 does not change over time. Stochastic processes with this property are called martingales. With we find
| (42) |
where we have used the martingale property of the scaled process S(x) in the first equality. The second equality is explained by the process being absorbed at one of the two boundaries x = 0 or x = 1 at large times. For a formal derivation, we refer to Otto and Day (2007, Chapter 15.3.3) or Etheridge (2012, Lemma 3.14).
As an example, let us consider the Wright–Fisher diffusion with selection and without mutations (νA → B = νB → A = 0) given in Equation 35. We have μ(x) = αx(1 − x) and σ 2(x) = x(1 − x) such that the scale function simplifies to
| (43) |
Recalling the definition of α = sN for a finite population of size N and plugging this into Equation 42 yields
| (44) |
which for x 0 = 1/N becomes , the result of Haldane for the fixation of a single mutant copy in a population of size N (Haldane, 1927). The first line of Equation 44, the classical result of fixation probabilities when derived from diffusion theory, and its applicability has been the subject of extensive research, for example, Bürger and Ewens (1995) and references therein; for a more general review on fixation probabilities, we refer to Patwa and Wahl (2008).
Of course, the fixation probability can also be calculated for more complicated stochastic differential equations where the sign of the deterministic dynamics μ(x) depends on the population configuration. Most classically, these frequency‐dependent problems were studied in deterministic evolutionary game theory introduced by Maynard Smith and Price (1973) (see also Hofbauer and Sigmund (1998) for an introduction to evolutionary game dynamics). In Appendix 3, we (re‐)derive the fixation probability in case of frequency‐dependent selection.
Conclusion 8: The fixation probability of a one‐dimensional diffusion is given by the scale function as stated in Equation 42.
7. MEAN TIME TO FIXATION
A related quantity of interest is the expected time to fixation (or extinction from the other type's point of view), that is, the average time of coexistence of two types. Again, the calculation relies on a special function, this time Green's function G(x, y), which can be interpreted as the average time that a diffusion started in x spends in the interval [y, y + dy) before reaching one of the boundaries (Etheridge, 2012, Chapter 3.5). It is therefore also called sojourn time density (Ewens, 2004). It is defined as
| (45) |
where S(x) is the previously defined scale function and m(x) denotes the speed measure density (Box 1).
The expected time to fixation for a process started at frequency x, denoted , is then given by (Ewens, 2004, section 4.4).
| (46) |
This integral corresponds to the summation of the sojourn times in the discrete case, for example, we refer to Ohtsuki et al. (2007) for an application in finite populations. In some cases, the result of this equation yields an analytically tractable result, for example, for the neutral Wright–Fisher diffusion
| (47) |
In this case, the scale function and speed measure density are given by
| (48) |
Then, the expected time to fixation of one of the two alleles can be expressed as
| (49) |
In Appendix 3, we consider the more involved example of frequency‐dependent selection (Altrock & Traulsen, 2009; Pfaffelhuber & Wakolbinger, 2018).
Similarly to fixation probabilities, the mean time to fixation has been studied extensively through stochastic diffusions, for example, Kimura and Ohta (1969). It is especially important in population genetics where one is interested in the time to extinction or fixation of newly arising alleles (van Herwaarden & van der Wal, 2002). In ecology, the mean time to extinction or fixation is for example applied in the context of population extinction (Lande, 1994) and speciation events (Yamaguchi & Iwasa, 2013).
Conclusion 9: Expected unconditional fixation times, that is, the expected time of coexistence of two types in a population, can be calculated by integrating over Green's function (the mean occupation time of a certain frequency until extinction), as shown in Equation 46.
8. DISCUSSION AND CONCLUSION
We have outlined how to derive a stochastic differential equation from an individual‐based description of two classical models in evolutionary theory and theoretical ecology, the Wright–Fisher diffusion and the logistic growth equation. The resulting stochastic differential equations in one dimension describe the dynamics of the allele frequency and population density, respectively. Using probabilistic properties of this equation, that is, transforming it to a standard Brownian motion (Box 1), it is possible to analytically derive the (quasi‐) stationary distribution, fixation probability, and the mean time to fixation. As an example, we derived these quantities for the Wright–Fisher diffusion.
The diffusion process emerges as the infinite population size limit. However, as we have shown in Section 4, one can also derive a finite population size approximation of the dynamics, the diffusion approximation. The fixation probability, mean extinction time, and stationary distribution are accessible by the same means as for the continuum limit. Applications of diffusion approximations are abundant and cover diverse topics (e.g., Assaf & Mobilia, 2011; Constable et al., 2016; Czuppon & Gokhale, 2018; Czuppon & Traulsen, 2018; Débarre & Otto, 2016; Houchmandzadeh, 2015; Kang & Park, 2017; Koopmann et al., 2017; McLeod & Day, 2019; Parsons et al., 2018; Reichenbach et al., 2007; Schenk et al., 2020; Serrao & Täuber, 2017; Traulsen et al., 2005).
Apart from the fixation probability and the mean time to fixation, the (quasi‐)stationary distribution is a commonly used measure to describe stochastic processes. Its calculation through the speed measure of the associated scaled process (Box 1) is (in many cases) numerically straightforward. If the process has an absorbing state, for example, an extinction boundary of the population, the stationary distribution is not meaningful. Here, the quasi‐stationary distribution describes the stationary distribution conditioned on the survival of the population. For negligible extinction probabilities, that is, very large survival probabilities of the population, the functional central limit theorem (or linear noise approximation) can be used to approximate this quasi‐stationary distribution. In the theoretical biology literature, this method is frequently used in models of gene regulatory networks (see Anderson and Kurtz (2015) for a mathematical introduction), and less so in the context of ecology or evolution (e.g., Boettiger et al. (2010); Kopp et al. (2018); Wienand et al. (2018); Czuppon and Constable (2019); and Assaf and Meerson (2017) for a review of the physics literature related to this topic).
Lastly, we did not cover multi‐dimensional or spatially explicit stochastic differential equations in this methods review. These processes are often much more complicated to analyze. Here, we aimed to give a brief introduction into the derivation of a continuum limit from an individual‐based model. We hope, that with our basic comparisons between different approaches used in different subfields of theoretical biology, we help to clarify the common foundation, the individual‐based model, on which these different methods are based.
CONFLICT OF INTEREST
The authors declare no conflict of interest.
AUTHOR CONTRIBUTION
Peter Czuppon: Conceptualization (equal); methodology (lead); writing–original draft (lead); writing–review and editing (equal). Arne Traulsen: Conceptualization (equal); writing–review and editing (equal).
ACKNOWLEDGMENTS
We are very grateful to Florence Débarre who carefully read an earlier version of the manuscript and made numerous suggestions that led to the current version. Both authors appreciate generous funding from the Max Planck Society. PC also received funding from the Agence Nationale de la Recherche, grant number ANR‐14‐ACHN‐0003 provided to Florence Débarre and the European Union's Horizon 2020 research and innovation program under the Marie Skłodowska ‐Curie grant agreement PolyPath 844369.
APPENDIX 1.
INFINITESIMAL MEAN AND VARIANCE
Given the stochastic differential equation
| (50) |
the corresponding infinitesimal generator is defined by
| (51) |
The connection between the infinitesimal generator and its associated stochastic differential equation is outlined in more detail in, for example, Kallenberg (2002, Chapter 23). Briefly, one needs to apply Itô's formula (Kallenberg, 2002, Theorem 17.18) to the process f(xt), where xt solves the stochastic differential equation in Equation 50:
| (52) |
Taking the expectation yields the result since the last term on the right‐hand side vanishes (the mean of a standard Brownian motion is zero).
Additionally, we show that μ(x) is indeed the infinitesimal mean of the stochastic process with infinitesimal generator as given in Equation 51. Setting f 1(x) = x (and thus ) we find for the infinitesimal change of the mean
| (53) |
Similarly, we see that σ 2(x) is the infinitesimal variance. With f 2(x) = x 2, we have
| (54) |
This justifies that calculating the infinitesimal mean and variance (right‐hand sides in Equation 9) indeed yields the functions μ and σ 2 of the diffusion.
APPENDIX 2.
DERIVING A STOCHASTIC DIFFERENTIAL EQUATION FROM THE WRIGHT–FISHER MODEL WITH SELECTION AND MUTATION
In the main text, we have derived the Wright–Fisher diffusion in the absence of selection and mutation. Here, we provide the calculation steps when including both these processes.
We say that type A alleles are beneficial (deleterious) if s > 0 (s < 0). Given that there are k type A individuals in the population, the probability for an offspring to choose a type A individual as a parent is given by
| (55) |
We can also add mutations to the Wright–Fisher model, that is, type A individuals can mutate to type B and vice versa. We set uA → B as the probability to mutate from type A to B and uB → A as the mutation probability from B to A. Then, the probability for an individual to be of type A given k type A individuals in the parental generation reads
| (56) |
In this model, mutation is intimately connected with the reproduction mechanism. For the Moran model, compare Section 3, these processes do not necessarily need to be coupled (even though this would, biologically speaking, make the most sense).
Following the same methodology as for the neutral Wright–Fisher model in the main text, we can derive a diffusion process by computing the infinitesimal mean and variance. Writing xt = Xt/N and setting Δt = 1/N, we obtain for the infinitesimal change in allele frequency
| (57) |
This is a rather unhandy expression. However, we can make further progress by assuming that selection and mutation are weak, that is, we set s = α/N and ui = νi/N. Rewriting the equation in terms of α and νi, expanding the equation in terms of 1/N, and neglecting terms of order 1/N 3 and higher we find
| (58) |
Thus, for the infinitesimal mean in the infinite population size limit, we find
| (59) |
The infinitesimal variance in terms of α and νi derives to
| (60) |
where we used
| (61) |
Therefore, the infinitesimal variance for N → ∞ is given by
| (62) |
Putting together the final results for the infinitesimal mean and variance, we get the weak selection and mutation limit of the Wright–Fisher model with selection and mutation, that is,
| (63) |
APPENDIX 3.
FREQUENCY‐DEPENDENT SELECTION
In the main text, we have exclusively considered situations with constant selection coefficients s (or α). Here instead, we apply the derived formulas for the fixation probability and the mean time to fixation for the case of frequency‐dependent selection. More precisely, we consider a stochastic diffusion with linear frequency dependence (we refer to Traulsen et al. (2006) for a physical formulation of this and Pfaffelhuber and Wakolbinger (2018) for a more general mathematical analysis). We denote the strength of selection by α and let u, v be arbitrary real numbers. We write αx(1 − x)(ux + v) for the linear frequency‐dependent dynamics of selection. Then, the allele frequency evolves according to the following equation
| (64) |
We have μ(x) = αx(1 − x)(ux + v) and σ 2(x) = x(1 − x).
Fixation probability
Recall the formula for the fixation probability, Equation 42
| (65) |
where S(x) is the scale function given in Equation 32 and given by
| (66) |
For α = 1, we can linearize the exponential and write the scale function as
| (67) |
Plugging this into Equation 65, we obtain
| (68) |
In the context of evolutionary game theory, this result is a rederivation of the 1/3‐law (Nowak et al., 2004) (generalized by Lessard & Ladret, 2007). It states that for an allele starting with one individual, it is more likely to become fixed in the population than under neutral dynamics if the deterministic fixed point is smaller than 1/3. This can be seen by plugging in u = a − b − c + d and v = b − d, where a, b, c, and d represent the payoffs of an evolutionary game.
Mean time to fixation
The mean time to fixation is given by Equation 46 that was given as
| (69) |
where G(x,y) is Green's function and defined as (Equation 45)
| (70) |
Similar to the computation of the fixation probability, we will consider the case of small initial frequencies and weak selection, that is, α,x << 1. More precisely, we neglect terms of order α 2 and αx 2. We recall the approximation of the scale function in this case that we derived in Equation 67
| (71) |
Employing these approximations, the first integral in Equation 69 yields
| (72) |
Approximating the second integral in a similar way, we find
Taking these two expressions together we rederived the results already known in the literature (Altrock & Traulsen, 2009; Pfaffelhuber & Wakolbinger, 2018), that is,
| (74) |
for α and x sufficiently small.
Czuppon P, Traulsen A. Understanding evolutionary and ecological dynamics using a continuum limit. Ecol Evol. 2021;11:5857–5873. 10.1002/ece3.7205
DATA AVAILABILITY STATEMENT
No data are used.
REFERENCES
- Allen, J. C. , Schaffer, W. M. , & Rosko, D. (1993). Chaos reduces species extinction by amplifying local population noise. Nature, 364(6434), 229–232. 10.1038/364229a0 [DOI] [PubMed] [Google Scholar]
- Allen, L. (2011). An introduction to stochastic processes with applications to biology (2nd ed.). Pearson Education. [Google Scholar]
- Altrock, P. , & Traulsen, A. (2009). Fixation times in evolutionary games under weak selection. New Journal of Physics, 11(1), 013012. 10.1088/1367-2630/11/1/013012 [DOI] [Google Scholar]
- Anderson, D. , & Kurtz, T. G. (2015). Stochastic analysis of biochemical systems. Springer. [Google Scholar]
- Antal, T. , Nowak, M. , & Traulsen, A. (2009). Strategy abundance in 2£2 games for arbitrary mutation rates. Journal of Theoretical Biology, 257, 340–344. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Assaf, M. , & Meerson, B. (2017). WKB theory of large deviations in stochastic populations. Journal of Physics A: Mathematical and Theoretical, 50(26), 263001. 10.1088/1751-8121/aa669a [DOI] [Google Scholar]
- Assaf, M. , Meerson, B. , & Sasorov, P. (2010). Large fluctuations in stochastic population dynamics: Momentum‐space calculations. Journal of StatisticalMechanics, 2010, P07018. 10.1088/1742-5468/2010/07/P07018 [DOI] [Google Scholar]
- Assaf, M. , & Mobilia, M. (2011). Fixation of a deleterious allele under mutation pressure and finite selection intensity. Journal of Theoretical Biology, 275, 93–103. 10.1016/j.jtbi.2011.01.025 [DOI] [PubMed] [Google Scholar]
- Black, A. J. , & McKane, A. J. (2012). Stochastic formulation of ecological models and their applications. Trends in Ecology and Evolution, 27, 337–345. 10.1016/j.tree.2012.01.014 [DOI] [PubMed] [Google Scholar]
- Black, A. , Traulsen, A. , & Galla, T. (2012). Mixing times in evolutionary game dynamics. Physical Review Letters, 109(2), 028101. 10.1103/physrevlett.109.028101 [DOI] [PubMed] [Google Scholar]
- Boettiger, C. (2018). From noise to knowledge: How randomness generates novel phenomena and reveals information. Ecology Letters, 21(8), 1255–1267. 10.1111/ele.13085 [DOI] [PubMed] [Google Scholar]
- Boettiger, C. , Dushoff, J. , & Weitz, J. (2010). Fluctuation domains in adaptive evolution. Theoretical Population Biology, 77(1), 6–13. 10.1016/j.tpb.2009.10.003 [DOI] [PubMed] [Google Scholar]
- Bürger, R. , & Ewens, W. J. (1995). Fixation probabilities of additive alleles in diploid populations. Journal of Mathematical Biology, 33(5), 557–575. 10.1007/BF00163042 [DOI] [Google Scholar]
- Cabrales, A. (2000). Stochastic replicator dynamics. International Economic Review, 41(2), 451–481. 10.1111/1468-2354.00071 [DOI] [Google Scholar]
- Cattiaux, P. , Collet, P. , Lambert, A. , Martínez, S. , Méléard, S. , & San Martín, J. (2009). Quasistationary distributions and diffusion models in population dynamics. The Annals of Probability, 37(5), 1926–1969. 10.1214/09-AOP451 [DOI] [Google Scholar]
- Champagnat, N. (2006). A microscopic interpretation for adaptive dynamics trait substitution sequencemodels. Stochastic Processes and their Applications, 116(8), 1127–1160. 10.1016/j.spa.2006.01.004 [DOI] [Google Scholar]
- Champagnat, N. , Ferrière, R. , & Méléard, S. (2006). Unifying evolutionary dynamics: From individual stochastic processes to macroscopic models. Theoretical Population Biology, 69(3), 297–321. 10.1016/j.tpb.2005.10.004 [DOI] [PubMed] [Google Scholar]
- Claussen, J. , & Traulsen, A. (2005). Non‐Gaussian fluctuations arising from finite populations: Exact results for the evolutionaryMoran process. Physical Review E, 71, 025101(R). [DOI] [PubMed] [Google Scholar]
- Constable, G. , Rogers, T. , McKane, A. , & Tarnita, C. (2016). Demographic noise can reverse the direction of deterministic selection. Proceedings of the National Academy of Sciences of the United States of America, 113(32), E4745–E4754. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Crow, J. , & Kimura, M. (1970). An introduction to population genetics theory. Harper and Row. [Google Scholar]
- Czuppon, P. , & Constable, G. (2019). Invasion and extinction dynamics of mating types under facultative sexual reproduction. Genetics, 213(2), 567–580. 10.1534/genetics.119.302306 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Czuppon, P. , & Gokhale, C. (2018). Disentangling eco‐evolutionary effects on trait fixation. Theoretical Population Biology, 124, 93–107. 10.1016/j.tpb.2018.10.002 [DOI] [PubMed] [Google Scholar]
- Czuppon, P. , & Rogers, D. (2019). Evolution of mating types in finite populations: The precarious advantage of being rare. Journal of Evolutionary Biology, 32(11), 1290–1299. 10.1111/jeb.13528 [DOI] [PubMed] [Google Scholar]
- Czuppon, P. , & Traulsen, A. (2018). Fixation probabilities in populations under demographic fluctuations. Journal ofMathematical Biology, 77(4), 1233–1277. 10.1007/s00285-018-1251-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Débarre, F. , & Otto, S. (2016). Evolutionary dynamics of a quantitative trait in a finite asexual population. Theoretical Population Biology, 108, 75–88. 10.1016/j.tpb.2015.12.002 [DOI] [PubMed] [Google Scholar]
- Dieckmann, U. , & Law, R. (1996). The dynamical theory of coevolution: A derivation from stochastic ecological processes. Journal of Mathematical Biology, 34(5–6), 579–612. 10.1007/bf02409751 [DOI] [PubMed] [Google Scholar]
- Etheridge, A. (2012). Some mathematical models from population genetics. Lecture notes in mathematics (119pp.). Berlin Heidelberg: Springer‐Verlag. [Google Scholar]
- Ethier, S. , & Kurtz, T. (1986). Markov processes: Characterization and convergence. Wiley series in probability and mathematical statistics. J. Wiley & Sons. [Google Scholar]
- Ewens, W. (2004). Mathematical population genetics. I. Theoretical Introduction. Springer. [Google Scholar]
- Fisher, R. (1930). The genetical theory of natural selection. Clarendon Press. [Google Scholar]
- Foster, D. , & Young, P. (1990). Stochastic evolutionary game dynamics¤. Theoretical Population Biology, 38(2), 219–232. 10.1016/0040-5809(90)90011-j [DOI] [Google Scholar]
- Fudenberg, D. , & Harris, C. (1992). Evolutionary dynamics with aggregate shocks. Journal of Economic Theory, 57(2), 420–441. 10.1016/0022-0531(92)90044-i [DOI] [Google Scholar]
- Fudenberg, D. , & Imhof, L. A. (2006). Imitation processes with small mutations. Journal of Economic Theory, 131(1), 251–262. 10.1016/j.jet.2005.04.006 [DOI] [Google Scholar]
- García, J. , & Traulsen, A. (2019). Evolution of coordinated punishment to enforce cooperation from an unbiased strategy space. Journal of the Royal Society Interface, 16(156), 20190127. 10.1098/rsif.2019.0127 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gard, T. (1988). Introduction to stochastic differential equations. Marcel Dekker, Inc. [Google Scholar]
- Gardiner, C. (2004). Handbook of stochastic methods (3rd edition). Springer. [Google Scholar]
- Gardner, A. , & West, S. A. (2004). Spite and the scale of competition. Journal of Evolutionary Biology, 17(6), 1195–1203. 10.1111/j.1420-9101.2004.00775.x [DOI] [PubMed] [Google Scholar]
- Gaston, K. J. , & He, F. (2002). The distribution of species range size: A stochastic process. Proceedings of the Royal Society of London. Series B: Biological Sciences, 269(1495), 1079–1086. 10.1098/rspb.2002.1969 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Goel, N. , & Richter‐Dyn, N. (1974). Stochastic models in biology. Academic Press. [Google Scholar]
- Gokhale, C. , & Traulsen, A. (2010). Evolutionary games in the multiverse. Proceedings of the National Academy of Sciences of the United States of America, 107, 5500–5504. 10.1073/pnas.0912214107 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Haldane, J. (1927). A mathematical theory of natural and artificial selection, part V: Selection and mutation. Proceedings of the Cambridge Philosophical Society, 23(838), 838–844. 10.1017/S0305004100015644 [DOI] [Google Scholar]
- Hofbauer, J. , & Sigmund, K. (1998). Evolutionary games and population dynamics. Cambridge University Press. [Google Scholar]
- Houchmandzadeh, B. (2015). Fluctuation driven fixation of cooperative behavior. Biosystems, 127, 60–66. 10.1016/j.biosystems.2014.11.006 [DOI] [PubMed] [Google Scholar]
- Imhof, L. , & Nowak, M. (2006). Evolutionary game dynamics in a Wright‐Fisher process. Journal of Mathematical Biology, 52, 667–681. 10.1007/s00285-005-0369-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jeltsch, F. , Grimm, V. , Reeg, J. , & Schlägel, U. E. (2019). Give chance a chance: From coexistence to coviability in biodiversity theory. Ecosphere, 10(5), e02700. 10.1002/ecs2.2700 [DOI] [Google Scholar]
- Kallenberg, O. (2002). Foundations of modern probability. Springer Verlag. [Google Scholar]
- Kang, Y.‐G. , & Park, J.‐M. (2017). Demographic‐noise‐induced fixation in subdivided populations with migration. Journal of Physics A: Mathematical General, 50(47), 475001. 10.1088/1751-8121/aa8ce0 [DOI] [Google Scholar]
- Karlin, S. , & Taylor, H. (1975). A first course in stochastic processes (2nd ed.). Academic. [Google Scholar]
- Karlin, S. , & Taylor, H. (1981). A second course in stochastic processes (1st ed.). Academic. [Google Scholar]
- Kaveh, K. , Komarova, N. L. , & Kohandel, M. (2015). The duality of spatial death‐birth and birthdeath processes and limitations of the isothermal theorem. Journal of the Royal Society Open Science, 2(4), 140465. 10.1098/rsos.140465 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kimura, M. (1983). The neutral theory of molecular evolution. Cambridge University Press. [Google Scholar]
- Kimura, M. , & Ohta, T. (1969). The average number of generations until fixation of amutant gene in a finite population. Genetics, 61(3), 763–771. 10.1093/genetics/61.3.763 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Koopmann, B. , Müller, J. , Tellier, A. , & Živković, D. (2017). Fisher‐Wright model with deterministic seed bank and selection. Theoretical Population Biology, 114, 29–39. 10.1016/j.tpb.2016.11.005 [DOI] [PubMed] [Google Scholar]
- Kopp, M. , Nassar, E. , & Pardoux, E. (2018). Phenotypic lag and population extinction in the moving‐optimum model: Insights from a small‐jumps limit. Journal of Mathematical Biology, 77(5), 1431–1458. 10.1007/s00285-018-1258-2 [DOI] [PubMed] [Google Scholar]
- Lande, R. (1994). Risk of population extinction from fixation of new deleterious mutations. Evolution, 48(5), 1460–1469. 10.1111/j.1558-5646.1994.tb02188.x [DOI] [PubMed] [Google Scholar]
- Lande, R. , Engen, S. , & Saether, B.‐E. (2003). Stochastic population dynamics in ecology and conservation. Oxford University Press. [Google Scholar]
- Lehmann, L. (2012). The stationary distribution of a continuously varying strategy in a classstructured population under mutation‐selection‐drift balance. Journal of Evolutionary Biology, 25(4), 770–787. 10.1111/j.1420-9101.2012.02472.x [DOI] [PubMed] [Google Scholar]
- Lessard, S. , & Ladret, V. (2007). The probability of fixation of a singlemutant in an exchangeable selectionmodel. Journal ofMathematical Biology, 54, 721–744. 10.1007/s00285-007-0069-7 [DOI] [PubMed] [Google Scholar]
- Lieberman, E. , Hauert, C. , & Nowak, M. A. (2005). Evolutionary dynamics on graphs. Nature, 433, 312–316. 10.1038/nature03204 [DOI] [PubMed] [Google Scholar]
- Maynard Smith, J. , & Price, G. (1973). The logic of animal conflict. Nature, 246, 15–18. 10.1038/246015a0 [DOI] [Google Scholar]
- McLeod, D. , & Day, T. (2019). Why is sterility virulence most common in sexually transmitted infections? Examining the role of epidemiology. Evolution, 73(5), 872–882. 10.1111/evo.13718 [DOI] [PubMed] [Google Scholar]
- Melbourne, B. A. , & Hastings, A. (2008). Extinction risk depends strongly on factors contributing to stochasticity. Nature, 454(7200), 100–103. 10.1038/nature06922 [DOI] [PubMed] [Google Scholar]
- Méléard, S. , & Villemonais, D. (2012). Quasi‐stationary distributions and population processes. Probability Surveys, 9, 340–410. 10.1214/11-PS191 [DOI] [Google Scholar]
- Moran, P. (1958). Random processes in genetics. Mathematical Proceedings of the Cambridge Philosophical Society, 54(1), 60–71. 10.1017/S0305004100033193 [DOI] [Google Scholar]
- Nowak, M. , Sasaki, A. , Taylor, C. , & Fudenberg, D. (2004). Emergence of cooperation and evolutionary stability in finite populations. Nature, 428, 646–650. 10.1038/nature02414 [DOI] [PubMed] [Google Scholar]
- Ohtsuki, H. , Bordalo, P. , & Nowak, M. (2007). The one‐third law of evolutionary dynamics. Journal of Theoretical Biology, 249, 289–295. 10.1016/j.jtbi.2007.07.005 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Otto, S. P. , & Day, T. (2007). A biologist’s guide to mathematical modeling in ecology and evolution. Princeton Univ. Press. [Google Scholar]
- Ovaskainen, O. , & Meerson, B. (2010). Stochastic models of population extinction. Trends in Ecology and Evolution, 25(11), 643–652. 10.1016/j.tree.2010.07.009 [DOI] [PubMed] [Google Scholar]
- Parsons, T. , Lambert, A. , Day, T. , & Gandon, S. (2018). Pathogen evolution in finite populations: Slow and steady spreads the best. Journal of the Royal Society Interface, 15(147), 20180135. 10.1098/rsif.2018.0135 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Patwa, Z. , & Wahl, L. (2008). The fixation probability of beneficialmutations. Journal of the Royal Society Interface, 5(28), 1279–1289. 10.1098/rsif.2008.0248 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Peña, J. , Lehmann, L. , & Nöldeke, G. (2014). Gains from switching and evolutionary stability in multi‐player matrix games. Journal of Theoretical Biology, 346, 23–33. 10.1016/j.jtbi.2013.12.016 [DOI] [PubMed] [Google Scholar]
- Pfaffelhuber, P. , & Wakolbinger, A. (2018). Fixation probabilities and hitting times for low levels of frequency‐dependent selection. Theoretical Population Biology, 124, 61–69. 10.1016/j.tpb.2018.09.003 [DOI] [PubMed] [Google Scholar]
- Polansky, P. (1979). Invariant distributions for multi‐population models in random environments. Theoretical Population Biology, 16(1), 25–34. 10.1016/0040-5809(79)90004-2 [DOI] [PubMed] [Google Scholar]
- Rand, D. G. , & Nowak, M. A. (2011). The evolution of antisocial punishment in optional public goods games. Nature Communications, 2(1). 10.1038/ncomms1442 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Reichenbach, T. , Mobilia, M. , & Frey, E. (2007). Noise and correlations in a spatial population model with cyclic competition. Physical Review Letters, 99, 238105. 10.1103/PhysRevLett.99.238105 [DOI] [PubMed] [Google Scholar]
- Schaffer, W. M. , Ellner, S. , & Kot, M. (1986). Effects of noise on some dynamical models in ecology. Journal ofMathematical Biology, 24(5), 479–523. 10.1007/bf00275681 [DOI] [PubMed] [Google Scholar]
- Schenk, H. , Schulenburg, H. , & Traulsen, A. (2020). How long do Red Queen dynamics survive under genetic drift? A comparative analysis of evolutionary and eco‐evolutionary models. BMC Evolutionary Biology, 20(8). 10.1186/s12862-019-1562-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schreiber, S. J. , Levine, J. M. , Godoy, O. , Kraft, N. J. , & Hart, S. P. (2018). Does deterministic coexistence theory matter in a finite world? bioRxiv. 10.1101/290882 [DOI] [PubMed] [Google Scholar]
- Serrao, S. , & Täuber, U. (2017). A stochastic analysis of the spatially extended may–leonard model. Journal of Physics A: Mathematical and Theoretical, 50(40), 404005. 10.1088/1751-8121/aa87a8 [DOI] [Google Scholar]
- Sigmund, K. , Silva, H. D. , Traulsen, A. , & Hauert, C. (2010). Social learning promotes institutions for governing the commons. Nature, 466(7308), 861–863. 10.1038/nature09203 [DOI] [PubMed] [Google Scholar]
- Taylor, P. , & Maciejewski, W. (2012). An inclusive fitness analysis of synergistic interactions in structured populations. Proceedings of the Royal Society B: Biological Sciences, 279(1747), 4596–4603. 10.1098/rspb.2012.1408 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Traulsen, A. , Claussen, J. , & Hauert, C. (2005). Coevolutionary dynamics: From finite to infinite populations. Physical Review Letters, 95, 238701. [DOI] [PubMed] [Google Scholar]
- Traulsen, A. , Claussen, J. C. , & Hauert, C. (2012). Stochastic differential equations for evolutionary dynamics with demographic noise andmutations. Physical Review E, 85(4), 041901. 10.1103/physreve.85.041901 [DOI] [PubMed] [Google Scholar]
- Traulsen, A. , & Hauert, C. (2009). Stochastic evolutionary game dynamics. In Schuster H. G. (Ed.), Reviews of nonlinear dynamics and complexity (Vol. II, pp. 25–61). Wiley‐VCH. [Google Scholar]
- Traulsen, A. , Pacheco, J. , & Imhof, L. (2006). Stochasticity and evolutionary stability. Physical Review E, 74, 021905. 10.1103/PhysRevE.74.021905 [DOI] [PubMed] [Google Scholar]
- Turelli, M. (1977). Random environments and stochastic calculus. Theoretical Population Biology, 12(2), 140–178. 10.1016/0040-5809(77)90040-5 [DOI] [PubMed] [Google Scholar]
- Turelli, M. (1981). Temporally varying selection on multiple alleles: A diffusion analysis. Journal of Mathematical Biology, 13(1), 115–129. 10.1007/bf00276870 [DOI] [Google Scholar]
- van Herwaarden, O. A. , & van der Wal, N. J. (2002). Extinction time and age of an allele in a large finite population. Theoretical Population Biology, 61(3), 311–318. 10.1006/tpbi.2002.1576 [DOI] [PubMed] [Google Scholar]
- van Kampen, N. (2007). Stochastic processes in physics and chemistry. North Holland. [Google Scholar]
- Vasconcelos, V. V. , Santos, F. P. , Santos, F. C. , & Pacheco, J. M. (2017). Stochastic dynamics through hierarchically embedded markov chains. Physical Review Letters, 118(5), 058301. 10.1103/physrevlett.118.058301 [DOI] [PubMed] [Google Scholar]
- Verhulst, P.‐F. (1838). Notice sur la loi que la population suit dans son accroissement. CorrespondanceMathématique Et Physique Publiée Par A. Quetelet, 10, 113–121. [Google Scholar]
- Wakano, J. , & Lehmann, L. (2014). Evolutionary branching in deme‐structured populations. Journal of Theoretical Biology, 351, 83–95. 10.1016/j.jtbi.2014.02.036 [DOI] [PubMed] [Google Scholar]
- Wienand, K. , Frey, E. , & Mobilia, M. (2018). Eco‐evolutionary dynamics of a population with randomly switching carrying capacity. Journal of the Royal Society Interface, 15(145), 20180343. 10.1098/rsif.2018.0343 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wright, S. (1931). Evolution in mendelian populations. Genetics, 16(2), 97–159. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wu, B. , Gokhale, C. S. , Wang, L. , & Traulsen, A. (2011). How small are small mutation rates? Journal of Mathematical Biology, 64(5), 803–827. 10.1007/s00285-011-0430-8 [DOI] [PubMed] [Google Scholar]
- Yamaguchi, R. , & Iwasa, Y. (2013). First passage time to allopatric speciation. Interface Focus, 3(6), 20130026. 10.1098/rsfs.2013.0026 [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
No data are used.
