Demographic noise can reverse the direction of deterministic selection

George W A Constable; Tim Rogers; Alan J McKane; Corina E Tarnita

doi:10.1073/pnas.1603693113

. 2016 Jul 22;113(32):E4745–E4754. doi: 10.1073/pnas.1603693113

Demographic noise can reverse the direction of deterministic selection

George W A Constable ^a,¹, Tim Rogers ^b, Alan J McKane ^c, Corina E Tarnita ^a,¹

PMCID: PMC4987790 PMID: 27450085

Significance

Demographic stochasticity—the population-level randomness that emerges when the timing of birth, death, and interaction events is unpredictable—can profoundly alter the dynamics of a system. We find that phenotypes that pay a cost to their birth rate to modify the environment by increasing the global carrying capacity can be stochastically selected for, where they would otherwise be deterministically disfavored. Our results hold for a general class of mathematical models but we use a model of public good production for illustration. In this case, demographic stochasticity is exploited by populations of cooperators to turn selection in their favor; it therefore operates as a mechanism that supports the evolution of public good production.

Keywords: stochastic dynamics, nonfixed population size, cooperation, public goods, timescale separation

Abstract

Deterministic evolutionary theory robustly predicts that populations displaying altruistic behaviors will be driven to extinction by mutant cheats that absorb common benefits but do not themselves contribute. Here we show that when demographic stochasticity is accounted for, selection can in fact act in the reverse direction to that predicted deterministically, instead favoring cooperative behaviors that appreciably increase the carrying capacity of the population. Populations that exist in larger numbers experience a selective advantage by being more stochastically robust to invasions than smaller populations, and this advantage can persist even in the presence of reproductive costs. We investigate this general effect in the specific context of public goods production and find conditions for stochastic selection reversal leading to the success of public good producers. This insight, developed here analytically, is missed by the deterministic analysis as well as by standard game theoretic models that enforce a fixed population size. The effect is found to be amplified by space; in this scenario we find that selection reversal occurs within biologically reasonable parameter regimes for microbial populations. Beyond the public good problem, we formulate a general mathematical framework for models that may exhibit stochastic selection reversal. In this context, we describe a stochastic analog to $r - K$ theory, by which small populations can evolve to higher densities in the absence of disturbance.

Over the past century, mathematical biology has provided a framework with which to begin to understand the complexities of evolution. Historically, development has focused on deterministic models (1). However, when it comes to questions of invasion and migration in ecological systems, it is widely acknowledged that stochastic effects may be paramount, because the incoming number of individuals is typically small. The importance of demographic (intrinsic) noise has long been argued for in population genetics; it is the driver of genetic drift and can undermine the effect of selection in small populations (2, 3). This concept has also found favor in game theoretic models of evolution that seek to understand how apparently altruistic traits can invade and establish in populations (4). However, the past decade has seen an increase in the awareness of some of the more exotic and counterintuitive aspects of demographic noise: It has the capacity to induce cycling of species (5), pattern formation (6, 7), speciation (8), and spontaneous organization in systems that do not display such behavior deterministically.

Here we explore the impact of demographic noise on the direction of selection in interactions between multiple phenotypes or species. Historically, a key obstacle to progress in this area has been the analytical intractability of multidimensional stochastic models. This is particularly apparent when trying to investigate problems related to invasion, where systems are typically far from equilibrium. A promising avenue of analysis has recently become apparent, however, through stochastic fast-variable elimination (9, 10). If a system consists of processes that act over very different timescales, it is often possible to eliminate fast modes, assumed to equilibrate quickly in the multidimensional model, and obtain a reduced dimensional description that is amenable to analysis (11). This approach has been used multiple times over the past decade to study a stochastic formulation of the classical Lotka–Volterra competition model for two competing phenotypes/species. In refs. 9, 10, and 12–14, such models were analyzed under the assumption that the dynamics regulating the total population size (birth, death, and competition) occurred on a much faster timescale than the change in population composition. In particular, refs. 9, 10, 12, and 13 have shown that it is possible for systems that appear neutral in a deterministic setting to become nonneutral once stochasticity is included. If the two phenotypes have equal deterministic fitness, but one is subject to a larger amount of demographic noise than the other, then the effect of this noise alone can induce a selective drift in favor of the phenotype experiencing less noise. This result stems from the fact that it is easier to invade a noisy population than a stable one; furthermore, the direction of this induced selection can vary with the system’s state (15). The idea has been further generalized mathematically in ref. 16.

Here we show more generally that not only can stochasticity break deterministic neutrality, but it also has the capacity to reverse the direction of selection predicted deterministically. Thus, whereas in a deterministic setting a certain phenotype will always reach fixation (and is resistant to invasions), in a stochastic setting its counterpart can in fact be more likely to invade and fixate (and be less susceptible to invasions). These results generalize recent work on modified Moran- and Wright–Fisher-type models (17, 18) to a large class of models consisting of two phenotypes interacting with their environment. We begin with the analysis of a prototypical public good model, which is used to illustrate our method. We find that stochastic selection reversal can alleviate the public good production dilemma. We further show how space can amplify this phenomenon, allowing the reversal of selection to emerge over a greater parameter range. Finally, we extend the ideas to a more general model framework and explore the types of system in which we expect this behavior to be relevant. In particular, we discuss the similarities with $r - K$ selection theory (19).

Public Good Model

It is generally accepted that random events play a strong role in the evolution of cooperative behavior, which is deterministically selected against (4). The standard formulation of evolutionary game theory involves setting the problem in terms of a modified Moran model (20, 21). The Moran model is a population genetic model first developed as an abstract illustration of the effect of genetic drift in a haploid population of two phenotypes; an individual is picked to reproduce with a probability proportional to its fitness, whereas simultaneously a second individual is chosen randomly to die (22). Coupling birth and death events keeps the population size fixed, which increases the tractability of the system.

The specification of fixed population size is, however, restrictive and can be problematic. Most prominently, a phenotype with increased fitness can be no more abundant in isolation than its ailing counterpart. Additional difficulties are encountered if one attempts to use simple game-theoretic models to quantitatively understand more complex experimental data. Whereas, for example, assuming some arbitrary nonlinearity in the model’s game payoff matrix may enable experimental findings to be elegantly recapitulated, it is more difficult to justify the origin of these assumptions on a mechanistic level (23). In light of such issues, it has been suggested that a more ecologically grounded take on the dynamics of cooperation might be preferable (24, 25), one in which the population size is not fixed and that is sufficiently detailed that mechanistic (rather than phenomenological) parameters can be inferred experimentally. In the following, we take such an approach. We begin by considering a prototypical model of public good production and consumption.

In our model, we consider a phenotype X having the ability to produce a public good Q that catalyzes its growth. We wish to capture the stochastic dynamics of the system. To this end we assume that the system is described by a set of probability transition rates, which describe the probability per unit time of each reaction occurring:

\begin{array}{l} X ⇄_{κ / R^{2}}^{b_{x}} X + X, X + Q \overset{r / R^{2}}{\to} X + X + Q, \\ X \overset{p_{x}}{\to} X + Q, Q \overset{δ}{\to} \emptyset . \end{array}

[1]

In the absence of the public good, the producer phenotype X reproduces at a baseline birthrate $b_{x} .$ The phenotypes encounter each other and the public good at a rate $R^{- 2};$ the quantity $R^{2}$ can be interpreted as a measure of the area (or volume) to which the system is confined. Death of the phenotype occurs solely due to crowding effects at rate κ, multiplied by the encounter rate. Phenotypes encounter and use the public good at a rate $r / R^{2} .$ We study the case where this reaction is catalytic (i.e., the public good is conserved) and leads to a phenotype reproduction. Examples of catalytic (reusable) public goods are the enzyme invertase produced by the yeast Saccharomyces cerevisiae (26) or the siderophore pyoverdine produced by the bacterium Pseudomonas aeruginosa (27). The total rate at which the phenotype reproduces is thus increased in the presence of the public good. The public good itself is produced by the producer phenotype at a rate $p_{x}$ and decays at a rate δ. Note that as well as controlling the spatial scale of the well-mixed system, the magnitude of R will also control the typical number of individuals in the system, because larger R (more space) allows the population to grow to greater numbers. We next introduce a mutant phenotype Y that does not produce the public good (i.e., $p_{y} = 0$ ); consequently, it has a different baseline birth rate $b_{y}$ that we expect to be at least as high as that of the producer, due to the nonproducer’s reduced metabolic expenditure. Its interactions with the public good are otherwise similar to those of X (Eq. 1).

The state of the system is specified by the discrete variables $n_{x}$ , $n_{y},$ and $n_{q},$ the number of each phenotype and public good, respectively. For the system described, we wish to know the probability of being in any given state at any given time. To answer this, we set up an infinite set of ordinary differential equations (ODEs) [one for each unique state $(n_{x}, n_{y}, n_{q})$ ] that measures the flow of probability between neighboring states (controlled by the transitions in Eq. 1). These equations govern the time evolution of a probability density function $P (n_{x}, n_{y}, n_{q}, t)$ (Eq. S2). Such a model is sometimes termed a microscopic description (28), because it takes account of the dynamics of discrete interactions between the system variables.

Although the probabilistic model is straightforward to formalize, it is difficult to solve in its entirety. We apply an approximation that makes the model more tractable, while maintaining the system’s probabilistic nature. Such approximations, which assume that the system under consideration has a large but finite number of individuals, are well practiced and understood (28) and are analogous to the diffusion approximation (22) of population genetics. Assuming that R is large, but finite (which implies a large number of individuals in the system), we transform the system into the approximately continuous variables $(x, y, q) = (n_{x}, n_{y}, n_{q}) / R^{2}$ and expand the partial difference equations in $1 / R^{2} .$ This allows us to express the infinite set of ODEs as a single partial differential equation in four continuous variables, $(x, y, q, t) .$ However, because the partial differential equation (PDE) results from a Taylor expansion, it has infinite order. Truncating the expression after the first term (at order $R^{- 2}$ ), one obtains a deterministic approximation of the dynamics (valid for $R \to \infty$ or equivalently for infinite population sizes). Because we aim to make the system tractable but still retain some stochastic element in the dynamics, we truncate the expansion after the second term (at order $R^{- 4};$ Eq. S4). The resulting model can be conveniently expressed as a set of Itō stochastic differential equations (SDEs):

\begin{array}{l} \dot{x} = x [b_{x} + r q - κ (x + y)] + R^{- 1} η_{x} (t), \\ \dot{y} = y [b_{y} + r q - κ (x + y)] + R^{- 1} η_{y} (t), \\ \dot{q} = p_{x} x - δ q + R^{- 1} η_{q} (t) . \end{array}

[2]

The $η_{i} (t)$ represent Gaussian white noise terms whose correlations depend on the state of the system (the noise is multiplicative). Importantly, because Eq. 2 has been developed as a rigorous approximation of the underlying stochastic model, Eq. 1, the precise functional form of the noise can be determined explicitly, rather than posited on an ad hoc basis (SI Obtaining the SDE System from the Microscopic Individual-Based Model). Setting $R \to \infty,$ the population size increases with the interaction scale and one recovers the deterministic limit. Because Eq. 2 is a course-grained approximation of the underlying microscopic model but retains an inherent stochasticity, it is often referred to as the mesoscopic limit (29).

First, we analyze the dynamics of Eq. 2 in the deterministic, $R \to \infty$ limit. There exist three fixed points or equilibria. The first one, at the origin, is always unstable. The remaining fixed points occur when the system contains only a single phenotype: the producer fixed point, $(x, y, q) = (K_{x}, 0, p_{x} K_{x} / δ),$ and the nonproducer fixed point, $(x, y, q) = (0, K_{y}, 0) .$ Thus, $K_{x}$ and $K_{y}$ are measures of the phenotypes’ frequency (carrying capacity) in isolation, with precise forms

K_{x} = \frac{b_{x} δ}{κ δ - p_{x} r}, K_{y} = \frac{b_{y}}{κ} .

[3]

If $b_{y} > b_{x},$ then the nonproducer fixed point is always stable whereas the producer fixed point is always unstable. However, the nonproducer fixed point is globally attracting only if $κ δ > r p_{x} .$ If this condition is not met, then there exist initial conditions for which the producers produce and process the public good faster than they die and faster than the public good degrades, resulting in unbounded exponential growth of the system. This biologically unrealistic behavior comes from the fact that we have assumed for simplicity that the public good uptake does not saturate. Because this behavior is unrealistic, we work in the regime $κ δ > r p_{x}$ for the remainder of this paper. Finally, we are interested in systems where the size of the producer population in isolation is larger than that of the nonproducer, $K_{x} > K_{y};$ this is true if the condition $b_{x} > b_{y} (1 - r p_{x} / δ κ)$ holds. Thus, deterministically, a nonproducing mutant will always take over a producer population and, due to the absence of the public good, it will yield a smaller population at equilibrium.

This deterministic analysis predicts, unsurprisingly, that a population composed entirely of nonproducers is the only stable state. We next explore the behavior of the system in Eq. 1 when demographic stochasticity is considered.

Mesoscopic Selection Reversal.

Due to noise, a stochastic system will not be positioned precisely on deterministic fixed points, but rather it will fluctuate around them. In the above system, these fluctuations will occur along the y axis for the nonproducer fixed point whereas in the absence of nonproducers they will occur in the $(x, q)$ plane for the producer fixed point. We can define $N_{x} = R^{2} K_{x}$ and $N_{y} = R^{2} K_{y}$ to be the mean number of the phenotypes X and Y in isolation in the respective stationary states. We assume that the nonproducing phenotype has a greater per capita birth rate than the producer phenotype, i.e., $b_{y} > b_{x},$ and we introduce a single nonproducing mutant into a producer population. Whereas the deterministic theory predicts that the nonproducer should sweep through the population until it reaches fixation, in the stochastic setting fixation of the nonproducer is by no means guaranteed: There is a high probability that the single mutant might be lost due to demographic noise. However, because the nonproducer is deterministically selected for, we might expect the probability of a nonproducer mutant invading and fixating in a resident producer population to be greater than the probability of a producer mutant invading and fixating in a resident nonproducer population. We explore this question below.

To make analytic predictions about the stochastic model, we need to reduce the complexity of the system. This can be done if we use methods based on the elimination of fast variables (30) to obtain an effective one-dimensional description of the system dynamics. To this end, we begin by assuming that the public good production and decay, $p_{x}$ and δ, and the phenotypes’ reproduction and death, $b_{x}$ , $b_{y},$ and κ, occur on a much faster timescale than the rate of change of population composition, which is governed by the difference in birth rates, $b_{x} - b_{y} .$ Essentially this assumption requires that the cost of public good production is marginal. In the case of S. cerevisiae, this assumption is supported by empirical work (Table S2). To mathematically investigate this timescale separation we define

b_{x} = b (1 - ε), b_{y} = b,

[4]

where the parameter ε represents the metabolic cost that X pays for producing the public good. The parameter ε now controls the rate of change of the population composition, and if $1 ≫ ε,$ we have our desired timescale separation in the deterministic system. Because the parameters $K_{x},$ $K_{y},$ $N_{x},$ and $N_{y}$ depend on ε, we find it convenient to define their values when $ε = 0$ as $K_{x}^{(0)},$ $K_{y}^{(0)},$ $N_{x}^{(0)},$ and $N_{y}^{(0)},$ respectively. To maintain our assumption that the composition of the phenotype population changes slowly in the stochastic system, we additionally require that the noise is small. However, this assumption has already been implicitly made in the derivation of Eq. 2, where it is assumed that R is large, and thus $R^{- 1},$ the prefactor for the noise terms, is small. To formalize this, we find it convenient to assume $R^{- 2} \approx O (ε) .$

Table S2.

List of parameters used in the simulation, with the exception of $p_{x},$ $p_{u},$ ε, m, and D, which are varied

Parameter	Value	Justification
σ	4,000	Assumed parameter. Presence of 4,000 invertase molecules required for yeast reproduction.
$p_{y}$	0	True nonproducer does not produce invertase.
$p_{x}$	$1.14 \times 10^{- 4} s^{- 1}$ $(0.41 h^{- 1})$	Experimental value of molecular invertase production rate (Table S1) scaled by σ (Eq. S88).
$p_{u}$	$1.2 \times 10^{- 4} s^{- 1}$ $(0.43 h^{- 1})$	Leads to factor 1.7 increase in the steady-state invertase from producing to hyperproducing population, consistent with ref. 33.
b	$6.94 \times 10^{- 6} s^{- 1}$ $(0.025 h^{- 1})$	Small baseline yeast birth rate assumed.
r	$1.58 \times 10^{- 5} s^{- 1}$ $(0.057 h^{- 1})$	Chosen to give per capita yeast reproduction rate $(b + r q) \approx λ_{exp}$ when system is entirely producers (Table S1).
δ	$0.002 s^{- 1}$	Taken from experimentally measured values (Table S1).
κ	$1 \times 10^{- 6} s^{- 1}$	Suggested parameter for illustrating effects in this work; restricted by $δ κ > p_{i} r,$ $i = x, y, u .$
R	2	Suggested parameter for illustrating effects in this work.
ε	0.06	Taken from experiments (Table S1).
$N_{y}$	28	Eq. S20.
$N_{x}$	302	Eq. S20.
$N_{u}$	499	See Eq. S20 for $N_{x}$ and substitute $p_{u}$ for $p_{x} .$
$L_{p}$	$67 μ m$	Eq. S92.
m	$3.4 \times 10^{- 7} s^{- 1}$	Yields a migration to birth-rate ratio between $m / b = 4.9 \times 10^{- 2}$ (all nonproducers) and $m / (b + r q) = 4.5 \times 10^{- 3}$ (all producers).
D	$2.22 \times 10^{- 5} s^{- 1}$	Obtained using experimental value $D_{exp}$ from Table S1 and Eq. S93.

Open in a new tab

Under the above assumptions, the system features a separation of timescales. Next, we take advantage of this timescale separation to reduce the complexity of the system. Deterministically, the existence of a set of fast timescales suggests the existence of a lower-dimensional subspace, the slow manifold (SM), shown in Fig. 1A, to which the system quickly relaxes, and along which it slowly moves, until it reaches the system’s stable fixed point. This behavior can be exploited if we assume that the system reaches the SM instantaneously. We can then describe the dynamics of the entire system in this lower-dimensional space and thus reduce the number of variables in our description of the deterministic system. However, we are interested in the stochastic dynamics.

Fig. 1. — System dynamics in the phenotype plane. Deterministic trajectories are shown as gray arrows. (A) Trajectories rapidly collapse to a SM (black dashed line), before slowly moving to the nonproducing Y fixed point. Stochastic trajectories (histogram overlaid in orange) remain in the region of the SM but may fluctuate away from it. (B) Illustration of the origin of noise-induced selection. The orange ellipse depicts the SD of Gaussian fluctuations originating at its center. Fluctuations (black dashed arrows) to points α are equally likely; however, when projected back to the CM (black dashed line) to points β, a bias for producing the X phenotype is observed. Parameters used are $p_{x} = 9.5 \times 10^{- 4},$ $ε = 0.08$ in A, $ε = 0$ in B, and the remaining parameters are given in Table S2.

The stochastic trajectories initially collapse to the region around the SM, about which they are confined, but along which they can move freely until one of the phenotypes fixates (Fig. 1A). Fluctuations that take the system off the SM are quickly quashed back to another point on the SM; however, the average position on the SM to which a fluctuation returns is not necessarily the same as that from which the fluctuation originated. A crucial element of the dynamics in this stochastic setting is that the form of the noise, combined with that of the trajectories back to the SM, can induce a bias in the dynamics along the SM (Fig. 1B and Fig. S1). This bias is the origin of the stochastic selection reversal that we explore. To capture this behavior while simultaneously removing the fast timescales in the stochastic system, we map all fluctuations off the SM along deterministic trajectories back to the SM (30). This procedure essentially assumes that any noisy event that takes the system off the SM is instantaneously projected back to another point on the SM.

Fig. S1. — Illustration of the origin of stochastically induced drift along the CM. The gray dashed line shows the form of the deterministic CM, which intersects the x axis at a higher value than the y axis (phenotype X has a higher carrying capacity due to the production of the public good). The orange ellipse illustrates the form of the Gaussian noise centered on the point $x^{(0)}$ on the CM. Fluctuations in the population are equally likely to increase or decrease the frequency of the Y phenotype to the points $x^{(1)} .$ Away from the CM, the deterministic pressure to the CM becomes prominent, forcing the system along quasi-deterministic trajectories back to the CM, at the points $x^{(2)} .$ The resulting distribution of $x^{(2)}$ does not have a mean centered on $x^{(0)} .$ Rather, the distribution is shifted, inducing a drift in favor of the producing X phenotype.

For clarity, we briefly describe the dynamics when $ε = 0.$ In this case the birth rates of phenotypes X and Y are identical. Instead of the two nonzero fixed points, $K_{x}$ and $K_{y},$ found above, the deterministic system now has a line of fixed points, referred to as a center manifold (CM) (31). The CM is identical to the SM in the limit $ε \to 0 .$ It is given by

y = \frac{K_{y}^{(0)}}{K_{x}^{(0)}} (K_{x}^{(0)} - x), q = \frac{p_{x}}{δ} x,

[5]

and shown graphically in Fig. 1B. The separation of timescales in the system is now at its most pronounced, because there are strictly no deterministic dynamics along the CM following the fast transient to the CM. However, the stochastic system still features dynamics along the CM. Applying the procedure outlined in ref. 30, we arrive at a description of the stochastic dynamics in a single variable, the frequency of producers along the CM,

\dot{x} = \frac{b}{R^{2}} x (1 - \frac{x}{K_{x}^{(0)}}) ℱ (x) + \frac{1}{R} ζ (t),

[6]

where

ℱ (x) = 2 (\frac{K_{x}^{(0)} - K_{y}^{(0)}}{{(K_{x}^{(0)} K_{y}^{(0)})}^{2}}) [K_{x}^{(0)} K_{y}^{(0)} + (K_{x}^{(0)} - K_{y}^{(0)}) x] .

Here $ζ (t)$ is a Gaussian white noise term with a correlation structure given in Eq. S18. Together with Eq. 5, Eq. 6 approximates the dynamics of the entire system. Note that whereas Eq. 6 predicts a noise-induced directional drift along the CM [controlled by $ℱ (x)$ ], a deterministic analysis predicts no dynamics, because the CM is by definition a line of fixed points. This directional drift along the CM results from the projection bias illustrated in Fig. 1B. If $p_{x} > 0,$ then $K_{x}^{(0)} > K_{y}^{(0)},$ and so $ℱ (x) > 0;$ thus the public good production by phenotype X induces a selective pressure that selects for X along the center manifold.

The origin of the term $ℱ (x)$ in Eq. 6 can be understood more fully by exploring its implications for the invasion probabilities of X and Y, denoted $ϕ_{x}$ and $ϕ_{y} .$ These can be straightforwardly calculated because the system is one dimensional (SI Probability of Fixation for the Reduced Public Good Model). We find

ϕ_{x} = \frac{1}{N_{y}}, and ϕ_{y} = \frac{1}{N_{x}},

[7]

where $ϕ_{x} > ϕ_{y}$ as long as $p_{x} > 0$ (Eq. 3). The term $ℱ (x)$ can thus be interpreted as resulting from the stochastic advantage the producers have at the population level from reaching higher carrying capacities in isolation, which makes them more stochastically robust to invasion attempts. This result is independent of the spatial scale R (and therefore of the population size) as long as R is finite.

If $ε \neq 0,$ the system does not collapse to the CM, but rather to the SM. At leading order in ε, the equation for the SM is given by Eq. 5. Upon removing the fast dynamics, the effective dynamics of x can now be shown to take the form (Eq. S23)

\dot{x} = b x (1 - \frac{x}{K_{x}^{(0)}}) (\frac{1}{R^{2}} ℱ (x) - ε) + \frac{1}{R} ζ (t),

[8]

where $ζ (t)$ and $ℱ (x)$ are the same as in Eq. 6. The SDE now consists of two components. The deterministic contribution, governed by ε, exerts a selective pressure against phenotype X, due to its reduced birth rate. The stochastic term $ℱ (x)$ exerts a pressure in favor of phenotype X, resulting, as in the case $ε = 0$ discussed above, from the producers’ stochastic robustness to invasions.

Thus, when $ε > 0,$ a trade-off emerges in the stochastic system between the stochastic advantage to public good production (due to increased population sizes) and the deterministic cost producers pay (in terms of birth rates). If the birth costs are not too high, producers will be selected for, which constitutes a reversal in the direction of selection from the deterministic prediction. Specifically, we can calculate the condition on the metabolic cost that ensures that the producers are fitter than the nonproducers (i.e., $ϕ_{x} > ϕ_{y}$ ):

ε < \frac{κ}{b R^{2}} log [\frac{δ κ}{δ κ - p_{x} r}] .

[9]

Whereas for no metabolic cost producers consistently have a stochastic advantage regardless of typical population size (Eq. 7), for nonzero production costs, the population must be sufficiently small that stochastic effects, governed by $R^{- 2},$ are dominant. Fig. 2 and Fig. S2 show that the theory predicts well the trade-off in the underlying stochastic model [1].

Fig. 2. — Stochasticity can render nonproducers more susceptible to invasion by producers than vice versa. Shown are plots of the difference in invasion probabilities between producers X and nonproducers Y as a function of the cost to birth for production, ε, and good production rate $p_{x} .$ The remaining parameters are taken from Table S2. (*Left*) Analytic results for a single small patch (Eq. 9). The critical cost ε for selection reversal, Eq. 9, is shown here as a black dashed line. (*Right*) Results from Gillespie simulations (46) of the stochastic process Eq. 1, averaged over 2,000 runs.

Fig. S2. — Illustration of the larger range of values for the parameter ε over which the approximation Eq. **S28** is accurate. Parameters are given in Table S2, with the exception of $p_{x}$ and ε, which are varied. Note that this figure is similar to Fig. 2, in the main text, but plotted over a greater range of ε and $ϕ_{x} - ϕ_{y} .$ The parameter region plotted in black is that for which $κ δ > r p_{x} .$

We have shown that stochastic selection reversal is more prevalent when R is not large. Meanwhile our analytic results have been obtained under the assumption that R is large, which allowed us to use the diffusion approximation leading to Eq. 2 and aided the timescale elimination procedure that yielded Eq. 8. We therefore expect that although stochastic selection reversal will become more prominent as R is reduced, the quality of our analytic predictions may suffer. Despite this caveat, it is the small R regime, in which stochastic selection reversal is a more prominent force, that is interesting to us. Small values of R are associated with small population sizes. Although it is conceivable that populations of macroorganisms may consist of a small number of individuals, this limit is not so pertinent to the study of microorganisms. In the next section, however, we show that by incorporating space, the constraint of small population size can be relaxed.

Spatial Amplification

In this section we consider a metapopulation on a grid: Each subpopulation (patch) has a small size so that demographic noise continues to be relevant locally, but the number of subpopulations is large so that the overall population in the system is large. This method of incorporating demographic stochasticity into spatial systems has proved to be successful in the modeling of microbial populations (7). We consider a grid of C patches. The dynamics within each patch are given by the transitions in Eq. 1 and coupled to the surrounding patches by the movement of the phenotypes and public good. A patch will produce migrants at a rate proportional to its density. Producers X and nonproducers Y disperse with a probability rate m to a surrounding region, whereas the public good diffuses into neighboring regions at a rate D. Once again the diffusion approximation can be applied to obtain a set of SDEs approximating the system dynamics,

\begin{array}{l} \frac{d x_{i j}}{d τ} = x_{i j} (b_{x} + r q_{i j} - κ (x_{i j} + y_{i j})) + m {(L x)}_{i j} + \frac{η_{x i j} (t)}{R}, \\ \frac{d y_{i j}}{d τ} = y_{i j} (b_{y} + r q_{i j} - κ (x_{i j} + y_{i j})) + m {(L y)}_{i j} + \frac{η_{y i j} (t)}{R}, \\ \frac{d q_{i j}}{d τ} = p_{x} x_{i j} - δ q_{i j} + D {(L q)}_{i j} + \frac{η_{q i j} (t)}{R}, \end{array}

[10]

where $i j$ is the patch on row i and column j. The operator L is the discrete Laplacian operator ${(L x)}_{i j} = - 4 x_{i j} + x_{(i - 1) j} + x_{(i + 1) j} + x_{i (j - 1)} + x_{i (j + 1)} .$ If $b_{y} > b_{x},$ the deterministic dynamics predict that the producers will always go extinct.

First, we discuss some important limit case behavior for this system. In the limit of large dispersal rate m and diffusion rate D, the stochastic system behaves like a well-mixed population with a spatial scale $c R^{2}$ (i.e., the spatial structure is lost). In this case, as the size of the spatial system is increased, the effective population size also increases, and as a consequence selection reversal for producing phenotypes becomes less likely (Eq. 9).

We next consider the low-dispersal, zero diffusion limit. For sufficiently low dispersal, any incoming mutant will first either fixate or go to extinction locally before any further dispersal event occurs. Because each dispersal/invasion/extinction event resolves quickly, at the population level, the system behaves like a Moran process on a graph (4), with each node representing a patch. The “fitness” of a patch is the probability that it produces a migrant and that that migrant successfully invades a homogeneous patch of the opposite type, following the approach used in ref. 17. Denoting the fitness of producing and nonproducing patches by $W_{x}$ and $W_{y},$ respectively, we have

W_{x} = m N_{x} ϕ_{x}, W_{y} = m N_{y} ϕ_{y},

[11]

where $N_{i}$ ( $i = x, y$ ) is the mean carrying capacity of phenotype i in a homogeneous patch, and $ϕ_{i}$ are the invasion probabilities of a type i mutant in a type $j \neq i$ patch. The fixation probabilities of a homogeneous patch in a population of the opposite phenotype can now be calculated using standard results (4). Let $ρ_{i}$ ( $i = x, y$ ) denote the probability that type i takes over the metapopulation when starting from one patch of type i in a population otherwise composed entirely of patches of the opposite phenotype. Then

ρ_{i} = \frac{1 - r_{i}^{- 1}}{1 - r_{i}^{- C}}, for i = x, y and r_{x} = \frac{W_{x}}{W_{y}}, r_{y} = \frac{W_{y}}{W_{x}} .

[12]

If we start from a single invading mutant, the probability that it takes over the entire population (i.e., invasion probability) is the product between the probability that it takes over its home patch, $ϕ_{i}$ , and the probability that the newly invaded home patch fixates into the metapopulation, $ρ_{i}$ :

Π_{x} = ϕ_{x} ρ_{x}, Π_{y} = ϕ_{y} ρ_{y} .

[13]

In the infinite patch limit ( $C \to \infty$ ), $ρ_{x}$ and $ρ_{y}$ depend on $r_{x},$ the patch fitness ratio defined in Eq. 12. If $r_{x} > 1,$ $ρ_{x} \to 1 - r_{x}^{- 1}$ and $ρ_{y} \to 0$ , whereas if $r_{x} < 1,$ the converse is true. This means that, in the infinite patch, low dispersal, zero diffusion limit, the condition for the stochastic reversal of selection is weakened from $ϕ_{x} > ϕ_{y}$ to

N_{x} ϕ_{x} > N_{y} ϕ_{y} .

[14]

Spatial structure therefore has the ability to enhance the stochastic reversal observed in the small well-mixed system. An approximate analytic form for the above condition can be obtained in terms of the original parameters:

ε < 2 \frac{κ}{b R^{2}} log [\frac{δ κ}{δ κ - p_{x} r}] .

[15]

Once again, our analytical results are well supported by simulations (Fig. 3). The critical production rate for the invasion probability of producers to exceed that of nonproducers has been decreased, as predicted by Eqs. 9 and 15. Producers can therefore withstand higher production costs in spatially structured environments.

It is important to note that whereas Eq. 14 depends on the mean number of producers and nonproducers on a homogeneous patch ( $N_{x}$ and $N_{y}$ ), it is independent of the number of individuals in the entire metapopulation in the large C limit. The interaction between these two spatial scales leads to results that can appear counterintuitive. Demographic noise, as we have discussed, leads to producing patches being “more fit” at the patch level (Eq. 11). However, when a large number of patches are considered, the demographic noise at the metapopulation level is reduced. This leads to the system following trajectories that appear deterministic at the level of the metapopulation, even though the path they follow is entirely the result of demographic stochasticity at the within-patch level (Fig. 4). Movie S1 displays the individual dynamics of the patches that compose the trajectory illustrated in Fig. 4.

Fig. 4. — Demographic stochasticity at the local “patch” scale profoundly alters the system dynamics at the population level. Results are obtained from stochastic and deterministic ( $R \to \infty$ ) simulations of Eq. 10 with a grid of $C = 100 \times 100$ patches, $p_{x} = 1 \times 10^{- 4},$ $ε = 0.02,$ and $m = 3.7 \times 10^{- 5}$ and the remaining parameters are taken from Table S2. Initial conditions are a single producer and nonproducer on each patch. The initial (fast) transient collapse to the SM occurs in the shaded gray region. Following this, the deterministic system slowly moves along the SM until the nonproducers fixate, whereas in the stochastic system, the producers experience a selective pressure in their favor. For dynamics at the patch level, see Movie S1.

Away from the small dispersal, zero diffusion limit, the dramatic selection reversal predicted by the analytical results is clearly weakened (Fig. 3). Although selection reversal is still found across a range of m and D values, if either dispersal or diffusion is too high, the selection reversal breaks down. It is therefore important to understand what order of magnitude estimates for the values of m and D may be biologically reasonable.

Insights from S. cerevisiae.

In the following section, we attempt to contextualize our model with reference to a S. cerevisiae yeast system, which has been previously identified as a biological example of a population that features public good producers and nonproducers. The model we have presented is general and therefore it could not capture the full biological detail of this particular system. For instance, it has been noted that some degree of privatization of the public good occurs in even the well-mixed experimental system (23), a behavior we do not consider in our model. However, setting our model in this context can provide some insight into the scenarios in which we might expect stochastic selection reversal to be a biologically relevant phenomenon.

An S. cerevisiae yeast cell metabolizes simple sugars, such as glucose, to function. However, when simple sugars are scarce, the yeast can produce invertase, an enzyme that breaks down complex sugars, such as sucrose, to release glucose (32). Invertase is produced at a metabolic cost and, because digestion of sucrose occurs extracellularly, most of the benefits of its production are shared by the population. Specifically in the case of S. cerevisiae, $S U C 2$ , the wild-type strain, produces invertase, whereas the laboratory-cultured mutant $s u c 2$ does not (33). In terms of our model parameters, the baseline birth rates, $b_{x}$ and $b_{y},$ represent, respectively, $S U C 2$ and $s u c 2$ reproduction in the absence of invertase. This could be understood as arising from yeast directly metabolizing sucrose [a less energetically beneficial metabolic route (32)] or as the result of some extrinsically imposed low glucose concentration in the system. The rate r would then represent the additional birth rate in the presence of invertase. The form of our specified reactions (Eq. 1) assumes that the presence of invertase leads directly to a yeast reproduction event. In reality invertase must break down the sucrose into glucose and then slowly absorb the glucose. We are therefore essentially assuming that the sucrose is abundant, its breakdown by invertase instantaneous, and the glucose absorption rapid and occurring in discrete packets, with each packet absorbed leading to a reproduction event.

In the well-mixed system, our analytic predictions indicate that stochastic selection reversal can occur only if the population is very small. Because this is an unrealistic assumption in the case of yeast cultures, we would predict that nonproducers should come to dominate a well-mixed population. In a spatially structured population, however, this constraint is relaxed because it requires only small interaction regions. For S. cerevisiae, we can obtain order of magnitude estimates for the majority of parameters in our model, including the rate of public good diffusion (SI Order of Magnitude Parameter Estimates and Table S1). Using these estimates together with our analytic results for the spatial public goods system, we find that stochastic selection reversal could feasibly be an important phenomenon for promoting the evolution of microbial public goods production in spatial settings (Fig. 3B). Given this finding, we now consider a spatial experiment on S. cerevisiae and ask how its results might be interpreted in light of the insights developed with our simple model.

Table S1.

List of experimental parameters obtained from the literature

Experimental parameter	Value	Description
$p_{mol}$	$0.46 mol \cdot s^{- 1}$	Production rate of a molecule of invertase per producing yeast cell (26)
δ	$2 \times 10^{- 3} mol \cdot s^{- 1}$	Estimated efficacy decay rate of invertase (ref. 49, figure 5)
$λ_{exp}$	$0.31 - 0.5 h^{- 1}$	Yeast reproduction rate in producing population (50, 51)
$ε_{exp}$	0.06	Cost of public good production to yeast reproduction rate (50)
$D_{exp}$	$100 {μ m}^{2} \cdot s^{- 1}$	Diffusion rate of invertase molecules estimated in ref. 41
$L_{c}$	$3 μ m$	Cell length physical approximation (41)

Open in a new tab

In ref. 33, $S U C 2$ and $s u c 2$ were experimentally competed on an agar plate. It was found that nonproducing $s u c 2$ could not invade from rare ( $1 %$ of initial yeast population) and in fact decreased in frequency, becoming undetectable at long times (around 800 generations). This result suggests that in a spatial setting, invertase-producing $S U C 2$ yeast are robust to invasions, which is in qualitative agreement with our theoretical predictions. The experiments yielded an additional result, the appearance of a hyperproducing mutant. This hyperproducing phenotype produced invertase at ∼1.5 times the rate of standard producers and existed at higher densities. The hyperproducer appeared to evolve naturally and establish robust colonies during the competition experiments between nonproducers and producers. However, when separate competition experiments were conducted between the hyperproducers and the producers, the hyperproducers failed to demonstrate any appreciable fitness advantage over the producers. This finding potentially suggests an optimal invertase production rate, whereby the hyperproducers managed to establish and grow during the $S U C 2$ – $s u c 2$ competition experiments by exploiting nonproducing regions due to a relative fitness advantage, but could not invade regions of space occupied by producers. Interestingly, our model also predicts that an intermediate optimal production rate may exist, depending on how the cost of production scales with the production rate. Suppose a hyperproducer, U, produces at a rate $p_{u} = a_{p} p_{x},$ paying a metabolic cost $a_{b} ε$ to its birth rate, such that $b_{u} = b (1 - a_{b} ε) .$ The pairwise invasion probabilities of each phenotype can then be calculated (SI Pairwise Invasibility for Nonproducers, Producers, and Hyperproducers). We define the fitter phenotype in a pair as that with the larger invasion probability. The potential fitness rankings are investigated in Fig. 5 as a function of $p_{x}$ and ε (which we recall also alter $p_{u}$ and $b_{u}$ ). We draw particular attention to Fig. 5, Right, in which $a_{b} > a_{p} .$ In this scenario, the hyperproducers pay a disproportionate cost for their increased production rate compared with the producers. This can be interpreted as diminishing returns for production. In this case, there exist regions where the producer is the optimal phenotype (regions A and B, in purple and cyan, respectively). Specifically, region A displays a similar behavior to that observed in ref. 33, in which producers win out over both nonproducers and hyperproducers, but hyperproducers are more likely to invade nonproducing populations.

Fig. 5. — Plots of the pairwise invasibility scenarios possible for nonproducing (NP), producing (P), and hyperproducing (HP) phenotypes. Arrows point away from the dominant phenotype in a pair, which is defined as that with a larger invasion probability (Fig. S3). Nontransitive dynamics are not possible. It is possible, however, for an optimal intermediate good production rate to emerge (cyan and purple regions), if $a_{p} < a_{b} .$ In this scenario the hyperproducer receives diminishing good production as a function of cost to birth rate compared with the producer. (*Left*) $a_{b} = 1.3$ and $a_{p} = 1.5.$ (*Right*) $a_{b} = 3$ and $a_{p} = 1.5.$ Remaining parameters given in Table S2.

Generality of Results

We have shown that demographic stochasticity can reverse the direction of selection in a public good model. In this section we show that the mechanism responsible for this phenomenon is by no means particular to this model. We consider a general scenario, with a phenotype $X_{1},$ which is the focus of our study, and a number of discrete ecosystem constituents, $E_{i} .$ In the public good model for instance, we would label the public good itself as an ecosystem constituent; however, more generally this could be a food source, a predator, or anything else that interacts with the phenotypes. The state of the ecosystem influences the birth and death of the phenotype and in turn the presence of the phenotype influences the state of the ecosystem, altering the abundances of the constituents. We assume that the system lies at a unique, stable stationary state, precluding the possibility of periodic behavior. Suppose that a new phenotype, $X_{2},$ arises. We assume that the second phenotype is only slightly better at exploiting the ecosystem than $X_{1},$ although its influence on the ecosystem may be very different. For instance, in the public good model, nonproducers have a small birth rate advantage over producers, but do not produce the public good. Which phenotype is more likely to invade and fixate in a resident population of the opposite type?

The stochastic model for this system can be constructed in a similar manner to the public good model; the dynamics are described by a set of probability transition rates (analogous to Eq. 1). We restrict the transitions by specifying that although the two phenotypes compete, there is no reaction that instantaneously changes both of their numbers in the population. This final condition simply means that they should not, for instance, be able to mutate from one type to another during their lifetime or to prey on each other. A parameter R is introduced, to once again govern the typical scale of the system. The model is analyzed in the mesoscopic limit, by introducing the continuous phenotype $(x_{1}, x_{2})$ and ecosystem $(e_{i})$ variables as $(x_{1}, x_{2}, e) = (n_{x 1}, n_{x 2}, n_{e}) / R^{2}$ and applying the diffusion approximation. For large but finite R, the mesoscopic description takes the form

\begin{matrix} {\dot{x}}_{1} = x_{1} F^{(0)} (x, e) - ε x_{1} F^{(ε)} (x, e) + R^{- 1} η_{1} (t), \\ {\dot{x}}_{2} = x_{2} F^{(0)} (x, e) + R^{- 1} η_{2} (t), \\ {\dot{e}}_{i} = F_{i} (x, e) + R^{- 1} β_{i} (t), \forall i = 3, \dots J, \end{matrix}

[16]

where ε is small and governs selective pressure against $X_{1} .$ The assumption that there is no reaction that instantaneously changes the number of both phenotypes ensures that the correlation structure of the noise terms takes the form

\begin{matrix} 〈 η_{1} (t) η_{1} (t') 〉 = δ (t - t') x_{1} H^{(0)} (x, e), \\ 〈 η_{2} (t) η_{2} (t') 〉 = δ (t - t') x_{2} H^{(0)} (x, e), 〈 η_{1} (t) η_{2} (t') 〉 = 0, \end{matrix}

with ε taken to be of order $R^{- 2} .$ This assumption, made here to isolate the effect of varying carrying capacity from any other intraspecies dynamics, means that whereas the magnitude of fluctuations in the number of both phenotypes is dependent on the state of the system, $(x, e),$ the fluctuations themselves are not correlated with each other. Restrictions on the microscopic model that yield the above SDE description are addressed more thoroughly in SI Generality of Results. The form of Eq. 16 makes the nature of the system we describe more clear; it consists of two competing phenotypes, which reproduce according to replicator dynamics (1) with equal fitness at leading order in ε.

In the special case $ε = 0,$ both phenotypes are equally fit, regardless of their influence on the ecosystem variables $e_{i} .$ The degeneracy of the dynamics in $x_{1}$ and $x_{2}$ ensures the existence of a deterministic CM. We assume that the structure of $F^{(0)} (x)$ and $F_{i} (x)$ is such that the CM is one dimensional (there are no further degenerate ecosystem variables) and that it is the only stable state in the interior region $x_{i} > 0 .$ A separation of timescales is present if the system collapses to the CM much faster than the stochastic dynamics. In practical terms, the timescale of collapse can be inferred as the inverse of the nonzero eigenvalues of the system, linearized about the CM (34), whereas the timescale of fluctuations will be of order $R^{- 2}$ (35). When $ε > 0,$ the timescale elimination procedure can still be applied if $ε \approx O (R^{- 2}) .$ The effective one-dimensional description of the system now takes the form

{\dot{x}}_{1} = - ε D (x_{1}) + R^{- 2} S (x_{1}) + R^{- 1} ζ (t),

[17]

where the term $D (x_{1})$ is the deterministic contribution to the effective dynamics and $S (x_{1})$ is the stochastic contribution, whereas $ζ (t)$ is an effective noise term. The form these functions take is dependent on $F^{(0)} (x, e),$ $F^{(ε)} (x, e),$ and $F_{i} (x, e),$ as well as on the noise correlation structure, $H^{(0)} (x, e);$ however, it is independent of the structure of the demographic noise acting on the ecosystem variables (Eqs. S55, S56, and S66).

The core assumption we have made to derive Eq. 17 is essentially that the system’s ecological processes act on a faster timescale than its evolutionary processes. Even in this general setting, insights about the system’s stochastic dynamics can still be drawn (SI Generality of Results). If $ε = 0,$ the fixation probability of phenotype $X_{1}$ is independent of the initial conditions of the ecosystem variables $e .$ In fact, it is equal to the initial fraction of $X_{1}$ in the population, $n_{10} / (n_{10} + n_{20}) .$ The invasion probability of mutant $X_{1}$ phenotype fixating in a resident $X_{2}$ population, however, depends on the stationary state of the $X_{2}$ population; this stationary state defines the initial invasion conditions (the denominator for the fixation probability of $X_{1}$ ). Denoting by $N_{1}$ and $N_{2}$ the average numbers of phenotypes $X_{1}$ and $X_{2}$ in their respective stationary states, we find $ϕ_{1} = 1 / N_{2}$ and $ϕ_{2} = 1 / N_{1},$ generalizing Eq. 7. Therefore, for $ε = 0$ the phenotype that exists at higher densities is more likely to invade and fixate than its competitor, a consequence of its robustness to invasions. This result holds for any choice of finite R. In an ensemble of disconnected populations subject to repeated invasions, we would observe the emergence of high-density phenotypes if this phenotype does not carry a cost. Although this seems like a reasonable and indeed natural conclusion, it is one entirely absent from the deterministic analysis.

If $ε > 0,$ general results for the phenotype fixation probabilities cannot be obtained. However, if $N_{1} > N_{2},$ in the limit $ε \to 0$ we have shown that $ϕ_{1} > ϕ_{2} .$ From this result, it can be inferred that the term $S (x_{1})$ is positive on average along the slow manifold (Eq. S62). Therefore, if phenotype $X_{1}$ exists in isolation at higher densities than phenotype $X_{2},$ there will exist a stochastically induced pressure favoring the invasion of phenotype $X_{1} .$ Meanwhile, by construction we expect $D (x_{1})$ to be positive along the SM because phenotype $X_{1}$ exploits the ecosystem environment less effectively than phenotype $X_{2} .$ There is therefore a trade-off for competing phenotypes between increasing their phenotype population density and increasing their per capita growth rate. Note that the noise-induced selection function $S (x_{1})$ need not be strictly positive; indeed it may become negative along regions of the SM. This behavior potentially allows for stochastically induced “fixed points” along the SM, around which the system might remain for unusually large periods of time. This result may provide a theoretical understanding of the coexistence behavior observed in ref. 36.

The term $S (x_{1})$ is moderated by a factor $R^{- 2}$ (Eq. 8) or, more physically, by the typical size of the population. The stochastically induced selection for the high-density phenotype therefore becomes weaker as typical system sizes increase. The trade-off will be most crucial in small populations or, as illustrated in the public good model, in systems with a spatial component. If the phenotypes and ecosystem variables move sufficiently slowly in space, the results of Eqs. 13 and 14 can be imported, with the understanding that $ϕ_{1}$ and $ϕ_{2}$ must be calculated for the new model under consideration.

It is worth noting that the precise functional form of $ϕ_{1}$ and $ϕ_{2}$ identified in the deterministically neutral case ( $ε = 0$ ) is dependent on the assumption that phenotype noise fluctuations are uncorrelated. Although correlated fluctuations (for instance, resulting from mutual predation of the phenotypes) can still be addressed with similar methods to those used here, there is then the potential for the emergence of further noise-induced selection terms (SI Generality of Results). Careful specification of the phenotype interaction terms is therefore needed to determine to what degree these additional processes might amplify or dampen the induced selection we have identified.

Discussion

In this paper, we have shown that stochastic effects can profoundly alter the dynamics of systems of phenotypes that change the carrying capacity of the total population. Most strikingly, selection can act in the opposite direction from that of the deterministic prediction if the phenotype that is deterministically selected for also reduces the carrying capacity of the population. The methods used to analyze the models outlined in this paper are based on the removal of fast degrees of freedom (30). The conclusions drawn are therefore expected to remain valid as long as the rate of change of the phenotype population composition occurs on a shorter timescale than that of the remaining ecological processes.

By illustrating this phenomenon in the context of public good production, we have revealed a mechanism by which the dilemma of cooperation can be averted in a very natural way: by removing the unrealistic assumptions of fixed population size inherent in Moran-type game theoretic models. The potential for such behavior has been previously illustrated with the aid of a modified Moran model (17) and a single-variable Wright–Fisher-type model (18) that assumes discrete generations. However, we have shown that the mechanism can manifest more generally in multivariate continuous-time systems. Our analysis may also provide a mathematical insight into the related phenomenon of fluctuation-induced coexistence that has been observed in simulations of a similar public good model featuring exogenous additive noise (36): Such coexistence may rely on a similar conflict between noise-induced selection for producing phenotypes and deterministic selection against them.

For biologically reasonable public good production costs, selection reversal is observed only in systems that consist of a very small number of individuals. However, by building a metapopulation analog of the model to account for spatial structure, the range of parameters over which selection reversal is observed can be dramatically increased, as long as public good diffusion and phenotype dispersal between populations are not large. Two distinct mechanisms are responsible for these results. First, including spatial structure allows for small, local effective population sizes, even as the total size of the population increases. This facilitates the stochastic effects that lead to selection reversal. Second, because producer populations tend to exist at greater numbers (or higher local densities), they produce more migrants. The stochastic advantage received by producers is thus amplified, as they are not only more robust stochastically to invasions, but also more likely to produce invaders. Away from the low-dispersal, zero public good diffusion limit, the effect of selection reversal is diminished, but is still present across a range of biologically reasonable parameters. The analytical framework we have outlined may prove insightful for understanding the simulation results observed in ref. 37, where a similar metapopulation public good model was considered. In addition to fixation of producers (in the low dispersal–diffusion limit) and fixation of nonproducers (in the high dispersal–diffusion limit), ref. 37 observed an intermediate parameter range in which noise-induced coexistence was possible. Although our model does not feature such a regime, extending our mathematical analysis to their model would be an interesting area for future investigations. However, it must be noted that coexistence in a stochastic setting is inherently difficult to quantify analytically, as for infinite times some phenotype will always go extinct.

That space can aid the maintenance of cooperation is well known (38, 39). Generally, however, this is a result of spatial correlations between related phenotypes, so that cooperators are likely to be born neighboring other cooperators (and share the benefits of cooperation) whereas defectors can extract benefits only at the perimeter of a cooperating cluster. This is not what occurs in the model presented in this paper. Indeed, whereas we have assumed in our analytic derivation of the invasion probability that dispersal is small enough that each patch essentially contains a single phenotype, we find that the phenomenon of selection reversal manifests outside this limit (see Movie S2 in which a majority of patches contain a mix of producers and nonproducers). Instead, producing phenotypes have a selective advantage due to the correlation between the fraction of producers on a patch and the total number of individuals on a patch, which provides both resistance to invasions and an increased dispersal rate.

Most commonly in spatial game theoretic models of cooperation–defection, individuals are placed at discrete locations on a graph (40, 41). In contrast, by using a metapopulation modeling framework we have been able to capture the effect of local variations in phenotype densities across space, which is the driver of selection amplification in our model. Nevertheless, the question that remains is which modeling methodology is more biologically reasonable. The answer clearly depends on the biological situation. However, in terms of testability, our model makes certain distinct predictions. In ref. 41, producers and nonproducers were modeled as residing on nodes of a spatial network, with a public good diffusing between them. The investigation concludes that both lower public good diffusion and lower spatial dimensions (e.g., systems on a surface rather than in a volume) should encourage public good production, essentially by limiting the “surface area” of producing clusters. Whereas our investigation certainly predicts that lower public good diffusion is preferable, stochastic selection reversal does not require that the spatial dimension of the system is low. In fact, the result used in Eq. 12 holds for patches arranged on any regular graph (where each vertex has the same number of neighbors) and thus could be used to describe patches arranged on a cubic, or even hexagonal, lattice.

In our final investigation, we have shown that stochastic selection reversal is not an artifact of a specific model choice, but may be expected across a wide range of models. These models consist of two phenotypes, competing under weak deterministic selection strength, reproducing according to replicator dynamics and interacting with their environment. Thus, the phenomenon of selection reversal is very general; however, it depends strongly on how one specifies a selective gradient. We take one phenotype to have a stochastic selective advantage over the other if a single mutant is more likely to invade a resident population of the opposite type. Such a definition is also used in standard stochastic game theoretic models (4). A key difference here, however (where the population size is not fixed), is that the invasion probability is not specified by a unique initial condition; we must also specify the size of the resident population. We have assumed that the invading mutant encounters a resident population in its stationary state. This is by no means an unusual assumption; it is the natural analog of the initial conditions in a fixed population size model. Essentially it assumes a very large time between invasion or mutation events, an approach often taken in adaptive dynamics (42).

If instead we assumed a well-mixed system far from the steady state, our results would differ. For instance, suppose the system initially contains equal numbers of the two phenotypes. For the case when the two phenotypes have equal reproductive rates ( $ε = 0$ ), the phenotypes have equal fixation probability. For $ε > 0,$ the phenotype with the higher birth rate has the larger fixation probability, regardless of its influence on the system’s carrying capacity. This apparent contradiction with the results we developed in the body of this paper echoes the observations of $r - K$ selection theory (43): Selection for higher birth rates (r selection) acts on frequently disturbed systems that lie far from equilibrium, whereas selection for improved competitive interactions or carrying capacities (K selection) acts on rarely disturbed systems. In addition, $r - K$ selection theory suggests that K-selected species are typically larger in size and, as a consequence, consist of a lower number of individuals (19). This finding indicates a further parallel with our stochastic model framework, because selection for higher carrying capacities requires that the typical number of individuals (of both the low and high carrying capacity phenotypes) is small. Although the mechanism that leads us to these conclusions is distinct, our stochastic analysis provides a complementary view of $r - K$ -selection theory, which may be applicable to simple microorganisms. In exploring this analogous behavior further, future investigations may also benefit from considering the results of ref. 15, where it was shown that stochastically induced selection can change direction near carrying capacity.

Although we have implicitly developed our results in the low mutation limit, including mutation explicitly in the modeling framework is possible. This would be an interesting extension to the framework. In the well-mixed scenario, it is likely that the inclusion of mutation will complicate the intuition developed here: Although larger populations are more robust to invasions, they are also more prone to mutations, by virtue of their size. Whereas this result may be offset by the additional benefits garnered in the spatial analog of the model, a complex set of timescale-dependent behaviors is likely to emerge.

Finally, we propose a rigorous analytical investigation of existing models that conform to the framework we have outlined; an example is the work conducted in refs. 36 and 37, which we believe to be mathematically explainable within our formalism. In the context of induced selection, whereby deterministically neutral systems become nonneutral in the stochastic setting, similar ideas have already been extended to disease dynamics (16) and the evolution of dispersal (44, 45). The extension of selection reversal to such novel ecological models may provide further insight. Furthermore, this general scheme may be of relevance to many other systems in ecological and biological modeling, such as cancer, for which the evolution of phenotypes that profoundly alter the carrying capacity of a cell type can be of primary importance.

SI Obtaining the SDE System from the Microscopic Individual-Based Model

We begin with a model consisting of a discrete number of entities, two phenotypes of a species, X and Y and a public good Q. They interact according to the transitions

\begin{array}{l} X ⇄_{κ / R^{2}}^{b_{x}} X + X, Y + X \overset{κ / R^{2}}{\to} X, X + Q \overset{r / R^{2}}{\to} X + X + Q, X \overset{p_{x}}{\to} X + Q, \\ Y ⇄_{κ / R^{2}}^{b_{y}} Y + Y, Y + X \overset{κ / R^{2}}{\to} Y, Y + Q \overset{r / R^{2}}{\to} Y + Y + Q, Q \overset{δ}{\to} \emptyset . \end{array}

[S1]

The term $R^{- 2}$ occurs in all terms involving two reactants. It thus controls the interaction probability between instances of the phenotypes and the public good. Taking larger R decreases the interaction probability of phenotypes X and Y and the public good and allows the populations to grow to greater numerical abundances. The parameter R can thus be understood as a measure of the spatial scale of the system; when R is increased, the probability of interactions in the well-mixed system is decreased whereas the number of individuals the system can contain is increased.

Let us denote $n = (n_{x}, n_{y}, n_{q})$ the numbers of X, Y, and Q, respectively. Then the dynamics of this system can be described by the set of ODEs

\frac{d P (n, t)}{d t} = \sum_{n' \neq n} [T (n | n') P (n', t) - T (n' | n) P (n, t)],

[S2]

where $P (n, t)$ is the probability of the state being in state $n$ at time t, and $T (n' | n),$ the probability transition rate, is the probability per unit time of transitioning from state $n$ to $n' .$ Formally this is known as the master equation (47). Given the reactions in Eq. S1 the probability transition rates can be expressed as

\begin{array}{l} T_{1} (n_{x} + 1, n_{y}, n_{q} | n_{x}, n_{y}, n_{q}) = b_{x} n_{x} + \frac{r}{R^{2}} n_{x} n_{q}, \\ T_{2} (n_{x}, n_{y} + 1, n_{q} | n_{x}, n_{y}, n_{q}) = b_{y} n_{y} + \frac{r}{R^{2}} n_{y} n_{q}, \\ T_{3} (n_{x} - 1, n_{y}, n_{q} | n_{x}, n_{y}, n_{q}) = \frac{κ}{R^{2}} n_{x} (n_{x} + n_{y}), \\ T_{4} (n_{x}, n_{y} - 1, n_{q} | n_{x}, n_{y}, n_{q}) = \frac{κ}{R^{2}} n_{y} (n_{x} + n_{y}), \\ T_{5} (n_{x}, n_{y}, n_{q} + 1 | n_{x}, n_{y}, n_{q}) = p_{x} n_{x}, \\ T_{6} (n_{x}, n_{y}, n_{q} - 1 | n_{x}, n_{y}, n_{q}) = δ n_{q} . \end{array}

[S3]

Let us now make a change of variables into the scaled expressions $x = (x, y, q) = (n_{x}, n_{y}, n_{q}) / R^{2} .$ Substituting the probability transition rates into Eq. S2, we find recurrent factors of $1 / R^{2}$ appearing in the resulting expression. These terms are associated with the local transitions from state $n$ to the surrounding states. If $R^{2}$ is sufficiently large, the population grows larger (as the crowding terms in Eq. S1 grow small). We may then Taylor expand Eq. S2 in $R^{- 1},$ assuming that the variables $(x, y, q)$ are approximately continuous (28). Truncating at second order in $R^{- 4},$ we arrive at a PDE for $p (x, y, q, t)$ of the form

\frac{\partial p (x, t)}{\partial t} = - \frac{1}{R^{2}} \sum_{i} \frac{\partial}{\partial x_{i}} [A_{i} (x) p (x, t)] + \frac{1}{2 R^{4}} \sum_{i, j} \frac{\partial^{2}}{\partial x_{i} \partial x_{j}} [B_{i j} (x) p (x, t)], x = (x_{1}, x_{2}, x_{3}) \equiv (x, y, q) .

[S4]

This is a diffusion approximation in a population genetics context (22), but more generally is akin to the Kramers–Moyal expansion (28) or a nonlinear analog of the van Kampen expansion (47). The forms of $A (x)$ and $B (x),$ given transition rates in Eq. S3, are found to be

A_{x} (x) = x (b_{x} + r q - κ (x + y)), A_{y} (x) = y (b_{y} + r q - κ (x + y)), A_{q} (x) = p_{x} x - δ q,

[S5]

and

\begin{array}{l} B_{x x} (x) = x (b_{x} + r q + κ x + κ y), B_{y y} (x) = y (b_{y} + r q + κ x + κ y), \\ B_{q q} (x) = p_{x} x + δ q, B_{i j} = 0 \forall i \neq j . \end{array}

[S6]

Further, it can be shown that the above PDE is equivalent to the set of Itō SDEs (48)

\frac{d x}{d τ} = A (x) + \frac{1}{R} η (τ),

[S7]

where $τ = t R^{2}$ and $η (t)$ are Gaussian white noise terms with zero mean and correlations

〈 η_{i} (τ) η_{j} (τ') 〉 = δ (τ - τ') B_{i j} (x) .

[S8]

Note that the correlations are multiplicative and thus dependent on the state of the system.

SI Obtaining a One-Dimensional Effective Public Good Model

In this section we seek to identify and remove the fast modes of the SDE system (Eq. S7) and thus obtain an effective one-dimensional description of the dynamics. We make use of methods of fast-mode elimination described in ref. 30. First, we note that the deterministic nullcline for q is given by

q = \frac{p_{x} x}{δ} \equiv Z_{q} (x, y) .

[S9]

Therefore, if the production and decay of public good occur much faster than the processes associated with the phenotypes, we would expect the public good to quickly attain this value, after which its dynamics would be slaved to those of x and y. Note that deterministically, substituting Eq. S9 into Eq. S5 recovers a Lotka–Volterra competition model for two competing species.

To make further analytic progress, we begin by considering the quasi-neutral limit in which $b_{x} = b_{y} \equiv b .$ Under these conditions, the deterministic system exhibits a CM given by Eq. S9 and

y = \frac{[b δ - (δ κ - r p_{x}) x]}{δ κ} \equiv Z_{y} (x) .

[S10]

The CM is stable for $κ δ > r p_{x},$ and we assume that this condition holds throughout this paper. Calculating the intersection of the CM at the boundaries $y = 0$ and $x = 0$ allows us to determine the mean population size in the quasi-neutral ( $ε = 0$ ) limit when it consists of only producers and nonproducers, respectively:

N_{x}^{(0)} = R^{2} K_{x}^{(0)}, K_{x}^{(0)} = (\frac{b δ}{δ κ - r p_{x}}),

[S11]

N_{y}^{(0)} = R^{2} K_{y}^{(0)}, K_{y}^{(0)} = (\frac{b δ}{δ κ - r p_{y}}) .

[S12]

These parameters will be useful in the following analysis.

Deterministically, the system comes to rest on a point along the CM (defined by Eqs. S9 and S10), which depends on the system’s initial conditions. When stochasticity is included, the CM ceases to exist in any true sense. However, when the noise is small [already assumed in the derivation of SDEs (Eq. S7)], we can say that far from the CM, we expect the dynamics to be dominated by the deterministic collapse to the CM, whereas in the vicinity of the CM, we expect noise to play a more important role, driving the slow change in population composition until one or the other of the phenotypes fixates. We wish to exploit this timescale separation and obtain an effective description of the dynamics in terms of a single variable.

To begin, we note that the stochastic dynamics along the CM have two components. First, noise can move the system neutrally along the CM. Second, noise can take the system off the CM, at which point we expect the deterministic component of the dynamics to become more prevalent, driving the system back to the CM. To capture the effect of both of these processes on the effective dynamics along the CM, we implement a nonlinear projection of the stochastic system to the CM. Essentially this assumes that fluctuations that take the system away from the manifold are instantaneously mapped along deterministic trajectories back to the CM. To formalize this, the mapping $z = f (x, y, q)$ is introduced, where $f (x, Z_{y} (x), Z_{q} (x)) = x;$ that is, z gives the position on the CM, parameterized by x, which intersects a deterministic trajectory beginning at $(x, y, q) .$ The mapping can be determined analytically from the observation that the quantity $x / y$ in Eq. S7 is invariant in this quasi-neutral ( $b_{x} = b_{y}$ ) scenario. Therefore

\frac{z}{Z_{y} (z)} = \frac{x}{y}, z = \frac{b δ x}{(δ κ - p r) x + δ κ y} .

[S13]

The effective dynamics for z can now be straightforwardly calculated by differentiating Eq. S13 with respect to t. We must note, however, that because the original SDE system is defined in the Itō sense, the normal rules of calculus no longer apply. Applying Itō’s rules of calculus appropriately (30, 47), we find that the effective dynamics along the CM take the form

\dot{z} = \frac{1}{R^{2}} S (z) + \frac{1}{R} ζ (t),

[S14]

where

S (z) = {\frac{1}{2} (\frac{\partial^{2} z}{\partial x^{2}} B_{x x} (x) + \frac{\partial^{2} z}{\partial y^{2}} B_{y y} (x)) |}_{x = z, y = Z_{y} (z), q = Z_{q} (z)},

[S15]

\begin{array}{l} = \frac{2 p_{x} r}{δ} z {1 + \frac{1}{b^{2} δ^{2}} z [b δ (2 p_{x} r - δ κ) + p_{x} r (p_{x} r - δ κ) z]}, \\ = 2 b (\frac{K_{x}^{(0)} - K_{y}^{(0)}}{{(K_{x}^{(0)})}^{3} {(K_{y}^{(0)})}^{2}}) z (K_{x}^{(0)} - z) [K_{x}^{(0)} K_{y}^{(0)} + (K_{x}^{(0)} - K_{y}^{(0)}) z], \end{array}

[S16]

and

〈 ζ (t) 〉 = 0, 〈 ζ (t) ζ (t') 〉 = δ (t - t') ℬ (z),

with

\begin{matrix} ℬ (z) = {({[\frac{\partial z}{\partial x}]}^{2} B_{x x} (x) + {[\frac{\partial z}{\partial y}]}^{2} B_{y y} (x)) |}_{x = z, y = Z_{y} (z), q = Z_{q} (z)}, \\ = 2 z {b + \frac{1}{b^{2} δ^{3}} z [b^{2} δ^{2} (3 p_{x} r - δ κ) + b p_{x} r δ (3 p_{x} r - 2 δ κ) z + p_{x}^{2} r^{2} (p_{x} r - δ κ) z^{2}]}, \end{matrix}

[S17]

= 2 b (\frac{1}{{(K_{x}^{(0)})}^{3} {(K_{y}^{(0)})}^{2}}) z (K_{x}^{(0)} - z) {[K_{x}^{(0)} K_{y}^{(0)} + (K_{x}^{(0)} - K_{y}^{(0)}) z]}^{2} .

[S18]

Note that because the mapping in Eq. S13 is independent of q, both Eq. S15 and Eq. S17 do not depend on the noise correlations in q.

Whereas the deterministic system features no dynamics along the CM, the effective SDE (Eq. S14) does feature a drift in the mean state, embodied by $S (z) .$ Understanding the origin of this induced drift term requires considering the following. We envisage fluctuations arising from a single point on the CM, $x^{(0)},$ which take the system to a point off the CM, $x^{(1)}$ (Fig. S1). The point $x^{(1)}$ is clearly stochastic, but its distribution is approximately Gaussian, with a variance defined by $B (x^{(0)}) .$ The fluctuation is now mapped back along a deterministic trajectory to a point $x^{(2)}$ on the CM. The location $x^{(2)}$ is also stochastic [dependent as it is on $x^{(1)}$ ] and has its own distribution. The presence of the term $S (z)$ in Eq. S14 is indicative of the fact that the mean of the distribution of $x^{(2)}$ is not $x^{(0)};$ fluctuation events on average are mapped back to the CM with a preferred direction, inducing drift along the CM. Note that $S (z)$ is positive along the length of the CM, which is defined on the interval $[0, K_{x}^{(0)}] .$

We now turn our attention to the case when $ε > 0 .$ As long as ε is small, a separation of timescales is still present, although now no CM exists. Instead there is a SM, to which the deterministic system quickly relaxes, before slowly moving along it until phenotype Y fixates. The equations for the population size at the boundaries of the SM are formally given by

N_{x} = R^{2} K_{x}, K_{x} = (\frac{b_{x} δ}{δ κ - r p_{x}}) \equiv K_{x}^{(0)} + O (ε),

[S19]

N_{y} = R^{2} K_{y}, K_{y} = (\frac{b_{y} δ}{δ κ - r p_{y}}) \equiv K_{x}^{(0)} + O (ε) .

[S20]

To proceed with the stochastic calculation, we assume $ε \approx O (R^{- 2})$ and work order by order in $R^{- 1} .$ At leading order, the equation for the SM is identical to that for the CM, Eqs. S9 and S10. The mapping to the SM is also unchanged at leading order from the quasi-neutral case (Eq. S13). We proceed as before to obtain an effective description of the system dynamics in terms of z (30), now obtaining the dynamics

\dot{z} = - ε D (z) + \frac{1}{R^{2}} S (z) + \frac{1}{R} ζ (t),

[S21]

where

\begin{matrix} D (z) = - {(\frac{d z}{d x} A_{x} (x) + \frac{d z}{d y} A_{y} (x)) |}_{x = z, y = Z_{y} (z), q = Z_{q} (z)}, \\ = b z [1 - (\frac{δ κ - p_{x} r}{b δ}) z], \\ = \frac{b}{K_{x}^{(0)}} z (K_{x}^{(0)} - z), \end{matrix}

[S22]

and $S (t)$ and $ζ (t)$ retain their form from the quasi-neutral case, Eqs. S16 and S18. The function $D (z)$ is the deterministic contribution to the dynamics along the SM. This expression is what would be obtained using standard fast variable elimination techniques on the deterministic system. From Eq. S22, we can see that $D (z)$ is positive along the length of the SM and therefore acts (as we would expect) to increase the selective advantage of the nonproducers, phenotype Y. There is therefore a conflict between the two components of the drift in the system. The term $D (z)$ works against producers along the length of the SM, whereas $S (z)$ creates a selective pressure in favor of producers. Ultimately, which term is more prevalent is dependent on the parameters ε and R (Eq. S21); small R leads to a small population size in which stochastic effects are stronger, and so producers are more likely to be selected for. In contrast, when the deterministic cost for good production is increased, the nonproducers have an increased advantage over producers.

Adopting the notation used in the main text, in which we set $z = x$ (which is valid on the CM and SM at leading order), the expression for the SDE (Eq. S21) can alternatively be written

\dot{x} = \frac{b}{K_{x}^{(0)}} x (K_{x}^{(0)} - x) (\frac{1}{R^{2}} ℱ (x) - ε) + \frac{1}{R} ζ (t),

[S23]

where

ℱ (x) = 2 (\frac{K_{x}^{(0)} - K_{y}^{(0)}}{{(K_{x}^{(0)})}^{2} {(K_{y}^{(0)})}^{2}}) [K_{x}^{(0)} K_{y}^{(0)} + (K_{x}^{(0)} - K_{y}^{(0)}) x] .

[S24]

SI Probability of Fixation for the Reduced Public Good Model

The fixation probability for a phenotype in a single variable system can be calculated using standard methods (28). To conduct the calculation, we need expressions for the absorbing boundaries of the problem. For the reduced system given in Eq. S21, these lie at $z = 0$ and $z = K_{x}^{(0)} .$ The fact that the boundary for the problem exists at $z = K_{x}^{(0)},$ rather than at $z = K_{x},$ is a consequence of the order to which we are working in ε. At this order the SM is approximated by the expression for the CM, which intersects the absorbing boundaries $x = 0$ and $y = 0$ at $z = 0$ and $z = K_{x}^{(0)},$ respectively. Denoting $Q (z_{0})$ the fixation probability of producing phenotype X given an initial frequency $z_{0}$ on the CM/SM, the fixation probability can be conveniently be expressed as

Q (z_{0}) = \frac{\int_{z = 0}^{z_{0}} ψ (z) d z}{\int_{z = 0}^{K_{x}} ψ (z) d z}, ψ (z) = exp [\int_{0}^{z} \frac{2 (- ε R D (z') + S (z'))}{ℬ (z')} d z'] .

[S25]

Substituting for $D (z),$ $S (z),$ and $ℬ (z)$ from Eqs. S22, S16, and S18, we find

Q (z_{0}) = \frac{1 - G (z_{0})}{1 - G (K_{x})}, G (z_{0}) = exp [\frac{(ε N_{y}^{(0)} K_{x}^{(0)} z_{0})}{(K_{x}^{(0)} K_{y}^{(0)} + (K_{x}^{(0)} - K_{y}^{(0)}) z)}] .

[S26]

The nature of these expressions can be understood more intuitively if we move from considering the initial frequency of X on the CM, $z_{0} = n_{x 0} / R^{2},$ to considering the initial fraction of phenotype X on the CM, $f_{z 0} .$ The fraction and number of phenotype X on the CM are related by

f_{z 0} = \frac{z}{z + Z_{y} (z)}, \to z = \frac{K_{x}^{(0)} K_{y}^{(0)} f_{z 0}}{K_{x}^{(0)} - (K_{x}^{(0)} - K_{y}^{(0)}) f_{z 0}} .

[S27]

Substituting this into Eq. S26, we find

Q (f_{z 0}) = \frac{1 - exp [ε N_{y}^{(0)} f_{z 0}]}{1 - exp [ε N_{y}^{(0)}]}, {Q (f_{z 0}) |}_{ε = 0} = f_{z 0} .

[S28]

On first appraisal, the fixation probabilities in Eq. S28 appear to share the form of the well-mixed Moran model with weak selection. There is, however, one crucial distinction: The relation between $f_{z 0}$ and $(x_{0}, y_{0}, q_{0})$ is dependent on the form of the CM/SM and is not necessarily symmetric under the interchange of X and Y. For instance, let us consider the quasi-neutral case ( $ε = 0$ ) with the population initially consisting of a mutant X in a population of the Y phenotype in its stationary state. Then $f_{z 0} = 1 / N_{y} .$ In contrast, if the mutant is of phenotype Y, and the resident population consists of phenotype X in the stationary state, $f_{z 0} = 1 - 1 / N_{x} .$ Because $N_{x}$ and $N_{y}$ are distinct, these frequencies are not the same, and Eq. S28 is not symmetric under the interchange of phenotypes, undermining its apparent similarities to the Moran model.

In this section a crucial aspect of the selection reversal has been elucidated. The selection reversal along the SM is a result of the differing densities at which the populations of X and Y phenotypes reside in isolation. In a deterministic system, we would define the fitter phenotype as the one that fixates at long times. In a stochastic Moran-type model, the fitter phenotype is defined as that with the greater invasion probability. Because Moran-type models feature a constant population size, N, the invasion probability of a mutant phenotype is defined by a unique initial condition, a single mutant, and $N - 1$ residents. In systems such as the public good model discussed in this paper, the invasion probability is no longer defined uniquely by the specification of a single invading mutant; we must also define the size of the resident phenotype population and the public good density. If the system has been allowed to relax to a stationary state before the mutant is introduced, then selection reversal along the CM may be present, and it is possible for the producing phenotype to have a larger fixation probability than the nonproducing phenotype. Thus, the producing phenotype may be fitter.

SI Pairwise Invasibility for Nonproducers, Producers, and Hyperproducers

In this section we explore the pairwise invasibility of three separate phenotypes, nonproducers, producers, and hyperproducers. We begin by noting that, under the assumption that the birth rates differ by only a small amount from phenotype to phenotype, the invasion probability of phenotype i in a resident population j, $ϕ_{i | j},$ can be expressed as

ϕ_{i | j} = \frac{1 - exp [(b_{i} - b_{j}) / (κ N_{j}^{(0)})]}{1 - exp [(b_{i} - b_{j}) R^{2} / κ]} .

[S29]

We therefore define phenotype i as fitter than phenotype j if $ϕ_{i | j} > ϕ_{j | i} .$ Let us now explicitly express the birth rates of each of the phenotypes as

Nonproducer : b_{y} = b, Producer : b_{x} = b (1 - ε), Hyperproducer : b_{u} = b (1 - a_{b} ε) .

[S30]

We now wish to obtain an expression for the critical costs to birth rate ε at which producers are fitter than nonproducers, hyperproducers are fitter than nonproducers, and hyperproducers are fitter than producers. To do this we must solve $ϕ_{i | j} = ϕ_{j | i}$ for ε for each pair of phenotypes. An analytic solution is available if we set $ε = \tilde{ε} R^{- 2}$ with $\tilde{ε}$ of order one and Taylor expand in $R^{- 2} .$ Truncating at first order, we find that the critical cost for species i to be fitter than species j, $ε_{i | j}$ is given by

ε_{i | j} = \frac{κ \log [(p_{i} r - δ κ) / (p_{j} r - δ κ)]}{[(b_{i} - b_{j}) / ε] R^{2}} .

[S31]

We note that this provides eight different possible scenarios of fitness ranking, described in Fig. S3. Substituting in our equations for the birth rates, Eq. S30, these expressions become

ε_{x | y} = \frac{κ}{b R^{2}} log [- \frac{δ κ}{p_{x} r - δ κ}],

[S32]

ε_{u | y} = \frac{κ}{a_{b} b R^{2}} log [- \frac{δ κ}{p_{u} r - δ κ}],

[S33]

ε_{u | x} = \frac{κ}{(a_{b} - 1) b R^{2}} log [\frac{p_{x} r - δ κ}{p_{u} r - δ κ}] .

[S34]

Clearly the exact scenarios that emerge for a given set of parameters depend on the relationship between $p_{x}$ and $p_{u} .$ We make the assumption

p_{u} = a_{p} p_{x} .

[S35]

For $a_{p} > a_{b},$ the hyperproducer pays a discounted cost to its birth rate for its additional good production. In this situation, only scenarios C–F are possible in Fig. S3. It is always better to be a hyperproducer or a nonproducer, depending on the production rate $p_{x}$ and ε. This “all or nothing” result makes intuitive sense; if the hyperproducer produces much more than the producer, but pays only fractionally more to its birth rate, any region in which production is favored will be disproportionately advantageous to the hyperproducers. In contrast, if $a_{p} < a_{b},$ the hyperproducer receives decreasing production returns as a function of the cost it pays to birth in comparison with the producer. In this case, scenarios in Fig. S3 A and B and Fig. S3 E and F are possible. Either producers or nonproducers are favored, and hyperproducers are never favored.

Fig. S3. — (A–H) Eight different fitness rankings are possible based on the pairwise invasibility probabilities of nonproducers, producers, and hyperproducers. (A) Producers have a larger invasion probability than both hyperproducers and nonproducers, while hyperproducers have a larger invasion probability than nonproducers. (B) Producers have a larger invasion probability than both hyperproducers and nonproducers, while nonproducers have a larger invasion probability than hyperproducers. (C) Hyperproducers have a larger invasion probability than both producers and nonproducers, while nonproducers have a larger invasion probability than producers. (D) Hyperproducers have a larger invasion probability than both producers and nonproducers, while producers have a larger invasion probability than producers. (E) Nonproducers have a larger invasion probability than both producers and hyperproducers, while hyperproducers have a larger invasion probability than producers. (F) Nonproducers have a larger invasion probability than both producers and hyperproducers, while producers have a larger invasion probability than hyperproducers. (G) Producers have a larger invasion probability than hyperproducers. Hyperproducers have a larger invasion probability than nonproducers. Nonproducers have a larger invasion probability than producers. (H) Producers have a larger invasion probability than nonproducers. Nonproducers have a larger invasion probability than hyperproducers. Hyperproducers have a larger invasion probability than producers. The nontransitive dynamics of G and H are not seen in the public good model.

SI Generality of Results

We begin by specifying in a very general way the dynamics of an arbitrary individual based model (IBM) with m distinct types of constituent, fully described by a set of u reaction rates. The model can be expressed in chemical reaction notation as

\sum_{i = 1}^{m} a_{μ i} X_{i} \overset{r_{μ}}{\to} \sum_{i = 1}^{m} b_{μ i} X_{i}, \forall μ = 1, \dots u,

[S36]

where $a_{μ i}$ and $b_{μ i}$ respectively specify the reactants and products of the $μ th$ reaction, and $p_{y}$ are the reaction rate constants (for example, Eq. S1). The stoichiometric matrix is defined by $ν_{i μ} = b_{μ i} - a_{μ i},$ whose elements give the change in number of the $i th$ species due to the $μ th$ reaction. Together with the rate constants $r_{μ},$ the stoichiometric matrix allows us to express the transition rates

T_{μ} (n + ν_{μ} | n) = r_{μ} \prod_{i = 1}^{m} a_{μ i} \frac{n_{i}}{R^{2}},

[S37]

where $R^{2}$ once again is a controlled measure of how often constituents interact (Eq. S3). In the well-mixed model, it therefore directly controls the typical area of the system. Together with the master equation [S2], the full stochastic dynamics are specified.

With a general notation now in hand, we now begin to define the specific type of system we will analyze. We consider a system consisting of two phenotypes, $X_{1}$ and $X_{2},$ who interact with a set of discrete ecosystem variables $X_{i},$ for $i = 3, \dots, N .$ The state of the system at any time is given by the number of each phenotype and ecosystem constituent $n = (n_{1}, n_{2}, n_{3}, \dots, n_{N}) .$ The situation we envisage is as follows: Whereas the interplay between the phenotypes and the ecosystem is relevant for the dynamics, we are primarily interested in the evolutionary dynamics and outcome of competition between the two phenotypes. We make the following assumptions on their dynamics;

i)
Each phenotype birth and death event is proportional to the number of that phenotype:
$If ν_{1 μ} \neq 0, then a_{μ 1} > 0, and if ν_{2 μ} \neq 0, then a_{μ 2} > 0 .$ [S38]
ii)
The phenotypes are very similar in their utilization of the ecosystem. For each $μ th$ reaction that changes the frequency of $X_{1},$ there therefore exists a similar reaction $μ'$ that changes the frequency of $X_{2}$ such that
$ν_{1 μ} r_{μ} = ν_{2 μ'} (r_{μ'} + O (ε)) .$ [S39]
iii)
There is no reaction that simultaneously changes the frequencies of the phenotypes (i.e., no cannibalization or simultaneous killing):
$ν_{1 μ} ν_{2 μ} = 0 \forall μ .$ [S40]

The phenotypes may, however, differ significantly in their effect on the ecosystem, so that one phenotype may deplete or increase ecosystem constituents in an entirely distinct way from the other (for instance, the production of a public good by phenotype X in Eq. S1).

As R is increased so too does the number of each phenotype and ecosystem constituent. If R is sufficiently large, once again a system-size expansion of the master equation can be conducted. Making the change of variables $x_{1} = n_{1} / R^{2},$ $x_{2} = n_{2} / R^{2},$ and $e_{i} = n_{i - 2} / R^{2},$ we obtain the set of Itō SDEs

\begin{array}{l} \frac{d x_{1}}{d t} = x_{1} [F^{(0)} (x, e) - ε F^{(1)} (x, e)] + \frac{1}{R} η_{1} (t), \\ \frac{d x_{2}}{d t} = x_{2} F^{(0)} (x, e) + \frac{1}{R} η_{2} (t), \\ \frac{d e_{i}}{d t} = h_{i} (x, e) + \frac{1}{R} β_{i} (t), \forall i = 1, \dots N . \end{array}

[S41]

The deterministic contribution to the SDEs can be determined from the transitions via

x_{1} [F^{(0)} (x, e) - ε F^{(1)} (x, e)] = \sum_{μ = 1}^{u} ν_{1 μ} T_{μ} [R^{2} {(x, e)}^{T} + ν_{μ} | {(x, e)}^{T}],

[S42]

x_{2} F^{(0)} (x, e) = \sum_{μ = 1}^{u} ν_{2 μ} T_{μ} [R^{2} {(x, e)}^{T} + ν_{μ} | {(x, e)}^{T}],

[S43]

h_{i} (x, e) = \sum_{μ = 1}^{u} ν_{(i + 2) μ} T_{μ} [R^{2} {(x, e)}^{T} + ν_{μ} | {(x, e)}^{T}] .

[S44]

Note that the relationship between Eqs. S42 and S43 is controlled by assumption ii. The correlations in the noise meanwhile are given by

〈 η_{1} (t) η_{1} (t') 〉 = δ (t - t') lim_{ϵ \to 0} \sum_{μ = 1}^{u} ν_{1 μ}^{2} T_{μ} [R (x, e) + ν_{μ} | (x, e)],

[S45]

〈 η_{2} (t) η_{2} (t') 〉 = δ (t - t') lim_{ϵ \to 0} \sum_{μ = 1}^{u} ν_{2 μ}^{2} T_{μ} [R (x, e) + ν_{μ} | (x, e)],

[S46]

〈 η_{1} (t) η_{2} (t') 〉 = 0,

[S47]

〈 η_{i} (t) β_{j} (t') 〉 = δ (t - t') lim_{ϵ \to 0} \sum_{μ = 1}^{u} ν_{i μ} ν_{(j + 2) μ} T_{μ} [R (x, e) + ν_{μ} | (x, e)],

[S48]

〈 β_{i} (t) β_{j} (t') 〉 = δ (t - t') lim_{ϵ \to 0} \sum_{μ = 1}^{u} ν_{(i + 2) μ} ν_{(j + 2) μ} T_{μ} [R (x, e) + ν_{μ} | (x, e)],

[S49]

at leading order in ε. The lack of noise correlation between the phenotypes, Eq. S47, is a consequence of assumption iii. Assumption ii allows us to rewrite Eqs. S45 and S46 as

\begin{array}{l} 〈 η_{1} (t) η_{1} (t') 〉 = δ (t - t') x_{1} H (x, e), \\ 〈 η_{2} (t) η_{2} (t') 〉 = δ (t - t') x_{2} H (x, e) . \end{array}

[S50]

An example of a system where this condition is not enforced is explored in SI Illustrating Generality with Reference to a Complementary System: The Stochastic Lotka–Volterra System.

To begin our analysis of the SDEs, a quasi-neutral limit is considered in which $ϵ = 0 .$ Then the deterministic ODEs for the system (the SDEs in the limit $R \to \infty$ ) lead to a manifold of fixed points associated with the focus phenotypes. We now make two additional assumptions:

v)
There exists a single stable, well behaved, manifold.
vi)
This manifold is one dimensional and so can be parameterized by a single variable.

We then choose to parameterize the manifold in terms of $x_{1},$ which for clarity we label z on the CM. The CM is then defined by the set of equations

x_{1} = z, x_{2} = Z_{2} (z), e_{i} = Z_{e i} (z) \forall i = 2, \dots N .

[S51]

The system dynamics are now entirely analogous to those of the public good model in the quasi-neutral limit. Deterministically, the system comes to rest at a point on the CM (which depends on the system’s initial conditions) at which it stays indefinitely and, when stochasticity is included the system, moves along the CM until one of the phenotypes fixates. A timescale separation is present as long as the composition of the population changes on a slower timescale than that of the collapse to the CM. In practice, the timescale of the collapse to the CM can be inferred from the eigenvalues of Eq. S41 linearized about the CM. The magnitude of the smallest nonzero eigenvalue is indicative of the slowest component of collapse to the CM (34). This should be much larger than the timescale at which the system moves along the CM, which is of order $R^{- 1}$ (35).

To implement the timescale separation, a nonlinear projection is applied to the system that maps fluctuations back to the CM. This can be seen to be equivalent to transforming into the deterministically invariant variable whose existence is guaranteed by the existence of the CM (31), setting the dynamics in all other variables equal to zero, and evaluating the variables themselves on the CM. What form does this mapping take, in the quasi-neutral limit, for Eq. S41? Because the dynamical equations for the phenotypes take on the form of degenerate replicator equations in the limit $ε \to 0,$ the ratio $x_{1} / x_{2}$ is deterministically invariant, regardless of the other parameters. Therefore, the nonlinear mapping may be obtained by solving the following equation for z:

\frac{z}{Z_{2} (z)} = \frac{x_{1}}{x_{2}}, \to z = Y (x_{1}, x_{2}) .

[S52]

The resulting effective description for the quasi-neutral system on the CM can be denoted

\dot{z} = \frac{1}{R} S (z) + \frac{1}{\sqrt{R}} ζ (t) .

[S53]

Note that whereas the deterministic system evaluated on the CM had no drift dynamics, the reduced system may. Mathematically, this is a consequence of the fact that the equations are defined strictly in the Itō sense (from the underlying IBM) and therefore the normal rules of calculus do not apply. Instead, any nonlinear transformation induces a drift, in general given by

S (z) = {\frac{1}{2} [\sum_{i j}^{2} (\frac{\partial^{2} z}{\partial x_{i} \partial x_{j}} B_{i j}) + \sum_{i j}^{N} (\frac{\partial^{2} z}{\partial e_{i} \partial e_{j}} B_{e i j})] |}_{x_{1} = z, x_{2} = Z_{2} (z), e_{i} = Z_{e i} (z)} .

[S54]

However, because the mapping z is independent of the ecosystem variables $e$ (Eq. S52), Eq. S54 can be simplified to

S (z) = {\frac{1}{2} \sum_{i j}^{2} (\frac{\partial^{2} z}{\partial x_{i} \partial x_{j}} B_{i j}) |}_{x_{1} = z, x_{2} = Z_{2} (z), e_{i} = Z_{e i} (z)} .

[S55]

The form of the correlations in $ζ (t)$ is now given by

ℬ (z) = {\sum_{i j}^{2} ({[\frac{\partial z}{\partial x_{i}}]}_{i} {[\frac{\partial z}{\partial x_{j}}]}_{j} B_{i j} (x)) |}_{x = z, x_{2} = Z_{2} (z), e_{i} = Z_{e i} (z)},

[S56]

where once again we have taken advantage of the property $(d z / d e_{i}) = 0$ for all i.

In this very general scenario, what inferences can we make about $S (z) ?$ To answer this question, it is convenient to return to our original SDEs, Eq. S41, and implement the timescale separation in a different fashion. We begin by transforming into variables measuring the total size of the $x_{1}$ and $x_{2}$ population and the fraction of type $x_{1} :$

N_{T} = x_{1} + x_{2}, f_{1} = \frac{x_{1}}{x_{1} + x_{2}}, \to x_{1} = f_{1} N_{T}, x_{2} = N_{T} (1 - f_{1}) .

[S57]

Applying this transformation, taking care to account for the impact of Itō calculus, we arrive at the following SDEs for the system:

\begin{matrix} \frac{d f_{1}}{d t} = \frac{1}{2 R^{2}} \sum_{i, j = 1}^{2} \frac{\partial^{2} f_{1}}{\partial x_{i} \partial x_{j}} B_{i j} + \frac{1}{R} {\tilde{η}}_{1} (t), \\ \frac{d N_{T}}{d t} = N_{T} F^{(0)} (x, e) + \frac{1}{R} {\tilde{η}}_{2} (t), \\ \frac{d e_{i}}{d t} = h_{i} (x, e) + \frac{1}{R} {\tilde{β}}_{i} (t), \forall i = 1, \dots N . \end{matrix}

[S58]

By conducting the transformation, we immediately notice a few things. Most trivially, the forms of the noise correlations are now altered in all variables. Second, because the transformation into the variable $N_{T}$ was linear, its governing SDE contains no noise-induced elements. Finally, the nonlinear transformation into $f_{1}$ has resulted in a noise-induced drift term. This drift term, however, is dependent only on the noise correlation structure between $x_{1}$ and $x_{2} .$ Evaluating the dynamics for $N_{T}$ and $e$ on the CM and substituting in the remaining expressions from Eqs. S47 and S50, we obtain the following one-dimensional SDE for $f_{1},$

\frac{d f_{1}}{d t} = \frac{1}{R} {\tilde{η}}_{1} (t),

[S59]

where ${\tilde{η}}_{1} (t)$ is evaluated on the CM. There are no deterministic dynamics in our reduced-dimension description of $f_{1} .$ This result is a consequence of assumptions ii and iii. The equation for the fixation probability of phenotype $X_{1}$ given an initial fraction $f_{10}$ on the CM, $Q (f_{10}),$ is then, regardless of the noise form,

Q (f_{10}) = f_{10} .

[S60]

Crucially however, $f_{1}$ is evaluated on the CM, which may vary depending on the constitution of the population:

f_{10} = \frac{x_{10}}{x_{10} + Z_{2} (x_{10})} .

[S61]

If $[d Z_{2} (x_{10}) / d x_{10}] < 1,$ then the total phenotype population decreases with increasing $x_{20},$ and phenotype $X_{1}$ has a larger invasion probability than $X_{2} .$ From this we can infer that $S (z)$ will be positive on average along the length of the CM:

\int_{z = 0}^{N_{1} / R^{2}} S (z) d z > 0 .

[S62]

Therefore, the phenotype with the higher carrying capacity will be stochastically selected for in this quasi-neutral case, regardless of their interaction with the environment. We note once again that this result is in general dependent on assumption ii. If assumption ii does not hold, then there will be correlations between the fluctuations $η_{1} (t)$ and $η_{2} (t)$ and, rather than the equation for the time evolution of $f_{1}$ featuring no mean drift (as in Eq. S59), there will be a noise-induced drift term favoring one or the other of the phenotypes. The exact form of this term will be highly dependent on the exact form of the interactions between the phenotypes, a full treatment of which lies outside the scope of this paper.

Now suppose that $ε > 0,$ so that the system is nonneutral. Now no CM exists. There is no line of deterministic fixed points and therefore no invariant variable to project our variables onto and reduce the problem. However, under the assumption that ε is small there is still a separation of timescales. If ε is sufficiently small, the slow manifold (and the projection to it) can be approximated by the results from the quasi-neutral case (Eqs. S51 and S52), plus an ε correction. A perturbative analysis can thus be conducted, and, under the assumption the $ε \approx O (R^{- 2}),$ at leading order we have

\dot{z} = - ε D (z) + \frac{1}{R^{2}} S (z) + \frac{1}{R} \bar{η} (t) .

[S63]

The form of $S (z)$ is unchanged from Eq. S54, whereas the new deterministic contribution to the drift takes the form

D (z) = - {\sum_{i = 1}^{N} (\frac{d z}{d x_{i}} \frac{d x_{i}}{d t}) |}_{x_{1} = z, x_{2} = Z_{2} (z), e = Z_{e} (z)} .

[S64]

Once again, however, the projection is simply a function of $x_{1}$ and $x_{2},$ and so

D (z) = - {(x_{1} F^{(0)} (x) \frac{d z}{d x_{1}} + x_{2} F^{(0)} (x) \frac{d z}{d x_{2}} - ε x_{1} F^{(1)} \frac{d z}{d x_{1}}) |}_{x_{1} = z, x_{2} = Z_{2} (z), e = Z_{e} (z)} .

[S65]

Finally, we also know that in the limit $ε \to 0$ this deterministic contribution to the dynamics on the CM, $D (z),$ should disappear. Therefore, the first two terms in the above equation must cancel, leaving us with

D (z) = {ε z (F^{(1)} (x) \frac{d z}{d x_{1}}) |}_{x_{1} = z, x_{i} = Z_{i} (z)} .

[S66]

We now have a much simpler system to deal with. Say that $F^{(1)} (x)$ is strictly positive. Then this will be a term that consistently decreases the value of $x_{1} .$ Based on physical arguments, we would expect that, regardless of the form of ζ, $D (z)$ must be positive. We still require the exact form of z (Eq. S52) to make analytic progress and specific predictions. Generally, however, we have shown that $S (z)$ will be positive as long as species $X_{1}$ has a larger carrying capacity (subject to the above conditions). A consideration of Eq. S63 shows that even when the system is nonneutral, for sufficiently weak selection/small R, there will be a trade-off between stochastic “strength in numbers” and deterministic costs for high-density behavior.

SI Illustrating Generality with Reference to a Complementary System: The Stochastic Lotka–Volterra System

In SI Obtaining a One-Dimensional Effective Public Good Model it was noted that deterministically the public good model reduces to a competitive Lotka–Volterra model under the elimination of the fast public good dynamics. However, it is important to note that although they may be deterministically equivalent at long times, due to alterations in the demographic noise structure the two systems have distinct behaviors. Despite this, the qualitative picture remains the same; for the quasi-neutral system, the fixation probability of each type is simply proportional to its initial fraction in the population, whereas when selection is introduced, there is playoff between stochastic and deterministic effects. To illustrate this finding, we investigate the stochastic Lotka–Volterra competition model (SLVC), derived from first principles.

In this section we analyze a stochastic Lotka–Volterra competition model, using the methods developed in SI Generality of Results. We assume a population composed of two phenotypes, $X_{1}$ and $X_{2},$ whose numbers in the system are measured by $N_{u} .$ The phenotypes are born, die, and compete with each other. In particular, we define the system to be governed by the probability transition rates

\begin{array}{l} T_{1} (n_{1} + 1, n_{2} | n_{1}, n_{2}) = b_{1} n_{1}, \\ T_{2} (n_{1} - 1, n_{2} | n_{1}, n_{2}) = d_{1} n_{1} + \frac{c_{1}}{R^{2}} n_{1}^{2} + \frac{c_{2}}{R^{2}} n_{1} n_{2}, \\ T_{3} (n_{1}, n_{2} + 1 | n_{1}, n_{2}) = b_{2} n_{2}, \\ T_{4} (n_{1}, n_{2} - 1 | n_{1}, n_{2}) = d_{2} n_{2} + \frac{c_{1}}{R^{2}} n_{1} n_{2} + \frac{c_{2}}{R^{2}} n_{2}^{2} . \end{array}

Together with Eq. S2, this system fully specifies the stochastic dynamics. Taking the limit of large R, we can once again obtain a mesoscopic description of the system,

\begin{array}{l} \frac{d x_{1}}{d t} = x_{1} ((b_{1} - d_{1}) - c_{1} x_{1} - c_{2} x_{2}) + \frac{1}{R} η_{1} (t), \\ \frac{d x_{2}}{d t} = x_{2} ((b_{2} - d_{2}) - c_{1} x_{1} - c_{2} x_{2}) + \frac{1}{R} η_{2} (t), \end{array}

[S67]

where $η_{i} (t)$ have correlation structure Eq. S8 with the $B_{i j} (x)$ term given by

\begin{matrix} B_{11} (x) = x_{1} ((b_{1} + d_{1}) + c_{1} x_{1} + c_{2} x_{2}), \\ B_{22} (x) = x_{2} ((b_{2} + d_{2}) + c_{1} x_{1} + c_{2} x_{2}), \\ B_{12} (x) \equiv B_{21} (x) = 0 . \end{matrix}

[S68]

Note that the noise structure is not the same as that in Eq. S50; two phenotypes with an equal effective reproduction rate $b_{1} - d_{1} = b_{2} - d_{2}$ have the same deterministic fitness, but distinct multiplicative noise. Phenotypes that are reproducing and dying more quickly are subject to greater noise as they have a larger rate of population turnover. We will, however, proceed to consider this more general scenario to illustrate what can happen when this assumption is not enforced. Finally, we impose a separation of timescales by setting

b_{1} - d_{1} = \tilde{b} (1 - ε), b_{2} - d_{2} = \tilde{b} .

[S69]

A CM thus exists if $ϵ = 0$ and an SM when ε is small. The parameter $\tilde{b}$ is an effective birth rate encompassing birth and death, whereas ε is a fitness cost paid by phenotype $X_{1}$ in terms of either a decreased birth rate or an increased death rate, relative to phenotype $X_{2} .$

In the case $ε = 0,$ the system is quasi-neutral, and so a CM exists. The equation for the CM $x_{2} = Z_{2} (x_{1})$ (Eq. S51) and its intersection with the boundaries $x_{2} = 0$ and $x_{1} = 0,$ $K_{1}^{(0)}$ and $K_{2}^{(0)},$ respectively, are

Z_{2} (x_{1}) = \frac{1}{c_{2}} (\tilde{b} - c_{1} x_{1}), K_{1}^{(0)} = \frac{\tilde{b}}{c_{1}}, K_{2}^{(0)} = \frac{\tilde{b}}{c_{2}} .

[S70]

The parameters $K_{1}^{(0)}$ and $K_{2}^{(0)}$ give the frequency of $X_{1}$ and $X_{2}$ phenotypes in isolation. We assume that $c_{2} > c_{1}$ and thus that phenotype $X_{1}$ exists at higher densities than phenotype $X_{2} .$ Finally, the mapping from any point $(x_{1}, x_{2})$ to a coordinate $z = x_{1}$ on the CM is determined from Eq. S52:

z = \frac{\tilde{b} x_{1}}{c_{1} x_{1} + c_{2} x_{2}} .

[S71]

We can now obtain expressions for $D (z),$ $S (z),$ and $ℬ (z)$ directly from Eqs. S66, S54, and S56:

D (z) = - z (\tilde{b} - c_{1} z),

[S72]

S (z) = \frac{2}{{\tilde{b}}^{2}} z (\tilde{b} - c_{1} z) (c_{2} (\tilde{b} + d_{2}) - c_{1} (\tilde{b} + d_{1})),

[S73]

ℬ (z) = \frac{2}{{\tilde{b}}^{2}} z (\tilde{b} - c_{1} z) [z (c_{2} (\tilde{b} + d_{2}) - c_{1} (\tilde{b} + d_{1})) + \tilde{b} (d_{1} + β)] .

[S74]

The equation can now be solved to calculate the fixation probability of phenotype $X_{1}$ along the CM. In terms of the initial fraction of $X_{1},$ $f_{1},$ we find

Q (f_{1}) = \frac{1 - χ (f_{1})}{1 - χ (1)}, χ (f_{1}) = {[(\frac{K_{1}^{(0)}}{d_{1} + \tilde{b}}) (\frac{d_{1} (1 - f_{1}) + d_{2} f_{1} + \tilde{b}}{K_{1}^{(0)} (1 - f_{1}) + f_{1} K_{2}^{(0)}})]}^{- θ},

[S75]

where θ is a parameter given by

θ = (1 + \frac{K_{1}^{(0)} K_{2}^{(0)} R^{2} \tilde{b} ϵ}{K_{2}^{(0)} (d_{1} + \tilde{b}) - K_{1}^{(0)} (d_{2} + \tilde{b})}) .

[S76]

Let us consider the special case $ϵ = 0 .$ The fixation probability then becomes

{Q (f_{1}) |}_{ϵ = 0} = \frac{f_{1} (d_{2} + \tilde{b})}{d_{1} (1 - f_{1}) + d_{2} f_{1} + \tilde{b}} .

[S77]

The species with the lower death rate (and birth rate, because $\tilde{b}$ is fixed) has a greater probability of fixation than the species with the higher birth rate/death rate. This insight, made in refs. 9 and 10, is a result of the higher levels of noise experienced by the phenotype with the high birth and death rates. The higher levels of demographic noise experienced by the short-lived phenotype make it easier for the longer-lived phenotype (lower birth/death rates) to invade and fixate. For the purposes of this paper, we ignore such effects to focus on systems in which the carrying capacity of the phenotypes alone is responsible for the differences in noise experienced by the phenotypes on the CM/SM.

To this end, we now focus on the case $b_{1} = b_{2} \equiv b,$ $d_{1} = d_{2} \equiv d .$ In this case, ${Q (f_{1}) |}_{ϵ = 0} = f_{1},$ and $Q (f_{1})$ in general becomes

Q (f_{1}) = \frac{1 - χ (f_{1})}{1 - χ (1)}, χ (f_{1}) = {(\frac{K_{1}^{(0)}}{K_{1} (1 - f_{1}) + f_{1} K_{2}^{(0)}})}^{- θ},

[S78]

where θ is now given by

θ = (1 + \frac{K_{1}^{(0)} K_{2}^{(0)} R^{2} (b - d) ϵ}{(K_{2}^{(0)} - K_{1}^{(0)}) b}) .

[S79]

The invasion probabilities $ϕ_{1}$ and $n_{q}$ meanwhile are given by

ϕ_{1} = Q (N_{2}^{- 1}), ϕ_{2} = 1 - Q (1 - N_{1}^{- 1}) .

[S80]

We can use the above expressions to obtain an approximate value for the maximum cost to birth rate that can be paid such that selection reversal is observed. Assuming $N_{1}^{- 1}$ and $N_{2}^{- 1}$ are of order ε and Taylor expanding in ε, we find the cost to birth must obey

\frac{1}{N_{2}} (\frac{b}{b - d}) (1 - \frac{N_{2}}{N_{1}}) > ε

[S81]

for the direction of selection to be reversed. This is analogous to Eq. 8 in the main text.

SI Order of Magnitude Parameter Estimates

In this section we seek to identify an illustrative set of parameters to use in the model in order to emphasize that the insights developed are biologically reasonable. We wish to obtain order of magnitude estimates for the set of parameters b, $p_{x},$ $p_{u},$ r, δ, κ, R, m, and D. We choose the yeast S. cerevisiae as our model organism. Whereas our model is more physically realistic than many mathematical public good models, we note that there are still choices that must be made in relating this physical system to our general framework.

Our model is constructed such that the uptake of one constituent of the public good, Q, by a phenotype, results in a reproduction event. In the context of S. cerevisiae, the type Q is thus shorthand for the amount of invertase that must be present in the system to break down sucrose into sufficient glucose for a reproduction event of the yeast. Let us define σ to be the scaling between $n_{q}$ and the total number of invertase molecules, such that the number of invertase molecules is $σ n_{q} .$ To understand the relationship between our model parameters and physically measurable parameters, we begin by considering a simplified ODE system of our model:

\frac{d x}{d t} = x (b + r q - κ x),

[S82]

\frac{d q}{d t} = p x - δ q .

[S83]

Whereas the total number of discrete invertase constituents is $n_{q} \approx R^{2} q,$ the total number of invertase molecules is $R^{2} σ q .$ Let θ be a measure of the number of invertase molecules, such that $θ = σ q .$ The ODEs in this more natural variable read

\frac{d x}{d t} = x (b + \frac{r}{σ} θ - κ x),

[S84]

\frac{d θ}{d t} = σ p x - δ θ .

[S85]

The decay rate δ is independent of the number of molecules that make up an invertase constituent Q, so we can take experimental measurements of the invertase molecular decay rate as values for δ. Meanwhile the molecular invertase production rate and reproduction rate due to invertase take on scaled forms of the parameters in our original ODEs:

r_{mol} = \frac{r}{σ},

[S86]

p_{mol} = σ p .

[S87]

Whereas measurements of $p_{mol}$ are obtainable in the literature (Table S1), our estimation of $r_{mol}$ is complicated by the fact that it is an effective parameter. It must capture the increase in the reproductive rate due to invertase, which in reality is coupled to the reaction rate of invertase and sucrose into glucose, as well as to the uptake rate of glucose by yeast and the energy conversion to reproduction. We do, however, know the typical range of yeast reproduction rates. Let us define $λ_{exp}$ as the yeast reproduction rate measured experimentally. In turn, let $λ_{eff}$ be the effective per capita reproduction rate of yeast in the model:

λ_{eff} = b + r q .

[S88]

The yeast reproduction rate clearly depends on the amount of public good in the system, typically varying from

λ_{eff} = b (all nonproducers) to λ_{eff} = b \frac{δ κ}{δ κ - p_{x} r} (all producers) .

[S89]

In reality, the reproduction rate of yeast in a system without any invertase is effectively zero; we have assumed some baseline birth rate for convenience in the model, which could be physically interpreted as being associated with an exogenous glucose concentration in the system. We assume that this is typically low, such that b is small, whereas the yeast approaches its maximum reproductive rate when it consists entirely of producers.

The parameter κ controls death due to crowding. For simplicity this is the only form of death in the model. This choice leads, perhaps unnaturally, to the nonproducers (who exist at typically lower densities) having a much smaller death rate than producers. For the parameters chosen, however, we obtain per capita death rates on the order of 1 h for producers and 10 h for nonproducers. The parameter R meanwhile measures the assumed spatial interaction scale. It determines the typical number of individuals on each patch. We can use this value to infer the size of a patch. Denoting the diameter of a yeast cell as $L_{c}$ and assuming that the hyperproducing cells in the stationary state can be packed on a grid, the size of each patch, $L_{p},$ can be approximated by

L_{p} = L_{c} \sqrt{N_{u}},

[S90]

= L_{c} R \sqrt{K_{u}} .

[S91]

The parameters m and D are effective migration and diffusion rates in our model. To map these physical parameters these must be in turn scaled by the patch length. The public good diffusion rate must also be scaled by σ, which maps the discrete amount of invertase constituents Q to the number of invertase molecules. Denoting $m_{exp}$ and $D_{exp}$ the physical migration and public good diffusion rate rates, it can be shown that (28)

D_{exp} = σ L_{p}^{2} D, m_{exp} = L_{p}^{2} m .

[S92]

The parameter choices that follow from these calculations are summarized in Table S2.

When considering the parameter choices summarized in Table S2, it is important to make a final point. Whereas both the approximations we have used, the system size expansion in SI Obtaining the SDE System from the Microscopic Individual-Based Model and the fast-variable elimination in SI Obtaining a One-Dimensional Effective Public Good Model, rely formally on $R^{2}$ being large and ε being small ( $O (R^{- 2}) > ε$ ), in practical terms the procedures are relatively robust to this restriction. Indeed, throughout the body of the main text, $R = 4,$ whereas ε is varied on the interval $[0,0.1] .$ In fact, we find that the approximate analytic expression we obtain for the invasion probabilities of the phenotypes, Eq. S28, describes the results obtained from simulation well, up to $ε = R^{- 2},$ as illustrated in Fig. S2.

In terms of the system size expansion, this robustness can in part be explained by the fact that the typical population sizes ( $N_{x},$ $N_{y},$ and $N_{q}$ ) are proportional to R. For populations of fixed size N, SDEs for the system can be obtained by means of a Taylor expansion of the master equation (for example, Eq. S2) as a series in $1 / N .$ A crucial feature of the system we are concerned with here, however, is that population sizes may vary, and so this technique is unavailable. Instead we conduct an expansion in the interaction scale R, which is proportional to the mean population size. Although R may not be a large number itself, increasing R leads to an associated increase in population size (Table S2). In turn, this leads to terms of higher order in the Taylor expansion of the master equation becoming subdominant (47), justifying the truncation that leads to Eq. S4. In contrast, the resilience of the fast-variable elimination approximation to such large values of ε is surprising.

Supplementary Material

Supplementary File

Download video file^{(9.4MB, mp4)}

Supplementary File

Download video file^{(863.5KB, mp4)}

Acknowledgments

T.R. acknowledges funding from the Royal Society of London. C.E.T. acknowledges support from the Alfred P. Sloan Foundation (FR-2015-65382).

Footnotes

The authors declare no conflict of interest.

This article is a PNAS Direct Submission.

This article contains supporting information online at www.pnas.org/lookup/suppl/doi:10.1073/pnas.1603693113/-/DCSupplemental.

References

1.Hofbauer J, Sigmund K. Evolutionary Games and Population Dynamics. Cambridge Univ Press; Cambridge, UK: 1998. [Google Scholar]
2.Fisher RA. The Genetical Theory of Natural Selection. Clarendon Press; Oxford: 1930. [Google Scholar]
3.Wright S. Evolution in Mendelian populations. Genetics. 1931;16(2):97–159. doi: 10.1093/genetics/16.2.97. [DOI] [PMC free article] [PubMed] [Google Scholar]
4.Nowak MA. Evolutionary Dynamics: Exploring the Equations of Life. Harvard Univ Press; Cambridge, MA: 2006. [Google Scholar]
5.McKane AJ, Newman TJ. Predator-prey cycles from resonant amplification of demographic stochasticity. Phys Rev Lett. 2005;94(21):218102. doi: 10.1103/PhysRevLett.94.218102. [DOI] [PubMed] [Google Scholar]
6.Butler T, Goldenfeld N. Robust ecological pattern formation induced by demographic noise. Phys Rev E Stat Nonlin Soft Matter Phys. 2009;80(3 Pt 1):030902. doi: 10.1103/PhysRevE.80.030902. [DOI] [PubMed] [Google Scholar]
7.Hallatschek O, Hersen P, Ramanathan S, Nelson DR. Genetic drift at expanding frontiers promotes gene segregation. Proc Natl Acad Sci USA. 2007;104(50):19926–19930. doi: 10.1073/pnas.0710150104. [DOI] [PMC free article] [PubMed] [Google Scholar]
8.Rossberg AG, Rogers T, McKane AJ. Are there species smaller than 1 mm? Proc Biol Sci. 2013;280(1767):20131248. doi: 10.1098/rspb.2013.1248. [DOI] [PMC free article] [PubMed] [Google Scholar]
9.Parsons TL, Quince C. Fixation in haploid populations exhibiting density dependence I: The non-neutral case. Theor Popul Biol. 2007;72(1):121–135. doi: 10.1016/j.tpb.2006.11.004. [DOI] [PubMed] [Google Scholar]
10.Lin YT, Kim H, Doering CR. Features of fast living: On the weak selection for longevity in degenerate birth-death processes. J Stat Phys. 2012;148:646–662. [Google Scholar]
11.Gunawardena J. Time-scale separation--Michaelis and Menten’s old idea, still bearing fruit. FEBS J. 2014;281(2):473–488. doi: 10.1111/febs.12532. [DOI] [PMC free article] [PubMed] [Google Scholar]
12.Parsons TL, Quince C. Fixation in haploid populations exhibiting density dependence II: The quasi-neutral case. Theor Popul Biol. 2007;72(4):468–479. doi: 10.1016/j.tpb.2007.04.002. [DOI] [PubMed] [Google Scholar]
13.Chotibut T, Nelson DR. Evolutionary dynamics with fluctuating population sizes and strong mutualism. Phys Rev E Stat Nonlin Soft Matter Phys. 2015;92(2):022718. doi: 10.1103/PhysRevE.92.022718. [DOI] [PubMed] [Google Scholar]
14.Constable GWA, McKane AJ. Models of genetic drift as limiting forms of the Lotka-Volterra competition model. Phys Rev Lett. 2015;114(3):038101. doi: 10.1103/PhysRevLett.114.038101. [DOI] [PubMed] [Google Scholar]
15.Parsons TL, Quince C, Plotkin JB. Some consequences of demographic stochasticity in population genetics. Genetics. 2010;185(4):1345–1354. doi: 10.1534/genetics.110.115030. [DOI] [PMC free article] [PubMed] [Google Scholar]
16.Kogan O, Khasin M, Meerson B, Schneider D, Myers CR. Two-strain competition in quasineutral stochastic disease dynamics. Phys Rev E Stat Nonlin Soft Matter Phys. 2014;90(4):042149. doi: 10.1103/PhysRevE.90.042149. [DOI] [PubMed] [Google Scholar]
17.Houchmandzadeh B, Vallade M. Selection for altruism through random drift in variable size populations. BMC Evol Biol. 2012;12:61. doi: 10.1186/1471-2148-12-61. [DOI] [PMC free article] [PubMed] [Google Scholar]
18.Houchmandzadeh B. Fluctuation driven fixation of cooperative behavior. Biosystems. 2015;127:60–66. doi: 10.1016/j.biosystems.2014.11.006. [DOI] [PubMed] [Google Scholar]
19.Reznick D, Bryant MJ, Bashey F. r- and k-selection revisited: The role of population regulation in life-history evolution. Ecology. 2002;83:1509–1520. [Google Scholar]
20.Nowak MA, Sasaki A, Taylor C, Fudenberg D. Emergence of cooperation and evolutionary stability in finite populations. Nature. 2004;428(6983):646–650. doi: 10.1038/nature02414. [DOI] [PubMed] [Google Scholar]
21.Rice SH. Evolutionary Theory. Sinauer; Sunderland, MA: 2004. [Google Scholar]
22.Crow JF, Kimura M. An Introduction to Population Genetics Theory. Blackburn Press; Caldwell, NJ: 1970. [Google Scholar]
23.Gore J, Youk H, van Oudenaarden A. Snowdrift game dynamics and facultative cheating in yeast. Nature. 2009;459(7244):253–256. doi: 10.1038/nature07921. [DOI] [PMC free article] [PubMed] [Google Scholar]
24.Hauert C, Holmes M, Doebeli M. Evolutionary games and population dynamics: Maintenance of cooperation in public goods games. Proc Biol Sci. 2006;273(1600):2565–2570. doi: 10.1098/rspb.2006.3600. [DOI] [PMC free article] [PubMed] [Google Scholar]
25.Huang W, Hauert C, Traulsen A. Stochastic game dynamics under demographic fluctuations. Proc Natl Acad Sci USA. 2015;112(29):9064–9069. doi: 10.1073/pnas.1418745112. [DOI] [PMC free article] [PubMed] [Google Scholar]
26.Koschwanez JH, Foster KR, Murray AW. Sucrose utilization in budding yeast as a model for the origin of undifferentiated multicellularity. PLoS Biol. 2011;9(8):e1001122. doi: 10.1371/journal.pbio.1001122. [DOI] [PMC free article] [PubMed] [Google Scholar]
27.Kümmerli R, Brown SP. Molecular and regulatory properties of a public good shape the evolution of cooperation. Proc Natl Acad Sci USA. 2010;107(44):18921–18926. doi: 10.1073/pnas.1011154107. [DOI] [PMC free article] [PubMed] [Google Scholar]
28.Gardiner CW. Handbook of Stochastic Methods. Springer; Berlin: 2009. [Google Scholar]
29.Black AJ, McKane AJ. Stochastic formulation of ecological models and their applications. Trends Ecol Evol. 2012;27(6):337–345. doi: 10.1016/j.tree.2012.01.014. [DOI] [PubMed] [Google Scholar]
30.Parsons TL, Rogers T. 2015. Dimension reduction via timescale separation in stochastic dynamical systems. arXiv:1510.07031.
31.Arnold L. Random Dynamical Systems, Springer Monographs in Mathematics. Springer; Berlin: 2003. [Google Scholar]
32.MaClean RC, Fuentes-Hernandez A, Greig D, Hurst LD, Gudelj I. A mixture of “cheats” and “co-operators” can enable maximal group benefit. PLoS Biol. 2010;8(9):e1000486. doi: 10.1371/journal.pbio.1000486. [DOI] [PMC free article] [PubMed] [Google Scholar]
33.Craig Maclean R, Brandon C. Stable public goods cooperation and dynamic social interactions in yeast. J Evol Biol. 2008;21(6):1836–1843. doi: 10.1111/j.1420-9101.2008.01579.x. [DOI] [PubMed] [Google Scholar]
34.Constable GWA, McKane AJ, Rogers T. Stochastic dynamics on slow manifolds. J Phys A Math Theor. 2013;46(29):295002. [Google Scholar]
35.Constable GWA, McKane AJ. Fast-mode elimination in stochastic metapopulation models. Phys Rev E Stat Nonlin Soft Matter Phys. 2014;89(3):032141. doi: 10.1103/PhysRevE.89.032141. [DOI] [PubMed] [Google Scholar]
36.Behar H, Brenner N, Ariel G, Louzoun Y. 2016. Fluctuations-induced coexistence in public good dynamics. Phys Bio, in press. [DOI] [PubMed]
37.Behar H, Brenner N, Louzoun Y. Coexistence of productive and non-productive populations by fluctuation-driven spatio-temporal patterns. Theor Popul Biol. 2014;96:20–29. doi: 10.1016/j.tpb.2014.06.002. [DOI] [PubMed] [Google Scholar]
38.Nowak MA, Tarnita CE, Antal T. Evolutionary dynamics in structured populations. Philos Trans R Soc Lond B Biol Sci. 2010;365(1537):19–30. doi: 10.1098/rstb.2009.0215. [DOI] [PMC free article] [PubMed] [Google Scholar]
39.Wakano JY, Hauert C. Pattern formation and chaos in spatial ecological public goods games. J Theor Biol. 2011;268(1):30–38. doi: 10.1016/j.jtbi.2010.09.036. [DOI] [PubMed] [Google Scholar]
40.Nowak MA, May RM. Evolutionary games and spatial chaos. Nature. 1992;359:826–829. [Google Scholar]
41.Allen B, Gore J, Nowak MA. Spatial dilemmas of diffusible public goods. eLife. 2013;2:e01169. doi: 10.7554/eLife.01169. [DOI] [PMC free article] [PubMed] [Google Scholar]
42.Waxman D, Gavrilets S. 20 questions on adaptive dynamics. J Evol Biol. 2005;18(5):1139–1154. doi: 10.1111/j.1420-9101.2005.00948.x. [DOI] [PubMed] [Google Scholar]
43.Pianka ER. On r- and k-selection. Am Nat. 1970;104:592–597. [Google Scholar]
44.Lin YT, Kim H, Doering CR. Demographic stochasticity and evolution of dispersion I. Spatially homogeneous environments. J Math Biol. 2015;70(3):647–678. doi: 10.1007/s00285-014-0776-9. [DOI] [PubMed] [Google Scholar]
45.Lin YT, Kim H, Doering CR. Demographic stochasticity and evolution of dispersion II: Spatially inhomogeneous environments. J Math Biol. 2015;70(3):679–707. doi: 10.1007/s00285-014-0756-0. [DOI] [PubMed] [Google Scholar]
46.Gillespie DT. A general method for numerically simulating the stochastic time evolution of coupled chemical reactions. J Comput Phys. 1976;22:403–434. [Google Scholar]
47.van Kampen NG. Stochastic Processes in Physics and Chemistry. Elsevier; Amsterdam: 2007. [Google Scholar]
48.Risken H. The Fokker-Planck Equation. Springer; Berlin: 1989. [Google Scholar]
49.Gomez L, Ramirez HL, Cabrera G, Simpson BK, Villalonga R. Immobilization of invertase chitosan conjugate on hyaluronic-acid-modified chitin. J Food Biochem. 2008;32:264–277. [Google Scholar]
50.Sanchez A, Gore J. Feedback between population and evolutionary dynamics determines the fate of social microbial populations. PLoS Biol. 2013;11(4):e1001547. doi: 10.1371/journal.pbio.1001547. [DOI] [PMC free article] [PubMed] [Google Scholar]
51.Snoep JL, Mrwebi M, Schuurmans JM, Rohwer JM, Teixeira de Mattos MJ. Control of specific growth rate in Saccharomyces cerevisiae. Microbiology. 2009;155(Pt 5):1699–1707. doi: 10.1099/mic.0.023119-0. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary File

Download video file^{(9.4MB, mp4)}

Supplementary File

Download video file^{(863.5KB, mp4)}

[r1] 1.Hofbauer J, Sigmund K. Evolutionary Games and Population Dynamics. Cambridge Univ Press; Cambridge, UK: 1998. [Google Scholar]

[r2] 2.Fisher RA. The Genetical Theory of Natural Selection. Clarendon Press; Oxford: 1930. [Google Scholar]

[r3] 3.Wright S. Evolution in Mendelian populations. Genetics. 1931;16(2):97–159. doi: 10.1093/genetics/16.2.97. [DOI] [PMC free article] [PubMed] [Google Scholar]

[r4] 4.Nowak MA. Evolutionary Dynamics: Exploring the Equations of Life. Harvard Univ Press; Cambridge, MA: 2006. [Google Scholar]

[r5] 5.McKane AJ, Newman TJ. Predator-prey cycles from resonant amplification of demographic stochasticity. Phys Rev Lett. 2005;94(21):218102. doi: 10.1103/PhysRevLett.94.218102. [DOI] [PubMed] [Google Scholar]

[r6] 6.Butler T, Goldenfeld N. Robust ecological pattern formation induced by demographic noise. Phys Rev E Stat Nonlin Soft Matter Phys. 2009;80(3 Pt 1):030902. doi: 10.1103/PhysRevE.80.030902. [DOI] [PubMed] [Google Scholar]

[r7] 7.Hallatschek O, Hersen P, Ramanathan S, Nelson DR. Genetic drift at expanding frontiers promotes gene segregation. Proc Natl Acad Sci USA. 2007;104(50):19926–19930. doi: 10.1073/pnas.0710150104. [DOI] [PMC free article] [PubMed] [Google Scholar]

[r8] 8.Rossberg AG, Rogers T, McKane AJ. Are there species smaller than 1 mm? Proc Biol Sci. 2013;280(1767):20131248. doi: 10.1098/rspb.2013.1248. [DOI] [PMC free article] [PubMed] [Google Scholar]

[r9] 9.Parsons TL, Quince C. Fixation in haploid populations exhibiting density dependence I: The non-neutral case. Theor Popul Biol. 2007;72(1):121–135. doi: 10.1016/j.tpb.2006.11.004. [DOI] [PubMed] [Google Scholar]

[r10] 10.Lin YT, Kim H, Doering CR. Features of fast living: On the weak selection for longevity in degenerate birth-death processes. J Stat Phys. 2012;148:646–662. [Google Scholar]

[r11] 11.Gunawardena J. Time-scale separation--Michaelis and Menten’s old idea, still bearing fruit. FEBS J. 2014;281(2):473–488. doi: 10.1111/febs.12532. [DOI] [PMC free article] [PubMed] [Google Scholar]

[r12] 12.Parsons TL, Quince C. Fixation in haploid populations exhibiting density dependence II: The quasi-neutral case. Theor Popul Biol. 2007;72(4):468–479. doi: 10.1016/j.tpb.2007.04.002. [DOI] [PubMed] [Google Scholar]

[r13] 13.Chotibut T, Nelson DR. Evolutionary dynamics with fluctuating population sizes and strong mutualism. Phys Rev E Stat Nonlin Soft Matter Phys. 2015;92(2):022718. doi: 10.1103/PhysRevE.92.022718. [DOI] [PubMed] [Google Scholar]

[r14] 14.Constable GWA, McKane AJ. Models of genetic drift as limiting forms of the Lotka-Volterra competition model. Phys Rev Lett. 2015;114(3):038101. doi: 10.1103/PhysRevLett.114.038101. [DOI] [PubMed] [Google Scholar]

[r15] 15.Parsons TL, Quince C, Plotkin JB. Some consequences of demographic stochasticity in population genetics. Genetics. 2010;185(4):1345–1354. doi: 10.1534/genetics.110.115030. [DOI] [PMC free article] [PubMed] [Google Scholar]

[r16] 16.Kogan O, Khasin M, Meerson B, Schneider D, Myers CR. Two-strain competition in quasineutral stochastic disease dynamics. Phys Rev E Stat Nonlin Soft Matter Phys. 2014;90(4):042149. doi: 10.1103/PhysRevE.90.042149. [DOI] [PubMed] [Google Scholar]

[r17] 17.Houchmandzadeh B, Vallade M. Selection for altruism through random drift in variable size populations. BMC Evol Biol. 2012;12:61. doi: 10.1186/1471-2148-12-61. [DOI] [PMC free article] [PubMed] [Google Scholar]

[r18] 18.Houchmandzadeh B. Fluctuation driven fixation of cooperative behavior. Biosystems. 2015;127:60–66. doi: 10.1016/j.biosystems.2014.11.006. [DOI] [PubMed] [Google Scholar]

[r19] 19.Reznick D, Bryant MJ, Bashey F. r- and k-selection revisited: The role of population regulation in life-history evolution. Ecology. 2002;83:1509–1520. [Google Scholar]

[r20] 20.Nowak MA, Sasaki A, Taylor C, Fudenberg D. Emergence of cooperation and evolutionary stability in finite populations. Nature. 2004;428(6983):646–650. doi: 10.1038/nature02414. [DOI] [PubMed] [Google Scholar]

[r21] 21.Rice SH. Evolutionary Theory. Sinauer; Sunderland, MA: 2004. [Google Scholar]

[r22] 22.Crow JF, Kimura M. An Introduction to Population Genetics Theory. Blackburn Press; Caldwell, NJ: 1970. [Google Scholar]

[r23] 23.Gore J, Youk H, van Oudenaarden A. Snowdrift game dynamics and facultative cheating in yeast. Nature. 2009;459(7244):253–256. doi: 10.1038/nature07921. [DOI] [PMC free article] [PubMed] [Google Scholar]

[r24] 24.Hauert C, Holmes M, Doebeli M. Evolutionary games and population dynamics: Maintenance of cooperation in public goods games. Proc Biol Sci. 2006;273(1600):2565–2570. doi: 10.1098/rspb.2006.3600. [DOI] [PMC free article] [PubMed] [Google Scholar]

[r25] 25.Huang W, Hauert C, Traulsen A. Stochastic game dynamics under demographic fluctuations. Proc Natl Acad Sci USA. 2015;112(29):9064–9069. doi: 10.1073/pnas.1418745112. [DOI] [PMC free article] [PubMed] [Google Scholar]

[r26] 26.Koschwanez JH, Foster KR, Murray AW. Sucrose utilization in budding yeast as a model for the origin of undifferentiated multicellularity. PLoS Biol. 2011;9(8):e1001122. doi: 10.1371/journal.pbio.1001122. [DOI] [PMC free article] [PubMed] [Google Scholar]

[r27] 27.Kümmerli R, Brown SP. Molecular and regulatory properties of a public good shape the evolution of cooperation. Proc Natl Acad Sci USA. 2010;107(44):18921–18926. doi: 10.1073/pnas.1011154107. [DOI] [PMC free article] [PubMed] [Google Scholar]

[r28] 28.Gardiner CW. Handbook of Stochastic Methods. Springer; Berlin: 2009. [Google Scholar]

[r29] 29.Black AJ, McKane AJ. Stochastic formulation of ecological models and their applications. Trends Ecol Evol. 2012;27(6):337–345. doi: 10.1016/j.tree.2012.01.014. [DOI] [PubMed] [Google Scholar]

[r30] 30.Parsons TL, Rogers T. 2015. Dimension reduction via timescale separation in stochastic dynamical systems. arXiv:1510.07031.

[r31] 31.Arnold L. Random Dynamical Systems, Springer Monographs in Mathematics. Springer; Berlin: 2003. [Google Scholar]

[r32] 32.MaClean RC, Fuentes-Hernandez A, Greig D, Hurst LD, Gudelj I. A mixture of “cheats” and “co-operators” can enable maximal group benefit. PLoS Biol. 2010;8(9):e1000486. doi: 10.1371/journal.pbio.1000486. [DOI] [PMC free article] [PubMed] [Google Scholar]

[r33] 33.Craig Maclean R, Brandon C. Stable public goods cooperation and dynamic social interactions in yeast. J Evol Biol. 2008;21(6):1836–1843. doi: 10.1111/j.1420-9101.2008.01579.x. [DOI] [PubMed] [Google Scholar]

[r34] 34.Constable GWA, McKane AJ, Rogers T. Stochastic dynamics on slow manifolds. J Phys A Math Theor. 2013;46(29):295002. [Google Scholar]

[r35] 35.Constable GWA, McKane AJ. Fast-mode elimination in stochastic metapopulation models. Phys Rev E Stat Nonlin Soft Matter Phys. 2014;89(3):032141. doi: 10.1103/PhysRevE.89.032141. [DOI] [PubMed] [Google Scholar]

[r36] 36.Behar H, Brenner N, Ariel G, Louzoun Y. 2016. Fluctuations-induced coexistence in public good dynamics. Phys Bio, in press. [DOI] [PubMed]

[r37] 37.Behar H, Brenner N, Louzoun Y. Coexistence of productive and non-productive populations by fluctuation-driven spatio-temporal patterns. Theor Popul Biol. 2014;96:20–29. doi: 10.1016/j.tpb.2014.06.002. [DOI] [PubMed] [Google Scholar]

[r38] 38.Nowak MA, Tarnita CE, Antal T. Evolutionary dynamics in structured populations. Philos Trans R Soc Lond B Biol Sci. 2010;365(1537):19–30. doi: 10.1098/rstb.2009.0215. [DOI] [PMC free article] [PubMed] [Google Scholar]

[r39] 39.Wakano JY, Hauert C. Pattern formation and chaos in spatial ecological public goods games. J Theor Biol. 2011;268(1):30–38. doi: 10.1016/j.jtbi.2010.09.036. [DOI] [PubMed] [Google Scholar]

[r40] 40.Nowak MA, May RM. Evolutionary games and spatial chaos. Nature. 1992;359:826–829. [Google Scholar]

[r41] 41.Allen B, Gore J, Nowak MA. Spatial dilemmas of diffusible public goods. eLife. 2013;2:e01169. doi: 10.7554/eLife.01169. [DOI] [PMC free article] [PubMed] [Google Scholar]

[r42] 42.Waxman D, Gavrilets S. 20 questions on adaptive dynamics. J Evol Biol. 2005;18(5):1139–1154. doi: 10.1111/j.1420-9101.2005.00948.x. [DOI] [PubMed] [Google Scholar]

[r43] 43.Pianka ER. On r- and k-selection. Am Nat. 1970;104:592–597. [Google Scholar]

[r44] 44.Lin YT, Kim H, Doering CR. Demographic stochasticity and evolution of dispersion I. Spatially homogeneous environments. J Math Biol. 2015;70(3):647–678. doi: 10.1007/s00285-014-0776-9. [DOI] [PubMed] [Google Scholar]

[r45] 45.Lin YT, Kim H, Doering CR. Demographic stochasticity and evolution of dispersion II: Spatially inhomogeneous environments. J Math Biol. 2015;70(3):679–707. doi: 10.1007/s00285-014-0756-0. [DOI] [PubMed] [Google Scholar]

[r46] 46.Gillespie DT. A general method for numerically simulating the stochastic time evolution of coupled chemical reactions. J Comput Phys. 1976;22:403–434. [Google Scholar]

[r47] 47.van Kampen NG. Stochastic Processes in Physics and Chemistry. Elsevier; Amsterdam: 2007. [Google Scholar]

[r48] 48.Risken H. The Fokker-Planck Equation. Springer; Berlin: 1989. [Google Scholar]

[r49] 49.Gomez L, Ramirez HL, Cabrera G, Simpson BK, Villalonga R. Immobilization of invertase chitosan conjugate on hyaluronic-acid-modified chitin. J Food Biochem. 2008;32:264–277. [Google Scholar]

[r50] 50.Sanchez A, Gore J. Feedback between population and evolutionary dynamics determines the fate of social microbial populations. PLoS Biol. 2013;11(4):e1001547. doi: 10.1371/journal.pbio.1001547. [DOI] [PMC free article] [PubMed] [Google Scholar]

[r51] 51.Snoep JL, Mrwebi M, Schuurmans JM, Rohwer JM, Teixeira de Mattos MJ. Control of specific growth rate in Saccharomyces cerevisiae. Microbiology. 2009;155(Pt 5):1699–1707. doi: 10.1099/mic.0.023119-0. [DOI] [PubMed] [Google Scholar]

PERMALINK

Demographic noise can reverse the direction of deterministic selection

George W A Constable

Tim Rogers

Alan J McKane

Corina E Tarnita

Series information

Significance

Abstract

Public Good Model

Mesoscopic Selection Reversal.

Table S2.

Fig. 1.

Fig. S1.

Fig. 2.

Fig. S2.

Spatial Amplification

Fig. 3.

Fig. 4.

Insights from S. cerevisiae.

Table S1.

Fig. 5.

Generality of Results

Discussion

SI Obtaining the SDE System from the Microscopic Individual-Based Model

SI Obtaining a One-Dimensional Effective Public Good Model

SI Probability of Fixation for the Reduced Public Good Model

SI Pairwise Invasibility for Nonproducers, Producers, and Hyperproducers

Fig. S3.

SI Generality of Results

SI Illustrating Generality with Reference to a Complementary System: The Stochastic Lotka–Volterra System

SI Order of Magnitude Parameter Estimates

Supplementary Material

Acknowledgments

Footnotes

References

Associated Data

Supplementary Materials

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases