Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2009 Aug 21.
Published in final edited form as: J Theor Biol. 2008 Dec 6;257(2):340–344. doi: 10.1016/j.jtbi.2008.11.023

Strategy abundance in 2 × 2 games for arbitrary mutation rates

Tibor Antal a, Martin A Nowak a, Arne Traulsen b
PMCID: PMC2699184  NIHMSID: NIHMS119039  PMID: 19111558

Abstract

We study evolutionary game dynamics in a well-mixed populations of finite size, N. A well-mixed population means that any two individuals are equally likely to interact. In particular we consider the average abundances of two strategies, A and B, under mutation and selection. The game dynamical interaction between the two strategies is given by the 2 × 2 payoff matrix (abcd). It has previously been shown that A is more abundant than B, if a(N − 2) + bN > cN + d(N − 2). This result has been derived for particular stochastic processes that operate either in the limit of asymptotically small mutation rates or in the limit of weak selection. Here we show that this result holds in fact for a wide class of stochastic birth-death processes for arbitrary mutation rate and for any intensity of selection.

Keywords: Evolutionary game theory, Finite populations, Stochastic effects

1 Introduction

Evolutionary dynamics describe how successful strategies spread in a population through genetic reproduction or cultural imitation. In mutation-selection processes where the fitness of each individual is constant, the competition between two types A and B is straightforward. If both fitness values are identical, then we have neutral evolution (Kimura, 1968) and the average abundances of A and B are the same. (Throughout the paper we assume symmetric mutation rates: the mutation probability from A to B is the same as from B to A.) If A is fitter than B, then A is more abundant than B in the mutation selection equilibrium. Thus, for constant selection the comparison of the abundances of the two types is trivial.

For frequency dependent selection, where fitness depends on the types and abundances of others, the situation is more complex. One way to model these systems are evolutionary games, where the reproductive success of individuals depends on their payoff derived from interaction with others (Maynard Smith and Price, 1973; Maynard Smith, 1982; Weibull, 1995; Samuelson, 1997; Hofbauer and Sigmund, 1998; Fudenberg and Tirole, 1998; Hofbauer and Sigmund, 2003; Nowak and Sigmund, 2004; Szabó and Fáth, 2007; Sandholm, 2007). In the simplest case, interactions are described by a 2 × 2 payoff matrix,

(ABAabBcd). (1)

Here, an A individual obtains a from other A individuals, but b from B individuals. Similarly, B obtains c from A, and d from other B individuals.

For a > c and b > d, strategy A dominates strategy B: Regardless of the composition of the population, strategy A has the higher payoff. Thus, evolutionary dynamics will always lead to a population with more A individuals than B individuals. Equivalently, for a < c and b < d, strategy B dominates. In this case, the average abundance of B will be higher than the average abundance of A at the mutation-selection equilibrium.

For a > c and b < d, both strategies are best replies to themselves. This is a ‘coordination game’. In a population of mostly A individual, rare B mutants have a lower payoff. In a population of mostly B individuals, rare A mutants have a lower payoff. Therefore, in a mutation selection process it is not a priori clear whether strategy A or strategy B will be more abundant at the equilibrium distribution. Two concepts are important: (i) pareto efficiency and (ii) risk-dominance. Strategy A is Pareto efficient if a > d. In this case, an all-A population has a higher average payoff than an all-B population. Strategy A is risk-dominant if a + b > c + d (Harsanyi and Selten, 1988). In this case, strategy A has a larger basin of attraction than strategy B. In a game between two players, if I do not know what the other person will do, it is less risky for me to choose the risk dominant strategy, but it would be more rewarding for both of us to choose the Pareto efficient strategy. A coordination game is especially interesting, if the risk-dominant strategy is not Pareto efficient.

(Kandori et al., 1993) have shown that A is chosen over B if a(N − 2) + bN > cN + d(N − 2). This means a large population selects the risk dominant equilibrium in the long run. They analyze a process which is stochastic in the generation of mutants, but deterministic in following the gradient of selection. Their calculation assumes asymptotically small mutation rates, but holds for a wide range of evolutionary processes, in contrast to previous approaches that make specific assumptions on the source of noise (Foster and Young, 1990; Fudenberg and Harris, 1992). However, since the dynamics follows the gradient of selection, the processes can only leave a Nash equilibrium if a sufficient number of mutations occurs simultaneously. In large populations, it is very unlikely that this happens, and the time until the population moves from one equilibrium to another becomes exponentially large in N.

If the dynamics is stochastic, a single mutation can be sufficient to leave a Nash equilibrium. Nowak et al. (2004) have studied the competition of two strategies A and B in a frequency dependent Moran process (and similar processes) which model fully stochastic evolutionary dynamics. This means that the selection steps are stochastic. They have shown that the fixation probability of A is greater than that of B if and only if a(N − 2)+bN > cN +d(N − 2). This calculation assumes weak selection in a process with selection only. The result is also valid for stronger intensities of selection, if we use the pairwise comparison process (Szabó and Tőke, 1998; Traulsen et al., 2007) or a slightly modified version of the Moran process for evolutionary updating (Traulsen et al., 2008).

Here, we extend these results to both arbitrary mutation rates and arbitrary intensities of selection. We show that A is more abundant than B in the mutation-selection distribution for a wide range of stochastic processes for any mutation rate and any intensity of selection if and only if a(N − 2) + bN > cN + d(N − 2). Higher mutation rates seem to be very relevant for social evolution, because humans (and higher animals) tend to try new strategies fairly often. We therefore expect that evolutionary dynamics in the cultural learning and imitation context operate under fairly high mutation (=‘exploration’) rates. We also show that the condition holds for any payoff matrix, not only for coordination games.

2 Pairwise comparison process

The payoffs of A and B in a population of j individuals of type A and Nj individuals of type B are

πA(j)=j1N1a+NjN1bπB(j)=jN1c+Nj1N1d. (2)

Each individual interacts with N − 1 other individuals. The payoff difference Δπ(j) is a linear function of j

Δπ(j)=πA(j)πB(j)=abc+dN1j+a+bNdN+dN1. (3)

First, we need to specify how strategies spread in the population, depending on their payoffs. We adopt the following process: a random (focal) individual i is selected. It compares its payoff πi to the payoff πj of a randomly chosen role model, individual j, and takes over the strategy of that individual with probability [1 + eβ(πiπj)]−1 (Blume, 1993; Szabó and Tőke, 1998; Traulsen et al., 2007; Sandholm, 2007). This process occurs with probability 1−μ. With probability μ, a mutation occurs and the focal individual produces an offspring with random strategy, A or B. The quantity β determines the intensity of selection (0 ≤ β < ∞). Strong selection (large β) means that the probability to adopt a better strategy approaches one, and the probability to adopt an inferior strategy vanishes. For weak selection (small β), these two probabilities are close to 1/2. The transition probabilities to increase or decrease the number of A players by one, are given by

Tj+=jNNjN1μ1+eβΔπ(j)+NjNμ2Tj=jNNjN1μ1+e+βΔπ(j)+jNμ2. (4)

With these transition probabilities, we can determine whether A is more abundant than B or vice versa. Before discussing the problem for general mutation rates we first consider the special case of low mutation rates, because of its importance in previous papers (Kandori et al., 1993; Nowak et al., 2004; Taylor et al., 2004; Imhof and Fudenberg, 2006). We will also present a simple approximation for very high mutation rates, which illustrates the dynamics in this extreme limit.

2.1 Low mutation rates

First, we consider low mutation rates, μN−2. In this limit, a mutation reaches extinction or fixation before a second mutation arises. Thus, the important quantity is the probability that a single mutant takes over the population in a process without mutations (Nowak et al., 2004; Taylor et al., 2004; Imhof and Fudenberg, 2006; Antal and Scheuring, 2006). The ratio of the fixation probability ρA of a single A player and that of a single B player can be written as (Karlin and Taylor, 1975; Nowak, 2006)

ρBρA=j=1N1TjTj+. (5)

Without mutations, μ = 0, the ratio of the transition probabilities (4), simplifies to Tj/Tj+=eβΔπ(j) (Traulsen et al., 2006, 2007). Thus, ρB/ρA reduces to

ρBρA=exp[βj=1N1Δπ(j)]=exp[β(a+bcd2Na+d)]. (6)

Therefore, ρA = ρB is equivalent to

(a+bcd)N=2a2d. (7)

From the payoff matrix (2) it is clear, that strategy A is favored whenever a or b increases, or c or d decreases. Therefore, A is more abundant than B if

a(N2)+bN>cN+d(N2). (8)

This condition has been derived previously in the low mutation limit for different evolutionary processes (Kandori et al., 1993; Nowak, 2006; Traulsen et al., 2008). The strategy with the higher fixation probability is the more abundant one, because more of its mutants reach fixation. Thus for low mutation rates, A has a higher abundance than B whenever condition (8) is fulfilled.

For weak selection, a wide range of processes fulfills the 1/3-rule, which states that the fixation probability of a single A mutant in a coordination game is larger than 1/N if a(N − 2) + b(2N − 1) > c(N + 1) + d(2N − 4) (Nowak et al., 2004; Imhof and Nowak, 2006; Lessard and Ladret, 2007; Bomze and Pawlowitsch, 2008). The fixation probability for large N can thus be written as

ρA=1N+ω(13dbabc+d), (9)

where ω is a small positive number that may depend on the payoffs and on N. Writing down the analogous equation for ρB and subtracting yields

ρAρB=ω(acdcb+adbabc+d)a+bcd. (10)

Thus, ρA > ρB is equivalent to a + b > c + d, which is identical to our condition (8) for large N. Lessard and Ladret (2007) have shown that any process within the domain of Kingman’s coalescent (Kingman, 1982) fulfills the 1/3-rule (for large N and weak selection and mutation). If an evolutionary process fulfills the 1/3-rule, ρA and ρB can be written in the form of (9). Hence we find again that ρA > ρB is equivalent to a + b > c + d. Thus, we conclude that our result is valid for any process within the domain of Kingman’s coalescent for large N and weak selection and mutation.

2.2 High mutation rates

For high mutation rates μ →1, mutations dominate the process, and drive the system towards equal abundance of A and B, that is close to j = N/2. Hence the relevant transition probabilities are TN/2+ and TN/2, and it is plausible to assume that TN/2+>TN/2 implies that A is more abundant than B in the stationary state. From (4) we obtain

TN/2+TN/2=1μ4tanh(β2Δπ(N/2)), (11)

where the payoff difference is given by

Δπ(N/2)=(a+bcd)N2(ad)2(N1). (12)

Since Δπ (N/2) > 0 implies TN/2+TN/2>0, it also implies a more abundant A, for arbitrary intensity of selection β. Thus, we have again the same condition a(N − 2) + bN > cN + d(N − 2) for the dominance of A.

2.3 Arbitrary mutation rates

So far we have shown that both for low and high mutation rates the same condition determines whether A or B is more abundant, regardless of the intensity of selection. Now, we turn to general mutation rates. Whenever equation (7) holds, the payoff difference (3) has the following symmetry property

Δπ(j)=Δπ(Nj). (13)

This antisymmetry in the payoff differences then implies the following symmetry in our transition probabilities (4),

Tj+=TNj. (14)

For symmetric transition probabilities, however, the stationary probabilities pj of having j number of A players, are also symmetric, pj = pNj. This can be shown by first writing the stationary probability distribution explicitly (Kampen, 1997; Claussen and Traulsen, 2005)

pj=p0i=0j1Ti+Ti+1 (15)

where p0 follows from the normalization j=0Npj=1. For symmetric transition probabilities (14) and for j < Nj we can write

pNjpj=i=jNj1Ti+Ti+1=i=jNj1Ti+TNi1+=1. (16)

Hence the distribution is symmetric pj = pNj.

For such a symmetric distribution, the number of A and B players are of course the same, since

j=j=0Njpj=j=0NjpNj=j=0N(Nj)pj=Nj. (17)

Hence we have demonstrated that when (7) is fulfilled, then the average abundance of A players and B players are the same.

On the other hand, when condition (8) is fulfilled, then the average abundance of A players is higher than the average abundance of B players. This is quite obvious by looking at the payoff matrix (1): increasing a or b favors A, while increasing c or d favors B. More formally, we could say that from (2), (3) and (4) it is clear that Tj+ and Tj are monotone increasing functions of a and b, and decreasing functions of c and d. This implies condition (8). Interestingly, other features of the game do not matter, e.g. whether a single A player has a larger disadvantage in a B population than a single B player in an A population.

Condition (8) can also be written in the form

a(12N)+b>c+d(12N), (18)

which highlights the fact that there is a 1/N correction compared to simple risk dominance (a + b > c + d), which is the N → ∞ limit. This implies that a strategy can be more abundant than the other strategy for large N, but less abundant for small N. For example for the payoff matrix (103111), strategy A is more abundant in large populations (N ≥19), but B is more abundant in small populations (N ≤17). They are equally abundant for N = 18. In general, there is a threshold population size N*= 2(ad)/(ad+bc). If N*> 2, then A is more abundant than B either for N > N* or for N < N*.

2.4 Including self-interactions

So far, we have adopted the usual convention that individuals cannot interact with themselves. If instead we allow individuals to derive a payoff from self-interaction, we obtain

fA(j)=jNa+NjNbfB(j)=jNc+NjNd. (19)

Hence, the payoff difference is Δπ (j) = (abc + d) j/N + bd. Now for all mutation rates and all intensities of selection the condition for the average abundance of A to exceed the average abundance of B is simple risk dominance:

a+b>c+d. (20)

We conclude that the finite N correction to risk dominance in (18) results from the exclusion of self-interactions.

3 Frequency dependent Moran process

Although condition (8) is valid for a large class of evolutionary processes (see Sec. 4), it does not always hold if we depart from weak selection. As an example we discuss the frequency dependent Moran process (Nowak et al., 2004; Taylor et al., 2004; Antal and Scheuring, 2006). As we shall see, for this model our condition (8) is only valid in the limit of weak selection.

Let the fitness be a convex combination of a background fitness (which we set to 1) and the payoff, fA(j) = 1 − ω + ωπA(j) and fB(j) = 1 − ω + ωπB(j). Here, ω is the intensity of selection (0 ≤ω ≤1 for payoff matrices with positive entries). An individual is selected for reproduction at random, but proportional to fitness. The selected individual produces an offspring, which replaces a randomly chosen individual. Mutation can occur during reproduction. This leads to the transition probabilities

Tj+=jfA(j)F(j)NjN(1μ)+(Nj)fB(j)F(j)NjNμTj=(Nj)fB(j)F(j)jN(1μ)+jfA(j)F(j)jNμ. (21)

Here F(j) = jfA(j) + (Nj)fB(j) is the total fitness of the whole population. These transition probabilities depend on the fitness values only through the ratio fA(j)=fB(j). Yet, there is no simple condition for the equilibrium abundance of A players. In the weak selection limit, ω ≪ 1, however, only the payoff differences enter into the transition probabilities (21),

fA(j)fB(j)=1+wΔπ(j)+O(w2). (22)

Thus, for weak selection, condition (8) again ensures the higher abundance of A.

We can also consider a variant of the frequency dependent Moran process where fitness is an exponential function of payoff, fA(j) = exp[+ωπA(j)] and fB(j) = exp[+ωπB(j)] (Traulsen et al., 2008). In this case, the transition probabilities again only depend on the payoff differences, because fA(j)/fB(j) = exp(ωΔπ (j)). Hence, the condition for abundance is again (8) for any intensity of selection and any mutation rate.

4 General birth death processes

Here, we show that our finding holds for a wide range of birth-death processes. We need two requirements to be fulfilled. The first is that the payoffs received from other players should be additive. In this case the payoffs πA(j), πB(j), and also the payoff difference Δπ (j) are linear functions of j. Consequently,

Δπ(j)+Δπ(Nj)=λ(a,b,c,d,N) (23)

is independent of j. Hence, when λ(a, b, c, d, N) = 0 holds, we have an antisymmetric payoff function

Δπ(j)=Δπ(Nj). (24)

The second requirement is that the difference between the two types of players should manifest itself in the transition probabilities Tj+ and Tj, only through the the payoff difference Δπ (j). Hence, we can write Tj=Tj[Δπ(j),μ], and the probability of decreasing the number of mutants is

Tj=Tj[Δπ(j),μ]=TNj[Δπ(j),μ]. (25)

That is, the only difference between the two types is the change of sign of the payoff difference Δπ (j). Otherwise, the transition probabilities are the same, but with the number of the opposite type of individuals Nj. Now, simply setting the index to Nj in (25), we have

TNj=Tj+[Δπ(Nj),μ]=Tj+[Δπ(j),μ]=Tj+. (26)

For the second equality we have used (24). These symmetric transition probabilities then imply an equal abundance of the two types of players in any birth-death process, as it has been shown in Section 2.3. The condition for a higher abundance of A compared to B is then λ (a, b, c, d, N) > 0. This general condition takes the form of (8) or (20) in our examples when self-interaction are excluded or included, respectively. Note that similar arguments appear in (Claussen, 2007).

5 Discussion

In this paper, we have shown that a(N − 2) + bN > cN + d(N − 2) is the crucial condition for A to be favored over B in wide range of evolutionary processes for any mutation rate, any intensity of selection and any finite population size. By A being favored over B we mean that A is more abundant than B in the mutation-selection equilibrium of the stochastic process.

The relative abundance in the stationary state is a natural way of comparing two strategies. In the low mutation limit, when the population is typically homogeneous, a strategy being more abundant is equivalent of having a larger fixation probability. Hence, our study of abundance can be viewed as a generalization of that concept to arbitrary mutation rates. This is of special interest for cultural dynamics, where mutation rates are not necessarily small.

In particular, we have generalized the famous result of Kandori et al. (1993) to any mutation rate. We have generalized the result of Nowak et al. (2004) to any intensity of selection. Our results are valid for any mutation rate and any intensity of selection.

It turns out that for our result to hold, a birth-death process has to fulfill two requirements: (i) additive payoffs, and (ii) that that the evolutionary dynamics depend only on the payoff differences (the players are identical otherwise). These requirements hold, for example, for the pairwise comparison process described by Traulsen et al. (2006) and for a frequency dependent Moran process with exponential fitness function (Traulsen et al, 2008). For the standard frequency dependent Moran process with linear fitness function these requirements only hold in the limit of weak selection.

Finally, we note that in coordination games with strong selection both the fixation probability of A and the fixation probability of B are very small. For small mutation rates, the system will stay in one equilibrium for a very long time. Therefore, it can take a very long time for the system to obtain a representative sample of the stationary mutation-selection distribution.

Acknowledgments

We are grateful for financial support from NIH Grant R01GM078986, The John Templeton Foundation, Jeffrey Epstein (T.A. and M.A.N.) and the Emmy-Noether program of the DFG (A.T.).

References

  1. Antal T, Scheuring I. Fixation of strategies for an evolutionary game in finite populations. Bull Math Biol. 2006;68:1923–1944. doi: 10.1007/s11538-006-9061-4. [DOI] [PubMed] [Google Scholar]
  2. Blume LE. The statistical mechanics of strategic interaction. Games and Economic Behavior. 1993;4:387–424. [Google Scholar]
  3. Bomze I, Pawlowitsch C. One-third rules with equality: Second-order evolutionary stability conditions in finite populations. J Theor Biol. 2008;254:616–620. doi: 10.1016/j.jtbi.2008.06.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Claussen JC, Traulsen A. Non-gaussian fluctuations arising from finite populations: Exact results for the evolutionary moran process. Phys Rev E. 2005;71:025101(R). doi: 10.1103/PhysRevE.71.025101. [DOI] [PubMed] [Google Scholar]
  5. Claussen JC. Drift reversal in asymmetric coevolutionary conflicts: influence of microscopic processes and population size. Eur Phys J B. 2007;60:391–399. [Google Scholar]
  6. Foster D, Young P. Stochastic evolutionary game dynamics. Theo Pop Biol. 1990;38:219–232. [Google Scholar]
  7. Fudenberg D, Harris C. Evolutionary dynamics with aggregate shocks. J Econ Theory. 1992;57:420–441. [Google Scholar]
  8. Fudenberg D, Tirole J. Game Theory. Vol. 6. MIT Press; Cambridge: 1998. [Google Scholar]
  9. Harsanyi JC, Selten R. A general theory of equilibrium selection in games. MIT Press; Cambridge: 1988. [Google Scholar]
  10. Hofbauer J, Sigmund K. Evolutionary Games and Population Dynamics. Cambridge University Press; Cambridge: 1998. [Google Scholar]
  11. Hofbauer J, Sigmund K. Evolutionary game dynamics. Bull Am Math Soc. 2003;40:479–519. [Google Scholar]
  12. Imhof LA, Fudenberg D. Imitation process with small mutations. Journal of Economic Theory. 2006;131:251–262. [Google Scholar]
  13. Imhof LA, Nowak MA. Evolutionary game dynamics in a Wright Fisher process. J Math Biol. 2006;52:667–681. doi: 10.1007/s00285-005-0369-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Kampen NGv. Stochastic Processes in Physics and Chemistry. 2. Elsevier; Amsterdam: 1997. [Google Scholar]
  15. Kandori M, Mailath GJ, Rob R. Learning, mutation, and long run equilibria in games. Econometrica. 1993;61:29–56. [Google Scholar]
  16. Karlin S, Taylor HMA. A first course in stochastic processes. 2. Academic; London: 1975. [Google Scholar]
  17. Kimura M. DNA and the neutral theory. Phil Trans Roy Soc London B. 1968;312:343–354. doi: 10.1098/rstb.1986.0012. [DOI] [PubMed] [Google Scholar]
  18. Kingman JFC. The coalescent. Stochastic Processes and Their Applications. 1982;13:235–248. [Google Scholar]
  19. Lessard S, Ladret V. The probability of fixation of a single mutant in an exchangeable selection model. J Math Biol. 2007;54:721–744. doi: 10.1007/s00285-007-0069-7. [DOI] [PubMed] [Google Scholar]
  20. Maynard Smith J. Evolution and the Theory of Games. Cambridge University Press; Cambridge: 1982. [Google Scholar]
  21. Maynard Smith J, Price GR. The logic of animal conflict. Nature. 1973;246:15–18. [Google Scholar]
  22. Nowak MA. Evolutionary Dynamics. Harvard University Press; Cambridge, MA: 2006. [Google Scholar]
  23. Nowak MA, Sasaki A, Taylor C, Fudenberg D. Emergence of cooperation and evolutionary stability in finite populations. Nature. 2004;428:646–650. doi: 10.1038/nature02414. [DOI] [PubMed] [Google Scholar]
  24. Nowak MA, Sigmund K. Evolutionary dynamics of biological games. Science. 2004;303:793–799. doi: 10.1126/science.1093411. [DOI] [PubMed] [Google Scholar]
  25. Samuelson L. Evolutionary Games and Equilibrium Selection. MIT Press; Cambridge, MA: 1997. [Google Scholar]
  26. Sandholm WH. Population games and evolutionary dynamics. MIT Press; Cambridge, MA: 2007. [Google Scholar]
  27. Szabó G, Fáth G. Evolutionary games on graphs. Physics Reports. 2007;446:97–216. [Google Scholar]
  28. Szabó G, Tőke C. Evolutionary Prisoner’s Dilemma game on a square lattice. Phys Rev E. 1998;58:69. [Google Scholar]
  29. Taylor C, Fudenberg D, Sasaki A, Nowak MA. Evolutionary game dynamics in finite populations. Bull Math Biol. 2004;66:1621–1644. doi: 10.1016/j.bulm.2004.03.004. [DOI] [PubMed] [Google Scholar]
  30. Traulsen A, Nowak MA, Pacheco JM. Stochastic dynamics of invasion and fixation. Phys Rev E. 2006;74:11909. doi: 10.1103/PhysRevE.74.011909. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Traulsen A, Pacheco JM, Nowak MA. Pairwise comparison and selection temperature in evolutionary game dynamics. J Theor Biol. 2007;246:522–529. doi: 10.1016/j.jtbi.2007.01.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Traulsen A, Shoresh N, Nowak MA. Analytical results for individual and group selection of any intensity. Bull Math Biol. 2008;63:363. doi: 10.1007/s11538-008-9305-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Weibull J. Evolutionary Game Theory. MIT Press; Cambridge: 1995. [Google Scholar]

RESOURCES