Skip to main content
Springer logoLink to Springer
. 2014 Mar 8;70:465–484. doi: 10.1007/s00285-014-0770-2

Evolutionary dynamics in finite populations with zealots

Yohei Nakajima 1, Naoki Masuda 1,
PMCID: PMC4289535  PMID: 24610380

Abstract

We investigate evolutionary dynamics of two-strategy matrix games with zealots in finite populations. Zealots are assumed to take either strategy regardless of the fitness. When the strategy selected by the zealots is the same, the fixation of the strategy selected by the zealots is a trivial outcome. We study fixation time in this scenario. We show that the fixation time is divided into three main regimes, in one of which the fixation time is short, and in the other two the fixation time is exponentially long in terms of the population size. Different from the case without zealots, there is a threshold selection intensity below which the fixation is fast for an arbitrary payoff matrix. We illustrate our results with examples of various social dilemma games.

Mathematics Subject Classification: 91A22, 60J70

Introduction

A standard assumption underlying evolutionary game dynamics, regardless of whether a player is social agent or gene, is that players tend to imitate successful others. In actual social evolutionary dynamics, however, there may be zealous players that stick to one option according to their idiosyncratic preferences regardless of the payoff that they or their peers earn. Collective social dynamics in the presence of zealots started to be examined for non-game situations such as the voter model representing competition between two equally strong opinions (i.e., neutral invasions) (Mobilia 2003; Galam and Jacobs 2007; Mobilia et al. 2007; Xie et al. 2011; Singh et al. 2012). Zealots seem to be also relevant in evolutionary game dynamics. For example, voluntary immunization behavior of individuals when epidemic spreading possibly occurs in a population can be examined by a public-goods dilemma game (Fu et al. 2011). In this situation, some individuals may behave as zealot such that they try to immunize themselves regardless of the cost of immunization (Liu et al. 2012).

In our previous work, we examined evolutionary dynamics of the prisoner’s dilemma and snowdrift games in infinite populations with zealots (Masuda 2012). Specifically, we assumed zealous cooperators and asked the degree to which the zealous cooperators facilitate cooperation in the entire population. We showed that cooperation prevails if the temptation of unilateral defection is weak or the selection strength is weak. For the prisoner’s dilemma, we analytically obtained the condition of cooperation.

In the present paper, we conduct a finite population analysis of evolutionary dynamics of a general two-person game with zealots. Evolutionary games in finite populations have been recognized as a powerful analytical tool for understanding properties of evolutionary games such as conditions of cooperation in social dilemma games. In addition, the outcome for finite populations is often different from that for infinite populations (Nowak et al. 2004; Taylor et al. 2004; Nowak 2006). We take advantage of this method to understand evolutionary dynamics of games with zealots for general matrix games.

It should be noted that the fixation probability, i.e., the probability that a given strategy eventually dominates the population as a result of stochastic evolutionary dynamics, is a primary quantity to be pursued in evolutionary dynamics in finite populations. Nevertheless, fixation trivially occurs in the presence of zealots if all zealots are assumed to take the same strategy; the zealots’ strategy always fixates. For example, if there is a single zealous cooperator in the population, cooperation always fixates even in the conventional prisoner’s dilemma game. However, in this adverse case, fixation of cooperation is expected to take long time; the relevant question here is the fixation time (Antal and Scheuring 2006; Traulsen et al. 2007; Altrock and Traulsen 2009; Altrock et al. 2010; Assaf and Mobilia 2010; Ewens 2010; Wu et al. 2010; Altrock et al. 2011; Assaf and Mobilia 2012; Kreindler and Young 2013). Here we examine the mean fixation time of the strategy selected by the zealots. This quantity serves as a probe to understand the extent to which zealots influence non-zealous players in the population. The fixation time would be affected by the payoff matrix, population size, number of zealous players, and strength of selection. We derive the asymptotic dependence of the mean fixation time on the population size when the fraction of zealots in the population is fixed. Mathematically, we extend the approach taken in Antal and Scheuring (2006) to the case with zealots.

Model

We assume a well-mixed population of N+M players under evolutionary dynamics defined as follows. In each discrete time unit, each player selects either of the two strategies A or B. Each player plays a symmetric two-person game with all the other N+M-1 players in a unit time. The payoff matrix of the single game for the row player is given by

graphic file with name 285_2014_770_Equ1_HTML.gif 1

The fitness of a player on which the selection pressure operates is defined as the payoff summed over the N+M-1 opponents.

We assume that N players may flip the strategy according to the Moran process (Moran 1958; Ewens 2010). We call these players the ordinary players. The other M players are zealots that never change the strategy irrespectively of their fitness. Because our primary interest is in the possibility of cooperation in social dilemma games induced by zealous cooperators, we assume that all zealots take strategy A; A is identified with cooperation in the case of a social dilemma game. We also assume that a,b,c,d0 for the Moran process to be well-defined.

Because we have assumed a well-mixed population, the state of the evolutionary process is specified by the number of ordinary players selecting A, which we denote by i. In each time step, we select an ordinary player with the equal probability 1/N. The strategy of the selected player is updated. Then, we select a player, called the parent, whose strategy replaces that of the previously selected player. The parent is selected with the probability proportional to the fitness among the N+M players including the zealots and the player whose strategy is to be replaced. The population size N is constant over time. It should be noted that a player is updated once on average in time N.

Because the zealots always select A, the Moran process ends up with the unanimous population of A players (we impose a>0 for this to be true). In other words, fixation of A always occurs such that the issue of fixation probability is irrelevant to our model.

Results

We calculate the mean fixation time and its approximation in the case of a large population size by extending the framework developed in Antal and Scheuring (2006) (also see Van Kampen 2007; Redner 2001; Krapivsky et al. 2010; Ewens 2010).

Mean fixation time: exact solution

Consider the state of the population in which i (0iN) ordinary players select strategy A. A total of i+M and N-i players, including the zealots, select strategies A and B, respectively. The Moran process is equivalent to a random walk on the i space in which i=0 is a reflecting boundary, and i=N is the unique absorbing boundary.

The fitness of an A and B player is given by

fi=(i+M-1)a+(N-i)bN+M-1 2

and

gi=(i+M)c+(N-i-1)dN+M-1, 3

respectively. In a single time step, i increases by one, does not change, or decreases by one. We denote by Ti+ and Ti- the probabilities that i shifts to i+1 and i-1, respectively. These probabilities are given by

Ti+=N-iN(i+M)fi(i+M)fi+(N-i)gi 4

and

Ti-=iN(N-i)gi(i+M)fi+(N-i)gi. 5

We denote by ti the mean fixation time when there are initially i ordinary players with strategy A. As shown in Ewens (2010), pp. 86–91 (see Appendix A for a full derivation), we obtain

ti=j=iN-1qjk=0j1Tk+qk, 6

where

qk=j=1kTj-Tj+. 7

In Eq. (7), we interpret q0=1.

Deterministic approximation of the random walk

In this section we classify the deterministic dynamics driven by the expected bias of the random walk (i.e., Ti+-Ti-) into three cases, as is done in the analysis of populations without zealots (Taylor et al. 2004; Antal and Scheuring 2006). The obtained classification determines the dependence of the mean fixation time on N, as we will show in Sect. 3.3.

We first identify the equilibrium points of the deterministic dynamics, i.e., i satisfying Ti+=Ti-. Equations (4) and (5) indicate that i=N always yields Ti+=Ti-=0, corresponding to the fact that i=N is the unique absorbing state. Other equilibria are derived from

(i+M)(i+M-1)a+(N-i)b-i(i+M)c+(N-i-1)d=0. 8

We set yi/N (0y<1), mM/N, and ignore O(N-1) terms in Eq. (8) to obtain

f(y)(a-b-c+d)y2+2ma+(1-m)b-mc-dy+m2a+mb=0. 9

We define

y~=-12(a-b-c+d)2ma+1-mb-mc-d, 10
D=2ma+(1-m)b-mc-d2-4m(ma+b)(a-b-c+d), 11
y1=y~-D2(a-b-c+d)(a-b-c+d0),-m(ma+b)2ma+(1-m)b-mc-d(a-b-c+d=0), 12

and

y2=y~+D2(a-b-c+d). 13

We will use y2 only when a-b-c+d>0. In the continuous state limit, the deterministic dynamics driven by Ti+-Ti- is classified into the following three cases, as summarized in Table 1. The derivation is shown in Appendix B.

  • Case (i)
    : f(y)>0 holds true for all y (0y1) such that the dynamics starting from any initial condition tends to y=1 (Fig. 1a). In an infinite population, A dominates B. In a finite population, we expect that the fixation time is short. This case occurs when c<(m+1)a and one of the following conditions is satisfied:
    • a-b-c+d0.
    • a-b-c+d>0 and y10 (i.e., 2ma+(-m+1)b-mc-d0).
    • a-b-c+d>0, 0<y1<1 (i.e., 2ma+(-m+1)b-mc-d<0 and -(2m+2)a+(m+1)b+(m+2)c-d<0), and D0.
    • a-b-c+d>0 and y11 (i.e., -(2m+2)a+(m+1)b+(m+2)c-d0).
  • Case(ii)

    : f(y)=0 has a unique solution y1 (0<y1<1) such that the dynamics starting from any initial condition converges to y1 (Fig. 1b). In an infinite population, A and B coexist. In a finite population, we expect that the fixation time is long. This case occurs when c>(m+1)a.

  • Case (iii)
    : f(y)=0 has two solutions 0<y1<y2<1. Dynamics starting from 0y<y2 converges to y1, and that starting from y2<y<1 converges to y=1 (Fig. 1c). In an infinite population, a mixture of A and B and the pure A configuration are bistable. In a finite population, we expect that the fixation time is long if the dynamics starts with 0y<y2 and short if it starts with y2<y<1. This case occurs when
    c<(m+1)a, 14
    a-b-c+d>0, 15
    0<y~<1, 16
    and
    D>0 17
    are satisfied.

Table 1.

Classification of the three cases of the mean fixation time when N is large

a-b-c+d0 a-b-c+d>0
c<(m+1)a Case (i) Case (i) or (iii)
c>(m+1)a Case (ii) Case (ii)

Fig. 1.

Fig. 1

Schematic classification of the deterministic dynamics driven by Ti+-Ti-. ac Populations with zealots. dg Populations without zealots. Filled and open circles represent stable and unstable equilibria, respectively. Filled squares represent the absorbing boundary condition. It should be noted that we identify y=i/N

The condition given by Eq. (14) is related to the so-called cooperation facilitator assumed in a previous model (Mobilia 2012) as follows. Consider a hypothetical infinite population in which almost all players select A, i.e., y1. Then, the payoff that a player with strategy A gains by being matched with the other ordinary players and zealous players is equal to (m+1)a. The payoff that a player with strategy B gains by being matched with the other ordinary players, but not zealous players, is equal to c. Therefore, Eq. (14) represents the condition for the stability of the homogeneous population of strategy A against invasion by B when zealous players somehow contribute to the payoff of ordinary A players and not to that of ordinary B players. Such a zealous player is equivalent to the cooperation facilitator assumed in  Mobilia (2012).

In the corresponding model without zealots, there are four scenarios: A dominates B (Fig. 1d), B dominates A (Fig. 1e), a mixture of A and B is stable (Fig. 1f), and A and B are bistable (Fig. 1g) (Antal and Scheuring 2006). The cases shown in Fig. 1d, f, and g are analogous to cases (i), (ii), and (iii), respectively, for the game with zealots. The case shown in Fig. 1e never occurs in the game with zealots because y tends to increase in the absence of A owing to the fact that unanimity of B among the ordinary players is a reflecting boundary of our model. In fact, this case corresponds to case (ii) for the presence of zealots (Fig. 1b). If we set m0, we obtain case (i) when a-c>0 and b-d>0, case (ii) when a-c<0, and case (iii) when a-c>0 and b-d<0. As is consistent with Antal and Scheuring (2006), the classification depends only on the a-c and b-d values. However, the scenario in which B dominates A (Fig. 1e) does not happen even with the vanishing density of zealots (i.e., m0) because the unanimity of B remains to be a reflecting boundary as long as there is at least one zealot.

Mean fixation time: large N limit

In this section, we analyze the order of the mean fixation time in terms of N when N is large. We assume that the fraction of zealots in the population, i.e., m=M/N, is fixed. Because the mean fixation time is by definition the largest for i=0, i.e., the initial condition in which all ordinary players select B, we focus on t0. To evaluate t0, we rewrite Eq. (6) for i=0 as

t0=k=0N-11Tk+qkj=kN-1qj. 18

Case (i)

We obtain

Ti-Ti+=1-Nf(i/N)(i+M)[(i+M-1)a+(N-i)b], 19

where f(y) (0y<1) is given by Eq. (9). In case (i), f(y)>0 holds true. Therefore,

Ti-Ti+sup0y<11-f(y)(y+m)[(y+m)a+(1-y)b]ε<1 20

is satisfied for 0iN-1. By using Eq. (20), we obtain

1qkj=kN-1qj=j=kN-1=k+1jT-T+j=kN-1ε(j-k-1)1ε11-ε. 21

Because the left-hand side of Eq. (21) is at least unity, we obtain

t0k=0N-11Tk+. 22

The substitution of y=i/N and m=M/N in Eq. (4) yields

1Ti+=11-y+(y+m)c+(1-y)d(y+m)(y+m)a+(1-y)b. 23

In particular, we obtain

1Ti+11-y(y1). 24

Equation (24) implies that

t0NlnN. 25

This result coincides with the previous result for the absence of zealots (Antal and Scheuring 2006).

Case (ii)

In Case (ii), Ti+-Ti->0 for 0i<Ny1 and Ti+-Ti-<0 for Ny1<i<N. Therefore, qi takes the minimum at iNy1. We denote the value of i that satisfies i<Ny1 and qiqN-1 by i. Such an i exists if q0qN-1. If q0<qN-1, we regard that i=0. Using the relationship qi=q~(i/N)N for a function q~(y) (0y<1) (Antal and Scheuring 2006) (also see Appendix C), we obtain

t0=k=0N-11Tk+qkj=kN-1qjk=0N-11Tk+qkmax{qk,qN-1}k=iN-1qN-1qkNq~(1)q~(y)N, 26

where

q~(y)=min0y<1q~(y). 27

To derive the last line in Eq. (26), we used the steepest descent method (Antal and Scheuring 2006) (also see Appendix D).

Equations (23) and (24) imply that 1/Tk+ in Eq. (26) is safely ignored near the singularity at y1 because it would contribute at most NlnN to the fixation time. Therefore, we obtain

t0Nexp(γN), 28

where γ>0 is a constant that depends on a,b,c,d, and m. The dependence of γ on m is shown in Fig. 2 for sample payoff matrices for the prisoner’s dilemma game (solid line) and snowdrift game (dotted line). For both games, γ monotonically decreases with m, implying that the fixation time decreases with m. In particular, γ is equal to zero, which corresponds to t0NlnN, when m is larger than a threshold value.

Fig. 2.

Fig. 2

The exponent γ for the mean fixation time [Eq. (28)] plotted against the density of zealots m for the prisoner’s dilemma game with a=1, b=0, c=1.2, and d=0 (solid line) and the snowdrift game with a=β-0.5, b=β, c=β-1, d=0, with β=1.5 (dotted line). We calculated γ on the basis of Eqs. (26), (27), and (55)

Case (iii)

In this case, qi takes a local minimum at i=Ny1 and a local maximum at i=Ny2. Therefore, behavior of the random walk in the range 0i<Ny2 is qualitatively the same as that for case (ii), and that in the range Ny2<i<N is qualitatively the same as that for case (i). Because the former part makes the dominant contribution to the fixation time, the scaling of the mean fixation time is given by Eq. (28).

Case (iii) occurs when strategy A is disadvantageous when it is rare and advantageous when it is frequent. The coordination game provides such an example (Sect. 5.4).

Summary and the borderline case

In summary, the mean fixation time in the limit of large N is given by t0NlnN in case (i) and t0Nexp(γN) (γ>0) in cases (ii) and (iii). For the parameter values on the boundary between the two scaling regimes, the same arguments as those for the model without zealots (Antal and Scheuring 2006) lead to t0N3/2.

Dependence of the mean fixation time on the selection strength

We examine the influence of the selection strength, denoted by w, on the mean fixation time. To this end, we redefine the fitness to an A and B player by 1-w+wfi and 1-w+wgi, respectively, where fi and gi are given by Eqs. (2) and (3) (e.g., Nowak et al. 2004; Nowak 2006). Consequently, we replace the payoff matrix given by Eq. (1) by

graphic file with name 285_2014_770_Equ29_HTML.gif 29

Equation (1) is reproduced with w=1.

For sufficiently weak selection, we obtain t0NlnN, i.e., case (i), regardless of the payoff matrix. To prove this statement, we note that, by using the payoff matrix shown in Eq. (29), condition c<(m+1)a in the case of w=1 is generalized to

c<(m+1)a+m1w-1. 30

Therefore, if the original game in the case of w=1 belongs case (ii), i.e., c>(m+1)a, the game belongs to case (i) or (iii) (Table 1) if Eq. (30), or equivalently,

w<w1mc-(m+1)a+m 31

is satisfied. For a fixed payoff matrix, w1 monotonically increases with m, consistent with the intuition that existence of zealots would lessen the fixation time.

Next, the sign of a-b-c+d is not affected by the selection strength. Therefore, we assume a-b-c+d>0 and prove that a condition for case (iii), i.e., Eq. (17), is violated with a sufficiently small w. Because the value of y~ given by Eq. (10) is also unaffected by w, we start with assuming 0<y~<1, which is a necessary condition for case (iii) [Eq. (16); see Appendix B]. The condition D<0 in the case of w=1, where D is defined by Eq. (11), is generalized to

wD-4(1-w)m(m+1)(a-b-c+d)<0. 32

Because the condition imposed on D, which distinguishes cases (i) and (iii), is relevant only for a-b-c+d>0 (Table 1), Eq. (32) is satisfied for an arbitrary payoff matrix if

w<w24m(m+1)(a-b-c+d)D+4m(m+1)(a-b-c+d). 33

Therefore, case (iii) is excluded with a sufficiently small w value.

The threshold value of w below which t0NlnN, which we denote by wc, is given by

wc=min{w1,w2,1}(w1>0,w2>0),min{w1,1}(w1>0,w2<0),min{w2,1}(w1<0,w2>0),1(w1<0,w2<0). 34

We can alternatively introduce the selection strength by replacing Eqs. (2) and (3) to redefine the fitness by

fi=expβ(i+M-1)a+(N-i)bN+M-1 35

and

gi=expβ(i+M)c+(N-i-1)dN+M-1, 36

where β is the selection strength (Traulsen et al. 2008). In Appendix E, we show that qualitatively the same result holds true in the sense that there is a threshold value of β below which the fixation is fast irrespective of the a, b, c, d, and m values. It should be noted that, with Eqs. (35) and (36), a, b, c, and d are allowed to take negative values.

Examples

We compare the mean fixation time for some games with that for the neutral game, i.e., a=b=c=d>0. In the absence of zealots, the neural game yields Ti+=Ti- (1iN-1). The random walk is unbiased, and the so-called mean conditional fixation time is equal to N(N-1) (Antal and Scheuring 2006). The mean conditional fixation time is defined as the mean fixation time starting from state i=1 under the condition that the absorbing state at i=N, not i=0, is reached.

The neutral game in the presence of zealots yields T0+>T0-=0 and Ti+/Ti-=(i+M)/i (1iN-1). Therefore, the random walk is biased toward i=N for all i. More precisely, we obtain

t0=N(N+M)0kiN-1i!(k+M)!k!(i+M)!1(N-i)(i+M). 37

As in Antal and Scheuring (2006), we say that fixation is fast (slow) if t0 is smaller (larger) than the value given by Eq. (37). It should be noted that t0NlnN for the neutral game because it corresponds to w=0<wc.

Constant selection

As a first example, consider the case of frequency-independent selection such that A and B are equipped with fitness r and 1 (under w=1), respectively. When a=b=r and c=d=1, the threshold selection strength below which t0NlnN, i.e., case (i), holds true is given by

wc=m(m+1)(1-r)r1m+1,1r1m+1. 38

If w>wc, case (ii) occurs. Even if A is disadvantageous to B, A fixates fast with the help of zealots regardless of the selection strength if 1/(m+1)<r<1. This condition is more easily satisfied when m is larger.

Prisoner’s dilemma game

Consider the prisoner’s dilemma game with a standard payoff matrix given by a=1, b=0, c=T, and d=0, where T>1. Strategies A and B represent cooperation and defection, respectively. It should be noted that a-b-c+d<0. With a general selection strength, the conditions derived in Sect. 3.2 imply that t0NlnN, i.e., case (i), if T<1+m/w, and t0Nexp(γN) with case (ii) if T>1+m/w. This condition coincides with that for the dominance of cooperators in the case of the infinite population (Masuda 2012).

The mean fixation time with w=1 and m=0.2 obtained by direct calculations of Eq. (18) is shown in Fig. 3a. In this and the following figures, the t0 values are those normalized by that for the neutral game [Eq. (37)]. The behavior of t0 is qualitatively different according to whether T is larger or smaller than 1+m/w=1.2. If T<1.2, the ratio of t0 for the prisoner’s dilemma game to t0 for the neutral game seems to approach a constant as N. This is consistent with case (i). In contrast, if T>1.2, t0 grows rapidly, which is consistent with case (ii). To be more quantitative, 400Nexp(γN) divided by the t0 value for the neutral game is shown by the dashed line in Fig. 3a. It should be noted that 400 is a constant for fitting and that γ value is theoretically determined as described in Sect. 3.3.2. The theory (dashed line) agrees well with the exact numerical results (thinnest solid line). We remark that the normalized t0 behaves non-monotonically in N; it takes a minimum at an intermediate value of N.

Fig. 3.

Fig. 3

The normalized mean fixation time for the prisoner’s dilemma game as a function of N. We set a=1, b=0, c=T, and d=0. In a, we set m=0.2 and w=1. In b, we set T=1.2 and m=0.1. In c, we set T=1.2 and w=1. The dashed lines represent 400Nexp(γN) divided by the t0 value for the neutral game

Next, to examine the effect of the selection strength, we set T=1.2 and m=0.1. The mean fixation time as a function of N and w is shown in Fig. 3b. Equation (34) implies that t0NlnN when w<wc=0.5. Consistent with this result, t0 grows fast as a function of N when w is large (i.e., w=0.7 and 1). In particular, for w=1, 400Nexp(γN) normalized by the t0 value for the neutral game (dashed line in Fig. 3b) agrees well with the exact results (thin solid line). For small w (i.e., w=0.4), t0 seems to scale with NlnN (thick solid line).

Figure 3c shows the dependence of t0 on N for different densities of zealots (i.e., m). It should be noted that the baseline t0 value derived from the neutral game depends on the value of m. Because we set T=1.2 and w=1 in Fig. 3c, the threshold value of m is equal to 0.2. In fact, the normalized t0 diverges according to Nexp(γN) when m=0.1 (dashed line and thick solid line), whereas it seems to converge to a constant value when m=0.3 (thin solid line).

Figure 3 indicates that t0 for the prisoner’s dilemma game is always larger than that for the neutral game (i.e., the normalized t0 is larger than unity). This is consistent with the intuition that cooperation is difficult to attain in the prisoner’s dilemma game as compared to the neutral game.

Finally, consider the symmetrized donation game, which is another standard form of the prisoner’s dilemma game, given by a=b-c, b=-c, c=b, and d=0, where b is the benefit, and c(<b) is the cost. For the Moran process to be well-defined, we require 1-w+wb0, i.e., w<1/(1+c). For this payoff matrix, we obtain

wc=mm-b+(1+m)cbcm+1m,1bcm+1m. 39

Fixation occurs fast for a large benefit-to-cost ratio, large m, or small selection strength.

Snowdrift game

In this section, we examine the snowdrift game (Maynard Smith 1982; Sugden 1986; Hauert and Doebeli 2004) defined by a=β-0.5, b=β-1, c=β, and d=0, where β>1. Strategies A an B are identified as cooperation and defection, respectively. Each player is tempted to defect if the other player cooperates, as in the prisoner’s dilemma game. However, different from the prisoner’s dilemma game, a player is better off by cooperating if the partner defects; mutual defection is the worst outcome. In the infinite well-mixed population without zealots, the game has the unique mixed Nash equilibrium in which the fraction of cooperation is equal to (2β-2)/(2β-1).

Numerical evidence for the replicator dynamics, corresponding to an infinite population, suggests that cooperation is dominant if m is large or w is small (Masuda 2012). For the finite population, we obtain

wc=2m3m-2mβ+1βm+12m,1βm+12m. 40

If w<wc, we obtain t0NlnN, i.e., case (i). If w>wc, we obtain t0Nexp(γN) with case (ii). A large value of β or m makes the fixation time smaller. This result makes sense because a large β generally favors cooperation.

Coordination game

The coordination game given by a=d>0 and b=c=0 has two pure Nash equilibria in the infinite well-mixed population without zealots. For a finite population in the presence of zealots, Eq. (34) yields

wc=8m(m+1)a(-4m2-4m+1)+8m(m+1)0m2-12,12-12m1. 41

If w<wc, we obtain t0NlnN, i.e., case (i). It should be noted that any strength of selection 0w1 yields t0NlnN if there are sufficiently many zealots, similar to the game with constant selection, prisoner’s dilemma game, and snowdrift game. If w>wc, we obtain t0Nexp(γN) with case (iii).

The mean first-passage time from state 0 (i.e., all ordinary players select B) to state i, i.e., j=0i-1σj, is shown in Fig. 4. It should be noted that t0 is equal to this first-passage time to exit i=N. We set N=200, a=d=1, b=c=0, m=0.2, and w=1. Equation (41) implies wc=48/49 for these parameter values. Because w=1>wc, we obtain case (iii).

Fig. 4.

Fig. 4

Mean first-passage time for the coordination game. We set N=200, a=d=1, b=c=0, m=0.2, and w=1

The first-passage time increases slowly as i increases when i is small. It rapidly increases with i for intermediate values of i, Once the random walker passes the critical i value, it feels a positive bias such that the first-passage time only gradually increases with i for large i. The values of i that separate the three regimes are roughly consistent with the analytical estimates y1=0.1 and y2=0.2 [Eqs. (12), (13)]. It should be noted that the first-passage time shows representative behavior of case (iii) although w is only slightly larger than wc.

Discussion

We extended the results for the fixation time under the Moran process (Antal and Scheuring 2006) to the case of a population with zealous players. Similar to the case without zealots (Antal and Scheuring 2006), we identified three regimes in terms of the payoff matrix, number of zealots, and selection strengths. In one regime, the fixation time is small (i.e., NlnN). In the other two regimes, it is large (i.e., Nexp(γN) with γ>0). We illustrated our results with representative games including the prisoner’s dilemma game, snowdrift game, and coordination game.

Zealots have several impacts on evolutionary dynamics in finite populations. First, fixation of one strategy A always occurs with zealots because we assumed that all zealots permanently take A. Second, there is a case in which fixation is fast if the fraction of A players is sufficiently large, whereas fixation is slow if the fraction of A is small. This scenario occurs for the coordination game. In the absence of zealots, the same game shows bistability such that the fixation to the unanimity of A or that of B occurs fast (Antal and Scheuring 2006). Third, for a selection strength smaller than a threshold value, the fixation is fast for any payoff matrix. In the absence of zealots, the dependence of the mean fixation time on N for large N values is completely determined by the signs of a-c and b-d (Antal and Scheuring 2006). Therefore, the scaling of the mean fixation time on N is independent of the selection strength because manipulating the selection strength does not change the sign of the effective a-c or b-d value. If the payoff matrix is given in the slow fixation regime, the fixation is exponentially slow even for a small selection strength. In contrast, in the presence of zealots, slow fixation can be accelerated if we lessen the selection strength.

Mobilia examined the prisoner’s dilemma game with cooperation facilitators (Mobilia 2012). A cooperation facilitator was assumed to cooperate with cooperators and not to play with defectors. The cooperation facilitator and zealous cooperator in the present study are common in that they never change the strategy. However, they are different. First, zealous cooperators are embedded in a well-mixed population such that they myopically cooperate with defectors as well as cooperators. Second, the ordinary players may imitate the zealous cooperator’s strategy (i.e., cooperation). In contrast, players do not imitate the cooperation facilitator’s strategy (i.e., cooperation) in Mobilia’s model. As a consequence, cooperation does not always fixate in his model.

Examination of the case of imperfect zealots, in which zealots change the strategy with a small probability (Masuda 2012), warrants future work.

Acknowledgments

We thank Bin Wu for carefully reading the manuscript. NM acknowledges the support provided through Grants-in-Aid for Scientific Research (No. 23681033) from MEXT, Japan, the Nakajima Foundation, CREST JST, and the Aihara Innovative Mathematical Modelling Project, the Japan Society for the Promotion of Science (JSPS) through the “Funding Program for World-Leading Innovative R&D on Science and Technology (FIRST Program),” initiated by the Council for Science and Technology Policy (CSTP).

Appendix A: Derivation of Eq. (6)

Denote by Pi(t) the probability that the random walker starting from state i at time 0 is absorbed to state N at time t. The normalization is given by t=0Pi(t)=1. It should be noted that PN(0)=1 and PN(t)=0 (t1). The mean fixation time when i ordinary players initially select strategy A is given by

ti=t=0tPi(t). 42

It should be noted that tN=0.

Pi(t) satisfies the recursion relation given by

Pi(t)=Ti-Pi-1(t-1)+(1-Ti--Ti+)Pi(t-1)+Ti+Pi+1(t-1). 43

By multiplying both sides of Eq. (43) by t and taking the summation over t, we obtain

ti=Ti-ti-1+(1-Ti--Ti+)ti+Ti+ti+1+1. 44

In terms of σiti-ti+1, Eq. (44) can be rewritten as

Ti-σi-1-Ti+σi+1=0. 45

The solution of Eq. (45) is given by

σi=σ0qi+qik=1i1Tk+qk, 46

where 0iN-1 and qi is given by Eq. (7).

We set i=0 in Eq. (44) and use T0-=0 to obtain

t0=(1-T0+)t0+T0+t1+1. 47

Therefore, we obtain

σ0=t0-t1=1T0+. 48

Using Eq. (48), we reduce Eq. (46) to

σi=qik=0i1Tk+qk. 49

The mean fixation time is given by

ti=j=iN-1σj+tN=j=iN-1qjk=0j1Tk+qk. 50

Appendix B: Classification of the deterministic dynamics induced by the biased random walk

B.1 When a-b-c+d<0

We obtain d2f(y)/dy2<0 for a-b-c+d<0. Because

f(0)=m2a+mb>0, 51
f(1)=(m+1)2a-(m+1)c, 52

where we used the assumption a>0 in Eq. (51), we distinguish the following two cases. If c<(m+1)a, f(y)>0 is satisfied for 0y1, yielding case (i) in the main text. If c>(m+1)a, a certain y1(0<y1<1) exists such that f(y)>0 for 0y<y1, and f(y)<0 for y1<y1. Therefore, case (ii) occurs.

B.2 When a-b-c+d>0

We obtain d2f(y)/dy2>0 for a-b-c+d>0. In this situation, Eq. (51) holds true.

If f(1)<0, i.e., c>(m+1)a, a certain y1 (0<y1<1) exists such that f(y)>0 for 0y<y1, and f(y)<0 for y1<y1. Therefore, case (ii) occurs.

Suppose that f(1)>0, i.e., c<(m+1)a. To analyze this case, let us write

f(y)=(a-b-c+d)(y-y~)2+m2a+mb-2ma+(1-m)b-mc-d24(a-b-c+d), 53

where

y~=-2ma+(1-m)b-mc-d2(a-b-c+d). 54
  • (i)

    If y~0, i.e., 2ma+(-m+1)b-mc-d0, we obtain f(y)f(0)>0 for y0. Therefore, case (i) occurs.

  • (ii)

    If y~1, i.e., -(2m+2)a+(m+1)b+(m+2)c-d0, then f(y)f(1)>0, yielding case (i).

  • (iii)
    If 0<y~<1, we have the following two subcases:
    1. If D=2ma+(1-m)b-mc-d2-4m(ma+b)(a-b-c+d)>0, f(y)=0 has two solutions 0<y1<y2<1. In the deterministic dynamics driven by the bias Ti+-Ti-, y1 and y2 are stable and unstable, respectively. Therefore, case (iii) occurs.
    2. If D0, we obtain f(y)0 for all 0y<1, where the equality holds true only when D=0 and y=y~. Therefore, case (i) occurs.

B.3 When a-b-c+d=0

The quadratic term in f(y) disappears when a-b-c+d=0. The classification of the dynamics in this case coincides with that for a-b-c+d<0.

Appendix C: Derivation of q~(y)

To derive the relationship qi=q~(i/N)N, we write

qi=expk=1ilnTk-Tk+=expk=1ilnk(k+M)c+(N-k-1)d(k+M)(k+M-1)a+(N-k)bexpN0ylny(y+m)c+(1-y)d(y+m)(y+m)a+(1-y)bdy, 55

where y=i/N and y=k/N. Because the integral on the right-hand side of Eq. (55) is independent of N, we obtain qi=q~(y)N. It should be noted that q~(0)=1 is consistent with q0=1.

Appendix D: Steepest descent method

As done in Antal and Scheuring (2006), we use the steepest descent method to evaluate k=iN-1(qN-1/qk) in Eq. (26) as follows:

k=iN-1qN-1qkk=iN-1q~(1)q~(k/N)NNiN1q~(1)q~(y)Ndy=Nq~(1)q~(y)NiN1exp-lnq~(y)q~(y)1Ndy 56

where q~(y)=min0y<1q~(y). We approximate the integral by a Gaussian integral to obtain

exp-F(y)λdy2πλF(y)exp-F(y)λ 57

with F(y)=lnq~(y)/q~(y) and λ=1/N such that

k=kN-1qN-1qkNq~(1)q~(y)N. 58

Appendix E: Weak selection introduced via an exponential function leads to fast fixation

Assume that the fitness of an A and B player is given by Eqs. (35) and (36), respectively. Then, we obtain

Ti-Ti+=igi(i+M)fi=expβ(i+M)c+(N-i-1)d-(i+M-1)a+(N-i)bN+M-1+lnii+M. 59

If Ti-/Ti+<1, i.e.,

β(i+M)c+(N-i-1)d-(i+M-1)a+(N-i)bN+M-1+lnii+M<0 60

holds true for any i (1iN-1) and N, the fixation occurs fast (i.e., t0NlnN). By substituting y=i/N and m=M/N in Eq. (60) and ignoring O(N-1) terms, we obtain

β(x+m)(c-a)+(1-x)(d-b)1+m<lnx+mx. 61

Because the right-hand side of Eq. (61) is positive, there exists βc>0 such that t0NlnN when 0β<βc. It should be noted that, in contrast to the assumption throughout the present article, a, b, c, and d are allowed to be negative in the present analysis because fi and gi given by Eqs. (35) and (36) are positive irrespective of the a, b, c, and d values.

References

  1. Altrock PM, Traulsen A. Fixation times in evolutionary games under weak selection. New J Phys. 2009;11:013012. doi: 10.1088/1367-2630/11/1/013012. [DOI] [Google Scholar]
  2. Altrock PM, Gokhale CS, Traulsen A. Stochastic slowdown in evolutionary processes. Phys Rev E. 2010;82:011925. doi: 10.1103/PhysRevE.82.011925. [DOI] [PubMed] [Google Scholar]
  3. Altrock PM, Traulsen A, Reed FA. Stability properties of underdominance in finite subdivided populations. PLOS Comput Biol. 2011;7:e1002260. doi: 10.1371/journal.pcbi.1002260. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Antal T, Scheuring I. Fixation of strategies for an evolutionary game in finite populations. Bull Math Biol. 2006;68:1923–1944. doi: 10.1007/s11538-006-9061-4. [DOI] [PubMed] [Google Scholar]
  5. Assaf M, Mobilia M. Large fluctuations and fixation in evolutionary games. J Stat Mech. 2010;2010:P09009. [Google Scholar]
  6. Assaf M, Mobilia M. Metastability and anomalous fixation in evolutionary games on scale-free networks. Phys Rev Lett. 2012;109:188701. doi: 10.1103/PhysRevLett.109.188701. [DOI] [PubMed] [Google Scholar]
  7. Ewens WJ. Mathematical population genetics I. Theoretical introduction. New York: Springer; 2010. [Google Scholar]
  8. Fu F, Rosenbloom DI, Wang L, Nowak MA. Imitation dynamics of vaccination behaviour on social networks. Proc R Soc B. 2011;278:42–49. doi: 10.1098/rspb.2010.1107. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Galam S, Jacobs F. The role of inflexible minorities in the breaking of democratic opinion dynamics. Physica A. 2007;381:366–376. doi: 10.1016/j.physa.2007.03.034. [DOI] [Google Scholar]
  10. Hauert C, Doebeli M. Spatial structure often inhibits the evolution of cooperation in the snowdrift game. Nature. 2004;428:643–646. doi: 10.1038/nature02360. [DOI] [PubMed] [Google Scholar]
  11. Van Kampen NG. Stochastic processes in physics and chemistry. 3. Netherlands: Elsevier; 2007. [Google Scholar]
  12. Krapivsky PL, Redner S, Ben-Naim E. A kinetic view of statistical physics. Cambridge: Cambridge University Press; 2010. [Google Scholar]
  13. Kreindler GE, Young HP. Fast convergence in evolutionary equilibrium selection. Games Econ Behav. 2013;80:39–67. doi: 10.1016/j.geb.2013.02.004. [DOI] [Google Scholar]
  14. Liu XT, Wu ZX, Zhang L. Impact of committed individuals on vaccination behavior. Phys Rev E. 2012;86:051132. doi: 10.1103/PhysRevE.86.051132. [DOI] [PubMed] [Google Scholar]
  15. Masuda N. Evolution of cooperation driven by zealots. Sci Rep. 2012;2:646. doi: 10.1038/srep00646. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Maynard Smith J. Evolution and the theory of games. Cambridge: Cambridge University Press; 1982. [Google Scholar]
  17. Mobilia M. Does a single zealot affect an infinite group of voters? Phys Rev Lett. 2003;91:028701. doi: 10.1103/PhysRevLett.91.028701. [DOI] [PubMed] [Google Scholar]
  18. Mobilia M. Stochastic dynamics of the prisoner’s dilemma with cooperation facilitators. Phys Rev E. 2012;86:011134. doi: 10.1103/PhysRevE.86.011134. [DOI] [PubMed] [Google Scholar]
  19. Mobilia M, Petersen A, Redner S (2007) On the role of zealotry in the voter model. J Stat Mech: P08029
  20. Moran PAP. Random processes in genetics. Proc Cambridge Philos Soc. 1958;54:60–71. doi: 10.1017/S0305004100033193. [DOI] [Google Scholar]
  21. Nowak MA, Sasaki A, Taylor C, Fudenberg D. Emergence of cooperation and evolutionary stability in finite populations. Nature. 2004;428:646–650. doi: 10.1038/nature02414. [DOI] [PubMed] [Google Scholar]
  22. Nowak MA. Evolutionary dynamics. MA: The Belknap Press of Harvard University Press; 2006. [Google Scholar]
  23. Redner S. A guide to first-passage processes. Cambridge: Cambridge University Press; 2001. [Google Scholar]
  24. Singh P, Sreenivasan S, Szymanski BK, Korniss G. Accelerating consensus on coevolving networks: the effect of committed individuals. Phys Rev E. 2012;85:046104. doi: 10.1103/PhysRevE.85.046104. [DOI] [PubMed] [Google Scholar]
  25. Sugden R. The economics of rights, co-operation and welfare. New York: Blackwell; 1986. [Google Scholar]
  26. Taylor C, Fudenberg D, Sasaki A, Nowak MA. Evolutionary game dynamics in finite populations. Bull Math Biol. 2004;66:1621–1644. doi: 10.1016/j.bulm.2004.03.004. [DOI] [PubMed] [Google Scholar]
  27. Traulsen A, Pacheco JM, Nowak MA. Pairwise comparison and selection temperature in evolutionary game dynamics. J Theor Biol. 2007;246:522–529. doi: 10.1016/j.jtbi.2007.01.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Traulsen A, Shoresh N, Nowak MA. Analytical results for individual and group selection of any intensity. Bull Math Biol. 2008;70:1410–1424. doi: 10.1007/s11538-008-9305-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Wu B, Altrock PM, Wang L, Traulsen A. Universality of weak selection. Phys Rev E. 2010;82:046106. doi: 10.1103/PhysRevE.82.046106. [DOI] [PubMed] [Google Scholar]
  30. Xie J, Sreenivasan S, Korniss G, Zhang W, Lim C, Szymanski BK. Social consensus through the influence of committed minorities. Phys Rev E. 2011;84:011130. doi: 10.1103/PhysRevE.84.011130. [DOI] [PubMed] [Google Scholar]

Articles from Journal of Mathematical Biology are provided here courtesy of Springer

RESOURCES