Skip to main content
Proceedings of the Royal Society B: Biological Sciences logoLink to Proceedings of the Royal Society B: Biological Sciences
. 2006 Dec 12;274(1610):689–695. doi: 10.1098/rspb.2006.3759

Tag-based indirect reciprocity by incomplete social information

Naoki Masuda 1,*, Hisashi Ohtsuki 2
PMCID: PMC2197208  PMID: 17254993

Abstract

Evolution of altruistic behaviour in interacting individuals is accounted for by, for example, kin selection, direct reciprocity, spatially limited interaction and indirect reciprocity. Real social agents, particularly humans, often take actions based on similarity between themselves and others. Although tag-based indirect reciprocity in which altruism occurs exclusively among similar flocks is a natural expectation, its mechanism has not really been established. We propose a model of tag-based indirect reciprocity by assuming that each player may note strategies of others. We show that tag-based altruism can evolve to eradicate other strategies, including unconditional defectors for various initial strategy configurations and parameter sets. A prerequisite for altruism is that the strategy is sometimes, but not always, visible to others. Without visibility of strategies, policing does not take place and defection is optimal. With perfect visibility, what a player does is always witnessed by others and cooperation is optimal. In the intermediate regime, discriminators based on tag proximity, rather than mixture of generous players and defectors, are most likely to evolve. In this situation, altruism is realized based on homophily in which players are exclusively good to similar others.

Keywords: altruism, evolutionary game, indirect reciprocity, homophily

1. Introduction

According to classical game theory, defection rather than cooperation is the optimal behaviour from individualistic points of view. However, altruistic behaviour evolves in real social systems, particularly in human societies (Fehr & Fischbacher 2003). Altruism is explained by mechanisms such as kin selection (Hamilton 1964), assortative mating (Eshel & Cavalli-Sforza 1982; Bergstrom 2003), direct reciprocity (Trivers 1971; Axelrod & Hamilton 1981; Axelrod 1984), spatial reciprocity based on short-range interaction (Nowak & May 1992) and indirect reciprocity (Nowak & Sigmund 1998a,b, 2005; Brandt & Sigmund 2004, 2005; Ohtsuki & Iwasa 2004). In particular, indirect reciprocity occurs if each player carries an image score that indicates how often the player has cooperated before. Then, unconditional cooperators with high image scores are trusted by others, but they are too good to be resistant to invasion by unconditional defectors. Defectors have low scores and cannot count on altruism from others. Discriminators with moderate scores can evolve because good players trust them and defectors cannot exploit them.

In many situations, individuals are equipped with biological and social tags according to which they are categorized. Similar birds tend to flock together based on tags, which is assortative interaction (McPherson & Smith-Lovin 1987; Marsden 1988; McPherson et al. 2001; Newman 2003). Intuitively, players may exclusively help similar others to lead to indirect reciprocity. Imagine that we meet a stranger P from the same town or school by chance. Then, we may be inclined to help P even if we do not expect to see P later on. However, defection is theoretically better in such a one-shot encounter.

Tag-based reciprocity was proposed as a mechanism of indirect reciprocity (Riolo et al. 2001). Each player has a tolerance value as strategy. Altruistic behaviour happens when the difference between the opponent's tag and the tag of the focal player does not exceed the tolerance of the focal player. Altruism can evolve among players with close tags even if they have not met before. A key factor for tag-based altruism in this model is the assumption that players with the same tag value always help each other. If this condition is removed, which may be more realistic, cooperation does not evolve (Roberts & Sherratt 2002).

Nevertheless, these debates do not exclude the possible importance of tag-based altruism. In combination with population viscosity, which is known to elicit altruism in the Prisoner's Dilemma (Nowak & May 1992), use of tags reinforces altruism (Hammond & Axelrod 2006). Another scenario that revives tag-based altruism is that the tag and the tolerance genes sit on different alleles and inheritance of these genotypes are only loosely coupled. Thus, if many tag values are allowed and a population is spatially organized, tag-based altruism is viable (Jansen & van Baalen 2006).

In this work, we propose a mechanism of indirect reciprocity based on tags, without resorting to a prefabricated cooperative tendency (Riolo et al. 2001) and spatial structure of populations (Nowak & May 1992; Hammond & Axelrod 2006; Jansen & van Baalen 2006). The main assumption is that players may observe the tolerance (strategy) of opponents. In reality, third parties may communicate the information about others' strategies to a focal player. We show that tag-based indirect reciprocity emerges for a variety of initial configurations when tolerances are observable with moderate probabilities.

2. Model: strategy and pay-off

We assume n individuals involved in the evolutionary Prisoner's Dilemma game. In each round, a donor and a recipient are taken from the population randomly. If the donor decides to donate, the donor loses cost c and the recipient gains benefit b, which is larger than c. If the donor refuses to donate, both the donor and the recipient get nothing. A generation consists of a sufficiently small number of rounds that prohibits two specific players from interacting more than once on average. Therefore, there is no room for direct reciprocity. After a generation, the strategy of players with larger total pay-offs is more likely to be disseminated, whereas poor strategies are eliminated. Since b>c, the population pay-off is maximized when everybody is always altruistic. However, unconditional defection, but not cooperation, is the evolutionary stable strategy (ESS) (Axelrod & Hamilton 1981; Axelrod 1984); here is a dilemma.

We assume that the ith individual (1≤in) has tag wi and tolerance μi. Everybody can see others' tags. Tags may take nominal or graduated values. In either case, the distance between two tags 0≤l(wi,wj)≤lmax is defined, where lmax is the maximal distance. The tag wi can be assumed to vary in time (Riolo et al. 2001; Jansen & van Baalen 2006), which is typically the case in, for example, the preference of political parties in democratic regimes and fashions in clothing. Tags evolve much more slowly than do tolerances in other situations. For example, inborn biological or social traits and strong beliefs can be considered invariant for a player, whereas tolerances can adapt via social learning on a relatively short time-scale. We focus on the latter case; we consider the evolution of tolerances with tags fixed.

Suppose that player i and player j are, respectively, chosen as donor and recipient in a single round. Then, the player i may donate to the player j if i and j are close enough according to i's criterion. In the previous model (Riolo et al. 2001), i pays c to benefit j by b if l(wi,wj)≤μi, where μi is the tolerance level of i. Players with μilmax always donate, whereas players with μi<0 never donate. These players do not resort to tags. Riolo et al. (2001) excluded unconditional defectors by setting μi≥0 and observed indirect reciprocity. However, we allow negative μi to investigate whether tag users can evolve in the presence of unconditional defectors (μi<0). Without additional mechanisms, unconditional defectors outperform unconditional cooperators and tag users; hence, there is no altruism (Roberts & Sherratt 2002).

To explore the possibility of tag-based indirect reciprocity, we introduce the probability q that the donor i detects μj, namely, the tolerance of the recipient j. We can tune the reputations of others' tolerances to be public with a large q or private with a small q (Nowak & Sigmund 1998a,b; Brandt & Sigmund 2004, 2005), whereas perfect social information (q=1) is assumed in other models (e.g. Ohtsuki & Iwasa 2004). If i knows that l(wi,wj)>μj, i is not motivated to be nice to j. Note that this is not because i does not expect cooperation back from j; i and j hardly interact more than once per generation. Rather, this is because i expects that j will not be nice to other players whose tags are close to wi. If other players do the same, natural selection will eliminate μj, which may lead to an altruistic population.

If q=0, the donor never sees the tolerance of the recipient. In this case, μ<0 will be stable (Roberts & Sherratt 2002). If q=1, the tolerance of everybody is transparent to the entire population. Thus, small μi will disappear rapidly, and everybody will be eventually altruistic (μ=lmax). We examine what occurs for intermediate values of q.

3. Two-tag model

We begin with a minimal model in which each player has tag wa or wb. Such a minimal model was used to analyse the replicator dynamics of the original model by Riolo et al. (Traulsen & Schuster 2003). We define l(wa,wa)=l(wb,wb)=0 and l(wa,wb)=1. It suffices to consider three tolerance values μ=−1, 0 and 1. Players with μ=−1 are unconditional defectors. Players with μ=0 are tag users. Players with μ=1 are unconditional cooperators if q=0. When tolerances of opponents are visible (q>0), these players do not necessarily cooperate. We call them generous players. We could introduce unconditional cooperators and players who neglect others' tolerances even when available. However, these strategies are outperformed by players using tags and tolerances. Therefore, we consider only the three strategies (μ=1, 0 and −1).

Suppose that a population consists of n players and that nt (n(1−t)) players are endowed with tag wa (wb). We denote by pμa the probability that a player with tag wa, who we call wa-player, has tolerance μ. Similarly, a wb-player has tolerance μ with probability pμb. If the population is well mixed, any player meets a wa-player (wb-player) and tolerance μ with probability tpμa((1t)pμb). Since the tag is visible to others, players may prefer to interact with conspecifics. Such assortative interaction may stem from spatial structure of the population. To take assortativity into account, we assume that the two players with the same tag can meet more frequently. Precisely, a player who meets an opponent with the opposite tag looks for an alternative opponent with the same tag with probability h (0≤h≤1). Thus, a wa-player interacts with a wa-player (wb-player) with probability t+h(1−t) ((1−h)(1−t)), whereas a wb-player interacts with a wa-player (wb-player) with probability (1−h)t (1−t+ht) (see table 1; Eshel & Cavalli-Sforza 1982; Bergstrom 2003). Non-assortative interaction is reproduced by h=0, and h=1 defines completely assortative interaction in which players with opposite tags never interact.

Table 1.

Probability that two players meet in the two-tag model.

  tag of the opponent
 
tag of the focal player wa wb
wa t+h(1−t) (1−h)(1−t)
wb (1−h)t 1−t+ht

Disregarding normalization, the pay-off Πμa(Πμb) for a wa-player (wb-player) with tolerance μ is given by

Π1a=(bcq)[(t+hht)(p0a+p1a)+(1h)(1t)p1b]c(1q), (3.1)
Π0a=b[(t+hht)(p0a+p1a)+(1q)(1h)(1t)p1b]c(1q)(t+hht)cq(t+hht)(p0a+p1a), (3.2)
Π1a=b(1q)[(t+hht)(p0a+p1a)+(1h)(1t)p1b], (3.3)
Π1b=(bcq)[(1t+ht)(p0b+p1b)+(1h)tp1a]c(1q), (3.4)
Π0b=b[(1t+ht)(p0b+p1b)+(1q)(1h)tp1a]c(1q)(1t+ht)cq(1t+ht)(p0b+p1b) (3.5)

and

Π1b=b(1q)[(1t+ht)(p0b+p1b)+(1h)tp1a]. (3.6)

Then, we examine the replicator dynamics (Taylor & Jonker 1978; Hofbauer & Sigmund 1998) of the population density described by

p˙μa=pμa(Πμaμ=1,0,1pμaΠμa), (3.7)
p˙μb=pμb(Πμbμ=1,0,1pμbΠμb). (3.8)

The pay-off of wa-players depends on the tolerance distribution of wb-players, who do not compete with wa-players, and vice versa. There are six variables (p1a, p0a, p1a, p1b, p0b and p1b). Since we are interested in competition among tolerances with tags fixed, the number of wa-players (=nt) and that of wb-players (=n(1−t)) is preserved. Then, we obtain p1a+p0a+p1a=1, p1b+p0b+p1b=1, and the replicator system represented by equations (3.7) and (3.8) is four-dimensional.

(a) Symmetric strategies

Let us start with a simple case in which wa-players equal wb-players in number (t=0.5) and the initial condition is symmetric: pμa=pμb (μ=−1, 0 and 1). Then, the dynamics represented by equations (3.7) and (3.8) preserve the relations pμpμa=pμb and ΠμΠμa=Πμb (μ=−1, 0 and 1), and are equivalent to the two-dimensional replicator dynamics with p−1, p0 and p1 (p−1+p0+p1=1). Equations (3.1)–(3.6) are reduced to

Π1=(bcq)(1+h2p0+p1)c(1q), (3.9)
Π0=[b(1q)+(bc)q(1+h)2]p1+(bcq)(1+h)2p0c(1q)(1+h)2, (3.10)

and

Π1=b(1q)(1+h2p0+p1). (3.11)

Then, we have

Π1>Π0(bc)qp1>c(1q), (3.12)
Π0>Π1(bc)q(p0+p1)>c(1q), (3.13)

and

Π1>Π1(bc)q(1+h2p0+p1)>c(1q). (3.14)

As schematically shown in figure 1, the tolerance distribution of a population is mapped to a point in the simplex defined by p0+p1+p2=1 and p0, p1, p2≥0. On the line p1=0 (or p0+p1=1), we have Π1>Π0p1>A, where A=c(1−q)/[(bc)q]≥0. Similarly, we derive Π1>Π1p1>A on p0=0 and Π0>Π1p0>A on p1=0. Therefore, if A<1, then (p1, p0, p1)=(1,0,0),(0,1,0),(0,0,1) are ESSs. The condition A<1 is equivalent to bq>c, which coincides with the requirement for altruism in the image scoring model (Nowak & Sigmund 1998a,b). If bqc, Π−1 is dominant, and unconditional defectors eventually occupy the population.

Figure 1.

Figure 1

Replicator dynamics with binary tags. (ac) Schematics of population dynamics for h=0 and (a) small, (b) intermediate, and (c) large q. (d, e) Comparison of different degrees of assortativity with b=1.0, c=0.3 and q=0.5. (d) h=0 and (e) h=0.8.

Let us concentrate on the case bq>c (A<1). In addition to the three corners of the simplex, (p1,p0,p1)=(1−A,0,A) is a degenerated source (see Appendix A). Dynamics with non-assortative interaction (h=0) are compared in figure 1a–c for different values of q. For a small q (bqc), most initial configuration leads to domination by unconditional defectors (p−1=1 and p0=p1=0), as shown in figure 1a. Since players can keep their tolerances secret, it is better to behave selfishly (Roberts & Sherratt 2002). As q increases (equivalently, A decreases), tolerances are observed by others, and it is rational to be nice. An increase in q, indeed, enlarges the attractive basins of the unanimity of tag users (p0=1 and p−1=p1=0) and that of generous players (p1=1 and p−1=p0=0). By contrast, the attractive basin of p−1=1 shrinks. For intermediate q, the attractive basin of p0=1 (shaded areas in figure 1a–c) is large. Particularly, when q=2c/(b+c) (namely, A=0.5), the attractive basin of p0=1 is largest and twice as large as those of p−1=1 and p1=1 (figure 1b). When q is close to 2c/(b+c), tag users are likely to outperform generous players and defectors to establish tag-based altruism for many initial conditions. If q is even larger, being generous is optimal for many configurations (figure 1c).

Numerically calculated replicator dynamics for q=0.5 are shown in figure 1d,e for two values of h. The results for h=0 (non-assortative encountering) resemble figure 1b. When h is large (assortative encountering), μ=0 and 1 are not so distinct because players do not often play with others with the opposite tag. Then, the trajectories inside the simplex are more or less symmetric about p0=p1 (figure 1e). By contrast, if players with the opposite tags are more likely to interact, μ=−1 and 0 are less distinguished. Note that assortativity (h>0) is not necessary for tag-based reciprocity.

(b) Asymmetric strategies

The assumption that wa- and wb-subpopulations share the identical number of players and the identical tolerance distributions is too restrictive. Here, we relax t=0.5 and pia=pib (i=0,1,2). Now the population dynamics described by equations (3.7) and (3.8) are four dimensional. Since analytical treatment of the four-dimensional system is intractable, we look for pure-strategy ESSs. If a pure strategy of the wa-subpopulation is the best response to the wb-subpopulation with a pure strategy and vice versa, this pair of tolerances is an ESS. Therefore, we investigate compatibility of 3×3=9 pure strategies.

Let us fix the tolerance distribution of the wb-players. Then, the best response for wa-players can be derived from equations (3.1)–(3.3). The best response depending on the tolerance density of wa-players is shown in figure 2a for p1b>A. Since X[A(1(t+hht))p1b]/(t+hht)<A for p1b>A, the region defined by p1a>A belongs to the basin of p1a=1. The same conclusion obeys with a and b swapped and t+hht replaced by 1−t+ht. Starting from p1a>A and p1b>A, generous players will eventually eradicate the other two strategies in both subpopulations. This extends the result for the symmetric case.

Figure 2.

Figure 2

The best response for wa-players when (a) p1b>A and (b) p1b<A. We set X=[A(1(t+hht))p1b]/(t+hht).

The best response for wa-players when p1b<A is shown in figure 2b. Since p1b<A, possible pure strategies for the wb-subpopulation are μ=−1 and 0. Figure 2b implies that the pure strategy p1a=1 is incompatible with p1b=1 and p0b=1. Similarly, p1b=1 is incompatible with p1a=1 and p0a=1. Therefore, we exclude (p1a,p0a,p1a,p1b,p0b,p1b)=(1,0,0,0,0,1),(0,1,0,0,0,1),(0,0,1,1,0,0)and(0,0,1,0,1,0). By contrast, four pure strategies: (p1a,p0a,p1a,p1b,p0b,p1b)=S1:(1,0,0,1,0,0),S2:(1,0,0,0,1,0),S3:(0,1,0,1,0,0)andS4:(0,1,0,0,1,0) are self-consistent stable steady states. Solutions S1 (all defectors) and S4 (all tag users) are symmetric, as discovered in §3a. With S2 (wa-defectors and wb-tag users) or S3 (wa-tag users and wb-defectors), players do not cooperate with dissimilar others. Consequently, how nice a wa- (wb-) player is to other wa- (wb-) players does not affect the pay-off and the tolerances of wb- (wa-) players. Since two groups are independent of each other, asymmetric solutions are allowed.

To investigate which of the five ESSs identified above is more feasible, we perform 3000 runs of evolutionary dynamics with different initial conditions. The initial tolerance density is chosen according to the uniform distribution on the simplex: p1x+p0x+p1x=1 (p1x,p0x,p1x0) independently for x=a and b, respectively. The initial tolerance of each player is picked according to the tolerance distributions generated in this way. We set n=300, b=1.0 and c=0.3. One generation consists of 8n rounds so that each player appears eight times as donor and eight times as recipient on average. Since 8×2≪n, a specific pair of players hardly meet more than once, and direct reciprocity is ruled out. We study asynchronous updating: after each generation, we randomly choose a small number of players (equal to 8). For each of these players, the tolerance is replaced by that of player i with probability (πiminj=1nπj)/i=1n(πiminj=1nπj), where πi (1≤in) is the ith player's generation pay-off. Players with larger πi are more likely to bear offspring. The players with the minimal pay-off (=minj=1nπj) do not reproduce themselves.

The final proportion of each pure strategy is shown in figure 3. For small q, μ=−1 results from most initial conditions for both subpopulations (dotted line). By contrast, μ=1 is reached for both subpopulations when q is large (dash-dotted line). For intermediate q, three pure strategies comprising tag users can evolve. The probability that any of the three ESSs, with tag users fixates is pretty large (≥0.5) for 0.4≤q≤0.55. This range of q agrees with q=2c/(b+c)≅0.46, which is the optimal q for prosperity of tag users in the symmetric population (§3a). The probability that any of the other four pure strategies dominates the population is tiny and is thus omitted in figure 3.

Figure 3.

Figure 3

The size of the basin of attraction of five main pure strategies.

4. Continuous-tag model

Let us consider the case of continuously distributed tags, which is more realistic in some practical situations. Similar to Riolo et al. (2001), we assume that tags are uniformly distributed on [0,1], where 0 and 1 are identified, and that the interaction is non-assortative. The difference between two tags is naturally defined as the distance on the ring. Formally, l(wi,wj)=min(|wiwj|, 1−|wiwj|) and 0≤l(wi,wj)≤lmax≡0.5. We set μi∈[μmin,μmax], where μmin=−10−6 and μmax=0.5. Players with negative μi are unconditional defectors. Along the lines of the two-tag model, it is straightforward to show that a population of players with μi=μ (1≤in) for any μ is not invaded by a player with a different tolerance, given bq>c. Such a unanimous population is an ESS for any μ. We are interested in which μ will evolve.

To see this, imagine that each player initially owns a tolerance value taken from the uniform distribution. Without losing generality, we assume that a reference player P1 has tag w=0.5 and tolerance μ. Player P1 potentially donates to others whose tag w′∈[0.5−μ, 0.5+μ]. Owing to the uniform tag density, a proportion of 2Δl players, or players with tag w′∈[0.5+l, 0.5+ll] or w′∈[0.5−l−Δl, 0.5−l], are in the range of distance [l,ll] from P1.

Since encounters occur in a uniform manner, P1’s pay-off is equal to

Πμ=(bc)q[μminμdμ0μdl+μμmaxdμ0μdl]+b(1q)μminμmaxdμ0μdlc(1q)μminμmaxdμ0μdl, (4.1)

up to normalization. The first term indicates that, when the tag is visible with probability q, the tolerance of both P1 (μ) and the opponents (μ′) must be larger than l for altruism. The second term corresponds to the situation in which P1 sneaks benefit b without exposing that P1 is selfish. The third term represents the opposite situation. Since μmin≅0, equation (4.1) is reduced to

Πμ=(bc)q2[μbqc(bc)qμmax]2+μmax22[(bqc)2(bc)q+b(1q)]. (4.2)

If bqc, the optimal tolerance is μ=0≅μmin, namely, unconditional defection; the same conclusion as that for the two-tag model. If bq>c, the optimal tolerance is given by

μ=bqc(bc)qμmax, (4.3)

which is smaller than μmax unless q=1. Now, it is beneficial to use tags. As a function of q, the best response to the uniform tolerance density is plotted in figure 4 for the three values of b/c. The optimal tolerance increases with q. Tag users prosper for a wider range of q in more benign environments (for b/c large).

Figure 4.

Figure 4

The best response in the continuous-tag model when tags and tolerances are uniformly distributed. b/c=1.2 (thinnest line), 2 (moderate line) and 4 (thickest line).

We numerically simulate the continuous-tag model to confirm the above prediction. Initially, the tag and the tolerance of n players are uniformly and independently distributed on [0,1] and [μmin, μmax], respectively. We set n=800, b=1.0 and c=0.3. Noiseless simulations are performed as described in §3b. In noisy simulations, after each generation with natural selection, eight randomly chosen players are mutated so that their tolerances are drifted by a random amount chosen uniformly from [−0.03,0.03].

The population average of the tolerance after transient is shown in figure 5a with mutation present (filled squares) and absent (open circles). The estimation based on equation (4.3) (solid line, figure 5a) agrees with the stationary tolerance pretty well. Tag users can evolve for intermediate values of q, whereas extreme values of q foster unconditional defectors or generous players. The pay-off averaged over the population increases linearly with the final μ (results not shown). The standard deviations of the tolerance based on n players stay small in each run (figure 5b). This indicates that μi tends to align even under continuously distributed tags and dynamical noise.

Figure 5.

Figure 5

Results for a population of randomly interacting n=800 players with continuous tags and tolerances. For each q, we perform 10 runs starting with the uniform tag and tolerance densities. The duration of each run is 30 000 generations. (a) Population average of μi based on simulations with no mutation (open circles), simulations with mutation (filled squares), and the prediction by equation (4.3) (solid line). (b) Standard deviation of μi after each run.

5. Discussion

We have shown that partial visibility of the tolerance (strategy) leads to tag-based indirect reciprocity. Our results extend Roberts & Sherratt's (2002) work, which showed that unconditional defection is the evolutionary outcome when the strategy of each player is completely sealed from others' eyes. If the tolerance is always observed by others, intolerant players reveal themselves to be uncooperative and cannot receive altruism from others. Thus, being generous is optimal; no room to cheat. When bq>c, a population in which all the players share an identical tolerance (whatever the value) is an ESS. This implies that a homogeneous population entirely of tag users with a shared tolerance level, as well as one of generous players and one of defectors, can be an ESS. Which ESS is more likely to be reached is sensitive to q, the probability that a player observes tolerance of an opponent. For an intermediate q, the unanimity of tag users results from a wide range of initial tolerance distributions including the uniform distribution. Naturally, the ultimate tolerance of the population monotonically increases with q. More generous players prosper for larger q (Nowak & Sigmund 1998a,b; Nowak et al. 2000; Brandt & Sigmund 2004, 2005).

The range of q hosting evolution of tag users, which is determined by b and c, is not necessarily large. However, starting from a population with more tag users than expected from the uniform density, the unanimity of tag users is more easily reached. In addition, our social lives are full of partial information. We usually have some but not complete information about others. Therefore, it is plausible to assume an intermediate value of q, for which we showed the possibility of tag-based altruism.

The mechanism that we have identified is that for indirect reciprocity, not direct reciprocity. Suppose that the player P1 is tolerant enough to behave altruistically to player P2. Even so, P1 is discouraged to be nice to P2 if P2 is revealed to be intolerant with probability q. In other words, P1 is not motivated to cooperate with those who will not cooperate back to P1. Although it may sound like direct reciprocity, every decision is based on tags and tolerances, and P1 and P2 do not have to interact in repetition.

In the previous models of tag-based indirect reciprocity, unconditional defectors are eliminated by definition (Riolo et al. 2001; Traulsen & Schuster 2003). Tag users can easily evolve under this condition. By contrast, our model allows unconditional defectors. Furthermore, the boundary condition was such that random mutation somehow tended to make players more cooperative. Our model is without such a drift. A positive tolerance drift would induce a cyclic-dominance relation between different phenotypes (Riolo et al. 2001; Traulsen & Schuster 2003), which is not the case in our model. With unconditional defectors and without the positive drift, survival of tag users is more challenging. We have related tag-based actions to altruism by introducing the probability that tolerances are observed by others.

The proposed mechanism of indirect reciprocity does not necessitate the memory of previous acts or strategies, in contrast to image scoring (Nowak & Sigmund 1998a,b; Brandt & Sigmund 2004, 2005; Ohtsuki & Iwasa 2004) and direct reciprocity (Trivers 1971; Axelrod & Hamilton 1981; Axelrod 1984). In our model, the action depends only on present tags and tolerances. In addition, assortative mixing has small effects on the conclusions (compare figure 1d,e). Assortativity is not mandatory for our mechanism. Neither do we need population viscosity (Nowak & May 1992; Hammond & Axelrod 2006; Jansen & van Baalen 2006), which is one way to implement assortativity. Furthermore, tag-based reciprocity does not necessitate weak coupling of tag and tolerance (Jansen & van Baalen 2006). This study is a first important step towards understanding the evolution of the tag-based indirect reciprocity.

Acknowledgments

We thank Mayuko Nakamaru for helpful comments on this work. N.M. acknowledges support from the Special Postdoctoral Researchers Program of RIKEN, and H.O. acknowledges support from the Japan Society for the Promotion of Science (JSPS).

Appendix A.

We examine the stability of the steady-state (p1,p0,p1)=(1−A,0,A) of the symmetric two-tag model. By eliminating (p1=1−p0p1), we derive the two-dimensional linear dynamics around the steady state up to the second order,

Δp˙0=Δp0[((bc)q(1+h)2+cqc)Δp1+(bqc)(1+h)2Δp0+o(Δp1,Δp2)], (A1)
Δp˙1=(bqc)AΔp1+(bqc)(1+h)A2Δp0+o(Δp1,Δp2). (A2)

Here, Δp1 and Δp2 are perturbations to the steady state, and op1, Δp2) summarizes small quantities relative to Δp1 and Δp0. The first-order terms in the r.h.s. of equations (A 1) and (A 2) indicate that the Jacobian has eigenvalues λ1=(bqc)A and λ2=0 with corresponding eigenvectors (p1,p0)=(1,0) and ((1+h)/2, −1). Provided A>0 (equivalently, bq>c), λ1>0 follows, and the steady-state (1−A,0,A) is unstable along the boundary of the simplex (p0=0). When the perturbation along the first eigenmode is small (Δp1≅0), the eigenmode with λ2=0 matters for stability. When Δp1≅0, equation (A 1) indicates that Δp˙0>0 inside the simplex (Δp0>0). Therefore, the steady state is a degenerated source.

References

  1. Axelrod R. Basic Books; New York, NY: 1984. Evolution of cooperation. [Google Scholar]
  2. Axelrod R, Hamilton W.D. The evolution of cooperation. Science. 1981;211:1390–1396. doi: 10.1126/science.7466396. doi:10.1126/science.7466396 [DOI] [PubMed] [Google Scholar]
  3. Bergstrom T.C. The algebra of assortative encounters and the evolution of cooperation. Int. Game Theory Rev. 2003;5:211–228. doi:10.1142/S0219198903001021 [Google Scholar]
  4. Brandt H, Sigmund K. The logic of reprobation: assessment and action rules for indirect reciprocation. J. Theor. Biol. 2004;231:475–486. doi: 10.1016/j.jtbi.2004.06.032. doi:10.1016/j.jtbi.2004.06.032 [DOI] [PubMed] [Google Scholar]
  5. Brandt H, Sigmund K. Indirect reciprocity, image scoring, and moral hazard. Proc. Natl Acad. Sci. USA. 2005;102:2666–2670. doi: 10.1073/pnas.0407370102. doi:10.1073/pnas.0407370102 [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Eshel I, Cavalli-Sforza L.L. Assortment of encounters and evolution of cooperativeness. Proc. Natl Acad. Sci. USA. 1982;79:1331–1335. doi: 10.1073/pnas.79.4.1331. doi:10.1073/pnas.79.4.1331 [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Fehr E, Fischbacher U. The nature of human altruism. Nature. 2003;425:785–791. doi: 10.1038/nature02043. doi:10.1038/nature02043 [DOI] [PubMed] [Google Scholar]
  8. Hamilton W.D. The genetical evolution of social behaviour I and II. J. Theor. Biol. 1964;7:1–16. doi: 10.1016/0022-5193(64)90038-4. doi:10.1016/0022-5193(64)90038-4 see also 17–52. [DOI] [PubMed] [Google Scholar]
  9. Hammond R.A, Axelrod R. Evolution of contingent altruism when cooperation is expensive. Theor. Popul. Biol. 2006;69:333–338. doi: 10.1016/j.tpb.2005.12.002. doi:10.1016/j.tpb.2005.12.002 [DOI] [PubMed] [Google Scholar]
  10. Hofbauer J, Sigmund K. Cambridge University Press; Cambridge, UK: 1998. Evolutionary games and population dynamics. [Google Scholar]
  11. Jansen V.A.A, van Baalen M. Altruism through beard chromodynamics. Nature. 2006;440:663–666. doi: 10.1038/nature04387. doi:10.1038/nature04387 [DOI] [PubMed] [Google Scholar]
  12. Marsden P.V. Homogeneity in confiding relations. Soc. Netw. 1988;10:57–76. doi:10.1016/0378-8733(88)90010-X [Google Scholar]
  13. McPherson J.M, Smith-Lovin L. Homophily in voluntary organizations: status distance and the composition of face-to-face groups. Am. Sociol. Rev. 1987;52:370–379. doi:10.2307/2095356 [Google Scholar]
  14. McPherson M, Smith-Lovin L, Cook J.M. Birds of a feather: homophily in social networks. Annu. Rev. Sociol. 2001;27:415–444. doi:10.1146/annurev.soc.27.1.415 [Google Scholar]
  15. Newman M.E.J. Mixing patterns in networks. Phys. Rev. E. 2003;67:026126. doi: 10.1103/PhysRevE.67.026126. doi:10.1103/PhysRevE.67.026126 [DOI] [PubMed] [Google Scholar]
  16. Nowak M.A, May R.M. Evolutionary games and spatial chaos. Nature. 1992;359:826–829. doi:10.1038/359826a0 [Google Scholar]
  17. Nowak M.A, Sigmund K. Evolution of indirect reciprocity by image scoring. Nature. 1998a;393:573–577. doi: 10.1038/31225. doi:10.1038/31225 [DOI] [PubMed] [Google Scholar]
  18. Nowak M.A, Sigmund K. The dynamics of indirect reciprocity. J. Theor. Biol. 1998b;194:561–574. doi: 10.1006/jtbi.1998.0775. doi:10.1006/jtbi.1998.0775 [DOI] [PubMed] [Google Scholar]
  19. Nowak M.A, Sigmund K. Evolution of indirect reciprocity. Nature. 2005;437:1291–1298. doi: 10.1038/nature04131. doi:10.1038/nature04131 [DOI] [PubMed] [Google Scholar]
  20. Nowak M.A, Page K.M, Sigmund K. Fairness versus reason in the ultimatum game. Science. 2000;289:1773–1775. doi: 10.1126/science.289.5485.1773. doi:10.1126/science.289.5485.1773 [DOI] [PubMed] [Google Scholar]
  21. Ohtsuki H, Iwasa Y. How should we define goodness?—reputation dynamics in indirect reciprocity. J. Theor. Biol. 2004;231:107–120. doi: 10.1016/j.jtbi.2004.06.005. doi:10.1016/j.jtbi.2004.06.005 [DOI] [PubMed] [Google Scholar]
  22. Riolo R.L, Cohen M.D, Axelrod R. Evolution of cooperation without reciprocity. Nature. 2001;414:441–443. doi: 10.1038/35106555. doi:10.1038/35106555 [DOI] [PubMed] [Google Scholar]
  23. Roberts G, Sherratt T.N. Does similarity breed cooperation? Nature. 2002;418:499–500. doi: 10.1038/418499b. doi:10.1038/418499b [DOI] [PubMed] [Google Scholar]
  24. Taylor P.D, Jonker L.B. Evolutionary stable strategies and game dynamics. Math. Biosci. 1978;40:145–156. doi:10.1016/0025-5564(78)90077-9 [Google Scholar]
  25. Traulsen A, Schuster H.G. Minimal model for tag-based cooperation. Phys. Rev. E. 2003;68:046129. doi: 10.1103/PhysRevE.68.046129. doi:10.1103/PhysRevE.68.046129 [DOI] [PubMed] [Google Scholar]
  26. Trivers R. The evolution of reciprocal altruism. Q. Rev. Biol. 1971;46:35–57. doi:10.1086/406755 [Google Scholar]

Articles from Proceedings of the Royal Society B: Biological Sciences are provided here courtesy of The Royal Society

RESOURCES