Indirect reciprocity with simple records

Daniel Clark; Drew Fudenberg; Alexander Wolitzky

doi:10.1073/pnas.1921984117

. 2020 May 12;117(21):11344–11349. doi: 10.1073/pnas.1921984117

Indirect reciprocity with simple records

Daniel Clark ^a,¹, Drew Fudenberg ^a,^1,², Alexander Wolitzky ^a,¹

PMCID: PMC7260940 PMID: 32398366

Significance

Indirect reciprocity is a foundational mechanism of human cooperation, and understanding the social structures that allow it to arise continues to be a core issue in both the social sciences and evolutionary biology. This paper analyzes a model of indirect reciprocity in steady-state equilibria, where players observe only their partners’ records, and each individual’s record depends on the individual’s own past behavior alone. We show that tolerant trigger strategies based on these simple records can robustly support positive social cooperation in games with sufficient “strategic complementarity,” both in the prisoner’s dilemma and in some multiplayer public goods games, and we show that the resulting cooperative equilibria have strong stability properties.

Keywords: indirect reciprocity, robust cooperation, strategic complementarity, strategic substitutability

Abstract

Indirect reciprocity is a foundational mechanism of human cooperation. Existing models of indirect reciprocity fail to robustly support social cooperation: Image-scoring models fail to provide robust incentives, while social-standing models are not informationally robust. Here we provide a model of indirect reciprocity based on simple, decentralized records: Each individual’s record depends on the individual’s own past behavior alone, and not on the individual’s partners’ past behavior or their partners’ partners’ past behavior. When social dilemmas exhibit a coordination motive (or strategic complementarity), tolerant trigger strategies based on simple records can robustly support positive social cooperation and exhibit strong stability properties. In the opposite case of strategic substitutability, positive social cooperation cannot be robustly supported. Thus, the strength of short-run coordination motives in social dilemmas determines the prospects for robust long-run cooperation.

People (and perhaps also other animals) often trust each other to cooperate even when they know they will never meet again. Such indirect reciprocity relies on individuals having some information about how their partners have behaved in the past. Existing models of indirect reciprocity fall into two paradigms. In the image-scoring paradigm, each individual carries an image that improves when the individual helps others, and (at least some) individuals help only those with good images (1, 2). In the standing paradigm, each individual carries a standing that typically improves when the individual helps others with good standing, but not when the individual helps those with bad standing, and individuals with good standing help only other good-standing individuals (3, 4).

Neither of these paradigms provides a robust explanation for social cooperation. In image-scoring models, there is no reason for an individual to help only partners with good images: Since the partner’s image does not affect one’s future payoff, helping some partners and not others is optimal only if one is completely indifferent between helping and not helping. In game-theoretic terms, individuals never have strict incentives to follow image-scoring strategies, and hence such strategies can form at best a weak equilibrium. Closely related to this point, image-scoring equilibria are unstable in several environments (5, 6). Standing models do yield strict, stable equilibria, but they fail to be informationally robust: An individual’s standing is a function of not only the individual’s past behavior, but also the individual’s past partners’ behavior, their partners’ partners’ behavior, and so on ad infinitum. In the absence of centralized record keeping or some way of physically marking bad-standing individuals, computing such a function requires information that is likely unavailable in many groups (7).

We develop a theoretical paradigm for modeling indirect reciprocity that supports positive social cooperation as a strict, stable equilibrium while relying only on simple, individualistic information: When two players meet, they observe each other’s records and nothing else, and each individual’s record depends only on the individual’s own past behavior. [Individualistic information is also called “first-order” (8–10).]

As our model of individual interaction, we use the classic prisoner’s dilemma (“PD”) with actions $C, D$ (“cooperate,” “defect”) and a standard payoff normalization, where the gain from unilateral defection, $g$ , and the loss from unilateral cooperation, $l$ , are both positive and satisfy the condition $g < l + 1$ , which means that joint payoffs are maximized by mutual cooperation (Fig. 1, Left). This canonical game can capture many two-sided interactions, such as business partnerships (11), management of public resources (12, 13), and risk sharing in developing societies (14), as well as many well-documented animal behaviors (15).

Fig. 1. — The prisoner’s dilemma. (*Left*) Matrices show how any prisoner’s dilemma can be represented by the standard normalization with $g = (T - R) / (R - P)$ and $l = (P - S) / (R - P)$ , where $T > R > P > S$ . (*Right*) Matrices illustrate this normalization for “donation games” in which choosing $G$ ( $G i v e$ ) instead of $S$ ( $S h i r k$ ) incurs a personal cost $c$ and gives benefit $b > c$ to the opponent.

A critical feature of the PD is whether it exhibits strategic complementarity or strategic substitutability. Strategic complementarity means that the gain from playing $D$ is greater when the opponent also plays $D$ . In the PD payoff matrix displayed in Fig. 1, this corresponds to the condition

g < l .

The opposite case of strategic substitutability arises when the gain from playing $D$ is greater when the opponent plays $C$ : Mathematically, this occurs when

g > l .

Many previous studies of indirect reciprocity restrict attention to the “donation game” instance of the PD where $g = l$ , as in Fig. 1, Right (16).* Our analysis reveals this to be a knife-edge case that obscures the distinction between strategic complementarity ( $g < l$ ) and substitutability ( $g > l$ ). This distinction has long been known to be of critical importance in economics (18, 19), while its implications for cooperation in the repeated prisoner’s dilemma have been noted more recently (8, 9). When a player’s record depends only on the player’s own past actions, the future reward for cooperation (or future penalty for defection) is independent of the player’s current opponent’s record. Therefore, to obtain an equilibrium where a player has a strict incentive to cooperate if and only if the opponent’s record is good, the cost of cooperation must be lower against an opponent with a good record (who cooperates) than against one with a bad record (who defects): That is, cooperation requires $g < l$ .

Strategic complementarity is a common case in realistic social dilemmas. It implies that although $D$ is always selfishly optimal (a defining feature of the PD), the social dilemma nonetheless retains some aspect of a coordination game, so that playing $C$ is less costly when one’s partner also plays $C$ . For example, mobbing a predator is always risky (hence costly) for each individual, but it is much less risky when others also mob (20).

In our model, each player’s record is an integer, which evolves as a function of the player’s history of plays of $C$ and $D$ . We assume the system is subject to some noise, so that, whenever an individual plays $C$ , with probability $ε$ the individual’s record updates as if the individual had played $D$ instead.^† Here the level of noise $ε \in (0,1)$ can reflect either errors in recording or errors in executing the intended action.

A simple example of such a record system is the “counting $D$ s” system where a player’s record is just a count of the number of times the player has defected (or cooperated and was hit by noise). More complicated record systems could also count the number of times a player cooperated and could also keep track of the time path of plays of $C$ and $D$ . We analyze a fairly broad class of strategies, with the following three defining properties: 1) The set of all possible records can be partitioned into two classes, “good records” and “bad records.” 2) When two players with good records meet each other, they cooperate; if instead either partner has a bad record, both players defect. 3) The class of bad records is absorbing: Once a player obtains a bad record, the record remains bad forever. We refer to this as the class of trigger strategies.

Examples of trigger strategies include strategies where a player’s record becomes bad once the absolute number of times the player has defected crosses a threshold $K$ , as well as strategies where the player’s record becomes bad the first time the fraction of times the player has defected crosses a threshold. We call strategies of the former type tolerant grim trigger strategies or $G r i m K$ , as they are a form of the well-known grim trigger strategies (21) with a “tolerance” of $K$ recorded plays of $D$ . We will see that $G r i m K$ strategies succeed in supporting cooperation for a broad range of payoff parameters. Moreover, if the payoff parameters preclude cooperation under $G r i m K$ strategies, they also preclude cooperation under any other trigger strategy.

We analyze the steady-state equilibria of a system where the total population size is constant, but each individual has a geometrically distributed lifetime with survival probability $γ \in (0,1)$ . Players play the PD with random rematching every period and receive no information about their current partner other than the partner’s record. To ensure robustness, we insist that equilibrium behavior is strictly optimal at every record; in classical (normal-form) games, this implies that the equilibrium is evolutionarily stable (22, 23).

Results

Steady-State Cooperation.

We show that $G r i m K$ strategies can form a strict steady-state equilibrium if and only if the PD exhibits substantial strategic complementarity, in that the gain from playing $D$ rather than $C$ is significantly greater when the opponent plays $D$ : The precise condition required in the PD payoff matrix displayed in Fig. 1 is

g < \frac{l}{1 + l} .

Under this condition, the tolerance level $K$ can be tuned so that $G r i m K$ strategies support positive social cooperation in a steady-state equilibrium.

To see how to tune the threshold $K$ , note that since even individuals who always try to cooperate are sometimes recorded as playing $D$ due to noise, $K$ must be large enough that the steady-state share of the population with good records is sufficiently high: With any fixed value of $K$ , a population of sufficiently long-lived players would almost all have bad records. However, $K$ also cannot be too high, as otherwise an individual with a very good record (that is, with a very low number of $D$ s) can safely play $D$ until the individual’s record approaches the threshold. Another constraint is that an individual with record $K - 1$ who meets a partner with a bad record must not be tempted to deviate to $C$ to preserve the individual’s own good record. These constraints lead to an upper bound on the maximum share of cooperators in equilibrium. As lifetimes become long and noise becomes small, this upper bound converges to 0 whenever $g > l / (1 + l)$ and to $l / (1 + l)$ whenever $g < l / (1 + l)$ (Fig. 2), and we show that this share of cooperators can in fact be attained in equilibrium in the $(γ, ε) \to (1,0)$ limit. Thus, greater strategic complementarity (higher $l$ and lower $g$ ) not only helps support some cooperation; it also increases the maximum level of cooperation in the limit, as shown in Fig. 3.

Fig. 2. — Upper bounds on cooperation. The entries are upper bounds on the share of cooperators possible in a $G r i m K$ equilibrium for various $γ$ and $ε$ values when $g = 0.5$ and $l = 2.5$ , with a darker shade indicating a higher value as shown in the scale at *Right*. As we move to the bottom right, the upper bound converges to $l / (1 + l) \approx 0.7143$ , which is the maximum share of cooperators sustainable in the limit, but away from the limit the upper bound can be different (the values in this table are all higher, but this is not the case for small $γ$ or large $ε$ ).

Fig. 3. — Limit performance of $G r i m K$ strategies. (A) In the green region ( $l > g / (1 - g)$ ), $G r i m K$ strategies sustain a positive limit share of cooperators, which increases with $l$ , as indicated by a deeper shade of green. In the orange region ( $g < l < g / (1 - g)$ ), the limit share of cooperators with $G r i m K$ is 0, but other strategies may sustain positive cooperation in the limit. In the red region ( $l \leq g$ ), individualistic records preclude cooperation. (B) The limit share of cooperators as a function of $l$ when $g = 1 / 2$ . At $l = 1$ , there is a discontinuity; as $l \to \infty$ , the limit share of cooperators approaches 1.

We also show that, in the $(γ, ε) \to (1,0)$ limit, no trigger strategies can support a positive equilibrium share of cooperators if $g > l / (1 + l)$ , and no trigger strategies can support an equilibrium share of cooperations greater than $l / (1 + l)$ if $g < l / (1 + l)$ . Thus, when lifetimes are long and noise is small, $G r i m K$ strategies attain optimum equilibrium cooperation within the class of trigger strategies. The logic of this result is that the constraints on the performance of $G r i m K$ strategies imposed by players’ incentives and the presence of noise apply equally to any strategy in the trigger class.

Stability, Convergence, and Evolutionary Properties.

$G r i m K$ strategies also satisfy desirable stability and convergence properties. These derive from an important monotonicity property of $G r i m K$ strategies: When the distribution of individual records is more favorable today, the same will be true tomorrow, because players with better records both behave more cooperatively and induce more cooperative behavior from their partners. (See Methods for a precise statement.) From this observation it can be shown that, whenever the initial distribution of records is more favorable than the best steady-state record distribution, the record distribution converges to the best steady state. Similarly, whenever the initial distribution is less favorable than the worst steady state, convergence to the worst steady state obtains (Fig. 4). These additional robustness properties are not shared by more complicated, nonmonotone strategies that can sometimes support cooperation for a wider range of parameters than $G r i m K$ .

Fig. 4. — Convergence of the share of cooperators. A depicts trajectories for the share of cooperators when $γ = 0.8$ , $ε = 0.02$ , and players use the $G r i m 1$ strategy; B does the same for the $G r i m 2$ strategy. In A, all trajectories converge to the unique steady state; in B, there are three steady states. Here “high” trajectories converge to the most cooperative steady state, while “low” trajectories converge to the least cooperative steady state. See *Methods* for details.

We also analyze evolutionary properties of $G r i m K$ equilibria. When $g < l / (1 + l)$ , there is a sequence of $G r i m K$ equilibria that are “steady-state robust to mutants” and attain the maximum limit cooperation share of $l / (1 + l)$ . By this we mean that, when a small fraction of players adopt some mutant $G r i m K^{'}$ strategy where $K^{'} \neq K$ , there is a steady-state distribution of records where it remains strictly optimal to play according to $G r i m K$ . We also perform simulations of dynamic evolution when a population playing a $G r i m K$ equilibrium is infected by a mutant population playing $G r i m K^{'}$ for some $K^{'} \neq K$ (SI Appendix, Fig. S1).

Multiplayer Public Goods Games.

Although our main analysis takes the basic unit of social interaction to be the standard two-player PD, many social interactions involve multiple players: The management of the commons and other public resources is a leading example (12, 13). In SI Appendix we establish that, when strategic complementarity is sufficiently strong, robust cooperation in the multiplayer public goods game can be supported by a simple variant of $G r i m K$ strategies, wherein a player contributes to the public good if and only if all of the player’s current partners have good records. In contrast, with strategic substitutability the unique strict equilibrium involves zero contribution. As the $n$ -player public good game is a generalization of the PD, this implies that individualistic records preclude cooperation in the PD with strategic substitutability, as indicated in the red region in Fig. 3A.

Discussion

We have shown how individualistic records robustly support indirect reciprocity in supermodular PD and multiplayer public goods games. To place our results in context, recall that scoring models do not provide robust incentives, while standing models compute records as a recursive function of a player’s partners’ past actions and standing, their partners’ actions and standing, and so on, and thus require more information than may typically be available. The simplicity and power of individualistic records suggest that they may be usefully adapted to specific settings where cooperation is based on indirect reciprocity, such as online rating systems (24, 25), credit ratings (10, 26), decentralized currencies (27, 28), and monitoring systems for conflict resolution (29). Individualistic records may also prove useful in modeling the role of costly punishment in the evolution of cooperation (30–33).

We interpret individualistic records and $G r i m K$ strategies as both a theoretical demonstration that simple strategies can sometimes support cooperation using only first-order information and an approximation of human behavior in a range of environments. For example, when meeting a potential business partner for the first time, it is common to contact the person’s past partners and inquire about the potential partner’s past behavior, typically without delving into the past partners’ own past behavior or the past partners’ partners’ behavior. Similarly, in online marketplaces such as eBay or Airbnb, one typically rates one’s current partner’s behavior in the absence of any information about the current partner’s past partners’ behavior. Users then observe summary statistics that depend only on their current partner’s own past behavior, which is an example of individualistic (first-order) records. Moreover, if users behave honestly only with partners who have not received too many negative reviews, their behavior can be approximated by $G r i m K$ strategies.

We conclude by discussing possible extensions of our analysis.

First, while we have analyzed the evolutionary stability of the $G r i m K$ equilibrium, we have not analyzed how this equilibrium could first arise. In our model, it is a strict equilibrium for all agents to always defect, so that equilibrium is also an evolutionarily stable strategy. To explain how society might move from such a state to a more cooperative equilibrium such as $G r i m K$ , we could appeal to random mutations. Given our continuum population, this could be modeled as a deterministic drift as in ref. 34, but we do not develop that argument here.

We have also assumed that everyone shares the same assessment of each individual’s record. This “public information” assumption is known to be critical in some prior models of indirect reciprocity. In our model, allowing heterogeneous assessments of a player’s record would not change the analysis very much, so long as both partners learn their opponents’ assessments of their records before taking actions (35–37). The more complex situation where each partner’s assessment of the other’s record is private information would be interesting to study in future research.

Methods

Here we summarize the model and mathematical results; further details are provided in SI Appendix.

A Model of Social Cooperation with Individualistic Records.

Time is discrete and doubly infinite: $t \in {\dots, - 2, - 1,0,1,2, \dots}$ . There is a population of individuals of unit mass, each with survival probability $γ \in (0,1)$ , so each individual’s lifespan is geometrically distributed with mean $1 / (1 - γ)$ . An inflow of $1 - γ$ newborn players each period keeps the total population size constant. We thus have an infinite-horizon dynamic model with overlapping generations of players (38).

Every period, individuals randomly match in pairs to play the PD (Fig. 1). Each individual carries a record $k \in N ≔ {0,1,2, \dots}$ . Newborns have record 0. Under the counting $D$ s record system, whenever an individual plays $D$ , the individual’s record increases by 1, while whenever an individual plays $C$ , the individual’s record remains constant with probability $1 - ε$ and increases by 1 with probability $ε$ ; thus, $ε \in (0,1)$ measures the amount of noise in the system (39–43). More generally, a record system specifies an arbitrary next-period record as a function of the current-period record and the current-period recorded action, which equals $D$ if the individual plays $D$ , equals $C$ with probability $1 - ε$ , and equals $D$ with probability $ε$ if the individual plays $C$ .

When two players meet, they observe each other’s records and nothing else. A strategy is a mapping $s : N \times N \to {C, D}$ , with the convention that the first component of the domain is a player’s own record and the second component is the current opponent’s record. We assume that all players use the same strategy, noting that this must be the case in every strict equilibrium in a symmetric, continuum-agent model like ours. (Of course, players who have different records and/or meet opponents with different records may take different actions.)

The state of the system $μ \in Δ (N)$ describes the share of the population with each record, where $μ_{k} \in [0,1]$ denotes the share with record $k$ . When all players use strategy $s$ , let $f_{s} : Δ (N) \to Δ (N)$ denote the resulting update map governing the evolution of the state. (The formula for $f_{s} (μ)$ is in SI Appendix.) A steady state under strategy $s$ is a state $μ$ such that $f_{s} (μ) = μ$ .

Given a strategy $s$ and state $μ$ , the expected flow payoff of a player with record $k$ is $π_{k} (s, μ) = \sum_{k^{'}} μ_{k^{'}} u (s (k, k^{'}), s (k^{'}, k))$ , where $u$ is the PD payoff function. Denote the probability that a player with current record $k$ has record $k^{'} t$ periods in the future by $ϕ_{k} {(s, μ)}^{t} (k^{'})$ . The continuation payoff of a player with record $k$ is then $V_{k} (s, μ) = (1 - γ) \sum_{t = 0}^{\infty} γ^{t} \sum_{k^{'}} ϕ_{k} {(s, μ)}^{t} (k^{'}) π_{k^{'}} (s, μ)$ . Note that we have normalized continuation payoffs by $(1 - γ)$ to express them in per-period terms. A player’s objective is to maximize the expected lifetime payoff.

A pair $(s, μ)$ is an equilibrium if $μ$ is a steady state under $s$ and, for each own record $k$ and opponent’s record $k^{'}$ , the prescribed action $s (k, k^{'}) \in {C, D}$ maximizes the expected lifetime payoff from the current period onward, given by $(1 - γ) u (a, s (k^{'}, k)) + γ \sum_{k^{″}} (ρ (k, a) [k^{″}]) V_{k^{″}} (s, μ)$ , over $a \in {C, D}$ , where $ρ (k, a) [k^{″}]$ denotes the probability that a player with record $k$ who takes action $a$ acquires next-period record $k^{″}$ . Note that this expression depends on the opponent’s record only through the predicted current-period opponent action, $s (k^{'}, k)$ . In addition, the ratio $(1 - γ) / γ$ captures the weight that players place on their current payoff relative to their continuation payoff from tomorrow on. We study limits where this ratio converges to 0, as opposed to time-average payoffs which give exactly 0 wt to any one period’s payoff, because in the latter case optimization and equilibrium impose unduly weak restrictions (44). An equilibrium is strict if the maximizer is unique for all pairs $(k, k^{'})$ ; i.e., the optimal action is always unique. Note that this equilibrium definition allows agents to maximize over all possible strategies, as opposed to only strategies from some preselected set. We focus on strict equilibria because they are robust: They remain equilibria under “small” perturbations of the model. Note that the strategy “always defect,” i.e., $s (k, k') = D$ for all $(k, k^{'})$ , together with any steady state is always a strict equilibrium. SI Appendix, Lemma 2 characterizes the steady states for any $G r i m K$ strategy, as well as the $γ, ε, g, l$ parameters for which the steady states are equilibria.

Limit Cooperation under $G r i m K$ Strategies.

Under $G r i m K$ strategies, a matched pair of players cooperate if and only if both records are below a prespecified cutoff $K$ : That is, $s (k, k^{'}) = C$ if $max {k, k^{'}} < K$ , and $s (k, k^{'}) = D$ if $max {k, k^{'}} \geq K$ .

We call an individual a cooperator if the individual’s record is below $K$ and a defector otherwise. Note that individuals may be a cooperator for some periods of their life and a defector for other periods, rather than being preprogrammed to cooperate or defect for their entire life.

Given an equilibrium strategy $G r i m K$ , let $μ^{C} = \sum_{k = 0}^{K - 1} μ_{k}$ denote the corresponding steady-state share of cooperators. Note that, in a steady state with cooperator share $μ^{C}$ , mutual cooperation is played in share ${(μ^{C})}^{2}$ of all matches. Let ${\bar{μ}}^{C} (γ, ε)$ be the maximal share of cooperators in any tolerant grim trigger equilibrium (allowing for every possible $K$ ) when the survival probability is $γ$ and the noise level is $ε$ .

SI Appendix, Theorem 1 characterizes the performance of equilibria in $G r i m K$ strategies in the double limit where the survival probability approaches 1—so that players expect to live a long time and the “shadow of the future” looms large—and the noise level approaches 0—so that records are reliable enough to form the basis for incentives. [This long-lifespan/low-noise limit is the leading case of interest in theoretical analyses of indirect reciprocity (8, 45–49).] SI Appendix, Theorem 1 shows that, in the double limit $(γ, ε) \to (1,0)$ , ${\bar{μ}}^{C} (γ, ε)$ converges to $l / (1 + l)$ when $g < l / (1 + l)$ and converges to 0 when $g > l / (1 + l)$ . The formal statement and proof of this result are contained in SI Appendix.

Barring knife-edge cases, tolerant grim trigger strategies can thus robustly support positive cooperation in the double limit $(γ, ε) \to (1,0)$ if and only if the gain from defecting against a partner who cooperates is significantly smaller than the loss from cooperating against a partner who defects: $g < l / (1 + l)$ . Moreover, the maximum level of cooperation in this case is $l / (1 + l)$ . Here we explain the logic of this result.

We first show that $g < μ^{C}$ in any $G r i m K$ equilibrium. Newborn individuals have continuation payoff equal to the average payoff in the population, which is ${(μ^{C})}^{2}$ . Thus, since a newborn player plays $C$ if and only if matched with a cooperator, ${(μ^{C})}^{2} = (1 - γ) μ^{C} + γ μ^{C} V_{0}^{C} + γ (1 - μ^{C}) V_{0}^{D}$ , where $V_{0}^{C}$ and $V_{0}^{D}$ are the expected continuation payoffs of a newborn player after playing $C$ and $D$ , respectively. Newborn players have the highest continuation payoff in the population, so $V_{0}^{C} \leq V_{0} = {(μ^{C})}^{2}$ . For a newborn player to prefer not to cheat a cooperative partner, it must be that $V_{0}^{D} < V_{0}^{C} - (1 - γ) g / γ$ , so when $μ^{C} < 1$ (as is necessarily the case with any noise),

{(μ^{C})}^{2} < (1 - γ) μ^{C} + γ {(μ^{C})}^{2} - (1 - γ) (1 - μ^{C}) g .

This inequality can hold only if $g < μ^{C}$ .

We next show that $γ (1 - ε) μ^{C} < l / (1 + l)$ in any $G r i m K$ equilibrium. The continuation payoff $V_{K - 1}$ of an individual with record $K - 1$ satisfies $V_{K - 1} = (1 - γ) μ^{C} + γ (1 - ε) μ^{C} V_{K - 1}$ , or $V_{K - 1} = (1 - γ) μ^{C} / (1 - γ (1 - ε) μ^{C})$ . A necessary condition for an individual with record $K - 1$ to prefer to play $D$ against a defector partner is $(1 - γ) (- l) + γ (1 - ε) V_{K - 1} < 0$ , or $l > γ (1 - ε) V_{K - 1} / (1 - γ)$ . Combining this inequality with the expression for $V_{K - 1}$ yields $γ (1 - ε) μ^{C} < l / (1 + l)$ , which in the $(γ, ε) \to (1,0)$ limit gives $μ^{C} \leq l / (1 + l)$ .

We have established that tolerant grim trigger strategies can support positive cooperation in the $(γ, ε) \to (1,0)$ limit only if $g \leq l / (1 + l)$ and that the maximum cooperation share cannot exceed $l / (1 + l)$ . The proof of SI Appendix, Theorem 1 is completed by showing that when $g < l / (1 + l)$ , by carefully choosing the tolerance level $K$ , $G r i m K$ can support cooperation shares arbitrarily close to any value between $g$ and $l / (1 + l)$ in equilibrium when the survival probability is close to 1 and the noise level is close to 0.

Limit Cooperation under General Trigger Strategies.

$G r i m K$ strategies are an instance of the more general class of trigger strategies, which are defined by the following properties: 1) The set of all possible records can be partitioned into two classes, good records $G$ and bad records $B$ . 2) Partners cooperate if and only if they both have good records: $s (k, k^{'}) = C$ for all pairs $(k, k^{'}) \in G \times G$ , and $s (k, k^{'}) = D$ for all other pairs $(k, k^{'})$ . 3) The class $B$ is absorbing: If $k \in B$ , then every record $k^{'}$ that can be reached starting at record $k$ is also in $B$ .

SI Appendix, Theorem 9 shows that, in the $(γ, ε) \to (1,0)$ double limit, the maximum steady-state share of good-record players that can be supported in any trigger strategy equilibrium converges to zero if $g > l / (1 + l)$ and converges to $l / (1 + l)$ if $g < l / (1 + l)$ . Thus, in this double limit, tolerant grim trigger strategies attain the most equilibrium cooperation that any trigger strategy can support.

The intuition for this result is that the necessary conditions $g < μ^{C}$ and $γ (1 - ε) μ^{C} < l / (1 + l)$ derived above for $G r i m K$ strategies apply equally to any trigger strategy. The argument to establish the necessity of $g < μ^{C}$ is similar to that for $G r i m K$ strategies, except we must now consider the incentives of a player with whichever record $k$ yields the greatest equilibrium continuation payoff, which is no longer necessarily a newborn (i.e., we may now have $k \neq 0$ ). The argument to establish necessity of $γ (1 - ε) μ^{C} < l / (1 + l)$ is also similar to that for $G r i m K$ strategies, but now we consider the incentives of any player with a “marginal” good record that will become bad if the player is recorded as playing one additional $D$ , which is no longer necessarily a player who has been recorded as playing $K - 1 D$ s for some fixed cutoff $K$ .

Convergence of $G r i m K$ Strategies.

Fix an arbitrary initial record distribution $μ^{0} \in Δ (N)$ . When all individuals use $G r i m K$ strategies, the population share with record $k$ at time $t$ , $μ_{k}^{t}$ , evolves according to

\begin{aligned} μ_{0}^{t + 1} & = 1 - γ + γ (1 - ε) μ^{C, t} μ_{0}^{t}, \\ μ_{k}^{t + 1} & = γ (1 - (1 - ε) μ^{C, t}) μ_{k - 1}^{t} + γ (1 - ε) μ^{C, t} μ_{k}^{t} for 0 < k < K, \end{aligned}

where $μ^{C, t} = \sum_{k = 0}^{K - 1} μ_{k}^{t}$ .

Fixing $K$ , we say that distribution $μ$ dominates (or is more favorable than) distribution $\tilde{μ}$ if, for every $k < K$ , $\sum_{\tilde{k} = 0}^{k} μ_{\tilde{k}} \geq \sum_{\tilde{k} = 0}^{k} {\tilde{μ}}_{\tilde{k}}$ ; that is, if for every $k < K$ the share of the population with record no worse than $k$ is greater under distribution $μ$ than under distribution $\tilde{μ}$ . Under the $G r i m K$ strategy, let $\bar{μ}$ denote the steady state with the largest share of cooperators, and let $\underline{μ}$ denote the steady state with the smallest share of cooperators.

SI Appendix, Theorem 12 shows that, if the initial record distribution is more favorable than $\bar{μ}$ , then the record distribution converges to $\bar{μ}$ ; similarly, if the initial record distribution is less favorable than $\underline{μ}$ , then the record distribution converges to $\underline{μ}$ . Formally, if $μ^{0}$ dominates $\bar{μ}$ , then $lim_{t \to \infty} μ^{t} = \bar{μ}$ ; similarly, if $μ^{0}$ is dominated by $\underline{μ}$ , then $lim_{t \to \infty} μ^{t} = \underline{μ}$ .

In Fig. 4A the blue trajectory corresponds to the initial distribution where all players have record 0, the red trajectory is constant at the unique steady-state value $μ^{C} \approx 0.2484$ , and the yellow trajectory corresponds to the initial distribution where all players have defector records. Here all of the trajectories converge to the unique steady state. In Fig. 4B, the red trajectory is constant at the largest steady-state value $μ^{C} \approx 0.9855$ , the yellow trajectory is constant at the intermediate steady-state value $μ^{C} \approx 0.9184$ , and the purple trajectory is constant at the smallest steady-state value $μ^{C} \approx 0.6471$ . The blue trajectory corresponds to the initial distribution where all players have record 0 and converges to the largest steady-state share of cooperators. The green trajectory corresponds to the initial distribution where all players have defector records and converges to the smallest steady-state share of cooperators.

Code Availability.

All simulations and numerical calculations have been performed with MATLAB R2017b and Wolfram Mathematica 11.3.0.0. In SI Appendix, we provide the MATLAB scripts used to generate Fig. 4 as well as those to simulate evolutionary dynamics and generate SI Appendix, Fig. S1.

Supplementary Material

Supplementary File

pnas.1921984117.sapp.pdf^{(642.2KB, pdf)}

Acknowledgments

This work was supported by National Science Foundation Grants SES-1643517 and SES-1555071 and Sloan Foundation Grant 2017-9633.

Footnotes

The authors declare no competing interest.

*However, the $g \neq l$ case has also received significant attention: For example, the seminal article of Axelrod and Hamilton (17) took $g = 1$ and $l = 1 / 2$ .

^†It would not substantively affect our results to assume that there is also noise when an individual plays $D$ , so we exclude this possibility for simplicity.

This article contains supporting information online at https://www.pnas.org/lookup/suppl/doi:10.1073/pnas.1921984117/-/DCSupplemental.

References

1.Nowak M. A., Sigmund K., Evolution of indirect reciprocity by image scoring. Nature 393, 573–577 (1998). [DOI] [PubMed] [Google Scholar]
2.Nowak M. A., Sigmund K., The dynamics of indirect reciprocity. J. Theor. Biol. 194, 561–574 (1998). [DOI] [PubMed] [Google Scholar]
3.Sugden R., New developments in the theory of choice under uncertainty. Bull. Econ. Res. 38, 1–24 (1986). [Google Scholar]
4.Kandori M., Social norms and community enforcement. Rev. Econ. Stud. 59, 63 (1992). [Google Scholar]
5.Leimar O., Hammerstein P., Evolution of cooperation through indirect reciprocity. Proc. R. Soc. Lond. Ser. B Biol. Sci. 268, 745–753 (2001). [DOI] [PMC free article] [PubMed] [Google Scholar]
6.Panchanathan K., Boyd R., A tale of two defectors: The importance of standing for evolution of indirect reciprocity. J. Theor. Biol. 224, 115–126 (2003). [DOI] [PubMed] [Google Scholar]
7.Nowak M. A., Sigmund K., Evolution of indirect reciprocity. Nature 437, 1291–1298 (2005). [DOI] [PubMed] [Google Scholar]
8.Takahashi S., Community enforcement when players observe partners’ past play. J. Econ. Theor. 145, 42–62 (2010). [Google Scholar]
9.Heller Y., Mohlin E., Observations on cooperation. Rev. Econ. Stud. 85, 2253–2282 (2018). [Google Scholar]
10.Bhaskar V., Thomas C., Community enforcement of trust with bounded memory. Rev. Econ. Stud. 86, 1010–1032(2018). [Google Scholar]
11.Klein B., Leffler K. B., The role of market forces in assuring contractual performance. J. Polit. Econ. 89, 615–641 (1981). [Google Scholar]
12.Hardin G., The tragedy of the commons. Science 162, 1243–1248 (1968). [PubMed] [Google Scholar]
13.Ostrom E., Governing the Commons: The Evolution of Institutions for Collective Action (Cambridge University Press, 1990). [Google Scholar]
14.Coate S., Ravallion M., Reciprocity without commitment: Characterization and performance of informal insurance arrangements. J. Dev. Econ. 40, 1–24 (1993). [Google Scholar]
15.Dugatkin L. A., Cooperation Among Animals: An Evolutionary Perspective (Oxford University Press on Demand, 1997). [Google Scholar]
16.Sigmund K., The Calculus of Selfishness (Princeton University Press, 2010), vol. 6. [Google Scholar]
17.Axelrod R., Hamilton W. D., The evolution of cooperation. Science (New York, N.Y.) 211, 1390–1396 (1981). [DOI] [PubMed] [Google Scholar]
18.Bulow J. I., Geanakoplos J. D., Klemperer P. D., Multimarket oligopoly: Strategic substitutes and complements. J. Polit. Econ. 93, 488–511 (1985). [Google Scholar]
19.Fudenberg D., Tirole J., The fat-cat effect, the puppy-dog ploy, and the lean and hungry look. Am. Econ. Rev. 74, 361–366 (1984). [Google Scholar]
20.Zahavi A., Altruism as a handicap: The limitations of kin selection and reciprocity. J. Avian Biol. 26, 1–3 (1995). [Google Scholar]
21.Friedman J. W., A non-cooperative equilibrium for supergames. Rev. Econ. Stud. 38, 1–12 (1971). [Google Scholar]
22.Smith J. M., Evolution and the Theory of Games (Cambridge University Press, 1982). [Google Scholar]
23.Weibull J. W., Evolutionary Game Theory (MIT Press, 1997). [Google Scholar]
24.Resnick P., Kuwabara K., Zeckhauser R., Friedman E., Reputation systems. Commun. ACM 43, 45–48 (2000). [Google Scholar]
25.Dellarocas C., Reputation mechanism design in online trading environments with pure moral hazard. Inf. Syst. Res. 16, 209–230 (2005). [Google Scholar]
26.Klein D. B., Promise keeping in the great society: A model of credit information sharing. Econ. Polit. 4, 117–136 (1992). [Google Scholar]
27.Kocherlakota N., Wallace N., Incomplete record-keeping and optimal payment arrangements. J. Econ. Theor. 81, 272–289 (1998). [Google Scholar]
28.Biais B., Bisiere C., Bouvard M., Casamatta C., The blockchain folk theorem. Rev. Financ. Stud. 32, 1662–1715 (2019). [Google Scholar]
29.Fearon J. D., Laitin D. D., Explaining interethnic cooperation. Am. Polit. Sci. Rev. 90, 715–735 (1996). [Google Scholar]
30.Fehr E., Gächter S., Fairness and retaliation: The economics of reciprocity. J. Econ. Perspect. 14, 159–181 (2000). [Google Scholar]
31.Bhui R., Chudek M., Henrich J., How exploitation launched human cooperation. Behav. Ecol. Sociobiol. 73, 78 (2019). [Google Scholar]
32.Boyd R., Gintis H., Bowles S., Richerson P. J., The evolution of altruistic punishment. Proc. Natl. Acad. Sci. U.S.A. 100, 3531–3535 (2003). [DOI] [PMC free article] [PubMed] [Google Scholar]
33.Henrich J., et al. , Costly punishment across human societies. Science 312, 1767–1770 (2006). [DOI] [PubMed] [Google Scholar]
34.Fudenberg D., Harris C., Evolutionary dynamics with aggregate shocks. J. Econ. Theor. 57, 420–441 (1992). [Google Scholar]
35.Uchida S., Effect of private information on indirect reciprocity. Phys. Rev. 82, 036111 (2010). [DOI] [PubMed] [Google Scholar]
36.Hilbe C., Schmid L., Tkadlec J., Chatterjee K., Nowak M. A., Indirect reciprocity with private, noisy, and incomplete information. Proc. Natl. Acad. Sci. U.S.A. 115, 12241–12246 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
37.Ohtsuki H., Iwasa Y., Nowak M. A., Reputation effects in public and private interactions. PLoS Comput. Biol. 11, e1004527 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
38.Fudenberg D., He K., Learning and type compatibility in signaling games. Econometrica 86, 1215–1255 (2018). [Google Scholar]
39.Le S., Boyd R., Evolutionary dynamics of the continuous iterated prisoner’s dilemma. J. Theor. Biol. 245, 258–267 (2007). [DOI] [PubMed] [Google Scholar]
40.Fudenberg D., Maskin E., Evolution and cooperation in noisy repeated games. Am. Econ. Rev. Pap. Proc. 80, 274–279 (1990). [Google Scholar]
41.Fudenberg D., Rand D. G., Dreber A., Slow to anger and fast to forgive: Cooperation in an uncertain world. Am. Econ. Rev. 102, 720–749 (2012). [Google Scholar]
42.McNamara J. M., Barta Z., Houston A. I., Variation in behaviour promotes cooperation in the prisoner’s dilemma game. Nature 428, 745–748 (2004). [DOI] [PubMed] [Google Scholar]
43.Bendor J., Kramer R. M., Stout S., When in doubt... cooperation in a noisy prisoner’s dilemma. J. Conflict Resolut. 35, 691–719 (1991). [Google Scholar]
44.Fudenberg D., Maskin E., The folk theorem in repeated games with discounting or with incomplete information. Econometrica 54, 533–554 (1986). [Google Scholar]
45.Ohtsuki H., Iwasa Y., How should we define goodness?—Reputation dynamics in indirect reciprocity. J. Theor. Biol. 231, 107–120 (2004). [DOI] [PubMed] [Google Scholar]
46.Ohtsuki H., Iwasa Y., The leading eight: Social norms that can maintain cooperation by indirect reciprocity. J. Theor. Biol. 239, 435–444 (2006). [DOI] [PubMed] [Google Scholar]
47.Brandt H., Sigmund K., Indirect reciprocity, image scoring, and moral hazard. Proc. Natl. Acad. Sci. U.S.A. 102, 2666–2670 (2005). [DOI] [PMC free article] [PubMed] [Google Scholar]
48.Ellison G., Cooperation in the prisoner’s dilemma with anonymous random matching. Rev. Econ. Stud. 61, 567–588 (1994). [Google Scholar]
49.Hörner J., Olszewski W., The folk theorem for games with private almost-perfect monitoring. Econometrica 74, 1499–1544 (2006). [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary File

pnas.1921984117.sapp.pdf^{(642.2KB, pdf)}

[r1] 1.Nowak M. A., Sigmund K., Evolution of indirect reciprocity by image scoring. Nature 393, 573–577 (1998). [DOI] [PubMed] [Google Scholar]

[r2] 2.Nowak M. A., Sigmund K., The dynamics of indirect reciprocity. J. Theor. Biol. 194, 561–574 (1998). [DOI] [PubMed] [Google Scholar]

[r3] 3.Sugden R., New developments in the theory of choice under uncertainty. Bull. Econ. Res. 38, 1–24 (1986). [Google Scholar]

[r4] 4.Kandori M., Social norms and community enforcement. Rev. Econ. Stud. 59, 63 (1992). [Google Scholar]

[r5] 5.Leimar O., Hammerstein P., Evolution of cooperation through indirect reciprocity. Proc. R. Soc. Lond. Ser. B Biol. Sci. 268, 745–753 (2001). [DOI] [PMC free article] [PubMed] [Google Scholar]

[r6] 6.Panchanathan K., Boyd R., A tale of two defectors: The importance of standing for evolution of indirect reciprocity. J. Theor. Biol. 224, 115–126 (2003). [DOI] [PubMed] [Google Scholar]

[r7] 7.Nowak M. A., Sigmund K., Evolution of indirect reciprocity. Nature 437, 1291–1298 (2005). [DOI] [PubMed] [Google Scholar]

[r8] 8.Takahashi S., Community enforcement when players observe partners’ past play. J. Econ. Theor. 145, 42–62 (2010). [Google Scholar]

[r9] 9.Heller Y., Mohlin E., Observations on cooperation. Rev. Econ. Stud. 85, 2253–2282 (2018). [Google Scholar]

[r10] 10.Bhaskar V., Thomas C., Community enforcement of trust with bounded memory. Rev. Econ. Stud. 86, 1010–1032(2018). [Google Scholar]

[r11] 11.Klein B., Leffler K. B., The role of market forces in assuring contractual performance. J. Polit. Econ. 89, 615–641 (1981). [Google Scholar]

[r12] 12.Hardin G., The tragedy of the commons. Science 162, 1243–1248 (1968). [PubMed] [Google Scholar]

[r13] 13.Ostrom E., Governing the Commons: The Evolution of Institutions for Collective Action (Cambridge University Press, 1990). [Google Scholar]

[r14] 14.Coate S., Ravallion M., Reciprocity without commitment: Characterization and performance of informal insurance arrangements. J. Dev. Econ. 40, 1–24 (1993). [Google Scholar]

[r15] 15.Dugatkin L. A., Cooperation Among Animals: An Evolutionary Perspective (Oxford University Press on Demand, 1997). [Google Scholar]

[r16] 16.Sigmund K., The Calculus of Selfishness (Princeton University Press, 2010), vol. 6. [Google Scholar]

[r17] 17.Axelrod R., Hamilton W. D., The evolution of cooperation. Science (New York, N.Y.) 211, 1390–1396 (1981). [DOI] [PubMed] [Google Scholar]

[r18] 18.Bulow J. I., Geanakoplos J. D., Klemperer P. D., Multimarket oligopoly: Strategic substitutes and complements. J. Polit. Econ. 93, 488–511 (1985). [Google Scholar]

[r19] 19.Fudenberg D., Tirole J., The fat-cat effect, the puppy-dog ploy, and the lean and hungry look. Am. Econ. Rev. 74, 361–366 (1984). [Google Scholar]

[r20] 20.Zahavi A., Altruism as a handicap: The limitations of kin selection and reciprocity. J. Avian Biol. 26, 1–3 (1995). [Google Scholar]

[r21] 21.Friedman J. W., A non-cooperative equilibrium for supergames. Rev. Econ. Stud. 38, 1–12 (1971). [Google Scholar]

[r22] 22.Smith J. M., Evolution and the Theory of Games (Cambridge University Press, 1982). [Google Scholar]

[r23] 23.Weibull J. W., Evolutionary Game Theory (MIT Press, 1997). [Google Scholar]

[r24] 24.Resnick P., Kuwabara K., Zeckhauser R., Friedman E., Reputation systems. Commun. ACM 43, 45–48 (2000). [Google Scholar]

[r25] 25.Dellarocas C., Reputation mechanism design in online trading environments with pure moral hazard. Inf. Syst. Res. 16, 209–230 (2005). [Google Scholar]

[r26] 26.Klein D. B., Promise keeping in the great society: A model of credit information sharing. Econ. Polit. 4, 117–136 (1992). [Google Scholar]

[r27] 27.Kocherlakota N., Wallace N., Incomplete record-keeping and optimal payment arrangements. J. Econ. Theor. 81, 272–289 (1998). [Google Scholar]

[r28] 28.Biais B., Bisiere C., Bouvard M., Casamatta C., The blockchain folk theorem. Rev. Financ. Stud. 32, 1662–1715 (2019). [Google Scholar]

[r29] 29.Fearon J. D., Laitin D. D., Explaining interethnic cooperation. Am. Polit. Sci. Rev. 90, 715–735 (1996). [Google Scholar]

[r30] 30.Fehr E., Gächter S., Fairness and retaliation: The economics of reciprocity. J. Econ. Perspect. 14, 159–181 (2000). [Google Scholar]

[r31] 31.Bhui R., Chudek M., Henrich J., How exploitation launched human cooperation. Behav. Ecol. Sociobiol. 73, 78 (2019). [Google Scholar]

[r32] 32.Boyd R., Gintis H., Bowles S., Richerson P. J., The evolution of altruistic punishment. Proc. Natl. Acad. Sci. U.S.A. 100, 3531–3535 (2003). [DOI] [PMC free article] [PubMed] [Google Scholar]

[r33] 33.Henrich J., et al. , Costly punishment across human societies. Science 312, 1767–1770 (2006). [DOI] [PubMed] [Google Scholar]

[r34] 34.Fudenberg D., Harris C., Evolutionary dynamics with aggregate shocks. J. Econ. Theor. 57, 420–441 (1992). [Google Scholar]

[r35] 35.Uchida S., Effect of private information on indirect reciprocity. Phys. Rev. 82, 036111 (2010). [DOI] [PubMed] [Google Scholar]

[r36] 36.Hilbe C., Schmid L., Tkadlec J., Chatterjee K., Nowak M. A., Indirect reciprocity with private, noisy, and incomplete information. Proc. Natl. Acad. Sci. U.S.A. 115, 12241–12246 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]

[r37] 37.Ohtsuki H., Iwasa Y., Nowak M. A., Reputation effects in public and private interactions. PLoS Comput. Biol. 11, e1004527 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]

[r38] 38.Fudenberg D., He K., Learning and type compatibility in signaling games. Econometrica 86, 1215–1255 (2018). [Google Scholar]

[r39] 39.Le S., Boyd R., Evolutionary dynamics of the continuous iterated prisoner’s dilemma. J. Theor. Biol. 245, 258–267 (2007). [DOI] [PubMed] [Google Scholar]

[r40] 40.Fudenberg D., Maskin E., Evolution and cooperation in noisy repeated games. Am. Econ. Rev. Pap. Proc. 80, 274–279 (1990). [Google Scholar]

[r41] 41.Fudenberg D., Rand D. G., Dreber A., Slow to anger and fast to forgive: Cooperation in an uncertain world. Am. Econ. Rev. 102, 720–749 (2012). [Google Scholar]

[r42] 42.McNamara J. M., Barta Z., Houston A. I., Variation in behaviour promotes cooperation in the prisoner’s dilemma game. Nature 428, 745–748 (2004). [DOI] [PubMed] [Google Scholar]

[r43] 43.Bendor J., Kramer R. M., Stout S., When in doubt... cooperation in a noisy prisoner’s dilemma. J. Conflict Resolut. 35, 691–719 (1991). [Google Scholar]

[r44] 44.Fudenberg D., Maskin E., The folk theorem in repeated games with discounting or with incomplete information. Econometrica 54, 533–554 (1986). [Google Scholar]

[r45] 45.Ohtsuki H., Iwasa Y., How should we define goodness?—Reputation dynamics in indirect reciprocity. J. Theor. Biol. 231, 107–120 (2004). [DOI] [PubMed] [Google Scholar]

[r46] 46.Ohtsuki H., Iwasa Y., The leading eight: Social norms that can maintain cooperation by indirect reciprocity. J. Theor. Biol. 239, 435–444 (2006). [DOI] [PubMed] [Google Scholar]

[r47] 47.Brandt H., Sigmund K., Indirect reciprocity, image scoring, and moral hazard. Proc. Natl. Acad. Sci. U.S.A. 102, 2666–2670 (2005). [DOI] [PMC free article] [PubMed] [Google Scholar]

[r48] 48.Ellison G., Cooperation in the prisoner’s dilemma with anonymous random matching. Rev. Econ. Stud. 61, 567–588 (1994). [Google Scholar]

[r49] 49.Hörner J., Olszewski W., The folk theorem for games with private almost-perfect monitoring. Econometrica 74, 1499–1544 (2006). [Google Scholar]

PERMALINK

Indirect reciprocity with simple records

Daniel Clark

Drew Fudenberg

Alexander Wolitzky

Significance

Abstract

Fig. 1.