Directed reciprocity subverts cooperation in highly adaptive populations

Jean-Jacques Herings; Ronald Peeters; Anastas P Tenev

doi:10.1038/s41598-024-80408-z

. 2024 Nov 26;14:29311. doi: 10.1038/s41598-024-80408-z

Directed reciprocity subverts cooperation in highly adaptive populations

Jean-Jacques Herings ¹, Ronald Peeters ^2,^✉, Anastas P Tenev ³

PMCID: PMC11599575 PMID: 39592712

Abstract

We examine the generally accepted hypothesis that directed reciprocity is a powerful driver for cooperation. To do so, we consider a framework where agents situated on a circle network interact with their neighbors and have the choice to be egoistic, altruistic, or partially cooperative. We study the interaction between reciprocity, the likelihood that an agent reproduces value to the neighbor who has recently produced value for the agent, and inertia, the tendency of agents to repeat their previous choices even if other strategies are more successful. On the basis of extensive simulations, we conclude that for high levels of inertia, reciprocity enhances cooperation, while for low levels of inertia reciprocity rather subverts cooperation. For intermediate levels of inertia, we find a U-shaped effect. Reciprocity therefore interacts with the level of inertia in a non-monotonic fashion.

Supplementary Information

The online version contains supplementary material available at 10.1038/s41598-024-80408-z.

Keywords: Social dilemma, Cooperation, Reciprocity, Inertia

Subject terms: Human behaviour, Evolution

Introduction

Understanding what motivates cooperation on the individual level and what drives cooperation on the population level, particularly within social dilemma situations, is a major topic of study across the social sciences^1–4. Direct reciprocity, as captured by the “you scratch my back, and I’ll scratch yours” or the “I scratch your back, and you’ll scratch mine” principle, is generally considered to be a powerful mechanism for producing high levels of cooperation^5,6. Direct reciprocity is listed as one of the five rules for the evolution of cooperation, together with kin selection, group selection and two other reciprocity-based rules: indirect reciprocity and network reciprocity⁷.

Within evolutionary models of natural selection involving heterogeneous behavioral types, reciprocators are often implemented via (forgiving or generous) tit-for-tat play^8,9. Experimental work¹⁰ suggests that subjects who realize the value of cooperation would establish a punishment system to guarantee in-group cooperation and not so much to induce cooperation from others directly through cooperative actions; alternatively they could develop some kind of credible commitment devices¹¹. Other literature stresses the importance of strong reciprocators who punish norm violators, even though they receive lower payoffs^12,13. Theoretical studies have established that cooperation can be sustained in a local interaction framework, where agents are imitators^3,14–17.

The model of Herings, Peeters, Tenev and Thuijsman (henceforth, HPTT)¹⁸ builds on the study of Eshel, Samuelson and Shaked (henceforth, ESS)¹⁴ to investigate how sustainable cooperation can be in the presence of partial cooperators. The model has a number of agents positioned on a circular network interacting with their two neighbors, with each of the agents choosing either an egoistic, an altruistic or a partially cooperative strategy. Egoists do not produce any value for their neighbors, altruists produce value for both of their neighbors, and partial cooperators produce value for only one of their neighbors. The flip of a fair coin decides which of the two neighbors the partial cooperator produces value for. In every period, agents revise their strategies and adopt the one that was on average most successful among the strategies observed in their immediate neighborhood. In this study we report on findings obtained via extensive numerical simulations of this dynamic process, where we vary two parameters which were held fixed in the previous study: the levels of the updating inertia and the reciprocity probability.

First, reciprocity (captured by the parameter Inline graphic ) is used to allow partial cooperators to discriminate between the two neighbors based on the previous period’s “kindness” towards them. Discriminating strategies have been found to foster cooperation¹⁹. The situation reflects the flip of the fair coin in HPTT¹⁸. A value of Inline graphic above 1/2 implements direct reciprocity: agents are more likely to produce value to the neighbor who has recently produced value for them. A value of below 1/2 does the opposite: agents are less likely to produce value to the neighbor who has recently produced value for them. Although at first glance this seems to be in the spirit of indirect reciprocity as captured by the “you scratch my back, and I’ll scratch someone else’s” or the “I’ll scratch your back, and you’ll scratch someone else’s” principle^7,20, within our context it is probably more appropriate to label such behavior ‘antireciprocity’, and this is the nomenclature we will implement henceforth. While the latter type of behavior may appear unnatural, it aligns with the ‘Pay It Forward’ idea of encouraging a positive chain of altruistic acts; though, returning the favor directly remains a possibility in our framework. This type of behavior is also referred to as ‘serial reciprocity’²¹.

Second, inertia (captured by the parameter Inline graphic ) refers to the agents’ tendency to repeat previous decisions even after having received disconfirming information^22,23. Such inertia can be a consequence of technological restrictions, but also result from behavioral factors such as status-quo bias, stubbornness, and procrastination^24–26. In our simulations inertia means that agents stick to their current strategy with probability Inline graphic , even if another strategy was observed to be more successful. That is, inertia is exogenous to realized outcomes, independent of time and length of strategy usage, and homogeneous across the population. Recent research has considered implementations of inertia in the context of cooperation, where inertia is endogenous to observed payoff changes resulting from recent updates and observed payoff differences to other reference individuals^27–30, where inertia is increasing in the length a certain strategy has been used (habit formation)³¹, and where inertia is heterogeneous in society^32–34 including the presence of ‘zealous cooperators’ who never update their strategy³⁵. Partly due to the variations in how inertia has been implemented, its impact on the evolution of cooperative behavior has both been found to be positive^{32–34,36–38} and to be negative^35,39. One article⁴⁰ finds a non-monotonic relationship between inertia and cooperation; its results suggest that small inertia impedes cooperators, large inertia keeps the cooperation level the same as in the initial state, while medium inertia induces the greatest cooperation.

Although inertia produces higher levels of cooperation in our study (in line with some of the existing literature), we also document a remarkable influence in the role of reciprocity, and find the effect of reciprocity to interact with the level of inertia. While for high levels of inertia we find reciprocity to enhance cooperation, for low levels of inertia it rather subverts cooperation. The latter effect is caused by (i) reciprocity making the partially cooperative strategy (which is only half as cooperative as the altruist strategy) strong relative to the altruistic strategy (ii) while not being effective in eliminating the egoist strategy in case of low inertia.

Model setup and methods

There are Inline graphic agents situated on a circle network (see Fig. 2). Agents interact with their two direct neighbors and exhibit either egoistic or altruistic behavior towards each of them. All agents have three possible strategies at their disposal: a fully altruistic strategy A, a fully egoistic strategy E, and a partially cooperative strategy denoted by P.

Altruistic acts/contributions are directed: they produce a benefit to the contributor’s neighbors these acts are targeted at, but come at a cost to the contributor. The altruistic strategy (A) targets both neighbors. In contrast, agents who have adopted strategy E have no costs as they refrain from altruistic contributions altogether; yet, this does not preclude them from benefiting from their neighbors’ contributions targeted towards them. The partially cooperative strategy (P), however, enables agents to be altruistic to only one of their neighbors. Strategy P manifests itself in two possible decisions: L, representing altruistic behavior towards the left-hand neighbor and egoistic behavior towards the right-hand one, and R, representing altruistic behavior towards the right-hand neighbor and egoistic behavior towards the left-hand one. The P-strategy realizes as either L or R.

Without loss of generality, the value of a single altruistic contribution is normalized to 1. Hence, an agent who uses (i) strategy A provides value 1 to each of the neighbours; (ii) strategy E provides no value for the neighbours; and (iii) strategy P provides value 1 to only one of the two neighbors. While strategy E is costless, each altruistic act comes with a cost c, so that strategy P costs c and strategy A costs 2c. It is assumed that Inline graphic . The condition ensures that for all agents strategy E is the best reply against any play of the other agents; hence, it is the only rationalizable strategy. The condition ensures that the optimal strategy of an imitator is uniquely determined and that cooperative behavior is not impossibly costly. The level of cooperation in society is gauged by the preponderance of altruist acts, with more altruist acts corresponding to higher levels of cooperation. The socially efficient outcome is achieved when all agents use strategy A (with payoff Inline graphic for each agent and societal payoff of ), while when all agents use the only rationalizable strategy E, the societal payoff is 0. Hence, the situation constitutes a social dilemma. The possible payoffs of using any of the three strategies are summarized in Fig. 1, which once again highlights the fact that on an individual level it is most beneficial to employ E, while on a societal level it is best to use A. That is, fixing a specific pair of left-hand and right-hand neighbors, it is always individually better to use E, which corresponds to the lowest level of cooperative behavior, while the choice which produces the highest value to the neighbors (and is the most cooperative) is A.

Fig. 1 — The tables outline the possible payoffs for every strategy, when its left-hand neighbor uses the strategy/strategy realization in the leftmost column and its right-hand neighbor uses the strategy/strategy realization in the top row of the table. Below the table we specify the total value provided to immediate neighbors by an agent employing this strategy.

The model considers recurrent interaction within the fixed circular network. The agents’ behavior follows naive imitation, a heuristic decision rule, whereby at each stage they adopt the strategy that yielded the highest average payoff among the strategies observed in their immediate neighborhood; hence, only the agent’s own strategy and those of the agent’s immediate neighbors are in the consideration set.

In an extension of ESS¹⁴, this setup is used by HPTT¹⁸ who show that if strategy P allows contributions to either of the two neighbors with a strictly positive probability, there are five types of absorbing sets: (i) all-A, where all agents employ strategy A; (ii) all-P, where all agents employ strategy P; (iii) all-E, where all agents employ strategy E; (iv) mixed-A/E singleton absorbing sets in which A and E strategies coexist, but E’s exclusively appear in pockets of two adjacent agents; and (v) mixed-A/E non-singleton absorbing sets which cycle between two states, dubbed “blinkers”¹⁴, with pockets of adjacent E strategies alternating between singletons and triples, in addition to possibly pairs of E’s as in (iv). For a detailed description of the blinker states’ constitution, see Lemma 1 in HPTT¹⁸. For a description of the stationary states, see Proposition 1 there.

In choosing the topology of the circle network and the interaction structure, we follow the influential seminal paper by ESS¹⁴. The model as such does not correspond to a particular real-world situation, but is a metaphor for commonly occurring situations where agents interact much more with agents nearby than with agents far away and have a choice between acting more or less cooperatively. These agents can be either people living in a particular district, firms operating in geographically neighboring locations, or municipalities that interact with neighboring municipalities.

Building further on HPTT¹⁸, the current paper considers variations along two main dimensions:

Reciprocity Whenever the partially altruistic strategy P is employed, reciprocity captures the probability to act altruistically towards any of the neighbors depending on the outcome in the previous stage of the process. If only one neighbor of agent i created value 1 to agent i, then a strategy P used by agent i with probability results in providing value 1 to this neighbor and with probability to the other neighbor. If either both neighbors provided value 1 to agent i or none of them provided value 1, then strategy P treats both neighbors equally and results in an altruistic act towards only one of them as decided by a fair coin flip. Observe that the extreme case of corresponds to directed reciprocity, while can be dubbed antireciprocity. All cases in which capture a higher probability of directed reciprocity and will be referred to as directed reciprocity, while all cases in which capture a higher probability of antireciprocity and will be referred to as antireciprocity.
Inertia Inertia concerns the updating probability of all agents at every stage, i.e. how likely each agent is to implement the naive imitation rule specified above in every period of the imitation process. In every iteration, each agent keeps their strategy with probability and assumes the strategy which is best according to the decision rule with the remaining probability . This probability captures the ability to have a quick adaptive response to a changing environment or conversely the propensity to keep the status quo, hence the term inertia. Probability means no updating whatsoever, while implies that the agents reevaluate their chosen strategies at every stage of the process. Of course, in the latter case, the result of the process could still lead to an agent choosing the same strategy in two consecutive periods.

To illustrate the imitation dynamic, consider a circular network with Inline graphic agents. Let the agents start with the strategies (A, A, P, P, E, E). The strategies P played by the two middle agents can each realize in either an altruistic act towards the left neighbor or an altruistic act to the right neighbor, such that there are four possible ways in which the dynamic can progress. First, if the two middle agents’ strategies realize as (L, L), the six agents receive the payoffs Inline graphic such that on average, the best-performing strategies the agents observe within their neighborhood are (A, P, A, P, E, A), which is the strategy profile the agents would move to according to our imitation dynamic. This situation is depicted in Fig. 2. Second, if the strategies realize as (L, R), the agents receive the payoffs Inline graphic , and they would move to (A, A, A, E, E, E). Third, were the two strategies to realize as (R, L), and produce corresponding payoffs , the agents would move to (E, P, P, P, P, A). Fourth, if the strategies realize as (R, R), based on the payoffs agents would move to (E, P, P, E, E, E).

As shown above, with all agents updating ( Inline graphic ) there are four ways for the process to develop. However, positive inertia () can already result in as many as eight different states being reached after only the first possible realization (L, L) of the two P strategies in the starting state described above. When all agents update, three of the six agents change their strategy, and with positive inertia each of them can keep their current strategy with a positive probability. This means that instead of one, there are eight possible outcomes after a realization (L, L), which are summarized in Fig. 3: (i) all three change their strategy (ii) two of the three change their strategy (there are three such cases) (iii) one of the three changes the strategy (there are three such cases) (iv) all three keep their strategy.

Fig. 3 — All possible states which can result from the initial state (A, A, P, P, E, E) for . The positions in bold are the agents who want to revise their strategies, but might not do that due to positive inertia.

With all agents updating ( Inline graphic ), the realization (L, L) of the two P strategies leads to a total of eight altruistic acts in the next round. That is, in (A, P, A, P, E, A), there are three A’s, each of which provides two altruistic acts and there are two P’s, each of which provides one altruistic act. In contrast, the realization (R, R) only results in a total of two altruistic acts in the next round. That is, in (E, P, P, E, E, E) the two P’s produce one altruistic act each.

Ultimately, both parameters Inline graphic and affect the probability to act altruistically to a neighbor, but while the first one does this explicitly, the second one captures a factor which has an indirect effect on that. This paper focuses on comparing the incidence and efficiency of the absorbing states of the model by means of computer simulations. The situation Inline graphic and is extensively addressed in HPTT¹⁸, and serves as an important benchmark in the present study.

For the simulations, the size of the circle network is taken to be Inline graphic . In HPTT¹⁸, shows the most interesting variations, which are indicative of the results for greater values of n and do not suffer from the volatility observed at very small n. We used nine different values for the reciprocity parameter : 0.00, 0.05, 0.20, 0.35, 0.50, 0.65, 0.80, 0.95 and 1.00; and eight values for the inertia parameter Inline graphic : 0.00, 0.05, 0.20, 0.35, 0.50, 0.65, 0.80 and 0.95. For each pair , we followed the state transition process from initial state until convergence. In order to account for potential path dependencies, we used 171 different initial conditions, related to the number of A, P and E strategies at the onset. For each strategy, the number varied in multiples of three, and was never zero. For each of the 12,312 combinations of Inline graphic pairs and initial conditions, we run 1,000 simulations varying in how the A, P and E strategies were initially situated on the circle network. While the total number of runs, 12,312,000, may not appear extremely high, the number of iterations required for the process to converge is very large for high levels of inertia.

Results

This section presents the simulation outcomes. The main variable of interest is Efficiency, which is defined as the percentage of altruistic acts within the population. Notice, in this regard, that the total number of acts is 2n: two acts by each of the n agents. Strategy A produces two altruistic acts, strategy P one altruistic act and strategy E zero altruistic acts. In the present setting there is a one-to-one relation between the number of altruistic acts and the population’s aggregate payoff.

The graphs that are presented below show averages of the specific outcome variables, which are taken over the 1,000 runs of each of the 171 initial conditions. To assess the robustness of the reported findings in terms of their invariance to initial conditions, we divide the initial condition parameters into eight categories: many-A, many-P, many-E, few-A, few-P, few-E, mixed, and equal. The first seven categories partition the full set of initial conditions. The “equal” category is included in the “mixed” category and consists of the three initial conditions where all strategies are almost evenly represented. The initial conditions and the specifics of the categories are presented in Fig. 4. Overall, the findings we report are replicated within these subsamples; the rare exceptions will be discussed in the text.

Fig. 4 — Initial states and how they are categorized. Each hexagon represents one of the 171 initial conditions regarding the distribution of the initial seed of the three strategies. These initial conditions are grouped in different categories with the colors accentuating the different categories. The three categories labeled ‘many-X’ comprises of the 15 initial conditions where at least 42 out of 60 nodes are seeded with the X strategy. The three categories labeled ‘few-X’ comprises of the 27 initial conditions where at most 9 out of 60 nodes are seeded with the X strategy and there are no more than 42 nodes seeded with any of the other two strategies. The category labeled ‘mixed’ consists of the 45 initial conditions where for each of the three strategies at least 12 out of 60 nodes are seeded with this strategy. The category labeled ‘equal’ contains the 3 initial conditions in the category ‘mixed’ where the strategies are most evenly present. Finally, the category ‘all’ contains all 171 initial conditions.

Absorbing sets: efficiency

Figure 5 shows the average efficiency of the absorbing sets as a function of reciprocity ( Inline graphic ) for different levels of inertia (). The figure showcases the main findings of this paper, which are outlined and formulated below. The analysis shows that and have distinct effects regarding the ultimate level of cooperative behavior of the population, and that they interact in a non-trivial manner.

Fig. 5 — Efficiency. Efficiency of the imitation process relative to the reciprocity probability for some different values of the inertia parameter .

First, we consider the impact of inertia on efficiency. We see that, with the exception of the extreme Inline graphic , efficiency levels are increasing in .

Finding 1

For a given level of reciprocity, efficiency is increasing in the level of inertia.

Finding 1 implies that, for a given reciprocity level, a population benefits from inertia. The effect of inertia on efficiency is small at low levels of reciprocity, while it is large for high levels of reciprocity. At this stage, the reasons for this are not completely transparent, given that the same levels of efficiency can result from totally different states. For instance, a population comprising solely of partial cooperators is equally efficient as a population that is a perfect mix of altruists and egoists. The subsequent explorations will shed more light on the processes underlying Finding 1.

While there is high level of monotonicity in the relation between inertia and efficiency, this is not the case when considering the impact of directed reciprocity on efficiency. The latter relation is sensitive to the population’s level of inertia. This is observed in the figure: the curves are generally decreasing for low values of Inline graphic , they are U-shaped for intermediate values, and increasing for high values of .

Finding 2

For levels of inertia which are:

Low, efficiency is decreasing in directed reciprocity;
Medium, the relationship between efficiency and directed reciprocity is U-shaped;
High, efficiency is increasing in directed reciprocity.

Finding 2 implies that the impact of an increase in directed reciprocity on efficiency depends on the population’s level of inertia, and, if this is at medium level, it also depends on the level of directed reciprocity. The naive assumption that directed reciprocity boosts cooperation does not hold universally in our model. This is found to be true only for high levels of inertia, or at medium levels of inertia and already high levels of directed reciprocity. Otherwise, we observe an increase in directed reciprocity to be harmful. The next section explores this in more detail.

Findings 1 and 2 are not sensitive to changes in the initial conditions; the supporting graphs are presented in Fig. 6.

Absorbing sets: strategies

In order to understand better the findings related to efficiency, it is useful to consider the composition of the absorbing sets in terms of the strategies that are adopted. For instance, this helps identifying differences between the situation where all agents play P and the equally efficient situation where only half of them play A and the other half play E. Figure 7 presents the average percentage of the population that is using a particular strategy for the various levels of inertia and reciprocity, where each of the plots relates to one of the three different strategies. Again, the percentages presented are aggregated over all different initial conditions.

Fig. 7 — Absorbing sets: strategies. Percentage of strategies A, P and E in absorbing states.

Notice that for the P strategy, the plotted percentage is identical to the percentage of times the process converged to the all-P absorbing state, since the strategy P never co-exists with the other strategies in an absorbing set. For the A and E strategies, reported percentages are an aggregation over the all-A, all-E absorbing states respectively, and the two types of mixed-A/E absorbing sets. Figure 8 presents detailed information regarding the fraction of times a particular absorbing set has been reached. Comparing Fig. 8 with Fig. 7 makes clear that the mixed-A/E absorbing states are overall dominated by a greater number of A’s, while the all-A absorbing states are relatively infrequent. Therefore, the plot for strategy A in Fig. 7 shows a high correlation with the fraction of times the mixed-A/E absorbing set was reached, and the plot for strategy E is indicative of the fraction of times the all-E absorbing state was reached.

Fig. 8 — Frequencies of absorbing states. The graphs plot the fraction of times a certain type of absorbing set is reached. These fractions are plotted as a function of for various values of , where lighter colors refer to lower values of . The fractions are based on averages over all 171 initial conditions. The mixed-A/E singleton and non-singleton absorbing sets are pooled in one graph (labeled ‘mixed’).

We do not find a relationship between the level of inertia and the presence of a specific strategy that consistently applies to all reciprocity probabilities. However, we concisely report on some general tendencies below.

Finding 3

At most levels of directed reciprocity, inertia works against the spread of the egoist strategy and fosters the spread of the altruist strategy.

Next, we consider the impact of reciprocity. Like for efficiency, this impact varies across the different levels of inertia.

Finding 4

For levels of inertia which are:

Low, the spread of the altruist strategy is decreasing, and the spread of the partially cooperative and the egoist strategies are increasing in the reciprocity probability;
Medium, the spread of the altruist strategy is U-shaped in the reciprocity probability, while the presence of the partially cooperative/egoist strategy is increasing/decreasing;
High, the spread of the altruist/egoist strategy is increasing/decreasing in the reciprocity probability, while the spread of the partially cooperative strategy is inverse-U-shaped.

Overall, the observed impact of directed reciprocity on efficiency is mainly driven by the impact on the altruist and egoist strategies at all levels of inertia. Only at medium levels of inertia combined with high levels of directed reciprocity does the partially cooperative strategy have a substantial contribution to the generated efficiency (there is also a smaller positive effect for high levels of inertia and middle range of reciprocity probability).

The only negative effect of reciprocity on the likelihood to converge to an all-P state is for high level of inertia at high levels of reciprocity. Since the all-P states are not very efficient (overall, they achieve 50% of the potential efficiency), this is another channel through which inertia combined with directed reciprocity boosts efficiency.

Dynamics

To enhance the understanding about the full dynamics of the imitation process and how this is influenced by reciprocity and inertia, Fig. 9 presents the development of the proportions of every strategy from initial states with equal initial shares of the three strategies over the course of 1,000 iterations. The figure presents this for three values of the directed reciprocity parameter: low ( Inline graphic ; top graphs), medium (), and high (; bottom graphs); and three levels of inertia: low (; left graphs), medium () and high (; right graphs). Each of the graphs is based on aggregated data from 1,000 independent simulations of the dynamic process. Graphs for all values of and are available as Supplementary Information.

Beyond the unsurprising fact that an absorbing set is reached faster with lower levels of inertia, the imitation process is characterized by three phases.

Phase 1 attack of the egoists in the first phase, the E strategy eliminates the isolated A and P strategies. For all three reciprocity levels, we see this effect is larger at lower levels of inertia. Intuitively, inertia makes the A’s and P’s more resistant in this phase. This is because this gives them a time window with more opportunities to consolidate and form larger clusters, which are more resistant to the E’s. Further, for each of the three levels of inertia, we see that the E strategy is less successful for higher levels of reciprocity. Reciprocity helps small clusters of P and A strategies retain their cooperative attitudes.
Phase 2 the altruists strike back after the first phase, strong clusters of A’s and P’s have survived the attack of the egoists, and strike back. In this part of the process, the A and P strategies again benefit from a high level of inertia. Inertia ensures that during the slow but long march no losses are incurred and the E’s are whittled down. Overall, this happens because in this setup, clusters of identical strategies only change at their edges, and if one edge of a cluster of P’s or a cluster of A’s is preserved, this can help the whole cluster survive. Similarly, the A and P strategies again jointly profit from a high level of reciprocity. However, this benefits the P’s more than the A’s.
Phase 3 return of the partial cooperators after the E’s are decimated, the P’s find potential to combat the A’s. In this process the P’s again benefit from higher levels of reciprocity. At low levels of reciprocity, the A’s benefit from inertia; at high levels, the P’s benefit. Intuitively, in this case they can be locked into a mutually beneficial relationship longer. The only exception is the situation in the bottom-right of Fig. 9: high levels of inertia in combination with high levels of reciprocity prevent the tipping point for this phase being reached. Given the strategies found in the absorbing sets (see Fig. 7) this appears not to be an artefact from the dynamics only being displayed for the first 1,000 iterations.

In Finding 1 we report inertia to positively impact efficiency for (almost) all levels of reciprocity. Comparing each row of graphs in Fig. 9, we see that inertia leads to a less extreme drop in the A’s and the P’s during the first phase of the dynamics, and less E’s to survive the second phase of the dynamics. While the latter effect is smaller at lower levels of reciprocity, this is compensated by the A’s being stronger than the P’s in the third part of the process (where E’s remain constant).

In case of low levels of inertia, we document in Finding 2 that reciprocity negatively impacts efficiency. Looking in the first column of graphs in Fig. 9 we see that higher reciprocity leads to more A’s and P’s surviving the first phase, which would suggest the reversed impact. However, we find the share of E’s at the end of the second phase, and throughout the remainder of the process, not to be different for different levels of reciprocity. The negative impact of reciprocity on efficiency is caused by reciprocity making the P’s stronger relative to the A’s, and this effect being highly visible throughout the second and third phase of the process.

For medium levels of inertia, we found the impact of reciprocity on efficiency to be non-monotonic. This refers to the second column of graphs. Again, like for low levels of inertia, reciprocity makes the P’s stronger relative to the A’s, in particular throughout the second and third phase of the process. However, in the current case reciprocity unambiguously reduces the share of E’s that survive the first two phases. For low levels of reciprocity, the latter effect only dominates the former effect at higher levels of reciprocity, which explains the U-shaped effect on efficiency. We note that the U-shaped effect is, compared to Fig. 5, less prominent for medium level of inertia in Fig. 9. The notable difference is that Fig. 5 is based on averages over all initial states, while Fig. 9 is based on averages from the three initial states in the category “equal”.

Finally, for high levels of inertia, we found reciprocity to impact efficiency unambiguously positively. The third column of graphs represents this situation. Here we see that reciprocity has a negative impact on the E’s. Opposite to that, we find reciprocity to positively effect the A’s. There is a non-monotonic effect on the fraction of P’s, with a larger fraction of P’s surviving at intermediate levels of reciprocity (an effect that is better visible in Fig. 7). Nevertheless, this non-monotonic effect has no differential impact on efficiency given that it is dominated by the effect on the E’s and the A’s.

Discussion

We study the repeated interaction between agents situated on a circular network who have to choose between altruistic, egoistic, and partially cooperative actions. We examine the influence of two crucial parameters on the amount of cooperation in society: inertia and reciprocity. Inertia reflects the probability that agents in a given period do not consciously choose their action, but simply repeat the action they chose in the previous period. With the remaining probability, they take the action that generated the highest average payoffs in the previous period, where the choice is restricted to actions played by themselves and their neighbors in the previous period. Reciprocity corresponds to the probability that an altruistic action of their neighbors is responded to by an altruistic action in case of partially cooperative agents.

We find that inertia is always favorable for cooperation. However, contrary to conventional wisdom, reciprocity does not always stimulate cooperative behavior. The interaction between inertia and reciprocity is complicated. For low levels of inertia, reciprocity is harmful for cooperation, for intermediate levels of inertia the effect of reciprocity on cooperation is U-shaped, whereas for high levels of inertia, reciprocity is beneficial for cooperation. To better understand these effects, we subdivide the dynamic process in three stages: attack of the egoists, the altruists strike back, and return of the partial cooperators. We explain how inertia and reciprocity affect the behavior of agents during these stages.

Our model of interaction on a circular network is very stylized and invites further research on the effect of different network topologies on cooperation. Moreover, a further generalization could be to distinguish between the agents whose actions one observes and the agents who are affected by one’s actions.

Another avenue for further research concerns the extension of our framework to other forms of moral behavior^41,42. Recent works in social physics explore the evolution of trust⁴³ and honesty⁴⁴. A natural research question is therefore to study how inertia and reciprocity affects these types of behavior.

Supplementary Information

Supplementary Information.^{(1.8MB, pdf)}

Acknowledgements

A.P.T. gratefully acknowledges funding by the Hungarian National Research, Development and Innovation Office, Project Number K-143276.

Author contributions

J.J.H., R.P., and A.P.T. conceived the setup. R.P. conducted the simulations. All authors analysed the results and reviewed the manuscript.

Data availability

The study reports on data generated via simulations in Matlab. Matlab codes and the generated output are available for download from the OSF repository at 10.17605/osf.io/8jyeg.

Declarations

Competing interests

The authors declare no competing interests.

Footnotes

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

1.Nowak, M. A. & May, R. M. Evolutionary games and spatial chaos. Nature359, 826–829 (1992). [Google Scholar]
2.Bergstrom, T. C. & Stark, O. How altruism can prevail in an evolutionary environment. Am. Econ. Rev.83, 149–155 (1993). [Google Scholar]
3.Levine, D. K. & Pesendorfer, W. The evolution of cooperation through imitation. Games Econom. Behav.58, 293–315 (2007). [Google Scholar]
4.Fosco, C. & Mengel, F. Cooperation through imitation and exclusion in networks. J. Econ. Dyn. Control35, 641–658 (2011). [Google Scholar]
5.Trivers, R. L. The evolution of reciprocal altruism. Q. Rev. Biol.46, 35–57. 10.1086/406755 (1971). [Google Scholar]
6.van Veelen, M., García, J., Rand, D. G. & Nowak, M. A. Direct reciprocity in structured populations. Proc. Natl. Acad. Sci.109, 9929–9934. 10.1073/pnas.1206694109 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
7.Nowak, M. A. Five rules for the evolution of cooperation. Science314, 1560–1563. 10.1126/science.1133755 (2006). [DOI] [PMC free article] [PubMed] [Google Scholar]
8.Axelrod, R. & Hamilton, W. D. The evolution of cooperation. Science211, 1390–1396. 10.1126/science.7466396 (1981). [DOI] [PubMed] [Google Scholar]
9.Axelrod, R. The Evolution of Cooperation (Basic Books, 1984). [Google Scholar]
10.Yamagishi, T. The provision of a sanctioning system as a public good. J. Pers. Soc. Psychol.51, 110–116 (1986). [Google Scholar]
11.Ostrom, E., Walker, J. & Gardner, R. Covenants with and without a sword: Self-governance is possible. Am. Pol. Sci. Rev.86, 404–417 (1992). [Google Scholar]
12.Fehr, E. & Gächter, S. Altruistic punishment in humans. Nature415, 137–140 (2002). [DOI] [PubMed] [Google Scholar]
13.Bowles, S. & Gintis, H. The evolution of strong reciprocity: Cooperation in heterogeneous populations. Theor. Popul. Biol.65, 17–28. 10.1016/j.tpb.2003.07.001 (2004). [DOI] [PubMed] [Google Scholar]
14.Eshel, I., Samuelson, L. & Shaked, A. Altruists, egoists, and hooligans in a local interaction model. Am. Econ. Rev.88, 157–179 (1998). [Google Scholar]
15.Young, H. P. Individual Strategy and Social Structure: An Evolutionary Theory of Institutions (Princeton University Press, 1998). [Google Scholar]
16.Bergstrom, T. C. Evolution of social behavior: Individual and group selection. J. Econ. Perspect.16, 67–88 (2002).15179979 [Google Scholar]
17.Matros, A. Altruistic versus egoistic behavior in a public good game. J. Econ. Dyn. Control36, 642–656 (2012). [Google Scholar]
18.Herings, P.J.-J., Peeters, R., Tenev, A. P. & Thuijsman, F. Naïve imitation and partial cooperation in a local public goods model. J. Econ. Behav. Org.191, 162–185. 10.1016/j.jebo.2021.07.025 (2021). [Google Scholar]
19.Kirchkamp, O. Spatial evolution of automata in the prisoners’ dilemma. J. Econ. Behav. Org.43, 239–262 (2000). [Google Scholar]
20.Nowak, M. A. & Sigmund, K. Evolution of indirect reciprocity. Nature437, 1291–1298 (2005). [DOI] [PubMed] [Google Scholar]
21.Moody, M. Serial reciprocity: A preliminary statement. Soc. Theory26, 130–151 (2008). [Google Scholar]
22.Pitz, G. F. & Geller, E. S. Revision of opinion and decision times in an information-seeking task. J. Exp. Psychol.83, 400–405 (1970). [Google Scholar]
23.Traulsen, A., Semmann, D., Sommerfeld, R. D., Krambeck, H.-J. & Milinski, M. Human strategy updating in evolutionary games. Proc. Natl. Acad. Sci.107, 2962–2966 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
24.Samuelson, W. & Zeckhauser, R. Status quo bias in decision making. J. Risk Uncertain.1, 7–59 (1988). [Google Scholar]
25.Kahneman, D., Knetsch, J. L. & Thaler, R. H. Anomalies: The endowment effect, loss aversion, and status quo bias. J. Econ. Perspect.5, 193–206 (1991). [Google Scholar]
26.O’Donoghue, T. & Rabin, M. Doing it now or later. Am. Econ. Rev.89, 103–124 (1999). [Google Scholar]
27.Jia, C.-X., Ma, L. & Liu, R.-R. Enhancing cooperation through payoff-related inertia in networked prisoner’s dilemma games. Chaos Solitons Fractals186, 115333 (2024). [Google Scholar]
28.Wang, C., Zhu, W. & Szolnoki, A. The conflict between self-interaction and updating passivity in the evolution of cooperation. Chaos Solitons Fractals173, 113667 (2023). [Google Scholar]
29.Wang, C., Zhu, W. & Szolnoki, A. When greediness and self-confidence meet in a social dilemma. Phys. A625, 129033 (2023). [Google Scholar]
30.Wang, C. & Szolnoki, A. Evolution of cooperation under a generalized death-birth process. Phys. Rev. E107, 024303 (2023). [DOI] [PubMed] [Google Scholar]
31.He, J., Wang, J. & Yu, F. Behavior inertia of individuals promotes cooperation in spatial prisoner’s dilemma game. Europhys. Lett.132, 38002. 10.1209/0295-5075/132/38002 (2020). [Google Scholar]
32.Szolnoki, A. & Szabó, G. Cooperation enhanced by inhomogeneous activity of teaching for evolutionary prisoner’s dilemma games. Europhys. Lett.77, 30004 (2007). [Google Scholar]
33.Szolnoki, A., Perc, M., Szabó, G. & Stark, H.-U. Impact of aging on the evolution of cooperation in the spatial prisoner’s dilemma game. Phys. Rev. E80, 021901. 10.1103/PhysRevE.80.021901 (2009). [DOI] [PubMed] [Google Scholar]
34.Chang, S., Zhang, Z., Wu, Y. & Xie, Y. Cooperation is enhanced by inhomogeneous inertia in spatial prisoner’s dilemma game. Physica A490, 419–425 (2018). [Google Scholar]
35.Wang, C. & Sun, C. Zealous cooperation does not always promote cooperation in public goods games. Chaos33, 063111 (2023). [DOI] [PubMed] [Google Scholar]
36.Liu, R.-R., Rong, Z., Jia, C.-X. & Wang, B.-H. Effects of diverse inertia on scale-free-networked prisoner’s dilemma games. Europhys. Lett.91, 20002 (2010). [Google Scholar]
37.Zhang, Y., Fu, F., Wu, T., Xie, G. & Wang, L. Inertia in strategy switching transforms the strategy evolution. Phys. Rev. E84, 066103 (2011). [DOI] [PubMed] [Google Scholar]
38.Jia, D., Jin, J., Du, C. & Shi, L. Effects of inertia on the evolution of cooperation in the voluntary prisoner’s dilemma game. Physica A509, 817–826 (2018). [Google Scholar]
39.Wang, C. & Szolnoki, A. Inertia in spatial public goods games under weak selection. Appl. Math. Comput.449, 127941 (2023). [Google Scholar]
40.Du, W.-B., Cao, X.-B., Liu, R.-R. & Wang, Z. Effects of inertia on evolutionary prisoner’s dilemma game. Commun. Theor. Phys.58, 451–455 (2012). [Google Scholar]
41.Capraro, V. & Perc, M. Grand challenges in social physics: In pursuit of moral behavior. Front. Phys.6, 107 (2018). [Google Scholar]
42.Capraro, V. & Perc, M. Mathematical foundations of moral preferences. J. R. Soc. Interface18, 20200880 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
43.Kumar, A., Capraro, V. & Perc, M. The evolution of trust and trustworthiness. J. R. Soc. Interface17, 20200491 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
44.Capraro, V., Perc, M. & Vilone, D. Lying on networks: The role of structure and topology in promoting honesty. Phys. Rev. E101, 032305 (2020). [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Information.^{(1.8MB, pdf)}

Data Availability Statement

The study reports on data generated via simulations in Matlab. Matlab codes and the generated output are available for download from the OSF repository at 10.17605/osf.io/8jyeg.

[CR1] 1.Nowak, M. A. & May, R. M. Evolutionary games and spatial chaos. Nature359, 826–829 (1992). [Google Scholar]

[CR2] 2.Bergstrom, T. C. & Stark, O. How altruism can prevail in an evolutionary environment. Am. Econ. Rev.83, 149–155 (1993). [Google Scholar]

[CR3] 3.Levine, D. K. & Pesendorfer, W. The evolution of cooperation through imitation. Games Econom. Behav.58, 293–315 (2007). [Google Scholar]

[CR4] 4.Fosco, C. & Mengel, F. Cooperation through imitation and exclusion in networks. J. Econ. Dyn. Control35, 641–658 (2011). [Google Scholar]

[CR5] 5.Trivers, R. L. The evolution of reciprocal altruism. Q. Rev. Biol.46, 35–57. 10.1086/406755 (1971). [Google Scholar]

[CR6] 6.van Veelen, M., García, J., Rand, D. G. & Nowak, M. A. Direct reciprocity in structured populations. Proc. Natl. Acad. Sci.109, 9929–9934. 10.1073/pnas.1206694109 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR7] 7.Nowak, M. A. Five rules for the evolution of cooperation. Science314, 1560–1563. 10.1126/science.1133755 (2006). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR8] 8.Axelrod, R. & Hamilton, W. D. The evolution of cooperation. Science211, 1390–1396. 10.1126/science.7466396 (1981). [DOI] [PubMed] [Google Scholar]

[CR9] 9.Axelrod, R. The Evolution of Cooperation (Basic Books, 1984). [Google Scholar]

[CR10] 10.Yamagishi, T. The provision of a sanctioning system as a public good. J. Pers. Soc. Psychol.51, 110–116 (1986). [Google Scholar]

[CR11] 11.Ostrom, E., Walker, J. & Gardner, R. Covenants with and without a sword: Self-governance is possible. Am. Pol. Sci. Rev.86, 404–417 (1992). [Google Scholar]

[CR12] 12.Fehr, E. & Gächter, S. Altruistic punishment in humans. Nature415, 137–140 (2002). [DOI] [PubMed] [Google Scholar]

[CR13] 13.Bowles, S. & Gintis, H. The evolution of strong reciprocity: Cooperation in heterogeneous populations. Theor. Popul. Biol.65, 17–28. 10.1016/j.tpb.2003.07.001 (2004). [DOI] [PubMed] [Google Scholar]

[CR14] 14.Eshel, I., Samuelson, L. & Shaked, A. Altruists, egoists, and hooligans in a local interaction model. Am. Econ. Rev.88, 157–179 (1998). [Google Scholar]

[CR15] 15.Young, H. P. Individual Strategy and Social Structure: An Evolutionary Theory of Institutions (Princeton University Press, 1998). [Google Scholar]

[CR16] 16.Bergstrom, T. C. Evolution of social behavior: Individual and group selection. J. Econ. Perspect.16, 67–88 (2002).15179979 [Google Scholar]

[CR17] 17.Matros, A. Altruistic versus egoistic behavior in a public good game. J. Econ. Dyn. Control36, 642–656 (2012). [Google Scholar]

[CR18] 18.Herings, P.J.-J., Peeters, R., Tenev, A. P. & Thuijsman, F. Naïve imitation and partial cooperation in a local public goods model. J. Econ. Behav. Org.191, 162–185. 10.1016/j.jebo.2021.07.025 (2021). [Google Scholar]

[CR19] 19.Kirchkamp, O. Spatial evolution of automata in the prisoners’ dilemma. J. Econ. Behav. Org.43, 239–262 (2000). [Google Scholar]

[CR20] 20.Nowak, M. A. & Sigmund, K. Evolution of indirect reciprocity. Nature437, 1291–1298 (2005). [DOI] [PubMed] [Google Scholar]

[CR21] 21.Moody, M. Serial reciprocity: A preliminary statement. Soc. Theory26, 130–151 (2008). [Google Scholar]

[CR22] 22.Pitz, G. F. & Geller, E. S. Revision of opinion and decision times in an information-seeking task. J. Exp. Psychol.83, 400–405 (1970). [Google Scholar]

[CR23] 23.Traulsen, A., Semmann, D., Sommerfeld, R. D., Krambeck, H.-J. & Milinski, M. Human strategy updating in evolutionary games. Proc. Natl. Acad. Sci.107, 2962–2966 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR24] 24.Samuelson, W. & Zeckhauser, R. Status quo bias in decision making. J. Risk Uncertain.1, 7–59 (1988). [Google Scholar]

[CR25] 25.Kahneman, D., Knetsch, J. L. & Thaler, R. H. Anomalies: The endowment effect, loss aversion, and status quo bias. J. Econ. Perspect.5, 193–206 (1991). [Google Scholar]

[CR26] 26.O’Donoghue, T. & Rabin, M. Doing it now or later. Am. Econ. Rev.89, 103–124 (1999). [Google Scholar]

[CR27] 27.Jia, C.-X., Ma, L. & Liu, R.-R. Enhancing cooperation through payoff-related inertia in networked prisoner’s dilemma games. Chaos Solitons Fractals186, 115333 (2024). [Google Scholar]

[CR28] 28.Wang, C., Zhu, W. & Szolnoki, A. The conflict between self-interaction and updating passivity in the evolution of cooperation. Chaos Solitons Fractals173, 113667 (2023). [Google Scholar]

[CR29] 29.Wang, C., Zhu, W. & Szolnoki, A. When greediness and self-confidence meet in a social dilemma. Phys. A625, 129033 (2023). [Google Scholar]

[CR30] 30.Wang, C. & Szolnoki, A. Evolution of cooperation under a generalized death-birth process. Phys. Rev. E107, 024303 (2023). [DOI] [PubMed] [Google Scholar]

[CR31] 31.He, J., Wang, J. & Yu, F. Behavior inertia of individuals promotes cooperation in spatial prisoner’s dilemma game. Europhys. Lett.132, 38002. 10.1209/0295-5075/132/38002 (2020). [Google Scholar]

[CR32] 32.Szolnoki, A. & Szabó, G. Cooperation enhanced by inhomogeneous activity of teaching for evolutionary prisoner’s dilemma games. Europhys. Lett.77, 30004 (2007). [Google Scholar]

[CR33] 33.Szolnoki, A., Perc, M., Szabó, G. & Stark, H.-U. Impact of aging on the evolution of cooperation in the spatial prisoner’s dilemma game. Phys. Rev. E80, 021901. 10.1103/PhysRevE.80.021901 (2009). [DOI] [PubMed] [Google Scholar]

[CR34] 34.Chang, S., Zhang, Z., Wu, Y. & Xie, Y. Cooperation is enhanced by inhomogeneous inertia in spatial prisoner’s dilemma game. Physica A490, 419–425 (2018). [Google Scholar]

[CR35] 35.Wang, C. & Sun, C. Zealous cooperation does not always promote cooperation in public goods games. Chaos33, 063111 (2023). [DOI] [PubMed] [Google Scholar]

[CR36] 36.Liu, R.-R., Rong, Z., Jia, C.-X. & Wang, B.-H. Effects of diverse inertia on scale-free-networked prisoner’s dilemma games. Europhys. Lett.91, 20002 (2010). [Google Scholar]

[CR37] 37.Zhang, Y., Fu, F., Wu, T., Xie, G. & Wang, L. Inertia in strategy switching transforms the strategy evolution. Phys. Rev. E84, 066103 (2011). [DOI] [PubMed] [Google Scholar]

[CR38] 38.Jia, D., Jin, J., Du, C. & Shi, L. Effects of inertia on the evolution of cooperation in the voluntary prisoner’s dilemma game. Physica A509, 817–826 (2018). [Google Scholar]

[CR39] 39.Wang, C. & Szolnoki, A. Inertia in spatial public goods games under weak selection. Appl. Math. Comput.449, 127941 (2023). [Google Scholar]

[CR40] 40.Du, W.-B., Cao, X.-B., Liu, R.-R. & Wang, Z. Effects of inertia on evolutionary prisoner’s dilemma game. Commun. Theor. Phys.58, 451–455 (2012). [Google Scholar]

[CR41] 41.Capraro, V. & Perc, M. Grand challenges in social physics: In pursuit of moral behavior. Front. Phys.6, 107 (2018). [Google Scholar]

[CR42] 42.Capraro, V. & Perc, M. Mathematical foundations of moral preferences. J. R. Soc. Interface18, 20200880 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR43] 43.Kumar, A., Capraro, V. & Perc, M. The evolution of trust and trustworthiness. J. R. Soc. Interface17, 20200491 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR44] 44.Capraro, V., Perc, M. & Vilone, D. Lying on networks: The role of structure and topology in promoting honesty. Phys. Rev. E101, 032305 (2020). [DOI] [PubMed] [Google Scholar]

PERMALINK

Directed reciprocity subverts cooperation in highly adaptive populations

Jean-Jacques Herings

Ronald Peeters

Anastas P Tenev

Abstract

Supplementary Information

Introduction

Model setup and methods

Fig. 2.

Fig. 1.

Fig. 3.

Results

Fig. 4.

Absorbing sets: efficiency

Fig. 5.

Finding 1

Finding 2

Fig. 6.

Absorbing sets: strategies

Fig. 7.

Fig. 8.

Finding 3

Finding 4

Dynamics

Fig. 9.

Discussion

Supplementary Information

Acknowledgements

Author contributions

Data availability

Declarations

Competing interests

Footnotes

References

Associated Data

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases