Skip to main content
PLOS One logoLink to PLOS One
. 2021 Aug 6;16(8):e0254860. doi: 10.1371/journal.pone.0254860

Evolution of prosocial punishment in unstructured and structured populations and in the presence of antisocial punishment

Mohammad Salahshour 1,*
Editor: The Anh Han2
PMCID: PMC8345862  PMID: 34358254

Abstract

A large body of empirical evidence suggests that altruistic punishment abounds in human societies. Based on such evidence, it is suggested that punishment serves an important role in promoting cooperation in humans and possibly other species. However, as punishment is costly, its evolution is subject to the same problem that it tries to address. To suppress this so-called second-order free-rider problem, known theoretical models on the evolution of punishment resort to one of the few established mechanisms for the evolution of cooperation. This leaves the question of whether altruistic punishment can evolve and give rise to the evolution of cooperation in the absence of such auxiliary cooperation-favoring mechanisms unaddressed. Here, by considering a population of individuals who play a public goods game, followed by a public punishing game, introduced here, we show that altruistic punishment indeed evolves and promotes cooperation in the absence of a cooperation-favoring mechanism. In our model, the punishment pool is considered a public resource whose resources are used for punishment. We show that the evolution of a punishing institution is facilitated when resources in the punishment pool, instead of being wasted, are used to reward punishers when there is nobody to punish. Besides, we show that higher returns to the public resource or punishment pool facilitate the evolution of prosocial instead of antisocial punishment. We also show that an optimal cost of investment in the punishment pool facilitates the evolution of prosocial punishment. Finally, our analysis shows that being close to a physical phase transition facilitates the evolution of altruistic punishment.

Introduction

Cooperation requires a cooperator to incur a cost for others to benefit. As such, cooperation is costly and expected to diminish by natural selection [1, 2]. Contrary to this rational expectation, cooperation is everywhere-present in the biological and social world [36]. Empirical studies suggest, by enforcing cooperation in animal [710] and human societies [6, 1115], altruistic punishment can play an important role in the evolution of cooperation. However, just as cooperation does, altruistic punishment, being costly, goes against an individual’s self-interest, and its evolution is yet another puzzle [16, 17]. Many studies have tried to address this so-called second-order free-rider problem, according to which free-riding on prosocial punishers who punish defectors results in the elimination of punishers, and subsequently, to the extinction of cooperators due to first-order free-riding on cooperators. Theoretical models have shown that these two problems, first-order and second-order free-rider problems, can simultaneously be solved in cooperation favoring environments; that is when one of the few known mechanisms for the evolution of cooperation is at work. In this regard, group selection [18], indirect reciprocity [1921], voluntary participation [2225], spatial selection and network structure [2629], prior commitment in binary interactions (i.e., in the context of prisoner’s dilemma) [30], stochastically bribing punishers [31], and probabilistic sharing of the cost of punishment [32] have been successfully appealed to show how cooperation and punishment can co-evolve in cooperation favoring environments. Other studies have shown once exogenously established, punishment can have a positive effect on the evolution of cooperation [33]. However, the important question of whether altruistic punishment on its own can promote cooperation has remained unaddressed.

The theoretical grounds appear more disappointing, when it is noticed that in many cooperation favoring environments where it is argued that punishment and cooperation can co-evolve, the inclusion of a complete set of possible strategies, by adding antisocial punishers (who defect and punish cooperators) to the population, leads to the invasion of antisocial punishers, and thus, undermines the co-evolution of prosocial punishment and cooperation [3436]. A priory, there is no reason why antisocial punishment should be excluded in the model. Instead, especially given the empirical evidence that antisocial punishment is abundant in human and animal societies [3740], its exclusion is an important point which the theory needs to address, especially if the endogenous evolution of social norms is concerned. Although this problem is solved in some cases of cooperation-favoring environments (for example, in the case of structured populations [28], when a reputation mechanism is at work [20], or when participation is voluntary, and the type of institutions are observable [25]), a satisfactory understanding of the extent to which antisocial punishment can prevent the evolution of prosocial behavior is still lacking. These two problems, the second-order free-rider problem and, to a lesser degree, the antisocial punishment problem, raise important questions about the evolution of altruistic punishment and its role in the evolution of cooperation, not only in a general environment but also in many cooperation-favoring environments.

Here, by considering a well-mixed population of individuals who play a public goods game, followed by a public punishing game, introduced here, we show that the first-order and the second-order free-rider problems can be solved simultaneously in a general environment, i.e., in the absence of any cooperation-favoring mechanism and in the presence of antisocial punishment. This establishes altruistic punishment as a fundamental road to the evolution of cooperation and explains its evolution. Furthermore, by considering the same model in a structured population, we show that the mechanism is further strengthened in a cooperation-favoring environment. Besides, we show that an optimal weight of second-order versus first-order punishment can facilitate the evolution of prosocial punishment. Our analysis also shows that higher returns to investment in the punishment pool facilitate the evolution of prosocial as opposed to antisocial punishment. We also show that while a too low cost of punishment leads to the evolution of antisocial punishment, a too high punishment cost prohibits the evolution of punishing institutions, be it prosocial or antisocial. On the other hand, an optimal cost of investment in the punishment pool, comparable to the cost of investment in the public resource, facilitates the evolution of prosocial punishment. Finally, we provide evidence that being close to a physical phase transition facilitates the evolution of prosocial punishment in structured populations. This parallels some arguments according to which being close to a physical phase transition can facilitates biological functions [4144].

The public punishing game introduced here can be thought of as a model of pool or institutional punishment. In contrast to peer punishment, where agents individually decide whether to punish others or not, in pool punishment, agents can contribute to erect a punishment institution for conducting punishment [24, 29, 45]. However, in contrast to the previous models of pool punishment [24, 29, 45], in our model, punishment pool is modeled as a public goods game, the resources of which are used for punishment. Besides, while most of the previous models only consider a prosocial punishment pool [24, 29, 45], we allow for the existence of both a prosocial and an antisocial punishment pool. We argue how the public punishing game admits an intuitive interpretation in terms of the law enforcing institutions commonly observed in human societies [46, 47]. In this regard, we argue that increasing the adaptivity of the model in a way that it more closely resembles human punishing institutions can make the co-evolution of cooperation and prosocial punishment possible even in more hostile conditions for the evolution of cooperation. To do so, we show that the evolution of punishing institutions is facilitated if, instead of being wasted, the resources in the punishment pool can be used for rewarding purposes when there is nobody to punish.

The model

To see how altruistic punishment can evolve and promote cooperation, we consider a population of N individuals in which groups of g individuals are formed at random to play a public goods game (PGG). This game is frequently appealed to in studies on the evolution of cooperation [1114, 16, 17, 4850]. In this game, each individual can either cooperate or defect. Cooperators pay a cost c to invest in a public resource. Defectors invest nothing. All the investments are multiplied by an enhancement factor r and are divided equally among all the group members. In addition to playing the public goods game, individuals can engage in prosocial or antisocial punishment. For this purpose, cooperators can invest an amount c′ in a prosocial punishment pool. In the same way, defectors can invest the same amount c′ in an antisocial punishment pool. All the investments in the prosocial and antisocial punishment pools are multiplied by a punishment enhancement factor ρ and are used for punishment purposes. To this goal, a fraction 1 − α of the total resources in the prosocial punishment pool is spent to punish defectors in the group, and a fraction α is used to punish cooperators who do not contribute to the prosocial punishment pool. In the same way, a fraction 1 − α of the total resources in the antisocial punishment pool is spent to punish cooperators, while a fraction α is used to punish defectors who do not contribute to the antisocial punishment pool. Thus, α determines the relative strength of second-order with respect to first-order punishment.

Individuals gather payoffs according to the payoff structure of the game and reproduce with a probability proportional to the exponential of their payoff such that the population size remains constant. That is, each individual in the next generation is offspring to an individual in the last generation with a probability proportional to the exponential of its payoff. Offspring inherit the strategies of their parent subject to mutations. We assume mutations in the decisions of the individuals to contribute to the public pool, and their decision to contribute to the punishment pool occurs independently, each with probability ν. In this study, unless otherwise stated, we set c = c′ = 1.

We note that the model of punishment, introduced here, has similarities with the models of pool punishment [24, 29]. Pool punishment is introduced in two slightly different ways in the literature. In a popular variant [45], considered, for instance, in ref. [29], punishers invest an amount B to a punishment pool, and free-riders get punished by an amount G provided at least one pool punisher exists in their group. In another variant, considered in ref. [24], punishers invest an amount B in a punishment pool, and free-riders get fined by an amount G per punisher in their group. In this way, a free-rider gets fined by GnP, where nP is the number of punishers in its group. In both variants of pool punishment models discussed above, the amount of resources available for punishment is variable and depends on the number of defectors in the group: It is equal to nD G in the first variant and nD nP G in the second variant, where nD is the number of defectors in the group. However, it is unclear why the same contribution B to the punishment pool should yield different outcomes (from G to nD G) depending on the number of defectors in the group. Contrary to this feature of the past models, it can be argued that punishment resources should be determined based on the amount of contribution made by punishers to the punishment pool. This is the case in our model, where the total amount of resources available for punishment is determined by the amount of contribution to the punishment pool and is equal to βnPC c′ (in the case of prosocial punishment pool) or βnPD c′ (in the case of antisocial punishment pool), where, nPC and nPD are, respectively, the number of prosocial and antisocial punishers in the group. In this regard, the punishment pool is regarded as a public resource, the contributions to which are multiplied by an enhancement factor and are used for punishment purposes.

Looking at the punishment pool as a public resource allows for a modification of the model with interesting consequences. In the model considered so far, if there is nobody to punish, the resources in the punishment pool are wasted. Based on this feature, we call this model the wasteful punishment model. This feature, while keeping the model simple, might not be realistic. In real-world punishing institutions, wealth is not destroyed. Instead, if not necessary for sanctioning purposes, it can be used for other purposes, such as charity or reward. To take this fact into account, we also consider a second model, a non-wasteful punishment model, in which if there is nobody to punish, resources in the punishment pool are divided among its contributors. More precisely, in the non-wasteful punishment model, a fraction α of the resources in the prosocial punishment pool is spent to punish non-punishing cooperators. However, if there is no non-punishing cooperator in the group, instead of being wasted, this is divided equally among the contributors to the prosocial punishment pool (i.e., among the punishing cooperators). Similarly, a fraction 1 − α of the resources in the prosocial punishment pool is spent to punish defectors. However, if there is no defector to punish in a group, this fraction is divided equally among the contributors to the prosocial punishment pool. The same holds for the antisocial punishment pool. That is, a fraction α of the resources in the antisocial punishment pool is spent to punish non-punishing defectors. However, if there is no non-punishing defector in the group, this is divided equally among the contributors to the antisocial punishment pool. Similarly, a fraction 1 − α of the resources in the antisocial punishment pool is spent to punish cooperators. However, if there is no cooperator to punish in a group, this fraction is divided equally among the contributors to the antisocial punishment pool. All the other details of the model remain the same as before. As we will see, adding this conservation of punishment resources to the model facilitates the evolution of punishing institutions.

Results

Wasteful punishment

The phase diagrams of the model in the rρ and rα planes are presented in, respectively, Fig 1(a) and 1(b). The blue circles present the results of simulations in a population of size N = 10000, and the red lines result from the numerical solutions of the replicator dynamics, developed in the Methods section. To drive the phase diagram, we have determined the system’s equilibrium state starting from a random initial condition in which the strategies of the individuals are randomly assigned. For the replicator dynamics, this amounts to an initial condition in which the frequency of all the strategies is the same.

Fig 1. The phase diagram of the wasteful punishment model in a well-mixed population.

Fig 1

Blue circles denote the results of a simulation in a population of size N = 10000, and the red lines denote the results of the replicator dynamics. Depending on the parameters of the model, the model shows four different phases. C, D, PC, and PD denote different phases in which, respectively, cooperators, defectors, punishing cooperators, and punishing defectors dominate. Here, g = 9, ν = 0.001, c = c′ = 1. In (a) α = 0.5 and in (b) ρ = 5.

Depending on the model’s parameters, the system can be found in one of the four possible phases. As can be seen in Fig 1(a), for small punishment enhancement factors ρ, punishing strategies do not evolve. In this region, for r smaller than the group size g = 9, the population settles into a defective phase in which only non-punishing defectors survive. D denotes this phase in the figures. As r increases, for a value of r close to g = 9, a phase transition to a phase where non-punishing cooperators survive occurs. C denotes this phase. On the other hand, for large values of ρ, punishing strategies evolve and eliminate other strategies. However, the nature of the evolving punishment depends on the value of r. For small r and large values of ρ, such that the return to the investment in the public pool is low, but that to the investment in the punishment pool is high, the population settles into the antisocial punishment phase, where antisocial punishers dominate the population. PD indicates this phase. On the other hand, for large enough values of r and ρ, such that the returns to investments in both the public pool and the punishment pool are high, the dynamics settle into the prosocial punishment phase, where prosocial punishers dominate the population. PC denotes this phase.

The phase diagram in the rα plane shows similar phases. For small α, such that there is not enough investment in punishing second-order free-riders, punishment does not evolve. In this region, for r smaller than a value close to g = 9 non-punishing defectors survive, and for larger values of r non-punishing cooperators survive. As α increases, a discontinuous transition occurs above which punishing strategies evolve. In this regime, for small r punishing defectors dominate. However, for larger values of r, punishing cooperators dominate. For very large values of α (close to 1), another transition occurs above which punishment does not evolve. This shows that enough investment in punishing first-order free-riders is also necessary for the evolution of punishing institutions. Interestingly, for larger values of α, the evolution of prosocial punishment requires a larger value of r. This shows, an optimal value of α, that is, an optimal weight of second-order with respect to first-order punishment exists, which facilitates the evolution of prosocial punishment. Altogether, our analysis reveals that for punishing institutions to evolve, they need to punish both first-order free-riders, who do not contribute to the public good, and second-order free-riders, who do not contribute to the punishment pool. This suggests that issuing a fine for non-contributors to the policing institutions is necessary for the evolution of such institutions.

An interesting question is how the relative cost of investment in the public pool and the punishment pool affects the evolution of punishment? To address this question, in Fig 2(a) and 2(b), we plot the phase diagram of the model in, respectively, rc′ and ρc′ planes. As can be seen, for too small cost of punishment, c′, antisocial punishment evolves (We note that for c′ = 0, the fine is zero, and thus, punishment is not possible. The strategies PD and D coexist in this case. A non-zero c′ can give rise to a non-zero fine and lead to the domination of antisocial punishers). A non-zero cost of investment in the punishment pool can give rise to the evolution of prosocial punishment. However, the evolution of prosocial punishment is facilitated for medium values of c′, comparable to the cost of investment in the public resource, c = 1. For too large c′, punishing institutions do not evolve. Instead, non-punishing defectors or non-punishing cooperators (for r larger than approximately g = 9) dominate the population.

Fig 2. Dependence on the cost of investment in punishment pool, c′.

Fig 2

The phase diagram of the wasteful punishment model in rc′ (a) and ρc′ planes (b). For too small c′ antisocial punishment evolves, and for too large c′ non-punishing defectors survive. For medium values of c′ prosocial punishment evolves. An optimal c′, which is comparable to c = 1, facilitates the evolution of prosocial punishment. Parameter values: g = 9, ν = 10−3, α = 0.5, and c = 1. In (a) ρ = 5 and in (b) r = 5. The replicator dynamics is used to derive the phase diagrams.

So far, we have considered a well-mixed population. As population structure favors the evolution of cooperation, one might expect that prosocial punishment to evolve in structured populations as well. To see this is indeed the case, we present the phase diagram of the model for a structured population, in Fig 3(a) and 3(b). The phase diagram is derived by performing simulations in a population of N = 40000 individuals residing on a 200 × 200 first nearest neighbor square lattice with Moore connectivity and periodic boundaries. In a structured population, the model shows similar phases to those that appeared for a well-mixed population. However, two shifts are observable in the position of the phase transitions. First, due to network reciprocity, the DC transition shifts to smaller enhancement factors. We note that, in the absence of punishing strategies, the DC transition for the same network structure and size occurs for a larger value of r. This shows the beneficial effect of introducing the punishing strategies for the evolution of cooperation, even in the parameter regimes where such punishing strategies do not evolve. This interesting phenomenon results from the synergistic effect of rock-paper-scissor-like dynamics and spatial structure, according to which punishing cooperators facilitate the evolution of non-punishing cooperators by eliminating defectors and facilitating the formation of small cooperators block (See S.4.4 and S.5). Second, away from the DC transition, the phase transition from non-punishing to punishing strategies shifts to larger values of ρ, compared to the mixed population. This shows, surprisingly, population structure can hinder the evolution of punishing institutions, be it prosocial or antisocial. On the other hand, the transition from the PD phase to the PC phase occurs for much smaller values of r in a structured population compared to a mixed population. This shows that network structure facilitates the evolution of prosocial as opposed to antisocial punishment. In addition, the analysis of the model reveals that the evolution of prosocial punishment is facilitated close to the DC transition. This can be observed to be the case by noting that close to the DC transition, the value of ρ above which prosocial punishment evolves decreases. As shown in the Supplementary Information S1 Text (S.4.2), this result holds for other parameter values and shows the beneficial effect of being close to a continuous transition for the evolution of prosocial behavior. Finally, we note that the PDPC transition occurs for a smaller value of r compared to the DC transition. Furthermore, by increasing ρ, the PDPC transition shifts to smaller values of r. This is the case in both a well-mixed population and a structured population and shows that the more effective the punishment, the easier, and for smaller enhancement factors, prosocial punishment and cooperation evolve.

Fig 3. The phase diagram of the wasteful punishment model in a structured population.

Fig 3

The phase diagram is derived by running simulations in a population of size 40000, residing on a 200 × 200 first nearest neighbor lattice with Moore connectivity and periodic boundaries. Depending on the parameters of the model, the model shows four different phases. C, D, PC, and PD denote different phases in which, respectively, cooperators, defectors, punishing cooperators, and punishing defectors dominate. Here, g = 9, ν = 0.001, and c = c′ = 1. In (a) α = 0.5 and in (b) ρ = 5.

To take a closer look at the mechanism by which cooperation and punishment co-evolve, in Fig 4(a)–4(c), we present the snapshots of the time evolution of the system close to the CPC phase transition. Here, a population of N = 90000 individuals residing on a 300 × 300 lattice with Moore connectivity and periodic boundaries is considered. The simulation starts with a random assignment of the strategies. The frequency of different strategies as a function of time is plotted in Fig 4(d). Starting from a random initial condition, punishing strategies rapidly grow, while the non-punishing strategies decline. Punishing defectors have the highest growth rate at the beginning of the simulation. This results in a sharp increase in their number by driving other solitary strategies into extinction. However, after small blocks of punishing cooperators are formed, they obtain the highest growth by reaping the benefit of cooperation among themselves and avoiding being punished by rival punishing defectors, and rapidly drive punishing and non-punishing defectors into extinction. As argued below, by setting the stage for the invasion of non-punishing cooperators, this phenomenon facilitates the evolution of cooperation. The initial rapid growth of punishing cooperators sets the stage for the second stage of the system’s time evolution, in which small domains of non-punishing cooperators are formed in a sea of prosocial punishers. As here, the system is in the C phase, cooperators experience advantage over prosocial punishers. Consequently, cooperators’ blocks start to grow slowly along the horizontal and vertical boundaries until they dominate the population. While defectors can not survive in the sea of prosocial punishers, they survive by forming narrow bands within the domain of non-punishing cooperators. Consequently, once non-punishing cooperators start to dominate the population, the frequency of defectors increases as well. We note that, as shown in the S1 Text (S.4.4 and S.5) and the Supplementary Videos (S1S5 Videos), this coarsening pattern is characteristic of the evolution of punishing strategies.

Fig 4. Time evolution of the wasteful punishment model in a structured population.

Fig 4

(a) to (c) present the snapshots of the system’s time evolution, and (d) shows the frequency of different strategies as a function of time. Here, a population of N = 90000 individuals lives on a two-dimensional 300 × 300 lattice with periodic boundaries and Moore connectivity. Different colors indicate different strategies. Light blue shows non-punishing cooperators, dark blue shows punishing cooperators, light red shows non-punishing defectors, and dark red shows punishing defectors. The simulation started with a random initial condition. Here, ν = 10−3, c = c′ = 1, r = 4 and ρ = 3.47.

Finally, we note that in a mixed population, the model is multi-stable in the entire phase diagram; for r < g, all the three strategies, D, PD, and PC are stable. This implies that all the transitions but the DC transition are discontinuous. As shown in the S1 Text (S.3.2), the nature of the DC transition depends on the value of ρ. While this transition is discontinuous for large ρ, for small ρ there is a cross-over from the D phase to the C phase without passing any singularity. In between, the transition becomes a continuous transition at a critical point. Similarly, for a structured population, all the transitions but the DC transition are discontinuous. The DC transition, in contrast, shows no discontinuity and appears to occur continuously (S.4.3).

Non-wasteful punishment

The phase diagram of the non-wasteful punishment model for a mixed population is presented in Fig 5(a). Fig 5(a), presents the phase diagram of the model in the rρ plane, and Fig 5(b), presents the phase diagram of the model in the rα plane. Blue circles represent the result of a simulation in a population of size N = 40000, and the red lines represent the result of the replicator dynamics. As can be seen, the result of the replicator dynamics is in good agreement with the result of simulations.

Fig 5. The phase diagram of non-wasteful punishment model for a mixed population.

Fig 5

Blue circles denote the results of a simulation in a population of size N = 40000, and the red lines denote the results of the replicator dynamics. Depending on the parameters of the model, the model shows four different phases separated by discontinuous transitions. C, D, PC, and PD denote different phases in which, respectively, non-punishing cooperators, non-punishing defectors, punishing cooperators, and punishing defectors dominate the population. Here, g = 9, ν = 0.001, and c = c′ = 1.

As in the wasteful punishment model, the dynamics in the non-wasteful punishment model is multi-stable: depending on the initial conditions, the dynamics settle into a phase where one of the strategies dominates the population and drives all the other strategies into extinction. For small punishing enhancement factors, ρ, punishing strategies do not evolve. In this region, for r smaller than a value close to g, the dynamics settle into a defective phase in which non-punishing defectors dominate the population. D denotes this phase in the figure. On the other hand, for r larger than (approximately) g, cooperators survive and dominate the population. C denotes this phase in the figure. As ρ increases, a phase transition to a phase where punishing strategies evolve occurs. However, the nature of the evolved punishing strategy depends on the value of r. For small r punishing defectors dominate the population. PD denotes this phase in the figure. On the other hand, for large values of r, the dynamics settle into a phase where punishing cooperators dominate the population. PC denotes this phase in the figure.

Comparison with the wasteful punishment model shows that the evolution of punishment is facilitated in the non-wasteful punishment model, be it prosocial or antisocial. This can be seen by noting that the phase transition to the punishing phase occurs for a smaller value of the punishment enhancement factor, ρ, in the non-wasteful punishment model. This shows that a smaller return to the investment in the punishment pool is sufficient to give rise to the evolution of punishment, when, instead of being wasted, the resources in the punishment pool are redistributed among its contributors in case there is nobody to punish. We note that similarly to the wasteful punishment model, higher returns to the investments in the punishment pool facilitate the evolution of prosocial as opposed to antisocial punishment. This can be seen by noting that for higher values of ρ, the transition to the prosocial punishment phase shifts to smaller values of r. That is, for more effective punishment mechanisms, a smaller enhancement factor for the public resource is sufficient to promote social punishment.

The phase diagram of the non-wasteful model in the αr plane is presented in Fig 5(b). As can be seen, in the non-wasteful model, punishing strategies evolve even for α = 0. That is, punishment evolves even in the absence of second-order punishment. The reason is that the prospect of receiving a return from the punishment pool in case there is nobody to punish can act as a reward which solves the second-order free-riding problem. Consequently, second-order punishment is not necessary to ensure the evolution of punishment. Furthermore, the value of r for which prosocial punishment evolves increases by increasing α. This shows that second-order punishment can be detrimental to the evolution of prosocial punishment in a situation where the resources of the punishment pool are rewarded to its contributors when there is nobody to punish.

Finally, we note that similarly to the wasteful punishment model, the non-wasteful punishment model shows multistability in the whole region of the phase diagram: For r < g, three stable phases, D, PD, and PC exist. Depending on the initial conditions, the dynamics settle into one of these phases. On the other hand, for r > g, both the C and the PC phases are stable. This implies that the phase transitions involving different punishment phases in this model are discontinuous.

The non-wasteful punishment model in a structured population is studied in the S1 Text and it is shown that occasional rewarding of the resources in the punishment pool can facilitate the evolution of prosocial punishment in a structured population too.

Discussion

As the evolution of altruistic punishment is riddled by the same kind of free-riding problem that the evolution of cooperation is, it was believed that resorting to another cooperation favoring mechanism is necessary to explain the evolution of altruistic punishment and its role in the evolution of cooperation [18, 19, 2124, 2628]. As we have shown, this is not necessarily the case. Instead, the efficient coupling of the second-order and first-order free-rider problems provides a surprising way for the simultaneous solution of both dilemmas. This establishes altruistic punishment as a fundamental road to the evolution of cooperation and can explain its overwhelming presence in human and many animal societies. Furthermore, our study brings new insights into the beneficial conditions for the evolution of punishing institutions and prosocial behavior. In this regard, our analysis shows an optimal weight for second-order, with respect to first-order punishment, exists, which facilitates the evolution of altruistic punishment. Besides, the more efficient the punishment mechanism, the more likely that prosocial punishment, as opposed to antisocial punishment, evolves. Our analysis also reveals that network structure can be detrimental to the evolution of punishing institutions, be it prosocial or antisocial. This theoretical prediction has been observed recently in spatial public goods experiments [51], and parallels some arguments that network structure can sometimes be surprisingly harmful for the evolution of prosocial behavior [52]. On the other hand, network structure can facilitate the evolution of prosocial punishment instead of antisocial punishment, provided favorable conditions for the evolution of punishing institutions are satisfied. We have also seen that an optimal cost of investment in the punishment pool facilitates the evolution of prosocial punishment: While for a too low cost of punishment, antisocial punishment evolves, for too high investment cost punishing institutions, be it prosocial or antisocial, does not evolve.

Finally, we have seen that being close to a physical phase transition is beneficial for the evolution of punishing institutions in a structured population. This parallels many arguments, according to which being close to physical phase transitions can provide optimal conditions for many biological functions and extends such arguments to the evolution of prosocial behavior [4144].

Past studies have considered the evolution or pool or institutional punishment [24, 29, 45]. The point of departure in our model from the previous models of pool punishment is that the punishment pool is considered a public resource in our model, the resources of which are used for punishing purposes. In other words, in our model, the value of the fines received by punished individuals depends of the investment made by punishers. In addition, while most of the previous models exclude an antisocial punishment pool [24, 29, 45], in our model, an antisocial punishment pool exists as well. Using previous models of pool punishment, past studies have not detected the evolution of cooperation in a mixed population and in the absence of auxiliary mechanisms. However, as we show by exploring a broad parameter range, the model of pool punishment introduced here can give rise to the evolution of cooperation for sufficiently large values of punishment enhancement factor ρ, and proper choice of second-order punishment ratio α.

Just as the public goods game is thought of as a metaphor for a social dilemma, the public punishing game introduced here can be thought of as a metaphor for public punishing institutions, such as formal and informal policing institutions at work in human societies [46, 47]. In this regard, the contribution to the public punishing pool can be thought of as a tax paid by individuals to establish a policing institution. Similarly, the punishment of second-order free-riders can be considered as a fine for not paying the tax, and the punishment of first-order free-riders can be considered a fine for not contributing to the public good. Our model can be thought of as a simple and minimal model which grasps the essential aspects of the evolution of such punishing institutions. However, human punishing institutions are adaptive institutions that have accumulated a high level of sophistication in the course of their evolution [46, 47]. In terms of this analogy, the model can be made more adaptive to resemble human sanctioning institutions more closely. Such adaptivity is expected to increase the effectiveness of the punishing institutions and thus, facilitates the evolution of cooperation and prosocial punishment, just as it arguably does in real-world sanctioning institutions. We have considered one such modification in which the resources in the punishing pool, instead of being wasted if they remain unused for punishing purposes, can be used to reward its contributors. As we have seen, such a modification facilitates the evolution of punishment and cooperation. Furthermore, in such a non-wasteful punishment model, second-order punishment is not necessary for the evolution of punishment. Instead, occasional rewarding of punishers can provide an incentive to contribute to the punishing pool, which can solve the second-order free-riding problem. This feature of the non-wasteful punishment model is reminiscent of some arguments according to which combining reward and punishment [33] or second-order reward [53] can be an effective mechanism for promoting social behavior.

Methods and materials

The replicator dynamics

In a well-mixed population, it is possible to drive a set of equations for the time evolution of the system in terms of the replicator-mutation equations. In a general case, the replicator-mutation equations can be written as follows:

mx(t+1)=xνxxmx(t)wx(t)xmx(t)wx(t). (1)

Here, x, x′, and x″ refer to the strategies, and can be PC, C, PD, or D, referring to, respectively, punishing cooperators, non-punishing cooperators, punishing defectors, and non-punishing defectors. mx is the frequency of the strategy x, wx is the expected fitness of an individual with strategy x, and νxx is the mutation rate from the strategy x′ to the strategy x. Under our assumption that mutations in the strategies of the individuals in the public goods pool, and in the public punishing pool occur independently, these can be written in terms of the probability of mutation ν, as follows. For those transformations which require no mutations, that is x = x′, we have νxx=1-2ν+ν2 (this is the probability that no mutation, neither in the strategy to contribute to the public pool, nor in the strategy to contribute to the punishing pool, occurs). For those rates which require two mutations, one in the decision of the individuals to contribute to the public pool, and one in their decision to contribute to the punishing pool, we have νDPC=νCPD=νPCD=νPDC=ν2. All the other rates, which require only one mutation, are equal to νCD=νDC=νPCPD=νPDPC=νCPC=νPCC=νPDD=νDPD=ν-ν2.

To use the replicator-mutation equation, Eq (1), we need expressions for the expected fitness of different strategies. These are given by the following equations:

wPC=nPD=0g-1-nPC-nCnC=0g-1-nPCnPC=0g-1exp[cr1+nC+nPCg-(1-α)cρnPD1+nPC+nC-c-c]ρPCnPCρCnCρPDnPDρDg-1-nPC-nC-nPD(g-1nPC,nC,nPD,g-1-nPC-nC-nPD),wC=nPD=0g-1-nPC-nCnC=0g-1-nPCnPC=0g-1exp[cr1+nC+nPCg-(1-α)cρnPD1+nPC+nC-αcρnPC1+nC-c]ρPCnPCρCnCρPDnPDρDg-1-nPC-nC-nPD(g-1nPC,nC,nPD,g-1-nPC-nC-nPD),wPD=nPD=0g-1-nPC-nCnC=0g-1-nPCnPC=0g-1exp[crnC+nPCg-(1-α)cρnPC1+nPD+nD-c]ρPCnPCρCnCρPDnPDρDg-1-nPC-nC-nPD(g-1nPC,nC,nPD,g-1-nPC-nC-nPD),wD=nPD=0g-1-nPC-nCnC=0g-1-nPCnPC=0g-1exp[crnC+nPCg-(1-α)cρnPC1+nPD+nD-αcρnPD1+nD]ρPCnPCρCnCρPDnPDρDg-1-nPC-nC-nPD(g-1nPC,nC,nPD,g-1-nPC-nC-nPD). (2)

In the following we explain how these expressions can be derived. In the process, we consider a focal individual in a group where there are nPC punishing cooperators, nC non-punishing cooperators, nPD punishing defectors, and nD = g − 1 − nPCnCnPD non-punishing defectors in the group. The term in the large bracket in Eq (2), is the payoff of such a focal individual. In the following, we explain why this is so.

Using the previously mentioned notation for the group composition of a focal individual, cr1+nPC+nCg-c is the payoff of a focal punishing or non-punishing cooperator, and crnPC+nCg is the payoff of a focal punishing or non-punishing defector from the public goods game. These are the first terms in the large bracket in Eq (2). A focal punishing or non-punishing cooperator, receives a punishment from the punishing defectors in its group equal to (1-α)cρnPD1+nPC+nC. This is the second term in the large bracket in the expressions for wPC and wC. In addition, a focal non-punishing cooperator is punished by punishing cooperators in its group, by an amount equal to αcρnPC1+nC. This is the third term in the large bracket in the expression for wC. In the same way, a focal punishing or non punishing defector, receives a punishment from the punishing cooperators in its group equal to (1-α)cρnPC1+nPD+nD. This is the second term in the large bracket in the expressions for wPD and wD. In addition, a focal non-punishing defector is punished by punishing defectors in its group, by an amount equal to αcρnPD1+nD. This is the third term in the large bracket in the expression for wD. Finally, as punishing cooperators contribute to both the public pool and the prosocial punishing pool, they pay a cost of c+c′. On the other hand, non-punishing cooperators only pay a cost of c to contribute to the public pool. Similarly, punishing defectors, pay a cost of c′ to contribute to the antisocial punishing pool. Defectors contribute to none of the pools and pay no cost.

As individuals reproduce with a probability proportional to their payoff, the expected fitness of a strategy can be defined as the expected value of the exponential of the payoff of that strategy. To calculate the expected value of fitness, we note that ρPCnPCρCnCρPDnPDρDg-1-nPC-nC-nPD(g-1nPC,nC,nPD,g-1-nPC-nC-nPD), is the probability that a focal individual finds itself in a group with nPC punishing cooperators, nC non-punishing cooperators, nPD punishing defectors, and nD non-punishing defectors. Here, (g-1nPC,nC,nPD,g-1-nPC-nC-nPD)=(g-1)!nPC!,nC!,nPD!,(g-1-nPC-nC-nPD)! is the multinational coefficient. This is the number of ways that among the g − 1 group-mates of a focal individual, nPC, nC, nPD, and g − 1 − nPCnCnPD individuals are respectively, punishing cooperators, non-punishing cooperators, punishing defectors, and non-punishing defectors. Summation over all the possible configurations gives the expected fitness of different strategies. Using the expressions in Eq (2) for the expected fitness of different strategies in Eq (1), we have a set of four equations which gives an analytical description of the model, in the limit of infinite population size.

As show in the S1 Text (S.2), the replicator dynamics for the non-wasteful punishment model can be derived in the same way.

The simulations and numerical solutions

Analytical solutions result from numerically solving the replicator dynamics of the model. Simulations of the model are performed according to the model definition. Both simulations and analytical solutions are performed with an initial condition in which all the strategies are found in similar frequencies in the population pool. For the solutions of the replicator dynamics, this is assured by setting the initial frequency of all the four strategies equal to 1/4. For simulations, this is assured by a random assignment of the strategies.

Supporting information

S1 Text. Supplemental information text.

An overview of the models is given. The replicator dynamics of the non-wasteful punishment model is derived. The models are further analyzed and the robustness of the results is argued. Supplementary Videos are explained.

(PDF)

S1 Video. Supplementary video 1.

An example of the time evolution of the wasteful punishment model in the structured population. See S1 Text for a description of video.

(AVI)

S2 Video. Supplementary video 2.

An example of the time evolution of the wasteful punishment model in the structured population. See S1 Text for a description of video.

(AVI)

S3 Video. Supplementary video 3.

An example of the time evolution of the wasteful punishment model in the structured population. See S1 Text for a description of video.

(AVI)

S4 Video. Supplementary video 4.

An example of the time evolution of the wasteful punishment model in the structured population. See S1 Text for a description of video.

(AVI)

S5 Video. Supplementary video 5.

An example of the time evolution of the wasteful punishment model in the structured population. See S1 Text for a description of video.

(AVI)

Data Availability

All relevant data are within the paper and its Supporting information files.

Funding Statement

The author acknowledges funding from Alexander von Humboldt Foundation in the framework of the Sofja Kovalevskaja Award endowed by the German Federal Ministry of Education and Research. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1. Axelrod R, Hamilton WD. The evolution of cooperation. science. 1981. Mar 27;211(4489):1390–6. doi: 10.1126/science.7466396 [DOI] [PubMed] [Google Scholar]
  • 2. Nowak MA. Five rules for the evolution of cooperation. science. 2006. Dec 8;314(5805):1560–3. doi: 10.1126/science.1133755 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3. West SA, Griffin AS, Gardner A, Diggle SP. Social evolution theory for microorganisms. Nature reviews microbiology. 2006. Aug;4(8):597–607. doi: 10.1038/nrmicro1461 [DOI] [PubMed] [Google Scholar]
  • 4. Clutton-Brock T., 2009. Cooperation between non-kin in animal societies. Nature, 462 (7269), pp.51–57. doi: 10.1038/nature08366 [DOI] [PubMed] [Google Scholar]
  • 5. Rand DG, Nowak MA. Human cooperation. Trends in cognitive sciences. 2013. Aug 1;17(8):413–25. doi: 10.1016/j.tics.2013.06.003 [DOI] [PubMed] [Google Scholar]
  • 6. Fehr E, Fischbacher U. Social norms and human cooperation. Trends in cognitive sciences. 2004. Apr 1;8(4):185–90. doi: 10.1016/j.tics.2004.02.007 [DOI] [PubMed] [Google Scholar]
  • 7. Tibbetts EA, Dale J. A socially enforced signal of quality in a paper wasp. Nature. 2004. Nov;432(7014):218–22. doi: 10.1038/nature02949 [DOI] [PubMed] [Google Scholar]
  • 8. Hauser MD. Costs of deception: cheaters are punished in rhesus monkeys (Macaca mulatta). Proceedings of the National Academy of Sciences. 1992. Dec 15;89(24):12137–9. doi: 10.1073/pnas.89.24.12137 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9. Wenseleers T, Ratnieks FL. Enforced altruism in insect societies. Nature. 2006. Nov;444(7115):50–. doi: 10.1038/444050a [DOI] [PubMed] [Google Scholar]
  • 10. Ratnieks FL, Foster KR, Wenseleers T. Conflict resolution in insect societies. Annu. Rev. Entomol. 2006. Jan 7;51:581–608. doi: 10.1146/annurev.ento.51.110104.151003 [DOI] [PubMed] [Google Scholar]
  • 11. Fehr E. and Gächter S., 2002. Altruistic punishment in humans. Nature, 415 (6868), pp.137–140. doi: 10.1038/415137a [DOI] [PubMed] [Google Scholar]
  • 12. Fehr E, Gächter S. Cooperation and punishment in public goods experiments. American Economic Review. 2000. Sep;90(4):980–94. doi: 10.1257/aer.90.4.980 [DOI] [Google Scholar]
  • 13. Gächter S, Renner E, Sefton M. The long-run benefits of punishment. Science. 2008. Dec 5;322(5907):1510–. doi: 10.1126/science.1164744 [DOI] [PubMed] [Google Scholar]
  • 14. Gürerk Ö, Irlenbusch B, Rockenbach B. The competitive advantage of sanctioning institutions. Science. 2006. Apr 7;312(5770):108–11. doi: 10.1126/science.1123633 [DOI] [PubMed] [Google Scholar]
  • 15. Mathew S, Boyd R. Punishment sustains large-scale cooperation in prestate warfare. Proceedings of the National Academy of Sciences. 2011. Jul 12;108(28):11375–80. doi: 10.1073/pnas.1105604108 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16. Dreber A, Rand DG, Fudenberg D, Nowak MA. Winners don’t punish. Nature. 2008. Mar;452(7185):348–51. doi: 10.1038/nature06723 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17. Milinski M, Rockenbach B. Punisher pays. Nature. 2008. Mar;452(7185):297–8. doi: 10.1038/452297a [DOI] [PubMed] [Google Scholar]
  • 18. Boyd R, Gintis H, Bowles S, Richerson PJ. The evolution of altruistic punishment. Proceedings of the National Academy of Sciences. 2003. Mar 18;100(6):3531–5. doi: 10.1073/pnas.0630443100 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19. Panchanathan K, Boyd R. Indirect reciprocity can stabilize cooperation without the second-order free rider problem. Nature. 2004. Nov;432(7016):499–502. doi: 10.1038/nature02978 [DOI] [PubMed] [Google Scholar]
  • 20. Hilbe C, Traulsen A. Emergence of responsible sanctions without second order free riders, antisocial punishment or spite. Scientific reports. 2012. Jun 13;2(1):1–4. doi: 10.1038/srep00458 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21. Sigmund K, Hauert C, Nowak MA. Reward and punishment. Proceedings of the National Academy of Sciences. 2001. Sep 11;98(19):10757–62. doi: 10.1073/pnas.161155698 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22. Fowler JH. Altruistic punishment and the origin of cooperation. Proceedings of the National Academy of Sciences. 2005. May 10;102(19):7047–9. doi: 10.1073/pnas.0500938102 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23. Hauert C, Traulsen A, Brandt H, Nowak MA, Sigmund K. Via freedom to coercion: the emergence of costly punishment. science. 2007. Jun 29;316(5833):1905–7. doi: 10.1126/science.1141588 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24. Sigmund K, De Silva H, Traulsen A, Hauert C. Social learning promotes institutions for governing the commons. Nature. 2010. Aug;466(7308):861–3. doi: 10.1038/nature09203 [DOI] [PubMed] [Google Scholar]
  • 25. García J, Traulsen A. Evolution of coordinated punishment to enforce cooperation from an unbiased strategy space. Journal of the Royal Society Interface. 2019. Jul 26;16(156):20190127. doi: 10.1098/rsif.2019.0127 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26. Nakamaru M, Iwasa Y. The evolution of altruism by costly punishment in lattice-structured populations: score-dependent viability versus score-dependent fertility. Evolutionary ecology research. 2005;7(6):853–70. [Google Scholar]
  • 27. Brandt H, Hauert C, Sigmund K. Punishment and reputation in spatial public goods games. Proceedings of the royal society of London. Series B: biological sciences. 2003. May 22;270(1519):1099–104. doi: 10.1098/rspb.2003.2336 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28. Szolnoki A, Perc M. Second-order free-riding on antisocial punishment restores the effectiveness of prosocial punishment. Physical Review X. 2017. Oct 30;7(4):041027. doi: 10.1103/PhysRevX.7.041027 [DOI] [Google Scholar]
  • 29. Perc M. Sustainable institutionalized punishment requires elimination of second-order free-riders. Scientific reports. 2012. Mar 30;2(1):1–6. doi: 10.1038/srep00344 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Han TA. Emergence of social punishment and cooperation through prior commitments. InProceedings of the Thirtieth AAAI Conference on Artificial Intelligence 2016 Feb 12 (pp. 2494–2500).
  • 31. Liu L, Chen X, Szolnoki A. Evolutionary dynamics of cooperation in a population with probabilistic corrupt enforcers and violators. Mathematical Models and Methods in Applied Sciences. 2019. Oct 20;29(11):2127–49. doi: 10.1142/S0218202519500428 [DOI] [Google Scholar]
  • 32. Chen X, Szolnoki A, Perc M. Probabilistic sharing solves the problem of costly punishment. New Journal of Physics. 2014. Aug 7;16(8):083016. doi: 10.1088/1367-2630/16/8/083016 [DOI] [Google Scholar]
  • 33. Chen X, Sasaki T, Brännström Å, Dieckmann U. First carrot, then stick: how the adaptive hybridization of incentives promotes cooperation. Journal of the royal society interface. 2015. Jan 6;12(102):20140935. doi: 10.1098/rsif.2014.0935 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34. Rand DG, Armao JJ IV, Nakamaru M, Ohtsuki H. Anti-social punishment can prevent the co-evolution of punishment and cooperation. Journal of theoretical biology. 2010. Aug 21;265(4):624–32. doi: 10.1016/j.jtbi.2010.06.010 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35. Rand DG, Nowak MA. The evolution of antisocial punishment in optional public goods games. Nature communications. 2011. Aug 16;2(1):1–7. doi: 10.1038/ncomms1442 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36. Hauser OP, Nowak MA, Rand DG. Punishment does not promote cooperation under exploration dynamics when anti-social punishment is possible. Journal of theoretical biology. 2014. Nov 7;360:163–71. doi: 10.1016/j.jtbi.2014.06.041 [DOI] [PubMed] [Google Scholar]
  • 37. Herrmann B, Thöni C, Gächter S. Antisocial punishment across societies. Science. 2008. Mar 7;319(5868):1362–7. doi: 10.1126/science.1153808 [DOI] [PubMed] [Google Scholar]
  • 38. Denant-Boemont L, Masclet D, Noussair CN. Punishment, counterpunishment and sanction enforcement in a social dilemma experiment. Economic theory. 2007. Oct 1;33(1):145–67. doi: 10.1007/s00199-007-0212-0 [DOI] [Google Scholar]
  • 39. Gächter S, Herrmann B. Reciprocity, culture and human cooperation: previous insights and a new cross-cultural experiment. Philosophical Transactions of the Royal Society B: Biological Sciences. 2009. Mar 27;364(1518):791–806. doi: 10.1098/rstb.2008.0275 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40. Nikiforakis N. Punishment and counter-punishment in public good games: Can we really govern ourselves?. Journal of Public Economics. 2008. Feb 1;92(1-2):91–112. doi: 10.1016/j.jpubeco.2007.04.008 [DOI] [Google Scholar]
  • 41. Mora T, Bialek W. Are biological systems poised at criticality?. Journal of Statistical Physics. 2011. Jul;144(2):268–302. doi: 10.1007/s10955-011-0229-4 [DOI] [Google Scholar]
  • 42. Hidalgo J, Grilli J, Suweis S, Munoz MA, Banavar JR, Maritan A. Information-based fitness and the emergence of criticality in living systems. Proceedings of the National Academy of Sciences. 2014. Jul 15;111(28):10095–100. doi: 10.1073/pnas.1319166111 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43. Bialek W, Cavagna A, Giardina I, Mora T, Silvestri E, Viale M, et al. Statistical mechanics for natural flocks of birds. Proceedings of the National Academy of Sciences. 2012. Mar 27;109(13):4786–91. doi: 10.1073/pnas.1118633109 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44. Salahshour M. Phase Diagram and Optimal Information Use in a Collective Sensing System. Physical review letters. 2019. Aug 6;123(6):068101. doi: 10.1103/PhysRevLett.123.068101 [DOI] [PubMed] [Google Scholar]
  • 45. Perc M, Jordan JJ, Rand DG, Wang Z, Boccaletti S, Szolnoki A. Statistical physics of human cooperation. Physics Reports. 2017. May 8;687:1–51. doi: 10.1016/j.physrep.2017.05.004 [DOI] [Google Scholar]
  • 46. Ostrom E. Governing the commons: The evolution of institutions for collective action. Cambridge university press; 1990. Nov 30. [Google Scholar]
  • 47. Veszteg RF, Narhetali E. Public-good games and the Balinese. International Journal of Social Economics. 2010. Aug 3. doi: 10.1108/03068291011062461 [DOI] [Google Scholar]
  • 48. Salahshour M. Freedom to choose between public resources promotes cooperation. PLoS computational biology. 2021. Feb 8;17(2):e1008703. doi: 10.1371/journal.pcbi.1008703 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Salahshour M. Evolution of cooperation in costly institutes. arXiv preprint arXiv:2102.11432. 2021 Feb 23.
  • 50.Salahshour M. Evolution of cooperation and consistent personalities in public goods games. arXiv preprint arXiv:2010.05307. 2020 Oct 11. [DOI] [PMC free article] [PubMed]
  • 51. Li X, Jusup M, Wang Z, Li H, Shi L, Podobnik B, et al. Punishment diminishes the benefits of network reciprocity in social dilemma experiments. Proceedings of the National Academy of Sciences. 2018. Jan 2;115(1):30–5. doi: 10.1073/pnas.1707505115 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52. Hauert C, Doebeli M. Spatial structure often inhibits the evolution of cooperation in the snowdrift game. Nature. 2004. Apr;428(6983):643–6. doi: 10.1038/nature02360 [DOI] [PubMed] [Google Scholar]
  • 53. Sasaki T, Uchida S, Chen X. Voluntary rewards mediate the evolution of pool punishment for maintaining public goods in large populations. Scientific Reports. 2015. Mar 10;5(1):1–6. doi: 10.1038/srep08917 [DOI] [PMC free article] [PubMed] [Google Scholar]

Decision Letter 0

The Anh Han

7 May 2021

PONE-D-21-10893

The evolution of punishing institutions

PLOS ONE

Dear Dr. Salahshour,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

The two reviewers have provided constructive and detailed comments. They both agreed that the work is interesting, relevant and would provide a good contribution. However, there are several aspects of the paper that need improvements, for which the reviewers have provided constructive suggestions. Please carefully consider them in the revision of your manuscript.

Please submit your revised manuscript by Jun 21 2021 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

  • A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.

  • A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.

  • An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: http://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

The Anh Han, Ph.D.

Academic Editor

PLOS ONE

Journal Requirements:

When submitting your revision, we need you to address these additional requirements.

  1. Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

  1. Please include captions for your Supporting Information files at the end of your manuscript, and update any in-text citations to match accordingly. Please see our Supporting Information guidelines for more information: http://journals.plos.org/plosone/s/supporting-information.

Additional Editor Comments:

The three reviewers have provided constructive and detailed comments. They all agreed that the work is interesting, relevant and would provide a good contribution. However, there are several aspects of the paper that need improvements, for which the reviewers have provided constructive suggestions. Please carefully consider them in the revision of your manuscript.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: N/A

Reviewer #2: Yes

Reviewer #3: Yes

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: Remarks to the Author:

In this manuscript, the author has investigated the coevolution of social punishment and antisocial punishment in the public goods game. By using numerical calculations and simulations, they find that there exists an optimal weight for second-order that facilitates the evolution of cooperation. Furthermore, the author has extended the model to structured populations, and found that the mechanism is further strengthened in a cooperation favoring environment. The motivation of this manuscript is clear and the idea is novel. Thus I recommend the publication of this meaningful manuscript after some minor corrections detailed below:

1. In the Model section, the author uses the term “anti-social punishment”. As a corresponding, I suggest that the author change “social” into “pro-social” (line 61).

2. I suggest that the author change the C, D, PC and PD in Figures 1 and 2 to italics.

3. The author analyzes the replication dynamics by used numerical calculation and simulations, so I suggest the author change the title "The simulations and analytical solutions" to "The simulations and numerical solutions" (line 287).

4. In introduction section, the authors mentioned that altruistic punishment can play an important role in the evolution of cooperation. However, there are still some important literatures have not been reviewed, such as PLoS ONE, 16 (1), e0244592; New Journal of Physics, 16 (8), 083016; Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence. 2016: 2494-2500;.Journal of the Royal Society Interface, 12 (102), 20140935; Mathematical Models and Methods in Applied Sciences, 29 (11), 2127-2149. These literatures are very relevant to the subject of punishment and cooperation.

5. The reference format should be adjusted to meet the requirements of PLoS ONE.

Reviewer #2: The author studied how the punishing institution can evolve with cooperation in the presence of an antisocial punishment strategy. The results showed that punishing cooperators (who investigate both public and punishment pools) can evolve even without cooperation favoring mechanisms when the punishment enhancement factor is high enough and the institution punishes both defectors and the second-order free riders in an appropriate ratio. It implies that punishing both the first-order and the second-order free-riders is essential for the evolution of the punishing institution. The model explanation is clear, and the paper is well organized. However, there are some places to be improved for better readability. So, I recommend the publication of the manuscript in PLOS ONE after a minor revision.

[minor suggestions]

1. In the abstract and introduction, "general environment" sounds unclear. It would be better to describe it specifically (e.g., environments with cooperation favoring mechanisms).

2. In the last sentence of the first paragraph on page 1, the author said "However, the important question that whether altruistic punishment on its own can promote cooperation has remained unaddressed." But I am not sure that the main text can answer this. Did you mention it because the transition from PD to PC happens much lower r-value than the transition from D to C? If so, it would be good to highlight it.

3. Fig. S.4. is explained before Fig. S.3. It would be better to change the order of figures.

Reviewer #3: The manuscript presents a model of evolution of cooperation considering pool punishment. The model considers the evolution of both punisher cooperators and defectors doing antisocial punishment. The main contribution of this model is to show that under the right conditions, cooperation and punishment can evolve without requiring other mechanisms favouring cooperation.

Overall, the paper is well written, the goal and the contribution are well presented. The model used is straightforward, easy to understand and adapted to answer the research question. The results are well presented, with the author using phase transition plots to provide a global view of the results. I am in favour of accepting the manuscript with minor modifications.

My main comment is that the manuscript could be improved by explaining clearly why the model presented obtains cooperation using pool punishment when previous models did not. For instance, in the paper of (Sigmund et al. 2010), they do not observe the evolution of punishment and cooperation when they consider that the participation to pool punishment is compulsory. Yet, they appear to have a similar model than the one presented here. Why does the model presented here find different results? Is it because the author explores a larger parameter space, e.g. high punishment enhancement factor? Or is it because the group size is fixed to a low value of 9? Explaining this difference of results would help the claim of the manuscript to get accepted by a reader, in particular as the novelty of contributions in social evolution are often debated (Kay, Keller, and Lehmann 2020).

On the same note, the authors also claims that it introduces a “public punishing game” where individuals can pay a fee to an institution that punish both free-riders not contributing to the public good game and free-riders not contributing to the punishment pool. However, I struggle to see the difference between this “public punishing game” and pool-punishment systems introduced in previous models , such as in (Perc 2012; Sigmund et al. 2010). If there are no fundamental differences, the author needs to cite these previous work. If there are differences, the author should make them clear in the manuscript.

The supplementary material is very long and I am not sure it is necessary to include all these information (I have read it but I did not proofread it in details). It is not very clear to me the interest of characterising the transitions and I do not know of a community with an interest on this particular topic. If I am not mistaken, the author could cut down the supplementary materials.

A list of minor comments:

• I recommend changing the title because institutions are not truly evolving in this model.

• Abstract: “Nevertheless, theoretical work has been unable to show how this is possible.” . This is not exactly true (as the author explains after) and it downplays a lot of theoretical work. I suggest removing it.

• L29: Remove “its exclusion is a point which a proper theory needs to address.”. This sounds like previous theory were not “proper”.

• L62: What would happen if the amount invested in social punishment can differ from the amount invested in antisocial punishment?

• Add a sentence in the model definition that says that the extended description of the method can be found in supplementary material.

• I think the most common term to describe the proportion of individual with a given strategy is frequency instead of density.

• L125: Why do the simulations not always use the same population size? There are results with population size of 10 000, 40 000 and 90 000.

• L137: I am not sure this result comes (exclusively) from the rock-paper-scissor dynamic. Are not punishers rare for this transition because the punishment enhancement factor is low? Is this result not coming from the spatial structure which is known to favour the evolution of cooperation?

• L180: “This implies that all the transitions but the D - C transition are discontinuous.” What does a discontinuous transition mean exactly?

• L201: “It also reveals network structure can be detrimental to the evolution of punishing institutions.” Is it truly the case? Yes, the punishment enhancement factor needs to be slightly higher, but the public good enhancement factor can be much lower (2 instead of 9).

• L205: “Finally, we have seen that being close to a physical phase transition is beneficial for the evolution of punishing institutions.” Is this the case only in the spatial model? If yes, specify it.

• L211: “public punishing game introduced here,” Again, it is not clear in which way the public punishing game of the manuscript is new. It looks similar to previous pool punishment models.

• The model presented in (Perc 2012) is very similar to the spatial model in this manuscript. Could they be quickly compared in the discussion?

• An important parameter in the model is the punishment enhancement factor and the range of value considered can be high (up to 9). Is there a reason to think that such value would exist in real world? If yes, could the author provide an example?

• Supplementary materials S3.1 L4: a “e” is missing in “th”

Bibrliography:

Kay, Tomas, Laurent Keller, and Laurent Lehmann. 2020. “The Evolution of Altruism and the Serial Rediscovery of the Role of Relatedness.” Proceedings of the National Academy of Sciences, 202013596.

Perc, Matjaž. 2012. “Sustainable Institutionalized Punishment Requires Elimination of Second-Order Free-Riders.” Scientific Reports.

Sigmund, Karl, Hannelore De Silva, Arne Traulsen, and Christoph Hauert. 2010. “Social Learning Promotes Institutions for Governing the Commons.” Nature 466 (7308): 861–63.

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

Reviewer #3: No

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2021 Aug 6;16(8):e0254860. doi: 10.1371/journal.pone.0254860.r002

Author response to Decision Letter 0


21 Jun 2021

All the reviewer's suggestions are applied in the revised version. A point-by-point response to reviewers is submitted as a separate file, where changes in the manuscript is addressed or responses to their comments is provided.

Attachment

Submitted filename: response.pdf

Decision Letter 1

The Anh Han

6 Jul 2021

Evolution of prosocial punishment in unstructured and structured populations and in the presence of antisocial punishment

PONE-D-21-10893R1

Dear Dr. Salahshour,

We’re pleased to inform you that your manuscript has been judged scientifically suitable for publication and will be formally accepted for publication once it meets all outstanding technical requirements.

Within one week, you’ll receive an e-mail detailing the required amendments. When these have been addressed, you’ll receive a formal acceptance letter and your manuscript will be scheduled for publication.

An invoice for payment will follow shortly after the formal acceptance. To ensure an efficient process, please log into Editorial Manager at http://www.editorialmanager.com/pone/, click the 'Update My Information' link at the top of the page, and double check that your user information is up-to-date. If you have any billing related questions, please contact our Author Billing department directly at authorbilling@plos.org.

If your institution or institutions have a press office, please notify them about your upcoming paper to help maximize its impact. If they’ll be preparing press materials, please inform our press team as soon as possible -- no later than 48 hours after receiving the formal acceptance. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information, please contact onepress@plos.org.

Kind regards,

The Anh Han, Ph.D.

Academic Editor

PLOS ONE

Additional Editor Comments (optional):

All comments from reviewers have been satisfactorily addressed. Only a few minor things that can be easily corrected during a later stage.

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. If the authors have adequately addressed your comments raised in a previous round of review and you feel that this manuscript is now acceptable for publication, you may indicate that here to bypass the “Comments to the Author” section, enter your conflict of interest statement in the “Confidential to Editor” section, and submit your "Accept" recommendation.

Reviewer #1: All comments have been addressed

Reviewer #2: All comments have been addressed

Reviewer #3: (No Response)

**********

2. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

3. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

4. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

5. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

6. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: The authors have carefully and thoughtfully revised this paper and it is now suitable for publication.

Reviewer #2: The authors have addressed all comments, and the answers and the revised manuscript are satisfying. Hence, I recommend the publication of the revised manuscript in Plos ONE.

Reviewer #3: I thank very much the author for taking in account my comments and for providing detailed responses to each of them. The author has made the modifications asked or has well justified when he did not. I am for accepting this revised manuscript. Find below minor and facultative comments on the new parts of this revised manuscript.

• I think the new title can still be improved. Based on previous title, the author could go for “The evolution of cooperation with pool-punishment institutions” (the problem I stated in my previous review was that evolution of institutions would consider that institutions change along time in this model while they emerge here). Another alternative is “Institutionalised pool-punishment can lead to the evolution of sustainable cooperation”. Those are just suggestions, and the author is free to keep the title as it is.

• The paragraph L108 to L135 could be simplified. I appreciate that the author gives all these details, but I think that a shorter paragraph could reach the reader more easily. Something along the lines of “Previous model of pool-punishment considered that the value of the fine does not depend of the amount invested by punisher. Contrary to past models, we argued that punishment resources should be based on the amount of contribution made by punishers to the punishment pool. This is the case in our model, where the punishment pool is regarded as a public resource, the contributions to which are multiplied by an enhancement factor and are used for punishment purposes.”

• The lines 130 to 135 would fit better in the discussion. I suggest moving them to L373.

• In Figure 2, the author talks about free punishment, that is c’ = 0. What is the amount of the fine in this case? From the model definition, it should be c’ * rho = 0 so no punishment?

• L371. Add a sentence to explain what the author means by the “punishment pool is considered a public resource”. Something like “In other words, in our model, the value of the fines received by punished individuals depends of the investment made by punishers”.

• The author added reference 7 (Kay et al) following my suggestion. However, I believe there was a misunderstanding. I put this reference in my review to support one of my point but I do not think it fits in the paper. I understand that this was confusing in the review but I suggest to remove it from the references in the manuscript.

**********

7. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

Reviewer #3: No

Acceptance letter

The Anh Han

29 Jul 2021

PONE-D-21-10893R1

Evolution of prosocial punishment in unstructured and structured populations and in the presence of antisocial punishment

Dear Dr. Salahshour:

I'm pleased to inform you that your manuscript has been deemed suitable for publication in PLOS ONE. Congratulations! Your manuscript is now with our production department.

If your institution or institutions have a press office, please let them know about your upcoming paper now to help maximize its impact. If they'll be preparing press materials, please inform our press team within the next 48 hours. Your manuscript will remain under strict press embargo until 2 pm Eastern Time on the date of publication. For more information please contact onepress@plos.org.

If we can help with anything else, please email us at plosone@plos.org.

Thank you for submitting your work to PLOS ONE and supporting open access.

Kind regards,

PLOS ONE Editorial Office Staff

on behalf of

Dr. The Anh Han

Academic Editor

PLOS ONE

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Text. Supplemental information text.

    An overview of the models is given. The replicator dynamics of the non-wasteful punishment model is derived. The models are further analyzed and the robustness of the results is argued. Supplementary Videos are explained.

    (PDF)

    S1 Video. Supplementary video 1.

    An example of the time evolution of the wasteful punishment model in the structured population. See S1 Text for a description of video.

    (AVI)

    S2 Video. Supplementary video 2.

    An example of the time evolution of the wasteful punishment model in the structured population. See S1 Text for a description of video.

    (AVI)

    S3 Video. Supplementary video 3.

    An example of the time evolution of the wasteful punishment model in the structured population. See S1 Text for a description of video.

    (AVI)

    S4 Video. Supplementary video 4.

    An example of the time evolution of the wasteful punishment model in the structured population. See S1 Text for a description of video.

    (AVI)

    S5 Video. Supplementary video 5.

    An example of the time evolution of the wasteful punishment model in the structured population. See S1 Text for a description of video.

    (AVI)

    Attachment

    Submitted filename: response.pdf

    Data Availability Statement

    All relevant data are within the paper and its Supporting information files.


    Articles from PLoS ONE are provided here courtesy of PLOS

    RESOURCES