Abstract
Selfishness is seldom considered a group-beneficial strategy. In the typical evolutionary formulation, altruism benefits the group, selfishness undermines altruism, and the purpose of the model is to identify mechanisms, such as kinship or reciprocity, that enable altruism to evolve. Recent models have explored punishment as an important mechanism favoring the evolution of altruism, but punishment can be costly to the punisher, making it a form of second-order altruism. This model identifies a strategy called “selfish punisher” that involves behaving selfishly in first-order interactions and altruistically in second-order interactions by punishing other selfish individuals. Selfish punishers cause selfishness to be a self-limiting strategy, enabling altruists to coexist in a stable equilibrium. This polymorphism can be regarded as a division of labor, or mutualism, in which the benefits obtained by first-order selfishness help to “pay” for second-order altruism.
Keywords: punishment, cooperation, mutualism, game theory, public goods
Selfishness is rarely described as a group-beneficial strategy. Selfish strategies are labeled as deviant, cheating, free-riding, egoistic (1), but most of all, as undermining altruism and cooperation (2). In contrast, altruistic and cooperative strategies, almost by definition, benefit the group, often at the expense of the individual actor (2). In the typical evolutionary model, the invasion of selfish strategies into a group leads to the dissolution of altruism. Examples include scroungers among foraging groups (3, 4), infanticide of unrelated infants (5), sneaking worker reproduction in eusocial insect colonies (6, 7), and failure to help in territorial defense (8, 9). The experimental economics literature amply demonstrates the corrosive effects of selfishness in human social interactions. In public-goods games, participants start out moderately generous but quickly withdraw their cooperation in the presence of selfish cheaters (10–13).
Earlier evolutionary models focused on how altruism can evolve through nonrandom interactions or guarded cooperation in dyadic interactions (14–17). More recently, interest has focused on punishment as a mechanism for maintaining altruism in sizeable groups (10, 11, 13, 18–23). Punishment can be effective in curtailing selfish behavior within a group, but it can also be costly for the punisher, compared with cooperators in the same group who do not punish, thereby qualifying as a form of second-order altruism. Individuals who are altruistic in first-order and second-order interactions are at a double disadvantage (10, 23–25). The solution to this problem might lie where least expected.
An often overlooked aspect of game theory is that selfish individuals also have an incentive to punish other selfish individuals, thereby increasing the proportion of cooperators for them to exploit. This behavior might seem hypocritical in moral terms, but it makes sense as an evolutionary strategy. It can even be looked upon as a division of labor, or mutualism, whereby cheating during first-order interactions becomes a “payment” for altruism (punishment) in second-order interactions. A combination of strategies (selfish punisher plus altruistic nonpunisher) that split the costs of first- and second-order altruism can be superior to a single-altruist/punisher strategy that bears both costs.
In an earlier computer simulation model (26, 27), we showed that when first- and second-order altruism are modeled as initially uncorrelated traits, a negative correlation robustly develops between the two, although the size of the correlation depends upon a number of parameters. Here, we present an analytical model demonstrating how altruistic nonpunishers and selfish punishers, through the benefit of division of labor, can exist in a stable equilibrium.
The Model
First-Order Altruism and Selfishness.
The model emulates an experimental economics game in which each member of a group is provided an endowment, b, that can be kept or invested in a public good. The combined investment in the public good is multiplied by a factor, m, and distributed equally to everyone in the group. The total payoff of each individual (the proportion of the endowment kept for oneself plus one's share of the public good) is assumed to be linearly related to fitness. This scenario can easily be related to biological situations, such as investing effort in a hunt in which everything captured will be shared. The model considers the two pure strategies of investing all (altruist) or none (selfish) of one's endowment. For a model that includes a continuum of endowment strategies, see ref. 26. If p is the proportion of altruists within a single group, then the increments in the fitness of the altruists and selfish types after one round of interactions is:
Baseline fitness is ignored because it is the same for both types. It is clear that in the absence of punishment, selfish individuals always have the highest fitness within the group because all group members obtain an equal share of the contributions from the altruists, yet selfish individuals keep rather than donate b. Therefore, an altruist would obtain a greater fitness by switching to the selfish noncontributing alternative, resulting in a net gain of
(selfish gain of b minus the now share of the reduced group payoff caused by the loss of a single altruist). However, the group has the highest fitness when everyone is an altruist, resulting in the classic prisoner's dilemma situation.
Punishment.
Now consider a method of social control in which individuals can punish selfish group members at a personal cost c, which results in the selfish individual losing its acquired energy for a given round of the game. The cost is incurred for every selfish member of the group, and punished individuals risk the loss of their previously acquired energy (b + pbm) at a probability equal to the frequency of punishers that reside in the group. Punishment can be exhibited by either altruists or selfish individuals, yielding four possible strategies (altruistic punishers, selfish punishers, altruistic nonpunishers, selfish nonpunishers) at frequencies of pap, psp, pan, and psn, respectively. In a single group, the average fitness W of the four strategies are as follows:
The fitness of altruistic nonpunishers (Eq. 3) is exclusively based on the altruistic contributions of group members and remains unaffected by punishment.
The fitness of selfish nonpunishers (Eq. 4) is determined by the sum of withholding cooperation b, and the exploitation of altruists (pan + pap)bm, however, is only retained in the proportion that punishers are not present in the group [1 − (pap + psp)].
Altruistic punishers (Eq. 5) retain the same fitness as altruistic nonpunishers yet bear a cost for punishing all selfish members within the group.
Selfish punishers (Eq. 6) exploit the altruistic contributions of group members yet punish all other selfish group members.
Punishment reduces the fitness of selfish strategies, lowering their fitness within groups. However, punishers in turn have a lower fitness than nonpunishers within the same group. The act of punishment, taken by itself, can be regarded as altruistic because punishers pay a personal cost to provide a benefit (the removal of selfish individuals) that is shared by nonpunishers. We will explore the dynamics of the model by first considering two of the strategies and then adding additional strategies.
Altruistic Punishers vs. Selfish Nonpunishers.
Altruistic punishers and selfish nonpunishers form an unstable equilibrium with each strategy exhibiting positive frequency dependence, as shown in Fig. 1. Altruistic punishment can evolve if it begins above a threshold frequency. However, its stability is eliminated with the addition of altruistic nonpunishers, as shown in Fig. 2. By not partaking in the costs of punishment, altruistic nonpunishers outcompete altruistic punishers and undermine punishment, resulting in the establishment of selfishness as an ESS (for population trajectories, see Fig. 3). To summarize the model so far, the viability of altruistic punishment requires two conditions: their initial frequency must be sufficiently high, and second-order free-riding (altruistic nonpunishment) must be excluded as a superior strategy.
To curtail second-order free-riding, punishment can be expanded to include altruists who do not punish. This tactic is vulnerable to third-order free-riding, leading to a seemingly infinite regress (for a possible solution to this problem, see ref. 28). Alternatively, punishers can recoup their cost of punishment by being selfish in first-order interactions.
Selfish Punishment.
Selfish punishment is a self-limiting strategy that provides a form of second-order altruism by punishing selfishness. In the course of punishing all selfish individuals (including other selfish punishers), selfish punishers remain restricted through negative density dependence, promoting altruism within groups. In a simple two-strategy model, selfish punishers and altruistic nonpunishers form a stable equilibrium, as shown in Fig. 4. Under certain conditions, this equilibrium is robust against the invasion of selfish nonpunishers.
Because selfish punishers refrain from expelling themselves from the group, an individual punisher has a
less chance of expulsion than a selfish nonpunisher within the same group. Therefore, if the benefit of remaining in the group
is greater than the cost of punishing all other selfish individuals
the equilibrium between selfish punishers and altruistic nonpunishers remains stable. This equilibrium range is expanded with increasing the values of b and m as well as increasing the frequency of altruists residing in the group. Furthermore, decreasing group size N results in a greater discrepancy between the chances of expulsion, favoring selfish punishers over selfish nonpunishers within a group.
Groups comprising selfish punishers and altruistic nonpunishers will reconstitute the stable equilibrium regardless of initial frequencies, provided selfish nonpunishers are sufficiently scarce in the population (Fig. 5; for population trajectories, see Fig. 6). Once established, the within-group stability will maintain altruism at a constant frequency in a multigroup population.
Discussion
The majority of evolutionary models of social behavior have treated selfishness as an impediment to altruism (14, 16, 29, 30). When punishment is modeled as a mechanism that facilitates the evolution of altruism, the punishers are usually assumed to be altruists (13, 18, 19–23). Additional mechanisms that might facilitate the evolution of altruism include conformance transmission within groups (31) and withholding cooperation from selfish individuals (32). In all of these models, the possibility that selfishness might facilitate the evolution of altruism has been largely overlooked.
Inherent in evolutionary game theory is reduction in the fitness of selfish individuals with the increasing frequency of selfishness within groups, which implies that selfish individuals, in addition to altruists, have an incentive to punish selfishness (33). Selfish individuals are in a better position to do so because they have increased their fitness by their first-order interactions. Altruistic punishment is the least-fit strategy within a single group because of the double cost of first- and second-order altruism.
Here, we have shown that selfish strategies during first-order interactions can provide a more stable form of punishment within groups than altruistic punishers. Although this model is constrained to one round of interactions, a multiround computer simulation model with a continuum of altruism and punishment strategies (26) reaches similar conclusions, resulting in a high frequency of altruistic nonpunishers and a low frequency of selfish punishers that keep selfishness at bay. Consistent with predictions from the analytic model presented here, as punishment costs rise or group size is increased in the simulation model, punishers contribute less toward cooperation in the simulation model, exemplifying the tradeoff between first- and second-order altruism. Interestingly, selfish punishers do not evolve to overexploit altruistic nonpunishers beyond retrieving the costs of punishment.
A discussion of terminology is in order because the definition of the terms “altruism” and “selfishness” are notoriously variable (2, 34). For example, altruism is defined in terms of relative fitness within groups in multilevel selection models and in terms of absolute fitness in inclusive fitness theory models (35). When an individual increases the absolute fitness of a recipient at an absolute cost to itself, its behavior appears unambiguously altruistic. Yet, when the same individual is paired with another altruist, the trading of benefits makes the behavior appear like a form of self-interested mutualism. Selfishness can be individually advantageous in a group without punishers but maladaptive in a group with punishers, and so on.
We do not insist upon a given set of definitions (for a discussion of pluralism in evolutionary models of social behavior, see refs. 36–38), as long as sufficient information is provided to translate among perspectives. We have based our definitions on what takes place during each stage of a two-stage interaction. Thus, contributing to a public good counts as altruistic, and keeping one's endowment counts as selfish during the first stage, even though punishment during the second stage might cause selfishness during the first stage to become maladaptive. Similarly, punishment during the second stage counts as altruistic, and failing to punish counts as selfish, even though the combined two-stage strategy of selfish punishment can be selectively advantageous within groups, given a sufficient number of altruistic nonpunishers. These definitions enable us to retain the same definitions for behaviors during a single stage, as opposed to changing definitions based on whether the behaviors are advantageous or disadvantageous in various combinations. However, we acknowledged that the combination of selfish punishers and altruistic nonpunishers can be viewed as an internally stable mutualism and therefore self-interested in that sense of the word.
In most models of altruism, including altruistic punishment, the altruistic behavior is selectively disadvantageous within groups and requires the differential productivity of groups (between-group selection) to be maintained in the population. The outcome depends upon the balance between levels of selection. A trait that has a large effect on collective fitness at a very small personal cost can easily evolve by group selection (e.g., in randomly formed sizeable groups), despite qualifying as altruistic, as the term is defined within multilevel selection theory.
Some evolutionary models of social behavior result in multiple local equilibria, which are internally stable by definition but can differentially contribute to the total gene pool. Group selection among local equilibria can result in social interactions that are simultaneously good for the group and selectively advantageous within groups. Conformance transmission can cause punishment to become internally stable within groups (31). Our model provides another way for punishment to become internally stable, enabling groups with punishers to persist indefinitely and out-compete groups without punishers over the long term. Group selection might also favor the evolution of modifier traits that adjust the internal equilibrium so that most individuals are altruistic nonpunishers with only a few selfish punishers.
In nonhuman species, selfish punishment (which was termed “corrupt policing”) has been observed in the tree wasp Dolichovespula sylvestris. Workers that police the reproduction of other workers are more likely to lay eggs themselves (7). Worker policing in monogynous and monandrous colonies of the wasp Polistes chinensis antennalis exceeds that predicted by relatedness and was partly attributed to the immediate reproductive benefit of selfishly laid eggs (39). Scrub jays that tend to steal caches from other jays are also more defensive of their own caches, providing evidence of selfish behavior as a self-limiting strategy (40).
Game theorists refer to a “replicator dynamic” as any process whereby the most successful behavioral strategy increases in frequency in a population, which includes but goes beyond genetic evolution (41). The concept of selfish punishment therefore might apply to a diversity of human social systems, from the Castellians of medieval Europe (42), to mafia-like protection rackets, to the formal compensation of specialized punishers (police) for their services. In psychological research, a study using fictional scenarios to investigate altruistic punishment revealed that individuals who were most inclined to cheat were also most inclined to punish and spent more to punish, in part because they had amassed more as a result of their selfish behavior (33). We hope that our model stimulates interest in the concept of selfish punishment in both humans and nonhuman species.
Acknowledgments.
We are indebted to D. L. Farrell, A. B. Clark, J. Shepherd, P. Richerson, K. Panchanathan, M. Dlugos, D. O'Brien, The E. N. Huyck Preserve and members of EvoS, Binghamton University Evolutionary Studies Program, for helpful discussion. This work was supported by the National Science Foundation Alliance for Graduate Education and the Professoriate.
Footnotes
The authors declare no conflict of interest.
This article is a PNAS Direct Submission.
References
- 1.Becker GS. Altruism, egoism and genetic fitness: Economics and sociobiology. J Econ Lit. 1976;14:817–826. [Google Scholar]
- 2.Wilson DS, Wilson EO. Rethinking the theoretical foundation of sociobiology. Q Rev Biol. 2007;82:327–348. doi: 10.1086/522809. [DOI] [PubMed] [Google Scholar]
- 3.Barnard CJ, Sibly RM. Producers and scroungers: A general model and its application to captive flocks of house sparrows. Anim Behav. 1981;29:343–550. [Google Scholar]
- 4.Beauchamp G, Giraldeau LA. Group foraging revisited: Information-sharing or producer-scrounger game? Am Nat. 1996;147:738–743. [Google Scholar]
- 5.Packer C, Pusey AE. Adaptations of female lions to infanticidal incoming males. Am Nat. 1983;121:716–728. [Google Scholar]
- 6.Monnin T, Ratnieks FLW. Policing in queenless ponerine ants. Behav Ecol. 2001;50:97–108. [Google Scholar]
- 7.Wenseleers T, Tofilski A, Ratnieks FLW. Queen and worker policing in the tree wasp Dolichovespula sylvestris. Behav Ecol Sociobiol. 2005;58:80–86. [Google Scholar]
- 8.Grinnell J. Modes of cooperation during territorial defense by African lions. Hum Nat Int Bios. 2002;13:85–104. doi: 10.1007/s12110-002-1015-4. [DOI] [PubMed] [Google Scholar]
- 9.Grinnell J, Packer C, Pusey AE. Cooperation in male lions: Kinship, reciprocity or mutualism? Anim Behav. 1995;49:95–105. [Google Scholar]
- 10.Yamagishi T. The provision of a sanctioning system as a public good. J Pers Soc Psychol. 1986;51:110–116. [Google Scholar]
- 11.Yamagishi T. Seriousness of social dilemmas and the provision of a sanctioning system. Soc Psych Q. 1988;51:32–42. [Google Scholar]
- 12.Fehr E, Gachter S. Cooperation and punishment in public goods experiments. Am Econ Rev. 2000;90:980–994. [Google Scholar]
- 13.Fehr E, Gachter S. Altruistic punishment in humans. Nature. 2002;415:137–140. doi: 10.1038/415137a. [DOI] [PubMed] [Google Scholar]
- 14.Hamilton WD. The genetical evolution of social behaviour, I and II. J Theor Biol. 1964;7:1–16. doi: 10.1016/0022-5193(64)90038-4. [DOI] [PubMed] [Google Scholar]
- 15.Hamilton WD. In: Biosocial Anthropology. Fox R, editor. London: Malaby Press; 1975. pp. 133–155. [Google Scholar]
- 16.Axelrod R, Hamilton WD. The evolution of cooperation. Science. 1981;211:1390–1396. doi: 10.1126/science.7466396. [DOI] [PubMed] [Google Scholar]
- 17.Maynard Smith J. Evolution and the Theory of Games. Cambridge, UK: Cambridge Univ Press; 1982. [Google Scholar]
- 18.Gintis H. Strong reciprocity and human sociality. J Theor Biol. 2000;206:169–179. doi: 10.1006/jtbi.2000.2111. [DOI] [PubMed] [Google Scholar]
- 19.Fehr E, Fischbacher U, Gachter S. Strong reciprocity, human cooperation, and the enforcement of social norms. Hum Nat Int Bios. 2002;13:1–25. doi: 10.1007/s12110-002-1012-7. [DOI] [PubMed] [Google Scholar]
- 20.Boyd R, Gintis H, Bowles S, Richerson PJ. The evolution of altruistic punishment. Proc Natl Acad Sci USA. 2003;100:3531–3535. doi: 10.1073/pnas.0630443100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Fehr E, Fischbacher U. The nature of human altruism. Nature. 2003;425:785–791. doi: 10.1038/nature02043. [DOI] [PubMed] [Google Scholar]
- 22.Gintis H, Bowles S, Boyd R, Fehr E. Explaining altruistic behavior in humans. Evol Hum Behav. 2003;24:153–172. [Google Scholar]
- 23.Fehr E. Don't lose your reputation. Nature. 2004;432:449–450. doi: 10.1038/432449a. [DOI] [PubMed] [Google Scholar]
- 24.Bowles S, Gintis H. Homo reciprocans. Nature. 2002;415:125–128. doi: 10.1038/415125a. [DOI] [PubMed] [Google Scholar]
- 25.Bowles S, Gintis H. The evolution of strong reciprocity: Cooperation in heterogeneous populations. J Theor Popul Biol. 2004;65:17–28. doi: 10.1016/j.tpb.2003.07.001. [DOI] [PubMed] [Google Scholar]
- 26.Eldakar OT, Farrell DL, Wilson DS. Selfish punishment: Altruism can be maintained by competition among cheaters. J Theor Biol. 2007;249:198–205. doi: 10.1016/j.jtbi.2007.07.024. [DOI] [PubMed] [Google Scholar]
- 27.Nakamaru M, Iwasa Y. The coevolution of altruism and punishment: Role of the selfish punisher. J Theor Biol. 2006;240:475–488. doi: 10.1016/j.jtbi.2005.10.011. [DOI] [PubMed] [Google Scholar]
- 28.Boyd R, Richerson PJ. Punishment allows the evolution of cooperation (or anything else) in sizable groups. Ethol Sociobiol. 1992;13:171–195. [Google Scholar]
- 29.Sober E, Wilson DS. Unto Others: The Evolution and Psychology of Unselfish Behavior. Cambridge, MA: Harvard Univ Press; 1998. [Google Scholar]
- 30.Trivers RL. The evolution of reciprocal altruism. Q Rev Biol. 1971;46:35–57. [Google Scholar]
- 31.Henrich J, Boyd R. Why people punish defectors: Weak conformist transmission can stabilize costly enforcement of norms in cooperative dilemmas. J Theor Biol. 2001;208:79–89. doi: 10.1006/jtbi.2000.2202. [DOI] [PubMed] [Google Scholar]
- 32.Panchanathan K, Boyd R. Indirect reciprocity can stabilize cooperation without the second-order free-riding problem. Nature. 2004;432:499–502. doi: 10.1038/nature02978. [DOI] [PubMed] [Google Scholar]
- 33.Eldakar OT, Wilson DS, O'Gorman R. Emotions and actions associated with altruistic helping and punishment. Evol Psychol. 2006;4:274–286. [Google Scholar]
- 34.Wilson DS, Dugatkin LA. Altruism. In: Keller EF, Lloyd EA, editors. Key Words in Evolutionary Biology. Cambridge, MA: Harvard Univ Press; 1992. pp. 29–33. [Google Scholar]
- 35.Wilson DS. What is wrong with absolute individual fitness? Trends Ecol Evol. 2004;19:245–248. doi: 10.1016/j.tree.2004.02.008. [DOI] [PubMed] [Google Scholar]
- 36.West SA, Griffin AS, Gardner A. Social semantics: How useful has group selection been? J Evol Biol. 2007;21:405–420. doi: 10.1111/j.1420-9101.2006.01258.x. [DOI] [PubMed] [Google Scholar]
- 37.West SA, Griffin AS, Gardner A. Altruism, cooperation, mutualism, strong reciprocity, and group selection. J Evol Biol. 2008;20:415–432. doi: 10.1111/j.1420-9101.2006.01258.x. [DOI] [PubMed] [Google Scholar]
- 38.Wilson DS. Social semantics: Toward a genuine pluralism in the study of social behaviour. J Evol Biol. 2008;21:368–373. doi: 10.1111/j.1420-9101.2007.01396.x. [DOI] [PubMed] [Google Scholar]
- 39.Saigo T, Tsuchida K. Queen and worker policing in monogynous and monandrous colonies of a primitively eusocial wasp. Proc R Soc London Ser B. 2004;271:S509–S512. doi: 10.1098/rsbl.2004.0238. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Emery NJ, Clayton NS. Effects of experience and social context on prospective caching strategies by scrub jays. Nature. 2001;414:443–446. doi: 10.1038/35106560. [DOI] [PubMed] [Google Scholar]
- 41.Gintis H. Game Theory Evolving. Princeton, NJ: Princeton Univ Press; 2000. [Google Scholar]
- 42.Bisson TN. The feudal revolution. Past Present. 1994;142:6–42. [Google Scholar]