Abstract
Experimental evidence suggests that many people are willing to deviate from materially maximizing strategies to punish unfair behavior. Even though little is known about the origins of such fairness preferences, it has been suggested that they have deep evolutionary roots and that they are crucial for maintaining and understanding cooperation among non-kin. Here we report the results of an ultimatum game, played for real monetary stakes, using twins recruited from the population-based Swedish Twin Registry as our subject pool. Employing standard structural equation modeling techniques, we estimate that >40% of the variation in subjects' rejection behavior is explained by additive genetic effects. Our estimates also suggest a very modest role for common environment as a source of phenotypic variation. Based on these findings, we argue that any attempt to explain observed ultimatum bargaining game behavior that ignores this genetic influence is incomplete.
Keywords: cooperation, experimental economics
It is frequently pointed out that humans exhibit unusually high rates of cooperation among non-kin (1), and it has further been suggested that one important factor for enhancing cooperation is that humans appear willing to forego material payoffs to punish unfair behavior (2–4). Such fairness preferences have been widely studied by using experimental games, in particular the ultimatum game (5–7).
In the ultimatum game, two subjects are assigned the role of either proposer or receiver, and then they bargain over a sum of money (the “cake”). The proposer makes an offer on how to divide the cake. If the receiver accepts the proposer's offer, the players are paid accordingly, whereas if the offer is rejected, both players receive a zero payoff. In a one-shot game, rational and money-maximizing responders should accept any positive offer because the alternative is a zero payoff. Two stylized facts about responder behavior emerge from the ultimatum game literature: first, that unfair offers are often rejected and second, that the acceptance threshold varies substantially between individuals (5, 6). The average responder behavior has been shown to be relatively stable across Western cultures (8), whereas more variation has been observed among non-Western small-scale societies (9).
Although there is a voluminous literature discussing the cultural and evolutionary origins of observed fairness preferences, the relative social and genetic contributions have hitherto been left unexplored. In this work, we use the classical twin design to estimate the heritability of the propensity to reject unfair offers in the ultimatum bargaining game. In doing so, we not only provide the first decomposition of the social and genetic contributions to ultimatum game rejection behavior but also to behavior in experimental games in general. The virtue of the twin design is that by comparing monozygotic (MZ) twins, who share the same set of genes, and dizygotic (DZ) twins, whose genes are imperfectly correlated, we can estimate the proportion of variance in phenotype due to genetic, shared, and nonshared environmental effects (10).
Results
In Fig. 1 we report the distribution of acceptance thresholds for MZ and DZ twins. As can be seen, the distributions are very similar, and we cannot reject the null hypothesis of identical distributions (χ2 test; χ2 value = 4.292, df = 5, n = 653, P = 0.491).¶ The average acceptance threshold is 32.68 SEK, implying that on average a responder demands ≈33% of the cake to accept the proposer's offer in the ultimatum game.‖ Fig. 2 illustrates the correlation in acceptance thresholds within MZ and DZ twin pairs. The Spearman rank correlation is 0.39 (95% confidence interval, 0.26–0.49) for MZ twin pairs and −0.04 (95% confidence interval, −0.25 to 0.18) for DZ twin pairs. The difference in the correlation coefficients is highly significant (t test; t = 3.212, P < 0.01, two-sided) and thus provides strong evidence in favor of a genetic effect on ultimatum game responder behavior.
We also use a standard threshold model to decompose phenotypic variation into additive genetic effects, common environmental effects, and nonshared environmental effects. The model assumes that there is an underlying, continuous and normal distribution of fairness preferences, with arbitrary cutoffs (thresholds) observed by the researcher. In Table 1 we present maximum likelihood estimates of the model parameters as well as its nested submodels.** Additive genetic effects are estimated to account for 42% of the variation in the best-fitting model, and this estimate is significantly different from zero (P < 0.01). The point estimate of the effect of common environment is zero, and its confidence interval has an upper boundary at 21%, suggesting that common environmental influences are at most a moderately important source of variation. The remaining variation is accounted for by nonshared environment. The estimate of nonshared environment also includes any measurement error.
Table 1.
Model | χ2 value | P | df | Mean squared error | A (genetic contribution) | C (common environmental contribution) | E (unique environmental contribution) |
---|---|---|---|---|---|---|---|
ACE | 79.44 | 0.08 | 63 | 0.02 | 0.42 (0.17–0.54) |
0.00(0.00–0.21) | 0.58(0.46–0.72) |
AE | 79.44 | 0.09 | 64 | 0.02 | 0.42 (0.28–0.54) |
0.58(0.46–0.72) | |
CE | 86.96 | 0.01 | 64 | 0.04 | 0.32(0.19–0.4) | 0.68(0.56–0.81) | |
E | 109.30 | 0.00 | 65 | 0.04 | 1 |
The genetic contribution (A) is highly significant in the full ACE model with a point estimate of 42% of the variation. The common environmental contribution (C) has a point estimate of zero and is not significant. The AE submodel is the best-fitting model; comparing this model with the ACE model, we cannot reject the null hypothesis of a zero effect of common environment (χ 2 test; χ 2 value = 0, df = 1, P = 1). The ACE model significantly outperforms the CE submodel (χ 2 test; χ 2 value = 7.52, df = 1, P < 0.01) and the E submodel (χ 2 test; χ 2 value = 29.88, df = 2, P < 0.001).
Finally, we consider the results stratified by sex because pooling may not always be appropriate. Our data set is fairly unbalanced with respect to sex, approximately three-quarters of subjects being women. There was no evidence that pooling by sex is inappropriate because we could not reject the null hypothesis that the distribution of the acceptance threshold is the same for men and women (χ2 test; χ2 value = 2.653, df = 5, n = 653, P = 0.778) (11). When we estimate separate models for men and women, the estimates of additive genetic effects were practically identical: 0.412 for women and 0.435 for men.
Discussion
This work has demonstrated that genetic influences are important determinants of rejection behavior in the ultimatum game. In our best-fitting model, additive genetic effects account for 42% of the observed variation in responder behavior, and our point estimate for common environment is zero. However, note that the estimate for the genetic effect should be considered a lower boundary because it presumes perfect reliability in the measurement of responder behavior. If there is noise in eliciting acceptance thresholds, the estimate of additive genetic effects will be downward-biased.
Our finding of substantial genetic effects on ultimatum game rejection behavior is consistent with previous research in behavioral genetics and neuroscience. For instance, survey based studies repeatedly find sizeable genetic effects on a wide range of economically relevant social attitude variables such as personality and political preferences (15–18). Furthermore, recent studies have shown that the responder stage in the ultimatum game is associated with increased activation in the dorsolateral prefrontal cortex as well as responders' circulating testosterone levels (19–21). The dorsolateral prefrontal cortex is a brain region whose structure is under pronounced genetic influence (22, 23), as are individual testosterone levels (24).
The etiology of fairness preferences has been intensely debated. Some authors have argued that the willingness to engage in costly punishment reflects fundamental and universal fairness preferences that evolved through a process of gene–culture coevolution in early modern Homo sapiens, preferences that are considered crucial for maintaining cooperation among non-kin (3, 25). Others have argued that indirect reciprocity models based on an evolved psychology for reputation management provide a more parsimonious explanation (1, 26–31). Although our results are consistent with an evolutionary origin for fairness preferences, it is important to remember that heritability measures the genetically determined variation around some average behavior. Hence, it does not provide us with any direct evidence with regard to the evolutionary dynamics that brought it about.
However, the fact that there seems to be a substantial genetically determined heterogeneity in ultimatum game rejection behavior is an interesting finding in and of itself. First of all, it suggests that economic theory and policy need to address the potential importance of genetic influences on economic preferences, behavior, and outcomes (32). It also suggests that the current debate on the evolutionary origins of ultimatum game rejection behavior, and the preferences it proxies, should perhaps be brought into the broader context of the evolution of variation in personality. If we observe a mean phenotypic expression in behavior with substantial genetic variation, then any evolutionary model of its origin must not only account for average phenotype but also phenotypic variation.
In humans as well as in other species there is substantial genetically determined variation for a great number of personality traits (33–35). Unfortunately, this variation is not well understood because the observed patterns seldom correspond to the simplest evolutionary genetic models of fixation. Many authors have therefore resorted to the default explanation that observed variation is nonadaptive (33). This is, however, not necessarily the case, and a number of recent papers have examined these questions in more detail. For instance, Dall et al. (33) emphasize the combination of frequency-dependent selection and state-dependent behavioral specialization as a possible source of adaptive variation, whereas Penke et al. (35) argue that balancing selection by environmental heterogeneity explains individual variation in personality.††
Finally, on a more general note, our findings suggest that it is time to take seriously the proposition that experimental behavior may be substantially heritable, which may explain why, despite ample experimental evidence, the origins of individual variation remain elusive, and most attempts to find theoretically appealing and empirically stable correlates to experimentally derived preferences have had only mixed success.
Materials and Methods
This work was undertaken in collaboration with the Swedish Twin Registry at Karolinska Institutet. The registry is the largest twin registry in the world and has been described in detail elsewhere (36). Same-sex twins born 1960–1985 were solicited by e-mail and recruited in all major Swedish cities through the summer and fall of 2006. A condition for participation was that both twins in a pair be able to attend the same experimental session. In total, 658 individuals (71 DZ and 258 MZ pairs of twins) participated. Zygosity was assigned by questionnaire items that have been shown to have a reliability of up to 98% (36).
In the first stage of the experiment, all subjects played the role of proposer and were asked to divide 100 SEK (approximately $15) between themselves and a randomly selected anonymous counterpart not partaking in the same experimental session. In the second and final stage, all subjects played the role of responder and were once again matched with a randomly selected anonymous counterpart, different from that in stage 1, not partaking in the same session. We used the strategy method to extract acceptance thresholds (6). Each subject determined whether he or she would accept or reject every possible proposal in multiples of 10% before learning the actual proposal. This method allows for the recovery of the entire strategy of each participant. We then recorded the lowest offer that the responder indicated a willingness to accept in the range of offers between 0% and 50%. In this region, all subjects, except for the very few idiosyncratic responses referred to below, exhibit simple, monotonic behavior. The mean acceptance threshold is therefore uniquely defined.‡‡
Of the 658 participants, two failed to respond to the ultimatum game question and had to be dropped from the analysis. In addition, eight subjects provided inconsistent responses.§§ Of these inconsistent responses, five were clarified by an e-mail follow-up question (two subjects did not respond to the e-mail, and one subject wanted to keep the inconsistent answer). Thus, our dataset in the analysis consists of 653 individual observations (511 MZ twins and 142 DZ twins), and 324 complete twin pairs (253 MZ pairs and 71 DZ pairs).¶¶
Acknowledgments
We thank Terry Burnham, Ernst Fehr, Herbert Gintis, Erik Lindqvist, Moses Ndulu, Martin Nowak, Paul Schrimpf, Lucius Vinicius, and Robert Östling for helpful comments. This work was supported by the Jan Wallander and Tom Hedelius Foundation and the Swedish Research Council. The Swedish Twin Registry is also supported by grants from the Swedish Research Council, the Ministry for Higher Education, and AstraZeneca.
Abbreviations
- DZ
dizygotic
- MZ
monozygotic.
Footnotes
The authors declare no conflict of interest.
This article is a PNAS Direct Submission.
There is too little variation in the proposal stage to estimate model parameters with reasonable precision, and hence we focus on responder behavior. The equality of distributions of acceptance thresholds by zygosity was tested with a design-based independence test, which takes the correlation between twins into account. The χ2 statistic is adjusted by using the second-order correction of Rao and Scott (11).
To estimate the mean acceptance threshold, the acceptance threshold is set at the middle value of the acceptance threshold intervals.
The maximum likelihood estimation is implemented in Mx, a numerical optimizer for behavior genetics (12). We estimate a threshold model based on the response categories for the acceptance threshold in the experiment. The programming code for the threshold model we estimate is adapted from the digital scripts library at the University of Amsterdam (13). The algorithm that estimates confidence intervals for the parameters is explained in some detail by Neale and Miller (14).
One interpretation, suggested to us by a referee, is that the high heritability casts doubt on accounts of ultimatum rejections relying on notions of evolutionary disequilibrium. In light of our findings, it seems plausible that the trait has been under stabilizing selection in the post-Pleistocene era.
Previous experimental research has shown that it is fairly common to observe some subjects rejecting both low (<50%) and high (>50%) offers (37). In the parlance of experimental economics, these subjects are “hyperfair.” In our data, slightly fewer than one-third of the subjects exhibit such behavior.
For example, a response in which an individual accepts an offer of 10% but then rejects an offer of 20% is considered inconsistent because an acceptance threshold is not uniquely defined on the interval from 0 to 50%.
Furthermore, our results are robust to the manner in which we deal with these inconsistent responses. Dropping the five inconsistent responses clarified by e-mail has no discernible effect on the heritability estimates or their significance levels.
References
- 1.Nowak MA. Science. 2006;314:1560–1563. doi: 10.1126/science.1133755. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Fehr E, Gächter S. Nature. 2002;415:137–140. doi: 10.1038/415137a. [DOI] [PubMed] [Google Scholar]
- 3.Fehr E, Fischbacher U. Nature. 2003;425:785–791. doi: 10.1038/nature02043. [DOI] [PubMed] [Google Scholar]
- 4.de Quervain DJF, Fischbacher U, Treyer V, Schellhammer M, Schnyder U, Buck A, Fehr E. Science. 2004;305:1254–1258. doi: 10.1126/science.1100735. [DOI] [PubMed] [Google Scholar]
- 5.Fehr E, Schmidt K. Q J Econ. 1999;114:817–868. [Google Scholar]
- 6.Camerer CF. Behavioral Game Theory: Experiments in Strategic Interaction. Princeton: Princeton Univ Press; 2003. [Google Scholar]
- 7.Güth W, Schmittberger R, Schwarze B. J Econ Behav Organ. 1982;3:367–388. [Google Scholar]
- 8.Roth AE, Prasnikar V, Okuno-Fujiwara M, Shmuel Z. Am Econ Rev. 1991;81:1068–1095. [Google Scholar]
- 9.Henrich J, Boyd R, Bowles S, Camerer C, Fehr E, Gintis H. Foundations of Human Sociality: Economic Implications and Ethnographic Evidence from Fifteen Small-Scale Societies. New York: Oxford Univ Press; 2004. [Google Scholar]
- 10.Neale MC, Cardon LR. Methodology for Genetic Studies of Twins and Families. Dordrecht: Kluwer Academic Publishers; 1992. [Google Scholar]
- 11.Rao JNK, Scott AJ. Ann Stat. 1984;12:46–60. [Google Scholar]
- 12.Neale MC, Boker SM, Xie G, Maes HH. Mx: Statistical Modeling. Richmond, VA: VCU Department of Psychiatry; 2002. [Google Scholar]
- 13.Posthuma D, Boomsma DI. Behav Genet. 2005;35:499–505. doi: 10.1007/s10519-005-2791-5. [DOI] [PubMed] [Google Scholar]
- 14.Neale MC, Miller MB. Behav Genet. 1997;27:113–120. doi: 10.1023/a:1025681223921. [DOI] [PubMed] [Google Scholar]
- 15.Bouchard TJ, Jr, Lykken DT, McGue M, Segal NL, Tellegen A. Science. 1990;250:223–226. doi: 10.1126/science.2218526. [DOI] [PubMed] [Google Scholar]
- 16.Bouchard TJ, Jr, McGue M. J Neurobiol. 2003;54:4–45. doi: 10.1002/neu.10160. [DOI] [PubMed] [Google Scholar]
- 17.Rushton JP. Proc R Soc London Ser B. 2004;271:2583–2585. doi: 10.1098/rspb.2004.2941. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Alford J, Funk C, Hibbing JR. Am Pol Sci Rev. 2005;99:153–167. [Google Scholar]
- 19.Sanfey AG, Rilling JK, Aronson JA, Nystrom LE, Cohen JD. Science. 2003;300:1755–1758. doi: 10.1126/science.1082976. [DOI] [PubMed] [Google Scholar]
- 20.Knoch D, Pascual-Leone A, Meyer K, Treyer V, Fehr E. Science. 2006;314:829–832. doi: 10.1126/science.1129156. [DOI] [PubMed] [Google Scholar]
- 21.Burnham TC. Proc R London Soc Ser B. 2007;274:2327–2330. doi: 10.1098/rspb.2007.0546. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Gray JR, Thompson PM. Nat Rev Neurosci. 2004;5:471–492. doi: 10.1038/nrn1405. [DOI] [PubMed] [Google Scholar]
- 23.Toga AW, Thompson PM. Annu Rev Neurosci. 2005;28:1–23. doi: 10.1146/annurev.neuro.28.061604.135655. [DOI] [PubMed] [Google Scholar]
- 24.Harris JA, Vernon PA, Boomsma DI. Behav Gent. 1998;28:165–171. doi: 10.1023/a:1021466929053. [DOI] [PubMed] [Google Scholar]
- 25.Boyd R, Gintis H, Bowles S, Richerson PJ. Proc Natl Acad Sci USA. 2003;100:3531–3535. doi: 10.1073/pnas.0630443100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Gale J, Binmore KG, Samuelsson L. Game Econ Behav. 1995;8:56–90. [Google Scholar]
- 27.Nowak MA, Sigmund K. Nature. 1998;393:573–577. doi: 10.1038/31225. [DOI] [PubMed] [Google Scholar]
- 28.Nowak MA, Page KM, Sigmund K. Science. 2000;289:1773–1775. doi: 10.1126/science.289.5485.1773. [DOI] [PubMed] [Google Scholar]
- 29.Milinski M, Semmann D, Krambeck HJ. Nature. 2002;415:424–426. doi: 10.1038/415424a. [DOI] [PubMed] [Google Scholar]
- 30.Nowak MA, Sigmund K. Nature. 2005;437:1291–1298. doi: 10.1038/nature04131. [DOI] [PubMed] [Google Scholar]
- 31.Burnham TC, Johnson DDP. Anal Kritik. 2005;27:113–135. [Google Scholar]
- 32.Zizzo DJ. Cambridge J Econ. 2003;27:867–880. [Google Scholar]
- 33.Dall SRX, Houston AI, McNamara JM. Ecol Lett. 2004;7:734–739. [Google Scholar]
- 34.Nettle D. Am Psychol. 2006;61:622–631. doi: 10.1037/0003-066X.61.6.622. [DOI] [PubMed] [Google Scholar]
- 35.Penke L, Denissen JJA, Miller GF. Eur J Personality. 2007;21:549–587. [Google Scholar]
- 36.Lichtenstein P, Sullivan PF, Cnattingius S, Gatz M, Johansson S, Carlström E, Björk C, Svartengren M, Volk A, Klareskog L, et al. Twin Res Hum Genet. 2006;9:875–882. doi: 10.1375/183242706779462444. [DOI] [PubMed] [Google Scholar]
- 37.Gintis H, Bowles S, Boyd, Fehr E. Evol Hum Beh. 2003;24:153–172. [Google Scholar]