Abstract
The ways in which natural selection can allow the proliferation of cooperative behavior have long been seen as a central problem in evolutionary biology. Most of the literature has focused on interactions between pairs of individuals and on linear public goods games. This emphasis has led to the conclusion that even modest levels of migration would pose a serious problem to the spread of altruism through population viscosity in group structured populations. Here we challenge this conclusion, by analyzing evolution in a framework which allows for complex group interactions and random migration among groups. We conclude that contingent forms of strong altruism that benefits equally all group members, regardless of kinship and without greenbeard effects, can spread when rare under realistic group sizes and levels of migration, due to the assortment of genes resulting only from population viscosity. Our analysis combines group-centric and gene-centric perspectives, allows for arbitrary strength of selection, and leads to extensions of Hamilton’s rule for the spread of altruistic alleles, applicable under broad conditions.
Introduction
The evolution of cooperation and altruism are fundamental scientific challenges highlighted by their role in the major transitions in life’s history, when natural selection acted simultaneously on several competing levels [1]–[8]. In this context, the relevance of basic concepts, including group selection and Hamilton’s rule remain controversial [9]–[18]. Here we address these problems by studying a framework for evolution in group structured populations that incorporates inter- and intra-group competition and migration. Combining group-centric with gene-centric perspectives in a constructive group/kin selection approach, we build methodology that allows for the analysis of arbitrary non-linear fitness functions, resulting from complex multi-individual interactions across life cycles. We obtain the conditions for a rare social allele to invade the population. This is obtained in a mathematically rigorous way, by analyzing the stability of the equilibrium in which this allele is absent. This analysis is done for arbitrary strength of selection, but when selection is weak and groups are large the condition for invasion simplifies significantly into a form that is easy to apply and provides substantial intuition. In the case of linear fitness functions, the condition for invasion is identical to Hamilton’s rule, and it is natural to regard the more general non-linear cases as generalizations of that rule. Our results also show that one of the most widely used approaches to analyzing kin selection models, [19], [7](condition (6.7)), and [16](Box 6), yields incorrect results in some biologically relevant situations.
Our results reveal conditions that are biologically realistic and under which altruism can evolve when rare even with modest genetic relatedness in groups, without kin recognition or greenbeard effects (the altruistic acts benefit all group members equally). In this way we challenge a common understanding according to which inter-group selection favoring altruism could only override intra-group selection favoring selfishness under exceptional conditions, namely small group size and very low migration rates [13], [17], [20]–[28]. The issue is illustrated by quoting from the recent review [13], p.12: “For group selection to overcome selection within groups, less than one succesfully reproducing migrant may be exchanged per two populations per population lifetime.” The fact that this idea is still incorporated in mainstream evolutionary biology is illustrated by the theoretical considerations on pp. 11, 12 of the influential recent textbook [29], where one reads: “[…] for group selection to work populations must be isolated, such that individuals cannot migrate among them. Otherwise there would be nothing to stop the migration of selfish individuals […]. Once selfish individuals arrive, their genotype would soon spread. In nature groups are rarely isolated sufficiently to prevent such immigration.”
We identify the emphasis on linear public goods games in the literature, including most of the papers quoted in the previous paragraph, as having supported this belief in exceptionality. For these games, the condition for altruism to proliferate is Hamilton’s classical rule, requiring the relatedness in groups to exceed the ratio of cost to benefit for each altruistic act. Therefore, in this setting, altruism can only spread when either relatedness is large, or the cost/benefit ratio is low. And since relatedness is often low [28] (Table 8.3), [30], [31] (Tables 6.4 and 6.5), [32], [33] (Table 4.9), exceptionally low cost/benefit ratios are required, as observed for instance in [34]. In the absence of reasons to expect low cost/benefit ratios to be common, researchers whose intuition is molded by the linear public goods game are naturally led to the belief that altruism could only spread through population viscosity in exceptional cases.
In contrast, we show that for iterated public goods games, in which altruists cooperate or not in each round based on previous outcomes [35], [36], altruism can spread even when cost/benefit ratios for each altruistic act are reasonably high, groups are large, selection is weak and migration rates are substantially larger than the inverse of group size (high gene flow, realistically low relatedness). This result corrects [36], who predicted that large group size would not allow cooperation to spread when rare in this model. For species that live in groups, several vital group activities repeat themselves periodically and behavior changes as feedback is obtained from previous iterations. The iterated public goods game that we study is therefore often more realistic than a simple one shot public goods game. A proper analysis of this model fills therefore an important gap in the literature. (To illustrate the fact that the incorrect conclusion from [36] is still incorporated in the literature, we refer the reader to, e.g., Section 4.5 of the textbook [37] and p. 359 of the very recent textbook [29]).
To obtain our result in the case of weak selection and large groups we show that in the absence of selection, when groups are large, the fraction of group members that are close relatives of a randomly chosen individual has a non-Gaussian distribution with a fatter tail. As a consequence, even when altruistic alleles are rare in the population, they have a significant probability of concentrating in some groups, accruing substantial reproductive gains through multi-individual synergy.
The Two-Level Fisher-Wright Framework
When members of a species live in groups, their reproductive success depends on the behavior of all group members. More efficient groups may grow faster and split, outcompeting the less efficient ones that die out. On the other hand, individuals may free ride on the cooperation of other members of their group, and in this way outcompete them. This picture is further complicated by migration among groups. The Two-level Fisher-Wright framework with selection and migration (2lFW) captures all these elements, in a simplified fashion. (In the last two paragraphs of this section we explain how it relates to the trait group framework and budding-viscosity models.) In 2lFW haploid individuals live in a large number of groups of size , and are of two genetically determined phenotypic types, A or N. Generations do not overlap, reproduction is asexual and the type is inherited by the offspring (mutations will be considered briefly later). The relative fitness () of a type A, and that of a type N, in a group that has types A, are, respectively, and with the convention that , i.e., . The quantities and represent life-cycle payoffs derived from behavior, physiology, etc. The parameter indicates the strength of selection. Figure 1 describes the creation of a new generation in the 2lFW through inter- and intra- group competition, followed by migration at rate .
Cases in which types A behave in some altruistic fashion are of particular interest [38]. Most of the literature concerns the very special case of a linear public goods game (PG), defined by
with , in which each type A cooperates, at a cost to herself, providing a benefit shared by the other members of her group. The need to consider more complex intra-group interactions and non-linear payoff functions is, nevertheless, well known [10], [14], [30], [35], [36], [39]–[49]. Non-linearities appear naturally whenever activities involve many group members simultaneously. They result from threshold phenomena, increasing returns to scale, saturation, etc. For instance, to hunt large prey may require a large minimum number of hunters, the likelihood of success may first increase rapidly with the number of hunters, but it may plateau when this number becomes very large. Allowing for the analysis of such synergistic multi-individual interactions and activities is a central feature of our approach, distinguishing it from theoretical frameworks based on pairwise interactions, or single actors benefiting a group [4], [50], [51].
The 2lFW framework can be seen as a generalization of the trait-group framework (see Sec. 2.3.2 of [5]), which corresponds to the case . One can interpret as a viscosity, or an assortment parameter. Because migration is completely random in 2lFW, this assortment represents a worst case scenario, abstracting away additional assortment caused by kin recognition, greenbeard effects, selective acceptance of migrants, joint migration of individuals, etc. We notice that, according to [52], the 2lFW can be classified as a Type II group selection model, as the intergroup competition component qualifies as an explicit group level event that is absent in a trait-group model. It is well known [35], [36], [43] that even when non-linearities in fitness functions allow for coexistence of cooperators and defectors. But under the strong altruism condition (meaning that each type A would be better off mutating into a type N), this is not the case [38], [53]. One of our goals is to determine the level of migration compatible with invasion by rare strong altruists.
The model with the population structure of 2lFW and PG payoffs was studied in [54] with the name “budding-viscosity model”. But, as the authors explained on p.1714, this name may not be appropriate in some applications. We propose the name “two-level Fisher-Wright framework with selection and migration” to denote a mathematical structure applicable to several demographic/reproductive biological systems, including group fissioning and budding, and admitting arbitrary payoff functions. This name also emphasizes that selection occurs at the individual and the group levels. In [54] motivation for the population structure is discussed in detail, references to previous work leading to it are provided (notably [55]) and an explanation of how it allows viscosity to increase relatedness without increasing in the same measure the competition among relatives (as is the case in an island model with inelastic group size [56]) is presented. The analysis in [54] relied on the assumptions of PG payoffs and weak selection (). The paper [57] provides an alternative analysis, which depends on the same strong assumptions, in the section called “typical kin selection model”. This paper was a response to [58], where group selection was argued to be an important mechanism for the evolution of cooperation, and a multilevel selection model based on Moran’s model was introduced. In [57] it was argued that kin selection is a better tool for studying evolution in group structured populations. In the context of this debate, our analysis of 2lFW with non-linear fitness functions highlights the importance of combining group-centric with gene-centric perspectives, and shows that group selection can be an important force in evolution under realistic conditions. It also shows that mathematically rigorous analysis can be carried out, to a large extent, even when selection is strong, fitness functions are non-linear and migration rates are arbitrary. And it shows that one has to be very careful in applying mathematically non-rigorous methodology, as it can produce substantially incorrect results, even when selection is weak. We will see this when we observe that methods based on the approximation of regression coefficients by partial derivatives lead to such incorrect results in important examples.
Results
A Basic Example: Iterated Public Goods Game
Non-linearities in life-cycle payoffs can result from activities repeating themselves during a lifetime, and behavior being contingent on previous outcomes. A basic example is the iterated public goods game (IPG) [35], [36]. In IPG a PG is repeated an average of times in a life-cycle. We will suppose that types N never cooperate, while types A cooperate in the first round and later cooperate only if at least a fraction of group members cooperated in the previous round. The payoffs (see Figure S1 in File S1) are, therefore, given by
for constants , and ( and are costs and benefits in each iteration). Mathematically, this model generalizes the iterated prisoner dilemma and tit-for-tat, from the dyadic setting of [59] and [60] to the multi-individual setting. But while direct or indirect reciprocity requires the identification of individuals in the group, this is not the case here. The behavior of types A in the IPG can be triggered by individuals simply discontinuing cooperative behavior when previous cooperation produced negative feedback to them, for instance, when they received a negative payoff. In other words, allele A can predispose individuals to cooperate, but as they do it and obtain feedback from that behavior, they may continue it or discontinue it. The IPG is in this sense closely related to generalized reciprocity mechanisms [61], [62] with low cognitive requirements. (In generalized reciprocity models individuals interact in pairs, and generalized reciprocators help any other member of the group, but only in the first iteration, or when they were helped by some group member in the previous iteration. In contrast, in the IPG the actions involve several individuals simultaneously, but as in generalized reciprocity, individuals react to their own previous experience, without having to remember who did what.) Negative feedback from cooperation should occur if the fraction of group members that cooperated was less than , but not if it was larger than that threshold, since in the former case the payoff to a cooperator is negative, while in the latter case it is positive. This gives a special role to the value .
When are types A altruistic in the IPG? There is more than one way in which the concept of altruism in the context of the trait-group framework has been defined [38]. These different definitions carry over to the 2lFW. A particularly simple concept is called in [38] the “multilevel interpretation” of altruism. That definition requires the two conditions , for all , and increasing in . The first one means that types A are always worse off than types N in the same group, and the second one means that the more types A in a group, the better for the group. Both conditions are clearly always satisfied in the IPG.
There are nevertheless good arguments for considering other definitions of altruism [38]. A particularly appealing definition is called “focal-complement interpretation” in [38], and is often known as “mutation condition”, or “strong altruism”. Suppose that a type N mutated into a type A, everything else remaining unchanged. Would this cause a decrease in the fitness of the mutant? Would it cause an increase in the average fitness of the other members of the mutant’s group? Since the average fitness of the members of the mutant’s group increases with the mutation, the answer to the second question will be affirmative whenever the answer to the first question is affirmative. Therefore the condition for types A to be strongly altruistic is that for all . In File S1 (Section S8, last subsection) this condition is shown to be satisfied when and to fail otherwise.
To sum up, if , the behavior of types A is altruistic in the strong sense that each type A individual would increase its fitness if it behaved as a type N, everything else being equal, i.e., . Moreover, types N always free ride and have greater fitness than types A in the same group, regardless of the values of and , i.e., .
Figure 2 displays a detailed analysis of some instances of the IPG, giving conditions for allele A to spread when rare (In this figure, , so that, in particular, types A are strongly altruistic). For many species that live and interact in groups for many years, several vital activities, including collective hunting and food sharing, can repeat themselves hundreds or thousands of times in a life-cycle, giving plausibility to the values of in Figure 2. The assumption that individuals discontinue behavior after a single unsuccessful participation is a simplification. When this is not a realistic assumption, we can, however, interpret the parameter as the ratio between the typical number of repetitions of the activity and the typical number of unsuccessful attempts before cooperation is discontinued by a type A.
Panel C, in which selection is weak and groups are large, shows two important contrasting results. When , and the IPG is identical to the PG, allele A can only invade under Hamilton’s condition . But as increases, the level of relatedness needed for invasion drops substantially, so that for modest values of , allele A can invade under significantly lower than , compatible with levels observed in several species, including humans [28] (Table 8.3), [30], [31] (Tables 6.4 and 6.5), [32], [33] (Table 4.9). The corresponding number of migrants per group per generation, , can be of the order of 10. Further examples showing the spread of altruism and cooperation under high levels of gene flow and low levels of relatedness are provided in Figure S5, S6, S7 and S8, S20 and S21 in File S1.
The 2-player iterated prisoner dilemma has been analysed in detail in the literature, as reviewed, for instance in Chapter 4 of [37]. In this setting, when types N are defectors and types A play tit-for-tat, types A will not be able to invade when rare, if assortment is random (because then they are typically paired with defectors and lose in fitness to those by cooperating in the first iteration). A very modest level of relatedness is nevertheless sufficient to allow tit-for-tat to proliferate when rare, as computed in Section 4.1.2 of [37]. Our computations here provide similar results for the -player iterated public goods game and correct the computations in Section 4.5.1 of that text, which had indicated that types A would require very high levels of relatedness to invade. In File S1 (Section S8, next to last subsection), we explain in detail what assumption in [36], reproduced in [37], led to that incorrect conclusion.
It is important to also emphasize a relevant difference between tit-for-tat in 2-player iterated prisoner’s dilemmas and types A with in -player public goods games (as in Figure 2). The former can proliferate under random assortment provided that initially they are not very rare. This is so because, when common, tit-for-tat is not altruistic; cooperation in each iteration assures continuation of cooperation, and in the long run benefits the cooperator. In contrast, the latter is strongly altruistic in the sense that and therefore never proliferates under random assortment ([53], reviewed in [38]). In other words, while 2-player reciprocity is sometimes not altruistic, but rather cooperative, the behavior of types A that we are studying in Figure 2, is genuinely altruistic rather than simply cooperative.
Condition for Invasion Under Strong Selection
To analyze the 2lFW, denote by , the fraction of groups in generation that have exactly types A. Denote by the frequency of types A in the population. The state of the population in generation is described by the vector , since . We will suppose that , so that, by the law of large numbers, evolves as a deterministic (non-linear) dynamical system in dimension . Here we will study its linearization close to the fixed point , with no types A. This means that we are restricting ourselves to the case in which , and studying the conditions for allele A to invade the population when rare. With the notation introduced in Fig. 1, we have then , where
where represents the binomial distribution with trials with probability of success and we use the standard Kronecker notation if and if . Matrix represents the production of groups in the new generation, through the two-level competition, prior to migration. Matrix represents the effect of types A migrating out of groups, and matrix represents the effect of these migrant types A joining groups that previous to migration had no types A. (Explanation for and : When , the migrant types A are a small fraction of the migrant population, and therefore each one is likely to settle in a different group that had no types A before migration. A group that had types A prior to migration will therefore have types A after migration with the probability given by above. To understand the form of now, note that such a group that had types A prior to migration will contribute an average of migrant types A, who will therefore produce that same number of groups with exactly one type each.)
A standard application of the Perron-Frobenius Theorem (See Section 2 of File S1) implies that when , we have, in good approximation, , where is a constant that depends on , is the leading eigenvalue of and is its corresponding left-eigenvector normalized as a probability vector. This means that, regardless of the initial distribution , with , demographics and natural selection drive towards multiples of , in what can be seen as self-organization of copies of A in the optimal stable way for them to spread. Once this has happened, grows at rate . Consequently, allele A will proliferate, when rare, if the viability condition holds, and it will vanish if (see Figure 2 and Sections S1 and S2 in File S1 for applications and further explanations, see also Figure S2 to S9 in File S1 for illustrations). When , even if allele A is initially absent, a small rate of mutation will introduce it, allowing it to then invade the population. In the terminology of evolutionary game theory (see, e.g., [28](Chapter 7)), phenotype N is an evolutionary stable strategy (ESS) when and N is not an ESS when .
The viability condition has a gene-centric (kin-selection) interpretation in terms of average (neighbor modulated) fitnesses. For this purpose, define . Then it is well known that , where and are the average fitnesses of types A and N, and is the average fitness of all individuals. If we choose a random type A, it will have probability proportional to of being in a group with exactly types A (Bayesian sampling bias, reviewed in Section S3 of File S1). Therefore . When , if we choose a random individual, it is likely to be in a group with no types A. Therefore, in good approximation, . Since is driven towards multiples of , we obtain.
(1) |
provided and (the error term is of order +, with ). The viability condition can also be stated as , in (1). It is important to observe that does not need to be monotone, and that is the proper condition for invasion only when, as in (1), one is considering the stationary regime, . (See Figures S10, S11, S12 and S13 in File S1 for illustrations of the onset of this stationary regime.)
Weak Selection
If selection is weak, i.e., , migration acts much faster than selection, providing a separation of time scales [3], [57], [63]–[65]. This allows us to replace in (1) with , obtained by assuming within error of order (see also Figure S15 and S16 in File S1). Defining now , allows us then to rewrite the neighbor modulated fitness relation (1) in the form
(2) |
(within error term of order ). Algebraic simplifications (presented in Section S5 of File S1) transform the eigenvalue equation for into the following equations for :
with the matrix given by
Matrix is a Markov transition matrix (i.e., and ) and is its invariant probability distribution (see Section S5 in File S1 and Figure S17 in File S1 for details). They have natural interpretations in terms of identity by descent (IBD) under neutral genetic drift, as we explain next when we provide a second, independent, derivation of (2). This derivation is gene-centered, and is more intuitive and simpler than the derivations in the previous section. But it relies heavily on the assumption that selection is weak.
Two individuals are said to be IBD if following their lineages back in time, they coalesce before a migration event affects either one (see Figure S14 in File S1 for an illustration of the concept). The separation of time scales implies that when selection acts, the demographic distribution is well approximated by that obtained in equilibrium with . This means that in good approximation , where is the equilibrium probability that in the group of a randomly chosen focal type A there are exactly types A (focal included). But because we are supposing that types A are rare, the only individuals that are type A in this group are those that are IBD to the focal, so that is also the probability that exactly individuals in this group are IBD to the focal. As in the derivation of (1), since types A are rare, we have and hence , which is (2).
Now, the probability that the focal is IBD to exactly other members of her group is if the focal is a migrant (probability ), while if she is not a migrant (probability ), then we have to consider how many individuals in her mother’s group were IBD to her mother. If, counting her mother, that number was (probability , assuming demographic equilibrium) then the probability that the focal is IBD to exactly other members of her group is equal to the probability that of the other members of her group, exactly are non-migrants who chose for mother one of the candidates (among possibilities) that were IBD to the focal’s mother (probability ). Combining these pieces, we have
This is exactly the same as the set of equations .
The IBD distribution contains all the relevant information about genetic relatedness in the groups, including and exceeding that given by the average relatedness between group members, (see Figure S18 in File S1), obtained from lineages, regression coefficients, or Wright’s statistics. (See Sections S3 and S5 in File S1 for a review on these alternative, equivalent, definitions of relatedness, and its computation.) Specifically, we can define as the probability that a second member chosen from the focal’s group is IBD to the focal and then obtain (from linearity of expected values) that is a linear function of ’s first moment. When is a non-linear function of , more information contained in , including its higher moments, is needed to decide whether in (2). (In [66] the second moment of was used to obtain conditions for invasion by rare greenbeards in an island population structure.) It is important to also stress that (2) can be easily used for applications in which even the knowledge of all the moments of (see [63]) would be cumbersome to apply, as for instance in the computation of the short horizontal red lines in Figure 2, Panels A and B.
Large Groups Under Weak Selection
The stationarity condition allows for a recursive computation of all the moments of (see Section S5 in File S1). These moments can then be used to show the powerful result that if is large and is small, then , when properly rescaled, is close to a beta distribution, with mean (see Section S6 and Figure S19 in File S1). In this case, if in addition to the assumption of weak selection, also is well approximated by , for some piecewise continuous function , then (2) takes the easy to apply form
(3) |
where . Equations (1) and (3) play complementary roles in the analysis of 2lFW. Both provide the condition for invasion by allele A; (1) holds in full generality, while (3) requires special assumptions (small , large ), but is computationally much simpler and provides a great deal of intuition, as we discuss next.
Equation (3) should be contrasted with what [36] predicted by supposing that the number of individuals in a group that are IBD to a focal individual would be well approximated by a binomial with attempts and probability of success. That would lead to a normal distribution, narrowly concentrated close to its mean , in place of the beta distribution above. Our result reveals a strong dependency structure among lineages, producing the beta distribution, with a standard deviation comparable to its mean, and a tail that decays slowly compared to a Gaussian distribution. As a consequence, fitness functions that are large only when the fraction of types A in a group is above a threshold value, as in the IPG, will allow for proliferation of types A under levels of relatedness substantially lower than that predicted under the assumption in [36] (for another example see also Section S7 and in File S1 and Figure S20 in File S1). We will refer to the fact that the fraction of group members that are IBD to a focal individual has a non-vanishing standard deviation, even when selection is weak and groups are large, as persistence of variability. This phenomenon poses a severe limitation to the applicability of covariance-regression methods in which regression of fitness on genotype is replaced with derivatives, as in [19], [16](Box 6), [7](condition (6.7)). Both the assumptions in [36], or in [19] applied to the IPG would have implied incorrectly that when selection is weak and groups are large, types A could only invade the population when rare if (these computations are presented in Sections S8 an S9 in File S1). In a companion paper [37] we show that methodologies in which one expresses the fitness of a focal individual in terms of partial derivatives with respect to the focal individual’s phenotype and the phenotype of the individuals with whom the focal interacts, as in [3], [7], [16], [19], [51], [57], require to be a linear function of .
For the PG, (2) and (3) clearly reduce to the well known . The same is also true for the more general (1), as was shown in [22], where in case of strong selection the relatedness depends on the payoff functions. (The derivation in [22] was based on the Price equation. An alternative derivation is provided in Section 4 of File S1.) In contrast, if we are under the conditions of (3) with , then
(4) |
where is the -th moment of the beta distribution.
For the IPG, , if , and , if . The viability condition derived from (3) can be analyzed in detail, by simple, but long, computations, presented in Section S8 in File S1 and illustrated by Figure S21–S26 in File S1. In the case , the viability condition reads . When is large, this yields the following approximation for the critical relatedness :
(5) |
If also , then
(6) |
The simplicity and transparency of (5) and (6) illustrate the power of (3), and Fig. 2 shows how well they compare to the more general, but less transparent (1). Note also how (5) and (6) provide a direct grasp on the effect of the number of repetitions in the game, and a nice comparison between the PG and the IPG. Both Figure 2 and (6) show that alleles that promote contingent cooperative behavior, which is discontinued when participation is low, can spread under levels of genetic relatedness () more than 5 times smaller than . This mechanism should, therefore, be seriously investigated as a possible route for the proliferation of altruistic/cooperative behavior.
Conclusions
Natural selection in group structured populations is best analyzed by a combination of group-centric and gene-centric perspectives and methods. Both shed light, carry intuition and provide computational power, in different ways. For instance, in this paper the analysis of invasion under strong selection focused on groups, while that under weak selection focused on genes. Our computations indicate the advantage of a pluralistic group-centric/gene-centric approach over views that regard one of these two approaches as being redundant, less informative, or counterproductive. Contrast, on one hand, with, e.g., [14], which to us seems excessively negative about the contributions of kin selection ideas. And contrast, on the other hand, with, e.g., Section 6.5 of [17], or the subsection on group selection in Chapter 15 of [29], which to us seem excessively negative about the contributions of group selection ideas).
Rigorous mathematical analysis of models is sometimes possible even when fitness functions are non-linear, selection is possibly strong, and the migration rate is arbitrary. And it is needed for assessing the validity of non-rigorous approaches, showing for instance that regression coefficients may be poorely approximated by partial derivatives, even when selection is weak. (Contrast with, e.g., [19](from (1) to (2)), [51]((3.4), (3.5)), [16](Box 6), [7](condition (6.7)).)
Hamilton’s condition for the spread of altruism, , should be complemented with more general rules (like those obtained by setting in (2), (3), or (4)) that extend it to non-linear fitness functions. In Section 9 of File S1, we compare these extensions of Hamilton’s condition to other extensions, based on covariance-regression formulas, like display (5) in [16]. We observe there that for the fundamental purpose of computing critical levels of relatedness needed for invasion, in term of parameters of the models, like in (5) and (6) above, covariance-regression formulas are incomplete in that they must be complemented with computations of the appropriate distribution of alleles in groups. And once these distributions are computed, our methods are much more direct and simpler.
Contingent forms of group altruism that are discontinued when participation is low can proliferate under biologically realistic conditions even in large groups. Their role in the spread of altruism should be empirically investigated. (Contrast with [36].)
Natural selection can promote traits that (in net terms over a full life-cycle) are costly to the actors and beneficial to the other members of their group, under demographic conditions that are not stringent. This can happen in large groups and with realistically high levels of gene flow, through population viscosity, without the need for kin recognition or greenbeard effects. Excessive focus on one-shot linear public goods games in the literature has obscured this fact. (Contrast with, e.g., [13], [17], [20]–[29].)
Supporting Information
Acknowledgments
The authors thank Rob Boyd for many hours of stimulating and informative conversations on the subjects in this paper. Thanks are given to Marek Biskup for assistance in exploring the nature of the distribution π. We are also grateful to Clark Barrett, Maciek Chudek, Daniel Fessler, Kevin Foster, Willem Frankenhuis, Bailey House, Laurent Lehmann, Glauco Machado, Sarah Mathew, Diogo Meyer, Cristina Moya, Peter Nonacs, Karthik Panchanathan, Susan Perry, Joan Silk, Jennifer Smith and Ming Xue for nice conversations and feedback on various aspects of this project and related subjects. We thank also the referees and the editor for their comments.
Funding Statement
This project was partially supported by CNPq, under grant 480476/2009-8 and by the Center for Natural and Artificial Information Processing Systems of the University of São Paulo (NAP CNAIPS-USP). No additional external funding has been received for this study. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
References
- 1.Maynard Smith J, Szathmáry E (1995) The Major Transitions in Evolution. Oxford, UK: Oxford University Press.
- 2.Frank SA (1998) Foundations of Social Evolution. Princeton, NJ, USA.: Princeton University Press.
- 3.Rousset F (2004) Genetic Structure and Selection in Subdivided Populations. Princeton, NJ, USA: Princeton University Press.
- 4. Lehmann L, Keller L (2006) The evolution of cooperation and altruism - a general framework and a classification of models. J Evolution Biol 19: 1365–1378. [DOI] [PubMed] [Google Scholar]
- 5.Okasha S (2006) Evolution and the Levels of Selection. Oxford, UK: Oxford University Press.
- 6. Nowak MA (2006) Five rules for the evolution of cooperation. Science 314: 1560–1563. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Wenseleers T, Gardner A, Foster KR (2010) Social evolution theory: a review of methods and approaches. In: Szekely T, Moore AJ, Komdeur J, editors, Social Behaviour: Genes, Ecology and Evolution, Cambridge, UK: Cambridge University Press, Cambridge, UK. 132–158.
- 8.Bourke AF (2011) Principles of Social Evolution. Oxford, UK: Oxford University Press.
- 9. Wilson DS, Wilson EO (2007) Rethinking the theoretical foundation of sociobiology. Q Rev Biol 82: 327–348. [DOI] [PubMed] [Google Scholar]
- 10. West SA, Griffn AS, Gardner A (2007) Evolutionary explanations of cooperation. Curr Biol 17: R661–R672. [DOI] [PubMed] [Google Scholar]
- 11.Foster K (2009) A defense of sociobiology. In: Cold Spring Harbor Symposia on Quantitative Biology 74. Cold Spring Harbor Press, 403–418. [DOI] [PubMed]
- 12. Traulsen A (2009) Mathematics of kin- and group-selection: Formally equivalent? Evolution 64: 316–323. [DOI] [PubMed] [Google Scholar]
- 13. Leigh EG (2010) The group selection controversy. J Evolution Biol 23: 6–19. [DOI] [PubMed] [Google Scholar]
- 14. Nowak MA, Tarnita CE, Wilson EO (2010) The evolution of eusociality. Nature 466: 1057–1062. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15. Abbot P, et al. (2011) Inclusive fitness theory and eusociality. Nature 471: E1–E4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16. Gardner A, West S, Wild G (2011) The genetical theory of kin selection. J Evolution Biol 24: 1020–1043. [DOI] [PubMed] [Google Scholar]
- 17. West SA, Mouden CE, Gardner A (2011) Sixteen common misconceptions about the evolution of cooperation in humans. Evol Hum Behav 32: 231–262. [Google Scholar]
- 18. Simon B, Fletcher JA, Doebeli M (2012) Hamilton’s rule in multi-level selection models. J Theor Biol 299: 55–63. [DOI] [PubMed] [Google Scholar]
- 19. Taylor PD, Frank SA (1996) How to make a kin selection model. J Evolution Biol 180: 27–37. [DOI] [PubMed] [Google Scholar]
- 20. Maynard-Smith J (1964) Group selection and kin selection. Nature 201: 1145–7. [Google Scholar]
- 21.Williams GC (1966) Adaptation and Natural Selection. Princeton, NJ, USA: Princeton University Press.
- 22.Hamilton W (1975) Innate social aptitude of man: An approach from evolutionary genetics. In: Fox R, editor, ASA Studies 4: Biosocial Anthropology, London, UK: Malaby Press. 133–153.
- 23. Maynard-Smith J (1976) Group selection. Q Rev Biol 51: 277–283. [Google Scholar]
- 24. Aoki K (1982) A condition for group selection to prevail over counteracting individual selection. Evolution 36: 832–842. [DOI] [PubMed] [Google Scholar]
- 25. Crow JF, Aoki K (1982) Group selection for a polygenic behavioral trait: differential proliferation model. Proc Natl Acad Sci U S A 79: 2628–2631. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26. Kimura M (1983) Diffusion models of intergroup selection, with special reference to the evolution of altruistic character. Proc Natl Acad Sci U S A 80: 6317–6321. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Leigh EG (1983) When does the good of the group override the advantage of the individual? Proc Natl Acad Sci U S A : 2985–2989. [DOI] [PMC free article] [PubMed]
- 28.Maynard Smith J (1998) Evolutionary Genetics. Oxford, UK: Oxford University Press, second edition.
- 29.Davies NB, Krebs JR, West SA (2012) An Introduction to Behavioral Ecology. Oxford, UK: Wiley-Blackwell, fourth edition.
- 30.Bowles S, Gintis H (2011) A Cooperative Species: Human Reciprocity and its Evolution. Princeton, NJ, USA: Princeton University Press.
- 31.Hartl DL, Clark AG (2007) Principles of Population Genetics. Sunderland, MA, USA: Sinauer Associates, fourth edition edition. Especially Table 6.4, Chapter 6.
- 32. Bowles S (2006) Group competition, reproductive leveling, and the evolution of human altruism. Science 314: 1569–1572. [DOI] [PubMed] [Google Scholar]
- 33.Hamilton M (2009) Population Genetics. Oxford, UK: Wiley-Blackwell.
- 34.Refard D, Bergmiller T, Kümmerli R (2013) Altruism can evolve when relatedness is low: evidence from bacteria commiting suicide upon phage infection. Proc R Soc B 280. [DOI] [PMC free article] [PubMed]
- 35. Joshi N (1987) Evolution of cooperation by reciprocation within structured demes. J Genet 66: 69–84. [Google Scholar]
- 36. Boyd R, Richerson PJ (1988) The evolution of reciprocity in sizable groups. J Theor Biol 132: 337–356. [DOI] [PubMed] [Google Scholar]
- 37.McElreath R, Boyd R (2007) Mathematical Models of Social Evolution. Chicago: University of Chicago Press.
- 38. Kerr B, Godfrey-Smith P, Feldman MW (2004) What is altruism? Trends Ecol Evol 19: 135–140. [DOI] [PubMed] [Google Scholar]
- 39. Kokko H, Johnstone R, Clutton-Brock T (2001) The evolution of cooperative breeding through group augmentation. Proc R Soc B 268: 187–196. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40. Avilés L (2002) Solving the freeloaders paradox: genetic associations and frequency-dependence selection in the evolution of cooperation among non-relatives. Proc Natl Acad Sci USA 99: 14268–14273. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41. Hauert C, et al. (2006) Synergy and discounting of cooperation in social dilemmas. J Theor Biol 239: 195–202. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42. van Veelen M (2009) Group selection, kin selection, altruism and cooperation: when inclusive fitness is right and when it can be wrong. J Theor Biol 259: 589–600. [DOI] [PubMed] [Google Scholar]
- 43. Archetti M, Scheuring I (2010) Coexistence of cooperation and defection in public goods games. Evolution 65: 1140–11148. [DOI] [PubMed] [Google Scholar]
- 44. Gore J, Youk H, van Oudenaarden A (2009) Snowdrift game dynamics and facultative cheating in yeast. Nature 459: 253–256. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45. Chuang JS, Rivoire O, Leibler S (2010) Cooperation and Hamilton’s rule in a simple microbial system. Mol Syst Biol 6: 1–7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46. Smith J, Dyken JDV, Zee PC (2010) Generalization of Hamilton’s rule for the evolution of microbial cooperation. Science 328: 1700–1703. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47. Heinsohn R, Packer C (1995) Complex cooperative strategies in group-territorial african lions. Science 269: 1260–1262. [DOI] [PubMed] [Google Scholar]
- 48. Bergmüller R, et al. (2007) Integrating cooperative breeding into theoretical concepts of cooperation. Behav Process 76: 61–72. [DOI] [PubMed] [Google Scholar]
- 49. Boyd R, Gintis H, Bowles S (2010) Coordinated punishment of defectors sustains cooperation and can proliferate when rare. Science 328: 617–620. [DOI] [PubMed] [Google Scholar]
- 50. Queller DC (1992) Quantitative genetics, inclusive fitness, and group selection. Am Nat 139: 540–558. [Google Scholar]
- 51. Lehmann L, Rousset F (2010) How life history and demography promote or inhibit the evolution of helping behaviors. Phil Trans R Soc B 365: 2599–2617. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52. Mayo D, Gilinsky N (1987) Models of group selection. Philosophy of Science 54: 515–538. [Google Scholar]
- 53. Matessi C, Jayakar S (1976) Conditions for the evolution of altruism under darwinian selection. Theor Popul Biol 9: 360–387. [DOI] [PubMed] [Google Scholar]
- 54. Gardner A, West S (2006) Demography, altruism and the benefits of budding. J Evolution Biol 19: 1707–1716. [DOI] [PubMed] [Google Scholar]
- 55. Goodnight KF (1992) The effect of stochastic variation on kin selection in a budding-viscous population. Am Nat 140: 1028–1040. [DOI] [PubMed] [Google Scholar]
- 56. Taylor PD (1992) Altruism in viscous populations - an inclusive fitness model. Evol Ecol 6: 352–356. [Google Scholar]
- 57. Lehmann L, et al. (2007) Group selection and kin selection: Two concepts but one process. Proc Natl Acad Sci U S A 104: 6736–6739. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58. Trausen A, Nowak MA (2006) Evolution of cooperation by multilevel selection. Proc Natl Acad Sci U S A 103: 10952–10955. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59. Trivers RL (1971) The evolution of reciprocal altruism. Q Rev Biol 46: 35–57. [Google Scholar]
- 60. Axelrod R, Hamilton WD (1981) The evolution of cooperation. Science 211: 1390–1396. [DOI] [PubMed] [Google Scholar]
- 61. Rutte C, Taborsky M (2007) Generalized reciprocity in rats. PLoS Biology 5: 1421–1425. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62. Rankin DJ, Taborsky M (2009) Assortment and the evolution of generalized reciprocity. Evolution 67: 1913–1922. [DOI] [PubMed] [Google Scholar]
- 63. Roze D, Rousset F (2008) Multilocus models in infinite island modedel of population structure. Theor Popul Biol 73: 529–542. [DOI] [PubMed] [Google Scholar]
- 64. Lessard S (2009) Diffusion approximations for one-locus multi-allele kin selection, mutation and random drift in group-structured populations: a unifying approach to selection models in population genetics. J Math Biol 59: 659–696. [DOI] [PubMed] [Google Scholar]
- 65. Ohtsuki H (2010) Evolutionary games in Wright’s Island model: kin selection meets evolutionary game theory. Evolution 64: 3344–3353. [DOI] [PubMed] [Google Scholar]
- 66. Gardner A, West S (2009) Greenbeards. Evolution 64: 25–38. [DOI] [PubMed] [Google Scholar]
- 67.Schonmann RH, Boyd R, Vicente R (2012) The Taylor-Frank method cannot be applied to some biologically important, continuous fitness functions. ArXiv:1209.1412v1.
- 68. Kerr B, Godfrey-Smith P (2002) Individual and multi-level perspective on selection in structured populations. Biol Phil 17: 477–517. [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.