Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2013 Jul 22.
Published in final edited form as: Bull Math Biol. 2010 Dec 30;73(9):2201–2212. doi: 10.1007/s11538-010-9619-z

Bards, Poets, and Cliques: Frequency-Dependent Selection and the Evolution of Language Genes

Reed A Cartwright 1,
PMCID: PMC3718555  NIHMSID: NIHMS315516  PMID: 21191662

Abstract

The ability of humans to communicate via language is a complex, adapted phenotype, which undoubtedly has a recently evolved genetic component. However, the evolutionary dynamics of language-associated alleles are poorly understood. To improve our knowledge of such systems, a population-genetics model for language-associated genes is developed. (The model is general and applicable to social interactions other than communication.) When an allele arises that potentially improves the ability of individuals to communicate, it will experience positive frequency-dependent selection because its fitness will depend on how many other individuals communicate the same way. Consequently, new and rare alleles are selected against, posing a problem for the evolutionary origin of language. However, the model shows that if individuals form language-based cliques, then novel language-associated alleles can sweep through a population. Thus, the origin of language ability can be sufficiently explained by Darwinian processes operating on genetic diversity in a fi-nite population of human ancestors.

Keywords: Pairwise interaction, Language faculty, Assortative interaction, Dynamics, Stability, FOXP2

1 Introduction

Humans possess a unique and innate ability for complex communication called language, with several features of our anatomy appearing to have evolved for its use (Pinker and Jackendoff 2005; Nishimura et al. 2006; Corballis 2009). Given the large number of human features associated with language, it seems safe to assume that there are underlying genetic factors that have evolved at least in part for its production and use (Pinker and Bloom 1990). However, the literature concerning the evolutionary dynamics of such genes is surprisingly limited. Here I present a population-genetics model for the evolution of language-associated alleles and show how selection for improved communication can fix novel language-specific alleles. In contrast to previous predictions (Smith 2004; Smith and Kirby 2008), the model presented here finds that the obstacles posed by frequency-dependent selection can be overcome. I further argue that the interaction of selection with other evolutionary forces actually serves to increase the probability of fixation of language-specific genes. These results have broad implications for our understanding of the evolution of language ability, especially the demarcation problem: what aspects of language are specific to it and uniquely human (Hauser and Chomsky 2002; Pinker and Jackendoff 2005).

Complex language is a uniquely human ability. Although studies have found that certain species of animals possess the ability to learn rudimentary forms of language-like communication (Corballis 2009), their abilities are easily surpassed by very young children, who spontaneously acquire language through interacting with older humans (Darwin 1871; Hauser and Chomsky 2002; Pinker and Jackendoff 2005). Several parts of human anatomy appear to have evolved for the production and understanding of language. The most easily recognized are the face, mouth, and throat, which contain adaptations for the physical production of spoken language (Nishimura et al. 2006; Corballis 2009). Additionally, regions of the brain have been shown to be critical locations for cognition associated with language (Hagoort 2005; Corballis 2009). Given its complexity, there must be some genetic factors underlying language ability (Pinker and Bloom 1990), and these factors are likely to have evolved along the human lineage. FOXP2 is one such candidate because it is associated with deficits in language ability (Vargha-Khadem et al. 1995; Fisher et al. 1998; Lai et al. 2001; Fisher and Scharff 2009) and appears to have experienced adaptive selection in our lineage (Enard et al. 2002; Coop et al. 2008). Furthermore, several genes that are associated with auditory development and that show evidence of positive selection in the human lineage have been identified and are candidate genes for speech perception (Clark et al. 2003). Recently it was even proposed that specific ASPM and Microcephalin haplogroups were not only associated with tonal languages but actually biased populations to develop tonal languages (Dediu and Ladd 2007).

With little doubt, language—like all other complex adaptations—evolved gradually (Pinker and Bloom 1990; Pinker and Jackendoff 2005; Corballis 2009), and at various times during its evolutionary history, multiple language-ability phenotypes must have competed with one another in human populations. In these instances, some individuals communicated with an ancestral language ability and others with a derived language ability. Although each phenotype would acquire and use communication with different biases, communication between them would not necessarily be unintelligible.—Individuals with FOXP2-associated language disorders can still communicate with unaffected individuals (Vargha-Khadem et al. 1995).— During language acquisition, derived-phenotype individuals that are raised amongst ancestral-phenotype individuals would learn the existing ancestral language, but impose their own biases upon it, creating a distinctive idiolect. If there are other derived-phenotype individuals in the population, then their biases would be reinforced, and their idiolects could potentially merge into a new dialect.

Two well studied hypothesized transitions fall into this paradigm: gestures evolving into speech (Darwin 1871; Rizzolatti and Arbib 1998; Corballis 2009) and holistic communication into combinatorial language (Wray 1998; Nowak et al. 2000; Brighton 2002; Smith et al. 2003). In both instances, the ancestral form of communication was replaced by a derived—and fitter—form of communication.—Of course, “minor” transitions like improvements to the vocal tract or cognitive faculty would also fall into this paradigm.—Speech is more fit because it not only allows communication when individuals are not in direct sight of one another, but it also frees the hands for work (Corballis 2009). Likewise, combinatorial language (grammar, syntax, etc.) has been shown to offer at least two advantages over holistic language: it is cognitively more efficient when a wealth of ideas need to be communicated (von Humboldt 1836; Nowak and Krakauer 1999; Nowak et al. 2000) and produces more stable languages when children must acquire language from limited experience (poverty of input) (Brighton 2002; Smith et al. 2003). Unfortunately, this comes at a cost of increased risk for deception (Lachmann and Bergstrom 2004). Below I use a simple and intuitive population-genetics model to investigate the evolutionary dynamics of a novel language allele that produces a “fitter” language ability but arises in a population in which this fitness might not be able to be realized. While this model is described on the basis of language phenotypes, it can describe the evolutionary consequences of other types of social interaction.

2 Models and Analysis

Consider a population made up of “bards” and “poets,” where bards have the ancestral language-ability phenotype and poets have the derived phenotype. I do not propose that the phenotypic differences between bards and poets manifest in highly variable cultural aspects of language like verb-object versus object-verb orderings (Evans and Levinson 2009).—These have been shown to culturally evolve faster than biological adaptation can respond to (Christiansen and Chater 2008; Chater et al. 2009).—Rather they differ, perhaps slightly, in some fundamental cognitive or physical aspects of the language ability, e.g. in acquisition, production, or understanding of language.

Now consider that poets are able to communicate with one another more successfully than bards, such that a population of poets is likely to out compete a population of bards. In a mixed population, however, bards and poets interact with one another, and their successes also depend on how much their acquired languages overlap. In the most generous situation, poets that grow up amongst bards are able to completely learn the bardic language, while expanding upon it when communicating with other poets (complete overlap). Alternatively, the advantage of poets might come from thinking differently from bards, and although they might be able to learn the majority of the bardic language, their mastery of it could be limited, reducing the success of cross communication (partial overlap). If individuals' fitnesses are in part determined by their ability to communicate with other individuals, then bards and poets will experience positive frequency-dependent selection (Cavalli-Sforza and Feldman 1983).

While there is an enormous literature dealing with every variant of 2 × 2 interaction models in game theory and sociobiology, I will develop an explicitly genetic model in order to model directly the dynamics of language-associated alleles. I will assume that bards are homozygous for a recessive allele, BARD, and that poets have at least one copy of the correspondingly dominant allele, POET.—I also considered a model in which POET was recessive and a genic/haploid model that lacks dominance. They produce similar results, and I will note the differences where applicable.—The evolution of POET will be modeled via an extension of the dominant pairwise interaction model (Asmussen and Basnayake 1990; Asmussen et al. 2004). Pairwise interaction models derive an individual's fitness from its interactions with other individuals in the population. For each phenotype–phenotype interaction, there are partial-fitness coefficients for each of the actors. The total fitness of an individual is an average of their partial-fitness coefficients weighted by the frequency of their interaction with phenotypes in the population. The assumption of dominance in the model produces a two-phenotype system—ancestral and derived—that simplifies modeling. In the usage here, “phenotype” refers not to an individual's acquired language but rather the biases in its language ability. Poets are not born knowing a poetic language, but rather their biases result in them either acquiring it from other poets in the population or inventing it de novo, as described above. Additionally, I will not assume that poetic and bardic languages are mutually exclusive.

2.1 Complete Overlap Model

I will make the following assumptions: an infinitely large, unstructured population with random mating, no mutation, no migration, no genetic drift, and individuals interact randomly, such that the probability of each interaction type is proportional to phenotype frequencies. The impacts of violating the infinitely large and random mating assumptions will be discussed below. Let the partial fitness of two poets interacting be greater than two bards interacting, and let bards and poets receive equal fitnesses when interacting with one another—no phenotype can take advantage of the other phenotype. Let the partial fitness of poet–poet interactions be 1 + s, where s > 0 is the advantage that a poet receives from interacting with another poet. Sim-ilarly, let all other partial fitnesses be 1, i.e. complete overlap of language faculties. This produces the following partial-fitness matrix:

BardPoetBard11Poet11+s

Let p be the allele frequency of POET, and because it is dominant, poets occur with frequency p2 + 2p(1 − p) and bards with frequency (1 − p)2. The fitness of each phenotype is the average of its partial fitnesses weighted by its frequencies of interaction:

dpoet(p)=(1+s)[p2+2p(1p)]+1(1p)2dbard(p)=1

The marginal allele fitnesses are

wpoet(p)=dpoet(p)wbard(p)=pdpoet(p)+(1p)dbard(p)

The average fitness of the population is

w¯(p)=[p2+2p(1p)]dpoet(p)+(1p)2dbard(p)

and the allele frequency of the next generation is p′ = pwpoet(p)/(p).

An equilibrium exists whenever p′ = p. Solving this equation finds two trivial equilibria at p = 0 and p = 1 that always exist. An equilibrium point is considered locally stable if populations with allele frequencies close to it move toward it and unstable if they move away. To determine stability it is trivial to demonstrate that p′ > p if p > 0. Therefore, p = 0 is unstable and p = 1 is stable. According to these dynamics, when p0 > 0, POET will be driven to fixation. Thus complete overlap is always invadable by POET. However, since limp→ 0 p′/p = 1, the spread of POET alleles is slow when they are rare, and thus fixation will be heavily influenced by stochastic fluctuations (cf. Cavalli-Sforza and Feldman 1983).

2.2 Partial Overlap Model

To study the dynamics of partial overlap, the previous model is modified to add a coefficient, c ∈ (0,1/s], which creates a fitness cost that occurs when bards and poets interact with one another. The partial-fitness matrix now becomes

BardPoetBard11csPoet1cs1+s

Consequently, the fitnesses of poets and bards become

dpoet(p)=(1+s)[p2+2p(1p)]+(1cs)(1p)2dbard(p)=1(1p)2+(1cs)[p2+2p(1p)]

These fitnesses produce an unstable polymorphic equilibrium at

p^=11+c1+2c

and two stable trivial equilibria. According to these dynamics, when p0 < , POET will be driven to extinction, but when p0 > p̂, it will be driven to fixation. Thus partial overlap is not invadable by POET, unless its frequency grows above a threshold via genetic drift or some other evolutionary means, echoing previous results (e.g. Smith 2004; Smith and Kirby 2008).

2.3 Cliquing Model

Under partial overlap of language faculties, POET is selected against when rare. However, in similar systems, assortative interactions have been found to encourage the spread of social traits (e.g. Hamilton 1975; Cavalli-Sforza and Feldman 1983; Wilson and Dugatkin 1997; Gardner and West 2010). In this system, assortative interactions manifest themselves in language cliques. Cliques are a reasonable addition because individuals are likely to prefer to interact with individuals with whom they can communicate successfully. This behavior can be active or passive, e.g. frequent interactions among kin, who are also likely to share the same language phenotype. If cliquing behavior is active, then language phenotypes are an example of a “greenbeard” (Gardner and West 2010).

The previous two models are essentially the results of Asmussen et al. (2004) restated for bards and poets. The dominant PIM model will now be extended to include a cliquing behavior governed by a parameter, f ∈ [0, 1]. Individuals are now assumed to interact randomly at a rate of 1 − f and within their clique with a rate of f (cf. Gardner and West 2010). Note that the previous two models become edge cases of this model. Under this cliquing model, the fitnesses of poets and bards are

dpoet(p)=f(1+s)+(1f)[p2+2p(1p)](1+s)+(1f)(1p)2(1cs)=1+s(1+c)s(1f)(1p)2dbard(p)=f+(1f)(1p)2+(1f)[p2+2p(1p)](1cs)=1cs(1f)[p2+2p(1p)]

See Table 1.—Alternative formulations for recessive and haploid/genic POET models can be derived similarly.—From these fitnesses, a polymorphic equilibrium for bards and poets is found at a frequency of POET of

Table 1. Partial fitnesses and pairwise interaction rates with cliquing behavior.

Phenotype Partial fitnesses Interaction rates Total fitnesses


Bards Poets Bards Poets
Bards 1 1 − cs f + (1 − f)q2 (1 − f)(p2 + 2pq) 1 − cs(1 − f)(p2 + 2pq)
Poets 1 − cs 1 + s (1− f)q2 f + (1 − f)(p + 2pq) 1 + s − (1 + c)s(1 − f)q2

Bards are recessive for ancestral allele BARD. Parameter s ≥ 0 is the selective advantage of two poets communicating with one another, and 0 ≤ c ≤ 1/s is the relative penalty of cross communication. The strength of cliquing behavior is represented by 0 ≤ f ≤ 1. The alleles POET and BARD have frequencies p and q = 1 − p in the population, respectively

p^=11+(1f)c(1f)(1+2c) (1)

which exists if f < c/(1 + c) and is always unstable. Instability is trivially proved by showing that p′p if p. By combining these results, the dynamics for two different initial conditions can be determined.

  1. When c > 0 and 0 ≤ f < c/(1 + c), both p = 0 and p = 1 are stable, and p = is unstable. When p0 < , POET will be driven to extinction, and when p0 > , POET will be driven to fixation. Therefore, this situation is invadable by POET, only if its frequency is above the threshold of .

  2. When c ≥ 0 and c/(1 + c) ≤ f ≤ 1, p = 0 is unstable; p = 1 is stable; and the polymorphic equilibrium does not exist. When p0 > 0, POET will be driven to fixation. This situation is always invadable by POET.

The behaviors of other formulations are identical except that p^=c(1f)f(1f)(1+2c) if POET is recessive and p^=c(1f)f(1f)(1+2c) if POET is haploid/genic.

Poets will replace bards in the population, only if the frequency of POET is greater than . At this point, selection for more fit communication will drive poets to fixation. Clearly, if c = 0, ≤ 0 and there is no threshold value of p for poets; as c increases the threshold also increases. More importantly, as f increases, the threshold value of p decreases, and when fc/(1 + c), the threshold disappears completely, and selection will drive poets to fixation even if they cannot communicate perfectly with bards, i.e. poets can invade a population of bards. For example, if c = 1/4 and f = 0, the threshold is about 9%; if f = 0.1, the threshold is about 5%; and if f = 0.2, it disappears.

Clearly, slight increases in cliquing behavior can have drastic effects on the location of p̂, and thus the evolvability of language abilities. These results indicate that the threshold resulting from incomplete communication between bards and poets can be reduced or eliminated by a preference that increases interactions among members of the same language phenotype. Additionally, limp→0 p′/p = 1 + fs (1 + c) − cs, indicating that the spread of rare POET alleles happens faster as f increases. Thus stronger cliquing behaviors decrease the influence of stochastic fluctuations in the population (cf. Cavalli-Sforza and Feldman 1983). The shape of Δp = p′p is actually concave with respect to f. At low values of p, stronger cliquing behaviors increase Δp, while at high values of p, the stronger cliquing behaviors decrease Δp. More specifically,

dΔpdf>0ifp<1c(1+s)1+c(2+s)

Note that is less than this threshold. Above this threshold, poets are common and bards are rare, and thus increasing cliquing behavior rewards bards more than poets.

2.4 Interaction with Genetic Drift

Selection is not the only evolutionary force, and another force like random genetic drift could raise POET above the threshold frequency, after which selection will drive poets to fixation and maintain them in the population. The interaction of the cliquing model with drift was explored using simulations. A Wright–Fisher simulation of this model was implemented, starting with 1 copy of POET in a population of 10000 diploids and running until fixation or elimination of POET. For each parameter set in Fig. 1, simulations were repeated until either 1000 successful sweeps of POET or until (sweeps + 1)/(trials + 1) < 0.01/2N. If 1000 sweeps occurred, their probability was estimated from the unbiased negative-binomial estimator. Additionally, the 5% quantile was estimated from the observed durations of the selective sweeps.

Fig. 1.

Fig. 1

Selective sweeps of language-associated alleles occur for many different values of cliquing rates and interaction penalties. (a) Probability of occurrence of selective sweep. Each panel represents a different value for the selective coefficient (s) and contains 441 combinations of f and c. Points above the solid line are sets where > 0. For each set, the probability of a selective sweep occurring for POET in a population of size 10k was estimated in Simulation from the number of attempts it took to get 1000 selective sweeps. They are color-coded based on their ratio to the neutral expectation. Blank areas represent sets for which sweeps were too rare to study (2Nr < 0.01). (b) The 5% lower quantiles of the duration of selective sweeps for each set. Assuming the model is a reasonable description for the evolution of FOXP2 in humans, values below 4000 can be considered reasonable for its selective sweep

From the results (Fig. 1), I find that the probability of a selective sweep of POET occurring still depends on c, f, and s; however, selective sweeps occur regularly in some populations that have to surpass a threshold (above the solid line in Fig. 1a). The model presented here can also make predictions about the parameter values required to explain the selective sweep of FOXP2 in humans by using the duration of a sweep as a simple summary statistic. If we consider the result that the selective sweep of FOXP2 in humans began within the last 6000 generations (Enard et al. 2002) and assume that it finished before the out-of-Africa expansion of humans (roughly 2000 generations ago; Liu et al. 2006), then the duration of the sweep has an upper bound of 4000 generations. The majority of the points in Fig. 1b have lower 5% quantiles that are less than this bound and can be considered reasonable. It appears that s has to be at least 0.01, while b and f can vary as long as their relationship with drift allows for sweeps.

FOXP2 is just used an example of a language-associated gene that has undergone a selective sweep. The dynamics illustrated in this model could apply to any language-associated gene. For interpreting Fig. 1b, I have assumed that the modern allele is dominant over its ancestral state, but in reality, this is unknown in the case of FOXP2. We also do not know if the observed selective sweep was caused by the FOXP2 locus itself, or if there were interactions with genetic variation at other loci.

3 Discussion

I have developed a population-genetic model for the evolution of new language-associated alleles. In this model, “bards” have an ancestral language phenotype, and “poets” have a novel language phenotype that is potentially more fit. I find that if the poets' language abilities completely overlap bards', then selection will drive poets to fixation. However, partial overlap can select against poets when they are rare and can prevent them from invading the population. The ability of poets to spread can be greatly enhanced through assortative behaviors that create language-based cliques. Through simulations I explored the impact of genetic drift and estimated that the selective coefficient of FOXP2 would need to be at least 0.01 to explain a recent selective sweep.

In humans, genetic drift was probably more effective than predicted in the simulations since it only had to work in small tribes and not the entire population. After drift fixed POET in a few tribes, migration could spread the novel allele to other tribes (cf. Smith 2004). However, drift and migration alone are not sufficient to spread new language genes because migrants will be rare in a new tribe and selected against. Therefore, population-level events like rapid tribe growth and division, group migration, or war are needed for poets to efficiently spread outside their source tribes. Although this scenario is reminiscent of Wright's shifting-balance theory (Wright 1931) and suffers from the debate around it (Coyne et al. 1997; Whitlock and Phillips 2000), such events are common in human history, and their role in assisting language evolution cannot be entirely disregarded.

Although the cliquing model includes assortative interactions similar to assortative mating models, the results are simplified because mating still occurs randomly. Although it is not unreasonable to assume that there should be some level of assortative mating if there are cliques, the complexities of mate choice make it difficult to define what this level should be. For example, in a small tribe, language phenotypes will covary with relatedness, and inbreeding avoidance might prevent a language clique from becoming a mating clique. Assortative, within-clique mating should only reinforce the effects of language cliques.

Some researchers have argued that selection against language-specific alleles when they are rare indicates that language abilities are unlikely to evolve through selection for improved communication and that proposed language adaptations are thus neither specific to language nor adaptations for it (Smith 2004; Smith and Kirby 2008). They have further argued that language adaptations that do exist are likely to be cultural and not biological in origin. However, this hypothesis is called into question by the large number of universal anatomical features (physical and mental) of humans that are associated with language ability (Pinker and Bloom 1990; Vargha-Khadem et al. 1995; Fisher et al. 1998; Lai et al. 2001; Enard et al. 2002; Pinker and Jackendoff 2005; Nishimura et al. 2006; Coop et al. 2008; Corballis 2009). Not all of these can be explained by culture or as spandrels (Gould and Lewontin 1979). Because evolution is a process that results from many interacting forces, it would still be valid to call POET-like genes language-adapted even if drift or some other force had an important role in their evolutionary dynamics.

This model demonstrates that novel language alleles can be selected for even if they disrupt communication between ancestral and derived phenotypes. By forming cliques, the penalty of cross-communication is reduced or eliminated, making novel language alleles more likely to reach fixation. Furthermore, the interaction of selection with other evolutionary forces actually serves to increase the probability of fixation of language-specific alleles. However, this model is only the beginning, providing a framework for using classical population-genetic theory to study the evolution of the language ability and other social traits. Further research, utilizing more complex, frequency-dependent, population-genetic models and incorporating cultural transmission should allow for better understanding of the evolution of language and detection of language-associated genes.

Acknowledgments

This work was supported by an NSF Predoctoral Fellowship, NIH grant GM070806 to J.L. Thorne, and NLM grant LM010009-01 to D. Graur and G. Landan. The author would like to thank D. Graur, J. Ross-Ibarra, N. Sabath, H. Spencer, and two reviewers for thoughtful comments.

References

  1. Asmussen MA, Basnayake E. Frequency-dependent selection: the high potential for permanent genetic variation in the diallelic, pairwise interaction model. Genetics. 1990;125:215–230. doi: 10.1093/genetics/125.1.215. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Asmussen MA, Cartwright RA, Spencer HG. Frequency-dependent selection with dominance: a window onto the behavior of the mean fitness. Genetics. 2004;167:499–512. doi: 10.1534/genetics.167.1.499. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Brighton H. Compositional syntax from cultural transmission. Artif Life. 2002;8:25–54. doi: 10.1162/106454602753694756. [DOI] [PubMed] [Google Scholar]
  4. Cavalli-Sforza LL, Feldman MW. Paradox of the evolution of communication and of social interactivity. Proc Natl Acad Sci USA. 1983;80:2017–2021. doi: 10.1073/pnas.80.7.2017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Chater N, Reali F, Christiansen MH. Restrictions on biological adaptation in language evolution. Proc Natl Acad Sci USA. 2009;106:1015–1020. doi: 10.1073/pnas.0807191106. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Christiansen MH, Chater N. Language as shaped by the brain. Behav Brain Sci. 2008;31:489–508. doi: 10.1017/S0140525X08004998. discussion 509-558. [DOI] [PubMed] [Google Scholar]
  7. Clark AG, Glanowski S, Nielsen R, Thomas PD, Kejariwal A, Todd MA, Tanenbaum DM, Civello D, Lu F, Murphy B, Ferriera S, Wang G, Zheng X, White TJ, Sninsky JJ, Adams MD, Cargill M. Inferring nonneutral evolution from human-chimp-mouse orthologous gene trios. Science. 2003;302:1960–1963. doi: 10.1126/science.1088821. [DOI] [PubMed] [Google Scholar]
  8. Coop G, Bullaughey K, Luca F, Przeworski M. The timing of selection at the human FOXP2 gene. Mol Biol Evol. 2008;25:1257–1259. doi: 10.1093/molbev/msn091. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Corballis MC. The evolution of language. Ann N Y Acad Sci. 2009;1156:19–43. doi: 10.1111/j.1749-6632.2009.04423.x. [DOI] [PubMed] [Google Scholar]
  10. Coyne JA, Barton NH, Turelli M. Perspective: a critique of Sewall Wright's shifting balance theory of evolution. Evolution. 1997;51:643–671. doi: 10.1111/j.1558-5646.1997.tb03650.x. [DOI] [PubMed] [Google Scholar]
  11. Darwin CR. The descent of man and selection in relation to sex. London: Murray; 1871. [Google Scholar]
  12. Dediu D, Ladd DR. Linguistic tone is related to the population frequency of the adaptive haplogroups of two brain size genes, aspm and microcephalin. Proc Natl Acad Sci USA. 2007;104:10944–10949. doi: 10.1073/pnas.0610848104. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Enard W, Przeworski M, Fisher SE, Lai CSL, Wiebe V, Kitano T, Monaco AP, Pääbo S. Molecular evolution of FOXP2, a gene involved in speech and language. Nature. 2002;418:869–872. doi: 10.1038/nature01025. [DOI] [PubMed] [Google Scholar]
  14. Evans N, Levinson SC. The myth of language universals: language diversity and its importance for cognitive science. Behav Brain Sci. 2009;32:429–448. doi: 10.1017/S0140525X0999094X. discussion 448-494. [DOI] [PubMed] [Google Scholar]
  15. Fisher SE, Scharff C. Foxp2 as a molecular window into speech and language. Trends Genet. 2009;25:166–177. doi: 10.1016/j.tig.2009.03.002. [DOI] [PubMed] [Google Scholar]
  16. Fisher SE, Vargha-Khadem F, Watkins KE, Monaco AP, Pembrey ME. Localisation of a gene implicated in a severe speech and language disorder. Nat Genet. 1998;18:168–170. doi: 10.1038/ng0298-168. [DOI] [PubMed] [Google Scholar]
  17. Gardner A, West SA. Greenbeards. Evolution. 2010;64:25–38. doi: 10.1111/j.1558-5646.2009.00842.x. [DOI] [PubMed] [Google Scholar]
  18. Gould S, Lewontin R. Spandrels of San-Marco and the Panglossian paradigm—a critique of the adaptationist program. Proc Biol Sci. 1979;205:581–598. doi: 10.1098/rspb.1979.0086. [DOI] [PubMed] [Google Scholar]
  19. Hagoort P. On broca, brain, and binding: a new framework. Trends Cogn Sci. 2005;9:416–423. doi: 10.1016/j.tics.2005.07.004. [DOI] [PubMed] [Google Scholar]
  20. Hamilton WD. Innate social aptitudes of man: an approach from evolutionary genetics. In: Fox R, editor. ASA studies 4: biosocial anthropology. London: Malaby; 1975. pp. 133–153. [Google Scholar]
  21. Hauser MD, Chomsky N, Fitch WT. The faculty of language: what is it, who has it, and how did it evolve? Science. 2002;298:1569–1579. doi: 10.1126/science.298.5598.1569. [DOI] [PubMed] [Google Scholar]
  22. Lachmann M, Bergstrom CT. The disadvantage of combinatorial communication. Proc Biol Sci. 2004;271:2337–2343. doi: 10.1098/rspb.2004.2844. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Lai CS, Fisher SE, Hurst JA, Vargha-Khadem F, Monaco AP. A forkhead-domain gene is mutated in a severe speech and language disorder. Nature. 2001;413:519–523. doi: 10.1038/35097076. [DOI] [PubMed] [Google Scholar]
  24. Liu H, Prugnolle F, Manica A, Balloux F. A geographically explicit genetic model of world-wide human-settlement history. Am J Hum Genet. 2006;79:230–237. doi: 10.1086/505436. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Nishimura T, Mikami A, Suzuki J, Matsuzawa T. Descent of the hyoid in chimpanzees: evolution of face flattening and speech. J Hum Evol. 2006;51:244–254. doi: 10.1016/j.jhevol.2006.03.005. [DOI] [PubMed] [Google Scholar]
  26. Nowak MA, Krakauer DC. The evolution of language. Proc Natl Acad Sci USA. 1999;96:8028–8033. doi: 10.1073/pnas.96.14.8028. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Nowak MA, Plotkin JB, Jansen VA. The evolution of syntactic communication. Nature. 2000;404:495–498. doi: 10.1038/35006635. [DOI] [PubMed] [Google Scholar]
  28. Pinker S, Bloom P. Natural language and natural selection. Behav Brain Sci. 1990;13:707–784. [Google Scholar]
  29. Pinker S, Jackendoff R. The faculty of language: what's special about it? Cognition. 2005;95:201–236. doi: 10.1016/j.cognition.2004.08.004. [DOI] [PubMed] [Google Scholar]
  30. Rizzolatti G, Arbib MA. Language within our grasp. Trends Neurosci. 1998;21:188–194. doi: 10.1016/s0166-2236(98)01260-0. [DOI] [PubMed] [Google Scholar]
  31. Smith K. The evolution of vocabulary. J Theor Biol. 2004;228:127–142. doi: 10.1016/j.jtbi.2003.12.016. [DOI] [PubMed] [Google Scholar]
  32. Smith K, Kirby S. Cultural evolution: implications for understanding the human language faculty and its evolution. Philos Trans R Soc Lond B, Biol Sci. 2008;363:3591–3603. doi: 10.1098/rstb.2008.0145. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Smith K, Brighton H, Kirby S. Complex systems in language evolution: the cultural emergence of compositional structure. Adv Complex Syst. 2003;6:537–558. [Google Scholar]
  34. Vargha-Khadem F, Watkins K, Alcock K, Fletcher P, Passingham R. Praxic and nonverbal cognitive deficits in a large family with a genetically transmitted speech and language disorder. Proc Natl Acad Sci USA. 1995;92:930–933. doi: 10.1073/pnas.92.3.930. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. von Humboldt W. Über die Verschiedenheit des menschlichen Sprachbaus und ihren Einfluss auf die geistige Entwicklung des Menschengeschlechts. Bonn: Dummlers; 1836. [Google Scholar]
  36. Whitlock MC, Phillips PC. The exquisite corpse: a shifting view of the shifting balance. Trends Ecol Evol. 2000;15:347–348. doi: 10.1016/s0169-5347(00)01930-3. [DOI] [PubMed] [Google Scholar]
  37. Wilson DS, Dugatkin LA. Group selection and assortative interactions. Am Nat. 1997;149:336–351. [Google Scholar]
  38. Wray A. Protolanguage as a holistic system for social interaction. Lang Commun. 1998;18:47–67. [Google Scholar]
  39. Wright S. Evolution in Mendelian populations. Genetics. 1931;16:97–159. doi: 10.1093/genetics/16.2.97. [DOI] [PMC free article] [PubMed] [Google Scholar]

RESOURCES