Short abstract
How did music evolve? We show that prevailing views on the evolution of music — that music is a byproduct of other evolved faculties, that music evolved for social bonding, and that music evolved to signal mate quality — are incomplete or wrong. We argue instead that music evolved as a credible signal in at least two contexts: coalitional interactions and infant care. We suggest that basic features of music, including melody and rhythm, result from adaptations in the proper domain of human music, providing a foundation that cultural evolution shapes into its actual domain.
Long abstract
Music comprises a diverse category of cognitive phenomena that likely represent both the effects of psychological adaptations that are specific to music (e.g., rhythmic entrainment) and the effects of adaptations for non-musical functions (e.g., auditory scene analysis). How did music evolve? Here, we show that prevailing views on the evolution of music — that music is a byproduct of other evolved faculties, evolved for social bonding, or evolved to signal mate quality — are incomplete or wrong. We argue instead that music evolved as a credible signal in at least two contexts: coalitional interactions and infant care. Specifically, we propose that (1) the production and reception of coordinated, entrained rhythmic displays is a co-evolved system for credibly signaling coalition strength, size, and coordination ability; and (2) the production and reception of infant-directed song is a co-evolved system for credibly signaling parental attention to secondarily altricial infants. These proposals, supported by interdisciplinary evidence, suggest that basic features of music, such as melody and rhythm, result from adaptations in the proper domain of human music. The adaptations provide a foundation for the cultural evolution of music in its actual domain, yielding the diversity of musical forms and musical behaviors found worldwide.
1. Introduction
Thirty years ago, Steven Pinker and Paul Bloom made the “incredibly boring” (1990 p. 708) argument that language is the product of natural selection, resulting from adaptations for communication. This was, in fact, controversial: despite the facts that language is universally used to communicate information essential to survival and reproduction; that all people typically acquire language easily in infancy; that languages have deep computational structure unrelated to technological or societal progress; that neural injuries cause specific language impairments; and that specialized neuroanatomy enables speech production — many believed that language arose from byproducts of adaptations for cognition, not communication (e.g., Chomsky 1968). The question of how language evolved is far from settled but it continues to generate testable hypotheses and productive results (e.g., Atkinson et al. 2008; Christiansen & Chater 2008; Fitch 2017; Searcy 2019).
Music shares many of the above facts with language but its contributions to survival and reproduction, if any, are less evident than those of language. As such, there is no consensus surrounding why humans make and listen to music; why music has its particular features and not others; or how music evolved. Three views on the evolution of music are prominent: a byproduct view, where music developed as a result of non-musical adaptations; an adaptationist view, where music evolved to create and maintain social bonds; and a second adaptationist view, where music evolved to signal mate quality.
We will argue that these views are incomplete or incorrect, proposing instead that the human psychology of music is built on adaptations for at least two categories of vocal signals common across species: territorial advertisements and contact calls. In these contexts, music can communicate overt information about covert properties of the human mind, functioning as a credible signal. This account explains some basic musical phenomena and the limited scope of music’s proper domain, laying a foundation for cultural-evolutionary processes that shape the diversity of music worldwide.
2. What constitutes evidence for adaptation by natural selection?
Since antiquity, it has been recognized that unlike abiotic natural phenomena, the existence and form of many biological traits must be explained in reference to their “purpose”. Rain does not fall in order to make corn grow, Aristotle wrote, but of necessity: “What is drawn up must cool, and what has been cooled must become water and descend, the result of this being that the corn grows.” (Physics II, part 8). Teeth, in contrast, are “admirably constructed for their general office, the front ones being sharp, so as to cut the food into bits, and the hinder ones broad and flat, so as to grind it to a pulp.” (Parts of Animals III, part 1). Human teeth universally grow this way, so this relation of means to ends cannot be due to chance, Aristotle argued; instead, these parts of animals can only be explained by their purpose, which benefits the animal itself (Ariew 2002).
Two millennia later, William Paley described the organism as an intricate machine, “a cluster of contrivances” whose physical structures are best comprehended in relation to the useful functions they provide the organism (Paley 1803 p. 185). He recognized that these contrivances must be understood in relation to their environments: “Can it be doubted, whether the wings of birds bear a relation to air, and the fins of fish to water?” (Paley 1803 p. 291). Whereas Paley, arguing by exclusion, took evidence of design to be evidence for God, Darwin instead proposed that design evolved via heritable variation and differential reproduction, i.e., adaptation by natural selection (Darwin 1859).
Hypotheses for adaptation can be evaluated using criteria not so different from those of Aristotle, Darwin, or even Paley. Adaptations are generally characteristics of an entire species recognizable from a tight relation of means to ends: a fit between the features of the proposed adaptation and the features of the adaptive problem that it putatively solved. This constitutes evidence of design (Williams 1966).
The human heart must be an adaptation to pump blood, for example, because it develops universally with properties that efficiently and reliably cause blood to circulate (e.g., muscles that compress chambers; valves; inlet and outlet ports; connections to the circulatory system), an outcome essential to survival and reproduction. Conversely, a pumping function best explains why the heart has the structure it has, instead of other tissues in other arrangements, reliably and efficiently solving an adaptive problem (Darwin 1859; Williams 1966).
There are important differences between pre- and post-Darwinian conceptions of design, however. Selection among heritable variants generally optimized traits to increase inclusive fitness, the reproduction of self or close relatives (Hamilton 1964), contra, e.g., “well-being” or “longevity”; and did so in ancestral environments but not necessarily modern ones. In Williams’s words, “the degree to which an organism actually achieves reproductive survival” is “rather trivial… The central biological problem is not survival as such, but design for survival” (Williams 1966 p. 159).
A key issue when investigating the evolution of a trait — one central to questions of the evolution of music, as we will discuss — is the distinction between proximate and ultimate-level explanations (Mayr 1961; Tinbergen 1963). Proximate-level questions ask how a trait develops over ontogeny and what causal relationships it has with other parts of the organism. Ultimate-level questions, on the other hand, ask why a trait came to be and require identifying the phylogenetic history of the trait across ancestral and extant species, and the causal role it played, if any, in the reproduction of genes coding for it (discussion: Dickins & Barton 2013; Laland et al. 2011; Scott-Phillips et al. 2011).
A proximate-level explanation for bitter taste, for instance, is that certain chemicals bind to bitter taste receptor proteins on the tongue, increasing intracellular calcium in the taste receptor cell, thereby stimulating a sensory afferent neuron, and so on. An ultimate-level explanation accounts for the presence of bitter taste receptor genes across vertebrates, and their expression in the oral cavity and other tissues, as part of a neurophysiological system to detect and avoid dietary toxins, which, if ingested, could reduce inclusive fitness (Roper & Chaudhari 2017). Proximate-level explanation can also be applied to dysfunctions (such as cancer) and non-functions (such as the beating sound of the heart); they do not imply that a trait is an adaptation.
Ultimate-level analyses also do not presuppose adaptation. Adaptationist claims are onerous; there are infinitely many ways a phenotype can be carved into traits, most of which are unrelated to a genetic lineage’s reproductive fitness. Supporting a claim of adaptation therefore requires evidence for design: evidence that a trait is improbably well-organized to efficiently, effectively, and reliably solve an adaptive problem (Williams 1966).
Here we evaluate claims about the evolution of music using the approach outlined above, with particular attention to the psychological design of music.
3. Two claims regarding the origins of music are unconvincing
A successful account of music must provide evidence for design of its principal features. Music is an auditory display built from melodies and rhythms. It can involve loud, elaborate, coordinated performances with voices and musical instruments, with many listeners; it also can involve quiet, simple lullabies between parents and infants. It appears in many behavioral contexts, across the sexes and across the lifespan, as a common element of daily life.
Before we proceed, readers should note a companion BBS target article, “Music as a coevolved system for social bonding” (Savage et al. 2020), which presents an alternate evolutionary scenario for the origins of music. Savage and colleagues propose that musicality arose fairly recently in human ancestry as a cultural invention to enhance social bonding, and was then elaborated via gene-culture evolution over tens of thousands of years (see also Podlipniak 2017).
We appreciate the focus on gene-culture co-evolution, a phenomenon we do not explore in detail in this paper, but which dovetails nicely with our concluding ideas concerning cultural evolution. Nevertheless, the two approaches differ substantially. The theoretical justification for music as a social bonding mechanism relies primarily on the work of Robin Dunbar and colleagues, who argued that grooming serves this function in smaller groups of non-human primates, but that larger human groups required more efficient mechanisms, namely laughter and music. On this idea, social bonds are created by the effects of joint musical performances on the neurobiology of the performers, rather than from information encoded in music. The costs of music production do not enter into this account, and Savage et al. (2020) mostly avoid theoretical or phylogenetic connections between human musicality and similar phenomena in other species (though they do offer some predictions concerning musicality in other species).
The theory we will describe differs substantially from this view. We propose that music has deep evolutionary roots in primate vocalizations, especially contact calls and territorial advertisements that were likely present in the last common ancestor of all primates, approximately 55 to 85 million years ago. We see music as a credible signal conveying information to listeners with whom signalers might have conflicts of interest, in a fashion similar to most work on non-human vocalizations. We draw theoretical and phylogenetic connections between human music and similar phenomena in other primate and non-primate species. We argue that unique aspects of human lifestyle, including multilevel social organization and high levels of parental investment (including from alloparents), selected for especially elaborate vocal signaling relative to most other species. Finally, we propose that the key features of musicality arising from adaptations in the proper domain of credible signaling serve as building blocks for cultural evolution, which shapes music into its actual domain.
To begin, we review two popular ideas about the origins of music, and ask whether they explain the core properties of music.
3.1. The byproduct hypothesis fails in light of six lines of evidence
The null hypothesis against which hypotheses for adaptation are tested claims that music has no evolved function, and instead is a byproduct of other adaptations that evolved for other functions unrelated to music. The byproduct hypothesis dates at least to William James, who wrote that music “is a pure incident of having a hearing organ” (James 1890 p. 627); this view echoed other scholars of his time and before (Darwin 1871; Monboddo 1774; Rousseau 1781; Spencer 1902), and is common in the literature. Music has been proposed to be a byproduct of linguistic or emotive communication (Bryant 2013; Cattell 1891; Cross & Woodruff 2009; Jackendoff 2009; Panksepp 2009; Patel 2008; Pinker 1997; Schulkin 2013; Sievers et al. 2013); auditory scene analysis and habitat selection (Pinker 1997; Trainor 2015); signaling vocalizations (Bryant 2013; Livingstone 1973; Mithen 2005; Pinker 1997; Richman 1993); mimicry of other animals’ vocalizations (Benzon 2001; Krause 2012); physical or motor abilities (Geist 1978; Larsson 2014; Panksepp 2009; Tierney et al. 2011); theory of mind (Livingstone & Thompson 2009); or general cognitive capacities (Cross 2012; Honing & Ploeger 2012; Jackendoff & Lerdahl 2006; Justus & Hutsler 2005; Marcus 2012).
Pinker’s (1997) framing is the best-known: “I suspect that music is auditory cheesecake, an exquisite confection crafted to tickle the sensitive spots of … our mental faculties” (p. 534). Six lines of evidence, taken together, call the byproduct hypothesis into question, however, and motivate theories of specific adaptations for music.
First, complex, song-like vocalizations have evolved convergently across distantly-related animals, including multiple clades of birds, marine mammals, primates, and insects; and provide important benefits related to mating and territorial defense (Coen et al. 2016). In many cases these are socially learned, like music (Schachner et al. 2009). Moreover, at least some explicitly musical behaviors, such as entrainment to a beat, appear in many species (Phillips-Silver et al. 2010; Wilson & Cook 2016). Music-like adaptations can therefore evolve, in principle.1
Second, music is a human universal: it appears throughout a representative sample of human societies (Mehr et al. 2019); plays an essential role in important activities, such as rituals and ceremonies (Nettl 2015); and demonstrates cross-cultural links between form and function (Bainbridge et al. 2020; Mehr et al. 2019, 2018; Trehub et al. 1993a). Music is not a byproduct of traits present in only some cultures.
Third, music shows evidence for complex design, including grammar-like structures analogous to those of language (Lerdahl & Jackendoff 1983), some of which may be universal (Jacoby et al. 2019; Mehr et al. 2019). Moreover, music perception is computationally complex, such that artificial intelligence is currently at pains to emulate it (Benetos et al. 2013). Music is unlikely to occur as a result of random chance.
Fourth, the motivation and ability to perceive music appear early in ontogeny: neonates are sensitive to rhythms (Winkler et al. 2009) and melodies (Granier-Deferre et al. 2011) and infant music cognition is precocial (e.g., infants have detailed long-term memory for music; Mehr et al. 2016; Mehr & Spelke 2017; Trainor et al. 2004; reviews: Trehub 2001; Hannon & Trainor 2007). Music perception develops naturally, does not require extensive training, and is not a byproduct of traits specific to adults.
Fifth, music perception displays evidence for neural specialization (Norman-Haignere et al. 2015, 2019) and is impaired in specific deficits, such as tone-deafness (Peretz et al. 2002; Peretz & Vuvan 2017). Music is unlikely to be a byproduct of other neural systems.
Last, music is ancient: flutes are at least 40,000 years old (Conard et al. 2009) and the human auditory and vocal production systems are far older (Fitch 2006; Martínez et al. 2004; Quam et al. 2013). Music is not a recent cultural invention.
While no one of these pieces of evidence is a sufficient condition for rejecting the byproduct hypothesis, taken together, they motivate a search for an alternative.
3.2. The social bonding hypothesis fails in light of three theoretical issues
The best-known evolutionary hypothesis for music is that it evolved to create and maintain “social bonds.” Juan Roederer (1984), for example, argued that music established “behavioral coherency in masses of people” to meet the demands of “coherent, collective actions on the part of groups of human society” (p. 356). Steven Brown (2000b) asserted that “music-making has all the hallmarks of a group adaptation and functions as a device for promoting group identity, coordination, action, cognition, and emotional expression” (p. 296). These and similar claims (Barrow 2005; Benzon 2001; Brown 2000a; Conard et al. 2009; Cross & Morley 2009; Dissanayake 2000, 2008, 2009; Dunbar 1998, 2012a; Freeman 2000; Fritz et al. 2013; Geissmann 2000; Huron 2001; Jourdain 1997; Kirschner & Tomasello 2009, 2010; Koelsch & Siebel 2005; Kogan 1994; Launay et al. 2016; Loersch & Arbuckle 2013; McNeill 1995; Merker et al. 2009; Morley 2012; Pearce et al. 2015; Reddish et al. 2013; Richman 1993; Schulkin 2013; Schulkin & Raglan 2014; Weinstein et al. 2016; Wiltermuth & Heath 2009) together form the social bonding hypothesis.
This view was popularized in part by Robin Dunbar’s proposal of a role for social bonding in the evolution of many human social traits (Dunbar 1991): he argued that in primates, manual grooming serves a social bonding function; as group size increased in the hominin lineage, manual grooming became prohibitively time-consuming, creating a selection pressure for a less costly bonding mechanism; and, as a consequence, new bonding mechanisms evolved. These mechanisms were first proposed to be language and gossiping, which could be broadcast to multiple individuals while doing other tasks, replacing grooming as the primary means of social bonding in humans (Dunbar 1998). Later, Dunbar and colleagues revised this position (Dunbar & Lehmann 2013), arguing that musical chorusing and laughter evolved instead (Dunbar 2012a; Dunbar & Dunbar 2004; Dunbar et al. 2012; Pearce et al. 2015, 2017, 2016; Tarr et al. 2015, 2014, 2016).
Most empirical tests of the hypothesis examine music’s impact on prosociality and its hormonal mediators in laboratory experiments: participants are randomized into groups that engage in synchronized musical behavior (treatment) or another activity (control). The general finding is greater levels of prosociality and cooperation in the music/dancing conditions relative to controls2 (Anshel & Kipper 1988; Cirelli et al. 2014; Kirschner & Tomasello 2009, 2010; Pearce et al. 2015, 2017, 2016; Reddish et al. 2013; Schellenberg & Habashi 2015; Tarr et al. 2015).
The social bonding hypothesis has at least three key issues, however.
3.2.1. A “stress-reducing” social bonding mechanism is superfluous
The ultimate-level problem of sociality is that it imposes difficult-to-overcome inclusive fitness costs: increased competition with conspecifics for essential, limited resources; inbreeding depression; and increased exposure to pathogens (Alexander 1974). Living with and cooperating with conspecifics requires that the inclusive fitness benefits of sociality outweigh its fitness costs. Often they do not: dispersal and solitary living are ubiquitous across species (Benton et al. 2017; Bowler & Benton 2005; Duputié & Massol 2013).
In primates, diurnal social living evolved about 52 million years ago (Shultz et al. 2011). Because diurnal foraging increases predation risk, the joint evolution of diurnality and sociality supports the long-standing idea that primate sociality evolved as a defense against predators (Silk & Kappeler 2017; Van Schaik 1983). Advocates of the social bonding hypothesis claim that social living creates psychological stresses that threaten the cohesion of the group, necessitating a “bonding mechanism” (in non-human primates, grooming; in humans, music) that reduces stress:
Since living in groups of any kind creates stresses that would normally result in the group disbanding, species that live in stable social groups have to circumvent this problem if they are to prevent group size collapsing.
(Dunbar 2012a p. 1838)
This idea is superficially appealing because it draws attention to the fitness costs of social living, presenting them as proximate-level stresses, and implying a need for a behavioral response to relieve the stress.
But an ultimate-level analysis must consider alternative strategies. The alternative to sociality is solitary living, seen in ~70% of mammal species (Wilson & Reeder 2005). On the hypothesis that sociality solves the adaptive problem of defense against predators, the net fitness benefits of sociality exceed those of solitary life (with its attendant high risk of predation). The stress-related benefits of a “social bonding mechanism” are superfluous.
For an analogy, consider a group of friends walking close together in a dangerous neighborhood at night. There are costs to this sociality: they bump into each other; they don’t fit on the sidewalk, forcing some to risk injury from oncoming cars; it’s harder for them to converse, and so on. An ultimate-level analysis recognizes that the benefits of their sociality — defense against getting mugged — outweigh the costs, and no bonding mechanism, such as grooming or singing, is required to keep them together.
Grooming does provide hygienic benefits to primates, such as removal of ectoparasites (Barton 1985); perhaps with social functions beyond hygiene (McKenna 1978; Seyfarth 1977; Seyfarth & Cheney 1984), because across species the proportion of time spent grooming is positively correlated with group size (Dunbar 1991). But this association, core empirical finding underlying the social bonding hypothesis, is poorly evidenced: its strength is modest, and, when adjusting for terrestriality and other ecological factors, is not distinguishable from zero (Jaeggi et al. 2017). This may because primate group size is confounded with terrestriality; if so, increased grooming time could instead be explained by some property of a terrestrial niche, such as increased parasite load (Grueter et al. 2013; Jaeggi et al. 2017; cf. Dunbar & Lehmann 2013).
Whatever evolved social functions grooming might have, it is unlikely that they include stress reduction. Predation risk, not grooming, is the ultimate-level “bonding force” that likely explains primate sociality, and the additional benefits of cooperative endeavors such as hunting, parenting, and territorial defense likely explain human sociality.
3.2.2. The social bonding hypothesis conflates proximate- and ultimate-level reasoning
Might grooming solve other problems of sociality? Defense against predators, territory defense, hunting, and parenting are compelling examples of cooperation whose benefits could offset sociality’s costs. They raise profound theoretical challenges, however, involving free-riders: agents that receive benefits from others but do not provide any. Without countermeasures, free-riding is favored by natural selection (Nowak 2006), so forming cooperative relationships with arbitrary individuals is untenable. Instead, these relationships must be targeted at specific categories of individuals, such as kin, neighbors, or those likely to reciprocate, and adjusted to local socio-ecological conditions (Markham et al. 2015) such that long-run benefits are provided only when they exceed long-run costs to the donor (Nowak 2006).
Proponents of the social bonding hypothesis offer a proximate-level explanation, wherein the neurohormonal effects of music are a solution to the impediments to sociality and cooperation described above:
My proposal is that music arose originally because it allows individuals to become more group-oriented. Music seems to achieve this through a capacity to produce endorphins which have a positive effect on our attitudes towards others.
(Dunbar 2012b p. 208)
We propose that synchrony might act as direct means to encourage group cohesion by causing the release of neurohormones that influence social bonding.
(Launay et al. 2016 p. 779)
There are two problems with these claims. First, evidence that X causes Y is weak evidence that X evolved to cause Y. Recall Aristotle: rain causes corn to grow without implying any “purpose” for rain. Rain shows little evidence of special design for solving corn’s hydration problem, it has many other, unrelated effects, and so on. By analogy, a proximate-level analysis shows that petting animals reduces human anxiety via hormonal and physiological effects (Beetz et al. 2012), but animal-petting did not evolve to reduce anxiety or the threats that trigger it, of course.
Second, proximate mechanisms, such as release of neurohormones, are themselves subject to selection, and therefore cannot serve as ultimate-level explanations for the genetic evolution of a social bonding strategy. In order for a social strategy to evolve, it must outperform conceivable mutant strategies (a well-recognized criterion for claims of adaptation, the evolutionarily stable strategy; Smith & Price 1973). A mutation that prevented music from increasing endorphins and/or reduced endorphins’ effects on prosociality would have allowed humans with that mutation to free-ride: they could gain from the prosocial behavior of others (becoming more bonded with the group) without being prosocial themselves. Such a free-rider mutation would be selected for (Nowak 2006).
How can an unconditional social bonding mechanism like music be stabilized against free-riders? Confusion between proximate- and ultimate-level analyses in the social bonding hypothesis leaves this question unanswered.
3.2.3. Music is poorly designed to coordinate groups
Another version of the social bonding hypothesis proposes that music evolved by genetic group selection to enable humans to act as coordinated superorganisms: music increased group fitness by promoting group identity, cognition, coordination, and catharsis. These within-group functions are proposed to increase the ability of groups to compete with other groups (Brown 2000a).
While music does play a universal role in rituals (e.g., shamanistic trance; Singh 2018; Mehr et al. 2019), the problem with this view is that it equates proximate social “functions” or “effects” with adaptations shaped by natural selection3. Because any behavior has effects, and some of those effects may be incidentally “useful” (e.g., animal-petting reduces anxiety), the proper criterion is that music be well-designed for the proposed within-group function.
The superorganism model is based on an explicit analogy with multicellular organisms, where energy and time are sharply constrained resources. Within-organism signaling, cognition, and coordination evolved to be as efficient as possible, to maximize between-organism competitiveness. In neural signaling, for example, time and energy trade off: higher information rates use more energy, so at all levels of neural organization, strategies evolved to reduce energy consumption by filtering out predictable inputs, reducing the amount of redundant encoding (Laughlin 2001; Niven 2016; Niven & Laughlin 2008).
But music takes considerable time and energy to produce. People who produce music incur opportunity costs (Mehr & Krasnow 2017) and expend energy that could be used for other activities that directly increase reproductive success, such as food production (Hagen & Bryant 2003). Music is also often loud, and could attract predators or allow competing groups to eavesdrop. These costs also accrue to the variants of the social bonding hypothesis discussed earlier. Indeed, while music and other synchronous, ritualistic behaviors are often argued to be unambiguously beneficial for groups, the “neglected dark side of synchrony” (p. 3) shows that synchrony increases conformity and groupthink while reducing creativity and productive dissent (Gelfand et al. 2020).
Because natural selection shapes traits to perform specific functions by selecting among alternatives, a criterion for claiming adaptation is that a trait is uniquely suited to causing certain effects, relative to feasible alternatives. In the case of the social bonding hypothesis, an obvious alternative to music that serves the same proposed within-group functions is language, a low-cost signaling system that efficiently facilitates the coordination of collective action and other social behaviors (Pinker & Bloom 1990). Consider that the coxswain, whose job is to maintain the coordination of rowers, does not sing, nor does the crew; the efficient vocalization “row!” minimizes the energy required for within-group coordination, while maximizing the rowers’ ability to win a race4. Moreover, in a sample of six small-scale human societies, conversation time was close to the expected grooming time for a terrestrial primate with recent ape ancestry (Jaeggi et al. 2017), suggesting that language adequately provides whatever social functions grooming may have. As a social coordination or bonding mechanism, music thus appears to have no advantages over language and many disadvantages.
The weak case for music as an adaptation for social bonding does not mean that music has no evolved social functions. In the rest of this paper, we outline an alternative social hypothesis for the origins of music.
4. Origins of music in credible signaling
The social bonding hypothesis proposes that the fitness benefit of music arises from the neurophysiological effects of music production on music-makers themselves. Signaling hypotheses, in contrast, propose that fitness benefits arise from the information communicated by music-makers, via their music, to various categories of listeners.
Acoustic communication has evolved repeatedly and independently in many clades of tetrapods. It appeared 200 million years ago in therian mammals and is found in ~95% of mammal species (Chen & Wiens 2020). If music is an adaptation, it likely evolved from ancestral vocalizations, an idea foreshadowed by Lucretius two millennia ago:
To imitate the liquid notes of birds Was earlier far ‘mongst men than power to make, By measured song, melodious verse and give Delight to ears.
(De Rerum Natura, Book V)5
In non-human animals, most vocal adaptations evolve to send signals, which are defined as “any act or structure which alters the behaviour of other organisms, which evolved because of that effect, and which is effective because the receiver’s response has also evolved” (Maynard Smith & Harper 2003 p. 3).6 On average, receivers benefit from responding to the signal, and signalers benefit from the receivers’ response.7 Cues, in contrast, convey information about one organism to another but did not evolve to do so (e.g., bleeding is a cue of injury but did not evolve to signal injury). Common functions of signals include species identification in mate choice; individual recognition in interactions among conspecifics, such as territoriality, dominance, and cooperation; and conveying information on formidability, health, or behavioral type (Tibbetts et al. 2017).
Why do animals believe the vocal signals they hear? What maintains their credibility? If the interests of signaler and receiver are aligned, as in cells in an organism or agents in a superorganism, then selection for dishonesty is absent and signals evolve to be as efficient as possible (Maynard Smith & Harper 2003). If not, then selection can drive signalers to deceive and receivers to be vigilant against manipulation.
Some signals are necessarily credible because they are causally related to the quality being signaled. A wolf howl credibly indicates that a wolf is present, for example, and the number of distinct, simultaneous wolf howls credibly indicates a lower bound on the size of the pack (an “index”; Maynard Smith & Harper 2003). Costly signals, in contrast, are credible because to send them imposes a fitness cost that is lower for individuals with the quality than those without it: faking the signal is more costly than it is worth8 (Spence 1973; Zahavi 1975).
In addition to credibility, multiple selection pressures can shape signals, including biases in the sensory systems of receivers; receiver abilities to discriminate signals; the structure of the environment; social challenges; and arms races between signalers and receivers (Krebs & Dawkins 1984), where signalers are selected to produce the signal at lower cost and receivers are selected to better discriminate the quality of signalers (Bradbury & Vehrencamp 1998; Cummings & Endler 2018; Doorn & Weissing 2006; Hill 1994; Lindsay et al. 2019; McCoy & Haig 2020).
Here, we emphasize the importance of conflicts of interest between music producers and the audience, private information, and the features of music that underlie its ability to overtly signal covert information about the minds of those producing it.
4.1. The mate quality hypothesis is poorly supported
An early theory of music, first proposed by Darwin (1871) and endorsed by many others (Barrow 2005; Charlton 2014; Dutton 2009; Merker 2000a; Miller 2000a, 2000b; Miranda et al. 2003; Orians 2014; Sluming & Manning 2000; Todd 2000; Todd & Werner 1999; van den Broek & Todd 2009) is that male musical abilities and female musical preferences coevolved, with music functioning as a credible signal of male mate quality.
If musical production requires a brain and body relatively unperturbed by genetic mutation, infection, or developmental instability, plus time to cultivate one’s talent (properties that are difficult to perceive directly), the mate quality hypothesis argues that mates who prefer music-producers will benefit. This increases selection for music-producers to generate more impressive, complex, or interesting music (so as to improve the chance of being chosen as a mate). Given the sex difference in the amount of investment required of human parents for an offspring to be reproductively viable (Trivers 1972), signal production should be accentuated in the sex with lower obligate parental investment (males) and choosiness should be accentuated in the sex with the higher obligate parental investment (females).
Sexually dimorphic signals of mate quality are common across species (e.g., coloration, ornaments) and they play key roles in mate attraction (Andersson 1994; Dale et al. 2015; O’Brien et al. 2018); for example, male birdsong functions in part to attract mates (Catchpole & Slater 2018). Some mammals show this pattern too. In sac-winged bats, males produce complex songs that females may use in mating decisions (Behr et al. 2006). Adult house mice produce sexually dimorphic ultrasonic vocalizations with song-like features in response to the presence of novel female urine, but not the scents of immature females or other males (Musolf et al. 2010). If human music evolved in the context of signaling mate quality, it should have retained similar, signature features of a sexually selected adaptation9.
Ironically, the mate quality hypothesis is easy to refute precisely because it is so well-specified. Music is tenuously linked to mate quality. While love/courtship songs are common across cultures, they are only weakly identifiable as such relative to other forms of song (Mehr et al. 2019, 2018). A large twin study found inconsistent relations between measures of reproductive success or sociosexuality and measures of music production or music perception abilities (Mosing et al. 2015); the few positive relations reported were weak, and no stronger in men than in women. In another study, music performance quality was positively associated with indices of mate quality and attractiveness, but the effects and sample size were small and did not differ by sex (Madison et al. 2018). Musical preferences can vary across the menstrual cycle (Charlton 2014), perhaps indicating a role in mating, but this effect has failed to replicate10 (Charlton et al. 2012).
Sexually-selected traits that function for display and choice in mating contexts are often developmentally and contextually calibrated to mating (Kokko 1997). In humans, for instance, puberty coordinates the developmental timing of physical and psychological traits that support mating: menarche and spermarche coincide with the development of secondary sexual characteristics and the relative onset of mating psychological systems (Kaplan & Gangestad 2005). But humans of all ages produce and listen to music; no part of the music faculty emerges at or around puberty11. Young children enjoy the music of sexually mature conspecifics, and vice versa, a pattern contrasting with that of sexual attraction, which begins in late childhood (Herdt & McClintock 2000). While mating-related behaviors tend to be produced only in mating-relevant contexts, music is produced and consumed in multifarious contexts, universally, that are completely unrelated to mating (e.g., work, healing, greeting visitors, mourning; Mehr et al. 2019).
Last, many mating-related traits in humans are sexually dimorphic, such as male biases for traits useful in physical competition (Puts 2010; Sell et al. 2008) and female biases for traits useful in discerning investment potential (Buss 1989; Conroy-Beam et al. 2015; Kenrick & Keefe 1992). If music evolved to signal mate quality, then adaptations for music production should be more developed in men and adaptations for music perception should be more developed in women.
Little evidence supports this pattern. Dimorphisms in human vocalizations and vocal anatomy — lower voices in males, signaling threat potential (Puts et al. 2011), and higher voices in females, signaling fecundity (Apicella & Feinberg 2009) — appear beginning at puberty (McDermott 2012), but are neither more exaggerated nor more honestly signaled via song rather than via speech (cf. Keller et al. 2017). Auditory perception skills are comparable in males and females, with only small and inconsistent sex differences (Müllensiefen et al. 2014; Shuter-Dyson & Gabriel 1981). Musical disorders, such as specific musical anhedonia and congenital amusia, are found just as frequently in males as in females (Mas-Herrero et al. 2014; Peretz & Vuvan 2017). A lone report of sex differences in the frequency of music performance across human societies (Savage et al. 2015) is likely the result of sampling bias (discussion: Mehr et al. 2018, 2019). If anything, female musicians produce more novel songs than their male peers (Askin et al. 2020).
The pervasiveness of music across the sexes is evident in daily life: both males and females seek out and enjoy the performances of both male and female musicians (Hagen & Bryant 2003), and some evidence suggests that musical preferences are biased toward performers of the same sex as the listener (Greenberg et al. 2020). Male and female performers are both well-represented, historically, on the Billboard Top 100, albeit with an advantage toward males (Lafrance et al. 2011). While many of the highest-grossing musical artists of all time are male, sex differences in success as a musician likely have little to do with biology — a half-century ago, virtually all professional orchestral musicians were male, for example, whereas now the world’s top orchestras are approaching gender parity (Sergeant & Himonides 2019).
This pattern of evidence has contributed to a growing consensus that links between music and mate quality are weak (Mosing et al. 2015; Ravignani 2018).
4.2. Music as a credible signal of cooperative intent
We agree with proponents of the mate quality hypothesis that music is a credible signal. But song-like vocalizations in non-human animals often signal much more than mate quality. Even in songbirds, the poster-species for the sexual selection of male song, singing can serve other functions, such as territorial advertisements (Tobias et al. 2016).
We also agree with proponents of the social bonding hypothesis that musical abilities evolved because musical performances played an important role in cooperative sociality. But given the issues described above, we find it more likely that music evolved to credibly signal decisions to cooperate that were already reached by other means, not to determine them. Cooperation often fails, making it useful to have a credible signal indicating that, by various (non-musical) means, one or more agents have decided to cooperate. Credible signals of cooperative intent, in turn, can produce decisions by signal receivers that benefit the signalers.
We will discuss two behavioral contexts where complex vocal signals have evolved in numerous other species; where unique characteristics of the human species created selection pressures for an elaborate credible signal; and where music universally appears.
First, in the context of territorial advertisements, we consider pressures of coordinated territorial defense across coalitions and in the context of cooperative alliances with other groups. We propose that music could function as a credible signal of coalition strength, size, and coordination ability.
Second, in the context of contact calls, we consider pressures of helpless infants requiring substantial parental investment, relative to other primates; and multiple dependent siblings competing for parental investment. We propose that music could function as a credible signal of parental attention.
4.2.1. Synchronous coordinated music as a credible signal of coalition strength, size, and cooperation ability
In mammals, loud auditory signals are frequently agonistic, and territorial advertisements are a prime example (Gustison & Townsend 2015). Territoriality is common in taxa ranging from bacteria to vertebrates (Maher & Lott 2000; Smith & Dworkin 1994), including primates (Willems & van Schaik 2015). Territory owners have a consistent advantage over intruders, often retaining their territory without a fight (Kokko et al. 2006). It is thus in the interest of owners to advertise their residence in a territory to deter intruders and avoid a fight.
Territorial calls, which credibly signal that a territory is occupied, are found in many species, including birds, primates, and other mammals (Bates 1970; Gustison & Townsend 2015; Ladich & Winkler 2017; Wich & Nunn 2002). Loud primate calls are a plausible evolutionary precursor to human music (Geissmann 2000) because they appear to have existed in the last common ancestor of all primates and are often produced by both sexes and directed at both sexes (Wich & Nunn 2002). Some African apes display drumming-like behaviors as part of territorial signals (Goodall 1986; Hagen & Hammerstein 2009). In humans, vocal and instrumental music are reliably associated with war, procession, and ritual across a representative sample of societies (Mehr et al. 2019, Table 1); appears in political and military contexts with analogues to territorial signaling (Hagen & Bryant 2003; Hagen & Hammerstein 2009); is generally not sexually differentiated (see above); and, of course, is often loud.
Social species that collectively defend territory, such as chimpanzees and several species of social carnivores (e.g., lions, wolves) produce coordinated vocal territorial advertisements (e.g., roars, howls), which credibly signal group size to potential intruders (Harrington 1989; Harrington & Mech 1979; Krebs 1977; McComb et al. 1994; Wilson et al. 2001). In a study of nearly 10,000 bird species, the presence of communal signaling was associated with territoriality, typically in conjunction with stable social bonds (Tobias et al. 2016). Moreover, the effect of territoriality was more than twice the size of that of social bonds, and territoriality was a crucial precursor to communal signaling, suggesting that long-term social bonds might evolve after communal signaling.
Some coordinated vocal signals, like bird duets, involve complex, temporally synchronized displays. A high level of synchronous coordination among signalers requires considerable effort to achieve, and thus credibly signals a willingness and ability to cooperate over time, thereby serving as an index of the quality of the coalition defending the territory, above and beyond coalition size (critical information otherwise not apparent to intruders; Hagen & Bryant 2003; Hall & Magrath 2007; Wiley & Wiley 1977). If synchronous coordination is a signal of coalition quality, selection should push receivers to better discriminate differences in degrees of coordination, and signalers to produce more complex coordinated signals, leading to signal elaboration.
Several primate species also produce highly synchronized song-like duets. As in birds, song-like calls are characteristic of species living in small, monogamous groups (Schruth et al. 2019). Although duetting and coordinated vocalizations might have some role in pair-bond formation and strengthening in a few monogamous species, such as gibbons and titi monkeys, most evidence suggests these calls primarily function to exclude intruders and maintain spacing: they are territorial advertisements (Snowdon 2017). Experimental evidence suggests that higher levels of coordination in such signals indicate higher coalition quality. Duetting magpie-larks that had been paired for a longer time were more likely to produce highly coordinated displays, and in an experimental loudspeaker study on natural territories, playbacks of highly coordinated duets, which simulated territorial intrusions, evoked significantly higher song rates by resident males than poorly coordinated duets (indicating that the highly coordinated duets were perceived as more threatening; Hall & Magrath 2007).
Humans are both primates and social hunters, so we expect human ancestors to have advertised territory ownership in a similar fashion: using loud, coordinated vocalizations, perhaps with drumming. We propose that such territorial vocalizations are an evolutionary precursor to music, especially rhythmic music (Hagen & Bryant 2003; Hagen & Hammerstein 2009; cf. Merker 2000b). Signatures of this function might persist in modern humans in coordinated group dances that are universal across cultures (Mehr et al. 2019; Nettl 2015; e.g., the Māori haka; Best 1924). The group music of Aka Congo Basin hunter-gatherers, for instance, is audible to groups living some distance away.12
Complex forms of social organization likely set the stage for the evolution of complex credible signals, including synchronized and coordinated vocalizations. Sometime after diverging from other apes, the human lineage underwent a major transition to a multilevel society. In multilevel societies, small family units regularly aggregate with other family units, forming a higher-level unit, which in some species aggregate to form an even higher-level unit. This societal structure occurs in some other primate species (e.g., hamadryas baboons; Swedell & Plummer 2019), and some evidence suggests that higher degrees of social complexity are correlated with increased vocal diversity and flexibility (e.g., in macaques; Rebout et al. 2020).
A notable attribute of some multilevel-society species is that, in addition to the agonism or tolerance exhibited between units, units also cooperate. Homo sapiens exhibits particularly rich cooperative behavior between units: cooperative families are nested within cooperative residential groups that often form cooperative alliances with other residential groups to obtain food, buffer resource variation, raise children, defend territory, and so on (Chapais 2013; Hamilton et al. 2007; Pisor & Surbeck 2019; Rodseth et al. 1991; Swedell & Plummer 2019).
Between-group cooperation likely created many new selection pressures. In particular, if human groups varied in the benefits they could provide other groups as allies, and the number of alliances a group could maintain was limited, a biological market would have arisen (Hammerstein & Noë 2016), wherein groups evaluated the coalition quality of potential allies by assessing their size, cooperation ability, and willingness to cooperate, and potential allies had incentives to exaggerate these qualities (Hagen & Bryant 2003; Hagen & Hammerstein 2009). Common properties of music, especially those found in rhythmic, coordinated performances, provide a close fit to the necessary criteria for a credible signal of such otherwise difficult-to-observe group-level features. The time needed to create and practice group complex musical performances and achieve complex synchrony necessarily corresponds to a dimension of the underlying quality of the coalition: the amount of time coalition members have cooperated with one another.
In summary, we propose that music evolved, in part, as a means for groups to credibly show off their qualities to other groups.
There is substantial ethnographic, historical, and archaeological evidence of credible signaling of coalition quality among human groups, typically in the context of feasting. In feasting, two or more individuals share special types or quantities of foods, for a special purpose or event (Hayden 2014). In addition to food, feasting often includes special clothing, ornaments and other artifacts — and music and dance. Feasting has been documented in societies of all levels of social complexity, ranging from band-level hunter-gatherers to nation-states, including at archaeological sites throughout the Holocene (reviewed in Hayden & Villeneuve 2011; Hayden 2014). While many functions of feasting have been proposed (Hayden 2014; Wiessner & Schiefenhövel 1998), there is widespread agreement that feasts play a critical role in the formation of alliances between groups (reviewed in Hayden & Villeneuve 2011; Hayden 2014). As Sosis (2000) observed, the goal of ritualized foraging and feasting and other forms of food distribution is often to enhance the reputation of an entire group by displaying its productivity.
It is notable, then, that music and dance co-occur with feasting frequently in the ethnographic record.13 For example, Congo Basin hunter-gatherers are renowned for their music, which they perform in many social contexts, including at spirit plays and large inter-community dances following big game kills (Fürniss 2017; Lewis 2013, 2017), as in the Mbendjele BaYaka:
Sharing [food] between camps is less frequent, but will occur when big game is killed and during massana forest spirit performances. When an elephant is killed, Mbendjele in the area go rapidly to where the carcass is lying. Large camps grow, and feasting and dancing go on until the elephant has been consumed.
(Lewis 2017 p. 227)
It is plausible that similar events regularly occurred during human evolution because there is archaeological evidence for domestic spaces, large game hunting, mass kills, cooking, large aggregations, burials, ornaments, use of pigments, and musical instruments throughout the Upper Pleistocene, with some evidence appearing earlier (Barham 2002; Conard et al. 2009; Kuhn 2014; Kuhn & Stiner 2019; Maher & Conkey 2019; Stiner 2019, 2013, 2017).
A function of music in the context of alliance formation may also help to explain why music is often produced for and enjoyed by strangers, as in modern recorded music and live concerts. The selective dynamics of “social foraging” in the hominin niche, where strangers have an uncertain but non-zero possibility of becoming lucrative social partners, frame strangers as the appropriate targets of social foraging tactics (Delton et al. 2011; Delton & Robertson 2012; Rand et al. 2014). Moreover, some data shows that observers can infer coalition quality and fighting ability from observations of musical performances and other coordinated behaviors. People who listened to a musical performance with instruments mixed either in-sync, consistently out-of-sync, or scrambled rated coalition quality higher in the in-sync vs. out-of-sync (but not scrambled) conditions (Hagen & Bryant 2003). When listening to people marching asynchronously or synchronously, judges rated the synchronous groups as more formidable, better able to coordinate a physical attack, and higher in social closeness; judgments of formidability were mediated by judgments of coordination, not bonding (Fessler & Holbrook 2016).
On this view, music is clearly rooted in sociality. In contrast to the social bonding hypothesis, however, we predict that music does not directly cause social cohesion: rather, it signals existing social cohesion that was obtained by other means (Hagen & Bryant 2003 p. 30).
We do not think this is the only social context in which music can act as a credible signal. Within groups, musical performances might also create common knowledge of decisions to cooperate, which could serve group coordination and cooperation (Chwe 2001; Freitas et al. 2019; see Hagen & Bryant 2003 for other possibilities); credibly signal qualities guiding same-sex partner choice in a biological market (Hammerstein & Noë 2016), and perhaps informing mate choice by both sexes; and as a group analog of emotional expression (Hagen & Bryant 2003; Hagen & Hammerstein 2009). Producing music that is specific to a group might also credibly signal membership in that group (Mehr et al. 2016; Mehr & Spelke 2017) in a fashion similar to food preferences and dialects (see Liberman et al. 2016; Kinzler et al. 2007).
Next, we examine a case where we believe within-group rather than between-group credible signaling has shaped music.
4.2.2. Infant-directed song as a co-evolved system for negotiating parental investment of attention
Contact calls are a common vocalization across many species, distinct from territorial signals. In primates, these include loud calls between separated group members, and frequent quiet calls during heightened risk of separation (e.g., in dense vegetation). Contact calls rank among the most diverse and complex call types across species (Bouchet et al. 2013; Leighton 2017), enabling individuals to recognize, estimate distance to, and maintain contact with their social partners (Kondo & Watanabe 2009; Rendall et al. 2000).
One important class of contact calls are those between parents and offspring. These serve functions of mutual interest to parents and offspring, for example, enabling parents to be available to solve problems their offspring are ill-suited to solve on their own. Chacma baboon barks, for example, range from tonal, harmonically rich variants that are used for contact calls, to barks with a noisier, harsher structure that are used for alarm calls. By the age of six months, infants learn to discriminate call types and to discriminate their mothers’ contact barks from those of unrelated females (Fischer et al. 2000); and mothers recognize their infants’ contact calls (Rendall et al. 2000).
We propose that in the human lineage, maternal contact calls evolved to encode credible information beyond identity and distance, namely attention to the infant. There are few relationships where inclusive fitness interests overlap as much as they do between parents and offspring — but even these are not perfectly aligned. Because of the mechanics of diploid sexual reproduction, a parent is equally related to all her offspring, whereas each offspring is twice and four times as related to itself as it is to each of its full and half siblings, respectively. A strategy that optimizes the parent’s inclusive fitness (e.g., equal food distribution across offspring) does not necessarily optimize an offspring’s inclusive fitness, and vice versa. This possibility, parent-offspring conflict (Trivers 1974), implies differences in the interests of supply genes in the parent and demand genes in the offspring (Bossan et al. 2013).
Some aspects of human reproduction suggest that selection pressures for complex contact calls have increased relative to those in apes. First, human brain size is about triple that of other apes (Schoenemann 2006) and most brain growth occurs postnatally, implying that human infants are born helpless and have a very long juvenile period. Second, human forager interbirth intervals are about half those of chimpanzees (Marlowe 2005; Thompson 2013), requiring ancestral human mothers to simultaneously care for multiple dependent offspring (in contrast to chimpanzee mothers, who typically care for a single dependent offspring). Third, unlike other great apes, humans rely heavily on alloparenting in a multilevel society, requiring ancestral human infants to establish relationships with multiple caregivers and vice versa (Hrdy 2009); unlike chimpanzee infants, ancestral human infants typically competed with multiple juveniles for the attention of multiple caregivers.
Human parents increase their offspring’s fitness by attending to them and protecting them from harm. Attention is a limited resource, however; many other challenges require attention, and solving those may benefit the parent more than the infant (relative to the provisioning of attention to maintain infant safety). The interests of infants and their parents conflict, in terms of the optimal provisioning of attention: infants often “prefer” more attention than a parent would “prefer” to provide.
How does this conflict of interests play out? Infants have bargaining power to extract parental investment (in the form of material investment, like nursing, or parental attention); they demand attention by crying (for discussion of evolutionary scenarios, see Soltis 2004 and commentaries). Parents lack perfect access to their infant’s internal state, so crying provides information about when investment can be provided. Care-eliciting infant vocalizations (e.g., distress calls, separation calls) are common across mammals (Newman 2007), including humans, and mothers reliably respond to these vocalizations by providing care (Bornstein et al. 2017).
Whereas infants can easily detect when material investment has been provisioned, attention is a covert property of the parent’s mind, with unreliable cues. Infants can infer that parents are attending to them from estimating the parent’s gaze direction, but this only provides partial information (the parent could be concentrating on something else). Touch is also a good cue that a parent is nearby; but the parent could be asleep, or attending to something else.
Better than these cues would be a credible signal from the parent, reliably indicating that the infant has their attention (Mehr & Krasnow 2017). A vocal signal is a good candidate because its acoustic properties allow the proximity of the producer of the signal to be reliably inferred by the target. To the extent that the signal monopolizes the vocal apparatus, producing it is incompatible with other activities (such as speaking to another adult) that could co-opt the parent’s attention. And aspects of the vocal signal can be modulated in real time, in response to the infant’s state and behavior, which cannot be done without attending to the infant.
Here again we expect an evolutionary arms race, driven by partially conflicting fitness interests between senders and receivers, producing an elaborated signal. We propose that this process could lead to key features of music: in particular, contrasting with the rhythmic features developing from territorial signals, we expect the rather more subdued context of soothing parent-infant contact calls to give rise to melodic features, tokens of which are the lullabies we sing to infants today (Mehr & Krasnow 2017).
Three sets of results support this idea. First, if adaptations support the production of song in parents and alloparents, and the appetite for and ability to perceive song in infants, then music should appear universally in the context of infant care and infant-directed songs should share features worldwide. These predictions, long discussed in the music cognition literature (Hannon & Trainor 2007; Peretz 2006; Trehub & Nakata 2001–2002), are well-evidenced. In an analysis of high-quality ethnography from a representative sample of human societies, text concerning vocal music was significantly associated with infant care and children, over and above base rates of reporting (this finding replicates both with expert annotations of the ethnography and automated text analysis; Mehr et al. 2019). Moreover, infant-directed songs are found in 100% of a pseudorandom sample of field recordings in mostly-small-scale societies; and naïve listeners, who are unfamiliar with the languages or cultures involved, reliably recognize them as infant-directed, with remarkable consistency (Mehr et al. 2019, 2018). This finding replicates prior cross-cultural work (Trehub et al. 1993b, 1993a).
Second, the genetic architecture of musical perception and motivation should be regulated, in part, by parent-of-origin epigenetic mechanisms, such as genomic imprinting. Humans are sexually reproducing but not obligately monogamous, which differentiates the conflict of interest between parents and offspring by parental sex: because maternity certainty is greater than paternity certainty, genes of maternal origin are more likely to be found in an offspring’s siblings than genes of paternal origin (Haig & Wilkins 2000). Genes of maternal origin are thus under selection to bias the tradeoff in demand for parental investment in the direction of the offspring’s siblings and away from the offspring; on average, maternally inherited genes should reduce investment demands on mothers, and vice versa. This prediction is confirmed by the fact that genes with parent-of-origin effects tend to affect demands for parental investment, such as intrauterine growth (Haig 1993).
Genomic imprinting disorders, where genetic dysregulation is differentiated by parent-of-origin, provide a unique test of the relation between a trait and its putative link to parental investment (Haig & Wharton 2003). Angelman and Prader-Willi syndromes result from opposing dysregulation at the same genetic region (15q11–13), with a loss of genes expressing maternal interest resulting in Angelman syndrome, and the reverse, a relative loss of genes expressing paternal interest resulting in Prader-Willi syndrome. The behavioral phenotypes reflect the different effects of maternally vs. paternally inherited genes: infants with Angelman syndrome have a voracious appetite while nursing, are awake for more hours of the day than typically developing infants, and attract more attention via smiling than do typically developing children (Ubeda 2008; Williams et al. 2006), increasing investment demands on the mother. Infants with Prader-Willi syndrome, in contrast, are born with low birth weight, sleep more than typically developing infants, and often lack a suckle reflex (Cassidy & Driscoll 2008; Holm et al. 1993; Peters 2014), with the opposite effect (decreasing investment demands on the mother).
Recent findings show that these effects extend to the domain of music, demonstrating a genetic link between music perception and parental investment. People with Angelman syndrome have a suppressed relaxation response to music (Kotler et al. 2019); while people with Prader-Willi syndrome have a potentiated relaxation response to music, along with pitch perception deficits (Mehr et al. 2017). These results support the idea that music signals attention: suppressed relaxation in Angelman syndrome implies increased maternal demands, while potentiated relaxation in Prader-Willi syndrome implies reduced maternal demands, in line with other findings concerning parental investment demands in genomic imprinting disorders.
Last, we also expect relationships between the acoustic features of non-human primate contact calls and human infant-directed song. While few data exist with which to test these relationships, preliminary findings suggest that similarities do exist. For example, baboon contact calls are harmonically rich, whereas alarm calls are harsh and noisy (Fischer et al. 2000); in a vocalization corpus from 21 human societies, infant-directed song was acoustically distinct from infant-directed speech across many pitch, rhythmic, phonetic, and timbral attributes (Moser et al. 2020), with a similar pattern of results to the acoustic differences between baboon contact calls and alarm calls. Moreover, several acoustic features driving these effects were related to vocal exertion (e.g., temporal modulation, pitch rate, vowel rate), perhaps honestly signaling additional costs incurred by the signaler.
5. Discussion
A comprehensive understanding of music requires that proximate-level explanations are distinguished from ultimate-level explanations uniquely linked to music; that proposed adaptations explain the core features of music that are putatively shaped by natural selection, and distinguish them from features that are byproducts of other adaptations; and finally, that the results of evolutionary analyses provide a foundation on which cultural-evolutionary processes can plausibly act.
The credible signaling account meets these criteria, whereas other accounts of the origins of music do not.
5.1. Credible signaling may explain some basic features of music
Early in this paper we noted some properties of human music that need explanation. While we find it implausible that any one theory can explain all of them, two core features of music are directly related to the ideas presented here.
An evolved system for quickly and reliably signaling coalition quality, which might otherwise be difficult to perceive, especially during territorial advertisements, agonistic intergroup encounters (e.g., war songs, dances), and alliance-forging feasts, provides a functional explanation for rhythm: selection pressures toward synchronized isochronous sounds, with complex internal design. An evolved system for credibly signaling parental attention to infants provides a functional explanation for melody: selection pressures toward manipulating affective prosody in vocalizations, constrained by the physics of the vocal production system and inherent features of the auditory world.
These “building blocks” appear universally in music (Mehr et al. 2019; Nettl 2015; Savage et al. 2015), like “building blocks” of language (e.g., Baker 2001). They provide a grammar-like, combinatorially generative interface through which musical content can be created, improvised, and elaborated upon, through hierarchical organization of meter and tonality14 (Krumhansl 2001; Lerdahl & Jackendoff 1983), in fashions that themselves have universal signatures (Jacoby & McDermott 2017; Jacoby et al. 2019; Mehr et al. 2019).
The importance of rhythm and pitch in human music perception — and the degree to which these features of music are unique to human vocalizations — may be directly tied to their evolutionary history.
5.2. Music is culturally evolved but cultural evolution has to start somewhere
We understand culture as information that affects individuals’ behavior and that is acquired from conspecifics through teaching, imitation, and other types of social transmission (Boyd & Richerson 2004; Tooby & Cosmides 1992). Because information is transmitted with some degree of fidelity through non-genetic means (e.g., memory, learning), information is cumulative. Some cultural information is passed on with greater frequency and higher regularity than other information. For example, social learners tend to pay attention to information sources that have established prestige more than sources that do not (Henrich & Boyd 2002). Similarly, some information is easier to learn than other information; children exhibit interest about information associated with danger and retain it with greater fidelity and over longer periods than related information unassociated with danger (Barrett et al. 2016; Wertz 2019).
One characteristic of cumulative culture is ritualization (Lorenz 1966), analogous to co-evolutionary processes underlying animal communication systems (Krebs & Dawkins 1984). Cultural signals can develop extravagant physical features resulting from arms race dynamics, particularly in cases when there is a conflict of interest between senders and receivers. Examples from modern environments include conspicuously branded luxury goods, which can signal wealth (Han et al. 2010); or businesses that engage in one-upmanship by incorporating exaggerated sensory features in competitive advertisements (Dunham 2011).
Such cultural ritualization is likely at play in the musical domain, especially given the increasingly important role of elaborate feasting in the cultural evolution of social complexity across the globe throughout the Holocene (Hayden 2014), and given the highly variable musical features that continually unfold over time across compositional styles, instrumentation and orchestration, improvisatory motifs, setting lyrics to music, and so on. Music must be shaped by culture in all contexts, however, not only those of coalition signaling and parental care.
We propose that the adaptations proposed here provide a foundation for cultural-evolutionary processes. These traits — particular grammar-like structures, for instance, such as tonalities and meters — gravitate towards certain forms, or “attractors”, and away from others (Sperber 1996; Sperber & Hirschfeld 2004). These attractors will interact with evolved capacities for nonmusical traits resulting in sensitivity and attraction to features in communicative acts that trigger them (e.g., musical phenomena that evoke the sound of an emotional voice), increasing variability in music.
As this process repeats within and across cultures, the diversity of music increases, while underlaid by universals that can be traced back to music’s adaptive functions in credible signaling. This pattern of universality and diversity is exactly what is observed in systematic analyses of music across cultures (Mehr et al. 2019) and, we believe, is what continues to shape music, worldwide, today.
Understanding this variability has been a longstanding interest of ethnomusicologists, who document musical traditions as they are shaped by social environments, politics, and ethnolinguistic history (Blacking 1973; Feld 1984; Nettl 2015), but it has strong parallels in the study of cultural evolution and social transmission. In particular, the prevalence of specific musical features (a particular scale, musical instrument, ornament, vocal practice, and so on) in a given society’s music is likely to be shaped by that society’s relation to other societies, just as the presence or absence of linguistic features is predictable by lineage (Dunn et al. 2011).
We expect that studying the cultural evolution of musical features will be a productive endeavor (with promising first steps already underway; e.g., Savage et al. 2015). We predict, however, that those features least likely to be shaped by culture are those core features predicted by the evolutionary account described here. For example, whereas we expect few musical systems worldwide to lack melody and rhythm as core features, we expect many to have rather different instantiations of those features. This is uncontroversial: while scales commonly used in music differ across cultures, they nevertheless are mutually intelligible, implying shared psychological mechanisms for music perception surrounding the interpretation of melodies (Castellano et al. 1984; Krumhansl et al. 2000; Mehr et al. 2019).
5.3. Auditory cheesecake: not wrong, but not right either
A key difficulty of studying the evolution of music, to which we alluded throughout this paper, is that the present environment has diverged from the environment in which humans evolved. In this context, Pinker’s (1997) “auditory cheesecake” analogy for a byproduct account of music is neither surprising nor controversial. We should expect many human behaviors to have cheesecake-like features. Just as the world’s great writers have stretched the bounds of human language far beyond language’s original adaptive functions, the boundless creativity of composers and performers have created an actual domain of music that, we believe, is quite far from its proper domain.
In this sense, we agree with Pinker that many musical inventions are byproducts, plain and simple: auditory cheesecake is not wrong. But in light of the adaptations proposed here, auditory cheesecake isn’t right either: as we have argued, in at least two contexts, music exhibits design features consistent with adaptations for credible signaling, which give rise to a universal human psychology of music.
6. Conclusions
Why study the origins of music, language, or any other human behavior? It’s unlikely that anyone will ever explain the full extent to which a particular behavior is accounted for by one or more adaptations because, given its complexity, human behavior cannot be exhaustively measured.
Nevertheless, we think that inching toward a functional understanding of complex behavior helps determine what the phenomena in question are, exactly, by isolating the core psychological representations and cultural processes underlying the phenomena from those that are merely associated with them. In the case of music, the analyses presented here lay out a roadmap for understanding the phenomenon of human musicality.
Music-like behaviors occur in a broad swath of species, including our ape relatives, and increasing evidence indicates that these serve important credible signaling functions among agents with conflicts of interest, such as territorial advertisements and mate attraction. In humans, across cultures, music is associated with social behaviors that directly involve credible signaling private information among agents with conflicts of interest, especially coalitional interactions and infant care, but perhaps others too. Accordingly, the psychological mechanisms for processing and producing features of music that are implied by those contexts, such as melody and rhythm, should also be universal; all of this is proposed to constitute music’s proper domain. In music’s actual domain, in contrast, we should expect the engine of cultural evolution to develop and expand these features, producing a diverse set of musical manifestations worldwide that retain some key features of their evolved functions.
Additional mechanisms likely interact with these core features. These may include psychological mechanisms that enable the perception of higher-level features of music, such as implied harmony or musical emotions; linguistic mechanisms that shape the ways in which language and music are intertwined; cultural mechanisms that drive musical traditions and are shaped historically as cultures mix and combine to form new cultures; technological mechanisms that directly alter the feature space of musicality, including musical inventions, such as instruments and music production software, or new musical forms, such as microtonal music; and, not least, aesthetic mechanisms that drive the preferences and interests of those who make and listen to music worldwide.
Understanding these mechanisms in isolation and as they interact with each other to produce the phenomenon of human musicality is a key challenge for the field — a challenge that will be served well by a clear explanation for the origins of music, which can then be built upon using the interdisciplinary toolkit of modern science.
Acknowledgments
We thank Paul Seabright, the Institute for Advanced Study in Toulouse, and the Fondation Royaumont, whose meeting “Origins of music in non-state societies” sparked the idea for this paper; and Mila Bertolo, Courtney Hilton, Cody Moser, and Manvir Singh for feedback on the manuscript.
Funding statement
The “Origins of music in non-state societies” meeting was supported by the Agence Nationale de la Recherche (Investissements d’Avenir ANR-17-EURE-0010). S.A.M. is supported by the Harvard Data Science Initiative and the National Institutes of Health Director’s Early Independence Award DP5OD024566.
Footnotes
Conflicts of interest
None.
The degree to which music-like behaviors in non-human species are homologous to music is up for debate (see Honing et al. 2018; Bertolo et al. 2020; McDermott & Hauser 2005), especially given surprising differences in auditory cognition and auditory preferences across species (Bregman et al. 2016; McDermott & Hauser 2004, 2007). For discussion, see Kotz et al. (2018) and Patel (2017).
We leave aside a serious issue: most studies of prosocial effects of music-making are vulnerable to participant expectancy effects, which may account for the literature’s poor reproducibility (Atwood et al. 2020).
We leave aside intense debates over whether or not genetic group selection is tenable; see Pinker (2012) and commentaries.
We thank anonymous Reviewer 5 for this example.
We thank Cody Moser for suggesting this quotation.
A fascinating exception is the phenomenon of echolocation, wherein the sender and receiver of a vocal signal are the same organism.
We leave aside deceptive signals, which benefit the signaler at the expense of the receiver.
For discussion of cues, indices, costly signals, and their relationships, see Biernaskie et al. (2018) and references therein.
This is true even of adaptations that subsequently change; bird feathers served as insulation before supporting flight but they retain features revealing their original function (see Persons & Currie 2019 and your duvet).
Cycle effects on mate preferences, in general, have been questioned by recent studies (Gangestad et al. 2016; Jones et al. 2018).
Musical preferences change modestly during middle childhood (e.g., Hargreaves et al. 1995) but whether the frequency of musical behaviors also changes is unknown.
This statement is supported by personal interviews in E.H.’s fieldwork.
Whereas evidence of feasting is abundant in agricultural and complex hunter-gatherer societies throughout the Holocene, it is less well-documented in simple hunter-gatherer societies, with some exceptions (Hayden 2014; Wallis & Blessing 2015).
Here, Temperley’s (2004) discussion of communicative structure in the evolution of musical style may have surprising parallels in the biological evolution of music.
References
- Alexander RD (1974). The evolution of social behavior. Annual Review of Ecology and Systematics, 5(1), 325–383. [Google Scholar]
- Andersson MB (1994). Sexual selection, Princeton, NJ: Princeton University Press. [Google Scholar]
- Anshel A, & Kipper DA (1988). The influence of group singing on trust and cooperation. Journal of Music Therapy, 25(3), 145–155. [Google Scholar]
- Apicella CL, & Feinberg DR (2009). Voice pitch alters mate-choice-relevant perception in hunter–gatherers. Proceedings of the Royal Society B: Biological Sciences, 276(1659), 1077–1082. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ariew A (2002). Platonic and Aristotelian roots of teleological arguments. In Ariew A, Cummins R, & Perlman M, eds., Functions: New essays in the philosophy of psychology and biology, Oxford University Press, pp. 7–32. [Google Scholar]
- Askin N, Mauskapf M, Koppman S, & Uzzi B (2020). Do women produce more novel work than men? Gender differences in musical creativity. Working Paper. [Google Scholar]
- Atkinson QD, Meade A, Venditti C, Greenhill SJ, & Pagel M (2008). Languages evolve in punctuational bursts. Science, 319(5863), 588. [DOI] [PubMed] [Google Scholar]
- Atwood S, Schachner A, & Mehr SA (2020). Expectancy effects threaten the inferential validity of synchrony-prosociality research. PsyArXiv. doi: 10.31234/osf.io/zjy8u [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bainbridge CM, Youngers J, Bertolo M, … Mehr SA (2020). Infants relax in response to unfamiliar foreign lullabies. PsyArXiv. Retrieved from https://osf.io/xcj52 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Baker MC (2001). The atoms of language, New York: Basic Books. [Google Scholar]
- Barham LS (2002). Systematic pigment use in the middle pleistocene of South‐Central Africa. Current Anthropology, 43(1), 181–190. [Google Scholar]
- Barrett HC, Peterson CD, & Frankenhuis WE (2016). Mapping the cultural learnability landscape of danger. Child Development, 87(3), 770–781. [DOI] [PubMed] [Google Scholar]
- Barrow JD (2005). The artful universe expanded, Oxford; New York: Oxford University Press. [Google Scholar]
- Barton R (1985). Grooming site preferences in primates and their functional implications. International Journal of Primatology, 6(5), 519–532. [Google Scholar]
- Bates BC (1970). Territorial behavior in primates: A review of recent field studies. Primates, 11(3), 271–284. [Google Scholar]
- Beetz A, Uvnäs-Moberg K, Julius H, & Kotrschal K (2012). Psychosocial and psychophysiological effects of human-animal interactions: The possible role of oxytocin. Frontiers in Psychology, 3, 234. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Behr O, von Helversen O, Heckel G, Nagy M, Voigt CC, & Mayer F (2006). Territorial songs indicate male quality in the sac-winged bat Saccopteryx bilineata (Chiroptera, Emballonuridae). Behavioral Ecology, 17(5), 810–817. [Google Scholar]
- Benetos E, Dixon S, Giannoulis D, Kirchhoff H, & Klapuri A (2013). Automatic music transcription: Challenges and future directions. Journal of Intelligent Information Systems, 41(3), 407–434. [Google Scholar]
- Benton TG, Baguette M, Clobert J, Bullock JM, & Oxford University Press. (2017). Dispersal ecology and evolution, Oxford: Oxford University Press. [Google Scholar]
- Benzon W (2001). Beethoven’s anvil: Music in mind and culture, New York: Basic Books. [Google Scholar]
- Bertolo M, Singh M, & Mehr SA (2020). Sound-induced motion in chimpanzees does not imply shared ancestry for music or dance (preprint), PsyArXiv. Retrieved from https://osf.io/mxdru [DOI] [PMC free article] [PubMed]
- Best E (1924). The Māori (Volume 2), Wellington, N.Z.: H.H. Tombs. [Google Scholar]
- Biernaskie JM, Perry JC, & Grafen A (2018). A general model of biological signals, from cues to handicaps. Evolution Letters, 2(3), 201–209. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Blacking J (1973). How musical is man?, Seattle; London: University of Washington Press. [Google Scholar]
- Bornstein MH, Putnick DL, Rigo P, … Venuti P (2017). Neurobiology of culturally common maternal responses to infant cry. Proceedings of the National Academy of Sciences, 114(45), E9465–E9473. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bossan B, Hammerstein P, & Koehncke A (2013). We were all young once: An intragenomic perspective on parent–offspring conflict. Proceedings of the Royal Society B: Biological Sciences, 280(1754), 20122637. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bouchet H, Blois-Heulin C, & Lemasson A (2013). Social complexity parallels vocal complexity: A comparison of three non-human primate species. Frontiers in Psychology, 4, 390. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bowler DE, & Benton TG (2005). Causes and consequences of animal dispersal strategies: Relating individual behaviour to spatial dynamics. Biological Reviews, 80(2), 205–225. [DOI] [PubMed] [Google Scholar]
- Boyd R, & Richerson PJ (2004). The origin and evolution of cultures, New York: Oxford University Press. [Google Scholar]
- Bradbury JW, & Vehrencamp SL (1998). Principles of animal communication, Sunderland, MA: Sinauer Associates. [Google Scholar]
- Bregman MR, Patel AD, & Gentner TQ (2016). Songbirds use spectral shape, not pitch, for sound pattern recognition. Proceedings of the National Academy of Sciences, 113(6), 1666–1671. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Brown S (2000a). Evolutionary models of music: From sexual selection to group selection. In Tonneau F & Thompson NS, eds., Perspectives in ethology, Vol. 13, Boston, MA: Springer, pp. 231–281. [Google Scholar]
- Brown S (2000b). The “musilanguage” model of music. In The origins of music, Cambridge, Mass.: MIT Press. [Google Scholar]
- Bryant GA (2013). Animal signals and emotion in music: Coordinating affect across groups. Frontiers in Psychology, 4, 990. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Buss DM (1989). Sex differences in human mate preferences: Evolutionary hypotheses tested in 37 cultures. Behavioral and Brain Sciences, 12(01), 1–14. [Google Scholar]
- Cassidy SB, & Driscoll DJ (2008). Prader-Willi syndrome. European Journal of Human Genetics, 17(1), 3–13. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Castellano MA, Bharucha JJ, & Krumhansl CL (1984). Tonal hierarchies in the music of north India. Journal of Experimental Psychology: General, 113(3), 394–412. [DOI] [PubMed] [Google Scholar]
- Catchpole C, & Slater PJB (2018). Bird song: Biological themes and variations, New York: Cambridge University Press. [Google Scholar]
- Cattell JM (1891). On the origin of music. Mind, 16(63), 375–388. [Google Scholar]
- Chapais B (2013). Monogamy, strongly bonded groups, and the evolution of human social structure. Evolutionary Anthropology: Issues, News, and Reviews, 22(2), 52–65. [DOI] [PubMed] [Google Scholar]
- Charlton BD (2014). Menstrual cycle phase alters women’s sexual preferences for composers of more complex music. Proceedings of the Royal Society B: Biological Sciences, 281(1784), 20140403. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Charlton BD, Filippi P, & Fitch WT (2012). Do women prefer more complex music around ovulation? PLoS ONE, 7(4), e35626. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chen Z, & Wiens JJ (2020). The origins of acoustic communication in vertebrates. Nature Communications, 11(1, 1), 369. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chomsky NA (1968). Language and mind, New York: Harcourt, Brace and World. [Google Scholar]
- Christiansen MH, & Chater N (2008). Language as shaped by the brain. Behavioral and Brain Sciences, 31(5), 489–509. [DOI] [PubMed] [Google Scholar]
- Chwe MS-Y (2001). Rational ritual: Culture, coordination, and common knowledge, Princeton, N.J.: Princeton University Press. [Google Scholar]
- Cirelli LK, Einarson KM, & Trainor LJ (2014). Interpersonal synchrony increases prosocial behavior in infants. Developmental Science, 17(6), 1003–1011. [DOI] [PubMed] [Google Scholar]
- Coen P, Xie M, Clemens J, & Murthy M (2016). Sensorimotor transformations underlying variability in song intensity during Drosophila courtship. Neuron, 89(3), 629–644. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Conard NJ, Malina M, & Münzel SC (2009). New flutes document the earliest musical tradition in southwestern Germany. Nature, 460(7256), 737–740. [DOI] [PubMed] [Google Scholar]
- Conroy-Beam D, Buss DM, Pham MN, & Shackelford TK (2015). How sexually dimorphic are human mate preferences? Personality and Social Psychology Bulletin, 41(8), 1082–1093. [DOI] [PubMed] [Google Scholar]
- Cross I (2012). Cognitive science and the cultural nature of music. Topics in Cognitive Science, 4(4), 668–677. [DOI] [PubMed] [Google Scholar]
- Cross I, & Morley I (2009). The evolution of music: Theories, definitions, and the nature of the evidence. In Malloch S & Trevarthen C, eds., Communicative musicality: Exploring the basis of human companionship, Oxford; New York: Oxford University Press, pp. 61–81. [Google Scholar]
- Cross I, & Woodruff GE (2009). Music as a communicative medium. In Botha R & Knight C, eds., The prehistory of language, Vol. 1, Oxford; New York: Oxford University Press, pp. 113–144. [Google Scholar]
- Cummings ME, & Endler JA (2018). 25 Years of sensory drive: The evidence and its watery bias. Current Zoology, 64(4), 471–484. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dale J, Dey CJ, Delhey K, Kempenaers B, & Valcu M (2015). The effects of life history and sexual selection on male and female plumage colouration. Nature, 527(7578), 367–370. [DOI] [PubMed] [Google Scholar]
- Darwin C (1859). On the origin of species by means of natural selection, London: J. Murray. [Google Scholar]
- Darwin C (1871). The descent of man, London: Watts & Co. [Google Scholar]
- Delton AW, Krasnow MM, Cosmides L, & Tooby J (2011). Evolution of direct reciprocity under uncertainty can explain human generosity in one-shot encounters. Proceedings of the National Academy of Sciences, 108(32), 13335–13340. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Delton AW, & Robertson TE (2012). The social cognition of social foraging: Partner selection by underlying valuation. Evolution and Human Behavior, 33(6), 715–725. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dickins TE, & Barton RA (2013). Reciprocal causation and the proximate–ultimate distinction. Biology & Philosophy, 28(5), 747–756. [Google Scholar]
- Dissanayake E (2000). Antecedents of the temporal arts in early mother-infant interaction. In Wallin NL, Merker B, & Brown S, eds., The origins of music, Cambridge, Mass.: MIT Press, pp. 389–410. [Google Scholar]
- Dissanayake E (2008). If music is the food of love, what about survival and reproductive success? Musicae Scientiae, 12(1 suppl), 169–195. [Google Scholar]
- Dissanayake E (2009). Root, leaf, blossom, or bole: Concerning the origin and adaptive function of music. In Malloch S & Trevarthen C, eds., Communicative musicality: Exploring the basis of human companionship, Oxford; New York: Oxford University Press, pp. 17–30. [Google Scholar]
- Doorn G. S. van, & Weissing FJ (2006). Sexual conflict and the evolution of female preferences for indicators of male quality. The American Naturalist, 168(6), 742–757. [DOI] [PubMed] [Google Scholar]
- Dunbar RIM (1991). Functional significance of social grooming in primates. Folia Primatologica, 57(3), 121–131. [Google Scholar]
- Dunbar RIM (1998). Grooming, gossip, and the evolution of language, Cambridge, Mass.: Harvard University Press. [Google Scholar]
- Dunbar RIM (2012a). Bridging the bonding gap: The transition from primates to humans. Philosophical Transactions of the Royal Society B: Biological Sciences, 367(1597), 1837–1846. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dunbar RIM (2012b). On the evolutionary function of song and dance. In Bannan N, ed., Music, language, and human evolution, Oxford: Oxford University Press, pp. 201–214. [Google Scholar]
- Dunbar RIM, & Dunbar RIM (2004). Language, music, and laughter in evolutionary perspective. In Oller DK & Griebel U, eds., Evolution of communication systems: A comparative approach, Cambridge, MA: MIT Press, pp. 257–274. [Google Scholar]
- Dunbar RIM, Kaskatis K, MacDonald I, & Barra V (2012). Performance of music elevates pain threshold and positive affect: Implications for the evolutionary function of music. Evolutionary Psychology, 10(4), 688–702. [PubMed] [Google Scholar]
- Dunbar RIM, & Lehmann J (2013). Grooming and social cohesion in primates: A comment on Grueter et al et al. Evolution and Human Behavior, 34(6), 453–455. [Google Scholar]
- Dunham B (2011). The role for signaling theory and receiver psychology in marketing. In Saad G, ed., Evolutionary Psychology in the Business Sciences, Berlin, Heidelberg: Springer Berlin Heidelberg, pp. 225–256. [Google Scholar]
- Dunn M, Greenhill SJ, Levinson SC, & Gray RD (2011). Evolved structure of language shows lineage-specific trends in word-order universals. Nature, 473(7345), 79–82. [DOI] [PubMed] [Google Scholar]
- Duputié A, & Massol F (2013). An empiricist’s guide to theoretical predictions on the evolution of dispersal. Interface Focus, 3(6), 20130028. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dutton D (2009). The art instinct: Beauty, pleasure, & human evolution, New York: Bloomsbury Press. [Google Scholar]
- Feld S (1984). Sound structure as social structure. Ethnomusicology, 28(3), 383–409. [Google Scholar]
- Fessler DMT, & Holbrook C (2016). Synchronized behavior increases assessments of the formidability and cohesion of coalitions. Evolution and Human Behavior, 37(6), 502–509. [Google Scholar]
- Fischer J, Cheney DL, & Seyfarth RM (2000). Development of infant baboons’ responses to graded bark variants. Proceedings of the Royal Society B: Biological Sciences, 267(1459), 2317–2321. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fitch WT (2006). Production of vocalizations in mammals. Visual Communication, 3, 145. [Google Scholar]
- Fitch WT (2017). Preface to the special issue on the biology and evolution of language. Psychonomic Bulletin & Review, 24(1), 1–2. [DOI] [PubMed] [Google Scholar]
- Freeman WJ (2000). A neurobiological role of music in social bonding. In Wallin N, Merkur B, & Brown S, eds., The origins of music, Cambridge, Mass.: MIT Press, pp. 411–424. [Google Scholar]
- Freitas JD, Thomas K, DeScioli P, & Pinker S (2019). Common knowledge, coordination, and strategic mentalizing in human social life. Proceedings of the National Academy of Sciences, 116(28), 13751–13758. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fritz TH, Hardikar S, Demoucron M, … Leman M (2013). Musical agency reduces perceived exertion during strenuous physical performance. Proceedings of the National Academy of Sciences, 110(44), 17784–17789. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fürniss S (2017). Diversity in pygmy music: A family portrait. In Hewlett BS, ed., Hunter-Gatherers of the Congo Basin: Cultures, Histories, and Biology of African Pygmies, pp. 217–248. [Google Scholar]
- Gangestad SW, Haselton MG, Welling LLM, … Puts DA (2016). How valid are assessments of conception probability in ovulatory cycle research? Evaluations, recommendations, and theoretical implications. Evolution and Human Behavior, 37(2), 85–96. [Google Scholar]
- Geissmann T (2000). Gibbon songs and human music from an evolutionary perspective. In Wallin NL, Merker B, & Brown S, eds., The origins of music, Cambridge, Mass.: MIT Press, pp. 103–123. [Google Scholar]
- Geist V (1978). Life strategies, human evolution, environmental design: Toward a biological theory of health, New York: Springer-Verlag. [Google Scholar]
- Gelfand MJ, Caluori N, Jackson JC, & Taylor MK (2020). The cultural evolutionary trade-off of ritualistic synchrony. Philosophical Transactions of the Royal Society B: Biological Sciences, 375(1805), 20190432. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Goodall J (1986). The chimpanzees of Gombe: Patterns of behavior, Cambridge, Mass.: Belknap Press of Harvard University Press. [Google Scholar]
- Granier-Deferre C, Bassereau S, Ribeiro A, Jacquet A-Y, & DeCasper AJ (2011). A melodic contour repeatedly experienced by human near-term fetuses elicits a profound cardiac reaction one month after birth. PLoS ONE, 6(2), e17304. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Greenberg DM, Matz SC, Schwartz HA, & Fricke KR (2020). The self-congruity effect of music. Journal of Personality and Social Psychology. doi: 10.1037/pspp0000293 [DOI] [PubMed] [Google Scholar]
- Grueter CC, Bissonnette A, Isler K, & van Schaik CP (2013). Grooming and group cohesion in primates: Implications for the evolution of language. Evolution and Human Behavior, 34(1), 61–68. [Google Scholar]
- Gustison ML, & Townsend SW (2015). A survey of the context and structure of high- and low-amplitude calls in mammals. Animal Behaviour, 105, 281–288. [Google Scholar]
- Hagen EH, & Bryant GA (2003). Music and dance as a coalition signaling system. Human Nature, 14(1), 21–51. [DOI] [PubMed] [Google Scholar]
- Hagen EH, & Hammerstein P (2009). Did Neanderthals and other early humans sing? Seeking the biological roots of music in the territorial advertisements of primates, lions, hyenas, and wolves. Musicae Scientiae, 13(2 suppl), 291–320. [Google Scholar]
- Haig D (1993). Genetic conflicts in human pregnancy. The Quarterly Review of Biology, 68(4), 495–532. [DOI] [PubMed] [Google Scholar]
- Haig D, & Wharton R (2003). Prader-Willi syndrome and the evolution of human childhood. American Journal of Human Biology, 15(3), 320–329. [DOI] [PubMed] [Google Scholar]
- Haig D, & Wilkins JF (2000). Genomic imprinting, sibling solidarity and the logic of collective action. Philosophical Transactions of the Royal Society B: Biological Sciences, 355(1403), 1593–1597. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hall ML, & Magrath RD (2007). Temporal coordination signals coalition quality. Current Biology, 17(11), R406–R407. [DOI] [PubMed] [Google Scholar]
- Hamilton MJ, Milne BT, Walker RS, Burger O, & Brown JH (2007). The complex structure of hunter–gatherer social networks. Proceedings of the Royal Society B: Biological Sciences, 274(1622), 2195–2203. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hamilton WD (1964). The genetical evolution of social behaviour. I & II. Journal of Theoretical Biology, 7(1), 1–16. [DOI] [PubMed] [Google Scholar]
- Hammerstein P, & Noë R (2016). Biological trade and markets. Philosophical Transactions of the Royal Society B: Biological Sciences, 371(1687), 20150101. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Han YJ, Nunes JC, & Drèze X (2010). Signaling status with luxury goods: The role of brand prominence. Journal of Marketing, 74(4), 15–30. [Google Scholar]
- Hannon EE, & Trainor LJ (2007). Music acquisition: Effects of enculturation and formal training on development. Trends in Cognitive Sciences, 11(11), 466–72. [DOI] [PubMed] [Google Scholar]
- Hargreaves DJ, Comber C, & Colley A (1995). Effects of age, gender, and training on musical preferences of British secondary school students. Journal of Research in Music Education, 43(3), 242–250. [Google Scholar]
- Harrington FH (1989). Chorus howling by wolves: Acoustic structure, pack size and the Beau Geste effect. Bioacoustics, 2(2), 117–136. [Google Scholar]
- Harrington FH, & Mech LD (1979). Wolf howling and its role in territory maintenance. Behaviour, 68(3–4), 207–249. [Google Scholar]
- Hayden B (2014). The power of feasts, Cambridge University Press. [Google Scholar]
- Hayden B, & Villeneuve S (2011). A century of feasting studies. Annual Review of Anthropology, 40(1), 433–449. [Google Scholar]
- Henrich J, & Boyd R (2002). On modeling cognition and culture: Why cultural evolution does not require replication of representations. Journal of Cognition and Culture, 2(2), 87–112. [Google Scholar]
- Herdt G, & McClintock M (2000). The magical age of 10. Archives of Sexual Behavior, 29(6), 587–606. [DOI] [PubMed] [Google Scholar]
- Hill GE (1994). Trait elaboration via adaptive mate choice: Sexual conflict in the evolution of signals of male quality. Ethology Ecology & Evolution, 6(3), 351–370. [Google Scholar]
- Holm VA, Cassidy SB, Butler MG, … Greenberg F (1993). Prader-Willi syndrome: Consensus diagnostic criteria. Pediatrics, 91(2), 398–402. [PMC free article] [PubMed] [Google Scholar]
- Honing H, Bouwer FL, Prado L, & Merchant H (2018). Rhesus monkeys (Macaca mulatta) sense isochrony in rhythm, but not the beat: Additional support for the gradual audiomotor evolution hypothesis. Frontiers in Neuroscience, 12. doi: 10.3389/fnins.2018.00475 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Honing H, & Ploeger A (2012). Cognition and the evolution of music: Pitfalls and prospects. Topics in Cognitive Science, 4(4), 513–524. [DOI] [PubMed] [Google Scholar]
- Hrdy SB (2009). Mothers and others: The evolutionary origins of mutual understanding, Cambridge, Mass.: Harvard University Press. [Google Scholar]
- Huron D (2001). Is music an evolutionary adaptation? Annals of the New York Academy of Sciences, 930(1), 43–61. [DOI] [PubMed] [Google Scholar]
- Jackendoff R (2009). Parallels and nonparallels between language and music. Music Perception: An Interdisciplinary Journal, 26(3), 195–204. [Google Scholar]
- Jackendoff R, & Lerdahl F (2006). The capacity for music: What is it, and what’s special about it? Cognition, 100(1), 33–72. [DOI] [PubMed] [Google Scholar]
- Jacoby N, & McDermott JH (2017). Integer ratio priors on musical rhythm revealed cross-culturally by iterated reproduction. Current Biology, 27(3), 359–370. [DOI] [PubMed] [Google Scholar]
- Jacoby N, Undurraga EA, McPherson MJ, Valdés J, Ossandón T, & McDermott JH (2019). Universal and non-universal features of musical pitch perception revealed by singing. Current Biology, 29(19), 3229–3243. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jaeggi AV, Kramer KL, Hames R, … Gurven M (2017). Human grooming in comparative perspective: People in six small-scale societies groom less but socialize just as much as expected for a typical primate. American Journal of Physical Anthropology, 162(4), 810–816. [DOI] [PMC free article] [PubMed] [Google Scholar]
- James W (1890). The principles of psychology, New York: H. Holt. [Google Scholar]
- Jones BC, Hahn AC, & DeBruine LM (2018). Ovulation, sex hormones, and women’s mating psychology. Trends in Cognitive Sciences, 23(1), 51–62. [DOI] [PubMed] [Google Scholar]
- Jourdain R (1997). Music, the brain, and ecstasy: How music captures our imagination, New York: W. Morrow. [Google Scholar]
- Justus T, & Hutsler JJ (2005). Fundamental issues in the evolutionary psychology of music: Assessing innateness and domain specificity. Music Perception: An Interdisciplinary Journal, 23(1), 1–27. [Google Scholar]
- Kaplan HS, & Gangestad SW (2005). Life history theory and evolutionary psychology. In Buss DM, ed., The handbook of evolutionary psychology, Hoboken, NJ, US: John Wiley & Sons Inc, pp. 68–95. [Google Scholar]
- Keller PE, König R, & Novembre G (2017). Simultaneous cooperation and competition in the evolution of musical behavior: Sex-related modulations of the singer’s formant in human chorusing. Frontiers in Psychology, 8, 1559. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kenrick DT, & Keefe RC (1992). Age preferences in mates reflect sex differences in human reproductive strategies. Behavioral and Brain Sciences, 15(1), 75–91. [Google Scholar]
- Kinzler KD, Dupoux E, & Spelke ES (2007). The native language of social cognition. Proceedings of the National Academy of Sciences, 104(30), 12577–12580. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kirschner S, & Tomasello M (2009). Joint drumming: Social context facilitates synchronization in preschool children. Journal of Experimental Child Psychology, 102(3), 299–314. [DOI] [PubMed] [Google Scholar]
- Kirschner S, & Tomasello M (2010). Joint music making promotes prosocial behavior in 4-year-old children. Evolution and Human Behavior, 31(5), 354–364. [Google Scholar]
- Koelsch S, & Siebel WA (2005). Towards a neural basis of music perception. Trends in Cognitive Sciences, 9(12), 578–584. [DOI] [PubMed] [Google Scholar]
- Kogan N (1994). On aesthetics and its origins: Some psychobiological and evolutionary considerations. Social Research, 61(1), 139–165. [Google Scholar]
- Kokko H (1997). Evolutionarily stable strategies of age-dependent sexual advertisement. Behavioral Ecology and Sociobiology, 41(2), 99–107. [Google Scholar]
- Kokko H, Jennions MD, & Brooks R (2006). Unifying and testing models of sexual selection. Annual Review of Ecology, Evolution, and Systematics, 37, 43–66. [Google Scholar]
- Kondo N, & Watanabe S (2009). Contact calls: Information and social function. Japanese Psychological Research, 51(3), 197–208. [Google Scholar]
- Kotler J, Mehr SA, Egner A, Haig D, & Krasnow MM (2019). Response to vocal music in Angelman syndrome contrasts with Prader-Willi syndrome. Evolution and Human Behavior, 40(5), 420–426. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kotz SA, Ravignani A, & Fitch WT (2018). The Evolution of Rhythm Processing. Trends in Cognitive Sciences, 22(10), 896–910. [DOI] [PubMed] [Google Scholar]
- Krause BL (2012). The great animal orchestra: Finding the origins of music in the world’s wild places, 1st ed., New York: Little, Brown. [Google Scholar]
- Krebs J, & Dawkins R (1984). Animal signals: Mind-reading and manipulation. In Krebs J& Davies N, eds., Behavioural Ecology: An Evolutionary Approach, Blackwell, pp. 380–402. [Google Scholar]
- Krebs JR (1977). The significance of song repertoires: The Beau Geste hypothesis. Animal Behaviour, 25, 475–478. [Google Scholar]
- Krumhansl CL (2001). Cognitive foundations of musical pitch, Oxford University Press. [Google Scholar]
- Krumhansl CL, Toivanen P, Eerola T, Toiviainen P, Järvinen T, & Louhivuori J (2000). Cross-cultural music cognition: Cognitive methodology applied to North Sami yoiks. Cognition, 76(1), 13–58. [DOI] [PubMed] [Google Scholar]
- Kuhn SL (2014). Signaling theory and technologies of communication in the Paleolithic. Biological Theory, 9(1), 42–50. [Google Scholar]
- Kuhn SL, & Stiner MC (2019). Hearth and home in the Middle Pleistocene. Journal of Anthropological Research, 75(3), 305–327. [Google Scholar]
- Ladich F, & Winkler H (2017). Acoustic communication in terrestrial and aquatic vertebrates. Journal of Experimental Biology, 220(13), 2306–2317. [DOI] [PubMed] [Google Scholar]
- Lafrance M, Worcester L, & Burns L (2011). Gender and the Billboard Top 40 charts between 1997 and 2007. Popular Music and Society, 34(5), 557–570. [Google Scholar]
- Laland KN, Sterelny K, Odling-Smee J, Hoppitt W, & Uller T (2011). Cause and effect in biology revisited: Is Mayr’s proximate-ultimate dichotomy still useful? Science, 334(6062), 1512–1516. [DOI] [PubMed] [Google Scholar]
- Larsson M (2014). Self-generated sounds of locomotion and ventilation and the evolution of human rhythmic abilities. Animal Cognition, 17(1), 1–14. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Laughlin SB (2001). Energy as a constraint on the coding and processing of sensory information. Current Opinion in Neurobiology, 11(4), 475–480. [DOI] [PubMed] [Google Scholar]
- Launay J, Tarr B, & Dunbar RIM (2016). Synchrony as an adaptive mechanism for large-scale human social bonding. Ethology, 122(10), 779–789. [Google Scholar]
- Leighton GM (2017). Cooperative breeding influences the number and type of vocalizations in avian lineages. Proceedings of the Royal Society B: Biological Sciences, 284(1868), 20171508. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lerdahl F, & Jackendoff R (1983). A generative theory of tonal music, Cambridge, MA: MIT Press. [Google Scholar]
- Lewis J (2013). A cross-cultural perspective on the significance of music and dance to culture and society. In Arbib MA, ed., Language, Music, and the Brain, The MIT Press, pp. 45–66. [Google Scholar]
- Lewis J (2017). Egalitarian social organization: The case of the Mbendjele BaYaka. In Hewlett BS, ed., Hunter-Gatherers of the Congo Basin, Routledge, pp. 249–274. [Google Scholar]
- Liberman Z, Woodward AL, Sullivan KR, & Kinzler KD (2016). Early emerging system for reasoning about the social nature of food. Proceedings of the National Academy of Sciences, 113(34), 9480–9485. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lindsay WR, Andersson S, Bererhi B, … Edwards SV (2019). Endless forms of sexual selection. PeerJ, 7, e7988. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Livingstone FB (1973). Did the australopithecines sing? Current Anthropology, 14(1/2), 25–29. [Google Scholar]
- Livingstone RS, & Thompson WF (2009). The emergence of music from the theory of mind. Musicae Scientiae, 13(2 suppl), 83–115. [Google Scholar]
- Loersch C, & Arbuckle NL (2013). Unraveling the mystery of music: Music as an evolved group process. Journal of Personality and Social Psychology, 105(5), 777–798. [DOI] [PubMed] [Google Scholar]
- Lorenz KZ (1966). Evolution of ritualization in the biological and cultural spheres. Philosophical Transactions of the Royal Society B: Biological Sciences, 251(772), 273–284. [Google Scholar]
- Madison G, Holmquist J, & Vestin M (2018). Musical improvisation skill in a prospective partner is associated with mate value and preferences, consistent with sexual selection and parental investment theory: Implications for the origin of music. Evolution and Human Behavior, 39(1), 120–129. [Google Scholar]
- Maher CR, & Lott DF (2000). A review of ecological determinants of territoriality within vertebrate species. The American Midland Naturalist, 143(1), 1–29. [Google Scholar]
- Maher LA, & Conkey M (2019). Homes for hunters? Exploring the concept of home at hunter-gatherer sites in Upper Paleolithic Europe and Epipaleolithic Southwest Asia. Current Anthropology, 60(1), 91–137. [Google Scholar]
- Marcus GF (2012). Musicality: Instinct or acquired skill? Topics in Cognitive Science, 4(4), 498–512. [DOI] [PubMed] [Google Scholar]
- Markham AC, Gesquiere LR, Alberts SC, & Altmann J (2015). Optimal group size in a highly social mammal. Proceedings of the National Academy of Sciences, 112(48), 14882–14887. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Marlowe FW (2005). Hunter-gatherers and human evolution. Evolutionary Anthropology: Issues, News, and Reviews, 14(2), 54–67. [Google Scholar]
- Martínez I, Rosa M, Arsuaga J-L, … Carbonell E (2004). Auditory capacities in Middle Pleistocene humans from the Sierra de Atapuerca in Spain. Proceedings of the National Academy of Sciences of the United States of America, 101(27), 9976–9981. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mas-Herrero E, Zatorre RJ, Rodriguez-Fornells A, & Marco-Pallarés J (2014). Dissociation between musical and monetary reward responses in specific musical anhedonia. Current Biology, 24(6), 699–704. [DOI] [PubMed] [Google Scholar]
- Maynard Smith J, & Harper D (2003). Animal signals, Oxford: Oxford University Press. [Google Scholar]
- Mayr E (1961). Cause and effect in biology: Kinds of causes, predictability, and teleology are viewed by a practicing biologist. Science, 134(3489), 1501–1506. [DOI] [PubMed] [Google Scholar]
- McComb K, Packer C, & Pusey A (1994). Roaring and numerical assessment in contests between groups of female lions, Panthera leo. Animal Behaviour, 47(2), 379–387. [Google Scholar]
- McCoy DE, & Haig D (2020). Embryo selection and mate choice: Can “honest signals” be trusted? Trends in Ecology & Evolution, 35(4), 308–318. [DOI] [PubMed] [Google Scholar]
- McDermott JH (2012). Auditory preferences and aesthetics: Music, voices and everyday sounds. In Neuroscience of Preference and Choice, Elsevier, pp. 227–256. [Google Scholar]
- McDermott J, & Hauser M (2004). Are consonant intervals music to their ears? Spontaneous acoustic preferences in a nonhuman primate. Cognition, 94(2), B11–B21. [DOI] [PubMed] [Google Scholar]
- McDermott J, & Hauser M (2005). The origins of music: Innateness, uniqueness, and evolution. Music Perception, 23(1), 29–59. [Google Scholar]
- McDermott J, & Hauser M (2007). Nonhuman primates prefer slow tempos but dislike music overall☆. Cognition, 104(3), 654–668. [DOI] [PubMed] [Google Scholar]
- McKenna JJ (1978). Biosocial functions of grooming behavior among the common Indian langur monkey (Presbytis entellus). American Journal of Physical Anthropology, 48(4), 503–509. [DOI] [PubMed] [Google Scholar]
- McNeill WH (1995). Keeping together in time: Dance and drill in human history, Cambridge, Mass.: Harvard University Press. [Google Scholar]
- Mehr SA, Kotler J, Howard RM, Haig D, & Krasnow MM (2017). Genomic imprinting is implicated in the psychology of music. Psychological Science, 28(10), 1455–1467. [DOI] [PubMed] [Google Scholar]
- Mehr SA, & Krasnow MM (2017). Parent-offspring conflict and the evolution of infant-directed song. Evolution and Human Behavior, 38(5), 674–684. [Google Scholar]
- Mehr SA, Singh M, Knox D, … Glowacki L (2019). Universality and diversity in human song. Science, 366(6468), 957–970. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mehr SA, Singh M, York H, Glowacki L, & Krasnow MM (2018). Form and function in human song. Current Biology, 28(3), 356–368. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mehr SA, Song LA, & Spelke ES (2016). For 5-month-old infants, melodies are social. Psychological Science, 27(4), 486–501. [DOI] [PubMed] [Google Scholar]
- Mehr SA, & Spelke ES (2017). Shared musical knowledge in 11‐month‐old infants. Developmental Science, 21(2). [DOI] [PubMed] [Google Scholar]
- Merker B (2000a). Synchronous chorusing and human origins. In Wallin NL, Merker B, & Brown S, eds., The origins of music, Cambridge, Mass.: MIT Press, pp. 315–327. [Google Scholar]
- Merker B (2000b). Synchronous chorusing and the origins of music. Musicae Scientiae, 3(1 suppl), 59–73. [Google Scholar]
- Merker B, Madison GS, & Eckerdal P (2009). On the role and origin of isochrony in human rhythmic entrainment. Cortex, 45(1), 4–17. [DOI] [PubMed] [Google Scholar]
- Miller G (2000a). Evolution of human music through sexual selection. In Wallin NL, Merker B, & Brown S, eds., The origins of music, Cambridge, Mass.: MIT Press, pp. 329–360. [Google Scholar]
- Miller GF (2000b). The mating mind: How sexual choice shaped the evolution of human nature, New York: Doubleday. [Google Scholar]
- Miranda ER, Kirby S, & Todd P (2003). On computational models of the evolution of music: From the origins of musical taste to the emergence of grammars. Contemporary Music Review, 22(3), 91–111. [Google Scholar]
- Mithen S (2005). The singing neanderthals: The origins of music, language, mind and body, London: Weidenfeld Nicolson. [Google Scholar]
- Monboddo JBL (1774). Of the origin and progress of language, 2nd ed., Vol. 1, Edinburgh: J. Balfour. [Google Scholar]
- Morley I (2012). Hominin physiological evolution and the emergence of musical capacities. In Bannan N, ed., Music, language, and human evolution, Oxford: Oxford University Press, pp. 109–141. [Google Scholar]
- Moser CJ, Lee-Rubin H, Bainbridge CM, … Mehr SA (2020). Acoustic regularities in infant-directed vocalizations across cultures. bioRxiv. doi: 10.1101/2020.04.09.032995v1 [DOI] [Google Scholar]
- Mosing MA, Verweij KJH, Madison G, Pedersen NL, Zietsch BP, & Ullén F (2015). Did sexual selection shape human music? Testing predictions from the sexual selection hypothesis of music evolution using a large genetically informative sample of over 10,000 twins. Evolution and Human Behavior, 36(5), 359–366. [Google Scholar]
- Musolf K, Hoffmann F, & Penn DJ (2010). Ultrasonic courtship vocalizations in wild house mice, Mus musculus musculus. Animal Behaviour, 79(3), 757–764. [Google Scholar]
- Müllensiefen D, Gingras B, Musil J, & Stewart L (2014). The musicality of non-musicians: An index for assessing musical sophistication in the general population. PLOS ONE, 9(2), e89642. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nettl B (2015). The study of ethnomusicology: Thirty-three discussions, Urbana, IL: University of Illinois Press. [Google Scholar]
- Newman JD (2007). Neural circuits underlying crying and cry responding in mammals. Behavioural Brain Research, 182(2), 155–165. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Niven JE (2016). Neuronal energy consumption: Biophysics, efficiency and evolution. Current Opinion in Neurobiology, 41, 129–135. [DOI] [PubMed] [Google Scholar]
- Niven JE, & Laughlin SB (2008). Energy limitation as a selective pressure on the evolution of sensory systems. Journal of Experimental Biology, 211(11), 1792–1804. [DOI] [PubMed] [Google Scholar]
- Norman-Haignere S, Kanwisher NG, & McDermott JH (2015). Distinct cortical pathways for music and speech revealed by hypothesis-free voxel decomposition. Neuron, 88(6), 1281–1296. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Norman-Haignere SV, Feather J, Brunner P, … Kanwisher N (2019). Intracranial recordings from human auditory cortex reveal a neural population selective for musical song. bioRxiv. Retrieved from https://www.biorxiv.org/content/10.1101/696161v1 [Google Scholar]
- Nowak MA (2006). Five rules for the evolution of cooperation. Science, 314(5805), 1560–1563. [DOI] [PMC free article] [PubMed] [Google Scholar]
- O’Brien DM, Allen CE, Van Kleeck MJ, … Emlen DJ (2018). On the evolution of extreme structures: Static scaling and the function of sexually selected signals. Animal Behaviour, 144, 95–108. [Google Scholar]
- Orians GH (2014). Snakes, sunrises, and Shakespeare: How evolution shapes our loves and fears, Chicago: The University of Chicago Press. [Google Scholar]
- Paley W (1803). Natural theology: Or, evidences of the existence and attributes of the deity, collected from the appearances of nature, London: R. Faulder. [Google Scholar]
- Panksepp J (2009). The emotional antecedents to the evolution of music and language. Musicae Scientiae, 13(2 suppl), 229–259. [Google Scholar]
- Patel AD (2008). Science & music: Talk of the tone. Nature, 453(7196), 726–727. [DOI] [PubMed] [Google Scholar]
- Patel AD (2017). Why doesn’t a songbird (the european starling) use pitch to recognize tone sequences? The informational independence hypothesis. Comparative Cognition & Behavior Reviews, 12. Retrieved from http://comparative-cognition-and-behavior-reviews.org/2017/vol12_patel/ [Google Scholar]
- Pearce E, Launay J, & Dunbar RIM (2015). The ice-breaker effect: Singing mediates fast social bonding. Royal Society Open Science, 2(10), 150221. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pearce E, Launay J, MacCarron P, & Dunbar RIM (2017). Tuning in to others: Exploring relational and collective bonding in singing and non-singing groups over time. Psychology of Music, 45(4), 496–512. [Google Scholar]
- Pearce E, Launay J, van Duijn M, Rotkirch A, David-Barrett T, & Dunbar RIM (2016). Singing together or apart: The effect of competitive and cooperative singing on social bonding within and between sub-groups of a university fraternity. Psychology of Music, 44(6), 1255–1273. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Peretz I (2006). The nature of music from a biological perspective. Cognition, 100(1), 1–32. [DOI] [PubMed] [Google Scholar]
- Peretz I, Ayotte J, Zatorre RJ, … Jutras B (2002). Congenital amusia: A disorder of fine-grained pitch discrimination. Neuron, 33(2), 185–191. [DOI] [PubMed] [Google Scholar]
- Peretz I, & Vuvan DT (2017). Prevalence of congenital amusia. European Journal of Human Genetics, 25(5), 625–630. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Persons WS, & Currie PJ (2019). Feather evolution exemplifies sexually selected bridges across the adaptive landscape. Evolution, 73(9), 1686–1694. [DOI] [PubMed] [Google Scholar]
- Peters J (2014). The role of genomic imprinting in biology and disease: An expanding view. Nature Reviews Genetics, 15(8), 517–530. [DOI] [PubMed] [Google Scholar]
- Phillips-Silver J, Aktipis CA, & Bryant GA (2010). The Ecology of Entrainment: Foundations of Coordinated Rhythmic Movement. Music Perception, 28(1), 3–14. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pinker S (1997). How the mind works, New York: Norton. [DOI] [PubMed] [Google Scholar]
- Pinker S (2012). The false allure of group selection. Retrieved from http://edge.org/conversation/the-false-allure-of-group-selection
- Pinker S, & Bloom P (1990). Natural language and natural selection. Behavioral and Brain Sciences, 13(4), 707–727. [Google Scholar]
- Pisor AC, & Surbeck M (2019). The evolution of intergroup tolerance in nonhuman primates and humans. Evolutionary Anthropology, 28, 210–223. [DOI] [PubMed] [Google Scholar]
- Podlipniak P (2017). The Role of the Baldwin Effect in the Evolution of Human Musicality. Frontiers in Neuroscience, 11. doi: 10.3389/fnins.2017.00542 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Puts DA (2010). Beauty and the beast: Mechanisms of sexual selection in humans. Evolution and Human Behavior, 31(3), 157–175. [Google Scholar]
- Puts DA, Apicella CL, & Cárdenas RA (2011). Masculine voices signal men’s threat potential in forager and industrial societies. Proceedings of the Royal Society B: Biological Sciences, 279(1728), 601–609. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Quam RM, Ruiter D. J. de, Masali M, Arsuaga J-L, Martínez I, & Moggi-Cecchi J (2013). Early hominin auditory ossicles from South Africa. Proceedings of the National Academy of Sciences, 110(22), 8847–8851. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rand DG, Peysakhovich A, Kraft-Todd GT, … Greene JD (2014). Social heuristics shape intuitive cooperation. Nature Communications, 5(1, 1), 3677. [DOI] [PubMed] [Google Scholar]
- Ravignani A (2018). Darwin, sexual selection, and the origins of music. Trends in Ecology & Evolution, 33(10), 716–719. [DOI] [PubMed] [Google Scholar]
- Rebout N, De Marco A, Lone J-C, … Thierry B (2020). Tolerant and intolerant macaques show different levels of structural complexity in their vocal communication. Proceedings of the Royal Society B: Biological Sciences, 287(1928), 20200439. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Reddish P, Fischer R, & Bulbulia J (2013). Let’s dance together: Synchrony, shared intentionality and cooperation. PLoS ONE, 8(8), e71182. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rendall D, Cheney DL, & Seyfarth RM (2000). Proximate factors mediating ‘contact’ calls in adult female baboons (Papio cynocephalus ursinus) and their infants. Journal of Comparative Psychology, 114(1), 36–46. [DOI] [PubMed] [Google Scholar]
- Richman B (1993). On the evolution of speech: Singing as the middle term. Current Anthropology, 34(5), 721–722. [Google Scholar]
- Rodseth L, Wrangham RW, Harrigan AM, … Wolpoff MH (1991). The human community as a primate society. Current Anthropology, 32(3), 221–254. [Google Scholar]
- Roederer JG (1984). The search for a survival value of music. Music Perception, 1(3), 350–356. [Google Scholar]
- Roper SD, & Chaudhari N (2017). Taste buds: Cells, signals and synapses. Nature Reviews. Neuroscience, 18(8), 485–497. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rousseau J-J (1781). Essai sur l’origine des langues. (Moran JH, Trans.), New York: F. Ungar. [Google Scholar]
- Savage PE, Brown S, Sakai E, & Currie TE (2015). Statistical universals reveal the structures and functions of human music. Proceedings of the National Academy of Sciences, 112(29), 8987–8992. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Savage P, Loui P, Tarr B, … Fitch W (2020). Music as a coevolved system for social bonding. Behavioral and Brain Sciences. [DOI] [PubMed] [Google Scholar]
- Schachner A, Brady TF, Pepperberg IM, & Hauser MD (2009). Spontaneous motor entrainment to music in multiple vocal mimicking species. Current Biology, 19(10), 831–836. [DOI] [PubMed] [Google Scholar]
- Schellenberg EG, & Habashi P (2015). Remembering the melody and timbre, forgetting the key and tempo. Memory & Cognition, 1–11. [DOI] [PubMed] [Google Scholar]
- Schoenemann PT (2006). Evolution of the size and functional areas of the human brain. Annual Review of Anthropology, 35(1), 379–406. [Google Scholar]
- Schruth D, Templeton CN, & Holman DJ (2019). A definition of song from human music universals observed in primate calls. bioRxiv. Retrieved from https://www.biorxiv.org/content/10.1101/649459v1 [Google Scholar]
- Schulkin J (2013). Reflections on the musical mind: An evolutionary perspective, Princeton: Princeton University Press. [Google Scholar]
- Schulkin J, & Raglan GB (2014). The evolution of music and human social capability. Frontiers in Neuroscience, 8, 292. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Scott-Phillips TC, Dickins TE, & West SA (2011). Evolutionary theory and the ultimate–proximate distinction in the human behavioral sciences. Perspectives on Psychological Science, 6(1), 38–47. [DOI] [PubMed] [Google Scholar]
- Searcy WA (Ed.). (2019). Animal communication, cognition, and the evolution of language. Animal Behaviour, 151, 203–205. [Google Scholar]
- Sell A, Cosmides L, Tooby J, Sznycer D, Rueden C. von, & Gurven M (2008). Human adaptations for the visual assessment of strength and fighting ability from the body and face. Proceedings of the Royal Society B: Biological Sciences, 276(1656), 575–584. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sergeant DC, & Himonides E (2019). Orchestrated sex: The representation of male and female musicians in world-class symphony orchestras. Frontiers in Psychology, 10, 1760. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Seyfarth RM (1977). A model of social grooming among adult female monkeys. Journal of Theoretical Biology, 65(4), 671–698. [DOI] [PubMed] [Google Scholar]
- Seyfarth RM, & Cheney DL (1984). Grooming, alliances and reciprocal altruism in vervet monkeys. Nature, 308(5959), 541–543. [DOI] [PubMed] [Google Scholar]
- Shultz S, Opie C, & Atkinson QD (2011). Stepwise evolution of stable sociality in primates. Nature, 479(7372), 219–222. [DOI] [PubMed] [Google Scholar]
- Shuter-Dyson R, & Gabriel C (1981). The psychology of musical ability, 2nd edn, London; New York: Methuen. [Google Scholar]
- Sievers B, Polansky L, Casey M, & Wheatley T (2013). Music and movement share a dynamic structure that supports universal expressions of emotion. Proceedings of the National Academy of Sciences, 110(1), 70–75. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Silk JB, & Kappeler JM (2017). Sociality in primates. In Rubenstein DR & Abbot P, eds., Comparative social evolution, New York, N.Y.: Cambridge University Press. [Google Scholar]
- Singh M (2018). The cultural evolution of shamanism. Behavioral and Brain Sciences, 41, 1–62. [DOI] [PubMed] [Google Scholar]
- Sluming VA, & Manning JT (2000). Second to fourth digit ratio in elite musicians: Evidence for musical ability as an honest signal of male fitness. Evolution and Human Behavior, 21(1), 1–9. [Google Scholar]
- Smith DR, & Dworkin M (1994). Territorial interactions between two Myxococcus species. Journal of Bacteriology, 176(4), 1201–1205. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Smith JM, & Price GR (1973). The logic of animal conflict. Nature, 246(5427, 5427), 15–18. [Google Scholar]
- Snowdon CT (2017). Vocal communication in family-living and pair-bonded primates. In Quam R, Ramsier M, Fay RR, & Popper AN, eds., Primate hearing and communication, 1st edn, Vol. 63, Springer International Publishing, pp. 141–174. [Google Scholar]
- Soltis J (2004). The signal functions of early infant crying. Behavioral and Brain Sciences, 27(04), 443–458. [PubMed] [Google Scholar]
- Sosis R (2000). Costly signaling and torch fishing on Ifaluk atoll. Evolution and Human Behavior, 21(4), 223–244. [DOI] [PubMed] [Google Scholar]
- Spence M (1973). Job market signaling. The Quarterly Journal of Economics, 87(3), 355–374. [Google Scholar]
- Spencer H (1902). Facts and comments (two essays), New York; London: D. Appleton and Co. [Google Scholar]
- Sperber D (1996). Explaining culture: A naturalistic approach, Oxford, UK; Cambridge, Mass.: Blackwell. [Google Scholar]
- Sperber D, & Hirschfeld LA (2004). The cognitive foundations of cultural stability and diversity. Trends in Cognitive Sciences, 8(1), 40–46. [DOI] [PubMed] [Google Scholar]
- Stiner M (2019). Human predators and prey mortality, Milton: Routledge. [Google Scholar]
- Stiner MC (2013). An unshakable middle paleolithic? Trends versus conservatism in the predatory niche and their social ramifications. Current Anthropology, 54(S8), S288–S304. [Google Scholar]
- Stiner MC (2017). Love and death in the stone age: What constitutes first evidence of mortuary treatment of the human body? Biological Theory, 12(4), 248–261. [Google Scholar]
- Swedell L, & Plummer T (2019). Social evolution in Plio-Pleistocene hominins: Insights from hamadryas baboons and paleoecology. Journal of Human Evolution, 137, 102667. [DOI] [PubMed] [Google Scholar]
- Tarr B, Launay J, Cohen E, & Dunbar RIM (2015). Synchrony and exertion during dance independently raise pain threshold and encourage social bonding. Biology Letters, 11(10), 20150767. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tarr B, Launay J, & Dunbar RIM (2014). Music and social bonding: “Self-other” merging and neurohormonal mechanisms. Frontiers in Psychology, 5(1096). [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tarr B, Launay J, & Dunbar RIM (2016). Silent disco: Dancing in synchrony leads to elevated pain thresholds and social closeness. Evolution and Human Behavior, 37(5), 343–349. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Temperley D (2004). Communicative pressure and the evolution of musical styles. Music Perception, 21(3), 313–337. [Google Scholar]
- Thompson ME (2013). Reproductive ecology of female chimpanzees. American Journal of Primatology, 75(3), 222–237. [DOI] [PubMed] [Google Scholar]
- Tibbetts EA, Mullen SP, & Dale J (2017). Signal function drives phenotypic and genetic diversity: The effects of signalling individual identity, quality or behavioural strategy. Philosophical Transactions of the Royal Society B: Biological Sciences, 372(1724), 20160347. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tierney AT, Russo FA, & Patel AD (2011). The motor origins of human and avian song structure. Proceedings of the National Academy of Sciences, 108(37), 15510–15515. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tinbergen N (1963). On aims and methods of ethology. Zeitschrift Für Tierpsychologie, 20(4), 410–433. [Google Scholar]
- Tobias JA, Sheard C, Seddon N, Meade A, Cotton AJ, & Nakagawa S (2016). Territoriality, social bonds, and the evolution of communal signaling in birds. Frontiers in Ecology and Evolution, 4. [Google Scholar]
- Todd P (2000). Simulating the evolution of musical behavior. In Wallin NL, Merker B, & Brown S, eds., The origins of music, Cambridge, Mass.: MIT Press, pp. 361–388. [Google Scholar]
- Todd PM, & Werner GM (1999). Frankensteinian methods for evolutionary music composition. In Griffith N & Todd PM, eds., Musical networks: Parallel distributed perception and performance, Cambridge, Mass.: MIT Press. [Google Scholar]
- Tooby J, & Cosmides L (1992). The psychological foundations of culture. In The adapted mind: Evolutionary psychology and the generation of culture, New York: Oxford University Press. [Google Scholar]
- Trainor LJ (2015). The origins of music in auditory scene analysis and the roles of evolution and culture in musical creation. Philosophical Transactions of the Royal Society B: Biological Sciences, 370(1664), 20140089. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Trainor LJ, Wu L, & Tsang CD (2004). Long-term memory for music: Infants remember tempo and timbre. Developmental Science, 7(3), 289–296. [DOI] [PubMed] [Google Scholar]
- Trehub SE (2001). Musical predispositions in infancy. Annals of the New York Academy of Sciences, 930(1), 1–16. [DOI] [PubMed] [Google Scholar]
- Trehub SE, Unyk AM, & Trainor LJ (1993a). Adults identify infant-directed music across cultures. Infant Behavior and Development, 16(2), 193–211. [Google Scholar]
- Trehub SE, Unyk AM, & Trainor LJ (1993b). Maternal singing in cross-cultural perspective. Infant Behavior and Development, 16(3), 285–295. [Google Scholar]
- Trehub S, & Nakata T (2001–2002). Emotion and music in infancy. Musicae Scientiae. [Google Scholar]
- Trivers RL (1972). Parental investment and sexual selection. In Campbell BG, ed., Sexual selection and the descent of man, Chicago: Aldine, pp. 136–179. [Google Scholar]
- Trivers RL (1974). Parent-offspring conflict. American Zoologist, 14(1), 249–264. [Google Scholar]
- Ubeda F (2008). Evolution of genomic imprinting with biparental care: Implications for Prader-Willi and Angelman syndromes. PLoS Biology, 6(8), 1678–1692. [DOI] [PMC free article] [PubMed] [Google Scholar]
- van den Broek EMF, & Todd PM (2009). Evolution of rhythm as an indicator of mate quality. Musicae Scientiae, 13(2 suppl), 369–386. [Google Scholar]
- Van Schaik CP (1983). Why are diurnal primates living in groups? Behaviour, 87(1/2), 120–144. [Google Scholar]
- Wallis NJ, & Blessing ME (2015). Big feasts and small scale foragers: Pit features as feast events in the American Southeast. Journal of Anthropological Archaeology, 39, 1–18. [Google Scholar]
- Weinstein D, Launay J, Pearce E, Dunbar RIM, & Stewart L (2016). Singing and social bonding: Changes in connectivity and pain threshold as a function of group size. Evolution and Human Behavior, 37(2), 152–158. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wertz AE (2019). How plants shape the mind. Trends in Cognitive Sciences, 23(7), 528–531. [DOI] [PubMed] [Google Scholar]
- Wich SA, & Nunn CL (2002). Do male “long-distance calls” function in mate defense? A comparative study of long-distance calls in primates. Behavioral Ecology and Sociobiology, 52(6), 474–484. [Google Scholar]
- Wiessner P, & Schiefenhövel W (1998). Food and the status quest: An interdisciplinary perspective, Providence, R.I.: Berghahn Books. [Google Scholar]
- Wiley RH, & Wiley MS (1977). Recognition of neighbors’ duets by stripe-backed wrens Campylorhynchus nuchalis. Behaviour, 62(1/2), 10–34. [Google Scholar]
- Willems EP, & van Schaik CP (2015). Collective action and the intensity of between-group competition in nonhuman primates. Behavioral Ecology, 26(2), 625–631. [Google Scholar]
- Williams CA, Beaudet AL, Clayton-Smith J, … Wagstaff J (2006). Angelman syndrome 2005: Updated consensus for diagnostic criteria. American Journal of Medical Genetics, 140(5), 413–418. [DOI] [PubMed] [Google Scholar]
- Williams GC (1966). Adaptation and natural selection: A critique of some current evolutionary thought, Princeton, N.J.: Princeton University Press. [Google Scholar]
- Wilson DE, & Reeder DM (Eds.). (2005). Mammal species of the world: A taxonomic and geographic reference, 3rd edition, Vols. 1–2, Baltimore, MD: Johns Hopkins University Press. [Google Scholar]
- Wilson M, & Cook PF (2016). Rhythmic entrainment: Why humans want to, fireflies can’t help it, pet birds try, and sea lions have to be bribed. Psychonomic Bulletin & Review, 23(6), 1647–1659. [DOI] [PubMed] [Google Scholar]
- Wilson ML, Hauser MD, & Wrangham RW (2001). Does participation in intergroup conflict depend on numerical assessment, range location, or rank for wild chimpanzees? Animal Behaviour, 61(6), 1203–1216. [Google Scholar]
- Wiltermuth SS, & Heath C (2009). Synchrony and cooperation. Psychological Science, 20(1), 1–5. [DOI] [PubMed] [Google Scholar]
- Winkler I, Háden GP, Ladinig O, Sziller I, & Honing H (2009). Newborn infants detect the beat in music. Proceedings of the National Academy of Sciences, 106(7), 2468–2471. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zahavi A (1975). Mate selection—A selection for a handicap. Journal of Theoretical Biology, 53(1), 205–214. [DOI] [PubMed] [Google Scholar]