Skip to main content
Proceedings of the National Academy of Sciences of the United States of America logoLink to Proceedings of the National Academy of Sciences of the United States of America
. 2023 Nov 13;120(47):e2218799120. doi: 10.1073/pnas.2218799120

Overcoming bias in the comparison of human language and animal communication

Erica A Cartmill a,b,1
PMCID: PMC10666095  PMID: 37956297

Abstract

Human language is a powerful communicative and cognitive tool. Scholars have long sought to characterize its uniqueness, but each time a property is proposed to set human language apart (e.g., reference, syntax), some (attenuated) version of that property is found in animals. Recently, the uniqueness argument has shifted from linguistic rules to cognitive capacities underlying them. Scholars argue that human language is unique because it relies on ostension and inference, while animal communication depends on simple associations and largely hardwired signals. Such characterizations are often borne out in published data, but these empirical findings are driven by radical differences in the ways animal and human communication are studied. The field of animal communication has been dramatically shaped by the “code model,” which imagines communication as involving information packets that are encoded, transmitted, decoded, and interpreted. This framework standardized methods for studying meaning in animal signals, but it does not allow for the nuance, ambiguity, or contextual variation seen in humans. The code model is insidious. It is rarely referenced directly, but it significantly shapes how we study animals. To compare animal communication and human language, we must acknowledge biases resulting from the different theoretical models used. By incorporating new approaches that break away from searching for codes, we may find that animal communication and human language are characterized by differences of degree rather than kind.

Keywords: ostension, inference, meaning, gesture, human uniqueness


Human communication draws on highly developed ostensive and inferential capacities (13). While much of language is codified and structured, humans can use virtually any behavior to communicate (e.g., a head tilt, a look, moving an object). The ostensive capacity to turn anything into a signal is seen as a powerful driver of human communication. The hypothesized mechanisms underlying this capacity have become one of the last bulwarks of human language’s uniqueness.

There is a long history of searching for the property that makes human language unique. Many properties once considered unique to humans (e.g., referential communication, syntax, cultural transmission, and intentionality) have subsequently been found in animal communication systems, at least in simple forms. I argue that recent claims that the cognitive foundations of human language are unique are not supported by the evidence cited, because the theories and methods used to study human language are not comparable to those used to study animal communication. These differences bias the conclusions we draw about the nature of the two systems.

Searching for Features Unique to Human Language

Human language was once thought to be a more formal, structured system, in contrast to the emotional, contextually bound communicative acts of animals. Even Darwin’s famous work comparing the expression of emotions in humans and other animals set “articulate language” to one side, considering it to be an altogether different type of system (4). These perceptions were influenced by a long tradition in Western philosophy privileging humanity’s capacity for rational thought, as well as by models of language that focused on our ability to convey meaning in predictable, decontextualized ways (5). These models were often developed by people studying written language and accordingly focused on the structural features and decontextualized, disembodied aspects of language.

For most of the 20th century, scholars argued that structural features of human language distinguished it from animal communication. The search for the structural “key” to language turned into an arms race between linguists and animal communication researchers. Every time a linguistic feature was proposed as the Rubicon separating human language and animal communication, animal researchers would look for evidence of that ability in their study species. Eventually, it would be found in one or more species (though in simpler or attenuated forms). In order to demonstrate the ability in nonhuman animals, researchers would often broaden its definition to make sense for nonlinguistic communication. This sometimes changed the definition considerably. Linguists would object. Animal researchers would defend their generalizations. This cycle was repeated many times.

Four of the primary candidates for features differentiating human language and animal communication—cultural transmission, reference, syntax, recursion—are reviewed below. Many other aspects of communication have been compared between animal and human systems (e.g., turn-taking, multimodality, intentionality), but these four illustrate how diverse aspects of language—learning, meaning, and structure—have been identified in animal systems. Note that finding a feature of human language in an animal system does not imply that language is not unique. Each feature has been found in one or more animal species, but never to the degree seen in human language. Language supercharges these features, and it is likely unique in its constellation thereof.

Cultural transmission was long assumed to be a defining feature of human language since language is acquired through convention and culture rather than biological inheritance. We now know that many other species, particularly birds, also acquire their vocalizations through learning from adult models. Some species even have a “critical period” in infancy, during which they learn from song models and rehearse their songs, and after which they cannot learn to produce these songs correctly (6). Evidence of vocal learning coupled with evidence of dialect differences across populations led scholars to conclude that there was cultural transmission of vocalizations in birds, whales, bats, and elephants (712)*.

Reference (using a signal to denote something in the external environment) was once argued to be unique to human language. Then, researchers demonstrated that the alarm calls of many types of animals differ systematically in response to different types of predators and elicit different escape behaviors in listeners (13, 14). This predictable association with an external event seemed to meet the broad definition of reference; however, animal researchers began to use the term “functional reference,” since no one knew whether the animals intended to refer to the predators when producing alarm calls. Once animals appeared to demonstrate a capacity for (at least functional) reference, displaced reference became the new bar. Humans not only refer to things in their immediate environment but also refer to things displaced in space or time (5, 15). In contrast, the animal alarm calls that demonstrated functional reference were always produced in the presence of a predator (the referent). Arguments for the uniqueness of displaced reference dovetailed with arguments that only humans were capable of mental time travel: the cognitive ability to remember the past or imagine the future from a first-person perspective (16). Experimental paradigms have since shown that great apes can point to things in their immediate environment to reference objects displaced in space and time (17). Other studies suggest that apes and other species are capable of mental time travel; they plan for future opportunities to retrieve food by selectively caching different types of food or storing tools where needed to retrieve future rewards (1820).

Many scholars have argued that syntax is the defining feature of human language (2124). Syntactic rules allow humans to communicate about an infinite number of things with a finite number of symbols. Having a set of rules that can be applied in predictable but open-ended ways to a set of symbols (e.g., words) enables humans to create sentences that they have never heard before. No other species has been found to use syntax as humans do, but evidence of syntax-like structural rules has been found in several animal communication systems. The songs produced by many bird species have “rules” that predict which sounds can be combined into motifs and which motifs can follow one another (25, 26). The same is true for whales (27) and the call sequences of some primates (28). When perception of song sequences is tested by playing back recordings to birds, most species recognize when components are presented in orders that break the rules (2931); there is little evidence, however, that rules about component order encode differences in meaning. Some argue that the “syntactic” patterns in bird and whale song are better described as phonological rules (or phonological syntax rather than lexical syntax), since they apply to sound combinations but not, as far as we can tell, to changes in meaning (25). Note that I use meaning throughout this paper to encompass both the rich, intended meanings of human language (and arguably ape gestures), and the information receivers extract from animal signals.

Animals can combine signals to amplify or modify one of the component parts (3237)—for example, by increasing intensity. However, there is little evidence that signal combinations produce meanings that differ from their parts. This type of combination (A + B = C) would suggest a syntax more reliant on the rule than on the identity of the components, one where the combination takes on an entirely new meaning. Such rules have been hinted at in the call sequences of a few monkey (3840) and bird species (41, 42), but not all scholars are convinced these systems require syntax to explain the observed patterns (43). Of course, animals might combine calls for reasons that have nothing to do with changing the meaning of the component parts. Song complexity has been linked to reproductive success across different taxa, suggesting that the ability to combine different sounds is itself attractive (44, 45).

One particular type of syntactic rule—recursion—has been proposed as the feature that distinguishes both human language and human minds (46). Recursion is the ability to apply a syntactic rule to the output of that same syntactic rule, ad infinitum. This enables humans to create infinitely long utterances with multiple embeddings. For example, the utterance “I read the sentence that Cartmill wrote that illustrated the property that Chomsky described that Fitch argued made human language unique” exhibits several instances of recursion. This type of rule is complicated to employ and difficult to test, but one study found evidence that European starlings are capable of distinguishing patterns that rely on recursion (47). Others have argued that these results may have been based on numerical cognition rather than understanding of recursion (48), and so the search continues.

In recent years, the focus on structure as the defining aspect of language has relaxed. Indeed, the broad characterization of human language as structured and of animal communication as lacking structure has largely reversed. Human language is now often characterized by its flexibility, openness, and ability to deviate from its “code,” while animal systems are depicted as inflexible and reflexive, restricted to operating within set rules (3, 49, 50). This shift from structural features to cognitive building blocks aligns with a broader shift towards studying the minds rather than just the behavior of animals (51). However, shifting from the features of communicative acts to the often invisible motivational states and cognitive abilities behind them makes it more difficult to draw comparisons across species. As we shall see, the shift toward meaning and cognition also gives theoretical models of communication—and their methodological implications—an outsized role in shaping the empirical evidence about animal abilities.

Human Language Relies on Understanding Others’ Minds.

Human language is an exceptionally rich communicative system, even when it appears in the impoverished form studied by most linguists (written down and stripped of its nonverbal components and interactional information). Sociolinguists and linguistic anthropologists argue that human language is characterized not by the formal features seen in disembodied and decontextualized examples but by the rich social cognition and pragmatic inferences that underlie face-to-face communication. Pragmatics is the branch of linguistics that focuses on how language is used in everyday life and how meaning is conveyed in situated, interactional contexts rather than through only words and syntax. The heart of pragmatics is the idea that language meaning is inflected by its physical, social, and conversational environments.

The flexibility of human language means that each language user has the potential to modify, embellish, or expand the system, either in the long term (by inventing new words or structures) or in the short term (by using things outside the linguistic system in communicative ways). Long-term modifications allow languages to change over time. Short-term modifications allow speakers to adapt particular communicative acts to their goals, interlocutors, and environments. This might involve weaving together conventional symbols and structures with nonverbal elements like gestures, facial expressions, intonations, or prosodic emphasis. It may simply involve rich inferences about the goals or knowledge of one’s interlocutor. Take, for example, the following exchange between two friends at a party:

  1. Susan: Is George here?

  2. Lila: I saw an orange truck when I came.

To understand this exchange, you do not need to know that George drives an orange truck; you can extract the meaning from the response. This is done through inference, rather than a literal reading of the sentence. Many human conversations are intelligible only because (A) the speaker makes clear that their actions are communicative and (B) the listener makes a series of inferences about the relevance of the speaker’s actions to the ongoing conversation. The shared goals and coordinated actions of speaker and listener are encapsulated in the cooperative principle of philosopher Paul Grice (52). It is debated whether animal communication in general—and great ape gestures in particular—ever display Gricean principles of communication (5357).

The cognitive abilities underlying our capacity to communicate in novel, nonliteral, and flexible ways are ostension and inference. Through ostension, a person draws attention to their actions and conveys that they are communicative (e.g., sliding their glass towards someone who is pouring wine). Through inference, others can perceive the communicative intent and respond accordingly (e.g., by filling the glass). The ostensive-inferential model of human language proposes that signalers are concerned with making their communicative intentions known (and may use a variety of behaviors to provide evidence of intention), and listeners are concerned with gathering this evidence and making inferences about the intended meaning (1, 3, 49, 58, 59). In line 2 of the example above, Lila answers Susan’s question by referring to an unusual type of car (not to George). Susan makes sense of this reply by assuming that Lila’s utterance was relevant to her question and infers that George is present. Such indirect reference seems complicated, but it is common in everyday language and demonstrates how easily humans employ ostensive-inferential abilities. Importantly, it relies heavily on understanding others’ minds and intentions: so-called theory of mind (2). There is evidence from cognitive studies that animals possess at least some aspects of theory of mind (e.g., understanding gaze), but the ways in which we typically study animal communication do not leave much room for questions about ostension and inference.

Socio-Cognitive Abilities in Animals.

Social animals have evolved sophisticated cognitive abilities that help them navigate their social worlds. These abilities include learning from others, tracking relationships across time, predicting others’ actions, making inferences about others’ goals, and responding to others’ behavior based on inferences about what they can or cannot see refs. (6062).

Nonhuman primates are renowned for complex social cognition, which they use to navigate social hierarchies, form friendships, recruit allies, and avoid rivals (6365). They understand not only their own relationships, but the nature of relationships between others (66). There is evidence that at least great apes understand what others can and can’t see ref. (60), as well as what others know and don’t know (67, 68), suggesting that they possess some type of theory of mind. Adult humans can understand long recursive chains of theory of mind (e.g., I know that Matt knows that Jacob believes that Gal hopes that I want to see the movie tomorrow), but the building blocks of understanding other minds develop slowly over the first few years of life (69, 70). Human toddlers and great apes both pass simple theory-of-mind tests; for example, by visually predicting where others will search for an item that the subject knows has been moved (65, 67, 71, 72). Note, however, that young children (but not apes) have at least some knowledge of language, which may give them an advantage in propositional thinking about others’ beliefs (73).

Some argue that socio-cognitive abilities like theory of mind provide the foundation underlying human language but only enrich (rather than underlie) animal communication (1, 3, 49, 50). This might seem like a semantic quibble, but the difference is significant. The claim is that human language is built on a fundamentally different type of cognition (one that requires ostension and inference), while all animal systems are exchanges of codes that do not require inferences about others (but may sometimes employ them). This model of language origins argues that language is not an animal-like system enriched with properties like syntax and complex inferences about others’ intentions. Instead, it claims that language is an entirely new type of system reliant on inferences about others’ minds at its most basic level. This claim is intriguing but it proposes that human language represents a discontinuity in the evolution of animal communication systems: Language involves sophisticated social inferences about signaler intentions and recipient knowledge purportedly lacking in animal systems. This, in turn, requires considerable evidence that rich social inferences about others’ minds are missing from nonhuman communicative acts. The difficulty is that data on human and animal communication have been heavily biased by differences in the way humans and animals are studied.

Animal Communication As Codes.

Animal communication research has been strongly shaped by the “code model” of communication, though it is rarely referenced directly (Fig. 1). In this model, a message is encoded by the signaler, transmitted across a communicative channel (where it can distorted by “noise”), and then decoded by the receiver. The goal is for the decoded and encoded messages to be the same, with little information lost during transmission. This model grew out of Shannon’s information theory (74) and the conduit model of information transfer (75). The code model reflects its origins—the transmission of electronic signals in broadcasting and engineering—through the metaphors of encoding, decoding, transmission, noise, and signal interference.

Fig. 1.

Fig. 1.

A schematic illustrating the code model of communication.

The code model provides a useful recipe for breaking communication into phases, each a potential source of miscommunication. For example, a signaler could choose the wrong signal, the signal could be difficult to perceive due to interference (noise), or the receiver could misinterpret the signal and respond inappropriately. In describing potential sources of miscommunication, I have avoided code model terms (encoding, decoding, error, transmission) because they equate communication with robotic transfer. Herein lies the main problem with the code model: It dramatically restricts the ways we conceptualize (and study) animal communication.

The code model has had a pervasive effect on the design and interpretation of research on animal communication. Studies are predominantly built around searching for replicable contingencies between a signal produced by one individual and a subsequent change of behavior in another—in other words, codes (76). This search for code-like systems is mostly unavoidable; it is the primary way researchers demonstrate that animal signals have meaning (i.e., that they are communicative). If a signal does not reliably relate to a feature of the environment and/or evoke a particular response, then it’s hard to argue that it means anything—at least if your starting assumption is that animal signals are meaningless until proven otherwise.

This approach differs profoundly from how meaning is studied in language. Many human words, sentences, and gestures do not evoke an immediate reaction in others. Consider the paper you are currently reading. Unless reading this paper led to a predictable response in readers (e.g., angrily throwing it across the room or immediately designing a new study), standard methods in animal communication would conclude that the paper had no meaning. Methods for determining meaning in animal signals are not good at detecting communicative acts with delayed reactions, acts that impact internal states rather than external behavior, or acts that are embedded within ongoing interactions and are difficult to isolate.

Methodological Bias in the Search for Meaning.

To establish that animal signals have meaning, researchers focus on isolated signals that elicit predictable, immediate behavioral reactions. This makes sense; humans are not fluent in the systems they are studying, and animals cannot provide translations. However, this focus on predictable contingencies between signals and responses tightly constrains how researchers study meaning in animal systems and which behaviors they study in the first place.

The gold standard for determining that an animal signal has meaning is the “playback” design. There are many forms of meaning-making that draw on social, contextual, or environmental features outside of the signal, but the playback method looks only at the meaning transmitted to the receiver by the information encoded in the signal itself (directly borrowing metaphors from the code model). In a playback design, a signal (almost always auditory) is recorded and later played back to one or more animals to observe their reactions. It is critical to play the recorded signal back outside of its original context so that the environmental conditions that elicited the signal are not present; any responses will then be due to the information in the signal.

Playback designs are effective at measuring particular types of signals (alarms), in particular modalities (auditory), with particular responses (immediate behaviors like fleeing). With careful experimental manipulation, the playback method allows researchers to test their attributions of signal meaning. This is a major advantage. Humans bias our ascription of meaning to animal signals by the ways we parse and label our worlds. Alarm calls are typically thought to reference the type of predator that elicits them (e.g., leopard, eagle), but animals might perceive and categorize the world differently. For example, domestic chickens were thought to have separate alarm calls for foxes and eagles. When chickens heard an eagle alarm call, they hunched down. When they heard a fox alarm call, they flew up to perches. However, when chickens were shown video of foxes on screens overhead, they gave “eagle” calls. And when they saw videos of eagles on the ground, they gave “fox” calls (77). This simple manipulation revealed that chickens do not have alarm calls referencing the type of animal [something humans, including infants, would consider a “natural kind” (78, 79)]; instead, they provide information about the location of the impending attack.

Studies of dynamic visual signals (like gestures) also look for contingencies between signals and responses, though it is usually not possible to play back visual signals. In animal gesture studies, researchers typically take all cases of a particular gesture form, filter for those with markers of intentional use, categorize and count receivers’ responses, and assign meaning to gesture types based on apparently satisfactory outcomes—situations where the gesturer ceases to signal rather than persisting (8082). After finding that signalers do not continue gesturing following Response X to Gesture A—but do persist following other responses to Gesture A—researchers conclude that Gesture A likely means “do X”. Such analyses of meaning focus on the signaler’s goal (what response from the receiver satisfies the signaler). This differs from playback studies, where analyses of meaning focus on how receivers perceive, interpret, and respond to the signal. This perspective shift is important because the goals of communication differ significantly between signalers and receivers (83). Analyses of gesture meaning and auditory playback designs are both successful at studying signals used in isolation with clear, immediate, behavioral responses, but this represents only a subset of all potentially communicative behaviors. What happens to other behaviors?

Animal signals not reliably associated with particular responses are often dismissed as “ambiguous” and excluded from analyses of meaning (or structure). In this way, studies present a highly curated subset of all potentially communicative behavior. This curation allows researchers to focus their efforts on frequent actions with immediate, predictable responses. Focusing on a subset of behavior has the added benefit of increasing the sample size for analyzed signals and standardizing methods across studies.

The use of playback to study alarm calls has become the paradigm for rigorously demonstrating that animal signals have meaning. Indeed, most studies of meaning in animal communication use variations of the playback design and draw directly on metaphors from the code model, like signaler and receiver. However, this limits the kinds of questions researchers can ask and the conclusions they can draw.

The Code Model Constrains Even Studies of Flexible Animal Systems.

Not all kinds of communication can be studied via playback, but methodological practices stemming from the code model skew conclusions about these systems as well. To illustrate this, I draw on examples from ape gesture. I do this for two reasons: First, because I have worked in this field for almost 20 y, so any criticisms I make necessarily include my own work; second, because ape gesture is arguably the least likely animal system to fall prey to the constraints of the code model. From the start, ape gesture has been recognized as both intentional and flexible—properties that make it more similar to human communication and significantly less code-like than other animal systems (8489). This focus on intentionality and flexibility helped to distinguish ape gestures from vocalizations, which were argued to be largely inflexible and contextually bound (90, 91). Ironically, it was only once ape gestures had been shown to be used flexibly that researchers began to systematically investigate gesture meaning by looking for tighter relationships between gestures and responses (i.e., codes). By highlighting the ways that the code model permeates study designs of even such extremely flexible and intentional animal systems as ape gesture, I hope to show how pervasive the search for codes is in studies of animal communication.

To study gesture, researchers start with video of ape groups. They cut these videos down to focus on nonfunctional movements produced in social situations, excluding movements produced when apes are alone and instrumental movements that achieve a physical goal (like picking up an infant). The remaining movements are categorized by their physical forms and contextual uses. They are often named by their similarity to human gestures (e.g., beckon, shoo, wave). Researchers may exclude gestures that do not meet criteria for intentionality, like waiting for a response or elaborating when there is no response (92). It is also common to exclude gestures that occur during play or multiparty interactions, since these are not likely to occur the same way multiple times. Most studies also exclude “rare” gestures observed only infrequently. Sample size is particularly important for analyses of meaning or function, where many examples of the same gesture are needed. In studies with smaller sample sizes, rarely observed gestures may be a sizable portion of the data. For example, Cartmill & Byrne (80) described 64 gestures in orangutans but included only 40 in analyses of meaning. Genty et al. (93) reported 102 gestures in gorillas but analyzed function for only the 10 most common. Finally, analyses of gesture typically focus on gestures that either occur alone or at the start of sequences, i.e., those that initiate interactions. This increases the likelihood that a gesture can be shown to elicit a particular response, critical for demonstrating that gestures have (code-like) meanings.

These curation and analysis practices have led to important discoveries about ape gesture. Nevertheless, it is important to recognize what these practices leave out and to recognize the ways in which they too have been shaped by assumptions of the code model. Table 1 presents examples of findings and methods from a range of ape gesture studies. This table is by no means exhaustive; it is meant to demonstrate how pervasive the search for codes is, even for a type of communication characterized by flexibility and intentionality. Some current studies of ape gesture attempt to move beyond searching for code-like contingencies by focusing on the pragmatics of interactions or socio-ecological contexts of communication (9499); I discuss these below. Despite recent work to widen our lens, the study of ape gesture continues to be blinkered by the code model, in both methods and research questions.

Table 1.

Comparison of methods and conclusions from published studies of ape gesture meaning (emphasis mine)

Study Species Conclusions Methods
Hobaiter & Byrne (85) Chimpanzee (Pan troglodytes) “…found [gestures] to be used intentionally to achieve 15 purposes...”

“We examined whether different gestures were associated with a specific pattern of outcomes..”

“… we therefore excluded data from play bouts to avoid masking the “real-world” meaning of gestures.”

Roberts et al. (104) Chimpanzee (Pan troglodytes) “...gestures were also strongly associated with specific responses and outcomes…”

“…consistent association between a given gesture type and particular behavioural change may be used to infer the meaning of different gestures”

“…this necessarily excluded both actions that were not clearly directed…towards a specific recipient that could visually perceive the signaller's behaviour…and actions that could feasibly be readily explained in noncommunicative terms”

Cartmill & Byrne (84) Orangutan (Pongo pygmaeus/abelii) “…more than half of the orangutan gestures we were able to analyse had predictable intentional meanings”

“The aim is to identify gestures that are used predictably to elicit specific reactions...”

“Only gestures that occurred singly or as the first gesture in a sequence were analysed for meaning. While this simplified the analysis by restricting it to a single signal and reaction in each case, it necessarily excluded some gestures from analysis.”

Graham et al. (105) Bonobo (Pan paniscus) “Bonobos intentionally deploy gestures to achieve at least 14 different intended outcomes”

“We are able to deduce the meaning of great ape gestures by looking at the ‘Apparently Satisfactory Outcome’ (ASO)…”

“…Fifteen gesture types were suitable for analysis, having been used by at least 3 individuals at least 3 times to achieve an ASO.”

These studies represent a range of authors and study species and all employ methods frequently used in the field. Each study found that gestures were used to communicate specific meanings. Each also explained in the methods that the researchers were explicitly looking for code-like contingencies between signal and response and curated their data in particular ways that made these codes easier to find.

Meaning beyond Codes.

Studies of animal communication (like those in Table 1) typically find that animals communicate using simple codes that elicit particular responses. However, this is unsurprising: These studies are looking for codes, often discarding communications that do not fit the code model.

Human language, in contrast, is already assumed to be both communicative and meaningful. This assumption frees up the ways it is studied. Flexibility in the way words and gestures are used is assumed to reveal properties of thought or interaction, rather than evidence that the observed behaviors are not communicative. A word doesn’t need to show a predictable response in order to be considered meaningful.

Humans communicate by integrating and interweaving culturally transmitted codes (spoken or signed language) with nonverbal conventions (e.g., gestures, prosody), nonverbal nonconventions (e.g., interacting with the physical environment), and assumptions about the knowledge and goals of their interlocutor. This integration occurs for both signaler and receiver, with information exchanged in both directions. Receivers provide feedback when they encounter difficulties in understanding, and signalers revise their communicative acts in real time based on this feedback (100, 101). Studies of human gesturing demonstrate that gesture is tightly integrated with speech, both semantically and temporally (102104). When one modality is disrupted (as in the case of stuttering), the other is as well (105, 106). Receivers also integrate information across gesture and speech when interpreting language (107, 108). Evidence from studies of gesture production and gesture perception supports a model of human language as a single, integrated, multimodal system—rather than two systems produced at the same time (109112). We have only begun to understand the full range of behavior and information humans use when communicating face-to-face. Fig. 2 gives an overview of some of the many possible sources of meaning humans draw on.

Fig. 2.

Fig. 2.

Schematic illustration of some of the sources of meaning humans integrate when communicating. Sources with code-like features (e.g., words) form only a small part of the semiotic landscape in natural conversation.

Fig. 2 is not a comprehensive model of human communication (113); it merely highlights the many ways humans convey meaning to one another. Note, however, how complex this simplified model of human communication is when compared to the code model used to study animal communication. Humans might ostensively signal that they are communicating by drawing on a wide range of sources: linguistic conventions, nonverbal conventions, or nonverbal nonconventions. For example, imagine that the people in Fig. 2 are at an event when the music switches to an obscure song. The signaler might smile and hold one finger up to draw attention to the song (nonverbal conventions), say “I remember digging through records at an estate sale and finding this waaay in the back,” and mime flipping through record albums (a nonverbal nonconvention). The receiver integrates information across the speaker’s words, actions, tone, expression—and the song itself—with inferences about the speaker’s intentions to interpret the communicative act.

It is unlikely that such a complex personal narrative would be found in a nonhuman animal. However, it seems possible (and even likely) that at least some animals use some of these ostensive and inferential abilities at least some of the time. This is especially true for great apes, who flexibly modify their gestures in response to the visual attention (114) and understanding of others (115), and who can (at least sometimes) make inferences based on others’ knowledge (67, 68) and expectations (116). However, the code model and the methods commonly used to study animals leave little room for discovering complex or infrequent interactions that involve integrating information across different sources. Recognizing differences in our implicit models of human and animal communication—and the ways in which these models bias our studies—is a necessary step toward credible comparisons between language and animal communication.

A Path Forward?

It is impossible to eliminate every difference in the way communication is studied in animals and humans, but it is critical to acknowledge and account for theoretical and methodological differences when comparing abilities between species. To draw accurate comparisons between the communication systems and underlying cognitive abilities of humans and animals, animal communication studies must expand beyond searching for code-like signals with predictable behavioral responses.

Scholars studying great ape communication have recently begun to break away from the code model by introducing a range of new methods, mainly adapted from studies of human interaction. Some have introduced analyses of pragmatics, focusing on the ways context can inflect meaning (94, 117); others allow for greater signal ambiguity, so that each signal does not need to have a single typical response to be included in analyses (82). Some have begun to apply analytic frames from the field of conversation analysis to primate gesture, shifting the focus from the signaling individual to the interacting dyad (9597). Others have argued that ape gesture researchers need to be bolder in their research questions and conclusions, urging scholars to more fully explore the types of communicative inferences apes can make rather than stopping at demonstrations of intentional signaling (118). Still others have begun developing computer-vision and machine-learning methods to circumvent human biases in the categorization of gesture forms or meanings (119). Finally, a few scholars have begun to use analytical frameworks from animal communication to study human interaction [e.g., to study gestures of human infants (120)]. These emerging lines of work have found much greater continuity between animal and human communication than traditional studies. However, most studies of animal communication remain constrained in both imagination and design by the limitations of the code model.

Current work on ape gesture provides some examples for how animal communication research might expand beyond the code model. This echoes an earlier shift in the field—also led by work on ape gesture—toward considering the intentionality of primate signals (e.g., refs. 121, 122). However, many of the more flexible approaches to ape gesture draw directly on methods from studies of human interaction (e.g., conversation analysis and pragmatics). Some of these methods would take significant work to apply to species more distantly related to humans; modifying them to fit different social structures, body plans, perceptual systems, and environments demands added care to avoid stretching definitions and categories past the point of useful comparison. Methods adapted from conversation analysis seem to be among the most directly applicable across a wide range of species (96, 123125). This is probably because conversation analysis began as a framework to study the underlying rules of human interaction from patterns of external behavior, much as is done with animals (126). Analyses of several aspects of interaction, including turn-taking and communicative repair, have been usefully applied to both animal and human data with only minor modifications (96, 127). The focus of conversation analysis on externally observable behavior helps it transfer between species, but its backgrounding of cognitive processes limits its utility in demonstrating abilities like ostension and inference (128). A wider sea change is needed in the ways researchers approach the communication and minds of animals. Here are two possibilities:

  1. Inclusion-by-association: One small but potentially impactful change would be for scholars to grant animals the same generous assumption that they do preverbal human children: that ambiguous behaviors directed toward others in the context of known communicative acts are probably also communicative. This assumption would greatly expand the types of exchanges that are included in studies of animal communication. Inclusion-by-association is common practice in interpretation of preverbal infant behavior, but it is not applied to animals. Instead, animal research typically engages in exclusion-by-association. For example, if an animal behavior occurs alongside signals or behaviors typically found in the context of play, it will often be glossed as play and excluded from further analyses of signal meaning (81, 129). Taking a page from studies of human language development, I propose inclusion-by-association rather than exclusion-by-association.

  2. Communicative play: Researchers could also ask whether animals engage in communicative play. Animal play can be viewed as an opportunity for rich inference and potentially for pretense, as individuals signal to others that their actions should not be taken literally (130132). Some types of play even display game-like qualities involving role reversal and hierarchical rules (116, 133). Despite considerable interest in animal play, communicative play is not often studied. Researchers could start with known signals and ask how animals use those signals in nonstandard ways. They could also conduct qualitative analyses of rare interactions. For example, a researcher might observe animals sometimes producing signals in nonsocial contexts; pairing signals with actions that are not obviously communicative; or using signals in nontypical ways during social play. These nonstandard uses would likely be missed or excluded by current protocols for studying animal communication, but they are similar to behaviors in human children that are known to be important in communicative development. One type of communicative play—babbling—has been documented in animal species that learn to produce their vocalizations. Many species of birds, some species of bats and marine mammals, and at least one species of marmosets go through periods of vocal rehearsal early in development characterized by playful exploration of the acoustic space (134139). This suggests that there may be other types of communicative play to be found outside of typical signaler-receiver information exchanges.

Opening up studies of animal communication to “messy” signaling would not require that researchers abandon the strict definitions and protocols that ensure interobserver reliability and replicability. Nor would it involve an “anything goes” approach to building a dataset. However, by systematically exploring instances where signals are used in ways that do not fit the expectations and assumptions of the code model, researchers may discover that animals can use a range of semiotic resources in indirect or novel ways.

Studies showing code-like contingencies between signals and responses have been crucial for demonstrating that animal signals have meanings and are not purely reflexes. The proposed methodological shifts toward considering animal communication outside of simple signal-response pairs would not invalidate or discount the contributions of studies following the code model. New, more flexible methods build on findings from code-model analyses, using known codes to anchor or contextualize ambiguous or rare behaviors. By acknowledging historic differences in the ways scholars study animals and humans—and incorporating new approaches that break away from primarily searching for codes—we may find that animal communication and human language are largely characterized by differences of degree rather than kind.

Acknowledgments

I am grateful to Jacob Foster for his encouragement and comments on the manuscript. I am indebted to Matt Cartmill and Kaye Brown for challenging me to think more deeply about these topics. I thank Thom Scott-Phillips, Heidi Lyn, Richard Moore, Lila Gleitman, and interlocutors from EVOLANG for spirited debates about the code model.

Author contributions

E.A.C. wrote the paper.

Competing interests

The author declares no competing interest.

Footnotes

This article is a PNAS Direct Submission.

*Cultural transmission does not play a substantial role in the acquisition of either vocal or gestural communication in non-human primates (140145). Primates, like many other species, do develop cultural traditions in other domains (146).

Data, Materials, and Software Availability

There are no data underlying this work.

References

  • 1.Sperber D., Wilson D., Relevance: Communication and Cognition (Blackwell, 1995). [Google Scholar]
  • 2.Origgi G., Sperber D., “Evolution, communication and the proper function of language” in Evolution and the Human Mind: Language, Modularity and Social Cognition, Carruthers P., Chamberlain A., Eds. (Cambridge University Press, 2000), pp. 140–169. [Google Scholar]
  • 3.Tomasello M., Origins of human Communication (MIT press, 2010). [Google Scholar]
  • 4.Darwin C., The Expression of the Emotions in Man and Animals (Cambridge University Press, 1872), pp. 364–393. The portable Darwin (1993). [Google Scholar]
  • 5.Hockett C. F., The origin of speech. Sci. Am. 203, 89–96 (1960). [PubMed] [Google Scholar]
  • 6.Catchpole C. K., Slater P. J. B., Bird Song: Biological Themes and Variations (Cambridge University Press, 2003). [Google Scholar]
  • 7.Garland E. C., McGregor P. K., Cultural transmission, evolution, and revolution in vocal displays: Insights from bird and whale song. Front. Psychol. 11, 544929 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.J. Hyland Bruno, E. D. Jarvis, M. Liberman, O. Tchernichovski, Birdsong learning and culture: Analogies with human spoken language. Annu. Rev. Linguist. 7, 449–472 (2021). [Google Scholar]
  • 9.Slater P. J., The cultural transmission of bird song. Trends Ecol. Evol. 1, 94–97 (1986). [DOI] [PubMed] [Google Scholar]
  • 10.Deecke V. B., Ford J. K., Spong P., Dialect change in resident killer whales: Implications for vocal learning and cultural transmission. Anim. Behav. 60, 629–638 (2000). [DOI] [PubMed] [Google Scholar]
  • 11.Janik V. M., Knörnschild M., Vocal production learning in mammals revisited. Philos. Trans. R. Soc. Lond. B Biol. Sci. 376, 20200244 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Poole J. H., Tyack P. L., Stoeger-Horwath A. S., Watwood S., Elephants are capable of vocal learning. Nature 434, 455–456 (2005). [DOI] [PubMed] [Google Scholar]
  • 13.Blumstein D. T., The evolution of functionally referential alarm communication: Multiple adaptations; multiple constraints. Evol. Commun. 3, 135–147 (1999). [Google Scholar]
  • 14.Gill S. A., Bierema A.M.-K., On the meaning of alarm calls: A review of functional reference in avian alarm calling. Ethology 119, 449–461 (2013). [Google Scholar]
  • 15.Morford J. P., Goldin-Meadow S., From here and now to there and then: The development of displaced reference in homesign and English. Child Dev. 68, 420–435 (1997). [PubMed] [Google Scholar]
  • 16.Suddendorf T., Corballis M. C., The evolution of foresight: What is mental time travel, and is it unique to humans? Behav. Brain Sci. 30, 299–313; discussion 313–351 (2007). [DOI] [PubMed] [Google Scholar]
  • 17.Lyn H., et al. , Apes communicate about absent and displaced objects: Methodology matters. Anim. Cogn. 17, 85–94 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Mulcahy N. J., Call J., Apes save tools for future use. Science 312, 1038–1040 (2006). [DOI] [PubMed] [Google Scholar]
  • 19.Raby C. R., Alexis D. M., Dickinson A., Clayton N. S., Planning for the future by western scrub-jays. Nature 445, 919–921 (2007). [DOI] [PubMed] [Google Scholar]
  • 20.Osvath M., Osvath H., Chimpanzee (Pan troglodytes) and orangutan (Pongo abelii) forethought: Self-control and pre-experience in the face of future tool use. Ani. Cogn. 11, 661–674 (2008). [DOI] [PubMed] [Google Scholar]
  • 21.Brown R., Bellugi U., Three processes in the child’s acquisition of syntax. Harv. Educ. Rev. 34, 133–151 (1964). [Google Scholar]
  • 22.Chomsky N., “Linguistic contributions to the study of mind: Future” in Language and Mind (Cambridge, 1968), pp. 323–364. [Google Scholar]
  • 23.Bickerton D., The supremacy of syntax. Behav. Brain Sci. 10, 658–659 (1987). [Google Scholar]
  • 24.Lieberman P., Human language and human uniqueness. Lang. Commun. 14, 87–95 (1994). [Google Scholar]
  • 25.Berwick R. C., Okanoya K., Beckers G. J. L., Bolhuis J. J., Songs to syntax: The linguistics of birdsong. Trends Cogn. Sci. 15, 113–121 (2011). [DOI] [PubMed] [Google Scholar]
  • 26.Searcy W. A., Soha J., Peters S., Nowicki S., Long-distance dependencies in birdsong syntax. Proc. Biol. Sci. 289, 20212473 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Payne K., Payne R., Large scale changes over 19 years in songs of humpback whales in Bermuda. Z. Tierpsychol. 68, 89–114 (2010). [Google Scholar]
  • 28.Mitani J. C., Marler P., A phonological analysis of male gibbon singing behavior. Behaviour 109, 20–45 (1989). [Google Scholar]
  • 29.Balaban E., Bird song syntax: Learned intraspecific variation is meaningful. Proc. Natl. Acad. Sci. U.S.A. 85, 3657–3660 (1988). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Emlen S. T., An experimental analysis of the parameters of bird song eliciting species recognition. Behaviour 41, 130–171 (1972). [Google Scholar]
  • 31.Taylor C. E., Brumley J. T., Hedley R. W., Cody M. L., Sensitivity of California Thrashers (Toxostoma redivivum) to song syntax. Bioacoustics 26, 259–270 (2017). [Google Scholar]
  • 32.Crockford C., C. Boesch, Call combinations in wild chimpanzees. Behaviour 142, 397–421 (2005). [Google Scholar]
  • 33.Zuberbühler K., A syntactic rule in forest monkey communication. Anim. Behav. 63, 293–299 (2002). [Google Scholar]
  • 34.Schassburger R. M., Vocal Communication in the Timber Wolf, Canis lupus: Structure, Motivation, and Ontogeny (Parey Scientific Publication, 1993). [Google Scholar]
  • 35.Robinson J. G., Syntactic structures in the vocalizations of wedge-capped capuchin monkeys, Cebus olivaceus. Behaviour 90, 46–78 (1984). [Google Scholar]
  • 36.Martin K., Horn A. G., Hannon S. J., The calls and associated behavior of breeding willow ptarmigan in Canada. Wilson Bull. 107, 496–509 (1995). [Google Scholar]
  • 37.Cleveland J., Snowdon C. T., The complex vocal repertoire of the adult cotton-top Tamarin (Saguinus oedipus oedipus)1). Z. Tierpsychol. 58, 231–270 (2010). [Google Scholar]
  • 38.Arnold K., Zuberbühler K., Semantic combinations in primate calls. Nature 441, 303–303 (2006). [DOI] [PubMed] [Google Scholar]
  • 39.Arnold K., Zuberbühler K., Meaningful call combinations in a non-human primate. Curr. Biol. 18, R202–R203 (2008). [DOI] [PubMed] [Google Scholar]
  • 40.Schlenker P., Chemla E., Arnold K., Zuberbühler K., Pyow-hack revisited: Two analyses of Putty-nosed monkey alarm calls. Lingua 171, 1–23 (2016). [Google Scholar]
  • 41.Suzuki T. N., Wheatcroft D., Griesser M., The syntax–semantics interface in animal vocal communication. Philos. Trans. R. Soc. Lond. B Biol. Sci. 375, 20180405 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Suzuki T. N., Semantic communication in birds: Evidence from field research over the past two decades. Ecol. Res. 31, 307–319 (2016). [Google Scholar]
  • 43.Schlenker P., Coye C., Leroux M., Chemla E., The ABC-D of animal linguistics: Are syntax and compositionality for real? Biol. Rev. Camb. Philos. Soc. 98, 1142–1159 (2023). [DOI] [PubMed] [Google Scholar]
  • 44.Kime N. M., Rand A. S., Kapfer M., Ryan M. J., Consistency of female choice in the túngara frog: A permissive preference for complex characters. Anim. Behav. 55, 641–649 (1998). [DOI] [PubMed] [Google Scholar]
  • 45.Zsebők S., et al. , Sequential organization of birdsong: Relationships with individual quality and fitness. Behav. Ecol. 32, 82–93 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Hauser M. D., Chomsky N., Fitch W. T., The faculty of language: What is it, who has it, and how did it evolve? Science 298, 1569–1579 (2002). [DOI] [PubMed] [Google Scholar]
  • 47.Gentner T. Q., Fenn K. M., Margoliash D., Nusbaum H. C., Recursive syntactic pattern learning by songbirds. Nature 440, 1204–1207 (2006). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.van Heijningen C. A. A., de Visser J., Zuidema W., ten Cate C., Simple rules can explain discrimination of putative recursive syntactic structures by a songbird species. Proc. Natl. Acad. Sci. U.S.A. 106, 20538–20543 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Scott-Phillips T., Speaking Our Minds: Why Human Communication is Different, and How Language Evolved to Make It Special (Bloomsbury Publishing, 2014). [Google Scholar]
  • 50.Scott-Phillips T. C., Nonhuman primate communication, pragmatics, and the origins of language. Curr. Anthropol. 56, 56–80 (2015). [Google Scholar]
  • 51.Andrews K., The Animal Mind: An Introduction to the Philosophy of Animal Cognition (Routledge, 2020). [Google Scholar]
  • 52.Grice H. P., “Logic and conversation” in Syntax and Semantics, Vol. 3, Speech Acts, Cole P., Morgan J. L., Eds. (Academic Press, 1975), pp. 41–58. [Google Scholar]
  • 53.Bar-On D., Origins of meaning: Must we “go Gricean”? Mind Lang. 28, 342–375 (2013). [Google Scholar]
  • 54.Scott-Phillips T. C., Meaning in great ape communication: Summarising the debate. Anim. Cogn. 19, 233–238 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Moore R., Meaning and ostension in great ape gestural communication. Anim. Cogn. 19, 223–231 (2016). [DOI] [PubMed] [Google Scholar]
  • 56.Graham K. E., Wilke C., Lahiff N. J., Slocombe K. E., Scratching beneath the surface: Intentionality in great ape signal production. Philos. Trans. R. Soc. Lond. B Biol. Sci. 375, 20180403 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Planer R. J., Going Dennettian about Gricean communication. Philos. Psychol. 1–24 (2023). [Google Scholar]
  • 58.Tomasello M., A Natural History of Human Thinking (Harvard University Press, 2018). [Google Scholar]
  • 59.Sperber D., Wilson D., Pragmatics, modularity and mind‐reading. Mind & Language 17, 3–23 (2002). [Google Scholar]
  • 60.Hare B., Call J., Agnetta B., Tomasello M., Chimpanzees know what conspecifics do and do not see. Anim. Behav. 59, 771–785 (2000). [DOI] [PubMed] [Google Scholar]
  • 61.Cheney D. L., Extent and limits of cooperation in animals. Proc. Natl. Acad. Sci. U.S.A. 108 (Suppl. 2), 10902–10909 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Davidson G. L., Clayton N. S., New perspectives in gaze sensitivity research. Learn. Behav. 44, 9–17 (2016). [DOI] [PubMed] [Google Scholar]
  • 63.Cheney D., Seyfarth R., Smuts B., Social relationships and social cognition in nonhuman primates. Science 234, 1361–1366 (1986). [DOI] [PubMed] [Google Scholar]
  • 64.Byrne R. W., Bates L. A., Primate social cognition: Uniquely primate, uniquely social, or just unique? Neuron 65, 815–830 (2010). [DOI] [PubMed] [Google Scholar]
  • 65.Rosati A. G., Santos L. R., Hare B., “Primate social cognition: Thirty years after Premack and Woodruff” in Primate Neuroethology, M. L. Platt, A. A. Ghazanfar, Eds. (Oxford University Press, 2010), pp. 117–143. [Google Scholar]
  • 66.Cheney D. L., Seyfarth R. M., How Monkeys See the World: Inside the Mind of Another Species (University of Chicago Press, 2018). [Google Scholar]
  • 67.Krupenye C., Kano F., Hirata S., Call J., Tomasello M., Great apes anticipate that other individuals will act according to false beliefs. Science 354, 110–114 (2016). [DOI] [PubMed] [Google Scholar]
  • 68.Kano F., Krupenye C., Hirata S., Tomonaga M., Call J., Great apes use self-experience to anticipate an agent’s action in a false-belief test. Proc. Natl. Acad. Sci. U.S.A. 116, 20904–20909 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69.Wellman H. M., Theory of mind: The state of the art. Eur. J. Dev. Psychol. 15, 728–755 (2018). [Google Scholar]
  • 70.Slaughter V., Theory of mind in infants and young children: A review. Aust. Psychol. 50, 169–172 (2015). [Google Scholar]
  • 71.Premack D., Woodruff G., Does the chimpanzee have a theory of mind? Behav. Brain Sci. 1, 515–526 (1978). [Google Scholar]
  • 72.R. M. Scott, Z. He, R. Baillargeon, D. Cummins, False-belief understanding in 2.5-year-olds: Evidence from two novel verbal spontaneous-response tasks. Dev. Sci. 15, 181–193 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73.de Villiers J., The interface of language and theory of mind. Lingua 117, 1858–1878 (2007). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74.Shannon C. E., A mathematical theory of communication. Bell Syst. Techn. J. 27, 379–423 (1948). [Google Scholar]
  • 75.Reddy M. J., “The conduit metaphor” in Metaphor and Thought, Ortony A. Ed. (Cambridge, 1979), pp. 284–310. [Google Scholar]
  • 76.Townsend S. W., et al. , Exorcising Grice’s ghost: An empirical approach to studying intentional communication in animals. Biol. Rev. Camb. Philos. Soc. 92, 1427–1433 (2017). [DOI] [PubMed] [Google Scholar]
  • 77.Evans C. S., Evans L., Marler P., On the meaning of alarm calls: Functional reference in an avian vocal system. Anim. Behav. 46, 23–38 (1993). [Google Scholar]
  • 78.Gelman S. A., Markman E. M., Young children’s inductions from natural kinds: The role of categories and appearances. Child Dev. 58, 1532–1541 (1987). [PubMed] [Google Scholar]
  • 79.Quinn P. C., Category representation in young infants. Curr. Dir. Psychol. Sci. 11, 66–70 (2002). [Google Scholar]
  • 80.Cartmill E. A., Byrne R. W., Semantics of primate gestures: Intentional meanings of orangutan gestures. Anim. Cogn. 13, 793–804 (2010). [DOI] [PubMed] [Google Scholar]
  • 81.Hobaiter C., Byrne R. W., The meanings of chimpanzee gestures. Curr. Biol. 24, 1596–1600 (2014). [DOI] [PubMed] [Google Scholar]
  • 82.Hobaiter C., Graham K. E., Byrne R. W., Are ape gestures like words? Outstanding issues in detecting similarities and differences between human language and ape gesture. Philos. Trans. R. Soc. Lond. B Biol. Sci. 377, 20210301 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 83.Seyfarth R. M., Cheney D. L., Signalers and receivers in animal communication. Annu. Rev. Psychol. 54, 145–173 (2003). [DOI] [PubMed] [Google Scholar]
  • 84.Tomasello M., Call J., Nagell K., Olguin R., Carpenter M., The learning and use of gestural signals by young chimpanzees: A trans-generational study. Primates 35, 137–154 (1994). [Google Scholar]
  • 85.Pika S., Liebal K., Tomasello M., Gestural communication in young gorillas (Gorilla gorilla): Gestural repertoire, learning, and use. Am. J. Primatol. 60, 95–111 (2003). [DOI] [PubMed] [Google Scholar]
  • 86.Liebal K., Pika S., Tomasello M., Gestural communication of orangutans (Pongo pygmaeus). Gesture (Amst.) 6, 1–38 (2006). [Google Scholar]
  • 87.Plooij F. X., “The behavioral development of free-living chimpanzee babies and infants” in Monographs on Infancy (Bloomsbury Academic Press, 1984), vol. 4, p. 207. [Google Scholar]
  • 88.Plooij F. X., “Some basic traits of language in wild chimpanzees?” In Action, Gesture and Symbol, A. Lock, Ed. (1978).
  • 89.Van Lawick-Goodall J., The behaviour of free-living chimpanzees in the Gombe stream reserve. Anim. Behav. Monogr. 1, 161–311, IN1–IN12 (1968). [Google Scholar]
  • 90.Pollick A. S., de Waal F. B. M., Ape gestures and language evolution. Proc. Natl. Acad. Sci. U. S. A. 104, 8184–8189 (2007). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 91.Arbib M. A., Liebal K., Pika S., Primate vocalization, gesture, and the evolution of human language. Curr. Anthropol. 49, 1053–1076 (2008). [DOI] [PubMed] [Google Scholar]
  • 92.Byrne R. W., et al. , Great ape gestures: Intentional communication with a rich set of innate signals. Anim. Cogn. 20, 755–769 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 93.Genty E., Breuer T., Hobaiter C., Byrne R. W., Gestural communication of the gorilla (Gorilla gorilla): Repertoire, intentionality and possible origins. Anim. Cogn. 12, 527–546 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 94.Bohn M., Liebal K., Oña L., Tessler M. H., Great ape communication as contextual social inference: A computational modelling perspective. Philos. Trans. R. Soc. Lond. B Biol. Sci. 377, 20210096 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 95.Genty E., et al. , How apes get into and out of joint actions: Shared intentionality as an interactional achievement. Interact. Stud. 21, 353–386 (2020). [Google Scholar]
  • 96.Heesen R., Fröhlich M., Sievers C., Woensdregt M., Dingemanse M., Coordinating social action: A primer for the cross-species investigation of communicative repair. Philos. Trans. R. Soc. Lond. B Biol. Sci. 377, 20210110 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 97.Mondada L., Meguerditchian A., Sequence organization and embodied mutual orientations: Openings of social interactions between baboons. Philos. Trans. R. Soc. Lond. B Biol. Sci. 377, 20210101 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 98.Graham K. E., Furuichi T., Byrne R. W., Context, not sequence order, affects the meaning of bonobo (Pan paniscus) gestures. Gesture 19, 335–364 (2020). [Google Scholar]
  • 99.Graham K. E., Badihi G., Safryghin A., Grund C., Hobaiter C., A socio-ecological perspective on the gestural communication of great ape species, individuals, and social units. Ethol. Ecol. Evol. 34, 235–259 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 100.Schegloff E. A., Jefferson G., Sacks H., The preference for self-correction in the organization of repair in conversation. Language 53, 361–382 (1977). [Google Scholar]
  • 101.Heritage J., Garfinkel and Ethnomethodology (John Wiley & Sons, 2013). [Google Scholar]
  • 102.Church R. B., Kelly S., Holcombe D., Temporal synchrony between speech, action and gesture during language production. Lang. Cogn. Neurosci. 29, 345–354 (2014). [Google Scholar]
  • 103.Esteve-Gibert N., Prieto P., Pons F., Nine-month-old infants are sensitive to the temporal alignment of prosodic and gesture prominences. Infant Behav. Dev. 38, 126–129 (2015). [DOI] [PubMed] [Google Scholar]
  • 104.Bosker H. R., Peeters D., Beat gestures influence which speech sounds you hear. Proc. Biol. Sci. 288, 20202419 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 105.Mayberry R. I., Jaques J., “Gesture production during stuttered speech: Insights into the nature of gesture-speech integration” in Language and Gesture, D. McNeill, Ed. (Cambridge University Press, Cambridge, 2000), pp. 199–214. [Google Scholar]
  • 106.Graziano M., Gullberg M., When speech stops, gesture stops: Evidence from developmental and crosslinguistic comparisons. Front. Psychol. 9, 879 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 107.Cassell J., McNeill D., McCullough K.-E., Speech-gesture mismatches: Evidence for one underlying representation of linguistic and nonlinguistic information. Pragmat. Cogn. 7, 1–34 (1999). [Google Scholar]
  • 108.Goldin-Meadow S., Singer M. A., From children’s hands to adults’ ears: Gesture’s role in the learning process. Dev. Psychol. 39, 509–520 (2003). [DOI] [PubMed] [Google Scholar]
  • 109.Kendon A., “Gesticulation and speech: Two aspects of the process of utterance” in The Relationship of Verbal and Nonverbal Communication, M. Key, Ed. (Mouton, 1980), pp. 207–227. [Google Scholar]
  • 110.McNeill D., Hand and Mind. What Gestures Reveal about Thought (University of Chicago Press, Chicago, 1992). [Google Scholar]
  • 111.McNeill D., Duncan S. D., "Growth points in thinking-for-speaking" in Language and Gesture (Cambridge University Press, Cambridge, 2000), Vol. 1987, pp. 141–161. [Google Scholar]
  • 112.Cartmill E. A., Gesture. Ann. Rev. Anthropol. 51, 455–73 (2022). [Google Scholar]
  • 113.Goodwin C., Co-Operative Action (Cambridge University Press, 2018). [Google Scholar]
  • 114.Liebal K., Call J., Tomasello M., Pika S., To move or not to move: How apes adjust to the attentional state of others. Interact. Stud. 5, 199–219 (2004). [Google Scholar]
  • 115.Cartmill E. A., Byrne R. W., Orangutans modify their gestural signaling according to their audience’s comprehension. Curr. Biol. 17, 1345–1348 (2007). [DOI] [PubMed] [Google Scholar]
  • 116.Eckert J., Winkler S. L., Cartmill E. A., Just kidding: The evolutionary roots of playful teasing. Biol. Lett. 16, 20200370 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 117.Sievers C., Gruber T., Reference in human and non-human primate communication: What does it take to refer? Anim. Cogn. 19, 759–768 (2016). [DOI] [PubMed] [Google Scholar]
  • 118.Warren E., Call J., Inferential communication: Bridging the gap between intentional and ostensive communication in non-human primates. Front. Psychol. 12, 718251 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 119.Wiltshire C., et al. , DeepWild: Application of the pose estimation tool DeepLabCut for behaviour tracking in wild chimpanzees and bonobos. J. Anim. Ecol. 92, 1560–1574 (2023), 10.1111/1365-2656.13932. [DOI] [PubMed] [Google Scholar]
  • 120.Kersken V., Gómez J.-C., Liszkowski U., Soldati A., Hobaiter C., A gestural repertoire of 1- to 2-year-old human children: In search of the ape gestures. Anim. Cogn. 22, 577–595 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 121.Crockford C., Wittig R. M., Mundry R., Zuberbühler K., Wild chimpanzees inform ignorant group members of danger. Curr. Biol. 22, 142–146 (2012). [DOI] [PubMed] [Google Scholar]
  • 122.Schel A. M., Townsend S. W., Machanda Z., Zuberbühler K., Slocombe K. E., Chimpanzee alarm call production meets key criteria for intentionality. PLoS One 8, e76674 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 123.Logue D. M., Stivers T., Squawk in interaction: A primer of conversation analysis for students of animal communication. Behaviour 149, 1283–1298 (2012). [Google Scholar]
  • 124.Fröhlich M., Taking turns across channels: Conversation-analytic tools in animal communication. Neurosci. Biobehav. Rev. 80, 201–209 (2017). [DOI] [PubMed] [Google Scholar]
  • 125.Abreu F., Pika S., Turn-taking skills in mammals: A systematic review into development and acquisition. Front. Ecol. Evol. 10, 987253 (2022). [Google Scholar]
  • 126.Goodwin C., Heritage J., Conversation analysis. Annu. Rev. Anthropol. 19, 283–307 (1990). [Google Scholar]
  • 127.Rivera-Cáceres K. D., Quirós-Guerrero E., Araya-Salas M., Templeton C. N., Searcy W. A., Early development of vocal interaction rules in a duetting songbird. R. Soc. Open Sci. 5, 171791 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 128.Potter J., Edwards D., “Conversation analysis and psychology” in The Handbook of Conversation Analysis, Sidnell J., Stivers T., Eds. (John Wiley & Sons, 2012), pp. 701–725. [Google Scholar]
  • 129.Adang O. M. J., Teasing in young chimpanzees. Behaviour 88, 98–121 (1984). [Google Scholar]
  • 130.Bekoff M., Byers J. A., Animal Play: Evolutionary, Comparative and Ecological Perspectives (Cambridge University Press, 1998). [Google Scholar]
  • 131.Bekoff M., Allen C., “Intentional communication and social play: How and why animals negotiate and agree to play” in Animal Play: Evolutionary, Comparative and Ecological Perspectives, Bekoff M., Byers J. A., Eds. (Cambridge University Press, 1998), pp. 97–114. [Google Scholar]
  • 132.Palagi E., et al. , Rough-and-tumble play as a window on animal communication. Biol. Rev. Camb. Philos. Soc. 91, 311–327 (2016). [DOI] [PubMed] [Google Scholar]
  • 133.Mielke A., Carvalho S., Chimpanzee play sequences are structured hierarchically as games. PeerJ 10, e14294 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 134.Fernandez A. A., Burchardt L. S., Nagy M., Knörnschild M., Babbling in a vocal learning bat resembles human infant babbling. Science 373, 923–926 (2021). [DOI] [PubMed] [Google Scholar]
  • 135.Cazau D., Adam O., Laitman J. T., Reidenberg J. S., Understanding the intentional acoustic behavior of humpback whales: A production-based approach. J. Acoust. Soc. Am. 134, 2268–2273 (2013). [DOI] [PubMed] [Google Scholar]
  • 136.Ter Haar S. M., et al. , Cross-species parallels in babbling: Animals and algorithms. Philos. Trans. R. Soc. Lond. B Biol. Sci. 376, 20200239 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 137.A. Margaret Elowson, C. T. Snowdon, C. Lazaro-Perea, ‘Babbling’ and social context in infant monkeys: Parallels to human infants. Trends Cogn. Sci. 2, 31–37 (1998). [DOI] [PubMed] [Google Scholar]
  • 138.Pistorio A. L., Vintch B., Wang X., Acoustic analysis of vocal development in a New World primate, the common marmoset (Callithrix jacchus). J. Acoust. Soc. Am. 120, 1655–1670 (2006). [DOI] [PubMed] [Google Scholar]
  • 139.McCowan B., Reiss D., Whistle contour development in captive-born infant bottlenose dolphins (Tursiops truncatus): Role of learning. J. Comp. Psychol. 109, 242–260 (1995). [Google Scholar]
  • 140.Hayes K. J., Hayes C., The intellectual development of a home-raised chimpanzee. Proc. Am. Philos. Soc. 95, 105–109 (1951). [Google Scholar]
  • 141.Kellogg W. N., Communication and Language in the Home-Raised Chimpanzee. Science 162, 423–427 (1968). [DOI] [PubMed] [Google Scholar]
  • 142.Desai N. P., Fedurek P., Slocombe K. E., Wilson M. L., Chimpanzee pant-hoots encode individual information more reliably than group differences. Am. J. Primatol. 84, e23430 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 143.Owren M. J., Dieter J. A., Seyfarth R. M., Cheney D. L., “Food” calls produced by adult female rhesus (Macaca mulatta) and Japanese (M. fuscata) macaques, their normally-raised offspring, and offspring cross-fostered between species. Behaviour 120, 218–231 (1992). [Google Scholar]
  • 144.Badihi G., et al. , Dialects in leaf-clipping and other leaf-modifying gestures between neighbouring communities of East African chimpanzees. Sci. Rep. 13, 147 (2023). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 145.Cartmill E. A., Hobaiter C., Developmental perspectives on primate gesture: 100 years in the making. Anim. Cogn. 22, 453–459 (2019). [DOI] [PubMed] [Google Scholar]
  • 146.Whiten A., The burgeoning reach of animal culture. Science 372, eabe6514 (2021). [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

There are no data underlying this work.


Articles from Proceedings of the National Academy of Sciences of the United States of America are provided here courtesy of National Academy of Sciences

RESOURCES