Skip to main content
Proceedings of the Royal Society B: Biological Sciences logoLink to Proceedings of the Royal Society B: Biological Sciences
. 2019 Apr 3;286(1900):20190147. doi: 10.1098/rspb.2019.0147

Joint attention skills in wild Arabian babblers (Turdoides squamiceps): a consequence of cooperative breeding?

Yitzchak Ben Mocha 1,2,3,, Roger Mundry 2, Simone Pika 4,5,
PMCID: PMC6501685  PMID: 30940054

Abstract

Human cooperation strongly relies on the ability of interlocutors to coordinate each other's attentional state: joint attention. One predominant hypothesis postulates that this hallmark of the unique cognitive system of humans evolved due to the combination of an ape-like cognitive system and the prosocial motives that facilitate cooperative breeding. Here, we tested this hypothesis by investigating communicative interactions of a cooperatively breeding bird species, the Arabian babbler (Turdoides squamiceps). The behaviour of 12 wild social groups was observed focusing on two distinct communicative behaviours: object presentation and babbler walk. The results showed that both behaviours fulfilled the criteria for first-order intentional communication and involved co-orientation of recipients' attention. In turn, recipients responded with cooperative and communicative acts that resulted in coordinated joint travel between interlocutors. These findings provide the first evidence that another animal species shows several key criteria traditionally used to infer joint attention in prelinguistic human infants. Furthermore, they emphasize the influence of cooperative breeding on sophisticated socio-cognitive performances, while questioning the necessity of an ape-like cognitive system underlying joint attentional behaviour.

Keywords: Arabian babblers, Turdoides squamiceps, joint attention, cooperative breeding, human cognition, intentional communication

1. Introduction

The extraordinary degree of cooperation exhibited by humans seems unrivalled in the animal kingdom [1,2]. Theorists have implicated a specific cognitive capacity, joint attention, as one of the essential building blocks for the evolution of the cooperative abilities of humans [1,3]. Traditionally, joint attention has been defined as the ability to attract and coordinate the attention of a recipient towards a locus of mutual interest (e.g. an object/event in the environment [4]). Precursors of this cognitive capacity can already be found in human infants at the age of approximately six months, who respond to joint attention by following the direction of a caretaker's gesture [5]. At the age of 9–12 months, infants are capable of initiating joint attention with a social partner by gesturing towards a locus of mutual interest [4]. Consequently, the development of joint attention skills is seen as a fundamental milestone in the ontogeny of human cooperative communication [5,6]. Some scholars even see joint attention as the ‘small difference that made a big difference’ in the cognitive evolution of our species [3].

One predominant hypothesis about the evolution of human sophisticated social cognition, the ‘cooperative breeding’ hypothesis [2], postulates that cooperative breeding installed prosocial psychological functioning supporting systematic alloparental care. This, in turn, resulted in immediate consequences for socio-cognitive performance [7]: Caretakers of cooperative breeders evolved specific cognitive capacities for understanding the needs of others' offspring. In parallel, the offspring developed elaborate communicative signalling to attract attention and care from caretakers other than their mothers [8]. Concerning humans, it has been argued that the exceptional co-occurrence of an existing ape-like cognitive system together with humans’ systematic reliance on alloparental care [9,10] gave rise to unique socio-cognitive capacities such as joint attention (figure 1) [2,7].

Figure 1.

Figure 1.

The hypothesized role of cooperative breeding in the transition from ape-like to uniquely human cognition (after [2, p. 177]).

However, are joint attentional skills indeed uniquely human and is the combination of an ape-like cognitive system and cooperative breeding a crucial prerequisite?

To date, relatively little is known about joint attentional skills and the acting selection pressures in other animal species. Furthermore, whether specific behaviours of non-human animals towards human caretakers (e.g. gaze alternation in chimpanzees, Pan troglodytes [11]; pointing in bottlenose dolphins, Tursiops truncatus [12]; and vocal learning in grey parrots, Psittacus erithacus [13]) qualify as joint attentional skills in the human sense is subject to a contentious debate [3,14,15].

Here, we revisited the claim that joint attentional skills are a uniquely human ability [3,15] and tested whether the combination of an ape-like cognitive system and cooperative breeding represents a necessary requirement for joint attention to unfold [2]. We predicted that if an ape-like cognitive system and cooperative breeding are both necessary for joint attention [2,7], bird species will not exhibit key hallmarks of this trait. However, if an ape-like cognitive system is not necessary, but cooperative breeding does facilitate the performance of joint attention skills [16], cooperatively breeding species will demonstrate hallmarks of joint attention. To test these predictions, we investigated communicative interactions in a cooperatively breeding bird species, the Arabian babbler (Turdoides squamiceps), in the wild.

Arabian babblers live in stable social groups [17], consisting of 2–20 kin and non-kin from both sexes [17]. All members provide substantial allopaternal care and use elaborate communicative signalling [17]. We focused on two distinct signals, ‘object presentation’ [18] and ‘babbler walk’, which are frequently used to solicit following behaviour from conspecifics. object presentation involves the discreet presentation of an object to attract the recipient's attention without being seen by other group members. The goal is to lead the recipient towards a hidden location for copulation [18] (see electronic supplementary material, video S1). babbler walk is a multi-modal signal that involves conspicuous wing waving and vocalizations, and is used to solicit a conspecific to follow the signaller (figure 2).

Figure 2.

Figure 2.

Adult caretaker (right) performing babbler walk by (a) waving its wings and vocalizing in front of a fledgling; then (b) turning and hopping away while waving its wings, resulting in a following response. (Online version in colour.)

We paid special attention to two distinct issues that have hampered comparative research on joint attention. First, disagreement on an applicable definition of joint attention has so far prevented valid quantitative comparisons between human and other animal species [1921]. For example, some define joint attention as the ‘intentional co-orientation of two or more organisms to the same locus’ [21]. Others (e.g. [3]), by contrast, require complex mind reading and define it as ‘the mutual awareness of having attended to the same entity between two (or more) individuals. Mutual awareness is established through communication by at least one individual during mutual gaze’ [20]. To overcome this lack of consensus, we refrained from testing whether communicative episodes between Arabian babblers fit a specific definition. Instead, we investigated whether communicative interactions that were initiated by object presentation and babbler walk fulfilled distinct hallmarks that have traditionally been used to infer joint attention in prelinguistic human infants: (i) intentional communication (e.g. [21,22]), (ii) co-orientation of attention (e.g. [4,15]) and (iii) mutual awareness (e.g. [20,23]). By applying this approach, we aimed to instigate a constructive discussion on joint attentional skills that provides useful tools to pinpoint the different degrees of joint attention and cognitive capacities involved [19].

Second, research on joint attention in non-human species has been strongly biased towards interactions with objects (e.g. [12,20]). However, joint attention can revolve around any type of locus [15] such as, for instance, the interlocutors themselves, or the activity they are performing at the time [4,24]. Hence, we examined whether signallers acted to attract and co-orient recipients' attention to their joint travel.

We thus investigated whether communicative episodes initiated by object presentation and babbler walk fulfil the following three key hallmarks of joint attention (see table 1 and methods for detailed operational criteria).

Table 1.

Criteria used to infer join attention in interactions initiated by object presentation and babbler walk.

criterion operational criterion data coded
first-order intentional communication
voluntary signalling (social use) [22] selective production and withholding of signalling according to the presence of certain recipients and their behaviour audience composition and behaviour in which signalling started and stopped
recipient-directed signalling [22,25] the production of signalling behaviour—and its continuous adjustment—to ensure reception by the recipient's sensory perception (1) did the signaller position itself in front of the recipient before starting to signal (yes/no)
(2) was visual contact between interlocutors blocked (e.g. by vegetation) during the joint movement of the interlocutors (yes/no)
(2a) If yes, did the signaller re-position itself within sight of the recipient (yes/no, see electronic supplementary material, video S2)
signalling behaviour elicits responses that are conducive to realizing the signaller's goal [22,25] a change in the recipient's behaviour that (i) is repeatedly followed by the cessation of signalling [22,26] and (ii) does not elicit an aversive response from the signaller (1) did the recipient stop its pre-signalling behaviour and start a new behaviour during signalling without being physically forced by the signaller to do so (yes/no)
(2) the recipient's response type
(3) its behaviour when the signaller stopped signalling
goal-directed signalling [22,25] persistence: the signaller continues to re-signal and/or exaggerate the same signal used at the beginning of the communicative episode until its goal is realised or failure is clearly indicated (see electronic supplementary material, video S3)
elaboration: the signaller modifies the signal it used at the beginning of the communicative interaction (i.e. by adding and/or subtracting and/or modifying elements) or performs a different signal until its goal is realised or failure is clearly indicated (see electronic supplementary material, video S4)
(1) the duration of signalling behaviour
(2) was the recipient cooperative (i.e. a recipient that bent over in a copulation posture/followed the signaller within 30 s of receiving the signal) or uncooperative (i.e. a recipient that did not bend over for copulation/ did not follow the signaller within 30 s/stopped following it after few metres)
(3) did the signaller continue or stop signalling following the recipient's response
(4) did the signaller elaborate its signal during the communicative interaction (yes/no)
co-orientation of attention to the signaller's activity
monitoring the recipient's behaviour [26] the signaller turns its head from its direction of movement—at least 45°—towards the recipient (i.e. gaze alternation, see electronic supplementary material, videos S5 and S6) each time the signaller turned its head towards the recipient at least 45° or when it turned its entire body back towards the recipient
co-orientation of the recipient's attention [26] following its gaze alternation, the signaller continues to move forward if it was followed (electronic supplementary material, video S6) or returns to re-engage the recipient if not (see electronic supplementary material, video S7) (1) the recipient's behaviour when the signaller gazed towards it (bent over in a copulation posture/followed the signaller/did not follow the signaller)
(2) The signaller's behaviour following gaze alternation (continued to move forward/re-signalled from the same location/returned towards the recipient and re-signalled)
common ground
coordinated joint engagement [4,27] following signalling, the recipient stops its current behaviour and travels with the signaller for at least 3 s (electronic supplementary material, video S6) joint engagement was recoded from the moment the recipient started approaching the signaller until one of the participants stopped following the other or when body contact was established
understanding the signaller's goal [26] the recipient follows and moves ahead of the signaller in the same direction (electronic supplementary material, video S6)
for object presentation:
(1) if a signal is produced at an already hidden location, the recipient bends over for copulation
(2) upon arrival at a hidden location, the recipient bends over without further signal (electronic supplementary material, video S1)
changes in the recipient's behaviour throughout the communicative interaction
re-communication [23] following an object presentation, the recipient presents an object to the signaller as well (see electronic supplementary material, video S8)

(a). First-order intentional communication

To investigate the presence of first-order intentionality (in which the signaller intends that the signal produces a specific response in the recipient without necessarily requiring the recipient to recognize this intention [22]), we tested the following key markers [6,22,28]: (i) voluntary signalling, (ii) recipient-directed signalling, (iii) whether signalling elicits responses that are conducive to realizing the signaller's goal and (iv) goal-directedness of the behaviour. Since we have shown elsewhere [18,25] that object presentations qualify as intentionally produced signals, we examined these criteria here with regards to the babbler walk only.

(b). Co-orientation of attention

To examine whether signallers co-orient recipients’ attention to their own activity [4,15], we investigated whether signallers used gaze alternations to monitor recipients' behaviour and whether they tried to re-engage them in the case of non-cooperation [27].

(c). Common ground

This hallmark is controversial at theoretical and operational levels. At the theoretical level, some argue that awareness of the communication process is not required to achieve intentional co-orientation of attention [4,21]. By contrast, others see it as signature of truly intersubjective sharing [1]. Concerning the operational level, proponents of the mutual awareness criterion consider the communicative quality of sharing looks as the most convincing evidence for this hallmark in human infants [15] (defined as ‘looks directed to the [human] tester that could be seen to express a participant sharing experience’ [29]). However, such a qualitative evaluation of looks is not testable in animals, thereby resulting in an artificial human–animal divide [20,30]. Hence, we used traditional criteria that can be operationalized across a wide range of species and provide evidence for an establishment of common ground between interlocutors about their joint affair. We tested (i) whether successful communication resulted in coordinated joint engagement between interlocutors [4], (ii) whether recipients' responses indicated an understanding of the signaller's goal [27] and (iii) whether recipients responded communicatively to signallers [20,23].

2. Methods

(a). Study site and population

The study was conducted in the Shezaf Nature Reserve (30.718° N, 35.266° E) in Israel (permit number: 2014/40304, The Israel Nature and Parks Authority). The Arabian babbler population in the reserve has been studied since 1971. Individuals are well habituated to researchers and marked with a unique combination of coloured rings [17]. The majority of individuals' life histories and dominance ranks are known in detail [17,31].

(b). Behavioural observations

We carried out 86 observation sessions during the breeding season (February–June) of 2014, resulting in a total of 144.7 h of observation. Observations were conducted during the 4 h beginning when the group left its roosting tree at dawn and during the last 2 h of the day until the group roosted again. We observed a total of 80 birds (older than three months of age) from 12 different social groups. To document object presentations, we followed the dominant female during the days of egg-laying, which is the time period when most copulations occur [32]. object presentation was defined as an individual picking up an object with its beak and holding it in front of an opposite-sex conspecific [18]. babbler walks were documented by following fledglings during the three weeks after they had left the nest (recipients' age: 17–35 days). It was defined as an individual performing wing waving and/or beak gaping while vocalising and subsequently hopping or flying away (recording role: continuous recording [33]). Behaviours were recorded from a distance of 2–20 m using an HD camera (Canon Legria HFM 41). Visual recording was accompanied by detailed narration of the observed behaviours.

(c). Data coding and inter-observer reliability

The coding of videos was based on established coding schemes of bird [34] and great ape signalling [35] (see table 1 for the parameters coded). A second observer, blind to the hypotheses tested, coded 25% of object presentation and 17% of babbler walk episodes. Inter-observer reliability was tested by using Cohen's kappa for categorical variables [36] and Spearman's rank correlation for duration measurements. Due to tied observations, p-values of Spearman's rank correlations were estimated using 10 000 permutations. All inter-observer reliability tests were significant (p < 0.005). Cohen's kappa ranged from 0.80 to 1.00 and correlation coefficients from 0.86 to 0.93 (see electronic supplementary material, table S1 for results of inter-observer reliability tests).

(d). Statistical analyses

Statistical analyses were conducted using the program R v. 3.2.3 with the packages irr [37], parallel [38] and lme4 [39] (v. 1.1–11). All tests were two-tailed, and the significance level was set to α = 0.05. Due to vegetation cover and the mobile nature of the examined behaviours, data for some criteria were not always available. In these cases, we report a different sample size. Linear mixed models (LMMs) or generalized linear mixed models (GLMMs) [40] were used to account for pseudo-replication [41]. For each criterion tested, we used separate models for object presentation and babbler walk. Models included the signaller, recipient (both nested in a social group) and social group identities as random effects. The significance of each full model as compared to its corresponding null model (containing only the intercept and random effects) was examined using a likelihood ratio test (R function ‘anova’ with argument test ‘χ2’ [42]). We assessed the models' stability by excluding levels of the random effects one at a time from the dataset and comparing model estimates derived from these data with those derived from the full dataset. This did not reveal influential levels of random effects to exist. To keep type I error rate at the nominal level of 5%, we included random slopes which were likely to be identifiable [42,43] (see electronic supplementary material, table S2 for the random slopes included).

(i). Signalling elicits responses that are conducive to realizing the signaller's goal

To examine whether the behaviours of fledglings after the production of babbler walks were responses to these signals (rather than being fixed responses to the sight of a conspecific), they were compared to the behaviours of fledglings when an adult group member passed by without signalling. For this purpose, 10 adults were each observed for 20 min. We recorded the fledgling's response when the focal individual passed within 50 cm without performing a babbler walk (see electronic supplementary material, video S9). For a GLMM analysis, each episode was coded such that it included all six behaviours that were observed in the two conditions, and the response then indicated which was actually performed during an episode (zero for the behaviours not performed and one for the behaviour performed). The model included the fixed effect of condition, a random intercept for the particular episode, a random intercept for the behavioural responses (factor with six levels) and random intercepts for signaller and group identity. The key term in the model was the random slope [42,43] of condition within the response type, which accounts for the possibility that certain response types would be particularly likely in only one of the conditions. To account for the possible choices of the responses within an episode not being independent (i.e. only one could be performed), significance was established by means of a permutation test [44]. For this purpose, the chosen response type was randomized within episodes. A total of 1000 permutations in which we included the original data as one permutation were used. The standard deviation estimated for the random slope of condition-within-response-type was used as a test statistic. The p-value was finally estimated as the proportion of permutations that revealed a test statistic at least as large as that of the original data. Sample size was 1236 possible behaviours (i.e. six behaviours multiplied by 206 babbler walks and control events) involving 42 actors, 11 groups and six behaviours. The recipient's identity was not included as a random effect since it was not always available in the control sessions.

(ii). Persistence of signalling

Persistence of signalling was examined from two perspectives. First, the duration of signalling behaviour towards cooperative versus uncooperative recipients was compared. We set an LMM with the duration of signalling behaviour (seconds) as the response variable and the recipient's behaviour (cooperative/uncooperative) as fixed effect. Since the duration of signalling had a right-skewed distribution, it was log transformed to meet the assumptions of normally distributed and homogeneous residuals and to avoid influential cases. Sample size was a total of 114 babbler walks involving 38 signallers and 26 recipients from 11 groups.

Second, the probability that a signaller would re-signal or quit signalling was compared between episodes with cooperative versus uncooperative recipients. For this purpose, a GLMM with binomial error structure and logit link function [40,45] was used. The model included the signallers' behaviour (continue/quit signalling) as the response variable and the recipients’ behaviour (cooperative/uncooperative) as a fixed effect. babbler walks often involved several fledglings as recipients and in two out of 114 episodes not all recipients responded in the same way. We thus removed these two episodes from the data of this analysis to avoid ambiguity with regard to which behaviour the signaller responded. We then randomly selected one of the recipients from each of the remaining episodes to be included into the data. To obtain results unconditional on any particular random selection, we conducted 1000 random selections and report average model estimates. Sample size was a total of 112 babbler walks involving 28 signallers and 23 recipients from ten groups.

(iii). Elaboration of signalling

A GLMM with a binomial error structure and logit link function was used to compare the probabilities of signal elaboration in episodes with uncooperative versus cooperative recipients. It included the signallers' behaviour (elaborated/did not elaborate the signal) as the response variable and the recipients’ behaviour (cooperative/uncooperative) as a fixed effect. We excluded from the data a single episode in which recipients responded differently from one another. We randomly selected one recipient for each episode and ran 1000 selections as described above. In addition, as the response variable did not include episodes of elaboration towards a cooperative recipient, the model suffered from complete separation [46]. To cope with this, we adopted the approach of Goodale et al. [47]: we modified the data to make it include all possible results (i.e. also cases of elaboration towards cooperative recipients). Thus, for each of the 1000 selections described above, we altered one instance of the response. This was randomly chosen out of those 24 episodes in which the recipient was cooperative and the signaller did not elaborate its signalling behaviour to indicate a signaller that elaborated its signalling. We then fitted a model to this modified dataset and conducted a full-null model comparison. Note that this is a conservative approach since the difference between the two conditions (i.e. the number of cases in which the signaller elaborated its signal when communicating with a cooperative versus uncooperative recipient) is made slightly smaller than it actually was. We report average model estimates. Sample size was 100 babbler walks involving 23 signallers and 20 recipients from eight groups.

(iv). Co-orientation of attention

We examined whether gaze alternations were used to monitor the partner's behaviour and re-engage it when necessary. We used GLMMs that included the signaller's behaviour following its gaze alternation as the response variable (move further/return to signalling in front of the recipient or continue signalling from the same location) and the recipient's behaviour at the time of gaze (approached/did not approach the signaller) as a fixed effect. As communicative episodes usually involved several acts of gaze alternation, episode IDs were included as a random effect. A further random effect of signalling event identity was included in the babbler walk model to account for the fact that episodes often involved several recipients. Sample size for the object presentation model was a total of 157 gaze alternations during 50 episodes involving nine signallers, eight recipients and five groups. Sample size for the babbler walk model was a total of 299 gaze alternations during 116 episodes involving 40 signallers, 454 fledgling responses made by 26 recipients from ten groups.

3. Results

We documented a total of 55 object presentations. These were performed by six alpha males (n = 49 episodes), three subordinate males (n = 4 episodes) and one alpha female (n = 2) from six social groups.

We recorded 138 babbler walks that were performed by 42 caretakers from 11 social groups (21 males, 12 females and nine young birds that could not be sexed [17]). Signallers were older than two months of age and included dominant and subordinate group members.

(a). First-order intentional communication

(i). Voluntary signalling

babbler walks were systematically performed in the presence of dependent fledglings (less than 35 days old; 100%, n = 138). Our anecdotal observations suggest that the signal is occasionally used to solicit an adult conspecific to follow the signaller under ‘urgent’ conditions (for further information, see electronic supplementary material, table S3). Caretakers frequently moved close to fledglings without signalling (see control behaviour in table 2; electronic supplementary material, video S9). We could not identify any distinct behaviours performed by the fledglings that induced the production of this signal.

Table 2.

Responses to object presentation, babbler walk and their corresponding control behaviours. Data for object presentation are amended from [25].

number of responses for
recipient's response object presentation controla babbler walk control
follows the actor 27b 0 87b 2
follows and moves ahead of the actor in the same direction 10b,c 0 30b,c 0
bends over in a copulation posture (if object presentation produced at a hidden location) 3b,c 0
moves away from the actor 3 14 2 0
waits for actor to approach and opens beak 0 84 0 0
waits for actor to approach and refuses to open beak 0 10 0 0
other responses 2 0 6 0
no response (continues previous behaviour) 8 0 13 71
total 53 108 138 73
on arrival at a hidden location the recipient bends over for copulation posture without further signalling 15b,c
total 18

aAllofeeding attempt: individual standing and later moving towards another adult with a food item in its beak.

bResponses that were in line with the signaller's goal.

cResponses that promoted the signaller's goal beyond passive following.

(ii). Recipient-directed signalling

All babbler walks started when the signaller and recipient were already in sight of each other or after the signaller had actively repositioned itself to be within the recipient's sight (100%, n = 116). Moreover, whenever the signaller moved forward during signalling and left the recipients’ visual field due to vegetation, it returned to re-signal in front of the recipient (100%, n = 87; see electronic supplementary material, video S2).

(iii). Response to babbler walk

Ninety-one per cent of babbler walks elicited a voluntary response from the recipient (n = 138). The majority of first responses consisted of travelling with the signaller (94%). These responses were not fixed responses towards conspecifics as babbler walks elicited different responses compared to those elicited during the control condition in which a caretaker walked nearby a fledgling without signalling (permutation test: standard deviation of the random slope of condition within behaviour = 4.062, p = 0.001; see table 2; electronic supplementary material, video S9).

(iv). Inferring the signaller's goal

As object presentations stopped after copulation [18,25], we considered the signaller's goal as ‘to be followed for copulation’. As the majority of babbler walks stopped upon the recipient following the signaller to a new shelter (table 2), the signaller's goal was considered as ‘to be followed’.

(v). Goal-directedness

The duration of signalling was shorter towards a cooperative recipient than an uncooperative recipient (estimate ± s.e. = −0.62 ± 0.17, χ2 = 5.75, d.f. = 1, p < 0.02; figure 3a).

Figure 3.

Figure 3.

(a) Median, quartiles and quantiles (0.025 and 0.975) of the duration of babbler walks towards following and non-following recipients. (b) Number of episodes in which a signaller elaborated its babbler walk depending on whether the recipient followed it or not.

Signallers were less likely to persist with their signalling and also less likely to elaborate it when communicating with a cooperative than with an uncooperative recipient (persistence (means of 1000 GLMMs): estimate ± s.e. = −26.7 ± 27.1, χ2 = 106.17, d.f. = 1 all p-values < 0.001; elaboration (means of 1000 GLMMs): estimate ± s.e. = −6.38 ± 3.9, χ2 = 39.74, d.f. = 1, all p-values < 0.001 (see figure 3b; electronic supplementary material, tables S4 and S5 for the mean estimates of these models). Elaboration of babbler walks included one or more of the following modifications (n = 48): gaping the beak in front of the recipient (37 episodes; electronic supplementary material, video S4), producing long-lasting vocalizations (eight episodes), or adding wing movements to the vocalization (three episodes).

(b). Co-orientation of attention to the signaller's activity

(i). Monitoring the recipient's behaviour

While signalling, the signaller alternated its gaze from its own direction of movement to the recipient and back at least once during a communicative episode (object presentation: 98%, n = 51 episodes; babbler walk: 99%, n = 126 episodes; electronic supplementary material, video S5).

(ii). Co-orientation of the recipient's attention

Whether the recipient approached the signaller or not had a significant effect on the signaller's post-gaze-alternation behaviour (object presentation: estimate ± s.e. = −10.11 ± 4.84, χ2 = 12.82, d.f. = 1, p < 0.001; babbler walk: estimate ± s.e. = −27.43 ± 3.38, χ2 = 148.47, d.f. = 1, p < 0.001; electronic supplementary material, videos S6 and S7): if the signaller was approached, it was more likely to move forward than to re-signal from the same location or return to the recipient. However, if the signaller was not approached, it was more likely to re-signal from the same location or return to re-signal in front of the recipient than to move forward (figure 4).

Figure 4.

Figure 4.

Effects of the recipient's behaviour on the signaller's post-gaze-alternation behaviours. The number of the signaller's post-gaze-alternation responses according to the recipient's behaviour.

(c). Common ground

(i). Coordinated joint engagement

Eighty-two per cent of object presentations (n = 38) and 93% of babbler walks (n = 103) that elicited a voluntary response were followed by coordinated joint travel of interlocutors (see electronic supplementary material, figure S1). Median coordinated travel following object presentations lasted 22 s (14, 34.5; quartiles; n = 31) and 20 s following babbler walks (12, 38; quartiles; n = 96).

(ii). Understanding the signaller's apparently satisfactory outcome

In response to 40% of object presentations (n = 53) and 22% of babbler walks (n = 138), the recipient initiated at least one behaviour that contributed to the achievement of the signaller's goal beyond passive following (table 2).

(iii). Re-communication

In response to 16% of object presentations, the recipient picked up an object as well and moved together with the signaller (n = 55, electronic supplementary material, video S8).

4. Discussion

The present study revisited the claim that joint attentional skills are a uniquely human ability and tested whether these abilities require the unique combination of an ape-like cognitive system and intensive cooperative motives (figure 1) [2]. We investigated whether two distinct signals used by a cooperatively breeding bird species were characterized by key hallmarks traditionally used to infer joint attention skills in pre-linguistic human children: first-order intentional communication [21,22], co-orientation of attention [4,15] and common ground [20,23].

Concerning intentional communication, we found that signallers were able to use their signals voluntarily, took into consideration recipients' attentional states, and showed persistence and elaboration of signalling when appropriate. In response, recipients often followed the signallers, who consequently terminated their signalling. Arabian babblers, therefore, demonstrated convergent evidence for first-order intentional communication by fulfilling a number of key markers that have been widely applied to infer intentionality in human [6,27] and non-human species [22]. These results add to the growing evidence that first-order intentional communication is not restricted to the primate lineage [34]. Future studies should therefore develop testable markers to investigate higher-order intentionality in non-human species [48], a capacity that plays a crucial role in enabling the complexity of human communication [48].

Concerning co-orientation of attention, signallers regularly alternated their gaze towards receivers, adjusted their own behaviour in accordance with those of recipients and actively returned to re-signal in front of uncooperative recipients. Whether other avian species engage in active co-orientation of attention is currently unknown. Related findings stem however from captive great apes that co-orientate the attention of human caretakers to third entities [14,21]. A useful candidate to investigate co-orientation of attention is group hunting, which can frequently be observed in wild chimpanzees, spotted hyenas (Crocuta crocuta) and lions (Panthera leo) [49]. Though it has been argued that individuals may only monitor the behaviour of their group members via ‘checking looks’ [23], rather than actively coordinating each other's attention [3], the existing studies did not (in contrast to the present study) distinguish between checking looks and active co-orientation [50].

Concerning the hallmark of ‘common ground’, we found that following signalling, interlocutors often engaged in coordinated joint travel. Furthermore, several results show the recipients' understanding of the signallers' goal. First, recipients often responded by moving ahead of the signaller while continuing with the signaller's direction of movement (i.e. recipients not only understood the signaller's goal as to be followed but also the desired direction of joint travel). Second, in the majority of observed object presentations, it was the recipient who bent over for copulation upon arrival at a hidden location. This indicates an understanding of the signal as an invitation for a copulation in a hidden location. Third, in a sixth of all observed object presentations, recipients picked up an object and presented it back to the signaller. No further use was documented for these objects and coordinated joint engagements that followed these reciprocal presentations resulted in copulation three times faster than interactions involving presentation by the signaller only. This re-presentation of an object may thus serve as a communicative act used to communicate back [23] the recipient's willingness to engage in mating, thereby fulfilling the third criterion for common ground [23].

Cross-species comparisons of the common ground hallmark have been severely hampered by using markers that are not equally applicable to non-human species [15,20,23], resulting in lively debates [1,14,21]. The parameters applied in the present study may thus convince some scholars [30], while others may question whether interlocutors indeed experience the same thing at the same time and know together that they are doing this [1]. However, the use of criteria that are only testable in humans excludes the study of joint attention in non-human species from scientific inquiry [30,51]. Thus, in addition to the application of criteria that allow valid comparisons across species, we furthermore postulate that joint attention can be viewed as a layered system involving different cognitive skills. The integration of these approaches will not only enable us to gain a more comprehensive and quantitative view into joint attention skills, but may also shed light on the evolutionary trajectories of the different abilities involved.

In sum, our results show that the hallmarks commonly used to infer joint attention skills in humans also characterize communicative interactions of another cooperatively breeding animal species. These findings suggest that the ability to use joint attention—in its most traditional sense (i.e. the ability to intentionally attract one's attention and co-orient it to a common locus)—does not require an ‘ape-like cognitive system’ [2]. At the same time, these results do not imply that the processing abilities of an ape-sized brain do not additionally nurture the development of joint attention skills. Nevertheless, as long as complex mind reading, which may underlie human joint attention, is not testable in non-human species, the importance of an ape-like cognitive system for such a ‘rich’ notion of joint attention [30] will also remain non-testable.

One explanation for joint attention skills in distantly related species that rely on intensive alloparental care such as Arabian babblers and humans is convergent evolution. In both species, the evolution of joint attention may have been driven by selection pressures for systematic attendance to the needs of group members' offspring [2,8]. An alternative explanation may be that joint attention skills represent fundamental communicative capacities that frequently develop in social species. If the latter hypothesis is true, the suggestion that joint attention is uniquely human [1] might be the result of several biases in research: a strong focus on cognitive abilities of captive animals rather than individuals living under active selection pressures [5254], the paucity of studies on non-primate gestural communication [55], the use of different definitions [15,23] and the neglect of interactions that revolve around entities other than objects. Future research should tackle these alternative explanations by investigating closely related species which differ in their degree of cooperation while using a comparable methodology [7].

5. Conclusion

The present study provides the first evidence that crucial hallmarks characterizing joint attention behaviour in pre-linguistic human infants are not uniquely human. These results provide further support for the hypothesis that intensive cooperative motives affect socio-cognitive performance [2,34,56].

To better understand its evolution, we propose to view joint attention as a layered system involving different components rather than as a single capacity mastered only by a single species: humans. Such a view will enable us to tackle differences and similarities in cognitive skills that underlie co-orientation of attention across species [57].

Furthermore, to reconstruct the changes that paved the way for our sophisticated cognition and communication, we have to view the likely adaptations of early hominins in general, rather than with specific reference to other primates only [58]. In particular, examples of convergent evolution in distantly related species [59] may provide crucial clues to the types of problems that particular cognitive mechanisms were ‘designed’ to solve [60].

Supplementary Material

Tables S1-S5 and Figure S1
rspb20190147supp1.pdf (176.2KB, pdf)

Supplementary Material

Video S1
Download video file (87.3MB, mov)

Supplementary Material

Video S2
Download video file (108.4MB, mp4)

Supplementary Material

Video S3
Download video file (115.3MB, mp4)

Supplementary Material

Video S4
Download video file (78.1MB, mp4)

Supplementary Material

Video S5
Download video file (199.1MB, mp4)

Supplementary Material

Video S6
Download video file (109.4MB, mp4)

Supplementary Material

Video S7
Download video file (42.1MB, mp4)

Supplementary Material

Video S8
Download video file (104.6MB, mp4)

Supplementary Material

Video S9
Download video file (108.7MB, mp4)

Acknowledgements

We thank Amotz Zahavi and Avishag Zahavi for introducing the Arabian babblers to us and granting permission to carry out research at their study site. We are grateful to Eva Jablonka and Ehud Lamm for inspiring the early stages of this work. We thank Yael Alon, Arnon Datner, Nora Gottlieb, Dorit Narisna, Yonatan Narisna and Peter Santema for invaluable assistance in the field and for helpful discussion. For help with reliability coding, we thank Manuela Jäger. We are indebted to Christophe Boesch for providing us with stimulating working facilities at the Max Planck Institute for Evolutionary Anthropology. We thank Claudia Herf and her whole team for steady support during the study. For comments on earlier drafts and discussions, we thank Judith Burkart, Malinda Carpenter and Michael Tomasello.

Data accessibility

The data on which this study is based are available from Dryad Digital Repository: https://doi.org/10.5061/dryad.j3s5v66 [61].

Authors' contributions

Y.B.M. and S.P. designed the study. Y.B.M. collected the data. Y.B.M. and R.M. analyzed the data. Y.B.M., R.M. and S.P. wrote the paper.

Competing interests

We declare we have no competing interests.

Funding

This project was supported by grants from Tel-Aviv University, the DAAD and the IMPRS for Organismal Biology to Y.B.M., the Sofja Kovalevskaja- Award of the Alexander von Humboldt Foundation (www.humboldt-foundation.de) to S.P. and the DFG Centre of Excellence 2117 ‘Centre for the Advanced Study of Collective Behaviour’ (ID: 422037984).

References

  • 1.Tomasello M, Carpenter M. 2007. Shared intentionality. Dev. Sci. 10, 121–125. ( 10.1111/j.1467-7687.2007.00573.x) [DOI] [PubMed] [Google Scholar]
  • 2.Burkart JM, Hrdy SB, van Schaik CP. 2009. Cooperative breeding and human cognitive evolution. Evol. Anthropol. 18, 175–186. ( 10.1002/evan.20222) [DOI] [Google Scholar]
  • 3.Tomasello M, Carpenter M, Call J, Behne T, Moll H. 2005. Understanding and sharing intentions: the origins of cultural cognition. Behav. Brain Sci. 28, 675–735. ( 10.1017/S0140525X05000129) [DOI] [PubMed] [Google Scholar]
  • 4.Bakeman R, Adamson LB. 1984. Coordinating attention to people and objects in mother–infant and peer–infant interaction. Child Dev. 55, 1278–1289. ( 10.2307/1129997) [DOI] [PubMed] [Google Scholar]
  • 5.Morales M, et al. 2000. Responding to joint attention across the 6- through 24-month age period and early language acquisition. J. Appl. Dev. Psychol. 21, 283–298. ( 10.1016/S0193-3973(99)00040-4) [DOI] [Google Scholar]
  • 6.Bates E. 1979. The emergence of symbols: cognition and communication in infancy. New York, NY: Academic Press. [Google Scholar]
  • 7.Burkart JM, van Schaik CP. 2010. Cognitive consequences of cooperative breeding in primates? Anim. Cogn. 13, 1–19. ( 10.1007/s10071-009-0263-7) [DOI] [PubMed] [Google Scholar]
  • 8.Zuberbühler K. 2011. Cooperative breeding and the evolution of vocal flexibility. In The Oxford handbook of language evolution (eds Tallerman M, Gibson K), pp. 71–81. Oxford, UK: Oxford University Press. [Google Scholar]
  • 9.Hill K, Hurtado AM. 2009. Cooperative breeding in South American hunter–gatherers. Proc. R. Soc. B 276, 3863–3870. ( 10.1098/rspb.2009.1061) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Hrdy SB. 2007. Evolutionary context of human development: the cooperative breeding model. In Family relationships: an evolutionary perspective (eds Salmon CA, Shackelford TK), pp. 39–68. Oxford, UK: Oxford University Press. [Google Scholar]
  • 11.Pitman CA, Shumaker RW. 2009. Does early care affect joint attention in great apes (Pan troglodytes, Pan paniscus, Pongo abelii, Pongo pygmaeus, Gorilla gorilla)? J. Comp. Psychol. 123, 334–341. ( 10.1037/a0015840) [DOI] [PubMed] [Google Scholar]
  • 12.Pack AA, Herman LM. 2006. Dolphin social cognition and joint attention: our current understanding. Aquat. Mamm. 32, 443–460. ( 10.1578/AM.32.4.2006.443) [DOI] [Google Scholar]
  • 13.Pepperberg IM, McLaughlin MA. 1996. Effect of avian-human joint attention on allospecific vocal learning by grey parrots (Psittacus erithacus). J. Comp. Psychol. 110, 286–297. ( 10.1037/0735-7036.110.3.286) [DOI] [PubMed] [Google Scholar]
  • 14.Gómez J-C. 2005. Joint attention and the notion of subject: insight from apes, normal children, and children with autism. In Joint attention: communication and other minds, vol. 16, (eds Eilan N, Hoerl C, Teresa M, Johannes R), pp. 65–84. Oxford, UK: Oxford University Press. [Google Scholar]
  • 15.Carpenter M. 2012. Joint attention in humans and animals. In Encyclopedia of the sciences of learning (ed. NM Seel), pp. 1663–1664. Boston, MA: Springer; ( 10.1007/978-1-4419-1428-6_802) [DOI] [Google Scholar]
  • 16.Burkart JM, van Schaik CP. 2016. Revisiting the consequences of cooperative breeding. J. Zool. 299, 77–83. ( 10.1111/jzo.12322) [DOI] [Google Scholar]
  • 17.Zahavi A. 1990. Arabian babblers: the quest for social status in a cooperative breeder. In Cooperative breeding in birds long term studies of ecology and behaviour (eds Stacey PB, Koenig W), pp. 103–130. Cambridge, UK: Cambridge University Press. [Google Scholar]
  • 18.Ben Mocha Y, Mundry R, Pika S. 2018. Why hide? Concealed sex in dominant Arabian babblers (Turdoides squamiceps) in the wild. Evol. Hum. Behav. 39, 575–582. ( 10.1016/j.evolhumbehav.2018.05.009) [DOI] [Google Scholar]
  • 19.Milward SJ, Carpenter M. 2018. Joint action and joint attention: drawing parallels between the literatures. Soc. Pers. Psychol. Compass 12, e12377 ( 10.1111/spc3.12377) [DOI] [Google Scholar]
  • 20.Kaller T. 2012. Joint attention in wild chimpanzees and human infants: A comparative study. PhD thesis, University of York, York, UK. [Google Scholar]
  • 21.Leavens DA, Racine TP. 2009. Joint attention in apes and humans: are humans unique? J. Conscious. Stud. 16, 240–267. [Google Scholar]
  • 22.Townsend SW, et al. 2016. Exorcising Grice's ghost: an empirical approach to studying intentional communication in animals. Biol. Rev. 92, 1427–1433. ( 10.1111/brv.12289) [DOI] [PubMed] [Google Scholar]
  • 23.Carpenter M, Liebal K. 2011. Joint attention, communication, and knowing together in infancy. In Joint attention: new developments in psychology, philosophy of mind, and social neuroscience (ed. Seemann A.), pp. 159–181. Cambridge, MA: MIT Press. [Google Scholar]
  • 24.Bard KA, Keller H, Ross K, Butler L, Hewlett B. 2016. Coordinated joint engagement in human 1-year-olds sampled from different ecological-cultural settings. In 23rd Congr. Int. Assoc. Cross-Cultural Psychol. [Google Scholar]
  • 25.Ben Mocha Y, Pika S. In press Intentional presentation of objects in cooperatively breeding Arabian babblers (Turdoides squamiceps). Front. Ecol. Evol. [Google Scholar]
  • 26.Schel AM, Townsend SW, Machanda Z, Zuberbühler K, Slocombe KE. 2013. Chimpanzee alarm call production meets key criteria for intentionality. PLoS ONE 8, e76674 ( 10.1371/journal.pone.0076674) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Carpenter M, Nagell K, Tomasello M. 1998. Social cognition, joint attention and communicative competence from 9 to 15 months of age. Monogr. Soc. Res. Child Dev. 63 ( 10.2307/1166214) [DOI] [PubMed] [Google Scholar]
  • 28.Dennett DC. 1983. Intentional systems in cognitive ethology: the ‘Panglossian paradigm’ defended. Behav. Brain Sci. 6, 343–390. ( 10.1017/s0140525x00016393) [DOI] [Google Scholar]
  • 29.Hobson JA, Hobson RP. 2007. Identification: the missing link between joint attention and imitation? Dev. Psychopathol 19, 411–431. ( 10.1017/S0954579407070204) [DOI] [PubMed] [Google Scholar]
  • 30.Racine TP. 2011. Getting beyond rich and lean views of joint attention. In Joint attention: new developments in psychology, philosophy of mind, and social neuroscience (ed. Seemann A.), pp. 21–42. Cambridge, MA: Massachusetts Institute of Technology. [Google Scholar]
  • 31.Ben Mocha Y. 2014. The social and functional role of sentinel behavior in the Arabian babbler (Turdoides squamiceps). MSc thesis, Tel-Aviv University, Israel. [Google Scholar]
  • 32.Perel J. 1996. Competition for breeding between Arabian babbler males. MSc thesis, Tel-Aviv University, Israel. [Google Scholar]
  • 33.Martin P, Bateson P. 1994. Measuring behaviour: an introductory guide. New York, NY: Cambridge University Press. [Google Scholar]
  • 34.Pika S, Bugnyar T. 2011. The use of referential gestures in ravens (Corvus corax) in the wild. Nat. Commun. 2, 560 ( 10.1038/ncomms1567) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Pika S, Liebal K, Tomasello M. 2003. Gestural communication in young gorillas (Gorilla gorilla): gestural repertoire, learning, and use. Am. J. Primatol. 60, 95–111. ( 10.1002/ajp.10097) [DOI] [PubMed] [Google Scholar]
  • 36.Fleiss J, Levin B, Cho Paik M. 1981. Statistical methods for rates and proportions. New Jersey, NJ: John Wiley & Sons. [Google Scholar]
  • 37.Gamer M, Lemon J, Singh IFP. 2012. irr: Various coefficients of interrater reliability and agreement. See https://rdrr.io/cran/irr.
  • 38.R Core Team. 2017. R: a language and environment for statistical computing. Vienna, Austria: R Foundation for Statistical Computing. [Google Scholar]
  • 39.Bates D, Maechler M, Bolker B, Walker S. 2015. Fitting linear mixed-effects models using lme4. J. Stat. Softw. 67, 1–48. ( 10.18637/jss.v067.i01) [DOI] [Google Scholar]
  • 40.Baayen RH. 2008. Analyzing linguistic data: a practical introduction to statistics using R. Cambridge, UK: Cambridge University Press. [Google Scholar]
  • 41.Waller BM, Warmelink L, Liebal K, Micheletta J, Slocombe KE. 2013. Pseudoreplication: a widespread problem in primate communication. Anim. Behav. 86, 483–488. ( 10.1016/j.anbehav.2013.05.038) [DOI] [Google Scholar]
  • 42.Schielzeth H, Forstmeier W. 2009. Conclusions beyond support: overconfident estimates in mixed models. Behav. Ecol. 20, 416–420. ( 10.1093/beheco/arn145) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Barr DJ, Levy R, Scheepers C, Tily HJ. 2013. Random effects structure for confirmatory hypothesis testing: keep it maximal. J. Mem. Lang. 68, 255–278. ( 10.1016/j.jml.2012.11.001) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Adams DC, Anthony CD. 1996. Using randomization techniques to analyse behavioural data. Anim. Behav. 51, 733–738. ( 10.1006/anbe.1996.0077) [DOI] [Google Scholar]
  • 45.McCullagh P, Nelder JA. 1989. Generalized linear models. London, UK: Chapman & Hall. [Google Scholar]
  • 46.Field A. 2005. Discovering statistics using SPSS. London, UK: Sage Publications; ( 10.1016/j.landurbplan.2008.06.008) [DOI] [Google Scholar]
  • 47.Goodale E, Ratnayake CP, Kotagama SW. 2014. Vocal mimicry of alarm-associated sounds by a drongo elicits flee and mobbing responses from other species that participate in mixed-species bird flocks. Ethology 120, 266–274. ( 10.1111/eth.12202) [DOI] [Google Scholar]
  • 48.Scott-Phillips TC. 2015. Nonhuman primate communication, pragmatics, and the origins of language. Curr. Anthropol. 56, 56–80. ( 10.1086/679674) [DOI] [Google Scholar]
  • 49.Schuster R. 2005. Why not chimpanzees, lions, and hyenas too? Behav. Brain Sci. 28, 716–717. ( 10.1017/S0140525X05490123) [DOI] [Google Scholar]
  • 50.Boesch C, Boesch H. 1989. Hunting behavior of wild chimpanzees in the Tai’ National Park. Am. J. Phys. Anthropol. 78, 547–573. ( 10.1002/ajpa.1330780410) [DOI] [PubMed] [Google Scholar]
  • 51.Popper K. 1957. Philosophy of science: a personal report. London, UK: Allen and Unwin. [Google Scholar]
  • 52.Tomasello M, Call J, Hare B. 1998. Five primate species follow the visual gaze of conspecifics. Anim. Behav. 55, 1063–1069. ( 10.1006/anbe.1997.0636) [DOI] [PubMed] [Google Scholar]
  • 53.Leavens DA, Bard KA, Hopkins WD. 2017. The mismeasure of ape social cognition. Anim. Cogn. ( 10.1007/s10071-017-1119-1) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Boesch C. 2007. What makes us human (Homo sapiens)? The challenge of cognitive cross-species comparison. J. Comp. Psychol. 121, 227–240. ( 10.1037/0735-7036.121.3.227) [DOI] [PubMed] [Google Scholar]
  • 55.Pika S. 2012. The case of referential gestural signaling: where next? Commun. Integr. Biol. 5, 578–582. ( 10.4161/cib.22012) [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Vygotsky LS. 1978. Mind in society. Cambridge, MA: Harvard University Press. [Google Scholar]
  • 57.Pika S. 2017. Unpeeling the layers of communicative complexity. Anim. Behav. 134, 223–227. ( 10.1016/j.anbehav.2017.07.002) [DOI] [Google Scholar]
  • 58.Lovejoy CO. 2009. Reexamining human origins in light of Ardipithecus ramidus. Science 326, 74–74e8. ( 10.1126/science.1175834) [DOI] [PubMed] [Google Scholar]
  • 59.Hauser MD, Chomsky N, Fitch WT. 2002. The faculty of language: what is it, who has it, and how did it evolve? Science 298, 1569–1579. ( 10.1126/science.298.5598.1569) [DOI] [PubMed] [Google Scholar]
  • 60.Gould SJ. 1976. Grades and clades revisited. In evolution, brain, and behavior: persistent problems (eds Masterton RB, Hodos W, Jerison H), pp. 115–122. New York, NY: Wiley. [Google Scholar]
  • 61.Ben Mocha Y, Mundry R, Pika S. 2019. Data from: Joint attention skills in wild Arabian babblers (Turdoides squamiceps): a consequence of cooperative breeding? Dryad Digital Repository. ( 10.5061/dryad.j3s5v66) [DOI] [PMC free article] [PubMed]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Citations

  1. Ben Mocha Y, Mundry R, Pika S. 2019. Data from: Joint attention skills in wild Arabian babblers (Turdoides squamiceps): a consequence of cooperative breeding? Dryad Digital Repository. ( 10.5061/dryad.j3s5v66) [DOI] [PMC free article] [PubMed]

Supplementary Materials

Tables S1-S5 and Figure S1
rspb20190147supp1.pdf (176.2KB, pdf)
Video S1
Download video file (87.3MB, mov)
Video S2
Download video file (108.4MB, mp4)
Video S3
Download video file (115.3MB, mp4)
Video S4
Download video file (78.1MB, mp4)
Video S5
Download video file (199.1MB, mp4)
Video S6
Download video file (109.4MB, mp4)
Video S7
Download video file (42.1MB, mp4)
Video S8
Download video file (104.6MB, mp4)
Video S9
Download video file (108.7MB, mp4)

Data Availability Statement

The data on which this study is based are available from Dryad Digital Repository: https://doi.org/10.5061/dryad.j3s5v66 [61].


Articles from Proceedings of the Royal Society B: Biological Sciences are provided here courtesy of The Royal Society

RESOURCES