Skip to main content
UKPMC Funders Author Manuscripts logoLink to UKPMC Funders Author Manuscripts
. Author manuscript; available in PMC: 2015 Nov 6.
Published in final edited form as: Dev Psychol. 2011 Sep 26;47(6):1499–1503. doi: 10.1037/a0025659

Automated Gaze-Contingent Objects Elicit Orientation Following in 8-months-old Infants

Fani Deligianni 1,2, Atsushi Senju 1, György Gergely 3, Gergely Csibra 1,3
PMCID: PMC4636044  EMSID: EMS65884  PMID: 21942669

Abstract

The current study tested whether the purely amodal cue of contingency elicit orientation following behaviour in 8-months-old infants. We presented 8-month-old infants with automated objects without human features that did or did not react contingently to the infants' fixations recorded by an eye-tracker. We found that an object's occasional orientation towards peripheral targets was reciprocated by a congruent visual orientation following response by infants only when it had displayed gaze-contingent interactivity. Our finding demonstrates that infants' gaze following behaviour does not depend on the presence of a human being. The results are consistent with the idea that the detection of contingent reactivity, like other communicative signals, can itself elicit the illusion of being addressed in 8-months-old infants.

Keywords: Infants, Eye-tracking, Contingency, Gaze Following, Communicative Intentions


Gaze following is an extensively studied phenomenon of human infant behaviour. In naturalistic observations, infants spontaneously shift their gaze to the same object that their interactive partner is looking at from about 10 months of age (Carpenter, Nagell, & Tomasello, 1998). Such a behaviour has been suggested to indicate that infants (1) identified the other as an intentional agent (e.g., Johnson, 2003), (2) understand visual perception (e.g., Meltzoff & Brooks, 2008), (3) wish to share attention with the adult (e.g., Carpenter, Nagell, & Tomasello, 1998), (4) have learnt that gaze shifts predict interesting events (Moore & Corkum, 1994), or (5) interpret gaze as a communicative-referential signal (e.g., Csibra & Gergely, 2009). These explanations of gaze-following behaviour are not necessarily mutually exclusive. However, their perspective on its functional significance in early social cognitive development differs considerably.

To understand this phenomenon better, one has to study the early development of gaze following and the exact conditions that elicit such behaviour. Gaze following has been shown to occur in 3- to 6-month-old infants in real-life-like situations (D'Entremont, Hains, & Muir, 1997), can be demonstrated in the laboratory at 7 and 8 months of age (Woodward, 2003; Csibra & Volein, 2008), by eye-tracking techniques at 6 months (Gredebäck, Theuring, Hauf, & Kenward, 2008; Senju & Csibra, 2008), and is even detectable in newborns' response tendencies (Farroni, Massaccesi, Pividori, Simion, & Johnson, 2004). It has been shown that gaze following is facilitated by verbal and manual cues that accompany gaze shifts (Flom, Deák, Phill, & Pick, 2004), but there is no evidence that it would be influenced by emotional signals (Flom & Pick, 2005). Furthermore, young infants do not seem to follow gaze in the absence of preceding communicative cues, like eye contact or infant-directed speech (Senju & Csibra, 2008), which are normally employed in infants' interactions with a live partner.

The functional significance of this behaviour may be best exposed by situations in which gaze following occurs in the absence of human “gaze”, i.e., without the presence of a human face or its realistic or schematic representation in dynamic or static form. In particular, temporal contincency, which is not tied to any sensory modality, and hence could be considered as an amodal feature of the stimulus environment, has been shown to facilitate gaze following. Ten- and 12-month-old infants tend to follow the orientation of a robot or a non-human faceless puppet after it had responded contingently to the infants' behaviours (Johnson, Slaughter, & Carey, 1998; Movellan & Watson, 1987). First-person (R-S) contingent reactivity of an object can be conceived either as a cue of perceptual and attentional abilities (Movellan & Watson, 2002; Johnson, 2003) or as a cue of communicative intention (Csibra, 2010), or both. Either way, that such an abstract, amodal signal can act as a triggering condition of orientation following suggests that this response may play a functional role in the development of skills for social interactions that characteristically exhibit such temporal adjustments.

The present study aimed to establish whether infants even as young as 8 months of age are sensitive to the purely amodal cue of contingent reactivity that their responses induce in a stimulus object with no human features. Our study differs from earlier studies in a number of significant respects. We employed non-human objects (animated on a computer screen) with distinct motion and sound (Figure 1a). These objects acted as either contingent or non-contingent agents, but they did not possess any human-like features, and their behaviour was fully automated and could not be contaminated by experimenter-induced human biases. The automated eye-tracker methodology we employed allowed us to exert better control over the pattern and degree of contingent reactivity than what was possible in earlier studies with 10- and 12-month-olds (Movellan & Watson, 1987; Johnson et al., 1998). In these previous studies the pattern and degree of response-stimulus contingencies induced, while clearly high-but-imperfect overall, were nevertheless variable in degree over time, involved variable combinations of different responses by the infant (such as vocalizations and sudden movements detected by the experimenter) and variations in the types and combinations of stimulus effects that these responses induced in the stimulus object (beeping, flashing lights, or turning away). In contrast, by employing a high-precision interactive eye-tracking technique with a gaze-contingent display we could use young infants’ spontaneous gaze-fixations of the stimulus object as the single type of response that induced the pattern of contingent reactivity in the object. The induced stimulus effect in the target object on the computer screen always involved a distinctive animated motion pattern and a sound response, both with fixed durations. In the contingent condition, each time the infant fixated the agent in the middle of the screen the latter produced a specific contingent response. Following a number of such contingent events the agent produced a novel response: it changed its orientation to simulate turning towards another non-contingent target object positioned either at the lower left or right side of the screen (Figure 1b). For the control condition, the playback of this animation was shown to the next infant in a yoked design.

Figure 1.

Figure 1

Stimulus displays in the experiment. (a) During the familiarization phase all five objects were present. The objects in the four corners were animated unpredictably. The central object responded to the infant's gaze contingently in the Interactive/Contingent (IC) condition but was independent of the infant's gazing behavior in the Non-interactive/Playback (NP) condition. (b) Stimulus configuration during the last five seconds of a trial during the test phase. The region of interest (ROI) of the central object is shown as an ellipse of empty circles. In the IC condition, each time the infant fixated inside this ROI the central object animated. The additional ellipses made of asterisks and crosses cover the extra-space occupied by the central object when it turned to one side. The solid line around the corner objects defines their ROIs.

Methods

Participants

Thirty-six 8-month-old infants (16 female, 20 male) completed the study. Their mean age was 244.2 days (range: 228–261 days). Eighteen infants (7 female, 11 male, mean age: 242.1 days) were assigned for the interactive/contingent (IC) condition, and another eighteen infants (9 female, 9 male, mean age: 246.3 days) were assigned for the non-interactive/playback (NP) condition. A further 8 infants were excluded from the analyses because of inattentiveness and technical problems. All infants were born full term and were recruited by advertisements in local magazines for parents.

Apparatus

A Tobii 1750 Eye Tracker (Stockholm, Sweden) was used to collect gaze data. Stimulus was displayed by MATLAB, using the Psychophysics Toolbox extensions (Brainard, 1997; Pelli, 1997). A specially developed interface between MATLAB and the Tobii 1750's TET server (Talk2Tobii) allowed online collection and recording of the gaze direction during stimulus presentation.

Stimuli and Procedure

The stimulus-presentation started with five 3D-objects, one at the centre and four at each corner of a screen of black background (Figure 1a). The 3D objects were adapted from the Fribble stimulus set (Tarr, 2003). Each of these objects was associated with a distinctive animation scheme and a sound, with duration of 500 ms and 400 to 500 ms, respectively. The animation of each object was implemented based on linear morphing between two predefined shape instances. Each of the corner objects was animated independently from each other at a rate of once in every 10 s with uniform probability distribution. A distinctive colour was also associated with each of the four corner objects. In order to get more attention to the central object, its colour changed continuously, smoothly and cyclically from green to red to blue throughout the experiment.

The experiment consisted of a familiarization and a test phase. In the IC condition, the central object behaved contingently on the infant's looking pattern. When the infant’s gaze landed on the central ROI for at least 150 ms (an ellipse around the central object, see Figure 1b), it waited for another interval randomly chosen between 0 and 150 ms, and became animated if the infant was still looking at it at the end of this interval. If the infant looked at the central object without interruption for 2 s, it generated an additional response. After four responses, the two corner objects at the upper part of the screen disappeared, the familiarization phase ended and the test phase started immediately.

The test phase was divided into trials. Each trial included a training phase and a response phase. The training phase was identical to the familiarization phase with the exception that only three objects were present. The training phase ended when the infant's behaviour elicited three contingent animations of the central object. Thus, the actual duration of a trial varied according to infants' looking behaviour. The response phase started 200 ms later, when the central object “turned” either towards the left or to the right bottom object (Figure 1b). The duration of the turning was 1 s, and the object remained in its final position for an additional 5 s. The other two objects remained still during the response phase. In each trial, the shapes of the two corner objects were selected randomly from the four initial object shapes. The direction of the turning of the central object varied quasi-randomly between left and right, ensuring balanced occurrence of both sides. The exact parameters of each animation frame were recorded and saved along with an accurate presentation timestamp. At the end of the trial, all objects disappeared from the screen and colourful swirling geometric shapes were presented to attract infants' attention to the screen. The experimenter manually controlled the start of each trial.

In the IC condition, the experiment lasted as long as the infants were attentive. In the NP condition, each infant was yoked with an infant in the IC condition, and watched the exact same animation sequence that the yoked IC partner produced. In the NP condition, the experiment was terminated either when the playback was over or when the baby became inattentive.

Data Analysis

To analyze the eye-tracking data, saccades and fixations were calculated based on a velocity-based saccade extraction algorithm (Duchowski, 2007). Regions of interest (ROIs) were defined as showing on Figure 1b. Three measures of preferential looking were extracted automatically from the eye-tracking data based on the pre-defined ROIs during the last five seconds of each trial. The first measurement is based on the direction of the first saccade from the central ROI, which included the additional to cover the 'nose' of the turning object (see Figure 1b, to one of the side objects' ROI, showing either congruent or incongruent response with the central object's turning direction. The normalized difference between congruent and incongruent saccades was calculated for each infant by subtracting the number of trials with incongruent saccade from the number of trials with congruent saccades, and divided this value by the total number of trials that generated a central object-to-corner object saccade (Corkum & Moore, 1998; Moore & Corkum, 1998). As a second measure, the frequency of central-to-corner object saccades was estimated based on the difference between the number of saccades towards the congruent object and that towards the incongruent object. Finally, the total duration of the fixation on each corner object was calculated. The frequency and duration measures were normalized the same way as the first measure. All reported statistics are two-tailed with alpha = .05.

Results

In the IC condition, an average of 17.2 trials (min: 12, max: 31) were completed. In the NP condition 15 infants completed all trials, while three infants watched 10 out of 21, 14 out of 22 and 15 out of 31 trials. On average, the number of valid trials was 16.2 (min 9, max 31) in the IC condition, and 14.1 (min 7, max 22) in the NP condition. A trial was defined as valid if the infant looked at the central object at least once during its turning.

We measured the proportion of looking times towards the central object at various phases of the events to assess the amount of attention infants paid to this object. During the whole experiment, infants fixated at the central object for 52.34 % and 50.82 % of the time in the IC and NP condition, respectively. This difference is not statistically significant (t(17) = 1.24, p = .231). (When comparing the two groups, we used paired t-tests, treating the infant in the IC condition and his/her NP yoked partner as a single statistical unit.) During the animation of the central object, infants fixated this object more in the IC (75.57 % of time) than in the NP (58.53 % of time) condition (t(17) = 3.59, p = 002). This is not suprising, as animation started only when infants had already fixated this object in the IC condition, while in the NP condition infants most of the times had to make a saccade to the animated object. However, we found no statistically significant difference (t(17) = 0.17, p = .863) between the two groups in their fixation time at the central object during the turning phase (IC: 54.76 %, NP: 55.49 %). Thus, all infants had equal chance to observe and follow the orientation of the central object.

In the IC condition, preferential looking towards the contingent object was significantly above chance level for all three measures. Infants were significantly more likely to direct their first saccades towards the target object (t(17) = 4.07, p < .001, d = 1.11), look more often toward (t(17) = 3.86, p = .001, d = 0.91), and longer (t(17) = 4.06, p < .001, d = 0.96) at it (Figure 2). However, none of these measures differed from baseline in the NP condition (direction of first saccade: t(17) = 0.68, p = .506, d = 0.16; frequency of fixations: t(17) = 0.24, p = .813, d = 0.06; duration of fixations: t(17) = 0.41, p = .687, d = 0.10).

Figure 2.

Figure 2

Measures of preferential looking. Average normalized difference scores are depicted for direction of the first central-to-corner-object saccade (First), duration of object fixation (Duration), and frequency of object directed saccades (Frequency) in each condition. IC and NP stand for Interactive/Contingent and Non-interactive/Playback condition, respectively. **p < .01, *p < .05, p < .075. Error bars represent standard error.

We also compared the measures of orientation following between the conditions. These tests demonstrated a significant difference in the frequency of saccades towards (t(17) = 2.69, p = .016, d = 0.63), and in the duration of fixations on (t(17) = 2.51, p = .023, d = 0.59) the target object (Figure 2). For the direction of the first saccade, the test indicated a marginally significant difference (t(17) = 1.91, p = .073, d = 0.45). We also calculated non-parametric Fischer's exact tests to compare the number of infants who produced positive vs. negative difference scores on the three measures in the two conditions (here we ignored infants whose difference score was 0). In the IC condition, 15 infants produced more first saccades towards the target than to the other object, while only 1 infant showed the opposite pattern. In the NP condition the corresponding numbers are 10 and 7, respectively, and the difference between groups was statistically significant (p = 0.039). Similar calculations resulted in significant group difference for the duration of fixations (IC: 15 vs 3, NP: 8 vs 10; p = 0.035), but not for saccade frequencies (IC: 13 vs. 4, NP: 7 vs. 8; p = 0.127). These results generally confirm those of the parametric tests while suggesting substantial individual variability in various parameters of eye-movement patterns.

Discussion

We found that 8-months-old human infants followed the turning of an object towards a target stimulus, but they did so only if the object had repeatedly responded to them in a contingent fashion. Furthermore, the response-induced contingent reactivity of the object was itself a sufficient cue to induce the infants’ orientation-following response as the mechanically animated agent on the screen had no human features and was operated automatically without human intervention. In contrast, observing an accurate playback of what another infant had seen earlier in the contingent condition, did not elicit such orientation-following reactions in the non-contingent condition. Thus, the current results extend previous findings (Johnson et al., 1998; Movellan & Watson, 1987) by establishing that infants as young as 8 months of age are sensitive to the purely amodal cue of contingent reactivity that is sufficient in and of itself to elicit the orientation-following gaze response from infants.

The current results suggest that infants’ ability to detect contingency and consequent tendency to follow the direction of the responsive agent’s subsequent target-oriented behaviour are not dependent on the presence of any other social cues, such as faces or human voice, or on the communicative nature of the behaviour (such as vocalization) that themselves could trigger the infant’s orientation following response (see Senju & Csibra, 2008, for a similar orientation-following effect as a function of preceding direct eye-contact or infant-directed speech in 6-month-olds). In the present study the contingent reactivity of a 2D object without human features on a computer screen programmed to repeatedly react to being visually fixated with the production of an identical motion and acoustic response, proved in itself sufficient to elicit subsequent ‘gaze-following’ behaviour in the infants. This finding suggests that the 8-month-olds may have interpreted such an object as an agent with communicative and referential intention (Csibra & Gergely, 2006, 2009) towards them and/or attributed attentional and perceptual abilities (Johnson, 2003) and a ‘line of regard’ (Movellan and Watson, 2002) to it. In order to succeed, infants did not only have to pay attention to what the central object did but also had to be able to detect its direction of turning, and had to interpret the protubarence on the object as a potential device of perception (like an eye) or reference (like a pointing hand). For this purpose, 'Fribbles' offered an optimal stimulus class, as they have an unambiguously marked direction without exhibiting featural similarity to humans.

These results are consistent with the idea that the abstract temporal structure of turn-taking contingency (i.e., alternating and temporally self-adjusted pattern of activity of a stimulus source) functions as an amodal cue of ostensive communication that is not tied to any particular modality-specific instantiation, and could generate referential expectation in infants in the same way as do other ostensive signals (such as eye-contact or infant-directed speech) that function to communicatively address the infant (e. g., Csibra, 2010). Because young infants follow human gaze shifts only when they are preceded by an ostensive signal (Senju & Csibra, 2008), the current results support the idea that turn-taking contingent reactivity is perceived as functionally equivalent with other infant-directed communicative signals, and may generate an illusion of perceived communicative intention attributed to the non-human gaze-contingent object.

What is the benefit of relying on such an abstract and amodal structural cue as an ostensive signal? Different communicative signals usually occur together, especially when they are directed to infants. Such multi-modal stimuli support each other, and turn-taking contingency can disambiguate whether a signal in a sensory modality (e.g., speech) is directed to the infant. Indeed, contingency can serve as a training signal that helps the infant figure out, and eventually learn, whether an unusual and unfamiliar stimulus configuration is in fact an ostensive signal. For example, while newborns can detect direct gaze in a full frontal face (Farroni, et al., 2002), they are unable to do so when a face is turned 3/4 to the side (Farroni, Menon, & Johnson, 2006). Evidence shows that by 4 months of age, they can detect whether the eyes in a face that is oriented away look at them (Farroni, Johnson, & Csibra, 2004), and the neural responses to this configuration suggest that they treat it the same way as other ostensive stimuli, like smiles (Grossmann, et al., 2008). Turn-taking contingency, which normally accompany eye contact and infant-directed communication, may contribute to such fine-tuning of the recognition of modality-specific ostensive signals.

Acknowledgements

We thank John S. Watson for helpful comments. This work is supported from James S. McDonnell Foundation, an EU6 Framework Programme (NEUROCOM grant 12738) and an ERC Advanced Investigator Grant (OSTREFCOM).

References

  1. Brainard DH. The psychophysics toolbox. Spatial Vision. 1997;10:433–436. [PubMed] [Google Scholar]
  2. Carpenter M, Nagell K, Tomasello M. Social cognition, joint attention, and communicative competence from 9 to 15 months of age. Monographs of the Society for Research in Child Development. 1998;63(4) [PubMed] [Google Scholar]
  3. Cooper RP, Aslin R. Preference for infant-directed speech in the first month of birth. Child Development. 1990;61(5):1584–1595. [PubMed] [Google Scholar]
  4. Corkum V, Moore C. The origins of joint visual attention in infants. Developmental Psychology. 1998;34:23–38. doi: 10.1037/0012-1649.34.1.28. [DOI] [PubMed] [Google Scholar]
  5. Csibra G. Recognizing communicative intentions in infancy. Mind & Language. 2010;25:141–168. [Google Scholar]
  6. Csibra G, Gergely G. Social learning and social cognition: The case for pedagogy. In: Munakata Y, Johnson MH, editors. Processes of Change in Brain and Cognitive Development. Attention and Performance XXI. Oxford University Press; Oxford: 2006. pp. 249–274. [Google Scholar]
  7. Csibra G, Gergely G. Natural Pedagogy. Trends in Cognitive Sciences. 2009;13:148–153. doi: 10.1016/j.tics.2009.01.005. [DOI] [PubMed] [Google Scholar]
  8. Csibra G, Volein A. Infants can infer the presence of hidden objects from referential gaze information. British Journal of Developmental Psychology. 2008;26:1–11. [Google Scholar]
  9. D'Entermont B, Hains SMJ, Muir DW. A demonstration of gaze following in 3-to 6- months-olds. Infant Behavior and Development. 1997;20:569–572. [Google Scholar]
  10. Duchowski AT. Eye Movement Analysis. Eye Tracking Methodology: Theory & Practice. 2007.
  11. Farroni T, Csibra G, Simion F, Johnson MH. Eye contact detection in humans from birth. PNAS. 2002;99(14):9602–9605. doi: 10.1073/pnas.152159999. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Farroni T, Johnson MH, Csibra G. Mechanisms of Eye Gaze Perception during Infancy. Journal of Cognitive Neuroscienc. 2004;16(8):1320–1326. doi: 10.1162/0898929042304787. [DOI] [PubMed] [Google Scholar]
  13. Farroni T, Massaccesi S, Pividori D, Simion F, Johnson MH. Gaze following in newborns. Infancy. 2004;5:39–60. [Google Scholar]
  14. Farroni T, Menon E, Johnson MH. Factors influencing newborns' preference for faces with eye contact. Journal of Experimental Child Psychology. 2006;95(4):298–308. doi: 10.1016/j.jecp.2006.08.001. [DOI] [PubMed] [Google Scholar]
  15. Flom R, Deák GO, Phill CG, Pick AD. Nine-month-olds’ shared visual attention as a function of gesture and object location. Infant Behavior and Development. 2004;27:181–194. [Google Scholar]
  16. Flom R, Pick AD. Experimenter affective expression and gaze following in 7-month-olds. Infancy. 2005;7:207–218. doi: 10.1207/s15327078in0702_5. [DOI] [PubMed] [Google Scholar]
  17. Grossmann T, Johnson MH, Lloyd-Fox S, Blasi A, Deligianni F, Elwell C. Early cortical specialization for face-to-face communication in human infants. Proceedings of the Royal Society B: Biological sciences. 2008;275(1653):2803–2811. doi: 10.1098/rspb.2008.0986. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Gredebäck G, Theuring C, Hauf P, Kenward B. The microstructure of infants' gaze as they view adult shifts in overt attention. Infancy. 2008;13:533–543. [Google Scholar]
  19. Johnson S. Detecting agents. Philosophical Transactions of the Royal Society B. 2003;358:549–559. doi: 10.1098/rstb.2002.1237. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Johnson S, Slaughter V, Carey S. Whose gaze will infants follow? The elicitation of gaze-following in 12-month-olds. Developmental Science. 1998;1(2):233–238. [Google Scholar]
  21. Meltzoff AN, Brooks R. Self-experience as a mechanism for learning about others: A training study in social cognition. Developmental Psychology. 2008;44:1257–1265. doi: 10.1037/a0012888. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Moore C, Corkum V. Infant gaze following based on eye direction. British Journal of Developmental Psychology. 1998;16:495–503. [Google Scholar]
  23. Movellan JR, Watson JS. Perception of directional attention. Paper presented at the Infant Behavior and Development: Abstracts of the Sixth International Conference of Infant Studies.1987. [Google Scholar]
  24. Movellan JR, Watson JS. The development of gaze following as a Bayesian systems identification problem. 2002. UCSD Machine Perception Laboratory Technical Reports 2002.01.
  25. Pelli DG. The VideoToolbox software for visual psychophysics: Transforming numbers into movies. Spatial Vision. 1997;10:437–442. [PubMed] [Google Scholar]
  26. Senju A, Csibra G. Gaze following in human infants depends on communicative signals. Current Biology. 2008;18(9):668–671. doi: 10.1016/j.cub.2008.03.059. [DOI] [PubMed] [Google Scholar]
  27. Tarr MJ. Perception of faces, objects, and scenes: Analytic and holistic processes. In: Peterson MA, Rhodes G, editors. Visual Object Recognition: Can a single mechanism suffice? Oxford University Press; Oxford, UK: 2003. pp. 177–211. [Google Scholar]
  28. Woodward AL. Infants’ developing understanding of the link between looker and object. Developmental Science. 2003;6:297–311. [Google Scholar]

RESOURCES