Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2022 Sep 1.
Published in final edited form as: Dev Sci. 2021 Feb 24;24(5):e13088. doi: 10.1111/desc.13088

All contexts are not created equal: Social stimuli win the competition for organizing reinforcement learning in 9-month-old infants

Denise Werchan 1, Dima Amso 2
PMCID: PMC8298591  NIHMSID: NIHMS1667225  PMID: 33484594

Abstract

Previous work has shown that infants as young as 8 months of age can use certain features of the environment, such as the shape or color of visual stimuli, as cues to organize simple inputs into hierarchical rule structures, a robust form of reinforcement learning that supports generalization of prior learning to new contexts. However, especially in cluttered naturalistic environments, there are an abundance of potential cues that can be used to structure learning into hierarchical rule structures. It is unclear how infants determine what features constitute a higher-order context to organize inputs into hierarchical rule structures. Here we examine whether 9-month-old infants are biased to use social stimuli, relative to non-social stimuli, as a higher-order context to organize learning of simple visuospatial inputs into hierarchical rule sets. Infants were presented with four face/color-target location pairings, which could be learned most simply as individual associations. Alternatively, infants could use the faces or colorful backgrounds as a higher-order context to organize the inputs into simpler color-location or face-location rules, respectively. Infants were then given a generalization test designed to probe how they learned the initial pairings. Results indicated that infants appeared to use the faces as a higher-order context to organize simpler color-location rules, which then supported generalization of learning to new face-contexts. These findings provide new evidence that infants are biased to organize reinforcement learning around social stimuli.


The ability to structure learning and action in ways that promote the procurement of favorable outcomes is essential for adaptive behavior across the lifespan. To support this ability, a learner can use relevant cues to structure learning into flexible representations that prevent interference and support generalization of prior knowledge. For instance, an infant raised in a bilingual environment may receive conflicting inputs when learning new information (e.g., sometimes a red round object is called “apple” and other times it is called “manzana”). To learn these conflicting inputs, the infant may use cues, such as the speaker’s identity, as a higher-order context to dissociate and organize inputs into flexible rule structures (Figure 1A). Yet, it is unclear how infants determine what stimuli constitute a higher-order context to guide learning. Here we examine whether infants are biased to use social stimuli to organize inputs into hierarchical rule sets.

Figure 1.

Figure 1.

Examples of a hierarchical structure in a real-world environment, where individuals serve as contexts to organize conflicting labels for objects (A) and in an experimental environment, where faces serve as contexts to organize color-location rules (B).

Previous work shows that humans spontaneously organize arbitrary inputs into hierarchical rule sets, even when these inputs have no inherent structure (Collins, Cavanagh, & Frank, 2014; Collins & Frank, 2013). This is accomplished by using certain features as higher-order contexts to organize the inputs into hierarchically-organized rule sets (Figure 1B). For example, when 8- and 9-month-old infants are presented with simple cues varying by shape and color in a visuospatial learning task, they are able to use specific stimulus features, such as the shape of the cues, as a higher-order context to organize inputs into simpler color-location rules (e.g., if the context is “square”, then “red” predicts a cartoon in “location 1” and “blue” predicts a cartoon in “location 2”; Werchan & Amso, 2020b; Werchan, Collins, Frank, & Amso, 2015). Importantly, organizing inputs in this way supported generalization to new contexts (e.g., to new shapes). Related work shows that 8-9-month-old infants can also use social information to organize object-label mappings and event sequences into dissociable rule sets (Werchan & Amso, 2020a; Werchan, Collins, Frank, & Amso, 2016). However, especially in cluttered naturalistic environments, there are an abundance of cues that can be used to structure learning. This raises the question of how an infant learner determines what cues to use to structure inputs into flexible rule structures?

Insight into this question may be gleaned from computational models of structure learning. In the computational literature, higher-order contexts are learned through reinforcement, where a hypothesized context is positively reinforced through dopaminergic reward signals when it correctly cues an appropriate rule set (Collins & Frank, 2013). Over time, agents can use these reward histories to determine which cues signify higher-order contexts that can be used to flexibly update rule sets into working memory. As such, one possibility is that infants may be biased to use stimuli that are intrinsically rewarding as a higher-order context to organize inputs into hierarchical rule structures.

In infancy, social stimuli such as caregivers are typically the main source of biological and emotional rewards, including food, warmth and comfort. Indeed, prior work has found that 7-month-olds have increased pupil dilation and eye blink rate, which are purported physiological indices of dopaminergic reward activity, in response to the primary caregiver, which in turn drives learning of spatiotemporal patterns (Tummeltshammer, Feldman, & Amso, 2019). At the behavioral level, there is ample evidence that infants use social information to guide attention and learning from early in infancy (Csibra & Gergely, 2006; Hood, Willen, & Driver, 1998; Meltzoff, Kuhl, Movellan, & Sejnowski, 2009; Scaife & Bruner, 1975). For instance, from early in postnatal life, infants show a preference for infant-directed over adult-directed speech (Cooper & Aslin, 1990) and to attend to faces over non-face patterns (Johnson, 2005). Infants also follow gaze (Farroni, Massaccesi, Pividori, Simion, & Johnson, 2004) and are motivated to imitate others’ actions (Meltzoff & Moore, 1977). Social cues, such as shared attention, support learning about objects and language (Baldwin, 1995; Scaife & Bruner, 1975; Tomasello & Farrar, 1986; Yu & Smith, 2013). In the second-half year of life, infants begin to use social referencing to regulate their own behavior and affect (Feinman, 1982; Gunnar & Stone, 1984; Sorce, Emde, Campos, & Klinnert, 1985). Infants are also selective when using others to guide learning (Poulin-Dubois & Brosseau-Liard, 2016). For example, infants selectively interact with new toys when they are presented by an expert, relative to a nonexpert informant (Stenberg, 2013). Other work shows 8-month-old infants are able to track the reliability of an adult’s gaze to make predictions about future events (Tummeltshammer, Mareschal, & Kirkham, 2014).

Social information also influences rule learning abilites in infancy and subsequent executive functions abilities in later childhood. When 8-month-olds are presented with auditory or visual stimuli that follow an ABA or AAB pattern, they show better extraction of the abstract pattern when presented with speech sounds relative to musical tones or visual shapes (Ferguson & Lew-Williams, 2016; Marcus, Fernandes, & Johnson, 2007). Similarly, 7-month-olds show superior abstraction of statistical regularities in syllable sequences when presented with infant-directed versus adult-direct speech (Thiessen, Hill, & Saffran, 2005). Other work indicates that 9-10-month-olds use experimenters to guide search behavior in the A-Not-B task (Topal et al., 2008; Werchan & Amso, 2020). More broadly, socioemotional features of infants’ environments, such as parenting practices, are predictive of executive functions ability in early childhood (Blair, Raver, & Berry, 2014; Fay-Stammbach, Hawes, & Meredith, 2014; Hammond, Müller, Carpendale, Bibok, & Liebermann-Finestone, 2012; Rosen, Amso, & Mclaughlin, 2019). Taken together, these findings indicate that social information may have a unique role in supporting early learning and rule-guided behavior.

Given these findings, we hypothesize that infants may be biased to use social stimuli to structure inputs into hierarchical rule sets. In this view, social information may be a foundational cue that infants exploit to structure learning and help make sense of the abundance of new, and often cluttered and ambiguous inputs. In turn, this may scaffold the development of more complex executive functions over ontogenetic development.

To examine whether infants are biased to use social stimuli to structure reinforcement learning, we adapted a task used to examine incidental structure learning in infants (Werchan et al., 2015). We tested 9-month-old infants given that prior work indicates that infants are capable of structure learning (Werchan et al., 2015, 2016) and show reward responses to social information (Tummeltshammer et al., 2019) by this age. Infants were presented with four cue/target-location pairings, in which the cues consisted of static faces presented on colorful circle backgrounds and the target locations consisted of a cartoon animation presented in one of four screen quadrants (Figure 2). Infants could learn these pairings as individual associations, or they could use the faces or colorful backgrounds as a higher-order context to organize simpler color-location or face-location rules, respectively (Figure 3). We then tested how infants learned the pairings in a subsequent generalization task. Given the ostensible value of social stimuli, we predicted that infants would be biased to use the faces as a higher-order context to structure learning. More specific hypotheses and predictions are detailed at the end of the following section.

Figure 2.

Figure 2.

Examples of four cue/target-location pairings during the initial learning task (A). The central cues consisted of female faces on colorful circle backgrounds and the target locations consisted of a cartoon animation played in one of four screen quadrants. Eye movement reaction times from the central cue to the target locations were measured (AOIs are illustrated by the dotted gray lines). Each trial began with a colorful attention getter, after which the central cue appeared followed by the animated toy (B).

Figure 3.

Figure 3.

Infants could learn the cue/target-location pairings as four separate individual associations (A). Alternatively, infants could apply a hierarchical structure to learn the pairings, using either the faces (B) or the colorful backgrounds (C) as a higher-order context to structure simpler color-location or face-location rules, respectively. Note that L1, L2, L3, and L4 refer to the target locations (i.e., screen quadrants in which the cartoon reward appeared).

Method

Participants

Thirty-nine 9-month-old infants (20 females, 19 males; M age = 9.4 months, SD = 0.46 months) were recruited via advertisements and through birth records from the state department of health. Infants were randomly assigned to a Social-Contexts condition (N = 19, 12 females, 7 males; M age = 9.4 months, SD = 0.46 months, 13 non-Hispanic white, 1 black, 2 Hispanic, 3 mixed-race) or a color-contexts condition (N = 20, 8 females, 12 males; M age = 9.4 months, SD = 0.5 months, 13 non-Hispanic white, 2 black, 5 Hispanic). Sample size was determined using an a priori power analysis at 90% power with a predicted medium effect size (f = .25) estimated from prior studies using similar paradigms (Werchan et al., 2015, 2016), which indicated that approximately 40 infants (20 per condition) would provide sufficient power. The final data point was not collected because of the onset of the COVID-19 pandemic. The study was conducted according to guidelines laid down in the Declaration of Helsinki, with written informed consent obtained from a parent or guardian prior to any data collection. All procedures involving human subjects were approved by the Institutional Review Board at Brown University. All infants were born full-term (within 4 weeks of due date) and had no history of serious health problems. One additional infant was tested, but their data were discarded because of fussiness and crying. All families were compensated for time and travel.

Materials

Apparatus.

Infants sat on a parent’s lap approximately 60 cm away from a 24” monitor in a dimly lit room. Stimuli were presented via SMI Experiment Center software, and infants’ eye movements were recorded at a rate of 60 Hz using remote eye tracking software (RED system; SensoMotoric Instruments, or SMI; Teltow, Germany). Prior to the task, each infant’s point of gaze (POG) was calibrated by presenting a looming stimulus in the center and each of the four corners of the screen until it was fixated by the infant. Calibration was validated by presenting the same stimulus in four locations on the screen. Infant’s estimated POG was compared with the stimulus location, and calibration was repeated if deviations were greater than 2°. Areas of interest (AOIs) were defined in the native SMI software-analysis package BeGaze (Figure 2).

Stimuli.

The stimuli consisted of central cues (~6.5° in size) and targets (~3.5°), which were presented on a black screen (Figure 2). The central cues consisted of four discriminable Caucasian female faces (taken from the NimStim Face Stimulus Set, Tottenham et al., 2009) that were overlaid on discriminable colorful circle backgrounds (red, green, blue, pink). The targets consisted of four animated toys (drawn from the Tobii eye tracking calibration stimuli) presented in one of the four screen quadrants, each of which made a unique, discriminable sound (high-pitched dinging, electronic crescendo, low-pitched ringing, horn crescendo), and bounced from side to side. The targets were all equidistant (~7°) from center.

Procedure

Task overview.

Infants participated in a learning task followed by a generalization task. All infants received the same learning task, but infants were randomly assigned to either a face-contexts (n = 19) or a color-contexts (n = 20) condition in the generalization task (Figure 4). During the learning task, infants were presented with cue/target-location pairings. The central cues consisted of static female faces overlaid on colorful circle backgrounds, and the targets consisted of rewarding cartoon animations presented in one of four different screen quadrants. These pairings could be learned simply as individual associations between the central cues and the target locations. Alternatively, infants could apply a hierarchical structure to learn the pairings, using either the faces or the colorful backgrounds as a higher-order context to organize simpler color-location or face-location rules, respectively. The subsequent generalization task was designed to probe how infants structured the inputs during the initial learning task, namely whether infants learned: (1) individual cue/target associations, (2) a hierarchical structure using the faces to organize simpler color/target-locations rules, or (3) a hierarchical structure using the colorful backgrounds to organize simpler face/target-location rules.

Figure 4.

Figure 4.

Schematic of the hierarchical structure that could be applied during the learning and generalization task using either faces (left panel) or the colored backgrounds (right panel) as a higher-order context. Note that L1, L2, L3, and L4 refer to the target locations (i.e., screen quadrants in which the cartoon reward appeared).

The mappings between rule sets, faces, colors, and target locations were counterbalanced across infants. Our measure of interest was infants’ average reaction time (RT) to the location of the target (animated toy), which was defined as the time between target onset and the time infants’ point of gaze (i.e., time to first fixation) arrived at the target location. All statistical analyses were carried out using SPSS Version 22.0.

Learning task.

We presented infants with four cue/target-location pairings, in which the centrally presented cues consisted of one of two different female faces presented on one of two colorful circle backgrounds. The four central cues were paired with an animated toy presented in one of the screen quadrants, each of which made a unique sound and bounced. The learning task was designed such that the cue/target-location pairings could be learned in multiple ways. Most simply, infants could learn the pairings as four separate face/color/target-location associations with no latent structure (Figure 3A). This would result in efficient initial learning of the pairings but would not support transfer of prior learning to new stimuli. Alternatively, infants could apply a latent hierarchical structure to learn the pairings, using one of the two dimensions of the central cues (the faces or background colors) as a higher-order context. For instance, infants could use the faces to organize simpler color/target-location rules (Figure 3B). Alternatively, infants could use the colorful backgrounds to organize simpler face/target-location rules (Figure 3C). Although applying a latent hierarchical structure might result in more effortful initial learning, it affords future generalization opportunities, thus allowing for efficient learning in novel contexts.

Infants received 8 trials of the four different cue/target-location pairings, for a total of 32 trials (16 in rule set 1 and 16 in rule set 2). Prior to each trial, infants’ point of gaze was centered by presenting a colorful central attention-getting stimulus (a looming square fractal making a “boing” sound). The trial was initiated once the experimenter judged that the infant was looking at the central attention-getter. During each trial, the central cue was shown for 1,000 ms, after which the animated toy appeared for 2,000 ms in the target location associated with the cue (Figure 2). The central cue remained on the screen while the animated toy was presented. Trial order was pseudo-randomized with the constraint that there were an equal number of trials in which the face changed from one trial to the next and trials in which the colorful background changed from one trial to the next.

Dependent measures.

Infants’ RTs to the target location from target onset were measured during the learning task. Trials with eye movement reaction times slower or faster than two standard deviations from the mean for each infant were excluded from analysis. Trials in which infants did not fixate on the central cue prior to attending to the target location were also excluded. To smooth over trial-by-trial variability and to account for missing trial data, we binned every two consecutive trials per rule set to create four learning bins for each rule set. Infants contributed an average of 13.56 (SD = 2.44) (of 16 possible) trials in rule set 1 and 13.13 (SD = 2.18) (of 16 possible) trials in rule set 2.

Generalization task.

Immediately after the learning task, infants were randomly assigned to a face-contexts generalization condition (n = 19) or a color-contexts generalization condition (n = 20), during which infants saw four new cue/target-location pairings (Figure 4). Infants again received 8 pseudorandomized trials of each cue/target-location pairing, for a total of 32 trials. The same timing parameters were used in the generalization task as in the learning task.

In the face-contexts condition, the new central cues consisted of two novel faces presented on the same colorful backgrounds as in the initial learning task. Infants could again use the novel faces to organize the inputs into simpler color/target-location rules (Figure 4). Critically, one novel face denoted a rule set (RS1-A) that had the same previously learned color/target-location rules as a rule set from the initial learning task (RS1). In contrast, the other novel face denoted a novel rule set (RS3) that had two color/target-location rules that had both been experienced individually before, but across different rule sets (RS1 and RS2).

The color-contexts condition had an analogous design, but the novel cues instead consisted of two novel colorful backgrounds that were paired with the same faces used in the initial learning task (Figure 4). One of these novel colorful backgrounds denoted a rule set (RS1-A) with the same face/target-location rules as a rule set from the learning task. The other colorful background denoted a novel rule set (RS3) that had two face/target-location rules that had been experienced across different rule sets (RS1 and RS2).

Dependent measures.

Infants RTs to the target locations from target onset were measured. We again binned every two consecutive trials to create four learning bins per rule set. Trials without a fixation to the central cue prior to fixating on the target location or trials with eye movement reaction times slower or faster than two standard deviations from the mean were excluded from analysis. Infants in the face-contexts condition contributed an average of 12.37 (SD = 2.59) (of 16 possible) trials in rule set 1A and 12.21 (SD = 2.64) (of 16 possible) trials in rule set 3, and infants in the color-contexts condition contributed an average of 12.70 (SD = 3.08) trials in rule set 1A and 12.05 (SD = 3.05) trials in rule set 3.

Specific Predictions.

Our predictions were as follows. First, we predicted that infants’ RTs should decrease with trial exposure (Canfield & Haith, 1991). Moreover, if infants simply learned individual associations between the central cues and target-locations during the learning task, then we expected to observe similar learning rates (i.e., reductions in RTs) between the two novel face or color-contexts in the generalization task, and no differences by condition assignment. However, if infants applied a hierarchical structure to learn the pairings using the faces as a higher-order context, then we expected to find transfer of prior learning in the Face contexts condition only (evidenced by faster RTs in RS1A relative to RS3), and no learning or transfer effects in the color-contexts condition. Additionally, we predicted to find a RT switch cost if infants were learning a hierarchical structure. That is, if infants used the faces as a higher-order context, then we expected to find slower average RTs on trials when the higher-order face rule changes from trial-to-trial relative to average RTs on trials when it remained the same. This prediction is based on prior work indicating that adults have slower reaction times when a higher-order rule changes on a trial-by-trial basis and thus has to be updated into working memory (Monsell, 2003; Collins & Frank, 2013) and infant work showing the same pattern in a hierarchical rule-learning tasks structured like the one used here (Werchan et al., 2015). Finally, if infants applied a hierarchical structure using the colors as a higher-order context, then we expected to find transfer of prior learning in the color-contexts condition only, and no learning or transfer effects in the face-contexts condition.

Results

Learning Task

We first conducted a mixed-effects ANOVA on infants’ averaged reaction times using rule set (RS1, RS2) and trial bin (bin 1, bin 2, bin 3, bin 4) as within-subjects factors and condition (face-contexts, color-contexts) as a between-subjects factor. This analysis revealed a significant main effect of trial bin, F(3, 111) = 7.38, p < .001, ηp2 = .17, indicating that infants’ reaction times became faster with trial exposure (Figure 5). There were no other significant effects or interactions, all ps > .13, all Fs < 1.91, all ηp2 < .05. These results indicate that infants learned the rule sets with trial exposure, and that there were no differences by rule set or condition assignment.

Figure 5.

Figure 5.

Infants’ eye movement reaction times over trial bins for RS1 and RS2 during the learning task. Error bars reflect SEM.

We next examined reaction time costs during face and color change trials in the learning task. We calculated two switch-cost values: (1) a face switch cost (average RTs on trials where the face changed from the previous trial – average RTs on trials where the face remained the same as the previous trial), which assumes a higher-order structure using the faces as a context; (2) a color switch cost (average RTs on trials where the color changed from the previous trial – average RTs on trials where the color remained the same as the previous trial), which assumes a higher-order structure using color as a context. We first examined whether there were differences in infants face switch costs relative to color switch costs during learning using a mixed-effects ANOVA with condition (face-contexts, color-contexts) as a between-subjects variable and switch cost type (face switch cost, color switch cost) as a within-subjects variable. This indicated no significant difference between face and color switch costs, F(1, 37) = 0.62, p = .44, ηp2 = .02. There was also no main effect of condition, F(1, 37) = 2.10, p = .16, ηp2 = .05, nor an interaction between switch cost type and condition, F(1, 37) = 0.06, p = .82, ηp2 = .001. However, planned comparisons to zero indicated that infants, as a group, had face switch costs significantly greater than zero, t(38) = 2.55, p = .01, but not for color switch costs, t(38) = 0.92, p = .36 (Figure 6). This indicates that infants had significantly slower reaction times on trials where the higher-order face changed from one trial to the next relative to when it remained the same, but that there were no differences in reaction times for trials where the color changed relative to trials where the color remained the same. Thus, the significant reaction time switch cost observed on trials when the face changes, requiring infants to update a new rule set into working memory, provides further evidence that infants were learning a hierarchical structure using the faces as a higher-order context.

Figure 6.

Figure 6.

Infants’ face and color switch costs (average RTs on trials where the face/color changes from the previous trial – average RTs on trials where it remains the same). Infants’ face switch costs were significantly greater than zero, indicating that their RTs were significantly slower when the face rule changed relative to when it remained the same. Color switch costs did not differ from zero, indicating that there were no differences in infants’ RTs when the color changed from trial to trial relative to when it remained the same. Error bars reflect SEM.

Generalization Task

Finally, we examined infants’ performance during the generalization task using a mixed-effects ANOVA with rule set (RS1A, RS3) and trial bin (bin 1, bin 2, bin 3, bin 4) as within-subjects factors and condition (face-contexts, color-contexts) as a between-subjects factor. This analysis revealed a significant main effect of rule set, F(1, 37) = 8.62, p = .006, ηp2= .20. Importantly, it also revealed a significant interaction between rule set and condition, F(1, 37) = 7.77, p = .008, ηp2= .17, and a trending interaction between rule set, trial bin, and condition, F(3, 111) = 2.46, p = .07, ηp2= .06. We followed up on this analysis by examining each condition separately (Bonferroni corrected alpha = .025). In the face-contexts condition, a repeated-measures ANOVA revealed a significant main effect of rule set, F(1, 18) = 26.67, p < .001, ηp2= .60. There was no significant effect of trial bin, F(3, 54) = 2.59, p = .06, ηp2 = .13, nor was there a significant interaction between rule set and trial bin, F(3, 54) = 1.44, p = .24, ηp2 = .07. In contrast, the color-contexts condition showed no main effect of rule set, F(1, 17) = 0.01, p = .93, ηp2 = 0, or trial bin, F(3, 57) = 0.47, p = .71, ηp2 = .02. There was also no significant rule set by trial bin interaction, F(3, 57) = 2.10, p = .11, ηp2= .10. In sum, these analyses indicated that infants had faster overall reaction times in RS1A relative to RS3 in the face-contexts generalization condition, but that there were no differences between RS1A and RS3 for infants assigned to the color-contexts generalization condition (Figure 7). This indicates transfer of prior learning in the face-contexts generalization condition, but not in the color-contexts generalization condition.

Figure 7.

Figure 7.

Infants’ eye movement reaction times over trial bins for RS1-A and RS3 during the generalization task for the Face contexts condition and the Color contexts condition. Error bars reflect SEM.

Discussion

The ability to structure learning into hierarchically-organized rule structures is essential for adaptive behavior. Yet, it is unclear what cues are most likely to be used to structure learning in infancy. In the current study, we examined whether infants are biased to use social stimuli (in this case faces) to organize inputs into hierarchically-organized structures. We first presented infants with cue/target-location pairings during an initial learning task. We found that all infants showed a reduction in reaction times with trial exposure, suggesting that they learned the pairings. We then examined infants’ performance in the generalization task to explore how infants learned these pairings. Critically, we found evidence that infants structured the inputs into hierarchical rule sets using the faces as a higher-order context. This is shown by the finding that infants in the face-contexts generalization condition showed positive transfer of an analogous rule set (as shown by decreased reaction times to RS1A) and negative transfer of a novel rule set (as shown by increased reaction times to RS3). In contrast, infants in the color-contexts generalization condition showed increased reaction times to both RS1A and RS3 with no differences in performance between rule sets, indicating no transfer of prior learning. Moreover, these results control for the possibility that the decline in reaction times over the learning task was related to a general eye movement speed decline. If this were the case, then we would not expect to observe increased reaction times in the color-contexts generalization condition, as well as in the novel rule set in the face-contexts generalization condition. As such, these findings indicate that, given the option to use either faces or colorful shapes to structure learning, infants structured learning into hierarchical rule sets using the faces as a higher-order context, which then supported generalization to novel face-contexts.

These findings show that infants are biased to use social stimuli to organize reinforcement learning when in competition with color information. Other work has found that when non-social stimuli are used, infants can learn to use color or shape information as a higher-order context to structure inputs (Werchan et al., 2015; Werchan & Amso, 2020). It is likely that infants can learn to use different information to structure learning as a function of changing task demands. For instance, this paradigm could be adapted to test whether infants are biased to use shape as a higher-order context if a label is heard during learning.

Interestingly, while we found that infants in the face-contexts generalization condition showed positive transfer (faster reaction times to RS1A relative to RS3), we also observed that infants’ reaction times did not show a further decrease over trials during the generalization task. The lack of a further increase in reaction times could suggest that this mechanism is not sufficiently mature to support proactive or anticipatory saccades. This explanation is consistent with computational approaches, which suggest that proactive control over action requires more complex hierarchical nesting of frontostriatal circuitry (Collins & Frank, 2013). It is also possible that the 1,000 ms delay between cue and target onset was too short to elicit anticipatory eye movements, or that the cue remaining on screen during the delay decreased the number of anticipatory eye movements due to the added cost of disengaging from a salient cue. Alternatively, the use of faces could have altered infants’ attentional patterns, potentially leading to asymptotic reaction times. For instance, prior work suggests that the use of salient faces in spatial cueing tasks can attenuate typical inhibition-of-return effects in both adult (Pérez-Dueñas, Acosta, & Lupiáñez, 2014) and infant studies (Markant, Oakes, & Amso, 2015). Future work is needed to address these questions.

We also measured infants’ reaction time switch costs during face and color change trials in the learning task, based on prior adult (Monsell, 2003) and infant (Werchan et al., 2015) studies showing that individuals have slower reaction times when updating a new higher-order rule into working memory. We found that infants’ face switch costs were significantly greater than zero. As an analytical control, we also examined color switch costs and saw no differences in reaction times. These data are consistent with infants’ generalization performance, and provide further evidence that infants used the faces to structure inputs into hierarchical rule sets.

Prior research argues that humans are evolutionarily adapted to transmit knowledge through pedagogical interactions with caregivers and other conspecifics (Csibra & Gergely, 2006). This idea is supported by findings showing that human children use social cues to guide attention and learning from early in infancy. For instance, social cues such as joint attention have been shown to guide object processing (Cleveland, Schug, & Striano, 2007; Reid & Striano, 2005; Striano, Chen, Cleveland, & Bradshaw, 2006; Theuring, Gredeba, & Hauf, 2007; Wu, Gopnik, Richardson, & Kirkham, 2011), multimodal learning (Wu & Kirkham, 2010), and event sequence learning (Topal, Gergely, Miklosi, Erdohegyi, & Csibra, 2008; Werchan & Amso, 2020a). We add to these findings by showing that 9-month-old infants can use static faces to organize arbitrary cue/target pairings into hierarchical rule sets.

Establishing the value of social information for structuring reinforcement learning beginning in early postnatal life also has implications for understanding the ontogenetic origins of more complex behavior and executive functions. For example, prior work indicates that young children monitor the reliability of informants to infer whether to trust individuals (Clement, Koenig, & Harris, 2004; Koenig & Harris, 2005; Luchkina, Sobel, & Morgan, 2018; Tummeltshammer, Wu, Sobel, & Kirkham, 2014; Xiao et al., 2018). Our findings suggest that this mechanism could scaffold learning of more abstract rules, such as rules related to the trustworthiness. This adds insight into the impact of early social environments on adaptive functioning more broadly. For example, if the quality of interactions with caregivers is poor or unreliable, this may alter infants’ bias to use social contexts to guide learning as an adaptive response to early caregiving environments, as predicted by an ecological account of executive functions development (Werchan & Amso, 2017). In turn, this may influence children’s capacity to engage with and use other children and adults in the learning process, potentially impacting academic achievement as the child begins formal schooling. This idea is supported by work indicating predictive relationships between the quality of early parent-child interactions on subsequent school readiness, which is mediated by the development of executive functions (Blair & Raver, 2015; Devine, Bignardi, & Hughes, 2016; Fay-Stammbach et al., 2014; Russell, Lee, & Oxford, 2016). Moreover, if infants use social information to structure learning, then it is possible that infants might use caregivers and related social stimuli to learn more complex rule and task sets, such as social interactions, cultural norms, and even parenting practices. This could lead to phenomena such as “ghosts in the nursery”, where individuals model caregiving practices after those they received during early childhood (Fraiberg, Adelson, & Shapiro, 1975). The present work adds new insights into the potential mechanistic origins of these observed relationships.

A limitation of the present study is that we are unable to tease apart the precise mechanism underlying infants’ bias to use social stimuli to structure learning. Social stimuli are both attention-capturing (Amso, Haas, & Markant, 2014; Frank, Vul, & Johnson, 2009; Gluckman & Johnson, 2013; Tummeltshammer, Wu, et al., 2014) and are associated with reward value for learning (Tummeltshammer et al., 2019). Additionally, it is possible that the figure-ground design of the face/color cues could have caused infants to bias attention to the face during learning. More work is needed to discern these possibilities. For instance, future research could directly measure individual differences in infants’ baseline attentional biases and physiological reward responses to both social and non-social stimuli and examine whether these individual differences predict infants’ bias to uses faces to structure learning. Nonetheless, the current findings provide an important advance in understanding how infants use simultaneously presented social and non-social information to structure learning of hierarchical rule structures.

In sum, here we found that infants are biased to use social stimuli to organize inputs into hierarchical rule sets that support generalization to new contexts. These findings provide new evidence that social information helps narrow the space of learning problems by acting as a relevant cue that infants leverage to organize inputs into flexible rule structures. While these findings add important new insights into the role of social information in structuring learning in infancy, the generalizability of our findings is limited by our sample demographic, which was predominately white and middle class. We designed the study to use social stimuli that matched the race and gender of the majority of primary caretakers in our infant sample, which primarily consisted of white female caretakers. We made this design choice based on prior work showing that mothers elicit a reward response (Tummeltshammer et al., 2019). However, given that our infant sample is not representative of the population at large, this limits the generalizability of our findings to non-white middle class infants. Future work is needed with more representative samples to determine the generalizability and relevance of the current findings for the population at large.

Research Highlights.

  • Examined whether infants are biased to use social stimuli, over non-social stimuli, to organize reinforcement learning (RL) using an incidental hierarchical rule learning task

  • Presented infants with visuospatial inputs that could be learned using social or non-social stimuli as a higher-order context to organize inputs into hierarchical rule sets

  • Results indicated that infants used social stimuli to organize inputs into hierarchical rule sets, which supported generalization or prior learning to new contexts

  • This work contributes new evidence that infants are biased to organize RL around social stimuli

Acknowledgements:

This work was supported by the National Institutes of Health (R21 MH113870 to DA) and NSF (2051819 to DA). We thank Jamie Poldracky and Lily Gordon for help with recruitment and data processing, and all of the infants and families who made this research possible.

Footnotes

Conflict of Interest Statement:

The authors report no conflicts of interest.

Data Availability Statement:

The data that support the findings of this study are available from the corresponding author upon reasonable request.

References

  1. Amso D, Haas S, & Markant J (2014). An Eye Tracking Investigation of Developmental Change in Bottom-up Attention Orienting to Faces in Cluttered Natural Scenes. PLoS ONE, 9(1), 1–7. 10.1371/journal.pone.0085701 [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Baldwin DA (1995). Understanding the link between joint attention and language. In Moore C & Dunham PJ (Eds.), Joint attention: Its origins and role in development (pp. 131–158). Hillsdale, NJ: Erlbaum. [Google Scholar]
  3. Blair C, & Raver CC (2015). School Readiness and Self-Regulation: A Psychobiological Approach. Annual Review of Psychology, 66, 711–731. 10.1146/annurev-psych-010814-015221 [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Blair C, Raver CC, & Berry DJ (2014). Two Approaches to Estimating the Effect of Parenting on the Development of Executive Function in Early Childhood. Developmental Psychology, 50(2), 554–565. 10.1037/a0033647.Two [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Clement F, Koenig M, & Harris P (2004). The Ontogenesis of Trust. Mind & Language, 19(4), 360–379. [Google Scholar]
  6. Cleveland A, Schug M, & Striano T (2007). Joint Attention and Object Learning in 5- and 7-Month-Old Infants. Infant and Child Development, 16, 295–306. 10.1002/icd [DOI] [Google Scholar]
  7. Collins AGE, Cavanagh JF, & Frank MJ (2014). Human EEG Uncovers Latent Generalizable Rule Structure during Learning. Journal of Neuroscience, 34(13), 4677–4685. 10.1523/JNEUROSCI.3900-13.2014 [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Collins AGE, & Frank MJ (2013). Cognitive control over learning: Creating, clustering, and generalizing task-set structure. Psychological Review, 120(1), 190–229. 10.1037/a0030852 [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Cooper RP, & Aslin RN (1990). Preference for Infant-Directed Speech in the First Month after Birth Author ( s ): Robin Panneton Cooper and Richard N . Aslin Published by : Wiley on behalf of the Society for Research in Child Development Stable; URL :http://www.jstor.org/stable/1130766. Child Development, 61(5), 1584–1595. [PubMed] [Google Scholar]
  10. Csibra G, & Gergely G (2006). Social learning and social cognition : The case for pedagogy. In Munakata Y & Johnson MH (Eds.), Processes of Change in Brain and Cognitive Development (pp. 249–274). Oxford: Oxford University Press. [Google Scholar]
  11. Devine RT, Bignardi G, & Hughes C (2016). Executive Function Mediates the Relations between Parental Behaviors and Children’s Early Academic Ability. Frontiers in Psychology, 7, 1–15. 10.3389/fpsyg.2016.01902 [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Farroni T, Massaccesi S, Pividori D, Simion F, & Johnson MH (2004). Gaze Following in Newborns. Infancy, 5(1), 39–60. [Google Scholar]
  13. Fay-Stammbach T, Hawes DJ, & Meredith P (2014). Parenting Influences on Executive Function in Early Childhood: A Review. Child Development Perspectives, 8(4), 258–264. 10.1111/cdep.12095 [DOI] [Google Scholar]
  14. Feinman S (1982). Social Referencing in Infancy. Merrill-Palmer Quarterly, 28(4), 445–470. [Google Scholar]
  15. Ferguson B, & Lew-Williams C (2016). Communicative signals support abstract rule learning by 7-month-old infants. Nature Scientific Reports, 6(25434), 1–7. 10.1038/srep25434 [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Fraiberg S, Adelson E, & Shapiro V (1975). Ghosts in the nursery: A psychoanalytic approach to the problems of impaired infant-mother relationships. Journal of American Academy of Child Psychiatry, 14(3), 387–421. [DOI] [PubMed] [Google Scholar]
  17. Frank MC, Vul E, & Johnson SP (2009). Development of infants’ attention to faces during the first year. Cognition, 110(2), 160–170. 10.1016/j.cognition.2008.11.010.Development [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Gluckman M, & Johnson SP (2013). Attentional capture by social stimuli in young infants. Frontiers in Psychology, 4, 1–7. 10.3389/fpsyg.2013.00527 [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Gunnar MR, & Stone C (1984). The Effects of Positive Maternal Affect on Infant Responses to Pleasant, Ambiguous, and Fear-Provoking Toys. Child Development, 55(4), 1231–1236. [Google Scholar]
  20. Hammond SI, Müller U, Carpendale JIM, Bibok MB, & Liebermann-Finestone DP (2012). The effects of parental scaffolding on preschoolers’ executive function. Developmental Psychology, 48(1), 271–281. [DOI] [PubMed] [Google Scholar]
  21. Hood BM, Willen JD, & Driver J (1998). Adult ’ s Eyes Trigger Shifts of Visual Attention in Human Infants. Psychological Science, 9(2), 131–134. [Google Scholar]
  22. Johnson MH (2005). Subcortical Face Processing. Nature Reviews Neuroscience, 6, 766–774. 10.1038/nrn1766 [DOI] [PubMed] [Google Scholar]
  23. Koenig MA, & Harris PL (2005). Preschoolers Mistrust Ignorant and Inaccurate Speakers. Child Development, 76(6), 1261–1277. [DOI] [PubMed] [Google Scholar]
  24. Luchkina E, Sobel DM, & Morgan JL (2018). Eighteen-month-olds selectively generalize words from accurate speakers to novel contexts. Developmental Science, 21(e12663), 1–11. 10.1111/desc.12663 [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Marcus GF, Fernandes KJ, & Johnson SP (2007). Infant Rule Learning Facilitated by Speech. Psychological Science, 18(5), 387–391. [DOI] [PubMed] [Google Scholar]
  26. Markant J, Oakes LM, & Amso D (2015). Visual Selective Attention Biases Contribute to the Other-Race Effect Among 9-Month-Old Infants. Developmental Psychobiology, 355–365. 10.1002/dev.21375 [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Meltzoff AN, Kuhl PK, Movellan J, & Sejnowski TJ (2009). Foundations for a New Science of Learning. Science, 325, 284–289. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Meltzoff AN, & Moore K (1977). Imitation of Facial and Manual Gestures by Human Neonates. Science, 198(4312), 75–78. [DOI] [PubMed] [Google Scholar]
  29. Monsell S (2003). Task switching. Trends in Cognitive Sciences, 7(3), 134–140. 10.1016/S1364-6613(03)00028-7 [DOI] [PubMed] [Google Scholar]
  30. Pérez-Dueñas C, Acosta A, & Lupiáñez J (2014). Reduced habituation to angry faces: increased attentional capture as to override inhibition of return. Psychological Research, 78, 196–208. 10.1007/s00426-013-0493-9 [DOI] [PubMed] [Google Scholar]
  31. Poulin-Dubois D, & Brosseau-Liard P (2016). The Developmental Origins of Selective Social Learning. Current Directions in Psychological Science, 25(1), 60–64. 10.1177/0963721415613962 [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Reid VM, & Striano T (2005). Adult gaze influences infant attention and object processing: implications for cognitive neuroscience. European Journal of Neuroscience, 21, 1763–1766. 10.1111/j.1460-9568.2005.03986.x [DOI] [PubMed] [Google Scholar]
  33. Rosen ML, Amso D, & Mclaughlin KA (2019). The role of the visual association cortex in scaffolding prefrontal cortex development: A novel mechanism linking socioeconomic status and executive function. Developmental Cognitive Neuroscience, 39(July), 100699. 10.1016/j.dcn.2019.100699 [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Russell BS, Lee JO, & Oxford ML (2016). Parenting and Preschool Self-Regulation as Predictors of Social Emotional Competence in 1st Grade, 30(2), 153–169. 10.1080/02568543.2016.1143414.Parenting [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Scaife M, & Bruner JS (1975). The capacity for joint visual attention in the infant. Nature, 253(5489), 265–266. [DOI] [PubMed] [Google Scholar]
  36. Sorce JF, Emde RN, Campos J, & Klinnert MD (1985). Maternal Emotional Signaling : Its Effect on the Visual Cliff Behavior of 1-Year-Olds. Developmental Psychology, 21(I), 195–200. [Google Scholar]
  37. Stenberg G (2013). Do 12-Month-Old Infants Trust a Competent Adult? Infancy, 18(5), 873–904. 10.1111/infa.12011 [DOI] [Google Scholar]
  38. Striano T, Chen X, Cleveland A, & Bradshaw S (2006). Joint attention social cues influence infant learning. European Journal of Developmental Psychology, 3(3), 289–299. 10.1080/17405620600879779 [DOI] [Google Scholar]
  39. Theuring C, Gredeba G, & Hauf P (2007). Object processing during a joint gaze following task. European Journal of Developmental Psychology, 4(1), 65–79. 10.1080/17405620601051246 [DOI] [Google Scholar]
  40. Thiessen ED, Hill EA, & Saffran JR (2005). Infant-Directed Speech Facilitates Word Segmentation. Inf, 7(1), 53–71. [DOI] [PubMed] [Google Scholar]
  41. Tomasello M, & Farrar MJ (1986). Joint Attention and Early Language Author. Child Development, 57(6), 1454–1463. [PubMed] [Google Scholar]
  42. Topal J, Gergely G, Miklosi A, Erdohegyi A, & Csibra G (2008). Infants ’ Perseverative Search Errors Are Induced by Pragmatic Misinterpretation. Science, 321(September), 1831–1835. [DOI] [PubMed] [Google Scholar]
  43. Tummeltshammer KS, Feldman ECH, & Amso D (2019). Using pupil dilation , eye-blink rate, and the value of mother to investigate reward learning mechanisms in infancy. Developmental Cognitive Neuroscience, 36. 10.1016/j.dcn.2018.12.006 [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Tummeltshammer KS, Mareschal D, & Kirkham NZ (2014). Infants’ selective attention to reliable visual cues in the presence of salient distractors. Child Development, 85(5), 1981–1994. 10.1111/cdev.12239 [DOI] [PubMed] [Google Scholar]
  45. Tummeltshammer KS, Wu R, Sobel DM, & Kirkham NZ (2014). Infants Track the Reliability of Potential Informants. Psychological Science, 25(9), 1730–1738. 10.1177/0956797614540178 [DOI] [PubMed] [Google Scholar]
  46. Werchan DM, & Amso D (2017). A Novel Ecological Account of Prefrontal Cortex Functional Development PFC : The State of the Art. Psychological Review, 124(6), 720–739. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Werchan DM, & Amso D (2020a). Adaptive rule learning of event sequences during the A-not-B task in 9-month-old infants. Developmental Psychobiology, 1–14. 10.1002/dev.21999 [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Werchan DM, & Amso D (2020b). Top-down knowledge rapidly acquired through abstract rule learning biases subsequent visual attention in 9-month-old infants. Developmental Cognitive Neuroscience, 42, 100761. 10.1016/j.dcn.2020.100761 [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Werchan DM, Collins AGE, Frank MJ, & Amso D (2015). 8-Month-Old Infants Spontaneously Learn and Generalize Hierarchical Rules. Psychological Science, 26(6), 805–815. 10.1177/0956797615571442 [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Werchan DM, Collins AGE, Frank MJ, & Amso D (2016). Role of Prefrontal Cortex in Learning and Generalizing Hierarchical Rules in 8-Month-Old Infants. Journal of Neuroscience, 36(40), 10314–10322. 10.1523/JNEUROSCI.1351-16.2016 [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Wu R, Gopnik A, Richardson DC, & Kirkham NZ (2011). Infants Learn About Objects From Statistics and People Infants Learn About Objects From Statistics and People. Developmental Psychology, 1–10. 10.1037/a0024023 [DOI] [PubMed] [Google Scholar]
  52. Wu R, & Kirkham NZ (2010). No two cues are alike: Depth of learning during infancy is dependent on what orients attention. Journal of Experimental Child Psychology, 107, 118–136. 10.1016/j.jecp.2010.04.014 [DOI] [PubMed] [Google Scholar]
  53. Xiao NG, Wu R, Quinn PC, Liu S, Tummeltshammer KS, Kirkham NZ, … Lee K (2018). Infants rely more on gaze cues from own-race than other-race adults for learning under uncertainty. Child Development, 89(3), 1–24. 10.1111/cdev.12798.Infants [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Yu C, & Smith LB (2013). Joint Attention without Gaze Following : Human Infants and Their Parents Coordinate Visual Attention to Objects through Eye-Hand Coordination. PLoS ONE, 8(11). 10.1371/journal.pone.0079659 [DOI] [PMC free article] [PubMed] [Google Scholar]

RESOURCES