Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2015 Oct 1.
Published in final edited form as: J Exp Child Psychol. 2014 Jul 8;126:395–411. doi: 10.1016/j.jecp.2014.06.003

Cross-Situational Statistical Word Learning in Young Children

Sumarga H Suanda 1,2, Nassali Mugwanya 2, Laura L Namy 2
PMCID: PMC4116143  NIHMSID: NIHMS612234  PMID: 25015421

Abstract

Recent empirical work has highlighted the potential role of cross-situational statistical word learning in children's early vocabulary development. In the current study, we tested five-to seven-year-old children's cross-situational learning by presenting children with a series of ambiguous naming events containing multiple words and multiple referents. Children rapidly learned word-to-object mappings by attending to the co-occurrence regularities across these ambiguous naming events. The current study begins to address the mechanisms underlying children's learning by demonstrating that the diversity of learning contexts impacts performance. The implications of the current findings for the role of cross-situational word learning at different points in development are discussed along with the methodological implications of employing school-aged children to test hypotheses regarding the mechanisms supporting early word learning.


According to one estimate, by six years of age, children have amassed a vocabulary of 14,000 words (Carey, 1978). One central goal in the study of children's language acquisition is to better understand the processes that underlie such impressive word learning. Based on their influential investigation into these processes, Carey and Bartlett (1978) demonstrated one learning mechanism, fast mapping, a learning process that occurs after a single, or a few exposures to a novel word and involves the acquisition of an initial link between a word and its referent. Much of the ensuing research on children's word learning over the past three and a half decades has been devoted to understanding the basis for fast mapping (for discussion, see Swingley, 2010), including the nature of the constraints on the mapping process (see Woodward & Markman, 1998), whether fast mapping is specific to the domain of word learning (Markson & Bloom, 1997), and how the mapping processes change with development (Hollich et al., 2000).

Recently, a growing body of research has aimed to understand a different word learning process, one that extends over multiple encounters. In one such study, employing adults as model word learners, Yu and Smith (2007) presented subjects with a series of learning trials, each involving ambiguous reference. In each trial, subjects viewed multiple pictures of objects simultaneously on a computer screen and heard multiple spoken words played sequentially in a random order. In each trial, it was unclear which words referred to which objects. However, over trials, every time subjects heard a particular word, its corresponding referent object was present. Importantly, word-object pairs did not always appear with the same set of accompanying words and objects. Thus, subjects could learn the words if they attended to the cross-situational regularities with which particular words and objects co-occurred. Yu and Smith found that adult learners were remarkably sensitive to these cross-situational co-occurrence patterns between words and objects (i.e., their co-occurrence statistics) and could use this knowledge to acquire the word-to-object mappings. Yu and Smith's data, as well as other results (e.g., Akhtar & Montague, 1999; Gillette, Gleitman, Gleitman, & Lederer, 1999; Vouloumanos, 2008) have highlighted the likely role of processes other than fast mapping in children's lexical acquisition.

Two lines of research have been inspired by Yu and Smith's initial work on cross-situational learning. First, a number of investigations have been devoted to extending the empirical phenomenon of cross-situational word learning to developmental populations. Towards this end, Smith and Yu (2008) designed a version of their adult cross-situational learning paradigm suitable for testing infant learners. Employing a simplified looking-based version of the task, Smith and Yu found that 12- to 14-month-old infants successfully associated words and their corresponding objects in a task that, like its adult precursor, required them to attend to the co-occurrence statistics across situations (see also Scott & Fisher, 2012; Smith & Yu, 2013; Yu & Smith, 2011; Vlach & Johnson, 2013; Vouloumanos & Werker, 2009). This finding, that even young word learners possess the capacity for cross-situational word learning, is important because it is an existence proof for the claim that a process such as cross-situational word learning can facilitate early lexical acquisition.

A second line of research has been devoted to understanding the underlying mechanisms that make cross-situational word learning possible. Towards this broad goal, a number of researchers have employed adult learners, as Yu and Smith originally did, to test the nature of the underlying learning algorithms that could explain successful cross-situational word learning (e.g., Kachergis, Yu, & Shiffrin, 2012; K. Smith, Smith, & Blythe, 2011; Trueswell, Medina, Hafri, & Gleitman, 2013; Yu, Zong, & Fricker, 2012; Yurovsky, Yu, & Smith, 2013). Extant computational and experimental work suggests that multiple mechanisms could explain cross-situational word learning findings, including hypothesis testing (Medina, Snedeker, Trueswell, & Gleitman, 2011; Trueswell et al., 2013), associative learning (Kachergis et al., 2012; Yu, 2008), and single-trial learning (K. Smith, Smith, & Blythe, 2009). Other researchers have examined the constellation of factors that influence adults' cross-situational word learning performance, demonstrating that the diversity of learning environments (Kachergis, Yu, & Shiffrin, 2009; Suanda & Namy, 2012), attention to competitors (Fitneva & Christiansen, 2011), the adoption of exclusion constraints (Yoshida, Rhemtulla, & Vouloumanos, 2012; Yurovsky et al., 2013), and the presence of grammatical cues (Monaghan & Mattock, 2012) all help in cross-situational word learning. Finally, researchers have also begun to examine the extent to which adult learners can acquire word-to-referent mappings through cross-situational learning while also solving other language-related tasks to better mimic the multi-tasking likely involved in young children's early language learning. Experimental research along these lines has revealed that adult learners can successfully learn words cross-situationally even when the words are presented in continuous speech and thus required learners to segment speech into the relevant units (Cunillera, Laine, Camara, & Rodriguez-Fornells, 2010; Yurovsky, Yu, & Smith, 2012).

Together, this recent wealth of studies has contributed to a better understanding of the nature of cross-situational word learning and its potential role in lexical acquisition. The goal of the current experiment is to examine cross-situational word learning in school-aged children, between the ages of 5 and 7. There are two reasons that studying lexical development in this age group is important.

First, research with this age group may bridge the two sets of findings mentioned above. That is, research with adult learners that probe the mechanisms of cross-situational word learning employ research methodologies that would be difficult to implement with infants. These include paradigms that require adult learners to explicitly select mappings on a trial-by-trial basis (often from a large set of candidate objects; see K. Smith et al., 2011; Trueswell et al., 2013), paradigms that require adult learners to report confidence-levels in their word-to-referent mappings (Vlach & Sandhofer, 2014; Yurovsky et al., 2013), and paradigms that require adult learners to actively select the word-object pairings to be seen on each trial (Kachergis, Yu, & Shiffrin, 2013). Although these paradigms would be difficult to implement with young infants, they could be translated into paradigms suitable for school-aged children.

Across various aspects of language acquisition, researchers have adopted the approach of using children rather than adults as models of infant language learning. In some cases, researchers have found very little difference between adults' and children's performance (Saffran, 2001; 2002; Saffran, Newport, Aslin, Tunick, & Barrueco, 1997). Others, however, have identified conditions under which adults and children perform differently (Hudson Kam & Newport, 2005; 2009). Finally, some researchers have proposed that the differences between adults and children are mainly quantitative, rather than qualitative, in nature (e.g., Braine et al., 1990; Ferman & Karni, 2010; Janacsek, Fiser, & Nemeth, 2012; Piccin & Waxman, 2007). Thus, although early school-aged children are far from equivalent to novice language learners, their inherently more limited memory and attentional capacities may nonetheless lead to different patterns of performance compared to adult learners. This approach has the potential to inform whether the cross-situational learning findings gleaned from adult learning tasks are relevant for developmental accounts.

A second reason to study word learning in this population is that this is a period of development in which children are very much in the midst of building their vocabulary. In fact, the rate of vocabulary growth during middle childhood is greater than during late infancy and toddlerhood, the period typically emphasized in word learning research (for discussion, see Anglin, 1993; Bloom, 2000; Snedeker, 2009). Thus an understanding of cross-situational word learning during this period of vocabulary development has the potential to inform not only the role of cross-situational learning in earlier stages of lexical development, but also the constellation of learning processes that support prolific word learning more generally.

In what follows, we present two experiments that investigate cross-situational statistical word learning and its underpinnings in school-aged children. We had two specific goals. First, we investigated whether children's cross-situational word learning is, in fact, cross-situational in nature. K. Smith and colleagues (2009) recently demonstrated that success in a cross-situational learning paradigm does not necessarily entail aggregating information across situations, and that task success could, in some testing conditions, be the product of a single-trial learning strategy. In Experiment 1, we tested whether children's learning actually involved aggregating information across situations by examining the effect of context diversity on children's learning. Contextual diversity is indexed by the number of different sets of stimuli with which each word-object pairing co-occurs across learning trials. Because contextual diversity refers to an aspect of the learning environment that is, by definition, cross-situational in nature, it could only impact learning if children were aggregating information across situations.

The second goal of the current work was to investigate the robustness of the word mappings children form as a function of cross-situational word learning. Towards this end, in Experiment 2, we examined whether children could, following exposure to only a few ambiguous naming events, distinguish a word's referent from other distractor objects that had also co-occurred with the word at varying levels of frequency. The extent to which children can discriminate referents based on fine-grained differences in co-occurrence frequency speaks to the strength of the word-to-referent mappings obtained through cross-situational learning.

Experiment 1

To test cross-situational word learning capacities in school-aged children, we adapted Yu and Smith's adult cross-situational word learning paradigm (Yu & Smith, 2007) to render the task suitable for young children. As in the adult paradigm, children encountered ambiguous naming events in which they saw multiple pictures of objects and heard multiple words with no disambiguating information regarding which word referred to which picture. Across situations, words and their referents always co-occurred together while their accompanying word-referent pairings varied. Thus the logic behind the paradigm was that children could determine word reference only if they were able to utilize the cross-situational co-occurrence information.

Although success on this task has generally been interpreted as evidence for cross-situational learning, K. Smith and colleagues have recently proposed an alternative explanation for success in this task (K. Smith et al., 2009). They argue that a learner who simply remembers a snapshot of a single trial, keeping track of the set of objects present during one presentation of a given word could still perform above chance on the forced-choice task employed in this paradigm. That is, provided that the set of objects present at test includes only a subset of those present during the sole remembered learning trial, a learner could perform above chance simply by selecting at random from the test pictures that had also been presented during the single encoded learning trial for a particular word.

Given K. Smith and colleagues' critique, coupled with evidence suggesting that relative to adults, children may be less likely to aggregate information across trials (e.g., Piccin & Waxman, 2007), the current paradigm includes a manipulation that tests whether children employ a single-trial learning strategy versus a truly cross-situational one. Specifically, children participated in one of three learning conditions that differed in the contextual diversity of the learning environment. We defined contextual diversity as the degree of variability in the set of word-object pairings with which a particular word-object pair co-occurs across naming events. We argue that if, as others have documented in adult learners (Kachergis et al., 2009; Suanda & Namy, 2012), children are more successful at making word-referent mappings with higher than lower context diversity, then children must have employed a cross-situational learning strategy. This is because contextual diversity arises across multiple situations, and a single-trial learner would not encode information about variability across multiple situations. In contrast, if we do not observe an effect of contextual diversity in children's learning, we cannot rule out the possibility that children employed a single-trial learning strategy.

Participants

Eighty-four 5- to 7-year-olds (Mean age = 73.6 mos, Range = 57.3 – 94.9) participated. Forty-nine children were female. 77% of children were Caucasian, 19% were African-American, 2% were Asian, and 1% were of other racial categories. 9% of families identified as Hispanic or Latino. All children were native speakers of English. An additional 13 participants were excluded from data analysis due to exhibiting a position bias (see Coding section below).

Stimuli

Stimuli included eight recorded bi-syllabic novel words (tanzer, bemkin, japple, daxen, hiplex, foppick, corwit, renkle) spoken by the same female speaker using adult-directed speech and neutral (list-like) prosody. Each word was paired with a picture of an uncommon or artificially altered object (see Figure 1) to create eight to-be-learned word-object pairings. Initial pilot data revealed that children exhibited no bias towards learning any particular word-object mapping. Four additional novel word-object pairings were used for a familiarization phase of the experiment. Stimuli were incorporated into an in-house computer application used to control stimulus presentation on a 17-in. monitor connected to a Power Mac G5. An add-on touch screen was mounted onto the monitor to allow children to advance trials and make selections at test.

Figure 1.

Figure 1

Pictures of novel/altered objects employed in Experiments 1 and 2

Design

Children were randomly assigned to one of three conditions varying in the contextual diversity of the learning environment: High Contextual Diversity (High CD), Moderate Contextual Diversity (Moderate CD), and Low Contextual Diversity (Low CD). In all three conditions, a word co-occurred with its referent in a total of four trials. In each learning trial, a second word and picture was also presented, with the correspondence between words and pictures ambiguous on any given trial. Conditions differed in the number of different distractors with which a word co-occurred across the four trials, as well as the frequency with which a given distractor co-occurred with a word. In the High CD condition for any given word-picture pairing, the accompanying word-picture pair was different on each of the four trials. In the Moderate CD condition, word-picture pairings co-occurred with one word-picture pairing on two trials, and two other word-picture pairings on the other two trials, resulting in less diversity across trials. Finally, in the Low CD condition, word-picture pairings co-occurred with one word-picture pairing three times, and another word-picture pairing once, so the diversity across trials was low (for a more detailed depiction of the co-occurrence structure of each condition, see Appendix).

Procedure

Children sat in front of the touch-screen computer next to the experimenter with a video camera positioned over the children's shoulder. The experimenter employed a ladybug puppet named “Lulu the Ladybug” and introduced the experiment as a game with the goal of learning Lulu's names for her favorite toys. Children completed a familiarization phase followed by a learning phase and a test phase. The goal of the familiarization phase was to introduce children to the experimental setting and to the general goal of learning Lulu's names for her toys. This procedure was thoroughly piloted and adapted to optimize task comprehensibility for children. There were three parts to the familiarization phase. First, the twelve pictures (the 8 to-be learned pictures and 4 additional pictures) were displayed simultaneously on the computer screen. Then, the twelve novel words that corresponded to each of the pictures were played in a random order. The experimenter then told children, “We are going to learn which name goes with which picture”. In the second step of the familiarization phase, the experimenter explicitly taught children the name of two of the pictures (that were not going to be part of the experiment proper). One of these pictures was presented on the computer screen and its corresponding word was played. This was repeated for the second picture. Then, children's learning of those two words was tested in two four-alternative forced-choice (4AFC) trials. In these trials, the same four pictures (the two labeled pictures and two unlabeled pictures) appeared simultaneously on the computer screen. In the first trial, the first novel word was played and children were asked to make a choice by touching the picture they thought went with the word. A second 4AFC trial tested children's learning of the referent of the second novel word. Correct selections were reinforced by the experimenter's clapping and a rewarding audio clip (applause and cheering). For any incorrect selections, children were asked to make a different selection until they were correct. The two novel word-picture pairings taught during the familiarization phase as well as the two distractors present in the 4AFC test trials of the familiarization phase did not appear during the experiment proper. The goal of the familiarization phase was simply to familiarize children with the game of learning words. Given that there was no referential ambiguity in the learning of words during this phase, the familiarization phase was not likely to “train” children on how to learn words cross-situationally. However, it did familiarize children with the experimental setting, the touch screen, the 4AFC task and the task goal of learning new words1.

Following the familiarization phase, children proceeded immediately to the learning phase. At this time, the experimenter said, “Now, we are going to learn all of the names of Lulu's other toys”. In each trial of the learning phase, children saw two pictures, one on each side of the monitor. Children also heard two spoken words, played sequentially in random order, corresponding to the two pictures (see Figure 2A).

Figure 2.

Figure 2

Sample trial structure in this paradigm: a learning trial (A) and a test trial (B).

Each of the eight to-be-learned word-object pairings occurred on four trials throughout the learning phase. Given that these 32 instances of word-object pairings were presented two at a time on each trial, the learning phase consisted of 16 total learning trials. Two training lists were created with the order of the trials pseudo-randomized such that each of the eight word-object pairings appeared once before any given pairing was repeated and no word-object pairing appeared in back-to-back trials, consistent with Yu and Smith's original adult paradigm. Which training list was used was counterbalanced across participants.

The test phase immediately followed the learning phase and consisted of eight 4AFC test trials, one per target word. In each trial, four pictures appeared simultaneously, one in each quadrant, followed by the presentation of the target word (see Figure 2B). The child indicated which picture she thought went with the target word by touching the picture on the screen. Children received no feedback during the test phase. All test trials were constructed by selecting the target word's corresponding picture and 3 foils randomly selected from the set of objects that had never co-occurred with the target word during the learning phase. All pictures served as foils an equal number of times. Two test lists were created and were identical across conditions. Which test list was employed was counterbalanced across participants.

Since none of the foils had co-occurred with the target word during the learning phase, this testing regimen was not designed to detect whether children definitively mapped the target word to its correct referent. Instead, this procedure simply tested children's sensitivity to whether the word had co-occurred with the target picture at all during the learning phase. Although this testing regimen did not provide the most rigorous test of word mapping within each condition, it did provide a straightforward way to assess the effect of contextual diversity across conditions while keeping the structure of the test trials constant.

Coding

For each test trial, children's choices were automatically registered as correct or incorrect. Children were classified as exhibiting a position bias and excluded from the analysis if they selected the object located in the same quadrant on five or more of the eight trials, which is the point at which through a binomial test, the probability of selecting a single quadrant across 8 independent trials is statistically greater, p < .05, than would be predicted by chance responding.

Results

For each child, we computed the proportion of test trials answered correctly. We then derived a mean proportion correct for each contextual diversity condition. Initial analyses (2×2 ANOVAs) within each condition revealed no effects of training list, testing list, or interaction (smallest p = .09)2, on mean proportion correct. Thus all subsequent analyses were collapsed across training and testing lists. There were also no sex differences in performance, t(82) = −1.03, p = .30, and no correlation between age and mean proportion correct, r(82)= .09, p = .43.

Figure 3 shows the mean proportion correct across conditions of contextual diversity (MHigh = .48, SDHigh = .21; MMod = .39, SDMod = .20; MLow = .34, SDLow = .18). To test whether children demonstrated word learning, for each condition we conducted single-sample t-tests to examine whether mean proportion correct was above the proportion that would be expected from chance performance (.25) in each condition. These tests revealed that learning was significantly higher than chance performance in all three conditions (tHigh(27) = 5.80, dHigh = 1.10; tMod(27) = 3.57, dMod = .67; tLow(27) = 2.77, dLow(27) = .52, all p's <.01). This finding underscores the power of children's cross-situational learning: from only a handful of ambiguous naming events, children mapped words to their referents even when confronted with low levels of contextual diversity.

Figure 3.

Figure 3

Mean proportion correct across levels of contextual diversity. Note: *** p <.001, ** p < .01; Error Bars reflect 95% confidence intervals around the means

To investigate the effect of contextual diversity on learning, we conducted a one-way analysis of variance (ANOVA) on mean proportion correct with condition as a between-subjects factor. As depicted by the downward trend in mean proportion correct across conditions in Figure 3, contextual diversity had a significant effect on performance, F(2, 81) = 3.53, p = .03, η2 = .08. Planned comparisons with Bonferroni correction revealed that the only statistically significant pair-wise difference, was between the High CD and Low CD conditions, t(54)= 2.64 pbonferroni = .03. The difference between the High CD and Moderate CD conditions, t(54)= 1.68, pbonferroni = .25, or between the Moderate CD and Low CD conditions, t(54)=.87, pbonferroni = 1.00, did not reach statistical significance. The finding that increased contextual diversity improves cross-situational word learning is consistent with research with adult learners in a similar paradigm (Kachergis et al., 2009; Suanda & Namy, 2012).

To investigate how representative these group-level results were of individual children's performance, we examined individual patterns of performance, dichotomizing children in each condition as either performing numerically above chance or performing at/below chance (.25). Figure 4 illustrates the proportion of children across conditions that performed above chance. The main patterns of the group level analyses were upheld at the individual level. That is, chi-square goodness-of-fit tests revealed that the proportion of children performing above chance in each condition (PropHigh = .786, PropMod = .643, PropLow = .500) was statistically greater than the proportion that would be predicted from chance performance (.321)3, χ2High = 27.74, pHigh < .001, χ2Mod = 13.31, pMod < .001, χ2Low = 4.12, pLow = .04. Further, a chi-square test of independence revealed a marginally significant effect of contextual diversity condition on the distribution of individual performance, χ2 = 4.97, p = .08. Figure 4 illustrates that the higher the contextual diversity of the learning environment, the greater the proportion of children who performed above chance.

Figure 4.

Figure 4

Proportion of children performing above chance3 across levels of contextual diversity. Note: ***p < .001, *p < .05; Variability estimates of the data were obtained via 200 iterations of bootstrap resampling (error bars reflect 95% confidence intervals around the observed data).

Discussion

In the current experiment, 6-year-old children learned word-to-referent mappings from just a handful of ambiguous naming events, suggesting developmental continuity in cross-situational word learning mechanisms across infancy, childhood and adulthood. Although children demonstrated learning across all conditions of contextual diversity, children's learning patterns clearly indicated that the more diverse the learning contexts, the better the learning. These findings are consistent with previous work suggesting that infants, toddlers and adults are prodigious cross-situational word learners (Scott & Fisher, 2012; Smith & Yu, 2008; Yu & Smith, 2007) and that contextual diversity influences performance in adult learners (Kachergis et al., 2009; Suanda & Namy, 2012). The contextual diversity effect sheds light on the underlying learning strategy children employed in this task. Although the specific mechanism remains unclear, that contextual diversity impacts learning rules out a single-trial learning strategy (K. Smith et al., 2009) as a candidate process, and suggests that the process is one that involves combining co-occurrence information across situations.

The finding that greater diversity in learning contexts aided children's word-to-referent mapping is also consistent with evidence of college students' word learning from written texts (Bolger, Balass, Landen, & Perfetti, 2008), and observational and corpus analyses that connect early language environments to acquisition outcomes (Hills, Maouene, Riordan, & Smith, 2010; Hoff & Naigles, 2002). Further, this finding is in accord with a broader body of evidence suggesting that increasing variability of learning environments improves learning (Gomez, 2002; Hintzman & Stern, 1978; Postman & Knecht, 1983; Rost & McMurray, 2009, 2010; S. Smith, Glenberg, & Bjork, 1978; Verkoijen, Pikers, & Schmidt, 2004).

Although there is abundant evidence demonstrating that contextual diversity helps learning, the precise reason for why it helps is unclear, though at least three hypotheses have been put forward. First, some have argued that increasing variability of learning instances allows for more decontextualized representations (e.g., Apfelbaum & McMurray, 2011). Second, based on earlier memory research, some scholars have argued that contextually diverse learning environments allow for a greater number of potential cues at time of memory retrieval (Bower, 1972, Glenberg, 1979). Finally, Bjork and colleagues have offered an explanation based on the notion of “desirable difficulties” in learning. That is, contextually diverse learning opportunities initially create more difficult individual learning instances due to the mismatch between learning instances. This initial difficulty boosts the strength of learning in the long run, so long as the encoding of individual instances is successful (Bjork, 2011). Thus, a number of potential explanations exist to explain the current findings.

Adding to the puzzle of the basis for contextual diversity effects in learning is the large number of findings across the memory, learning, and language literatures that fail to find a benefit of contextual diversity on learning (Dempster, 1987; Postman & Knecht, 1983; Young & Bellezza, 1982), as well as those that find a benefit for context redundancy on learning outcomes (e.g., Benitez & Smith, 2012, Haryu, Imai, & Okada, 2011; Maguire, Hirsh-Pasek, Golinkoff, & Brandone, 2008; Vlach & Sandhofer, 2011). For example in one study of early verb learning, Maguire and colleagues (2008) presented two-year-olds with an actor performing a novel action coupled with a verb-naming event, “wow, watch her blicking!” Two-year-olds either saw four instances of the same actor blicking, or four instances of four different actors blicking. Maguire and colleagues found that children who were presented with the same actor on all learning instances were better able to extend the label to a novel instance of blicking (i.e., with a novel actor). Thus, although the current findings (see also Kachergis et al., 2009; Suanda & Namy, 2012) clearly demonstrate a positive effect of increased contextual diversity in cross-situational word learning, how they relate to other findings of contextual diversity is a topic for future research.

Experiment 2

Although the results of Experiment 1 demonstrated successful cross-situational word learning, they did not address the strength of the mappings children made from these few ambiguous naming events. Recall that the structure of the test trials in Experiment 1 involved the presentation of a target word, a target object, and three foil objects that had never occurred with the target word during the learning phase. Although this structure allowed for equating the structure of the test trials across conditions and thus served as a conservative test of the effects of contextual diversity on learning, the test structure can only tell us whether subjects had built up some level of word-to-referent association; it does not address the relative strength of this association. In this Experiment, we replicate the Moderate CD condition from Experiment 1 using a more stringent measure of word-referent mapping. Specifically, we examine whether children can determine a word's correct referent from among an array of foils that had also co-occurred with the target word with varying frequency during learning.

Participants

Twenty-eight 5- to 7-year-olds (Mean age = 74.7 mos, Range = 62.5 – 96.5) participated. Fourteen children were female. 78% of children were Caucasian, 13% were African-American, 3% were Asian, and 6% identified as members of other racial categories. 6% of families identified as Hispanic or Latino. All children were native speakers of English.

Stimuli, Design & Procedure

The stimuli, design, procedure, and coding were identical to those in Experiment 1 with the following exceptions. First, there was only one learning condition in the experiment. The contextual diversity of the learning environment was identical to that of the Moderate CD condition of Experiment 1 (see Appendix). The critical difference between Experiments 1 and 2 was the structure of the test trials. In Experiment 1, the foils presented on test trials had never co-occurred with the target word during learning. In Experiment 2, the foils had co-occurred with the target word during learning. Specifically, each of the eight 4AFC test trials was constructed such that the target referent for each word was paired with 3 pseudo-randomly selected foils that included one foil that co-occurred with the target word on two of the four learning trials, one that had co-occurred with the target word once during the learning phase, and a third foil that had never co-occurred with the target word during the learning phase. We were interested in whether children would demonstrate cross-situational word learning under these more difficult test conditions. We were also interested in whether, when children made errors, they were more likely to select foils that co-occurred most often during learning. This speaks to the precision with which children learned word-referent pairings across learning events.

Results & Discussion

As in Experiment 1, we computed the proportion of trials each child answered correctly. Preliminary analyses revealed a significant effect of training list, F(1,24) = 4.56, p = .043, qualified by a significant interaction between training list and test list on mean proportion correct, F(1,24) = 4.56, p = .043, indicating that performance in one training-test list combination was significantly higher than the others. An inspection of the data revealed that this effect was driven primarily by a single child who performed well above the group mean. When this child was removed from the analysis, the training and testing list effect was no longer statistically significant, F(1,23) = 3.28, p = .08). Importantly, the statistical significance of the primary analyses presented below was not markedly altered when this participant's data were removed from analyses. As in Experiment 1, we found no correlation between age and performance, r(26) = −.01, p = .97, and no sex differences, t(26) = −.37, p = .71 in performance.

To examine the extent to which participants learned word-to-referent pairings in this more challenging testing regimen, we performed a single-sample t-test on mean proportion correct against the learning rate that would be expected by chance performance (.25). Results revealed that the mean proportion correct (M = .335; SD = .19) was significantly higher than chance levels, t(27) = 2.41, p = .02, d = .45, suggesting that children successfully mapped words onto their referents. This group-level analysis was supported by an individual-level analysis of performance. A chi-square goodness-of-fit test revealed that the proportion of children performing above chance was significantly higher than what would be predicted by random performance, χ2 = 4.12, p = .04.

To examine the effects of the testing environment on performance, we conducted an independent samples t-test, comparing performance in Experiment 2 to that in the Moderate CD condition of Experiment 1. Results revealed that although the difference between the two samples was in the expected direction (i.e., performance in Experiment 2 where co-occurring foils were present at test was lower, MExp 2 = .33; MExp 1Mod CD = .39), the difference was not statistically significant, t(54) = 1.02, p = .31.

We also examined the nature of children's response patterns, investigating the extent to which the item selected on each trial reflected the co-occurrence frequency between the target word and each of the four pictures in that trial. That is, were children more likely to select objects that co-occurred more frequently with the target word than to select objects that co-occurred less frequently with the target word? Figure 5 illustrates the relation between the likelihood of selecting a particular picture and the word-to-picture co-occurrence frequency. As the figure shows, there is a clear relation between frequency of co-occurrence and item selection: the more frequently a word and picture co-occurred during learning, the more often that picture was selected at test. These findings suggest that, as in previous research with adult learners (Suanda & Namy, 2012; Vouloumanos, 2008), children's mappings reflect the statistical structure of the learning environment. The figure also illustrates that although children selected the target picture more often than predicted by chance, on average children were almost as likely to select the closest co-occurring competitor as they were the target picture. Indeed when we considered only selections to the target picture and the high-frequency co-occurring competitor (61% of all selections), children did not select the target picture (M = .56, SD = .28) more often than what would be predicted by chance (.50), t(27) = 1.19, p = .24. These findings suggest that although children were able to use relative frequency of co-occurrence with the target word as a basis for responding, there appeared to be imprecision in children's mappings.

Figure 5.

Figure 5

Distribution of answers to objects differing in co-occurrence frequency with target word. Error bars reflect 95% confidence intervals around the means.

There are two mapping patterns that could result in the imprecision we observed. First, children may have mapped some target words to the correct, most frequently co-occurring, picture but incorrectly mapped other words to the most frequently co-occurring foil. Alternatively, children may have associated target words (either explicitly or implicitly) with both the target picture and the frequently co-occurring foil. In such a case, these results would reflect the fact that children could not consistently discriminate between the target word's associative strength to the two different pictures. Based on the current data, we cannot distinguish between these two possibilities. We discuss the mechanistic issues raised by this particular finding in greater detail below.

General Discussion

In the current study, we investigated the extent to which young children could learn word-to-referent mappings from a small number of naming events involving ambiguous reference. The results of two experiments suggest that children, like infant (Smith & Yu, 2008, 2013) and adult learners (Yu & Smith, 2007), can learn words when the only clues to reference are the cross-situational word-to-reference co-occurrence statistics. The findings from Experiment 1 revealed that children's learning, like adults' learning (see Kachergis et al. 2009; Suanda & Namy, 2012), is affected by the contextual diversity of the learning environment. The findings from Experiment 2 revealed the robustness of children's cross-situational word learning and suggested that children, like adults (see Suanda & Namy, 2012; Vouloumanos, 2008), exhibited response patterns that reflected the co-occurrence statistics of the learning environment.

Much of the developmental research on cross-situational word learning has focused on infant and toddler populations (Akhtar & Montague, 1999; Scott & Fisher, 2012; Smith & Yu, 2008; 2013; Yu & Smith, 2011; but see Piccin & Waxman, 2007; Werner & Kaplan, 1950, for notable exceptions). This focus on young word learners is warranted: to the extent that cross-situational word learning is a viable candidate process that gets word learning off the ground, it is important to demonstrate the availability of this learning process in the youngest of word learners. But what about the role of cross-situational word learning in later vocabulary development? The current results highlight the availability of a powerful statistical word learning capacity in children and thus suggest the possibility that cross-situational learning may play an important role in word learning across development. Of course just because children can exploit the co-occurrence statistics in the service of word learning in the current task, it does not mean that children deploy this capacity in their everyday word learning. Interestingly, however, there is some evidence from the education and reading studies literatures that suggest the potential use for a cross-situational word learning mechanism during middle, as well as early childhood. That is, many words learned during this phase of development are acquired unintentionally in reading (Gordon, Schumm, Coffland, & Doucette, 1992; Nagy, Anderson, & Herman, 1987; Nagy, Herman, & Anderson 1985; Shu, Anderson, & Zhang, 1995) or listening (Elley, 1989; Robbins & Ehri, 1994) contexts. Research on this type of word learning, known as incidental word learning (Nagy et al., 1985), suggests that in these contexts, children benefit from multiple exposures to words, and that a single exposure is rarely sufficient for learning (Horst, Parsons, & Bryan, 2011; Jenkins, Stein, & Wysocki, 1984; Robbins & Ehri, 1994). Further research is needed to investigate the extent to which findings such as these on school-aged children's vocabulary acquisition is linked to the types of processes that underlie the cross-situational learning reported in the current study. This step is of particular importance considering that children's real-world learning environments are substantially more complex than the learning environments presented in artificial laboratory tasks, such as the ones described here.

That children's learning in the current study showed similar patterns to adults' learning in previous work (i.e., both are sensitive to contextual diversity, see Kachergis et al., 2009; Suanda & Namy, 2012; and both show response patterns that reflect the statistics in the input, see Suanda & Namy, 2012; Vouloumanos, 2008) suggests continuity across development in the underlying mechanisms of cross-situational word learning. This finding is in accord with a number of statistical language learning studies in other aspects of language acquisition, such as speech segmentation (Saffran et al., 1997) and sequence learning (Meulemans, Van der Linden, & Perruchet, 1998), that also find similarities between adults' and children's statistical learning. This does not imply, of course, that child learning patterns consistently conform to adult patterns across all aspects of language acquisition. Indeed, one hallmark of language in general is that it is more readily (and therefore differently) acquired by children than adults. For example, in their work on grammar regularization, Hudson Kam and Newport find that adults and children exhibit different patterns in their regularization of a probabilistic determiner system with children more likely to regularize than their adult counterparts (Hudson Kam & Newport, 2005; 2009).

Of course even when researchers do find behavioral similarities between adult and child learners, as we did here, it is not a given that they are driven by the same underlying processes. This issue is particularly pertinent in the context of cross-situational word learning given that researchers have argued and demonstrated computationally that multiple mechanisms can readily account for many of the existing cross-situational learning findings (see Trueswell et al., 2013; K. Smith et al., 2011; Yu & Smith, 2012). More specifically, the capacity to capitalize on cross-situational regularities in word learning can be explained by two distinct computational processes. According to an associative learning account of cross-situational learning, when children encounter a new word in an ambiguous naming event, they encode the connection between this word and multiple, if not all, co-occurring objects. As children encounter this word in subsequent naming events, they strengthen previous connections as well as create new connections. This process of aggregating cross-situational co-occurrence statistics yields successful learning because, on average, the strength of the associations between words and their referents will be stronger than incorrect mappings based on frequency of word-object co-occurrence (Yu, 2008; Yu & Smith, 2012).

Alternatively, according to a hypothesis testing account of learning, cross-situational word learning occurs because children pick out a specific object as a hypothesized referent of the word when they encounter a new word in an ambiguous naming event. As children encounter this word in a subsequent naming event, they either confirm the hypothesis if it is consistent with the event (i.e., the initially hypothesized referent is also present in the subsequent event) or reject and replace the hypothesis if it is inconsistent with the event (i.e., the initially hypothesized referent is not present in the subsequent event). This process of confirming or replacing hypothesized word-to-referent mappings should yield successful cross-situational word learning based on the logic of probability: more frequently co-occurring word-to-object pairs will be more likely selected as word-to-referent hypotheses than infrequently co-occurring word-to-object pairs (see Medina et al., 2011; Trueswell et al., 2013).

The current studies, modeled after the original adult paradigm (Yu & Smith, 2007), do not speak to which of these accounts best characterize children's learning. In a thoughtful discussion of these two accounts, Yu and Smith (2012) argued that distinguishing between them is not straightforward and that deciding which characterizes learning may not be a productive route to understanding the phenomenon. This is because each of these accounts can be decomposed into several processes (e.g., the amount of information learners select from the environment and the nature of the information retrieved from memory), and that depending on the configuration of the proposed processes, one account may look indistinguishable from the other (see also Smith, Suanda, & Yu, 2014).

Recent empirical work with adult learners also supports the rejection of a simple either-or approach to understanding cross-situational word learning. K. Smith and colleagues (2011), for example, have revealed that adults' cross-situational learning strategies shift as a function of the nature of the learning task (specifically the degree of referential uncertainty): when referential uncertainty is low, learning approximates associative learning; when referential uncertainty is high, learning approximates hypothesis testing. Additionally, Romberg and Yu (2014) have demonstrated, using a learning paradigm that encourages learners to engage in both hypothesis testing and associative learning (arguably a better model of real-world word learning), that there is mutual influence between testing hypotheses and aggregating associations. These recent studies thus underscore the likely complex interplay of multiple mechanisms behind cross-situational word learning. Future modifications to our current paradigm (e.g., varying task difficulty or adopting a continuous learning paradigm) should help shed light onto what cocktail of mechanisms underlie children's cross-situational word learning.

In conclusion, the current results provide the first empirical evidence that school-aged children can learn word-to-referent mappings across multiple ambiguous naming events. These results suggest statistical word learning as a candidate learning mechanism for later phases of vocabulary acquisition as well as earlier ones. The current endeavor not only demonstrates that children can learn words cross-situationally, but also provides important evidence regarding developmental consistency in the learning mechanism by demonstrating that some of the same factors previously shown to influence adults' learning also influences children's learning.

Highlights.

  • -

    Six-year-old children show rapid word learning across ambiguous situations

  • -

    Children's cross-situational learning show context diversity effects

  • -

    Children's cross-situational choice patterns reflect the statistical structure of the learning environment

  • -

    Children show learning patterns similar to those previously found with adult learners

Acknowledgments

We thank all the families who volunteered to make this research possible. We thank Lynne Nygaard, Rob Hampton, Phil Wolff, and Stella Lourenco for helpful comments on this project. We also thank Maggie Dancel, Pat Finan, Ann Marie Finley, Arielle Friehling, Katie Kimura, and Carlie Marks for their assistance with subject recruitment and coding. A portion of the stimulus images used is courtesy of Michael J. Tarr, Center for the Neural Basis of Cognition and Department of Psychology, Carnegie Mellon University, http://www.tarrlab.org/. This research was part of the first author’s doctoral dissertation submitted to Emory University and portions of this research were presented at the Biennial Meeting of the Society for Research in Child Development, 2011, and the Biennial Meeting of the Cognitive Development Society, 2011. This research was supported by an NSF GRF awarded to SHS, by SIRE grants from Emory College awarded to NM and LLN, and in part by NICHD grant number 5-R03-HD058777-02 awarded to LLN. The material in this article is based in part on work done while the third author was employed as a National Science Foundation Program Director. Any opinion, findings, and conclusions expressed in this article are those of the authors and do not necessarily reflect the views of the U.S. National Science Foundation

Appendix

Association Matrices representing word – picture co-occurrence frequencies across conditions

graphic file with name nihms-612234-f0001.jpg

The above table illustrates the total frequencies with which words (columns) co-occurred with different pictures (rows) in each condition. To illustrate co-occurrence patterns in the High CD condition, Word 1 (W1) co-occurred with its referent (Picture 1 – P1) on all four trials in which W1 occurred. W1–P1 was accompanied by W2–P2 on one of those trials, W3–P3 on a different trial, W4–P4 on another trial, and W5–P5 on yet another trial, resulting in maximal contextual diversity

Footnotes

Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

1

This familiarization procedure constitutes a departure from the Yu & Smith (2007) adult paradigm. Given previous work suggesting that pre-exposure to objects improves word learning in two-year-olds (Kucker & Samuelson, 2012), it is possible that this procedure had similar effects on 5- to 7-year-old learners. Importantly for the current contexts though, there is no reason to suspect that the familiarization procedure would differentially impact some conditions of contextual diversity more than others.

2

This was observed in a test of the training list effect in the Moderate CD condition: F(1,24) = 3.154, p = .09. All other preliminary analyses had p-values greater than .25.

3

Chance for this analysis was defined as the proportion of times one would expect to see a participant perform above chance (.25) if participants randomly guessed on each trial. This chance value was derived from multiplying the likelihood of each response pattern given random guessing (i.e., the likelihood of getting exactly 3 trials correct, 4 trials correct, etc.) and the number of permutations of each response pattern that is above chance (e.g., there are exactly 56 different ways in which a random performer would get exactly 3 trials correct = trials 1,2,3 are correct, trials 1,2,4 are correct, trials 1, 2, 5 are correct, etc.).

References

  1. Akhtar N, Montague LJ. Lexical acquisition: The role of cross-situational learning. First Language. 1999;19:347–358. [Google Scholar]
  2. Anglin JM. Vocabulary development: A Morphological analysis. Monographs of the Society for Research in Child Development. 1993;58(10):v–238. [Google Scholar]
  3. Apfelbaum KS, McMurray B. Using variability to guide dimensional weighting: Associative mechanisms in early word learning. Cognitive Science. 2011;35:1105–1138. doi: 10.1111/j.1551-6709.2011.01181.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Benitez VL, Smith LB. Predictable locations aid early object name learning. Cognition. 2012;125:339–352. doi: 10.1016/j.cognition.2012.08.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Bjork RA. On the symbiosis of learning, remembering, and forgetting. In: Benjamin AS, editor. Succesfull remembering and successful forgetting: A Festchrift in honor of Robert A. Bjork. Psychology Press; London, UK: 2011. pp. 1–22. [Google Scholar]
  6. Bloom P. How children learn the meanings of words. The MIT Press; Cambridge, MA: 2000. [Google Scholar]
  7. Bolger DJ, Balass M, Landen E, Perfetti CA. Context variation and definitions in learning the meanings of words: An Instance-based learning approach. Discourse Processes. 2008;45:122–159. [Google Scholar]
  8. Bower GH. Stimulus-sampling theory of encoding variability. In: Melton AW, Martin E, editors. Coding Processes in Human Memory. Winston & Sons; New York, NY: 1972. pp. 85–124. [Google Scholar]
  9. Braine MDS, Brody RE, Brooks PJ, Sudhalter V, Ross JA, Catalano L, Fisch SM. Exploring language acquisition in children with a miniature artificial language: Effects of item and pattern frequency, arbitrary subclasses, and correction. Journal of Memory and Language. 1990;29:591–610. [Google Scholar]
  10. Carey S. The child as word-learner. In: Halle M, Bresnan J, Miller GA, editors. Linguistic theory and psychological reality. MIT Press; Cambridge, MA: 1978. pp. 64–293. [Google Scholar]
  11. Carey S, Bartlett E. Acquiring a single new word. Papers and Reports on Child Language Development. 1978;15:17–29. [Google Scholar]
  12. Cunillera T, Laine M, Camara E, Rodriguez-Fornells A. Bridging the gap between speech segmentation and word-to-world mappings: Evidence from an audiovisual statistical learning task. Journal of Memory and Language. 2010;63:295–305. [Google Scholar]
  13. Dempster FN. Effects of variable encoding and spaced presentations on vocabulary learning. Journal of Educational Psychology. 1987;79:162–170. [Google Scholar]
  14. Elley WB. Vocabulary acquisition from listening to stories. Reading Research Quarterly. 1989;24:174–187. [Google Scholar]
  15. Ferman S, Karni A. No childhood advantage in the acquisition of skill using an artificial language rule. PLoS ONE. 2010;5:1–10. doi: 10.1371/journal.pone.0013648. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Fitneva SA, Christiansen MH. Looking in the wrong direction correlates with more accurate word learning. Cognitive Science. 2011;35:367–380. doi: 10.1111/j.1551-6709.2010.01156.x. [DOI] [PubMed] [Google Scholar]
  17. Gillette J, Gleitman H, Gleitman LR, Lederer A. Human simulations of vocabulary learning. Cognition. 1999;73:135–176. doi: 10.1016/s0010-0277(99)00036-0. [DOI] [PubMed] [Google Scholar]
  18. Glenberg AM. Component-levels theory of the effects of spacing of repetitions on recall and recognition. Memory & Cognition. 1979;7:95–112. doi: 10.3758/bf03197590. [DOI] [PubMed] [Google Scholar]
  19. Gomez RL. Variability and detection of invariant structure. Psychological Science. 2002;13:431–436. doi: 10.1111/1467-9280.00476. [DOI] [PubMed] [Google Scholar]
  20. Gordon J, Schumm JS, Coffland C, Doucette M. Effects of inconsiderate versus considerate text on elementary students' vocabulary learning. Reading Psychology. 1992;13:157–169. [Google Scholar]
  21. Haryu E, Imai M, Okada H. Object similarity bootstraps young children to action-based verb extension. Child Development. 2011;82:674–686. doi: 10.1111/j.1467-8624.2010.01567.x. [DOI] [PubMed] [Google Scholar]
  22. Hills TT, Maouene J, Riordan B, Smith LB. The associative structure of language: Contextual diversity in early word learning. Journal of Memory and Language. 2010;63:259–273. doi: 10.1016/j.jml.2010.06.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Hintzman DL, Stern LD. Contextual variability and memory for frequency. Journal of Experimental Psychology: Human Learning and Memory. 1978;4:539–549. [Google Scholar]
  24. Hoff E, Naigles L. How children use input to acquire a lexicon. Child Development. 2002;73:418–433. doi: 10.1111/1467-8624.00415. [DOI] [PubMed] [Google Scholar]
  25. Hollich GJ, Hirsh-Pasek K, Golinkoff RM, Brand RJ, Brown E, Chung H, L…Rocroi C. Breaking the language barrier: An Emergentist coalition model for the origins of word learning. Monographs of the Society for Research in Child Development. 2000;65(3):v–262. [PubMed] [Google Scholar]
  26. Horst JS, Parsons KL, Bryan NM. Get the story straight: contextual repetition promotes learning from storybooks. Frontiers in Psychology. 2011;2:17. doi: 10.3389/fpsyg.2011.00017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Hudson Kam CL, Newport EL. Regularizing unpredictable variation: The roles of adult and child learners in language formation and change. Language Learning and Development. 2005;1:151–195. [Google Scholar]
  28. Hudson Kam CL, Newport EL. Getting it right by getting it wrong: When learners change languages. Cognitive Psychology. 2009;59:30–66. doi: 10.1016/j.cogpsych.2009.01.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Janacsek K, Fiser J, Nemeth D. The best time to acquire new skills: age-related difference in implicit sequence learning across the human lifespan. Developmental Science. 2012;15:496–505. doi: 10.1111/j.1467-7687.2012.01150.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Jenkins JR, Stein ML, Wysocki K. Learning vocabulary through reading. American Educational Research Journal. 1984;21:767–787. [Google Scholar]
  31. Kachergis G, Yu C, Shiffrin RM. Frequency and contextual diversity effects in cross-situational word learning. In: Taatgen N, van Rijn H, editors. Proceedings of the 31st Annual Conference of the Cognitive Science Society; Austin, TX: Cognitive Science Society; 2009. pp. 2220–2225. [Google Scholar]
  32. Kachergis G, Yu C, Shiffrin RM. An associative model of adaptive inference for learning word-referent mappings. Psychonomic Bulletin & Review. 2012;19:317–324. doi: 10.3758/s13423-011-0194-6. [DOI] [PubMed] [Google Scholar]
  33. Kachergis G, Yu C, Shiffrin RM. Actively learning object names across ambiguous situations. Topics in Cognitive Science. 2013;5:200–213. doi: 10.1111/tops.12008. [DOI] [PubMed] [Google Scholar]
  34. Kucker SC, Samuelson LK. The first slow step: differential effects of object and word-form familiarization on retention of fast-mapped words. Infancy. 2012;17:295–323. doi: 10.1111/j.1532-7078.2011.00081.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Maguire MJ, Hirsh-Pasek K, Golinkoff RM, Brandone AC. Focusing on the relation: Fewer exemplars facilitate children's initial verb learning and extension. Developmental Science. 2008;11:628–634. doi: 10.1111/j.1467-7687.2008.00707.x. [DOI] [PubMed] [Google Scholar]
  36. Markson L, Bloom P. Evidence against a dedicated system for word learning in children. Nature. 1997;385:813–815. doi: 10.1038/385813a0. [DOI] [PubMed] [Google Scholar]
  37. Medina T, Snedeker J, Trueswell JC, Gleitman LR. How words can and cannot be learned by observation. Proceedings of the National Academy of Sciences of the United States of America. 2011;108:9014–9019. doi: 10.1073/pnas.1105040108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Meulemans T, Van der Linden M, Perruchet P. Implicit sequence learning in children. Journal of Experimental Child Psychology. 1998;69:199–221. doi: 10.1006/jecp.1998.2442. [DOI] [PubMed] [Google Scholar]
  39. Monaghan P, Mattock K. Integrating constraints for learning word-referent mappings. Cognition. 2012;123:133–143. doi: 10.1016/j.cognition.2011.12.010. [DOI] [PubMed] [Google Scholar]
  40. Nagy WE, Anderson RC, Herman PA. Learning word meanings from context during normal reading. American Educational Research Journal. 1987;24:237–270. [Google Scholar]
  41. Nagy WE, Herman PA, Anderson RC. Learning words from context. Reading Research Quarterly. 1985;20:233–253. [Google Scholar]
  42. Piccin TB, Waxman SR. Why nouns trump verbs in word learning: New evidence from children and adults in the human simulation paradigm. Language Learning and Development. 2007;3:295–323. [Google Scholar]
  43. Postman L, Knecht K. Encoding variability and retention. Journal of Verbal Learning and Verbal Behavior. 1983;22:133–152. [Google Scholar]
  44. Robbins C, Ehri LC. Reading storybooks to kindergartnets helps them learn new vocabulary words. Journal of Educational Psychology. 1994;86:54–64. [Google Scholar]
  45. Romberg AR, Yu C. Interactions between statistical aggregation and hypothesis testing mechanisms during word learning. In: Bello P, Guarini M, McShane M, Scassellati B, editors. Proceedings of the 36th Annual Conference of the Cognitive Science Society; Austin, TX: Cognitive Science Society; 2014. [Google Scholar]
  46. Rost GC, McMurray B. Speaker variability augments phonological processing in early word learning. Developmental Science. 2009;12:339–349. doi: 10.1111/j.1467-7687.2008.00786.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Rost GC, McMurray B. Finding the signal by adding noise: The role of noncontrastive phonetic variability in early word learning. Infancy. 2010;15:608–635. doi: 10.1111/j.1532-7078.2010.00033.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Saffran JR. The use of predictive dependencies in language learning. Journal of Memory and Language. 2001;44:493–515. [Google Scholar]
  49. Saffran JR. Constraints on statistical language learning. Journal of Memory and Language. 2002;47:177–196. [Google Scholar]
  50. Saffran JR, Newport EL, Aslin RN, Tunick RA, Barrueco S. Incidental language learning: Listening (and learning) out of the corner of your ear. Psychological Science. 1997;8:101–105. [Google Scholar]
  51. Scott RM, Fisher C. 2.5-Year-olds use cross-situational consistency to learn verbs under referential uncertainty. Cognition. 2012;122:163–180. doi: 10.1016/j.cognition.2011.10.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Shu H, Anderson RC, Zhang H. Incidental learning of word meanings while reading: A Chinese and American cross-cultural study. Reading Research Quarterly. 1995;30:76–95. [Google Scholar]
  53. Smith K, Smith ADM, Blythe RA. Reconsidering human cross-situational learning: A Revision to Yu & Smith's (2007) experimental paradigm. In: Taatgen N, van Rijn H, editors. Proceedings of the 31st Annual Conference of the Cognitive Science Society; Austin, TX: Cognitive Science Society; 2009. pp. 2711–2716. [Google Scholar]
  54. Smith K, Smith ADM, Blythe RA. Cross-situational learning: An Experimental study of word-learning mechanisms. Cognitive Science. 2011;3:480–498. [Google Scholar]
  55. Smith LB, Suanda SH, Yu C. The unrealized promise of infant statistical word-referent learning. Trends in Cognitive Sciences. 2014;18:251–258. doi: 10.1016/j.tics.2014.02.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Smith LB, Yu C. Infants rapidly learn word-referent mappings via cross-situational statistics. Cognition. 2008;106:1558–1568. doi: 10.1016/j.cognition.2007.06.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Smith LB, Yu C. Visual attention is not enough: Individual differences in statistical word-referent learning in infants. Language Learning and Development. 2013;9:25–49. doi: 10.1080/15475441.2012.707104. [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. Smith SM, Glenberg AM, Bjork RA. Environmental context and human memory. Memory & Cognition. 1978;6:342–353. [Google Scholar]
  59. Snedeker J. Word learning. In: Squire LR, editor. Encyclopedia of neuroscience. Elsevier; Amsterdam, Netherlands: 2009. pp. 503–508. [Google Scholar]
  60. Suanda SH, Namy LL. Detailed behavioral analysis as a window into cross-situational word learning. Cognitive Science. 2012;36:545–559. doi: 10.1111/j.1551-6709.2011.01218.x. [DOI] [PubMed] [Google Scholar]
  61. Swingley D. Fast mapping and slow mapping in children's word learning. Language Learning and Development. 2010;6:179–183. [Google Scholar]
  62. Trueswell JC, Medina T, Hafri A, Gleitman LR. Propose but verify: Fast mapping meets cross-situational word learning. Cognitive Psychology. 2013;66:126–156. doi: 10.1016/j.cogpsych.2012.10.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  63. Verkoeijen PPJL, Pikers RMJP, Schmidt HG. Detrimental influence of contextual change on spacing effects in free recall. Journal of Experimental Psychology: Learning, Memory, and Cognition. 2004;30:796–800. doi: 10.1037/0278-7393.30.4.796. [DOI] [PubMed] [Google Scholar]
  64. Vlach HA, Sandhofer CM. Retrieval dynamics and retention in cross-situational word learning. Cognitive Science. 2014;38:757–774. doi: 10.1111/cogs.12092. [DOI] [PMC free article] [PubMed] [Google Scholar]
  65. Vlach HA, Sandhofer CM. Developmental differences in children's context dependent word learning. Journal of Experimental Child Psychology. 2011;108:394–401. doi: 10.1016/j.jecp.2010.09.011. [DOI] [PubMed] [Google Scholar]
  66. Vlach HA, Johnson SP. Memory constraints on infants' cross-situational statistical learning. Cognition. 2013;127:375–382. doi: 10.1016/j.cognition.2013.02.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  67. Vouloumanos A. Fine-grained sensitivity to statistical information in adult word learning. Cognition. 2008;107:729–742. doi: 10.1016/j.cognition.2007.08.007. [DOI] [PubMed] [Google Scholar]
  68. Vouloumanos A, Werker JF. Infants' learning of novel words in a stochastic environment. Developmental Psychology. 2009;45:1611–1617. doi: 10.1037/a0016134. [DOI] [PubMed] [Google Scholar]
  69. Werner H, Kaplan E. Development of word meaning through verbal context: An experimental study. The Journal of Psychology. 1950;29:251–257. [Google Scholar]
  70. Woodward AL, Markman EM. Early word learning. In: Damon W, Kuhn D, Siegler RS, editors. Handbook of child psychology (Vol. 2): Cognition, perception, and language. John Wiley & Sons; New York, NY: 1998. pp. 371–420. [Google Scholar]
  71. Yoshida KA, Rhemtulla M, Vouloumanos A. Exclusion constraints facilitate statistical word learning. Cognitive Science. 2012;36:933–947. doi: 10.1111/j.1551-6709.2012.01260.x. [DOI] [PubMed] [Google Scholar]
  72. Young DR, Bellezza FS. Encoding variability, memory organization, and the repetition effect. Journal of Experimental Psychology: Learning, Memory, and Cognition. 1982;8:545–559. [Google Scholar]
  73. Yu C. A Statistical associative account of vocabulary growth in early word learning. Language Learning and Development. 2008;4:32–62. [Google Scholar]
  74. Yu C, Smith LB. Rapid word learning under uncertainty via cross-situational statistics. Psychological Science. 2007;18:414–420. doi: 10.1111/j.1467-9280.2007.01915.x. [DOI] [PubMed] [Google Scholar]
  75. Yu C, Smith LB. What you learn is what you see: using eye movements to study infant cross-situational word learning. Developmental Science. 2011;14:165–180. doi: 10.1111/j.1467-7687.2010.00958.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  76. Yu C, Zong Y, Fricker D. Selective attention in cross-situational statistical learning: evidence from eye tracking. Frontiers in Psychology. 2012;3:148. doi: 10.3389/fpsyg.2012.00148. [DOI] [PMC free article] [PubMed] [Google Scholar]
  77. Yurovsky D, Yu C, Smith LB. Statistical speech segmentation and word learning in parallel: Scaffolding from child-directed speech. Frontiers in Language Sciences. 2012;3:374. doi: 10.3389/fpsyg.2012.00374. [DOI] [PMC free article] [PubMed] [Google Scholar]
  78. Yurovsky D, Yu C, Smith LB. Competitive processes in cross-situational word learning. Cognitive Science. 2013;37:891–921. doi: 10.1111/cogs.12035. [DOI] [PMC free article] [PubMed] [Google Scholar]

RESOURCES