Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2019 Jan 9.
Published in final edited form as: Linguist Approaches Biling. 2015;5(4):494–498. doi: 10.1075/lab.5.4.09lew

Infants’ history of distributional learning in real time

Casey Lew-Williams 1
PMCID: PMC6326367  NIHMSID: NIHMS955189  PMID: 30636995

Phillips and Ehrenhofer (2015), henceforth P&E, are doing the field a favor. They link adult psycholinguistic research not only to developmental psycholinguistic research, but to research on language learning that emerges from the psychological tradition. A range of scientists — those who get out of bed in the morning to study language, to study development, and/or to study the cognitive science of learning — will find important insights embedded throughout the authors’ discussion of how language processing informs language acquisition. The title of the article broadly refers to language acquisition, but the content is focused primarily on the level of syntax and semantics. This spotlight is by no means problematic, but it invites my main response: for a complete account of the relation between language processing and language acquisition, we need elaboration on the role of infancy in staging the emergence of language processing.

In the first months of life, language processing happens as infants gradually accumulate information about distributions of sounds, syllables, and words. ‘Statistical learning’ is at best a meta-theoretic perspective, and at worst an atheoretical lab observation, but we have preliminary evidence for its relevance to semi-natural learning contexts (e.g., Frank, Tenenbaum, & Gibson, 2013; Hay, Pelucchi, Graf Estes, & Saffran, 2011; Kurumada, Meylan, & Frank, 2013; Lew-Williams, Pelucchi, & Saffran, 2011). The target article culminates in a discussion of why (typically developing) children are well-rounded and high-achieving language learners, and to do so, the authors turn to research on adult second language learners. Adult learners, the P&E appropriately suggest, have early successes in language processing that lock them into routines and block them from learning new information (see Arnon & Ramscar, 2012). This is followed by a consideration of children’s relatively weak memory abilities, inferior reanalysis abilities, and immature prediction abilities as likely culprits for the lack of a comparable trajectory in early development. While children’s limitations in revising initial interpretations represent a critical part of the story, another angle is needed to complement the narrative: infants’ and toddlers’ early successes in language processing may not block but instead open doors to learning new information. They may not have the same difficulties as adult learners because they get to exploit a history of co-occurrence information; what infants do for a living is engage in a temporally protracted process of input-based pattern extraction, both within and across the auditory and visual systems. And at some point on a currently unspecified timetable, they graduate beyond simple patterns and form generalizations over the input (see Marcus, Fernandes, & Johnson, 2007). For adult learners, complex sentences come in at a non-optimal pace (perhaps up to 5 words per second), and language learning in traditional classroom contexts hinges on the testimony of an instructor. But for child learners, the input is often friendly enough to guide them toward reasonably efficient interpretations of the signal. New information and new structures are not necessarily a burden; instead, novelty iteratively rests on a wealth of accumulated distributional learning.

In this sense, it seems inaccurate to conclude that “children’s language processing abilities give them no obvious advantage over adult learners” (p. 413). Young children do have severe limitations in their abilities to form predictions over complex sequences of words, for a variety of reasons: they have two instead of 20 years of language experience, they have immature nervous systems, and they lack an optimal incorporation of explicit strategizing into the task of learning. But their language processing abilities also have a historically based statistical advantage over adult learners. While they are only capable of using the most common adjectives (Fernald, Thorpe, & Marchman, 2010), verbs (Fernald, Zangl, Portillo, & Marchman, 2008), and articles (Lew-Williams & Fernald, 2007) to ‘get ahead of ’ the input (prediction abilities that may only be apparent in distilled lab simulations), their attention to local relations between sounds, syllables, and words gradually gives rise to parsing and adult-like prediction. P&E suggest that accurate sentence analysis is a first step in ensuring eventual proficiency (p. 431), but this statement skips over the foundational ‘skill-building’ that happens in infancy. More attention is needed to the fact that young children spend many months discovering relevant structure in patterns of sounds, syllables, and words in the input. Learning about the distributional properties of words is just the beginning of becoming a native listener, and over time, infants and toddlers apply their knowledge of distributions to tasks such as recognizing referents and processing words in combination. We now have growing evidence for this continuity (Conway, Bauernschmidt, Huang, & Pisoni, 2010; Graf Estes, Evans, Alibali, & Saffran, 2007; Lany, 2014; Misyak, Christiansen, & Tomblin, 2010).

Importantly, different caregivers provide different access to the units of language and the relations between them (Hart & Risley, 1995; Hoff & Naigles, 2002; Huttenlocher, Vasilyeva, Cymerman, & Levine, 2002; Huttenlocher, Waterfall, Vasilyeva, Vevea, & Hedges, 2010; Rowe, 2008; Weisleder & Fernald, 2013). But in considering why some linguistic phenomena are hard to learn and why children generally fare better than adults, P&E downplay the potential usefulness of scrutinizing variability in the input. They propose that delayed learning of a specific linguistic phenomenon probably does not result from variability in exposure to that phenomenon. Why? Because we see surprising consistency across children in age of mastery, which is problematic for input-based accounts and necessitates the idea that learner-internal changes “could allow them to confirm or disconfirm regularities that they were previously unable to test” (p. 438). This is an excellent insight about the dynamic interplay between cognition and experience with language, and there is certainly no one-to-one mapping between amount of input and timing of learning a specific linguistic phenomenon. But there should still be a correspondence between the two, and the deployment of advanced cognitive skills may hinge on having received rich input. Children from wealthy families get approximately 3.5 times more language experience per unit time, on average, relative to children from poor families (Hart & Risley, 1995). Moreover, according to my crude estimates of time-spent-in-the-classroom, adult second language learners may fare even worse. These input estimates do not necessarily speak to cases like anaphora, but dissociating global from specific input is odd, given that global input is comprised of diverse exposure to specific linguistic phenomena. And thus it is premature to conclude that variation in learning speed is less dependent on input variability than on learner-internal variability. Researchers interested in infant language, socioeconomic status, early intervention, and the achievement gap have barely begun to tackle the importance or irrelevance of overall input quantity. The field is currently preoccupied with debating the high-level issue of input quantity vs. quality as the engine behind children’s divergent learning trajectories, and very few studies have focused specifically on syntax. One exception is a study conducted by Huttenlocher and colleagues (2002), which showed that preschool teachers who use a greater proportion of multi-clause sentences and a greater number of noun phrases per sentence have students who — one year later — demonstrate greater syntactic growth. Dissecting the relations between specific input features and specific linguistic constructs is likely to be a fruitful direction for future work, not one that should be left behind.

The ‘less is eventually more’ account takes the contrast of mature/immature processing resources in a highly productive direction, but children’s limitations need to be considered alongside what we know about the origins of detecting language structure in infancy. Infants may not be better than adults at finding structure in patterned input, but they do receive overwhelmingly more practice. With this practice comes the opportunity to ‘miss’ information and learn from mistakes. Newborns miss everything, toddlers mis-predict, and children mis-analyze, because child-directed input is fast, sometimes imprecise, and often novel. While systematic mis-parsing may be destructive for the learner in some ways, erroneous processing — broadly construed — also helps create change to the system (Chang, Dell, & Bock, 2006). The ‘less is eventually more’ account speculates that these limitation-based errors help young children avoid damage and ultimately shine. But the extended time-scale (the ‘eventualness’) of the account will benefit by extending its reach back to infancy as the starting point of language learning. Infants and toddlers receive lots of language input, they digest its structure gradually over time, and they confront and learn from erroneous predictions at every turn.

Throughout their discussion, P&E suggest a need for more research that directly compares child and adult learners, more evaluations of the real-world relevance of lab findings, and more focus on the nature of young children’s predictive abilities. But they intentionally and wisely withhold suggestions for methodological outreach. Their main point is to understand how language is learned in real time, and to let the field deploy its machinery as it wishes.

References

  1. Arnon I, Ramscar M. Granularity and the acquisition of grammatical gender: How order-of-acquisition affects what gets learned. Cognition. 2012;122:292–305. doi: 10.1016/j.cognition.2011.10.009. [DOI] [PubMed] [Google Scholar]
  2. Chang F, Dell GS, Bock K. Becoming syntactic. Psychological Review. 2006;113:234–272. doi: 10.1037/0033-295X.113.2.234. [DOI] [PubMed] [Google Scholar]
  3. Conway CM, Bauernschmidt A, Huang SS, Pisoni DB. Implicit statistical learning in language processing: Word predictability is the key. Cognition. 2010;114:356–371. doi: 10.1016/j.cognition.2009.10.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Fernald A, Thorpe K, Marchman VA. Blue car, red car: Developing efficiency in online interpretation of adjective–noun phrases. Cognitive Psychology. 2010;60:190–217. doi: 10.1016/j.cogpsych.2009.12.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Fernald A, Zangl R, Portillo A, Marchman VA. Looking while listening: Using eye movements to monitor spoken language comprehension by infants and young children. In: Sekerina IA, Fernández EM, Clahsen H, editors. Developmental psycholinguistics: Online methods in children’s language processing. Amsterdam: John Benjamins; 2008. pp. 97–135. [DOI] [Google Scholar]
  6. Frank MC, Tenenbaum JB, Gibson E. Learning and long-term retention of large-scale artificial languages. PLoS One. 2013;8:e52500. doi: 10.1371/journal.pone.0052500. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Graf Estes KM, Evans J, Alibali MW, Saffran JR. Can infants map meaning to newly segmented words? Statistical segmentation and word learning. Psychological Science. 2007;18:254–260. doi: 10.1111/j.1467-9280.2007.01885.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Hart B, Risley TR. Meaningful differences in the everyday experience of young American children. Baltimore, MD: Brookes; 1995. [Google Scholar]
  9. Hay JF, Pelucchi B, Graf Estes K, Saffran JR. Linking sounds to meanings: Infant statistical learning in a natural language. Cognitive Psychology. 2011;63:93–106. doi: 10.1016/j.cogpsych.2011.06.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Hoff E, Naigles L. How children use input to acquire a lexicon. Child Development. 2002;73:418–433. doi: 10.1111/1467-8624.00415. [DOI] [PubMed] [Google Scholar]
  11. Huttenlocher J, Vasilyeva M, Cymerman E, Levine S. Language input and child syntax. Cognitive Psychology. 2002;45:337–374. doi: 10.1016/S0010-0285(02)00500-5. [DOI] [PubMed] [Google Scholar]
  12. Huttenlocher J, Waterfall H, Vasilyeva M, Vevea J, Hedges LV. Sources of variability in children’s language growth. Cognitive Psychology. 2010;61:343–365. doi: 10.1016/j.cogpsych.2010.08.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Kurumada C, Meylan SC, Frank MC. Zipfian frequency distributions facilitate word segmentation in context. Cognition. 2013;127:439–453. doi: 10.1016/j.cognition.2013.02.002. [DOI] [PubMed] [Google Scholar]
  14. Lany J. Judging words by their covers and the company they keep: Probabilistic cues support word learning. Child Development. 2014;85:1727–1739. doi: 10.1111/cdev.12199. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Lew-Williams C, Fernald A. Young children learning Spanish make rapid use of grammatical gender in spoken word recognition. Psychological Science. 2007;18:193–198. doi: 10.1111/j.1467-9280.2007.01871.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Lew-Williams C, Pelucchi B, Saffran JR. Isolated words enhance statistical language learning in infancy. Developmental Science. 2011;14:1323–1329. doi: 10.1111/j.1467-7687.2011.01079.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Marcus GF, Fernandes KJ, Johnson SP. Infant rule learning facilitated by speech. Psychological Science. 2007;18:387–391. doi: 10.1111/j.1467-9280.2007.01910.x. [DOI] [PubMed] [Google Scholar]
  18. Misyak JB, Christiansen MH, Tomblin JB. Sequential expectations: The role of prediction-based learning in language. Topics in Cognitive Science. 2010;2:138–153. doi: 10.1111/j.1756-8765.2009.01072.x. [DOI] [PubMed] [Google Scholar]
  19. Phillips C, Ehrenhofer L. The role of language processing in language acquisition. Linguistic Approaches to Bilingualism. 2015;5 [Google Scholar]
  20. Rowe ML. Child-directed speech: Relation to socioeconomic status, knowledge of child development and child vocabulary skill. Journal of Child Language. 2008;35:185–205. doi: 10.1017/S0305000907008343. [DOI] [PubMed] [Google Scholar]
  21. Weisleder A, Fernald A. Talking to children matters: Early language experience strengthens processing and builds vocabulary. Psychological Science. 2013;20:1–10. doi: 10.1177/0956797613488145. [DOI] [PMC free article] [PubMed] [Google Scholar]

RESOURCES