Abstract
Infants have been described as ‘statistical learners’ capable of extracting structure (such as words) from patterned input (such as language). Here, we investigated whether prior knowledge influences how infants track transitional probabilities in word segmentation tasks. Are infants biased by prior experience when engaging in sequential statistical learning? In a laboratory simulation of learning across time, we exposed 9- and 10-month-old infants to a list of either bisyllabic or trisyllabic nonsense words, followed by a pause-free speech stream composed of a different set of bisyllabic or trisyllabic nonsense words. Listening times revealed successful segmentation of words from fluent speech only when words were uniformly bisyllabic or trisyllabic throughout both phases of the experiment. Hearing trisyllabic words during the pre-exposure phase derailed infants’ abilities to segment speech into bisyllabic words, and vice versa. We conclude that prior knowledge about word length equips infants with perceptual expectations that facilitate efficient processing of subsequent language input.
Keywords: statistical learning, infant language learning, word segmentation, transfer, prior experience
1. Introduction
Smith (2000) describes learning as a historical process. Learners do not simply make connections between stimulus events. Instead, “the formation of initially simple associations changes what is attended to and, in doing so, changes what will be learned in the future” (p. 172). There have been many demonstrations of how learning occurs against the backdrop of what has been learned previously, across diverse domains including categorization (Lin & Murphy, 1997), concept learning (Wisniewski & Medin, 1994), shape learning (Smith, Jones, Landau, Gershkoff-Stowe, & Samuelson, 2002), scene perception (Biederman, Mezzanotte, & Rabinowitz, 1982), visual pattern detection (Thiessen, 2011), musical pattern detection (Tillman & Poulin-Charronnat, 2010), syntax learning (Lany & Gómez, 2008), and classroom education (Ennis, 1992). This approach to learning has also dominated research on second language learning for decades, yielding a highly robust and convergent finding: an individual’s first language knowledge shapes and constrains acquisition of a second language (e.g., Bates & MacWhinney, 1981; Lado, 1957; Lew-Williams & Fernald, 2010; Nemser, 1971). Here we applied this dynamic, temporal perspective on learning to a basic issue in early language acquisition, in order to refine what is currently known about the operation of mechanisms that support learning in infancy.
In the last 15 years, developmental researchers have become interested in ‘statistical learning,’ a mechanism argued to drive learning from the earliest months of life. Statistical learning refers to the domain-general ability to extract structure from patterned input (Fiser & Aslin, 2002; Kirkham, Slemmer, & Johnson, 2002; Romberg & Saffran, 2010). Much of the research on statistical language learning has investigated how infants learn to segment word-like units from continuous speech. Using artificial-language (Aslin, Saffran, & Newport, 1998; Saffran, Aslin, & Newport, 1996) and natural-language paradigms (Lew-Williams, Pelucchi, & Saffran, 2011; Pelucchi, Hay, & Saffran, 2009), studies have demonstrated that infants are sensitive to the regularity with which one syllable will transition to another syllable, known as transitional probability (TP). It is argued that infants’ general ability to detect statistical dependencies helps them identify where words begin and end in the ambient language, in tandem with many other cues specific to that language (Sahni, Seidenberg, & Saffran, 2010).
Researchers are only just beginning to understand how experience shapes infant statistical learning. For example, highly frequent lexical items such as ‘Mommy’ are easy for infants to detect in continuous speech (Bortfeld, Morgan, Golinkoff, & Rathbun, 2005; Singh, Nestor, & Bortfeld, 2008). Previous learning of stress patterns also influences segmentation skills: infants more readily segment unfamiliar words if they match the predominant stress pattern in the native language (Houston, Santelmann, & Jusczyk, 2004; Johnson & Jusczyk, 2001; Thiessen & Saffran, 2003), and shift their segmentation strategy based on exposure to novel stress patterns (Thiessen & Saffran, 2007). Moreover, adults’ phonological knowledge shapes the detection of new statistical structure. For example, Catalan (Toro, Pons, Bion, & Sebastián-Gallés, 2011) and English speakers (Finn & Hudson Kam, 2008) have difficulty segmenting words that violate subtle, language-specific rules for word formation. Saffran and Thiessen (2003) illuminated how this perceptual specialization may take shape in infancy. Given brief exposure to specific phonological patterns (artificial words with syllable-initial voiceless consonants, e.g., todkad), infants successfully segmented words that fit the pattern, but failed to segment words that violated expectations (words with syllable-initial voiced consonants, e.g., bupgok). Together, this research suggests that learners are biased by prior experiences with word frequency, stress, and phonology when determining word boundaries in new linguistic input.
A recent study with English-speaking adults took this idea in an important direction, showing that such biases can actually override attention to sequential statistics. Frank, Tily, Arnon, and Goldwater (2010) designed an artificial language that could be segmented into bisyllabic, trisyllabic, or hexasyllabic units. Crucially, only the hexasyllabic lexicon was supported by segmenting at locations with low TPs, but few adults actually segmented the language into these larger chunks. Most participants preferred bisyllabic or trisyllabic segmentations, imposing an apparent bias for shorter units. Indeed, English words are short relative to some languages, including Spanish: the 680 words included on the MacArthur-Bates Inventario del Desarrollo de Habilidades Comunicativas are a mean 2.7 syllables in length (vs. 1.7 syllables in English). In contrast, words in Mandarin Chinese are predominantly monosyllabic. Across time, experience with shorter vs. longer words may lead infants to form perceptual expectations for future language input, and thereby impact statistical learning from fluent speech.
The current study was designed to determine whether we could change infants’ expectations about where words begin and end in a statistical learning task. Does prior experience mediate the deployment of language learning mechanisms in infancy? Rather than manipulating the infants’ native language, we used a laboratory simulation of learning across time, in order to avoid the introduction of biases related to phonology and stress inherent in cross-linguistic comparisons. Infants were first pre-exposed to a list of nonsense words that were deterministically either bisyllabic or trisyllabic. This between-subjects manipulation was intended to induce a specific expectation about word length. Infants were then exposed to fluent speech generated using new nonsense words that were either bisyllabic or trisyllabic, with only sequential statistical cues to word boundaries. Thus the length of words heard during the pre-exposure phase was either consistent or inconsistent with the length of words embedded in fluent speech during the segmentation phase (Figure 1). Some infants heard words of uniform length throughout the experiment (bisyllabic or trisyllabic in both phases), while other infants were led down a “garden-path” of sorts, where initial expectations about word length did not match the statistical structure of the fluent speech stream (Gebhart, Aslin, & Newport, 2009). Infants were then tested by assessing their preference for repetitions of target words (TP=1.0) vs. frequency-matched part-words that spanned word boundaries (TP=.5).
If pre-exposure to words of a particular length has no effect upon subsequent statistical learning, the pattern of listening times should not differ as a function of whether the pre-exposure was consistent or inconsistent with the structure of the fluent speech. That is, hearing a list of trisyllabic words should not impact infants’ ability to detect bisyllabic words in fluent speech, or vice versa. However, if word-length expectations do influence statistical learning, listening times to words and part-words should vary across the consistent and inconsistent conditions. Pre-exposure to trisyllabic words should facilitate infants’ detection of trisyllabic words relative to bisyllabic words in fluent speech. Similarly, pre-exposure to bisyllabic words should facilitate detection of bisyllabic words relative to trisyllabic words. This outcome would indicate that prior experience shapes statistical learning.
2. Method
2.1. Participants
Participants were 96 healthy, full-term infants (43 female) from monolingual English-speaking households. Infants ranged in age from 9.0 to 10.9 months (M=10.0 months), matching the approximate age of participants in previous studies on infant word segmentation using a similar three-stage task (Saffran & Thiessen, 2003; Thiessen & Saffran, 2007). Parents reported no pervasive developmental delays and no history of hearing or visual problems. Twenty-four infants were randomly assigned to one of four between-subjects conditions. Forty additional infants were tested but not included in analyses due to fussiness (29), inattentiveness (6), experimenter error (3), or equipment malfunction (2). This rate of participant exclusion was expected due to the length of the experimental session relative to past artificial-language studies.
2.2. Stimuli
In the Pre-exposure Phase, infants heard a list of 30 nonsense words, each separated by a 300-ms pause. Words were either bisyllabic (CVCV; e.g., boga…giku…) or trisyllabic (CVCVCV; e.g., bogapi…gikutu…), depending on condition. Pre-exposure words were bisyllabic in the Consistent2-2 and Inconsistent2-3 conditions, and trisyllabic in the Consistent3-3 and Inconsistent3-2 conditions; subscripts indicate the number of syllables in pre-exposure and target words (Figure 1). All speech stimuli were recorded in citation form by a female native English speaker and presented at approximately 67 dBSPL.
During the Segmentation Phase, infants heard pause-free speech consisting of 4 new nonsense words, repeated in random order. Two of the words occurred 35 times each (low-frequency words) and 2 occurred 70 times each (high-frequency words). This phase was modeled after Aslin et al. (1998), except that frequency of exposure to each target word was reduced by 22.2% to increase the potential impact of the pre-exposure manipulation. Target words were bisyllabic in the Consistent2-2 and Inconsistent3-2 conditions, and trisyllabic in the Consistent3-3 and Inconsistent2-3 conditions. Successive syllables within all target words had TPs of 1.0. There were 2 counterbalanced bisyllabic languages and 2 counterbalanced trisyllabic languages, such that each test item was a word for half of the infants and a part-word for the other half (see Appendix). To approximate the operations of a speech synthesizer and maintain natural coarticulation, we followed the speech recording procedures used in Graf Estes, Evans, Alibali, and Saffran (2007, Experiment 2). We recorded each target syllable in the middle of a 3-syllable sequence, in every possible coarticulation context. Middle syllables were spliced together into a fluent speech stream with consistent rate (3.1 syllables/sec) and pitch (F0=196 Hz); there were no acoustic cues to word boundaries.
In the Test Phase, infants heard repetitions of a single item on each trial, separated by 300-ms pauses. Test items for each infant were either all bisyllabic (Consistent2-2 and Inconsistent3-2 conditions) or all trisyllabic (Consistent3-3 and Inconsistent2-3 conditions), matching the segmentation materials. Test items consisted of 2 low-frequency target words (TP = 1.0), and 2 frequency-matched part-words, consisting of the last syllable of one high-frequency word and the beginning of the other high-frequency word (TP=.5 for bisyllabic items; TP=0.5 and 1.0 between successive syllable pairs for trisyllabic items).
2.3. Procedure
The experiment consisted of 3 phases: Pre-exposure (bisyllabic, 33 sec; trisyllabic, 40 sec), Segmentation (bisyllabic, 2:12; trisyllabic, 3:18), and Test (~3 min), all using the Headturn Preference Procedure. Infants were seated on parents’ laps; parents listened to unrelated speech over closed-ear headphones. During the pre-exposure phase, infants heard a list of nonsense words. During the segmentation phase, infants heard a stream of continuous speech; blinking side-lights were flashed contingent on looking behavior. The experimenter coded head-turns using custom-designed MATLAB software (R2010b, Mathworks, Inc.). During the test, a trial was initiated when infants looked to a flashing center light, at which point the experimenter extinguished the center light and one of the side-lights began flashing. When infants looked at the side-light, a token of a word or part-word was repeated until infants looked away for 2 s or 20 s elapsed. The experimenter was blind to test-item presentation. Each word and part-word was repeated on 3 trials, randomized by block, for a total of 12 test trials.
3. Results
Our initial analysis was designed to ensure that the length of words in the speech stream (bisyllabic vs. trisyllabic) did not influence ease of segmentation. Looking times were analyzed in a 2x2 mixed ANOVA with segmentation language (bisyllabic, triyllabic) as a between-subjects factor, and item (word, part-word) as a within-subjects factor. Neither the main effect of segmentation language [F(1,94)=2.9, p=.09, ηp2=.03] nor the interaction between segmentation language and item [F(1,94)=.07, p=.80, ηp2=.001] were significant (see Figure 2). This indicates that, independent of pre-exposure, the speech streams made up of trisyllabic words were not inherently harder to learn than the bisyllabic streams, and vice versa. We thus combined the two Consistent conditions and the two Inconsistent conditions in the following analysis.
The main analysis tested the hypothesis that infants would succeed in discriminating words from part-words only when the word lengths in the segmentation materials were consistent with the word lengths heard during pre-exposure. Mean looking times were analyzed in a 2x2 mixed ANOVA with consistency (Consistent, Inconsistent) as a between-subjects factor and item (word, part-word) as a within-subjects factor. Looking times revealed a main effect of item [F(1,94)=4.21, p=.043, ηp2=.043]: infants looked significantly longer on word trials than part-word trials. There was also a significant main effect of consistency [F(1,94)=5.40, p=.022, ηp2=.054], indicating that infants looked longer when word lengths matched across the pre-exposure and segmentation phases. Crucially, there was a significant interaction between test item and consistency: F(1,94)=4.69, p=.033, ηp2=.048. Bonferroni-corrected post-hoc comparisons (p<.025) revealed that infants in the Inconsistent conditions looked equally to words and part-words [M=7.03 sec for words; M=7.05 for part-words; t(47)=.09, p=.928, d=.01], while infants in the Consistent conditions looked significantly longer to words than to part-words [M=8.73 for words; M=7.69 for part-words; t(47)=2.70, p=.010, d=.39]. These results suggest that only infants in the Consistent conditions were able to discriminate words from part-words. Hearing a list of words during the pre-exposure phase facilitated the detection of TPs only when those words provided an accurate template for the analysis of subsequent language input.
4. Discussion
Across conditions, infants heard a continuous speech stream with statistical structure that supported the segmentation of target words. However, pre-exposure to a list of either bisyllabic or trisyllabic words changed the course of learning. Infants only showed successful discrimination between words and part-words when pre-exposure and target words were consistent in length. When word lengths were inconsistent across phases of the experiment, infants did not successfully use TPs to discriminate between test items. Our results thus provide evidence that expectations about word length shaped infants’ processing of statistical cues to word boundaries. These findings are revealing about the manner in which prior experience and new learning interact. Obviously, real language environments do not provide such overt and deterministic cues to word length, and infants do not have fixed expectations about word length. Instead, infants become gradually attuned to the structure inherent in the global language input. Prior experience with the distribution of word lengths in the native language should facilitate subsequent analyses of input, in tandem with knowledge of phoneme distributions (Christiansen, Onnis, & Hockema, 2009), lexical stress (Johnson & Jusczyk, 2001), and highly frequent words (Bortfeld et al., 2005).
Why did expectations about word length change infants’ sensitivity to statistical relations? One possibility is that top-down knowledge led infants to seek words of a particular length in fluent speech. Infants may have attempted, but ultimately failed, to divide a trisyllabic language into bisyllabic words, or vice versa. However, it is not necessary to posit that infants derive word structure so actively. Swingley (2005) questioned the idea that infants insert dividing points into unsegmented speech, suggesting instead that infants cluster syllables that co-occur reliably. By studying how prior word knowledge influences the detection of syllable clusters, our results support an integration of these two descriptions of segmentation: during the pre-exposure phase, infants may have internalized a two- or three-part rhythm, which interacted with their perception of transition statistics. When TP-defined word boundaries aligned with this rhythmic expectation, infants used convergent evidence from syllabic co-occurrence to accurately segment words. But a mismatch derailed successful detection of syllable clusters. Computational, adult, and developmental data indicate that both top-down expectations and bottom-up processes are inevitably involved in learning (Bortfeld et al., 2005; Lew-Williams et al., 2011; Reber, 1989; Sun & Zhang, 2004). Our experiment design illuminates how an interaction between past and present learning may operate as infants build knowledge of structure in their linguistic environments.
Our results converge with and advance recent findings in research on statistical learning. Johnson and Tyler (2010) showed that infants have difficulty using TPs to segment continuous speech when the component words vary in length. Similarly, infants in our study who heard words of inconsistent lengths across the two learning phases of the experiment were not successful in segmenting continuous speech. These findings could indicate that statistical learning mechanisms cannot account for the variability inherent in real language, leading to catastrophic interference when the structure of the input diverges across time. Alternatively, our findings suggest an efficient processing strategy: infants used prior language experience to quickly narrow in on the most accurate analysis. This perspective is compatible with a growing body of research suggesting that learners do not blindly attend to transition statistics (Frank & Tenenbaum, 2011; Giroux & Rey, 2009). Bayesian approaches to word segmentation posit that learning is constrained by prior assumptions about the nature of words (Goldwater, Griffiths, & Johnson, 2009). Our experiment provides a snapshot of this learning process in infant learners. Pre-exposure to a list of bisyllabic or trisyllabic words induced a prior, or learning bias, that infants adopted when processing subsequent language input. This process mirrors the fine-tuning of infants’ phonological perception to sounds in the ambient language over the first year (Werker & Tees, 1084), and supports an experimentally provocative description of learning by Smith (2000): “we learn about what we attend to, and we learn what to attend to” (p. 170). A complete theory of statistical learning will need to account for how infants accumulate language experience and update word knowledge.
Future research should aim to uncover how cross-linguistic differences in word length guide statistical learning in infancy. Multisyllabic words are more common in Spanish than in English (Cutler & Carter, 1997; Roark & Demuth, 2000), and the consequences of this difference emerge early in language production: infants exposed to Spanish produce more multisyllabic babbles than infants with no Spanish exposure (Ward, Sundara, Conboy, & Kuhl, 2009). Our findings suggest that similar effects will emerge early in language comprehension. Understanding how the distribution of word length and other types of variability in home-language experience shape subsequent learning will be important for analyzing the biases that infants bring to lab-based experiments, and for uncovering how learning unfolds over time.
Acknowledgments
This research was funded by grants from the Eunice Kennedy Shriver National Institute of Child Health and Human Development to CLW (F32HD069094), JRS (R01HD037466), and the Waisman Center (P30HD03352). Additional support was provided by a grant from the James F. McDonnell Foundation to JRS. Thanks to the participating families and the members of the Infant Learning Lab. We are also grateful to Jon Willits and Erik Thiessen for feedback on an earlier version of this manuscript.
Appendix
Segmentation Phase
| |||
---|---|---|---|
Bisyllabic
|
Trisyllabic
|
||
Language 1 | Language 2 | Language 1 | Language 2 |
baku* | pido* | pabiku* | tudaro* |
tiro* | lagu* | tibudo* | pigola * |
dola** | roba** | golatu** | bikuti** |
gupi** | kuti** | daropi** | budopa** |
=low-frequency
=high-frequency
Test Phase
| |||
---|---|---|---|
Bisyllabic
|
Trisyllabic
|
||
Language 1 | Language 2 | Language 1 | Language 2 |
baku* | baku** | pabiku* | pabiku** |
tiro* | tiro** | tibudo* | tibudo** |
pido** | pido* | tudaro** | tudaro* |
lagu** | lagu* | pigola** | pigola* |
=word
=part-word
Pronunciation: /a/=‘ah’; /i/=‘ee’; /o/=‘oh’; /u/=‘oo.’
Footnotes
Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.
References
- Aslin RN, Saffran JR, Newport EL. Computation of conditional probability statistics by 8- month- old infants. Psychological Science. 1998;9:321–324. [Google Scholar]
- Bates E, MacWhinney B. Second language acquisition from a functionalist perspective: Pragmatic, semantic, and perceptual strategies. In: Winitz H, editor. Native language and foreign language acquisition (Annals of the New York Academy of Sciences, No. 379) New York, NY: New York Academy of Sciences; 1981. pp. 190–214. [Google Scholar]
- Biederman I, Mezzanotte RJ, Rabinowitz JC. Scene perception: Detection and judging objects undergoing relational violations. Cognitive Psychology. 1982;14:143–177. doi: 10.1016/0010-0285(82)90007-x. [DOI] [PubMed] [Google Scholar]
- Bortfeld H, Morgan JL, Golinkoff RM, Rathbun K. Mommy and me: Familiar names help launch babies into speech-stream segmentation. Psychological Science. 2005;16:298–304. doi: 10.1111/j.0956-7976.2005.01531.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Christiansen MH, Onnis L, Hockema SA. The secret is in the sound: From unsegmented speech to lexical categories. Developmental Science. 2009;12:388–395. doi: 10.1111/j.1467-7687.2009.00824.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cutler A, Carter D. The predominance of strong initial syllables in the English vocabulary. Computer Speech and Language. 1987;2:133–142. [Google Scholar]
- Ennis CD. Reconceptualizing learning as a dynamical system. Journal of Curriculum and Supervision. 1992;7:115–130. [Google Scholar]
- Finn AS, Hudson Kam CL. The curse of knowledge: First language knowledge impairs adult learners’ use of novel statistics for word segmentation. Cognition. 2008;108:477–499. doi: 10.1016/j.cognition.2008.04.002. [DOI] [PubMed] [Google Scholar]
- Fiser J, Aslin RN. Statistical learning of higher-order temporal structure from visual shape sequences. Journal of Experimental Psychology: Learning, Memory, and Cognition. 2002;28:458–467. doi: 10.1037//0278-7393.28.3.458. [DOI] [PubMed] [Google Scholar]
- Frank MC, Tenenbaum JB. Three ideal observer models for rule learning in simple languages. Cognition. 2011 doi: 10.1016/j.cognition.2010.10.005. Manuscript in press. [DOI] [PubMed] [Google Scholar]
- Frank MC, Tily H, Arnon I, Goldwater S. Beyond transitional probabilities: Human learners apply a parsimony bias in statistical word segmentation. Proceedings of the 32nd Annual Meeting of the Cognitive Science Society.2010. [Google Scholar]
- Gebhart AL, Aslin RN, Newport EL. Changing structures in midstream: Learning Along the Statistical Garden Path. Cognitive Science. 2009;33:1087–1116. doi: 10.1111/j.1551-6709.2009.01041.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Giroux I, Rey A. Lexical and sublexical units in speech perception. Cognitive Science. 2009;33:260–272. doi: 10.1111/j.1551-6709.2009.01012.x. [DOI] [PubMed] [Google Scholar]
- Goldwater S, Griffiths TL, Johnson M. A Bayesian framework for word segmentation: Exploring the effects of context. Cognition. 2009;112:21–54. doi: 10.1016/j.cognition.2009.03.008. [DOI] [PubMed] [Google Scholar]
- Graf Estes KM, Evans J, Alibali MW, Saffran JR. Can infants map meaning to newly segmented words? Statistical segmentation and word learning. Psychological Science. 2007;18:254–260. doi: 10.1111/j.1467-9280.2007.01885.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Houston DM, Santelmann LM, Jusczyk PW. English-learning infants’ segmentation of trisyllabic words from fluent speech. Language and Cognitive Processes. 2004;19:97–136. [Google Scholar]
- Johnson EK, Jusczyk PW. Word segmentation by 8-month-olds: When speech cues count more than statistics. Journal of Memory and Language. 2001;44:548–567. [Google Scholar]
- Johnson EK, Tyler MD. Testing the limits of statistical learning for word segmentation. Developmental Science. 2010;13:339–345. doi: 10.1111/j.1467-7687.2009.00886.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kirkham NZ, Slemmer JA, Johnson SP. Visual statistical learning in infancy: Evidence for a domain general learning mechanism. Cognition. 2002;83:B35–B42. doi: 10.1016/s0010-0277(02)00004-5. [DOI] [PubMed] [Google Scholar]
- Lado R. Linguistics across cultures: Applied linguistics for language teachers. University of Michigan Press; Ann Arbor: 1957. [Google Scholar]
- Lany J, Gómez RL. Twelve-month-old infants benefit from prior experience in statistical learning. Psychological Science. 2008;19:1247–1252. doi: 10.1111/j.1467-9280.2008.02233.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lew-Williams C, Fernald A. Real-time processing of gender-marked articles by native and non-native Spanish speakers. Journal of Memory and Language. 2010;63:447–464. doi: 10.1016/j.jml.2010.07.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lew-Williams C, Pelucchi B, Saffran JR. Isolated words enhance statistical language learning in infancy. Developmental Science. 2011 doi: 10.1111/j.1467-7687.2011.01079.x. Manuscript in press. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lin EL, Murphy GL. Effects of background knowledge on object categorization and part detection. Journal of Experimental Psychology: Human Perception and Performance. 1997;23:1153–1169. [Google Scholar]
- Nemser W. Approximative systems of foreign language learners. International Review of Applied Linguistics. 1971;9:115–124. [Google Scholar]
- Pelucchi B, Hay JF, Saffran JR. Statistical learning in a natural language by 8- month-old infants. Child Development. 2009;80:674–685. doi: 10.1111/j.1467-8624.2009.01290.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Reber A. Implicit learning and tacit knowledge. Journal of Experimental Psychology: General. 1989;118:219–235. [Google Scholar]
- Roark B, Demuth K. Prosodic constraints and the learner’s environment: A corpus study. In: Howell SC, Fish SA, Keith-Lucas T, editors. Proceedings of the 24th Annual Conference on Language Development. Somerville MA: Cascadilla Press; 2000. pp. 597–608. [Google Scholar]
- Romberg AR, Saffran JR. Statistical learning and language acquisition. Wiley Interdisciplinary Reviews: Cognitive Science. 2010;1:906–914. doi: 10.1002/wcs.78. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Saffran JR, Aslin RN, Newport EL. Statistical learning by eight-month old infants. Science. 1996;274:1926–1928. doi: 10.1126/science.274.5294.1926. [DOI] [PubMed] [Google Scholar]
- Saffran JR, Thiessen ED. Pattern induction by infant language learners. Developmental Psychology. 2003;39:484–494. doi: 10.1037/0012-1649.39.3.484. [DOI] [PubMed] [Google Scholar]
- Sahni SD, Seidenberg MS, Saffran JR. Connecting cues: Overlapping regularities support cue discovery in infancy. Child Development. 2010;81:727–736. doi: 10.1111/j.1467-8624.2010.01430.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Singh L, Nestor SS, Bortfeld H. Overcoming the effects of variation in infant speech segmentation: Influences of word familiarity. Infancy. 2008;13:57–74. doi: 10.1080/15250000701779386. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Smith LB. Avoiding association when it’s behaviorism you really hate. In: Golinkoff R, Hirsh-Pasek K, editors. Breaking the word learning barrier. Oxford: Oxford University Press; 2000. pp. 169–174. [Google Scholar]
- Smith LB, Jones SS, Landau B, Gershkoff-Stowe L, Samuelson L. Object name learning provides on-the-job training for attention. Psychological Science. 2002;13:13–19. doi: 10.1111/1467-9280.00403. [DOI] [PubMed] [Google Scholar]
- Sun R, Zhang X. Top-down versus bottom-up learning in cognitive skill acquisition. Cognitive Systems Research. 2004;5:63–89. [Google Scholar]
- Swingley D. Statistical clustering and the contents of early vocabulary. Cognitive Psychology. 2005;50:86–132. doi: 10.1016/j.cogpsych.2004.06.001. [DOI] [PubMed] [Google Scholar]
- Thiessen ED. Domain general constraints on statistical learning. Child Development. 2011;82:462–470. doi: 10.1111/j.1467-8624.2010.01522.x. [DOI] [PubMed] [Google Scholar]
- Thiessen ED, Saffran JR. When cues collide: Use of stress and statistical cues to word boundaries by 7- to 9-month-old infants. Developmental Psychology. 2003;39:706–716. doi: 10.1037/0012-1649.39.4.706. [DOI] [PubMed] [Google Scholar]
- Thiessen ED, Saffran JR. Learning to learn: Infants’ acquisition of stress-based strategies for word segmentation. Language Learning and Development. 2007;3:73–100. [Google Scholar]
- Tillman B, Poulin-Charronnat B. Auditory expectations for newly acquired structures. The Quarterly Journal of Experimental Psychology. 2010;63:1646–1664. doi: 10.1080/17470210903511228. [DOI] [PubMed] [Google Scholar]
- Toro JM, Pons F, Bion RAH, Sebastián-Gallés N. The contribution of language-specific knowledge in the selection of statistically-coherent word candidates. Journal of Memory and Languae. 2011;64:171–180. [Google Scholar]
- Ward N, Sundara M, Conboy B, Kuhl P. Consequences of short-term language exposure in infancy on babbling. The Journal of the Acoustical Society of America. 2009;126:2311. [Google Scholar]
- Werker JF, Tees RC. Cross-language speech perception: Evidence for perceptual reorganization during the first year of life. Infant Behavior and Development. 1984;7:49–63. [Google Scholar]
- Wisniewski EJ, Medin DL. On the interaction of theory and data in concept learning. Cognitive Science. 1994;18:221–281. [Google Scholar]