Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2016 Jan 1.
Published in final edited form as: Child Dev. 2014 Jul 14;86(1):10–22. doi: 10.1111/cdev.12269

From flexibility to constraint: The contrastive use of lexical tone in early word learning

Jessica F Hay 1, Katharine Graf Estes 2, Tianlin Wang 3, Jenny R Saffran 4
PMCID: PMC4295000  NIHMSID: NIHMS581103  PMID: 25041105

Abstract

Infants must develop both flexibility and constraint in their interpretation of acceptable word forms. The current experiments examined the development of infants’ lexical interpretation of non-native variations in pitch contour. Fourteen, 17-, and 19-month-olds (Experiments 1 and 2, N = 72) heard labels for two novel objects; labels contained the same syllable produced with distinct pitch contours (Mandarin lexical tones). The youngest infants learned the label-object mappings, but the older groups did not, despite being able to discriminate pitch differences in an object-free task (Experiment 3, N = 14). Results indicate that 14-month-olds remain flexible regarding what sounds make meaningful distinctions between words. By 17–19 months, experience with a non-tonal native language constrains infants’ interpretation of lexical tone.


A fundamental challenge facing novice language learners is to determine which sounds in the ambient environment carry meaning. The complexity of this problem is lost on most adult language users, who effortlessly reject a wide range of non-speech sounds as labels for objects. For example, the “vrrmm” of a vacuum cleaner, the “achoo” of a sneeze, and the “tsk tsk” in response to a naughty act would not be considered potential lexical items. Infants, on the other hand, must learn to identify which sounds in their environments are lexically relevant and which sounds are not and thus should be rejected as potential labels.

A great deal of research has suggested that infants start out open-minded about what makes an acceptable word form. They then display appropriate, language-specific narrowing regarding the kinds of units they are willing to treat as potential object labels. For example, early word learners, around 13 to 17 months of age, readily map words (e.g., toma) and non-speech sounds (e.g., a squeak, Woodward & Hoyne, 1999; a two-tone beep, Namy, 2001) to novel objects when they are presented during interactive labeling events. They also associate non-verbal mouth noises (e.g. psst, Hirsh-Pasek, Golinkoff, & Hollich, 2000) as well as gestures and pictograms (Namy, 2001) with objects. Early flexibility in what counts as a word may prove to be adaptive for subsequent word learning, as infants do not know a priori which sounds and symbols in their environment are lexically relevant.

As infants get older, they amass knowledge of the sound repertoire of their native language, and demonstrate increasing sensitivity to native word forms. For example, older infants (around 20 months and above) often fail to map non-speech symbols to objects (Namy & Waxman, 1998; Woodward & Hoyne, 1999). This change in what infants consider to be acceptable word forms is appropriate and useful, as these variations are not used contrastively in the infants’ native language.

The narrative about narrowing in early word learning is quite complex, however. In contrast to the literature reviewed above, recent research suggests that sophisticated constraints on what infants count as a word may emerge very early in development. MacKenzie, Graham, and Curtin (2011) demonstrated that 12-month-olds privilege words as object labels over other vocally produced non-lexical sounds in non-social associative learning tasks (see MacKenzie et al., 2011, for a discussion of methodological considerations). Infants successfully learned label-object mappings when the labels were two English nonsense CVC words (fep and wug), but not when they were consonant- or vowel-based communicative sounds (mmm and shhh; oooh and aaah), or non-communicative sounds (/l/ and /ʒ/). MacKenzie, Curtin, and Graham (2012) also demonstrated that 12-month-olds are constrained by phonotactic regularities in their native language. Infants mapped non-native words that were phonotactically legal in English (e.g., mido and hashi) to novel objects, but not words that violated English phonotactic regularities (e.g., ptak and svet, which contain word onset consonant clusters that never occur in English). These studies suggest that even the youngest word learners have some basic understanding of acceptable word forms.

Languages use many ways to create acceptable word forms and to mark distinctions between words. For example, speakers can change many word meanings by simply altering a single segmental element (e.g., ball → tall; bat → bit). Thus, one might expect infants to be particularly sensitive to this type of lexically contrastive distinction early on. Numerous studies, however, have demonstrated that 14-month-olds fail to map minimal pair nonsense words (e.g., /bI/-/dI/; /pIn/ - /bIn/; /pIn/ - /dIn) to novel objects (e.g., Stager & Werker, 1997; Pater, Stager, & Werker, 2004), even though both place of articulation (e.g., /b/-/d/) and voicing (e.g., /b/-/p/) distinctions are lexically contrastive in English, and infants can readily discriminate them when no objects are present (Stager & Werker, 1997). A number of theories have been proposed to account for 14-month-olds’ performance in these minimal pair label-object mapping tasks, including capacity limitations (e.g., Werker & Curtin, 2005; see also Fennell, 2012; Yoshida, Fennell, Swingley, & Werker, 2009), lack of social/referential support (e.g., Fennell & Waxman, 2010), and insufficiently robust phonological representations (Apfelbaum & McMurray, 2011; Galle, Apfelbaum, & McMurray, 2013; Rost & McMurray, 2009, 2010). Nevertheless, even without referential support, by around 17 months, infants succeed in learning label pairs that differ by a single phoneme (e.g., Werker, Fennell, Corcoran, & Stager, 2002). Thus, it appears that language experience helps infants appropriately interpret phoneme distinctions.

In addition to segmental contrasts, many languages use lexical tones or pitch contours, which are suprasegmental features, to form lexical contrasts. Languages that use pitch contour contrastively are called tone languages (e.g., Mandarin Chinese, Cantonese, Thai). They comprise 60–70% of the world’s languages (Yip, 2002), and are spoken by over 50% of the world’s population (Fromkin, 1978). Lexical tones typically reflect variations in the level or contour of the fundamental frequency (F0; pitch) within a single syllable (e.g., Burnham & Mattock, 2007; Liu & Samuel, 2004), as well as additional acoustic correlates of physical duration, F2, voice quality, and amplitude. Figure 1 illustrates lexical tones that occur in Mandarin, the most widely spoken tone language. The four citation tones in Mandarin can be categorized as high level (T1), high rising (T2), low dipping (T3), and high falling (T4). For instance, /ma/ can mean ‘mother’, ‘hemp’, ‘horse’, and ‘to scold’ when carrying T1, T2, T3, and T4, respectively. Precisely how languages instantiate lexical tone varies from one language to the next, but in most tone languages, lexical tone is crucial for determining the meaning of a word.

Figure 1.

Figure 1

Mean F0 contour of four Mandarin tones in the /ma/ monosyllable produced in isolation. The time course is normalized, with all tones plotted with their average duration proportional to the average duration of T3 (Xu, 1997, p. 67, reprinted with permission).

In non-tone languages, like English, pitch also varies in meaningful ways. In English, pitch is commonly used to communicate intonational and emotional meaning (e.g., Bolinger, 1989; Hirschberg & Ward, 1992; Pierrehumbert, 1980; Ward & Hirschberg, 1985), highlight aspects of grammatical structure (e.g., Gussenhoven, 2004) and lexical stress (e.g., Fry, 1958), and differentiate talker identity, talker gender, and talker register (e.g., infant- versus adult-directed speech; Fernald, 1992). Although pitch conveys information and is a salient acoustic characteristic of the speech signal in English, it is not used contrastively for word meaning. Still, young English-learning infants are highly familiar with pitch variation in their speech input and show sensitivity to many of these systematic pitch variations (e.g., Fernald & Kuhl, 1987; Fernald, 1989; Fernald, 1992; Katz, Cohn, & Moore, 1996; Moore, Spence, & Katz, 1997). Moreover, the presence of lexically-irrelevant pitch variation can actually help infants to detect lexically-relevant phonemic differences (such as voice-onset-time) in minimal-pair word learning tasks (Galle, et al., 2013; Rost & McMurray, 2009, 2010).

Recent work by Quam and Swingley (2010) suggests that English-learning children and adults are unable to exploit contrastive pitch contours when learning new words, even when pitch contour is the only relevant lexical cue. Adults and 2.5-year-olds were taught a novel object-label pairing, where the label (deebo) was produced with a consistent and salient pitch contour. When tested, both adults and toddlers failed to notice a salient non-phonemic change in the word’s pitch contour (e.g., deebo with rising then falling contour was changed to deebo with level then falling contour, or vice versa). In contrast, they noticed a phonemic change in the word’s vowel (e.g., deebo to dahbo). Quam and Swingley suggested that children may not have interpreted salient pitch contour changes as lexical changes because by 2.5 years of age children have accumulated a great deal of evidence indicating that pitch contour variation is not lexically contrastive in English.

Thus, by 2.5 years of age, English-learning children show a constrained interpretation of pitch contour, namely that different pitch contours do not mark distinct word meanings. How much English-language experience is necessary for this constraint to develop? Are very young word learners, who have less experience with the functions of pitch variation, more flexible in their interpretation of distinctive pitch contours than more experienced word learners?

In the current set of experiments, we investigated the developmental trajectory of infants’ flexibility in early word learning by assessing infants’ willingness to accept lexical tones as lexically contrastive. Specifically, the focus of the current set of experiments was infants’ ability to map labels that differ only in lexical tone to novel objects. As mentioned previously, lexical tones are used in tone languages to minimally contrast words and typically reflect pitch variations within a single syllable. Lexical tones provide an informative test case for asking about infants’ flexibility in determining which sound differences point to meaningful distinctions between words. While English-learning infants experience extensive pitch variations in their speech input, a significant amount of language experience may be necessary before they can assign pitch variations to their appropriate (non-lexical) linguistic functions (see also Dietrich, Swingley, & Werker, 2007). In addition, the pitch contour variations in lexical tones are particularly acoustically salient compared to phonemic differences between many consonants. Thus, early in word learning, as English-learning infants are trying to discover which acoustic properties of sounds in the environment make lexical contrasts, the salience of pitch may prove more useful than later on, when infants have access to numerous other cues (e.g., voice-onset-time) pointing out lexical contrasts. Nevertheless, pitch contours represent a cue that English-learning infants must learn to ignore when interpreting the meanings of words, as they are largely lexically irrelevant in English.

This series of experiments investigated whether very young word learners have a constrained interpretation of pitch contours, or whether they are open-minded about the lexical relevance of pitch contours. As learners gather experience about their non-tonal native language, pitch contour may become a less lexically relevant cue. To begin addressing these issues, in Experiment 1 we examined flexibility in early word learning by testing whether 14-month-old infants can map meanings to labels that contrast only in lexical tone. It would be adaptive for infants to remain flexible during the early stages of word learning, as they do not know a priori which language environment they will be brought up in. Thus, we predicted that young word learners should readily map labels that contrast in lexical tone to novel objects.

Experiment 1

In Experiment 1, we assessed flexibility in early word learning by testing whether 14-month-olds associate novel labels with novel objects when the labels differ only in pitch contour (/kʊ/ with rising contour versus /kʊ/ with falling contour). We used a modified version of the Switch Paradigm, which is an associative learning task without contextual or referential support, to ensure that learning would not be carried solely by the social context of the testing situation (see MacKenzie et al., 2011, for a discussion of methodological considerations). We selected rising versus falling contours because previous research has suggested that across the first year, both tonal language- and non-tonal language-learning infants maintain the ability to discriminate between rising versus falling pitch contours in a non-word learning context (Mattock & Burnham, 2006). Finally, we chose to test 14-month-old infants because previous research suggests that infants around this age show both flexibility in what counts as a word (Woodward & Hoyne, 1999) and constraint in what sounds they find meaningfully contrastive (MacKenzie et al., 2011, 2012). Additionally, it is one of the youngest ages at which infants successfully learn object-label mappings in experimental tasks (Werker, Cohen, Lloyd, Casasola, & Stager, 1998; but see Curtin, 2009; MacKenzie et al., 2011; 2012). Although previous studies have suggested that 14-month-old infants have significant difficulty learning phonetically similar object labels (e.g., Stager & Werker, 1997; but see Yoshida et al., 2009), this is an age at which infants may be willing to interpret an acoustically salient, but lexically irrelevant cue, such as differences in pitch contour, as meaningfully distinctive.

Method

Participants

Twenty-four 14-month-old (mean age 14.2 months, range: 13.1–15.1; 12 female) monolingual English-learning infants from a mid-sized Midwestern American city participated in Experiment 1. All infants were born full-term, had fewer than five prior ear infections, no history of hearing or vision impairments, and no exposure to any tone languages. Data from 6 additional infants were excluded from the analysis because of fussiness (4), experimenter error (1), or current ear infection (1).

Stimuli

To create the labels, an adult female native speaker of Mandarin Chinese produced the monosyllabic nonsense word /kʊ/ with a rising pitch contour and a falling pitch contour. Figure 2 shows the spectrograms for both rising and falling /kʊ/ overlaid with the fundamental frequency (F0). Tokens were selected to have similar overall durations (856 ms for rising /kʊ/ and 867 ms for falling /kʊ/). Rising /kʊ/ and falling /kʊ/ both started with 143 ms of aspiration followed voicing for the remainder of the syllable. During the voicing portion, rising /kʊ/ had a frequency of 245 Hz at the start of voicing, which fell to 200 Hz over the first 130 ms of voicing and then rose to 290 Hz over the remaining 570 ms of the syllable. Falling /kʊ/ had a frequency of 320 Hz at the onset of voicing, which fell to 190 Hz over the remainder of voiced portion of the syllable. Twelve tokens were repeated with an ISI of 750 ms to create a 20 ms auditory stream.

Figure 2.

Figure 2

Spectrogram and pitch contour of falling /kʊ/ (left) and rising /kʊ/ (right).

As shown in Figure 3, the novel objects consisted of two distinct, multi-colored, 3-D images. These objects have been used successfully in a number of other label-object mapping studies (e.g., Graf Estes, Evans, Alibali, & Saffran, 2007; Hay, Pelucchi, Graf Estes, & Saffran, 2011). Images were displayed in the center of the vertical axis of a 19-inch computer monitor and moved slowly back and forth in a linear path across the computer screen. The movement of the images continued uninterrupted at a rate of six seconds per cycle and was not synchronized with the presentation of the object labels.

Figure 3.

Figure 3

Objects used in label-object association task.

Infants were randomly assigned to view object A paired with rising /kʊ/ and object B paired with falling /kʊ/, or vice versa. A level /la/ produced by the same Mandarin speaker served as a pre-test stimulus and was paired with a third novel object.

Procedure

Infants were seated on a parent’s lap in a sound attenuated booth, approximately 1 meter from a flat screen monitor. The stimuli were presented using Habit X 1.0 (Cohen, Atkinson, & Chaput, 2004). An observer viewed infants’ responses on a monitor and indicated looking times by pressing a button on the computer running Habit. To avoid potential bias, the observer was blind to the identity of the materials being presented and the parent listened to masking music over headphones.

In order to familiarize infants with the task and to help avoid erroneous habituation as a result of very long looks on the first trial, infants were first presented with a pre-test label and object pair: /la/ paired with a novel object. They were then habituated to the two lexical tone label-object pairs, presented one at a time, in random order. Each label-object combination played continuously while the infant looked at the screen, and terminated when the infant looked away for 1s or after a maximum looking time of 20s. After the infant looked away, a cartoon pinwheel played to recapture the infant’s attention. Once the infant’s attention was focused on the screen, the experimenter initiated the next trial. The habituation criterion was satisfied when looking time across 3 consecutive trials decreased to 50% of the average looking time across the first 3 trials.

Test trials began immediately after the infant habituated or reached the maximum cutoff of 25 trials. During Same test trials, the infant viewed the label-object combinations that were presented during the habituation phase. During Switch trials, the labels for the two objects were switched; for example, Object B occurred with Label A. There were four trials per block (two Same and two Switch trials) presented in four counterbalanced testing orders. Each block of test trials was presented twice for a total of eight test trials. Across the test trials, infants were presented with all possible combinations of labels and objects. The dependent variable was the looking time on Same versus Switch test trials. If infants learned the original label-object pairings, they should look longer during the Switch test trials in which the pairings are violated (e.g., Werker et al., 1998).

Results and Discussion

All infants met the habituation criterion. Preliminary analyses revealed that there were no main effects or interactions involving infant Sex, Trial Order, or Test Block; all subsequent analyses were performed collapsed across these variables. To examine whether infants successfully mapped rising /kʊ/ and falling /kʊ/ to different novel objects, we compared mean looking times on Same and Switch trials using a paired samples t-test (all t-tests 2-tailed; effect sizes reported for t-tests are Cohen’s d). The infants looked significantly longer to Switch (M = 8.23 s; SD = 3.59) than to Same (M = 6.63 s; SD = 2.95) trials, t(23) = 2.11, p =.046, d = .43, indicating that they learned the mapping between the labels and the object (see Figure 4).

Figure 4.

Figure 4

Results of Experiments 1 and 2: Average looking times (± 1SE) on Same and Switch trials for 14-, 19-, and 17-month-olds. * p < .05.

Our results suggest that although lexical tones are not used contrastively in English, 14-month-old infants are willing to map labels that differ only in pitch contour to novel objects. These findings are particularly surprising in light of the difficulty that 14-month-olds typically have mapping minimal pair words to novel objects when the words differ by a single phoneme (e.g., Stager & Werker, 1997), and given that our testing paradigm provided no referential support. The present results support the hypothesis that young learners are open to a variety of potential contrastive cues to word meaning.

One question that arises from the current study is why did our English-learning infants succeed in using lexical tones contrastively, when Quam and Swingley’s (2010) participants failed to notice changes in pitch contour in newly learned words? There are a number of differences between the two studies that may account for the discrepancy. First, there were substantial methodological differences between the studies. By presenting falling /kʊ/ with one object and rising /kʊ/ with a second object, we provided infants with distributional information that may have facilitated the contrastive use of lexical tone. Support for this idea comes from recent work by Yeung and Werker (2009) who found that the presence of co-occurring contextual cues, such as consistently pairing objects with particular phoneme distributions, facilitates differentiation between phonetic categories. Thus, providing rising pitch contour with one object and falling pitch contour with a second object may have helped the infants differentiate between the two types of pitch contours, thereby facilitating the label-object associations. Participants in Quam and Swingley’s (2010) experiments were not provided with the same type of distributional information. Instead, they were taught the label for a single object and thus did not receive information about the potential lexical relevance of pitch contour.

A second, and potentially more notable, difference between the two studies is the large age difference between the infants in our study (14 months) and Quam and Swingley’s (2010) participants (adults and 2.5-year-olds). Between age one and age 2.5, children gain tremendous experience with the regularities of their native language. It is possible that across the second year, infants learn to ignore or down weight variations in pitch contour while acquiring and interpreting words, because pitch contours are not lexically relevant in English. Consistent with this view, Dietrich, Swingley, and Werker (2007) have demonstrated cross-linguistic differences in infants’ willingness to treat vowel duration as meaningfully contrastive. Vowel duration, like pitch contour, is an acoustically salient phonetic property that is used to minimally distinguish words in some languages, like Dutch and Japanese, but not in other languages, like English and French. At 18 months of age Dutch infants, but not English-learning infants, readily map labels that differ minimally in vowel duration to novel objects.

It is thus unclear whether the results of Experiment 1 diverge from prior studies due to differences in the methodologies, or to differences in the age of the participants (and thus their degree of native language experience). To address these alternative hypotheses, Experiment 2 examined the developmental trajectory of infants’ willingness to use lexical tone contrastively.

Experiment 2

As infants gain language experience, they show increasing sensitivity to the structures and word forms of their native language. In Experiment 2, we replicated Experiment 1 with 19-month-olds, who are typically more successful at mapping minimal pair labels to novel objects than their younger peers (e.g., Werker et al., 2002), and who are generally becoming quite skilled at word learning (Goldfield & Reznick, 1990; Werker et al., 1998). If the difference in performance between 14-month-old infants in Experiment 1 and the 2.5-year-olds examined by Quam and Swingley (2010) is due to methodological differences in the availability of distributional cues, then we would expect older infants to demonstrate contrastive mapping of pitch contours to novel objects in our task. However, if native language experience with a non-tonal language is showing infants that pitch contours are not lexically relevant, then older infants should ignore the pitch contour of new words in our task. If this is the case, then 19-month-olds should fail to learn the tone minimal pairs under the same circumstances in which the 14-month-olds in Experiment 1 succeeded.

Method

Participants

Twenty-four 19-month-old (mean age 19.75 months, range: 19.1–20.1; 13 female) monolingual English-learning infants from a mid-sized Midwestern American city participated in Experiment 2. Exclusion criteria were identical to Experiment 1. Data from 6 additional infants were excluded from the analysis because of fussiness (4), parental interference (1), or looking for the total duration of the trial on more than 5 test trials (1).

Stimuli and Procedure

The stimuli and procedure were identical to those used in Experiment 1.

Results and Discussion

Twenty-three out of 24 infants met the habituation criteria within 25 trials. Data from all 24 infants were included in the analysis because excluding the infant who failed to habituate did not alter the results. Preliminary analyses revealed that there were no main effects or interactions involving infant Sex, Trial Order or Test Block; all subsequent analyses were performed collapsed across these variables. To examine whether 19-month-olds successfully mapped rising /kʊ/ and falling /kʊ/ to novel objects, we compared mean looking time on Same and Switch trials using a paired samples t-test. Looking times on Switch (M = 6.91 s; SD = 3.43) and Same (M = 7.79 s; SD = 3.84) trials did not differ significantly, t(23) = 1.12, p = .274, d = .230 (see Figure 4). These results suggest that 19-month-olds did not notice the labeling violation in the Switch trials.

In order to verify that the performance of the 19-month-olds in Experiment 2 differed from the 14-month-olds tested in Experiment 1, we performed a 2 (Age: 14 months vs. 19 months; between subjects) X 2 (Trial Type: Same vs. Switch; within subjects) mixed analysis of variance (ANOVA). There was no main effect of Trial Type, F(1, 46) < 1, n.s., or Age, F(1, 46) < 1, n.s. Importantly, there was a significant Trial Type X Age interaction, F(1, 46) = 5.162, p = .028, ηp2 = .10, supporting the contention that the two groups differed in their performance.

These results suggest that there is a developmental shift in infants’ willingness to treat lexical tone contrastively. Fourteen-month-olds seem to be open to accepting labels that differ solely in pitch contour as different lexical items, despite the fact that pitch contour is not a lexically contrastive cue in their native language. However, by 19 months of age, infants ignored pitch contour as a contrastive cue to word meaning. Results from Experiment 2 support the hypothesis that differences between our findings in Experiment 1 and those of Quam and Swingley (2010) are likely developmental in nature, rather than distributional in nature. Thus, between 14 and 19 months of age there appears to be a shift in infants’ interpretation of the lexical relevance of pitch contour.

Infants’ performance using lexical tone contrastively appears to be the inverse of their performance mapping consonantal minimal pairs to novel objects. We found that 14-month-olds succeeded in mapping tone-based minimal pairs, whereas infants of the same age typically fail in mapping phoneme-based minimal pairs (Stager & Werker, 1997; Pater et al., 2004). Similarly, we found that 19-month-olds failed to use tone-based minimal pairs, whereas 17- to 20-month-olds typically succeed in mapping phoneme-based minimal pairs (Werker et al., 2002). Improved mapping of phoneme-based minimal pairs likely results, at least partially, from increased attention to phonemic detail. Thus, 17-months may be a pivotal age at which infants become more sensitive to the acoustic properties of object labels. Experiment 2b tested this possibility.

Experiment 2b

In Experiment 2b, we replicated Experiments 1 and 2 with infants intermediate in age (17 months) in order to examine the possibility that developmental change in the contrastive use of lexical tone coincides with when infants begin to display greater attention to phonemic detail in newly learned words.

Method

Participants

Twenty-four 17-month-old (mean age 17.5 months, range: 17.1–18.2 months; 13 female) monolingual English-learning infants from a mid-sized Southeastern American city participated in Experiment 2b. Exclusion criteria were identical to Experiments 1 and 2a. Data from 9 additional infants were excluded from the analysis because of fussiness (4), experimenter error (3), or looking for the total trial duration on more than 5 test trials (2).

Stimuli and Procedure

The stimuli and procedure were identical to those used in Experiments 1 and 2a.

Results and Discussion

Twenty-one out of 24 infants met the habituation criteria within 25 trials. Data from all 24 infants were included in the analysis because excluding the 3 infants who failed to habituate did not alter the results. Preliminary analyses revealed no main effects or interactions involving infant Sex, Trial Order, or Test Block; all subsequent analyses were performed collapsed across these variables. A paired samples t-test revealed that 17-month-olds showed no significant mean looking time difference in Switch (M = 8.44 s; SD = 2.96) and Same (M = 8.22 s; SD = 3.98) trials, t(23) < 1, p = .74, d = .07, indicating that they did not notice the labeling violation in the Switch trials (see Figure 4).

In order to compare performance across age, we preformed a 3 (Age: 14, 17, 19 months; between subjects) x 2 (Trial Type: Same versus Switch; within subjects) mixed ANOVA. There was no main effect of Trial Type, F(1, 69) < 1, n.s., or Age, F(1, 69) < 1, n.s. The Trial Type X Age interaction was only marginally significant, F(1, 69) = 2.895, p = .062, ηp2 = .077. It appears that the 17-month-old infants’ performance is intermediate between the 14-month-olds tested in Experiment 1, who succeeded in mapping lexical tones to novel objects and the 19-month-olds tested in Experiment 2a, who failed to make this mapping (for evidence of intermediate performance by 17-month-olds on phonemic-based minimal pair learning, see Werker, et al, 2002).

The findings suggest that while infants begin to show increasing attention to phonemic detail during word learning tasks, they also become constrained in their use of lexical tone as a contrastive cue. As English-learning infants accrue knowledge that pitch contours are not relevant to word meaning, they ignore the pitch contours of words, much in the same way that they come to ignore the speaker’s gender when interpreting word meaning (Houston & Jusczyk, 2000).

However, the same pattern of results could occur if infants lose the ability to discriminate rising versus falling pitch contours between 14 and 17–19 months of age. This would be consistent with the kind of perceptual narrowing that occurs for many non-native phoneme contrasts (e.g., Werker & Tees, 1984). Although perceptual narrowing is not a universally agreed upon theory of developmental speech perception (for examples of within-category discrimination see Carney, Widin, & Viemeister, 1977; Massaro & Cohen, 1983; Toscano, McMurray, Dennhardt, & Luck, 2010; for examples of phoneme contrasts that are gained, not lost see Eilers & Minifie, 1975; Eilers, Wilson, & Moore, 1977; Kuhl, Stevens, Deguchi, Kiritani, & Iverson, 2006; Narayan, Werker, & Beddor, 2010; for various alternative theories of the development of speech perception see Guenther & Gjaja, 1996; McMurray, Aslin, & Toscano, 2009; Vallabha, McClelland, Pons, Werker, & Amano, 2007), it has been widely documented that across the first year of life early broad sensitivity becoming fine-tuned as infants focus on the speech sound variants that are meaningfully distinct in their native language (e.g., Anderson, Morgan, & White, 2003; Best, McRoberts, LaFleur, & Silver-Isenstadt, 1995; Bosch & Sebastian-Galles, 2003; Cheour et al., 1998; Palmer, Fais, Golinkoff, & Werker, 2012; Pegg & Werker, 1997; Polka & Werker, 1994; Rivera-Gaxiola, Silva-Pereyra, & Kuhl, 2005; Werker & Lalonde, 1988; Werker & Tees, 1984).

Infants growing up in non-tone language environments display some perceptual narrowing for pitch between 4 and 9 months, but results are variable and depend on the salience of the contrast (e.g., Mattock & Burnham, 2006; Mattock, Molnar, Polka, & Burnham, 2008; Yeung, Chen, & Werker, 2013). For example, Mattock and Burnham (2006) found that between 6 and 9 months of age, English-learning infants lose the ability to discriminate rising vs. low level Thai tones, but continue to discriminate the more acoustically salient distinction between rising and falling Thai tones. Yeung and colleagues found evidence of perceptual narrowing for the Cantonese high rising and the mid-level tone contrast between 4 and 9 months (Yeung et al., 2013). However, they also suggested that language experience might affect the perception of lexical tone as early as 4 months. More research is needed to flesh out our understanding of perceptual narrowing of lexical tone.

To ensure that older English-learning infants continue to perceive the distinction between rising versus falling pitch contours in the second year, Experiment 3 tested 19-month-olds’ basic discrimination of the stimuli used in Experiments 1 and 2.

Experiment 3

In Experiment 3, we used a habituation/dishabituation procedure to test whether 19-month-olds can discriminate the tone contrasts used in Experiments 1 and 2. Previous research suggests that some pitch contours are subject to similar perceptual narrowing faced by segmental cues, such as vowels and consonants (e.g., Mattock et al., 2008; Mattock & Burnham, 2006; Yeung et al., 2013). However, we used rising versus falling pitch contours in the current set of experiments expressly because they appear to be less susceptible to perceptual narrowing (Mattock & Burnham, 2006), presumably because they are more acoustically salient (see General Discussion). Consequently, we expected 19-month-olds to readily discriminate rising versus falling /kʊ/ when they were not attempting to associate the sounds with novel objects. This prediction is consistent with Stager and Werker’s (1997) finding that although 14-month-olds cannot map minimal pairs such as /bIh/ and /dIh/ to novel objects, they readily discriminate the minimal pairs. On this account, discrimination is necessary, though not sufficient, for minimal pair mapping (see also Dietrich et al., 2007).

Method

Participants

Fourteen 19-month-old (mean age 19.5 months, range: 18.2–20.4 months; 7 female) monolingual English-learning infants participated in Experiment 3. Ten of the participants were from a small West coast American city and 4 of the infants were from a mid-sized Southeastern American city. Exclusion criteria were identical to Experiments 1 and 2. Data from 6 additional infants were excluded from the analysis because of fussiness (4), experimenter error (1), or excessive movement (1).

Stimuli

The auditory stimuli were identical to those used in Experiments 1 and 2. The visual stimulus consisted of a static black and multicolored 9″ × 14″ checkerboard.

Procedure

The experiment started with a pre-test trial (to familiarize infants with the task), followed by a habituation phase, a test phase, and a post-test trial (to ensure that performance on test trials was not a result of fatigue; see Werker et al., 1998). The pre- and post-test stimulus was a level pitch /la/ which was paired with a checkerboard. Following the pretest trial, infants were habituated to either rising /kʊ/ or falling /kʊ/ paired with the checkerboard. Assignment of habituation stimulus was counterbalanced across participants. As in Experiments 1 and 2, the words repeated until the infant looked away. The habituation criterion was also the same. Following habituation, infants were presented with two test trials. One trial was identical to the habituation trials (Same trial). The other trial was a Change trial where the auditory stimulus changed (i.e., from rising /kʊ/ to falling /kʊ/, or vice versa), but the visual display remained the same. The order of the test trials was counterbalanced such that half of the infants heard the Same trial first, and half heard the Change trial first.

Results and Discussion

All infants reached the habituation criteria. Preliminary analyses revealed that there were no main effects or interactions involving infant Sex, Habituation Stimulus (rising vs. falling), or Test Order; all subsequent analyses were performed collapsed across these variables. To examine whether 19-month-old infants successfully discriminate rising /kʊ/ from falling /kʊ/, we compared mean looking time on Same and Change trials using a paired samples t-test. Analyses revealed that 19-month-olds looked significantly longer on the Change trial (M = 5.17 s; SD = 1.38) than on the Same trial (M = 4.53 s; SD = 1.21), t(13) = 3.71, p = .003, d = .5, indicating that they discriminated rising from falling lexical tone.

Thus, the failure to map minimally contrastive pitch contours to novel objects at 17 and 19 months of age does not appear to be due to perceptual narrowing for rising versus falling lexical tones. Instead, we suggest that between 14 and 17–19 months, monolingual English-learning infants experience an interpretive shift whereby differences in lexical tone cease to be lexically contrastive.

General Discussion

In the current experiments, we investigated the developmental trajectory of English-learning infants’ use of pitch contour as lexically contrastive. The results suggest that infants become increasingly constrained in their interpretation of lexical tone across the second year of life. In Experiment 1, 14-month-olds readily associated novel objects with two novel words that differed only in pitch contour. In Experiment 2, both 17- and 19-month-olds failed to map the same target words to the objects. Finally, in Experiment 3 we confirmed that older infants’ performance on the label-object associations was not a result of perceptual narrowing; 19-month-olds readily discriminated rising from falling /kʊ/ when the labels were not presented with nameable objects.

These findings indicate that young English learners are not only sensitive to variations in pitch contour, but they display a remarkable ability to treat lexical tone contrastively during label-object mapping. Conversely, by 17 to 19 months of age, infants no longer interpret such variations as lexically relevant, although they continue to be sensitive to variations in pitch contour when the mapping problem is eliminated. Thus, the developmental shift in flexibility occurs sometime between 14 and 17–19 months, as infants gain experience with the functions of segmental and suprasegmental units in their language. More broadly, our results support the idea that infants begin the process of word learning flexible in their interpretation of which sound variants are lexically contrastive (e.g., Woodward & Hoyne, 1999), and show language-appropriate constraints as they gain linguistic knowledge (e.g., Graf Estes, Edwards, & Saffran, 2011; Woodward & Hoyne, 1999; Waxman & Booth, 2003; Fulkerson & Waxman, 2007).

Younger children do not typically outperform older children. Some of the domains in which this unusual developmental pattern occurs include face processing (e.g., Kelly et al., 2007; Scott, Pascalis, & Nelson, 2007) and speech perception (e.g., Werker & Tees, 1984). In face processing, infants at 3 months of age perceive distinctions between faces of unfamiliar races that 9-month-olds do not readily discriminate (Kelly et al., 2007). In speech perception, 6-month-olds detect differences between non-native phonemes that 10-month-olds do not readily detect (Werker & Tees, 1984). As infants gain experience in particular domains, their perceptual systems narrow to focus on the distinctions that are predominant in the ambient environment. This allows for more efficient processing of the information that is most likely to be relevant to the infants’ interactions with others, but results in less effective processing for non-native patterns.

The present experiments have a great deal in common with the findings regarding perceptual narrowing. Younger infants showed word learning superior to what we observed with older infants. This decline in performance is adaptive; in order to process English effectively, it is important to treat words that differ only in pitch contour as variations of the same word. In face processing, speech perception, and learning of labels that differ in lexical tone, the developmental shift occurs as infants become more specialized for processing the input present in their environments. However, the present experiments do not demonstrate perceptual narrowing per se. As Experiment 3 showed, infants can readily perceive the distinction between the rising and falling tones when no objects are present, at least as late as 19 months of age. The key is that English-learning infants do not treat these pitch contours as functionally significant in word learning. We have demonstrated an interpretive narrowing: with linguistic experience, infants become increasingly specific about the forms of words that they treat as lexically contrastive.

The present findings integrate with previous work addressing the connection between phoneme perception and word learning (Apfelbaum & McMurray, 2011; Dietrich et al., 2007; Galle et al, 2013; Pater et al., 2004; Rost & McMurray 2009; 2010; Stager & Werker, 1997; Werker et al., 2002; Werker & Curtin, 2005) to provide a rich picture of the effects of phonological development on word learning. Infants must learn which sound variants in the ambient environment signal meaningful contrasts between words. In some circumstances, infants must take sound distinctions that they perceive (e.g., /pIn/ vs /bIn/) and develop sufficiently robust representations of them to determine that the sounds make meaningful distinctions between words. In other cases, infants may perceive that sounds are distinct (e.g., different pitch patterns or vowel durations), but they must learn not to treat the perceptual distinction as functionally significant. This balancing of attention is essential for rapid and effective word learning and word recognition.

The discussion of interpretive narrowing that we have offered suggests that infants become more constrained in what they accept as possible words as they accrue language experience. Numerous studies, however, have suggested that infants begin word learning with strong constraints on the sound variants that they interpret as good object labels (e.g., MacKenzie et al., 2011; 2012). Why, then, did the 14-month-olds in Experiment 1 display flexibility in interpretation of pitch contours? The answer may be multi-layered. First, infants have considerable experience with the ways that pitch contour varies meaningfully in English. Thus, by 14 months of age, infants may have gleaned that pitch contour is meaningfully relevant. However, they may still have to develop the sophisticated knowledge that pitch is not used contrastively for word meaning in English and assign pitch variations to their appropriate linguistic functions (see also Dietrich et al., 2007).

Second, the labels used here contain rich acoustic properties that may facilitate learning. As mentioned previously, rising versus falling tones seem to resist perceptual narrowing during the first year of life (Mattock & Burnham, 2006), presumably due to their acoustic salience (for a discussion of the connection between acoustic salience and the timing of perceptual narrowing see Narayan, et al., 2010; Yeung, et al., 2013). Thus, these acoustically salient tone contours may support label-object mapping relative to less salient contrasts (Curtin, Fennell, & Escudero, 2009; Graf Estes & Hurley, 2012; Stager & Werker, 1997). Additionally, lexical tones unfold over several hundred milliseconds and have broad pitch ranges. In contrast, the segmental differences of minimal pairs such as /bIh/ and /dIh/ unfold over tens of milliseconds, thus there is significantly less information available to the listeners during the labeling task. Hence, the development of attention to phonemic-based minimal pairs may differ from pairs that differ in a variety of suprasegmental characteristics, including pitch contour and stress (for evidence of stress-based minimal pair mapping see Curtin, 2009). During the early process of discovering which acoustic properties of sounds in the environment make meaningful distinctions between words, infants may initially rely more heavily on acoustic salience. As infants gain linguistic experience, they may begin to weight native-language lexical properties more than acoustic salience (see also Dietrich et al., 2007; Galle et al., 2013; Rost & McMurray, 2009, 2010). Future research will be necessary to directly explore the role of acoustic salience in label-object mapping.

The current set of studies informs our understanding of flexibility and constraint in early word learning. They reveal the time course of interpretive narrowing of lexical tone across the second year of life. Even in the absence of social or referential support, young monolingual English-learning infants treat syllables with distinct tonal contours as labels for distinct objects. Their success is surprising given the difficulty that infants around this age typically experience mapping minimal pair words to novel objects (Stager & Werker, 1997; Pater et al., 2004; Werker et al., 2002) as well as a variety of communicative and non-communicative mouth sounds (MacKenzie et al., 2011), and sound sequences that are phonotactically illegal (MacKenzie et al., 2012). We propose that flexibility and constraint in early word learning emerge from a confluence of factors including acoustic salience and the quantity and nature of the linguistic experience. For example, recent work suggests that bilinguals may have a protracted period of flexibility during early label-object mapping compared to monolinguals (Graf Estes & Hay, 2014). Further research is needed to establish the relative roles of each of these factors in interpretive narrowing. Lexical tone provides a unique lens through which to study these relationships.

Acknowledgments

This research was funded by grants from NICHD to JFH (F32HD557032) and to JRS (R37HD037466), a grant from the National Science Foundation to KGE (BCS0847379), a grant from the James F. McDonnell Foundation to JRS, and by a NICHD core grant to the Waisman Center (P30HD03352). The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health or the National Science Foundation. We would like to thank the members of the Infant Learning Lab at the University of Wisconsin-Madison, the Infant Language and Perceptual Learning Lab at the University of Tennessee, Knoxville, and the Language Learning Lab at the University of California, Davis for their assistance in the conduct of this research. We would also like to thank the families who generously contributed their time, and three anonymous reviewers for helpful comments on a previous version of this manuscript.

Contributor Information

Jessica F. Hay, University of Tennessee, Knoxville

Katharine Graf Estes, University of California, Davis.

Tianlin Wang, University of Wisconsin – Madison.

Jenny R. Saffran, University of Wisconsin – Madison

References

  1. Anderson JL, Morgan JL, White KS. A statistical basis for speech sound discrimination. Language and Speech. 2003;46:155–182. doi: 10.1177/00238309030460020601. [DOI] [PubMed] [Google Scholar]
  2. Apfelbaum K, McMurray B. Using variability to guide dimensional weighting: Associative mechanisms in early word learning. Cognitive Science. 2011;35:1105–1138. doi: 10.1111/j.1551-6709.2011.01181.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Best CT, McRoberts GW, LaFleur R, Silver-Isenstadt J. Divergent developmental patterns for infants’ perception of two nonnative consonant contrasts. Infant Behavior & Development. 1995;18:339–350. doi: 10.1016/0163-6383(95)90022-5. [DOI] [Google Scholar]
  4. Bolinger D. Intonation and its uses: Melody in grammar and discourse. Stanford: Stanford University Press; 1989. [Google Scholar]
  5. Bosch L, Sebastián-Gallés N. Simultaneous bilingualism and the perception of a language-specific vowel contrast in the first year of life. Language and Speech. 2003;46:217–243. doi: 10.1177/00238309030460020801. [DOI] [PubMed] [Google Scholar]
  6. Burnham D, Mattock K. Language experience in second language speech learning: In honor of James Emil Flege. 2007. pp. 259–280. [Google Scholar]
  7. Cheour M, Ceponiene R, Lehtokoski A, Luuk A, Allik J, Alho K, Näätänen R. Development of language-specific phoneme representations in the infant brain. Nature Neuroscience. 1998;1:351–353. doi: 10.1038/1561. [DOI] [PubMed] [Google Scholar]
  8. Cohen LB, Atkinson DJ, Chaput HH. Habit X: A new program for obtaining and organizing data in infant perception and cognition studies (Version 1.0) Austin: University of Texas; 2004. [Google Scholar]
  9. Curtin S. Twelve-month-olds learn novel word-object pairings differing only in stress pattern. Journal of Child Language. 2009;36:1157–1165. doi: 10.1017/S0305000909009428. [DOI] [PubMed] [Google Scholar]
  10. Curtin S, Fennell C, Escudero P. Weighting of acoustic cues explains patterns of word-object associative learning. Developmental Science. 2009;12:725–731. doi: 10.1111/j.1467-7687.2009.00814.x. [DOI] [PubMed] [Google Scholar]
  11. Carney AE, Widin GP, Viemeister NF. Non categorical perception of stop consonants differing in VOT. Journal of the Acoustical Society of America. 1977;62:961–970. doi: 10.1121/1.381590. [DOI] [PubMed] [Google Scholar]
  12. Dietrich C, Swingley D, Werker JF. Native language governs interpretation of salient speech sound differences at 18 months. Proceedings of the National Academy of Sciences. 2007;104:16027–16031. doi: 10.1073/pnas.0705270104. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Eilers R, Minifie F. Fricative discrimination in early infancy. Journal of Speech and Hearing Research. 1975;18:158–167. doi: 10.1044/jshr.1801.158. [DOI] [PubMed] [Google Scholar]
  14. Eilers R, Wilson W, Moore J. Developmental changes in speech discrimination in infants. Perception & Psychophysics. 1977;16:513–521. doi: 10.1044/jshr.2004.766. [DOI] [PubMed] [Google Scholar]
  15. Fennell CT. Object familiarity enhances infants’ use of phonetic detail in novel words. Infancy. 2012;17:339–353. doi: 10.1111/j.1532-7078.2011.00080.x. [DOI] [PubMed] [Google Scholar]
  16. Fennell C, Waxman SR. What paradox? Referential cues allow for infant use of phonetic detail in word learning. Child Development. 2010;81:1376–1383. doi: 10.1111/j.1467-8624.2010.01479.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Fernald A. Intonation and communicative intent in mothers’ speech to infants: Is the melody the message? Child Development. 1989;60:1497–1510. doi: 10.2307/1130938. [DOI] [PubMed] [Google Scholar]
  18. Fernald A. Human maternal vocalizations to infants as biologically relevant signals: An evolutionary perspective. New York, NY, US: Oxford University Press, New York, NY; 1992. pp. 391–428. [Google Scholar]
  19. Fernald A, Kuhl PK. Acoustic determinants of infant preference for motherese speech. Infant Behavior & Development. 1987;10:279–293. doi: 10.1016/0163-6383(87)90017-8. [DOI] [Google Scholar]
  20. Fromkin VA. Tone: A Linguistic Survey. Academic Press, London. 1978;1:292. [Google Scholar]
  21. Fry DB. Experiments in the perception of stress. Language and Speech. 1958;1:126–152. [Google Scholar]
  22. Fulkerson AL, Waxman SR. Words (but not tones) facilitate object categorization: Evidence from 6- and 12-month-olds. Cognition. 2007;105:218–228. doi: 10.1016/j.cognition.2006.09.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Galle M, Apfelbaum K, McMurray B. Within-speaker variability benefits phonological word learning. Language Learning and Development. 2013 Manuscript in press. [Google Scholar]
  24. Goldfield BA, Reznick JS. Early lexical acquisition: Rate, content, and the spurt. Journal of Child Language. 1990;17:171–183. doi: 10.1017/S0305000900013167. [DOI] [PubMed] [Google Scholar]
  25. Graf Estes K, Edwards J, Saffran JR. Phonotactic constraints on infant word learning. Infancy. 2011;16:180–197. doi: 10.1111/j.1532-7078.2010.00046.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Graf Estes K, Evans JL, Alibali MW, Saffran JR. Can infants map meaning to newly segmented words? Statistical segmentation and word learning. Psychological Science. 2007;18:254–260. doi: 10.1111/j.1467-9280.2007.01885.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Graf Estes K, Hay JF. Bilinguals are flexible word learners. 2014 Manuscript under editorial review. [Google Scholar]
  28. Graf Estes K, Hurley K. Infant-directed prosody helps infants map sounds to meanings. Infancy. 2013;18:797–824. doi: 10.1111/infa.12006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Guenther F, Gjaja M. The perceptual magnet effect as an emergent property of neural map formation. Journal of the Acoustical Society of America. 1996;100:1111–1112. doi: 10.1121/1.416296. [DOI] [PubMed] [Google Scholar]
  30. Gussenhoven C. The phonology of tone and intonation. Cambridge UP; 2004. [DOI] [Google Scholar]
  31. Hay JF, Pelucchi B, Graf Estes K, Saffran JR. Linking sounds to meanings: Infant statistical learning in a natural language. Cognitive Psychology. 2011;63:93–106. doi: 10.1016/j.cogpsych.2011.06.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Hirschberg J, Ward G. The influence of pitch range, duration, amplitude and spectral features on the interpretation of the rise-fall-rise intonation contour in English. Journal of Phonetics. 1992;20:241–251. [Google Scholar]
  33. Hirsh-Pasek K, Golinkoff RM, Hollich G. Becoming a word learner: A debate on lexical acquisition. New York, NY: Oxford University Press; 2000. An emergentist coalition model for word learning: Mapping words to objects is a product of the interaction of multiple cues; pp. 136–164. [DOI] [Google Scholar]
  34. Houston DM, Jusczyk PW. The role of talker-specific information in word segmentation by infants. Journal of Experimental Psychology: Human Perception and Performance. 2000;26:1570–1582. doi: 10.1037/0096-1523.26.5.1570. [DOI] [PubMed] [Google Scholar]
  35. Katz GS, Cohn JF, Moore CA. A combination of vocal f0 dynamic and summary features discriminates between three pragmatic categories of infant-directed speech. Child Development. 1996;67:205–217. doi: 10.1111/j.1467-8624.1996.tb01729.x. [DOI] [PubMed] [Google Scholar]
  36. Kelly DJ, Quinn PC, Slater AM, Lee K, Ge L, Pascalis O. The other-race effect develops during infancy: Evidence of perceptual narrowing. Psychological Science. 2007;18:1084–1089. doi: 10.1111/j.1467-9280.2007.02029.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Kuhl PK, Stevens EH, Deguchi AT, Kiritani S, Iverson P. Infants show a facilitation effect for native language phonetic perception between 6 and 12 months. Developmental Science. 2006;9:F13–F21. doi: 10.1111/j.1467-7687.2006.00468.x. [DOI] [PubMed] [Google Scholar]
  38. Liu S, Samuel AG. Perception of mandarin lexical tones when F0 information is neutralized. Language and Speech. 2004;47:109–138. doi: 10.1177/00238309040470020101. [DOI] [PubMed] [Google Scholar]
  39. MacKenzie H, Curtin S, Graham SA. 12-month-olds’ phonotactic knowledge guides their Word–object mappings. Child Development. 2012;83:1129–1136. doi: 10.1111/j.1467-8624.2012.01764.x. [DOI] [PubMed] [Google Scholar]
  40. MacKenzie H, Graham SA, Curtin S. Twelve-month-olds privilege words over other linguistic sounds in an associative learning task. Developmental Science. 2011;14:249–255. doi: 10.1111/j.1467-7687.2010.00975.x. [DOI] [PubMed] [Google Scholar]
  41. Massaro DW, Cohen MM. Categorical or continuous speech perception: a new test. Speech Communication. 1983;2:15–35. doi: 10.1016/0167-6393(83)90061-4. [DOI] [Google Scholar]
  42. Mattock K, Burnham D. Chinese and English infants’ tone perception: Evidence for perceptual reorganization. Infancy. 2006;10:241–265. doi: 10.1207/s15327078in1003_3. [DOI] [Google Scholar]
  43. Mattock K, Molnar M, Polka L, Burnham D. The developmental course of lexical tone perception in the first year of life. Cognition. 2008;106:1367–1381. doi: 10.1016/j.cognition.2007.07.002. [DOI] [PubMed] [Google Scholar]
  44. McMurray B, Aslin RN, Toscano JC. Statistical learning of phonetic categories: Insights from a computational approach. Developmental Science. 2009;12:369–378. doi: 10.1111/j.1467-7687.2009.00822.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Moore DS, Spence MJ, Katz GS. Six-month-olds’ categorization of natural infant-directed utterances. Developmental Psychology. 1997;33:980–89. doi: 10.1037/0012-1649.33.6.980. [DOI] [PubMed] [Google Scholar]
  46. Namy LL. What’s in a name when it isn’t a word? 17-month-olds’ mapping of nonverbal symbols to object categories. Infancy. 2001;2:73–86. doi: 10.1207/S15327078IN0201_5. [DOI] [PubMed] [Google Scholar]
  47. Namy LL, Waxman SR. Words and gestures: Infants’ interpretations of different forms of symbolic reference. Child Development. 1998;69:295–308. doi: 10.1111/j.1467-8624.1998.tb06189.x. [DOI] [PubMed] [Google Scholar]
  48. Narayan C, Werker JF, Beddor P. The interaction between acoustic salience and language experience in developmental speech perception: Evidence from nasal place discrimination. Developmental Science. 2010;13:407–420. doi: 10.1111/j.1467-7687.2009.00898.x. [DOI] [PubMed] [Google Scholar]
  49. Palmer SB, Fais L, Golinkoff RM, Werker JF. Perceptual narrowing of linguistic sign occurs in the 1st year of life. Child Development. 2012;83:543–553. doi: 10.1111/j.1467-8624.2011.01715.x. [DOI] [PubMed] [Google Scholar]
  50. Pater J, Stager CL, Werker JF. The lexical acquisition of phonological contrasts. Language. 2004;80:361–379. doi: 10.1353/lan.2004.0141. [DOI] [Google Scholar]
  51. Pegg JE, Werker JF. Adult and infant perception of two English phones. Journal of the Acoustical Society of America. 1997;102:3742–3753. doi: 10.1121/1.420137. [DOI] [PubMed] [Google Scholar]
  52. Pierrehumbert J. Doctoral dissertation. Massachusetts Institute of Technology; 1980. The Phonology and Phonetics of English Intonation. Retrieved from http://faculty.wcas.northwestern.edu/~jbp/publications/Pierrehumbert_PhD.pdf. [Google Scholar]
  53. Polka L, Werker JF. Developmental changes in perception of nonnative vowel contrasts. Journal of Experimental Psychology: Human Perception and Performance. 1994;20:421–435. doi: 10.1037/0096-1523.20.2.421. [DOI] [PubMed] [Google Scholar]
  54. Quam C, Swingley D. Phonological knowledge guides two-year-olds’ and adults’ interpretation of salient pitch contours in word learning. Journal of Memory and Language. 2010;62:135–150. doi: 10.1016/j.jml.2009.09.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Rivera-Gaxiola M, Silva-Pereyra J, Kuhl PK. Brain potentials to native and non-native speech contrasts in 7- and 11-month-old american infants. Developmental Science. 2005;8:162–172. doi: 10.1111/j.1467-7687.2005.00403.x. [DOI] [PubMed] [Google Scholar]
  56. Rost GC, McMurray B. Speaker variability augments phonological processing in early word learning. Developmental Science. 2009;12:339–349. doi: 10.1111/j.1467-7687.2008.00786.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Rost GC, McMurray B. Finding the signal by adding noise: The role of non-contrastive phonetic variability in early word learning. Infancy. 2010;15:608. doi: 10.1111/j.1532-7078.2010.00033.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. Scott LS, Pascalis O, Nelson CA. A domain general theory of the development of perceptual discrimination. Current Directions in Psychological Science. 2007;16:197–201. doi: 10.1111/j.1467-8721.2007.00503.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  59. Stager CL, Werker JF. Infants listen for more phonetic detail in speech perception than in word-learning tasks. Nature. 1997;388:381–382. doi: 10.1038/41102. [DOI] [PubMed] [Google Scholar]
  60. Toscano JC, McMurray B, Dennhardt J, Luck S. Continuous perception and graded categorization: Electrophysiological evidence for a linear relationship between the acoustic signal and perceptual encoding of speech. Psychological Science. 2010;21:1532–1540. doi: 10.1177/0956797610384142. [DOI] [PMC free article] [PubMed] [Google Scholar]
  61. Vallabha GK, McClelland JL, Pons F, Werker JF, Amano S. Unsupervised learning of vowel categories from infant-directed speech. Proceedings of the National Academy of Sciences. 2007;104:13273–13278. doi: 10.1073/pnas.0705369104. [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. Ward G, Hirschberg J. Implicating uncertainty: The pragmatics of fall-rise intonation. Language. 1985;64:747–776. doi: 10.2307/414489. [DOI] [Google Scholar]
  63. Waxman S, Booth A. The origins and evolution of links between word learning and conceptual organization: New evidence from 11-month-olds. Developmental Science. 2003;6:128–135. doi: 10.1111/1467-7687.00262. [DOI] [Google Scholar]
  64. Werker JF, Cohen LB, Lloyd VL, Casasola M, Stager CL. Acquisition of word–object associations by 14-month-old infants. Developmental Psychology. 1998;34:1289–1309. doi: 10.1037/0012-1649.34.6.1289. [DOI] [PubMed] [Google Scholar]
  65. Werker JF, Curtin S. PRIMIR: A Developmental Framework of Infant Speech Processing. Language Learning and Development. 2005;1:197–234. doi: 10.1080/15475441.2005.9684216. [DOI] [Google Scholar]
  66. Werker JF, Fennell CT, Corcoran KM, Stager CL. Infants’ ability to learn phonetically similar words: Effects of age and vocabulary size. Infancy. 2002;3:1–30. doi: 10.1207/S15327078IN0301_1. [DOI] [Google Scholar]
  67. Werker JF, Lalonde CE. Cross-language speech perception: Initial capabilities and developmental change. Developmental Psychology. 1988;24:672–683. doi: 10.1037/0012-1649.24.5.672. [DOI] [Google Scholar]
  68. Werker JF, Tees RC. Cross-language speech perception: Evidence for perceptual reorganization during the first year of life. Infant Behavior & Development. 1984;7:49–63. doi: 10.1016/S0163-6383(84)80022-3. [DOI] [Google Scholar]
  69. Woodward AL, Hoyne KL. Infants’ learning about words and sounds in relation to objects. Child Development. 1999;70:65–77. doi: 10.1111/1467-8624.00006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  70. Xu Y. Contextual tonal variations in Mandarin. Journal of Phonetics. 1997;25:61–83. doi: 10.1006/jpho.1996.0034. [DOI] [Google Scholar]
  71. Yeung HH, Chen KH, Werker JF. When does native language input reorganize phonetic perception? The precocious case of lexical tone. Journal of Memory and Language. 2013;68:123–139. doi: 10.1016/j.jml.2012.09.004. [DOI] [Google Scholar]
  72. Yeung HH, Werker JF. Learning words’ sounds before learning how words sound: 9-month-old infants use distinct objects as cues to categorize speech information. Cognition. 2009;113:234–243. doi: 10.1016/j.cognition.2009.08.010. [DOI] [PubMed] [Google Scholar]
  73. Yip M. Tone. Cambridge: Cambridge University Press; 2002. [DOI] [Google Scholar]
  74. Yoshida KA, Fennell CT, Swingley D, Werker JF. Fourteen-month-old infants learn similar-sounding words. Developmental Science. 2009;12:412–418. doi: 10.1111/j.1467-7687.2008.00789.x. [DOI] [PMC free article] [PubMed] [Google Scholar]

RESOURCES