Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2022 Jul 11.
Published in final edited form as: Adv Methods Pract Psychol Sci. 2021 Mar 12;4(1):10.1177/2515245920974622. doi: 10.1177/2515245920974622

A multi-lab study of bilingual infants: Exploring the preference for infant-directed speech

Krista Byers-Heinlein 1, Angeline Sin Mei Tsui 2, Christina Bergmann 3, Alexis K Black 4, Anna Brown 5, Maria Julia Carbajal 6, Samantha Durrant 5, Christopher T Fennell 7, Anne-Caroline Fiévet 6, Michael C Frank 2, Anja Gampe 8, Judit Gervain 9, Nayeli Gonzalez-Gomez 10, J Kiley Hamlin 4, Naomi Havron 6, Mikołaj Hernik 11, Shila Kerr 12, Hilary Killam 1, Kelsey Klassen 13, Jessica E Kosie 14, Ágnes Melinda Kovács 15, Casey Lew-Williams 14, Liquan Liu 16, Nivedita Mani 17, Caterina Marino 9, Meghan Mastroberardino 1, Victoria Mateu 18, Claire Noble 5, Adriel John Orena 12, Linda Polka 12, Christine E Potter 14, Melanie Schreiner 17, Leher Singh 19, Melanie Soderstrom 13, Megha Sundara 18, Connor Waddell 16, Janet F Werker 4, Stephanie Wermelinger 8
PMCID: PMC9273003  NIHMSID: NIHMS1769134  PMID: 35821764

Abstract

From the earliest months of life, infants prefer listening to and learn better from infant-directed speech (IDS) than adult-directed speech (ADS). Yet, IDS differs within communities, across languages, and across cultures, both in form and in prevalence. This large-scale, multi-site study used the diversity of bilingual infant experiences to explore the impact of different types of linguistic experience on infants’ IDS preference. As part of the multi-lab ManyBabies 1 project, we compared lab-matched samples of 333 bilingual and 385 monolingual infants’ preference for North-American English IDS (cf. ManyBabies Consortium, 2020: ManyBabies 1), tested in 17 labs in 7 countries. Those infants were tested in two age groups: 6–9 months (the younger sample) and 12–15 months (the older sample). We found that bilingual and monolingual infants both preferred IDS to ADS, and did not differ in terms of the overall magnitude of this preference. However, amongst bilingual infants who were acquiring North-American English (NAE) as a native language, greater exposure to NAE was associated with a stronger IDS preference, extending the previous finding from ManyBabies 1 that monolinguals learning NAE as a native language showed a stronger preference than infants unexposed to NAE. Together, our findings indicate that IDS preference likely makes a similar contribution to monolingual and bilingual development, and that infants are exquisitely sensitive to the nature and frequency of different types of language input in their early environments.

Keywords: language acquisition, bilingualism, speech perception, infant-directed speech, reproducibility, experimental methods


When caregivers interact with their infants, their speech often takes on specific, distinguishing features in a speech register known as infant-directed speech (IDS; Fernald et al., 1989). IDS is produced by caregivers of most (although not all) linguistic and cultural backgrounds, and is typically characterized by a slow, melodic, high-pitched, and exaggerated cadence (Farran, Lee, Yoo, & Oller, 2016; Fernald et al., 1989; Kitamura, Thanavishuth, Burnham, & Luksaneeyanawin, 2001; Pye, 1986; Shute & Wheldall, 1999). From early in life, infants tune their attention to IDS, preferring to listen to IDS over adult-directed speech (ADS) both at birth (Cooper & Aslin, 1990), as well as later in infancy (Cooper, Abraham, Berman, & Staska, 1997; Cooper & Aslin, 1994; Fernald, 1985; Hayashi, Tamekawa, & Kiritani, 2001; Kitamura & Lam, 2009; Newman & Hussain, 2006; Pegg, Werker, & McLeod, 1992; Santesso, Schmidt, & Trainor, 2007; Singh, Morgan, & Best, 2002; Werker & McLeod, 1989; Werker, Pegg, & McLeod, 1994).

Infants’ preference for IDS may play a useful role in early language learning. For example, infants are better able to discriminate speech sounds in IDS than in ADS (Karzon, 1985; Trainor & Desjardins, 2002), more efficiently segment words from continuous speech in an IDS register (Thiessen, Hill, & Saffran, 2005), demonstrate better long-term memory for words spoken in IDS (Singh, Nestor, Parikh, & Yull, 2009) and learn new words more effectively from IDS than ADS (Graf Estes & Hurley, 2013; Ma, Golinkoff, Houston, & Hirsh-Pasek, 2011; but see Schreiner, Altvater-Mackensen, & Mani, 2016).

While most studies have confirmed a general, early preference for IDS, to date there is very little research aimed at understanding how different linguistic experiences affect infants’ preferences. For instance, although the use of IDS has been demonstrated in a large number of cultures (see above citations), the vast majority of the research on infants’ IDS preferences has been conducted in North America, using English speech typically directed at North American English-hearing infants (Dunst, Gorman, & Hamby, 2012). Most critically, past work has been limited to a particular kind of linguistic (and cultural) experience: that of the monolingual infant. Here, we present a large-scale, multi-site, pre-registered study on bilingual infants, a population that is particularly suited to explore the relationship between language experience and IDS preference. Moreover, this research provides important insight into the early development of bilingual infants, a large but understudied population.

Does experience tune infants’ preference for IDS?

What role might experience play in tuning infants’ attention to IDS? We aggregated results from a published meta-analysis (Dunst et al., 2012) with additional community-contributed data (MetaLab, 2017) to examine their combined results. When all 62 studies are considered, we found a moderately-sized average effect of Cohen’s d =.64. Focusing on the 22 studies most similar to ours (testing IDS preference using looking times collected in a laboratory, among typically-developing infants from 3–15 months, with naturally-produced English-spoken IDS from an unfamiliar female speaker), we found a slightly smaller effect size, d = .60. Although this meta-analysis focused on infants in the first year of life, other studies of infants aged 18–21 months have also reported a preference for IDS over ADS (Glenn & Cunningham, 1983; Robertson, von Hapsburg, & Hay, 2013). There is some evidence that older infants show a greater preference for IDS than younger infants (Dunst et al., 2012), although an age effect was not found in the subsample of 22 studies mentioned above. More evidence is needed to explore the possibility that increased language experience as children grow enhances their preference for IDS.

Another variable that would be important in understanding the role of experience in the preference for IDS is whether the speech stimuli were presented in a native or non-native language. Numerous studies in early perception find different developmental trajectories for perception of native versus non-native stimuli (e.g. discriminating human faces vs. discriminating monkey faces, Lewkowicz & Ghazanfar, 2006; discriminating native vs. discriminating non-native speech sound categories, Maurer & Werker, 2014; segmenting word forms from fluent speech, e.g., Polka & Sundara, 2012). Generally, whereas infants show increasing proficiency in discriminating the types of faces and sounds that are present in their environment, they lose sensitivity to the differences between non-native stimuli over time. This general pattern might lead us to predict that infants will initially be sensitive to differences between IDS and ADS in both the native and non-native languages, but that this initial cross-linguistic sensitivity will decline with age. In other words, at some ages, infants’ preference for IDS over ADS could be enhanced when hearing their native language. However, to date, there is very little data on this question. Importantly, this general trend, if it exists, may interact with differences across languages in the production of IDS. The exaggerated IDS of North American English might be either more interesting or less interesting to an infant whose native language is characterized by a less exaggerated form of IDS, than for an infant who regularly hears North American English IDS.

Only a handful of IDS preference studies have explicitly explored infants’ preference for IDS from infants’ native versus a non-native language. Werker et al. (1994) compared 4.5- and 9-month-old English and Cantonese-learning infants’ preference for videos of Cantonese mothers using IDS versus ADS. Both groups showed a preference for IDS; however, the magnitude of the preference between the two groups was not specifically compared (Werker et al., 1994). Hayashi et al. (2001) studied Japanese-learning infants’ (aged 4–14 months) preference for native (Japanese) and non-native (English) speech. Japanese-learning infants generally showed a preference for Japanese IDS over ADS, as well as an increasing preference for Japanese IDS over English IDS. The latter finding shows that infants tune into their native language with increased experience; however, as the study did not measure infants’ interest in English ADS, we do not know whether Japanese infants were equally sensitive to the difference between ADS and IDS in the non-native stimuli, or whether/how this might change over time.

Infants growing up bilingual are typically exposed to IDS in two languages. They provide a particularly useful wedge in understanding experiential influences on infants’ attention to IDS. Bilingual infants receive less exposure to each of their languages than monolingual infants, and the exact proportion of exposure to each of their two languages varies from infant to infant. This divided exposure does not appear to slow the overall rate of language acquisition: bilinguals pass their language milestones on approximately the same schedule as monolingual infants, such as the onset of babbling and the production of their first words (Werker & Byers-Heinlein, 2008). Nonetheless, children from different language backgrounds receive different types of input, and must ultimately acquire different language forms, which can alter some patterns of language acquisition (e.g., Choi & Bowerman, 1991; Slobin, 1985; Tardif, 1996; Tardif, Shatz, & Naigles, 1997; Werker & Tees, 1984). As a consequence, bilingual infants allow researchers to investigate how a given “dose” of experience with a specific language relates to phenomena in language acquisition, while holding infants’ age and total experience with language constant (Byers-Heinlein & Fennell, 2014).

Aside from the opportunity to study dose effects, it is important to examine the preference for IDS in bilingual infants for the sake of understanding bilingual development itself. Several lines of research suggest that early exposure to two languages changes some aspects of early development (Byers-Heinlein & Fennell, 2014), including bilinguals’ perception of non-native speech sounds (i.e., sounds that are in neither of their native languages). For example, a number of studies have reported that bilinguals maintain sensitivity to non-native consonant contrasts (García-Sierra, Ramírez-Esparza, & Kuhl, 2016; Petitto et al., 2012; Ramírez, Ramírez, Clarke, Taulu, & Kuhl, 2017), tone contrasts (Graf Estes & Hay, 2015; Liu & Kager, 2017a), and visual differences between languages (i.e., rhythmic and phonetic information available on talkers’ faces; Sebastián-Gallés, Albareda-Castellot, Weikum, & Werker, 2012) until a later age than monolinguals. Other studies have suggested that bilinguals’ early speech perception is linked to their language dominance (Liu & Kager, 2015; Molnar, Carreiras, & Gervain, 2016; Sebastián-Gallés & Bosch, 2002), whereby bilinguals’ perception most closely matches that of monolinguals in their dominant language. Bilingual infants also demonstrate some cognitive differences from monolinguals that are not specific to language, including faster visual habituation (Singh et al., 2015), better memory generalization (Brito & Barr, 2014; Brito, Sebastián-Gallés, & Barr, 2015), and greater cognitive flexibility (Kovács & Mehler, 2009a, 2009b). This might reflect an early-emerging difference in information processing between the two groups. Together, these lines of work raise the possibility that preference for IDS versus ADS could have a different developmental course for bilingual and monolingual infants, and that bilinguals’ distinct course could interact with factors such as language dominance.

Bilinguals’ exposure to and learning from IDS

Overall, there is very little research on whether bilinguals’ experience with IDS is comparable to monolinguals’ experience. Some research has compared English monolinguals and English-Spanish bilinguals in the United States (Ramírez-Esparza, García-Sierra, & Kuhl, 2014, 2017). Here, researchers reported that bilingual infants around 1 year of age received less exposure to IDS than monolingual infants on average. Moreover, in the bilingual families, input was more evenly distributed across infant- and adult-directed registers. It is difficult to know whether the results reported in these studies generalize to other populations of bilinguals, or whether it was specific to this language community. As acknowledged by the authors, the bilinguals in this study were of a lower SES than the monolinguals, which could have driven differences in the amount of IDS that infants heard. On the other hand, it might be the case that bilingual infants more rapidly lose their preference for the IDS register than do monolinguals, and that caregivers of bilinguals respond to this by reducing the amount of IDS input they provide.

Bilingual infants might also hear IDS that differs prosodically and phonetically from that heard by monolingual infants. Bilingual infants often have bilingual caregivers, and even when they are highly proficient speakers, their speech may vary from that of monolinguals. One study compared vowels produced in the IDS of monolingual English, monolingual French, and balanced French-English bilingual mothers living in Montreal (Danielson, Seidl, Onishi, Alamian, & Cristia, 2014). Bilingual mothers’ vowels were distinct in the two languages, and the magnitude of the difference between French and English vowels was similar to that shown by monolingual mothers. However, another study showed that, in a word-learning task, 17-month-old French-English bilinguals learned new words better from a bilingual speaker than a monolingual speaker, even though acoustic measurements did not reveal what dimension infants were attending to (Fennell & Byers-Heinlein, 2014; similar findings were found in Mattock, Polka, Rvachew, & Krehm, 2010). Finally, a study of Spanish-Catalan bilingual mothers living in Barcelona found that some mothers were more variable in their productions of a difficult Catalan vowel contrast than monolingual mothers (Bosch & Ramon-Casas, 2011). Thus, bilingual infants may not only differ in the amount of IDS they hear in a particular language relative to monolingual infants, but different populations of bilingual infants may also vary in how similar the IDS they hear is to monolingual-produced IDS in the same languages. This could, in turn, lead to greater variability across bilinguals in their preference for IDS over ADS when tested with any particular stimulus materials.

Regardless of bilingual infants’ specific experience with IDS, evidence suggests that bilinguals might enjoy the same learning benefits from IDS as monolinguals. For example, Ramírez-Esparza et al. (2017) found that greater exposure to IDS predicted larger vocabulary size in both monolingual and bilingual infants. Indeed, an untested possibility is that exposure to IDS might be of particular benefit to bilingual infants. Bilinguals face a more complex learning situation than monolinguals, as they acquire two sets of sounds, words, and grammars simultaneously (Werker & Byers-Heinlein, 2008). This raises the possibility that bilingual infants might have enhanced interest in IDS relative to monolinguals, or that they might maintain a preference for IDS until a later age than monolinguals, similar to the extended sensitivity observed in bilingual infants’ perception of non-native phonetic contrasts.

Replicability in research with bilingual infants

Working with bilingual infant populations engenders unique replicability issues above and beyond those common in the wider field of infant research (e.g., between-lab variability, methodological variation, etc.; see Frank et al., 2017). These issues begin with the nature of the population. Our discussion of bilingual infants thus far has used “bilingual” as a blanket term to describe infants growing up hearing two or more languages. However, this usage belies the large variability in groups of infants described as “bilingual”. First, some studies of bilinguals have included infants from a homogeneous language background (where all infants are exposed to the same language pair; e.g. English-Spanish in Ramírez-Esparza et al., 2017), while others have included infants from heterogeneous language backgrounds (where infants are exposed to different language pairs, e.g., English-Other, where “Other” might be Spanish, French, Mandarin, Punjabi, etc.; e.g., Fennell, Byers-Heinlein, & Werker, 2007). Second, some bilinguals learn two typologically closely related languages (e.g. Spanish-Catalan) while others learn two distant languages (e.g. English-Mandarin). Third, there is wide variability between bilingual infants in the amount of exposure to each language, which introduces an extra dimension of individual differences relative to studies with monolingual infants. Fourth, studies define bilingualism in different ways, ranging from a liberal criterion of at least 10% exposure to the non-dominant language to at least 40% exposure to the non-dominant language (Byers-Heinlein, 2015). Finally, bilingual and monolingual populations can be difficult to compare because of cultural, sociological, and socio-economic status differences that exist between samples.

All of the above difficulties have resulted in very few findings being replicated across different samples of bilinguals. The limited research that has compared different types of bilingual learners has indicated that the particular language pair being learned by bilingual infants influences perception of both native (Bialystok, Luk, & Kwan, 2005; Sundara & Scutellaro, 2011) and non-native (Patihis, Oh, & Mogilner, 2015) speech sounds. In contrast, other studies have not found differences between bilinguals learning different language pairs, for example in their ability to apply speech perception skills to a word-learning task (Fennell et al., 2007). Generally, we do not know how replicable most findings are across different groups of bilinguals, or how previously reported effects of bilingualism on learning and perception are impacted by the theoretically interesting moderators discussed above.

Research on bilingual infants also faces many of the same general concerns shared with other fields of infancy research, such as challenges recruiting sufficient participants to conduct well-powered studies (Frank et al., 2017). Finding an appropriate bilingual sample further limits the availability of research participants, even in locations with significant bilingual populations. Such issues are particularly relevant given the recent emphasis on the replicability and best practices in psychological science (Klein et al., 2014; Open Science Collaboration, 2015; Simmons, Nelson, & Simonsohn, 2011). Of particular interest is whether bilingual infants as a group show greater variability in their responses than monolingual infants, and how to characterize the variability of responses between the different types of samples of bilinguals that can be recruited by particular labs (i.e., homogeneous vs. heterogeneous samples). Understanding whether variability differs systematically across groups is vital for planning appropriately-powered studies.

Description of the current study

Here, we report a large-scale, multi-site, pre-registered study aimed at using data from bilingual infants to understand variability in infants’ preference for IDS over ADS. This study, “ManyBabies 1 Bilingual”, is a companion project to the “ManyBabies 1” project, published in a previous issue of this journal (ManyBabies Consortium, 2020). The two studies were conducted in parallel, using the same stimuli and experimental procedure. However, while ManyBabies 1 analyzed all data collected from monolingual infants (including those data from monolinguals reported here), the current study reports a subset of these data together with additional data from bilingual infants not reported in that paper. Our multi-site approach gives us precision in estimating the overall effect size of bilingual infants’ preference for IDS, while also allows us to investigate how different types of language experience moderate this effect.

Our primary approach was to compare bilinguals’ performance to the performance of monolinguals tested in the same lab. This approach has two notable advantages. First, within each lab, bilinguals shared one of their two languages with monolinguals (the language of the wider community). Second, testing procedures were held constant within each lab. Thus, this approach allowed us to minimize procedural confounds with infants’ bilingual status. However, a disadvantage of this approach is that it leaves out data from monolingual infants tested in other labs (since not all laboratories provided data from bilingual infants), which could potentially add precision to the measured effects. Thus, we performed additional analyses comparing all bilinguals to all monolinguals within the same age bins, regardless of the labs each had been tested in.

We tested bilinguals in one of two age windows: 6–9 months, and 12–15 months1. The specific age bins selected were based on apreliminary survey of access to participants of different age ranges across participating laboratories. The choice of non-adjacent age bins also increased the chances of observing developmental differences.

All infants were tested using the same stimuli, which consisted of recordings of North-American English (NAE) accented IDS and ADS. Because of the international nature of this multi-site project, these stimuli were native for some infants but non-native for other infants, both in terms of the language of the stimuli (English), and the variety of infant-directed speech (NAE-IDS is particularly exaggerated in its IDS characteristics relative to other varieties of IDS; see Soderstrom, 2007 for a review). Moreover, the stimuli were produced by monolingual mothers. Thus, infants’ exposure to the type of stimuli used varied from low (monolinguals and bilinguals not exposed to NAE), to moderate (bilinguals learning NAE as one of their two languages), to high (monolinguals learning NAE).

Infants were tested in one of three experimental setups regularly used to test infant auditory preference: central fixation, eye-tracking, and headturn preference procedure. The use of a particular setup was the choice of each lab, depending on their equipment and expertise. Labs that tested both monolinguals and bilinguals used the same setup for both groups. On all setups, infants heard a series of trials presenting either IDS or ADS, and their looking time to an unrelated visual stimulus (e.g., a checkerboard) was used as an index of their attention. In the central fixation setup, infants sat in front of a single screen that displayed a visual stimulus, and their looking times to this visual stimulus while an auditory stimulus was played was coded via button press using a centrally positioned camera. This was similar in the eyetracking setup, except that infants’ looking was coded automatically using a corneal-reflection eye-tracker. In the headturn preference procedure setup (HPP; see Kemler Nelson et al., 1995), infants sat in the middle of a room facing a central visual stimulus. Their attention was drawn to the left or right side of the room by a visual stimulus while the auditory stimulus played, and the duration of their looking to the visual stimulus was measured via button press using a centrally positioned camera.

Research questions

We identified three basic research questions addressed by this study. Note that it was not always possible to make specific predictions given the very limited data on infants’ cross-language preferences for IDS over ADS, and particularly the absence of data from bilingual infants. We also note that the ManyBabies 1 project, focusing on monolingual infants, addresses other more general questions such as the average magnitude of the IDS preference, changes in preference over age, and the effects of methodological variation on IDS preference (ManyBabies Consortium, 2020). The main questions addressed by data from bilingual infants are:

  1. How does bilingualism affect infants’ interest in IDS relative to ADS? As described above, monolingual infants display an early preference for IDS that grows in strength at least through the first year of life. We anticipated that the bilingual experience might result in a different pattern of IDS preference; however, the direction and potential source of any difference is difficult to predict. For example, the more challenging nature of early bilingual environments might induce an even greater preference for IDS over ADS relative to monolinguals. This enhanced preference could be shown across development, or might be observed only at certain ages. On the other hand, given some evidence that parents of bilingual infants produce relatively less IDS than parents of monolingual infants, it may be that bilinguals show less interest in IDS than monolinguals. We also explored the following questions as potential sources for an emerging difference between populations: If an overall difference between monolingual and bilingual infants’ preference for IDS is observed, can this be accounted for by systematic differences in socioeconomic status? Do bilinguals show greater variability in their preference for IDS than monolinguals?

  2. How does the amount of exposure to NAE-IDS affect bilingual infants’ listening preferences? While we expected infants across different language backgrounds to show greater interest in IDS over ADS, we investigated whether this was moderated by the amount of exposure to NAE. For monolinguals, this exposure would be either 100% (monolingual learners of NAE) or 0% (monolingual learners of other languages). For bilinguals, some infants would have 0% exposure to NAE-IDS (e.g., bilingual infants learning Spanish and Catalan) while others would have a range of different exposures (e.g., bilingual infants learning NAE and French). This allowed us to at least partially disentangle dose effects of exposure to NAE-IDS from infants’ bilingualism. An additional possibility is that infants’ exposure to NAE would predict overall attention to both infant-directed and adult-directed NAE, with no differential effects on interest to IDS versus ADS. Finally, it is possible that NAE-IDS is equally engaging to infants regardless of their experience with North American English.

  3. Finally, we had planned to ask how bilingual infants’ listening to NAE-IDS and ADS is impacted by the particular language pair being learned. We intended to ask this question at both the group and at the individual level. At the group level, we planned to investigate whether different patterns of results would be seen in homogeneous versus heterogeneous samples of bilinguals, in terms of overall preference for IDS and group-level variability. However, ultimately we had insufficient homogeneous samples to address this question. At the individual level, we were interested in how the particular language pair being learned modulated infants’ preference for IDS. As we did not know a priori what language pairs would have sufficient sample size for analysis, this was considered a potential exploratory analyses. Ultimately, due to the nature of our main results and the diverse language backgrounds of our final sample, we decided to leave this question open for future investigations.

Disclosures

Preregistration

The accepted Stage 1 version of this manuscript was preregistered at https://osf.io/wtfuq.

Data, materials, and online resources

Study instructions and other details are available at the ManyBabies 1 Bilingual Open Science framework site, https://osf.io/zauhq/, and materials are available via the ManyBabies 1 Open Science Framework site, https://osf.io/re95x/.

Labs submitted anonymized data for central analysis that identified participants by code only. Data and analytic code are available at https://github.com/manybabies/mb1b-analysis-public. Video recordings of individual participants were coded and stored locally at each lab, and where possible were uploaded to a central controlled-access databank accessible to other researchers (https://databrary.org).

Reporting

We report how we determined our sample size, all data exclusions, all manipulations, and all measures in our study.

Ethical approval

This research was carried out in accordance with the provisions of the World Medical Association Declaration of Helsinki. Each lab followed the ethical guidelines and ethics review board protocols of their own institution.

Methods

Participation Details

Our monolingual sample originated from the ManyBabies 1 project (ManyBabies Consortium, 2020). Here we report some basic information about that sample - the reader is referred to the original study for further details - and focus primarily on the bilingual sample.

Time-frame.

An open call for labs to participate was issued on February 2, 2017. Participant testing began on May 1, 2017. Testing for monolinguals ended on April 30, 2018. Because of the additional difficulty of recruiting bilingual samples, the end-date for collection of these data was extended by four months to August 31, 2018. Due to a miscommunication, one lab continued testing data beyond this deadline but prior to data analysis, and these data were included in the final sample.

Age distribution.

Labs contributing data from bilingual infants were asked to test participants in at least one of two (but preferably both) age bins: 6–9 month-olds (6;1 – 9;0) and 12–15 month-olds (12;1 – 15;0). Labs were asked to aim for a mean age at the centre of the bin, with distribution across the entire age window. Some labs chose to test additional infants outside the target age ranges for future exploratory analyses, which were excluded from the current study.

Lab participation criterion.

Considering the challenges associated with recruiting bilingual infants and the importance of counterbalancing in our experimental design, we asked labs to contribute a minimum of 16 infants per age and language group (note that infants who met inclusion criteria for age and language exposure but were ultimately excluded for other reasons counted towards this minimum N). We expected that requiring a relatively low minimum number of infants would encourage more labs to contribute a bilingual sample, and under our statistical approach a larger number of groups is more important than a larger number of individuals (Maas & Hox, 2005). However, labs were encouraged to contribute additional data provided that decisions about when to stop data collection were made ahead of time (e.g., by declaring an intended start and end date before data collection). A sensitivity analysis showed that, with a sample of 16 infants and assuming the average effect size of similar previous studies (Cohen’s d = .70; Dunst et al., 2012; MetaLab, 2017), individual labs would have 74% power to detect a preference for IDS in a paired-samples t-test (alpha = .05, one-tailed). Assuming a smaller effect size of Cohen’s d = 0.60, a conservative estimate of power based on the literature reviewed above, individual labs’ power would be 61%. The moderate statistical power that individual labs would have to detect this effect highlights the importance of our approach to combine data across labs. We note that some labs were unable to recruit their planned minimum sample of 16 bilingual infants that met our inclusion criteria in the timeframe available, a point we will return to later in the paper.

Labs were asked to screen infants ahead of time for inclusion criteria, typically by briefly asking about language exposure over the phone. Despite this screening process, some infants who arrived in the lab for testing fell between the criteria for monolingual and bilingual status based on the comprehensive questionnaire. In such cases, the decision whether to test the infant was left up to individual laboratories’ policy, but we asked that data from any babies who entered the testing room be submitted for data processing (even though some such data might be excluded from the main analyses).

Participants

Defining bilingualism.

Infants are typically categorized as bilingual as a function of their parent-reported relative exposure to their languages. However, studies vary considerably in terms of inclusion criteria for the minimum exposure to the non-dominant language, which in previous studies has ranged from 10% to 40% of infants’ exposure (Byers-Heinlein, 2015). Some bilingual infants may also have some exposure to a third or fourth additional language. Finally, infants can vary in terms of when the onset of exposure to their additional languages is, which can be as early as birth or anytime thereafter. We aimed to take a middle-of-the-road approach to defining bilingualism, attempting to balance a need for experimental power with interpretable data.

Thus, we asked each participating lab to recruit a group of simultaneous bilingual infants who were exposed to two languages between 25% and 75% of the time, with regular exposure to both languages beginning within the first month of life. There was no restriction as to whether infants were exposed to additional languages, thus some infants could be considered multilingual (although we continue to use the term bilingual throughout this manuscript). These criteria would include, for example, an infant with 40% English, 40% French, and 20% Spanish exposure, but would exclude an infant with 20% English, 70% French, and 10% Spanish exposure. We also asked labs to recruit a sample of bilingual infants who shared at least one language – the community language being learned by monolinguals tested in the same lab. For labs in bilingual communities (e.g., Barcelona, Ottawa, Montréal, Singapore), labs were free to decide which community language to select as the shared language. Within this constraint, most labs opted to test heterogeneous groups of bilinguals, for example, English-Other bilinguals where English was the community language, the other language might be French, Spanish, Mandarin, etc. Only one lab tested a homogeneous group of bilinguals (in this case, all infants were learning English and Mandarin), although we had expected that more labs would test homogeneous samples, given both heterogeneous and homogeneous samples are used regularly in research with bilingual infants. Because only one homogeneous sample was tested, we were not able to conduct planned analyses examining the impact of this type of sample on our results. Infants that were tested but did not meet inclusion criteria into the group (for example because they did not hear enough of their non-dominant language, or did not hear enough of the community language) were excluded from the main analyses, but retained for exploratory analyses where appropriate.

Assessing bilingualism.

Each lab was asked to use a detailed day-in-the-life parental interview questionnaire to quantify the percent of time that infants were exposed to each language. This approach has been shown to predict bilingual children’s language outcomes better than a one-off parental estimate (DeAnda, Bosch, Poulin-Dubois, Zesiger, & Friend, 2016). Moreover, recent findings based on day-long recordings gathered using LENA technology show that caregivers can reliably estimate their bilingual child’s relative exposure to each language (Orena, Byers-Heinlein, & Polka, 2020). Labs were also asked to pay special attention to whether infants had exposure to North American English (based on a parent report of the variety of English spoken to their infant), and if so which caregiver(s) this input came from. As most of the labs contributing bilingual data had extensive expertise in bilingual language background assessment, we encouraged each lab to use whatever version of measurement instrument was normally used in their lab (details of the assessment instruments are outlined below, including source references for most measures). Where possible, labs conducted the interview in the parents’ language of choice, and documented whether the parents’ preferred language was able to be used.

While standardization of measurement tools is often desirable, we reasoned that different questions and approaches might be best for eliciting information from parents in different communities and from different cultures. Indeed, many labs reported that their own instruments had undergone considerable refinement over the years as a function of their experience working with the families in their communities. However, in order to maximize the overall sample size and the diversity of bilingual groups tested, we encouraged participation from laboratories without extensive experience testing bilingual infants. Labs that did not have an established procedure were paired with more experienced labs working with similar communities to refine a language assessment procedure. Twelve of the labs administered a structured interview-style questionnaire based on the one developed by Bosch and Sebastián-Gallés (1997, 2001; for examples of the measure see the online supplementary materials of Byers-Heinlein et al., 2019; DeAnda et al., 2016), and the remaining 5 labs administered other questionnaires. We describe each of these approaches in detail below.

The Bosch and Sebastián-Gallés (1997, 2001) questionnaire is typically referred to in the literature as the Language Exposure Questionnaire (LEQ; e.g., Byers-Heinlein, Fennell, & Werker, 2013), or the Language Exposure Assessment Tool (LEAT; DeAnda et al., 2016). Administration of these questionnaires takes the form of a parental interview, where a trained experimenter systematically asks at least one of the infant’s primary caregivers detailed questions about the infant’s language environment. The interviewer obtains an exposure estimate for each person who is in regular contact with the infant, as defined by a minimum contact of once a week. For each of those people, the caregiver gives an estimate of how many hours per day they speak to the infant in each language for each of the days of the week (e.g., weekdays and weekends may differ depending on work commitments). Further, the caregiver is asked if the language input from each regular-contact person was similar across the infant’s life history. If not, such as in the case of a caregiver returning to work after parental leave, or an extended stay in another country, an estimate is derived for each different period of the infant’s lifespan. The interviewer also asks the caregiver about the language background of each person with regular contact with the infant (as defined above), asking the languages they speak and whether they are native speakers of those languages. The caregiver also gives an estimate of language exposure in the infant’s daycare, if applicable. Finally, the caregiver gives a global estimate of their infant’s percent exposure to the two languages, which includes input from those people in regular contact with the infant and other people with whom the infant has less regular contact (e.g., playgroups, friends of caregivers, etc.). Importantly, this global estimate does not include input from television or radio, as such sources have no known positive impact, and may even have a negative impact on monolingual and bilingual language development in infancy (see Hudon, Fennell, & Hoftyzer, 2013). The estimate of an infant’s percent exposure to their languages is derived from the average cumulative exposure based on the data from the primary individuals in the infant’s life. Some labs use the global estimate simply to confirm these percentages. Other labs average the primary and global exposure to take into account all language exposure, while still giving more weight to the primary individuals. Also, some labs asked additional questions, for example about videoconferencing with relatives, whether caregivers mix their languages when speaking to the infant, or caregivers’ cultural background. Finally, while the original form was pen-and-paper, there have been adaptations which include using a form-fillable Excel sheet (DeAnda et al., 2016).

For the other language exposure measures used by 5 of the labs, we will simply highlight the differences from the LEQ/LEAT measure described above, as there is much overlap between all the instruments used to measure infants’ exposure to their languages. Two labs used custom assessment measures designed within each lab. The major difference from the LEQ for the first of these custom measures is that parents provide percentage exposure estimates for each language from primary individuals in the infant’s life, rather than exposure estimates based on hours per day in each language. The other custom measures, unlike the LEQ, specify estimates of language exposure in settings where more than one speaker is present by weighting each speaker’s language contribution. A further two labs used other child language exposure measures present in the literature: one used the Multilingual Infant Language Questionnaire (MILQ; Liu & Kager, 2017b) and the other used an assessment measure designed by Cattani et al. (2014). For the MILQ, one major difference is that parents complete the assessment directly using an Excel sheet with clear instructions. The other major difference is that the MILQ is much more detailed than the LEQ/LEAT: breaking down language exposure to very specific activities (e.g., car time, book reading, meal time); asking more detail about the people in regular contact with the infant (e.g., accented speech, level of talkativeness); and obtaining estimates of media exposure (e.g., TV, music). The measure from Cattani et al. (2014) focuses on parental exposure and uses Likert scales to determine exposure from each parent. The ratings are converted to percentages and maternal exposure is weighted more in the final calculation based on data showing that mothers are more verbal than fathers. Finally, one lab did not use a detailed measure, but rather simply asked parents to give an estimate of the percentage exposure to each of the languages their infant was hearing.

For monolinguals, labs either did the same assessment as with bilinguals, or minimally checked participants’ monolingual status by asking parents a single question: estimate the percent of time that their infant was exposed to their native language. Under either approach, if that estimate exceeded 90% exposure to a single language, the infant was considered monolingual.

Demographics.

Each lab administered a questionnaire that gathered basic demographic data about infants, including age, health history, gestation, etc. Infants’ socioeconomic status (SES) was measured via parental report of years of maternal education. To standardize across different education systems where formal schooling may begin at different ages, we counted the number of years of education after kindergarten. For example, in the United States, mothers who had completed high school would be considered to have 12 years of education.

Final sample.

Our final sample of bilinguals who met our infant-level inclusion criteria included 333 infants tested in 17 labs; 148 were 6–9 months, and 185 were 12–15 months (a full account of exclusions is detailed in the results section). These 17 labs also collected data from monolingual infants (N = 384 who met infant-level inclusion criteria), of whom 181 were 6–9 months, and 203 were 12–15 months. While all analyses required that data meet the infant-level inclusion criteria, some analyses further required that the data met the lab-level inclusion criteria (lab-level inclusion criteria are discussed in the Results section where they were implemented for specific analyses). Data from monolingual infants in these age ranges were available from 59 additional labs (n = 574 6–9 month-olds; n = 463 12–15 month-olds) who did not contribute bilingual data. Bilingual infants and lab-matched monolingual samples tested by each lab are detailed in Table 1. For further description of our participants, please refer to the Appendix, where we list gender distributions across subsamples (Table A1) and the language pairs being learned by bilingual infants (Table A2).

Table 1.

Number of monolingual and bilingual infants in each age group that met infant-level inclusion criteria, average years of maternal education (SES), and average NAE exposure for blingual infants by lab. The table is ordered by NAE exposure. Note that because of lab-level inclusion criteria, cells with n < 10 were excluded from the meta-analyses, but were included in the mixed-effects regression analyses. Labs that only tested monolingual infants are not listed.

institution method 6–9 mo bilingual 6–9 mo monolingual 12–15 mo bilingual 12–15 mo monolingual average years of maternal education bilinguals’ average NAE
Oxford Brookes University singlescreen 17 15 17 16 16.79 0.00
Western Sydney University hpp 9 15 15 15 17.00 0.00
Universite Paris Descartes hpp 10 0 1 16 16.37 0.00
Central European University eyetracking 0 0 14 13 18.09 0.00
University of Liverpool eyetracking 7 18 6 15 17.59 0.00
Ecole Normale Superieure eyetracking 0 0 16 14 16.98 0.00
National University of Singapore eyetracking 26 10 12 10 14.98 0.00
University of Zurich eyetracking 0 0 28 30 15.17 0.00
University of Goettingen singlescreen 9 31 7 15 16.06 0.00
McGill University hpp 0 0 16 11 18.12 28.63
University of Manitoba hpp 7 26 8 16 15.58 47.16
University of British Columbia (Werker) hpp 15 20 0 0 16.33 48.88
Concordia University eyetracking 16 17 18 18 16.69 48.98
Princeton University hpp 15 1 0 0 18.02 49.07
University of California Los Angeles hpp 0 0 9 3 14.86 53.04
University of Ottawa singlescreen 7 17 18 11 18.31 54.79
University of British Columbia (Hamlin) eyetracking 10 11 0 0 16.60 55.69

Materials

Visual stimuli.

Labs using a central fixation or eye-tracking method presented infants with a brightly-coloured checkerboard as the main visual stimulus. A video of a laughing baby was used as an attention-getter between trials to reorient infants to the screen. Labs using the headturn preference procedure used the typical visual stimulus employed in their labs, which was sometimes light bulbs (consistent with the original development of the procedure in the 1980s) or sometimes colourful stimuli presented on LCD screens. All visual stimuli are available via the ManyBabies 1 Open Science Framework site at osf.io/re95x/.

Auditory stimuli.

Auditory stimuli consisted of semi-naturalistic recordings of mothers interacting with their infants (ranging in age from 122–250 days) in a laboratory setting. Mothers were asked to talk about a set of objects with their infant, and also separately with an experimenter. A set of 8 IDS and 8 ADS auditory stimuli of 18 s each were created from these recordings. Details regarding the recording and selection process, acoustic details and ratings from naive adult listeners can be found in the ManyBabies 1 study (ManyBabies Consortium, 2020) and the associated Open Science Framework project at osf.io/re95x.

Procedure

Basic Procedure.

Each lab used one of three common infant study procedures, according to their own expertise and the experimental setups available in the lab: central fixation (3 labs), eye-tracking (7 labs), or headturn preference procedure (7 labs). The testing procedure was identical to that used in the ManyBabies 1 project (ManyBabies Consortium (2020); deviations from the protocol are also described there), and we describe key aspects here.

Infants sat on their parents’ laps or in a high chair, and parents listened to masking music over headphones throughout the study. Infants saw 2 training trials that presented an unrelated auditory stimulus (piano music), followed by 16 test trials that presented either IDS or ADS speech. Trials were presented in one of four pseudo-random orders that counterbalanced the order of presentation of the two stimulus types. Note that within each order, specific IDS and ADS clips were presented adjacently in yoked pairs to facilitate analyses. On each trial, the auditory stimulus played until the infant looked away for 2 consecutive seconds (for labs that implemented an infant-controlled procedure) or until the entire stimulus played, up to 18 seconds (for labs that implemented a fixed trial-length procedure). The implementation of the procedure depended on the software that was available in each lab. Trials with less than 2 seconds of looking were excluded from analyses. Attention-grabbing stimuli were played centrally between trials to reorient infants to the task.

The main differences between the setups were the type and position of visual stimuli presented, and the onset of the auditory stimuli. For central fixation and eye-tracking procedures, infants saw a checkerboard on a central monitor, whose presentation coincided with the onset of the auditory stimuli on each trial. For the headturn preference procedure, the visual stimulus (either flashing light bulbs or a colourful stimulus) played silently on a monitor/bulb in the centre of the room and on one of two side monitors/bulbs, and the auditory stimulus began playing when the infant turned their head towards the side stimulus.

The dependent variable was infant looking time to the visual stimulus during each trial. For eye-tracking setups, looking time was measured automatically via corneal reflection. For central fixation and headturn preference procedure setups, looking time was measured by trained human coders who were blind to trial type, according to the lab’s standard procedures.

Parents completed questionnaires about participants’ demographic and language background either prior to or after the main experiment.

Results

Analysis overview

Data exclusion.

Labs were asked to submit all data collected as part of the bilingual study to the analysis team, and this section focuses on exclusions for infants collected as part of the bilingual sample. The initial dataset contained 501 infants, of which 333 met each of the inclusion criteria, which are detailed below. We note that exclusions were applied sequentially (i.e., percentages reflect exclusions among the remaining sample after previous criteria were applied).

  • Full term. We defined full term as gestation times greater than or equal to 37 weeks. There were 9 (1.80%) infants who were tested but excluded as they were pre-term.

  • No diagnosed developmental disorders. We excluded infants whose parents reported developmental disorders (e.g., chromosomal abnormalities, etc.) or were diagnosed with hearing impairments. There were 2 (0.41%) infants who were tested but excluded for these reasons. Due to concerns about the accuracy of parent reports, we did not plan exclusions based on self-reported ear infections unless parents reported medically-confirmed hearing loss.

  • Age. We included infants in two age groups: 6–9 and 12–15 month-olds. There were 58 (11.84%) infants who were tested in the paradigm, but who fell outside our target ages. Some labs chose to test such infants for future exploratory analyses, knowing they would be excluded from the current paper.

  • Bilingualism. We excluded infants from the sample whose language background did not meet our pre-defined criteria for bilingualism (see above for details). There were 70 (16.20%) infants whose exposure did not meet this criterion. We also excluded 7 (1.93%) additional infants who met this criterion, but who were not learning the community language as one of their languages.

  • Session-level errors. Participants were also excluded based on session-level errors, including 7 infants for equipment error, 3 infant for experimenter error, and 4 infants for outside interference.

  • Adequate trials for analysis. We excluded any individual trial that was reported to be invalid (e.g., fussiness, incorrect stimulus, single instance of parent or sibling interference). A total of 855 (13.98%) trials were affected by such errors. There was 1 infant who did not have any trials left for analysis once such trials were excluded. Next, we excluded any infant who did not have at least one IDS-ADS trial pair available for analysis (N = 7; 2.06%) infants were tested but did not meet these criteria. For infants with at least one good trial pair, we additionally excluded any trial with less than 2 s of looking (n = 876 trials; 16.92% of trials), which was set as a trial-level minimum so that infants had heard enough of the stimulus to discriminate IDS from ADS. As infants did not have to complete the entire experiment to be included, this meant that different infants contributed different numbers of trials. On average, infants contributed 15.70 trials to the analysis.

Data analysis framework.

Our primary dependent variable of interest was looking time (LT), which was defined as the time spent fixating on the visual stimulus during test trials. Given evidence that looking times are non-normally distributed, we log-transformed all looking times prior to statistical analysis in the mixed-effects model (Csibra, Hernik, Mascaro, Tatone, & Lengyel, 2016). We refer to this transformed variable as “log LT”. For the meta-analysis, we analyzed effect sizes computed from raw difference scores, which did not require log transformation. We pre-registered a set of analyses to examine whether monolinguals, heterogeneous samples of bilinguals, and homogeneous samples of bilinguals showed different levels of variability. Unexpectedly, only 1 lab (Table 1) tested a homogenous sample of bilinguals, thus we deviated from our original plan and did not analyze data as a function of whether our bilingual groups were homogenous versus heterogeneous. For the main analyses, we adopted two complementary data analytic frameworks parallel to the ManyBabies 1 project (ManyBabies Consortium, 2020): meta-analysis and mixed-effects regression.

Under the meta-analytic framework, data from each sample of infants (e.g., 6 to 9 month-old bilinguals from Lab 1) was characterized by a) its effect size (here Cohen’s d), and b) its standard deviation. Effect size analyses addressed questions about infants’ overall preference for IDS, while group-based standard deviation analyses addressed questions about whether some groups of infants show higher variability in their preference than others. Note that meta-analyses of intra-group variability are relatively rare (Nakagawa et al., 2015; Senior, Gosby, Lu, Simpson, & Raubenheimer, 2016). Unfortunately, our pre-registration did not account for the eventuality that several labs would contribute very small numbers of infants to certain groups, as each lab had committed to a minimum sample of 16 infants per group. In two cases where a lab contributed data with a single infant in a particular language group, it was impossible to compute an effect size. Thus, we implemented a lab-level inclusion criterion for the meta-analysis such that each effect size was computed only if the lab had contributed at least 10 infants in that particular language group and age. For example, if lab A had contributed 7 bilingual infants between 6- to 9-months and 15 monolingual infants between 6- to 9-months, we only computed the effect size for the monolingual group, but not for the bilingual group. This criterion ensured that each effect size was computed based on a reasonable sample size (i.e., a minimum of 10 infants) and also was consistent with the lab-level inclusion criteria in the ManyBabies 1 study. Because this exclusion criterion was not part of the pre-registration, we also ran a robustness analysis with a looser minimum contribution of 5 infants, which yielded very similar findings (analysis code and results can be found in our Github repository).

An advantage of the meta-analytic approach is that it is easy to visualize lab-to-lab differences. Further, the meta-analytic framework most closely mirrors the current approach for studying monolingual-bilingual differences, which typically compares groups of monolingual and bilingual infants tested within the same lab. We used this approach specifically to test the overall effect of bilingualism and its possible interactions with age on the magnitude of infants’ preference for IDS over ADS. We also compared standard deviations for the bilingual group and monolingual group in a meta-analytic approach. This analysis closely followed Nakagawa et al. (2015).

Under the mixed-effects regression model, trial-by-trial data from each infant were submitted for analysis. Further, independent variables of interest could be specified on an infant-by-infant basis. This approach had the advantage of potentially increasing statistical power, as data are analyzed at a more fine-grained level of detail. As with the meta-analytic approach, this analysis tested the effects of bilingualism and their potential interactions with age. We also investigated whether links between bilingualism and IDS preference were mediated by socio-economic status. Additionally, this approach allowed us to assess how the amount of exposure to NAE-IDS, measured as a continuous percentage, affected infants’ listening preferences. Note that unlike for the meta-analysis, we did not need to apply a lab-level inclusion criterion, which maximized our sample size. Thus, data from all infants who met the infant-level criteria were included in this analysis, resulting in slightly different sample sizes under the meta-analytic and mixed-effects approaches.

Under both frameworks, we used a dual analysis strategy to investigate how infants’ IDS preference is related to bilingualism. First, we examined the lab-matched subset of data from labs that contributed a monolingual and bilingual sample at a particular age. Second, we examined the complete set of data including data from labs that contributed both monolinguals and bilinguals, as well as additional data from labs that only tested monolinguals at the ages of interest as part of the larger ManyBabies 1 project.

Confirmatory analyses

Meta-analytic approach.

This approach focused on the analysis of group-level datasets. We defined a dataset as a group of at least 10 infants tested in the same lab, of the same age (either 6–9 or 12–15 months), and with the same language background (monolingual or bilingual). For analyses of within-group variability, we compared bilingual infants to monolingual infants.

To estimate an effect size for each dataset, we first computed individual infants’ preference for IDS over ADS by 1) subtracting looking time to the ADS stimulus from looking time to the IDS stimulus within each yoked trial pair, and 2) computing a mean difference score for each infant. Pairs that had a trial with missing data were excluded (34.53% pairs in lab-matched dataset, 25.41% pairs in the full dataset), which constituted a total of 20.58% of trials in lab-matched dataset, and 13.78% of trials in full dataset. Note that we expected many infants to have missing data particularly on later test trials, given the length of the study (16 test trials). Then, for each dataset (i.e., combination of lab, infant age group, and whether the group of participants was bilingual or monolingual), we calculated the mean of these difference scores (Md) and its associated standard deviation across participants (sd). Finally, we used the derived Md and sd to compute a within-subject Cohen’s d using the formula dz = Md/sd.

In the following meta-analyses, random effects meta-analysis models with a restricted maximum-likelihood estimator (REML) were fit with the metafor package (Viechtbauer, 2010). To account for the dependence between monolingual and bilingual datasets stemming from the same lab, we added laboratory as a random factor. As part of our pre-registered analyses, we planned to include method as a moderator in this analysis if it was found to be a statistically significant moderator in the larger ManyBabies 1 project - which it was (ManyBabies Consortium, 2020). However, because only 17 labs contributed bilingual data, we deviated from this plan because of the small number of labs per method (e.g., only three labs used a single-screen method).

Effect size-based meta-analysis.

Our first set of meta-analyses focused on effect sizes (dz): how our variables of interest contributed to effect size comparing looking time to IDS versus ADS trials. As a reminder, we ran the analyses in two ways: (i) the analysis was only restricted to the labs that contributed lab-matched data (lab-matched dataset), and (ii) the analysis included all available data labs that tested only monolinguals or only bilinguals at the ages of interest (full dataset).

We initially fit the following model to examine contributions of age and bilingualism to infants’ IDS preference, as well as potential interactions between these variables:

dz~1+bilingual+age+bilingual*age

Bilingualism was dummy coded (0 = monolingual, 1 = bilingual), and age (a continuous variable) was coded as the average age for each lab’s contributed sample for each language group (centered for ease of interpretation).

In the lab-matched dataset, we did not find any statistically significant effects of age (dz = 0.17, CI = [−1.01, 1.36], z = 0.29, p = .775), bilingualism (dz = −0.17, CI = [−0.44, 0.10], z = −1.22, p = .224), or interactions between age and bilingualism (dz = −0.19, CI = [−1.84, 1.46], z = −0.22, p = .822).

Similarly, in the full dataset, we did not find any significant main effects of age, (dz = 0.01, CI = [−0.66, 0.67], z = 0.02, p = .984), bilingualism (dz = −0.10, CI = [−0.29, 0.09], z = −1.03, p = .301), nor a significant interaction between age and bilingualism (dz = 0.01, CI = [−0.93, 0.95], z = 0.03, p = .979).

As bilingualism is the key moderator of research interest in the current paper, here we report the effect sizes of monolingual and bilingual infants separately. In the lab-matched dataset, the effect size for monolinguals was dz = 0.42 (CI = [0.21, 0.63], z = 3.94, p < .001), while for bilinguals the effect was dz = 0.24 (CI = [0.06, 0.42], z = 2.64, p = .008). In the full dataset, the effect size for monolinguals was dz = 0.36 (CI = [0.28, 0.44], z = 9.15, p < .001), while for bilinguals the effect was dz = 0.26 (CI = [0.09, 0.43], z = 2.97, p = .003). In sum, numerically, monolinguals showed a stronger preference for IDS than bilinguals, but this tendency was not statistically significant in the effect size-based meta-analyses. A forest plot for this meta-analysis is shown in Figure 1.

Figure 1.

Figure 1.

Forest plot for the lab-matched dataset, separated by age group. Standardized effect sizes are shown for each lab, with error bars showing 95% confidence intervals. Each lab reported two effect sizes: one for the monolingual group (red triangles) and the other one for the bilingual group (blue circles). Within each age group, points are ordered by the difference between the monolingual and bilingual effect sizes, and this effect size difference is indicated by a black X. Points are scaled by inverse variance (i.e., more precise estimates are denoted by larger shapes). The points in the bottom panel show the global meta-analytic estimate.

Within-group variability meta-analysis.

Our second set of pre-registered meta-analyses examined whether the variability in infants’ preference for IDS within a sample (within-study variability) was related to language background (monolingual vs. bilingual). Note that this question of within-sample heterogeneity is different from questions of between-sample heterogeneity that can also be addressed in meta-analysis (see Higgins & Thompson, 2002; Higgins, Thompson, Deeks, & Altman, 2003 for approaches to between-group variability in meta-analysis). Specifically, the within-group variability meta-analysis approach provides additional insights into how two groups differ in terms of their variances, not merely their mean effect sizes. This approach is useful when the language backgrounds of the infants influence not only the magnitude of infants’ IDS preference, but also the variability of infants’ IDS preference. In the following, the standard deviations measure looking time variability of infants’ preference for IDS over ADS in each language group (either monolingual or bilingual). Again, we report dz, an effect size that measures the magnitude of infants’ preference for IDS over ADS.

Our pre-registered plan was to follow Nakagawa et al. (2014) and Senior et al. (2015), and we further elaborate on this plan here. According to Nakagawa et al. (2015), there are two approaches to run within-group variability meta-analysis: one approach uses lnCV R, the natural logarithm of the ratio between the coefficients of variation, to compare the variability of two groups; a second approach enters lnSD (the natural logarithm of standard deviations) and lnX¯ (the log mean) into a mixed-effect model. When data meet the assumption that the standard deviation is proportional to the mean (i.e., the two are correlated), the first approach should be used, and otherwise, the second approach should be used. Our data did not meet the necessary assumption, therefore we used the second, mixed-effect approach. In the following meta-regression model, the natural logarithm of the standard deviations (lnSD) from each language group is the dependent variable. This dependent variable (group variance) is the log-transformed standard deviation of infants’ preference for IDS over ADS that corresponds to infants’ language group (either monolingual/bilingual). We note that this log transformation is entirely unrelated to the log transformation of raw looking times used in the linear mixed-effects models.

lnSD~1+bilingual+ln(dz)+(bilinguallab)

where dz is the absolute value of dz because we needed to ensure that values entered into the logarithm were positive, bilingual is the binary dummy variable that indicates whether the language group is monolingual or bilingual. Further, we entered a random intercept and a random slope for bilingualism, which were allowed to vary by lab.

In the lab-matched dataset, we did not find statistically significant evidence for bilingualism as a moderator of the differences in standard deviations across language groups, (dz = −0.08, p = .235). Similarly, we also did not find statistical significance for bilingualism in the full dataset, (dz = 0.02, p = .698). In short, we did not find support for the hypothesis that bilingual infants would show larger within-group variability than monolingual infants.

Mixed-effects approach.

Mixed-effects regression allows variables of interest to be specified on a trial-by-trial and infant-by-infant basis. We had anticipated that we would be able to include additional data from labs that aimed to test homogeneous samples (i.e., because we could include infants from these labs who were not learning this homogeneous language pair), but in practice this did not apply as only one lab contributed a homogeneous data set, and that lab did not test additional infants. We were also able to include data from all valid trials, rather than excluding data from yoked pairs with a missing data point as was necessary for the meta-analysis. As under the meta-analytic approach, we ran the models twice, once including only data from labs that contributed lab-matched samples of monolinguals and bilinguals, and once including all available data from 6–9 and 12–15 month-olds.

The mixed-effects model was specified as follows:

DV~IV1+IV2++(subject)+(item)+(lab)

The goal of this framework was to examine effects of the independent variables (IV) on the dependent variable (DV), while controlling for variation in both the DV (“random intercepts”) and the relationship of the IV to the DV (“random slopes”) based on relevant grouping units (subjects, items, and labs). Following recent recommendations (Barr, Levy, Scheepers, & Tily, 2013), we planned to initially fit a maximal random effects structure, such that all random effects appropriate for our design were included in the model. However, we also recognized that such a large random effects structure might be overly complex given our data, and would be unlikely to converge. After reviewer feedback during Stage 1 of the Registered Report review process, we pre-registered a plan to use a “Parsimonious mixed models” approach for pruning the random effects (Bates, Kliegl, Vasishth, & Baayen, 2018; Matuschek, Kliegl, Vasishth, Baayen, & Bates, 2017). However, we found that it was computationally difficult to first fit complex models (i.e., our models had multiple interactions and cross-levels grouping) under the maximal random effects structure and then prune the models using a parsimonious mixed models approach. Further, we note that this was not the approach used in ManyBabies 1, which would make a direct comparison between ManyBabies 1 and the current study difficult. As such, following ManyBabies 1, we fitted and pruned the following models using the maximal random effects structure only (Barr et al., 2013). We fit all models using the lme4 package (Bates, Mächler, Bolker, & Walker, 2015) and computed p values using the lmerTest package (Kuznetsova, Brockhoff, & Christensen, 2016). All steps of the pruning process we followed are detailed in the analytic code on our Github repository. Following a reviewer’s suggestion during Stage 2 review, we checked our models for potential issues with multicollinearity by examining variance inflation factors (VIF) for each model. Variables that have VIF values exceeding 10 are regarded as violating the multicollinearity assumption (Curto & Pinto, 2011). None of our models violated this assumption. Below is a description of our variables for the mixed-effects models:

  • log_lt: Dependent variable. Log-transformed looking time in seconds.

  • trial_type: A dummy coded variable with two levels, with ADS trials as the baseline, such that positive effects of trial type indicate longer looking to IDS.

  • bilingual: A dummy coded variable with two levels, with monolingual as the baseline, such that positive effects of bilingualism reflect longer looking by bilinguals.

  • language: A dummy coded variable with two levels (North American English-learners as the baseline), for whether infants were learning North American English as a native language (i.e., >= 90% exposure to NAE for monolinguals, or >= 25% exposure to NAE for bilinguals).

  • exp_nae: A continuous variable for the percent of time infants heard North-American English.

  • method: A dummy-coded variable to control for effects of different experimental setups, with single-screen central fixation as the reference level.

  • age_days: Centered for interpretability of main effects.

  • trial_number: The number of the trial pair, recoded such that the first trial pair is 0.

  • ses: The number of years of maternal education, centered for ease of interpretation.

Note that in this analysis plan, we have used a concise format for model specification, which is the form used in R. As such, lower-order effects subsumed by interactions are modeled even though they are not explicitly written. For example, the interaction trial_type * trial_num also assumes a global intercept, a main effect of trial type, and a main effect of trial number.

Homogeneity of variance.

We pre-registered a Levene’s test to examine whether monolinguals and bilinguals showed different amounts of variance in their IDS preference. Our analysis focused on the residual variance for monolinguals and bilinguals in the main linear mixed-effects models, in order to partition out variance associated with other factors (e.g., age, method, etc.). The Levene’s test revealed a statistically significant difference in variance between monolinguals and bilinguals for the full samples (p = 0.02) but not the lab-matched samples (p = 0.68). We note that the difference in residual variances between monolingual (variance = 0.24) and bilingual language groups (variance = 0.25) was small, suggesting that the statistically significant Levene’s test for the full samples was mainly driven by a larger sample size, rather than by meaningful differences between monolinguals and bilinguals.

Effects of bilingualism on IDS preference.

We planned a mixed-effects model which was based on the structure of the final model fit for the ManyBabies project, including bilingualism as an additional moderator. Note that because data collection for both projects was simultaneous, we did not know prior to registration what the final model structure for the monolingual-only sample would be (it was expected that pruning of this model would be necessary in the case of non-convergence). The original model proposed for the monolingual-only sample was designed to include simple effects of trial type, method, language (infants exposed vs. not exposed to NAE-IDS), age, and trial number, capturing the basic effects of each parameter on looking time (e.g., longer looking times for IDS, shorter looking times on later trials). Additionally, the model included two-way interactions of trial type with method and with trial number, a two-way interaction of age with trial number, as well as two- and three-way interactions between trial type, age, and language (see ManyBabies Consortium, 2020, for full justification). This model was specified to minimize higher-order interactions while preserving theoretically-important interactions. Note that to reduce model complexity, both developmental effects and trial effects are treated linearly. The planned initial model was:

log It~trial type*method+trial type*trial num+age*trial num+trial type*age*language+(trial type*trial numsubid)+(trial type*agelab)+(method+age*languageitem) (1)

Our analysis plan specified that we would add bilingualism to the fixed effects of the final pruned model that fitted to the monolingual sample. For higher-order interactions in the model, we ensured that we had at least 20 infants per group. For example, for a three-way interaction between bilingualism, language and age, we included at least 20 infants per group: at least 20 infants in the group of 6–9 month-old bilinguals who were not exposed to NAE. We applied the same rules to all other groups.

In our preregistration, we were uncertain as to whether our sample size would support a model with a four-way-interaction of trial type, age, bilingual status, and language. Given our final sample size, we elected to fit our main model without including the four-way interaction effect2. In our main model, we included two fixed three-way interactions: (i) the interaction between bilingualism, age and trial type, and (ii) the interaction between language, age and trial type, as well as other subsumed lower-order interactions.

Regardless of our fixed effect structure, the model included the random slope of bilingualism on lab and item, as well as appropriate interactions with other random factors. Our initial unpruned model was:

log It~trial type*method+trial type*trial num+age*trial num+trial type*age*language+trial type*age*bilingual+(trial type*trial numsubid)+(trial type*age*bilinguallab)+(method+age*language+age*bilingualitem) (2)

After pruning random effects for non-convergence and singularity, the final models for the lab-matched dataset and full dataset were different. The following was the final model for the lab-matched dataset:

log lt~trial type*method+trial type*trial num+age*trial num+trial type*age*language+trial type*age*bilingual+(1subid)+(bilinguallab)+(1item) (3)

In contrast, the final model for the full dataset was:

log lt~trial type*method+trial type*trial num+age*trial num+trial type*age*language+trial type*age*bilingual+(1subid)+(1lab)+(1item) (4)

Overall, the mixed-level analyses in both lab-matched and full datasets yielded similar results (Table 2 and 3). More coefficients were statistically significant in the full dataset, likely due to the larger sample size. Thus, in the following, we focus on the results of the mixed-level model for the full dataset. We found that infants showed a preference for IDS, as indicated by a positive coefficient on the IDS predictor (reflecting greater looking times to IDS stimuli). We did not find any effects of bilingualism on IDS preference nor any interaction effects between bilingualism and other moderators. This finding is consistent with the results of our meta-analysis above.

Table 2.

Linear Mixed Model 1 testing bilingualism effect on IDS in a matched dataset.

Estimate SE t p
Intercept 1.93 0.0744 26 4.01e-19
IDS 0.0932 0.0466 2 0.0503
HPP 0.103 0.0924 1.11 0.283
Single Screen 0.113 0.103 1.09 0.288
Age −0.0273 0.00801 −3.41 0.000675
Trial # −0.0361 0.0026 −13.9 9.58e-33
NAE −0.0594 0.075 −0.792 0.435
Bilingual 0.000324 0.0345 0.00938 0.993
IDS * HPP 0.0165 0.0292 0.566 0.571
IDS * Single Screen 0.00385 0.031 0.124 0.901
Age * Trial # 0.000977 0.00043 2.27 0.0232
IDS * Trial # 0.000636 0.00365 0.174 0.862
IDS * Age 0.0133 0.00608 2.18 0.0293
IDS * NAE 0.0507 0.0261 1.94 0.0523
Age * NAE 0.00651 0.0101 0.646 0.519
IDS * Bilingual −0.0123 0.0237 −0.518 0.604
Age * Bilingual −0.00613 0.00913 −0.671 0.503
IDS * Age * NAE 0.0156 0.00841 1.86 0.0629
IDS * Age * Bilingual −0.00945 0.00782 −1.21 0.227
R2 Conditional 0.317
R2 Marginal 0.0874
N 717
Table 3.

Linear Mixed Model 1 testing bilingualism effect on IDS in a full dataset.

Estimate SE t p
Intercept 1.9 0.0469 40.4 1.1e-60
IDS 0.105 0.0382 2.76 0.00948
HPP 0.19 0.0575 3.31 0.00162
Single Screen 0.243 0.0539 4.51 1.46e-05
Age −0.0292 0.00514 −5.68 1.5e-08
Trial # −0.0373 0.00176 −21.2 3.01e-87
NAE 0.00299 0.0483 0.0618 0.951
Bilingual −0.00587 0.0254 −0.231 0.817
IDS * HPP 0.029 0.0179 1.62 0.105
IDS * Single Screen −0.0204 0.0193 −1.05 0.292
Age * Trial # 0.00105 0.000268 3.9 9.48e-05
IDS * Trial # −0.00238 0.00246 −0.967 0.334
IDS * Age 0.0131 0.00343 3.8 0.000143
IDS * NAE 0.0372 0.0155 2.4 0.0163
Age * NAE 0.00161 0.00659 0.244 0.807
IDS * Bilingual 0.00294 0.0191 0.154 0.878
Age * Bilingual −0.00283 0.00768 −0.369 0.712
IDS * Age * NAE 0.00945 0.00484 1.95 0.0509
IDS * Age * Bilingual −0.00701 0.0063 −1.11 0.265
R2 Conditional 0.361
R2 Marginal 0.11
N 1754

Surprisingly, the fitted model did not show an interaction between infants’ IDS preference and the method used in the lab, a result that is different from the results in the ManyBabies 1 project. However, this finding is likely due to smaller sample sizes in the current paper, as we restricted the analysis to participants at particular ages. Apart from this, our findings were largely consistent with the ManyBabies 1 study. There was a significant and positive two-way interaction between IDS and NAE, suggesting greater IDS preferences for children in NAE contexts. The interaction between IDS and age was also significant and positive, suggesting that older children showed a stronger IDS preference. Finally, we found a marginally significant three-way interaction between IDS, age, and NAE, suggesting that older children in NAE contexts tended to show stronger IDS preference than those in the non-NAE contexts.

Dose effects of exposure to NAE-IDS in bilingual infants.

In this analysis, we tested whether we could observe a dose-response relationship between infants’ exposure to NAE-IDS (measured continuously) and their preference for IDS over ADS.

We decided to conduct this analysis only including data from bilinguals. Our reasoning was that bilingualism status and exposure to NAE-IDS are confounded, as monolinguals’ exposure to NAE will be either near 0% or 100%, while bilinguals’ NAE experience can be either 0% (since not all bilinguals are learning NAE as one of their two languages), or 25–75%. Because the monolingual sample is larger and their NAE exposures are more extreme, their effects would dominate that of the bilinguals in a merged analysis. Therefore, we reasoned that if there is a dose effect, it should be observable in the bilingual sample alone. Finally, although excluding monolingual infants reduced power overall, we decided that given the relatively large sample of bilingual infants, this disadvantage would be offset by the ease of interpretation afforded by restricting the analysis to bilinguals. On average, bilingual infants in our sample were exposed to 20.17% NAE (range: 0 to 75%).

Once again, we based this model on the final pruned monolingual model, substituting the binary measure of exposure to NAE-IDS (language) with the continuous measure of exposure(exp_nae), and including a random slope for exp_nae by item (which was ultimately pruned from the model). After pruning, our model was specified as follows:

log lt~trial type*method+trial type*trial num+age*trial num+tarial type*age*exp nae+(1subid)+(trial typelab)+(1item) (5)

Table 4 contains the details of the results in this model. The main effect of infants’ exposure to NAE (exp_nae) was not significant (β = −0.00067, SE = 0.0012, p = 0.57). This indicates that bilingual infants who were exposed to more NAE did not pay more attention to the NAE speech stimuli than those who were exposed to less NAE. However, the interaction between trial type and exp_nae was significant (β = 0.0023, SE = 0.00081, p = 0.011). That is, bilingual infants who were exposed to more NAE showed stronger IDS preferences, confirming a dose-response relationship between infants’ exposure to NAE and their preference for IDS over ADS (Figure 2) even among bilinguals who are learning NAE as one of their native languages.

Table 4.

Linear Mixed Model testing the effects of exposure to NAE-IDS in bilingual infants.

Estimate SE t p
Intercept 1.91 0.0736 25.9 6.58e-17
IDS −0.00859 0.0618 −0.139 0.89
HPP 0.0879 0.0913 0.963 0.353
Single Screen 0.168 0.111 1.51 0.16
Age −0.0235 0.0104 −2.27 0.0236
Trial # −0.0361 0.00356 −10.1 4.37e-18
EXP_NAE −0.000669 0.00118 −0.565 0.575
IDS * HPP 0.0537 0.0529 1.02 0.331
IDS * Single Screen 0.0278 0.0598 0.465 0.654
Age * Trial # 0.000195 0.00065 0.3 0.764
IDS * Trial # 0.00581 0.00504 1.15 0.251
IDS * Age 0.0062 0.00794 0.781 0.435
IDS * EXP_NAE 0.0023 0.000806 2.86 0.0107
Age * EXP_NAE −5.26e-05 0.000263 −0.2 0.842
IDS * Age * EXP_NAE 0.000205 0.00023 0.891 0.373
R2 Conditional 0.318
R2 Marginal 0.0891
N 333
Figure 2.

Figure 2.

Linear trend between infants’ IDS preference and their percentage of time exposed to North American English (NAE). Blue line indicates a regression model between infants’ IDS preference and their NAE exposure (starting from zero). Red line indicates another regression model of the same relationship with a focus of NAE exposure between 25 to 75%. We note that the y-axis was truncated to highlight the trend such that some individual points are not plotted.

Socio-economic status as a moderator of monolingual-bilingual differences.

Because socio-economic status can vary systematically between monolinguals and bilinguals in the same community, we were interested in whether relationships between bilingualism and IDS preference would hold when controlling for socio-economic status. It is possible that an observed effect of bilingualism on IDS preference could disappear once SES was controlled. Alternatively, it is possible that the effect of bilingualism on IDS preference could only be apparent once SES was controlled. Thus, this analysis was important regardless of an observed relationship between IDS preference and bilingualism in the previous model.

First, we computed descriptive statistics for the two groups. Mothers of the bilingual sample had an average of 16.71 years of education (SD = 2.47, range = 10–26), those of the lab-matched monolingual sample had an average of 16.33 years of education (SD = 2.83, range = 5–28), and those of the full monolingual sample had an average of 16.42 years of education (SD = 2.47, range = 8–25).

Our approach was to add SES as a moderator of our final model for bilinguals. We expected that any effects of socio-economic status could interact with age, thus this model included interactions of trial type, age, and socio-economic status as a fixed effect, as well as the corresponding random slope by item. Based on the potential model detailed above for the bilinguals, our expected ses-mediated model was:

log It~trial type*method+trial type*trial num+age*trial num+trial type*age*language+trial type*age*bilingual+trial type*age*ses+(trial type*trial numsubid)+(trial type*age*bilinguallab)+(method+age*language+age*bilingual+age*sesitem) (6)

After pruning for non-convergence, our final model specifications are listed below. For the lab-matched dataset, the final model was:

log lt~trial type*method+trial type*trial num+age*trial num+trial type*age*language+trial type*age*bilingual+trial type*age*ses+(1subid)+(bilinguallab) (7)

By contrast, the final model of the full dataset was:

log lt~trial type*method+trial type*trial num+age*trial num+trial type*age*language+trial type*age*bilingual+trial type*age*ses+(1subid)+(1lab)+(1item) (8)

In general, across the lab-matched and full datasets (Table 5 and 6), SES did not have a significant effect on infants’ looking time nor did it affect infants’ preference for IDS. However, for the lab-matched dataset only, we found a statistically significant three-way interaction between IDS, age, and SES. Specifically, infants from 6- to 9-month-olds showed stronger IDS preference when they were from higher SES families, but older infants from 12- to 15-month-olds showed similar IDS preference across families with different SES levels. However, this interaction was not observed in the full dataset, raising the possibility that it is a spurious, and arose only in the lab-matched dataset because it is substantially smaller than the full data set.

Table 5.

Linear Mixed Model examining socio-economic status as a moderator of monolingual-bilingual differences SES in the matached dataset.

Estimate SE t p
Intercept 1.91 0.0664 28.8 6.66e-18
IDS 0.132 0.0327 4.05 5.11e-05
HPP 0.12 0.0893 1.34 0.199
Single Screen 0.0943 0.1 0.939 0.359
Age −0.0294 0.00817 −3.59 0.000337
Trial # −0.0326 0.0019 −17.2 1.07e-64
NAE −0.089 0.072 −1.24 0.225
Bilingual 0.0223 0.0279 0.798 0.425
SES −0.00264 0.00516 −0.512 0.609
IDS * HPP 0.0192 0.0303 0.633 0.527
IDS * Single Screen 0.00648 0.0323 0.201 0.841
Age * Trial # 0.00104 0.000445 2.33 0.0199
IDS * Trial # −0.00464 0.00266 −1.74 0.0811
IDS * Age 0.012 0.00625 1.92 0.0551
IDS * NAE 0.0541 0.0277 1.95 0.0509
Age * NAE 0.0118 0.0105 1.13 0.26
IDS * Bilingual −0.0181 0.0248 −0.732 0.464
Age * Bilingual −0.0105 0.00904 −1.16 0.246
IDS * SES 0.00352 0.00453 0.777 0.437
Age * SES −0.000247 0.00169 −0.147 0.883
IDS * Age * NAE 0.0158 0.00874 1.81 0.0711
IDS * Age * Bilingual −0.00495 0.00817 −0.606 0.545
IDS * Age * SES −0.00351 0.00151 −2.33 0.0199
R2 Conditional 0.304
R2 Marginal 0.0879
N 717
Table 6.

Linear Mixed Model 3 examining socio-economic status as a moderator of monolingual-bilingual differences SES in the full dataset.

Estimate SE t p
Intercept 1.9 0.0485 39.3 1.27e-56
IDS 0.112 0.0402 2.79 0.00854
HPP 0.213 0.0602 3.55 0.000843
Single Screen 0.236 0.0544 4.34 3.42e-05
Age −0.0341 0.00546 −6.25 5.02e-10
Trial # −0.0373 0.00187 −20 7.61e-78
NAE −0.0189 0.0498 −0.38 0.705
Bilingual 0.0045 0.0263 0.171 0.864
SES −3.96e-06 0.00364 −0.00109 0.999
IDS * HPP 0.0325 0.0197 1.65 0.0983
IDS * Single Screen −0.0191 0.0206 −0.928 0.353
Age * Trial # 0.00124 0.000285 4.36 1.29e-05
IDS * Trial # −0.00264 0.00262 −1.01 0.313
IDS * Age 0.0115 0.00371 3.1 0.00192
IDS * NAE 0.0284 0.0169 1.68 0.0931
Age * NAE 0.0029 0.00691 0.42 0.675
IDS * Bilingual −0.00558 0.02 −0.279 0.78
Age * Bilingual −0.00326 0.00791 −0.413 0.68
IDS * SES 0.00408 0.00307 1.33 0.183
Age * SES −0.000883 0.00115 −0.766 0.444
IDS * Age * NAE 0.011 0.00513 2.15 0.0319
IDS * Age * Bilingual −0.00391 0.00656 −0.596 0.551
IDS * Age * SES −0.000375 0.000996 −0.376 0.707
R2 Conditional 0.355
R2 Marginal 0.114
N 1754

Exploratory analyses

The relationship between NAE and IDS for bilingual infants who have some exposure to NAE.

In our second confirmatory analysis model (linear mixed model 2), we found that bilingual infants with more exposure to NAE showed stronger IDS preference. However, this initial analysis included a number of bilingual infants who were not exposed to NAE at all (Figure 2). This raises the question of whether the relation between NAE and IDS preference was primarily driven by the infants who were not learning NAE. In the following analysis, we re-ran the pre-registered NAE-IDS model, this time restricting the model to infants who were exposed to NAE between 25% and 75% of the time. After pruning for non-convergence, the final model was:

log lt~trial type*method+trial type*trial num+age*trial num+trial type*age*exp nae+(1subid)+(1lab)+(1item) (9)

Based on 135 infants, the interaction between IDS and NAE exposure was still statistically significant (β = 0.01, SE = 0.00, p = 0.00). This result suggested that a dose-response relationship between infants’ exposure to NAE and their preference for IDS over ADS was not driven by infants living in non-NAE contexts alone (see Table 7 for details of the model).

Table 7.

Linear Mixed Model testing the effects of exposure to NAE-IDS (restricted to bilingual infants living in NAE contexts).

Estimate SE t p
Intercept 1.91 0.168 11.4 5.83e-09
IDS −0.211 0.132 −1.6 0.112
HPP 0.227 0.142 1.6 0.18
Single Screen 0.0942 0.2 0.472 0.663
Age −0.0094 0.0355 −0.265 0.791
Trial # −0.0413 0.00557 −7.42 8.05e-12
EXP_NAE −0.00158 0.00203 −0.781 0.436
IDS * HPP 0.0163 0.0627 0.26 0.795
IDS * Single Screen −0.115 0.0811 −1.42 0.156
Age * Trial # 0.0012 0.000973 1.23 0.219
IDS * Trial # 0.0158 0.00793 1.99 0.0483
IDS * Age 0.0219 0.0304 0.72 0.472
IDS * EXP_NAE 0.00528 0.00182 2.9 0.00384
Age * EXP_NAE −0.000426 0.000653 −0.653 0.515
IDS * Age * EXP_NAE 3.14e-05 0.000578 0.0543 0.957
R2 Conditional 0.362
R2 Marginal 0.119
N 135

General Discussion

The current study was designed to better understand the effects of experience on the tuning of infants’ preference for infant-directed speech (IDS) compared to adult-directed speech (ADS). Bilingual infants’ language experience is split across input in two different languages, which are being acquired simultaneously. Bilinguals and monolinguals may thus differ in their preference for IDS. To explore this question, we used a collaborative, multi-lab (N = 17 labs) approach to gather a large dataset of infants who were either 6–9- or 12–15-months old and growing up bilingual (N = 333 bilingual infants in the final sample, and a lab-matched sample of N = 384 monolingual infants from the same communities). Data were collected as a companion project to ManyBabies 1 (ManyBabies Consortium, 2020), which was limited to infants growing up monolingual. Overall, we found that bilingualism neither enhanced nor attenuated infants’ preference for IDS, with bilinguals showing a similar magnitude and developmental trajectory of IDS preference as monolinguals from age 6 to 15 months.

Although bilingual experience did not appear to moderate infants’ preference for IDS, we found striking evidence that experience hearing North-American English (NAE, the language of our stimuli) contributed to the magnitude of bilingual infants’ IDS preference. Bilinguals with greater exposure to NAE showed greater IDS preferences (when tested in NAE) than those who had less exposure to NAE. This relationship between NAE exposure and IDS preference was also observed in a subsample of bilingual infants all acquiring NAE, but who varied in how much they were exposed to NAE relative to their other native language. These results converge with those from the larger ManyBabies 1 study, which reported that monolinguals acquiring NAE had a stronger preference for IDS than monolinguals acquiring another language. Importantly, our approach provides a more nuanced view of the relationship between NAE and IDS preference, and suggests that there is a continuous dose effect of exposure on preference. Together, our findings have a number of implications for bilingual language acquisition during infancy. In the following, we will first discuss each of our research questions in turn, followed by limitations and implications of our study.

Our first research question asked whether bilingualism affects infants’ attention to IDS relative to ADS. We hypothesized that the complexity of the bilingual infant’s learning experience might lead to greater reliance on/preference for IDS, given that IDS may be viewed as an enhanced linguistic signal. However, this hypothesis was not confirmed. We observed a meta-analytic effect size in the full dataset for monolinguals of dz = 0.36 [CI = 0.28, 0.44] and for bilinguals of dz = 0.26 [CI: 0.09, 0.43]. While monolinguals showed a numerically larger effect size, this difference was not statistically significant in either the meta-analyses or the mixed-effects linear models. Although small differences are still possible, our data generally support the conclusion that bilingual and monolingual infants show a similar preference for IDS over ADS. Specifically, both groups show a preference for IDS at 6–9 months of age, which gets stronger by 12–15 months.

An additional part of our first research question asked whether bilinguals might show more variability than monolinguals in their IDS preference, beyond any differences in the magnitude of the preference. We reasoned that given their diversity of language experiences, bilingual groups may have a higher heterogeneity in terms of their IDS preference compared to monolingual groups (see also Orena & Polka, 2019, for a recent experiment that observed this pattern). Both monolingual and bilingual groups showed high variability. The magnitude of the observed difference in variability was very small. We carried three analyses to compare the variability between the monolinguals and bilinguals. Only one of the three variability analyses (i.e., the Levene’s test with the full dataset) was statistically significant. This statistical significance was mainly driven by the large sample size in the full dataset (N = 1754) because the difference in variability between the monolinguals and bilinguals remained negligible. Thus, our results did not support the idea that bilingual infants show meaningfully more variability in their IDS preference than their monolingual peers.

Given that monolinguals and bilinguals can systematically differ in their socio-economic status (SES), the third part of our first research question asked whether SES might moderate bilingualism effects. Using the years of maternal education as a proxy for SES, we found mixed support for the role of SES in our datasets. In our smaller lab-matched dataset, we found a statistically significant interaction between age, SES, and IDS preference: 6–9-month-olds from higher SES families showed stronger IDS preference than those from lower SES families, whereas 12–15-month-olds showed similar IDS preference regardless of SES. The direction of this effect aligns with other research reporting that children from higher SES families generally receive more language input and/or higher quality input (e.g., engaging in conversations with more lexical diversity, complexity, and structural variations) than children from lower SES families (Fernald, Marchman, & Weisleder, 2013; Hart & Risley, 1995; Hoff, 2006; Tal & Arnon, 2018). Thus, this could suggest that infants from higher SES families may show stronger IDS preference earlier in life as they hear more or higher quality IDS in their daily lives. Further, this positive SES impact may be most beneficial to younger infants whose IDS preference is still developing. However, given that in our larger (full) dataset SES was unrelated to IDS preference in either 6–9- or 12–15-month-olds, this result might be spurious and should be interpreted with caution. Further, it is important to note that our samples (both monolingual and bilingual group) were mainly from higher SES families. Indeed, in the lab-matched dataset, ‘67.79% of children whose mothers had earned at least a bachelor degree after kindergarten. Our samples, therefore, have low variability in infants’ SES, thus this question would be better tested with future studies that have participants from more diverse SES backgrounds.

Our second research question asked whether and how the amount of exposure to NAE would affect bilingual infants’ listening preferences. Given that our stimuli were produced in NAE, we expected that greater exposure to NAE would be linked to greater attention to NAE IDS relative to NAE ADS. Indeed, ManyBabies 1 (ManyBabies Consortium, 2020), which was conducted concurrently with the current study, found that monolinguals acquiring NAE showed a stronger IDS preference than monolinguals not acquiring NAE. However, in the ManyBabies 1 study, exposure to NAE-IDS was a binary variable – either the infants heard only NAE or heard only a different language in their environments. In the current paper, bilinguals provide a more nuanced way to address this question, as bilinguals’ exposure to NAE varied continuously between 25% and 75% (for infants learning NAE as one of their native languages) or was near 0% (for infants learning two non-NAE native languages). We found clear evidence for a positive dose-response relationship between exposure to NAE and infants’ preference for NAE-IDS. This evidence – that bilinguals with more exposure to NAE showed a stronger NAE-IDS preference – was also present when focusing only on bilinguals who were learning NAE as one of their native languages (i.e., those exposed to NAE 25–75% of the time). Importantly, we did not find a similar effect of exposure to NAE on infants’ overall looking. This suggests that the effect of NAE exposure on preference for IDS is a meaningful relationship, rather than an artefact due to the stimuli being presented in NAE. Further studies with stimuli in other languages would be necessary to solidify this conclusion.

Our analyses included both meta-analyses and linear mixed-effects models, which allowed us to compare these two approaches. As our field moves toward more large scale studies of this type, it will be important to determine appropriate standards for analysis. Our meta-analysis allows for better and more direct comparison with prior meta-analyses (e.g., Dunst et al., 2012). However, an important limitation of this approach is that infants’ data is collapsed to a single data point per group, thus obscuring potentially interesting variability. Moreover, because we could not model trial number directly, this average was based on valid adjacent trial pairs, which resulted in many trials being excluded from the analysis. In contrast, the mixed effect models analyzed data at the individual trial level, allowing us to examine how data variability can be explained by moderators at the trial and participant level, which increases statistical power. Our finding of a significant age effect in the mixed models, but not in the meta-analysis, can be attributed to this difference in statistical power. Moving forward, we believe that these complementary approaches each have their place, but that the mixed effect model is preferred as it improves statistical power.

As the first study to recruit and test bilingual infants at such a large scale and at so many sites, we encountered several challenges (see also Byers-Heinlein et al., 2020, for a fuller discussion of challenges in planning and conducting ManyBabies 1). First, several laboratories were not able to recruit the number of bilingual infants they had originally planned. All labs committed to collecting a minimum of 16 bilingual infants per age group. This ended up being unfeasible for some labs within the timeframe available (which was more than a year), in some cases due to a high number of participants not meeting our strict criterion for inclusion as bilingual. This undoubtedly highlights the challenges for labs in recruiting bilingual infant samples, and moreover raises questions about how bilingualism should be defined, and whether it should be treated as a continuous vs. categorical variable (Anderson, Mak, Chahi, & Bialystok, 2018; Bialystok, Luk, Peets, & Yang, 2018; Incera & McLennan, 2018). Second, we had planned to explore the effect of different language pairs on IDS preference. We had expected that some labs would be able to recruit relatively homogeneous samples of infants (i.e., all learning the same language pair), but in the end only one of 17 labs did so (another lab had planned to recruit a homogeneous sample but deviated from this plan when it appeared unfeasible). Thus, we leave the question of the effect of language pair on infants’ IDS preference an open issue to be followed up in future studies. By and large, we believe that our large-scale approach to data collection may in the future allow for the creation of homogeneous samples of infants tested at different laboratories around the world. As such, large-scale and multi-site bilingual research projects provide researchers with a powerful way to examine how the diversity and variability of bilinguals impact their language and cognitive development.

Overall, our finding that bilinguals show a similar preference for IDS as monolinguals reinforces theoretical views that emphasize the similarities in attentional and learning mechanisms across monolingual and bilingual infants (e.g., Curtin, Byers-Heinlein, & Werker, 2011). IDS appears to be a signal that enhances attention in infants from a variety of language backgrounds. Yet, bilingual infants appear to be exquisitely fine-tuned to the relative amount of input in each of their languages. It could have been the case that language exposure has a threshold effect with any regular exposure to NAE enhancing infants’ preference for NAE-IDS, marking it is a highly relevant speech signal. Instead, we observed a graded effect such that the magnitude of bilingual infants’ preference varied continuously with the amount of exposure to NAE. Just as bilingual infants’ relative vocabulary size and early grammar skills in each language are linked to the amount of input in that language (Hoff et al., 2012; Place & Hoff, 2011), the current study shows that the amount of language input may also play an important role in other language acquisition processes. Indeed, an intriguing but untested possibility is that different input-related attentional biases (i.e., IDS preference) across bilinguals’ two languages explain important variability in the early development of bilingual children’s vocabulary and grammar. Future bilingual work can investigate the above possibility to further delineate the interplay between infants’ language input, IDS preference, vocabulary, and grammar skills.

To conclude, the findings of the current study provide a more nuanced view of the development of infants’ preference for IDS than prior studies have allowed. IDS preference develops along a similar trajectory across infants from monolingual and bilingual backgrounds. Importantly, by testing bilingual infants, our results revealed that this IDS preference operates in a dose-response fashion, where the amount of exposure to NAE positively moderated infants’ (NAE-) IDS preference in a continuous way. Our experience highlights the challenges in recruiting and testing bilingual infants, but also reveals the promise of large-scale collaborations for increasing sample sizes, and thus improving the replicability and generalizability of key findings in infant research.

Funding

Individual participating labs acknowledge funding support from: the Natural Sciences and Engineering Research Council of Canada (2011-402470 and 2015-03967); the Social Sciences and Humanities Research Council of Canada Insight Grant (435-2015-1974 and 435-2015-0385); Agence Nationale de la Recherche (ANR-17-EURE-0017 and ANR-10-IDEX-0001-02); Western Sydney University Early Career Researcher Start-up Grant (20311.87608); European Commission (MSCA-IF-798658); a European Research Council Synergy Grant, SOMICS (609819); ERC Consolidator Grant “BabyRhythm” (773202); The Leverhulme Trust (ECF-2015-009); The UK Economic and Social Research Council (ES/L008955/1); Research Manitoba, Children’s Hospital Research Institute of Manitoba, University of Manitoba; ODPRT funds, National University of Singapore; and the National Institute of Child Health and Human Development (R01HD095912).

Appendix

Table A1 Number of monolingual and bilingual infants in each gender group by lab that met infant-level inclusion criteria.

lab monolingual
female
monolingual
male
bilingual
female
bilingual
male
babylabbrookes 18 12 14 20
babylabkingswood 11 19 9 15
babylabparisdescartes1 7 9 5 6
babylabprinceton 1 0 10 5
bllumanitoba 18 24 9 6
cdcceu 8 5 8 6
infantcogubc 8 3 7 3
infantstudiesubc 8 12 9 6
irlconcordia 15 20 16 18
isplabmcgill 5 6 8 8
langlabucla 1 2 5 4
ldlottawa 16 12 14 11
lllliv 17 16 4 9
lscppsl 7 7 7 9
nusinfantlanguagecentre 8 12 24 14
weltentdeckerzurich 14 16 16 12
wsigoettingen 17 29 5 11

Table A2 Number of bilingual infants per unique language pairs

language_pairs n
albanian ; non_nae_english 1
albanian ; swissgerman 1
arabic ; french 5
arabic ; german 1
arabic ; nae_english 2
arabic ; non_nae_english 2
armenian ; french 1
bahasa ; non_nae_english 1
belizean creole ; nae_english 1
bengali ; non_nae_english 1
bosnian ; non_nae_english 1
bulgarian ; german 1
cantonese ; german 1
cantonese ; nae_english 14
cantonese ; non_nae_english 2
dutch ; french 1
farsi ; non_nae_english 2
finnish ; german 1
finnish ; swissgerman 1
french ; georgian 1
french ; german 2
french ; hungarian 2
french ; italian 4
french ; korean 1
french ; lebanese 1
french ; mandarin 1
french ; nae_english 64
french ; non_nae_english 9
french ; persian 1
french ; polish 1
french ; portuguese 2
french ; romanian 1
french ; russian 1
french ; spanish 6
french ; swissgerman 5
french.; kabyle 1
german ; hungarian 1
german ; kurdish 1
german ; lithuanian 1
german ; nae_english 5
german ; non_nae_english 9
german ; polish 2
german ; russian 2
greek ; non_nae_english 2
greek ; swissgerman 1
hebrew ; hungarian 3
hebrew ; nae_english 3
hindi ; non_nae_english 1
hungarian ; italian 1
hungarian ; nae_english 1
hungarian ; non_nae_english 4
hungarian ; russian 2
hungarian ; spanish 1
indonesian ; nae_english 1
indonesian ; non_nae_english 1
italian ; nae_english 1
italian ; non_nae_english 2
italian ; swissgerman 3
japanese ; non_nae_english 3
khmer ; non_nae_english 1
korean ; nae_english 2
malayalam ; nae_english 1
mandarin ; nae_english 7
mandarin ; non_nae_english 44
nae_english ; persian 1
nae_english ; polish 1
nae_english ; punjabi 3
nae_english ; russian 3
nae_english ; spanish 17
nae_english ; swedish 2
nae_english ; swissgerman 1
nae_english ; tagalog 2
nae_english ; telugu 1
nae_english ; urdu 1
nepali ; non_nae_english 1
non_nae_english ; patois 1
non_nae_english ; polish 7
non_nae_english ; portuguese 7
non_nae_english ; punjabi 1
non_nae_english ; russian 1
non_nae_english ; slovenian 1
non_nae_english ; spanish 7
non_nae_english ; swissgerman 5
non_nae_english ; tagalog 2
non_nae_english ; tamil 1
non_nae_english ; turkish 1
non_nae_english ; ukrainean 1
non_nae_english ; urdu 1
non_nae_english ; vietnamese 1
non_nae_english ; welsh 2
non_nae_english ; wu 1
portuguese ; swissgerman 1
romansh ; swissgerman 1
serbian ; swissgerman 1
slowenian ; swissgerman 1
spanish ; swissgerman 6
swissgerman ; turkish 1

Footnotes

Conflicts of Interest

The authors declare that there were no conflicts of interest with respect to the authorship or the publication of this article.

1

Note that ManyBabies 1 also tested 3–6 month and 9–12 month monolingual groups.

2

We did not enter the above-mentioned four-way interaction into our main model, but note that in the more complex model, the four-way interaction was not statistically significant in the matched dataset (β = 0.00, SE = 0.02, p = 0.85) or the full dataset (β = 0.01, SE = 0.01, p = 0.63).

Prior versions

Preprint versions of this manuscript are posted at https://psyarxiv.com/sqh9d/.

References

  1. Anderson J, Mak L, Chahi AK, & Bialystok E (2018). The language and social background questionnaire: Assessing degree of bilingualism in a diverse population. Behavior Research Methods, 50(1), 250–263. 10.3758/s13428-017-0867-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Barr DJ, Levy R, Scheepers C, & Tily HJ (2013). Random effects structure for confirmatory hypothesis testing: Keep it maximal. Journal of Memory and Language, 68(3), 255–278. 10.1016/j.jml.2012.11.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Bates D, Kliegl R, Vasishth S, & Baayen H (2018). Parsimonious mixed models. Preprint.
  4. Bates D, Mächler M, Bolker B, & Walker S (2015). Fitting linear mixed-effects models using lme4. Journal of Statistical Software, Articles, 67(1), 1–48. 10.18637/jss.v067.i01 [DOI] [Google Scholar]
  5. Bialystok E, Luk G, & Kwan E (2005). Bilingualism, biliteracy, and learning to read: Interactions among languages and writing systems. Scientific Studies of Reading, 9(1), 43–61. 10.1207/s1532799xssr0901_4 [DOI] [Google Scholar]
  6. Bialystok E, Luk G, Peets KF, & Yang S (2018). Receptive vocabulary differences in monolingual and bilingual children. Bilingualism: Language and Cognition, 13(4), 525–531. 10.1017/S1366728909990423 [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Bosch L, & Ramon-Casas M (2011). Variability in vowel production by bilingual speakers: Can input properties hinder the early stabilization of contrastive categories? Journal of Phonetics, 39(4), 514–526. 10.1016/j.wocn.2011.02.001 [DOI] [Google Scholar]
  8. Bosch L, & Sebastián-Gallés N (1997). Native-language recognition abilities in 4-month-old infants from monolingual and bilingual environments. Cognition, 65(1), 33–69. 10.1016/S0010-0277(97)00040-1 [DOI] [PubMed] [Google Scholar]
  9. Bosch L, & Sebastián-Gallés N (2001). Evidence of early language discrimination abilities in infants from bilingual environments. Infancy, 2(1), 29–49. 10.1207/S15327078IN0201_3 [DOI] [PubMed] [Google Scholar]
  10. Brito N, & Barr R (2014). Flexible memory retrieval in bilingual 6-month-old infants. Developmental Psychobiology, 56(5), 1156–1163. 10.1002/dev.21188 [DOI] [PubMed] [Google Scholar]
  11. Brito N, Sebastián-Gallés N, & Barr R (2015). Differences in language exposure and its effects on memory flexibility in monolingual, bilingual, and trilingual infants. Bilingualism: Language and Cognition, 18(4), 670–682. 10.1017/S1366728914000789 [DOI] [Google Scholar]
  12. Byers-Heinlein K (2015). Methods for studying infant bilingualism. In Schwieter JW (Ed.), The Cambridge Handbook of Bilingual Processing (pp. 133–154). Cambridge: Cambridge University Press. 10.1017/CBO9781107447257 [DOI] [Google Scholar]
  13. Byers-Heinlein K, Bergmann C, Davies C, Frank MC, Hamlin JK, Kline M, … Soderstrom M (2020). Building a collaborative psychological science: Lessons learned from ManyBabies 1. Canadian Psychology. 10.31234/osf.io/dmhk2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Byers-Heinlein K, & Fennell CT (2014). Perceptual narrowing in the context of increased variation: Insights from bilingual infants. Developmental Psychobiology, 56(2), 274–291. 10.1002/dev.21167 [DOI] [PubMed] [Google Scholar]
  15. Byers-Heinlein K, Fennell CT, & Werker JF (2013). The development of associative word learning in monolingual and bilingual infants. Bilingualism: Language and Cognition, 16(1), 198–205. 10.1017/S1366728912000417 [DOI] [Google Scholar]
  16. Byers-Heinlein K, Schott E, Gonzalez-Barrero AM, Brouillard M, Dubé D, Jardak A, … et al. (2019). MAPLE: A multilingual approach to parent language estimates. Bilingualism: Language and Cognition, 1–7. 10.1017/S1366728919000282 [DOI] [Google Scholar]
  17. Cattani A, Abbot-Smith K, Farag R, Krott A, Arreckx F, Dennis I, & Floccia C (2014). How much exposure to English is necessary for a bilingual toddler to perform like a monolingual peer in language tests? International Journal of Language & Communication Disorders, 49(6), 649–671. 10.1111/1460-6984.12082 [DOI] [PubMed] [Google Scholar]
  18. Choi S, & Bowerman M (1991). Learning to express motion events in English and Korean: The influence of language-specific lexicalization patterns. Cognition, 41(1), 83–121. 10.1016/0010-0277(91)90033-Z [DOI] [PubMed] [Google Scholar]
  19. Cooper RP, Abraham J, Berman S, & Staska M (1997). The development of infants’ preference for motherese. Infant Behavior and Development, 20(4), 477–488. 10.1016/S0163-6383(97)90037-0 [DOI] [Google Scholar]
  20. Cooper RP, & Aslin RN (1990). Preference for infant-directed speech in the first month after birth. Child Development, 61(5), 1584–1595. Retrieved from http://www.jstor.org/stable/1130766 [PubMed] [Google Scholar]
  21. Cooper RP, & Aslin RN (1994). Developmental differences in infant attention to the spectral properties of infant-directed speech. Child Development, 65(6), 1663–1677. 10.2307/1131286 [DOI] [PubMed] [Google Scholar]
  22. Csibra G, Hernik M, Mascaro O, Tatone D, & Lengyel M (2016). Statistical treatment of looking-time data. Developmental Psychology, 52(4), 521–536. 10.1037/dev0000083 [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Curtin S, Byers-Heinlein K, & Werker JE (2011). Bilingual beginnings as a lens for theory development: PRIMIR in focus. Journal of Phonetics, 39(4), 492–504. 10.1016/j.wocn.2010.12.002 [DOI] [Google Scholar]
  24. Curto JD, & Pinto JC (2011). The corrected VIF (CVIF). Journal of Applied Statistics, 38(7), 1499–1507. 10.1080/02664763.2010.505956 [DOI] [Google Scholar]
  25. Danielson DK, Seidl A, Onishi KH, Alamian G, & Cristia A (2014). The acoustic properties of bilingual infant-directed speech. The Journal of the Acoustical Society of America, 135(2), EL95–EL101. 10.1121/1.4862881 [DOI] [PubMed] [Google Scholar]
  26. DeAnda S, Bosch L, Poulin-Dubois D, Zesiger P, & Friend M (2016). The Language Exposure Assessment Tool: Quantifying language exposure in infants and children. Journal of Speech, Language, and Hearing Research, 59(6), 1346–1356. 10.1044/2016_JSLHR-L-15-0234 [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Dunst C, Gorman E, & Hamby D (2012). Preference for infant-directed speech in preverbal young children. Center for Early Literacy Learning, 5(1), 1–13. [Google Scholar]
  28. Farran LK, Lee C-C, Yoo H, & Oller DK (2016). Cross-cultural register differences in infant-directed speech: An initial study. PLOS ONE, 11(3), 1–14. 10.1371/journal.pone.0151518 [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Fennell CT, & Byers-Heinlein K (2014). You sound like Mommy: Bilingual and monolingual infants learn words best from speakers typical of their language environments. International Journal of Behavioral Development, 38(4), 309–316. 10.1177/0165025414530631 [DOI] [Google Scholar]
  30. Fennell CT, Byers-Heinlein K, & Werker JF (2007). Using speech sounds to guide word learning: The case of bilingual infants. Child Development, 78(5), 1510–1525. 10.1111/j.1467-8624.2007.01080.x [DOI] [PubMed] [Google Scholar]
  31. Fernald A (1985). Four-month-old infants prefer to listen to motherese. Infant Behavior and Development, 8(2), 181–195. 10.1016/S0163-6383(85)80005-9 [DOI] [Google Scholar]
  32. Fernald A, Marchman VA, & Weisleder A (2013). SES differences in language processing skill and vocabulary are evident at 18 months. Developmental Science, 16(2), 234–248. 10.1111/desc.12019 [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Fernald A, Taeschner T, Dunn J, Papousek M, Boysson-Bardies B. de, & Fukui I (1989). A cross-language study of prosodic modifications in mothers’ and fathers’ speech to preverbal infants. Journal of Child Language, 16(3), 477–501. 10.1017/S0305000900010679 [DOI] [PubMed] [Google Scholar]
  34. Frank MC, Bergelson E, Bergmann C, Cristia A, Floccia C, Gervain J, … Yurovsky D (2017). A collaborative approach to infant research: Promoting reproducibility, best practices, and theory-building. Infancy, 22(4), 421–435. 10.1111/infa.12182 [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. García-Sierra A, Ramírez-Esparza N, & Kuhl PK (2016). Relationships between quantity of language input and brain responses in bilingual and monolingual infants, 110, 1–17. 10.1016/j.ijpsycho.2016.10.004 [DOI] [PubMed] [Google Scholar]
  36. Glenn SM, & Cunningham CC (1983). What do babies listen to most? A developmental study of auditory preferences in nonhandicapped infants and infants with Down’s syndrome. Developmental Psychology, 19(3), 332–337. 10.1037/0012-1649.19.3.332 [DOI] [Google Scholar]
  37. Graf Estes K, & Hay JF (2015). Flexibility in bilingual infants’ word learning. Child Development, 86(5), 1371–1385. 10.1111/cdev.12392 [DOI] [PubMed] [Google Scholar]
  38. Graf Estes K, & Hurley K (2013). Infant-directed prosody helps infants map sounds to meanings. Infancy, 18(5), 797–824. 10.1111/infa.12006 [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Hart B, & Risley TR (1995). Meaningful differences in the everyday experience of young american children. Baltimore: Paul H. Brookes Publishing. [Google Scholar]
  40. Hayashi A, Tamekawa Y, & Kiritani S (2001). Developmental change in auditory preferences for speech stimuli in Japanese infants. Journal of Speech, Language, and Hearing Research, 44(6), 1189–1200. 10.1044/1092-4388(2001/092) [DOI] [PubMed] [Google Scholar]
  41. Higgins JPT, & Thompson SG (2002). Quantifying heterogeneity in a meta-analysis. Statistics in Medicine, 21(11), 1539–1558. 10.1002/sim.1186 [DOI] [PubMed] [Google Scholar]
  42. Higgins JPT, Thompson SG, Deeks JJ, & Altman DG (2003). Measuring inconsistency in meta-analyses. BMJ, 327(7414), 557–560. 10.1136/bmj.327.7414.557 [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Hoff E (2006). How social contexts support and shape language development. Developmental Review, 26(1), 55–88. 10.1016/j.dr.2005.11.002 [DOI] [Google Scholar]
  44. Hoff E, Core C, Place S, Rumiche R, Señor M, & Parra M (2012). Dual language exposure and early bilingual development. Journal of Child Language, 39(1), 1–27. 10.1017/S0305000910000759 [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Hudon TM, Fennell CT, & Hoftyzer M (2013). Quality not quantity of television viewing is associated with bilingual toddlers’ vocabulary scores. Infant Behavior and Development, 36(2), 245–254. 10.1016/j.infbeh.2013.01.010 [DOI] [PubMed] [Google Scholar]
  46. Incera S, & McLennan CT (2018). The time course of within and between-language interference in bilinguals. International Journal of Bilingualism, 22(1), 88–99. 10.1177/1367006916644688 [DOI] [Google Scholar]
  47. Karzon RG (1985). Discrimination of polysyllabic sequences by one- to four-month-old infants. Journal of Experimental Child Psychology, 39(2), 326–342. 10.1016/0022-0965(85)90044-X [DOI] [PubMed] [Google Scholar]
  48. Kemler Nelson DG, Jusczyk PW, Mandel DR, Myers J, Turk A, & Gerken L (1995). The head-turn preference procedure for testing auditory perception. Infant Behavior and Development, 18(1), 111–116. 10.1016/0163-6383(95)90012-8 [DOI] [Google Scholar]
  49. Kitamura C, & Lam C (2009). Age-specific preferences for infant-directed affective intent. Infancy, 14(1), 77–100. 10.1080/15250000802569777 [DOI] [PubMed] [Google Scholar]
  50. Kitamura C, Thanavishuth C, Burnham D, & Luksaneeyanawin S (2001). Universality and specificity in infant-directed speech: Pitch modifications as a function of infant age and sex in a tonal and non-tonal language. Infant Behavior and Development, 24(4), 372–392. 10.1016/S0163-6383(02)00086-3 [DOI] [Google Scholar]
  51. Klein RA, Ratliff KA, Vianello M, Adams RB, Bahník Š, Bernstein MJ, … Nosek BA (2014). Investigating variation in replicability. Social Psychology, 45(3), 142–152. 10.1027/1864-9335/a000178 [DOI] [Google Scholar]
  52. Kovács ÁM, & Mehler J (2009a). Cognitive gains in 7-month-old bilingual infants. Proceedings of the National Academy of Sciences, 106(16), 6556. 10.1073/pnas.0811323106 [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Kovács ÁM, & Mehler J (2009b). Flexible learning of multiple speech structures in bilingual infants. Science, 325(5940), 611. 10.1126/science.1173947 [DOI] [PubMed] [Google Scholar]
  54. Kuznetsova A, Brockhoff PB, & Christensen RH (2016). Package ‘ImerTest’.
  55. Lewkowicz DJ, & Ghazanfar AA (2006). The decline of cross-species intersensory perception in human infants. Proceedings of the National Academy of Sciences, 103(17), 6771–6774. 10.1073/pnas.0602027103 [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Liu L, & Kager R (2015). Bilingual exposure influences infant VOT perception. Infant Behavior and Development, 38, 27–36. 10.1016/j.infbeh.2014.12.004 [DOI] [PubMed] [Google Scholar]
  57. Liu L, & Kager R (2017a). Is mommy talking to daddy or to me? Exploring parental estimates of child language exposure using the Multilingual Infant Language Questionnaire. International Journal of Multilingualism, 14(4), 366–377. 10.1080/14790718.2016.1216120 [DOI] [Google Scholar]
  58. Liu L, & Kager R (2017b). Perception of tones by bilingual infants learning non-tone languages. Bilingualism: Language and Cognition, 20(3), 561–575. 10.1017/S1366728916000183 [DOI] [PubMed] [Google Scholar]
  59. Ma W, Golinkoff RM, Houston DM, & Hirsh-Pasek K (2011). Word learning in infant- and adult-directed speech. Language Learning and Development, 7(3), 185–201. 10.1080/15475441.2011.579839 [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Maas CJM, & Hox JJ (2005). Sufficient sample sizes for multilevel modeling. Methodology: European Journal of Research Methods for the Behavioral and Social Sciences, 1(3), 86–92. 10.1027/1614-2241.1.3.86 [DOI] [Google Scholar]
  61. ManyBabies Consortium. (2020). Quantifying sources of variability in infancy research using the infant-directed-speech preference. Advances in Methods and Practices in Psychological Science, 3(1), 24–52. 10.1177/2515245919900809 [DOI] [Google Scholar]
  62. Mattock K, Polka L, Rvachew S, & Krehm M (2010). The first steps in word learning are easier when the shoes fit: Comparing monolingual and bilingual infants. Developmental Science, 13, 229–243. 10.1111/j.1467-7687.2009.00891.x [DOI] [PubMed] [Google Scholar]
  63. Matuschek H, Kliegl R, Vasishth S, Baayen H, & Bates D (2017). Balancing Type I error and power in linear mixed models, 94, 305–315. 10.1016/j.jml.2017.01.001 [DOI] [Google Scholar]
  64. Maurer D, & Werker JF (2014). Perceptual narrowing during infancy: A comparison of language and faces. Developmental Psychobiology, 56(2), 154–178. 10.1002/dev.21177 [DOI] [PubMed] [Google Scholar]
  65. MetaLab. (2017). Interactive tools for community-augmented meta-analysis,, power analysis, and experimental planning in language acquisition research. Retrieved from http://metalab.stanford.edu
  66. Molnar M, Carreiras M, & Gervain J (2016). Language dominance shapes non-linguistic rhythmic grouping in bilinguals. Cognition, 152, 150–159. 10.1016/j.cognition.2016.03.023 [DOI] [PubMed] [Google Scholar]
  67. Nakagawa S, Poulin R, Mengersen K, Reinhold K, Engqvist L, Lagisz M, & Senior AM (2015). Meta-analysis of variation: Ecological and evolutionary applications and beyond. Methods in Ecology and Evolution, 6(2), 143–152. 10.1111/2041-210X.12309 [DOI] [Google Scholar]
  68. Newman RS, & Hussain I (2006). Changes in preference for infant-directed speech in low and moderate noise by 4.5- to 13-month-olds. Infancy, 10(1), 61–76. 10.1207/s15327078in1001_4 [DOI] [PubMed] [Google Scholar]
  69. Open Science Collaboration. (2015). Estimating the reproducibility of psychological science. Science, 349(6251). 10.1126/science.aac4716 [DOI] [PubMed] [Google Scholar]
  70. Orena AJ, Byers-Heinlein K, & Polka L (2020). What do bilingual infants actually hear? Evaluating measures of language input to bilingual-learning 10-month-olds. Developmental Science, n/a(n/a), e12901. 10.1111/desc.12901 [DOI] [PubMed] [Google Scholar]
  71. Orena AJ, & Polka L (2019). Monolingual and bilingual infants’ word segmentation abilities in an inter-mixed dual-language task. Infancy, 24(5), 718–737. 10.1111/infa.12296 [DOI] [PubMed] [Google Scholar]
  72. Patihis L, Oh JS, & Mogilner T (2015). Phoneme discrimination of an unrelated language: Evidence for a narrow transfer but not a broad-based bilingual advantage. International Journal of Bilingualism, 19(1), 3–16. 10.1177/1367006913476768 [DOI] [Google Scholar]
  73. Pegg JE, Werker JF, & McLeod PJ (1992). Preference for infant-directed over adult-directed speech: Evidence from 7-week-old infants. Infant Behavior and Development, 15(3), 325–345. 10.1016/0163-6383(92)80003-D [DOI] [Google Scholar]
  74. Petitto LA, Berens MS, Kovelman I, Dubins MH, Jasinska K, & Shalinsky M (2012). The “Perceptual Wedge Hypothesis” as the basis for bilingual babies’ phonetic processing advantage: New insights from fNIRS brain imaging. Brain and Language, 121(2), 130–143. 10.1016/j.bandl.2011.05.003 [DOI] [PMC free article] [PubMed] [Google Scholar]
  75. Place S, & Hoff E (2011). Properties of dual language exposure that influence 2-year-olds’ bilingual proficiency. Child Development, 82(6), 1834–1849. 10.1111/j.1467-8624.2011.01660.x [DOI] [PMC free article] [PubMed] [Google Scholar]
  76. Polka L, & Sundara M (2012). Word segmentation in monolingual infants acquiring Canadian English and Canadian French: Native language, cross-dialect, and cross-language comparisons. Infancy, 17(2), 198–232. 10.1111/j.1532-7078.2011.00075.x [DOI] [PubMed] [Google Scholar]
  77. Pye C (1986). Quiché Mayan speech to children. Journal of Child Language, 13(1), 85–100. 10.1017/S0305000900000313 [DOI] [PubMed] [Google Scholar]
  78. Ramírez NF, Ramírez RR, Clarke M, Taulu S, & Kuhl PK (2017). Speech discrimination in 11-month-old bilingual and monolingual infants: A magnetoencephalography study. Developmental Science, 20(1), e12427. 10.1111/desc.12427 [DOI] [PubMed] [Google Scholar]
  79. Ramírez-Esparza N, García-Sierra A, & Kuhl PK (2014). Look who’s talking: Speech style and social context in language input to infants are linked to concurrent and future speech development. Developmental Science, 17(6), 880–891. 10.1111/desc.12172 [DOI] [PMC free article] [PubMed] [Google Scholar]
  80. Ramírez-Esparza N, García-Sierra A, & Kuhl PK (2017). The impact of early social interactions on later language development in spanish–english bilingual infants. Child Development, 88(4), 1216–1234. 10.1111/cdev.12648 [DOI] [PubMed] [Google Scholar]
  81. Robertson S, von Hapsburg D, & Hay JS (2013). The effect of hearing loss on the perception of infant- and adult-directed speech. Journal of Speech, Language, and Hearing Research, 56(4), 1108–1119. 10.1044/1092-4388(2012/12-0110) [DOI] [PubMed] [Google Scholar]
  82. Santesso DL, Schmidt LA, & Trainor LJ (2007). Frontal brain electrical activity (EEG) and heart rate in response to affective infant-directed (ID) speech in 9-month-old infants. Brain and Cognition, 65(1), 14–21. 10.1016/j.bandc.2007.02.008 [DOI] [PubMed] [Google Scholar]
  83. Schreiner MS, Altvater-Mackensen N, & Mani N (2016). Early word segmentation in naturalistic environments: Limited effects of speech register. Infancy, 21(5), 625–647. 10.1111/infa.12133 [DOI] [Google Scholar]
  84. Sebastián-Gallés N, Albareda-Castellot B, Weikum WM, & Werker JF (2012). A bilingual advantage in visual language discrimination in infancy. Psychological Science, 23(9), 994–999. 10.1177/0956797612436817 [DOI] [PubMed] [Google Scholar]
  85. Sebastián-Gallés N, & Bosch L (2002). Building phonotactic knowledge in bilinguals: Role of early exposure. Journal of Experimental Psychology: Human Perception and Performance, 28(4), 974–989. 10.1037/0096-1523.28.4.974 [DOI] [PubMed] [Google Scholar]
  86. Senior AM, Gosby AK, Lu J, Simpson SJ, & Raubenheimer D (2016). Meta-analysis of variance: An illustration comparing the effects of two dietary interventions on variability in weight. Evolution, Medicine, and Public Health, 2016(1), 244–255. 10.1093/emph/eow020 [DOI] [PMC free article] [PubMed] [Google Scholar]
  87. Shute B, & Wheldall K (1999). Fundamental frequency and temporal modifications in the speech of British fathers to their children. Educational Psychology, 19(2), 221–233. 10.1080/0144341990190208 [DOI] [Google Scholar]
  88. Simmons JP, Nelson LD, & Simonsohn U (2011). False-positive psychology: Undisclosed flexibility in data collection and analysis allows presenting anything as significant. Psychological Science, 22(11), 1359–1366. 10.1177/0956797611417632 [DOI] [PubMed] [Google Scholar]
  89. Singh L, Fu CSL, Rahman AA, Hameed WB, Sanmugam S, Agarwal P, … Team, the G. R. (2015). Back to basics: A bilingual advantage in infant visual habituation. Child Development, 86(1), 294–302. 10.1111/cdev.12271 [DOI] [PubMed] [Google Scholar]
  90. Singh L, Morgan JL, & Best CT (2002). Infants’ listening preferences: Baby talk or happy talk? Infancy, 3(3), 365–394. 10.1207/S15327078IN0303_5 [DOI] [PubMed] [Google Scholar]
  91. Singh L, Nestor S, Parikh C, & Yull A (2009). Influences of infant-directed speech on early word recognition. Infancy, 14(6), 654–666. 10.1080/15250000903263973 [DOI] [PubMed] [Google Scholar]
  92. Slobin DI (Ed.). (1985). The crosslinguistic study of language acquisition (theoretical issues) (1st ed., Vol. 2). New York: Psychology Press. 10.4324/9780203781890 [DOI] [Google Scholar]
  93. Soderstrom M (2007). Beyond babytalk: Re-evaluating the nature and content of speech input to preverbal infants. Developmental Review, 27(4), 501–532. 10.1016/j.dr.2007.06.002 [DOI] [Google Scholar]
  94. Sundara M, & Scutellaro A (2011). Rhythmic distance between languages affects the development of speech perception in bilingual infants. Journal of Phonetics, 39(4), 505–513. 10.1016/j.wocn.2010.08.006 [DOI] [Google Scholar]
  95. Tal S, & Arnon I (2018). SES effects on the use of variation sets in child-directed speech. Journal of Child Language, 45(6), 1423–1438. 10.1017/S0305000918000223 [DOI] [PMC free article] [PubMed] [Google Scholar]
  96. Tardif T (1996). Nouns are not always learned before verbs: Evidence from Mandarin speakers’ early vocabularies. Developmental Psychology, 32(3), 492–504. 10.1037/0012-1649.32.3.492 [DOI] [Google Scholar]
  97. Tardif T, Shatz M, & Naigles L (1997). Caregiver speech and children’s use of nouns versus verbs: A comparison of English, Italian, and Mandarin. Journal of Child Language, 24(3), 535–565. 10.1017/S030500099700319X [DOI] [PubMed] [Google Scholar]
  98. Thiessen ED, Hill EA, & Saffran JR (2005). Infant-directed speech facilitates word segmentation. Infancy, 7(1), 53–71. 10.1207/s15327078in0701_5 [DOI] [PubMed] [Google Scholar]
  99. Trainor LJ, & Desjardins RN (2002). Pitch characteristics of infant-directed speech affect infants’ ability to discriminate vowels. Psychonomic Bulletin & Review, 9(2), 335–340. 10.3758/BF03196290 [DOI] [PubMed] [Google Scholar]
  100. Viechtbauer W (2010). Conducting meta-analyses in r with the metafor package. Journal of Statistical Software, Articles, 36(3), 1–48. 10.18637/jss.v036.i03 [DOI] [Google Scholar]
  101. Werker JF, & Byers-Heinlein K (2008). Bilingualism in infancy: First steps in perception and comprehension. Trends in Cognitive Sciences, 12(4), 144–151. 10.1016/j.tics.2008.01.008 [DOI] [PubMed] [Google Scholar]
  102. Werker JF, & McLeod PJ (1989). Infant preference for both male and female infant-directed talk: A developmental study of attentional and affective responsiveness. Canadian Journal of Psychology/Revue Canadienne de Psychologie, 43(2), 230–246. 10.1037/h0084224 [DOI] [PubMed] [Google Scholar]
  103. Werker JF, Pegg JE, & McLeod PJ (1994). A cross-language investigation of infant preference for infant-directed communication. Infant Behavior and Development, 17(3), 323–333. 10.1016/0163-6383(94)90012-4 [DOI] [Google Scholar]
  104. Werker JF, & Tees RC (1984). Cross-language speech perception: Evidence for perceptual reorganization during the first year of life. Infant Behavior and Development, 7(1), 49–63. 10.1016/S0163-6383(84)80022-3 [DOI] [Google Scholar]

RESOURCES