Abstract
Some autistic children acquire foreign languages from exposure to screens. Such Unexpected Bilingualism (UB) is therefore not driven by social interaction; rather, language acquisition appears to rely on less socially mediated learning and other cognitive processes. We hypothesize that UB children may rely on other cues, such as acoustic cues, of the linguistic input. Previous research indicates enhanced pitch processing in some autistic children, often associated with language delays and difficulties in forming stable phonological categories due to sensitivity to subtle linguistic variations. We propose that repetitive screen-based input simplifies linguistic complexity, allowing focus on individual cues. This study hypothesizes that autistic UB children exhibit superior pitch discrimination compared to both autistic and non-autistic peers. From a sample of 46 autistic French-speaking children aged 9 to 16, 12 were considered as UB. These children, along with 45 non-autistic children, participated in a two-alternative forced-choice pitch discrimination task. They listened to pairs of pure tones, 50% of which differed by 3% (easy), 2% (medium), or 1% (hard). A stringent comparison of performance revealed that only the autistic UB group performed above chance for tone pairs that differed, across all conditions. This group demonstrated superior pitch discrimination relative to autistic and non-autistic peers. This study establishes the phenomenon of UB in autism and provides evidence for enhanced pitch discrimination in this group. Acute perception of auditory information, combined with repeated language content, may facilitate UB children’s focus on phonetic features, and help acquire a language with no communicative support or motivation.
Keywords: Autism, Unexpected Bilingualism, language acquisition, enhanced auditory perception, low-level processing, screen exposure
Lay summary
In this study, we tested pitch discrimination abilities in a group of French-speaking autistic children who have acquired English solely from exposure through screens (UB, unexpected Bilingualism). Compared to their autistic and non-autistic peers, UB-autistic children demonstrated enhanced pitch discrimination abilities.
1. Introduction
Autistic children frequently display language delays and difficulties, which are likely linked to early onset socio-communicative atypicalities inherent to the autism diagnosis (Su et al., 2021). Socio-communicative skills play a central role in early typical language development (e.g. Colonnesi et al., 2010), and, accordingly, autistic children with weaker joint attention abilities are more likely to remain minimally verbal (Luyster et al., 2008; Paul et al., 2008; Wodka et al., 2013; Yoder et al., 2015). However, there is also some evidence that socio-communicative abilities do not always predict language growth in autism (Anderson et al., 2007; Ellis Weismer & Kover, 2015; Kissine et al., 2023), and hence that language acquisition in autism is mediated by communicative interaction to a lesser extent than in typical development. Consistent with this hypothesis is the phenomenon of unexpected bilingualism (UB) in some autistic children. Described in multiple, independent case studies to date, autistic UB children learn foreign languages via non-interactive, self-selected exposure to and engagement with content on the internet, television and computer games (Abd El-Raziq et al., 2023; Kissine et al., 2019; Vulchanova et al., 2012; Zhukova et al., 2021). That is, autistic UB children acquire advanced morpho-syntactic and phonological knowledge through exposure to screen-based media. Ostrolenk et al. (2024) report that some autistic children with an intense interest in letters seem to acquire a foreign language by changing the subtitles of YouTube videos in order to read them. While further research is clearly needed, it is possible that Unexpected Bilingualism may also be based on non-socially mediated exposure to print materials. Strikingly, learning language from screens has repeatedly been documented as ineffective in typically developing children (Kuhl et al., 2003; Sachs et al., 1981). In fact, early exposure to screens may even have deleterious effects in typical development, with high levels of exposure to screens before the age of 3 related to poor language outcomes (Zimmerman & Christakis, 2005).
To date, UB (in autism) is drastically understudied; the empirical literature is limited to a handful of case studies that report advanced and productive morpho-syntactic skills in the language learned from screens. No study so far has investigated UB as a distinct sub-group on the autism spectrum and compared such children to non-autistic peers. The very existence of UB challenges the overspread idea that social interaction is a pre-requisite for language acquisition in autism and raises the question of which less socially mediated mechanisms enable these children to acquire language from screens. A better understanding of the specific characteristics of UB children is thus of paramount importance.
One straightforward hypothesis is that UB children rely heavily on the internal structure of their linguistic input, compensating for reduced socio-communicative bootstrapping. That is, these children acquire language by focusing on the structural properties of the linguistic input from screens to which they are repeatedly exposed. Language learners—typical and atypical alike—use statistical and prosodic cues to parse and segment their linguistic input. Suprasegmental cues such as pitch variation, lengthening and pausing are crucial for word segmentation (Goodsitt et al., 1993; Shukla et al., 2011) and syntactic parsing (de Carvalho et al., 2016). Furthermore, prosodic cues are preferred over purely statistical ones when the two do not overlap (Jusczyk et al., 1999; Soderstrom et al., 2005; Thiessen & Saffran, 2003). For instance, Thiessen and Saffran (2003) found that 9-month-olds used syllable stress for segmentation, ignoring distributional regularities.
There is extensive converging evidence that auditory processing in autism is enhanced, with more frequent absolute pitch, enhanced pitch discrimination, and enhanced pitch perception in autistic versus non-autistic individuals (Bonnel et al., 2010; Eigsti & Fein, 2013; Haesen et al., 2011; Heaton, 2003, 2005; Heaton et al., 2008; Kellerman et al., 2005). A recent meta-analysis of 22 studies (Chen Yu et al., 2022) found small to medium positive effect size for enhanced pitch perception in autism.
Under the revised version of the Weak Central Coherence (WCC) model (Happé & Frith, 2006), a cognitive processing style specific to autistic individuals is characterized by a preference for bottom-up processing of information, with a tendency to focus on local details—including fine-grained acoustic differences—at the expense of global, top-down processing, based on higher level structure and contextual knowledge. Vulchanova et al. (2012) have used WCC to explain the UB profile of EV, a 10-year-old Bulgarian girl who learned German only via television. Vulchanova and colleagues (2012) reported that EV demonstrated enhanced local processing on grammar tasks, block design and object assembly tasks, and argued that a local processing bias combined with the massive exposure to input from television may have contributed to EV’s unexpected bilingualism.
Likewise, according to the Enhanced Perceptual Functioning (EPF) model (Mottron et al., 2006), heightened perceptual functioning in autism results in a focus on local features rather than global configurations. Within the EPF model, the enhanced pitch perception observed in autism is explained by the overdevelopment of low-level perceptual operations(Germain et al., 2019; Mottron et al., 2006; Mottron & Burack, 2001). While WCC posits a deficit in top-down processing, the EPF model attributes local bias to superiority in low-level perceptual operations without global perception being necessarily impaired.
In the context of language processing, Chen et al (2022) argue that the local bias in autism may enhance the extraction of low-level perceptual information (e.g., acoustic cues) at the expense of higher-order information, such as linguistic meaning and social functioning. Heightened attention to low-level properties of the speech stream could then impact language acquisition, especially in social communication environments that require the integration of verbal and nonverbal cues and of the non-linguistic context. Acute auditory perception could have a detrimental effect on language development because the perception of wide range of non-meaningful auditory contrasts may compromise the emergence of stable phonological categories. (Jones et al., 2009) found that enhanced pitch discrimination was linked with a history of language delay; similarly, Eigsti and Fein (2013) reported that better pitch discrimination was associated with both current autism symptomatology and with delays in first word production (a critical language milestone). Additionally, studies on tone languages such as Mandarin Chinese, where pitch changes play a contrastive phonological role, further demonstrate that overly enhanced pitch discrimination may impact the creation of stable phonological categories. There is evidence that in autistic Chinese children, enhanced sensitivity to pitch variation within tone categories might hinder the formation of stable phonological representations (Wang et al., 2017; Yu et al., 2015).
Enhanced auditory perception may prevent autistic children from generalizing across from fine-grained variations in speech input, and hence negatively impact language acquisition. Much of the language content of cartoons, video games, and other screen-based media, occurs in the context of interactions between two or more virtual agents, and thus presents verbal and non-verbal information in context. However, repetitive exposure to this pre-recorded linguistic input, as is common in autistic UB children, may offset the complexity of communicative language. Caregivers and clinicians report that autistic children watch their favorite cartoons or videos—especially preferred segments—hundreds and hundreds of times. The media to which autistic UB children are exposed offer a means for replaying preferred excerpts, and to focus on a specific aspect of the linguistic input. We speculate that, in attending to screen-based linguistic input, some autistic children may abstract away from the socio-communicative aspects and predominantly focus on structural and phonetic properties. Any difficulty in building stable phonological categories, due to heightened attention to non-meaningful acoustic contrasts, could be neutralized by the possibility of replaying—and reproducing—a video, reducing the variability in the input. We propose that heightened auditory perception, combined with invariant and repeated screen-based linguistic input, is the mechanism underlying screen-based language acquisition in UB autism.
Consistent with anectodical caregiver reports of absolute pitch and related strengths in the domain of music in autistic UB children, the first author conducted a case study of three UB children. Findings indicated that each child developed distinct phonological categories for the language spoken around them and the language they learned autonomously and without familial or educational support from screen-based media (Dumont et al., 2022). Acute auditory perception could have facilitated the perception of fine contrasts in the foreign language they heard from screens, allowing them to acquire the corresponding phonological categories.
In sum, the unusual acquisition of language not used in the family context via screen-based exposure in some autistic children could be linked to enhanced perception of fine-grained auditory differences. In this study, we tested pitch discrimination abilities among autistic and non-autistic children, with the prediction of heightened pitch perception in UB children relative to children in the two other groups. We compared performance in a pitch discrimination task among Autistic-UB (AU-UB), Autistic (AU) and Non-Autistic (Non-AU) children raised in a French-speaking linguistic environment. The AU-UB children acquired English exclusively through exposure to screens.
2. Methods
2.1. Participants
We focused on verbal autistic individuals, given challenges in establishing the presence of UB in non-speaking individuals. Participants were 46 autistic (AU; 19 females) children, including 12 AU-UB (3 females), and 46 non-autistic children (Non-AU; 21 females), ages nine to 16 years; see Table 1 for Participant information. Inclusion criteria were a Nonverbal Index of cognitive abilities (NVI) above 70 and the use of French as primary language in the family. Four participants were included despite their NVI being not available. As their Full-Scale IQ was above 70, we could reasonably expect that their NVI would meet the inclusion criterion. All autistic participants received a clinical diagnosis of autism from multidisciplinary teams (composed of medical doctors, speech therapists, psychologists, and social workers) specialized and officially licensed in diagnosing autism by the Belgian State. Participants were recruited by first and second authors for a larger study on sex differences in autism and statistical learning in autism. While our primary focus was not explicitly directed towards identifying UB within the autistic cohort, it is noteworthy that a subset of participants exhibiting this unique profile spontaneously emerged within our sample.
Table 1.
Participant information
| AU-UB | AU | Non-AU | one-way ANOVAs | ||||||
|---|---|---|---|---|---|---|---|---|---|
|
| |||||||||
| N1 | Mean (SD) [Range] | N | Mean (SD) [Range] | N | Mean (SD) [Range] | AU-UB vs AU | AU-UB vs Non-AU | AU vs Non-AU | |
|
| |||||||||
| Age (months) | 12 | 151.36 (22.60) [118–187] | 34 | 151.05 (27.98) [109–203] | 45 | 146.86 (26.23) [110–203] | ns | <.001 | <.001 |
| SES (1–20) | 9 | 11.55(3.76) [6–17] | 30 | 10.19 (2.58) [6–17] | 37 | 12.94 (3.14) [7–18] | <.001 | <.001 | <.001 |
| SCQ | 8 | 20.25 (7.42) [8–33] | 33 | 21.40 (5.27) [11–33] | 37 | 3.13 (2.85) [0–10] | <.001 | <.001 | <.001 |
| WISC-V NVIQ score | 12 | 92 (14.93) [74–124] | 31 | 100 (17.15) [70–134] | 44 | 109 (13.20) [78–131] | <.001 | <.001 | <.001 |
| WISC-V FSIQ score | 12 | 86 (15.13) [66–121] | 33 | 98(18.23) [64–129] | 44 | 109(12.44) [73–129] | <.001 | <.001 | <.001 |
| French Core language score (CELF-V) | 12 | 74 (18.62) [46–114] | 33 | 82 (18.99) [44–125] | 44 | 101 (11.95) [70–127] | <.001 | <.001 | <.001 |
| English Core Language score (CELF-V)2 | 10 | 65 (18.21) [45–100] | |||||||
Given missing data from some participants, sample sizes vary from one measure to another. Data were missing because parents did not complete or did not return the questionnaires they received. NVI scores are missing for the participants who had a recent FS-IQ assessment that did not include the two subsets needed to calculate the NVI. Lastly, for one participant, the scores of the French version of the CELF were lost due to experimenter error.
Two AU-UB children refused to complete the CELF-V in English; mastery of English (advanced syntactical and lexical knowledge) was assessed by the first author in informal conversation with the child.
Note. UB=Unexpected Bilingualism; SES=socioeconomic status; SCQ=Social Communication Questionnaire; WISC-V NV = Non-Verbal Index of Weschler Intelligence Scale for Children-5th Edition; WISC-V FSIQ = Full Scale IQ of Weschler Intelligence Scale for Children-5th Edition; CELF = Communication of the Evaluation of Language Fundamentals-5th version.
2.2. Measures
Parents completed the French version of the Social Communication Questionnaire (SCQ; (Rutter, Bailey, et al., 2003) – Lifetime version, a 40-item questionnaire probing autism symptoms across the lifespan. Autism was ruled out in the non-autistic group using the SCQ, with scores below the threshold of 15. One participant in the Non-AU group was excluded because of an SCQ score above this cutoff.
Parents responded to a questionnaire regarding language development (age of first words and first phrases), language production (languages used by the child), language input (languages used by caregivers, siblings, exposure at school) and media exposure (amount of screen time, type of activities). Given the novelty of the phenomenon of UB, there is neither a commonly accepted definition nor established inclusion criteria. Participants were included in the AU-UB group if, in this questionnaire, parents reported the use of English in French-speaking households where English was not in daily use or taught at school. To further ensure that the knowledge of English was genuinely productive and not limited to echolalic productions of media input, we administered a formal language assessment in English, using the English version of the CELF-5 (Wiig et al., 2013). Twelve out of the 48 autistic participants were included in the UB group. For six of these AU-UB children, parents reported that their child produced first words in English only, or along with French words; the other six parents reported that the child’s first words were in French, with English emerging between one to six years of age. Note that many of these parents expressed some discomfort with their child’s production in English, because the parents themselves were not fluent or comfortable speaking in English; many asked for clinical guidance about whether they should discourage the child’s use of this less-useful communicative modality.
Core language abilities in French were also assessed using the Communication Evaluation of Language Fundamentals-5th version (Wiig et al., 2019), CELF-F). To avoid testing bias, the French and the English versions of the CELF were administered by different examiners (first author for English, and second author for French). A Nonverbal Index of cognitive abilities (NVI) was assessed using the Weschler Intelligence Scale for Children-5th Edition (WISC-V;Weschler, 2014), yielding age-based standardized scores.
Parents reported on sociodemographic characteristics on a questionnaire adapted and translated to French from the Family Affluence Scale (Torsheim et al., 2016). This measure, which served as a proxy of socioeconomic status, captures educational attainment on a 0- to 6-point scale, ranging from 0 (indicating no primary school achievement) to 6 (representing a doctoral degree), and economic status on a 0- to 13-point scale, based on indicators such as ownership of assets (e.g., car, dishwasher, frequency of vacations, etc.) with 0 indicating very low economic status, and 13 indicating very high economic status. The education and income (status) scores are added to form a composite measure of socioeconomic status (SES).
2.3. Pitch discrimination task
The pitch discrimination task, adapted from Eigsti & Fein (2013), was programmed in E-Prime 3 and presented on an ASUS ROG RIX laptop via Bose Quietcomfort 45 headphones. Participants were asked to listen to 112 pairs of tones and to make a same/different judgement in a two-alternative forced-choice task. If the participants judged that the pairs were identical, they had to press on the key of the keyboard with a green sticker; if the pairs were judged to be different, they had to press on the key of the keyboard with a red sticker. Tones were 100 ms long, with a 1000 ms inter-stimulus interval and a 500 ms inter-trial interval. Participants first completed a training block of 16 trials with feedback (at 4% and 1% difference between tones), and then completed three 36-trial blocks at increasing difficulty levels: Easy (in which pairs could differ by 3% Hz), Medium (with a 2% difference), and Hard (1% difference). Each block comprised 36 pairs of tones, 18 of which (50%) were identical. Tones were 500, 750, 1000 or 1500 Hz, chosen to be outside of the typical range for human speech. For additional details, see Eigsti and Fein (2013).
2.4. Data analytic plan
All analyses were implemented in R (R Core Team, 2024), using the psycho, lme4 and emmeans packages. Prior to analysis, anticipatory responses (RT < 70 ms) and trials with a response slower than 7s were removed; these trials comprised < 1% of the data.
Mean accuracy was computed for each group and type of tone (same vs. different) to assess general group differences. Subsequently, we examined participants’ discrimination abilities and response biases using Signal Detection Theory (Macmillan, 2002). The d’ and c-statistic were calculated using the dprime function of the psycho package in R. Initially, we determined Hits, False Alarms, Correct Rejections, and Misses. Hits were defined as correctly discriminating two different tones, while False Alarms referred to mistakenly identifying identical tones as different. Correct Rejections were instances of correctly identifying two identical tones, and Misses were instances of incorrectly rejecting two different tones. D’ serves as a valuable metric, indicating the separation between the signal and noise distributions. D’ values close to 0 suggest performance at chance levels, whereas values above 1 are considered evidence of reliable discrimination. Subsequently, we computed the c-statistic to explore any potential response bias. A c-statistic value of 0.5 suggests no bias, while values deviating from 0.5 indicate a bias towards responding “different” (c > 0.5) or “same” (c < 0.5).”
Accuracy was subsequently modeled using stepwise forward comparisons of multilevel logistic models. In contrast to previous studies where d’ was used as the dependent variable, we opted to utilize raw accuracy. This decision aimed to minimize loss of statistical power, as calculating d’ involves transformation and loss of information. This ensured a more direct representation of participants’ performance and maximized sensitivity to detect group differences and subtle variations in task performance. The baseline model included a random intercept for subject and age as fixed factors; age was centered given the wide age range of the sample and group differences in age. Subsequent models were iteratively expanded by adding predictor variables: Group (AU-UB, AU, Non-AU), Stimulus Type (same/different pairs) and Difficulty level (Easy, Medium, Hard), as well as their interactions. The selection of each model was guided by model fit indices, including AIC (Akaike information criterion) and BIC (Bayesian Information Criterion), as well as the significance of predictors. The AIC and BIC were used to balance goodness of fit with model complexity, aiming to identify the model that provided the best trade-off. Additionally, significance testing helped ensure that selected predictors contributed meaningfully to the model. The best-fitting model was used for subsequent pairwise Tukey comparisons, implemented with the emmeans package.
There were no differences in accuracy as a function of sex (female/male) or SES. As we had no specific predictions for these variables, they were not included in further analyses.
3. Results
3.1. Psychometric measures and autistic symptomatology
One-way ANOVAs showed significant group differences for age, Full-Scale IQ, Non-verbal IQ, and French language assessments. AU participants were significantly older than Non-AU participants. AU-UB participants had significantly lower Full-Scale IQ and non-verbal IQ scores and lower language performance in French than the Autistic (AU) and Non-Autistic (Non-AU) groups. Furthermore, AU-UB English CELF scores were significantly lower than French CELF scores. The AU-UB group’s Core Language scores in English were extremely heterogeneous, with scores ranging from 45 – 100. As expected, the Non-AU had lower SCQ scores than both AU groups. Furthermore, SCQ scores were significantly lower score in the AU-UB group compared to the AU group, suggesting weaker socio-communicative signs of autistic symptomatology in AU-UB participants.
3.2. Pitch discrimination results
Looking at descriptive statistics in Table 2, mean group accuracy revealed that AU-UB were more accurate in both “Same” and “Different” pairs of pure tones than AU and Non-AU. Higher d’ in the AU-UB indicates a relatively higher sensitivity in discriminating between signal and noise. In all group, the c-statistic indicate a bias toward answering “same”, with the AU-UB exhibiting a stronger “same” bias than AU and Non-AU. However, taking the d’ and c together, the results suggests that while all groups tend to perceive all pairs as the same, the UB group is better able to effectively discriminate between signal and noise, resulting in higher sensitivity.
Table 2.
Mean raw accuracy and standard error as a function of group and stimulus pairings (same, different), and d’ and c means and standard errors.
| Accuracy | Signal Detection Theory | |||
|---|---|---|---|---|
|
| ||||
| Same | Different | d’ | c | |
| AU-UB | 0.76 (0.18) | 0.69(0.19) | 1.52(1.31) | 0.14(0.36) |
| AU | 0.73(0.18) | 0.53(0.21) | 0.87 (1.06) | 0.35(0.48) |
| Non-AU | 0.74(0.14) | 0.54(0.18) | 0.83(0.81) | 0.30(0.38) |
For inferential statistics, stepwise forward model comparisons, revealed that the best fitting model included effects of Group, Type and Difficulty, as well as Type x Difficulty and Group x Type interactions, χ2(2) = 10.66, p = .004.; see Table 1 in Supplementary Material.
Looking at the significant Type x Difficulty interaction from the best fitting model (m5, Table 2), post-hoc pairwise Tukey-adjusted comparisons showed that, across groups, accuracy differed across each pairing of conditions: Accuracy of different pairs was higher in the Easy than in the Medium (β = .35; SE =.08; p <.001) and Hard (β =.88, SE =.08 p < .001) conditions, and in the Medium than in the Hard condition (β =.53, SE =.08, p < .001). For identical tone pairs, accuracy was significantly higher in the Easy condition than in the Medium (β = .30, SE = .09, p = .002) and Hard (β = .46, SE =0.09, p < .001) conditions; however, for identical pairs, accuracy in the Medium and Hard conditions did not differ (p = .14). In all conditions, participants responded with greater accuracy to identical tone-pairs relative to different tone-pairs (Easy: β = −.49, SE =.09; Medium: β =−.69, SE = .09; Hard: β =−1.06, SE = .09; all p’s < .001), reflecting a bias towards ‘same’ responses.
Turning to the significant Group x Type interaction: as shown by the fitted 95% confidence intervals in Figure 1, only the UB group performed above the 50% chance level with different tone-pairs. Post-hoc comparisons showed that, for different tone-pairs, children in the AU-UB group had higher accuracy than children in either the AU (β = .80, SE =.27, p = .009) or the Non-AU (β = .71, SE = 0.26, p = .01) groups. The AU and Non-AU groups did not differ in accuracy for different tone-pairs and there was no group difference in accuracy for same tone-pairs (all p’s > .05).
Figure 1.

Fitted accuracy and 95% CI by group and stimulus type.
3.3. Effects of language performance and bilingualism
We performed an a posteriori check to ensure that group effects were not due to confounding variables such as French (first language) abilities or “expected bilingual” status. We first replaced the group variable in our best-fitting model with French CELF-5 scores. Results indicated that French CELF-5 scores did not predict accuracy (p = .73). To test whether “expected bilingual” status accounted for differences in pitch performance, any autistic child who was consistently exposed to a second language via live human interaction (e.g., with a caregiver) before age three years was coded as bilingual. An analysis compared pitch discrimination by group for Autistic-Bilingual, Autistic-Monolingual and Non-Autistic groups. Post-hoc comparisons revealed no group differences in accuracy, all p’s > .60.
4. Discussion
This study is the first to compare a group of autistic children with Unexpected Bilingualism (UB) to their autistic and non-autistic peers. The very presence of the largest (to date) group of autistic children with un-taught second language abilities establishes that UB is not limited to isolated and exotic case studies, but is a genuine phenomenon, present in 25 % of our total sample of autistic participants. By simply recruiting French-speaking non-autistic (Non-AU) and autistic participants (AU), we identified 12 Autistic-UB (AU-UB) out of 48 autistic children. The selection was based on a parental questionnaire reporting some knowledge of English, without any formal learning or exposure at home, and further established via formal assessment of English language skills.
Understanding the mechanisms that drive UB is crucial; the capacity to acquire fluent language skills via screen-based exposure suggests that language acquisition could, in some cases, be less mediated by socio-communicative abilities. With the exception of the case study by Vulchanova et al. (2012), there is currently no study describing the mechanisms underlying such intriguing language acquisition profiles. Vulchanova and colleagues (2012) framed their study in terms of the Weak Central Coherence Hypothesis (Happé & Frith, 2006). Our hypothesis, aligning with both the Weak Central Coherence and the Enhanced Perceptual Functioning hypotheses, was that autistic UB children would exhibit enhanced performance in pitch discrimination. Both models predict that heightened perceptual processing, such as pitch discrimination, may be due to enhanced sensitivity and attention to low-level perceptual information.
In addition to establishing the phenomenon of UB in the largest sample to date of autistic children, the primary finding was of superior pitch discrimination abilities in the AU-UB children as compared to both AU and Non-AU participants. The AU-UB children exhibited better accuracy in detecting different pairs of tones across varying difficulty levels, while AU and Non-AU participants performance did not differ from chance. No group differences were found for the pairs of pure tones that were identical; accuracy was very high overall for identical pairs (Means = 0.76, 0.73, 0.74 for AU-UB, AU and Non-AU groups). In all groups, the c-statistic from signal detection theory showed a bias toward “same” responses, potentially accounting for the lack of group differences for same pairs of pure tones.
Enhanced pitch discrimination ability in autistic UB children carries significant implications for understanding their language acquisition profiles. Pitch discrimination has been linked to both language acquisition deficits and current autistic symptomatology. Jones et al. (2009) did not find any difference between autistic and non-autistic group at the group level but identified a subgroup of autistic participants with heightened auditory perception and a history of delayed language onset. Bonnel et al. (2010) found that only an autistic group, and not participants with the Asperger syndrome diagnosis who had no history of language or cognitive delays, showed enhanced pitch discrimination. Eigsti & Fein (2013) also reported enhanced pitch discrimination in autistic children, but not in those who were diagnosed as children but no longer met criteria for autism.
Similar to reports from Jones et al. (2009), Eigsti and Fein (2013), and Bonnel et al. (2010), the current study revealed enhanced pitch discrimination in a particular subgroup of autistic children. The subgrouping variable in our sample was not based on language history (as in Jones et al., 2009 and Bonnel et al., 2010) or current autistic symptomatology (as in Eigsti & Fein, 2013), but rather on a unique bilingual language profile. Here, results indicated that the autistic participants with self-taught bilingual language skills displayed significantly better auditory discrimination relative to autistic participants without UB or non-autistic comparison participants. Note, however, that these results are quite compatible with previously reported links between enhanced auditory processing and language delays; children in the AU-UB group, who displayed heightened pitch perception, also displayed overall lower scores in the language of their linguistic community (French) compared to their autistic peers without the unexpected bilingual profile. Note, however, that overall linguistic skills did not predict pitch discrimination in the totality of our sample; this non-replication of prior findings may reflect the choice of raw accuracy data instead of d’ as the dependent variable.
According to both the Enhanced Perceptual Functioning and Weak Central Coherence theories (Happé & Frith, 2006; Mottron et al., 2006), autistic individuals have a unique cognitive style that prioritizes lower-level perceptual processing. Enhanced auditory processing is plausibly one manifestation of this cognitive style, which, in variable communication environments, may lead to overly detailed phonological representations and thereby to delays in language acquisition. However, in the case of screen-based exposure, the variability of the linguistic input is likely offset by one’s ability to replay a video segment over and over again. Repeated exposure to a constrained linguistic input set may allow UB children to attend exclusively only the structural, phonetic properties, and reducing the stress of social interaction. The possibility of acquiring linguistic structures outside a direct interactional frame is further supported by evidence on language acquisition in culture where children tend to be not directly spoken to by adults (Foushee & Srinivasan, 2024) and raises crucial questions about language learnability (Kissine, 2020).
Acute perception in auditory information, such as pitch, vowel contrasts, or relative formant, could also drive a child’s interest in language learning in and of itself. Restricted and repetitive idiosyncratic interests are central to the autism diagnosis (American Psychiatric Association, 2013). Intense interests often incorporate a sensory dimension (Uljarević et al., 2022), with television and YouTube videos being one of the most common interests in autistic children (Nowell et al., 2021), including those with age-typical cognitive skills (Klin et al., 2007). The local bias observed in autism has also been linked to an advantage in fragmented, featural learning, and may result in a tendency to seek configural patterns, as found in calendar calculations, mathematics, memorization — and language (Klin et al., 2007; Mottron et al., 2021). Many parents of children in our AU-UB sample reported that their child watched content not only in English, but also in Russian, Spanish or Japanese. While it was not possible to assess their abilities in these languages, we speculate that enhanced performance in auditory discrimination and the bias towards low-level processing underlies the children with the AU-UB profile to seek out different languages in order to “crack the code.”
To expand our comprehension of Unexpected Bilingualism, future research should focus on higher-level auditory discrimination tasks to thoroughly investigate auditory processing profiles in Autistic-UB individuals. For example, more complex auditory stimuli and tasks evaluating higher-order auditory features such as speech intonation, prosodic processing, or phonemic contrasts, would provide a broader perspective on the local-processing bias. A clear limitation of our study is the absence of audiometric data: a baseline of hearing sensitivity would further allow comparing pitch discrimination relative to auditory threshold levels. In this exploratory study, we did not assess other relevant perceptual low-level auditory skills (such as temporal auditory processing). Future studies should recruit larger samples of Autistic-UB children to investigate potential links between pitch discrimination abilities and the age of language onset (see Eigsti & Fein, 2013). Relatedly, it would be important to include younger children to better understand how pitch discrimination may interact with language trajectories in this group. A longitudinal study tracking media use, preferred contents and the watching habits of children with the unexpected bilingualism profile, would illuminate the processes underlying this type of language acquisition. Studies utilizing long-format recording tools (e.g., LENA recorders) should also contrast language input in UB children via screens, communicative interactions, and overheard speech.
By investigating UB and understanding the underlying mechanisms and cognitive profiles, we can move closer to providing evidence-based advice to caregivers of autistic UB children, who do not know whether they should encourage the exposure to screens in foreign languages. More broadly, this research informs theoretical models of language acquisition.
Supplementary Material
Acknowledgements:
We are grateful to all the participating children, parents, and schools that collaborated with us.
Funding:
The project was supported by both Margueritte-Marie Delacroix and Fonds de La Recherche Nationale. Inge-Marie Eigsti was supported by NIH NIDCD 1R01DC021564–01 and NIMH R01MH112687–01A1.
Footnotes
Conflict of Interest: The authors declare that they have no conflict of interest.
Ethics approval: Ethical approval was received for the study from the Erasme-ULB ethics committee in accordance with the Declaration of Helsinki. Participants’ parents signed a written consent for their children to be enrolled in this study after being informed of their rights and all aspects of the experimental design. Written consent was obtained from children enrolled in this study.
Data availability statement:
De-identified data and R scripts are available from the corresponding author on request.
References
- Abd El-Raziq M, Meir N, & Saiegh-Haddad E (2023). Lexical skills in children with and without autism in the context of Arabic diglossia : Evidence from vocabulary and narrative tasks View supplementary material Lexical skills in children with and without autism in the context of Arabic diglossia : Evidence from vocabulary and narrative tasks. 1–25. 10.1080/10489223.2023.2268615 [DOI]
- American Psychiatric Association. (2013). Diagnostic and statistical manual of mental disorders : DSM-5. In DSM-5. American Psychiatric Association. [Google Scholar]
- Anderson DK, Lord C, Risi S, DiLavore PS, Shulman C, Thurm A, Welch K, & Pickles A (2007). Patterns of Growth in Verbal Abilities Among Children With Autism Spectrum Disorder. Journal of Consulting and Clinical Psychology, 75(4), 594. 10.1037/0022-006X.75.4.594 [DOI] [PubMed] [Google Scholar]
- Bonnel A, McAdams S, Smith B, Berthiaume C, Bertone A, Ciocca V, Burack JA, & Mottron L (2010). Enhanced pure-tone pitch discrimination among persons with autism but not Asperger syndrome. Neuropsychologia, 48(9), 2465–2475. 10.1016/j.neuropsychologia.2010.04.020 [DOI] [PubMed] [Google Scholar]
- Yu Chen, Enze Tang, Hongwei Ding, & Yang Zhang. (2022). Auditory Pitch Perception in Autism Spectrum Disorder : A Systematic Review and Meta-Analysis. Journal of Speech, Language, and Hearing Research, 65(12), 4866–4886. 10.1044/2022_JSLHR-22-00254 [DOI] [PubMed] [Google Scholar]
- Colonnesi C, Stams GJJM, Koster I, & Noom MJ (2010). The relation between pointing and language development : A meta-analysis. Developmental Review, 30(4), 352–366. 10.1016/j.dr.2010.10.001 [DOI] [Google Scholar]
- de Carvalho A, Lidz J, Tieu L, Bleam T, & Christophe A (2016). English-speaking preschoolers can use phrasal prosody for syntactic parsing. The Journal of the Acoustical Society of America, 139(6), EL216. 10.1121/1.4954385 [DOI] [PubMed] [Google Scholar]
- Dumont C, Biston A, Clin E, Wintgens A, & Kissine M (2022, mai). Case studies of non-interactive bilingualism in ASD [Poster Presentation]. 14th INSAR Annu. Sci. Meet. [Google Scholar]
- Eigsti IM, & Fein DA (2013). More is less : Pitch discrimination and language delays in children with optimal outcomes from autism. Autism research : official journal of the International Society for Autism Research, 6(6), 605–613. 10.1002/AUR.1324 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ellis Weismer S, & Kover ST (2015). Preschool language variation, growth, and predictors in children on the autism spectrum. Journal of Child Psychology and Psychiatry and Allied Disciplines, 56(12), 1327–1337. 10.1111/jcpp.12406 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Foushee R, & Srinivasan M (2024). Infants who are rarely spoken to nevertheless understand many words. Proceedings of the National Academy of Sciences, 121(23), e2311425121. 10.1073/pnas.2311425121 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Germain E, Foster NEV, Sharda M, Chowdhury R, Tryfon A, Doyle-Thomas KAR, Anagnostou E, & Hyde KL (2019). Pitch direction ability predicts melodic perception in autism. Child Neuropsychology, 25(4), 445–465. 10.1080/09297049.2018.1488954 [DOI] [PubMed] [Google Scholar]
- Goodsitt JV, Morgan JL, & Kuhl PK (1993). Perceptual strategies in prelingual speech segmentation. Journal of Child Language, 20(2), 229–252. 10.1017/S0305000900008266 [DOI] [PubMed] [Google Scholar]
- Haesen B, Boets B, & Wagemans J (2011). A review of behavioural and electrophysiological studies on auditory processing and speech perception in autism spectrum disorders. Research in Autism Spectrum Disorders, 5(2), 701–714. 10.1016/j.rasd.2010.11.006 [DOI] [Google Scholar]
- Happé F, & Frith U (2006). The Weak Coherence Account : Detail-focused Cognitive Style in Autism Spectrum Disorders. Journal of Autism and Developmental Disorders, 36(1), 5–25. 10.1007/s10803-005-0039-0 [DOI] [PubMed] [Google Scholar]
- Heaton P (2003). Pitch memory, labelling and disembedding in autism. Journal of Child Psychology and Psychiatry, 44(4), 543–551. 10.1111/1469-7610.00143 [DOI] [PubMed] [Google Scholar]
- Heaton P (2005). Interval and Contour Processing in Autism. Journal of Autism and Developmental Disorders, 35(6), 787–793. 10.1007/s10803-005-0024-7 [DOI] [PubMed] [Google Scholar]
- Heaton P, Hudry K, Ludlow A, & Hill E (2008). Superior discrimination of speech pitch and its relationship to verbal ability in autism spectrum disorders. Cognitive Neuropsychology, 25(6). 10.1080/02643290802336277 [DOI] [PubMed] [Google Scholar]
- Jones CRG, Happé F, Baird G, Simonoff E, Marsden AJS, Tregay J, Phillips RJ, Goswami U, Thomson JM, & Charman T (2009). Auditory discrimination and auditory sensory behaviours in autism spectrum disorders. Neuropsychologia, 47(13). 10.1016/j.neuropsychologia.2009.06.015 [DOI] [PubMed] [Google Scholar]
- Jusczyk PW, Houston DM, & Newsome M (1999). The Beginnings of Word Segmentation in English-Learning Infants. Cognitive Psychology, 39(3), 159–207. 10.1006/cogp.1999.0716 [DOI] [PubMed] [Google Scholar]
- Kellerman GR, Fan J, & Gorman JM (2005). Auditory Abnormalities in Autism : Toward Functional Distinctions Among Findings. CNS Spectrums, 10(9), 748–756. 10.1017/S1092852900019738 [DOI] [PubMed] [Google Scholar]
- Kissine M (2020). Autism, constructionism and nativism. Language, 1–35. [Google Scholar]
- Kissine M, Luffin X, Aiad F, Bourourou R, Deliens G, & Gaddour N (2019). Noncolloquial Arabic in Tunisian Children With Autism Spectrum Disorder : A Possible Instance of Language Acquisition in a Noninteractive Context. Language Learning, 69(1), 44–70. 10.1111/lang.12312 [DOI] [Google Scholar]
- Kissine M, Saint-Denis A, & Mottron L (2023). Language acquisition can be truly atypical in autism : Beyond joint attention. 105384. 10.1016/j.neubiorev.2023.105384 [DOI] [PubMed] [Google Scholar]
- Klin A, Danovitch JH, Merz AB, & Volkmar FR (2007). Circumscribed Interests in Higher Functioning Individuals with Autism Spectrum Disorders : An Exploratory Study. Research and Practice for Persons with Severe Disabilities, 32(2), 89–100. 10.2511/rpsd.32.2.89 [DOI] [Google Scholar]
- Kuhl PK, Tsao FM, & Liu HM (2003). Foreign-language experience in infancy : Effects of short-term exposure and social interaction on phonetic learning. Proceedings of the National Academy of Sciences of the United States of America, 100(15), 9096–9101. 10.1073/pnas.1532872100 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Luyster RJ, Kadlec MB, Carter A, & Tager-Flusberg H (2008). Language assessment and development in toddlers with autism spectrum disorders. Journal of Autism and Developmental Disorders, 38(8), 1426–1438. 10.1007/s10803-007-0510-1 [DOI] [PubMed] [Google Scholar]
- Macmillan NA (2002). Signal detection theory. Stevens’ handbook of experimental psychology, 4, 43–90. [Google Scholar]
- Mottron L, & Burack J (2001). Enhanced perceptual functioning in the development of autism The development of autism : Perspectives from theory and research (pp. 131–148). [Google Scholar]
- Mottron L, Dawson M, Soulières I, Hubert B, & Burack J (2006). Enhanced Perceptual Functioning in Autism : An Update, and Eight Principles of Autistic Perception. Journal of Autism and Developmental Disorders, 36(1), 27–43. 10.1007/s10803-005-0040-7 [DOI] [PubMed] [Google Scholar]
- Mottron L, Ostrolenk A, & Gagnon D (2021). In Prototypical Autism, the Genetic Ability to Learn Language Is Triggered by Structured Information, Not Only by Exposure to Oral Language. Genes, 12(8), Article 8. 10.3390/genes12081112 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nowell KP, Bernardin CJ, Brown C, & Kanne S (2021). Characterization of Special Interests in Autism Spectrum Disorder : A Brief Review and Pilot Study Using the Special Interests Survey. Journal of Autism and Developmental Disorders, 51(8), 2711–2724. 10.1007/s10803-020-04743-6 [DOI] [PubMed] [Google Scholar]
- Ostrolenk A, Gagnon D, Boisvert M, Lemire O, Dick S-C, Côté M-P, & Mottron L (2024). Enhanced interest in letters and numbers in autistic children. Molecular Autism, 15(1), 26. 10.1186/s13229-024-00606-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Paul R, Chawarska K, Cicchetti D, & Volkmar F (2008). Language outcomes of toddlers with autism spectrum disorders : A two year follow-up. Autism Research, 1(2), 97–107. 10.1002/aur.12 [DOI] [PMC free article] [PubMed] [Google Scholar]
- R Core Team. (2024). R: A Language and Environment for Statistical Computing [Logiciel]. https://www.R-project.org
- Rutter M, Bailey A, & Lord C (2003). The Social Communication Questionnaire : Manual. Adaptation française Kruck, Baduel, & Rogé, 2013 (Western Psychological Services; ). [Google Scholar]
- Rutter M, Le Couteur A, & Lord C (2003). ADI-R. Autism diagnostic interview revised. Manual. Los Angeles: Western Psychological Services. [Google Scholar]
- Sachs J, Bard B, & Johnson ML (1981). Language learning with restricted input : Case studies of two hearing children of deaf parents. Applied Psycholinguistics, 2(1), 33–54. 10.1017/S0142716400000643 [DOI] [Google Scholar]
- Shukla M, White KS, & Aslin RN (2011). Prosody guides the rapid mapping of auditory word forms onto visual objects in 6-mo-old infants. Proceedings of the National Academy of Sciences, 108(15), 6038–6043. 10.1073/pnas.1017617108 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Soderstrom M, Nelson DGK, & Jusczyk PW (2005). Six-month-olds recognize clauses embedded in different passages of fluent speech. Infant Behavior and Development, 28(1), 87–94. 10.1016/j.infbeh.2004.07.001 [DOI] [Google Scholar]
- Su PL, Rogers SJ, Estes A, & Yoder P (2021). The role of early social motivation in explaining variability in functional language in toddlers with autism spectrum disorder. Autism, 25(1), 244–257. 10.1177/1362361320953260 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Thiessen ED, & Saffran JR (2003). When Cues Collide : Use of Stress and Statistical Cues to Word Boundaries by 7- to 9-Month-Old Infants. Developmental Psychology, 39(4), 706–716. 10.1037/0012-1649.39.4.706 [DOI] [PubMed] [Google Scholar]
- Torsheim T, Cavallo F, Levin KA, Schnohr C, Mazur J, Niclasen B, Currie C, & the FAS Development Study Group. (2016). Psychometric Validation of the Revised Family Affluence Scale : A Latent Variable Approach. Child Indicators Research, 9(3), 771–784. 10.1007/s12187-015-9339-x [DOI] [PMC free article] [PubMed] [Google Scholar]
- Uljarević M, Alvares GA, Steele M, Edwards J, Frazier TW, Hardan AY, & Whitehouse AJ (2022). Toward better characterization of restricted and unusual interests in youth with autism. Autism, 26(5), 1296–1304. 10.1177/13623613211056720 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Vulchanova M, Talcott JB, Vulchanov V, & Stankova M (2012). Language against the odds, or rather not : The weak central coherence hypothesis and language. Journal of Neurolinguistics, 25(1), 13–30. 10.1016/j.jneuroling.2011.07.004 [DOI] [Google Scholar]
- Wang X, Wang S, Fan Y, Huang D, & Zhang Y (2017). Speech-specific categorical perception deficit in autism : An Event-Related Potential study of lexical tone processing in Mandarin-speaking children. Scientific Reports, 7(1), 43254. 10.1038/srep43254 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Weschler D (2014). Weschler Intelligence Scale for Children—Fifth Edition (WISC-V) (Pearson; ). [Google Scholar]
- Wiig EH, Secord WA, & Semel E (2013). Clinical evaluation of language fundamentals : CELF-5. Pearson. [Google Scholar]
- Wiig EH, Semel E, & Secord WA (2019). CELF 5 : Évaluation des fonctions langagières et de communication (Éditions du Centre de psychologie appliquée).
- Wodka EL, Mathy P, & Kalb L (2013). Predictors of phrase and fluent speech in children with Autism and severe language delay. Pediatrics, 131(4), e1128–e1134. 10.1542/peds.2012-2221 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yoder P, Watson LR, & Lambert W (2015). Value-Added Predictors of Expressive and Receptive Language Growth in Initially Nonverbal Preschoolers with Autism Spectrum Disorders. Journal of Autism and Developmental Disorders, 45(5), 1254–1270. 10.1007/S10803-014-2286-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yu L, Fan Y, Deng Z, Huang D, Wang S, & Zhang Y (2015). Pitch Processing in Tonal-Language-Speaking Children with Autism : An Event-Related Potential Study. Journal of Autism and Developmental Disorders, 45(11), 3656–3667. 10.1007/s10803-015-2510-x [DOI] [PubMed] [Google Scholar]
- Zhukova MA, Talantseva OI, An I, & Grigorenko EL (2021). Brief Report : Unexpected Bilingualism : A Case of a Russian Child With ASD. Journal of Autism and Developmental Disorders 2021, 1–8. 10.1007/S10803-021-05161-Y [DOI] [PubMed] [Google Scholar]
- Zimmerman FJ, & Christakis DA (2005). Children’s Television Viewing and Cognitive Outcomes : A Longitudinal Analysis of National Data. Archives of Pediatrics & Adolescent Medicine, 159(7), 619–625. 10.1001/archpedi.159.7.619 [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
De-identified data and R scripts are available from the corresponding author on request.
