Speech Recognition Abilities in Normal-Hearing Children 4 to 12 Years of Age in Stationary and Interrupted Noise

Wiepke J A Koopmans; S Theo Goverts; Cas Smits

doi:10.1097/AUD.0000000000000569

. 2018 Oct 26;39(6):1091–1103. doi: 10.1097/AUD.0000000000000569

Speech Recognition Abilities in Normal-Hearing Children 4 to 12 Years of Age in Stationary and Interrupted Noise

Wiepke J A Koopmans ^1,², S Theo Goverts ¹, Cas Smits ^1,^✉

PMCID: PMC7664447 PMID: 29554035

Abstract

Objectives:

The main purpose of this study was to examine developmental effects for speech recognition in noise abilities for normal-hearing children in several listening conditions, relevant for daily life. Our aim was to study the auditory component in these listening abilities by using a test that was designed to minimize the dependency on nonauditory factors, the digits-in-noise (DIN) test. Secondary aims were to examine the feasibility of the DIN test for children, and to establish age-dependent normative data for diotic and dichotic listening conditions in both stationary and interrupted noise.

Design:

In experiment 1, a newly designed pediatric DIN (pDIN) test was compared with the standard DIN test. Major differences with the DIN test are that the pDIN test uses 79% correct instead of 50% correct as a target point, single digits (except 0) instead of triplets, and animations in the test procedure. In this experiment, 43 normal-hearing subjects between 4 and 12 years of age and 10 adult subjects participated. The authors measured the monaural speech reception threshold for both DIN test and pDIN test using headphones. Experiment 2 used the standard DIN test to measure speech reception thresholds in noise in 112 normal-hearing children between 4 and 12 years of age and 33 adults. The DIN test was applied using headphones in stationary and interrupted noise, and in diotic and dichotic conditions, to study also binaural unmasking and the benefit of listening in the gaps.

Results:

Most children could reliably do both pDIN test and DIN test, and measurement errors for the pDIN test were comparable between children and adults. There was no significant difference between the score for the pDIN test and that of the DIN test. Speech recognition scores increase with age for all conditions tested, and performance is adult-like by 10 to 12 years of age in stationary noise but not interrupted noise. The youngest, 4-year-old children have speech reception thresholds 3 to 7 dB less favorable than adults, depending on test conditions. The authors found significant age effects on binaural unmasking and fluctuating masker benefit, even after correction for the lower baseline speech reception threshold of adults in stationary noise.

Conclusions:

Speech recognition in noise abilities develop well into adolescence, and young children need a more favorable signal-to-noise ratio than adults for all listening conditions. Speech recognition abilities in children in stationary and interrupted noise can accurately and reliably be tested using the DIN test. A pediatric version of the test was shown to be unnecessary. Normative data were established for the DIN test in stationary and fluctuating maskers, and in diotic and dichotic conditions. The DIN test can thus be used to test speech recognition abilities for normal-hearing children from the age of 4 years and older.

Keywords: Age factors, Binaural unmasking, Child, Fluctuating masker benefit, Hearing tests, Speech intelligibility, Speech-in-noise recognition, Speech perception, Speech recognition abilities, Speech reception threshold test

INTRODUCTION

Young children spend many hours a day in complex acoustic environments with noise and reverberation such as kindergarten and school. In these demanding listening situations, they have to communicate with their parents, teachers, and other children. Previous studies have shown that children have more difficulty than adults with recognizing speech in noisy situations (Crandell 1993; Hall et al. 2002), and that speech recognition abilities in noise develop at least to the age of 10 to 12 years (Buss et al. 2006; Hall et al. 2004; Holder et al. 2016; Vaillancourt et al. 2008). Children’s reduced speech recognition abilities in noise may affect how well they learn in a noisy classroom, through both formal education and incidental learning. On top of this developmental effect, the ability to recognize speech in noise can be strongly reduced by hearing loss (Ching et al. 2017), which makes daily-life listening conditions often critical for children with hearing impairment. To quantify the consequences of hearing loss in children with hearing impairment, it is important to relate the outcomes of hearing assessment to those of their normal-hearing peers. Hence, it is important and clinically relevant to know how speech recognition abilities in noise of normal-hearing children develop with age.

Listening in an acoustically demanding situation involves combining the two different, but related noise-corrupted speech fragments from both ears. Children’s speech recognition abilities therefore depend on their ability to separate speech from noise, to benefit from fluctuations in the background noise, and to benefit from binaural cues. Previous studies have shown that children’s test performance on speech-in-noise tests improves with age (Corbin et al. 2016; Elliott 1979; Hall et al. 2002). In stationary speech-shaped noise, most children achieved adult-like performance by 10 years of age or later (Corbin et al. 2016; Elliott 1979; Hall et al. 2002; Holder et al. 2016; Neuman et al. 2010; Nishi et al. 2010; Wilson et al. 2010). Other masker types result in larger and more prolonged age effects. For example, Leibold and Buss (2013) found that the developmental trajectory depends on the masker type, and found a more prolonged developmental time course for consonant detection in two-talker babble than in speech-shaped noise masker. Corbin et al. (2016) found a similar prolonged developmental time course for word detection in two-talker babble compared with a speech-shaped masker. They hypothesized that masked speech recognition may rely on mature executive function to a greater extent in two-talker speech (informational masking) than in speech-shaped noise (energetic masking), and that this places a greater cognitive load on the child. This shows that the development of speech recognition in noise abilities has been explained in terms of both auditory and nonauditory factors, and that the exact developmental time course likely depends on test procedures and masker types.

The effects of fluctuations in the noise on the speech recognition abilities of children have been studied by Stuart (2005, 2008), Hall et al. (2012), and Buss et al. (2016). They found that the test performance improves with age, and that children under the age of 11 to 14 years old need a more favorable signal to noise ratio (SNR) to perform as well as adults. Stuart (2008) measured performance in five groups of children (6–7, 8–9, 10–11, 12–13, and 14–15 years) and in adults, and found that the fluctuating masker benefit (FMB; i.e., the release of masking because of the interruptions in the noise) for the children was not significantly different from that of adults. He suggested that school-age children have an inherent poorer central processing efficiency, rather than poorer temporal resolution. Thus, children benefit from listening in the “gaps’” but their ability to recognize speech in noise is limited by ongoing maturation of the auditory system and their developing language and attention skills. Hall et al. (2012) tested the effect of temporally modulated maskers (100% sinusoidal modulation at a rate of 10 Hz) on speech recognition scores of children 4.6 to 11.1 years of age. They found a significant developmental effect on masking release related to the temporal modulations in the noise. In a later study, Buss et al. (2016) found similar age effects on masking release by a modulated masker in a four-alternative forced-choice response context. In both studies, the authors speculated that young children are relatively poor in the ability to piece together sparse “glimpses” of speech. They also noted that the observed developmental effect might be related to the relatively high SNR associated with baseline-masked thresholds in younger children (Bernstein & Grant 2009; Smits & Festen 2013). The origin of the observed child–adult differences in FMB is therefore not entirely clear.

The development of binaural hearing abilities has been studied in various ways. Binaural hearing abilities of adults are often assessed using headphones with binaural intelligibility level difference tests (Johansson & Arlinger 2002; Licklider 1948). These tests use the masking level difference when the speech is phase shifted between the right and left ear compared with the homophasic condition. The binaural unmasking [BU; i.e., the difference in speech reception threshold (SRT) between diotic (N0S0) and dichotic (N0Sπ) presentation] can amount to 7 dB SNR for adults (Johansson & Arlinger 2002). BU has been explored in tests with children, but typically only in tone discrimination tasks. Moore et al. (2011) did not find a significant change in masking level difference with age, but they also state that the small number of children examined in their study may have contributed to the nonsignificant result. Several other groups have studied children’s ability to benefit from spatial and binaural hearing when target speech and competing noise are spatially separated in a sound field. There is no consensus on how this spatial release of masking (SRM) develops with age (as reviewed by Yuen & Yuan 2014). Some studies report that SRM does not improve with age and becomes adult-like at a young age (Ching et al. 2011; Garadat & Litovsky 2007; Litovsky 2005; Murphy et al. 2011). For example, Litovsky (2005) used target speech that was presented from the front, while speech or modulated speech-shaped noise competitors were either in front or on the right at 90°. She found that SRM was similar in the two age groups (children 4 to 7 years of age and adults), and even greater in children in one condition tested. Her findings suggested that young children are already able to utilize spatial and/or head shadow cues to segregate sounds in noisy environments. By contrast, other studies report that SRM improves with age and that it takes much longer to reach adult-like performance (Cameron et al. 2009; Cameron & Dillon 2007; Vaillancourt et al. 2008; Van Deun et al. 2010; Yuen & Yuan 2014). For example, Van Deun et al. (2010) used a speech test with digits in noise to measure speech perception benefits in normal-hearing children between 4 and 8 years of age and normal-hearing adults. They measured SRM, head shadow effects, summation effects, and squelch and found that only SRM was influenced by age. Yuen and Yuan (2014) revisited the research question on whether the development of SRM is completed early or late in children. They hypothesized that there is a much longer maturational time for SRM than suggested by other studies (Ching et al. 2011; Garadat & Litovsky 2007; Litovsky 2005; Murphy et al. 2011) because of the ongoing maturation of the auditory system. They performed SRM testing with children (4 to 9 years of age) and adults and found that SRM improves significantly with age. A robust regression of 0.1 to 0.15 dB SRM improvement per month was observed for two different test materials.

To summarize, a substantial body of literature demonstrates lower speech recognition scores in noise for children than for adults in various conditions that are relevant for everyday listening. The developmental time course depends on test materials and masker types used. The origin of the observed age dependencies for different speech recognition in noise abilities remains to be explained, but the literature suggests that auditory and nonauditory factors play a role.

Assessing and Interpreting Speech Recognition Abilities in Children

When assessing and interpreting speech recognition test scores in children, one has to consider various factors that can influence the conclusions based on the test: the test should measure the same speech recognition ability for children and adults; age-dependent normative data should be available; and the effect of the baseline SNR at which the masking release is estimated (Smits & Festen 2011) has to be taken into account.

First, standard speech-in-noise tests designed for adults may not be suitable for children because of the procedures and materials that are used. For example, the standard Dutch speech-in-noise tests [sentence speech-in-noise tests from Plomp & Mimpen (1979) and Versfeld et al. (2000)] are not suitable for children under 12 years of age because of the language competency required to complete the test. Mendel (2008) points out that test performance can be influenced by the child’s vocabulary, language competency, and cognitive abilities. These nonauditory (or top–down) processes are developing during childhood. It is therefore difficult to discriminate between the developmental aspects of purely auditory speech recognition abilities (or bottom–up) and top–down processes. Moore et al. (2011) point out that children’s reduced performance on auditory tasks may primarily be due to nonsensory factors. The poorer test performance of children is often explained in terms of “elevated internal noise” or “poor processing efficiency,” although these concepts are ill defined. Because both auditory (bottom–up) and nonauditory (top–down) factors are developing in children, age-appropriate tests are needed. Either target speech material, competitor noise, or the adaptive test procedure must be modified to meet the needs of children. Although Mendel (2008) gives a guide how to design test materials to be age appropriate, it is unclear to what extent the test scores and findings are still impacted by the child’s cognitive abilities, attention skills, and linguistic proficiency. It is important to minimize the effect of these nonsensory factors on the test from purely auditory factors to primarily test the auditory bottom–up component of speech recognition in noise (Smits et al. 2013).

Second, to relate the test results of a child to their peers, age-dependent normative data should be available for the speech-in-noise test. Establishing normative data is quite an effort because many children from different age groups have to be tested for a clear understanding of the age-dependent mean test score and confidence interval. For practical reasons, there are some advantages to acquiring these normative data by using headphones, rather than in a free field. Free-field tests have to be conducted in a sound booth at a clinic and are susceptible to the child’s head movements, and variation in position relative to the loudspeaker. When using acoustically isolated headphones, children’s head movements have little effect on the test result and the tests may be performed outside the test booth, such as at school. This greatly facilitates the recruitment of an adequate sample size for each age group involved.

Finally, when considering the effect of age on masking release (FMB, SRM, or BU), one must realize that the amount of unmasking may depend on the baseline SNR at which unmasking is estimated (Bernstein & Grant 2009; Oxenham & Simonson 2009; Smits & Festen 2011). The slope of the speech recognition function is, in general, shallower in fluctuating noise than in steady-state noise (Smits & Festen 2013). Because of slope differences between the speech recognition function for the baseline stationary noise condition and speech recognition functions for other listening conditions (e.g., fluctuating maskers or spatially separated maskers), the FMB, SRM, or BU (i.e., the difference between these functions expressed in dB SNR) may depend on the baseline SNR. Thus, the FMB, SRM, or BU may be lower for children than for adults because children need a higher SNR in the baseline condition than adults. This effect is often overlooked in studies reported in the literature, and could potentially explain part of the reported age dependence of FMB and SRM. Therefore, when studying the effect of age on FMB and SRM, it is important to take the baseline SNR into account, and ideally, performance should be measured across the psychometric function. In summary, the assessment of speech recognition abilities in children could be facilitated by a speech recognition test that is applicable to children and adults; for which age-dependent normative data are available; and for which the effect of baseline SNR on masking release can be taken into account.

Digits-in-Noise Test

Smits et al. (2013) developed a digits-in-noise (DIN) test that was designed to measure primarily the auditory, bottom–up, speech recognition abilities in noise. The DIN test measures the SRT (i.e., the SNR corresponding to 50% correct recognition) for digit–triplets in long-term average speech spectrum (LTASS) noise. Smits et al. (2013) validated the test for adults and found that after a practice run, there is no residual learning effect. There is a high correlation (r = 0.96) with SRT scores obtained with the standard sentences speech-in-noise test (Plomp & Mimpen 1979). Because of the steep speech recognition function, the DIN test has a small measurement error of only 0.7 dB (Smits et al. 2013). Because test scores on the DIN test hardly (<1 dB) depend on linguistic abilities (Kaandorp et al. 2016), the DIN test can be used in virtually the entire population of adults with hearing loss, from normal-hearing listeners to listeners with severe to profound hearing losses and cochlear implant recipients (Kaandorp et al. 2015). The DIN test has not been used in children before.

Aims of the Study

The primary purpose of this study was to examine developmental effects on speech recognition in noise abilities for normal-hearing children 4 to 12 years of age in stationary and interrupted noise. We used the DIN test to minimize the dependency on nonauditory factors. In Experiment 1, results on the DIN test are compared with test results on an adapted version of the DIN test, the pediatric DIN (pDIN) test that was designed to rule out contribution of specific nonauditory factors that might influence test performance for the youngest children. The factors are related to the test procedures and test materials used in the DIN test. For example, the DIN test uses a task that requires the reproduction of three digits. Normative data from the digit span tests in the Wechsler Intelligence Scale for Children show that the digits span increases with age and that 1.5% of the 6-year-old children cannot recall three digits in the forward direction (Wechsler 2004). This suggests that a small fraction of the 6-year-old and probably even larger fractions of 4- and 5-year-old children do not have the auditory memory to do the DIN test reliably. The pDIN test uses the same speech tokens as the DIN test, with the digit “0” omitted and in a single-digit paradigm, to circumvent this issue. Other modifications were made to simplify the test and make it appealing even to the youngest children (see Experiment 1 for details).

In Experiment 2, age-dependent normative data in normal-hearing children 4 to 12 years of age were established for the DIN test under headphones for N0S0 and N0Sπ listening conditions in both stationary and interrupted noise. We analyzed FMB and BU with respect to the baseline SNRs to determine “true” developmental effects of speech recognition in noise abilities, and describe the developmental time course for these effects.

EXPERIMENT I: A COMPARISON BETWEEN THE pDIN TEST AND THE STANDARD DIN TEST

A pDIN test was developed to simplify the test and make it appealing even to the youngest children. Results of the pDIN test were compared with those of the DIN test to find out which factors influence test performance.

Materials and Methods

Subjects

In this study, 43 native Dutch-speaking, normal-hearing children (22 male and 21 female) between 4 and 13 years of age participated. They were recruited from a local primary school. Their parents or caregivers, and children of 12 years of age and older gave their written informed consent. To determine adult reference data, 10 native Dutch-speaking (1 male, 9 female), normal-hearing adults between 18 and 33 years of age participated in the study. Normal hearing was defined as air conduction thresholds equal to or better than 20 dB HL for all octave frequencies from 0.25 to 8 kHz in the test ear. All subjects had normal (type A) tympanograms.

DIN Test Stimuli and Test Procedure

The speech material and masking noise used in the DIN test are described elsewhere in detail (Smits et al. 2013). Briefly, the DIN test uses a set of 120 unique digit–triplet combinations constructed from the digits 0 to 9 uttered by a male speaker, separated by short (150 msec), silent intervals. Each triplet stimulus started and ended with 500 msec of silence. All the silent intervals were enlarged or reduced with an interval chosen at random between +50 and −50 msec to add uncertainty to the listening task. The stimulus was mixed with LTASS masking noise to achieve the desired SNR. The noise started and ended with a 100 msec raised cosine ramp. The duration of the triplet-in-noise files ranged from 2.8 to 3.1 sec. Signal and noise were presented at a fixed overall level of 65 dBA. The SNR was varied adaptively following the standard one-up one-down procedure with a step size of 2 dB SNR. The first stimulus was presented at a favorable SNR of 6 to 8 dB above the expected SRT, and 24 triplets were presented. The SNR for triplet 25 was calculated but not presented. The DIN SRT was calculated by taking the average SNR of trial 5 to 25.

pDIN Test Stimuli and Test Procedure

The pDIN test uses the single digits 1 to 9 from the same speech material as the DIN test, but in a single-digit format. The digit 0 was omitted from the test because the concept of zero needs longer to develop in children (Wellman & Miller 1986). Each digit stimulus started and ended with 500 msec of silence, enlarged or reduced with an interval chosen at random between +50 and −50 msec to add uncertainty to the listening task. The stimulus was mixed with LTASS masking noise to achieve the desired SNR. Each presentation started and ended with a 100 msec raised cosine ramp. The test first presented the digits 1 to 9 in random order in quiet to find out if the child could reproduce each digit correctly. If the child did not repeat a particular digit correctly, it was presented a second time. The digit was automatically omitted from the test if the child responded incorrectly a second time. Then the noise was introduced with an animation, depicting a scientist that builds a “noise-machine,” Next, the adaptive test procedure started. Signal and noise were presented at a fixed overall level of 65 dBA. The SNR was varied adaptively, with a weighted up–down procedure (Kaernbach 1991). The step size for trial 1 to 4 was 3 dB, to approach the SRT quickly. Step sizes for trial 5 to 24 were 0.67 dB down and 2.57 dB up, such that the 79.4% point of the psychometric function was targeted. By choosing this target point, the pDIN SRT for the pDIN test and the DIN SRT for the DIN test correspond to the same SNR, and can theoretically be compared. The SRT for the DIN test is defined as the SNR where 50% of the triplets are reproduced correctly. The triplet consists of concatenated digits without prosody or coarticulation. This means that the probability of reproducing an individual digit ( Inline graphic ) correctly at this SNR is statistically independent from the other digits in the triplet (Smits & Houtgast 2006). The probability of reproducing a triplet correct () is then given by a simple product of the probabilities of the individual digits in this triplet: . Hence, , which means 79.4% of the single digits are reproduced correctly at this target point.

Lists of 24 digits were presented, such that each digit 1 to 9 was presented at least twice and at most three times, and that consecutive digits were never the same. The SNR for triplet 25 was calculated but not presented. The child repeated the digits, and the experimenter recorded the response in the computer program. The pDIN SRT was calculated by taking the average SNR of presentations 5 to 25. Dummy presentations of digits at a favorable SNR (+5 dB from the current estimated SRT) were presented after animations every six trials to keep the child motivated and alert. The response to the dummy presentation was not used for calculating the SRT.

Setup

All tests with children were carried out in a quiet office room at the local primary school. Air conduction pure-tone audiograms were measured with a portable clinical audiometer (Noordwijk, The Netherlands: Decos Technology) and Sennheiser HDA 200 headphones (Wedemark, Germany: Sennheiser electronic GmbH & Co. KG). Custom software (Austin, Texas: Delphi Embarcadero Technologies) was developed for the pDIN test and DIN test. It presents speech and noise stimuli at a defined SNR, records and judges the response, adjusts the SNR, and stores the results in a database. All stimuli were presented monaurally through Sennheiser HDA 200 headphones, connected to a digital sound card (Soundblaster Audigy; Dublin, Ireland: Creative Technology Ltd) and a laptop. All tests with adults were carried out with the same equipment, in a standard, quiet office room at VU University Medical Center, Amsterdam.

Overall Test Procedure

Each session started with a pDIN test practice run to familiarize the child with the task and eliminate procedural learning effects. Next, the child performed a pDIN test and retest, and finally, a DIN practice run followed by a single DIN test. In the initial test phase, only pDIN test measurements were performed because we expected that the DIN test would be too difficult for the younger children. However, during the experiment, it became apparent that testing with the DIN test was feasible for almost all of the children and the DIN test was administered in all children from then on. Therefore, the DIN test was not performed by all subjects. Finally, the experimenter determined the child’s pure-tone audiogram. Testing sessions took 20 to 30 min per subject. The procedure for testing adults was similar, except that animations and dummy trials were not presented in the pDIN test. The study was approved by the VU University Medical Centre Medical Ethical Committee.

Results

pDIN and DIN Test Feasibility

All children of 4 years of age and older could repeat the numbers 1 to 9 in quiet, thus no digits were omitted from the test. The administration of multiple tests in a single test session was possible for all children. All 43 children did a pDIN test practice run and a test, and 41 did a retest. For the children who were tested with the DIN test (N = 35), only 1 did not complete the test after the practice run. Thus, 34 from 35 children performed five SRT tests in one session and only 1 completed four SRT tests. A single test took approximately 3 min. The task of reproducing either single digits or triplets in noise could be performed even by the youngest children.

The mean SRT scores and the standard error of the mean (SEM; derived from the test–retest differences) for the pDIN test for different age groups are summarized in Table 1. The measurement error for the pDIN test, represented by the SEM, was approximately 1 dB for all age groups (range, 0.9 to 1.1 dB). A repeated-measures analysis of variance (ANOVA) was conducted to compare the effects of age group and learning (test versus retest) on the pDIN SRT. There was a significant effect of age group [F(3,47) = 22.2; p < 0.001]. There was neither a significant effect of learning [F(1,47) = 0.00] nor a significant interaction between age group and learning [F(3,47) = 0.55; p = 0.562]. These results suggest that SRTs change with age and that there is no residual learning effect between the first and second test after the practice run.

TABLE 1.

Group mean monaural SRT scores for the DIN and pDIN tests for different age groups, measurement error for the pDIN test, and difference between pDIN and DIN SRT

graphic file with name aud-39-1091-g001.jpg

Open in a new tab

When the digit 0 was presented as part of a triplet in the DIN test, children 4 to 5 years of age reproduced this digit correctly 82% of the trials across the different SNRs presented. Adults reproduced this digit correctly 74% of the trials across the different SNRs presented. This percentage was in the same range of percentages correct for the digits 1 to 9 (67 to 88% for the children 4 to 5 years of age and 64 to 91% for the adults). This finding shows that the digit 0 can be used in the DIN test for young children of 4 to 5 years of age and older.

Age Dependency of DIN SRT and pDIN SRT

Figure 1 shows the monaural SRT measured with the DIN test and pDIN test as a function of age. The individual test scores for each child are shown in a scatter plot (retest scores are not shown). The box plot represents the results for the adult group, with median group SRT (horizontal line), 25th and 75th percentile SRT (box ends), 10th and 90th percentile SRT (whiskers), and outliers (open circles). The thick line is an exponential fit to the data from the children. The regression equations are shown in Figure 1. The intersubject variance (spread in SRT values within the group) was different for both tests, for children and adults. After correction for the age-dependent group mean SRT, the standard deviation for the DIN test and pDIN test was 0.8 and 1.2 dB for children, and 0.34 and 1.1 dB for adults, respectively. For children, this observed reduction in variance is consistent with the three times greater number of presentations in the DIN test relative to the pDIN test, derived from model calculations by Smits and Houtgast (2006). For adults, the reduction in variance is larger than predicted.

Fig. 1. — Age Dependency of DIN SRT and pDIN SRT for monaural presentation. (A) pDIN SRT vs. age. (B) DIN SRT vs. age. The thick line represents an exponential fit to the data. DIN indicates digits-in-noise; pDIN, pediatric digits-in-noise; SNR, signal to noise ratio; SRT, speech reception threshold.

Equivalence of DIN SRT and pDIN SRT

Figure 2A shows the DIN SRT as a function of the pDIN SRT. Given the relatively small range in SRT values, there is still a reasonably strong, positive correlation between the two. Pearson correlation coefficient was 0.74 for a single pDIN test and DIN test, and 0.85 when the average SRT of test and retest for the pDIN test was used.

Fig. 2. — Equivalence of DIN SRT and pDIN SRT. A, DIN SRT vs. pDIN SRT. Pearson r = 0.74 for a single test and Pearson r = 0.80 for the average of test and retest. The thick line represents the equal-SRT line. B, The difference in pDIN SRT–DIN SRT as a function of age. The slope and offset of the linear fit (represented by the thick line) are not significantly different from zero. DIN indicates digits-in-noise; pDIN, pediatric digits-in-noise; SNR, signal to noise ratio; SRT, speech reception threshold.

Figure 2B shows the difference between DIN SRT and pDIN SRT as a function of age. Linear regression gave nonsignificant values for both offset (−0.42 dB; p = 0.22) and slope (0.06 dB/yr; p = 0.07). A paired t test of DIN SRT and pDIN SRT showed that there is no significant difference between pDIN and DIN (pDIN SRT–DIN SRT = 0.15 dB; p = 0.4). These results indicate that the pDIN test and DIN test do not yield a significantly different SRT value for children and adults.

EXPERIMENT II: CHILDREN’S SPEECH RECOGNITION ABILITIES IN STATIONARY AND INTERRUPTED NOISE

Experiment I showed that both the pDIN test and DIN test can reliably be performed by normal-hearing children between 4 and 12 years of age and that they result in similar SRTs. Because the DIN SRT showed a smaller intersubject variance than the pDIN SRT, it was decided to further use the DIN test in Experiment II. The aim of Experiment II was to investigate speech recognition abilities in stationary and interrupted noise, and to determine the developmental time course of the benefit from BU and fluctuating maskers in children. BU was investigated by comparing N0S0 with N0Sπ presentation, while the FMB was measured by comparing SRTs in stationary noise with SRTs in interrupted noise. Finally, the combined effect of N0Sπ presentation and interrupted noise was studied.