Significance
Understanding speech in noisy environments is an essential communication skill that varies widely between individuals and is poorly understood. We show here that extended high-frequency (EHF) hearing, beyond the currently tested range of clinical audiometry, contributes to speech perception in noise. EHF hearing loss is common in otherwise normally hearing young adults and predicts self-reported difficulty hearing speech in noise. The data suggest that EHF hearing is a long sought missing link between audiometry and speech perception and may be a sensitive predictor of age-related hearing loss much earlier in life when preventive measures can be effectively deployed.
Keywords: self-report, digits-in-noise test, pure-tone audiometry, listening in noise, high-frequency hearing
Abstract
Young healthy adults can hear tones up to at least 20 kHz. However, clinical audiometry, by which hearing loss is diagnosed, is limited at high frequencies to 8 kHz. Evidence suggests there is salient information at extended high frequencies (EHFs; 8 to 20 kHz) that may influence speech intelligibility, but whether that information is used in challenging listening conditions remains unknown. Difficulty understanding speech in noisy environments is the most common concern people have about their hearing and usually the first sign of age-related hearing loss. Digits-in-noise (DIN), a widely used test of speech-in-noise perception, can be sensitized for detection of high-frequency hearing loss by low-pass filtering the broadband masking noise. Here, we used standard and EHF audiometry, self-report, and successively higher cutoff frequency filters (2 to 8 kHz) in a DIN test to investigate contributions of higher-frequency hearing to speech-in-noise perception. Three surprising results were found. First, 74 of 116 “normally hearing,” mostly younger adults had some hearing loss at frequencies above 8 kHz. Early EHF hearing loss may thus be an easily measured, preventive warning to protect hearing. Second, EHF hearing loss correlated with self-reported difficulty hearing in noise. Finally, even with the broadest filtered noise (≤8 kHz), DIN hearing thresholds were significantly better (P < 0.0001) than those using broadband noise. Sound energy above 8 kHz thus contributes to speech perception in noise. People with “normal hearing” frequently report difficulty hearing in challenging environments. Our results suggest that one contribution to this difficulty is EHF hearing loss.
Normal pure-tone hearing thresholds as measured by the “gold standard” audiogram do not necessarily mean absence of pathology in the cochlea or central auditory nervous system (1–4). For example, there is evidence from animal and some human studies that noise exposure and aging are associated with loss of synapses between inner hair cells and cochlear nerve fibers before any depletion of hair cells or elevation of hearing thresholds occurs (3, 5–8). This cochlear “synaptopathy” occurs first at higher frequencies (3) and has been hypothesized to contribute to a range of difficulties known in human studies by various names including “obscure auditory dysfunction” (9) and “hidden hearing loss” (10). Following the failure of standard audiometry to detect such early signs of neurodegeneration, some studies have examined the diagnostic utility of extended high-frequency (EHF) audiometry in detection of hidden hearing loss caused by noise exposure, ototoxicity, and aging (11–13).
Although frequencies below 6 kHz provide the phonetic information required for speech perception in quiet, substantial evidence suggests there is salient information in the higher-frequency regions that may affect speech intelligibility (1, 2, 14–18). Shaw et al. (19) were among the first to propose that poor speech perception in noise may be a result of an EHF hearing loss. In 9 adults who complained of difficulty understanding speech in noise they found elevated mean high-frequency (10 to 20 kHz) thresholds.
More generally, standard pure-tone audiometry is unable to predict with precision the level of difficulty a person will have listening to speech in a challenging environment (17, 20–22). Speech signals have a spectrotemporal complexity that changes with limited predictability over time. Accurate speech coding and recognition requires multiple auditory discrimination skills (23). Pure-tone detection also uses minimal cognitive resources, in contrast to suprathreshold speech perception, especially in the presence of competing noise (22). Speech-in-noise tests allow us to objectively and reliably measure speech recognition abilities (24–28). This is typically achieved by presenting speech segments, whole words, or sentences to listeners against a simultaneous background of broadband (BB) random noise or multitalker “babble.” A “speech reception threshold” (SRT) is defined as the speech signal/noise ratio (SNR) at which speech is correctly recognized in 50% of presentations.
Digits (0 through 9) have been used as stimuli in several speech perception tests (29–31) and in clinical screening (5, 24, 32–34). Digits are highly overlearned stimuli that are easily recognized by a wide range of people, including young children (35) and nonnative language speakers (36). Randomized digit triplets have a low probability of all being correctly guessed and provide more accurate and reproducible SRT estimates than other speech test materials, including nondigit words, single digits, and sentences (5, 33, 37). Previous studies have reported a strong correlation between DIN-SRTs and audiometric “pure-tone average” (PTA) measures in adult listeners of mixed hearing ability (5, 23, 24, 33). DIN can be administered as a rapid, reliable, fully automated self-test accessible by telephone (including smartphone apps) or internet, and there is no requirement for sound-attenuating booths or other clinical equipment (5, 23, 38–40).
Results
Audiological Testing.
To investigate contributions of higher-frequency hearing to speech-in-noise perception we first recruited 116 adult listeners (18 to 65 years old [y/o], M = 29.5, SD = 9.1; 67 females) with normal hearing sensitivity in both ears across the standard range of frequencies (≤20-dB hearing level [HL]; 0.25 to 8 kHz; Fig. 1A). However, 74 of these listeners (64%) had elevated EHF thresholds (10 to 16 kHz; >20-dB HL, Fig. 1B) at one or more frequencies, in one or both ears, as defined by the International Organization for Standardization (ISO 389-5, 2006). Retesting of thresholds at 14 and 16 kHz in 11 of the 116 listeners showed high reliability of EHF thresholds for both ears (left: Pearson’s r = 0.96; right: r = 0.97) and frequencies (14 kHz: r = 0.99; 16 kHz: r = 0.93). For nearly all listeners, EHF hearing thresholds, indexed by standardized HL, were higher than those in the standard range.
Fig. 1.
Hearing loss and hearing difficulty. Mean hearing thresholds of both ears for (A) standard frequency (PTA-SF) and (B) extended high-frequency (PTA-EHF) ranges. Black lines show median thresholds of both ears for all 116 listeners. Colored lines show mean thresholds of both ears for the individuals with most and least sensitive hearing, and those closest to the 25th and 75th percentiles. (C) Mean EHF thresholds of both ears for 5 age groups: 18 to 22 y/o (n = 29), 23 to 26 (n = 25), 27 to 30 (n = 25), 31 to 40 (n = 24), and 41 to 65 (n = 13). (D) PTA-EHF thresholds (pure-tone average of 8 EHFs = 4 frequencies × 2 ears) in 4 categories: low, lower mid, higher mid, and high (n = 29 in each category). Violin plots show kernel probability density of thresholds, boxes are interquartile range (with median), and whiskers are 1.5 times the interquartile range. (E) Rated self-reported difficulty listening in noise by hearing category. (F) Correlation between PTA-SF and PTA-EHF thresholds for individual listeners with and without self-reported difficulty listening in noise. (G) ROC curves showing test characteristics of the PTA-SF and PTA-EHF based on self-reported difficulty hearing in noise. The filled black dots correspond to the optimal cutoff of each test. (H) Correlation between age and PTA-EHF thresholds for individual listeners with and without self-reported difficulty listening in noise.
Listeners were mostly younger adults, categorized by age into 5 groups (Fig. 1C). Nonparametric analysis of variance (ANOVA) showed no significant differences between the youngest 3 age groups for PTA-EHF hearing thresholds (PTA of 8 EHFs = 4 frequencies × 2 ears). The 31 to 40 y/o group had significantly higher thresholds than the 2 youngest groups (P < 0.05), and the thresholds of the oldest group (41 to 65 y/o) were significantly higher (P < 0.0001) than all 4 other groups. Nevertheless, 44 of 78 listeners (56%) who were less than 30 y/o had at least 1 elevated EHF threshold (>20-dB HL).
Despite responding to a recruitment advertisement that called for “normal hearing,” in a background questionnaire that asked: “Do you find it difficult to follow a conversation if there is a background noise (such as TV, radio, children playing)?” (22), 39 of the 116 listeners (34%) reported difficulty. To analyze the relationship between self-reported difficulty and EHF hearing loss, participants were divided into 4 categories (Fig. 1D) based on PTA-EHF thresholds. Listeners self-reporting difficulty hearing in noise were significantly more likely to be placed in the higher categories, indicating EHF hearing loss (Fig. 1E; χ2 analysis; P < 0.0001).
By selection, these 116 listeners had sensitive thresholds in the standard frequency (SF) range, with mean thresholds (PTA-SF from 0.25 to 8.0 kHz, both ears) all <16-dB HL, and most <10-dB HL (Fig. 1F), well below a currently accepted upper limit of normal hearing (20-dB HL; American Academy of Audiology [AAA] Clinical Practice Guidelines, 2015). Nevertheless, PTA-SF was correlated with PTA-EHF (r = 0.56, P < 0.0001), raising the possibility that standard frequency hearing might be driving apparent sensitivity to EHF hearing. Remarkably, however, 25 of the 39 listeners (64%) reporting difficulties had PTA-EHF >20-dB HL, but mean PTA-SF of just 9-dB HL, further suggesting it is their EHF hearing that accounts for their difficulty.
Receiver operating characteristic (ROC) curves were calculated for PTA-SF and PTA-EHF to quantify and contrast self-report in this sample (Fig. 1G). The cutoff value on the ROC curve was the PTA (in dB HL) with the highest sensitivity and specificity. PTA-EHF had greater area under the curve (ROC area = 0.81, cutoff = 22.8-dB HL), compared to PTA-SF (ROC area = 0.71, cutoff = 4.7-dB HL), showing the superior ability of EHF hearing thresholds to predict difficulty hearing in noise in this “normal” hearing sample. Both PTA-EHF threshold and prevalence of self-reported hearing difficulty increased with age (Fig. 1H). Note, however, the prevalence of both PTA-EHF hearing loss (>20-dB HL) and self-reported difficulty among the younger listeners.
DIN Testing.
In our implementation of the DIN, digits were recorded and “homogenized” by adjusting the level of each digit to the mean SRT of all digits at 50% intelligibility (5). Homogenized digits were presented as triplets in noise through headphones (Fig. 2A) in an audiological sound booth. Extended bandwidth spectrograms of the digits revealed a surprisingly rich representation of EHF energy (Fig. 2B and SI Appendix, Fig. S1). Noise maskers are used in DIN testing to simulate real-world interfering sounds. Typically, the noises are BB (Fig. 2C) and relatively unmodulated, made by summing the spectrum of individual digits. Low-pass filtering the noise makes higher-frequency energy in the digits more audible, thus sensitizing the DIN for detection of high-frequency hearing loss. For example, a previously used 1.5-kHz low-pass noise masker produced higher sensitivity and specificity (81 to 93%) for detection of high-frequency hearing loss among a clinical sample than a broadband noise (5). Here, we extended these methods using low-pass filtered noise with higher cutoff frequencies (2, 4, and 8 kHz) in addition to broadband noise (Fig. 2C). We predicted that low-pass filtering with successively higher-frequency cutoffs would further sensitize the DIN to successively higher-frequency information in the standard audiometric range. We also hypothesized that acoustic information above the 8-kHz limit of standard audiometry would contribute to speech perception in noise.
Fig. 2.
EHF hearing improves speech perception in noise. (A) DIN test. (B) Spectrogram of the digit “6” showing substantial energy above 8 kHz. Spectrograms of all digits (0 through 9) are shown in SI Appendix, Fig. S1. (C) Four masking noise power spectra were unfiltered (broadband, BB) or had upper frequency cuts at 2, 4, or 8 kHz. (D) Speech reception threshold (SRT) improved with lower frequency noise spectrum cut. (E and F) Correlation between PTA-EHF thresholds with BB and 8-kHz low-pass filtered noise conditions for listeners with and without self-reported difficulty hearing in noise. Each data point indicates mean SRTs of individual listeners. Regression equations and lines, and correlations fit to all data.
Sixty of the listeners (19 to 62 y/o, M = 29.6, SD = 10.7; 39 females) who provided audiometric data were tested using DIN with each of the 4 filters. As noise bandwidth broadened (Fig. 2C) mean SRT increased (became poorer, Fig. 2D) as predicted, due to masking the higher-frequency components of the digits. However, as hypothesized, even the broadest low-pass filter (8-kHz cutoff) produced an SRT that was significantly better (mean 3.2 dB more negative; F1, 118 = 187.7, P < 0.0001) than that resulting from the unfiltered, broadband noise. We therefore concluded that sound energy above the upper frequency limit of the standard audiogram contributed significantly to the intelligibility of the digits.
For the power spectra in Fig. 2C, Butterworth filters (41) that are 3 dB down at the cutoff frequency were used to filter the noise. It is possible that the better mean SRT we found for the 8-kHz cutoff was due to the reduced noise energy (at −3 dB) and just below this frequency. To check this possibility, we collected additional SRTs using another filter (a Chebyshev filter) (41) that does not allow additional energy to pass at 8 kHz. The results did not differ significantly (P = 0.9) from those obtained using the Butterworth filter (SI Appendix, Fig. S2). It is also possible that slight gaps between the low-frequency arms of the filtered noises and the BB noise, created by the addition of the attenuated, same BB noise to the filtered maskers, may have facilitated lower-frequency sound perception. One of these gaps, in the 3.5- to 6-kHz frequency region of the 8-kHz noise (Fig. 2C), had the potential of confounding interpretation. To check whether the better mean SRT found for the 8-kHz low-pass filter was due to this gap, we corrected for the differences in spectra by adding independent samples of the attenuated BB noise to each low-pass filtered noise (SI Appendix, Fig. S3) and collected additional SRTs using the updated noise spectra. The results confirmed the consistent benefit of the 8-kHz low-pass filter over BB noise (mean SRT 2.6 dB more negative, P < 0.0001).
Thirty-four of the 60 listeners (57%) in the DIN sample had elevated EHF hearing threshold (>20-dB HL) at one or more frequencies in one or both ears. Of those 34, 22 listeners had a PTA-EHF threshold >20-dB HL. PTA-EHF across all listeners was significantly related to broadband, BB-SRT (Fig. 2E); those with more sensitive EHF hearing had more sensitive BB-SRT. However, the relation between PTA-EHF and SRT-8 kHz was nonsignificant (Fig. 2F). We initially found this result perplexing, because we expected the 8-kHz noise filter to provide better differentiation between listeners with and without EHF hearing loss. It is important to note, however, that SRT-8 kHz performance was markedly better across PTA, with a mean SRT of −13.2 dB, relative to the BB-SRT performance (mean = −9.9 dB). There was no significant correlation between PTA-SF and either BB-SRT (r = 0.17) or 8 kHz-SRT (r = 0.09). As discussed in further detail below, we suggest the pattern of results in Fig. 2 E and F is due to lack of EHF masking by the 8-kHz filtered noise relative to the BB noise.
Listeners in the DIN sample who self-reported difficulty hearing in noise had higher mean EHF thresholds than those reporting no difficulty, as found in the larger sample (df = 59, P < 0.0001; cf. Fig. 1F). Twelve of 13 listeners with self-reported difficulty and elevated EHF thresholds (green triangles in Fig. 2 E and F) had bilateral EHF hearing loss (2 to 4 elevated hearing thresholds in each ear >20-dB HL).
Discussion
Listeners in this study all had normal audiometry (≤20-dB HL) in the standard frequency range (0.25 to 8 kHz) but around two-thirds had some evidence of EHF hearing loss (>20-dB HL) and about one-third reported difficulty hearing in everyday noisy situations. We found that the extent of the specific EHF hearing loss was related to the number of individuals self-reporting difficulty hearing and to the DIN-SRT. Our results also showed that EHF hearing loss and self-reported difficulty are already widespread among people in their 20s, suggesting that, even at this young age, speech perception in challenging conditions is reduced. Healthy people in their 20s are sometimes assumed to have the most sensitive hearing of any age group (42). However, hearing very high-frequency tones is most sensitive in young children and hearing becomes progressively less sensitive throughout the remainder of life (43). For example, a downward decline for hearing 20 kHz starts from 4 to 6 y of age (43, 44).
Given the apparent competence of communication in most young adults, how important are EHFs for hearing? Self-report is playing an increasingly prominent role in hearing assessment. The recently recognized complexity of contributions to hearing, both within and outside the conventionally defined auditory system (1, 45), reduces the likelihood that one, or even a small number of tests is likely to capture the full experience of hearing. We found here that EHF hearing loss is related to self-reported difficulty hearing in noisy environments. Other studies have found similar poor EHF hearing in young adults reporting high levels of music exposure and otherwise normal audiograms (3).
Our findings also revealed that providing supplementary EHF stimulation in the DIN speech-in-noise task improved SRTs in individuals both with and without EHF hearing loss. In a study investigating the relationship between SRT in noise and self-reported hearing disability, it was found that adults who self-reported “difficulty” following conversation in noise had 2.7-dB poorer mean DIN-SRT than adults with “good” ability listening in noise (25). These and other data (33, 37) suggest that the mean 3.2-dB improved intelligibility found here after allowing access to EHFs is likely to be of substantial functional significance. To our knowledge, this is direct, sample-based evidence that EHFs enhance speech hearing ability. It adds to previous studies that have combined EHF with lower-frequency stimuli (46), used only lower-frequency stimuli (4 to 8 kHz) (47, 48), or presented clinical reports on individual cases (16).
A potential confusion in Results was the finding of relative lack of SRT sensitivity, particularly 8 kHz-SRT, to EHF thresholds (Fig. 2 E and F). At first pass, this finding appears to contradict the major conclusion that EHF hearing enhances speech perception in noise. However, using the 8-kHz filtered noise, EHF information is still available to those with mild EHF hearing loss without the 20-dB shadow cast across EHFs by the BB masker. With the extra BB masking, those with better EHF thresholds retained a (modest) SRT advantage, because the masking was preventing access to EHF information for those with mild EHF hearing loss.
Another possibility is that the relation between PTA-EHF and BB-SRT is driven by responses of EHF neurons to energy in the standard frequency bands. This could occur if the tails or low-frequency rising slopes of tuning curves of EHF neurons normally contribute to the SRT for elements of the speech signals within the standard frequency range (49). When EHF neurons have elevated thresholds, it may effectively remove that contribution to hearing in the standard frequency range, leading to an increase in SRT that could then be incorrectly attributed to EHF hearing loss. We think the tails are unlikely to contribute because they would have too high thresholds relative to the measured SRTs (−8 to −18 dB relative to a 65-dB SPL noise). It is harder to rule out such a contribution for the low-frequency rising slopes of tuning curves of EHF neurons having a characteristic frequency (CF) near the 8-kHz cutoff. We suppose that human tuning curves of neurons with a CF around 10 kHz, the lowest EHF contributing to the PTA-EHF, might make a small contribution. However, most of the EHF hearing loss is occurring at higher frequencies. To examine further the contribution of EHF energy, a future experiment could filter EHF information from the digits, rather than the noise, and probe effects on SRT.
Benefits of the DIN include that it is procedurally very unchallenging, as discussed above, and it is available in self-administered forms, delivered on-line (33) or by smartphone (38). A screening test takes only about 3 min. However, there has also been concern that, since English digits may be largely distinguished based on their low-frequency vowel formants, the DIN could be insensitive to higher-frequency hearing loss (36). The results from this and a previous study (5) clearly show this concern is unfounded. Sensitizing the DIN to high-frequency hearing by using low-pass filtered noise and broad-spectrum digits improved performance of people with normal hearing.
Another concern about the digits is that they represent a “closed-set” (i.e., limited) word corpus (33). In some respects, this could make the digits less generalizable to real life situations, since “open-set” words or sentences are more representative of those situations (33). On the other hand, the digits are an extreme form of simple speech stimuli that were selected originally for an easy, portable, and self-administered screening test of hearing loss (26), a role this research was in part designed to enhance. More than any other speech-in-noise test of which we are aware, the DIN has been shown to correlate strongly with audiometry (33). This is likely because highly overlearned digits are stimuli that result in very sensitive and reproducible SRTs. They have a relative lack of cognitive (22) or linguistic influence, resulting in a test with, for example, minimal practice effects (33). Further research using less predictable target stimuli may be helpful to understand perceptual mechanisms underlying the self-report data presented here in addition to other aspects of typical listening such as context (50).
The sensitivity of the DIN to EHF cues found here was unexpected. One reason is that speech spectrograms are typically shown only to 4 to 8 kHz (51, 52), reflecting a view that higher frequencies are of low energy and unimportant for speech perception. When we extended the spectrogram of the digits used in this study beyond 8 kHz, we found considerable EHF energy. It appears that this energy is used by listeners with sufficient EHF hearing to extract useful cues to identify the digits in noise. Together with the finding of widespread EHF hearing loss, this may help explain why many people with normal standard audiograms have difficulty hearing in noisy places.
Clinical Implications.
Following on from the great success of “universal” neonatal hearing screening that has reduced dramatically the age at which hearing aids and cochlear implants are fitted to infants with hearing impairment (53), there has been considerable discussion among policy makers as to whether there should be a “universal hearing screen” in adulthood (54). This discussion has centered around the tendency for people in their 50s and older to delay by many years the age at which they start using hearing aids. If the criterion for hearing loss is extended to the EHF range of frequencies, many more people, perhaps the majority of adults, might be considered as candidates for some form of hearing intervention.
This raises the question of whether very early EHF “hearing loss” is a harbinger of more disabling hearing loss in the standard range of frequencies and later in life (“presbycusis”). It has long been known that hearing loss in older people starts at the higher frequencies within the standard range and moves steadily toward the lower frequencies, considered most vital for speech perception (55, 56). For EHF, thresholds are only measurable in about half the older population, presumably reflecting individual differences in sensitivity highlighted here as well as the high absolute sound levels needed to achieve ISO standards for hearing level in the average listener. Where it can be measured, the pattern of correlation between age, hearing level, and frequency is consistent with a downward trend of the upper limit of EHF hearing that precedes presbycusis (57, 58). It seems likely that EHF hearing earlier in life could therefore be a sensitive predictor of later disability. Further research is needed to test this prediction, but a universal EHF hearing screen (59) in late adolescence or early adulthood could lead to identification of vulnerable individuals and more timely advice regarding prevention and conservation among susceptible individuals.
The mechanisms of EHF hearing loss are presumably similar to those elsewhere in the frequency spectrum and may result from genetic effects, noise, ototoxicity, infections, and aging (60). However, the early appearance and high prevalence of EHF hearing loss suggest possible additional mechanisms. One such known mechanism is early childhood otitis media with effusion (OME) (61). OME is very common from birth to 5 to 6 y of age (62). It is associated with EHF hearing loss and with ototoxicity in the extreme basal end of the cochlea (63). Another possibility is that some form of hidden hearing loss, such as a widespread neuropathy throughout the cochlea, affects speech perception in noise. Recently, Wu et al. (64) investigated cochlear neuropathy in a temporal-bone study of 29 humans aged 0 to 89 y at autopsy. Their findings showed that neurodegeneration of auditory nerve peripheral axons in the aging ear outpaces loss of inner hair cells and spiral ganglion somata across the audiometric standard frequency range, likely contributing to age-related difficulty hearing in noise. Insufficient data were available to test predictions from this study concerning EHF hearing loss beginning in childhood, but it is notable that interpolation from the available data suggested possible neurodegeneration by the age of 20.
A possible reason for the historic neglect of EHF hearing in clinical audiology may be that hearing aids do not typically have the ability to provide amplification at higher frequencies. New technologies are helping correct this shortcoming (18, 65), but further research and wider recognition of the benefits of EHF hearing are also required.
Methods
Participants.
A female Midwestern American English speaker was recruited to record the digits. A total of 116 listeners of average age 29.5 y (SD = 9.1; 67 females) with normal hearing sensitivity (≤20-dB HL) in the conventional range of audiometric frequencies (0.25 to 8 kHz) were recruited via flyers distributed in the community and at Cincinnati Children’s Hospital Medical Center (CCHMC) and the University of Cincinnati. Seventy people (M = 29 y/o, SD = 10, 44 female) from the original 116 participants participated in DIN testing. Ten of these people performed only homogenization testing (see below), and 60 participated in the DIN part of the main study. All participants were paid and gave written informed consent. The consent form and the experimental procedures were approved by the CCHMC Institutional Review Board.
Audiological Testing.
Conventional pure-tone audiometry was performed using an Interacoustics Equinox 2.0 model audiometer calibrated to ANSI 3.6 2010. Participants were tested in a double-walled sound booth (Acoustic Systems, Austin, TX) meeting criteria of the American National Standards Institute (ANSI) S3.1–1999 for audiometric test rooms. Air conduction thresholds were obtained using Sennheiser HDA300 circumaural headphones.
DIN Testing.
The DIN test was developed based on the method described by Vlaming et al. (5), Smits et al. (33), and Potgieter et al. (38).
Speech Stimuli and Masking Noise.
A list of 20 triplets was made from 10 digits (0 through 9), where each digit occurred 2 times at each position (a, b, and c). A BB speech-shaped noise masker was developed by obtaining an average frequency spectrum across all digits. Low-pass noise versions of the masker were constructed using a 10th-order Butterworth low-pass filter with 3 different cutoff frequencies (2, 4, and 8 kHz), summed with a 15-dB attenuated version of the original broadband noise. Noise was started 100 ms before and ended 100 ms after each triplet presentation and was continuous between the test triplets. To achieve equal intelligibility, digits were homogenized with respect to SRT. For this purpose, the average of scores for each digit at each position was subtracted from the overall mean SRT to give the decibel difference required for homogenizing the digits. The difference in decibel was then transformed into a factor to multiply with the digit waveforms to ensure that each digit had a 50% chance of being recognized correctly at the same SNR (see SI Appendix for more details on development and homogenization of the DIN test).
DIN Procedure.
A random set of 25 triplets was made from the homogenized digits. The triplets were presented diotically over Sennheiser (Wedemark, Germany) HD 25-1 headphones via a Maya 22 USB sound card in a double-walled sound booth (Acoustic Systems) meeting criteria of ANSI S3.1–1999 for audiometric test rooms. A one-down, one-up adaptive procedure was used to obtain the SRT. In this procedure, following each correct response, the SNR level was reduced by 2 dB, and following each incorrect response, the SNR was increased by 2 dB. All 3 digits had to be correct for the trial to be a correct response. Total presentation duration was, again, 3.25 s The initial SNR level was −4 dB, about 8 to 10 dB above the expected SRTs for normal hearing listeners in each test condition. SRT was estimated as average SNR of the final 19 of 25 total trials.
Analysis.
R software (version 3.4.2) was used for statistical computing and graphics. One-way ANOVA and Tukey post hoc test were used for group comparisons. All P values were 2-sided, and a P value of <0.05 was accepted as the statistical significance level. ROC curves were calculated using the pROC package in R software.
Supplementary Material
Acknowledgments
This study was supported by NIH grant R21DC016241 and by the Cincinnati Children’s Hospital Research Foundation. D.R.M. receives support from the National Institute for Health Research (NIHR) Manchester Biomedical Research Center. We thank Drs. Kevin Munro and Harvey Dillon for commenting on drafts of the paper.
Footnotes
Conflict of interest statement: D.W.S. is cofounder, lead inventor, and shareholder, and D.R.M. is a scientific advisor and shareholder of the hearX Group. No author has any other financial or nonfinancial interest in relation to the work described.
This article is a PNAS Direct Submission.
This article contains supporting information online at www.pnas.org/lookup/suppl/doi:10.1073/pnas.1903315116/-/DCSupplemental.
References
- 1.Pienkowski M., On the etiology of listening difficulties in noise despite clinically normal audiograms. Ear Hear. 38, 135–148 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Badri R., Siegel J. H., Wright B. A., Auditory filter shapes and high-frequency hearing in adults who have impaired speech in noise performance despite clinically normal audiograms. J. Acoust. Soc. Am. 129, 852–863 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Liberman M. C., Epstein M. J., Cleveland S. S., Wang H., Maison S. F., Toward a differential diagnosis of hidden hearing loss in humans. PLoS One 11, e0162726 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Musiek F. E., Chermak G. D., Bamiou D., Shinn J., CAPD: The most common ‘hidden hearing loss’. ASHA Lead. 23, 6–9 (2018). [Google Scholar]
- 5.Vlaming M. S., MacKinnon R. C., Jansen M., Moore D. R., Automated screening for high-frequency hearing loss. Ear Hear. 35, 667–679 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Füllgrabe C., Moore B. C., Stone M. A., Age-group differences in speech identification despite matched audiometrically normal hearing: Contributions from auditory temporal processing and cognition. Front. Aging Neurosci. 6, 347 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Brattico E., et al. , Long-term exposure to occupational noise alters the cortical organization of sound processing. Clin. Neurophysiol. 116, 190–203 (2005). [DOI] [PubMed] [Google Scholar]
- 8.Liberman M. C., Noise-induced and age-related hearing loss: New perspectives and potential therapies. F1000 Res. 6, 927 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Saunders G. H., Haggard M. P., Field D., Clinical diagnosis and management of obscure auditory dysfunction. Br. J. Audiol. 23, 358 (1989). [DOI] [PubMed] [Google Scholar]
- 10.Schaette R., McAlpine D., Tinnitus with a normal audiogram: Physiological evidence for hidden hearing loss and computational model. J. Neurosci. 31, 13452–13457 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Rodríguez Valiente A., Roldán Fidalgo A., Villarreal I. M., García Berrocal J. R., Extended high-frequency audiometry (9,000–20,000 Hz). Usefulness in audiological diagnosis. Acta Otorrinolaringol. Esp. 67, 40–44 (2016). [DOI] [PubMed] [Google Scholar]
- 12.Knight K. R., Kraemer D. F., Winter C., Neuwelt E. A., Early changes in auditory function as a result of platinum chemotherapy: Use of extended high-frequency audiometry and evoked distortion product otoacoustic emissions. J. Clin. Oncol. 25, 1190–1195 (2007). [DOI] [PubMed] [Google Scholar]
- 13.Le Prell C. G., Spankovich C., Lobariñas E., Griffiths S. K., Extended high-frequency thresholds in college students: Effects of music player use and other recreational noise. J. Am. Acad. Audiol. 24, 725–739 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Vitela A. D., Monson B. B., Lotto A. J., Phoneme categorization relying solely on high-frequency energy. J. Acoust. Soc. Am. 137, EL65–EL70 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Apoux F., Bacon S. P., Relative importance of temporal information in various frequency regions for consonant identification in quiet and in noise. J. Acoust. Soc. Am. 116, 1671–1680 (2004). [DOI] [PubMed] [Google Scholar]
- 16.Collins M. J., Cullen J. K. Jr, Berlin C. I., Auditory signal processing in a hearing-impaired subject with residual ultra-audiometric hearing. Audiology 20, 347–361 (1981). [DOI] [PubMed] [Google Scholar]
- 17.Moore D., Hunter L., Munro K., Benefits of extended high-frequency audiometry for everyone. Hear. J. 70, 50–54 (2017). [Google Scholar]
- 18.Levy S. C., Freed D. J., Nilsson M., Moore B. C. J., Puria S., Extended high-frequency bandwidth improves speech reception in the presence of spatially separated masking speech. Ear Hear. 36, e214–e224 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Shaw G. M., Jardine C. A., Fridjhon P., A pilot investigation of high-frequency audiometry in obscure auditory dysfunction (OAD) patients. Br. J. Audiol. 30, 233–237 (1996). [DOI] [PubMed] [Google Scholar]
- 20.Killion M. C., Niquette P. A., What can the pure-tone audiogram tell us about a patient’s SNR loss? Hear. J. 53, 46–53 (2000). [Google Scholar]
- 21.Vermiglio A. J., Soli S. D., Freed D. J., Fisher L. M., The relationship between high-frequency pure-tone hearing loss, hearing in noise test (HINT) thresholds, and the articulation index. J. Am. Acad. Audiol. 23, 779–788 (2012). [DOI] [PubMed] [Google Scholar]
- 22.Moore D. R., et al. , Relation between speech-in-noise threshold, hearing loss and cognition from 40–69 years of age. PLoS One 9, e107720 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Summers V., Makashay M. J., Theodoroff S. M., Leek M. R., Suprathreshold auditory processing and speech perception in noise: Hearing-impaired and normal-hearing listeners. J. Am. Acad. Audiol. 24, 274–292 (2013). [DOI] [PubMed] [Google Scholar]
- 24.Jansen S., Luts H., Wagener K. C., Frachet B., Wouters J., The French digit triplet test: A hearing screening tool for speech intelligibility in noise. Int. J. Audiol. 49, 378–387 (2010). [DOI] [PubMed] [Google Scholar]
- 25.Smits C., Kramer S. E., Houtgast T., Speech reception thresholds in noise and self-reported hearing disability in a general adult population. Ear Hear. 27, 538–549 (2006). [DOI] [PubMed] [Google Scholar]
- 26.Smits C., Kapteyn T. S., Houtgast T., Development and validation of an automatic speech-in-noise screening test by telephone. Int. J. Audiol. 43, 15–28 (2004). [DOI] [PubMed] [Google Scholar]
- 27.Miller G. A., Heise G. A., Lichten W., The intelligibility of speech as a function of the context of the test materials. J. Exp. Psychol. 41, 329–335 (1951). [DOI] [PubMed] [Google Scholar]
- 28.Plomp R., A signal-to-noise ratio model for the speech-reception threshold of the hearing impaired. J. Speech Hear. Res. 29, 146–154 (1986). [DOI] [PubMed] [Google Scholar]
- 29.Ramkissoon I., Proctor A., Lansing C. R., Bilger R. C., Digit speech recognition thresholds (SRT) for non-native speakers of English. Am. J. Audiol. 11, 23–28 (2002). [DOI] [PubMed] [Google Scholar]
- 30.Rudmin F., Speech reception thresholds for digits. J. Aud. Res. 27, 15–21 (1987). [PubMed] [Google Scholar]
- 31.Wilson R. H., Burks C. A., Weakley D. G., A comparison of word-recognition abilities assessed with digit pairs and digit triplets in multitalker babble. J. Rehabil. Res. Dev. 42, 499–510 (2005). [DOI] [PubMed] [Google Scholar]
- 32.Smits C., Houtgast T., Recognition of digits in different types of noise by normal-hearing and hearing-impaired listeners. Int. J. Audiol. 46, 134–144 (2007). [DOI] [PubMed] [Google Scholar]
- 33.Smits C., Theo Goverts S., Festen J. M., The digits-in-noise test: Assessing auditory speech recognition abilities in noise. J. Acoust. Soc. Am. 133, 1693–1706 (2013). [DOI] [PubMed] [Google Scholar]
- 34.Ozimek E., Kutzner D., Sęk A., Wicher A., Development and evaluation of Polish digit triplet test for auditory screening. Speech Commun. 51, 307–316 (2009). [Google Scholar]
- 35.Koopmans W. J. A., Goverts S. T., Smits C., Speech recognition abilities in normal-hearing children 4 to 12 Years of age in stationary and interrupted noise. Ear Hear. 39, 1091–1103 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Smits C., Watson C. S., Kidd G. R., Moore D. R., Goverts S. T., A comparison between the Dutch and American-English digits-in-noise (DIN) tests in normal-hearing listeners. Int. J. Audiol. 55, 358–365 (2016). [DOI] [PubMed] [Google Scholar]
- 37.Jansen S., Luts H., Dejonckere P., van Wieringen A., Wouters J., Efficient hearing screening in noise-exposed listeners using the digit triplet test. Ear Hear. 34, 773–778 (2013). [DOI] [PubMed] [Google Scholar]
- 38.Potgieter J. M., Swanepoel W., Myburgh H. C., Hopper T. C., Smits C., Development and validation of a smartphone-based digits-in-noise hearing test in South African English. Int. J. Audiol. 55, 405–411 (2015). [DOI] [PubMed] [Google Scholar]
- 39.Swanepoel W., Myburgh H. C., Howe D. M., Mahomed F., Eikelboom R. H., Smartphone hearing screening with integrated quality control and data management. Int. J. Audiol. 53, 841–849 (2014). [DOI] [PubMed] [Google Scholar]
- 40.Mahomed-Asmail F., Swanepoel W., Eikelboom R. H., Myburgh H. C., Hall J. 3rd, Clinical validity of hearScreen™ smartphone hearing screening for school children. Ear Hear. 37, e11–e17 (2016). [DOI] [PubMed] [Google Scholar]
- 41.Malica T., Shekhar S., Ali Z., Design and comparison of butterworth and Chebyshev type-1 low pass filter using Matlab. Int. J. Eng. Sci. 4, 423–438 (2011). [Google Scholar]
- 42.International Organization for Standardization , Acoustics-statistical distribution of hearing thresholds of otologically normal persons in the age range from 18 years to 25 years under free-field listening conditions (ISO/DIS Standard No. 28961). https://www.iso.org/standard/45104.html. Accessed 17 October 2019.
- 43.Rodríguez Valiente A., Trinidad A., García Berrocal J. R., Górriz C., Ramírez Camacho R., Extended high-frequency (9–20 kHz) audiometry reference thresholds in 645 healthy subjects. Int. J. Audiol. 53, 531–545 (2014). [DOI] [PubMed] [Google Scholar]
- 44.Trehub S. E., Schneider B. A., Morrongiello B. A., Thorpe L. A., Auditory sensitivity in school-age children. J. Exp. Child Psychol. 46, 273–285 (1988). [DOI] [PubMed] [Google Scholar]
- 45.Moore D. R., Challenges in diagnosing auditory processing disorder. Hear. J. 71, 32–36 (2018). [Google Scholar]
- 46.Besser J., Festen J. M., Goverts S. T., Kramer S. E., Pichora-Fuller M. K., Speech-in-speech listening on the LiSN-S test by older adults with good audiograms depends on cognition and hearing acuity at high frequencies. Ear Hear. 36, 24–41 (2015). [DOI] [PubMed] [Google Scholar]
- 47.Moore B. C. J., A review of the perceptual effects of hearing loss for frequencies above 3 kHz. Int. J. Audiol. 55, 707–714 (2016). [DOI] [PubMed] [Google Scholar]
- 48.Stelmachowicz P. G., Pittman A. L., Hoover B. M., Lewis D. E., Moeller M. P., The importance of high-frequency audibility in the speech and language development of children with hearing loss. Arch. Otolaryngol. Head Neck Surg. 130, 556–562 (2004). [DOI] [PubMed] [Google Scholar]
- 49.Kiang N. Y. S., Moxon E. C., Tails of tuning curves of auditory-nerve fibers. J. Acoust. Soc. Am. 55, 620–630 (1974). [DOI] [PubMed] [Google Scholar]
- 50.Heinrich A., Henshaw H., Ferguson M. A., The relationship of speech intelligibility with hearing sensitivity, cognition, and perceived hearing difficulties varies for different speech perception tests. Front. Psychol. 6, 782 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Ladefoged P., A Course in Phonetics (Harcourt Brace Jovanovich College Publishers, Fort Worth, ed. 3, 1993). [Google Scholar]
- 52.Moore B. C. J., An Introduction to the Psychology of Hearing (Emerald, Bingley, ed. 6, 2012). [Google Scholar]
- 53.Yoshinaga-Itano C., Sedey A. L., Wiggin M., Chung W., Early hearing detection and vocabulary of children with hearing loss. Pediatrics 140, e20162964 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Smith P. A., et al. , Adult hearing screening: What comes next? Int. J. Audiol. 50, 610–612 (2011). [DOI] [PubMed] [Google Scholar]
- 55.Gates G. A., Cooper J. C. Jr, Kannel W. B., Miller N. J., Hearing in the elderly: The framingham cohort, 1983–1985. Part I. Basic audiometric test results. Ear Hear. 11, 247–256 (1990). [PubMed] [Google Scholar]
- 56.Gates G. A., Cooper J. C., Incidence of hearing decline in the elderly. Acta Otolaryngol. 111, 240–248 (1991). [DOI] [PubMed] [Google Scholar]
- 57.Matthews L. J., Lee F. S., Mills J. H., Dubno J. R., Extended high-frequency thresholds in older adults. J. Speech Lang. Hear. Res. 40, 208–214 (1997). [DOI] [PubMed] [Google Scholar]
- 58.Lee F. S., Matthews L. J., Dubno J. R., Mills J. H., Longitudinal study of pure-tone thresholds in older persons. Ear Hear. 26, 1–11 (2005). [DOI] [PubMed] [Google Scholar]
- 59.Rieke C. C., et al. , Fixed-level frequency threshold testing for ototoxicity monitoring. Ear Hear. 38, e369–e375 (2017). [DOI] [PubMed] [Google Scholar]
- 60.Moore B. C. J., Cochlear Hearing Loss: Physiological, Psychological and Technical Issues (John Wiley & Sons, Chichester, England; Hoboken, NJ, ed. 2, 2007). [Google Scholar]
- 61.Paradise J. L., et al. , Otitis media in 2253 pittsburgh-area infants: Prevalence and risk factors during the first two years of life. Pediatrics 99, 318–333 (1997). [DOI] [PubMed] [Google Scholar]
- 62.Hogan S. C. M., Moore D. R., Impaired binaural hearing in children produced by a threshold level of middle ear disease. J. Assoc. Res. Otolaryngol. 4, 123–129 (2003). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Hunter L. L., et al. , High frequency hearing loss associated with otitis media. Ear Hear. 17, 1–11 (1996). [DOI] [PubMed] [Google Scholar]
- 64.Wu P. Z., et al. , Primary neural degeneration in the human cochlea: Evidence for hidden hearing loss in the aging ear. Neuroscience 407, 8–20 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Puria S., Maria P. L. S., Perkins R., Temporal-bone measurements of the maximum equivalent pressure output and maximum stable gain of a light-driven hearing system that mechanically stimulates the umbo. Otol. Neurotol. 37, 160–166 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.