Perceptual Adaptation to Room Acoustics and Effects on Speech Intelligibility in Hearing-Impaired Populations

Pavel Zahorik; Eugene Brandewie

. Author manuscript; available in PMC: 2013 Feb 26.

Published in final edited form as: Proc. Forum Acust. 2011 Jun 27:2167–2172.

Perceptual Adaptation to Room Acoustics and Effects on Speech Intelligibility in Hearing-Impaired Populations

Pavel Zahorik ¹, Eugene Brandewie ¹

PMCID: PMC3582192 NIHMSID: NIHMS395059 PMID: 23455358

Summary

Recent evidence suggests that brief listening exposure to a reverberant room environment can improve closed-set speech intelligibility in that same environment. For normal-hearing populations, this room adaptation effect can result in improvements in intelligibility of as much as 20%, but depends strongly on the reverberation time of the room, and appears to require binaural input. Because poor speech intelligibility in reverberation is a common complaint for hearing-impaired listeners, it is important to determine how room adaptation might impact speech intelligibility for hearing-impaired populations. Here, room adaptation was quantified for a sample of listeners with sensorineural hearing loss that varied in severity and configuration. Speech reception thresholds (SRTs) were measured both with and without prior listening exposure to the room environment. Headphone-based auralization techniques were used to simulate the acoustics of various listening rooms, ranging from anechoic to highly reverberant space (broadband T₆₀ = 3 s). Although SRTs both with and without prior room exposure were found to be generally elevated relative to normal-hearing listeners, the room adaptation effect, as defined by the relative decrease in SRT with room exposure, was comparable on average to that observed for normal-hearing listeners. This result is consistent with the view that room adaptation effects result from central auditory processing mechanisms.

1. Introduction

Aspects of room acoustics have long been known to cause problems for speech communication. For example, increasing amounts of room reverberation are known to significantly degrade the speech signal, and these degradations can result in speech understanding deficits [1]. Recent results suggest, however, that some of this degradation may be perceptually offset when listeners are provided with prior auditory exposure to the room. Such room exposure has been shown to objectively improve speech intelligibility [2] and to modify speech perception [3, 4]. This phenomenon appears to be similar to the adaptive buildup of echo suppression observed in situations when only a single echo is present (see [5] for review). Because speech perception in reverberant rooms is known to be particularly problematic for listeners with hearing-impairment [6], it is important to determine whether hearing-impaired listeners might obtain similar benefits from prior room listening exposure as do normally-hearing listeners. This is the goal of the current study, which builds on previous work with normally-hearing listeners [2] using the same testing paradigm.

2. Methods

2.1. Listeners

A total of 26 listeners participated in this study. The hearing-impaired group was composed of 12 listeners (1 male, 11 female) ranging in age from 23 – 82 years. All had adult-onset bilaterally symmetrical (the median interaural difference in thresholds across all frequencies was 5 dB) sloping high-frequency sensorineural hearing loss. Table I displays age and pure-tone averages of airconductions thresholds at .5, 1, and 2 kHz for the left and right ear of each listener in this group. Five listeners in this group routinely used hearing aids, although all testing in this study was conducted un-aided. The normally-hearing group consisted of 14 young adults (6 male, 8 female) ranging in age from 17 – 24 years. All had pure-tone air-conduction thresholds of < 25 dB HL [7] from .25 – 8 kHz. Mean (±1 standard deviation) pure-tone air-conduction thresholds are shown in Figure 1 for both groups of listeners.

Table I.

Hearing-impaired listener information: Identification code, age (in years), pure-tone average (PTA) of air-conduction thresholds for left and right ears (see text for details), signal-to-noise ratio (SNR) testing range, supplemental signal+noise gain (see text for details), and matrix of completed (shaded) “No Carrier” (NC) and “Sentence Carrier” (SC) conditions in each test room.

graphic file with name nihms-395059-f0002.jpg

Open in a new tab

Mean pure-tone air-conduction audiograms for the left and right ears (filled and open symbols, respectively) for normally-hearing and hearing-impaired listeners. Shaded regions indicate 1 standard deviation about the mean.

2.2. Speech Corpus

Speech materials were taken from the Coordinate Response Measure (CRM) corpus [8].

2.3. Room Simulation

Virtual auditory space techniques were used to simulate listening conditions in five rooms, ranging from anechoic to highly-reverberant. The rooms had identical dimensions of 5.7 × 4.3 × 2.6 m, but differed in the absorptive properties of the simulated boundary surfaces. Table II displays broadband reverberation times, T₆₀, and clarity indices, C₅₀ [9], for each room. Each room was simulated using a simple model of a binaural room impulse response (BRIR), which was constructed using an image-model [10] to simulate early reflections and a statistical model of the late reverberant energy. The simulation techniques were identical to those used in a previous related study [2], and have been shown to produce results that are perceptually similar to those derived from measurements in a real room [11].

Table II.

Broadband (125 – 4000 Hz) reverberation time, T₆₀, and Clarity Index, C₅₀, measures for each test room.

Room	T₆₀ (s)	C₅₀ (dB)
0	<0.01	>60
1	0.31	25.8
2	0.42	13.4
3	1.2	3.5
4	3	-6.6

Open in a new tab

In all rooms, the target speech was simulated at a spatial location directly in front of the listener at a distance of 1.4 m, and a broadband noise masker was presented at a simulated location opposite the listener's right ear, also at 1.4 m. Signal-to-noise ration (SNR) varied over a range of 32 dB in 4 dB steps. For normally-hearing listeners, SNRs ranged from -28 to +4 dB. Pilot testing was used to determine an appropriate range of SNRs for each hearing-impaired listener. These ranges are displayed in Table I.

2.4. Design and Procedure

The experimental design and procedures were fundamentally identical to those used in previous work [2]. Speech reception thresholds (SRTs) were measured under two different listening exposure conditions. The “No Carrier” condition (NC) limited prior room listening exposure by presenting listeners with only the color/number targets from the CRM speech corpus and selecting the simulated room at random from trial-to-trial within a block of trials. The “Sentence Carrier” condition (SC) enhanced room exposure by presenting listeners with a two-sentence carrier phrase (~10 s duration) preceding the color/number targets and by holding the simulated room constant within a block of trials. Each block contained 54 trials (6 repetitions at each on 9 SNRs). Five trial blocks were completed for a given room/condition combination. It should be noted that not all listeners completed all room/condition combinations. Completed combinations for the hearing-impaired group are shown in Table I. The dataset is much more complete for the normally-hearing group, with the notable exception that only two listeners have completed conditions involving Room 3. Portions of the normally-hearing dataset (Room 0 and Room 2) appear in a previous publication [2].

All sounds were presented over equalized Beyerdynamic DT-990-Pro headphones using a Digital Audio Labs CardDeluxe for D/A conversion (24-bit, 44.1 kHz) within a double-walled sound isolation chamber (Acoustic Systems). For the normal hearing group, all sounds were presented at moderate levels (approximately 65 dB SPL). During pilot testing, hearing-impaired listeners were given the option to adjust the sound levels (in 5 dB steps) such that the speech signals were comfortably audible. Table I displays any supplemental gain applied to both signal and noise for individual listeners in this group relative to the nominal levels used for normally-hearing listeners. Listeners entered their responses from the CRM task on a graphical user interface. All signal processing and data collection applications were implemented using Matlab software (Mathworks, Inc.).

2.5. Data Analysis

The proportion of correct responses (PC) was computed separately for each listener in all combinations of exposure condition, room, and SNR. Logistic functions of the following form were fit to the data using a maximum likelihood procedure [12],

Est . P C (x) = (1 - δ) \times \frac{1}{1 + exp (- (α - x) ∕ β)} + δ

where α is the estimated threshold parameter, β is the estimated slope parameter, and δ is chance-performance level (1/32 in the CRM task). 95% confidence intervals were obtained for each fitted function's threshold value, PC = .516, using a bootstrapping procedure [13].

3. Results

Overall, the data were well-approximated by the logistic function fits (R² > .44 in all cases, with a median R² of .97). Figure 2 displays representative function fits for one listener (LMN) for each of the experimental conditions in Room 1. The room adaptation effect may be observed in Figure 2 as a decrease in threshold between the NC and SC conditions. This is presumably due to a buildup of echo suppression. Slopes of the fitted functions in Figure 2 are relatively homogeneous, and this is representative of the fits in general across listeners, conditions, and rooms. Estimated psychometric functions are therefore well-described by their threshold parameters alone.

Proportion of correct speech target identifications as a function of signal-to-noise ratio and estimated psychometric functions for a single listener (LMN) in Room 1 for both the No Carrier (NC) and Sentence Carrier (SC) experimental conditions. Speech reception threshold (SRT) estimates (with 95% confidence intervals) are displayed for each fitted function. Speech intelligibility enhancement with prior listening exposure (“buildup”) is indicated by decreased SC SRT relative to the NC SRT

Figure 3 displays speech reception thresholds in all rooms for the NC condition. Results for both normally-hearing and hearing-impaired listeners are shown. As expected, generally elevated SRTs are observed for the hearing-impaired listeners relative to normally hearing listeners. In addition, SRTs increase with increasing reverberation for both groups of listeners, consistent with previous work related to the effects of room reverberation on speech intelligibility [14].

Summary of SRTs in the NC condition in all rooms for all listeners. SRTs for the normally-hearing group are indicated by black symbols. SRTs for the hearing-impaired group are indicated by red symbols. Subject identification codes are also indicated for all hearing-impaired listeners. Bars show 95% confidence intervals about each SRT

Figure 4 plots the difference in SRT between the NC and SC conditions. Positive values indicate a reduction in SRT (improved performance) in the SC condition relative to the NC condition. Bars indicate 95% confidence limits for each difference, estimated using information from the bootstrapped confidence limits about individual-listener SRTs in each condition (see Figure 2 for examples). Determining these confidence limits is complicated by the fact that estimates of a given listener's thresholds are likely not independent across different experimental conditions. We therefore computed confidence limits using the following relationship for the variance of the difference between two random variables, a and b:

var (a - b) = var (a) + var (b) - 2 r \sqrt{var (a)} \sqrt{var (b)}

where r is the Pearson correlation between a and b. Confidence limits in Figure 4 were computed based on a value of r = 0.7.

Difference in SRT values between the NC and SC conditions in all rooms. Data for both normally-hearing (black symbols) and hearing-impaired (red symbols) are shown. Positive values indicate lower SRT (better performance) in the SC condition relative to the NC condition. Bars show 95% confidence intervals about each SRT difference (see text for details). Subject identification codes are indicated for all hearing-impaired listeners.

At least two noteworthy patterns are observable in the data displayed in Figure 4. First, the size of the adaptation effect appears to depend on the simulated room. In general, consistent adaptation effects are not observed in anechoic space (Room 0) or highly reverberant space (Room 4), but are increasingly observed as room reverberation is increased over an intermediate range (Rooms 1 – 3). These are important results, because they suggest that the effect is linked specifically to the acoustical properties of the room (since the effects are not observed without a room, i.e. in anechoic space) and that the effect is strongest in moderately reverberant rooms. Similar results have been reported in related studies from our laboratory [15]. For reference, the average SRT improvement in Room 2 was 2.7 dB for the normally-hearing listeners. This corresponds to a greater than 18% improvement in speech understanding in the SC condition relative to threshold SNR for the NC condition [2].

A second, and perhaps more practically important pattern observable in Figure 4 is that the effect sizes do not appear to be markedly different between the normally-hearing and hearing-impaired groups of listeners. This pattern was confirmed by pooling the data across rooms, and comparing mean effect sizes from each group of listeners. No statistically significant difference in effect sizes was found, t(95) = -.389, p = .698. Although increases in the effect size variability can be observed in certain rooms (e.g. Room 2) for the hearing-impaired group, overall the similarity in effect sizes is surprising, given the large differences in NC SRTs between the two groups (Figure 3).

This similarity in adaptation effects between normally-hearing and hearing-imparied groups is important, because it appears to minimize the role of the auditory periphery such effects. It is also consistent with other studies related aspects of reflective sound processing that are thought to be mediated by centralized brain mechanisms [16]. Although the specifics of these mechanisms have yet to be identified, it has been suggested that the mechanisms involve high-level perceptual calibration to particular aspects of the listening environment's acoustics [17, 18], perhaps related to its modulation transfer function [19]. Regardless of cause, the fact that hearing impaired listeners show similar improvements in speech understanding in reverberant rooms when allowed brief periods of prior listening exposure may have important practical implications for aural rehabilitation.

4. Conclusions

Hearing-impaired listeners appear to derive roughly similar benefits from prior listening exposure to a room as do normally-hearing listeners. Brief periods (on the order of seconds) of exposure result in improvements in speech reception threshold of 2 to 3 dB, which correspond to improvements in speech intelligibility of nearly 20%. These improvements appear to be specific to rooms (they are not consistently observed in anechoic space) and are strongest for rooms with moderate amounts of reverberation (0.3 < T₆₀ < 1.2 s).

Acknowledgments

Work supported by NIH R01DC008168.

References

1.Knudsen VO. The hearing of speech in auditoriums. J. Acoust. Soc. Am. 1929;1:56–82. [Google Scholar]
2.Brandewie E, Zahorik P. Prior listening in rooms improves speech intelligibility. J. Acoust. Soc. Am. 2010;128:291–299. doi: 10.1121/1.3436565. [DOI] [PMC free article] [PubMed] [Google Scholar]
3.Watkins AJ. Perceptual compensation for effects of reverberation in speech identification. J. Acoust. Soc. Am. 2005;118:249–262. doi: 10.1121/1.1923369. [DOI] [PubMed] [Google Scholar]
4.Watkins AJ. Perceptual compensation for effects of echo and of reverberation on speech identification. Acta Acust. united Ac. 2005;91:892–901. doi: 10.1121/1.1923369. [DOI] [PubMed] [Google Scholar]
5.Clifton RK, Freyman RL. The precedence effect: Beyond echo suppression. In: Gilkey RH, Anderson TR, editors. Binaural and Spatial Hearing in Real and Virtual Environments. Erlbaum; Mahwah, New Jersey: 1997. [Google Scholar]
6.Nabelek AK, Pickett JM. Monaural and binaural speech perception through hearing aids under noise and reverberation with normal and hearing-impaired listeners. J. Speech Hear. Res. 1974;17:724–739. doi: 10.1044/jshr.1704.724. [DOI] [PubMed] [Google Scholar]
7.ANSI-S3.9, American National Standard specification for audiometers. American National Standards Institute; New York: 1989. [Google Scholar]
8.Bolia RS, Nelson WT, Ericson MA, Simpson BD. A speech corpus for multitalker communications research. J. Acoust. Soc. Am. 2000;107:1065–1066. doi: 10.1121/1.428288. [DOI] [PubMed] [Google Scholar]
9.ISO-3382, Acoustics - Measurement of the reverberation time of rooms with reference to other acoustical parameters. International Organization for Standardization; Geneva: 1997. [Google Scholar]
10.Allen JB, Berkley DA. Image method for efficiently simulating small-room acoustics. J. Acoust. Soc. Am. 1979;65:943–950. [Google Scholar]
11.Zahorik P. Perceptually relevant parameters for virtual listening simulation of small room acoustics. J. Acoust. Soc. Am. 2009;126:776–791. doi: 10.1121/1.3167842. [DOI] [PMC free article] [PubMed] [Google Scholar]
12.Wichmann FA, Hill NJ. The psychometric function: I. Fitting, sampling, and goodness of fit. Percept. Psychophys. 2001;63:1293–1313. doi: 10.3758/bf03194544. [DOI] [PubMed] [Google Scholar]
13.Wichmann FA, Hill NJ. The psychometric function: II. Bootstrap-based confidence intervals and sampling. Percept. Psychophys. 2001;63:1314–1329. doi: 10.3758/bf03194545. [DOI] [PubMed] [Google Scholar]
14.Plomp R. Binaural and monaural speech-intelligibility of connected discourse in reverberation as a function of azimuth of a single competing sound source (speech or noise). Acustica. 1976;34:201–211. [Google Scholar]
15.Zahorik P, Brandewie E. Room adaptation effects on speech intelligibility as a function of room reverberation time. Abstr. Midwinter Res. Meet. Assoc. Res. Otolaryngol. 2009;32:145. [Google Scholar]
16.Grantham DW. Left-right asymmetry in the buildup of echo suppression in normal-hearing adults. J. Acoust. Soc. Am. 1996;99:1118–1123. doi: 10.1121/1.414596. [DOI] [PubMed] [Google Scholar]
17.Clifton RK, Freyman RL, Litovsky RY, McCall D. Listeners’ expectations about echoes can raise or lower echo threshold. J. Acoust. Soc. Am. 1994;95:1525–1533. doi: 10.1121/1.408540. [DOI] [PubMed] [Google Scholar]
18.Clifton RK, Freyman RL, Meo J. What the precedence effect tells us about room acoustics. Percept. Psychophys. 2002;64:180–188. doi: 10.3758/bf03195784. [DOI] [PubMed] [Google Scholar]
19.Nielsen JB, Dau T. Revisiting perceptual compensation for effects of reverberation in speech identification. J. Acoust. Soc. Am. 2010;128:3088–3094. doi: 10.1121/1.3494508. [DOI] [PubMed] [Google Scholar]

[R1] 1.Knudsen VO. The hearing of speech in auditoriums. J. Acoust. Soc. Am. 1929;1:56–82. [Google Scholar]

[R2] 2.Brandewie E, Zahorik P. Prior listening in rooms improves speech intelligibility. J. Acoust. Soc. Am. 2010;128:291–299. doi: 10.1121/1.3436565. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R3] 3.Watkins AJ. Perceptual compensation for effects of reverberation in speech identification. J. Acoust. Soc. Am. 2005;118:249–262. doi: 10.1121/1.1923369. [DOI] [PubMed] [Google Scholar]

[R4] 4.Watkins AJ. Perceptual compensation for effects of echo and of reverberation on speech identification. Acta Acust. united Ac. 2005;91:892–901. doi: 10.1121/1.1923369. [DOI] [PubMed] [Google Scholar]

[R5] 5.Clifton RK, Freyman RL. The precedence effect: Beyond echo suppression. In: Gilkey RH, Anderson TR, editors. Binaural and Spatial Hearing in Real and Virtual Environments. Erlbaum; Mahwah, New Jersey: 1997. [Google Scholar]

[R6] 6.Nabelek AK, Pickett JM. Monaural and binaural speech perception through hearing aids under noise and reverberation with normal and hearing-impaired listeners. J. Speech Hear. Res. 1974;17:724–739. doi: 10.1044/jshr.1704.724. [DOI] [PubMed] [Google Scholar]

[R7] 7.ANSI-S3.9, American National Standard specification for audiometers. American National Standards Institute; New York: 1989. [Google Scholar]

[R8] 8.Bolia RS, Nelson WT, Ericson MA, Simpson BD. A speech corpus for multitalker communications research. J. Acoust. Soc. Am. 2000;107:1065–1066. doi: 10.1121/1.428288. [DOI] [PubMed] [Google Scholar]

[R9] 9.ISO-3382, Acoustics - Measurement of the reverberation time of rooms with reference to other acoustical parameters. International Organization for Standardization; Geneva: 1997. [Google Scholar]

[R10] 10.Allen JB, Berkley DA. Image method for efficiently simulating small-room acoustics. J. Acoust. Soc. Am. 1979;65:943–950. [Google Scholar]

[R11] 11.Zahorik P. Perceptually relevant parameters for virtual listening simulation of small room acoustics. J. Acoust. Soc. Am. 2009;126:776–791. doi: 10.1121/1.3167842. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R12] 12.Wichmann FA, Hill NJ. The psychometric function: I. Fitting, sampling, and goodness of fit. Percept. Psychophys. 2001;63:1293–1313. doi: 10.3758/bf03194544. [DOI] [PubMed] [Google Scholar]

[R13] 13.Wichmann FA, Hill NJ. The psychometric function: II. Bootstrap-based confidence intervals and sampling. Percept. Psychophys. 2001;63:1314–1329. doi: 10.3758/bf03194545. [DOI] [PubMed] [Google Scholar]

[R14] 14.Plomp R. Binaural and monaural speech-intelligibility of connected discourse in reverberation as a function of azimuth of a single competing sound source (speech or noise). Acustica. 1976;34:201–211. [Google Scholar]

[R15] 15.Zahorik P, Brandewie E. Room adaptation effects on speech intelligibility as a function of room reverberation time. Abstr. Midwinter Res. Meet. Assoc. Res. Otolaryngol. 2009;32:145. [Google Scholar]

[R16] 16.Grantham DW. Left-right asymmetry in the buildup of echo suppression in normal-hearing adults. J. Acoust. Soc. Am. 1996;99:1118–1123. doi: 10.1121/1.414596. [DOI] [PubMed] [Google Scholar]

[R17] 17.Clifton RK, Freyman RL, Litovsky RY, McCall D. Listeners’ expectations about echoes can raise or lower echo threshold. J. Acoust. Soc. Am. 1994;95:1525–1533. doi: 10.1121/1.408540. [DOI] [PubMed] [Google Scholar]

[R18] 18.Clifton RK, Freyman RL, Meo J. What the precedence effect tells us about room acoustics. Percept. Psychophys. 2002;64:180–188. doi: 10.3758/bf03195784. [DOI] [PubMed] [Google Scholar]

[R19] 19.Nielsen JB, Dau T. Revisiting perceptual compensation for effects of reverberation in speech identification. J. Acoust. Soc. Am. 2010;128:3088–3094. doi: 10.1121/1.3494508. [DOI] [PubMed] [Google Scholar]

PERMALINK

Perceptual Adaptation to Room Acoustics and Effects on Speech Intelligibility in Hearing-Impaired Populations

Pavel Zahorik

Eugene Brandewie

Summary

1. Introduction