Abstract
Degradations in external, acoustic stimulation have long been suspected to increase the load on working memory (WM). One neural signature of WM load is enhanced power of alpha oscillations (6–12 Hz). However, it is unknown to what extent common internal, auditory degradation, that is, hearing impairment, affects the neural mechanisms of WM when audibility has been ensured via amplification. Using an adapted auditory Sternberg paradigm, we varied the orthogonal factors memory load and background noise level, while the electroencephalogram was recorded. In each trial, participants were presented with 2, 4, or 6 spoken digits embedded in one of three different levels of background noise. After a stimulus-free delay interval, participants indicated whether a probe digit had appeared in the sequence of digits. Participants were healthy older adults (62–86 years), with normal to moderately impaired hearing. Importantly, the background noise levels were individually adjusted and participants were wearing hearing aids to equalize audibility across participants. Irrespective of hearing loss (HL), behavioral performance improved with lower memory load and also with lower levels of background noise. Interestingly, the alpha power in the stimulus-free delay interval was dependent on the interplay between task demands (memory load and noise level) and HL; while alpha power increased with HL during low and intermediate levels of memory load and background noise, it dropped for participants with the relatively most severe HL under the highest memory load and background noise level. These findings suggest that adaptive neural mechanisms for coping with adverse listening conditions break down for higher degrees of HL, even when adequate hearing aid amplification is in place.
Keywords: alpha oscillations, hearing loss, hearing aid, cognition, working memory
INTRODUCTION
Adverse listening conditions are common in everyday life. Auditory distractions and signal degradations increase demands on attention and working memory (WM; Shinn-Cunningham and Best, 2008). WM describes the system for temporary storage and processing of information to perform a cognitive task (Baddeley, 1986). Any degradation of the sensory auditory input requires increased WM involvement to successfully interpret the stimuli (Rönnberg et al., 2008; Stenfelt and Rönnberg, 2009). Auditory stimuli can be degraded by external factors, often occurring in the form of background noise, in which case WM is engaged to extract useful information from the auditory input (Pichora-Fuller, 2003). However, auditory processing can also be disrupted by internal degradation, such as sensorineural hearing loss (HL). To alleviate this internal degradation of the auditory input, people suffering from HL are typically treated with hearing aids. The purpose of a hearing aid is to amplify the auditory input to make sounds audible and consequently reduce the internal auditory degradation, which theoretically should release WM resources (sometimes referred to as lowered cognitive load; Lunner, 2003). Here, we tested whether HL affects brain signatures of WM involvement in an adverse listening paradigm.
The power of neural oscillations in the alpha frequency band (liberally defined as 6–12 Hz) has been found to increase with WM load (Jensen et al., 2002). According to the functional inhibition framework (Klimesch et al., 2007; Jensen and Mazaheri, 2010), alpha oscillations indicate the inhibition of currently task-irrelevant brain regions and/or cognitive processes to prevent interference with task-relevant cognitive processing (Bonnefond and Jensen, 2012). Although alpha power modulations have been found for external degradation of auditory signals (van Dijk et al., 2010; Obleser and Weisz, 2012; Obleser et al., 2012; Becker et al., 2013; Scharinger et al., 2014; Wöstmann et al., 2015), it is currently unknown how the internal degradation of auditory input through HL affects neural alpha dynamics (Strauß et al., 2014). There is good evidence from behavioral studies that HL negatively affects cognitive operations on the speech signal (McCoy et al., 2005; Wingfield et al., 2005, 2006). These findings support the hypothesis put forward by Rabbitt (1991), stating that adverse listening conditions require the allocation of more cognitive resources, which could otherwise be used for more task-relevant cognitive processing, such as storing information. Thus, external (acoustic), and internal (auditory) degradations are assumed to trigger a higher degree of WM involvement during the encoding of task-relevant stimuli, leaving fewer cognitive resources for the storage, and processing of information in the WM (Lunner et al., 2009; Van Engen and Peelle, 2014). Here, we tested whether HL impacts behavioral performance and neural mechanisms even when it is treated with individually fitted hearing aids.
A well-established experimental paradigm to test WM demands is the Sternberg paradigm (Sternberg, 1966). Participant’s task is to encode and retain a number of items to compare them to a subsequent probe. Although the Sternberg paradigm was originally developed as a visual WM task, it has since been adapted to test auditory WM (e.g., Rojas et al., 2000; Leiberg et al., 2006). The test incorporates a short stimulus-free delay period between the encoding and the probe presentation, during which the participants are to retain the presented stimuli in memory. This stimulus-free delay period is of special interest in neuroimaging studies, because neural responses measured in this time period are thought to reflect WM processes independent of the sensory stimulation itself. During stimuli presentation, the processes of auditory encoding and memory storage are not easily separated, contrary to the delay period where there is no sensory input and the only task is to retain the stimuli in memory and restore inadequately encoded items. A number of studies have found that increased memory load (i.e., increasing the number of items to be remembered) was associated with enhanced alpha power over central and parietal recording sites during the delay period (Jensen et al., 2002; Leiberg et al., 2006; Obleser et al., 2012). Critically, Obleser et al. (2012) recently found that alpha power in the delay period was not only enhanced with an increasing number of to-be-remembered items, but with the acoustic degradation of the items.
In the present study, a version of the Sternberg test modified by Obleser et al. (2012) was applied to investigate the effects of varying memory load and the level of background noise on alpha oscillations measured by electroencephalogram (EEG) recording. We tested older participants with varying degrees of HL. In line with prior studies, we expected decreased task performance with higher memory load and higher levels of background noise. We hypothesized that alpha power would increase with the severity of HL, suggesting that internal auditory degradations increase the load on neural WM mechanisms in speech processing. Furthermore, it was of interest whether such increased expenditure of cognitive resources would reach a limit and break down (i.e., reminiscent of the CRUNCH hypothesis put forward by Reuter-Lorenz and Cappell, 2008) in listeners with the most severe HL and/or under highest task demands (i.e., highest memory load and most severe background noise).
MATERIALS AND METHODS
PARTICIPANTS
Twenty-nine native Swedish speaking participants (16 females, age range: 62–86 years, mean age 72.2 years), recruited from the audiology clinic at the University Hospital of Linköping in Sweden, participated in this study. Participants were recruited to show large inter-individual variability of auditory pure-tone thresholds. Participants were grouped according to their pure-tone average (PTA), across 0.5, 1, 2, 4, and 8 kHz into three groups of HL (no/mild/moderate HL). The hearing threshold at 8 kHz was included in the PTA since sensitivity loss at higher frequencies is known to accompany age-related HL (CHABA, 1988). Separate one-way ANOVAs showed no difference in age between groups (p = 0.114), but a significant difference in HL (p < 0.001), with Fisher’s Least Significant Difference (LSD) post hoc analysis showing significant differences between the three groups (all p < 0.001). Participant information is shown in Table 1 and Figures 1C,D.
Table 1.
Hearing threshold range [dB HL] | Pure-tone average (PTA) [dB HL] | Age [years] | No. of females | |
---|---|---|---|---|
No hearing loss (HL; n = 8) | 0–25 | 22.3 (7.1) | 68.8 (4.6) | 4 |
Mild HL (n = 11) | 25–50 | 42.1 (8.4) | 72.5 (5.5) | 7 |
Moderate HL (n = 10) | 50–80 | 63.7 (5.2) | 74.6 (6.4) | 5 |
Total (n = 29) | 44.1 (17.9) | 72.2 (5.8) | 16 |
Participants grouped according to their HL (no/mild/moderate HL; first column), defined based on three ranges of hearing thresholds (second column). Values in parentheses indicate one standard deviation. Average hearing threshold levels for the three groups across 0.5, 1, 2, 4, and 8 KHz are shown in the third column. Columns four and five list participants’ mean age and number of females in the three groups, respectively. The bottom row shows average data across the entire sample of participants.
Participants all gave informed consent and were given no financial compensation for their participation. The study was approved by the regional ethical review in Linköping, Sweden and conformed with the Helsinki Declaration of Ethical Principles for Medical Research Involving Human Subjects.
EXPERIMENTAL DESIGN
Speech materials
The stimuli consisted of the monosyllabic Swedish digits “0,” “1,” “2,” “3,” “5,” “6,” and “7,” spoken by a female talker and recoded in a soundproof booth at a sampling rate of 22.05 kHz. For a natural co-articulation, the digits were recorded as triplets. The triplets were adjusted to the same root-mean-square (RMS) level, and then the first digit was extracted without silent intervals before and after each waveform, resulting in an average digit duration of 677 ms (SD: 103 ms). The recordings were originally used for the Swedish digit triplets test (Drullman et al., 2005; Larsby et al., 2011).
The final audio files were generated by adding speech-shaped noise to the digits at the individualized SNR levels (see below). Due to the short duration of the spoken digits acceptable speech-shaped noise could not be generated based on the spectrum of the digits. The speech-shaped noise was taken from the Dantale II test, a standardized speech intelligibility test (Wagener et al., 2003). Speech-shaped noise is random stationary broadband noise, with the same long-term average frequency spectrum as natural speech.
Stimulus presentation
All participants were wearing Agil hearing aids (Oticon A/S, Smørum, Denmark) with individual quasi-linear amplification. The quasi-linear amplification accounts for the audibility of soft (inaudible speech) sounds by incorporating a fast-acting gain adjustment at the onset of the presented sounds and maintaining this gain throughout the presentation of the sounds with a very slow-acting gain adjustment (for details see Simonsen and Behrens, 2009). No changes were made to the time constant throughout the sound presentation, and the hearing aid amplification can be considered linear, meaning that the hearing aid output intensity increased at the same rate as the intensity of the acoustic input. The noise reduction algorithm and volume control normally available on these hearing aids were disabled during the entire experimental session.
All auditory stimuli were presented directly through the hearing aids using the Direct Audio Input (DAI). The experiment was conducted in an electrically shielded soundproof booth. Visual cues and instructions were presented on a 1280 by 1024 resolution screen, with the participants positioned 1 m from the screen.
Individual adjustments of SNR levels
To ensure equal intelligibility of the stimulus materials for all participants despite large inter-individual differences in hearing thresholds (see Figures 1C,D; Table 1), the background noise levels were individually adjusted. To this end, participants listened to and repeated 40 spoken sentences from the Swedish version of hearing in noise test (HINT; Hällgren et al., 2006). The output presentation level was 70 dB SPL, which was presented through the DAI of the hearing aids and amplified according to the individual audiograms. In an adaptive tracking procedure (Levitt, 1971), we determined the background noise level (measured as the signal to noise ratio between speech and background noise) at which each participant was able to repeat 80% of the words in a sentence. This value for an individual participant will be referred to as the Speech Reception Threshold (SRT) of 80% (denoted 0 dB SRT80). In the Sternberg test, the individual 0 dB SRT80 level was used as the intermediate background noise level for the participant in question. The lower and higher background noise levels were generated by raising or lowering the SNR by 4 dB from the obtained 0 dB SRT80, denoted 4 dB SRT80 and –4 dB SRT80, respectively. To maintain a constant overall intensity level of the stimuli played from the presentation computer at ∼70 dB SPL, both the level of the signal (i.e., the digits) and the level of the background noise were adjusted. For instance, for the 4 dB SRT80 condition, the noise level was lowered by 2 dB in intensity, and the signal level was raised by 2 dB relative to the 0 dB SRT80.
Experimental procedure
After the individual adjustment of SNR levels, the actual experiment was performed. An auditory version of the Sternberg paradigm (Sternberg, 1966), inspired by Obleser et al. (2012), was used, employing a 3 × 3 design of the orthogonal factors memory load (2, 4, or 6 digits to be remembered) and background noise level (4 dB, 0 dB, or –4 dB relative to the individual level at which 80% of the words were correctly recalled in noise). Each trial started with the presentation of a central fixation cross for 1–2 s (randomly varied duration), followed by the encoding phase, in which 2, 4, or 6 digits were presented in speech-shaped noise (Figures 1A,B). The noise onset always preceded the onset of the first digit by 50 ms to avoid masking of the first digit by the noise onset. In trials with two and four digits, flanking sounds of white noise, at the same intensity level as the spoken digits, were presented to always ensure the presentation of six sounds. The sounds (digits and flanking noises) were presented with an onset-to-onset stimulus interval of 0.8 s, resulting in a total encoding time of 4.85 s, after which the noise was also terminated.
The encoding was followed by a stimulus-free delay period, in which the participants were to retain the presented digits in their memory. The delay phase had a duration of 1–2 s (randomly varied). Lastly, a probe digit was presented in the same background noise level as during the encoding interval. Again, the noise started 50 ms prior to the probe digit. During this 50 ms interval, the fixation cross changed to a question mark, signaling that the participants were to indicate, via a button press on a response box, whether the probe digit appeared in the encoding phase (response window of 2 s). Participants were not instructed to use any particular finger(s) for pressing the response buttons, nor were the button positions varied between participants. If participants required more than 2 s to respond, they were instructed to be faster on the next trial and informed that no response was recorded. Feedback was given after each trial, consisting of either ‘correct,’ ‘incorrect,’ or ‘no answer registered, please answer faster.’ In half of the trials, the probe digit appeared during encoding.
Trials for the nine conditions in the 3 (memory load) × 3 (background noise level) design were presented in 10 blocks. Due to the length of the test, the 10 test blocks were separated into two recordings of five blocks. Each recording lasted ∼45 min with a break of 15 min between the two recordings. Each recording was initiated with a training block of 11–25 trials from all nine conditions. Each test block consisted of a minimum of 18 trials with 2 trials for each condition, presented in a randomized order. The actual number of trials per block was determined by the number of unanswered trials. That is, for each trial in which no answer was registered due to a response time longer than 2 s, an extra trial was added to the block. Overall, 20 trials with registered answers were recorded in each condition for each participant.
EEG RECORDING AND PREPROCESSING
The EEG was recorded using an EGI system (Electrical Geodesic Inc., Eugene, OR, USA) with 128 Ag/Ag-Cl channels. Six occipital and one central electrode were disconnected from the electrode net and used for other physiological measurements which will not be reported here. The EEG was recorded at a sampling rate of 250 Hz using Cz as the reference. All electrode impedances were maintained below 50 kOhm. The EGI system incorporates analog elliptical high- and low-pass with cut-off frequencies at 0.1 and 125 Hz (the Nyquist frequency), respectively. Filtering was performed before analog-to-digital conversion of the EEG.
Offline, the EEG data were analyzed using customized MATLAB scripts (R2011b, MathWorks Inc.) and the Fieldtrip toolbox (Oostenveld et al., 2011). Trials with response times longer than 2 s were excluded from all further analyses. The data were divided into epochs of sufficient length (–5 to +11 s around the onset of the first digit/flanking noise) to avoid data loss at the edges of the time-frequency representations due to windowing effects. The epoched data were bandpass filtered using an acausal sixth order IIR Butterworth filter between 0.5 and 45 Hz and re-referenced to the average of both mastoids. Before further analyses, 18 electrodes used for recording the electrooculogram (EOG) or positioned on the cheeks and jaw were removed for technical reasons.
Individual channels containing artifacts were identified through visual inspection and repaired by averaging over adjacent electrodes (according to the nearest neighbor approach implemented in the ft_channelrepair function in Fieldtrip). Data from one participant from the mild HL group were excluded from all further analyses due to a high number of artifact-contaminated channels. To remove further artifacts, an independent component analysis (ICA) was performed, and components containing eye blinks, saccadic eye movements, muscle activity, and heartbeats were identified by inspection of components’ topographies and time courses and rejected. On average, 22% (SD: 6%) of the components were removed.
The time-frequency representation of oscillatory power in each trial was obtained by convolution of single trial time domain data with a family of Morlet wavelets (width: seven cycles). This analysis was performed for frequencies from 0.5 to 30 Hz in steps of 0.5 Hz and from –5 to +11 s around the onset of the first digit/flanking noise in steps of 0.05 s. Note that this long time interval included the baseline period, encoding, delay, and probe (Figure 1A). The power of each time–frequency–electrode bin was calculated for each trial by taking the square norm of the complex wavelet coefficients. Adjustment for inter- and intra-individual variability in oscillatory power was performed by means of subtraction and division by the average power of the first 0.4 to 1 s of the baseline interval (relative change from baseline). For further analyses, each trial was split into the following periods: encoding, 0.4–4.8 s relative to first digit/flanking noise onset; delay, 0.4–1 s relative to the offset of the last digit/flanking noise; and probe, 0.4–1 s relative to probe-digit onset. All time intervals disregard the first 0.4 s as to not include evoked activity after stimulus on- or offset in the analysis.
STATISTICAL ANALYSES
A main motivation of the present study was to investigate the effect of HL on behavioral performance and alpha oscillations in the auditory Sternberg task. However, HL was confounded by age, as evidenced by a positive Pearson’s correlation between age and PTA (r = 0.44, p = 0.018). To obtain a measure of HL that was independent of age, we calculated the residualized PTA, quantifying the variation in PTA across participants that could not be explained by age. In detail, the residualized PTA was estimated as the residuals of the linear regression of PTA on age. For the remainder of this paper, we will refer to the z-scored residualized PTA as ‘rPTA.’ In all further analyses, rPTA was included as a continuous covariate. Moreover, we considered it likely that brain compensatory mechanisms involved in overcoming the adverse listening conditions would not increase linearly with HL, but drop with more severe HL, especially under high memory load/background noise (see Introduction). To model this negative quadratic (inverted u-shape) relationship between HL and behavioral and brain responses, we additionally included the quadratic term rPTA-squared as a second continuous covariate in all further analyses.
Statistical analysis of behavioral data
First, we analyzed to what extent the individual adjustments of SNR levels were dependent on participants’ HL. To evaluate whether individualization was needed, we calculated the Pearson’s correlation between the 0 dB SRT80 value from the HINT and the non-residualized PTA.
In the auditory Sternberg task, response times were measured from the onset of the probe digit until the button press by the participant to indicate whether the probe digit appeared in the encoding. Accuracy was calculated as the percentage of correctly answered trials. Changes in task accuracy and response times as a function of the within-subject factors (memory load and background noise level) and the continuous between-subjects covariates (rPTA and rPTA-squared), were investigated using two separate repeated-measures ANCOVAs. All ANCOVAs showed violation of the assumption of sphericity (Mauchly’s test, all p < 0.05), hence the Greenhouse–Geisser corrected p-values were calculated and reported for all results. Fisher’s LSD tests were used for all post hoc analyses.
To illustrate the quadratic relationship between rPTA and response times (Figure 2C), a quadratic function was fitted to the response time as a function of rPTA using the least-squares approach implemented in the MATLAB functions polyfit and polyval.
Statistical analysis of EEG data
In the analysis of the EEG data, alpha power was averaged across frequencies from 6–12 Hz in a subset of 31 electrodes (Figure 3A, topographic maps) and across three time intervals outlined in Figure 3A: encoding, 0.4–4.8 s relative to the onset of the first digit/flanking noise; delay, 0.4–1 s relative to the offset of the last digit/flanking noise; and probe, 0.4–1 s relative to the onset of the probe digit. The 31 electrodes were chosen to derive a centro-parietal scalp distribution, which has previously been identified as an important site for alpha activity generation during auditory processing (Krause et al., 1996). Average alpha power during encoding, delay, and probe were subjected to three repeated-measures ANCOVAs with memory load and background noise level as within-subject factors and with rPTA and rPTA-squared as continuous between-subject covariates. All ANCOVAs showed violation of the assumption of sphericity (Mauchly’s test, all p < 0.05), hence the Greenhouse–Geisser corrected p-values were calculated and reported for all results. All statistical analyses were performed using Statistica (version 12, StatSoft, Tulsa, OK, USA).
To illustrate the quadratic relationship between rPTA and alpha power (Figure 4B), the fitting procedure described in the section above was applied.
Studies have previously shown an interaction between response time and alpha activity (Klimesch, 2005). Relations between alpha activity during the probe period and response time were therefore evaluated using Pearson’s correlation.
RESULTS
INDIVIDUAL ADJUSTMENTS OF SNR LEVELS
The individual adjustments of SNR levels using the SRT80 measure resulted in an average 0 dB SRT80 value of 4.61 dB [standard error of the mean (SEM) = 0.86], meaning that participants on average required an SNR level of 4.61 dB to successfully repeat 80% of words from sentences presented in noise. The 0 dB SRT80 values correlated positively with participants’ non-residualized PTA (r = 0.76; p < 0.001). This indicates that participants with more severe HL required a higher SNR level of stimulus materials.
MEMORY LOAD, BACKGROUND NOISE LEVEL, AND HEARING LOSS IMPACT PERFORMANCE
Figure 2A shows the average accuracy for the three levels of memory load (2, 4, 6 digits) and the three background noise levels (–4 dB SRT80, 0 dB SRT80, 4 dB SRT80) in the auditory Sternberg task. The main effect of memory load on accuracy was significant [F(2,50) = 6.26, p = 0.005]. Post hoc tests revealed significantly increased accuracy for two compared with six items (p < 0.001) and for four compared with six items (p = 0.002) but not for two compared with four items (p = 0.718). Additionally, the main effect of background noise level on accuracy was significant [F(2,50) = 28.35, p < 0.001], with the post hoc analysis showing a significant decrease in accuracy with increasing noise level (all p < 0.01). There were no significant main effects of rPTA [F(1,25) = 1.86, p = 0.185) or rPTA-squared [F(1,25) = 1.94, p = 0.176], indicating that the degree of HL by itself did not significantly impact task accuracy. None of the interactions between background noise level, memory load, rPTA, and rPTA-squared were significant (all p > 0.195).
Figure 2B shows the average response times for the three memory loads and background noise levels. The main effect of memory load on response times was significant [F(2,50) = 24.73, p < 0.001]. Post hoc tests revealed significantly longer response times for six compared with four and two to-be-retained digits, as well as for four compared with two digits (all p < 0.001). The main effect of background noise on response times was significant as well [F(2,50) = 8.34, p = 0.001]. Post hoc tests revealed significantly longer response times for the highest background noise level (–4 dB SRT80) compared with the intermediate noise level (0 dB SRT80; p < 0.001) and the lowest background noise level (4 dB SRT80; p = 0.003). Response times in the four and 0 dB SRT80 conditions did not differ significantly (p = 0.328). Interestingly, the main effect of rPTA-squared on response times was significant [F(1,25) = 5.69, p = 0.025]. This indicated a significant quadratic relationship between response times and the degree of HL in such a way that response times increased from no to mild HL, while response times decreased again for participants with the most severe HL (see Figure 2C). Neither the main effect of rPTA [F(1,25) = 1.85, p = 0.185], nor any interaction between memory load, background noise, rPTA, and rPTA-squared (all p ≥ 0.13) reached significance.
TEMPORAL DYNAMICS OF ALPHA OSCILLATIONS
Figure 3A shows the grand-average baseline corrected time-frequency power representation (collapsed over all nine experimental conditions) for all participants throughout the encoding, delay, and probe periods of the auditory Sternberg task. The time course of alpha power (6–12 Hz; averaged over 31 scalp electrodes highlighted in topographic maps) for the three groups of HL are indicated in Figure 3B. Descriptively, alpha power decreased over the trial time course from encoding to delay and also during the probe interval. Normal hearing participants (no HL) exhibited the lowest alpha power in encoding, delay and probe, while the mild HL group showed the highest and the moderate HL group exhibited intermediate alpha power.
Hearing loss affects alpha oscillations under load
We analyzed whether alpha power during the stimulus-free delay interval was dependent on memory load, background noise level, and HL. To this end, the average alpha power (6–12 Hz) across 31 centro-parietal electrodes during the delay interval (0.4–1 s relative to the offset of the background noise) was submitted to a repeated-measures ANCOVA with the factors memory load and background noise level and the continuous covariates rPTA and rPTA-squared. None of the main effects including background noise level [F(2,50) = 1.23, p = 0.299], memory load [F(2,50) = 0.04, p = 0.598], or rPTA-squared [F(1,25) < 0.01, p = 0.989] were significant. Importantly, however, the main effect rPTA was significant [F(1,25) = 4.31, p = 0.0483], indicating that alpha power during the delay increased significantly with the degree of HL (Figure 4A).
Moreover, the two-way interaction background noise level × rPTA-squared [F(2,50) = 6.34, p = 0.004] as well as the three-way interaction background noise level × rPTA-squared × memory load were significant [F(4,100) = 2.86, p = 0.042]. The direction of the significant three-way interaction is illustrated in Figure 4B. For the two lower memory loads (two and four to-be-remembered items), alpha power during the delay period increased moderately with the degree of HL for all background noise levels. This pattern of results changed significantly under the highest memory load (six to-be-remembered digits); here, alpha power strongly increased with HL under the two more favorable background noise levels (4 and 0 dB SRT80), but under the most severe background noise level (–4 dB SRT 80), alpha power increased only for participants with mild HL, whereas it decreased again for participants with moderate HL. The significant interaction between background noise level and rPTA (p = 0.004) is not shown, but resembles the same behavior as observed for six items to be remembered shown in Figure 4B. None of the remaining interactions among rPTA, rPTA-squared, memory load, and background noise level were significant (all p > 0.15).
The main hypothesis of this experiment was focused on identifying condition and HL effects on alpha power during the delay. However, Obleser et al. (2012) also report smaller condition effects during the encoding and probe period. We therefore investigated alpha power during the encoding (0.4–4.8 s relative to the onset of the first digit/flanking noise) and probe (0.4–1 s relative to probe digit onset) interval as well. For the encoding interval, none of the main effects of memory load, background noise level, rPTA, and rPTA-squared, nor any interactions reached significance (all p > 0.14). During the presentation of the probe, a main effect of rPTA-squared was found [F(1,25) = 9.63, p = 0.004], while no other main effects or interactions were significant (all p > 0.12). Notably, an effect of rPTA-squared is also observed on the response time and the relationship between alpha activity during the probe, and the response time was investigated. A Pearson’s correlation showed a positive relationship (r = 0.35, p = 0.068) between alpha power during the probe and response times, meaning that participants with higher alpha power during the probe interval showed longer response times. A similar relationship was not observed between the alpha power during the delay period and the response times (r = 0.15, p = 0.42).
DISCUSSION
In this study, we tested whether HL in older participants had an impact on the neural mechanisms of WM under changing task demands implemented by varying degrees of memory load and background noise. Our main findings can be summarized as follows: first, irrespective of HL, increasing memory load and higher background noise levels led to performance decrements in the auditory Sternberg paradigm. Second, the effects of the increasing memory load and background noise level on alpha activity during the delay were co-determined by the degree of HL. That is, participants suffering from a higher degree of HL exhibited a breakdown in alpha activity with increasing task difficulty, which was not observed for the participants with mild or no HL. These findings show how an internal auditory degradation (i.e., HL) interacts with external acoustic challenges during adverse listening.
THE EFFECT OF RETAINING AUDITORY STIMULI
Effects of WM processing on alpha power have been often observed only during the retention of stimuli in both auditory (van Dijk et al., 2010; Obleser and Weisz, 2012; Obleser et al., 2012; Becker et al., 2013; Scharinger et al., 2014) and visual tasks (Jensen et al., 2002; Schack and Klimesch, 2002; Sander et al., 2012b). It was therefore not unexpected that modulations of alpha power in this study were also found in the delay period.
The linear main effect of rPTA on alpha power in the delay period (Figure 4A) showed that alpha power increases with more severe HL, independent of task difficulty. This linear effect occurs despite the quadratic tendency seen in Figure 3B. The linear relationship in Figure 4A arose from large individual differences in alpha power, especially in the mildly impaired group, and was also affected by the residualization performed to remove age effects: first, this dependence of alpha power on HL is observed during the retention of the to-be-remembered digits, where no active listening is involved. Second, all participants were wearing hearing aids to equalize audibility of the digits presented during the encoding across participants. Interpreting the alpha activity as a sign of WM involvement (Jensen et al., 2002), our study shows that a higher degree of WM involvement is needed to overcome more severe HL to successfully retain the auditory information. This view of increased WM involvement with increased HL has been put forward in a number of studies (Pichora-Fuller and Singh, 2006; Rönnberg et al., 2008; Shinn-Cunningham and Best, 2008). The Ease of Language Understanding (ELU) model developed by Rönnberg et al. (2008) explains the involvement of the WM in speech understanding under adverse conditions. In detail, the ELU model builds on the ability to match auditory stimuli with a preexisting long-term memory store of phonological representations. When suffering from a HL, this match cannot readily be made due to the internal degradation. Hence WM processes are required for extracting acoustical cues that can trigger a phonological match and ensure a successful understanding. In line with the ELU model, the linear relationship between HL and alpha power can be interpreted as the increased WM resources needed to perform successful phonological matching in listeners with HL. Interestingly, the effect of HL on alpha activity is observed for participants wearing hearing aids, which is thought to ensure equal audibility, but arguably cannot restore the WM resources needed to retain speech stimuli.
Hearing aids can indeed ensure audibility and restore intelligibility in quiet situations, while other aspects of listening, such as processing of temporal cues, are not alleviated by amplification (Ardoint et al., 2010). Furthermore, speech intelligibility in noisy situations also remains affected by HL and cannot be fully restored by amplification (Plomp, 1978; Dillon, 2001). This is indeed evident from the positive relationship between HL and the 0 dB SRT80 value. Peelle et al. (2011) found that increased HL was correlated with decreased gray matter volume of the auditory cortex, i.e., a structural change in the brain. If HL causes structural changes in the auditory cortex, this might explain why individual HL compensation via amplification does not nullify such structural deviation in the auditory system, and HL-dependent effects, such as the present ones, are observed despite hearing aids being employed.
The impact of the experimental conditions (memory load and background noise level) proved only to be significant in interactions with HL. Our results showed that when increasing the external degradation, i.e., the background noise level, an increase in alpha activity with HL was observed for the lower levels of background noise. However, for the highest background noise level, a breakdown in alpha activity was observed for the participants with the most severe degree of HL tested in this study (moderate HL). This breakdown in alpha power is only observed when participants have to remember six digits in the most difficult noise condition (Figure 4B). The almost linear increase in alpha power with HL severity observed at lower background noise levels (4 and 0 dB SRT80) suggests that although the noise levels are individualized, participants with increased HL require additional WM resources to be able to perform the task. Indeed, it has previously been suggested that people suffering from HL need to allocate additional resources to process auditory information (Rabbitt, 1991). The findings in this study lend neural support to this hypothesis.
The breakdown in alpha power with increased HL and background noise level further suggests that the participants suffering from reduced hearing reach a ceiling at which no further enhancement in alpha activity can be achieved, and alpha power begins to decrease. Such alpha power breakdown has been observed before when older participants, not considering HL, are subjected to a higher WM load in a visual Sternberg task, while no effect of age was observed on task accuracy (Sander et al., 2012b). Similar findings of neural activity breakdown with high WM loads for increasing age have been observed in fMRI studies (Reuter-Lorenz and Cappell, 2008; Schneider-Garces et al., 2010; Grady, 2012). Also here, the activity breakdown is not necessarily accompanied by changes in task accuracy. According to the “compensation-related utilization of neural circuits” (CRUNCH) hypothesis, the brain increases its activation to engage more neural resources as a result of aging, independent of WM involvement. However, with increasing WM demands, this recruitment reaches a ceiling, and the activity decreases, although no changes in task performance are observed (Reuter-Lorenz and Cappell, 2008). We suggest that, similar to increasing age, more severe HL can cause neural activity breakdown as a result of having to engage more WM resources than participants with better hearing. It is believed that the cause of the observed breakdown is a combination of the two observations that: participants with more severe HL experience generally higher WM involvement (independent of experimental conditions, Figure 4A) and during WM tasks they have increased WM involvement (Figure 4B). To our knowledge our results are the first to demonstrate a breakdown of neural activity with increased HL.
Alpha power during the delay was affected by memory load in a three-way interaction with background noise and rPTA-squared. Our experimental design was modified from the auditory Sternberg task applied by Obleser et al. (2012), who found main effects of both memory load and auditory degradation (obtained through noise-vocoding of the digits) on alpha activity. The lack of a main effect of memory load in the present study might be explained best by the differences in participants (older hearing impaired vs. younger normal hearing), rather than auditory degradation (background noise vs. noise-vocoding). Both of these changes were introduced to achieve some gain in external validity in the present study.
Although we corrected for the difference in age between participants in this study, we cannot account for the average differences between younger and older persons, which has been proven to affect both alpha activity and WM resources (Klimesch, 1999; Sander et al., 2012a). Although increased age might have resulted in participants having generally less WM resources available and thereby reaching alpha power breakdown, differences in cohort age between the studies cannot explain the non-significant main effect of memory load in the present study. We suggest that the lack of memory load effect can be explained by the fact that the hearing impaired participants are already performing at ceiling and cannot further increase their alpha activity when subjected to higher memory loads and/or background noise levels. This statement is supported by two observations: firstly, that the alpha power increased with HL, independent of the experimental condition. Secondly, that the conditions effects (rPTA × background noise level and rPTA × background noise level × memory load) showed a decrease in alpha power for the moderately impaired participants, c.f. Figure 4B.
NO EFFECTS OF HEARING LOSS ON TASK ACCURACY
To adjust for the differences in HL, the background noise levels were individualized using the SRT80 measure obtained from the HINT test (for details see Materials and Methods). The positive relation between HL and 0 dB SRT80 shows that for participants with more severe HL a lower background noise level (i.e., higher 0 dB SRT80) is needed. This relationship emphasizes the importance of individualizing the background noise level to ensure equal task accuracy across all participants, independent of HL. Indeed, the non-significant effect of HL on task accuracy confirms the success of applying individual noise levels.
As hypothesized, task accuracy significantly decreased both with increased memory load and background noise level. As Figure 2A shows, background noise levels showed stronger effects on task accuracy than changes in the memory load. In line with the modulations of alpha activity, this finding emphasizes that auditory degradation induces a larger WM involvement than changes in the memory load for the memory loads and background noise level tested in this study. Significant effects of the experimental conditions on task accuracy have sometimes been reported in auditory and visual Sternberg tasks (Rojas et al., 2000; Jensen et al., 2002; Sander et al., 2012b), but most studies aim at having no condition effects on accuracy (Sternberg, 1966; Lehtelä et al., 1997; Leiberg et al., 2006; Obleser et al., 2012). As noted by Rojas et al. (2000), the confounding effect of task accuracy on response time and alpha activity makes it impossible to determine whether WM processing is indeed involved in solving the task, especially for wrongly answered trials. In this study, effects of memory load and background noise level on task accuracy were found, which is a limitation of the study. However, obtaining task accuracies close to 100% correct for all conditions and participants would require troublesome and time consuming individualization. Alternatively, including only the correctly answered trials in the current analysis would result in an unfeasibly low number of trials per condition. However, as we observe effects of HL on the alpha power, we believe that WM processing was involved during task solving.
The response times were affected both by the experimental conditions (Figure 2B) and HL (Figure 2C), the latter showing a speed-up in response times with increased HL. As a sign of stimulus retrieval (Sternberg, 1966), it was expected that the response time would show effects of the experimental condition as well as HL. The increase in response times from normal to mildly impaired hearing suggests that increasing internal degradation of the auditory signal results in longer processing times of the probe digit. As HL increases from mild to moderate, participants’ strategy might change resulting in shorter response times (Figure 2C).
The effect of rPTA-squared on alpha activity during the probe also proved to be significant and although the correlation between the alpha activity during the probe and the response times only approached significance (p = 0.068), we believe that the changes in alpha power during the probe period arise from changes in the speed of information processing (Klimesch, 2005) and not WM processing as such.
In summary, the present findings suggest that despite being compensated for the loss of hearing through hearing aid amplification and by individually setting the administered signal-to-noise ratios, higher degrees of HL are detrimentally affecting a cardinal neural mechanism of overcoming adverse listening conditions, namely the increase in posterior alpha power. Apparently, participants with moderate HL reach a ceiling level at which no more WM resources can be recruited, and thus alpha power begins to decrease again. These findings not only reveal that hearing aid amplification by itself is not sufficient for restoring normal neural signatures of auditory processing, but also suggest that persons suffering from a higher degree of HL reach a WM limit at a lower task demand.
Conflict of Interest Statement
Eriksholm Research Centre is part of Oticon A/S and as such the salary of Eline Borch Petersen and Thomas Lunner were paid by Oticon A/S. Hearing aids were provided by Oticon A/S.
Acknowledgments
EBP is supported by a grant from the Oticon Foundation. We wish to thank all the participants in the experiment and Gunilla Wänström, Irene Slättengren, and Mathias Hällgren for their assistance during the experiment. The authors are grateful for the helpful discussions with researchers at Eriksholm Research Centre and members of the Max Planck Research Group “Auditory Cognition.”
REFERENCES
- Ardoint M., Sheft S., Fleuriot P., Garnier S., Lorenzi C. (2010). Perception of temporal fine-structure cues in speech with minimal envelope cues for listeners with mild-to-moderate hearing loss. Int. J. Audiol. 49 823–831 10.3109/14992027.2010.492402 [DOI] [PubMed] [Google Scholar]
- Baddeley A. (1986). Working Memory. Oxford: Oxford University Press. [Google Scholar]
- Becker R., Pefkou M., Michel C. M., Hervais-Adelman A. G. (2013). Left temporal alpha-band activity reflects single word intelligibility. Front. Syst. Neurosci. 7:121 10.3389/fnsys.2013.00121 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bonnefond M., Jensen O. (2012). Alpha oscillations serve to protect working memory maintenance against anticipated distracters. Curr. Biol. 22 1969–1974 10.1016/j.cub.2012.08.029 [DOI] [PubMed] [Google Scholar]
- CHABA. (1988). Speech understanding and aging. J. Acoust. Soc. Am. 83 859–895 10.1121/1.395965 [DOI] [PubMed] [Google Scholar]
- Dillon H. (2001). Hearing aids. Gac. Med. Mex. 126 417–418 10.1007/SpringerReference_115704 [DOI] [PubMed] [Google Scholar]
- Drullman R., Wagener K., Wouters J., Eneman K. (2005). Speech Recognition Tests for Different Languages. Report FP6-004171. Hearing in the communication society. Available at: http://www.hearcom.eu [Google Scholar]
- Grady C. (2012). The cognitive neuroscience of ageing. Nat. Rev. Neurosci. 13 491–505 10.1038/nrn3256 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hällgren M., Larsby B., Arlinger S. (2006). A swedish version of the Hearing In Noise Test (HINT) for measurement of speech recognition. Int. J. Audiol. 45 227–37 10.1080/14992020500429583 [DOI] [PubMed] [Google Scholar]
- Jensen O., Gelfand J., Kounios J., Lisman J. E. (2002). Oscillations in the alpha band (9-12 Hz) increase with memory load during retention in a short-term memory task. Cereb. Cortex 12 877–882 10.1093/cercor/12.8.877 [DOI] [PubMed] [Google Scholar]
- Jensen O., Mazaheri A. (2010). Shaping functional architecture by oscillatory alpha activity: gating by inhibition. Front. Hum. Neurosci. 4:186 10.3389/fnhum.2010.00186 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Klimesch W. (1999). EEG alpha and theta oscillations reflect cognitive and memory performance: a review and analysis. Brain Res. Brain Res. Rev. 29 169–95 10.1016/S0165-0173(98)00056-3 [DOI] [PubMed] [Google Scholar]
- Klimesch W. (2005). “EEG Alpha and Cognitive Processes,” in Time and the Brain ed. Miller R. (Amsterdam: Harwood Academic Publishers) 236–261. [Google Scholar]
- Klimesch W., Sauseng P., Hanslmayr S. (2007). EEG alpha oscillations: the inhibition-timing hypothesis. Brain Res. Rev. 53 63–88 10.1016/j.brainresrev.2006.06.003 [DOI] [PubMed] [Google Scholar]
- Krause C. M., Lang A. H., Laine M., Kuusisto M., Pörn B. (1996). Event-related EEG desynchronization and synchronization during an auditory memory task. Electroencephalogr. Clin. Neurophysiol. 98 319–326 10.1016/0013-4694(96)00283-0 [DOI] [PubMed] [Google Scholar]
- Larsby B., Hällgren M., Leandersson M. (2011). “The swedish digit triplet test for screening of hearing by telephone and internet,” in Proceedings of the First International Conference on Cognitive Hearing Science for communication Linköping. [Google Scholar]
- Lehtelä L., Salmelin R., Hari R. (1997). Evidence for reactive magnetic 10-Hz rhythm in the human auditory cortex. Neurosci. Lett. 222 111–114 10.1016/S0304-3940(97)13361-4 [DOI] [PubMed] [Google Scholar]
- Leiberg S., Lutzenberger W., Kaiser J. (2006). Effects of memory load on cortical oscillatory activity during auditory pattern working memory. Brain Res. 1120 131–140 10.1016/j.brainres.2006.08.066 [DOI] [PubMed] [Google Scholar]
- Levitt H. (1971). Transformed up-down methods in psychoacoustics. J. Acoust. Soc. Am. 49 467–477 10.1121/1.1912375 [DOI] [PubMed] [Google Scholar]
- Lunner T. (2003). Cognitive function in relation to hearing aid use. Int. J. Audiol. 42 S49–S58 10.3109/14992020309074624 [DOI] [PubMed] [Google Scholar]
- Lunner T., Rudner M., Rönnberg J. (2009). Cognition and hearing aids. Scand. J. Psychol. 50 395–403 10.1111/j.1467-9450.2009.00742.x [DOI] [PubMed] [Google Scholar]
- McCoy S. L., Tun P. A., Cox L. C., Colangelo M., Stewart R. a., Wingfield A. (2005). Hearing loss and perceptual effort: downstream effects on older adults’ memory for speech. Q. J. Exp. Psychol. A. 58 22–33 10.1080/02724980443000151 [DOI] [PubMed] [Google Scholar]
- Obleser J., Weisz N. (2012). Suppressed alpha oscillations predict intelligibility of speech and its acoustic details. Cereb. Cortex 22 2466–77 10.1093/cercor/bhr325 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Obleser J., Wöstmann M., Hellbernd N., Wilsch A., Maess B. (2012). Adverse listening conditions and memory load drive a common alpha oscillatory network. J. Neurosci. 32 12376–12383 10.1523/JNEUROSCI.4908-11.2012 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Oostenveld R., Fries P., Maris E., Schoffelen J.-M. (2011). FieldTrip: open source software for advanced analysis of MEG, EEG, and invasive electrophysiological data. Comput. Intell. Neurosci. 2011 156869 10.1155/2011/156869 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Peelle J. E., Troiani V., Grossman M., Wingfield A. (2011). Hearing loss in older adults affects neural systems supporting speech comprehension. J. Neurosci. 31 12638–12643 10.1523/JNEUROSCI.2559-11.2011 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pichora-Fuller M. K. (2003). Cognitive aging and auditory information processing. Int. J. Audiol. 42(Suppl. 2) 2S26–2S32. [PubMed] [Google Scholar]
- Pichora-Fuller M. K., Singh G. (2006). Effects of age on auditory and cognitive processing: implications for hearing aid fitting and audiologic rehabilitation. Trends Amplif. 10 29–59 10.1177/108471380601000103 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Plomp R. (1978). Auditory handicap of hearing impairment and the limited benefit of hearing aids. J. Acoust. Soc. Am. 63 533–549 10.1121/1.381753 [DOI] [PubMed] [Google Scholar]
- Rabbitt P. (1991). Mild hearing loss can cause apparent memory failures which increase with age and reduce with IQ. Acta Otolaryngol. Suppl. 476 167–176 10.3109/00016489109127274 [DOI] [PubMed] [Google Scholar]
- Reuter-Lorenz P. A., Cappell K. A. (2008). Neurocognitive aging and the compensation hypothesis. Curr. Dir. Psychol. Sci. 17 177–182 10.1111/j.1467-8721.2008.00570.x [DOI] [Google Scholar]
- Rojas D. C., Teale P. D., Sheeder J. L., Reite M. L. (2000). Neuromagnetic alpha suppression during an auditory sternberg task. Evidence for a serial, self-terminating search of short-term memory. Brain Res. Cogn. Brain Res. 10 85–9 10.1016/S0926-6410(00)00026-4 [DOI] [PubMed] [Google Scholar]
- Rönnberg J., Rudner M., Foo C., Lunner T. (2008). Cognition counts: a working memory system for ease of language understanding (ELU). Int. J. Audiol. 47(Suppl. 2) S99–S105 10.1080/14992020802301167 [DOI] [PubMed] [Google Scholar]
- Sander M. C., Lindenberger U., Werkle-Bergner M. (2012a). Lifespan age differences in working memory: a two-component framework. Neurosci. Biobehav. Rev. 36 2007–2033 10.1016/j.neubiorev.2012.06.004 [DOI] [PubMed] [Google Scholar]
- Sander M. C., Werkle-Bergner M., Lindenberger U. (2012b). Amplitude modulations and inter-trial phase stability of alpha-oscillations differentially reflect working memory constraints across the lifespan. Neuroimage 59 646–54 10.1016/j.neuroimage.2011.06.092 [DOI] [PubMed] [Google Scholar]
- Schack B., Klimesch W. (2002). Frequency characteristics of evoked and oscillatory electroencephalic activity in a human memory scanning task. Neurosci. Lett. 331 107–110 10.1016/S0304-3940(02)00846-7 [DOI] [PubMed] [Google Scholar]
- Scharinger M., Herrmann B., Nierhaus T., Obleser J. (2014). Simultaneous EEG-fMRI brain signatures of auditory cue utilization. Front. Neurosci. 8:137 10.3389/fnins.2014.00137 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schneider-Garces N. J., Gordon B. a., Brumback-Peltz C. R., Shin E., Lee Y., Sutton B. P., et al. (2010). Span, CRUNCH, and beyond: working memory capacity and the aging brain. J. Cogn. Neurosci. 22 655–669 10.1162/jocn.2009.21230 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Shinn-Cunningham B. G., Best V. (2008). Selective attention in normal and impaired hearing. Trends Amplif. 12 283–299 10.1177/1084713808325306 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Simonsen C. S., Behrens T. (2009). A new compression strategy based on a guided level estimator. Hear. Rev. 16 26–31. [Google Scholar]
- Stenfelt S., Rönnberg J. (2009). The signal-cognition interface: interactions between degraded auditory signals and cognitive processes. Scand. J. Psychol. 50 385–393 10.1111/j.1467-9450.2009.00748.x [DOI] [PubMed] [Google Scholar]
- Sternberg S. (1966). High-speed scanning in human memory. Science 153 652–654 10.1126/science.153.3736.652 [DOI] [PubMed] [Google Scholar]
- Strauß A., Wöstmann M., Obleser J. (2014). Cortical alpha oscillations as a tool for auditory selective inhibition. Front. Hum. Neurosci. 8:350 10.3389/fnhum.2014.00350 [DOI] [PMC free article] [PubMed] [Google Scholar]
- van Dijk H., Nieuwenhuis I. L. C., Jensen O. (2010). Left temporal alpha band activity increases during working memory retention of pitches. Eur. J. Neurosci. 31 1701–7 10.1111/j.1460-9568.2010.07227.x [DOI] [PubMed] [Google Scholar]
- Van Engen K. J., Peelle J. E. (2014). Listening effort and accented speech. Front. Hum. Neurosci. 8:577 10.3389/fnhum.2014.00577 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wagener K., Josvassen J. L., Ardenkjær R. (2003). Design, optimization and evaluation of a Danish sentence test in noise. Int. J. Audiol. 42 10–17 10.3109/14992020309056080 [DOI] [PubMed] [Google Scholar]
- Wingfield A., McCoy S. L., Peelle J. E., Tun P. A., Cox L. C. (2006). Effects of adult aging and hearing loss on comprehension of rapid speech varying in syntactic complexity. J. Am. Acad. Audiol. 17 487–497 10.3766/jaaa.17.7.4 [DOI] [PubMed] [Google Scholar]
- Wingfield A., Tun P. A., McCoy S. L. (2005). Hearing loss in older adulthood. what it is and how it interacts with cognitive performance. Curr. Dir. Psychol. Sci. 14 144–148 10.1111/j.0963-7214.2005.00356.x [DOI] [Google Scholar]
- Wöstmann M., Herrmann B., Wilsch A., Obleser J. (2015). Neural alpha dynamics in younger and older listeners reflect acoustic challenges and predictive benefits. J. Neurosci. 35 1458–1467 10.1523/JNEURDSCI.3250-14.2015 [DOI] [PMC free article] [PubMed] [Google Scholar]