Abstract
Many older listeners have difficulty understanding speech in noise, when cues to speech-sound identity are less redundant. The amplitude envelope of speech fluctuates dramatically over time, and features such as the rate of amplitude change at onsets (attack) and offsets (decay), signal critical information about the identity of speech sounds. Aging is also thought to be accompanied by increases in cortical excitability, which may differentially alter sensitivity to envelope dynamics. Here, we recorded electroencephalography in younger and older human adults (of both sexes) to investigate how aging affects neural synchronization to 4 Hz amplitude-modulated noises with different envelope shapes (ramped: slow attack and sharp decay; damped: sharp attack and slow decay). We observed that subcortical responses did not differ between age groups, whereas older compared with younger adults exhibited larger cortical responses to sound onsets, consistent with an increase in auditory cortical excitability. Neural activity in older adults synchronized more strongly to rapid-onset, slow-offset (damped) envelopes, was less sinusoidal, and was more peaked. Younger adults demonstrated the opposite pattern, showing stronger synchronization to slow-onset, rapid-offset (ramped) envelopes, as well as a more sinusoidal neural response shape. The current results suggest that age-related changes in the excitability of auditory cortex alter responses to envelope dynamics. This may be part of the reason why older adults experience difficulty understanding speech in noise.
SIGNIFICANCE STATEMENT Many middle-aged and older adults report difficulty understanding speech when there is background noise, which can trigger social withdrawal and negative psychosocial health outcomes. The difficulty may be related to age-related changes in how the brain processes temporal sound features. We tested younger and older people on their sensitivity to different envelope shapes, using EEG. Our results demonstrate that aging is associated with heightened sensitivity to sounds with a sharp attack and gradual decay, and sharper neural responses that deviate from the sinusoidal features of the stimulus, perhaps reflecting increased excitability in the aged auditory cortex. Altered responses to temporal sound features may be part of the reason why older adults often experience difficulty understanding speech in social situations.
Keywords: aging, envelope tracking, hearing, neural synchronization, nonsinusoidal signal shape
Introduction
Sensitivity to temporal features of sound, such as dynamic fluctuations in the amplitude envelope, is considered critical for speech intelligibility (Drullman et al., 1994; Shannon et al., 1995). Aging is associated with a decline in processing auditory temporal features (Gordon-Salant and Fitzgibbons, 1999) and speech intelligibility, particularly in the presence of background sounds (Gordon-Salant, 2006). In addition to peripheral hearing loss (presbycusis) (Frisina and Frisina, 1997) and cognitive decline (Wayne and Johnsrude, 2015; Griffiths et al., 2020), evidence increasingly suggests that poorer speech intelligibility in older individuals may be related to changes in how the cerebral cortex responds to amplitude envelopes (Millman et al., 2017; Goossens et al., 2018).
Neural activity readily synchronizes with lower-frequency (<20 Hz) sinusoidal amplitude envelopes of sounds (Aiken and Picton, 2008), but older adults often exhibit greater synchronization than younger to such amplitude modulations (AMs) (Goossens et al., 2016; Presacco et al., 2016a,b; Herrmann et al., 2019). Enhanced AM synchronization may be disadvantageous for speech-in-noise perception (Millman et al., 2017; Goossens et al., 2018, 2019) because it may distort envelope pattern and depth cues (Moore and Glasberg, 1993; Schlittenlacher and Moore, 2016). Exaggerated AM synchronization may be related to heightened excitability of the auditory cortex in older people (Snyder and Alain, 2005; Bidelman et al., 2014; Herrmann et al., 2016; Salvi et al., 2017), perhaps resulting from reduced neural inhibition (Caspary et al., 2008).
The shape of amplitude envelopes in speech varies, including in the shape of the attack (rise) and decay (fall) portions (Rosen, 1992). Envelope-shape cues are important for identifying and discriminating between consonants (e.g., /pa/ vs /ta/) (van der Horst et al., 1999). Envelope shape can also alter cochlear excitation (Carlyon, 1996) and neural synchronization patterns (Pressnitzer et al., 2000; Lu et al., 2001; Neuert et al., 2001). Inferior colliculus neurons synchronize more strongly with damped (sharp attack, gradual decay) compared with ramped (gradual attack, sharp decay) envelope shapes in aged rats, whereas the opposite occurs for young rats (Herrmann et al., 2017). Synchronization to ramped and damped envelopes may also differ between older and younger human listeners: increased synchronization to sharp attacks in sounds may explain why older individuals report difficulty suppressing distracting sounds (Parmentier and Andrés, 2010; Mishra et al., 2014), and with speech-in-noise perception when modulated background sound (containing sharp attacks) is present (Moore and Glasberg, 1993; Millman et al., 2017).
Studies in humans and animals almost exclusively focus on synchronization at the stimulation frequency (Purcell et al., 2004; Dimitrijevic et al., 2016; Henry et al., 2017; Herrmann et al., 2017, 2018). Yet, energy is also commonly observed at the harmonics (Lins et al., 1995; Zhu et al., 2013), indicating responses are not fully sinusoidal (Dallos, 1973; Mayoral et al., 2017). Analysis of nonsinusoidal response features, such as the harmonics, improves classification of neural synchronization in clinical settings (Cebulla et al., 2006) and predictions about AM coding using computational modeling (Vasilkov and Verhulst, 2019; Keshishzadeh et al., 2020). Further, nonsinusoidal signal-shape features, such as sharpness, can provide important information about neural dysfunction (Cole et al., 2017). Considering neural response features other than synchronization at the stimulation frequency may provide a better understanding of age-related neural synchronization changes.
In the present study, we examine neural synchronization to narrowband noise stimuli with ramped and damped envelope shapes in younger and older human adults. We use stimuli in two carrier frequency bands (0.9-1.8 kHz, 1.8-3.6 kHz), which are within the frequency range to which human hearing is most sensitive, and in which the articulatory resonances that indicate speech sound identity (i.e., formants) are found. We also expand on traditional Fourier-based analyses to characterize nonsinusoidal response features.
Materials and Methods
Participants
Forty-nine younger (25 younger: 9 males and 16 females aged 18-32 years, mean = 21.8 years, ± SD = 3.2 years) and older (24 older: 6 males and 18 females aged 50-83 years, mean = 66.1 years, ± SD = 8.0) individuals were recruited for this experiment from the Western University Psychology subject pool and the surrounding community of London, Ontario, Canada via Western's neuroscience research registry (OurBrainsCAN; https://ourbrainscan.uwo.ca/). All participants provided informed consent according to a protocol approved by Western's Research Ethics Board (REB #112015), and either received course credit or financial compensation of $10 CAD per hour. The 49 participants included in this study reported having no hearing loss, hearing aid usage, neurologic issues, or psychiatric disorders. Data from 3 additional individuals were not included because of a technical error during data recording (N = 1), a neurologic disorder (N = 1), or hearing aid usage (N = 1). Data from the younger participant group were also analyzed in the stimulus-selection phase of the experiment (see Fig. 2).
Acoustic stimuli
Stimuli were narrowband noises generated by adding 150 randomly sampled carrier frequency components with different onset phases from one of two possible carrier frequency bands (low: 0.9-1.8 kHz; high: 1.8-3.6 kHz). Frequency bands were chosen to span the range of highest human sensitivity, and much of the energy that contributes to discrimination of speech sounds. Narrowband noises were amplitude modulated at a rate of 4 Hz with either a ramped (gradual attack and sharp decay) or damped (sharp attack and gradual decay) envelope shape (Fig. 1) (Herrmann et al., 2017). Amplitude envelopes were created by varying parameters of the following equation:
(1) |
where t is a time vector representing one cycle (0.250 s), z determines the envelope shape, and b is the resulting function used to modulate the noise. A z parameter of 2 generates a symmetrical envelope shape, whereas a value closer to 1 generates an envelope with a damped shape (sharp attack and gradual decay). Varying the z parameter also impacts the sharpness and half-life of the oscillation. We used a z parameter of 1.4491 and 1.15 (based on Herrmann et al., 2017) to generate weakly and strongly modulated envelope shapes, respectively (Fig. 1). Strongly modulated damped envelopes have sharp onsets and a 168.4 ms half-life, whereas weakly modulated envelopes have more sloped onsets and a 195 ms half-life. Weakly and strongly modulated ramped envelope shapes were created by mirroring the vector b. Stimuli were normalized relative to peak amplitude and presented at ∼75 dB SPL (identical for all listeners).
In an effort to select either weakly or strongly modulated envelope shapes for the main experiment, which investigates response changes with age, we examined neural synchronization to either weakly or strongly ramped and damped 4 Hz amplitude-modulated noise stimuli in two groups of younger adults. One group of participants listened to noises with weakly ramped/damped envelope shapes, while the other group listened to noises with strongly ramped/damped envelope shapes. Additional details regarding the participants, procedure, and analyses are reported in the legend of Figure 2. We observed that synchronized neural activity was larger for strong compared with weak envelope shapes. Therefore, we use strongly shaped envelopes to investigate whether neural synchronization with ramped and damped AMs changes with age.
Task procedure
The experiment was conducted in a single-walled sound-attenuating booth (Eckel Industries). Sounds were delivered through Sennheiser (HD 25 Light) headphones, using an RME Fireface 400 external soundcard controlled by a PC (Windows 10) and Psychtoolbox (version 3) in MATLAB (R2017b). EEG was recorded while participants passively listened to a series of strongly modulated ramped and damped sounds while watching a muted captioned movie of their choice. Each stimulus had a duration of 4 s, and stimuli were presented at an onset-to-onset interval of 5.021 s. Participants heard 28 ramped and 28 damped stimuli in each of the two carrier frequency bands (low: 0.9-1.8 kHz; high: 1.8-3.6 kHz) during each of the 6 blocks, for a total of 168 trials per condition per person.
Experimental design and statistical analysis
Statistics were conducted using a combination of IBM SPSS Statistics for Windows (version 24) and MATLAB. Details of the specific variables and statistical tests for each analysis can be found in subsequent analysis subsections. In general, group differences were examined either using an ANOVA or independent-samples t tests. Nonparametric Mann–Whitney U tests were used to analyze behavioral ratings, as these data were ordinal, not continuous. Significant main effects and interactions were followed up using t tests, with multiple comparisons corrected using the false discovery rate (FDR) (Benjamini and Hochberg, 2016) correction. FDR-corrected p values are referred to as pFDR. Effect sizes are reported as partial η squared (η2p) for ANOVAs and requivalent (re) (Rosenthal and Rubin, 2003) for t tests. This experiment was not preregistered. Data are available at the project website on the Open Science Framework (OSF; https://osf.io/eq45x/).
Behavioral hearing assessment
Pure-tone thresholds were measured for all participants at octave frequencies between 0.25 and 8 kHz in the left and right ear (see Fig. 3a). Pure-tone thresholds were used to calculated pure-tone averages (PTA) across octave frequencies from 0.5 to 4 kHz (averaged across ears), to characterize the presence of hearing loss in a range of frequencies relevant to the stimuli from the main portion of the experiment (see Materials and Methods). Average PTA thresholds were submitted to an independent-samples t test with age group (younger, older) as the grouping variable.
Participants also answered questions taken from the Speech, Spatial, and Qualities of Hearing Scale (Gatehouse and Noble, 2004), asking them to use a Likert scale (0: 'not at all' to 10: 'perfectly') to rate their ability in listening situations requiring spatial hearing (N = 2), speech perception in noise (N = 2), and suppression of distracting background sounds (N = 1). Average scores were generated for listening-situation categories with multiple questions (i.e., spatial hearing, speech perception in noise). Given that SSQ scores are ordinal, not continuous, we used separate Mann–Whitney U tests (non-parametric) examine age group (younger, older) differences on each listening situation category (spatial hearing, speech perception in noise, distractor suppression).
Participants completed the Quick Speech in Noise test (QuickSIN) (Killion et al., 2004), a clinical measure used to assess speech understanding in noise. All target sentences and babble noise were taken from the QuickSIN database. During the test, a target sentence, spoken by a female talker, was presented with four-talker babble as background noise (overall 70 dB SPL). Participants were instructed to listen to each sentence and type the words that they heard. Sentences were presented in sets of 6, which began with a 25 dB signal-to-noise ratio (SNR) and reduced in 5 dB steps until the final sentence was completed. Participants were each asked to complete four sentence sets (24 total sentences) that were randomly selected from 12 possible sets. We calculated performance for each SNR separately (25, 20, 15, 10, 5, 0 dB), and report the total proportion of correct words for each SNR. Performance was at ceiling for all SNRs except 0 dB. We therefore examined age group (younger, older) differences on performance at the 0 dB SNR using an independent-samples t test.
EEG recording and preprocessing
EEG was recorded from 16 active electrodes (Ag/AgCl) placed on the scalp using an electrode cap with spacing according to the 10/20 system (Biosemi ActiveTwo system). We also recorded and averaged signals from both mastoids to rereference the data during offline analysis. During data recording, all electrodes were referenced to a feedback loop formed of two electrodes: a common mode sense active electrode and a driven passive electrode (see www.biosemi.com/faq/cms&drl.htm). EEG was recorded at 16,384 Hz to target peripheral and subcortical sources during auditory brainstem response (ABR) recording (online low-pass filter 3334 Hz) and at 1024 Hz to isolate primarily cortical sources during envelope tracking (online low-pass filter of 208 Hz). All preprocessing was conducted offline using MATLAB software and the Fieldtrip toolbox (Oostenveld et al., 2011).
For ABR recordings, data were rereferenced to the averaged signal from both mastoids, a notch filter was used to attenuate signal at line-noise frequencies (60 and 120 Hz), and then the EEG data were high-pass (80 Hz, 2743 points, Hann window) and low-pass filtered (2000 Hz, 101 points, Hann window). Continuous data were segmented into 12 ms epochs ranging from −2 ms to 10 ms time-locked to click onset. Epochs in which signal changed by >25 µV during the 0-10 ms time window in any channel were rejected (average rejection rate: 16%).
For cortical EEG, data were rereferenced to the averaged signal from both mastoids and then high-pass (0.7 Hz, 2449 points, Hann window) and low-pass filtered (30 Hz, 101 points, Hann window). The continuous EEG data were segmented into epochs ranging from −0.5 to 4 s, time-locked to the onset of each stimulus. Ocular artifacts were removed using independent components analysis (Makeig et al., 1996). Epochs in which the signal changed by >150 µV in any channel were rejected (average rejection rate: 5%).
EEG analysis: peripheral and subcortical neural responses
We recorded click-evoked ABRs to derive an objective physiological measure of auditory peripheral and subcortical function. Participants were asked to passively listen to a series of isochronous clicks presented monaurally to the right ear, while watching a muted captioned movie of their choice and EEG was recorded. Each click had a 0.1 ms duration (rectangular window) and was presented monaurally to the right ear with an 11.3 ms onset-to-onset interval and an approximate sound level of 88 dB SPL. A total of 4000 clicks were presented with click polarity inverted on half of the trials, resulting in an equal proportion of condensation and rarefaction clicks.
A small subset of electrodes were used for the analysis to approximate a vertical electrode montage (Cz referenced to mastoid ipsilateral to sound presentation); this subset was chosen because it is known to maximize appearance of both Wave I and Wave V (Picton, 2010a). Peak latency was identified as the time point corresponding to maximum amplitude within a time window specific to Wave I (1-3 ms) and Wave (5-7 ms). Peak amplitude was calculated by averaging the amplitude within a 0.5 ms window centered on Wave I or Wave latency. Resulting peaks were visually inspected to ensure the response peaks well-characterized Wave I and Wave responses.
No discernible Wave I or peak could be identified for two individuals in the younger age group, both of whom required that >75% trials be rejected because of excessive artifact. These individuals were excluded from ABR analysis. Wave I and amplitudes and latencies were calculated and then analyzed for the remaining participants (24 older and 23 younger) using four separate independent-samples t tests, each which had age group (younger, older) as the grouping variable.
EEG analysis
Cortical responses to sound onset
Single-trial time courses for each envelope shape (ramped, damped) and carrier frequency band (low, high) were averaged separately. We examined P1 and N1, both of which are sensory-evoked responses with primary sources originating in auditory cortex (Hari et al., 1982; Näätänen and Picton, 1987; Pantev et al., 1988; Liégeois-Chauvel et al., 1994; Yoshiura et al., 1995), by averaging responses across a fronto-central electrode cluster that is sensitive to both responses (Fz, F3, F4, Cz, C3, C4) (Näätänen and Picton, 1987; Picton, 2010b). Mean amplitude was calculated by finding the time point corresponding to either the maximum (P1) or minimum amplitude (N1 peak) within a time window specific to the onset response (P1: 0.045-0.065 s; N1: 0.085-0.115 s), and then averaging the amplitude values within a corresponding averaging window (P1: 0.02-s; N1: 0.03-s) centered on the response peak. Visual inspection was performed to ensure the response peaks were accurately found for the P1 and the N1, for both ramped and damped envelope shapes.
We used onset response amplitude (P1, N1) as a metric of neural responsiveness to sound (compare Snyder and Alain, 2005; Alain et al., 2014; Herrmann et al., 2016, 2019; Henry et al., 2017). P1 and N1 amplitudes were submitted to separate ANOVAs with envelope shape (ramped, damped) and carrier frequency (low, high) as within-subject factors and age group (younger, older) as a between-subjects factor.
Time course correlation similarity
In order to better understand whether, and to what extent, cortical time courses differ between age groups, we quantified the degree of similarity between the neural time courses. After excluding the first 0.5 s (onset response range), we averaged responses across electrodes (Fz, F3, F4, Cz, C3, C4) and the two carrier frequency band conditions (because no interactions with age group were observed when including carrier frequency as a factor, F < 0.44, p > 0.51, η2p < 0.009), resulting in one averaged time course for each envelope shape (ramped, damped) condition and participant. Correlations between the averaged time courses were calculated, separately for ramped and damped stimuli, such that each participant's time course was correlated with the time course of each participant within their “own” age group and with the time course of each participant from the “other” age group. For each envelope shape and each participant, the set of r values resulting from the correlations with time courses from other participants were categorized (“own” vs “other” group) and averaged separately, yielding four mean correlations for each participant: two for each envelope shape: one for “own” group and one for “other” group. Larger own group r values would indicate an individual's response time course was highly synchronous with others in their own age group, whereas larger other group r values would indicate an individual's response time course was more synchronous with individuals in the other age group.
To quantify the degree of similarity between the neural time courses for younger and older adults, we compared average r values using an ANOVA with envelope shape (ramped, damped) and correlation type (own group r, other group r) as within-subjects factors and age group (younger, older) as the between-subjects factor.
Neural synchronization strength
In order to characterize neural synchronization to AM in the narrowband noise stimuli, we calculated intertrial phase coherence (ITPC) (Lachaux et al., 1999). For each condition, a Fourier transform was calculated for the 0.5-4 s time window of single-trial time courses (Hann window; zero-padding). The first 0.5 s was excluded from the analysis to prevent onset responses from affecting the neural-synchronization analysis. Each complex number resulting from the Fourier transform was divided by its absolute value and averaged across trials. ITPC values were derived by calculating the absolute value of the resulting average. ITPC can take on values between 0 and 1, with larger values indicating greater coherence. For each condition, ITPC was averaged across electrodes (Fz, F3, F4, Cz, C3, C4). Average ITPC was extracted at the AM frequency (4 Hz; averaging window: ±0.05 Hz).
The Fourier transform decomposes a time domain signal into unique sines and cosines, regardless of whether the time domain signal has a dominant sinusoidal structure. As a result, any complex signals that contain periodic, but nonsinusoidal, structure in the time domain will yield peaks in the spectrum at the fundamental frequency as well as at the harmonics (Mayoral et al., 2017). Ignoring the response magnitude at the harmonics may leave out important information, since neural synchronization to amplitude-modulated sounds is often nonsinusoidal (Dallos, 1973; Lins et al., 1995; Cebulla et al., 2006; Zhu et al., 2013). Given that responses are clearly visible in the spectrum at the harmonics of the stimulation frequency (see Figs. 2, 4) we also averaged ITPC values at the stimulation frequency (4 Hz) and harmonics up to 20 Hz (i.e., 4, 8, 12, 16, and 20 Hz; abbreviated hereafter using 4:4:20 Hz; averaging window: ±0.05 Hz). By doing so, we can explore whether including nonsinusoidal response features, such as responses to the harmonics, offers additional information above and beyond what is observed at ITPC to the stimulation frequency (4 Hz).
To examine whether neural synchronization strength differed between age groups, ITPC at the AM frequency (4 Hz) and for the 4 Hz fundamental/harmonic series (4:4:20 Hz) were submitted to separate ANOVAs, each with envelope shape (ramped, damped) and carrier frequency (low, high) as within-subject factors and age group (younger, older) as a between-subjects factor.
Quantification of nonsinusoidal response patterns and signal shape
A growing body of evidence suggests that nonsinusoidal activity features can provide important information about underlying neural response properties (Cole and Voytek, 2017; Cole et al., 2017), including those that may indicate neural dysfunction (Sherman et al., 2016). We therefore investigated the degree to which synchronized neural responses diverge from a sinusoidal shape in two unique, but complementary, ways.
First, we investigated the harmonic structure of the ITPC frequency spectrum. High amplitude at the harmonics of the fundamental frequency would indicate that responses are less sinusoidal (Dallos, 1973; Mayoral et al., 2017). To approximate this, we extracted ITPC at the 4 Hz fundamental frequency (4 Hz, averaging window: ±0.05 Hz), and at the frequency of the harmonics from 8 to 20 Hz (i.e., 8, 12, 16, and 20 Hz; 8:4:20 Hz, averaging window: ±0.05 Hz) for each condition, and calculated the ratio between the two according to the following equation:
(2) |
where F0 refers to mean ITPC at 4 Hz, and F1:F4 refers to mean ITPC across harmonic frequencies: 8, 12, 16, and 20 Hz. Larger Q values indicate a more sinusoidal synchronization response, whereas smaller Q values indicate a more nonsinusoidal synchronization response. Q was submitted to an ANOVA with envelope shape (ramped, damped) and carrier frequency (0.9-1.8 kHz, 1.8-3.6 kHz) as within-subjects factors and age group (younger, older) as the between-subjects factor.
Using a second approach, we quantified specific nonsinusoidal signal shape features. For this analysis, the amplitude values of trial-averaged time courses (averaged across electrodes: Fz, F3, F4, Cz, C3, C4) were related to the 4 Hz stimulus phase. That is, the time course amplitude data were binned according to phase values assuming a 4 Hz sinusoid (number of bins: 100; window width: 0.063 radians), such that signal amplitude was represented as a function of phase (see Fig. 7a). An exponential cosine function was fit to the amplitude data using the following equation:
(3) |
where y is the vector of binned amplitudes as a function of phase, a is the parameter for amplitude, x is the starting phase value, p is a vector of the 100 linearly spaced phase values relating to amplitude values in y, b is the intercept, and e is the exponent parameter which determines the sharpness of the function (see Fig. 7b). An exponent of 1 reflects a sinusoid. An exponent >1 means the signal is nonsinusoidal, and the function increases in sharpness with increasing exponent. Here, we analyzed two parameters from each fit, amplitude, and exponent (sharpness), to directly quantify whether neural responses are hyper-responsive (amplitude) and contain nonsinusoidal response features (exponent) (b, the intercept, is not meaningful here as it is close to zero because of the high-pass filter). The estimated amplitude a and the estimated exponent e were submitted to an ANOVA with envelope shape (ramped, damped) and carrier frequency (low, high) as within-subjects factors and age group (younger, older) as the between-subjects factor. The absolute value of the fitted amplitude a was calculated before the ANOVA because the inclusion of e in the formula sometimes led to a sign inversion of a.
Results
Younger and older listeners differ in behavioral hearing assessment, but not in subcortical responses
Pure-tone thresholds for octave frequencies between 0.25 and 8 kHz (averaged across ears) are plotted in Figure 3a. All participants had PTA thresholds (0.5-4 kHz averaged across ears) ≤31 dB HL. Relative to younger participants, older adults had elevated PTA thresholds (±10.93 dB HL; t(47) = −6.96, p = 9.56 × 10−9, re = 0.71) and lower self-reported ratings for spatial hearing (U = 183.5, p = 0.018), sound distractor suppression (U = 154, p = 0.003), and understanding speech in the presence of background noise (U = 154.5, p = 0.003) (Gatehouse and Noble, 2004).
Older and younger adults performed at ceiling on the QuickSIN, except for the most difficult SNR level (0 dB). At 0 dB, both groups exhibited proportions of correctly reported words that were significantly lower than 1 (younger: mean = 0.34, SE = 0.03, 95% CI [0.28 0.40]; older: mean = 0.20, SE = 0.02, 95% CI [0.15 0.24]), and proportions were lower for older compared with younger adults (0 dB, t(47) = 3.66, p = 0.001, re = 0.47; Fig. 3b).
Despite these age-group differences in behavioral assessment metrics, at the neural level, there was no group difference in Wave I or Wave amplitude (Wave I: p = 0.823; Wave V: p = 0.295) or latency (Wave I: p = 0.105; Wave V: p = 0.574) in response to click stimulation (11.3 Hz, 88 dB SPL) (Fig. 3c). To examine whether hearing loss, instead of age, was associated with a reduction in subcortical function, we calculated the partial correlation (controlling for age) between average PTA thresholds and the ratio between Wave V and I amplitude (Wave V/I ratio), but did not find a significant relationship (r(44) = –0.1, p = 0.51). These findings do not suggest that auditory nerve and subcortical function differed across age groups.
Aged auditory cortex is hyperresponsive to sound
Aging and hearing loss are associated with maladaptive cortical plasticity that leads to greater responsivity to stimulation by neural populations in auditory cortex (Snyder and Alain, 2005; Alain et al., 2014; Auerbach et al., 2014; Herrmann et al., 2016, 2017, 2018; Henry et al., 2017; Salvi et al., 2017; Herrmann and Butler, 2020). Hyper-responsiveness may be an index of reduced inhibition in cortical circuits (Caspary et al., 2008; Knipper et al., 2013; Ng and Recanzone, 2018). In order to test whether the cortex in the sample of older adults tested here is hyper-responsive, we compared neural responses elicited by sound onset between age groups (Fig. 4a).
Response amplitudes for both P1 and N1 were larger for older compared with younger adults (effect of age group: [P1: F(1,47) = 13.18, p = 0.001, η2p = 0.22] [N1: F(1,47) = 20.68, p = 3.8 × 10−5, η2p = 0.31]; Fig. 4b), indicating hyper-responsiveness to sound. Stimuli with damped envelope shapes also elicited larger P1 and N1 amplitudes compared with noises with ramped envelope shapes (effect of envelope shape: (P1: F(1,47) = 35.73, p = 2.9 × 10−7, η2p = 0.43; N1: F(1,47) = 5.36, p = 0.025, η2p = 0.10). This is consistent with previous research showing that N1 amplitude is larger when the stimulus rise time is fast (Picton, 2010c), probably because sharp onsets drive more synchronous activity than slower onsets. Finally, there was an envelope shape × age group interaction for P1 (F(1,47) = 32.89, p = 6.89 × 10−7, η2p = 0.41). Older adults showed larger P1 amplitudes for damped compared with ramped envelopes (t(23) = 8.22, pFDR = 5.3 × 10−8, re = 0.86), while younger adults showed no such difference (pFDR = 0.864). None of the other effects or interactions were significant (F < 3.8, p > 0.05, η2p < 0.1).
Neural time courses differ between younger and older adults
In order to quantify differences in the response time courses between older and younger adults, we calculated correlations between individual participants' response time courses within and across age groups, separately for stimuli with ramped (Fig. 5a, top) and damped envelope shapes (Fig. 5a, bottom).
Participants had larger own group r compared to other group r scores (effect of correlation type: F(1,47) = 80.88, p = 8.8 × 10−12, η2p = 0.63): participant time courses were more similar within an age group than across age groups. This difference between own group and other group r was larger for younger compared with older subjects (correlation type × age group interaction: F(1,47) = 9.05, p = 004, η2p = 0.16). Similarity scores (r) did not differ as a function of envelope shape (p = 0.336), but a significant envelope shape × age group interaction (F(1,47) = 30.59, p = 1.4 × 10−6, η2p = 0.39), showed that younger subjects had larger similarity scores for ramped compared with damped (t(24) = 4.24, pFDR = 6 × 10−4, re = 0.65), while older subjects had larger similarity scores for damped compared with ramped (t(24) = −3.59, pFDR = 0.002, re = 0.65).
A significant envelope shape × correlation type × age group interaction (F(1,47) = 90.1, p = 1.7 × 10−12, η2p = 0.66) was analyzed with post hoc t tests. For ramped-envelope stimuli, younger participants had larger own group r compared with other group r, suggesting younger participants neural-response time courses to ramped envelopes were correlated more strongly among their peers than with time courses from older adults (t(24) = 14.34, pFDR = 2.9 × 10−13, re = 0.95; Fig. 5b, top left). Older participants showed no difference between own group and other group r for ramped-envelope stimuli (pFDR = 0.527; Fig. 5b, top right). For damped-envelope stimuli, older adults had larger own group r compared with other group r (t(23) = 9.20, pFDR = 3.6 × 10−9, re = 0.89; Fig. 5b, bottom right), whereas younger participants showed no difference between own and other group r for damped envelope shapes (pFDR = 0.369; Fig. 5b, bottom left). Together, these analyses show that younger participants exhibit more synchronous neural responses when listening to ramped envelopes (slow onset and rapid offset), whereas older participants produce more synchronous neural responses when listening to damped envelopes (rapid onset and slow offset).
Neural synchronization for different envelope shapes differs between younger and older adults
We quantified how envelope shape affects neural synchronization (ITPC) in older and younger adults (Fig. 6a). For ITPC at the 4 Hz stimulation frequency, there was no effect of age group (F(1,47) = 3.45, p = 0.07, η2p =0.07) nor envelope shape (p = 0.611), but the age group × envelope shape interaction was significant (F(1,47) = 17.82, p = 1.10 × 10−4, η2p = 0.28). Younger adults showed increased ITPC for sounds with ramped compared with damped envelope shapes (t(24) = −3.27, pFDR = 0.006, re = 0.56), whereas older adults showed the reverse pattern (t(23) = 2.69, pFDR = 0.013, re = 0.49; Fig. 6b). ITPC was also larger for sounds with high compared with low-carrier frequency bands for younger adults (t(24) = −2.60, pFDR =0.032, re = 0.47), but there was no difference for older adults (pFDR = 0.498; age group × carrier frequency interaction: F(1,47) = 6.08, p = 0.017, η2p = 0.12). There was a significant interaction between envelope shape and carrier frequency (F(1,47)= 8.19, p = 0.006, η2p = 0.15), but follow-up comparisons did not reveal any significant differences between ramped and damped low-carrier frequency sounds (pFDR = 0.446) or ramped and damped high-carrier frequency sounds (pFDR = 0.225). None of the other effects were significant (F < 2.67, p > 0.11, η2p < 0.05).
ITPC for the fundamental/harmonic series (Fig. 6b) did not differ between age groups (p = 0.678), but ITPC was larger for damped compared with ramped envelope shapes (effect of envelope shape: F(1,47) = 13.55, p = 0.001, η2p = 0.22). Critically, the envelope shape × age group interaction was significant (F(1,47) = 7.01, p = 0.011, η2p = 0.13; Fig. 6b): neural synchronization was larger for damped compared with ramped envelope shapes for older adults (t(23) = 5.35, pFDR = 3.9 × 10−5, re = 0.74) but did not differ between envelope shapes for younger participants (pFDR = 0.524). There was also an interaction between age group and carrier frequency (F(1,47) = 6.02, p = 0.018, η2p = 0.11). This was driven by reduced synchronization for sounds with a high compared with a low-carrier frequency band in older participants (t(23) = 3.17, pFDR = 0.046, re = 0.55), and a nonsignificant trend toward the opposite pattern in younger participants (pFDR = 0.269). None of the other effects were significant (F < 1.06, p > 0.31, η2p < 0.02).
Neural synchronization is less sinusoidal in older compared with younger adults
A growing body of evidence suggests that analyzing nonsinusoidal signal features can provide important information about underlying neural response properties (Cole and Voytek, 2017; Cole et al., 2017). To characterize the presence of nonsinusoidal signal features, we used the harmonic structure of the ITPC frequency spectrum to calculate Q, which indexes the degree to which the response is sinusoidal (spectral peak mainly at F0) versus nonsinusoidal (spectral peaks at the harmonics, F1 to F4). Q was smaller, indicating a less sinusoidal response, in older compared with younger participants (effect of age group: F(1,47) = 9.67, p = 0.003, η2p = 0.17) and was smaller (less sinusoidal response) for damped compared with ramped envelope shapes (effect of envelope shape: F(1,47) = 15.54, p = 2.7 × 10−4, η2p = 0.25). An interaction between age group and envelope shape (F(1,47) = 7.13, p = 0.01, η2p = 0.13) was due to younger participants having smaller Q for damped compared with ramped stimuli (t(24) = 4.99, pFDR = 9 × 10–5, re = 0.71), whereas Q was small for both damped and ramped stimuli in older individuals, with no reliable difference (t(23) = 0.85, pFDR = 0.406, re = 0.17). The synchronized response was therefore less sinusoidal for sounds with damped compared with ramped envelopes in younger adults, and nonsinusoidal for both envelope shapes in older adults (Fig. 6c). There was an effect of carrier frequency band: Q was smaller (less sinusoidal response) for sounds with a low- compared with a high-carrier frequency band (F(1,47) = 11, p = 0.002, η2p = 0.19). Finally, there was an interaction between envelope shape and frequency band (F(1,47) = 7.02, p = 0.011, η2p = 0.13), which was a result of smaller Q for damped compared with ramped envelopes with a high-carrier frequency band (t(48) = 4.68, pFDR = 5 × 10−5, re = 0.56), but no difference between damped and ramped envelopes with a low-frequency band (t(48) = 1.87, pFDR = 0.067, re = 0.26). None of the other interactions were significant (F < 1.2, p > 0.29, η2p < 0.02).
Neural synchronization reflects sharper responses in older compared with younger adults
Analysis of specific aspects of the neural signal shape, such as sharpness, that are not well captured using the Fourier transform may provide important information about neural response properties and dysfunction (Sherman et al., 2016; Cole et al., 2017). In order to capture differences in neural signal shape, we binned the amplitude data according to phase values of a 4 Hz sinusoid (Fig. 7a), and then fit an exponential cosine function to the binned amplitude data (Fig. 7b). We analyzed the estimated amplitude (a) and sharpness (exponent e) coefficients, with three-factor ANOVAs (envelope shape, carrier frequency band, age group).
The analysis of amplitude paralleled our ITPC findings (Fig. 6b): there was no age group difference (p = 0.508), but amplitude was larger for damped compared with ramped envelope shapes (effect of envelope shape: F(1,47) = 4.41, p = 0.041, η2p = 0.09; Fig. 7c). The interaction between age group and envelope shape was significant (F(1,47) = 20.90, p = 3.5 × 10−5, η2p = 0.31), revealing larger amplitudes for damped compared with ramped envelope shapes for older individuals (t(23) = 4.48, pFDR = 3 × 10−4, re = 0.68), whereas a nonsignificant pattern in the opposite direction was observed for younger adults (t(24) = −1.84, pFDR = 0.077, re = 0.35). There was also an interaction between age group and carrier frequency band (F(1,47) = 12.86, p = 0.001, η2p = 0.22), which indicated that responses in younger participants were larger for high compared with low-carrier frequency sounds (t(24) = 2.11, pFDR = 0.046, re = 0.40), whereas the reverse pattern was observed for older individuals (t(23) = −3.11, pFDR = 0.01, re = 0.54). None of the other interactions were significant (F < 2.8, p > 0.1, η2p < 0.06).
The exponent (indicating response sharpness) was larger for older compared with younger adults (effect of age group: F(1,47) = 25.95, p = 6 × 10−6, η2p = 0.36), and larger for damped compared with ramped envelopes (effect of envelope shape: F(1,47) = 12.71, p = 0.001, η2p = 0.21; Fig. 7d). No remaining effects or interactions were significant (F < 2.8, p > 0.15, η2p < 0.04). These results indicate that responses were sharper in older compared with younger adults, and for damped compared with ramped envelope shapes.
In sum, signal-shape analyses indicate that, in older adults, responses were larger for envelopes with rapid onsets (and slow offsets) than for those with slow onsets (and rapid offsets), whereas younger adults showed the reverse pattern. Synchronized neural activity was less sinusoidal, and sharper, in older participants compared with younger participants.
Relating neural sensitivity to different envelope shapes with measures of hearing loss
In order to quantify how neural sensitivity to different envelope shapes may be related to our measures of hearing loss, we calculated partial correlations (controlling for age) between PTA and the following neural measures: the difference in synchronization to damped versus ramped envelopes (4:4:20 Hz ITPC; averaged across carrier frequencies; r(46) = –0.09, pFDR = 0.556), the difference between damped and ramped Q scores (averaged across carrier frequencies; r(46) = –0.06, pFDR = 0.679), overall signal sharpness (averaged across envelope shapes and carrier frequencies; r(46) = −0.02, pFDR = 0.918), and overall signal amplitude a (averaged across envelope shapes and carrier frequencies; r(44) = –0.23, pFDR = 0.117), but did not find any significant relationships. We also made a similar comparison between QuickSIN performance (0 SNR) and the same neural measures: the difference in synchronization to damped versus ramped envelopes (4:4:20 Hz synchronization: r(46) = –0.09, pFDR = 0.67), the difference between damped and ramped Q scores (averaged across carrier frequencies; r(46) = –0.13, pFDR = 0.607), overall signal sharpness (r(46) = 0.02, pFDR = 0.916), and overall signal amplitude a (r(46) = –0.19, pFDR = 0.607), but no effects reached significance.
Discussion
We examined neural sensitivity to sound envelopes with a ramped (gradual attack, sharp decay) or damped (sharp attack, gradual decay) envelope shape in younger and older adults. The three main findings are as follows: (1) auditory cortex of older adults is hyper-responsive to sound compared with younger adults despite similar subcortical responses in both groups; (2) neural activity in older adults synchronizes more strongly with rapid-attack, slow-decay (damped) envelopes, whereas in younger adults it synchronizes more strongly with slow-attack, rapid-decay (ramped) envelopes; and (3) synchronized neural activity is less sinusoidal and sharper, appearing more burst-like, in older compared with younger adults. Our results demonstrate that older adults' sensitivity to the amplitude envelope of sounds differs fundamentally from that of younger adults.
Auditory cortex of older adults is hyper-responsive to sound
We observed larger cortical responses to sound onset in older compare to younger adults (Fig. 4b), despite higher pure-tone thresholds (lower sensitivity) in older individuals (Fig. 3a) and no difference in subcortical responses between age groups (Fig. 3c). Our results are consistent with a growing literature showing hyper-responsiveness to sound in the cortex of older compared with younger rats (Hughes et al., 2010) and humans (Amenedo and Díaz, 1999; Snyder and Alain, 2005; Sörös et al., 2009; Lister et al., 2011; Alain et al., 2012; Herrmann et al., 2016, 2018; Herrmann and Butler, 2020), as well as in rats following noise exposure (Popelár˘ et al., 1987; Syka et al., 1994; Manzoor et al., 2012) and adult humans with hearing loss compared with those without (Tremblay et al., 2003; Alain et al., 2014; Millman et al., 2017).
Hyper-responsiveness is thought to arise from damage to the auditory periphery, such that deprivation of inputs from peripheral structures to brain regions downstream leads to reduced neural inhibition and increased excitation throughout the auditory pathway (Caspary et al., 2008; Auerbach et al., 2014; Salvi et al., 2017). Consistent with the current results, studies comparing noise-exposed to control animals have repeatedly shown that hyper-responsiveness manifests most strongly in auditory cortex (Auerbach et al., 2014; Chambers et al., 2016; Asokan et al., 2018). Indeed, hyper-responsiveness in auditory cortex has been taken as an index of a loss of inhibition (Ng and Recanzone, 2018). The enhanced onset responses in older compared with younger adults could also be a result of reduced response variability, as older adults have been shown to exhibit less variable neural response profiles compared with younger adults (Garrett et al., 2010, 2011). More consistent single-trial responses would result in a larger response magnitude in the average. Decreased response variability would likely be secondary to a loss of neural inhibition.
Neural synchronization patterns differ between younger and older adults
By correlating each individual's neural time course with that of other participants, we were able to derive a metric of neural response similarity for ramped and damped sounds across participants. This analysis is conceptually similar to calculating intersubject correlation (Hasson et al., 2008; Cohen and Parra, 2016), and represents a measure of global synchrony with other participants. Time courses were more similar between individuals from the same age group than for individuals from different age groups, but this was only observed when participants listened to the envelope shape for which each age group showed heightened ITPC sensitivity. In younger individuals, neural-activity time courses were more synchronous with other younger participants than with older ones when listening to stimuli with a ramped envelope shape (Fig. 5c). In contrast, older individuals exhibited neural-activity time courses for damped envelopes that were more synchronous with other older participants than with those of younger ones. Thus, older and younger subjects preferentially synchronize to specific envelopes shapes, and this effect is highly consistent across subjects.
Previous work has demonstrated larger synchronization strength at the stimulation frequency for older compared with younger adults (Goossens et al., 2016, 2018, 2019; Presacco et al., 2016a,b). We did not observe an age-related increase in synchronization at the stimulation frequency, or when we additionally considered the energy at harmonic frequencies (4:4:20 Hz). Sound intensity has been shown to affect the magnitude of synchronized activity (Picton et al., 2003). Many studies control for audibility between normal-hearing and hearing-impaired participants by increasing the sound level for those with hearing impairment (Millman et al., 2017; Goossens et al., 2018, 2019). In some cases, a larger synchronization response in older compared with younger individuals is not observed if sounds are presented at the same level in both groups (Goossens et al., 2019). We used a sound level of ∼75 dB SPL for both age groups, and observed that older adults were hyper-responsive to sound onset, but did not show increased synchronization with the AM stimulus, compared with younger listeners.
Compatible with previous findings in spiking activity of the inferior colliculus of rats (Herrmann et al., 2017), older adults showed increased synchronization strength for damped compared with ramped envelopes, whereas younger adults showed the opposite pattern (Fig. 6b). The sizable response to damped compared with ramped envelopes across harmonics suggests older participants have hyper-sensitivity to sounds with sharp onsets. Further, we generally observed that the effects are consistent across both carrier frequency ranges, without consistent interactions between age group and carrier frequency. The age-related changes in envelope sensitivity we observed appear to generalize across the range of frequencies tested here, which covers the critical frequency range used to discriminate speech sounds.
The shape of synchronized neural activity differs fundamentally between age groups
Typically, neural synchronization with AM focuses on the response at the stimulation frequency (sinusoidal component) (Purcell et al., 2004; Zhu et al., 2013; Bharadwaj et al., 2015; Dimitrijevic et al., 2016; Henry et al., 2017; Herrmann et al., 2017, 2019), although synchronization patterns commonly include nonsinusoidal response features, such as responses to the harmonics (Dallos, 1973; Lins et al., 1995; Cebulla et al., 2006; Zhu et al., 2013). Further, there is evidence that analyzing nonsinusoidal signal shape features, such as sharpness, can provide important physiological information about neural signaling and system dysfunction (Cole et al., 2017). We analyzed the extent to which neural responses consisted of primarily sinusoidal or nonsinusoidal response patterns by studying the ratio of responses at the fundamental to an average of the signal at the harmonics (Q). This showed that neural responses were overall less sinusoidal for damped envelopes compared with ramped, and for older compared with younger adults.
We also analyzed specific features of the neural signal shape and showed both ramped and damped envelopes elicited sharper neural responses in older compared with younger adults (Fig. 7d), suggesting heightened sensitivity to both the sharp onset of damped envelopes and sharp offset of ramped envelopes. Sharp response features of neural activity indicate that the response is likely driven by short synchronous bursts of activity: the same neural responses spread out in time would create a smoother more sinusoidal signal shape (Sherman et al., 2016; Cole and Voytek, 2017). These findings suggest that how the auditory system responds to amplitude envelopes in sounds is fundamentally changed in older individuals, such that responses are more synchronous, appearing burst-like.
Relating envelope-shape sensitivity to hearing ability
Previous behavioral work has demonstrated that preserved envelope-shape cues are critical for speech intelligibility (Drullman et al., 1994; Shannon et al., 1995), and that poorer speech intelligibility in older individuals may be related to changes in envelope coding (Millman et al., 2017; Goossens et al., 2018). Despite observing fundamental changes in older adults' neural sensitivity to amplitude envelopes, no relation with speech-in-noise ability was observed. One limitation of using the QuickSIN to assess speech-in-noise here is that the babble noise masker has a nearly flat temporal profile, and may not reflect a reliance on temporal envelope cues as strongly as when using a speech-on-speech task, which has salient temporal fluctuations in the target and masker. Alternatively, observed changes in envelope sensitivity may be more related to other hearing difficulties that older adults experience, such as difficulty suppressing background sounds (Parmentier and Andrés, 2010; Mishra et al., 2014). Increased sensitivity to sharp stimulus features may increase the salience of background sounds with such features (e.g., stop consonants) (Repp and Lin, 1989). Increased sensitivity to sharp stimulus features may also contribute to the discomfort some individuals experience when using a hearing aid with fast compression time constants (compare Gatehouse et al., 2003), which distorts the temporal envelope, such that the attack is more rapid.
In conclusion, we examined how different envelope shapes (ramped, damped) affect neural synchronization in younger and older adults. Older participants demonstrated neural hyper-responsiveness to sound onsets, despite showing no major differences in neural responses at peripheral and subcortical levels. Older participants also showed increased sensitivity to damped compared with ramped envelope shapes, whereas the opposite pattern was observed in younger adults. Furthermore, synchronized neural activity appeared less sinusoidal and more burst-like in older, compared with younger, individuals. Our findings underscore the importance of characterizing sinusoidal and nonsinusoidal features of synchronized neural responses to stimuli, and suggest that aging is accompanied by major changes in the way that brain activity synchronizes with AMs in sounds.
Footnotes
This work was supported by Canadian Institutes of Health Research MOP133450 to I.S.J. B.H. was supported by a BrainsCAN Tier I postdoctoral fellowship (Canada First Research Excellence Fund) and the Canada Research Chair program.
The authors declare no competing financial interests.
References
- Aiken SJ, Picton TW (2008) Human cortical responses to the speech envelope. Ear Hear 29:139–157. 10.1097/AUD.0b013e31816453dc [DOI] [PubMed] [Google Scholar]
- Alain C, McDonald K, Van Roon P (2012) Effects of age and background noise on processing a mistuned harmonic in an otherwise periodic complex sound. Hear Res 283:126–135. 10.1016/j.heares.2011.10.007 [DOI] [PubMed] [Google Scholar]
- Alain C, Roye A, Salloum C (2014) Effects of age-related hearing loss and background noise on neuromagnetic activity from auditory cortex. Front Syst Neurosci 8:1–12. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Amenedo E, Díaz F (1999) Ageing-related changes in the processing of attended and unattended standard stimuli. Neuroreport 10:2383–2388. 10.1097/00001756-199908020-00030 [DOI] [PubMed] [Google Scholar]
- Asokan MM, Williamson RS, Hancock KE, Polley DB (2018) Sensory overamplification in layer 5 auditory corticofugal projection neurons following cochlear nerve synaptic damage. Nat Commun 9:10. 10.1038/s41467-018-05333-y [DOI] [PMC free article] [PubMed] [Google Scholar]
- Auerbach BD, Rodrigues PV, Salvi RJ (2014) Central gain control in tinnitus and hyperacusis. Front Neurol 5:1–21. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Benjamini Y, Hochberg Y (2016) Controlling the false discovery rate: a practical and powerful approach to multiple testing. J R Stat Soc Ser B 57:289–300. 10.1111/j.2517-6161.1995.tb02031.x [DOI] [Google Scholar]
- Bharadwaj HM, Masud S, Mehraei G, Verhulst S, Shinn-Cunningham BG (2015) Individual differences reveal correlates of hidden hearing deficits. J Neurosci 35:2161–2172. 10.1523/JNEUROSCI.3915-14.2015 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bidelman GM, Villafuerte JW, Moreno S, Alain C (2014) Age-related changes in the subcortical-cortical encoding and categorical perception of speech. Neurobiol Aging 35:2526–2540. 10.1016/j.neurobiolaging.2014.05.006 [DOI] [PubMed] [Google Scholar]
- Carlyon RP (1996) Spread of excitation produced by maskers with damped and ramped envelopes. J Acoust Soc Am 99:3647–3655. 10.1121/1.414963 [DOI] [Google Scholar]
- Caspary DM, Ling L, Turner JG, Hughes LF (2008) Inhibitory neurotransmission, plasticity and aging in the mammalian central auditory system. J Exp Biol 211:1781–1791. 10.1242/jeb.013581 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cebulla M, Stürzebecher E, Elberling C (2006) Objective detection of auditory steady-state responses: comparison of one-sample and q-sample tests. J Am Acad Audiol 17:93–103. 10.3766/jaaa.17.2.3 [DOI] [PubMed] [Google Scholar]
- Chambers AR, Resnik J, Yuan Y, Whitton JP, Edge AS, Liberman MC, Polley DB (2016) Central gain restores auditory processing following near-complete cochlear denervation. Neuron 89:867–879. 10.1016/j.neuron.2015.12.041 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cohen SS, Parra LC (2016) Memorable audiovisual narratives synchronize sensory and supramodal neural responses. eNeuro 3:ENEURO.0203-16.2016–11. 10.1523/ENEURO.0203-16.2016 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cole SR, van der Meij R, Peterson EJ, de Hemptinne C, Starr PA, Voytek B (2017) Nonsinusoidal beta oscillations reflect cortical pathophysiology in Parkinson's disease. J Neurosci 37:4830–4840. 10.1523/JNEUROSCI.2208-16.2017 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cole SR, Voytek B (2017) Brain oscillations and the importance of waveform shape. Trends Cogn Sci 21:137–149. 10.1016/j.tics.2016.12.008 [DOI] [PubMed] [Google Scholar]
- Dallos P (1973) Auditory periphery biophysics and physiology. In: The auditory periphery, pp 395–406. San Diego: Academic. [Google Scholar]
- Dimitrijevic A, Alsamri J, John MS, Purcell D, George S, Zeng FG (2016) Human envelope following responses to amplitude modulation: effects of aging and modulation depth. Ear Hear 37:e322–e335. 10.1097/AUD.0000000000000324 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Drullman R, Festen JM, Plomp R (1994) Effect of temporal envelope smearing on speech reception. J Acoust Soc Am 95:1053–1064. 10.1121/1.408467 [DOI] [PubMed] [Google Scholar]
- Frisina DR, Frisina RD (1997) Speech recognition in noise and presbycusis: relations to possible neural mechanisms. Hear Res 106:95–104. 10.1016/S0378-5955(97)00006-3 [DOI] [PubMed] [Google Scholar]
- Garrett DD, Kovacevic N, McIntosh AR, Grady CL (2010) Blood oxygen level-dependent signal variability is more than just noise. J Neurosci 30:4914–4921. 10.1523/JNEUROSCI.5166-09.2010 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Garrett DD, Kovacevic N, McIntosh AR, Grady CL (2011) The importance of being variable. J Neurosci 31:4496–4503. 10.1523/JNEUROSCI.5641-10.2011 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gatehouse S, Noble I (2004) The Speech, Spatial and Qualities of Hearing Scale (SSQ). Int J Audiol 43:85–99. 10.1080/14992020400050014 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gatehouse S, Naylor G, Elberling C (2003) Benefits from HAs in relation to the interaction between the user and the environment. Int J Audiol 42:77–85. 10.3109/14992020309074627 [DOI] [PubMed] [Google Scholar]
- Goossens T, Vercammen C, Wouters J, van Wieringen A (2016) Aging affects neural synchronization to speech-related acoustic modulations. Front Aging Neurosci 8:1–16. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Goossens T, Vercammen C, Wouters J, van Wieringen A (2018) Neural envelope encoding predicts speech perception performance for normal-hearing and hearing-impaired adults. Hear Res 370:189–200. 10.1016/j.heares.2018.07.012 [DOI] [PubMed] [Google Scholar]
- Goossens T, Vercammen C, Wouters J, van Wieringen A (2019) The association between hearing impairment and neural envelope encoding at different ages. Neurobiol Aging 74:202–212. 10.1016/j.neurobiolaging.2018.10.008 [DOI] [PubMed] [Google Scholar]
- Gordon-Salant S (2006) Speech perception and auditory temporal processing performance by older listeners: implications for real-world communication. Semin Hear 27:264–268. 10.1055/s-2006-954852 [DOI] [Google Scholar]
- Gordon-Salant S, Fitzgibbons PJ (1999) Profile of auditory temporal processing in older listeners. J Speech Lang Hear Res 42:300–311. 10.1044/jslhr.4202.300 [DOI] [PubMed] [Google Scholar]
- Griffiths TD, Lad M, Kumar S, Holmes E, McMurray B, Maguire EA, Billig AJ, Sedley W (2020) How can hearing loss cause dementia? Neuron 108:401–412. 10.1016/j.neuron.2020.08.003 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hari R, Kaila K, Katila T, Tuomisto T, Varpula T (1982) Interstimulus interval dependence of the auditory vertex response and its magnetic counterpart: implications for their neural generation. Electroencephalogr Clin Neurophysiol 54:561–569. 10.1016/0013-4694(82)90041-4 [DOI] [PubMed] [Google Scholar]
- Hasson U, Furman O, Clark D, Dudai Y, Davachi L (2008) Enhanced intersubject correlations during movie viewing correlate with successful episodic encoding. Neuron 57:452–462. 10.1016/j.neuron.2007.12.009 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Henry MJ, Herrmann B, Kunke D, Obleser J (2017) Aging affects the balance of neural entrainment and top-down neural modulation in the listening brain. Nat Commun 8:15801. 10.1038/ncomms15801 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Herrmann B, Buckland C, Johnsrude IS (2019) Neural signatures of temporal regularity processing in sounds differ between younger and older adults. Neurobiol Aging 83:73–85. 10.1016/j.neurobiolaging.2019.08.028 [DOI] [PubMed] [Google Scholar]
- Herrmann B, Butler BE (2020) Hearing loss and brain plasticity: The hyperexcitability phenomenon. Preprints: 2020090116. [DOI] [PubMed] [Google Scholar]
- Herrmann B, Henry MJ, Johnsrude IS, Obleser J (2016) Altered temporal dynamics of neural adaptation in the aging human auditory cortex. Neurobiol Aging 45:10–22. 10.1016/j.neurobiolaging.2016.05.006 [DOI] [PubMed] [Google Scholar]
- Herrmann B, Parthasarathy A, Bartlett EL (2017) Ageing affects dual encoding of periodicity and envelope shape in rat inferior colliculus neurons. Eur J Neurosci 45:299–311. 10.1111/ejn.13463 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Herrmann B, Maess B, Johnsrude IS (2018) Aging affects adaptation to sound-level statistics in human auditory cortex. J Neurosci 38:1989–1999. 10.1523/JNEUROSCI.1489-17.2018 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hughes LF, Turner JG, Parrish JL, Caspary DM (2010) Processing of broadband stimuli across A1 layers in young and aged rats. Hear Res 264:79–85. 10.1016/j.heares.2009.09.005 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Keshishzadeh S, Garrett M, Vasilkov V, Verhulst S (2020) The derived-band envelope following response and its sensitivity to sensorineural hearing deficits. bioRxiv 820704. [DOI] [PubMed] [Google Scholar]
- Killion MC, Niquette PA, Gudmundsen GI, Revit LJ, Banerjee S (2004) Development of a quick speech-in-noise test for measuring signal-to-noise ratio loss in normal-hearing and hearing-impaired listeners. J Acoust Soc Am 116:2395–2405. 10.1121/1.1784440 [DOI] [PubMed] [Google Scholar]
- Knipper M, Van Dijk P, Nunes I, Rüttiger L, Zimmermann U (2013) Advances in the neurobiology of hearing disorders: recent developments regarding the basis of tinnitus and hyperacusis. Prog Neurobiol 111:17–33. 10.1016/j.pneurobio.2013.08.002 [DOI] [PubMed] [Google Scholar]
- Lachaux JP, Rodriguez E, Martinerie J, Varela FJ (1999) Measuring phase synchrony in brain signals. Hum Brain Mapp 8:194–208. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Liégeois-Chauvel C, Musolino A, Badier JM, Marquis P, Chauvel P (1994) Evoked potentials recorded from the auditory cortex in man: evaluation and topography of the middle latency components. Electroencephalogr Clin Neurophysiol 92:204–214. 10.1016/0168-5597(94)90064-7 [DOI] [PubMed] [Google Scholar]
- Lins OG, Picton PE, Picton TW, Champagne SC, Durieux-Smith A (1995) Auditory steady-state responses to tones amplitude-modulated at 80-110 Hz. J Acoust Soc Am 97:3051–3063. 10.1121/1.411869 [DOI] [PubMed] [Google Scholar]
- Lister JJ, Maxfield ND, Pitt GJ, Gonzalez VB (2011) Auditory evoked response to gaps in noise: older adults. Int J Audiol 50:211–225. 10.3109/14992027.2010.526967 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lu T, Liang L, Wang X (2001) Neural representations of temporally asymmetric stimuli in the auditory cortex of awake primates. J Neurophysiol 85:2364–2380. 10.1152/jn.2001.85.6.2364 [DOI] [PubMed] [Google Scholar]
- Makeig S, Bell AJ, Jung TP, Sejnowski TJ (1996) Independent component analysis of electroencephalographic signals. In: Advances in neural information processing systems (Touretzky MM, ed), pp 145–151. Cambridge, MA: Massachusetts Institute of Technology. [Google Scholar]
- Manzoor NF, Licari FG, Klapchar M, Elkin RL, Gao Y, Chen G, Kaltenbach JA (2012) Noise-induced hyperactivity in the inferior colliculus: its relationship with hyperactivity in the dorsal cochlear nucleus. J Neurophysiol 108:976–988. 10.1152/jn.00833.2011 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mayoral EH, López MA, Román-Hernández E, Cortina-Marrero HJ, Portela JR, Oliva VI (2017) Fourier analysis for harmonic signals in electrical power systems. In: Fourier transforms high-tech application and current trends (Nikolic G, Cvetkovic D, Cakic M, eds), pp 43–66. London: IntechOpen. [Google Scholar]
- Millman RE, Mattys SL, Gouws AD, Prendergast G (2017) Magnified neural envelope coding predicts deficits in speech perception in noise. J Neurosci 37:7727–7736. 10.1523/JNEUROSCI.2722-16.2017 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mishra J, de Villers-Sidani E, Merzenich M, Gazzaley A (2014) Adaptive training diminishes distractibility in aging across species. Neuron 84:1091–1103. 10.1016/j.neuron.2014.10.034 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Moore BC, Glasberg BR (1993) Simulation of the effects of loudness recruitment and threshold elevation on the intelligibility of speech in quiet and in a background of speech. J Acoust Soc Am 94:2050–2062. 10.1121/1.407478 [DOI] [PubMed] [Google Scholar]
- Näätänen R, Picton TW (1987) The N1 wave of the human electric and magnetic response to sound: a review and an analysis of the component structure. Psychophysiology 24:375–425. 10.1111/j.1469-8986.1987.tb00311.x [DOI] [PubMed] [Google Scholar]
- Neuert V, Pressnitzer D, Patterson RD, Winter IM (2001) The responses of single units in the inferior colliculus of the guinea pig to damped and ramped sinusoids. Hear Res 159:36–52. 10.1016/s0378-5955(01)00318-5 [DOI] [PubMed] [Google Scholar]
- Ng CW, Recanzone GH (2018) Age-related changes in temporal processing of rapidly-presented sound sequences in the macaque auditory cortex. Cereb Cortex 28:3775–3796. 10.1093/cercor/bhx240 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Oostenveld R, Fries P, Maris E, Schoffelen JM (2011) FieldTrip: open source software for advanced analysis of MEG, EEG, and invasive electrophysiological data. Comput Intell Neurosci 2011:156869. 10.1155/2011/156869 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pantev C, Hoke M, Lehnertz K, Lütkenhöner B, Anogianakis G, Wittkowski W (1988) Tonotopic organization of the human auditory cortex revealed by transient auditory evoked magnetic fields. Electroencephalogr Clin Neurophysiol 69:160–170. 10.1016/0013-4694(88)90211-8 [DOI] [PubMed] [Google Scholar]
- Parmentier FB, Andrés P (2010) The involuntary capture of attention by sound: novelty and postnovelty distraction in young and older adults. Exp Psychol 57:68–76. 10.1027/1618-3169/a000009 [DOI] [PubMed] [Google Scholar]
- Picton TW (2010a) Auditory brainstem responses: peaks along the way. In: Human auditory evoked potentials, pp 213–246. San Diego: Plural. [Google Scholar]
- Picton TW (2010b) Middle-latency responses: the brain and the brawn. In: Human auditory evoked potentials, pp 262–269. San Diego: Plural. [Google Scholar]
- Picton TW (2010c) Late auditory evoked potentials: Changing the things which are. In: Human auditory evoked potentials, pp 335–398. San Diego: Plural. [Google Scholar]
- Picton TW, John MS, Dimitrijevic A, Purcell D (2003) Human auditory steady-state responses. Int J Audiol 42:177–219. 10.3109/14992020309101316 [DOI] [PubMed] [Google Scholar]
- Popelár˘ J, Syka J, Berndt H (1987) Effect of noise on auditory evoked responses in awake guinea pigs. Hear Res 26:239–247. 10.1016/0378-5955(87)90060-8 [DOI] [PubMed] [Google Scholar]
- Presacco A, Simon JZ, Anderson S (2016a) Evidence of degraded representation of speech in noise, in the aging midbrain and cortex. J Neurophysiol 116:2346–2355. 10.1152/jn.00372.2016 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Presacco A, Simon JZ, Anderson S (2016b) Effect of informational content of noise on speech representation in the aging midbrain and cortex. J Neurophysiol 116:2356–2367. 10.1152/jn.00373.2016 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pressnitzer D, Winter IM, Patterson RD (2000) The responses of single units in the ventral cochlear nucleus of the guinea pig to damped and ramped sinusoids. Hear Res 149:155–166. 10.1016/s0378-5955(00)00175-1 [DOI] [PubMed] [Google Scholar]
- Purcell DW, John SM, Schneider BA, Picton TW (2004) Human temporal auditory acuity as assessed by envelope following responses. J Acoust Soc Am 116:3581–3593. 10.1121/1.1798354 [DOI] [PubMed] [Google Scholar]
- Repp BH, Lin HB (1989) Acoustic properties and perception of stop consonant release transients. J Acoust Soc Am 85:379–396. 10.1121/1.397689 [DOI] [PubMed] [Google Scholar]
- Rosen S (1992) Temporal information in speech: acoustic, auditory and linguistic aspects. Philos Trans R Soc Lond B Biol Sci 336:367–373. 10.1098/rstb.1992.0070 [DOI] [PubMed] [Google Scholar]
- Rosenthal R, Rubin DB (2003) R equivalent: A simple effect size indicator. Psychol Methods 8:492–496. [DOI] [PubMed] [Google Scholar]
- Salvi R, Sun W, Ding D, Chen G, Di Lobarinas E, Wang J, Radziwon K, Auerbach BD (2017) Inner hair cell loss disrupts hearing and cochlear function leading to sensory deprivation and enhanced central auditory gain. Front Neurosci 10:1–14. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schlittenlacher J, Moore BC (2016) Discrimination of amplitude-modulation depth by subjects with normal and impaired hearing. J Acoust Soc Am 140:3487–3495. 10.1121/1.4966117 [DOI] [PubMed] [Google Scholar]
- Shannon RV, Zeng FG, Kamath V, Wygonski J, Ekelid M (1995) Speech recognition with primarily temporal cues. Science 270:303–304. 10.1126/science.270.5234.303 [DOI] [PubMed] [Google Scholar]
- Sherman MA, Lee S, Law R, Haegens S, Thorn CA, Hämäläinen MS, Moore CI, Jones SR (2016) Neural mechanisms of transient neocortical beta rhythms: converging evidence from humans, computational modeling, monkeys, and mice. Proc Natl Acad Sci USA 113:E4885–E4894. 10.1073/pnas.1604135113 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Snyder JS, Alain C (2005) Age-related changes in neural activity associated with concurrent vowel segregation. Brain Res Cogn Brain Res 24:492–499. 10.1016/j.cogbrainres.2005.03.002 [DOI] [PubMed] [Google Scholar]
- Sörös P, Teismann IK, Manemann E, Lütkenhöner B (2009) Auditory temporal processing in healthy aging: a magnetoencephalographic study. BMC Neurosci 10:34. 10.1186/1471-2202-10-34 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Syka J, Rybalko N, Popelár J (1994) Enhancement of the auditory cortex evoked responses in awake guinea pigs after noise exposure. Hear Res 78:158–168. 10.1016/0378-5955(94)90021-3 [DOI] [PubMed] [Google Scholar]
- Tremblay KL, Piskosz M, Souza P (2003) Effects of age and age-related hearing loss on the neural representation of speech cues. Clin Neurophysiol 114:1332–1343. 10.1016/S1388-2457(03)00114-7 [DOI] [PubMed] [Google Scholar]
- van der Horst R, Leeuw AR, Dreschler WA (1999) Importance of temporal-envelope cues in consonant recognition. J Acoust Soc Am 105:1801–1809. 10.1121/1.426718 [DOI] [PubMed] [Google Scholar]
- Vasilkov V, Verhulst S (2019) Towards a differential diagnosis of cochlear synaptopathy and outer-hair-cell deficits in mixed sensorineural hearing loss pathologies. medRxiv 19008680. [Google Scholar]
- Wayne RV, Johnsrude IS (2015) A review of causal mechanisms underlying the link between age-related hearing loss and cognitive decline. Ageing Res Rev 23:154–166. 10.1016/j.arr.2015.06.002 [DOI] [PubMed] [Google Scholar]
- Yoshiura T, Ueno S, Iramina K, Masuda K (1995) Source localization of middle latency auditory evoked magnetic fields. Brain Res 703:139–144. 10.1016/0006-8993(95)01075-0 [DOI] [PubMed] [Google Scholar]
- Zhu L, Bharadwaj H, Xia J, Shinn-Cunningham B (2013) A comparison of spectral magnitude and phase-locking value analyses of the frequency-following response to complex tones. J Acoust Soc Am 134:384–395. 10.1121/1.4807498 [DOI] [PMC free article] [PubMed] [Google Scholar]