Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2023 Aug 29.
Published in final edited form as: Psychophysiology. 2022 Nov 13;60(4):e14217. doi: 10.1111/psyp.14217

Intensity and inter-stimulus-interval effects on human middle- and long-latency auditory evoked potentials in an unpredictable auditory context

Fran López-Caballero 1, Brian Coffman 1, Dylan Seebold 1, Tobias Teichert 2,3, Dean F Salisbury 1
PMCID: PMC10463565  NIHMSID: NIHMS1923217  PMID: 36371684

Abstract

It is not known how Auditory-Evoked Responses (AERs) comprising Middle Latency Responses (MLRs) and Long Latency Responses (LLRs) are modulated by stimulus intensity and inter-stimulus interval (ISI) in an unpredictable auditory context. Further, intensity and ISI effects on MLR and LLR have never been assessed simultaneously in the same humans. To address this important question, thirty participants passively listened to a random sequence of auditory clicks of three possible intensities (65, 75, and 85 dB) at five possible ISI ranges (0.25 to 0.5 s, 0.5 to 1 s, 1 to 2 s, 2 to 4 s, 4 to 8 s) over four to seven one-h our sessions while EEG was recorded. P0, Na, Pa, Nb, and Pb MLR peaks and N1 and P2 LLR peaks were measured. MLRs P0 (p=.005), Pa (p=.021), and Pb (p=<.001) were modulated by intensity, while only MLR Pb (p=<.001) was modulated by ISI. LLR N1 and P2 were modulated by both intensity and ISI (all p values < .001). Intensity and ISI interacted at Pb, N1, and P2 (all p values < .001), with greater intensity effects at longer ISIs and greater ISI effects at louder intensities. Together, these results provide a comprehensive picture of intensity and ISI effects on AER across the entire thalamocortical auditory pathway, while controlling for stimulus predictability. Moreover, they highlight P0 as the earliest MLR response sensitive to stimulus intensity and Pb (~50 ms) as the earliest cortical response coding for ISIs above 250 ms and showing an interdependence between intensity and ISI effects.

Keywords: ERPs, ISI, middle-latency response, neural adaptation, random auditory context, stimulus intensity

1 |. INTRODUCTION

Human Auditory evoked responses (AERs) comprise a series of waves recorded non-invasively with Electroencephalography(EEG)orMagnetoencephalography (MEG) that reflect neural responses to auditory stimuli. Within AERs, auditory brainstem responses (ABRs) or brainstem auditory evoked responses (BAERs) arise from brainstem, collicular, and thalamic subcortical auditory processing centers and are measured within the first 10 ms post-sound (Jewett, 1970). Mid-latency responses (MLRs) include five waves (P0, Na, Pa, Nb, and Pb) that range in latency from ~12 to 70 ms after sound onset (Picton et al., 1974; Yvert et al., 2001). MLRs arise in the medial geniculate nucleus of the thalamus (MGN: Deiber et al., 1988; Kraus et al., 1982; McGee et al., 1992) and areas of the auditory cortex, including Heschl’s gyrus (HG: Godey et al., 2001), Planum Temporale (PT), and the lateral aspect of the Superior Temporal Gyrus (STG: Kileny et al., 1987; Yvert et al., 2005). Long-latency responses (LLRs) include N1(/N100) and P2(/P200) AERs, ranging from ~80 to 250 ms from sound onset. LLRs are generated in HG, PT, and lateral STG (Godey et al., 2001; Yvert et al., 2005) and likely also include contributions from auditory-related executive areas in parietal and frontal cortices (Alcaini et al., 1994; Anderer et al., 1998), as well as cingulate, insular, and other extra-temporal auditory-associated areas (Giard et al., 1994).

The amplitude of both MLRs and LLRs depends on the physical properties of the sound such as intensity, but also on contextual factors, such as the inter-stimulus interval (ISI) between sounds. While the dependence on sound intensity is mechanically inherited from the inner ear, the dependence on temporal context is not, and instead reflects properties of neural processing along the ascending auditory processing hierarchy such as the duration of contextual memory. However, despite decades of research, fundamental questions remain about the basic phenomenology of the effects of intensity and ISI on auditory evoked potentials across the ascending auditory pathway. How early in the auditory processing hierarchy, as reflected in MLR and LLR, do intensity and ISI effects arise? How do intensity and ISI interact? How do intensity and ISI affect MLRs and LLRs without stimulus predictability confounds? Answering these questions would further characterize prominent response properties of AERs while allowing for a better interpretation of their responsiveness without predictive effects potentially generated by highly repetitive, unchanging (and therefore predictable) stimulus contexts.

Although the sensitivity of LLRs to stimulus intensity is well documented (i.e., increased amplitude with greater stimulus intensity), there is less work regarding the psychophysics of MLRs and extant results are inconsistent. For example, using auditory clicks as stimuli, a MEG study reported intensity effects at the Na (Borgmann et al., 2001), while another using scalp EEG did not (Tucker et al., 2001). The opposite results were found in these two studies for the Pa wave, while both reported intensity effects at Pb. Results from the older literature converge in finding modulation by stimulus intensity in later (Pa, Nb, Pb) versus earlier (P0, Na) MLR peaks (Kupperman & Mendel, 1974; Thornton et al., 1977). It is unclear then, whether intensity information is coded from initial auditory processing in AER amplitude, and precisely how MLRs react to intensity changes. In contrast with MLRs, LLR intensity effects have been consistently shown for N1 and P2 in both EEG sensor (Harris et al., 2007; Juckel et al., 2003; Keidel & Spreng, 1965) and source-resolved activity in the auditory cortex measured with EEG (Hagenmuller et al., 2016; Schadow et al., 2007). Equivalent results in N1 and P2 were obtained with MEG (Wyss et al., 2014), and N1-P 2 amplitudes at different intensities have also been used to determine pure-tone audiometry thresholds (Lightfoot & Kennedy, 2006; Mahdavi & Peyvandi, 2007).

Regarding ISI effects in AERs (i.e. larger responses to sounds preceded by longer periods of silence), in the MLR, consistent evidence still is scarce, but modulations of amplitude by ISI have been reported for Pa (Tucker et al., 2002) and Pb (Erwin & Buchwald, 1986; Tucker et al., 2002), as well as in their magnetic counterparts (Onitsuka et al., 2003). ISI effects on LLR amplitudes were first reported with EEG by Davis et al. (1966), and have since been consistently found in both N1 and P2 (Herrmann et al., 2014, 2016; Pereira et al., 2014; Sussman et al., 2008), in the trough-to-peak N1-P2 complex (Muller-Gass et al., 2008; Shucard & Callaway, 1974), as well as in MEG N1m (Okamoto & Kakigi, 2014; Sams et al., 1993; Zygierewicz et al., 2012) and P2m (Rosburg et al., 2010). Evidence points to maximal N1 and P2 amplitudes with ISIs at ~10 s (Nelson & Lassman, 1968; Ritter et al., 1968).

Despite the extensive literature, it remains unclear which evoked potential is the first to be reliably modulated by both intensity and ISI, with inconsistent and sometimes contradictory results for the MLRs. How these two factors interact in their modulation of AERs is also uncertain, but intensity and ISI effects were rarely studied together in the same subjects (with a few exceptions, Roth et al., 1976; Zhang et al., 2009), thus leaving them non-comparable. Moreover, regularity of the auditory stimulation sequence is a confounding element in the aforementioned studies of intensity and ISI. When auditory stimuli are predictable, AERs amplitudes are smaller (Coffman et al., 2017; Kaufmann et al., 1982; Lange, 2009; Schwartze et al., 2013). In human event-related designs, it is common to use predictable repetitive stimulation paradigms (i.e., one intensity or one ISI in a block) due to the large number of trials needed to obtain MLR or LLRs.

In the current study, we examined intensity and ISI effects in the entire middle-and long-latency hierarchy of the sensory auditory information processing stream by measuring these phenomena simultaneously in the same participants and within the same trials, which has not been previously done. Furthermore, the stimulation paradigm employed here used entirely unpredictable intensities and ISIs, resulting in 15 possible combinations for each stimulus (3 intensities by 5 exponential ISI groupings with ISIs sampled randomly within each range). This allowed us to quantify intensity and ISI effects and their interaction as independent effects, and prevent any potential confound of stimulus regularity.

2 |. METHOD

2.1 |. Participants

Thirty healthy participants (19 males) were included in the study, ranging in age from 19 to 38 years (mean 24,03; SD 5.27). The target sample size was determined based on previous authors work with MLR amplitude modulations (Lopez-Caballero et al., 2016) and on studies with similar experimental designs (Herrmann et al., 2016; Pereira et al., 2014). Exclusion criteria were history of concussion or head injury with sequelae, psychiatric or neurological disorder, the history of alcohol or drug addiction in the last 5 years, or the use of medication affecting brain function or structure. Participants completed pure-tone audiometry (1000–4000 Hz) before the experiment started, ensuring mean hearing thresholds below 30 dB nHL at each ear and less than 15 dB threshold differences between ears. Mean hearing thresholds averaged across frequencies were of 7.8 and 7.7 dB on the left and right ear, respectively (see Figure S1 for extended hearing thresholds results). All procedures were approved by the University of Pittsburgh IRB and in accordance with the WMA Declaration of Helsinki Ethical Principles for Medical Research Involving Human Subjects. All participants provided written informed consent and were paid for participation.

2.2 |. Stimuli and procedure

The stimulus was a 1 ms biphasic click, created using MATLAB (The Mathworks, Inc., Natick, MA) delivered through ER-3A insert earphones (Etymotic Research, Inc., Elk Grove Village, IL, USA) using the Presentation® software (Version 9.90).

The experiment was divided into four to seven identical one-hour sessions. Each session consisted of three blocks of 20 min each, with short (~2 min) breaks between blocks. During each session, participants sat in the MEG/EEG scanner in a magnetically shielded room while passively listening to a series of auditory clicks. They were instructed to watch a silent movie of their choice and ignore the sounds. Clicks were one of three intensities (65, 75, or 85 dB SPL). ISI on a given trial was randomly selected from a uniform distribution within one of five equally probable time ranges (0.25 to 0.5 s, 0.5 to 1 s, 1 to 2 s, 2 to 4 s or 4 to 8 s). Intensity and ISI were manipulated in a fully crossed 3-by-5 design, resulting in 15 intensity-ISI combinations (Figure 1a). The intended length of the study was of six sessions to retrieve, across all sessions, 600 trials of each intensity-ISI combination, ensuring enough remaining sweeps for visualization of MLRs after EEG cleaning procedures (which we set as 400 sweeps). However, eight participants completed an additional seventh session due to technical problems with the MEG acquisition system shortening previous sessions. Two other participants could only complete four and five sessions, respectively, due to MEG technical problems and the impossibility to record further sessions with them. Among them, only the one with five sessions was used for MLR analyses (sufficient surviving sweeps after EEG cleaning procedures), while both were used for LLR. The remaining twenty participants completed six sessions successfully, despite for three of them not enough sweeps survived to include them in MLR analyses, and no further sessions could be recorded. On average, 31.56 days (SD=49.24) passed between sessions and 156.96 days (SD=126.93) passed between the first and last session.

FIGURE 1.

FIGURE 1

(a) Example of the auditory sequence. Auditory clicks could be of three possible intensities (65, 75 and 85 dB), and the ISIs between them could fall within five possible time ranges (0.25 to 0.5 s, 0.5 to 1 s, 1 to 2 s, 2 to 4 s or 4 to 8 s). The intensity and ISI on each trial was random. (b) Example illustration of the subtraction procedure. Elongated averages (templates, green to red) for each click category (intensity and ISI) were computed to subtract from individual epochs (blue) in their overlapping time-window (red). In the example, the second click comes after 0.3 s from the first one.

2.3 |. EEG acquisition

Combined EEG and MEG acquisition was performed in a magnetically shielded room (Imedco AG, Hägendorf, Switzerland) using a low-impedance EEG cap (BrainCap MEG, Brain Vision, LLC, Morrisville, NC, USA) with 60 scalp electrode locations based on the 10–10 system and a whole-head MEG system (Elekta Neuromag, Helsinki, Finland). The acquisition system (Elekta Neuromag) was the same for EEG and MEG. Only scalp EEG activity is reported here. MEG results will be discussed in a separate report. Data were recorded with a sampling rate of 3000 Hz and an online bandpass filter of 0.1 to 1000 Hz. Additional EEG electrodes were placed in the left mastoid (online EEG reference), right mastoid, back of the neck (ground), below the left clavicle (ECG), and below the left eye (VEOG). Two bipolar electrodes were also placed on the outer canthi of both eyes (HEOG).

2.4 |. EEG analysis

EEG data were first down-sampled to 1500 Hz digitization rate and preprocessed using the EEGLAB toolbox (Delorme & Makeig, 2004) in MATLAB (The Mathworks, Inc., Natick, MA). The data were processed differently for MLRs and LLRs.

2.4.1 |. MLRs

For MLRs, a high-pass filter (10 Hz; 12 dB/oct) was applied (Kavanagh & Domico, 1987), and channels and data segments with excessive noise were removed by visual inspection. Next, Adaptive Mixture Independent Component Analysis (AMICA) was performed to remove artifacts associated with eye blinks/movements and heartbeat. Channels removed prior to AMICA were interpolated afterwards. After preprocessing, EEG data were analyzed using the Brainstorm software (Tadel et al., 2011). First, data were re-referenced to averaged mastoids. A low-pass filter (100 Hz; 24 dB/oct, see Figure S2a,b for comparison of 200 Hz and 100 Hz low-pass filtered data) was then applied, and epochs of −50 to 100 ms (baseline corrected) were extracted around click stimuli. Next, epochs with amplitudes larger than ±50 μV at any EEG channel were discarded. Finally, epochs from all sessions were averaged within each participant, separately for each of the 15 intensity-ISI conditions. Four participants with less than 400 surviving sweeps on average in these conditions were removed from further MLR analyses. Total number of sweeps per condition and participant, as well as signal-to-noise ratio values, can be found in Figures S3 and S4.

2.4.2 |. LLR

For LLR, a high-pass filter (0.5 Hz; 12 dB/oct) was applied (Rentzsch et al., 2008), and channels and data segments with excessive noise were removed by visual inspection. AMICA was then performed to remove eye movements and heartbeats from the signal, and channels removed prior to AMICA were interpolated afterwards. After preprocessing, AMICA-corrected data were analyzed using the Brainstorm software (Tadel et al., 2011). First, data were re-referenced to averaged mastoids and down-sampled to 500 Hz. Next, a low-pass filter (20 Hz; 24 dB/oct) was applied (Whitham et al., 2007), and epochs of −150 to 400 ms (baseline corrected) were extracted around click stimuli. Epochs with amplitudes larger than ±50 μV at any EEG channel were discarded. Finally, epochs from all sessions were averaged within each participant, separately for each of the 15 intensity-ISI conditions.

2.4.3 |. Subtraction of prior AERs in shortest ISI condition

Epochs preceded by the shortest ISI (0.25 to 0.5 s) for LLR analyses were distorted due to the continued response to the preceding click. This response overlap was corrected using a similar procedure as employed by Budd and Michie (1994) and Briley and Krumbholz (2013) (Figure 1b). Briefly, elongated epochs of 0.6 s post-stimulus were extracted and averaged for each of the 15 intensity-ISI conditions, following the same procedures as with original epochs and separately for each experimental session. We constructed a “template” of the response for each trial type, which was then subtracted from individual epochs based on the overlapping time-window and the preceding trial type. To ensure a smooth transition of the response to baseline in the template averages, we applied a half Hanning window in the last 0.2 s and assumed all activity past 0.6 s after click stimulus was equal to zero. For consistency, we performed this same procedure in epochs from all conditions, and in both MLR and LLR, despite only epochs from trials preceded by the shortest ISI in LLR appearing contaminated by the previous response.

2.5 |. Statistical analysis

Two-way repeated measures ANOVAs were performed separately for each MLR and LLR peak, measured at the FCz electrode (see EEG scalp topographies in Figure S5). The two factors were intensity (3 levels) and ISI (five levels), and the time windows were defined based on the grand-average waveforms (P0: 5 to 11 ms; Na: 15 to 21 ms; Pa: 27 to 33 ms; Nb: 37 to 43 ms; Pb: 51 to 61 ms; N1: 78 to 106 ms; P2: 150 to 182 ms). For each of the comparisons, degrees of freedom were corrected using the Huynh-Feldt epsilon. All analyses were carried out using IBM SPSS Statistics (Version 26).

3 |. RESULTS

3.1 |. Middle latency response

P0 (8ms), Na (18 ms), Pa (30 ms), Nb (40 ms) and Pb (56 ms) peaks were identified at typical MLR latencies (Borgmann et al., 2001; Picton et al., 1974; Yvert et al., 2001). Figure 2a displays MLR waveforms averaged across ISI (left) and click intensities (right), while Figure 2b shows MLRs for each intensity-ISI combination. Results of the two-way rmANOVA are summarized in Table 1.

FIGURE 2.

FIGURE 2

Grand-average MLR waveforms (n=26) at FCz, with the standard error of the mean. Gray areas show the time windows of peaks where conditions significantly differed in rmANOVA (p<.05). (a) MLRs averaged across ISI (left), or across intensities (right). (b) MLRs of each intensity-ISI combination.

TABLE 1.

Results from two-way repeated measures ANOVA on each MLR and LLR peak

dB
ISI
dB × ISI
F (2,50/58) p ε ηp2 F (4,100/116) p ε ηp2 F (8,200/232) p ε ηp2
P0 5.78 .005 1 .188 .40 .783 .89 .016 .67 .714 .82 .026
Na .42 .651 .98 .017 .69 .596 .65 .027 .74 .655 .88 .029
Pa 4.64 .021 .81 .157 1.60 .184 .93 .600 .77 .608 .86 .030
Nb .37 .683 .96 .015 4.19 .006 .84 .144 1.26 .267 .88 .048
Pb 41.45 <.001 .74 .624 33.64 <.001 .71 .574 4.05 <.001 .84 .139
N1 85.02 <.001 .62 .746 32.03 <.001 .85 .525 6.39 .001 .95 .181
P2 111.74 <.001 .67 .794 125.47 <.001 .66 .812 7.85 <.001 .95 .213

Note: Intensity (3 levels: 65, 75 and 85 dB) and ISI (five levels: 0.25 to 0.5 s, 0.5 to 1 s, 1 to 2 s, 2 to 4 s or 4 to 8 s) factors were used. Dependent variable was mean response amplitude over a time window around each peak. For each factor and their interaction, F and p values are presented along with epsilon sphericity index and effect sizes (ηp2). Degrees of freedom are displayed for MLR and LLR, respectively. p values below .05 are highlighted.

At P0 (8ms), amplitudes increased as click intensities became louder (p=.005), but not as ISI became longer (p=.783). At Na (18 ms), amplitudes were not affected by either click intensity (p=.651) or ISI (p=.596). At Pa (30 ms), amplitudes increased with louder clicks (p=.021) but not with longer ISI (p=.184). At Nb (40 ms), amplitudes were not modulated by click intensity (p=.683), but they were by ISI (p=.006). However, Nb amplitudes did not increase with longer ISIs. Instead, pairwise comparisons based on estimated marginal means revealed a significant Nb amplitude increase (more negative) in the 0.25–0.5 s ISI versus the 1–2 s ISI condition.

At Pb (56 ms), amplitudes progressively increased as clicks were louder (p<.001) and ISIs were longer (p<.001). The effects of intensity and ISI interacted (p<.001); the effect of intensity was enhanced for longer ISIs (Figure 4e). To parse the effects of the interaction, rmANOVAs were conducted to assess modulations by intensity within each ISI level, revealing louder clicks produced significantly larger Pb responses regardless of which ISI preceded the stimulus: 0.25 to 0.5 s F(2,50)=14.16;p<.001;ε=.92;ηp2=.362, 0.5 to 1 s F(2,50)=5.35;p=.013;ε=.81;ηp2=.176, 1 to 2 s F(2,50)=23.16;p<0.001;ε=.99;ηp2=.481, 2 to 4 s F(2,50)=29.66;p<0.001;ε=.91;ηp2=.543 or 4 to 8 s F(2,50)=26.65;p<.001;ε=.82;ηp2=.516. Similarly, rmANOVAs were conducted to assess modulations by ISI within each intensity level, which revealed that shorter ISIs resulted in smaller Pb responses for all click intensities: 65 dB F(4,100)=6.78;p<0.001;ε=.85;ηp2=.213, 75 dB F(4,100)=10.33;p<0.001;ε=1;ηp2=.292 or 85 dB F(4,100)=26.76;p<0.001;ε=.90;ηp2=.517. Pairwise comparisons (Bonferroni corrected) between each individual intensity and ISI level were then performed. Significant differences are highlighted in Figure 4e. Overall, the intensity of the click modulated Pb to a greater extent than ISI and the most prominent shift in response amplitude occurred when changing from the 0.5–1 s to the 1–2 s ISI.

FIGURE 4.

FIGURE 4

Peak amplitudes as a function of ISI, separated by click intensities, for MLR P0, Na, Pa, Nb and Pb (a–e) and LLR N1 and P2 (f and g). Significant pair-wise comparisons performed at Pb, N1 and P2 (p<.05, Bonferroni corrected) are highlighted with (*) when between intensities and with (▲) when between ISIs.

3.2 |. Long latency response

All click intensity and ISI combinations elicited measurable LLRs, with identifiable N1 (92ms) and P2 (166ms) peaks (Figure 3) at typical latencies (Crowley & Colrain, 2004; Näätänen & Picton, 1987). LLR waveforms averaged across ISIs (left) and click intensities (right) are shown in Figure 3a. LLRs for each intensity-ISI combination separately are displayed in Figure 3b. LLR results from the two-way rmANOVA are summarized in Table 1.

FIGURE 3.

FIGURE 3

Grand-average LLR waveforms (n=30) at FCz, with the standard error of the mean. Gray areas show the time windows of peaks where conditions significantly differed in rmANOVA (p<.05). (a) LLRs averaged across ISIs (left), or across intensities (right). (b) LLRs of each intensity-ISI combination.

At N1 (92ms), more intense clicks (p<.001) and longer ISIs (p<.001) elicited larger responses. The enhancement of N1 amplitude with more intense stimuli was more pronounced for long versus short ISI, and response reduction with shorter ISI was also more pronounced for stronger versus weaker stimuli (intensity × ISI interaction: p<.001; Figure 4f). To parse the effects of the interaction, rmANOVAs were conducted to assess modulations by intensity within each ISI level, revealing louder clicks increased N1 amplitude for all ISIs: 0.25 to 0.5 s F(2,58)=36.28;p<.001;ε=.95;ηp2=.556, 0.5 to 1 s F(2,58)=46.05;p=<.001;ε=.72;ηp2=.614, 1 to 2 s F(2,58)=68.27;p<0.001;ε=.98;ηp2=.702, 2 to 4 s F(2,58)=40.97;p<0.001;ε=.77;ηp2=.586 or 4 to 8 s F(2,58)=80.89;p<.001;ε=.78;ηp2=.736. Similarly, rmANOVAs were conducted to assess modulations by ISI within each intensity level, revealing shorter ISIs reduced N1 amplitude for each of the stimulus intensities: 65 dB F(4,116)=6.104;p=<0.001;ε=1;ηp2=.174, 75 dB F(4,116)=17.96;p<0.001;ε=.83;ηp2=.383 or 85 dB F(4,116)=26.91;p<0.001;ε=.87;ηp2=.481). Pairwise comparisons (Bonferroni corrected) between each individual intensity and ISI level were then performed. Significant differences are highlighted in Figure 4f. N1 responses were significantly larger at 0.25–5 s ISI compared to 0.5–1 s ISI, in both 65 and 75 dB click intensity conditions, contrary to the general reduction of response amplitudes with shorter ISIs.

At P2 (166ms), louder clicks (p<.001) and longer ISIs (p<.001) elicited larger responses. Intensity and ISI factors interacted (p<.001), such that the enhancement of P2 with louder clicks was more pronounced for long versus short ISI, and P2 response reduction with shorter ISI was more pronounced for louder versus quieter stimuli (Figure 4g). To parse the effects of the interaction, rmANOVAs were conducted to assess modulations by intensity within each ISI level, revealing louder clicks increased P2 amplitude for all ISIs: 0.25 to 0.5 s F(2,58)=54.61;p<.001;ε=1;ηp2=.653, 0.5 to 1 s F(2,58)=38.89;p<.001;ε=.79;ηp2=.573, 1 to 2 s F(2,58)=69.26;p<0.001;ε=.72;ηp2=.705, 2 to 4 s F(2,58)=133.98;p<0.001;ε=.83;ηp2=.822 or 4 to 8 s F(2,58)=72.71;p<.001;ε=.73;ηp2=.715. Similarly, rmANOVAs were conducted to assess modulations by ISI within each intensity level, which revealed shorter ISIs reduced P2 amplitude for each stimulus intensity: 65 dB F(4,116)=39.29;p<0.001;ε=.94;ηp2=.575, 75 dB F(4,116)=95.61;p<0.001;ε=.84;ηp2=.767 or 85 dB F(4,116)=102.66;p<0.001;ε=.71;ηp2=.780). Pairwise comparisons (Bonferroni corrected) between each individual intensity and ISI level were then performed. Significant differences are highlighted in Figure 4g. P2 amplitudes increased between all ISI levels at all click intensities, except between 2–4 s and 4–8 s, suggesting they stabilized at 2–4 s ISI (Figure 4g).

4 |. DISCUSSION

4.1 |. Summary of results

The present study examined the behavior of auditory middle- and long-latency evoked responses to changes in stimulus intensity and ISI while using an auditory stimulation sequence that did not contain any temporal or stimulus intensity predictability. Increased evoked response amplitudes with more intense stimuli were observed as early as the first MLR peak recorded (P0, 8 ms). Pb (~50 ms) was the earliest cortical response coding for ISIs above 250 ms and showing an interdependence between intensity and ISI effects. Greater amplitude enhancement with more intense clicks and longer ISIs, as well as an interaction between intensity and ISI factors, were observed at both N1 (92ms) and P2 (166ms) peaks. N1 increases in amplitude at the shortest ISI (0.25 to 0.5 s) and asymptotic P2 amplitudes after 2–4 s ISI were also observed.

4.2 |. Middle-latency response

Regarding effects of stimulus intensity at P0, our findings contrast with the general lack of findings for this EP in the literature. Most studies instead find intensity effects in MLR beginning at Na (Borgmann et al., 2001; Kupperman & Mendel, 1974; Thornton et al., 1977; Tucker et al., 2001). Only Madell and Goldstein (1972) reported a peak-to-peak amplitude increase between P0 and Na with more intense sounds. This could be due to earlier peaks being typically smaller, and thus varying over a more limited range of amplitudes (Zerlin & Naunton, 1974), as we found effect sizes of stimulus intensity more than three times larger at Pb than at Pa or P0. It could also be due to its subcortical origin (Yoshiura et al., 1996), requiring a larger amount of stimulus repetitions to reach similar signal-to-noise ratios than later MLR peaks. To that regard, evoked responses at earlier latencies than those of MLR, at the ABR, are shown to be sensitive to stimulus intensity (Dzulkarnain et al., 2020; Stapells & Oates, 1997; van den Honert & Stypulkowski, 1986). At Pa, we replicated the intensity effects observed with similar click intensities (Tucker et al., 2001) and other types of stimuli (Kupperman & Mendel, 1974). However, our results for Pa contrast with reports suggesting saturation of amplitude effects at 60 dB (Borgmann et al., 2001; Madell & Goldstein, 1972; Özdamar & Kraus, 1983). At Pb, we replicated findings of increased amplitudes with louder sounds from previous studies using clicks (Borgmann et al., 2001; Tucker et al., 2001) and other stimuli (Thornton et al., 1977). Despite stimulus intensity modulating the positive MLR peaks at P0, Pa and Pb, the peak negative amplitudes at Na and Nb were apparently unaffected by this stimulus feature. We speculate these results may be due to an extended underlying processing of stimulus intensity at P0 and Pa peaks, of positive polarity, overlapping, respectively, Na and Nb negative peaks.

Regarding effects of ISI at MLR, our results are consistent with previous literature showing ISI affecting Pb but not earlier peaks (Erwin & Buchwald, 1986; Goldstein et al., 1972; Mcfarland et al., 1975). Two other studies (Onitsuka et al., 2003; Tucker et al., 2002) found ISI modulations at Pa. In Onitsuka et al. (2003), ISIs used were between 0.4 s to 2 s, which are included among the ranges of ISIs investigated here. However, the different stimuli used (tone burst vs clicks), the signal recorded (MEG vs EEG), and the nature of the sound sequence (repetitive vs random) may account for the discrepancies. In Tucker et al. (2002), using clicks and measuring EEG, the differences found were between ~0.1 s and ~ 1 s ISI at 85 dB. Since the authors used ISIs below our shortest (0.25 s), our lack of ISI effects at Pa may indicate neural recovery times below 0.25 s at Pa′s neural generators (discussed in detail below).

4.3 |. Long-latency response

Our N1 and P2 amplitudes showed a clear dependence on stimulus intensity, with larger amplitudes with more intense stimuli. This is consistent with the previous EEG literature observing this phenomenon when using auditory clicks (Keidel & Spreng, 1965), as we did, but also with studies using pure tones (Harris et al., 2007).

Regarding ISI effects, both N1 and P2 amplitudes measured here increased with longer ISIs, as in previous studies (Davis et al., 1966; Herrmann et al., 2016). Previous literature reports maximal N1 and P2 amplitudes at ~6–10 s ISI (Nelson & Lassman, 1968; Ritter et al., 1968). Such results are in line with our findings at N1, with maximal amplitudes at the longest ISI condition (4–8 s). However, our P2 amplitudes did not grow past the 2–4 ISI. More recent studies manipulating ISI and measuring the P2 have not used ISIs beyond 6 s (e.g. Pereira et al., 2014; Rosburg et al., 2010; Sussman et al., 2008), which leaves the question open as to whether the ~6–10 s limit was replicated. Pereira et al. (2014) found different amplitudes at P2 between 3 and 6 s ISI, but Herrmann et al. (2016) found a similar stabilization of P2 after 2 s as the one we observed. Future studies on P2 response characteristics would benefit from revisiting this topic. N1 and P2 displaying different patterns of increase with longer ISI may reflect local circuit organizational differences between their different neural generators (Crowley & Colrain, 2004; Herrmann et al., 2016; Verkindt et al., 1994), but this would need further research to determine.

The N1 amplitude increase we observed at our shortest ISI (0.25–0.5 s) replicated those of previous studies using ISIs below 0.3 s (Budd & Michie, 1994; Heinemann et al., 2011; Loveless et al., 1989; McEvoy et al., 1997; Teder et al., 1993; Todd et al., 2000), even for other sensory modalities (Wang et al., 2008). This phenomenon, referred as N1 facilitation, has been hypothesized as caused by several factors: (1) an accelerated decay of inhibitory over excitatory postsynaptic potentials at high (<200 ms) stimulation rates (Loveless et al., 1989); (2) avoiding a latent inhibitory feedback from N1 generators if stimulus is presented before inhibition takes place (Sable et al., 2004); (3) a separate neural component with a time course overlapping N1 and P2 that is modulated exclusively by very short ISI (Wang et al., 2008). Some evidence supports this overlapping component may be Mismatch Negativity (Volosin et al., 2017, 2021). Our results best fit the third hypothesis, as P2 in addition to N1 followed a different pattern of results under our shortest ISI, reduced more prominently than for other ISIs.

4.4 |. Intensity effects precede ISI effects

In our study, the effects of Intensity emerged earlier than the effects of ISI. Specifically, while P0 (8ms) already showed increased peak amplitudes as a function of stimulus intensity, not until Pb (56 ms) did peak amplitudes increase as a function of ISI. Stimulus intensity is represented in the ear drum and affects the earliest neural responses in the auditory hierarchy. Studies with single unit recordings from cats (Rhode & Smith, 1986) and birds (MacLeod & Carr, 2007) point towards the cochlear nucleus as the first neural relay to process stimulus intensity. Thus it is reasonable to expect AERs of early latencies to be modulated by this feature, as seen at P0 in our study and in human ABRs (Dzulkarnain et al., 2020; Stapells & Oates, 1997; van den Honert & Stypulkowski, 1986). It is less clear how and at which stage within the auditory hierarchy ISI is encoded. ABR studies in humans demonstrate ISI modulates AER as early as 5 ms after sound onset (Burkard & Sims, 2001; Dzulkarnain et al., 2013, 2020; Scott & Harkins, 1978), which contrasts with our lack of effects until ~50 ms. However, such ABR studies used ISIs between 10 and 100 ms, far below our minimum ISI of 250 ms. A possible interpretation for our lack of ISI effects in the early (non-cortically generated) MLR waves could be neural adaptation. It is well documented that neurons from subcortical structures recover faster from adaptation than neurons from the auditory cortex (Fitzpatrick et al., 1999; Joris et al., 2004). For instance, single unit recordings from animal studies report cochlear nucleus neurons recovery times between 10 and 200 ms (Boettcher et al., 1990), while auditory cortical neurons have a much wider range of recovery times spanning from milliseconds to tens and possibly hundreds of seconds (Tsodyks & Markram, 1997; Ulanovsky et al., 2004). Since our shortest ISI was of 250 ms, it could be that neural generators of early MLR peaks had already recovered and thus did not show sensitivity to our ISI manipulations. In other words, this would suggest that neural generators of Pb are the first stage of the auditory hierarchy that has not yet recovered for ISIs above 250 ms and are thus capable of coding for longer ISIs.

Alternatively, our lack of ISI effects in the early MLR waves could be attributed to circuit complexity rather than single neuron response properties. As the cortex represents more complicated circuity responding to each sound, it is likely that the increased complexity of the circuit allows for prolonged effects (sensory memory) from prior stimuli. That sensory memory would thus be shorter at subcortical structures lacking such complexity, only allowing for the codification of very short ISIs (below 250 ms).

4.5 |. Interactions between effects of stimulus intensity and ISI

We observed a positive interaction between intensity and ISI for MLR Pb and LLRs N1 and P2, which was reported in a few previous studies (Roth et al., 1976; Zhang et al., 2009). Such positive interaction in a context free of confounds from stimulus predictability can be explained by a feedforward model of short-term presynaptic depression (see Teichert et al., 2016). In the model, a sound of a particular intensity will release a certain fraction of the readily releasable pool of vesicles, with more intense sounds releasing a larger fraction. In the model, the predicted AEP amplitude is proportional to the total number of vesicles released. For short ISIs, the readily releasable pool of vesicles is smaller, thus leading to a smaller overall effect size of intensity for short than for long ISIs.

4.6 |. Modulation by ISI: Habituation versus adaptation views

AER modulation by ISI can be interpreted with either the habituation or the neural adaptation hypotheses, which provide alternative but potentially complementary explanations (for a review, see Rosburg & Mager, 2020; Ruusuvirta, 2020). Habituation could be described as an active psychologically relevant process in living organisms that tends to reduce responses to innocuous predictable stimuli. Under this view, the reduced amplitude of evoked responses with shorter ISIs would be the result of a novelty-oriented loss (Barry et al., 1992; Budd et al., 1998; Thompson & Spencer, 1966). The habituation hypothesis would associate such response decrement with the building of a neuronal model or template of the stimulus due to its repeated exposure (Sokolov, 1963), an interpretation falling within the predictive coding framework (see Huang & Rao, 2011). According to the neural adaptation hypothesis (O’Shea, 2015), the phenomenon of AERs attenuation by ISI shortening would rather emerge from a passive and more basic neurophysiological process involving the inability of the neural population to respond repeatedly to the incoming stimuli (Ritter et al., 1968). Whether this inability is entirely explained by neuronal ‘fatigue’ or refractoriness at the single neuron level or mediated by inhibitory feedback of interneurons at the circuit level is of debate, we will refer to this concept as neural ‘adaptation’ rather than ‘refractoriness’.

A crucial feature of our stimulation paradigm was that it ensured no prediction of the incoming stimuli was possible and it is therefore free of confounds from a constant (utterly predictable) stimulus presentation rate. Pb, N1, and P2 suppression with shorter ISI can thus be explained here with adaptation alone (Rosburg & Mager, 2020) instead of top-down predictive processes. We also manipulated the intensity of the stimuli and found stronger modulations by ISI with louder stimuli, in contrast with the habituation hypothesis prediction that response reduction with repetitive stimuli is weaker with louder sounds (Rankin et al., 2009; Thompson & Spencer, 1966).

Previous research manipulating the predictability of the sound sequence addressed habituation versus adaptation hypotheses. Pereira et al. (2014) compared N1 and P2 attenuation with different ISIs when presenting stimuli repeatedly with and without occasional pitch deviant stimuli breaking the regularity. While habituation was theoretically stronger in the condition without deviant stimuli, ISI effects were identical in the two conditions, thus suggesting that habituation was not involved. Herrmann et al. (2016) found different patterns of attenuation of N1 and P2 between regular and irregular non-predictable sound sequences, both affected by the long temporal stimulation history. The authors fitted the data to a single-neuron computational model and concluded adaptation alone could explain their observed effects. More research replicating these findings and isolating pure adaptation from habituation is necessary to help disentangle which neural mechanisms are behind AER modulations by ISI, thus improving our interpretations on the significance of evoked responses.

5 |. CONCLUSIONS

This is the first study simultaneously measuring middle-and long-latency auditory evoked responses while manipulating stimulus intensity and ISI in an unpredictable auditory context. We proved both factors modulated AERs without stimulus predictability confounds, with stimulus intensity affecting evoked responses as early as 8 ms from stimulus onset. Moreover, we provided evidence of Pb (~50 ms) being the first stage of the auditory processing hierarchy that has not yet recovered for ISIs above 250 ms and that shows a positive interdependence between intensity and ISI modulations. Our findings also add into the discussion of the neurophysiological origin of AERs reduction with short ISIs, supporting the neural adaptation hypothesis.

Supplementary Material

sup fig 5

Figure S5 EEG scalp topographies for every MLR and LLR peak. The white dot represents the position of FCz electrode, where statistical analyses were conducted.

sup fig 4

Figure S4 (Top) LLR number of sweeps after amplitude threshold rejection for every experimental condition (numbers 11 to 93). First digit of the condition number indicates the ISI (1 = 0.25–0.5 s; 9 = 4–8 s) and the second digit indicates the intensity level (1 = 65 dB; 3 = 85 dB). (Bottom) LLR Signal-to-Noise Ratio calculated as the Log Root Mean Square (RMS) of the signal after the stimulus divided by the signal at the baseline. Values are provided for every experimental condition and the average.

sup fig 3

Figure S3 (Top) MLR number of sweeps after amplitude threshold rejection for every experimental condition (numbers 11 to 93). First digit of the condition number indicates the ISI (1 = 0.25–0.5 s; 9 = 4–8 s) and the second digit indicates the intensity level (1 = 65 dB; 3 = 85 dB). Participant’s under a minimum of 400 surviving (dashed line) sweeps on average across conditions were discarded from further analyses. (Bottom) MLR Signal-to-Noise Ratio calculated as the Log Root Mean Square (RMS) of the signal after the stimulus divided by the signal at the baseline. Values are provided for every experimental condition and the average.

sup fig 2

Figure S2 MLR waveforms with 200 Hz (left) or 100 Hz (right) low-pass filter for comparison. N reflects the number of subjects surviving amplitude threshold correction with more than 400 sweeps.

sup fig 1

Figure S1 Mean and standard deviation of hearing thresholds (dB) averaged across participants. Thresholds were measured with a pure-tone audiometry (1000–4000 Hz).

ACKNOWLEDGMENTS

We thank Vanessa Fishel, Natasha Torrence, Yiming Wang, and Rebekah Farris for their assistance in data collection.

FUNDING INFORMATION

This research was supported by funding from the National Institutes of Health (NIMH RF1 MH114223).

Footnotes

CONFLICT OF INTEREST

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

SUPPORTING INFORMATION

Additional supporting information can be found online in the Supporting Information section at the end of this article.

DATA AVAILABILITY STATEMENT

The data that support the findings of this study are available from the corresponding author upon reasonable request.

REFERENCES

  1. Alcaini M, Giard MH, Thévenet M, & Pernier J (1994). Two separate frontal components in the N1 wave of the human auditory evoked response. Psychophysiology, 31(6), 611–615. 10.1111/j.1469-8986.1994.tb02354.x [DOI] [PubMed] [Google Scholar]
  2. Anderer P, Saletu B, Semlitsch HV, & Pascual-Marqui RD (1998). Electrical sources of P300 event-related brain potentials revealed by low resolution electromagnetic tomography. 2. Effects of nootropic therapy in age-associated memory impairment. Neuropsychobiology, 37, 28–35. 10.1159/000026473 [DOI] [PubMed] [Google Scholar]
  3. Barry RJ, Cocker KI, Anderson JW, Gordon E, & Rennie C (1992). Does the N100 evoked potential really habituate? Evidence from a paradigm appropriate to a clinical setting. International Journal of Psychophysiology, 13, 9–16. 10.1016/0167-8760(92)90014-3 [DOI] [PubMed] [Google Scholar]
  4. Boettcher FA, Salvi RJ, & Saunders SS (1990). Recovery from short-term adaptation in single neurons in the cochlear nucleus. Hearing Research, 48, 125–144. 10.1016/0378-5955(90)90203-2 [DOI] [PubMed] [Google Scholar]
  5. Borgmann C, Roß B, Draganova R, & Pantev C (2001). Human auditory middle latency responses: Influence of stimulus type and intensity. Hearing Research, 158, 57–64. 10.1016/S0378-5955(01)00292-1 [DOI] [PubMed] [Google Scholar]
  6. Briley PM, & Krumbholz K (2013). The specificity of stimulus-specific adaptation in human auditory cortex increases with repeated exposure to the adapting stimulus. Journal of Neurophysiology, 110, 2679–2688. 10.1152/jn.01015.2012 [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Budd TW, Barry RJ, Gordon E, Rennie C, & Michie PT (1998). Decrement of the N1 auditory event-related potential with stimulus repetition: Habituation vs. refractoriness. International Journal of Psychophysiology, 31, 51–68. 10.1016/S0167-8760(98)00040-3 [DOI] [PubMed] [Google Scholar]
  8. Budd TW, & Michie PT (1994). Facilitation of the N1 peak of the auditory ERP at short stimulus intervals. NeuroReport, 5, 2513–2516. 10.1097/00001756-199412000-00027 [DOI] [PubMed] [Google Scholar]
  9. Burkard RF, & Sims D (2001). The human auditory brainstem response to high click rates: Aging effects. American Journal of Audiology, 10, 53–61. 10.1044/1059-0889(2001/008) [DOI] [PubMed] [Google Scholar]
  10. Coffman BA, Haigh SM, Murphy TK, & Salisbury DF (2017). Impairment in mismatch negativity but not repetition suppression in schizophrenia. Brain Topography, 30, 521–530. 10.1007/s10548-017-0571-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Crowley KE, & Colrain IM (2004). A review of the evidence for P2 being an independent component process: Age, sleep and modality. Clinical Neurophysiology, 115, 732–744. 10.1016/j.clinph.2003.11.021 [DOI] [PubMed] [Google Scholar]
  12. Davis H, Mast T, Yoshie N, & Zerlin S (1966). The slow response of the human cortex to auditory stimuli: Recovery process. Electroencephalography and Clinical Neurophysiology, 21, 105–113. 10.1016/0013-4694(66)90118-0 [DOI] [PubMed] [Google Scholar]
  13. Deiber MP, Ibañez V, Fischer C, Perrin F, & Mauguière F (1988). Sequential mapping favours the hypothesis of distinct generators for Na and Pa middle latency auditory evoked potentials. Electroencephalography and Clinical Neurophysiology/Evoked Potentials, 71, 187–197. 10.1016/0168-5597(88)90004-4 [DOI] [PubMed] [Google Scholar]
  14. Delorme A, & Makeig S (2004). EEGLAB: An open source toolbox for analysis of single-trial EEG dynamics including independent component analysis. Journal of Neuroscience Methods, 134, 9–21. 10.1016/j.jneumeth.2003.10.009 [DOI] [PubMed] [Google Scholar]
  15. Dzulkarnain AAA, Hadi USA, & Zakaria NA (2013). The effects of stimulus rate and electrode montage on the auditory brainstem response in infants. Speech, Language and Hearing, 16, 221–226. 10.1179/2050572813Y.0000000017 [DOI] [Google Scholar]
  16. Dzulkarnain AAA, Shahrudin FA, Jamal FN, Marzuki MN, & Mazlan MNS (2020). Effects of stimulus repetition rates on the auditory brainstem response to level-specific ce-chirp in normal-hearing adults. American Journal of Audiology, 29, 838–850. 10.1044/2020_AJA-20-00049 [DOI] [PubMed] [Google Scholar]
  17. Erwin RJ, & Buchwald JS (1986). Midlatency auditory evoked responses: Differential recovery cycle characteristics. Electroencephalography and Clinical Neurophysiology, 64, 417–423. 10.1016/0013-4694(86)90075-1 [DOI] [PubMed] [Google Scholar]
  18. Fitzpatrick DC, Kuwada S, Kim DO, Parham K, & Batra R (1999). Responses of neurons to click-pairs as simulated echoes: Auditory nerve to auditory cortex. The Journal of the Acoustical Society of America, 106, 3460–3472. 10.1121/1.428199 [DOI] [PubMed] [Google Scholar]
  19. Giard MH, Perrin F, Echallier JF, Thevenet M, Froment JC, & Pernier J (1994). Dissociation of temporal and frontal components in the human auditory N1 wave: A scalp current density and dipole model analysis. Electroencephalography and Clinical Neurophysiology/Evoked Potentials Section, 92(3), 238–252. [DOI] [PubMed] [Google Scholar]
  20. Godey B, Schwartz D, De Graaf JB, Chauvel P, & Liégeois-Chauvel C (2001). Neuromagnetic source localization of auditory evoked fields and intracerebral evoked potentials: A comparison of data in the same patients. Clinical Neurophysiology, 112, 1850–1859. 10.1016/S1388-2457(01)00636-8 [DOI] [PubMed] [Google Scholar]
  21. Goldstein R, Rodman LB, & Karlovich RS (1972). Effects of stimulus rate and number on the early components of the averaged electroencephalic response. Journal of Speech and Hearing Research, 15, 559–566. 10.1044/jshr.1503.559 [DOI] [PubMed] [Google Scholar]
  22. Hagenmuller F, Heekeren K, Meier M, Theodoridou A, Walitza S, Haker H, Rössler W, & Kawohl W (2016). The loudness dependence of auditory evoked potentials (LDAEP) in individuals at risk for developing bipolar disorders and schizophrenia. Clinical Neurophysiology, 127, 1342–1350. 10.1016/j.clinph.2015.10.050 [DOI] [PubMed] [Google Scholar]
  23. Harris KC, Mills JH, & Dubno JR (2007). Electrophysiologic correlates of intensity discrimination in cortical evoked potentials of younger and older adults. Hearing Research, 228, 58–68. 10.1016/j.heares.2007.01.021 [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Heinemann LV, Kaiser J, & Altmann CF (2011). Auditory repetition enhancement at short interstimulus intervals for frequency-modulated tones. Brain Research, 1411, 65–75. 10.1016/j.brainres.2011.07.009 [DOI] [PubMed] [Google Scholar]
  25. Herrmann B, Henry MJ, Johnsrude IS, & Obleser J (2016). Altered temporal dynamics of neural adaptation in the aging human auditory cortex. Neurobiology of Aging, 45, 10–22. 10.1016/j.neurobiolaging.2016.05.006 [DOI] [PubMed] [Google Scholar]
  26. Herrmann B, Schlichting N, & Obleser J (2014). Dynamic range adaptation to spectral stimulus statistics in human auditory cortex. Journal of Neuroscience, 34, 327–331. 10.1523/JNEUROSCI.3974-13.2014 [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Huang Y, & Rao RPN (2011). Predictive coding. Wiley Interdisciplinary Reviews: Cognitive Science, 2(5), 580–593. [DOI] [PubMed] [Google Scholar]
  28. Jewett DL (1970). Volume-conducted potentials in response to auditory stimuli as detected by averaging in the cat. Electroencephalography and Clinical Neurophysiology, 28, 609–618. 10.1016/0013-4694(70)90203-8 [DOI] [PubMed] [Google Scholar]
  29. Joris PX, Schreiner CE, & Rees A (2004). Neural processing of amplitude-modulated sounds. Physiological Reviews, 84, 541–577. 10.1152/physrev.00029.2003 [DOI] [PubMed] [Google Scholar]
  30. Juckel G, Gallinat J, Riedel M, Sokullu S, Schulz C, Möller HJ, Müller N, & Hegerl U (2003). Serotonergic dysfunction in schizophrenia assessed by the loudness dependence measure of primary auditory cortex evoked activity. Schizophrenia Research, 64, 115–124. 10.1016/S0920-9964(03)00016-1 [DOI] [PubMed] [Google Scholar]
  31. Kaufmann F, Kaufmann R, & Salapatek P (1982). The effect of change in a stimulus sequence on P300. Neuropsychologia, 20, 439–445. 10.1016/0028-3932(82)90042-2 [DOI] [PubMed] [Google Scholar]
  32. Kavanagh KT, & Domico WD (1987). High pass digital and analog filtering of the middle latency response. Ear and Hearing, 8, 101–109. 10.1097/00003446-198704000-00007 [DOI] [PubMed] [Google Scholar]
  33. Keidel WD, & Spreng M (1965). Neurophysiological evidence for the Stevens power function in man. The Journal of the Acoustical Society of America, 38, 191–195. 10.1121/1.1909629 [DOI] [PubMed] [Google Scholar]
  34. Kileny P, Paccioretti D, & Wilson AF (1987). Effects of cortical lesions on middle-latency auditory evoked responses (MLR). Electroencephalography and Clinical Neurophysiology, 66, 108–120. 10.1016/0013-4694(87)90180-5 [DOI] [PubMed] [Google Scholar]
  35. Kraus N, Özdamar Ö, Hier D, & Stein L (1982). Auditory middle latency responses (MLRs) in patients with cortical lesions. Electroencephalography and Clinical Neurophysiology, 54, 275–287. 10.1016/0013-4694(82)90177-8 [DOI] [PubMed] [Google Scholar]
  36. Kupperman GL, & Mendel MI (1974). Threshold of the early components of the averaged electroencephalic response determined with tone pips and clicks during drug-induced sleep. Audiology, 13(5), 379–390. [DOI] [PubMed] [Google Scholar]
  37. Lange K (2009). Brain correlates of early auditory processing are attenuated by expectations for time and pitch. Brain and Cognition, 69, 127–137. 10.1016/j.bandc.2008.06.004 [DOI] [PubMed] [Google Scholar]
  38. Lightfoot G, & Kennedy V (2006). Cortical electric response audiometry hearing threshold estimation: Accuracy, speed, and the effects of stimulus presentation features. Ear and Hearing, 27(5), 443–456. 10.1097/01.aud.0000233902.53432.48 [DOI] [PubMed] [Google Scholar]
  39. Lopez-Caballero F, Zarnowiec K, & Escera C (2016). Differential deviant probability effects on two hierarchical levels of the auditory novelty system. Biological Psychology, 120, 1–9. 10.1016/j.biopsycho.2016.08.001 [DOI] [PubMed] [Google Scholar]
  40. Loveless N, Hari R, Hämäläinen M, & Tiihonen J (1989). Evoked responses of human auditory cortex may be enhanced by preceding stimuli. Electroencephalography and Clinical Neurophysiology, 74, 217–227. 10.1016/0013-4694(89)90008-4 [DOI] [PubMed] [Google Scholar]
  41. MacLeod KM, & Carr CE (2007). Beyond timing in the auditory brainstem: Intensity coding in the avian cochlear nucleus angularis. Progress in Brain Research, 165, 123–133. 10.1016/S0079-6123(06)65008-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Madell JR, & Goldstein R (1972). Relation between loudness and the amplitude of the early components of the averaged electroencephalic response. Journal of Speech and Hearing Research, 15, 134–141. 10.1044/jshr.1501.134 [DOI] [PubMed] [Google Scholar]
  43. Mahdavi ME, & Peyvandi AA (2007). Accuracy of cortical evoked response audiometry in estimating normal hearing thresholds. Tehran University Medical Journal, 65, 17–22. [Google Scholar]
  44. McEvoy L, Levänen S, & Loveless N (1997). Temporal characteristics of auditory sensory memory: Neuromagnetic evidence. Psychophysiology, 34, 308–316. 10.1111/j.1469-8986.1997.tb02401.x [DOI] [PubMed] [Google Scholar]
  45. Mcfarland WH, Vivion MC, Wolf KE, & Goldstein R (1975). Reexamination of effects of stimulus rate and number on the middle components of the averaged electroencephalic response. International Journal of Audiology, 14, 456–465. 10.3109/00206097509071756 [DOI] [PubMed] [Google Scholar]
  46. McGee T, Kraus N, Littman T, & Nicol T (1992). Contributions of medial geniculate body subdivisions to the middle latency response. Hearing Research, 61, 147–154. 10.1016/0378-5955(92)90045-O [DOI] [PubMed] [Google Scholar]
  47. Muller-Gass A, Marcoux A, Jamshidi P, & Campbell K (2008). The effects of very slow rates of stimulus presentation on event-related potential estimates of hearing threshold. International Journal of Audiology, 47, 34–43. 10.1080/14992020701647934 [DOI] [PubMed] [Google Scholar]
  48. Näätänen R, & Picton T (1987). The N1 wave of the human electric and magnetic response to sound: A review and an analysis of the component structure. Psychophysiology, 24, 375–425. 10.1111/j.1469-8986.1987.tb00311.x [DOI] [PubMed] [Google Scholar]
  49. Nelson DA, & Lassman FM (1968). Effects of intersignal interval on the human auditory evoked response. The Journal of the Acoustical Society of America, 44, 1529–1532. 10.1121/1.1911292 [DOI] [PubMed] [Google Scholar]
  50. Okamoto H, & Kakigi R (2014). Neural adaptation to silence in the human auditory cortex: A magnetoencephalographic study. Brain and Behavior, 4, 858–866. 10.1002/brb3.290 [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Onitsuka T, Ninomiya H, Sato E, Yamamoto T, & Tashiro N (2003). Differential characteristics of the middle latency auditory evoked magnetic responses to interstimulus intervals. Clinical Neurophysiology, 114, 1513–1520. 10.1016/S1388-2457(03)00110-X [DOI] [PubMed] [Google Scholar]
  52. O’Shea RP (2015). Refractoriness about adaptation. Frontiers in Human Neuroscience, 9, 38. 10.3389/fnhum.2015.00038 [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Özdamar Ö, & Kraus N (1983). Auditory middle-latency responses in humans. International Journal of Audiology, 22, 34–49. 10.3109/00206098309072768 [DOI] [PubMed] [Google Scholar]
  54. Pereira DR, Cardoso S, Ferreira-Santos F, Fernandes C, Cunha-Reis C, Paiva TO, Almeida PR, Silveira C, Barbosa F, & Marques-Teixeira J (2014). Effects of inter-stimulus interval (ISI) duration on the N1 and P2 components of the auditory event-related potential. International Journal of Psychophysiology, 94, 311–318. 10.1016/j.ijpsycho.2014.09.012 [DOI] [PubMed] [Google Scholar]
  55. Picton TW, Hillyard SA, Krausz HI, & Galambos R (1974). Human auditory evoked potentials. I: Evaluation of components. Electroencephalography and Clinical Neurophysiology, 36, 179–190. 10.1016/0013-4694(74)90155-2 [DOI] [PubMed] [Google Scholar]
  56. Rankin CH, Abrams T, Barry RJ, Bhatnagar S, Clayton DF, Colombo J, Coppola G, Geyer MA, Glanzman DL, Marsland S, McSweeney F, Wilson DA, Wu CF, & Thompson RF (2009). Habituation revisited: An updated and revised description of the behavioral characteristics of habituation. Neurobiology of Learning and Memory, 92, 135–138. 10.1016/j.nlm.2008.09.012 [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Rentzsch J, Jockers-Scherübl MC, Boutros NN, & Gallinat J (2008). Test-retest reliability of P 50, N100 and P200 auditory sensory gating in healthy subjects. International Journal of Psychophysiology, 67, 81–90. 10.1016/j.ijpsycho.2007.10.006 [DOI] [PubMed] [Google Scholar]
  58. Rhode WS, & Smith PH (1986). Encoding timing and intensity in the ventral cochlear nucleus of the cat. Journal of Neurophysiology, 56, 261–286. 10.1152/jn.1986.56.2.261 [DOI] [PubMed] [Google Scholar]
  59. Ritter W, Vaughan HG, & Costa LD (1968). Orienting and habituation to auditory stimuli: A study of short terms changes in average evoked responses. Electroencephalography and Clinical Neurophysiology, 25, 550–556. 10.1016/0013-4694(68)90234-4 [DOI] [PubMed] [Google Scholar]
  60. Rosburg T, & Mager R (2020). The reduced auditory evoked potential component N1 after repeated stimulation: Refractoriness hypothesis vs. habituation account. Hearing Research, 400, 108140. 10.1016/j.heares.2020.108140 [DOI] [PubMed] [Google Scholar]
  61. Rosburg T, Zimmerer K, & Huonker R (2010). Short-term habituation of auditory evoked potential and neuromagnetic field components in dependence of the interstimulus interval. Experimental Brain Research, 205, 559–570. 10.1007/s00221-010-2391-3 [DOI] [PubMed] [Google Scholar]
  62. Roth WT, Krainz PL, Ford JM, Tinklenberg JR, Rothbart RM, & Kopell BS (1976). Parameters of temporal recovery of the human auditory evoked potential. Electroencephalography and Clinical Neurophysiology, 40, 623–632. 10.1016/0013-4694(76)90137-1 [DOI] [PubMed] [Google Scholar]
  63. Ruusuvirta T (2020). The release from refractoriness hypothesis of N1 of event-related potentials needs reassessment. Hearing Research, 399, 107923. 10.1016/j.heares.2020.107923 [DOI] [PubMed] [Google Scholar]
  64. Sable JJ, Low KA, Maclin EL, Fabiani M, & Gratton G (2004). Latent inhibition mediates N1 attenuation to repeating sounds. Psychophysiology, 41, 636–642. 10.1111/j.1469-8986.2004.00192.x [DOI] [PubMed] [Google Scholar]
  65. Sams M, Hari R, Rif J, & Knuutila J (1993). The human auditory sensory memory trace persists about 10 sec: Neuromagnetic evidence. Journal of Cognitive Neuroscience, 5, 363–370. 10.1162/jocn.1993.5.3.363 [DOI] [PubMed] [Google Scholar]
  66. Schadow J, Lenz D, Thaerig S, Busch NA, Fründ I, & Herrmann CS (2007). Stimulus intensity affects early sensory processing: Sound intensity modulates auditory evoked gamma-band activity in human EEG. International Journal of Psychophysiology, 65, 152–161. 10.1016/j.ijpsycho.2007.04.006 [DOI] [PubMed] [Google Scholar]
  67. Schwartze M, Farrugia N, & Kotz SA (2013). Dissociation of formal and temporal predictability in early auditory evoked potentials. Neuropsychologia, 51, 320–325. 10.1016/j.neuropsychologia.2012.09.037 [DOI] [PubMed] [Google Scholar]
  68. Scott ML, & Harkins SW (1978). Amplitude of the brainstem auditory evoked response: The effect of interstimulus interval. International Journal of Neuroscience, 8, 147–152. 10.3109/00207457809150392 [DOI] [Google Scholar]
  69. Shucard DW, & Callaway E (1974). Auditory evoked potential amplitude and variability: Effects of task and intellectual ability. Journal of Comparative and Physiological Psychology, 87, 284–294. 10.1037/h0036855 [DOI] [PubMed] [Google Scholar]
  70. Sokolov EN (1963). Higher nervous functions: The orienting reflex. Annual Review of Physiology, 25(1), 545–580. [DOI] [PubMed] [Google Scholar]
  71. Stapells DR, & Oates P (1997). Estimation of the pure-tone audiogram by the auditory brainstem response: A review. Audiology and Neuro-Otology, 2, 257–280. 10.1159/000259252 [DOI] [PubMed] [Google Scholar]
  72. Sussman E, Steinschneider M, Gumenyuk V, Grushko J, & Lawson K (2008). The maturation of human evoked brain potentials to sounds presented at different stimulus rates. Hearing Research, 236, 61–79. 10.1016/j.heares.2007.12.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
  73. Tadel F, Baillet S, Mosher JC, Pantazis D, & Leahy RM (2011). Brainstorm: A user-friendly application for MEG/EEG analysis. Computational Intelligence and Neuroscience, 2011, 1–13. 10.1155/2011/879716 [DOI] [PMC free article] [PubMed] [Google Scholar]
  74. Teder W, Alho K, Reinikainen K, & Näätänenn R (1993). Interstimulus interval and the selective-attention effect on auditory ERPs: “N1 enhancement” versus processing negativity. Psychophysiology, 30, 71–81. 10.1111/j.1469-8986.1993.tb03206.x [DOI] [PubMed] [Google Scholar]
  75. Teichert T, Gurnsey K, Salisbury D, & Sweet RA (2016). Contextual processing in unpredictable auditory environments: The limited resource model of auditory refractoriness in the rhesus. Journal of Neurophysiology, 116, 2125–2139. 10.1152/jn.00419.2016 [DOI] [PMC free article] [PubMed] [Google Scholar]
  76. Thompson RF, & Spencer WA (1966). Habituation: A model phenomenon for the study of neuronal substrates of behavior. Psychological Review, 73, 16–43. 10.1037/h0022681 [DOI] [PubMed] [Google Scholar]
  77. Thornton AR, Mendel MI, & Anderson CV (1977). Effects of stimulus frequency and intensity on the middle components of the averaged auditory electroencephalic response. Journal of Speech and Hearing Research, 20, 81–94. 10.1044/jshr.2001.81 [DOI] [PubMed] [Google Scholar]
  78. Todd J, Michie PT, Budd TW, Rock D, & Jablensky AV (2000). Auditory sensory memory in schizophrenia: Inadequate trace formation? Psychiatry Research, 96, 99–115. 10.1016/S0165-1781(00)00205-5 [DOI] [PubMed] [Google Scholar]
  79. Tsodyks MV, & Markram H (1997). The neural code between neocortical pyramidal neurons depends on neurotransmitter release probability. Proceedings of the National Academy of Sciences of the United States of America, 94, 719–723. 10.1073/pnas.94.2.719 [DOI] [PMC free article] [PubMed] [Google Scholar]
  80. Tucker DA, Dietrich S, Harris S, & Pelletier S (2002). Effects of stimulus rate and gender on the auditory middle latency response. Journal of the American Academy of Audiology, 13, 146–153. 10.1055/s-0040-1715956 [DOI] [PubMed] [Google Scholar]
  81. Tucker DA, Dietrich S, McPherson DL, & Salamat MT (2001). Effect of stimulus intensity level on auditory middle latency response brain maps in human adults. Journal of the American Academy of Audiology, 12, 471–477. https://www.audiology.org/sites/default/files/journal/JAAA_12_05_01.pdf [PubMed] [Google Scholar]
  82. Ulanovsky N, Las L, Farkas D, & Nelken I (2004). Multiple time scales of adaptation in auditory cortex neurons. Journal of Neuroscience, 24, 10440–10453. 10.1523/JNEUROSCI.1905-04.2004 [DOI] [PMC free article] [PubMed] [Google Scholar]
  83. van den Honert C, & Stypulkowski PH (1986). Characterization of the electrically evoked auditory brainstem response (ABR) in cats and humans. Hearing Research, 21, 109–126. 10.1016/0378-5955(86)90033-X [DOI] [PubMed] [Google Scholar]
  84. Verkindt C, Bertrand O, Thevenet M, & Pernier J (1994). Two auditory components in the 130–230 ms range disclosed by their stimulus frequency dependence. NeuroReport, 5, 1189–1192. 10.1097/00001756-199406020-00007 [DOI] [PubMed] [Google Scholar]
  85. Volosin M, Czigler I, & Horváth J (2021). Pre-attentive auditory change detection for rapid auditory transient combinations: Insight from age-related processing changes. Biological Psychology, 159, 108024. 10.1016/j.biopsycho.2021.108024 [DOI] [PubMed] [Google Scholar]
  86. Volosin M, Gaál ZA, & Horváth J (2017). Age-related processing delay reveals cause of apparent sensory excitability following auditory stimulation. Scientific Reports, 7(1), 1–12. 10.1038/s41598-017-10696-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  87. Wang AL, Mouraux A, Liang M, & Iannetti GD (2008). The enhancement of the N1 wave elicited by sensory stimuli presented at very short inter-stimulus intervals is a general feature across sensory system. PLoS One, 3, e3929. 10.1371/journal.pone.0003929 [DOI] [PMC free article] [PubMed] [Google Scholar]
  88. Whitham EM, Pope KJ, Fitzgibbon SP, Lewis T, Clark CR, Loveless S, Broberg M, Wallace A, DeLosAngeles D, Lillie P, Hardy A, Fronsko R, Pulbrook A, & Willoughby JO (2007). Scalp electrical recording during paralysis: Quantitative evidence that EEG frequencies above 20 Hz are contaminated by EMG. Clinical Neurophysiology, 118, 1877–1888. 10.1016/j.clinph.2007.04.027 [DOI] [PubMed] [Google Scholar]
  89. Wyss C, Boers F, Kawohl W, Arrubla J, Vahedipour K, Dammers J, Neuner I, & Shah NJ (2014). Spatiotemporal properties of auditory intensity processing in multisensor MEG. NeuroImage, 102, 465–473. 10.1016/j.neuroimage.2014.08.012 [DOI] [PubMed] [Google Scholar]
  90. Yoshiura T, Ueno S, Iramina K, & Masuda K (1996). Human middle latency auditory evoked magnetic fields. Brain Topography, 8, 291–296. 10.1007/BF01184787 [DOI] [PubMed] [Google Scholar]
  91. Yvert B, Crouzeix A, Bertrand O, Seither-Preisler A, & Pantev C (2001). Multiple supratemporal sources of magnetic and electric auditory evoked middle latency components in humans. Cerebral Cortex, 11, 411–423. 10.1093/cercor/11.5.411 [DOI] [PubMed] [Google Scholar]
  92. Yvert B, Fischer C, Bertrand O, & Pernier J (2005). Localization of human supratemporal auditory areas from intracerebral auditory evoked potentials using distributed source models. NeuroImage, 28, 140–153. 10.1016/j.neuroimage.2005.05.056 [DOI] [PubMed] [Google Scholar]
  93. Zerlin S, & Naunton RF (1974). Early and late averaged electroencephalic responses at low sensation levels. International Journal of Audiology, 13, 366–378. 10.3109/00206097409071695 [DOI] [PubMed] [Google Scholar]
  94. Zhang F, Eliassen J, Anderson J, Scheifele P, & Brown D (2009). The time course of the amplitude and latency in the auditory late response evoked by repeated tone bursts. Journal of the American Academy of Audiology, 20, 239–250. 10.3766/jaaa.20.4.4 [DOI] [PubMed] [Google Scholar]
  95. Zygierewicz J, Sielużycki C, Zacharias N, Suffczyński P, Kordowski P, Scheich H, Durka PJ, & König R (2012). Estimation of the spatiotemporal structure of event-related desynchronization and synchronization in magnetoencephalography. Journal of Neuroscience Methods, 205, 148–158. 10.1016/j.jneumeth.2011.12.013 [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

sup fig 5

Figure S5 EEG scalp topographies for every MLR and LLR peak. The white dot represents the position of FCz electrode, where statistical analyses were conducted.

sup fig 4

Figure S4 (Top) LLR number of sweeps after amplitude threshold rejection for every experimental condition (numbers 11 to 93). First digit of the condition number indicates the ISI (1 = 0.25–0.5 s; 9 = 4–8 s) and the second digit indicates the intensity level (1 = 65 dB; 3 = 85 dB). (Bottom) LLR Signal-to-Noise Ratio calculated as the Log Root Mean Square (RMS) of the signal after the stimulus divided by the signal at the baseline. Values are provided for every experimental condition and the average.

sup fig 3

Figure S3 (Top) MLR number of sweeps after amplitude threshold rejection for every experimental condition (numbers 11 to 93). First digit of the condition number indicates the ISI (1 = 0.25–0.5 s; 9 = 4–8 s) and the second digit indicates the intensity level (1 = 65 dB; 3 = 85 dB). Participant’s under a minimum of 400 surviving (dashed line) sweeps on average across conditions were discarded from further analyses. (Bottom) MLR Signal-to-Noise Ratio calculated as the Log Root Mean Square (RMS) of the signal after the stimulus divided by the signal at the baseline. Values are provided for every experimental condition and the average.

sup fig 2

Figure S2 MLR waveforms with 200 Hz (left) or 100 Hz (right) low-pass filter for comparison. N reflects the number of subjects surviving amplitude threshold correction with more than 400 sweeps.

sup fig 1

Figure S1 Mean and standard deviation of hearing thresholds (dB) averaged across participants. Thresholds were measured with a pure-tone audiometry (1000–4000 Hz).

Data Availability Statement

The data that support the findings of this study are available from the corresponding author upon reasonable request.

RESOURCES