Skip to main content
Trends in Hearing logoLink to Trends in Hearing
. 2020 Jan 29;24:2331216519900595. doi: 10.1177/2331216519900595

Neural Representation of Loudness: Cortical Evoked Potentials in an Induced Loudness Reduction Experiment

Florian H Schmidt 1,, Manfred Mauermann 1, Birger Kollmeier 1
PMCID: PMC6990611  PMID: 31994456

Short abstract

Loudness context effects comprise differences in judgments of the loudness of a target stimulus depending on the presence of a preceding inducer tone. Interstimulus intervals (ISIs) between inducer tone and target tone of about 200 ms and above cause an induced loudness reduction (ILR) of the target tone. As the ILR increases, respectively, the perceived loudness of the target stimuli decreases with increasing ISI. This in turn means that identical stimuli in a different context have a differently perceived loudness. A correlation between specific characteristics in the electroencephalography responses and perceived loudness in an ILR experiment would therefore provide a neurophysiological indication of loudness processing beyond a neural representation of stimulus intensity only. To examine if such a correlation exists, we investigated cortical electroencephalography responses in a latency range from 75 to 510 ms during a psychoacoustical ILR experiment with different ISIs. With increasing ISI, the strength of the N1-P2 deflection of the respective electroencephalography response decreases similarly to the loudness perception of the target tone pulse. This indicates a representation based on loudness rather than on intensity at the corresponding processing stage.

Keywords: loudness context effect, neural correlate, cortical potentials, interstimulus interval, induced loudness reduction, loudness enhancement

Introduction

Loudness is an auditory measure of perception and can essentially be defined as the perceived intensity of a sound. However, it has been well known for decades that besides intensity, other physical parameters such as spectral content and bandwidth or temporal properties like duration contribute to loudness perception (Moore, 2013; Stevens, 1957; Zwicker, 1958; Zwicker & Fastl, 1999). Although many loudness effects are linked to the physical properties of the stimulus, there are experiments indicating loudness differences for identical stimuli that were caused by changing contexts (for a review, see, e.g., Arieh & Marks, 2011). In psychophysics, the established common idea was that these contextual effects reflect relatively late judgment processes, that is, a bias in response, rather than changes in internal representation of the stimulus underlying the perception of sound intensity (Anderson, 1975; Stevens, 1958). However, more recent psychoacoustical research on contextual effects of loudness suggests that there is as well evidence for explicit changes in the internal representation of the stimulus underlying the perception of loudness (e.g., Algom & Marks, 1990; Schneider & Parker, 1990). Correlates between psychoacoustical measured effects of contextual loudness and neurophysiological responses would support the findings of these psychoacoustical studies if the latencies of the respective neurophysiological responses are too short to assume a relation to late judgment processes, or even too short to assume that consciously accessible processing stages are involved.

From the perspective of neurophysiological electroencephalography (EEG) measurements, several studies have shown that the change of sound intensity correlates with changes in the amplitudes of auditory evoked potentials (AEPs) related to brainstem activity (Bauer, Elmasian, & Galambos, 1974; Dau, Wegner, Mellert, & Kollmeier, 2000; Fobel & Dau, 2004; Junius & Dau, 2005; Serpanos, O’Malley, & Gravel, 1997) as well as in AEP related to activity in the auditory cortex (Hegerl, Gallinat, & Mrowinski, 1994; Hoppe, Rosanowski, Iro, & Eysholdt, 2001; Mulert, Juckel, Augustin, & Hegerl, 2002; Mariam et al., 2012; Näätänen & Picton, 1987; Potter et al., 2017). Generally, those studies treated the analysis of AEP that can be generated simply, using clicks or tone pulses.

When focusing on the relation of absolute intensity and AEP, most authors agree on the close relation of sound intensity in AEPs, but it remains rather unclear whether the observations are indicating only a correlation to intensity or a link to the perceived loudness. Relevant studies are contradictory (Babkoff, Pratt, & Kempinski, 1984; Darling & Price, 1990; Hoppe et al., 2001; Näätänen & Picton, 1987; Pratt & Sohmer, 1977; Serpanos et al., 1997; Silva & Epstein, 2010, 2012). Sound intensity is the factor with the greatest influence on loudness. Therefore, intensity and loudness show a close covariation. This makes it rather difficult to distinguish whether a neural response is better correlated with one or the other. Previous studies typically tried to detect compression effects in the neural response, to distinguish loudness from sound intensity (Behler & Uppenkamp, 2016; Castro, de Prat, & Zabala, 2008; Eeckhoutte, Wouters, & Francart, 2016; Emara & Kolkaila, 2010; Ménard, Gallégo, Berger-Vachon, Collet, & Thai-Van, 2008). However, it is not clear if a more compressed representation of the stimulus in the auditory pathway is necessarily indicating an earlier loudness processing, in particular when considering (a) that it can be assumed that a major part of the compression of the auditory dynamic range takes place in the cochlea and (b) the possibility of central expansion in loudness processing (see, e.g., Zhang & Zeng, 1997). In an ILR paradigm, typically identical stimuli (e.g., tone pulses) are judged as differently loud according to different contexts given by an inducer tone pulse presented beforehand at different ISIs. The cochlear processing of two physically identical reference stimuli, especially their compression, can be assumed to be almost the same even in different contexts, that is, for different ISIs (if the ISI is not too short). Therefore, any correlation between specific AEPs and the differently perceived loudness of identical reference stimuli in different contexts could provide an AEP based loudness indicator that has a different quality compared to loudness indicators that provide essentially correlations to the representation of stimulus intensity at different levels after cochlear compression.

In many psychoacoustical studies, loudness context effects have been addressed as the level difference (LI–LB) between the level LB of a comparison tone matched to be as equally loud as a preceding target tone of a different frequency without any inducer tone (baseline task) and the level LI matched in the presence of an additional inducer tone (typically at a higher level) preceding the target tone with a specific interstimulus interval (ISI). The presence of a more intense inducer at close temporal proximity (ISI less than 100 ms) and close in frequency typically enhances the loudness of a target tone (Marks, 1988), whereas ISIs longer than 200 ms lead to a so-called induced loudness reduction (ILR; Scharf, Buus, & Nieder, 2002). ILR can reach 10 dB or more (Arieh & Marks, 2003a; Nieder, Buus, Florentine, & Scharf, 2003) and the recovery process from ILR has a time scale of a minute (Marks, 1993) or even longer (Arieh, Kelly, Marks, & Pierce, 2005). To explain the temporal properties of these loudness-context effects, Arieh and Marks (2003a) suggested that the inducer tone starts two processes: an initial enhancement (or decrement) process with fast onset and decay and an ILR adaptation process with fast onset but slower decay.

According to Oberfeld (2007), the initial process of loudness enhancement can be seen as a rather centrally located merging process of inducer and target tone (see also Elmasian, Galambos, & Bernheim, 1980). Besides, loudness enhancement ILR is as well interpreted as a centrally based adaptation-like process (Arieh & Marks, 2003b). Mechanisms on lower processing stages can be widely excluded as the most relevant explanations for these loudness context effects. Stimulus interactions on cochlear level can be neglected, since cochlear processing such as peripheral ringing is thought to last only a few cycles of the filter’s center frequency. Loudness changes with increasing ISI should show the opposite trend to the observed ILR effect when assuming the same mechanisms as for classical forward-masking. Nieder et al. (2003) argue that medial efferents affecting cochlear tuning are unlikely, due to the mismatch of the respective time constants in psychoacoustics and efferent mechanisms and the general insensitivity of medial efferents to short stimuli of the order of 20 ms, which nevertheless produce significant ILR. Wang, Kreft, and Oxenham (2015) showed that loudness context effects are also present in cochlear implant (CI) users for whom the cochlea, and respectively any medial efferent feedback to the cochlea, is bypassed. This indicates as well that medial efferent effects play not a major role as underlying mechanism for loudness context effects, although they may affect the absolute outcome to some extent as indicated by Wang, Kreft, and Oxenham (2016). They observed loudness enhancement in CI users not seen in normal hearing subjects at comparable stimulus conditions which might be due to an overall loudness reduction caused by a medial efferent reflex in the normal hearing subjects. Overall, for the stimulus settings in this study, it is reasonable to assume that most peripheral and medial efferent effects should play no or only a minor role.

Which effects can be expected in cortical AEP when using stimulus configurations similar to ILR experiments? Previous EEG studies investigated the change of the cortical AEP for a series of tone pulses using the same frequency, but with varying stimulus onset asynchrony and ISI (Davis, Mast, Yoshie, & Zerlin, 1966; Lanting, Briley, Sumner, & Krumbholz, 2013; Nelson & Lassman, 1968), that is, they used stimulus configurations that are comparable to ILR experiments. These studies found a strong decrease of the neural response strength to the second and later tone pulses with respect to the first tone. This decrease of strength is referred to as repetition suppression. It can be assumed that it is consequence of an overloading-related reduction of synchronously firing neurons or specific neural circuits. In a further step, Lanting et al. (2013) found (for a stimulus condition that is comparable to the experiment in this study—their Experiment 1) that with increasing ISI, the repetition suppression decreased, that is, the related cortical AEPs increased again. Several studies that investigated the relationship of loudness and cortical AEPs suggest a positive correlation between both (Hegerl et al., 1994; Hoppe et al., 2001; Mariam et al., 2012; Mulert et al., 2002; Näätänen & Picton, 1987; Potter et al., 2017). The combination of these findings suggests that a decrease in repetition suppression, that is, an increase in EEG amplitude (with increasing ISI) goes hand in hand with a respective increase in loudness perception. At first glance, this appears to contradict the psychoacoustical results of Arieh and Marks (2003a). However, Lanting et al. (2013) used the same level for inducer and target tone while ILR can only be expected if a certain level difference between inducer and target tone is exceeded. Furthermore, they showed that the different cortical components have different adaptation properties. They suggested that mainly cortical components at later stages, particularly the vertex-positive deflection around 200 ms (P2), are involved in the decrease of the repetition suppression. It could be that response components other than P2 (e.g., N1) may reflect different aspects of loudness processing and therefore are less, or even oppositely, affected. One of the rare studies combining context effects of loudness and neural responses found that the amplitude of the N1 deflection represents an electrophysiological correlate of the effects of nonsimultaneous masking on intensity resolution (Oberfeld, 2010), which possibly also indicates a direct relation between the AEP deflection amplitudes and loudness enhancement at short ISIs (Oberfeld, 2008). However, the temporal properties of loudness enhancement and ILR are different, suggesting possibly different underlying mechanisms affecting the two processes. For longer ISIs (0.8–8 s), Lu et al. (1992) conducted magnetoencephalography (MEG) and psychoacoustic measurements at the same time. They found that the individual time constants of loudness-context effects, that is, loudness increase or decrease with increasing ISI, were closely correlated to the respective individual time constant of an increasing N1m peak. They interpreted the increase of N1m as the recovery of a neuronal circuit representing a memory trace1 of the loudness of the first tone pulse. The memory trace initiated by the target tone (and serving as the reference for loudness comparisons) decays with increasing ISI and is more and more replaced by a long-term loudness reference given by the middle of the range of stimuli (central tendency). In summary, the literature provides clear evidence that differences in loudness due to context effects are mainly related to loudness processing on higher stages of the auditory pathway, while earlier processing stages—especially cochlear compression and medial efferent effects—are likely to play a minor role only. Furthermore, in comparison to purely intensity-related AEP correlates, neural responses that reflect those specific context-related differences in loudness would indicate a more complete, or at least a different, representation of loudness at the respective processing stages.

Within this context, the focus of this study lies on the relation between AEPs (including the N1 deflection) from EEG recordings and ILR at longer ISIs. In particular, this study looks for features in the cortical AEP responses that correlate directly with the loudness differences caused by ILR as a context effect. Therefore, we included the ILR paradigm directly in an EEG experiment, measuring a subset of the ISIs investigated by Arieh and Marks (2003a). The synchronous measurement of psychoacoustics and EEG provides two possible advantages: (a) The recorded EEG data are directly linked to the psychoacoustic outcome—no effects have to be considered due to different attention or physiological status of the subject, which may occur when performing EEG and psychoacoustic measurements at different times. (b) A possible enhancement of the neural activation, since recent studies recommended active listening tasks to enhance neural activation related to the investigated features (Bennington & Polich, 1999; Öhman & Lader, 1972; O’Sullivan, Shamma, & Lalor, 2015).

The features we investigated are the condition-related changes of strength and latency of the cortical components in the AEP. The cortical components that we consider are the vertex-negative deflection at around 100 ms (N1), the vertex-positive deflection around 200 ms (P2), and the vertex-negative deflection around 250 ms (N2). These selected components are popular candidates to represent cortical activation at different stages. We tested whether (a) the strength of cortical components increased with increasing loudness and (b) the latency of cortical components changed with loudness.

When the eliciting target stimuli in an ILR experiment are identical for the different ISI conditions, any loudness differences between conditions are independent of sound intensity. Therefore, a relationship between cortical AEPs and context-related loudness changes would provide an indicator covarying with perceived loudness and not intensity differences of the respective processing stage.

Later AEPs such as N2 may already also reflect cognitive control mechanisms (Folstein & Petten, 2008), whereas the N1-P2 deflection is assumed to be associated with sensory evoked potentials that are most probably not representing conscious processes such as attention or decision-making (Polich, 1993).

Therefore, a correlation between earlier cortical AEPs and the contextual loudness would provide some evidence that the neural representation of the stimulus loudness is adapted, rather than observing a bias in response only, whereas a correlation only with N2 or later AEPs would indicate the opposite.

Methods

Subjects

Twelve subjects, six males (S1, S3, S6, S7, S10, and S11) and six females (S2, S4, S5, S8, S9, and S12), with clinically normal hearing participated in the experiments. All had hearing thresholds ≤ 15 dB HL at standard audiometric frequencies between 125 and 8000 Hz. The subjects were right-handed, between 20 and 30 years old and were paid volunteers.2

All experimental procedures were approved by the ethics committee of the University of Oldenburg.

Stimulation and Recording

In line with the experiment by Arieh and Marks (2003a), the stimuli used in the experiment were different sequences of tone pulses with an overall duration of 50 ms, including 5 ms cosine rise and decay. A sequence generally consisted of one 2500 Hz inducer tone at 80 dB SPL, one 2500 Hz target tone at 60 dB SPL and one 500 Hz comparison tone with adjustable sound level (Figure 1). Due to the frequency specificity of ILR, the inducer and target tone on the one hand and comparison tone on the other were presented at different frequencies. According to Marks and Warner (1991), the comparison tone should differ by at least one critical band from the frequency of inducer and target tone, to ensure that the inducer is affecting the target tone alone, rather than also causing an ILR of the comparison tone (see, e.g., Arieh & Marks, (2003a), Experiment 2). Four conditions were presented in pseudorandomized order and differed in the length of the ISI between inducer and target tone: (a) without an inducer tone (henceforth referred to as baseline condition), and with an inducer tone to the target tone at (b) 150, (c) 525, and (d) 1,650 ms. For all conditions, the ISI between target and comparison tone was 1,000 ms. Signal generation and conditioning including attenuation was performed digitally on a PC by a MATLAB R2006b (the Mathworks) based custom software. The stimuli were digital-to-analog converted at a sampling frequency of 44.1 kHz using a Fireface UCX (RME) as an external sound device and were presented diotically via ER 2 insert earphones (Etymotic Research) driven by an HB7 headphone buffer (Tucker Davies Technologies). The ER 2 insert earphones were calibrated using an IEC711 compatible ear simulator for insert phones (Brüel & Kjaer, Typ 4157). During calibration, the earphones were connected to the ear simulator via a foam tip plugged into the external-ear simulator (Brüel & Kjaer, Typ DB2012).

Figure 1.

Figure 1.

The stimulus sequences used to measure the time course of ILR. The inducer tone creates the context of the target tone. The time course of ILR was measured by varying the ISI between the inducer tone and the target tone. The comparison tone level was adjusted in an adaptive procedure to determine the loudness of the target tone.

The subject’s response to the psychoacoustic task was via a button-response box, which sent a response-specific trigger signal to the EEG recording system to be stored in the EEG data, and gave a correlated input to the stimulation PC controlling the psychoacoustic measurement procedure.

Four conditions were presented in pseudorandomized order and differed in the length of the ISI between inducer and target tone: (a) without an inducer tone (henceforth referred to as baseline condition), and with an inducer tone to the target tone at (b) 150, (c) 525, and (d) 1,650 ms. For all conditions, the ISI between target and comparison tone was 1,000 ms.

The psychoacoustic measurement was realized using the AFC software package, a tool designed to run psychoacoustic measurements in Matlab (Ewert, 2013). Similar to the study of Arieh and Marks (2003a), the four conditions (baseline and ISIs between target and comparison tone of 150, 525, and 1,650 ms) were measured in a randomized order. The loudness of the 2500-Hz target tone fixed at 60 dB SPL was estimated from the results for the measurement of two interleaved tracks, referred to as the ascending and descending track. The order of the presentation from the two tracks was randomized from trial to trial. After the presentation of each stimulus sequence, the listener’s task was to judge whether the target or comparison tone was louder by pressing the appropriate key (two-alternative forced choice, AFC). The level of the comparison tone in each track was adjusted according to the listener’s responses following an adaptive two-up, two-down (2up-2down) procedure, that is, the level of the comparison tone was increased after it was judged as softer than the target in two successive trials within one track, or the level was decreased if the comparison tone was judged as louder in two successive trials. This up-down rule converges to the target probability of 50% on the psychometric function. The step size of the sound level of the comparison tone changed adaptively within each track. After three reversals,3 the step size decreased from 4 to 2 dB. The whole procedure ended after nine reversals in each track. Due to the adaptive procedure, the numbers of trials in each run varied. The average number of trials was 156, but varied between 77 and 210. Arieh and Marks (2003a) had measured at first the baseline condition in each subject, using start levels for the comparison tones of 40 and 80 dB SPL for the ascending and descending track, respectively. Then they measured the other conditions with start levels of 25 dB below and above the individual baselines of the test persons. In this study, we used start levels of 25 dB below and above the nominal target level of 60 dB SPL for the measurement of the various ISI conditions as well as for the baseline condition, in order to allow a randomized order for the measurement across all conditions (including the baseline condition).

Parallel to the loudness estimation procedure, EEG was recorded with a Biosemi-Active Two system using 64 channels, with the electrodes placed according to the international 10-20 system. Contact gel (Signa gel Electrode Gel, Parker) was used between electrodes and scalp. The electrode offset was not higher than 10 mV. The recordings were collected and digitalized on a second PC using the ActiView software (6.03, Biosemi) at 1024 Hz sampling frequency. Each session lasted 10 to 15 min. A 10-min break separated consecutive sessions.

Data Processing and Analysis

The psychoacoustic data from the adaptive loudness matches provided a series of loudness judgments that gradually converged to the value of equal loudness. The point of equal loudness between the target and reference was calculated by averaging the last six reversal points. Subsequently, the arithmetic mean was determined from the ascending and descending tracks. The listeners were instructed to ignore the inducer tone and judge whether the target or the comparison tone was louder.

All EEG data were processed offline using MATLAB. The average of all electrodes was used as the virtual reference electrode. For the evaluation of the cortical responses, a cluster of nine electrodes around the Fcz-electrode (Fcz, F1, F2, Fz, C1, C2, C3, C4, and Cz) was examined. Generally, diotical stimulation favors the recording at central electrodes. Furthermore, the findings of Lanting et al. (2013) regarding the topographic activation of the responses N1 and P2 to the target tone showed that some of the frontal electrodes were most appropriate to measure the AEP to the consecutive tone pulse. After applying high-pass filtering at 0.5 Hz to reduce the electrode drift and low-pass filtering at 8 Hz to reduce alpha-wave artifacts,4 the data were down-sampled to 64 Hz sampling frequency. Further reduction of artifacts caused by eye movement was carried out using independent component analysis (MATLAB toolbox eeglab 4.4 b). For the two shorter ISI conditions (150 ms and 525 ms), there was a temporal overlap between the AEPs of the inducer and the target tone. Therefore, the response to the inducer tone from the longest ISI condition (1,650 ms) was subtracted from these conditions. Afterwards, the data were separated into epochs. The epochs were averaged using an iterated weighted averaging procedure (Riedel, Granzow, & Kollmeier, 2001) with two iterations. This procedure provided weights for each epoch according to its estimated amount of noise contamination.

Our study focused on the differences in amplitudes and latencies of the cortical components N1, P2, and N2 of the test conditions to the baseline condition. We used the peak-to-peak amplitude between the P2 and N1 deflections and between the N2 and P2 deflections as well as the mean amplitude over different time windows as representations of these components. Peak-to-peak amplitudes of the components were extracted by selecting the minimum amplitude for the negative components (N1 and N2) or the maximum for the positive component (P2), each within physiologically reasonable time windows. The respective time windows were chosen for N1 from 75 to 170 ms, for P2 from 170 to 310 ms, and for N2 from 200 to 510 ms. The width of these windows was chosen based on the measured temporal occurrence of the components in the mean EEG response to the target tone. The position of each peak was used as a representation of the latency of the respective cortical component.

Statistical Analysis

We aimed to test whether the trend of the evoked responses of the target tone was driven by a decrease of the repetition suppression or by the ILR, which would show an opposite trend. Therefore, we compared the order of the averaged AFC judgments for the different ISI conditions with the order of the strength of the cortical components. Assuming independent and identically distributed observations and normal distribution for the data, an analysis of variance for repeated measures (rmANOVA), including the Mauchly test for sphericity and a corresponding Greenhouse–Geisser correction, was carried out to examine whether the means of the prospective neural correlates across subjects differed with the conditions. This was also carried out for the sound levels derived from the AFC judgments. Furthermore, we compared the changes in the loudness matches to the changes in the EEG amplitudes within subjects. We used the approach of Bland and Altman (1995) for the calculation of correlation coefficients with repeated observations. This method allows study of the direct relationship between individual loudness matches and individual EEG responses by calculating the correlation within subjects.

Results

Figure 2 illustrates the results of the AFC judgments for the three ISI conditions. The difference between the adjusted comparison tone of the baseline condition (51.4 ± 2.3 dB) and the ISI conditions was shown. By increasing the ISI between the inducer tone and the target tone, the level of the adjusted comparison tone decreased. This implies a decrease in loudness of the target tone. The level difference between the ISI conditions 150 ms and 525 ms is about 4 dB, whereas it amounts to only 2.5 dB between 525 ms and 1,650 ms. The decreasing effect of the inducer tone on the loudness perception of the target tone was highly significant—rmANOVA: F(2, 22) = 8.789, ε^ = 0.862, p < .004, η2 = 0.44.

Figure 2.

Figure 2.

The level of the comparison tones was adjusted in equal loudness to the 60 dB target tone for the baseline condition and several ISI conditions. ΔL represents the level difference between the adjusted comparison tone level under the baseline condition and the comparison tone levels under ISI conditions. The error bars indicate the 95% confidence intervals across subjects. If ΔL > 0, the target tone of this ISI condition was perceived as louder than in the baseline condition, and vice versa.

The inducer tone also had a considerable effect on the EEG response of the target tone. The strength of cortical components of the target tone response among the ISI conditions was clearly reduced for all conditions compared with the baseline response (Figure 3), but across the ISI conditions, the mean amplitudes relative to the baseline condition of the different cortical components showed disparate behavior (Figure 4(a) and (b)). The mean amplitude difference of N1-P2 decreased significantly with increasing ISI—rmANOVA: F(2, 22) = 4.9, ε^ = 0.862, p = .023, η2 = 0.078. On the other hand, the mean amplitude difference of P2-N2 increased proportionally with the ISI—rmANOVA: F(2, 22) = 4.68, ε^ = 1, p = .02, η2 = 0.12. However, these significant effects could only be shown for the mean amplitude, not for the peak-to-peak amplitude. The inducer tone had no significant effect on the latencies of the cortical components (Figure 4(c) and (d)). The results are summarized in more detail in Table 1, including the absolute amplitudes of the cortical components for peak-to-peak and mean-amplitude methods as well as their corresponding latencies.

Figure 3.

Figure 3.

The averaged EEG response to the target tone (starting at 0 ms) and to the loudness-adjusted comparison tone (starting at 1,050 ms) for the different ISI conditions. The averaged EEG response to a 60 dB tone pulse and to a corresponding, adjusted, comparison tone in the absence of an inducer tone is represented by the black line (baseline condition). Both tone pulses have a similar cortical response, characterized by a pronounced N1 with a peak at 100 ms, a pronounced P2 with a peak 190 ms, and an N2 with a peak at 320 ms with an extended reload. All ISIs showed a reduced strength of the target tone response. Furthermore, the strength of their N1-P2 deflections showed a decrease with increasing ISI. ISI = interstimulus interval.

Figure 4.

Figure 4.

(a–d) Boxplots of the investigated features of the EEG response to the target tone for the three different ISI conditions. In the top panels, ΔA represents the amplitude differences of the N1-P2 (a) and N2-P2 (b) deflections between the condition with an inducer tone and the baseline without inducer tone. The amplitudes were extracted by using the mean amplitude method. In the bottom panels, the latencies of the N1 (c) and of the P2 (d) are displayed. The mean values across the subjects are marked by black triangles. ISI = interstimulus interval.

Table 1.

Cortical Features of the EEG-Response to the Target Tone in the Baseline Condition and in the Different ISI Conditions With Their Corresponding ANOVA Results.

ISI-conditions
rmANOVA results
Cortical components Baseline 150 ms—Baseline 525 ms—Baseline 1,650 ms—Baseline F p
Amplitude
 Peak-to-peak N1-P2/µV 1.80 ± 0.36 −0.45 ± 0.25 −0.69 ± 0.21 −0.86 ± 0.20 2.43 .11
P2-N2/µV 2.35 ± 0.29 −1.41 ± 0.10 −1.14 ± 0.18 −1.10 ± 0.14 1.97 .16
 Mean amplitude N1-P2/µV 0.33 ± 0.22 0.26 ± 0.21 0.03 ± 0.20 0.11 ± 0.20 4.9 .023
P2-N2/µV 0.62 ± 0.14 −0.62 ± 0.061 −0.44 ± 0.074 −0.39 ± 0.095 4.68 .02
Latency N1/ms 106.8 ± 2.6 132.8 ± 8.9 136.7 ± 8.4 125 ± 7.9 1.24 .30
P2/ms 199.2 ± 6.4 252.6 ± 11.3 234.4 ± 9 229.2 ± 11.9 1.41 .26

Note. The N1-P2 and N2-P2 deflections were extracted by using two different approaches: peak-to-peak and mean amplitude. The amplitude differences were normalized by subtracting the baseline condition. Latencies were extracted at the position of the peaks of the N1 and P2 components. One-way ANOVA was performed across ISI conditions and 12 subjects. In the bottom part of the table, absolute latencies of the respective EEG responses are given without a baseline correction. rmANOVA = repeated measures analysis of variance; ISI = interstimulus interval.

Correlation analysis between individual loudness matches and individual EEG responses showed a significant correlation between N1-P2 amplitude and loudness matches (Rpeak-to-peak=0.31, p = .03; Rmean=0.38, p = .01; cf. Figure 5). There was no significant correlation between the P2-N2 amplitude and individual loudness. The latencies of the cortical components also did not correlate with the individual loudness matches.

Figure 5.

Figure 5.

Scatter plot of the individual EEG responses of the N1-P2 amplitude and the individual loudness matches with parallel lines fitted for each subject as suggested by Bland and Altman (1995). ΔL represents the level difference between the adjusted comparison tone level under the baseline condition and the comparison tone levels under ISI conditions. The different open symbols and colors represent the different subjects. The filled symbols in the middle of each colored (red, blue, gray) line are just markers indicating which line belongs to which data points (open symbols) with the same shape and color. The black line is the regression line for all data points after subtraction of the subjects’ individual offsets (as indicated by the colored lines).

Discussion

The ILR effect for loudness found by Arieh and Marks (2003a) was essentially reproduced for the three conditions in the psychoacoustical experiment. However, the measured level differences of the comparison tones between the conditions in our data are slightly lower than in Arieh and Marks’ study. Furthermore, the average data found here may even indicate a slight loudness enhancement for an ISI of 150 ms, which was not seen in Arieh and Marks (2003a) for the same signal configuration. These differences can have several reasons: One reason might be that the relative small number of subjects in this study and the high intersubject variability in this type of experiments especially for higher level inducer tones (e.g., Oberfeld, 2007) biasing the average outcome here. Another reason might be differences in signal presentation due to the use of different speakers. The strength of both ILR and loudness enhancement is clearly dependent both on the level difference as well as on the absolute level of inducer and target tone (Oberfeld, 2007). Differences across studies in the effective sound level at the eardrum can easily occur when using different headphones while presenting the sound at the same nominal sound pressure level (SPL).5 The effective level deviation from their specific level condition due to different headphones and a respective calibration mismatch can potentially be another source for the observed differences in the context related loudness effects in this study. Since Arieh and Marks (2003a) discussed that their specific level configuration of inducer and target level may lead to cancellation of loudness enhancement and ILR processes at short ISIs and that this might be the reason for almost no loudness enhancement at short ISIs in their data. An effective level deviation from their specific level condition due to a respective calibration mismatch between different head- or earphones may explain even the observed loudness enhancement at an ISI of 150 ms in this study. A further reason for the differences with Arieh and Marks (2003a) could be the different starting levels of the comparison tones for the ascending and descending tracks when measuring the ISI conditions. They used 25 dB above and below the listener’s individual baseline, whereas in this study, 25 dB above and below the nominal target level had been used. In addition, Lu et al. (1992, see Introduction section) assumed that with increasing ISI, the remembered loudness of the target tone tends toward a long-term loudness. They argue that the long-term loudness is essentially determined by a kind of average level during the running test session. Thinking in these terms, the differences in ILR between the studies might also be related to differences in remembered long-term loudness. A different number of stimulus conditions or different temporal settings, such as different pause durations between trials and so on, may differ across studies and possibly affect the integration process for both the respective long-term loudness and loudness matches at long ISIs.

The amplitude of the P2-N2 deflection—as part of the cortical response to the target tone—increased with increasing duration of the ISI. This decrease of repetition suppression is in line with the expectations of previous studies (Davis et al., 1966; Lanting et al., 2013; Nelson & Lassman, 1968). However, comparing the loudness matches with the amplitudes of the P2-N2 deflection within subjects, no correlation was found. On the contrary, the amplitude of the N1-P2 deflection decreased with increasing duration of the ISI, which is in agreement with the psychoacoustically measured decrease in loudness. This effect was most evident when examining the mean amplitude, since the peak-to-peak amplitude only showed significant correlation when compared with the individual loudness matches. The N1-P2 mean amplitude of the target tone of the ISI at 150 ms is higher than in the baseline condition (Figure 4). This gives some indication that the N1-P2 deflection also reflects the effect of loudness enhancement. In addition, the correlation coefficient for the peak-to-peak amplitude was slightly lower than for the mean amplitude. This is probably due to the fact that peak extraction cannot take temporally wider deflections into account. The reduction of the N1-P2 deflection with decreasing loudness is in agreement with the findings by Hoppe et al. (2001) in CI users. For the different ISI, the target stimuli were identical in this study. That means that the changes in loudness, as well as in the coinciding N1-P2 deflection, as observed here, are not related to the physical stimulus intensity.

Compared to the strength of a normal cortical response reported in literature (1–2 µV for the deflections of individual components; Picton, Hillyard, Krausz, & Galambos, 1974, e.g., the response to the comparison tone), a strong reduction in the strength of the cortical response was observed for both N1-P2 and P2-N2 deflections. This generally coincides with the results of previous studies, in which a decrease of repetition suppression for the cortical AEPs was observed (Davis et al., 1966; Lanting et al., 2013; Nelson & Lassman, 1968). The N1-P2 deflection also shows this overall reduction6 but no decrease of repetition suppression with increasing ISI. It might therefore be possible that the change in the underlying AEP component (related to the time course of ILR or loudness enhancement) is even larger than observed here, but that it is partially masked by the general trend of a decrease of repetition suppression. Nevertheless, the representation of the ILR effect in the N1-P2 deflection suggests that loudness is represented in the auditory cortex. This finding is in agreement with the results of some previous studies (Behler & Uppenkamp, 2016; Röhl & Uppenkamp, 2012; Thwaites et al., 2016). Behler and Uppenkamp (2016) showed in a functional magnetic resonance imaging study that the correlation between the blood oxygenation level dependent (BOLD) signal and loudness increases at higher stages in the auditory pathway. Maximum correlation was reached at the posterior medial Heschl’s gyrus. Similar observations were reported by Thwaites et al. (2016). They found in an MEG study several cortical components corresponding to the loudness of speech estimated by a loudness model. Using a cross-correlation analysis, they found four components with significant correlation at latencies of 45 ms, 100 ms, 165 ms, and 275 ms. The latest of these components provided a higher correlation with short-term loudness (i.e., low-pass filtered) than with the instantaneous loudness. Hence, they concluded that the latest component represented short-term loudness, which is seen as a more fully integrated representation of loudness in higher auditory stages in comparison to instantaneous loudness or purely sound level related correlates. For latencies between 75 ms and 310 ms, we found that N1-P2 deflection also correlates with contextual effects in loudness, suggesting that for these latencies and the associated neuronal processing stages, already a relatively largely integrated loudness representation (i.e., it already includes the representation of many different aspects of loudness) is given.

Overall, it can be assumed that loudness context effects are essentially processed at the cortical level, that is, at relatively high processing levels. However, it is not entirely clear whether the processing is essentially still part of the sensory stimulus processing or whether it is rather a shift of the response criteria and thus a processing on even higher levels in connection with conscious judgment processes (Algom & Marks, 1990; Arieh & Marks, 2003b; Schneider & Parker, 1990). Investigating the response times of neural processing can be useful to clarify this issue. For example, in Arieh and Marks (2003b), listeners had to detect weak tones in a classification task. They found that under conditions that produce loudness context effects, the listeners showed increased response times and higher error rates compared to control conditions. According to Luce (1986), a positive relation between response time and error rate is a strong indicator for a sensory, rather than for a decisional, change. That means, from the psychoacoustical point of view, that the effect of loudness context effects is—at least partially—a change in the sensory representation of the target tone rather than a response bias. From the neurophysiological aspect, the N1-P2 deflection is not expected to reflect processing stages that were already involved into decisional processes, but more in providing specific feature traces of the stimuli and perhaps to some extent complete stimulus representations (for a review, see Näätänen & Winkler, 1999). Under this premise, the relation found between N1-P2 deflection and the time course of loudness context effects provides neurophysiological evidence that loudness context effects causes a context-related change of the sensory-stimulus representation.

It should be noted that in the current EEG analysis, a low-pass filter was used with a relatively low cutoff frequency of 8 Hz, which reduced the amplitudes of the measured cortical components, especially affecting the P1 component. Similar to Hoppe et al. (2001), who used a cut off frequency at 10 Hz, this makes a quantitative analysis of the P1 amplitude almost impossible. Therefore, only the N1-P2 and P2-N2 deflections were examined here. This has two major drawbacks: (a) it significantly reduces the temporal resolution of the components; and (b) it impedes a separate consideration of the deflections. In particular, it would be interesting to investigate the N1 and P2 separately, since they represent different dipole sources (Crowley & Colrain, 2004). This filtering adjustment was necessary because of frequently observed alpha waves during the experiment, presumable induced by the effort of the AFC task processing. This is in agreement with the findings by Klimesch (1999), who showed that cognitive and memory performance was reflected by an increase of alpha activity. This has to be seen as a clear drawback of the synchronous measurement of AEP and psychoacoustic tasks.

Summary and Conclusion

Neural representation of loudness context effects in the EEG response was demonstrated by using an AFC paradigm during the EEG recordings. The strength of the N1-P2 deflection showed the same trend as the time course of loudness context effects, that is, a decrease with increasing ISI, whereas the P2-N2 deflection showed a recovery from repetition suppression indicated by a relative increase with increasing ISI. In the presence of an inducer tone at 80 dB SPL, both deflections for all ISI were clearly reduced compared to the baseline condition. The following conclusions are drawn:

  1. Since the target stimuli for all ISI were identical, the N1-P2 deflection provides a correlation to loudness, rather than only to physical sound intensity. The correlation to context effects provides an indication that a relatively far-reaching integration of loudness representation is already given for the corresponding latency range and the associated processing stages.

  2. The finding that the correlation to loudness context effects is already reflected in the N1-P2 deflection provides neurophysiological evidence that context effects cause at least partially an adaption of the neural loudness representation of the stimulus and not (only) a shift in the decisional processes.

Acknowledgments

The authors would like to thank all participants for volunteering. Language services on an earlier version of the manuscript were provided by www.stels-ol.de.

Notes

1

See, for example, Näätänen and Winkler (1999) for the concept of memory traces in auditory processing.

2

The subjects were asked if they were right-handed and in case of ambiguities with which hand they write.

3

Reversal means an alteration in the direction of the level changes of the comparison tone (from increase to decrease or vice versa).

4

For filtering, a digital implementation of a second-order Butterworth filter was used with the MATLAB function filtfilt. filtfilt performs zero-phase digital filtering by processing the input data in both the forward and reverse directions.

5

Calibrated headphones of different types typically provide different sound pressures at the eardrum for the same nominal SPL, as indicated, for example, by different Reference Equivalent Threshold Sound Pressure Level (RETSPL) as provided for audiometric headphones. For example, at 2.5 kHz, the RETSPL value for a HDA200 (Sennheiser) differs by as much as 7.5 dB from a TDH39 (Telephonics) on the same ear simulator (see, e.g., Champlin & Letowski, 2014). The TDH49 (Telephonics) used by Arieh and Marks has RETSPL corrections of 11 dB or 9.5 dB at 2 or 3 kHz, respectively (see, e.g., Champlin & Letowski, 2014), while the ER2 insert earphones as used in this study are in the order of 15 to 16 dB (Han & Poulsen, 1998) at the same frequencies.

6

With respect to the effect strength of the repetition suppression for tone pulses of either different or the same frequencies, the current results are in line with the considerations of Lanting et al. (2013). They argued that for consecutive tones with different features, for example, frequency, the subsequent tone will recruit unadapted neurons and thus show almost no repetition suppression, while this does not apply for similar tones. This agrees with psychoacoustic findings indicating the largest loudness reduction when the inducer tone and the target fall within the same critical band (Marks & Warner, 1991). However, Lanting et al. (2013) contradicts other AEP studies on prepulse inhibition (e.g., Schall, Schön, Zerbin, Eggers, & Oades, 1996) that measured frequency-independent repetition suppression. In our results, this is reflected by comparing the amplitude of the responses of the target and the comparison tone. It can be clearly seen that the strengths of the cortical responses to the comparison tone are higher than to the target tone. The target tone response is a subsequent tone pulse to the inducer tone and both have the same frequency. The comparison tone, however, is the subsequent tone pulse to the target tone with a different frequency than its predecessor. This observation is best illustrated by comparing the cortical responses for the baseline condition for which the strength was of the same order of magnitude.

Declaration of Conflicting Interests

The authors declared no potential conflicts of interest with respect to the research, authorship, and/or publication of this article.

Funding

The authors disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This study was funded by the DFG (Cluster of Excellence 1077 “Hearing4all”) at the University of Oldenburg.

ORCID iD

Florian H. Schmidt https://orcid.org/0000-0002-8344-0006

References

  1. Algom D., Marks L. E. (1990). Range and regression, loudness scales, and loudness processing: Toward a context-bound psychophysics. Journal of Experimental Psychology: Human Perception and Performance, 16, 706–727. doi:10.1037//0096-1523.16.4.706 [DOI] [PubMed] [Google Scholar]
  2. Anderson N. H. (1975). On the role of context effects in psychophysical judgment. Psychological Review, 82, 462–482. doi:10.1037/0033-295X.82.6.462 [PubMed] [Google Scholar]
  3. Arieh Y., Kelly K., Marks L. E., Pierce J.B. (2005). Tracking the time to recovery after induced loudness reduction (L). The Journal of the Acoustical Society of America, 117, 3381–3384. doi:10.1121/1.1898103 [DOI] [PubMed] [Google Scholar]
  4. Arieh Y., Marks L. E. (2003. a) Time course of loudness recalibration: Implications for loudness enhancement. The Journal of the Acoustical Society of America, 114(3), 1550–1556. doi:10.1121/1.1603768 [DOI] [PubMed] [Google Scholar]
  5. Arieh Y., Marks L. E. (2003. b). Recalibrating the auditory system: A speed-accuracy analysis of intensity perception. Journal of Experimental Psychology: Human Perception & Performance, 29, 523–536. doi:10.1037/0096-1523.29.3.523 [DOI] [PubMed] [Google Scholar]
  6. Arieh Y., Marks L. E. (2011). Measurement of loudness, Part II: Context effects. In Loudness, Springer handbook of auditory research (pp. 57–87). New York, NY: Springer.
  7. Babkoff H., Pratt H., Kempinski D. (1984). Auditory brainstem evoked potential latency-intensity functions: A corrective algorithm. Hearing Research, 16, 243–249. [DOI] [PubMed] [Google Scholar]
  8. Bauer J. W., Elmasian R. O., Galambos R. (1974). Loudness enhancement in man. I. Brainstem-evoked response correlates. The Journal of the Acoustical Society of America, 57(1), 165–171. doi:10.1121/1.380410 [DOI] [PubMed] [Google Scholar]
  9. Behler O., Uppenkamp S. (2016). The representation of level and loudness in the central auditory system for unilateral stimulation. NeuroImage, 139, 176–188. doi:10.1016/j.neuroimage.2016.06.025 [DOI] [PubMed] [Google Scholar]
  10. Bennington J. Y., Polich J. (1999). Comparison of P300 from passive and active tasks for auditory and visual stimuli. International Journal of Psychophysiology, 34, 171–177. [DOI] [PubMed] [Google Scholar]
  11. Bland J. M., Altman D. G. (1995). Calculating correlation coefficients with repeated observations: Part 1—Correlation within subjects. British Medical Journal, 310, 446. doi:10.1136/bmj.310.6973.170 [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Castro F. Z., de Prat J. J. B., Zabala E. L. (2008). Loudness and auditory steady-state responses in normal-hearing subjects. International Journal of Audiology, 47, 269–275. doi:10.1080/14992020801945501 [DOI] [PubMed] [Google Scholar]
  13. Champlin C. A., Letowski T. (2014). Audiometric calibration: Air conduction. Seminars in Hearing, 35(4), 312–328. [Google Scholar]
  14. Crowley K., Colrain I. (2004). A review of the evidence for P2 being an independent component process: Age, sleep, and modality. Clinical Neurophysiology, 115, 732–744. doi:10.1016/j.clinph.2003.11.021 [DOI] [PubMed] [Google Scholar]
  15. Darling R. M., Price L. L. (1990). Loudness and auditory brain stem evoked response. Ear & Hearing, 11(4), 289–295. [DOI] [PubMed] [Google Scholar]
  16. Dau T., Wegner O., Mellert V., Kollmeier B. (2000). Auditory brainstem responses with optimized chirp signals compensating basilar-membrane dispersion. The Journal of the Acoustical Society of America, 107(3), 1530–1540. doi:10.1121/1.428438 [DOI] [PubMed] [Google Scholar]
  17. Davis H., Mast T., Yoshie N., Zerlin S. (1966). The slow response of the human cortex to auditory stimuli: Recovery process. Electroencephalography and Clinical Neurophysiology, 21, 105–113. doi:10.1016/0013-4694(66)90118-0 [DOI] [PubMed] [Google Scholar]
  18. Eeckhoutte M. Van, Wouters J., Francart T. (2016). Auditory steady-state responses as neural correlates of loudness growth. Hearing Research, 342, 58–68. doi:10.1016/j.heares.2016.09.009 [DOI] [PubMed] [Google Scholar]
  19. Elmasian R., Galambos R., Bernheim A. (1980). Loudness enhancement and decrement in four paradigms. The Journal of the Acoustical Society of America, 67, 601–607. doi:10.1121/1.383937 [DOI] [PubMed] [Google Scholar]
  20. Emara A. A. Y., Kolkaila E. A. (2010). Prediction of loudness growth in subjects with sensorineural hearing loss using auditory steady state response. The Journal of International Advanced Otology, 6(3), 371–379. [Google Scholar]
  21. Ewert S. D. (2013). AFC – A modular framework for running psychoacoustic experiments and computational perception models. Proceedings of the International Conference on Acoustics AIA-DAGA 2013, Merano, Italy, 1326–1329.
  22. Fobel O., Dau T. (2004). Searching for the optimal stimulus eliciting auditory brainstem responses in humans. The Journal of the Acoustical Society of America, 116(4), 2213–2222. doi:10.1121/1.1787523 [DOI] [PubMed] [Google Scholar]
  23. Folstein J. R., Petten C. V. (2008). Influence of cognitive control and mismatch on the N2 component of the ERP: A review. Psychophysiology, 45, 152–170. doi:10.1111/j.1469-8986.2007.00602.x [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Han L. A., Poulsen T. (1998). Equivalent threshold sound pressure levels for Sennheiser HDA 200 earphone and Etymotic Research ER-2 insert earphone in the frequency range 125 Hz to 16 kHz. Scandinavian Audiology, 27(2), 105–112. [DOI] [PubMed] [Google Scholar]
  25. Hegerl U., Gallinat J., Mrowinski D. (1994). Intensity dependence of auditory evoked dipole source activity. International Journal of Psychophysiology, 17, 1–13. doi:10.1016/0167-8760(94)90050-7 [DOI] [PubMed] [Google Scholar]
  26. Hoppe U., Rosanowski F., Iro H., Eysholdt U. (2001). Loudness perception and late auditory evoked potentials in adult cochlear implant users. Scandinavian Audiology, 30, 119–125. doi:10.1080/010503901300112239 [DOI] [PubMed] [Google Scholar]
  27. Junius D., Dau T. (2005). Influence of cochlear traveling wave and neural adaptation on auditory brainstem responses. Hearing Research, 205, 53–67. doi:10.1016/j.heares.2005.03.001 [DOI] [PubMed] [Google Scholar]
  28. Klimesch W. (1999). EEG alpha and theta oscillations reflect cognitive and memory performance: A review and analysis. Brain Research Reviews, 29, 169–195. doi:10.1016/S0165-0173(98)00056-3 [DOI] [PubMed] [Google Scholar]
  29. Lanting C. P., Briley P. M., Sumner C. J., Krumbholz K. (2013). Mechanisms of adaptation in human auditory cortex. Journal of Neurophysiology, 110, 973–983. doi:10.1152/jn.00547.2012 [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Luce R. D. (1986). Response times: Their role in inferring elementary mental organization. New York, NY: Oxford University Press. doi:10.1093/acprof:oso/9780195070019.001.0001 [Google Scholar]
  31. Lu, Z. L., Williamson, S. J., Kaufman, L. (1992). Behavioral lifetime of human auditory sensory memory predicted by physiological measures. Science, 258, doi: 1668–1670. 10.1126/science.1455246 [DOI] [PubMed] [Google Scholar]
  32. Mariam, M., Mustaffa, I., Muzzammil, K., 2012. Feasibility of N1-P2 Habituation of Differentiate Loudness Levels. Proc. ICoBE, Penang. [Google Scholar]
  33. Marks L. E. (1988). Magnitude estimation and sensory matching. Perception & Psychophysics, 43(6), 511–525. doi:10.3758/BF03207739 [DOI] [PubMed] [Google Scholar]
  34. Marks L. E. (1993). Contextual processing of multidimensional and unidimensional auditory stimuli. Journal of Experimental Psychology: Human Perception and Performance, 19(2), 227–249. doi:10.1037/0096-1523.19.2.227 [DOI] [PubMed] [Google Scholar]
  35. Marks L. E., Warner E. (1991). Slippery context effect and critical bands. Journal of Experimental Psychology: Human Perception and Performance, 17, 986–996. doi:10.1037/0096-1523.17.4.986 [DOI] [PubMed] [Google Scholar]
  36. Ménard M., Gallégo S., Berger-Vachon C., Collet L., Thai-Van H. (2008). Relationship between loudness growth function and auditory steady-state response in normal-hearing subjects. Hearing Research, 235, 105–113. doi:10.1016/j.heares.2007.10.007 [DOI] [PubMed] [Google Scholar]
  37. Moore B. C. J. (2013). An introduction to the psychology of hearing (6th ed.). London, England: Academic Press. [Google Scholar]
  38. Mulert C., Juckel G., Augustin H., Hegerl U. (2002). Comparison between the analysis of the loudness dependency of the auditory N1/P2 component with LORETA and dipole source analysis in the prediction of treatment response to the selective serotonin reuptake inhibitor citalopram in major depression. Clinical Neurophysiology, 113, 1566–1572. doi:10.1016/S1388-2457(02)00252-3 [DOI] [PubMed] [Google Scholar]
  39. Näätänen R., Picton T. (1987). The N1 wave of the human electric and magnetic response to sound: A review and an analysis of the component structure. Psychophysiology, 24(4), 375–425. [DOI] [PubMed] [Google Scholar]
  40. Näätänen R., Winkler I. (1999). The concept of auditory stimulus representation in neuroscience. Psychological Bulletin, 125, 826–859. [DOI] [PubMed] [Google Scholar]
  41. Nelson D. A., Lassman F. M. (1968). Effects of intersignal interval on the human auditory evoked response. The Journal of the Acoustical Society of America, 44(6), 1529–1532. doi:10.1121/1.1911292 [DOI] [PubMed] [Google Scholar]
  42. Nieder B., Buus S., Florentine M., Scharf B. (2003). Interactions between test- and inducer-tone durations in induced loudness reduction. The Journal of the Acoustical Society of America, 114(5), 2846–2855. doi:10.1121/1.1616580 [DOI] [PubMed] [Google Scholar]
  43. Oberfeld D. (2007). Loudness changes induced by a proximal sound: Loudness enhancement, loudness recalibration, or both? Journal of the Acoustical Society of America, 121, 2137–2148. doi:10.1121/1.271043 [DOI] [PubMed] [Google Scholar]
  44. Oberfeld D. (2008). The mid-difference hump in forward-masked intensity discrimination. Journal of the Acoustical Society of America, 123(3), 1571–1581. doi:10.1121/1.2837284 [DOI] [PubMed] [Google Scholar]
  45. Oberfeld D. (2010). Electrophysiological correlates of intensity resolution under forward masking In Lopez-Poveda E. A., Palmer A. R., Meddis R. (Eds.), Advances in auditory research: Physiology, psychophysics and models (pp. 99–110). New York, NY: Springer. doi:10.1007/978-1-4419-5686-6_10 [Google Scholar]
  46. Öhman A., Lader M. (1972). Selective attention and ‘habituation’ of the auditory averaged evoked response in humans. Physiology & Behavior, 8, 79–85. doi:10.1016/0031-9384(72)90132-1 [DOI] [PubMed] [Google Scholar]
  47. O’Sullivan J. A., Shamma S. A., Lalor E. C. (2015). Evidence for neural computations of temporal coherence in an auditory scene and their enhancement during active listening. The Journal of Neuroscience, 35(18), 7256–7263. doi:10.1523/JNEUROSCI.4973-14.2015 [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Picton T. W., Hillyard S. A., Krausz H. I., Galambos R. (1974). Human auditory evoked potentials. I. Evaluation of components. Electroencephalography and Clinical Neurophysiology, 36, 179–190. doi:10.1016/0013-4694(74)90155-2 [DOI] [PubMed] [Google Scholar]
  49. Polich J. (1993). Cognitive brain potentials. Current Directions in Psychological Science, 2(6), 175–179. [Google Scholar]
  50. Potter T., Li S., Nguyen T., Nguyen T., Ince N., Zhang Y. (2017). Characterization of volume-based changes in cortical auditory evoked potentials and prepulse inhibition. Scientific Reports, 7, 1–9. doi:10.1038/s41598-017-11191-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Pratt H., Sohmer H. (1977). Correlations between psychophysical magnitude estimates and simultaneously obtained auditory nerve, brain stem and cortical responses to click stimuli in man. Electroencephalography and Clinical Neurophysiology, 43, 802–812. doi:10.1016/0013-4694(77)90003-7 [DOI] [PubMed] [Google Scholar]
  52. Riedel H., Granzow M., Kollmeier B. (2001). Single-sweep-based methods to improve the quality of auditory brain stem responses Part II: Averaging methods. Zeitschrift für Audiologie, 40(2), 62–85. [Google Scholar]
  53. Röhl M., Uppenkamp S., (2012). Neural coding of sound intensity and loudness in the human auditory system. Journal of Association for Research in Otolaryngology, 13, 369–379. doi:10.1007/s10162-012-0315-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Schall U., Schön A., Zerbin D., Eggers C., Oades R.D. (1996). Event-related potentials during an auditory discrimination with prepulse inhibition in patients with schizophrenia, obsessive-compulsive disorder and healthy subjects. International Journal of Neuroscience, 84, 15–33. doi:10.3109/00207459608987247 [DOI] [PubMed] [Google Scholar]
  55. Scharf B., Buus S., Nieder B. (2002). Loudness enhancement: Induced loudness reduction in disguise? The Journal of the Acoustical Society of America, 112(3), 807–810. doi:10.1121/1.1500755 [DOI] [PubMed] [Google Scholar]
  56. Schneider B., Parker S. (1990). Does stimulus context affect loudness or only loudness judgments? Perception & Psychophysics, 48, 409–418. doi:10.3758/BF03211584 [DOI] [PubMed] [Google Scholar]
  57. Serpanos Y. C., O’Malley H., Gravel J. S. (1997). The relationship between loudness intensity functions and the click-ABR wave V latency. Ear & Hearing, 18(5), 409–419. [DOI] [PubMed] [Google Scholar]
  58. Silva I., Epstein M. (2010). Estimating loudness growth from tone-burst evoked responses. The Journal of the Acoustical Society of America, 127(6), 3629–2642. doi:10.1121/1.3397457 [DOI] [PMC free article] [PubMed] [Google Scholar]
  59. Silva I., Epstein M. (2012). Objective estimation of loudness growth in hearing-impaired. The Journal of the Acoustical Society of America, 131(1), 353–362. doi:10.1121/1.3666024 [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Stevens S. S. (1957). On the psychophysical law. Psychological Review, 64, 153–181. doi:10.1037/h0046162 [DOI] [PubMed] [Google Scholar]
  61. Stevens S. S. (1958). Adaptation-level vs the relativity of judgment. The American Journal of Psychology, 71, 633–646. doi:10.2307/1420322 [PubMed] [Google Scholar]
  62. Thwaites A., Glasberg B. R., Nimmo-Smith I., Marslen-Wilson W. D., Moore B. C. J. (2016). Representation of instantaneous and short-term loudness in the human cortex. Frontiers in Neuroscience, 10(183), 1–11. doi:10.3389/fnins.2016.00183 [DOI] [PMC free article] [PubMed] [Google Scholar]
  63. Wang N., Kreft H. A., Oxenham A. J. (2015). Loudness context effects in normal-hearing listeners and cochlear-implant users. Journal of the Association for Research in Otolaryngology, 16(4), 535–545. doi:10.1007/s10162-015-0523-y [DOI] [PMC free article] [PubMed] [Google Scholar]
  64. Wang N., Kreft H. A., Oxenham A. J. (2016). Induced loudness reduction and enhancement in acoustic and electric hearing. Journal of the Association for Research in Otolaryngology, 17(4), 383–391. doi:10.1007/s10162-016-0563-y [DOI] [PMC free article] [PubMed] [Google Scholar]
  65. Zhang C., Zeng F.-G. (1997). Loudness of dynamic stimuli in acoustic and electric hearing. The Journal of the Acoustical Society of America, 102, 2925–2934. doi:10.1121/1.420347 [DOI] [PubMed] [Google Scholar]
  66. Zwicker E. (1958). Über psychologische und methodische Grundlagen der Lautheit [On the psychological and methodological basis of loudness]. Acustica, 8, 237–258. [Google Scholar]
  67. Zwicker E., Fastl H. (1999). Psychoacoustics: Facts and models, Bd. 3. Berlin, Germany: Springer. [Google Scholar]

Articles from Trends in Hearing are provided here courtesy of SAGE Publications

RESOURCES