Abstract
Sound localization is important for orienting and focusing attention and for segregating sounds from different sources in the environment. In humans, horizontal sound localization mainly relies on interaural differences in sound arrival time and sound level. Despite their perceptual importance, the neural processing of interaural time and level differences (ITDs and ILDs) remains poorly understood. Animal studies suggest that, in the brainstem, ITDs and ILDs are processed independently by different specialized circuits. The aim of the current study was to investigate whether, at higher processing levels, they remain independent or are integrated into a common code of sound laterality. For that, we measured late auditory cortical potentials in response to changes in sound lateralization elicited by perceptually matched changes in ITD and/or ILD. The responses to the ITD and ILD changes exhibited significant morphological differences. At the same time, however, they originated from overlapping areas of the cortex and showed clear evidence for functional coupling. These results suggest that the auditory cortex contains an integrated code of sound laterality, but also retains independent information about ITD and ILD cues. This cue-related information might be used to assess how consistent the cues are, and thus, how likely they would have arisen from the same source.
Keywords: electroencephalography (EEG), adaptation, horizontal sound localization, spatial hearing
INTRODUCTION
Sound localization is important for orienting and focusing attention (Darwin and Hukin 2000) and helps to segregate sounds from different sources in the environment (Moore and Gockel 2002). In humans, horizontal sound localization is based on interaural differences in sound arrival time and sound level. While there have been substantial advances in understanding the neural processing of interaural time and level differences (ITDs and ILDs) separately (see Grothe et al. 2010 for a review), the question as to whether, or at which stage, ITDs and ILDs are integrated into a common code of horizontal sound location still remains unanswered. Single neuron recordings in animals indicate that the initial processing of ITDs and ILDs is performed independently, involving different nuclei of the superior olivary complex (SOC) in the auditory brainstem (Tollin 2003). However, it has been proposed that an integrated code of ITDs and ILDs might emerge already at the subsequent processing level. Riedel and Kollmeier (2002) measured the prominent wave V deflection in the human auditory brainstem response (ABR), which is thought to be generated at the level of the lateral lemniscus or inferior colliculus (Burkard et al. 2007), and found that stimuli with ITDs and ILDs favoring the same ear elicited smaller wave V amplitudes than diotic stimuli (i.e. stimuli with zero ITD and ILD) or stimuli with ITDs and ILDs favoring opposite ears. Given that some listeners perceive stimuli with opposing ITDs and ILDs as central, like diotic stimuli, whereas stimuli with consistent ITDs and ILDs are perceived as lateral, Riedel and Kollmeier interpreted their findings as evidence that wave V reflects an integrated code of stimulus laterality. However, the wave V latency seems to tell a different story: it was longer for stimuli with opposing ITDs and ILDs than for either diotic stimuli or stimuli with consistent ITDs and ILDs. Given that stimuli with opposing ITDs and ILDs often elicit incoherent percepts, with multiple sound images relating to the individual ITD and ILD cues (Whitworth and Jeffress 1961; Hafter and Jeffress 1968), this result suggests that, contrary to Riedel and Kollmeier's conclusion, wave V may reflect not only the lateralization, but also the coherence of the sound image. This would suggest that at least some degree of separation of ITDs and ILDs remains at the level of wave V.
Three other attempts to investigate whether ITDs and ILDs are coded independently or integrated have looked at cortical responses (Schröger 1996; Tardif et al. 2006; Ungan et al. 2001). In contrast to Riedel and Kollmeier (2002), all of these studies concluded that ITDs and ILDs are still coded separately even at the level of the auditory cortex. Schröger measured the mismatch negativity (MMN; see Näätänen et al. 2011) to stimuli with an ITD only, an ILD only or with a combined ITD and ILD (both favoring the same ear) and found that the response to the combined condition was similar to the sum of the responses to the ITD- and ILD-only conditions. Schröger concluded that the response to the combined ITD and ILD constituted a linear superposition of responses from independent ITD- and ILD-sensitive generators. However, the ITD and ILD used by Schröger were very disparate in terms of their perceived laterality (82 ° versus 36 °, respectively). It is possible that the degree of integration of ITDs and ILDs depends on how consistent the cues are; it might be the case that disparate cues are processed independently, as if having arisen from different sound sources, while consistent cues are integrated. This might explain why stimuli with opposing ITDs and ILDs, like those used by Riedel and Kollmeier (2002), tend to elicit multiple sound images. Under this assumption, Schröger might have found a non-linear (super- or subadditive) superposition of the ITD- and ILD-related components in the combined ITD and ILD response, had the cues been perceptually more consistent. Super- or subadditive processing in neuroimaging data is commonly assumed to indicate functional integration (e.g. Calvert 2001). In contrast to Schröger, Ungan et al. (2001) and Tardif et al. (2006) investigated whether the cortical processing of ITDs and ILDs involves spatially separated or overlapping generators by comparing the scalp topography and estimating the sources of ITD- and ILD-evoked cortical responses. Like Schröger, Tardif et al. measured MMNs to ITDs and ILDs, whereas Ungan et al. measured responses to abrupt ITD or ILD changes in ongoing sounds. Both studies found differences in the hemispheric distribution of the ITD- and ILD-evoked responses, and both interpreted this finding as evidence for ITDs and ILDs being processed by separated generators. However, the hemispheric distribution of sound-evoked cortical responses has been shown to depend on the stimulus lateralization (e.g. McEvoy et al. 1994; Krumbholz et al. 2007). Neither Ungan et al. nor Tardif et al. took care to match the lateralization of their stimuli. The observed differences in hemispheric distribution could thus have been due to differences in stimulus lateralization, rather than separated processing of ITDs and ILDs.
The aim of the current study was to revisit the question of whether ITDs and ILDs are represented by separated or integrated codes in the human auditory cortex. The general paradigm was very similar to that used by Schröger (1996), in that we compared responses to an ITD and an ILD, presented either separately or combined. However, rather than using Schröger's MMN paradigm, we used the change response paradigm of Ungan et al. to isolate ITD- and ILD-related activity, because it yields a better response-to-noise ratio. In order to maximize the chances of detecting any integrated processing of ITDs and ILDs, we conducted a precursory psychophysical experiment to find ITDs and ILDs that were matched in terms of perceived laterality for each participant. If ITDs and ILDs are represented by independent codes in the auditory cortex, we would expect the response to the combined ITD and ILD to constitute a linear superposition of its ITD- and ILD-related components. The combined response should thus resemble the sum of the responses to the ITD- and ILD-only conditions. If, on the other hand, ITDs and ILDs are represented by an integrated code, the response to the combined condition might be expected to be either larger (superadditive) or smaller (subadditive) than the sum of the ITD- and ILD-only responses. In addition to comparing the responses to the combined and ITD- and ILD-only conditions, we also measured the scalp topographies and estimated the source distributions of the ITD- and ILD-only responses.
METHODS
In order to isolate cortical responses related to ITD and ILD processing, we measured the late auditory-evoked cortical potentials to a short probe stimulus (150 ms), which was preceded without gap by a longer adapter stimulus (see Fig. 1A, B). The adapter was long enough (1,500 ms) to ensure that the response to its onset had subsided before the probe onset (Fig. 1C). In different conditions, the probe had an ITD only, an ILD only or both. The adapter was always diotic (zero ITD and ILD). The experiment consisted of five conditions. In the “small ITD” condition, the probe had a fixed ITD of −250 μs (lateralized a little less than halfway towards the left ear; Toole and Sayers 1965) and no (zero) ILD. In the “small ILD” condition, the probe had an ILD and no ITD. The ILD was set individually to match the lateralization of the small (−250 μs) ITD (see below). In the “combined” condition, the probe was presented with both the small ITD and small ILD. Informal listening revealed that the lateralization of the combined ITD and ILD was approximately double that of either cue alone. In order to investigate the effect of this lateralization difference, we included two further ITD- and ILD-only conditions, referred to as “large ITD” and “large ILD”, where the ITD and ILD were individually matched to elicit the same lateralization as in the combined condition.
Stimuli
The stimuli consisted of noise, lowpass-filtered below 1 kHz to restrict the stimulus passband to the frequency range over which ITDs in the temporal fine structure can be perceived (Durlach and Colburn 1978). The adapter and probe had durations of 1,500 and 150 ms, respectively, and the silent gap between the end of the probe and the onset of the next adapter was 1,500 ms. They were gated on and off with 10-ms quarter-cosine ramps. At the transition between them, the ramps were cross-faded at their −3-dB points. ITDs were implemented by delaying the noise carrier in the right-ear stimulus and gating the left- and right-ear stimuli simultaneously to create ongoing ITDs. Onset ITDs could not be included, because the adapter and probe were presented without a gap. In the ITD-only conditions, the adapter and probe were presented at a constant level of 70 dB SPL (Fig. 1A). ILDs were implemented by decreasing the level of the right-ear stimulus by half of the relevant ILD relative to 70 dB SPL and increasing the level of the left-ear stimulus by the same amount. In the conditions that contained ILDs, the level of the adapter was switched randomly between the left- and right-ear levels of the probe. The time between the level switches was either 75 or 150 ms with equal probability. This was to minimize the confounding of any ILD-change response with a response to the monaural level change at the adapter–probe transition [Fig. 1B; see Ungan and Ozmen (1996) for a similar procedure]. Note that the average adapter level over time, which would be expected to determine the amount of adaptation (Seither-Preisler et al. 2004; Lanting et al. 2013), was still the same across conditions.
All stimuli were generated digitally on a trial-by-trial basis using Matlab (The Mathworks, Natick, MA, USA). They were digital-to-analogue converted with a 50-kHz sampling rate and 24-bit amplitude resolution and amplified using a Tucker Davis Technologies (Alachua, FL, USA) System 3 with an RP2.1 real-time processor and HB7 headphone amplifier. They were presented via K240 DF headphones (AKG, Vienna, Austria) in a double-walled sound-attenuated booth.
Electroencephalography (EEG) data acquisition
The EEG experiment consisted of four 20-min blocks. Within each block, each of the five stimulus conditions was presented 65 times in random order. This was a passive listening task; participants watched a silent, sub-titled movie to stay alert.
Late auditory-evoked cortical potentials were recorded with an EEG amplifier system (BrainAmp DC, Brain Products, Gilching, Germany) and an “infracerebral” cap fitted with 64 Ag/AgCl ring electrodes in a quasi-equidistant arrangement (Easycap, Herrsching, Germany). The infracerebral electrode arrangement is designed to cover a larger proportion of the head surface than the standard 10–20 arrangement to facilitate localization of sources in the region of auditory cortex. One additional electrode was placed below the right eye to monitor electro-ocular activity. Data were recorded continuously with a sampling rate of 500 Hz. They were filtered online between 0.1 and 250 Hz. Skin-to-electrode impedances were maintained below 5 kΩ. The ground electrode was placed on the midline just above the Fpz position, and the recording reference was Cz.
EEG data analysis
The EEG data were preprocessed using the BrainVision Analyzer software (Brain Products). They were (1) highpass-filtered at 0.25 Hz (−48 dB/oct roll-off), (2) corrected for eye-blink artefacts using a regression-based procedure (Gratton et al. 1983), (3) re-referenced to the average of all 64 channels (“average reference”), (4) lowpass-filtered at 35 Hz (−48 dB/oct roll-off) and (5) segmented into epochs covering the period from −200 to 2,000 ms relative to the adapter onset. Epochs that contained voltages exceeding ±75 μV were considered artifactual and excluded from further analysis. The remaining epochs were averaged for each participant and condition, baseline-corrected to the 100-ms period before the probe onset and then averaged across participants to create a grand average response for each condition.
Differences between conditions were tested by computing the bootstrap standard error (based on 1,000 bootstrap resamples; see Efron and Tibshirani 1994) of the root-mean-square (RMS) amplitude of the difference between the respective responses at each time point. The RMS amplitude of average-referenced EEG data is commonly referred to as global field power (GFP; e.g. Murray et al. 2008). A difference response was considered significant where the 99 % bootstrap confidence interval of its GFP did not contain zero for more than seven consecutive time points [corresponding to half the period of the highest frequency contained in the responses (35 Hz)]. The higher significance threshold level (p = 0.01) and the cluster threshold (seven time points) were used to minimize false positives as a result of conducting multiple tests over many time points.
The scalp topographies of selected responses were compared using a global dissimilarity (DISS) analysis (Lehman and Skrandies 1980). The DISS between two scalp maps is defined as the RMS difference between the maps after normalizing each map by its GFP. In the current study, the DISS was normalized to range from 0 to 1 (with the original definition, the DISS can range from 0 to 2), where 0 indicates topographic equality and 1 indicates topographic inversion. The DISS was computed at each time point within the time ranges of the N1 and P2 deflections. At each time point, the statistical significance of the DISS was tested by comparing the DISS between the actual scalp maps with an empirical null distribution, generated by randomly exchanging (“permuting”) the scalp maps within participants and calculating the DISS between these new, permuted maps (Murray et al. 2008). Each null distribution was based on 29 = 512 permutations, which is the maximum possible number of unique permutations for nine participants.
The cortical source distributions of the scalp maps were estimated using the iterative sSLOFO (standardized shrinking LORETA-FOCUSS) algorithm (Liu et al. 2005) as implemented in the Brain Electrical Source Analysis (BESA) software (v5.3; BESA, Gräfelfing, Germany). The sSLOFO distributions were calculated with a voxel size of 7 mm and using three iterations of the sSLOFO algorithm. Tikhonov regularization of 0.3 % was applied in each iteration. The threshold below which voxels are eliminated from the source space in each iteration was set to 10 %. Different sSLOFO distributions were compared by measuring the spatial overlap between the distributions after clipping each distribution above 1 % of its maximum. The overlap was measured by counting the number of the voxels contained in both distributions and expressing it as a proportion of the average total number of voxels in each distribution. Like the DISS, the sSLOFO overlap was calculated at each time point within the N1 and P2 time ranges. For statistical analysis, the actual overlap at a given time point was compared with an empirical null distribution, generated with the same permutation procedure as the null distributions for the DISS.
The hemispheric lateralization of the sSLOFO distributions was measured by calculating the lateralization index, LI = (S R − S L)/(S R + S L), of the maximum source strengths across the left and right hemispheres, S L and S R. The lateralization index can range from 0 to ±1, where 0 indicates no lateralization and −1 or +1 indicates total lateralization to the left or right hemisphere, respectively. It was computed at each time point within the N1 and P2 time ranges. At each time point, the difference between the lateralization indices for different conditions was significance-tested by comparing the actual difference with an empirical null distribution, generated with the same permutation procedure as the null distributions for the DISS and sSLOFO overlap.
Lateralization matching experiment
In the EEG experiment, the small ITD was fixed at −250 μs for all participants to create a left-lateralized percept. All other ITDs and ILDs used were set individually for each participant in order to match them in terms of lateralization: the small ILD was set to match the lateralization of the small ITD, and the large ITD and ILD were set to match the lateralization of the combined ITD and ILD. The matching was performed using an adaptive “doublet” procedure (Bode and Carhart 1973; Leek 2001) based on a two-alternative, forced-choice task. Each doublet run comprised two randomly interleaved tracks. Each trial within the two tracks consisted of first the target and then the matching stimulus. The participant's task was to indicate whether the matching stimulus was located to the left or right of the target stimulus. At the beginning of each track, the ITD or ILD of the matching stimulus was set such that it would be lateralized well to the left or right of the target stimulus (±750 μs for ITDs, ±30 dB for ILDs). In the “left” track, the ITD or ILD of the matching stimulus converged according to a two-up, one-down rule in order to track the point where the matching stimulus had a 70.7 % likelihood of being perceived to the left of the target stimulus (Levitt 1971). In the “right” track, the ITD or ILD of the matching stimulus converged according to a two-down, one-up rule to track the point where the matching stimulus had a 70.7 % likelihood of being perceived to the right of the target stimulus. Both tracks consisted of eight reversals in the ITD or ILD of the matching stimulus. ITDs were adjusted in steps of 62.5 μs, and ILDs in steps of 2 dB. The matching ITD or ILD was estimated by averaging the final six reversals in each track and then averaging across the left and right tracks. Three doublet runs were completed for each condition and the results averaged. The matching results are shown in Figure 2.
Participants
Five male and four female participants, aged between 18 and 35 years, were recruited through poster advertisements distributed about the Nottingham University campus. Participants gave written informed consent prior to the experiment and were right-handed according to the Edinburgh inventory (Oldfield 1971). None of the participants reported having any history of audiological or neurological disease. The experimental procedures conformed to the Code of Ethics of the World Medical Association (Declaration of Helsinki) and were approved by the Ethics Committee of the School of Psychology at the University of Nottingham.
RESULTS
The diotic adapter sound elicited a typical triphasic energy onset response (EOR in Fig. 1C), comprising P1, N1 and P2 deflections, as well as a sustained response (SR), upon which the response to the ITD and/or ILD change at the onset of the probe was superposed (change response, CR). The change responses were dominated by the N1 and P2 deflections, with no discernible P1 deflection (Fig. 3). The negative deflection following the P2 appears to be the offset response to the probe (OffR; compare Magezi and Krumbholz 2010). It showed many of the same effects as the N1 and P2 and will thus not be discussed separately.
The morphology of the change responses differed somewhat between the ITD and ILD conditions (Fig. 3). In particular, the N1 deflection was larger and occurred earlier, and the P2 deflection was smaller, in the response to the small ILD than small ITD change. Similarly, the N1 deflection occurred earlier, and the P2 deflection was smaller, in the large ILD than large ITD change response. The statistical significance of these differences was verified by bootstrapping the global field power (GFP) of the difference between the respective responses (Fig. 4; see “METHODS”).
Source analysis of ITD- and ILD-only responses
In order to test whether the ITD and ILD change responses were generated in different cortical areas, we compared the scalp voltage distributions (referred to as scalp maps) of the average small and large ITD and average small and large ILD responses using a DISS analysis (Lehman and Skrandies 1980). We also estimated the cortical source distributions of these average ITD- and ILD-only responses using the iterative sSLOFO algorithm (Liu et al. 2005; see “METHODS” for details).
Over the time range of the N1, the scalp maps exhibited negative and positive voltage maxima to the right of the vertex and over the right subtemporal cortex, respectively (blue and red highlight in Fig. 5A) and a polarity reversal over the right temporal cortex. For the P2, the voltage maximum at the vertex (which is positive in the case of the P2) was more focal and less lateralized than for the N1 (Fig. 5B). The sSLOFO analysis suggested that the predominant contribution to the N1 was from the right auditory cortex, whereas the P2 received an additional contribution from a more central, non-auditory source, possibly the cingulate cortex (Fig. 5C, D).
A permutation test (see “METHODS”) showed that, for the entire time range of the N1, and for the first part (65 %) of the time range of the P2, the scalp maps of the ITD- and ILD-only responses were not significantly different from each other (blue lines and shading in Fig. 5E). Towards the end of the P2, the difference reached significance at a level of p = 0.05, but not at the higher level of p = 0.01 used for all other statistical tests involving multiple comparisons over many time points.
The estimated source distributions of the ITD- and ILD-only responses overlapped by up to 87 % for the N1 and 81 % for the P2 (red lines and shading in Fig. 5E). A permutation test showed that, for the entire time range of the N1, and for the first part (41 %) of the P2 time range, the overlap was not significantly smaller than the overlap between two statistically identical distributions. Towards the end of the P2, the difference reached significance at p = 0.05, but not at p = 0.01.
Both the N1 and P2 showed a considerable degree of lateralization towards the right hemisphere (Fig. 5F). On average, the N1 was up to 2.8 and the P2 up to 2.1 times larger in the right than the left hemisphere. A permutation test showed that, for the entire time range of the N1, and for the first part (69 %) of the time range of the P2, the lateralization indices of the ITD- and ITD-only responses were not significantly different from each other; the difference reached significance at p = 0.05, but not at p = 0.01 towards the end of the P2.
Combined ITD and ILD change response
In order to test whether there was functional interaction between the generators of the ITD and ILD change responses, we compared the response to the combined ITD and ILD change with the responses to the respective ITD- and ILD-only conditions (i.e. small ITD and ILD). Figure 6A shows that the sum of the responses to the small ITD and ILD conditions was almost twice as large as the response to the combined condition. The significance of this difference was confirmed by bootstrapping the GFP of the difference between the two responses. The combined response could be best described as a weighted sum of the small ITD and ILD responses (Fig. 6B). The GFP of the difference between the combined and weighted-sum responses, referred to as “residual”, was minimized when the small ITD response was weighted by 0.63, and the small ILD response was weighted by 0.42. For these weights, the residual was comparable to the noise floor.
We also calculated the scalp maps and estimated the source distributions of the combined ITD and ILD response and compared them with the scalp maps and source distributions of the average response to the large ITD and ILD conditions (which matched the lateralization of the combined condition). Figure 7A shows that both the DISS and source overlap between the combined and average large ITD and ILD conditions were of a similar order as those between the ITD- and ILD-only responses in the previous section (no permutation test was carried out in this case); the source distributions of the combined and large ITD and ILD responses overlapped by up to 82 % for the N1 and 90 % for the P2. Figure 7B shows that there was no systematic difference in hemispheric lateralization between the responses to the combined and average large ITD and ILD conditions on the one hand, and the average small ITD and ILD conditions on the other hand, despite the almost twofold difference in stimulus lateralization (the combined and large ITD and ILD responses may have been expected to be more strongly lateralized than the small ITD and ILD response).
Large ITD and ILD responses
The lateralization of the combined ITD and ILD was about twice as large as that of the individual cues presented separately (i.e. small ITD and ILD). Nevertheless, the response to the combined cues was barely larger than the responses to the individual cues, indicating a high degree of compression of the combined response size with respect to the stimulus laterality. In order to investigate whether this compression was a specific property of the combined stimulus, or applies more generally to strongly lateralized sounds, we compared the responses to the small ITD and ILD conditions with the responses to the ITD- and ILD-only conditions that matched the lateralization of the combined stimulus (large ITD and ILD). We found that the large ITD and ILD responses also showed compression, in that they were considerably smaller than the responses that would have been expected if there were a linear relationship between stimulus lateralization and response size (Fig. 8). The large ITD response was compressed to 0.69 times the expected linear response, and the large ILD response was compressed to 0.47 times the expected response. The compression was significant in both cases as confirmed by bootstrapping the GFP of the difference between the expected and actual responses. Note that the compression factors for the large ITD and ILD responses (0.69 and 0.47) are remarkably similar to the weighting factors used to model the response to the combined ITD and ILD condition with the small ITD and ILD responses (0.63 and 0.42, respectively). This indicates that compression was not special to the combined response, but applies more generally to strongly lateralized sounds.
DISCUSSION
The aim of this study was to investigate whether the two cues for horizontal sound localization, ITDs and ILDs, are processed by separate or integrated codes in the human auditory cortex. For that, we measured the responses to a change in ITD or ILD between an adapter and a probe stimulus and compared them with the response to the combined change. Both the ITD- and ILD-only changes elicited large responses. For the most part of the responses, their scalp topographies were similar and their estimated source distributions were largely overlapping, suggesting that they were generated by overlapping populations of neurons. Both the ITD- and ILD-only responses were strongly lateralized to the right hemisphere, as would be expected, given that the evoking stimuli were lateralized to the left hemifield (e.g. McEvoy et al. 1994; Woldorff et al. 1999; Krumbholz et al. 2005, 2007). Importantly, the degree of hemispheric lateralization was similar, supporting the notion that the differences in hemispheric lateralization between ITD and ILD responses observed by Ungan et al. (2001) and Tardif et al. (2006) were due to differences in stimulus lateralization. There were some differences between the scalp maps and estimated source distributions of the ITD- and ILD-only responses over the falling flank of the P2 deflection. However, these differences have to be viewed with caution, because they resulted from multiple tests over many time points and only carried a low level of significance. Also, the differences coincided with relatively low amplitudes of one of the compared responses (ILD-only), which means that the comparison may have been influenced by noise.
The finding that the response to the combined ITD and ILD change was significantly different from the linear superposition of the ITD- and ILD-only responses suggests that the neuron populations that process ITDs and ILDs are not only overlapping, but also functionally coupled; a linear superposition of the ITD and ILD components would have been expected if ITDs and ILDs were processed by independent neurons. The combined ITD and ILD response was subadditive (i.e. smaller than the linear superposition of the ITD- and ILD-only responses). Similar subadditivity has been observed in the responses to multisensory compared to unisensory stimuli (e.g. Calvert et al. 2001; Calvert and Thesen 2004) and in the responses to binaural compared to monaural sounds (Gaumond and Psaltikidou 1991; Krumbholz et al. 2005; McPherson and Starr 1993; Riedel and Kollmeier 2002). The scalp maps and source distribution of the combined ITD and ILD response were similar to those of the ITD- and ILD-only responses.
Additional measurements with stimuli that matched the lateralization of the combined stimulus suggested that the subadditivity of the combined ITD and ILD response was due to a compressive relationship between stimulus lateralization and evoked response size; while the lateralization of the combined stimulus was almost double that of its ITD and ILD components, the size of the combined response was barely larger than that of each of the individual responses. Recent research suggests that, in humans, sound laterality is represented by a population rate code comprising two opponent populations broadly tuned to the left and right auditory hemifields (Salminen et al. 2009; Salminen et al. 2010; Magezi and Krumbholz 2010; Briley et al. 2013). Within the context of this opponent process model of sound lateralization, the compressive relationship between stimulus lateralization and response size would be assumed to reflect saturation of the opponent population responses towards larger sound lateralities. Neurophysiological recordings in animals suggest that each population response first increases steeply for lateralities close to the midline, but then reaches a broad maximum for larger lateralities (McAlpine et al. 2001; Stecker et al. 2003, 2005). Saturation of opponent population responses would also explain the finding that the combined and large ITD and ILD responses did not show a greater degree of hemispheric lateralization than the small ITD and ILD responses, despite the difference in stimulus lateralization.
The combined ITD and ILD response appeared to be more strongly influenced by the ITD than ILD component (weighting ratio = 0.63:0.42 = 1.5). The current stimuli were filtered to only contain frequencies below 1 kHz, where fine structure ITDs can be perceived. In natural sounds, the perceptual weighting of ILDs tends to increase, and the weighting of ITDs tends to decrease, towards higher frequencies (McPherson and Middlebrooks 2002). However, this is because natural low-frequency ILDs tend to be small (Whightman and Kistler 1992) and natural high-frequency ITDs tend to be less effective at eliciting lateralization than corresponding low-frequency ITDs (Bernstein and Trahiotis 2003), and does thus not mean that the weighting of ITDs and ILDs observed in the current study would necessarily have been different had the stimuli contained higher frequencies. When simulated over headphones, low- and high-frequency ITDs and ILDs can be made equally effective, and it has been shown that this also equalizes their perceptual weighting (Bernstein and Trahiotis 2004, 2005).
The similarity of the scalp topographies and source distributions of the ITD- and ILD-only responses, as well as the subadditivity of the response to the combined ITD and ILD change, suggests that the auditory cortex in humans contains an integrated code of ITDs and ILDs. However, the morphological differences between the ITD- and ILD-only responses, with later and smaller N1, and larger P2, deflections in the ITD-only responses, indicate that the auditory cortex retains at least some degree of independent information about ITDs and ILDs. It is possible, for instance, that the cortical processing of ITDs and ILDs is based on different, but interconnected populations of neurons, located within the same area. Alternatively, ITDs and ILDs may be processed by the same neurons, but these neurons receive input from separate ITD- and ILD-specific sources. The idea of an integrated code of ITDs and ILDs that retains some degree of cue-related information is consistent with findings from auditory psychophysics. For instance, Phillips et al. (2006) have shown that prolonged exposure to an adapting sound with a large ILD can shift the perceived lateral position of a probe sound with an ITD and vice versa. It would be difficult to conceive how such cross-adaptation between ITDs and ILDs could occur unless there were some integrated representations of the two cues. The notion of integrated processing is supported by the finding of Philips et al. (2002) that the auditory saltation illusion, whereby presentation of two sets of clicks at two lateral positions can lead to the illusory perception of a continuous movement of the clicks between the two positions, is immune to switching the lateralization cue (e.g. from ITD to ILD) between the two sets of clicks. The idea of integrated processing of ITDs and ILDs is also consistent with the fact that ITDs and ILDs can, to a certain extent, be offset, or traded, against one another, such that a sound lateralized with an ITD can be centered by applying an opposing ILD and vice versa. However, it has been shown that trading is often imperfect, in that listeners presented with stimuli having opposing interaural time and level differences may perceive a single sound image, but that image may be dominated by one or other cue (Hafter and Jeffress 1968), or they may perceive two images, one corresponding mainly to the ITD, and the other to the ILD (Whitworth and Jeffress 1961). This indicates that the auditory system retains independent information about ITDs and ILDs even up to the level of perception. Independent information about ITDs and ILDs would enable the brain to ascertain how consistent the cues are and thus how likely they would have arisen from the same source. Independent ITD and ILD information would also enable the brain to exploit the full benefit that each cue confers when listening to speech in noisy environments (Edmonds and Culling 2005). It is possible that the degree of integration of ITDs and ILDs is decided upon how consistent the cues are. The ITDs and ILDs used in the current study were highly consistent and that may be why they were processed in an integrated fashion. It is possible that using less consistent, or even opposing, cues would have led to less integration and thus a lesser degree of subadditivity in the EEG response to the combined stimulus.
Acknowledgments
This work was funded by the intramural programme of the Medical Research Council (MRC). Dr Edmonds received additional funding by the National Institute of Health Research (NIHR). The views expressed in this publication are those of the authors and not necessarily those of the NHS, the NIHR or the Department of Health.
References
- Bernstein LR, Trahiotis C. Enhancing interaural-delay-based extents of laterality at high frequencies by using “transposed stimuli”. J Acoust Soc Am. 2003;113:3335–3347. doi: 10.1121/1.1570431. [DOI] [PubMed] [Google Scholar]
- Bernstein LR, Trahiotis C. The apparent immunity of high-frequency “transposed” stimuli to low-frequency binaural interference. J Acoust Soc Am. 2004;116:3062–3069. doi: 10.1121/1.1791892. [DOI] [PubMed] [Google Scholar]
- Bernstein LR, Trahiotis C. Measures of extents of laterality for high-frequency “transposed” stimuli under conditions of binaural interference. J Acoust Soc Am. 2005;118:1626–1635. doi: 10.1121/1.1984827. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bode D, Carhart R. Measurement of articulation functions using adaptive test procedures. Audio and Electroacoustics, IEEE Transactions on. 1973;21:196–201. doi: 10.1109/TAU.1973.1162479. [DOI] [Google Scholar]
- Briley PM, Kitterick PT, Summerfield AQ. Evidence for opponent process analysis of sound source location in humans. J Assoc Res Otolaryngol. 2013;14:83–101. doi: 10.1007/s10162-012-0356-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Burkard RF, Don M, Eggermont JJ. Auditory evoked potentials: basic principles and clinical application. Williams & Wilkins, Baltimore: Lippincott; 2007. [Google Scholar]
- Calvert GA, Hansen PC, Iversen SD, Brammer MJ. Detection of audio-visual integration sites in humans by application of electrophysiological criteria to the BOLD effect. Neuroimage. 2001;14:427–38. doi: 10.1006/nimg.2001.0812. [DOI] [PubMed] [Google Scholar]
- Calvert GA, Thesen T. Multisensory integration: methodological approaches and emerging principles in the human brain. J Physiol Paris. 2004;98:191–205. doi: 10.1016/j.jphysparis.2004.03.018. [DOI] [PubMed] [Google Scholar]
- Darwin CJ, Hukin RW. Effectiveness of spatial cues, prosody, and talker characteristics in selective attention. J Acoust Soc Am. 2000;107:970–7. doi: 10.1121/1.428278. [DOI] [PubMed] [Google Scholar]
- Durlach NI, Colburn HS (1978) Binaural phenomena. In Handbook of perception vol. IV, EC Carterette, MP Friedman (eds.) MP, Academic Press, New York.
- Edmonds BA, Culling JF. The role of head-related time and level cues in the unmasking of speech in noise and competing speech. Acta Acust United Ac. 2005;91:546–53. [Google Scholar]
- Efron B, Tibshirani RJ. An introduction to the bootstrap. New York: Chapman and Hall/CRC; 1994. [Google Scholar]
- Gaumond RP, Psaltikidou M. Models for the generation of the binaural difference response. J Acoust Soc Am. 1991;89:454–456. doi: 10.1121/1.400482. [DOI] [PubMed] [Google Scholar]
- Gratton G, Coles MGH, Donchin E. A new method for off-line removal of ocular artifact. Electroencephalogr Clin Neurophysiol. 1983;55:468–484. doi: 10.1016/0013-4694(83)90135-9. [DOI] [PubMed] [Google Scholar]
- Grothe B, Pecka M, McAlpine D (2010) Mechanisms of sound localization in mammals. Physiol Rev 90:983–1012 [DOI] [PubMed]
- Hafter ER, Jeffress LA. Two-image lateralization of tones and clicks. J Acoust Soc Am. 1968;44:563–569. doi: 10.1121/1.1911121. [DOI] [PubMed] [Google Scholar]
- Krumbholz K, Hewson-Stoate N, Schönwiesner M. Cortical response to auditory motion suggests an asymmetry in the reliance on inter-hemispheric connections between the left and right auditory cortices. J Neurophysiol. 2007;97:1649–1655. doi: 10.1152/jn.00560.2006. [DOI] [PubMed] [Google Scholar]
- Krumbholz K, Schönwiesner M, Rübsamen R, Zilles K, Fink GR, von Cramon DY. Hierarchical processing of sound location and motion in the human brainstem and planum temporale. Eur J Neurosci. 2005;21:230–238. doi: 10.1111/j.1460-9568.2004.03836.x. [DOI] [PubMed] [Google Scholar]
- Lanting CP, Briley PM, Sumner CJ, Krumbholz K. Mechanisms of adaptation in human auditory cortex. J Neurophysiol. 2013;110:973–983. doi: 10.1152/jn.00547.2012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Leek M. Adaptive procedures in psychophysical research. Atten, Percept, Psychophys. 2001;63:1279–1292. doi: 10.3758/BF03194543. [DOI] [PubMed] [Google Scholar]
- Lehmann D, Skrandies W. Reference-free identification of components of checkerboard-evoked multichannel potential fields. Electroenceph Clin Neurophysiol. 1980;48:609–621. doi: 10.1016/0013-4694(80)90419-8. [DOI] [PubMed] [Google Scholar]
- Levitt H. Transformed up-down methods in psychoacoustics. J Acoust Soc Am. 1971;49(2):467. doi: 10.1121/1.1912375. [DOI] [PubMed] [Google Scholar]
- Liu H, Schimpf PH, Dong G, Gao X, Yang F, Gao S. Standardized shrinking LORETA-FOCUSS (SSLOFO): a new algorithm for spatio-temporal EEG source reconstruction. IEEE Trans Biomed Eng. 2005;52:1681–1691. doi: 10.1109/TBME.2005.855720. [DOI] [PubMed] [Google Scholar]
- Magezi DA, Krumbholz K. Evidence for opponent-channel coding of interaural time differences in human auditory cortex. J Neurophysiol. 2010;104:1997–2007. doi: 10.1152/jn.00424.2009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mcalpine D, Jiang D, Palmer AR. A neural code for low-frequency sound localization in mammals. Nat Neurosci. 2001;4:396–401. doi: 10.1038/86049. [DOI] [PubMed] [Google Scholar]
- Mcevoy L, Makela JP, Hamalainen M, Hari R. Effect of interaural time differences on middle-latency and late auditory evoked magnetic fields. Hear Res. 1994;78:249–57. doi: 10.1016/0378-5955(94)90031-0. [DOI] [PubMed] [Google Scholar]
- Macpherson EA, Middlebrooks JC. Listener weighting of cues for lateral angle: the duplex theory of sound localization revisited. J Acoust Soc Am. 2002;111:2219–2236. doi: 10.1121/1.1471898. [DOI] [PubMed] [Google Scholar]
- Mcpherson DL, Starr A. Binaural interaction in auditory evoked potentials: brainstem, middle- and long-latency components. Hear Res. 1993;66:91–8. doi: 10.1016/0378-5955(93)90263-Z. [DOI] [PubMed] [Google Scholar]
- Moore BCJ, Gockel H. Factors influencing sequential stream segregation. Acta Acustica United with Acustica. 2002;88:320–333. [Google Scholar]
- Murray MM, Brunet D, Michel CM. Topographic ERP analyses: a step-by-step tutorial review. Brain Topogr. 2008;20:249–264. doi: 10.1007/s10548-008-0054-5. [DOI] [PubMed] [Google Scholar]
- Näätänen R, Kujala T, Winkler I. Auditory processing that leads to conscious perception: a unique window to central auditory processing opened by the mismatch negativity and related responses. Psychophysiology. 2011;48:4–22. doi: 10.1111/j.1469-8986.2010.01114.x. [DOI] [PubMed] [Google Scholar]
- Oldfield RC. Assessment and analysis of handedness—Edinburgh inventory. Neuropsychologia. 1971;9:97. doi: 10.1016/0028-3932(71)90067-4. [DOI] [PubMed] [Google Scholar]
- Phillips DP, Carmichael ME, Hall SE. Interaction in the perceptual processing of interaural time and level differences. Hear Res. 2006;211:96–102. doi: 10.1016/j.heares.2005.10.005. [DOI] [PubMed] [Google Scholar]
- Phillips DP, Hall SE, Boehnke SE. Central auditory onset responses, and temporal asymmetries in auditory perception. Hear Res. 2002;167:192–205. doi: 10.1016/S0378-5955(02)00393-3. [DOI] [PubMed] [Google Scholar]
- Riedel H, Kollmeier B. Auditory brain stem responses evoked by lateralized clicks: is lateralization extracted in the human brain stem? Hear Res. 2002;163:12–26. doi: 10.1016/S0378-5955(01)00362-8. [DOI] [PubMed] [Google Scholar]
- Salminen NH, May PJC, Alku P, Tiitinen H. A population rate code of auditory space in the human cortex. Plos One. 2009;4:e7600. doi: 10.1371/journal.pone.0007600. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Salminen NH, Tiitinen H, Yrttiaho S, May PJ. The neural code for interaural time difference in human auditory cortex. J Acoust Soc Am. 2010;127:EL60–5. doi: 10.1121/1.3290744. [DOI] [PubMed] [Google Scholar]
- Schröger E (1996) Interaural time and level differences: integrated or separated processing? Hear Res 96:191–198 [DOI] [PubMed]
- Seither-Preisler A, Krumbholz K, Patterson RD, Seither S, Lütkenhöner B. Interaction between the neuromagnetic responses to sound energy onset and pitch onset suggests common generators. Eur J Neurosci. 2004;19:3073–3080. doi: 10.1111/j.0953-816X.2004.03423.x. [DOI] [PubMed] [Google Scholar]
- Stecker GC, Harrington IA, Middlebrooks JC. Location coding by opponent neural populations in the auditory cortex. PLoS Biol. 2005;3:e78. doi: 10.1371/journal.pbio.0030078. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Stecker GC, Mickey BJ, Macpherson EA, Middlebrooks JC. Spatial sensitivity in field PAF of cat auditory cortex. J Neurophysiol. 2003;89:2889–903. doi: 10.1152/jn.00980.2002. [DOI] [PubMed] [Google Scholar]
- Tardif E, Murray MM, Meylan R, Spierer L, Clarke S. The spatio-temporal brain dynamics of processing and integrating sound localization cues in humans. Brain Res. 2006;1092:161–176. doi: 10.1016/j.brainres.2006.03.095. [DOI] [PubMed] [Google Scholar]
- Tollin DJ. The lateral superior olive: a functional role in sound source localization. Neuroscientist. 2003;9:127–143. doi: 10.1177/1073858403252228. [DOI] [PubMed] [Google Scholar]
- Toole FE, Sayers BMCA. Lateralization judgements and the nature of binaural acoustic images. J Acoust Soc Am. 1965;37:319–324. doi: 10.1121/1.1909329. [DOI] [Google Scholar]
- Ungan P, Ozmen B. Human long-latency responses to brief interaural disparities of intensity. Electroencephalogr Clin Neurophysiol. 1996;99:479–490. doi: 10.1016/S0013-4694(96)96503-7. [DOI] [PubMed] [Google Scholar]
- Ungan P, Yagcioglu S, Goksoy C. Differences between the N1 waves of the responses to interaural time and intensity disparities: scalp topography and dipole sources. Clin Neurophysiol. 2001;112:485–498. doi: 10.1016/S1388-2457(00)00550-2. [DOI] [PubMed] [Google Scholar]
- Wightman FL, Kistler DJ. The dominant role of low-frequency interaural time differences in sound localization. J Acoust Soc Am. 1992;91:1648–1661. doi: 10.1121/1.402445. [DOI] [PubMed] [Google Scholar]
- Whitworth RH, Jeffress LA. Time vs intensity in the localization of tones. J Acoust Soc Am. 1961;33:925–929. doi: 10.1121/1.1908849. [DOI] [Google Scholar]
- Woldorff MG, Tempelmann C, Fell J, Tegeler C, Gaschler-Markefski B, Hinrichs H, Heinz HJ, Scheich H. Lateralized auditory spatial perception and the contralaterality of cortical processing as studied with functional magnetic resonance imaging and magnetoencephalography. Hum Brain Mapp. 1999;7:49–66. doi: 10.1002/(SICI)1097-0193(1999)7:1<49::AID-HBM5>3.0.CO;2-J. [DOI] [PMC free article] [PubMed] [Google Scholar]