Skip to main content
The Journal of Neuroscience logoLink to The Journal of Neuroscience
. 2010 Sep 1;30(35):11696–11702. doi: 10.1523/JNEUROSCI.0846-10.2010

Population Coding of Interaural Time Differences in Gerbils and Barn Owls

Nicholas A Lesica 1,2,, Andrea Lingner 1, Benedikt Grothe 1,2
PMCID: PMC6633420  PMID: 20810890

Abstract

Interaural time differences (ITDs) are the primary cue for the localization of low-frequency sound sources in the azimuthal plane. For decades, it was assumed that the coding of ITDs in the mammalian brain was similar to that in the avian brain, where information is sparsely distributed across individual neurons, but recent studies have suggested otherwise. In this study, we characterized the representation of ITDs in adult male and female gerbils. First, we performed behavioral experiments to determine the acuity with which gerbils can use ITDs to localize sounds. Next, we used different decoders to infer ITDs from the activity of a population of neurons in central nucleus of the inferior colliculus. These results show that ITDs are not represented in a distributed manner, but rather in the summed activity of the entire population. To contrast these results with those from a population where the representation of ITDs is known to be sparsely distributed, we performed the same analysis on activity from the external nucleus of the inferior colliculus of adult male and female barn owls. Together, our results support the idea that, unlike the avian brain, the mammalian brain represents ITDs in the overall activity of a homogenous population of neurons within each hemisphere.

Introduction

For both birds and mammals, interaural time differences (ITDs), the differences in the arrival time of sounds at the two ears, are an important cue for the localization of low-frequency sounds in the azimuthal plane. The first detailed model of how ITDs might be used to create a representation of space within the brain was presented by Jeffress (1948). The Jeffress model specifies the anatomical framework used to both create ITD sensitivity and form a representation of space. In the Jeffress model, an array of binaural coincidence detector cells tuned to different preferred ITDs via axonal “delay lines” form a topographic map of space. Because each cell is a sharply tuned filter, and the preferred ITDs span the range of ITDs that correspond to all possible azimuthal locations, the representation of space in the Jeffress model is sparse: for a single sound source, only a small subset of cells are active at any given time. In the decades since Jeffress presented his model, it has been shown to be a remarkably accurate description of the circuitry that processes ITDs in barn owls (and, subsequently, other birds); there is clear anatomical evidence for delay lines in the avian coincidence detector, the nucleus laminaris (NL), and physiological studies have revealed a topographic representation of azimuthal space in the auditory midbrain, comprised of a series of sharply tuned cells (for review, see Konishi, 2003).

The representation of azimuthal space in the mammalian brain is less clear. Numerous studies have shown that cells in the mammalian brain are sensitive to ITDs, beginning in the brainstem with the binaural coincidence detector, the medial superior olive (MSO), and continuing through the midbrain, thalamus and cortex. Until recently, it was assumed that mammals also process ITDs in a manner similar to that proposed by Jeffress. However, tympanic ears (and, thus, ITD sensitivity) evolved separately in birds and mammals (Clack, 1997), and while there is some evidence for anatomical delay lines in the MSO (Smith et al., 1993; Beckius et al., 1999), they do not appear to account for the observed physiological delays (Karino et al., 2010). Furthermore, apart from an apparent weak gradient of preferred ITDs in the MSO (Yin and Chan, 1990; Oliver et al., 2003), most of the evidence suggests that there is no ITD-based space map in the ascending auditory pathway (Middlebrooks et al., 2002; Grothe, 2003; McAlpine and Grothe, 2003; King and Campbell, 2005; McAlpine, 2005; Joris and Yin, 2007). However, even if the mammalian ascending auditory pathway does not contain delay lines and/or a space map, this suggests only that the anatomical framework for processing ITDs in mammals is different from that in birds; whether the fundamental nature of the representation in mammals is different from that in birds (i.e., whether the representation is sparsely distributed across the population) remains an open question.

In this study, we investigate the nature of the representation of ITDs in gerbils, a common mammalian model for the study of ITD tuning, by assessing whether cell identity is important for decoding responses, i.e., when using the responses of a population of neurons to localize a sound, is it necessary to know which spikes came from which cells, or can the responses of all cells simply be summed together? If the representation of ITDs is in fact distributed across the population, then summing together the responses of all cells will result in a population response that is relatively invariant to changes in ITD. In contrast, if ITD is represented by the overall activity within each hemisphere, then summing the responses from all cells within a given hemisphere should result in little or no loss of information. To provide a context in which to evaluate the results of our analysis of gerbil responses, we also perform a similar analysis on population responses from the barn owl, which is known to represent ITDs in a distributed manner.

Materials and Methods

Behavioral experiments

Experiments were performed on adult Mongolian gerbils (Meriones unguiculatus) in a sound-attenuated chamber in which the walls, ceiling and floor were covered with foam wedges to suppress echoes. The chamber contained a circular arena with a small ring in the middle. The ring contained a light barrier for detecting the presence of the animal's nose. Two movable arms were mounted on the outer rail of the arena. Each arm contained a loudspeaker (Aurasound, NSW1-205-8A), a custom-made foot-switch, and a feeder to deliver food pellets.

The animals were trained to lateralize sounds in a two-alternative forced-choice (2AFC) task. A trial began when an animal inserted its nose in the ring (ensuring that its head was oriented exactly in the middle of the two speakers), triggering a sound from one of the speakers. The animal then went to the foot-switch attached to the correct speaker to obtain a food pellet. After 15 s without a decision, the trial was aborted. Incorrect decisions were not rewarded. The animal was only allowed to begin a new trial after the previous trial was either completed or aborted, with an additional waiting period of 20 s after incorrect trials. The sounds were 125 ms bursts of low-frequency noise, with frequencies between 1 and 1200 Hz. The sounds were presented with a roving level of 72 ± 6 dB per trial. The speaker separation angle was systematically decreased from 105° to 5.5° (intermediate steps: 60°, 45°, 35°, 30°, 25°, 20°, 15°, 10°, 7°), with 5 trials at each angle. After 5 trials at 5.5°, the sequence was repeated. The entire experimental procedure was computer controlled. Each animal performed one testing session per day for at least 11 d, yielding a total of at least 64 trials per speaker angle.

To estimate the threshold separation angle (and approximate threshold ΔITD) for each animal, the set of points containing the mean percent correct at each angle tested were fit with a sigmoid. The threshold was defined as the angle at which the fit exceeded the significance level (62.5% for p < 0.05 in a 2AFC task with 64 trials; two-tailed binomial test).

Physiological experiments

Gerbils.

The surgical procedures for the physiological experiments on gerbils have been described in detail previously (Lesica and Grothe, 2008). All experiments were approved according to the German Tierschutzgesetz (AZ 211-2531-40/01 and AZ 211-2531-68/03). Briefly, adult gerbils were anesthetized for surgery with an initial intraperitoneal injection (0.5 ml/100 g of body weight) of a physiological NaCl solution containing ketamine (20%) and xylazine (2%). During recordings, the same solution was infused continuously at a rate of ∼0.1 ml/h. A small metal rod was mounted on the frontal part of the skull and used to secure the head of the animal in a stereotaxic device. The animal was positioned in a sound-attenuated chamber and a craniotomy was made over the inferior colliculus (IC), 1.3–2.6 mm lateral from the midline and 0–1 mm caudal from bregma. The dura mater overlying the cortex was removed, and a multielectrode microdrive (Thomas Recording) was used to advance 7 independently moveable tungsten microelectrodes (3–5 MΩ arranged concentrically with an interelectrode spacing of ∼100 μm) into the inferior colliculus (2–4 mm below the surface). Recordings were made in the low-frequency lamina of the rostrolateral quadrant of the IC, where inputs from the MSO are clustered (Cant and Benson, 2006) and cells are likely to be ITD sensitive. Extracellular voltage signals were amplified and fed into a computer via an A/D converter (RX5, Tucker Davis Technologies). Voltage signals were analyzed using an offline program MClust (Fraley and Raftery, 2002) to isolate action potentials from single units. Only those units with an “isolation distance” >10 were included in this study (Schmitzer-Torbert et al., 2005). Isolation distance assumes that the cluster of action potentials forms a multidimensional Gaussian in feature space and measures, in terms of the SD of the original cluster, the size of the ellipsoid in feature space that contains an equal number of action potentials and noise spikes. Our recordings yielded an average of 4.2 ± 1.4 single units per recording site (of 7 possible single units per site—multiple single units were never taken from a single electrode). Experiments typically lasted ∼12 h. At the end of each experiment, a lesion was made with the central electrode via current injection (20 kHz sinusoid with peak amplitude of 10 μA and duration of 90 s) to mark the recording site. All sites were verified to be in the central nucleus of the IC. Recording sites from two representative animals are shown in supplemental Figure 1 (available at www.jneurosci.org as supplemental material).

Sounds were generated with a 48 kHz sampling rate by TDT System III hardware (Tucker Davis Technologies). Digitally generated sounds were converted to analog signals (RP2-1), attenuated (PA5), and delivered to speakers (ER2, Etymotic Research) coupled to tubes which were inserted into the ear canals along with microphones. Speakers were calibrated to have a flat frequency response [±5 dB SPL (sound pressure level)from 0.1 to 10 kHz] at the beginning of each experiment. At each recording site, a sequence of sounds with various frequencies, intensities, and ITDs were presented to characterize basic response properties. First, 100 ms pure tones of various intensities and frequencies were presented, separated by 150 ms periods of silence, to determine the frequency response area (FRA) (see Fig. 2a). Tones were presented binaurally with zero ITDs, and for some cells, also monaurally in the ear contralateral to the recording site. The tones had a rise/fall time of 5 ms. Next, 8 repeated presentations of a 250 ms segment of “frozen” noise at ITDs ranging from −2 to 2 ms were presented, separated by 500 ms periods of silence, to compute noise delay functions (NDFs) (see Fig. 2b). Two sets of noise were presented, one in which the noise bursts in the two ears were identical and, thus, perfectly correlated, and the other in which the noise bursts in the ipsilateral ear were multiplied by −1, so that the inputs to the two ears were perfectly anti-correlated. The noise segments were filtered to contain only frequencies between 50 and 5000 Hz and had a rise/fall time of 5 ms. The intensity of the noise was 50 dB. Finally the stimulus used for the main decoding analysis was presented: 40 repeated presentations of a 250 ms segment of frozen noise at 9 different ITDs ranging from −135 to 135 μs, separated by 500 ms periods of silence. The noise was filtered as described above and its intensity was 50 dB.

Figure 2.

Figure 2.

Basic responses properties of cells in the gerbil IC. a, The left column shows the FRAs for two typical cells in the gerbil IC measured from responses to binaural tone bursts with different frequencies and intensities and zero ITDs. The BF for each cell is indicated. The right column shows the FRAs measured from responses to tone bursts presented in the contralateral ear only. Our measure of binaural response type for each cell is indicated. b, The NDFs for the same two cells as in a. Two NDFs are shown, one for which the noise bursts in the two ears were identical and, thus, perfectly correlated (black), and the other for which the noise bursts in the ipsilateral ear were multiplied by −1, so that the inputs to the two ears were perfectly anticorrelated (red). For each cell, the best ITD and our measure of ITD sensitivity type, the correlation coefficient between the two NDFs, are indicated. c, The distributions of basic response properties for our population of cells.

Barn owls.

The surgical procedures for the physiological experiments on owls have been described in detail previously and the data analyzed here have also been analyzed in previous work (Keller and Takahashi, 2000; Bala et al., 2003). Briefly, adult barn owls were anesthetized by intramuscular injections of ketamine (0.05–0.1 ml/h; 100 mg/ml) and diazepam (0.025–0.05 ml/h; 5 mg/ml) and given a prophylactic dosage of ampicillin (0.2 ml, i.m.; 250 mg/ml). The owl was placed into a stereotaxic device inside a sound-attenuated chamber that held its head tilted downwardly at a 45° angle. Microelectrodes were inserted into the exterior nucleus of the inferior colliculus (ICx) through a recording well on the skull. The ICx was identified on the basis of response characteristics. At the end of a recording session, the well was closed with dental cement, and the scalp was sutured and covered with a topical antibacterial cream. Typically, a recording session lasted ∼12–15 h. A bird was typically used for four sessions with a minimum of 10 d between sessions.

Sounds were presented over earphones (Sony MDR with custom-built cones that fit into the outer ear canal) after compensating for the filtering properties of the earphones in situ and filtering the signals with the individual bird's head-related transfer functions (HRTFs) (Tucker Davis Technologies PD1). At each recording site, 20 repeated presentations of a 100 ms segment of random noise were presented at different ITDs, separated by 500 ms periods of silence. The noise was filtered to contain only frequencies between 2 and 9 kHz. The intensity of the noise was 50 dB.

Physiological data analysis

The significance of each cell's ITD tuning was determined by comparing the decoder performance (see below) on the actual responses to the performance after randomly reassigning the ITD associated with each response. Decoding was performed on 100 different sets of randomized responses and the significance threshold was defined as 2 SDs above the mean percent correct for the randomized sets. The populations of cells analyzed were not restricted to any particular type of ITD tuning, i.e., any cell that exhibited significant tuning was included. For example, the population of gerbil cells contained cells that were sensitive to ITDs in both the envelope and fine structure of the sound waveform.

The similarity of tuning curves for each animal was quantified by computing the correlation coefficient between pairs of tuning curves. This analysis was also performed on two sets of simulated tuning curves, one in which all tuning curves were identical, and another in which tuning curves peaks were distributed uniformly. Both sets of tuning curves were defined by spike rates at 9 ITDs evenly spaced between −1 and 1 (arbitrary units). All tuning curves had a width at half max of 2.5. For the identical set, all peaks were at 1. For the other set, peaks were uniformly distributed between −1.5 and 1.5 (i.e., the peaks of some tuning curves were outside the range that was analyzed, as was the case in the physiological data from barn owls).

The representation of ITDs in each animal was characterized by decoding responses. Decoding was performed as follows. (1) A single response was removed from the full set of all repeated presentations at all ITDs. (2) The Euclidean distance between the removed response and each of the remaining responses in the set was computed. (3) The removed response was assigned to the ITD for which its average distance was smallest. This process was repeated for all responses in the full set to obtain an overall percent correct. For the “distributed” decoder, each response was an n-dimensional vector defined by the spike rates of each of the n cells in the population. For the “summed” decoder, each response was a scalar defined by the sum of the spike rates of all the cells in the population. The methods used to assess the role of spike timing and noise correlations on decoder performance are described in supplemental Figures 2 and 3 (available at www.jneurosci.org as supplemental material).

Results

Gerbils localize low-frequency sounds in the azimuthal plane with high acuity

Before investigating the nature of the representation of ITDs by decoding responses from a particular animal, it is necessary to know the acuity with which that animal can use ITDs to localize sounds, so that the decoding can be performed at an appropriate resolution. Because there have been relatively few studies on the ability of gerbils to localize sounds using ITDs (Heffner and Heffner, 1988; Maier and Klump, 2006; Maier et al., 2008), we tested their ability to localize sounds in the horizontal plane using a 2AFC task, the design of which is illustrated in Figure 1a. We trained gerbils to place their nose in a ring to disrupt a light beam and elicit a sound from one of two speakers, which were spaced symmetrically about the midline at various angles. The sounds were low-frequency noise bursts (<1.2 kHz) for which ITD is the only cue available for localization (Maki and Furukawa, 2005). The ability of individual gerbils to correctly identify the speaker that emitted the sound for a range of separation angles is shown in Figure 1b. Individual threshold separation angles ranged from 7° to 20°, corresponding to a difference in ITD of ∼11–30 μs, with a median value of 14°, or 21 μs (approximate ITDs were derived from the head-related transfer functions in Maki and Furukawa (2005)). These values are comparable to the threshold difference for ITD in humans (Mills, 1958).

Figure 1.

Figure 1.

Performance of gerbils in a sound localization task. a, A schematic illustration of the sound localization task. 1, The gerbil placed its nose in a ring and disrupted a light beam to ensure that its head was facing forward. 2, A sound was emitted from one of two speakers. 3, The gerbil went to the speaker from which the sound was emitted to collect a food reward. b, The performance of individual gerbils in the sound localization task. Each colored line shows the percentage correct at a range of speaker separation angles (and approximate difference in ITDs) for one gerbil. The inset shows the threshold speaker separation angles (and approximate difference in ITDs) for each gerbil, i.e., the smallest values at which each gerbil's performance was significantly above chance level.

Response properties of neurons in the gerbil inferior colliculus

To investigate the representation of ITDs in the gerbil auditory system, we made extracellular single-unit recordings from the central nucleus of the IC in anesthetized gerbils using a multielectrode array (all recording sites were in the same hemisphere). Recordings were made in the low-frequency lamina of the rostrolateral quadrant of the IC, where inputs from the MSO are clustered (Cant and Benson, 2006) and cells are likely to be ITD sensitive (see supplemental Fig. 1, available at www.jneurosci.org as supplemental material, for representative examples of recording sites). Only those cells with significant ITD tuning (see Materials and Methods) were analyzed (n = 112).

To characterize the basic response properties of our population of cells, we recorded responses to binaural (for all cells) and monaural contralateral (for n = 61 cells) tone bursts with different frequencies and intensities, and binaural noise bursts with different ITDs. The FRAs for two typical cells are shown in Figure 2a. From these FRAs, we extracted two values: the best frequency (BF), which is the frequency at which the cell responded to a binaural tone burst (with zero ITDs) at the lowest intensity, and a measure of binaural type for which we computed the strength of binaural responses relative to contralateral responses (binaural–contralateral)/contralateral, averaged over all frequencies and intensities). For neurons that received excitatory inputs from both ears, as was the case for both of the example cells, this value was greater than zero.

The NDFs for the same two cells are shown in Figure 2b. For each cell, we recorded two NDFs, one for which the noise bursts in the two ears were identical and, thus, perfectly correlated (black), and the other for which the noise bursts in the ipsilateral ear were multiplied by −1, so that the inputs to the two ears were perfectly anti-correlated (red). From these NDFs, we extracted two values: the best ITD, which is the ITD that evoked the highest spike rate for the correlated noise, and a measure of ITD sensitivity type for which we computed the correlation coefficient between the two NDFs (Joris, 2003). For neurons with low BFs that were sensitive to ITDs in the fine structure of a sound, such as cell 1, the correlation coefficient between the two NDFs was close to −1, while for neurons with relatively high BFs that were sensitive to ITDs in the envelope of a sound, such as cell 2, this value was close to 1.

The distributions of the basic response properties for our population of cells are shown in Figure 2c. The distribution of BFs ranged from 0.25 kHz to 4.2 kHz, with a peak around 1 kHz. For the vast majority of cells (53/61, 87%) for which we could compute our measure of binaural type, the value was greater than zero, suggesting that they received excitatory inputs from both ears. The distribution of best ITDs was tightly clustered around 200 ms (mean = 201 ms, SD = 37 ms) and a majority of cells (78/112, 70%) had best ITDs beyond the physiological limit for gerbils of 135 μs (Maki and Furukawa, 2005). For our measure of ITD sensitivity type, the distribution ranged from −0.86 to 0.91, but a majority of cells (83/112, 74%) had values less than zero, indicating that they were sensitive primarily to ITDs in the fine structure of a sound.

Decoding gerbil and barn owl population responses to a hypothetical localization task

Based on the behavioral results described above, we also recorded the responses of each cell to low-frequency frozen noise bursts at 9 different ITDs, centered on an ITD = 0 μs, spaced near the localization threshold at ∼34 μs. This stimulus was designed to test the ability of different decoders to use the population responses to perform a hypothetical task: to localize a sound to 1 of the 9 possible ITDs. The responses of the same two example cells to repeated presentations at different ITDs are shown in Figure 3a. Both cells responded reliably to features of the sound across repetitions, with an overall spike rate that increased with increasing ITD, as summarized in the tuning curves in Figure 3b.

Figure 3.

Figure 3.

Responses of cells in the gerbil IC to sounds with different ITDs. The top row shows raster plots of the responses of two cells in the gerbil IC (the same cells as in Fig. 2) to 20 repeated presentations of the same noise burst at different ITDs. Each mark on the plot represents one spike. The bottom row shows the spike rate versus ITD tuning curves for the same two cells. Circles and error bars indicate the mean and SD of the spike rate across repeated presentations.

The tuning curves for ITDs within the limited range tested for the decoding task are shown for a sample of individual cells in Figure 4a (colored lines; tuning curves normalized to range from 0 to 1). There was some variability in the tuning curves, but nearly all had a minimum at −135 μs and a maximum at 135 μs, the lowest and highest ITDs tested. As a basis for comparison with the gerbil, we also analyzed responses to noise bursts at different ITDs from the exterior nucleus of the ICx in anesthetized barn owls (n = 58 with significant ITD tuning). While there are many important anatomical and physiological differences between the central nucleus of the mammalian IC and the avian ICx, the barn owl ICx responses, in which the representation of ITDs is known to be sparsely distributed across the population (Knudsen and Konishi, 1978; Bala et al., 2003), provide a clear example with which to compare the gerbil responses. The ability of barn owls to localize sounds in the azimuthal plane has been well studied (Knudsen et al., 1979; Knudsen and Konishi, 1979), and it has been shown that this ability relies almost exclusively on sensitivity to ITDs (Poganiatz et al., 2001). In contrast to the gerbil, the tuning curve maxima for the barn owl spanned the entire range of ITDs tested, as shown in Figure 4b. Figure 4, a and b, also shows the sum of the tuning curves for all cells (black). For the gerbil, the summed tuning curve spanned the full dynamic range, indicating that the tuning curves for all cells were similar, but for the barn owl, the summed tuning curve was relatively flat. To facilitate comparison with the gerbil responses, further analysis of the owl responses was performed only on those responses to nine different ITDs, centered on ITD = 0 ms, spaced near the localization threshold at ∼5 μs (Knudsen et al., 1979; Knudsen and Konishi, 1979), corresponding to the range denoted by the gray band. Restricting our analysis of the owl responses to this set of ITDs ensured that the hypothetical task used for decoding responses was of equivalent difficulty for each animal.

Figure 4.

Figure 4.

Tuning curves and decoder performance for cells in the gerbil IC and barn owl ICx. a, Tuning curves for ITDs within the limited range tested for the decoding task for cells in the gerbil IC. Each colored line shows the mean spike rate across repeated presentations for one cell. Each tuning curve was normalized to have a minimum value of 0 and a maximum value of 1. For clarity, only 20 tuning curves are shown. The mean of all tuning curves is shown in black. All cells were recorded in the same hemisphere. b, ITD tuning curves for cells in the barn owl ICx, plotted as in a. The gray band denotes the range of responses that were used for further analysis. c, Histograms of the ITD at which spike rate was maximal (the best ITD within the range of ITDs tested) for cells in the gerbil IC and barn owl ICx. d, The distribution of correlation coefficients between pairs of tuning curves for 4 sets of tuning curves: a set of simulated identical tuning curves, a set of simulated tuning curves with peaks uniformly distributed across a range of ITDs, the gerbil tuning curves, and the barn owl tuning curves. e, Performance of distributed and summed decoders for population responses in the gerbil IC and barn owl ICx. Decoding began with a single cell (that which yielded the best performance) and an additional cell was added to each population on each iteration (that which yielded the best performance in combination with the current population).

Before decoding the gerbil and owl responses, we quantified the similarity in the tuning curves for each animal. First, we determined the ITD at which the spike rate was maximal (the best ITD within the limited range tested for the decoding task). As shown in Figure 4c, the tuning curve peaks were clustered near the largest ITD tested (+135 μs) for the gerbil, but distributed relatively uniformly across all ITDs tested for the barn owl. To further examine the similarity in the tuning curves for each animal, we computed the correlation coefficient between all possible pairs of tuning curves. As a basis for comparison with the experimental tuning curves, we created two sets of simulated tuning curves, one in which all tuning curves were identical (with the same peak ITD and width), and one in which the peaks were distributed uniformly across all ITDs (see Materials and Methods for details). As shown in Figure 4d, the correlation coefficient was 1 for all tuning curves in the identical set, while for the set in which the peaks were distributed uniformly, the correlation coefficients were broadly distributed. For the gerbil, the correlation coefficients for most pairs of tuning curves were clustered near 1, indicating that the tuning curves were nearly identical, while for the barn owl, the distribution of correlation coefficients was similar to that of the simulated population in which the peaks were distributed uniformly.

Finally, to explicitly compare the nature of the representation of ITDs in each animal, we measured the performance of two different decoders in localizing of a sound to 1 of 9 possible ITDs based on observation of the population responses. For a particular response (from one or more cells), the decoder computed its average Euclidean distance from the responses (across repeated presentations) evoked at each ITD, and chose the ITD corresponding to the smallest distance (see Materials and Methods for details). Before using the decoders on population responses, we determined the time scale at which decoder performance was maximal for the responses of single cells for the gerbil (see supplemental Fig. 2, available at www.jneurosci.org as supplemental material), and tested whether decoder performance was influenced by “noise correlations” between cells (i.e., correlations in the trial to trial variability of responses; see supplemental Fig. 3, available at www.jneurosci.org as supplemental material). The results of these analyses suggested that, as for the barn owl (Keller and Takahashi, 2000), information about ITD was confined to the overall spike rate (i.e., considering the timing of individual spikes did not improve decoder performance) and that noise correlations could be ignored (i.e., decoder performance for the joint responses of small groups of cells recorded simultaneously was the same before and after shuffling the order of repeated presentations). Thus, for the remainder of our analysis, we considered only spike rate and combined all cells from different recording sites into one large population.

To compare the nature of the representation of ITDs in each animal, we measured the performance of two different decoders. The first decoder (distributed) retained the identity of each cell, while the second (summed) ignored cell identity and summed the responses of all cells. The performance of the two decoders is shown in Figure 4e. To examine the effect of population size, the decoding was done iteratively, with one cell added to each population (that which provided the largest improvement in performance) on each iteration. For the gerbil, the performance of both decoders saturated near the maximum value, but for the barn owl, the performance of the distributed decoder saturated near the maximum value, while the performance of the summed decoder decreased with each additional cell. This result suggests that the representation of ITDs in the gerbil brain is indeed different from that in barn owl brain: when decoding the sparsely distributed population responses in the barn owl, ignoring which cell fired which spikes results in a loss of information, but when decoding the responses from the relatively homogenous population within a single hemisphere in the gerbil, the responses of all cells can be summed together into a single channel.

Discussion

Our decoding analysis explicitly shows that the representation of ITDs is not sparsely distributed across the responses of individual neurons in the central nucleus of the inferior colliculus of the gerbil. Because ignoring which cells in the population fired which spikes results in no loss of information about ITDs, we conclude that ITD is coded by the overall activity of the entire population. This result supports one side of the ongoing debate as to whether the mammalian ITD pathway is a “Jeffress-like” system or a “two-channel” system, with ITD represented by the difference in the overall activity in each hemisphere (for opposing views, see McAlpine and Grothe, 2003; Joris and Yin, 2007). The Jeffress-like nature of ITD processing in mammals has been in question for some time for several reasons. (1) The results of the anatomical studies that have looked for delay lines (Smith et al., 1993; Beckius et al., 1999) are ambiguous and recent analysis suggests that any anatomical delays that may be present cannot account for the observed physiological delays (Karino et al., 2010). (2) Apart from a possible weak topographic gradient of best ITDs in the MSO (Yin and Chan, 1990; Oliver et al., 2003), there is little evidence for a topographic map of space in the ascending auditory pathway (Middlebrooks et al., 2002; Grothe, 2003; McAlpine and Grothe, 2003; King and Campbell, 2005; McAlpine, 2005; Joris and Yin, 2007). (3) Recent studies have shown that some cells in the mammalian brain have broad ITD tuning curves, with peaks outside the physiological range (McAlpine et al., 2001; Brand et al., 2002; Hancock and Delgutte, 2004; Pecka et al., 2008). (4) Theoretical work (Harper and McAlpine, 2004) has shown that a Jeffress-like system is optimal for animals that use ITDs as a spatial cue for high-frequency sounds, such as barn owls, while a two-channel system is optimal for animals that use ITDs for low-frequency sounds, such as gerbils (this theory also states that barn owls should use a two-channel system for low-frequency sounds, suggesting that the differences between gerbils and barn owls in our results may be due to the differences in the frequency tuning of the populations studied, though recent experimental evidence indicates that barn owls use a Jeffress-like system for low frequencies as well (Wagner et al., 2007). Our results explicitly show that the representation of ITDs in gerbils is not Jeffress-like, i.e., it is not sparsely distributed across a population of cells with different preferences, and, thus, suggest that the search for anatomical features such as delay lines or space maps that are used to create or organize a Jeffress-like representation in the mammalian brain may be unnecessary.

Previous studies of ITD coding in the mammalian IC have shown that the responses of single neurons can be sufficient to explain behavioral performance (Skottun et al., 2001; Shackleton et al., 2003). This result is in apparent conflict with our analysis of responses in the gerbil IC in which a population of cells is required for the decoder to reach the desired level of performance. However, upon closer inspection, both sets of results are fully consistent. The studies cited above showed that for a two-alternative task, a decoder using the responses of a single cell could distinguish a change in ITD of ∼30 μs relative to a specific reference ITD. This is true of many of the cells in our sample as well. For example, a decoder using only the responses of cell 1 in Figure 3 can be used to distinguish between noise bursts with ITDs of −67 μs and −34 μs with near perfect performance. The task used in our decoding analysis is much more difficult, a 9-alternative task for ITDs that span a relatively wide range, and for this task, a decoder using only the responses of cell 1 in Figure 3 achieves a correct performance level of only 54%. Thus, for a two-alternative task within a specific narrow range of ITDs, a single cell may be sufficient, but for a more difficult task over a wide range of ITDs, the responses of a population of cells are required.

Footnotes

This work was supported by the Bernstein Center for Computational Neuroscience and the German Science Foundation (Deutsche Forschungsgemeinschaft; GR1205/12-1, LE2522/1-1).We thank Jose-Manuel Alonso for his expert advice on multielectrode recordings and Avinash Bala and Terry Takahashi for providing the barn owl data.

References

  1. Bala AD, Spitzer MW, Takahashi TT. Prediction of auditory spatial acuity from neural images on the owl's auditory space map. Nature. 2003;424:771–774. doi: 10.1038/nature01835. [DOI] [PubMed] [Google Scholar]
  2. Beckius GE, Batra R, Oliver DL. Axons from anteroventral cochlear nucleus that terminate in medial superior olive of cat: observations related to delay lines. J Neurosci. 1999;19:3146–3161. doi: 10.1523/JNEUROSCI.19-08-03146.1999. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Brand A, Behrend O, Marquardt T, McAlpine D, Grothe B. Precise inhibition is essential for microsecond interaural time difference coding. Nature. 2002;417:543–547. doi: 10.1038/417543a. [DOI] [PubMed] [Google Scholar]
  4. Cant NB, Benson CG. Organization of the inferior colliculus of the gerbil (Meriones unguiculatus): differences in distribution of projections from the cochlear nuclei and the superior olivary complex. J Comp Neurol. 2006;495:511–528. doi: 10.1002/cne.20888. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Clack JA. The evolution of tetrapod ears and the fossil record. Brain Behav Evol. 1997;50:198–212. doi: 10.1159/000113334. [DOI] [PubMed] [Google Scholar]
  6. Fraley C, Raftery AE. Model-based clustering, discriminant analysis and density estimation. J Am Stat Assoc. 2002;97:611–631. [Google Scholar]
  7. Grothe B. New roles for synaptic inhibition in sound localization. Nat Rev Neurosci. 2003;4:540–550. doi: 10.1038/nrn1136. [DOI] [PubMed] [Google Scholar]
  8. Hancock KE, Delgutte B. A physiologically based model of interaural time difference discrimination. J Neurosci. 2004;24:7110–7117. doi: 10.1523/JNEUROSCI.0762-04.2004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Harper NS, McAlpine D. Optimal neural population coding of an auditory spatial cue. Nature. 2004;430:682–686. doi: 10.1038/nature02768. [DOI] [PubMed] [Google Scholar]
  10. Heffner RS, Heffner HE. Sound localization and use of binaural cues by the gerbil (Meriones unguiculatus) Behav Neurosci. 1988;102:422–428. doi: 10.1037//0735-7044.102.3.422. [DOI] [PubMed] [Google Scholar]
  11. Jeffress LA. A place theory of sound localization. J Comp Physiol Psychol. 1948;41:35–39. doi: 10.1037/h0061495. [DOI] [PubMed] [Google Scholar]
  12. Joris P, Yin TC. A matter of time: internal delays in binaural processing. Trends Neurosci. 2007;30:70–78. doi: 10.1016/j.tins.2006.12.004. [DOI] [PubMed] [Google Scholar]
  13. Joris PX. Interaural time sensitivity dominated by cochlea-induced envelope patterns. J Neurosci. 2003;23:6345–6350. doi: 10.1523/JNEUROSCI.23-15-06345.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Karino S, Smith PH, Yin TCT, Joris PX. Axonal delay lines from bushy cells to the medial superior olive: a reexamination. Assoc Res Otolaryngol Abstr. 2010:776. [Google Scholar]
  15. Keller CH, Takahashi TT. Representation of temporal features of complex sounds by the discharge patterns of neurons in the owl's inferior colliculus. J Neurophysiol. 2000;84:2638–2650. doi: 10.1152/jn.2000.84.5.2638. [DOI] [PubMed] [Google Scholar]
  16. King AJ, Campbell RA. Cortical processing of sound-source location. Acta Acustica. 2005;91:399–408. [Google Scholar]
  17. Knudsen EI, Konishi M. A neural map of auditory space in the owl. Science. 1978;200:795–797. doi: 10.1126/science.644324. [DOI] [PubMed] [Google Scholar]
  18. Knudsen EI, Konishi M. Mechanisms of sound localization in the barn owl (Tyto alba) J Comp Physiol. 1979;133:13–21. [Google Scholar]
  19. Knudsen EI, Blasdel GG, Konishi M. Sound localization by the barn owl (Tyto alba) measured with the search coil technique. J Comp Physiol. 1979;133:1–11. [Google Scholar]
  20. Konishi M. Coding of auditory space. Annu Rev Neurosci. 2003;26:31–55. doi: 10.1146/annurev.neuro.26.041002.131123. [DOI] [PubMed] [Google Scholar]
  21. Lesica NA, Grothe B. Dynamic spectrotemporal feature selectivity in the auditory midbrain. J Neurosci. 2008;28:5412–5421. doi: 10.1523/JNEUROSCI.0073-08.2008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Maier JK, Klump GM. Resolution in azimuth sound localization in the Mongolian gerbil (Meriones unguiculatus) J Acoust Soc Am. 2006;119:1029–1036. doi: 10.1121/1.2159429. [DOI] [PubMed] [Google Scholar]
  23. Maier JK, Kindermann T, Grothe B, Klump GM. Effects of omni-directional noise-exposure during hearing onset and age on auditory spatial resolution in the Mongolian gerbil (Meriones unguiculatus)—a behavioral approach. Brain Res. 2008;1220:47–57. doi: 10.1016/j.brainres.2008.01.083. [DOI] [PubMed] [Google Scholar]
  24. Maki K, Furukawa S. Acoustical cues for sound localization by the Mongolian gerbil, Meriones unguiculatus. J Acoust Soc Am. 2005;118:872–886. doi: 10.1121/1.1944647. [DOI] [PubMed] [Google Scholar]
  25. McAlpine D. Creating a sense of auditory space. J Physiol. 2005;566:21–28. doi: 10.1113/jphysiol.2005.083113. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. McAlpine D, Grothe B. Sound localization and delay lines—do mammals fit the model? Trends Neurosci. 2003;26:347–350. doi: 10.1016/S0166-2236(03)00140-1. [DOI] [PubMed] [Google Scholar]
  27. McAlpine D, Jiang D, Palmer AR. A neural code for low-frequency sound localization in mammals. Nat Neurosci. 2001;4:396–401. doi: 10.1038/86049. [DOI] [PubMed] [Google Scholar]
  28. Middlebrooks JC, Xu L, Furukawa S, Macpherson EA. Cortical neurons that localize sounds. Neuroscientist. 2002;8:73–83. doi: 10.1177/107385840200800112. [DOI] [PubMed] [Google Scholar]
  29. Mills AW. On the minimum audible angle. J Acoust Soc Am. 1958;30:237–246. [Google Scholar]
  30. Oliver DL, Beckius GE, Bishop DC, Loftus WC, Batra R. Topography of interaural temporal disparity coding in projections of medial superior olive to inferior colliculus. J Neurosci. 2003;23:7438–7449. doi: 10.1523/JNEUROSCI.23-19-07438.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Pecka M, Brand A, Behrend O, Grothe B. Interaural time difference processing in the mammalian medial superior olive: the role of glycinergic inhibition. J Neurosci. 2008;28:6914–6925. doi: 10.1523/JNEUROSCI.1660-08.2008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Poganiatz I, Nelken I, Wagner H. Sound-localization experiments with barn owls in virtual space: influence of interaural time difference on head-turning behavior. J Assoc Res Otolaryngol. 2001;2:1–21. doi: 10.1007/s101620010039. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Schmitzer-Torbert N, Jackson J, Henze D, Harris K, Redish AD. Quantitative measures of cluster quality for use in extracellular recordings. Neuroscience. 2005;131:1–11. doi: 10.1016/j.neuroscience.2004.09.066. [DOI] [PubMed] [Google Scholar]
  34. Shackleton TM, Skottun BC, Arnott RH, Palmer AR. Interaural time difference discrimination thresholds for single neurons in the inferior colliculus of guinea pigs. J Neurosci. 2003;23:716–724. doi: 10.1523/JNEUROSCI.23-02-00716.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Skottun BC, Shackleton TM, Arnott RH, Palmer AR. The ability of inferior colliculus neurons to signal differences in interaural delay. Proc Natl Acad Sci U S A. 2001;98:14050–14054. doi: 10.1073/pnas.241513998. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Smith PH, Joris PX, Yin TC. Projections of physiologically characterized spherical bushy cell axons from the cochlear nucleus of the cat: evidence for delay lines to the medial superior olive. J Comp Neurol. 1993;331:245–260. doi: 10.1002/cne.903310208. [DOI] [PubMed] [Google Scholar]
  37. Wagner H, Asadollahi A, Bremen P, Endler F, Vonderschen K, von Campenhausen M. Distribution of interaural time difference in the barn owl's inferior colliculus in the low- and high-frequency ranges. J Neurosci. 2007;27:4191–4200. doi: 10.1523/JNEUROSCI.5250-06.2007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Yin TC, Chan JC. Interaural time sensitivity in medial superior olive of cat. J Neurophysiol. 1990;64:465–488. doi: 10.1152/jn.1990.64.2.465. [DOI] [PubMed] [Google Scholar]

Articles from The Journal of Neuroscience are provided here courtesy of Society for Neuroscience

RESOURCES