Abstract
Sound localization in humans depends largely on interaural time delay (ITD). The ability to discriminate differences in ITD is highly accurate. ITD discrimination (Δ ITD) thresholds, under some circumstances, are as low as 10–20 μs. It has been assumed that thresholds this low could only be obtained if the outputs from many neurons were combined. Here we use Receiver Operating Characteristic analysis to compute neuronal Δ ITD thresholds from 53 cells in the inferior colliculus in guinea pigs. The Δ ITD thresholds of single neurons range from several hundreds of μs down to 20–30 μs. The lowest single-cell thresholds are comparable to human thresholds determined with similar stimuli. This finding suggests that the highly accurate sound localization of human observers is consistent with the resolution of single cells and need not reflect the combined activity of many neurons.
The question of whether the finest sensory discrimination thresholds reflect single-cell resolution or are accomplished by ensembles of neurons has wide implications for our understanding of the relationship between sensory functions and their neuronal substrates. In the study of vision it has become increasingly clear that psychophysical thresholds are not beyond the capabilities of single cells (1, 2), even in cases where spatial discrimination thresholds correspond to positional differences smaller than the diameter of single photoreceptors (3, 4). Somatosensory thresholds may also be accounted for on the basis of single neurons (5, 6). The accuracy of sound localization, on the other hand, has proved difficult to attribute to single neurons. In the case of sound localization, the time difference between the two ears [interaural time delay (ITD)] at discrimination threshold is about 10–20 μs (7–9). It has been widely assumed that discrimination thresholds this low could only be achieved if the responses from many cells were combined (10–15). This view has been challenged on the basis of computer simulations (16). These simulations, which made use of Receiver Operating Characteristic (ROC) analysis, were based on the assumption that the response variability conforms to a Poisson distribution. Although this may be an appropriate assumption for an average neuron, because of considerable intercell differences any given neuron may differ markedly from Poisson variability.
It has long been realized that the sensitivity with which a change in stimulus can be detected is as crucially dependent on the variability in the response to a constant stimulus as on the magnitude of the change in response when the stimulus is changed; this concept has been codified as Signal Detection Theory (17). Thus it is as important, when determining the sensitivity of neurons to changes in ITD (Δ ITD thresholds), that the variability in response as an individual stimulus is repeated is measured, as well as the change in response when stimulus ITD is changed. This, unfortunately, requires a large number of repeats and is, perhaps, one reason why this has not hitherto been attempted. If it is assumed that the spiking of neurons is a memory-less point-process, where the occurrence of a spike at any instant has no influence on the probability of a spike occurring at any other time, then the spike process can be termed a (nonhomogeneous) Poisson process and the probability of counting a given number of spikes in any time interval can be described by a Poisson distribution (18). This assumption is largely valid at the level of the auditory nerve (ignoring refractory effects), but becomes increasingly less certain as the auditory system is ascended and more complex processing occurs. It is therefore vital that, in addition to the mean rate, the distribution of neural firing counts to repeated stimulation is measured.
The earlier computer simulations (16) used published mean rate responses, assumed a Poisson distribution, and could thus be criticized for not using the true variability of the neurons being modeled; in this paper we aim to remedy that defect. Additionally, the responses modeled were obtained from binaural beat stimuli, which are pure tones with an interaural frequency difference of typically 1 Hz and duration of at least a second. The interaural phase difference slowly sweeps through 360° during the period of the beat so an estimate of the ITD tuning curve can be obtained. This estimate is considered to yield ITD tuning curves similar to those obtained by using static tones where interaural time delay is varied (19, 20). However, there are many instances of neural responses to the ITD of static tones that do not match the sensitivities revealed by dynamic stimuli like binaural beats (e.g., ref. 21); at the very least, binaural beat stimuli may cause the amplitude of the response to differ. When spike counts obey a Poisson distribution, the variance is proportional to the mean and so the absolute response levels may affect the estimated discrimination thresholds. Thus, using binaural beat responses may provide a different estimate of sensitivity than short, static tones.
To circumvent these problems, we here use actual neuronal responses to 100 repeats of static tones (i.e., constant ITD throughout each stimulus presentation) to measure Δ ITD thresholds in 53 neurons in the inferior colliculus (IC) of guinea pigs. ROC analysis using spike count distributions based on the actual neural responses are used so that no assumptions are necessary about the nature of the response variability in the IC. Our results demonstrate that the lowest neuronal thresholds are quite close to psychophysical thresholds. It is possible, therefore, that the psychophysical threshold may simply reflect the limit of the most accurate neuron without any pooling being involved.
Methods and Stimuli
Recordings were made in the right IC of 15 pigmented guinea pigs weighing 335–507 g. Animals were anaesthetized with urethane (1.3 g/kg i.p.) and Hypnorm (Janssen; 0.2 ml i.m.), premedicated with Atropine sulfate (0.06 mg/kg s.c.). Further doses of Hypnorm (0.2 ml i.m.) were given on indication by pedal withdrawal reflex. A tracheotomy was performed, and core temperature was maintained at 38°C. The animals were mounted in a stereotaxic frame inside a sound-attenuating room. Hollow plastic speculae with sealed-in loudspeakers replaced the ear bars. All experiments were carried out in accordance with the United Kingdom Animals (Scientific Procedures) Act of 1986.
Extracellular action potentials were recorded with tungsten electrodes (22). All stimuli were digitally synthesized at a 100 kHz sampling rate and were output through a waveform reconstruction filter set at 25 kHz. ITD functions were obtained by delaying, or advancing, the fine structure of the signal to the right ear while keeping the signal to the left ear fixed. Signals were gated on and off simultaneously in the two ears with rise/fall times of 2 ms. The signals were tone bursts (of 50 ms or 400 ms duration) at the neuron's best frequency (BF) and at 20 dB above rate threshold. ITD functions were obtained over ±1.5 cycles of BF in 0.1 cycle steps by using 50 repeats (Fig. 1A, open symbols). A fine-grained analysis (0.01 cycle steps) was performed from the trough to the peak of the slope through zero ITD with 100 repeats (Fig. 1 A, filled symbols, and B). A single repeat consisted of the full range of ITDs presented in pseudorandom order. The BFs of cells reported were approximately evenly distributed between 72 and 1185 Hz. No attempt was made to determine ITD sensitivity if the BF was much above 1000 Hz, because past experience strongly indicates that this would be unlikely to succeed.
Results
The average response as a function of ITD for a representative IC neuron is shown in Fig. 1A. The responses, which vary cyclically, undergo large changes over narrow portions of each cycle. The filled symbols show a detailed mapping, sampled at 0.01 cycle intervals, of the steeply sloping region of the ITD function that crosses zero ITD (midline azimuthal positions). This neuron was stimulated at 400 Hz; therefore, 0.01 of a cycle is equal to 25 μs. As can be seen from the expanded ITD function in Fig. 1B, ITD differences of 25 μs can elicit clear differences in average response.
The ability of a neuron to signal small changes in ITD is determined not only by the steepness of the ITD curve, however, but also by the variability of the response. The reliability (i.e., percent correct response) with which an ideal observer can detect ITD differences was estimated by using ROC analyses (17). ROC analyses determine the probability of differentiating two stimuli on the basis of the response distributions elicited by the two stimuli. This is comparable to using a t test [or d′ analysis (17)] to determine the statistical significance of the difference between two means. Unlike a t test (or d′), however, ROC analysis does not require any assumptions regarding the nature of the distributions. The application of ROC analysis to neuronal responses has been described (23). Fig. 1C shows percent correct ITD discrimination as a function of Δ ITD. From these data, Δ ITD thresholds were computed by determining the Δ ITD giving 75% correct performance.
Fig. 2A shows the distribution of Δ ITD thresholds for 53 neurons. The Δ ITD thresholds show a wide distribution, varying from 31 μs to well above 230 μs. The arrows indicate psychophysical Δ ITD thresholds for human observers determined by using 50-ms stimuli (replotted from ref. 24). Fig. 2B shows Δ ITD thresholds plotted against tone frequency (filled symbols). Clearly there is a tendency for thresholds to decrease with increasing frequency (slope of regression line = −0.84; r = −0.66). The best neural thresholds are only slightly greater than the corresponding human psychophysical thresholds, and the decrease in neural threshold with increasing frequency is in general agreement with psychophysical performance in human observers (indicated with open symbols; from ref. 24). It is clear from both figures that the most sensitive neurons are not abnormal outliers.
One factor that may affect the comparison between single cells and psychophysics is the presence of long-term variability. Psychophysical thresholds are determined, typically, by making comparisons between two stimuli presented close together in time. In the present experiments the stimuli were presented in a pseudorandom sequence extending over several minutes with each ITD presented 100 times. The ROC analysis assigns equal significance to responses that occur close together as to those that are separated by extended periods of time. The existence of slow response fluctuations, or long-term variability, is well documented (23, 25–27). To estimate the effect of these slow fluctuations on our neuronal thresholds, we carried out a pairwise comparison in which the response to the nth presentation of one stimulus was compared with the response to the nth presentation of another (see Fig. 3 legend for further details). For the vast majority of neurons, reducing the variability in this manner did not alter the threshold. In a few cells, however—including that with the lowest threshold—there was a threshold reduction of about 20%. The percent correct as a function of Δ ITD for our most sensitive neuron, responding to stimuli of 400 ms duration, is shown in Fig. 3C. Results obtained both with the conventional ROC method (filled circles), and those based on pairwise comparison (open circles), are shown. The Δ ITD thresholds under the two cases were 21 μs and 17 μs, respectively. By comparison, human Δ ITD thresholds for 400-ms stimuli, depending on rise-time and individual differences, range from 14 to 24 μs (from figure 4 of ref. 24). Thus, this neuron's threshold fell within the range of the variability of human observers.
Discussion
The goal of this study was to examine whether the extremely low Δ ITD thresholds of human observers is reflected in the behavior of single neurons, or whether it necessitates pooled responses from many neurons. If we make the reasonable assumption that the behavioral threshold reflects the performance of neurons having the greatest acuity then we would expect the psychophysical thresholds to lie in the lower tails of the distribution of neural thresholds. Fig. 2 shows that guinea pig neural thresholds are slightly higher than human psychophysical thresholds. This discrepancy may represent a species difference, as discussed below, or may be the result of our trying to estimate the tail of a distribution with insufficient observations (53 cells spread evenly over BFs between 72 and 1,185 Hz). In either case, we feel that the agreement is sufficiently close for us to conclude that neurons probably exist that have sufficiently low thresholds to account for behavioral thresholds, and, consequently, that pooling of the responses from many neurons may not be required. This conclusion is in agreement with earlier computer simulations based on responses to binaural beat stimuli (16).
Our conclusion involves an across-species comparison. Human observers show very low Δ ITD thresholds and thus, optimally, we would have examined ITD discriminability at the level of single neurons in humans; however, this is not possible. Instead, we might have compared our guinea pig neuronal thresholds to behavioral thresholds in the same species, although, unfortunately, we are not aware of any study in which behavioral Δ ITD thresholds have been determined in this animal. However, for guinea pig behavioral thresholds to invalidate our conclusions, these thresholds would have to be (markedly) lower than those of humans, which is unlikely because behavioral thresholds for free-field localization of broad-band stimuli in animals are mostly much larger than those of human observers (28). It seems more likely that behavioral thresholds in guinea pigs are higher than those in humans and thus are higher than the single-cell thresholds that we have measured. If so, it may be that the information carried by the lowest-threshold guinea pig IC neurons is lost or underused. In this case one might speculate that interspecies differences in behavioral Δ ITD thresholds may be determined more by the ability of different species to use the information carried by the most accurate neurons than by the thresholds of these neurons themselves.
If human neural sensitivity were better than that of guinea pigs, we would have to invoke a similar argument about loss or underutilization of information. If, on the other hand, human neural sensitivity were worse than that of guinea pigs, then we would have to invoke pooling to explain human psychophysical sensitivity. Consistent binaural physiology across a wide range of laboratory animals (29) suggests that neuronal Δ ITD thresholds in humans and guinea pigs may be similar. The shapes of ITD functions and the positions of their peaks have been obtained in the IC, in response to both tones and noise, in cat, rabbit, kangaroo rat, and barn owl, as well as guinea pig (29), and are very similar across these species. Additionally, although guinea pigs are small animals, they have large bullae, an adaptation that allows them to hear low frequencies. Thus the comparison between human and guinea pig is not invalidated by their having very different audiograms. These arguments, though, do not address the trial to trial variability in firing rate that, as we have here demonstrated, is an important determinant of ITD discriminability. There are no across-species comparisons that can be made here because there are no previous reports of these measurements for ITD discrimination. One component of the variability is likely to be the accuracy with which the auditory waveform is transmitted to the input of the ITD-sensitive coincidence-detector mechanism in the Medial Superior Olive (MSO). The closest we can get to this comparison is at the level of the auditory nerve (which is a single synapse away from the MSO): here the accuracy of phase locking is approximately the same for both cat (30) and guinea pig (31) up to about 800 Hz, where the guinea pig synchronization index begins to decline about an octave lower than that of cat. However, the guinea pig synchronization index does not become negligible until about 2 kHz, again an octave below that of cat. Although there are a number of ways in which the variability of ITD representation could be altered between auditory nerve and IC, at least in the auditory nerve the accuracy of representation is similar in two very different species over most of the range of frequencies that we have studied.
Because phase-locking at the auditory nerve is approximately constant across the range of frequencies that we have used, we can, with a couple of assumptions, make predictions about the shape of the Δ ITD threshold versus frequency curve (Fig. 2B). It is generally accepted that ITD responses at the IC are, to a large extent, a reflection of coincidence-detection between spike trains derived from the stimulus presented to each ear (11–15, 19–21, 29). If we assume that the inputs to the coincidence detectors have similar phase locking to that in the auditory nerve, and that the variability at the output of these coincidence detectors is related to the phase-locking at the input, then the variability (for a fixed firing rate) will be independent of stimulus frequency. However, the slope of the ITD function (and hence the change in firing rate for a given Δ ITD) will be inversely proportional to stimulus frequency. Because log–log plots show power relationships as straight lines, we would expect that the slope of a log–log plot of Δ ITD threshold as a function of frequency would be −1 up to the point at which phase-locking declines. The data shown in Fig. 2B come close to this. Because we did not attempt to measure ITD sensitivity for cells with BFs much above 1 kHz, because they generally show poor modulation of the ITD function, there is an implicit increase in Δ ITD threshold somewhere close to the maximum frequency shown, comparable with human psychophysics.
Low-frequency IC units show responses that demonstrate convergence from cells in both medial (MSO) and lateral superior olive (LSO; ref. 32). It would be interesting to know whether ITD sensitivity were different for responses deriving originally from different nuclei. It is likely that units with a peak near 0 IPD in the ITD function (peak units) receive afferent projections from MSO, whereas those with a trough near zero (trough units) receive projections from LSO. However, with only a single frequency ITD function it is not possible to determine unambiguously whether a unit is a peak or trough. To be completely unambiguous, ITD functions are needed at a range of frequencies. These data were not gathered because recording time was limited to the data specific to ITD thresholds. Given that, almost by definition, trough units will yield a large best interaural phase value compared with peak units, however, our sample population can be segregated accordingly. We have done this and looked at the ITD functions of all units with long best IPDs. Although a few are probably trough units, members of our sample more commonly showed intermediate properties (possibly reflecting influences such as convergence and divergence) in accordance with the results of previous studies of IC behavior (32). Thus, although we do not have the data to address the question of afferent input directly, it is likely that trough units form a minority in our sample and, hence, the majority of our units were most likely innervated from the MSO.
Although comparisons across species raise concern, given that previous studies have argued strongly that pooling of responses from many neurons is required to achieve the very low psychophysical Δ ITD thresholds in humans (10–15), the demonstration that any auditory neurons in any species have thresholds comparable to those of human psychophysics is significant. The demonstration refutes the notion that human Δ ITD thresholds are beyond the resolution of single neurons, irrespective of species.
Our findings should not be taken to indicate that ITD discrimination involves only a single cell. On the contrary, given that the stimuli are of suprathreshold intensity and that ambiguities need to be resolved (e.g., sound level differences need to be differentiated from ITD differences), it would be surprising if only a single neuron were involved in the task. Instead, these results show that it may not be necessary to combine the responses from many neurons to achieve the astonishingly high accuracy of human ITD discrimination. Whether psychophysical ITD discrimination actually reflects the limit of the most accurate neuron has yet to be determined, although our data lend credence to this possibility.
Acknowledgments
We are indebted to Carl Hansen of Pictopia.com of Emeryville, CA, for writing perl-scripts.
Abbreviations
- ITD
interaural time delay
- ROC
Receiver Operating Characteristic
- IC
inferior colliculus
- BF
best frequency
References
- 1.Bradley A, Skottun B C, Ohzawa I, Sclar G, Freeman R D. J Opt Soc Am A. 1985;2:1607–1610. doi: 10.1364/josaa.2.001607. [DOI] [PubMed] [Google Scholar]
- 2.Newsome W T, Britten K H, Movshon J A. Nature (London) 1989;341:52–54. doi: 10.1038/341052a0. [DOI] [PubMed] [Google Scholar]
- 3.Parker A J, Hawken M J. J Opt Soc Am A. 1985;2:1101–1114. doi: 10.1364/josaa.2.001101. [DOI] [PubMed] [Google Scholar]
- 4.Hawken M J, Parker A J. In: Vision: Coding and Efficiency. Blakemore C, editor. Cambridge, U.K.: Cambridge Univ. Press; 1990. pp. 103–116. [Google Scholar]
- 5.Johansson R S, Vallbo Å B. J Physiol (London) 1979;297:405–422. doi: 10.1113/jphysiol.1979.sp013048. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Vallbo Å B. In: The Cognitive Neurosciences. Gazzaniga M S, editor. Cambridge, MA: MIT Press; 1995. pp. 237–252. [Google Scholar]
- 7.Mills A W. J Acoust Soc Am. 1958;30:237–246. [Google Scholar]
- 8.Durlach N I, Colburn H S. In: Handbook of Perception, Vol IV, Hearing. Carterette E C, Friedman M P, editors. New York: Academic; 1978. pp. 365–466. [Google Scholar]
- 9.Dye R H, Hafter E R. J Acoust Soc Am. 1984;75:1593–1598. doi: 10.1121/1.390868. [DOI] [PubMed] [Google Scholar]
- 10.Hall J L. J Acoust Soc Am. 1965;37:814–823. doi: 10.1121/1.1909447. [DOI] [PubMed] [Google Scholar]
- 11.Yin T C T, Chan J C K. In: Auditory Function: Neurobiological Bases of Hearing. Edelman G M, Gall W E, Cowan W M, editors. New York: Wiley; 1988. pp. 385–430. [Google Scholar]
- 12.Carr C E. Annu Rev Neurosci. 1993;16:223–243. doi: 10.1146/annurev.ne.16.030193.001255. [DOI] [PubMed] [Google Scholar]
- 13.Gerstner W, Kempter R, van Hemmen J L, Wagner H. Nature (London) 1996;383:76–78. doi: 10.1038/383076a0. [DOI] [PubMed] [Google Scholar]
- 14.Yin T C T, Joris P X, Smith P H, Chan J C K. In: Binaural and Spatial: Hearing in Real and Virtual Environments. Gilkey R H, Anderson T R, editors. Mahwah, NJ: Erlbaum; 1997. pp. 427–445. [Google Scholar]
- 15.Fitzpatrick D C, Batra R, Stanford T R, Kuwada S. Nature (London) 1997;388:871–874. doi: 10.1038/42246. [DOI] [PubMed] [Google Scholar]
- 16.Skottun B C. Nature (London) 1998;393:531. doi: 10.1038/31134. [DOI] [PubMed] [Google Scholar]
- 17.Green D M, Swets J A. Signal Detection Theory and Psychophysics. Huntington, NY: Krieger; 1974. [Google Scholar]
- 18.Cox D R, Miller H D. The Theory of Stochastic Processes. London: Chapman & Hall; 1965. [Google Scholar]
- 19.Kuwada S, Stanford T R, Batra R. J Neurophysiol. 1987;57:1338–1360. doi: 10.1152/jn.1987.57.5.1338. [DOI] [PubMed] [Google Scholar]
- 20.Stanford T R, Kuwada S, Batra R. J Neurosci. 1992;12:3200–3216. doi: 10.1523/JNEUROSCI.12-08-03200.1992. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Spitzer M W, Semple M N. J Neurophysiol. 1993;69:1245–1263. doi: 10.1152/jn.1993.69.4.1245. [DOI] [PubMed] [Google Scholar]
- 22.Bullock D C, Palmer A R, Rees A. Med Biol Eng Comput. 1988;26:669–672. doi: 10.1007/BF02447511. [DOI] [PubMed] [Google Scholar]
- 23.Bradley A, Skottun B C, Ohzawa I, Sclar G, Freeman R D. J Neurophysiol. 1987;57:755–772. doi: 10.1152/jn.1987.57.3.755. [DOI] [PubMed] [Google Scholar]
- 24.Hafter E R, Dye R H, Gilkey R H. J Acoust Soc Am. 1979;65:471–477. doi: 10.1121/1.382346. [DOI] [PubMed] [Google Scholar]
- 25.Tomko G J, Crapper D R. Brain Res. 1974;79:405–418. doi: 10.1016/0006-8993(74)90438-7. [DOI] [PubMed] [Google Scholar]
- 26.Dean A F. Exp Brain Res. 1981;44:437–440. doi: 10.1007/BF00238837. [DOI] [PubMed] [Google Scholar]
- 27.Tolhurst D J, Movshon J A, Dean A F. Vision Res. 1983;23:775–785. doi: 10.1016/0042-6989(83)90200-6. [DOI] [PubMed] [Google Scholar]
- 28.Heffner R S. Acta Otolaryngol (Stockholm) 1997;532,Suppl.:46–53. doi: 10.3109/00016489709126144. [DOI] [PubMed] [Google Scholar]
- 29.Palmer A R, Rees A, Caird D M. Hearing Res. 1990;50:71–86. doi: 10.1016/0378-5955(90)90034-m. [DOI] [PubMed] [Google Scholar]
- 30.Johnson D H. J Acoust Soc Am. 1980;68:1115–1122. doi: 10.1121/1.384982. [DOI] [PubMed] [Google Scholar]
- 31.Palmer A R, Russell I J. Hearing Res. 1986;24:1–15. doi: 10.1016/0378-5955(86)90002-x. [DOI] [PubMed] [Google Scholar]
- 32.McAlpine D, Jiang D, Shackleton T M, Palmer A R. J Neurosci. 1998;18:6026–6039. doi: 10.1523/JNEUROSCI.18-15-06026.1998. [DOI] [PMC free article] [PubMed] [Google Scholar]