Abstract
Binaural auditory neurons exhibit “best delays” (BDs): They are maximally activated at certain acoustic delays between sounds at the two ears and thereby signal spatial sound location. BDs arise from delays internal to the auditory system, but their source is controversial. According to the classic Jeffress model, they reflect pure time delays generated by differences in axonal length between the inputs from the two ears to binaural neurons. However, a relationship has been reported between BDs and the frequency to which binaural neurons are most sensitive (the characteristic frequency), and this relationship is not predicted by the Jeffress model. An alternative hypothesis proposes that binaural neurons derive their input from slightly different places along the two cochleas, which induces BDs by virtue of the slowness of the cochlear traveling wave. To test this hypothesis, we performed a coincidence analysis on spiketrains of pairs of auditory nerve fibers originating from different cochlear locations. In effect, this analysis mimics the processing of phase-locked inputs from each ear by binaural neurons. We find that auditory nerve fibers that innervate different cochlear sites show a maximum number of coincidences when they are delayed relative to each other, and that the optimum delays decrease with characteristic frequency as in binaural neurons. These findings suggest that cochlear disparities make an important contribution to the internal delays observed in binaural neurons.
Keywords: auditory nerve, phase-locking, sound localization, stereo, temporal coding
A remarkable feat of the human auditory system is its extraordinary sensitivity to differences in the acoustic waveforms between the two ears. Interaural time differences (ITDs) arise when sound sources are offset from the midline toward one side of the head, and differences in ITD can be perceived to values of 10–20 μs. Most computational models of this ability are based on the qualitative scheme of Jeffress (1) in which a population of binaural cells effectively cross-correlates the two monaural signals by a process of coincidence detection and delay lines. In this scheme, the input to a binaural neuron from one ear is delayed relative to that of the other ear by a so-called “internal delay,” generated by differences in length of the axonal pathways from each side. The binaural neurons are coincidence detectors whose inputs are spiketrains that are time-locked to the ongoing features of the sound stimulus at each ear. Only spiketrains that arrive coincidentally activate the binaural cell: This activation is achieved at an ITD (“best delay” or BD) that exactly compensates for the internal delay. By arranging these axons in a delay-line pattern, a range of internal delays to different coincidence detectors is created.
There is physiological and anatomical support for the Jeffress model both in the medial superior olive (MSO) of mammals and in nucleus laminaris of the barn owl (2, 3). However, binaural data in the mammalian inferior colliculus (IC), which receives input from the MSO, show a systematic relationship between BDs and the frequency to which neurons are most sensitive (characteristic frequency or CF). BDs are, on average, small at high CFs and large at low CFs (4, 5). To fit the Jeffress model, these findings would imply longer axonal delay lines at low CFs than at high CFs.
Schroeder (6) suggested an alternative scheme, which we refer to as the cochlear disparity hypothesis. Sounds cause a propagating wave in the cochlea, so that high-frequency components are transduced earliest at the base of the cochlea, and low-frequency components are transduced several milliseconds later at its apex. If the monaural inputs to a binaural neuron were derived from slightly different places in the left cochlea vs. the right cochlea, an internal delay would result. Indeed, computational models (7, 8) suggest that small cochlear mismatches should have significant effects on BD. The only experimental test to date, in the barn owl, concluded that there was no need to invoke nonaxonal delays (9). Although there have been no experimental studies in the mammal that specifically address the cochlear disparity hypothesis, some features in binaural responses have been interpreted as indicative of such mismatches (10).
The effect of cochlear disparity is more complex than a simple time delay because it reflects differences in both the magnitude and phase spectra of two points in the cochlea. The place to measure these effects in pure form is not in the binaural system but rather in the output of the cochlea: the auditory nerve. Here, we use a coincidence analysis to measure the magnitude of the effective delays resulting from cochlear disparities, as a function of CF. We processed monaural (auditory nerve) spike trains, obtained within a single ear, through the simplest conceivable coincidence detector, and we compared this output with binaural (IC) responses obtained with the same stimuli.
Results
BDs in Binaural Neurons of the IC.
We first characterized the distribution of BDs as a function of CF in the IC. Noise bursts were delivered to the two ears while varying ITD in discrete steps and measuring neuronal firing rate: The resulting graphs are referred to as noise-delay functions. The thick lines in Fig. 1 Left show representative examples for four cells with different CF, spaced approximately an octave apart. A key property (4) is that the largest peak, indicated with a vertical line, is usually the peak closest to zero delay. Stated differently, the BD is usually smaller than half a characteristic period (1/2 CF). Because this period is obviously largest at low CFs, the BD at such CFs (Fig. 1D, 940 μs) is often much larger than at high CFs (Fig. 1A, −20 μs) and can fall outside the physiological range of ITDs (i.e., the range of ITDs naturally encountered by the species). Note the different scaling of the abscissa across rows.
Our main interest was to examine the effect of internal delays on the positioning of the “fine-structure” of the noise-delay functions with respect to ITD. The traditional BD measure is not always the best characterization of the internal delay. In some cases, e.g., Fig. 1C, the noise-delay function shows several peaks of comparable magnitude; moreover, envelope sensitivity can affect the overall shape (Fig. 1A) (11). We therefore routinely also collected noise-delay functions with the polarity of the noise waveform in one ear inverted (anticorrelated noise in Fig. 1 Left thin lines). Subtraction of the responses to anticorrelated and correlated noise results in “difcors” (Fig. 1 Right). Difcors have several advantages: The effect of envelope sensitivity is reduced; the distorting effects of thresholding and saturation are tempered; and they are based on more measurements. For each difcor, we not only determined the BD, being the ITD at which the response was maximal (Fig. 1, circles), but also the ITDs for the maxima of the neighboring secondary peaks (Fig. 1, triangles). We also quantified the periodicity of difcors with Fourier analysis and refer to the spectral maximum as the dominant frequency (DF).
Fig. 2 shows the peak delays against CF, for a population of 219 IC neurons. As in previous studies of responses to broadband noise (4, 5, 12), the distribution of BDs is strongly biased toward positive ITDs, at which the stimulus to the contralateral ear leads in time. More importantly, there is a wide range of BDs at low CFs and a small range at high CFs. Most BDs (circles) are sandwiched between the horizontal line at zero delay and the upper hyperbola that indicates a half-period at CF above this horizontal (“π-boundary”). For neurons with BD values that fall outside this region, one of the neighboring maxima (triangles) is usually within that region. The distribution of BDs is thus flanked by the distributions of secondary maxima but separated from them by zones that are relatively free of data points. It is consistent with previous reports for the IC (4, 5) in showing two main trends with increasing CF: a decrease in the average BD and a decrease in the spread of BDs. Our data extend to CFs almost an octave higher than these reports and show that these two trends continue to CFs at the limit of phase-locking.
Importantly, although the converging spacing of primary and secondary peaks is a trivial consequence of the increase in CF, there is no a priori reason why the range of BDs should taper with CF. If the same internal delays were available at all CFs, the distribution would not follow the π-boundary, and noise-delay functions at high CFs would contain multiple “slipped cycles” (i.e., multiple CF periods would separate the main peak from zero delay), but slipped cycles are rarely observed (Fig. 2).
Delays Resulting from Disparities in Cochlear Position.
The observed link between internal delay and CF is not trivial to achieve, particularly in view of the extremely small delays involved. It seems parsimonious to search for a basis in the cochlea, because it is the only site in the auditory pathway where CF and delay are intrinsically coupled. Low-CF fibers have longer latencies than high-CF fibers, and the dependence of latency on CF is steeper in the cochlear apex than in the cochlear base (13, 14). Thus, a given offset in cochlear position between two neurons will affect their temporal relationship more strongly at low CFs than at high CFs. On the other hand, the characteristic period (CF−1) is much shorter at the base than at the apex, so that small differences in delay will more readily produce slipped cycles in high-CF than in low-CF neurons. What then is the quantitative relationship between CF and the delay resulting from cochlear disparities, and can it provide the basis for the relationship between CF and BD observed in the IC (Fig. 2)?
Our approach to examine the effect of cochlear disparities on delays is based on a coincidence analysis of responses of auditory nerve fibers (Fig. 3) (15). We made successive recordings (Fig. 3A) from fibers in a single nerve. From each fiber, we obtained spiketrains to a standard (A+) and inverted (A−) noise. We then counted intervals (Fig. 3C) or, equivalently, coincidences at different delays, between pairs of spiketrains.
If the spiketrains that are compared are derived from different fibers, we refer to these distributions as cross-correlograms. Fig. 3B illustrates that such cross-correlograms predict the output of the simplest circuit with a cochlear disparity: a binaural coincidence detector, which receives a single input from the left and right cochlea but displaced in position along the cochlea. The underlying assumptions are that the output of left and right auditory nerve are interchangeable and that there is no correlation between auditory nerve fibers except the correlation induced by acoustic stimuli (16). We also computed single-fiber auto-correlograms, in which case sets x and y are spiketrains from the same auditory nerve fiber.
Fig. 3 D–F shows data for one pair of fibers. Fig. 3 D and E Upper are auto-correlograms for a fiber of 522 Hz and 617 Hz, respectively; Fig. 3 D and E Lower show their difcors. As shown in refs. 11 and 17, these auto-correlograms have a symmetrical, damped oscillatory shape, with a periodicity determined by the fiber CF. Again, we determined the DF with Fourier analysis of the difcor. For the two fibers illustrated, DFs were 513 and 625 Hz and agree well with CF.
Fig. 3F shows cross-correlation data (15). Like auto-correlograms, the cross-correlograms are also damped oscillations: The DF of the difcor was at an intermediate frequency of 552 Hz. However, whereas auto-correlograms are symmetric around zero delay, cross-correlograms are not. The main peak occurs at a positive delay of 450 μs. Thus, within the range of delays of interest (the physiological range of ± 400 μs), the net effect of the cochlear disparity is a delay in the response of the neuron with the lower CF relative to that with the higher CF.
We calculated auto- and cross-correlograms for all possible pairings of fibers within each auditory nerve. Fig. 4A shows auto-correlograms for 11 low-CF fibers from a single ear. The curves are arranged according to distance from the cochlear apex, calculated from the fiber DF (left ordinate) (18, 19). The circles and triangles indicate the delay of the main and secondary peaks. From these fibers, we arbitrarily selected one fiber (thick line) as a reference fiber: Its auto-correlogram is replotted in Fig. 4B (thick line). All of the other curves in Fig. 4B are cross-correlograms between the responses of this fiber and the ten other comparison fibers. The vertical position of these cross-correlograms is again according to the DF of the comparison fibers, as in Fig. 4A. The right ordinate indicates the difference in octaves between the DF of the reference fiber and that of the comparison fiber. Positive delays indicate a longer delay for the reference fiber. For these low-CF fibers, large and systematic shifts are observed in the maxima and secondary peaks of the cross-correlograms. The shifts are in the expected direction: increasing delay of the reference fiber relative to fibers tuned to higher CFs.
Fig. 4 C and D show similar data for eight fibers of higher CF from the same animal. The DF of the reference fiber was 1,792 Hz, and the comparison fibers span a similar range of cochlear distance as in Fig. 4 A and B. The pattern of shift in the cross-correlograms (Fig. 4D) is similar to that of Fig. 4A, with low-frequency fibers being delayed relative to high-frequency fibers, but the magnitude of the shifts is much smaller.
Delay values from 204 fibers in six animals are compiled in Fig. 5. These data represent ≈50% of the fibers recorded in these animals. Pairs of fibers were only included if their cochlear positions, calculated from their CF, differed between 0 and 0.5 mm. This distance corresponds to ≈1/3 octave, depending on CF (18). Within each pair, the fiber with the lowest DF was always used as the reference, and the correlogram peak delays are plotted at the arithmetic mean CF of the pair. Taking our conventions for the sign of delays into account, the distribution of Fig. 5 shows the distribution that would be obtained from coincidence detectors receiving an ipsilateral input that is systematically biased toward a more basal cochlear position (higher CF) than the contralateral input, by 0 to 0.5 mm. The resulting pattern shows a striking similarity to the BD distribution measured in the IC (Fig. 2).
Discussion
We developed an analysis to examine ITD-sensitivity of elementary coincidence detectors operating on auditory-nerve inputs from different cochlear positions. The main insight is the similarity in the pattern of the resulting ITD-sensitivity to the pattern we observed in binaural IC neurons. The activity of nerve fibers with mismatched CFs is maximally correlated at delays that depend on CF in a way similar to the BDs of binaural neurons. As in the binaural responses, cross-correlograms of monaural responses peak within the π-boundary and only rarely show slipped cycles.
Our study differs from previous studies by using actual neural responses rather than model predictions and by testing the entire range of CFs for its consistency with binaural responses to the same stimuli. The distribution of Fig. 5 is obtained with cochlear disparities between 0 and 0.5 mm. The upper limit translates to ≈50 hair cells and the average disparity to ≈1% of the entire length of the organ of Corti (19). There is convergence at several levels in both the excitatory and inhibitory input chains to the MSO, and it is implausible that the entire input to its neurons is ultimately derived from one inner hair cell on each side. More likely, binaural neurons effectively sample a restricted region of each cochlea. Our results show that if the mix of inputs from one side has a different bias in its ultimate cochlear position compared with the other side, shifts in BD will occur.
Importantly, “unsigned” cochlear disparities are not sufficient to generate the patterns of Fig. 2; they also need to be systematically biased. The effect of random cochlear disparities is to create a larger dispersion in BDs at low frequencies compared with the dispersion in BDs at high frequencies: The pattern obtained tapers with increasing CF but is symmetrical (Fig. 6A). Random mismatches thus can account for the decrease in spread of BD with increasing CF, but the decrease in average BD with CF requires a biased source of internal delays. For Fig. 5, delays were always measured with the reference fiber being the one with the more apical position. The distribution in Fig. 2 can therefore be generated by ipsilateral inputs that are systematically dominated by more basal cochlear parts than the contralateral inputs (Fig. 5B). There is no physiological or anatomical evidence to support or contradict this requirement, but it provides a clear prediction that can be tested experimentally. Alternatively, axonal delay lines, which have been documented anatomically within an iso-frequency plane (20, 21), also exist across such planes (Fig. 5C). The tonotopy of MSO is such that low-CF neurons are found most dorsally (22). The contralateral afferents to MSO arrive at this nucleus in a rostral position (giving rise to caudally directed delay lines) but also in a ventral position, so that the collaterals to high CF neurons are shorter than those to low CF neurons (20, 21). A similar bias does not seem to be present for the ipsilateral afferents, which may explain larger average BDs for low-CF, rather than for high-CF, neurons.
The existence of internal delays outside the physiological range in mammals and humans has invited several teleological speculations (4, 23–26). We surmise that a broad range of BDs cannot be avoided at low CFs, because cochlear phase delays at these frequencies are large and because exact matching of the cochlear position of monaural afferents to the binaural coincidence detectors requires an unattainable precision in wiring. Moreover, exact matching may be undesirable because it would restrain the strategies available to the central processor (27, 28).
Materials and Methods
Single unit recordings from the IC of 18 animals and the auditory nerve of six animals were obtained with the procedures described in refs. 11, 17, and 29. CF was determined with a threshold-tracking algorithm to binaural or monaural stimulation, whichever yielded the lowest threshold. Long-duration (1 or 5 s) bursts of pseudorandom broadband noise were presented at an average suprathreshold level of 30 dB through closed calibrated speakers. Noise tokens were presented in pairwise binaural (IC) or sequential monaural (auditory nerve) combinations of the original and inverted waveforms, allowing construction of difcors. Difcors allow a more refined analysis than regular noise-delay functions but are not critical to our results: Scatterplots of BDs of noise-delay functions show the same tapering distribution as in Fig. 2. Spikes were timed at their peak amplitude with 1-μs resolution.
To quantify the temporal relationships between nerve spiketrains, correlograms were constructed by comparing spike times. Such correlograms are a natural display to compare peripheral, monaural responses with binaural responses that are thought to result from a coincidence process (11, 15, 17, 29). The general scheme is illustrated in Fig. 3. Two sets of responses are available, consisting of multiple responses to conditions x and y. These conditions can differ in terms of the stimuli used (e.g., A+ vs. A−) or in being obtained from different fibers (fiber x vs. fiber y). The correlograms shown in Figs. 4 and 5 result strictly from the counting of coincidences, without any smoothing or fitting. All correlograms were normalized to factor out the effects of average spike rate (17). Pairs with weak correlation were excluded from further analysis by requiring that the peak of the cross-correlogram-based difcor was >1. The choice of cochlear disparities (here 0–0.5 mm) and cross-correlogram amplitude threshold are not critical. Widening of the cochlear disparity or lowering of the amplitude threshold result in more data points and more scatter, but the pattern of a decrease in the range of BDs with CF remains.
To the extent possible, we based the comparison of the delay distributions in auditory nerve and IC on CF (Figs. 2 and 5). However, tuning curves of low-frequency auditory nerve fibers are often shallow so that CF is not always well defined. This ambiguity is less of a problem with DF, which is based on much longer response samples. In the calculation of cross-correlograms, we therefore ranked each pair of neurons by DF rather than by CF, but similar distributions are obtained when delays are plotted as a function of DF rather than of CF.
Acknowledgments
We thank the editor and reviewers for their constructive comments. This work was supported by Fund for Scientific Research in Flanders (Belgium) Grants G.0083.02 and G.0392.05 and Research Fund of K.U. Leuven Grants OT/01/42 and OT/05/57.
Abbreviations
- BD
best delay
- CF
characteristic frequency
- DF
dominant frequency
- IC
inferior colliculus
- ITD
interaural time difference
- MSO
medial superior olive.
Footnotes
Conflict of interest statement: No conflicts declared.
This paper was submitted directly (Track II) to the PNAS office.
References
- 1.Jeffress L. A. J. Comp. Physiol. Psychol. 1948;41:35–39. doi: 10.1037/h0061495. [DOI] [PubMed] [Google Scholar]
- 2.Joris P. X., Smith P. H., Yin T. C. T. Neuron. 1998;21:1235–1238. doi: 10.1016/s0896-6273(00)80643-1. [DOI] [PubMed] [Google Scholar]
- 3.Konishi M. Annu. Rev. Neurosci. 2003;26:31–55. doi: 10.1146/annurev.neuro.26.041002.131123. [DOI] [PubMed] [Google Scholar]
- 4.McAlpine D., Jiang D., Palmer A. Nat. Neurosci. 2001;4:396–401. doi: 10.1038/86049. [DOI] [PubMed] [Google Scholar]
- 5.Hancock K. E., Delgutte B. J. Neurosci. 2004;24:7110–7117. doi: 10.1523/JNEUROSCI.0762-04.2004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Schroeder M. R. In: Psychophysics and Physiology of Hearing. Evans E. F., Wilson J. P., editors. New York: Academic; 1977. pp. 455–467. [Google Scholar]
- 7.Shamma S. A. J. Acoust. Soc. Am. 1989;86:989–1006. doi: 10.1121/1.398734. [DOI] [PubMed] [Google Scholar]
- 8.Bonham B. H., Lewis E. R. J. Acoust. Soc. Am. 1999;106:281–290. doi: 10.1121/1.427056. [DOI] [PubMed] [Google Scholar]
- 9.Peña J. L., Viete S., Funabiki K., Saberi K., Konishi M. J. Neurosci. 2001;21:9455–9459. doi: 10.1523/JNEUROSCI.21-23-09455.2001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Yin T. C. T., Kuwada S. J. Neurophysiol. 1983;50:1020–1042. doi: 10.1152/jn.1983.50.4.1020. [DOI] [PubMed] [Google Scholar]
- 11.Joris P. X. J. Neurosci. 2003;23:6345–6350. doi: 10.1523/JNEUROSCI.23-15-06345.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Yin T. C. T., Chan J. K., Irvine D. R. F. J. Neurophysiol. 1986;55:280–300. doi: 10.1152/jn.1986.55.2.280. [DOI] [PubMed] [Google Scholar]
- 13.Kiang N. Y. S., Watanabe T., Thomas E. C., Clark L. F. Discharge Patterns of Single Fibers in the Cat’s Auditory Nerve. Cambridge, MA: MIT Press; 1965. research monograph no. 35. [Google Scholar]
- 14.van der Heijden M., Joris P. X. J. Neurosci. 2003;23:9194–9198. doi: 10.1523/JNEUROSCI.23-27-09194.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Joris P. X., Louage D. H., Cardoen L., van der Heijden M. Hear. Res. 2006:216–217. doi: 10.1016/j.heares.2006.03.010. 19–30. [DOI] [PubMed] [Google Scholar]
- 16.Johnson D. H., Kiang N. Y. S. Biophys. J. 1976;16:719–734. doi: 10.1016/S0006-3495(76)85724-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Louage D. H., van der Heijden M., Joris P. X. J. Neurophysiol. 2004;91:2051–2065. doi: 10.1152/jn.00816.2003. [DOI] [PubMed] [Google Scholar]
- 18.Greenwood D. D. J. Acoust. Soc. Am. 1990;87:2592–2605. doi: 10.1121/1.399052. [DOI] [PubMed] [Google Scholar]
- 19.Liberman M. C. J. Acoust. Soc. Am. 1982;72:1441–1449. doi: 10.1121/1.388677. [DOI] [PubMed] [Google Scholar]
- 20.Smith P. H., Joris P. X., Yin T. C. T. J. Comp. Neurol. 1993;331:245–260. doi: 10.1002/cne.903310208. [DOI] [PubMed] [Google Scholar]
- 21.Beckius G. E., Batra R., Oliver D. L. J. Neurosci. 1999;19:3146–3161. doi: 10.1523/JNEUROSCI.19-08-03146.1999. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Guinan J. J., Norris B. E., Guinan S. S. Int. J. Neurosci. 1972;4:147–166. [Google Scholar]
- 23.McAlpine D., Jiang D., Palmer A. Hear. Res. 1996;97:136–152. [PubMed] [Google Scholar]
- 24.van der Heijden M., Trahiotis C. J. Acoust. Soc. Am. 1999;105:388–399. doi: 10.1121/1.424628. [DOI] [PubMed] [Google Scholar]
- 25.McFadden D. J. Acoust. Soc. Am. 1972;54:528–530. doi: 10.1121/1.1913611. [DOI] [PubMed] [Google Scholar]
- 26.Louage D. H., Joris P. X., van der Heijden M. J. Neurosci. 2006;26:96–108. doi: 10.1523/JNEUROSCI.2339-05.2006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Loeb G. E., White M. W., Merzenich M. M. Biol. Cybern. 1983;47:149–163. doi: 10.1007/BF00337005. [DOI] [PubMed] [Google Scholar]
- 28.Carney L. H., Heinz M. G., Evilsizer M. E., Gilkey R. H., Colburn H. S. Acta Acustica United with Acustica. 2002;88:334–346. [Google Scholar]
- 29.Joris P. X., Van de Sande B., van der Heijden M. J. Neurophysiol. 2005;93:1857–1870. doi: 10.1152/jn.00962.2004. [DOI] [PubMed] [Google Scholar]