Abstract
Amplitude modulation is an important feature of communication sounds. A phenomenological model of the auditory pathway that reproduces amplitude modulation coding from the outer ear to the inferior colliculus is presented. It is based on Hewitt and Meddis’ work. To improve the temporal coding for high level stimuli, high spontaneous rate and low spontaneous rate auditory nerve fibers innervate chopper cells of the cochlear nucleus. Wideband inhibitory interneurons which limit high spontaneous rate fibers connected to chopper units are added in this nucleus. The realistic structure we propose gives results closer to physiological data in terms of synchronization.
I. Introduction
Complex sounds and especially communication sounds such as speech are characterized by amplitude modulation, also referred to as temporal envelope. Psychoacoustic studies (e.g. [18]) indicate that envelope information plays an important role in many perceptual phenomena including speech identification and auditory streaming. Physiological studies show that temporal coding of amplitude modulation takes place from the inner ear to the auditory cortex (for review, see [5]). In this paper, a model of amplitude modulation coding from the middle ear to the inferior colliculus (IC) is presented. At this level, modulation frequencies are extracted from the stimulus as some neurons respond to specific modulation frequencies.
The point neuron model of the auditory pathway is based on Hewitt and Meddis’ work [4]. This is a four-stage model including: the cochlea multichannel filterbank decomposition, the auditory nerve response via inner hair cells that act as rectifiers and compressors, the cochlear nucleus decomposition in terms of modulation frequency, and the coincidence detection in the inferior colliculus. This model reproduces physiological data except for high level stimuli where neuron synchronization decreases. So, we propose in this paper an update of Hewitt and Meddis’ structure to improve synchronization and find results closer to physiological data.
The main evolution is the use of high spontaneous rate (HSR) and low spontaneous rate (LSR) fibers to innervate ventral cochlear nucleus (VCN) cells. HSR fibers, which are used in [4], respond to low level stimuli and saturate for high level ones. LSR fibers begin to fire for medium level stimuli and saturate for very high ones. Figure 1 illustrates HSR and LSR behaviors. The chopper cell receives HSR and LSR fibers afferent activity. Those fibers are tuned to the same center frequency (CF). This is not enough to expand chopper unit dynamics because it saturates. That is why an onset inhibitory interneuron is added. It receives afferent from HSR auditory nerve fibers with a wide range of CFs which makes it a wideband inhibitory interneuron (WBII). When the HRS fibers, afferent of the chopper unit, saturate, they are inhibited by the WBII, then LSR fibers relay temporal information. Figure 2 shows the principle of cells connection in the VCN.
Fig. 1.

Raise of the firing rate of two auditory nerve fibers with the stimulus level. These auditory nerve fibers are characterized by their spontaneous firing rate. There are low and high spontaneous firing rate fibers (LSR and HSR fibers respectively). HSR is sensitive to low stimulus level and saturates for medium level stimulus. LSR is sensitive to medium level stimulus and saturates for high level stimulus.
Fig. 2.

Explicative scheme of cochlear nucleus neurons: innervations and connections. The chopper cell receives HSR and LSR auditory nerve afferents having the same CF. The onset inhibitory interneuron is connected with LSR fibers from a wide range of CFs. This interneuron inhibits HSR chopper cell excitatory afferent.
Physiological elements support this scheme. Firstly, Liberman [7] showed that each inner hair cell is connected with the three types of fibers (HSR, medium spontaneous rate and LSR). Secondly, Ferragamo [1] noticed that, in the VCN, stellate T cells are inhibited by stellate D ones. Stellate T cells correspond to multipolar cells in the VCN [12]. Chopper responses are associated with this cell [16]. Stellate D cells may correspond to giant multipolar cells and therefore have an onset response. Concerning afferent connections to those cells, Ryugio [17] found that LSR fibers give rise to greater collaterals in VCN than HSR fibers. Moreover, LSR fibers have a greater number of terminals in VCN [13]. Geometrical considerations lead us to suppose that LSR fibers innervate giant multipolar units.
II. Procedure for Paper Submission
The model we propose is composed of five stages: the outer and middle ear, the basilar membrane, the inner hair cells, the auditory nerve fibers, the cochlear nucleus and the inferior colliculus. The outer and middle ear filtering consists in four second order Butterworth filters in cascade. This is a bandpass function whose maximum is reached at 1.5 kHz.
The sound is then decomposed by the basilar membrane of the cochlea. Here the ‘dual resonance non linear’ model is used ([10], [8]). This model is interesting in its behavior because it reproduces intensity influence on basilar membrane filters selectivity and center frequencies.
Each point of the modeled basilar membrane stimulates an inner hair cell. These cells compress and adapt the signal to the auditory nerve. One inner hair cell is modeled by a three-stage model: receptor potential, synaptic calcium channel and transmitter release [19]. The first part transforms basilar membrane motion in hair cell membrane potential (receptor potential). The second part turns this potential into transmitter release probability which is used in the third part to determine the amount of neurotransmitters in the synaptic cleft. This corresponds to a probability for the connected fiber to fire. In this model, this probability is realized by a geometrical law parameterized by the number of fibers. The IHC-AN complex allows us to model two kinds of auditory nerve fibers: HSR ones and LSR ones. Those fibers are obtained by changing synaptic calcium channel and the transmitter release parameters. These two kinds of fiber innervate cells of the VCN. The WBII receives LSR fibers outputs with different CFs (from one octave lower and two higher than the CF of the chopper cell these fibers inhibit). Chopper cell receives two excitatory afferents from LSR and HSR fibers of same CF. These afferent fibers determine the chopper cell CF. Figure 3 gives a detailed schematic organization of the model for a single couple of WBI-chopper VCN cells. Results presented here are obtained with a five-channel basilar membrane (corresponding to five CFs; the WBII is connected with four auditory nerves).
Fig. 3.

Detailed scheme of the first stages of the model. After outer/middle ear (O/ME) filtering, the basilar membrane (BM) filterbank decomposes the signal in three nonlinearly filtered versions (characterized by their center frequency: CF1<CF2<CF3). They correspond to the motion of three points on the BM. The motion of each point is transformed in receptor potential (RP) of the corresponding inner hair cell (IHC). It influences synaptic calcium channels (SCC) which controls transmitter release (TR). The amount of neurotransmitters is converted in discharge in the auditory nerve (AN). All HSR auditory nerve fibers excite the onset WBI cell. Then three filters are used to model dendritic connections of the chopper cell (see text). The soma is a McGregor model.
Now the model of figure 3 has to be extended at the VCN level to produce inferior colliculus inputs. All IHC afferents are realized using a geometrical law and provide inputs to ten couples of WBI-chopper neurons. Outputs of these chopper cells converge in the inferior colliculus where they are averaged and filtered to excite coincidence detector soma. This is illustrated in figure 4.
Fig. 4.

Global scheme of the auditory pathway model from the outer ear to the inferior colliculus. First the stimulus is filtered by the outer/middle ear (O/ME), then the basilar membrane (BM) decomposes the sound in a tonotopic manner. Each center frequency (CF) of the BM filters activates an inner hair cell (IHC). Two kinds of IHC are used to model high spontaneous rate (HSR) auditory nerve fibers (white background) and low spontaneous rate (LSR) ones (grey background). Several fibers are connected to each IHC in order to connect 10 VCN groups of cells identically. Then the stellate cells outputs of those groups converge to the inferior colliculus (IC) coincidence detector unit.
III. Methods
A. Stimulus definition
Stimuli are defined to be as close as possible to experimental ones [15]. They consist in sinusoidal modulated tones. The carrier frequency corresponds to the CF of the channel with HSR and LSR fibers. The modulation depth is 50%. All stimuli last 0.6 s for VCN tests and 0.3 s for IC tests. They are shaped by rising and falling 25 ms cosine ramps.
B. Neuronal response measurement
Simulated responses are spike trains. To compare simulated data with physiological measurements, responses are evaluated in terms of average firing rate and envelope synchronization. Synchronization is evaluated using the modulation gain based on Goldberg and Brown index [2]. The relation between firing rate and modulation frequency is the rate modulation transfer function (r-MTF) and the relation between synchronization and modulation frequency is the temporal modulation transfer functions (t-MTF).
IV. Results
A. Cochlear nucleus responses
Figure 5 illustrates the difference in the modulation gain obtained with and without the WBII in the VCN. At low stimuli levels (10 dB and 30 dB), the synchronization of the two models is similar. At medium level (50 dB), the WBII improves VCN chopper response modulation gain of about 4 dB. Higher levels are not represented in figure 5 because physiological data to be reproduced correspond to 10, 30 and 50 dB stimuli levels. However, at high levels the new structure turns out to be more synchronized than the original one.
Fig. 5.

Response of a cochlear nucleus chopper unit. t-MTF of the model (A) without and (B) with the WBII. For low level stimulus, the two models are similar with a good synchronization for a wide band of modulation frequencies. The VCN cell becomes more and more selective as the stimulus level increases. For medium level stimulus, the model that includes WBII still has a good synchronization at the neuron best modulation frequency (100Hz here).
B. Cochlear nucleus responses
Figure 6 illustrates the impact of modeling WBII in the cochlear nucleus in terms of firing rate and synchronization. r-MTF curves are similar for both structures. They exhibit a peak at a specific modulation frequency at low intensities, and become flat at high intensities. At low intensity levels, t-MTF of the two models show a band-pass profile. As the intensity increases, it becomes more and more selective. At high level, the model without WBII has a noisy t-MTF contrary to the complete model whose t-MTF is still synchronized at the modulation frequency for which the neuron reaches its maximum firing rate (best modulation frequency).
Fig. 6.

Response of an inferior colliculus coincidence detector unit. (A) r-MTF and t-MTF curves are obtained without WBII in the cochlear nucleus. (B) r-MTF and t-MTF are obtained with the model including WBII According to physiological measurements [15], high level stimuli are synchronized at the best modulation frequency (see text) of the neuron with WBII included in the VCN.
V. Discussion
The model response without WBII structure is closer to physiological measurements [15] than Hewitt and Meddis first results. This is due to the new basilar membrane model [10] and corresponding inner hair cells. The addition of WBII improves high level stimuli response. This is explained as follows. For low level stimuli, HSR fibers of the right CF drive this unit. At medium stimuli level, when HSR fibers saturate, LSR fibers convey synchrony information. Of course WBI inhibition is active, but it just acts as a limiter. For high level stimuli carrier, when both HSR and LSR fibers saturate, inhibition gives chopper cell synchronization by the wideband afferent. This is an example of how inhibition could be an information vector and not just an information modulator.
These results may be improved considering the study by Guérin [3]. It shows that relaxing constraint on chopper cell time constant makes the model more selective.
VI. Conclusion
This study confirms the natural relation that exists between temporal coding and dynamics. Moreover this work is a complement to other studies about temporal processing in the early stage of the auditory pathway.
Each stage of the physiological model (BM, IHC, AN) was already validated in different aspects, reproducing post stimulus time histograms, t-MTF, r-MTF, and forward masking measurements. The structure we proposed here improves results produced by Meddis [4] for high level stimulus in terms of response synchronization. Given Pressnitzer’s work [14], one can suppose that this model will probably be able to reproduce comodulation masking results. Moreover, this structure generalizes Nelson and Carney’s “same frequency inhibition and excitation” [11]. This lets us hope to reproduce other types of inferior colliculus responses like those measured by Krishna and Semple [6] when varying WBII and chopper cell time constant.
VII. Model Setting
All notations used in this section correspond to those used in corresponding articles.
TABLE I.
Basilar Membrane Settings [10]
| CF(Hz) | 826 | 1297 | 1980 | 2970 | 4400 |
TABLE II.
IHC Receptor Potential Settings [19]
| M | GCA max (nS) | [Ca2+]thr(*10−11) | |
|---|---|---|---|
| HSR | 10 | 8 | 4.48 |
| LSR | 8 | 2.75 | 4 |
TABLE III.
| τM ms | τGK ms | C | B | Th0 mV | Ek mV | Er mV | |
|---|---|---|---|---|---|---|---|
| VCN chopper | 5 | 10 | 0.3 | 0.1 | 10 | −16 | 0 |
| VCN onset | 5 | 10 | 0.3 | 0.1 | 14 | −10 | 0 |
| IC | 1 | 0.1 | 0 | 0.1 | 14 | −10 | 0 |
Dendritic filtering time constant is 0.1 ms. AP is 50 mV
References
- 1.Ferragamo MJ, Golding NL, Oertel D. Synaptic inputs to stellate cells in the ventral cochlear nucleus. J Neurophysiol. 1998;79:51–63. doi: 10.1152/jn.1998.79.1.51. [DOI] [PubMed] [Google Scholar]
- 2.Goldberg JM, Brown PB. Responses of binaural neurons of dog superior olivary complex to dichotic tonal stimulation: some physiological mechanisms of sound localization. J Neurophysiol. 1969;32:940–958. doi: 10.1152/jn.1969.32.4.613. [DOI] [PubMed] [Google Scholar]
- 3.Guérin A, Le Bouquin Jeannès R, Bès J, Faucon G, Lorenzi C. Evaluation of two computational models of amplitude modulation coding in the inferior colliculus. Hear Res. 2006;211:54–62. doi: 10.1016/j.heares.2005.10.001. [DOI] [PubMed] [Google Scholar]
- 4.Hewitt MJ, Meddis R. A computer model of amplitude modulation sensitivity of single units in the inferior colliculus. J Acoust Soc Am. 1994;95:2145–2159. doi: 10.1121/1.408676. [DOI] [PubMed] [Google Scholar]
- 5.Joris PX, Schreiner CE, Rees A. Neural representation of amplitude-modulated sounds. Physiol Rev. 2004;84:541–577. doi: 10.1152/physrev.00029.2003. [DOI] [PubMed] [Google Scholar]
- 6.Krishna BS, Semple MN. Auditory temporal processing: response to sinusoidally amplitude modulated tone in the inferior colliculus. J Neurophysiol. 2000;22:255–273. doi: 10.1152/jn.2000.84.1.255. [DOI] [PubMed] [Google Scholar]
- 7.Liberman MC. Central projection of auditory-nerve fibers of differing spontaneous rate. I. Anteroventral cochlear nucleus. J Comp Neural. 1991;313:240–258. doi: 10.1002/cne.903130205. [DOI] [PubMed] [Google Scholar]
- 8.Lopez-Poveda EA. An approximate transfer function for the dual-resonance nonlinear filter model of auditory frequency selectivity. J Acoust Soc Am. 2003;114:2112–2117. doi: 10.1121/1.1605389. [DOI] [PubMed] [Google Scholar]
- 9.MacGregor RJ. Neural and Brain Modeling. Academic; San Diego, CA: 1987. [Google Scholar]
- 10.Meddis R, O’Mard LP, Lopez-Poveda EA. A computational algorithm for computing nonlinear auditory frequency selectivity. J Acous Soc Am. 2001;109:2852–2861. doi: 10.1121/1.1370357. [DOI] [PubMed] [Google Scholar]
- 11.Nelson PC, Carney LH. A phenomenological model of peripheral and central neural responses to amplitude modulated tones. J Acoust Soc Am. 2004;116:2173–2186. doi: 10.1121/1.1784442. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Oertel D, Wu SH, Garb MW, Dizack C. Morphology and physiology of cells in slice preparations of the posteroventral cochlear nucleus of mice. J Comp Neural. 1990;295:136–154. doi: 10.1002/cne.902950112. [DOI] [PubMed] [Google Scholar]
- 13.Oertel D, Fay RR, Popper AN. Integrative function in the mammalian auditory pathway. Springer-Verlag; New York: 2002. [Google Scholar]
- 14.Pressnitzer D, Meddis R, Winter IM. Physiological correlates of comodulation masking release in the mammalian ventral cochlear nucleus. J Neurosci. 2001;21:6377–6386. doi: 10.1523/JNEUROSCI.21-16-06377.2001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Rees A, Palmer AR. Neuronal response to amplitude modulated and pure tone stimuli in the guinea pig inferior colliculus, and their modification with broadband noise. J Acoust Soc Am. 1989;85:1978–1994. doi: 10.1121/1.397851. [DOI] [PubMed] [Google Scholar]
- 16.Rhode WS, Smith PH, Oertel D. Physiologicalresponse properties of cells labeled intracellulary with horseradish peroxidase in the ventral cochlear nucleus. J Comp Neurol. 1983;213:426–447. doi: 10.1002/cne.902130407. [DOI] [PubMed] [Google Scholar]
- 17.Ryugo DK, Parks TN. Primary innervation of the avian and mammalian cochlear nucleus. Brain Res Bulletin. 2003;60:435–456. doi: 10.1016/s0361-9230(03)00049-2. [DOI] [PubMed] [Google Scholar]
- 18.Smith ZM, Delgutte B, Oxenham AJ. Chimaeric sounds reveal dichotomies in auditory perception. Nature. 2002;416:87–90. doi: 10.1038/416087a. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Sumner CJ, Lopez-Poveda EA, O’Mard LP, Meddis R. A revised model of the inner-hair cell and auditory nerve complex. J Acoust Soc Am. 2002;111:2178–2188. doi: 10.1121/1.1453451. [DOI] [PubMed] [Google Scholar]
