Skip to main content
The Journal of the Acoustical Society of America logoLink to The Journal of the Acoustical Society of America
. 2010 Apr;127(4):2420–2432. doi: 10.1121/1.3337233

Distortion product emissions from a cochlear model with nonlinear mechanoelectrical transduction in outer hair cells

Yi-Wen Liu 1, Stephen T Neely 1
PMCID: PMC2865700  PMID: 20370025

Abstract

A model of cochlear mechanics is described in which force-producing outer hair cells (OHC) are embedded in a passive cochlear partition. The OHC mechanoelectrical transduction current is nonlinearly modulated by reticular-lamina (RL) motion, and the resulting change in OHC membrane voltage produces contraction between the RL and the basilar membrane (BM). Model parameters were chosen to produce a tonotopic map typical of a human cochlea. Time-domain simulations showed compressive BM displacement responses typical of mammalian cochleae. Distortion product (DP) otoacoustic emissions at 2f1f2 are plotted as isolevel contours against primary levels (L1,L2) for various primary frequencies f1 and f2 (f1<f2). The L1 at which the DP reaches its maximum level increases as L2 increases, and the slope of the “optimal” linear path decreases as f2f1 increases. When primary levels and f2 are fixed, DP level is band passed against f1. In the presence of a suppressor, DP level generally decreases as suppressor level increases and as suppressor frequency gets closer to f2; however, there are exceptions. These results, being similar to data from human ears, suggest that the model could be used for testing hypotheses regarding DP generation and propagation in human cochleae.

INTRODUCTION

Nonlinear growth is characteristic of healthy mammalian cochleae (Rhode, 1971). Introducing nonlinearity to the damping coefficient of cochlear models (Kim et al., 1973; Hall, 1974) enabled the simulation of distortion products (DPs) and other nonlinear responses. In several subsequent studies, the damping coefficient was allowed to be negative at low intensity so sounds were amplified in a frequency- and place-specific manner. As a result, model responses achieved high sensitivity and sharp tuning typical of mammalian hearing (Davis, 1983; Neely and Kim, 1983, 1986). Since then, understanding of the biophysical foundations for the putative “cochlear amplifier” has improved (see Dallos, 2008, for a review). Models proposed in recent years (Lu et al., 2006; Ramamoorthy et al., 2007; Liu and Neely, 2009; Rabbitt et al., 2009) indicated that outer hair cells (OHCs) likely provide cycle-by-cycle amplification to cochlear traveling waves because of their capabilities to convert mechanical and electrical energy in both directions (Brownell et al., 1985; Hudspeth, 1997). However, these newer results were obtained by linear analysis in the frequency domain; time-domain simulation of DP and other nonlinear responses from these models has not yet been reported.

The goals of the present study are (1) to construct a nonlinear cochlear model that incorporates recent findings in OHC biophysics and (2) to test the model’s time-domain response to single and multitone stimuli for a broad range of stimulus conditions. Of particular interest is the simulation of distortion-product otoacoustic emissions (DPOAEs). First observed in late 1970s (Kemp, 1978; Kim et al., 1980), DPOAEs have been measured for a wide range of stimulus frequencies and levels (e.g., Gaskill and Brown, 1990). Damage to the OHCs causes reduction of DPOAEs (Zurek et al., 1982), confirming OHCs’ contribution to cochlear nonlinearity. Computer simulation of DPOAE emerged in the 1990s. Using a variation of the Neely and Kim (1986) model, Kanis and de Boer (1993) calculated DP levels via a frequency-domain iterative quasilinear approach. Results suggested that the source of DP is distributed, and that DP travels backward to the middle ear via a slow transverse wave. A later revision of the Neely and Kim (1986) model included an explicit representation of the membrane potential in OHCs (Neely, 1993). Subsequently, simulation of DP was carried out using a time-domain approach (Neely and Stover, 1994); when the frequency of one primary tone (f2) was fixed while the other (f1,f1<f2) was swept, DP level had a band-pass filtering characteristic similar to that observed in humans and rodents (Brown et al., 1992). When the levels of the primaries were swept, the revised model also produced DPOAE input-output (I∕O) functions typical of humans (Neely et al., 2000). In the interim, theoretic treatment of DPOAE was conducted by Talmadge et al. (1998) using a frequency-domain, perturbative approach to explain fine structures and the band-pass filtering effect.

The present study is an extension to modeling efforts previously described in a few ways. First, the present model describes OHC biophysics in more detail (Liu and Neely, 2009), which makes it possible to place the nonlinearity specifically in the mechanoelectrical transduction (MET) channel of OHCs. The advantage of modeling OHCs biophysics is discussed in Sec. 5. Second, DPOAEs for a wide range of primary levels and primary frequencies were simulated so results can be compared to data recently reported by Johnson et al. (2006). In addition, model responses to three-tone stimuli in a DP suppression paradigm are compared to data from normal-hearing humans (Rodríguez et al., 2010).

It is noteworthy that, although DPOAE is a robust phenomenon routinely used for clinical purposes, there is still a debate regarding DP propagation. The existence of slow-traveling waves has been challenged (Nobili et al., 2003), and transmission-line modeling efforts (e.g., Neely and Kim, 1983) have been criticized. Nobili et al. (2003) proposed that the motion of the stapes is coupled to the basilar membrane (BM) via fluid compressional waves. This theory found supports from a laser-scanning measurement of BM motion (Ren, 2004), which seems to suggest reverse propagation of DPs via compressional (fast) waves. However, the interpretation was based on the phase velocity of DP and its flaws have been noted (Shera et al., 2007). By simultaneous measurements of DPOAE and intracochlear acoustic pressure, Dong and Olson (2008) found evidence for reverse-traveling waves in the phase response of DPOAE relative to the phase response of intracochlear DPs. As will be described in Sec. 2, the macromechanical model for the present study is a transmission-line model. Therefore, results presented in this paper are based on the slow-traveling wave theory.

Besides DPs, a plethora of nonlinear phenomena has been observed in cochlear mechanics. Some of them have been reproduced in computational models, including level-dependent latency of transient responses (Neely, 1988), instantaneous-frequency glide in transient responses (Hubbard et al., 2003), asymmetric onset and offset response time (Zhang et al., 2009), and an in-depth investigation (Ku et al., 2009) of spontaneous otoacoustic emissions (SOAEs). Transient responses are beyond the scope of this paper, while generation of SOAEs will be discussed in Sec. 5.

The rest of this paper is organized as follows. Models for OHC biophysics, cochlear mechanics, and middle-ear mechanics are described in Sec. 2. Parameter selection and methods for time-domain simulation are described in Sec. 3. Model responses to single and multitone stimuli are reported in Sec. 4. Discussion is given in Sec. 5, followed by conclusions in Sec. 6.

MODELS

Sections 2A, 2B, 2C describe the present model for cochlear mechanics, and Sec. 2D describes a simple model for the mechanics of the middle ear. Cochlear and middle-ear parameters are listed in Table 1, 2, respectively.

Table 1.

Parameters for cochlear mechanics.

  Meaning (unit) Base Mid Apex
  Organ of Corti mechanical parameters
M Mass in OHC load impedance (g) 2.8×10−8 5.0×10−7 2.8×10−5
R Resistance in OHC load impedance (g s−1) 9.4×10−4 9.2×10−4 2.7×10−3
K Stiffness in OHC load impedance (g s−2) 200 11 0.76
m BM mass per unit area (g cm−2) 3.8×10−5 2.8×10−4 2.1×10−3
r BM resistance per unit area (g s−1 cm−2) 1.5 3.2 8.6
k BM stiffness per unit area (g s−2 cm−2) 5.9×105 4.0×104 1.6×103
 
  Outer hair cell electromechanical properties
αd MET’s sensitivity to RL displacement (A∕m) 1.6×10−3 6.2×10−4 2.0×10−4
αv MET’s sensitivity to RL velocity (C∕m) 4.4×10−6 1.8×10−6 6.8×10−7
Imax Maximum range of OHC receptor current (pA) 670 320 83
T Piezoelectric transformer ratio (m∕C) 2.4×106 2.4×106 2.4×106
G Membrane conductance (nS) 91 51 33
C Membrane capacitance (pF) 14 32 79
Cg Gating capacitance (pF) 18 33 70
 
  Physical dimensions
A Cochlear cross-sectional area (cm2) 6.3×10−2 1.4×10−2 3.1×10−3
w BM width (cm) 0.031 0.040 0.051
L Length of cochlea (cm) 3.5    

Table 2.

Parameters for middle-ear mechanics.

  Meaning (unit) Value
Maleus-incus-eardrum parameters
Am Area of eardrum (cm2) 0.5
Mm Effective mass (g) 8.5×10−3
Rm Effective resistance (g s−1) 20
Km Effective stiffness (g s−2) 1.5×105
gm Maleus-incus lever ratio 0.7
 
Incudo-stapedial joint
Ri Resistance (g s−1) 400
Ki Stiffness (g s−2) 5.0×106
 
Stapes parameters
As Area of stapes footplate (cm2) 0.0625
Ms Effective mass (g) 5.0×10−3
Rs Effective resistance (g s−1) 80
Ks Effective stiffness (g s−2) 5.0×105
 
Round-window parameters
Ar Area of round window (cm2) 0.0625
Mr Effective mass (g) 5.0×10−3
Rr Effective resistance (g s−1) 20
Kr Effective stiffness (g s−2) 1.5×105

Outer hair cells: Mechanoelectrical transduction

The receptor current that flows into an OHC is modulated by deflection of its hair bundle (HB). We previously proposed to model this as follows (Liu and Neely, 2009):

ir=αvξ˙r+αdξr, (1)

where ir denotes the receptor current, and ξr and ξ˙r denote the reticular-lamina (RL) displacement and velocity, respectively; tectorial-membrane motion is not considered explicitly in favor for simplicity here.1 In the present study, Eq. 1 is generalized by introducing nonlinearity to it:

ir=I(αvξ˙r+αdξr), (2)

where I(⋅) denotes an arbitrary nonlinear function. For the present study, we defined I(⋅) as an antisymmetric function:

I(η)Imax(11+exp(4ηImax)12)=Imax2tanh2ηImax, (3)

where η=αvξ˙r+αdξr. Note that the full range of current output is Imax, and the slope of I(⋅) is unity at the origin:

|Iη|η=0=1.

Experiments have shown that the nonlinear function I(⋅) should be an asymmetric function because of rectification; ir is larger when the HB is deflected toward the tallest row of stereocilia than ir is negative when the HB is rarified (e.g., Corey and Hudspeth, 1983; Ricci et al., 2005). Nevertheless, the nonlinearity given by Eq. 3 was suitable for this study because the main concern was to simulate DP at 2f1f2, which is an odd-order distortion.2

Outer hair cells: Electromotility

The present study adopts a one-dimensional (1D) piezoelectric OHC model that we previously proposed (Liu and Neely, 2009). The underlying assumptions of the model are summarized below. First, the receptor current ir is the sum of capacitive, conductive, and gating components:

ir=CdVdt+GV+dQdt, (4)

where C and G are the capacitance and the conductance of the plasmic membrane, respectively, V is the transmembrane potential, and Q is the charge accumulation that accompanies electromotility. Second, the OHC contraction displacement ξo is linearly proportional to Q,

ξo=TQ, (5)

where T represents a piezoelectric constant. Finally, Q is a Boltzmann function of V˜VTfOHC, where fOHC is a contraction force generated by the OHC.

For the present study, the Boltzmann function Q(V˜) is linearized so the only source of nonlinearity is from the MET channel. Linearization of Q(V˜) is legitimate if V˜ is small in comparison to the voltage scale in the Boltzmann function—this appears to be the case at the stimulus levels of interest (SPL≤100 dB, see Sec. 5 for further details). Therefore, Eq. 4 can be rewritten as

ir=CdVdt+GV+CgdV˜dt, (6)

where CgQV˜ denotes a gating capacitance that is approximately constant in time.

In the present study, it is assumed that the OHC contracts and stretches against a simple mechanical load:3

fOHC=Mξ¨o+Rξ˙o+Kξo, (7)

where M, R, and K are the effective mass, resistance, and stiffness, respectively.

Cochlear macromechanics

The macromechanics of the cochlea are governed by Newton’s laws and the principle of continuity. In a one-dimensional nonviscous model (Dallos, 1973), Newton’s second law requires that

xP=ρAU˙, (8)

where P denotes the pressure difference between two cochlear chambers (scala vestibuli and scala tympani), x denotes the longitudinal direction from base to apex, ρ denotes the effective fluid mass density, A denotes the cross-sectional area of the fluid chamber, and U denotes the volume velocity along the x-direction. The present study assumes that the fluid is incompressible, and the principle of continuity is represented by the following equation4 (Neely and Liu, 2009):

xU=wξ˙r, (9)

where w is the width of cochlear partition. The displacement ξb of the BM, equal to the sum of ξr and ξo, is driven by the pressure P:

mξ¨b+rξ˙b+kξb=P, (10)

where m, r, and k are mass, resistance, and stiffness of BM per unit area.

The boundary condition at the apical end of the cochlea is

xP|x=L=ρAmhP, (11)

where mh=110 g∕cm4 is an acoustic inductance which represents the mass of the fluid at the helicotrema [Puria and Allen, 1991, Eq (11)]. The boundary condition at the basal end of the cochlea is

xP|x=0=ρv˙s, (12)

where vs denotes the velocity of the stapes.

Modeling the middle ear

The present middle-ear model, adapted from Matthews (1983), is aimed to reproduce adequately the pressure magnitude transfer functions measured from human cadavers (e.g., Puria, 2003; Nakajima et al., 2009) without pursuing other details in middle-ear mechanics. A schematic diagram of the middle ear is shown in Fig. 1. The malleus, the incus, and the eardrum are lumped into one system as suggested by Zwislocki (1962), while any motion on the eardrum that is not coupled to the ossicular chain is ignored. The malleus-incus-eardrum system is characterized by parameters {Mm,Rm,Km}. The malleus-incus lever ratio is denoted as g (g≤1). The incudo-stapedial joint (ISJ) is characterized by parameters {Ri,Ki}. The stapes and its surrounding structures are represented by parameters {Ms,Rs,Ks}.

Figure 1.

Figure 1

Schematic diagram of middle-ear mechanics. Pressure and displacement are coupled between the ear canal (to the left) and the cochlear fluid (to the right). Subscript m denotes the malleus-incus-eardrum system, i denotes incudo-stapedial joint, and s denotes stapes.

Given the diagram in Fig. 1, the acoustic pressure5PED at the eardrum and the cochlear fluid pressure PFL at the stapes are coupled to each other via the following equations:

Mmv˙m=KmxmRmvm+gfi+PEDAe, (13a)
(Ms+Mr)v˙s=(Ks+Kr)xs(Rs+Rr)vsfiPFLAs, (13b)

where fi=Ki(xsgxm)+Ri(vsgvm) is the force transferred through the ISJ. In the preceding equations, Ae and As are effective areas of the eardrum and the stapes footplate, respectively; xm and vm, respectively, denote the displacement and velocity of the malleal system, and xs and vs denote the displacement and the velocity of the stapes, respectively. Also, in Eq. 13b, parameters {Mr,Rr,Kr} represent the round window.6 The present middle-ear model is similar to that of Talmadge et al. (1998) except for a nonrigid ISJ here. Middle-ear frequency responses are reported in Appendix A.

SIMULATION METHODS

Fine-tuning cochlear parameters

Table 1 shows the parameters used in the present cochlear model: the values listed are for the base (x=0), the longitudinal midpoint (x=L∕2), and the apex (x=L). Parameter values intermediate to these three locations were determined by log-quadratic interpolation.7 These values were carefully chosen to produce (1) tonotopic mapping typical of humans, (2) cochlear excitation profiles and frequency responses typical of mammals, and (3) nonlinear compression of BM motion typical of mammals. Approximation formulas were derived from frequency-domain analysis to facilitate the parameter-tuning process; more details are described in Appendix B. Parameter values listed in Table 1 are a result of this fine-tuning process. Then, these parameters were used in simulation of DP and DP suppression.

Delivering the stimulus

To simulate experimental conditions in a typical DPOAE measurement paradigm, we assumed that the stimulus is delivered as a force f(t) that acts on a diaphragm, the diaphragm is latched on one end to a coupler inserted in the ear canal; at the other end of the coupler is the eardrum. Therefore, the dynamics of the diaphragm are described by the following equation:

Mdv˙d=f(t)RdvdKdxdPdAd, (14)

where vd and xd denote the velocity and the displacement of the diaphragm, respectively, Pd denotes the pressure in the enclosed space, and Ad is the area of the diaphragm. Two further assumptions about the coupler were made: first, the coupler is acoustically lossless; second, the physical dimension of the coupler is much smaller than the shortest wavelengths of interest. Therefore, Pd is approximately equal to the pressure PED at the eardrum in Eq. 13a, and the enclosed volume of air is acoustically compliant:

Pd=Kc(xdAdxmAe). (15)

Parameter values were as follows:8Kc=8.5×105 dyn∕cm5, Md=5×10−3 g, Rd=1.4×103 g∕s, Kd=4×108 g∕s2, and Ad=0.75 cm2.

State-space formulation and numerical integration in time

To conduct a time-domain simulation, a minimal set of variables (displacements, velocities, currents, or voltages) were selected as state variables so their rates of change could be determined instantaneously given their present state and the stimulus. Then, the state variables were integrated numerically with respect to time.

The following variables were chosen as state variables: diaphragm variables {xd,vd}, middle-ear variables {xm,vm,xs,vs}, and cochlear variables including RL displacement ξr(x), RL velocity ur(x), OHC contraction velocity uo(x), OHC membrane potential V(x), and the gating charge Q(x). With some algebra, Eqs. 5, 6, 7, 10 can be rearranged as follows:

ξ˙r(x)=ur, (16a)
Q˙(x)=T1uo, (16b)
u˙r(x)=Ruo+KTQMVCg1QTMr(ur+uo)+k(ξr+TQ)mP(x)m, (16c)
u˙o(x)=Ruo+KTQM+VCg1QTM, (16d)
V˙(x)=1C(ir(ur,ξr)GVT1uo). (16e)

In Eq. 16e, ir(urr) is the nonlinear function defined in Sec. 2A. Note that although P(x) on the right hand side of Eq. 16c is not a state variable, it could be solved instantaneously given the state variables. By combining Eqs. 8, 9, the following approximation was derived,

x2P=ρA(xtUtUxAA), (17a)
ρAwt2ξr. (17b)

The second term on the right hand side of Eq. 17a was neglected, assuming that A−1xA≈0 (i.e., the taper of the area is small compared to its overall size). Combining Eqs. 17b, 16c, we obtained the following relation between P(x) and state variables:

(x2ρwmA)P=l(ur(x),uo(x),ξr(x),Q(x),V(x)), (18)

where l(⋅) denotes a linear combination. Then, Eq. 18 with boundary conditions described in Sec. 2C was solved numerically by a finite-difference method with N=700 discrete cochlear segments. The computation load was O(N).9

To summarize, the state vector consisted of 3500 cochlear variables and six other variables, and their rates of change were determined by Eqs. 13, 14, 16, 18 given their present state and the stimulus f(t). Let us denote the state vector as x(t), and its time derivative as v(x). State-space equations were numerically integrated with respect to time in steps of Δt=6.25 μs. For each step in time t, x(t) was updated by a modified Sielecki method (Diependaal et al., 1987) in two steps; first, x(tt) was estimated by extrapolation:

x^=x+Δtv(x). (19)

Then, v was re-evaluated at x^, and the state variables were updated as follows:

x(t+Δt)=x(t)+Δtv(x(t))+v(x^)2. (20)

RESPONSES TO STIMULI

Cochlear tuning and latency for low-level stimuli

To calculate cochlear tuning properties at low intensity, a wide-band (0.32–12 kHz) click was delivered to the diaphragm. The level of the click was set sufficiently low (peak-equivalent SPL<32 dB at the eardrum) so that the OHC receptor currents ir(t) were no more than 2.5% of Imax in Eq. 3 anytime and anywhere in the cochlea. The simulation ran for 70 ms, long enough for the traveling wave to reach the apical region of the cochlea.

In Fig. 2, the stapes-to-RL displacement gain and its group delay are plotted against frequency for nine different locations in the cochlea.10 Characteristic frequency (CF) (i.e., frequency of maximum gain) decreases from base to apex. As shown in Fig. 2A, the displacement gain at CF increases from 23 dB at the most apical location to 62 dB near the base. The group delay, when expressed in number of cycles, also reaches its maximum Nmax near CF for every location.

Figure 2.

Figure 2

Frequency responses at selected locations in the cochlea. Curves represent responses at nine different locations in equal distances: x={0.9,0.8,…,0.1} times the length the cochlea, respectively. Characteristic frequency decreases as x increases (i.e., toward the apex). (A) RL-to-stapes displacement gain. (B) RL displacement group delay.

By inspection, the responses in Fig. 2A are more sharply tuned at basal locations than apically. Further analyses show that the quality factor in terms of the equivalent rectangular bandwidth11 (QERB) increases from 1.8 to 9.9 from apex to base (Fig. 3, top panel). Between 0.5 and 10 kHz, the present values of QERB are similar to those of cats and guinea pigs derived from auditory-nerve recordings (Shera et al., 2002). The values are also similar to human QERB values derived from DP suppression tuning data (Gorga et al., 2008); however, they are smaller than the QERB of humans derived from psychoacoustic experiments (Glasberg and Moore, 1990). Additionally, the model QERB seems to flatten between 1 and 10 kHz, in agreement with older psychoacoustic findings but different from more recent data that suggest that (a) QERB>10 for a wide range of frequencies, and (b) QERB continues to increase at higher frequencies (Shera et al., 2002).

Figure 3.

Figure 3

Model QERB (top panel) and latency Nmax (bottom panel) plotted against characteristic frequency. Results are compared with curves derived from experimental data across three species. Human QERB=9.26CF∕(CF+230 Hz) is given by Glasberg and Moore (1990). Linear fits of human Nmax and cat and guinea pig QERB and Nmax are given by Shera et al. (2002).

The maximum group delay Nmax for each location is plotted against CF in the bottom panel of Fig. 3, in comparison to cochlear latency estimated from stimulus-frequency otoacoustic emission (SFOAE) data across different species (Shera et al., 2002). The model latency Nmax is shorter than that of human SFOAE forward latency but longer than that of cats and guinea pigs. Note that the model Nmax also seem to flatten at above 1 kHz, a feature shared with the model QERB but not observed in SFOAE data. The model latency, when expressed in absolute time, ranges from 7.0 ms at the 0.5 kHz place to 0.8 ms at the 8 kHz place. Compared to estimates of human cochlear forward latency based on auditory brainstem responses (Neely et al., 1988; Harte et al., 2009), the model latency is similar at 0.5 kHz but shorter at 8 kHz.

Compression of single tones

Figure 4 shows BM (thin lines) and RL (thick lines) magnitude responses to a single tone at two different frequencies for input levels L0 from 0 to 100 dB sound pressure level (SPL) in 10 dB steps. The response plotted at every location is the magnitude of BM or RL displacement at the frequency of the stimulus f0; harmonic distortions are omitted.

Figure 4.

Figure 4

Excitation patterns for pure-tone stimuli. BM displacement (thin lines) and RL displacement (thick lines) along the cochlea are plotted as the input level varies from 0 to 100 dB SPL in 10 dB steps. (A) Stimulus frequency f0=4 kHz. (B) f0=500 Hz.

For each frequency, the magnitude response is compressed near its characteristic place (CP). For f0=4 kHz, RL displacement at CP (x=1.1 cm) grows nearly linearly from L0=0–30 dB (Fig. 5). When L0 increases from 30 to 90 dB, the RL displacement at the CP increases only by an order of magnitude and the best place shifts toward the base (Rhode and Robles, 1974). For f0=500 Hz, the response is nearly linear for L0≤40 dB while the compression is more prominent for L0=50–90 dB. The magnitude response is approximately linear near the base for f0=500 Hz; the excitation pattern forms parallel lines for the first 2.0 cm from the stapes [Fig. 4B]. The near-linear growth at low intensity is consistent with recent experimental findings (Rhode, 2007, Fig. 1), in contrast to a cubic-root growth of amplitude predicted by “essential nonlinearity” (Eguíluz et al., 2000) and Hopf bifurcation theory (Stoop and Kern, 2004). However, for f0=4 kHz, this presumably linear growth is confounded by a standing-wave pattern that is prominent for L0=30–70 dB. This standing-wave phenomenon indicates a reflection from the CP and was explained as a consequence of self-suppression of the forward-going waves for intermediate input levels (Kanis and de Boer, 1993, Fig. 2).

Figure 5.

Figure 5

Nonlinear growth of RL and BM displacements at CP, plotted against stimulus intensity. The stimulus is either a 4 kHz or a 500 Hz pure tone.

Compared to BM excitation patterns, the RL excitation patterns in Figs. 4A, 4B have higher tip-to-tail gains and are more sharply tuned. Since RL motion is more directly related to neural excitation, the present results suggest a distinction between tuning curves recorded from auditory nerves (e.g., Pfeiffer and Kim, 1975; van der Heijden and Joris, 2006) and BM tuning curves obtained by motion-sensing techniques (e.g., Rhode, 1971; Ruggero et al., 1997).

Figure 5 shows the RL and BM displacements at CP as a function of stimulus level; their rate of growth (ROG), defined as the slope of the I∕O function, is plotted in Fig. 6. As can be seen in Fig. 6, the response to 4 kHz stimuli is most compressive between 60 and 70 dB SPL, reaching a minimum ROG of 0.19 and 0.25 for RL and BM, respectively. The response to 500 Hz stimuli is linear for a more extended input range; the response is most compressive between 70 and 80 dB SPL, reaching minimum ROGs of 0.24 and 0.33 for RL and BM, respectively.

Figure 6.

Figure 6

RL and BM response rate of growth (ROG, see the text for definition), plotted against stimulus intensity.

The model results for the 4 kHz stimuli have comparable ROG to BM vibration data from the 9 kHz place (Ruggero et al., 1997, Fig. 3: ROG=0.2–0.5) and the 6 kHz place (Rhode, 2007, Fig. 3: ROG∼0.3) in chinchilla cochleae. However, experimental results showed individual variability, and a more compressive BM response has also been reported (Rhode, 2007, Fig. 4: ROG∼0.1).

Distortion-product otoacoustic emissions

To simulate DPOAE, the stimulus f(t) was comprised of two tones at frequencies {f1,f2} and levels {L1,L2}. The amplitude of the two tones was calibrated so that the SPL varied from 40 to 80 dB for L1 and from 20 to 70 dB for L2 in 2 dB steps, respectively. The range of L2 is similar to that of a typical experiment on human subjects (e.g., Johnson et al., 2006; Long et al., 2009). A 5 ms cosine-square ramp was used for the onset of the stimulus. For each combination of {L1,L2}, the simulation ran for a sufficiently long time (more than the duration of the ramp plus 40 cycles of f1) so that the DPOAE at the eardrum reached a steady level. Then, the magnitude Ld of DPOAE at fd=2f1f2 was calculated using discrete Fourier transform (DFT). The window length of DFT was set so that it contained minimal integer cycles of f1 and f2, respectively—for example, if f2 is 4 kHz and f2f1=1.2, the window length is 6 cycles of f2, or 1.5 ms.

Results for 16 different conditions of primary frequencies {f1,f2} are reported in Fig. 7: four f2’s (1, 2, 4, or 8 kHz) by four primary-frequency ratios (f2f1=1.4, 1.3, 1.2, or 1.1). Each column represents a fixed f2 and each row represents a fixed f2f1. In each panel, isolevel contours for Ld as a function of L1 and L2 are plotted in 4 dB steps. Note that the contours are well rounded for f2f1=1.40 across all f2, and become more oblique and narrower as f2f1 decreases. Therefore, for any fixed f2 and L2, the “optimal” L1 that yields the highest DPOAE level has a tendency to decrease as f2f1 decreases. This result agrees qualitatively with experimental data from normal-hearing humans (Johnson et al., 2006). In each panel, the straight line shows a linear regression of optimal L1 as a function of L2. Among all linear paths shown in Fig. 7, the path for f2=4 kHz and f2f1=1.20 is L1=0.43L2+45, which comes closest to paths previously recommended. To obtain maximum DPOAE, Kummer et al. (1998) recommended the path of L1=0.4L2+39 and Neely et al. (2005) recommended L1=0.45L2+44.

Figure 7.

Figure 7

DPOAE level Ld, plotted as a function of L1 and L2 for various combinations of {f1,f2}. Isolevel contours are plotted in 4 dB steps. As a reference, the contour corresponding to Ld=0 dB SPL is indicated with a thick black line. Otherwise, darker contours represent higher Ld. Only contours corresponding to Ld>−30 dB are shown. In each panel, the thick gray line represents linear regression of optimal L1 as a function of L2. Empirical optimal paths are shown on the panel for f2=4 kHz and f2f1=1.20 for comparison; dashed line was recommended by Neely et al. (2005), and dash-dotted line by Kummer et al. (1998).

Another way to visualize the DPOAE data is to plot Ld against primary frequencies. Following Kummer et al. (1998) recommendation, we set L1=0.4L2+39. DPOAE level Ld is plotted against f1 for fixed f2=1, 2, 4, or 8 kHz and fixed L2 from 20 to 70 dB in 10 dB steps. In Fig. 8, each curve consists of six points corresponding to f2f1=1.55, 1.4, 1.3, 1.2, 1.1, and 1.05, respectively. Results show that the optimal f1 for each curve ranges from f2∕1.4 (e.g., for f2=1 kHz, L2=60 or 70 dB) to f2∕1.1 (e.g., for f2=2 or 4 kHz, L2=20 dB). Also, the optimal f1 decreases as L2 increases for f2=1, 2, or 4 kHz.

Figure 8.

Figure 8

DP filtering effects. DP level Ld is plotted against f1 for a fixed f2=1, 2, 4, or 8 kHz. Curves marked with different symbols correspond to different L2: ▽=20 (dB), ◻=30, +=40, ○=50, ×=60, and △=70, while L1=39+0.4L2.

Brown et al. (1992) reported human DPOAE at a fixed f2 while f1 was swept, so f2f1 varied from 1.01 to 1.41. Primary levels were fixed at L2=40 dB and L1=55 dB. DP level was plotted against DP frequency fd, and results showed a band-pass profile similar to Fig. 8 with a plateau of approximately 0.8 kHz wide and 5–10 dB (SPL) high. The peak level Ld for the 2f1f2 component occurred at an f1 of approximately 3.3–3.4 kHz, which is higher than the best f1≈3.1 kHz of the present model. This discrepancy will be discussed further in Sec. 5.

Suppression of DPOAE

Model responses to three tones were tested in a DP suppression paradigm. The stimulus f(t) consisted of two primary tones with f2f1=1.22 and a suppressor tone at frequency fsup and level Lsup. As L2 varied, L1 was set as recommended by Kummer et al. (1998): L1=0.4L2+39. The DP I∕O function was measured in the same manner as described in Sec. 4C except that the window used for spectral analysis now contained minimal integer cycles of f1, f2, and fsup.

Figure 9 shows DP I∕O function for an “on-frequency” suppression condition [fsupf2, Fig. 9A] and for fsup at approximately one octave lower [Fig. 9B]. The thick line shows the DP I∕O function when Lsup=0 dB SPL; at this suppressor level, any change of DP level due to the presence of the suppressor should be negligible. On-frequency suppression becomes prominent for Lsup≥40 dB SPL, and the I∕O function generally shifts to the right as Lsup increases further. In contrast, the low-frequency suppressor does not cause Ld to reduce until Lsup=70 dB. Note that the low-frequency suppressor even causes Ld to increase for Lsup=50, 60, and 70 dB for different ranges of L2. Possible reasons for this DPOAE increment induced by a low-frequency “suppressor” will be discussed in Sec. 5.

Figure 9.

Figure 9

Suppression of DPOAE by a third tone. Frequencies of the primary tones are: f2=4000 Hz, and f1=f2∕1.22. (A) “On-frequency” suppression, fsupf2. (B) fsup≈0.5f2. In both panels, the solid line shows DP input-output (I∕O) function with negligible suppression (suppressor level Lsup=0 dB). Symbols represent the I∕O function for different Lsup: ◻=30, +=40, ○=50, ×=60, △=70, and ▽=80.

The rightward shift of DP I∕O function shown in Fig. 9 could be quantified by finding the input increment ΔL2 necessary for the DP level Ld to reach a given level when the suppressor is present. In Fig. 10, ΔL2 is plotted against Lsup for the on-frequency suppressor (filled symbols) and the low-frequency suppressor (open symbols) for Ld to reach −20 or −3 dB SPL. Results indicate that I∕O curves begins to shift to the right (ΔL2>0) at a lower Lsup for Ld=−20 dB than for Ld=−3 dB; this is true for both the on-frequency and the low-frequency suppressors. The shift ΔL2 produced by the present model is comparable to data from normal-hearing humans (Rodríguez et al., 2010) for most of the lower Lsup’s. For the highest Lsup tested (70 dB for fsupf2 and 75 or 80 dB for fsup≈0.5f2), ΔL2 produced by the model is at least one standard deviation higher than the mean in Rodríguez et al.’s data (in which ΔL2 was measured with Ld=−3 dB SPL). This discrepancy will be discussed in Sec. 5.

Figure 10.

Figure 10

DP input suppression. The increase ΔL2 in input level required for DPOAE to reach a given level (−20 or −3 dB SPL) is plotted against suppressor level Lsup. Filled symbols represent the “on-frequency” suppression (fsupf2). Open symbols represent fsup at about one octave lower. Squares and error bars show means and standard deviations, respectively, of DP suppression data from normal-hearing humans (Rodríguez et al., 2010).

DISCUSSION

Cochlear tuning and parameter selection

Employing a frequency-domain analysis, we previously predicted that the present OHC model could provide amplification to the traveling waves in a frequency-selective manner (Liu and Neely, 2009) if parameters αv (velocity sensitivity) and Cg (gating capacitance) are sufficiently large. During the parameter-selection process for the present study, we found that the tip-to-tail gain and the quality factor QERB were both reduced when smaller values of αv or Cg were used. In this sense, the present study serves as a time-domain confirmation of the frequency-domain analysis. Note that αv and Cg represent different components of the “cochlear amplifier:” αv represents the sensitivity of MET, which depends on the potential gradient across the apical membrane of the OHC; Cg depends on the density of motor molecules (prestin) on the OHC lateral membrane and is reduced when the cell is hyperpolarized (Santos-Sacchi, 1991). The present model makes it possible to investigate the relations between cochlear tuning properties and micromechanical parameters such as αv and Cg. Simulation can be conducted to predict responses from cochleae with different types of OHC pathology. This might be of clinical interest and provides directions for future research.

In spite of the effort devoted to the parameter-selection process in order to make a tonotopic map typical of humans, the sharpness of tuning QERB and the cochlear latency Nmax produced by the present model (Fig. 3) fall below experimental values previously derived from human ears. Several other lines of evidence also suggest that the tuning of the present model is not as sharp as in human cochleae: first, the ratio f2f1 that produces the highest DPOAE is higher in the present model than that obtained from normal-hearing human ears (Sec. 4C). This implies that the present model is not as sharply tuned as human cochleae because the cutoff frequency of the DP filter (Fig. 8) is an estimate of the bandwidth of cochlear responses. Second, in the present model, the low-frequency suppressor produces much more DPOAE suppression than empirically at high Lsup (Fig. 10). This indicates that the f2 characteristic place in the present model has a larger response to low-frequency stimuli at high level than it does in human cochleae. Therefore, either the present model is less sharply tuned than human cochleae or its excitation pattern shifts toward the base excessively at high intensities. Further investigation is needed to explore these two possibilities.

The reduced sharpness of tuning may be a limitation of one-dimensional (long-wave) cochlear modeling. In a two-dimensional (2D) model, the real and imaginary parts of the wave propagation function, which are related to cochlear gain and cochlear latency respectively, are both greater than those predicted by a 1D model (Shera et al., 2005). Kolston (2000) argued that it is necessary to model cochlear mechanics in three dimensions so as to match both the phase and the gain responses to experiment data. To increase the sharpness of tuning for the present 1D model, we could have set αv or Cg to higher values. However, values higher than currently used might start to deviate from physical reality. Additionally, we found that higher αv’s resulted in spontaneous vibration in the cochlear partition, which could propagate from the cochlea as SOAE. The frequency of SOAE is hard to predict (Ku et al., 2009), and its presence complicates spectral analysis since the present method assumed that the frequencies of all spectral components are of integer ratios. In the future, higher αv values might be preferred to produce higher cochlear gain; an improved spectral-analysis method is also warranted.

Cochlear nonlinearities

In the present study, the gating capacitance of OHCs was linearized to confine the source of nonlinearity to MET of OHCs (see Sec. 2B). Patuzzi (1996) argued that such simplification is legitimate at high frequencies because the response in OHC lateral-membrane potential is limited by membrane capacitance. However, at low frequencies and high intensities, Patuzzi (1996) stated that the membrane potential may change by tens of millivolts; in this case, the operating point shifts significantly, so the nonlinearity in OHC gating capacitance needs to be considered.

In light of Patuzzi’s (1996) argument, we examined the maximum potential change elicited by acoustical stimuli in the present model. Results showed that the change in OHC membrane potential was indeed larger for low-frequency than for high-frequency stimuli, but was no more than 4.0 mV for single tones at 100 dB SPL at any frequency between 0.125 and 8 kHz. This amplitude was too small to have caused a significant shift in the OHC operating point. Though the model’s findings deviated from Patuzzi’s (1996) prediction, our attempt to linearize the dynamics of piezoelectrical membrane is justified as far as numerical simulation is concerned. Improving the representation of electrical properties of the OHC is an area for future work.

The present model reproduced, at least qualitatively, many features in nonlinear responses that are typical of humans and other mammals. We have argued that some discrepancies could be accounted for by increasing the sharpness of tuning throughout the cochlear model. Another discrepancy is in the absolute level of DPOAE; in Fig. 7, the DP level Ld reached >30 dB SPL for the highest L1 and L2 tested for f2=1 or 2 kHz when f2f1≥1.30. In contrast, the mean of Ld across normal-hearing ears was never greater than 20 dB SPL as reported by Johnson et al. (2006). Nevertheless, the DP I∕O function in Fig. 9A is similar to experimental data at L2=20–50 dB SPL (Johnson et al., 2006; Long et al., 2009; Rodríguez et al., 2010). While discrepancy of the absolute levels of DPOAE could be partially explained by transmission through the middle ear (see discussion in Appendix A), discrepancy in the DP growth rate at high L2 indicates that the nonlinearity in the present model is not sufficiently compressive.

Note that the DP level discrepancy becomes prominent at L2≥50 dB, at which the medial olivocochlear (MOC) efferent inhibition would be activated (Guinan, 2006). A recent study also showed that middle-ear muscle reflex (MEMR) was elicited by a 75 dB activator at 1 kHz in 25% of normal-hearing ears (Keefe et al., 2010). It is possible that MOC efferents and MEMR both contribute to the suppression of DPOAE level when L1 or L2 is above their respective thresholds. Therefore, incorporating these negative feedback mechanisms to the present model may help reduce the excessive DPOAE it generates at high primary levels.

As stated previously, the one-dimensional treatment is known to affect tuning. Because tuning is related to compression in a nonlinear model, the absence of close quantitative agreement with experimental data could also be related to the simplified dimensionality of the present model.

Negative suppression

The negative suppression shown in Figs. 9B, 10 remains puzzling. It occurred only for a certain input range (50–70 dB). In fact, negative suppression of DPOAE has been observed empirically; for instance, the mean input suppression ΔL2 in Rodríguez et al. (2010) is −0.8 dB at Lsup=60 dB for the low-frequency suppressor. To explain Rodríguez et al.’s (2010) data, one can assume that the DP wavelets from the f2 characteristic place and those reflected coherently (Zweig and Shera, 1995) from the DP place interfere destructively at the stapes. When a low-frequency suppressor is presented at an intermediate level, it reduces the wavelets that originate from the DP place and releases DPOAE from wave cancellation. This causes the DPOAE level to increase.

An alternative explanation for negative suppression is that the suppressor causes DP to be reflected from its characteristic place in a nonlinear fashion, much similar to the reflection observed in Fig. 4 and by Kanis and de Boer (1993). If the reflected wavelets interfere with the wavelets from the f2 place constructively, there would be an increase in the level of DPOAE. In theory, waves are scattered wherever there is an impedance mismatch. If saturation nonlinearity causes the propagation-gain function to reduce locally at a high intensity, the wavenumber function should also be altered due to causality (Shera, 2007). Similarly, it may be possible to investigate how characteristic impedance is altered locally and waves scattered nonlinearly, perhaps using a quasilinear approach (Kanis and de Boer, 1993; Talmadge et al., 1998). In the future, a time-domain model such as presented in this paper can serve as a numerical test bench for theories developed in this direction.

CONCLUSIONS

Simulation of DPOAE at frequency 2f1f2 can be achieved by introducing an antisymmetric nonlinearity to the MET channel of OHCs. The present nonlinear OHC models embedded in a one-dimensional transmission-line cochlear model collectively produce dynamic-range compression, broadening of excitation patterns, and shifting of excitation patterns toward the base, similar to responses observed in live mammalian cochleae. The model also produces DPOAE isolevel contour plots that vary with f1 and f2, DP band-pass filtering effects across three octaves of f2, and a shift of DPOAE I∕O functions due to suppression by a third tone. Several lines of evidence suggest that the present model is, however, less sharply tuned than human cochleae. The present results might be improved by setting a higher sensitivity of OHC receptor current with respect to RL velocity or by modeling mechanics of the cochlear fluid in two or three dimensions. Nevertheless, qualitative similarities between the present results and experimental data from normal-hearing ears suggest that the model may also be useful for studying other nonlinear responses in healthy cochleae. In addition, because of an explicit representation of OHC electromechanics, it is possible to alter parameters for the present model and simulate responses in ears with different types of OHC pathology. These provide directions for future research.

ACKNOWLEDGMENTS

We gratefully acknowledge two anonymous reviewers for providing valuable comments. We also thank Joyce Rodríguez and Emery Ku for giving critiques on previous versions of this manuscript. This study was supported by a grant from NIH-NIDCD (Grant No. R01-DC8318).

APPENDIX A: MIDDLE-EAR TRANSFER FUNCTIONS

Figure 11 shows middle-ear transfer functions in both the forward and the reverse direction. Results were obtained from a finite-difference, frequency-domain version of the present cochlear model. First, cochlear input impedance Zc was calculated using a small-signal analysis (Puria and Allen, 1991), i.e., Eq. 1 was used instead of Eq. 2. Given Zc, we drew an equivalent circuit for the middle-ear mechanics (Fig. 1) and derived the forward pressure transfer function S12, defined as the acoustic pressure gain from the ear canal to the cochlear vestibule (Puria, 2003). To calculate transfer functions in the reverse direction, we assumed that the ear canal has an acoustic impedance of ρcA—ρ is the air density, c is the speed of sound, and A is the cross-sectional area of the ear canal; in other words, reflection of reverse-traveling sounds was ignored. The equivalent circuit was also used for calculation of the reverse pressure transfer function S21, defined as the ratio of the ear canal pressure to the cochlear fluid pressure for reverse transmission. Finally, the reverse middle-ear impedance M3, defined as the ratio of the vestibule pressure to the stapes volume velocity, was calculated. For comparison purposes, the symbols and definitions are borrowed from a previous study (Puria, 2003, Fig. 2).

Figure 11.

Figure 11

Middle-ear transfer functions. Top panels show magnitude responses, and bottom panels show phase responses. From left to right, the four columns show forward pressure transfer function (S12), reverse pressure transfer function (S21), reverse middle-ear impedance (M3), and the cochlear input impedance Zc, respectively.

The main discrepancy between the present results and human cadaver data is in the phase responses of pressure transfer functions. The phase accumulated to more than 300° for both S12 and S21 in Puria’s (2003) data, and in one case more than 540° for S12 in Nakajima et al. (2009). This indicates a middle-ear latency, in the order of tens of μs (Dong and Olson, 2006), that is not present in our model but has been attributed to tympanic membrane dynamics (Puria and Allen, 1998).

Compared to measurements from human cadavers (Puria, 2003; Nakajima et al., 2009), the magnitude plateau in S12 is about the same height (10–20 dB gain). The width of the plateau is similar to Nakajima et al. (2009) (their Fig. 6) but appears wider than in Puria’s data (his Fig. 2); the present S12 is as much as 10–15 dB higher at frequencies below 500 Hz and above 4 kHz. The magnitude of S21 are similar to Puria’s (2003) result, peaking at around 1 kHz and rolls off approximately to −50 and −60 dB at 0.1 and 10 kHz, respectively. However, the maximum |S21| is smaller than Puria’s (2003) but more similar to results from live gerbils (Dong and Olson, 2006, Fig. 5).

The magnitude of S12 and S21 influences the level of DPOAE directly (Keefe and Abdala, 2007). If the middle-ear parameters are adjusted to reduce S12 and match Puria’s (2003) data, it will require higher L1 and L2 to achieve the same levels of DP inside the cochlea. Consequently, we can expect the DP contours to shift toward the upper-right corner in Fig. 7, more so for f2=4 or 8 kHz than for 1 and 2 kHz. However, S12 and S21 are interdependent due to reciprocity; reducing S12 tends to increase S21. It remains to be studied how much the difference between model-predicted DPOAE levels and those measured from normal-hearing humans (Johnson et al., 2006, Fig. 1) can be resolved by jointly adjusting S12 and S21.

APPENDIX B: METHODS FOR ADJUSTING THE COCHLEAR PARAMETERS

For a stimulus at sufficiently low intensity, the response at any given location in the cochlea can be characterized by three salient frequencies: a shoulder frequencyfsh above which the magnitude increases rapidly, a pole frequencyfp at which the response reaches its peak, and a cutoff frequencyfc at which the response starts to roll off. Based on the present assumptions on fluid coupling [Eq. 9] and OHC electromechanics, these frequencies are approximately given by the following formulas:

2πfsh=KeqM, (B1a)
2πfp=(Keq+αvTC)M, (B1b)
2πfc=km, (B1c)

where Keq=K+(T2C)−1+(T2Cg)−1. For negative damping to occur, it is necessary that fsh<fp<fc (Liu and Neely, 2009). Further, large capacitances C and Cg and high sensitivity αv are preferred to achieve high amplification (Liu and Neely, 2009). All parameters listed in Table 1 were fine-tuned while maintaining that fp<fc but maximizing αvTC so as to ensure a broad negative damping region. In Fig. 12, salient frequencies are compared against Greenwood’s function (Greenwood, 1990) and the CF predicted by the present model. The model-predicted CF follows the pole frequency closely. Also, the CF does not deviate from Greenwood’s function (Greenwood, 1990) by more than 10% except at the most apical 7 mm. The discrepancy there reflects the limitation of fitting the parameters spatially by log-quadratic interpolation (Sec. 3A).

Figure 12.

Figure 12

Place-frequency functions. The shoulder frequency, the pole frequency, and the cutoff frequency were calculated as a function of distance from stapes. At selected places, the model-predicted CF is marked with a square. The Greenwood function was defined as CF(x)=F0⋅2xlF1, where x is the distance from stapes, l=5.0 mm, F0=20 kHz, and F1=0.165 kHz.

Finally, the parameter Imax which characterizes nonlinearity [see Eq. 3] was adjusted to produce reasonable dynamic-range compression and DP I∕O function. We found that a small Imax produces compression at low stimulus levels but limits the maximum DP level. Thus, Imax and middle-ear transfer functions S12 and S21 jointly affect the amount of DPOAEs produced by the present model.

Footnotes

1

That is, any phase advance of ir relative to ξr due to radial coupling between OHC hair bundles and the tectorial membrane was represented by the velocity-sensing parameter αv [see Sec. VC of Liu and Neely (2009) for more discussion].

2

Because Eq. 3 is antisymmetric, all the even-order terms vanish in its Taylor expansion around the origin. Consequently, even-order DPs vanish while odd-order products remain.

3

In our previous study (Liu and Neely, 2009), an OHC was assumed to stretch against RL on one side, BM on the other side, and the stiffness of the OHC itself. We then argued that, because the stiffness of BM is much higher than that of RL, the impedance is approximately that of RL for the frequency range of interest. Therefore, the effective mass M and resistance R in Eq. 7 can be regarded as equivalent to the RL mass and resistance, whereas the effective stiffness K is equivalent to OHC stiffness plus RL stiffness in Liu and Neely (2009).

4

Alternatively, Liu and Neely (2009) assumed that the volume velocity gradient couples to the velocity of both the RL and the BM; i.e., xU=w(ξ˙r+ξ˙b). Empirically, Eq. 9 gives higher displacement gain in the cochlea.

5

Hereafter, acoustic pressure is referred to as pressure for simplicity.

6

Equation 13b implicitly assumes that the displacement of the round window is equal to that of the stapes but in the opposite direction, and that the area of the round window is the same as the area of the stapes.

7

Let q0, q1, and q2 denote the value of a parameter q at the base, the midpoint, and the apex respectively. Then, q(x) was calculated using the following formula: q(x)=q0 exp[ax(Lx)+bx], where a=2 log(q12q0q2)L2 and b=log(q2q0)∕L.

8

Kc corresponds to the acoustic compliance of enclosed air of 1.7 cm3 at atmospheric pressure. Diaphragm parameters Kd, Rd, and Md were chosen arbitrarily to produce a sufficiently flat response in the frequency range of interest. Note that the resonance frequency of the diaphragm, KdMd2π, is above 40 kHz, and the Q-value (KdMdRd) is about 1.0.

9

Thus, the spatial resolution δ along the x-direction was 50 μm, and x2P was approximated by [P(x+δ)−2P(x)+P(x−δ)]∕δ2. So, Eq. 18 was solved by inverting a tridiagonal matrix via Gaussian elimination, which has a computation load of O(N).

10

To derive the gain and the group delay, the Fourier transform Xr(x,f) of RL displacement ξr(x,t) was calculated at every location x. Define a transfer function T(x,f)=Xr(x,f)∕Xs(f), where Xs(f) is the Fourier transform of stapes displacement xs(t). The displacement gain plotted in Fig. 2A is |T(x,f)| in dB, and the group delay plotted in Fig. 2B is defined as N=−ffϕ, where ϕ denotes the phase of T(x,f).

11

For an arbitrary tuning curve T(f), its ERB is defined as Δf=0|T(f)|dfTmax, where Tmax denotes the maximum of |T(f)|. Accordingly, QERB is defined as fmax∕Δf, where fmax denotes the best frequency.

References

  1. Brown, A. M., Gaskill, S. A., and Williams, D. M. (1992). “Mechanical filtering of sound in the inner ear,” Proc. R. Soc. London, Ser. B 250, 29–34. 10.1098/rspb.1992.0126 [DOI] [PubMed] [Google Scholar]
  2. Brownell, W. E., Bader, C. R., Bertrand, D., and de Ribaupierre, Y. (1985). “Evoked mechanical responses of isolated cochlear hair cells,” Science 227, 194–196. 10.1126/science.3966153 [DOI] [PubMed] [Google Scholar]
  3. Corey, D. P., and Hudspeth, A. J. (1983). “Kinetics of the receptor current in bullfrog saccular hair cells,” J. Neurosci. 3, 962–976. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Dallos, P. (1973). The Auditory Periphery: Biophysics and Physiology (Academic, New York: ). [Google Scholar]
  5. Dallos, P. (2008). “Cochlear amplification, outer hair cells and prestin,” Curr. Opin. Neurobiol. 18, 370–376. 10.1016/j.conb.2008.08.016 [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Davis, H. (1983). “An active process in cochlear mechanics,” Hear. Res. 9, 79–90. 10.1016/0378-5955(83)90136-3 [DOI] [PubMed] [Google Scholar]
  7. Diependaal, R. J., Duifhuis, H., Hoogstraten, H. W., and Viergever, M. A. (1987). “Numerical methods for solving one-dimensional cochlear models in the time domain,” J. Acoust. Soc. Am. 82, 1655–1666. 10.1121/1.395157 [DOI] [PubMed] [Google Scholar]
  8. Dong, W., and Olson, E. S. (2006). “Middle ear forward and reverse transmission in gerbil,” J. Neurophysiol. 95, 2951–2961. 10.1152/jn.01214.2005 [DOI] [PubMed] [Google Scholar]
  9. Dong, W., and Olson, E. S. (2008). “Supporting evidence for reverse cochlear traveling waves,” J. Acoust. Soc. Am. 123, 222–240. 10.1121/1.2816566 [DOI] [PubMed] [Google Scholar]
  10. Eguíluz, V. M., Ospeck, M., Choe, Y., Hudspeth, A. J., and Magnasco, M. O. (2000). “Essential nonlinearities in hearing,” Phys. Rev. Lett. 84, 5232–5235. 10.1103/PhysRevLett.84.5232 [DOI] [PubMed] [Google Scholar]
  11. Gaskill, S. A., and Brown, A. M. (1990). “The behaviour of the acoustic distortion product, 2f1-f2, from the human ear and its relation to auditory sensitivity,” J. Acoust. Soc. Am. 88, 821–839. 10.1121/1.399732 [DOI] [PubMed] [Google Scholar]
  12. Glasberg, B. R., and Moore, B. C. J. (1990). “Derivation of auditory filter shapes from notched-noise data,” Hear. Res. 47, 103–138. 10.1016/0378-5955(90)90170-T [DOI] [PubMed] [Google Scholar]
  13. Gorga, M. P., Neely, S. T., Dierking, D. M., Kopun, J., Jolkowski, K., Groenenboom, K., Tan, H., and Stiegemann, B. (2008). “Low-frequency and high-frequency distortion product otoacoustic emission suppression in humans,” J. Acoust. Soc. Am. 123, 2172–2190. 10.1121/1.2839138 [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Greenwood, D. D. (1990). “A cochlear frequency-position function for several species—29 years later,” J. Acoust. Soc. Am. 87, 2592–2605. 10.1121/1.399052 [DOI] [PubMed] [Google Scholar]
  15. Guinan, J. J., Jr. (2006). “Olivocochlear efferents: Anatomy, physiology, function, and the measurement of efferent effects in humans,” Ear Hear. 27, 589–607. 10.1097/01.aud.0000240507.83072.e7 [DOI] [PubMed] [Google Scholar]
  16. Hall, J. L. (1974). “Two-tone distortion products in a nonlinear model of the basilar membrane,” J. Acoust. Soc. Am. 56, 1818–1828. 10.1121/1.1903519 [DOI] [PubMed] [Google Scholar]
  17. Harte, J. M., Pigasse, G., and Dau, T. (2009). “Comparison of cochlear delay estimates using otoacoustic emissions and auditory brainstem responses,” J. Acoust. Soc. Am. 126, 1291–1301. 10.1121/1.3168508 [DOI] [PubMed] [Google Scholar]
  18. Hubbard, A. E., Mountain, D. C., and Chen, F. (2003). “Time-domain responses from a nonlinear sandwich model of the cochlea,” in Biophysics of the Cochlea: From Molecules to Models, edited by Gummer A. W. (World Scientific, Singapore: ), pp. 351–358. [Google Scholar]
  19. Hudspeth, A. J. (1997). “Mechanical amplification of stimuli by hair cells,” Curr. Opin. Neurobiol. 7, 480–486. 10.1016/S0959-4388(97)80026-8 [DOI] [PubMed] [Google Scholar]
  20. Johnson, T. A., Neely, S. T., Garner, C. A., and Gorga, M. P. (2006). “Influence of primary-level and primary-frequency ratios on human distortion product otoacoustic emissions,” J. Acoust. Soc. Am. 119, 418–428. 10.1121/1.2133714 [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Kanis, L. J., and de Boer, E. (1993). “Self-suppression in a locally active nonlinear model of the cochlea: A quasi-linear approach,” J. Acoust. Soc. Am. 94, 3199–3206. 10.1121/1.407225 [DOI] [PubMed] [Google Scholar]
  22. Keefe, D. H., and Abdala, C. (2007). “Theory of forward and reverse middle-ear transmission applied to otoacoustic emissions in infant and adult ears,” J. Acoust. Soc. Am. 121, 978–993. 10.1121/1.2427128 [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Keefe, D. H., Fitzpatrick, D., Liu, Y. -W., Sanford, C. A., and Gorga, M. P. (2010). “Wideband acoustic reflex test in a test battery to predict middle-ear dysfunction,” Hear. Res. In press. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Kemp, D. T. (1978). “Stimulated acoustic emissions from within the human auditory system,” J. Acoust. Soc. Am. 64, 1386–1391. 10.1121/1.382104 [DOI] [PubMed] [Google Scholar]
  25. Kim, D. O., Molnar, C. E., and Mathews, J. W. (1980). “Cochlear mechanics: Nonlinear behavior in two-tone responses as reflected in cochlear-nerve-fibre responses and in ear canal sound pressure,” J. Acoust. Soc. Am. 67, 1704–1721. 10.1121/1.384297 [DOI] [PubMed] [Google Scholar]
  26. Kim, D. O., Molnar, C. E., and Pfeiffer, R. R. (1973). “A system of nonlinear differential equations modeling basilar-membrane motion,” J. Acoust. Soc. Am. 54, 1517–1529. 10.1121/1.1914449 [DOI] [PubMed] [Google Scholar]
  27. Kolston, P. J. (2000). “The importance of phase data and model dimensionality to cochlear mechanics,” Hear. Res. 145, 25–36. 10.1016/S0378-5955(00)00067-8 [DOI] [PubMed] [Google Scholar]
  28. Ku, E. M., Elliott, S. J., and Lineton, B. (2009). “Limit cycle oscillations in a nonlinear state space model of the human cochlea,” J. Acoust. Soc. Am. 126, 739–750. 10.1121/1.3158861 [DOI] [PubMed] [Google Scholar]
  29. Kummer, P., Janssen, T., and Arnold, W. (1998). “The level and growth behavior of the 2f1-f2 distortion product otoacoustic emission and its relationship to auditory sensitivity in normal hearing and cochlear hearing loss,” J. Acoust. Soc. Am. 103, 3431–3444. 10.1121/1.423054 [DOI] [PubMed] [Google Scholar]
  30. Liu, Y. -W., and Neely, S. T. (2009). “Outer hair cell electromechanical properties in a nonlinear piezoelectric model,” J. Acoust. Soc. Am. 126, 751–761. 10.1121/1.3158919 [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Long, G. R., Jeung, C., and Talmadge, C. L. (2009). “Dependence of distortion-product otoacoustic emission components on primary-level ratio,” in Concepts and Challenges in the Biophysics of Hearing, edited by Cooper N. P. and Kemp D. T. (World Scientific, Singapore: ), pp. 203–208. [Google Scholar]
  32. Lu, T. K., Zhak, S., Dallos, P., and Sarpeshkar, R. (2006). “Fast cochlear amplification with slow outer hair cells,” Hear. Res. 214, 45–67. 10.1016/j.heares.2006.01.018 [DOI] [PubMed] [Google Scholar]
  33. Matthews, J. W. (1983). “Modeling reverse middle ear transmission of acoustic distortion signals,” in Mechanics of Hearing, edited by de Boer E. and Viergever M. A. (Delft University Press, Delft: ), pp. 11–18. [Google Scholar]
  34. Nakajima, H. H., Dong, W., Olson, E. S., Merchant, S. N., Ravicz, M., and Rosowski, J. J. (2009). “Differential intracochlear sound pressure measurements in normal human temporal bones,” J. Assoc. Res. Otolaryngol. 10, 23–36. 10.1007/s10162-008-0150-y [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Neely, S. T. (1988). “Transient responses in an active, nonlinear model of cochlear mechanics,” in Basic Issues in Hearing, edited by Duifuis D., Horst J., and Wit H. (Academic, New York: ). [Google Scholar]
  36. Neely, S. T. (1993). “A model of cochlear mechanics with outer hair cell motility,” J. Acoust. Soc. Am. 94, 137–146. 10.1121/1.407091 [DOI] [PubMed] [Google Scholar]
  37. Neely, S. T., Gorga, M. P., and Dorn, P. A. (2000). “Distortion product and loudness growth in an active, nonlinear model of cochlear mechanics,” in Recent Developments in Auditory Mechanics, edited by Wada H., Takasaka T., Ikeda K., Ohyama K., and Koike T. (World-Scientific, Singapore: ), pp. 237–243. [Google Scholar]
  38. Neely, S. T., Johnson, T. A., and Gorga, M. P. (2005). “Distortion-product otoacoustic emission measured with continuously varying stimulus level,” J. Acoust. Soc. Am. 117, 1248–1259. 10.1121/1.1853253 [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Neely, S. T., and Kim, D. O. (1983). “An active cochlear model showing sharp tuning and high sensitivity,” Hear. Res. 9, 123–130. 10.1016/0378-5955(83)90022-9 [DOI] [PubMed] [Google Scholar]
  40. Neely, S. T., and Kim, D. O. (1986). “A model for active elements in cochlear biomechanics,” J. Acoust. Soc. Am. 79, 1472–1480. 10.1121/1.393674 [DOI] [PubMed] [Google Scholar]
  41. Neely, S. T., and Liu, Y. -W. (2009). “Retrograde propagation of cochlear distortion,” in Concepts and Challenges in the Biophysics of Hearing, edited by Cooper N. P. and Kemp D. T. (World Scientific, Singapore: ), pp. 55–61. [Google Scholar]
  42. Neely, S. T., Norton, S. J., Gorga, M. P., and Jesteadt, W. (1988). “Latency of auditory brain-stem responses and otoacoustic emissions using tone-burst stimuli,” J. Acoust. Soc. Am. 83, 652–656. 10.1121/1.396542 [DOI] [PubMed] [Google Scholar]
  43. Neely, S. T., and Stover, L. J. (1994). “Otoacoustic emissions from a nonlinear, active model of cochlear mechanics,” in Biophysics of Hair Cell Sensory Systems, edited by Duifhuis H., Horst J. W., van Dijk P., and van Netten S. M. (World Scientific, Singapore: ), pp. 64–71. [Google Scholar]
  44. Nobili, R., Vetešník, A., Turicchia, L., and Mammano, F. (2003). “Otoacoustic emissions from residual oscillations of the cochlear basilar membrane in a human ear model,” J. Assoc. Res. Otolaryngol. 4, 478–494. 10.1007/s10162-002-3055-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Patuzzi, R. (1996). “Cochlear micromechanics and macromechanics,” in The Cochlea, edited by Dallos P., Popper A. N., and Fay R. R. (Springer, New York: ), pp. 186–257. [Google Scholar]
  46. Pfeiffer, R. R., and Kim, D. O. (1975). “Cochlear nerve fiber responses: Distribution along the cochlear partition,” J. Acoust. Soc. Am. 58, 867–869. 10.1121/1.380735 [DOI] [PubMed] [Google Scholar]
  47. Puria, S. (2003). “Measurements of human middle ear forward and reverse acoustics: Implications for otoacoustic emissions,” J. Acoust. Soc. Am. 113, 2773–2789. 10.1121/1.1564018 [DOI] [PubMed] [Google Scholar]
  48. Puria, S., and Allen, J. B. (1991). “A parametric study of cochlear input impedance,” J. Acoust. Soc. Am. 89, 287–309. 10.1121/1.400675 [DOI] [PubMed] [Google Scholar]
  49. Puria, S., and Allen, J. B. (1998). “Measurement and model of the cat middle ear: Evidence of tympanic membrane acoustic delay,” J. Acoust. Soc. Am. 104, 3463–3482. 10.1121/1.423930 [DOI] [PubMed] [Google Scholar]
  50. Rabbitt, R. D., Clifford, S., Breneman, K. D., Farrell, B., and Brownell, W. E. (2009). “Power efficiency of outer hair cell somatic electromotility,” PLOS Comput. Biol. 5, e1000444. 10.1371/journal.pcbi.1000444 [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Ramamoorthy, S., Deo, N. V., and Grosh, K. (2007). “A mechano-electro-acoustical model for the cochlea: Response to acoustic stimuli,” J. Acoust. Soc. Am. 121, 2758–2773. 10.1121/1.2713725 [DOI] [PubMed] [Google Scholar]
  52. Ren, T. (2004). “Reverse propagation of sound in the gerbil cochlea,” Nat. Neurosci. 7, 333–334. 10.1038/nn1216 [DOI] [PubMed] [Google Scholar]
  53. Rhode, W. S. (1971). “Observations of the vibration of the basilar membrane in squirrel monkeys using the Mössbauer technique,” J. Acoust. Soc. Am. 49, 1218–1231. 10.1121/1.1912485 [DOI] [PubMed] [Google Scholar]
  54. Rhode, W. S. (2007). “Basilar membrane mechanics in the 6–9 kHz region of sensitive chinchilla cochleae,” J. Acoust. Soc. Am. 121, 2792–2804. 10.1121/1.2718397 [DOI] [PubMed] [Google Scholar]
  55. Rhode, W. S., and Robles, L. (1974). “Evidence from Mössbauer experiments for nonlinear vibration in the cochlea,” J. Acoust. Soc. Am. 55, 588–596. 10.1121/1.1914569 [DOI] [PubMed] [Google Scholar]
  56. Ricci, A. J., Kennedy, H. J., Crawford, A. C., and Fettiplace, R. (2005). “The transduction channel filter in auditory hair cells,” J. Neurosci. 25, 7831–7839. 10.1523/JNEUROSCI.1127-05.2005 [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Rodríguez, J., Neely, S. T., Patra, H., Kopun, J., Jesteadt, W., Tan, H., and Gorga, M. P. (2010). “The role of suppression in psychophysical tone-on-tone masking,” J. Acoust. Soc. Am. 127, 361–369. 10.1121/1.3257224 [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. Ruggero, M. A., Rich, N. C., and Robles, L. (1997). “Basilar-membrane responses to tones at the base of the chinchilla cochlea,” J. Acoust. Soc. Am. 101, 2151–2163. 10.1121/1.418265 [DOI] [PMC free article] [PubMed] [Google Scholar]
  59. Santos-Sacchi, J. (1991). “Reversible inhibition of voltage-dependent outer hair cell motility and capacitance,” J. Neurosci. 11, 3096–3110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Shera, C. A. (2007). “Laser amplification with a twist: Traveling-wave propagation and gain functions from throughout the cochlea,” J. Acoust. Soc. Am. 122, 2738–2758. 10.1121/1.2783205 [DOI] [PubMed] [Google Scholar]
  61. Shera, C. A., Guinan, J. J., Jr., and Oxenham, A. J. (2002). “Revised estimates of human cochlear tuning from otoacoustic and behavioral measurements,” Proc. Natl. Acad. Sci. U.S.A. 99, 3318–3323. 10.1073/pnas.032675099 [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. Shera, C. A., Tubis, A., and Talmadge, C. L. (2005). “Coherent reflection in a two-dimensional cochlea: Short-wave versus long-wave scattering in the generation of reflection-source otoacoustic emissions,” J. Acoust. Soc. Am. 118, 287–313. 10.1121/1.1895025 [DOI] [PubMed] [Google Scholar]
  63. Shera, C. A., Tubis, A., Talmadge, C. L., de Boer, E., Fahey, P. F., and Guinan, J. J., Jr. (2007). “Allen-Fahey and related experiments support the predominance of cochlear slow-wave otoacoustic emissions,” J. Acoust. Soc. Am. 121, 1564–1575. 10.1121/1.2405891 [DOI] [PubMed] [Google Scholar]
  64. Stoop, R., and Kern, A. (2004). “Two-tone suppression and combination tone generation as computations performed by the Hopf cochlea,” Phys. Rev. Lett. 93, 268103. 10.1103/PhysRevLett.93.268103 [DOI] [PubMed] [Google Scholar]
  65. Talmadge, C. L., Tubis, A., Long, G. R., and Piskorski, P. (1998). “Modeling otoacoustic emission and hearing threshold fine structures,” J. Acoust. Soc. Am. 104, 1517–1543. 10.1121/1.424364 [DOI] [PubMed] [Google Scholar]
  66. van der Heijden, M., and Joris, P. X. (2006). “Panoramic measurements of the apex of the cochlea,” J. Neurosci. 26, 11462–11473. 10.1523/JNEUROSCI.1882-06.2006 [DOI] [PMC free article] [PubMed] [Google Scholar]
  67. Zhang, Y., Kim, C. K., Lee, K. J. B., and Park, Y. (2009). “A Brownian energy depot model of the basilar membrane oscillation with a braking mechanism,” Eur. Phys. J. E 29, 345–349. 10.1140/epje/i2009-10491-9 [DOI] [PubMed] [Google Scholar]
  68. Zurek, P. M., Clark, W. W., and Kim, D. O. (1982). “The behavior of acoustic distortion products in the ear canals of chinchillas with normal or damaged ears,” J. Acoust. Soc. Am. 72, 774–780. 10.1121/1.388258 [DOI] [PubMed] [Google Scholar]
  69. Zweig, G., and Shera, C. A. (1995). “The origin of periodicity in the spectrum of evoked otoacoustic emissions,” J. Acoust. Soc. Am. 98, 2018–2047. 10.1121/1.413320 [DOI] [PubMed] [Google Scholar]
  70. Zwislocki, J. (1962). “Analysis of the middle-ear function. Part I: Input impedance,” J. Acoust. Soc. Am. 34, 1514–1523. 10.1121/1.1918382 [DOI] [Google Scholar]

Articles from The Journal of the Acoustical Society of America are provided here courtesy of Acoustical Society of America

RESOURCES