Abstract
The cochlear microphonic (CM) results from the vector sum of outer hair cell transduction currents excited by a stimulus. The classical theory of CM generation—that the response measured at the round window is dominated by cellular sources located within the tail region of the basilar membrane (BM) excitation pattern—predicts that CM amplitude and phase vary little with stimulus frequency. Contrary to expectations, CM amplitude and phase-gradient delay measured in response to low-level tones in chinchillas demonstrate a striking, quasiperiodic pattern of spectral ripples, even at frequencies > 5 kHz, where interference with neurophonic potentials is unlikely. The spectral ripples were reduced in the presence of a moderate-level saturating tone at a nearby frequency. When converted to the time domain, only the delayed CM energy was diminished in the presence of the saturator. We hypothesize that the ripples represent an interference pattern produced by CM components with different phase gradients: an early-latency component originating within the tail region of the BM excitation and two delayed components that depend on active cochlear processing near the peak region of the traveling wave. Using time windowing, we show that the early, middle, and late components have delays corresponding to estimated middle-ear transmission, cochlear forward delays, and cochlear round-trip delays, respectively. By extending the classical model of CM generation to include mechanical and electrical irregularities, we propose that middle components are generated through a mechanism of “coherent summation” analogous to the production of reflection-source otoacoustic emissions (OAEs), while the late components arise through a process of internal cochlear reflection related to the generation of stimulus-frequency OAEs. Although early-latency components from the passive tail region typically dominate the round-window CM, at low stimulus levels, substantial contributions from components shaped by active cochlear processing provide a new avenue for improving CM measurements as assays of cochlear health.
Keywords: cochlear microphonic, cochlear microphonic model, cochlea
INTRODUCTION
Nearly 90 years ago, Wever and Bray (1930) observed that an electrode placed near the cochlea picks up an alternating current potential that faithfully mirrors the waveform of the acoustic stimulus. This potential, which came to be known as the cochlear microphonic (CM), is generated primarily by the summed receptor currents of the outer hair cells (OHCs) responding to basilar membrane (BM) displacement (e.g., Dallos and Cheatham 1976). Because the CM can be measured in an intact cochlea with far-field monopolar electrodes (e.g., by electrodes placed on the round window or eardrum), it provides a potentially important assay of OHC health and function (Russell 2008). Nevertheless, CMs are perhaps the least utilized indices of OHC performance due to difficulties interpreting the responses in terms of local (i.e., place-specific) cochlear function (e.g., Cheatham et al. 2011).
In the common view, the poor place specificity of the far-field CM derives from a combination of two factors. First, the voltage measured by the electrode depends on a weighted sum of OHC transducer currents whose effects decrease with distance from the electrode, as in a simple cable model where the decrease is exponential (Patuzzi 1987). Thus, for an electrode placed at the round window (RW), the CM primarily reflects the activity of the most proximal cells—those located near the basal end of the cochlea—even when one employs a low-frequency stimulus whose response peaks near the apical end (Patuzzi et al. 1989). Second, because the individual hair cells contributing to the CM evoked by a tonal stimulus are excited by the BM traveling wave, contributions from different cells add with different phases. In particular, the rapid phase variation near the peak of the traveling wave causes the currents from neighboring sources to largely cancel one another; consequently, hair cells in the peak region contribute little to the total CM. As a result, the measured CM is dominated by contributions from cells located in the passive tail region of the BM response, where the phase varies little with location and currents sum constructively (Dallos 1973). Although several studies have proposed ways of overcoming the limited place specificity of monopolar CM measurements, and thereby improving their utility as assays of OHC health (Ponton et al. 1992; Chertoff et al. 2014; Charaziak et al. 2017), the classical theory is widely believed to capture the mechanisms shaping CM responses at the round window (Cheatham et al. 2011; He et al. 2012). Here, we describe previously unreported features of the round-window CM and interpret the results as indicating that mechanisms unaccounted for in the classical theory also play an important role in generating the cochlear microphonic.
Specifically, we show that when measured in response to low-level probe tones in fine frequency steps, CM amplitude and phase-gradient group delays demonstrate a striking quasiperiodic pattern of local maxima and minima, referred to here as spectral ripples. When the probe tone is presented together with another tone of moderate level at a nearby frequency, the spectral ripples are almost completely abolished, confirming their cochlear origin. We describe these phenomena in chinchillas for probe frequencies above 5 kHz, where interference with delayed neurophonic responses is unlikely (Verschooten and Joris 2014). The presence of spectral ripples in CM responses appears inconsistent with the classical view of CM generation, where the response is dominated by contributions from cells located near the round window in the tail region of the BM excitation pattern. By analogy with mechanisms responsible for spectral ripples in distortion-product otoacoustic emission (DPOAE fine structure; Shera and Guinan 1999), we hypothesize that CM spectral ripples represent an interference pattern produced by CM components with different phase gradients: an early-latency (“classical”) component with a shallow phase gradient and one or more delayed components with rapidly rotating phases. We explore this possibility using a model of CM generation in which additional delayed components are created within the active, peak region of the BM displacement wave as a consequence of the existence of cochlear roughness (i.e., random spatial mechanical and/or electrical irregularities). When they are of sufficient magnitude, these delayed components interfere with the early-latency components, resulting in spectral ripples in the round-window CM. Our results suggest a promising new avenue for exploring the utility of the CM in extracting place-specific information on the active function of OHCs. Furthermore, our modeling efforts suggest a link between CM and otoacoustic emission responses evoked with pure tones (i.e., stimulus-frequency otoacoustic emissions, SFOAEs) that can help to unify our understanding of cochlear mechanics.
METHODS
Animal Preparation
The procedures for animal preparation are detailed elsewhere (Charaziak and Siegel 2015; Charaziak et al. 2017). In brief, adult chinchillas were anesthetized (induction: ketamine hydrochloride, 20 mg/kg; maintenance: diallylbarbituric acid, 50 mg/kg, in urethane, 200 mg/kg; supplemental maintenance doses of 20 % of the initial one were given to eliminate limb withdrawal reflex), tracheotomized, and secured in a head holder. The pinna and lateral portion of the ear canal were dissected so that the tip of the sound delivery system could be sealed within ~ 2 mm of the eardrum. The bulla was opened, the tendon of the tensor tympani was sectioned, and in some animals, the stapedius muscle was paralyzed through sectioning of the facial nerve (Songer and Rosowski 2005). A silver ball electrode was placed on the round window, the reference electrode was inserted in the skin of the contralateral ear, and the ground electrode was attached to the head holder. The rectal temperature was kept at ~ 37 °C. The status of the preparation was monitored via recordings of compound action potential (CAP) thresholds. The data collection involved experiments that were part of another study (Charaziak and Siegel 2015). Experimental protocols were approved by the Animal Care and Use Committee of Northwestern University.
Instrumentation
All measurements were carried out in an electrically shielded sound-attenuating booth. Stimulus waveforms were generated and responses acquired and averaged digitally using 24-bit sound card (Card Deluxe-Digital Audio Labs; sampling rate 44.1 kHz) controlled with EMAV software ver. 3.24 (Neely and Liu 2011). The RW electrode signal was differentially amplified (40 dB), band-pass filtered (0.1–30 kHz), and corrected for the acoustic delay of the sound delivery system, as well as for the delay of the preamplifier filter. The stimuli were presented via two modified Radio Shack RS-1377 Super Tweeters coupled via plastic tubing to the probe-microphone system (a modified ER-10A OAE probe, Etymōtic Research, Elk Grove, IL). The speakers were grounded and shielded with heavy steel boxes to minimize electrical and magnetic radiation. Any contamination of the CM signals from the speakers occurred at levels below the system noise floor for all stimulus conditions. Stimulus levels were calibrated in situ to maintain a constant sound-pressure level at the probe microphone, whose output voltages were corrected for the frequency-dependent sensitivity of the microphone (Siegel 2007).
Measurements
The RW signal was measured in response to pure tone(s) of ~ 1.57 s duration, including 10-ms onset/offset ramps. The stimuli consisted of 30-dB SPL probe tones (fp, varied from 0.5 to 16 kHz) presented together with an additional, 65-dB SPL saturating tone (fs = fp-43 Hz) on alternate runs. The probe frequency was varied in fine steps (86 Hz) to avoid phase unwrapping errors. The probe level was kept low to limit the spread of BM excitation toward the base that typically occurs at higher stimulus levels. Each condition was immediately repeated, and the responses stored in separate buffers (A and B). The two response buffers were averaged () and subtracted () to obtain estimates of either the CM or the noise amplitude at the frequency of the probe (obtained via fast Fourier transform), respectively. In both cases, the first and last 46.4 ms of the response buffer was skipped to prevent contamination from responses to onset and offset transients (e.g., CAP). The microphone response was measured simultaneously with the RW signal and was used to eliminate data where probe slippage was suspected (i.e., deviations in the measured probe SPL from its target value).
RESULTS
Data from 19 chinchillas (of 22 tested) showed no signs of probe slippage throughout the measurements (i.e., stable ear-canal pressure) and were included in the analyses. In all cases, CAP thresholds were considered normal (i.e., typically within 20 to 40 dB SPL in the mid frequencies, rising above 60 dB SPL above ~ 12–18 kHz), and both round-window CMs and robust OAE responses (distortion-product and stimulus-frequency OAEs) were obtained.
At low frequencies, the RW electrode signal contains phase-locked auditory nerve action potentials (auditory neurophonics) as well as hair-cell potentials (e.g., Henry 1995; He et al. 2012; Lichtenhan et al. 2013). Interference between the neurophonic and the CM might result in sharp peaks and notches in the CM levels. To avoid confusion between interference patterns that may include neural components, we analyze data only at higher frequencies (5–16 kHz) where contamination from neurophonic potentials is highly unlikely due to the low-pass nature of neural phase locking (Johnson 1980; Weiss and Rose 1988).
Spectral Ripples in the CM Responses
Figure 1 shows examples of the round-window CM responses to low-level tones (solid black) for three representative animals (columns). In all cases, the CM levels (a–c) exhibit a striking quasiperiodic pattern of peaks and notches (i.e., ripples). The spectral ripples in the CM levels can be better appreciated following detrending (d–f; the subtracted trend levels are shown in a–c with black dotted lines). Although the level rippling pattern varies idiosyncratically from animal to animal, the ripple spacing (e.g., horizontal arrows in Fig. 1) and ripple depth (vertical arrows) typically increase with probe frequency (on average from ~ 1 to ~ 2 kHz and from ~ 5 to ~ 20 dB over 5 to 16 kHz range, respectively; also see Fig. 7, black). The unwrapped CM phase demonstrates subtle ripples as well (g–i), although these are better visualized following calculation of the local phase slope (i.e., group delay; j–l). Within an animal, the patterns of spectral ripples observed in the CM levels and phase-gradient group delays are similar (e.g., f vs. l), with discrepancies most often observable near CM level minima, where lower signal-to-noise ratios could have compromised the phase estimate (e.g., deep level notches in d and e tend to line up with discontinuities in group delays in j and k, respectively). In some animals, the CM phase not only demonstrates subtle ripples but also exhibits regions with steeper slope that translate to longer group delays (~ 0.4 ms; e.g., H03 and E08 in Fig. 1g, h). Such steep phase slopes are inconsistent with the classical view of the round-window CM responses, which predicts that the CM phase gradient should approximate the sum of the stimulus delay (here ~ 6 μs) and the forward middle-ear delay (~ 0.05–0.08 ms as in Slama et al. 2010; Robles et al. 2015) due to dominance by potentials generated at the basal end of the BM excitation pattern, most proximal to the electrode (but see also the “DISCUSSION” for an alternative interpretation of the shallow phase of the classical CM component). The presence of the spectral ripples in the CM level and phase is suggestive of interference between early and delayed CM components with shallow and steep phase slopes, respectively. In some cases, such delayed components may even dominate the overall CM response, as exemplified by the regions of steep phase slopes (e.g., g, h).
When a moderate-level saturating tone is swept together with the probe at a nearby frequency, the CM response becomes smoother, and no ripples are evident in either the CM levels or phase gradients (Fig. 1, red). As a result, the CM phase has a consistently shallow and nearly constant slope across frequency, corresponding to group delays of ~ 0.05–0.1 ms. Evidently, the saturating tone removes contributions from delayed CM components with steep phase slopes that would otherwise interfere with CM components with shallow phase slopes, producing the pattern of spectral ripples clearly observed in the CM levels and phase-gradient group delays. Because the saturating tone is believed to act primarily on OHC currents arising near its CF place (Geisler et al. 1990; Charaziak et al. 2017)—which is close to that of the probe tone—the delayed CM components are likely to originate within the peak region of the BM response to the probe. In contrast, the saturating tone has little effect on the global trend in CM levels (compare solid red and dashed black lines in a–c), although small reductions or enhancements (typically within ± 5 dB) are sometimes observed (Nuttall and Dolan 1991). (Small increases or decreases in CM level can be explained by the classical view of the CM generation; see “DISCUSSION”). These patterns indicate that CM components having shallow phase slopes are largely unaffected by near-probe saturating tones, consistent with their domination by sources located basal to the CF place of the probe. In other words, the CM measured in the presence of the near-probe saturating tone displays characteristics expected from the classical theory of CM generation.
Unmixing CM Components in the Pseudo-Time Domain
We test for CM components having different phase gradients by trying to separate them using a time-windowing method. Similar signal processing strategies have been used previously to unmix OAE components having different phase-gradient group delays (Kalluri and Shera 2001; Knight and Kemp 2001; Shera et al. 2008). If the semiregular pattern of CM ripples arises due to interference between CM components with different phase slopes, then an inverse Fourier analysis of the CM signal should reveal components with different latencies. Complete or partial separation of such components can be achieved using time windows. However, due to cochlear frequency dispersion, the CM components may be smeared in time, rendering them more difficult to separate by time windowing. To compensate for this dispersion, we performed the Fourier analysis with respect to a transformed frequency coordinate derived from the empirical place map of chinchilla near-CF basilar membrane group delays, τBM (for details, see Shera et al. 2008). The CM measurements were interpolated to frequencies spaced at intervals inversely proportional to τBM, where , with parameters a, b, c, and f0 set to 0.496 ms, 1.863 ms, − 0.771, and 1 kHz, respectively (Siegel et al. 2005). The same strategy has been applied to unmix stimulus-frequency OAE components in chinchillas (Shera et al. 2008).
Figure 2 shows the latency-domain representation of the round-window CM signal for one chinchilla (b, black) together with the mean CM latency function obtained from 19 animals (a). Because of the quasi-power-law transformation applied before Fourier analysis, the conjugate “latency units” (lu) along the abscissa have no simple physical interpretation. On average, the latency demonstrates a single prominent peak centered at ~ 1 lu (Fig. 2a, black). In the individual data, however, several distinct peaks can be observed at longer latencies (b, black). Because the delayed peaks in the latency domain do not align perfectly for different animals, they tend to smear out through the averaging process. Nevertheless, in all cases, the energy at later latencies (> 3 lu) is considerably reduced when a near-probe saturating tone is presented concomitantly with the probe (Fig. 2, red). In contrast, the saturator has little effect on the amplitude of the early peak centered at ~ 1 lu. These results are consistent with the existence of delayed CM components with steep phase slopes.
To clarify how the units along the latency axis (lu) relate to cochlear travel times (i.e., to place of origin) and thereby facilitate the choice of unmixing windows, we created artificial signals with known phase-gradient delays (τAS) and transformed them to the latency domain so that the location of their respective peaks could be identified (see shaded areas in Fig. 2). The latency function of artificial signals that mimic pure middle-ear delays (τAS = τME = 0.05–0.1 ms) peaks within the first 2 lu (gray box). Thus, the first prominent peak in the data (at ~ 1 lu) presumably represents CM sources located near the basal end of the cochlea, as predicted by the classical theory. When τAS matches forward BM travel time (range from 0.8 to 1.0 times τBM), the latency function peaks between 5 and 10 lu (red box). [The τBM was set as the near-CF basilar membrane group delay in chinchillas derived by Siegel et al. (2005) and Shera et al. (2010).] In about half the animals, distinct peaks at longer latencies are also observed (Fig. 2b). Simulations indicate that latency functions with τAS in the range 1.3–2.0 times τBM peak from 11 to 20 lu (blue box). Thus, these long-latency CM components have latencies roughly corresponding to the round-trip travel time of the BM excitation to and from the CF place.
To separate putative CM components in the latency domain we used 10th-order recursive exponential windows (Shera and Zweig 1993). The windowed signals were then converted back to the frequency domain with a Fourier transform. The early components were separated from the delayed components using a window with half-power duration of 4 lu (Fig. 2, green solid). As shown in the examples in Fig. 3 (middle and right columns), the early-latency components (CMEL) have a shallow phase slope and a level that varies smoothly with frequency. The phase-gradient delays average 0.05–0.08 ms (d, black), values which agree well with the range of middle-ear delays in chinchillas. On average, CMEL levels decrease with frequency at a rate of 5–6 dB/oct. This trend may reflect stimulus attenuation due to middle-ear transmission, where a local decrease in sensitivity (by ~ 7 dB) is typically observed as frequency increases from ~ 6 to ~ 14 kHz despite the transfer functions having relatively broadband characteristics otherwise (also see Dallos et al. 1974, for middle-ear effects on CM measurements; Robles et al. 2015). Furthermore, the decrease in CMEL level with frequency may relate to cochlear “end effects” (e.g., for probe frequencies peaking near the basal end of the cochlea, the tail region of the BM excitation pattern shortens and currents through fewer OHCs can add in phase). In general, the overall characteristics of CMEL agree with the predictions of the classical theory of the CM generation.
In contrast, the delayed CM (> 4 lu) level does not vary smoothly with frequency and demonstrates idiosyncratic patterns of magnitude peaks and notches (Fig. 3b, c, green). Typically, the delayed CM is lower in level than the early CM (Fig. 3a), although exceptions occur at higher frequencies (b), because, unlike the CMEL, the delayed components tend to increase in level with increasing probe frequency. The delayed CM phase rotates rapidly with frequency, with slope changing abruptly from ~ 1.3τBM (mid gray area in d) to ~0.8τBM (dark gray area) below and above ~ 8 kHz, respectively, as if components with different phase gradients are dominating the response at low vs. high frequencies. To test that possibility, we further unmixed the delayed CM using a 10-lu window duration (Fig. 2, dashed green). The resulting middle-latency (4–10 lu) and late-latency (> 10 lu) components are shown in Fig. 3 (red and blue, respectively). The late components (CMLL) are small in magnitude (~ 20 dB below CMEL), exhibit an idiosyncratic pattern of magnitude microstructure (Fig. 3b, c, blue), and decrease in level with increasing frequency at a rate similar to the CMEL (Fig. 3a). Throughout the frequency range, the CMLL has phase-gradient delays corresponding to estimated otoacoustic round-trip travel times in chinchillas [~ 1.3τBM, (Shera et al. 2008)]. This suggests that the late-latency components arise through a process of internal reflections of the traveling pressure-difference wave between the CF place and stapes (Shera and Cooper 2013). In contrast, the levels of the middle-latency components (CMML) increase with frequency (Fig. 3a, red), so that CMML are lower/higher in magnitude compared to CMLL at frequencies, respectively, below/above 8 kHz. Although CMML levels also exhibit an animal-specific microstructure (Fig. 3b, c, red), the pattern of peaks and notches does not align with the microstructure of the late CM (blue). Furthermore, CMML phase-gradient group delays are slightly shorter than the cochlear forward travel time to the CF place (~0.8τBM, Fig. 3d). This suggests that the middle CM arises from regions slightly basal to the probe-tone CF place. The differences in CMML and CMLL magnitude microstructures, group delays, and global changes in level with frequency suggest that these two components arise via different mechanisms, rather than CMLL being simply a “delayed reflection” of CMML.
The results of the component unmixing are robust to reasonable variations in the analysis parameters, specifically to changes in the form, order, and duration of the unmixing windows (the latter by ± 1 lu), despite the limited resolution of the latency-domain function (note the proximity of the colored boxes in Fig. 2). Using recursive exponential windows of high order allows for relatively clean separation of the unmixed signals in the latency domain, although some energy leaking across windows cannot be avoided unless rectangular windows are used. Rectangular windows, however, introduce spectral artifacts when converting the unmixed signals back to the frequency domain. We tested the “leakage” of the recursive exponential windows by applying the unmixing strategy to signals synthesized from the data to include only the early- and late-latency components (i.e., the middle-latency component was absent). If the component separation were perfect, then no middle-latency component would be obtained from the synthesized signals. Although the middle-latency components unmixed from the synthesized signals were nonzero, their amplitudes were ~ 15 dB below those seen in the data, indicating that energy leakage between windows is relatively low. We selected the window durations based on average BM near-CF group delays, as estimated from auditory nerve fiber data in a different group of animals (Siegel et al. 2005; Shera et al. 2010). Although this strategy facilitates interpretation in terms of cochlear travel times, it neglects possible differences in cochlear travel times in individual animals, so that the chosen window durations may have not been optimal in each case (e.g., Fig. 2b). Finally, we tested our unmixing strategy on artificial signals that contained either all three components or only the early-latency component. If the delayed components were not present in the analyzed signal, the unmixed delayed signals were at least 35 dB lower in level compared to when all three components were present (i.e., unmixing “noise”). Importantly, the trends we describe do not depend sensitively on the details of the signal-processing strategy—we obtained similar results using an alternative unmixing method based on the continuous wavelet transform (Moleti et al. 2012).
CM Model Description
We explore the origin of the various CM components by modifying the classical theory of CM generation to include effects of possible cochlear irregularities (i.e., “roughness”). In general, we assume that the local complex-valued CM voltage produced by the ionic current flowing through a hair cell is proportional to the local BM displacement (DBM), and that the voltage spreads instantaneously throughout the cochlea, reaching the electrode with no additional delay (He et al. 2004; Cheatham et al. 2011; Charaziak et al. 2017). The CM at the round window is calculated as the vector sum of all the local sources, weighted exponentially with distance from the source to mimic the effects of electrical attenuation. Because we are concerned only with responses to low-level stimuli, and simulations show that inclusion of the nonlinear hair-cell transducer function has negligible effects on the results, we omit nonlinear elements in the model (see “DISCUSSION”).
According to the classical theory of CM generation, the round-window at probe frequency fp represents a vector sum of the individual voltage sources excited by the forward traveling wave on the BM, weighted according to their distance (x) from the round window:
1 |
where DBM(x, fp) is the normalized complex-valued spatial excitation pattern of BM displacement produced by the probe frequency fp, V0 represents the intrinsic strength of the local voltage sources (assumed independent of x), and w(x) is the weighting function that controls the attenuation with distance from the source (e.g., Cheatham et al. 2011; Charaziak et al. 2017). We used with attenuation rate A in decibels per millimeter. (All modeled CM components are marked with a hat to distinguish them from measured data.)
We now extend the classical model to include plausible effects of small, cochlear irregularities on the CM generation process. For generality, we distinguish two possible types of irregularity in the cochlea: “micromechanical” irregularities (rm) that represent spatial perturbations in the micromechanics of the organ of Corti and “microelectrical” irregularities (re) that represent spatial variations in the strength of the OHC voltage sources, although the two may be closely related. If one adds both irregularities to the model, two things happen. First, the traveling wave is scattered by the micromechanical roughness, creating a reverse traveling wave via a process described by the coherent reflection theory of OAE generation (Zweig and Shera 1995). Second, the constant V0 becomes V0[1 + γere(x)], where γe is a microelectrical scaling factor introduced for convenience. The assumption here is that the strength of the voltage sources, previously assumed constant (or at least smoothly varying), acquires an irregular or random component. So, to first order, the equation for the round-window CM becomes:
2 |
where Rs represents the reflection coefficient for reverse waves at the stapes and the cochlear reflectance R describes the reflection of the forward traveling wave near its CF place. The cochlear reflectance is as approximated by:
3 |
where γm is a micromechanical scaling factor (chosen so that |R| < 1). Equation (3) simplifies the physics by assuming that the cochlear reflection near the CF place is proportional to the square of the BM displacement wave, rather than to the square of the pressure-difference wave (Shera et al. 2008). Derivation of the pressure difference from the BM displacement requires estimates of the BM impedance (or, equivalently, the wavenumber). Since these are not available for our BM displacement patterns (see below), we adopted the simplified expression for R. The main consequence of this simplification is that the model predicts a round-trip OAE travel time of ~ 1.8τBM, somewhat larger than the value of ~ 1.3τBM obtained using the pressure-difference wave (Shera et al. 2008). The Shera et al. (2008) predictions match well the estimated cochlear round-trip travel times from stimulus-frequency (SF) OAE and BM measurements.
One can immediately see that our extension of the classical theory to include the two kinds of irregularity predicts that the CM has three components, as schematically depicted in Fig. 4. The first component is simply the classical CM, which is dominated by basal sources producing an early-latency response [; Eq. (1)]. The second component is due to the microelectrical roughness, and we show that it produces a “middle-latency” response due to coherent electrical summation at the peak [i.e., ]. The third component is due to internal reflection of the traveling wave and produces a long-latency response [i.e., ].
As in the classic theory, the early-latency component is dominated by the contributions from the tail region of the primary forward traveling wave (Fig. 4, black solid arrow), where the phase varies slowly and the BM moves nearly in unison (dashed black box), while contributions from the CF region are largely canceled out. In contrast, the middle-latency component arises predominantly within the CF region through an electrical analog of “coherent reflection” (Zweig and Shera 1995). Specifically, when the microelectrical irregularities include spatial-frequency components that match the wavelength of the traveling wave, the resulting voltage potentials combine coherently. Sources within the peak region that would normally tend to cancel one another then sum in phase (i.e., coherent summation; red dashed box in Fig. 4). Because this CM component originates within the peak region of the primary forward traveling wave, its latency should be close to the forward BM travel time to the CF place. (Recall that the reverse travel is electrical and therefore effectively instantaneous.) The long-latency component arises due to the internal reflection of cochlear traveling waves. The micromechanical irregularities scatter the primary forward traveling wave (black solid arrow, Fig. 4) and the resulting reflected wavelets combine coherently near the wave peak when the irregularities contain spatial-frequency components matching half the wavelength of the traveling wave (Zweig and Shera 1995). The process of coherent reflection creates the reverse traveling wave that propagates back toward the stapes (red solid arrow; Zweig and Shera 1995), where it is represented by the cochlear reflectance, R. At the stapes, part of the reverse wave energy is transmitted through the middle ear to the ear canal, where it can be measured as a reflection-source emission (SFOAE). The other part is reflected back into the cochlea (with reflection coefficient, Rs), giving rise to a secondary forward traveling wave (blue solid arrow, Fig. 4). Just as for the primary traveling wave, the current sources excited by the BM displacement resulting from the reverse wave and the secondary forward wave add constructively in the tail region of the combined excitation pattern (blue dashed box), giving rise to CM components with latencies equivalent to the round-trip travel time to and from the CF place. Although these long-latency components are dominated by CM sources located in the tail region of the BM excitation pattern, their amplitudes depend on the size of the reverse traveling wave and are therefore related to the OAE. For example, because the amplitude of the reverse wave depends on cochlear amplification at the CF place, a reduction in this amplification (e.g., resulting from high stimulus levels, damage, or a saturating tone) decreases the relative amplitude of the long-latency CM components. Unlike the classical, early-latency component, the new roughness-related middle- and long-latency components depend on cochlear activity near the CF place.
To compute CM components using the model, we derived estimates of longitudinal BM displacement profiles [DBM(x, fp)] from published chinchilla data (Rhode 2007b; Ruggero et al. 1997, 2000). We converted frequency responses at a single location (30-dB SPL tones at CFs ranging from 6.6 to 10 kHz) to spatial profiles at fixed probe frequency using scaling. All derived displacement profiles were normalized to their peak value and interpolated to a resolution of 2.4 μm over a BM length of 10 mm. (For comparison, the width of a single hair cell is about 10 μm.) To simulate responses over a range of frequencies, BM displacement profiles were translated using the chinchilla frequency-position map (Muller et al. 2010), with the CF varied from 5 to 16 kHz in 86-Hz steps. Although simulations were performed for each of four different BM displacement profiles obtained from different cochleae [(Rhode 2007b) for chinchillas N92 and N157; (Ruggero et al. 1997) for L113; and (Ruggero et al. 2000) for L208], we show only those results obtained for cochlea L208 for brevity; similar patterns were obtained in all cases.
Because the BM displacement and CM data were obtained from different groups of animals, and because irregularity patterns in individual animals remain unknown, only qualitative comparisons between model predictions and CM data are currently possible. To simulate data from individual “ears,” we computed CM components for 19 different realizations of the roughness. For computational convenience, we assumed that the micromechanical and microelectrical irregularities have the same spatial pattern (r = rm = re). The common roughness function, r(x), was obtained by drawing random numbers from a zero mean normal distribution with a standard deviation of 0.2. The unknown stapes reflectance (Rs) was assumed frequency-independent and set equal to 0.9. Adjusting the complex value of Rs modulates the amplitude and phase of the reflected wave but has no qualitative effect on the model conclusions. The value of V0 was set arbitrarily to 1 V. The remaining free parameters of the models (A, γm, and γe) were initially varied until the relative component levels visually mimicked major trends observed in the data (Fig. 3a) and were then fixed at these constant values. The electrical attenuation constant A was set to 2 dB/mm for all simulations; this value agrees with estimates of the attenuation rate suggested by Charaziak et al. (2017) and produces an increase in level with frequency that matches the trends observed in the data (Fig. 3a). Although the early and late CM components are insensitive to the value of A, the middle components are strongly attenuated and contribute little to the round-window response at larger values of A (A > 4 dB/mm). The scaling parameters γm and γe determine the relative amplitudes of the various CM components and were fixed at the values 24 and 5, respectively; these values provide good approximations to the trends in the data (i.e., and ).
CM Model Results
Model-Predicted CM Components
Figure 5 shows early, middle, and late CM components simulated for two roughness profiles (middle and right columns) together with mean results computed for 19 profiles (left). The component does not depend on roughness and therefore remains unchanged across the different “ears,” so long as the BM displacement profile used in the simulations remains fixed (as here).
The simulated has nearly zero phase-gradient delay (< 0.05 ms, Fig. 5d, black) and levels that vary little with frequency (a–c). Thus, resembles both the CM measured in the presence of the near-probe saturator (Fig. 1, red) and the unmixed early-latency component, CMEL (Fig. 3, black). However, level does not vary systematically with frequency and its group delays are shorter than those of CMEL. These discrepancies are likely due to the absence of a middle-ear model in the simulations. Furthermore, cochlear “end effects” may have been underestimated in the simulations, since different frequencies were simulated by simply translating the BM displacement profiles uniformly along the cochlear length with no accompanying change in shape. Stronger end effects are observed when the simulations are extended to frequencies approaching 20 kHz; in these cases, the level of the simulated CM decreases with increasing frequency due to the considerable shortening of the tail region of the BM excitation profile (in the model x = 0 mm was set to have a CF = ~ 30 kHz). Finally, because is dominated by contributions from sources located in the tail region of the BM excitation pattern, its level is not sensitive to changes in the attenuation constant, A [Eq. (1)], whose value primarily affects currents originating at more apical locations. However, a small increase in phase slope is observed for A > 0 dB/mm (see “DISCUSSION”).
Like the early-latency component, the late component in the model arises from basal sources. The key difference is the identity of the wave driving the OHCs: Whereas the early component is driven by the stimulus-induced forward traveling wave, the late component is driven by the reverse traveling (OAE) wave combined with its reflection at the stapes. Because both and arise from basal sources, levels generally parallel levels across frequency (Fig. 5a, blue vs. black), as seen in the data (Fig. 3a, blue vs. black). Unlike , however, the phase of the late component rotates rapidly with frequency—the delay equals the round-trip travel time derived from the model (~ 1.8; blue vs. light gray in Fig. 5d)—and its level exhibits idiosyncratic spectral structure (e.g., notches) similar to SFOAEs. Thus, the features of qualitatively match those of the long-latency components unmixed from the data (Fig. 3, blue).
The middle-latency component in the model arises from CM sources located near the peak of the BM excitation pattern (Fig. 4). Overall levels are therefore strongly influenced by the attenuation rate, particularly for lower frequency stimuli. As a result, when A is nonzero, levels tend to increase gradually with increasing frequency (Fig. 5a, red). group delay matches ~ 0.8 (Fig. 5d, red vs. dark gray), as if the region of coherent summation is centered slightly basal to the probe CF place [as also observed for coherent reflection, e.g., (Shera and Guinan 2003)]. Finally, displays idiosyncratic patterns of magnitude microstructure (b, c, red). All these features agree well with the properties of the unmixed middle-latency components seen in the data (Fig. 3, red).
Importantly, even though we used the same roughness profiles for the calculation of both and (r = rm = re), their spectral microstructures appear uncorrelated (Fig. 5b, c, red vs. blue). This occurs because the coherent summation of and OAE sources depends on different spatial-frequency components in the pattern of irregularities. Because depends on the product rDBM (i.e., on DBM raised to the first power), irregularity components whose spatial period matches the wavelength of the traveling wave at its peak contribute most to By contrast, the cochlear reflectance R depends on (i.e., on DBM raised to the second power), and therefore, the dominant spatial-frequency components for both SFOAEs and are those whose period is half the wavelength of the traveling wave.
In summary, the modeling results agree remarkably well with the data. The model correctly predicts that the overall levels of the early and late components change in parallel across frequency, while the middle-latency component level exhibits an opposing trend (compare Figs. 3a and 5a). Although inclusion of the electrical attenuation with distance from the RW explains the gradual increase of the middle-component level with frequency, the simulations do not reproduce the opposite trend seen in the data for the early and late components unless higher frequencies, peaking near the simulated cochlear base, are included. The phase-gradient delays of the model components match the trends observed in the unmixed data. [Note that the total phase accumulation of the delayed model components is larger than seen in the data (Figs. 3e, f and 5e, f); this is due to the perfect scaling symmetry assumed in the model, which results in longer BM group delays than observed experimentally across the range of frequencies. Thus, it is more meaningful to compare the component group delays relative to the underlying BM group delays (e.g., Figs. 3d and 5d) than to compare the unwrapped phase lags.] Although the ratio of τLL/τBM is closer to ~ 1.3 in the data (Fig. 3a), as opposed to the predicted ~ 1.8 ratio in the model (Fig. 5d), this discrepancy is the expected consequence of adopting a simplified form for the cochlear reflectance, R(fp) [Eq. (3)]. Our model results are consistent with the predictions of Shera et al. (2008), who used a more accurate form of Eq. (3) based on the traveling pressure-difference wave rather than DBM. According to Shera et al., the coherent reflection model predicts OAE round-trip travel times of ~ 1.3τBM, in agreement with the data, at least for the basal end of the cochlea (in chinchillas > 3–4 kHz). Thus, the delay of the unmixed CMLL component agrees well with the theory of its origin in the reverse OAE wave. Finally, the simulations predict highly individualized, uncorrelated patterns of spectral microstructure in the middle and late components (Fig. 5b, c), while the early components vary relatively little from ear to ear as long as the underlying BM excitation patterns are similar (as assessed for BM displacement profiles derived from four cochleae; data not shown). The unmixed data indicate similar trends (Fig. 3b, c).
Total CM at the Round Window
Figure 6 shows examples of modeled total round-window microphonics for various possible compositions of the (see legend). In the classical theory, only the early-latency components ( are included, and no spectral ripples in either CM magnitude (red, top rows) or group delay (bottom) are observed. When the middle-latency component is added to the early component (, an ear-specific rippling pattern emerges in both the total microphonic level and its group delay (Fig. 6, black). As observed in the data (Fig. 1, black), the ripple depth varies from ear to ear depending on the relative magnitudes of the two components. In ear no. 3, for example, the levels of and are similar (Fig. 5b, black and red), creating deep notches (~ 20 dB) in the total CM magnitude and regions of steep and shallow phase slopes (Fig. 6d, g, black). This example resembles the data for animal H03 (Figs. 1 and 3). When is lower in level than (no. 13, Fig. 5c), the rippling pattern in total CM is less pronounced (Fig. 6f, i, black), similar to the pattern observed for animal G29 (Figs. 1 and 3). Because the modeled late component is typically lower in level than the middle component (Fig. 5a), inclusion of in the total (Fig. 6, gray) introduces a small modulation to the ripple spacing but does not appreciably alter the ripple depth.
The spacing between ripples can be quantified by extracting it from the total microphonic using signal processing (Fig. 7). When the simulated includes all three components (green), the ripple spacing tends to be narrower than when is computed without the late component. Comparison with the ripple spacing obtained from the total measured CM (black) shows that the simulations nicely capture the change in ripple spacing across frequency. In the mid-frequency range, the data agree better with the spacing predicted for responses (red) than for (green), plausibly due to our overestimation of round-trip travel times in the model computations (see above). The ripple spacing is determined by the relative delay between the contributing components. This concept can be better visualized if the ripple spacing of the consisting of early- and late-latency components (blue) is compared to the spacing obtained for a mixture of early- and middle-latency components (red). Because the phase of the long-latency component rotates more rapidly with frequency than the phase of the middle-latency component, the is in and out of phase with at narrower frequency intervals, resulting in narrower ripple spacing (Fig. 7, blue).
DISCUSSION
Further Evidence for Origin of the Delayed CM Components in the Peak Region of the BM Excitation
The interference pattern observed in the round-window CM (Fig. 1) suggests contributions from components with different phase-gradient delays (Fig. 2) that may originate in different cochlear regions. The delayed CM components, but not the early-latency CM, are effectively diminished in magnitude in the presence of a near-probe saturating tone (Fig. 2). This result suggests that the delayed CM components originate in the active peak region of the BM response to the probe tone where nonlinear two-tone interactions are strong (Rhode 2007a). The additional data reproduced in Fig. 8, obtained as a part of another experiment (Charaziak et al. 2017), further support this conclusion. The data show that the spectral ripples observed in the CM responses to low-level tones (black) are diminished (1) at higher probe levels (dotted red; linearly scaled down), (2) in the presence of a near-probe saturating tone (solid red), and (3) following acoustic trauma that causes a mild threshold elevation (dashed gray; see caption for details). In each of these three experimental conditions, the cochlear gain at the CF place is known to be decreased (Robles and Ruggero 2001), and the reduced rippling patterns suggest that the delayed components, but not necessarily the early ones, are diminished in magnitude. In contrast, the spectral ripples are affected relatively little with a tendency for small increase in the ripple depth when the frequency of the saturating tone is set more than an octave above the probe tone (blue). Such remote saturating tones do not suppress cochlear gain at the CF place (Rhode 2007a) and are therefore unlikely to affect the CM components originating in that region. Instead, a decrease in the overall CM level is observed in the presence of the remote saturator, consistent with suppression of sources located in the tail region of the BM response to the probe tone, where large early-latency components originate. While these preliminary results reveal promising trends, a more systematic study of dependence of different latency CM components on active cochlear processing is warranted. Better understanding of how different CM components are shaped by active cochlear processing could improve interpretation of changes in CM seen due to activation of the efferent system, application of ototoxic drugs or genetic mutations (Gifford and Guinan 1987; Fitzgerald et al. 1993; Cheatham et al. 2011; Elgueda et al. 2011). Measurements with intracochlear electrodes placed near the CF place of the stimulus support the existence of “active” CM potentials demonstrating nonlinear growth and, at low levels, sharp tuning with rotating phase characteristics (Dallos et al. 1974; Fridberger et al. 2002, 2004; Dong and Olson 2013). Whether these intracochlearly measured nonlinear CM potentials correspond to the delayed CM components seen at the RW needs further investigation.
Spectral Ripples in CM Responses and in Ear-Canal Pressure
Are all three components necessary to explain the existence of spectral ripples in the round-window CM? In theory, the spectral ripples can emerge in the CM response as interference between the early components and either late- or middle-latency components (Fig. 9b, d, black). Additional insights into that problem can be provided by comparing the CM ripples with the rippling pattern observed in the ear-canal pressure. It is well known that the quasiperiodic ripples in the ear-canal pressure are due to interference between the evoking tone and the delayed SFOAE (e.g., Kemp and Chum 1980; Zwicker and Schloth 1984; Shera and Zweig 1993). Because middle-ear delays are small compared to cochlear round-trip travel times, the model predicts that the pattern of spectral ripples evident in ear-canal pressure should be correlated with the ripples observed in the round-window CM due to interference between early and late CM components, as simulated in Fig. 9b. This is because our model predicts that late CM originates through a process of internal cochlear reflection closely related to the generation of SFOAEs. In the data, we observed a good match between the CM and ear-canal pressure rippling patterns in about 50 % of the animals, but only at lower frequencies (Fig. 9, the gray boxes in the left-side plots). This observation agrees with the results of the component unmixing, where the late-latency components are typically larger than the middle-latency components at the lowest frequencies (Fig. 3a) as they are less susceptible to the effects of the electrical attenuation with distance. For higher frequencies, where the middle-latency components contribute more to the CM response, the CM rippling pattern does not correlate with the ear-canal pressure ripples, as predicted by the simulations (Fig. 9d, f). The rather abrupt change in the correlation between the CM and ear-canal rippling patterns with frequency provides independent evidence of the existence of at least two delayed CM components with different phase gradients that mix with the early-latency CM.
Species Differences
To the best of our knowledge, high-resolution CM measurements of the type used here to test models of CM generation are not available in other species, with the partial exception of gerbils. In contrast to the round-window CM in chinchillas, the CM in gerbils does not appear to exhibit strong rippling patterns, even at low stimulus levels (He et al. 2012). However, the CM measurements extend only to the relatively low (for gerbils) frequency of 10 kHz. Based on the chinchilla data, the largest contribution to the round-window CM from middle-latency components occurs for tones with CFs mapped to basal cochlear locations due to their proximity to the electrode. This agrees with observations in mice, where round-window CM suppression tuning curves are sharply tuned only at very high probe frequencies, as would be expected if the place-specific middle-latency component dominated the measured signal (Cheatham et al. 2011). Furthermore, CM rippling patterns resulting from interference with late-latency CM components are expected in species with strong intracochlear reflections such as chinchillas and humans (Dhar et al. 2002; Siegel et al. 2011; Shera and Cooper 2013). In contrast, relatively weak reflection-source OAEs in gerbils and mice (Kemp 1986; Siegel et al. 2011) suggest such reflections may be small, if present at all in these species. While high-resolution CM measurements in other species will undoubtedly provide additional insights into CM generation, species differences in cochlear frequency maps, in electrical attenuation constants, and in the length and shape of the cochlea (e.g., the number of turns) all complicate interspecies comparisons.
Interpretation of CM Phase: Place- and Wave-Fixed Components
In the classical theory of CM generation, the frequency invariance of round-window microphonic phase has been ascribed to the dominance of contributions from CM sources located in the basal region of the cochlea (He et al. 2012). We suggest, however, that the shallow phase gradient of the classical CM derives primarily from the presumed mechanism rather than from the place of CM generation. In other words, because the classical theory ignores the role of micromechanical and microelectrical irregularity, it predicts that CM phase gradients are shallow no matter where the sources are located. A useful analogy here is to “wave-fixed” OAE sources—the phase of wave-fixed OAEs is nearly independent of frequency in a scaling-symmetric cochlea (e.g., Kemp and Brown 1983; Shera and Guinan Jr 1999). When combined with an exponential cochlear frequency-position map, scaling symmetry implies that traveling wave excitation patterns are locally translation-invariant, with the number of wavelengths in the traveling wave remaining nearly independent of frequency. Thus, even though a low-frequency wave travels farther than a high-frequency wave to reach its CF place, both waves travel the same number of wavelengths, and their total phase accumulation is therefore the same (Shera and Guinan 2008). As a result, the phase of wave-fixed components—whether they be components of OAEs or components of the CM—cannot vary with frequency. Wave-fixed mechanisms, no matter where they operate in the cochlea, must give rise to early-latency components.
To validate this reasoning computationally, we use the classical model to evaluate contributions to arising from the CF region—such contributions can arise due to incomplete cancelation of sources within the peak region. According to the wave-fixed hypothesis, peak region contributions to the classical CM should have a shallow phase slope despite their more apical origin. In the model, these peak region components can be extracted by simply restricting the summation to locations near the CF place. Alternatively, the peak region components can be also extracted with a help of a near-probe saturating tone (Charaziak et al. 2017). Because this additional tone saturates the transducer currents of the hair cells located near its own CF place (Geisler et al. 1990), contributions from these cells to the total response can be gauged by deriving the complex difference between probe-tone responses obtained with and without the saturating tone (i.e., residual CM). In the limit that the saturating tone has no effect on the probe-frequency CM sources, the residual CM (rCM) is ideally zero (in the absence of noise). In the opposite limit, the saturating tone completely suppresses all sources, and rCM equals the probe-alone response. In practice, because of phase-interference effects, the round-window probe response can either decrease or increase in the presence of the saturating tone, even though the saturator always reduces the local source currents (see also Nuttall and Dolan 1991). In the model, the residual CM response closely resembles the features of the peak components extracted by simply restricting the summation to the CF region in Eq. (1). Because the rCM can also be obtained from experimental data, only the modeled results of extracting the peak region components with a saturating tone are discussed in the subsequent paragraphs.
To simulate two-tone interactions in the classical model, we modified Eq. (1) so that the local source excitation becomes proportional to the local BM displacement fed through a saturating transducer function. Specifically,
4 |
where the diacritical “~” indicates the nonlinear model and [∙] is an operator that converts the Fourier coefficient DBM(x, fp) into the corresponding time waveform dBM(x, t), combines that waveform with an optional saturating tone, passes the sum through a saturating transducer function [a second-order Boltzmann function; see Eq. (3) in (Charaziak et al. 2017) for details], and then extracts the modified Fourier component at frequency fp from the distorted waveform. We derived the residual microphonic () by calculating for BM displacement profiles obtained for single- (probe-alone) or two-tone (probe+saturating tone) stimuli. The saturating tone parameters were set to approximate those used in the measurements [Ls = Lp + 30 dB and fs = fp-86 Hz], with all other parameters set as in (Charaziak et al. 2017).
Figure 10a, c shows model results for magnitude and phase, respectively (black). The levels are 30–40 dB below the probe-alone level, indicating that the peak region of the BM excitation pattern contributes to the round-window CM, but that these contributions are relatively small (Fig. 10a, black), especially at low frequencies, due both to their attenuation with distance and the mutual phase cancelation that occurs among sources located near the CF region. Most importantly, even though components originate near the peak region of the BM excitation (Charaziak et al. 2017), phase is nearly independent of frequency, implying close to zero phase-gradient delay (Fig. 10c, black). Thus, because it assumes wave-fixed CM sources, the classical model of CM generation cannot produce CM components with long delays, even when the sources are located in the peak region of the traveling wave. Small but nonzero phase gradients for the early-latency CM component can be introduced by imperfect scaling at the level of the BM, by the effects of electrical attenuation with distance (which violate “electrical scaling”), and, perhaps most significantly, by delays associated with stimulus transmission though the ear canal and middle ear. In agreement with this latter view, He et al. (2012) showed in gerbils that round-window CM phase, when referenced to the stapes motion, is essentially independent of frequency across a wide range of frequencies. In summary, the classical model predicts that CM phase gradients are shallow no matter where the sources are located.
Contrasting with measured CM residuals (Fig. 10b, d) reinforces the failure of the classical (wave-fixed) model to fully account for the CM generation process. Unlike the small peak region components predicted by the classical model, those present in the data (rCM) can have magnitudes approaching the level of the probe-alone CM. In addition, rather than varying smoothly with frequency, as predicted by the classical model, the measured rCM manifests a quasiperiodic pattern of peaks and notches and shows considerable phase accumulation with frequency (from 2 to 9 cycles over the displayed range). To generate peak region components with the observed characteristics, the classical model must be extended to incorporate the effects of cochlear irregularities [Eq. (2)]—just as with OAEs, longer latency CM components require place-fixed mechanisms (e.g., micromechanical and/or microelectrical irregularity). Although the two roughness-dependent CM components (the middle- and late-latency components) arise by different mechanisms, both depend on BM responses near the peak region of the traveling wave, and their respective residuals ( and ) are therefore of substantial magnitude [Fig. 10a, c, red and blue; to simulate suppression, we modified Eq. (2) by analogy with Eq. (4)]. These two CM components behave in a fashion similar to “place-fixed” OAEs, for which the source phase is tightly linked to the intracochlear location of the source. For , the relationship between the component phase and the underlying place of origin is indirect. Just as with the early-latency CM components, the late-latency components are dominated by potentials produced by hair cells located in the tail region of the total BM excitation pattern. But the hair cells that generate the late potentials are responding not directly to the primary forward traveling wave but to a reverse traveling (OAE) wave that originates in the peak region of the BM excitation and its stapes reflection (Fig. 4). Thus, suppressing the reverse wave with a near-probe saturating tone diminishes the long-latency CM through this indirect route. Furthermore, the model predicts that the long-latency CM, but not the middle component, should be also suppressible by remote saturating tones, similar to the early-latency CM. Unfortunately, the data at hand obtained with both remote and near-probe saturating tones (Fig. 8) suffer from too poor resolution in the latency domain to clearly observe the predicted patterns.
Model Limitations
We developed our model of CM generation with the goal of explaining the various components seen in the data rather than providing a complete account of the many complexities involved in CM generation. The model ignores the complicated electroanatomy of the spiral cochlea and its possible implications for round-window responses (e.g., Dallos 1984; Chertoff et al. 2012; Ayat et al. 2015). For instance, the spiral shape of the cochlea may affect the effective electrical attenuation factor, which may not be an entirely monotonic function of longitudinal distance (Chertoff et al. 2012). Furthermore, the model assumes a simple proportionality between the local CM response and BM displacement. Although this proportionality is well documented in the tail region of the traveling wave, cochlear potentials near the CF place appear more in phase with BM velocity (Dong and Olson 2013). As a result, an abrupt ¼ cycle relative phase shift occurs in local CM responses near the boundary between the tip and tail. Our simulations are robust against inclusion of this phase shift, but we cannot exclude the possibility that abrupt phase shifts may have important effects on CM summation patterns in more realistic models. Lastly, the majority of our simulations have assumed a linear cochlea. Although inclusion of a saturating transducer function in the model allows qualitative simulation of nonlinear two-tone interactions in CM responses, our implementation does not capture the full, mutually dependent relationship between active BM responses and hair-cell transduction. For example, the nonlinearities measured in cochlear potentials do not simply mirror those seen in BM vibrations (Dong and Olson 2013); these differences may well affect the degree of suppression seen in the various CM components. Despite these limitations, our simple model of CM generation reproduces the trends in the data reasonably well; additional data that reveal discrepancies with model predictions may provide motivation for the development of more comprehensive models.
Summary
This study demonstrates that chinchilla round-window CM measured in response to low-level tones exhibits an idiosyncratic pattern of spectral ripples in magnitude and phase-gradient delay (Fig. 1, black). We show that the spectral ripples are reduced both in the presence of a near-probe saturating tone (Fig. 1, red) and after removing delayed CM components in the latency domain (Fig. 3, black). These results are consistent with the hypothesis that spectral ripples are caused by the interference between CM components with shallow and steep phase gradients. Further dissection of the delayed CM in the latency domain reveals middle- and long-latency components, CMML and CMLL(Fig. 3). Although both delayed components are diminished in the presence of the near-probe saturating tone (Fig. 2), consistent with their origin in the active peak region, they depend differently on probe frequency: whereas CMML levels increase with frequency, CMLL levels show the opposite pattern (Fig. 3a).
By extending the classical model of CM generation to include the effects of cochlear irregularities—both mechanical and electrical—we predict two additional mechanisms of CM generation. These two mechanisms give rise to middle- and late-latency CM components (Figs. 4 and 5) that mix with the early-latency CM at the RW, producing spectral ripples (Fig. 6). In the model, the middle-latency components originate via the novel mechanism of coherent summation of CM sources near the CF place, while the late-latency components arise through the internal reflection of cochlear traveling waves that produces SFOAEs. Both mechanisms require that the BM traveling wave interacts with some sort of cochlear roughness or irregularity distributed along the partition (Zweig and Shera 1995). The model suggests that, unlike the early-latency components, the delayed CM components depend on cochlear responses in the active near-CF region of the probe response and could therefore be utilized as place-specific assays of cochlear function. Despite its simplicity, the model replicates the trends in the data remarkably well (e.g., compare Figs. 1 and 3 and Figs. 5 and 6; also Fig. 7). While our results reveal the existence of additional, previously unnoticed CM components, our data do not contradict the notion that the round-window CM is usually dominated by cellular sources located within the tail region of the BM excitation pattern, particularly at high stimulus levels (Fig. 8).
Acknowledgments
This study was supported by NIH grants R01 DC00419 (to M. Ruggero) and R01 DC003687 (to C.A. Shera) and by Northwestern University.
Contributor Information
Karolina K. Charaziak, Phone: (323) 865 1791, Email: charazia@usc.edu
Jonathan H. Siegel, Email: j-siegel@northwestern.edu
Christopher A. Shera, Email: christopher.shera@gmail.com
References
- Ayat M, Teal PD, Searchfield GD, Razali N (2015) Cochlear microphonic broad tuning curves. AIP Conference Proceedings 1703:030010
- Charaziak KK, Siegel JH. Tuning of SFOAEs evoked by low-frequency tones is not compatible with localized emission generation. J Assoc Res Otolaryngol. 2015;16:317–329. doi: 10.1007/s10162-015-0513-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Charaziak KK, Shera CA, Siegel JH. Using cochlear microphonic potentials to localize peripheral hearing loss. Front Neurosci. 2017;11:169. doi: 10.3389/fnins.2017.00169. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cheatham MA, Naik K, Dallos P. Using the cochlear microphonic as a tool to evaluate cochlear function in mouse models of hearing. J Assoc Res Otolaryngol. 2011;12:113–125. doi: 10.1007/s10162-010-0240-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chertoff ME, Earl BR, Diaz FJ, Sorensen JL. Analysis of the cochlear microphonic to a low-frequency tone embedded in filtered noise. J Acoust Soc Am. 2012;132:3351–3362. doi: 10.1121/1.4757746. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chertoff ME, Earl BR, Diaz FJ, Sorensen JL, Thomas ML, Kamerer AM, Peppi M. Predicting the location of missing outer hair cells using the electrical signal recorded at the round window. J Acoust Soc Am. 2014;136:1212–1224. doi: 10.1121/1.4890641. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dallos P. The auditory periphery: biophysics and physiology. New York: Academic; 1973. [Google Scholar]
- Dallos P. Some electrical circuit properties of the organ of Corti. II. Analysis including reactive elements. Hear Res. 1984;14:281–291. doi: 10.1016/0378-5955(84)90055-8. [DOI] [PubMed] [Google Scholar]
- Dallos P, Cheatham MA. Production of cochlear potentials by inner and outer hair cells. J Acoust Soc Am. 1976;60:510–512. doi: 10.1121/1.381086. [DOI] [PubMed] [Google Scholar]
- Dallos P, Cheatham MA, Ferraro J. Cochlear mechanics, nonlinearities, and cochlear potentials. J Acoust Soc Am. 1974;55:597–605. doi: 10.1121/1.1914570. [DOI] [PubMed] [Google Scholar]
- Dhar S, Talmadge CL, Long GR, Tubis A. Multiple internal reflections in the cochlea and their effect on DPOAE fine structure. J Acoust Soc Am. 2002;112:2882–2897. doi: 10.1121/1.1516757. [DOI] [PubMed] [Google Scholar]
- Dong W, Olson ES. Detection of cochlear amplification and its activation. Biophys J. 2013;105:1067–1078. doi: 10.1016/j.bpj.2013.06.049. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Elgueda D, Delano PH, Robles L. Effects of electrical stimulation of olivocochlear fibers in cochlear potentials in the chinchilla. J Assoc Res Otolaryngol. 2011;12:317–327. doi: 10.1007/s10162-011-0260-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fitzgerald JJ, Robertson D, Johnstone BM. Effects of intra-cochlear perfusion of salicylates on cochlear microphonic and other auditory responses in the guinea pig. Hear Res. 1993;67:147–156. doi: 10.1016/0378-5955(93)90242-s. [DOI] [PubMed] [Google Scholar]
- Fridberger A, Zheng J, Parthasarathi A, Ren T, Nuttall A. Loud sound-induced changes in cochlear mechanics. J Neurophys. 2002;88:2341–2348. doi: 10.1152/jn.00192.2002. [DOI] [PubMed] [Google Scholar]
- Fridberger A, de Monvel JB, Zheng J, Hu N, Zou Y, Ren T, Nuttall A. Organ of Corti potentials and the motion of the basilar membrane. J Neurosci. 2004;24:10057–10063. doi: 10.1523/JNEUROSCI.2711-04.2004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Geisler CD, Yates GK, Patuzzi RB, Johnstone BM. Saturation of outer hair cell receptor currents causes two-tone suppression. Hear Res. 1990;44:241–256. doi: 10.1016/0378-5955(90)90084-3. [DOI] [PubMed] [Google Scholar]
- Gifford ML, Guinan JJ., Jr Effects of electrical stimulation of medial olivocochlear neurons on ipsilateral and contralateral cochlear responses. Hear Res. 1987;29:179–194. doi: 10.1016/0378-5955(87)90166-3. [DOI] [PubMed] [Google Scholar]
- He DZ, Jia S, Dallos P. Mechanoelectrical transduction of adult outer hair cells studied in a gerbil hemicochlea. Nature. 2004;429:766–770. doi: 10.1038/nature02591. [DOI] [PubMed] [Google Scholar]
- He W, Porsov E, Kemp D, Nuttall AL, Ren T. The group delay and suppression pattern of the cochlear microphonic potential recorded at the round window. PLoS One. 2012;7:e34356. doi: 10.1371/journal.pone.0034356. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Henry KR. Auditory nerve neurophonic recorded from the round window of the Mongolian gerbil. Hear Res. 1995;90:176–184. doi: 10.1016/0378-5955(95)00162-6. [DOI] [PubMed] [Google Scholar]
- Johnson DH. The relationship between spike rate and synchrony in responses of auditory-nerve fibers to single tones. J Acoust Soc Am. 1980;68:1115–1122. doi: 10.1121/1.384982. [DOI] [PubMed] [Google Scholar]
- Kalluri R, Shera CA. Distortion-product source unmixing: a test of the two-mechanism model for DPOAE generation. J Acoust Soc Am. 2001;109:622–637. doi: 10.1121/1.1334597. [DOI] [PubMed] [Google Scholar]
- Kemp DT. Otoacoustic emissions, travelling waves and cochlear mechanisms. Hear Res. 1986;22:95–104. doi: 10.1016/0378-5955(86)90087-0. [DOI] [PubMed] [Google Scholar]
- Kemp DT, Brown AM. An integrated view of cochlear mechanical nonlinearities observable from the ear canal. In: de Boer E, Viergever MA, editors. Mechanics of hearing. Dordrecht: Springer; 1983. pp. 75–82. [Google Scholar]
- Kemp DT, Chum RA. Observations on the generator mechanism of stimulus frequency acoustic emissions—two tone suppression. In: de Boer E, Viergever MA, editors. Psychophysical, physiological and behavioral studies in hearing. Delft: Delft University Press; 1980. pp. 34–41. [Google Scholar]
- Knight RD, Kemp DT. Wave and place fixed DPOAE maps of the human ear. J Acoust Soc Am. 2001;109:1513–1525. doi: 10.1121/1.1354197. [DOI] [PubMed] [Google Scholar]
- Lichtenhan JT, Cooper NP, Guinan JJ., Jr A new auditory threshold estimation technique for low frequencies: proof of concept. Ear Hear. 2013;34:42–51. doi: 10.1097/AUD.0b013e31825f9bd3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Moleti A, Longo F, Sisto R. Time-frequency domain filtering of evoked otoacoustic emissions. J Acoust Soc Am. 2012;132:2455–2467. doi: 10.1121/1.4751537. [DOI] [PubMed] [Google Scholar]
- Muller M, Hoidis S, Smolders JW. A physiological frequency-position map of the chinchilla cochlea. Hear Res. 2010;268:184–193. doi: 10.1016/j.heares.2010.05.021. [DOI] [PubMed] [Google Scholar]
- Neely S, Liu Z (2011) EMAV: otoacoustic emission averager. In: Technical memorandum. Omaha: Boys Town National Research Hospital
- Nuttall AL, Dolan DF. Cochlear microphonic enhancement in two tone interactions. Hear Res. 1991;51:235–245. doi: 10.1016/0378-5955(91)90040-g. [DOI] [PubMed] [Google Scholar]
- Patuzzi RB. A model of the generation of the cochlear microphonic with nonlinear hair cell transduction and nonlinear basilar membrane mechanics. Hear Res. 1987;30:73–82. doi: 10.1016/0378-5955(87)90185-7. [DOI] [PubMed] [Google Scholar]
- Patuzzi RB, Yates GK, Johnstone BM. The origin of the low-frequency microphonic in the first cochlear turn of guinea-pig. Hear Res. 1989;39:177–188. doi: 10.1016/0378-5955(89)90089-0. [DOI] [PubMed] [Google Scholar]
- Ponton CW, Don M, Eggermont JJ. Place-specific derived cochlear microphonics from human ears. Scand Audiol. 1992;21:131–141. doi: 10.3109/01050399209045993. [DOI] [PubMed] [Google Scholar]
- Rhode WS. Mutual suppression in the 6 kHz region of sensitive chinchilla cochleae. J Acoust Soc Am. 2007;121:2805–2818. doi: 10.1121/1.2718398. [DOI] [PubMed] [Google Scholar]
- Rhode WS. Basilar membrane mechanics in the 6–9 kHz region of sensitive chinchilla cochleae. J Acoust Soc Am. 2007;121:2792–2804. doi: 10.1121/1.2718397. [DOI] [PubMed] [Google Scholar]
- Robles L, Ruggero MA. Mechanics of the mammalian cochlea. Physiol Rev. 2001;81:1305–1352. doi: 10.1152/physrev.2001.81.3.1305. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Robles L, Temchin AN, Fan Y-H, Ruggero MA. Stapes vibration in the chinchilla middle ear: relation to behavioral and auditory-nerve thresholds. J Assoc Res Otolaryngol. 2015;16:447–457. doi: 10.1007/s10162-015-0524-x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ruggero MA, Rich NC, Recio A, Narayan SS, Robles L. Basilar-membrane responses to tones at the base of the chinchilla cochlea. J Acoust Soc Am. 1997;101:2151–2163. doi: 10.1121/1.418265. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ruggero MA, Narayan SS, Temchin AN, Recio A. Mechanical bases of frequency tuning and neural excitation at the base of the cochlea: comparison of basilar-membrane vibrations and auditory-nerve-fiber responses in chinchilla. Proc Natl Acad Sci U S A. 2000;97:11744–11750. doi: 10.1073/pnas.97.22.11744. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Russell IJ. Cochlear receptor potentials. In: Albright TD, Albright TD, Masland RH, Dallos P, Oertel D, Firestein S, Beauchamp GK, Bushnell MC, Basbaum AI, Kaas JH, Gardner EP, editors. The senses: a comprehensive reference. New York: Academic; 2008. pp. 319–358. [Google Scholar]
- Shera CA, Cooper NP. Basilar-membrane interference patterns from multiple internal reflection of cochlear traveling waves. J Acoust Soc Am. 2013;133:2224–2239. doi: 10.1121/1.4792129. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Shera CA, Guinan JJ. Mechanisms of mammalian otoacoustic emission. In: Manley GA, Fay RR, Popper AN, editors. Active processes and otoacoustic emissions in hearing. New York: Springer; 2008. pp. 305–342. [Google Scholar]
- Shera CA, Guinan JJ., Jr Evoked otoacoustic emissions arise by two fundamentally different mechanisms: a taxonomy for mammalian OAEs. J Acoust Soc Am. 1999;105:782–798. doi: 10.1121/1.426948. [DOI] [PubMed] [Google Scholar]
- Shera CA, Guinan JJ., Jr Stimulus-frequency-emission group delay: a test of coherent reflection filtering and a window on cochlear tuning. J Acoust Soc Am. 2003;113:2762–2772. doi: 10.1121/1.1557211. [DOI] [PubMed] [Google Scholar]
- Shera CA, Zweig G. Noninvasive measurement of the cochlear traveling-wave ratio. J Acoust Soc Am. 1993;93:3333–3352. doi: 10.1121/1.405717. [DOI] [PubMed] [Google Scholar]
- Shera CA, Tubis A, Talmadge CL. Testing coherent reflection in chinchilla: auditory-nerve responses predict stimulus-frequency emissions. J Acoust Soc Am. 2008;124:381–395. doi: 10.1121/1.2917805. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Shera CA, Guinan JJ, Jr, Oxenham AJ. Otoacoustic estimation of cochlear tuning: validation in the chinchilla. J Assoc Res Otolaryngol. 2010;11:343–365. doi: 10.1007/s10162-010-0217-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Siegel JH. Calibration of otoacoustic emission probes. In: Robinette MS, Glattke TJ, editors. Otoacoustic emissions: clinical applications. 3. New York: Thieme; 2007. pp. 403–429. [Google Scholar]
- Siegel JH, Cerka AJ, Recio-Spinoso A, Temchin AN, van Dijk P, Ruggero MA. Delays of stimulus-frequency otoacoustic emissions and cochlear vibrations contradict the theory of coherent reflection filtering. J Acoust Soc Am. 2005;118:2434–2443. doi: 10.1121/1.2005867. [DOI] [PubMed] [Google Scholar]
- Siegel JH, Charaziak K, Cheatham MA. Transient- and tone-evoked otoacoustic emissions in three species. In: Shera C, Olson E, editors. What fire is in mine ears: progress in auditory biomechanics. New York: Springer; 2011. pp. 307–314. [Google Scholar]
- Slama MC, Ravicz ME, Rosowski JJ. Middle ear function and cochlear input impedance in chinchilla. J Acoust Soc Am. 2010;127:1397–1410. doi: 10.1121/1.3279830. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Songer JE, Rosowski JJ. The effect of superior canal dehiscence on cochlear potential in response to air-conducted stimuli in chinchilla. Hear Res. 2005;210:53–62. doi: 10.1016/j.heares.2005.07.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Verschooten E, Joris PX. Estimation of neural phase locking from stimulus-evoked potentials. J Assoc Res Otolaryngol. 2014;15:767–787. doi: 10.1007/s10162-014-0465-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Weiss TF, Rose C. A comparison of synchronization filters in different auditory receptor organs. Hear Res. 1988;33:175–179. doi: 10.1016/0378-5955(88)90030-5. [DOI] [PubMed] [Google Scholar]
- Wever EG, Bray CW. Action currents in the auditory nerve in response to acoustic stimulation. Proc Natl Acad Sci U S A. 1930;16:344–350. doi: 10.1073/pnas.16.5.344. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zweig G, Shera CA. The origin of periodicity in the spectrum of evoked otoacoustic emissions. J Acoust Soc Am. 1995;98:2018–2047. doi: 10.1121/1.413320. [DOI] [PubMed] [Google Scholar]
- Zwicker E, Schloth E. Interrelation of different oto-acoustic emissions. J Acoust Soc Am. 1984;75:1148–1154. doi: 10.1121/1.390763. [DOI] [PubMed] [Google Scholar]