Skip to main content
JARO: Journal of the Association for Research in Otolaryngology logoLink to JARO: Journal of the Association for Research in Otolaryngology
. 2014 Sep 23;15(6):915–931. doi: 10.1007/s10162-014-0484-6

The Effect of Stimulus Bandwidth on the Nonlinear-Derived Tone-Burst-Evoked Otoacoustic Emission

James D Lewis 1,, Shawn S Goodman 2
PMCID: PMC4389961  PMID: 25245497

Abstract

Intermodulation distortion has been hypothesized as a mechanism contributing to the generation of short-latency (SL) components in the transient-evoked otoacoustic emission (TEOAE). Presumably, nonlinear interactions between the frequency components within the evoking stimulus induce cochlear distortion products, which mix in the cochlea and ear canal with reflected energy from each stimulus-frequency’s tonotopic place. The mixing of these different components is evidenced in the bandpass-filtered emission waveform as a series of different latency peaks. The current study tested the hypothesis that intermodulation distortion, induced within the spectral bandwidth of the evoking stimulus, is the primary mechanism through which the SL components are generated. The nonlinear-derived tone-burst-evoked OAE (TBOAEnl) was evoked using 2-kHz tone bursts with durations of 3, 6, 12, and 24 cycles. As tone burst duration doubled, the spectral bandwidth was halved. It was hypothesized that contributions to the TBOAEnl from SL components would decrease as tone burst duration increased and spectral bandwidth decreased, if the SL components were generated through intermodulation distortion. Despite differences in spectral bandwidth between the evoking stimuli, the latencies and magnitudes of the different latency components between the 3- and 6-cycle TBOAEnl were comparable. The 12- and 24-cycle TBOAEnl envelopes were characteristic of destructive phase interactions between different latency components overlapping in time. The different latency components in the 3- and 6-cycle TBOAEnl introduced a characteristic level dependency to TBOAEnl magnitude and latency when analyzed across a broad time window spanning the different components. A similar dependency described the 12- and 24-cycle TBOAEnl input/output and latency-intensity functions, suggesting that the SL components evident in the shorter-duration TBOAEnl equally contributed to the longer-duration TBOAEnl, despite reductions in spectral bandwidth. The similarity between the different TBOAEnl suggests that they share a common generation mechanism and casts doubt on intermodulation distortion as the generation mechanism of SL TEOAE components in humans.

Keywords: intermodulation distortion, different latency components, generation mechanism, nonlinear-derived, SFOAE, TEOAE

INTRODUCTION

The generation of transient-evoked otoacoustic emissions (TEOAEs) has been and continues to be a subject of debate. The broad bandwidth of the stimuli used to evoke these emissions (e.g., acoustic clicks) seemingly necessitates at least some contribution from intermodulation distortion (IMD). When presented to the cochlea, these stimuli cause broad excitation across the basilar membrane (BM; Don and Eggermont 1978; Recio et al. 1998). If nearby areas were excited simultaneously, they would be expected to induce distortion since the cochlea is a nonlinear system. However, the spatial time filtering imposed by the impedance gradient of the cochlea may limit such interactions.

In the guinea pig, IMD is thought to play a primary role in TEOAE generation (Avan et al. 1995; Withnell and Yates 1998; Yates and Withnell 1999; Withnell et al. 2000). Most recently, Withnell and McKinley (2005) reported that the nonlinear-derived TEOAE (TEOAEnl) includes both a short-latency (SL) portion with shallow phase gradient and a long-latency (LL) portion with steep phase gradient. The contribution of each portion to the total TEOAEnl changes with stimulus level, shifting primarily from the LL portion at low levels to the SL portion at higher levels. These observations led Withnell and McKinley to hypothesize that the dominant mechanism underlying TEOAEnl generation depended on stimulus level, with a place-fixed mechanism at low levels (i.e., linear coherent reflection, LCR; Zweig and Shera 1995; Shera and Guinan 1999) and a wave-fixed mechanism at higher levels (i.e., IMD).

In humans, multiple studies have demonstrated that the TEOAE (both linear- and nonlinear-derived) also includes SL and LL portions (or components), seemingly analogous to those in the guinea pig (Withnell et al. 2008; Goodman et al. 2009; Goodman et al. 2011; Moleti et al. 2012a). As such, IMD has been hypothesized as the underlying generation mechanism. However, SL components have also been observed in the stimulus-frequency (SF) OAE, with latencies and growth rates that approximate the SL TEOAE components (Sisto et al. 2013). The presence of these components in the SFOAE suggests that SL generation depends primarily on the stimulus energy at the frequency of the emission itself and not on interactions between stimulus frequencies remote from that of the emission. Thus, if the SL SFOAE and TEOAE components are analogous, generation through IMD due to nonlinear interactions between different frequencies within the transient stimulus’ bandwidth seems unlikely.

The current study examined the effect of tone burst duration and, therefore, spectral bandwidth on the nonlinear-derived 2-kHz band tone-burst-evoked OAE (TBOAEnl) to test the hypothesis that IMD is a primary contributor to SL OAEnl components evoked by stimuli with broad spectral bandwidths. The TBOAEnl was evoked by 2-kHz tone bursts with durations of 3, 6, 12, and 24 cycles (1.5, 3, 6, and 12 ms, respectively). Each doubling of tone burst duration resulted in a halving of spectral bandwidth, such that the tone burst changed from resembling a band-limited acoustic click at 3 cycles to a pure tone by 24 cycles. Each tone burst was presented across a range of levels. To facilitate comparison between the different TBOAEnl, tone burst stimuli were calibrated to account for level differences resulting from their different spectral bandwidths (Prieve et al. 1996; Kalluri and Shera 2007a). SL components were hypothesized to contribute to the 3-cycle TBOAEnl, possibly due to IMD resulting from the broad bandwidth of the 3-cycle tone burst. As tone burst duration increased and spectral bandwidth decreased, the contribution of SL components to the TBOAEnl was expected to decrease, if IMD was the primary generator.

METHODS

Subjects

Thirteen subjects (8 female, 5 male) between the ages of 18–29 years old participated in the study. All subjects had normal hearing thresholds (≤20 dB HL) at the octave frequencies between 0.25 and 8 kHz in the test ear (8 right ears, 5 left ears) and normal middle ear function (assessed through 226-Hz tympanometry). Data collection was completed over the course of a single 2-h visit. The University of Iowa Institutional Review Board approved the research protocol.

Signal Generation and Data Acquisition

Two-channel stimuli were digitally created at a 44.1-kHz sampling rate using custom-written MATLAB (The Mathworks, Inc.) software and a personal computer (PC) running the Windows (Microsoft, Inc.) operating system. Stimuli were routed from the PC to an external 24-bit soundcard (UltraLite-mk3 Hybrid, Mark of the Unicorn) through a USB 2.0 interface. The resulting two-channel electrical signal was directed through a pair of earphones (Sennheiser IE8) attached by silicone tubing to the receiver ports of an ER10B+ probe assembly (Etymotic Research, Inc.). The ER10B+ probe assembly was coupled to the ear canal by an ER10-14 foam eartip. The ear canal pressure responses were transduced into an electrical signal by a miniature microphone housed in the ER10B+ probe assembly. The microphone voltage was amplified 20 dB by the ER10B+ preamplifier, routed to the external soundcard, and digitized at a sampling rate of 44.1 kHz. The digital signal was stored on the PC hard drive for offline analysis. Signal presentation and data acquisition were controlled by MATLAB using custom-written software.

Measurement and Analysis of TBOAEnl

Stimuli

TBOAE was evoked using 2-kHz tone bursts with durations of 3, 6, 12, and 24 cycles (see Fig. 1). Tone bursts were generated in cosine phase and hann-windowed. Each doubling of duration resulted in a halving of the tone burst bandwidth. The bandwidths (defined by the low- and high-cut frequencies at −3 dB re 2 kHz) of the 3-, 6-, 12-, and 24-cycle tone bursts were 1.52–2.48 kHz (2 kHz ± 1/3 octave), 1.76–2.24 kHz (2 kHz ± 1/6 octave), 1.88–2.12 kHz (2 kHz ± 1/12 octave), and 1.94–2.06 kHz (2 kHz ± 1/24 octave), respectively.

FIG. 1.

FIG. 1

Time- and frequency-domain representations of the 2-kHz tone burst stimuli. The left panel shows the waveforms for the 3-, 6-, 12-, and 24-cycle tone bursts. The right panel shows the corresponding spectra. Line thickness is used to show increasing number of cycles in the tone burst.

Calibration

Tone bursts were presented at eight levels (6-dB steps). To equate the stimulus levels across the different duration tone bursts (Prieve et al. 1996; Kalluri and Shera 2007a), stimulus levels were specified in terms of the level-per-cycle magnitude (dB re 1-Hz bandwidth or dB/Hz) at 2 kHz. The different duration tone bursts were first designed to have peak amplitudes of 1 (electrical units). Tone bursts were then transformed to the frequency domain via the fast Fourier transform (FFT). The size of the FFT depended on the length of the tone burst; therefore, the number and width of the frequency bins were different for each tone burst stimulus. For a given tone burst duration, the magnitude within each frequency bin was divided by the bandwidth of the bin. The magnitude spectra of the different tone bursts were compared, and scaling factors were derived relating the 2-kHz magnitude (dB/Hz) of the 3-, 6-, and 12-cycle tone bursts to that of the 24-cycle tone burst. When this was done, the scaled tone bursts had peak sound pressure levels (pSPL) of 44–86 dB pSPL (3 cycles; 6-dB steps), 41–83 dB pSPL (6 cycles), 38–80 dB pSPL (12 cycles), and 35–77 dB pSPL (24 cycles), in a calibration cavity (see below). When expressed in terms of 2-kHz level-per-cycle, these dB pSPL levels corresponded to 7–49 dB/Hz (6-dB steps).

The amplitudes of the tone bursts were calibrated in terms of the pressure generated in a long, plastic tube (2-m length, 7-mm inner diameter) terminated by a steel bearing. The long-tube calibration technique was chosen because it is insensitive to standing wave effects that are present in traditional in situ dB SPL ear canal calibrations (Sachs and Burkhard 1972; Stinson et al. 1982; Gilman and Dirks 1986). Additionally, the maximum voltage drive to the transducer is strictly controlled, and the possibility of inadvertently overdriving the transducers is eliminated. This technique has been described previously (Goodman et al. 2009). Briefly, the probe tip is placed into the long tube, and stimuli are presented and recorded repeatedly. The pressure recorded for each stimulus presentation includes the incident pressure delivered by the probe and multiple reflections resulting from the stimulus pressure traveling back and forth between the two ends of the tube. With each round trip, the amplitude of the reflected stimulus pressure is attenuated and eventually falls into the noise floor. In order to isolate the incident pressure delivered by the probe to the tube, it is necessary to use a short-duration stimulus and a slow stimulus repetition rate.

Sixty-four repetitions of a 10-ms, hann-windowed 2-kHz tone burst were delivered to the calibration tube at a rate of 1 per second and recorded. Recordings were highpass filtered using a finite impulse response (FIR) digital filter (250 Hz cut, 256 order). Only the initial 10 ms of each filtered recording was retained. The pressure measured in this time window described the incident pressure delivered by the transducer to the tube and was thus free from internal reflections occurring at the distal end of the tube and propagating back toward the microphone. The amplitudes of the electrical tone burst stimuli were adjusted to yield output levels (as measured in the long tube) at the levels described above.

Presentation

Stimuli were presented using an equal-level, double-evoked, double-source paradigm (Keefe 1998; Keefe and Ling 1998; Schairer et al. 2003). A stimulus buffer was composed of three consecutive stimulus intervals (s1, s2, and s1,2). Each interval was 40 ms long resulting in a total buffer duration of 120 ms. Stimulus interval s1 contained the tone burst (of level L1 where L1 = 7–49 dB/Hz SPL; 6-dB steps) and was routed through channel 1 of the transducer. Stimulus interval s2 contained the same tone burst as s1 (L2 = L1), but routed through channel 2 of the transducer. Tone bursts were presented simultaneously through channels 1 and 2 of the transducer for s1,2. The number of stimulus buffers presented depended on the tone burst level, with more presentations for lower level tone bursts to improve the signal-to-noise ratio (SNR) of the evoked emission upon averaging. Two thousand buffers were presented for levels between 7 and 19 dB/Hz SPL, 1,600 were presented for levels between 25 and 37 dB/Hz SPL, and 1,200 were presented for levels between 43 and 49 dB/Hz SPL.

Analysis

Ear canal pressure recordings for each stimulus buffer were divided into three time segments (p1, p2, and p1,2) corresponding to the three stimulus intervals (s1, s2, and s1,2). Each recording was the sum of pressure waves associated with the stimulus, physiological and environmental noise, and the OAE. The nonlinear differential pressure (pD) was subsequently calculated

pD=p1+p2p1,2. 1

The primary advantage of the double-evoked extraction is that pD is free from stimulus contamination, assuming transducer linearity. Stimulus contamination is especially problematic in OAE measurements where (1) the frequency of the stimulus and emission is identical and (2) the duration of the stimulus is sufficiently long (whether by design or due to artifact) that the stimulus and emission overlap in time. Both of these conditions applied to the current study since (1) the evoking stimuli were 2-kHz tone bursts and the 2-kHz band of the emission was analyzed and (2) the durations of the tone bursts extended from 1.5 to 12 ms and overlapped with the expected latency range of the 2-kHz band TBOAE (approximately 3–12 ms, depending on stimulus level; Tognola et al. 1997; Sisto and Moleti 2007; Rasetshwane et al. 2013).

A consequence of the double-evoked extraction is that the extracted OAE pressure will not always accurately describe the total OAE pressure evoked by the s1 stimulus (p1OAE). In other words, only a portion of the evoked OAE is extracted (Schairer et al. 2003; Kalluri and Shera 2007b). Theoretically, specifying the level of the s2 stimulus (L2) so that it corresponds to the saturating portion of the OAE input-output (I/O) function will minimize inaccuracy in the extraction. However, specifying L2 to correspond to the saturating portion of the I/O function is not always feasible as a high L2 may result in excessive nonlinear transducer distortion and/or high physiological noise levels relative to the noise levels associated with the lower L1 (Schairer et al. 2003). Additionally, it is typically not immediately obvious as to the L2 associated with saturation of the I/O function. This is especially true for the different latency components in TEOAEs, as they exhibit different growth rates and level dependencies such that the L2 associated with saturation for an LL component may not necessarily result in saturation for an SL component (Moleti et al. 2012a).

Preliminary data for the study was collected with L2 set 12 dB higher than L1. However, this paradigm precluded measurement of the early time portion of 3-cycle TBOAE at high stimulus levels as noncanceling stimulus artifact generated by the s2 tone burst contaminated this part of the recording. Since IMD is thought to contribute to the early time portion of the emission at high stimulus levels, it was important to retain as much of this portion of the emission at the highest possible stimulus levels. Subsequent data collected using an equal-level paradigm (i.e., L2 = L1) permitted measurement of earliest portions of the TBOAEnl across a wider range of L1 that extended to higher stimulus levels. Therefore, the equal-level paradigm was used for the current study.

To reduce filter artifact, the initial and final 5 ms of pD were ramped on and off, respectively, using a 1/2-cycle Hann function. The windowed responses were then highpass filtered using an FIR digital filter (250 Hz cut, 128 order). The root-mean-square (RMS) levels of the filtered responses were calculated and subjected to an artifact rejection algorithm (Goodman et al. 2009; see Hoaglin et al. 1983) in order to identify recordings contaminated by high levels of intermittent noise. Briefly, the first and third quartiles of the RMS levels were calculated, and the interquartile range was derived. Recordings with an RMS exceeding the third quartile by more than 1.5 times the interquartile range were excluded from further analysis.

Retained responses were filtered using a 1/3 octave bandpass FIR digital filter (2-kHz center frequency, 512 order). The K filtered responses were synchronously averaged in the time domain to yield an estimate of the nonlinear (nl)-derived 2-kHz band TBOAE (p¯OAEnln, n indicates the sample number),

p¯OAEnln=1Kk=1KpDnk. 2

The analytic expression of p¯OAEnl (p¯^OAEnl) was calculated using the discrete Hilbert transform (denoted by ℋ),

p¯^OAEnln=p¯OAEnln+jHp¯OAEnln. 3

From the analytic signal, the instantaneous magnitude (p¯^OAEnl), or envelope, was calculated as

p¯^OAEnln=Rep¯^OAEnln2+Imp¯^OAEnln2. 4

An estimate of the time-domain noise was calculated as the standard error of the mean of the instantaneous magnitude. First, the analytic signal of each individual nonlinear differential response was computed,

p^Dnk=pDnk+jHpDnk. 5

The mean instantaneous magnitude (p¯^D) was subsequently calculated,

p¯^Dn=1Kk=1Kp^Dnk. 6

and the noise was computed as the standard error of the mean (SEM),

SEMn=1K1K1k=1Kp^Dnkp^¯Dn2 7

The TBOAEnl signal and noise estimates were constrained to an analysis window extending from stimulus onset to stimulus offset, plus an additional 10 ms. This resulted in different duration analysis windows for each tone burst stimulus (3 cycles: 11.5 ms, 6 cycles: 13 ms, 12 cycles: 16 ms, 24 cycles: 22 ms). The 10-ms post-stimulus offset was chosen as it approximates the longest expected latency for a low-level 2-kHz reflection source OAE (Neely et al. 1988; Tognola et al. 1997; Shera and Guinan 2003; Sisto et al. 2007). A time vector was mapped to each analysis window with time 0 corresponding to the peak of the tone burst stimulus.

TBOAEnl magnitude (LOAEnl) was calculated as the RMS of the weighted TBOAE envelope within the analysis window,

LOAEnl=20log101Nn=0N1wnp¯^OAEnln2/0.00002. 8

The weighting function (w) applied a binary weight to each sample (n) of the windowed TBOAEnl and depended on the instantaneous SNR,

SNRn=20log10p¯^OAEnlnSEMn/0.00002, 9

according to

wn=1forSNRn110forSNRn<11. 10

Additionally, in order for a sample with an SNR >11 to receive a weighting of 1, the adjacent 21 samples (corresponding to a full cycle of a 2-kHz wave) also had to have an SNR >11. This constraint minimized contributions from samples that were more likely noise than OAE energy. The 11-dB SNR criterion was established by calculating the SNR at which the probability of a false positive (classifying noise as signal) was less than 5 %. Criteria were initially established for each stimulus level as the number of averages (K) across levels were not constant (K = 1,200, 1,600, or 2,000). K, 1-s long, randomly generated number sequences were created using MATLAB’s normally distributed pseudorandom number generator function. Sequences were filtered using the same highpass and bandpass filters that were applied to the ear canal pressure recordings. The mean instantaneous magnitude and standard error of the mean were calculated (see Eqs. 4 and 7). The former represented an estimate of the signal and the later represented an estimate of the noise. The empirical distribution function (EDF) for each value of K was determined and fit using a cubic spline interpolant. The SNR corresponding to a false-positive rate of 5 % was similar across all values of K and occurred between 10.8 and 11.06 dB SNR. Consequently, a criterion of 11 dB SNR was chosen and used for all stimulus levels.

Compared to calculating TBOAEnl magnitude from the time-domain waveform (as is traditionally done), magnitude defined from the envelope is 3 dB larger. The origin of this difference is the inclusion of the imaginary part of the analytic signal when calculating the TBOAEnl envelope (see Eq. 4). Compared to the real part of the analytic signal, the imaginary part has the same amplitude but differs in phase by π/2 radians. Summing the squared real and imaginary parts of the analytic signal results in a doubling of intensity and, therefore, a 3-dB increase in the RMS of the TBOAEnl envelope compared to that of the TBOAEnl waveform.

TBOAEnl latency (τOAEnl) was also defined using the TBOAEnl envelope and weighting function,

τOAEnl=n=0N1wnp¯^OAEnln2tnn=0N1wnp¯^OAEnln2, 11

where t was the time vector for the analysis window (ms). This definition quantifies latency as the energy-weighted mean of the time vector and gives greatest weight to time indices corresponding to peaks in the TBOAEnl envelope, where the SNR is the largest. Attributing greatest weight to high-SNR regions of TBOAEnl reduces the variability of the OAE latency measure (Shera and Bergevin 2012).

RESULTS

TBOAEnl Envelopes

Figure 2 shows the TBOAEnl envelopes evoked by each tone burst stimulus for three representative subjects (arranged in columns). In each plot, the magnitudes of the envelopes have been normalized relative to the peak magnitude of the envelope evoked by the highest-level tone burst (49 dB/Hz SPL). Similarly, time has been specified relative to the peak of the evoking tone burst (resulting in negative time values). The TBOAEnl envelopes for the 3-cycle tone bursts (top row) typically exhibited three different latency peaks (P1, P2, and P3). At the lowest stimulus levels, only the longest-latency peak (P1) was present above the noise floor. As stimulus level increased, several earlier, less-compressive growing magnitude peaks emerged (e.g., P2 and P3). The differential growth rates between the peaks often resulted in the largest magnitude peak shifting from the longest-latency peak at low stimulus levels to the shorter-latency peaks at high stimulus levels. The temporal location of each magnitude peak remained nearly constant with stimulus level.

FIG. 2.

FIG. 2

TBOAEnl envelopes from 3 representative subjects. Columns show envelopes for an individual subject, and rows show envelopes for a given tone burst duration (row 1/top row—3 cycles, row 2–6 cycles, row 3–12 cycles; row 4/bottom row—24 cycles). Increasing line thickness shows increasing tone burst level. Arrows indicate the different latency peaks identified in the 3-cycle TBOAEnl. The shaded gray region indicates the noise floor.

The 6-cycle TBOAEnl envelopes (second row) resembled those for the 3-cycle tone burst, although they spanned a wider time range as a consequence of the increased duration of the evoking stimulus. These envelopes were also characterized by a series of different latency magnitude peaks. Three peaks were generally apparent in the 6-cycle TBOAEnl envelopes, with only the longest-latency peak present at low stimulus levels and less-compressive growing shorter-latency peaks emerging at higher stimulus levels. The less-compressive growth of the earlier peaks resulted in their magnitudes exceeding that of the longest-latency peak at the highest stimulus levels. The temporal locations of the peaks were similar to the peaks in the 3-cycle TBOAEnl envelopes. The 6-cycle TBOAEnl peaks were generally more poorly resolved in time than those of the 3-cycle tone burst. For instance, in subject 137R (middle column), the P2 and P3 peaks that were evident for the 3-cycle TBOAEnl merged into a single peak in the 6-cycle TBOAEnl. The envelopes for the other subjects showed a similar decrease in the resolution of the shorter-latency peaks.

Compared to the shorter-duration tone bursts, the 12-cycle TBOAEnl envelopes (third row) were more dispersed across time, consistent with the longer duration of the evoking stimulus. Despite the narrower bandwidth of the 12-cycle tone burst, the TBOAEnl exhibited characteristics similar to the 3- and 6-cycle TBOAEnl. First, only a longer-latency peak was observed at the lowest stimulus levels. Second, at least one shorter-latency peak emerged as stimulus level increased. Third, the shorter-latency peak(s) grew less compressively than the longer-latency peak. And fourth, the largest magnitude peak shifted from the longer-latency peak at low stimulus levels to the shorter-latency peak(s) at high stimulus levels. Multiple peaks occurring earlier in time than the longest-latency peak were not typically evident in the 12-cycle TBOAEnl envelopes. Rather, a single peak was observed across the same latency range where two seemingly distinct peaks were present in the 3- and 6-cycle TBOAEnl envelopes (e.g., subjects 161L and 137R).

When the tone burst duration was increased to 24 cycles (fourth row), the TBOAEnl still exhibited characteristics consistent with contributions from earlier- and later-occurring TBOAEnl energy. As was observed in the TBOAEnl envelopes evoked by the shorter-duration stimuli, the peak with the largest magnitude shifted to a shorter latency as stimulus level increased. Moreover, these envelopes were characteristic of destructive phase interactions between at least two components that partially overlapped in time (Talmadge et al. 1999). For instance, the envelopes for subject 161L exhibited a sharp magnitude null between 6 and 10 ms (depending on stimulus level).

Different Latency TBOAEnl Components

The temporal resolution of the different latency magnitude peaks in the 3- and 6-cycle TBOAEnl envelopes allowed for these TBOAEnl to be divided into different latency components. First, the 3- and 6-cycle TBOAEnl instantaneous magnitudes (i.e., envelopes) were transformed to instantaneous power (by squaring the envelopes) and the mean power envelopes (averaged across stimulus levels) were calculated for each subject. Second, a peak-picking algorithm was used to identify the time indices of magnitude peaks and nulls from the mean envelopes. A “component” was defined as the portion of the TBOAEnl within a time window that spanned adjacent magnitude nulls. The magnitude and latency of each component were calculated according to Eqs. 8 and 11, respectively. However, for each component, the weighting function (i.e., w[n], Eq. 10) was adjusted such that all samples of the envelope falling outside of the temporal window of the component were set to 0. The left panel of Figure 3 provides an illustration of the different latency components identified from the 3-cycle TBOAEnl for subject 116R. Four components were identified for this subject (C1, C2, C3, and C4).

FIG. 3.

FIG. 3

Different latency components in the 3- and 6-cycle TBOAEnl. The left panel illustrates how different latency components (i.e., C1, C2, C3, and C4) were defined (data is shown for the 3-cycle TBOAEnl from subject 116R). Gray lines are the TBOAEnl envelopes evoked for different stimulus levels (denoted by line thickness). The black line is the mean envelope from which the time boundaries of the different latency components were defined (indicated by the dashed lines). The right panel plots the mean latencies (±1 standard deviation) of the different latency components identified in the 3- (circles) and 6-cycle (asterisks) TBOAEnl, for all subjects. In some cases, the standard deviations were sufficiently small that error bars are not apparent. The horizontal dashed line is at a latency of 6.03 ms and was the latency boundary chosen for separating SL from LL components.

The right panel of Figure 3 shows the mean latencies (and standard deviations) of all components identified for each subject, for the 3- (circles) and 6-cycle (asterisks) TBOAEnl. Between 2 and 5 components were identified across subjects for both the 3- and 6-cycle TBOAEnl. Most of the components had latency falling within a range from 1 to 10 ms. The latency of each component exhibited only small changes with stimulus level, as indicated by the width of the error bars. The approximate level invariance of different latency components has previously been observed in the TE and SFOAE (e.g., Goodman et al. 2011; Sisto et al. 2013). On average, a factor of 1.76 (standard deviation = 0.42) and 1.86 (0.56) separated the latencies of adjacent components in the 3- and 6-cycle TBOAEnl, respectively. This is similar to the factor of approximately 1.6 observed by Goodman et al. (2009) and Moleti et al. (2012a) between the SL and LL component in the TEOAE. Within each subject, both the number of components and each component’s latency for the 6-cycle TBOAEnl were similar to those for the 3-cycle TBOAEnl. For instance, 4 components were identified in both the 3- and 6-cycle TBOAEnl for subject 122R; the 3-cycle TBOAEnl latencies were 1.51, 3.76, 5.73, and 8.22 compared to the 6-cycle TBOAEnl latencies of 1.36, 3.57, 5.73, and 8.24, respectively. In two subjects (113L and 137R), 3 components were identified in the 3-cycle TBOAEnl, but only 2 components were identified in the 6-cycle TBOAEnl. For these subjects, the two earliest-occurring components in the 3-cycle TBOAEnl have merged into a single component in the 6-cycle TBOAEnl (consistent with observations from Figure 2), as evidenced by the standard deviation of the 6-cycle component spanning the latencies of the two 3-cycle components.

For each subject, components with similar latencies between the 3- and 6-cycle TBOAEnl also exhibited similar I/O functions. Figure 4 plots the I/O functions of the different latency components in the 3- (top row of panels) and 6-cycle (bottom row of panels) TBOAEnl, measured in 3 representative subjects. Components are identified based on their relative latencies with C1 indicating the longest-latency component, C2 indicating the component earlier-in-time and adjacent to C1, and C3 indicating the component earlier-in-time and adjacent to C2. In all cases, the component with the largest magnitude shifted from C1 at low-to-moderate stimulus levels to C2 at higher levels and, in certain cases (subject 134R), to C3 at the highest levels. This observed shift was a consequence of the less-compressive growth of the earlier-occurring components.

FIG. 4.

FIG. 4

3- and 6-cycle TBOAEnl component I/O functions for three representative subjects (161L, 134R, and 166R). The top and bottom row of panels show I/O functions for the 3- and 6-cycle TBOAEnl, respectively. Each column shows data for a single subject. Line style denotes the component where C1 corresponds to the longest-latency component, C2 corresponds to the component earlier-in-time and adjacent to C1, and C3 (when present) corresponds to the component earlier-in-time and adjacent to C2.

To avoid a somewhat arbitrary grouping and across-subject analysis of the different latency components based on their label (i.e., C1 and C2), components were instead grouped according to their latency relative to that expected for a 2 kHz OAE generated through LCR near the tonotopic place. From low-level SFOAE latency data reported in Shera and Guinan (2003; Table 1), a 2-kHz emission latency of less than 6.03 ms is expected only 5 % of the time. This criterion was adopted to separate the different latency components into either SL or LL categories (indicated by the horizontal broken line in the right panel of Fig. 3). The magnitudes (μPa) and latencies of all components categorized as SL components were averaged across subjects at each stimulus level to quantify the level dependence of SL TBOAEnl magnitude and latency. The same was done with all components categorized as LL components to quantify the level dependence of the LL TBOAEnl. Group data (I/O, growth rate, and latency-intensity functions) for the 3- and 6-cycle SL and LL TBOAEnl are presented in the top and bottom rows of panels, respectively, in Figure 5.

FIG. 5.

FIG. 5

Mean 3- and 6-cycle SL, LL, and total TBOAEnl I/O, growth rate, and latency-intensity functions. The top and bottom row of panels show data for the 3- and 6-cycle TBOAEnl, respectively. I/O functions are plotted in the left column, growth rate functions are plotted in the middle column, and latency-intensity functions are plotted in the right column. Error bars indicate the standard error. Line style denotes the SL, LL, and total TBOAEnl.

The top left panel of Figure 5 plots the mean I/O functions (across subjects; error bars indicate the standard error) for the 3-cycle SL and LL TBOAEnl. At low stimulus levels, the LL TBOAEnl had the largest magnitude; however, as stimulus level increased, the magnitude of the SL TBOAEnl eventually exceeded that of the LL TBOAEnl. The top middle panel of Figure 5 plots the mean 3-cycle SL and LL TBOAEnl growth rates as a function of stimulus level. Growth rates were calculated by differentiating the I/O functions. Both the LL and SL magnitudes grew increasingly compressively as stimulus level increased: The growth rate of the LL TBOAEnl decreased from 0.60 to 0.06 dB/dB while the growth rate of the SL TBOAEnl decreased from 1.09 to 0.40 dB/dB.

The mean latencies of the 3-cycle SL and LL TBOAEnl are plotted as a function of stimulus level in the top right panel of Figure 5. The latency of the LL TBOAEnl was approximately independent of stimulus level. In contrast, the latency of the SL TBOAEnl decreased from 4.62 to 3.52 ms as stimulus level increased from 7 dB/Hz SPL to 49 dB/Hz SPL. The near level invariance of LL TBOAEnl latency mimics that observed in the individual components from Figure 3 and was presumably due to the fact that the LL TBOAEnl generally included only a single LL component for each subject (subject 16R is one exception to this as this subject had 3 LL components). On the other hand, 2–3 SL components were typically included in the calculation of SL TBOAEnl latency such that latency decreased as the distribution of TBOAEnl energy shifted to progressively earlier-occurring components with increasing stimulus level. The mean latency of the LL TBOAEnl across stimulus levels was 8.65 ms (standard deviation = 0.16) compared to 4.03 ms (±0.41) for the SL TBOAEnl.

The bottom row of panels in Figure 5 plots analogous data to those data plotted in the top row, but for the 6-cycle TBOAEnl. The I/O, growth rate, and latency-intensity functions for the 6-cycle SL and LL components resembled those for the 3-cycle SL and LL components. Growth rates for the 6-cycle LL TBOAEnl decreased from 0.30 to −0.04 dB/dB across stimulus levels while growth rates for the SL TBOAEnl decreased from 0.82 to 0.45 dB/dB. The mean latencies across stimulus levels of the LL and SL TBOAEnl were 8.34 (±0.13) and 3.92 ms (±0.39), respectively.

Total TBOAEnl I/O and Latency-Intensity Functions

Overlaid on the component-specific data in Figure 5 are the mean data for the 3- and 6-cycle TBOAEnl analyzed across a time window spanning both the SL and LL components (thick solid lines). The TBOAEnl analyzed across this time window is hereafter referred to as the total TBOAEnl (or simply the TBOAEnl). Comparing the level dependence of the magnitude and latency of the total TBOAEnl to the SL and LL TBOAEnl illustrates how the latter influences the former. Compared to the SL and LL TBOAEnl (whether 3 or 6 cycles), the magnitude of the total TBOAEnl was larger (left panels), which was to be expected since the total TBOAEnl included the contributions from both SL and LL components. Similar to the magnitudes of the SL and LL TBOAEnl, the magnitude of the total TBOAEnl increased with stimulus level. Reflecting the changing contributions from the SL and LL portions, the growth rate of the total TBOAEnl (middle panels) approximated that of the LL TBOAEnl at the lowest stimulus levels (where this portion of the TBOAEnl was dominant) but was closer to the growth rate of the SL TBOAEnl at the highest stimulus levels (where this portion of the TBOAEnl was dominant).

In contrast to the latencies of the SL and LL TBOAEnl, which were roughly constant, the latency of the total TBOAEnl depended on stimulus level (right panels of Fig. 5). Latency decreased from 8.35 to 3.63 ms for the 3-cycle TBOAEnl and from 8.29 to 3.44 ms for the 6-cycle TBOAEnl as stimulus level increased from 7 to 49 dB/Hz. The observed decrease in total TBOAEnl latency was expected, given the I/O functions and latencies of the SL and LL TBOAEnl. Specifically, the long latency of the total TBOAEnl at low stimulus levels was consistent with the LL TBOAEnl being the primary contributor to the TBOAEnl. As stimulus level increased and the contribution from the SL TBOAEnl increased, total TBOAEnl latency decreased to approximate the latency of the SL TBOAEnl. Stover et al. (1996) demonstrated a similar phenomenon underlying the level dependency of DPOAE latency.

The poorer temporal resolution of the 12- and 24-cycle TBOAEnl envelopes precluded examination of potential contributions from SL and LL components and direct comparison with those in the 3- and 6-cycle TBOAEnl. However, the data from Figure 5 suggested that contributions from SL and LL components to the TBOAEnl might be deduced by analyzing the total TBOAEnl. Specifically, if different latency components contributed to the 12- and 24-cycle TBOAEnl to the same extent that they contributed to the 3- and 6-cycle TBOAEnl, then, the magnitudes, growth rates, and latencies of the 12- and 24-cycle total TBOAEnl should approximate those of the 3- and 6-cycle total TBOAEnl.

Figure 6 plots the 3-, 6-, 12-, and 24-cycle total TBOAEnl I/O functions for 6 representative subjects. TBOAEnl magnitudes of 3 and 6 cycles increased monotonically with stimulus level. For some subjects, the increase in TBOAEnl magnitude for a given increase in stimulus level varied across the I/O function. For instance, both 3- and 6-cycle TBOAEnl magnitudes for subjects 137R and 153L exhibited increasingly compressive growth through 25 dB/Hz SPL but grew less compressively at higher stimulus levels. In contrast to the 3- and 6-cycle TBOAEnl I/O functions, the I/O functions for the 12- and 24-cycle TBOAEnl occasionally exhibited nonmonotonic growth across certain stimulus levels. For instance, magnitude notches occur in the 12- and 24-cycle TBOAEnl I/O functions for subject 116R at 25 dB/Hz SPL and 31 dB/Hz SPL, respectively. Magnitude notches have previously been observed in measured SFOAEnl I/O functions (Schairer et al. 2003) and have been hypothesized as being indicative of destructive phase interactions between different components overlapping in time and/or shifts in the fine structure. Verhulst et al. (2012) similarly noted notches in the modeled, linear-extracted SFOAE; however, differences in the extraction paradigms may preclude direct comparisons between the notches reported here and in Schairer et al. to those in Verhulst et al. Aside from the notches, 12- and 24-cycle TBOAEnl magnitudes generally increased with stimulus level, and the 3-, 6-, 12-, and 24-cycle TBOAEnl I/O functions were similar within each subject.

FIG. 6.

FIG. 6

I/O functions for the 3-, 6-, 12-, and 24-cycle total TBOAEnl for 6 representative subjects. Each panel shows the functions for a single subject. Marker style denotes tone burst duration.

Figure 7 plots the mean I/O (left panel) and growth rate functions (middle panel) across subjects for the different TBOAEnl. TBOAEnl magnitudes were similar at each stimulus level. The largest difference was approximately 5 dB and occurred at 49 dB/Hz SPL between the 3-cycle and 24-cycle TBOAEnl. Growth rate functions are shown in the middle panel and were derived by calculating the derivative of the I/O functions after first smoothing the I/O functions with a spline interpolant. The growth functions for the different TBOAEnl all exhibited a region of increasingly compressive growth with increasing stimulus level that extended to between 25 and 31 dB/Hz SPL. At higher stimulus levels, the trajectory of the growth rate functions changed, and growth rates either remained nearly constant with increasing stimulus level (e.g., 3-cycle TBOAEnl) or became less compressive. Recall that the increasing contribution from the SL component associated with increasing stimulus level was responsible for the change in the trajectory of the growth rate function for both the 3- and 6-cycle TBOAEnl (see Fig. 5). Both the change in growth rates and similar magnitudes between the 12-/24-cycle TBOAEnl and the 3-/6-cycle TBOAEnl suggests that SL and LL components analogous to those in the TBOAEnl evoked by the shorter-duration tone bursts also contributed to the TBOAEnl evoked by the longer-duration tone bursts.

FIG. 7.

FIG. 7

Mean I/O, growth rate, and latency-intensity functions for the 3-, 6-, 12-, and 24-cycle total TBOAEnl. The left panel plots the I/O functions, the middle panel plots the growth rate functions, and the right panel plots the latency-intensity functions. Marker style denotes tone burst duration.

Figure 8 plots the total TBOAEnl latency-intensity functions for the same subjects’ whose I/O functions were plotted in Figure 6. In most subjects and for the 3-, 6-, and 12-cycle tone bursts, total TBOAEnl latency decreased monotonically with stimulus level. Additionally, the latencies for the 3-, 6-, and 12-cycle TBOAEnl were comparable at each stimulus level. Similar to the latencies for the TBOAEnl evoked by the shorter-duration tone bursts, latency for the 24-cycle TBOAEnl generally decreased as stimulus level increased. However, in several subjects and across a portion of the latency-intensity function, latency either remained nearly constant or increased as stimulus level increased (e.g., subjects 161L and 145R, respectively), thereby causing these latencies to deviate from those for the TBOAEnl evoked by the shorter-duration tone bursts. Within these same subjects’ functions, latencies often exhibited a rapid decrease as stimulus level continued to increase and eventually approximated the latencies for the shorter-duration tone bursts at the highest stimulus levels.

FIG. 8.

FIG. 8

Latency-intensity functions for the 3-, 6-, 12-, and 24-cycle total TBOAEnl for 6 representative subjects. Subjects are the same as those presented in Figure 6. Each panel shows the functions for a single subject. Marker style denotes tone burst duration.

The irregular changes in 24-cycle TBOAEnl latency across certain stimulus levels were typically associated with the emergence of a shorter-latency magnitude peak in the TBOAEnl envelope. Figure 9 plots the 24-cycle TBOAEnl envelopes for subject 153L at stimulus levels from 25 to 43 dB/Hz SPL, where the latency-intensity function for the 24-cycle TBOAEnl deviated from those for the 3-cycle TBOAEnl. Overlaid on each envelope is the latency calculated from Eq. 11. As stimulus level increased from 25 to 31 dB/Hz, the dominant magnitude peak that was initially centered around 5 ms split into two peaks, which had the effect of slightly increasing TBOAEnl latency. It was this increase in latency that caused the latency-intensity function to deviate from those for the shortest-duration tone bursts. As stimulus level continued to increase, TBOAEnl latency approximated the time index of the shorter-latency peak and the latency-intensity function approached those for the TBOAEnl evoked by the shorter-duration tone bursts.

FIG. 9.

FIG. 9

Illustration of the association between an emerging SL peak and an increase in 24-cycle TBOAEnl latency (subject 153L). TBOAEnl envelopes are shown for stimulus levels between 25 and 43 dB/Hz SPL. Triangles correspond to the calculated latency of each envelope. As stimulus level increased, a SL peak emerged at 31 dB/Hz SPL. The emergence of this peak was associated with a slight increase in TBOAEnl latency.

The right panel of Figure 7 plots the mean latency-intensity functions for each tone burst stimulus. Latencies were similar between the different TBOAEnl at each stimulus level. As stimulus level increased, latencies decreased from 8.35 to 3.63, 8.29 to 3.44, 8.10 to 3.47, and 8.15 to 3.80 ms for the 3-, 6-, 12-, and 24-cycle TBOAEnl, respectively. To quantify the level dependency of TBOAEnl latency, each function was fit by a first-order polynomial. Latencies decreased with increasing stimulus levels at rates of 0.116 ms/dB (5th–95th % confidence bounds ± 0.0123), 0.119 ms/dB (±0.0116), 0.116 ms/dB (±0.013), and 0.11 ms/dB (±0.0195). These rates correspond to approximately 0.2 cycles/dB when expressed relative to the period of a 2-kHz wave. Similar rates are observed in TEOAEnl data from Sisto and Moleti (2007), TBOAEnl data from Rasetshwane et al. (2013), and cochlear reflectance data from Rasetshwane and Neely (2012). In the SFOAEnl, Schairer et al. (2006) reported approximately a 50 % decrease in latency across a 30-dB increase in stimulus level, which is similar to that observed in the TBOAEnl (e.g., the median 24-cycle TBOAEnl latency decreased from 7.16 to 3.80 ms—a 53 % decrease—between 19 dB/Hz SPL and 49 dB/Hz SPL). Recall that the level dependency of the 3- and 6-cycle total TBOAEnl was due to the largest contribution to the TBOAEnl shifting from the LL component at low stimulus levels to the SL component at higher stimulus levels. The similarity between the TBOAEnl evoked by the longer-duration tone bursts and the 3- and 6-cycle TBOAEnl thus suggests that all TBOAEnl included comparable contributions from SL and LL components.

The latency-intensity functions plotted in Figure 7 exhibited relatively little dependency on tone burst duration. This result is seemingly at odds with findings from Rasetshwane et al. (2013). Similar to the current study, these investigators measured TBOAE latency as a function of stimulus level for different duration 2-kHz tone bursts; however, their tone bursts spanned a much narrower time frame (2–4 ms) compared to the current study (1.5–12 ms). Their findings demonstrated that the latency of the TBOAE systematically increased as tone burst duration increased. It is hypothesized that the origin of Rasetshwane et al. findings was due to referencing latency to tone burst onset as opposed to the peak of the tone burst. As the rise time of the tone burst increases, the peak occurs at a later moment in time relative to stimulus onset, thereby causing the peak of the TBOAE to be increasingly delayed relative to stimulus onset. By referencing latency to the peaks of the tone bursts, as done in the current study, the influence of differences in rise times is reduced.

DISCUSSION

The Effect of Stimulus Bandwidth on the TBOAEnl

Previous studies have reported the presence of different latency components within the TEOAE (Withnell and McKinley 2005; Goodman et al. 2009, 2011; Moleti et al. 2012a). Common findings across these studies were the following: (1) Shorter-latency components grew less compressively than longer-latency components, (2) the dominant component contributing to the TEOAE shifted to shorter-latency components as stimulus level increased, and (3) the latency of each component was nearly level invariant (Carvalho et al. 2003). It is notable that these studies all used acoustic clicks to evoke the OAE, with spectral bandwidths wider than those for the tone bursts used in the current study. The broad bandwidth of click stimuli has previously been hypothesized to result in IMD and the generation of SL TEOAE components (Withnell and McKinley 2005; Withnell et al. 2008; Goodman et al. 2009; Moleti et al. 2012a).

In the current study, different latency components were measurable in both the 3- and 6-cycle TBOAEnl envelopes, despite their relatively narrow bandwidths compared to clicks. For most subjects, three or four components were evident, several of which (i.e., the SL components) had latency earlier than expected for generation through LCR at the tonotopic place (Shera and Guinan 2003). As with the different latency components in the aforementioned TEOAE studies, shorter-latency components grew less compressively than longer-latency components, and the latencies of all components were approximately level invariant. The I/O functions and latencies of the different components in the 6-cycle TBOAEnl approximated those in the 3-cycle TBOAEnl, demonstrating that the generation of these components was relatively independent of the stimulus energy at frequencies beyond ±1/6 octaves re: 2 kHz (i.e., the bandwidth of the 6-cycle tone burst). Therefore, any contribution to the TBOAEnl resulting from IMD would necessitate nonlinear interactions between stimulus frequencies within a narrow range of the TBOAEnl frequency.

The 2f1 − f2 distortion product (f2/f1 ≈ 1.2) has recently been implicated as the potential source of at least one of the SL components (Moleti et al. 2012a) as the latency factor separating LL and SL components in the TEOAE (∼1.6; Goodman et al. 2009; Moleti et al. 2012a) approximates that expected between an emission generated at the tonotopic place through LCR and an emission generation through low-side cubic distortion (f2/f1 ≈ 1.2) at the f2 place (Moleti et al. 2012a; see Konrad-Martin and Keefe 2003 for DPOAE vs SFOAE latency comparisons). The reduction in spectral bandwidth between the 3- and 6-cycle 2-kHz tone bursts does not support this hypothesis, however. In order to generate a 2-kHz-band 2f1 − f2 distortion source OAE (f2/f1 ≈ 1.2), stimulus energy around 2.5 kHz (f1) and 3.1 kHz (f2) is required, but the 6-cycle tone burst’s energy at and around these frequencies relative to that for the 3-cycle tone burst was attenuated by approximately 20 dB. Despite this attenuation, 6-cycle TBOAEnl SL components at comparable latencies and magnitudes to 3-cycle TBOAEnl SL components were observed. As such, 2f1 − f2 distortion (f2/f1 ≈ 1.2) is not supported as the likely source of SL components, although this does not preclude potential contributions from other distortion products or from 2f1 − f2 at f2 / f1 other than 1.2. Indeed, if IMD does contribute to the SL components, it likely does so through complex stimulus frequency interactions that cannot be reduced to a single distortion product.

The method used to unmix the different latency components from the 3- and 6-cycle TBOAEnl (i.e., time windowing) was not suitable for the 12- and 24-cycle TBOAEnl due to the poorer temporal resolution of these responses. However, the morphologies of these envelopes were consistent with contributions from different latency components. Specifically, at moderate-to-high stimulus levels, the 12- and 24-cycle TBOAEnl envelopes often exhibited a shorter- and longer-latency magnitude peak, separated by a magnitude null. Talmadge et al. (1999) demonstrated that magnitude nulls in the DPOAE envelope are associated with the offset and onset of the SL distortion and LL reflection source components, respectively, when the two components are in antiphase. In the case of the TBOAEnl envelopes for the longer-duration tone bursts, magnitude nulls are hypothesized to correspond to the offset of a shorter-latency component and onset of a longer-latency component in antiphase (or nearly antiphase). This is not to say that the shorter-latency component is a distortion source component but, rather, that the 12- and 24-cycle TBOAEnl envelopes are characteristic of wave interference between overlapping components with different latencies and phases.

Assuming that the same SL components contributed to the 3-, 6-, 12-, and 24-cycle 2 kHz TBOAEnl, their generation would depend primarily on stimulus energy across a frequency region constrained to ±1/24 octave re: 2 kHz (i.e., the bandwidth of the 24-cycle tone burst). Findings from Sisto et al. (2013) using time-frequency analysis to unmix the SFOAEnl into LL and SL components support this hypothesis. The latencies of these SFOAEnl components approximated those of TEOAEnl components and exhibited qualitatively similar magnitudes and growth rates. As mentioned by Sisto et al., closer agreement between the SF and TEOAEnl components may have been evidenced if the stimuli used to evoke the SF and TEOAE were calibrated to account for the effect of stimulus bandwidth, as was done in the current study (see Prieve et al. 1996; Kalluri and Shera 2007a).

To infer SL and LL contributions to the 12- and 24-cycle TBOAEnl, comparable to those for the 3- and 6-cycle TBOAEnl, the I/O and latency-intensity functions of the total TBOAEnl were compared across tone burst durations. Recall that the I/O and latency-intensity functions for the 3- and 6-cycle TBOAEnl were shaped by the relative contributions of the different latency components (see Fig. 5). The agreement between the 3-, 6-, 12-, and 24-cycle TBOAEnl magnitudes and latencies across stimulus levels is interpreted as evidence that the same LL and SL components contributed to the 3- and 6-cycle TBOAEnl as well as the 12- and 24-cycle TBOAEnl. Admittedly, comparing the level dependence of the 12- and 24-cycle TBOAEnl to that of the 3- and 6-cycle TBOAEnl provides only an indirect test of whether the same components contributed to the TBOAEnl, regardless of bandwidth. A more direct test would require unmixing the 12- and 24-cycle TBOAEnl into different latency components (e.g., evoking the TBOAEnl with additional tone bursts with center frequencies around 2 kHz and using time-frequency analysis; Moleti et al. 2012b; Sisto et al. 2013).

The similarity between the TBOAEnl evoked by tone bursts from 3- to 24-cycles demonstrates that the generation of each frequency band within the TBOAEnl depends primarily on stimulus energy within the same frequency band (Xu et al. 1994; Prieve et al. 1996). In other words, the generation of the TBOAEnl—regardless of the evoking stimulus’ bandwidth—occurs through relatively independent cochlear channels. This finding contrasts with the hypothesis that SL components in the TEOAE are primarily generated through IMD (Withnell et al. 2008; Goodman et al. 2009; Moleti et al. 2012a). It is important to remember that the data from the current study were obtained from humans. Interspecies differences in cochlear mechanics may preclude a generalization of the current findings. For instance, TEOAE measurements in guinea pigs implicate an IMD mechanism (Withnell and Yates 1998; Yates and Withnell 1999; Withnell and McKinley 2005). Even in humans, the cochlear channels through which the TEOAE is generated may not be entirely independent of one another. Konrad-Martin and Keefe (2003, 2005) observed that when a transient stimulus is used to evoke the OAE (e.g., a short tone burst), the spectrum of the OAE exhibits asymmetry with the magnitudes of higher-frequency OAE components being larger than lower-frequency components. They hypothesized that this asymmetry is a by-product of two-tone suppression where the higher-frequency components suppress the slightly lower-frequency components. In the current study, two-tone suppression between the different frequency components in the TBOAEnl may explain the more compressive growth of the 3- and 6-cycle TBOAEnl compared to the 12- and 24-cycle TBOAEnl, given the broader bandwidths of the tone bursts used to evoke the former. If two-tone suppression does influence the TEOAE, then IMD may as well; however, any such contributions appear to be small (e.g., Moleti et al. 2013).

Influence of TBOAEnl Extraction Paradigm

A known consequence of the double-evoked technique (or any nonlinear-derived technique, e.g., Kemp et al. 1990) that is the extracted OAE may include only a portion of the total OAE evoked by the s1 stimulus (Tognola et al. 2001; Schairer et al. 2003; Kalluri and Shera 2007b; Moleti et al. 2012a). Compared to nonequal-level variants of the double-evoked paradigm, an even greater portion of the OAE is lost in the extraction when using the equal-level variant. It is therefore important to consider whether the phenomenon described in the current study, observed using an equal-level, double-evoked extraction, also generalizes to the TBOAEnl evoked using other nonlinear paradigms and the TBOAE evoked using linear paradigms.

The first column of panels in Figure 10 compares the 3-cycle TBOAEnl envelopes evoked using equal-level (L2 = L1) and nonequal-level (L2 = L1 + 12 dB) double-evoked paradigms within the same subject (161L), at identical stimulus levels (7–31 dB/Hz SPL). Recall from the “METHODS” section that a nonequal-level paradigm was initially used for data collection but was abandoned in favor of the equal-level paradigm as the latter afforded extraction of the SL components at higher stimulus levels (note that these levels are not shown in Fig. 10). Subject 161L was one of three who participated in both data collection efforts. Different latency components characterized both data sets and components for L2 = L1 were smaller in magnitude than those for L2 = L1 + 12 dB (as expected, see “METHODS”). As such, the SL components were measureable at lower stimulus levels for L2 = L1 + 12 dB. Thus, although the 3-cycle TBOAEnl components are attenuated for L2 = L1, the presence of analogous components in the L2 = L1 + 12 dB data set implies that the same phenomenon underlies both (see Goodman et al. 2009; Moleti et al. 2012b for L2 = L1 + 15 dB). Unfortunately, emission data for the longer-duration tone bursts using L2 = L1 + 12 dB were not collected at the same L1 as were used for L2 = L1. However, as shown in the second column of panels in Figure 10, the 24-cycle TBOAEnl envelopes between the two data sets resembled each other and were characteristic of wave interference between different latency components.

FIG. 10.

FIG. 10

Comparison of 3- and 24-cycle TBOAEnl envelopes between equal- (L 2 = L 1) and nonequal-level (L 2 = L 1 + 12 dB) double-evoked paradigms (subject 161L). The left column shows data for the 3-cycle TBOAEnl (top row L 2 = L 1, bottom row L 2 = L 1 + 12 dB). The right column shows data for the 6-cycle TBOAEnl. The 3-cycle TBOAEnl envelopes were evoked at equal L 1 for both paradigms. L 1 for the 24-cycle TBOAEnl envelopes using the equal-level paradigm was 3 dB higher than L 1 using the nonequal-level paradigm. Line thickness increases with increasing stimulus level (i.e., L 1).

It remains possible that the TBOAE evoked using a linear paradigm depends more strongly on stimulus bandwidth than does that evoked using nonlinear paradigms. Unfortunately, temporal and spectral overlap between the OAE and stimuli precluded the use of a linear extraction paradigm and, therefore, comparisons with the nonlinear-evoked data. However, Goodman et al. (2011) and Moleti et al. (2012a) both presented evidence demonstrating that the different latency components in the TEOAEnl were also evident in the TEOAE extracted using a linear paradigm. Compared to the TEOAEnl, TEOAE components were larger in magnitude (see Fig. 13 in Moleti et al. 2012a) and had slightly more compressive growth rates (see Fig. 4 in Goodman et al.). Thus, while there are differences in the OAE measured across linear and nonlinear paradigms, the common finding that different latency components contribute to the evoked OAE demonstrates that the findings from the current study are not unique to the equal-level, double-evoked extraction paradigm.

Generation Mechanisms of SL Components

The conclusion that IMD in humans is not a primary contributor to TBOAEnl SL component generation does not preclude generation of these components at off-frequency places in the cochlea (i.e., frequency places away from the characteristic frequency place). Indeed, the different latencies of these components seem to necessitate generation at different cochlear locations. One hypothesis is that the SL components are generated basal to the tonotopic place through LCR (Goodman et al. 2011; Moleti et al. 2013). Using a nonlinear cochlear model simulation of TEOAEnl generation, Moleti et al. (2013) implicated SL component generation resulting from reflection of the traveling wave off impedance discontinuities approximately 0.3 octave basal to the tonotopic place. Even more recently, Moleti et al. (2014) provided experimental evidence implicating generation of the SL components between 0.5 and 1 mm basal to the TEOAEnl frequency’s tonotopic place (0.11–0.21 octave basal; Greenwood 1961, 1990), on average.

Other potential generation mechanisms of SL components (whether TE, TB, or SFOAE for linear or nonlinear evoking paradigms) include nonlinear coherent reflection (Talmadge et al. 2000), distributed sources of nonlinear distortion (Siegel et al. 2005), and reflection from the tail portion of the traveling wave (Guinan 1990; Siegel et al. 2004; Choi et al. 2008). SL components may also potentially arise through fast backward propagation of the emission through cochlear fluid compression waves (Ren 2004). Regardless of the exact mechanism, the similarity observed in the current study between the different TBOAEnl argues for a common generation mechanism underlying click-evoked OAE (approximated by the 3-cycle TBOAEnl) and SFOAE (approximated by the 24-cycle TBOAEnl) generation across a wide range of stimulus levels (Kalluri and Shera 2007a; Sisto et al. 2013).

Acknowledgments

Subject compensation was provided by a grant from the Executive Council for Graduate and Professional Students at the University of Iowa. The authors thank Rachel Stanziola and Brittany James who performed the data collection for this study. Portions of this work were presented at the 2013 Annual Meeting of the American Auditory Society and the 2014 Mid-winter Meeting of the Association for Research in Otolaryngology. This research served as part of the first author’s dissertation work.

Conflict of Interest

Neither of the authors have a commercial interest or other conflict of interest concerning the research detailed in the submitted manuscript. Funds for data collection (subject compensation) were provided by a grant from the Executive Council for Graduate and Professional Students at the University of Iowa.

Contributor Information

James D. Lewis, Phone: +1-402-4986361, Email: james.lewis@boystown.org

Shawn S. Goodman, Email: Shawn-goodman@uiowa.edu

References

  1. Avan P, Bonfils P, Loth D, Elbez M, Erminy M. Transient-evoked otoacoustic emissions and high-frequency acoustic trauma in the guinea pig. J Acoust Soc Am. 1995;97:3012–3020. doi: 10.1121/1.411866. [DOI] [PubMed] [Google Scholar]
  2. Carvalho S, Büki B, Bonfils P, Avan P. Effect of click intensity on click-evoked otoacoustic emission waveforms: implications for the origin of emissions. Hear Res. 2003;175:215–225. doi: 10.1016/S0378-5955(02)00745-1. [DOI] [PubMed] [Google Scholar]
  3. Choi Y-S, Lee S-Y, Parham K, Neely ST, Kim DO. Stimulus-frequency otoacoustic emission: measurements in humans and simulations with an active cochlear model. J Acoust Soc Am. 2008;123:2651–2669. doi: 10.1121/1.2902184. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Don M, Eggermont JJ. Analysis of the click-evoked brainstem potentials in man using high-pass noise masking. J Acoust Soc Am. 1978;63:1084–1092. doi: 10.1121/1.381816. [DOI] [PubMed] [Google Scholar]
  5. Gilman S, Dirks DD. Acoustics of ear canal measurement of eardrum SPL in simulators. J Acoust Soc Am. 1986;80:783–793. doi: 10.1121/1.393953. [DOI] [PubMed] [Google Scholar]
  6. Goodman SS, Fitzpatrick DF, Ellison JC, Jesteadt W, Keefe DH. High-frequency click-evoked otoacoustic emissions and behavioral thresholds in humans. J Acoust Soc Am. 2009;125:1014–1032. doi: 10.1121/1.3056566. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Goodman SS, Mertes IB, Scheperle RA (2011) Delays and growth rates of multiple TEOAE components. In: Shera CA, Olson ES (eds) What fire is in mine ears: progress in auditory biomechanics: Proceedings of the 11th International Mechanics of Hearing Workshop, American Institute of Physics, pp 279–285
  8. Greenwood DD. Critical bandwidth and frequency coordinates of the BM. J Acoust Soc Am. 1961;33:1344–1356. doi: 10.1121/1.1908437. [DOI] [Google Scholar]
  9. Greenwood DD. A cochlear frequency-position function for several species—29 years later. J Acoust Soc Am. 1990;87:2592–2605. doi: 10.1121/1.399052. [DOI] [PubMed] [Google Scholar]
  10. Guinan JJ., Jr . Changes in stimulus frequency otoacoustic emissions produced by two-tone suppression and efferent stimulation in cat. In: Dallos P, Geisler CD, Matthews J, Ruggero MA, Steele C, editors. The mechanics and biophysics of hearing. Madison: Springer; 1990. pp. 170–177. [Google Scholar]
  11. Hoaglin D, Mosteller F, Tukey JW. Understanding robust and exploratory data analysis. New York: Wiley; 1983. [Google Scholar]
  12. Kalluri R, Shera CA. Near equivalence of human click-evoked and stimulus-frequency otoacoustic emissions. J Acoust Soc Am. 2007;121:2097–2110. doi: 10.1121/1.2435981. [DOI] [PubMed] [Google Scholar]
  13. Kalluri R, Shera CA. Comparing stimulus-frequency otoacoustic emissions measured by compression, suppression, and spectral smoothing. J Acoust Soc Am. 2007;122:3562–3575. doi: 10.1121/1.2793604. [DOI] [PubMed] [Google Scholar]
  14. Keefe DH. Double-evoked otoacoustic emissions. I. Measurement theory and nonlinear coherence. J Acoust Soc Am. 1998;103:3489–3498. doi: 10.1121/1.423057. [DOI] [PubMed] [Google Scholar]
  15. Keefe DH, Ling R. Double-evoked otoacoustic emissions. II. Intermittent noise rejection, calibration and ear-canal measurements. J Acoust Soc Am. 1998;103:3499–3508. doi: 10.1121/1.423058. [DOI] [PubMed] [Google Scholar]
  16. Kemp DT, Ryan S, Bray P. A guide to the effective use of otoacoustic emissions. Ear Hear. 1990;11:93–105. doi: 10.1097/00003446-199004000-00004. [DOI] [PubMed] [Google Scholar]
  17. Konrad-Martin D, Keefe DH. Time-frequency analyses of transient-evoked stimulus-frequency and distortion-product otoacoustic emissions: testing cochlear model predictions. J Acoust Soc Am. 2003;114:2021–2043. doi: 10.1121/1.1596170. [DOI] [PubMed] [Google Scholar]
  18. Konrad-Martin D, Keefe DH. Transient-evoked stimulus-frequency and distortion-product otoacoustic emissions in normal and impaired ears. J Acoust Soc Am. 2005;117:3799–3815. doi: 10.1121/1.1904403. [DOI] [PubMed] [Google Scholar]
  19. Moleti A, Botti T, Sisto R. Transient-evoked otoacoustic emission generators in a nonlinear cochlea. J Acoust Soc Am. 2012;131:2891–2903. doi: 10.1121/1.3688474. [DOI] [PubMed] [Google Scholar]
  20. Moleti A, Longo F, Sisto R. Time-frequency domain filtering of evoked otoacoustic emissions. J Acoust Soc Am. 2012;132:2455–2467. doi: 10.1121/1.4751537. [DOI] [PubMed] [Google Scholar]
  21. Moleti A, Al-Maamury AM, Bertaccini D, Bott T, Sisto R. Generation place of the long- and short-latency components of transient-evoked otoacoustic emissions in a nonlinear cochlear model. J Acoust Soc Am. 2013;133:4098–4108. doi: 10.1121/1.4802940. [DOI] [PubMed] [Google Scholar]
  22. Moleti A, Sisto R, Lucertini M. Experimental evidence for the basal generation place of the short-latency transient-evoked otoacoustic emissions. J Acoust Soc Am. 2014;135:2862–2872. doi: 10.1121/1.4870699. [DOI] [PubMed] [Google Scholar]
  23. Neely ST, Norton SJ, Gorga MP, Jestaedt W. Latency of auditory brain-stem responses and otoacoustic emissions using tone-burst stimuli. J Acoust Soc Am. 1988;83:652–656. doi: 10.1121/1.396542. [DOI] [PubMed] [Google Scholar]
  24. Prieve BA, Gorga MP, Neely ST. Click- and tone-burst-evoked otoacoustic emissions in normal-hearing and hearing-impaired ears. J Acoust Soc Am. 1996;99:3077–3086. doi: 10.1121/1.414794. [DOI] [PubMed] [Google Scholar]
  25. Rasetshwane DM, Neely ST. Measurements of wide-band cochlear reflectance in humans. J Assoc Res Otolaryngol. 2012;13:591–607. doi: 10.1007/s10162-012-0336-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Rasetshwane DM, Argenyi M, Neely ST, Kopun JG, Gorga MP. Latency of tone-burst-evoked auditory brain stem responses and otoacoustic emissions: level, frequency, and rise-time effects. J Acoust Soc Am. 2013;133:2803–2817. doi: 10.1121/1.4798666. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Recio A, Rich NC, Narayan SS, Ruggero MA. Basilar-membrane responses to clicks at the base of the chinchilla cochlea. J Acoust Soc Am. 1998;103:1972–1989. doi: 10.1121/1.421377. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Ren T. Reverse propagation of sound in the gerbil cochlea. Nat Neurosci. 2004;7:333–334. doi: 10.1038/nn1216. [DOI] [PubMed] [Google Scholar]
  29. Sachs RM, Burkhard MD. Insert earphone pressure response in real ears and couplers. J Acoust Soc Am. 1972;52:183. doi: 10.1121/1.1982149. [DOI] [Google Scholar]
  30. Schairer KS, Fitzpatrick D, Keefe DH. Input-output functions for stimulus-frequency otoacoustic emissions in normal-hearing adult ears. J Acoust Soc Am. 2003;114:944–966. doi: 10.1121/1.1592799. [DOI] [PubMed] [Google Scholar]
  31. Schairer KS, Ellison JC, Fitzpatrick D, Keefe DH. Use of stimulus-frequency otoacoustic emission latency and level to investigate cochlear mechanics in human ears. J Acoust Soc Am. 2006;120:901–914. doi: 10.1121/1.2214147. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Shera CA, Bergevin C. Obtaining reliable phase-gradient delays from otoacoustic emission data. J Acoust Soc Am. 2012;132:927–943. doi: 10.1121/1.4730916. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Shera CA, Guinan JJ., Jr Evoked otoacoustic emissions arise by two fundamentally different mechanisms: a taxonomy for mammalian OAEs. J Acoust Soc Am. 1999;105:782–798. doi: 10.1121/1.426948. [DOI] [PubMed] [Google Scholar]
  34. Shera CA, Guinan JJ Jr (2003) Stimulus-frequency-emission group delay: A test of coherent reflection filtering and a window on cochlear tuning. J Acoust Soc Am 113:2762-2772 [DOI] [PubMed]
  35. Siegel JH, Cerka AJ, Temchin AN, Ruggero MA. Similar two-tone suppression patterns in SFOAEs and the cochlear microphonics indicate comparable spatial summation of underlying generators. Assoc Res Otolaryngol Mid-Winter Meeting Abstract. 2004;27:365. [Google Scholar]
  36. Siegel JH, Cerka AJ, Recio-Spinoso A, Temchin AN, van Dijk P, Ruggero MA. Delays of stimulus-frequency otoacoustic emissions and cochlear vibrations contradict the theory of coherent reflection filtering. J Acoust Soc Am. 2005;118:2434–2443. doi: 10.1121/1.2005867. [DOI] [PubMed] [Google Scholar]
  37. Sisto R, Moleti A. Transient evoked otoacoustic emission latency and cochlear tuning at different stimulus levels. J Acoust Soc Am. 2007;122:2183–2190. doi: 10.1121/1.2769981. [DOI] [PubMed] [Google Scholar]
  38. Sisto R, Moleti A, Shera CA. Cochlear reflectivity in transmission-line models and otoacoustic emission characteristic time delays. J Acoust Soc Am. 2007;122:3554–3561. doi: 10.1121/1.2799498. [DOI] [PubMed] [Google Scholar]
  39. Sisto R, Sanjust F, Moleti A. Input/output functions of different-latency components of transient-evoked and stimulus-frequency otoacoustic emissions. J Acoust Soc Am. 2013;133:2240–2253. doi: 10.1121/1.4794382. [DOI] [PubMed] [Google Scholar]
  40. Stinson MR, Shaw EAG, Lawton BW. Estimation of acoustical energy reflectance at the eardrum from measurements of pressure distribution in the human ear canal. J Acoust Soc Am. 1982;72:766–773. doi: 10.1121/1.388257. [DOI] [PubMed] [Google Scholar]
  41. Stover LJ, Neely ST, Gorga MP. Latency and multiple sources of distortion product otoacoustic emissions. J Acoust Soc Am. 1996;99:1016–1024. doi: 10.1121/1.414630. [DOI] [PubMed] [Google Scholar]
  42. Talmadge CL, Long GR, Tubis A, Dhar S. Experimental confirmation of the two-source interference model for the fine structure of distortion product otoacoustic emissions. J Acoust Soc Am. 1999;105:275–292. doi: 10.1121/1.424584. [DOI] [PubMed] [Google Scholar]
  43. Talmadge CL, Tubis A, Long GR, Tong C. Modeling the combined effects of BM nonlinearity and roughness on stimulus frequency otoacoustic emission fine structure. J Acoust Soc Am. 2000;108:2911–2932. doi: 10.1121/1.1321012. [DOI] [PubMed] [Google Scholar]
  44. Tognola G, Grandori F, Ravazzani P. Time-frequency distributions of click-evoked otoacoustic emissions. Hear Res. 1997;106:112–122. doi: 10.1016/S0378-5955(97)00007-5. [DOI] [PubMed] [Google Scholar]
  45. Tognola G, Ravazzani P, Molini E, Ricci G, Alunni N, Parazzini M, Grandori F. “Linear” and “derived” otoacoustic emissions in newborns: a comparative study. Ear Hear. 2001;22:182–190. doi: 10.1097/00003446-200106000-00002. [DOI] [PubMed] [Google Scholar]
  46. Verhulst S, Dau T, Shera CA. Nonlinear time-domain cochlear model for transient stimulation and human otoacoustic emission. J Acoust Soc Am. 2012;132:3842–3848. doi: 10.1121/1.4763989. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Withnell RH, McKinley S. Delay dependence for the origin of the nonlinear derived transient evoked otoacoustic emission. J Acoust Soc Am. 2005;117:281–291. doi: 10.1121/1.1798352. [DOI] [PubMed] [Google Scholar]
  48. Withnell RH, Yates GK. Enhancement of the transient-evoked otoacoustic emission produced by the addition of a pure tone in the guinea pig. J Acoust Soc Am. 1998;104:344–349. doi: 10.1121/1.423243. [DOI] [PubMed] [Google Scholar]
  49. Withnell RH, Yates GK, Kirk DL. Changes to low-frequency components of the TEOAE following acoustic trauma to the base of the cochlea. Hear Res. 2000;139:1–12. doi: 10.1016/S0378-5955(99)00132-X. [DOI] [PubMed] [Google Scholar]
  50. Withnell RH, Hazlewood C, Knowlton A. Reconciling the origin of the transient evoked otoacoustic emission in humans. J Acoust Soc Am. 2008;123:212–221. doi: 10.1121/1.2804635. [DOI] [PubMed] [Google Scholar]
  51. Xu L, Probst R, Harris FP, Roede J. Peripheral analysis of frequency in human ears revealed by tone burst evoked otoacoustic emissions. Hear Res. 1994;74:173–180. doi: 10.1016/0378-5955(94)90185-6. [DOI] [PubMed] [Google Scholar]
  52. Yates GK, Withnell RH. The role of intermodulation distortion in transient-evoked otoacoustic emissions. Hear Res. 1999;136:49–64. doi: 10.1016/S0378-5955(99)00108-2. [DOI] [PubMed] [Google Scholar]
  53. Zweig G, Shera CA. The origin of periodicity in the spectrum of evoked otoacoustic emissions. J Acoust Soc Am. 1995;98:2018–2047. doi: 10.1121/1.413320. [DOI] [PubMed] [Google Scholar]

Articles from JARO: Journal of the Association for Research in Otolaryngology are provided here courtesy of Association for Research in Otolaryngology

RESOURCES