Sensitivity of bilateral cochlear implant users to fine-structure and envelope interaural time differences

Victor A Noel; Donald K Eddington

doi:10.1121/1.4794372

. 2013 Apr;133(4):2314–2328. doi: 10.1121/1.4794372

Sensitivity of bilateral cochlear implant users to fine-structure and envelope interaural time differences^a

Victor A Noel ^1,^a), Donald K Eddington ^1,^b)

PMCID: PMC3631249 PMID: 23556598

Abstract

Bilateral cochlear implant users have poor sensitivity to interaural time differences (ITDs) of high-rate pulse trains, which precludes use of these stimuli to convey fine-structure ITD cues. However, previous reports of single-neuron recordings in cats demonstrated good ITD sensitivity to 1000 pulses-per-second (pps) pulses when the pulses were sinusoidally amplitude modulated. The ability of modulation to restore ITD sensitivity to high-rate pulses in humans was tested by measuring ITD thresholds for three conditions: ITD encoded in the modulated carrier pulses alone, in the envelope alone, and in the whole waveform. Five of six subjects were not sensitive to ITD in the 1000-pps carrier, even with modulation. One subject's 1000-pps carrier ITD sensitivity did significantly improve due to modulation. Sensitivity to ITD encoded in the envelope was also measured as a function of modulation frequency, including at frequencies from 4 to 16 Hz where much of the speech envelope's energy and information resides. Sensitivity was best at the modulation frequency of 100 Hz and degraded rapidly outside of a narrow range. These results provide little evidence to support encoding ITD in the carrier of current bilateral processors, and suggest envelope ITD sensitivity is poor for an important segment of the speech modulation spectrum.

INTRODUCTION

The use of two spatially-separated ears confers many advantages to normal-hearing (NH) listeners, including improved localization of sounds (Middlebrooks and Green, 1991; Blauert, 1997) and better understanding of speech in noisy environments (e.g., Bronkhorst and Plomp, 1988). Interaural time differences (ITDs) are created by the azimuth-dependent delay for sound arriving at the two ears, and are important cues for sound localization (Rayleigh, 1907; Wightman and Kistler, 1992; Macpherson and Middlebrooks, 2002) as well as understanding speech in the presence of noise (Zurek, 1993) or in the presence of competing speech (Carhart et al., 1967). ITDs are present in both the slowly varying amplitude envelope of a signal and in its more rapidly varying carrier, or fine structure. Early studies indicated that ITD cues were only used for low-frequency sounds (Rayleigh, 1907), and psychophysical studies confirmed that sensitivity to the fine-structure ITDs of pure tones is limited to frequencies of up to about 1400 Hz (Klumpp and Eady, 1956). Later studies showed that listeners can also lateralize on the basis of envelope ITDs of high-frequency sounds (Henning, 1974; Bernstein and Trahiotis, 1994).

Cochlear implant (CI) patients are increasingly choosing bilateral implantation, and numerous studies have demonstrated that these patients benefit from improved speech reception in noise (van Hoesel and Tyler, 2003; Schleich et al., 2004; Litovsky et al., 2006; Loizou et al., 2009) and improved localization (van Hoesel and Tyler, 2003; Nopp et al., 2004; Litovsky et al., 2009). However, the amount of benefit received by bilateral cochlear implant (BiCI) users is less than that achieved by NH listeners, and the benefits seem to be well accounted for by the level differences at the two ears for both localization (Grantham et al., 2007; Seeber and Fastl, 2008) and for speech reception in noise (e.g., Loizou et al., 2009; for a review see van Hoesel, 2011). Improving the ITD-based binaural benefit therefore seems an important step toward narrowing the gap in the overall benefit between BiCI and NH listeners.

In the speech coding strategies used with most modern CI processors, the input signal is divided into frequency bands by a bank of bandpass filters. The envelopes of the filter outputs are then extracted and used to amplitude modulate a set of interleaved pulse trains. As a natural result of these speech coding strategies, the envelope ITD of the input signal is represented in the envelopes of the stimulation signal. The fine structure of the signal is discarded completely and the interaural delay of the carrier pulses is almost always unrelated to the incoming sound signal.1

Even if fine-structure ITD was encoded in the carrier pulse trains of bilateral processors, benefit might be limited because BiCI patients have poor sensitivity to the ITD of un-modulated pulse trains at rates typical of modern CIs (van Hoesel and Tyler, 2003; Laback et al., 2007; van Hoesel, 2007; Poon et al., 2009; van Hoesel et al., 2009). The findings of these studies are largely consistent. The best ITD sensitivity is found with pulse trains of 50 to 100 pulses-per-second (pps), where good listeners have detection thresholds to ITDs of 100 to 200 μs, although variability among subjects is large. As pulse rates increase, ITD sensitivity drops off rapidly. For example, van Hoesel et al. (2009) measured thresholds for lateralization in six BiCI subjects pre-screened for their relatively good ITD sensitivity. For un-modulated pulse trains, with rates ranging from 100 to 1000 pps, the best ITD sensitivity was at 100 pps where the mean threshold for apical electrode pairs was about 165 μs. ITD sensitivity degraded as the pulse rate increased: The mean threshold of the six BiCI subjects increased to approximately 450 μs at 600 pps, and at 1000 pps many thresholds were above 1 ms.

Since the carrier pulse rates used in most contemporary commercial speech processors range from 800 to several thousand pps, the poor ITD sensitivity at these rates seems to preclude successfully coding relevant ITDs in the carriers. However, data from animal auditory-physiology experiments suggest amplitude modulation may improve the ITD sensitivity to high-rate pulse trains. In a pair of companion studies, Smith and Delgutte (2007, 2008) made single-neuron recordings in the inferior colliculus (IC) of bilaterally implanted cats while stimulating with pulse trains of varying interaural delay. When stimulating with constant-amplitude pulse trains of 40 pps, they recorded from units that responded to almost every pulse and were sharply tuned to ITD. As pulse rates were raised, the post-onset response fell, nearly ceasing at rates above 300 pps and resulting in an increase in neural ITD-thresholds for increasing pulse rates resembling that seen with human psychophysics. However, the authors also found that an ongoing response could be restored to a high-rate (1000-pps) pulse train by 100% amplitude modulation with a 40-Hz sinusoid, resulting in neural ITD thresholds improving to levels comparable to those for the 40-pps un-modulated pulses. This sensitivity was to carrier ITD of the 1000-pps pulse trains, not the envelopes, as it was observed even when the envelope ITD was fixed at zero. A model that successfully described the IC neurons' behavior included an interaural coincidence detector followed by a slow, inhibitory feedback with a time constant of several milliseconds. This model described how un-modulated 1000-pps pulse trains produced a buildup of inhibition which blocked the ongoing response, but the addition of modulation provided low amplitude periods during which the inhibition could dissipate, enabling an ongoing response to subsequent pulses. An alternative model predicts the observed improvement without invoking inhibition (Colburn et al., 2009), but the basic mechanism remains the same: Low amplitude intervals of a modulator restore an ongoing response.

Several psychophysical studies with human BiCI listeners have shown that ITD sensitivity to modulated high-rate pulse trains is much improved compared to un-modulated trains at the same high rates (van Hoesel and Tyler, 2003; Majdak et al., 2006; van Hoesel, 2007; van Hoesel et al., 2009). A reasonable assumption has been that this improved ITD sensitivity is due to the envelope itself, given that sensitivity to the un-modulated pulse trains is mostly nonexistent. However the physiology results also raise the possibility that at rates near 1000 pps, the modulated carrier itself is also contributing to the improved ITD sensitivity. The current study seeks to examine this possibility directly. Experiment 1 tested the hypothesis that sinusoidal amplitude modulation (SAM) improves carrier ITD sensitivity to high-rate pulse trains in human BiCI subjects. Psychophysical measures of ITD threshold were made using parameters similar to those from the animal physiological experiments, using waveforms with various combinations of envelope and carrier ITD. The results were analyzed for evidence of carrier ITD sensitivity. If amplitude modulation, which occurs naturally in CI processing, can improve carrier ITD sensitivity in human BiCI users, then modifying bilateral processors so that their interaural carrier delay agrees with the input signal's ITD may improve performance. If modulation does not improve carrier ITD sensitivity, encoding the signal's ITD in the carriers would not be indicated.

As previously mentioned, the envelope of natural sounds also carries ITD information, and most CI speech processors present this envelope ITD in their output signals (Laback et al., 2004). BiCI listeners are sensitive to this envelope ITD when acoustic click-trains are presented to their speech processors (Laback et al., 2004; van Hoesel, 2004; Senn et al., 2005), often showing thresholds between 100 and 300 μs. When the input signal is a speech token, however, ITD sensitivity in the same subjects can be very poor, with thresholds near 2 ms (Laback et al., 2004). Several factors may impact the sensitivity to the speech token. One may be the relatively low frequency of the speech envelope. For continuous speech, most of the energy in the envelope is at modulation frequencies below 30 Hz, with the range of 1 to 7 Hz being the most prominent (Elliott and Theunissen, 2009). Speech intelligibility is also mostly dependent on low modulation frequencies. Excellent speech intelligibility is maintained when envelopes are low-pass filtered to frequencies as low as 16 Hz (Drullman et al., 1994), and methods that analyze the modulation spectrum to predict speech reception in noise perform well even when restricting the analysis to frequencies below 30 Hz (Houtgast and Steeneken, 1985).

BiCI listeners have shown sensitivity to ITD presented in the envelope of high-rate pulse trains when stimuli are presented directly to a single electrode pair. However, to date, these studies have only measured ITD sensitivity for segments of the modulation spectrum above that of the speech envelope. For sinusoidal modulation, the lowest modulation frequencies studied are 50 Hz (van Hoesel and Tyler, 2003) or 100 Hz (van Hoesel, 2007; van Hoesel et al., 2009). Other studies measured ITD sensitivity for trapezoidal modulation using fixed frequencies of 12 Hz (Majdak et al., 2006) and 27 Hz (Laback et al., 2011), but the slopes of the trapezoid corresponded to higher sinusoidal frequencies. Little is known about ITD sensitivity for sinusoidal modulation frequencies below 25 Hz, even though this is the range most important for speech. In Experiment 2a we measured ITD sensitivity for sinusoidal modulation at frequencies (f_m) between 4 and 500 Hz. These measurements, especially for f_m between 4 and 16 Hz, are important when considering whether speech envelopes are able to deliver ITD information useful for binaural benefit. The results of Experiment 2a found a substantial dependence of ITD sensitivity on SAM f_m. Experiment 2b measured additional ITD thresholds using modified versions of the SAM waveform. The motivation was to investigate which characteristic of the SAM waveform correlated with the observed ITD sensitivity.

Finally, Experiment 2c examined the effect of carrier-rate on envelope ITD sensitivity. It has been argued that a pulse rate of 5 times the modulation frequency is required to adequately represent the envelope to CI listeners (Wilson, 1997), and modeling results indicate this ratio may need to be even higher to prevent the carrier from interfering in envelope ITD detection (Colburn et al., 2009). In tension with the evidence recommending higher pulse rates is the physiological evidence showing that envelope ITD sensitivity becomes poorer at higher carrier rates in the cat IC (Smith and Delgutte, 2008). Experiment 2c sought to determine if carrier-rate contributes to the rate-limit for envelope ITD sensitivity at higher modulation frequencies when the ratio of f_c /f_m drops to five or less. This was assessed by measuring envelope ITD thresholds using a 4640-pps carrier at the modulation frequencies of 50, 100, 200, and 500 Hz and comparing within-subject results to 1000-pps thresholds.

EXPERIMENT 1: ITD ENCODED IN THE FINE STRUCTURE OF SAM PULSE TRAINS

Methods

Subjects

Six subjects participated in Experiment 1. All lost hearing post-lingually. Relevant binaural information and the most recent single-syllable word score are shown in Table Table I.. There were no performance-based inclusion criteria. All subjects were implanted bilaterally with Advanced Bionics CII or 90 K implants. Subject C340 received both implants simultaneously during a single surgery. The other five received their second implant after at least 1 yr of monaural implant use. All subjects had at least 9 months of binaural CI experience at the beginning of data collection. Testing was conducted at a series of monthly sessions determined by the subjects' schedules. For a few of the subjects (C109 and C128), the sessions stretched over a period greater than 1 yr. When this occurred, conditions were repeated to check consistency and verify the absence of longitudinal changes.

Table I.

Subject information. Single syllable word scores are CNC or NU6.

	Age at onset of profound hearing loss (years)		Age at onset of CI use (years)		Binaural deprivation (years)	Bilateral CI experience (years)	Binaural single-syllable word score	ITD threshold Unmodulated, 100-pps train
Subject	L	R	L	R
C94	54	49	63	54	14	0.8	80%	67 μs
C109	44	44	50	48	6	6	86%	107 μs
C128	25	25	39	36	14	5	82%	74 μs
C278	34	34	44	46	12	3	84%	764 μs
C299	43	40	48	45	8	1	68%	106 μs
C340	63	63	65	65	2	2.5	86%	197 μs

Open in a new tab

Stimuli

Custom interface hardware [Clarion Research Interface 2 (CRI2), Advanced Bionics] was combined with custom software to control the implanted stimulators directly; clinical speech processors were not used. The CRI2 synchronizes stimuli between the two implants to within less than 1 μs, with an ITD resolution of 13.5 μs. During testing, signals were calculated on a personal computer and transmitted to the implant via the CRI2. SAM current waveforms were generated by multiplying a carrier pulse-train by a modulator of the form

S (t) = I_{C} \cdot (1 - \cos (2 π f_{m} t)),

(1)

where f_m is the modulation frequency and I_C is the current amplitude determined for each electrode of a binaural pair (see Sec. 2A3). Carrier pulses-trains were made up of 27-μs-per-phase biphasic pulses at a carrier frequency (pulse rate) f_c. In Experiment 1, carrier frequency f_c was 1000 pps and modulation frequency f_m was 50 Hz. These carrier and envelope rates were chosen to approximate the parameters used by Smith and Delgutte (2008) in the cat physiological experiment. The 1000-pps carrier rate is also typical of those used in commercial CIS processors. Modulation depth was 100% and all stimuli were presented on a linear current scale, i.e., without compression or logarithmic mapping. The stimulus duration was 300 ms and stimuli always contained an integer number of modulation periods. As shown in Fig. 1a, the modulator of Eq. 1 results in a gradual, non-abrupt onset. No other ramp or windowing was applied, so the onset and offset are determined solely by the shape of the modulator, which is dependent on f_m.

Schematic illustrations of the stimuli used in Experiment 1. (a) Raised cosine, 100% modulation as generated per Eq. 1. (b) Encoding interaural time delay. In Carrier-Only coding, the envelopes in the two ears are identical and the biphasic carrier-pulses are delayed (ITD_envelope = 0). In Envelope-Only coding, the pulses are coincident in time and only their amplitude modulation is delayed (ITD_carrier = 0). Whole-Wave encoding delays the entire lagging waveform. It is the only condition in which the envelope and the carrier ITD agree, provided the ITD does not exceed one-half the carrier period. (c) Subjects who showed sensitivity to the modulated Carrier-Only waveform were also tested with the un-modulated Carrier-Only condition. In this condition envelopes have constant amplitude, except for 50-ms onset and offset ramps. The envelope ITD = 0; ITD is encoded only in the carrier pulses.

Figure 1b illustrates interaural delays of the envelope (ITD_envelope) and of the carrier (ITD_carrier) for a SAM waveform. Experiment 1 measured sensitivity to three ITD coding conditions: Carrier-Only, Envelope-Only, and Whole-Wave. In the Carrier-Only condition, ITD_envelope is fixed at zero and the ITD is only encoded in the carrier pulses. This is one condition to which Smith and Delgutte (2008) found sensitivity to the carrier ITD. It provides a conservative measurement of carrier ITD sensitivity because there is no possibility of contribution by the envelope ITD. In fact, sensitivity to the carrier must overcome that of the zero-ITD envelope in order to be detected. Carrier-Only encoding may also present ambiguous ITD information when the delay approaches one-half the pulse period. For our 1000-pps carrier, ITDs between 500 and 1000 μs could appear as sub-500 μs ITDs leading in the opposite ear. To avoid presenting these ambiguous cues, the Carrier-Only tests limited the ITD magnitude to 450 μs (but see results for a further analysis of this issue). In the Envelope-Only condition, only the envelope is delayed. ITD_carrier is always fixed at zero, that is, the interaural carrier pulses are always coincident in time. The Whole-Wave condition delays the entire SAM waveform so that both the envelope and the carrier are delayed. Whole-Wave tests did not limit the ITD at 450 μs. When the ITD is less than 500 μs, the Whole-Wave envelope and carrier ITD are in agreement; however, for ITDs between 500 and 1000 μs, Whole-Wave envelope and carrier ITD may conflict, again because of the 1000-μs period of the carrier. For 5 of 6 subjects, Whole-Wave sensitivity was good enough that tests never presented an ITD greater than 500 μs. For subject C278, ITDs greater than 500 μs were presented.

Finally, because the hypothesis being tested is that modulation improves the ITD sensitivity to high-rate carrier pulses, thresholds were measured to un-modulated 1000-pps trains as a control condition against which to compare the modulated results. Figure 1c shows the un-modulated Carrier-Only control condition. All pulses were at constant amplitude except for linear, 50-ms onset and offset ramps meant to degrade any onset ITD cue, and the ramped-envelope ITD was fixed at zero. Only subjects who showed some sensitivity to modulated high-rate pulse trains were tested with the un-modulated signal.

Electrode pair and stimulation level

Stimuli were presented to a single bilateral electrode pair chosen as the most ITD-sensitive of all tested for the subject. The experimenter first chose one basal and one apical processor-matched pair. Processor-matched pairs are those that in daily use receive stimulation from the same frequency analysis-channel of their respective bilateral speech processors. ITD thresholds were then measured for the apical and basal pairs using a stimulus which produces the lowest ITD threshold in most subjects: 300-ms, un-modulated pulse-train of 50 or 100 pps. Processor matched pairs are often, but not always, the most sensitive for ITD (Long et al., 2003; Poon et al., 2009), therefore additional ITD thresholds were measured for candidate pairs comprising electrodes with slight interaural mismatches (up to four electrodes) relative to the processor matched locations. The electrode pair with the best ITD sensitivity of all pairs tested was selected as the subject's test pair. For most subjects, the pair with the best sensitivity was comprised of electrodes at the same position along the electrode array and corresponded to a processor-matched pair; however, mismatches of up to one electrode did occur. For three of the six subjects, the bilateral electrode pair with best ITD sensitivity was at the apical end of the electrode array; for three subjects this pair was near the base.

For each electrode pair a pair of current amplitudes (I_{C_}_left, I_C_{_right}) was determined using the following three step procedure. (1) The 50-Hz, 1000-pps SAM waveform was applied to each ear alone (monaurally) and the amplitudes adjusted until the subject judged the loudness comfortable in each ear and equal across ears. (2) Stimuli were then played simultaneously (binaurally) and amplitudes adjusted until the loudness was “most-comfortable,” the level the subject judged comfortable for listening during long periods of testing. (3) The subject reported the location of the sound image and adjustments were made to the amplitude in either ear until the image was centered in the horizontal dimension. Fusion was not used as a criterion for determining current amplitudes but subjects verified informally that all sounds were fused before the centering procedure began. Once determined, current amplitudes were fixed and used for all modulated conditions of Experiment 1. For the un-modulated condition, the current amplitude was initially set equal to the root-mean-square (rms) level of the subject's modulated stimuli and then adjusted if necessary to produce a centered auditory image.

Psychophysical procedure

ITD sensitivity was measured using a lateralization task to determine the psychophysical threshold for detection of an interaural time delay. Thresholds were measured by a 2-down/1-up adaptive procedure which converges to a 70.7% correct level (Levitt, 1971). A single trial consisted of two 300-ms sounds presented in sequence and separated by a 300-ms silence. The interaural delay of the first stimulus was always zero, and the interaural delay of the second stimulus was the magnitude of the current ITD test value. The ITD of the second stimulus was produced by delaying either the left or right ear signal, with the delayed side chosen at random with equal probability. The subjects were instructed to indicate whether they heard the second sound move to the left or right of the first. Subjects entered their responses by a keyboard and received correct-answer feedback after every trial. The instructions encouraged the subject to compare the second stimulus to the first stimulus; however, there is evidence that subjects learn to ignore the first reference trial and treat this task as a single-interval detection task (Hartmann and Raked, 1989). In this case reported thresholds correspond to a d′ value of 0.54. Threshold values were adjusted accordingly when compared with studies specifying sensitivity at other levels of d′.

Before each test, the subject was presented with multiple example trials of both left-ear leading and right-ear leading ITD. These trials familiarized the subjects with the condition being tested and were also used to determine a starting ITD magnitude for the adaptive task at which subjects could hear the second stimulus lateralized. For the Carrier-Only condition, subjects rarely reported hearing the second stimulus lateralized; in these cases a starting ITD of 400 μs was used. The adaptive procedure used a logarithmic step size which adjusted the ITD magnitude up or down by 2 dB at each reversal (Saberi, 1995). Each test consisted of 14 reversals and a threshold was calculated as the geometric mean of the last 8 reversals (a geometric mean is appropriate because of the logarithmic step size). For Experiment 1, conditions were tested in pseudo-random order. At least 4 and as many as 13 test runs were completed for each condition and the arithmetic mean of all runs were reported.

All subjects performed a minimum practice period of at least 2 h of ITD testing before any results were included as data. Subjects C128, C340, and C109 have participated in regular testing for 2 to 8 yrs and completed at least 20 h of ITD testing before this experiment.

Results

Carrier-Only ITD sensitivity

Figure 2 shows ITD thresholds for six subjects in three conditions. For the Carrier-Only condition, five of the six subjects were not able to complete the adaptive task, as depicted in Fig. 2 by an unfilled dashed bar labeled “no sensitivity.” This result was recorded if the adaptive procedure continued to track upward without reaching an ITD value at which the subject could lateralize at the 71% threshold level. When this occurred, the ITD test value eventually clipped at the 450-μs limit imposed for the Carrier-Only condition. The test was typically allowed to run until completion to allow the subject an opportunity to detect a lateralization cue using the provided feedback. Even for a condition with no sensitivity, subjects occasionally produced correct responses by chance. These caused the adaptive procedure's test ITD value to bounce between the 450-μs maximum and one or two 2-dB step-sizes below. Although the test eventually completed, the result was not recorded as evidence of sensitivity when several instances of clipping occurred. Therefore a result of “no sensitivity” for the Carrier-Only condition more precisely means that no threshold below about 400 μs could be measured.

Individual subject ITD thresholds for three methods of ITD encoding. Dashed, unfilled bars labeled “*no sensitivity*” signify that a subject was not able to complete any adaptive test for Carrier-Only encoding, which is the case for all subjects except C299. The height of these unfilled bars serves as a reminder that given the ITD magnitude constraint for the Carrier-Only condition, only ITD thresholds below 450 μs were detectable. Error bars are standard deviations of multiple test runs. C299 alone showed a significant difference (after *post hoc* correction) between thresholds for the Envelope-Only and Whole-Wave conditions.

One subject, C299, was able to complete the adaptive threshold measurement with ITD encoded only in the carrier. Each of C299's multiple Carrier-Only adaptive tracks reliably converged to an ITD value below 400 μs, resulting in a mean Carrier-Only threshold of 133 μs. This is comparable with C299's mean threshold for un-modulated, low rate (100-pps) pulse trains, which is 106 μs (see Table Table I.).

The adaptive procedure assumes monotonic performance as a function of ITD, which may not be the case as the ITD tested approaches one-half the pulse period. For example, Majdak et al. (2006) measured carrier ITD sensitivity as the ITD was varied throughout the entire pulse period. For conditions when their subjects could lateralize using the pulse-train ITD, the percentage of correct lateralizations increased with ITD until reaching a maximum at one-quarter of the pulse period and then declined as the ITD approached one-half the period, where the ear receiving the leading pulse becomes ambiguous. If our subjects were similarly sensitive to the carrier ITD, their performance would vary non-monotonically, thus violating one of the assumptions for use of an adaptive method and possibly obscuring ITD sensitivity. As an alternative means of searching for carrier ITD sensitivity, lateralization performance was plotted in the form of psychometric functions. These functions were then examined for patterns that would indicate sensitivity to carrier ITD not revealed by the adaptive tests.

In Fig. 3, percent correct for lateralization is shown as a function of ITD magnitude. Each column corresponds to one ITD encoding condition and each row contains the data for one subject. Data in Fig. 3 were obtained by pooling the trials from the multiple adaptive test runs completed by the subjects. Trials were grouped on the basis of ITD magnitude into 100-μs wide bins centered on integer multiples of 100 μs and percent correct was calculated for the pooled trials. In order to increase the number of trials at under-sampled ITDs, or at ITDs of interest but not visited by the adaptive procedure, subjects performed additional blocks of trials at constant ITD magnitudes. The results were added to the pooled trials from the adaptive runs. Gray shaded areas surrounding the psychometric functions indicate the 95% confidence intervals, whose widths are strongly influenced by the number of trials at each ITD value. Filled symbols represent ITD values at which the percent correct was significantly different (p < 0.05) than the chance performance level of 50%. Open symbols represent performance not significantly different from chance. Confidence intervals and significant differences for the psychometric function were computed from maximum likelihood estimates using a binomial distribution.

Psychometric functions for six subjects measured using each of three ITD encoding methods. Horizontal dashed lines in each panel indicate chance performance (50% correct). Filled circles represent ITD values where percent correct is significantly different than chance performance (p < 0.05). Performance at open circles is not significantly different than chance. Gray shaded regions are 95% confidence intervals. For C299 in the Carrier-Only condition, square symbols connected by a dotted line show performance when the stimulus was an un-modulated 1000-pps pulse train. In this same panel, asterisks indicate a significant improvement for modulated Carrier-Only versus un-modulated Carrier-Only conditions.

The Carrier-Only psychometric functions show differing patterns of sensitivity among users. The curves can be divided into three main sub-types. The first describes subjects with no Carrier-Only ITD sensitivity. In this group are subjects C94, C278, and C340. Curves for these subjects are mainly flat at approximately 50% correct (chance level) with no significant ability to lateralize at any ITD. (Subject C340 did show a single, dramatic jump in percent correct at 900 μs; however, as can be seen by the very wide confidence intervals for this subject's curve, the number of trials for this ITD is low, and the subject became unavailable before additional blocks could be measured.)

A single subject produced a psychometric function that clearly indicates sensitivity to the carrier. C299's function is roughly periodic with the period of the carrier pulses: Sensitivity grows to a level significantly above chance as ITD increases; then decreases to chance level as ITD reaches one-half the pulse period and the leading ear becomes ambiguous; and finally drops significantly below chance at ITD magnitudes greater than [1/2] the pulse period, because at these delays the nearest interaural pairs indicate a leading ear opposite that of the actual test ITD. Subject C299's periodic psychometric function is therefore consistent with his adaptive results showing significant sensitivity to the 1000-pps carrier.

To determine whether amplitude modulation played a role in producing Carrier-Only sensitivity, C299 was tested with an un-modulated 1000-pps pulse train [Fig. 1c] using blocks of trials with constant ITD at 250, 500, and 750 μs. Results are shown by the square symbols and dotted line in C299's Carrier-Only plot. At 250 μs the percent correct for the modulated Carrier-Only signals are significantly greater than for the un-modulated Carrier-Only trials (p < 0.05, indicated by asterisks in the C299 Carrier-Only plot). At 750 μs, modulated scores are significantly less than the un-modulated carrier, indicating better sensitivity to the reversed ITD cue. There is some sensitivity to the ITD of un-modulated pulses at 250 μs (62%, p < 0.05 different than chance) but not at 750 μs. The significantly greater sensitivity shown with the modulated carrier indicates that, for this subject, amplitude modulation is responsible for improving ITD sensitivity to high-rate pulses.

The psychometric functions of C109 and C128 fall into a third sub-type. Each shows lateralization significantly different than chance for a single ITD value, and the significance was confirmed by repeat testing with a block of 80 to 100 additional trials. C128's percent correct at 300 μs is 62%. C109's percent correct at 400 μs is 40%, i.e., below chance. While the data for these two subjects do indicate some carrier ITD sensitivity, it is isolated, not always consistent with the delivered cue, and insufficient to reach a modest threshold criterion for completion of an adaptive task.

Envelope-Only and Whole-Wave ITD sensitivity

Figure 2 shows that all subjects demonstrated some degree of sensitivity to ITD in the Envelope-Only and Whole-Wave conditions. The arithmetic group-mean for the Envelope-Only threshold was 277 μs (range 135 to 683 μs) and similar to the Whole-Wave group mean of 250 μs (range 60 to 571 μs). The largest individual difference between the Envelope-Only and Whole-Wave conditions is for subject C299, whose Envelope-Only ITD threshold of 250 μs is more than 4 times larger than his Whole-Wave ITD threshold of 61 μs.

The individual ITD test results were combined into an unbalanced, two-way analysis of variance (ANOVA) with subject and condition as factors. Because the within-subject variances were correlated with subject means, a log transformation was first applied to the test results to better satisfy the ANOVA requirement for homogeneous variances. Both main effects and interaction effects were analyzed. When subject interaction effects were taken into account, there was a significant difference between the Envelope-Only and the Whole-Wave condition (F[1,76] = 5.23, p < 0.025). A post hoc analysis using Tukey's honestly significant difference (HSD) criterion showed that C299 had a highly significant difference for Envelope-Only versus Whole-Wave ITD sensitivity (p < 0.001). No other subject showed a significant difference between the conditions. Accordingly, when C299 was removed from the group and the ANOVA re-computed, there was no significant effect of condition for the remainder of the group (F[1,65] = 0.23, p < 0.63).

Finally, the Whole-Wave and Envelope-Only psychometric functions plotted in Fig. 3 show a generally monotonic increase in performance with ITD magnitude and demonstrate broad agreement with the adaptive threshold results.

Discussion

Modulation did not improve high-rate pulse ITD sensitivity in most subjects

Experiment 1 showed that for five out of six subjects, amplitude-modulation did not produce sensitivity to the ITD of a high-rate carrier. None of these five subjects were able to complete the adaptive task using ITDs below 450 μs in the Carrier-Only condition. These subjects also performed no better in the Whole-Wave condition as compared to the Envelope-Only condition, which is further evidence that they did not benefit from the encoding of ITD in the carrier. The psychometric functions of these subjects generally confirmed the adaptive results: Lateralization of the 1000-pps pulse train in the Carrier-Only condition was generally at chance level. Two of these five subjects did show a small but significant change in lateralization at a single ITD magnitude, but the performance did not reach the 71% criterion for sensitivity.

The results from one of the six subjects differed distinctly from those of the others. C299 consistently completed the adaptive task for the Carrier-Only condition, displaying thresholds for modulated 1000-pps carriers nearly equal to those measured with un-modulated 100-pps pulses, and his sensitivity was confirmed by an analysis of his psychometric function. Further evidence that C299 benefitted from ITD encoded in the carrier was the dramatic improvement in ITD sensitivity from the Envelope-Only to the Whole-Wave condition, with threshold dropping by more than fourfold, from 250 to 61 μs. This large difference may be understood by recalling that the Envelope-Only condition does still present a carrier ITD cue; however, it is fixed at 0 μs. For a subject with carrier ITD sensitivity, this replaces an informative ITD cue with one likely to interfere with lateralization using the envelope ITD.

When the ITD was encoded only in the carrier pulses of un-modulated pulse trains, C299 showed little sensitivity to this signal. Since the modulated pulse trains produced ITD sensitivity that was significantly greater than the un-modulated condition, for this single subject it appears that amplitude modulation significantly improved ITD sensitivity to the high-rate carrier.

Several psychophysical studies with human BiCI listeners have shown that ITD sensitivity to modulated high-rate pulse trains is much improved compared to un-modulated trains at the same high rates (van Hoesel and Tyler, 2003; Majdak et al., 2006; van Hoesel, 2007; van Hoesel et al., 2009). The question asked in the current experiment is whether, for pulses near 1000 pps, this improved ITD sensitivity is due to the envelope, to improved carrier ITD sensitivity in the presence of modulation, or to a combination of the two. By explicitly isolating the carrier and envelope ITD sensitivity, this study demonstrated that for five of the six subjects at pulse rates of 1000 pps, SAM-ITD sensitivity was due to the envelope alone. For a single subject, sensitivity to the carrier ITD existed even when the envelope ITD was zero. The effect of the envelope was important, however, as modulation significantly improved the carrier ITD sensitivity. For this subject, ITD sensitivity was best when the carrier and envelope both encoded ITD, indicating the information was combined, and the subject's thresholds for the Whole-Wave condition fell to levels below those for 100-pps un-modulated pulse trains.

It would be of interest to understand why subject C299 alone demonstrated a dramatic improvement in carrier ITD sensitivity. Previous studies have reported exceptional BiCI subjects with ongoing ITD-sensitivity to pulse rates of 800 or 1000 pps (Laback et al., 2007; van Hoesel et al., 2009). These same subjects also had exceptional ITD sensitivity to pulse rates of 100 pps, where thresholds were often below 50 μs. C299, however, shows good but not exceptional ITD sensitivity at lower un-modulated (100 pps) or modulated (100 Hz) rates, with thresholds usually at the median of the six subjects studied here (Table Table I. and Fig. 4). It seems that C299 is exceptional in acquiring 1000-pps ITD-sensitivity as a result of modulation, and is not simply a subject displaying an outstanding overall ITD-sensitivity.

ITD threshold plotted as a function of modulation frequency for five subjects. Error bars are standard deviations for multiple runs. Shaded regions indicate ITDs greater than naturally experienced by humans.

Comparison with other human BiCI studies

No previous psychophysical studies directly address whether modulation improves the carrier ITD sensitivity for pulses near 1000 pps, but previous results do provide relevant evidence. van Hoesel and Tyler (2003) measured thresholds to ITD encoded in the whole waveform using an 800-pps carrier SAM modulated at 50 Hz. The 194-μs mean threshold for 3 subjects is somewhat higher than the mean of 136 μs measured for three subjects using a 6000-pps carrier modulated at 100 Hz (van Hoesel, 2007), but compares closely with the mean of 184 μs for 6 subjects also using carriers of 6000 pps (van Hoesel et al., 2009). (For these comparisons thresholds were scaled to equal levels of d′. It is acknowledged that comparisons of absolute thresholds across studies can be confounded by subject or methodological differences.) Since ITD sensitivity at rates near 6000 pps is presumed to be poor and to not contribute to BiCI lateralization, the similar thresholds in human BiCI users for 800 and 6000-pps carriers indicate the 800-pps carrier did not contribute to ITD sensitivity. Conversely, van Hoesel and Tyler (2003) also measured Envelope-Only ITD threshold with 1 of their subjects using the 800-pps carrier. The subject's Envelope-Only ITD threshold was more than twice their Whole-Wave threshold (290 vs 120 μs). The poorer Envelope-Only threshold indicates sensitivity to the ITD of the 800-pps carrier. Since the subject showed no sensitivity to ITD in an un-modulated, 800-pps pulse train, the improved Whole-Wave performance is also consistent with modulation improving sensitivity to ITD encoded in the carrier. In another study, lateralization was measured with pulse trains at rates between 100 and 800 pps modulated by a 12.5-Hz trapezoid (Majdak et al., 2006). Whole-Wave and Envelope-Only ITD sensitivity were measured and compared within the same four subjects. With 800-pps carriers, 2 of 4 subjects' lateralization with the Whole-Wave ITD condition were significantly better than with the Envelope-Only condition. Since sensitivity to the 800-pps carrier was not compared to an un-modulated condition, the possibility exists that the two subjects showing sensitivity at 800 pps were simply demonstrating an existing ITD sensitivity to high-rate pulses, irrespective of modulation. The existing data, like our results, is not consistent across subjects in terms of the impact of amplitude modulation on sensitivity to carrier ITD. Generally, modulation does not improve carrier ITD sensitivity, but like the one subject in the present study, exceptional individuals do show a substantial improvement when ITD is encoded in the carrier.

Comparison with physiological results

For five of the six subjects, sensitivity to the carrier ITD of high-rate pulse trains was not improved by amplitude modulation. This finding differs with the single neuron data from the cat IC. A number of differences between the two experiments may explain the discrepancy. In the cat physiological experiments, neurons often only displayed ITD sensitivity for a narrow range of stimulus level, and each unit was tested at the level producing optimal ITD tuning. In the human subjects, the stimulation level could not be optimized for ITD tuning on a fiber-by-fiber basis and may result in only a fraction of the neurons operating at the optimal level to encode ITD. More generally, even when individual neurons show sharp ITD tuning, at a later stage of auditory processing their responses must be combined (or pooled) to form the percept of lateral position. This process is not precisely understood in normal or electrical hearing. It may be that some aspect of the population ITD coding is abnormal in both the acutely deafened cats and in the human subjects, but its impact only becomes apparent in behavioral measures such as the human psychophysics.

BiCI physiological studies also differ from human behavioral data in that anesthetized cats show a lower rate-limit for ITD sensitivity to un-modulated pulses [10 to 200 pps (Smith and Delgutte, 2007)] as compared to awake human subjects [250 to 600 pps (van Hoesel, 2007)]. A preliminary study (Chung et al., 2012) has shown that anesthesia itself may be responsible for this difference in rate limit, as awake rabbits exhibit rate limits that are substantially higher than anesthetized cats or rabbits and more similar to those seen with human psychophysics. Since the modulator's ability to restore responses to the high-rate carrier is presumed to depend on an interaction between the modulator frequency and the carrier rate (Smith and Delgutte, 2008; Colburn et al., 2009), a different combination of these variables may be necessary to produce carrier sensitivity in awake, BiCI subjects than was found to be effective in anesthetized cats.

Finally, acutely deafened cats were used in the physiology study whereas it has been shown that congenitally deafened cats have poorer ITD tuning as compared to their acutely deafened counterparts (Hancock et al., 2010b). While the relatively late onset of hearing impairment in our human subjects suggests the development of normal binaural hearing, their 2 to 14 yrs of binaural deprivation may have resulted in deterioration of their binaural circuits as compared to the acutely deafened animals. This hypothesis is weakened by (1) more recent data which show that ITD tuning from long-term deafened cats is closer to that of acutely deafened cats than it is to that of congenitally deafened cats (Hancock et al., 2010a) and (2) results showing no indication that the length of binaural deprivation degrades ITD sensitivity provided it is preceded by a period of normal binaural development (Litovsky et al., 2010). While the effect of binaural deprivation cannot be ruled out, it appears unlikely to explain the difference between results from our human subjects and the physiological results measured in cat.

EXPERIMENT 2: FREQUENCY DEPENDENCE OF ENVELOPE ITD SENSITIVITY

In Experiment 1, most subjects showed no sensitivity to ITD encoded in a modulated, high-rate carrier, but all subjects showed sensitivity to the ITD encoded in the envelope. Experiment 2 measured the envelope sensitivity for SAM waveforms whose modulation frequencies spanned the modulation spectrum of speech. Several aspects of envelope ITD sensitivity were examined. After preliminary pilot studies, three experiments (Experiments 2a, 2b, and 2c) were designed. Threshold measurements for all three experiments were then performed concurrently with runs intermingled in pseudo-random order. At least three threshold measurements were made for each condition reported. Five of the research subjects from Experiment 1 participated in Experiment 2a. Of these five, four also completed Experiments 2b and 2c. Electrode pair selection and the adaptive threshold procedure were identical to those used in Experiment 1.