Skip to main content
JARO: Journal of the Association for Research in Otolaryngology logoLink to JARO: Journal of the Association for Research in Otolaryngology
. 2011 Jan 27;12(3):375–393. doi: 10.1007/s10162-011-0257-4

Relationship Between Behavioral and Physiological Spectral-Ripple Discrimination

Jong Ho Won 1,2,3,, Christopher G Clinard 3,4, Seeyoun Kwon 5,6, Vasant K Dasika 1, Kaibao Nie 1, Ward R Drennan 1, Kelly L Tremblay 3, Jay T Rubinstein 1,2
PMCID: PMC3085690  PMID: 21271274

Abstract

Previous studies have found a significant correlation between spectral-ripple discrimination and speech and music perception in cochlear implant (CI) users. This relationship could be of use to clinicians and scientists who are interested in using spectral-ripple stimuli in the assessment and habilitation of CI users. However, previous psychoacoustic tasks used to assess spectral discrimination are not suitable for all populations, and it would be beneficial to develop methods that could be used to test all age ranges, including pediatric implant users. Additionally, it is important to understand how ripple stimuli are processed in the central auditory system and how their neural representation contributes to behavioral performance. For this reason, we developed a single-interval, yes/no paradigm that could potentially be used both behaviorally and electrophysiologically to estimate spectral-ripple threshold. In experiment 1, behavioral thresholds obtained using the single-interval method were compared to thresholds obtained using a previously established three-alternative forced-choice method. A significant correlation was found (r = 0.84, p = 0.0002) in 14 adult CI users. The spectral-ripple threshold obtained using the new method also correlated with speech perception in quiet and noise. In experiment 2, the effect of the number of vocoder-processing channels on the behavioral and physiological threshold in normal-hearing listeners was determined. Behavioral thresholds, using the new single-interval method, as well as cortical P1-N1-P2 responses changed as a function of the number of channels. Better behavioral and physiological performance (i.e., better discrimination ability at higher ripple densities) was observed as more channels added. In experiment 3, the relationship between behavioral and physiological data was examined. Amplitudes of the P1-N1-P2 “change” responses were significantly correlated with d′ values from the single-interval behavioral procedure. Results suggest that the single-interval procedure with spectral-ripple phase inversion in ongoing stimuli is a valid approach for measuring behavioral or physiological spectral resolution.

Keywords: spectral-ripple discrimination, behavioral and physiological measure, cochlear implant, auditory change response, auditory cortex

Introduction

Spectral-ripple stimuli consist of rippled spectra noise signals in which the frequency positions of the spectral peaks and valleys alternate (Supin et al.1994). Over the past few years, spectral-ripple discrimination has been widely used in a variety of behavioral experiments with normal-hearing (NH), hearing-impaired (HI), and cochlear-implant (CI) listeners (e.g., Supin et al.1994; Henry et al. 2000, 2005; Won et al.2007; Litvak et al.2007; Saoji et al. 2009; Won et al.2010). For example, the ability of subjects to discriminate a reversal in the phase of the rippled shape (Henry et al. 2000, 2005; Won et al.2007), and the ability to differentiate between a spectral-ripple stimulus and white noise (Litvak et al.2007) have been evaluated. Furthermore, spectral-ripple discrimination ability has been shown to correlate significantly with vowel and consonant recognition in quiet (Henry et al.2005; Litvak et al.2007), word recognition in noise (Won et al.2007), and music perception abilities including complex-tone pitch direction discrimination, melody recognition, and timbre recognition (Won et al.2010) for CI users. Drennan et al. (2010) used spectral-ripple discrimination to evaluate different CI sound encoding strategies, and Faulkner et al. (2010) used spectral-ripple discrimination for auditory training in CI listeners to improve the perception of spectral details in speech and music sounds. Because Won et al. (2007) showed that test–retest reliability for the spectral-ripple test was good, and with minimal learning effects, the spectral-ripple test appears to be a stable measure that could be used in research as well as the clinic. These previous studies suggest that spectral-ripple discrimination is an efficient and valuable tool for assessing spectral envelope sensitivity in CI users in a way that can predict the hearing performance.

An additional benefit of spectral-ripple testing is that it is a non-linguistic psychophysical measure that could be helpful to researchers and clinicians when testing infants and children. Such information could potentially be used to improve CI signal processing or (re)habilitation strategies. However, the multiple-interval, adaptive forced-choice (AFC) discrimination procedures that have been typically used with adult listeners (e.g., three interval, three-AFC procedure) are poorly suited for behavioral studies with infants and toddlers. The unchanging stimuli of a multiple-interval procedure are also unsuitable for measuring spectral-ripple discrimination electrophysiologically via auditory evoked potentials. For these reasons, the goal of the present study was to develop a new method of measuring spectral-ripple threshold. The new method uses a single-interval, yes/no paradigm with the method of constant stimuli. On a trial, either a “change” or a “no-change” stimulus was presented. The stimulus paradigm and physiological analysis techniques were modeled after those used by Ross et al. (2007). This single-interval method can be extended with minimal changes to test preverbal children using a conditioned head turn or observer-based procedure (e.g., Eisenberg et al.2007; Dasika et al.2009). Moreover, this type of stimulus presentation is also useful because the same stimuli and presentation mode used for behavioral perceptual testing are also feasible for studying the neural detection of spectral changes.

Electrophysiological measures have long been used to assess the integrity of the auditory system following implantation (for review, see Abbas and Brown 2000; Martin et al.2008) as well as to assist with CI mapping (Brown et al.2000). Electrophysiological measures can be especially helpful in assessing the integrity of the auditory system in infants and children when behavioral testing is challenging or not possible. For these reasons, the present study attempted to determine if electrophysiological responses could be evoked by spectral-ripple stimuli, and estimated the physiological capacity of an individual’s auditory system in detecting spectral changes within spectral-ripple stimuli. The P1-N1-P2 complex, a cortical auditory evoked potential, is especially relevant for this purpose because the response is robust enough to reliably identify in individuals. The P1-N1-P2 response may be elicited by the onset or offset of, or changes within, an acoustic stimulus (Spoor et al. 1969; Jerger and Jerger 1970; McCandless and Rose 1970). These P1-N1-P2 responses provide information about sound processing up to the level of the cortex, and can closely approximate behavioral discrimination thresholds (for review: Martin et al.2008). For this reason, cortical auditory evoked potentials have been used extensively with CI listeners in response to pulse trains (Ponton et al.1996) or speech sounds (Friesen and Tremblay 2006), as well as vocoded speech in normal-hearing listeners (Friesen et al.2009).

The goals of the current study are (1) to determine if the previously used three-AFC procedure yielded results that were correlated with the new, single-interval, yes/no procedure; (2) to determine the feasibility of recording cortical P1-N1-P2 responses using the spectral-ripple stimuli; and (3) to determine if these physiological responses are related to behavioral spectral-ripple discrimination. In experiment 1, the proposed single-interval, yes/no procedure with the method of constant stimuli was conducted with adult CI listeners. Results were compared to data from the previously established spectral-ripple discrimination test which used a three-interval, three-AFC, adaptive procedure (e.g., Won et al.2007). In experiment 2, the single-interval, yes/no spectral-ripple discrimination procedure was used to obtain behavioral and physiological measures in NH listeners. The effect of the number of vocoder-processing channels on behavioral thresholds and the P1-N1-P2 responses were examined. In experiment 3, the correlation between d′ values from the single-interval behavioral performance and the amplitudes of P1-N1-P2 change responses were examined.

Methods

Experiment 1: the development and validation of single-interval spectral-ripple discrimination

Subjects

Fourteen post-lingually deafened, generally high performing, adult CI users participated in experiment 1. They were 25–77 years old (mean = 57 years, six females and eight males). Individual CI subject information is listed in Table 1. All subjects listened to the stimuli using their own sound processor set to a comfortable listening level. CI sound processor settings were fixed so that sound was processed identically throughout all test batteries. The use of human subjects in this study was reviewed and approved by the University of Washington Institutional Review Board.

TABLE 1.

CI subject characteristics

Subject Age (years) Duration of hearing loss (years)a Duration of implant use (years) Etiology Implant type Speech processor strategy CNC word score (% correct)
S03 61 5 11 Genetic Nucleus 22 SPEAK 72
S04 62 1 3 Unknown Nucleus 24 ACE 88
S12 49 0 2 Connexin 26 MedEl Combi40+ CIS 36
S34 55 1.5 Noise exposure HiRes90K HiRes 62
S41 52 7 5 Hereditary HiRes90K HiRes 86
S42 68 5 3 Genetic HiRes90K Fidelity 120 52
S48 67 10 0.5 Unknown HiRes90K HiRes 72
S49 64 4 0.75 Hereditary HiRes90K Fidelity 120 86
S50 40 15 6 Genetic Clarion CII Fidelity 120 100
S51 56 7 6 Hereditary Clarion CII HiRes 92
S52 77 0 0.5 Noise exposure HiRes90K Fidelity 120 66
S53 63 3 7 Unknown Clarion CII Fidelity 120 96
S54 25 0.5 2.5 Unknown HiRes90K HiRes 92
S55 65 40 (L)1.4; (R)0.5 (L)Genetic; (R)Genetic (L)HiRes90K; (R)HiRes90K (L)HiRes; (R)HiRes 96

aThe duration of their hearing loss before implantation

Stimuli and procedures

  1. The single-interval, yes/no paradigm spectral-ripple discrimination test

Four hundred pure-tone frequency components were summed to generate the rippled noise stimuli. Amplitudes of the components were determined by a full wave-rectified sinusoidal envelope on a logarithmic amplitude scale. The 400 tones were spaced equally on a logarithmic frequency scale. The ripple peaks were spaced equally on a logarithmic frequency scale. Bandwidth of the stimuli was 100–5,000 Hz. The peak-to-valley ratio of the ripple spectra was 30 dB. Standard and inverted (ripple phase reversed) ripple stimuli were generated. For standard ripples, the phase of the full wave-rectified sinusoidal spectral envelope was set to zero radians; and for inverted ripples, it was set to π/2. Using standard and inverted ripples, two types of test stimuli were generated: the “standard-standard” ripple and the “standard-inverted” ripple.

For the standard-inverted case (i.e., the change stimulus), the first 2 s consisted of a standard ripple and the last 2 s contained an inverted ripple. Therefore, there was a spectral change at 2-s point. Each half of each stimulus was individually created and then concatenated. To eliminate the possibility of a loudness change at the 2-s point of the standard-inverted stimuli, the root mean square of the first (standard ripple) and the last (inverted ripple) parts were matched. After amplitude matching, the whole 4-s stimulus was ramped with 150-ms rise/fall times, and filtered with a long-term, speech-shaped filter (Byrne et al.1994). Because the concatenation could produce frequency components beyond 5 kHz at the 2-s point, a 5 kHz low-pass filter was applied to the entire 4-s stimuli. To prevent a subject from perceiving a cue due to the temporal pattern of transition from standard to inverted ripple at the 2-s point, the phase of both the first and last 2 s was randomized for every presentation using random phases for the 400 individual frequency components.

For the standard-standard case (i.e., no-change stimulus), a 4-s duration standard ripple stimulus was created and ramped with 150-ms rise/fall times, then filtered with a long-term, speech-shaped filter (Byrne et al.1994) and with a 5 kHz low-pass filter. Therefore, there was no change within the stimulus.

Figure 1 shows waveforms (upper panels) and spectrograms (middle panels) for the standard-inverted and standard-standard stimuli for the 1 ripple/octave case. The bottom row of Fig. 1 shows example waveforms for the standard-inverted and standard-standard stimuli from 1.98 to 2.02 s, showing that there is no temporal discontinuity cue, such as a sudden amplitude change, for a subject to use to detect at the midpoint of the standard-inverted stimuli. There is therefore only a spectral cue that a subject can use to detect a change within the standard-inverted stimuli.

FIG. 1.

FIG. 1

Stimuli waveforms (upper panels), spectrograms (middle panels), and zoom-in of time–domain waveforms from 1.98 to 2.02 s (lower panels) for a standard-inverted stimulus (left side) and a standard-standard stimulus (right side). The ripple density was 1 ripple/octave for both stimuli.

Figure 2 shows the CI sound processor output in response to the standard-inverted ripple stimuli using HiResolution strategy. The upper two plots show how HiResolution strategy encodes the standard-inverted ripple stimuli over the duration of stimuli. The left panel shows the output for spectral-ripple density of 1.0 ripple/octave, and the right panel shows the output for ripple density of 6.0 ripples/octave. The lower panel shows average outputs over the first 2 s (i.e., standard phase) and the last 2 s (i.e., inverted phase) of the standard-inverted stimuli. The upper left panel (1 ripple/octave) shows that there is a distinct change at the 2-s point for all electrodes, but the upper right panel (6 ripples/octave) does not show visible change at the 2-s point in the outputs for all 16 electrodes. The lower panel plots also confirm this trend. Multiple peaks and valleys are well-represented across the electrodes when ripple density is low. But at higher ripple density, the distance between the peaks and valleys is decreased, so the sound processor presents reduced spectral contrast for higher ripple density.

FIG. 2.

FIG. 2

The sound processor outputs for spectral-ripple density of 1 ripple/octave (left panel) and 6 ripples/octave (right panel) are shown. The upper panel plots show electrodograms for the standard-mixed ripple stimuli, which represent the biphasic pulses (in μA) computed by HiResolution strategy of Advanced Bionics devices. The lower panel plots show average outputs (in μA) over the duration of the standard phase (first 2 s, filled circles) and the inverted phase (last 2 s, open circles) ripple stimuli for 16 electrodes. Electrode 16 represents the highest frequency channel.

A single administration of the single-interval, yes/no paradigm spectral-ripple discrimination task involved four blocks of test trials. Sounds were presented at 65 dBA in the free field in a sound-treated booth via a single loudspeaker, positioned 1 m from the subject. Ripple stimuli were generated with 15 different densities: 0.25, 0.5, 0.707, 1, 1.414, 2, 2.828, 4, 5.657, 8, 11.314, 16, 22.627, 32, and 40 ripples/octave. The method of constant stimuli was used. Ten consecutive ripple densities from 0.25 to 8 ripples/octave were tested; however, if a subject could discriminate 8 ripples/octave stimuli above chance levels, higher ripple densities were tested. Each ripple density was presented 10 times in random order within a block: five times for standard-standard ripple stimuli and another five times for standard-inverted ripple stimuli. Therefore, a total of 20 standard-standard and standard-inverted ripple stimuli were presented across the four test blocks. Subjects performed the task using a mouse and computer screen placed in front of and to the right of them. Subjects were instructed to listen to the sound and click on an on-screen button labeled “Yes” if they thought there was a change within the stimulus; or “No” if they thought there was not a change. Percent correct and hit rate were calculated for each block. Feedback was not provided. The testing required 1 h for each subject.

Threshold was defined as the ripple density corresponding to d′ = 1. Data from both Won et al. (2007) and the present study suggest that psychometric functions for spectral-ripple discrimination generally follow an expected reverse-sigmoid shape as a function of increasing ripple density. It was desirable to accurately estimate threshold from points statistically shown to lie on the slope of the psychometric function, rather than points on either the upper or lower asymptote. Like several previous studies, confidence interval analyses were used to identify points lying on the upper asymptote, slope, and lower asymptote of the psychometric function (e.g., Buss et al.1986; Bargones et al.1995; Dasika et al.2009). At each ripple density, d′ and 80% confidence intervals for d′ were computed from aggregate hit and false alarm rates obtained from the four blocks. To avoid undefined values of d′, hit rates of 0 were converted to 0.5/Ns and hit rates of 1 were converted to 1–0.5/Ns,, where Ns was the number of standard-inverted stimuli presented at a given ripple density (i.e., 20; cf. Macmillan and Creelman 2005). Hit rate proportions of 1 or 0 were thus converted to 0.975 or 0.025. Similarly, false alarm rates of 0 or 1 were converted to 0.025 or 0.975. Eighty percent confidence intervals for d′ were estimated by using the approximation described by Gourevitch and Galanter (1967; cf. Macmillan and Creelman 2005). Eighty percent as opposed to 95% confidence intervals were used to identify a greater number of data points on the slope.

The upper asymptote was taken as the maximum performance obtained at any ripple density. If the upper limit of the 80% confidence interval of d′ of a given data point equaled or exceeded asymptotic d′, the data point was generally assumed to lie on the upper asymptote. If the lower limit of the 80% confidence interval of d′ for a given data point was lower than d′ = 0 (chance performance), the point was generally assumed to lie on the lower asymptote. Slope was estimated using all successive data points with upper and lower confidence intervals contained entirely within the interval between the upper and lower asymptotes. Because few points were sometimes identified on the slope, the upper asymptotic point corresponding to the highest ripple density and the lower asymptotic point corresponding to the lowest ripple density were also included in the slope calculation. Sensitivity (d′) versus ripple density (ripples/octave) data was fit by linear regression. Threshold was defined as the ripple density at which the regression line equaled 1 (i.e., d′ = 1).

  1. The three-interval, three adaptive forced-choice (3-AFC) spectral-ripple discrimination test

The three-interval, 3-AFC procedure used the same stimuli described by Won et al. (2007). Stimuli of 500 ms duration were generated by summing 200 pure-tone frequency components. Each 500-ms stimulus was either a standard or inverted ripple. The ripple densities included 0.125, 0.176, 0.250, 0.354, 0.500, 0.707, 1.000, 1.414, 2.000, 2.828, 4.000, 5.657, 8.000, and 11.314 ripples/octave. The spectral-ripple resolution threshold was determined using a two-up, one-down adaptive procedure, converging on 70.7% correct (Levitt 1971). Each test run began with 0.176 ripples/octave and moved in equal ratio steps of 1.414. The presentation level was roved within trials (7 dB range in 1 dB steps). Subjects were asked to click on an on-screen button that was labeled 1, 2, and 3 after the stimuli were presented. One stimulus (i.e., inverted ripple sound, test stimulus) was different from two others (i.e., standard ripple sound, reference stimulus). The subject’s task was to identify the test stimulus. Threshold for a single test run was estimated by averaging the final eight of 13 reversals. The primary dependent variable was determined by averaging the threshold from six different test runs. Feedback was not provided. The testing took 30 min for each subject.

  1. Speech reception threshold in noise test

The procedure for administering the speech reception threshold (SRT) test was the same as that previously described by Won et al. (2007), and similar to Turner et al. (2004). Subjects were asked to identify one randomly chosen spondee word out of a closed-set of 12 equally difficult spondees (Harris 1991). The spondees, two-syllable words with equal emphasis on each syllable, were recorded by a female talker. Two background noises were used: two-talker babble (one male and one female) and steady-state, speech-shaped noise. The female talker for the babble was different from the female talker for the spondees. The same background noise was used on every trial in order to eliminate variance that might arise from variability in the background stimulus. The onset of the spondees was 500 ms after the onset of the background noise. The two-talker babble and steady-state noise had duration of 2.0 s. The level of the speech was held constant at 65 dBA, while the level of the noise was tracked using a one-down, one-up procedure and 2-dB steps. Feedback was not provided. The threshold for a single test run was estimated by averaging the signal-to-noise ratio for the final 10 of 14 reversals. The primary dependent variable was the mean SRT of the six test runs.

  1. Word recognition in quiet test

Fifty consonant–nucleus–consonant (CNC) monosyllabic words (e.g., “Home, June, Pad, Sun”) recorded by a male talker were presented, from an open set, in sound-field at 65 dBA (Peterson and Lehiste 1962). Each CNC word list was randomly chosen out of 10 lists for each subject. A total percent correct score was calculated as the percent of words correctly repeated. Feedback was not provided.

Results

  1. Spectral-ripple threshold

For the single-interval procedure, the mean threshold for the 14 CI subjects was 6.16 ripples/octave with a 95% confidence interval of 1.70 ripples/octave. For the three-interval procedure, the mean threshold was 2.12 ripples/octave with a 95% confidence interval of 0.56 ripples/octave.

  1. Characteristics of psychometric function

For the three-interval procedure, Won et al. (2007) showed a psychometric function in which there was a monotonic decrease as a function of ripple density with an upper asymptote in the ability to discriminate the standard and inverted ripple stimuli. The psychometric functions constructed from the single-interval procedure showed a similar pattern. Figure 3 shows the psychometric functions obtained from the single-interval procedure method, plotting d′ as a function of ripple density for each of the 14 CI subjects. In the slope region, d′ monotonically decreased as ripple density increased. Calculated slope values are indicated in each panel (e.g., −0.89 d′/ripple for S04).

  1. Comparing thresholds obtained from the single- and three-interval procedures

FIG. 3.

FIG. 3

Psychometric functions for single-interval spectral-ripple discrimination for 14 CI subjects. Each panel represents data for an individual subject. Data points used for linear regression fits are shown as filled circles. Linear fits are shown as solid lines. The second number shown in upper right corner in each panel shows threshold. The third number shown in upper right corner shows psychometric-function slope (in units of d′/ripple). Error bars show 80% confidence intervals. Open circles represent points other than those used to estimate the slope (e.g., upper and lower asymptotic points).

Figure 4 shows the threshold obtained from the single-interval procedure was correlated with the threshold obtained from the three-interval procedure for the 14 subjects. A highly significant correlation was found between the two methods (r = 0.84, p = 0.0002).

  1. Correlation between the single-interval spectral-ripple test and speech measures

FIG. 4.

FIG. 4

Relationship between spectral-ripple thresholds determined using the three-interval procedure and those derived from the single-interval procedure in 14 CI subjects. Linear regression is represented by the solid line.

Figure 5 shows the single-interval spectral-ripple thresholds significantly correlated with speech perception performance. Significant negative correlations were found between the single-interval spectral-ripple thresholds and SRTs in two-talker babble (r = −0.64, p = 0.01) and in steady-state noise (r = −0.68, p = 0.008). A significant correlation was also found between the single-interval spectral-ripple thresholds and CNC word recognition scores in quiet (r = 0.72, p = 0.004). These results were consistent with previously reported data from the three-interval procedure (Won et al.2007; Henry et al.2005). Table 2 shows correlations between d′ values at each ripple density and speech perception scores.

FIG. 5.

FIG. 5

Relationship between single-interval spectral-ripple thresholds and speech reception thresholds in noise (left panel) and CNC scores (right panel). Linear regressions are represented by the dashed line for two-talker babble and the solid line for steady-state noise in the left panel.

TABLE 2.

Correlations of d′ values at each ripple density with speech perception tests and 3-AFC ripple test

RIPPLE DENSITY 0.25 0.5 0.707 1 1.414 2 2.828 4 5.657 8
Speech in babble
Speech in steady noise −0.57 −0.54
CNC word 0.57 0.54 0.61 0.59 0.63 0.68 0.70
3-AFC ripple 0.55 0.65 0.64 0.67 0.65

Correlation values with p < 0.05 are only shown. Nonsignificant correlations are not shown and left blank

Discussion

In experiment 1, a significant correlation (r = 0.84) was found between the thresholds obtained using the two procedures. Most importantly, consistent with previous reports by Won et al. (2007), there were significant correlations between the single-interval spectral-ripple test and speech perception in the presence of two-talker babble (r = −0.64, p = 0.01), steady-state noise (r = −0.68, p = 0.0008), and with CNC word recognition scores (r = 0.72, p = 0.004). With that said, caution needs to be taken when interpreting correlation results because they do not necessarily mean that the same hearing mechanisms were involved during the single-interval ripple test, three-interval ripple test, and speech perception tasks. The integrity of the auditory systems of CI users having excellent three-interval spectral-ripple thresholds might be quite good and thus show overall good performance. Even so, our results suggest that the single-interval, yes/no paradigm is a comparable method for evaluating spectral-ripple discrimination in CI users.

The mean spectral-ripple threshold was 6.16 ripples/octave for the single-interval test and 2.12 ripples/octave for the three-interval test. The higher thresholds obtained using the single-interval test compared to the three-interval test might, in part, be explained by procedural differences between the two methods. In the single-interval method, listeners were presented with either a single standard-standard ripple or standard-inverted ripple stimulus that was 4 s in duration. Thus, listeners could use the ripple phase inversion at the 2-s midpoint in the single-interval procedure, thereby providing an immediate comparison between the two portions of the same stimulus (see Figs. 1 and 2). This immediate comparison might involve a sensory-trace comparison as described by Durlach and Braida (1969) as well as Kidd et al. (1988) during spectral shape discrimination. In contrast, the three-interval procedure included three stimuli whereby an inverted ripple stimulus was presented along with two other standard ripple stimuli. The three-interval procedure required that listeners select the interval that sounded different from the other two, a task that likely requires more demand on short-term memory and other cognitive abilities to make the comparison. We speculate that this additional cognitive load contributed to poorer ripple thresholds. In addition, the duration of the single-interval stimulus for each ripple phase is longer (2 s) than the interval duration of the 3-AFC stimuli (500 ms), so the subjects had a longer duration for comparison with the single-interval test. Another difference is that, the single-interval procedure used the method of constant stimuli and estimated a threshold that corresponded to a d′ = 1, whereas the three-interval procedure used a two-up, one-down adaptive procedure, converging on 70.7% correct point (i.e., d′ = 1.28) on the psychometric function. Although the differences between the two procedures could contribute to the elevated spectral-ripple threshold in the single-interval procedure, it should be emphasized that the high, significant correlations between the results obtained with the two methods and the speech recognition scores suggest that the single-interval test can be used to evaluate spectral-ripple discrimination in CI listeners.

Table 2 shows correlations between d′ values at each ripple density and speech perception performance and spectral-ripple thresholds obtained with 3-AFC paradigm. Generally, d′ at the slopes showed significant correlations with speech perception, whereas d′ at the coarsest density did not. At lower ripple densities, listeners could have used intensity cue, but the d′ values at lower ripple densities did not correlate with speech perception or with spectral-ripple thresholds obtained with the 3-AFC ripple test. On the contrary, the d′ values at higher ripple densities, where spectral sensitivity is required for discrimination, were significantly correlated with speech perception and the 3-AFC spectral-ripple thresholds. These results further suggest that the single-interval ripple test evaluates spectral sensitivity which correlates with speech perception in CI users.

Experiment 2: the effect of number of channels on behavioral and physiological spectral-ripple threshold

In experiment 2, behavioral and physiological measures of spectral-ripple resolution were done with NH listeners. The single-interval procedure stimuli from experiment 1 were used for cochlear implant vocoder simulations. The effect of the number of vocoder processing channels on both behavioral and physiological spectral-ripple resolution was also investigated. The CI vocoder simulations serve as a model of CI users’ performance. We chose to test normal-hearing subjects listening to CI vocoder simulations to avoid confounding artifacts when recording cortical responses from CI users (e.g., Friesen and Picton 2010). Because cortical P1-N1-P2 “change” responses can be recorded using the spectral-ripple stimuli processed with the vocoder simulation, it suggests that it is feasible to use the single-interval spectral-ripple stimuli to explore the behavioral–physiological relationship of spectral-ripple discrimination in CI users.

Subjects

Three NH listeners participated. Each had audiometric thresholds less than 15 dB HL at all octave frequencies tested (from 0.25 to 8 kHz). The ages of the subjects were 27, 28, and 30.

Stimuli and procedures

  1. Vocoder simulation of single-interval spectral-ripple discrimination

To assess the effect of the number of channels on the single-interval spectral-ripple test, vocoder simulations of the stimuli were presented to the NH listeners. The same stimuli used in experiment 1 were used in this experiment. The number of channels tested included 1, 4, 8, 16, and 24. For vocoder stimuli, ripple densities of 0.25, 0.5, 1, 2, 4, and 8 ripples/octave were used. The input stimuli were divided into sub-band signals between 80 and 17,640 Hz by using a variable number of band-pass finite impulse response filters (e.g., Rubinstein and Turner 2003; Drennan et al.2007; Won et al.2008). For each number of channels condition, the center frequencies of the analysis filters are equally spaced along the human cochlear frequency map (Greenwood 1990). The envelope and the fine structure were extracted using a Hilbert transform. The Hilbert phases of the sub-band signals were then randomized, which creates a noise-band vocoder (Shannon et al.1995). The randomized fine-structure of each sub-band was multiplied by the Hilbert envelope of each sub-band signal. Finally, the same band-pass filter was applied again to the sub-band output and all filtered sub-band outputs were summed together. The single-interval, yes/no paradigm spectral-ripple discrimination test was administered as previously described in experiment 1. The 1-, 4-, 8-, 16-, and 24-channel conditions were run in random order.

  1. Electrophysiological methods

In the electrophysiological procedure, the same single-interval stimuli and vocoder simulation described above were used (ripple densities: 0.25, 1, 4, and 8 ripples/octave). To minimize recording time, the one-channel condition was not tested but the 4-, 8-, 16-, and 24-channel conditions were randomly presented. Each ripple density condition involved 200 presentations of standard-inverted ripple stimuli. During behavioral and electrophysiological testing, stimuli were presented at 65 dBA in the free field in a sound-treated booth via a single loudspeaker, positioned 1 m from the subject. Stimulus onset asynchrony (the time from the onset of one stimulus to the onset of the next stimulus) was randomly jittered (8,000 ms ± 500 ms) to reduce neural adaptation. Evoked potentials were recorded with a 64-channel electrode cap using a PC-based Neuroscan™ system. Analog-to-digital sampling rate was 1,000 Hz. The reference electrode was Cz (vertex) during recording and was re-referenced offline to an average reference. The recording window was 5,100 ms in duration, including a 100 ms pre-stimulus baseline. Online EEG filters were 0.05–100 Hz. Data were low-pass filtered offline at 40 Hz. Ocular artifact was monitored with electrodes at the lateral canthus of each eye and with electrodes inferior and superior to each eye. Sweeps including artifacts exceeding ±70 μV were not included in the average.

All recordings were obtained in a sound-treated booth while the subject sat in a comfortable chair attending to a muted, close-captioned movie of their choice. Each subject completed data collection over the course of two sessions on different days, lasting approximately 10 h. There were 5-min breaks between each condition. The order of conditions was randomized. Evoked potentials were recorded on separate days following the behavioral test.

P1-N1-P2 responses, signaling stimulus onset, change, and stimulus offset were detected using visual analysis, as well as a statistical test applied to the phase coefficients of a wavelet transform (Ross et al. 1999, 2007). Individual accepted sweeps from the average-referenced Cz electrode were submitted to a Morlet wavelet transform, resulting in amplitude and phase coefficients for each point in time and frequency. The Rayleigh test (Fisher 1993) was performed on the phase coefficients from the wavelet analysis. If the phase of a time–frequency point was uniformly distributed across the epochs, then the null hypothesis would be accepted, indicating that there was no response to the stimulus change in that particular condition.

Results

  1. Behavioral threshold as a function of the number of channels

The filled circles in Fig. 6 show the behavioral spectral-ripple thresholds as a function of the number of channels. A one-way repeated measure of ANOVA was performed to compare threshold with the number of channels, and as expected, there was a significant effect of the number of channels (F4,9 = 19.95, p < 0.001). Post hoc tests (Bonferroni) showed a statistically significant difference between the 24 channel and all other channel conditions. This is consistent with a previous observation in NH listeners that the spectral-ripple discrimination ability increases as the number of channels increases (Henry and Turner 2003).

  1. Physiological response as a function of the number of channels

FIG. 6.

FIG. 6

Behavioral and physiological thresholds from experiment 2 as a function of number of vocoder processing channels. Error bars show standard error across subjects.

Individual evoked cortical responses are shown in Fig. 7A (subject N1, left panel) and Fig. 7B (subject N2 and N3). Onset P1-N1-P2 responses were seen in all subjects in all conditions. P1-N1-P2 “change” responses, elicited by spectral-ripple phase inversions, were also seen in all subjects, and the presence or absence of the “change” response approximates behavioral thresholds (shown as a θ in Fig. 7A and B).

FIG. 7.

FIG. 7

FIG. 7

Individual data from experiment 2. A shows physiologic data for subject N01. Left time–domain cortical auditory evoked potential waveforms are plotted from electrode Cz. Each row represents a different vocoder channel condition (e.g., 24 channel, bottom row). Within each panel, responses to standard-inverted ripple stimuli are shown. The corresponding ripple density is displayed next to each waveform (e.g., 0.25, top trace in each row). For comparison, the behavioral vocoder spectral-ripple discrimination thresholds are plotted as theta symbols. Right for each cortical auditory evoked potential waveform, results of the Rayleigh test applied to coefficients of a time-frequency analysis of the cortical auditory evoked potential waveforms are plotted. P values less than 0.001 from the Rayleigh test are indicated by the shaded areas. Statistically significant “onset” responses are present prior to 0.5 s, while the “change” responses are represented by low p values in the 2.1–2.4 s time range. B shows time–domain cortical auditory evoked potential waveforms for two subjects, N02 and N03.

P values resulting from the Rayleigh test are shown for one subject in the right panel of Fig. 7A. The Rayleigh test was used as a statistical means of detecting the presence of the “change” response; results from the Rayleigh test were consistent with behavioral thresholds. The presence of a physiological “change” response was represented by low p values in the 2.1–2.4 s range. When the “change” responses were present in time domain waveforms (left panel for each subject), the Rayleigh test had low p values, less than 0.001, in the same latency region where the “change” response was observed (right panel for each subject). The presence of the “change” response, when analyzed visually using the time–domain waveform, as well as by the objective Rayleigh test, approximated behavioral thresholds. Similar results were obtained when electrodes over temporal lobes were examined.

The open triangles in Fig. 6 show the estimated physiological thresholds as a function of the number of channels. Physiological thresholds were estimated as the geometric mean between the highest ripple density at which the “change” response was elicited (confirmed by the Rayleigh test) and the next ripple density. The physiological thresholds were lower than the behavioral thresholds for all conditions, but the physiological thresholds also improved as a function of number of channels.

Discussion

In experiment 2, the vocoder behavioral data showed that NH listeners used the additional spectral information provided by higher numbers of channels to resolve more closely spaced spectral peaks to detect the change within a stimulus. Our observation is consistent with previous reports such as Henry and Turner (2003), which showed significant increases in performance on spectral-ripple discrimination up to 16 vocoder processing channels in NH listeners. Friesen et al. (2001, 2009) also showed that performance of NH listeners on vowel and consonant recognition increased up to 20 vocoder processing channels.

The physiological spectral-ripple data show that the presence of physiological “change” responses approximated the behavioral psychometric functions. As the number of vocoder channels increased, cortical “change” responses were affected, suggesting that the physiological sensitivity to spectral-ripple phase inversion increased with additional spectral information. These findings are in agreement with the behavioral discrimination data obtained from the same subjects, using the same stimuli. This is the first documentation that physiological “change” responses can be recorded in humans in response to the phase inversion of spectral-ripple stimuli. Further, our results show good agreement between physiological and behavioral thresholds showing a similar growth function as the number of channels increased (Fig. 6). In this respect, the physiology data show that it is feasible to further determine how close the brain–behavior relationship can be established using these methods.

What is less clear is the comparison between the CI subjects and NH vocoder simulations. The mean threshold of the CI subjects is comparable to that of the NH subjects when the number of processing channels ranged between 16 and 24. This result is not consistent with our initial prediction, as well as previous reports that CI listeners have only about eight functional channels (e.g., Shannon et al.1995; Dorman and Loizou 1997, 1998; Henry and Turner 2003). The vocoder simulation in experiment 2 used the low and high cutoff frequencies of 80 and 17,640 Hz, respectively. This frequency range is much wider than a typical CI processor (e.g., 250–8,700 Hz). This frequency range is also much wider than the 5,000 Hz bandwidth of the stimuli, thus several higher channels carried no information. If this discrepancy is considered, the mean threshold of CI subjects would correspond to the NH vocoder performance obtained with 11–17 simulated channels. It may be that, if the NH subjects were tested with a vocoder simulation covering 250–8,700 Hz, the mean threshold of the CI subjects would correspond to the NH vocoder simulation threshold at about eight channels. In addition, tone vocoders would also present better spectral contrast between standard and inverted ripple for the 3-AFC and the single-interval tests. These ideas were tested in experiment 3. Electrophysiology data were also collected using vocoder stimuli with these new filter settings, and utilized smaller steps in ripple density to better define physiological thresholds and the brain–behavior relationship.

Experiment 3: the correlation between behavioral performance and change response amplitude

In the behavioral measures for experiment 3, noise and tone vocoder simulations were tested to determine if vocoder carrier affects spectral-ripple thresholds. Two channel conditions were tested including eight and 16 channels. The vocoder frequency range was 250 and 8,700 Hz. Electrophysiology data were also collected using smaller steps in ripple density to more precisely estimate electrophysiological thresholds so they could be compared to behavioral data from the same subjects.

Subjects

The same three NH listeners in experiment 2 participated. The subjects completed data collection over the course of multiple sessions on different days, lasting approximately 10 h.

Stimuli and procedures

  1. Vocoder simulation and behavioral tests

A total of four vocoder simulations were created, including eight-channel noise, eight-channel tone, 16-channel noise, and 16-channel tone vocoders. Ripple densities of 0.25, 0.5, 1, 2, 4, and 8 ripples/octave were used. These ripple densities were also used in the behavioral single-interval test in experiment 2 and were wide enough to estimate the behavioral thresholds with eight- and 16-channel conditions. The noise vocoders used the same signal processing as experiment 2 except for different low cutoff (250 Hz) and high cutoff (8,700 Hz) frequencies. The tone vocoders also used the same signal processing as the noise vocoders except that the Hilbert envelope was modulated with a tone carrier at the center frequency of the band. Subjects were tested with a total of eight testing conditions in random order (i.e., four vocoder simulations for both 3-AFC and single-interval tests).

  1. Electrophysiology methods

The same recording and analysis procedures were used as in experiment 2. For the electrophysiological data collection, only the eight-channel noise vocoder simulation was used. The electrophysiological data with the eight-channel condition in experiment 2 (see Fig. 4) were collected with 0.25, 1, 4, and 8 ripples/octave stimuli; “change” responses were not seen at densities greater than 0.25 ripples/octave for all subjects, suggesting that the physiological threshold was likely between a ripple density of 0.25 and 1 ripple/octave. In this experiment, each subject was tested at six ripple densities that were logarithmically equally spaced between 0.25 and 2 ripples/octave. These ripple densities were chosen to more closely approximate the physiological threshold by including ripple densities above and below the behavioral threshold.

Results

  1. Behavioral thresholds

Figure 8 shows the mean thresholds of the three NH subjects from the 3-AFC test (left panel) and the single-interval test (right panel). For comparison, the mean spectral-ripple thresholds for the 14 CI subjects are indicated as filled triangles. A 2 × 2 × 2 repeated measures ANOVA was performed on the thresholds (two types of tests, two carriers, and two channel numbers). It showed a significant effect of channel numbers (F1,2 = 55.8, p = 0.017), but the type of test (3-AFC or one interval) and carrier did not have any effect. Unlike the CI subjects, the NH subjects showed very similar performance on the 3-AFC and single-interval tests. When the NH vocoder thresholds were compared to the mean threshold of the 14 CI subjects, the eight-channel vocoder thresholds were comparable to the mean of the CI subjects for the 3-AFC test.

  1. Electrophysiological thresholds

FIG. 8.

FIG. 8

Average spectral-ripple thresholds for four different vocoder conditions in NH listeners (left 3-AFC test, right one-interval test). For comparison, the mean spectral-ripple threshold for the 14 CI subjects is indicated as filled triangles. Error bars represent 95% confidence intervals. Data points are slightly horizontally displaced for clarity.

Figure 9 shows individual waveforms of cortical responses. The Rayleigh test was also performed to determine if the “change” responses were present in the waveforms. The Rayleigh test showed that there were “change” responses up to 1.0, 0.7079, and 1.4125 ripples/octave for N01, N02, and N03, respectively. Correspondingly, physiological thresholds, defined as the geometric mean between the highest ripple density at which the “change” response was elicited and the next ripple density, were 1.188, 0.841, and 1.681 ripples/octave for N01, N02, and N03, respectively, whereas the behavioral thresholds with the same vocoder were 3.0, 2.83, and 1.74 ripples/octave for N01, N02, and N03, respectively. Physiological thresholds were slightly lower than behavioral thresholds for all subjects.

  1. Correlation between behavioral and physiological data

FIG. 9.

FIG. 9

Time–domain cortical auditory evoked potential waveforms are shown. Each column shows individual normal-hearing subject’s data at different ripple densities (e.g., 0.25 ripples/octave). Stimuli were processed with the eight-channel noise vocoder simulation. The corresponding ripple density is displayed next to each waveform. Amplitude of “change” response decreases as ripple density increases.

To better determine the relationship between behavioral discrimination and physiological responses, the peak-to-peak amplitudes of N1-P2 “change” responses for all ripple densities were obtained. Response amplitudes were measured only for those conditions where the Rayleigh test identified a “change” response as being present; amplitudes of zero were recorded for conditions where the Rayleigh test indicated an absent “change” response. For each individual subject, N1-P2 “change” response amplitudes were normalized by the subject’s largest “change” response amplitude. Amplitudes and d′ declined as ripple density increased. Figure 10 illustrates the within-subject brain–behavior relationship and shows normalized N1-P2 response amplitudes and d′ from the behavioral procedure. At low ripple densities, where d′ was higher (i.e., better behavioral detection), N1-P2 “change” response amplitudes were also higher (i.e., better physiological detection).

FIG. 10.

FIG. 10

Relationships between behavioral and physiological measures are illustrated in scatterplots of normalized amplitude of N1-P2 “change” responses (filled symbols plotted in μV on the left y-axis) and behavioral discrimination performance (open symbols plotted in d′ on the right y-axis) as a function of ripple density. Data for individual subject N01 (circles), N02 (triangles), and N03 (squares) are represented by different symbols.

Figure 11 illustrates the across-subjects brain–behavior relationship and shows the correlations between the normalized N1-P2 “change” response amplitudes and the d′ values for all subjects at ripple densities in which both behavior and physiology were tested. There were strong correlations (r ≥ 0.86) observed for two subjects (N01 and N03). When all data points for the three subjects were pooled together to determine the correlation, about 54% of the variance (r = 0.73, p = 0.007) in behavioral performance was accounted for by amplitudes of the N1-P2 “change” responses.

FIG. 11.

FIG. 11

The brain–behavior relationship is represented in scatterplots of the normalized amplitude of N1-P2 “change” responses and behavioral d′ data. Data for individual subject N01 (circles), N02 (triangles), and N03 (squares) are represented by different symbols. Significant correlations were found for each subject; linear fits between the two measures are shown as lines for each subject.

Discussion

When the new filter setting was used (250–8,700 Hz) to simulate the 3-AFC test, the mean threshold of CI subjects corresponded to the NH, simulated eight-channel thresholds obtained with both noise and tone carrier vocoders. This observation suggests that the vocoder simulation signal processing is adequate and these data are consistent with previous reports of 8 functional channels in CI listeners (e.g., Shannon et al.1995; Dorman and Loizou 1997, 1998; Henry and Turner 2003). For the single-interval test, the 16-channel vocoder threshold corresponded to the mean of the CI subjects. This result is not consistent with our prediction that the narrow filter range should result in the eight-channel single-interval vocoder threshold would correspond to the mean threshold of CI subjects. The NH vocoder simulation data in Fig. 8 showed no effect of test type (3-AFC vs. single-interval), whereas the CI subjects in experiment 1 showed significantly better threshold with the single-interval test.

It appears that in the single-interval test, some cues were only available to CI listeners and those cues were not preserved in the vocoder outputs. It might be that the output differences are attributed to the differences between the actual CI sound processing and the vocoder signal processing. To evaluate the output differences, the outputs of the actual CI sound processing and the noise and tone vocoder processing were compared. Fidelity120 strategy was chosen for this analysis. The absolute value of the difference between output levels for standard and inverted ripple stimuli were estimated for individual channels. These values were summed across channels and the resulting value was used as an index to compare Fidelity120 and the vocoder outputs. This analysis shows that the vocoder processing resulted in a much smaller index than Fidelity120, especially when the eight-channel vocoder was used. The vocoder simulations in this study did not implement pre-emphasis filters or compression functions. When they were added, a much larger index was observed. Other differences include frequency allocation for each channel, envelope detection method, input dynamic range, and automatic gain control. In addition, five of the 14 CI subjects used Fidelity120 as their clinical sound encoding strategy. Drennan et al. (2010) demonstrated that CI listeners could achieve significantly better spectral-ripple discrimination with Fidelity120 than with HiResolution, because current steering implemented in Fidelity120 could provide improved spectral detail of the sound. This pattern is also observed in the present study. The five users of Fidelity120 showed 7.4 ripples/octave, whereas the other nine subjects (i.e., non-Fidelity120 users) showed 5.5 ripples/octave for the one-interval test. Note that the vocoder simulation in the present study did not implement current steering. It seems that the vocoder simulated single-interval test was more affected by these differences. A future direction will be to define optimal recording parameters in case the ability to detect changes in ongoing acoustic vs. electric stimuli is differentially affected by neural adaptation (e.g., Wojtczak et al.2003). Neural adaptation might help CI subjects detect changes at the 2-s point of the standard-inverted stimuli, whereas this effect might not be large enough for NH subjects and would be absent in the 3-AFC test. Nevertheless, the strong correlations of the single-interval spectral-ripple test in CI listeners with the 3-AFC spectral-ripple test and speech perception tests do not detract from the single-interval procedure as a potentially valuable measure of spectral-ripple discrimination.

The main finding from experiment 3 was that amplitudes of the “change” responses were correlated with behavioral performance. Electrophysiology data have also shown that the amplitudes of P1-N1-P2 “change” responses elicited by spectral-ripple phase inversion are related to the ripple density. That is, as ripple density increased, smaller frequency differences resulting from spectral-ripple phase inversion in the ongoing stimuli evoked lower amplitude “change” responses. The presence of “change” responses in the present data reflects the physiological detection of the acoustic cue in the stimuli, namely the physiological detection of spectral-ripple phase inversion at the level of the auditory cortex. The acoustic cue could also include a temporal component which disrupts an ongoing signal. Nevertheless, significant correlations between the behavioral and physiological data suggest that the neural encoding of spectral features in the acoustic sounds and its processing at the cortical level is an important factor in determining behavioral spectral-ripple discrimination. Although these correlations were obtained across conditions for individual subjects, a strong correlation was also found across subjects (r = 0.73, p = 0.007) when all subjects’ data points were included for the correlation analysis.

When physiological thresholds were defined as the geometric mean between the highest ripple density at which the “change” response was elicited and the next ripple density; physiological thresholds were slightly lower than behavioral thresholds for all subjects. Different testing paradigms used in the behavioral and physiological conditions may have contributed to differences between the perceptual and physiological thresholds. For example, a passive listening paradigm was employed to record the physiology data, where cognitive processing and demands were different than during the behavioral listening task. To be of use clinically, it will be important to determine if the physiological detection of spectral-ripple change falls within a clinically acceptable range of perceptual thresholds, just as there is a clinically defined acceptable brain–behavior relationship when using auditory brainstem thresholds to estimate hearing sensitivity. Future directions will aim to optimize recording parameters, so clinically relevant relationships between physiological and perceptual “change” thresholds can be established using spectral-ripple stimuli.

General discussion

In the present study, a single-interval, yes/no paradigm spectral-ripple discrimination test was developed. When adult CI users were tested in experiment 1, a highly significant correlation was found between the spectral-ripple threshold obtained using the proposed single-interval method and the threshold obtained using a previously established three-interval, three-AFC method (Won et al.2007). The single-interval spectral-ripple threshold was also significantly correlated with speech recognition in quiet and noise. In experiment 2, the behavioral vocoder experiment showed that, as the number of channels increased, the NH listeners were able to detect and discriminate a higher density of spectral-ripple stimuli. The physiological test showed that the single-interval spectral-ripple stimuli can evoke the “change” response, signaling the physiological detection of spectral-ripple phase inversion, and the presence of this “change” response increased as a function of the number of vocoder processing channels. The presence of the change response qualitatively agreed with behavioral performance. In experiment 3, where smaller step sizes in ripple density were used, physiological thresholds were more accurately estimated. The amplitudes of the “change” responses significantly correlated with behavioral performance. Taken together, these findings confirm that the single-interval spectral-ripple discrimination is a feasible approach for studying spectral-ripple resolution in the human auditory system.

Several studies have compared different types of psychophysical procedures and threshold estimation methods such as single-, two-, or three-AFC procedures using either the adaptive staircase rule, maximum-likelihood stepping rule, or the method of constant stimuli (e.g., Shelton et al.1982; Shelton and Scarrow 1984; Saberi and Green 1997; Leek 2001; Buss et al.2001; Marvit et al.2003). In general, different efficiency and reliability were found according to the type of hearing task. Likewise, it appears that both single- and three-interval spectral-ripple tests in the current study have their own distinct advantages.

From the perspective of testing time, the three-interval procedure is preferred because it requires significantly shorter time to obtain a threshold than the single-interval procedure. However, the single-interval procedure is more appropriate for naïve listeners (e.g., young children) who are less able to execute a task that requires comparing two or more separate sounds (Green 1995). Single-interval procedures are readily adaptable for testing preverbal children using observer-based psychophysical procedures, which have been used to investigate several hearing abilities in NH infants (e.g., Olsho et al.1987; Bargones et al.1995; Leibold and Werner 2006) as well as in infant CI users (Dasika et al.2009; Grieco-Calub et al.2008).

The current study found a significant correlation of the single-interval spectral-ripple test with speech recognition in quiet as well as in noise, which suggests the intriguing possibility that if the spectral-ripple threshold can be obtained from prelingual CI users, their speech perception capabilities could be predicted. This clinical information could lead to improved habilitation/rehabilitation and educational interventions, as well as outcomes. In principle, such measures could also allow testing and optimization of sound processing strategies in children whose auditory nervous systems are still maximally plastic.

In addition, the spectral change stimuli used in the single-interval approach have also been demonstrated to be appropriate to conduct electrophysiological studies to explore spectral information processing in the auditory cortex. If cortical auditory evoked potentials can be measured in response to spectral-ripple phase inversion in the stimulus in CI listeners, it may be possible to evaluate various mapping strategies and possibly determine which strategy leads to the best spectral resolution. Once optimal stimulus presentation rates and recording montages for maturing auditory systems are defined, such approaches may facilitate the tracking of longitudinal and developmental changes involving spectral processing in CI users following implantation. All of these possibilities have motivated the present study, and the feasibility data obtained in this study justify expanding this area of research to include the single-interval approach when studying auditory spectral processing in NH, HI, and CI listeners.

Acknowledgments

We greatly appreciate the dedicated efforts of our subjects. This study was supported by NIH-NIDCD grants R01-DC007525, P30-DC04661, R01-DC007705, T32-DC000033, F31-DC009755, F31-DC010553, F32-DC008238, L30-DC008490, and Korea Research Foundation grant KRF-2006-612-D00106, and an educational fellowship from Advanced Bionics Corporation. Thanks to Bernhard Ross for his programming assistance with the Rayleigh test as well as his consultation.

Footnotes

This paper was a part of the first author’s PhD dissertation. Portions of this data were presented at the 2007 Northwest Auditory and Vestibular Meeting at Portland, OR, USA at the 2008 ARO Midwinter Meeting at Phoenix, AZ, USA at the 2008 American Auditory Society Annual Meeting at Scottsdale, AZ, USA, and at the 2011 ARO Midwinter Meeting at Baltimore, MD, USA.

References

  1. Abbas PJ, Brown CJ (2000) Electrophysiology and device telemetry. In: Waltzman SB, Cohen NL (eds) Cochlear implants. Thieme
  2. Bargones JY, Werner LA, Marean GC. Infant psychometric functions for detection: mechanisms of immature sensitivity. J Acoust Soc Am. 1995;98:99–111. doi: 10.1121/1.414446. [DOI] [PubMed] [Google Scholar]
  3. Brown CJ, Hughes ML, Luk B, Abbas PJ, et al. The relationship between EAP and EABR thresholds and levels used to program the nucleus 24 speech processor: data from adults. Ear Hear. 2000;21:151–163. doi: 10.1097/00003446-200004000-00009. [DOI] [PubMed] [Google Scholar]
  4. Buss S, Schorer E, Florentine M, Zwicker E. Decision rules in detection of simple and complex tones. J Acoust Soc Am. 1986;80:1646–1657. doi: 10.1121/1.394329. [DOI] [PubMed] [Google Scholar]
  5. Buss E, Hall JW, Grose JH, Dev MB. A comparison of threshold estimation methods in children 6–11 years of age. J Acoust Soc Am. 2001;109:727–731. doi: 10.1121/1.1337960. [DOI] [PubMed] [Google Scholar]
  6. Byrne D, Dillon H, Tran K, Arlinger S, et al. An international comparison of long-term average speech spectra. J Acoust Soc Am. 1994;96:2108–2120. doi: 10.1121/1.410152. [DOI] [Google Scholar]
  7. Dasika VK, Werner LA, Norton SJ, Nie K, et al. Measuring sound detection and reaction time in infant and toddler cochlear implant recipients using an observer-based procedure: a first report. Ear Hear. 2009;30:250–261. doi: 10.1097/AUD.0b013e3181986dfe. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Dorman MF, Loizou PC. Speech intelligibility as a function of the number of channels of stimulation for normal-hearing listeners and patients with cochlear implants. Am J Otol. 1997;18:S113–S114. [PubMed] [Google Scholar]
  9. Dorman MF, Loizou PC. The identification of consonants and vowels by cochlear implant patients using a 6-channel continuous interleaved sampling processor and by normal-hearing subjects using simulations of processors with two to nine channels. Ear Hear. 1998;19:162–166. doi: 10.1097/00003446-199804000-00008. [DOI] [PubMed] [Google Scholar]
  10. Drennan WR, Won JH, Dasika VK, Rubinstein JT. Effects of temporal fine structure on the lateralization of speech and on speech understanding in noise. J Assoc Res Otolaryngol. 2007;8:373–383. doi: 10.1007/s10162-007-0074-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Drennan WR, Won JH, Nie K, Jameyson EM, Rubinstein JT. Sensitivity of psychophysical measures to signal processor modifications in cochlear implant users. Hear Res. 2010;262:1–8. doi: 10.1016/j.heares.2010.02.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Durlach NI, Braida LD (1969) Intensity perception. I. Preliminary theory of intensity resolution. J Acoust Soc Am 46:372–383 [DOI] [PubMed]
  13. Eisenberg LS, Martinez AS, Boothroyd A. Assessing auditory capabilities in young children. Int J Pediatr Otorhinolaryngol. 2007;71:1339–1350. doi: 10.1016/j.ijporl.2007.05.017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Faulkner KF, Tremblay KL, Rubinstein JT, Werner LA, Nie K (2010) Auditory training in adult cochlear implant listeners using spectrally-rippled noise stimuli in an adaptive, single-interval paradigm. Presented at 33rd annual meeting of the association research in otolaryngology. Abstract #363
  15. Fisher NI. Statistical analysis of circular data. Cambridge: Cambridge University Press; 1993. [Google Scholar]
  16. Friesen LM, Picton TW. A method for removing cochlear implant artifact. Hear Res. 2010;259:95–106. doi: 10.1016/j.heares.2009.10.012. [DOI] [PubMed] [Google Scholar]
  17. Friesen LM, Tremblay KL. Acoustic change complexes recorded in adult cochlear implant listeners. Ear Hear. 2006;27:678–685. doi: 10.1097/01.aud.0000240620.63453.c3. [DOI] [PubMed] [Google Scholar]
  18. Friesen LM, Shannon RV, Baskent D, Wang X. Speech recognition in noise as a function of the number of spectral channels: comparison of acoustic hearing and cochlear implants. J Acoust Soc Am. 2001;110:1150–1163. doi: 10.1121/1.1381538. [DOI] [PubMed] [Google Scholar]
  19. Friesen LM, Tremblay KL, Rohila N, et al. Evoked cortical activity and speech recognition as a function of the number of simulated cochlear implant channels. Clin Neurophysiol. 2009;120(4):776–782. doi: 10.1016/j.clinph.2009.01.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Gourevitch V, Galanter E. A significance test for one parameter isosensitivity functions. Psychometrika. 1967;32:25–33. doi: 10.1007/BF02289402. [DOI] [PubMed] [Google Scholar]
  21. Green DM. Maximum-likelihood procedures and the inattentive observer. J Acoust Soc Am. 1995;97:3749–3760. doi: 10.1121/1.412390. [DOI] [PubMed] [Google Scholar]
  22. Greenwood DD. A cochlear frequency-position function for several species—29 years later. J Acoust Soc Am. 1990;87:2592–2605. doi: 10.1121/1.399052. [DOI] [PubMed] [Google Scholar]
  23. Grieco-Calub TM, Litovsky RY, Werner LA. Using the observer-based psychophysical procedure to assess localization acuity in toddlers who use bilateral cochlear implants. Otol Neurotol. 2008;29:235–239. doi: 10.1097/mao.0b013e31816250fe. [DOI] [PubMed] [Google Scholar]
  24. Harris RW. Speech audiometry materials compact disk. Provo: Brigham Young University; 1991. [Google Scholar]
  25. Henry BA, Turner CW. The resolution of complex spectral patterns by cochlear implant and normal-hearing listeners. J Acoust Soc Am. 2003;113:2861–2873. doi: 10.1121/1.1561900. [DOI] [PubMed] [Google Scholar]
  26. Henry BA, McKay CM, McDermott HJ, Clark GM. The relationship between speech perception and electrode discrimination in cochlear implantees. J Acoust Soc Am. 2000;108:1269–1280. doi: 10.1121/1.1287711. [DOI] [PubMed] [Google Scholar]
  27. Henry BA, Turner CW, Behrens A. Spectral peak resolution and speech recognition in quiet: normal hearing, hearing impaired, and cochlear implant listeners. J Acoust Soc Am. 2005;118:1111–1121. doi: 10.1121/1.1944567. [DOI] [PubMed] [Google Scholar]
  28. Jerger J, Jerger S. Evoked response to intensity and frequency change. Arch Otolaryngol. 1970;91:433–436. doi: 10.1001/archotol.1970.00770040627007. [DOI] [PubMed] [Google Scholar]
  29. Kidd G, Mason CR, Hanna TE (1988) Evidence for sensory-trace comparisons in spectral shape discrimination. J Acoust Soc Am 84:144–149 [DOI] [PubMed]
  30. Leek MR. Adaptive procedures in psychophysical research. Percept Psychophys. 2001;63:1279–1292. doi: 10.3758/BF03194543. [DOI] [PubMed] [Google Scholar]
  31. Leibold L, Werner L. Effect of masker-frequency variability on the detection performance of infants and adults. J Acoust Soc Am. 2006;119:3960–3970. doi: 10.1121/1.2200150. [DOI] [PubMed] [Google Scholar]
  32. Levitt H. Transformed up–down methods in psychoacoustics. J Acoust Soc Am. 1971;49(Suppl 2):467–477. doi: 10.1121/1.1912375. [DOI] [PubMed] [Google Scholar]
  33. Litvak LM, Spahr AJ, Saoji AA, Fridman GY. Relationship between perception of spectral ripple and speech recognition in cochlear implant and vocoder listeners. J Acoust Soc Am. 2007;122:982–991. doi: 10.1121/1.2749413. [DOI] [PubMed] [Google Scholar]
  34. MacMillan NA, Creelman CD. Detection theory: a user’s guide. Hillsdale: Erlbaum; 2005. [Google Scholar]
  35. Martin BA, Tremblay KL, Korczak P. Speech evoked potentials: from the laboratory to the clinic. Ear Hear. 2008;29:285–313. doi: 10.1097/AUD.0b013e3181662c0e. [DOI] [PubMed] [Google Scholar]
  36. Marvit P, Florentine M, Buus S. A comparison of psychophysical procedures for level–discrimination thresholds. J Acoust Soc Am. 2003;113:3348–3361. doi: 10.1121/1.1570445. [DOI] [PubMed] [Google Scholar]
  37. McCandless GA, Rose DE. Evoked cortical responses to stimulus change. J Speech Hear Res. 1970;13:624–634. doi: 10.1044/jshr.1303.624. [DOI] [PubMed] [Google Scholar]
  38. Olsho LW, Koch EG, Halpin CF. Level and age effects in infant frequency discrimination. J Acoust Soc Am. 1987;82:454–464. doi: 10.1121/1.395446. [DOI] [PubMed] [Google Scholar]
  39. Peterson GE, Lehiste I. Revised CNC lists for auditory tests. J Speech Hear Disord. 1962;27:62–70. doi: 10.1044/jshd.2701.62. [DOI] [PubMed] [Google Scholar]
  40. Ponton CW, Don M, Eggermont JJ, Waring MD, et al. Auditory system plasticity in children after long periods of complete deafness. NeuroReport. 1996;8:61–65. doi: 10.1097/00001756-199612200-00013. [DOI] [PubMed] [Google Scholar]
  41. Ross B, Lutkenhoner B, Pantev C, Hoke M. Frequency-specific threshold determination with the ceragram method: basic principle and retrospective evaluation of data. Audiol Neurootol. 1999;4:12–27. doi: 10.1159/000013816. [DOI] [PubMed] [Google Scholar]
  42. Ross B, Tremblay KL, Picton TW. Physiological detection of interaural phase differences. J Acoust Soc Am. 2007;121:1017–1027. doi: 10.1121/1.2404915. [DOI] [PubMed] [Google Scholar]
  43. Rubinstein JT, Turner CW (2003) A novel acoustic simulation of cochlear implant hearing: effects of temporal fine structure. First. Int. IEEE. EMBS. Conf. Neural. Eng. IEEE Press, 142–145
  44. Saberi K, Green D. Evaluation of maximum-likelihood estimators in nonintensive auditory psychophysics. Percept Psychophys. 1997;59:867–876. doi: 10.3758/BF03205504. [DOI] [PubMed] [Google Scholar]
  45. Saoji AA, Litvak LM, Spahr AJ, Eddins DA (2009) Spectral modulation detection and vowel and consonant identifications in cochlear implant listeners. J Acoust Soc Am 126:955–958 [DOI] [PubMed]
  46. Shannon RV, Zeng FG, Kamath V, Wygonski J, et al. Speech recognition with primarily temporal cues. Science. 1995;270:303–304. doi: 10.1126/science.270.5234.303. [DOI] [PubMed] [Google Scholar]
  47. Shelton B, Scarrow I. 2-Alternative versus 3-alternative procedures for threshold estimation. Percept Psychophys. 1984;35:385–392. doi: 10.3758/BF03206343. [DOI] [PubMed] [Google Scholar]
  48. Shelton B, Picardi M, Green D. Comparison of 3 adaptive psychophysical procedures. J Acoust Soc Am. 1982;71:1527–1533. doi: 10.1121/1.387806. [DOI] [Google Scholar]
  49. Spoor A, Timmer F, Odenthal DW. The evoked auditory response to intensity modulated and frequency modulated tones and tone bursts. Int Audiol. 1969;8:410–415. doi: 10.3109/05384916909079086. [DOI] [Google Scholar]
  50. Supin A, Popov VV, Milekhina ON, Tarakanov MB. Frequency resolving power measured by rippled noise. Hear Res. 1994;78:31–40. doi: 10.1016/0378-5955(94)90041-8. [DOI] [PubMed] [Google Scholar]
  51. Turner CW, Gantz BJ, Vidal C, Behrens A, et al. Speech recognition in noise for cochlear implant listeners: benefits of residual acoustic hearing. J Acoust Soc Am. 2004;115:1729–1735. doi: 10.1121/1.1687425. [DOI] [PubMed] [Google Scholar]
  52. Wojtczak M, Donaldson GS, Viemeister NF. Intensity discrimination and increment detection in cochlear-implant users. J Acoust Soc Am. 2003;114(1):396–407. doi: 10.1121/1.1579007. [DOI] [PubMed] [Google Scholar]
  53. Won JH, Drennan WR, Rubinstein JT. Spectral-ripple resolution correlates with speech reception in noise in cochlear implant users. J Assoc Res Otolaryngol. 2007;8:384–392. doi: 10.1007/s10162-007-0085-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Won JH, Schimmel SM, Drennan WR, Souza PE, et al. Improving performance in noise for hearing aids and cochlear implants using coherent modulation filtering. Hear Res. 2008;239:1–11. doi: 10.1016/j.heares.2008.01.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Won JH, Drennan WR, Kang RS, Rubinstein JT. Psychoacoustic abilities associated with music perception in cochlear implant users. Ear Hear. 2010;31:796–805. doi: 10.1097/AUD.0b013e3181e8b7bd. [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from JARO: Journal of the Association for Research in Otolaryngology are provided here courtesy of Association for Research in Otolaryngology

RESOURCES