Relationship Between Behavioral and Physiological Spectral-Ripple Discrimination

Jong Ho Won; Christopher G Clinard; Seeyoun Kwon; Vasant K Dasika; Kaibao Nie; Ward R Drennan; Kelly L Tremblay; Jay T Rubinstein

doi:10.1007/s10162-011-0257-4

. 2011 Jan 27;12(3):375–393. doi: 10.1007/s10162-011-0257-4

Relationship Between Behavioral and Physiological Spectral-Ripple Discrimination

Jong Ho Won ^1,^2,^3,^✉, Christopher G Clinard ^3,⁴, Seeyoun Kwon ^5,⁶, Vasant K Dasika ¹, Kaibao Nie ¹, Ward R Drennan ¹, Kelly L Tremblay ³, Jay T Rubinstein ^1,²

PMCID: PMC3085690 PMID: 21271274

Abstract

Previous studies have found a significant correlation between spectral-ripple discrimination and speech and music perception in cochlear implant (CI) users. This relationship could be of use to clinicians and scientists who are interested in using spectral-ripple stimuli in the assessment and habilitation of CI users. However, previous psychoacoustic tasks used to assess spectral discrimination are not suitable for all populations, and it would be beneficial to develop methods that could be used to test all age ranges, including pediatric implant users. Additionally, it is important to understand how ripple stimuli are processed in the central auditory system and how their neural representation contributes to behavioral performance. For this reason, we developed a single-interval, yes/no paradigm that could potentially be used both behaviorally and electrophysiologically to estimate spectral-ripple threshold. In experiment 1, behavioral thresholds obtained using the single-interval method were compared to thresholds obtained using a previously established three-alternative forced-choice method. A significant correlation was found (r = 0.84, p = 0.0002) in 14 adult CI users. The spectral-ripple threshold obtained using the new method also correlated with speech perception in quiet and noise. In experiment 2, the effect of the number of vocoder-processing channels on the behavioral and physiological threshold in normal-hearing listeners was determined. Behavioral thresholds, using the new single-interval method, as well as cortical P1-N1-P2 responses changed as a function of the number of channels. Better behavioral and physiological performance (i.e., better discrimination ability at higher ripple densities) was observed as more channels added. In experiment 3, the relationship between behavioral and physiological data was examined. Amplitudes of the P1-N1-P2 “change” responses were significantly correlated with d′ values from the single-interval behavioral procedure. Results suggest that the single-interval procedure with spectral-ripple phase inversion in ongoing stimuli is a valid approach for measuring behavioral or physiological spectral resolution.

Keywords: spectral-ripple discrimination, behavioral and physiological measure, cochlear implant, auditory change response, auditory cortex

Introduction

Spectral-ripple stimuli consist of rippled spectra noise signals in which the frequency positions of the spectral peaks and valleys alternate (Supin et al.1994). Over the past few years, spectral-ripple discrimination has been widely used in a variety of behavioral experiments with normal-hearing (NH), hearing-impaired (HI), and cochlear-implant (CI) listeners (e.g., Supin et al.1994; Henry et al. 2000, 2005; Won et al.2007; Litvak et al.2007; Saoji et al. 2009; Won et al.2010). For example, the ability of subjects to discriminate a reversal in the phase of the rippled shape (Henry et al. 2000, 2005; Won et al.2007), and the ability to differentiate between a spectral-ripple stimulus and white noise (Litvak et al.2007) have been evaluated. Furthermore, spectral-ripple discrimination ability has been shown to correlate significantly with vowel and consonant recognition in quiet (Henry et al.2005; Litvak et al.2007), word recognition in noise (Won et al.2007), and music perception abilities including complex-tone pitch direction discrimination, melody recognition, and timbre recognition (Won et al.2010) for CI users. Drennan et al. (2010) used spectral-ripple discrimination to evaluate different CI sound encoding strategies, and Faulkner et al. (2010) used spectral-ripple discrimination for auditory training in CI listeners to improve the perception of spectral details in speech and music sounds. Because Won et al. (2007) showed that test–retest reliability for the spectral-ripple test was good, and with minimal learning effects, the spectral-ripple test appears to be a stable measure that could be used in research as well as the clinic. These previous studies suggest that spectral-ripple discrimination is an efficient and valuable tool for assessing spectral envelope sensitivity in CI users in a way that can predict the hearing performance.

An additional benefit of spectral-ripple testing is that it is a non-linguistic psychophysical measure that could be helpful to researchers and clinicians when testing infants and children. Such information could potentially be used to improve CI signal processing or (re)habilitation strategies. However, the multiple-interval, adaptive forced-choice (AFC) discrimination procedures that have been typically used with adult listeners (e.g., three interval, three-AFC procedure) are poorly suited for behavioral studies with infants and toddlers. The unchanging stimuli of a multiple-interval procedure are also unsuitable for measuring spectral-ripple discrimination electrophysiologically via auditory evoked potentials. For these reasons, the goal of the present study was to develop a new method of measuring spectral-ripple threshold. The new method uses a single-interval, yes/no paradigm with the method of constant stimuli. On a trial, either a “change” or a “no-change” stimulus was presented. The stimulus paradigm and physiological analysis techniques were modeled after those used by Ross et al. (2007). This single-interval method can be extended with minimal changes to test preverbal children using a conditioned head turn or observer-based procedure (e.g., Eisenberg et al.2007; Dasika et al.2009). Moreover, this type of stimulus presentation is also useful because the same stimuli and presentation mode used for behavioral perceptual testing are also feasible for studying the neural detection of spectral changes.

Electrophysiological measures have long been used to assess the integrity of the auditory system following implantation (for review, see Abbas and Brown 2000; Martin et al.2008) as well as to assist with CI mapping (Brown et al.2000). Electrophysiological measures can be especially helpful in assessing the integrity of the auditory system in infants and children when behavioral testing is challenging or not possible. For these reasons, the present study attempted to determine if electrophysiological responses could be evoked by spectral-ripple stimuli, and estimated the physiological capacity of an individual’s auditory system in detecting spectral changes within spectral-ripple stimuli. The P1-N1-P2 complex, a cortical auditory evoked potential, is especially relevant for this purpose because the response is robust enough to reliably identify in individuals. The P1-N1-P2 response may be elicited by the onset or offset of, or changes within, an acoustic stimulus (Spoor et al. 1969; Jerger and Jerger 1970; McCandless and Rose 1970). These P1-N1-P2 responses provide information about sound processing up to the level of the cortex, and can closely approximate behavioral discrimination thresholds (for review: Martin et al.2008). For this reason, cortical auditory evoked potentials have been used extensively with CI listeners in response to pulse trains (Ponton et al.1996) or speech sounds (Friesen and Tremblay 2006), as well as vocoded speech in normal-hearing listeners (Friesen et al.2009).

The goals of the current study are (1) to determine if the previously used three-AFC procedure yielded results that were correlated with the new, single-interval, yes/no procedure; (2) to determine the feasibility of recording cortical P1-N1-P2 responses using the spectral-ripple stimuli; and (3) to determine if these physiological responses are related to behavioral spectral-ripple discrimination. In experiment 1, the proposed single-interval, yes/no procedure with the method of constant stimuli was conducted with adult CI listeners. Results were compared to data from the previously established spectral-ripple discrimination test which used a three-interval, three-AFC, adaptive procedure (e.g., Won et al.2007). In experiment 2, the single-interval, yes/no spectral-ripple discrimination procedure was used to obtain behavioral and physiological measures in NH listeners. The effect of the number of vocoder-processing channels on behavioral thresholds and the P1-N1-P2 responses were examined. In experiment 3, the correlation between d′ values from the single-interval behavioral performance and the amplitudes of P1-N1-P2 change responses were examined.

Methods

Experiment 1: the development and validation of single-interval spectral-ripple discrimination

Subjects

Fourteen post-lingually deafened, generally high performing, adult CI users participated in experiment 1. They were 25–77 years old (mean = 57 years, six females and eight males). Individual CI subject information is listed in Table 1. All subjects listened to the stimuli using their own sound processor set to a comfortable listening level. CI sound processor settings were fixed so that sound was processed identically throughout all test batteries. The use of human subjects in this study was reviewed and approved by the University of Washington Institutional Review Board.

TABLE 1.

CI subject characteristics

Subject	Age (years)	Duration of hearing loss (years)^a	Duration of implant use (years)	Etiology	Implant type	Speech processor strategy	CNC word score (% correct)
S03	61	5	11	Genetic	Nucleus 22	SPEAK	72
S04	62	1	3	Unknown	Nucleus 24	ACE	88
S12	49	0	2	Connexin 26	MedEl Combi40+	CIS	36
S34	55		1.5	Noise exposure	HiRes90K	HiRes	62
S41	52	7	5	Hereditary	HiRes90K	HiRes	86
S42	68	5	3	Genetic	HiRes90K	Fidelity 120	52
S48	67	10	0.5	Unknown	HiRes90K	HiRes	72
S49	64	4	0.75	Hereditary	HiRes90K	Fidelity 120	86
S50	40	15	6	Genetic	Clarion CII	Fidelity 120	100
S51	56	7	6	Hereditary	Clarion CII	HiRes	92
S52	77	0	0.5	Noise exposure	HiRes90K	Fidelity 120	66
S53	63	3	7	Unknown	Clarion CII	Fidelity 120	96
S54	25	0.5	2.5	Unknown	HiRes90K	HiRes	92
S55	65	40	(L)1.4; (R)0.5	(L)Genetic; (R)Genetic	(L)HiRes90K; (R)HiRes90K	(L)HiRes; (R)HiRes	96

Open in a new tab

^aThe duration of their hearing loss before implantation

Stimuli and procedures

The single-interval, yes/no paradigm spectral-ripple discrimination test

Four hundred pure-tone frequency components were summed to generate the rippled noise stimuli. Amplitudes of the components were determined by a full wave-rectified sinusoidal envelope on a logarithmic amplitude scale. The 400 tones were spaced equally on a logarithmic frequency scale. The ripple peaks were spaced equally on a logarithmic frequency scale. Bandwidth of the stimuli was 100–5,000 Hz. The peak-to-valley ratio of the ripple spectra was 30 dB. Standard and inverted (ripple phase reversed) ripple stimuli were generated. For standard ripples, the phase of the full wave-rectified sinusoidal spectral envelope was set to zero radians; and for inverted ripples, it was set to π/2. Using standard and inverted ripples, two types of test stimuli were generated: the “standard-standard” ripple and the “standard-inverted” ripple.

For the standard-inverted case (i.e., the change stimulus), the first 2 s consisted of a standard ripple and the last 2 s contained an inverted ripple. Therefore, there was a spectral change at 2-s point. Each half of each stimulus was individually created and then concatenated. To eliminate the possibility of a loudness change at the 2-s point of the standard-inverted stimuli, the root mean square of the first (standard ripple) and the last (inverted ripple) parts were matched. After amplitude matching, the whole 4-s stimulus was ramped with 150-ms rise/fall times, and filtered with a long-term, speech-shaped filter (Byrne et al.1994). Because the concatenation could produce frequency components beyond 5 kHz at the 2-s point, a 5 kHz low-pass filter was applied to the entire 4-s stimuli. To prevent a subject from perceiving a cue due to the temporal pattern of transition from standard to inverted ripple at the 2-s point, the phase of both the first and last 2 s was randomized for every presentation using random phases for the 400 individual frequency components.

For the standard-standard case (i.e., no-change stimulus), a 4-s duration standard ripple stimulus was created and ramped with 150-ms rise/fall times, then filtered with a long-term, speech-shaped filter (Byrne et al.1994) and with a 5 kHz low-pass filter. Therefore, there was no change within the stimulus.

Figure 1 shows waveforms (upper panels) and spectrograms (middle panels) for the standard-inverted and standard-standard stimuli for the 1 ripple/octave case. The bottom row of Fig. 1 shows example waveforms for the standard-inverted and standard-standard stimuli from 1.98 to 2.02 s, showing that there is no temporal discontinuity cue, such as a sudden amplitude change, for a subject to use to detect at the midpoint of the standard-inverted stimuli. There is therefore only a spectral cue that a subject can use to detect a change within the standard-inverted stimuli.

FIG. 1 — Stimuli waveforms (*upper panels*), spectrograms (*middle panels*), and zoom-in of time–domain waveforms from 1.98 to 2.02 s (*lower panels*) for a standard-inverted stimulus (*left side*) and a standard-standard stimulus (*right side*). The ripple density was 1 ripple/octave for both stimuli.

Figure 2 shows the CI sound processor output in response to the standard-inverted ripple stimuli using HiResolution strategy. The upper two plots show how HiResolution strategy encodes the standard-inverted ripple stimuli over the duration of stimuli. The left panel shows the output for spectral-ripple density of 1.0 ripple/octave, and the right panel shows the output for ripple density of 6.0 ripples/octave. The lower panel shows average outputs over the first 2 s (i.e., standard phase) and the last 2 s (i.e., inverted phase) of the standard-inverted stimuli. The upper left panel (1 ripple/octave) shows that there is a distinct change at the 2-s point for all electrodes, but the upper right panel (6 ripples/octave) does not show visible change at the 2-s point in the outputs for all 16 electrodes. The lower panel plots also confirm this trend. Multiple peaks and valleys are well-represented across the electrodes when ripple density is low. But at higher ripple density, the distance between the peaks and valleys is decreased, so the sound processor presents reduced spectral contrast for higher ripple density.

FIG. 2 — The sound processor outputs for spectral-ripple density of 1 ripple/octave (*left panel*) and 6 ripples/octave (*right panel*) are shown. The *upper panel plots* show electrodograms for the standard-mixed ripple stimuli, which represent the biphasic pulses (in μA) computed by HiResolution strategy of Advanced Bionics devices. The *lower panel plots* show average outputs (in μA) over the duration of the standard phase (first 2 s, *filled circles*) and the inverted phase (last 2 s, *open circles*) ripple stimuli for 16 electrodes. Electrode 16 represents the highest frequency channel.

A single administration of the single-interval, yes/no paradigm spectral-ripple discrimination task involved four blocks of test trials. Sounds were presented at 65 dBA in the free field in a sound-treated booth via a single loudspeaker, positioned 1 m from the subject. Ripple stimuli were generated with 15 different densities: 0.25, 0.5, 0.707, 1, 1.414, 2, 2.828, 4, 5.657, 8, 11.314, 16, 22.627, 32, and 40 ripples/octave. The method of constant stimuli was used. Ten consecutive ripple densities from 0.25 to 8 ripples/octave were tested; however, if a subject could discriminate 8 ripples/octave stimuli above chance levels, higher ripple densities were tested. Each ripple density was presented 10 times in random order within a block: five times for standard-standard ripple stimuli and another five times for standard-inverted ripple stimuli. Therefore, a total of 20 standard-standard and standard-inverted ripple stimuli were presented across the four test blocks. Subjects performed the task using a mouse and computer screen placed in front of and to the right of them. Subjects were instructed to listen to the sound and click on an on-screen button labeled “Yes” if they thought there was a change within the stimulus; or “No” if they thought there was not a change. Percent correct and hit rate were calculated for each block. Feedback was not provided. The testing required 1 h for each subject.

Threshold was defined as the ripple density corresponding to d′ = 1. Data from both Won et al. (2007) and the present study suggest that psychometric functions for spectral-ripple discrimination generally follow an expected reverse-sigmoid shape as a function of increasing ripple density. It was desirable to accurately estimate threshold from points statistically shown to lie on the slope of the psychometric function, rather than points on either the upper or lower asymptote. Like several previous studies, confidence interval analyses were used to identify points lying on the upper asymptote, slope, and lower asymptote of the psychometric function (e.g., Buss et al.1986; Bargones et al.1995; Dasika et al.2009). At each ripple density, d′ and 80% confidence intervals for d′ were computed from aggregate hit and false alarm rates obtained from the four blocks. To avoid undefined values of d′, hit rates of 0 were converted to 0.5/N_s and hit rates of 1 were converted to 1–0.5/N_s,, where N_s was the number of standard-inverted stimuli presented at a given ripple density (i.e., 20; cf. Macmillan and Creelman 2005). Hit rate proportions of 1 or 0 were thus converted to 0.975 or 0.025. Similarly, false alarm rates of 0 or 1 were converted to 0.025 or 0.975. Eighty percent confidence intervals for d′ were estimated by using the approximation described by Gourevitch and Galanter (1967; cf. Macmillan and Creelman 2005). Eighty percent as opposed to 95% confidence intervals were used to identify a greater number of data points on the slope.

The upper asymptote was taken as the maximum performance obtained at any ripple density. If the upper limit of the 80% confidence interval of d′ of a given data point equaled or exceeded asymptotic d′, the data point was generally assumed to lie on the upper asymptote. If the lower limit of the 80% confidence interval of d′ for a given data point was lower than d′ = 0 (chance performance), the point was generally assumed to lie on the lower asymptote. Slope was estimated using all successive data points with upper and lower confidence intervals contained entirely within the interval between the upper and lower asymptotes. Because few points were sometimes identified on the slope, the upper asymptotic point corresponding to the highest ripple density and the lower asymptotic point corresponding to the lowest ripple density were also included in the slope calculation. Sensitivity (d′) versus ripple density (ripples/octave) data was fit by linear regression. Threshold was defined as the ripple density at which the regression line equaled 1 (i.e., d′ = 1).

The three-interval, three adaptive forced-choice (3-AFC) spectral-ripple discrimination test

The three-interval, 3-AFC procedure used the same stimuli described by Won et al. (2007). Stimuli of 500 ms duration were generated by summing 200 pure-tone frequency components. Each 500-ms stimulus was either a standard or inverted ripple. The ripple densities included 0.125, 0.176, 0.250, 0.354, 0.500, 0.707, 1.000, 1.414, 2.000, 2.828, 4.000, 5.657, 8.000, and 11.314 ripples/octave. The spectral-ripple resolution threshold was determined using a two-up, one-down adaptive procedure, converging on 70.7% correct (Levitt 1971). Each test run began with 0.176 ripples/octave and moved in equal ratio steps of 1.414. The presentation level was roved within trials (7 dB range in 1 dB steps). Subjects were asked to click on an on-screen button that was labeled 1, 2, and 3 after the stimuli were presented. One stimulus (i.e., inverted ripple sound, test stimulus) was different from two others (i.e., standard ripple sound, reference stimulus). The subject’s task was to identify the test stimulus. Threshold for a single test run was estimated by averaging the final eight of 13 reversals. The primary dependent variable was determined by averaging the threshold from six different test runs. Feedback was not provided. The testing took 30 min for each subject.

Speech reception threshold in noise test

The procedure for administering the speech reception threshold (SRT) test was the same as that previously described by Won et al. (2007), and similar to Turner et al. (2004). Subjects were asked to identify one randomly chosen spondee word out of a closed-set of 12 equally difficult spondees (Harris 1991). The spondees, two-syllable words with equal emphasis on each syllable, were recorded by a female talker. Two background noises were used: two-talker babble (one male and one female) and steady-state, speech-shaped noise. The female talker for the babble was different from the female talker for the spondees. The same background noise was used on every trial in order to eliminate variance that might arise from variability in the background stimulus. The onset of the spondees was 500 ms after the onset of the background noise. The two-talker babble and steady-state noise had duration of 2.0 s. The level of the speech was held constant at 65 dBA, while the level of the noise was tracked using a one-down, one-up procedure and 2-dB steps. Feedback was not provided. The threshold for a single test run was estimated by averaging the signal-to-noise ratio for the final 10 of 14 reversals. The primary dependent variable was the mean SRT of the six test runs.

Word recognition in quiet test

Fifty consonant–nucleus–consonant (CNC) monosyllabic words (e.g., “Home, June, Pad, Sun”) recorded by a male talker were presented, from an open set, in sound-field at 65 dBA (Peterson and Lehiste 1962). Each CNC word list was randomly chosen out of 10 lists for each subject. A total percent correct score was calculated as the percent of words correctly repeated. Feedback was not provided.

Results

Spectral-ripple threshold

For the single-interval procedure, the mean threshold for the 14 CI subjects was 6.16 ripples/octave with a 95% confidence interval of 1.70 ripples/octave. For the three-interval procedure, the mean threshold was 2.12 ripples/octave with a 95% confidence interval of 0.56 ripples/octave.

Characteristics of psychometric function

For the three-interval procedure, Won et al. (2007) showed a psychometric function in which there was a monotonic decrease as a function of ripple density with an upper asymptote in the ability to discriminate the standard and inverted ripple stimuli. The psychometric functions constructed from the single-interval procedure showed a similar pattern. Figure 3 shows the psychometric functions obtained from the single-interval procedure method, plotting d′ as a function of ripple density for each of the 14 CI subjects. In the slope region, d′ monotonically decreased as ripple density increased. Calculated slope values are indicated in each panel (e.g., −0.89 d′/ripple for S04).

Comparing thresholds obtained from the single- and three-interval procedures

FIG. 3 — Psychometric functions for single-interval spectral-ripple discrimination for 14 CI subjects. Each panel represents data for an individual subject. Data points used for linear regression fits are shown as *filled circles*. Linear fits are shown as *solid lines*. The *second number shown in upper right corner in each panel* shows threshold. The *third number shown in upper right corner* shows psychometric-function slope (in units of d′/ripple). *Error bars* show 80% confidence intervals. *Open circles* represent points other than those used to estimate the slope (e.g., upper and lower asymptotic points).

Figure 4 shows the threshold obtained from the single-interval procedure was correlated with the threshold obtained from the three-interval procedure for the 14 subjects. A highly significant correlation was found between the two methods (r = 0.84, p = 0.0002).

Correlation between the single-interval spectral-ripple test and speech measures

FIG. 4 — Relationship between spectral-ripple thresholds determined using the three-interval procedure and those derived from the single-interval procedure in 14 CI subjects. Linear regression is represented by the *solid line*.

Figure 5 shows the single-interval spectral-ripple thresholds significantly correlated with speech perception performance. Significant negative correlations were found between the single-interval spectral-ripple thresholds and SRTs in two-talker babble (r = −0.64, p = 0.01) and in steady-state noise (r = −0.68, p = 0.008). A significant correlation was also found between the single-interval spectral-ripple thresholds and CNC word recognition scores in quiet (r = 0.72, p = 0.004). These results were consistent with previously reported data from the three-interval procedure (Won et al.2007; Henry et al.2005). Table 2 shows correlations between d′ values at each ripple density and speech perception scores.

FIG. 5 — Relationship between single-interval spectral-ripple thresholds and speech reception thresholds in noise (*left panel*) and CNC scores (*right panel*). Linear regressions are represented by the *dashed line* for two-talker babble and the *solid line* for steady-state noise in the *left panel*.

TABLE 2.

Correlations of d′ values at each ripple density with speech perception tests and 3-AFC ripple test

RIPPLE DENSITY	0.25	0.5	0.707	1	1.414	2	2.828	4	5.657	8
Speech in babble
Speech in steady noise						−0.57	−0.54
CNC word				0.57	0.54	0.61	0.59	0.63	0.68	0.70
3-AFC ripple					0.55		0.65	0.64	0.67	0.65

Open in a new tab

Correlation values with p < 0.05 are only shown. Nonsignificant correlations are not shown and left blank

Discussion

In experiment 1, a significant correlation (r = 0.84) was found between the thresholds obtained using the two procedures. Most importantly, consistent with previous reports by Won et al. (2007), there were significant correlations between the single-interval spectral-ripple test and speech perception in the presence of two-talker babble (r = −0.64, p = 0.01), steady-state noise (r = −0.68, p = 0.0008), and with CNC word recognition scores (r = 0.72, p = 0.004). With that said, caution needs to be taken when interpreting correlation results because they do not necessarily mean that the same hearing mechanisms were involved during the single-interval ripple test, three-interval ripple test, and speech perception tasks. The integrity of the auditory systems of CI users having excellent three-interval spectral-ripple thresholds might be quite good and thus show overall good performance. Even so, our results suggest that the single-interval, yes/no paradigm is a comparable method for evaluating spectral-ripple discrimination in CI users.

The mean spectral-ripple threshold was 6.16 ripples/octave for the single-interval test and 2.12 ripples/octave for the three-interval test. The higher thresholds obtained using the single-interval test compared to the three-interval test might, in part, be explained by procedural differences between the two methods. In the single-interval method, listeners were presented with either a single standard-standard ripple or standard-inverted ripple stimulus that was 4 s in duration. Thus, listeners could use the ripple phase inversion at the 2-s midpoint in the single-interval procedure, thereby providing an immediate comparison between the two portions of the same stimulus (see Figs. 1 and 2). This immediate comparison might involve a sensory-trace comparison as described by Durlach and Braida (1969) as well as Kidd et al. (1988) during spectral shape discrimination. In contrast, the three-interval procedure included three stimuli whereby an inverted ripple stimulus was presented along with two other standard ripple stimuli. The three-interval procedure required that listeners select the interval that sounded different from the other two, a task that likely requires more demand on short-term memory and other cognitive abilities to make the comparison. We speculate that this additional cognitive load contributed to poorer ripple thresholds. In addition, the duration of the single-interval stimulus for each ripple phase is longer (2 s) than the interval duration of the 3-AFC stimuli (500 ms), so the subjects had a longer duration for comparison with the single-interval test. Another difference is that, the single-interval procedure used the method of constant stimuli and estimated a threshold that corresponded to a d′ = 1, whereas the three-interval procedure used a two-up, one-down adaptive procedure, converging on 70.7% correct point (i.e., d′ = 1.28) on the psychometric function. Although the differences between the two procedures could contribute to the elevated spectral-ripple threshold in the single-interval procedure, it should be emphasized that the high, significant correlations between the results obtained with the two methods and the speech recognition scores suggest that the single-interval test can be used to evaluate spectral-ripple discrimination in CI listeners.

Table 2 shows correlations between d′ values at each ripple density and speech perception performance and spectral-ripple thresholds obtained with 3-AFC paradigm. Generally, d′ at the slopes showed significant correlations with speech perception, whereas d′ at the coarsest density did not. At lower ripple densities, listeners could have used intensity cue, but the d′ values at lower ripple densities did not correlate with speech perception or with spectral-ripple thresholds obtained with the 3-AFC ripple test. On the contrary, the d′ values at higher ripple densities, where spectral sensitivity is required for discrimination, were significantly correlated with speech perception and the 3-AFC spectral-ripple thresholds. These results further suggest that the single-interval ripple test evaluates spectral sensitivity which correlates with speech perception in CI users.

Experiment 2: the effect of number of channels on behavioral and physiological spectral-ripple threshold

In experiment 2, behavioral and physiological measures of spectral-ripple resolution were done with NH listeners. The single-interval procedure stimuli from experiment 1 were used for cochlear implant vocoder simulations. The effect of the number of vocoder processing channels on both behavioral and physiological spectral-ripple resolution was also investigated. The CI vocoder simulations serve as a model of CI users’ performance. We chose to test normal-hearing subjects listening to CI vocoder simulations to avoid confounding artifacts when recording cortical responses from CI users (e.g., Friesen and Picton 2010). Because cortical P1-N1-P2 “change” responses can be recorded using the spectral-ripple stimuli processed with the vocoder simulation, it suggests that it is feasible to use the single-interval spectral-ripple stimuli to explore the behavioral–physiological relationship of spectral-ripple discrimination in CI users.