NEURAL CORRELATES OF ADAPTATION IN NORMAL HEARING SUBJECTS TO FREE LEARNING WITH COCHLEAR IMPLANT ACOUSTIC SIMULATIONS

Christopher J Smalt; Javier Gonzalez-Castillo; Thomas M Talavage; David B Pisoni; Mario A Svirsky

doi:10.1016/j.neuroimage.2013.06.001

. Author manuscript; available in PMC: 2014 Nov 15.

Published in final edited form as: Neuroimage. 2013 Jun 7;82:500–509. doi: 10.1016/j.neuroimage.2013.06.001

NEURAL CORRELATES OF ADAPTATION IN NORMAL HEARING SUBJECTS TO FREE LEARNING WITH COCHLEAR IMPLANT ACOUSTIC SIMULATIONS

Christopher J Smalt ¹, Javier Gonzalez-Castillo ², Thomas M Talavage ¹, David B Pisoni ³, Mario A Svirsky ⁴

PMCID: PMC3764992 NIHMSID: NIHMS504172 PMID: 23751864

Abstract

Neurobiological correlates of adaptation to spectrally degraded speech were investigated with fMRI before and after exposure to a portable real-time speech processor that implements an acoustic simulation model of a cochlear implant (CI). The speech processor, in conjunction with isolating insert earphones and a microphone to capture environment sounds, was worn by participants over a two week chronic exposure period. fMRI and behavioral speech comprehension testing was conducted before and after this two week period. After using the simulator each day for 2 hours, participants significantly improved in word and sentence recognition scores. fMRI shows that these improvements came accompanied by changes in patterns of neuronal activation. In particular, we found additional recruitment of visual, motor, and working memory areas after the perceptual training period. These findings suggest that the human brain is able to adapt in a short period of time to a degraded auditory signal under a natural learning environment, and gives insight on how a CI might interact with the central nervous system. This paradigm can be furthered to investigate neural correlates of new rehabilitation, training, and signal processing strategies non-invasively in normal hearing listeners to improve CI patient outcomes.

1. INTRODUCTION

Cochlear implants (CIs) can restore hearing to profoundly deaf individuals through direct stimulation of auditory neurons in the VIII cranial nerve. Patient performance on language tasks after implantation varies substantially, both in terms of asymptotic performance level, and in terms of the speed with which this level is achieved (Bassim et al., 2005). In the immediate postoperative period, some patients exhibit poor performance at speech comprehension and are sometimes only capable of detecting the presence of sounds. Improvement of CI user performance typically involves months of rehabilitation (sometimes as long as one year) that culminates in asymptotic performance levels that vary among different patients; even those who share a common model of implant and electrical stimulation strategy (Dorman, 1993).

Cochlear implants use a microelectrode array surgically implanted into the inner ear, which results in a fixed number of sites where the nerve population can be effectively stimulated—see (Wilson, 2000) for a description of basic functioning principles of CIs. This results in limited spectral resolution for CI users; with typical performance shown to be equivalent to having 8 spectral bands or channels of information (Shannon et al., 2004). Studies that utilize neuroimaging techniques have demonstrated that the processing of acoustic signals in the brain differs between CI users and normal hearing subjects. Some of the first positron emission tomography (PET) studies proved that auditory stimulation with CIs produces bilateral activations in the primary and secondary auditory cortex (Herzog et al., 1991). Subsequent studies found that activations in these areas differed between normal hearing subjects and different groups of CI users (Giraud et al., 2001a; Giraud et al., 2000; Naito et al., 1997; Wong et al., 1999). Moreover, differences were also found in the inferior frontal cortex (Wong et al., 1999), as well as in regions outside classical language areas such as dorsal occipital cortex (Giraud et al., 2001a), precuneus (Giraud et al., 2001a) and subcortical auditory regions (Giraud et al., 2000). Additionally, studies of speech perception with CI users have reported recruitment of cortical regions such as visual cortex (Giraud et al., 2001b; Green et al., 2008; Kang et al., 2004) or the posterior cingulate (Kang et al., 2004), regions which are not usually reported with normal hearing subjects. These combined data provide evidence that CI users and normal hearing subjects use differing neural strategies to achieve speech comprehension. However, in these studies it is difficult to distinguish if these neuronal differences result only from adaptation to processing the degraded signal or from other factors such as recovery from long periods of deafness or maturation—the latter being a particular confound in the case of studies involving young children. In addition, each CI user has a unique deafness, with the onset, mechanism, and nature of deafness being patient-specific. Therefore, it is impractical to control the perception of external stimuli across individual CI users. The second limitation is that almost all CI devices are incompatible with functional magnetic resonance imaging (fMRI), the most powerful of the present ensemble of functional neuroimaging technologies with respect to localization and tracking of changes over time. Therefore, it is impractical to study this intuitively desirable population for the purposes envisioned.

Rather, the larger population of normally hearing listeners represents the ideal target population. Acoustic simulation models of CI’s have become a common means to test processing strategies (Shannon et al., 2004) and evaluate training (Harnsberger et al., 2001; Li and Fu, 2007). These models manipulate the acoustic signal in the same way as a typical CI, simulating for a normal hearing listener what a real CI user might hear. Davis and Johnsrude (2003) and Narain et al. 2003 observed a left-lateralized response to vocoded speech primarily in the superior temporal gyrus, inferior frontal and premotor as compared to normal speech. Other studies looked at adaptation specifically (e.g., Eisner et al. 2010, Hervais-Adelman et al. 2012, Erb et al. 2012). Eisner et al. (2010) observed the effect of short term training in the scanner by interleaving blocks of training and testing with vocoded sentences, finding that the left inferior frontal gyrus (IFG) and left superior temporal sulcus (STS) were sensitive to the learnability of the sentences, while the left angular gyrus (AG) reflected the most significant activation directly related to the learning process. Hervais-Adelman et al. (2012) attempted to exclude the confound of sentence level processing by focusing on single words, and reported differences between clear speech and vocoded speech in the left insula, and prefrontal and motor cortices, but found no interaction between test sessions (before and after training) and speech type. Instead of training inside the scanner, Erb et al. (2012) used sought to predict individual differences in vocoded speech adaptation ability by correlating with various metrics including amplitude modulation rate discrimination and gray matter volume (left thalamus).

Typically, acoustic models of CI speech are used for offline processing in a laboratory, a drawback which prevents interaction with the environment and scenarios truly realistic of what CI users face in the real world. Further, it is impossible for participants to hear their own voice in real time or during conversation. The goal of this study was to investigate the neural correlates of adaptation to degraded speech using a free-learning paradigm that permits our subjects to adapt in natural everyday environments linking perception and sensory motor systems. In addition to measuring changes in speech recognition performance, functional magnetic resonance imaging (fMRI) was conducted for each subject before and after a two-week exposure period, which involved wearing a real-time portable CI simulator for 2 hours each day. The portable CI simulator, implemented on the iPod Touch (Smalt et al., 2011), was based on work by Kaiser et al. (2000) and can be used to evaluate different CI training protocols.

2. METHODS

2.1 Subjects

Fifteen right-handed self-reported normal hearing subjects (9 male, 6 female) were recruited from the Purdue University student body to participate in the experiment involving a two-week exposure period with vocoded speech. In addition, eight control subjects were recruited (5 male, 3 female) were recruited who participated in the behavioral testing only with no training involved in between. Participants were closely matched in age (mean age: 24 ± 3 years) with no history of auditory disorders. Only two subjects had ever been exposed to noise vocoded-speech (exposure group), while none of the other participants had any experience with that type of auditory stimulus. All subjects gave informed consent in compliance with a protocol approved by the Institutional Review Board of Purdue University.

2.2 Experimental Paradigm

This experiment consisted of three different components: (1) an un-supervised learning period with a real-time portable cochlear implant simulator to improve performance in degraded speech comprehension; (2) behavioral testing sessions conducted in a sound treated room before and after the learning period to measure any changes in word comprehension of degraded speech; and (3) two fMRI imaging sessions performed prior and subsequent to each behavioral sessions. Fig. 1 provides a schematic of this experiment protocol.

Schematic showing relationship between two-week exposure period, functional imaging and behavioral testing sessions.

2.3 Portable CI Simulator

A CI speech processor uses an array of non-overlapping filters which divide an acoustic signal it up into frequency bands corresponding to the number of electrodes in a CI. The amplitude and rate of electrical stimulation in each channel is determined based on the information in each frequency band. In a noise-vocoded simulation of a CI, the carrier frequency in each band is modulated with band-pass noise with the resulting acoustic signal being the sum of each channel (Shannon et al., 1995).

A portable implementation of CI simulation on the iPod Touch (Smalt et al., 2011) was developed based upon on the vocoder from Kaiser et al. (2000), and was used in this experiment to degrade naturally produced speech in real-time. The program can be used on the the Apple iPod Touch and iPhone; devices that carry an ARM processor equipped with a vector floating point unit. The Apple RemoteIO audio libraries enabled the real-time simulation to run at a latency of 10 ms from audio input to output. The portability of the iPod provided a number of benefits for training purposes, allowing participants to not only hear sounds in their environment, but to hear their own speech. Butterworth infinite impulse response (IIR) filters of order N=4 were used in each of eight un-shifted (i.e. ideally inserted, no basalward shift) noise band channels, with the filter bank logarithmically spaced using (Greenwood, 1990) in the range from 250 Hz to 8 kHz.

2.4 Behavioral Training and Testing Procedures

To assess perceptual learning as a result of exposure to the real-time CI simulator, the device was worn by all 15 normal-hearing participants over a two week chronic exposure period for two hours per day. Participants were instructed to actively listen to speech or music throughout each two hour period. The device recorded the total run time for the program to verify that the participants were using the device for the required duration. The apparatus consisted of a lapel microphone and insert earphones, rated at 35–42 dB noise isolation (Etymotic, 2010).

Participants wore their portable simulator during testing which consisted of a word and sentence recognition task carried out in a sound treated room before and after exposure under free field conditions (Smalt et al., 2011). Three tests to measure degraded speech performance were used: phonetically-balanced (PB) words, PRESTO multi-talker sentences, and semantically anomalous sentences (Park et al., 2010). Participants wore their portable simulator during the entire testing session in a sound treated room before and after exposure under free field conditions. Although all stimuli were presented in their original form through the loudspeaker, participants only heard degraded speech through their simulator. The number of presentations of each stimulus type was 50 (words), 90, and 50 (sentences) respectively, where no stimuli were repeated across sessions. Subjects were asked to repeat out loud the word or sentence they heard while wearing the portable CI simulator. Each stimulus was presented twice, and included two corresponding responses. Scoring was done for each participant based on the percent correct of key words, as obtained from a recording of their verbal responses. The overall score for a given test was taken as the mean score of all of that subjects’ trials.

Overall differences in speech perception before and after exposure were evaluated with a three way ANOVA for each test type (PB, PRESTO, anomalous sentences). Factors in the analysis included group (exposure, control), session (pre/post), and subject, with planned comparisons across each session to look for a learning effect. Only the first response to each stimulus was used in the ANOVA.

2.5 fMRI Experimental Paradigm

Patterns of neural activation as measured by the fMRI BOLD effect were evaluated before and after chronic exposure to the portable CI simulator using an auditory sentence comprehension blocked experimental paradigm previously demonstrated to be highly reliable across imaging sessions at the group level (Gonzalez-Castillo and Talavage, 2011). In the current study, two functional runs (duration=380 s) of the above-mentioned paradigm were conducted each for two run types: normal and degraded (CI simulated) speech. Each functional run had the same structure of blocks (Fig. 2), including 5 repetitions of the following sequence of blocks: visual instructions (5 s); auditory stimulus block (25 s); visual instructions (5 s); response/attention control task (15 s); and rest (20 s). During the visual instruction presentation and response/ attention control task blocks, no auditory stimulus was presented to the subjects. In the auditory stimulus blocks of the run, participants were asked to listen to six sentences from the City University of New York (CUNY) (Boothroyd et al., 1988) presented binaurally via pneumatic headphones while looking at a fixation crosshair. Following this listening period, subjects responded to 3 fill-in-the-blank multiple choice questions displayed on the screen regarding one of the previously presented sentences, selected at random.Subjects were familiarized with the task by exposure to a practice run before entering the scanner during each session. This experimental paradigm was performed twice for each participant approximately two weeks apart (sentences were never repeated within or across sessions). Comparing across sessions (pre/post exposure), changes in brain activation exhibited by the CI sentences that differ from the normal condition would likely be a result of changes in processing strategies due to adaptation to transformed inputs.

fMRI auditory comprehension block paradigm, which has previously been shown to be highly reliable across imaging sessions at the group level. Subjects listen to 6 sentences in each block followed by multiple choice questions answered visually. Each block is repeated 5 times per run (Figure reproduced from Gonzalez-Castillo and Talavage 2011).

2.6 fMRI Data Acquisition

Imaging was conducted at the Purdue MRI Facility (West Lafayette, IN) on a General Electric (Waukesha, WI) 3T Signa HDx scanner, using an 16-channel brain-array coil (Invivo), matching scan parameters to that of (Gonzalez-Castillo and Talavage, 2011). During the two imaging sessions, a single high resolution axial fast spoiled gradient-echo sequence acquisition (38 slices, slice thickness=3.8 mm, spacing=0 mm, FOV=24 cm, in-plane resolution=256×256) was followed by four functional runs. Functional runs were obtained using a multi-slice echo planar imaging sequence (TR=2.5 s, TE=22 ms, 38 slices, slice thickness=3.8 mm, spacing=0 mm, in-plane resolution= 64×64, FOV=24 cm, flip angle=77°). Additionally, at the end of each imaging session, sagittal high resolution T1-weighted images (number of slices=190; slice thickness=1.0 mm; FOV=24 cm; in-plane resolution=256×224) were also acquired for alignment and presentation purposes. In addition to imaging data, behavioral data was collected while subjects were in the scanner. Subjects indicated their selection of the missing word during Response blocks via a 4-button response box (CURDES Fiber Optic Response Box Model No: HH-2×4-C). Both responses and response times were recorded.

2.7 fMRI Data Analysis

Data analysis was conducted using SPM8 software (http://www.fil.ion.ucl.ac.uk/spm/). Pre-processing steps for each subject included: (1) discarding three initial volumes to account for T1 saturation, (2) motion correction, (3) registration of each functional run to the subjects segmented gray matter maps from the T1 weighted anatomical scan, (4) segmentation and transformation of the anatomy to the standard MNI, (5) applying the MNI transformation to the EPI data, (6) spatial smoothing with a Gaussian kernel (full width at half-maximum = 6mm). Individual subject analyses were first performed in all subjects to compare fMRI activation in the Listen block as compared to the Rest block (Listen > Rest) using the canonical HRF. This analysis was performed for both CI simulated and normal sentences. Statistical maps of activation from the (Listen > Rest) condition were compared across subject using a paired t-test across session (pre / post exposure), and across sentence type (normal / vocoded speech). Two thresholds were used when analyzing fMRI results: one for significance with multiple comparison correction (p_FDR<0.05) and one slightly relaxed to look for trends (p<0.0005).

2.8 Correlation of fMRI and Behavioral Analysis

A stepwise regression model (entrance threshold p=0.05, exit threshold p=0.10) was used to ascertain if any correlation existed between the mean BOLD t-statistic over 116 predefined MarsBaR (Brett et al., 2002) regions of interest (ROIs) and behavioral measurements (Breedlove et al., 2012). The algorithm finds a model that best captures the variation in the data, limiting the problem of over-fitting as in simple multiple regression models. The stepwise regression models the change in average fMRI t-statistic for each ROI to predict the out of scanner percent words correct scores combined across both sessions.The goal of such an analysis is to investigate correlations between bold fMRI data and the behavioral metrics. The five variables (from table 1) collected from out of scanner behavioral data were chosen as the prediction variables since they had the greatest number of trials and are most likely to give a more accurate assessment of behavioral performance (Foster and Zatorre, 2010), while the constraints of fMRI limit accuracy of estimates (multiple choice vs. % words correct). The stepwise regression was done for both types of functional runs, with the hypothesis that there should be less significant correlation between using the portable CI simulator compared with the normal functional runs as opposed to the cochlear implant simulated runs. Data from each session (pre/post) are combined into the same regression, while fMRI sentence type was processed separately (normal, CI simulated).

Table 1.

R-value correlation between In-Scanner and Out-of-Scanner behavioral metrics.

		Out of Scanner (All CI Simulated)
		PB	PRESTO Resp 1	PRESTO Resp 2	Anom Resp 1	Anom Resp 1
In Scanner	Normal Sentences	0.00	0.19	0.18	0.24	0.16

	CI Sentences	0.26	0.30	^* 0.35	0.31	0.28

Open in a new tab

Statistically significant correlations (p<0.05) are indicated with a *.

3. RESULTS

3.1 Behavioral Results

Figure 3 summarizes the speech recognition performance pre/post training during testing sessions in a sound treated booth while subjects wore their portable CI simulator (Fig 3.A), as well as during the fMRI experiment (Fig 3.B). Using a linear regression on behavioral performance (% key words correct) for degraded speech combined across sessions, the out-of-scanner PRESTO sentences were found to be correlated with the in-scanner responses to CUNY CI simulated sentences (% responses correct) (p < 0.05), but not with the normal sentences (p < 0.35). Table 1 indicates the correlation between the in-scanner and out-of-scanner behavioral metrics, where the PRESTO sentences showed the highest degree of correlation.

Behavioral results across two sessions conducted both in sound booth with participants wearing their portable CI simulator (left panel) and during the fMRI sessions (right panel). Behavioral tasks included perceptually balanced single words (PB), PRESTO sentences, and anomalous sentences (while wearing portable device), and both normal and CI simulated CUNY sentences during the fMRI sessions. Significant training effects are observed both in and out of the scanner for CI simulated sentences after 2 weeks of exposure to the portable CI simulator. Behavioral performance on normal sentences in-scanner remained consistent across session.

3.1.1 Out of Scanner Results

Fifteen subjects participated in the 2-week exposure period and fMRI study and did show a significant improvement in percentage words correct in all of the tests conducted when using there portable CI simulator at the group level. In stark contrast to the controls, 2 out of the 15 subjects scored lower in the follow-up session for the PB words, 0/15 for the PRESTO sentences, and 0/15 for the anomalous sentences. The range of improvement on the PRESTO sentences across session ranged from 6.5%–35%.

An omnibus 3-way ANOVA (A×B×C; A=Test Type [PB, PRESTO, Anomalous], fixed; B=session [Pre, Post], fixed; C=subject [1..15], random) on percent key words correct revealed a significant main effect of session (F_1,78 = 17.51, p < 0.0001) and test-type (F_1,78 = 60.37, p < 0.0001), with no interaction (F_1,78 = 0.97, p = 0.38). A Priori Tukey-Kramer adjusted multiple comparisons between the session for each test-type comparing pre / post training were all significant (p < 0.01). Based on the ANOVA, it can be concluded that the group of subjects who underwent the exposure period improved significantly between sessions on all tests with the portable CI simulator.

3.1.2 In Scanner Results

An omnibus 3-way ANOVA (A×B×C; A=Sentence Type [Normal, CI Simulated], fixed; B=session [Pre, Post], fixed; C=subject [1..15], random) demonstrated a significant main effect of session (F_1,28 = 4.93, p < 0.03) and test-type (F_1,28 = 33.02, p < 0.0001), with no interaction (F_1,28 = 2.29, p = 0.14). A Priori Tukey-Kramer adjusted multiple comparisons between the session for each test-type comparing pre / post training showed a significant increase in performance in response to CI simulated sentences (p < 0.01), but not for normal sentences (p = 0.62).

3.1.3 Behavioral Controls

A second group of eight control subjects did not undergo the exposure period, and showed no statistically significant change in percentage words correct for any of the tests conducted across session while using the portable CI simulator (Fig. 4). Furthermore, 5 out of the 8 subjects scored lower in the follow-up session for the PB words, 2/8 for the PRESTO sentences, and 3/8 for the anomalous sentences. The range of improvement on the PRESTO sentences across session ranged from −8.5%−6.3%, far less than that of the group who underwent the free-learning period.

Histogram of behavioral improvement across the two sessions for the control group (gray) and learning group (black). Behavioral performance did not significantly improve for control subjects.

An omnibus 3-way ANOVA (A×B×C; A=Test Type [PB, PRESTO, Anomalous], fixed; B=session [Pre, Post], fixed; C=subject [1..8], random) on percent key words correct revealed no main effect of session (F_1,36 = 0.1, p =0.75). A significant effect of test-type was observed (F_2,36 = 45.44, p < 0.0001), with no interaction (F_2,36 = 0.18, p = 0.84). Priori Tukey-Kramer adjusted multiple comparisons between the session for each test-type comparing pre / post training found no statistical significance (p = 0.65). Based on the ANOVA results it can be concluded that the control group did not improve significantly based on repeating the same behavioral testing protocol.

3.2 fMRI Results

3.2.1 Reproducibility

A paired t-test at the group level was used to compare activation across the two sessions for the normal sentences during the Listen block. By looking at the effect of session, the amount of significant change in activation should be negligible if the task is consistent. This statistical analysis was performed on data collected for this study, as well as on the first 2 runs of 15 additional subjects previously acquired on the same scanner using the an identical paradigm (Gonzalez-Castillo and Talavage, 2011), and processing steps. No activation (p_FDR<0.05) was found for the separate paired t-tests between session 1&2 or 1&5 for the Gonzalez-Castillo 2011 data. Similarly, no activation changes (p_FDR<0.05) were found in the current study for normal sentences.

3.2.2 Group Comparisons across Session

In order to adequately understand the changes in activation related to the learning period with the CI Simulator, it is necessary to make individual comparisons across session for both types of functional runs i.e. sentence type (normal, CI simulated). Unlike the normal sentences, CI simulated sentences showed statistically significant (p_FDR<0.05) changes in activation across session; regions activated included superior temporal gyrus (STG), insula, cuneus, precentral gyrus, and cingulate gyrus, as shown in Fig. 5. A summary of the significantly activated clusters across session for CI simulated sentences can be found in Table 2. The relaxed threshold of (p<0.0005) found no new clusters of activation for this condition.

Activation maps contrasting the two sessions (pre / post) for the CI simulated sentences at (*p_FDR*<0.05). Normal sentences produced no activation at this significance level. Yellow areas in the figure indicate regions where activation to the degraded speech significantly increased after exposure.

Table 2.

Clusters of significant activation for the contrast across the two fMRI sessions for CI sentences

ide	MNI coordinates			Z -score	Volume [mm³]	Anatomical Label
R	39	−25	7	4.80	1080	Heschl’s Gyrus (BA 41, 22)
L&R	3	17	40	4.03	837	Cingulate Gyrus (BA 32)
L	−9	−85	4	4.45	648	Cuneus (BA 17)
L	−30	26	1	4.19	324	Insula (BA 47, 13)
L&R	−6	8	52	3.65	270	Supp. Motor Area (BA 32)
L	−42	−34	7	3.79	162	Sup. Temporal Sulcus (BA 41)
R	33	20	4	3.71	108	Insula (BA 13)
L	−39	−58	31	−3.46	108	Angular Gyrus
L	−15	−43	28	−3.43	108	Cingulate Gyrus (BA 31)
R	54	2	−5	4.01	81	Sup. Temporal Pole

Open in a new tab

Significant activations for the pre/post contrast for CI sentences at pFDR<0.05 and a minimum cluster size of 81 mm³. Abbreviations: x=Right-Left; y=Anterior-Posterior; z=Inferior-Superior.

3.2.3 Group Comparisons across Sentence Type

Comparing the normal and CI sentence conditions in the session following the exposure period using a paired t-test revealed activation differences in the bilateral superior and middle temporal gyri (BA 21, 22) at a significance level of (p_FDR<0.05). At a more relaxed significance level of (p<0.0005), the two sentences types showed activation differences in several regions after the free learning period, as shown in Fig. 6. Activated regions included the STG, insula, precuneus, occipital gyrus, and SFG. A summary of the significantly activated clusters when comparing sentence types after the learning period can be found in Table 3.

Activation maps contrasting normal and CI simulated sentences for each session (p<0.0005). Red areas in the figure indicate regions where normal sentences produced significantly more activation than degraded sentences, while blue areas indicate regions where degraded sentences produced significantly more activation than normal sentences

Table 3.

Clusters of significant activation for the contrast normal and CI simulated sentences after the free learning period.

ide	MNI coordinates			Z -score	Volume [mm³]	Anatomical Label
L	−60	−16	−14	4.64	1674	^*Mid. Temporal (BA 21)
R	57	−16	4	4.23	1620	^*Heschl (BA 21,22)
L	−45	−64	16	4.06	1188	^*Sup. Temporal Sulcus (BA 39, 19)
L	−21	29	46	3.71	513	Sup. Frontal Gyrus (BA 8)
L	−3	−52	16	3.60	162	Precuneus (BA 30)
L	−30	17	7	−3.81	135	Insula (BA 13)
R	45	−73	−17	3.52	108	Inf. Occipital (BA 19)
R	33	20	4	−3.69	108	Insula (BA 13)

Open in a new tab

Significant activations for the normal/CI simulated contrast for the post session at pFDR<0.05

(indicated with a *) and p<0.0005 with a minimum cluster size of 81 mm³.

Abbreviations: x=Right-Left; u=Anterior-Posterior; z=Inferior-Superior.

3.3 Correlation of fMRI and Behavioral Analysis

Table 4 summarizes the correlations between the PRESTO behavioral data and 116 predefined regions of interest (ROIs) for both the functional runs using CI simulated and normal sentences. Behavioral data collected outside the scanner produced a much larger number of ROIs as well as high R^2 values, indicating that CI functional runs are a good predictor of behavioral performance in these regions. The regions implicated in the statistical analysis also match functional maps when comparing across session for the CI simulated runs, including the right Heschl’s gyrus, precentral gyrus, postcentral gyrus, supplemental motor area, cingular gyrus, angular gyrus, and parietal lobe. Fig. 7 highlights each region for both the normal (Fig 7.A) and CI simulated (Fig 7.B) sentences, where the color scale indicates the resulting R^2 value after the stepwise regression.

Table 4.

Results of stepwise regressions relating change in average t-statistic of BOLD signal for the set of AAL ROIs (Tzourio-Mazoyer et al., 2002) with behavioral data across the two training sessions. Stepwise regressions were preformed separately for each type of fMRI run (Normal, CI Simulated).

CI Simulated			Normal
Sentences		2	Sentences		2
				Cerebelum
	Heschl_R	.2569	_Crus1_L		.1739

				Cerebelum
	Parietal Inf L	.2274	_6_R		.1596

	Parietal_Inf_R	.2162		Calcarine_L	.1332

				Frontal_Mi
	Parietal_Sup_R	.2085	d_Orb_R		.1169

	Parietal_Sup_L	.1996	Heschl_R	.1063

	Paracentral_Lob			Frontal_Mi
ule_L		.1911	d_Orb_L		.1000

	Temporal_Sup_
R		.1779

	Postcentral_R	.1740

	Supp_Motor_Ar
ea_L		.1467

	Supp_Motor_Ar
ea_R		.1396

	Precentral_R	.1371

	Postcentral_L	.1327

	Paracentral_Lob
ule_R		.1312

	Cingulum_Mid_
R		.1299

	Angular_R	.1265

	Cingulum_Mid_
L		.1258

	Temporal_Pole_
Sup_R		.1256

	Precentral_L	.1218

	Temporal_Sup_L	.1208

	Frontal_Inf_Ope
r_R		.1183

	SupraMarginal_
R		.1144

	Frontal_Sup_R	.1022

Open in a new tab

Map of correlations of a stepwise regression between the BOLD t-statistic over a predefined list of ROIs and behavioral measurements region for both the CI simulated (top) and normal (bottom) sentences with behavioral data collected outside scanner using portable CI simulator. Yellow indicates areas of highest correlation.

4. DISCUSSION

After chronic exposure to real-time acoustic cochlear implant simulation, our results show a correlation between changes in behavior and functional measures, suggesting that short-term adaptation takes place and is physiologically measureable. In addition to improved sentence and word comprehension, we observed increased brain activation using fMRI in traditional and nontraditional speech brain areas, including working memory (Insula, inferior frontal cortex), motor (supplemental motor area), and visual (cuneus) regions. These changes suggest that training translated into widespread recruitment of areas beyond auditory cortex.

The paradigm used in this study allows participants to hear their own voice while engaging in speech activities and brings the learning process to the more naturalistic environment of everyday life. This translated into significant changes within the dorsal stream of speech processing, believed to rely on motoric representations of speech to aid in speech comprehension. An additional advantage of the current setup is that utilizing degraded speech in normal hearing listeners avoids confounds such as differences in surgical implantation and in length of deafness prior to implantation.

4.1 Behavioral Adaptation to Degraded speech

Behavioral performance was assessed outside the scanner before and after the exposure period while users wore a portable CI simulator for both sentence tests (PRESTO, anomalous), and an open set word test (PB). Participants showed improvement in percent correct of key words after two weeks of exposure to the portable CI simulator. In addition, a statistically significant increase in comprehension of CUNY sentences was observed in the scanner after the learning period for CI simulated sentences but not for normal sentences. Across both sessions, the PRESTO sentence scores obtained with the portable CI simulator outside the scanner were found to be correlated the CUNY CI simulated tasks collected during the fMRI scan. Normal CUNY sentences as presented during the fMRI sessions were not found to be correlated with behavioral measures collected with the portable CI simulator, an expected result since performance improved only for the CI simulated tasks. These results taken together with the behavioral control data provide strong evidence that the performance increase was actually due to the exposure to degraded speech, not merely familiarity with the task or other factors related to the experimental environment (Loubinoux et al., 2001; Specht et al., 2003). The same previously used in-scanner testing strategy (Gonzalez-Castillo and Talavage, 2011) also revealed no performance change across session for normal sentences. Our behavioral results demonstrate that exposure to degraded speech can transfer to novel stimuli in a short time period, even when “training” occurs in an uncontrolled environment, and are consistent with previous perceptual learning studies with vocoded speech (Davis et al., 2005; Hervais-Adelman et al., 2008), and even non-words (Hervais-Adelman et al., 2011).

4.2 Neural Correlates of Adaptation to Degraded speech

In addition to studying behavioral change due to the exposure to degraded speech in normal hearing individuals, we sought to investigate the neural correlates of this processing using fMRI. By comparing brain activations that occur across session for CI simulated sentences but not normal speech, we aimed to isolate changes related to adaptation of the degraded speech. We also compared our fMRI results with behavioral measures to examine which changes in activation are correlated with changes in performance in predefined regions of interest.

4.2.1 Reproducibility

Our fMRI protocol was identical to that used previously for data collected under the same paradigm, scanner, and analysis (Gonzalez-Castillo and Talavage, 2011). One difference between studies is that S01, S02 and S05 were collected, on average, within 7.8 days for the Gonzalez-Castillo data set, as opposed to just over two weeks for the current study. Also in the original study five runs per condition were available, while here there only two. The original study reported 86% voxel consistency over the five sessions at the group level, and no statistically significant activation change across session for normal speech. Our reproducibility study based on two sessions confirmed lack of statistically significant activation for the normal speech under the current modified experimental setup. This result permit us presume that changes exhibited across session in the CI simulated sentences and not in the normal sentences are likely due to changes in neural processing due to the exposure period.

4.2.2 Changes in Degraded Speech Processing across Session

When comparing across sessions, activation patterns show significant changes in a set of distributed brain regions, including non-auditory areas, for CI simulated sentences as compared to normal sentences (See Table 2). Our hypothesis is that after a learning period, in addition to changes in primary and secondary auditory areas, new functional pathways are also established in both the dorsal (mapping sound into articulatory-based representations) and ventral (mapping sound into meaning) streams of speech processing (Hickok and Poeppel, 2004, 2007) to accommodate changes in behavioral performance.

Primary and secondary auditory

Activation changes across sessions were found for spectrally degraded speech in both the left and right superior temporal gyrus and sulcus (STG/STS). More activation after training was observed in the right hemisphere near Heschl’s gyrus for CI sentences, perhaps indicating that more spectral information is being recovered from the signal. This is consistent with previous studies suggesting that right auditory cortex is involved in fine spectral analysis of auditory input (Schönwiesner et al., 2005; Zatorre et al., 2002). Additionally, Wernicke's area (BA 22 and BA 39) also showed strong increases in activation with training. This area is associated with a post-sensory interface that accesses long-term semantic knowledge, and is not typically activated by degraded speech when an abstract representation is not available (Obleser and Eisner, 2009; Obleser et al., 2007). Activation of this region only after training suggests that subjects were able to construct these associations during the training.

Ventral Speech Processing Stream

Our results show strong activation changes after the learning period with degraded speech in portions of the ventral stream, namely bilateral anterior and posterior Middle temporal gyrus (MTG, BA 21–22). Consistent with the Hickock and Poeppel model, left hemisphere activations were slightly dominant in this stream and most likely indicate an improvement in lexical comprehension. Since the stimuli for the fMRI portion of the experiment were sentences, it is likely that increased intelligibility upstream or elsewhere in the system might lead to improved lexical processing.

Dorsal Speech Processing Stream

Training with degraded speech also came accompanied by changes in areas part of the dorsal processing stream, namely insula and left inferior frontal cortex (BA47). In particular, changes in the insular region (BA 47, BA 13) suggest that working memory may be enhanced after training, and that stored information regarding the degraded speech is being utilized to improve performance. In addition to insula activation in vocoded speech (Hervais-Adelman et al., 2012), changes in the left inferior frontal cortex (BA 47) have been implicated in music (Levitin and Menon, 2003) and speech processing associated with rehearsal and maintenance of verbal information (Petrides et al., 1993). Our results could suggest that after two weeks subjects have a better feature representation of the CI simulated speech to access and aid in comprehension. Other previous studies have suggested that both normal and degraded speech stimuli can recruit anterior insula when contrasting any active task with rest in both normal hearing subjects (Binder et al., 1999; Binder et al., 1997) and CI users (Giraud et al., 2001a). Activations in anterior insula could then be a result of task demands, not necessarily as a result of exposure to acoustic stimuli. Binder et al. (2004) suggested that the anterior insula may be recruited when additional decision-making processing is required for participants over some auditory signals. Our results could also support this hypothesis, and suggest that training may serve to enable this additional recruitment, and that additional decision-making machinery is required for degraded speech as compared to normal speech.

Areas outside the streams of speech processing

Other areas of activation found to change after the learning period included the cuneus, and the cingulate gyrus, both of which have been observed in PET imaging studies of CI users (Giraud et al., 2001a). Activation in the cuneus (secondary visual cortex) is not unexpected as it has been previously observed in response to auditory stimulation in CI users (Giraud et al., 2001a; Kang et al., 2004), and it is well established that auditory and visual cortices receive polysensory information (Benevento et al., 1977). This secondary visual cortex is often thought of as an association area, and its recruitment in both CI users and the normal hearing listeners from this study indicate additional processing capabilities that develop after exposure, possibly enabling an improved abstract representation of degraded speech. Also, activation was found to be stronger in the precuneus for CI simulated speech as compared to regular speech after training. Kang (Kang et al., 2004) agreed with this interpretation suggesting a visual representation of acoustic sources might be possible and could improve task performance. Alternatively, the change in activation may simply be related to task demand. Sensiorimotor and visual posterior regions of the precuneus have demonstrated connections in resting state studies to both working and premotor areas (Margulies et al., 2009), suggesting that better integration of sensory systems with stored representations may be occurring after exposure to the CI simulator. Finally, activation in the cingulate cortex has been identified in studies involving attention (Maddock, 1999; Maddock et al., 2003), memory (Maddock et al., 2001), and both normal (Xu et al., 2005) and degraded speech. Obleser et al. (2007) also found activation in this region while comparing high and low predictability vocoded sentences. These evidences together suggest that the cingulate cortex may be involved in the semantic or meaning processing of speech recognition. Future Work

One caveat of the current study is that the shift in frequency that occurs in CI users due to the lack of full insertion in CI studies was not included in the study, and could possibly affect the implicated regions of activation. However, the iPod touch application does have this capability implemented, and could be used in future studies. Other potential studies might include varying the number of frequency bands in the simulator to study adaptation across amounts of degradation both with and without upward shifts to model CI insertion – an important consideration as the technology and surgical procedures continues to improve. More detailed knowledge about this adaptation process following cochlear implantation may help improve training protocols and improve speech perception clinical outcomes in the future.

5. CONCLUSION

Utilizing a portable acoustic simulation of a cochlear implant, we were able to identify neural correlates of adaptation to a distorted input using fMRI. After two weeks of exposure for two hours each day with the degraded speech, increased behavioral performance was accompanied by changes in patterns of neural activation measured with fMRI that suggest re-organization in both the auditory cortex and non-traditional speech areas. This adaptation could be necessitated by the increased processing power demanded in order to achieve a successful level of comprehension. Since the study was conducted in normal hearing listeners, specific confounds related to studies of CI users (i.e. the unique nature of each deafness) are not a factor in this study.

The findings of this study suggest that shifts in behavioral performance are associated with several changes in brain function. Increased activation in the auditory cortex, most noticeably in the right hemisphere, suggests that more information from the spectral content of the signal is being extracted. Secondly, the observation of increased supplemental motor area and visual cortex activity indicates increased utilization of phonological and abstract representations of the vocoded speech. These changes are accompanied by increased activation in brain areas related to the motor system, speech and working memory, suggesting an enhanced integration of knowledge acquired during the learning period.

Highlights.

Participants were exposed to a portable real-time acoustic cochlear implant simulation
After two-weeks of free-learning, participants improved in word recognition scores.
Activation increases with learning in auditory, motor, and working memory areas.
Behavioral performance changes were correlated with changes in brain function

ACKNOWLEDGEMENTS

This work was partially supported by funding from by NIH R01 (D.B.P.), NIH R01 DC03937 (M.A.S), NIDCD T32 DC00030 (C.J.S).

Footnotes

Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

REFERENCES

Bassim MK, Buss E, Clark MS, Kolln KA, Pillsbury CH, Pillsbury HC, 3rd, Buchman CA. MED-EL Combi40+ cochlear implantation in adults. Laryngoscope. 2005;115:1568–1573. doi: 10.1097/01.mlg.0000171023.72680.95. [DOI] [PubMed] [Google Scholar]
Benevento LA, Fallon J, Davis B, Rezak M. Auditory-visual interaction in single cells in the cortex of the superior temporal sulcus and the orbital frontal cortex of the macaque monkey. Experimental Neurology. 1977;57:849–872. doi: 10.1016/0014-4886(77)90112-1. [DOI] [PubMed] [Google Scholar]
Binder JR, Frost JA, Hammeke TA, Bellgowan PS, Rao SM, Cox RW. Conceptual processing during the conscious resting state. A functional MRI study. J Cogn Neurosci. 1999;11:80–95. doi: 10.1162/089892999563265. [DOI] [PubMed] [Google Scholar]
Binder JR, Frost JA, Hammeke TA, Cox RW, Rao SM, Prieto T. Human brain language areas identified by functional magnetic resonance imaging. Journal of Neuroscience. 1997;17:353–362. doi: 10.1523/JNEUROSCI.17-01-00353.1997. [DOI] [PMC free article] [PubMed] [Google Scholar]
Binder JR, Liebenthal E, Possing ET, Medler AD, Ward DB. Neural correlates of sensory and decision processes in auditory object identification. Nature Neuroscience. 2004;7:295–301. doi: 10.1038/nn1198. [DOI] [PubMed] [Google Scholar]
Boothroyd A, Hnath-Chisolm T, Hanin L, Kishon-Rabin L. Voice fundamental frequency as an auditory supplement to the speechreading of sentences. Ear and Hearing. 1988;9:306–312. doi: 10.1097/00003446-198812000-00006. [DOI] [PubMed] [Google Scholar]
Breedlove EL, Robinson M, Talavage TM, Morigaki KE, Yoruk U, O’Keefe K, King J, Leverenz LJ, Gilger JW, Nauman EA. Biomechanical correlates of symptomatic and asymptomatic neurophysiological impairment in high school football. Journal of Biomechanics. 2012 doi: 10.1016/j.jbiomech.2012.01.034. [DOI] [PubMed] [Google Scholar]
Brett M, Anton J, Valabregue R, Poline J. Region of interest analysis using an SPM toolbox [abstract/497] presented at the 8th International Conference on Functional Mapping of the Human Brain, June 2–6, Sendai, Japan. Available on CD-ROM in. Neuroimage. 2002:16. [Google Scholar]
Davis MH, Johnsrude IS. Hierarchical processing in spoken language comprehension. Journal of Neuroscience. 2003;23:3423–3431. doi: 10.1523/JNEUROSCI.23-08-03423.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
Davis MH, Johnsrude IS, Hervais-Adelman A, Taylor K, McGettigan C. Lexical information drives perceptual learning of distorted speech: Evidence from the comprehension of noise-vocoded sentences. Journal of Experimental Psychology: General. 2005;134:222. doi: 10.1037/0096-3445.134.2.222. [DOI] [PubMed] [Google Scholar]
Dorman MF. Speech perception by adults. In: Tyler RS, editor. Cochlear Implants. Singular Publishing: San Diego; 1993. pp. 145–190. [Google Scholar]
Eisner F, McGettigan C, Faulkner A, Rosen S, Scott SK. Inferior frontal gyrus activation predicts individual differences in perceptual learning of cochlear-implant simulations. Journal of Neuroscience. 2010;30:7179–7186. doi: 10.1523/JNEUROSCI.4040-09.2010. [DOI] [PMC free article] [PubMed] [Google Scholar]
Erb J, Henry MJ, Eisner F, Obleser J. Auditory skills and brain morphology predict individual differences in adaptation to degraded speech. Neuropsychologia. 2012;50:2154–2164. doi: 10.1016/j.neuropsychologia.2012.05.013. [DOI] [PubMed] [Google Scholar]
Etymotic. Earphone Comparison. 2010 [Google Scholar]
Foster NE, Zatorre RJ. A role for the intraparietal sulcus in transforming musical pitch information. Cerebral Cortex. 2010;20:1350–1359. doi: 10.1093/cercor/bhp199. [DOI] [PubMed] [Google Scholar]
Giraud AL, Price CJ, Graham JM, Frackowiak RS. Functional plasticity of language-related brain areas after cochlear implantation. Brain. 2001a;124:1307–1316. doi: 10.1093/brain/124.7.1307. [DOI] [PubMed] [Google Scholar]
Giraud AL, Price CJ, Graham JM, Truy E, Frackowiak RS. Cross-modal plasticity underpins language recovery after cochlear implantation. Neuron. 2001b;30:657–663. doi: 10.1016/s0896-6273(01)00318-x. [DOI] [PubMed] [Google Scholar]
Giraud AL, Truy E, Frackowiak RS, Gregoire MC, Pujol JF, Collet L. Differential recruitment of the speech processing system in healthy subjects and rehabilitated cochlear implant patients. Brain. 2000;123(Pt 7):1391–1402. doi: 10.1093/brain/123.7.1391. [DOI] [PubMed] [Google Scholar]
Gonzalez-Castillo J, Talavage TM. Reproducibility of FMRI activations associated with auditory sentence comprehension. Neuroimage. 2011;54:2138–2155. doi: 10.1016/j.neuroimage.2010.09.082. [DOI] [PMC free article] [PubMed] [Google Scholar]
Green KM, Ramsden RT, Julyan PJ, Hastings DL. Neural plasticity in blind cochlear implant users. Cochlear Implants Int. 2008;9:177–185. doi: 10.1179/cim.2008.9.4.177. [DOI] [PubMed] [Google Scholar]
Greenwood DD. A cochlear frequency-position function for several species--29 years later. Journal of the Acoustical Society of America. 1990;87:2592–2605. doi: 10.1121/1.399052. [DOI] [PubMed] [Google Scholar]
Harnsberger JD, Svirsky MA, Kaiser AR, Pisoni DB, Wright R, Meyer TA. Perceptual "vowel spaces" of cochlear implant users: implications for the study of auditory adaptation to spectral shift. J Acoust Soc Am. 2001;109:2135–2145. doi: 10.1121/1.1350403. [DOI] [PMC free article] [PubMed] [Google Scholar]
Hervais-Adelman A, Davis MH, Johnsrude IS, Carlyon RP. Perceptual learning of noise vocoded words: Effects of feedback and lexicality. Journal of Experimental Psychology: Human Perception and Performance. 2008;34:460–474. doi: 10.1037/0096-1523.34.2.460. [DOI] [PubMed] [Google Scholar]
Hervais-Adelman AG, Carlyon RP, Johnsrude IS, Davis MH. Brain regions recruited for the effortful comprehension of noise-vocoded words. Language and Cognitive Processes. 2012;27:1145–1166. [Google Scholar]
Hervais-Adelman AG, Davis MH, Johnsrude IS, Taylor KJ, Carlyon RP. Generalization of perceptual learning of vocoded speech. Journal of Experimental Psychology: Human Perception and Performance. 2011;37:283. doi: 10.1037/a0020772. [DOI] [PubMed] [Google Scholar]
Herzog H, Lamprecht A, Kuhn A, Roden W, Vosteen KH, Feinendegen LE. Cortical activation in profoundly deaf patients during cochlear implant stimulation demonstrated by H2(15)O PET. J Comput Assist Tomogr. 1991;15:369–375. doi: 10.1097/00004728-199105000-00005. [DOI] [PubMed] [Google Scholar]
Hickok G, Poeppel D. Dorsal and ventral streams: a framework for understanding aspects of the functional anatomy of language. Cognition. 2004;92:67–99. doi: 10.1016/j.cognition.2003.10.011. [DOI] [PubMed] [Google Scholar]
Hickok G, Poeppel D. The cortical organization of speech processing. Nature Reviews Neuroscience. 2007;8:393–402. doi: 10.1038/nrn2113. [DOI] [PubMed] [Google Scholar]
Kaiser AR, Svirsky MA. Using a personal computer to perform real-time signal processing in cochlear implant research. Hunt, Texas: Proceedings of the Ninth Digital Signal Processing; 2000. [Google Scholar]
Kang E, Lee DS, Kang H, Lee JS, Oh SH, Lee MC, Kim CS. Neural changes associated with speech learning in deaf children following cochlear implantation. Neuroimage. 2004;22:1173–1181. doi: 10.1016/j.neuroimage.2004.02.036. [DOI] [PubMed] [Google Scholar]
Levitin DJ, Menon V. Musical structure is processed in "language" areas of the brain: a possible role for Brodmann Area 47 in temporal coherence. Neuroimage. 2003;20:2142–2152. doi: 10.1016/j.neuroimage.2003.08.016. [DOI] [PubMed] [Google Scholar]
Li T, Fu QJ. Perceptual adaptation to spectrally shifted vowels: training with nonlexical labels. J Assoc Res Otolaryngol. 2007;8:32–41. doi: 10.1007/s10162-006-0059-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
Loubinoux I, Carel C, Alary F, Boulanouar K, Viallard G, Manelfe C, Rascol O, Celsis P, Chollet F. Within-session and between-session reproducibility of cerebral sensorimotor activation: a test--retest effect evidenced with functional magnetic resonance imaging. J Cereb Blood Flow Metab. 2001;21:592–607. doi: 10.1097/00004647-200105000-00014. [DOI] [PubMed] [Google Scholar]
Maddock RJ. The retrosplenial cortex and emotion: new insights from functional neuroimaging of the human brain. Trends Neurosci. 1999;22:310–316. doi: 10.1016/s0166-2236(98)01374-5. [DOI] [PubMed] [Google Scholar]
Maddock RJ, Garrett AS, Buonocore MH. Remembering familiar people: the posterior cingulate cortex and autobiographical memory retrieval. Neuroscience. 2001;104:667–676. doi: 10.1016/s0306-4522(01)00108-7. [DOI] [PubMed] [Google Scholar]
Maddock RJ, Garrett AS, Buonocore MH. Posterior cingulate cortex activation by emotional words: fMRI evidence from a valence decision task. Hum Brain Mapp. 2003;18:30–41. doi: 10.1002/hbm.10075. [DOI] [PMC free article] [PubMed] [Google Scholar]
Margulies DS, Vincent JL, Kelly C, Lohmann G, Uddin LQ, Biswal BB, Villringer A, Castellanos FX, Milham MP, Petrides M. Precuneus shares intrinsic functional architecture in humans and monkeys. Proceedings of the National Academy of Sciences of the United States of America. 2009;106:20069–20074. doi: 10.1073/pnas.0905314106. [DOI] [PMC free article] [PubMed] [Google Scholar]
Naito Y, Hirano S, Honjo I, Okazawa H, Ishizu K, Takahashi H, Fujiki N, Shiomi Y, Yonekura Y, Konishi J. Sound-induced activation of auditory cortices in cochlear implant users with post- and prelingual deafness demonstrated by positron emission tomography. Acta Otolaryngol. 1997;117:490–496. doi: 10.3109/00016489709113426. [DOI] [PubMed] [Google Scholar]
Obleser J, Eisner F. Pre-lexical abstraction of speech in the auditory cortex. Trends in Cognitive Sciences. 2009;13:14–19. doi: 10.1016/j.tics.2008.09.005. [DOI] [PubMed] [Google Scholar]
Obleser J, Wise RJS, Dresner MA, Scott SK. Functional integration across brain regions improves speech perception under adverse listening conditions. Journal of Neuroscience. 2007;27:2283–2289. doi: 10.1523/JNEUROSCI.4663-06.2007. [DOI] [PMC free article] [PubMed] [Google Scholar]
Park H, Felty R, Lormore K, Pisoni D. PRESTO: Perceptually robust English sentence test: Open set—Design, philosophy, and preliminary findings. The Journal of the Acoustical Society of America. 2010;127:1958. [Google Scholar]
Petrides M, Alivisatos B, Meyer E, Evans AC. Functional activation of the human frontal cortex during the performance of verbal working memory tasks. Proceedings of the National Academy of Sciences of the United States of America. 1993;90:878–882. doi: 10.1073/pnas.90.3.878. [DOI] [PMC free article] [PubMed] [Google Scholar]
Schönwiesner M, Rübsamen R, Von Cramon DY. Hemispheric asymmetry for spectral and temporal processing in the human antero-lateral auditory belt cortex. European Journal of Neuroscience. 2005;22:1521–1528. doi: 10.1111/j.1460-9568.2005.04315.x. [DOI] [PubMed] [Google Scholar]
Shannon RV, Fu Q-J, III JG. The Number of Spectral Channels Required for Speech Recognition Depends on the Difficulty of the Listening Situation. Acta Otolaryngol Suppl. 2004;552:50–54. doi: 10.1080/03655230410017562. [DOI] [PubMed] [Google Scholar]
Shannon RV, Zeng FG, Kamath V, Wygonski J, Ekelid M. Speech recognition with primarily temporal cues. Science. 1995;270:303–304. doi: 10.1126/science.270.5234.303. [DOI] [PubMed] [Google Scholar]
Smalt CJ, Talavage TM, Pisoni DB, Svirsky MA. Neural Adaptation and Perceptual Learning using a Portable Real-Time Cochlear Implant Simulator in Natural Environments. Boston, MA, USA: IEEE Engineering in Medicine and Biology Society; 2011. [DOI] [PMC free article] [PubMed] [Google Scholar]
Specht K, Willmes K, Shah NJ, Jancke L. Assessment of reliability in functional imaging studies. J Magn Reson Imaging. 2003;17:463–471. doi: 10.1002/jmri.10277. [DOI] [PubMed] [Google Scholar]
Wilson BS. Cochear Implant Technology. In: Niparko JK, Kirk IK, Mellon NK, Robbins AM, Tucci DL, Wilson BS, editors. Cochlear implants : principles & practices. Philadelphia: Lippincott Williams & Wilkins; 2000. pp. 109–127. [Google Scholar]
Wong D, Miyamoto RT, Pisoni DB, Sehgal M, Hutchins GD. PET imaging of cochlear-implant and normal-hearing subjects listening to speech and nonspeech. Hear Res. 1999;132:34–42. doi: 10.1016/s0378-5955(99)00028-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
Xu J, Kemeny S, Park G, Frattali C, Braun A. Language in context: emergent features of word, sentence, and narrative comprehension. Neuroimage. 2005;25:1002–1015. doi: 10.1016/j.neuroimage.2004.12.013. [DOI] [PubMed] [Google Scholar]
Zatorre RJ, Belin P, Penhune VB. Structure and function of auditory cortex: music and speech. Trends in Cognitive Sciences. 2002;6:37–46. doi: 10.1016/s1364-6613(00)01816-7. [DOI] [PubMed] [Google Scholar]

[R1] Bassim MK, Buss E, Clark MS, Kolln KA, Pillsbury CH, Pillsbury HC, 3rd, Buchman CA. MED-EL Combi40+ cochlear implantation in adults. Laryngoscope. 2005;115:1568–1573. doi: 10.1097/01.mlg.0000171023.72680.95. [DOI] [PubMed] [Google Scholar]

[R2] Benevento LA, Fallon J, Davis B, Rezak M. Auditory-visual interaction in single cells in the cortex of the superior temporal sulcus and the orbital frontal cortex of the macaque monkey. Experimental Neurology. 1977;57:849–872. doi: 10.1016/0014-4886(77)90112-1. [DOI] [PubMed] [Google Scholar]

[R3] Binder JR, Frost JA, Hammeke TA, Bellgowan PS, Rao SM, Cox RW. Conceptual processing during the conscious resting state. A functional MRI study. J Cogn Neurosci. 1999;11:80–95. doi: 10.1162/089892999563265. [DOI] [PubMed] [Google Scholar]

[R4] Binder JR, Frost JA, Hammeke TA, Cox RW, Rao SM, Prieto T. Human brain language areas identified by functional magnetic resonance imaging. Journal of Neuroscience. 1997;17:353–362. doi: 10.1523/JNEUROSCI.17-01-00353.1997. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R5] Binder JR, Liebenthal E, Possing ET, Medler AD, Ward DB. Neural correlates of sensory and decision processes in auditory object identification. Nature Neuroscience. 2004;7:295–301. doi: 10.1038/nn1198. [DOI] [PubMed] [Google Scholar]

[R6] Boothroyd A, Hnath-Chisolm T, Hanin L, Kishon-Rabin L. Voice fundamental frequency as an auditory supplement to the speechreading of sentences. Ear and Hearing. 1988;9:306–312. doi: 10.1097/00003446-198812000-00006. [DOI] [PubMed] [Google Scholar]

[R7] Breedlove EL, Robinson M, Talavage TM, Morigaki KE, Yoruk U, O’Keefe K, King J, Leverenz LJ, Gilger JW, Nauman EA. Biomechanical correlates of symptomatic and asymptomatic neurophysiological impairment in high school football. Journal of Biomechanics. 2012 doi: 10.1016/j.jbiomech.2012.01.034. [DOI] [PubMed] [Google Scholar]

[R8] Brett M, Anton J, Valabregue R, Poline J. Region of interest analysis using an SPM toolbox [abstract/497] presented at the 8th International Conference on Functional Mapping of the Human Brain, June 2–6, Sendai, Japan. Available on CD-ROM in. Neuroimage. 2002:16. [Google Scholar]

[R9] Davis MH, Johnsrude IS. Hierarchical processing in spoken language comprehension. Journal of Neuroscience. 2003;23:3423–3431. doi: 10.1523/JNEUROSCI.23-08-03423.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R10] Davis MH, Johnsrude IS, Hervais-Adelman A, Taylor K, McGettigan C. Lexical information drives perceptual learning of distorted speech: Evidence from the comprehension of noise-vocoded sentences. Journal of Experimental Psychology: General. 2005;134:222. doi: 10.1037/0096-3445.134.2.222. [DOI] [PubMed] [Google Scholar]

[R11] Dorman MF. Speech perception by adults. In: Tyler RS, editor. Cochlear Implants. Singular Publishing: San Diego; 1993. pp. 145–190. [Google Scholar]

[R12] Eisner F, McGettigan C, Faulkner A, Rosen S, Scott SK. Inferior frontal gyrus activation predicts individual differences in perceptual learning of cochlear-implant simulations. Journal of Neuroscience. 2010;30:7179–7186. doi: 10.1523/JNEUROSCI.4040-09.2010. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R13] Erb J, Henry MJ, Eisner F, Obleser J. Auditory skills and brain morphology predict individual differences in adaptation to degraded speech. Neuropsychologia. 2012;50:2154–2164. doi: 10.1016/j.neuropsychologia.2012.05.013. [DOI] [PubMed] [Google Scholar]

[R14] Etymotic. Earphone Comparison. 2010 [Google Scholar]

[R15] Foster NE, Zatorre RJ. A role for the intraparietal sulcus in transforming musical pitch information. Cerebral Cortex. 2010;20:1350–1359. doi: 10.1093/cercor/bhp199. [DOI] [PubMed] [Google Scholar]

[R16] Giraud AL, Price CJ, Graham JM, Frackowiak RS. Functional plasticity of language-related brain areas after cochlear implantation. Brain. 2001a;124:1307–1316. doi: 10.1093/brain/124.7.1307. [DOI] [PubMed] [Google Scholar]

[R17] Giraud AL, Price CJ, Graham JM, Truy E, Frackowiak RS. Cross-modal plasticity underpins language recovery after cochlear implantation. Neuron. 2001b;30:657–663. doi: 10.1016/s0896-6273(01)00318-x. [DOI] [PubMed] [Google Scholar]

[R18] Giraud AL, Truy E, Frackowiak RS, Gregoire MC, Pujol JF, Collet L. Differential recruitment of the speech processing system in healthy subjects and rehabilitated cochlear implant patients. Brain. 2000;123(Pt 7):1391–1402. doi: 10.1093/brain/123.7.1391. [DOI] [PubMed] [Google Scholar]

[R19] Gonzalez-Castillo J, Talavage TM. Reproducibility of FMRI activations associated with auditory sentence comprehension. Neuroimage. 2011;54:2138–2155. doi: 10.1016/j.neuroimage.2010.09.082. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R20] Green KM, Ramsden RT, Julyan PJ, Hastings DL. Neural plasticity in blind cochlear implant users. Cochlear Implants Int. 2008;9:177–185. doi: 10.1179/cim.2008.9.4.177. [DOI] [PubMed] [Google Scholar]

[R21] Greenwood DD. A cochlear frequency-position function for several species--29 years later. Journal of the Acoustical Society of America. 1990;87:2592–2605. doi: 10.1121/1.399052. [DOI] [PubMed] [Google Scholar]

[R22] Harnsberger JD, Svirsky MA, Kaiser AR, Pisoni DB, Wright R, Meyer TA. Perceptual "vowel spaces" of cochlear implant users: implications for the study of auditory adaptation to spectral shift. J Acoust Soc Am. 2001;109:2135–2145. doi: 10.1121/1.1350403. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R23] Hervais-Adelman A, Davis MH, Johnsrude IS, Carlyon RP. Perceptual learning of noise vocoded words: Effects of feedback and lexicality. Journal of Experimental Psychology: Human Perception and Performance. 2008;34:460–474. doi: 10.1037/0096-1523.34.2.460. [DOI] [PubMed] [Google Scholar]

[R24] Hervais-Adelman AG, Carlyon RP, Johnsrude IS, Davis MH. Brain regions recruited for the effortful comprehension of noise-vocoded words. Language and Cognitive Processes. 2012;27:1145–1166. [Google Scholar]

[R25] Hervais-Adelman AG, Davis MH, Johnsrude IS, Taylor KJ, Carlyon RP. Generalization of perceptual learning of vocoded speech. Journal of Experimental Psychology: Human Perception and Performance. 2011;37:283. doi: 10.1037/a0020772. [DOI] [PubMed] [Google Scholar]

[R26] Herzog H, Lamprecht A, Kuhn A, Roden W, Vosteen KH, Feinendegen LE. Cortical activation in profoundly deaf patients during cochlear implant stimulation demonstrated by H2(15)O PET. J Comput Assist Tomogr. 1991;15:369–375. doi: 10.1097/00004728-199105000-00005. [DOI] [PubMed] [Google Scholar]

[R27] Hickok G, Poeppel D. Dorsal and ventral streams: a framework for understanding aspects of the functional anatomy of language. Cognition. 2004;92:67–99. doi: 10.1016/j.cognition.2003.10.011. [DOI] [PubMed] [Google Scholar]

[R28] Hickok G, Poeppel D. The cortical organization of speech processing. Nature Reviews Neuroscience. 2007;8:393–402. doi: 10.1038/nrn2113. [DOI] [PubMed] [Google Scholar]

[R29] Kaiser AR, Svirsky MA. Using a personal computer to perform real-time signal processing in cochlear implant research. Hunt, Texas: Proceedings of the Ninth Digital Signal Processing; 2000. [Google Scholar]

[R30] Kang E, Lee DS, Kang H, Lee JS, Oh SH, Lee MC, Kim CS. Neural changes associated with speech learning in deaf children following cochlear implantation. Neuroimage. 2004;22:1173–1181. doi: 10.1016/j.neuroimage.2004.02.036. [DOI] [PubMed] [Google Scholar]

[R31] Levitin DJ, Menon V. Musical structure is processed in "language" areas of the brain: a possible role for Brodmann Area 47 in temporal coherence. Neuroimage. 2003;20:2142–2152. doi: 10.1016/j.neuroimage.2003.08.016. [DOI] [PubMed] [Google Scholar]

[R32] Li T, Fu QJ. Perceptual adaptation to spectrally shifted vowels: training with nonlexical labels. J Assoc Res Otolaryngol. 2007;8:32–41. doi: 10.1007/s10162-006-0059-2. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R33] Loubinoux I, Carel C, Alary F, Boulanouar K, Viallard G, Manelfe C, Rascol O, Celsis P, Chollet F. Within-session and between-session reproducibility of cerebral sensorimotor activation: a test--retest effect evidenced with functional magnetic resonance imaging. J Cereb Blood Flow Metab. 2001;21:592–607. doi: 10.1097/00004647-200105000-00014. [DOI] [PubMed] [Google Scholar]

[R34] Maddock RJ. The retrosplenial cortex and emotion: new insights from functional neuroimaging of the human brain. Trends Neurosci. 1999;22:310–316. doi: 10.1016/s0166-2236(98)01374-5. [DOI] [PubMed] [Google Scholar]

[R35] Maddock RJ, Garrett AS, Buonocore MH. Remembering familiar people: the posterior cingulate cortex and autobiographical memory retrieval. Neuroscience. 2001;104:667–676. doi: 10.1016/s0306-4522(01)00108-7. [DOI] [PubMed] [Google Scholar]

[R36] Maddock RJ, Garrett AS, Buonocore MH. Posterior cingulate cortex activation by emotional words: fMRI evidence from a valence decision task. Hum Brain Mapp. 2003;18:30–41. doi: 10.1002/hbm.10075. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R37] Margulies DS, Vincent JL, Kelly C, Lohmann G, Uddin LQ, Biswal BB, Villringer A, Castellanos FX, Milham MP, Petrides M. Precuneus shares intrinsic functional architecture in humans and monkeys. Proceedings of the National Academy of Sciences of the United States of America. 2009;106:20069–20074. doi: 10.1073/pnas.0905314106. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R38] Naito Y, Hirano S, Honjo I, Okazawa H, Ishizu K, Takahashi H, Fujiki N, Shiomi Y, Yonekura Y, Konishi J. Sound-induced activation of auditory cortices in cochlear implant users with post- and prelingual deafness demonstrated by positron emission tomography. Acta Otolaryngol. 1997;117:490–496. doi: 10.3109/00016489709113426. [DOI] [PubMed] [Google Scholar]

[R39] Obleser J, Eisner F. Pre-lexical abstraction of speech in the auditory cortex. Trends in Cognitive Sciences. 2009;13:14–19. doi: 10.1016/j.tics.2008.09.005. [DOI] [PubMed] [Google Scholar]

[R40] Obleser J, Wise RJS, Dresner MA, Scott SK. Functional integration across brain regions improves speech perception under adverse listening conditions. Journal of Neuroscience. 2007;27:2283–2289. doi: 10.1523/JNEUROSCI.4663-06.2007. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R41] Park H, Felty R, Lormore K, Pisoni D. PRESTO: Perceptually robust English sentence test: Open set—Design, philosophy, and preliminary findings. The Journal of the Acoustical Society of America. 2010;127:1958. [Google Scholar]

[R42] Petrides M, Alivisatos B, Meyer E, Evans AC. Functional activation of the human frontal cortex during the performance of verbal working memory tasks. Proceedings of the National Academy of Sciences of the United States of America. 1993;90:878–882. doi: 10.1073/pnas.90.3.878. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R43] Schönwiesner M, Rübsamen R, Von Cramon DY. Hemispheric asymmetry for spectral and temporal processing in the human antero-lateral auditory belt cortex. European Journal of Neuroscience. 2005;22:1521–1528. doi: 10.1111/j.1460-9568.2005.04315.x. [DOI] [PubMed] [Google Scholar]

[R44] Shannon RV, Fu Q-J, III JG. The Number of Spectral Channels Required for Speech Recognition Depends on the Difficulty of the Listening Situation. Acta Otolaryngol Suppl. 2004;552:50–54. doi: 10.1080/03655230410017562. [DOI] [PubMed] [Google Scholar]

[R45] Shannon RV, Zeng FG, Kamath V, Wygonski J, Ekelid M. Speech recognition with primarily temporal cues. Science. 1995;270:303–304. doi: 10.1126/science.270.5234.303. [DOI] [PubMed] [Google Scholar]

[R46] Smalt CJ, Talavage TM, Pisoni DB, Svirsky MA. Neural Adaptation and Perceptual Learning using a Portable Real-Time Cochlear Implant Simulator in Natural Environments. Boston, MA, USA: IEEE Engineering in Medicine and Biology Society; 2011. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R47] Specht K, Willmes K, Shah NJ, Jancke L. Assessment of reliability in functional imaging studies. J Magn Reson Imaging. 2003;17:463–471. doi: 10.1002/jmri.10277. [DOI] [PubMed] [Google Scholar]

[R48] Wilson BS. Cochear Implant Technology. In: Niparko JK, Kirk IK, Mellon NK, Robbins AM, Tucci DL, Wilson BS, editors. Cochlear implants : principles & practices. Philadelphia: Lippincott Williams & Wilkins; 2000. pp. 109–127. [Google Scholar]

[R49] Wong D, Miyamoto RT, Pisoni DB, Sehgal M, Hutchins GD. PET imaging of cochlear-implant and normal-hearing subjects listening to speech and nonspeech. Hear Res. 1999;132:34–42. doi: 10.1016/s0378-5955(99)00028-3. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R50] Xu J, Kemeny S, Park G, Frattali C, Braun A. Language in context: emergent features of word, sentence, and narrative comprehension. Neuroimage. 2005;25:1002–1015. doi: 10.1016/j.neuroimage.2004.12.013. [DOI] [PubMed] [Google Scholar]

[R51] Zatorre RJ, Belin P, Penhune VB. Structure and function of auditory cortex: music and speech. Trends in Cognitive Sciences. 2002;6:37–46. doi: 10.1016/s1364-6613(00)01816-7. [DOI] [PubMed] [Google Scholar]

PERMALINK

NEURAL CORRELATES OF ADAPTATION IN NORMAL HEARING SUBJECTS TO FREE LEARNING WITH COCHLEAR IMPLANT ACOUSTIC SIMULATIONS

Christopher J Smalt

Javier Gonzalez-Castillo

Thomas M Talavage

David B Pisoni

Mario A Svirsky

Abstract

1. INTRODUCTION

2. METHODS

2.1 Subjects

2.2 Experimental Paradigm

FIGURE 1.

2.3 Portable CI Simulator

2.4 Behavioral Training and Testing Procedures

2.5 fMRI Experimental Paradigm

FIGURE 2.

2.6 fMRI Data Acquisition

2.7 fMRI Data Analysis

2.8 Correlation of fMRI and Behavioral Analysis

Table 1.

3. RESULTS

3.1 Behavioral Results

FIGURE 3.

3.1.1 Out of Scanner Results

3.1.2 In Scanner Results

3.1.3 Behavioral Controls

FIGURE 4.

3.2 fMRI Results

3.2.1 Reproducibility

3.2.2 Group Comparisons across Session

FIGURE 5.

Table 2.

3.2.3 Group Comparisons across Sentence Type

FIGURE 6.

Table 3.

3.3 Correlation of fMRI and Behavioral Analysis

Table 4.

FIGURE 7.

4. DISCUSSION

4.1 Behavioral Adaptation to Degraded speech

4.2 Neural Correlates of Adaptation to Degraded speech

4.2.1 Reproducibility

4.2.2 Changes in Degraded Speech Processing across Session

Primary and secondary auditory

Ventral Speech Processing Stream

Dorsal Speech Processing Stream

Areas outside the streams of speech processing

5. CONCLUSION

Highlights.

ACKNOWLEDGEMENTS

Footnotes

REFERENCES

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases