Skip to main content
The Journal of Neuroscience logoLink to The Journal of Neuroscience
. 2013 Oct 2;33(40):15868–15878. doi: 10.1523/JNEUROSCI.5466-12.2013

Facilitation of Inferior Frontal Cortex by Transcranial Direct Current Stimulation Induces Perceptual Learning of Severely Degraded Speech

Bernhard Sehm 1,3,, Tim Schnitzler 1, Jonas Obleser 2, Agnes Groba 1,4, Patrick Ragert 1, Arno Villringer 1,3, Hellmuth Obrig 1,3
PMCID: PMC6618468  PMID: 24089493

Abstract

Perceptual learning requires the generalization of categorical perceptual sensitivity from trained to untrained items. For degraded speech, perceptual learning modulates activation in a left-lateralized network, including inferior frontal gyrus (IFG) and inferior parietal cortex (IPC). Here we demonstrate that facilitatory anodal transcranial direct current stimulation (tDCSanodal) can induce perceptual learning in healthy humans. In a sham-controlled, parallel design study, 36 volunteers were allocated to the three following intervention groups: tDCSanodal over left IFG, IPC, or sham. Participants decided on the match between an acoustically degraded and an undegraded written word by forced same-different choice. Acoustic degradation varied in four noise-vocoding levels (2, 3, 4, and 6 bands). Participants were trained to discriminate between minimal (/Tisch/-FISCH) and identical word pairs (/Tisch/-TISCH) over a period of 3 d, and tDCSanodal was applied during the first 20 min of training. Perceptual sensitivity (d′) for trained word pairs, and an equal number of untrained word pairs, was tested before and after training. Increases in d′ indicate perceptual learning for untrained word pairs, and a combination of item-specific and perceptual learning for trained word pairs. Most notably for the lowest intelligibility level, perceptual learning occurred only when tDCSanodal was applied over left IFG. For trained pairs, improved d′ was seen on all intelligibility levels regardless of tDCS intervention. Over left IPC, tDCSanodal did not modulate learning but instead introduced a response bias during training. Volunteers were more likely to respond “same,” potentially indicating enhanced perceptual fusion of degraded auditory with undegraded written input. Our results supply first evidence that neural facilitation of higher-order language areas can induce perceptual learning of severely degraded speech.

Introduction

Speech comprehension requires the rapid mapping of complex sounds to meaning. Remarkably, the human brain affords this task almost without effort, although natural listening environments regularly provide highly variable, often degraded and distorted input. Moreover, listeners readily adapt even to highly adverse listening conditions, suggesting rapid plasticity in the speech comprehension network. Behavioral evidence indicates that linguistic competence at various levels supports adaptation to degraded speech: lexical information augments learning (Davis et al., 2005), but is not mandatory because training in an unknown language generalizes to the (untrained) native language depending on their phonological similarity (Bent et al., 2011). Adaptation profits from sentential semantics (Obleser et al., 2007) and phonological information supplied by nonwords (Hervais-Adelman et al., 2008), and even basic durational and spectral cues may support learning (Winn et al., 2012). Similarly, generalization to untrained material, a crucial trait of perceptual learning (Wohlwill, 1958), may apply to domains beyond speech, such as identification of environmental sounds (Loebach et al., 2009).

Supporting the notion of a tight interplay between linguistic top-down influences and auditory bottom-up processing, neuroimaging studies have delineated a left-lateralized network including superior temporal [superior temporal gyrus (STG)/superior temporal sulcus (STS)], inferior frontal [inferior frontal gyrus (IFG)], and inferior parietal [inferior parietal cortex (IPC)] cortices. It is relatively uncontroversial that STG/STS activation correlates with intelligibility (Scott et al., 2000). Activation may extend more anteriorly as a function of lexical predictability (Obleser and Kotz, 2010) and more posteriorly depending on the syntactic information supplied (Friederici et al., 2010). The IFG shows more complex behavior: activation increases with semantic integration demands, but this effect is “gated” by intelligibility (Obleser and Kotz, 2010). As shown in an electrophysiological study on single words, the frontotemporal loop impacts on degraded speech perception at a very early stage (Sohoglu et al., 2012). Most relevant here, left IFG activation has been shown to correlate with interindividual learning success in a short degraded speech learning paradigm performed in the scanner (Eisner et al., 2010).

In the same study, individual learning progress correlated with left IPC activation [angular gyrus (AG)], identifying IPC as another key area for top-down control during perceptual learning. More generally, IPC may afford meta-integration (Sharp et al., 2010a), converging pre-established semantic and contextual information. In this vein, a recent imaging study elegantly showed its role in “repair processes” during degraded speech comprehension, when auditory information is elusive (Shahin et al., 2009). On a supramodal level, the IPC is involved in decision-making processes particularly under uncertainty (Vickery and Jiang, 2009) and may modulate response bias (Eickhoff et al., 2011).

Proceeding from both behavioral and neuroimaging evidence, here we investigate whether facilitation of two higher-order nodes in the speech comprehension network modulates perceptual learning. Given its role during degraded speech comprehension, we hypothesize that facilitating left IFG during training enhances perceptual learning. This effect should be most prominent when severe degradation requires strong “top-down” predictions. For left IPC, the existing literature provides less concise predictions. Facilitation can be expected to modulate integration and decision processes during training, most likely increasing the tendency to converge auditory and written input. Such enhanced integration of multimodal percepts will be reflected in a shift in the criterion (C). Since learning requires optimization of C (i.e., reduction in potentially pre-existing response bias), the effect of IPC facilitation can be expected to depend on several factors. In case of a response bias before training, minimization of the latter should support learning, while induction or augmentation of a response bias should not support learning.

Materials and Methods

Participants

Thirty-six healthy volunteers, who were native speakers of German, without a history of hearing impairment or any other neurological or medical condition, participated in the study (mean age, 26.6 years; age range, 21–31 years; 18 females). None of the participants took any CNS-active medication during the experiments. Before participation, all subjects underwent a comprehensive neurological examination to screen for potential exclusion criteria. Participants who did not meet the protocol criteria and/or had contraindications for transcranial direct current stimulation (tDCS) were not included (Nitsche et al., 2008). According to the Oldfield questionnaire (Oldfield, 1971), all participants were strongly right handed. They gave written informed consent according to the Declaration of Helsinki and were financially compensated for participation according to the standard practice at the Institute. The Ethics Committee of the University of Leipzig approved of the study.

Stimuli and experimental conditions

We studied 200 minimal pairs (MPs) from a large corpus of monosyllabic and bisyllabic monomorphemic German nouns, which differed in the initial consonant and were phonological and graphematic neighbors without elision or addition of any phoneme or grapheme [e.g., Fisch–Tisch (English: fish–table)]. An equal number of corresponding identical pairs (IPs) of words were also constructed (e.g., Fisch–Fisch). These target stimuli were presented intermixed with 64 distractor stimuli, which were MPs that differed in the final consonant [e.g., Maus–Maul (English: mouse–mouth)] and their corresponding identical word pairs (e.g., Maus–Maus). These distractor stimuli did not enter the analysis, but were introduced to prevent participants from selectively attending to the initial phoneme during the task. All stimuli were controlled for word frequency (according to the Wortschatz Lexikon of the University of Leipzig; http://wortschatz.uni-leipzig.de) and for the total number of competing MP neighbors differing in the initial phoneme [e.g., Mutter (English: mother) has three minimal competitors, which are Butter, Kutter, and Futter (English: butter, boat, fodder)]. Target stimuli were also controlled for phonetic distance of the initial phonemes of both words of an MP regarding the features voicing coding, place coding, and manner coding.

Stimuli spoken by a female trained speaker were recorded in a sound-attenuated room at a sampling rate of 44.1 kHz. Postediting included downsampling to 22.5 kHz; cutting at zero crossings before and after each spoken word, including a fade-in and fade-out of 2 ms; and root mean square normalization. To vary intelligibility, each word was noise vocoded at 2, 3, 4, and 6 bands. Bands were chosen according to a pilot study (n = 16, different but demographically similar volunteers), which indicated error scores to provide a sufficient dynamic range for learning. Noise vocoding preserves the temporal detail but can parametrically vary the spectral detail of the auditory signal (Shannon et al., 1995). We followed a procedure that has been previously outlined (Rosen et al., 1999) and often applied in neuroimaging and behavioral studies (Obleser et al., 2008; Erb et al., 2012); all spectral information between 0.07 and 9 kHz entered the vocoding routine and was divided into filter bands according to the Greenwood formula (Greenwood, 1990), yielding approximately logarithmic spacing. A 400 Hz envelope low-pass filter was applied to each band. Since two-band noise vocoding conveys the spectral information only in two bands, this is the least intelligible version of the stimuli, while six-band vocoded speech can usually be decoded rather well after a short training session (Dahan and Mead, 2010).

All target stimuli (200 MPs and 200 IPs) and distractor stimuli (64 MPs and 64 IPs) were allocated to different conditions: training to discriminate between word pairs of half of the target stimuli (100 MPs and 100 IPs) and half of the distractor stimuli (32 MPs and 32 IPs) took place over 3 consecutive days, whereas participants were not trained on the other half. Instead, these were only presented at pretesting and post-testing (henceforth, these stimuli are referred to as “trained/untrained”; see training procedure below). Both trained and untrained stimuli as well as the distractors were allocated to the four different noise-vocoding levels (2, 3, 4, and 6 bands), yielding a total amount of 25 MPs, 25 IPs, and 8 distractor MPs and IPs per experimental condition. Across participants the allocation of the specific stimuli to the eight experimental conditions (2, 3, 4, and 6 bands; trained and untrained stimuli) was changed, thus further attenuating potential item-specific differences (see Fig. 2 for a graphical display of the stimulus categories).

Figure 2.

Figure 2.

Illustration of the material used in the present study. This altogether included 200 MPs, 200 IPs, and 128 distractors. The latter were either of type I (identical) or type II (minimal pairs) and were not analyzed. They solely served as a goal to enforce judgments on the complete lexical item and prevent judgments based selectively on the first phoneme.

Experimental procedure and trial design

Our behavioral measures were changes in perceptual sensitivity (d′) and C in a discrimination task between MPs and IPs. Each stimulus consisted of the auditory presentation of a word parametrically degraded by noise vocoding followed by the same (IP) or a different (MP) word presented in an undegraded written form. To compare item-specific learning (trained stimuli) and generalization (untrained stimuli), participants were trained on half of the stimuli over 3 days. Generalization (untrained stimuli) can be considered an operationalization of perceptual learning in the present task. In this article, “perceptual learning” is operationalized in that an increase in perceptual sensitivity and the resulting increase in “same-different” discrimination for untrained items is considered to signal generalization. Notably, this transfer is bound to impact at a sublexical level, while training intentionally relied on lexical items. We consider this transfer effect to allow one to probe perceptual learning for the case of degraded speech on the single-word level [for a more general discussion of perceptual learning, see Goldstone (1998)].

To investigate whether IFG or IPC facilitation modulates learning compared with sham stimulation, a between-subject (3 groups of 12 participants) was required, because our primary interest in perceptual learning excludes a within-subject design. To sum up, we used a mixed factorial design including the within-subject factors of stimulus type (IP/MP), auditory degradation (2, 3, 4, and 6 bands), training status (trained/untrained), time (pretraining/post-training), and the across-subjects factor stimulation type (IFG/IPC/sham stimulation). For the latter, participants were randomly allocated to three intervention groups that differed in the mode of brain stimulation during learning, as follows: anodal tDCS (tDCSanodal) over stimulated left IFG (IFGSTIM); tDCSanodal over stimulated left IPC (IPCSTIM), or sham stimulation (SHAMSTIM; see section on tDCS below).

The experiment proceeded over 4 consecutive days with a pretest on the first day, a post-test on the fourth day, and a total of three training sessions, one each on days 1, 2, and 3 (Fig. 1A). During both the training and pretest/post-test periods, a same-different forced-choice paradigm was used to train participants and to test their perceptual sensitivity of word discrimination. Stimulus presentation and behavioral response recording were controlled by Presentation software (version 14.7, Neurobehavioral Systems). Participants were seated in front of the computer screen with the right index and middle finger placed on response buttons.

Figure 1.

Figure 1.

Experimental setup and design. A, Baseline perceptual sensitivity to discriminate between MPs and IPs was assessed on day 1, followed by a training over 3 days where participants were trained intensively on half of the stimuli (n = 100). During the first 20 min of each training session, either tDCS over IFG, IPC, or sham stimulation was applied. After training (day 4), perceptual sensitivity of all stimuli (trained and untrained) were tested again. B, Single trial design of the same-different forced-choice paradigm during both pretest and post-test (top row) and training (bottom row). C, For tDCS, neuronavigation was used to target brain areas (left IFG and IPC) based on the subjects' individual structural MRI scans.

Pretest and post-test—trial design and stimuli.

During the pretest, each participant's baseline perceptual sensitivity to discriminate between MPs and IPs was assessed (e.g., MP /Fisch/ − TISCH = “different” vs IP /Fisch/ − FISCH = same; please note that throughout the manuscript “/Xxx/” indicates the acoustic, degraded presentation, while “XXX” indicates the written, undegraded presentation of the respective word). The first word in each word pair was presented acoustically, degraded by noise vocoding in one of the four different intelligibility levels (2, 3, 4, and 6 bands; see above). After a variable interstimulus interval of 600–900 ms, the second word was visually presented for 500 ms (capital letters, font size 72, black on white background). After the presentation of the second word, participants had to decide by button press whether both words were identical or different. They were encouraged to do so as fast as possible. After the button press or after time out (2000 ms, if the participant failed to respond; a missed trial), there was a break of 500 ms before the next trial started (Fig. 1B). Across participants, the allocation of the index or middle finger for the response of same or different was counterbalanced.

All target stimuli (200 MPs and 200 IPs) and distractor stimuli (64 MPs and 64 IPs) were tested during the pretest. To prevent order recognition effects, each auditorily presented word was presented a third time followed by the visual presentation of either the identical or the minimal pair word (hence, a total of 300 MPs, 300 IPs, and 192 distractors in pseudorandomized order). This third presentation was not included in the data analysis. On day 4, a post-test assessed perceptual sensitivity for both trained and untrained stimuli. The procedure of the post-test was identical to the pretest, except that stimuli were presented in a different randomization.

Training—trial design and stimuli.

The training proceeded over 3 days and included five training blocks (one block on day 1; two blocks each on days 2 and 3). As sketched above (Fig. 2), participants were trained to discriminate only half of the stimuli tested in the pretest, yielding a total amount of 25 MPs, 25 IPs, 8 distractor MPs, and IPs per degradation level. Each block comprised the threefold auditory presentation of each stimulus, once followed by the visual presentation of the identical word, once followed by the visual presentation of the minimal word pair, and once randomly followed by either of both to prevent order recognition effects. In analogy, 32 distractor MPs and 32 IPs (8 per degradation level) were also presented auditorily three times, resulting in a total of 300 training and 96 distractor trials per training block. To allow perceptual learning, two additional steps were added to the procedure otherwise identical to the pretesting: (1) after the response, feedback (correct/incorrect) was given by icons; and (2) thereafter the first word was simultaneously presented in its acoustically degraded and in the (undegraded) written forms (Fig. 1B). The latter procedure was chosen because perceptual learning of degraded speech has been shown to be most robust when acoustically degraded verbal material and its written transcription are presented simultaneously (Loebach et al., 2010). One training block (∼30 min) followed the pretest on day 1, while two training blocks (∼60 min) were performed on day 2 and day 3. On day 4 (post-test), there was no further training. To investigate its potential to induce facilitation of perceptual learning, tDCS or sham stimulation was applied during the first 20 min of each training session (for tDCS methods, see below).

Neuro-navigated transcranial direct current stimulation

T1-weighted high-resolution structural images of each participant were available to individually identify the target areas for tDCS. Target coordinates for left IFG and left IPC/AG were chosen according to a recent fMRI study on perceptual learning of degraded speech (Eisner et al., 2010): x, y, z MNI space, −46, 26, 20 for left IFG; −36, −58, 50 for left IPC (Fig. 1C). For the sham group (SHAMSTIM), one of each of the target areas was randomly chosen. A battery-driven DC stimulator delivered tDCS (Neuroconn GmbH) using a pair of electrodes in a 5 × 5 cm saline-soaked sponge. Before training, the electrodes were attached to the participant's head using elastic bands. The anodal electrode was centered over the respective target coordinate, while the cathode was attached to the contralateral supraorbital region (Flöel et al., 2008). The application of tDCS was single blinded. For all experimental conditions (anodal tDCS over IFG, anodal tDCS over IPC, and SHAM), the current was increased in a ramp-like fashion over 30 s to a maximum of 1 mA eliciting a transient tingling sensation on the scalp. In the verum groups, tDCS was delivered for 20 min (IFGSTIM, IPCSTIM), but in the sham group stimulation was faded out after 30 s (SHAMSTIM). The current density at the stimulation electrodes amounts to 0.04 mA/cm2 for our 1 mA anodal tDCS, and the total charge (current density × total stimulation duration, in seconds) was 0.048 C/cm2 on each day for the verum conditions. Currents were turned off slowly after a few seconds precluding sensory differences between conditions (Nitsche et al., 2003), which has been shown to be efficient in the blinding of the procedure (Gandiga et al., 2006; Ragert et al., 2008).

Data analysis

The analysis of the same-different forced-choice discrimination task followed signal detection methodology rendering measures of d′ and the underlying C. The former measure is independent of response bias, and the latter renders a numerical value that indicates response bias when different from zero. The analytic framework takes into account that individuals are not merely passive receivers but use an internal decision criterion for response selection. Stimulus degradation induces an increase in perceptual uncertainty rendering response bias more likely to occur. In the following, an increase of d′ indicates an increase in perceptual sensitivity for the discrimination of MPs versus IPs. Successful learning thus increases d′. This measure is calculated by d′ = z (hits) − z (false alarms).

Since d′ is a measure independent of response bias, C renders additional information on the individual's response behavior. C was calculated using the following formula: C = −0.5 * (z (hits) + z (false alarms)) (McMillan, 2005). When C = 0, there is no response bias. In the present study C > 0 indicates a more “conservative” response behavior, meaning that the participant showed a tendency to respond differently. Conversely, C < 0 (more liberal) indicates the tendency to respond the same.

Four different statistical models were calculated separately for both d′ and C. These models combined the following: one across-subject factor, STIM (3 groups, i.e., IFGSTIM, IPCSTIM, SHAMSTIM); four within-subject factors, DEGRAD (four levels, i.e., 2, 3, 4, 6-band vocoding); BLOCK (five levels, i.e., training blocks only available for trained items); TIME (two levels, i.e., pretest, post-test); and TRAIN (two levels, i.e., trained, untrained).

Model 1.

At baseline, we assessed potential influences of degradation and/or tDCS group on d′ and C in two separate 4 × 3 repeated-measures mixed-factorial ANOVAs (RMANOVAs; DEGRAD × STIM). These were performed on the pretest data.

Model 2.

Changes in d′ and C during training, were assessed by two separate 5 × 4 × 3 RMANOVAs (BLOCK × DEGRAD × STIM). These obviously only included the trained items.

Model 3.

The influence of the full training program on d′ and C was assessed by two 2 × 2 × 4 × 3 RMANOVAs (TIME × TRAIN × DEGRAD × STIM).

Model 4.

The influence of tDCS on perceptual learning (i.e., the change in perceptual sensitivity for untrained stimuli) was assessed by its influence on changes in d′ (Δd′ = dpost-testdpretest; Zaehle et al., 2011). Likewise, changes in C were analyzed to assess potential changes in response behavior associated with perceptual learning (ΔC = Cpost-testCpretest). Both parameters were tested by using two separate 4 × 3 RMANOVAs (DEGRAD × STIM). In a second step, univariate between-subject ANOVAs for the factor STIM were calculated separately for each degradation level.

The level of statistical significance was set at a threshold of p < 0.05. When ANOVAs yielded significant main effects, post hoc tests were calculated according to Fisher's least significant difference.

Results

Our study targets the improvement in perceptual sensitivity to discriminate between MPs and IPs under acoustically degraded conditions. The measure to assess changes in perceptual sensitivity was d′ in a forced-choice paradigm. Here, our main interest was the effect of tDCS on changes in d′ (i.e., pretest vs post-test) for the untrained items, because this transfer of perceptual sensitivity from trained to untrained stimuli can be considered an operationalization of perceptual learning in the present study. Additionally, we looked into the changes in C, potentially disclosing influences on learning and influences of tDCS on response bias during stimulation.

d′ and C

As described in the Materials and Methods section, four models were tested with regard to d′ and C: (1) the influence of degradation level and stimulation group at baseline; (2) the effect of training for IP, and MP respecting the stimulation group (essentially this only refers to the trained items); (3) changes in both parameters from baseline to post-test after the full training program regarding differential effects on trained and untrained items; and (4) the effect of the three different tDCS stimulation conditions on the improvement of perceptual sensitivity after training (determined by Δd′ = dpost-testdpretest), and changes in response criterion (ΔC = Cpost-testCpretest).

Influence of degradation on d′ and C at baseline

d′

The parametric degradation of the acoustically presented words significantly reduced perceptual sensitivity (Fig. 3A). This steady increase in d′ with intelligibility was statistically confirmed by a main effect of DEGRAD (F(3,99) = 56.6, p < 0.001; post hoc t tests between all vocoding levels at p < 0.015). The factor STIM did not influence perceptual sensitivity at baseline, precluding a pretraining bias due to uncontrolled group differences (main effect for STIM: F(2,33) = 0.6, p = 0.9; interaction of DEGRAD × STIM: F(6,99) = 0.71, p = 0.64). In detail, baseline of d′ for all stimulation groups and degradation levels were as follows (mean ± SEM): SHAMSTIM: 2 bands, d′ = 0.89 ± 0.12; 3 bands, d′ = 1.27 ± 0.11; 4 bands, d′ = 1.44 ± 0.15; 6 bands, d′ = 1.82 ± 0.13; IFGSTIM: 2 bands, d′ = 0.77 ± 0.11; 3 bands, d′ = 1.2 ± 0.11; 4 bands, d′ = 1.58 ± 0.14; 6 bands, d′ = 1.85 ± 0.21; IPCSTIM: 2 bands, d′ = 0.71 ± 0.1; 3 bands, d′ = 0.93 ± 0.15; 4 bands, d′ = 1.2 ± 0.11; 6 bands, d′ = 1.6 ± 0.15.

Figure 3.

Figure 3.

Behavioral data at baseline (pretest). A, d′ decreased as a function of signal degradation (number of bands) in a roughly linear fashion. B, The amount of degradation influenced the participant's criterion, C: severely degraded speech (2 and 3 bands) led to a positive criterion; for example, subjects showed a response bias to judge pairs differently for severely degraded speech, but not for lesser degraded speech (4 and 6 bands). Wide columns, Mean of all participants (n = 36); narrow columns, mean per stimulation group (n = 12 each). Error bars indicate ±SE.

C

Higher degradation levels led to more conservative values for C (Fig. 3B), indicating that participants showed a bias to judge word pairs differently for severely degraded speech, not present for lesser degradation levels as follows (mean ± SEM): SHAMSTIM: 2 bands, C = 0.38 ± 0.11; 3 bands, C = 0.15 ± 0.14; 4 bands, C = 0.04 ± 0.09; 6 bands, C = −0.11 ± 0.12; IFGSTIM: 2 bands, C = 0.32 ± 0.11; 3 bands, C = 0.14 ± 0.11; 4 bands, C=-0.02 ± 0.11; 6 bands, C = −0.02 ± 0.08; IPCSTIM: 2 bands, C = 0.54 ± 0.12; 3 bands, C = 0.20 ± 0.12; 4 bands, C = −0.00 ± 0.10; 6 bands, C = −0.12 ± 0.1. Statistics confirmed a main effect of DEGRAD (F(3,99) = 55.75, p < 0.001) and confirmed a significant response bias only for the two severe degradation levels (2 bands, p < 0.001; 3 bands, p = 0.03, 4 bands, p = 0.92; 6 bands, p = 0.15). At baseline no influence of tDCS group on C was found (factor STIM: F(2,33) = 0.6, p = 0.9; interaction STIM × DEGRAD: F(6,99) = 2.13, p = 0.06).

To sum up, before training, the three stimulation groups showed no differences with regard to d′ and C. The within-subject factor speech degradation level (DEGRAD) showed the expected increase in d′ with intelligibility and a bias to judge pairs as different for severe degradation.

Influence of training and tDCS on d′ and C

d′

In the course of the training, perceptual sensitivity as indexed by d′ improved for the trained items (Fig. 4A). Visual inspection suggests steeper learning curves for the more strongly degraded items when compared with the less degraded items. The increase in d′ levels off toward the later training blocks only for the least degraded stimuli (6 bands). These findings were confirmed by the multifactorial RMANOVA, showing main effects for BLOCK (F(132,4) = 132.08, p < 0.001) and DEGRAD (F(99,3) = 89.11, p < 0.001), and for the interaction BLOCK × DEGRAD (F(396,12) = 1.99, p < 0.024). Post hoc testing of consecutive blocks revealed the following relations in magnitude: 2 band, block1 < 2 < 3, and block 4 < 5 (all p < 0.045); 3 band, block 1 < 2 < 3 (all p < 0.007); 4 band, block1 < 2 < 3 (all p < 0.042); and 6 band, block 1 < 2 (p < 0.001). All other comparisons were not significant (p > 0.05). Importantly, we did not find a significant main effect of STIM (F(33,2) = 2.093, p < 0.14) or significant interactions with STIM (BLOCK × STIM, F(132,8) = 1.47, p = 1.74; BLOCK × STIM × DEGRAD, F(396,24) = 0.92, p < 0.574) during training.

Figure 4.

Figure 4.

Behavioral data of all subjects during the training. A, Graph shows improvements in d′ during five blocks of training (days 1–3) in the three different tDCS groups. More strongly degraded stimuli showed a steeper learning curve compared with the less degraded stimuli, as supported by a significant interaction of signal degradation and training block. B, Application of tDCS during training led to a modulation of C toward a more liberal response tendency in the IPC group only. No response bias (e.g., significant difference of C from zero) was found in the IFG or sham group. Error bars indicate ±SE.

C

During training, tDCS over the IPC yielded a response bias (Fig. 4B), while C was not significantly different from 0 (i.e., no response bias) for IFG or sham stimulation. IPC stimulation led to a decrease in C for all degradation levels, with the response bias shifting toward the judgment same as indicated by the negative, that is, more liberal, C. A multifactorial RMANOVA revealed a strong trend for a significant main effect of STIM (IFGSTIM, IPCSTIM, SHAMSTIM: F(33,1) = 3.26, p = 0.051). Post hoc testing revealed differences in criterion between IPCSTIM versus SHAMSTIM (p = 0.042) and IPCSTIM versus IFGSTIM (p = 0.028), respectively. No difference in criterion was found between IFGSTIM and SHAMSTIM (p = 0.862). Furthermore, a significant interaction of the factors BLOCK and DEGRAD was found (F(280,56,12) = 2.53, p = 0.01; Greenhouse–Geisser corrected). This interaction was driven by the influence of the factor DEGRAD. No significant main effect was found for DEGRAD (F(99,3) = 0.7; p = 0.55) and BLOCK (F(132,4) = 0.7, p = 0.59), or the interactions DEGRAD × STIM (F(99,6) = 1.16; p = 0.34), BLOCK × STIM (F(132,8) = 1.11; p = 0.36), or DEGRAD × BLOCK × STIM (F(396,24) = 0.91; p = 0.6).

In summary, perceptual sensitivity for the trained items increased during the training. For less degraded stimuli, this increase leveled off for the later training blocks. Regarding the criterion, a response bias was induced only by IPC stimulation during training, which showed no changes over the course of the training.

Item-specific learning versus generalization

Changes in d′ from pretest to post-test

As expected, and according to the results in Model 2, the comparison between pretest and post-test confirms the improvement for the trained items (Fig. 5A). More relevant to our research question, item-specific learning elicited generalization because d′ also increased for the untrained items (Fig. 5B). Comparison between the different degradation levels indicates better performance for lesser degradation (as shown above) but also suggests an interaction with training: there is a parallel increase on all degradation levels for the trained items, whereas learning curves for the untrained items fan out as a function of degradation level. Stronger degradation leads to a lesser generalization. For the 2-band degradation, no increase in d′ is seen for the untrained items, suggesting no perceptual learning for this degradation level. These findings were confirmed by the results of the RMANOVA yielding significant main effects for TIME (F(33,1) = 202.9, p < 0.001), TRAIN (F(33,1) = 119.47, p < 0.001), and DEGRAD (F(99,3) = 129.0, p < 0.001). The main effect STIM did not reach significance (F(33,2) = 3.23; p = 0.052). Regarding interactions involving the factor of interest (TIME), we observed significant interactions of TIME × DEGRAD (F(99,3) = 4.56, p < 0.005) and TIME × TRAIN × DEGRAD (F(99,3) = 3.46, p < 0.02), which confirmed differential learning-related changes for trained versus untrained items at different degradation levels. Post hoc testing revealed significant increases in d′ from pretest to post-test on all degradation levels (2, 3, 4, and 6 bands) for the trained stimuli (p < 0.001), and for 3, 4, and 6-bands degradation condition for the untrained stimuli (p < 0.01). No difference from pretest to post-test was found for the 2-band condition in the untrained stimuli (not significant at p > 0.05).

Figure 5.

Figure 5.

Perceptual sensitivity of all subjects (n = 36) before and after training (pretest and post-test). A, Graph shows an improvement in perceptual sensitivity of trained stimuli in all degradation levels from pretest to post-test, indicating item-specific learning. B, Improvement in untrained stimuli, i.e., generalization, revealed perceptual learning for the lesser degraded conditions (3, 4, and 6 bands). No perceptual learning was observed for the most severely degraded stimuli (2 bands). Perceptual and item-specific learning show differential learning patterns as a function of degradation: while equal discrimination improvements were observed across degradation levels for item-specific learning, during perceptual learning the slopes fan out as a function of degradation level, i.e. stronger degradation leads to a lesser generalization. Error bars indicate ±SE. Asterisks indicate significant differences (p < 0.05).

The other significant interactions did not involve the factor TIME (TRAIN × DEGRAD, F(99,3) = 5.4, p < 0.002; and TRAIN × DEGRAD × STIM, F(99,6) = 2.54, p < 0.03). The interactions involving the factors TIME and STIM were not significant, as follows: TIME × STIM, F(29,2) = 1.05, p = 0.362; TIME × STIM × TRAIN, F(33,2) = 1.56, p = 0.226; TIME × STIM × DEGRAD, F(99,6) = 0.79, p = 0.583.

Changes in C from pretest to post-test

The analysis of C confirms the above significant influence of degradation on C as indexed by the main effect DEGRAD (F(99,3) = 11.34, p < 0.001). Furthermore, we observed significant interactions of TIME × DEGRAD (F(99,3) = 8.3, p < 0.001) and TIME × DEGRAD × TRAIN (F(99,3) = 3.21, p < 0.026). Post hoc tests comparing C from pretest to post-test revealed significant differences in 2-band condition for both trained (p < 0.002) and untrained stimuli (p < 0.008) and in the 6-band condition for the untrained stimuli only (p < 0.023). In detail, at baseline for 2-band vocoding, trained and untrained stimuli showed a response bias significantly different from zero as follows (mean ± SEM): trained, 0.43 ± 0.08, p < 0.001; untrained, 0.39 ± 0.06, p < 0.001. At post-testing, this response bias decreased for the untrained stimuli (0.19 ± 0.04, p < 0.001) and was abolished for the trained stimuli (trained −0.01 ± 0.11, n.s.). For 6-band vocoding, no significant response bias, that is, C different from zero, was detected at pretesting (trained, −0.07 ± 0.07, n.s.; untrained, −0.1 ± 0.05, n.s). and post-testing (trained, −0.19 ± 0.17, n.s.; untrained, −0.06 ± 0.04, n.s.). No influence was found for between-subject factor STIM (F(33,2) = 0.35, p = 0.71) or the interactions involving the factors STIM and TIME: TIME × STIM, F(33,2) = 0.59, p = 0.556; TIME × STIM × TRAIN, F(33,2) = 0.32, p = 0.727; TIME × STIM × DEGRAD, F(99,6) = 0.77, p = 0.592.

To sum up, training reduced (untrained stimuli) or abolished (trained stimuli) a pre-existing response bias to judge stimuli as different for the most severely degraded condition. No change in response bias was seen for 3- and 4-band vocoding. The increase in C for the 6-band condition was purely numerical, since neither pretesting nor post-testing showed a significant response bias (C ≈ 0 at all times).

Effect of tDCS on perceptual learning and related changes in response behavior

Δd′

The omnibus ANOVA (DEGRAD × STIM) of Δd′ revealed a significant influence of degradation on perceptual learning as indexed by the main effect DEGRAD (F(99,3) = 8.73, p < 0.001). In this statistical model, the factor STIM or the interaction STIM × DEGRAD did not reach significance (p > 0.05). Since we expected the effect of top-down influences to be more relevant in severely degraded speech, we additionally tested the four degradation levels separately by using univariate between-subject ANOVAs (factor STIM) for each degradation level. tDCS over IFG led to an increase in perceptual learning of degraded speech only in the condition with severely degraded speech (2 bands, Δd′ = 0.6 ± 0.52; Fig. 6). At this degradation level, no perceptual learning was seen for IPCSTIM or SHAMSTIM (mean ± SD: IPCSTIM, 0.0 ± 0.54; SHAMSTIM, 0.1 ± 0.53; F(33,2) = 3.88; p < 0.031; η2 = 0.19; post hoc t-tests: IFG vs sham, p < 0.043; IFG vs IPC, p < 0.013; sham vs IPC, n.s., p = 0.605), while the differences for the other degradation levels (3, 4, and 6 bands) did not reach statistical significance (3 bands, F(33,2) = 0.06, p = 0.94; 4 bands, F(33,2) = 1.974, p = 0.155; 6 bands, F(33,2) = 0.69; p = 0.51). No stimulation group effects were seen for the trained stimuli (2 bands, F(33,2) = 1.82, p = 0.177; 3 bands, (F(33,2) = 0.26; p = 0.78; 4 bands, F(33,2) = 1.484, p = 0.242; 6 bands, F(33,2) = 0.64; p = 0.541).

Figure 6.

Figure 6.

Effect of tDCS on perceptual learning (Δd′ of untrained items; lower graph displaying group averages, upper graph displaying single subject data). tDCS over IFG (n = 12) induced perceptual learning in the condition with the most severely degraded items (2 bands), while here no perceptual learning was observed for tDCS over IPC or SHAM (n = 12 each). No significant differences between groups were observed in the other degradation levels (3, 4, and 6 bands). Error bars indicate ± SE. Asterisks indicate significant differences (p < 0.05).

ΔC

The omnibus ANOVA (DEGRAD × STIM) of ΔC revealed a significant influence of degradation on changes in response behavior for perceptual learning as indexed by the main effect DEGRAD (F(99,3) = 10.28, p < 0.001). The factor STIM or the interaction STIM × DEGRAD did not reach significance (p > 0.05). Likewise, the analyses using univariate ANOVAs performed separately for each degradation level did not show an effect of STIM on ΔC (p ≥ 0.19 for all comparisons).

Discussion

Using a sham-controlled design, our results demonstrate that facilitatory tDCSanodal impacts on perceptual learning when applied during training over two key areas involved in the top-down processing of noise-vocoded speech. Facilitation of the left IFG induced a transfer from trained to untrained stimuli for the increase in d′. This applied only for the most severely degraded speech. Such transfer can be considered evidence for perceptual learning. Notably, no perceptual learning occurred when IPC was facilitated by tDCSanodal. On the contrary, left IPC facilitation induced a change in the decision criterion (C), in that participants showed a robust bias to judge stimuli as same during training. Additionally, the response bias (C > 0) before training for the most severely degraded stimuli was attenuated for the untrained, and abolished for the trained stimuli after the training.

The major findings can be conceived in the framework of signal detection theory. As sketched in Figure 7, two effects must be considered. (1) Forced choice between two categories (i.e., same vs different) will improve when stimuli (i.e., IP vs MP) elicit less overlapping neuronal response patterns (Fig. 7A). Our data support the hypothesis that for severely degraded speech IFG supports such a formation of more distinct response patterns during training, as evidenced by an increase in d′ for untrained stimuli. We suggest that in our paradigm a linguistic feature (here phonetic categories) is “sharpened” by training. (2) More general cognitive control processes such as C impact on forced choice in a more complex way. If no response bias is present at baseline (i.e., C ≈ 0), the induction of a response bias will not enhance performance. Only if an intervention reduces a pre-existing response bias (C ≠ 0) will this increase performance in a balanced forced-choice task. The latter effect is seen for more severely degraded speech in the comparison between pretesting and post-testing. On the contrary, the induction of a response bias during training by IPC facilitation does not increase perceptual sensitivity for the untrained items (Fig. 7B).

Figure 7.

Figure 7.

Sketch illustrating roles of IFG and IPC during perceptual learning of degraded speech. A, Conceptualization of the effect of tDCS over IFG for the most severely degraded items. Before training (dashed curves), the neuronal response to the auditory stimuli is noisy, as illustrated by broad overlapping response distributions and a comparatively small d′ (dashed gray line). Induced by training under IFG stimulation, the response distributions drift apart (solid lines), which signals enhanced categorical phoneme discrimination. Note that for the 2-band-vocoded words this effect relies on IFG facilitation. B sketches the effect of a change in the internal C as seen only during training with IPC facilitation for all vocoding levels. IPC facilitation induced liberalization of the criterion, yielding a near-optimal hit rate for identical pairs, at the expense of a grossly enhanced false alarm rate. The bias to integrate the acoustically degraded and the undegraded written word is a bias toward a unified percept. Such a bias may enhance perceptual learning when a larger-scale linguistic context (semantic, syntactic, or pragmatic) is supplied, but may not enhance the phonetic nucleus of perceptual learning of degraded speech.

Clearly, for perceptual learning during natural connected speech perception, these processes interact. However, our design, selectively providing lexical information without any syntactic, pragmatic, or contextual cues, allows us to discuss more linguistic and more general cognitive aspects during perceptual learning of vocoded speech.

Linguistic and general cognitive top-down processes

During speech perception, linguistic competence is incrementally recruited when the auditory input is degraded. At the same time, decision bias is likely to occur under challenging perceptual conditions, leading to the ambiguity of a stimulus (Vickery and Jiang, 2009). Using facilitatory anodal tDCS, we here supply experimental evidence that left IFG and IPC are important hubs differentially impacting on these two processes pertaining to perceptual learning of degraded speech. This noninvasive brain stimulation technique has been previously shown to modulate language functions and language learning processes (Flöel et al., 2008; Sparing et al., 2008; de Vries et al., 2010; Cattaneo et al., 2011). In line with our hypothesis anodal tDCS over left IFG supports perceptual learning of degraded speech in a same-different forced-choice paradigm. Consistent with the recruitment of linguistic knowledge for adverse listening environments, left IFG indeed showed a facilitatory effect for perceptual learning of highly degraded speech: Only when IFG was facilitated during training did perceptual learning occur for the most severely degraded stimuli. Note that these stimuli provide almost no spectral information to allow for the phonemic discrimination required by the task, and, accordingly, participants showed no perceptual learning for these items under natural, that is, non-tDCS, conditions. On the contrary, IPC facilitation did not affect perceptual learning but instead modulated participants' decision criterion during training, eliciting a robust liberalization of the response bias toward judging pairs as identical.

On a theoretical account, our data support the notion that integration of linguistic knowledge becomes essential especially when the auditory input is severely degraded (Obleser et al., 2007). The data provide the first evidence that top-down influences can be efficiently augmented by facilitatory tDCS to allow for learning even when learning is absent under “natural,” that is, non-tDCS learning conditions. Since noise vocoding simulates the degraded auditory input supplied by cochlear implants (Shannon et al., 1995), our findings are encouraging to further explore the potential clinical benefits of noninvasive brain stimulation in hearing impairment and restorative hearing therapy.

Operationalization of perceptual learning

Our results are based on an operationalization because transfer of the increase in d′ from trained to untrained items is thought to signal perceptual learning. Principally, perceptual adaptation and learning processes ensure the relatively stable processing of sensory information despite a highly variable and/or distorted signal. Depending on the experimental operationalization, the generalization of a previously established differential response to a new stimulus can be a mandatory feature for perceptual learning (Wohlwill, 1958; Goldstone, 1998; Loebach et al., 2009). In our study, participants were intensively trained on a same-different forced-choice paradigm to discriminate between MPs and IPs. Importantly, after the training on day 4, the generalization to untrained items was tested. Because improvements for the untrained stimuli indicate perceptual learning, our study design allows disentangling of perceptual learning from item-specific stimulus–response associations.

It should be noted that our design targets perceptual learning, which has been differentiated from within-session adaptation, in that multiple sessions and overnight consolidation may be required to stabilize the augmented perceptual skill. Both perceptual adaptation and learning have been suggested to require an interaction of primary sensory with higher-order, more cognitive processes in different brain areas (Censor et al., 2012).

Implications for the neuronal network affording perceptual learning of degraded speech

With regard to the adaptation to degraded speech, a recent fMRI study provides support for the differential role of the two higher-order hubs of the network (IFG and IPC) to interact with bottom-up processes (mostly taxing STG/STS). Participants were trained on moderately degraded 8-band-vocoded (and frequency-shifted) sentences in a comparatively short single training session in the scanner (Eisner et al., 2010). BOLD contrast changes over the course of training disclosed that vocoded speech, when compared with nonlearnable material, elicited IFG and STS activations. However, only left IFG activation correlated with the learning success, suggesting a pivotal role of the IFG for perceptual learning. Interestingly, inferior parietal regions [AG and supramarginal gyrus (SMG)] correlated with the individual progress over the course of the training. Based on a functional connectivity analysis, the authors proposed that IFG is the key structure linking bottom-up (STS/STG) and multimodal inferior parietal integration areas (AG and SMG) during perceptual learning.

Advancing from such correlational imaging findings, our study provides (1) novel evidence for a causal relationship between IFG facilitation and perceptual learning of severely degraded speech, and (2) highlights novel aspects with regard to the interplay between IFG and IPC, affording the modulation of phonemic categorical decisions during the learning process, as follows.

First, we show that learning is not only enhanced, but also enabled by tDCS in the case of severely degraded, highly unintelligible speech. No learning took place for 2-band degraded speech in the absence of IFG tDCS. To our knowledge, this is the first demonstration of a cognitive process that depends on external stimulation. How might tDCS enable perceptual learning? Anodal stimulation causes a tonic depolarization of the neuronal membrane potential, increasing the spontaneous firing rate and the excitability of cortical neurons in the stimulated area. Importantly, anodal stimulation acts upon a modulation of the synaptic strength and thus resembles long-term potentiation-like mechanisms (Stagg and Nitsche, 2011). We suggest that tDCS-enhanced plasticity in the left IFG allowed for the modulation and sharpening of phonemic categories, even when the auditory input supplied minimally discernible features (Fig. 7A). This effect is constitutive for perceptual learning under severely degraded conditions where stronger contributions of linguistic knowledge are required.

Second, previous studies on the neural correlates of perceptual learning of degraded speech used training paradigms providing a high level of semantic, syntactic, and even pragmatic context. When subjects are trained on sentences (Eisner et al., 2010; Erb et al., 2012) or in a natural listening environment using a portable real-time cochlear-implant simulator (Smalt et al., 2011), IFG involvement cannot be attributed to a certain level of linguistic processing. In contrast, our task targets phoneme discrimination and minimizes further contextual (e.g., semantic, syntactic, or pragmatic) information. Since our data show that IFG stimulation enables perceptual learning of highly degraded single words, we propose that in this context left IFG affords the critical formation and adaptation of phonological categories. Our findings support a more linguistic role of the IFG, while the more general cognitive processes of perceptual integration and decision bias may rely on the highly multimodal, parietal parts of the network. Such a view converges with previous studies showing an involvement of the left IFG in different aspects of phonological processing (Poldrack et al., 1999; Nixon et al., 2004; Tremblay et al., 2004; Hartwigsen et al., 2010). Future studies should target the question of the exact locus of facilitation from a neurolinguistic perspective, since the importance of lexical information of the training items has been controversially discussed in the light of behavioral findings (Davis et al., 2005; Hervais-Adelman et al., 2008). Based on our present findings, we propose that IFG is the key area to readjust sublexical, more specifically phonemic, categories during learning (Fig. 7A). In turn, this categorical readjustment modulates intelligibility (Obleser and Eisner, 2009).

It is important to note that tDCS does not selectively increase local cortical activity but rather modulates neuronal networks to change functional connectivity between both local and interconnected brain areas (Sehm et al., 2012). We assume that the behavioral effect does not exclusively rely on a modulation of the left IFG, but is supported by the modulation of a fronto-temporo-parietal network, with the left IFG as the key node. This assumption needs to be put under scrutiny by investigating approaches that combine facilitatory brain stimulation and functional connectivity imaging.

Beyond IFG, left IPC has been considered a key node within the network affording perception and/or perceptual learning of degraded speech (Shahin et al., 2009; Eisner et al., 2010; Obleser and Kotz, 2010; Sharp et al., 2010a; Clos et al., 2012). We do not find an IPC facilitation in perceptual learning, which is most likely due to the use of single words and a phonological decision task, which neither requires nor allows higher-order semantic or sentential integration. We suggest that the IPC comes into play when a higher-order semantic context is provided, or when the temporo-frontal loop is compromised by disease (Sharp et al., 2010b). In the present data, IPC stimulation had a more subtle, indirect effect on performance and modulated the decision criterion during learning. Only when IPC was facilitated by anodal tDCS, did participants show a bias to identify identical pairs (Fig. 7B). The results raise the possibility that facilitation of the IPC leads to an increased integration of the degraded acoustic word and the undegraded written word, resulting in a response bias toward identical pairs. On a more general level, our data support the putatively supramodal role of the IPC during discrimination tasks, where this area is a hub of a network involved in perceptual decision making (Kühn et al., 2011).

Potential limitations

A principal limitation of studies targeting perceptual learning is the necessity to test interventions between groups. Since perceptual sensitivity, training efficiency, and tDCS effects will vary between individuals, the resulting variance may elicit substantial type II errors. In that vein, interactions involving the factor STIM in the critical comparisons failed to reach significance (e.g., STIM × TIME). With regard to the network affording processing of degraded speech, one key area (STS/STG) was not targeted in the present study. Here, the major limitation is that tDCS cannot selectively target these areas without affecting primary auditory processing. Hence, a different training schedule including tests for primary auditory processing would be required, which we considered to overly complicate the already complex design of the study. Another technical issue regarding tDCS is bipolar electrode arrangement. Based on our experimental design, we cannot entirely rule out an effect of the cathodal “reference” electrode over the right supraorbital region. We consider it unlikely, however, that prefrontal inhibitory tDCS can explain our findings because (1) we find differential effects for anodal stimulation of the IFG and IPC, though the reference electrode position was kept the same in all tDCS conditions; (2) inhibitory effects by cathodal stimulation in cognitive and language tasks must be considered small (Jacobson et al., 2012), especially at the intensity of 1 mA used here; and (3) a modulatory effect on perceptual learning by potential inhibition of right supraorbital region would be a very unexpected finding given the putative key areas in this task, as discussed in previous work (Eisner et al., 2010).

Summary

To sum up, our results allow us to disentangle the contributions of two key structures of the speech perception network during perceptual learning of degraded speech: IFG is the crucial brain area to adjust and sharpen the mapping of degraded input to phonemic categories. Upregulation in IFG allows perceptual learning even under most severely reduced auditory bottom-up information. On the contrary, IPC plays a modulatory role in the same-different discrimination, as indicated by a clear influence on the decision criterion during learning. Such an influence on the criterion may be relevant in perceptual learning, but may only influence learning success when the categories to be differentiated have grossly different occurrence probabilities.

Footnotes

The authors declare no competing financial interests.

References

  1. Bent T, Loebach JL, Phillips L, Pisoni DB. Perceptual adaptation to sinewave-vocoded speech across languages. J Exp Psychol Hum Percept Perform. 2011;37:1607–1616. doi: 10.1037/a0024281. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Cattaneo Z, Pisoni A, Papagno C. Transcranial direct current stimulation over Broca's region improves phonemic and semantic fluency in healthy individuals. Neuroscience. 2011;183:64–70. doi: 10.1016/j.neuroscience.2011.03.058. [DOI] [PubMed] [Google Scholar]
  3. Censor N, Sagi D, Cohen LG. Common mechanisms of human perceptual and motor learning. Nat Rev Neurosci. 2012;13:658–664. doi: 10.1038/nrn3315. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Clos M, Langner R, Meyer M, Oechslin MS, Zilles K, Eickhoff SB. Effects of prior information on decoding degraded speech: an fMRI study. Hum Brain Mapp. 2012 doi: 10.1002/hbm.22151. doi: 10.1002/hbm.22151. Advance online publication. Retrieved August 29, 2013. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Dahan D, Mead RL. Context-conditioned generalization in adaptation to distorted speech. J Exp Psychol Hum Percept Perform. 2010;36:704–728. doi: 10.1037/a0017449. [DOI] [PubMed] [Google Scholar]
  6. Davis MH, Johnsrude IS, Hervais-Adelman A, Taylor K, McGettigan C. Lexical information drives perceptual learning of distorted speech: evidence from the comprehension of noise-vocoded sentences. J Exp Psychol Gen. 2005;134:222–241. doi: 10.1037/0096-3445.134.2.222. [DOI] [PubMed] [Google Scholar]
  7. de Vries MH, Barth AC, Maiworm S, Knecht S, Zwitserlood P, Flöel A. Electrical stimulation of Broca's area enhances implicit learning of an artificial grammar. J Cogn Neurosci. 2010;22:2427–2436. doi: 10.1162/jocn.2009.21385. [DOI] [PubMed] [Google Scholar]
  8. Eickhoff SB, Pomjanski W, Jakobs O, Zilles K, Langner R. Neural correlates of developing and adapting behavioral biases in speeded choice reactions—an fMRI study on predictive motor coding. Cereb Cortex. 2011;21:1178–1191. doi: 10.1093/cercor/bhq188. [DOI] [PubMed] [Google Scholar]
  9. Eisner F, McGettigan C, Faulkner A, Rosen S, Scott SK. Inferior frontal gyrus activation predicts individual differences in perceptual learning of cochlear-implant simulations. J Neurosci. 2010;30:7179–7186. doi: 10.1523/JNEUROSCI.4040-09.2010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Erb J, Henry MJ, Eisner F, Obleser J. Auditory skills and brain morphology predict individual differences in adaptation to degraded speech. Neuropsychologia. 2012;50:2154–2164. doi: 10.1016/j.neuropsychologia.2012.05.013. [DOI] [PubMed] [Google Scholar]
  11. Flöel A, Rösser N, Michka O, Knecht S, Breitenstein C. Noninvasive brain stimulation improves language learning. J Cogn Neurosci. 2008;20:1415–1422. doi: 10.1162/jocn.2008.20098. [DOI] [PubMed] [Google Scholar]
  12. Friederici AD, Kotz SA, Scott SK, Obleser J. Disentangling syntax and intelligibility in auditory language comprehension. Hum Brain Mapp. 2010;31:448–457. doi: 10.1002/hbm.20878. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Gandiga PC, Hummel FC, Cohen LG. Transcranial DC stimulation (tDCS): a tool for double-blind sham-controlled clinical studies in brain stimulation. Clin Neurophysiol. 2006;117:845–850. doi: 10.1016/j.clinph.2005.12.003. [DOI] [PubMed] [Google Scholar]
  14. Goldstone RL. Perceptual learning. Annu Rev Psychol. 1998;49:585–612. doi: 10.1146/annurev.psych.49.1.585. [DOI] [PubMed] [Google Scholar]
  15. Greenwood DD. A cochlear frequency-position function for several species—29 years later. J Acoust Soc Am. 1990;87:2592–2605. doi: 10.1121/1.399052. [DOI] [PubMed] [Google Scholar]
  16. Hartwigsen G, Price CJ, Baumgaertner A, Geiss G, Koehnke M, Ulmer S, Siebner HR. The right posterior inferior frontal gyrus contributes to phonological word decisions in the healthy brain: evidence from dual-site TMS. Neuropsychologia. 2010;48:3155–3163. doi: 10.1016/j.neuropsychologia.2010.06.032. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Hervais-Adelman A, Davis MH, Johnsrude IS, Carlyon RP. Perceptual learning of noise vocoded words: effects of feedback and lexicality. J Exp Psychol Hum Percept Perform. 2008;34:460–474. doi: 10.1037/0096-1523.34.2.460. [DOI] [PubMed] [Google Scholar]
  18. Jacobson L, Koslowsky M, Lavidor M. tDCS polarity effects in motor and cognitive domains: a meta-analytical review. Exp Brain Res. 2012;216:1–10. doi: 10.1007/s00221-011-2891-9. [DOI] [PubMed] [Google Scholar]
  19. Kühn S, Schmiedek F, Schott B, Ratcliff R, Heinze HJ, Düzel E, Lindenberger U, Lövden M. Brain areas consistently linked to individual differences in perceptual decision-making in younger as well as older adults before and after training. J Cogn Neurosci. 2011;23:2147–2158. doi: 10.1162/jocn.2010.21564. [DOI] [PubMed] [Google Scholar]
  20. Loebach JL, Pisoni DB, Svirsky MA. Transfer of auditory perceptual learning with spectrally reduced speech to speech and nonspeech tasks: implications for cochlear implants. Ear Hear. 2009;30:662–674. doi: 10.1097/AUD.0b013e3181b9c92d. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Loebach JL, Pisoni DB, Svirsky MA. Effects of semantic context and feedback on perceptual learning of speech processed through an acoustic simulation of a cochlear implant. J Exp Psychol Hum Percept Perform. 2010;36:224–234. doi: 10.1037/a0017609. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. McMillan NA. Detection theory. Mahwah, NJ: Erlbaum; 2005. [Google Scholar]
  23. Nitsche MA, Liebetanz D, Lang N, Antal A, Tergau F, Paulus W. Safety criteria for transcranial direct current stimulation (tDCS) in humans. Clin Neurophysiol. 2003;114:2220–2222. doi: 10.1016/S1388-2457(03)00235-9. [DOI] [PubMed] [Google Scholar]
  24. Nitsche MA, Cohen LG, Wassermann EM, Priori A, Lang N, Antal A, Paulus W, Hummel F, Boggio PS, Fregni F, Pascual-Leone A. Transcranial direct current stimulation: state of the art 2008. Brain Stimul. 2008;1:206–223. doi: 10.1016/j.brs.2008.06.004. [DOI] [PubMed] [Google Scholar]
  25. Nixon P, Lazarova J, Hodinott-Hill I, Gough P, Passingham R. The inferior frontal gyrus and phonological processing: an investigation using rTMS. J Cogn Neurosci. 2004;16:289–300. doi: 10.1162/089892904322984571. [DOI] [PubMed] [Google Scholar]
  26. Obleser J, Eisner F. Pre-lexical abstraction of speech in the auditory cortex. Trends Cogn Sci. 2009;13:14–19. doi: 10.1016/j.tics.2008.09.005. [DOI] [PubMed] [Google Scholar]
  27. Obleser J, Kotz SA. Expectancy constraints in degraded speech modulate the language comprehension network. Cereb Cortex. 2010;20:633–640. doi: 10.1093/cercor/bhp128. [DOI] [PubMed] [Google Scholar]
  28. Obleser J, Wise RJ, Alex Dresner M, Scott SK. Functional integration across brain regions improves speech perception under adverse listening conditions. J Neurosci. 2007;27:2283–2289. doi: 10.1523/JNEUROSCI.4663-06.2007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Obleser J, Eisner F, Kotz SA. Bilateral speech comprehension reflects differential sensitivity to spectral and temporal features. J Neurosci. 2008;28:8116–8123. doi: 10.1523/JNEUROSCI.1290-08.2008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Oldfield RC. The assessment and analysis of handedness: the Edinburgh inventory. Neuropsychologia. 1971;9:97–113. doi: 10.1016/0028-3932(71)90067-4. [DOI] [PubMed] [Google Scholar]
  31. Poldrack RA, Wagner AD, Prull MW, Desmond JE, Glover GH, Gabrieli JD. Functional specialization for semantic and phonological processing in the left inferior prefrontal cortex. Neuroimage. 1999;10:15–35. doi: 10.1006/nimg.1999.0441. [DOI] [PubMed] [Google Scholar]
  32. Ragert P, Vandermeeren Y, Camus M, Cohen LG. Improvement of spatial tactile acuity by transcranial direct current stimulation. Clin Neurophysiol. 2008;119:805–811. doi: 10.1016/j.clinph.2007.12.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Rosen S, Faulkner A, Wilkinson L. Adaptation by normal listeners to upward spectral shifts of speech: implications for cochlear implants. J Acoust Soc Am. 1999;106:3629–3636. doi: 10.1121/1.428215. [DOI] [PubMed] [Google Scholar]
  34. Scott SK, Blank CC, Rosen S, Wise RJ. Identification of a pathway for intelligible speech in the left temporal lobe. Brain. 2000;123:2400–2406. doi: 10.1093/brain/123.12.2400. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Sehm B, Schäfer A, Kipping J, Margulies D, Conde V, Taubert M, Villringer A, Ragert P. Dynamic modulation of intrinsic functional connectivity by transcranial direct current stimulation. J Neurophysiol. 2012;108:3253–3263. doi: 10.1152/jn.00606.2012. [DOI] [PubMed] [Google Scholar]
  36. Shahin AJ, Bishop CW, Miller LM. Neural mechanisms for illusory filling-in of degraded speech. Neuroimage. 2009;44:1133–1143. doi: 10.1016/j.neuroimage.2008.09.045. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Shannon RV, Zeng FG, Kamath V, Wygonski J, Ekelid M. Speech recognition with primarily temporal cues. Science. 1995;270:303–304. doi: 10.1126/science.270.5234.303. [DOI] [PubMed] [Google Scholar]
  38. Sharp DJ, Awad M, Warren JE, Wise RJ, Vigliocco G, Scott SK. The neural response to changing semantic and perceptual complexity during language processing. Hum Brain Mapp. 2010a;31:365–377. doi: 10.1002/hbm.20871. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Sharp DJ, Turkheimer FE, Bose SK, Scott SK, Wise RJ. Increased frontoparietal integration after stroke and cognitive recovery. Ann Neurol. 2010b;68:753–756. doi: 10.1002/ana.21866. [DOI] [PubMed] [Google Scholar]
  40. Smalt CJ, Talavage TM, Pisoni DB, Svirsky MA. Neural adaptation and perceptual learning using a portable real-time cochlear implant simulator in natural environments. Conf Proc IEEE Eng Med Biol Soc. 2011;2011:1145–1148. doi: 10.1109/IEMBS.2011.6090268. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Sohoglu E, Peelle JE, Carlyon RP, Davis MH. Predictive top-down integration of prior knowledge during speech perception. J Neurosci. 2012;32:8443–8453. doi: 10.1523/JNEUROSCI.5069-11.2012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Sparing R, Dafotakis M, Meister IG, Thirugnanasambandam N, Fink GR. Enhancing language performance with non-invasive brain stimulation—a transcranial direct current stimulation study in healthy humans. Neuropsychologia. 2008;46:261–268. doi: 10.1016/j.neuropsychologia.2007.07.009. [DOI] [PubMed] [Google Scholar]
  43. Stagg CJ, Nitsche MA. Physiological basis of transcranial direct current stimulation. Neuroscientist. 2011;17:37–53. doi: 10.1177/1073858410386614. [DOI] [PubMed] [Google Scholar]
  44. Tremblay T, Monetta L, Joanette Y. Phonological processing of words in right- and left-handers. Brain Cogn. 2004;55:427–432. doi: 10.1016/j.bandc.2004.02.068. [DOI] [PubMed] [Google Scholar]
  45. Vickery TJ, Jiang YV. Inferior parietal lobule supports decision making under uncertainty in humans. Cereb Cortex. 2009;19:916–925. doi: 10.1093/cercor/bhn140. [DOI] [PubMed] [Google Scholar]
  46. Winn MB, Chatterjee M, Idsardi WJ. The use of acoustic cues for phonetic identification: effects of spectral degradation and electric hearing. J Acoust Soc Am. 2012;131:1465–1479. doi: 10.1121/1.3672705. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Wohlwill JF. The definition and analysis of perceptual learning. Psychol Rev. 1958;65:283–295. doi: 10.1037/h0041722. [DOI] [PubMed] [Google Scholar]
  48. Zaehle T, Sandmann P, Thorne JD, Jäncke L, Herrmann CS. Transcranial direct current stimulation of the prefrontal cortex modulates working memory performance: combined behavioural and electrophysiological evidence. BMC Neurosci. 2011;12:2. doi: 10.1186/1471-2202-12-2. [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from The Journal of Neuroscience are provided here courtesy of Society for Neuroscience

RESOURCES