Skip to main content
The Journal of Neuroscience logoLink to The Journal of Neuroscience
. 2011 Dec 14;31(50):18590–18597. doi: 10.1523/JNEUROSCI.2599-11.2011

Interactions between “What” and “When” in the Auditory System: Temporal Predictability Enhances Repetition Suppression

Jordi Costa-Faidella 1,2, Torsten Baldeweg 3, Sabine Grimm 1,2, Carles Escera 1,2,
PMCID: PMC6623902  PMID: 22171057

Abstract

Neural activity in the auditory system decreases with repeated stimulation, matching stimulus probability in multiple timescales. This phenomenon, known as stimulus-specific adaptation, is interpreted as a neural mechanism of regularity encoding aiding auditory object formation. However, despite the overwhelming literature covering recordings from single-cell to scalp auditory-evoked potential (AEP), stimulation timing has received little interest. Here we investigated whether timing predictability enhances the experience-dependent modulation of neural activity associated with stimulus probability encoding. We used human electrophysiological recordings in healthy participants who were exposed to passive listening of sound sequences. Pure tones of different frequencies were delivered in successive trains of a variable number of repetitions, enabling the study of sequential repetition effects in the AEP. In the predictable timing condition, tones were delivered with isochronous interstimulus intervals; in the unpredictable timing condition, interstimulus intervals varied randomly. Our results show that unpredictable stimulus timing abolishes the early part of the repetition positivity, an AEP indexing auditory sensory memory trace formation, while leaving the later part (∼>200 ms) unaffected. This suggests that timing predictability aids the propagation of repetition effects upstream the auditory pathway, most likely from association auditory cortex (including the planum temporale) toward primary auditory cortex (Heschl's gyrus) and beyond, as judged by the timing of AEP latencies. This outcome calls for attention to stimulation timing in future experiments regarding sensory memory trace formation in AEP measures and stimulus probability encoding in animal models.

Introduction

The auditory system extracts stimulus probabilities from the acoustic scene to predict future events (Winkler et al., 2009): what do we expect on the basis of what we have heard before? Yet, guiding our actions in changing environments also involves anticipating when events will occur. Although the neural mechanisms of stimulus probability encoding have been the focus of much research, their interaction with stimulus timing is little understood.

Several studies showed that activity along sensory pathways is reduced with stimulus probability, a phenomenon supporting the neural representation of stimuli known as “repetition suppression” (Desimone, 1996). In the auditory system, repetition suppression spans multiple spatial and time scales, as revealed by animal single-cell recordings exhibiting stimulus-specific adaptation (SSA) in cortical and subcortical structures (Ulanovsky et al., 2003, 2004; Pérez-González et al., 2005; Reches and Gutfreund, 2008; Anderson et al., 2009; Malmierca et al., 2009; Antunes et al., 2010; Farley et al., 2010; Zhao et al., 2011), human long- and middle-latency auditory-evoked potentials (AEP) (Haenschel et al., 2005; Slabu et al., 2010; Costa-Faidella et al., 2011; Grimm et al., 2011), and fMRI studies (Mutschler et al., 2010). However, none of the abovementioned studies explored the influence of timing regularity on repetition suppression, a subject lightly tapped in human electrophysiology literature, leading to contradictory findings. For example, whether single repeated tones in periodic versus aperiodic sequences elicit a smaller N1–P2 complex of the AEP is still unclear (Nelson et al., 1969; Rothman et al., 1970; Nelson and Lassman, 1977). Similarly, periodicity in an AEP oddball paradigm can yield N1 decrements (Harada et al., 2005), P50 decrements (Moberget et al., 2008), and contradictory effects on preattentive deviance detection (Takegata and Morotomi, 1999; Schwartze et al., 2011). Divergences might arise from using inappropriate stimulation paradigms; because AEP repetition effects occur rapidly, using different stimuli with a variable number of repetitions might prove more instructive (e.g., roving standard paradigm) (Baldeweg et al., 2004; Cowan et al., 1993).

Considering that recent perception theories contemplate repetition suppression as a neural correlate of the precision with which future sensory events can be predicted (i.e., suppression of prediction error) based on past stimulation history (Friston, 2005; Winkler et al., 2009), it is of high interest to assess the relevance of timing predictability in neural adaptation. The present study seeks to provide human electrophysiological evidence showing that temporal predictability enhances the experience-dependent modulation of neural activity associated with probabilistic stimuli.

Here we recorded human AEP to pure tones arranged in a passive roving standard paradigm delivered at isochronous or random time intervals. We aimed to obtain a combined modulation of AEPs generated along the auditory pathway (P50, N1, P2), conforming the repetition positivity (RP), an AEP-reflecting auditory sensory memory trace formation (Haenschel et al., 2005). If repetition suppression is enhanced by stimulus predictability, repetition effects should be greater in isochronous sound sequences. Furthermore, violating probability-based expectancies involving predictable time information should elicit stronger error signals, as indexed by the mismatch negativity (MMN) (Näätänen, 2007).

Materials and Methods

Participants.

Seventeen healthy volunteers (6 male; age, 23–49 years; mean age, 29.18 years; all right-handed) with no history of neurological, psychiatric, or hearing impairment and with normal or corrected-to-normal visual acuity participated in the experiment. All volunteers gave informed consent before their participation. Data from two participants had to be excluded from the analysis; one due to a poor signal-to-noise ratio (<50% of artifact-free epochs in one block), and one due to a muscle artifact time-locked to the onset of the acoustic stimuli (picked up by the electrodes located at the mastoid positions). Ethical approval was obtained from the local ethical committee and all experiments were in accordance with the Declaration of Helsinki.

Stimuli and procedure.

The auditory stimuli consisted of pure sinusoidal tones of 50 ms duration, including 5 ms rise and fall times, generated with Audacity free software (version 1.3; http://audacity.sourceforge.net/). The tones were delivered binaurally through headphones by Presentation software (version 0.70; www.neurobs.com). Each subject adjusted the loudness of the tones to a comfortable level, which was maintained throughout the recording session (average loudness, ∼80 dB SPL). This experiment used a modified version of the roving standard frequency paradigm described by Baldeweg et al. (2004). Here, trains of three, six, or 12 equal tones were randomly delivered without intertrain pauses, with tone frequency varying across trains (Fig. 1). In such a stimulus arrangement, the first tone of a train acts as a low-probability stimulus compared with those of the previous train (deviant stimulus), whereas the last tone of a train acts as a high-probability stimulus inside that train (standard stimulus). This paradigm allowed us to derive two types of measures on the amount of change in the AEPs as a function of stimulus repetition: a direct measure of the adaptation to repetition, indexing sensory-memory trace formation (Haenschel et al., 2005) by comparing the AEPs to the last tone in a train of three, six, or 12 stimuli; and a measure of the neural activity related to deviance detection by means of the MMN, as obtained by subtracting the activity evoked by the standard stimulus from that evoked by the deviant stimulus (Näätänen et al., 1978). The reason for choosing trains of three rather than two stimulus presentations as used in previous studies with roving standard paradigms (Baldeweg et al., 2004, 2006; Haenschel et al., 2005) was to avoid the possibility of a residual MMN to either the preceding train of standards or to the deviant stimulus in the neural response to the repeated tone (Sams et al., 1983). Twenty-five different frequencies were used, ranging from 880 to 2921 Hz, with a frequency ratio between adjacent tones of 0.05 according to the following formula: Δf = (f2 − f1)/(f2 × f1)1/2 (Ulanovsky et al., 2003). To avoid N1 refractoriness effects across trains, the tone frequency of a particular train did not appear in any of the 10 subsequent trains.

Figure 1.

Figure 1.

Schematic diagram of the roving standard frequency paradigm used in this study. Trains of three, six, or 12 equal tones were randomly delivered without intertrain pauses, with tone frequency varying from 880 to 2921 Hz across trains. In this arrangement, the first tone of a train acts as a low-probability stimulus compared with the previous train [deviant stimulus (DEV); black hexagons], whereas the last tone of a train acts as a high-probability stimulus inside that train [standard stimulus (STD); white hexagons]. A, Predictable timing condition. The SOA and the ITI were set constant at 708 ms. B, Unpredictable timing condition. The SOA varied pseudorandomly between 364 and 1062 ms in seven steps of 118 ms, with the constraint that the SOA previous to the last stimulus in a train as well as the ITI were always 708 ms (asterisks).

In the predictable timing condition, the stimulus onset asynchrony (SOA) and the intertrain interval (ITI) were set to 708 ms. In the unpredictable timing condition, the SOA varied pseudorandomly between 364 and 1062 ms in seven steps of 118 ms, equiprobably presented with the constraint that the SOA previous to the last stimulus in a train and the ITI were always 708 ms (Fig. 1B, asterisks). This constraint was adopted to avoid potential baseline confounds in the AEP analysis due to carryover effects from the AEP to the previous stimulus. In total, 150 trains of three, six, and 12 tone repetitions were delivered per condition (900 trains overall). Participants sat in a comfortable chair in a sound-attenuated and electrically shielded room. They were instructed to ignore the sounds and watch a silent movie with subtitles. The auditory stimuli were arranged in three blocks of predictable timing and three of unpredictable timing, delivered at random, with resting pauses in between. The total duration of the experiment was ∼80 min, plus 1 h of EEG recording preparation.

Auditory-evoked potentials recording and analysis.

The electroencephalogram (EEG) was continuously recorded with frequency limits of 0.05–100 Hz and digitized at a sampling rate of 1000 Hz by a SynAmps amplifier (NeuroScan). Ag/AgCl electrodes were used for the EEG acquisition, 18 of which were mounted in a nylon cap (Quik-Cap; Compumedics) at the standard locations Fp1, Fp2, F7, F3, Fz, F4, F8, T7, C3, Cz, C4, T8, P7, P3, Pz, P4, P8, and Oz according to the international 10–20 system. Additionally, two electrodes were positioned over the left and the right mastoids (M1 and M2, respectively). Vertical and horizontal electrooculogram (VEOG and HEOG, respectively) were measured from monopolar electrodes placed below (VEOG) and lateral (HEOG) to the right eye. The ground electrode was placed at Fpz and the common reference electrode was placed at Cpz. All impedances were kept <5 kΩ during the whole recording session.

Data analysis was performed offline using EEGlab v.7 software (Delorme and Makeig, 2004) running under Matlab v7.6 (Mathworks). Continuous EEG data were resampled to 250 Hz and periods contaminated by nonstereotyped muscle artifacts were rejected by visual inspection. Independent component analysis decomposition was applied using the Infomax algorithm, removing blink-related independent components on the basis of their scalp topography and continuous activity (Jung et al., 2000). EOG artifact-corrected data were re-referenced to linked mastoids and filtered from 0.2 to 30 Hz. Epochs of 600 ms, starting 100 ms before stimulus onset and baseline corrected from −100 to 0 ms, were extracted and averaged for each experimental condition separately (12 conditions; standard and deviant stimuli × predictable/unpredictable timing × 3-, 6-, or 12-tone repetition trains; 150 epochs per condition). Before averaging, epochs exceeding an amplitude threshold of ±100 μV were rejected (mean of overall rejected epochs, 22; SD, 14.9). MMN difference waveforms were obtained by subtracting the activity elicited by the last stimulus in a train (i.e., standard stimulus: third, sixth, or 12th) from that elicited by the first stimulus of the subsequent train (i.e., deviant stimulus). For illustration purposes, RP difference waveforms were obtained by subtracting the activity elicited by the third standard stimulus from that elicited by the 12th standard stimulus. To examine the early onset of AEP repetition effects, we computed the mean amplitude in the 60–80 ms time window around the P50 peak at Fz electrode for all standard and deviant AEPs. N1 peak amplitudes were retrieved from all standard and deviant AEPs by detecting the minimum amplitude values in the 80–180 ms time window at Fz. Similarly, P2 peak amplitudes were retrieved from all standard AEPs by detecting the maximum amplitude values in the 120–280 ms time window at Fz. P2 values were not retrieved from deviant stimuli due to a possible overlap with the ongoing MMN and P300 AEP components taking place in the response to improbable stimuli (deviants). Finally, to compare the differential activity between standard and deviant stimuli around the MMN range, we retrieved the mean amplitudes for both stimulus types in a 30 ms time window centered at the individual MMN peak at the Fz electrode (detected as the minimum value in the difference waveforms in the 80–250 ms time window).

Statistical analysis.

Timing predictability and repetition effects and their interactions were assessed by means of repeated-measures ANOVAs with the factors Stimulus (standard vs deviant), Timing predictability (predictable vs unpredictable), and Repetition (3, 6, 12) for all P50, N1, and MMN (time range) measures specified above. Effects on P2 values were computed with repeated-measures ANOVAs with the factors Timing predictability (predictable vs unpredictable) and Repetition (3, 6, 12). Subsequent repeated-measures ANOVAs were performed to assess interaction effects. Linear trends are reported when applicable. The Greenhouse–Geisser correction was applied when the assumption of sphericity was violated. Significant ANOVA effects (alpha level = 0.05) are reported with the partial η2 effect size measure.

To ascertain that stimulus probability and timing predictability affected the studied AEP (P50, N1, P2) and that the expected modulations were not due to other neural processes summing up at the scalp with those components, we computed a topographical measure of global dissimilarity (DISS) (Murray et al., 2008). DISS is an index of configuration differences between two electric fields, independent of their strength. This test provides a statistical means of determining whether the brain networks activated by two conditions differ. DISS equals the square root of the mean of the squared differences between the potentials measured at each electrode (average re-referenced; note that the relative form of the scalp topography is reference-independent), each of which is first scaled to unitary strength by dividing by the instantaneous global field power (the root mean square across the average-referenced electrode values at a given instant in time). DISS can range from 0 (topographic homogeneity) to 2 (topographic inversion). Because DISS is a single measure of the distance between two vectors, a nonparametric statistical test has to be conducted and only pairwise comparisons are allowed with this method. We computed all possible pairwise comparisons between the scalp topographies obtained with the abovementioned measures for the P50, N1, and P2 AEPs, as follows: (1) data were average-referenced, (2) single-subject maps were reassigned to different experimental conditions (i.e., permutations of the data), (3) grand-average AEPs were recalculated, (4) and a DISS value was recalculated for the new grand-average AEP. Five thousand permutations of the data were used to obtain the empirical distribution, and the alpha level to determine whether the observed DISS between two conditions was significantly different compared with the distribution was set to 0.05.

Results

Grand average waveforms evoked to standard and deviant stimuli after three, six, and 12 stimulus presentations for both predictable and unpredictable timing conditions are illustrated in Figure 2A, together with deviant minus standard difference waveforms (Fz electrode). As expected, tones in both conditions elicited the typical AEP waveforms with distinct P50, N1, and P2 components, and the subtraction of the waveforms to low from high probability tones revealed a prominent MMN. Below, we describe in detail the influence of stimulus probability and timing predictability in these AEP components.

Figure 2.

Figure 2.

A, Grand-average waveforms for standard (STD), deviant (DEV), and deviant minus standard differences (DEV-STD DW) in predictable (top) and unpredictable (bottom) timing conditions, separately for trains of three (blue trace), six (red trace), and 12 (green trace) tone presentations, as recorded from Fz electrode. Shaded areas indicate SEM. B, P50 amplitudes in predictable (PT) and unpredictable (UT) timing conditions elicited to standard (white circles) and deviant (black circles) stimuli separately for trains of three, six, and 12 tones (amplitudes in microvolts; error bars denote SEM). P50 amplitude increased with repetition only in the predictable timing condition regardless of stimulus type. C, Same as B, but for N1 amplitudes, which were overall larger for deviant than standard stimuli but decreased with further repetition only for standard stimuli in the predictable timing condition. D, Same as B, but for P2 amplitudes elicited to the standard stimulus. P2 amplitudes increased with tone repetition regardless of timing predictability. E, Same as B, but for amplitudes retrieved in a time window around the MMN. Deviant stimuli elicited more negative amplitudes in the MMN time window (MMNwin) than standard stimuli, but only the latter were affected by repetition, an effect manifested as an increase of positivity, larger in the predictable than the unpredictable timing condition.

Interactions between timing predictability and stimulus repetition effects on brain potentials

Early effects at the P50 range (∼70 ms)

The P50 component of the AEP to standard and deviant stimuli indexed changes of early brain activity to tone repetition that depended on timing predictability (Timing predictability × Repetition interaction: F(2,28) = 3.536, p = 0.043, η2 = 0.202). In the predictable timing condition, the P50 mean amplitude evoked to standard and deviant stimuli was similar and increased as a function of repetition (Stimulus type: F(1,14) = 0.640, p = 0.437; Stimulus type × Repetition interaction: F(2,28) = 0.184, p = 0.833; Repetition effect for standard and deviant stimuli; F(2,28) = 8.685, p = 0.001; η2 = 0.383; linear trend: F(1,14) = 22.601, p = 0.0003; η2 = 0.617), but it showed no significant changes in the unpredictable timing condition (Stimulus type: F(1,14) = 0.954, p = 0.345; Stimulus type × Repetition interaction: F(2,28) = 1.901, p = 0.168; Repetition effect for standard and deviant stimuli: F(2,28) = 0.066, p = 0.936). The topographical distribution of the P50 AEP remained stable across stimulus types and repetitions and did not depend on timing predictability, according to DISS (Murray et al., 2008) on every possible pairwise comparison. Stimulus repetition-related changes in P50 mean amplitude elicited to deviant and standard stimuli in both predictable and unpredictable conditions are illustrated in Figure 2B.

Effects at the N1 range (∼110 ms)

When analyzing the peak amplitudes of the N1 component of the AEP, larger amplitudes were found for deviant than standard stimuli (Stimulus type main effect: F(1,14) = 18.308, p = 0.001, η2 = 0.567). This main effect interacted with repetition (Stimulus type × Repetition interaction: F(2,28) = 5.066, p = 0.009, η2 = 0.285). Thus, when analyzing the N1 peak amplitude separately for deviant stimuli, no significant effects were found (Timing predictability: F(1,14) = 0.916, p = 0.355; Repetition: F(2,28) = 0.940, p = 0.403; Timing predictability × Repetition interaction: F(2,28) = 0.027, p = 0.974), but the repetition effect interacted with timing predictability for standard stimuli (Timing predictability × Repetition interaction: F(2,28) = 4.786, p = 0.016, η2 = 0.255). This repetition effect was present in the predictable timing condition (Repetition main effect: F(2,28) = 11.123, ε = 0.706, p = 0.001, η2 = 0.443; linear trend: F(1,14) = 14.633, p = 0.002, η2 = 0.511) but not in the unpredictable timing condition (F(2,28) = 0.846, p = 0.440). The topographical distribution of the N1 AEP remained stable across stimulus types and repetitions and did not depend on timing predictability according to a DISS analysis on every possible pairwise comparison. N1 peak amplitudes retrieved from standard and deviant stimuli in both predictable and unpredictable timing conditions are plotted as a function of stimulus repetition in Figure 2C.

Effects at the P2 range (∼170 ms)

The analysis of the peak amplitude of the P2 component of the auditory AEP to standard stimuli yielded an increase with repetition regardless of timing predictability, as shown by a significant effect of stimulus repetition without an interaction with timing predictability (Interaction: F(2,28) = 2.334, p = 0.116; Repetition main effect: F(2,28) = 4.899, ε = 0.706, p = 0.028, η2 = 0.259; linear trend: F(1,14) = 6.108, p = 0.027, η2 = 0.304). The topographical distribution of the P2 AEP remained stable across stimulus repetitions and did not depend on timing predictability according to a DISS analysis on every possible pairwise comparison. These changes in P2 peak amplitude with stimulus repetition in both predictable and unpredictable timing conditions are illustrated in Figure 2D.

In addition to the above mentioned analyses, the effect of stimulus repetition on the standard stimuli AEPs and its interaction with stimulus timing can be better seen in Figure 3A where, for illustration purposes, we subtracted the activity elicited by the third from that elicited by the 12th tone in a train, a procedure commonly used to obtain the RP (Haenschel et al., 2005). The figure shows that the RP develops with stimulus repetition at much earlier latencies in the predictable timing (Fig. 3A, blue trace) than in the unpredictable timing (Fig. 3A, red trace) condition, an effect highlighted by the color arrows in the figure, marking the time windows of the P50 (∼70 ms; Fig. 3A, blue arrow) and P2 (∼170 ms; Fig. 3A, red arrow) components found in the standard stimuli AEPs. The typically frontocentral scalp potential distribution of the RP (Haenschel et al., 2005) is plotted in Figure 3B at the abovementioned time windows. This figure shows the lack of repetition effects in the AEP to standard stimuli over the whole scalp, until the P2 time range, in the unpredictable timing condition.

Figure 3.

Figure 3.

A, RP grand-average difference waveforms [AEP to the 12th minus the third standard (STD) stimulus] for predictable (PT; blue trace) and unpredictable (UT; red trace) timing conditions at Fz electrode. Whereas the shape of the RP waveform is similar in both traces, the onset of the significant repetition-related positivity is ∼100 ms earlier in the predictable (blue arrow at the P50 time window, 70 ms) versus the unpredictable (red arrow at the P2 time window, 170 ms) timing condition. B, RP scalp potential distributions at 70, 110, and 170 ms for predictable and unpredictable timing conditions. Note how the development of the frontocentral repetition-related positivity takes place at an earlier latency when stimulation timing is predictable. C, MMN scalp potential distribution after three, six, and 12 tone presentations in predictable and unpredictable timing conditions. Note the repetition-related increase in amplitude and the typical frontocentral distribution of the MMN (using linked mastoids reference).

Interactions between timing predictability and stimulus repetition effects in auditory deviance detection

Auditory deviance detection was influenced both by stimulus repetition and timing predictability, as shown by an analysis of the separate contributions of deviant and standard stimuli to the MMN AEP (Fig. 2E). Repetition effects were modulated by stimulus type and timing predictability (triple interaction between Stimulus type × Timing predictability × Repetition: F(2,28) = 3.396, p = 0.048, η2 = 0.195), being present mainly for standard stimuli and larger in the predictable than in the unpredictable timing conditions. Subsequent analyses showed that in the predictable timing condition, as expected, deviant stimuli elicited more negative amplitudes in the MMN time window overall than standard stimuli (Stimulus type main effect: F(1,14) = 83.758, p = 2.8 × 10−7, η2 = 0.857), although repetition effects were only present in the responses to the latter (Stimulus type × Repetition interaction: F(2,28) = 10.875, p = 0.0003, η2 = 0.437; Repetition effects on deviant stimuli: F(2,28) = 0.159, p = 0.853; Repetition effects on standard stimuli: F(2,28) = 9.109, p = 0.001, η2 = 0.394; linear trend: F(1,14) = 15.093, p = 0.002, η2 = 0.519). In the unpredictable timing condition, deviant stimuli elicited more negative amplitudes in the MMN time window overall than standard stimuli (Stimulus type main effect: F(1,14) = 79.480, p = 3.8 × 10−7, η2 = 0.850) and an interaction between Stimulus type and Repetition was also present (F(2,28) = 3.977, p = 0.030, η2 = 0.221), indicating that repetition effects had opposite directions for deviant (increasing negativity) than standard (increasing positivity) stimuli. However, repetition effects, per se, did not reach significance in standard nor deviant stimuli (Repetition effects on deviant stimuli: F(2,28) = 0.244, p = 0.785; Repetition effects on standard stimuli: F(2,28) = 1.963, p = 0.159). Additionally, the typically frontocentral scalp potential distribution of the MMN (Alho, 1995; Näätänen and Alho, 1995) can be seen in Figure 3C as a function of stimulus probability and timing predictability.

Discussion

The present study constitutes the first demonstration that timing predictability enhances the experience-dependent modulation of neural activity associated with stimulus probability encoding. Specifically, we have shown that isochronous stimulus repetition enhances the early part of the repetition positivity (∼<200 ms), an AEP indexing auditory sensory-memory trace formation. This suggests that predictable timing aids the propagation of repetition effects upstream the auditory pathway, as judged by the timing of AEP latencies. Furthermore, violating probability-based stimulus expectancies involving regular time relations elicited a greater error signal, as reflected by larger MMN amplitudes in the predictable compared with the unpredictable timing condition.

Using roving standard stimulation, enabling the observation of repetition-related effects, Baldeweg and colleagues showed in several studies that tone repetition modulates the AEP components in a conjoined and reliable way: an increase of the P50, decrease of the N1, and increase of the P2 AEPs, which they called the repetition positivity (Baldeweg et al., 1999, 2006; Haenschel et al., 2005). Our data argue for the view of the RP as a nonunitary phenomenon, as the modulation of the underlying AEPs, supported by their stable scalp topographies across the different experimental conditions, might reflect different processing stages of regularity encoding in the auditory pathway.

Particularly, we showed that P2 amplitude increased with repetition regardless of the timing predictability of the sound sequence. P2 amplitude increases with stimulus repetition in timescales of minutes (Baldeweg et al., 1999) and days (Atienza et al., 2002), and correlates with stimulus expectancy driven by local and global stimulus probabilities (for a review of the P2 AEP, see Crowley and Colrain, 2004; Costa-Faidella et al., 2011). This suggests that P2, with neural generators localized to planum temporale (PT) as well as in area 22 (auditory association cortex) (Godey et al., 2001), might reflect the encoding of the “what” aspect of auditory stimulation, in line with the idea that the PT is a crucial structure in the generation of auditory objects (Griffiths and Warren, 2002).

However, the N1 behaved differently, increasing to tone changes regardless of timing predictability, but showing decrements with an increased number of stimulus repetitions only in isochronous sequences. Thus, the N1 evoked by a repeated tone is affected by the “when” aspect of auditory stimulation. Our data seem at odds with a study by Budd and colleagues (1998) in which trains of isochronous tones, including tone repetitions at the first five positions of each train, were delivered to healthy participants. The authors found that after the second repeated tone, N1 amplitude does not reveal a further decrease with repetition. Yet differences might arise from the use of a higher number of repeated stimuli in our paradigm, leading to a stronger memory-trace effect. The reason why this further N1 amplitude decrement only took place in our predictable timing condition needs more consideration. For example, N1 amplitude decreases with temporal and pitch expectations (Lange, 2009), with previous knowledge of the sequence of stimulation (Clementz et al., 2002), and to self-generated tones (Baess et al., 2011). The common aspect in these different studies is that they support the involvement of predictive mechanisms in N1 amplitude attenuation. Following this reasoning, our results show that increasing the predictability of the auditory stimulation both in stimulus probability and stimulus timing leads to a pronounced N1 attenuation (but see Nelson et al., 1969, 1977; Rothman et al., 1970; Harada et al., 2005).

As with the N1, repetition effects of the P50 were only observed in isochronous sequences. The increase of P50 amplitude extends findings from previous studies using roving standard paradigms (Haenschel et al., 2005; Baldeweg et al., 2006) or embedding stimulus repetitions in changing acoustic backgrounds (Dyson et al., 2005). Here we showed that temporal regularity is a necessary requirement to elicit P50 repetition-related amplitude increments, strongly suggesting the existence of an inference generation mechanism involving the encoding of precise temporal contingencies (Clementz et al., 2002; Bendixen et al., 2009).

The fact that the P50 amplitude evoked by a deviant tone was not different from that evoked by its preceding standard supports the view that early AEPs reflect the attempt of the auditory system to predict the sound input in the immediate future (Bendixen et al., 2009), and that deviations from the predicted input are detected at later stages of stimulus processing, possibly reflected by the N1 (∼110 ms) and the MMN (∼150 ms) (Näätänen and Winkler, 1999). Although the correlates of auditory deviance detection found in our study appear at these relatively long latencies (at ∼110 ms after deviance onset and onwards), recent studies have shown mismatch responses at the middle-latency AEPs (∼40 ms) (Slabu et al., 2010; Grimm et al., 2011), supportive of a multistage deviance detection system in the auditory modality (Grimm and Escera, 2011). The disagreement with our results might arise from using shorter interstimulus intervals (<300 ms compared with 708 ms used here), leading to stronger memory-trace effects on deviance detection.

The onset of RP in the latency of P50 (∼70 ms; isochronous condition) implicates the primary auditory cortex in its generation, based on a latency comparison with intracranial generators of human AEP (Liegeois-Chauvel et al., 1991). This, with all caution in comparing different neural scales, makes the RP a possible human electrophysiological counterpart of SSA, with which it shares many properties: both occur without overt attention to sounds, are stimulus-specific, and develop rapidly over multiple timescales (Baldeweg, 2007; Nelken and Ulanovsky, 2007; Costa-Faidella et al., 2011). Although SSA literature is overwhelming (Ulanovsky et al., 2003, 2004; Pérez-González et al., 2005; Reches and Gutfreund, 2008; Anderson et al., 2009; Malmierca et al., 2009; Antunes et al., 2010; Farley et al., 2010; Zhao et al., 2011), to date, no study has attempted to explore SSA–timing interactions. To confirm that an irregular timing dampens the repetition effects on a neuronal scale, further research in animal models tapping the influence of timing predictability in the generation of SSA should prove instructive.

In summary, our study shows that the more regular, and thus predictable, the pattern of incoming sounds is, the shorter the latency of the AEP components exhibiting repetition suppression. Because human AEPs show a systematic hierarchy with latencies up to 70 ms generated along Heschl's gyrus, and later peaks generated in belt and parabelt (planum temporale) areas (Godey et al., 2001), our data suggest that the degree of predictability aids the propagation of repetition suppression upstream the auditory pathway. This idea is not new: Baldeweg (2006) raised it under the term “back-propagation hypothesis,” stating that, with increasing number of repetitions, a stimulus-specific memory trace could be detected at earlier auditory processing stages in a top-down fashion in line with a predictive coding account (Friston, 2005). We extend this notion by including timing as an important variable in the formation of stimulus-specific memory traces at the level of the primary auditory cortex and perhaps further upstream.

It is important to note that sensory-memory trace formation is dependent on short-term synaptic plasticity, mainly mediated via NMDA receptor function, which is also essential in the generation of MMN/RP (Javitt et al., 1996; Umbricht et al., 2000; Näätänen et al., 2011). The lack of RP associated with timing uncertainty in our study might thus suggest the existence of a beat-based mechanism promoting a fine temporal adjustment of active top-down predictive signals (Nobre et al., 2007). A plausible candidate for such a mechanism is the entrainment of brain oscillations to stimulus presentation rate (Lakatos et al., 2008). Rhythmical deflections in the membrane potential could shift the excitability (i.e., depolarization) in local neuronal ensembles (Lakatos et al., 2005), aiding stimulus processing and memory trace formation via NMDA receptor activation. Future research using time-frequency decompositions of the electroencephalographic data may shed light on the interplay between entrained neural oscillations to rhythmic stimulation and repetition suppression.

Footnotes

This work was supported by Grant PSI2009-08063 and the Consolider Ingenio2010 Program (CDS2007-00012) from the Spanish Ministry of Science and Innovation, Grant 2009SGR11 from the Generalitat de Catalunya, and the ICREA Academia Distinguished Professorship awarded to C.E.

References

  1. Alho K. Cerebral generators of mismatch negativity (MMN) and its magnetic counterpart (MMNm) elicited by sound changes. Ear Hear. 1995;16:38–51. doi: 10.1097/00003446-199502000-00004. [DOI] [PubMed] [Google Scholar]
  2. Anderson LA, Christianson GB, Linden JF. Stimulus-specific adaptation occurs in the auditory thalamus. J Neurosci. 2009;29:7359–7363. doi: 10.1523/JNEUROSCI.0793-09.2009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Antunes FM, Nelken I, Covey E, Malmierca MS. Stimulus-specific adaptation in the auditory thalamus of the anesthetized rat. PloS One. 2010;5:e14071. doi: 10.1371/journal.pone.0014071. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Atienza M, Cantero JL, Dominguez-Marin E. The time course of neural changes underlying auditory perceptual learning. Learn Mem. 2002;9:138–150. doi: 10.1101/lm.46502. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Baess P, Horváth J, Jacobsen T, Schröger E. Selective suppression of self-initiated sounds in an auditory stream: an ERP study. Psychophysiology. 2011;48:1276–1283. doi: 10.1111/j.1469-8986.2011.01196.x. [DOI] [PubMed] [Google Scholar]
  6. Baldeweg T. Repetition effects to sounds: evidence for predictive coding in the auditory system. Trends Cogn Sci. 2006;10:93–94. doi: 10.1016/j.tics.2006.01.010. [DOI] [PubMed] [Google Scholar]
  7. Baldeweg T. ERP repetition effects and mismatch negativity generation: A predictive coding perspective. J Psychophysiol. 2007;21:204–213. [Google Scholar]
  8. Baldeweg T, Williams JD, Gruzelier JH. Differential changes in frontal and sub-temporal components of mismatch negativity. Int J Psychophysiol. 1999;33:143–148. doi: 10.1016/s0167-8760(99)00026-4. [DOI] [PubMed] [Google Scholar]
  9. Baldeweg T, Klugman A, Gruzelier J, Hirsch SR. Mismatch negativity potentials and cognitive impairment in schizophrenia. Schizophr Res. 2004;69:203–217. doi: 10.1016/j.schres.2003.09.009. [DOI] [PubMed] [Google Scholar]
  10. Baldeweg T, Wong D, Stephan KE. Nicotinic modulation of human auditory sensory memory: evidence from mismatch negativity potentials. Int J Psychophysiol. 2006;59:49–58. doi: 10.1016/j.ijpsycho.2005.07.014. [DOI] [PubMed] [Google Scholar]
  11. Bendixen A, Schröger E, Winkler I. I heard that coming: event-related potential evidence for stimulus-driven prediction in the auditory system. J Neurosci. 2009;29:8447–8451. doi: 10.1523/JNEUROSCI.1493-09.2009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Budd TW, Barry RJ, Gordon E, Rennie C, Michie PT. Decrement of the N1 auditory event-related potential with stimulus repetition: habituation vs. refractoriness. Int J Psychophysiol. 1998;31:51–68. doi: 10.1016/s0167-8760(98)00040-3. [DOI] [PubMed] [Google Scholar]
  13. Clementz BA, Barber SK, Dzau JR. Knowledge of stimulus repetition affects the magnitude and spatial distribution of low-frequency event-related brain potentials. Audiol NeuroOtol. 2002;7:303–314. doi: 10.1159/000064444. [DOI] [PubMed] [Google Scholar]
  14. Costa-Faidella J, Grimm S, Slabu L, Díaz-Santaella F, Escera C. Multiple time scales of adaptation in the auditory system as revealed by human evoked potentials. Psychophysiology. 2011;48:774–783. doi: 10.1111/j.1469-8986.2010.01144.x. [DOI] [PubMed] [Google Scholar]
  15. Cowan N, Winkler I, Teder W, Näätänen R. Memory prerequisites of mismatch negativity in the auditory event-related potential (ERP) J Exp Psychol Learn Mem Cogn. 1993;19:909–921. doi: 10.1037//0278-7393.19.4.909. [DOI] [PubMed] [Google Scholar]
  16. Crowley KE, Colrain IM. A review of the evidence for P2 being an independent component process: age, sleep and modality. Clin Neurophysiol. 2004;115:732–744. doi: 10.1016/j.clinph.2003.11.021. [DOI] [PubMed] [Google Scholar]
  17. Delorme A, Makeig S. EEGLAB: an open source toolbox for analysis of single-trial EEG dynamics including independent component analysis. J Neurosci Methods. 2004;134:9–21. doi: 10.1016/j.jneumeth.2003.10.009. [DOI] [PubMed] [Google Scholar]
  18. Desimone R. Neural mechanisms for visual memory and their role in attention. Proc Natl Acad Sci U S A. 1996;93:13494–13499. doi: 10.1073/pnas.93.24.13494. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Dyson BJ, Alain C, He Y. I've heard it all before: perceptual invariance represented by early cortical auditory-evoked responses. Brain Res Cogn Brain Res. 2005;23:457–460. doi: 10.1016/j.cogbrainres.2004.11.012. [DOI] [PubMed] [Google Scholar]
  20. Farley BJ, Quirk MC, Doherty JJ, Christian EP. Stimulus-specific adaptation in auditory cortex is an NMDA-independent process distinct from the sensory novelty encoded by the mismatch negativity. J Neurosci. 2010;30:16475–16484. doi: 10.1523/JNEUROSCI.2793-10.2010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Friston K. A theory of cortical responses. Philos Trans R Soc Lond B Biol Sci. 2005;360:815–836. doi: 10.1098/rstb.2005.1622. [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Godey B, Schwartz D, de Graaf JB, Chauvel P, Liégeois-Chauvel C. Neuromagnetic source localization of auditory evoked fields and intracerebral evoked potentials: a comparison of data in the same patients. Clin Neurophysiol. 2001;112:1850–1859. doi: 10.1016/s1388-2457(01)00636-8. [DOI] [PubMed] [Google Scholar]
  23. Griffiths TD, Warren JD. The planum temporale as a computational hub. Trends Neurosci. 2002;25:348–353. doi: 10.1016/s0166-2236(02)02191-4. [DOI] [PubMed] [Google Scholar]
  24. Grimm S, Escera C. Auditory deviance detection revisited: evidence for a hierarchical novelty system. Int J Psychophysiology. 2011 doi: 10.1016/j.ijpsycho.2011.05.012. [DOI] [PubMed] [Google Scholar]
  25. Grimm S, Escera C, Slabu L, Costa-Faidella J. Electrophysiological evidence for the hierarchical organization of auditory change detection in the human brain. Psychophysiology. 2011;48:377–384. doi: 10.1111/j.1469-8986.2010.01073.x. [DOI] [PubMed] [Google Scholar]
  26. Haenschel C, Vernon DJ, Dwivedi P, Gruzelier JH, Baldeweg T. Event-related brain potential correlates of human auditory sensory memory-trace formation. J Neurosci. 2005;25:10494–10501. doi: 10.1523/JNEUROSCI.1227-05.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Harada N, Masuda T, Endo H, Nakamura Y, Takeda T, Tonoike M. The effect of 1/f fluctuation in inter-stimulus intervals on auditory evoked mismatch field. Neurosci Lett. 2005;379:223–228. doi: 10.1016/j.neulet.2005.01.012. [DOI] [PubMed] [Google Scholar]
  28. Javitt DC, Steinschneider M, Schroeder CE, Arezzo JC. Role of cortical N-methyl-D-aspartate receptors in auditory sensory memory and mismatch negativity generation: implications for schizophrenia. Proc Natl Acad Sci U S A. 1996;93:11962–11967. doi: 10.1073/pnas.93.21.11962. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Jung TP, Makeig S, Westerfield M, Townsend J, Courchesne E, Sejnowski TJ. Removal of eye activity artifacts from visual event-related potentials in normal and clinical subjects. Clin Neurophysiol. 2000;111:1745–1758. doi: 10.1016/s1388-2457(00)00386-2. [DOI] [PubMed] [Google Scholar]
  30. Lakatos P, Shah AS, Knuth KH, Ulbert I, Karmos G, Schroeder CE. An oscillatory hierarchy controlling neuronal excitability and stimulus processing in the auditory cortex. J Neurophysiol. 2005;94:1904–1911. doi: 10.1152/jn.00263.2005. [DOI] [PubMed] [Google Scholar]
  31. Lakatos P, Karmos G, Mehta AD, Ulbert I, Schroeder CE. Entrainment of neuronal oscillations as a mechanism of attentional selection. Science. 2008;320:110–113. doi: 10.1126/science.1154735. [DOI] [PubMed] [Google Scholar]
  32. Lange K. Brain correlates of early auditory processing are attenuated by expectations for time and pitch. Brain Cogn. 2009;69:127–137. doi: 10.1016/j.bandc.2008.06.004. [DOI] [PubMed] [Google Scholar]
  33. Liegeois-Chauvel C, Musolino A, Chauvel P. Localization of the primary auditory area in man. Brain. 1991;114:139–151. [PubMed] [Google Scholar]
  34. Malmierca MS, Cristaudo S, Pérez-González D, Covey E. Stimulus-specific adaptation in the inferior colliculus of the anesthetized rat. J Neurosci. 2009;29:5483–5493. doi: 10.1523/JNEUROSCI.4153-08.2009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Moberget T, Karns CM, Deouell LY, Lindgren M, Knight RT, Ivry RB. Detecting violations of sensory expectancies following cerebellar degeneration: a mismatch negativity study. Neuropsychologia. 2008;46:2569–2579. doi: 10.1016/j.neuropsychologia.2008.03.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Murray MM, Brunet D, Michel CM. Topographic ERP analyses: a step-by-step tutorial review. Brain Topogr. 2008;20:249–264. doi: 10.1007/s10548-008-0054-5. [DOI] [PubMed] [Google Scholar]
  37. Mutschler I, Wieckhorst B, Speck O, Schulze-Bonhage A, Hennig J, Seifritz E, Ball T. Time scales of auditory habituation in the amygdala and cerebral cortex. Cereb Cortex. 2010;20:2531–2539. doi: 10.1093/cercor/bhq001. [DOI] [PubMed] [Google Scholar]
  38. Näätänen R. The mismatch negativity: where is the big fish? J Psychophysiol. 2007;21:133–137. [Google Scholar]
  39. Näätänen R, Alho K. Generators of electrical and magnetic mismatch responses in humans. Brain Topogr. 1995;7:315–320. doi: 10.1007/BF01195257. [DOI] [PubMed] [Google Scholar]
  40. Näätänen R, Winkler I. The concept of auditory stimulus representation in cognitive neuroscience. Psychol Bull. 1999;125:826–859. doi: 10.1037/0033-2909.125.6.826. [DOI] [PubMed] [Google Scholar]
  41. Näätänen R, Gaillard AW, Mäntysalo S. Early selective-attention effect on evoked potential reinterpreted. Acta Psychologica. 1978;42:313–329. doi: 10.1016/0001-6918(78)90006-9. [DOI] [PubMed] [Google Scholar]
  42. Näätänen R, Kujala T, Kreegippu K, Carlson S, Escera C, Baldeweg T, Ponton C. The mismatch negativity: an index of cognitive decline in neuropsychiatric and neurological diseases and in aging. Brain. 2011 doi: 10.1093/brain/awr064. [DOI] [PubMed] [Google Scholar]
  43. Nelken I, Ulanovsky N. Mismatch negativity and stimulus-specific adaptation in animal models. J Psychophysiol. 2007;21:214–223. [Google Scholar]
  44. Nelson DA, Lassman FM. Re-examination of the effects of periodic and aperiodic stimulation on the auditory-evoked vertex response. Audiology. 1977;16:409–418. doi: 10.3109/00206097709071854. [DOI] [PubMed] [Google Scholar]
  45. Nelson DA, Lassman FM, Hoel RL. The effects of variable-interval and fixed-interval signal presentation schedules on the auditory evoked response. J Speech Hear Res. 1969;12:199–209. doi: 10.1044/jshr.1201.199. [DOI] [PubMed] [Google Scholar]
  46. Nobre A, Correa A, Coull J. The hazards of time. Curr Opin Neurobiol. 2007;17:465–470. doi: 10.1016/j.conb.2007.07.006. [DOI] [PubMed] [Google Scholar]
  47. Pérez-González D, Malmierca MS, Covey E. Novelty detector neurons in the mammalian auditory midbrain. Eur J Neurosci. 2005;22:2879–2885. doi: 10.1111/j.1460-9568.2005.04472.x. [DOI] [PubMed] [Google Scholar]
  48. Reches A, Gutfreund Y. Stimulus-specific adaptations in the gaze control system of the barn owl. J Neurosci. 2008;28:1523–1533. doi: 10.1523/JNEUROSCI.3785-07.2008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Rothman HH, Davis H, Hay IS. Slow evoked cortical potentials and temporal features of stimulation. Electroencephalogr Clin Neurophysiol. 1970;29:225–232. doi: 10.1016/0013-4694(70)90135-5. [DOI] [PubMed] [Google Scholar]
  50. Sams M, Alho K, Näätänen R. Sequential effects on the ERP in discriminating two stimuli. Biol Psychol. 1983;17:41–58. doi: 10.1016/0301-0511(83)90065-0. [DOI] [PubMed] [Google Scholar]
  51. Schwartze M, Rothermich K, Schmidt-Kassow M, Kotz SA. Temporal regularity effects on pre-attentive and attentive processing of deviance. Biol Psychol. 2011;87:146–151. doi: 10.1016/j.biopsycho.2011.02.021. [DOI] [PubMed] [Google Scholar]
  52. Slabu L, Escera C, Grimm S, Costa-Faidella J. Early change detection in humans as revealed by auditory brainstem and middle-latency evoked potentials. Eur J Neurosci. 2010;32:859–865. doi: 10.1111/j.1460-9568.2010.07324.x. [DOI] [PubMed] [Google Scholar]
  53. Takegata R, Morotomi T. Integrated neural representation of sound and temporal features in human auditory sensory memory: an event-related potential study. Neurosci Lett. 1999;274:207–210. doi: 10.1016/s0304-3940(99)00711-9. [DOI] [PubMed] [Google Scholar]
  54. Ulanovsky N, Las L, Nelken I. Processing of low-probability sounds by cortical neurons. Nat Neurosci. 2003;6:391–398. doi: 10.1038/nn1032. [DOI] [PubMed] [Google Scholar]
  55. Ulanovsky N, Las L, Farkas D, Nelken I. Multiple time scales of adaptation in auditory cortex neurons. J Neurosci. 2004;24:10440–10453. doi: 10.1523/JNEUROSCI.1905-04.2004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Umbricht D, Schmid L, Koller R, Vollenweider FX, Hell D, Javitt DC. Ketamine-induced deficits in auditory and visual context-dependent processing in healthy volunteers: implications for models of cognitive deficits in schizophrenia. Arch Gen Psychiatry. 2000;57:1139–1147. doi: 10.1001/archpsyc.57.12.1139. [DOI] [PubMed] [Google Scholar]
  57. Winkler I, Denham SL, Nelken I. Modeling the auditory scene: predictive regularity representations and perceptual objects. Trends Cogn Sci. 2009;13:532–540. doi: 10.1016/j.tics.2009.09.003. [DOI] [PubMed] [Google Scholar]
  58. Zhao L, Liu Y, Shen L, Feng L, Hong B. Stimulus-specific adaptation and its dynamics in the inferior colliculus of rat. Neuroscience. 2011;181:163–174. doi: 10.1016/j.neuroscience.2011.01.060. [DOI] [PubMed] [Google Scholar]

Articles from The Journal of Neuroscience are provided here courtesy of Society for Neuroscience

RESOURCES