Abstract
The predictive coding model of perception proposes that neuronal responses are modulated by the amount of sensory input that the internal prediction cannot account for (i.e., prediction error). However, there is little consensus on what constitutes nonpredicted stimuli. Conceptually, whereas mispredicted stimuli may induce both prediction error generated by prediction that is not perceived and prediction error generated by sensory input that is not anticipated, unpredicted stimuli involves no top-down, only bottom-up, propagation of information in the system. Here, we examined the possibility that the processing of mispredicted and unpredicted stimuli are dissociable at the neurophysiological level using human electroencephalography. We presented participants with sets of five tones in which the frequency of the fifth tones was predicted, mispredicted, or unpredicted. Participants were required to press a key when they detected a softer fifth tone to maintain their attention. We found that mispredicted and unpredicted stimuli are associated with different amount of cortical activity, probably reflecting differences in prediction error. Moreover, relative to predicted stimuli, the mispredicted prediction error manifested as neuronal enhancement and the unpredicted prediction error manifested as neuronal attenuation on the N1 event-related potential component. These results highlight the importance of differentiating between the two nonpredicted stimuli in theoretical work on predictive coding.
SIGNIFICANCE STATEMENT The current research seeks to dissociate the neurophysiological processing of two types of “nonpredicted” stimuli that have long been considered interchangeable: mispredicted and unpredicted stimuli. We found that mispredicted stimuli, which violate predictions, and unpredicted stimuli, which lack predictions, are represented distinctively in the brain. The results will influence the design of experiments on the predictive coding mechanism, in which the contrast between predicted and “nonpredicted” conditions should be specifically defined to reveal the prediction error proper. This is of general interest because it concerns the logic of research investigating all levels of processing (including perceptual, motor, and cognitive processing) in many neuroscientific domains.
Introduction
The predictive coding model of perception postulates that perception entails two distinct neurocomputational components, the top-down propagation of prediction and the bottom-up propagation of prediction error (Rao and Ballard, 1999; Friston, 2005, 2009). Specifically, the flow of information takes place between multiple hierarchical levels harboring both representational units and error units (Egner et al., 2010; Arnal and Giraud, 2012). While the representational units encode prediction about the causal structure of the environment and feed it backward to the next lower level, the error units encode sensory input that prediction fails to explain as prediction error and communicate it forward to the next higher level. Such prediction error is believed to modulate a range of neuronal responses (den Ouden et al., 2012).
Within this framework, as recently pointed out by Arnal and Giraud (2012), there is a conceptual difference between mispredicted and unpredicted stimuli (Fig. 1A). In particular, they proposed that mispredicted stimuli are associated with larger prediction error because prediction fails to match with sensory input, inducing both prediction error generated by prediction that is not perceived and prediction error generated by sensory input that is not anticipated. Conversely, unpredicted stimuli are associated with smaller prediction error because there is only the latter type of prediction error: sensory input that is not anticipated. Meanwhile, predicted stimuli are associated with minimum prediction error because prediction explains away sensory input.
Note that this reasoning presupposes that the representation of prediction is discrete—that is, precisely defined. In this case, a correct prediction perfectly matches with the sensory input, canceling out the bottom-up flow of information. However, it is possible that prediction is not represented in an all-or-none manner. Instead of predicting exclusively the most likely upcoming event, the brain is more likely to represent prediction as a probability distribution (Friston, 2009) in one or both of the following ways. First, the representation of prediction might be based on codes that are to a certain degree imprecise in the sense that the activation in each representational unit is distributed over a set of a continuum of predictions. This would be similar to the population coding scheme involving neurons with a Gaussian tuning curve in representing sensory input (Bitterman et al., 2008). Here, predicted stimuli trigger prediction error to the extent that the probability function representing the prediction is wider than the probability function representing the sensory input (Fig. 1B). Second, the representation of prediction might mimic the regularities of the environment, which not only codes the most likely upcoming event, but also other events to a lesser degree. Therefore, when a “predicted” stimulus is presented, there is always some amount of prediction error resulting from the unfulfilled part of the prediction (Fig. 1C).
In both of the aforementioned scenarios, predicted stimuli should trigger medium prediction error because prediction is represented as a probability distribution that inevitably induces some amount of prediction error. Therefore, although both mispredicted and unpredicted stimuli trigger prediction error, their relation to predicted stimuli may be distinct. Compared with predicted stimuli, mispredicted prediction error would manifest as neuronal enhancement, whereas unpredicted prediction error would manifest as neuronal attenuation given the absence of prediction in this case.
Here, we presented participants with sets of five tones in which the frequency of the fifth tones was predicted, mispredicted, or unpredicted and assessed the ensuing prediction error response with electroencephalography (EEG). If the processing of mispredicted and unpredicted stimuli are indeed different, they should be associated with different amount of cortical activity, particularly in relation to the N1 event-related potential (ERP) component, which is typically considered an electrophysiological indicator for automatic predictive processing (for review, see Bendixen et al., 2012). Moreover, if predictions are represented in a probabilistic manner, the amount of cortical activity triggered by predicted stimuli should be in between the cortical activity to mispredicted and unpredicted stimuli.
Materials and Methods
Participants
Twenty-one healthy volunteers (average age 24; seven males; 15 right-handed) with no history of neurological, psychiatric, or visual/hearing impairments as indicated by self-report participated in the experiment. Participants gave written informed consent and were paid for participation. Ethical approval was granted by the Comité de Protection des Personnes Ile de France II. One participant was excluded from data analysis for having <50% artifact-free trials in any of the conditions of interest, leaving 20 participants in the final sample (average age 25; seven males; 14 right-handed).
Stimuli
Seven sinusoidal tones with a loudness of 80 phons (i.e., 80 dB for tones of 1000 Hz) were generated using MATLAB. The duration of each tone was 50 ms (including 5 ms rise/fall times). The frequency of each tone was within the range of 261.626–493.883 Hz, matching the absolute frequency of a series of seven natural keys on a modern piano (i.e., C4 D4 E4 F4 G4 A4 B4).
From the pool of seven tones, a total of 800 sets of five tones were created for the main experiment. Figure 2 shows a schematic illustration of the design. In the predicted condition, which comprised 600 sets of tones, the frequency of the first four tones was ascending in steps of one natural key and the frequency of the fifth tone was one natural key higher than the fourth tone in the series (e.g., D4-E4-F4-G4-A4). That is, the fifth tone followed the ascending pattern. In the mispredicted condition, which comprised 100 sets of tones, the frequency of the first four tones was ascending in steps of one natural key and the frequency of the fifth tone was one natural key lower than the first tone in the series (e.g., D4-E4-F4-G4-C4); that is, the fifth tone violated the ascending pattern. In the unpredicted condition, which comprised 100 sets of tones, stimuli were created by randomly scrambling the order of the five tones in the predicted and mispredicted conditions. Here, the fifth tone cannot be predicted because the five tones in each sequence were selected from a population of seven tones.
To exclude the possibility that the findings in the main experiment simply reflect how much the tones of interest (i.e., the fifth tones) differ from their preceding tones (i.e., the fourth tones) in terms of frequency, a total of 160 sets of two tones were created for a control experiment. Half of the two-tone sets resemble the fourth and the fifth tones in the predicted condition in the main experiment; that is, the second tones were always one natural key higher than the first tones (e.g., G4–A4). The other half of the two-tone sets resemble the fourth and the fifth tones in the mispredicted condition in the main experiment; that is, the second tones were always four natural key lower than the first tones (e.g., G4–C4). In other words, the frequency difference between the two tones in the control experiment was identical to the frequency difference between the fourth and the fifth tones in the main experiment. However, in the control experiment, the second tones being the one-natural-key-higher tones and the four-natural-key-lower tones are of equal probability and thus equal predictability.
E-prime version 2.0 (Psychology Software Tools) was used for stimulus presentation. The stimulation was randomized individually for each participant and delivered binaurally via headphones (Sennheiser PX200).
Procedures
In the main experiment, a total of 10 blocks of 80 trials were presented. A trial started with the presentation of a fixation cross, which remained on the screen for the duration of the trial. After 500 ms, a set of five tones were presented and each tone was followed by a 500 ms stimulus onset asynchrony (SOA). Ten percent of the fifth tones were of attenuated loudness by 20 dB in each condition. To maintain their attention, participants were required to press a key when they detected a softer fifth tone within a 700 ms long time window after the offset of the fifth tone. Each trial was followed by a jittered intertrial interval (ITI) of 700–800 ms.
In the control experiment, a total of two blocks of 80 trials were presented. A trial started with the presentation of a fixation cross, which remained on the screen for the duration of the trial. After 500 ms, a set of two tones were presented in which each tone was followed by a 500 ms SOA. Ten percent of the second tones were of attenuated loudness by 20 dB in each condition. To maintain their attention, participants were required to press a key when they detected a softer second tone within a 700 ms long time window after the offset of the second tone. Each trial was followed by a jittered ITI of 700–800 ms.
To prevent the main experiment from influencing the control experiment, participants were presented with the control experiment first. The whole study took ∼60 min (i.e., 160 trials × 2500 ms + 800 trials × 4000 ms).
Data recording and analysis
EEG recording and preprocessing.
EEG was recorded with 64 active electrodes (actiCAP; Brain Products) conforming to the international 10–10 system. The sampling rate was 500 Hz. No online/offline filter was used. The Cz served as the reference electrode online. The data were recomputed to average reference offline. Target stimuli of attenuated loudness were removed. Epochs extended from −100 to 500 ms relative to the onset of the last stimulus in each trial (i.e., the fifth stimulus in the main experiment and the second stimulus in the control experiment) using a 100 ms prestimulus baseline. Ocular artifact correction was conducted with independent component analysis in EEGlab (Delorme and Makeig, 2004). Epochs containing voltage deviations exceeding ±100 μV relative to baseline at any of the electrodes were rejected. The trial numbers after artifact rejection in each condition are listed in Table 1.
Table 1.
Main experiment |
Control experiment |
||||
---|---|---|---|---|---|
Predicted | Mispredicted | Unpredicted | One-natural-key-higher | Four-natural-key-lower | |
Mean (range) | 519.60 (485–537) | 86.95 (81–90) | 86.20 (80–90) | 68.70 (54–72) | 68.40 (54–72) |
ERP analysis.
ERP analysis was based on a temporal principal component analysis (PCA) in SPSS 20. Since it was first introduced (Ruchkin et al., 1964; Donchin, 1966), PCA has been considered an effective linear reduction method for multivariate ERP data (Möcks, 1988a, 1988b; Duffy et al., 1992; Chapman and McCrary, 1995; Dien, 1998; Picton et al., 2000; Dien and Frishkoff, 2005; for review, see Kayser and Tenke, 2003, Dien, 2012). PCA statistically decomposes the ERP waveforms into constituent building blocks, which affords objective, data-driven ERP component measures compared with other conventional methods (Kayser et al., 1998; Beauducel et al., 2000; Kayser and Tenke, 2006). Moreover, it is not as susceptible to the influences of high-frequency noises and low-frequency drifts in the data as other conventional methods (Luck, 2005). Covariance matrix and Promax rotation were used here. All components accounting for a total of 99% of the variance (maximum iterations for convergence = 500) were included in the rotation (Promax kappa = 4). The decomposition provided a set of time-variant component loadings reflecting the contribution of each temporal component to the voltage at each time point and a set of time-invariant component scores (calculated using Bartlett method) representing the contribution of each temporal component to the ERP waveforms which can be subject to inferential statistics (Van Boxtel, 1998). The components corresponding to the N1 (i.e., principal component 2 accounting for 15.59% of the variance), the N1/P2 transition in which a mismatch negativity (MMN) was seen in mispredicted condition but not in unpredicted condition (i.e., principal component 3 accounting for 6.01% of the variance), and the P2 (i.e., principal component 4 accounting for 2.87% of the variance) were identified on the basis of the component loading latencies and the component score topographies.
The component scores were averaged across three electrodes showing the largest responses across all conditions independent of experimental manipulation (i.e., N1: F1, F4, FC3; N1/P2 transition: F3, F5, F4; P2: Pz, P2, CPz). Therefore, the averages of these electrodes can be considered as objective representatives of the components to serve as inputs for further statistical analyses. The advantage of averaging three maximum electrodes was twofold. First, it increased the signal-to-noise ratio of the components. Second, it avoided the problems inherited in the analysis of predefined areas that took an average of multiple electrodes over predefined regions, which might not correspond to the true topography in the experiment. Moreover, these electrodes representing the locations of maximum signals for each component corresponded to those reported in the literature (Näätänen and Picton, 1987).
To test for the prediction effects in the main experiment, a one-way repeated-measures ANOVA comparing the ERPs among the predicted, mispredicted, and unpredicted conditions was performed. To test whether the prediction effects in the main experiment simply reflect how much the tones of interest (i.e., the fifth tones) differ from their preceding tones (i.e., the fourth tones) in terms of frequency, a 2 (frequency distance: one-natural-key-higher/four-natural-key-lower conditions) × 2 (predictive setting: main/control experiments) repeated-measures ANOVA was performed.
Descriptive source analyses of the N1 were performed in BESA Research 6.0 using equivalent current dipoles (ECDs) in a 4-shell ellipsoidal head model (Scherg and Von Cramon, 1986). For each condition, a time window at 92–112 ms (i.e., ± 10 ms around the peak of the N1) that showed stable topography was chosen for analysis. Two ECDs were fitted without constraints separately for the grand average ERPs of each condition, resulting in a good fit (residual variance for predicted: 1.67%, mispredicted: 2.31%, and unpredicted: 3.52% conditions).
Results
Behavioral results
Participants' target detection performance was close to ceiling in the main experiment [hit rate: mean = 0.97, SD = 0.04; false alarm rate: mean < 0.001, SD ≤ 0.001; reaction time (RT): mean = 419.29, SD = 55.19] and in the control experiment (hit rate: mean = 0.94, SD = 0.12; false alarm rate: mean = 0.05, SD = 0.07; RT: mean = 404.49, SD = 71.70), indicating that they concentrated on the stimuli as instructed.
ERP results
Figure 3A shows the grand average ERPs on three midline electrodes (i.e., Fz, Cz, Pz). The waveforms were low-pass filtered at 30 Hz for visual presentation purposes. There are evident N1 at ∼100 ms with a frontocentral distribution and P2 at ∼250 ms with a centroparietal distribution. Figure 3B shows the component loadings of 122 components in the temporal PCA. The components corresponding to the N1, the N1/P2 transition, and the P2 in the grand average ERPs were identified on the basis of the component loading latencies and the component score topographies.
Comparisons among the predicted, mispredicted, and unpredicted conditions
On the N1, there was a significant difference among the predicted, mispredicted, and unpredicted conditions (F(2,38) = 36.77, p < 0.001, partial η2 = 0.66). Post hoc pairwise comparisons with Sidak adjustment showed that the N1 was significantly larger in the mispredicted than in the predicted condition (mean difference = −0.42; SE = 0.08; p < 0.001; 95% CI = −0.62–−0.22) and significantly smaller in the unpredicted than in the predicted condition (mean difference = 0.20; SE = 0.07; p < 0.025; 95% CI = 0.02–0.37) (Fig. 4A, left column).
Descriptive source analysis using ECDs showed that the largest contribution to the N1 response in each condition originated near the left and right auditory cortices (Fig. 5). No additional cortical sources were identified with stable solutions.
On the N1/P2 transition, there was a significant difference among the predicted, mispredicted, and unpredicted conditions (F(2,38) = 15.37, p < 0.001, partial η2 = 0.45). Post hoc pairwise comparisons with Sidak adjustment showed that the N1/P2 transition was significantly larger in the mispredicted than in the predicted condition (mean difference = −0.54; SE = 0.14; p < 0.01; 95% CI = −0.92–−0.17), but did not differ between the unpredicted and the predicted conditions (mean difference = 0.15; SE = 0.11; p = 0.46; 95% CI = −0.14–0.43; Fig. 4A, middle column).
On the P2, there was a significant difference among the predicted, mispredicted, and unpredicted conditions (F(2,38) = 4.51, p < 0.05, partial η2 = 0.19). Post hoc pairwise comparisons with Sidak adjustment showed that the P2 was marginally larger in the mispredicted than in the predicted condition (mean difference = 0.36; SE = 0.12; p = 0.026; 95% CI = 0.04–0.67), but did not differ between the unpredicted and predicted conditions (mean difference = 0.05; SE = 0.12; p = 0.96; 95% CI = −0.25–0.36; Fig. 4A, right column).
Interactions between frequency distance and predictive setting
To test whether the prediction effects in the main experiment simply reflect how much the tones of interest (i.e., the fifth tones) differ from their preceding tones (i.e., the fourth tones) in terms of frequency, data from the main experiment (Fig. 4A) and those from the control experiment (Fig. 4B) were conjointly submitted to a 2 (frequency distance: one-natural-key-higher/four-natural-key-lower conditions) × 2 (predictive setting: main/control experiments) repeated-measures ANOVA.
On the N1, there was a significant interaction between frequency distance and predictive setting (F(1,19) = 6.26, p < 0.05, partial η2 = 0.25). Post hoc t tests showed that there was a significant difference between frequency distance in the main experiment (t(19) = 5.42, p < 0.001), but not in the control experiment (t(19) = 1.74, p = 0.10).
On the N1/P2 transition, there was a significant interaction between frequency distance and predictive setting (F(1,19) = 6.35, p < 0.05, partial η2 = 0.25). Post hoc t tests showed that there was a significant difference between frequency distance in the main experiment (t(19) = 3.82, p ≤ 0.001), but not in the control experiment (t(19) = 0.85, p = 0.41).
On the P2, there was no significant interaction between frequency distance and predictive setting (F(1,19) = 1.64, p = 0.22, partial η2 = 0.08). Conversely, there was a significant main effect of frequency distance in which the P2 was smaller in the small-frequency-distance condition than in the large-frequency-distance condition (F(1,19) = 4.95, p < 0.05, partial η2 = 0.21) and a significant main effect of predictive setting in which the P2 was smaller in the main experiment than in the control experiment (F(1,19) = 6.90, p < 0.05, partial η2 = 0.27).
Overall, the significant interactions between frequency distance and predictive setting on the N1 (and even the N1/P2 transition but not the P2) excludes the possibility that the prediction effects in the main experiment simply reflect how much the tones of interest (i.e., the fifth tones) differ from their preceding tones (i.e., the fourth tones) in terms of frequency.
Discussion
We presented participants with sets of five tones, the last of which was predicted, mispredicted, or unpredicted in terms of frequency. We found that mispredicted and unpredicted stimuli are associated with different amounts of cortical activity, probably reflecting differences in prediction error (Friston, 2005; den Ouden et al., 2012). In particular, predicted stimuli did not trigger minimum, but rather medium cortical activity. Relative to predicted stimuli, mispredicted stimuli are associated with neuronal enhancement and unpredicted stimuli are associated with neuronal attenuation on the N1 ERP component. Source localization suggested that the predictive model is generated near the auditory cortices.
First of all, our results demonstrate that the processing of mispredicted and unpredicted stimuli are distinctive. The dissociation conforms to previous research on the MMN reporting a significant “surprise response” by contrasting between a deviant embedded in a standard sequence (i.e., a mispredicted tone) and a deviant embedded in an equiprobable sequence (i.e., an unpredicted tone) (for review, see Jacobsen and Schröger, 2001; Näätänen et al., 2005; but see Ahmed et al., 2011; Astikainen et al., 2011; Nakamura et al., 2011 vs Farley et al., 2010; Fishman and Steinschneider, 2012; Kaliukhovich and Vogels, 2014 for an ongoing debate in animal research). Importantly, in addition, our results demonstrate how mispredicted and unpredicted responses are related to predicted response, which can help to explain the seemingly contradictory observations of prediction-related effects in the literature. For example, although stimulus repetition is believed to induce prediction (Summerfield et al., 2008), the nonpredicted alternation of the familiar stimuli was associated with neuronal enhancement, whereas the nonpredicted alternation of the unfamiliar stimuli was associated with neuronal attenuation (Henson et al., 2000; Fiebach et al., 2005; Gagnepain et al., 2008; Soldan et al., 2008; Müller et al., 2013; Subramaniam et al., 2012). Notably, familiar and unfamiliar stimuli differ in whether there is a preexisting representation (Turk-Browne et al., 2008), which might determine whether a prediction can be encoded in the representational units (Grotheer and Kovács, 2014). It might be that the nonpredicted alternation of the familiar and unfamiliar stimuli were like the mispredicted and unpredicted case, respectively. More closely related to the current research, Hsu et al. (2014a, 2014b) also found seemingly contradictory patterns of activation in two auditory studies using very similar designs. When pairs of two tones were contrasted with each other, more enhanced N1 responses were found for the alternated tones (i.e., the nonpredicted tones) than the repeated tones (i.e., the predicted tones) (Hsu et al., 2014a). Conversely, when pairs of random tones and pairs of rising tones were contrasted, more attenuated N1 responses were found for the random pairs (i.e., the nonpredicted tones) than the rising pairs (i.e., the predicted tones) (Hsu et al., 2014b). The two results seem incompatible when both alternated and random tones are simply considered as “nonpredicted” stimuli. However, the pattern can be explained if alternated and random tones are, respectively, considered as mispredicted and unpredicted stimuli. Specifically, in the former study, alternated tones were embedded in pairs of two identical tones in which there was perceptual regularity. This might facilitate the brain's attempt to form prediction. In this case, alternated tones would be more like mispredicted stimuli, which trigger larger cortical activity relative to predicted stimuli. Conversely, in the latter study, random tones were embedded in a stimulus stream without perceptual regularity and there was little if any prediction involved. In this case, random tones would be more like unpredicted stimuli, which trigger smaller cortical activity relative to predicted stimuli.
The pattern of results is straightforward as concerns the mispredicted stimuli. Relative to predicted stimuli, mispredicted stimuli are associated with enhanced prediction error. This is consistent with a range of findings showing that violation of prediction triggers stronger cortical activity (for review, see Bendixen et al., 2012). The common explanation for this is that, for mispredicted stimuli, prediction fails to match with sensory input. This mismatch therefore induces both prediction error generated by prediction that is not perceived and prediction error generated by sensory input that is not anticipated (Arnal and Giraud, 2012). This idea was also used to explain why a decrease of stimulus repetition probability reduces the size of adaptation effect on hemodynamic responses (visual: Summerfield et al., 2008; Kovács et al., 2012; Grotheer and Kovács, 2014; Mayrhauser et al., 2014; auditory: Andics et al., 2013) and electrophysiological responses (visual: Summerfield et al., 2011; auditory: Todorovic et al., 2011). Specifically, it was proposed that our perceptual apparatus generally predicts stimulation to be consistent from moment to moment because our perceptual context tends to remain stable across time scales (Summerfield et al., 2008). In this manner, a decrease of stimulus repetition probability is considered a violation of prediction, which in turn elicits stronger cortical activity.
A different pattern of results was observed for the unpredicted stimuli. Relative to predicted stimuli, unpredicted stimuli are associated with attenuated prediction error. This is incompatible with the idea that predicted stimuli are associated with minimum prediction error because a fulfilled prediction can explain away all the prediction error (Arnal and Giraud, 2012). Rather, predicted stimuli are associated with some amount of prediction error. This finding provides empirical support for the neurocomputational theory that, instead of encoding prediction in an all-or-none manner, the brain seems to represent prediction as a probability distribution (Friston, 2009) in one or both of the following ways.
First, the representation of prediction might be based on codes that are to a certain degree imprecise in the sense that the activation in each representational unit is distributed over a set of a continuum of predictions. With regard to continuous variables such as frequency, the prediction of the “preferred” frequency would be coded most strongly. This is done similarly to the population coding scheme involving neurons with a Gaussian tuning curve in representing sensory input (Bitterman et al., 2008). Here, predicted stimuli trigger prediction error to the extent that the probability function representing the prediction is wider than the probability function representing the sensory input. Second, the representation of prediction might mimic the regularities of the environment. In the current research, participants might learn that, after four ascending tones, the chance that the step inertia will continue is 86% and the chance that the step inertia will change is 14%. Although the former case is considered the most likely upcoming event, the latter case is also coded in the representational units, only to a lesser degree. Therefore, when a “predicted” stimulus is presented, there is always some amount of prediction error resulting from the unfulfilled part of the prediction. In both of the aforementioned scenarios, which are not mutually exclusive, predicted stimuli trigger some amount of prediction error that is larger or at least equivalent to the unpredicted prediction error.
Note that, in the current research, all stimuli were attended. Can our result pattern be attributed to attention effect? It was suggested the prediction mechanism dynamically interacts with other cognitive operations, particularly attention (Summerfield and Egner, 2009; Segaert et al., 2013). Specifically, the engagement of attention may increase the weighting of sensory information according to the precision of prediction, resulting in heightened cortical activity to predicted relative to nonpredicted stimuli (Rao, 2005; Feldman and Friston, 2010; Kok et al., 2012). However, this idea cannot explain the distinct pattern of cortical activity to mispredicted and unpredicted stimuli reported here, both of which are supposed to be nonpredicted stimuli of low precision of prediction. Conversely, the result pattern might be interpreted in terms of how prediction error can be adjusted depending on the predictability of the input (Friston, 2009). Specifically, prediction error evoked by stimuli occurring in more predictable contexts can be weighted more strongly (Schröger et al., 2015). Here, because the preceding contextual stimuli were more predictable for mispredicted stimuli than unpredicted stimuli, neuronal responses were stronger in the former than the latter. Altogether, the current research suggests that it is necessary to differentiate between the two nonpredicted stimuli in theoretical work on predictive coding.
To summarize, the predictive coding model of perception proposes that neuronal responses reflect the sensory input that the prediction cannot account for (Rao and Ballard, 1999; Friston, 2005, 2009). However, it remains unclear what constitutes the nonpredicted stimuli. Although a conceptual difference between mispredicted and unpredicted stimuli was proposed (Arnal and Giraud, 2012), there is a lack of empirical investigation on this issue. Here, we examined this possibility that the processing of mispredicted and unpredicted stimuli are dissociable at the neurophysiological level. We found that, compared with predicted stimuli, mispredicted stimuli elicited neuronal enhancement, whereas unpredicted stimuli elicited neuronal attenuation. This pattern of results demonstrates the essentialness to differentiate between mispredicted and unpredicted stimuli. Moreover, it shows that predicted stimuli are not necessarily associated with minimum cortical activity. We suggest that prediction is rather represented as a probability distribution, resulting in partially explained and unexplained sensory input and thus medium prediction error.
Footnotes
This work was supported by the European Research Council under the European Union's Seventh Framework Programme (Grant FP7/2007–2013/ERC Grant 263067). We thank the Paris Descartes Platform for Sensorimotor Studies (Université Paris Descartes, Centre National de la Recherche Scientifique, INSERM, Région Ile-de-France) for supporting the experimental work presented here.
The authors declare no competing financial interests.
References
- Ahmed et al., 2011.Ahmed M, Mällo T, Leppänen PH, Hämäläinen J, Ayräväinen L, Ruusuvirta T, Astikainen P. Mismatch brain response to speech sound changes in rats. Front Psychol. 2011;2:283. doi: 10.3389/fpsyg.2011.00283. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Andics et al., 2013.Andics A, Gál V, Vicsi K, Rudas G, Vidnyánszky Z. fMRI repetition suppression for voices is modulated by stimulus expectations. Neuroimage. 2013;69:277–283. doi: 10.1016/j.neuroimage.2012.12.033. [DOI] [PubMed] [Google Scholar]
- Arnal and Giraud, 2012.Arnal LH, Giraud AL. Cortical oscillations and sensory predictions. Trends Cogn Sci. 2012;16:390–398. doi: 10.1016/j.tics.2012.05.003. [DOI] [PubMed] [Google Scholar]
- Astikainen et al., 2011.Astikainen P, Stefanics G, Nokia M, Lipponen A, Cong F, Penttonen M, Ruusuvirta T. Memory-based mismatch response to frequency changes in rats. PLoS One. 2011;6(9) doi: 10.1371/journal.pone.0024208. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Beauducel et al., 2000.Beauducel A, Debener S, Brocke B, Kayser J. On the reliability of augmenting/reducing: peak amplitudes and principal components analysis of auditory evoked potentials. Journal of Psychophysiology. 2000;14:226–240. doi: 10.1027//0269-8803.14.4.226. [DOI] [Google Scholar]
- Bendixen et al., 2012.Bendixen A, SanMiguel I, Schröger E. Early electrophysiological indicators for predictive processing in audition: a review. Int J Psychophysiol. 2012;83:120–131. doi: 10.1016/j.ijpsycho.2011.08.003. [DOI] [PubMed] [Google Scholar]
- Bitterman et al., 2008.Bitterman Y, Mukamel R, Malach R, Fried I, Nelken I. Ultra-fine frequency tuning revealed in single neurons of human auditory cortex. Nature. 2008;451:197–201. doi: 10.1038/nature06476. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Chapman and McCrary, 1995.Chapman RM, McCrary JW. EP component identification and measurement by principal components analysis. Brain Cogn. 1995;27:288–310. doi: 10.1006/brcg.1995.1024. [DOI] [PubMed] [Google Scholar]
- Delorme and Makeig, 2004.Delorme A, Makeig S. EEGLAB: an open source toolbox for analysis of single-trial EEG dynamics including independent component analysis. J Neurosci Methods. 2004;134:9–21. doi: 10.1016/j.jneumeth.2003.10.009. [DOI] [PubMed] [Google Scholar]
- den Ouden et al., 2012.den Ouden HE, Kok P, de Lange FP. How prediction errors shape perception, attention, and motivation. Front Psychol. 2012;3:548. doi: 10.3389/fpsyg.2012.00548. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dien, 1998.Dien J. Addressing misallocation of variance in principal components analysis of event-related potentials. Brain Topography. 1998;11:43–55. doi: 10.1023/A:1022218503558. [DOI] [PubMed] [Google Scholar]
- Dien, 2012.Dien J. Applying principal components analysis to event-related potentials: a tutorial. Dev Neuropsychol. 2012;37:497–517. doi: 10.1080/87565641.2012.697503. [DOI] [PubMed] [Google Scholar]
- Dien and Frishkoff, 2005.Dien J, Frishkoff GA. Principal components analysis of event-related potential datasets. In: Handy T, editor. Event-related potentials: a methods handbook. Cambridge, MA: MIT; 2005. pp. 189–208. [Google Scholar]
- Donchin, 1966.Donchin E. A multivariate approach to the analysis of average evoked potentials. IEEE Trans Biomed Eng. 1966;13:131–139. doi: 10.1109/tbme.1966.4502423. [DOI] [PubMed] [Google Scholar]
- Duffy et al., 1992.Duffy FH, Jones K, Bartels P, McAnulty G, Albert M. Unrestricted principal components analysis of brain electrical activity: issues of data dimensionality, artifact, and utility. Brain Topography. 1992;4:291–307. doi: 10.1007/BF01135567. [DOI] [PubMed] [Google Scholar]
- Egner et al., 2010.Egner T, Monti JM, Summerfield C. Expectation and surprise determine neural population responses in the ventral visual stream. J Neurosci. 2010;30:16601–16608. doi: 10.1523/JNEUROSCI.2770-10.2010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Farley et al., 2010.Farley BJ, Quirk MC, Doherty JJ, Christian EP. Stimulus-specific adaptation in auditory cortex is an NMDA-independent process distinct from the sensory novelty encoded by the mismatch negativity. J Neurosci. 2010;30:16475–16484. doi: 10.1523/JNEUROSCI.2793-10.2010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Feldman and Friston, 2010.Feldman H, Friston KJ. Attention, uncertainty, and free-energy. Front Hum Neurosci. 2010;4:215. doi: 10.3389/fnhum.2010.00215. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fiebach et al., 2005.Fiebach CJ, Gruber T, Supp GG. Neuronal mechanisms of repetition priming in occipitotemporal cortex: spatiotemporal evidence from functional magnetic resonance imaging and electroencephalography. J Neurosci. 2005;25:3414–3422. doi: 10.1523/JNEUROSCI.4107-04.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fishman and Steinschneider, 2012.Fishman YI, Steinschneider M. Searching for the mismatch negativity in primary auditory cortex of the awake monkey: deviance detection or stimulus specific adaptation? J Neurosci. 2012;32:15747–15758. doi: 10.1523/JNEUROSCI.2835-12.2012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Friston, 2005.Friston K. A theory of cortical responses. Philos Trans R Soc Lond B Biol Sci. 2005;360:815–836. doi: 10.1098/rstb.2005.1622. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Friston, 2009.Friston K. The free-energy principle: a rough guide to the brain? Trends Cogn Sci. 2009;13:293–301. doi: 10.1016/j.tics.2009.04.005. [DOI] [PubMed] [Google Scholar]
- Gagnepain et al., 2008.Gagnepain P, Chételat G, Landeau B, Dayan J, Eustache F, Lebreton K. Spoken word memory traces within the human auditory cortex revealed by repetition priming and functional magnetic resonance imaging. J Neurosci. 2008;28:5281–5289. doi: 10.1523/JNEUROSCI.0565-08.2008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Grotheer and Kovács, 2014.Grotheer M, Kovács G. Repetition probability effects depend on prior experiences. J Neurosci. 2014;34:6640–6646. doi: 10.1523/JNEUROSCI.5326-13.2014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Henson et al., 2000.Henson R, Shallice T, Dolan R. Neuroimaging evidence for dissociable forms of repetition priming. Science. 2000;287:1269–1272. doi: 10.1126/science.287.5456.1269. [DOI] [PubMed] [Google Scholar]
- Hsu et al., 2014a.Hsu YF, Hämäläinen JA, Waszak F. Repetition suppression comprises both attention-independent and attention-dependent processes. Neuroimage. 2014a;98:168–175. doi: 10.1016/j.neuroimage.2014.04.084. [DOI] [PubMed] [Google Scholar]
- Hsu et al., 2014b.Hsu YF, Hämäläinen JA, Waszak F. Both attention and prediction are necessary for adaptive neuronal tuning in sensory processing. Front Hum Neurosci. 2014b;8:152. doi: 10.3389/fnhum.2014.00152. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jacobsen and Schröger, 2001.Jacobsen T, Schröger E. Is there pre-attentive memory-based comparison of pitch? Psychophysiology. 2001;38:723–727. doi: 10.1111/1469-8986.3840723. [DOI] [PubMed] [Google Scholar]
- Kaliukhovich and Vogels, 2014.Kaliukhovich DA, Vogels R. Neurons in macaque inferior temporal cortex show no surprise response to deviants in visual oddball sequences. J Neurosci. 2014;34:12801–12815. doi: 10.1523/JNEUROSCI.2154-14.2014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kayser and Tenke, 2003.Kayser J, Tenke CE. Optimizing PCA methodology for ERP component identification and measurement: theoretical rationale and empirical evaluation. Clin Neurophysiol. 2003;114:2307–2325. doi: 10.1016/S1388-2457(03)00241-4. [DOI] [PubMed] [Google Scholar]
- Kayser and Tenke, 2006.Kayser J, Tenke CE. Consensus on PCA for ERP data, and sensibility of unrestricted solutions. Clin Neurophysiol. 2006;117:695–707. [Google Scholar]
- Kayser et al., 1998.Kayser J, Tenke CE, Bruder GE. Dissociation of brain ERP topographies for tonal and phonetic oddball tasks. Psychophysiology. 1998;35:576–590. doi: 10.1017/S0048577298970214. [DOI] [PubMed] [Google Scholar]
- Kok et al., 2012.Kok P, Rahnev D, Jehee JF, Lau HC, de Lange FP. Attention reverses the effect of prediction in silencing sensory signals. Cereb Cortex. 2012;22:2197–2206. doi: 10.1093/cercor/bhr310. [DOI] [PubMed] [Google Scholar]
- Kovács et al., 2012.Kovács G, Iffland L, Vidnyánszky Z, Greenlee MW. Stimulus repetition probability effects on repetition suppression are position invariant for faces. Neuroimage. 2012;60:2128–2135. doi: 10.1016/j.neuroimage.2012.02.038. [DOI] [PubMed] [Google Scholar]
- Luck, 2005.Luck SJ. An introduction to the event-related potential technique. Cambridge, MA: MIT; 2005. [Google Scholar]
- Mayrhauser et al., 2014.Mayrhauser L, Bergmann J, Crone J, Kronbichler M. Neural repetition suppression: evidence for perceptual expectation in object-selective regions. Front Hum Neurosci. 2014;8(225) doi: 10.3389/fnhum.2014.00225. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Möcks, 1988a.Möcks J. Decomposing event-related potentials: a new topographic components model. Biol Psychol. 1988a;26:199–215. doi: 10.1016/0301-0511(88)90020-8. [DOI] [PubMed] [Google Scholar]
- Möcks, 1988b.Möcks J. Topographic components model for event-related potentials and some biophysical considerations. IEEE Trans Biomed Eng. 1988b;35:482–484. doi: 10.1109/10.2119. [DOI] [PubMed] [Google Scholar]
- Müller et al., 2013.Müller NG, Strumpf H, Scholz M, Baier B, Melloni L. Repetition suppression versus enhancement: it's quantity that matters. Cereb Cortex. 2013;23:315–322. doi: 10.1093/cercor/bhs009. [DOI] [PubMed] [Google Scholar]
- Näätänen and Picton, 1987.Näätänen R, Picton T. The N1 wave of the human electric and magnetic response to sound: A review and an analysis of the component structure. Psychophysiology. 1987;24:375–425. doi: 10.1111/j.1469-8986.1987.tb00311.x. [DOI] [PubMed] [Google Scholar]
- Näätänen et al., 2005.Näätänen R, Jacobsen T, Winkler I. Memory-based or afferent processes in mismatch negativity (MMN): a review of the evidence. Psychophysiology. 2005;42:25–32. doi: 10.1111/j.1469-8986.2005.00256.x. [DOI] [PubMed] [Google Scholar]
- Nakamura et al., 2011.Nakamura T, Michie PT, Fulham WR, Todd J, Budd TW, Schall U, Hunter M, Hodgson DM. Front Psychol. 2011;2:367. doi: 10.3389/fpsyg.2011.00367. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Picton et al., 2000.Picton TW, Bentin S, Berg P, Donchin E, Hillyard SA, Johnson R, Jr, Miller GA, Ritter W, Ruchkin DS, Rugg MD, Taylor MJ. Guidelines for using human event-related potentials to study cognition: recording standards and publication criteria. Psychophysiology. 2000;37:127–152. doi: 10.1111/1469-8986.3720127. [DOI] [PubMed] [Google Scholar]
- Rao, 2005.Rao RP. Bayesian inference and attentional modulation in the visual cortex. Neuroreport. 2005;16:1843–1848. doi: 10.1097/01.wnr.0000183900.92901.fc. [DOI] [PubMed] [Google Scholar]
- Rao and Ballard, 1999.Rao RP, Ballard DH. Predictive coding in the visual cortex: a functional interpretation of some extra-classical receptive-field effects. Nat Neurosci. 1999;2:79–87. doi: 10.1038/4580. [DOI] [PubMed] [Google Scholar]
- Ruchkin et al., 1964.Ruchkin DS, Villegas J, John ER. An analysis of average evoked potentials making use of least mean square techniques. Ann N Y Acad Sci. 1964;115:799–826. doi: 10.1111/j.1749-6632.1964.tb50665.x. [DOI] [PubMed] [Google Scholar]
- Scherg and Von Cramon, 1986.Scherg M, Von Cramon D. Evoked dipole source potentials of the human auditory cortex. Electroencephalogr Clin Neurophysiol. 1986;65:344–360. doi: 10.1016/0168-5597(86)90014-6. [DOI] [PubMed] [Google Scholar]
- Schröger et al., 2015.Schröger E, Marzecová A, SanMiguel I. Attention and prediction in human audition: a lesson from cognitive psychophysiology. Eur J Neurosci. 2015;41:641–664. doi: 10.1111/ejn.12816. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Segaert et al., 2013.Segaert K, Weber K, de Lange FP, Petersson KM, Hagoort P. The suppression of repetition enhancement: a review of fMRI studies. Neuropsychologia. 2013;51:59–66. doi: 10.1016/j.neuropsychologia.2012.11.006. [DOI] [PubMed] [Google Scholar]
- Soldan et al., 2008.Soldan A, Zarahn E, Hilton HJ, Stern Y. Global familiarity of visual stimuli affects repetition-related neural plasticity but not repetition priming. Neuroimage. 2008;39:515–526. doi: 10.1016/j.neuroimage.2007.08.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Subramaniam et al., 2012.Subramaniam K, Faust M, Beeman M, Mashal N. The repetition paradigm: enhancement of novel metaphors and suppression of conventional metaphors in the left inferior parietal lobe. Neuropsychologia. 2012;50:2705–2719. doi: 10.1016/j.neuropsychologia.2012.07.020. [DOI] [PubMed] [Google Scholar]
- Summerfield and Egner, 2009.Summerfield C, Egner T. Expectation (and attention) in visual cognition. Trends Cogn Sci. 2009;13:403–409. doi: 10.1016/j.tics.2009.06.003. [DOI] [PubMed] [Google Scholar]
- Summerfield et al., 2008.Summerfield C, Trittschuh EH, Monti JM, Mesulam MM, Egner T. Neural repetition suppression reflects fulfilled perceptual expectations. Nat Neurosci. 2008;11:1004–1006. doi: 10.1038/nn.2163. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Summerfield et al., 2011.Summerfield C, Wyart V, Johnen VM, de Gardelle V. Human scalp electroencephalography reveals that repetition suppression varies with expectation. Front Hum Neurosci. 2011;5(67) doi: 10.3389/fnhum.2011.00067. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Todorovic et al., 2011.Todorovic A, van Ede F, Maris E, de Lange FP. Prior expectation mediates neural adaptation to repeated sounds in the auditory cortex: an MEG study. J Neurosci. 2011;31:9118–9123. doi: 10.1523/JNEUROSCI.1425-11.2011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Turk-Browne et al., 2008.Turk-Browne NB, Scholl BJ, Chun MM. Babies and brains: habituation in infant cognition and functional neuroimaging. Front Hum Neurosci. 2008;2:16. doi: 10.3389/neuro.09.016.2008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Van Boxtel, 1998.Van Boxtel GJ. Computational and statistical methods for analyzing event-related potential data. Behave Res Methods Instr Comp. 1998;30:87–102. doi: 10.3758/BF03209419. [DOI] [Google Scholar]