Advances in neuroscience have led to the emergence of two complementary theories of neural information processing. First, the “Bayesian brain hypothesis” proposes that the brain actively predicts and represents incoming sensory information as probabilities that are updated in a near-optimal Bayesian manner (Friston, 2010; Aitchison and Lengyel, 2017). Second, “predictive coding” accounts posit that these predictions are generated from higher-level cortical regions and passed down toward early sensory regions. The difference between predicted and actual stimulus information results in a prediction error signal that is propagated bottom-up along the cortical hierarchy to update future predictions (Spratling, 2017). Importantly, predictive coding assumes that top-down predictions relate not only to the content of the sensory input, but also to the precision of such predictions (Heilbron and Chait, 2018). Predictions about precision modulate the gain of prediction errors so that surprises are attenuated in high-uncertainty environments.
Predictive processing in the auditory domain has traditionally been investigated using oddball paradigms. Typically, subjects are presented with a continuous stream of repeated sounds (“standards”). With a given probability, some sounds (“deviants”) are randomly modified by changing the pitch, timbre, timing, or volume. In neural electrophysiological activity recorded using electroencephalography (EEG) or magnetoencephalography (MEG), deviants evoke an increased negativity at ∼100–250 ms after stimulus onset compared with standards (Heilbron and Chait, 2018). This is known as a mismatch negativity (MMN) and is thought to largely reflect precision-weighted prediction errors (Lieder et al., 2013), which are driven by neural adaptation in lower auditory regions and short-term synaptic changes between frontal and temporal cortices (Garrido et al., 2009).
Because the MMN is a function of both prediction error and precision, separating the neural mechanisms of precision estimation from prediction errors remains challenging. Lecaignard et al. (2022) sought to address this via computational modeling of neural responses evoked by auditory stimulation, recorded using simultaneous EEG-MEG. To identify mechanisms underlying precision, the predictability of deviants was manipulated in a passive auditory oddball paradigm. In the low-uncertainty condition, a block consisted of two standards followed by a deviant tone, then three standards followed by a deviant tone, and so forth until eight standards were followed by a deviant. In the high-uncertainty condition, the number of standards preceding a deviant was randomly shuffled in each block. This manipulation ensured that the deviant probability remained constant, despite the two conditions affording different levels of predictability.
On the computational level, the authors used Bayesian model comparison to test the relative extent to which three model classes reconstructed the observed neural responses. Baseline models assume every tone elicits the same neural response. Change detector models compute the difference between incoming and preceding tones. Learning models predict deviant probability, recursively update predictions in a Bayesian manner, and introduce a learning constant, τ, to govern the weight previous trials have on current predictions. Lecaignard et al. (2022) found that learning models were best able to reconstruct evoked responses at the MMN time window. In particular, although the MMN did not vary for the two conditions given the same values of τ, the estimated value of τ was significantly larger when uncertainty was low.
On the neurophysiological level, dynamic causal modeling (DCM) was used to examine effective connectivity during the perception of standards and deviants in the two conditions. The following two types of connections are distinguished in DCM: extrinsic connections between cortical regions; and intrinsic connections within a cortical region. Extrinsic connections originate from excitatory pyramidal cells in infragranular layers of a source region and drive activity in a target region, while intrinsic connections modulate synaptic gain of infragranular pyramidal cells via local interneurons connecting infragranular, granular, and supragranular layers (Kiebel et al., 2008).
DCM allows one to infer connectivity dynamics by selecting the network architecture that most plausibly generates the observed neural activity and examining the coupling strength of extrinsic and intrinsic connections. In line with previous work (Auksztulewicz et al., 2017; Quiroga-Martinez et al., 2021), the most plausible network found by Lecaignard et al. (2022) comprised bidirectional connections between Heschl's gyrus (primary auditory cortex) and the planum polare, the planum polare and the inferior frontal gyrus, and the inferior frontal gyrus and the superior frontal area. Notably, they found significantly enhanced self-inhibition in all nodes of the network in the high-uncertainty condition. Furthermore, echoing previous findings, deviants elicited increased top-down and bottom-up connectivity, as well as reduced self-inhibition, compared with standards.
Results in the study by Lecaignard et al. (2022) demonstrate the influence of uncertainty on predictive mechanisms during auditory perception at the cognitive and neural levels. Cognitively, that learning models best explain neural activity in the MMN time window confirms previous work (Ostwald et al., 2012; Lieder et al., 2013) and supports the MMN as a neural signature reflecting the ability of the brain to infer and learn statistically. Importantly, their key contribution is showing that the learning constant τ in learning models is larger when uncertainty is low. This demonstrates that information from previous stimuli—regardless of whether they were standards or deviants—is upweighted in a more predictable context. This implies that the brain flexibly modulates its temporal integration window—depending on the uncertainty of the context—to most efficiently exploit information in its environment to generate future predictions. However, because τ is estimated from the data and not explained by the model, the observed difference indicates only a consequence of and not evidence for predictions of precision, as hypothesized in predictive coding. The precise mechanisms describing the computation of precision remains open.
Nevertheless, Lecaignard et al. (2022) show that, at the neural level, precision weighting is manifested as changes in local synaptic gain. In particular, self-inhibition is increased when the context is not conducive to making reliable predictions. Integrating this result with previous findings on the MMN (for review, see Garrido et al., 2009; Heilbron and Chait, 2018), a plausible mechanism could be as follows: upon hearing an auditory deviant, infragranular pyramidal cells along the cortical hierarchy between Heschl's gyrus and frontal regions are disinhibited by local interneurons, thereby enhancing the bottom-up propagation of predictions errors and top-down predictions. However, the extent of disinhibition is modulated by the uncertainty of the context, with increased disinhibition in more predictable contexts. This could facilitate statistical learning via changes in frontal–temporal coupling strength, which could result in an altered temporal integration time window as estimated by the learning parameter τ.
Separating mechanisms behind the generation of surprise from the uncertainty of the context is not only relevant to low-level auditory perception, but also to understanding how complex stimuli such as music can evoke pleasure in the listener (Koelsch et al., 2019). Recent work has shown that chords, melodies, and rhythms are most pleasing when they strike an optimal balance between the uncertainty of a prediction and the surprise from what is actually heard (Cheung et al., 2019; Gold et al., 2019; Matthews et al., 2020). Furthermore, current evidence reveals that the interaction between uncertainty and surprise in music engages not only the auditory cortex, but also regions in the mesolimbic reward network including the nucleus accumbens, amygdala, and hippocampus (Cheung et al., 2019; Matthews et al., 2020). Although the dynamic coupling between these brain regions during music listening have already been established via correlational approaches (Salimpoor et al., 2013; Shany et al., 2019), future studies could use directed approaches such as DCM to examine the flow of information within each region and within the network. Following results from the study by Lecaignard et al. (2022), we would expect auditory and reward-related regions to show increased intrinsic connectivity because of enhanced local inhibition during uncertain musical contexts.
There are nevertheless two key distinctions between music and auditory deviants. First, music structure is thought to be syntactically organized, with hierarchical relations spanning multiple timescales and beyond the local context (Rohrmeier and Pearce, 2018). Second, these relations are assumed to be established not only on hearing the stimulus, but also in long-term memory after extended exposure to a musical style (Pearce, 2018). Consequently, surprises in musical syntax elicit an early right anterior negativity, which is thought to be distinct from the MMN (Koelsch, 2009; Koelsch et al., 2019). However, the extent to which predictive processes integrating short-term acoustic information (e.g., timbre and pitch) and long-term abstract relationships (e.g., musical syntax) overlap remains unclear (Koelsch, 2009). In particular, neurocomputational mechanisms distinguishing the processing of uncertainty and surprise in musical syntax remains largely unexplored. A model-comparison approach similar to that of Lecaignard et al. (2022) may prove helpful in this investigation.
In summary, Lecaignard et al. (2022) provide a holistic account of how uncertainty modulates surprises in auditory perception at the neural and cognitive levels. Using neurophysiological and computational modeling of the MMN, they show that uncertainty influences local inhibitory synaptic dynamics and integration of past information when forming probabilistic predictions. These findings not only further support Bayesian inference and predictive coding as key neural processing mechanisms, but also provide helpful insights toward understanding the role of predictive processing in our appreciation of complex auditory stimuli that we call “music.”
Footnotes
Editor's Note: These short reviews of recent JNeurosci articles, written exclusively by students or postdoctoral fellows, summarize the important findings of the paper and provide additional insight and commentary. If the authors of the highlighted article have written a response to the Journal Club, the response can be found by viewing the Journal Club at www.jneurosci.org. For more information on the format, review process, and purpose of Journal Club articles, please see http://jneurosci.org/content/jneurosci-journal-club.
This work was supported by the Japan Science and Technology Agency (JSP CREST Grant JPMJCR20D4). Publication fee support was provided under Grant 1022089 from the Burroughs Wellcome Fund to SfN. We thank Dr. Shinichi Furuya for comments on an earlier version of this work.
The authors declare no competing financial interests.
References
- Aitchison L, Lengyel M (2017) With or without you: predictive coding and Bayesian inference in the brain. Curr Opin Neurobiol 46:219–227. 10.1016/j.conb.2017.08.010 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Auksztulewicz R, Barascud N, Cooray G, Nobre AC, Chait M, Friston K (2017) The cumulative effects of predictability on synaptic gain in the auditory processing stream. J Neurosci 37:6751–6760. 10.1523/JNEUROSCI.0291-17.2017 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cheung VKM, Harrison PMC, Meyer L, Pearce MT, Haynes J-D, Koelsch S (2019) Uncertainty and surprise jointly predict musical pleasure and amygdala, hippocampus, and auditory cortex activity. Curr Biol 29:4084–4092.e4. [DOI] [PubMed] [Google Scholar]
- Friston K (2010) The free-energy principle: a unified brain theory? Nat Rev Neurosci 11:127–138. 10.1038/nrn2787 [DOI] [PubMed] [Google Scholar]
- Garrido MI, Kilner JM, Stephan KE, Friston KJ (2009) The mismatch negativity: a review of underlying mechanisms. Clin Neurophysiol 120:453–463. 10.1016/j.clinph.2008.11.029 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gold BP, Pearce MT, Mas-Herrero E, Dagher A, Zatorre RJ (2019) Predictability and uncertainty in the pleasure of music: a reward for learning? J Neurosci 39:9397–9409. 10.1523/JNEUROSCI.0428-19.2019 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Heilbron M, Chait M (2018) Great expectations: is there evidence for predictive coding in auditory cortex? Neuroscience 389:54–73. 10.1016/j.neuroscience.2017.07.061 [DOI] [PubMed] [Google Scholar]
- Kiebel SJ, Garrido MI, Moran RJ, Friston KJ (2008) Dynamic causal modelling for EEG and MEG. Cogn Neurodyn 2:121–136. 10.1007/s11571-008-9038-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Koelsch S (2009) Music-syntactic processing and auditory memory: similarities and differences between ERAN and MMN. Psychophysiology 46:179–190. 10.1111/j.1469-8986.2008.00752.x [DOI] [PubMed] [Google Scholar]
- Koelsch S, Vuust P, Friston K (2019) Predictive processes and the peculiar case of music. Trends Cogn Sci 23:63–77. 10.1016/j.tics.2018.10.006 [DOI] [PubMed] [Google Scholar]
- Lecaignard F, Bertrand O, Caclin A, Mattout J (2022) Neurocomputational underpinnings of expected surprise. J Neurosci 42:474–486. 10.1523/JNEUROSCI.0601-21.2021 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lieder F, Daunizeau J, Garrido MI, Friston KJ, Stephan KE (2013) Modelling trial-by-trial changes in the mismatch negativity. PLoS Comput Biol 9:e1002911. 10.1371/journal.pcbi.1002911 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Matthews TE, Witek MAG, Lund T, Vuust P, Penhune VB (2020) The sensation of groove engages motor and reward networks. Neuroimage 214:116768. 10.1016/j.neuroimage.2020.116768 [DOI] [PubMed] [Google Scholar]
- Ostwald D, Spitzer B, Guggenmos M, Schmidt TT, Kiebel SJ, Blankenburg F (2012) Evidence for neural encoding of Bayesian surprise in human somatosensation. Neuroimage 62:177–188. 10.1016/j.neuroimage.2012.04.050 [DOI] [PubMed] [Google Scholar]
- Pearce MT (2018) Statistical learning and probabilistic prediction in music cognition: mechanisms of stylistic enculturation. Ann N|Y Acad Sci 1423:378–395. 10.1111/nyas.13654 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Quiroga-Martinez DR, Hansen NC, Højlund A, Pearce M, Brattico E, Holmes E, Friston K, Vuust P (2021) Musicianship and melodic predictability enhance neural gain in auditory cortex during pitch deviance detection. Hum Brain Mapp 42:5595–5608. 10.1002/hbm.25638 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rohrmeier M, Pearce M (2018) Musical syntax I: theoretical perspectives. In: Springer handbook of systematic musicology (Bader R, ed), pp 473–486. Berlin: Springer. [Google Scholar]
- Salimpoor VN, van den Bosch I, Kovacevic N, McIntosh AR, Dagher A, Zatorre RJ (2013) Interactions between the nucleus accumbens and auditory cortices predict music reward value. Science 340:216–219. 10.1126/science.1231059 [DOI] [PubMed] [Google Scholar]
- Shany O, Singer N, Gold BP, Jacoby N, Tarrasch R, Hendler T, Granot R (2019) Surprise-related activation in the nucleus accumbens interacts with music-induced pleasantness. Soc Cogn Affect Neurosci 14:459–470. 10.1093/scan/nsz019 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Spratling MW (2017) A review of predictive coding algorithms. Brain Cogn 112:92–97. 10.1016/j.bandc.2015.11.003 [DOI] [PubMed] [Google Scholar]