Converging research efforts have proposed that musical sounds become rewarding through predictive processes in the brain's pleasure networks, including dopamine release in the midbrain (Blood and Zatorre, 2001; Gebauer et al., 2012). In this commentary we address the subtle, yet important distinction between two types of “prediction error” that are sometimes conflated in the music neuroscience literature: (i) reward prediction error (RPE) pertaining to (psychological) expectations of how emotionally rewarding a piece of music will be and (ii) prediction error (PE) pertaining to neuronal computation of sensory input relating to the brain's predictions about music itself. Ultimately, “What is the next chord?” (PE) and “How much will I like the next chord?” (RPE) are distinct—potentially orthogonal—questions. While many sources of fundamental pleasure like food, sex, and drugs are readily quantifiable and show a largely monotonic relationship between stimulus amount and pleasure magnitude (until a given saturation point), sources of higher-order pleasures like music cannot be unambiguously quantified (Berridge and Kringelbach, 2008). More music does not in itself imply greater pleasure. Rather, the pleasure potential of music relies on the interplay of prior learning and dynamic changes in stimulus structure over time (Huron, 2006). We propose that predictive coding under the free-energy principle (Friston, 2009)—under which the brain continuously minimizes PE in the interaction with its environment—has the potential to bridge PE and RPE, thus elucidating domain-specific aspects of musical appreciation.
Salimpoor et al. (2015) take noteworthy first steps toward synthesizing the two research literatures on the neurobiology of reward (e.g., their references 2–9) and on musical expectations (e.g., their references 12–42). From a computational perspective, the former relies on reinforcement learning, which sets up computational principles for maximizing reward value, irrespective of music-structural specifics (Schultz, 2013). The latter deals with predictions concerning musical structure and has been modeled using statistical learning and predictive coding (Vuust et al., 2009; Hansen and Pearce, 2014; Vuust and Witek, 2014; Hansen et al., 2016). In predictive coding, PE is neither “positive” nor “negative” per se, but rather strong/weak on a single continuum (Friston and Stephan, 2007). Positive and negative RPE thus seems inconsistent with the mathematical formulation of predictive coding. Friston and colleagues propose that RPE represents mere surface manifestations of more fundamental computations in the brain (Friston et al., 2009). Specifically, rewarding actions are those that minimize the brain's free energy, thus building a stronger and more accurate model of the world. In other words, many types of reinforcement and procedural learning can be reinterpreted as predictive coding and may in fact render the very notion of value redundant (Friston, 2009).
A key claim of Salimpoor et al. (2015) is that “[w]hen listening to previously unheard music, similar-sounding auditory templates may be ‘activated’ to generate expectations of how the new sounds will unfold [i.e., PE]. If the new sounds were better than expected [i.e., RPE], positive PE would result.” Assessing whether music is “structurally-better-than-expected” requires a clear definition of “structurally good.” In music listening, however, expectations more likely pertain to the structure of music (PE) than to its reward value (RPE) (Huron, 2006; Miranda and Ullman, 2007; Hansen and Pearce, 2014; Vuust and Witek, 2014; Hansen et al., 2016). Accordingly, Salimpoor et al.'s notion of valenced PE with respect to structural continuation is problematic because it conflates expectations about experienced pleasure and perceived sounds. Yet, the authors resort to this in their account of the inverted U-shaped relationship between exposure and musical appreciation, claiming that time between hearings increases the leeway for positive PE (Salimpoor et al., 2015, Box 2). In Figure 1 we provide an alternative explanation, without reference to RPE, emphasizing instead how the certainty of the brain's predictions influences the salience of the ensuing PE (Ross and Hansen, 2016) which may in turn affect the level of experienced pleasure. This is thought to be mediated by the sensitivity or gain of neuronal populations in inferior frontal gyrus to feedforward connections from superior temporal gyrus that mediate PE (Dietz et al., 2014).
Previous studies have found a relationship between reward value and activity in brain structures implicated in positive RPE (Salimpoor et al., 2013). This does, however, not provide causal evidence that musical appreciation is mediated by positive RPE, rather than PE. Moreover, a general focus on RPE circumvents the question of how music evokes pleasure and is assigned reward value in the first place.
So how does dopamine fit into this? Single-cell studies have shown bidirectional coding where changes in dopaminergic activity reflect positive RPE when a reward is greater than expected and negative RPE when it is smaller than expected (Schultz, 2010, 2013). However, empirical evidence indicates that dopamine neurons not only code for expected reward value, but also for the magnitude, timing, probability, and uncertainty of rewards, as well as perceptual salience (Schultz, 2010; Vuust and Kringelbach, 2010). The last possibility, in particular, may relate to PE rather than RPE. Rather than encoding the “prediction error on value,” predictive coding posits that dopamine may encode the “value of prediction error” where value corresponds to precision or incentive salience (Friston and Stephan, 2007; Friston, 2009). For example, one may hypothesize that participants are willing to pay more money for music that they have a strong predictive model for (cf. Salimpoor et al., 2013).
In conclusion, we propose that predictive coding offers a useful framework for understanding the mechanisms that determine when and why music is rewarding. However, it is crucial not to conflate RPE and PE. We regard this as an important distinction that still remains to be adequately studied. To this end, although not the only relevant theory, predictive coding could provide an alternative account of musical reward encapsulated in a general theory of brain function where PE and RPE are treated in a unified manner. In other words, predictive coding of musical structure and its rewarding qualities may be different manifestations of the same underlying computational principles.
Author contributions
NH and MD: conceived of the initial idea for this commentary. NH: wrote the first draft. NH, MD, and PV: all made substantial contributions to the content, critical revision, and final approval of this work and agree to be held accountable for this.
Funding
Center for Music in the Brain is funded by the Danish National Research Foundation (DNRF117). During part of this work, NH was supported by The Ministry of Culture Denmark and an EliteForsk Travel Grant from The Ministry of Higher Education and Science Denmark. MD is supported by the VELUX Foundation.
Conflict of interest statement
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Acknowledgments
The authors would like to acknowledge financial support from the Danish National Research Foundation (PV, NH), The Ministry of Culture Denmark (NH), an EliteForsk Travel Grant from The Ministry of Higher Education and Science Denmark (NH), and the VELUX Foundation (MD).
References
- Berridge K. C., Kringelbach M. L. (2008). Affective neuroscience of pleasure: reward in humans and animals. Psychopharmacology 199, 457–480. 10.1007/s00213-008-1099-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Blood A. J., Zatorre R. J. (2001). Intensely pleasurable responses to music correlate with activity in brain regions implicated in reward and emotion. Proc. Natl. Acad. Sci. U.S.A. 98, 11818–11823. 10.1073/pnas.191355898 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dietz M. J., Friston K. J., Mattingley J. B., Roepstorff A., Garrido M. I. (2014). Effective connectivity reveals right-hemisphere dominance in audiospatial perception: implications for models of spatial neglect. J. Neurosci. 34:5003. 10.1523/jneurosci.3765-13.2014 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Friston K. (2009). The free-energy principle: a rough guide to the brain? Trends Cogn. Sci. 13, 293–301. 10.1016/j.tics.2009.04.005 [DOI] [PubMed] [Google Scholar]
- Friston K. J., Daunizeau J., Kiebel S. J. (2009). Reinforcement learning or active inference? PLoS ONE 4:e6421. 10.1371/journal.pone.0006421 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Friston K. J., Stephan K. E. (2007). Free-energy and the brain. Synthese 159, 417–458. 10.1007/s11229-007-9237-y [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gebauer L., Kringelbach M. L., Vuust P. (2012). Ever-changing cycles of musical pleasure. Psychomusicol. Music Mind Brain 22, 152–167. 10.1037/a0031126 [DOI] [Google Scholar]
- Hansen N. C., Pearce M. (2014). Predictive uncertainty in auditory sequence processing. Front. Psychol. 5:1052. 10.3389/fpsyg.2014.01052 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hansen N. C., Vuust P., Pearce M. (2016). “If you have to ask, you'll never know”: effects of specialised stylistic expertise on predictive processing of music. PLoS ONE 11:e0163584. 10.1371/journal.pone.0163584 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Huron D. (2006). Sweet Anticipation: Music and the Psychology of Expectation. Cambridge, MA: MIT Press. [Google Scholar]
- Miranda R. A., Ullman M. T. (2007). Double dissociation between rules and memory in music: an event-related potential study. Neuroimage 38, 331–345. 10.1016/j.neuroimage.2007.07.034 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ross S., Hansen N. C. (2016). Dissociating prediction failure: considerations from music perception. J. Neurosci. 36, 3103–3105. 10.1523/JNEUROSCI.0053-16.2016 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Salimpoor V. N., van den Bosch I., Kovacevic N., McIntosh A. R., Dagher A., Zatorre R. J. (2013). Interactions between the nucleus accumbens and auditory cortices predict music reward value. Science 340, 216–219. 10.1126/science.1231059 [DOI] [PubMed] [Google Scholar]
- Salimpoor V. N., Zald D. H., Zatorre R. J., Dagher A., McIntosh A. R. (2015). Predictions and the brain: how musical sounds become rewarding. Trends Cogn. Sci. 19, 86–91. 10.1016/j.tics.2014.12.001 [DOI] [PubMed] [Google Scholar]
- Schultz W. (2010). Dopamine signals for reward value and risk: basic and recent data. Behav. Brain Funct. 6:24. 10.1186/1744-9081-6-24 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schultz W. (2013). Updating dopamine reward signals. Curr. Opin. Neurobiol. 23, 229–238. 10.1016/j.conb.2012.11.012 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Vuust P., Kringelbach M. L. (2010). The pleasure of making sense of music. Interdiscipl. Sci. Rev. 35, 166–182. 10.1179/030801810X12723585301192 [DOI] [Google Scholar]
- Vuust P., Ostergaard L., Pallesen K. J., Bailey C., Roepstorff A. (2009). Predictive coding of music-brain responses to rhythmic incongruity. Cortex 45, 80–92. 10.1016/j.cortex.2008.05.014 [DOI] [PubMed] [Google Scholar]
- Vuust P., Witek M. A. (2014). Rhythmic complexity and predictive coding: a novel approach to modeling rhythm and meter perception in music. Front. Psychol. 5:1111. 10.3389/fpsyg.2014.01111 [DOI] [PMC free article] [PubMed] [Google Scholar]