Dopamine, Updated: Reward Prediction Error and Beyond

Talia N Lerner; Ashley L Holloway; Jillian L Seiler

doi:10.1016/j.conb.2020.10.012

. Author manuscript; available in PMC: 2022 Apr 1.

Published in final edited form as: Curr Opin Neurobiol. 2020 Nov 14;67:123–130. doi: 10.1016/j.conb.2020.10.012

Dopamine, Updated: Reward Prediction Error and Beyond

Talia N Lerner ^1,^2,^*, Ashley L Holloway ^1,², Jillian L Seiler ^1,³

PMCID: PMC8116345 NIHMSID: NIHMS1639715 PMID: 33197709

Abstract

Dopamine neurons have been intensely studied for their roles in reinforcement learning. A dominant theory of how these neurons contribute to learning is through the encoding of a reward prediction error (RPE) signal. Recent advances in dopamine research have added nuance to RPE theory by incorporating the ideas of sensory prediction error, distributional encoding, and belief states. Further nuance is likely to be added shortly by convergent lines of research on dopamine neuron diversity. Finally, a major challenge is to reconcile RPE theory with other current theories of dopamine function to account for dopamine’s role in movement, motivation, and goal-directed planning.

Introduction

Dopamine neurons of the ventral tegmental area (VTA) have been intensely studied for their role in reinforcement learning. These dopamine neurons encode reward prediction error (RPE) – the difference between predicted and received rewards. RPE encoding in dopamine neurons was first suggested by Schultz et al. in 1997 [1]. The hypothesis that dopamine neurons function to encode RPE gripped the attention of many neuroscientists due to its excellent coherence with the predictions of temporal difference reinforcement models of learning [2,3]. Now, more than 20 years after the initial suggestion, there is little doubt that RPE is an essential part of what dopamine does in the brain. But is it everything? Recent progress in dopamine research has led to stronger formulations of the RPE hypothesis, which have refined the theory while largely confirming the initial observation. However, it remains a challenge to incorporate other hypothesized functions of dopamine, for example in motivation and motor behavior, into the RPE hypothesis. Whether dopamine’s functions can indeed be united under one theory is not clear: many recent studies have also brought attention to previously underappreciated dopamine neuron diversity. This diversity – in gene expression, intrinsic currents, synaptic connectivity, and neurotransmitter co-release – could give rise to a diversity of functions not amenable to description under one theory. In this review, we provide a brief synopsis of exciting advancements in understanding how dopamine neurons encode RPE, as well as an overview of emerging evidence for dopamine’s other functions in reward-related, motor, and cognitive behaviors and their relationship to the theme of cell type diversity.

Advancements in Understanding Dopamine’s Encoding of Reward Prediction Error

The advent of optogenetics in the early 2000s brought a revolution to our investigations of dopamine function. In particular, an important early study by Steinberg et al. [4] formally demonstrated causality in RPE by testing a strong, formal version of the RPE hypothesis using a behavioral procedure known as “blocking.” In the blocking procedure, an animal first learns that a single cue (A) fully predicts a reward (R). Once the association is learned, it “blocks” new learning about redundant cues. For example, if animals are trained that A →R and then that AX →R, they will not learn that X→R. Since A is fully predictive of R, the additional presentation of X does not add any predictive value. Therefore, there is no RPE generated by dopamine neurons at the reward delivery time and no new learning about X occurs. Steinberg et al. found that optogenetically imposing an RPE by stimulating VTA dopamine neurons at the time of reward during compound (AX→R) training could unblock learning. This experiment showed that VTA dopamine neurons were not only computing a signed, quantitative RPE, but that this RPE signal was sufficient to update the animal’s value learning system and change behavior.

Recently, Maes et al. [5] have built on these findings using a similar blocking paradigm in conjunction with optogenetic inhibition of VTA dopamine neurons, in this case during the time of cue presentation. By manipulating cue-evoked dopamine signals, they tested whether cue-evoked dopamine transmission is an RPE signal. They trained animals that A→R, and then that both AX →R and AY→ R. Normally, learning about X and Y would be blocked. The experiment was then to optogenetically inhibit dopamine neurons during presentations of AX (but not AY) and ask whether this inhibition unblocks learning about X compared to Y. This clever design can distinguish between two possibilities for what cue-evoked dopamine might be encoding: if cue-evoked dopamine encodes a value prediction, suppressing it should unblock learning about X (X did not predict reward; therefore, reward was unexpected; therefore, one should learn about X). On the other hand, if cue-evoked dopamine encodes reward prediction error, then the lack of an error should not generate any new learning. The authors found that learning about X and Y was equally blocked. This result provides evidence that dopamine encodes a strictly-defined RPE that is not to be conflated with the prediction itself.

Other recent optogenetic stimulation experiments also support the same conclusion: optogenetic stimulation of VTA dopamine neurons can promote associations between cues and rewards or between cues and other valueless cues, without causing any cues to acquire value [6,7]. There is evidence that predicted value is encoded by inputs to dopamine neurons – e.g., the prefrontal cortex – but NOT by dopamine neurons themselves [8,9]. Together, these experiments help to rule out the direct encoding of value or value prediction by dopamine neurons, solidifying RPE theory.

Additionally, Morrens et al. [10] showed that novel, but not familiar, cues evoke dopamine release and that if dopamine release is inhibited during a novel cue, learning about that cue is impaired. These findings help explain why animals learn more quickly about novel cues. They also fit well with another important, recently developed idea that dopamine acts not only as a prediction error for reward values but also as a prediction error for sensory experiences (“sensory prediction errors”; [11–14]). The idea that cue-evoked dopamine occurs before associations are learned, and helps guide that learning, also fits with ideas about how novelty responses can help mark events with an unknown potential for reward association [15].

How do dopamine neurons encode prediction errors? An exciting new idea, that of a distributional RPE code, is inspired by artificial intelligence research. The idea is that instead of all dopamine neurons encoding a similar mean prediction error, each neuron might encode either a more “optimistic” or more “pessimistic” prediction, distributed around the mean. Using many cells to represent a range of predictions allows the brain to capture a full probability distribution for future rewards and in theory improves reinforcement learning. Indeed, Dabney et al. [16] found strong evidence for distributional RPE encoding in single-unit recordings from VTA dopamine neurons in mice performing probabilistic learning tasks. These findings fit with previous recordings by the same group, which showed that individual VTA dopamine neurons compute complete RPEs (via subtraction), albeit with heterogeneity in scaling [17–19]. Distributional RPE encoding is likely achieved through variations in intrinsic excitability as well as in the complement of inputs each individual dopamine neuron receives from a variety of brain areas [20].

A final, important recent addition to our understanding of how dopamine neurons encode RPE is the proposal that RPEs depend on hidden belief states. A “state” is a neural representation of environmental conditions at a given time. However, deterministic external signals about the animal’s current state are not always available. Therefore, belief states are created in cases where an animal must rely on some probabilistic assessment of what state it is likely to be in. To test whether belief states contribute to RPE encoding by dopamine neurons, several different tests have been designed in which the external cues indicating state are ambiguous. One test used perceptual ambiguity in the cues predicting reward [21], another created ambiguity by using the same cue to predict both large and small rewards in randomly alternating (unsignaled) blocks [22], another used variable cue-reward intervals combined with probabilistically-delivered rewards to cause ambiguity about whether particular trials were long-interval trials vs. omission trials [23]. In all of these manipulations, the responses of VTA dopamine neurons were well accounted for by RPE models incorporating belief states. Therefore, RPE signals are not necessarily accurate representations of error, but reflect an animal’s internal understanding of its environment based on previous experiences in that environment.

Going Beyond Reward Prediction Error

In the studies discussed thus far, researchers focused on VTA dopamine neurons. Even more specifically, these studies focused primarily on dopamine neurons in the lateral VTA, which have a specific set of cellular, molecular, and synaptic properties compared to dopamine neurons as a whole, and which primarily project to the nucleus accumbens lateral shell [24,25]. Not all dopamine neurons are alike. For example, using the blocking paradigm described above, Keiflin et al. [14] showed that while optogenetic stimulation of VTA dopamine neurons at the time of reward can unblock learning, optogenetic stimulation of nearby substantia nigra pars compacta (SNc) dopamine neurons cannot. These results, and others, suggest that SNc dopamine neurons do not encode RPE, despite being capable of acting as a primary reinforcer [14,26–28]. Another set of dopamine neurons in the far lateral portion of the substantia nigra (SNL), which project to the caudal tail of the striatum, receive a distinct array of inputs compared to VTA and SNc, and respond to salient, novel stimuli [29–32]. Other recent studies additionally emphasize heterogeneity in the encoding properties of VTA dopamine neurons [33,34], adding to a story of heterogeneity from earlier studies of projection-defined VTA dopamine neurons [25,35]. Thus, there is a growing need in the field to add specificity in our methods of identifying dopamine neurons and to carefully report which subsets of dopamine neurons are under scrutiny in any given experiment.

Diverse Molecular Phenotypes of Midbrain Dopamine Neurons

Studies identifying heterogeneity in dopamine neurons based on their locations and projection targets have emphasized the need to answer the question: just how heterogenous is the midbrain dopamine system? How do we define dopamine cell types? Molecular approaches are offering some insight. Using single-cell gene expression analyses, researchers have proposed the existence of ~4–7 distinct groups of midbrain dopamine neurons defined by the expression of genes such as Aldh1a1, Sox6, and Vglut2 [36–41]. These exciting studies offer promise in explaining the heterogeneity of dopamine neuron responses in vivo. Still, work remains to be done to map molecularly-defined groups of neurons onto anatomically-defined groups. Poulin et al. [42] took a step in this direction when they used intersectional genetic strategies to create mouse lines for selecting out molecularly-defined dopamine neuron subtypes for study. They mapped the projection patterns of various subtypes and noted different (but overlapping) patterns of forebrain innervation. In terms of input connectivity, previous studies have indicated that projection-defined dopamine neurons receive distinct patterns of input [29,43–45], but it is as yet unclear how this aspect of connectivity interfaces with the molecular phenotypes of dopamine neurons.

How do molecularly-defined populations of dopamine neurons contribute to behavior? One recent study addressed this question for aldehyde dehydrogenase-positive (Aldh1a1+) dopamine neurons in the SNc and found that this population is crucial for the acquisition of motor skills in the rotarod task [46]. However, the question of what Aldh1a1+ dopamine neurons are encoding during motor skill acquisition remains. Another recent study defined VTA dopamine neuron subtypes by their expression of the neuropeptidergic markers Crhr1 (corticotropin-releasing hormone receptor 1) and Cck (cholecystokinin) [47]. They found that Crhr1- and Cck-expressing VTA dopamine neurons project to the core and medial shell of the nucleus accumbens, respectively. These two groups of dopamine neurons were critical for distinct parts of behavioral reinforcement: Crhr1-expressing VTA dopamine neurons were critical for establishing instrumental action-outcome and Pavlovian cue-reward associations, while Cck-expressing VTA dopamine neurons helped motivate responding for a reward after an action-outcome relationship was established. These findings suggest that while most dopamine neurons increase their activity in response to rewards, different subtypes are critical for individual features of reward processing. Future studies examining the interface between neuropeptidergic phenotype and the gene expression patterns defined by single-cell sequencing studies, however, are still necessary to fully define dopaminergic heterogeneity at the molecular level and to align these definitions with behavioral observations.

Dopamine Neurons Use More than Dopamine to Communicate

One important consequence of variations in gene expression in midbrain dopamine neurons is variations in neurotransmitter co-release. It has become increasingly evident that dopamine neurons can release more than their namesake: subsets of dopamine neurons co-release classical fast neurotransmitters such as glutamate and GABA [48–54]. The relevance of co-release for behavior is an area of active investigation and debate. For example, concerning glutamate co-transmission, Wang et al. demonstrated that co-release of glutamate from dopamine terminals in the ventral striatum does not contribute to acquisition of intracranial self-stimulation (ICSS) or real-time place preference (RTPP) for optogenetic stimulation [55]. Meanwhile, Zell et al. found that conditionally ablating tyrosine hydroxylase from neurons that co-release glutamate and dopamine also did not affect ICSS acquisition and RTPP, despite a loss of dopamine release in the ventral striatum [56]. Together, these studies suggest redundant roles of glutamate and dopamine release in “dopamine” neurons, at least in the specific behavioral paradigms examined. Future studies are necessary to ascertain the exact timing, signaling dynamics, and downstream effects of co-transmitter release by dopamine neurons, and elucidate their functions across all of striatum. Notably, the possibility of co-release means that studies of dopamine cell body firing in the midbrain cannot necessarily claim that the downstream circuit effects of this firing are due to dopamine release alone. To determine the contributions of dopamine release per se to downstream circuit function in future experiments, new tools such as fluorescent dopamine sensors [57–60] and sensors allowing researchers to track the biochemical responses of dopamine receptor-expressing neurons during behavior [61] will be instrumental.

Dopamine and Movement

Parkinson’s disease – in which SNc dopamine neurons slowly degenerate – has long made evident the importance of dopamine in spontaneous movement. Yet, it remains unclear how dopamine neurons participate in motor control. Recent studies have deepened our understanding, demonstrating that dopamine neurons – primarily in the SNc but also in the VTA – show phasic activity related to different components of movement such as action choice, initiation, vigor, and velocity [62–70]. These signals have not been explained by RPE theory and, using excitatory optogenetics, have been shown to be sufficient to trigger movement. Still, the physiological role of movement-related dopamine neurons is under debate. Coddington and Dudman have suggested that the portion of dopamine neurons that are naturally active during movement initiation is small and that optogenetic stimulation of dopamine neurons must be extremely supraphysiological to evoke movement directly [71]. Notably, when Wu et al. ablated Aldh1a1+ SNc dopamine neurons (a sizeable portion of the SNc dopamine population, and the dopamine neurons most vulnerable in Parkinson’s Disease), they did not observe immediate parkinsonism, but merely a slight decrease in high-speed movements combined with a profound motor learning deficit [46]. This result suggests that SNc dopamine neurons are not primarily responsible for movement initiation and that parkinsonism arises mostly from slower basal ganglia-wide adaptations to progressive dopamine loss.

Movement-related dopamine neurons are largely located in SNc, where they encode contralateral actions [62,63,72]. Yet, there is also evidence for the representation of movement in VTA dopamine neurons. Using a ‘Go-NoGo’ task, Syed et al. [68] demonstrated that if an animal did not need to initiate movement to obtain a reward, dopamine release in the NAc core in response to a cue predicting the reward was attenuated. Engelhard et al. [33] have also argued for the representation of kinematic variables by at least a subset of VTA dopamine neurons, alongside RPE representation. Furthermore, Hughes et al. used sensitive measurements of force generated by mice in a head-fixed fixed-time interval task to observe three populations of VTA dopamine neurons representing different aspects of forward and backward movement relative to a reward lick spout [70]. Although the firing rates of dopamine neurons they recorded were tightly linked to movement, these movement relationships were only observed in highly trained, not naïve, mice. Based on these results, it seems likely that dopamine neurons track both movement and RPE, often simultaneously in the same cells. There may be good reason for dopamine neurons to track both signals together: to assign credit to our actions accurately, we need to detect when rewards are missed not because of misunderstandings in cue-outcome or action-outcome relationships, but because of errors in motor execution [73; see also 74]. This is an enticing idea, but more work is still required to understand how movement signals interact with RPE to drive motivated behavior and motor learning.

Dopamine, Motivation, and Planning: What Are Dopamine Ramps For?

Dopamine “ramps,” in which dopamine activity and release increases as an animal gets closer to reward, have been proposed as a mechanism of maintaining motivation to work for distal rewards and properly linking credit back to the initial cue or action that resulted in a positive outcome [69,75,76]. Notably, ramping occurs more readily when mice are participating in instrumental rather than Pavlovian tasks [77–79]. The phenomenon has been observed robustly in the VTA and ventral striatum [69,75,76,78–82], as well as in the dorsomedial striatum [77,83]. There is less evidence for strong ramping in the dorsolateral striatum [75,77,83].

The origin of ramping activity remains controversial. Ramping is reliably observed when measuring the postsynaptic release of dopamine (using microdialysis, voltammetry, and dopamine sensor imaging) [69,75–78,81]. It has also been observed when measuring the activity of dopamine axons in the striatum using calcium sensors [77,78,83]. However, controversy exists as to the observation of ramps in cell body activity. Mohebi et al. [81] report an absence of ramping in VTA cell bodies (measured using electrophysiology) even when ramps are observed in the NAc (measured using voltammetry and dopamine sensor imaging), suggesting ramps could be locally generated in axons. However, others have observed ramps when measuring cell body activity, which are more obvious when population measures such as fiber photometry are used instead of electrophysiological recordings of single units [33,78–80,82]. Thus, there is an argument about whether ramps are generated in cell bodies, and, if they are, whether axonal mechanisms could still accentuate them.

What is the computational role of ramps? Again, controversy exists, and the answer may vary depending on the subpopulations of dopamine neurons being examined. To date, investigations of dopamine heterogeneity have largely not been brought to bear on the question of dopamine ramping. One school of thought is that dopamine ramps can be successfully incorporated into RPE theory [2,69,77,84,85]. Ramping could be explained by the back propagation of RPEs during learning and thus used to reinforce action choices [2,69,85]. Mikhael et al. [84] argue that sensory feedback plays an important role in generating ramps from RPE by reducing the uncertainty about time to reward.

Another school of thought is that dopamine ramps are better explained as motivational signals, or as a means of coordinated goal-directed action planning [75,76,79,81]. Song and Lee [82] suggest a model in which a ramping-to-phasic transition in dopamine signaling is related to a reduction in task dimensionality as subjects narrow their focus to relevant stimuli. They suggest that ramps should fade as instrumental learning asymptotes and habits begin to form [76,82]. In contrast, Guru et al [79] observe that ramps generally fade with repetitive training, and provide evidence that ramps are only maintained under conditions where internal goal representations are necessary to support behavior. These explanations of the roles of dopamine ramping in behavior, as well as a fuller characterization of when dopamine ramps occur and how they are generated, await further investigation. At present, the wide range of tasks and recording methods used to detect ramps, as well as the lack of an agreed-upon definition for ramping, are hindering progress in this burgeoning area of interest.

Conclusion

Much exciting progress has been made in dopamine research in recent years, but challenges lie ahead. The field must begin to map studies of computational function onto molecularly-defined groups of dopamine neurons. Such studies would help to clarify whether or not molecularly-defined groups of dopamine neurons all serve to encode RPE, and whether each group of neurons serves single or multiple computational functions. A comprehensive mapping of computational function onto molecularly-defined dopamine cell types will add richness to both RPE theory and other theories on dopamine’s role in supporting reward- and motor-related behaviors.

Going further, we must also better understand how dopamine signals in distinct subsets of neurons are being generated by upstream circuits and interpreted by downstream circuits. On the former problem, input mapping studies have led the way [20,29,43,44,86] but a good deal of work is left to describe the detailed mechanisms by which these inputs contribute to the encoding properties of dopamine neurons, as well as to understand how such connectivity might be altered by sex, across development and aging, and in conditions (e.g., stress) that are related to the emergence of psychiatric conditions. On the latter problem, we are only beginning to understand the transformations that take place between dopamine cell body activity, axon activity, and neurotransmitter release [81]. The astoundingly complex axonal arbors of midbrain dopamine neurons [87] provoke many questions about how action potential propagation is regulated. Complex neuromodulatory mechanisms are known to regulate dopamine release at terminals [88,89]. Different release machinery can regulate dopamine release on different time scales [90,91], and yet further mechanisms may regulate the differential co-release of other neurotransmitters [50,52]. Thus, even when we observe RPE signals while recording dopamine neuron action potentials at cell bodies, we may question whether and in what form downstream circuits are receiving these signals. As researchers tackle these difficult questions in the coming years, theories of dopamine function will, no doubt, again need to be updated.

Highlights.

Reward Prediction Error (RPE) remains a compelling theory accounting for VTA dopamine neuron activity
RPE theory now incorporates sensory prediction errors, distributional encoding, and belief states
Molecular subtyping of dopamine neurons provides a scaffold for probing additional computational functions
Whether dopamine neuron activity related to movement, motivation, and goal-directed planning can be explained by RPE is under debate

Acknowledgments

This work was supported by the National Institutes of Health [DP2MH122401 to TNL]; and the National Science Foundation [DGE-1842165 to ALH].

Footnotes

Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

Conflict of Interest Statement

The authors have no conflicts of interest to declare.

References

*of special interest

**of outstanding interest

1.Schultz W, Dayan P, Montague PR: A Neural Substrate of Prediction and Reward. Science 1997, 275:1593–1599. [DOI] [PubMed] [Google Scholar]
2.Niv Y, Schoenbaum G: Dialogues on prediction errors. Trends in Cognitive Sciences 2008, 12:265–272. [DOI] [PubMed] [Google Scholar]
3.Glimcher PW: Understanding dopamine and reinforcement learning: The dopamine reward prediction error hypothesis. PNAS 2011, 108:15647–15654. [DOI] [PMC free article] [PubMed] [Google Scholar]
4.Steinberg EE, Keiflin R, Boivin JR, Witten IB, Deisseroth K, Janak PH: A causal link between prediction errors, dopamine neurons and learning. Nat Neurosci 2013, 16:966–973. [DOI] [PMC free article] [PubMed] [Google Scholar]
**5.Maes EJP, Sharpe MJ, Usypchuk AA, Lozzi M, Chang CY, Gardner MPH, Schoenbaum G, Iordanova MD: Causal evidence supporting the proposal that dopamine transients function as temporal difference prediction errors. Nat Neurosci 2020, 23:176–178. [DOI] [PMC free article] [PubMed] [Google Scholar]; This brief but elegant study is well-designed to distinguish between prediction error and predicted value. The distinction is important and often not made clearly, muddying the interpretation of experimental results. Using inhibitory optogenetics to prevent cue-evoked dopamine signals, Maes et al find that the dopamine neuron activity observed in response to a reward-predictive cue is a prediction error, not a signal about the value of the cue.
*6.Sharpe MJ, Batchelor HM, Mueller LE, Yun Chang C, Maes EJP, Niv Y, Schoenbaum G: Dopamine transients do not act as model-free prediction errors during associative learning. Nat Commun 2020, 11:106. [DOI] [PMC free article] [PubMed] [Google Scholar]; This study is complementary to the study by Maes et al, showing that dopamine neuron activity does not add value to cues as cue-outcome associations are learned. Three separate experiments using excitatory optogenetics to stimulate dopamine neuron activity confirm that dopamine stimulation supports associative learning about antecedent cues without evidence of a cached value being ascribed to the cue.
7.Sharpe MJ, Chang CY, Liu MA, Batchelor HM, Mueller LE, Jones JL, Niv Y, Schoenbaum G: Dopamine transients are sufficient and necessary for acquisition of model-based associations. Nature Neuroscience 2017, 20:735–742. [DOI] [PMC free article] [PubMed] [Google Scholar]
8.Bari BA, Grossman CD, Lubin EE, Rajagopalan AE, Cressy JI, Cohen JY: Stable Representations of Decision Variables for Flexible Behavior. Neuron 2019, 103:922–933.e7. [DOI] [PMC free article] [PubMed] [Google Scholar]
9.Mendoza JA, Lafferty CK, Yang AK, Britt JP: Cue-Evoked Dopamine Neuron Activity Helps Maintain but Does Not Encode Expected Value. Cell Rep 2019, 29:1429–1437.e3. [DOI] [PubMed] [Google Scholar]
10.Morrens J, Aydin Ç, Janse van Rensburg A, Esquivelzeta Rabell J, Haesler S: Cue-Evoked Dopamine Promotes Conditioned Responding during Learning. Neuron 2020, 106:142–153.e7. [DOI] [PubMed] [Google Scholar]
11.Takahashi YK, Batchelor HM, Liu B, Khanna A, Morales M, Schoenbaum G: Dopamine Neurons Respond to Errors in the Prediction of Sensory Features of Expected Rewards. Neuron 2017, 95:1395–1405.e3. [DOI] [PMC free article] [PubMed] [Google Scholar]
12.Stalnaker TA, Howard JD, Takahashi YK, Gershman SJ, Kahnt T, Schoenbaum G: Dopamine neuron ensembles signal the content of sensory prediction errors. Elife 2019, 8. [DOI] [PMC free article] [PubMed] [Google Scholar]
13.Gardner MPH, Schoenbaum G, Gershman SJ: Rethinking dopamine as generalized prediction error. Proc Biol Sci 2018, 285. [DOI] [PMC free article] [PubMed] [Google Scholar]
*14.Keiflin R, Pribut HJ, Shah NB, Janak PH: Ventral Tegmental Dopamine Neurons Participate in Reward Identity Predictions. Curr Biol 2019, 29:93–103.e3. [DOI] [PMC free article] [PubMed] [Google Scholar]; This study demonstrated an important distinction between VTA and SNc dopamine neurons’ functions. While previous studies had suggested that both VTA and SNc stimulation could support primary reinforcement in self-stimulation paradigms, and Schultz’s original work on RPE included observations from both VTA and SNc dopamine neurons, this study showed that VTA dopamine stimulation can unblock learning, while SNc dopamine stimulation cannot. The result suggests that SNc dopamine neurons do not encode strictly-defined RPE, despite having reward-related responses. This study also serves as an important to reminder that most of studies of dopamine and RPE are really examining a specific subset of anatomically-defined dopamine neurons in the VTA, and that we have not yet fully squared findings of dopamine neuron heterogeneity with RPE theory.
15.Kakade S, Dayan P: Dopamine: generalization and bonuses. Neural Networks 2002, 15:549–559. [DOI] [PubMed] [Google Scholar]
**16.Dabney W, Kurth-Nelson Z, Uchida N, Starkweather CK, Hassabis D, Munos R, Botvinick M: A distributional code for value in dopamine-based reinforcement learning. Nature 2020, 577:671–675. [DOI] [PMC free article] [PubMed] [Google Scholar]; This innovative study shows that RPEs are represented in the VTA as a distributional code, meaning that although individual VTA dopamine neurons fully encode RPE, they each do so slightly differently, making an array of more optimistic and pessimistic predictions. The distributional encoding of RPE helps explains the apparent redundancy of RPE encoding in the system. Artificial intelligence research has shown that the distributional encoding of RPE improves reinforcement learning.
17.Eshel N, Bukwich M, Rao V, Hemmelder V, Tian J, Uchida N: Arithmetic and local circuitry underlying dopamine prediction errors. Nature 2015, 525:243–246. [DOI] [PMC free article] [PubMed] [Google Scholar]
18.Eshel N, Tian J, Bukwich M, Uchida N: Dopamine neurons share common response function for reward prediction error. Nature Neuroscience 2016, 19:479–486. [DOI] [PMC free article] [PubMed] [Google Scholar]
19.Cohen JY, Haesler S, Vong L, Lowell BB, Uchida N: Neuron-type-specific signals for reward and punishment in the ventral tegmental area. Nature 2012, 482:85–88. [DOI] [PMC free article] [PubMed] [Google Scholar]
20.Tian J, Huang R, Cohen JY, Osakada F, Kobak D, Machens CK, Callaway EM, Uchida N, Watabe-Uchida M: Distributed and Mixed Information in Monosynaptic Inputs to Dopamine Neurons. Neuron 2016, 91:1374–1389. [DOI] [PMC free article] [PubMed] [Google Scholar]
21.Lak A, Nomoto K, Keramati M, Sakagami M, Kepecs A: Midbrain Dopamine Neurons Signal Belief in Choice Accuracy during a Perceptual Decision. Current Biology 2017, 27:821–832. [DOI] [PMC free article] [PubMed] [Google Scholar]
22.Babayan BM, Uchida N, Gershman SJ: Belief state representation in the dopamine system. Nat Commun 2018, 9:1891. [DOI] [PMC free article] [PubMed] [Google Scholar]
23.Starkweather CK, Babayan BM, Uchida N, Gershman SJ: Dopamine reward prediction errors reflect hidden-state inference across time. Nature Neuroscience 2017, 20:581–589. [DOI] [PMC free article] [PubMed] [Google Scholar]
24.Lammel S, Hetzel A, Häckel O, Jones I, Liss B, Roeper J: Unique Properties of Mesoprefrontal Neurons within a Dual Mesocorticolimbic Dopamine System. Neuron 2008, 57:760–773. [DOI] [PubMed] [Google Scholar]
25.Lammel S, Ion DI, Roeper J, Malenka RC: Projection-Specific Modulation of Dopamine Neuron Synapses by Aversive and Rewarding Stimuli. Neuron 2011, 70:855–862. [DOI] [PMC free article] [PubMed] [Google Scholar]
26.Saunders BT, Richard JM, Margolis EB, Janak PH: Dopamine neurons create Pavlovian conditioned stimuli with circuit-defined motivational properties. Nat Neurosci 2018, 21:1072–1083. [DOI] [PMC free article] [PubMed] [Google Scholar]
27.Rossi MA, Sukharnikova T, Hayrapetyan VY, Yang L, Yin HH: Operant self-stimulation of dopamine neurons in the substantia nigra. PLoS ONE 2013, 8:e65799. [DOI] [PMC free article] [PubMed] [Google Scholar]
28.Ilango A, Kesner AJ, Keller KL, Stuber GD, Bonci A, Ikemoto S: Similar Roles of Substantia Nigra and Ventral Tegmental Dopamine Neurons in Reward and Aversion. J Neurosci 2014, 34:817–822. [DOI] [PMC free article] [PubMed] [Google Scholar]
29.Menegas W, Bergan JF, Ogawa SK, Isogai Y, Umadevi Venkataraju K, Osten P, Uchida N, Watabe-Uchida M: Dopamine neurons projecting to the posterior striatum form an anatomically distinct subclass. eLife 2015, 4:e10032. [DOI] [PMC free article] [PubMed] [Google Scholar]
30.Menegas W, Babayan BM, Uchida N, Watabe-Uchida M: Opposite initialization to novel cues in dopamine signaling in ventral and posterior striatum in mice. Elife 2017, 6. [DOI] [PMC free article] [PubMed] [Google Scholar]
31.Menegas W, Akiti K, Amo R, Uchida N, Watabe-Uchida M: Dopamine neurons projecting to the posterior striatum reinforce avoidance of threatening stimuli. Nature Neuroscience 2018, 21:1421–1430. [DOI] [PMC free article] [PubMed] [Google Scholar]
32.Steinberg EE, Gore F, Heifets BD, Taylor MD, Norville ZC, Beier KT, Földy C, Lerner TN, Luo L, Deisseroth K, et al. : Amygdala-Midbrain Connections Modulate Appetitive and Aversive Learning. Neuron 2020, 106:1026–1043.e9. [DOI] [PubMed] [Google Scholar]
33.Engelhard B, Finkelstein J, Cox J, Fleming W, Jang HJ, Ornelas S, Koay SA, Thiberge SY, Daw ND, Tank DW, et al. : Specialized coding of sensory, motor and cognitive variables in VTA dopamine neurons. Nature 2019, 570:509–513. [DOI] [PMC free article] [PubMed] [Google Scholar]
34.Cai LX, Pizano K, Gundersen GW, Hayes CL, Fleming WT, Holt S, Cox JM, Witten IB: Distinct signals in medial and lateral VTA dopamine neurons modulate fear extinction at different times. Elife 2020, 9. [DOI] [PMC free article] [PubMed] [Google Scholar]
35.Lammel S, Lim BK, Ran C, Huang KW, Betley MJ, Tye KM, Deisseroth K, Malenka RC: Input-specific control of reward and aversion in the ventral tegmental area. Nature 2012, 491:212–217. [DOI] [PMC free article] [PubMed] [Google Scholar]
36.Poulin J-F, Zou J, Drouin-Ouellet J, Kim K-YA, Cicchetti F, Awatramani RB: Defining Midbrain Dopaminergic Neuron Diversity by Single-Cell Gene Expression Profiling. Cell Reports 2014, 9:930–943. [DOI] [PMC free article] [PubMed] [Google Scholar]
37.Tiklová K, Björklund ÅK, Lahti L, Fiorenzano A, Nolbrant S, Gillberg L, Volakakis N, Yokota C, Hilscher MM, Hauling T, et al. : Single-cell RNA sequencing reveals midbrain dopamine neuron diversity emerging during mouse brain development. Nature Communications 2019, 10:581. [DOI] [PMC free article] [PubMed] [Google Scholar]
38.Poulin J-F, Gaertner Z, Moreno-Ramos OA, Awatramani R: Classification of Midbrain Dopamine Neurons Using Single-Cell Gene Expression Profiling Approaches. Trends in Neurosciences 2020, 43:155–169. [DOI] [PMC free article] [PubMed] [Google Scholar]
39.Hook PW, McClymont SA, Cannon GH, Law WD, Morton AJ, Goff LA, McCallion AS: Single-Cell RNA-Seq of Mouse Dopaminergic Neurons Informs Candidate Gene Selection for Sporadic Parkinson Disease. The American Journal of Human Genetics 2018, 102:427–446. [DOI] [PMC free article] [PubMed] [Google Scholar]
40.La Manno G, Gyllborg D, Codeluppi S, Nishimura K, Salto C, Zeisel A, Borm LE, Stott SRW, Toledo EM, Villaescusa JC, et al. : Molecular Diversity of Midbrain Development in Mouse, Human, and Stem Cells. Cell 2016, 167:566–580.e19. [DOI] [PMC free article] [PubMed] [Google Scholar]
41.Saunders A, Macosko EZ, Wysoker A, Goldman M, Krienen FM, de Rivera H, Bien E, Baum M, Bortolin L, Wang S, et al. : Molecular Diversity and Specializations among the Cells of the Adult Mouse Brain. Cell 2018, 174:1015–1030.e16. [DOI] [PMC free article] [PubMed] [Google Scholar]
**42.Poulin J-F, Caronia G, Hofer C, Cui Q, Helm B, Ramakrishnan C, Chan CS, Dombeck DA, Deisseroth K, Awatramani R: Mapping projections of molecularly defined dopamine neuron subtypes using intersectional genetic approaches. Nat Neurosci 2018, 21:1260–1271. [DOI] [PMC free article] [PubMed] [Google Scholar]; Initial studies by Poulin et al and others have established a landscape of molecular heterogeneity in midbrain dopamine neurons. In this study, Poulin et al. introduce valuable genetic mouse lines for distinguishing between dopamine neuron subpopulations based on their molecular identity. These mouse lines will enable many important future studies of how molecular heterogeneity could contribute to diverse encoding beyond RPE. This study then specifically examined the projection patterns of molecular-defined dopamine neurons, data that will serve as a framework for aligning future studies of dopamine neuron molecular heterogeneity with anatomical features of the system.
43.Beier KT, Steinberg EE, DeLoach KE, Xie S, Miyamichi K, Schwarz L, Gao XJ, Kremer EJ, Malenka RC, Luo L: Circuit Architecture of VTA Dopamine Neurons Revealed by Systematic Input-Output Mapping. Cell 2015, 162:622–634. [DOI] [PMC free article] [PubMed] [Google Scholar]
44.Lerner TN, Shilyansky C, Davidson TJ, Evans KE, Beier KT, Zalocusky KA, Crow AK, Malenka RC, Luo L, Tomer R, et al. : Intact-Brain Analyses Reveal Distinct Information Carried by SNc Dopamine Subcircuits. Cell 2015, 162:635–647. [DOI] [PMC free article] [PubMed] [Google Scholar]
45.de Jong JW, Afjei SA, Pollak Dorocic I, Peck JR, Liu C, Kim CK, Tian L, Deisseroth K, Lammel S: A Neural Circuit Mechanism for Encoding Aversive Stimuli in the Mesolimbic Dopamine System. Neuron 2019, 101:133–151.e7. [DOI] [PMC free article] [PubMed] [Google Scholar]
*46.Wu J, Kung J, Dong J, Chang L, Xie C, Habib A, Hawes S, Yang N, Chen V, Liu Z, et al. : Distinct Connectivity and Functionality of Aldehyde Dehydrogenase 1a1-Positive Nigrostriatal Dopaminergic Neurons in Motor Learning. Cell Reports 2019, 28:1167–1181.e7. [DOI] [PMC free article] [PubMed] [Google Scholar]; This study is one of the first designed to determine the behavioral significance of a molecularly-defined subset of midbrain dopamine neurons, namely Aldh1a1+ SNc dopamine neurons. Aldh1a1+ SNc dopamine neurons are particularly vulnerable in Parkinson’s disease, and project primarily to the dorsal striatum. Interestingly, when Aldh1a1+ SNc dopamine neurons are ablated with caspase-3, the authors find only minimal gross motor impairments, but a profound change motor skill acquisition. The reasons for the selectivity of this effect in comparison to more general dopamine lesions (e.g. using 6-OHDA) remain unclear.
*47.Heymann G, Jo YS, Reichard KL, McFarland N, Chavkin C, Palmiter RD, Soden ME, Zweifel LS: Synergy of Distinct Dopamine Projection Populations in Behavioral Reinforcement. Neuron 2020, 105:909–920.e5. [DOI] [PMC free article] [PubMed] [Google Scholar]; This study is of particular interest due to the authors’ definitions of dopamine neuron subtypes by their peptidergic markers. Previous studies of molecularly-defined dopamine neurons have included Cck as a genetic marker, but not Crhr1. This study demonstrated that peptide-defined populations of dopamine neurons have distinct projections and roles in behavioral reinforcement. Considering the variety of peptidergic inputs to midbrain dopamine neurons, this study inspires others to elucidate how the identities of peptide-defined dopamine neurons intersect with molecular phenotypes, as defined by gene expression analyses, and further probe the behavioral contributions of peptide-defined dopamine neurons to reward-related behaviors.
48.Bimpisidis Z, König N, Stagkourakis S, Zell V, Vlcek B, Dumas S, Giros B, Broberger C, Hnasko TS, Wallén-Mackenzie Å: The NeuroD6 Subtype of VTA Neurons Contributes to Psychostimulant Sensitization and Behavioral Reinforcement. eNeuro 2019, 6. [DOI] [PMC free article] [PubMed] [Google Scholar]
49.Mongia S, Yamaguchi T, Liu B, Zhang S, Wang H, Morales M: The Ventral Tegmental Area has calbindin neurons with the capability to co-release glutamate and dopamine into the nucleus accumbens. European Journal of Neuroscience 2019, 50:3968–3984. [DOI] [PMC free article] [PubMed] [Google Scholar]
50.Fortin GM, Ducrot C, Giguère N, Kouwenhoven WM, Bourque M-J, Pacelli C, Varaschin RK, Brill M, Singh S, Wiseman PW, et al. : Segregation of dopamine and glutamate release sites in dopamine neuron axons: regulation by striatal target cells. The FASEB Journal 2018, 33:400–417. [DOI] [PubMed] [Google Scholar]
51.Kim J-I, Ganesan S, Luo SX, Wu Y-W, Park E, Huang EJ, Chen L, Ding JB: Aldehyde dehydrogenase 1a1 mediates a GABA synthesis pathway in midbrain dopaminergic neurons. Science 2015, 350:102–106. [DOI] [PMC free article] [PubMed] [Google Scholar]
52.Silm K, Yang J, Marcott PF, Asensio CS, Eriksen J, Guthrie DA, Newman AH, Ford CP, Edwards RH: Synaptic Vesicle Recycling Pathway Determines Neurotransmitter Content and Release Properties. Neuron 2019, 102:786–800.e5. [DOI] [PMC free article] [PubMed] [Google Scholar]
53.Tritsch NX, Oh W-J, Gu C, Sabatini BL: Midbrain dopamine neurons sustain inhibitory transmission using plasma membrane uptake of GABA, not synthesis. eLife 2014, 3:e01936. [DOI] [PMC free article] [PubMed] [Google Scholar]
54.Tritsch NX, Ding JB, Sabatini BL: Dopaminergic neurons inhibit striatal output through non-canonical release of GABA. Nature 2012, 490:262–266. [DOI] [PMC free article] [PubMed] [Google Scholar]
55.Wang DV, Viereckel T, Zell V, Konradsson-Geuken Å, Broker CJ, Talishinsky A, Yoo JH, Galinato MH, Arvidsson E, Kesner AJ, et al. : Disrupting Glutamate Co-transmission Does Not Affect Acquisition of Conditioned Behavior Reinforced by Dopamine Neuron Activation. Cell Reports 2017, 18:2584–2591. [DOI] [PMC free article] [PubMed] [Google Scholar]
56.Zell V, Steinkellner T, Hollon NG, Warlow SM, Souter E, Faget L, Hunker AC, Jin X, Zweifel LS, Hnasko TS: VTA Glutamate Neuron Activity Drives Positive Reinforcement Absent Dopamine Co-release. Neuron 2020, doi: 10.1016/j.neuron.2020.06.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
57.Patriarchi T, Cho JR, Merten K, Howe MW, Marley A, Xiong W-H, Folk RW, Broussard GJ, Liang R, Jang MJ, et al. : Ultrafast neuronal imaging of dopamine dynamics with designed genetically encoded sensors. Science 2018, 360:eaat4422. [DOI] [PMC free article] [PubMed] [Google Scholar]
58.Patriarchi T, Mohebi A, Sun J, Marley A, Liang R, Dong C, Puhger K, Mizuno GO, Davis CM, Wiltgen B, et al. : An expanded palette of dopamine sensors for multiplex imaging in vivo. Nat Methods 2020, doi: 10.1038/s41592-020-0936-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
59.Sun F, Zeng J, Jing M, Zhou J, Feng J, Owen SF, Luo Y, Li F, Wang H, Yamaguchi T, et al. : A Genetically Encoded Fluorescent Sensor Enables Rapid and Specific Detection of Dopamine in Flies, Fish, and Mice. Cell 2018, 174:481–496.e19. [DOI] [PMC free article] [PubMed] [Google Scholar]
60.Sun F, Zhou J, Dai B, Qian T, Zeng J, Li X, Zhuo Y, Zhang Y, Tan K, Feng J, et al. : New and improved GRAB fluorescent sensors for monitoring dopaminergic activity in vivo. bioRxiv 2020, doi: 10.1101/2020.03.28.013722. [DOI] [PMC free article] [PubMed] [Google Scholar]
61.Lee SJ, Lodder B, Chen Y, Patriarchi T, Tian L, Sabatini BL: Cell-type specific asynchronous modulation of PKA by dopamine during reward based learning. bioRxiv 2019, doi: 10.1101/839035. [DOI] [PMC free article] [PubMed] [Google Scholar]
62.Parker NF, Cameron CM, Taliaferro JP, Lee J, Choi JY, Davidson TJ, Daw ND, Witten IB: Reward and choice encoding in terminals of midbrain dopamine neurons depends on striatal target. Nat Neurosci 2016, 19:845–854. [DOI] [PMC free article] [PubMed] [Google Scholar]
63.Lee RS, Mattar MG, Parker NF, Witten IB, Daw ND: Reward prediction error does not explain movement selectivity in DMS-projecting dopamine neurons. Elife 2019, 8. [DOI] [PMC free article] [PubMed] [Google Scholar]
64.da Silva JA, Tecuapetla F, Paixão V, Costa RM: Dopamine neuron activity before action initiation gates and invigorates future movements. Nature 2018, 554:244–248. [DOI] [PubMed] [Google Scholar]
65.Dodson PD, Dreyer JK, Jennings KA, Syed ECJ, Wade-Martins R, Cragg SJ, Bolam JP, Magill PJ: Representation of spontaneous movement by dopaminergic neurons is cell-type selective and disrupted in parkinsonism. Proc Natl Acad Sci U S A 2016, 113:E2180–E2188. [DOI] [PMC free article] [PubMed] [Google Scholar]
66.Howe MW, Dombeck DA: Rapid signalling in distinct dopaminergic axons during locomotion and reward. Nature 2016, 535:505–510. [DOI] [PMC free article] [PubMed] [Google Scholar]
67.Barter JW, Li S, Lu D, Bartholomew RA, Rossi MA, Shoemaker CT, Salas-Meza D, Gaidis E, Yin HH: Beyond reward prediction errors: the role of dopamine in movement kinematics. Front Integr Neurosci 2015, 9. [DOI] [PMC free article] [PubMed] [Google Scholar]
68.Syed ECJ, Grima LL, Magill PJ, Bogacz R, Brown P, Walton ME: Action initiation shapes mesolimbic dopamine encoding of future rewards. Nat Neurosci 2016, 19:34–36. [DOI] [PMC free article] [PubMed] [Google Scholar]
69.Hamid AA, Pettibone JR, Mabrouk OS, Hetrick VL, Schmidt R, Vander Weele CM, Kennedy RT, Aragona BJ, Berke JD: Mesolimbic dopamine signals the value of work. Nat Neurosci 2016, 19:117–126. [DOI] [PMC free article] [PubMed] [Google Scholar]
*70.Hughes RN, Bakhurin KI, Petter EA, Watson GDR, Kim N, Friedman AD, Yin HH: Ventral Tegmental Dopamine Neurons Control the Impulse Vector during Motivated Behavior. Current Biology 2020, 30:2681–2694.e5. [DOI] [PMC free article] [PubMed] [Google Scholar]; This provocative study challenges the dogma that VTA dopamine neurons encode RPE, arguing instead for movement encoding. The authors use in vivo electrophysiology to record the firing patterns of optogenetically-tagged VTA dopamine neurons in mice. Sensitive measurements of force generation in a head-fixation apparatus reveal subtle head movements as mice collect rewards during a fixed-time interval task. No explicit cues preceded reward delivery in the task, but mice learned to perform anticipatory licking – along with stereotyped movement sequences associated with this licking – based on timing. Three different populations of dopamine neurons were identified based on their correlations with force exerted over time, i.e. the “impulse vector.” (Notably, only one population displayed ramping). Further optogenetic experiments showed that, in highly trained mice, VTA dopamine neuron stimulation could cause forward force generation. Additional work is clearly required to determine whether the hypothesis of impulse vector encoding will hold and/or be integrated with RPE theory. It remains a critical question in the field to better understand when dopamine controls movement and how motor execution and error signals drive reward learning.
*71.Coddington LT, Dudman JT: The timing of action determines reward prediction signals in identified midbrain dopamine neurons. Nat Neurosci 2018, 21:1563–1573. [DOI] [PMC free article] [PubMed] [Google Scholar]; This study used in vivo electrophysiology to record the activity of optogenetically-tagged VTA and SNc dopamine neurons during learning of an auditory trace conditioning task. It is unique and innovative in part because the authors record from mice as they first begin to learn, instead of only examining dopamine activity in well-trained mice. Thus, the paper offers insight into how RPE encoding is slowly created over many trials. The authors find that dopamine responses to rewards and predictive cues emerge independently (not as a gradual transfer from reward to cue), and that reward expectation signals reflect a summation of emergent sensory cue signals and action initiation signals, that latter of which is responsible for the timing of RPE signals. Furthermore, they make important observations about the relationship of dopamine to movement initiation. They find that dopamine signals related to movement initiation are altered by learning and reward context, and that optogenetic stimulation that is calibrated to match physiological dopamine signals is not sufficient produce movement initiation, despite being capable of inducing conditioned place preference.
72.Moss MM, Zatka-Haas P, Harris KD, Carandini M, Lak A: Dopamine axons to dorsal striatum encode contralateral stimuli and actions. bioRxiv 2020, doi: 10.1101/2020.07.16.207316. [DOI] [PMC free article] [PubMed] [Google Scholar]
*73.McDougle SD, Butcher PA, Parvin DE, Mushtaq F, Niv Y, Ivry RB, Taylor JA: Neural Signatures of Prediction Errors in a Decision-Making Task Are Modulated by Action Execution Failures. Current Biology 2019, 29:1606–1613.e5. [DOI] [PMC free article] [PubMed] [Google Scholar]; This human fMRI study proposes an interesting hypothesis regarding the interface between RPE encoding and motor control. The authors used a classical reinforcement learning task in which participants make short reaching movements to indicate choice. The participants were given feedback about whether errors leading to reward omission were associated with wrong choices vs motor execution errors. Although both wrong choices and motor execution errors lead to the same outcome (no reward), participants do not display the same lose-switch behavior. In the fMRI data, ventral striatal RPEs are attenuated following execution errors, providing a potential neurological explanation for the reduction in lose-switch behavior. These results lead to the suggestion that motor feedback influences RPE encoding to allow us to distinguish internal vs external reasons for trial-by-trial variability during reinforcement learning.
74.Coddington LT, Dudman JT: Learning from Action: Reconsidering Movement Signaling in Midbrain Dopamine Neuron Activity. Neuron 2019, 104:63–77. [DOI] [PubMed] [Google Scholar]
75.Howe MW, Tierney PL, Sandberg SG, Phillips PEM, Graybiel AM: Prolonged dopamine signalling in striatum signals proximity and value of distant rewards. Nature 2013, 500:575–579. [DOI] [PMC free article] [PubMed] [Google Scholar]
76.Collins AL, Greenfield VY, Bye JK, Linker KE, Wang AS, Wassum KM: Dynamic mesolimbic dopamine signaling during action sequence learning and expectation violation. Scientific Reports 2016, 6:20231. [DOI] [PMC free article] [PubMed] [Google Scholar]
77.Hamid AA, Frank MJ, Moore CI: Dopamine waves as a mechanism for spatiotemporal credit assignment. bioRxiv 2019, doi: 10.1101/729640. [DOI] [PMC free article] [PubMed] [Google Scholar]
*78.Kim HR, Malik AN, Mikhael JG, Bech P, Tsutsui-Kimura I, Sun F, Zhang Y, Li Y, Watabe-Uchida M, Gershman SJ, et al. : A unified framework for dopamine signals across timescales. bioRxiv 2019, doi: 10.1101/803437. [DOI] [PMC free article] [PubMed] [Google Scholar]; In contrast to Mohebi et al (2019), this pre-print suggests that ramping activity can be detected in the firing patterns of VTA cell bodies, and that such activity is sufficient to explain ramping signals seen in dopamine axons (using calcium sensors) and when measuring dopamine release (using the dopamine sensor GRABDA). Differences between dopamine firing patterns and downstream release are suggested to be limited to minute-level timescales (“tonic” dopamine) and the reasons for the divergence are left to future studies to investigate.
**79.Guru A, Seo C, Post RJ, Kullakanda DS, Schaffer JA, Warden MR: Ramping activity in midbrain dopamine neurons signifies the use of a cognitive map. bioRxiv 2020, doi: 10.1101/2020.05.21.108886. [DOI] [Google Scholar]; While still published only as a pre-print, this careful study of dopamine ramping is an exemplary attempt to systematically examine how different tasks do or not do evoke ramping. The authors perform fiber photometry of VTA dopamine cell bodies as mice perform a number of tasks. They do not observe ramping in a classical Pavlovian task, but they do observe ramps when mice move towards a reward or a reward moves towards the mouse. Although this ramping is robust at first, it fades with training. In contrast, ramping towards reward does not fade when mice are asked to run a fixed distance on a running wheel in the absence of feedback cues indicating progress. The authors therefore suggest that ramps allow the generation of an internal model of progress towards reward important for goal-directed behavior.
80.Fiorillo CD, Tobler PN, Schultz W: Discrete coding of reward probability and uncertainty by dopamine neurons. Science 2003, 299:1898–1902. [DOI] [PubMed] [Google Scholar]
*81.Mohebi A, Pettibone JR, Hamid AA, Wong J-MT, Vinson LT, Patriarchi T, Tian L, Kennedy RT, Berke JD: Dissociable dopamine dynamics for learning and motivation. Nature 2019, 570:65–70. [DOI] [PMC free article] [PubMed] [Google Scholar]; The authors use microdialysis, voltammetry, and dopamine sensor imaging to monitor dopamine release on different timescales. They find that NAc core dopamine release in particular correlates with reward rate on a slow time scale, independent of changes in VTA dopamine cell firing rates. Based on their results, the authors make two provocative suggestions: (1) that tonic firing observed in VTA cell bodies is relatively constant and does not contribute to motivational changes and (2) that local modulation at dopamine terminals is responsible for producing “ramps” that encode motivation. The latter suggestion in particular is one that deserves attention and investigation in future studies – how are axonal regulatory or reuptake mechanisms in dopamine neurons transforming the encoding functions observed at cell bodies?
82.Song MR, Lee SW: Dynamic resource allocation during reinforcement learning accounts for ramping and phasic dopamine activity. Neural Networks 2020, 126:95–107. [DOI] [PubMed] [Google Scholar]
83.Seiler JL, Cosme CV, Sherathiya VN, Bianco JM, Lerner TN: Dopamine Signaling in the Dorsomedial Striatum Promotes Compulsive Behavior. bioRxiv 2020, doi: 10.1101/2020.03.30.016238. [DOI] [PMC free article] [PubMed] [Google Scholar]
84.Mikhael JG, Kim HR, Uchida N, Gershman SJ: Ramping and State Uncertainty in the Dopamine Signal. bioRxiv 2019, doi: 10.1101/805366. [DOI] [Google Scholar]
85.Gershman SJ: Dopamine Ramps Are a Consequence of Reward Prediction Errors. Neural Computation 2013, 26:467–471. [DOI] [PubMed] [Google Scholar]
86.Watabe-Uchida M, Zhu L, Ogawa SK, Vamanrao A, Uchida N: Whole-Brain Mapping of Direct Inputs to Midbrain Dopamine Neurons. Neuron 2012, 74:858–873. [DOI] [PubMed] [Google Scholar]
87.Matsuda W, Furuta T, Nakamura KC, Hioki H, Fujiyama F, Arai R, Kaneko T: Single Nigrostriatal Dopaminergic Neurons Form Widely Spread and Highly Dense Axonal Arborizations in the Neostriatum. J Neurosci 2009, 29:444–453. [DOI] [PMC free article] [PubMed] [Google Scholar]
88.Rice ME, Patel JC, Cragg SJ: Dopamine release in the basal ganglia. Neuroscience 2011, 198:112–137. [DOI] [PMC free article] [PubMed] [Google Scholar]
89.Sulzer D, Cragg SJ, Rice ME: Striatal dopamine neurotransmission: Regulation of release and uptake. Basal Ganglia 2016, 6:123–148. [DOI] [PMC free article] [PubMed] [Google Scholar]
*90.Banerjee A, Lee J, Nemcova P, Liu C, Kaeser PS: Synaptotagmin-1 is the Ca2+ sensor for fast striatal dopamine release. Elife 2020, 9. [DOI] [PMC free article] [PubMed] [Google Scholar]; This molecular study examines the mechanisms regulating dopamine release. They find that synaptotagmin 1 (Syt1) is the presynaptic calcium sensor responsible for fast, synchronous dopamine release, but that another mechanism must be responsible for slower, asynchronous release. The remaining asynchronous release in the absence of Syt1 does appear to contribute substantially to the regulation of ongoing extracellular dopamine levels. The study implies that there are distinct control mechanisms regulating dopamine release on different time scales. Molecular tools may be of use, then, in dissecting the behavioral roles of such distinct forms of release.
91.Liu C, Kaeser PS: Mechanisms and regulation of dopamine release. Current Opinion in Neurobiology 2019, 57:46–53. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R1] 1.Schultz W, Dayan P, Montague PR: A Neural Substrate of Prediction and Reward. Science 1997, 275:1593–1599. [DOI] [PubMed] [Google Scholar]

[R2] 2.Niv Y, Schoenbaum G: Dialogues on prediction errors. Trends in Cognitive Sciences 2008, 12:265–272. [DOI] [PubMed] [Google Scholar]

[R3] 3.Glimcher PW: Understanding dopamine and reinforcement learning: The dopamine reward prediction error hypothesis. PNAS 2011, 108:15647–15654. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R4] 4.Steinberg EE, Keiflin R, Boivin JR, Witten IB, Deisseroth K, Janak PH: A causal link between prediction errors, dopamine neurons and learning. Nat Neurosci 2013, 16:966–973. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R5] **5.Maes EJP, Sharpe MJ, Usypchuk AA, Lozzi M, Chang CY, Gardner MPH, Schoenbaum G, Iordanova MD: Causal evidence supporting the proposal that dopamine transients function as temporal difference prediction errors. Nat Neurosci 2020, 23:176–178. [DOI] [PMC free article] [PubMed] [Google Scholar]; This brief but elegant study is well-designed to distinguish between prediction error and predicted value. The distinction is important and often not made clearly, muddying the interpretation of experimental results. Using inhibitory optogenetics to prevent cue-evoked dopamine signals, Maes et al find that the dopamine neuron activity observed in response to a reward-predictive cue is a prediction error, not a signal about the value of the cue.

[R6] *6.Sharpe MJ, Batchelor HM, Mueller LE, Yun Chang C, Maes EJP, Niv Y, Schoenbaum G: Dopamine transients do not act as model-free prediction errors during associative learning. Nat Commun 2020, 11:106. [DOI] [PMC free article] [PubMed] [Google Scholar]; This study is complementary to the study by Maes et al, showing that dopamine neuron activity does not add value to cues as cue-outcome associations are learned. Three separate experiments using excitatory optogenetics to stimulate dopamine neuron activity confirm that dopamine stimulation supports associative learning about antecedent cues without evidence of a cached value being ascribed to the cue.

[R7] 7.Sharpe MJ, Chang CY, Liu MA, Batchelor HM, Mueller LE, Jones JL, Niv Y, Schoenbaum G: Dopamine transients are sufficient and necessary for acquisition of model-based associations. Nature Neuroscience 2017, 20:735–742. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R8] 8.Bari BA, Grossman CD, Lubin EE, Rajagopalan AE, Cressy JI, Cohen JY: Stable Representations of Decision Variables for Flexible Behavior. Neuron 2019, 103:922–933.e7. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R9] 9.Mendoza JA, Lafferty CK, Yang AK, Britt JP: Cue-Evoked Dopamine Neuron Activity Helps Maintain but Does Not Encode Expected Value. Cell Rep 2019, 29:1429–1437.e3. [DOI] [PubMed] [Google Scholar]

[R10] 10.Morrens J, Aydin Ç, Janse van Rensburg A, Esquivelzeta Rabell J, Haesler S: Cue-Evoked Dopamine Promotes Conditioned Responding during Learning. Neuron 2020, 106:142–153.e7. [DOI] [PubMed] [Google Scholar]

[R11] 11.Takahashi YK, Batchelor HM, Liu B, Khanna A, Morales M, Schoenbaum G: Dopamine Neurons Respond to Errors in the Prediction of Sensory Features of Expected Rewards. Neuron 2017, 95:1395–1405.e3. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R12] 12.Stalnaker TA, Howard JD, Takahashi YK, Gershman SJ, Kahnt T, Schoenbaum G: Dopamine neuron ensembles signal the content of sensory prediction errors. Elife 2019, 8. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R13] 13.Gardner MPH, Schoenbaum G, Gershman SJ: Rethinking dopamine as generalized prediction error. Proc Biol Sci 2018, 285. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R14] *14.Keiflin R, Pribut HJ, Shah NB, Janak PH: Ventral Tegmental Dopamine Neurons Participate in Reward Identity Predictions. Curr Biol 2019, 29:93–103.e3. [DOI] [PMC free article] [PubMed] [Google Scholar]; This study demonstrated an important distinction between VTA and SNc dopamine neurons’ functions. While previous studies had suggested that both VTA and SNc stimulation could support primary reinforcement in self-stimulation paradigms, and Schultz’s original work on RPE included observations from both VTA and SNc dopamine neurons, this study showed that VTA dopamine stimulation can unblock learning, while SNc dopamine stimulation cannot. The result suggests that SNc dopamine neurons do not encode strictly-defined RPE, despite having reward-related responses. This study also serves as an important to reminder that most of studies of dopamine and RPE are really examining a specific subset of anatomically-defined dopamine neurons in the VTA, and that we have not yet fully squared findings of dopamine neuron heterogeneity with RPE theory.

[R15] 15.Kakade S, Dayan P: Dopamine: generalization and bonuses. Neural Networks 2002, 15:549–559. [DOI] [PubMed] [Google Scholar]

[R16] **16.Dabney W, Kurth-Nelson Z, Uchida N, Starkweather CK, Hassabis D, Munos R, Botvinick M: A distributional code for value in dopamine-based reinforcement learning. Nature 2020, 577:671–675. [DOI] [PMC free article] [PubMed] [Google Scholar]; This innovative study shows that RPEs are represented in the VTA as a distributional code, meaning that although individual VTA dopamine neurons fully encode RPE, they each do so slightly differently, making an array of more optimistic and pessimistic predictions. The distributional encoding of RPE helps explains the apparent redundancy of RPE encoding in the system. Artificial intelligence research has shown that the distributional encoding of RPE improves reinforcement learning.

[R17] 17.Eshel N, Bukwich M, Rao V, Hemmelder V, Tian J, Uchida N: Arithmetic and local circuitry underlying dopamine prediction errors. Nature 2015, 525:243–246. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R18] 18.Eshel N, Tian J, Bukwich M, Uchida N: Dopamine neurons share common response function for reward prediction error. Nature Neuroscience 2016, 19:479–486. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R19] 19.Cohen JY, Haesler S, Vong L, Lowell BB, Uchida N: Neuron-type-specific signals for reward and punishment in the ventral tegmental area. Nature 2012, 482:85–88. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R20] 20.Tian J, Huang R, Cohen JY, Osakada F, Kobak D, Machens CK, Callaway EM, Uchida N, Watabe-Uchida M: Distributed and Mixed Information in Monosynaptic Inputs to Dopamine Neurons. Neuron 2016, 91:1374–1389. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R21] 21.Lak A, Nomoto K, Keramati M, Sakagami M, Kepecs A: Midbrain Dopamine Neurons Signal Belief in Choice Accuracy during a Perceptual Decision. Current Biology 2017, 27:821–832. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R22] 22.Babayan BM, Uchida N, Gershman SJ: Belief state representation in the dopamine system. Nat Commun 2018, 9:1891. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R23] 23.Starkweather CK, Babayan BM, Uchida N, Gershman SJ: Dopamine reward prediction errors reflect hidden-state inference across time. Nature Neuroscience 2017, 20:581–589. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R24] 24.Lammel S, Hetzel A, Häckel O, Jones I, Liss B, Roeper J: Unique Properties of Mesoprefrontal Neurons within a Dual Mesocorticolimbic Dopamine System. Neuron 2008, 57:760–773. [DOI] [PubMed] [Google Scholar]

[R25] 25.Lammel S, Ion DI, Roeper J, Malenka RC: Projection-Specific Modulation of Dopamine Neuron Synapses by Aversive and Rewarding Stimuli. Neuron 2011, 70:855–862. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R26] 26.Saunders BT, Richard JM, Margolis EB, Janak PH: Dopamine neurons create Pavlovian conditioned stimuli with circuit-defined motivational properties. Nat Neurosci 2018, 21:1072–1083. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R27] 27.Rossi MA, Sukharnikova T, Hayrapetyan VY, Yang L, Yin HH: Operant self-stimulation of dopamine neurons in the substantia nigra. PLoS ONE 2013, 8:e65799. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R28] 28.Ilango A, Kesner AJ, Keller KL, Stuber GD, Bonci A, Ikemoto S: Similar Roles of Substantia Nigra and Ventral Tegmental Dopamine Neurons in Reward and Aversion. J Neurosci 2014, 34:817–822. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R29] 29.Menegas W, Bergan JF, Ogawa SK, Isogai Y, Umadevi Venkataraju K, Osten P, Uchida N, Watabe-Uchida M: Dopamine neurons projecting to the posterior striatum form an anatomically distinct subclass. eLife 2015, 4:e10032. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R30] 30.Menegas W, Babayan BM, Uchida N, Watabe-Uchida M: Opposite initialization to novel cues in dopamine signaling in ventral and posterior striatum in mice. Elife 2017, 6. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R31] 31.Menegas W, Akiti K, Amo R, Uchida N, Watabe-Uchida M: Dopamine neurons projecting to the posterior striatum reinforce avoidance of threatening stimuli. Nature Neuroscience 2018, 21:1421–1430. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R32] 32.Steinberg EE, Gore F, Heifets BD, Taylor MD, Norville ZC, Beier KT, Földy C, Lerner TN, Luo L, Deisseroth K, et al. : Amygdala-Midbrain Connections Modulate Appetitive and Aversive Learning. Neuron 2020, 106:1026–1043.e9. [DOI] [PubMed] [Google Scholar]

[R33] 33.Engelhard B, Finkelstein J, Cox J, Fleming W, Jang HJ, Ornelas S, Koay SA, Thiberge SY, Daw ND, Tank DW, et al. : Specialized coding of sensory, motor and cognitive variables in VTA dopamine neurons. Nature 2019, 570:509–513. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R34] 34.Cai LX, Pizano K, Gundersen GW, Hayes CL, Fleming WT, Holt S, Cox JM, Witten IB: Distinct signals in medial and lateral VTA dopamine neurons modulate fear extinction at different times. Elife 2020, 9. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R35] 35.Lammel S, Lim BK, Ran C, Huang KW, Betley MJ, Tye KM, Deisseroth K, Malenka RC: Input-specific control of reward and aversion in the ventral tegmental area. Nature 2012, 491:212–217. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R36] 36.Poulin J-F, Zou J, Drouin-Ouellet J, Kim K-YA, Cicchetti F, Awatramani RB: Defining Midbrain Dopaminergic Neuron Diversity by Single-Cell Gene Expression Profiling. Cell Reports 2014, 9:930–943. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R37] 37.Tiklová K, Björklund ÅK, Lahti L, Fiorenzano A, Nolbrant S, Gillberg L, Volakakis N, Yokota C, Hilscher MM, Hauling T, et al. : Single-cell RNA sequencing reveals midbrain dopamine neuron diversity emerging during mouse brain development. Nature Communications 2019, 10:581. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R38] 38.Poulin J-F, Gaertner Z, Moreno-Ramos OA, Awatramani R: Classification of Midbrain Dopamine Neurons Using Single-Cell Gene Expression Profiling Approaches. Trends in Neurosciences 2020, 43:155–169. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R39] 39.Hook PW, McClymont SA, Cannon GH, Law WD, Morton AJ, Goff LA, McCallion AS: Single-Cell RNA-Seq of Mouse Dopaminergic Neurons Informs Candidate Gene Selection for Sporadic Parkinson Disease. The American Journal of Human Genetics 2018, 102:427–446. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R40] 40.La Manno G, Gyllborg D, Codeluppi S, Nishimura K, Salto C, Zeisel A, Borm LE, Stott SRW, Toledo EM, Villaescusa JC, et al. : Molecular Diversity of Midbrain Development in Mouse, Human, and Stem Cells. Cell 2016, 167:566–580.e19. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R41] 41.Saunders A, Macosko EZ, Wysoker A, Goldman M, Krienen FM, de Rivera H, Bien E, Baum M, Bortolin L, Wang S, et al. : Molecular Diversity and Specializations among the Cells of the Adult Mouse Brain. Cell 2018, 174:1015–1030.e16. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R42] **42.Poulin J-F, Caronia G, Hofer C, Cui Q, Helm B, Ramakrishnan C, Chan CS, Dombeck DA, Deisseroth K, Awatramani R: Mapping projections of molecularly defined dopamine neuron subtypes using intersectional genetic approaches. Nat Neurosci 2018, 21:1260–1271. [DOI] [PMC free article] [PubMed] [Google Scholar]; Initial studies by Poulin et al and others have established a landscape of molecular heterogeneity in midbrain dopamine neurons. In this study, Poulin et al. introduce valuable genetic mouse lines for distinguishing between dopamine neuron subpopulations based on their molecular identity. These mouse lines will enable many important future studies of how molecular heterogeneity could contribute to diverse encoding beyond RPE. This study then specifically examined the projection patterns of molecular-defined dopamine neurons, data that will serve as a framework for aligning future studies of dopamine neuron molecular heterogeneity with anatomical features of the system.

[R43] 43.Beier KT, Steinberg EE, DeLoach KE, Xie S, Miyamichi K, Schwarz L, Gao XJ, Kremer EJ, Malenka RC, Luo L: Circuit Architecture of VTA Dopamine Neurons Revealed by Systematic Input-Output Mapping. Cell 2015, 162:622–634. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R44] 44.Lerner TN, Shilyansky C, Davidson TJ, Evans KE, Beier KT, Zalocusky KA, Crow AK, Malenka RC, Luo L, Tomer R, et al. : Intact-Brain Analyses Reveal Distinct Information Carried by SNc Dopamine Subcircuits. Cell 2015, 162:635–647. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R45] 45.de Jong JW, Afjei SA, Pollak Dorocic I, Peck JR, Liu C, Kim CK, Tian L, Deisseroth K, Lammel S: A Neural Circuit Mechanism for Encoding Aversive Stimuli in the Mesolimbic Dopamine System. Neuron 2019, 101:133–151.e7. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R46] *46.Wu J, Kung J, Dong J, Chang L, Xie C, Habib A, Hawes S, Yang N, Chen V, Liu Z, et al. : Distinct Connectivity and Functionality of Aldehyde Dehydrogenase 1a1-Positive Nigrostriatal Dopaminergic Neurons in Motor Learning. Cell Reports 2019, 28:1167–1181.e7. [DOI] [PMC free article] [PubMed] [Google Scholar]; This study is one of the first designed to determine the behavioral significance of a molecularly-defined subset of midbrain dopamine neurons, namely Aldh1a1+ SNc dopamine neurons. Aldh1a1+ SNc dopamine neurons are particularly vulnerable in Parkinson’s disease, and project primarily to the dorsal striatum. Interestingly, when Aldh1a1+ SNc dopamine neurons are ablated with caspase-3, the authors find only minimal gross motor impairments, but a profound change motor skill acquisition. The reasons for the selectivity of this effect in comparison to more general dopamine lesions (e.g. using 6-OHDA) remain unclear.

[R47] *47.Heymann G, Jo YS, Reichard KL, McFarland N, Chavkin C, Palmiter RD, Soden ME, Zweifel LS: Synergy of Distinct Dopamine Projection Populations in Behavioral Reinforcement. Neuron 2020, 105:909–920.e5. [DOI] [PMC free article] [PubMed] [Google Scholar]; This study is of particular interest due to the authors’ definitions of dopamine neuron subtypes by their peptidergic markers. Previous studies of molecularly-defined dopamine neurons have included Cck as a genetic marker, but not Crhr1. This study demonstrated that peptide-defined populations of dopamine neurons have distinct projections and roles in behavioral reinforcement. Considering the variety of peptidergic inputs to midbrain dopamine neurons, this study inspires others to elucidate how the identities of peptide-defined dopamine neurons intersect with molecular phenotypes, as defined by gene expression analyses, and further probe the behavioral contributions of peptide-defined dopamine neurons to reward-related behaviors.

[R48] 48.Bimpisidis Z, König N, Stagkourakis S, Zell V, Vlcek B, Dumas S, Giros B, Broberger C, Hnasko TS, Wallén-Mackenzie Å: The NeuroD6 Subtype of VTA Neurons Contributes to Psychostimulant Sensitization and Behavioral Reinforcement. eNeuro 2019, 6. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R49] 49.Mongia S, Yamaguchi T, Liu B, Zhang S, Wang H, Morales M: The Ventral Tegmental Area has calbindin neurons with the capability to co-release glutamate and dopamine into the nucleus accumbens. European Journal of Neuroscience 2019, 50:3968–3984. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R50] 50.Fortin GM, Ducrot C, Giguère N, Kouwenhoven WM, Bourque M-J, Pacelli C, Varaschin RK, Brill M, Singh S, Wiseman PW, et al. : Segregation of dopamine and glutamate release sites in dopamine neuron axons: regulation by striatal target cells. The FASEB Journal 2018, 33:400–417. [DOI] [PubMed] [Google Scholar]

[R51] 51.Kim J-I, Ganesan S, Luo SX, Wu Y-W, Park E, Huang EJ, Chen L, Ding JB: Aldehyde dehydrogenase 1a1 mediates a GABA synthesis pathway in midbrain dopaminergic neurons. Science 2015, 350:102–106. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R52] 52.Silm K, Yang J, Marcott PF, Asensio CS, Eriksen J, Guthrie DA, Newman AH, Ford CP, Edwards RH: Synaptic Vesicle Recycling Pathway Determines Neurotransmitter Content and Release Properties. Neuron 2019, 102:786–800.e5. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R53] 53.Tritsch NX, Oh W-J, Gu C, Sabatini BL: Midbrain dopamine neurons sustain inhibitory transmission using plasma membrane uptake of GABA, not synthesis. eLife 2014, 3:e01936. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R54] 54.Tritsch NX, Ding JB, Sabatini BL: Dopaminergic neurons inhibit striatal output through non-canonical release of GABA. Nature 2012, 490:262–266. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R55] 55.Wang DV, Viereckel T, Zell V, Konradsson-Geuken Å, Broker CJ, Talishinsky A, Yoo JH, Galinato MH, Arvidsson E, Kesner AJ, et al. : Disrupting Glutamate Co-transmission Does Not Affect Acquisition of Conditioned Behavior Reinforced by Dopamine Neuron Activation. Cell Reports 2017, 18:2584–2591. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R56] 56.Zell V, Steinkellner T, Hollon NG, Warlow SM, Souter E, Faget L, Hunker AC, Jin X, Zweifel LS, Hnasko TS: VTA Glutamate Neuron Activity Drives Positive Reinforcement Absent Dopamine Co-release. Neuron 2020, doi: 10.1016/j.neuron.2020.06.011. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R57] 57.Patriarchi T, Cho JR, Merten K, Howe MW, Marley A, Xiong W-H, Folk RW, Broussard GJ, Liang R, Jang MJ, et al. : Ultrafast neuronal imaging of dopamine dynamics with designed genetically encoded sensors. Science 2018, 360:eaat4422. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R58] 58.Patriarchi T, Mohebi A, Sun J, Marley A, Liang R, Dong C, Puhger K, Mizuno GO, Davis CM, Wiltgen B, et al. : An expanded palette of dopamine sensors for multiplex imaging in vivo. Nat Methods 2020, doi: 10.1038/s41592-020-0936-3. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R59] 59.Sun F, Zeng J, Jing M, Zhou J, Feng J, Owen SF, Luo Y, Li F, Wang H, Yamaguchi T, et al. : A Genetically Encoded Fluorescent Sensor Enables Rapid and Specific Detection of Dopamine in Flies, Fish, and Mice. Cell 2018, 174:481–496.e19. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R60] 60.Sun F, Zhou J, Dai B, Qian T, Zeng J, Li X, Zhuo Y, Zhang Y, Tan K, Feng J, et al. : New and improved GRAB fluorescent sensors for monitoring dopaminergic activity in vivo. bioRxiv 2020, doi: 10.1101/2020.03.28.013722. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R61] 61.Lee SJ, Lodder B, Chen Y, Patriarchi T, Tian L, Sabatini BL: Cell-type specific asynchronous modulation of PKA by dopamine during reward based learning. bioRxiv 2019, doi: 10.1101/839035. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R62] 62.Parker NF, Cameron CM, Taliaferro JP, Lee J, Choi JY, Davidson TJ, Daw ND, Witten IB: Reward and choice encoding in terminals of midbrain dopamine neurons depends on striatal target. Nat Neurosci 2016, 19:845–854. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R63] 63.Lee RS, Mattar MG, Parker NF, Witten IB, Daw ND: Reward prediction error does not explain movement selectivity in DMS-projecting dopamine neurons. Elife 2019, 8. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R64] 64.da Silva JA, Tecuapetla F, Paixão V, Costa RM: Dopamine neuron activity before action initiation gates and invigorates future movements. Nature 2018, 554:244–248. [DOI] [PubMed] [Google Scholar]

[R65] 65.Dodson PD, Dreyer JK, Jennings KA, Syed ECJ, Wade-Martins R, Cragg SJ, Bolam JP, Magill PJ: Representation of spontaneous movement by dopaminergic neurons is cell-type selective and disrupted in parkinsonism. Proc Natl Acad Sci U S A 2016, 113:E2180–E2188. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R66] 66.Howe MW, Dombeck DA: Rapid signalling in distinct dopaminergic axons during locomotion and reward. Nature 2016, 535:505–510. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R67] 67.Barter JW, Li S, Lu D, Bartholomew RA, Rossi MA, Shoemaker CT, Salas-Meza D, Gaidis E, Yin HH: Beyond reward prediction errors: the role of dopamine in movement kinematics. Front Integr Neurosci 2015, 9. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R68] 68.Syed ECJ, Grima LL, Magill PJ, Bogacz R, Brown P, Walton ME: Action initiation shapes mesolimbic dopamine encoding of future rewards. Nat Neurosci 2016, 19:34–36. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R69] 69.Hamid AA, Pettibone JR, Mabrouk OS, Hetrick VL, Schmidt R, Vander Weele CM, Kennedy RT, Aragona BJ, Berke JD: Mesolimbic dopamine signals the value of work. Nat Neurosci 2016, 19:117–126. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R70] *70.Hughes RN, Bakhurin KI, Petter EA, Watson GDR, Kim N, Friedman AD, Yin HH: Ventral Tegmental Dopamine Neurons Control the Impulse Vector during Motivated Behavior. Current Biology 2020, 30:2681–2694.e5. [DOI] [PMC free article] [PubMed] [Google Scholar]; This provocative study challenges the dogma that VTA dopamine neurons encode RPE, arguing instead for movement encoding. The authors use in vivo electrophysiology to record the firing patterns of optogenetically-tagged VTA dopamine neurons in mice. Sensitive measurements of force generation in a head-fixation apparatus reveal subtle head movements as mice collect rewards during a fixed-time interval task. No explicit cues preceded reward delivery in the task, but mice learned to perform anticipatory licking – along with stereotyped movement sequences associated with this licking – based on timing. Three different populations of dopamine neurons were identified based on their correlations with force exerted over time, i.e. the “impulse vector.” (Notably, only one population displayed ramping). Further optogenetic experiments showed that, in highly trained mice, VTA dopamine neuron stimulation could cause forward force generation. Additional work is clearly required to determine whether the hypothesis of impulse vector encoding will hold and/or be integrated with RPE theory. It remains a critical question in the field to better understand when dopamine controls movement and how motor execution and error signals drive reward learning.

[R71] *71.Coddington LT, Dudman JT: The timing of action determines reward prediction signals in identified midbrain dopamine neurons. Nat Neurosci 2018, 21:1563–1573. [DOI] [PMC free article] [PubMed] [Google Scholar]; This study used in vivo electrophysiology to record the activity of optogenetically-tagged VTA and SNc dopamine neurons during learning of an auditory trace conditioning task. It is unique and innovative in part because the authors record from mice as they first begin to learn, instead of only examining dopamine activity in well-trained mice. Thus, the paper offers insight into how RPE encoding is slowly created over many trials. The authors find that dopamine responses to rewards and predictive cues emerge independently (not as a gradual transfer from reward to cue), and that reward expectation signals reflect a summation of emergent sensory cue signals and action initiation signals, that latter of which is responsible for the timing of RPE signals. Furthermore, they make important observations about the relationship of dopamine to movement initiation. They find that dopamine signals related to movement initiation are altered by learning and reward context, and that optogenetic stimulation that is calibrated to match physiological dopamine signals is not sufficient produce movement initiation, despite being capable of inducing conditioned place preference.

[R72] 72.Moss MM, Zatka-Haas P, Harris KD, Carandini M, Lak A: Dopamine axons to dorsal striatum encode contralateral stimuli and actions. bioRxiv 2020, doi: 10.1101/2020.07.16.207316. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R73] *73.McDougle SD, Butcher PA, Parvin DE, Mushtaq F, Niv Y, Ivry RB, Taylor JA: Neural Signatures of Prediction Errors in a Decision-Making Task Are Modulated by Action Execution Failures. Current Biology 2019, 29:1606–1613.e5. [DOI] [PMC free article] [PubMed] [Google Scholar]; This human fMRI study proposes an interesting hypothesis regarding the interface between RPE encoding and motor control. The authors used a classical reinforcement learning task in which participants make short reaching movements to indicate choice. The participants were given feedback about whether errors leading to reward omission were associated with wrong choices vs motor execution errors. Although both wrong choices and motor execution errors lead to the same outcome (no reward), participants do not display the same lose-switch behavior. In the fMRI data, ventral striatal RPEs are attenuated following execution errors, providing a potential neurological explanation for the reduction in lose-switch behavior. These results lead to the suggestion that motor feedback influences RPE encoding to allow us to distinguish internal vs external reasons for trial-by-trial variability during reinforcement learning.

[R74] 74.Coddington LT, Dudman JT: Learning from Action: Reconsidering Movement Signaling in Midbrain Dopamine Neuron Activity. Neuron 2019, 104:63–77. [DOI] [PubMed] [Google Scholar]

[R75] 75.Howe MW, Tierney PL, Sandberg SG, Phillips PEM, Graybiel AM: Prolonged dopamine signalling in striatum signals proximity and value of distant rewards. Nature 2013, 500:575–579. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R76] 76.Collins AL, Greenfield VY, Bye JK, Linker KE, Wang AS, Wassum KM: Dynamic mesolimbic dopamine signaling during action sequence learning and expectation violation. Scientific Reports 2016, 6:20231. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R77] 77.Hamid AA, Frank MJ, Moore CI: Dopamine waves as a mechanism for spatiotemporal credit assignment. bioRxiv 2019, doi: 10.1101/729640. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R78] *78.Kim HR, Malik AN, Mikhael JG, Bech P, Tsutsui-Kimura I, Sun F, Zhang Y, Li Y, Watabe-Uchida M, Gershman SJ, et al. : A unified framework for dopamine signals across timescales. bioRxiv 2019, doi: 10.1101/803437. [DOI] [PMC free article] [PubMed] [Google Scholar]; In contrast to Mohebi et al (2019), this pre-print suggests that ramping activity can be detected in the firing patterns of VTA cell bodies, and that such activity is sufficient to explain ramping signals seen in dopamine axons (using calcium sensors) and when measuring dopamine release (using the dopamine sensor GRABDA). Differences between dopamine firing patterns and downstream release are suggested to be limited to minute-level timescales (“tonic” dopamine) and the reasons for the divergence are left to future studies to investigate.

[R79] **79.Guru A, Seo C, Post RJ, Kullakanda DS, Schaffer JA, Warden MR: Ramping activity in midbrain dopamine neurons signifies the use of a cognitive map. bioRxiv 2020, doi: 10.1101/2020.05.21.108886. [DOI] [Google Scholar]; While still published only as a pre-print, this careful study of dopamine ramping is an exemplary attempt to systematically examine how different tasks do or not do evoke ramping. The authors perform fiber photometry of VTA dopamine cell bodies as mice perform a number of tasks. They do not observe ramping in a classical Pavlovian task, but they do observe ramps when mice move towards a reward or a reward moves towards the mouse. Although this ramping is robust at first, it fades with training. In contrast, ramping towards reward does not fade when mice are asked to run a fixed distance on a running wheel in the absence of feedback cues indicating progress. The authors therefore suggest that ramps allow the generation of an internal model of progress towards reward important for goal-directed behavior.

[R80] 80.Fiorillo CD, Tobler PN, Schultz W: Discrete coding of reward probability and uncertainty by dopamine neurons. Science 2003, 299:1898–1902. [DOI] [PubMed] [Google Scholar]

[R81] *81.Mohebi A, Pettibone JR, Hamid AA, Wong J-MT, Vinson LT, Patriarchi T, Tian L, Kennedy RT, Berke JD: Dissociable dopamine dynamics for learning and motivation. Nature 2019, 570:65–70. [DOI] [PMC free article] [PubMed] [Google Scholar]; The authors use microdialysis, voltammetry, and dopamine sensor imaging to monitor dopamine release on different timescales. They find that NAc core dopamine release in particular correlates with reward rate on a slow time scale, independent of changes in VTA dopamine cell firing rates. Based on their results, the authors make two provocative suggestions: (1) that tonic firing observed in VTA cell bodies is relatively constant and does not contribute to motivational changes and (2) that local modulation at dopamine terminals is responsible for producing “ramps” that encode motivation. The latter suggestion in particular is one that deserves attention and investigation in future studies – how are axonal regulatory or reuptake mechanisms in dopamine neurons transforming the encoding functions observed at cell bodies?

[R82] 82.Song MR, Lee SW: Dynamic resource allocation during reinforcement learning accounts for ramping and phasic dopamine activity. Neural Networks 2020, 126:95–107. [DOI] [PubMed] [Google Scholar]

[R83] 83.Seiler JL, Cosme CV, Sherathiya VN, Bianco JM, Lerner TN: Dopamine Signaling in the Dorsomedial Striatum Promotes Compulsive Behavior. bioRxiv 2020, doi: 10.1101/2020.03.30.016238. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R84] 84.Mikhael JG, Kim HR, Uchida N, Gershman SJ: Ramping and State Uncertainty in the Dopamine Signal. bioRxiv 2019, doi: 10.1101/805366. [DOI] [Google Scholar]

[R85] 85.Gershman SJ: Dopamine Ramps Are a Consequence of Reward Prediction Errors. Neural Computation 2013, 26:467–471. [DOI] [PubMed] [Google Scholar]

[R86] 86.Watabe-Uchida M, Zhu L, Ogawa SK, Vamanrao A, Uchida N: Whole-Brain Mapping of Direct Inputs to Midbrain Dopamine Neurons. Neuron 2012, 74:858–873. [DOI] [PubMed] [Google Scholar]

[R87] 87.Matsuda W, Furuta T, Nakamura KC, Hioki H, Fujiyama F, Arai R, Kaneko T: Single Nigrostriatal Dopaminergic Neurons Form Widely Spread and Highly Dense Axonal Arborizations in the Neostriatum. J Neurosci 2009, 29:444–453. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R88] 88.Rice ME, Patel JC, Cragg SJ: Dopamine release in the basal ganglia. Neuroscience 2011, 198:112–137. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R89] 89.Sulzer D, Cragg SJ, Rice ME: Striatal dopamine neurotransmission: Regulation of release and uptake. Basal Ganglia 2016, 6:123–148. [DOI] [PMC free article] [PubMed] [Google Scholar]

[R90] *90.Banerjee A, Lee J, Nemcova P, Liu C, Kaeser PS: Synaptotagmin-1 is the Ca2+ sensor for fast striatal dopamine release. Elife 2020, 9. [DOI] [PMC free article] [PubMed] [Google Scholar]; This molecular study examines the mechanisms regulating dopamine release. They find that synaptotagmin 1 (Syt1) is the presynaptic calcium sensor responsible for fast, synchronous dopamine release, but that another mechanism must be responsible for slower, asynchronous release. The remaining asynchronous release in the absence of Syt1 does appear to contribute substantially to the regulation of ongoing extracellular dopamine levels. The study implies that there are distinct control mechanisms regulating dopamine release on different time scales. Molecular tools may be of use, then, in dissecting the behavioral roles of such distinct forms of release.

[R91] 91.Liu C, Kaeser PS: Mechanisms and regulation of dopamine release. Current Opinion in Neurobiology 2019, 57:46–53. [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Dopamine, Updated: Reward Prediction Error and Beyond

Talia N Lerner

Ashley L Holloway

Jillian L Seiler

Abstract

Introduction

Advancements in Understanding Dopamine’s Encoding of Reward Prediction Error

Going Beyond Reward Prediction Error

Diverse Molecular Phenotypes of Midbrain Dopamine Neurons

Dopamine Neurons Use More than Dopamine to Communicate

Dopamine and Movement

Dopamine, Motivation, and Planning: What Are Dopamine Ramps For?

Conclusion

Highlights.

Acknowledgments

Footnotes

References

ACTIONS

PERMALINK

RESOURCES

Cite

Add to Collections

PERMALINK

Dopamine, Updated: Reward Prediction Error and Beyond

Talia N Lerner

Ashley L Holloway

Jillian L Seiler

Abstract

Introduction

Advancements in Understanding Dopamine’s Encoding of Reward Prediction Error

Going Beyond Reward Prediction Error

Diverse Molecular Phenotypes of Midbrain Dopamine Neurons

Dopamine Neurons Use More than Dopamine to Communicate

Dopamine and Movement

Dopamine, Motivation, and Planning: What Are Dopamine Ramps For?

Conclusion

Highlights.

Acknowledgments

Footnotes

References

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases