Skip to main content
Proceedings of the National Academy of Sciences of the United States of America logoLink to Proceedings of the National Academy of Sciences of the United States of America
. 2022 May 20;119(21):e2117270119. doi: 10.1073/pnas.2117270119

A unidirectional but not uniform striatal landscape of dopamine signaling for motivational stimuli

Wouter van Elzelingen a,b,1, Jessica Goedhoop a,b,1, Pascal Warnaar a,b, Damiaan Denys a,b, Tara Arbab a,b, Ingo Willuhn a,b,2
PMCID: PMC9171911  PMID: 35594399

Significance

Although it is undisputed that striatal dopamine plays a prominent role in motivated behavior and learning, the precise information conveyed by dopamine signals as such is under active debate. For a long time, the idea dominated that dopamine encodes a reward prediction error and that this signal is broadcast uniformly throughout the brain. However, here, we capture dopamine dynamics across many striatal regions and demonstrate that dopamine release is, regionally, extremely heterogeneous and that a reward prediction error–like signal is predominantly found in the relatively small limbic domain of the striatum. Another striking organizing principle is that stimulus valence directs dopamine concentration homogeneously across all regions (i.e., appetitive stimuli increase dopamine and aversive stimuli decrease dopamine).

Keywords: dopamine, striatum, motivation, Pavlovian conditioning, behavior

Abstract

Dopamine signals in the striatum are critical for motivated behavior. However, their regional specificity and precise information content are actively debated. Dopaminergic projections to the striatum are topographically organized. Thus, we quantified dopamine release in response to motivational stimuli and associated predictive cues in six principal striatal regions of unrestrained, behaving rats. Absolute signal size and its modulation by stimulus value and by subjective state of the animal were interregionally heterogeneous on a medial to lateral gradient. In contrast, dopamine-concentration direction of change was homogeneous across all regions: appetitive stimuli increased and aversive stimuli decreased dopamine concentration. Although cues predictive of such motivational stimuli acquired the same influence over dopamine homogeneously across all regions, dopamine-mediated prediction-error signals were restricted to the ventromedial, limbic striatum. Together, our findings demonstrate a nuanced striatal landscape of unidirectional but not uniform dopamine signals, topographically encoding distinct aspects of motivational stimuli and their prediction.


Dopamine neurotransmission is pivotal for neuronal processing of, and behavioral responding to, appetitive and aversive stimuli (13). The largest releasable pool of dopamine is found in the striatum, a large brain nucleus that is the main input structure of the basal ganglia and the primary projection target of midbrain dopamine neurons (4, 5). The striatum crucially mediates dopamine’s role in motivated behavior (6, 7) and, vice versa, the function of the striatum is critically dependent on dopamine innervation (8, 9). Delineated by topographically organized afferents from cortex, thalamus, and the dopaminergic midbrain (substantia nigra pars compacta and ventral tegmental area), the striatum can be divided into smaller, functionally distinct domains: the ventromedial striatum (VMS; which consists of the nucleus accumbens core [NAC] and nucleus accumbens shell [NAS]), the dorsomedial striatum (DMS), the dorsolateral striatum (DLS), the ventrolateral striatum (VLS), and the tail of striatum (TS). Inputs from cortex, thalamus, and midbrain distribute across the entire striatum, whereas afferents from amygdala and hippocampus are more regionally restricted (1015) (Fig. 1A). The striatal regions researched most over the past decades are the NAC and NAS, followed by the DMS and DLS; both of these regional pairs are assumed to support dichotomous or even antagonistic functions (16, 17). Other regions, such as the VLS and the TS, have moved into focus more recently (1820).

Fig. 1.

Fig. 1.

Targeting dopamine release in six topographically defined domains of the striatum. (A) Schematic depicting the rodent striatum and its main topographic afferents from the cortex on a series of coronal sections. Dopaminergic inputs to the striatum are indicated in color, delineating the six target regions of the striatum in which dopamine signals were recorded. (B) Representative electrolytic lesions circled on Cresyl violet–stained coronal brain sections for histologic verification of electrode-tip placement. From right to left: NAS (purple circles), NAC (blue circles), DLS (red circles), VLS (orange circles), DMS (green circles), and TS (pink circles), and corresponding electrode tracks (arrows). (C) Summary schematic of all dopamine recording sites in NAS (purple circles), NAC (blue circles), DLS (red circles), VLS (orange circles), DMS (green circles), and TS (pink circles). Coronal sections range from +2.04 mm to 0.0 mm, and −1.80 to −2.40 mm anterior to bregma (88).

Although it is undisputed that striatal dopamine plays a prominent role in motivated behavior and reward learning, the precise information conveyed by dopamine signals is under active debate (21). For example, dopamine-neuron activity has been shown to relate to movement (2229), incentive salience (6), reward value (21, 30), reward prediction, and a so-called temporal-difference reward-prediction error (RPE) (3136). An important question is how the dopamine system integrates the aforementioned (and other) functions in its output, or whether any of these functions dominate dopamine-release dynamics. Relatedly, given the existence of the striatal regions we just described, another prominent question is whether dopamine signals broadcasted to these striatal subregions are uniform or diverse; in other words, does each region receive the same dopamine signals or are there region-specific differences? For instance, for a long time, the idea dominated that the RPE encoded in midbrain dopamine-neuron firing [signaling the difference between expected and obtained reward (3135)] is broadcast uniformly throughout the striatum (29, 3739). However, since more recent findings demonstrate that dopamine release is regionally heterogeneous (19, 22, 4050), one emerging view is that different striatal regions receive distinct dopamine signals, and that discrepant findings and the varying functions assigned to dopamine may be explained by this variance of signal location within the striatum (45). The answers to these two questions would help explain the breadth and regional specificity of neural computations performed in the striatum and basal-ganglia function as a whole.

An additional question is how the dopamine system is implicated in the processing of and responding to aversive stimuli. Fewer studies have been conducted on this topic, and the outcomes are less consistent than for appetitive stimuli. For example, whether dopamine activity increases or decreases in response to aversive stimuli is still debated and may depend on several factors, such as the striatal subregion sampled, the type of stimulus, and the subject’s ability to escape or avoid it (19, 20, 46, 5059). Reports that the dopamine system also encodes an aversive prediction error exist, but results vary widely, ranging from no APE encoding (60), to partial encoding (59), to full encoding (3).

Thus, we set out to systematically explore which information is contained in dopamine signals related to prediction error–based reinforcement learning (i.e., stimulus valence [reward and aversion] and value [magnitude], motivational state, and outcome prediction), and how this maps onto the striatum topographically. Most previous studies have sampled from a single mesostriatal connected network and, thus, comparisons across literature are often challenged by inconsistent experimental conditions (e.g., species, stimulus identity, study design, behavioral paradigm, anatomic targeting, dopamine-sensing method). To warrant consistent conditions throughout, we characterized dopamine dynamics in all six of the aforementioned striatal domains (i.e., VMS consisting of NAS and NAC, DMS, DLS, VLS, and TS) of unrestricted, behaving rats using chronically implanted microelectrodes for fast-scan cyclic voltammetry (FSCV) with subsecond detection of dopamine. Using FSCV to monitor terminal release of dopamine provides several advantages over inferring dopamine release based on dopamine-neuron activity for this type of topographical study: First, dopamine cell bodies in the midbrain are tightly packed and their identity and projection targets require additional verification. Second, some midbrain dopamine neurons co-release other neurotransmitters, rendering the interpretation of effects of dopamine cell-body activity more difficult. Third, activity at the level of dopamine-neuron cell bodies, and even on the axonal level, does not necessarily translate to dopamine release in their projection targets (61), as axonal-terminal release undergoes local modulation in the striatum (62). Fourth, the effects of interindividual differences can be minimized by monitoring multiple regions simultaneously in the same animal. Fifth, FSCV electrodes are very small and, thus, enable precise anatomic targeting combined with minimal disruption of brain tissue (63).

Since the sampled striatal regions have been assigned different behavioral functions, we hypothesized that local dopamine signals must exhibit regional differences. Our findings provide substantiation for similarities as well as differences in striatal dopamine signaling across the VMS, DMS, DLS, VLS, and TS, where the anatomic organization of striatal dopaminergic innervation determines a variety of dopamine-signal properties, including absolute size, value and subjective-state coding, valence prediction and valence-prediction error, while the animals’ movement was not encoded in any region.

Results

We characterized the regional release of dopamine throughout rat striatum in response to appetitive and aversive stimuli and their predictors, using chronically implanted FSCV microelectrodes to record dopamine concentration with subsecond resolution in NAS, NAC, DMS, DLS, VLS, and TS. FSCV-electrode placement was verified postmortem via electrolytic lesion (Fig. 1 B and C). As a control for recording quality, delivery of an unpredicted food pellet prior to the start of the behavioral experiment provoked reliable and stable increases of extracellular dopamine concentration in all sampled striatal regions (SI Appendix, Fig. S1A) and, conversely, unpredicted exposure to aversive white noise (WN) reliably decreased extracellular concentrations of dopamine in all regions (SI Appendix, Fig. S1B). We explored dissimilarities in dopamine signaling between striatal regions utilizing several behavioral paradigms designed to identify how motivational stimuli and their predictors are reflected across them.

Dopamine Encodes Subjective Reward Value Region Specifically: Dose–Response Experiment.

Using a probabilistic reward dose–response paradigm (Fig. 2A) in which food pellet rewards of varying magnitude (one, three, or nine pellets) were delivered at random time intervals (unpredicted) to freely moving rats (n = 39), we characterized real-time dopamine responses to determine the precise reward-related information conveyed by striatal dopamine signals and their regional specificity. Absolute changes in extracellular dopamine concentration between regions (Fig. 2B) as well as relative changes within regions (Fig. 2C) varied substantially: Reward itself is reflected homogeneously throughout striatum, as food-reward delivery significantly increased dopamine release (compared with respective pre-reward baseline dopamine concentrations) in all striatal regions (NAS: n = 10, Z = −2.803, P = 0.005; NAC: n = 7, Z = −2.366, P = 0.018; DMS: n = 14, Z = −3.296, P < 0.001; DLS: n = 10, Z = −2.803, P = 0.005; VLS: n = 10, Z = −2.803, P = 0.005; TS: n = 9, Z = −2.666, P = 0.008; Fig. 2D). Reward magnitude, however, was reflected in NAS, NAC, DMS, and DLS, but not in VLS and TS (main effect of reward magnitude: NAS, F(1.083, 9.743) = 12.85, P = 0.005; NAC, F(1.414, 8.485) = 18.98, P = 0.001; DMS, F(1.304, 16.95) = 42.55, P < 0.001; DLS, F(1.933, 17.39) = 3.825, P = 0.043; VLS, F(1.342, 12.08) = 0.6611, P = 0.476; TS, F(1.096, 8.769) = 0.931, P = 0.370; Fig. 2D); the subjective values of one, three, and nine pellets were significantly discernible in NAS, NAC, and DMS, but not in DLS, VMS, and TS (post hoc analysis: NAS: 1 vs. 3, P = 0.035; 1 vs. 9, P = 0.006, 3 vs. 9: P = 0.027. NAC: 1 vs. 3, P = 0.034; 1 vs. 9, P = 0.008; 3 vs. 9, P = 0.015. DMS: 1 vs. 3, P < 0.001; 1 vs. 9, P < 0.001; 3 vs. 9, P < 0.001. DLS: 1 vs. 3, P = 0.108; 1 vs. 9, P = 0.108; 3 vs. 9, P = 0.754. VLS: 1 vs. 3, P = 0.842; 1 vs. 9, P = 0.842; 3 vs. 9, P = 0.265. TS: 1 vs. 3, P = 0.589; 1 vs. 9, P = 0.589; 3 vs. 9, P = 0.589; Fig. 2D). The DMS is frequently subdivided into the anterior DMS (aDMS) and posterior (pDMS) subdivisions (SI Appendix, Fig. S2A) that are thought to serve differing behavioral functions, but dopamine-encoding of reward value in aDMS and pDMS did not differ significantly in our paradigm (one pellet: U(naDMS = 6; npDMS = 8) = 10, Z = −1.807, P = 0.243; three pellets: U(naDMS = 6; npDMS = 8) = 13, Z = −1.420, P = 0.181; nine pellets: U(naDMS = 6; npDMS = 8) = 20, Z = −0.516, P = 0.662; SI Appendix, Fig. S2B). Together, these data show that regional dopamine signals encode reward uniformly throughout the striatum, but that the overall size of dopamine signals and the encoding of reward magnitude are region specific.

Fig. 2.

Fig. 2.

Varying reward magnitudes elicit distinct regional dopamine (DA)-signal profiles. (A) Food-pellet rewards of different magnitudes were delivered into the food magazine (Top) in semirandom order on a variable ITI of 30 s. Rats received a reward of either nine food pellets (15% of trials), three food pellets (15% of trials), or one food pellet (70% of trials) (Bottom). (B) Changes in extracellular dopamine concentration in NAS, NAC, DMS, DLS, VLS, and TS in response to the unpredicted delivery of one, three, or nine food pellets (traces are aligned to reward delivery). (C) Data from B displayed as a percentage of maximum (average) dopamine release. (D) Dopamine release in response to reward was increased in all striatal regions when compared with baseline (B). Dopamine release in NAS, NAC, DMS, and DLS scaled with reward magnitude, but not in VLS and TS. Bar graphs depict (positive) peak concentrations of dopamine averaged across rats for baseline (−2 to 0 s) and reward (0 to 10 s) epochs. +, a main effect of reward magnitude, but without post hoc difference. (B and D) Data are reported as mean + SEM. Comparison to baseline (B): #P < 0.05, ##P < 0.01, ###P < 0.001; Comparison 1 vs. 3 vs. 9: *P < 0.05, **P < 0.01, ***P < 0.001.

Dopamine Reflects Subjective State Region Specifically: Satiety Experiment.

How dopamine signaling is affected by subjective state is debated (64). Thus, we tested whether changes in subjective state (e.g., hunger) alter dopamine-mediated valuation of reward immediately or only after additional experience with the reward. To this end, we evaluated the effects of satiety and hunger as states on (unpredicted) food-induced release of regional dopamine, comparing within-subject dopamine responses in rats between food-restricted and sated states (where rats were given access to a large amount of food pellets immediately prior to the recording session; Fig. 3A). As in the previous experiment (Fig. 2), absolute changes between regions (Fig. 3B) as well as relative changes within regions (Fig. 3C) in extracellular dopamine concentration varied substantially. Food-reward delivery significantly increased dopamine release in all striatal regions compared with respective pre-reward baseline dopamine concentrations (NAS: n = 11, Z = −2.934, P = 0.003; NAC: n = 17, Z = −3.621, P < 0.001; DMS: n = 14, Z = −3.296, P < 0.001; DLS: n = 18, Z = −3.724, P < 0.001; VLS: n = 9, Z = −2.666, P = 0.008; TS: n = 10, Z = −2.395, P = 0.017; Fig. 3D), corroborating our results described in Fig. 2D. Satiety state, however, is only reflected in medial regions of the striatum: dopamine responses in NAS, NAC, and DMS differed significantly between hungry and sated sessions, whereas those in in DLS, VLS, and TS did not (NAS: n = 11, Z = −2.845, P = 0.004; NAC: n = 17, Z = −3.479, P < 0.001; DMS: n = 14, Z = −2.040, P = 0.041; DLS: n = 18, Z = −1.198, P = 0.231; VLS: n = 9, Z = −0.296, P = 0.767; TS: n = 10, Z = −0.663, P = 0.508; Fig. 3D). Again, no significant differences were found between DMS subdivisions (post hoc analysis: hungry: U(naDMS = 6; npDMS = 8) = 29, Z = 0.645, P = 1.000; sated: U(naDMS = 6; npDMS = 8) = 26, Z = 0.258, P = 0.852; SI Appendix, Fig. S2C). Rats were given two single-food-pellet trials in each session to compare dopamine responses between hungry and sated states. Dopamine responses to the first and the second pellet did not differ, except for a minor difference in the NAC (sated state), but hungry and sated responses vastly differed from one another (SI Appendix, Fig. S3). In summary, medial but not lateral regions of the striatum exhibit immediate changes in subjective state.

Fig. 3.

Fig. 3.

Satiety state–dependent regional differences in reward-induced dopamine (DA) release. (A) On two days, rats received unpredicted food pellets separated by a variable ITI. On 1 d, rats were hungry (H; food restricted to 85% of their free-feeding weight). On the other day, rats were sated (S; prefed via 1-h ad libitum access to food pellets immediately prior to testing). (B) Changes in dopamine concentration in NAS, NAC, DMS, DLS, VLS, and TS in response to an unpredicted food pellet in hungry and sated rats (traces are aligned to food-pellet delivery). (C) Data from B are displayed as a percentage of maximum (average) dopamine release. (D) Dopamine release in response to reward was increased in all striatal regions when compared with baseline (B). Dopamine release in NAS, NAC, and DMS was lower in sated than in hungry rats, but not in DLS, VLS, and TS. Bar graphs depict (positive) peak concentrations of dopamine averaged across rats for baseline (−2 to 0 s) and reward (0 to 5 s) epochs. (B and D) Data are reported as mean + SEM. Comparison to baseline (B): #P < 0.05, ##P < 0.01, ###P < 0.001; Comparison H vs S: *P < 0.05, **P < 0.01, ***P < 0.001.

Appetitive and Aversive Pavlovian-Cue Conditioning Produces Region-Specific Dopamine Signatures.

Although dopamine is implicated in signaling aversion (3, 19, 20, 46, 5159), the information content and regional specificity of dopamine signals as they relate to aversive stimuli are even less well understood than those of rewards. Thus, we explored regionally specific encoding of appetitive and aversive outcomes (unconditioned stimuli [US]: respectively food pellets and aversive WN), as well as their discriminative cues (conditioned stimuli [CS]: a light and a tone, predicting delivery of, respectively, the appetitive and aversive US), while recording dopamine transients during probabilistic Pavlovian conditioning (Fig. 4 A and B).

Fig. 4.

Fig. 4.

Dopamine (DA) encoding of appetitive and aversive Pavlovian conditioning is region specific and unrelated to locomotor behavior. (A) Schematic of the Pavlovian task with appetitive and aversive stimuli. After two conditioning sessions on previous days, rats received 30 pairings of cue 1 with food-pellet delivery and 30 pairings of cue 2 with 3-s WN presentation. (B) Trial structure of the probabilistic Pavlovian conditioning task. (C) Locomotion speed during appetitive trials (Top) was decreased following food-pellet delivery or its prediction (cue 1). During aversive trials (Bottom), speed was increased during the presentation WN and its prediction (cue 2). The bar graphs depict average speed for baseline (B −5 to 0 s), 5-s cue (C; 0 to 5 s), or the same period for no-cue conditions, food pellet (5 to 10 s), and 3-s WN (5 to 8 s) epochs. Traces are aligned to food-pellet delivery or WN presentation, or to cue offset when outcome was omitted. Data are reported as mean ± SEM. (D) Regional dopamine release in response to cue-1 presentation (predicting food-pellet delivery) was increased compared with baseline. The bar graph insets depict averaged dopamine concentrations for baseline (−5 to 0 s) and cue (0 to 5 s) epochs. Different proportions of temporal shift from US (pellet) to CS (cue 1) across regions are indicated by black arrows. (E) Dopamine release in NAS, NAC, DLS, VLS, and TS in response to a predicted food pellet was diminished compared with an unpredicted food pellet, and no differences were observed in DMS. The bar graph insets depict averaged dopamine concentrations for unpredicted food-pellet (U; 5 to 20 s) and predicted food-pellet (P; 5 to 20 s) epochs. (F) Dopamine release in NAS and NAC in response to an omitted food pellet was decreased compared with a predicted food pellet, and no differences were observed in DMS, DLS, VLS, and TS. The bar graph insets depict averaged dopamine concentrations for predicted food-pellet (5 to 20 s) and omitted food-pellet (O; 5 to 20 s) epochs. (G) Dopamine release in NAS, NAC, DMS, DLS, and VLS in response to cue-2 presentation (predicting WN) was decreased compared with baseline, but no difference was observed in TS, although WN, itself, did decrease TS dopamine (extra bar graph inset). Bar graph insets depict averaged dopamine concentrations for baseline (−5 to 0 s), cue (0 to 5 s), and WN (5 to 12 s) epochs. Different proportions of temporal shift from US (WN) to CS (cue 2) across regions are indicated by black arrows. (H) Dopamine release in DMS in response to predicted WN was diminished compared with unpredicted WN, and no differences were observed in NAS, NAC, DLS, VLS, and TS. The bar graph insets depict averaged dopamine concentrations for unpredicted WN (5 to 20 s) and predicted WN (5 to 20 s) epochs. (I) Dopamine release in NAS, NAC, and DMS in response to omitted WN was less decreased compared with predicted WN, and no differences were observed in DLS, VLS, and TS. The bar graph insets depict averaged dopamine concentrations for predicted WN (5 to 20 s) and omitted WN (5 to 20 s) epochs. (J) Locomotion speed (black) superimposed on dopamine traces during appetitive trials (Left) and table of correlations between locomotion speed and dopamine concentration during cue-1 presentation and food-pellet delivery during these trials (Right). No correlations were significant. (K) Locomotion speed (black) superimposed on dopamine traces during aversive trials (Left) and table of correlations between locomotion speed and dopamine concentration during cue-2 presentation and WN presentation during these trials (Right). No correlations were significant. (DI) Data are reported as mean + SEM. (J and K) Speed data are reported as mean + SEM. *P < 0.05, **P < 0.01, ***P < 0.001.

Appetitive and aversive US provoke opposing behavioral responses in rat.

Rat behavioral response (derived from DeepLabCut video analysis) to presentation of the appetitive (food pellets) and aversive (WN) US was dichotomous: In appetitive trials, we observed a significant, transient decrease in locomotion speed, whereas in aversive trials locomotion speed was transiently increased [unpredicted pellet: χ2 (2, 48) = 32.042, P < 0.001; unpredicted WN: χ2 (2, 48) = 52.792, P < 0.001; Fig. 4 C, Left Column]. Pavlovian conditioning was successfully induced, as this effect extended to the CS associated with the US [predicted pellet: χ2 (2, 48) = 74.667, P < 0.001; omitted pellet: χ2 (2, 48) = 66.500, P < 0.001; predicted WN: χ2 (2, 48) = 90.125, P < 0.001; omitted WN: χ2 (2, 48) = 24.125, P < 0.001; Fig. 4 C, Middle and Right Columns].

In appetitive conditioning trials, animal speed always decreased during US delivery (post hoc analysis; Fig. 4 C, Top Left: baseline vs. reward, P < 0.001; Fig. 4 C, Top Middle: baseline vs. reward, P < 0.001), and animal speed always decreased following (but not during) appetitive CS presentation regardless of whether US was delivered (post hoc analysis; Fig. 4 C, Top Middle: baseline vs. cue, P = 0.205; baseline vs. reward, P < 0.001) or omitted (post hoc analysis; Fig. 4 C, Top Right: baseline vs. cue, P = 1.000; baseline vs. no reward, P < 0.001), suggesting that the cue prompts animals to slow down and inspect the food magazine for delivered pellets, which are consumed in case of delivery. In contrast, during aversive conditioning trials, US presentation increased animal locomotion speed (post hoc analysis; Fig. 4 C, Bottom Left: baseline vs. WN, P < 0.001; Fig. 4 C, Bottom Middle: baseline vs. WN, P < 0.001), and animal speed always increased both during and following CS presentation regardless of whether US was presented (post hoc analysis; Fig. 4 C, Bottom Middle: baseline vs. cue, P < 0.001; baseline vs. WN, P < 0.001) or omitted (post hoc analysis; Fig. 4 C, Bottom Right: baseline vs. cue, P < 0.001; baseline vs. no WN, P = 0.012), corroborating our previously reported observation that WN induces mild behavioral activation in rats, which may reflect an increased motivation to escape the operant box (59).

Dopamine transients reflect CS prediction of appetitive and aversive US (predicted US).

We mapped out regional differences in outcome-prediction encoding across the striatum by quantifying regional dopamine release in response to cues predicting appetitive and aversive outcomes. Dopamine release significantly increased in all striatal regions in response to the appetitive CS (cue 1: NAS: n = 10, Z = −2.803, P = 0.003; NAC: n = 13, Z = −3.180, P < 0.001; DMS: n = 14, Z = −3.296, P < 0.001; DLS: n = 19, Z = −3.219, P < 0.001; VLS: n = 10, Z = −2.803, P = 0.003; TS: n = 10, Z = −1.988, P = 0.023; Fig. 4D) and significantly decreased in response to the aversive CS (cue 2) in all regions except TS (NAS: n = 10, Z = −2.803, P = 0.003; NAC: n = 13, Z = −3.180, P < 0.001; DMS: n = 14, Z = −3.296, P < 0.001; DLS: n = 21, Z = −3.980, P < 0.001; VLS: n = 12, Z = −2.275, P = 0.011; TS: n = 11, Z = −0.533, P = 0.297; Fig. 4G); TS dopamine did, however, significantly decrease during WN presentation (TS: n = 11, Z = −2.667, P = 0.004; Fig. 4G). Thus, NAS, NAC, DMS, DLS, and VLS dopamine release predict both appetitive and aversive stimuli, whereas in TS, it predicts only appetitive stimuli. By definition, if dopamine acts as an RPE signal, its moment of release should gradually shift from the US to the CS; an incomplete shift implies dopamine signals reward prediction but not its error. We quantified this by calculating the share of overall dopamine increase (CS + US) attributable to the appetitive CS throughout different striatal regions. The CS acquired greatest weight in NAC, VLS, and TS (proportion of dopamine released during cue presentation: NAS, 42%; NAC, 80%; DMS, 59%; DLS, 47%; VLS, 80%; and TS, 89%; Fig. 4 D, see arrows). In contrast, the share of aversive CS was significantly smaller compared with the appetitive CS (n = 6, Z = −2.201, P = 0.028) and more homogenous between regions, acquiring its smallest weight in TS (proportion of dopamine released during cue presentation: NAS, 19%; NAC, 21%; DMS, 18%; DLS, 15%; VLS, 19%; and TS, 6%; Fig. 4 G, arrows). Taken together, the appetitive CS evokes reward-prediction dopamine signals throughout the striatum, but only in a subset of regions is this shift from US to CS (a requirement of RPE signals) substantial. The aversive CS receives only a small percentage of the total dopamine signal, indicating the absence of APE-like signaling throughout the striatum.

Appetitive and aversive CS evoke heterogenous dopamine responses to US prediction (predicted vs. unpredicted US).

By definition of a prediction-error signal, the size of the dopamine signal in response to a US should decrease when predicted by a CS. We evaluated to what extent the CS shaped dopamine responses throughout the striatum by comparing predicted outcomes (CS + US) with unpredicted outcomes (US only, in 8% of trials: 4% appetitive, 4% aversive; Fig. 4B). The appetitive CS reduced US-triggered dopamine-release in NAS, NAC, DLS, VLS, and TS, but not DMS (NAS: n = 10, Z = −2.293, P = 0.011; NAC: n = 13, Z = −3.110, P = 0.002; DMS: n = 14, Z = −0.534, P = 0.297; DLS: n = 19, Z = −2.133, P = 0.033; VLS: n = 10, Z = −2.803, P = 0.005; TS: n = 10, Z = −2.803, P = 0.005; Fig. 4E). In contrast, the aversive CS did not affect dopamine release, except in DMS, where it increased the size of the dopamine response to the US (i.e., the opposite effect of a prediction-error response; NAS: n = 10, Z = −0.866, P = 0.386; NAC: n = 13, Z = −0.314, P = 0.753; DMS: n = 14, Z = −2.354, P = 0.019; DLS: n = 21, Z = −2.033, P = 0.084; VLS: n = 12, Z = 0.000, P = 1.000; TS: n = 11, Z = −1.423, P = 0.155; Fig. 4H). Thus, prediction of appetitive and aversive stimuli impacts dopamine release heterogeneously throughout striatum, where DMS dopamine is the only US signal both not impacted by prediction of reward and, vice versa, the only US signal that is (mildly) impacted by prediction during aversive trials.

Appetitive and aversive CS evoke heterogenous dopamine responses to US prediction error across striatal regions (predicted vs. omitted US).

By definition, prediction-error signals are learning signals generated by a discrepancy between actual and predicted aversive outcomes and come in two varieties: positive and negative. Positive and negative refer to the direction of the discrepancy, not the value of the outcome (65). There is no consensus on the manner in which prediction errors are represented by dopamine transients. We compared dopamine responses with predicted outcomes (CS + US) with those that were omitted (CS only, in 8% of trials: 4% appetitive, 4% aversive; Fig. 4B) throughout the striatum. Dopamine release decreased significantly when a predicted food pellet was omitted (i.e., a worse-than-expected outcome, in NAS and NAC, but not in DMS, DLS, VLS, and TS; NAS: n = 10, Z = 2.497, P = 0.013; NAC: n = 13, Z = 2.551, P = 0.006; DMS: n = 14, Z = −1.601, P = 0.109; DLS: n = 19, Z = 1.006, P = 0.157; VLS: n = 10, Z = 0.357, P = 0.361; TS: n = 10, Z = −0.051, P = 0.480; Fig. 4F). In contrast, dopamine release increased significantly when predicted WN was omitted (i.e., a better-than-expected outcome, in NAS, NAC, and DMS, but not in DLS, VMS, and TS; NAS: n = 10, Z = −2.701, P = 0.007; NAC: n = 13, Z = −2.062, P = 0.039; DMS: n = 14, Z = −2.417, P = 0.016; DLS: n = 21, Z = 0.226, P = 0.411; VLS: n = 12, Z = −1.334, P = 0.182; TS: n = 11, Z = −1.423, P = 0.155; Fig. 4I). These data demonstrate that negative prediction-error coding of dopamine is region specific rather than a striatum-wide uniform signal. Only in NAC and NAS did dopamine release resemble a negative prediction-error signal for both appetitive and aversive stimuli, whereas dopamine in VLS, DLS, and TS did not show any negative prediction-error coding, neither for appetitive nor aversive stimuli (DMS showed a negative APE response). This conclusion was further supported by a post hoc analysis for VLS and TS, intended to increase sensitivity for the fast dopamine fluctuations observed in Fig. 4 F and I by comparing dopamine-response between predicted and omitted stimuli in the 15-s postcue epoch using a second-by-second statistical analysis (SI Appendix, Fig. S4).

A second post hoc analysis revealed no significant differences between aDMS and pDMS dopamine responses [appetitive CS: U(naDMS = 6; npDMS = 8) = 12, Z = −1.549, P = 0.142; unpredicted pellet-delivery: U(naDMS = 6; npDMS = 8) = 15, Z = −1.162, P = 0.282; omitted pellet-delivery: U(naDMS = 6; npDMS = 8) = 11, Z = −1.678, P = 0.108; aversive CS: U(naDMS = 6; npDMS = 8) = 34, Z = 1.291, P = 0.228; unpredicted WN presentation: U(naDMS = 6; npDMS = 8) = 20, Z = −0.516, P = 0.662; omitted WN presentation: U(naDMS = 6; npDMS = 8) = 22, Z = −0.258, P = 0.852; SI Appendix, Fig. S2 D and E], indicating that the reported DMS results apply to both aDMS and pDMS.

Striatal input from the dopaminergic midbrain is topographically organized along the anterior–posterior (A-P) axis and thus, variability in anatomic placement of our electrodes along the A-P axis might possibly introduce sufficient variance in our dopamine recordings to conceal potential prediction errors, such as (most notably) in VLS (Fig. 4I) and TS (Fig. 4 H and I). Thus, we performed a third post hoc analysis to investigate whether variance in these dopamine responses could be explained by electrode placement (SI Appendix, Fig. S5). Exclusion of the most caudal electrodes improved VLS-signal variance, but not for the TS. This indicates that in the TS, dopamine-response variance is not organized along the A-P axis, whereas in the VLS, there is an A-P gradient in representation of aversive stimuli. In either case, although negative data are more difficult to interpret, the variance of VLS and TS data is, in itself, valuable, implying that these regions are less involved in prediction-error signaling than is the VMS.

Locomotion speed does not influence dopamine transients reflecting appetitive and aversive stimuli.

As dopamine-neuron activity often relates to movement, we controlled for the influence of locomotion on CS- and US-triggered dopamine release by correlating speed of locomotion with dopamine concentrations during epochs of cue presentation (0 to 5 s) and outcome delivery (5 to 10 s) during appetitive (SI Appendix, Fig. S6A) and aversive (SI Appendix, Fig. S6B) trials. We found no correlation between dopamine release and animal locomotion speed in any striatal region regardless of trial type (appetitive or aversive) during cue presentation or outcome delivery (Fig. 4 J and K); thus, we may rule out locomotion as a substantial linear contributor to the magnitude and direction of dopamine transients in this study.

Discussion

To reveal behaviorally relevant organizational principles of dopamine neurotransmission in its primary projection target, the striatum, we conducted an investigation that spans its different functional domains. Our findings demonstrate several general organizational principles, some of which are expressed homogeneously throughout the striatum, and others subject to regional heterogeneity (summarized in Tables 1 and 2). The most striking principle is that stimulus valence directs dopamine concentration homogeneously across all regions (i.e., reward increases dopamine and aversive stimuli decrease dopamine). CS, predictive of these US, acquired this effect on dopamine release homogeneously throughout the striatum. Furthermore, dopamine signals are homogeneously unaffected by the animals’ movement; in other words, striatal dopamine-population signaling (volume transmission) is not linked to general locomotion, in contrast to individual dopamine neurons that are known to participate in locomotor function or movement (2325, 66).

Table 1.

Summary of regional functional features of striatal dopamine: Reward

Reward Reward magnitude DA signal size Subjective state (satiety) RP during CS Impact of CS prediction on US (reward) signal size Signal transfer from US to CS, % Positive RPE (bigger reward) Negative RPE (reward omission) Encoding summary
NAS X •••• X X 42 X X Partial RPE
NAC X •••• X X 80 X X Full RPE
DMS X •• X X 59 X Quantitative RP + reward
DLS (x) X 47 RP + reward
VLS •• X 80 RP
TS X 89 RP

Dopamine signals across the six sampled striatal domains encode information related to rewarding stimuli in a multifaceted, region-specific manner, where signals differ qualitatively and quantitatively, yet adhere to unifying organizing principles. Dopamine dynamics in all striatal domains responded to US presentation (food pellet) in a consistent manner, where dopamine increased to the reward, in some regions even relative to reward magnitude and subjective internal state. Dopamine responded to the CS (predictive of the US; reward prediction) in almost all regions, but the impact of and signal transfer to CS (from US) differed substantially between regions, and only some regions exhibited dopamine dynamics that resembled an RPE. DA, dopamine; RP, reward prediction; RPE, reward-prediction error.

Table 2.

Summary of regional functional features of striatal dopamine: Aversive stimulation

Aversive stimulus DA signal size AP during CS Impact of CS prediction on US (aversive) signal size Signal transfer from US to CS, % Negative APE (WN omission) Encoding summary
NAS •• X 19 X AP + aversive
NAC •• X 21 X AP + aversive
DMS •• X 18 X AP + aversive
DLS X 15 AP + aversive
VLS X 19 AP + aversive
TS 6 Aversive

Dopamine signals across the six sampled striatal domains encode information related to aversive stimuli in a relatively consistent manner, where dopamine concentration decreased in response to US presentation (aversive WN). The CS (predictive of the US; aversive-stimulus prediction) decreased dopamine in almost all regions, but with little impact on US-induced dopamine and only limited signal transfer from US to CS. Only some regions responded to WN omission. DA, dopamine; AP, aversive-stimulus prediction; APE, aversive-stimulus prediction error.

Heterogeneous features are topographically expressed in a gradient from medial to lateral striatum, with more pronounced encoding of subjective state and reward magnitude in medial regions. VMS exhibits the greatest dopamine release and scales most linearly to reward magnitude (interval scale), DMS displays qualitatively scaled dopamine (ordinal scale), and the three lateral regions show dopamine responses to reward but are insensitive to reward magnitude (nominal scale). The absolute size of dopamine signals was significantly heterogeneous across the striatum. More specifically, absolute signal magnitude was consistently greatest in VMS, where dopamine concentration increased 2 to 4 times more than in other regions (in response to reward). This is remarkable because the dopamine tissue content does not differ substantially between striatal regions; if at all, slightly less content is found in the VMS (67, 68). Smaller dopamine signals may be due to recruitment of fewer dopamine-releasing axons. Consistently, reward-related signaling has been found more frequently in dopaminergic axons projecting to the ventral striatum than in axons projecting to the dorsal striatum (24). However, our recordings also reveal a marked difference in signal size between VMS and VLS, indicating that reward-related signal size is also stratified on a medial to lateral gradient (in addition to a ventral-to-dorsal axis). Taken together, organizational principles of dopamine signaling are governed by binary valence-encoding and independence from locomotion throughout the striatum, where signal size underlies a “VMS-centric” profile.

We contrasted our investigation of reward processing by the dopamine system with aversive-stimulus processing. We used WN, a well-controllable, aversive, auditory stimulus (69, 70) that provokes mild behavioral activation and depresses NAC dopamine release, as validated in our previous work (59). Consistent with previous studies aimed at the NAC (46, 51, 57, 58), we found that striatal dopamine release encodes rewarding and aversive events with opposing directionality across striatal regions. The degree of aversive-stimulus–induced dopamine decrease manifested similarly to rewards (medial regions show greater signals [dips] compared with lateral regions), albeit with less pronounced regional disparity in signal magnitude. However, despite similarities, dopamine in all regions encodes negative-valence stimuli qualitatively different from positive-valence ones, as we demonstrated previously for the NAC (59), a finding that is inconsistent with the framework of APE signaling. First, aversive-related changes in extracellular dopamine concentration are substantially smaller compared with appetitive-related ones, potentially due to hard-wired differences between dopamine release (signal induction) and dopamine-signal termination: Positive signals have a bigger dynamic range as the number of neurons firing and their firing frequency can be increased substantially (71) compared with negative signals that are limited to neurons ceasing to fire, tied to relatively slow dopamine reuptake and diffusion (72). Second, there is little relative temporal shift in dopamine signal from the aversive US to its predicting CS, a prominent feature of dopamine signals to appetitive stimuli. Third, consistently, signal magnitude is not decreased by stimulus prediction. Fourth, decreased dopamine concentration in response to an aversive stimulus does not immediately return to baseline after stimulus termination, evoking temporally imprecise stimulus encoding. Fifth, besides aversion-induced dopamine dips, we also observed dips below prestimulus baseline subsequent to a reward-induced increase, as reported previously (6, 73), whereby such dips are bigger (relative to positive signal size) and occur more frequently in dorsal and lateral regions of the striatum (VLS, DMS, DLS, TS). Such bipolar responses may accentuate the termination of a reward signal, signify the setting of a new baseline for ambient dopamine concentration (30), or reflect the ceasing of an appetitive event; as our experimental design does not evaluate these postreward epochs, we can merely speculate about the nature of these dips. Taken together, we report that aversive stimuli induce negative dopamine signals that vary in size regionally and mirror positive-reward dopamine signals but encode qualitatively different information than reward signals and do not encode APEs in any of the striatal regions, inconsistent with findings that support the idea that TS dopamine functions as an APE (19, 20).

Dopamine neuron firing (32), dopamine axon activity (39), and dopamine release in the striatum (74, 75) have been found to reflect RPE signals. To qualify as an RPE signal, several criteria need to be satisfied (75, 76), some of which we investigated. Specifically, we tested whether dopamine release encoded reward, reward value, reward prediction, and positive or negative deviation from expected reward value. A strikingly heterogeneous feature was that, although all sampled regions encoded reward and reward prediction and half the regions encoded reward value, pure and robust encoding of a canonical RPE was exclusive to the VMS. Importantly, this VMS RPE signal was shaped by the subjective state of the animal (i.e., prefeeding-induced, sensory-specific satiety [toward the food reward]) and had an immediate effect on dopamine release in the experimental context (i.e., dopamine signals were affected already during the first pellet delivery after prefeeding), instead of a slower, experience-based adjustment (64, 77, 78). Within the VMS, the NAC qualified more strictly as an RPE, as reported by others (79), because more dopamine release transferred from the US to the CS than in NAS; however, behavioral performance had already plateaued before our dopamine measurement, consistent with previous studies that used comparable training durations (6, 80). Notably, a recent study (78) elegantly demonstrated how task uncertainty can alter the shape of NAC-dopamine RPE signals, including residual dopamine signals in response to the reward US (81, 82). However, due to a relatively short CS of 5 s in our task, that was presented with maximal contingency and contiguity during extensive training, we believe our animals experienced, at most, very limited uncertainty (evidenced by the fact that NAC did not exhibit such residual US dopamine signals). In summary, the dopaminergic RPE signal, previously assumed ubiquitous and global, was most prominent in the NAC, underlining the exceptional functional status of this limbic brain hub, thought to translate motivation to action (83) and, thus, revealing substantial regional differences in striatal dopamine signaling.

The directionality of dopamine concentration changes in NAS, DLS, and TS were consistent with one another and with the other regions in that appetitive and aversive stimuli increased and decreased dopamine, respectively, in contrast to reports demonstrating increased dopamine activity after aversive stimuli in these regions (19, 46, 52). Notably, this discrepancy is probably not due to sampling of functionally different striatal domains, since anatomic targeting for these regions largely overlaps between studies (19, 46, 52). Specifically, TS dopamine was decreased by the aversive stimulus but not its prediction (which was not due to insufficient behavioral conditioning, since other regions did encode the cue association), and NAS and DLS dopamine decreased strongly to both the aversive stimulus and its prediction. Similarly, we found categorical regional homogeneity between NAS and NAC, as well as DMS and DLS, where others have reported opposite dopamine signals (48, 52). Perhaps, these DMS and DLS differences are attributable to biased or specialized RPEs (39), as DMS dopamine displays the only regional US signal not impacted by reward prediction and DLS dopamine is not modulated by subjective state. Overall, some of the discrepancies we have described between studies could be explained by differences in animal species, stimulus identity, study design, or dopamine-measuring technique. Alternatively, discrepant findings may be explained by whether experimental conditioning was instrumental or Pavlovian, since recent evidence indicates differential response profiles for the dopamine system to each (84, 85). Furthermore, measuring dopamine axon-terminal activity with calcium indicators versus dopamine release from terminals may capture physiological processes that are not identical, as dopamine-terminal activity is thought to be modulated locally in the striatum (61, 62). Together, discrepant with recent reports, our data points at striatum-wide unidirectionality of dopamine signals, where appetitive and aversive stimuli induce increased and decreased activity of the local dopamine projections, respectively.

Dopamine signals we measured in DMS, a region strongly associated with goal-directed, outcome-dependent behavior, displayed properties more fitting with a region associated with stimulus-driven, outcome-independent behavior (e.g., DLS): The DMS dopamine signal was unaffected by outcome (i.e., reward omission, predicted reward, and unpredicted reward all evoked DMS signals of equal size). This supports our previous attribution of an unsuspected role of DMS dopamine in habitual behavior (86). Intriguingly, DMS dopamine was sensitive to reward value and subjective state, placing it in a transitional position between VMS and the lateral striatal regions. Thus, these findings update actor-critic reinforcement-learning models, where the critic (influenced by motivational state, e.g., hunger) is the ventral striatum (composed of VMS and VLS), and the actor is the dorsal striatum (87), Based on our findings, we may speculate that VMS dopamine signaling functions as critic but that VLS dopamine does not. Thus, we move the line that divides actor and critic functions into ventral and dorsal striatum, instead into VMS and DMS on one hand, and VMS and VLS on the other.

In summary, we demonstrate that dopamine signals across the striatum homogeneously encode stimulus valence and prediction of motivational stimuli (of both positive and negative valence) and are homogeneously unrelated to locomotion and APE. This is in contrast to heterogeneous encoding of reward magnitude, subjective state, and RPE, all of which are VMS centric, where NAC most accurately reflects a formal temporal-difference RPE signal. Thus, considering the anatomic size of the striatum, only a relatively small limbic portion, tightly associated with motivation, is exposed to a dopaminergic, RPE-like, volume-transmission signal (pooling signals of hundreds of different dopamine neurons), whereas dopamine in the remaining regions predominantly tracks both the presence of appetitive and aversive stimuli and their predictors. Some of these findings are strongly inconsistent with the literature on recordings from individual dopamine neurons (32, 42), which requires further investigation; however, a mismatch between activity of individual dopamine neurons and the striatal-dopamine population signal (which is based on release from striatal terminals) may be due to dopamine-terminal modulation that divorces dopamine release from cell-body activity. Regardless, the dopamine population signal, which is what modulates the activity of postsynaptic neurons in the striatum, acts substantially less as an RPE than individual dopamine neurons do. Together, our findings contribute to unraveling the long-standing question of how regional dopamine in the striatum realizes its many functions.

Materials and Methods

Animals.

Adult male Long Evans rats (300 to 380 g; Janvier Labs) were individually housed and kept on a reversed 12-h light/dark cycle (lights off from 08:00 to 20:00) with controlled temperature and humidity. Rats were food restricted to 85% of their free-feeding body weight, and water was provided ad libitum. Rats were fed regular laboratory chow in their home cage 2 h after the end of the daily behavioral training, to supplement food intake during training. All cohorts of rats were trained consistently at the same time of day, between 08:00 and 17:00, during the dark phase. All animal procedures were conducted in accordance with Dutch and European laws and were approved by the Animal Experimentation Committee of the Royal Netherlands Academy of Arts and Sciences. A total of 58 rats had at least one functional and histologically verified recording electrode and were used for dopamine recordings. Subgroups of this total number of rats were used in the different experiments described later in this section.

Stereotactic Surgery.

Stereotaxic surgery was performed as described previously (43, 44). Rats were anesthetized with 1 to 3% isoflurane and placed in a stereotactic frame, analgesic care was delivered as subcutaneous injection of Metacam (0.2 mg meloxicam/100 g). Body temperature was monitored and maintained with a heating pad. The scalp was shaved and disinfected with 70% alcohol, and an incision (treated with lidocaine, 100 mg/mL) exposed the cranium at the midline of the scalp. Subsequently, holes were drilled for three anchor surgical screws, an Ag/AgcCl reference electrode targeting the forebrain, and two custom-made carbon-fiber microelectrodes (63) unilaterally targeting two of the following regions: NAS [1.2 mm rostral, 1.0 mm lateral, and −7.7 mm ventral from bregma (88)], NAC (1.2 mm rostral, 1.5 mm lateral, and −7.1 mm ventral from bregma), DMS (−0.2 mm rostral, 2.5 mm lateral, and −4.5 mm ventral from bregma), DLS (1.2 mm rostral, 3.6 mm lateral, and −4.5 mm ventral from bregma), VLS (0.36 mm rostral, 3.8 mm lateral, and −6.8 mm ventral from bregma), or TS (−2.1 mm rostral, 4.8 mm lateral, and −5.8 mm ventral from bregma). Electrodes were secured and anchored to the surgical screws with dental acrylic cement. Dopamine release was quantified by 11 NAS electrodes, 17 NAC electrodes, 14 DMS electrodes, 21 DLS electrodes, 12 VLS electrodes, and 11 TS electrodes.

Behavioral Procedures.

All behavioral experiments were conducted in modified modular operant chambers (32 × 30 × 29 cm; Med Associates Inc.) equipped with a food magazine with an integrated cue light (connected to an automated food-pellet dispenser), a house light, a tone generator, a WN generator, and a metal grid floor. Each operant chamber was surveilled with a video camera. The boxes were housed in metal Faraday cages that were insulated with sound-absorbing polyurethane foam and ventilated by a fan.

Experiment 1: Dopamine recording during food-pellet dose response.

Rats (n = 39) were placed in an operant chamber (Fig. 2) and tethered to the FSCV recording setup to record fluctuations in extracellular dopamine concentration in response to different magnitudes of (unpredicted) food-pellet rewards (“purified” pellets; Bio.Serv Inc.). During the first three trials of the session, a single food pellet was delivered per trial. Then, rats received a food reward consisting of either one, three, or nine pellets per trial in a semirandom order. The probability of receiving one pellet was 0.7, and for three or nine pellets, the probability was 0.15. Trials were separated by a variable intertrial interval (ITI) of 30 s (range: 20 to 40 s). Pellets were delivered sequentially, one by one, with a time interval of 0.4 s between deliveries. Thus, it took approximately 8 × 0.4 s = 3.2 s in total for nine pellets to be delivered.

To determine the effect of a single food pellet on dopamine, the first three trials were analyzed. For all regions, we compared the (positive) peak dopamine concentration in the first 10 s after pellet delivery (approximate duration of the observed phasic signal) among trials of one, three, and nine pellets. To test whether dopamine increased in response to a reward, the peak concentration in the first 10 s after pellet delivery was compared with the (positive) peak concentration in the 2 s before pellet delivery (baseline).

Experiment 2: Dopamine recording during food-pellet delivery at different subjective states.

On two separate days, rats (n = 51) were placed in an operant chamber and tethered to the FSCV recording setup (Fig. 3) to determine the effect of satiety (subjective state) on regional dopamine release. On 1 d, rats were tested in a food-restricted state. On the other day, rats were given 1-h access to 20 g of food pellets (approximately 450 pellets; ad libitum) prior to the start of the session (pellets were placed in a custom-made metal cup in the operant chamber). Sessions on both days consisted of two trials; during each, a single unpredicted pellet was delivered. Trials were separated by a variable ITI of 30 s (range: 20 to 40 s).

The positive peak concentration of dopamine over 5 s following food-reward delivery was compared between hungry and sated conditions. To determine a regional reward response, the average across conditions was compared with the positive peak concentration over 2 s before the presentation of the food reward (baseline) that was also averaged across both conditions.

Experiment 3: Dopamine recording during a Pavlovian task with appetitive and aversive stimuli.

Rats (n = 50) were conditioned in two sessions to the delivery of food pellets (appetitive outcome) and the presentation of loud WN [90 dB, an intensity we previously showed to be aversive to rats (59); aversive outcome] (Fig. 4). Sessions consisted of 30 pairings (trials) of 5 s of predictive cue-light illumination followed by a food-pellet delivery (delivered immediately after cue offset) and 30 pairings (trials) of 5 s of predictive cue tone (1.5 kHz, 75 dB) followed by a 3-s WN presentation (presented immediately after cue offset). Trials were presented in a semirandom order and separated by a variable ITI of 30 s (range: 20 to 40 s).

In the third session, rats were placed in the operant chamber and tethered to the FSCV recording equipment. The probability for each trial type (appetitive or aversive) was 50%. During most appetitive trials, food-pellet delivery was preceded by a predictive 5-s CS (cue 1; 42% of trials); in probe trials, the predictive 5-s cue 1 was presented alone (4% of trials) or the food pellet was delivered alone ( 4% of trials). During most aversive trials, 3-s WN was preceded by predictive 5-s CS (cue 2; 42% of trials); in probe trials, the predictive 5-s cue 2 was presented alone (4% of trials) or the 3-s WN was presented alone (4% of trials). Probe trials helped identify whether dopamine release was related to reward outcomes (R), reward prediction (RP), or RPE, as well as aversive outcomes (A), aversive prediction (AP), or APE. All trials were presented in a semirandom order and separated by a variable ITI of 30 s (range: 20 to 40 s).

We compared the average dopamine concentration during cue presentation (0 to 5 s; after cue onset) to baseline concentration (−5 to 0 s; before cue onset); for TS, we additionally compared dopamine during the WN epoch (5 to 12 s after cue onset) to baseline. In trials with different contingencies (unpredicted or omitted outcome), we compared dopamine transients during relevant epochs (5 to 20 s after cue onset). To assess regional differences in the temporal dopamine-signal shift toward the earliest predictor of the respective appetitive and aversive stimulus, we determined the relative contribution of the CS to the combined positive deviation from baseline for both CS and US epochs by dividing the positive values of the area under the curve during CS epoch (0 to 5 s) by the positive values of the area under the curve during both CS and US epochs (0 to 20 s; CS plus US combined).

FSCV Measurements and Analysis.

We used FSCV incorporating chronically implanted carbon-fiber microelectrodes (63) to record rapid changes in extracellular dopamine concentration in different striatal domains. Before the start of a recording session, microelectrodes were manually connected to the head-mounted voltammetric amplifier. The amplifier was interfaced through an electrical swivel (Med Associates) with a personal computer (PC)–driven data-acquisition and analysis system (National Instruments) (63).

The voltammetric scans were repeated every 100 ms (sampling rate of 10 Hz). The alternating potential at the carbon-fiber electrode tip was ramped linearly during each voltammetric scan from −0.4 V versus Ag/AgCl to +1.3 V (anodic sweep) and back to −0.4 V (cathodic sweep) at 400 V/s (total scan time of 8.5 ms) and held at −0.4 V between scans. Waveform generation and data acquisition and analysis were carried out on a PC-based system using two peripheral component interconnect multifunction data acquisition cards and software written in LabVIEW (National Instruments). When present at the carbon-fiber surface, dopamine is oxidized to form dopamine-o-quinone during the anodic sweep (peak reaction at approximately +0.7 V), which is reduced back again to dopamine during the cathodic sweep (peak reaction at approximately −0.3 V). The ensuing flux of electrons is measured as current and is directly proportional to the number of molecules that undergo electrolysis. For each scan, the obtained background-subtracted, time-resolved current provides a chemical signature characteristic of the analyte, allowing resolution of dopamine from other substances (89). Dopamine traces were isolated from the voltammetric signal by chemometric analysis using a standard training set, based on electrically stimulated dopamine release detected with chronically implanted electrodes; resulting dopamine concentration was estimated on the basis of the average postimplantation sensitivity of electrodes (63). Individual dopamine traces were smoothed with a moving 10-point median filter prior to analysis of average concentration.

Analysis of Operant-Box Behavior.

DeepLabCut software (90) was used to track rat position in the operant chamber using video data recorded during FSCV measurements. These tracking data were analyzed in MATLAB (version 2019a; The Mathworks, Inc.) to determine speed of movement (cm/s). Analyses were performed using the average traveled distance or locomotion speed during the cue (5 s) or WN (3 s) epochs.

Statistical Analysis.

Data collection and analysis were not performed blind to the conditions of the experiments. Statistical analysis was carried out using Prism (GraphPad Software), SPSS (version 25; IBM Corp.) and MATLAB R2018b (The Mathworks, Inc.). Individual electrochemical signals were averaged across trials within a session (within animals) and then across animals. Statistical significance was set to P < 0.05. P values were adjusted according to the Holm-Bonferroni correction method when multiple comparisons were carried out. FSCV and behavioral data were analyzed using two-tailed paired or unpaired Student’s t tests. Equivalent nonparametric tests were applied when data were not normally distributed. Graphical representations were made using Prism and Matlab. All data are presented as mean plus SEM.

Supplementary Material

Supplementary File

Acknowledgments

We thank Ralph Hamelink and Nicole Yee for their technical support, and Matthijs Feenstra for his input on the manuscript. This research was funded by the European Research Council (Grant ERC-2014-STG 638013 to I.W.) and the Netherlands Organization for Scientific Research (Grants 864.14.010, 2015/06367/ALW and BRAINSCAPES 024.004.012 to I.W.).

Footnotes

The authors declare no competing interest.

This article is a PNAS Direct Submission.

This article contains supporting information online at https://www.pnas.org/lookup/suppl/doi:10.1073/pnas.2117270119/-/DCSupplemental.

Data Availability

Data and code have been deposited in the Open Science Framework (OSF) https://osf.io/zj9rk/ (91).

Change History

May 25, 2022: The SI Appendix has been updated.

References

  • 1.Guitart-Masip M., Duzel E., Dolan R., Dayan P., Action versus valence in decision making. Trends Cogn. Sci. 18, 194–202 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Bromberg-Martin E. S., Matsumoto M., Hikosaka O., Dopamine in motivational control: Rewarding, aversive, and alerting. Neuron 68, 815–834 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Matsumoto H., Tian J., Uchida N., Watabe-Uchida M., Midbrain dopamine neurons signal aversion in a reward-context-dependent manner. eLife 5, e17328 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Björklund A., Dunnett S. B., Dopamine neuron systems in the brain: An update. Trends Neurosci. 30, 194–202 (2007). [DOI] [PubMed] [Google Scholar]
  • 5.Björklund A., Dunnett S. B., Fifty years of dopamine research. Trends Neurosci. 30, 185–187 (2007). [DOI] [PubMed] [Google Scholar]
  • 6.Flagel S. B., et al. , A selective role for dopamine in stimulus-reward learning. Nature 469, 53–57 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Athalye V. R., Carmena J. M., Costa R. M., Neural reinforcement: Re-entering and refining neural dynamics leading to desirable outcomes. Curr. Opin. Neurobiol. 60, 145–154 (2020). [DOI] [PubMed] [Google Scholar]
  • 8.Schwarting R. K., Huston J. P., Unilateral 6-hydroxydopamine lesions of meso-striatal dopamine neurons and their physiological sequelae. Prog. Neurobiol. 49, 215–266 (1996). [DOI] [PubMed] [Google Scholar]
  • 9.Bernheimer H., Birkmayer W., Hornykiewicz O., Jellinger K., Seitelberger F., Brain dopamine and the syndromes of Parkinson and Huntington. Clinical, morphological and neurochemical correlations. J. Neurol. Sci. 20, 415–455 (1973). [DOI] [PubMed] [Google Scholar]
  • 10.Fallon J. H., Moore R. Y., Catecholamine innervation of the basal forebrain. IV. Topography of the dopamine projection to the basal forebrain and neostriatum. J. Comp. Neurol. 180, 545–580 (1978). [DOI] [PubMed] [Google Scholar]
  • 11.Berendse H. W., Galis-de Graaf Y., Groenewegen H. J., Topographical organization and relationship with ventral striatal compartments of prefrontal corticostriatal projections in the rat. J. Comp. Neurol. 316, 314–347 (1992). [DOI] [PubMed] [Google Scholar]
  • 12.Mailly P., Aliane V., Groenewegen H. J., Haber S. N., Deniau J.-M., The rat prefrontostriatal system analyzed in 3D: Evidence for multiple interacting functional units. J. Neurosci. 33, 5718–5727 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Hintiryan H., et al. , The mouse cortico-striatal projectome. Nat. Neurosci. 19, 1100–1114 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Zahm D. S., Brog J. S., On the significance of subterritories in the “accumbens” part of the rat ventral striatum. Neuroscience 50, 751–767 (1992). [DOI] [PubMed] [Google Scholar]
  • 15.Smith Y., Raju D. V., Pare J.-F., Sidibe M., The thalamostriatal system: A highly specific network of the basal ganglia circuitry. Trends Neurosci. 27, 520–527 (2004). [DOI] [PubMed] [Google Scholar]
  • 16.Floresco S. B., The nucleus accumbens: An interface between cognition, emotion, and action. Annu. Rev. Psychol. 66, 25–52 (2015). [DOI] [PubMed] [Google Scholar]
  • 17.Yin H. H., Knowlton B. J., The role of the basal ganglia in habit formation. Nat. Rev. Neurosci. 7, 464–476 (2006). [DOI] [PubMed] [Google Scholar]
  • 18.Natsubori A., et al. , Ventrolateral striatal medium spiny neurons positively regulate food-incentive, goal-directed behavior independently of D1 and D2 selectivity. J. Neurosci. 37, 2723–2733 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Menegas W., Babayan B. M., Uchida N., Watabe-Uchida M., Opposite initialization to novel cues in dopamine signaling in ventral and posterior striatum in mice. eLife 6, e21886 (2017). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Menegas W., Akiti K., Amo R., Uchida N., Watabe-Uchida M., Dopamine neurons projecting to the posterior striatum reinforce avoidance of threatening stimuli. Nat. Neurosci. 21, 1421–1430 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Berke J. D., What does dopamine mean? Nat. Neurosci. 21, 787–793 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Parker N. F., et al. , Reward and choice encoding in terminals of midbrain dopamine neurons depends on striatal target. Nat. Neurosci. 19, 845–854 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Lee R. S., Mattar M. G., Parker N. F., Witten I. B., Daw N. D., Reward prediction error does not explain movement selectivity in DMS-projecting dopamine neurons. eLife 8, e42992 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 24.Howe M. W., Dombeck D. A., Rapid signalling in distinct dopaminergic axons during locomotion and reward. Nature 535, 505–510 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.da Silva J. A., Tecuapetla F., Paixão V., Costa R. M., Dopamine neuron activity before action initiation gates and invigorates future movements. Nature 554, 244–248 (2018). [DOI] [PubMed] [Google Scholar]
  • 26.Jin X., Costa R. M., Start/stop signals emerge in nigrostriatal circuits during sequence learning. Nature 466, 457–462 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Schultz W., Ruffieux A., Aebischer P., The activity of pars compacta neurons of the monkey substantia nigra in relation to motor activation. Exp. Brain Res. 51, 377–387 (1983). [Google Scholar]
  • 28.Syed E. C. J., et al. , Action initiation shapes mesolimbic dopamine encoding of future rewards. Nat. Neurosci. 19, 34–36 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Samejima K., Doya K., Multiple representations of belief states and action values in corticobasal ganglia loops. Ann. N. Y. Acad. Sci. 1104, 213–228 (2007). [DOI] [PubMed] [Google Scholar]
  • 30.Hamid A. A., et al. , Mesolimbic dopamine signals the value of work. Nat. Neurosci. 19, 117–126 (2016). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Montague P. R., Dayan P., Sejnowski T. J., A framework for mesencephalic dopamine systems based on predictive Hebbian learning. J. Neurosci. 16, 1936–1947 (1996). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Schultz W., Dayan P., Montague P. R., A neural substrate of prediction and reward. Science 275, 1593–1599 (1997). [DOI] [PubMed] [Google Scholar]
  • 33.Hart A. S., Rutledge R. B., Glimcher P. W., Phillips P. E. M., Phasic dopamine release in the rat nucleus accumbens symmetrically encodes a reward prediction error term. J. Neurosci. 34, 698–704 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.Bayer H. M., Glimcher P. W., Midbrain dopamine neurons encode a quantitative reward prediction error signal. Neuron 47, 129–141 (2005). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Sutton R. S., Barto A. G., “A temporal-difference model of classical conditioning” in Proceedings of the Ninth Annual Conference of the Cognitive Science Society, Seattle, WA (ISBN 9780805801668; Published by Psychology Press; California Digital Library, 1987), pp. 355–378.
  • 36.Engelhard B., et al. , Specialized coding of sensory, motor and cognitive variables in VTA dopamine neurons. Nature 570, 509–513 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Cox J., Witten I. B., Striatal circuits for reward learning and decision-making. Nat. Rev. Neurosci. 20, 482–494 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Rangel A., Camerer C., Montague P. R., A framework for studying the neurobiology of value-based decision making. Nat. Rev. Neurosci. 9, 545–556 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Tsutsui-Kimura I., et al. , Distinct temporal difference error signals in dopamine axons in three regions of the striatum in a decision-making task. eLife 9, e62390 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Kim H. F., Ghazizadeh A., Hikosaka O., Dopamine neurons encoding long-term memory of object value for habitual behavior. Cell 163, 1165–1175 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Brown H. D., McCutcheon J. E., Cone J. J., Ragozzino M. E., Roitman M. F., Primary food reward and reward-predictive stimuli evoke different patterns of phasic dopamine signaling throughout the striatum. Eur. J. Neurosci. 34, 1997–2006 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Matsumoto M., Hikosaka O., Two types of dopamine neuron distinctly convey positive and negative motivational signals. Nature 459, 837–841 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Willuhn I., Burgeno L. M., Everitt B. J., Phillips P. E. M., Hierarchical recruitment of phasic dopamine signaling in the striatum during the progression of cocaine use. Proc. Natl. Acad. Sci. U.S.A. 109, 20703–20708 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Willuhn I., Burgeno L. M., Groblewski P. A., Phillips P. E. M., Excessive cocaine use results from decreased phasic dopamine signaling in the striatum. Nat. Neurosci. 17, 704–709 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Lammel S., Ion D. I., Roeper J., Malenka R. C., Projection-specific modulation of dopamine neuron synapses by aversive and rewarding stimuli. Neuron 70, 855–862 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.de Jong J. W., et al. , A neural circuit mechanism for encoding aversive stimuli in the mesolimbic dopamine system. Neuron 101, 133–151.e7 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Klanker M., Fellinger L., Feenstra M., Willuhn I., Denys D., Regionally distinct phasic dopamine release patterns in the striatum during reversal learning. Neuroscience 345, 110–123 (2017). [DOI] [PubMed] [Google Scholar]
  • 48.Aragona B. J., et al. , Regional specificity in the real-time development of phasic dopamine transmission patterns during acquisition of a cue-cocaine association in rats. Eur. J. Neurosci. 30, 1889–1899 (2009). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Bassareo V., Musio P., Di Chiara G., Reciprocal responsiveness of nucleus accumbens shell and core dopamine to food- and drug-conditioned stimuli. Psychopharmacology (Berl.) 214, 687–697 (2011). [DOI] [PubMed] [Google Scholar]
  • 50.Yuan L., Dou Y.-N., Sun Y.-G., Topography of reward and aversion encoding in the mesolimbic dopaminergic system. J. Neurosci. 39, 6472–6481 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Badrinarayan A., et al. , Aversive stimuli differentially modulate real-time dopamine transmission dynamics within the nucleus accumbens core and shell. J. Neurosci. 32, 15779–15790 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Lerner T. N., et al. , Intact-brain analyses reveal distinct information carried by SNc dopamine subcircuits. Cell 162, 635–647 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Roitman M. F., Wheeler R. A., Wightman R. M., Carelli R. M., Real-time chemical responses in the nucleus accumbens differentiate rewarding and aversive stimuli. Nat. Neurosci. 11, 1376–1377 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Wheeler R. A., et al. , Cocaine cues drive opposing context-dependent shifts in reward processing and emotional state. Biol. Psychiatry 69, 1067–1074 (2011). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.McCutcheon J. E., Ebner S. R., Loriaux A. L., Roitman M. F., Encoding of aversion by dopamine and the nucleus accumbens. Front. Neurosci. 6, 137 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Twining R. C., et al. , Aversive stimuli drive drug seeking in a state of low dopamine tone. Biol. Psychiatry 77, 895–902 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 57.Oleson E. B., Gentry R. N., Chioma V. C., Cheer J. F., Subsecond dopamine release in the nucleus accumbens predicts conditioned punishment and its successful avoidance. J. Neurosci. 32, 14804–14808 (2012). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 58.Stelly C. E., et al. , Pattern of dopamine signaling during aversive events predicts active avoidance learning. Proc. Natl. Acad. Sci. U.S.A. 116, 13641–13650 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Goedhoop J. N., van den Boom B. J. G., Arbab T., Willuhn I., Nucleus-accumbens dopamine tracks aversive stimulus duration and prediction but not value or prediction error. bioRxiv [Preprint] (2021). 10.1101/2021.01.16.426967. Accessed 10 April 2022. [DOI] [PMC free article] [PubMed]
  • 60.Fiorillo C. D., Two dimensions of value: Dopamine neurons represent reward but not aversiveness. Science 341, 546–549 (2013). [DOI] [PubMed] [Google Scholar]
  • 61.Mohebi A., et al. , Dissociable dopamine dynamics for learning and motivation. Nature 570, 65–70 (2019). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Threlfell S., et al. , Striatal dopamine release is triggered by synchronized activity in cholinergic interneurons. Neuron 75, 58–64 (2012). [DOI] [PubMed] [Google Scholar]
  • 63.Clark J. J., et al. , Chronic microsensors for longitudinal, subsecond dopamine detection in behaving animals. Nat. Methods 7, 126–129 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Gardner M. P. H., Schoenbaum G., Gershman S. J., Rethinking dopamine as generalized prediction error. Proc. Biol. Sci. 285, 20181645 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Wright K. M., DiLeo A., McDannald M. A., Early adversity disrupts the adult use of aversive prediction errors to reduce fear in uncertainty. Front. Behav. Neurosci. 9, 227 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Coddington L. T., Dudman J. T., The timing of action determines reward prediction signals in identified midbrain dopamine neurons. Nat. Neurosci. 21, 1563–1573 (2018). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Fitoussi A., Dellu-Hagedorn F., De Deurwaerdère P., Monoamines tissue content analysis reveals restricted and site-specific correlations in brain regions involved in cognition. Neuroscience 255, 233–245 (2013). [DOI] [PubMed] [Google Scholar]
  • 68.Holdiness M. R., Rosen M. T., Justice J. B., Neill D. B., Gas chromatographic-mass spectrometric determination of dopamine in subregions of rat brain. J. Chromatogr. A 198, 329–336 (1980). [DOI] [PubMed] [Google Scholar]
  • 69.Campbell B. A., Bloom J. M., Relative aversiveness of noise and shock. J. Comp. Physiol. Psychol. 60, 440–442 (1965). [DOI] [PubMed] [Google Scholar]
  • 70.Hughes R. A., Bardo M. T., Shuttlebox avoidance by rats using white noise intensities from 90–120 db SPL as the UCS. J. Aud. Res. 21, 109–118 (1981). [PubMed] [Google Scholar]
  • 71.Grace A. A., Bunney B. S., Nigral dopamine neurons: Intracellular recording and identification with L-dopa injection and histofluorescence. Science 210, 654–656 (1980). [DOI] [PubMed] [Google Scholar]
  • 72.Rice M. E., Cragg S. J., Dopamine spillover after quantal release: Rethinking dopamine transmission in the nigrostriatal pathway. Brain Res. Brain Res. Rev. 58, 303–313 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73.Hollon N. G., Arnold M. M., Gan J. O., Walton M. E., Phillips P. E. M., Dopamine-associated cached values are not sufficient as the basis for action selection. Proc. Natl. Acad. Sci. U.S.A. 111, 18357–18362 (2014). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74.Day J. J., Roitman M. F., Wightman R. M., Carelli R. M., Associative learning mediates dynamic shifts in dopamine signaling in the nucleus accumbens. Nat. Neurosci. 10, 1020–1028 (2007). [DOI] [PubMed] [Google Scholar]
  • 75.Hart A. S., Clark J. J., Phillips P. E. M., Dynamic shaping of dopamine signals during probabilistic Pavlovian conditioning. Neurobiol. Learn. Mem. 117, 84–92 (2015). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 76.Rutledge R. B., Dean M., Caplin A., Glimcher P. W., Testing the reward prediction error hypothesis with an axiomatic model. J. Neurosci. 30, 13525–13536 (2010). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 77.Sharpe M. J., et al. , Dopamine transients do not act as model-free prediction errors during associative learning. Nat. Commun. 11, 106 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 78.Mikhael J. G., Kim H. R., Uchida N., Gershman S. J., The role of state uncertainty in the dynamics of dopamine. Curr. Biol. 32, 1077–1087.e9 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 79.Kim H. R., et al. , A unified framework for dopamine signals across timescales. Cell 183, 1600–1616.e25 (2020). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 80.Clark J. J., Collins A. L., Sanford C. A., Phillips P. E. M., Dopamine encoding of Pavlovian incentive stimuli diminishes with extended training. J. Neurosci. 33, 3526–3532 (2013). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 81.Fiorillo C. D., Newsome W. T., Schultz W., The temporal precision of reward prediction in dopamine neurons. Nat. Neurosci. 11, 966–973 (2008). [DOI] [PubMed] [Google Scholar]
  • 82.Kobayashi S., Schultz W., Influence of reward delays on responses of dopamine neurons. J. Neurosci. 28, 7837–7846 (2008). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 83.Mogenson G. J., Jones D. L., Yim C. Y., From motivation to action: Functional interface between the limbic system and the motor system. Prog. Neurobiol. 14, 69–97 (1980). [DOI] [PubMed] [Google Scholar]
  • 84.Hamid A. A., Frank M. J., Moore C. I., Wave-like dopamine dynamics as a mechanism for spatiotemporal credit assignment. Cell 184, 2733–2749.e16 (2021). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 85.Hollon N. G., et al. , Nigrostriatal dopamine signals sequence-specific action-outcome prediction errors. bioRxiv [Preprint] (2021). 10.1101/2021.01.25.428032. Accessed 10 April 2022. [DOI] [PMC free article] [PubMed]
  • 86.van Elzelingen W., et al. , Striatal dopamine signals are region specific and temporally stable across action-sequence habit formation. Curr. Biol. 32, 1163–1174.e6 (2022). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 87.Dayan P., Balleine B. W., Reward, motivation, and reinforcement learning. Neuron 36, 285–298 (2002). [DOI] [PubMed] [Google Scholar]
  • 88.Paxinos G., Watson C., The Rat Brain in Stereotaxic Coordinates (Academic Press, 1998). [DOI] [PubMed] [Google Scholar]
  • 89.Phillips P. E. M., Wightman R. M., Critical guidelines for validation of the selectivity of in-vivo chemical microsensors. Trends Analyt. Chem. 22, 509–514 (2003). [Google Scholar]
  • 90.Mathis A., et al. , DeepLabCut: Markerless pose estimation of user-defined body parts with deep learning. Nat. Neurosci. 21, 1281–1289 (2018). [DOI] [PubMed] [Google Scholar]
  • 91.J. Goedhoop, I. Willuhn, W. van Elzelingen, A unidirectional but not uniform striatal landscape of dopamine signaling for motivational stimuli. Open Science Framework (OSF). https://osf.io/zj9rk/. Deposited 28 December 2021. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary File

Data Availability Statement

Data and code have been deposited in the Open Science Framework (OSF) https://osf.io/zj9rk/ (91).


Articles from Proceedings of the National Academy of Sciences of the United States of America are provided here courtesy of National Academy of Sciences

RESOURCES