Abstract
Multiple signals for reward—hedonic impact, motivation, and learned associative prediction—are funneled through brain mesocorticolimbic circuits involving the nucleus accumbens and ventral pallidum. Here, we show how the hedonic “liking” and motivation “wanting” signals for a sweet reward are distinctly modulated and tracked in this circuit separately from signals for Pavlovian predictions (learning). Animals first learned to associate a fixed sequence of Pavlovian cues with sucrose reward. Subsequent intraaccumbens microinjections of an opioid-stimulating drug increased the hedonic liking impact of sucrose in behavior and firing signals of ventral pallidum neurons, and likewise, they increased incentive salience signals in firing to the reward-proximal incentive cue (but did not alter firing signals to the learned prediction value of a reward-distal cue). Microinjection of a dopamine-stimulating drug instead enhanced only the motivation component but did not alter hedonic impact or learned prediction signals. Different dedicated neuronal subpopulations in the ventral pallidum tracked signal enhancements for hedonic impact vs. incentive salience, and a faster firing pattern also distinguished incentive signals from slower hedonic signals, even for a third overlapping population. These results reveal separate neural representations of wanting, liking, and prediction components of the same reward within the nucleus accumbens to ventral pallidum segment of mesocorticolimbic circuitry.
Keywords: addiction, obesity, limbic, reinforcement, striatum
Pleasant rewards are often predicted and wanted as well as liked, especially when preceded by learned cues. Attraction to food in the refrigerator when hungry, for example, involves learned predictions of tasty treats, motivation to eat, and finally, pleasure enjoyed upon eating. How do neural circuits separately control and track these components of Pavlovian reward as distinct signals? An answer to this question is crucial to unraveling addictions, binge eating, depression, and related motivation or hedonic disorders.
Diverse information about rewards is funneled through pathways from the nucleus accumbens (NAc) to the ventral pallidum (VP) embedded in larger mesocorticolimbic circuitry (1–13). Food, sex, drugs, winning money, and other rewards or predictive cues all activate these pathways in humans and other animals (11, 14–29). Recent evidence indicates a surprising multiplicity of control systems within the NAc and VP, including multiple parallel and segregated loops that carry point to point signals through restricted subregions of prefrontal cortex-NAc-VP-thalamus and back again (7). Modulation by diverse neurochemicals also contributes to mediating separable hedonic and motivation functions (1, 10, 25, 30, 31).
Once thought of mainly as a limbic motor translation zone (13), it is now clear that the VP as well as the NAc controls reward motivation and hedonics (1, 3, 6, 31–34). Both structures contain subregional hotspots where μ-opioid agonist microinjections induce a doubling of the hedonic impact or palatability of a sweet taste of sucrose (35, 36). Hedonic enhancement is reflected in positive patterns of affective orofacial expressions that are homologous between humans and rodents, which can be used to operationalize palatability (referred to in shorthand here as “liking” reactions). Such microinjections simultaneously induce increases in incentive salience or Pavlovian-triggered motivation (here called “wanting” in shorthand) (1, 31, 35–37). A neurochemical distinction has been made between opioid stimulation of the NAc hotspot, which amplifies hedonic impact (liking) as well as motivation (wanting) for reward, and mesolimbic dopamine stimulation, which only amplifies motivation (1, 38–44).
Firing patterns of neurons within the VP encode Pavlovian conditioned stimulus (CS) features, such as prediction and incentive salience, and unconditioned stimulus (UCS) reward features, such as hedonic impact (45, 46). Problematically, however, it has never been clear how NAc-VP circuitry could distinguish various reward signals from each other. Existence of multiple anatomical loops connecting the NAc and VP provides a way to segregate the mixture of reward components, but some NAc and VP subregions and single neurons may also need to represent more than one signal (6, 35, 47). Here, we find that disentangling of signals can still be faithfully achieved. They are revealed by distinct effects of activating NAc neuromodulatory transmitters on different VP reward-related firing patterns in a manner that preserves separate wanting, liking, and Pavlovian prediction information for the same reward.
Results
Serial Separation of Maximal Reward Signals.
Firing of VP neurons was recorded in rats exposed to a learned Pavlovian reward sequence after microinjection in the NAc of vehicle (control condition), [D-Ala2, N-MePhe4, Gly-ol]-enkephalin (DAMGO) (μ-opioid stimulation), or amphetamine (dopamine stimulation; n = 115 neurons, mean = 12.8 neurons recorded per test session, n = 8 rats for neural recording, n = 10 rats for Fos functional localization analysis). In this sequence, an initial CS+1 (auditory tone; 1–5 s) preceded a second CS+2 (different tone; 10–15 s) and finally, a pleasant UCS reward (10-s infusion of sucrose solution into the mouth by oral cannulae; 13–23 s), all in a fixed interval (Fig. 1). The use of a sequence of distinct Pavlovian cues allowed us to temporally separate moments of maximum intensity for each reward component (48, 49). Concerning prediction signals, in information theory (50), the predictive contingency between CS+1, CS+2, and UCS in these time intervals meant that CS+1 presentation potentially added 3.8 log units of information, designated by the term H, to create a nearly 100% certainty (i.e., minimal entropy) that CS+2 and reward UCS would both follow. Given that CS+1 temporarily reduces uncertainty about impending reward to near zero, because it predicts the entire remaining sequence of CS+2 and UCS events, the CS+2 presentation adds little or no further information value or reduction in uncertainty about UCS occurrence (quantitative explanation in Materials and Methods) (48, 50, 51). When CS+1 and CS+2 are separated by several seconds, a gradual rise in incentive salience begins soon after the CS+1 onset, motivating anticipatory behavioral approach (e.g., head entries into a sucrose dish) which reaches maximum levels around the moment of CS+2 presentation and just before the UCS (48, 49). The final event in the sequence, the UCS sucrose infusion (i.e., the reward), carried the strongest hedonic impact, which was confirmed by taste reactivity results (45). Separately from this CS+1 → CS+2 → UCS trial sequence, a distinct CS− tone was randomly interleaved during the session as a control stimulus that predicted nothing and thus, carried little predictive, motivation, or hedonic value. For additional comparison of reward modulation of VP activity during appetitive and consummatory behaviors, the rats were also exposed on the same day to learned traditional Pavlovian trials in which a single CS+ (feeder click; combining predictive and incentive value) predicted sucrose pellet delivery into a dish and evoked approach and consumption.
Tests occurred over 3 d for within-subject comparison of vehicle, DAMGO, and amphetamine microinjection conditions in an order pseudorandomly assigned across rats. On each test day, VP neural activity was recorded after NAc microinjection first to serial CS+ cues presented in extinction (i.e., without sucrose to prevent new learning about UCS), then to sucrose UCS infusions, and finally, to the reinforced single CS+ Pavlovian approach trials (Fig. 1). Last, motivated food consumption was measured in a free intake test of M&M candies in the home cage. The order of testing was designed to avoid timing confounds in comparing across drug conditions by equating the time points for each stimulus after drug injections. For example, CS+1 vs. CS+2 data were always compared first each day (under vehicle, amphetamine, or DAMGO microinjection conditions), UCS data were always compared next, and chocolate consumption was always compared in the final test each day. The order design also isolates the crucial extinction tests of serial CS+1 and CS+2 signals in the very first test and provides data for motivation signals at early (CS+2) and late (single CS+) time points after injections. Previous studies have suggested that DAMGO/amphetamine microinjection effects persist long enough each day to influence all test blocks (52–54), which we show they do here.
NAc Opioids and Dopamine Accentuate VP Signals for CS+ Wanting Without Distorting Prediction Signals.
Two-thirds of recorded VP neurons (67%; 77/115) changed firing after presentation of the CS+1, CS+2, UCS, or single-approach CS+ stimuli in trained rats. These neurons were either integrative, responding to multiple stimuli (47%), or belonged to dedicated subpopulations that fired to only one stimulus (53%).
In the cue-only extinction block of trials, 45% of VP neurons (52/115) fired phasically at the onset of the CS+1 and/or CS+2 cues (23% of these fired to the CS+1, 21% of these fired to the CS+2, and 56% of these fired to both) (Figs. 2 and 4 and Fig. S1). The phasic response consisted of a rapid climb in firing rate to reach a peak within 200 ms of each CS+ tone onset. The peak firing duration was 200–300 ms, at which point firing either rapidly returned to baseline levels (34/52; 65%) or remained slightly elevated above baseline for the duration of the tone cues (18/52; 35%) (examples shown in Fig. 4 and Fig. S1). A few neurons exhibited more highly phasic responses with a rapid firing increase at cue onset that persisted for only 100 ms or less (example in Fig. S1). There were no recorded neurons with only tonic activity changes to these cues.
On control tests after vehicle microinjections, the increase in phasic firing to the CS+1 was greater than any other stimulus: more than the CS+2, more than the UCS (in separate trials), and much more than the control cue, CS−. In other words, with this learned sequence of cues, the CS+1 prediction-related signal dominated VP firing under normal (vehicle) conditions (49) (compare peak amplitudes of CS+1 with CS+2 and CS− under vehicle in Fig. 2 A, B, and D). A computational profile analysis (49) and inspection of individual neurons confirmed the dominance of CS+1-related activity (Fig. S2).
By contrast, when either amphetamine or DAMGO microinjections were made in the NAc before testing to, respectively, increase either dopamine or opioid neurotransmission, VP firing still occurred to the CS+1, but peak dominance switched to the CS+2 (Fig. 2 and Fig. S2). This switch from CS+1 to CS+2 dominance was always caused by a selective elevation of VP firing to the CS+2 onset by >150% compared with vehicle levels (main drug effect = 0–0.5 s, F2,382 = 6.72, P = 0.001; DAMGO/vehicle P < 0.05 and amphetamine/vehicle P < 0.01). The DAMGO- or amphetamine-enhanced CS+2 firing peak was rapid, phasic, and anchored to the CS+2 onset, appearing within the first 200 ms of CS+2 presentation and typically decaying back to vehicle control levels by 400 ms (0.1–0.2 s main drug, F2,382 = 8.41, P < 0.001; DAMGO/vehicle P < 0.05 and amphetamine/vehicle P < 0.05). The amplitudes of CS+1-associated firing peaks were not altered by DAMGO or amphetamine microinjection [no main drug effect for either drug during 0–0.5 s bin: F2,391 = 1.90, not significant (NS); 0.1–0.2 s: F2,391 = 1.421, NS], which indicates that the incentive salience signal was selectively enhanced, whereas the CS+1 prediction signal was unchanged (Fig. 2 and Fig. S2). Firing to the control CS− tone, which predicted nothing, was never enhanced by either drug microinjection and instead, declined below its original low level (main drug effect = 0–0.5 s, F2,181 = 4.74, P = 0.01; DAMGO/vehicle P < 0.05 and amphetamine/vehicle P < 0.05) (Fig. 2).
Crucially, the enhancements of CS+2 firing by NAc stimulation were dynamic and required no new learning about the UCS under drug, because they were detectable even on the five initial extinction presentations of CS+2 in firing rasters of individual neurons and in population activity (population firing above vehicle levels—DAMGO: F1,116 = 3.95, P < 0.05; amphetamine: F1,98 = 22.76, P < 0.001). Also, the CS+2 firing enhancement was not a result of baseline firing changes, which were opposite for the drugs. DAMGO microinjections suppressed, whereas amphetamine microinjections enhanced baseline firing (pre-CS baseline firing drug effect: F2,774 = 19.90, P < 0.001; DAMGO/amphetamine: P < 0.001 and vehicle/amphetamine: P < 0.05) (Fig. S3), although both drugs produced the same enhancement of CS+2-evoked firing.
Enhancement of CS+ Incentive Salience Replicates During Approach Trials and Promotes Consumption.
We next confirmed that the enhanced representation of CS+ incentive salience in VP neuronal activity occurred similarly in the simpler Pavlovian active approach task later in the same day. These trials represented a more naturalistic situation where a single CS+ was paired with delivery of a sugar pellet UCS that rats must actively approach and consume (rather than receive sucrose passively by intraoral infusions). In this paradigm, the moment of the single CS+ sound represents a composite blend of maximal prediction and maximal incentive salience. As one might expect from this conjunction, phasic firing in the VP was enhanced by both DAMGO and amphetamine microinjections in the NAc. Both drugs amplified the ∼250-ms CS+ firing peak to levels 50–100% over normal peak levels in the vehicle condition and up to 200–350% over pre-CS baseline firing (main drug effect: F2,128 = 15.29, P < 0.001; DAMGO/vehicle P < 0.001 and amphetamine/vehicle P < 0.05) (Fig. S4).
We also confirmed, with a behavioral index of wanting (30-min free intake test), that both DAMGO and amphetamine microinjections amplified actual voluntary consumption of chocolate candies (M&Ms). Rats consumed 33% more chocolate on average after they had received either DAMGO or amphetamine than when they received vehicle microinjections (equal to ∼2 g more chocolate after DAMGO or amphetamine microinjection; DAMGO: t = 2.60, P < 0.05; amphetamine: t = 2.91, P < 0.05 without outlier) (Fig. 2).
NAc Opioid Stimulation but Not Dopamine Accentuates Palatability and Hedonic Signals.
Intraoral UCS taste infusions evoked variable patterns of activity from VP neurons, but the most predominant response was a slow rise in spiking above baseline to peak during the first 1.5 s of sucrose infusion. The elevation typically persisted as a moderate and sustained plateau of firing throughout the remainder of the 10-s infusion (Figs. 3 and 4 and Fig. S1). Only DAMGO microinjection in the NAc magnified the elevation in VP firing to the UCS, raising sucrose-elicited firing to a peak that was >50% above firing recorded after vehicle control injections (first 1.5 s of drug main effect: F1,381 = 8.90, P < 0.001; DAMGO/vehicle posthoc P < 0.05 during seconds 1.0–1.5 of UCS infusion) (Figs. 3 and 4 and Figs. S1 and S2). A simultaneous DAMGO-evoked enhancement in behavioral responses was observed as a >30% increase in the number of positive orofacial affective reactions elicited by the taste of sucrose infusion compared with vehicle levels (F1,152 = 35.62, P < 0.001). This behavioral response pattern confirms enhanced sucrose palatability or liking (Fig. 3) with NAc DAMGO treatment. Negative disgust reactions (e.g., gapes) remained very low and unchanged.
In stark contrast, amphetamine microinjections in the NAc never elevated VP firing to the UCS (instead, it produced a trend to suppression; posthoc from above, NS) (Figs. 3 and 4 and Figs. S1 and S2) and likewise, never enhanced positive behavioral orofacial reactions to the palatability of sucrose (F1,154 = 1.27, NS) (Fig. 3). Both negative results indicated that NAc dopamine failed to accentuate either hedonic-related UCS firing activity in the VP or behavioral indices of hedonic impact. Baseline firing rates before the UCS continued to be slightly suppressed by DAMGO and slightly enhanced by amphetamine (baseline of 5 s before UCS infusion; main drug effect: F2,390 = 53.28, P < 0.001; DAMGO/vehicle P < 0.001 and amphetamine/vehicle P < 0.001) (Fig. S3). We note that the failure of amphetamine to enhance UCS firing was not likely because of the higher baseline rate, because (i) the highest baseline under amphetamine (20 spikes/s) never reached heights of stimulus-evoked firing (>60 spikes/s), (ii) even UCS-responsive neurons with low baselines under the amphetamine condition still failed to show enhanced UCS-evoked firing, and (iii) amphetamine baseline rates continued to rise in the single CS+ approach block of trials that were conducted after these UCS trials (below) when both amphetamine and DAMGO increased firing to that single CS+ (Fig. S3).
We assessed hedonic impact further by measuring orofacial reactions to the Pavlovian CS+ cues, which in principle, can acquire conditioned hedonic value as well as incentive value from their paired UCS (55–60). The CS+1 tone occasionally and the CS+2 more frequently elicited a few positive hedonic reactions (on average, 1 reaction per 10 s under vehicle condition; this weak hedonic conditioned response was only 12% of the UCS unconditioned level of 8.3 per 10 s, but tone-elicited reactions were still mildly elevated over near-zero pre-CS baseline levels; CS+2: F1,157 = 8.30, P < 0.01; CS+1: F1,157 = 3.82, P = 0.052) (Fig. S5). DAMGO microinjection in the NAc approximately doubled the number of positive hedonic conditioned reactions such as lip licking (to two reactions per tone) during both the CS+1 and CS+2 above vehicle numbers (CS+2: F1,130 = 4.77, P < 0.05; CS+1: F1,130 = 4.33, P < 0.05). By contrast, amphetamine microinjection never enhanced conditioned orofacial reactions elicited by tone CSs compared with vehicle (CS+2: F1,145 = 1.05, NS; CS+1: F1,145 = 0.67, NS) (Fig. S5) just as it had failed to enhance hedonic reactions elicited by the UCS. This illustrates that the UCS carried much higher hedonic impact than CSs and confirms our hypothesis that DAMGO but not amphetamine in the NAc amplifies hedonic impact of stimuli, whether conditioned or unconditioned.
Distinct VP Ensembles and Firing Signatures Potentially Disentangle Liking vs. Wanting Enhancements.
Can NAc-VP circuits distinctly parse reward components even when multiple ones are enhanced together in the same neurons? How conjoint increases in wanting vs. liking are distinguished cannot be fully resolved by current data, but indications were found both for a segregated subpopulation code that discriminated CS+2 from UCS enhancements by DAMGO and for a temporal pattern code that distinguished CS+2 vs. UCS signals even when both were enhanced in the same neurons (Fig. 4). In a subset of 14 VP neurons that fired to the CS+2 or UCS in the DAMGO condition at levels clearly exceeding vehicle (i.e., those that dominated the above ensemble firing enhancement to those stimuli), eight (57%) fired only to the CS+2 and not to UCS sucrose. Conversely, two other neurons (14%; 2/14) fired above normal vehicle levels to the UCS after DAMGO but not to the CS+2. Thus, most of these stimulus-preferring neurons (10/14; 71%) fired robustly to one stimulus but not at all to the other stimulus.
This population coding mechanism for separating CS+2 from UCS would not apply, however, to a third remaining subpopulation of four neurons (29%) that showed similarly elevated activations above vehicle levels after NAc DAMGO to both the hedonic UCS and the incentive CS+2. The enhanced peak responses to the CS+2, however, were phasic with shorter latencies and shorter durations compared with UCS peak responses, which in contrast, had slower latencies of rise (1 s or more to peak after sucrose entry into the mouth; estimated to occur 0.5 s after pump onset) and more sustained duration of activation during UCS infusion (Fig. 4). Thus, on average, DAMGO elevation of firing above vehicle levels took approximately more than three times longer to manifest after UCS compared with the rapid enhancement after CS+2 onset. Notably, this temporal pattern was shared over the entire population of responsive neurons after DAMGO (compare Fig. 2 with Fig. 3), indicating a potential temporal way of discriminating CS+2 and UCS events in addition to subpopulations.
Stimulus Focus of VP Populations.
A related potent decoding in responsive VP populations was observed in a sharpened stimulus focus after NAc microinjections of DAMGO or amphetamine. Compared with vehicle, both drugs caused a rise in the proportion of responsive neurons that were activated uniquely (that is, responsive to only CS+1, only CS+2, only UCS, or only the single CS+). Specifically, under normal vehicle conditions, 68.4% of VP neurons fired to at least two stimuli in the task, but after dopamine or opioid stimulation of the NAc, there was a significant increase in the proportion of neurons responsive to only one stimulus (each ∼50% of responsive neurons; χ2, P = 0.0001). Alongside this was a consequent fall in the proportion that responded to two or more stimuli. Thus, after NAc stimulation, VP neurons became more selective for a particular type of reward stimulus, conceivably adding clarity to the discrimination among reward signals for downstream targets (Fig. S6).
Test Requirements Modulate VP Bursting and Baseline Activity.
VP bursting patterns and baseline firing levels were modulated by the behavioral requirements of the tests. Temporal bursting patterns increased by about 70% as the test paradigm was switched from the pure Pavlovian test (where CS and intraoral UCS stimuli were presented no matter what the rat did; 20–60 min postdrug injection) to the active Pavlovian approach test (where rats had to approach the dish and actively retrieve to consume a sucrose pellet UCS; 60–75 min postinjection; ANOVA: F4, 91 = 14.979, P < 0.001) (Fig. S7). Higher bursting during the active approach task was observed during both tonic baseline firing and phasic CS and UCS peaks (F2,233 = 17.95, P < 0.001). We also observed a concomitant increase in baseline rate in prestimulus firing at the task switch compared with earlier test blocks (Fig. S3). This was most pronounced in the vehicle-control condition but also present after amphetamine or DAMGO microinjections (all conditions overall: F2,233 = 17.95, P < 0.001; vehicle/amphetamine P < 0.05 and vehicle/DAMGO P < 0.001). The rise in VP baseline (and bursting) activity could reflect the switch to active-approach conditions, contextual conditioning to the chamber, or a relative clamping of baseline firing by NAc drugs that contributed to the steady stimulus-evoked firing increases across test epochs.
Anatomical Distribution of NAc Manipulations.
Finally, we mapped the NAc microinjection effects on VP firing based on anatomical identification of microinjection sites and additional Fos plume data on the spread of drug impact around a microinjection (35, 36) (Fig. S1). As outlined in Materials and Methods, the local spread of NAc neuronal activation caused by DAMGO or amphetamine microinjection was measured under conditions of maximal spread (i.e., first drug microinjection; n = 10 separate rats). This measured diameter was used to construct symbol size in Fig. S1 plume maps representing drug spread (plume center volumes [i.e., 10× normal or 5× vehicle Fos elevation] = 0.0037–0.10 mm3; surrounding outer halo volumes [i.e., 5× normal or 3× vehicle elevation] = 0.84–1.40mm3). It is important to note that all other data depicted in the maps show functional consequences of microinjections and were obtained from rats tested for VP firing and behavior (integration of Fos/function data discussed in Materials and Methods). Microinjection locations in the NAc from the VP recording experiment were plotted based on histologically confirmed locations, and functional maps were constructed representing the effect of each drug microinjection at a particular NAc location on firing in all VP neurons recorded from that particular animal. These maps revealed that the enhancements of incentive salience and hedonic impact were anatomically consistent across our microinjection sites focused on the previously identified hotspot in the rostrodorsal quadrant of medial NAc shell. At most microinjection sites in the NAc medial shell, DAMGO and amphetamine raised phasic VP firing to the CS+2. By contrast, only DAMGO placements in and around the hotspot in midrostral medial shell additionally enhanced phasic VP firing to the UCS (35, 37).
Discussion
Presentation of cues for food, sex, drugs, or money to humans can evoke an increase in VP blood oxygen level dependence (BOLD) signals (14, 18, 19, 61), and in rodents, robust firing in VP neurons is evoked by cues for sensory rewards (46, 62). It has usually been difficult to determine which component of Pavlovian reward (prediction, incentive salience, or hedonic impact) is represented by a neural activation in VP and larger circuitry, because all these reward components covary together in most cases. However, they can be dissociated experimentally under special conditions, such as here, and also may dissociate clinically to cause particular disorders, such as addiction (42, 44). Functional dissociations imply that brain circuits are able to disentangle and track the reward component signals, but how that task is accomplished has remained unclear. Our results help solve this logical puzzle. We showed that dopamine or opioid neuromodulators in the NAc differentially fine-tuned the ratio of reward components represented in VP firing in a manner that may serve to separately track even jointly enhanced incentive salience vs. hedonic impact signals separately.
Associative Prediction of Future Reward.
Incentive- and hedonic-related firing patterns were also distinguishable from prediction signals that, in contrast, did not change during dopamine or opioid stimulation of the NAc. Prediction signal firing rate profiles normally dominated (i.e., under vehicle-control conditions), with highest firing occurring as a consistently rapid and phasic burst that peaked at the onset of CS+1. Prediction dominance by CS+1 here may be related to previous reports of maximal firing to the first predictor in a sequence of several cues by midbrain dopamine neurons and neurons in target structures (49, 51, 63–65) in studies not incorporating drugs or other neural manipulations. It is consistent with computational temporal difference algorithms that increment a cached value of expected future reward, and with learning, that move the prediction forward in time to the initial cue (at least when mesocorticolimbic systems are not in a stimulated state) (50, 51, 66).
Incentive Salience or Wanting Component.
Dopamine stimulation of the NAc enhanced only incentive salience-related firing that peaked at the CS+2. Opioid stimulation similarly enhancedCS+2 firing (and additionally, enhanced hedonic UCS firing as discussed below). These incentive amplifications made CS+2-evoked firing consistently greater than CS+1-evoked firing after both drugs, shifting dominant representations in VP activity from prediction to incentive motivation. Accentuated firing related to the incentive cue was similar across variations in testing procedure (serial cues vs. single-approach cue) and independent of baseline firing rates. Behavioral responses were consistent with this shift reflecting higher wanting for sweet reward: rats ate more chocolate after receiving either amphetamine or DAMGO microinjections in the NAc shell. Enhancement of eating by NAc opioid stimulation is well-known (1, 31, 35, 53), and eating enhancement by NAc amphetamine, although more fragile, is also consistent with prior reports (67, 68) as well as our hypothesis that dopamine mediates incentive salience in a manner sufficient to drive motivated behavior like eating (1, 42, 69) (note that NAc amphetamine-stimulated eating can be contrasted to anorectic effects of systemic amphetamine that occur primarily through norepinephrine or dopamine in medial hypothalamus [and in other structures at high doses]) (70–73).
To distinguish prediction and incentive salience computationally, Zhang et al. (48) proposed a model that quantitatively disentangles the two components in a way that fits our results. In that model, incentive salience, , is computed as (Eq. 1)
Inputs to incentive salience on the right side of the equation distinguish a stable, well-learned prediction value (rt) that is equivalent to predictions of future reward derived from contemporary temporal difference models of reinforcement learning and accumulated based on previous pairings of the Pavlovian CS with reward UCS (51, 66, 74, 75) from a more dynamic gain-control factor (k) that can shift quickly with changes in physiological state. This gain-control factor, we suggest, evolved to allow natural appetites to modulate wanting independently from associative predictions, and here, it was dynamically elevated by NAc dopamine or opioid stimulation. Consequently, CS+1 firing reflecting prediction of future reward based on the associative memory cache (rt) was unchanged here. By contrast, only firing evoked by the CS+2 (or CS+ in the approach task) was enhanced by DAMGO and amphetamine microinjections that amplified the NAc-VP gain factor (k).
Hedonic Impact or Liking Component.
Identifying the hedonic component of reward required a different approach, which was aided by effects of NAc opioid stimulation. DAMGO microinjection in the NAc was the only manipulation to enhance liking reactions to the sucrose UCS, which was assessed behaviorally by orofacial taste reactivity (and to enhance the much weaker conditioned hedonic orofacial reactions evoked by CS tones). Amphetamine microinjection failed to enhance liking reactions to any stimulus. Accordingly, only opioid stimulation enhanced the slow-latency rise in VP firing to the UCS that seemed to track the hedonic impact of sucrose taste. These findings show that μ-opioid stimulation of the NAc hotspot magnifies neural signals evoked by a sweet sensation in a manner that may inform neural target structures about the enhancement of hedonic impact.
Reward Component Separation by Population Segregation and Firing Pattern.
All of the neural responses discussed above occur closely together in time, making it a potential challenge for downstream neural circuits to disentangle reward components. Our results, however, suggest that liking, wanting, and learned prediction of reward are still distinguishable in NAc-VP circuits by neuronal population and firing pattern codes. First, somewhat different VP neuronal subpopulations seemed to carry incentive salience (CS+2) vs. hedonic impact (UCS) signals, even when both components were enhanced by the same NAc opioid stimulation. Neuronal segregation seems compatible with anatomical reports that multiple segregated paths exist in parallel within NAc-VP circuits, which are embedded within larger corticolimbic circuits and pallidal-thalamocortical reentry loops (7). Second, even for VP neurons that represented combined wanting and liking enhancements, CS+2 effects were potentially distinguishable by a faster-latency rise in activation than sucrose UCS effects. These population and temporal firing mechanisms would allow liking, wanting, and prediction signals to be told apart within ventral striatopallidal circuits during natural appetite, drug intoxication or withdrawal, or stress states known to elevate reward measures (1, 2, 4, 22, 25, 27, 30, 31, 46, 76).
We suggest that these VP neuronal signals for reward components could be related to reports of activity in human posterior VP positively correlated with pleasant food images (17) and mechanisms by which opioid and related stimulation in a VP hotspot can modulate the hedonic impact of sensory rewards (33, 36, 77). Conversely, one could speculate that impairment of hedonic signals in NAc-VP pathways could contribute to clinical manifestations of anhedonia or incentive motivation impairment in depression and related disorders (78) or to dysphoria after lesions encroaching on the VP hotspot (79–81).
NAc stimulation also focused firing activation on a particular stimulus, individually tailored for particular neurons so that more VP neurons responded to only one stimulus and not any of the others (e.g., CS+1 only, CS+2 only, or UCS only) rather than to multiple stimuli. The net effect of this was a relative increase in the size of the activated neural population with labeled line-like qualities (i.e., responses to a single stimulus entity). More focused channeling could facilitate tracking of separate reward components and improve clarity for downstream neural recipients. This idea is similar to the notion of dynamic focused activation in basal ganglia as a mechanism used to select appropriate actions (82–86).
Concerning NAc-VP interaction, many have noted that reciprocal inhibitory GABAergic projections tend to enforce opposite polarity of events within NAc vs. VP so that hyperpolarization in neurons of the NAc accompanies depolarization in neurons of the VP and vice versa (5, 87, 88). An expectation from that reciprocal inhibition view would be that excitatory peaks of VP firing observed here might have corresponded to inhibitory pauses of NAc neurons, which have been suggested to signal reward (20, 24, 88, 89). However, simultaneous excitations in NAc and VP (or simultaneous inhibitions) may also be possible (29, 37, 62, 87, 90), perhaps enabled by corelease of peptides such as dynorphin, enkephalin, or substance P to modulate the impact of GABA on postsynaptic neurons (5, 6, 32, 87). Additionally, neurons of the NAc hotspot in the rostrodorsal medial shell may not project directly to neurons in the posterior VP hotspot recorded here but rather, to a more anterior site in VP and to lateral hypothalamus (7) from where interneurons might convey signals to posterior VP to contribute to functional interactions (37). A series of multiple GABAergic synapses would open the possibility for disinhibition to create simultaneous excitations in NAc-VP hotspots. Future studies would be needed to assess these issues. Furthermore, regarding circuitry, we emphasize that VP neurons also are likely to encode many other signals beyond reward, including ones related to the originally hypothesized function of limbic to motor translation. Also, of course, the NAc-VP path is only one segment within larger mesocorticolimbic circuits involving cortical reentry loops, parallel segregation, and other features important for both reward impact and translation into behavior (1, 5–7, 11, 27, 34, 91, 92).
Finally, we suggest that our results have relevance to addiction and other compulsive disorders. For example, addiction-related cues can often trigger relapse and consumption, but their ability to do so fluctuates. We suggest that the motivation power of cues is amplified by states of mesolimbic reactivity that magnify incentive salience as observed here, such as drug intoxication or stress states (41, 42, 49, 93). Likewise, the UCS proximity of peak incentive salience signals may be related to why it is easier to resist cues that are temporally distant from rewards (e.g., sight of a crack house; here comparable with encountering CS+1) than to resist other cues that are temporally closer to reward (e.g., sight of crack in one's own hand; comparable with CS+2). In short, the observations described above suggest a possible role for NAc-VP motivational signals in controlling the ebb and flow of vulnerability to cues as triggers of addictive behaviors.
Materials and Methods
Male Sprague-Dawley rats were housed individually in tub cages on a reverse light–dark cycle (total n = 8 rats, 115 neurons from nine test sessions). Additional rats (n = 10) were used to measure Fos reactivity at NAc microinjection sites to estimate drug functional spread (see below). Experiments were conducted during late morning to afternoon hours, which coincided with the rats’ active (dark) period after acclimating to housing conditions for at least 1 wk. Food and water were available ad libitum throughout testing, except for food restriction (20 g/d) during habituation only for the main recording experiment. All procedures were approved by the University Committee on the Use and Care of Animals at the University of Michigan.
Habituation and Surgery.
Rats were initially habituated to the testing chamber for 3 d, given preparatory magazine training for the Pavlovian approach task (free sucrose pellets in the magazine chamber), and given M&Ms in their home cage overnight for familiarization. For surgery, rats were anesthetized with ketamine (100 mg/kg) and xylazine (10 mg/kg) and placed in a stereotaxic apparatus. Bilateral oral cannulae (PE-100 tubing) entered the mouth in the upper cheek lateral to the first maxillary molar, were threaded beneath the zygomatic arch, and exited the dorsal head near the skull (94). Rats also received intracranial cannulae and electrode implantation (on the same day for five rats or after the fourth training day for three other rats). Bilateral stainless steel guide cannulae (23 gauge) were implanted 2.5 mm above the rostral and dorsal quadrant of the NAc medial shell (histologically identified placements spanned anteroposterior (AP) = 0.9–2.2 mm, mediolateral (ML) ± 1.0 mm, dorsoventral (DV) = −6.6 to 7.7 mm) (Fig. S1). A stainless steel obturator was inserted in the NAc cannulae to prevent occlusion. A recording electrode was implanted unilaterally in the posterior VP (histologically shown to span ML = 2.4–3.0 mm, AP = −0.5 to 1.2 mm, and DV = −7.6 to 8.0 mm) (Fig. S1). Each electrode consisted of eight wires (50-μm tungsten). One-half of the electrodes had a screw-driven brass microdrive for lowering before testing, and one-half were lowered during surgery and permanently fixed. On test days, one wire with no spike activity was selected during testing sessions to serve as a reference channel for differential recording. The implant was anchored to the skull with bone screws and acrylic cement. Animals were allowed to recover for at least 7 d.
Training and Serial Cue Design.
A serial Pavlovian reward task was used to separate moments of maximal occurrence for predictive, incentive, and hedonic signals: a CS+1 followed by a CS+2 and an intraoral sucrose UCS, all at fixed intervals. In information theory (50) in the trial overall, the sucrose UCS has an objective probability of occurrence P = 0.07 (e.g., 1 of every 14.7 bins of 10-s duration). That is, in an information theory sense, the surprisal value (h) of a UCS event generally occurring is h = log2 (1/p). For UCS, h = 3.88 [h = log2 (14.7)] if sequential dependencies are not considered (50). This general probability might also correspond psychologically to the strength of general contextual association between the training chamber and sucrose if contextual learning matches the objective probability of reward. However, during the CS+1 → CS+2 → UCS sequence, the conditional probability of reward after CS+1 temporarily rises to 100% or P = 1.0 (conditional probability of CS+2 likewise becomes certain; P = 1.0). After it is learned, when a CS+1 is encountered, the momentary surprisal value of the 100% certain UCS that follows reduces to h = 0 [log2 (1/1.0)] as does the surprisal value of the intervening CS+2. That is, the surprisal value of the CS+1 remains h = 3.88, but after it occurs, the CS+1 fully predicts (i.e., reduces uncertainty) both the CS+2 and UCS. The subsequent CS+2 within the sequence adds no additional predictive information or uncertainty reduction concerning the already-predicted UCS. This is seen by noting that, after CS+1, CS+2 h = 0 [log2 (1/1.0)]. The h = 0 zero value of CS+2 and UCS means that the CS+2 occurrence cannot reduce sequential uncertainty about UCS any more (i.e., h cannot go below zero). In other words, the 100% stereotypy of the CS+1 → CS+2 → UCS sequence transfers all information surprise and prediction value forward to the CS+1 that initiates the sequence (similar redundancy is often used to compact data in encryption systems) (48, 50, 51). After a sequence is over, the general probability of reward reverts to 0.07 for some time until the next CS+1 is encountered when the conditional probabilities of the sequence again take over.
However, the CS+2 still carries a high degree of incentive salience, potentially even more than the CS+1 (48, 49). For example, CS+2 presentations are temporally associated with higher levels of appetitive Pavlovian conditioned approach responses than CS+1 presentations (49, 95, 96). Similarly, late-phase behaviors occurring near reward may be more potentiated by a CS+ than earlier-phase behaviors in Pavlovian-instrumental transfer and related procedures to assess cue-triggered incentive motivation (97, 98), which is consistent with a temporal distinction between prediction and incentive salience. Also, hyperbolic discounting or impulsive choice for more immediate rewards in humans is related to dopamine activation (99). Most relevant, systemic amphetamine administration or drug sensitization, which amplifies incentive salience through enhanced dopamine-related activation (93), specifically amplify VP neuron firing to a CS+2 but not to an earlier CS+1 (49). This phenomenon was exploited here to identify firing signals in the VP related to incentive salience that occurred at CS+2 presentation, which is distinct from prediction signals maximal to the CS+1.
Finally, the UCS sucrose taste itself is the only stimulus to carry sensory sweetness, and it elicits maximal facial affective reactions (even when sucrose is predicted by CS+1 and CS+2). Thus, in this stimulus sequence, the UCS is assumed to carry the strongest hedonic impact (45), and this assumption was confirmed by taste reactivity results in which UCS evoked 10 times higher levels of positive orofacial reactions than CS+1 or CS+2.
Training took place on each of 5 consecutive d (Fig. 1) and consisted of two blocks conducted sequentially on each day. Rats were placed in a transparent Plexiglas chamber (28 × 35 cm) with an open top. During the serial cue trials, rats were confined to a transparent cylinder (25-cm diameter) placed within the chamber. A mirror under the glass bottom of the cylinder allowed video recording of bodily and orofacial movements. A computer program controlled stimulus presentations and oral sucrose infusions (Mtask; J. Wayne Aldridge, University of Michigan, Ann Arbor, MI). A separate program controlled neural recording (Recorder; Plexon Inc). The time-stamped clocks were synchronized for taste infusions, stimulus events, neural recordings, and videotape recordings. In the first trial block, rats were presented with 10 serial CS+ trials consisting of the two sequential tones (5-s CS+1 followed by 5-s intertrial interval (ITI) and 5-s CS+2) that predicted an intraoral 0.1-mL infusion of 9% sucrose solution (0.26 M). The infusion was 10 s in duration and began 3 s after CS+2 onset. Ten CS− trials, consisting of a distinct tone predicting nothing, were interspersed randomly with serial CS+ trials using a variable 1-min ITI. CS+1 and CS+2 tone identities were counterbalanced between rats and were a high- (3,800 Hz) or low-frequency continuous tone (400 Hz). CS+ tone assignment did not affect the rate of VP firing to stimuli, and it was not a factor in the effects of NAc microinjection on VP firing (two-way ANOVA on drug X tone assignment; tone assignment: F2,775 = 2.05, NS; drug and tone interaction: F2,775 = 1.79, NS); therefore, groups were combined for analyses. The CS− was a low-frequency 0.75-s on/off pulsed tone (400 Hz). A white noise generator was used to mask the noise of the sucrose infusion pump located exterior to the testing chamber. In the second training block conducted immediately after the first block on the same days, the Plexiglas cylinder was removed, and animals were given 10 Pavlovian approach trials in which a single CS+ (feeder click) was followed immediately by the delivery of a sucrose pellet into a hopper (variable 1-min ITI).
Testing.
Tests were then conducted on 3 repeated d spaced 48 h apart and separated by 1 d retraining (conducted as above) (Fig. 1). On each test day, rats first received an intra-NAc microinjection. Drugs were 0.05 μg/0.2 μL DAMGO and 10 μg/0.2 μL d-amphetamine sulfate (Sigma) dissolved in artificial cerebral spinal fluid (ACSF) as vehicle (Harvard Apparatus). Microinjections were made using a stainless steel injector cannula (29 gauge) that extended 2.5 mm beyond the ventral end of the guide cannulae and connected to PE-20 tubing and a syringe pump. Drug microinjections were made bilaterally at a rate of 0.030 μL/min. Microinjector tips were left in place for an additional 1 min after each infusion, and then, obturators were reinserted. Animals received a mock injection of ACSF vehicle after the fifth day of training for habituation. Each rat received each of the three drugs (amphetamine, DAMGO, and ACSF) pseudorandomly assigned across test days.
Twenty minutes after drug microinjection, rats were presented with 10 serial CS+ trials (CS+1 followed by CS+2) randomized with 10 CS− trials in an extinction setting without sucrose reward delivery (although the empty syringe pump was activated as usual). This extinction condition isolated direct shifts in CS signals and prevented CS assessment from being confounded by new learning about the UCS during the test under the influence of drug (e.g., preventing positive UCS prediction error or hedonic signals that might secondarily elevate CS signals). Immediately after the test, hedonic impact of sucrose infusions was tested in subsequent UCS-only trials to isolate sucrose hedonic signals and protect UCS impact from being dampened by prediction. Animals were presented with 10 free oral infusions of the sucrose UCS (10-s infusions and variable 1-min ITI) (Fig. 1). Afterward, the Plexiglas cylinder was removed, and animals were given 10 Pavlovian approach trials that were identical to training in which a feeder click preceded the delivery of one sucrose pellet into a hopper. Finally, animals were then removed from the testing chamber and placed in their home cage with M&Ms. Intake (grams and number consumed) was recorded at 30 and 60 min.
Neural Analysis.
Single neurons were identified using principle components or peak-width analysis of waveforms using Offline Sorter (Plexon Inc). Neurons were verified by distinct spike waveforms and clear refractory periods in an autocorrelation histogram, and cross-correlation analysis was performed to ensure that neurons were counted only one time (NeuroExplorer; Nex Techologies).
A normalized firing response to a stimulus event for each neuron was obtained by dividing the neuron's absolute firing response in a predetermined 500-ms time window at stimulus onset by its baseline during a 5-s period before each trial's onset. A neuron was considered responsive if its absolute firing rate was different from the preceding baseline period for a stimulus event of interest (determined by Tukey-corrected t tests). A few slow-firing neurons that showed phasic excitatory responses to stimuli and crossed a 90% confidence interval but did not reach significance in t tests were also considered responsive. Stimuli events were also compared for neuronal firing at successive 100- or 500-ms epochs to characterize response properties. For comparison of NAc microinjection effects on VP firing, normalized firing rate to each stimulus event in the three testing blocks was compared across drug conditions using ANOVA with trial as a covariate, and individual drug comparisons were made using Tukey posthoc tests. Except where noted, all tests of normalized firing responses to events were conducted on cells firing differently from baseline during the stimulus of interest (exceptions were profile analysis and Fos plume maps that compared across all recorded neurons). Drug-evoked changes in basal firing rate of VP neurons were also compared across microinjection conditions using one-way ANOVA for each of the three testing periods (normalized firing magnitude in a −5- to 0-s window before CS+1, UCS, and feeder click). Additional profile and bursting analysis methods are described in SI Materials and Methods and SI Results.
Behavioral Analyses.
Hedonic, aversive, and neutral taste reactivity patterns during the sucrose infusion in the UCS-only test (stimulus duration plus 5 s = 15 s) (Fig. 3) and during presentation of CS tones in the CS-only test (stimulus duration plus 5 s = 10 s) (Fig. S5) were videotaped and scored off-line in slow-motion using established procedures (94). Hedonic responses included rhythmic midline tongue protrusions, lateral tongue protrusions, and paw licks. Aversive responses included gapes, head shakes, face washes, forelimb flails, and chin rubs. Neutral responses included passive dripping of solution out of the mouth, ordinary grooming, and rhythmic mouth movements. Individual reaction totals were calculated for hedonic vs. aversive categories by adding all response scores within an affective category for that rat (hedonic, aversive, and neutral). These scores were statistically examined for drug vs. vehicle effects using ANOVA with trial as a covariate. Additionally, alerting reactions to cue presentation (head turn, step, or rear) were videoscored offline during the first 2 s of CS tones and were compared statistically between drug and vehicle (orient vs. not orient; one-way ANOVA per CS stimulus and trial covariate) (SI Materials and Methods and SI Results). M&M intake under drug was analyzed against intake under vehicle for statistical testing (one-way ANOVA).
Histology.
Cannulae tracks were marked with ink, and rats were overdosed with sodium pentobarbital at the end of the experiment. Brains were removed, fixed in 10% paraformaldehyde, cryoprotected with buffered 20% sucrose solution, sectioned coronally (60 μm), and stained for nissl substance. Maps illustrating the location of microinjection sites and electrode recording sites (Fig. S1) were constructed by identifying the spread of ink from the center of the microinjector tip placement on tissue sections and by identifying the electrode track and tip across sections (100). Separate animals (n = 10) were used to calculate spread of microinjection using Fos plume measurements used to guide functional mapping (SI Materials and Methods and SI Results).
Supplementary Material
Acknowledgments
We thank Ryan Grant, Andrew Klein, Jonathon Lee, and Steve Zekany for assistance. This work was supported by National Institutes of Health Grants T32 DC00011 (to K.S.S.), MH63649 (to K.C.B.), DA015188 (to K.C.B.), and DA017752 (to J.W.A.).
Footnotes
The authors declare no conflict of interest.
*This Direct Submission article had a prearranged editor.
See Author Summary on page 10935.
This article contains supporting information online at www.pnas.org/lookup/suppl/doi:10.1073/pnas.1101920108/-/DCSupplemental.
References
- 1.Baldo BA, Kelley AE. Discrete neurochemical coding of distinguishable motivational processes: Insights from nucleus accumbens control of feeding. Psychopharmacology (Berl) 2007;191:439–459. doi: 10.1007/s00213-007-0741-z. [DOI] [PubMed] [Google Scholar]
- 2.Robbins TW, Ersche KD, Everitt BJ. Drug addiction and the memory systems of the brain. Ann N Y Acad Sci. 2008;1141:1–21. doi: 10.1196/annals.1441.020. [DOI] [PubMed] [Google Scholar]
- 3.McGraw LA, Young LJ. The prairie vole: An emerging model organism for understanding the social brain. Trends Neurosci. 2010;33:103–109. doi: 10.1016/j.tins.2009.11.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Kalivas PW, Volkow ND. The neural basis of addiction: A pathology of motivation and choice. Am J Psychiatry. 2005;162:1403–1413. doi: 10.1176/appi.ajp.162.8.1403. [DOI] [PubMed] [Google Scholar]
- 5.Zahm DS. An integrative neuroanatomical perspective on some subcortical substrates of adaptive responding with emphasis on the nucleus accumbens. Neurosci Biobehav Rev. 2000;24:85–105. doi: 10.1016/s0149-7634(99)00065-2. [DOI] [PubMed] [Google Scholar]
- 6.Smith KS, Tindell AJ, Aldridge JW, Berridge KC. Ventral pallidum roles in reward and motivation. Behav Brain Res. 2009;196:155–167. doi: 10.1016/j.bbr.2008.09.038. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Thompson RH, Swanson LW. Hypothesis-driven structural connectivity analysis supports network over hierarchical model of brain architecture. Proc Natl Acad Sci USA. 2010;107:15235–15239. doi: 10.1073/pnas.1009112107. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Wang Z, Aragona BJ. Neurochemical regulation of pair bonding in male prairie voles. Physiol Behav. 2004;83:319–328. doi: 10.1016/j.physbeh.2004.08.024. [DOI] [PubMed] [Google Scholar]
- 9.Wise RA. Forebrain substrates of reward and motivation. J Comp Neurol. 2005;493:115–121. doi: 10.1002/cne.20689. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Watts AG, Swanson LW. Anatomy of motivational systems. In: Gallistel CR, editor. Stevens’ Handbook of Experimental Psychology. 3rd Ed. Vol. 3. New York: Wiley; 2002. pp. 563–632. [Google Scholar]
- 11.Haber SN, Knutson B. The reward circuit: Linking primate anatomy and human imaging. Neuropsychopharmacology. 2010;35:4–26. doi: 10.1038/npp.2009.129. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Torregrossa MM, Tang XC, Kalivas PW. The glutamatergic projection from the prefrontal cortex to the nucleus accumbens core is required for cocaine-induced decreases in ventral pallidal GABA. Neurosci Lett. 2008;438:142–145. doi: 10.1016/j.neulet.2008.04.016. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Mogenson GJ, Jones DL, Yim CY. From motivation to action: Functional interface between the limbic system and the motor system. Prog Neurobiol. 1980;14:69–97. doi: 10.1016/0301-0082(80)90018-0. [DOI] [PubMed] [Google Scholar]
- 14.Stoeckel LE, et al. Widespread reward-system activation in obese women in response to pictures of high-calorie foods. Neuroimage. 2008;41:636–647. doi: 10.1016/j.neuroimage.2008.02.031. [DOI] [PubMed] [Google Scholar]
- 15.McClure SM, York MK, Montague PR. The neural substrates of reward processing in humans: The modern role of FMRI. Neuroscientist. 2004;10:260–268. doi: 10.1177/1073858404263526. [DOI] [PubMed] [Google Scholar]
- 16.O'Doherty JP. Reward representations and reward-related learning in the human brain: Insights from neuroimaging. Curr Opin Neurobiol. 2004;14:769–776. doi: 10.1016/j.conb.2004.10.016. [DOI] [PubMed] [Google Scholar]
- 17.Beaver JD, et al. Individual differences in reward drive predict neural responses to images of food. J Neurosci. 2006;26:5160–5166. doi: 10.1523/JNEUROSCI.0350-06.2006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Pessiglione M, et al. How the brain translates money into force: A neuroimaging study of subliminal motivation. Science. 2007;316:904–906. doi: 10.1126/science.1140459. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Small DM, Veldhuizen MG, Felsted J, Mak YE, McGlone F. Separable substrates for anticipatory and consummatory food chemosensation. Neuron. 2008;57:786–797. doi: 10.1016/j.neuron.2008.01.021. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Setlow B, Schoenbaum G, Gallagher M. Neural encoding in ventral striatum during olfactory discrimination learning. Neuron. 2003;38:625–636. doi: 10.1016/s0896-6273(03)00264-2. [DOI] [PubMed] [Google Scholar]
- 21.Day JJ, Carelli RM. The nucleus accumbens and Pavlovian reward learning. Neuroscientist. 2007;13:148–159. doi: 10.1177/1073858406295854. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Carlezon WA, Jr., Thomas MJ. Biological substrates of reward and aversion: A nucleus accumbens activity hypothesis. Neuropharmacology. 2009;56(Suppl 1):122–132. doi: 10.1016/j.neuropharm.2008.06.075. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Nicola SM, Yun IA, Wakabayashi KT, Fields HL. Cue-evoked firing of nucleus accumbens neurons encodes motivational significance during a discriminative stimulus task. J Neurophysiol. 2004;91:1840–1865. doi: 10.1152/jn.00657.2003. [DOI] [PubMed] [Google Scholar]
- 24.Taha SA, Fields HL. Encoding of palatability and appetitive behaviors by distinct neuronal populations in the nucleus accumbens. J Neurosci. 2005;25:1193–1202. doi: 10.1523/JNEUROSCI.3975-04.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Dallman MF. Stress-induced obesity and the emotional nervous system. Trends Endocrinol Metab. 2010;21:159–165. doi: 10.1016/j.tem.2009.10.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Kringelbach ML. The human orbitofrontal cortex: Linking reward to hedonic experience. Nat Rev Neurosci. 2005;6:691–702. doi: 10.1038/nrn1747. [DOI] [PubMed] [Google Scholar]
- 27.Kringelbach ML. The hedonic brain: A functional neuroanatomy of human pleasure. In: Kringelbach ML, Berridge KC, editors. Pleasures of the Brain. Oxford: Oxford University Press; 2009. pp. 202–221. [Google Scholar]
- 28.Georgiadis JR, Kortekaas R. The sweetest taboo: Functional neurobiology of human sexuality in relation to pleasure. In: Kringelbach ML, Berridge KC, editors. Pleasures of the Brain. Oxford: Oxford University Press; 2009. pp. 178–201. [Google Scholar]
- 29.Root DH, Fabbricatore AT, Ma S, Barker DJ, West MO. Rapid phasic activity of ventral pallidal neurons during cocaine self-administration. Synapse. 2010;64:704–713. doi: 10.1002/syn.20792. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Smith KS, Mahler SM, Peciña S, Berridge KC. Hedonic hotspots: generating sensory pleasure in the brain. In: Kringelbach ML, Berridge KC, editors. Pleasures of the Brain. Oxford, UK: Oxford University Press; 2009. pp. 27–49. [Google Scholar]
- 31.Berthoud HR. Multiple neural systems controlling food intake and body weight. Neurosci Biobehav Rev. 2002;26:393–428. doi: 10.1016/s0149-7634(02)00014-3. [DOI] [PubMed] [Google Scholar]
- 32.Chrobak JJ, Napier TC. Opioid and GABA modulation of accumbens-evoked ventral pallidal activity. J Neural Transm. 1993;93:123–143. doi: 10.1007/BF01245342. [DOI] [PubMed] [Google Scholar]
- 33.Wassum KM, Ostlund SB, Maidment NT, Balleine BW. Distinct opioid circuits determine the palatability and the desirability of rewarding events. Proc Natl Acad Sci USA. 2009;106:12512–12517. doi: 10.1073/pnas.0905874106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Kalivas PW, Nakamura M. Neural systems for behavioral activation and reward. Curr Opin Neurobiol. 1999;9:223–227. doi: 10.1016/s0959-4388(99)80031-2. [DOI] [PubMed] [Google Scholar]
- 35.Peciña S, Berridge KC. Hedonic hot spot in nucleus accumbens shell: Where do mu-opioids cause increased hedonic impact of sweetness? J Neurosci. 2005;25:11777–11786. doi: 10.1523/JNEUROSCI.2329-05.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Smith KS, Berridge KC. The ventral pallidum and hedonic reward: Neurochemical maps of sucrose “liking” and food intake. J Neurosci. 2005;25:8637–8649. doi: 10.1523/JNEUROSCI.1902-05.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Smith KS, Berridge KC. Opioid limbic circuit for reward: Interaction between hedonic hotspots of nucleus accumbens and ventral pallidum. J Neurosci. 2007;27:1594–1605. doi: 10.1523/JNEUROSCI.4205-06.2007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Leyton M. The neurobiology of desire: Dopamine and the regulation of food and motivational states in humans. In: Kringelbach ML, Berridge KC, editors. Pleasures of the Brain. Oxford, UK: Oxford University Press; 2009. pp. 222–243. [Google Scholar]
- 39.Barbano MF, Cador M. Opioids for hedonic experience and dopamine to get ready for it. Psychopharmacology (Berl) 2007;191:497–506. doi: 10.1007/s00213-006-0521-1. [DOI] [PubMed] [Google Scholar]
- 40.Robinson S, Sandstrom SM, Denenberg VH, Palmiter RD. Distinguishing whether dopamine regulates liking, wanting, and/or learning about rewards. Behav Neurosci. 2005;119:5–15. doi: 10.1037/0735-7044.119.1.5. [DOI] [PubMed] [Google Scholar]
- 41.Davis CA, et al. Dopamine for “wanting” and opioids for “liking”: A comparison of obese adults with and without binge eating. Obesity (Silver Spring) 2009;17:1220–1225. doi: 10.1038/oby.2009.52. [DOI] [PubMed] [Google Scholar]
- 42.Robinson TE, Berridge KC. The neural basis of drug craving: An incentive-sensitization theory of addiction. Brain Res Brain Res Rev. 1993;18:247–291. doi: 10.1016/0165-0173(93)90013-p. [DOI] [PubMed] [Google Scholar]
- 43.Peciña S, Cagniard B, Berridge KC, Aldridge JW, Zhuang X. Hyperdopaminergic mutant mice have higher “wanting” but not “liking” for sweet rewards. J Neurosci. 2003;23:9395–9402. doi: 10.1523/JNEUROSCI.23-28-09395.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Flagel SB, et al. A selective role for dopamine in stimulus-reward learning. Nature. 2011;469:53–57. doi: 10.1038/nature09588. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Tindell AJ, Smith KS, Peciña S, Berridge KC, Aldridge JW. Ventral pallidum firing codes hedonic reward: When a bad taste turns good. J Neurophysiol. 2006;96:2399–2409. doi: 10.1152/jn.00576.2006. [DOI] [PubMed] [Google Scholar]
- 46.Tindell AJ, Smith KS, Berridge KC, Aldridge JW. Dynamic computation of incentive salience: “Wanting” what was never “liked.”. J Neurosci. 2009;29:12220–12228. doi: 10.1523/JNEUROSCI.2499-09.2009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Tindell AJ, Berridge KC, Aldridge JW. Ventral pallidal representation of pavlovian cues and reward: Population and rate codes. J Neurosci. 2004;24:1058–1069. doi: 10.1523/JNEUROSCI.1437-03.2004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Zhang J, Berridge KC, Tindell AJ, Smith KS, Aldridge JW. A neural computational model of incentive salience. PLoS Comput Biol. 2009;5:e1000437. doi: 10.1371/journal.pcbi.1000437. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Tindell AJ, Berridge KC, Zhang J, Peciña S, Aldridge JW. Ventral pallidal neurons code incentive motivation: Amplification by mesolimbic sensitization and amphetamine. Eur J Neurosci. 2005;22:2617–2634. doi: 10.1111/j.1460-9568.2005.04411.x. [DOI] [PubMed] [Google Scholar]
- 50.Attneave F. Applications of Information Theory to Psychology: A Summary of Basic Concepts, Methods, and Results. New York: Holt, Reinhart and Winston; 1959. [Google Scholar]
- 51.Schultz W, Dayan P, Montague PR. A neural substrate of prediction and reward. Science. 1997;275:1593–1599. doi: 10.1126/science.275.5306.1593. [DOI] [PubMed] [Google Scholar]
- 52.Zhang M, Balmadrid C, Kelley AE. Nucleus accumbens opioid, GABaergic, and dopaminergic modulation of palatable food motivation: Contrasting effects revealed by a progressive ratio study in the rat. Behav Neurosci. 2003;117:202–211. doi: 10.1037/0735-7044.117.2.202. [DOI] [PubMed] [Google Scholar]
- 53.Woolley JD, Lee BS, Fields HL. Nucleus accumbens opioids regulate flavor-based preferences in food consumption. Neuroscience. 2006;143:309–317. doi: 10.1016/j.neuroscience.2006.06.067. [DOI] [PubMed] [Google Scholar]
- 54.Peciña S, Berridge KC. Opioid site in nucleus accumbens shell mediates eating and hedonic ‘liking’ for food: Map based on microinjection Fos plumes. Brain Res. 2000;863:71–86. doi: 10.1016/s0006-8993(00)02102-8. [DOI] [PubMed] [Google Scholar]
- 55.Delamater AR, LoLordo VM, Berridge KC. Control of fluid palatability by exteroceptive Pavlovian signals. J Exp Psychol Anim Behav Process. 1986;12:143–152. [PubMed] [Google Scholar]
- 56.Bolles RC. Reinforcement, expectancy, & learning. Psychol Rev. 1972;79:394–409. [Google Scholar]
- 57.Kerfoot EC, Agarwal I, Lee HJ, Holland PC. Control of appetitive and aversive taste-reactivity responses by an auditory conditioned stimulus in a devaluation task: A FOS and behavioral analysis. Learn Mem. 2007;14:581–589. doi: 10.1101/lm.627007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Berridge KC, Schulkin J. Palatability shift of a salt-associated incentive during sodium depletion. Q J Exp Psychol B. 1989;41:121–138. [PubMed] [Google Scholar]
- 59.Toates FM. Motivational Systems. Cambridge, UK: Cambridge University Press; 1986. [Google Scholar]
- 60.Bindra D. A motivational view of learning, performance, and behavior modification. Psychol Rev. 1974;81:199–213. doi: 10.1037/h0036330. [DOI] [PubMed] [Google Scholar]
- 61.Childress AR, et al. Prelude to passion: Limbic activation by “unseen” drug and sexual cues. PLoS One. 2008;3:e1506. doi: 10.1371/journal.pone.0001506. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 62.Ito M, Doya K. Validation of decision-making models and analysis of decision variables in the rat basal ganglia. J Neurosci. 2009;29:9861–9874. doi: 10.1523/JNEUROSCI.6157-08.2009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Schultz W, Apicella P, Ljungberg T. Responses of monkey dopamine neurons to reward and conditioned stimuli during successive steps of learning a delayed response task. J Neurosci. 1993;13:900–913. doi: 10.1523/JNEUROSCI.13-03-00900.1993. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.O'Doherty JP, Dayan P, Friston K, Critchley H, Dolan RJ. Temporal difference models and reward-related learning in the human brain. Neuron. 2003;38:329–337. doi: 10.1016/s0896-6273(03)00169-7. [DOI] [PubMed] [Google Scholar]
- 65.McClure SM, Berns GS, Montague PR. Temporal prediction errors in a passive learning task activate human striatum. Neuron. 2003;38:339–346. doi: 10.1016/s0896-6273(03)00154-5. [DOI] [PubMed] [Google Scholar]
- 66.Daw ND, Niv Y, Dayan P. Actions, policies, values, and the basal ganglia. In: Bezard E, editor. Recent Breakthroughs in Basal Ganglia Research. Hauppauge, NY: Nova Science Publishers; 2005. pp. 91–106. [Google Scholar]
- 67.Wise RA, Fotuhi M, Colle LM. Facilitation of feeding by nucleus accumbens amphetamine injections: Latency and speed measures. Pharmacol Biochem Behav. 1989;32:769–772. doi: 10.1016/0091-3057(89)90031-2. [DOI] [PubMed] [Google Scholar]
- 68.Pal GK, Thombre DP. Modulation of feeding and drinking by dopamine in caudate and accumbens nuclei in rats. Indian J Exp Biol. 1993;31:750–754. [PubMed] [Google Scholar]
- 69.Berridge KC, Valenstein ES. What psychological process mediates feeding evoked by electrical stimulation of the lateral hypothalamus? Behav Neurosci. 1991;105:3–14. doi: 10.1037//0735-7044.105.1.3. [DOI] [PubMed] [Google Scholar]
- 70.Adan RA, Vanderschuren LJ, la Fleur SE. Anti-obesity drugs and neural circuits of feeding. Trends Pharmacol Sci. 2008;29:208–217. doi: 10.1016/j.tips.2008.01.008. [DOI] [PubMed] [Google Scholar]
- 71.Wellman PJ, Davies BT, Morien A, McMahon L. Modulation of feeding by hypothalamic paraventricular nucleus alpha 1- and alpha 2-adrenergic receptors. Life Sci. 1993;53:669–679. doi: 10.1016/0024-3205(93)90243-v. [DOI] [PubMed] [Google Scholar]
- 72.Cannon CM, Abdallah L, Tecott LH, During MJ, Palmiter RD. Dysregulation of striatal dopamine signaling by amphetamine inhibits feeding by hungry mice. Neuron. 2004;44:509–520. doi: 10.1016/j.neuron.2004.10.009. [DOI] [PubMed] [Google Scholar]
- 73.Kuo DY. Further evidence for the mediation of both subtypes of dopamine D1/D2 receptors and cerebral neuropeptide Y (NPY) in amphetamine-induced appetite suppression. Behav Brain Res. 2003;147:149–155. doi: 10.1016/j.bbr.2003.04.001. [DOI] [PubMed] [Google Scholar]
- 74.Rescorla RA, Wagner AR, editors. A Theory of Pavlovian Conditioning: Variations in the Effectiveness of Reinforcement and Nonreinforcement. New York: Appleton Century Crofts; 1972. pp. 64–99. [Google Scholar]
- 75.Redish AD, Jensen S, Johnson A. A unified framework for addiction: Vulnerabilities in the decision process. Behav Brain Sci. 2008;31:415–437. doi: 10.1017/S0140525X0800472X. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 76.Marinelli M, Piazza PV. Interaction between glucocorticoid hormones, stress and psychostimulant drugs. Eur J Neurosci. 2002;16:387–394. doi: 10.1046/j.1460-9568.2002.02089.x. [DOI] [PubMed] [Google Scholar]
- 77.Johnson PI, Stellar JR, Paul AD. Regional reward differences within the ventral pallidum are revealed by microinjections of a mu opiate receptor agonist. Neuropharmacology. 1993;32:1305–1314. doi: 10.1016/0028-3908(93)90025-x. [DOI] [PubMed] [Google Scholar]
- 78.Treadway MT, Zald DH. Reconsidering anhedonia in depression: Lessons from translational neuroscience. Neurosci Biobehav Rev. 2011;35:537–555. doi: 10.1016/j.neubiorev.2010.06.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79.Miller JM, et al. Anhedonia after a selective bilateral lesion of the globus pallidus. Am J Psychiatry. 2006;163:786–788. doi: 10.1176/ajp.2006.163.5.786. [DOI] [PubMed] [Google Scholar]
- 80.Vijayaraghavan L, Vaidya JG, Humphreys CT, Beglinger LJ, Paradiso S. Emotional and motivational changes after bilateral lesions of the globus pallidus. Neuropsychology. 2008;22:412–418. doi: 10.1037/0894-4105.22.3.412. [DOI] [PubMed] [Google Scholar]
- 81.Cromwell HC, Berridge KC. Where does damage lead to enhanced food aversion: The ventral pallidum/substantia innominata or lateral hypothalamus? Brain Res. 1993;624:1–10. doi: 10.1016/0006-8993(93)90053-p. [DOI] [PubMed] [Google Scholar]
- 82.Mink JW. The basal ganglia: Focused selection and inhibition of competing motor programs. Prog Neurobiol. 1996;50:381–425. doi: 10.1016/s0301-0082(96)00042-1. [DOI] [PubMed] [Google Scholar]
- 83.Mink JW. The basal ganglia and involuntary movements: Impaired inhibition of competing motor patterns. Arch Neurol. 2003;60:1365–1368. doi: 10.1001/archneur.60.10.1365. [DOI] [PubMed] [Google Scholar]
- 84.Gurney K, Prescott TJ, Redgrave P. A computational model of action selection in the basal ganglia. I. A new functional anatomy. Biol Cybern. 2001;84:401–410. doi: 10.1007/PL00007984. [DOI] [PubMed] [Google Scholar]
- 85.Berke JD. Uncoordinated firing rate changes of striatal fast-spiking interneurons during behavioral task performance. J Neurosci. 2008;28:10075–10080. doi: 10.1523/JNEUROSCI.2192-08.2008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 86.Smith Y, Bevan MD, Shink E, Bolam JP. Microcircuitry of the direct and indirect pathways of the basal ganglia. Neuroscience. 1998;86:353–387. doi: 10.1016/s0306-4522(98)00004-9. [DOI] [PubMed] [Google Scholar]
- 87.Mogenson GJ, Yang CR. The contribution of basal forebrain to limbic-motor integration and the mediation of motivation to action. Adv Exp Med Biol. 1991;295:267–290. doi: 10.1007/978-1-4757-0145-6_14. [DOI] [PubMed] [Google Scholar]
- 88.Nicola SM. The nucleus accumbens as part of a basal ganglia action selection circuit. Psychopharmacology (Berl) 2007;191:521–550. doi: 10.1007/s00213-006-0510-4. [DOI] [PubMed] [Google Scholar]
- 89.Roitman MF, Wheeler RA, Carelli RM. Nucleus accumbens neurons are innately tuned for rewarding and aversive taste stimuli, encode their predictors, and are linked to motor output. Neuron. 2005;45:587–597. doi: 10.1016/j.neuron.2004.12.055. [DOI] [PubMed] [Google Scholar]
- 90.Wan X, Peoples LL. Amphetamine exposure enhances accumbal responses to reward-predictive stimuli in a pavlovian conditioned approach task. J Neurosci. 2008;28:7501–7512. doi: 10.1523/JNEUROSCI.1071-08.2008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 91.Haber SN. The primate basal ganglia: Parallel and integrative networks. J Chem Neuroanat. 2003;26:317–330. doi: 10.1016/j.jchemneu.2003.10.003. [DOI] [PubMed] [Google Scholar]
- 92.Yin HH, Ostlund SB, Balleine BW. Reward-guided learning beyond dopamine in the nucleus accumbens: The integrative functions of cortico-basal ganglia networks. Eur J Neurosci. 2008;28:1437–1448. doi: 10.1111/j.1460-9568.2008.06422.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 93.Wyvell CL, Berridge KC. Incentive sensitization by previous amphetamine exposure: Increased cue-triggered “wanting” for sucrose reward. J Neurosci. 2001;21:7831–7840. doi: 10.1523/JNEUROSCI.21-19-07831.2001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 94.Berridge KC. Measuring hedonic impact in animals and infants: Microstructure of affective taste reactivity patterns. Neurosci Biobehav Rev. 2000;24:173–198. doi: 10.1016/s0149-7634(99)00072-x. [DOI] [PubMed] [Google Scholar]
- 95.Timberlake W, Wahl G, King D. Stimulus and response contingencies in the misbehavior of rats. J Exp Psychol Anim Behav Process. 1982;8:62–85. [PubMed] [Google Scholar]
- 96.Matthews TJ, Lerer BE. Behavior patterns in pigeons during autoshaping with an incremental conditioned-stimulus. Anim Learn Behav. 1987;15:69–75. [Google Scholar]
- 97.Corbit LH, Balleine BW. Instrumental and Pavlovian incentive processes have dissociable effects on components of a heterogeneous instrumental chain. J Exp Psychol Anim Behav Process. 2003;29:99–106. doi: 10.1037/0097-7403.29.2.99. [DOI] [PubMed] [Google Scholar]
- 98.Balleine BW, Garner C, Gonzalez F, Dickinson A. Motivational control of heterogeneous instrumental chains. J Exp Psychol Anim Behav Process. 1995;21:203–217. [Google Scholar]
- 99.Pine A, Shiner T, Seymour B, Dolan RJ. Dopamine, time, and impulsivity in humans. J Neurosci. 2010;30:8888–8896. doi: 10.1523/JNEUROSCI.6028-09.2010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 100.Paxinos G, Watson C. The Rat Brain in Stereotaxic Coordinates. San Diego: Academic; 1998. [DOI] [PubMed] [Google Scholar]