Skip to main content
The Journal of Neuroscience logoLink to The Journal of Neuroscience
. 2019 Mar 6;39(10):1842–1854. doi: 10.1523/JNEUROSCI.3477-17.2018

Prior Exposure to Salient Win-Paired Cues in a Rat Gambling Task Increases Sensitivity to Cocaine Self-Administration and Suppresses Dopamine Efflux in Nucleus Accumbens: Support for the Reward Deficiency Hypothesis of Addiction

Jacqueline-Marie N Ferland 1,, Tristan J Hynes 1, Celine D Hounjet 1, David Lindenbach 2, Cole Vonder Haar 1, Wendy K Adams 1, Anthony G Phillips 2, Catharine A Winstanley 1,
PMCID: PMC6407298  PMID: 30626700

Abstract

Rats trained to perform a version of the rat gambling task (rGT) in which salient audiovisual cues accompany reward delivery, similar to commercial gambling products, show greater preference for risky options. Given previous demonstrations that probabilistic reinforcement schedules can enhance psychostimulant-induced increases in accumbal DA and locomotor activity, we theorized that performing this cued task could perpetuate a proaddiction phenotype. Significantly more rats developed a preference for the risky options in the cued versus uncued rGT at baseline, and this bias was further exacerbated by cocaine self-administration, whereas the choice pattern of optimal decision-makers was unaffected. The addition of reward-paired cues therefore increased the proportion of rats exhibiting a maladaptive cognitive response to cocaine self-administration. Risky choice was not associated with responding for conditioned reinforcement or a marker of goal/sign-tracking, suggesting that reward-concurrent cues precipitate maladaptive choice via a unique mechanism unrelated to simple approach toward, or responding for, conditioned stimuli. Although “protected” from any resulting decision-making impairment, optimal decision-makers trained on the cued rGT nevertheless self-administered more cocaine than those trained on the uncued task. Collectively, these data suggest that repeated engagement with heavily cued probabilistic reward schedules can drive addiction vulnerability through multiple behavioral mechanisms. Rats trained on the cued rGT also exhibited blunted locomotor sensitization and lower basal accumbal DA levels, yet greater cocaine-induced increases in accumbal DA efflux. Gambling in the presence of salient cues may therefore result in an adaptive downregulation of the mesolimbic DA system, rendering individuals more sensitive to the deleterious effects of taking cocaine.

SIGNIFICANCE STATEMENT Impaired cost/benefit decision making, exemplified by preference for the risky, disadvantageous options on the Iowa Gambling Task, is associated with greater risk of relapse and treatment failure in substance use disorder. Understanding factors that enhance preference for risk may help elucidate the neurobiological mechanisms underlying maladaptive decision making in addiction, thereby improving treatment outcomes. Problem gambling is also highly comorbid with substance use disorder, and many commercial gambling products incorporate salient win-paired cues. Here we show that adding reward-concurrent cues to a rat analog of the IGT precipitates a hypodopaminergic state, characterized by blunted accumbal DA efflux and attenuated locomotor sensitization, which may contribute to the enhanced responsivity to uncertain rewards or the reinforcing effects of cocaine we observed.

Keywords: conditioned reinforcement, dopamine, locomotor sensitization, microdialysis, rat gambling task, self-administration

Introduction

Substance use disorder (SUD) is associated with maladaptive cost/benefit decision making, as exemplified by impaired performance of the Iowa Gambling Task (IGT) (Rogers et al., 1999; Bolla et al., 2005; Verdejo-García et al., 2007; Verdejo-García and Bechara, 2009), a probabilistic decision-making task. Risky choice on the IGT predicts relapse, suggesting that decision-making deficits directly contribute to addiction maintenance (Bechara, 2003; Stevens et al., 2013; Wang et al., 2013). Animals that prefer the “high-risk, high-reward” options on a rat gambling task (rGT) loosely based on the IGT are uniquely susceptible to decision-making deficits induced by cocaine self-administration (Ferland and Winstanley, 2017), suggesting that maladaptive choice may both foster and result from drug abuse. Understanding the neurocognitive processes that promote risky decision making may therefore provide significant insight into the etiology of addiction and how to treat it.

Individuals who are more sensitive to the behavioral impact of reward-paired cues may be most at risk for addiction development (Phillips and Fibiger, 1990; Robinson and Berridge, 1993, 2008; Tomie, 1996; Tomie et al., 2008; Flagel et al., 2009, 2010). Considerable supporting evidence has been obtained using basic behavioral procedures, such as sign-tracking, which measures approach to reward-paired cues (Brown and Jenkins, 1968), or conditioned reinforcement (CRf), which reflects willingness to work for conditioned stimuli (Williams, 1994). However, the impact of cue sensitivity on cost/benefit decision making itself is poorly understood.

Win-paired audiovisual cues are heavily used in electronic gambling machines (EGMs), arguably the most addictive form of gambling (Dowling et al., 2005; Livingstone and Adams, 2011). Problem gambling is highly comorbid with SUD (Chou and Afifi, 2011; Lorains et al., 2011), indicating a potential synergy between risky choice, cue sensitivity, and addiction vulnerability. Adding win-paired, audiovisual cues augments choice of the risky, disadvantageous options on the rGT while rendering choice sensitive to D3 receptor modulators (Barrus and Winstanley, 2016). D3-selective drugs generally have limited effect on behaviors reinforced by nutritive rewards but significantly influence responding for addictive substances, particularly under densely cued schedules of reinforcement (Le Foll et al., 2005; Beninger and Banasikowski, 2008). As such, the ability of reward-concurrent cues to promote risky decision making on the rGT may tap into the same neurocognitive mechanisms by which cues facilitate engagement in addictive behaviors.

The D3 receptor is densely expressed within the NAc (Sokoloff et al., 2006), a brain area strongly implicated in the development of addiction, the response to conditioned cues, and risky decision making (Sugam et al., 2012, 2014; Bissonette and Roesch, 2016). Specifically, manipulation of dopamine (DA) within the NAc can bias rats toward either risk-averse or risk-seeking decisions (Saddoris et al., 2015; Zalocusky et al., 2016). Therefore, the influence of reward-concurrent cues on the rGT may drive risky choice by altering dopaminergic signaling in this region. Repeated experience with probabilistic reward delivery can itself be sufficient to sensitize dopaminergic neurons in a similar way to drugs of abuse, increasing amphetamine-induced locomotion and DA release in the NAc, and also potentiating amphetamine self-administration (Singer et al., 2012; Zack et al., 2014; Mascia et al., 2019). We reasoned that preference for the risky options during the rGT may similarly sensitize the response to drugs, such as cocaine, and this may be potentiated in the cued variant of the task, such that cued rGT training encourages a proaddiction phenotype.

We therefore investigated whether prior experience with the cued versus uncued rGT altered (1) self-administration of cocaine, (2) the ability of cocaine self-administration to promote risky choice, (3) locomotor sensitization to cocaine, and (4) accumbal DA release. We hypothesized that experience with win-paired cues and the disadvantageous, uncertain options of the task would sensitize the motoric and reinforcing effects of cocaine, an effect mediated by increased NAc DA tone. Given that cue sensitivity has been associated with addiction risk, we also hypothesized that the reward-paired cues in the rGT may amplify risky choice through acting as conditioned reinforcers and thereby motivate operant responding. We therefore expected cue-biased risky choice to correlate positively with responding for CRf.

Materials and Methods

Subjects

Experiment 1 used male Long–Evans rats from three separate cohorts (n = 48; Charles River Laboratories) weighing 275–300 g at the start of the experiment. Experiment 2 used 32 male transgenic rats, bred in house against a Long–Evans background, that express cre recombinase (Cre) in neurons containing tyrosine hydroxylase (TH:Cre rats from; Long–Evans-Tg(TH-Cre)3.1Deis, RRRC #00659; Rat Resource and Research Centre; WT rats obtained from Charles River Laboratories). However, transgene (Tg) status (n = 16 per Tg, Tg+, Tg) was not used for any experimental manipulation. Rats bred in-house were weaned at postnatal day 21, housed in groups of 2 or 3 animals per cage, and had access to ad libitum standard rat chow until an average weight of ∼300 g was reached. Rats were then transferred to the main vivarium. In all experiments, animals were food-restricted to 85% of their free-feeding weight relative to a standard growth curve. Water was available ad libitum for both experiments. Rats were housed under a reverse 12 h light/dark cycle (lights off at 8:00 A.M.) in a temperature-controlled colony room maintained at 21°C. Testing and housing were in accordance with the Canadian Council of Animal Care, and all experimental protocols were approved by the Animal Care Committee of the University of British Columbia.

Apparatus

The rGT, CRf, and self-administration behavioral protocols were conducted in standard 5-hole operant chambers enclosed within ventilated sound-attenuating cabinets (Med Associates). To minimize task interference, cocaine self-administration and CRf were run in separate operant boxes kept in a different room within the facility. An array of five response holes was positioned on one wall of the operant chambers. The food magazine, positioned 2 cm above the rod-floor located opposite the aperture array, was attached to an external food dispenser equipped to deliver sucrose pellets (45 mg, Bioserv) to the magazine. Two retractable levers were situated on either side of the food magazine for use during self-administration or CRf. A light stimulus was situated at the back of each response hole as well as within the food magazine, and nose-poke responses into these apertures were detected by a horizontal infrared beam. Boxes were equipped with variable tone generators, and chambers could be illuminated by a house light. All manipulanda were controlled by software written in Med PC by CAW and CVH running on an IBM-compatible computer.

Self-administration boxes were fitted with an infusion apparatus consisting of a variable rate infusion pump (Med Associates), a 10 ml plastic syringe used to administer drug or vehicle, PE/PVC tubing (Instech Solomon) connected to a 22 gauge single-channel plastic swivel (Instech Solomon), and a 40 cm spring-covered tubing connector assembly (Plastics One). Locomotor testing was completed in 40 cm2 Plexiglas boxes fitted with video cameras, and ambulatory activity was counted using behavioral tracking software (Ethovision 3.1, Noldus).

Experimental timeline

An overview of experimental designs is provided in Figure 1A, B. For Experiment 1, Cohorts 1 and 2 were evenly divided into experimental groups matched for cued rGT performance and trained to self-administer cocaine or saline. Rats from Cohort 3 trained on rGT or cued rGT underwent cocaine self-administration only. Animals underwent consecutive daily rGT and cocaine self-administration sessions, such that the progressive effects of voluntary cocaine ingestion on decision-making behavior could be tracked. rGT sessions were run in the mornings, before self-administration in the afternoons, to ensure rGT performance was not acutely impacted by cocaine dosing. In Experiment 2, locomotor testing was first performed before the start of cued/uncued rGT training, and then repeated immediately after behavioral stability was reached on the decision-making task, followed by microdialysis.

Figure 1.

Figure 1.

Timelines for Experiments 1 (A) and 2 (B).

Behavioral testing

rGT.

Animals were first habituated to the operant chambers over two daily 30 min sessions followed by nose-poke and forced-choice training as described in a previous report (Zeeb et al., 2009). Following training, animals proceeded to the free-choice version of the program. A task schematic is provided in Figure 2. During each 30 min behavioral session, the animal initiated each trial by making a nose-poke response at the illuminated food magazine. This response turned off the magazine light and triggered the start of a 5 s intertrial interval (ITI), after which nose-poke apertures 1, 2, 4, and 5 were illuminated. A nose-poke response at one of the illuminated holes resulted in either delivery of a sugar reward or a variable duration punishment time-out period during which the light in the chosen aperture flashed at 0.5 Hz. Each hole was associated with a different amount of reward (1–4 sugar pellets, P1-P4), length of penalty time-out (5–40 s), and probability of winning a reward over punishment (0.9–0.4). The cued rGT was identical to the uncued task, except that reward delivery was paired with compound light/tone cues that scaled in complexity with the size of the win (Barrus and Winstanley, 2016; Adams et al., 2017). The reinforcement contingencies were designed such that consistent choice of options that yielded larger per trial gains (P3, P4) ultimately resulted in fewer sugar pellets over the course of a session due to the longer and more frequent time-out penalties incurred. Hence, the optimal strategy was to favor the options paired with smaller incremental gains, particularly P2. The locations of choice options were counterbalanced between animals to control for potential side bias (Version A: left to right, P1, P4, P2, P3; Version B: P4, P1, P3, P2).

Figure 2.

Figure 2.

Schematic diagram for the rGTs. The task began with illumination of the tray light. A nose-poke response in the food tray extinguished the tray light and initiated a new trial. After an ITI of 5 s, four stimulus lights were turned on in holes 1, 2, 4, and 5, and the animal was required to respond in one of these holes within 10 s. This response was then rewarded or punished depending on the reinforcement schedule for that option (indicated by the probability of a win or loss in brackets for each option, hypothetical maximum number of pellets presented underneath each option). If the animal was rewarded, the stimulus lights were extinguished and the animal received the corresponding number of pellets in the now-illuminated food tray. Cued rGT animals also received concurrent win-paired audiovisual stimuli. The six tones used were as follows: 4, 8, 10, 12, 15, and 20 kHz. Using the letters A–F to represent a different tone, the patterns for P3 were as follows: CDEDCDEDCD; CECEDEDECE. Similarly, the patterns for P4 were as follows: ABCDEFEDCB; BCDCDEDEFE; CEDFCEBDAC; FEDCBAFEDC. With respect to the visual cues, the first light to flash was the hole associated with that response, shown here for version A. For P3 and P4, the visual stimuli then become more varied in the last second of the cue, using sequences of multiple lights that change in sync with the tones. Lights could illuminate together, as indicated by numbers in brackets, or independently. The patterns for P3 were as follows: 5434543454; (5 + 3)4 (5 + 3)4(5 + 3)4(5 + 3)4(5 + 3)4. The patterns for P4 were as follows: 1234543212; (2 + 4)(1 + 3 + 5)(2 + 4)(1 + 3 + 5)(2 + 4)(1 + 3 + 5)(2 + 4)(1 + 3 + 5)(2 + 4)(1 + 3 + 5); 1324354231; (2 + 4)(1 + 5)(2 + 4)3(2 + 4)(1 + 5)(2 + 4)3(2 + 4). Which tone/light pattern would play was determined randomly, but the same pattern was never presented on sequential P4 win trials. A response at the food tray then started a new trial. If the animal was punished, the stimulus light in the corresponding hole flashed at a frequency of 0.5 Hz for the duration of the punishing timeout and all other lights were extinguished. At the end of the punishment period, the tray light was turned on and the animal could initiate a new trial. Failure to respond at the illuminated holes resulted in an omission, whereas a response during the ITI was classified as a premature response and punished by a 5 s timeout during which the house light was turned on (schematic based upon Zeeb et al., 2009).

Nose-poke responses made at the array during the ITI were recorded as premature responses, a measure of impulsive action similar to that obtained from the five-choice serial reaction time task (Robbins, 2002). Such responses were punished by a 5 s time-out penalty, signaled by illumination of the house light, after which the tray light was turned on and the animal could initiate a new trial. If rats failed to make a response at the illuminated apertures within 10 s, the trial would be registered as an omission, and the food tray again illuminated indicating that a new trial could be initiated. Animals received five or six daily sessions per week until statistically stable patterns of behavior across all measures were observed over three sessions. This took ∼35 sessions to achieve.

Cocaine self-administration

Animals were trained to lever press for cocaine hydrochloride (0.75 mg/kg/infusion, dose calculated as the salt and dissolved in sterile 0.9% saline; Medisca Pharmaceuticals) or saline vehicle over 10 daily 3 h sessions (Calu et al., 2007; Ferland and Winstanley, 2017). At the start of each self-administration session, two free infusions of solution were given to fill catheters and indicate that drug was available. Rats were presented with two levers, one active and one inactive, with an illuminated cue-light situated over the active lever. Using a fixed ratio (FR1) schedule, responses on the active lever would result in a single 4.5 s infusion in concert with the cue light flashing (50 Hz) and a 20 kHz tone. Following the infusion, animals underwent a 40 s time-out during which drug would not be delivered, the cue light and tone were extinguished, but levers would remain extended and responses monitored. Responses on the active lever during infusions and timeouts were recorded and interpreted as preliminary cocaine “seeking” behaviors. Inactive lever presses, while monitored, had no programmed consequences. Animals were limited to 30 infusions per hour to prevent overdose.

Locomotor activity assay

As in previous studies (Singer et al., 2012; Zack et al., 2014), animals were allowed a 1 h habituation period to the locomotor chamber, after which a 1 ml/kg intraperitoneal injection of saline was administered. One hour later, animals then received a 10 mg/kg intraperitoneal injection of cocaine. Recording continued for a further hour. Total distance traveled (centimeters) was calculated and parsed into 5 min bins for analyses.

CRf

CRf testing was completed after the final locomotor sessions had concluded. In 10 daily 1 h sessions, subjects were first trained to associate a 5 s conditioned stimulus (CS+), which consisted of illumination of a circular disc cue light, paired with delivery of a food pellet into the central magazine. Animals had equal experience with an identical circular cue light control stimulus (CS), the presentation of which had no programmed consequences. The CS+ differed from the CS only in its spatial location (i.e., whether it appeared on the left or right of the food magazine), and this was counterbalanced across subjects. Stimulus presentation occurred on a random interval 60 s schedule in a pseudorandom sequence such that the same stimulus was not presented on more than two consecutive trials. The offset of the CS+ was contingent with the delivery of a food pellet in the central magazine, whereas the CS was never associated with food reward. The latency to nose-poke at the food tray after illumination of the CS+ or CS was recorded to provide an index of goal-tracking: that is, the tendency of animals to approach the location of reward (“goal”) following presentation of the CS+ (Flagel et al., 2007). Whether the CS+ had become sufficiently reinforcing as to support the acquisition of a novel operant response was then assessed on day 11. At the start of a single 1 h session, the two retractable levers were inserted into the operant chamber, each located on either side of the food tray, underneath the cue lights used as conditioned stimuli. Responding on each lever resulted in illumination of the cue light located immediately above it (the CS+ or CS) on an FR1 schedule. No sugar pellets were delivered at any time; lever press responses were reinforced solely by CS presentation. The number of responses made for the CS+ and CS were recorded and calculated as a ratio of the total number of responses for the CS+/total number of lever presses. The total number of nose-pokes made at the food tray during presentation of the conditioned stimuli, as well as the latency to the first nose-poke after CS illumination, were also recorded.

Surgeries

For all surgeries, rats were anesthetized with isoflurane (5% induction, 2%–3% maintenance), given 5 mg/kg ketaprofen as analgesic, and bupivacaine was administered at the surgical site as a local anesthetic.

Experiment 1: jugular vein catheterization.

Animals were aseptically implanted with catheters constructed of Silastic silicone tubing (Dow Corning via VWR International) attached to back-mounted cannulae (Plastics One) into the right jugular vein. Catheters were passed through the skin subcutaneously and positioned, such that the cannulae exited between the shoulder blades. To prevent blockages, catheters were flushed daily with 0.1 ml of 50% heparinized saline. Behavioral testing on the cued and uncued rGT resumed after 5–7 d of recovery.

Experiment 2: cannulae implantation for microdialysis.

Twenty-eight animals underwent aseptic stereotaxic surgery. As per previous work (Vacca et al., 2007), animals were bilaterally implanted with 15 mm 19 gauge nitric acid passivated stainless-steel guide cannulae above the NAc (from bregma 1.7 mm anterior and ±1.1 mm lateral; from dura −1.0 mm ventral) (Paxinos and Watson, 1998), secured via skull screws and dental cement. Stainless-steel obdurators (15 mm) maintained patency of the guides until probe implantation. Remaining animals (n = 4, Tg+) were unexpectedly required in the breeding colony and were removed from the experiment.

In vivo microdialysis

Microdialysis took place in 40 cm3 Plexiglas chambers. Microdialysis probes were constructed from Filtral 12 AN69HF semipermeable hollow fibers (2 mm long, 340 μm OD, 65 kDa molecular weight cutoff) and silica inlet-outlet lines (75/150 μm ID/OD). The day before microdialysis experiments, probes were flushed with aCSF (10.0 mm sodium phosphate buffer with 147.0 mm NaCl, 3.0 mm KCl, 1.0 mm MgCl2, and 1.2 mm CaCl2, pH 7.4) and inserted into the guide cannulae (dialysis membrane spanned −4.8 to −6.8 mm ventral from the guide). Rats remained in the microdialysis chamber overnight (14–16 h) with freely available food and water, and the probes were perfused continuously with aCSF at a rate of 1.1 μl/min. The following morning, DA levels were determined in dialysates collected at 10 min intervals. Once a stable baseline was established (<10% fluctuation over four consecutive samples, ∼8 baseline samples taken per rat), animals were administered a saline injection (1 ml/kg i.p.). One hour later, animals received an injection of cocaine (10 mg/kg i.p.). Dialysates were collected for a further 1 h period. To minimize sensitization to cocaine, 2–4 d elapsed before this process was then repeated for the opposite hemisphere. The first hemisphere sampled (left/right) was counterbalanced across animals.

HPLC

Samples were analyzed via HPLC with electrochemical detection. HPLC systems were composed of the following: an ESA 582 pump (Bedford), a pulse damper (Scientific Systems), an inert manual injector (Rheodyne), a Super ODS TSK column (Tosoh Bioscience), and an Intro Electrochemical detector (Antec Leyden). The mobile phase [70 mm sodium acetate buffer, 40 mg/L EDTA, and 6 mg/L SDS (adjustable), pH 4.0, 10% methanol] flowed through the system at 0.15 ml/min. EZChrome Elite software (Scientific Software) was used to acquire and analyze chromatographic data. After the experiment, animals were killed by live decapitation, brains were sectioned at −20°C on a cryostat, and sections were stained with cresyl violet. Probe placements were verified with reference to a stereotaxic atlas (Paxinos and Watson, 1998).

Experimental design and statistical analysis

All statistical analyses were completed using SPSS Statistics 24.0 software (IBM). As per previous reports, the following rGT variables were analyzed at baseline (Experiments 1 and 2) and during self-administration (Experiment 1 only): percentage choice of each option (number of times option chosen/total number of choices × 100), score (calculated as choice of [(P1 + P2) − (P3 + P4)]), percentage of premature responses (number of premature responses/total number of trials initiated × 100), sum of omitted responses, sum of trials completed, and average latencies to choose an option and collect reward. Variables that were expressed as a percentage were subjected to an arcsine transformation to limit the effect of an artificially imposed ceiling (i.e., 100%) (McDonald, 2009). A statistically stable baseline was determined by a repeated-measures ANOVA across data from three consecutive sessions, in which both the session factor and session × choice interaction were not significant. To verify that Tg status did not alter any behavioral measure, stand-alone analyses with Tg (two levels: Tg+, Tg) as a between-subjects factor were conducted for all rGT and cued rGT variables.

For the self-administration phase, we analyzed infusions achieved, active lever responses, and inactive lever responses via mixed factorial repeated-measures ANOVA. We entered session as the within-subjects factor and, unless noted otherwise, set the between-subjects factors as task and risk-preference. Significant effects from the omnibus tests were followed up with a combination of single factorial repeated-measures ANOVAs, Fisher's Least Significant Difference (LSD) post hoc test, and/or independent samples t tests. The total responses on each lever during CRf sessions in Experiment 2 were subject to repeated-measures ANOVA (two levels: active/ CS+, inactive/ CS). CRf ratio data were also subjected to a univariate ANOVA. For the locomotor activity assay, the total distance traveled (centimeters) was analyzed by repeated-measures ANOVA with treatment (two levels: saline, cocaine) and time bin (6 levels: 6 × 5 min bins) as within-subjects factors. Only the first 30 min of behavior were analyzed to capture locomotor counts while cocaine was maximally effective onboard (Ciccarone, 2011). Averaged raw DA dialysate concentrations collected from each hemisphere were subjected to a repeated-measures ANOVA (treatment: 3 levels: baseline, saline, cocaine; bin: 6 levels: 6 ×10 min bins). Four rats had a substituted baseline value from an earlier measurement for one hemisphere due to unusual deviation from an otherwise stable baseline (i.e., 1 of 4 baseline values had ∼> ±10% variability, n = 2/task). Measurement of DA following saline and cocaine administration was also transformed into percentage change from baseline, and again analyzed with a repeated-measures ANOVA (5 levels, 5 × 10 min bins after injection).

In the first experiment, we excluded 16 rats from the experimental cohort due to occlusion of catheters resulting in a cued group of n = 37 and an uncued group of n = 11. In the cocaine group, animals with a mean positive baseline score were designated as “optimal decision-makers” (n = 20), whereas rats with negative scores were classified as “risk-preferring” (n = 16). The control (i.e., saline self-administering) cohort consisted of 12 rats (risk-preferring: n = 6; optimal: n = 6). For Experiment 2, all variables from the rGT, locomotor data, and responding for CRf were analyzed with rGT variant (uncued/cued, n = 16 per task) and risk-preference (optimal, risk-preferring, n = 16 per group; uncued optimal, n = 12; uncued risk-preferring, n = 4; cued optimal, n = 12; risk-preferring, n = 12) as between-subjects factors. For both experiments, the frequency of risk-preferring rats trained on each task was evaluated using a χ2 test followed by a Fisher Exact test (to correct the p value due to low number of animals in a specific group; i.e., n = 4 optimal rats in the cued rGT in Experiment 2). Six animals were excluded from microdialysis analyses due to inaccurate probe placement, and 1 animal's data were excluded due to illness. Analysis of DA efflux included rGT variant as a between-subjects factor (n = 11 per cued/uncued task) but not risk-preference due to lack of individual differences seen in locomotor data (see Results).

For rGT, cued rGT, and CRf behavior, data were excluded from any animal that performed <20 trials (Winstanley et al., 2011). For all analyses, if sphericity was violated as determined by Mauchley's test, a Huynh–Feldt correction was applied, and corrected p values' degrees of freedom rounded to the next integer. Results were deemed to be significant if p values were less than or equal to an α of 0.05. Analyses yielding a p value between 0.05 and 0.08 were reported as a trend.

Results

Experiment 1: effects of cocaine self-administration on performance of the cued versus uncued rGT

Of the 16 rats trained on the uncued rGT, 100% developed an optimal decision-making strategy. In contrast, of the 48 rats trained on the cued variant of the task, 22 were optimal decision-makers and 26 were risk-preferring (χ2 test: χ2(1) = 14.60, p < 0.001). As per results from previous studies (Barrus and Winstanley, 2016; Adams et al., 2017), animals trained on the cued rGT showed greater preference for the risky options (Table 1; cued vs uncued: choice × task: F(3,119) = 9.626, p < 0.0001; individual options: P1: t(46) = 2.270, p = 0.028; P2: t(46) = 3.475, p = 0.001; P3: t(46) = −3.460, p = 0.001; t(46) = −2.134, p = 0.038; score: task: F(1,46) = 21.422, p < 0.0001). Similarly, as expected, risk-preferring rats showed a statistically significant increase in choice of the risky options (choice × risk preference: F(3,124) = 33.657, p < 0.0001; optimal versus risk preferring: P1: t(46) = 2.186, p = 0.035; P2: t(46) = 8.632, p < 0.0001; P3: t(46) = −5.914, p < 0.0001; P4: t(46) = −3.963, p = 0.0004; score: F(1,46) = 171.198, p < 0.0001)

Table 1.

Cued rGT performance prior to self-administrationa

Behavioral measure Cued saline optimal Cued saline risk-preferring Cued cocaine optimal Cued cocaine risk-preferring Uncued cocaine optimal
P1 14.85 ± 11.72 4.45 ± 2.16 7.71 ± 6.31 3.85 ± 1.41 15.26 ± 4.33
P2 68.49 ± 15.07 23.13 ± 5.28 63.50 ± 8.42 12.76 ± 2.87 73.90 ± 6.60
P3 13.40 ± 9.64 42.03 ± 14.06 12.09 ± 4.77 50.41 ± 7.30 2.72 ± 0.95
P4 3.26 ± 2.40 30.39 ± 9.20 16.70 ± 6.16 32.98 ± 7.03 8.12 ± 3.78
Premature responses 11.09 ± 2.68 30.04 ± 6.42 30.55 ± 5.24 28.64 ± 3.15 14.64 ± 1.73
Trials completed 129.34 ± 11.49 61.37 ± 6.54 89.89 ± 9.96 55.55 ± 2.62 99.93 ± 6.39
Omitted responses 0.22 ± 0.22 0.56 ± 0.39 0.22 ± 0.12 0.73 ± 0.20 2.09 ± 0.49
Choice latency 0.93 ± 0.34 0.95 ± 0.16 1.55 ± 0.30 1.01 ± 0.11 1.39 ± 0.19
Collect latency 0.80 ± 0.11 0.44 ± 0.08 0.94 ± 0.15 0.71 ± 0.08 0.60 ± 0.05

aData are mean ± SEM.

Risk-preferring rats also completed significantly fewer trials overall (risk preference: F(1,45) = 38.943, p < 0.0001), and cued animals omitted significantly fewer trials, although no animals omitted more than two trials on average per session (task: F(1,45) = 22.868, p < 0.0001). Risk-preferring rats and animals trained on the cued task had significantly faster collection latencies (risk preference: F(1,45) = 5.640, p = 0.022; task: F(1,45) = 5.228, p = 0.027), but exhibited no differences in choice latencies (F values < 1.961, p > 0.168). Before self-administration, rats assigned to the cocaine group did not differ from saline rats on any behavioral measure (F values < 2.748, p > 0.104).

Self-administration training

Over 10 sessions of training, animals self-administering cocaine took increasing numbers of cocaine infusions (session: F(9,234) = 5.900, p < 0.001). We then collapsed across groups and followed up with a linear contrast to show that all rats took more cocaine with each progressive self-administration session (F(1,27) = 29.641, p < 0.001; Fig. 3A). In contrast, animals responded steadily less on the inactive lever over time (session: F(9,234) = 3.057, p = 0.021; Fig. 3B; linear contrast: F(1,26) = 12.090, p = 0.002). Critically, we noted a between-subjects effect of task (F(1,27) = 4.410, p = 0.045) and of risk preference (F(1,27) = 11.875, p = 0.002). The task × risk preference interaction term could not be computed, however, as no risk-preferring rats were present in the uncued group. We therefore parsed the animals into three groups for follow-up analyses [uncued optimal (n = 11), cued optimal (n = 9), and cued risk-preferring (n = 16)]. We then used Fisher's LSD to show that optimal decision-makers on the cued task took more cocaine than both their risk-preferring counterparts (Fig. 3; MD = 19.664, p = 0.002), and optimal decision-makers performing the uncued rGT (MD = 12.997, p = 0.045), neither of which differed from each other (MD = 6.668, p = 0.215). A similar pattern of effects was observed when the number of responses on the active lever was analyzed (group: F(2,26) = 4.175, p = 0.027; Fig. 3B), in that optimal decision-makers trained on the cued rGT made significantly more responses than risk-preferring rats (MD = 37.967, p = 0.009), or optimal decision-makers trained on the uncued task (MD = 32.850, p = 0.032) whereas these latter two groups did not differ from each other (MD = 5.117, p = 0.666).

Figure 3.

Figure 3.

Cocaine self-administration. A–C, Number of infusions and lever presses over 10 d of self-administration. Cued optimal animals achieved more infusions and made more response for cocaine than other groups. All rats decreased responding on the inactive lever over the 10 sessions. D–F, Total number of infusions and lever presses for intravenous saline. Responding for saline decreased over time. Data are mean ± SEM.

Control animals self-administering saline took steadily fewer infusions over time (session: F(9,81) = 6.383, p = 0.032; linear contrast: F(1,9) = 25.114, p = 0.001). Similarly, animals made fewer responses on the active lever with continued training (session: F(9,81) = 2.648, p = 0.037; linear contrast: F(1,9) = 16.795, p = 0.003). Responding on the inactive lever did not change over the duration of self-administration (session: F(9,81) = 1.158, p = 0.333).

rGT performance

rGT performance was significantly affected by cocaine versus saline self-administration (drug: F(1,34) = 4.571, p = 0.040). The score of saline self-administering animals did not change, regardless of basal levels of risky choice (session × risk preference: F(9,81) = 1.376, p = 0.213). However, the score variable decreased significantly and selectively in risk-preferring rats self-administering cocaine, across both the cued and uncued task (Fig. 4: session × risk preference: F(9,225) = 2.285, p = 0.037; session: risk-preferring only: F(9,117) = 2.666, p = 0.012; optimal decision-makers only: F(9,117) = 1.188, p = 0.325). Importantly, this effect was seen in all cohorts tested (session × cohort: F(10,324) = 1.411, p = 0.181; session × drug × cohort: F(7,306) = 1.636, p = 0.129). Premature responding also declined during the self-administration epoch in all rats, an effect that tended to be more pronounced in rats trained on the uncued task (Table 2; session × task: F(9,225) = 2.693, p = 0.006; session: cued only: F(9,198) = 4.642, p < 0.001; uncued only: F(9,45) = 2.540, p = 0.019). Animals performing the uncued task also completed fewer trials during this phase of the experiment (Table 2; session × task: F(9,252) = 6.775, p < 0.001; session: cued only: F(9,189) = 1.055, p = 0.384; uncued: F(9,72) = 6.664, p < 0.001). We observed no differences in omissions or latency to collect reward (i.e., all terms of both repeated-measures ANOVAs were nonsignificant), although choice latency increased across all rats (session: F(7,190) = 6.690, p < 0.001; session × task: F(7,190) = 0.74, p = 0.641; session × risk-preference: F(7,190) = 1.764, p = 0.096). In contrast, we detected no change in score, premature responding, trials completed, omissions, collect latency, or choice latency (i.e., all terms of omnibus ANOVAs were nonsignificant) in animals self-administering saline (Table 2).

Figure 4.

Figure 4.

Decision making during cocaine self-administration as measured by score (a positive score reflects advantageous decision making). A, The score of optimal animals did not change, regardless of whether cocaine or saline was self-administered. B, In contrast, risk-preferring animals exhibited significant decreases in score, reflecting an increase in disadvantageous choice after cocaine exposure. This stabilized from Session 5 onwards (t tests vs Session 1: Session 2: t(15) = 0.498, p = 0.625; Session 3: t(15) = 3.074, p = 0.008; Session 4: t(15) = 0.910, p = 0.377; Session 5: t(15) = 2.604, p = 0.020; Session 6: t(15) = 2.399, p = 0.030; Session 7: t(15) = 2.720, p = 0.016; Session 8: t(13) = 2.314, p = 0.038; Session 9: t(13) = 3.569, p = 0.003; Session 10: t(13) = 2.897, p = 0.012). C–G, Changes in individual rat score from day 1 to day 10 of self-administration in optimal and risk-preferring subgroups. Only cocaine-exposed, risk-preferring animals exhibit decreases in score. A, B, Data are mean ± SEM. C–G, Data represent individual rat scores. The asterisks reflect a significant effect of session p < 0.05.

Table 2.

Cued rGT performance during 10 d self-administration period: decision making by individual choice option at day 1 and 10 of cocaine self-administration and additional task measures at day 1 and 10 of cocaine self-administrationa

Behavioral measure Time point Cued saline optimal Cued saline risk-preferring Cued cocaine optimal Cued cocaine risk-preferring Uncued cocaine optimal
P1 Day 1 12.21 ± 9.82 4.28 ± 2.41 10.67 ± 8.93 3.98 ± 1.54 15.85 ± 4.07
P2 76.66 ± 11.58 24.89 ± 9.12 61.07 ± 9.71 15.78 ± 3.13b 73.08 ± 7.42
P3 7.99 ± 1.66 35.16 ± 11.82 11.02 ± 4.08 45.35 ± 6.15 2.37 ± 1.40
P4 3.15 ± 2.48 35.67 ± 8.89 17.23 ± 7.43 34.89 ± 6.97 8.70 ± 4.69
P1 Day 10 8.54 ± 7.50 0.89 ± 0.81 2.74 ± 1.87 3.45 ± 1.42 9.33 ± 3.94
P2 81.88 ± 9.30 24.36 ± 9.71 70.66 ± 8.62 5.66 ± 2.36b 80.71 ± 5.58
P3 7.11 ± 0.79 50.18 ± 12.96 13.69 ± 8.44 62.39 ± 7.72 2.86 ± 1.46
P4 2.47 ± 2.18 24.57 ± 7.37 13.42 ± 7.95 28.50 ± 7.18 7.09 ± 4.22
Premature responses Day 1 6.04 ± 1.48 22.82 ± 7.47 24.04 ± 5.50b 22.42 ± 4.15b 18.89 ± 2.29b
Day 10 10.18 ± 3.69 20.85 ± 5.33 5.94 ± 1.73b 16.46 ± 3.67b 14.30 ± 2.80b
Trials completed Day 1 132.20 ± 8.75 67.74 ± 10.25 93.80 ± 10.38 57.34 ± 4.92 90.45 ± 4.76b
Day 10 128.80 ± 13.07 69.60 ± 7.06 95.26 ± 13.58 53.09 ± 2.78 52.46 ± 7.21b
Omitted responses Day 1 0.40 ± 0.24 0.86 ± 0.46 0.44 ± 0.24 1.56 ± 0.72 2.82 ± 0.70
Day 10 0.20 ± 0.20 2.57 ± 1.95 2.00 ± 1.02 2.29 ± 0.75 2.22 ± 0.70
Choice latency Day 1 1.25 ± 0.30 1.93 ± 0.30 1.05 ± 0.22b 1.45 ± 0.21b 1.43 ± 0.23b
Day 10 1.74 ± 0.45 1.97 ± 0.33 1.80 ± 0.32b 1.97 ± 0.29b 2.52 ± 0.34b
Collect latency Day 1 1.04 ± 0.06 0.81 ± 0.21 0.69 ± 0.07 0.78 ± 0.11 1.18 ± 0.17
Day 10 1.03 ± 0.05 0.60 ± 0.08 0.69 ± 0.06 0.62 ± 0.06 1.25 ± 0.21

aData are mean ± SEM. We detected a significant main effect of session in P2 for cocaine risk-preferring rats (F(9,117) = 3.693, p < 0.001), which we followed up with a linear contrast (F(1,13) = 12.010, p = 0.004) to show that responding for the safe option P2 decreased in a linear fashion as cocaine self-administration progressed. No other options displayed significant within-subjects changes over session (i.e., all terms of individual repeated-measures ANOVAs were nonsignificant.

bSignificant effect across sessions as determined by post hoc tests.

Rats weighed between 375 and 450 g during this phase of the experiment. However, weight did not predict any change in score over self-administration training (r = −0.082, p = 0.709). Although rats tended to lose weight after self-administering cocaine, the degree of weight loss also did not correlate with change in score (r = 0.295, p = 0.172).

Experiment 2: locomotor activity, CRf, and DA efflux within the NAc of rats trained to perform the cued or uncued rGT

Similar to Experiment 1, the majority of rats trained on the uncued rGT were optimal decision-makers (12 of 16), whereas the opposite pattern was observed for the cued task (4 of 16; χ2 test: χ2(1) = 8.00, p = 0.012). As in Experiment 1, rats trained on the cued rGT task showed higher levels of risky choice that could be largely attributed to a greater preference for P3 (Table 3; score: task: F(1,28) = 8.928, p = 0.006; choice: option × task: F(3,72) = 2.619, p = 0.066). Rats trained on the cued rGT were also faster to collect reward (task: F(1,28) = 7.643, p = 0.010), made more premature responses (task: F(1,28) = 5.649, p = 0.025), and tended to complete fewer trials (task: F(1,28) = 4.114, p = 0.052). Performance of either task variant was indistinguishable between TH:Cre Tg+ and Tg rats (F < 1.912, p > 0.178); therefore, Tg status was excluded as a between-subjects measure for all remaining analyses.

Table 3.

rGT and cued rGT performance at baseline for Experiment 2a

Behavioral measure rGT Cued rGT Optimal Risk-preferring
P1 7.61 ± 2.10 9.76 ± 2.91 10.42 ± 3.29 6.95 ± 1.39
P2 62.72 ± 9.34 23.91 ± 5.04 66.33 ± 9.69 20.30 ± 3.69
P3 13.81 ± 3.29 39.09 ± 7.43 8.10 ± 2.22 44.79 ± 7.45
P4 15.86 ± 3.22 27.24 ± 5.48 15.14 ± 3.03 27.95 ± 5.57
Score 40.67 ± 9.30 −32.66 ± 9.15 53.50 ± 8.61 −45.49 ± 7.78
Premature Responses 12.09 ± 1.99 22.40 ± 3.31 14.85 ± 2.45 19.64 ± 3.10
Trials completed 89.80 ± 12.39 64.44 ± 8.69 93.99 ± 12.76 60.25 ± 7.89
Omitted responses 3.21 ± 0.66 1.21 ± 0.34 2.48 ± 0.52 1.94 ± 0.56
Choice latency 1.64 ± 0.23 1.29 ± 0.19 1.44 ± 0.21 1.49 ± 0.22
Collect latency 0.94 ± 0.13 0.61 ± 0.09 0.90 ± 0.12 0.65 ± 0.09

aData are mean ± SEM.

Locomotor testing

Comparing locomotor activity before versus after rGT training, we observed that uncued rGT animals increased locomotor activity after cocaine, whereas this effect was absent in rats trained on the cued rGT (Fig. 5A: day × bin × task: F(5,140) = 2.794, p = 0.019; rGT: day: F(1,14) = 4.781, p = 0.046; cued rGT: day: F(1,14) = 0.370, p = 0.553). Neither group showed changes in locomotor activity after saline (Fig. 5D–F; day: F values < 0.529, p > 0.479).

Figure 5.

Figure 5.

Locomotor activity pretask and post-task training after acute cocaine or saline administration. A, D, Before task training, animals exhibited robust ambulation after cocaine exposure compared with saline. However, after training, only rGT-experienced rats (A,B) showed significantly potentiated cocaine-induced locomotor activity. A, C, In contrast, cued rGT-trained animals maintained comparable locomotor activity as before training, indicating a blunted response to cocaine. B, C, Risk-preference did not mitigate the response to cocaine after training, although risk-preferring animals showed blunted cocaine-induced ambulation before task experience. D–F, Saline-induced locomotion remained relatively stable before and after task experience, indicating that the potentiated response seen in rGT-trained animals was unique to cocaine. Data are mean ± SEM.

The omnibus ANOVA also revealed subtle differences in risk-preference groups' ambulatory activity before and after training, most prominently in cued rGT rats (Fig. 5C; day × bin × task × risk-preference: F(5,140) = 2.270, p = 0.051; cued rGT only: day × bin × risk-preference: F(5,70) = 2.992, p = 0.017; treatment × bin × risk-preference: F(5,70) = 2.517, p = 0.04). Specifically, at baseline, cued rGT risk-preferring animals were less active than optimal decision-makers after cocaine (Fig. 5C; bin × risk-preference: F(5,70) = 2.21, p = 0.06; day × bin: F(5,70) = 2.682, p = 0.028) and saline (Fig. 5F; day × bin × risk-preference: F(5,70) = 2.609, p = 0.03). As such, animals that went on to become risk-preferring showed some signs of reduced locomotor activity in response to an injection. However, risk-preference did not account for the sensitization effect seen in these rats (Fig. 5B; day × risk-preference, day × bin × risk-preference, risk-preference F values < 1.094, p > 0.372). For all analyses, no other significant interactions were present by day, treatment, task, or risk-preference (F values < 2.856, p > 0.102).

Responding for CRf

There were no significant differences by task or risk-preference group during the conditioning period for number of nose-pokes after CS+ versus CS presentation, latency to nose-poke, or collection latencies (data not shown, F values < 1.642, p > 0.211). During the operant test, animals responded more on the CS+ associated lever, suggesting sensitivity to CRf (Fig. 6A; lever: F(1,21) = 9.409, p = 0.006). Although optimal decision-makers appear to show greater responding for the CS+, there were no significant differences by task or by risk-preference in the number of lever presses made, the ratio of responding for the CS+ vs CS, or the latency to nose-poke at the food tray after illumination of the CS+ or CS, a proxy of “goal-tracking” behavior (Fig. 6B,C; latency data not shown; all F values < 2.111, p > 0.161). These data suggest neither risk-preference nor task experience influenced responding for CRf.

Figure 6.

Figure 6.

Responding during CRf test. A, All animals exhibited a significant preference for the CS+ lever compared with the CS (p < 0.05). However, there were no significant differences by task experience or risk-preference, either for lever preference (B) or ratio for CS+ (C). These data suggest that, although rats acquired the CRf response, experience with salient reward-paired cues or individual risk-preference did not mitigate operant responding for cues. Data are presented by task and risk-preference subgroups (collapsed across rGT and cued rGT). Data are mean ± SEM.

DA microdialysis

Histological analyses excluded 7 animals for incorrect placement of microdialysis probes (i.e., exclusively in the shell or the core; see Fig. 7A for placements, n = 22 included). As expected, cocaine significantly increased DA levels within the NAc compared with baseline and saline (Fig. 7B,C; treatment: F(1,24) = 76.839, p < 0.001; bin: F(1,38) = 10.547, p < 0.001; treatment × bin: F(2,36) = 6.909, p = 0.004). However, rats trained on the cued rGT exhibited significantly less DA efflux compared with uncued-rGT rats (Fig. 7B; task: F(1,20) = 4.598, p = 0.044; within-subjects interactions: F values < 1.064, p > 0.350). Given the group differences in basal DA levels, we also analyzed cocaine-induced increases in DA efflux as percentage change from baseline. Although basal levels of DA were relatively lower in cued rGT animals, these rats tended to show a greater proportional DA efflux compared with uncued-rGT animals, most visibly after cocaine (Fig. 7C; task: F(1,19) = 3.513, p = 0.076; treatment: F(1,19) = 48.288, p < 0.001; bin: F(2,33) = 8.909, p = 0.001; treatment × bin: F(2,34) = 6.916, p = 0.004; all interactions by task: F < 2.947, p > 0.102). Therefore, although rats trained on the cued rGT appeared to have lower levels of basal DA in the NAc, they may be more sensitive to the ability of cocaine to increase DA efflux.

Figure 7.

Figure 7.

Microdialysis results. A, Placement of probes on the border of the NAc shell and core at specific anterior–posterior coordinates. B, Raw NAc DA dialysate. Compared with rGT-trained rats, cued rGT-experienced animals showed significantly less basal efflux at baseline and after saline exposure, but comparable levels after cocaine administration. C, Change in DA efflux over baseline. All animals exhibited significant potentiation in DA release after cocaine exposure, but this effect was slightly greater in cued rGT-trained rats, indicating that, although these animals have a hypodopaminergic state at baseline, they may be slightly more reactive to cocaine. Data are mean of both left and right hemispheres ± SEM.

Discussion

Here we show that training rats on a gambling-like schedule of reinforcement in which wins are accompanied by salient audiovisual cues results in behavioral and neurophysiological changes indicative of a proaddiction phenotype. Interestingly, the behavioral response to cocaine self-administration manifests differently in animals able to persist in making good choices despite the presence of the cues, versus those that prefer the risky options. Similar to the results observed previously with the uncued version of the rGT (Ferland and Winstanley, 2017), the decision making of risk-preferring rats became significantly and progressively more risky as cocaine self-administration was acquired. In addition to exhibiting greater levels of risky choice as a group, the proportion of risk-preferring rats is dramatically elevated in the cued rGT, with ∼59.3% of animals exhibiting this profile as opposed to 12.5% in the uncued task. We have shown previously that the degree to which cocaine self-administration potentiates risky choice predicts elevated drug-seeking in the incubation of craving model of relapse (Ferland and Winstanley, 2017). As such, training on the cued rGT enriches the population of animals putatively vulnerable to addiction. Cocaine self-administration did not increase preference for the risky options in optimal decision-makers, regardless of which task they were trained on. However, despite this apparent resilience to the negative sequelae of drug-taking on the choice process, optimal decision-makers trained on the cued rGT self-administered significantly more cocaine, again potentially indicative of greater addiction risk.

We originally hypothesized that persistent risky choice, particularly on the cued rGT, may exacerbate addiction vulnerability through sensitizing the DA system, in keeping with reports that both exposure to cues that predict uncertain rewards, and responding for probabilistic reinforcement, can enhance locomotor sensitization to amphetamine (Singer et al., 2012; Zack et al., 2014; Mascia et al., 2019). Our data instead suggest that risk-preferring rats have a hyposensitive DA system at the outset, in that these rats show a blunted locomotor response to an injection of cocaine. Although training on the uncued rGT resulted in robust locomotor sensitization to cocaine in all rats, this was absent in animals trained on the cued rGT. Furthermore, in vivo microdialysis confirmed that basal DA efflux within the NAc was lower in rats trained on the cued compared with the uncued rGT, whereas the proportional increase in DA detected after cocaine administration tended to be greater. Collectively, these data suggest that hypoactivity within the mesolimbic DA system may promote susceptibility to the addictive properties of psychostimulants, in keeping with the reward deficiency hypothesis of addiction (Blum et al., 2000), and that repeated engagement with heavily cued probabilistic reinforcement schedules can exacerbate this vulnerability. What remains unclear from the current data is whether this hypoactive state is specific to the NAc core or shell. DA tone within these subregions may differentially affect decision-making preferences (Sugam et al., 2012, 2014), and future studies should address this issue.

Both optimal decision-makers and risk-preferring rats trained on the cued rGT made more premature responses than their uncued counterparts, indicative of greater motor impulsivity, another cognitive behavioral trait linked to addiction risk. Animals that exhibit more premature responses at baseline self-administer cocaine in a more addiction-like manner, and accumbal D2/3 receptor expression is markedly lower in these rats (Dalley et al., 2007; Belin et al., 2008). Again, currently available data would predict that elevated premature responding would be associated with a hyperactive mesolimbic DA system. For example, potentiating DA release in the NAc increases this form of motor impulsivity, and the ability of amphetamine to drive premature responding can be attenuated by direct infusions of DA antagonists into the NAc (Pattij et al., 2007; Moreno et al., 2013). However, our data may not be entirely inconsistent with these observations. The addition of reward-concurrent cues to the rGT would be expected to potentiate the increase in DA release likely caused by responding for reward on probabilistic schedules of reinforcement. Greater phasic DA release on a background of low tonic DA levels is thought to have greater neurophysiological and behavioral effects (Grace, 1991), and this may result in elevated premature responding in the cued task.

The predicted ability of reward-concurrent cues to amplify task-induced DA release may also explain why accumbal DA levels are so much lower in animals trained on the cued rGT. Specifically, repeated exposure to heavily cued probabilistic reinforcement schedules on the cued rGT, which should drive dopaminergic activity, may lead to an adaptive downregulation of basal NAc DA release in a form of homeostatic regulation. This in turn may result in sensitization of postsynaptic receptors, such that the response to events that potentiate DA signaling is disproportionately amplified. Certainly, attenuating dopaminergic signaling through administration of DA receptor antagonists can sensitize or upregulate D2-family receptors in the striatum (Seeman, 2008; Varela et al., 2014), including the D3 receptors that uniquely mediate choice on the cued rGT (Sokoloff et al., 2006; Barrus and Winstanley, 2016). As such, rats trained on the cued rGT may be more motivated to respond for certain reinforcers as predicted by reward deficiency: animals are therefore primed to self-administer cocaine or to respond for larger rewards despite the potential negative consequences as smaller rewards are insufficient to motivate behavior (Blum et al., 2000, 2012). Indeed, recent data suggest that cocaine-dependent human subjects show blunted DA release within the ventral striatum, encompassing the NAc, after amphetamine administration, and have lower ratings of euphoria after amphetamine, yet are clearly more willing to self-administer cocaine (Trifilieff et al., 2017). Previous data also indicate that changes in DA signaling within the NAc can also influence preference for uncertain outcomes (St Onge et al., 2012; Sugam et al., 2012; Zalocusky et al., 2016), further emphasizing the potential synergy between the neurobiological mechanism impacted by addictive drugs and responding for uncertain outcomes.

The self-administration and CRf sessions were run in different operant chambers from the rGT and used distinct operant manipulanda and stimuli, but some features of the contexts and reward-paired cues were similar across paradigms, raising questions over whether learning generalized across tasks. However, responding for CRf was comparable regardless of risk-preference or whether animals performed the cued or uncued task variant, making it unlikely that learning from the context or set of cues used in the rGT impacted development of this behavior. Similarly, although a 20 kHz tone was used to signal delivery of cocaine and also featured in the reward-concurrent cues paired with one of the high-risk options (P4), there was no sign that preference for P4 affected cocaine-taking; levels of cocaine self-administration were comparable in risk-preferring rats trained on the cued rGT and rats trained on the uncued rGT. Similarly, optimal decision-makers performing the cued rGT took more cocaine but did not then shift their preference toward the riskier options.

The question remains as to why training on the cued task results in distinct behavioral responses to cocaine self-administration depending on baseline levels of risk preference: optimal decision-makers take considerably more cocaine, whereas risk-preferring rats make increasing numbers of risky decisions. If experience of cocaine exacerbates the subjective or behavioral response to low basal DA, as would be expected from the opponent process theory of addiction and supporting work, this may lead to increased drive to reach an optimal level of reward (Koob and Le Moal, 2008; Koob and Volkow, 2010; Der-Avakian and Markou, 2012; Belujon et al., 2016). Optimal decision-makers are immune to the allure of the risky, more heavily cued options on the rGT, and may therefore overcome this aversive state through taking more drug. Risk-preferring rats may instead seek to boost accumbal DA levels through making more risky choices. As such, the pursuit of large, risky rewards may almost “substitute” for cocaine self-administration in these animals. Alternatively, greater risky choice could result from insensitivity to punishment, similar to the “myopia for the future” previously hypothesized by others to hallmark decision making in addiction-vulnerable individuals (Bechara, 2005). These hypotheses remain speculative but may prove heuristically useful.

Poor cognitive performance after cocaine experience was limited to the decision-making domain in risk-preferring animals. All rats actually showed an improvement in impulse control, as indicated by a decrease in premature responding while self-administering cocaine. This is not the first time that self-administration of psychostimulants has been shown to decrease this form of motor impulsivity (Dalley et al., 2007; Caprioli et al., 2013), and may reflect the therapeutic benefit of stimulant medications in reducing impulsive responding. While clinical data clearly show that levels of impulsive choice (selection of smaller-sooner over larger-later rewards) and impulsive action are elevated in SUD (Dalley and Robbins, 2017), data from animal models may help to track variation in impulsivity across phases of drug use. Determining which behavioral trait is best predictive of problematic features of drug use, and at which stage of the addiction cycle, could be enormously beneficial in targeting appropriate interventions.

The null relationship between responding for CRf and task experience or risk-preference indicates that risky choice on either task is unlikely to simply reflect greater willingness to respond for conditioned reinforcing stimuli. In an incubation of craving paradigm, we previously observed that rats identified as risk-preferring on the uncued rGT exhibited greater responding for drug-paired cues after withdrawal from cocaine self-administration, suggesting a potential link between risky choice and cue responsivity (Ferland and Winstanley, 2017). Furthermore, a large body of literature has indicated an important relationship between cue sensitivity and/or sign-tracking and the development of an addictive-like behavioral pattern (Phillips and Fibiger, 1990; Robinson and Berridge, 1993, 2008; Tomie, 1996; Tomie et al., 2008; Flagel et al., 2009, 2010). The current results, however, suggest that there may be something unique about the incentive motivation elicited by drug-paired, rather than food-paired, cues with regards to the relationship with risky choice. The exact cognitive process through which win-paired cues drive risky choice remains unclear and is the focus of ongoing research efforts.

These data may have implications for our understanding of disordered gambling and its relationship to SUD. GD is epitomized by risky decision making, as the subject continues to play despite the danger of overwhelming financial loss (Clark et al., 2013; Petry et al., 2014). Similar to patterns of brain activation in response to drug-paired cues in SUD, cues associated with monetary rewards elicit ventral striatum and insula activity in problem gamblers (Sescousse et al., 2013; Limbrick-Oldfield et al., 2017), suggesting a common mechanism between craving for drugs and gambling. EGMs use salient audiovisual cues to encourage play, and may be particularly addictive compared with other forms of gambling (Clark et al., 2013; Murch and Clark, 2016). Given the ability of experience with reward-paired cues to both exacerbate risky decision making and to promote cocaine self-administration, even in those that are not biased toward risky choices, experience with EGMs may potentially sensitize those with GD to other forms of addiction through multiple insidious mechanisms. Indeed, in a representative survey of pathological gamblers, 73% had comorbid alcohol use disorder and nearly 40% exhibited other drug abuse (Petry et al., 2005). The role played by striatal DA signaling in GD, however, seems more complex than in SUD. While a lower density of D2/3 receptors within the ventral striatum has been repeatedly observed in SUD populations (Volkow et al., 2009), data from GD subjects have been more equivocal (Clark et al., 2012; Potenza and Brody, 2013; Sescousse et al., 2016). Although data suggest those with GD have elevated DA synthesis capacity within the ventral striatum (van Holst et al., 2018), this population also shows blunted ventral striatum activation during reward anticipation (Luijten et al., 2017). Nevertheless, the possibility that aspects of gambling products may facilitate cross-sensitization to psychostimulant drugs is an important consideration with respect to the regulation of EGM design and should be explored further.

In conclusion, these data demonstrate that pairing uncertain rewards with salient sensory cues not only increases risky choice, but fosters a hypodopaminergic state within the NAc. Understanding the mechanism through which appetitive conditioned stimuli can drive both risky decision making and addiction vulnerability will hopefully contribute to novel therapeutic interventions for both chemical and behavioral dependency. Addictions are often highly comorbid with other psychiatric disorders in which decision making is compromised, such as schizophrenia, major depression, and anxiety (Pasche, 2012; Thoma and Daum, 2013; McCabe et al., 2017; Gómez-Coronado et al., 2018), all of which have similarly been associated with aberrant DA signaling (Grace, 2016; Faivre et al., 2018; Hellberg et al., 2018). These findings may further clarify the neurocognitive processes underlying the synergy between these psychopathologies.

Footnotes

This work was supported by Canadian Institutes for Health Research Open Operating Grant to C.A.W. We thank Sukhbir Kaur for genotyping animals used in Experiment 2; Mason Silveira for assisting with breeding animals used in this work; and Giada Vacca for assistance with HPLC.

In the last 3 years, C.A.W. has been retained as an expert witness by Hogan Lovells LLP, and received due compensation. A.G.P. declares a patent related to glutamate receptor function (A Peptide that Specifically Blocks Regulated AMPA Receptor Endocytosis and Hippocampal CA1 Long-term Depression; European 04789721.0, and United States 13/066,700). A.G.P. also declares a pending patent for the use of d-Govadine in treatment of cognitive deficits. The remaining authors declare no competing financial interests.

References

  1. Adams WK, Barkus C, Ferland JN, Sharp T, Winstanley CA (2017) Pharmacological evidence that 5-HT2C receptor blockade selectively improves decision making when rewards are paired with audiovisual cues in a rat gambling task. Psychopharmacology (Berl) 234:3091–3104. 10.1007/s00213-017-4696-4 [DOI] [PubMed] [Google Scholar]
  2. Barrus MM, Winstanley CA (2016) Dopamine D3 receptors modulate the ability of win-paired cues to increase risky choice in a rat gambling task. J Neurosci 36:785–794. 10.1523/JNEUROSCI.2225-15.2016 [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Bechara A. (2003) Risky business: emotion, decision-making, and addiction. J Gambl Stud 19:23–51. 10.1023/A:1021223113233 [DOI] [PubMed] [Google Scholar]
  4. Bechara A. (2005) Decision making, impulse control and loss of willpower to resist drugs: a neurocognitive perspective. Nat Neurosci 8:1458–1463. 10.1038/nn1584 [DOI] [PubMed] [Google Scholar]
  5. Belin D, Mar AC, Dalley JW, Robbins TW, Everitt BJ (2008) High impulsivity predicts the switch to compulsive cocaine-taking. Science 320:1352–1355. 10.1126/science.1158136 [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Belujon P, Jakobowski NL, Dollish HK, Grace AA (2016) Withdrawal from acute amphetamine induces an amygdala-driven attenuation of dopamine neuron activity: reversal by ketamine. Neuropsychopharmacology 41:619–627. 10.1038/npp.2015.191 [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Beninger RJ, Banasikowski TJ (2008) Dopaminergic mechanism of reward-related incentive learning: focus on the dopamine D(3) receptor. Neurotox Res 14:57–70. 10.1007/BF03033575 [DOI] [PubMed] [Google Scholar]
  8. Bissonette GB, Roesch MR (2016) Neurophysiology of reward-guided behavior: correlates related to predictions, value, motivation, errors, attention, and action. Curr Top Behav Neurosci 27:199–230. 10.1007/7854_2015_382 [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Blum K, Braverman ER, Holder JM, Lubar JF, Monastra VJ, Miller D, Lubar JO, Chen TJ, Comings DE (2000) Reward deficiency syndrome: a biogenetic model for the diagnosis and treatment of impulsive, addictive, and compulsive behaviors. J Psychoactive Drugs 32 [Suppl i-iv]:1–112. [DOI] [PubMed] [Google Scholar]
  10. Blum K, Gardner E, Oscar-Berman M, Gold M (2012) “Liking” and “wanting” linked to Reward Deficiency Syndrome (RDS): hypothesizing differential responsivity in brain reward circuitry. Curr Pharm Des 18: 113–118. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Bolla KI, Eldreth DA, Matochik JA, Cadet JL (2005) Neural substrates of faulty decision-making in abstinent marijuana users. Neuroimage 26:480–492. 10.1016/j.neuroimage.2005.02.012 [DOI] [PubMed] [Google Scholar]
  12. Brown PL, Jenkins HM (1968) Auto-shaping of the pigeon's keypeck. J Exp Anal Behav 11:1–8. 10.1901/jeab.1968.11-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Calu DJ, Stalnaker TA, Franz TM, Singh T, Shaham Y, Schoenbaum G (2007) Withdrawal from cocaine self-administration produces long-lasting deficits in orbitofrontal-dependent reversal learning in rats. Learn Mem 14:325–328. 10.1101/lm.534807 [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Caprioli D, Hong YT, Sawiak SJ, Ferrari V, Williamson DJ, Jupp B, Adrian Carpenter T, Aigbirhio FI, Everitt BJ, Robbins TW, Fryer TD, Dalley JW (2013) Baseline-dependent effects of cocaine pre-exposure on impulsivity and D2/3 receptor availability in the rat striatum: possible relevance to the attention-deficit hyperactivity syndrome. Neuropsychopharmacology 38:1460–1471. 10.1038/npp.2013.44 [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Chou KL, Afifi TO (2011) Disordered (pathologic or problem) gambling and axis I psychiatric disorders: results from the national epidemiologic survey on alcohol and related conditions. Am J Epidemiol 173:1289–1297. 10.1093/aje/kwr017 [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Ciccarone D. (2011) Stimulant abuse: pharmacology, cocaine, methamphetamine, treatment, attempts at pharmacotherapy. Prim Care 38:41–58. 10.1016/j.pop.2010.11.004 [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Clark L, Stokes PR, Wu K, Michalczuk R, Benecke A, Watson BJ, Egerton A, Piccini P, Nutt DJ, Bowden-Jones H, Lingford-Hughes AR (2012) Striatal dopamine D(2)/D(3) receptor binding in pathological gambling is correlated with mood-related impulsivity. Neuroimage 63:40–46. 10.1016/j.neuroimage.2012.06.067 [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Clark L, Averbeck B, Payer D, Sescousse G, Winstanley CA, Xue G (2013) Pathological choice: the neuroscience of gambling and gambling addiction. J Neurosci 33:17617–17623. 10.1523/JNEUROSCI.3231-13.2013 [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Dalley JW, Robbins TW (2017) Fractionating impulsivity: neuropsychiatric implications. Nat Rev Neurosci 18:158–171. 10.1038/nrn.2017.8 [DOI] [PubMed] [Google Scholar]
  20. Dalley JW, Fryer TD, Brichard L, Robinson ES, Theobald DE, Lääne K, Peña Y, Murphy ER, Shah Y, Probst K, Abakumova I, Aigbirhio FI, Richards HK, Hong Y, Baron JC, Everitt BJ, Robbins TW (2007) Nucleus accumbens D2/3 receptors predict trait impulsivity and cocaine reinforcement. Science 315:1267–1270. 10.1126/science.1137073 [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Der-Avakian A, Markou A (2012) The neurobiology of anhedonia and other reward-related deficits. Trends Neurosci 35:68–77. 10.1016/j.tins.2011.11.005 [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Dowling N, Smith D, Thomas T (2005) Electronic gaming machines: are they the 'crack-cocaine' of gambling? Addiction 100:33–45. 10.1111/j.1360-0443.2005.00962.x [DOI] [PubMed] [Google Scholar]
  23. Faivre F, Joshi A, Bezard E, Barrot M (2018) The hidden side of Parkinson's disease: studying pain, anxiety and depression in animal models. Neurosci Biobehav Rev 96:335–352. 10.1016/j.neubiorev.2018.10.004 [DOI] [PubMed] [Google Scholar]
  24. Ferland JN, Winstanley CA (2017) Risk-preferring rats make worse decisions and show increased incubation of craving after cocaine self-administration. Addict Biol 22:991–1001. 10.1111/adb.12388 [DOI] [PubMed] [Google Scholar]
  25. Flagel SB, Watson SJ, Robinson TE, Akil H (2007) Individual differences in the propensity to approach signals vs goals promote different adaptations in the dopamine system of rats. Psychopharmacology (Berl) 191:599–607. 10.1007/s00213-006-0535-8 [DOI] [PubMed] [Google Scholar]
  26. Flagel SB, Akil H, Robinson TE (2009) Individual differences in the attribution of incentive salience to reward-related cues: implications for addiction. Neuropharmacology 56 [Suppl 1]:139–148. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Flagel SB, Robinson TE, Clark JJ, Clinton SM, Watson SJ, Seeman P, Phillips PE, Akil H (2010) An animal model of genetic vulnerability to behavioral disinhibition and responsiveness to reward-related cues: implications for addiction. Neuropsychopharmacology 35:388–400. 10.1038/npp.2009.142 [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Gómez-Coronado N, Sethi R, Bortolasci CC, Arancini L, Berk M, Dodd S (2018) A review of the neurobiological underpinning of comorbid substance use and mood disorders. J Affect Disord 241:388–401. 10.1016/j.jad.2018.08.041 [DOI] [PubMed] [Google Scholar]
  29. Grace AA. (1991) Phasic versus tonic dopamine release and the modulation of dopamine system responsivity: a hypothesis for the etiology of schizophrenia. Neuroscience 41:1–24. 10.1016/0306-4522(91)90196-U [DOI] [PubMed] [Google Scholar]
  30. Grace AA. (2016) Dysregulation of the dopamine system in the pathophysiology of schizophrenia and depression. Nat Rev Neurosci 17:524–532. 10.1038/nrn.2016.57 [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Hellberg SN, Russell TI, Robinson MJ (2018) Cued for risk: evidence for an incentive sensitization framework to explain the interplay between stress and anxiety, substance abuse, and reward uncertainty in disordered gambling behavior. Cogn Affect Behav Neurosci. Advance online publication. Retrieved October 24, 2018. doi: 10.3758/s13415-018-00662-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Koob GF, Le Moal M (2008) Review: neurobiological mechanisms for opponent motivational processes in addiction. Philos Trans R Soc Lond B Biol Sci 363:3113–3123. 10.1098/rstb.2008.0094 [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Koob GF, Volkow ND (2010) Neurocircuitry of addiction. Neuropsychopharmacology 35:217–238. 10.1038/npp.2009.110 [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Le Foll B, Goldberg SR, Sokoloff P (2005) The dopamine D3 receptor and drug dependence: effects on reward or beyond? Neuropharmacology 49:525–541. 10.1016/j.neuropharm.2005.04.022 [DOI] [PubMed] [Google Scholar]
  35. Limbrick-Oldfield EH, Mick I, Cocks RE, McGonigle J, Sharman SP, Goldstone AP, Stokes PR, Waldman A, Erritzoe D, Bowden-Jones H, Nutt D, Lingford-Hughes A, Clark L (2017) Neural substrates of cue reactivity and craving in gambling disorder. Transl Psychiatry 7:e992. 10.1038/tp.2016.256 [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Livingstone C, Adams PJ (2011) Harm promotion: observations on the symbiosis between government and private industries in Australasia for the development of highly accessible gambling markets. Addiction 106:3–8. 10.1111/j.1360-0443.2010.03137.x [DOI] [PubMed] [Google Scholar]
  37. Lorains FK, Cowlishaw S, Thomas SA (2011) Prevalence of comorbid disorders in problem and pathological gambling: systematic review and meta-analysis of population surveys. Addiction 106:490–498. 10.1111/j.1360-0443.2010.03300.x [DOI] [PubMed] [Google Scholar]
  38. Luijten M, Schellekens AF, Kühn S, Machielse MW, Sescousse G (2017) Disruption of reward processing in addiction: an image-based meta-analysis of functional magnetic resonance imaging studies. JAMA Psychiatry 74:387–398. 10.1001/jamapsychiatry.2016.3084 [DOI] [PubMed] [Google Scholar]
  39. Mascia P, Neugebauer NM, Brown J, Bubula N, Nesbitt KM, Kennedy RT, Vezina P (2019) Exposure to conditions of uncertainty promotes the pursuit of amphetamine. Neuropsychopharmacology 44:274–280. 10.1038/s41386-018-0099-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. McCabe SE, West BT, Jutkiewicz EM, Boyd CJ (2017) Multiple DSM-5 substance use disorders: a national study of US adults. Hum Psychopharmacol 32. Advance online publication. Retrieved July 27, 2017. doi: 10.1002/hup.2625 [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. McDonald JH. (2009) Handbook of biological statistics. Baltimore: Sparky House. [Google Scholar]
  42. Moreno M, Economidou D, Mar AC, Lopez-Granero C, Caprioli D, Theobald DE, Fernando A, Newman AH, Robbins TW, Dalley JW (2013) Divergent effects of D(2)/(3) receptor activation in the nucleus accumbens core and shell on impulsivity and locomotor activity in high and low impulsive rats. Psychopharmacology (Berl) 228:19–30. 10.1007/s00213-013-3010-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Murch WS, Clark L (2016) Games in the brain: neural substrates of gambling addiction. Neuroscientist 22:534–545. 10.1177/1073858415591474 [DOI] [PubMed] [Google Scholar]
  44. Pasche S. (2012) Exploring the comorbidity of anxiety and substance use disorders. Curr Psychiatry Rep 14:176–181. 10.1007/s11920-012-0264-0 [DOI] [PubMed] [Google Scholar]
  45. Pattij T, Janssen MC, Vanderschuren LJ, Schoffelmeer AN, van Gaalen MM (2007) Involvement of dopamine D1 and D2 receptors in the nucleus accumbens core and shell in inhibitory response control. Psychopharmacology (Berl) 191:587–598. 10.1007/s00213-006-0533-x [DOI] [PubMed] [Google Scholar]
  46. Paxinos G, Watson C (1998) The rat brain in stereotaxic coordinates. San Diego: Academic. [DOI] [PubMed] [Google Scholar]
  47. Petry NM, Stinson FS, Grant BF (2005) Comorbidity of DSM-IV pathological gambling and other psychiatric disorders: results from the national epidemiologic survey on alcohol and related conditions. J Clin Psychiatry 66:564–574. 10.4088/JCP.v66n0504 [DOI] [PubMed] [Google Scholar]
  48. Petry NM, Blanco C, Auriacombe M, Borges G, Bucholz K, Crowley TJ, Grant BF, Hasin DS, O'Brien C (2014) An overview of and rationale for changes proposed for pathological gambling in DSM-5. J Gambl Stud 30:493–502. 10.1007/s10899-013-9370-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Phillips AG, Fibiger HC (1990) Role of reward and enhancement of conditioned reward in persistence of responding for cocaine. Behav Pharmacol 1:269–282. 10.1097/00008877-199000140-00002 [DOI] [PubMed] [Google Scholar]
  50. Potenza MN, Brody AL (2013) Commentary on Boileau et al. (2013): distinguishing D2/D3 dopaminergic contributions to addictions. Addiction 108:964–965. 10.1111/add.12119 [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Robbins TW. (2002) The 5-choice serial reaction time task: behavioural pharmacology and functional neurochemistry. Psychopharmacology (Berl) 163:362–380. 10.1007/s00213-002-1154-7 [DOI] [PubMed] [Google Scholar]
  52. Robinson TE, Berridge KC (1993) The neural basis of drug craving: an incentive-sensitization theory of addiction. Brain Res Brain Res Rev 18:247–291. 10.1016/0165-0173(93)90013-P [DOI] [PubMed] [Google Scholar]
  53. Robinson TE, Berridge KC (2008) Review: the incentive sensitization theory of addiction: some current issues. Philos Trans R Soc Lond B Biol Sci 363:3137–3146. 10.1098/rstb.2008.0093 [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Rogers RD, Everitt BJ, Baldacchino A, Blackshaw AJ, Swainson R, Wynne K, Baker NB, Hunter J, Carthy T, Booker E, London M, Deakin JF, Sahakian BJ, Robbins TW (1999) Dissociable deficits in the decision-making cognition of chronic amphetamine abusers, opiate abusers, patients with focal damage to prefrontal cortex, and tryptophan-depleted normal volunteers: evidence for monoaminergic mechanisms. Neuropsychopharmacology 20:322–339. 10.1016/S0893-133X(98)00091-8 [DOI] [PubMed] [Google Scholar]
  55. Saddoris MP, Sugam JA, Stuber GD, Witten IB, Deisseroth K, Carelli RM (2015) Mesolimbic dopamine dynamically tracks, and is causally linked to, discrete aspects of value-based decision making. Biol Psychiatry 77:903–911. 10.1016/j.biopsych.2014.10.024 [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Seeman P. (2008) Dopamine D2(High) receptors moderately elevated by bifeprunox and aripiprazole. Synapse 62:902–908. 10.1002/syn.20557 [DOI] [PubMed] [Google Scholar]
  57. Sescousse G, Barbalat G, Domenech P, Dreher JC (2013) Imbalance in the sensitivity to different types of rewards in pathological gambling. Brain 136:2527–2538. 10.1093/brain/awt126 [DOI] [PubMed] [Google Scholar]
  58. Sescousse G, Janssen LK, Hashemi MM, Timmer MH, Geurts DE, Ter Huurne NP, Clark L, Cools R (2016) Amplified striatal responses to near-miss outcomes in pathological gamblers. Neuropsychopharmacology 41:2614–2623. 10.1038/npp.2016.43 [DOI] [PMC free article] [PubMed] [Google Scholar]
  59. Singer BF, Scott-Railton J, Vezina P (2012) Unpredictable saccharin reinforcement enhances locomotor responding to amphetamine. Behav Brain Res 226:340–344. 10.1016/j.bbr.2011.09.003 [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Sokoloff P, Diaz J, Le Foll B, Guillin O, Leriche L, Bezard E, Gross C (2006) The dopamine D3 receptor: a therapeutic target for the treatment of neuropsychiatric disorders. CNS Neurol Disord Drug Targets 5:25–43. 10.2174/187152706784111551 [DOI] [PubMed] [Google Scholar]
  61. St Onge JR, Stopper CM, Zahm DS, Floresco SB (2012) Separate prefrontal-subcortical circuits mediate different components of risk-based decision making. J Neurosci 32:2886–2899. 10.1523/JNEUROSCI.5625-11.2012 [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. Stevens L, Betanzos-Espinosa P, Crunelle CL, Vergara-Moragues E, Roeyers H, Lozano O, Dom G, Gonzalez-Saiz F, Vanderplasschen W, Verdejo-García A, Pérez-García M (2013) Disadvantageous decision-making as a predictor of drop-out among cocaine-dependent individuals in long-term residential treatment. Front Psychiatry 4:149. 10.3389/fpsyt.2013.00149 [DOI] [PMC free article] [PubMed] [Google Scholar]
  63. Sugam JA, Day JJ, Wightman RM, Carelli RM (2012) Phasic nucleus accumbens dopamine encodes risk-based decision-making behavior. Biol Psychiatry 71:199–205. 10.1016/j.biopsych.2011.09.029 [DOI] [PMC free article] [PubMed] [Google Scholar]
  64. Sugam JA, Saddoris MP, Carelli RM (2014) Nucleus accumbens neurons track behavioral preferences and reward outcomes during risky decision making. Biol Psychiatry 75:807–816. 10.1016/j.biopsych.2013.09.010 [DOI] [PMC free article] [PubMed] [Google Scholar]
  65. Thoma P, Daum I (2013) Comorbid substance use disorder in schizophrenia: a selective overview of neurobiological and cognitive underpinnings. Psychiatry Clin Neurosci 67:367–383. 10.1111/pcn.12072 [DOI] [PubMed] [Google Scholar]
  66. Tomie A. (1996) Locating reward cue at response manipulandum (CAM) induces symptoms of drug abuse. Neurosci Biobehav Rev 20:505–535. 10.1016/0149-7634(95)00023-2 [DOI] [PubMed] [Google Scholar]
  67. Tomie A, Grimes KL, Pohorecky LA (2008) Behavioral characteristics and neurobiological substrates shared by Pavlovian sign-tracking and drug abuse. Brain Res Rev 58:121–135. 10.1016/j.brainresrev.2007.12.003 [DOI] [PMC free article] [PubMed] [Google Scholar]
  68. Trifilieff P, Ducrocq F, van der Veldt S, Martinez D (2017) Blunted dopamine transmission in addiction: potential mechanisms and implications for behavior. Semin Nucl Med 47:64–74. 10.1053/j.semnuclmed.2016.09.003 [DOI] [PubMed] [Google Scholar]
  69. Vacca G, Ahn S, Phillips AG (2007) Effects of short-term abstinence from escalating doses of D-amphetamine on drug and sucrose-evoked dopamine efflux in the rat nucleus accumbens. Neuropsychopharmacology 32:932–939. 10.1038/sj.npp.1301173 [DOI] [PubMed] [Google Scholar]
  70. van Holst RJ, Sescousse G, Janssen LK, Janssen M, Berry AS, Jagust WJ, Cools R (2018) Increased striatal dopamine synthesis capacity in gambling addiction. Biol Psychiatry 83:1036–1043. 10.1016/j.biopsych.2017.06.010 [DOI] [PMC free article] [PubMed] [Google Scholar]
  71. Varela FA, Der-Ghazarian T, Lee RJ, Charntikov S, Crawford CA, McDougall SA (2014) Repeated aripiprazole treatment causes dopamine D2 receptor up-regulation and dopamine supersensitivity in young rats. J Psychopharmacol 28:376–386. 10.1177/0269881113504016 [DOI] [PMC free article] [PubMed] [Google Scholar]
  72. Verdejo-García A, Bechara A (2009) A somatic marker theory of addiction. Neuropharmacology 56 [Suppl 1]:48–62. [DOI] [PMC free article] [PubMed] [Google Scholar]
  73. Verdejo-García AJ, Perales JC, Pérez-García M (2007) Cognitive impulsivity in cocaine and heroin polysubstance abusers. Addict Behav 32:950–966. 10.1016/j.addbeh.2006.06.032 [DOI] [PubMed] [Google Scholar]
  74. Volkow ND, Fowler JS, Wang GJ, Baler R, Telang F (2009) Imaging dopamine's role in drug abuse and addiction. Neuropharmacology 56 [Suppl 1]:3–8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  75. Wang G, Shi J, Chen N, Xu L, Li J, Li P, Sun Y, Lu L (2013) Effects of length of abstinence on decision-making and craving in methamphetamine abusers. PLoS One 8:e68791. 10.1371/journal.pone.0068791 [DOI] [PMC free article] [PubMed] [Google Scholar]
  76. Williams BA. (1994) Conditioned reinforcement: neglected or outmoded explanatory construct? Psychonom Bull Rev 1:457–475. 10.3758/BF03210950 [DOI] [PubMed] [Google Scholar]
  77. Winstanley CA, Cocker PJ, Rogers RD (2011) Dopamine modulates reward expectancy during performance of a slot machine task in rats: evidence for a 'near-miss' effect. Neuropsychopharmacology 36:913–925. 10.1038/npp.2010.230 [DOI] [PMC free article] [PubMed] [Google Scholar]
  78. Zack M, Featherstone RE, Mathewson S, Fletcher PJ (2014) Chronic exposure to a gambling-like schedule of reward predictive stimuli can promote sensitization to amphetamine in rats. Front Behav Neurosci 8:36. 10.3389/fnbeh.2014.00036 [DOI] [PMC free article] [PubMed] [Google Scholar]
  79. Zalocusky KA, Ramakrishnan C, Lerner TN, Davidson TJ, Knutson B, Deisseroth K (2016) Nucleus accumbens D2R cells signal prior outcomes and control risky decision-making. Nature 531:642–646. 10.1038/nature17400 [DOI] [PMC free article] [PubMed] [Google Scholar]
  80. Zeeb FD, Robbins TW, Winstanley CA (2009) Serotonergic and dopaminergic modulation of gambling behavior as assessed using a novel rat gambling task. Neuropsychopharmacology 34:2329–2343. 10.1038/npp.2009.62 [DOI] [PubMed] [Google Scholar]

Articles from The Journal of Neuroscience are provided here courtesy of Society for Neuroscience

RESOURCES