Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2015 Mar 15.
Published in final edited form as: Biol Psychiatry. 2013 Oct 17;75(6):487–498. doi: 10.1016/j.biopsych.2013.09.005

Endocannabinoids promote cocaine-induced impulsivity and its rapid dopaminergic correlates

Giovanni Hernandez 1, Erik B Oleson 2, Ronny N Gentry 2, Zarish Abbas 3, David L Bernstein 2, A Arvanitogiannis 3, Joseph F Cheer 2,4,*
PMCID: PMC3943889  NIHMSID: NIHMS525241  PMID: 24138924

Abstract

Background

Impaired decision-making, a hallmark of addiction, is hypothesized to arise from maladaptive plasticity in the mesolimbic dopamine pathway. The endocannabinoid (eCB) system modulates dopamine activity through activation of CB1 receptors (CB1R). Here, we investigated whether impulsive behavior observed following cocaine exposure requires CB1R activation.

Methods

We trained rats in a delay-discounting task. Following acquisition of stable performance, rats were exposed to cocaine (10 mg/kg, i.p.) every-other-day for 14 days and locomotor activity was measured. One week later, delay-discounting performance was reevaluated. To assess reversal of impulsivity, injections of a CB1R antagonist (1.5 mg/kg, i.p.) or vehicle were given 30-min before the task. During the second experiment, aimed at preventing impulsivity, rather than reversing it, CB1R were antagonized prior to each cocaine injection. In this experiment, sub-second dopamine release was measured in the nucleus accumbens (NAc) during delay-discounting sessions before and after cocaine treatment.

Results

Blockade of CB1R reversed and prevented cocaine-induced impulsivity. Electrochemical results showed that during baseline, and following disruption of eCB signaling, there was a robust increase in dopamine for immediate large rewards compared to immediate small rewards; but this effect reversed when the delay for the large reward was 10-s. In contrast, dopamine release always increased for one-pellet options at minimal or moderate delays in vehicle-treated rats.

Conclusion

eCBs play a critical role in changes associated with cocaine exposure. CB1R blockade may thus counteract maladaptive alterations in afferents to dopamine neurons; thereby, preventing changes in dopaminergic activity underlying a loss of self-control.

Keywords: Decision-making, self-control, cocaine, CB1 receptors, dopamine, Fast-scan cyclic voltammetry

Introduction

Human and non–human species discount delayed rewarding outcomes; the subjective value of a rewarding outcome depends on how distant in the future it is. In general, when questioned about future options, we tend to choose and prefer large distant future rewards to smaller less distant ones. However as time elapses, and depending on the value of the delayed reward, this preference switches (1); this occurs because immediate rewarding outcomes have a greater subjective value than delayed ones. When the larger delayed reward is chosen over the smaller immediate reward, self-control has been exerted. When the opposite occurs, impulsivity has taken place (2; 3).

Drug addiction can be seen as an impairment of decision-making processes, in which the weight of the delayed reward has very little repercussion on the preference of the individual. Therefore, immediate rewards and the instant gratification that drugs of abuse produce is disproportionally chosen over the long-term gratification of a healthy life style (4; 5). When tested under laboratory settings, individuals with substance abuse problems (4; 68) as well as non-human subjects that had been sensitized to the effects of drugs of abuse, perform impulsively in tasks that assess self-control (3; 9; 10; 11). A key neural substrate involved in decision-making and an important mediator of rewarding stimuli is the mesolimbic dopaminergic system (1214). Dopamine (DA) neurons in the ventral tegmental area (VTA) encode the subjective value of reward (1517). When measured during a delay-discounting task (DDt), in which the size and delay of reward delivery is varied, putative DA neurons fire at a higher rate for cues that predict larger rewards and for rewards with smaller delays (18). When phasic accumbal DA release is measured in a task that manipulates effort and delay to obtain a reward, release is higher for the cue that predicts lesser effort as well as for the cue that predicts an immediate reward (19). However, it is presently unclear whether different aspects of a DDt are encoded by phasic DA release in the NAc, and whether this changes following a sensitizing cocaine exposure.

Most drugs of abuse, directly or indirectly, alter DA neurotransmission (20) and chronic exposure to them produces plastic changes in multiple brain areas that are believed to underlie addiction (21). Rodents sensitized to the effects of stimulant drugs—and in particular cocaine—show increased DA availability in the NAc (2223). This increase has been hypothesized to arise from different neural adaptations. Some researchers have emphasized changes in transduction at D1 and/or D2 receptors (13; 2528), whereas others stress the interaction between DA neurons and inhibitory (2935) and excitatory neurotransmitters (3644).

The endocannabinoid (eCB) system is an important component of the reward circuitry. eCBs are retrograde lipid messengers that bind to CB1R localized presynaptically, mainly on excitatory (glutamatergic) and inhibitory (GABAergic) synapses (45; 46) where their main effect is to decrease the probability of neurotransmitter release (47; 48). These molecules participate in synaptic plasticity in the VTA (49; 50); where they modulate DA neuron excitability (48; 51) and they are also necessary for drug- and reward-predictive cues to evoke DA release (52; 53). eCB involvement in different forms of plasticity and regulation of DA release makes this system critical in fine-tuning goal-directed behavior (54), particularly when it is compromised, as it is in the case of drug addiction. eCB signaling is involved in conditioned drug seeking and relapse (51; 55; 56), as well as in cue-induced reinstatement (5658). Importantly, eCBs play a key role in cocaine sensitization (5962 but see 59), by inducing the cocaine psychomotor response and increasing the reliability of sensitization (62).

Given the pivotal role eCBs play in neural plasticity at the circuit level, and their involvement in modulating psychostimulant-induced changes in DA release, the present study investigates the role of CB1R activation in the reversal and blockade of impulsive behavior in a DDt following repeated cocaine administration. In addition, we provide the first demonstration that the eCB system modulates phasic DA release during the development of cocaine-induced impulsivity.

Methods and Materials

Subjects and surgery

Thirty-six male Long-Evans (Charles-River, St. Constant, QC, or Wilmington, MA) rats weighing 300–350 g at the time of arrival, served as subjects. Rats were individually housed in a temperature and humidity controlled room with a 12-h light-dark cycle (lights on at 07:00 h). Animals were divided as follows: eighteen rats were used in the reversal experiment; twelve of those received cocaine and six received saline. The remaining eighteen were used in the blockade experiment. Of those rats, twelve were anesthetized with isoflurane and stereotaxically implanted with chronic carbon fiber electrodes (63) aimed at the NAc shell (+1.7 AP, +0.8 ML, −7.0 DV), ipsilateral bipolar stimulating electrodes (Plastics One, Roanoke, VA) aimed at the VTA (−5.4 AP, +0.5 ML, −8.7 DV) and contralateral Ag/AgCl reference electrodes. A triangular input waveform (initial ramp, −0.4–1.3 V, 400 V/s)(64) was applied to the recording electrode at 60 Hz for 30-min and then reduced to 10 Hz. At this point electrically evoked DA release was monitored. If needed, the recording electrode was lowered by 0.1 mm from the initial coordinates until release was observed. Our electrode placement in the dorso-ventral direction was guided by the kinetics of uptake, which are slower in the shell compared to the core of the NAc. For a detailed description see (65). Electrodes were secured with dental acrylic and skull-screw anchors. Rats were allowed to recover for at least 10 days, during which time they received food and water ad libitum. Before starting behavioral sessions rats were food restricted at 85% ± 5% of their free-feeding weight and maintained around this weight throughout the experiment. The final recordings were carried-out in eight rats due to poor electrode sensitivity. The remaining six rats received a combination of CB1 antagonist and saline. All procedures were carried out in accordance with established practices as described in the NIH Guide for Care and Use of Laboratory Animals as well as by the Canadian Council on Animal Care. In addition, all procedures were reviewed and approved by the Animal Care and Use Committee of University of Maryland School of Medicine and by the Animal Research Ethics Committee of Concordia University.

Drugs

Two structurally similar CB1R inverse agonists SR141716A (Rimonabant) (National Institute on Drug Abuse Drug Supply Program, Raleigh, NC) or AM251 (Tocris Bioscience, Bristol UK) that produce comparable effects in several behavioral paradigms (see (66) for a review) as well as molecular changes (61) were used. They were dissolved in a solution of (1:1:18) ethanol, Emulphor (Rhodia, Cranbury, NJ), and saline and injected intraperitoneally (i.p.) at 0 and 1.5 mg/kg; a dose selected on its inability to alter locomotor activity (67). Cocaine (National Institute on Drug Abuse Drug Supply Program, Raleigh, NC or Medisca Pharmaceutique Montreal, QC) was dissolved in saline and injected at 10 mg/kg. All drugs were given in an injection volume of 1 ml/kg.

Behavioral training

Experiments were conducted in rat operant conditioning chambers (12.5″ L × 13.5″ W × 13.5 H; Med Associates, Georgia, VT) located within ventilated sound attenuation chambers. The operant boxes were equipped with a white noise amplifier, a house light located at the rear of the chamber, three cue lights above two retractable levers (Coulbourn Instruments, Whitehall, PA) and a pellet receptacle and modular pellet dispenser. The pellet receptacle was centrally located between the two levers and was fitted with infrared photobeams located horizontally across the entrance, this allowed for detecting head entrances into the receptacle.

Rats were initially trained under a fixed-ratio 1 reinforcement schedule to lever press for a 45 mg chocolate flavored sucrose pellet (Bio-Serv, Frenchtown, NJ). Only one lever was presented at random on any of the two lever locations. Once animals pressed for minimum of 100 times in an hour they were trained to nose-poke in the pellet receptacle in order to trigger the presentation of the levers. This ensured that at the beginning of each trial, rats were positioned centrally between the levers. These trials began with the house and cue light in the pellet receptacle turned on; once the rat nose-poked, the pellet receptacle light went off and one cue light selected at random was turned on for 2-s before the presentation of its associated lever. When the rat pressed the lever the house light and cue light extinguished, the lever retracted and the light of the pellet receptacle went on. After the rat harvested the pellet all lights in the box were off for the duration of the 15-s inter-trial interval. After two consecutive sessions, rats started the delayed reward task.

The DDt was modified from Evenden and Ryan (68). Each session consisted of 6 blocks of 14 trials each (Figure S1). The first block was considered a warm-up phase. Each block began with a pair of forced trials in which both the left or the right cue was turned on and 2-s later its associated lever was presented. These trials were followed by 12 free-choice trials in which both cues and levers were presented. Between blocks the assignment of the lever associated with the reward delivery delay was changed randomly in order to minimize potential selection bias. The cue on top of the lever assigned with the immediate delivery of a single pellet was always non-flashing; whereas the cue on top of the lever assigned with the delivery of four-pellet after a delay was flashing, the frequency at which the cue flashed changed with each delay and the selection of flashing frequencies for each delay were counterbalance between subjects. The reward delay of the four pellets option was increased between blocks in a sequential manner from 0-s in blocks 1 and 2, to 4, 10, 25 and 63-s in each of the remaining blocks. The intertrial interval during this part of the training was of variable length depending on the choice made, so that the time between trials was always 75-s. The latency to press the levers and response omissions were registered. Rats were trained under this schedule until performance was stable; e.g., a non-significant difference across 5 consecutive days as per repeated-measures ANOVA. Rats received at least 21 days of training before moving to the other phases of the experiment.

Locomotion measurements

Locomotor activity was assessed in individual activity boxes containing infrared sensors located at the bottom of the longitudinal axis (Med Associates, Inc., St. Albans, VT cages). Activity for both experiments was monitored over 1-h at approximately the same time each day. On the first two days rats were injected with saline and habituated to the locomotor boxes. Following habituation, rats used for the reversal experiment were injected with cocaine on alternate days for 14 days and locomotor activity was assessed. For the blockade experiment either rimonabant or vehicle was injected 30-min prior to the injection of cocaine or saline and subsequent measurement of locomotor activity. 2 days after the last injection rats returned to the delay-discounting task.

For the reversal and blockade experiment, cocaine pre-exposure effects on the delay-discounting task were assessed for the next 5 consecutive days; but for the reversal experiment performance was evaluated for another 5 days following a daily injection of AM251 or vehicle 30-min before the beginning of the DDt

Fast-Scan Cyclic Voltammetry

During the DDt DA was detected from fast-scan cyclic voltammograms collected at the carbon fiber electrode every 100-ms (initial waveform: −0.4 V to 1.3 V, 400 V/s) (63). Principal component regression (PCR) was used as previously described to extract the DA component from the raw voltammetric data (69). DA concentration was estimated based on the average post-implantation sensitivity of electrodes (63; 70).

Data Analysis

Behavioral performance across the different phases of the task as well as phasic DA release were evaluated using repeated-measures ANOVA to the arcsine transformation in order to limit the effect of an artificially imposed ceiling (71). When significant interactions or main effects were obtained (p < 0.05), differences between groups were tested using Dunnett’s multiple comparison test. Omissions, response latencies and locomotor counts were analyzed using paired t-tests. All analyses were carried-out using Statistica (Statsoft, Tulsa, OK).

Results

Blockade of CB1 receptors reverses cocaine-induced changes in preference for the large reward

The initial injection of cocaine produced similar increases in locomotor activity in both groups, the group that would be injected with the CB1R antagonist and the one injected with vehicle (t(10)= −1.17, p= 0.26). Rats of both groups were sensitized to the locomotor effects of cocaine (t(5) Vehicle-group = 2.86; p= 0.035; t(5) CB1Blockade-group = 3.32, p= 0.020) (Figure 1A–B; Figure S2). Saline injected rats showed no statistical difference between the first and last saline injection (t(5) saline-group= 0.14, p= 0.88) (Figure 1C; Figure S2). Cocaine pretreatment produced a significant change in self-control (Figure 2A–B), as a significant delay (Fvehicle-group(4,348)= 141.57, p=0.000; FCB1Blockade-group(4,348)= 128.23, p<0.001) or pretreatment effect (Fvehicle-group(2,87)=7.41, p=0.000; FCB1Blockade-group(2,87)= 10.75, p<0.001), and a significant interaction (Fvehicle-group(8,348)= 2.35, p=0.017; FCB1Blockade-group(8,348)= 4.17, p<0.001) were observed for both groups. Post-hoc analysis confirms that behavior observed across the first four delays (0, 4, 10, 25-s) was significantly different, showing an orderly decrease in the preference for the large reward as the delay to obtain it increased. However, behavior observed for the last two delays (25 and 63-s) was statistically similar. Pre-treatment with cocaine decreased the preference for the large reward, in particular during the first 3 delays (0, 4, 10-s) (Figure 2A, S6A); this change in preference was stable (Figure S3C–D) and was not reversed by administration of vehicle. Interestingly, rats whose CB1R were blocked reversed their behavior to a level similar to that observed during baseline (Figure 2B, S6B). Visual inspection of the different sessions revealed that the effect of CB1R blockade on performance was not immediate (Figure S3E–F). Temporal factors cannot explain the changes in performance observed following cocaine pretreatment since performance under saline was not different from baseline (Fsaline-group(1,58)=0.33, p=0.562) (Figure 2C, S6C). When contrasted against baseline none of the treatments had an effect on other performance measures; latency to press (t(5) Vehicle-group= 0.84, p= 0.217; t(5) CB1Blockade-group = 0.60, p= 0.286; t(5) saline-group= 0.51, p= 0.314 nor omissions, which were extremely rare (t(5) Vehicle-group= 0.99, p= 0.183; t(5) CB1Blockade-group = 0.20, p=0.423; t(5) saline-group= 0.25, p= 0.404).

Figure 1.

Figure 1

Locomotor activity for the different groups in the reversal experiment. When the activity between the first cocaine injection and the last cocaine injection are contrasted; repeated cocaine administration produces an increase in locomotor activity (p<0.05) in the groups that received AM251 (A) or vehicle (B) before the delay discounting task. In contrast, the locomotor activity of rats injected with saline (C) remained unaltered (p>0.05).

Figure 2.

Figure 2

AM251 (1.5 mg/kg/i.p.) reverses effects of cocaine pre-exposure on choice of the delayed reward. (A) Shows how cocaine pre-exposure (red circles) consistently reduces the selection of the delayed large reward. Vehicle administration (green triangles) prior to the delay discounting task does not have any effect on the cocaine induced decrease. Both performances are statistically similar (p >0.05). The reductions in the selection of the large delayed reward observed after cocaine and vehicle are statistically significant when contrasted against the baseline performance (p<0.05 denoted by #) during the first three delays (0, 4, 10-s.). (B) As before, cocaine pre-exposure (red circles) consistently reduces the selection of the delayed large reward. However, such reduction is reversed by AM251 administration (light blue triangles) prior to the delay discounting task. When performance is contrasted against baseline, only cocaine’s effects are statistically different; this difference is particularly salient during the first three delays (0, 4, 10-s) (p<0.05 denoted by *). (C) Pre-exposure to saline instead of cocaine has no effect on the selection of the delayed large reward (p >0.05).

Blockade of CB1 receptors curtails cocaine-induced changes in preference for the large reward

Blockade of CB1R prevented the effects on cocaine-induced locomotion (Figure 3A, Figure S4) (t(5) CB1 Blockade+Coc= 1.63, p= 0.16). In contrast, a significant increase in cocaine-induced locomotion was observed in vehicle-treated animals (Figure 3B, Figure S4) (t(5) Vehicle+Coc= 2.62, p= 0.046). Blockade of CB1R along with saline did not produce a discernible effect in locomotion (Figure 3C, Figure S4) (t(5)CB1 blockade+saline= 0.23, p= 0.41). Locomotor differences between pharmacological treatments were a reliable predictor of subsequent self-control. Indeed, vehicle-pretreated rats showed a decrease in preference for the large reward (Figure 4A; S7A). As observed before, there was a significant delay effect between baseline and following treatment (F(4,232) = 51.97, p<0.001), the pretreatment (F(1, 58) =14.74, p<0.001), and for the interaction between delay and pretreatment (F(4,232) = 8.84, p<0.001). Behavior observed across the first four delays (0, 4, 10, 25-s), was significantly different. In this group cocaine reduced the preference for the large reward, during the first 3 delays (0, 4, 10-s). Importantly, CB1R blockade prevented all changes in self-control associated with cocaine pre-exposure (Figure 4B; S7B) as no significant differences were observed during baseline or after treatment (F(1, 58) =0.124, p=0.725) or for the interaction between delay and treatment (F(4,232) = 1.36, p=0.247). Similarly blockade of CB1R alone before saline injections did not affect self-control (Figure 4C; S7C) (F(1, 58) =0.04, p=0.950) and there was no effect for the interaction between delay and treatment (F(4,232) = 0.64, p=0.627). None of the treatments altered the latency to press (t(5) CB1 Blockade+Coc= 0.72, p= 0.251; t(5) Vehicle+Coc= 1.28, p= 0.128; t(5)CB1 blockade+saline= 0.57, p= 0.296) nor the omissions (t(5) CB1 Blockade+Coc= 0.97, p= 0.186; t(5) Vehicle+Coc=0.70, p= 0.255; t(5) CB1 blockade+saline= 0.47, p= 0.326).

Figure 3.

Figure 3

Locomotor activity for the different groups in the blockade experiment. There is an increase in the average locomotor counts between the first and the last cocaine injection in the CB1 blockade + cocaine group (A) as well as in the vehicle + cocaine group (B). However, the increase in locomotor activity between the first cocaine injection and the last cocaine injection is only statistically reliable (p<0.05) for the latter. Locomotor activity of the CB1 blockade + saline group (C) is similar for the first and last saline injection (p>0.05).

Figure 4.

Figure 4

Rimonabant (1.5 mg/kg/i.p.) prevents the effects of cocaine pre-exposure on choice of the delayed reward. (A) When contrasted against baseline (blue triangles) vehicle administration 30-min before cocaine injections (light red circles) consistently reduces the selection of the delayed large reward. The reductions in the selection of the large delayed reward observed after vehicle and cocaine are statistically significant (p<0.05 denoted by *). These differences are particularly pronounced for the first three delays (0, 4, 10-s). These observations contrast to the results obtained when rimonabant (red circles) is administered 30-min before the cocaine injections (B). Performance during baseline and following treatment with rimonabant and cocaine are not statistically different (p>0.05). Similar results are obtained when rimonabant is administered 30-min before saline (magenta circles) (C).

Phasic DA associated with reward delivery recapitulates the behavioral effects

DA release measurements showed that, prior to cocaine exposure, release was greater for the 4-pellet option when the delay was minimal. However, when the delay reached 10-s, this pattern reversed; i.e., there was greater release for the immediate 1-pellet option compared to the delayed 4-pellet alternative (F(4,28) = 7.15, p<0.001) (Figure 5). This pattern of DA release was significantly disrupted by cocaine in vehicle-treated rats (Figure 6AA–D′). DA release for the 4-pellet option was consistently lower at small and moderate delays compared to baseline and this difference was particularly salient when the delay between the two options was identical (F(4,12)=3.36 p=0.045) (Figure 7A–D). Importantly, blockade of CB1R prior to cocaine injections maintained DA release patterns observed prior to cocaine exposure (F(4,12) =0.412, p=0.796) (Figure 8A–D′ and Figure 9A–D).

Figure 5.

Figure 5

Phasic DA release for both reward magnitudes at different delays. Release for the 4 reward pellets (dark blue) is higher for the 2 initial delays (0 and 4-s) when compared to the 1 pellet immediate option (dark red). However, it was only statistically higher (p<0.05 denoted by *) at 0-s delay. There is a peak of phasic DA release for the immediate 1 pellet option when the delay for the 4 pellets reaches 10-s, this difference in release between the options is statistically significant (p<0.05).

Figure 6.

Figure 6

Representative example of DA release to reward delivery at different delays for vehicle pretreatment. DA release for the 1-pellet (A) and 4-pellet (B) options at 0-s and following pre-exposure to vehicle and cocaine (A′ & B′). Current at the peak oxidation potential of DA is plotted as function of time; the insets showing the cyclic voltammogram identifying the detected peak current, denoted by the arrow head, as DA. Below are two-dimensional pseudocolor plots of cyclic voltammograms over time. Dashed lines denote reward delivery. During baseline, release for the 4-pellet option is higher at 0-s (B) when contrasted against the 1-pellet option (A). However, following cocaine and vehicle this pattern reverses (A′& B′). At 10-s, during baseline, the release for the immediate 1-pellet option (C) is higher than that of the 4-pellet delay option (D). This pattern remains following vehicle and cocaine pre-exposure (C′ & D′).

Figure 7.

Figure 7

Bar graphs of pooled DA release to reward delivery at different delays for vehicle-treated rats. (A) Shows the DA profile obtained during baseline. Release observed for the immediate 1-pellet option (blue bar) is paired with release obtained for the 4-pellet option (red bar) across different delays. Release for the 4-pellet option is statistically higher (p<0.05 denoted by *) at 0-s delay compared to its immediate counterpart. When the delay for the 4-pellet option reaches 10-s, there is a significant peak (p<0.05) of phasic DA release associated with the immediate 1-pellet alternative. (B) Injections of vehicle and cocaine produced a change in the DA release profile; following treatment, release for the immediate option (cyan bar) is significantly higher than that observed for the 4-pellet option (light red bar) at zero-delay, whereas release for the 4-pellet option remains stable across delays. Directly contrasting the effect of pre-exposure to vehicle and cocaine on DA release against baseline for the immediate (C) and delay option (D) shows that highest differences are observed when the delay is 0-s. At this delay there is a significant increase in release for the 1-pellet option following pre-exposure to vehicle and cocaine; whereas there is a significant decrease for the 4-pellet option (p<0.05).

Figure 8.

Figure 8

Example of DA release to reward delivery at different delays in rimonabant-treated rats. Release for the 1-pellet (A) and 4-pellet (B) options at 0-s and following rimonabant pretreatment and cocaine (A′ & B′). Current at the peak oxidation potential of DA is plotted as function of time; the insets showing the cyclic voltammogram identifying the detected peak current, denoted by the arrow head, as DA. Below are two-dimensional pseudocolor plots of cyclic voltammograms over time. Dashed lines denote reward delivery. During baseline recording, DA for the 4-pellet option is higher at 0-s (B) when compared to the 1-pellet option (A). Following rimonabant and cocaine this pattern persists (A′& B′). At 10-s, release for the immediate 1-pellet option at baseline (C) is higher than that of the 4-pellet delay option (D). Following rimonabant and cocaine pre-exposure this pattern remains (C′ & D′).

Figure 9.

Figure 9

Bar graphs of pooled DA release to reward delivery at different delays for rimonabant-treated animals. (A) Shows the DA profile obtained during baseline. Release observed for the immediate 1-pellet option (blue bar) is paired with release obtained for the 4-pellet option (red bar) across different delays. Release for the 4-pellet option is statistically higher (p<0.05 denoted by *) at 0-s delay compared to its immediate counterpart. When the delay for the 4-pellet option reaches 10-s, there is a significant peak (p<0.05) in release associated for the immediate 1-pellet alternative. (B) Injections of rimonabant prior to cocaine treatment maintained the release pattern observed during baseline; at 0-delay release is higher for the 4-pellet option (wine bars) than for the immediate option (light blue bars). Similar to baseline, when the delay for the 4-pellet option reaches 10-s, there is a significant peak (p<0.05) of phasic release associated for the immediate 1-pellet alternative. Directly contrasting the effect of pre-exposure to vehicle and cocaine on release to baseline for the immediate (C) and across delay options (D) shows that rimonabant preserves the baseline release pattern since no significant difference (p>0.05) in release at any of the delays for any of the options are observed.

Phasic DA associated with cue presentation does not encode delay or reward magnitude

Cue presentation during forced trials produced phasic DA release; however its amplitude did not differ between different reward magnitudes (F= (1, 7) =1.450, p=0.267) or delays (F= (4, 28) =1.26, p=0.308) (Figure S5A). Likewise, cocaine exposure did not produce changes in release in vehicle-treated rats (Fimmediate=(1,3)=0.831, p= 0.429; Fdelay =(1,3)=0.697, p= 0.465) (Figure S5B–C) or in rats in which CB1R were blocked (Fimmediate=(1,3)=1.369, p= 0.326; Fdelay =(1,3)=2.083, p= 0.244) (Figure S5D–E) compared to baseline.

Discussion

The current study documents the role of eCBs in the development and maintenance of changes in impulsive choice that arise from cocaine exposure, and adds to a growing body of evidence related to the modulatory role that eCBs play in self-control—in particular when it is altered by psychostimulants (7274). We demonstrate that cocaine produces a decrease in self-control that is reversed and blocked by interfering with CB1R signaling. Rats exposed to cocaine show a preference for immediate rewards over delayed ones even when the imposed delay is minimal. Since eCBs also modulate the reinforcing properties of natural rewards and the motivation to obtain food (54; 7577), it could be argued that the results in the present study are the product of a generalized reduction in motivation. However, this seems unlikely, as we did not observe differences between any of the groups in indirect measures of motivation; neither the latencies to respond nor omissions. Others (72; 73) have shown, using higher doses to the one used here, that CB1R antagonists fail to change the subject’s sensitivity towards the delayed large reward. Furthermore, CB1R blockade had no long-term effects on motivation, since we did not observe changes in performance in the blockade experiment when rats were treated with the antagonist along with saline. The indiscriminate preference for the immediate reward in the present study is in contrast to previous studies (10; 11) in which the preference for the smaller immediate reward is evident only after the 10-s delay. A factor that may explain this difference is that in our protocol the location of the levers changed randomly across trial blocks. This modification possibly reduced the presentation of stereotyped lever selection; and it revealed that in addition to impulsivity, cocaine produces a generalized aversion to temporal delays in the pursuit of reward. It is also possible that reduced constitutive activity of cannabinoid CB1 receptors due to inverse agonism might have contributed to the effects reported herein.

Increases in impulsivity following cocaine treatment have been reported by others (3; 9; 10; 11; 78; 79) and have been linked to heightened sensitivity to DA (9; 18), which is theorized to bias responding towards the subjectively more valuable reward (18) or immediate reward (80; 81). Our electrochemical results are the first to report patterned DA release during a DDt; where evidence of temporal bias can be observed. Phasic DA release associated with reward delivery is higher when the delay is below 10-s and decreases to a level that is below that observed for the immediate option. However, following cocaine treatment, phasic release for the immediate option is above that observed for the delay option regardless of the delay. This observation correlates with behavioral performance, in which cocaine exposure increases the preference for immediate rewards and confirms electrophysiological recordings suggesting that phasic DA firing encodes subjective reward value (18). Importantly, blockade of CB1R reverted and prevented this change in self-control. Therefore blockade of eCB signaling may play a previously unrecognized role in facilitating and maintaining long-term changes in reward valuation arising from cocaine exposure. It is possible that the changes observed at the behavioral level as well as the neurochemical correlates result from preventing the binding of eCBs to CB1Rs present on glutamatergic terminals located in several nuclei that send projections to the VTA (82) or by interfering with eCB binding to presynaptic CB1Rs on GABA terminals in the VTA (48; 83; 84). These changes may in turn prevent cocaine-induced increases in DA transients (52; 53) in brain areas related to decision-making and therefore exert the observed changes in self-control. However, as drugs were delivered systemically other targets for these effects could underlie the pharmacological responses.

In striking contrast to the changes in phasic DA release observed following reward delivery, release associated with cue onset at different delays did not encode subjective reward value and was not altered by cocaine exposure. A possible explanation for the lack of cue encoding by phasic DA release in our experimental conditions is that what is encoded in this signal is the effort threshold that has to be overcome in order to obtain a reward (85). Since the cost for the delay and immediate options was the same, the effort threshold and the DA signal that accompanies it should theoretically remain unaltered. Our results agree with Wanat et al. (70), who reported that DA release at cues is unaltered across different delays. However, other neurochemical studies (8689) have found that reward value is encoded by phasic DA release at the presentation of cues. It is likely that methodological differences between the tasks account for some of the discrepancies. Our results were obtained using a task in which the only variable that changed across the set of trials was the time at which the large reward was delivered following a response at the corresponding lever; whereas in prior studies the access to different response requirements (86; 88), a fixed delay (88) or the probability to obtain a reward (87) have been compared. In addition, most of the above-mentioned studies recorded from the NAc core (but see (70)). Our electrodes were aimed at the shell; so it is possible, as previously proposed, that NAc core and shell mediate different aspects of the reward-cue encoding (90).

Here, phasic DA release observed during the reward did not shift towards cues. During learning, phasic DA neuron firing that occurs at reward delivery decreases as rewards become fully predicted and firing activity shifts toward the first predictor of the reward in Pavlovian (89) as and operant paradigms (90; 91). However phasic firing at the first predictors critically depends on the duration of the stimulus-reward interval. As the stimulus-reward interval increases beyond 2-s, the phasic response to the conditioned stimulus decreases and the phasic response to the reward increases (17; 92). This observation is confirmed by our data since in our procedure, the cue was presented 2 s before lever extension and rats had an opportunity to press it and obtain a reward.

In conclusion, our results show that the eCB system plays an important role in decision-making processes, in particular when these have been modified as a result to exposure to cocaine. eCBs are well-known synaptic modulators of different limbic and motor inputs required to organize goal-directed behaviors. Here we show that this system is also crucially involved in the development and maintenance of enduring DA adaptations that result from cocaine exposure. The present data advance our understanding of the role of eCbs in aberrant decision-making in cocaine addiction.

Supplementary Material

01

Acknowledgments

This research was supported by NIH grants DA022340 and DA025890 to JFC; by Fonds de recherche du Québec - Nature et technologies (FQRNT) fellowship to GH and by Natural Science and Engineering Research Council of Canada (#249848-2007) to AA; and by a group grant from the “Fonds de Recherche du Québec–Santé” to the “Groupe de recherche en neurobiologie comportementale”/Center for Studies in Behavioral Neurobiology.

The authors would like to thank Dr. Rose-Marie Karlsson, David Munro and Steve Cabilio for expert technical assistance and Dr. Geoffrey Schoenbaum for valuable comments.

Preliminary data presented in the present article was published previously during the 2011–2012 Society for Neuroscience Meetings; reference 511.06/XX75 and 499.16/DDD68

Footnotes

Disclosure of biomedical financial interests and potential conflicts of interest

The authors report no biomedical financial interests or potential conflicts of interest.

Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

References

  • 1.Ainslie G. Specious reward: A behavioral theory of impulsiveness and impulse control. Psychological bulletin. 1975;82:463–496. doi: 10.1037/h0076860. [DOI] [PubMed] [Google Scholar]
  • 2.Rachlin H, Green L, Vi AD. Commitment, choice and self-control. 1972. pp. 15–22. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Loguea W, Tobin H, Chelonis JJ, Wang RY, Geary N, Schachter S. Cocaine decreases self-control in rats: a preliminary report. Psychopharmacology. 1992;109:245–7. doi: 10.1007/BF02245509. [DOI] [PubMed] [Google Scholar]
  • 4.Kirby KN, Petry NM, Bickel WK. Heroin addicts have higher discount rates for delayed rewards than non-drug-using controls. J Exp Psychol Gen. 1999;128:78–87. doi: 10.1037//0096-3445.128.1.78. [DOI] [PubMed] [Google Scholar]
  • 5.Bechara A, Damasio H. Decision-making and addiction (part I): impaired activation of somatic states in substance dependent individuals when pondering decisions with negative future consequences. Neuropsychologia. 2002;40:1675–89. doi: 10.1016/s0028-3932(02)00015-5. [DOI] [PubMed] [Google Scholar]
  • 6.Vuchinich RE, Simpson CA. Hyperbolic temporal discounting in social drinkers and problem drinkers. Exp Clin Psychopharmacol. 1998;6:292–305. doi: 10.1037//1064-1297.6.3.292. [DOI] [PubMed] [Google Scholar]
  • 7.Carroll ME, Anker JJ, Mach JL, Newman JL, Perry JL. Delay discounting as a predictor of drug abuse. In: Bickel GJMWK, editor. Impulsivity: The behavioral and neurological science of discounting. Washington, DC, US: American Psychological Association; 2010. pp. 243–271. [Google Scholar]
  • 8.Bickel WK, Odum AL, Madden GJ. Impulsivity and cigarette smoking: delay discounting in current, never, and ex-smokers. Psychopharmacology. 1999;146:447–454. doi: 10.1007/pl00005490. [DOI] [PubMed] [Google Scholar]
  • 9.Roesch MR, Takahashi Y, Gugsa N, Bissonette GB, Schoenbaum G. Previous cocaine exposure makes rats hypersensitive to both delay and reward magnitude. The Journal of neuroscience. 2007;27:245–250. doi: 10.1523/JNEUROSCI.4080-06.2007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Simon NW, Mendez Ia, Setlow B. Cocaine exposure causes long-term increases in impulsive choice. Behavioral neuroscience. 2007;121:543–549. doi: 10.1037/0735-7044.121.3.543. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Mendez Ia, Simon NW, Hart N, Mitchell MR, Nation JR, Wellman PJ, et al. Self-administered cocaine causes long-lasting increases in impulsive choice in a delay discounting task. Behavioral neuroscience. 2010;124:470–7. doi: 10.1037/a0020458. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Chao J, Nestler EJ. Molecular neurobiology of drug addiction. Annual review of medicine. 2004;55:113–132. doi: 10.1146/annurev.med.55.091902.103730. [DOI] [PubMed] [Google Scholar]
  • 13.Wise Ra, Rompre PP. Brain dopamine and reward. Annual review of psychology. 1989;40:191–225. doi: 10.1146/annurev.ps.40.020189.001203. [DOI] [PubMed] [Google Scholar]
  • 14.Koob GF. Drugs of abuse: anatomy, pharmacology and function of reward pathways. Trends in Pharmacological Sciences. 1992;13:177–184. doi: 10.1016/0165-6147(92)90060-j. [DOI] [PubMed] [Google Scholar]
  • 15.Roesch MR, Bryden DW. Impact of size and delay on neural activity in the rat limbic corticostriatal system. Frontiers in neuroscience. 2011;5:130. doi: 10.3389/fnins.2011.00130. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Tobler PN, Fiorillo CD, Schultz W. Adaptive coding of reward value by dopamine neurons. Science. 2005;307:1642–1645. doi: 10.1126/science.1105370. [DOI] [PubMed] [Google Scholar]
  • 17.Kobayashi S, Schultz W. Influence of reward delays on responses of dopamine neurons. The Journal of neuroscience. 2008;28:7837–7846. doi: 10.1523/JNEUROSCI.1600-08.2008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Roesch MR, Calu DJ, Schoenbaum G. Dopamine neurons encode the better option in rats deciding between differently delayed or sized rewards. Nature Neuroscience. 2007;10:1615–1624. doi: 10.1038/nn2013. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Day JJ, Jones JL, Carelli RM. Nucleus accumbens neurons encode predicted and ongoing reward costs in rats. The European journal of neuroscience. 2011;33:308–321. doi: 10.1111/j.1460-9568.2010.07531.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Di Chiara G, Imperato a. Drugs abused by humans preferentially increase synaptic dopamine concentrations in the mesolimbic system of freely moving rats. Proceedings of the National Academy of Sciences of the United States of America. 1988;85:5274–5278. doi: 10.1073/pnas.85.14.5274. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Everitt BJ, Robbins TW. Neural systems of reinforcement for drug addiction: from actions to habits to compulsion. Nature neuroscience. 2005;8:1481–1489. doi: 10.1038/nn1579. [DOI] [PubMed] [Google Scholar]
  • 22.Vezina P, Sprague-dawley H. D1, Dopamine Sensitization Receptor Activation Is Necessary for the Induction by Amphetamine in the Ventral Tegmental Area. The Journal of neuroscience. 1996;16:2411–2420. doi: 10.1523/JNEUROSCI.16-07-02411.1996. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Parsons LH, Justice JB. Serotonin and dopamine sensitization in the nucleus accumbens, ventral tegmental area, and dorsal raphe nucleus following repeated cocaine administration. Journal of neurochemistry. 1993;61:1611–1619. doi: 10.1111/j.1471-4159.1993.tb09794.x. [DOI] [PubMed] [Google Scholar]
  • 24.Kalivas PW, Duffy P. Time Course of Extracellular Dopamine and Behavioral to Cocaine. I. Dopamine Axon Terminals Sensitization. The Journal of Neuroscience. 1993;13:266–275. doi: 10.1523/JNEUROSCI.13-01-00266.1993. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Xu M, Guo Y, Vorhees CV, Zhang J. Behavioral responses to cocaine and amphetamine administration in mice lacking the dopamine D1 receptor. Brain research. 2000;852:198–207. doi: 10.1016/s0006-8993(99)02258-1. [DOI] [PubMed] [Google Scholar]
  • 26.Thompson D, Martini L, Whistler JL. Altered ratio of D1 and D2 dopamine receptors in mouse striatum is associated with behavioral sensitization to cocaine. PloS one. 2010;5:e11038. doi: 10.1371/journal.pone.0011038. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Capper-Loup C, Canales JJ, Kadaba N, Graybiel AM. Concurrent activation of dopamine D1 and D2 receptors is required to evoke neural and behavioral phenotypes of cocaine sensitization. The Journal of neuroscience. 2002;22:6218–6227. doi: 10.1523/JNEUROSCI.22-14-06218.2002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Henry DJ, White FJ. Repeated cocaine administration causes persistent enhancement of D1 dopamine receptor sensitivity within the rat nucleus accumbens. Journal of Pharmacology and Experimental Therapeutics. 1991;258:882–890. [PubMed] [Google Scholar]
  • 29.Xi Z-X, Ramamoorthy S, Shen H, Lake R, Samuvel DJ, Kalivas PW. GABA transmission in the nucleus accumbens is altered after withdrawal from repeated cocaine. The Journal of neuroscience. 2003;23:3498–505. doi: 10.1523/JNEUROSCI.23-08-03498.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Kushner SA, Unterwald EM. Chronic cocaine administration decreases the functional coupling of GABAB receptors in the rat ventral tegmental area as measured by baclofen-stimulated 35S-GTPγS binding. Life Sciences. 2001;69:1093–1102. doi: 10.1016/s0024-3205(01)01203-6. [DOI] [PubMed] [Google Scholar]
  • 31.Jung BJ, Dawson R, Sealey Sa, Peris J. Endogenous GABA release is reduced in the striatum of cocaine-sensitized rats. Synapse. 1999;34:103–110. doi: 10.1002/(SICI)1098-2396(199911)34:2<103::AID-SYN3>3.0.CO;2-Z. [DOI] [PubMed] [Google Scholar]
  • 32.Suzuki T, Abe S, Yamaguchi M, Baba a, Hori T, Shiraishi H, et al. Effects of cocaine administration on receptor binding and subunits mRNA of GABA(A)-benzodiazepine receptor complexes. Synapse. 2000;38:198–215. doi: 10.1002/1098-2396(200011)38:2<198::AID-SYN11>3.0.CO;2-K. [DOI] [PubMed] [Google Scholar]
  • 33.Frankowska M, Nowak E, Filip M. Effects of GABA B receptor agonists on cocaine hyperlocomotor and sensitizing effects in rats. Pharmacological reports. 2009;61:1042–1049. doi: 10.1016/s1734-1140(09)70166-5. [DOI] [PubMed] [Google Scholar]
  • 34.Lhuillier L, Mombereau C, Cryan JF, Kaupmann K. GABA(B) receptor-positive modulation decreases selective molecular and behavioral effects of cocaine. Neuropsychopharmacology. 2007;32:388–398. doi: 10.1038/sj.npp.1301102. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Kalivas PW, Stewart J. Dopamine transmission in the initiation and expression of drug- and stress-induced sensitization of motor activity. Brain research. Brain research reviews. 1991;16:223–244. doi: 10.1016/0165-0173(91)90007-u. [DOI] [PubMed] [Google Scholar]
  • 36.Kalivas PW. The glutamate homeostasis hypothesis of addiction. Nature reviews. Neuroscience. 2009;10:561–572. doi: 10.1038/nrn2515. [DOI] [PubMed] [Google Scholar]
  • 37.Kalivas PW, Duffy P. Repeated cocaine administration alters extracellular glutamate in the ventral tegmental area. Journal of neurochemistry. 1998;70:1497–1502. doi: 10.1046/j.1471-4159.1998.70041497.x. [DOI] [PubMed] [Google Scholar]
  • 38.You Z-B, Wang B, Zitzman D, Azari S, Wise Ra. A role for conditioned ventral tegmental glutamate release in cocaine seeking. The Journal of neuroscience. 2007;27:10546–10555. doi: 10.1523/JNEUROSCI.2967-07.2007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Churchill L, Swanson CJ, Urbina M, Kalivas PW. Repeated cocaine alters glutamate receptor subunit levels in the nucleus accumbens and ventral tegmental area of rats that develop behavioral sensitization. Journal of neurochemistry. 1999;72:2397–2403. doi: 10.1046/j.1471-4159.1999.0722397.x. [DOI] [PubMed] [Google Scholar]
  • 40.Moran MM, McFarland K, Melendez RI, Kalivas PW, Seamans JK. Cystine/glutamate exchange regulates metabotropic glutamate receptor presynaptic inhibition of excitatory transmission and vulnerability to cocaine seeking. The Journal of neuroscience. 2005;25:6389–6393. doi: 10.1523/JNEUROSCI.1007-05.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Fitzgerald LW, Ortiz J, Hamedania G, Nestler EJ. Drugs of abuse and stress increase the expression of GluR1 and NMDAR1 glutamate receptor subunits in the rat ventral tegmental area: common adaptations among cross-sensitizing agents. The Journal of neuroscience. 1996;16:274–282. doi: 10.1523/JNEUROSCI.16-01-00274.1996. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Carlezon Wa, Nestler EJ. Elevated levels of GluR1 in the midbrain: a trigger for sensitization to drugs of abuse? Trends in neurosciences. 2002;25:610–615. doi: 10.1016/s0166-2236(02)02289-0. [DOI] [PubMed] [Google Scholar]
  • 43.Grignaschi G, Burbassi S, Zennaro E, Bendotti C, Cervo L. A single high dose of cocaine induces behavioural sensitization and modifies mRNA encoding GluR1 and GAP-43 in rats. The European journal of neuroscience. 2004;20:2833–2837. doi: 10.1111/j.1460-9568.2004.03712.x. [DOI] [PubMed] [Google Scholar]
  • 44.Ungless Ma, Whistler JL, Malenka RC, Bonci a. Single cocaine exposure in vivo induces long-term potentiation in dopamine neurons. Nature. 2001;411:583–587. doi: 10.1038/35079077. [DOI] [PubMed] [Google Scholar]
  • 45.Alger BE. Retrograde signaling in the regulation of synaptic transmission: focus on endocannabinoids. Progress in neurobiology. 2002;68:247–86. doi: 10.1016/s0301-0082(02)00080-1. [DOI] [PubMed] [Google Scholar]
  • 46.Wilson RI, Nicoll RA. Endogenous cannabinoids mediate retrograde signalling at hippocampal synapses. Nature. 2001;410:588–592. doi: 10.1038/35069076. [DOI] [PubMed] [Google Scholar]
  • 47.Maejima T, Hashimoto K, Yoshida T, Aiba A, Kano M. Presynaptic inhibition caused by retrograde signal from metabotropic glutamate to cannabinoid receptors. Neuron. 2001;31:463–475. doi: 10.1016/s0896-6273(01)00375-0. [DOI] [PubMed] [Google Scholar]
  • 48.Lupica CR, Riegel AC. Endocannabinoid release from midbrain dopamine neurons: a potential substrate for cannabinoid receptor antagonist treatment of addiction. Neuropharmacology. 2005;48:1105–1116. doi: 10.1016/j.neuropharm.2005.03.016. [DOI] [PubMed] [Google Scholar]
  • 49.Haj-Dahmane S, Shen R-Y. Regulation of plasticity of glutamate synapses by endocannabinoids and the cyclic-AMP/protein kinase A pathway in midbrain dopamine neurons. The Journal of physiology. 2010;588:2589–2604. doi: 10.1113/jphysiol.2010.190066. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Melis M, Pistis M, Perra S, Muntoni AL, Pillolla G, Gessa GL. Endocannabinoids mediate presynaptic inhibition of glutamatergic transmission in rat ventral tegmental area dopamine neurons through activation of CB1 receptors. The Journal of neuroscience. 2004;24:53–62. doi: 10.1523/JNEUROSCI.4503-03.2004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Maldonado R, Valverde O, Berrendero F. Involvement of the endocannabinoid system in drug addiction. Trends in neurosciences. 2006;29:225–232. doi: 10.1016/j.tins.2006.01.008. [DOI] [PubMed] [Google Scholar]
  • 52.Oleson EB, Beckert MV, Morra JT, Lansink CS, Cachope R, Abdullah Ra, et al. Endocannabinoids shape accumbal encoding of cue-motivated behavior via CB1 receptor activation in the ventral tegmentum. Neuron. 2012;73:360–373. doi: 10.1016/j.neuron.2011.11.018. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Cheer JF, Wassum KM, Sombers LA, Heien ML, Ariansen JL, Aragona BJ, et al. Phasic dopamine release evoked by abused substances requires cannabinoid receptor activation. J Neurosci. 2007;27:791–795. doi: 10.1523/JNEUROSCI.4152-06.2007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Hernandez G, Cheer JF. Effect of CB1 receptor blockade on food-reinforced responding and associated nucleus accumbens neuronal activity in rats. The Journal of neuroscience. 2012;32:11467–11477. doi: 10.1523/JNEUROSCI.1833-12.2012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.De Vries TJ, Schoffelmeer ANM. Cannabinoid CB1 receptors control conditioned drug seeking. Trends in pharmacological sciences. 2005;26:420–426. doi: 10.1016/j.tips.2005.06.002. [DOI] [PubMed] [Google Scholar]
  • 56.De Vries TJ, Shaham Y, Homberg JR, Crombag H, Schuurman K, Dieben J, et al. A cannabinoid mechanism in relapse to cocaine seeking. Nature medicine. 2001;7:1151–1154. doi: 10.1038/nm1001-1151. [DOI] [PubMed] [Google Scholar]
  • 57.De Vries TJ, Homberg JR, Binnekade R, Raasø H, Schoffelmeer ANM. Cannabinoid modulation of the reinforcing and motivational properties of heroin and heroin-associated cues in rats. Psychopharmacology. 2003;168:164–169. doi: 10.1007/s00213-003-1422-1. [DOI] [PubMed] [Google Scholar]
  • 58.Xi Z-X, Gilbert JG, Peng X-Q, Pak AC, Li X, Gardner EL. Cannabinoid CB1 receptor antagonist AM251 inhibits cocaine-primed relapse in rats: role of glutamate in the nucleus accumbens. The Journal of neuroscience. 2006;26:8531–8536. doi: 10.1523/JNEUROSCI.0726-06.2006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Lesscher HMB, Hoogveld E, Burbach JPH, Van Ree JM, Gerrits MaFM. Endogenous cannabinoids are not involved in cocaine reinforcement and development of cocaine-induced behavioural sensitization. European neuropsychopharmacology. 2005;15:31–37. doi: 10.1016/j.euroneuro.2004.04.003. [DOI] [PubMed] [Google Scholar]
  • 60.Gerdeman GL, Schechter JB, French ED. Context-specific reversal of cocaine sensitization by the CB1 cannabinoid receptor antagonist rimonabant. Neuropsychopharmacology. 2008;33:2747–2759. doi: 10.1038/sj.npp.1301648. [DOI] [PubMed] [Google Scholar]
  • 61.Corbillé A-G, Valjent E, Marsicano G, Ledent C, Lutz B, Hervé D, et al. Role of cannabinoid type 1 receptors in locomotor activity and striatal signaling in response to psychostimulants. The Journal of neuroscience. 2007;27:6937–6947. doi: 10.1523/JNEUROSCI.3936-06.2007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Li X, Hoffman AF, Peng X-Q, Lupica CR, Gardner EL, Xi Z-X. Attenuation of basal and cocaine-enhanced locomotion and nucleus accumbens dopamine in cannabinoid CB1-receptor-knockout mice. Psychopharmacology. 2009;204:1–11. doi: 10.1007/s00213-008-1432-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Clark JJ, Sandberg SG, Wanat MJ, Gan JO, Horne EA, Hart AS, et al. Chronic microsensors for longitudinal, subsecond dopamine detection in behaving animals. Nat Meth. 2010;7:126–129. doi: 10.1038/nmeth.1412. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Heien MLAV, Phillips PEM, Stuber GD, Seipel AT, Wightman RM. Overoxidation of carbon-fiber microelectrodes enhances dopamine adsorption and increases sensitivity. The Analyst. 2003;128:1413–1419. doi: 10.1039/b307024g. [DOI] [PubMed] [Google Scholar]
  • 65.Oleson EB, Gentry RN, Chioma VC, Cheer JF. Subsecond dopamine release in the nucleus accumbens predicts conditioned punishment and its successful avoidance. The Journal of neuroscience. 2012;32:14804–8. doi: 10.1523/JNEUROSCI.3087-12.2012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Serrano A, Parsons LH. Pharmacology & therapeutics. Vol. 132. Elsevier Inc; 2011. Endocannabinoid influence in drug reinforcement, dependence and addiction-related behaviors; pp. 215–41. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Cippitelli A, Bilbao A, Hansson AC, Del Arco I, Sommer W, Heilig M, et al. Cannabinoid CB1 receptor antagonism reduces conditioned reinstatement of ethanol-seeking behavior in rats. The European journal of neuroscience. 2005;21:2243–2251. doi: 10.1111/j.1460-9568.2005.04056.x. [DOI] [PubMed] [Google Scholar]
  • 68.Evenden JL, Ryan CN. The pharmacology of impulsive behaviour in rats: the effects of drugs on response choice with varying delays of reinforcement. Psychopharmacology. 1996;128:161–170. doi: 10.1007/s002130050121. [DOI] [PubMed] [Google Scholar]
  • 69.Heien MLAV, Khan AS, Ariansen JL, Cheer JF, Phillips PEM, Wassum KM, et al. Real-time measurement of dopamine fluctuations after cocaine in the brain of behaving rats. Proceedings of the National Academy of Sciences. 2005;102:10023–10028. doi: 10.1073/pnas.0504657102. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Wanat MJ, Kuhnen CM, Phillips PEM. Delays conferred by escalating costs modulate dopamine release to rewards but not their predictors. The Journal of neuroscience. 2010;30:12020–12027. doi: 10.1523/JNEUROSCI.2691-10.2010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71.Winstanley C, Dalley JW, Theobald DEH, Robbins TW. Global 5-HT depletion attenuates the ability of amphetamine to decrease impulsive choice on a delay-discounting task in rats. Psychopharmacology. 2003;170:320–331. doi: 10.1007/s00213-003-1546-3. [DOI] [PubMed] [Google Scholar]
  • 72.Pattij T, Janssen MCW, Schepers I, González-Cuevas G, de Vries TJ, Schoffelmeer ANM. Effects of the cannabinoid CB1 receptor antagonist rimonabant on distinct measures of impulsive behavior in rats. Psychopharmacology. 2007;193:85–96. doi: 10.1007/s00213-007-0773-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73.Wiskerke J, Stoop N, Schetters D, Schoffelmeer ANM, Pattij T. Cannabinoid CB1 receptor activation mediates the opposing effects of amphetamine on impulsive action and impulsive choice. PloS one. 2011;6:e25856. doi: 10.1371/journal.pone.0025856. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 74.Kleijn J, Wiskerke J, Cremers TIFH, Schoffelmeera NM, Westerink BHC, Pattij T. Effects of amphetamine on dopamine release in the rat nucleus accumbens shell region depend on cannabinoid CB1 receptor activation. Neurochemistry international. 2012;60:791–798. doi: 10.1016/j.neuint.2012.03.002. [DOI] [PubMed] [Google Scholar]
  • 75.Arnone M, Maruani J, Chaperon F, Thiébot MH, Poncelet M, Soubrié P, et al. Selective inhibition of sucrose and ethanol intake by SR 141716, an antagonist of central cannabinoid (CB1) receptors. Psychopharmacology. 1997;132:104–6. doi: 10.1007/s002130050326. [DOI] [PubMed] [Google Scholar]
  • 76.Freedland CS, Poston JS, Porrino LJ. Effects of SR141716A, a central cannabinoid receptor antagonist, on food-maintained responding. Pharmacology, biochemistry, and behavior. 2000;67:265–70. doi: 10.1016/s0091-3057(00)00359-2. [DOI] [PubMed] [Google Scholar]
  • 77.Thornton-Jones ZD, Kennett Ga, Benwell KR, Revell DF, Misra A, Sellwood DM, et al. The cannabinoid CB1 receptor inverse agonist, rimonabant, modifies body weight and adiponectin function in diet-induced obese rats as a consequence of reduced food intake. Pharmacology, biochemistry, and behavior. 2006;84:353–359. doi: 10.1016/j.pbb.2006.06.001. [DOI] [PubMed] [Google Scholar]
  • 78.Monterosso J, Ehrman R, Napier KL, O’Brien CP, Childressa R. Three decision-making tasks in cocaine-dependent patients: do they measure the same construct? Addiction. 2001;96:1825–1837. doi: 10.1046/j.1360-0443.2001.9612182512.x. [DOI] [PubMed] [Google Scholar]
  • 79.Coffey SF, Gudleski GD, Saladin ME, Brady KT. Impulsivity and rapid discounting of delayed hypothetical rewards in cocaine-dependent individuals. Experimental and Clinical Psychopharmacology. 2003;11:18–25. doi: 10.1037//1064-1297.11.1.18. [DOI] [PubMed] [Google Scholar]
  • 80.Niv Y, Daw ND, Joel D, Dayan P. Tonic dopamine: opportunity costs and the control of response vigor. Psychopharmacology. 2007;191:507–520. doi: 10.1007/s00213-006-0502-4. [DOI] [PubMed] [Google Scholar]
  • 81.Niv Y, Duff MO, Dayan P. Dopamine, uncertainty and TD learning. Behavioral and brain functions. 2005:6. doi: 10.1186/1744-9081-1-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 82.Geisler S, Derst C, Veh RW, Zahm DS. Glutamatergic afferents of the ventral tegmental area in the rat. The Journal of neuroscience. 2007;27:5730–5743. doi: 10.1523/JNEUROSCI.0012-07.2007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 83.Cohen C, Perrault G, Voltz C, Steinberg R, Soubrie P. SR141716, a central cannabinoid (CB(1)) receptor antagonist, blocks the motivational and dopamine-releasing effects of nicotine in rats. Behav Pharmacol. 2002;13:451–463. doi: 10.1097/00008877-200209000-00018. [DOI] [PubMed] [Google Scholar]
  • 84.Szabo B, Siemes S, Wallmichrath I. SHORT COMMUNICATION Inhibition of GABAergic neurotransmission in the ventral tegmental area by cannabinoids. European Journal of Neuroscience. 2002;15:2057–2061. doi: 10.1046/j.1460-9568.2002.02041.x. [DOI] [PubMed] [Google Scholar]
  • 85.Phillips PEM, Walton ME, Jhou TC. Calculating utility: preclinical evidence for cost-benefit analysis by mesolimbic dopamine. Psychopharmacology. 2007;191:483–495. doi: 10.1007/s00213-006-0626-6. [DOI] [PubMed] [Google Scholar]
  • 86.Gan JO, Walton ME, Phillips PEM. Dissociable cost and benefit encoding of future rewards by mesolimbic dopamine. Nature neuroscience. 2010;13:25–27. doi: 10.1038/nn.2460. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 87.Sugam Ja, Day JJ, Wightman RM, Carelli RM. Phasic nucleus accumbens dopamine encodes risk-based decision-making behavior. Biological psychiatry. 2012;71:199–205. doi: 10.1016/j.biopsych.2011.09.029. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 88.Day JJ, Jones JL, Wightman RM, Carelli RM. Phasic nucleus accumbens dopamine release encodes effort- and delay-related costs. Biological psychiatry. 2010;68:306–309. doi: 10.1016/j.biopsych.2010.03.026. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 89.Schultz W. Dopamine signals for reward value and risk: basic and recent data. Behavioral and brain functions. 2010;6:24. doi: 10.1186/1744-9081-6-24. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 90.Day JJ, Roitman MF, Wightman RM, Carelli RM. Associative learning mediates dynamic shifts in dopamine signaling in the nucleus accumbens. Nature neuroscience. 2007;10:1020–1028. doi: 10.1038/nn1923. [DOI] [PubMed] [Google Scholar]
  • 91.Waelti P, Dickinson A, Schultz W. Dopamine responses comply with basic assumptions of formal learning theory. Nature. 2001;412:43–48. doi: 10.1038/35083500. [DOI] [PubMed] [Google Scholar]
  • 92.Fiorillo CD, Newsome WT, Schultz W. The temporal precision of reward prediction in dopamine neurons. Nature neuroscience. 2008;11:966–973. doi: 10.1038/nn.2159. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

01

RESOURCES