Abstract
There is ample evidence that blockade of CB1 receptors reduces reward seeking. However, the reported effects of CB1 blockade on performance for rewarding electrical brain stimulation stand out as an exception. By applying a novel method for conceptualizing and measuring reward seeking, we show that AM-251, a CB1 receptor antagonist, does indeed decrease performance for rewarding electrical stimulation of the medial forebrain bundle in rats. Reward seeking depends on multiple sets of variables, including the intensity of the reward, its cost, and the value of competing rewards. In turn, reward intensity depends both on the sensitivity and gain of brain reward circuitry. We show that drug-induced changes in sensitivity cannot account for the suppressive effect of AM-251 on reward seeking. Therefore, the role of CB1 receptors must be sought among the remaining determinants of performance. Our analysis provides an explanation of the inconsistencies between prior reports, which likely arose from the following: (1) the averaging of data across subjects showing heterogeneous effects and (2) the use of methods that cannot distinguish between the different determinants of reward pursuit. By means of microdialysis, we demonstrate that blockade of CB1 receptors attenuates nucleus accumbens dopamine release in response to rewarding medial forebrain bundle stimulation, and we propose that this action is responsible for the ability of the drug to decrease performance for the electrical reward.
Introduction
Rats work vigorously for electrical stimulation of the medial forebrain bundle (MFB) (Olds and Milner, 1954), a phenomenon known as intracranial self-stimulation (ICSS). The effect that the rat seeks to reinitiate is called “brain stimulation reward” (BSR) (Table 1). Like pharmacological and natural rewards, rewarding MFB stimulation causes dopamine (DA) release in the nucleus accumbens (NAc) (Hernandez and Hoebel, 1988; You et al., 2001; Hernandez et al., 2006).
Table 1.
Glossary of technical terms
| Term | Definition |
|---|---|
| BSR | The effect that leads the subject to seek additional stimulation. |
| Curve-shift scaling | A method for scaling the effect of a manipulation in terms of the offsetting change in the stimulation strength required to hold operant performance for BSR constant. Performance (e.g., time allocation or response rate) is plotted on the y-axis and is measured as a function of stimulation strength, which is plotted on the x-axis; lateral displacement of the resulting psychometric curve is measured. |
| Fhm | The pulse frequency at which reward intensity is half maximal. |
| Fm50 | The pulse frequency along a frequency-sweep curve at which time allocation lies halfway between the lower (TAmin) and upper (TAmax) asymptotes. (Fm50 = Fhm only when the price is one half of Pe.) |
| ICSS | Intracranial self-stimulation |
| Location parameters | (Fhm, Pe), the parameters of the reward mountain that determine its location in the plane defined by the common logarithms of the pulse frequency and the price. |
| Opportunity cost | Price, the work time required to earn a reward, scaled in terms of the value of alternate activities forgone. |
| Pe | The price at which time allocation for a maximal reward lies halfway between the lower (TAmin) and upper (TAmax) asymptotes. |
| Price | Opportunity cost, the cumulative time the lever must be held down in order to earn a train of rewarding stimulation. |
| Psychometric curve | A curve expressing a dependent behavioral variable (e.g., time allocation) as a function of an independent physical variable (e.g., pulse frequency). |
| Psychometric surface | A surface expressing a dependent behavioral variable (e.g., time allocation) as a function of two independent physical variables (e.g., pulse frequency and price) |
| Reward intensity | The subjective strength of the reward, analogous to the subjective variable that makes a highly concentrated sucrose solution more rewarding than a more dilute one. |
| Reward mountain | A psychometric surface in a 3D space describing how time allocation (z-axis) varies as a function of the strength (y-axis) and cost (x-axis) of BSR. |
| TAmax | Maximal time allocation; the upper asymptote of 2D psychometric curves or 3D psychometric surfaces. |
| TAmin | Minimal time allocation; the lower asymptote of 2D psychometric curves or 3D psychometric surfaces. |
| TA | The proportion of trial time spent working for BSR. |
Curve-shift scaling (Edmonds and Gallistel, 1974, 1977; Miliaressis et al., 1986) is used widely to infer effects of drugs on BSR from displacement of psychometric curves linking stimulation strength to instrumental performance. CB1 receptor (CB1R) ligands have produced inconsistent effects in the curve-shift paradigm (Arnold et al., 2001; Deroche-Gamonet et al., 2001; Vlachou et al., 2003; De Vry et al., 2004; Vlachou et al., 2005; Xi et al., 2008). This contrasts sharply with the consistent effects of CB1R ligands on performance for food and drugs (Solinas et al., 2008).
Hernandez et al. (2010) have extended curve-shift scaling by measuring and modeling performance for BSR as a joint function of the strength of the stimulation, as determined by the pulse frequency, and its opportunity cost (“price”): the cumulative time required to earn a reward (Fig. 1). The proportion of a subject's time devoted to reward seeking [time allocation (TA)] increases as a function of pulse frequency and decreases as a function of price (Fig. 1B,C). The resulting three-dimensional (3D) structure (Eq. 1) is dubbed the “reward mountain.”
Figure 1.
Graphical representation of the mountain model. A, In the initial stages of processing, an intensity-growth function transforms the aggregate spike rate induced by the stimulation train in the directly stimulated neurons into a reward-intensity signal. Following rescaling, the peak reward intensity is transferred to memory. The payoff from BSR (UB) is computed by discounting the stored reward-intensity value by the probability that a reward will be delivered when the work requirement has been met and by the effort and opportunity cost of the reward. The proportion of time the animal invests in working for BSR is determined by a comparison of UB, suitably transformed (Hernandez et al., 2010), to the sum of the suitably transformed values of both BSR and the payoff from competing activities (UE). B, Increasing the value of Fhm, the location parameter of the intensity-growth function, shifts the reward mountain rightward along the frequency axis of the 3D space. C, Reducing the value of the Pe parameter by downward rescaling of the output of the intensity-growth function, reduced reward probability, increased reward costs, or increased competition from alternative activities shifts the reward mountain leftwards along the price axis (Eq. 1; see Notes).
The pulse frequency that produces a half-maximal reward, Fhm, sets the position of the mountain along the pulse-frequency axis, whereas the price at which the rat spends half its time working for a maximal BSR, Pe, sets the position along the price axis. These location parameters reflect different stages in the translation of stimulation-induced firings into reward-seeking behavior (Fig. 1A). Drug action before the output of the “intensity-growth” function that translates the firing rate of the directly activated neurons into a subjective reward intensity (Gallistel and Leon, 1991; Leon and Gallistel, 1992; Simmons and Gallistel, 1994) alters Fhm (Fig. 1B), whereas drug action at later stages alters Pe (Fig. 1A,C) (Hernandez et al., 2010). The sensitivity of the reward substrate determines Fhm (Fig. 1B) and is analogous to the affinity of a ligand for a receptor. The gain of the substrate determines the maximal reward intensity attainable; it is analogous to receptor density and is reflected in Pe, as are alterations in perceived costs or in the value of competing activities (Fig. 1C).
Arvanitogiannis and Shizgal (2008) and Hernandez et al. (2010) have shown that displacements of the 3D reward mountain along the axes representing the strength or cost of reward cannot be distinguished on the basis of conventional two-dimensional (2D) measurements, such as curve shifts or progressive-ratio break points (Hodos, 1961; Keesey and Goldstein, 1968). Thus, we used the novel 3D measurement method to determine whether CB1Rs modulate BSR, and, if so, to constrain the stage(s) of processing to which these receptors contribute. We also show that CB1R blockade attenuates the ability of rewarding MFB stimulation to boost extracellular DA concentrations in the NAc, which could explain the decrease produced by this treatment in the opportunity cost at which rats maintain performance for BSR.
Materials and Methods
Subjects.
Subjects were 19 male Long–Evans rats from Charles River Breeding Farms. Thirteen of these animals took part in the intracranial self-stimulation (ICSS) experiment, and the rest took part in the microdialysis experiment. The rats were housed in Plexiglas cages in a vivarium with controlled temperature and reversed 12 h dark/light cycle. Food and water were available ad libitum. The behavioral procedures were conducted during the dark phase of the cycle, between 7:30 A.M. and 2:00 P.M. All procedures complied with the principles of the Canadian Council on Animal Care.
Implantation of electrodes and cannulas.
Rats weighed 400–550 g at the time of surgery. We administered atropine sulfate (0.05 mg/kg, s.c.) to reduce bronchial secretions. Anesthesia was induced with ketamine–xylazine (10–100 mg/kg, i.p.) and maintained with isoflurane vapor. Penicillin (0.3 ml/kg, i.m.) was administered to prevent infections. Before the rat was mounted in the stereotaxic frame, xylocaine jelly was applied to the external auditory meatus to reduce discomfort from the ear bars. Monopolar stainless-steel electrodes were constructed from 000 insect pins and insulated with Formvar to within 0.5 mm of the tip. The electrodes were aimed bilaterally at the lateral hypothalamic level of the MFB [anteroposterior (AP): −2.8, mediolateral (ML): ±1.7, dorsoventral (DV): 8.7–8.9 from the skull]. Four stainless-steel jeweler screws were threaded into pilot holes drilled in the skull; the electrodes were anchored to these screws with dental acrylic. A length of wire wrapped around two of the screws served as the current return. Gold-plated Amphenol connectors, attached via a short length of wire to each of the electrodes and the skull-screw return, were inserted into a McIntyre Miniature Connector (Scientific Technology Centre, Carleton University, Ottawa, ON, Canada), which was attached to the skull screws with dental acrylic to form a head cap. In the rats destined for the microdialysis experiment, 20 gauge guide cannulas were aimed bilaterally at the NAc (1.5 AP, 2.8 ML, and −5.4 DV from skull at a 10° angle), in addition to the MFB stimulation electrodes. Buprenorphine (0.05 mg/kg, s.c.) was administered immediately following surgery to reduce subsequent pain. Rats were allowed 5–7 d of recovery before behavioral training began.
Apparatus.
Behavioral testing was performed in four plastic operant boxes (30 × 21 × 51 cm) with a mesh floor and a clear Plexiglas front. Each box was equipped with a flashing light, located 10 cm above the floor mesh, and a retractable lever (ENV–112B, MED Associates) mounted on the right side wall. A 1 cm light was located 2 cm above the lever and was activated when the rat depressed the lever.
The temporal parameters of the electrical stimulation were set by a computer-controlled digital pulse generator, and pulse amplitude was determined by a computer-controlled constant-current amplifier. Stimulation consisted of 0.5 s trains of cathodal pulses, 0.1 ms in duration. The stimulation current was routed to the rat through a multichannel slip ring that allowed the rat to circle without tangling the leads. Experimental control and data acquisition were handled by a personal computer running a custom-written program (“PREF”) developed by Steve Cabilio (Concordia University, Montreal, QC, Canada). The stimulation was monitored on an oscilloscope by displaying the potential drop across a 1% precision resistor in series with the rat.
The behavioral phase of the microdialysis experiment was conducted in the previously described setup. In the neurochemical-sampling phase, the rats were transferred to similar operant chambers from which the levers had been removed, and dialysate samples were collected. Stimulation trains were programmed by a Master-8 pulse generator (A.M.P.I.), controlled by LabView software (National Instruments), and delivered by a constant current amplifier (Mundl, 1980). An infusion pump (Harvard Instruments) was connected by polyethylene tubing (PE-20) to a fluid swivel located at the top of the chamber. The second port of the swivel was connected to one end of a 50 cm length of polyethylene tubing, and a microdialysis probe was connected to the other end. A small diameter silica tube, extending into the tip of the microdialysis probe, completed the fluid circuit. The probes were described in detail previously (Hernandez et al., 2006, 2007).
Self-stimulation training.
For each rat, we determined the stimulating electrode and the current-frequency combination that supported vigorous lever pressing with minimum aversive side effects. From that point onwards the current and stimulating electrode were held constant. Rats were then trained to keep the lever depressed for a cumulative time of 4 s to receive the stimulation. Once this task had been mastered, training commenced on the “frequency-sweep” procedure. Each sweep consisted of a set of trials during which the stimulation parameters were held constant, and the rat had the opportunity to harvest as many as 20 rewards. Following delivery of each reward, the lever was disarmed and retracted for 2 or 3 s. The pulse frequency during the first three trials was set to the highest value the rat could tolerate without signs of aversion or forced movement. Over the subsequent eight trials, the pulse frequency was decreased systematically from trial to trial in equal proportional steps. The dependent variable was a corrected measure of the proportion of trial time that the lever was depressed (time allocation) (Breton et al., 2009). The range of pulse frequencies was selected to drive time allocation from its maximal to its minimal values, in sigmoidal fashion. Every trial was preceded by a 10 s intertrial interval signaled by a flashing light. During the last 2 s of this period rats received priming stimulation consisting of two stimulation trains at the maximum pulse frequency that the rat could tolerate, delivered at 1 train s−1.
After the subject showed consistently high asymptotic values of time allocation (not lower than 0.8) in at least the first two trials and low asymptotic values (<0.2) in at least the last two trials of each determination, we introduced two new types of sweeps. During “price sweeps,” the rats had to hold down the lever for increasing cumulative periods (i.e., prices) to obtain a stimulation train of maximal strength. The duration of each trial was adjusted to allow the rat to harvest a maximum of 20 rewards. After consistently high and low asymptotic time-allocation values (≥0.8 and ≤0.2, respectively) were observed in price-sweep data, a new “radial” sweep was added. In a radial sweep, the required price increased, and the stimulation strength decreased simultaneously across sequential trials. The stimulation-price combinations and the spacing between the trials were calculated so that the vector described by the radial sweep in the parameter space [log10(P) vs log10(F)] passed through, or very near, the point defined by the fitted values of the location parameters [log10(Pe), log10(Fhm)] (see Fig. 3A,B). This was achieved using the data from the frequency and price sweeps and a simulator developed by Yannick Breton and implemented in MATLAB (The MathWorks).
Figure 3.

Fit of the mountain model to the time-allocation data obtained following treatment with AM-251 and vehicle. A, B, Wire-mesh surfaces fitted to the vehicle (A) and drug (B) data from rat C19. The red, green, and blue dots represent mean time-allocation values for the frequency, radial, and price sweeps respectively, and the solid vertical red and blue lines represent the location parameters, Fhm and Pe, respectively.
Two sweeps of each type were run during every session. We use the term “survey” to refer to the combination of a frequency, a price, and a radial sweep; these provide the minimal dataset required to fit the mountain model. The sequence of sweeps was random within session for subjects C8–C14 and random within survey for C17–C20. In the latter case, the rats had to complete a full survey before any of the sweeps were repeated; this adjustment was made to increase the power of the resampling-based surface-fitting approach (see below). Each rat performed under these conditions for four sessions, and then the model was fitted (see below, Self-stimulation data: model fitting and comparisons). If the radial sweep deviated excessively from the fitted values of [log10(Pe), log10(Fhm)] or if the upper or lower asymptotic time-allocation values were insufficiently well defined, the sequence of prices and pulse frequencies was readjusted. Each rat was considered ready for behavioral or in vivo microdialysis drug testing when its responding was consistent throughout sessions and the trajectory of the radial sweep passed sufficiently close to [log10(Pe), log10(Fhm)]. Rats required 5 weeks of training, on average, to reach the drug-testing phase. Rats that failed to meet the criteria described above were excluded from the experiment.
Self-stimulation testing under the influence of AM-251 and its vehicle.
Each session consisted of a warm-up frequency sweep, followed by two price, two radial, and two frequency sweeps, either randomized within sessions (rats C8–C14) or randomized within surveys (rats C17–C20).
AM-251 (3 mg/kg; Tocris Bioscience) was diluted in 90% ethanol (90 μl/mg), cremophor (90 μl/mg), and 0.9% saline (900 μl/mg). The drug or its vehicle was administered at a volume of 3 ml/kg, i.p., 30 min before each behavioral test. This dose was chosen in accordance with previous studies (Xi et al., 2006, 2008). The stimulation frequencies and prices in the vehicle sessions were the same as those determined in the training phase. During the drug sessions, the price values tested in the price sweep were decreased by 0.1–0.2 log10 units on the basis of leftward shifts along the price axis observed during pilot testing (data not shown).
At least one “washout” day followed each drug session to allow for elimination of the drug. Rats received vehicle injections on Mondays and Thursdays, drug injections on Tuesdays and Fridays; Wednesdays, Saturdays, and Sundays were washout days. Eight to 12 test sessions, 5–6 h in duration, were run in both the drug and vehicle conditions. Approximately 3 months were required, following the initial surgery, to complete testing of each subject.
Following behavioral testing, rats were overdosed with ketamine–xylazine. As described previously (Hernandez et al., 2007), stimulation sites were marked by means of the Prussian Blue method and located by microscopic inspection of formol–thionine stained brain sections, with reference to an atlas of the rat brain (Paxinos and Watson, 2007).
Self-stimulation data: model fitting and comparisons.
Equation 1 describes the mountain model, as follows:
![]() |
where a is the constant determining the abruptness with which TA grows as the payoff from BSR increases; F is the pulse frequency; Fhm is the pulse frequency that produces a half-maximal reward; g is the constant determining the abruptness with which reward intensity grows as F is increased; P is the price (opportunity cost) of a stimulation train, the cumulative time the lever must be depressed in order for delivery of a stimulation train to be triggered; and Pe is the price at which the payoff from a maximally intense BSR equals the payoff from competing activities.
Among the objectives of the model-fitting approach were unbiased estimates of location-parameter (Fhm, Pe) values and their dispersions for each subject. This was accomplished by means of a MATLAB (The MathWorks) procedure developed by Kent Conover, based on the nonlinear least-squares routine in the MATLAB Optimization Toolbox and re sampling methods (Efron and Tibshirani, 1994). A primary fit of the six-parameter model presented in Equation 1 was performed independently to the data from each session (subject C8–C14) or survey (C17–C20) in each condition; this was done using the “location-specific approach” (Hernandez et al., 2010). This approach entails fitting individual values of the two location parameters to the data for each session or survey while using common values of the four remaining parameters. The reason for this procedure is to protect the values of the two slope parameters, a and g (Eq. 1), from the degradation that would ensue from fitting common values of all parameters to datasets that shift in the parameter space from session to session (Hernandez et al., 2010); such shifts would be expected to arise from unavoidable variation in drug administration, absorption, etc. Following the primary fit, the data were then resampled with replacement by session or survey, 1000 times; the model was fitted to each resampled dataset as described above. Estimates of the mean value of each parameter and the corresponding 95% confidence interval were computed over the 1000 fits; in the case of the location parameters, the session-specific values were averaged within each set of fits to a given resampled dataset. The 95% confidence intervals were percentile-based: they exclude the lowest and highest 25 of the 1000 values (see Fig. 5).
Figure 5.
AM-251-induced shifts in the location of the mountain. A, D, The contour graph of the surface fitted to the vehicle data (Fig. 3, Rat C19) is shown twice. This representation provides reference points for visualizing any AM-251-induced shifts of the mountain along the price and pulse-frequency axes. C, The contour graph of the surface fitted to the data obtained from the same subject in the drug condition. As in Figure 3, the red, blue, and green dots represent the values of the independent variables used to obtain the frequency, price, and radial sweeps, respectively. The solid red and dashed red horizontal lines represent the values of the Fhm parameter for the vehicle and AM-251 condition, respectively. Note the near overlap of these two lines. The solid blue and dashed blue vertical lines represent the values of the Pe parameter for the vehicle and drug condition, respectively. The accompanying blue arrow indicates a statistically reliable decrease in Pe. B, The changes in both location parameters are contrasted in the bar graph; these changes are the difference between the common logarithmic value of each parameter for the drug and vehicle conditions. Error bars denote 95% confidence intervals. In this subject, AM-251 shifted the mountain along the price, but not the pulse-frequency, axis (*p < 0.05).
The seven-parameter model described previously (Hernandez et al., 2010) allowed us to account for the exceptionally high time allocation observed at the lower pulse frequencies during the frequency sweeps for subject C17; according to the Akaike information criterion (Akaike, 1974), this model provided a better fit (data not shown) than the standard model, but only in the case of this one rat.
A difference vector was constructed for each location parameter in each subject by subtracting, element by element, the 1000 estimates for the AM-251 condition from the 1000 estimates for the vehicle condition. The mean changes in parameter values reported here represent the mean of this difference vector, whereas the 95% confidence intervals are simply its 2.5th and 97.5th percentiles (see Fig. 5). If the confidence interval did not include zero, the difference between conditions was considered statistically reliable, with an α level of 0.05.
Quantification of NAc DA release produced by rewarding MFB stimulation: in vivo microdialysis.
The rats in this phase of the study also underwent ICSS training, and the 3D model was fitted to each rat's data, as described above. The obtained parameters were used to estimate the pulse frequency that drove reward intensity to 95% of its maximum value (Eq. 2).
where F95 is the pulse frequency that produces a subjective reward intensity equal to 95% of the maximal attainable value, Fhm is the pulse frequency that produces half-maximal reward, and g is the parameter that determines the rate at which subjective reward intensity grows as a function of pulse frequency.
Rats were transferred to the microdialysis testing room 14 h before dialysate collection commenced. They were lightly anesthetized with isoflurane, and the microdialysis probes were inserted bilaterally into the NAc through the guide cannulae. Once the probes were in place, artificial CSF (145 mm Na+, 2.7 mm K+, 1.22 mm Ca2+, 1.0 mm Mg2+, 150 mm Cl−, 0.2 mm ascorbate, 2 mm Na2HPO4, pH = 7.4 ± 0.1) was pumped through them continuously, at a rate of 0.3 μl/h, to prevent the membrane from occluding. Food and water were available ad libitum. Two hours before sampling began, food was removed from the chamber and the flow was increased to 1.0 μl/h. Samples were then collected every 20 min. Baseline values for the DA concentration in the dialysate were obtained over the first 60 min of sampling (three samples). Animals then received either an injection of AM-251 (3 mg/kg) or its vehicle. Three dialysate samples were collected following the injection. This provided sufficient time for absorption and distribution of the drug and sufficient information to measure the effect of the drug on basal levels of NAc DA. Following collection of these samples, electrical stimulation was delivered for 2 h (six samples) at unpredictable intervals, according to a VT12 schedule. The stimulation pulse frequency was set to F95 for each rat. Six additional samples were collected after delivery of the stimulation ceased.
All animals received both AM-251 and its vehicle, in counterbalanced order, on different days. Drug administration sessions were always followed by a washout day during which the flow rate was reduced to 0.3 μl/h, and no samples were collected.
DA and its metabolites were quantified by means of electrochemical detection, using high performance liquid chromatography, as described in detail previously (Hernandez et al., 2006, 2007). Neurochemical data were analyzed by means of a two-way, repeated-measures ANOVA, using the “treatment” (drug/vehicle) and “time” (time of sampling, 18 samples per each treatment per rat) as factors. The effects of the drug on basal DA levels, the effects of stimulation on DA tone, and the differences between drug and vehicle during stimulation were then assessed by means of planned comparisons.
Simulation of “2D curve-shifts.”
On the basis of the mountain model and the fitted parameter values for each rat, we estimated the frequency required to support half-maximal performance (Fm50), the value that would have been obtained in a conventional curve-shift experiment (Eq. 3). To account for the low price paid for reward when the commonly used, continuous-reinforcement schedule is in force, we set the price to 0.1 s. In accordance with the practice in most prior studies linking CB1Rs with BSR (Arnold et al., 2001; Deroche-Gamonet et al., 2001; Vlachou et al., 2003; De Vry et al., 2004; Vlachou et al., 2005; Xi et al., 2008), the simulated Fm50 values were averaged within condition (drug or vehicle) for each subject. The paired means for all subjects were then compared across conditions using a paired-sample t test.
![]() |
where Fm50 is the pulse frequency that produces half-maximal time allocation, Fhm is the pulse frequency that produces half-maximal reward intensity, g is the exponent (growth constant) of the intensity-growth function, P is the price (opportunity cost) of the stimulation train, and Pe is the price at which the rat devotes half of its time to harvesting a reward of maximal intensity.
Results
The tip of the stimulating electrode in all eight subjects was within the MFB, at the level of the lateral hypothalamus (Fig. 2, top). The probes for the microdialysis subjects were located within the NAc (Fig. 2, bottom).
Figure 2.
Electrode and cannula placement. Top, The location of electrode tips. All electrode placements fell within the boundaries of the MFB at the level of the lateral hypothalamus, as determined by the Paxinos and Watson atlas (2007). Bottom, The location of cannulas for rats in the microdialysis experiment. Tips of all probes fell within the NAc.
The dependent measure was the proportion of trial time that the lever was depressed as a function of the pulse frequency and the price. The mountain model was fitted to these data to determine the Fhm and Pe parameter values and their associated confidence intervals, for each rat under each condition. As an example, Figure 3 shows the fit to the drug and vehicle data from subject C19, the location-parameter estimates, and their confidence intervals for each condition. Two-dimensional representations of the fitted sweeps from subject C19 are shown in Figure 4.
Figure 4.
Two-dimensional representations of results from Rat C19. A, The frequency-sweep data for each condition along with the 2D projections of the fitted surfaces. B, The price-sweep data for each condition along with the 2D projections of the fitted surfaces. C, The radial-sweep data and corresponding 2D projections, shown against the pulse-frequency axis. D, The radial-sweep data and corresponding 2D projections, shown against the price axis.
Changes in the values of the location parameters produced by AM-251 were assessed independently for each rat. Figure 5 shows contour-graph representations of the fits to the data from subject C19 along with the drug-induced changes in the location parameters. The contour graph for the drug condition (Fig. 5C) is displaced leftward with respect to the contour graph for the vehicle condition (Fig. 5A), whereas the vertical positions of the two contour graphs (Fig. 5C,D) are similar. Thus, AM-251 failed to alter the Fhm parameter but produced a substantial (nearly 0.2 log10 unit) decrease in the value of the Pe parameter (Fig. 5B).
The rows of blue diamonds in Figure 5, A, C, and D, denote the prices tested in rat C19 along the price sweeps. Note that the orientation of the contour lines is almost vertical at their intersection with the price-sweep vectors. Each contour line plots the combinations of price and pulse frequency that support a given level of behavior (time allocation). Thus, the contour lines trace out the intensity-growth function for BSR (Fig. 1A, red curve in the 3D graph on the left). The diagonal portions of the contour lines span ranges of pulse frequency over which reward intensity rises; as a result, the effect of a price increase can be offset by an increase in pulse frequency. In contrast, where the contour lines run vertically, reward intensity has leveled off at its maximal value, and increases in price can no longer be offset by further increases in pulse frequency. An estimate of the Pe parameter can be obtained by visual inspection of price-sweep data that intersect the vertically oriented portions of the contour lines: it is the price at which time allocation for the maximally intense reward lies halfway between the lower and upper asymptotes of the sigmoid psychometric curve. For example, the prices corresponding to the vertical midpoints of the two price-sweep curves in Figure 4B, which were obtained at near-maximal reward intensities, provide rough estimates of the Pe parameter, and the decrease in the value of this parameter produced by AM-251 is approximated by the leftward displacement of the solid, dark-blue curve from the dashed, light-blue curve.
Figure 6 shows the drug-induced changes in location-parameter estimates for all subjects. The changes in the value of the Fhm parameter met the criterion for statistical reliability in the data from only three of eight rats and ranged from −0.119 to .0194 common logarithmic units. The direction of these changes was inconsistent; in the case of Rat C8, Fhm decreased in the drug condition whereas in the cases of Rats C11 and C14, the same treatment increased it (Fig. 6). In contrast, we found a reliable decrease in the value of Pe following drug administration in seven of the eight rats. Figure 6 shows that the size of these changes ranged from −0.084 to −0.242 common logarithmic units (17.6–42.7% decreases in Pe).
Figure 6.
AM-251-induced changes in the location parameters for all subjects. Drug-induced changes in Fhm are shown on the left and those in Peon the right. Error bars denote 95% confidence intervals. Note that in five of eight cases, Fhm did not change reliably and that the changes that met the statistical criterion are inconsistent in sign. In contrast, a consistent decrease in Pe was found in seven of eight subjects (*p < 0.05).
We quantified the levels of DA in the NAc at various time points before, during, and after electrical stimulation following an injection of AM-251 or its vehicle (Fig. 7). We found a significant main effect of time of sampling (F(17,119) = 8.8032, p < 0.01), the treatment (F(1,119) = 9.1776, p < 0.05), and their interaction (F(17,119) = 4.0021, p < 0.01). Planned comparisons showed that electrical stimulation produced a significant increase in NAc DA levels (Fig. 7B). This increase was significantly attenuated by CB1R blockade, without affecting basal levels (Fig. 7).
Figure 7.
AM-251 attenuated the ability of MFB stimulation to boost DA tone in the NAc. A, Changes in NAc DA levels following vehicle or AM-251 injections. B, Planned comparison of DA concentrations in dialysate samples collected during baseline testing and during electrical stimulation of the MFB. The stimulation-induced increase in DA concentration was significantly attenuated by AM-251, but baseline levels were not affected (*p < 0.05 as compared with corresponding baseline, **p < 0.05 as compared with the vehicle group).
We used the mountain model to derive a widely used location parameter for psychometric curves obtained in 2D curve-shift experiments: Fm50, the pulse frequency that supports a half-maximal level of performance (Table 1, Fig. 8). In accord with conventional practice, we compared the Fm50 estimates for the drug and vehicle conditions by means of a paired sample t test (Arnold et al., 2001; Deroche-Gamonet et al., 2001; Vlachou et al., 2003; De Vry et al., 2004; Vlachou et al., 2005; Xi et al., 2008). Whereas the 3D methodology allowed us to detect reliable drug-induced changes in the Pe parameter in 7/8 rats, the effects of AM-251 on the derived Fm50 values failed to cross the statistical threshold (t(7) = 1.885, p > 0.05) (Figs. 8, 9).
Figure 8.

Simulation of 2D curve-shifts following CB1 receptor blockade. A, The contour lines mid-way between the fitted estimates of maximal (TAmax) and minimal (TAmin) time allocation. The data are drawn from the surfaces fitted to the data from Rat C19. Dashed lines represent the values of the location parameters. Note the decrease in the Pe parameter caused by the drug, and the near-absence of such a change in the Fhm parameter. Due to the gentle slope of the diagonal portion of the contour line, a substantial displacement along the price axis is translated into a much smaller shift along the pulse-frequency axis. B, Simulated curves showing how the frequency-sweep data from subject C19 would appear had they been obtained using the standard curve-shift method. Such a small shift would almost certainly have been lost in the noise. Values of the pulse frequency corresponding to half-maximal behavioral allocation (Fm50 values) were derived from the mountain model and the fitted parameter values (Eq. 2). The simulated Fm50 values were averaged within condition and subject and compared across conditions using a paired sample t test. The resulting difference failed to meet the criterion for statistical significance.
Figure 9.
The mountain model can reveal effects of cannabinoid receptor blockade that cannot be discerned with the conventional curve-shift method. The effects of AM-251 administration on Fhm, Pe, and the simulated Fm50 values are shown. Each black diamond represents the estimated change of the corresponding parameter for a given subject. Whiskers represent the maximum and minimum values, and the upper and lower borders of the boxes denote the 25th and 75th percentiles; the mean and the median are represented by the small inner square and the horizontal line, respectively. Note that the Fhm and Fm50 changes tend to cluster around zero whereas the Pe changes are clustered around a mean of −0.13 common logarithmic units.
Discussion
CB1Rs modulate the behavioral impact of rewards. Rodents pretreated with CB1R antagonists show decreased break points in progressive-ratio tests of performance for food (Rasmussen and Huskinson, 2008), blunted appetitive responses in the taste-reactivity test (Jarrett et al., 2007), impaired acquisition of conditioned place preferences to drugs (Singh et al., 2004; Forget et al., 2005; Yu et al., 2009), and reduced drug self-administration (Filip et al., 2006; Shoaib, 2008; Xi et al., 2008). Conversely, CB1 receptor agonists increase operant responding for food (Solinas and Goldberg, 2005) and induce place preference (Valjent and Maldonado, 2000).
Dopamine release in the NAc has been implicated in reward and motivation (Wise, 2008). Mice lacking CB1Rs show decreased DA release in the NAc in response to drug rewards (Mascia et al., 1999; Hungund et al., 2003; Li et al., 2009). The release of dopamine in the NAc by rewarding drugs is inhibited by CB1R blockade (Cheer et al., 2007) and enhanced by pharmacological activation of these receptors (Cheer et al., 2004; Solinas et al., 2006). Given the vast and consistent evidence linking CB1Rs with reward modulation, it is striking that the effects of CB1R blockade on ICSS, one of the most widely used procedures for the quantitative study of reward, have heretofore yielded contradictory results (Solinas et al., 2008). As discussed below, our results offer an explanation for this inconsistency and provide a way to reconcile the effects of CB1R blockade on ICSS with the rest of the literature implicating CB1Rs in reward modulation.
We found consistent effects of CB1R blockade on the pursuit of BSR by manipulating both the strength and cost of rewarding stimulation and by applying a 3D analysis appropriate for testing the influence of drugs on the performance of individual subjects. Application of the 3D model distinguishes between changes induced by CB1R blockade in the sensitivity of brain reward circuitry and changes induced by the multiple factors that alter the price at which rats maintain a given level of performance for stimulation of a given strength (Fig. 1A). Changes in sensitivity alter the stimulation strength required to produce a half-maximal reward (Fhm), which governs the position of the reward mountain along the pulse-frequency axis. Changes in the value of the Fhm parameter met the criterion for statistical reliability in the data from only three of eight rats, and the direction of these changes was inconsistent. In contrast, we found a reliable decrease in the value of Pe following drug administration in seven of the eight rats. Thus, CB1Rs play their principal role at or beyond the output of neural circuitry that determines reward sensitivity. Such actions could include downward rescaling of integrator output (i.e., decreased gain) or increases in subjective costs (i.e., subjective valuation of the time or effort required to earn a reward), and the value of competing activities such as grooming, resting, and exploring (Herrnstein, 1970, 1974; Killeen, 1972; Heyman, 1988).
The decrease in the prices at which a given level of performance is sustained (↓Pe) under the influence of AM-251 may reflect an interaction of CB1R blockade with neurotransmitter systems implicated in reward pursuit. The fact that boosting DA tone in the NAc is accompanied by an increase in the prices at which performance for BSR is sustained (Hernandez et al., 2010), an effect opposite in sign to the one reported here, suggests that the present effect could be due to a decrease of DA signaling in the NAc.
As in prior studies (Hernandez et al., 2006), rewarding MFB stimulation produced a significant increase in NAc DA levels. This increase was attenuated significantly by CB1R blockade, without affecting basal levels. Thus, AM-251 may decrease the prices at which a given level of performance is sustained (↓Pe) by blunting the ability of MFB stimulation to boost DA tone in the ventral striatum. The observed behavioral and neurochemical effects are likely due to attenuated endocannabinoid-mediated disinhibition of DA neurons (Sperlágh et al., 2009). That AM-251 failed to alter basal levels of DA but did reduce the stimulation-induced enhancement of DA tone suggests that endocannabinoids are released in response to rewarding MFB stimulation and that their disinhibitory influence on DA neurons is reduced by AM-251.
CB1 receptors are the target of at least two endogenous ligands: anandamide and 2-arachidonoylglycerol. It has been suggested that these two lipids play different behavioral roles (Long et al., 2009). Given that blockade of the CB1R interferes with the binding of both endocannabinoids, we cannot, at present, partition the observed effects between them. This might be achieved in future work through the use of novel pharmacological tools that selectively and differentially prevent the degradation of these compounds (Fegley et al., 2005; King et al., 2007).
Our behavioral results illustrate an important methodological point: restricting the collection and analysis of ICSS data to two dimensions and averaging results across subjects can obscure effects that are discernable clearly when performance is measured as a function of both the strength and cost of BSR and in a manner that supports single-subject analysis. This point is illustrated by deriving from our data a measure analogous to the 2D group curve-shifts that have typically been measured. We used the mountain model to derive a widely used location parameter for psychometric curves obtained in curve-shift experiments, Fm50. Despite the reliable decreases in the value of the Pe parameter in 7/8 rats, the difference in Fm50 values failed to cross the statistical threshold. This shows that the 3D methodology permits the detection of differences that may not be readily distinguished with the usual BSR methodology (Figs. 8, 9).
A decrease in Pe can arise in multiple ways (Fig. 1). Although the decrease in the prices at which a given level of performance was sustained could reflect increased subjective costs, it may also be explained otherwise, e.g., by a decrease in reward-system gain (Hernandez et al., 2010). Further methodological progress will be required to distinguish between the currently tenable explanations. In a manner analogous to the method used here, this task can be pursued profitably by taking advantage of nonlinearities in psychophysical functions that translate objective variables (e.g., physical work required to earn a reward) into their psychological equivalents (e.g., subjective effort costs).
Depression has been linked to dopaminergic dysfunction and to a blunted reaction to rewards (Martin-Soelch, 2009). The latter symptom is consistent with a reduction in the gain of brain reward circuitry. Reduced gain in the BSR substrate is a tenable explanation of the results reported here. In this regard, it is noteworthy that an increase in the incidence of depressed mood has been noted in clinical trials of rimonabant (Van Gaal et al., 2008; Moreira et al., 2009), a CB1R antagonist.
The present findings offer an explanation for the inconsistency of prior reports. The traditional rate-frequency curves can be portrayed as 2D projections of a 3D structure. The face of the structure is diagonally oriented. Thus, when the mountain is displaced along an axis representing either pulse frequency or price, the 2D silhouette is displaced along the orthogonal axis (see Notes). If the data are 2D, this can produce the illusion of motion in the plane in which the data are acquired when the actual movement was orthogonal to that plane. In other words, a shift along the price axis (ΔPe) can create the illusion of a shift along the pulse-frequency axis (ΔFhm). However, this relationship is asymmetrical. The low slope of the diagonal portion of the contour lines in Figures 5 and 8A implies that a given change in Pe will produce a substantially smaller displacement in the silhouette of the mountain along the pulse-frequency axis, which is the sole independent-variable axis considered in traditional curve-shift experiments. Such shifts may not be discernible. Thus, it is not surprising that significant effects of CB1R blockade on ICSS have not been found in several prior studies (Vlachou et al., 2003, 2005; Xi et al., 2008). The detection problem is compounded by small changes in Fhm, which can counteract the displacement of the 2D silhouette due to the shift of the 3D structure along the price axis. Moreover, the three reliable Fhm changes observed here were inconsistent in sign. This reduces the likelihood of finding a significant effect when changes in Fm50 are averaged across subjects and group comparisons are carried out. In contrast, the 3D representation of single-subject results (Fig. 5) renders the changes in the location parameters and their statistical reliability unambiguously and clearly.
The allocation of behavior to the pursuit of reward necessarily depends on multiple variables, including reward strength, cost, probability, delay, and risk (Shizgal, 1997). Methods that can distinguish and quantify the contributions of these different variables will be required to determine the roles in reward seeking played by different neural systems. The findings reported here constitute one step toward understanding the contribution(s) of the endogenous cannabinoid system in the evaluation, selection, and pursuit of appetitive goals. The combination of quantitative modeling and multidimensional measurement of behavior promises future advances toward this goal.
Notes
Supplemental material for this article can be found at http://spectrum.library.concordia.ca/7084/. This material has not been peer reviewed.
Footnotes
This research was supported by a grant to P.S. from the Canadian Institutes of Health Research (#MOP-74577), by a group grant from the Fonds de la recherche en santé du Québec to the Groupe de recherche en neurobiologie comportementale/Center for Studies in Behavioral Neurobiology (Barbara Woodside, P.I.), and by scholarships to I.T.-P from Consejo Nacional de Ciencia y Tecnologia (CONACYT, #209314) and from le Ministère de l'Éducation, du Loisir et du Sport du Québec (PBEEE-1M, #140498). David Munro built and maintained the computer-controlled equipment for experimental control and data acquisition. Software for experimental control and data acquisition was written and maintained by Steve Cabilio.
References
- Akaike H. A new look at the statistical model identification. IEEE Trans Automat Contr. 1974;19:716–723. [Google Scholar]
- Arnold JC, Hunt GE, McGregor IS. Effects of the cannabinoid receptor agonist CP 55,940 and the cannabinoid receptor antagonist SR 141716 on intracranial self-stimulation in Lewis rats. Life Sci. 2001;70:97–108. doi: 10.1016/s0024-3205(01)01366-2. [DOI] [PubMed] [Google Scholar]
- Arvanitogiannis A, Shizgal P. The reinforcement mountain: Allocation of behavior as a function of the rate and intensity of rewarding brain stimulation. Behavioral Neuroscience. 2008;122:1126–1138. doi: 10.1037/a0012679. [DOI] [PubMed] [Google Scholar]
- Breton YA, Marcus JC, Shizgal P. Rattus psychologicus: construction of preferences by self-stimulating rats. Behav Brain Res. 2009;202:77–91. doi: 10.1016/j.bbr.2009.03.019. [DOI] [PubMed] [Google Scholar]
- Cheer JF, Wassum KM, Heien ML, Phillips PE, Wightman RM. Cannabinoids enhance subsecond dopamine release in the nucleus accumbens of awake rats. J Neurosci. 2004;24:4393–4400. doi: 10.1523/JNEUROSCI.0529-04.2004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cheer JF, Wassum KM, Sombers LA, Heien ML, Ariansen JL, Aragona BJ, Phillips PE, Wightman RM. Phasic dopamine release evoked by abused substances requires cannabinoid receptor activation. J Neurosci. 2007;27:791–795. doi: 10.1523/JNEUROSCI.4152-06.2007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Deroche-Gamonet V, Le Moal M, Piazza PV, Soubrié P. SR141716, a CB1 receptor antagonist, decreases the sensitivity to the reinforcing effects of electrical brain stimulation in rats. Psychopharmacology. 2001;157:254–259. doi: 10.1007/s002130100804. [DOI] [PubMed] [Google Scholar]
- De Vry J, Schreiber R, Eckel G, Jentzsch KR. Behavioral mechanisms underlying inhibition of food-maintained responding by the cannabinoid receptor antagonist/inverse agonist SR141716A. Eur J Pharmacol. 2004;483:55–63. doi: 10.1016/j.ejphar.2003.10.012. [DOI] [PubMed] [Google Scholar]
- Edmonds DE, Gallistel CR. Parametric analysis of brain stimulation reward in the rat: III. Effect of performance variables on the reward summation function. J Comp Physiol Psychol. 1974;87:876–883. doi: 10.1037/h0037217. [DOI] [PubMed] [Google Scholar]
- Edmonds DE, Gallistel CR. Reward versus performance in self-stimulation: electrode-specific effects of alpha-methyl-p-tyrosine on reward in the rat. J Comp Physiol Psychol. 1977;91:962–974. doi: 10.1037/h0077391. [DOI] [PubMed] [Google Scholar]
- Efron B, Tibshirani RJ. An introduction to the bootstrap. New York: Chapman and Hall; 1994. [Google Scholar]
- Fegley D, Gaetani S, Duranti A, Tontini A, Mor M, Tarzia G, Piomelli D. Characterization of the fatty acid amide hydrolase inhibitor cyclohexyl carbamic acid 3′-carbamoyl-biphenyl-3-yl ester (URB597): effects on anandamide and oleoylethanolamide deactivation. J Pharmacol Exp Ther. 2005;313:352–358. doi: 10.1124/jpet.104.078980. [DOI] [PubMed] [Google Scholar]
- Filip M, Gołda A, Zaniewska M, McCreary AC, Nowak E, Kolasiewicz W, Przegaliński E. Involvement of cannabinoid CB1 receptors in drug addiction: effects of rimonabant on behavioral responses induced by cocaine. Pharmacol Rep. 2006;58:806–819. [PubMed] [Google Scholar]
- Forget B, Hamon M, Thiébot MH. Cannabinoid CB1 receptors are involved in motivational effects of nicotine in rats. Psychopharmacology. 2005;181:722–734. doi: 10.1007/s00213-005-0015-6. [DOI] [PubMed] [Google Scholar]
- Gallistel CR, Leon M. Measuring the subjective magnitude of brain stimulation reward by titration with rate of reward. Behav Neurosci. 1991;105:913–925. [PubMed] [Google Scholar]
- Hernandez G, Hamdani S, Rajabi H, Conover K, Stewart J, Arvanitogiannis A, Shizgal P. Prolonged rewarding stimulation of the rat medial forebrain bundle: neurochemical and behavioral consequences. Behav Neurosci. 2006;120:888–904. doi: 10.1037/0735-7044.120.4.888. [DOI] [PubMed] [Google Scholar]
- Hernandez G, Haines E, Rajabi H, Stewart J, Arvanitogiannis A, Shizgal P. Predictable and unpredictable rewards produce similar changes in dopamine tone. Behav Neurosci. 2007;121:887–895. doi: 10.1037/0735-7044.121.5.887. [DOI] [PubMed] [Google Scholar]
- Hernandez G, Breton YA, Conover K, Shizgal P. At what stage of neural processing does cocaine act to boost pursuit of rewards? PLoS ONE. 2010;5:e15081. doi: 10.1371/journal.pone.0015081. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hernandez L, Hoebel BG. Feeding and hypothalamic stimulation increase dopamine turnover in the accumbens. Physiol Behav. 1988;44:599–606. doi: 10.1016/0031-9384(88)90324-1. [DOI] [PubMed] [Google Scholar]
- Herrnstein RJ. On the law of effect. J Exp Anal Behav. 1970;13:243–266. doi: 10.1901/jeab.1970.13-243. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Herrnstein RJ. Formal properties of the matching law. J Exp Anal Behav. 1974;21:159–164. doi: 10.1901/jeab.1974.21-159. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Heyman G. How drugs affect cells and reinforcement affects behavior: formal analogies. Quantitative analyses of behavior. In: Commons ML, Church RM, Stellar JR, Wagner AR, editors. Biological determinants of behavior. Hillsdale, New Jersey: Lawrence Erlbaum; 1988. pp. 157–182. [Google Scholar]
- Hodos W. Progressive ratio as a measure of reward strength. Science. 1961;134:943–944. doi: 10.1126/science.134.3483.943. [DOI] [PubMed] [Google Scholar]
- Hungund BL, Szakall I, Adam A, Basavarajappa BS, Vadasz C. Cannabinoid CB1 receptor knockout mice exhibit markedly reduced voluntary alcohol consumption and lack alcohol-induced dopamine release in the nucleus accumbens. J Neurochem. 2003;84:698–704. doi: 10.1046/j.1471-4159.2003.01576.x. [DOI] [PubMed] [Google Scholar]
- Jarrett MM, Scantlebury J, Parker LA. Effect of delta9-tetrahydro cannabinol on quinine palatability and AM251 on sucrose and quinine palatability using the taste reactivity test. Physiol Behav. 2007;90:425–430. doi: 10.1016/j.physbeh.2006.10.003. [DOI] [PubMed] [Google Scholar]
- Keesey RE, Goldstein MD. Use of progressive fixed-ratio procedures in the assessment of intracranial reinforcement. J Exp Anal Behav. 1968;11:293–301. doi: 10.1901/jeab.1968.11-293. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Killeen P. The matching law. J Exp Anal Behav. 1972;17:489–495. doi: 10.1901/jeab.1972.17-489. [DOI] [PMC free article] [PubMed] [Google Scholar]
- King AR, Duranti A, Tontini A, Rivara S, Rosengarth A, Clapper JR, Astarita G, Geaga JA, Luecke H, Mor M, Tarzia G, Piomelli D. URB602 inhibits monoacylglycerol lipase and selectively blocks 2-arachidonoylglycerol degradation in intact brain slices. Chem Biol. 2007;14:1357–1365. doi: 10.1016/j.chembiol.2007.10.017. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Leon M, Gallistel CR. The function relating the subjective magnitude of brain stimulation reward to stimulation strength varies with site of stimulation. Behav Brain Res. 1992;52:183–193. doi: 10.1016/s0166-4328(05)80229-3. [DOI] [PubMed] [Google Scholar]
- Li X, Hoffman AF, Peng XQ, Lupica CR, Gardner EL, Xi ZX. Attenuation of basal and cocaine-enhanced locomotion and nucleus accumbens dopamine in cannabinoid CB1-receptor-knockout mice. Psychopharmacology. 2009;204:1–11. doi: 10.1007/s00213-008-1432-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Long JZ, Li W, Booker L, Burston JJ, Kinsey SG, Schlosburg JE, Pavón FJ, Serrano AM, Selley DE, Parsons LH, Lichtman AH, Cravatt BF. Selective blockade of 2-arachidonoylglycerol hydrolysis produces cannabinoid behavioral effects. Nat Chem Biol. 2009;5:37–44. doi: 10.1038/nchembio.129. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Martin-Soelch C. Is depression associated with dysfunction of the central reward system? Biochem Soc Trans. 2009;37:313–317. doi: 10.1042/BST0370313. [DOI] [PubMed] [Google Scholar]
- Mascia MS, Obinu MC, Ledent C, Parmentier M, Böhme GA, Imperato A, Fratta W. Lack of morphine-induced dopamine release in the nucleus accumbens of cannabinoid CB(1) receptor knockout mice. Eur J Pharmacol. 1999;383:R1–R2. doi: 10.1016/s0014-2999(99)00656-1. [DOI] [PubMed] [Google Scholar]
- Miliaressis E, Rompre PP, Laviolette P, Philippe L, Coulombe D. The curve-shift paradigm in self-stimulation. Physiol Behav. 1986;37:85–91. doi: 10.1016/0031-9384(86)90388-4. [DOI] [PubMed] [Google Scholar]
- Moreira FA, Grieb M, Lutz B. Central side-effects of therapies based on CB1 cannabinoid receptor agonists and antagonists: focus on anxiety and depression. Best Pract Res Clin Endocrinol Metab. 2009;23:133–144. doi: 10.1016/j.beem.2008.09.003. [DOI] [PubMed] [Google Scholar]
- Mundl WJ. A constant-current stimulator. Physiol Behav. 1980;24:991–993. doi: 10.1016/0031-9384(80)90162-6. [DOI] [PubMed] [Google Scholar]
- Olds J, Milner P. Positive reinforcement produced by electrical stimulation of septal area and other regions of rat brain. J Comp Physiol Psychol. 1954;47:419–427. doi: 10.1037/h0058775. [DOI] [PubMed] [Google Scholar]
- Paxinos G, Watson C. The rat brain in stereotaxic coordinates. Ed 7. Amsterdam: Academic/Elsevier; 2007. [Google Scholar]
- Rasmussen EB, Huskinson SL. Effects of rimonabant on behavior maintained by progressive ratio schedules of sucrose reinforcement in obese Zucker (fa/fa) rats. Behav Pharmacol. 2008;19:735–742. doi: 10.1097/FBP.0b013e3283123cc2. [DOI] [PubMed] [Google Scholar]
- Shizgal P. Neural basis of utility estimation. Curr Opin Neurobiol. 1997;7:198–208. doi: 10.1016/s0959-4388(97)80008-6. [DOI] [PubMed] [Google Scholar]
- Shoaib M. The cannabinoid antagonist AM251 attenuates nicotine self-administration and nicotine-seeking behaviour in rats. Neuropharmacology. 2008;54:438–444. doi: 10.1016/j.neuropharm.2007.10.011. [DOI] [PubMed] [Google Scholar]
- Simmons JM, Gallistel CR. Saturation of subjective reward magnitude as a function of current and pulse frequency. Behavioral Neuroscience. 1994;108:151–160. doi: 10.1037//0735-7044.108.1.151. [DOI] [PubMed] [Google Scholar]
- Singh ME, Verty AN, McGregor IS, Mallet PE. A cannabinoid receptor antagonist attenuates conditioned place preference but not behavioural sensitization to morphine. Brain Res. 2004;1026:244–253. doi: 10.1016/j.brainres.2004.08.027. [DOI] [PubMed] [Google Scholar]
- Solinas M, Goldberg SR. Motivational effects of cannabinoids and opioids on food reinforcement depend on simultaneous activation of cannabinoid and opioid systems. Neuropsychopharmacology. 2005;30:2035–2045. doi: 10.1038/sj.npp.1300720. [DOI] [PubMed] [Google Scholar]
- Solinas M, Justinova Z, Goldberg SR, Tanda G. Anandamide administration alone and after inhibition of fatty acid amide hydrolase (FAAH) increases dopamine levels in the nucleus accumbens shell in rats. J Neurochem. 2006;98:408–419. doi: 10.1111/j.1471-4159.2006.03880.x. [DOI] [PubMed] [Google Scholar]
- Solinas M, Goldberg SR, Piomelli D. The endocannabinoid system in brain reward processes. Br J Pharmacol. 2008;154:369–383. doi: 10.1038/bjp.2008.130. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sperlágh B, Windisch K, Andó RD, Sylvester Vizi E. Neurochemical evidence that stimulation of CB1 cannabinoid receptors on GABAergic nerve terminals activates the dopaminergic reward system by increasing dopamine release in the rat nucleus accumbens. Neurochem Int. 2009;54:452–457. doi: 10.1016/j.neuint.2009.01.017. [DOI] [PubMed] [Google Scholar]
- Valjent E, Maldonado R. A behavioural model to reveal place preference to delta 9-tetrahydrocannabinol in mice. Psychopharmacology. 2000;147:436–438. doi: 10.1007/s002130050013. [DOI] [PubMed] [Google Scholar]
- Van Gaal L, Pi-Sunyer X, Després JP, McCarthy C, Scheen A. Efficacy and Safety of Rimonabant for Improvement of Multiple Cardiometabolic Risk Factors in Overweight/Obese patients Pooled 1-year data from the Rimonabant in Obesity (RIO) program. Diabetes Care. 2008;31:S229–S240. doi: 10.2337/dc08-s258. [DOI] [PubMed] [Google Scholar]
- Vlachou S, Nomikos GG, Panagis G. WIN 55,212–2 decreases the reinforcing actions of cocaine through CB1 cannabinoid receptor stimulation. Behav Brain Res. 2003;141:215–222. doi: 10.1016/s0166-4328(02)00370-4. [DOI] [PubMed] [Google Scholar]
- Vlachou S, Nomikos GG, Panagis G. CB1 cannabinoid receptor agonists increase intracranial self-stimulation thresholds in the rat. Psychopharmacology. 2005;179:498–508. doi: 10.1007/s00213-004-2050-0. [DOI] [PubMed] [Google Scholar]
- Wise RA. Dopamine and reward: the anhedonia hypothesis 30 years on. Neurotox Res. 2008;14:169–183. doi: 10.1007/BF03033808. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Xi ZX, Gilbert JG, Peng XQ, Pak AC, Li X, Gardner EL. Cannabinoid CB1 receptor antagonist AM251 inhibits cocaine-primed relapse in rats: role of glutamate in the nucleus accumbens. J Neurosci. 2006;26:8531–8536. doi: 10.1523/JNEUROSCI.0726-06.2006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Xi ZX, Spiller K, Pak AC, Gilbert J, Dillon C, Li X, Peng XQ, Gardner EL. Cannabinoid CB1 receptor antagonists attenuate cocaine's rewarding effects: experiments with self-administration and brain-stimulation reward in rats. Neuropsychopharmacology. 2008;33:1735–1745. doi: 10.1038/sj.npp.1301552. [DOI] [PubMed] [Google Scholar]
- You ZB, Chen YQ, Wise RA. Dopamine and glutamate release in the nucleus accumbens and ventral tegmental area of rat following lateral hypothalamic self-stimulation. Neuroscience. 2001;107:629–639. doi: 10.1016/s0306-4522(01)00379-7. [DOI] [PubMed] [Google Scholar]
- Yu LL, Wang XY, Zhao M, Liu Y, Li YQ, Li FQ, Wang X, Xue YX, Lu L. Effects of cannabinoid CB1 receptor antagonist rimonabant in consolidation and reconsolidation of methamphetamine reward memory in mice. Psychopharmacology. 2009;204:203–211. doi: 10.1007/s00213-008-1450-y. [DOI] [PubMed] [Google Scholar]









