Skip to main content
eLife logoLink to eLife
. 2021 Mar 17;10:e67065. doi: 10.7554/eLife.67065

Mechanism for differential recruitment of orbitostriatal transmission during actions and outcomes following chronic alcohol exposure

Rafael Renteria 1, Christian Cazares 2, Emily T Baltz 2, Drew C Schreiner 1, Ege A Yalcinbas 2, Thomas Steinkellner 3, Thomas S Hnasko 2,3,4, Christina M Gremel 1,2,
Editors: Laura A Bradfield5, Kate M Wassum6
PMCID: PMC8016477  PMID: 33729155

Abstract

Psychiatric disease often produces symptoms that have divergent effects on neural activity. For example, in drug dependence, dysfunctional value-based decision-making and compulsive-like actions have been linked to hypo- and hyperactivity of orbital frontal cortex (OFC)-basal ganglia circuits, respectively; however, the underlying mechanisms are unknown. Here we show that alcohol-exposed mice have enhanced activity in OFC terminals in dorsal striatum (OFC-DS) associated with actions, but reduced activity of the same terminals during periods of outcome retrieval, corresponding with a loss of outcome control over decision-making. Disrupted OFC-DS terminal activity was due to a dysfunction of dopamine-type 1 receptors on spiny projection neurons (D1R SPNs) that resulted in increased retrograde endocannabinoid signaling at OFC-D1R SPN synapses reducing OFC-DS transmission. Blocking CB1 receptors restored OFC-DS activity in vivo and rescued outcome-based control over decision-making. These findings demonstrate a circuit-, synapse-, and computation-specific mechanism gating OFC activity in alcohol-exposed mice.

Research organism: Mouse

Introduction

The DSM-5 categorizes alcohol as one of the few drugs whose dependence can produce neurocognitive disorders such as alcohol-related dementia (American Psychiatric Association, 2013). Thus, treatment of alcohol dependence would benefit from understanding mechanisms producing impaired cognition. Value-based decision-making is one such cognitive process, with its disruption contributing to poor daily function, that when combined with the emergence of compulsive control contributes to excessive drug-seeking and relapse (Belin et al., 2013; Everitt and Robbins, 2016; Gremel and Lovinger, 2017; Hogarth et al., 2013; Lüscher et al., 2020). Recently, there has been renewed focus on orbital frontal cortex (OFC) circuits in substance use disorder and related behaviors (Everitt and Robbins, 2016; Lüscher et al., 2020; Moorman, 2018; Schoenbaum et al., 2016; Schoenbaum et al., 2006), as they play a key role in computations related to decision-making and the evaluation of outcomes (Fellows, 2007; Stalnaker et al., 2015; Wallis, 2007). However, a mechanistic understanding of how drug dependence disrupts the ability of OFC and its downstream circuits to represent information necessary for appropriate control over decision-making is lacking.

Indeed, drug dependence in general has been widely associated with reduced OFC activity and output (Catafau et al., 1999; Lüscher et al., 2020; Schoenbaum et al., 2016; Schoenbaum et al., 2006; Volkow et al., 1997; Volkow and Fowler, 1994). In alcohol-dependent individuals, reduced OFC activity correlates with aberrant value-based decision-making (Boettiger et al., 2007; Duka et al., 2011; Forbes et al., 2014). However, OFC hyperactivity has been observed in alcohol dependence in response to predictive information (Hermann et al., 2006; Reinhard et al., 2015; Tapert et al., 2003) and during approach behaviors (Ernst et al., 2014). In concert, both rodent and non-human primate models of alcohol dependence have identified long-lasting changes to OFC neuron excitability, structure, and transmission (Nimitvilai et al., 2017; Nimitvilai et al., 2016; Renteria et al., 2018). Thus, it seems likely that alcohol dependence differentially alters the contribution of OFC circuits and their downstream controllers to varied decision-making processes, thereby enhancing some computations while reducing others, leading to alterations in behavioral control.

One such circuit is OFC excitatory projections into basal ganglia, with synapses onto direct and indirect pathways in the dorsal striatum (DS) (Renteria et al., 2018; Wall et al., 2013). Excitatory transmission from OFC-DS projections carries information that may support value-based decision-making, as inhibition of transmission results in habitual control (Gremel et al., 2016). We recently found that chronic alcohol exposure selectively attenuated output of this circuit, with a long-lasting reduction in OFC-DS transmission onto dopamine-type 1 (D1) spiny projection neurons (SPNs) of the direct pathway (Renteria et al., 2018). In addition, chronic alcohol exposure produces a loss of value-based decision-making (Corbit et al., 2012; Dickinson et al., 2002; Renteria et al., 2018) and reductions in associated OFC activity (Catafau et al., 1999; Lüscher et al., 2020; Schoenbaum et al., 2016; Schoenbaum et al., 2006; Volkow et al., 1997; Volkow and Fowler, 1994). Indeed, this behavioral phenotype was rescued by increasing OFC activity (Renteria et al., 2018). Together, these findings suggest that alcohol dependence may result in a reduced communication of value-related information from OFC into basal ganglia circuits to support decision-making.

However, dependence has also been associated with compulsive phenotypes (Everitt and Robbins, 2016; Everitt and Robbins, 2005; Lüscher et al., 2020), and hyperactivity of OFC-DS circuits has been implicated in obsessive compulsive disorder (OCD) (Lüscher et al., 2020; Milad and Rauch, 2012; Pauls et al., 2014; Robbins et al., 2019), as well as animal models of compulsive (Ahmari et al., 2013) and compulsive behaviors (Pascoli et al., 2018; Pascoli et al., 2015). Indeed, increased OFC terminal activity was observed in mice when they self-stimulated ventral tegmental area dopamine neurons, despite the presence of a punishing foot-shock (Pascoli et al., 2018). This corresponded to postsynaptic potentiation of OFC-DS transmission (Pascoli et al., 2018), suggesting that increased transmission from OFC into basal ganglia circuits contributes to the emergence of compulsive actions. Thus, OFC-DS circuits are poised to regulate both compulsive action and value information, computations that are both implicated in substance use disorder and may be differentially altered in psychiatric disease.

Here we examine mechanisms underlying how drug dependence affects OFC-DS circuit information representation and its control over decision-making. Using a well-established model of alcohol dependence in mice, we find that chronic alcohol exposure selectively enhances OFC-DS activity associated with action while reducing OFC-DS activity during periods associated with outcome retrieval through downregulation of D1 receptor function in SPNs and enhanced endocannabinoid (eCB) signaling at OFC-D1 SPN synapses in the DS. Restoring OFC-DS activity rescues value-based decision-making. Our data has important implications for hypotheses regarding compulsive and habitual phenotypes observed in substance use disorder.

Results

Altered encoding of OFC-DS terminal activity following chronic alcohol exposure

To examine whether alcohol dependence alters information sent to dorsal striatum, we used fiber photometry to measure calcium activity of OFC terminals in DS during self-initiated value-based decision-making. We utilized a well-validated and commonly used mouse model of alcohol dependence, chronic intermittent ethanol (CIE) vapor exposure (Becker, 1994; Becker and Lopez, 2004; Griffin et al., 2009; Lopez and Becker, 2005; Figure 1A). Performed in three cohort replications, mice (Air n = 10, CIE n = 12) were exposed to ethanol vapor for 16 hr each day for four consecutive days followed by a 3 day withdrawal. This cycle of ethanol vapor and withdrawal was repeated for four consecutive weeks and resulted in blood ethanol concentrations of 33.70 ± 2.24 mM (mean ± SEM) (see Materials and methods). Prior to CIE procedures, mice were injected with AAV-hSynapsin1-axon-GCaMP6s-P2A-mRuby3 in the OFC and implanted with a fiber targeted to the medial DS to record OFC terminal activity (see Materials and methods) (Figure 1B–D). Alcohol withdrawal has been delineated into two differing stages: an acute withdrawal period lasting 2–3 days and longer protracted withdrawal period that can last three or more months (American Psychiatric Association, 2013; Heilig et al., 2010). To avoid effects of acute withdrawal and examine potential alterations in the protracted withdrawal period, mice were food restricted and 5 days after the last vapor exposure were trained to lever press for a food reward under a random ratio schedule of reinforcement to bias value-based decision-making, measured as a sensitivity to outcome devaluation (Dickinson et al., 1983; Gremel and Costa, 2013; Figure 1E).

Figure 1. CIE-induced alterations in OFC terminal activity in the DS during instrumental responding.

(A) Experimental timeline that includes surgeries followed by four cycles of CIE exposure, operant training, and outcome devaluation. (B) Schematic of viral injections in the OFC and fiber implant in the DS. (C) Example viral expression in OFC. (D) Example OFC terminal expression and fiber placement in the DS. (E) Lever press training for a food pellet under random ratio schedule of reinforcement. (F) Lever presses, (G) response rate, (H) head entries, and (I) reinforcers earned during training in Air (n = 10 mice, three vapor cohorts) and CIE (n = 12 mice, three vapor cohorts) mice. (J) Average number of lever press bouts in Air and CIE mice. (K) Average number of lever presses per bout. (L) Average inter-press interval within lever press bouts. (M) Average bout duration. (N) Z-score of ΔF/F GCaMP6s traces recorded in OFC terminals during lever press bouts including all event trials and net area under the curve (AUC) of Z-score GCaMP6s signal during lever press bouts. (O) Z-score of ΔF/F GCaMP6s traces during lever press bouts including session averages and net area under the curve (AUC) of Z-score GCaMP6s signal during lever press bouts. (P) Z-score of ΔF/F GCaMP6s trace during the first head entry after outcome delivery including all event trials and net AUC of Z-score GCaMP6s signal during first head entry after outcome delivery. (Q) Z-score of ΔF/F GCaMP6s trace during the first head entry after outcome delivery including session averages and net AUC of Z-score GCaMP6s signal during first head entry after outcome delivery. (R) Schematic of outcome devaluation procedure. (S) Consumption of sucrose and pellets in Air (n = 5 mice) and CIE (n = 7 mice) mice during 1 hr free feed period. (T) Schematic of extinction test following free feed period. (U) Distribution of lever presses across valued (V) and devalued (DV) days. Data points and bar graphs represent mean ± SEM. Bonferroni-corrected post hoc test of repeated-measures ANOVA, two-sided FDR-corrected permutation test, or Kolmogorov–Smirnov test, *p≤0.05, ****p≤0.0001.

Figure 1—source data 1. CIE-induced alteractions in OFC terminal activity in the DS during instrumental responding source data.

Figure 1.

Figure 1—figure supplement 1. CIE-induced alterations in OFC terminal activity in the DS during instrumental responding, devaluation testing, and histological placements.

Figure 1—figure supplement 1.

(A) Lever presses on valued (V) and devalued (DV) days (repeated-measures ANOVA: No interaction or main effect of Time or Treatment, Fs < 2.64). (B) Schematic representation of fiber placement in DS. (C) Z-score of ΔF/F GCaMP6s trace including all event trials during lever presses (two-sided FDR-corrected permutation test, Air: n = 12673; CIE: n = 14871), (D) outcome delivery (two-sided FDR-corrected permutation test, Air: n = 569; CIE: n = 705), and (E) rewarded and non-rewarded head entries in Air mice (two-sided FDR-corrected permutation test, Rewarded: n = 563; Non-rewarded: n = 8034), and (F) rewarded and non-rewarded head entries in CIE mice (two-sided FDR-corrected permutation test, Rewarded: n = 703; Non-rewarded: n = 8291). (G) Z-score of ΔF/F GCaMP6s trace including the session average for each mouse during lever presses (two-sided FDR-corrected permutation test, Air: n = 36; CIE: n = 34), (H) outcome delivery (two-sided permutation test, Air: n = 32; CIE: n = 36), (I) rewarded and non-rewarded head entries in Air mice (two-sided FDR-corrected permutation test, Rewarded: n = 34; Non-rewarded: n = 34), and (J) rewarded and non-rewarded head entries in CIE mice (two-sided FDR-corrected permutation test, Rewarded: n = 38; Non-rewarded: n = 29). Two-sided FDR-corrected permutation test, *p≤0.05.
Figure 1—figure supplement 1—source data 1. Figure 1—figure supplement 1 CIE-induced alterations in OFC terminal activity in the DS during instrumental responding, devaluation testing, and histological placements source data 1.
Figure 1—figure supplement 1—source data 2. Figure 1—figure supplement 1 CIE-induced alterations in OFC terminal activity in the DS during instrumental responding, devaluation testing, and histological placements source data 2.

Chronic alcohol exposure did not prevent mice from learning to lever press for food. Air control mice and CIE-exposed mice made the same number of lever presses (repeated-measures ANOVA: main effect of Training day: F(8, 160)=43.10, p<0.0001; no main effect of CIE exposure or interaction: Fs < 0.16) (Figure 1F) and pressed the lever at roughly equal rates (repeated-measures ANOVA: main effect of Training day: F(8, 160)=27.42, p<0.0001; no main effect of treatment or interaction: Fs < 1.22) (Figure 1G). The two groups made similar number of head entries across training (repeated-measures ANOVA: main effect of Training day: F(8, 160)=5.92, p<0.0001; no main effect of CIE exposure or interaction: Fs < 1.29) (Figure 1H) and earned similar numbers of rewards (repeated-measures ANOVA: main effect of Training day: F(8, 160)=19.42, p<0.0001; no main effect of treatment or interaction: Fs < 0.60) (Figure 1I). As mice self-initiate, self-terminate their own behavior, it appeared they often made multiple lever presses in close succession. We used bout analyses to examine whether mice organized their lever presses into distinguishable patterns (See Materials and methods). Mice grouped their lever presses into distinct bouts, and Air and CIE mice had similar numbers of bouts (Air: 40.92 ± 4.15, n = 37; CIE: 49.78 ± 4.17, n = 37; non-parametric two-sample Kolmogorov–Smirnov [KS] test: D = 0.30, p=0.07) (Figure 1J) with similar numbers of lever presses per bout (Air: 8.19 ± 0.18, n = 1514; CIE: 8.032 ± 0.16, n = 1842; KS test: D = 0.04, p=0.14) (Figure 1K). However, Air and CIE mice differentially executed these bouts, with CIE mice having shorter intervals between lever presses within a bout (inter-press interval, Air: 1.84 s ± 0.05, n = 1514; CIE: 1.31 s ± 0.04, n = 1842; KS test: D = 0.17, p<0.0001) (Figure 1L), resulting in shorter bout durations (Air: 19.15 s ± 0.59, n = 1514; CIE: 13.59 s ± 0.41, n = 1842; KS test: D = 0.14, p<0.0001) (Figure 1M). Thus, chronic alcohol exposure led to a change in action execution, manifested as faster completion of lever press bouts.

When we examined changes in z-scored (z-score ∆F/F0) OFC-DS terminal Ca2+ activity during decision-making, we observed modulation of calcium transients during action epochs (−2 s to 2 s after the onset of a lever press bout compared to −5 s to −2 s baseline period) and outcome retrieval epochs (−2 s to 2 s around head entry into food receptacle following reward delivery compared to −5 s to −2 s baseline period). As mice can readily see the food reward prior to making a head entry, this outcome retrieval period encompasses reward perception, retrieval, and potentially consumption. We examined whether modulations of calcium transients were differentially affected by chronic alcohol exposure. Photometry data was analyzed two ways. First, z-scored ∆F/F0 traces were combined across all mice within a group (all event trials). This was done to preserve the variance seen within a subject. Second, we averaged these z-scored ∆F/F0 traces for a given animal session and then averaged these traces across mice within a group (session average). This examines between-mouse variability but does not preserve within-subject variability. When we examined all lever presses, we saw CIE led to a slight yet significant increase in OFC-DS Ca2+ activity prior to the onset of a lever press, both when we examined lever presses collapsed across animals as well as when we examined session averages per mouse (see Materials and methods) (Figure 1—figure supplement 1C,G). Given mice organized their lever presses into bouts in a self-paced nature and can make multiple lever presses within the analysis window prior to lever press onset, we next examined whether this CIE-induced increase in OFC-DS modulation was evident in the first lever press of a bout. When all event trials where collapsed across mice, we found increased OFC-DS activity at the onset of lever press bout (false discovery rate [FDR]-corrected two-sided permutation test, all event trials: Air: n = 1363, CIE n = 1650, p<0.05), and net area under the curve (AUC) analyses showed a significant difference between Air and CIE mice (Air: −0.40 ± 0.08, n = 1363; CIE: −0.05 ± 0.06, n = 1650; unpaired t-test: t3011 = 3.70, p<0.001) (Figure 1N). However, analyses of average session per mouse activity at the onset of a lever press bout did not show a significant increase (session average: Air: n = 35; CIE: n = 36, p>0.05) or difference in AUC (Air: −0.35 ± 0.18, n = 35; CIE: −0.06 ± 0.24, n = 36; unpaired t-test: t69 = 0.94, p=0.35) (Figure 1O). Together this suggests that CIE may lead to slight increases in OFC terminal activity during lever-press-related behavior.

In contrast, when we examined OFC-DS activity during the period associated with outcome retrieval following reward delivery (first head entry after outcome delivery), we found the opposite pattern. There, in both all event and session average analyses, Air controls showed a large significant increase in OFC-DS terminal activity, whereas in CIE mice, this was significantly blunted (FDR-corrected two-sided permutation test, all event trials: Air: n = 521, CIE: n = 682; session average: Air: n = 34; CIE: n = 38) (Figure 1P,Q). Furthermore, there was an overall decrease in the net AUC in CIE mice compared to Air mice (all event trials: Air: 1.00 ± 0.16, n = 521; CIE: 0.24 ± 0.11, n = 682; unpaired t-test: t1201 = 4.12, p<0.0001; session average: Air: 1.22 ± 0.4, n = 34; CIE: 0.04 ± 0.37, n = 38; unpaired t-test: t70 = 2.18, p<0.05) (Figure 1P,Q). Since the sound associated with pellet delivery could potentially contribute to activity modulation, we examined OFC-DS terminal activity in relation to outcome delivery (independent of any checking behavior). Again, we found greater modulation within 1 s of outcome delivery in Air mice than in CIE mice (Figure 1—figure supplement 1D,H). Furthermore, the increase in OFC-DS activity in Air, and to a lesser extent CIE mice, was selective to head entries following outcome delivery when compared to non-rewarded head entries (Figure 1—figure supplement 1E,F). Together, these findings suggest that CIE exposure may lead to slightly enhanced OFC-DS terminal activity associated with actions, but substantially less OFC-DS terminal activity during the period associated with outcome perception and/or retrieval.

Given the necessity of OFC-DS pathway function for successful outcome devaluation (Gremel et al., 2016), the substantial reduction in OFC-DS activity observed in CIE during periods associated with outcome retrieval suggests value-based decision-making may be impaired in these CIE mice. Thus, we used sensory-specific satiety to induce outcome devaluation (Figure 1R–U, see Materials and methods), where reductions in lever pressing on the devalued compared to valued day are indicative of outcome devaluation. Pre-planned comparisons on normalized data to account for differences in response rates between individual mice showed that Air reduced responding following outcome devaluation, while CIE mice did less so (repeated-measures ANOVA: main effect of valuation state: F(1, 10)=6.36, p<0.05; no main effect of CIE exposure or interaction: Fs < 2.98; pre-planned corrected comparisons between Valued and Devalued days: Air p<0.02; CIE p>0.6) (Figure 1U, Figure 1—figure supplement 1). Indeed, Air mice differentially distributed lever pressing between devalued and valued days, and CIE mice did not (one-sample paired t-test vs. 0.5; Air-t4 = 3.68, p<0.05; CIE t6 = 0.54, p=0.30). As previous data found enhancing OFC activity was sufficient to rescue the deficits in outcome devaluation in CIE mice (Renteria et al., 2018), the current data suggest that their inability to update outcome value following devaluation may be a result of the dampened OFC-DS activity during outcome retrieval epochs (Figure 1P,Q).

Chronic alcohol exposure does not alter CB1 receptor function or eCB long-term plasticity

The reduced calcium activity in OFC-DS terminals is suggestive of a decrease in presynaptic transmission. In dorsal striatum, OFC terminal synapse onto both direct and indirect output pathways of the basal ganglia (Wall et al., 2013). However, we previously found CIE results in a selective and long-lasting decrease in presynaptic OFC transmission onto D1 SPNs of the direct pathway, but not dopamine type-2 (D2) SPNs of the indirect pathway (Renteria et al., 2018). This reduction in OFC-D1 SPN transmission was stable and persistent across 3 weeks of chronic withdrawal (Renteria et al., 2018). As an OFC-DS projection neuron most likely sends collaterals to both D1 SPNs and D2 SPNs (Wall et al., 2013), the cell-type-specific decrease in presynaptic transmission may be mediated by a CIE-induced alteration in eCB signaling (Henderson-Redmond et al., 2016; Parsons and Hurd, 2015; Pava and Woodward, 2012). eCBs are produced and released from the postsynaptic cell and act as a retrograde signal to activate presynaptic cannabinoid type 1 (CB1) receptors, resulting in an inhibition of presynaptic calcium and a subsequent decrease in neurotransmitter release (Castillo et al., 2012; Gerdeman et al., 2002; Heifets and Castillo, 2009; Kano et al., 2009; Kreitzer and Regehr, 2001; Lovinger, 2008). In the dorsal striatum, CB1 receptors have been implicated in habitual action control (Hilário et al., 2007; Nazzaro et al., 2012), and activation of CB1 receptors on OFC terminals in DS has been shown to disrupt sensitivity to changes in expected outcome value (Gremel et al., 2016).

To identify CIE-induced mechanisms by which OFC transmission is selectively decreased to D1 SPNs, nine cohort replicates of mice were exposed to four rounds of CIE as previously described (Figure 1A), and electrophysiological recordings were performed within 3–21 days of withdrawal (Figure 2A). We have previously shown that the decrease in OFC transmission to D1 SPNs persists in protracted withdrawal and is present for up to 21 days post-CIE procedures (see Supplementary Figure 2 in Renteria et al., 2018). Prior to CIE, D1-tdTomato or D1DR-Cre mice were injected with AAV5-CamKIIa-GFP-Cre and a Cre-dependent channelrhodopsin (ChR2) (AAV5-Ef1a-DIO-ChR2-eYFP; UNC viral vector core) in OFC to limit ChR2 expression to CamKIIa expressing excitatory projection neurons (Gremel et al., 2016; Gremel and Costa, 2013; Renteria et al., 2018; Tye et al., 2011; Figure 2B,C). D1DR-Cre mice received an additional injection of AAV5-Flex-tdTomato in the DS to identify D1-expressing SPNs (Figure 2D). Whole-cell patch clamp recordings were performed on tdTomato expressing D1 SPNs in the DS and optically evoked EPSCs were recorded by activating ChR2 expressing OFC terminals with 470 nm light in the DS (Figure 2B).

Figure 2. Endocannabinoid-mediated plasticity of OFC transmission to D1 SPNs in Air and CIE mice.

(A) Experimental timeline that includes surgeries followed by four cycles of CIE exposure and whole-cell recordings 3–21 days in withdrawal. (B) Schematic of viral injections in OFC and optical stimulation of OFC terminals during whole-cell recordings of D1 SPNs in the DS. (C) Representative viral expression of ChR2 in OFC. (D) Example ChR2 expression of OFC terminals and expression of tdTomato in D1 SPNs in the DS. (E) Bath application of a CB1 receptor agonist, WIN55212-2 (1 µM), during optical stimulation of OFC terminals to D1 SPNs in Air (n = 6 cells, four mice) and CIE (n = 6 cells, three mice). (F) Bar graphs representing the percentage change from baseline (min 0–10) after bath application of WIN55212-2 (min 30–40). (G) Endocannabinoid mediated long-term depression induced by bath application of a group 1 mGluR agonist, DHPG (50 µM) paired with depolarization to 0 mV in Air (n = 6 cells, three mice) and CIE (n = 5 cells, three mice). (H) Bar graphs representing the percentage change from baseline after bath application of DHPG. (I) DHPG-LTD of OFC transmission is blocked by a CB1 receptor antagonist, SR141716 (1 µM) in both Air (n = 4 cells, three mice) and CIE (n = 6 cells, three mice). (J) Bar graphs representing the percentage change from baseline after bath application of DHPG in the presence of SR141716. (K) D1 agonist (3 µM SKF 81297) blocks the expression of DHPG-LTD in Air mice (n = 6 cells, four mice) but has no effect in mice exposed to CIE (n = 7 cells, five mice; w/ SR141716 n = 5 cells, two mice). (L) Bar graphs representing the percentage change from baseline after bath application of DHPG in the presence of SKF 81297 or SKF 81297 with SR141716. Scale bars represent 10 ms (horizontal) and 50 pA (vertical). Data points and bar graphs represent mean ± SEM. Bonferroni-corrected or paired (vs. baseline) two-tailed t-test, **p≤0.01, ***p≤0.001, ****p≤0.0001.

Figure 2—source data 1. Endocannabinoid-mediated plasticity of OFC transmission to D1 SPNs in Air and CIE mice source data.

Figure 2.

Figure 2—figure supplement 1. No difference between Air and CIE in the expression of DHPG-LTD of electrically evoked EPSCs.

Figure 2—figure supplement 1.

(A) Schematic of local electrical stimulation and recording site in the DS. (B) DHPG-LTD of electrically evoked EPSCs expressed in both Air and CIE mice. (C) Bar graph representing the percentage change from baseline (min 0–10) after bath application of DHPG (min 30–40) (Air: 61.62 ± 8.03% of baseline at 30–40 min, n = 6, paired t-test vs. baseline: t5 = 4.79, p<0.01; CIE: 47.69 ± 4.95% of baseline at 30–40 min, n = 6, paired t-test vs. baseline: t5 = 10.57, p=0.0001; unpaired t-test Air vs. CIE: t10 = 1.48, p=0.17) and (D) DHPG-LTD is blocked by a CB1 receptor antagonist, SR141716 (1 µM). (E) Bar graphs representing the percentage change from baseline after bath application of DHPG in the presence of SR141716 (Air: 100.3 ± 8.27% of baseline at 30–40 min, n = 5, paired t-test vs. baseline: t4 = 0.04, p=0.97; CIE: 97.25 ± 6.87% of baseline at 30–40 min, n = 8, paired t-test vs. baseline: t7 = 0.40, p=0.70). (F) D1 agonist (3 µM SKF 81297) blocks the expression of DHPG-LTD in both Air and CIE. (G) Bar graphs representing the percentage change from baseline after bath application of DHPG in the presence of SKF 81297 or SKF 81297 with SR141716 (Air: 103.6 ± 8.45% of baseline at 30–40 min, n = 4, paired t-test vs. baseline: t3 = 0.43, p=0.70; CIE: 95.11 ± 11.97% of baseline at 30–40 min, n = 8, paired t-test vs. baseline: t7 = 0.41, p=0.70). Short-term depression in CIE mice (74.04 ± 9.88% of baseline at 10–20 min, n = 8, paired t-test vs. baseline: t7 = 2.63, p=0.03) is CB1 receptor dependent (107.10 ± 10.13% of baseline at 10–20 min, n = 4, paired t-test vs. baseline: t3 = 0.70, p=0.53; 99.61 ± 10.58% of baseline at 30–40 min, n = 4, paired t-test vs. baseline: t3 = 0.04, p=0.97). Data points and bar graphs represent the mean ± SEM.
Figure 2—figure supplement 1—source data 1. No difference betwen Air and CIE in the expression of DHPG-LTD of electrically evoked EPSCs.

We first hypothesized that CIE procedures induced a change in CB1 receptor function. We applied a CB1 receptor agonist (1 µM WIN 55,212–2) (Figure 2E,F), but found a similar decrease in EPSC amplitude between Air (44.72 ± 1.98% of baseline at 30–40 min, n = 6, paired t-test vs. baseline: t5 = 27.96, p<0.0001) and CIE-exposed mice (47.08 ± 9.03% of baseline at 30–40 min, n = 6, paired t-test vs. baseline: t5 = 5.86, p<0.01) (unpaired t-test comparing Air and CIE mice: t10 = 0.25, p=0.80).

Another possibility for the observed decrease in OFC transmission may involve a CIE-induced change in the expression of eCB-mediated plasticity (DePoy et al., 2013). Given the decreased OFC transmission induced by CIE, we hypothesized that eCB-mediated long-term depression (LTD) would be occluded in CIE exposed mice. eCB-LTD has been extensively studied and described using a high-frequency stimulation (HFS) induction protocol. However, we needed to probe input-specific contributions, and conventional ChR2 shows a strong inactivation and desensitization to HFS (Lin, 2011). Thus, we opted to use a group 1 mGluR agonist, DHPG, to induce LTD (Kreitzer and Malenka, 2007; Kreitzer and Malenka, 2005; Wu et al., 2015; Yin et al., 2006). A 10 min bath application of DHPG (50 µM) paired with postsynaptic depolarization to 0 mV, resulted in LTD of optically evoked EPSCs (Air: 54.60 ± 8.40% of baseline at 30–40 min, n = 6, paired t-test vs. baseline: t5 = 5.41, p<0.01; CIE: 46.68 ± 4.97% of baseline at 30–40 min, n = 5, paired t-test vs. baseline: t4 = 10.74, p<0.001) (Figure 2G). However, this DHPG-induced LTD was of similar magnitude between Air and CIE-exposed mice (unpaired t-test: t9 = 0.77, p=0.46) (Figure 2H). Bath application of a CB1 receptor antagonist, SR141716 (1 µM) showed the observed LTD to be CB1 receptor dependent in both groups (Air: 103.2 ± 5.85% of baseline at 30–40 min, n = 4, paired t-test vs. baseline: t3 = 0.54, p=0.63; CIE: 108.3 ± 5.85% of baseline at 30–40 min, n = 6 paired t-test vs. baseline: t5 = 0.96, p=0.38) (Figure 2I,J). Thus, CB1 receptor-dependent eCB-LTD at OFC-D1 SPNs synapses also remains intact after CIE exposure.

While CB1 receptor function and eCB-mediated plasticity at OFC-D1 SPN synapses was still intact in CIE mice, we wondered whether regulation of eCB signaling may be disrupted following CIE. Prior work in striatum has shown that activation of D1 receptors prevents the expression of eCB-LTD (Trusel et al., 2015; Wu et al., 2015). Additionally, positive-timing induction protocols in striatum require D1 receptor blockade for the expression of CB1 receptor dependent LTD (Shen et al., 2008). Based on these previous findings, we reasoned that D1 receptor activation may negatively regulate eCB-LTD. We found that activation of D1 receptors by bath application of SKF 81297 (3 µM) in Air controls blocked DHPG-LTD (96.32 ± 10.41% of baseline at 30–40 min, n = 6, paired t-test vs. baseline: t5 = 0.35, p=0.74) but had no effect in CIE-exposed mice (56.12 ± 4.85% of baseline at 30–40 min, n = 7, paired t-test vs. baseline: t6 = 9.06, p<0.001). The LTD that CIE mice continued to express was still CB1 receptor dependent, as bath application of SR141716 (1 µM) blocked the DHPG- LTD (104.2 ± 11.55% baseline at 30–40 min, n = 5, paired t-test vs. baseline: t4 = 0.37, p=0.73) (Figure 2K,L).

This effect may not be specific to OFC-D1 SPNs and could also be present at any glutamatergic input onto D1 SPNs. However, evoked EPSCs by electrical stimulation in a subset of mice where OFC-DS transmission was optically assessed, showed a similar magnitude of DHPG-LTD in both Air and CIE-exposed mice that was CB1 receptor dependent (Figure 2—figure supplement 1A–C). Furthermore, DHPG-LTD of electrically evoked EPSCs was blocked by SKF 81297 in Air mice (Figure 2—figure supplement 1F,G). While activation of D1 receptors with bath application of DHPG did result in a short-term depression in CIE, electrically evoked EPSC amplitudes returned to baseline at 30–40 min (Figure 2—figure supplement 1F,G) in contrast to the long-lasting effect observed when optically evoking EPSCs (Figure 2K,L). Additionally, we found that this short-term depression was sensitive to a CB1 receptor antagonist, as we observed no short- or long-term changes in EPSC amplitude in the presence of SR141716 (1 µM) (Figure 2—figure supplement 1F,G). Thus, the disrupted D1 receptor regulation of long-term eCB plasticity appears to be at least somewhat selective to OFC-D1 SPNs, as we do not see similar changes when examining excitatory input generally. These results raise the hypothesis that D1 receptor signaling may be altered in CIE-exposed mice.

Chronic alcohol exposure disrupts D1 receptor function

D1 receptors may negatively modulate eCB signaling as activation of D1 receptors increases spontaneous presynaptic neurotransmitter release (André et al., 2010), decreases eCB content (Patel et al., 2003), and prevents the expression of eCB-LTD (Wu et al., 2015). To determine whether CIE alters D1 receptor activity, we used both an in vitro and in vivo assay of D1 receptor function. First, D1 receptor activation in vitro has been shown to increase excitability of D1 SPNs (Hernández-López et al., 1997; Planert et al., 2013; Surmeier et al., 2007). Similar to previous findings, we found that washing on D1 agonist, SKF 81297 (10 µM), enhanced D1 SPN action potential firing in response to current injections in the DS of Air controls (repeated-measures ANOVA: interaction (Current × SKF 81297): F(10, 90)=5.72, p<0.0001; main effect of Current: F(10, 90)=22.46, p<0.0001; main effect of SKF 81297: F(1, 9)=32.37, p=0.0003) (Figure 3A). In contrast, bath application of SKF 81297 had no effect on D1 SPN excitability in CIE mice (repeated-measures ANOVA: no interaction [Current × SKF 81297]: F(10, 80)=0.28, p=0.98; main effect of Current: F(10, 80)=29.87, p<0.0001, but no main effect of SKF 81297: F(1, 8)=0.09, p=0.77) (Figure 3B). This suggests that D1 receptor signaling is disrupted in CIE mice and is in line with our previous data in which a D1 agonist failed to modulate DHPG-LTD in CIE mice (Figure 2K,L).

Figure 3. D1 receptor function is disrupted in CIE-exposed mice.

(A) Number of action potentials plotted against current injected under baseline conditions and in the presence of D1 agonist, SKF81297 (10 µM) (left) and sample traces (right) in D1 SPNs of Air (n = 10 cells, three mice) and (B) CIE (n = 9 cells, four mice). Scale bars represent 100 ms (horizontal) and 20 mV (vertical). (C) Schematic of unilateral microinjections (300 nL) of SKF 81297 (5 µg/µl) in the left hemisphere in the DS and predicted bias toward clockwise (CW) rotations. (D) Number of CW rotations and counterclockwise (CCW) rotations in Air (n = 8 mice) and CIE (n = 6 mice) (two vapor cohorts) mice counted over an hour session. (E) Total rotations (CW+CCW) in Air and CIE mice. (F) Percentage of CW turns in Air and CIE mice. Data points and bar graphs represent mean ± SEM. Bonferroni-corrected post hoc test of repeated-measures ANOVA or unpaired two-tailed t-test, *p≤0.05, **p≤0.01, ***p≤0.001, ****p≤0.0001.

Figure 3—source data 1. D1 receptor function is disrupted in CIE-exposed mice source data.

Figure 3.

Figure 3—figure supplement 1. Unilateral D1 agonist injection and distance traveled.

Figure 3—figure supplement 1.

(A) Schematic representation of cannula tip placement in DS for unilateral injections. (B) Total distance traveled in Air and CIE mice. Bar graphs represent the mean ± SEM.
Figure 3—figure supplement 1—source data 1. Unilateral D1 agonist injection and distance traveled source data.
Figure 3—figure supplement 2. No change in DS D1 or D2 receptor expression in CIE mice compared to Air controls.

Figure 3—figure supplement 2.

(A) Example immunoblots for D1 and (B) D2 dopamine receptors (Air = 4, 5, 8, 9; CIE = 1, 2, 3, 6, 7). (C) Band density for D1 and (D) D2 receptors normalized to coflin averaged across two runs. Bar graphs represent the mean ± SEM.
Figure 3—figure supplement 2—source data 1. No change in D2 D1 or D2 receptor expression in CIE mice compared to Air controls source data.

To test whether CIE alters the function of DS D1 receptors in vivo, we relied on previous findings where an imbalance in striatal dopamine induced rotational behavior (Glick et al., 1976; Martín et al., 2008). We gave Air and CIE mice a unilateral microinjection of the D1 agonist, SKF 81297 (300 nL, 5 µg/µl), into the left hemisphere of the DS (Figure 3C, Figure 3—figure supplement 1), and we measured the subsequent number of clockwise (CW) and counterclockwise (CCW) rotations. The prediction is that an increase in unilateral dopamine signaling should normally lead to an increase in CW rotational behavior (Figure 3C). While Air mice showed the predicted increase in CW rotations, CIE mice showed reduced D1-agonist induced rotational behavior. A one-way ANOVA (F(3, 24)=9.31, p<0.001) showed that Air controls had significantly more CW rotations than CCW rotations (Bonferroni corrected, p<0.001) and had more CW rotations compared to CIE mice (Bonferroni corrected, p<0.05) (Figure 3D). Importantly, there was no difference in the total number of rotations between Air and CIE mice (Air: 62.13 ± 4.54, n = 8; CIE: 57.67 ± 7.76, n = 6; unpaired t-test: t12 = 0.53, p=0.61) (Figure 3E). However, the percentage of CW rotations out of total rotations in CIE mice was significantly lower compared to Air controls (Air: 73.72 ± 4.08, n = 8; CIE: 47.72 ± 4.21, n = 6; unpaired t-test: t12 = 4.36, p<0.001) (Figure 3F). The blunted D1 receptor agonist response in CIE mice could be due to alterations in D1 receptor expression; however, Western blot analysis showed no change in D1 receptor expression in the DS between CIE and Air mice (Figure 3—figure supplement 2). Together, these data provide evidence that CIE results in disruption of D1 receptor function downstream of D1 receptor activation.

Chronic alcohol exposure enhances depolarization-induced suppression of excitation of OFC transmission to D1 SPNs

Given the chronic alcohol exposure-induced long-lasting disruption of D1 receptor function and the role of D1 receptors in the negative modulation of eCB signaling (André et al., 2010; Patel et al., 2003), we hypothesized that we may see an increase in eCB signaling in CIE mice. To test for CIE-induced changes in eCB signaling in OFC-D1 SPN transmission, we used depolarization-induced suppression of excitation (DSE), a short-term form of eCB plasticity (Diana and Marty, 2004). To induce DSE of optically evoked EPSCs from OFC terminals (Figure 4A), D1 SPNs were depolarized to +50 mV for 10 s which resulted in a short-term decrease of EPSC amplitudes in both Air (DSE Air: 25.78 ± 3.82, n = 8, three mice, paired t-test vs. baseline: t7 = 6.75, p<0.001) and CIE mice (DSE CIE: 42.10 ± 8.25, n = 9, three mice, paired t-test vs. baseline: t8 = 15.31, p<0.0001) that was CB1 receptor dependent (bath application of SR141716 [1 µM], DSE Air: 2.71 ± 3.09, n = 5, paired t-test vs. baseline: t4 = 0.88, p=0.43; DSE CIE: 1.77 ± 5.20, n = 6, paired t-test vs. baseline: t5 = 0.34, p=0.75) (Figure 4B,C). We found that CIE mice showed a larger magnitude of DSE than did Air controls (unpaired t-test Air vs. CIE: t15 = 3.53, p<0.01), indicating a CIE-induced increase in eCB signaling in OFC-D1 SPN transmission (Figure 4B,C). To again determine whether this effect is selective for OFC-D1 input, we examined DSE in electrically evoked EPSCs (Figure 4D) and found DSE in both Air (DSE Air: 32.96 ± 2.50, n = 6, three mice, paired t-test vs. baseline: t5 = 13.19, p<0.0001) and CIE mice (DSE CIE: 36.84 ± 7.00, n = 6, three mice, paired t-test vs. baseline: t5 = 5.27, p<0.01), that was also sensitive to CB1 receptor blockade (DSE Air: 2.40 ± 4.88, n = 6, paired t-test vs. baseline: t5 = 0.49, p=0.64; DSE CIE: 3.97 ± 1.94, n = 5, paired t-test vs. baseline: t4 = 2.05, p=0.11). In contrast to optically evoked EPSCs at OFC-D1 SPNs synapses (Figure 4A–C), Air and CIE mice showed a similar magnitude of DSE when EPSCs were electrically evoked (unpaired t-test Air vs. CIE: t10 = 0.52, p=0.61) (Figure 4D–F). Furthermore, the CIE-induced increase in DSE was cell-type selective as we observed no difference in DSE of D2 SPNs of Air and CIE mice with either optical stimulation of OFC terminals or local electrical stimulation (Figure 4—figure supplement 1). The CIE-induced increase in DSE was seen throughout the protracted withdrawal period (Figure 4—figure supplement 2). These results suggest that the increase in DSE of OFC transmission onto D1 SPNs following chronic alcohol exposure is at least partially selective to the OFC input.

Figure 4. CIE-induced enhancement of DSE at OFC terminals to D1 SPNs in the DS.

(A) Schematic of viral injections in OFC and optical stimulation of OFC terminals during whole-cell recordings of D1 SPNs in the DS. (B) Depolarization-induced suppression of excitation (DSE) of OFC inputs to D1 SPNs in Air (n = 8 cells, three mice) and CIE (n = 9 cells, three mice). (C) Bar graphs representing the percentage change from baseline immediately after depolarization. (D) Schematic of local electrical stimulation during whole-cell recordings of D1 SPNs in the DS. (E) DSE of excitatory inputs to D1 SPNs using electrical stimulation in Air (n = 6 cells, two mice) and CIE (n = 6 cells, three mice). (F) Bar graphs representing the percentage change from baseline immediately after depolarization. Scale bars represent 10 ms (horizontal) and 50 pA (vertical). Data points and bar graphs represent mean ± SEM. Scale bars represent 10 ms (horizontal) and 50 pA (vertical). Unpaired (Air vs. CIE) or paired (vs. baseline) two-tailed t-test, **p≤0.01, ***p≤0.001, ****p≤0.0001.

Figure 4—source data 1. CIE-induced enhancement of DSE at OFC terminals to D1 SPNs in the DS source data.

Figure 4.

Figure 4—figure supplement 1. DSE in D2 SPNs and effect of SR141716 on DSE in D1 SPNs.

Figure 4—figure supplement 1.

(A) Depolarization-induced suppression of excitation (DSE) of OFC inputs to D2 SPNs in Air control and CIE mice. (B) Bar graphs representing the percentage change from baseline immediately after depolarization. (C) DSE of excitatory inputs to D2 SPNs using electrical stimulation in Air and CIE mice. (D) Bar graphs representing the percentage change from baseline immediately after depolarization. (E) DSE of OFC inputs to D1 SPNs in the presence of a CB1 receptor antagonist, SR141716 (1 µM) in Air and CIE mice. (F) Bar graphs representing the percentage change from baseline immediately after depolarization. (G) DSE of excitatory inputs to D1 SPNs using electrical stimulation in the presence of a CB1 receptor antagonist, SR141716 (1 µM) in Air and CIE mice. (H) Bar graphs representing the percentage change from baseline immediately after depolarization. Data points and bar graphs represent the mean ± SEM. Paired (vs. baseline) two-tailed t-test, **p≤0.01, ***p≤0.001.
Figure 4—figure supplement 1—source data 1. DSE in D2 SPNs and effect of SR141716 on DSE in D1 SPNs source data.
Figure 4—figure supplement 2. CIE-induced effects on D1 receptor function and DSE across recording days.

Figure 4—figure supplement 2.

(A) Experimental timeline that includes surgeries followed by four cycles of CIE exposure and whole-cell recordings 3–21 days in withdrawal. (B) Expression of DHPG-LTD in the presence of SKF 81297 was persistent across the withdrawal period. (C) SKF 81297 had no effect on D1 SPN firing of CIE mice across the withdrawal period. (D) DSE was increased in CIE mice across the withdrawal period.
Figure 4—figure supplement 2—source data 1. CIE-induced effects on D1 receptor function and DSE across recording days source data.

CB1 receptor antagonist restores OFC transmission and goal-directed control

The above data suggests an enhancement of eCB signaling at OFC-D1 SPN synapses in CIE mice. Given this, we reasoned that CB1 receptor antagonism may alter the aberrant in vivo activity seen in OFC-DS terminals and restore sensitivity to outcome devaluation. However, the prevalence of CB1 receptors in cortico-basal ganglia circuits makes indirect actions on circuit activity also likely, which could obscure or counter effects at OFC-DS terminals. To examine this, we gave systemic injections of Vehicle or CB1 antagonist, SR141716 (3 mg/kg), prior to operant sessions and measured subsequent calcium activity in OFC-DS terminals during behavior (Figure 5A). Similar to prior findings that blocking CB1 receptors does not interfere with the ability to lever press (Hilário et al., 2007), we also found that systemic administration of SR141716 had no obvious effect on gross lever press-related behavior (Figure 5—figure supplement 1).

Figure 5. CB1 receptor antagonist restores OFC terminal activity for outcome encoding.

(A) Schematic that depicts injections with vehicle or SR141716 followed by an operant session under a random ratio 20 (RR20) schedule of reinforcement in Air (n = 9 mice, three vapor cohorts) and CIE (n = 11 mice, three vapor cohorts). (B) Z-score of ΔF/F GCaMP6s trace recorded in OFC terminals during lever press bouts in mice injected with vehicle or SR141716 including all event trials and (C) net area under the curve (AUC) in Air mice. (D) Z-score of ΔF/F GCaMP6s traces during lever press bouts including session averages and (E) net AUC of Z-score GCaMP6s signal during lever press bouts in Air mice. (F) Z-score of ΔF/F GCaMP6s trace recorded in OFC terminals during lever press bouts in mice injected with vehicle or SR141716 including all event trials and (G) net area under the curve (AUC) in CIE mice. (H) Z-score of ΔF/F GCaMP6s traces during lever press bouts including session averages and (I) net AUC of Z-score GCaMP6s signal during lever press bouts in CIE mice. (J) Z-score of ΔF/F GCaMP6s trace recorded in OFC terminals during the first head entry after outcome delivery in mice injected with vehicle or SR141716 including all event trials and (K) net AUC for each signal in Air mice. (L) Z-score of ΔF/F GCaMP6s trace recorded in OFC terminals during the first head entry after outcome delivery in mice injected with vehicle or SR141716 including session averages and (M) net AUC for each signal in Air mice. (N) Z-score of ΔF/F GCaMP6s trace recorded in OFC terminals during the first head entry after outcome delivery in mice injected with vehicle or SR141716 including all event trials and (O) net AUC for each signal in CIE mice. (P) Z-score of ΔF/F GCaMP6s trace recorded in OFC terminals during the first head entry after outcome delivery in mice injected with vehicle or SR141716 including session averages and (Q) net AUC for each signal in CIE mice. Data points and bar graphs represent mean ± SEM. Two-sided FDR-corrected permutation test or paired two-tailed t-test, *p≤0.05, **p≤0.01, ****p≤0.0001.

Figure 5—source data 1. CB1 receptor antagonist restores OFC terminal activity for outcome encoding source data.

Figure 5.

Figure 5—figure supplement 1. CB1 antagonist did not alter lever press behavior between training days.

Figure 5—figure supplement 1.

(A) Experimental timeline that includes training after outcome devaluation and injections with vehicle or SR141716. (B) Lever presses, (C) response rate, (D) reinforcers earned, and (E) head entries across all training days and retraining and testing days following outcome devaluation. (F) The number of lever presses, (G) response rate, (H) reinforcers earned, and (I) head entries during vehicle or SR141716 injection test sessions in Air and CIE mice. Data points across training days represent the mean ± SEM.
Figure 5—figure supplement 1—source data 1. CB1 antagonist did not alter lever press behavior between training days source data.
Figure 5—figure supplement 2. OFC terminal activity in DS with vehicle or CB1 antagonist.

Figure 5—figure supplement 2.

(A) Z-score of ΔF/F GCaMP6s trace including all event trials in vehicle or SR141716 test sessions during lever presses (two-sided FDR-corrected permutation test, Vehicle: n = 3858; SR141716: n = 3773) and (B) outcome delivery (two-sided FDR-corrected permutation test, Vehicle: n = 177; SR141716: n = 160) in Air mice. (C) Z-score of ΔF/F GCaMP6s trace including all event trials in vehicle or SR141716 test sessions during lever presses (two-sided FDR-corrected permutation test, Vehicle: n = 5420; SR141716: n = 4721) and (D) outcome delivery (two-sided FDR-corrected permutation test, Vehicle: n = 274; SR141716: n = 217) in CIE mice. (E) Z-score of ΔF/F GCaMP6s trace including the session average for each Air mouse in vehicle or SR141716 test sessions during lever presses (two-sided FDR-corrected permutation test, Vehicle: n = 8; SR141716: n = 10) and (F) outcome delivery (two-sided FDR-corrected permutation test, Vehicle: n = 8; SR141716: n = 10). (G) Z-score of ΔF/F GCaMP6s trace including the session average for each CIE mouse in vehicle or SR141716 test sessions during lever presses (two-sided FDR-corrected permutation test, Vehicle: n = 11; SR141716: n = 10) and (H) outcome delivery (two-sided FDR-corrected permutation test, Vehicle: n = 12; SR141716: n = 9). Two-sided permutation test, *p≤0.05.
Figure 5—figure supplement 2—source data 1. OFC terminal activity in DS with vehicle or CB1 antagonist source data.

We first looked at terminal activity during lever press bouts, where we previously saw slightly enhanced activity in CIE mice (Figure 1N,O). In Air control mice, systemically blocking CB1 receptors slightly decreased OFC terminal activity during lever press bouts (FDR-corrected two-sided permutation test, all event trials: Vehicle: n = 462; SR141716: n = 359; session average: Vehicle: n = 7; SR141716: n = 7) (Figure 5B,D). This was a significant decrease in Net AUC with all event trials (Vehicle: −0.21 ± 0.12, n = 462; SR141716: −1.42 ± 0.16, n = 359; unpaired t-test: t819 = 6.27, p<0.0001) (Figure 5C), but not session average (paired t-test: t6 = 1.64, p=0.15) (Figure 5E). Similarly, CIE mice also showed a slight decrease in OFC terminal activity prior to lever press bout onset with CB1 receptor blockade (FDR-corrected two-sided permutation test, all event trials: Vehicle: n = 536; SR141716: n = 385; session average: Vehicle: n = 7; SR141716: n = 7) (Figure 5F,H). However, this did not lead to a significant difference in Net AUC analyses with all event trials (Vehicle: 0.04 ± 0.09, n = 536; SR141716: −0.17 ± 0.14, n = 385; unpaired t-test: t919 = 1.50, p=0.14) (Figure 5G), although there was a trend when examining session average per mouse (paired t-test: t6 = 2.24, p=0.06) (Figure 5I). As blocking CB1 receptors at OFC-DS terminals should prevent any eCB-related decrease in calcium activity (Kreitzer and Regehr, 2001), this suggests the slightly reduced lever-press-related OFC-DS terminal activity is due to eCB actions on broader circuit activity.

However, eCBs are thought to be released at glutamatergic synapses in DS in an activity-dependent manner (Adermark et al., 2009; Adermark and Lovinger, 2009), and we saw larger increases in OFC-DS terminal activity during periods of associated with outcome retrieval. Hence, we also examined effects of blocking CB1 receptors on the reduced outcome retrieval activity observed in Air and CIE mice (Figure 1P,Q). Systemic administration of the CB1 antagonist in Air controls resulted in a brief, slight increase OFC terminal activity prior to the first head entry after reinforcer delivery (FDR-corrected two-sided permutation test, all event trials: Vehicle: n = 177; SR141716: n = 162; session average: Vehicle: n = 8; SR141716: n = 8) (Figure 5J,L), but this was not sufficient to induce a difference in Net AUC examining all event trials (Vehicle: 1.05 ± 0.28, n = 177; SR141716: 1.15 ± 0.32, n = 162; unpaired t-test: t337 = 0.24, p=0.81) (Figure 5K) or session average per mouse (paired t-test: t7 = 0.64, p=0.54) (Figure 5M). In contrast, systemic administration of the CB1 antagonist in CIE mice produced a significant and sustained increase in OFC terminal activity during this period of outcome retrieval (FDR-corrected two-sided permutation test, all event trials: Vehicle: n = 275; SR141716: n = 219; session average: Vehicle: n = 10; SR141716: n = 10) (Figure 5N,P). This was also reflected in the significant increase in net AUC with all event trials (Vehicle: 0.09 ± 0.18, n = 275; SR141716: 0.83 ± 0.16, n = 219; unpaired t-test: t492 = 3.10, p<0.01) (Figure 5O) and with session average per mouse (paired t-test: t9 = 2.22, p=0.054) (Figure 5Q). Thus, it appears that systemic administration of a CB1 antagonist produces different effects on OFC-DS terminal activity dependent on the behavior. Notably, we see a large CB1 receptor antagonist-induced increase in OFC-DS terminal activity during periods associated with outcome retrieval in CIE mice, in which OFC-DS terminal is normally blunted and endocannabinoid signaling is enhanced.

Given this, we hypothesized that blocking eCB signaling in DS would restore sensitivity to outcome devaluation in chronic alcohol-exposed mice. Two separate cohort replicates of mice were implanted with bilateral cannula targeting DS (Figure 6—figure supplement 1) and then exposed to Air or CIE followed by lever press training for a food outcome (Figure 6A,B) as previously described (Figure 1). Air and CIE mice pressed the lever at similar levels (repeated-measures ANOVA: main effect of Training day: F(8, 240)=161.5, p<0.0001; no main effect of CIE exposure or interaction: Fs < 0.13) (Figure 6C) and earned a similar number of outcomes across training (repeated-measures ANOVA: main effect of Training day: F(8, 240)=71.45, p<0.0001; no main effect of CIE exposure or interaction: Fs < 0.18) (Figure 6D). As described above, lever presses were organized into bouts (Figure 1). Similar to the previous cohorts, we found that lever press bouts were differentially executed in Air and CIE mice in that CIE mice had slightly less presses per bout, shorter inter-press intervals, and, consequently, shorter bout durations (Figure 6—figure supplement 1).

Figure 6. Blockade of CB1 receptors in the DS restores value-based decision-making in CIE mice.

(A) Experimental timeline that includes surgeries followed by four cycles of CIE exposure, operant training and outcome devaluation. (B) Schematic of mouse lever press training in which Air (Vehicle: n = 7 mice, SR141716: n = 8 mice, three vapor cohorts) and CIE mice (Vehicle: n = 8 mice, SR141716: n = 9 mice, three vapor cohorts) were trained under a random ratio schedule of reinforcement for a food outcome. (C) Lever presses during operant training in Air and CIE mice. (D) Outcome earned during operant training in Air and CIE mice. (E) Schematic of outcome devaluation procedure that includes 1 hr access to sucrose (valued) or the outcome earned during training (devalued) followed by a 5 min extinction test. (F) Normalized lever presses showing the distribution of lever pressing between the valued and devalued day in Air and CIE mice that received microinjections (300 nL) of vehicle or SR141716 (2 µM). Data points and bar graphs represent mean ± SEM. Bonferroni-corrected post hoc test of repeated-measures ANOVA, *p≤0.05, ***p<0.001.

Figure 6—source data 1. Blockade of CB1 receptors in the DS restores value-based decision-making in CIE mice source data.

Figure 6.

Figure 6—figure supplement 1. Behavioral data during lever press training and outcome devaluation in treatment and control groups of Air and CIE mice.

Figure 6—figure supplement 1.

(A) Schematic representation of bilateral cannula placement in DS. (B) Lever presses across training days for each treatment group (repeated-measures ANOVA: main effect of Time: F(8, 224)=157.0, p<0.0001; no main effect of Treatment or interaction, Fs < 1.08). (C) Response rate of lever pressing during acquisition (repeated-measures ANOVA: main effect of Time: F(8, 224)=86.80, p<0.0001; no main effect of Treatment or interaction, Fs < 1.10). (D) Reinforcers earned across training days (repeated-measures ANOVA: main effect of Time: F(8, 224)=70.03, p<0.0001; no main effect of Treatment or interaction, Fs < 1.40). (E) Head entries across training days (repeated-measures ANOVA: main effect of Time: F(8, 224)=10.68, p<0.0001; no main effect of Treatment or interaction, Fs < 1.02). (F) Average number of lever press bouts (Air: 54.58 ± 2.83, n = 60; CIE: 55.54 ± 2.61, n = 68; unpaired t-test: t126 = 0.25, p=0.80), (G) lever presses per bout (Air: 8.52 ± 0.12, n = 3275; CIE: 8.10 ± 0.10, n = 3777; unpaired t-test: t7050 = 2.72, p<0.01), (H) inter-press interval per lever press bout (Air: 1.56 ± 0.03, n = 3275; CIE: 1.37 ± 0.05, n = 3777; unpaired t-test: t7050 = 3.33, p<0.001), and (I) bout duration in Air and CIE mice (Air: 10.72 ± 0.23, n = 3275; CIE: 9.34 ± 0.27, n = 3777; unpaired t-test: t7050 = 3.85, p<0.001). (J) Consumption of sucrose and pellets on valued and devalued days respectively (repeated-measures ANOVA: no interaction or main effects of Time or Treatment, Fs < 0.41). (K) Lever presses on Valued and Devalued days for each treatment group (repeated-measures ANOVA: Valuation state × Treatment interaction: F(3, 28)=3.28, p<0.05; main effect of Valuation state: F(1, 28)=23.71, p<0.0001; no main effect of Treatment F < 0.49). Data points and bar graphs represent the mean ± SEM. Bonferroni corrected post hoc test of repeated-measures ANOVA or unpaired two-tailed t-test, *p<0.05, **p<0.01, ***p<0.001.
Figure 6—figure supplement 1—source data 1. Behavioral data during lever press training and outcome devaluation in treatment and control groups of Air and CIE mice source data.

We then examined the sensitivity of lever pressing to changes in expected outcome value using outcome devaluation procedures (Figure 6E). Just prior to outcome devaluation procedures, mice were injected with either vehicle or CB1 antagonist, SR141716, directly in the DS. We saw that in Air mice, intra-DS SR141716 had no effect on outcome devaluation responding. Once again, CIE-exposed mice treated with vehicle did not show sensitivity to outcome devaluation. However, CIE mice that were given an injection of SR141716 in the DS significantly decreased lever press responding on the devalued day relative to the value day (Figure 6F). A repeated-measures ANOVA showed a significant interaction (Training day × Treatment: F(3, 28)=4.41, p=0.01) and main effect of Devaluation day (F(1, 28)=33.28, p<0.0001) but no main effect of Treatment (F < 0.1). Thus, locally blocking CB1 receptors in the DS of CIE mice was sufficient to restore value-based decision-making.

Discussion

Often differing symptoms in psychiatric conditions produce opposing changes in the activity of a brain area. As there is increasing potential for cortical activity modulation to be used in the treatment of psychiatric disorders, we need a greater understanding of how such cortical modulation will affect downstream circuit engagement and information representation. Here we find that chronic alcohol exposure enhances activity associated with actions in OFC terminals in striatum, while also reducing activity associated with outcome retrieval. We identify one mechanism potentially responsible for the reduced OFC-DS activity specific to epochs of outcome retrieval: a disruption in D1 receptor function in SPNs and an enhancement of eCB signaling that reduces OFC-D1 SPN transmission in the DS (Figure 7). Restoration of OFC-DS transmission through CB1 receptor blockade restored use of reward value to control decision-making. Thus, CIE induces long-lasting, computational-specific changes to cortical transmission in part through cell-type, synapse-specific changes in postsynaptic modulation of this cortical terminal release.

Figure 7. Summary of hypothesized alterations in D1 receptor function and endocannabinoid signaling in D1 SPNs of CIE-exposed mice.

Figure 7.

In Air controls (left), D1 receptor activity regulates endocannabinoid production and/or release. In CIE mice (right), the loss of D1 receptor function leads to unregulated endocannabinoid signaling and the long-term reduction in glutamate release from OFC inputs.

We observed OFC-DS terminal activity profiles reminiscent of common OFC observations in humans: OCD-related hyperactivity (Milad and Rauch, 2012; Pauls et al., 2014; Robbins et al., 2019), the hallmark representation of reward evaluation (Fellows, 2007; Wallis, 2007), and dependence-induced hypoactivity (Catafau et al., 1999; Lüscher et al., 2020; Schoenbaum et al., 2016; Schoenbaum et al., 2006; Volkow et al., 1997; Volkow and Fowler, 1994). CIE mice showed slightly increased OFC terminal activity during lever press behavior (Figure 1), similar to the increased OFC-DS activity reported in compulsive phenotypes (Pascoli et al., 2018). While we did not directly measure markers of compulsivity in the present studies, we have previously reported that CIE procedures can increase response rates for both food outcomes (Renteria et al., 2018) and alcohol self-administration (Renteria et al., 2020). This raises the hypothesis that OFC-DS transmission has a role in alcohol-induced compulsive seeking. However, while this increased OFC terminal activity may drive the increase in response rate in CIE mice, it may not influence reward evaluation processes. We previously observed no correlation between response rate and the sensitivity to outcome devaluation following CIE (Renteria et al., 2018). Hence action control and the evaluation of outcome value may be differentially affected by alcohol dependence. That we also found reduced OFC-DS terminal activity in CIE mice during periods associated with outcome retrieval shows how population output activity may be differentially modulated depending on the behavior measured. The observations that chronic alcohol exposure significantly decreased reward-related OFC-DS terminal activity, but slightly increased action-related activity, raise one hypothesis that CIE differentially alters afferent activity into OFC thereby altering the capability of OFC-DS projections to send action and outcome information into basal ganglia circuits. Furthermore, the restoration of OFC terminal activity with a CB1 antagonist during outcome retrieval (Figure 5) and rescue of goal-directed control when injected directly in DS (Figure 6), suggests there also is additional local synaptic modulation of OFC-DS terminal activity during behavior. Interestingly, we found that OFC-DS terminal activity in Air controls, but not CIE mice, ramps up prior to head entries made after outcome delivery (Figure 1P,Q) and following outcome delivery itself (Figure 1—figure supplement 1). It is difficult to disentangle whether this is a predictive response, as mice can readily see into the head entry receptacle within the time frame where this rise occurs. It does suggest future works examining whether OFC-DS activity carries predictive information are warranted.

Our data suggests that CIE induces an increase in eCB signaling at OFC-D1 SPN synapses. While multiple aspects of eCB downstream signaling and plasticity were still intact, including CB1 receptor function and the ability to induce an eCB dependent LTD (Figure 2), upstream regulation of eCB signaling by D1 receptors was not. Using ex vivo brain slice preparations from CIE mice, we observed no effect of the D1 agonist SKF 81297 on DHPG-LTD or the excitability of D1 SPNs as previously reported (Hernández-López et al., 1997; Planert et al., 2013; Surmeier et al., 2007) and observed in our controls. In vivo unilateral microinjections of SKF 81297 resulted in rotation behavior that was significantly attenuated in CIE mice and there was no change in D1 receptor expression (Figure 3—figure supplement 2). These results suggest that CIE disrupts the signaling cascade downstream of D1 receptor activation. Indeed, acute ethanol has been shown to modulate downstream targets of D1 receptor activation (Ron and Barak, 2016; Ron and Messing, 2013), including cellular inhibition of specific protein kinase C (PKC) isoforms that constitutively phosphorylate D1 receptors to dampen D1 signaling (Rex et al., 2008). Withdrawal was found to decrease PKC isoforms in the BLA (Christian et al., 2012), although whether similar mechanisms occur in dorsal striatum is not known. As our measures of D1 receptor function were done 3–21 days in withdrawal from chronic ethanol exposure, this evidence raises one hypothesis that the present disruption in D1 receptor function observed may reflect neuroadaptive compensatory changes in PKC activity. Others have shown, in postmortem brain sections taken from human alcoholics, a downregulation of D1 receptor binding sites in the striatum (Hirth et al., 2016). In addition, rats exposed to CIE had an increase of dopamine release in the nucleus accumbens shell, but showed a blunted response to D1 receptor activation 21 days in withdrawal (Hirth et al., 2016). While plenty of data suggest that chronic alcohol exposure alters dopamine systems, the changes likely depend on the synapse, cell-type, and circuit.

The loss of D1 receptor function in SPNs of CIE exposed mice may be directly related to the enhancement of eCB signaling (Figures 4 and 7). Previous works have shown dopamine and eCB signaling interact in which D1 receptor activation may reduce anandamide (AEA) signaling (André et al., 2010; Patel et al., 2003) (but see Giuffrida et al., 1999), although at what point D1 signaling could interfere with AEA synthesis or release is not clear. However, DSE has widely been shown to be mediated by 2-arachidonoyl glycerol (2-AG), and a CIE-induced increase in 2-AG may contribute to the present effects. Others have shown that habitual alcohol seeking may be mediated by an increase in 2-AG, as an inhibitor, DO34, was found to reduce habitual responding (Gianessi et al., 2020). Thus, future work examining the contribution of different eCBs is warranted.

Conversely, the eCB system has been extensively studied in the modulation of dopamine transmission (Covey et al., 2017), further underscoring the complex interactions of these signaling systems. For example, as eCBs can reduce dopamine release in nucleus accumbens through reducing cortical drive (Mateo et al., 2017), there may also be a CIE-induced decrease in endogenous D1R activation at OFC-D1 SPN synapses. Given the large number of synaptic and molecular targets that chronic ethanol exposure can modulate (Abrahao et al., 2017; Lovinger and Roberto, 2013; Roberto and Varodayan, 2017), the synapse and cell-type-specific alterations in orbitostriatal transmission that we have observed are likely the result of ethanol-induced changes in many neuromodulatory and synaptic signaling systems.

The CIE-induced alterations in eCB-mediated signaling and plasticity presented here were found to be at least, in part, specific to the OFC input. Local electrical stimulation (Figure 2—figure supplement 1) likely includes OFC inputs in addition to excitatory inputs from other cortical and limbic regions as well as thalamic nuclei (Wall et al., 2013). It is possible that other excitatory inputs to D1 SPNs are differentially modulated by chronic alcohol exposure such that changes at individual inputs may be masked with electrical stimulation. However, enhanced eCB signaling is not expected to apply to thalamostriatal inputs to D1 SPNs, as they lack CB1 receptors at presynaptic terminals (Wu et al., 2015). Evidence for a CIE-induced change in other excitatory inputs to D1 SPNs may be supported by the DHPG-induced short-term depression observed with electrical stimulation in the presence of a D1 agonist (Figure 2—figure supplement 1). As the observed short-term depression was found to be sensitive to a CB1 receptor antagonist, it likely involves a change in eCB signaling, although the mechanism may differ from that observed with long-term depression (Castillo et al., 2012; Heifets and Castillo, 2009; Kano et al., 2009; Lovinger, 2008).

Interestingly, we found the effects of CB1 antagonist on OFC-DS terminal activity to be dependent on the specific component of decision-making represented. Systemic injection of CB1 receptor antagonist was found to decrease OFC terminal activity in DS during initiation of lever press bouts in both Air and CIE mice (Figure 5), suggesting the effect the antagonist exerted was not through activation of CB1 receptors on OFC terminals. One explanation could be a disinhibition of inhibitory signaling onto OFC cell bodies or terminals in DS. CB1 receptors are expressed on GABAergic terminals in cortex which, when blocked with a CB1 antagonist, would allow for inhibition of cortical projection neurons. Additionally, in the striatum, CB1 receptors are expressed not only on cortical terminals, but also on SPNs and GABAergic interneurons (Hu and Mackie, 2015), which may provide local regulation of excitatory transmission. In contrast, blocking CB1 receptors had little effect on OFC-DS transmission during outcome retrieval epochs in Air controls but significantly increased activity in CIE mice. These observations suggest that differences in instrumental responding and outcome encoding may result from differentially engaged local regulation of OFC-DS transmission. As prior work showed a dependency of outcome devaluation on the activity of OFC-DS terminals (Gremel et al., 2016), it is important to note that our current findings suggest that OFC may pass into striatum information pertaining to more than one type of computation employed during outcome devaluation.

The present mechanistic findings, which are not only synapse, cell-type, and projection specific, but likely also dependent on input engagement or computation performed, demonstrate the complexity of how circuits represent information. A circuit informational approach (Lovinger and Gremel, 2021,Benjamini and Hochberg, 1995), aimed at identifying the strength and pattern of incoming transmission, recruited plasticity mechanisms, and capabilities of local circuit modulation to affect transmission, increasingly appears necessary in order to understand disease-induced disruptions to behavior. Here we further our understanding of how OFC-DS circuitry regulates decision-making, building off of previous works (Gremel and Costa, 2013; Gremel et al., 2016) to show this pathway carries both action and reward information. Importantly, we now show that action and reward information can recruit different downstream mechanisms depending on the pattern of activation. However, much work remains to be done to understand the neuroadaptations induced by CIE that result in aberrant decision-making. For example, what does the reduction in transmission from one cortical area mean for SPN output in the DS? Results in the present study focus on the effects of CIE on OFC-D1 SPN transmission in the DS, but the loss of value-based decision-making likely also involves changes to habitual processes for which the neurocircuitry is less understood. Work performed in non-human primates suggests chronic heavy alcohol consumption increases excitatory drive and decreases local inhibition within the putamen or dorsolateral striatum (DLS), an area supporting habitual action control (Cuzon Carlson et al., 2011). Thus, it may be that alcohol exposure results in a decreased ability of associative cortical input to control DS output as well as disinhibition of DLS. Furthermore, future work could investigate the time course and potential duration of or change in such effects, as the present study was limited to a 3–21 day window in withdrawal. Elucidating the long-term effects of alcohol dependence on cortico-striatal plasticity will further our understanding of the dependence-induced disruptions to decision-making processes that contribute to continual drug-seeking and taking behaviors.

Materials and methods

Mice

B6.Cg-Tg(Drd1a-tdTomato)6Calak/J (Shuen et al., 2008) and B6.FVB(Cg)-Tg(Drd1-cre)EY266Gsat/Mmucd mice were used for electrophysiological recordings, and C57BL/6J mice were used for behavioral experiments and fiber photometry. All mouse lines were obtained from Jackson Laboratory, and transgenic mouse lines were bred with C57Bl/6J mice (Jackson Laboratory) for one generation, in-house. Adult (>8 weeks) male and female mice were housed in groups of 2–4, with mouse chow (Purina 5015) and water ad libitum and were kept on a 14 hr light/10 hr dark cycle. Behavioral and physiology experiments were conducted during the light phase of the light cycle. All experiments were approved by the Institutional Animal Care and Use Committees of the University of California San Diego, and experiments were conducted according to NIH guidelines.

Viral injections

Mice were anesthetized with isoflurane and were given stereotaxically guided bilateral injections into the OFC (coordinates from Bregma in mm: anterior [A], 2.70; medial [M], ±1.65; ventral [V]: 2.65). D1-tdTomato mice, used for patch clamp recordings, were co-injected with 100 nL AAV5-CamKIIa-GFP-Cre and 100 nL AAV5-Ef1a-DIO-ChR2-eYFP in the OFC. D1-Cre mice received additional bilateral injections of AAV5-Flp-tdTomato in the DS (coordinates from Bregma in mm: anterior [A], 0.5; medial [M],±1.5; ventral [V]: 3.25). The coordinates target the medial DS from the perspective of the dorsal–ventral axis, and more medial DS with respect to the medial versus lateral striatal distinctions. For fiber photometry, C57BL/6J mice received unilateral injections with 500 nL AAV-hSyn-axon-GCaMP6s in the OFC. Viral spread and expression in OFC terminals were assessed by imaging the extent of fluorescence in brain slices (Olympus MVX10).

Chronic intermittent ethanol exposure and repeated withdrawal

Mice were exposed to four rounds of ethanol vapor or air and repeated withdrawal (Becker, 1994; Becker and Lopez, 2004; Griffin et al., 2009; Lopez and Becker, 2005). Each round consisted of 16 hr of vapor exposure followed by an 8 hr withdrawal, repeated for four consecutive days. Ethanol was volatilized by bubbling air through a flask containing 95% ethanol at a rate of 2–3 L/min. The resulting ethanol vapor was combined with a separate air stream to give a total flow rate of approximately 10 L/min, which was delivered to the mice housed in Plexiglas chambers (Plas Labs Inc). Blood ethanol concentrations (BEC) were collected at the end of each round from sentinel mice (mean BEC = 34.7 ± 2.0 mM). No pyrazole or loading ethanol injections were given prior to placement in vapor chambers (Renteria et al., 2018).

Operant behavior

Training was conducted as previously described (Gremel et al., 2016; Gremel and Costa, 2013; Renteria et al., 2018). Two days prior to training, mice were food restricted and maintained at ~ 90% of their baseline body weight throughout training and testing. Mice were placed in sound attenuating operant boxes (Med-Associates) and were trained to press a single lever (left or right) for a food reinforcer (chow pellet). Mice were first trained to retrieve the outcome, in the absence of levers, under a random time schedule (RT60) in which the outcome was delivered on average every 60 s. Mice were then trained on a continuous reinforcement schedule (CRF) in each context in which each lever press produced a single outcome, and the maximum number of reinforcers earned in three daily sessions being 5, 15, and 30, respectively. Following CRF training, mice were trained under a random ratio (RR) schedule of reinforcement. Mice received two days of training in RR10 (on average the 10th lever press produces the outcome), followed by 4 days under RR20. Sessions ended after 30 reinforcers were earned or after 60 min had elapsed. Lever press bouts were analyzed using custom scripts on Matlab (Renteria, 2021a, https://github.com/gremellab/Renteria-et-al.-2021-eLifeRenteria, 2021b; copy archived at swh:1:rev:1dc7163f31ac46ddfa75d6a9c9fb8841f25500a2). Briefly, we found the inter-press intervals for all lever presses within a session and calculated the mean. Bouts were separated by inter-press intervals that were greater than the mean inter-press interval. Only bouts of at least three lever presses were included for analysis.

Devaluation testing through sensory-specific satiation was conducted across 2 days: a valued day and a devalued day. Mice were allowed to prefeed ad libitum for 1 hr on the food pellet outcome previously earned by lever pressing (devalued day) or a 20% sucrose solution to control for satiety (valued day). Each day immediately following prefeeding, mice underwent a 5 min extinction test in which the number of lever presses made were recorded, but no outcome was delivered. Lever presses on the valued or devalued day were normalized to total lever presses (valued + devalued) across devaluation testing to account for differences in response rates across individual groups. Mice with optical fiber implants for OFC terminal recordings were retrained after outcome devaluation using RR20 schedule of reinforcement and showed response rates during retraining similar to responding prior to outcome devaluation (Figure 5—figure supplement 1).

Fiber photometry

Mice were injected with AAV-hSynapsin-axon-GCaMP6s in the OFC, and fibers were implanted in the DS (coordinates from Bregma in mm: anterior [A], 0.5; medial [M], −1.5; ventral [V]: 3.25). Fibers were attached for the last 5 days of operant training, beginning on the second day of random ratio training. A blue LED (470 nm, Thorlabs) was used for excitation of OFC terminals in the DS. Fluorescence emissions were collected with a bifurcated fiber (Thorlabs), which allowed for simultaneous recording of two mice. The dual-fiber core was focused through a 10× objective (Olympus) onto a CMOS camera (FLIR Systems). Fluorescence intensity and analog signals for lever press, head entries, and outcome delivery were acquired simultaneously, thresholded, and timestamped for later analyses using Bonsai software (Lopes et al., 2015). Photometry and behavioral data were imported to Matlab (Mathworks Inc, Natick, MA) and were analyzed based on prior works (Markowitz et al., 2018) using custom scripts (Renteria, 2021a, https://github.com/gremellab/Renteria-et-al.-2021-eLife). We did not employ a reference channel in the 405–410 nm wavelength range, as GCaMP6s has an excitation peak within that range. To account for photobleaching and signal decay across a session, we fit a double exponential line on the raw calcium fluorescence signal and normalized the signal to this fit. We then estimated baseline fluorescence (F0), by calculating the running 10th percentile of the fluorescence signal intensity range using a 15 s sliding window for the entire session. ∆F was calculated by subtracting F0 from the fitted signal, and then ∆F was normalized by ∆F/F0. Only sessions in which the 97.5% of ∆F/F0, across the entire session trace (independent of any behavioral response), exceeded a 1% change from baseline fluorescence were included (Markowitz et al., 2018). For each session, ‘trials’ were composed of peri-event signals from −5 to 5 s around event onset. In each ‘trial’, 50 ms bins were z-scored to the session’s mean pre-event baseline (−5 to −2 s). We analyzed this photometry data two ways. First, these z-scored ∆F/F0 traces were combined across all mice within a group. This was done to preserve the variance seen within a subject. Second, we averaged these z-scored ∆F/F0 traces for a given animal session and then averaged these traces across mice within a group. This examines between-mouse variability but does not preserve within-subject variability. Outliers, defined as peak amplitude Z-scores more than three scaled median absolute deviations (above and below) from the median, were removed from group data.

Brain slice preparation

Mice were at least 12 weeks of age at the time of slice preparation. Coronal slices (250 μm thick) containing the DS were prepared using a Pelco easiSlicer (Ted Pella Inc, Redding, CA). Mice were anesthetized by inhalation of isoflurane and brains were rapidly removed and placed in 4°C oxygenated ACSF containing the following (in mM): 210 sucrose, 26.2 NaHCO3, 1 NaH2PO4, 2.5 KCl, 11 dextrose, bubbled with 95% O2/5% CO2. Slices were transferred to an ACSF solution for incubation containing the following (in mM): 120 NaCl, 25 NaHCO3, 1.23 NaH2PO4, 3.3 KCl, 2.4 MgCl2, 1.8 CaCl2, 10 dextrose. Slices were continuously bubbled with 95% O2/5% CO2 at pH 7.4, 32°C and were maintained in this solution for at least 60 min prior to recording.

Patch clamp electrophysiology

Whole-cell patch clamp recordings were made in identified SPNs in the medial DS. Cells were identified using an Olympus BX51WI microscope mounted on a vibration isolation table. Prior to patching onto a cell, the presence of td-Tomato expression was used to verify cell-type (D1+ or D1−) as well as eYFP expression for terminal expression of ChR2. eYFP expression was never observed in SPN cell bodies. Recordings were made in ACSF containing (in mM): 120 NaCl, 25 NaHCO3, 1.23 NaH2PO4, 3.3 KCl, 0.9 MgCl2, 2.0 CaCl2, and 10 dextrose, bubbled with 95% O2/5% CO2. ACSF was continuously perfused at a rate of 2.0 ml/min and maintained at a temperature of 32°C. Picrotoxin (50 µM) was included in the recording ACSF to block GABAA receptor-mediated synaptic currents. Recording electrodes (thin-wall glass, WPI Instruments) were made using a PC-10 puller (Narishige International, Amityville, NY) to yield resistances of 3–6 MΩ. For current-clamp experiments, electrodes were filled with (in mM): 135 KMeSO4, 12 NaCl, 0.5 EGTA, 10 HEPES, 2 Mg-ATP, 0.3 Tris–GTP, 260–270 mOsm (pH 7.3). For voltage clamp experiments, electrodes were filled with (in nM): 120 CsMeSO4, 15 CsCl, 8 NaCl, 10 HEPES, 0.2 EGTA, 10 TEA-Cl, 4 Mg-ATP, 0.3 Na-GTP, 0.1 spermine, and 5 QX-314-Cl. Access resistance was monitored throughout the experiments. Cells in which access resistance varied more than 20% were not included in the analysis.

Glutamatergic afferents were stimulated either electrically or optically. For electrical stimulation, a stainless steel bipolar stimulating electrode (FHC, Inc) was placed dorsal to the recording electrode, about 150–300 μm from the cell body. Optical stimulation was done using 470 nm blue light (4 ms pulse width) delivered via field illumination using a high-power LED (LED4D067, Thor Labs). Light intensity was adjusted to produce optically evoked excitatory postsynaptic currents (EPSCs) with a magnitude of 100–300 pA. Recordings were made using a MultiClamp 700B amplifier (Molecular Devices, Union City, CA), filtered at 2 kHz, digitized at 10 kHz with Instrutech ITC-18 (HEKA Instruments, Bellmore, NY), and displayed and saved using AxographX (Axograph, Sydney, Australia). For long-term depression (LTD) experiments, EPSCs were evoked at 0.05 Hz for a 10 min baseline, followed by a 10 min bath application of (S)−3,5-dihydroxy-phenylglycine (S-DHPG) (50 µM) paired with depolarization to 0 mV. After S-DHPG application, EPSCs were monitored for 20 min at a rate of 0.05 Hz. Data were combined in 2 min bins, and EPSC amplitudes were normalized to the 10 min baseline period. The magnitude of LTD was calculated by averaging normalized EPSC values from 30 to 40 min. For current-clamp recordings, a fixed current was injected for 800 ms in 50 pA steps from −400 pA to +500 pA, and the number of action potentials was counted at each step. Current injections were conducted prior to (baseline) and after a 10–15 min wash on of the D1 receptor agonist SKF 81297 (10 µM). For depolarization-induced suppression of excitation (DSE), EPSCs were evoked at a rate of 0.2 Hz for a 30 s baseline followed by a 10 s depolarization to +50 mV. After depolarization, EPSCs were monitored for 120 s at 0.2 Hz. Data were averaged over three trials for each neuron. EPSC amplitudes were normalized to the average of the baseline period, and the magnitude of DSE was measured as the first point after depolarization. Data from each neuron within a treatment group was combined and presented as mean ± SEM.

Rotational behavior

To probe for changes in D1 receptor function in vivo, mice were tested over a 3-week period following the conclusion CIE exposure. After cannula-guided injections, mice were placed in a round container, and Bonsai software (Lopes et al., 2015) was used for video recording and tracking orientation (radians). The number of rotations was counted using custom code in MATLAB (Mathworks Inc, Natick, MA) in which the difference of orientation at each frame was added until the sum was greater than or equal to 6.28 radians or one full CW rotation. The sum total was reset to 0 with the count of each rotation. CCW turns were counted when the sum was less than or equal to −6.28 radians. Each mouse was tested once for 1 hr and then immediately sacrificed for brain extraction and confirmation of cannula placement.

Immunoblotting

Tissue punches from the medial DS were prepared using a disposable biopsy puncher (Integra, #3331-A) from fresh-frozen brains of CIE or air-exposed mice and kept at −80°C until further processing. Proteins were extracted in ice-cold lysis buffer containing 150 mM NaCl, 50 mM Tris–HCl, and 1% Triton X-100, pH 7.4 and supplemented with protease inhibitors (Sigma, Complete Mini Protease Inhibitor Cocktail, #11836153001) using a probe tip sonicator (Sonics, Vibra Cell). Homogenates were further incubated on a rotator for 1 hr at 4°C and centrifuged at 12,000 × g for 15 min at 4°C. Supernatant was kept, and protein concentration was measured using the BCA method (Pierce BCA Protein Assay, Thermo Fisher, #23225). Samples were incubated in 2× Laemmli Buffer (120 mM Tris–HCl, pH = 6.8, 20% [w/v] glycerol, 4% sodium dodecyl sulfate, 10% 2-mercaptoethanol, 0.02% bromphenol blue) for 15 min at 50°C, and 5 μg of protein was separated on a 4–20% sodium dodecyl sulfate polyacrylamide gel electrophoresis (Bio-Rad, #4561096) and blotted onto nitrocellulose membranes by wet transfer. Membranes were stained with Ponceau S solution to determine equal protein loading before blocking in 0.1% PBST (phosphate-buffered saline [PBS], 0.1% Tween-20) containing 5% non-fat dry milk for 1 hr at room temperature. Incubation with primary antibodies (guinea pig Anti-D1R 1:1000, Frontier Institute, #2571594; rabbit Anti-D2R 1:1000; Frontier Institute, #2571596; rabbit Anti-cofilin [D3F9] 1:10,000, Cell Signaling Technology, #5175) in blocking buffer was performed overnight at 4°C. Next day, membranes were washed three times in 0.1% PBST and incubated with secondary antibodies (donkey Anti-guinea pig-Alexa-488 and/or donkey Anti-rabbit-Alexa-647; 1:1000; Jackson Immuno Research) in blocking solution for 2 hr at room temperature before another three 15 min washes in 0.1% PBST. Membranes were briefly rinsed in PBS before imaging using the Bio-Rad ChemiDoc MP imaging system. Band densities (bands at ca. 75–100 kD for D1R or D2R) were quantified using Image J (NIH), normalized to cofilin (loading control; ca. 19 kD), and are expressed as relative intensities. Antibodies for D1 and D2 were used in recent reports and showed decreases in D1R/D2R after CRISPR-mediated knockouts (Cui et al., 2020) and showed lack of D1R signal by immunohistochemistry in D1R-KO mice (Nagatomo et al., 2017). In addition, the D1R antibody was tested in D1-Cre mice hypomorhpic for D1R and showed considerably less bands at the 70–100 kD range (data not shown).

Cannula-guided drug injections

Mice were implanted with bilateral cannula (Plastics One) targeting the medial DS ([A], 0.5; [M],±1.5; [V], 3.25) under stereotaxic guidance and were given 1 week of recovery prior to CIE procedures. For rotation behaviors, mice received a unilateral injection (left hemisphere) of 300 nL of SKF81297 (13.49 mM) at a rate of 100 nL/min 20 min prior to testing. For CB1 receptor antagonist effects on devaluation, 300 nL of an 8 µM solution of SR141716A made up of 75% DMSO and 25% saline was injected into each hemisphere 30 min prior to prefeeding for outcome devaluation testing.

Statistical analysis

Statistical significance was defined as an alpha of p≤0.05. Statistical analysis was performed using GraphPad Prism 6 (GraphPad Software) and MatLab (Mathworks Inc, Natick, MA). To determine whether there were significant differences in photometry data between groups, permutation testing was performed (Jean-Richard-Dit-Bressel et al., 2020; Pascoli et al., 2018) in which trials of event-related activity from each treatment group were combined and randomly partitioned 1000 times to compute Z-score means and standard deviations across 50 ms bins. Analysis was limited to −2 to 2 s compared to the −5 to −2 s baseline period. To account for the potential false-discovery rate associated with doing a high number of tests, we applied the Benjamini and Hochberg false discovery rate correction (Benjamini and Hochberg, 1995). We also put a significance threshold that required at least four consecutive time points meet the threshold of p<0.05. Behavioral and electrophysiology data were analyzed using one-way ANOVA, repeated-measures ANOVA with pre-planned Bonferroni-corrected post hoc analyses or a one- or two-tailed t-test.

Funding Statement

The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.

Contributor Information

Christina M Gremel, Email: cgremel@ucsd.edu.

Laura A Bradfield, University of Technology Sydney, Australia.

Kate M Wassum, University of California, Los Angeles, United States.

Funding Information

This paper was supported by the following grants:

  • National Institutes of Health AA026077-01A1 to Christina M Gremel.

  • National Institutes of Health F32 AA026776 to Rafael Renteria.

  • National Science Foundation NSF-GRFP DGE-2038238 to Emily T Baltz.

  • National Science Foundation NSF-GRFP DGE-1650112 to Christian Cazares.

  • National Institutes of Health F31 AA027439 to Drew C Schreiner.

  • National Institutes of Health R01DA036612 to Thomas S Hnasko.

  • National Institutes of Health R01NS106822 to Thomas S Hnasko.

  • National Institutes of Health T01BX003759 to Thomas S Hnasko.

Additional information

Competing interests

No competing interests declared.

Author contributions

Conceptualization, Formal analysis, Funding acquisition, Investigation, Writing - original draft, Writing - review and editing.

Formal analysis, Writing - review and editing.

Investigation, Writing - review and editing.

Investigation, Writing - review and editing.

Formal analysis, Writing - review and editing.

Investigation, Writing - review and editing.

Resources, Methodology, Writing - review and editing.

Conceptualization, Supervision, Funding acquisition, Writing - original draft, Project administration, Writing - review and editing.

Ethics

Animal experimentation: This study was performed in strict accordance with the recommendations in the Guide for the Care and Use of Laboratory Animals of the National Institutes of Health. All experiments were approved by the Institutional Animal Care and Use Committees of the University of California San Diego and experiments were conducted according to NIH guidelines.

Additional files

Transparent reporting form

Data availability

All data generated or analyzed during this study are included in the manuscript and supporting files. Source data for all figures has been provided.

References

  1. Abrahao KP, Salinas AG, Lovinger DM. Alcohol and the brain: neuronal molecular targets, synapses, and circuits. Neuron. 2017;96:1223–1238. doi: 10.1016/j.neuron.2017.10.032. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Adermark L, Talani G, Lovinger DM. Endocannabinoid-dependent plasticity at GABAergic and glutamatergic synapses in the striatum is regulated by synaptic activity. European Journal of Neuroscience. 2009;29:32–41. doi: 10.1111/j.1460-9568.2008.06551.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Adermark L, Lovinger DM. Frequency-dependent inversion of net striatal output by endocannabinoid-dependent plasticity at different synaptic inputs. Journal of Neuroscience. 2009;29:1375–1380. doi: 10.1523/JNEUROSCI.3842-08.2009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Ahmari SE, Spellman T, Douglass NL, Kheirbek MA, Simpson HB, Deisseroth K, Gordon JA, Hen R. Repeated cortico-striatal stimulation generates persistent OCD-like behavior. Science. 2013;340:1234–1239. doi: 10.1126/science.1234733. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. American Psychiatric Association . Diagnostic and Statistical Manual of Mental Disorders (DSM-5) The American Psychiatric Association; 2013. [Google Scholar]
  6. André VM, Cepeda C, Cummings DM, Jocoy EL, Fisher YE, William Yang X, Levine MS. Dopamine modulation of excitatory currents in the striatum is dictated by the expression of D1 or D2 receptors and modified by endocannabinoids. European Journal of Neuroscience. 2010;31:14–28. doi: 10.1111/j.1460-9568.2009.07047.x. [DOI] [PubMed] [Google Scholar]
  7. Becker HC. Positive relationship between the number of prior ethanol withdrawal episodes and the severity of subsequent withdrawal seizures. Psychopharmacology. 1994;116:26–32. doi: 10.1007/BF02244867. [DOI] [PubMed] [Google Scholar]
  8. Becker HC, Lopez MF. Increased ethanol drinking after repeated chronic ethanol exposure and withdrawal experience in C57BL/6 mice. Alcoholism: Clinical & Experimental Research. 2004;28:1829–1838. doi: 10.1097/01.ALC.0000149977.95306.3A. [DOI] [PubMed] [Google Scholar]
  9. Belin D, Belin-Rauscent A, Murray JE, Everitt BJ. Addiction: failure of control over maladaptive incentive habits. Current Opinion in Neurobiology. 2013;23:564–572. doi: 10.1016/j.conb.2013.01.025. [DOI] [PubMed] [Google Scholar]
  10. Benjamini Y, Hochberg Y. Controlling the false discovery rate: a practical and powerful approach to multiple testing. Journal of the Royal Statistical Society: Series B. 1995;57:289–300. doi: 10.1111/j.2517-6161.1995.tb02031.x. [DOI] [Google Scholar]
  11. Boettiger CA, Mitchell JM, Tavares VC, Robertson M, Joslyn G, D'Esposito M, Fields HL. Immediate reward Bias in humans: fronto-parietal networks and a role for the catechol-O-methyltransferase 158(Val/Val) genotype. Journal of Neuroscience. 2007;27:14383–14391. doi: 10.1523/JNEUROSCI.2551-07.2007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Castillo PE, Younts TJ, Chávez AE, Hashimotodani Y. Endocannabinoid signaling and synaptic function. Neuron. 2012;76:70–81. doi: 10.1016/j.neuron.2012.09.020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Catafau AM, Etcheberrigaray A, Perez de los Cobos J, Estorch M, Guardia J, Flotats A, Bernà L, Marí C, Casas M, Carrió I. Regional cerebral blood flow changes in chronic alcoholic patients induced by naltrexone challenge during detoxification. Journal of Nuclear Medicine : Official Publication, Society of Nuclear Medicine. 1999;40:19–24. [PubMed] [Google Scholar]
  14. Christian DT, Alexander NJ, Diaz MR, Robinson S, McCool BA. Chronic intermittent ethanol and withdrawal differentially modulate basolateral amygdala AMPA-type glutamate receptor function and trafficking. Neuropharmacology. 2012;62:2430–2439. doi: 10.1016/j.neuropharm.2012.02.017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Corbit LH, Nie H, Janak PH. Habitual alcohol seeking: time course and the contribution of subregions of the dorsal striatum. Biological Psychiatry. 2012;72:389–395. doi: 10.1016/j.biopsych.2012.02.024. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Covey DP, Mateo Y, Sulzer D, Cheer JF, Lovinger DM. Endocannabinoid modulation of dopamine neurotransmission. Neuropharmacology. 2017;124:52–61. doi: 10.1016/j.neuropharm.2017.04.033. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Cui W, Aida T, Ito H, Kobayashi K, Wada Y, Kato S, Nakano T, Zhu M, Isa K, Kobayashi K, Isa T, Tanaka K, Aizawa H. Dopaminergic signaling in the nucleus accumbens modulates Stress-Coping strategies during inescapable stress. The Journal of Neuroscience. 2020;40:7241–7254. doi: 10.1523/JNEUROSCI.0444-20.2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Cuzon Carlson VC, Seabold GK, Helms CM, Garg N, Odagiri M, Rau AR, Daunais J, Alvarez VA, Lovinger DM, Grant KA. Synaptic and morphological neuroadaptations in the putamen associated with long-term, relapsing alcohol drinking in primates. Neuropsychopharmacology. 2011;36:2513–2528. doi: 10.1038/npp.2011.140. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. DePoy L, Daut R, Brigman JL, MacPherson K, Crowley N, Gunduz-Cinar O, Pickens CL, Cinar R, Saksida LM, Kunos G, Lovinger DM, Bussey TJ, Camp MC, Holmes A. Chronic alcohol produces neuroadaptations to prime dorsal striatal learning. PNAS. 2013;110:14783–14788. doi: 10.1073/pnas.1308198110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Diana MA, Marty A. Endocannabinoid-mediated short-term synaptic plasticity: depolarization-induced suppression of inhibition (DSI) and depolarization-induced suppression of excitation (DSE) British Journal of Pharmacology. 2004;142:9–19. doi: 10.1038/sj.bjp.0705726. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Dickinson A, Nicholas DJ, Adams CD. The effect of the instrumental training contingency on susceptibility to reinforcer devaluation. The Quarterly Journal of Experimental Psychology Section B. 1983;35:35–51. doi: 10.1080/14640748308400912. [DOI] [Google Scholar]
  22. Dickinson A, Wood N, Smith JW. Alcohol seeking by rats: action or habit? The Quarterly Journal of Experimental Psychology Section B. 2002;55:331–348. doi: 10.1080/0272499024400016. [DOI] [PubMed] [Google Scholar]
  23. Duka T, Trick L, Nikolaou K, Gray MA, Kempton MJ, Williams H, Williams SC, Critchley HD, Stephens DN. Unique brain Areas associated with abstinence control are damaged in multiply detoxified alcoholics. Biological Psychiatry. 2011;70:545–552. doi: 10.1016/j.biopsych.2011.04.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Ernst LH, Plichta MM, Dresler T, Zesewitz AK, Tupak SV, Haeussinger FB, Fischer M, Polak T, Fallgatter AJ, Ehlis AC. Prefrontal correlates of approach preferences for alcohol stimuli in alcohol dependence. Addiction Biology. 2014;19:497–508. doi: 10.1111/adb.12005. [DOI] [PubMed] [Google Scholar]
  25. Everitt BJ, Robbins TW. Neural systems of reinforcement for drug addiction: from actions to habits to compulsion. Nature Neuroscience. 2005;8:1481–1489. doi: 10.1038/nn1579. [DOI] [PubMed] [Google Scholar]
  26. Everitt BJ, Robbins TW. Drug addiction: updating actions to habits to compulsions ten years on. Annual Review of Psychology. 2016;67:23–50. doi: 10.1146/annurev-psych-122414-033457. [DOI] [PubMed] [Google Scholar]
  27. Fellows LK. The role of orbitofrontal cortex in decision making: a component process account. Annals of the New York Academy of Sciences. 2007;1121:421–430. doi: 10.1196/annals.1401.023. [DOI] [PubMed] [Google Scholar]
  28. Forbes EE, Rodriguez EE, Musselman S, Narendran R. Prefrontal response and frontostriatal functional connectivity to monetary reward in abstinent alcohol-dependent young adults. PLOS ONE. 2014;9:e94640. doi: 10.1371/journal.pone.0094640. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Gerdeman GL, Ronesi J, Lovinger DM. Postsynaptic endocannabinoid release is critical to long-term depression in the striatum. Nature Neuroscience. 2002;5:446–451. doi: 10.1038/nn832. [DOI] [PubMed] [Google Scholar]
  30. Gianessi CA, Groman SM, Thompson SL, Jiang M, van der Stelt M, Taylor JR. Endocannabinoid contributions to alcohol habits and motivation: relevance to treatment. Addiction Biology. 2020;25:e12768. doi: 10.1111/adb.12768. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Giuffrida A, Parsons LH, Kerr TM, Rodríguez de Fonseca F, Navarro M, Piomelli D. Dopamine activation of endogenous cannabinoid signaling in dorsal striatum. Nature Neuroscience. 1999;2:358–363. doi: 10.1038/7268. [DOI] [PubMed] [Google Scholar]
  32. Glick SD, Jerussi TP, Fleisher LN. Turning in circles: the neuropharmacology of rotation. Life Sciences. 1976;18:889–896. doi: 10.1016/0024-3205(76)90405-7. [DOI] [PubMed] [Google Scholar]
  33. Gremel CM, Chancey JH, Atwood BK, Luo G, Neve R, Ramakrishnan C, Deisseroth K, Lovinger DM, Costa RM. Endocannabinoid modulation of orbitostriatal circuits gates habit formation. Neuron. 2016;90:1312–1324. doi: 10.1016/j.neuron.2016.04.043. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Gremel CM, Costa RM. Orbitofrontal and striatal circuits dynamically encode the shift between goal-directed and habitual actions. Nature Communications. 2013;4:2264. doi: 10.1038/ncomms3264. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Gremel CM, Lovinger DM. Associative and sensorimotor cortico-basal ganglia circuit roles in effects of abused drugs. Genes, Brain and Behavior. 2017;16:71–85. doi: 10.1111/gbb.12309. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Griffin WC, Lopez MF, Becker HC. Intensity and duration of chronic ethanol exposure is critical for subsequent escalation of voluntary ethanol drinking in mice. Alcoholism: Clinical and Experimental Research. 2009;33:1893–1900. doi: 10.1111/j.1530-0277.2009.01027.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Heifets BD, Castillo PE. Endocannabinoid signaling and long-term synaptic plasticity. Annual Review of Physiology. 2009;71:283–306. doi: 10.1146/annurev.physiol.010908.163149. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Heilig M, Egli M, Crabbe JC, Becker HC. Acute withdrawal, protracted abstinence and negative affect in alcoholism: are they linked? Addiction Biology. 2010;15:169–184. doi: 10.1111/j.1369-1600.2009.00194.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Henderson-Redmond AN, Guindon J, Morgan DJ. Roles for the endocannabinoid system in ethanol-motivated behavior. Progress in Neuro-Psychopharmacology and Biological Psychiatry. 2016;65:330–339. doi: 10.1016/j.pnpbp.2015.06.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Hermann D, Smolka MN, Wrase J, Klein S, Nikitopoulos J, Georgi A, Braus DF, Flor H, Mann K, Heinz A. Blockade of cue-induced brain activation of abstinent alcoholics by a single administration of amisulpride as measured with fMRI. Alcoholism: Clinical and Experimental Research. 2006;30:1349–1354. doi: 10.1111/j.1530-0277.2006.00174.x. [DOI] [PubMed] [Google Scholar]
  41. Hernández-López S, Bargas J, Surmeier DJ, Reyes A, Galarraga E. D1 receptor activation enhances evoked discharge in neostriatal medium spiny neurons by modulating an L-type Ca2+ conductance. The Journal of Neuroscience. 1997;17:3334–3342. doi: 10.1523/JNEUROSCI.17-09-03334.1997. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Hilário MR, Clouse E, Yin HH, Costa RM. Endocannabinoid signaling is critical for habit formation. Frontiers in Integrative Neuroscience. 2007;1:6. doi: 10.3389/neuro.07.006.2007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Hirth N, Meinhardt MW, Noori HR, Salgado H, Torres-Ramirez O, Uhrig S, Broccoli L, Vengeliene V, Roßmanith M, Perreau-Lenz S, Köhr G, Sommer WH, Spanagel R, Hansson AC. Convergent evidence from alcohol-dependent humans and rats for a hyperdopaminergic state in protracted abstinence. PNAS. 2016;113:3024–3029. doi: 10.1073/pnas.1506012113. [DOI] [PMC free article] [PubMed] [Google Scholar]
  44. Hogarth L, Balleine BW, Corbit LH, Killcross S. Associative learning mechanisms underpinning the transition from recreational drug use to addiction. Annals of the New York Academy of Sciences. 2013;1282:12–24. doi: 10.1111/j.1749-6632.2012.06768.x. [DOI] [PubMed] [Google Scholar]
  45. Hu SS, Mackie K. Distribution of the endocannabinoid system in the central nervous system. In: Pertwee R, editor. Handbook of Experimental Pharmacology. Springer; 2015. pp. 59–93. [DOI] [PubMed] [Google Scholar]
  46. Jean-Richard-Dit-Bressel P, Clifford CWG, McNally GP. Analyzing Event-Related transients: confidence intervals, permutation tests, and consecutive thresholds. Frontiers in Molecular Neuroscience. 2020;13:14. doi: 10.3389/fnmol.2020.00014. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Kano M, Ohno-Shosaku T, Hashimotodani Y, Uchigashima M, Watanabe M. Endocannabinoid-mediated control of synaptic transmission. Physiological Reviews. 2009;89:309–380. doi: 10.1152/physrev.00019.2008. [DOI] [PubMed] [Google Scholar]
  48. Kreitzer AC, Malenka RC. Dopamine modulation of state-dependent endocannabinoid release and long-term depression in the striatum. Journal of Neuroscience. 2005;25:10537–10545. doi: 10.1523/JNEUROSCI.2959-05.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Kreitzer AC, Malenka RC. Endocannabinoid-mediated rescue of striatal LTD and motor deficits in Parkinson's disease models. Nature. 2007;445:643–647. doi: 10.1038/nature05506. [DOI] [PubMed] [Google Scholar]
  50. Kreitzer AC, Regehr WG. Retrograde inhibition of presynaptic calcium influx by endogenous cannabinoids at excitatory synapses onto purkinje cells. Neuron. 2001;29:717–727. doi: 10.1016/S0896-6273(01)00246-X. [DOI] [PubMed] [Google Scholar]
  51. Lin JY. A user's guide to channelrhodopsin variants: features, limitations and future developments. Experimental Physiology. 2011;96:19–25. doi: 10.1113/expphysiol.2009.051961. [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Lopes G, Bonacchi N, Frazão J, Neto JP, Atallah BV, Soares S, Moreira L, Matias S, Itskov PM, Correia PA, Medina RE, Calcaterra L, Dreosti E, Paton JJ, Kampff AR. Bonsai: an event-based framework for processing and controlling data streams. Frontiers in Neuroinformatics. 2015;9:7. doi: 10.3389/fninf.2015.00007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Lopez MF, Becker HC. Effect of pattern and number of chronic ethanol exposures on subsequent voluntary ethanol intake in C57BL/6J mice. Psychopharmacology. 2005;181:688–696. doi: 10.1007/s00213-005-0026-3. [DOI] [PubMed] [Google Scholar]
  54. Lovinger DM. Presynaptic modulation by endocannabinoids. In: Südhof TC, editor. Handbook of Experimental Pharmacology. Springer; 2008. pp. 435–477. [DOI] [PubMed] [Google Scholar]
  55. Lovinger DM, Gremel CM. A Circuit-Based information approach to substance abuse research. Trends in Neurosciences. 2021;44:122–135. doi: 10.1016/j.tins.2020.10.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Lovinger DM, Roberto M. Synaptic effects induced by alcohol. Current Topics in Behavioral Neurosciences. 2013;13:31–86. doi: 10.1007/7854_2011_143. [DOI] [PMC free article] [PubMed] [Google Scholar]
  57. Lüscher C, Robbins TW, Everitt BJ. The transition to compulsion in addiction. Nature Reviews Neuroscience. 2020;21:247–263. doi: 10.1038/s41583-020-0289-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. Markowitz JE, Gillis WF, Beron CC, Neufeld SQ, Robertson K, Bhagat ND, Peterson RE, Peterson E, Hyun M, Linderman SW, Sabatini BL, Datta SR. The striatum organizes 3D behavior via Moment-to-Moment action selection. Cell. 2018;174:44–58. doi: 10.1016/j.cell.2018.04.019. [DOI] [PMC free article] [PubMed] [Google Scholar]
  59. Martín AB, Fernandez-Espejo E, Ferrer B, Gorriti MA, Bilbao A, Navarro M, Rodriguez de Fonseca F, Moratalla R. Expression and function of CB1 receptor in the rat striatum: localization and effects on D1 and D2 dopamine receptor-mediated motor behaviors. Neuropsychopharmacology. 2008;33:1667–1679. doi: 10.1038/sj.npp.1301558. [DOI] [PubMed] [Google Scholar]
  60. Mateo Y, Johnson KA, Covey DP, Atwood BK, Wang HL, Zhang S, Gildish I, Cachope R, Bellocchio L, Guzmán M, Morales M, Cheer JF, Lovinger DM. Endocannabinoid actions on cortical terminals orchestrate local modulation of dopamine release in the nucleus accumbens. Neuron. 2017;96:1112–1126. doi: 10.1016/j.neuron.2017.11.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  61. Milad MR, Rauch SL. Obsessive-compulsive disorder: beyond segregated cortico-striatal pathways. Trends in Cognitive Sciences. 2012;16:43–51. doi: 10.1016/j.tics.2011.11.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. Moorman DE. The role of the orbitofrontal cortex in alcohol use, abuse, and dependence. Progress in Neuro-Psychopharmacology and Biological Psychiatry. 2018;87:85–107. doi: 10.1016/j.pnpbp.2018.01.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  63. Nagatomo K, Suga S, Saitoh M, Kogawa M, Kobayashi K, Yamamoto Y, Yamada K. Dopamine D1 receptor immunoreactivity on fine processes of GFAP-Positive astrocytes in the substantia nigra pars reticulata of adult mouse. Frontiers in Neuroanatomy. 2017;11:3. doi: 10.3389/fnana.2017.00003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  64. Nazzaro C, Greco B, Cerovic M, Baxter P, Rubino T, Trusel M, Parolaro D, Tkatch T, Benfenati F, Pedarzani P, Tonini R. SK channel modulation rescues striatal plasticity and control over habit in cannabinoid tolerance. Nature Neuroscience. 2012;15:284–293. doi: 10.1038/nn.3022. [DOI] [PubMed] [Google Scholar]
  65. Nimitvilai S, Lopez MF, Mulholland PJ, Woodward JJ. Chronic intermittent ethanol exposure enhances the excitability and synaptic plasticity of lateral orbitofrontal cortex neurons and induces a tolerance to the acute inhibitory actions of ethanol. Neuropsychopharmacology. 2016;41:1112–1127. doi: 10.1038/npp.2015.250. [DOI] [PMC free article] [PubMed] [Google Scholar]
  66. Nimitvilai S, Uys JD, Woodward JJ, Randall PK, Ball LE, Williams RW, Jones BC, Lu L, Grant KA, Mulholland PJ. Orbitofrontal neuroadaptations and Cross-Species synaptic biomarkers in Heavy-Drinking macaques. The Journal of Neuroscience. 2017;37:3646–3660. doi: 10.1523/JNEUROSCI.0133-17.2017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  67. Parsons LH, Hurd YL. Endocannabinoid signalling in reward and addiction. Nature Reviews Neuroscience. 2015;16:579–594. doi: 10.1038/nrn4004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  68. Pascoli V, Terrier J, Hiver A, Lüscher C. Sufficiency of mesolimbic dopamine neuron stimulation for the progression to addiction. Neuron. 2015;88:1054–1066. doi: 10.1016/j.neuron.2015.10.017. [DOI] [PubMed] [Google Scholar]
  69. Pascoli V, Hiver A, Van Zessen R, Loureiro M, Achargui R, Harada M, Flakowski J, Lüscher C. Stochastic synaptic plasticity underlying compulsion in a model of addiction. Nature. 2018;564:366–371. doi: 10.1038/s41586-018-0789-4. [DOI] [PubMed] [Google Scholar]
  70. Patel S, Rademacher DJ, Hillard CJ. Differential regulation of the endocannabinoids anandamide and 2-arachidonylglycerol within the limbic forebrain by dopamine receptor activity. Journal of Pharmacology and Experimental Therapeutics. 2003;306:880–888. doi: 10.1124/jpet.103.054270. [DOI] [PubMed] [Google Scholar]
  71. Pauls DL, Abramovitch A, Rauch SL, Geller DA. Obsessive-compulsive disorder: an integrative genetic and neurobiological perspective. Nature Reviews Neuroscience. 2014;15:410–424. doi: 10.1038/nrn3746. [DOI] [PubMed] [Google Scholar]
  72. Pava MJ, Woodward JJ. A review of the interactions between alcohol and the endocannabinoid system: implications for alcohol dependence and future directions for research. Alcohol. 2012;46:185–204. doi: 10.1016/j.alcohol.2012.01.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  73. Planert H, Berger TK, Silberberg G. Membrane properties of striatal direct and indirect pathway neurons in mouse and rat slices and their modulation by dopamine. PLOS ONE. 2013;8:e57054. doi: 10.1371/journal.pone.0057054. [DOI] [PMC free article] [PubMed] [Google Scholar]
  74. Reinhard I, Leménager T, Fauth-Bühler M, Hermann D, Hoffmann S, Heinz A, Kiefer F, Smolka MN, Wellek S, Mann K, Vollstädt-Klein S. A comparison of region-of-interest measures for extracting whole brain data using survival analysis in alcoholism as an example. Journal of Neuroscience Methods. 2015;242:58–64. doi: 10.1016/j.jneumeth.2015.01.001. [DOI] [PubMed] [Google Scholar]
  75. Renteria R, Baltz ET, Gremel CM. Chronic alcohol exposure disrupts top-down control over basal ganglia action selection to produce habits. Nature Communications. 2018;9:211. doi: 10.1038/s41467-017-02615-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  76. Renteria R, Cazares C, Gremel CM. Habitual ethanol seeking and licking microstructure of enhanced ethanol Self-Administration in Ethanol-Dependent mice. Alcoholism: Clinical and Experimental Research. 2020;44:880–891. doi: 10.1111/acer.14302. [DOI] [PMC free article] [PubMed] [Google Scholar]
  77. Renteria R. Renteria-Et-Al.-2021-eLife. 1Github. 2021a https://github.com/gremellab/Renteria-et-al.-2021-eLife
  78. Renteria R. Renteria-et-al.-2021-eLife. swh:1:rev:1dc7163f31ac46ddfa75d6a9c9fb8841f25500a2Software Heritage. 2021b https://archive.softwareheritage.org/swh:1:dir:497f63dc5154376ebb3ab14a22aa76be24394a9b;origin=https://github.com/gremellab/Renteria-et-al.-2021-eLife;visit=swh:1:snp:4fa087dd4c917498fd1e7cc0b6d9d2aac0896979;anchor=swh:1:rev:1dc7163f31ac46ddfa75d6a9c9fb8841f25500a2/
  79. Rex EB, Rankin ML, Ariano MA, Sibley DR. Ethanol regulation of D(1) dopamine receptor signaling is mediated by protein kinase C in an isozyme-specific manner. Neuropsychopharmacology. 2008;33:2900–2911. doi: 10.1038/npp.2008.16. [DOI] [PubMed] [Google Scholar]
  80. Robbins TW, Vaghi MM, Banca P. Obsessive-Compulsive disorder: puzzles and prospects. Neuron. 2019;102:27–47. doi: 10.1016/j.neuron.2019.01.046. [DOI] [PubMed] [Google Scholar]
  81. Roberto M, Varodayan FP. Synaptic targets: chronic alcohol actions. Neuropharmacology. 2017;122:85–99. doi: 10.1016/j.neuropharm.2017.01.013. [DOI] [PMC free article] [PubMed] [Google Scholar]
  82. Ron D, Barak S. Molecular mechanisms underlying alcohol-drinking behaviours. Nature Reviews Neuroscience. 2016;17:576–591. doi: 10.1038/nrn.2016.85. [DOI] [PMC free article] [PubMed] [Google Scholar]
  83. Ron D, Messing RO. Signaling pathways mediating alcohol effects. Current Topics in Behavioral Neurosciences. 2013;13:87–126. doi: 10.1007/7854_2011_161. [DOI] [PMC free article] [PubMed] [Google Scholar]
  84. Schoenbaum G, Roesch MR, Stalnaker TA. Orbitofrontal cortex, decision-making and drug addiction. Trends in Neurosciences. 2006;29:116–124. doi: 10.1016/j.tins.2005.12.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  85. Schoenbaum G, Chang CY, Lucantonio F, Takahashi YK. Thinking outside the box: orbitofrontal cortex, imagination, and how we can treat addiction. Neuropsychopharmacology. 2016;41:2966–2976. doi: 10.1038/npp.2016.147. [DOI] [PMC free article] [PubMed] [Google Scholar]
  86. Shen W, Flajolet M, Greengard P, Surmeier DJ. Dichotomous dopaminergic control of striatal synaptic plasticity. Science. 2008;321:848–851. doi: 10.1126/science.1160575. [DOI] [PMC free article] [PubMed] [Google Scholar]
  87. Shuen JA, Chen M, Gloss B, Calakos N. Drd1a-tdTomato BAC transgenic mice for simultaneous visualization of medium spiny neurons in the direct and indirect pathways of the basal ganglia. Journal of Neuroscience. 2008;28:2681–2685. doi: 10.1523/JNEUROSCI.5492-07.2008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  88. Stalnaker TA, Cooch NK, Schoenbaum G. What the orbitofrontal cortex does not do. Nature Neuroscience. 2015;18:620–627. doi: 10.1038/nn.3982. [DOI] [PMC free article] [PubMed] [Google Scholar]
  89. Surmeier DJ, Ding J, Day M, Wang Z, Shen W. D1 and D2 dopamine-receptor modulation of striatal glutamatergic signaling in striatal medium spiny neurons. Trends in Neurosciences. 2007;30:228–235. doi: 10.1016/j.tins.2007.03.008. [DOI] [PubMed] [Google Scholar]
  90. Tapert SF, Cheung EH, Brown GG, Frank LR, Paulus MP, Schweinsburg AD, Meloy MJ, Brown SA. Neural response to alcohol stimuli in adolescents with alcohol use disorder. Archives of General Psychiatry. 2003;60:727–735. doi: 10.1001/archpsyc.60.7.727. [DOI] [PubMed] [Google Scholar]
  91. Trusel M, Cavaccini A, Gritti M, Greco B, Saintot PP, Nazzaro C, Cerovic M, Morella I, Brambilla R, Tonini R. Coordinated regulation of synaptic plasticity at Striatopallidal and striatonigral neurons orchestrates motor control. Cell Reports. 2015;13:1353–1365. doi: 10.1016/j.celrep.2015.10.009. [DOI] [PubMed] [Google Scholar]
  92. Tye KM, Prakash R, Kim SY, Fenno LE, Grosenick L, Zarabi H, Thompson KR, Gradinaru V, Ramakrishnan C, Deisseroth K. Amygdala circuitry mediating reversible and bidirectional control of anxiety. Nature. 2011;471:358–362. doi: 10.1038/nature09820. [DOI] [PMC free article] [PubMed] [Google Scholar]
  93. Volkow ND, Wang GJ, Fowler JS. Imaging studies of cocaine in the human brain and studies of the cocaine addict. Annals of the New York Academy of Sciences. 1997;820:41–55. doi: 10.1111/j.1749-6632.1997.tb46188.x. [DOI] [PubMed] [Google Scholar]
  94. Volkow ND, Fowler JS. Brain-imaging studies of the combined use of cocaine and alcohol and of the pharmacokinetics of cocaethylene. NIDA Research Monograph. 1994;138:41–56. [PubMed] [Google Scholar]
  95. Wall NR, De La Parra M, Callaway EM, Kreitzer AC. Differential innervation of direct- and indirect-pathway striatal projection neurons. Neuron. 2013;79:347–360. doi: 10.1016/j.neuron.2013.05.014. [DOI] [PMC free article] [PubMed] [Google Scholar]
  96. Wallis JD. Orbitofrontal cortex and its contribution to decision-making. Annual Review of Neuroscience. 2007;30:31–56. doi: 10.1146/annurev.neuro.30.051606.094334. [DOI] [PubMed] [Google Scholar]
  97. Wu YW, Kim JI, Tawfik VL, Lalchandani RR, Scherrer G, Ding JB. Input- and cell-type-specific endocannabinoid-dependent LTD in the striatum. Cell Reports. 2015;10:75–87. doi: 10.1016/j.celrep.2014.12.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  98. Yin HH, Davis MI, Ronesi JA, Lovinger DM. The role of protein synthesis in striatal long-term depression. Journal of Neuroscience. 2006;26:11811–11820. doi: 10.1523/JNEUROSCI.3196-06.2006. [DOI] [PMC free article] [PubMed] [Google Scholar]

Decision letter

Editor: Laura A Bradfield1
Reviewed by: Laura A Bradfield2

In the interests of transparency, eLife publishes the most substantive revision requests and the accompanying author responses.

Acceptance summary:

This study examines the mechanisms underlying withdrawal from alcohol dependence and the resulting changes in value-based decision-making, with particular focus on changes to orbitofrontal cortical-dorsal striatal circuitry in the brain. Using fiber photometry, electrophysiology, operant behavior, optogenetics, and pharmacology, the results reveal that these changes arise specifically within spiny projection neurons that express dopamine-type 1 receptors in the striatum in a manner that is mediated by endocannabinoid signaling. Blocking the activity of endocannabinoid receptors was sufficient to rescue value-based decision-making in the alcohol-exposed mice. Overall this work reveals novel information about the mechanisms that lead to cognitive and behavioral dysfunction following withdrawal from alcohol exposure and is likely to be of interest to those interested in alcohol and substance use disorder, reward-seeking behavior, cortico-striatal circuitry, and/or cannabinoid function.

Decision letter after peer review:

[Editors’ note: the authors submitted for reconsideration following the decision after peer review. What follows is the decision letter after the first round of review.]

Thank you for submitting your work entitled "Mechanism for differential recruitment of orbitostriatal transmission during outcomes and actions in alcohol dependence" for consideration by eLife. Your article has been reviewed by three peer reviewers, including Laura A Bradfield as the Reviewing Editor and Reviewer #1, and the evaluation has been overseen by a Senior Editor.

Our decision has been reached after consultation between the reviewers. Based on these discussions and the individual reviews below, we regret to inform you that your work will currently not be considered for publication in eLife. Among other concerns, the reviewers shared considerable concern about the robustness of the photometry data and the long time period over which electrophysiological recordings took place. However, we do feel that there is merit to the manuscript, and would be willing to reconsider a revised version of the manuscript as a new submission if it thoroughly and adequately addresses each reviewer concern as well as the specific points outlined below. Please note that failure to sufficiently address any of the following points upon resubmission will result in rejection without further peer review, so your manuscript may be promptly submitted elsewhere. We certainly understand that you may wish at this point to submit your manuscript elsewhere.

Essential revisions:

1) Please revise language regarding “addiction” and “alcohol dependence” of rats in line with the requests of the reviewers below.

2) The fiber photometry data needs revision and the addition of further analyses. In particular, the issues that must be addressed are:

a) Owing to the concerns described by reviewer 3 and shared by all reviewers, we think it is important to include and discuss the subject averaged data. Please include the subject-averaged data in the main manuscript, along with appropriate quantification, analysis, and Discussion.

b) Please extend the traces to include baseline data for the per trial and per mouse data.

c) Please include a clearer description of how the photometry data were processed in accordance with the requests of reviewer 2.

d) Please include a clearer description of normalisation procedures that were or were not employed.

3) Additional analyses of why fiber photometry activity increased prior to head entry is necessary. Specifically, a comparison of this activity on trials that were unrewarded with trials that were rewarded is necessary to demonstrate that the activity is specific to reward delivery, as claimed by the authors. The authors may wish also to consider whether the sound of the pellet dispenser influenced this signal and/or whether differential behavioral patterns between groups contributed to the discrepant signal.

4) There are still some remaining issues with the length of time over which recordings were taken. To address this, two actions must be taken:

a) In the rebuttal letter, the authors refer to a supplemental figure in their previous publication that demonstrates a lack of evidence for changes over time, please insert a reference to this figure into the manuscript also so that the reader can find it if they wish to do so.

b) Because it is possible that there are differences that occur over time that the current and previous studies may not have detected, please insert a statement into the Discussion noting this possibility, and that future studies may uncover such differences.

5) To ensure transparency and the robustness of the behavioral data, please include the non-normalised (raw) lever press data for the outcome devaluation experiment shown in Figure 1U in the supplement for this figure.Reviewer #1 (Recommendations for the authors (required)):

Renteria et al., in this manuscript "mechanism for differential recruitment of orbitostriatal transmission during value and action computations in drug dependence" examine the mechanisms underlying alcohol dependence-induced changes in value-based decision-making with particular focus on orbitofrontal cortical-dorsal striatal circuits. First, they demonstrate that pre-exposing mice to a chronic intermittent ethanol procedure (CIE) leads to differential execution of lever press bouts as well as outcome devaluation performance (which is impaired relative to Air control mice). They then show that CIE induces alterations in OFC terminal activity in the DS, which is in opposite directions during lever press bouts versus during head entries after reinforcer deliveries. They spend the rest of the manuscript unpacking these effects, using both in vivo and in vitro methodologies, demonstrating that: (a) CB1 receptor function and eCB plasticity is not affected by CIE, (b) CIE disrupts D1 receptor function, (c) CIE increases eCB signalling, and this appears to be at least partially selective to the OFC input, and (d) CB1 receptor antagonist restores OFC transmission and outcome devaluation performance, indicating a rescue of goal-directed control.

I have reviewed this manuscript previously, and my reaction to it both then and now is positive. It is a nicely written paper describing an interesting and cohesive body of work. For the most part, my initial concerns have been adequately addressed. I will expand on my remaining concerns below:

1) I suggest that the word “addiction” is replaced with the less stigmatising “substance use disorder”.

2) I still have some concerns that the electrophysiological recordings took place over 3-21 days of withdrawal, which is a relatively long period in which a lot can change. However, I take note of, and appreciate, the author's rebuttal that this is a valid replication of a protracted withdrawal period in adults with alcohol use disorder.

3) I am not sure that all of the other reviewer's concerns have been completely addressed. For example, reviewer #1 previously noted that the increased calcium signal during head entry after reinforcer delivery occur prior to the head entry itself. The authors note that the animal can see into the head entry receptacle and may begin to make its approach prior to the head entry itself being detected, therefore, it is difficult to disentangle whether this is a predictive or reactive response. However, this possibility is not mentioned in the manuscript itself and I think it should be.

With regards to point 2 above I think it would be good to refer to the specific supplementary figures of the authors' prior Nature comms paper in this manuscript as well, however, in order to address the concern that OFC transmission to D1 SPNs may change over this period of time. As multiple reviewers shared this concern, it is probably one that will also occur to readers and should therefore be thoroughly addressed.

I further note that in the rebuttal letter it says that “outcome evaluation” has been changed to “reward evaluation” but this appears to have been changed to “outcome delivery” in the manuscript. This is a change I have previously requested in my review and I fully support, but could be a bit misleading for the other reviewers who were not been involved in the process that led to this change.

Finally, I previously asked the authors to give examples of what neurocognitive disorders could be produced by alcohol use and the authors responded in the rebuttal letter. I should have been clearer that I was asking for this example to be inserted into the text of the manuscript.

Reviewer #2:

The manuscript by Renteria et al., is one I previously reviewed for Biological Psychiatry investigating the role of OFC-dorsal striatal circuits in habitual responding in mice following chronic ethanol inhalation exposure (CIE). The authors have made attempts to address the critiques raised by the previous reviewers and have significantly improved the quality of the manuscript. I do, however, have several additional comments for the authors.

1) Abstract and Introduction should explicit state that experiments were done in alcohol exposed mice rather than the current description of "alcohol dependence" as this suggests the studies were done in humans.

2) The inhalation exposure it thought to model alcohol dependence not to be alcohol dependence (e.g., alcohol dependence in humans does not develop from inhalation exposure). I suggest that the authors revise statements such as "we find that dependence selectively…" to "we find that chronic inhalation exposure to alcohol selectively…". Similar to my first point, just saying alcohol dependence is misleading.

3) The authors argue in their response to reviewers that understanding what the increase in fiber photometry activity prior to head entry reflects is difficult to parse apart in their current paradigm which is not a very satisfying response. Although that may be true, I think the authors need to conduct additional analyses to provide some insight into why the increase is occurring before head entry. For example, if the increase is specific to reward can they show traces for head entries that were not rewarded?

4) I'm still having a difficult time understanding how the fiber photometry data were processed. For example, what does "we then estimated baseline fluorescence by calculating the running 10th percentile of the fluorescence signal using a 15 second sliding window" mean? Why was this used to determine which sessions to include or not? Was the signal normalized to the reference channel? If not, please describe why this was not done as it could be useful for readers and others doing fiber photometry analysis.

Reviewer #3:

The reviewers have responded with additional analyses and set changes to the paper. While the manuscript is improved, there are still some issues that remain unresolved.

1) CIE is not a model of alcohol dependence; mice cannot be diagnosed with mental illness.

2) Photometry data: I am still not convinced the use of each trial in analysis Figure 1 N and P for example are appropriate and truly represent independent samples. Moreover, the per mouse data does not always replicate the trial data, contrary to the authors suggestion in the text. The methods indicate "For each session, "trials" were composed of peri-event signals from -5 to 5 seconds around event onset. In each "trial", 50 ms bins were z-scored to the session's mean pre-event baseline (-5 to -2 seconds)." If this is true, the data for the baseline period should be shown in the figures. This is especially important since the Z score changes are very small in magnitude and examining the noise within the -5 to -2 time frame will allow the readers to more clearly see the changes are event related and not simply noise.

3) The time course was not evaluated as suggested. From the scatter plot shown it clearly looks like DHPG LTD is strongest in late withdrawal days 18-21. While the time window is not an issue per se, the lack of insight into the time dependence of these phenomenon becomes a major limitation with so few data points. More samples at different points would allow for linear regression analysis to show any time-dependence effects.

[Editors’ note: further revisions were suggested prior to acceptance, as described below.]

Thank you for resubmitting your work entitled "Mechanism for differential recruitment of orbitostriatal transmission during actions and outcomes post alcohol exposure" for further consideration by eLife. Your revised article has been evaluated by Kate Wassum (Senior Editor) and a Reviewing Editor.

Summary:

This work provides exciting new evidence that chronically exposing mice to alcohol alters their orbitostriatal transmission during the performance of actions and after outcome delivery in decision-making tasks. The authors employ a mixture of techniques including behavioural assessment, fiber photometry, electrophysiological recordings, and pharmacological injections, to comprehensively assess the alterations in activity at orbitostriatal terminals, and to describe the potential mechanism for this disruption as well as how it might be restored. These results provide important insights into the brain mechanisms that underlie disrupted decision-making in alcohol use disorder, a step that is critical for the future development of potential treatments.

The manuscript has been greatly improved and the editors and reviewers appreciate the hard work on the revisions to improve this manuscript. There are a few remaining minor issues that need to be addressed before acceptance, as outlined below:

1) There is some confusion between the "all event trials" and "session average" values (e.g., Figure 1N and 1O). Is the lack of consistency in effect between the "all event trials" and "session average" because a subset of mice had a greater change in dF/F and/or had more bouts than others that when using session averages for each mouse disappear (e.g., all mice were equally represented in the analysis)?

a) To avoid confusion, the following statement in the methods should be moved to or recapitulated in the main text and an explanation of why these measures might differ should be included (e.g., variance within subjects, more events for individual animals, etc) : "We analyzed this photometry data two ways. First, these 675 z-scored ∆F/F0 traces were combined across all mice within a group. This was done to preserve 676 the variance seen within a subject. Second, we averaged these z-scored ∆F/F0 traces for a given 677 animal session, and then averaged these traces across mice within a group. This examines 678 between-mouse variability but does not preserve within-subject variability."

2) In the Results – change alcohol dependence to following CIE or following alcohol inhalation exposure.

3) The antibodies for western blot analysis of D1 and D2 receptors are known to be very problematic (and likely inaccurate measures of protein density). Without confirming the validity of these specific antibodies for assessing D1 and D2 receptor density in mice, including this data in the paper and drawing any conclusions from could be misleading. Since it is not critical to the conclusions, we recommend removing it from the paper.

eLife. 2021 Mar 17;10:e67065. doi: 10.7554/eLife.67065.sa2

Author response


[Editors’ note: the authors resubmitted a revised version of the paper for consideration. What follows is the authors’ response to the first round of review.]

Essential revisions:

Reviewer #1 (Recommendations for the authors (required)):

Renteria et al., in this manuscript "mechanism for differential recruitment of orbitostriatal transmission during value and action computations in drug dependence" examine the mechanisms underlying alcohol dependence-induced changes in value-based decision-making with particular focus on orbitofrontal cortical-dorsal striatal circuits. First, they demonstrate that pre-exposing mice to a chronic intermittent ethanol procedure (CIE) leads to differential execution of lever press bouts as well as outcome devaluation performance (which is impaired relative to Air control mice). They then show that CIE induces alterations in OFC terminal activity in the DS, which is in opposite directions during lever press bouts versus during head entries after reinforcer deliveries. They spend the rest of the manuscript unpacking these effects, using both in vivo and in vitro methodologies, demonstrating that: (a) CB1 receptor function and eCB plasticity is not affected by CIE, (b) CIE disrupts D1 receptor function, (c) CIE increases eCB signalling, and this appears to be at least partially selective to the OFC input, and (d) CB1 receptor antagonist restores OFC transmission and outcome devaluation performance, indicating a rescue of goal-directed control.

I have reviewed this manuscript previously, and my reaction to it both then and now is positive. It is a nicely written paper describing an interesting and cohesive body of work. For the most part, my initial concerns have been adequately addressed. I will expand on my remaining concerns below:

1) I suggest that the word “addiction” is replaced with the less stigmatising “substance use disorder”.

We have changed “Addiction” to “substance use disorder”.

2) I still have some concerns that the electrophysiological recordings took place over 3-21 days of withdrawal, which is a relatively long period in which a lot can change. However, I take note of, and appreciate, the author's rebuttal that this is a valid replication of a protracted withdrawal period in adults with alcohol use disorder.

We thank the reviewer for that recognition. In regards to including reference to past works and limitations of the current time frame we have added the following to the Results:

“We have previously shown that the decrease in OFC transmission to D1 SPNs persists for up to 21 days in withdrawal (see Supplementary Figure 2 in Renteria et al., 2018).”

The following has been added to the Discussion: “Further, future work could investigate the time course of such effects, as the present study was limited to a 3-21 day window in withdrawal.”

3) I am not sure that all of the other reviewer's concerns have been completely addressed. For example, reviewer #1 previously noted that the increased calcium signal during head entry after reinforcer delivery occur prior to the head entry itself. The authors note that the animal can see into the head entry receptacle and may begin to make its approach prior to the head entry itself being detected, therefore, it is difficult to disentangle whether this is a predictive or reactive response. However, this possibility is not mentioned in the manuscript itself and I think it should be.

We have added the following to the Results presentation as well as to the Discussion:

In the Results, we have added the following: “Since the sound associated with pellet delivery could potentially contribute to activity modulation, we examined OFC-DS terminal activity in relation to outcome delivery (independent of any checking behavior). Again, we found greater modulation within 1 sec of outcome delivery in Air than CIE mice (Figure 1—figure supplement 1D, H). Further, the increase in OFC-DS activity in Air, and to a lesser extent CIE mice, was selective to head entries following outcome delivery when compared to non-rewarded head entries (Figure 1—figure supplement 1E, F). Together, these findings suggest that CIE exposure may lead to slightly enhanced OFC-DS terminal activity associated with actions, but substantially less OFCDS terminal activity during the period associated with outcome perception and/or retrieval. “

In the Discussion, we have added the following: “Interestingly, we found that OFC-DS terminal activity in Air controls, but not CIE mice, ramps up prior to head entries made after outcome delivery (Figure 1P, Q) and following outcome delivery itself (Figure 1—figure supplement 1). It is difficult to disentangle whether this is a predictive response, as mice can readily see into the head entry receptacle within the time frame where this rise occurs. It does suggest future works examining whether OFC-DS activity carries predictive information are warranted.”

With regards to point 2 above I think it would be good to refer to the specific supplementary figures of the authors' prior Nature comms paper in this manuscript as well, however, in order to address the concern that OFC transmission to D1 SPNs may change over this period of time. As multiple reviewers shared this concern, it is probably one that will also occur to readers and should therefore be thoroughly addressed.

The reference has been added, with reference to the specific supplementary figure.

I further note that in the rebuttal letter it says that “outcome evaluation” has been changed to “reward evaluation” but this appears to have been changed to “outcome delivery” in the manuscript. This is a change I have previously requested in my review and I fully support, but could be a bit misleading for the other reviewers who were not been involved in the process that led to this change.

To confirm, we have changed it to outcome delivery.

Finally, I previously asked the authors to give examples of what neurocognitive disorders could be produced by alcohol use and the authors responded in the rebuttal letter. I should have been clearer that I was asking for this example to be inserted into the text of the manuscript.

Our apologies, the text has been changed to: “The DSM-5 categorizes alcohol as one of the few drugs whose dependence can produce neurocognitive disorders such as alcohol-related dementia”.

Reviewer #2:

The manuscript by Renteria et al., is one I previously reviewed for Biological Psychiatry investigating the role of OFC-dorsal striatal circuits in habitual responding in mice following chronic ethanol inhalation exposure (CIE). The authors have made attempts to address the critiques raised by the previous reviewers and have significantly improved the quality of the manuscript. I do, however, have several additional comments for the authors.

1) Abstract and Introduction should explicit state that experiments were done in alcohol exposed mice rather than the current description of "alcohol dependence" as this suggests the studies were done in humans.

We have changed the text to describe mice that have undergone CIE procedures as “chronic alcohol exposed mice” or “CIE mice” In addition, we have changed the title to “Mechanism for differential recruitment of orbitostriatal transmission during actions and outcomes following chronic alcohol exposure.”

2) The inhalation exposure it thought to model alcohol dependence not to be alcohol dependence (e.g., alcohol dependence in humans does not develop from inhalation exposure). I suggest that the authors revise statements such as "we find that dependence selectively…" to "we find that chronic inhalation exposure to alcohol selectively…". Similar to my first point, just saying alcohol dependence is misleading.

We have changed the text throughout to describe mice that have undergone CIE procedures as “chronic alcohol exposed mice” or “CIE mice”. When we introduce the CIE procedure, we describe it as a model of alcohol dependence.

3) The authors argue in their response to reviewers that understanding what the increase in fiber photometry activity prior to head entry reflects is difficult to parse apart in their current paradigm which is not a very satisfying response. Although that may be true, I think the authors need to conduct additional analyses to provide some insight into why the increase is occurring before head entry. For example, if the increase is specific to reward can they show traces for head entries that were not rewarded?

The photometry graphs labeled “All head entries” included trials of both the first head entry after outcome delivery and all other head entries that were not rewarded. To show that the increase in OFC terminal activity in Air controls is specific to the first head entry after outcome delivery we have conducted the suggested analyses and removed those first head entry after outcome delivery from “All head entries” (now labeled “Non-rewarded head entries”) and find that the increase in terminal activity is absent in both Air and CIE mice (Figure 11—figure supplement E, H).

In addition, we have additional text in Results and Discussion relating to this point.

In the Results, we have added the following: “Even when we examined OFC-DS terminal activity in relation to outcome delivery (independent of any checking behavior), since the sound associated with pellet delivery could contribute to activity modulation, we saw greater modulation within 1 sec of delivery in Air than CIE mice (Figure 1—figure supplement 1D and G). Further, the pattern of OFC-DS activity in Air and CIE mice was absent when we examined head entries excluding ones following outcome delivery (Figure 1—figure supplement 1E and H).”

In the Discussion, we have added the following: “Interestingly, we found that OFC-DS terminal activity in Air controls, but not CIE mice, ramps up prior to head entries made after outcome delivery (Figure 1P, Q) and following outcome delivery itself (Figure 1—figure supplement 1). It is difficult to disentangle whether it is a predictive response, as mice can readily see into the head entry receptacle within the time frame where this rise occurs. It does suggest future works examining whether OFC-DS activity carries predictive information are warranted.”

4) I'm still having a difficult time understanding how the fiber photometry data were processed. For example, what does "we then estimated baseline fluorescence by calculating the running 10th percentile of the fluorescence signal using a 15 second sliding window" mean? Why was this used to determine which sessions to include or not? Was the signal normalized to the reference channel? If not, please describe why this was not done as it could be useful for readers and others doing fiber photometry analysis.

Our apologies. Hopefully we have clarified the text surrounding analyses in both the methods and Results section of our manuscript.

We do not use the commonly used reference channel of 405 nm or 410nm. This is largely because GCaMP6s has an excitation peak within that wavelength range (see example Cheong et al., 2018 and figure from that paper) and does not serve as a true isobestic control. In addition, as lower wavelength light is damaging to neural tissue, we wanted to reduce light exposure over the longer time course of our sessions (1 hour).

Instead, we used the same procedure used by the Sabatini and Datta labs (e.g., Markowitz et al., 2018). To account for photobleaching and signal decay across a session, we first fit a double exponential line on the raw calcium fluorescence signal and normalized the signal to this fit. Then the Baseline fluorescence (F0) was determined from this signal by calculating the running 10th percentile of the fluorescence signal intensity range using a 15 second sliding estimate. This means that the baseline F0 signal is determined by taking a running average (the 15 sec) of the 10% intensity level of the signal (this includes the entire intensity range of trace including peaks, so it is quite a conservative measure). This F0 trace was then subtracted from the fitted signal to get the ∆F trace across the session, and then normalized by ∆F/F0. This is then z-scored. In addition to clarification in the methods, we have added text to the results and figure axis clarifying that the signal is z-scored ∆F/F0.

We also use code from Markowitz et al., in which only sessions where the 97.5% point of the ∆F/F0 exceeded a 1% change from baseline fluorescence (independent of any behavioral response, just across the entire trace for that session) were included. The use of a 97.5% of ∆F/F0 exceeding a 1% change from baseline fluorescence acts as a signal detection mechanism used to determine whether a calcium signal exists anywhere in the entire session trace. In other words, at a minimum, the top 97.5 percentile of your range across your entire session trace has to be at least 1% different than your baseline trace.

Reviewer #3:

The reviewers have responded with additional analyses and set changes to the paper. While the manuscript is improved, there are still some issues that remain unresolved.

1) CIE is not a model of alcohol dependence; mice cannot be diagnosed with mental illness.

While of course CIE does not capture all aspects of human alcohol use disorder, CIE is a well-established and long-standing rodent model of alcohol dependence in rodents that results in the development of tolerance, withdrawal, and robust increases in voluntary ethanol consumption. Further, CIE was chosen as a model to induce alcohol dependence in rodents by the NIAAA/NIH led Integrative Neuroscience Initiative on Alcoholism (INIA). This NIH organized consortium includes multiple investigators and funded with the goal of understanding brain mechanisms associated with alcohol use disorder RFA-AA-16-005. We have changed our description of mice that have undergone CIE procedures to “chronic alcohol exposed mice” or “CIE mice” so as not to suggest that mice can be diagnosed with mental illness.

2) Photometry data: I am still not convinced the use of each trial in analysis Figure 1 N and P for example are appropriate and truly represent independent samples. Moreover, the per mouse data does not always replicate the trial data, contrary to the authors suggestion in the text. The methods indicate "For each session, "trials" were composed of peri-event signals from -5 to 5 seconds around event onset. In each "trial", 50 ms bins were z-scored to the session's mean pre-event baseline (-5 to -2 seconds)." If this is true, the data for the baseline period should be shown in the figures. This is especially important since the Z score changes are very small in magnitude and examining the noise within the -5 to -2 time frame will allow the readers to more clearly see the changes are event related and not simply noise.

The baseline period of -5 to -2 has been added to the photometry graphs. We also present all events and session average per mouse data side by side in Figure 1 and Figure 5, and associated stats side by side in main text. In Figure 1 we report a slight increase calcium modulation in CIE mice during lever pressing (significant in all events, not in session average), and a much larger calcium modulation associated with head entry after outcome delivery in Air than CIE mice (present in both all events and session average per mouse).

For Figure 5, where we are examining what happens to OFC-DS terminal activity when you give a systemic CB1 antagonist, we find antagonist administration results in a consistent increase in OFC-DS terminal activity during head entry following outcome delivery in CIE but not Air mice.

3) The time course was not evaluated as suggested. From the scatter plot shown it clearly looks like DHPG LTD is strongest in late withdrawal days 18-21. While the time window is not an issue per se, the lack of insight into the time dependence of these phenomenon becomes a major limitation with so few data points. More samples at different points would allow for linear regression analysis to show any time-dependence effects.

Sampling across multiple points in withdrawal to perform linear regression analysis is beyond the scope of this paper. Our goal is not to investigate the effects and time course of protracted withdrawal on OFC-DS transmission but instead to understand how this circuit is altered after the induction of stable and long-lasting alterations in CIE exposed mice. Further, the strong persistence of DHPG-LTD in the presence of a D1 agonist at 21 days, as noted above, suggests that the observed changes are likely stable beyond 21 days. If the observed changes were time dependent and fade across withdrawal, we would expect D1 activation to attenuate DHPG-LTD during late withdrawal, as observed in Air controls. We do think it would be of future interest to map out the emergence of such effect during both exposure and withdrawal timelines and have added discussion to this point.

[Editors’ note: what follows is the authors’ response to the second round of review.]

1) There is some confusion between the "all event trials" and "session average" values (e.g., Figure 1N and 1O). Is the lack of consistency in effect between the "all event trials" and "session average" because a subset of mice had a greater change in dF/F and/or had more bouts than others that when using session averages for each mouse disappear (e.g., all mice were equally represented in the analysis)?

For event types such as lever press bouts, each mouse can have hundreds of bouts per session. For session averages, the variability among those hundreds of bouts is lost. Although some session averages are not significantly different between Air and CIE due to loss of statistical power, such as Figure 1N and 1O, the pattern of the response and net AUC is generally consistent with all event trial data. Differences are not due to different numbers of lever presses between animals, as we see similar distributions of lever press levels/animal within groups.

a) To avoid confusion, the following statement in the methods should be moved to or recapitulated in the main text and an explanation of why these measures might differ should be included (e.g., variance within subjects, more events for individual animals, etc) : "We analyzed this photometry data two ways. First, these 675 z-scored ∆F/F0 traces were combined across all mice within a group. This was done to preserve 676 the variance seen within a subject. Second, we averaged these z-scored ∆F/F0 traces for a given 677 animal session, and then averaged these traces across mice within a group. This examines 678 between-mouse variability but does not preserve within-subject variability."

We have added the following to the main text “Photometry data was analyzed two ways. First, z-scored ΔF/F0 traces were combined across all mice within a group (all event trials). This was done to preserve the variance seen within a subject. Second, we averaged these z-scored ΔF/F0 traces for a given animal session, and then averaged these traces across mice within a group (session average). This examines between-mouse variability but does not preserve within-subject variability.”

2) In the Results – change alcohol dependence to following CIE or following alcohol inhalation exposure.

Alcohol dependence has been changed to “following CIE” and “CIE” respectively.

3) The antibodies for western blot analysis of D1 and D2 receptors are known to be very problematic (and likely inaccurate measures of protein density). Without confirming the validity of these specific antibodies for assessing D1 and D2 receptor density in mice, including this data in the paper and drawing any conclusions from could be misleading. Since it is not critical to the conclusions, we recommend removing it from the paper.

We have chosen to keep the western blot analyses based on the following reasons. The antibodies used for western blot analysis have been knock-out verified by the supplier, Frontiers. Recent papers also used both antibodies for D1 and D2 and showed decreases in D1R/D2R after CRISPR-mediated knockouts (Cui et al., 2020) and show lack of D1R signal by IHC in D1R-KO mice (Nagatomo et al., 2017). In addition, we tested the D1R antibody in D1-Cre mice hypomorhpic for D1R and they showed considerably less bands at the 70-100kD range (data not shown). If the editors disagree after our reasoning above, please let us know and we can then remove it. As stated in communication to the editors, we are hesitant to remove it based on non-specific concerns.

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    Figure 1—source data 1. CIE-induced alteractions in OFC terminal activity in the DS during instrumental responding source data.
    Figure 1—figure supplement 1—source data 1. Figure 1—figure supplement 1 CIE-induced alterations in OFC terminal activity in the DS during instrumental responding, devaluation testing, and histological placements source data 1.
    Figure 1—figure supplement 1—source data 2. Figure 1—figure supplement 1 CIE-induced alterations in OFC terminal activity in the DS during instrumental responding, devaluation testing, and histological placements source data 2.
    Figure 2—source data 1. Endocannabinoid-mediated plasticity of OFC transmission to D1 SPNs in Air and CIE mice source data.
    Figure 2—figure supplement 1—source data 1. No difference betwen Air and CIE in the expression of DHPG-LTD of electrically evoked EPSCs.
    Figure 3—source data 1. D1 receptor function is disrupted in CIE-exposed mice source data.
    Figure 3—figure supplement 1—source data 1. Unilateral D1 agonist injection and distance traveled source data.
    Figure 3—figure supplement 2—source data 1. No change in D2 D1 or D2 receptor expression in CIE mice compared to Air controls source data.
    Figure 4—source data 1. CIE-induced enhancement of DSE at OFC terminals to D1 SPNs in the DS source data.
    Figure 4—figure supplement 1—source data 1. DSE in D2 SPNs and effect of SR141716 on DSE in D1 SPNs source data.
    Figure 4—figure supplement 2—source data 1. CIE-induced effects on D1 receptor function and DSE across recording days source data.
    Figure 5—source data 1. CB1 receptor antagonist restores OFC terminal activity for outcome encoding source data.
    Figure 5—figure supplement 1—source data 1. CB1 antagonist did not alter lever press behavior between training days source data.
    Figure 5—figure supplement 2—source data 1. OFC terminal activity in DS with vehicle or CB1 antagonist source data.
    Figure 6—source data 1. Blockade of CB1 receptors in the DS restores value-based decision-making in CIE mice source data.
    Figure 6—figure supplement 1—source data 1. Behavioral data during lever press training and outcome devaluation in treatment and control groups of Air and CIE mice source data.
    Transparent reporting form

    Data Availability Statement

    All data generated or analyzed during this study are included in the manuscript and supporting files. Source data for all figures has been provided.


    Articles from eLife are provided here courtesy of eLife Sciences Publications, Ltd

    RESOURCES