Skip to main content
The Journal of Neuroscience logoLink to The Journal of Neuroscience
. 2021 Aug 25;41(34):7267–7277. doi: 10.1523/JNEUROSCI.0901-21.2021

The Medial Orbitofrontal Cortex–Basolateral Amygdala Circuit Regulates the Influence of Reward Cues on Adaptive Behavior and Choice

Nina T Lichtenberg 1, Linnea Sepe-Forrest 1, Zachary T Pennington 1, Alexander C Lamparelli 1, Venuz Y Greenfield 1, Kate M Wassum 1,2,3,4,
PMCID: PMC8387114  PMID: 34272313

Abstract

Adaptive reward-related decision making requires accurate prospective consideration of the specific outcome of each option and its current desirability. Often this information must be inferred based on the presence of predictive environmental events. The basolateral amygdala (BLA) and medial orbitofrontal cortex (mOFC) are two key nodes in the circuitry supporting such outcome expectations, but very little is known about the function of direct connections between these regions. Here, in male rats, we first anatomically confirmed the existence of bidirectional, direct projections between the mOFC and BLA and found that BLA projections to mOFC are largely distinct from those to lateral OFC (lOFC). Next, using pathway-specific chemogenetic inhibition and the outcome-selective Pavlovian-to-instrumental transfer and devaluation tests, we interrogated the function of the bidirectional mOFC–BLA connections in reward-directed behavior. We found evidence that the mOFC→BLA pathway mediates the use of environmental cues to understand which specific reward is predicted, information needed to infer which action to choose, and how desirable that reward is to ensure adaptive responses to the cue. By contrast, the BLA→mOFC pathway is not needed to use the identity of an expected reward to guide choice but does mediate adaptive responses to cues based on the current desirability of the reward they predict. These functions differ from those we previously identified for the lOFC–BLA circuit. Collectively, the data reveal the mOFC–BLA circuit as critical for the cue-dependent reward outcome expectations that influence adaptive behavior and decision making.

SIGNIFICANCE STATEMENT To make good decisions we evaluate how advantageous a particular course of action would be. This requires understanding what rewarding outcomes can be expected and how desirable they currently are. Such prospective considerations are critical for adaptive decision making but disrupted in many psychiatric diseases. Here, we reveal that direct connections between the medial orbitofrontal cortex and basolateral amygdala mediate these functions. These findings are especially important in light of evidence of dysfunction in this circuit in substance use disorder and mental illnesses marked by poor decision making.

Keywords: appetitive behavior, decision making, devalaution, memory, Pavlovian conditioning, Pavlovian-to-instrumental transfer

Introduction

To make good decisions we evaluate how advantageous a particular course of action would be, given our circumstances, or state. This includes consideration of what outcomes (e.g., specific rewarding events) might occur as well as the current desirability of those potential outcomes. Often the actual outcome we could receive is not readily apparent in the immediate environment. In these cases, we infer the most advantageous option by mentally representing possible outcomes and the current values of these outcomes. These representations are facilitated by predictive environmental events. Stored stimulus-outcome and action-outcome associative memories enable the outcome expectations that influence decision making (Balleine and Dickinson, 1998; Delamater, 2012; Fanselow and Wassum, 2015). For example, a restaurant logo on your favorite food-delivery app lets you know that a specific type of food (e.g., tacos) is available, and based on your current state (e.g., Did you just have Mexican for lunch?), you can determine that food's value and decide whether it is a suitable dinner option. Such prospective considerations are critical for adaptive decision making and are disrupted in many psychiatric diseases; yet, much is unknown of the underlying neural circuits of these considerations.

The orbitofrontal cortex (OFC) and basolateral amygdala (BLA) are two key nodes in the circuitry supporting reward outcome representations and their influence over decision making. Damage to either region causes performance deficits when adaptive behavior requires one to represent possible rewarding events (Hatfield et al., 1996; Gallagher et al., 1999; Blundell et al., 2001; Pickens et al., 2003, 2005; Izquierdo et al., 2004; Corbit and Balleine, 2005; Wellman et al., 2005; Machado and Bachevalier, 2007; Ostlund and Balleine, 2007, 2008; Johnson et al., 2009; West et al., 2011; Rhodes and Murray, 2013; Malvaez et al., 2015; Lichtenberg and Wassum, 2017; Panayi and Killcross, 2018). The lateral OFC (lOFC) has been particularly implicated, but the anatomically and functionally distinct medial OFC (mOFC; Wallis, 2011; Izquierdo, 2017; Woon et al., 2020) also participates in appetitive decision making (Noonan et al., 2010, 2017; Stopper et al., 2014; Bradfield et al., 2015, 2018; Dalton et al., 2016; Münster and Hauber, 2018; Malvaez et al., 2018a), especially when the value of rewarding options must be mentally simulated (Bray et al., 2010; Bradfield et al., 2015) and/or compared (Noonan et al., 2010; Rudebeck and Murray, 2011; Stopper et al., 2014; Gourley et al., 2016; Yamada et al., 2018). The mOFC both sends and receives direct excitatory projections to and from the BLA, positioning this as a potential crucial circuit for the influence of reward representations over decision making and adaptive behavior. But little is known of the function of direct mOFC→BLA projections and nothing of BLA→mOFC projections. This is limiting for both a basic understanding of appetitive behavior as well as for the understanding and treatment of the myriad psychiatric illnesses marked by dysfunction in the mOFC-BLA circuit (Tanabe et al., 2009; Goldstein and Volkow, 2011; Hahn et al., 2011; Linke et al., 2012; Sladky et al., 2015; Xie et al., 2021).

To address this gap in knowledge, we used a pathway-specific chemogenetic inhibition approach in male rats to uncover the function of mOFC→BLA and BLA→mOFC projections in the prospective consideration of specific predicted rewards and their current desirability required for adaptive reward-related behavior. We used Pavlovian and instrumental conditioning procedures that engender the encoding of two distinct, appetitive, sensory-specific stimulus-outcome memories and two distinct action-outcome memories. We followed this with an outcome-specific Pavlovian-to-instrumental transfer (PIT) test, which assesses the use of these memories to allow reward cues to influence decision making, and an outcome-selective devaluation test to evaluate the ability to use the value of an anticipated reward to guide adaptive behavior. Procedures were identical to our prior investigation of the function of the lOFC–BLA circuit (Lichtenberg et al., 2017) to facilitate comparison of function of the pathways between the BLA and lOFC versus mOFC.

Materials and Methods

Subjects

Male Long Evans rats aged 10–12 weeks at the start of the experiment (Charles River Laboratories) were pair housed before surgery and then subsequently housed individually in a vivarium with regulated temperature (68–79°F) and humidity (30–70%). Rats were provided with filtered tap water ad libitum in the home cage and were maintained on a food-restricted 12–14 g daily diet (LabDiet) to maintain ∼85–90% free-feeding body weight. Rats were handled for ∼3 d before experiment onset. Separate groups of naive rats were used for each experiment. Experiments were performed during the dark phase of a 12:12 h reverse dark/light cycle (lights off at 7 A.M.). All procedures were conducted in accordance with the National Institutes of Health Guide for the Care and Use of Laboratory Animals and were approved by the University of California, Los Angeles Institutional Animal Care and Use Committee.

Surgery

Standard surgical procedures, described previously (Malvaez et al., 2015, 2019; Lichtenberg et al., 2017), were used for all surgeries. Rats were anesthetized with isoflurane (4–5% induction, 1–2% maintenance), and a nonsteroidal anti-inflammatory agent was administered before and after the operation to minimize pain and discomfort. Rats also received postoperative antibiotics.

Anatomical tracing of mOFC-BLA circuit

Rats (N = 3) were infused unilaterally into the BLA with an anterograde and retrograde viral tracer cocktail. The anterograde adeno-associated virus (AAV) expressing the mCherry fluorophore [AAV8-hsyn-mCherry, 4.6 × 1012 viral genomes (vg)/ml; University of North Carolina Vector Core] was mixed 1:1 with the retrogradely trafficked AAV encoding eGFP (AAVrg-hsyn-EGFP, 7.4 × 1012 vg/ml; Addgene) and infused into the BLA [anteroposterior (AP): −3.0, ML: ±5.1, DV: −8.6 mm from bregma; 0.5 µL] at a rate of 0.1 µL/min using a 33 gauge injector. Injectors were left in place for an additional 10 min to ensure adequate diffusion and to minimize off-target spread along the injector tract. Tissue was collected 8 weeks postsurgery to ensure anterograde and retrograde transport for robust expression in BLA axons and terminals in the mOFC and BLA-projecting mOFC cell bodies.

Anatomical tracing of BLA projections to mOFC and lOFC

Rats (N = 4) were infused unilaterally with AlexaFluor-594 conjugated cholera toxin B (CTb; 5 µg/µL; Life Technologies) into the mOFC (AP +4.1, ML ±0.7, DV −5.0 mm from bregma; 0.4 µL) and AlexaFluor-488 conjugated CTb into the lOFC (AP +3.0, ML ±3.2, DV −5.8 mm; 0.6 µL) at a rate of 0.1 µL/min using a 33 gauge injector. Injectors were left in place for an additional 10 min. Tissue was collected 2 weeks postsurgery.

Chemogenetic inhibition of mOFC→BLA projections

Surgery occurred before onset of behavioral training. Rats (N = 11, final) were infused bilaterally with AAV expressing the inhibitory designer receptor human M4 muscarinic receptor (hM4Di; AAV8-hSyn-hM4Di-mCherry, 4.8 × 1012 or 3.7 × 1012 vg/ml; Addgene; 0.3 µL) at a rate of 0.1 µL/min into the mOFC (AP: +4.1, ML: ±0.5, DV: −5.2 mm from bregma) using a 33 gauge injector. Injectors were left in place for an additional 10 min to ensure adequate diffusion and to minimize off-target spread along the injector tract. Twenty-two gauge stainless-steel guide cannulas were implanted bilaterally above the BLA (AP: −3.0, ML: ±5.1, DV: −7.0 mm from bregma). Rats recovered for ∼16 d before the onset of behavioral training. Behavioral testing began 6–7 weeks following surgery to ensure anterograde transport and robust expression in mOFC axons and terminals in the BLA. One rat from this group became ill during behavioral training and was humanely euthanized before testing.

Chemogenetic inhibition of BLA→mOFC projections

Surgery occurred before onset of behavioral training. Rats (N = 9, final) were infused bilaterally with AAV expressing the inhibitory designer receptor hM4Di (AAV8-hSyn-hM4Di-mCherry, 4.8 × 1012 or 3.7 × 1012 vg/ml; Addgene; 0.4 µL) at a rate of 0.1 µL/min into the BLA (AP: −3.0, ML: ±5.1, DV: −8.6 mm from bregma) using a 33 gauge injector. Injectors were left in place for an additional 10 min. Twenty-three gauge stainless-steel guide cannulas were implanted bilaterally above the mOFC (AP: +4.1, ML: ±0.7, DV: −2.8 mm from dura). Rats recovered for ∼16 d before the onset of behavioral training. Behavioral testing began 6–7 weeks following surgery to ensure anterograde transport and robust expression in BLA axons and terminals in the mOFC.

Behavioral procedures

Apparatus

Training took place in conditioning chambers (Med Associates) housed within sound- and light-attenuated boxes, described previously (Malvaez et al., 2015, 2019; Lichtenberg and Wassum, 2017; Wassum et al., 2016; Lichtenberg et al., 2017; Collins et al., 2019). Each chamber contained two retractable levers that could be inserted to the left and right of a recessed food-delivery port in the front wall. A photobeam entry detector was positioned at the entry to the food port. Each chamber was equipped with a syringe pump to deliver 20% sucrose solution in 0.1 ml increments through a stainless-steel tube into one well of the food port and a pellet dispenser to deliver single 45 mg grain food pellets (Bio-Serv) into another well of the same food port. Both a tone and white noise generator were attached to individual speakers on the wall opposite the levers and food port. A 3 W, 24 V house light mounted on the top of the back wall opposite the food-delivery port provided illumination, and a fan mounted on the outer chamber provided ventilation and external noise reduction. Behavioral procedures were similar to those we described previously (Malvaez et al., 2015; Lichtenberg and Wassum, 2017; Lichtenberg et al., 2017).

Pavlovian conditioning

Rats first received eight sessions of Pavlovian training (one session/day) to learn to associate each of two auditory conditional stimuli (CS; 80–82 dB, 2 min duration), tone (1.5 kHz) or white noise, with a specific food reward, sucrose (20%, 0.1 ml/delivery) or grain pellets (one 45 mg pellet/delivery; Bio-Serv). CS-reward pairings were counterbalanced at the start of each experiment. For half the subjects, tone was paired with sucrose and noise with pellets, with the other half receiving the opposite arrangement. Each session consisted of eight tone and eight white noise presentations. During each 2 min CS the associated reward was delivered on a 30 s random-time schedule, resulting in an average of four CS–reward pairings per trial. CSs were delivered in pseudorandom order with a variable 2–4 min (mean = 3 min) intertrial interval.

Instrumental conditioning

Rats were then given 11 d minimum of instrumental training. They received two separate training sessions per day, one with the left lever and one with the right lever, separated by at least 1 h. Each action was reinforced with a different outcome (e.g., left press→pellets; right press→sucrose; counterbalanced with respect to the Pavlovian contingencies). Each session terminated after 30 outcomes had been earned or 30 min had elapsed. The reinforcement schedule was escalated ultimately to random ratio (RR)-20 (on average 20 presses required to obtain reward delivery). Rats received 1 training day (minimum) in which their actions were continuously reinforced, then a minimum of 2 training days on RR-2, 2 d on RR-5, and 3 d on RR-10, before being moved to the final RR-20 schedule for at least 3 d. To be escalated to the next schedule, rats had to earn at least 24 outcomes within 30 min. This requirement had to be met on the final RR-20 schedule for at least 2 training days before subjects were advanced to testing. The maximum number of days required to reach criterion for instrumental acquisition was 15.

Outcome-selective Pavlovian-to-instrumental transfer test

Following Pavlovian and instrumental conditioning, rats received a set of outcome-selective PIT tests. On the day before each PIT test, rats were given a single 30 min extinction session during which both levers were available, but pressing was not reinforced to establish a low level of responding. During the PIT test, both levers were continuously present, but pressing was not reinforced. After 5 min of lever-pressing extinction, each 2 min CS was presented separately four times in pseudorandom order, separated by a fixed 4 min intertrial interval. No rewards were delivered during CS presentation. Rats received two of each test, one following infusion of vehicle and one following infusion of Clozapine-n-oxide (CNO), counterbalanced for order. Rats were given two retraining sessions for each instrumental association (two sessions/day for 2 d) and one Pavlovian retraining session in between tests.

Outcome-selective devaluation test

Rats were given a set of two outcome-specific devaluation tests. Immediately before each test, rats were given 1 h of unlimited access to either the sucrose solution or food pellets in preexposed feeding chambers to establish a sensory-specific satiety and thus selectively devalue the prefed reward. The test consisted of two phases. In the first, both levers were available, and nonreinforced lever pressing was assessed for 5 min. The levers were then retracted, which started the second Pavlovian test phase in which each 2 min CS was presented, without accompanying reward, separately 2 times each in alternating order and separated by a fixed 4 min intertrial interval. Rats received two of each test, one in which food pellets were devalued and one in which the sucrose solution was devalued. After prefeeding and immediately before the test, rats were infused with either vehicle or CNO. Test order and the drug/prefed outcome relationship were counterbalanced across subjects and with respect to the CS–reward relationships. For both the mOFC→BLA and BLA→mOFC groups, there were no significant differences in the amount consumed during prefeeding between the future drug conditions (mOFC→BLA, vehicle: 21.14 grams ± 2.60 SEM; CNO: 22.35 ± 3.06; t(9) = 0.24, p = 0.81; BLA→mOFC, vehicle: 19.02 ± 2.28; CNO: 16.58 ± 2.96; t(8) = 0.58, p = 0.57). Successful devaluation was confirmed by post-test consumption of each food reward. For both groups, the mOFC→BLA [vehicle, valued: 8.26 grams ± 1.35 SEM, devalued: 0.92 ± 0.31; CNO, valued: 7.45 ± 1.26, devalued: 1.81 ± 0.78; outcome (valued v. devalued): F(1,9) = 76.06, p < 0.0001; drug: F(1,9) = 0.0009, p = 0.98; drug × outcome: F(1,9) = 0.47, p = 0.51] and BLA→mOFC (vehicle, valued: 4.81 ± 0.75, devalued: 3.82 ± 1.24; CNO, valued: 9.36 ± 2.07, devalued: 1.49 ± 0.56; outcome: F(1,8) = 17.86, p = 0.003; drug: F(1,8) = 3.21, p = 0.11; drug × outcome: F(1,8) = 3.95, p = 0.08), across both drug conditions, rats ate significantly less of the devalued food than the valued food. We excluded 1 mOFC→BLA rat from the devaluation test dataset because it did not eat during one of the prefeeding sessions. Rats were given 1–2 d without behavioral training or testing after each devaluation test to reestablish hunger and then received two retraining sessions for each instrumental association (two sessions/day for 2 d) and one Pavlovian retraining session in between tests.

Data collection

Lever presses and/or discrete entries into the food-delivery port were recorded continuously for each session. For the Pavlovian training and test sessions, the 2 min periods before each CS (preCS) onset served as the baseline for comparison of CS-induced elevations in lever pressing and/or food-port entries.

Chemogenetic inhibition of mOFC→BLA projections

In the mOFC→BLA group, chemogenetic inhibition was used to inactivate hM4Di-expressing mOFC axons and terminals in the BLA before one PIT test and after prefeeding before one of the devaluation tests. We selected chemogenetic inhibition to allow inhibition throughout the duration of each test. CNO (Tocris Bioscience) was dissolved in artificial cerebral spinal fluid (aCSF) to 1 mm, and 0.5 µL was intracranially infused over 1 min bilaterally into the BLA as previously described (Lichtenberg et al., 2017; Malvaez et al., 2019). Injectors were left in place for at least 1 additional minute to allow for drug diffusion. Testing commenced within 5–10 min following infusion. Using these procedures, we have previously demonstrated effective inactivation of OFC axons and terminals in the BLA both in vivo and ex vivo (Lichtenberg et al., 2017; Malvaez et al., 2019). We have also demonstrated that this dose of CNO when infused into the BLA has no effect on reward-related behavior, BLA activity, or OFC terminal activity in the BLA in the absence of the hM4Di transgene (Lichtenberg et al., 2017; Malvaez et al., 2019). Thus, for the control condition, aCSF was infused into the BLA using identical procedures.

Chemogenetic inhibition of BLA→mOFC projections

For the BLA→mOFC group, chemogenetic inhibition was used to inactivate hM4Di-expressing BLA axons and terminals in the mOFC before one PIT test and after prefeeding before one of the devaluation tests. Procedures were identical to those above with the exception that CNO or aCSF was infused bilaterally into the mOFC (0.3 µL). We have previously shown these procedures to effectively attenuate the activity of BLA terminals in the OFC and have found that intra-OFC infusion of this dose of CNO does not affect reward-related behavior, OFC activity, or BLA terminal activity in the OFC in the absence of hM4Di (Lichtenberg et al., 2017). Three rats were excluded from this group because of cannula clog and thus the inability to infuse CNO.

Histology

Rats were deeply anesthetized with Nembutal and transcardially perfused with PBS followed by 4% paraformaldehyde. Brains were removed and postfixed in 4% paraformaldehyde overnight, placed into 30% sucrose solution, then sectioned into 30–40 µm slices using a cryostat and stored in PBS or cryoprotectant. For the tracing experiments, fluorescent microscopy was used to confirm expression of eYFP in BLA-projecting mOFC cell bodies. Free-floating coronal sections were mounted onto slides and coverslipped with ProLong Gold mounting medium with DAPI (Invitrogen). The signal for BLA axonal expression of mCherry in the mOFC was immunohistochemically amplified using antibodies directed against mCherry. Floating coronal sections were washed two times in 1 × PBS for 10 min and then blocked in a solution of 5% normal goat serum (NGS) and 1% Triton X-100 dissolved in PBS for 1–2 h at room temperature. Sections were then washed three times in PBS for 15 min and then incubated in blocking solution containing rabbit anti-DsRed antibody (1:1000; EMD Millipore) with gentle agitation at 4°C for 18–22 h. Florescence imaging was used to visualize AlexaFluor-488 and 594-conjugated CTb in lOFC- and mOFC-projecting BLA cells.

To visualize hM4Di-mCherry expression in BLA or mOFC cell bodies, free-floating coronal sections were mounted onto slides and coverslipped with ProLong Gold mounting medium with DAPI (Invitrogen). Axonal expression of hM4Di-mCherry in terminal regions was immunohistochemically amplified. Floating coronal sections were washed two times in 1 × PBS for 10 min and then blocked in a solution of 5% NGS and 1% Triton X-100 dissolved in PBS for 1–2 h at room temperature. Sections were then washed three times in PBS for 15 min and then incubated in blocking solution containing rabbit anti-DsRed antibody (1:1000; EMD Millipore) with gentle agitation at 4°C for 18–22 h. On the second day, sections were rinsed three times in the blocking solution and incubated in Alexa Fluor 594-conjugated (red) goat secondary antibody (1:500; Invitrogen) for 2 h.

Images were acquired using a Keyence BZ-X710 microscope with a 4×,10×, and 20× objective (CFI Plan Apo), CCD camera, and BZ-X Analyzer software. Subjects with off-target viral and/or cannula placements were removed from the dataset (mOFC→BLA, N = 7; BLA→mOFC, N = 7). One subject from the mOFC→BLA group had extensive tissue damage and was also removed. We also excluded N = 2 from the mOFC→BLA group and N = 1 from the BLA→mOFC group because we were unable to verify expression.

Data analysis

Behavioral analysis

Behavioral data were processed with Microsoft Excel. Left and/or right lever presses and/or entries into the food delivery port were collected continuously for each training and test session. For the last day of Pavlovian training, we compared the rate of food-port entries between the CS-probe (after CS onset, before reward delivery) and the preCS baseline periods. Data were averaged across trials for each CS and then averaged across the two CSs. Press rates on the last day of instrumental training were averaged across levers. For the PIT test, the baseline lever-press rate (presses/min) averaged across both levers during the 2 min periods immediately before the onset of each CS was compared with that during the CS periods. During the CS periods, lever pressing was separated for presses on the lever that during training, earned the same outcome as the presented cue (CS-Same presses) versus those on the other available lever (CS-Different presses). Data were averaged across trials for each CS and then averaged across the two CSs. Rate of entries into the food delivery port were also compared between the baseline and CS periods, averaged across both CSs. For the devaluation test instrumental phase, the lever-press rate was compared between the lever that in training earned the prefed food outcome (i.e., the devalued outcome) and that which had previously earned the nonprefed (valued) outcome. During the Pavlovian phase the rate of entries into the food-delivery port were compared between the 2 min preCS baseline periods and the CS periods, which were separated by the CS that predicted the devalued outcome versus the CS that predicted the valued outcome.

Image Analysis

For the CTb tracing experiment, for each subject we analyzed two slices each of the anterior (−2.3 to −2.56 mm posterior to bregma), middle (−2.8 to −3.14 mm posterior to bregma), and posterior (−3.3 to −3.6) BLA. We used 20× images from the center of the lateral amygdala (LA) and BLA for each analysis. Single- and double-labeled neurons for each fluorophore were counted using a custom-written script in ImageJ (version 1.50i; National Institutes of Health). Cells counts were totaled across the LA and BLA and across slices.

Statistical analysis

Datasets were analyzed by two-tailed, paired Student's t tests, one- or two-way repeated-measures ANOVA, as appropriate (GraphPad Prism, GraphPad; IBM SPSS). post hoc tests used the Bonferroni correction. All data were tested for normality before analysis with ANOVA, and the Greenhouse-Geisser correction was applied to mitigate the influence of unequal variance between conditions if the assumption of sphericity was not met. Alpha levels were set at p < 0.05.

Rigor and reproducibility

Group sizes were estimated a priori based on prior work using male Long Evans rats in this behavioral task (Malvaez et al., 2015; Lichtenberg and Wassum, 2017; Lichtenberg et al., 2017) and to ensure counterbalancing of CS-reward and lever-reward pairings. Investigators were not blinded to condition because they were required to administer drugs. All behaviors were scored using automated software (MedPC). Each behavioral experiment included at least one replication cohort, and cohorts were balanced by viral group, CS-reward and lever-reward pairings, and drug test order, before the start of the experiment.

Data and Code Availability

All data supporting the findings of this study are available from the corresponding author on request.

Results

The BLA and mOFC are bidirectionally connected

Using anterograde and retrograde viral-mediated tracing, we first confirmed the existence of direct connections between the BLA and mOFC, as identified previously (Kita and Kitai, 1990; Hoover and Vertes, 2011; Heilbronner et al., 2016; Reppucci and Petrovich, 2016; Malvaez et al., 2019; Barreiros et al., 2021). An anterograde and retrograde tracer was infused into the BLA of male rats (Fig. 1A; N = 3). We detected robust expression of the retrograde tracer fluorophore in BLA-projecting mOFC cell bodies, confirming a direct mOFC→BLA projection. We also detected robust axonal expression of the anterograde tracer fluorophore in the mOFC, indicating the presence of BLA→mOFC projections. Thus, the mOFC and BLA share direct, bidirectional connections.

Figure 1.

Figure 1.

The BLA and mOFC are bidirectionally connected. A, Representative images of the anterograde (mCherry) and retrograde (eGFP) tracer expression in the BLA infusion site (bottom left) and of BLA axons and terminals (mCherry) and BLA-projecting cell bodies (eGFP) throughout the medial and lateral OFC. Insets, A 20× image of cell body and axonal expression. B, Schematic representation of the spread of the retrograde tracers AlexaFluor-594 CTb (red, mOFC) and AlexaFluor-488 CTb (green, lOFC) in the OFC infusion sites. C, Representative images of AlexaFluor-594 CTb and AlexaFluor-488 CTb spread in the mOFC and lOFC infusion sides, and of mOFC- and lOFC-projecting cell bodies in the BLA. D, Quantification of AlexaFluor-594 CTb (mOFC projecting), AlexaFluor-488 CTb-labeled (lOFC projecting), and colabeled cells across the AP axis of the LA and BLA. E, Quantification of AlexaFluor-594 CTb (mOFC projecting), AlexaFluor-488 CTb-labeled (lOFC projecting), and co-labeled cells totaled across LA and BLA for all slices.

The BLA also projects to the lOFC (Morecraft et al., 1992; Lichtenberg et al., 2017; Barreiros et al., 2021), and we previously found that activity in these projections participates in decision making and adaptive behavior (Lichtenberg et al., 2017). Thus, we next asked whether the BLA pathway to the mOFC is distinct or overlapping with that to the lOFC, that is, whether the BLA sends collateral projections to both mOFC and lOFC. We injected retrograde tracers into the lOFC and mOFC of male rats (Fig. 1B–C; N = 4). We detected expression of both tracer fluorophores in BLA cell bodies across the AP extent of the BLA, confirming the presence of BLA projections to both lOFC and mOFC (Fig. 1D; lateral amygdala, AP × projection: F(10,30) = 4.95, p = 0.0003; AP: F(5,15) = 4.10, p = 0.02; projection: F(2,6) = 8.23, p = 0.02; BLA, AP × projection: F(10,30) = 4.21, p = 0.001; AP: F(5,15) = 1.31, p = 0.31; projection: F(2,6) = 8.78, p = 0.001). lOFC-projecting cells were more prominent than mOFC projectors in the anterior BLA. We detected more labeling of BLA→lOFC projecting cells than BLA→mOFC projecting cells, likely because of the larger size, and thus tracer infusion volume, of the lOFC relative to mOFC. Critically, we detected very few colabeled cells, indicating that for the most part, the BLA pathways to the mOFC and lOFC are distinct (Fig. 1E; F(2,6) = 10.11, p = 0.01).

The mOFC→BLA pathway regulates the influence of stimulus-outcome memories over decision making and adaptive conditional responding

We next used chemogenetic inhibition to interrogate the function of mOFC→BLA projections in the influence of stimulus-outcome and action-outcome memories over decision making and adaptive conditional behavior (Fig. 2A). We expressed the inhibitory designer receptor hM4Di in the mOFC and placed guide cannula bilaterally over the BLA near hM4Di-expressing mOFC axons and terminals (Fig. 2B,C; N = 11). This allowed us to later infuse CNO (1 mm) to inactivate mOFC axons and terminals in the BLA, as we have previously validated both in vivo and ex vivo (Lichtenberg et al., 2017; Malvaez et al., 2019).

Figure 2.

Figure 2.

The mOFC→BLA pathway regulates the influence of stimulus-outcome memories over decision making and adaptive conditional responding. A, Schematic of behavioral training, testing, and chemogenetic inactivation strategy. CS, conditional stimulus (white noise or tone); O, outcome (sucrose solution or food pellet); A, action (left or right lever press). B, Representative fluorescent image of hM4Di-mCherry expression in mOFC cell bodies and immunofluorescent image of mOFC axons and terminals in the BLA near the implanted guide cannula. C, Schematic representation of hM4Di-mCherry expression in mOFC and axonal expression and injector tip placements in BLA for all subjects. D, Lever-press rate (lever presses/min) during the PIT test averaged across levers during the preCS baseline periods and during the CS periods on the lever earning the same outcome as the presented CS and the alternate lever (Different). Data are averaged across trials and CSs. E, Rate of entries into the food-delivery port (food-port entries/min) during the PIT test for the preCS baseline periods and during CS presentation (averaged across trials and CSs). F, Average lever-press rate during the instrumental choice phase of the outcome-specific devaluation test. Presses separated for those on the action that in training earned the valued versus devalued (prefed) reward. G, Rate of entries into the food-delivery port during the outcome-specific devaluation test, averaged across trials, during the preCS baseline period, and during presentation of the CS predicting the valued and devalued (prefed) rewards. Lines represent individual subjects. *p < 0.05, **p < 0.01, ***p < 0.001.

Rats were food deprived and given Pavlovian and instrumental conditioning using procedures that engender the encoding of rich, sensory-specific stimulus-outcome and action-outcome memories (Ostlund and Balleine, 2008; Malvaez et al., 2015; Lichtenberg et al., 2017; Lichtenberg and Wassum, 2017; Sias et al., 2021). For Pavlovian conditioning, each of two 2 min auditory CSs (white noise and tone) were associated with intermittent delivery of one of two distinct food rewards (sucrose solution or food pellets, e.g., white noise-sucrose/tone-pellet). Thus, each cue sets the state in which a specific reward can be expected. During each session, each cue was presented eight times in pseudorandom order (variable average 3 min intertrial interval) for 2 min, during which its associated reward was delivered on average every 30 s. Evidence of simple Pavlovian conditioning was detected in the goal-approach response (entries into the food delivery port) to the cue before reward delivery (baseline: 8.23 entries/min ± 0.89 SEM; CS probe period: 18.72 ± 1.58; t(10) = 12.57, p < 0.0001). Rats then received instrumental conditioning to learn that two different actions (left or right lever press) each earned one of the two food rewards (e.g., left press→sucrose/right press→pellets), ultimately on a random-ratio 20 schedule of reinforcement, thus establishing two action-outcome memories. Rats reached a final average press rate of 35.18 ± 3.13 presses/min.

We first asked whether activity in mOFC→BLA projections is necessary for the influence of stimulus-outcome and action-outcome memories over decision making using the outcome-specific PIT test. Rats received two PIT tests, counterbalanced for order, one following intra-BLA infusion of CNO to inactivate hM4Di-expressing mOFC axons and terminals in the BLA and one following infusion of aCSF vehicle. We chose a vehicle-infused control to provide a within-subject comparison, based on evidence that CNO when infused at this dose into the BLA has no effect on the expression of PIT, similar reward-related behaviors, BLA activity, or OFC terminal activity in the BLA in the absence of the hM4Di transgene (Lichtenberg et al., 2017; Malvaez et al., 2019). During each test, both levers were continuously present, but lever pressing was not rewarded. Each CS was presented four times (also without accompanying reward), with intervening CS-free baseline periods, to assess its influence on action performance and selection in the novel choice scenario. The test was unrewarded to force subjects to rely on memory of the predicted rewards. Because the cues are never associated with the instrumental actions, this test assesses the ability, on cue presentation, to represent the specific predicted reward and use this to motivate choice of the action known to earn the same unique reward (Kruse et al., 1983; Colwill and Motzkin, 1994; Gilroy et al., 2014; Corbit and Balleine, 2016). If subjects are able use both the stimulus-outcome and action-outcome memories to enable accurate reward representation, then cue presentation should cause them to increase their lever presses selectively on the action that earned the same outcome as predicted by that cue. This is consistent with the interpretation that rats use the cues to know which reward is predicted and, thus, infer which action would be most advantageous. Rats showed this outcome-specific PIT effect in the control condition (Fig. 2D). Cue presentation increased presses selectively on the lever that in training earned the same outcome as the presented cue, relative to the lever that earned the different outcome. Conversely, the influence of the cues over lever-press choice was greatly attenuated following intra-BLA CNO infusion [Fig. 2D; drug × CS/lever: F(1.4,14) = 4.20, p = 0.049; drug (vehicle vs CNO): F(1,10) = 9.51, p = 0.01; CS/lever (preCS, CS-Same vs CS-Different): F(1.5, 14.7) = 10.87, p = 0.002]. This effect was selective to cue-influenced decision making. Inactivation of mOFC→BLA projections neither affected baseline lever-pressing activity (p > 0.9999, Bonferroni-corrected post hoc comparison), nor conditional approach responses to the shared food-delivery port during the cues [Fig. 2E; CS (baseline vs CS): F(1,10) = 71.66, p < 0.0001; drug: F(1,10) = 4.14, p = 0.07; drug × CS: F(1,10) = 1.70, p = 0.22]. Thus, activity in mOFC→BLA projections is not needed to support general reward pursuit activity or conditional approach responses but is necessary to use appetitive cues to know which reward is predicted and, thus, bias decision making.

PIT requires that the subjects both know which outcome is predicted by the presented cue and which action earns that same outcome. Thus, the disrupted expression of PIT by inactivation of mOFC→BLA projections could reflect a function for this pathway in stimulus-outcome memory, action-outcome memory, or both. To arbitrate among these possibilities, we retrained rats on both the Pavlovian and instrumental contingencies and gave them a set of outcome-specific devaluation tests. This also provided an opportunity to assess the contribution of mOFC→BLA projection activity to the ability to use the value of an anticipated reward to guide adaptive behavior. Before each test, one of the food rewards was devalued by sensory-specific satiety (1 h prefeeding). No manipulation was made during this prefeeding to allow devaluation learning to proceed undisrupted. Immediately following the prefeeding, rats received an infusion of vehicle or CNO into the BLA and were then given a two-phase test. In the first phase, rats received 5 min access to both levers in a choice. Lever pressing was not reinforced to force subjects to rely on their action-outcome memories. If rats can use their action-outcome memories to represent the expected outcome of each action and its value, then they should select the action that earns the valued reward, downshifting responding on the action that earns the devalued reward. Rats showed this sensitivity of instrumental choice to outcome-selective devaluation in the control condition and when mOFC→BLA projections were inactivated [Fig. 2F; lever (valued vs devalued): F(1,9) = 15.32, p = 0.004; drug: F(1,9) = 0.20, p = 0.67; drug × lever: F(1,9) = 0.47, p = 0.51). Thus, activity in mOFC→BLA projections is not necessary to retrieve action-outcome memories or use them to guide choice behavior. In the second phase, the levers were retracted, and each cue was presented twice in alternating order. Again, the subjects were forced to rely on memory, in this case the stimulus-outcome memories, because the cues were not accompanied by the associated reward. If rats can use the cues to know which reward is predicted and how valuable that reward currently is, then they should be able to infer that it is advantageous to respond to the cue signaling the valued outcome by checking the food-delivery port, but it is not advantageous to respond during the cue signaling the devalued outcome. This was apparent in the control condition. Following intra-BLA vehicle infusion, rats showed robust conditional food-port approach responses to the cue signaling the valued reward but attenuated responses to the cue signaling the devalued reward. Inactivation of mOFC→BLA projections prevented subjects from adapting the conditional goal-approach responses based on the value of the predicted reward [Fig. 2G; drug × CS: F(1.3, 11.9) = 8.07, p = 0.01; drug: F(1,9) = 1.32, p = 0.28; CS (valued vs devalued): F(1.5,13.9) = 20.47, p = 0.0001]. Thus, activity in mOFC→BLA projections is necessary for adaptive responses to a cue based on the current value of the predicted reward. Together with the PIT results, this suggests that mOFC→BLA projection activity mediates the use of environmental cues to represent which specific reward is predicted and the desirability of that reward, both of which are critical for adaptive decision making.

The BLA→mOFC pathway mediates adaptive cue responses based on the value of the predicted reward

We next asked whether BLA projections back to the mOFC are similarly involved appetitive behavior (Fig. 3A; N = 9). The experiment was identical, except this time we expressed the inhibitory designer receptor hM4Di in the BLA and placed guide cannula bilaterally over the mOFC near hM4Di-expressing BLA axons and terminals (Fig. 3B,C). This allowed a later CNO (1 mm) infusion to inactivate BLA axons and terminals in the mOFC as we have validated previously (Lichtenberg et al., 2017). We have also previously shown that in the absence of the hM4Di transgene, CNO infusion at this dose into the OFC does not affect OFC activity, BLA terminal activity in the OFC, or reward-related behavior (Lichtenberg et al., 2017), and thus used a within-subject, vehicle-infused control condition. Rats provided evidence of successful Pavlovian conditioning in their final average goal-approach responses to the cues (baseline: 6.60 entries/min ± 0.69 SEM; CS probe period: 14.64 ± 1.71; t(8) = 6.56, p = 0.0002) and also acquired performance of the instrumental actions, reaching a final average press rate of 39.70 ± 2.51 presses/min.

Figure 3.

Figure 3.

The BLA→mOFC pathway regulates adaptive cue responses based on predicted reward value. A, Schematic of behavioral training, testing, and chemogenetic inactivation strategy. CS, conditional stimulus (white noise or tone); O, outcome (sucrose solution or food pellet); A, action (left or right lever press). B, Representative fluorescent image of hM4Di-mCherry expression in cell bodies of the BLA and immunofluorescent image of BLA axons and terminals in the mOFC near the implanted guide cannula. C, Schematic representation of hM4Di-mCherry expression in BLA and axonal expression and injector tip placements in mOFC for all subjects. D, Lever-press rate (lever presses/min) during the PIT test averaged across levers during the preCS baseline periods and during the CS periods on the lever earning the same outcome as the presented CS and the alternate lever (Different). Data are averaged across trials and CSs. E, Rate of entries into the food-delivery port (food-port entries/min) during the PIT test for the preCS baseline periods and during CS presentation (averaged across trials and CSs). F, Average lever-press rate during the instrumental phase of the outcome-specific devaluation test. Presses separated for those on the action that in training earned the valued versus devalued (prefed) reward. G, Rate of entries into the food-delivery port during the outcome-specific devaluation test, averaged across trials, during the preCS baseline period, and during presentation of the CS predicting the valued and devalued (prefed) rewards. Lines represent individual subjects. *p < 0.05, **p < 0.01.

In contrast to the mOFC→BLA pathway, activity of BLA→mOFC projections was found to be unnecessary for the expression of PIT. Following intra-mOFC infusion of either vehicle or CNO, the reward-predictive cues were capable of biasing lever-pressing choices toward the lever that during training earned the same outcome as the presented cue relative to the lever that earned the different outcome (Fig. 3D; CS/lever: F(1.5, 12) = 63.71, p < 0.0001; drug: F(1,8) = 0.26, p = 0.62; drug × CS/lever: F(1.2, 9.5) = 0.22, p = 0.69). During this PIT test, the conditional goal-approach response was also similar between groups (Fig. 3E; CS: F(1,8) = 83.04, p < 0.0001; drug: F(1,8) = 0.04, p = 0.86; drug × CS: F(1,8) = 0.06, p = 0.81). Thus, activity in BLA→mOFC projections is neither needed to retrieve action-outcome or stimulus-outcome memories nor to use this information to generate representations of the specific predicted reward to influence decision making.

Activity in BLA→mOFC projections was, however, critical to adapt approach responses to the cues based on the value of the predicted reward. During the outcome-specific devaluation test, in both the intra-mOFC vehicle and CNO conditions, rats showed sensitivity of their instrumental choice behavior to outcome-specific devaluation, downshifting responding on the action that in training earned the devalued reward (Fig. 3F; lever: F(1,8) = 20.60, p = 0.002; drug: F(1,8) = 0.57, p = 0.47; drug × lever: F(1,8) = 0.22, p = 0.65). This confirmed that BLA→mOFC projection activity is not needed to retrieve action-outcome memories and further demonstrated that these projections are not needed for general value-based decision making. However, inactivation of BLA→mOFC projections did prevent subjects from adapting Pavlovian conditional food-port approach responses based on the value of the predicted reward. Following intra-mOFC vehicle infusion, rats showed robust conditional food-port approach responses to the cue signaling the valued reward and attenuated responses to the cue signaling the devalued reward. This sensitivity of the conditional goal-approach response to devaluation was abolished following intra-mOFC CNO infusion [Fig. 3G; drug × CS: F(2,16) = 5.01, p = 0.02; drug: F(1,8) = 2.98, p = 0.12; CS (valued vs devalued): F(2,16) = 9.86, p = 0.002]. Thus, based on the PIT results, activity in the BLA→mOFC pathway is not needed to use the cues to know the identity of predicted reward, but the devaluation results indicate it is required to use the current value of that reward to infer how advantageous it would be to respond to the cue.

Discussion

Here, we interrogated the function of mOFC-BLA circuitry in the prospective considerations that underlie adaptive reward-related behavior. Using anatomical tracing, we confirmed the existence of bidirectional, direct pathways between mOFC and BLA and found that BLA projections to the mOFC and lOFC are largely distinct. Using pathway-specific chemogenetic inhibition, we found that mOFC→BLA projection activity is critical for using stimulus-outcome memories to guide decision making based on which specific reward is expected and adaptive cue responses based on the desirability of that predicted outcome. BLA→mOFC projection activity, however, is only needed to adapt cue responses based on the desirability of the predicted reward.

We found that mOFC→BLA pathway activity is critical for using predictive environmental cues to know which specific reward is predicted and the current value of that option. mOFC neuronal activity can represent a cue-reward memory (Namboodiri et al., 2019) and is necessary, across species, for the use of such memories to inform adaptive behavior (Noonan et al., 2010, 2017; Bradfield et al., 2015, 2018). Our data indicate this function is, at least in part, achieved via direct projections to the BLA. We have previously found that mOFC→BLA pathway activity mediates the retrieval of the incentive value of an expected food reward to ensure its adaptive pursuit (Malvaez et al., 2019). Incentive value is another form of state-dependent reward memory (food has high value when hungry, but low when sated). Thus, mOFC→BLA projections may be responsible for using the current state, defined both by external and internal physiological cues, to make a judgment about how advantageous a certain course of action might be.

This function of mOFC→BLA projections is distinct from that of lOFC→BLA projections. Using the same procedures, we previously found that lOFC→BLA projections are not needed for reward cues to bias choice toward the predicted reward (Lichtenberg et al., 2017). They are also unnecessary for retrieving the incentive value of an expected reward (Malvaez et al., 2019). Rather, lOFC→BLA pathway activity drives the learning of these state-dependent reward memories (Malvaez et al., 2019; Sias et al., 2021), a function that requires the BLA itself (Wassum et al., 2009, 2011, 2016; Parkes and Balleine, 2013) but not mOFC→BLA projections (Malvaez et al., 2019). Thus, the lOFC→BLA pathway mediates the formation of state-dependent reward memories, and the mOFC→BLA pathway facilitates the use of this information to guide adaptive reward-related behavior. Interestingly, however, under more dynamic decision scenarios the function of individual components of the OFC-BLA circuit can appear more overlapping. For example, during reversal learning, subjects must learn, integrate, and use information about reward availability and option value, and this form of decision making is influenced by lesion of lOFC→BLA projections (Groman et al., 2019).

The BLA→mOFC pathway was found to mediate adaptive responses to cues based on the desirability of the predicted reward. This is consistent with prior evidence that the BLA is needed for sensitivity of Pavlovian responses to outcome-specific devaluation (Hatfield et al., 1996; Johnson et al., 2009) and evidence that BLA neuronal responses to reward-predictive cues can reflect the value of the predicted reward (Schoenbaum et al., 1998; Saddoris et al., 2005; Paton et al., 2006; Belova et al., 2007, 2008). The data here reveal this function to be achieved, at least in part, via projections to the mOFC. BLA→mOFC projection activity was not necessary for reward cues to bias choice toward the predicted reward. Expression of such outcome-selective PIT depends on the sensory-specific identity of the predicted reward, but not its value (Holland, 2004). Thus, BLA→mOFC pathway activity mediates cue-driven behaviors based on a representation of the predicted reward's value but not its identity. Surprisingly, this indicates that a reward's identity can be decoupled from its value, that one can represent which specific reward is predicted but not the current value of that reward.

The function of the BLA→mOFC pathway identified here is distinct from that previously identified for the BLA→lOFC pathway. Using the same procedures, we previously found that activity in the anatomically distinct BLA→lOFC pathway is necessary for reward cues to both bias action choice toward the predicted reward and to adapt conditional approach responses based on the value of the predicted reward. Thus, the BLA→lOFC pathway allows one to use cues to know which specific reward is predicted, whereas BLA→mOFC pathway activity promotes behavior based on the current desirability of that predicted reward. Whether BLA→lOFC function in value is secondary to representing reward identity (if you do not know which reward is predicted, then you cannot represent its value) is a critical open question.

Neither mOFC→BLA nor BLA→mOFC projections were necessary for general conditional goal-approach responses, consistent with evidence from BLA or mOFC lesions (Hatfield et al., 1996; Everitt et al., 2000; Parkinson et al., 2000; Corbit and Balleine, 2005; Bradfield et al., 2015, 2018; Malvaez et al., 2015; Morse et al., 2020). This well-learned cue response does not require a representation of the specific predicted reward or on-the-fly use of this information for prospective consideration of the most advantageous option. Instead, it can rely on a previously learned policy. Thus, mOFC-BLA circuitry is not needed for cue responses generally but is needed when one must use stimulus-outcome memories to infer what to do based on which outcomes are expected and the current desirability of the outcomes.

The mOFC-BLA circuit was also not needed for rats to adapt their instrumental choice behavior based on the value of the predicted reward. Rather, it is selectively needed for cues to inform the considerations guiding adaptive appetitive behavior. Both the BLA and mOFC are needed for the sensitivity of instrumental choice to devaluation (Balleine et al., 2003; Ostlund and Balleine, 2008; Johnson et al., 2009; Bradfield et al., 2015, 2018; Gourley et al., 2016). This function may, therefore, be achieved via alternate pathways, perhaps those to the striatum (Hoover and Vertes, 2011; Corbit et al., 2013; van Holstein et al., 2020), a region heavily implicated in action-outcome memory (Malvaez and Wassum, 2018; Malvaez et al., 2018b; Malvaez, 2020).

Collectively, these data reveal the mOFC-BLA circuit as critical for the cue-dependent reward outcome expectations that influence decision making and adaptive cue responses. mOFC neuronal activity can represent partially unobservable states (Lopatina et al., 2017; Elliott Wimmer and Büchel, 2019) and is needed when such states must be used to make adaptive choices (Bradfield et al., 2015, 2018). Activity in the mOFC→BLA pathway may, therefore, be involved in signaling the state predicted by the reward cues. The BLA pathways to the lOFC and mOFC might then facilitate the use of this information to guide adaptive behavior, with BLA→lOFC projections facilitating detailed, sensory-specific representations of expected rewards (Lichtenberg et al., 2017) and BLA→mOFC projections facilitating the more general representations needed to compute option desirability. Indeed, lOFC neuronal activity can encode high-dimensional representations and the identity of predicted rewards (Klein-Flügge et al., 2013; McDannald et al., 2014; Rudebeck and Murray, 2014; Wilson et al., 2014; Howard et al., 2015; Rich and Wallis, 2016; Suzuki et al., 2017; Rudebeck and Rich, 2018), whereas mOFC represents more general information about expected events that is used to make decisions based on value estimations or comparisons (Pritchard et al., 2005; Padoa-Schioppa and Assad, 2006; Plassmann et al., 2010; Kennerley et al., 2011; Levy and Glimcher, 2011; Rudebeck and Murray, 2011; Burton et al., 2014; Lopatina et al., 2016, 2017; Suzuki et al., 2017). The precise information content conveyed by each component of the OFC-BLA circuit and how it is used in the receiving structure is a critical question for follow-up investigation. Because the staged scenarios used here allow isolation of psychological processes that typically co-occur and interact to influence decision making, such activity analysis will also be important for understanding how the functions identified here relate to more dynamic and complex decision-making scenarios. Another essential question is whether mOFC-BLA circuit function is similar in females, who do show similar performance in the tasks used here and require the BLA and mOFC for their performance (Ostlund and Balleine, 2008; Bradfield et al., 2018).

An inability to use reward cues to inform prospective considerations of which specific rewards can be expected and their current desirability can lead to maladaptive choices. Maladaptive choices could also arise if one is able to know what rewards are predicted, but not to consider their current value, as we showed here is neurobiologically possible. This is characteristic of the cognitive symptoms underlying many psychiatric diseases, including substance use disorder (Kalivas and Volkow, 2005; Verdejo-Garcia et al., 2018). Moreover, both the mOFC and BLA, as well as their connectivity, can be dysfunctional in substance use disorder and other psychiatric illnesses marked by disrupted appetitive decision making (Tanabe et al., 2009; Goldstein and Volkow, 2011; Hahn et al., 2011; Linke et al., 2012; Sladky et al., 2015; Shields and Gremel, 2020; Xie et al., 2021). Thus, these data may aid our understanding and treatment of these conditions.

Footnotes

This work was supported by National Institute of Drug Abuse Grants DA035443 (K.M.W.), F31DA044734 and T32DA024635 (N.T.L.), and T32DA024635 (A.C.L.), and the Staglin Center for Behavior and Brain Sciences. We thank Drs. Alicia Izquierdo, Melissa Sharpe, and Melissa Malvaez for feedback.

The authors declare no competing financial interests.

References

  1. Balleine BW, Dickinson A (1998) Goal-directed instrumental action: contingency and incentive learning and their cortical substrates. Neuropharmacology 37:407–419. 10.1016/s0028-3908(98)00033-1 [DOI] [PubMed] [Google Scholar]
  2. Balleine BW, Killcross AS, Dickinson A (2003) The effect of lesions of the basolateral amygdala on instrumental conditioning. J Neurosci 23:666–675. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Barreiros IV, Panayi MC, Walton ME (2021) Organization of afferents along the anterior-posterior and medial-lateral axes of the rat orbitofrontal cortex. Neuroscience 460:53–68. 10.1016/j.neuroscience.2021.02.017 [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Belova MA, Paton JJ, Morrison SE, Salzman CD (2007) Expectation modulates neural responses to pleasant and aversive stimuli in primate amygdala. Neuron 55:970–984. 10.1016/j.neuron.2007.08.004 [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Belova MA, Paton JJ, Salzman CD (2008) Moment-to-moment tracking of state value in the amygdala. J Neurosci 28:10023–10030. 10.1523/JNEUROSCI.1400-08.2008 [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Blundell P, Hall G, Killcross S (2001) Lesions of the basolateral amygdala disrupt selective aspects of reinforcer representation in rats. J Neurosci 21:9018–9026. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Bradfield LA, Dezfouli A, van Holstein M, Chieng B, Balleine BW (2015) Medial orbitofrontal cortex mediates outcome retrieval in partially observable task situations. Neuron 88:1268–1280. 10.1016/j.neuron.2015.10.044 [DOI] [PubMed] [Google Scholar]
  8. Bradfield LA, Hart G, Balleine BW (2018) Inferring action-dependent outcome representations depends on anterior but not posterior medial orbitofrontal cortex. Neurobiol Learn Mem 155:463–473. 10.1016/j.nlm.2018.09.008 [DOI] [PubMed] [Google Scholar]
  9. Bray S, Shimojo S, O'Doherty JP (2010) Human medial orbitofrontal cortex is recruited during experience of imagined and real rewards. J Neurophysiol 103:2506–2512. 10.1152/jn.01030.2009 [DOI] [PubMed] [Google Scholar]
  10. Burton AC, Kashtelyan V, Bryden DW, Roesch MR (2014) Increased firing to cues that predict low-value reward in the medial orbitofrontal cortex. Cereb Cortex 24:3310–3321. 10.1093/cercor/bht189 [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Collins AL, Aitken TJ, Huang IW, Shieh C, Greenfield VY, Monbouquette HG, Ostlund SB, Wassum KM (2019) Nucleus accumbens cholinergic interneurons oppose cue-motivated behavior. Biol Psychiatry 86:388–396. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Colwill RM, Motzkin DK (1994) Encoding of the unconditioned stimulus in pavlovian conditioning. Animal Learning and Behavior 22:384–394. 10.3758/BF03209158 [DOI] [Google Scholar]
  13. Corbit LH, Balleine BW (2005) Double dissociation of basolateral and central amygdala lesions on the general and outcome-specific forms of pavlovian-instrumental transfer. J Neurosci 25:962–970. 10.1523/JNEUROSCI.4507-04.2005 [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Corbit LH, Balleine BW (2016) Learning and motivational processes contributing to pavlovian-instrumental transfer and their neural bases: dopamine and beyond. Curr Top Behav Neurosci 27:259–289. [DOI] [PubMed] [Google Scholar]
  15. Corbit LH, Leung BK, Balleine BW (2013) The role of the amygdala-striatal pathway in the acquisition and performance of goal-directed instrumental actions. J Neurosci 33:17682–17690. 10.1523/JNEUROSCI.3271-13.2013 [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Dalton GL, Wang NY, Phillips AG, Floresco SB (2016) Multifaceted contributions by different regions of the orbitofrontal and medial prefrontal cortex to probabilistic reversal learning. J Neurosci 36:1996–2006. 10.1523/JNEUROSCI.3366-15.2016 [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Delamater AR (2012) On the nature of CS and US representations in pavlovian learning. Learn Behav 40:1–23. 10.3758/s13420-011-0036-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Elliott Wimmer G, Büchel C (2019) Learning of distant state predictions by the orbitofrontal cortex in humans. Nat Commun 10:2554. 10.1038/s41467-019-10597-z [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Everitt BJ, Cardinal RN, Hall J, Parkinson JA, Robbins TW (2000) Differential involvement of amygdala subsystems in appetitive conditioning and drug addiction. In: The amygdala: A functional analysis (Aggleton JP, ed), pp 353–390. Oxford, England: Oxford UP. [Google Scholar]
  20. Fanselow MS, Wassum KM (2015) The origins and organization of vertebrate pavlovian conditioning. Cold Spring Harb Perspect Biol 8:a021717. 10.1101/cshperspect.a021717 [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Gallagher M, McMahan RW, Schoenbaum G (1999) Orbitofrontal cortex and representation of incentive value in associative learning. J Neurosci 19:6610–6614. 10.1523/JNEUROSCI.19-15-06610.1999 [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Gilroy KE, Everett EM, Delamater AR (2014) Response-outcome versus outcome-response associations in pavlovian-to-instrumental transfer: effects of instrumental training context. Int J Comp Psychol 27:585–597. [PMC free article] [PubMed] [Google Scholar]
  23. Goldstein RZ, Volkow ND (2011) Dysfunction of the prefrontal cortex in addiction: neuroimaging findings and clinical implications. Nat Rev Neurosci 12:652–669. 10.1038/nrn3119 [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Gourley SL, Zimmermann KS, Allen AG, Taylor JR (2016) The medial orbitofrontal cortex regulates sensitivity to outcome value. J Neurosci 36:4600–4613. 10.1523/JNEUROSCI.4253-15.2016 [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Groman SM, Keistler C, Keip AJ, Hammarlund E, DiLeone RJ, Pittenger C, Lee D, Taylor JR (2019) Orbitofrontal circuits control multiple reinforcement-learning processes. Neuron 103:734–746.e3. 10.1016/j.neuron.2019.05.042 [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Hahn A, Stein P, Windischberger C, Weissenbacher A, Spindelegger C, Moser E, Kasper S, Lanzenberger R (2011) Reduced resting-state functional connectivity between amygdala and orbitofrontal cortex in social anxiety disorder. Neuroimage 56:881–889. 10.1016/j.neuroimage.2011.02.064 [DOI] [PubMed] [Google Scholar]
  27. Hatfield T, Han JS, Conley M, Gallagher M, Holland P (1996) Neurotoxic lesions of basolateral, but not central, amygdala interfere with pavlovian second-order conditioning and reinforcer devaluation effects. J Neurosci 16:5256–5265. 10.1523/JNEUROSCI.16-16-05256.1996 [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Heilbronner SR, Rodriguez-Romaguera J, Quirk GJ, Groenewegen HJ, Haber SN (2016) Circuit-based corticostriatal homologies between rat and primate. Biol Psychiatry 80:509–521. 10.1016/j.biopsych.2016.05.012 [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Holland PC (2004) Relations between pavlovian-instrumental transfer and reinforcer devaluation. J Exp Psychol Anim Behav Process 30:104–117. 10.1037/0097-7403.30.2.104 [DOI] [PubMed] [Google Scholar]
  30. Hoover WB, Vertes RP (2011) Projections of the medial orbital and ventral orbital cortex in the rat. J Comp Neurol 519:3766–3801. 10.1002/cne.22733 [DOI] [PubMed] [Google Scholar]
  31. Howard JD, Gottfried JA, Tobler PN, Kahnt T (2015) Identity-specific coding of future rewards in the human orbitofrontal cortex. Proc Natl Acad Sci U S A 112:5195–5200. 10.1073/pnas.1503550112 [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Izquierdo A (2017) Functional Heterogeneity within Rat Orbitofrontal Cortex in Reward Learning and Decision Making. J Neurosci 37:10529–10540. 10.1523/JNEUROSCI.1678-17.2017 [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Izquierdo A, Suda RK, Murray EA (2004) Bilateral orbital prefrontal cortex lesions in rhesus monkeys disrupt choices guided by both reward value and reward contingency. J Neurosci 24:7540–7548. 10.1523/JNEUROSCI.1921-04.2004 [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Johnson AW, Gallagher M, Holland PC (2009) The basolateral amygdala is critical to the expression of pavlovian and instrumental outcome-specific reinforcer devaluation effects. J Neurosci 29:696–704. 10.1523/JNEUROSCI.3758-08.2009 [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Kalivas PW, Volkow ND (2005) The neural basis of addiction: a pathology of motivation and choice. Am J Psychiatry 162:1403–1413. 10.1176/appi.ajp.162.8.1403 [DOI] [PubMed] [Google Scholar]
  36. Kennerley SW, Behrens TE, Wallis JD (2011) Double dissociation of value computations in orbitofrontal and anterior cingulate neurons. Nat Neurosci 14:1581–1589. 10.1038/nn.2961 [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Kita H, Kitai ST (1990) Amygdaloid projections to the frontal cortex and the striatum in the rat. J Comp Neurol 298:40–49. 10.1002/cne.902980104 [DOI] [PubMed] [Google Scholar]
  38. Klein-Flügge MC, Barron HC, Brodersen KH, Dolan RJ, Behrens TE (2013) Segregated encoding of reward-identity and stimulus-reward associations in human orbitofrontal cortex. J Neurosci 33:3202–3211. 10.1523/JNEUROSCI.2532-12.2013 [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Kruse H, Overmier J, Konz W, Rokke E (1983) Pavlovian conditioned stimulus effects upon instrumental choice behavior are reinforcer specific. Learn Motiv 14:165–181. 10.1016/0023-9690(83)90004-8 [DOI] [Google Scholar]
  40. Levy DJ, Glimcher PW (2011) Comparing apples and oranges: using reward-specific and reward-general subjective value representation in the brain. J Neurosci 31:14693–14707. 10.1523/JNEUROSCI.2218-11.2011 [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Lichtenberg NT, Wassum KM (2017) Amygdala mu-opioid receptors mediate the motivating influence of cue-triggered reward expectations. Eur J Neurosci 45:381–387. 10.1111/ejn.13477 [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Lichtenberg NT, Pennington ZT, Holley SM, Greenfield VY, Cepeda C, Levine MS, Wassum KM (2017) Basolateral amygdala to orbitofrontal cortex projections enable cue-triggered reward expectations. J Neurosci 37:8374–8384. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Linke J, King AV, Rietschel M, Strohmaier J, Hennerici M, Gass A, Meyer-Lindenberg A, Wessa M (2012) Increased medial orbitofrontal and amygdala activation: evidence for a systems-level endophenotype of bipolar I disorder. Am J Psychiatry 169:316–325. 10.1176/appi.ajp.2011.11050711 [DOI] [PubMed] [Google Scholar]
  44. Lopatina N, McDannald MA, Styer CV, Peterson JF, Sadacca BF, Cheer JF, Schoenbaum G (2016) Medial orbitofrontal neurons preferentially signal cues predicting changes in reward during unblocking. J Neurosci 36:8416–8424. 10.1523/JNEUROSCI.1101-16.2016 [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Lopatina N, Sadacca BF, McDannald MA, Styer CV, Peterson JF, Cheer JF, Schoenbaum G (2017) Ensembles in medial and lateral orbitofrontal cortex construct cognitive maps emphasizing different features of the behavioral landscape. Behav Neurosci 131:201–212. 10.1037/bne0000195 [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Machado CJ, Bachevalier J (2007) The effects of selective amygdala, orbital frontal cortex or hippocampal formation lesions on reward assessment in nonhuman primates. Eur J Neurosci 25:2885–2904. 10.1111/j.1460-9568.2007.05525.x [DOI] [PubMed] [Google Scholar]
  47. Malvaez M (2020) Neural substrates of habit. J Neurosci Res 98:986–997. 10.1002/jnr.24552 [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Malvaez M, Wassum K (2018) Regulation of habit formation in the dorsal striatum. Curr Opin Behav Sci 20:67–74. 10.1016/j.cobeha.2017.11.005 [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Malvaez M, Greenfield VY, Wang AS, Yorita AM, Feng L, Linker KE, Monbouquette HG, Wassum KM (2015) Basolateral amygdala rapid glutamate release encodes an outcome-specific representation vital for reward-predictive cues to selectively invigorate reward-seeking actions. Sci Rep 5:12511. 10.1038/srep12511 [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Malvaez M, Shieh C, Murphy M, Greenfield V, Wassum K (2018a) Distinct cortical-amygdala projections drive reward value encoding and retrieval. BioRxiv 299958. 10.1038/s41593-019-0374-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Malvaez M, Greenfield VY, Matheos DP, Angelillis NA, Murphy MD, Kennedy PJ, Wood MA, Wassum KM (2018b) Habits are negatively regulated by histone deacetylase 3 in the dorsal striatum. Biol Psychiatry 84:383–392. [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Malvaez M, Shieh C, Murphy MD, Greenfield VY, Wassum KM (2019) Distinct cortical-amygdala projections drive reward value encoding and retrieval. Nat Neurosci 22:762–769. 10.1038/s41593-019-0374-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. McDannald MA, Esber GR, Wegener MA, Wied HM, Liu TL, Stalnaker TA, Jones JL, Trageser J, Schoenbaum G (2014) Orbitofrontal neurons acquire responses to “valueless” pavlovian cues during unblocking. Elife 3:e02653. 10.7554/eLife.02653 [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Morecraft RJ, Geula C, Mesulam MM (1992) Cytoarchitecture and neural afferents of orbitofrontal cortex in the brain of the monkey. J Comp Neurol 323:341–358. 10.1002/cne.903230304 [DOI] [PubMed] [Google Scholar]
  55. Morse AK, Leung BK, Heath E, Bertran-Gonzalez J, Pepin E, Chieng BC, Balleine BW, Laurent V (2020) Basolateral amygdala drives a GPCR-mediated striatal memory necessary for predictive learning to influence choice. Neuron 106:855–869.e8. 10.1016/j.neuron.2020.03.007 [DOI] [PubMed] [Google Scholar]
  56. Münster A, Hauber W (2018) Medial orbitofrontal cortex mediates effort-related responding in rats. Cereb Cortex 28:4379–4389. 10.1093/cercor/bhx293 [DOI] [PubMed] [Google Scholar]
  57. Namboodiri VMK, Otis JM, van Heeswijk K, Voets ES, Alghorazi RA, Rodriguez-Romaguera J, Mihalas S, Stuber GD (2019) Single-cell activity tracking reveals that orbitofrontal neurons acquire and maintain a long-term memory to guide behavioral adaptation. Nat Neurosci 22:1110–1121. 10.1038/s41593-019-0408-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. Noonan MP, Walton ME, Behrens TE, Sallet J, Buckley MJ, Rushworth MF (2010) Separate value comparison and learning mechanisms in macaque medial and lateral orbitofrontal cortex. Proc Natl Acad Sci U S A 107:20547–20552. 10.1073/pnas.1012246107 [DOI] [PMC free article] [PubMed] [Google Scholar]
  59. Noonan MP, Chau B, Rushworth MF, Fellows LK (2017) Contrasting effects of medial and lateral orbitofrontal cortex lesions on credit assignment and decision making in humans. J Neurosci 37:7023–7035. [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Ostlund SB, Balleine BW (2007) Orbitofrontal cortex mediates outcome encoding in pavlovian but not instrumental conditioning. J Neurosci 27:4819–4825. 10.1523/JNEUROSCI.5443-06.2007 [DOI] [PMC free article] [PubMed] [Google Scholar]
  61. Ostlund SB, Balleine BW (2008) Differential involvement of the basolateral amygdala and mediodorsal thalamus in instrumental action selection. J Neurosci 28:4398–4405. 10.1523/JNEUROSCI.5472-07.2008 [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. Padoa-Schioppa C, Assad JA (2006) Neurons in the orbitofrontal cortex encode economic value. Nature 441:223–226. 10.1038/nature04676 [DOI] [PMC free article] [PubMed] [Google Scholar]
  63. Panayi MC, Killcross S (2018) Functional heterogeneity within the rodent lateral orbitofrontal cortex dissociates outcome devaluation and reversal learning deficits. Elife 7:e37357. 10.7554/eLife.37357 [DOI] [PMC free article] [PubMed] [Google Scholar]
  64. Parkes SL, Balleine BW (2013) Incentive memory: evidence the basolateral amygdala encodes and the insular cortex retrieves outcome values to guide choice between goal-directed actions. J Neurosci 33:8753–8763. 10.1523/JNEUROSCI.5071-12.2013 [DOI] [PMC free article] [PubMed] [Google Scholar]
  65. Parkinson JA, Robbins TW, Everitt BJ (2000) Dissociable roles of the central and basolateral amygdala in appetitive emotional learning. Eur J Neurosci 12:405–413. 10.1046/j.1460-9568.2000.00960.x [DOI] [PubMed] [Google Scholar]
  66. Paton JJ, Belova MA, Morrison SE, Salzman CD (2006) The primate amygdala represents the positive and negative value of visual stimuli during learning. Nature 439:865–870. 10.1038/nature04490 [DOI] [PMC free article] [PubMed] [Google Scholar]
  67. Pickens CL, Saddoris MP, Setlow B, Gallagher M, Holland PC, Schoenbaum G (2003) Different roles for orbitofrontal cortex and basolateral amygdala in a reinforcer devaluation task. J Neurosci 23:11078–11084. 10.1523/JNEUROSCI.23-35-11078.2003 [DOI] [PMC free article] [PubMed] [Google Scholar]
  68. Pickens CL, Saddoris MP, Gallagher M, Holland PC (2005) Orbitofrontal lesions impair use of cue-outcome associations in a devaluation task. Behav Neurosci 119:317–322. 10.1037/0735-7044.119.1.317 [DOI] [PMC free article] [PubMed] [Google Scholar]
  69. Plassmann H, O'Doherty JP, Rangel A (2010) Appetitive and aversive goal values are encoded in the medial orbitofrontal cortex at the time of decision making. J Neurosci 30:10799–10808. 10.1523/JNEUROSCI.0788-10.2010 [DOI] [PMC free article] [PubMed] [Google Scholar]
  70. Pritchard TC, Edwards EM, Smith CA, Hilgert KG, Gavlick AM, Maryniak TD, Schwartz GJ, Scott TR (2005) Gustatory neural responses in the medial orbitofrontal cortex of the old world monkey. J Neurosci 25:6047–6056. 10.1523/JNEUROSCI.0430-05.2005 [DOI] [PMC free article] [PubMed] [Google Scholar]
  71. Reppucci CJ, Petrovich GD (2016) Organization of connections between the amygdala, medial prefrontal cortex, and lateral hypothalamus: a single and double retrograde tracing study in rats. Brain Struct Funct 221:2937–2962. 10.1007/s00429-015-1081-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
  72. Rhodes SE, Murray EA (2013) Differential effects of amygdala, orbital prefrontal cortex, and prelimbic cortex lesions on goal-directed behavior in rhesus macaques. J Neurosci 33:3380–3389. 10.1523/JNEUROSCI.4374-12.2013 [DOI] [PMC free article] [PubMed] [Google Scholar]
  73. Rich EL, Wallis JD (2016) Decoding subjective decisions from orbitofrontal cortex. Nat Neurosci 19:973–980. 10.1038/nn.4320 [DOI] [PMC free article] [PubMed] [Google Scholar]
  74. Rudebeck PH, Murray EA (2011) Balkanizing the primate orbitofrontal cortex: distinct subregions for comparing and contrasting values. Ann N Y Acad Sci 1239:1–13. 10.1111/j.1749-6632.2011.06267.x [DOI] [PMC free article] [PubMed] [Google Scholar]
  75. Rudebeck PH, Murray EA (2014) The orbitofrontal oracle: cortical mechanisms for the prediction and evaluation of specific behavioral outcomes. Neuron 84:1143–1156. 10.1016/j.neuron.2014.10.049 [DOI] [PMC free article] [PubMed] [Google Scholar]
  76. Rudebeck P, Rich E (2018) Orbitofrontal cortex. Curr Biol 28:R1083–R1088. 10.1016/j.cub.2018.07.018 [DOI] [PMC free article] [PubMed] [Google Scholar]
  77. Saddoris MP, Gallagher M, Schoenbaum G (2005) Rapid associative encoding in basolateral amygdala depends on connections with orbitofrontal cortex. Neuron 46:321–331. 10.1016/j.neuron.2005.02.018 [DOI] [PubMed] [Google Scholar]
  78. Schoenbaum G, Chiba AA, Gallagher M (1998) Orbitofrontal cortex and basolateral amygdala encode expected outcomes during learning. Nat Neurosci 1:155–159. 10.1038/407 [DOI] [PubMed] [Google Scholar]
  79. Shields CN, Gremel CM (2020) Review of orbitofrontal cortex in alcohol dependence: a disrupted cognitive map? Alcohol Clin Exp Res 44:1952–1964. 10.1111/acer.14441 [DOI] [PMC free article] [PubMed] [Google Scholar]
  80. Sias A, Morse A, Wang S, Greenfield V, Goodpaster C, Wrenn T, Wikenheiser A, Holley S, Cepeda C, Levine M, Wassum K (2021) A bidirectional corticoamygdala circuit for the encoding and retrieval of detailed reward memories. eLife 10:e68617. 10.7554/eLife.68617 [DOI] [PMC free article] [PubMed] [Google Scholar]
  81. Sladky R, Höflich A, Küblböck M, Kraus C, Baldinger P, Moser E, Lanzenberger R, Windischberger C (2015) Disrupted effective connectivity between the amygdala and orbitofrontal cortex in social anxiety disorder during emotion discrimination revealed by dynamic causal modeling for FMRI. Cereb Cortex 25:895–903. 10.1093/cercor/bht279 [DOI] [PMC free article] [PubMed] [Google Scholar]
  82. Stopper CM, Green EB, Floresco SB (2014) Selective involvement by the medial orbitofrontal cortex in biasing risky, but not impulsive, choice. Cereb Cortex 24:154–162. 10.1093/cercor/bhs297 [DOI] [PubMed] [Google Scholar]
  83. Suzuki S, Cross L, O'Doherty JP (2017) Elucidating the underlying components of food valuation in the human orbitofrontal cortex. Nat Neurosci 20:1780–1786. 10.1038/s41593-017-0008-x [DOI] [PMC free article] [PubMed] [Google Scholar]
  84. Tanabe J, Tregellas JR, Dalwani M, Thompson L, Owens E, Crowley T, Banich M (2009) Medial orbitofrontal cortex gray matter is reduced in abstinent substance-dependent individuals. Biol Psychiatry 65:160–164. 10.1016/j.biopsych.2008.07.030 [DOI] [PMC free article] [PubMed] [Google Scholar]
  85. van Holstein M, MacLeod PE, Floresco SB (2020) Basolateral amygdala—nucleus accumbens circuitry regulates optimal cue-guided risk/reward decision making. Prog Neuropsychopharmacol Biol Psychiatry 98:109830. 10.1016/j.pnpbp.2019.109830 [DOI] [PubMed] [Google Scholar]
  86. Verdejo-Garcia A, Chong TT, Stout JC, Yücel M, London ED (2018) Stages of dysfunctional decision-making in addiction. Pharmacol Biochem Behav 164:99–105. 10.1016/j.pbb.2017.02.003 [DOI] [PubMed] [Google Scholar]
  87. Wallis JD (2011) Cross-species studies of orbitofrontal cortex and value-based decision-making. Nat Neurosci 15:13–19. 10.1038/nn.2956 [DOI] [PMC free article] [PubMed] [Google Scholar]
  88. Wassum KM, Ostlund SB, Maidment NT, Balleine BW (2009) Distinct opioid circuits determine the palatability and the desirability of rewarding events. Proc Natl Acad Sci U S A 106:12512–12517. 10.1073/pnas.0905874106 [DOI] [PMC free article] [PubMed] [Google Scholar]
  89. Wassum KM, Cely IC, Balleine BW, Maidment NT (2011) µ-Opioid receptor activation in the basolateral amygdala mediates the learning of increases but not decreases in the incentive value of a food reward. J Neurosci 31:1591–1599. 10.1523/JNEUROSCI.3102-10.2011 [DOI] [PMC free article] [PubMed] [Google Scholar]
  90. Wassum KM, Greenfield VY, Linker KE, Maidment NT, Ostlund SB (2016) Inflated reward value in early opiate withdrawal. Addict Biol 21:221–233. 10.1111/adb.12172 [DOI] [PMC free article] [PubMed] [Google Scholar]
  91. Wellman LL, Gale K, Malkova L (2005) GABAA-mediated inhibition of basolateral amygdala blocks reward devaluation in macaques. J Neurosci 25:4577–4586. 10.1523/JNEUROSCI.2257-04.2005 [DOI] [PMC free article] [PubMed] [Google Scholar]
  92. West EA, DesJardin JT, Gale K, Malkova L (2011) Transient inactivation of orbitofrontal cortex blocks reinforcer devaluation in macaques. J Neurosci 31:15128–15135. 10.1523/JNEUROSCI.3295-11.2011 [DOI] [PMC free article] [PubMed] [Google Scholar]
  93. Wilson RC, Takahashi YK, Schoenbaum G, Niv Y (2014) Orbitofrontal cortex as a cognitive map of task space. Neuron 81:267–279. 10.1016/j.neuron.2013.11.005 [DOI] [PMC free article] [PubMed] [Google Scholar]
  94. Woon EP, Sequeira MK, Barbee BR, Gourley SL (2020) Involvement of the rodent prelimbic and medial orbitofrontal cortices in goal-directed action: a brief review. J Neurosci Res 98:1020–1030. 10.1002/jnr.24567 [DOI] [PMC free article] [PubMed] [Google Scholar]
  95. Xie C, Jia T, Rolls ET, Robbins TW, Sahakian BJ, Zhang J, Liu Z, Cheng W, Luo Q, Zac Lo C-Y, Wang H, Banaschewski T, Barker GJ, Bokde ALW, Büchel C, Quinlan EB, Desrivières S, Flor H, Grigis A, Garavan H, et al. (2021) Reward versus nonreward sensitivity of the medial versus lateral orbitofrontal cortex relates to the severity of depressive symptoms. Biol Psychiatry Cogn Neurosci Neuroimaging 6:259–269. 10.1016/j.bpsc.2020.08.017 [DOI] [PubMed] [Google Scholar]
  96. Yamada H, Louie K, Tymula A, Glimcher PW (2018) Free choice shapes normalized value signals in medial orbitofrontal cortex. Nat Commun 9:162. 10.1038/s41467-017-02614-w [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

All data supporting the findings of this study are available from the corresponding author on request.


Articles from The Journal of Neuroscience are provided here courtesy of Society for Neuroscience

RESOURCES