Abstract
Adapting behavior to a dynamic environment requires both steadiness when the environment is stable and behavioral flexibility in response to changes. Much evidence suggests that cognitive flexibility, which can be operationalized in reversal learning tasks, is mediated by cortico-striatal circuitries, with the orbitofrontal cortex (OFC) playing a prominent role. The OFC is a functionally heterogeneous region, and we have previously reported differential roles of lateral (lOFC) and medial (mOFC) regions in a touchscreen serial visual reversal learning task for rats using pharmacological inactivation. Here, we investigated the effects of pharmacological overactivation of these regions using a glutamate transporter 1 (GLT-1) inhibitor, dihydrokainate (DHK), which increases extracellular glutamate by blocking its reuptake. We also tested the impact of antagonism of the serotonin 2A receptor (5-HT2AR), which modulates glutamate action, in the mOFC and lOFC on the same task. Overactivation induced by DHK produced dissociable effects in the mOFC and lOFC, with more prominent effects in the mOFC, specifically improving performance in the early, perseveration phase. Intra-lOFC DHK increased the number of omitted responses without affecting errors. In contrast, blocking the 5-HT2AR in the lOFC impaired reversal learning overall, while mOFC 5-HT2AR blockade had no effect. These results further support dissociable roles of the rodent mOFC and lOFC in deterministic visual reversal learning and indicate that modulating glutamate transmission through blocking the GLT-1 and the 5-HT2AR have different roles in these two structures.
Keywords: serotonin, glutamate, orbitofrontal cortex, prefrontal cortex, reversal learning
Public Significance Statement
This study further supports dissociable roles of specific orbitofrontal subregions, as well as glutamatergic and serotonergic transmission in these subregions, in cognitive flexibility. This knowledge will add to the understanding of specific neural mechanisms underlying inflexible behaviour across psychiatric disorders.
Cognitive flexibility, the ability to adapt behavior in response to a changing environment, is disrupted in several psychiatric and developmental disorders including obsessive-compulsive disorder (OCD), schizophrenia, and autism (Chamberlain et al., 2008; D’Cruz et al., 2013; Leeson et al., 2009; Waltz & Gold, 2007). In OCD patients, inflexible behavior is typically treated with selective serotonin reuptake inhibitors, though typically not with full remission and with a large subgroup of nonresponders (Robbins, Vaghi, & Banca, 2019). Thus, more recently, drugs modulating cortical glutamate neurotransmission have gained attention, appearing beneficial in improving cognitive flexibility in OCD patients (Marinova, Chuang, & Fineberg, 2017). However, the underlying neural mechanisms of glutamatergic and serotonergic modulation of flexible behavior are not yet understood and need to be further investigated.
Flexible responding can be assessed in reversal-learning paradigms in humans (Fellows & Farah, 2003; Murphy, Smith, Cowen, Robbins, & Sahakian, 2002), monkeys (Butter, 1964; Dias, Robbins, & Roberts, 1996; Groman et al., 2013), and rodents (Chudasama & Robbins, 2003; McAlonan & Brown, 2003). In reversal learning tasks, initially learned reward contingencies change, and the subject needs to update behavior accordingly. This requires the subject to suppress prepotent responses, explore alternative options, learn new contingencies, and choose the previously unrewarded (but now rewarded) option. If the subject fails to adapt behavior, increased perseverative responding can occur (Iversen & Mishkin, 1970)—that is, “exploitation” of an outdated rule irrespective of diminished reward.
Reversal learning is thought to be mediated by cortico-striatal networks, with the orbitofrontal cortex (OFC) playing a prominent role, as supported by studies in rodents (Izquierdo, Brigman, Radke, Rudebeck, & Holmes, 2017), monkeys (Chau et al., 2015; Clarke, Robbins, & Roberts, 2008; Dias et al., 1996), and humans (Fellows & Farah, 2003; Ghahremani, Monterosso, Jentsch, Bilder, & Poldrack, 2010; Hampshire & Owen, 2006; Hornak et al., 2004; O’Doherty, Kringelbach, Rolls, Hornak, & Andrews, 2001; Rahman, Sahakian, Hodges, Rogers, & Robbins, 1999), including in OCD patients (Remijnse et al., 2006). In rodents, mostly the lateral OFC (lOFC) has been implicated in reversal learning (Alsiö et al., 2015; Bohn, Giertler, & Hauber, 2003; Burke, Takahashi, Correll, Brown, & Schoenbaum, 2009; Graybeal et al., 2011; Kim & Ragozzino, 2005; McAlonan & Brown, 2003; Schoenbaum, Chiba, & Gallagher, 1999, 2000; Schoenbaum & Setlow, 2003; Takahashi et al., 2009), but a functional medial-lateral dissociation in reversal learning is emerging (Dalton, Wang, Phillips, & Floresco, 2016; Hervig et al., 2020; Mar, Walker, Theobald, Eagle, & Robbins, 2011). In Hervig et al., we found that pharmacological inactivation of the lOFC impaired the early phase of visual reversal learning performance, while mOFC inactivation improved it, possibly reflecting a functional lOFC-mOFC balance in controlling exploration-exploitation strategies. Dissociable functions of lOFC versus mOFC have also been found in humans (Cheng et al., 2016; Dias et al., 1996; Elliott, Dolan, & Frith, 2000; Hampshire & Owen, 2006; Kahnt, Chang, Park, Heinzle, & Haynes, 2012; Kringelbach & Rolls, 2004; Milad & Rauch, 2012; Noonan, Chau, Rushworth, & Fellows, 2017; O’Doherty et al., 2001; Zald et al., 2014), including OCD patients (Apergis-Schoute et al., 2017; Gillan et al., 2015), and other primates (Noonan et al., 2010; Walton, Behrens, Noonan, & Rushworth, 2011).
Such dissociable roles of certain OFC subregions may be modulated by glutamate, the principal excitatory transmitter that regulates neuronal plasticity, learning, and memory (Gasbarri & Pompili, 2014). Prefrontal glutamate levels predict reversal learning performance in the marmoset monkey (Lacreuse, Moore, LaClair, Payne, & King, 2018). In rodents, aberrant function of the glutamatergic N-methyl-D-aspartate receptor (NMDAR) has been associated with impaired reversal learning (Brigman et al., 2008; Dalton, Ma, Phillips, & Floresco, 2011; Kumar, Olley, Steckler, & Talpos, 2015; Lobellova et al., 2013; Marquardt, Saha, Mishina, Young, & Brigman, 2014) and set-shifting (Svoboda, Stankova, Entlerova, & Stuchlik, 2015). Reversal learning deficits are also produced by aberrant NMDAR function in the lOFC (Brigman et al., 2013; Marquardt et al., 2019; Thompson, Josey, Holmes, & Brigman, 2015). The astrocytic glutamate transporter 1 (GLT-1) is responsible for more than 80% of total glutamate removal at the synapse, and it plays a key role in regulating cortical glutamatergic homeostasis (Danbolt, Storm-Mathisen, & Kanner, 1992; Gibbs, Hutchinson, & Hertz, 2008; Nieoullon, 2009; Petr et al., 2015). Reduced GLT-1 function increases extracellular glutamate levels and synaptic glutamatergic transmission (Roberts-Wolfe & Kalivas, 2015). Little work has been done to investigate the effects of increased glutamate availability following reduced GLT-1 function in reversal learning, but excessive glutamate impairs visual discrimination learning in GLT-1-deficient mice (Karlsson et al., 2009), as well as the initial stages of visual (Granseth, Andersson, & Lindström, 2015) and spatial (Balschun et al., 2010) reversal learning in vesicular glutamate transporter (VGluT1)-deficient mice. Such impairments are supported by excessive glutamate induced by global dihydrokainate (DHK) treatment impairing spatial memory (Bechtholt-Gompf et al., 2010) and memory retrieval (Tian, Yu, Cen, & Xiao, 2019).
Serotonin stimulates glutamate release in the prefrontal cortex, evoking excitatory postsynaptic potentials—a modulation mediated by serotonin 2A receptors (5-HT2ARs; Aghajanian & Marek, 1999) that are robustly expressed in pyramidal neurons in the prefrontal cortex, including the OFC (Amargós-Bosch et al., 2004; Santana & Artigas, 2017; Santana, Bortolozzi, Serrats, Mengod, & Artigas, 2004), and predominantly in glutamatergic cells (Amargós-Bosch et al., 2004; Santana et al., 2004). Growing evidence suggests that serotonin promotes reversal learning behavior in rodents (Brigman et al., 2010; Brown, Amodeo, Sweeney, & Ragozzino, 2012; Zhukovsky et al., 2017), while reduced serotonin is associated with impaired reversal learning (Bari et al., 2010; Izquierdo et al., 2012; Lapiz-Bluhm, Soto-Piña, Hensler, & Morilak, 2009; Masaki et al., 2006; Matias, Lottem, Dugué, & Mainen, 2017), an effect also found following serotonin depletion in the OFC in monkeys (Clarke, Dalley, Crofts, Robbins, & Roberts, 2004, Clarke et al., 2005, Clarke, Walker, Dalley, Robbins, & Roberts, 2007; Rygula et al., 2015). Thus, in general, a vast amount of work suggests that reduced cortical (OFC) serotonin signaling increases perseveration and impairs reversal learning, whereas increasing serotonin generally facilitates reversal learning (Clarke et al., 2004), but serotonin exerts its effect through a variety of different receptors exerting both inhibitory and excitatory transmission depending on receptor subtype and localization.
In particular, the excitatory 5-HT2ARs primarily localized on pyramidal neurons (Amargós-Bosch et al., 2004; Santana et al., 2004) and inhibitory 5-HT2CRs primarily localized on inhibitory parvalbumin neurons (Liu, Bubar, Lanfranco, Hillman, & Cunningham, 2007) seem to be involved in reversal learning as systemic 5-HT2AR blockade impairs reversal learning performance, while systemic blockade of 5-HT2CRs improves performance (Boulougouris, Glennon, & Robbins, 2008). While local 5-HT2CR antagonism in the lOFC reproduces this improvement, probably through inhibition of parvalbumin neurons leading to increased excitatory lOFC activity, intra-lOFC 5-HT2AR blockade does not affect spatial reversal learning (Boulougouris & Robbins, 2010). However, 5-HT2AR blockade with M100907 in the lOFC does impair odor-based reversal learning (Furr, Lapiz-Bluhm, & Morilak, 2012), and high levels of perseveration in rats are associated with decreased levels of 5-HT2AR in the lOFC and mOFC (Barlow et al., 2015), consistent with decreased levels of OFC 5-HT2AR predicting clinical severity in OCD patients (Perani et al., 2008). In the visual serial reversal learning paradigm used in the present study, intra-lOFC blockade of 5-HT2CR improves performance in the early phase of reversal learning (Alsiö et al., 2015), but the role of 5-HT2ARs in the OFC still remains to be investigated on this task.
In the present study, we compared the effects of modulating glutamatergic transmission by DHK treatment and 5-HT2AR blockade in the lOFC and mOFC on a deterministic visual serial reversal learning in rats that we have previously shown to be dissociably affected by lOFC and mOFC inactivation (Hervig et al., 2020). We hypothesized that DHK-induced activation of the lOFC and mOFC would produce effects opposite to those of inactivating the lOFC and mOFC (Hervig et al., 2020). We further hypothesized that blocking 5-HT2ARs would produce dissociable effects in the mOFC and lOFC, with hypothetical early reversal learning impairments in the lOFC as blocking the inhibitory 5-HT2CR in the lOFC produces early reversal learning improvements (Alsiö et al., 2015).
Method
Animals
Subjects were male Lister hooded rats (N = 42; Charles River, United Kingdom; Supplementary Table S1) housed in groups of three or four during behavioral pretraining testing and single-housed following guide cannulas implantation to protect the implant. The rats were housed under a reverse 12-hr light/dark cycle with lights off at 7:00 a.m. All training and testing were performed during the dark phase. To ensure sufficient motivation for task performance, the animals were food restricted with ad libitum access to water and fed once daily at random times after testing. Their body weights were maintained at 85% of their free-feeding weight. All experiments were subject to regulation by the United Kingdom Home Office (PPL 70/7548) in accordance with the Animals (Scientific Procedures) Act 1986.
Drugs
M100907, or R-(+)-α-(2,3-dimethoxyphenyl)-1-[2-(4-fluorophenylethyl)]-4-piperidinemethanol (Sigma Aldrich, #M3324), a highly selective 5-HT2A receptor antagonist (Kehne et al., 1996), was dissolved in 0.01 M phosphate-buffered saline (PBS) and 0.1 M hydrochloride and pH adjusted with NaOH to pH 6. M100907 was administered at 0 (vehicle), 1 μg/side, or 3 μg/side. DHK (Tocris Bioscience, United Kingdom, #0111), a potent blocker of GLT-1 (Arriza et al., 1994; Muñoz et al., 1987), was dissolved in saline and administered at 0 (vehicle/saline) and 1 μg/side.
Drugs were aliquoted in the quantities required for each test day and frozen at −80°C. For the intracranial microinfusions, the drugs were administered at 0.5 μl/side 10 min prior to testing.
Behavioral Training (Touchscreen Serial Visual Reversal Learning)
Behavioral training was performed as previously described in Hervig et al., 2020. For the experimental timeline and design, see Figure 1.
Apparatus
We trained and tested the animals on a touchscreen serial visual reversal learning task using 16 operant chambers (Med Associates, Georgia, VT) placed in sound- and light-attenuating wooden cabinets equipped with a fan for ventilation and masking of external noise. The chambers measured 30 cm × 39 cm × 29 cm and consisted of a clear Perspex ceiling, front door, and back panel and metal paneling on the sides of the chamber. A metal grid with a removable metal tray below made up the floor of the chamber. A central food magazine coupled to an external pellet dispenser was located on one side of the chamber. It was equipped with light and infrared beam sensors to detect magazine entry, allowing delivery of one 45-mg sucrose pellet (TestDiet 5TUL; Sandown Scientific, Middlesex, United Kingdom) upon correct responses. A house light (∼3 W) was located near the ceiling directly above the magazine. A touch-sensitive screen (29 × 32 cm) presenting visual stimuli was located on the opposite side to that of the magazine. Task schedules were developed and implemented by A. C. Mar (Mar et al., 2013) using Visual Basic 2010 and have been published previously (Alsiö et al., 2015; Hervig et al., 2020).
Pretraining: Touchscreen Serial Visual Reversal Learning
A five-stage pretraining phase began after the rats were food restricted, involving Pavlovian and instrumental conditioning prior to visual discrimination and serial reversal learning, and lasted until a stable baseline was reached. In Stages 1 to 3, rats were trained to respond to a single white box at the bottom center of the touchscreen for sucrose reward pellets during 60-min daily sessions until criterion of receiving the maximum 100 pellets in one session. The box decreased in size across the three stages until a final size of 3 × 4 cm (“start box”) in Stage 3. In pretraining Stages 4 and 5, two additional stimuli were introduced (horizontal and vertical bars). The first was at the bottom of the screen to ease touch (Stage 4); then, the stimulus was raised 5 cm to the final location on the screen to avoid accidental touches (Stage 5). At this point, touching the white start box was no longer reinforced but instead led to the presentation of one of these novel stimuli to the left or right (pseudorandomized location). Responding to the presented stimulus was reinforced with a sugar pellet, whereas responding to the blank side was signaled as incorrect by the illumination of the house light for a 5-s time-out period. Eighty percent or more correct touches on one stimulus in a session led to training sessions with the other stimulus. When criterion of > 80% correct touches was reached also on this stimulus, the rat moved on from Stage 4 to Stage 5, and after ≥ 80% correct touches were reached on both stimuli on Stage 5, visual discrimination training ensued.
Visual Discrimination Training
In visual discrimination, the rats were presented with both stimuli simultaneously, of which one was reinforced. For session initiation, the rats would collect a free reward delivery, which led to presentation of the start box. The rat initiated a trial by responding to the start box, which initiated a simultaneous presentation of the stimuli pair. Responding to the correct stimulus (conditional stimulus; [CS]+) was reinforced with a sugar pellet, while responding to the incorrect nonreinforced stimulus (CS−) triggered a house-light-signaled 5-s time-out period. Failure to make a choice of either stimulus within 10 s of trial initiation was recorded as an omission. A 5-s intertrial-interval period preceded the next trial. To prevent the rats from developing a side bias, the stimuli were presented on the screen (left or right side) in a pseudorandom order (maximum three consecutive trials to the same side). The daily session ended after either 60 min, 150 rewards, or 250 trials, whichever was the first to occur. The rats reached criterion by 24 correct out of a running window of 30 trials. Prior to serial reversal learning training, a retention session with the same reward contingencies was given, as well as on the day following attainment of the learning criterion, to ensure that the rat had acquired the discrimination.
Serial Visual Reversal Learning Training
Following the retention session during visual discrimination, the contingencies reversed so the rats then had to respond to the previous nonrewarded CS− stimulus (now CS+) for reinforcement until they reached the reversal learning criterion (24/30). A retention session both preceded and followed a reversal block. A stable serial reversal performance was achieved once the rat reached criterion within three consecutive daily sessions, with more than 200 trials completed on the first reversal day. The rats underwent surgery after they acquired a stable reversal learning performance.
Stereotaxic Surgery
Rats were initially anesthetized with 5% isoflurane gas, which for the duration of the aseptic surgical procedure was reduced and maintained between 1% and 3%. We secured the rats in a stereotaxic frame (KOPF, Tujunga, CA) with atraumatic ear bars, set the tooth bar to −3.3 mm, and adjusted for flat skull position. Bilateral guide cannulas (22-GA; PlasticsOne, Roanoke, VA) were implanted in the lOFC (anteroposterior [AP] +3.5, mediolateral [ML] ±2.5, dorsoventral [DV] −1.7) and the mOFC (AP +4.0, ML ±0.6, DV −1.4) and secured with four screws and dental cement. Removable obdurators were inserted into the guide cannulas to prevent occlusion and protected with a dust cap. We obtained the surgical coordinates by using a stereotaxic atlas and made adjustments according to pilot surgeries. AP and ML coordinates were referenced to bregma, and DV was referenced to dura.
Intracerebral Microinfusions and Reversal Learning Testing
Following the surgery recovery week, the rats were rebaselined on the serial reversal learning task to ensure a continued stable performance after the surgery. Following the baseline reversal week, which also included microinfusion habituation with sham infusions, we started the bilateral drug infusions of either M100907 or DHK across reversals according to a within-subject, crossover/Latin square design. The procedure was as follows: Prior to testing, the rats were gently restrained, and injectors (PlasticsOne; 28-GA) extending 2 mm below the guide were inserted into the guide cannulas. The injectors were left in place for 1 min before and after infusion, and drug was infused in a volume of 0.5 μl over 2 min. The rats were allowed to move freely around in the experimenter’s lap during infusion. Ten minutes after drug infusion, the rats were tested on the reversal task. Infusions were administered each day of reversal—that is, from the session when contingencies first shifted to the day criterion was reached, followed by a retention session with no infusion. Thus, an animal that reached criterion on the third day received three infusions across 3 consecutive days. On the day before the next reversal, another retention session was given in which the rats received saline infusion to ensure habituation to the infusion procedure as the rats typically had 2 days without testing between these retention sessions. Thus, a complete reversal with retention sessions and break took 7 days, during which the rats typically received three drug infusions.
Histology
To confirm cannulas and injector-tip placements, we performed cresyl violet staining. Briefly, after the experiments, the rats were given a lethal dose of sodium pentobarbitone (Euthatal) and transcardially perfused with 0.01 M PBS followed by 4% (vol/vol) paraformaldehyde solution. The brains were removed, postfixed in 4% paraformaldehyde for 24 hr at room temperature, and dehydrated and preserved in 30% (wt/vol) sucrose in 0.01 M PBS for at least 2 days until sectioning. For sectioning, the brains were fast-frozen, embedded in optimal cutting temperature compound (O.C.T, VWR Chemicals, #361603E), and sectioned into 60-μm coronal sections using a cryostat (Leica, CM3050 S). The sections were stored in cryoprotectant at −20°C until cresyl violet staining.
Experimental Design and Statistical Analyses
Only animals with intact cannulas during the course of the experiments and with correct regional placement of injector tips (see Figure 1) were included in the analyses (Supplementary Table S1). All experiments employed a within-subject complete crossover/Latin square design with separate cohorts for each region and drug. Data across days within one reversal were collapsed, and trial outcomes were coded as perseverative, random, or late learning depending on performance over bins of 30 trials in a rolling window, as described in detail and illustrated previously (Hervig et al., 2020), following binomial distribution probabilities (Jones & Mishkin, 1972). Postcriterion data (>24 correct) were excluded from analysis.
Behavioral data were subjected to analysis of variance (ANOVA) using a general linear model with significance at α = .05. Data were initially tested for normality with the Shapiro-Wilk test, and data that did not pass the Shapiro-Wilk test were appropriately transformed to obtain normal distribution before analysis (as described in further detail next). Outliers were tested by inspection of studentized residuals and would only be excluded from the analyses if the subject was consistently an outlier across all drug doses and behavioral phases; no animals were excluded. Homogeneity of variance was verified using Levene’s test; for repeated-measures analyses, Mauchly’s test of sphericity was applied to assure the sphericity assumption was not violated.
The dependent variables were trials, errors, reward collection and response latencies, omissions, as well as win-stay and lose-shift probabilities. Errors were square root transformed and analyzed to learning criterion and in each phase across regions. Lose-shift and win-stay probabilities were arcsine transformed and analyzed to criterion. Nonparametric testing was applied to analyze omissions in each phase and to criterion (Wilcoxon’s; note that omissions only occurred if the animals actively initiated a trial by touching the start box). Latencies to respond to the stimuli (after initiating a trial) and to collect earned reward pellets were analyzed to criterion.
To investigate whether treatment had an impact on the overall learning strategy, we analyzed the win-stay and lose-shift behavior as a proxy for learning from positive and negative feedback, respectively. We calculated the win-stay strategy as the probability of making a correct choice after a correct trial (P[stay|win]) and the lose-shift strategy as the probability of making a correct choice after an incorrect trial (P[shift|loss]; Clarke et al., 2008; Riceberg & Shapiro, 2012). Thus, P(shift|win) + P(stay|win) = 1 and P(shift|loss) + P(stay|loss) = 1.
The “criterion of learning” and “behavioral phase” data analyses across regions were performed with two-way mixed ANOVAs in a within-subject (Treatment) × between-subjects (Region) design for regional inactivation. Data were analyzed within each region using planned pairwise comparisons with Student’s t tests and repeated-measures one-way ANOVAs as appropriate.
All statistical analyses were performed using SPSS Version 25.0.0.1, and graphs were generated using GraphPad Prism 8 (San Diego, CA). Data are presented as mean ± standard error of mean. Significant effects will be p < .05, while p > .1 will be reported as noneffects. Effect sizes are indicated with partial eta squared (ηp2; Cohen, 1988).
Results
Histological Assessment of Regional Infusion Sites
Of the 42 animals entering the reversal learning experiment, 38 rats were included in the analysis based on histological assessment of regional infusion sites. These comprised eight mOFC DHK, eight lOFC DHK, nine mOFC M100907, and 13 lOFC M100907 rats with correct regional injector placements (Figure 1D).
Effects of DHK Infusion in the mOFC and lOFC on Reversal Learning
Figure 2 shows that intra-OFC DHK produced dissociable effects on errors, with DHK in mOFC significantly reducing perseverative responses and intra-lOFC DHK not affecting errors, though significantly increasing omissions in the perseveration phase. For perseverative errors, ANOVA showed a significant DHK × Region interaction, F(1, 14) = 7.107, p = .018, ηp2 = 0.34, and main effect of DHK, F(1, 14) = 4.69, p = .048, ηp2 = 0.25, while there was no main effect of region, F(1, 14) = 0.054, p = .82, ηp2 = 0.004 (Figure 2A). Planned pairwise comparisons within each region showed that intra-mOFC DHK significantly decreased the number of errors in the perseveration phase, t7 = 2.78, p = .027, ηp2 = 0.524, while intra-lOFC DHK did not affect perseverative errors, t7 = −0.51, p = .628, ηp2 = 0.35.
For the early learning (“random”) phase, ANOVA showed a main effect of region, F(1, 14) = 5.133, p = .040, ηp2 = 0.27, but no DHK × Region interaction, F(1, 14) = 2.440, p = .141, ηp2 = 0.148, and no main effect of DHK, F(1, 14) = 0.176, p = .681, ηp2 = 0.012 (Figure 2A). For the late learning phase, ANOVA showed a significant main effect of DHK treatment, F(1, 14) = 4.869, p = .045, ηp2 = 0.26, but no DHK × Region interaction, F(1, 14) = 0.001, p = .978, ηp2 = 0.00006, or main effect of region, F(1, 14) = 2.317, p = .15, ηp2 = 0.142. Planned pairwise comparisons within each region revealed no effects of DHK treatment on the late learning phase.
For trials to criterion, there was a trend toward a main effect of region, F(1, 14) = 4.53, p = .052, ηp2 = 0.24, but no DHK × Region interaction, F(1, 14) = 0.58, p = .81, ηp2 = 0.0041, or main effect of DHK, F(1, 14) = 0.74, p = .41, ηp2 = 0.050 (Figure 2B). For omissions in the perseveration phase, a pairwise Wilcoxon signed-rank test showed that intra-lOFC DHK increased omissions, p = .043, while DHK in mOFC had no effect, p = .18 (Supplementary Figure S1). There were no effects in the random and late learning phases. We found no effects on response latencies (Figure 2C), collection latencies, or feedback sensitivity (Supplementary Figure S1).
In sum, DHK infused into the mOFC selectively reduced perseveration without affecting later learning phases. By contrast, DHK in the lOFC did not affect errors committed but increased omissions selectively in the perseveration phase.
Effects of 5-HT2AR Blockade in the mOFC and lOFC on Reversal Learning
Intra-OFC 5-HT2A blockade with M100907 produced dissociable effects on trials to criterion; intra-lOFC M100907 significantly increased overall trials required for learning, likely driven by increases across all phases, while intra-mOFC M100907 had no effect on trials (see Figure 3) but increased omissions overall (Supplementary Figure S2). For perseverative errors, ANOVA showed no M100907 × Region interaction, F(2, 40) = 0.24, p = .79, ηp2 = 0.012, and no main effect of M100907, F(2, 40) = 0.34, p = .72, ηp2 = 0.017, or region, F(1, 20) = 0.002, p = .97, ηp2 = 0.000086, and there were no effects of M100907 when analyzing within each region (Figure 3A).
For the early learning (random) phase, ANOVA showed no M100907 × Region interaction, F(2, 40) = 1.41, p = .26, ηp2 = 0.066, and no main effect of M100907, F(2, 40) = 1.54, p = .23, ηp2 = 0.072, or region, F(1, 20) = 0.177, p = .68, ηp2 = 0.009, and there were no effects of M100907 when analyzing within each region (Figure 3A). For the late learning phase, ANOVA across regions showed no M100907 × Region interaction, F(2, 40) = 0.71, p = .50, ηp2 = 0.034, and no main effect of M100907, F(2, 40) = 1.66, p = .20, ηp2 = 0.077, or region, F(2, 40) = 1.63, p = .22, ηp2 = 0.075 (Figure 3A), although within each region, ANOVA showed a significant effect of M100907 in the lOFC, F(2, 24) = 4.48, p = .022, ηp2 = 0.27, not the mOFC, F(1.9, 15.22) = 0.92, p = .90, ηp2 = 0.011. Planned pairwise comparisons showed that 1 μg M100907 in lOFC significantly increased late learning errors, t12 = −2.65, p = .021, ηp2 = 0.37, while 3 μg M100907 had no effect, t12 = −0.53, p = .61, ηp2 = 0.023.
For trials to criterion, ANOVA across regions showed no significant effects. However, when analyzing each region, ANOVA showed a significant main effect of M100907 in lOFC, F(2, 24) = 3.96, p = .033, ηp2 = 0.25, but not in mOFC, F(2, 24) = 0.24, p = .79, ηp2 = 0.030. Planned pairwise comparisons within each region showed that 1 μg M100907 in lOFC significantly increased trials to criterion, t12 = −3.43, p = .005, ηp2 = 0.49, while 1 μg M100907 in mOFC had no effect, t8 = −0.55, p = .60, ηp2 = 0.037. There was no effect of 3 μg M100907 in lOFC, t12 = −1.48, p = .17, ηp2 = 0.15, or mOFC, t8 = −0.14, p = .89, ηp2 = 0.003.
For response latencies, ANOVA showed a trend toward a main effect of M100907, F(2, 40) = 2.58, p = .088, ηp2 = 0.11, but no M100907 × Region interaction, F(2, 40) = 0.30, p = .75, ηp2 = 0.015, or main effect of region, F(1, 20) = 0.23, p = .63, ηp2 = 0.012 (see Figure 3). When analyzing each region, ANOVA showed a significant main effect of M100907 in lOFC, F(2, 24) = 3.58, p = .043, ηp2 = 0.23, but not in mOFC, F(2, 16) = 0.49, p = .62, ηp2 = 0.057. Planned pairwise comparisons showed that lOFC M100907 (1 μg) decreased response latencies, t12 = 2.92, p = .013, ηp2 = 0.42, while there was no effect of the 3 μg M100907, t12 = 1.34, p = .21, ηp2 = 0.13.
For omissions to criterion (Supplementary Figure S2), a pairwise Wilcoxon signed-rank test showed that 1 μg M100907 in mOFC significantly increased omissions, p = .031, while the lOFC M100907 did not affect omissions, p = .94. There was no effect of infusing 3 μg M100907 into the mOFC, p = .71, and lOFC, p = .22. There were no significant effects within the specific behavioral phases, although in the random phase, 1 μg M100907 in mOFC trended toward increasing omissions, p = .059. We found no effects on collection latencies or feedback sensitivity (Supplementary Figure S2).
In sum, 1 μg M100907 induced more effects than did 3 μg M100907, with these effects being found mainly in the lOFC. Intra-lOFC M100907 (1 μg) reduced reversal learning performance overall by increasing trials to criterion, probably driven by an increase in errors committed in the late learning phase. This reversal learning impairment was associated with faster response latencies. By contrast, 1 μg M100907 infused into the mOFC mainly increased omissions.
Discussion
We observed dissociable effects of intra-OFC blockade of the GLT-1 following DHK (presumably resulting in increased extracellular glutamate) and of the 5-HT2AR with M100907 (presumably resulting in diminished 5-HT2AR-mediated glutamatergic transmission) on deterministic serial visual reversal learning. Intra-mOFC DHK reduced perseverative errors, while intra-lOFC DHK had no effect on errors committed. By contrast, intra-lOFC M100907 impaired overall reversal learning as reflected by increased trials required to reach the learning criterion—presumably driven by errors increasing cumulatively at each stage reaching significance during late learning. This impairment was also associated with faster response latencies. These results add to our previous finding of dissociable roles of the rodent mOFC and lOFC in visual reversal learning (Hervig et al., 2020), which has also been reported across other tasks such as probabilistic reversal learning (Dalton et al., 2016), delay discounting (Mar et al., 2011), and instrumental action (Gourley, Lee, Howell, Pittenger, & Taylor, 2010).
Effects of Intra-OFC Blockade of GLT-1 on Serial Visual Reversal Learning
The present study shows that blockade of the astrocytic glutamate transporter GLT-1 with DHK in mOFC and lOFC affected reversal learning in a dissociable manner, though not in the direction that we expected. Based on our previous study (Hervig et al., 2020) showing that inactivating the lOFC impaired reversal learning, while inactivating the mOFC improved it, we expected to see somewhat opposite, and still dissociable, effects with DHK microinfusions. This is because DHK increases prefrontal extracellular glutamate levels (Pintor et al., 2004) and neuronal metabolic activity after local administration (at a comparable dose to the dose used in the present study) in the prefrontal cortex (PFC), while also decreasing related subcortical activity (Gasull-Camós et al., 2017). Thus, we predicted that microinfusion of DHK into lOFC would improve reversal learning, while it would impair reversal in the mOFC. Apparently paradoxically, intra-mOFC DHK improved reversal learning performance selectively in the early phase, as also occurred following inactivation of this structure. However, this improvement occurred in the absence of decreased collection latencies and enhanced negative feedback sensitivity produced by inactivation of the mOFC (Hervig et al., 2020). In contrast, intra-lOFC DHK had no effect on reversal learning performance.
Little work has been done on the role of the astrocytic GLT-1 in reversal learning, but dysfunctional astrocytes in the PFC impair reversal learning (Lima et al., 2014), and excessive glutamate impairs visual discrimination learning in GLT-1-deficient mice (Karlsson et al., 2009), as well as the initial stages of visual reversal learning (Granseth et al., 2015) and spatial reversal (Balschun et al., 2010) in VGluT1-deficient mice. Such impairments are supported by excessive glutamate induced by global DHK treatment impairing spatial memory (Bechtholt-Gompf et al., 2010) and memory retrieval (Tian et al., 2019). NMDAR antagonism, which results in excess glutamate and neuronal activation in the PFC (including the OFC; Hervig, Thomsen, Kalló, & Mikkelsen, 2016; Moghaddam, Adams, Verma, & Daly, 1997; Suzuki, Jodo, Takeuchi, Niwa, & Kayama, 2002), has also been reported to impair reversal learning in monkeys (Harder, Aboobaker, Hodgetts, & Ridley, 1998) and rodents (Abdul-Monim, Neill, & Reynolds, 2007; Dalton et al., 2011; Kumar et al., 2015; Li et al., 2016; Lobellova et al., 2013; Thonnard, Dreesen, Callaerts-Vegh, & D’Hooge, 2019; van der Meulen, Bilbija, Joosten, de Bruin, & Feenstra, 2003; Watson & Stanton, 2009)—an effect also seen in NMDA subunit-deficient mice (Brigman et al., 2008; Marquardt et al., 2014). Aberrant NMDAR function in the lOFC can also produce reversal learning deficits (Brigman et al., 2013; Marquardt et al., 2019; Thompson et al., 2015). However, there is some discrepancy as other studies with NMDAR antagonists either fail to show effects on reversal learning (Brigman, Ihne, Saksida, Bussey, & Holmes, 2009; Janhunen et al., 2015; Svoboda et al., 2015) or even show improvement in late-stage reversal learning (Dix, Gilmour, Potts, Smith, & Tricklebank, 2010; Fellini, Kumar, Gibbs, Steckler, & Talpos, 2014; McAllister, Mar, Theobald, Saksida, & Bussey, 2015). These contrasting findings may be due to methodological differences, for example, in terms of diverse pharmacological/genetic manipulations and the use of various reversal learning paradigms being dependent on different sensory modalities and, consequently, different brain structures. To understand these discrepancies, investigating effects of regional glutamate imbalance is necessary. In the present study, we show that excessive glutamate in the mOFC, and not the lOFC, improves visual serial reversal learning selectively in the early phase by reducing perseverative errors, without affecting response and reward collection latencies or feedback sensitivity. This observation supports a previous finding that prefrontal glutamate levels predict reversal learning performance in the marmoset, with enhanced glutamate availability being associated with better reversal learning performance (Lacreuse et al., 2018), suggesting that enhanced glutamate in certain PFC areas—for example, the mOFC—is beneficial for flexible responding.
We have previously suggested that the mOFC facilitates exploitative behavior (Hervig et al., 2020). DHK-induced excess glutamate in the mOFC likely disturbs the finely tuned glutamate homeostasis required for optimal neuronal functioning in learning and plasticity (Kalivas, 2009), in turn disrupting synchronized neuronal firing (Gray, 1994). This could hypothetically lead to inefficient cortico-striatal control over behavior and consequently enhanced exploration. This account may explain why intra-mOFC DHK to some degree mimics part of the effects from pharmacological mOFC inactivation on reversal learning observed previously (Hervig et al., 2020), while not fully reproducing those effects as the neural mechanisms are fundamentally different. As the mOFC, in contrast to lOFC, is the area most affected by DHK application in this study, it may be the region reflecting the brain circuitry responsible for the beneficial role of glutamate in reversal learning.
While it has been shown that optogenetic stimulation of lOFC-striatal projections suppresses compulsive grooming behavior (Burguière, Monteiro, Feng, & Graybiel, 2013), another study has shown that deep brain stimulation of the lOFC impairs spatial reversal learning, although not initial acquisition, in rats (Klanker, Post, Joosten, Feenstra, & Denys, 2013). Thus, the functional effect of lOFC activation on compulsive behavior is not straightforward, a conclusion further supported by the lack of effect of intra-lOFC DHK on visual reversal learning in the present study. As impaired reversal learning after lOFC inactivation or lesioning is well established across species, we expected to see some effect of “overactivating” the lOFC, but, at least in this paradigm, excessive glutamate in the lOFC does not seem to affect the lOFC’s control over dorsostriatal regions thought to be responsible for adapting behavior to altered response-reward contingencies in humans (Balleine & O’Doherty, 2010; Gillan et al., 2015; Morris et al., 2016), monkeys (Groman et al., 2013), and mice (Gremel & Costa, 2013). Alternatively, it is possible that overall glutamate excess does not affect lOFC neurons overall, but only subpopulations, due to the presence of functionally different individual neurons that exhibit different activational profiles depending on task after optogenetic stimulation (Jennings et al., 2019). Thus, variations in DHK infusion placements could in theory mask any specific effects mediated by individual lOFC neurons. This is further supported by studies showing that subpopulations of lOFC neurons exhibit task-dependent firing patterns during reversal learning (Gremel & Costa, 2013; Marquardt, Sigdel, & Brigman, 2017). At least, we can conclude that a hypothetical subpopulation effect in the lOFC is not transmitted to subcortical regions, such as the dorsolateral striatum, which is part of the neural circuitry mediating habitual learning (Gremel & Costa, 2013; Groman et al., 2013).
While intra-lOFC DHK did not affect primary measures of reversal learning performance, it did increase omissions specifically in the perseveration phase. Although this result should be interpreted with caution as it only encompasses few omissions in total, it does indicate some impairment in the early phase, possibly due to an attentional deficit resulting from hallucinatory-type actions (Jardri et al., 2016) or possibly due to some degree of anhedonia as shown for global and PFC DHK treatment in rats (Bechtholt-Gompf et al., 2010; John et al., 2012). Overall, our observations support a role for the GLT-1-mediated regulation of glutamate availability in the mOFC, not in the lOFC, in controlling reversal learning.
Effects of Intra-OFC Blockade of the 5-HT2AR on Serial Visual Reversal Learning
We found that selective blockade of 5-HT2ARs (by M100907) in the lOFC, not the mOFC, impaired reversal learning overall, as reflected by the increased number of trials required to reach learning criterion—an effect that presumably arose from increased errors committed cumulatively at each stage, reaching significance during late learning. This impairment was associated with faster response latencies, which could reflect overconfidence or impulsivity affecting decision-making. Blocking 5-HT2ARs in the mOFC had no effect on reversal learning but increased omissions.
This finding is consistent with a role of orbitofrontal serotonin in reversal learning as previous studies have shown that serotonin and serotonin transporter levels/polymorphisms predict individual variation in reversal learning performance in rodents (Barlow et al., 2015; Lapiz-Bluhm et al., 2009; Stolyarova, O’Dell, Marshall, & Izquierdo, 2014) and monkeys (Groman et al., 2013; Vallender, Lynch, Novak, & Miller, 2009), that orbitofrontal serotonin depletion selectively impairs visual reversal learning in monkeys (Clarke et al., 2004, 2005, 2007; Rygula et al., 2015), associated with poor response suppression (Rygula et al., 2015), and that OFC serotonin is important for reinforcer devaluation (West, Forcelli, McCue, & Malkova, 2013). Our result is also consistent with previous systemic administration of M100907 impairing reversal learning performance on an operant two-choice spatial reversal learning task, whereas systemic blockade of 5-HT2CRs had the opposite effect, improving performance (Boulougouris et al., 2008). While local 5-HT2CR antagonism in the lOFC reproduced this impairment, intra-lOFC 5-HT2AR blockade had no effect on spatial reversal learning (Boulougouris & Robbins, 2010). However, 5-HT2AR blockade with M100907 in the lOFC does impair odor-based reversal learning (Furr et al., 2012). Also, low reversal learning performance in rats is associated with decreased levels of 5-HT2AR, and serotonin, in the lOFC (Barlow et al., 2015), supporting our result that lOFC 5-HT2AR blockade impairs reversal learning performance.
In the visual serial reversal learning task used in the present study, intra-lOFC blockade of 5-HT2CR (primarily localized to inhibitory parvalbumin interneurons) improves performance in the early phase of reversal learning (Alsiö et al., 2015), which together with a reversal learning impairment after blockade of 5-HT2AR (primarily localized to glutamatergic pyramidal neurons) in the lOFC in the present study is consistent with 5-HT2CRs controlling and 5-HT2ARs facilitating reversal learning. However, it is important to note that as we did see an impairment, as expected, this impairment was not due to increased perseverative errors specifically, but rather an increase in errors committed across all reversal learning phases.
The discrepancy in effects of intra-lOFC M100907 administration on reversal learning is likely due to differences in reversal learning task design and sensory modalities involved. Boulougouris et al. used a spatial reversal learning task and only saw effects on the first, not the second or third, reversal the rats experienced, suggesting that novelty was also an important factor (Boulougouris et al., 2008; Boulougouris & Robbins, 2010). In the present study, we used a visual task, where the rats were trained in serial reversals to obtain stable reversal performance, allowing for within-subject analysis across reversals. Thus, our task is less dependent on circuitries involved in spatial cognition and excludes novelty as a possible factor.
It is important to note that only the lowest dose of 1 μg M100907 affected reversal learning significantly. This dose has been used in lOFC in a previous reversal learning study with no effects (Boulougouris & Robbins, 2010) and in the mPFC with effects on compulsivity (Mora et al., 2018). Both studies showed dissociable effects from intra-lOFC 5-HT2CR antagonist treatment—thus, this dose is presumably not targeting 5-HT2cR receptors. Also, in Furr et al. (2012), a dose comparable to the lowest dose in the present study impaired reversal learning when infused into the lOFC. However, no previous studies have used 3 μg M100907 in the OFC. Our results indicate that 3 μg M100907 had different effects from 1 μg, probably reflecting an inverted U-curve effect, which has also been reported for 5-HT2AR antagonists previously (Marek, Martin-Ruiz, Abo, & Artigas, 2005; Roth, 2011). Thus, our high dose may have induced receptor internalization, which is a known mechanism for the 5-HT2AR (Roth, 2011), supported by dose-response studies showing that systemic moderate doses of M100907 are more effective than low and high doses on a response inhibition task (Marek et al., 2005) and that intra-lOFC infusions with moderate M100907 doses induce the strongest detrimental effects on reversal learning compared with the low and high doses (Furr et al., 2012).
It is also worth speculating if the differential behavioral effects from the low and high doses of M100907 could be due to other factors. One factor is that the high M100907 dose could be targeting other receptors, in addition to the 5-HT2AR, to which M100907 has lower affinity, such as the 5-HT2CR. Since blocking the 5-HT2CR has opposite effects on reversal learning (Alsiö et al., 2015), this could in theory mask/counteract potential M100907 effects. However, this seems unlikely both because no opposite effects of low and high M100907 doses are observed in the early phase and because M100907 has subnanomolar affinity for 5-HT2ARs, at least 100-fold lower affinity for 5-HT2CRs, and negligible affinity for other receptors (Johnson, Siegel, & Carr, 1996; Kehne et al., 1996; Pehek, Nocjar, Roth, Byrd, & Mabrouk, 2006). It is also worth noting that, while the majority of prefrontal 5-HT2ARs are postsynaptic, a small proportion are presynaptic (Cornea-Hébert, Riad, Wu, Singh, & Descarries, 1999; González-Maeso et al., 2007; Jakab & Goldman-Rakic, 1998; Miner, Backstrom, Sanders-Bush, & Sesack, 2003). The high dose of M100907 may, in theory, affect presynaptic, in addition to postsynaptic, receptors to a greater extent than the low dose and may thus modulate not only downstream neuronal excitation and long-term potentiation (Aznar & Klein, 2013) but also afferent neurotransmission (Barre et al., 2016). However, more investigations are needed to elucidate these potential underlying mechanisms. Moreover, although the findings were statistically significant with a high effect size within the lOFC, the lack of statistical significance and low to moderate effect sizes in the overall ANOVA indicate that this experiment eventually will require replication.
We hypothesize that the lOFC promotes exploration, and our present study suggests that 5-HT2AR in the lOFC may be responsible not so much for the initial switch from exploitation to exploration strategies that occurs at the time of reversal but for implementing the information acquired through exploration. Our study shows that the rats are able to initiate exploration (as perseverative errors are not statistically altered), but they commit increasingly more errors in the random (early learning) and late learning phases. Thus, the information acquired initially is not properly implemented in the existing task set encoded by the lOFC as more trials/errors to update, or create new, task sets were required following intra-lOFC M100907 treatment, consistent with the well-established role for the 5-HT2AR in learning and memory (Harvey, 2003; Zhang & Stackman, 2015). This impairment is not due to deficient feedback sensitivity as win-stay/lose-shift parameters were not affected by intra-lOFC M100907, but the observed speeding of response latency could reflect altered decision-making as a result of increased “guessing” before actually having made the correct decision. The OFC is involved in decision confidence and decision-making processes (Izquierdo, 2017; Kepecs & Mainen, 2012; Kepecs, Uchida, Zariwala, & Mainen, 2008) and required for optimal waiting based on decision confidence (Lak et al., 2014). Moreover, orbitofrontal serotonin depletion is associated with poor response suppression (Rygula et al., 2015), and prefrontal 5-HT2ARs are thought to play a role in decision-making (Aznar & Klein, 2013).
That the reversal learning impairment is associated with faster responding may seem paradoxical, but this could be due to a potential role for 5-HT2AR and OFC in impulsivity. Although systemic 5-HT2AR antagonism decreases impulsive responding (Winstanley, Theobald, Dalley, Glennon, & Robbins, 2004), an effect presumably mediated through the nucleus accumbens (Robinson et al., 2008), an opposing role for 5-HT2AR in the lOFC is plausible.
Concluding Summary
We found that increasing glutamate availability in the mOFC, not lOFC, improved early reversal learning, while blocking 5-HT2ARs in the lOFC (presumably resulting in diminished glutamatergic transmission), not mOFC, lead to an overall impairment in visual reversal learning. These results further support dissociable roles of the rodent mOFC and lOFC in deterministic visual reversal learning and indicate that glutamate transmission and 5-HT2AR have different roles in these two structures.
Supplementary Material
References
- Abdul-Monim Z., Neill J. C., & Reynolds G. P. (2007). Sub-chronic psychotomimetic phencyclidine induces deficits in reversal learning and alterations in parvalbumin-immunoreactive expression in the rat. Journal of Psychopharmacology, 21, 198–205. 10.1177/0269881107067097 [DOI] [PubMed] [Google Scholar]
- Aghajanian G. K., & Marek G. J. (1999). Serotonin, via 5-HT 2A receptors, increases EPSCs in layer V pyramidal cells of prefrontal cortex by an asynchronous mode of glutamate release. Brain Research, 825, 161–171. 10.1016/s0006-8993(99)01224-x [DOI] [PubMed] [Google Scholar]
- Alsiö J., Nilsson S. R. O., Gastambide F., Wang R. A. H., Dam S. A., Mar A. C., et al. Robbins T. W. (2015). The role of 5-HT2C receptors in touchscreen visual reversal learning in the rat: A cross-site study. Psychopharmacology, 232, 4017–4031. 10.1007/s00213-015-3963-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Amargós-Bosch M., Bortolozzi A., Puig M. V., Serrats J., Adell A., Celada P., et al. Artigas F. (2004). Co-expression and in vivo interaction of serotonin1A and serotonin2A receptors in pyramidal neurons of prefrontal cortex. Cerebral Cortex, 14, 281–299. 10.1093/cercor/bhg128 [DOI] [PubMed] [Google Scholar]
- Apergis-Schoute A. M., Gillan C. M., Fineberg N. A., Fernandez-Egea E., Sahakian B. J., & Robbins T. W. (2017). Neural basis of impaired safety signaling in obsessive compulsive disorder. Proceedings of the National Academy of Sciences of the United States of America, 114, 3216–3221. 10.1073/pnas.1609194114 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Arriza J. L., Fairman W. A., Amara S. G., Kavanaugh P., Wadiche I., & Murdoch H. (1994). Functional comparisons of three glutamate transporter subtypes cloned from human motor cortex. The Journal of Neuroscience, 14, 5559–5569. 10.1523/JNEUROSCI.14-09-05559.1994 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Aznar S., & Klein A. B. (2013). Regulating prefrontal cortex activation: An emerging role for the 5-HT2A serotonin receptor in the modulation of emotion-based actions? Molecular Neurobiology, 48, 841–853. 10.1007/s12035-013-8472-0 [DOI] [PubMed] [Google Scholar]
- Balleine B. W., & O’Doherty J. P. (2010). Human and rodent homologies in action control: Corticostriatal determinants of goal-directed and habitual action. Neuropsychopharmacology, 35, 48–69. 10.1038/npp.2009.131 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Balschun D., Moechars D., Callaerts-Vegh Z., Vermaercke B., Van Acker N., Andries L., & D’Hooge R. (2010). Vesicular glutamate transporter VGLUT1 has a role in hippocampal long-term potentiation and spatial reversal learning. Cerebral Cortex, 20, 684–693. 10.1093/cercor/bhp133 [DOI] [PubMed] [Google Scholar]
- Bari A., Theobald D. E., Caprioli D., Mar A. C., Aidoo-Micah A., Dalley J. W., & Robbins T. W. (2010). Serotonin modulates sensitivity to reward and negative feedback in a probabilistic reversal learning task in rats. Neuropsychopharmacology, 35, 1290–1301. 10.1038/npp.2009.233 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Barlow R. L., Alsiö J., Jupp B., Rabinovich R., Shrestha S., Roberts A. C., et al. Dalley J. W. (2015). Markers of serotonergic function in the orbitofrontal cortex and dorsal raphé nucleus predict individual variation in spatial-discrimination serial reversal learning. Neuropsychopharmacology, 40, 1619–1630. 10.1038/npp.2014.335 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Barre A., Berthoux C., De Bundel D., Valjent E., Bockaert J., Marin P., & Bécamel C. (2016). Presynaptic serotonin 2A receptors modulate thalamocortical plasticity and associative learning. Proceedings of the National Academy of Sciences of the United States of America, 113, E1382–E1391. 10.1073/pnas.1525586113 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bechtholt-Gompf A. J., Walther H. V., Adams M. A., Carlezon W. A. Jr., Ongür D., & Cohen B. M. (2010). Blockade of astrocytic glutamate uptake in rats induces signs of anhedonia and impaired spatial memory. Neuropsychopharmacology, 35, 2049–2059. 10.1038/npp.2010.74 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bohn I., Giertler C., & Hauber W. (2003). Orbital prefrontal cortex and guidance of instrumental behaviour in rats under reversal conditions. Behavioural Brain Research, 143, 49–56. 10.1016/S0166-4328(03)00008-1 [DOI] [PubMed] [Google Scholar]
- Boulougouris V., Glennon J. C., & Robbins T. W. (2008). Dissociable effects of selective 5-HT2A and 5-HT2C receptor antagonists on serial spatial reversal learning in rats. Neuropsychopharmacology, 33, 2007–2019. 10.1038/sj.npp.1301584 [DOI] [PubMed] [Google Scholar]
- Boulougouris V., & Robbins T. W. (2010). Enhancement of spatial reversal learning by 5-HT2C receptor antagonism is neuroanatomically specific. The Journal of Neuroscience, 30, 930–938. 10.1523/JNEUROSCI.4312-09.2010 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Brigman J. L., Daut R. A., Wright T., Gunduz-Cinar O., Graybeal C., Davis M. I., et al. Holmes A. (2013). GluN2B in corticostriatal circuits governs choice learning and choice shifting. Nature Neuroscience, 16, 1101–1110. 10.1038/nn.3457 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Brigman J. L., Feyder M., Saksida L. M., Bussey T. J., Mishina M., & Holmes A. (2008). Impaired discrimination learning in mice lacking the NMDA receptor NR2A subunit. Learning & Memory, 15, 50–54. 10.1101/lm.777308 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Brigman J. L., Ihne J., Saksida L. M., Bussey T. J., & Holmes A. (2009). Effects of subchronic phencyclidine (PCP) treatment on social behaviors, and operant discrimination and reversal learning in C57BL/6J mice. Frontiers in Behavioral Neuroscience, 3, 2 10.3389/neuro.08.002.2009 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Brigman J. L., Mathur P., Harvey-White J., Izquierdo A., Saksida L. M., Bussey T. J., et al. Holmes A. (2010). Pharmacological or genetic inactivation of the serotonin transporter improves reversal learning in mice. Cerebral Cortex, 20, 1955–1963. 10.1093/cercor/bhp266 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Brown H. D., Amodeo D. A., Sweeney J. A., & Ragozzino M. E. (2012). The selective serotonin reuptake inhibitor, escitalopram, enhances inhibition of prepotent responding and spatial reversal learning. Journal of Psychopharmacology, 26, 1443–1455. 10.1177/0269881111430749 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Burguière E., Monteiro P., Feng G., & Graybiel A. M. (2013). Optogenetic stimulation of lateral orbitofronto-striatal pathway suppresses compulsive behaviors. Science, 340, 1243–1246. 10.1126/science.1232380 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Burke K. A., Takahashi Y. K., Correll J., Brown P. L., & Schoenbaum G. (2009). Orbitofrontal inactivation impairs reversal of Pavlovian learning by interfering with ‘disinhibition’ of responding for previously unrewarded cues. European Journal of Neuroscience, 30, 1941–1946. 10.1111/j.1460-9568.2009.06992.x [DOI] [PMC free article] [PubMed] [Google Scholar]
- Butter C. M. (1964). Habituation of responses to novel stimuli in monkeys with selective frontal lesions. Science, 144, 313–315. 10.1126/science.144.3616.313 [DOI] [PubMed] [Google Scholar]
- Chamberlain S. R., Menzies L., Hampshire A., Suckling J., Fineberg N. A., del Campo N., et al. Sahakian B. J. (2008). Orbitofrontal dysfunction in patients with obsessive-compulsive disorder and their unaffected relatives. Science, 321, 421–422. 10.1126/science.1154433 [DOI] [PubMed] [Google Scholar]
- Chau B. K. H., Sallet J., Papageorgiou G. K., Noonan M. P., Bell A. H., Walton M. E., & Rushworth M. F. S. (2015). Contrasting roles for orbitofrontal cortex and amygdala in credit assignment and learning in macaques. Neuron, 87, 1106–1118. 10.1016/j.neuron.2015.08.018 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cheng W., Rolls E. T., Qiu J., Liu W., Tang Y., Huang C. C., et al. Feng J. (2016). Medial reward and lateral non-reward orbitofrontal cortex circuits change in opposite directions in depression. Brain: A Journal of Neurology, 139, 3296–3309. 10.1093/brain/aww255 [DOI] [PubMed] [Google Scholar]
- Chudasama Y., & Robbins T. W. (2003). Dissociable contributions of the orbitofrontal and infralimbic cortex to Pavlovian autoshaping and discrimination reversal learning: Further evidence for the functional heterogeneity of the rodent frontal cortex. The Journal of Neuroscience, 23, 8771–8780. 10.1523/JNEUROSCI.23-25-08771.2003 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Clarke H. F., Dalley J. W., Crofts H. S., Robbins T. W., & Roberts A. C. (2004). Cognitive inflexibility after prefrontal serotonin depletion. Science, 304, 878–880. 10.1126/science.1094987 [DOI] [PubMed] [Google Scholar]
- Clarke H. F., Robbins T. W., & Roberts A. C. (2008). Lesions of the medial striatum in monkeys produce perseverative impairments during reversal learning similar to those produced by lesions of the orbitofrontal cortex. The Journal of Neuroscience, 28, 10972–10982. 10.1523/JNEUROSCI.1521-08.2008 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Clarke H. F., Walker S. C., Crofts H. S., Dalley J. W., Robbins T. W., & Roberts A. C. (2005). Prefrontal serotonin depletion affects reversal learning but not attentional set shifting. The Journal of Neuroscience, 25, 532–538. 10.1523/JNEUROSCI.3690-04.2005 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Clarke H. F., Walker S. C., Dalley J. W., Robbins T. W., & Roberts A. C. (2007). Cognitive inflexibility after prefrontal serotonin depletion is behaviorally and neurochemically specific. Cerebral Cortex, 17, 18–27. 10.1093/cercor/bhj120 [DOI] [PubMed] [Google Scholar]
- Cohen J. (1988). Statistical power analysis for the behavioral sciences (2nd ed.). New York, NY: Taylor and Francis. [Google Scholar]
- Cornea-Hébert V., Riad M., Wu C., Singh S. K., & Descarries L. (1999). Cellular and subcellular distribution of the serotonin 5-HT2A receptor in the central nervous system of adult rat. The Journal of Comparative Neurology, 409, 187–209. [DOI] [PubMed] [Google Scholar]
- Dalton G. L., Ma L. M., Phillips A. G., & Floresco S. B. (2011). Blockade of NMDA GluN2B receptors selectively impairs behavioral flexibility but not initial discrimination learning. Psychopharmacology, 216, 525–535. 10.1007/s00213-011-2246-z [DOI] [PubMed] [Google Scholar]
- Dalton G. L., Wang N. Y., Phillips A. G., & Floresco S. B. (2016). Multifaceted contributions by different regions of the orbitofrontal and medial prefrontal cortex to probabilistic reversal learning. The Journal of Neuroscience, 36, 1996–2006. 10.1523/JNEUROSCI.3366-15.2016 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Danbolt N. C., Storm-Mathisen J., & Kanner B. I. (1992). An [Na+ + K+] coupled L-glutamate transporter purified from rat brain is located in glial cell processes. Neuroscience, 51, 295–310. 10.1016/0306-4522(92)90316-T [DOI] [PubMed] [Google Scholar]
- D’Cruz A. M., Ragozzino M. E., Mosconi M. W., Shrestha S., Cook E. H., & Sweeney J. A. (2013). Reduced behavioral flexibility in autism spectrum disorders. Neuropsychology, 27, 152–160. 10.1037/a0031721 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dias R., Robbins T. W., & Roberts A. C. (1996). Dissociation in prefrontal cortex of affective and attentional shifts. Nature, 380, 69–72. 10.1038/380069a0 [DOI] [PubMed] [Google Scholar]
- Dix S., Gilmour G., Potts S., Smith J. W., & Tricklebank M. (2010). A within-subject cognitive battery in the rat: Differential effects of NMDA receptor antagonists. Psychopharmacology, 212, 227–242. 10.1007/s00213-010-1945-1 [DOI] [PubMed] [Google Scholar]
- Elliott R., Dolan R. J., & Frith C. D. (2000). Dissociable functions in the medial and lateral orbitofrontal cortex: Evidence from human neuroimaging studies. Cerebral Cortex, 10, 308–317. 10.1093/cercor/10.3.308 [DOI] [PubMed] [Google Scholar]
- Fellini L., Kumar G., Gibbs S., Steckler T., & Talpos J. (2014). Re-evaluating the PCP challenge as a pre-clinical model of impaired cognitive flexibility in schizophrenia. European Neuropsychopharmacology, 24, 1836–1849. 10.1016/j.euroneuro.2014.08.012 [DOI] [PubMed] [Google Scholar]
- Fellows L. K., & Farah M. J. (2003). Ventromedial frontal cortex mediates affective shifting in humans: Evidence from a reversal learning paradigm. Brain: A Journal of Neurology, 126, 1830–1837. 10.1093/brain/awg180 [DOI] [PubMed] [Google Scholar]
- Furr A., Lapiz-Bluhm M. D., & Morilak D. A. (2012). 5-HT2A receptors in the orbitofrontal cortex facilitate reversal learning and contribute to the beneficial cognitive effects of chronic citalopram treatment in rats. International Journal of Neuropsychopharmacology, 15, 1295–1305. 10.1017/S1461145711001441 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gasbarri A., & Pompili A. (2014). Involvement of glutamate in learning and memory In Meneses A. (Ed.), Identification of neural markers accompanying memory (pp. 63–77). London, UK: Elsevier. [Google Scholar]
- Gasull-Camós J., Soto-Montenegro M. L., Casquero-Veiga M., Desco M., Artigas F., & Castañé A. (2017). Differential patterns of subcortical activity evoked by glial glt-1 blockade in prelimbic and infralimbic cortex: Relationship to antidepressant-like effects in rats. International Journal of Neuropsychopharmacology, 20, 988–993. 10.1093/ijnp/pyx067 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ghahremani D. G., Monterosso J., Jentsch J. D., Bilder R. M., & Poldrack R. A. (2010). Neural components underlying behavioral flexibility in human reversal learning. Cerebral Cortex, 20, 1843–1852. 10.1093/cercor/bhp247 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gibbs M. E., Hutchinson D., & Hertz L. (2008). Astrocytic involvement in learning and memory consolidation. Neuroscience and Biobehavioral Reviews, 32, 927–944. 10.1016/j.neubiorev.2008.02.001 [DOI] [PubMed] [Google Scholar]
- Gillan C. M., Apergis-Schoute A. M., Morein-Zamir S., Urcelay G. P., Sule A., Fineberg N. A., et al. Robbins T. W. (2015). Functional neuroimaging of avoidance habits in obsessive-compulsive disorder. The American Journal of Psychiatry, 172, 284–293. 10.1176/appi.ajp.2014.14040525 [DOI] [PMC free article] [PubMed] [Google Scholar]
- González-Maeso J., Weisstaub N. V., Zhou M., Chan P., Ivic L., Ang R., et al. Gingrich J. A. (2007). Hallucinogens recruit specific cortical 5-HT(2A) receptor-mediated signaling pathways to affect behavior. Neuron, 53, 439–452. 10.1016/j.neuron.2007.01.008 [DOI] [PubMed] [Google Scholar]
- Gourley S. L., Lee A. S., Howell J. L., Pittenger C., & Taylor J. R. (2010). Dissociable regulation of instrumental action within mouse prefrontal cortex. European Journal of Neuroscience, 32, 1726–1734. 10.1111/j.1460-9568.2010.07438.x [DOI] [PMC free article] [PubMed] [Google Scholar]
- Granseth B., Andersson F. K., & Lindström S. H. (2015). The initial stage of reversal learning is impaired in mice hemizygous for the vesicular glutamate transporter (VGluT1). Genes, Brain & Behavior, 14, 477–485. 10.1111/gbb.12230 [DOI] [PubMed] [Google Scholar]
- Gray C. M. (1994). Synchronous oscillations in neuronal systems: Mechanisms and functions. Journal of Computational Neuroscience, 1, 11–38. 10.1007/BF00962716 [DOI] [PubMed] [Google Scholar]
- Graybeal C., Feyder M., Schulman E., Saksida L. M., Bussey T. J., Brigman J. L., & Holmes A. (2011). Paradoxical reversal learning enhancement by stress or prefrontal cortical damage: Rescue with BDNF. Nature Neuroscience, 14, 1507–1509. 10.1038/nn.2954 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gremel C. M., & Costa R. M. (2013). Orbitofrontal and striatal circuits dynamically encode the shift between goal-directed and habitual actions. Nature Communications, 4, 2264 10.1038/ncomms3264 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Groman S. M., James A. S., Seu E., Crawford M. A., Harpster S. N., & Jentsch J. D. (2013). Monoamine levels within the orbitofrontal cortex and putamen interact to predict reversal learning performance. Biological Psychiatry, 73, 756–762. 10.1016/j.biopsych.2012.12.002 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hampshire A., & Owen A. M. (2006). Fractionating attentional control using event-related fMRI. Cerebral Cortex, 16, 1679–1689. 10.1093/cercor/bhj116 [DOI] [PubMed] [Google Scholar]
- Harder J. A., Aboobaker A. A., Hodgetts T. C., & Ridley R. M. (1998). Learning impairments induced by glutamate blockade using dizocilpine (MK-801) in monkeys. British Journal of Pharmacology, 125, 1013–1018. 10.1038/sj.bjp.0702178 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Harvey J. A. (2003). Role of the serotonin 5-HT 2A receptor in learning. Learning & Memory, 10, 355–362. 10.1101/lm.60803 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hervig M. E., Fiddian L., Piilgaard L., Božič T., Blanco-Pozo M., Knudsen C., et al. Robbins T. W. (2020). Dissociable and paradoxical roles of rat medial and lateral orbitofrontal cortex in visual serial reversal learning. Cerebral Cortex, 30, 1016–1029. 10.1093/cercor/bhz144 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hervig M. E., Thomsen M. S., Kalló I., & Mikkelsen J. D. (2016). Acute phencyclidine administration induces c-Fos-immunoreactivity in interneurons in cortical and subcortical regions. Neuroscience, 334, 13–25. 10.1016/j.neuroscience.2016.07.028 [DOI] [PubMed] [Google Scholar]
- Hornak J., O’Doherty J., Bramham J., Rolls E. T., Morris R. G., Bullock P. R., & Polkey C. E. (2004). Reward-related reversal learning after surgical excisions in orbito-frontal or dorsolateral prefrontal cortex in humans. Journal of Cognitive Neuroscience, 16, 463–478. 10.1162/089892904322926791 [DOI] [PubMed] [Google Scholar]
- Iversen S. D., & Mishkin M. (1970). Perseverative interference in monkeys following selective lesions of the inferior prefrontal convexity. Experimental Brain Research, 11, 376–386. 10.1007/BF00237911 [DOI] [PubMed] [Google Scholar]
- Izquierdo A. (2017). Functional heterogeneity within rat orbitofrontal cortex in reward learning and decision making. The Journal of Neuroscience, 37, 10529–10540. 10.1523/JNEUROSCI.1678-17.2017 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Izquierdo A., Brigman J. L., Radke A. K., Rudebeck P. H., & Holmes A. (2017). The neural basis of reversal learning: An updated perspective. Neuroscience, 345, 12–26. 10.1016/j.neuroscience.2016.03.021 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Izquierdo A., Carlos K., Ostrander S., Rodriguez D., McCall-Craddolph A., Yagnik G., & Zhou F. (2012). Impaired reward learning and intact motivation after serotonin depletion in rats. Behavioural Brain Research, 233, 494–499. 10.1016/j.bbr.2012.05.032 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jakab R. L., & Goldman-Rakic P. S. (1998). 5-Hydroxytryptamine2A serotonin receptors in the primate cerebral cortex: Possible site of action of hallucinogenic and antipsychotic drugs in pyramidal cell apical dendrites. Proceedings of the National Academy of Sciences of the United States of America, 95, 735–740. 10.1073/pnas.95.2.735 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Janhunen S. K., Svärd H., Talpos J., Kumar G., Steckler T., Plath N., et al. Ballard T. M. (2015). The subchronic phencyclidine rat model: Relevance for the assessment of novel therapeutics for cognitive impairment associated with schizophrenia. Psychopharmacology, 232, 4059–4083. 10.1007/s00213-015-3954-6 [DOI] [PubMed] [Google Scholar]
- Jardri R., Hugdahl K., Hughes M., Brunelin J., Waters F., Alderson-Day B., et al. Denève S. (2016). Are hallucinations due to an imbalance between excitatory and inhibitory influences on the brain? Schizophrenia Bulletin, 42, 1124–1134. 10.1093/schbul/sbw075 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jennings J. H., Kim C. K., Marshel J. H., Raffiee M., Ye L., Quirin S., et al. Deisseroth K. (2019). Interacting neural ensembles in orbitofrontal cortex for social and feeding behaviour. Nature, 565, 645–649. 10.1038/s41586-018-0866-8 [DOI] [PMC free article] [PubMed] [Google Scholar]
- John C. S., Smith K. L., Van’t Veer A., Gompf H. S., Carlezon W. A. Jr., Cohen B. M., et al. Bechtholt-Gompf A. J. (2012). Blockade of astrocytic glutamate uptake in the prefrontal cortex induces anhedonia. Neuropsychopharmacology, 37, 2467–2475. 10.1038/npp.2012.105 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Johnson M. P., Siegel B. W., & Carr A. A. (1996). [3H]MDL 100,907: A novel selective 5-HT2A receptor ligand. Naunyn-Schmiedeberg’s Archives of Pharmacology, 354, 205–209. 10.1007/BF00178722 [DOI] [PubMed] [Google Scholar]
- Jones B., & Mishkin M. (1972). Limbic lesions and the problem of stimulus—Reinforcement associations. Experimental Neurology, 36, 362–377. 10.1016/0014-4886(72)90030-1 [DOI] [PubMed] [Google Scholar]
- Kahnt T., Chang L. J., Park S. Q., Heinzle J., & Haynes J. D. (2012). Connectivity-based parcellation of the human orbitofrontal cortex. The Journal of Neuroscience, 32, 6240–6250. 10.1523/JNEUROSCI.0257-12.2012 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kalivas P. W. (2009). The glutamate homeostasis hypothesis of addiction. Nature Reviews Neuroscience, 10, 561–572. 10.1038/nrn2515 [DOI] [PubMed] [Google Scholar]
- Karlsson R.-M., Tanaka K., Saksida L. M., Bussey T. J., Heilig M., & Holmes A. (2009). Assessment of glutamate transporter GLAST (EAAT1)-deficient mice for phenotypes relevant to the negative and executive/cognitive symptoms of schizophrenia. Neuropsychopharmacology, 34, 1578–1589. 10.1038/npp.2008.215 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kehne J. H., Baron B. M., Carr A. A., Chaney S. F., Elands J., Feldman D. J., et al. Widmaier C. (1996). Preclinical characterization of the potential of the putative atypical antipsychotic MDL 100,907 as a potent 5-HT2A antagonist with a favorable CNS safety profile. The Journal of Pharmacology and Experimental Therapeutics, 277, 968–981. http://jpet.aspetjournals.org/content/277/2/968.long [PubMed] [Google Scholar]
- Kepecs A., & Mainen Z. F. (2012). A computational framework for the study of confidence in humans and animals. Philosophical Transactions of the Royal Society B: Biological Sciences, 367, 1322–1337. 10.1098/rstb.2012.0037 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kepecs A., Uchida N., Zariwala H. A., & Mainen Z. F. (2008). Neural correlates, computation and behavioural impact of decision confidence. Nature, 455, 227–231. 10.1038/nature07200 [DOI] [PubMed] [Google Scholar]
- Kim J., & Ragozzino M. E. (2005). The involvement of the orbitofrontal cortex in learning under changing task contingencies. Neurobiology of Learning and Memory, 83, 125–133. 10.1016/j.nlm.2004.10.003 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Klanker M., Post G., Joosten R., Feenstra M., & Denys D. (2013). Deep brain stimulation in the lateral orbitofrontal cortex impairs spatial reversal learning. Behavioural Brain Research, 245, 7–12. 10.1016/j.bbr.2013.01.043 [DOI] [PubMed] [Google Scholar]
- Kringelbach M. L., & Rolls E. T. (2004). The functional neuroanatomy of the human orbitofrontal cortex: Evidence from neuroimaging and neuropsychology. Progress in Neurobiology, 72, 341–372. 10.1016/j.pneurobio.2004.03.006 [DOI] [PubMed] [Google Scholar]
- Kumar G., Olley J., Steckler T., & Talpos J. (2015). Dissociable effects of NR2A and NR2B NMDA receptor antagonism on cognitive flexibility but not pattern separation. Psychopharmacology, 232, 3991–4003. 10.1007/s00213-015-4008-9 [DOI] [PubMed] [Google Scholar]
- Lacreuse A., Moore C. M., LaClair M., Payne L., & King J. A. (2018). Glutamine/glutamate (Glx) concentration in prefrontal cortex predicts reversal learning performance in the marmoset. Behavioural Brain Research, 346, 11–15. 10.1016/j.bbr.2018.01.025 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lak A., Costa G. M., Romberg E., Koulakov A. A., Mainen Z. F., & Kepecs A. (2014). Orbitofrontal cortex is required for optimal waiting based on decision confidence. Neuron, 84, 190–201. 10.1016/j.neuron.2014.08.039 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lapiz-Bluhm M. D. S., Soto-Piña A. E., Hensler J. G., & Morilak D. A. (2009). Chronic intermittent cold stress and serotonin depletion induce deficits of reversal learning in an attentional set-shifting test in rats. Psychopharmacology, 202, 329–341. 10.1007/s00213-008-1224-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Leeson V. C., Robbins T. W., Matheson E., Hutton S. B., Ron M. A., Barnes T. R. E., & Joyce E. M. (2009). Discrimination learning, reversal, and set-shifting in first-episode schizophrenia: Stability over six years and specific associations with medication type and disorganization syndrome. Biological Psychiatry, 66, 586–593. 10.1016/j.biopsych.2009.05.016 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li J. T., Su Y. A., Wang H. L., Zhao Y. Y., Liao X. M., Wang X. D., & Si T. M. (2016). Repeated blockade of NMDA receptors during adolescence impairs reversal learning and disrupts GABAergic interneurons in rat medial prefrontal cortex. Frontiers in Molecular Neuroscience, 9, 17 10.3389/fnmol.2016.00017 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lima A., Sardinha V. M., Oliveira A. F., Reis M., Mota C., Silva M. A., et al. Oliveira J. F. (2014). Astrocyte pathology in the prefrontal cortex impairs the cognitive function of rats. Molecular Psychiatry, 19, 834–841. 10.1038/mp.2013.182 [DOI] [PubMed] [Google Scholar]
- Liu S., Bubar M. J., Lanfranco M. F., Hillman G. R., & Cunningham K. A. (2007). Serotonin2C receptor localization in GABA neurons of the rat medial prefrontal cortex: Implications for understanding the neurobiology of addiction. Neuroscience, 146, 1677–1688. 10.1016/j.neuroscience.2007.02.064 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lobellova V., Entlerova M., Svojanovska B., Hatalova H., Prokopova I., Petrasek T., et al. Stuchlik A. (2013). Two learning tasks provide evidence for disrupted behavioural flexibility in an animal model of schizophrenia-like behaviour induced by acute MK-801: A dose-response study. Behavioural Brain Research, 246, 55–62. 10.1016/j.bbr.2013.03.006 [DOI] [PubMed] [Google Scholar]
- Mar A. C., Horner A. E., Nilsson S. R. O., Alsiö J., Kent B. A., Kim C. H., et al. Bussey T. J. (2013). The touchscreen operant platform for assessing executive function in rats and mice. Nature Protocols, 8, 1985–2005. 10.1038/nprot.2013.123 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mar A. C., Walker A. L. J., Theobald D. E., Eagle D. M., & Robbins T. W. (2011). Dissociable effects of lesions to orbitofrontal cortex subregions on impulsive choice in the rat. The Journal of Neuroscience, 31, 6398–6404. 10.1523/JNEUROSCI.6620-10.2011 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Marek G. J., Martin-Ruiz R., Abo A., & Artigas F. (2005). The selective 5-HT2A receptor antagonist M100907 enhances antidepressant-like behavioral effects of the SSRI fluoxetine. Neuropsychopharmacology, 30, 2205–2215. 10.1038/sj.npp.1300762 [DOI] [PubMed] [Google Scholar]
- Marinova Z., Chuang D.-M., & Fineberg N. (2017). Glutamate-modulating drugs as a potential therapeutic strategy in obsessive-compulsive disorder. Current Neuropharmacology, 15, 977–995. 10.2174/1570159X15666170320104237 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Marquardt K., Josey M., Kenton J. A., Cavanagh J. F., Holmes A., & Brigman J. L. (2019). Impaired cognitive flexibility following NMDAR-GluN2B deletion is associated with altered orbitofrontal-striatal function. Neuroscience, 404, 338–352. 10.1016/j.neuroscience.2019.01.066 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Marquardt K., Saha M., Mishina M., Young J. W., & Brigman J. L. (2014). Loss of GluN2A-containing NMDA receptors impairs extra-dimensional set-shifting. Genes, Brain & Behavior, 13, 611–617. 10.1111/gbb.12156 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Marquardt K., Sigdel R., & Brigman J. L. (2017). Touch-screen visual reversal learning is mediated by value encoding and signal propagation in the orbitofrontal cortex. Neurobiology of Learning and Memory, 139, 179–188. 10.1016/j.nlm.2017.01.006 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Masaki D., Yokoyama C., Kinoshita S., Tsuchida H., Nakatomi Y., Yoshimoto K., & Fukui K. (2006). Relationship between limbic and cortical 5-HT neurotransmission and acquisition and reversal learning in a go/no-go task in rats. Psychopharmacology, 189, 249–258. 10.1007/s00213-006-0559-0 [DOI] [PubMed] [Google Scholar]
- Matias S., Lottem E., Dugué G. P., & Mainen Z. F. (2017). Activity patterns of serotonin neurons underlying cognitive flexibility. eLife, 6, 1–24. 10.7554/eLife.20552 [DOI] [PMC free article] [PubMed] [Google Scholar]
- McAllister K. A. L., Mar A. C., Theobald D. E., Saksida L. M., & Bussey T. J. (2015). Comparing the effects of subchronic phencyclidine and medial prefrontal cortex dysfunction on cognitive tests relevant to schizophrenia. Psychopharmacology, 232, 3883–3897. 10.1007/s00213-015-4018-7 [DOI] [PubMed] [Google Scholar]
- McAlonan K., & Brown V. J. (2003). Orbital prefrontal cortex mediates reversal learning and not attentional set shifting in the rat. Behavioural Brain Research, 146, 97–103. 10.1016/j.bbr.2003.09.019 [DOI] [PubMed] [Google Scholar]
- Milad M. R., & Rauch S. L. (2012). Obsessive-compulsive disorder: Beyond segregated cortico-striatal pathways. Trends in Cognitive Sciences, 16, 43–51. 10.1016/j.tics.2011.11.003 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Miner L. A. H., Backstrom J. R., Sanders-Bush E., & Sesack S. R. (2003). Ultrastructural localization of serotonin2A receptors in the middle layers of the rat prelimbic prefrontal cortex. Neuroscience, 116, 107–117. 10.1016/S0306-4522(02)00580-8 [DOI] [PubMed] [Google Scholar]
- Moghaddam B., Adams B., Verma A., & Daly D. (1997). Activation of glutamatergic neurotransmission by ketamine: A novel step in the pathway from NMDA receptor blockade to dopaminergic and cognitive disruptions associated with the prefrontal cortex. The Journal of Neuroscience, 17, 2921–2927. 10.1523/JNEUROSCI.17-08-02921.1997 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mora S., Merchán A., Vilchez O., Aznar S., Klein A. B., Ultved L., et al. Moreno M. (2018). Reduced cortical serotonin 5-HT2A receptor binding and glutamate activity in high compulsive drinker rats. Neuropharmacology, 143, 10–19. 10.1016/j.neuropharm.2018.09.004 [DOI] [PubMed] [Google Scholar]
- Morris L. S., Kundu P., Dowell N., Mechelmans D. J., Favre P., Irvine M. A., et al. Voon V. (2016). Fronto-striatal organization: Defining functional and microstructural substrates of behavioural flexibility. Cortex: A Journal Devoted to the Study of the Nervous System and Behavior, 74, 118–133. 10.1016/j.cortex.2015.11.004 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Muñoz M. D., Herreras O., Herranz A. S., Soís J. M., Martín del Río R., & Lerma J. (1987). Effects of dihydrokainic acid on extracellular amino acids and neuronal excitability in the in vivo rat hippocampus. Neuropharmacology, 26, 1–8. 10.1016/0028-3908(87)90037-2 [DOI] [PubMed] [Google Scholar]
- Murphy F. C., Smith K. A., Cowen P. J., Robbins T. W., & Sahakian B. J. (2002). The effects of tryptophan depletion on cognitive and affective processing in healthy volunteers. Psychopharmacology, 163, 42–53. 10.1007/s00213-002-1128-9 [DOI] [PubMed] [Google Scholar]
- Nieoullon A. (2009). Glial glutamate transporters: Electrophysiology In Squire L. (Ed.), Encyclopedia of neuroscience (pp. 793–804). London, UK: Elsevier. [Google Scholar]
- Noonan M. P., Chau B. K. H., Rushworth M. F. S., & Fellows L. K. (2017). Contrasting effects of medial and lateral orbitofrontal cortex lesions on credit assignment and decision-making in humans. The Journal of Neuroscience, 37, 7023–7035. 10.1523/JNEUROSCI.0692-17.2017 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Noonan M. P., Walton M. E., Behrens T. E. J., Sallet J., Buckley M. J., & Rushworth M. F. S. (2010). Separate value comparison and learning mechanisms in macaque medial and lateral orbitofrontal cortex. Proceedings of the National Academy of Sciences of the United States of America, 107, 20547–20552. 10.1073/pnas.1012246107 [DOI] [PMC free article] [PubMed] [Google Scholar]
- O’Doherty J., Kringelbach M. L., Rolls E. T., Hornak J., & Andrews C. (2001). Abstract reward and punishment representations in the human orbitofrontal cortex. Nature Neuroscience, 4, 95–102. 10.1038/82959 [DOI] [PubMed] [Google Scholar]
- Pehek E. A., Nocjar C., Roth B. L., Byrd T. A., & Mabrouk O. S. (2006). Evidence for the preferential involvement of 5-HT2A serotonin receptors in stress- and drug-induced dopamine release in the rat medial prefrontal cortex. Neuropsychopharmacology, 31, 265–277. 10.1038/sj.npp.1300819 [DOI] [PubMed] [Google Scholar]
- Perani D., Garibotto V., Gorini A., Moresco R. M., Henin M., Panzacchi A., et al. Fazio F. (2008). In Vivo PET study of 5HT(2A) serotonin and D(2) dopamine dysfunction in drug-naive obsessive-compulsive disorder. NeuroImage, 42, 306–314. 10.1016/j.neuroimage.2008.04.233 [DOI] [PubMed] [Google Scholar]
- Petr G. T., Sun Y., Frederick N. M., Zhou Y., Dhamne S. C., Hameed M. Q., et al. Rosenberg P. A. (2015). Conditional deletion of the glutamate transporter GLT-1 reveals that astrocytic GLT-1 protects against fatal epilepsy while neuronal GLT-1 contributes significantly to glutamate uptake into synaptosomes. The Journal of Neuroscience, 35, 5187–5201. 10.1523/JNEUROSCI.4255-14.2015 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pintor A., Galluzzo M., Grieco R., Pèzzola A., Reggio R., & Popoli P. (2004). Adenosine A 2A receptor antagonists prevent the increase in striatal glutamate levels induced by glutamate uptake inhibitors. Journal of Neurochemistry, 89, 152–156. 10.1111/j.1471-4159.2003.02306.x [DOI] [PubMed] [Google Scholar]
- Rahman S., Sahakian B. J., Hodges J. R., Rogers R. D., & Robbins T. W. (1999). Specific cognitive deficits in mild frontal variant frontotemporal dementia. Brain: A Journal of Neurology, 122, 1469–1493. 10.1093/brain/122.8.1469 [DOI] [PubMed] [Google Scholar]
- Remijnse P. L., Nielen M. M. A., van Balkom A. J. L. M., Cath D. C., van Oppen P., Uylings H. B. M., & Veltman D. J. (2006). Reduced orbitofrontal-striatal activity on a reversal learning task in obsessive-compulsive disorder. Archives of General Psychiatry, 63, 1225–1236. 10.1001/archpsyc.63.11.1225 [DOI] [PubMed] [Google Scholar]
- Riceberg J. S., & Shapiro M. L. (2012). Reward stability determines the contribution of orbitofrontal cortex to adaptive behavior. The Journal of Neuroscience, 32, 16402–16409. 10.1523/JNEUROSCI.0776-12.2012 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Robbins T. W., Vaghi M. M., & Banca P. (2019). Obsessive-compulsive disorder: Puzzles and prospects. Neuron, 102, 27–47. 10.1016/j.neuron.2019.01.046 [DOI] [PubMed] [Google Scholar]
- Roberts-Wolfe D. J., & Kalivas P. W. (2015). Glutamate transporter GLT-1 as a therapeutic target for substance use disorders. CNS & Neurological Disorders - Drug Targets, 14, 745–756. 10.2174/1871527314666150529144655 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Robinson E. S. J., Dalley J. W., Theobald D. E. H., Glennon J. C., Pezze M. A., Murphy E. R., & Robbins T. W. (2008). Opposing roles for 5-HT2A and 5-HT2C receptors in the nucleus accumbens on inhibitory response control in the 5-choice serial reaction time task. Neuropsychopharmacology, 33, 2398–2406. 10.1038/sj.npp.1301636 [DOI] [PubMed] [Google Scholar]
- Roth B. L. (2011). Irving Page lecture: 5-HT(2A) serotonin receptor biology: Interacting proteins, kinases and paradoxical regulation. Neuropharmacology, 61, 348–354. 10.1016/j.neuropharm.2011.01.012 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rygula R., Clarke H. F., Cardinal R. N., Cockcroft G. J., Xia J., Dalley J. W., et al. Roberts A. C. (2015). Role of central serotonin in anticipation of rewarding and punishing outcomes: Effects of selective amygdala or orbitofrontal 5-HT depletion. Cerebral Cortex, 25, 3064–3076. 10.1093/cercor/bhu102 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Santana N., & Artigas F. (2017). Laminar and cellular distribution of monoamine receptors in rat medial prefrontal cortex. Frontiers in Neuroanatomy, 11, 87 10.3389/fnana.2017.00087 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Santana N., Bortolozzi A., Serrats J., Mengod G., & Artigas F. (2004). Expression of serotonin1A and serotonin2A receptors in pyramidal and GABAergic neurons of the rat prefrontal cortex. Cerebral Cortex, 14, 1100–1109. 10.1093/cercor/bhh070 [DOI] [PubMed] [Google Scholar]
- Schoenbaum G., Chiba A. A., & Gallagher M. (1999). Neural encoding in orbitofrontal cortex and basolateral amygdala during olfactory discrimination learning. The Journal of Neuroscience, 19, 1876–1884. 10.1523/JNEUROSCI.19-05-01876.1999 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schoenbaum G., Chiba A. A., & Gallagher M. (2000). Changes in functional connectivity in orbitofrontal cortex and basolateral amygdala during learning and reversal training. The Journal of Neuroscience, 20, 5179–5189. 10.1523/JNEUROSCI.20-13-05179.2000 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Schoenbaum G., & Setlow B. (2003). Lesions of nucleus accumbens disrupt learning about aversive outcomes. The Journal of Neuroscience, 23, 9833–9841. 10.1523/JNEUROSCI.23-30-09833.2003 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Stolyarova A., O’Dell S. J., Marshall J. F., & Izquierdo A. (2014). Positive and negative feedback learning and associated dopamine and serotonin transporter binding after methamphetamine. Behavioural Brain Research, 271, 195–202. 10.1016/j.bbr.2014.06.031 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Suzuki Y., Jodo E., Takeuchi S., Niwa S., & Kayama Y. (2002). Acute administration of phencyclidine induces tonic activation of medial prefrontal cortex neurons in freely moving rats. Neuroscience, 114, 769–779. 10.1016/S0306-4522(02)00298-1 [DOI] [PubMed] [Google Scholar]
- Svoboda J., Stankova A., Entlerova M., & Stuchlik A. (2015). Acute administration of MK-801 in an animal model of psychosis in rats interferes with cognitively demanding forms of behavioral flexibility on a rotating arena. Frontiers in Behavioral Neuroscience, 9, 75 10.3389/fnbeh.2015.00075 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Takahashi Y. K., Roesch M. R., Stalnaker T. A., Haney R. Z., Calu D. J., Taylor A. R., et al. Schoenbaum G. (2009). The orbitofrontal cortex and ventral tegmental area are necessary for learning from unexpected outcomes. Neuron, 62, 269–280. 10.1016/j.neuron.2009.03.005 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Thompson S. M., Josey M., Holmes A., & Brigman J. L. (2015). Conditional loss of GluN2B in cortex and hippocampus impairs attentional set formation. Behavioral Neuroscience, 129, 105–112. 10.1037/bne0000045 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Thonnard D., Dreesen E., Callaerts-Vegh Z., & D’Hooge R. (2019). NMDA receptor dependence of reversal learning and the flexible use of cognitively demanding search strategies in mice. Progress in Neuro-Psychopharmacology & Biological Psychiatry, 90, 235–244. 10.1016/j.pnpbp.2018.12.003 [DOI] [PubMed] [Google Scholar]
- Tian S. W., Yu X. D., Cen L., & Xiao Z. Y. (2019). Glutamate transporter GLT1 inhibitor dihydrokainic acid impairs novel object recognition memory performance in mice. Physiology & Behavior, 199, 28–32. 10.1016/j.physbeh.2018.10.019 [DOI] [PubMed] [Google Scholar]
- Vallender E. J., Lynch L., Novak M. A., & Miller G. M. (2009). Polymorphisms in the 3′ UTR of the serotonin transporter are associated with cognitive flexibility in rhesus macaques. American Journal of Medical Genetics Part B: Neuropsychiatric Genetics, 150B, 467–475. 10.1002/ajmg.b.30835 [DOI] [PMC free article] [PubMed] [Google Scholar]
- van der Meulen J. A. J., Bilbija L., Joosten R. N. J. M. A., de Bruin J. P. C., & Feenstra M. G. P. (2003). The NMDA-receptor antagonist MK-801 selectively disrupts reversal learning in rats. NeuroReport, 14, 2225–2228. 10.1097/00001756-200312020-00018 [DOI] [PubMed] [Google Scholar]
- Walton M. E., Behrens T. E. J., Noonan M. P., & Rushworth M. F. S. (2011). Giving credit where credit is due: Orbitofrontal cortex and valuation in an uncertain world. Annals of the New York Academy of Sciences, 1239, 14–24. 10.1111/j.1749-6632.2011.06257.x [DOI] [PubMed] [Google Scholar]
- Waltz J. A., & Gold J. M. (2007). Probabilistic reversal learning impairments in schizophrenia: Further evidence of orbitofrontal dysfunction. Schizophrenia Research, 93, 296–303. 10.1016/j.schres.2007.03.010 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Watson D. J., & Stanton M. E. (2009). Medial prefrontal administration of MK-801 impairs T-maze discrimination reversal learning in weanling rats. Behavioural Brain Research, 205, 57–66. 10.1016/j.bbr.2009.07.026 [DOI] [PMC free article] [PubMed] [Google Scholar]
- West E. A., Forcelli P. A., McCue D. L., & Malkova L. (2013). Differential effects of serotonin-specific and excitotoxic lesions of OFC on conditioned reinforcer devaluation and extinction in rats. Behavioural Brain Research, 246, 10–14. 10.1016/j.bbr.2013.02.027 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Winstanley C. A., Theobald D. E. H., Dalley J. W., Glennon J. C., & Robbins T. W. (2004). 5-HT2A and 5-HT2C receptor antagonists have opposing effects on a measure of impulsivity: Interactions with global 5-HT depletion. Psychopharmacology, 176, 376–385. 10.1007/s00213-004-1884-9 [DOI] [PubMed] [Google Scholar]
- Zald D. H., Mchugo M., Ray K. L., Glahn D. C., Eickhoff S. B., & Laird A. R. (2014). Meta-analytic connectivity modeling reveals differential functional connectivity of the medial and lateral orbitofrontal cortex. Cerebral Cortex, 24, 232–248. 10.1093/cercor/bhs308 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhang G., & Stackman R. W. Jr. (2015). The role of serotonin 5-HT2A receptors in memory and cognition. Frontiers in Pharmacology, 6, 225 10.3389/fphar.2015.00225 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zhukovsky P., Alsiö J., Jupp B., Xia J., Giuliano C., Jenner L., et al. Dalley J. W. (2017). Perseveration in a spatial-discrimination serial reversal learning task is differentially affected by MAO-A and MAO-B inhibition and associated with reduced anxiety and peripheral serotonin levels. Psychopharmacology, 234, 1557–1571. 10.1007/s00213-017-4569-x [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.