Skip to main content
Nature Portfolio logoLink to Nature Portfolio
. 2021 Jun 30;26(12):7188–7199. doi: 10.1038/s41380-021-01194-y

Distinct roles for dopamine clearance mechanisms in regulating behavioral flexibility

Clio Korn 1, Thomas Akam 2, Kristian H R Jensen 1, Cristiana Vagnoni 2, Anna Huber 1, Elizabeth M Tunbridge 1,#, Mark E Walton 2,3,✉,#
PMCID: PMC8872990  EMSID: EMS127432  PMID: 34193974

Abstract

Dopamine plays a crucial role in adaptive behavior, and dysfunctional dopamine is implicated in multiple psychiatric conditions characterized by inflexible or inconsistent choices. However, the precise relationship between dopamine and flexible decision making remains unclear. One reason is that, while many studies have focused on the activity of dopamine neurons, efficient dopamine signaling also relies on clearance mechanisms, notably the dopamine transporter (DAT), which predominates in striatum, and catechol-O-methyltransferase (COMT), which predominates in cortex. The exact locus, extent, and timescale of the effects of DAT and COMT are uncertain. Moreover, there is limited data on how acute disruption of either mechanism affects flexible decision making strategies mediated by cortico-striatal networks. To address these issues, we combined pharmacological modulation of DAT and COMT with electrochemistry and behavior in mice. DAT blockade, but not COMT inhibition, regulated sub-second dopamine release in the nucleus accumbens core, but surprisingly neither clearance mechanism affected evoked release in prelimbic cortex. This was not due to a lack of sensitivity, as both amphetamine and atomoxetine changed the kinetics of sub-second release. In a multi-step decision making task where mice had to respond to reversals in either reward probabilities or the choice sequence to reach the goal, DAT blockade selectively impaired, and COMT inhibition improved, performance after reward reversals, but neither manipulation affected the adaptation of choices after action-state transition reversals. Together, our data suggest that DAT and COMT shape specific aspects of behavioral flexibility by regulating different aspects of the kinetics of striatal and cortical dopamine, respectively.

Subject terms: Neuroscience, Psychology

Introduction

Changes in behavioral flexibility are a central component of multiple psychiatric and neurological conditions, such as schizophrenia and substance use disorders, that involve dysfunctional dopamine transmission [1, 2]. However, the precise relationship between flexible behavior and patterns of dopamine signaling in different brain regions remains a matter of contention. Understanding how dopamine signaling regulates behavioral flexibility at different timescales and in different brain regions has important implications for the development of the more selective dopaminergic agents that are required. For example, in the case of schizophrenia, although antipsychotic drugs successfully antagonize the elevated striatal presynaptic dopamine function observed in patients [3], and thereby successfully treat psychosis, they have little or no effect on the cognitive dysfunction that patients experience, which is thought to result at least in part from insufficient dopaminergic transmission in the prefrontal cortex [4]. Similarly, although L-DOPA successfully remediates the motor symptoms of Parkinson’s disease, it has limited influence on—or may even impair—behavioral flexibility in this patient group [5].

Rapid and precisely-timed dopamine activity plays a crucial role in adaptive behavior, signaling how much better or worse an event is than predicted (a “reward prediction error”) [68]. Boosting or blocking these phasic signals can change the likelihood of repeating or switching away from a recent choice [9, 10]. However, effective dopamine signaling depends not just on dopamine neuron activity but also on clearance mechanisms, which are critical for the temporal and spatial regulation of dopamine’s actions. Crucially, clearance mechanisms vary across cortico-striatal networks implicated in flexible decision making [2]. Recycling via the dopamine transporter (DAT) predominates in nucleus accumbens (NAc) and other striatal regions [11], where it regulates the size and kinetics of dopamine transients as well as background dopamine tone [1113], while enzymatic degradation, particularly by catechol-O-methyltransferase (COMT), is more prominent in prefrontal cortex (PFC), where it has been shown to modulate evoked dopamine release measured over minutes [14, 15]. These clearance mechanisms are ideally placed to shape mesolimbic and mesocortical dopamine control over flexible decision making. Indeed, recent evidence suggests that manipulations of either the mesolimbic pathway to NAc or mesocortical pathway to medial PFC can have distinct effects on behavioral flexibility [1618]. Moreover, several lines of evidence indicate that cortical and striatal dopamine play synergistic and even opponent roles in shaping appropriate levels of flexibility and focus [1921].

While there have been a number of studies investigating how DAT and COMT shape reward pursuit and cognitive functions like working memory, respectively [2226], their role in regulating behavioral flexibility has been comparatively neglected. Individual variation in DAT or COMT function in humans has been shown to correlate with differences in reinforcement learning and flexible decision making [2733]. However, it is not clear precisely how markers of individual variation in transporter or enzyme function in humans relate to dopamine signaling in vivo. Notably, when the effects of targeted pharmacological or genetic manipulations of DAT have been investigated in animal models, deficits in reward learning or behavioral flexibility have been notably absent despite changes in motivation [22, 23, 34, 35], while evidence from animal models of COMT’s involvement in behavioral flexibility is limited [36]. Moreover, there is still much uncertainty over the locus, extent, and timescale of the effects of DAT and COMT, particularly concerning their regulation of phasic fluctuations in dopamine that produce reward prediction error-like signals in both striatum and cortex [17, 37]. Understanding these relationships is particularly critical given that novel COMT inhibitors are currently under development as putative novel agents for treating cognitive impairments [38].

To address these issues, we combined pharmacological modulation of DAT and COMT function with electrochemistry and behavior (see Fig. 1 for a schematic of the experiments). We first determined the effects of DAT blockade and COMT inhibition on fast fluctuations in dopamine in NAc using fast-scan cyclic voltammetry (FCV) and extended this approach for the first time to understand the effects of DAT and COMT on stimulated release in PFC. We then studied the effects of each clearance mechanism on motivation-to-work and behavioral flexibility using, first, an effort-based operant task and, second, a multi-step decision-making task that dissociated changes in the optimal response strategy to reach a goal from changes in reward probabilities at the goal location. We predicted that changes in motivation-to-work would reflect DAT’s and COMT’s capacity to modulate NAc dopamine. However, given the interplay of cortico-striatal systems and dopamine in behavioral flexibility, we anticipated that both DAT blockade and COMT inhibition would influence how quickly animals responded to changes on the multi-step decision making task, potentially either by changing the strategies the animals use or by modulating how they update their behavior in response based on their recent choices and outcomes. Moreover, based on the evidence that dopamine is predominantly activated by reward prediction errors over sensorimotor surprise [39], we anticipated that any influence of DAT or COMT might be selective to situations where there is a need to update choice strategies following a change in reward value, but not, for example, when needing to update information about the appropriate action sequence to reach the high value reward option.

Fig. 1.

Fig. 1

Schematic of the study design and details of the primary hypotheses to be tested.

Methods

See Supplementary Information for detailed methods.

Animals

For details of animal numbers, see figures and Supplementary Information. Adult male C57BL/6 mice were used because COMT exhibits sexually dimorphic effects [40]. Mice were maintained at >85% free feeding weight for all behavioral experiments, except for the sucrose preference test where they were water deprived for 3 h prior to test sessions. Food and water were otherwise provided ad libitum. All experiments were conducted in accordance with the UK Home Office Animals (Scientific Procedures) Act 1986 and the local ethical review board at the University of Oxford.

Drugs

In all experiments mice were administered the selective COMT inhibitor tolcapone (30 mg/kg; TRC Inc) [25, 41] and/or the selective DAT blocker GBR-12909 dihydrochloride (6 mg/kg, Tocris) [42]. These doses are effective at altering behavior without causing motor stereotypies (see Supplementary Information). D-amphetamine (in sulfate formulation; Tocris) was used at 4 mg/kg [43], and the NET blocker atomoxetine (in hydrochloride formulation; Tocris) at 1 mg/kg [44] in the voltammetry experiment only. All drugs were dissolved in 20% hydroxypropyl-beta-cyclodextrin (Acros Organics) in 0.9% saline (AquPharm), which served as a vehicle control in all experiments. All drugs were delivered by intraperitoneal injection.

Fast-scan cyclic voltammetry (FCV)

Carbon fiber electrodes targeting NAc core or prelimbic PFC were implanted and voltammetric recordings were made as previously described [45]. Catecholamine release was induced by electrical stimulation of the ventral tegmental area (VTA) (60 biphasic 2 ms pulses, 50 Hz, 300 μA), based on literature [46, 47] and on pilot experiments. Following a 30 min pre-drug baseline period, tolcapone or vehicle was administered. GBR-12909 or vehicle was then given after a further 90 min of recording. To determine whether it was possible to detect pharmacologically-induced changes in VTA-evoked transients in medial PFC using FCV, amphetamine was tested either after the last injection or in naïve animals, and atomoxetine was administered to drug naïve animals as a positive control. Data were obtained from 57 mice (n = 11–14 per drug group in each brain region for DAT blockade and COMT inhibition experiments, n = 5–10 for positive control experiments, see Supplementary Table 1 for numbers in each treatment group).

Behavioral tasks

Progressive ratio (PR) task

To investigate the influence of DAT blockade or COMT inhibition on motivation-to-work and response vigor, mice were trained on a PR task. During PR sessions, the number of active lever presses required to obtain each subsequent reward (60 µL of 10% sucrose solution) was increased according to the following equation: required lever presses = 5*ei*0.16–5 (“i”: trial number). Drug effects were assessed by giving mice two systemic injections prior to PR test sessions: tolcapone or vehicle (either 105 or 120 min before testing), followed by GBR-12909 or vehicle (15 or 60 min before testing). Each mouse received all possible drug combinations, including combined DAT blockade and COMT inhibition, over four PR test sessions according to a counterbalanced within-subjects design.

Multi-step decision making task

To investigate the influence of DAT blockade or COMT inhibition on reinforcement learning and behavioral flexibility, mice were trained on a multi-step decision making task adapted from the two-step task developed by Daw et al. [48] for dissociating model-based and model-free reinforcement learning in humans, as reported in [49, 50] (see Supplementary Table 2 for training steps). On each trial, mice chose between a top and a bottom nose poke, causing either the left or right reward port to light up, selection of which caused reward (20% sucrose) to be delivered on a probabilistic schedule. At any point in time, a particular first-step action (top or bottom) usually led to a particular second-step state (left or right port active) (“common” transitions, 80% of trials), though sometimes led to the opposite state (“rare” transitions, 20% of trials). Similarly, one reward port had a high probability of giving reward (0.8) and the other a low reward probability (0.2).

Unlike other recent rodent adaptations of multi-step decision tasks [5153], both the reward probabilities in the second-step states and the transition probabilities linking the first-step actions to the second-step states reversed in blocks (20 trials after an exponential moving average of choices [tau = 8 trials] crossed a 75% correct threshold, leading to either a reward or transition reversal).

Once animals had acquired the task, pharmacological manipulations were performed. Tolcapone or its vehicle were administered 90 min prior to sessions; GBR-12909 or its vehicle were administered 15 min prior to sessions. Each subject received a total of eight separate sessions of each drug and five sessions of each vehicle injection, order counterbalanced across animals.

Data analysis

Detailed analysis methodologies are reported in Supplementary Information. Full statistical results are presented in figure legends and Supplementary Information. The datasets are available from the corresponding author on reasonable request.

In brief, for the FCV experiments, we investigated whether DAT blockade or COMT inhibition affected dopamine release and kinetics of uptake in NAc or PFC by examining: (i) peak height; (ii) latency to peak; and (iii) rate of decay to half peak height (T50) separately in each region. For the primary experiments, the significance of drug effects on each parameter was determined using repeated-measures ANOVAs, with time as the within-subjects factor and drug 1 (tolcapone or vehicle) and drug 2 (GBR-12909 or vehicle) as between-subjects factors. In addition, Bayesian independent samples t tests were performed to compare evidence that the vehicle and drug groups were different post-administration. For the positive control experiments, the drug group (amphetamine or atomoxetine) was directly compared to vehicle using independent samples t tests. Full statistics are presented in Supplementary Tables 35.

For the PR task, the main outcome measure was cumulative active lever presses over the session as a measure of motivation, though we also examined inactive lever presses and average reward collection and re-engagement latencies. The significance of drug effects on each of these measures was assessed using a repeated-measures ANOVA, with drug 1 (tolcapone or vehicle) and drug 2 (GBR-12909 or vehicle) as within-subjects factors.

For the multi-step task, we focused on the effect of DAT blockade or COMT inhibition on motivational engagement and flexible updating. We examined drug effects on (i) trial rate, (ii) second-step reaction times (divided by common/rare), and (iii) choices pre-/post-reversal (separated by reward- or transition-reversal) using repeated-measures ANOVAs with within-subjects’ factors of vehicle/drug and GBR-12909/tolcapone. To get a more fine-grained picture of how adaptation to reversals was affected by the drugs, we also calculated post-reversal choice probability trajectories (see Supplementary Materials for details) and used permutation testing to evaluate differences between each drug and its corresponding vehicle. Finally, the statistical significance of trial-to-trial learning, and its modulation by drug treatment, was assessed using a logistic regression model and a hierarchical regression was fit using a mixed-effects package (lme4).

Results

DAT, but not COMT, influences fast dopamine fluctuations in NAc

We first wanted to extend our understanding of how DAT and COMT shape dynamics of catecholamine transmission in striatum and cortex by using FCV to assess the size and timing of release events at sub-second resolution. This is important because the precision of dopamine transients is thought to be critical for appropriate reward learning and behavioral flexibility and because most previous studies have assessed the roles of DAT and COMT on dopamine transmission at relatively slow (minute-level) timescales [5458].

We initially focused on VTA-evoked release in NAc core (Fig. 2, Figure S1), both to validate the effects of our stimulation parameters and chosen dose of GBR-12909 on striatal dopamine and to examine the influence of COMT on these signals. As expected, DAT blockade increased the magnitude and duration of dopamine transients: GBR-12909 increased the peak size of evoked release and slowed both the latency to peak and the rate of decay from peak (GBR-12909 × time interaction: all F > 6.5, p < 0.021, Fig. 2F, H, J). In contrast, COMT inhibition had no effect (all F < 2.4, p > 0.14) (Fig. 2E, G, I). When considered with previous microdialysis studies [54, 56, 58], this implies that only DAT recycling, and not COMT breakdown, plays a direct role in shaping the dynamics of fast dopamine transmission in NAc.

Fig. 2. Effects of COMT inhibition and DAT blockade on evoked dopamine release in the NAc.

Fig. 2

A Schematic of recording and stimulating electrodes in NAc core and VTA, respectively (see Figure S1 for precise locations). B Illustration of features of dopamine transients quantified during analysis. C Schematic of experiment structure. Drug injections are indicated below the timeline and time points of interest above it. D An example recording made in the NAc core. The pseudocolour plot shows the recorded current as a function of the applied potential of the triangular waveform (y axis) over an 11 s period (x axis); timing and duration of stimulation indicated by thick black bar. The trace above the pseudocolour plot shows the extracted current attributable to dopamine release as a function of time. The cyclic voltammogram (current versus applied voltage; boxed inset) during the peak of the signal, 1.5 s after the time of stimulation, is consistent with dopamine release. E Evoked dopamine release in NAc in animals given the COMT inhibitor as drug 1 (red) compared to release in those given vehicle as drug 1 (black) 85 min after the first injection. Release is normalized to the average pre-drug baseline peak height (equivalent to 100% on the y axis), binned over 15 min centered on the time point of interest, and presented as mean ± SEM across animals within each drug group. Timing and duration of stimulation indicated by thick black bar. F As in E, but comparing release in animals given the DAT blocker as drug 2 (blue) with release in those given vehicle as drug 2 (black) 30 min after the second injection. GJ Influence of COMT inhibition or DAT blockade on dopamine release kinetics, measured as a change from pre-drug baseline. G Quantification of the peak height of evoked dopamine release in NAc following administration of the COMT inhibitor. Left: peak height at the same time point shown in E. Each animal’s data is shown individually and is normalized to its average pre-drug baseline peak height (equivalent to 100% on the y axis). Box plots show median and 25th and 75th percentiles; whiskers extend from the minimum to maximum value. Right: normalized peak height (mean ± SEM for each drug group) of dopamine transients induced by VTA stimulation, recorded every 5 min over the 90 min following the first injection in animals that received the COMT inhibitor compared with those that received vehicle as drug 1. H As in G, but comparing data from animals that received the DAT blocker with data from those that received vehicle as drug 2; right-hand plot shows peak height over the 90 min following the second injection. I As in the left-hand plot of G, but showing the quantification of the latency from stimulation to peak (left) and of the decay from the peak to T50 (right). Decay constant data are shown on a log10 scale for clarity. J As in I, but for the DAT blocker. Statistical analysis: COMT inhibition did not alter the size or kinetics of evoked release (all F < 2.4, p > 0.14; 95% CI for difference between tolcapone and vehicle: release: −27.22 to 23.84; latency: −15.99 to 8.45; decay: −18.3 to 45.38). DAT blockade increased the size of evoked release compared to vehicle (GBR-12909 x time interaction: F1,19 = 18.1, p = 0.0004; main effect of GBR-12909: F1,19 = 7.8, p = 0.011; main effect of time: F1,19 = 11.5, p = 0.003). Post-hoc tests confirmed significantly greater evoked release following administration of the DAT blocker than vehicle (p = 0.002, 95% CI for difference: 26.32 to 104.66). DAT blockade also slowed the latency to peak compared to vehicle (GBR-12909 × time interaction: F1,19 = 12.6, p = 0.002; main effect of GBR-12909: F1,19 = 7.8, p = 0.011; main effect of time: F1,19 = 6.3, p = 0.021; 95% CI for difference between GBR-12909 and vehicle: 12.59 to 48.58). In addition, there was an interactive effect of the DAT blocker and time on the rate of decay from peak (F1,19 = 6.5, p = 0.020). Post-hoc tests confirmed the selective effect of the DAT blocker on the latency to peak (p = 0.002) and showed a trend level effect of DAT blockade on post-peak kinetics compared to vehicle (p = 0.078, 95% CI for difference −82.15 to 4.78), with a difference in the rate of decay in the DAT blockade group before and after receiving the drug (p = 0.007) that was not seen in the vehicle group (p = 0.565).

Neither DAT nor COMT affect fast catecholamine fluctuations in PFC

We then examined VTA-evoked release in prelimbic PFC (Fig. 3, Fig. S2). Transients here were in general considerably smaller than in NAc (pre-drug: NAc 692.2 ± 234.87 nM; PFC 112.05 ± 13.71 nM, mean ± SEM: Fig. 3B). Unexpectedly, we found little reliable influence of either DAT blockade (Fig. 3D, F, H) or COMT inhibition (Fig. 3C, E, G) on the size or kinetics of evoked release in PFC (all main effects or interaction with drug: F < 2.67, p > 0.12, except GBR-12909 × time interaction F1,21 = 6.0, p = 0.023). The lack of observed effect was not because our approach was insufficiently sensitive to detect drug-induced changes in PFC: both amphetamine, an indirect sympathomimetic known to potentiate dopamine (Fig. 3I), and atomoxetine, a norepinephrine transporter (NET) blocker (Fig. S3), changed VTA-evoked release kinetics (all p < 0.031) (see Fig. S4 for comparison of the effect of each pharmacological treatment on stimulation-evoked dopamine levels). Therefore, although COMT is implicated in shaping minute-by-minute dopamine tone in frontal cortex [36, 57, 59], our data indicate that neither it nor DAT regulates the release and kinetics of fast catecholamine transmission, induced here by periodic brief VTA stimulation over a 90 min period, in this region.

Fig. 3. Effects of COMT inhibition and DAT blockade on evoked dopamine release in the PFC.

Fig. 3

A Schematic of recording and stimulating electrodes in prelimbic PFC and the VTA, respectively (see Fig. S2 for precise histology). B An example recording made in the prelimbic PFC. The pseudocolour plot shows the recorded current as a function of the applied potential of the triangular waveform (y axis) over a 6 s period (x axis); timing and duration of stimulation indicated by thick black bar. The trace above the pseudocolour plot shows the extracted current attributable to dopamine release as a function of time. The cyclic voltammogram (current versus applied voltage; boxed inset) during the peak of the signal, 1.4 s after the time of stimulation, is consistent with catecholamine release. C Evoked dopamine release in the PFC in animals given the COMT inhibitor as drug 1 (red) compared to release in those given vehicle as drug 1 (black) 85 min after the first injection. Release is normalized to the average pre-drug baseline peak height (equivalent to 100% on the y axis), binned over 15 min centered on the time point of interest, and presented as mean ± SEM across animals within each drug group. Timing and duration of stimulation indicated by thick black bar. D As in C, but comparing release in animals given the DAT blocker as drug 2 (blue) with release in those given vehicle as drug 2 (black) 30 min after the second injection. EH Influence of COMT inhibition or DAT blockade on dopamine release kinetics, measured as a change from pre-drug baseline. E Quantification of the peak height of evoked dopamine release in PFC following administration of the COMT inhibitor. Left: peak height at the same time point shown in C. Each animal’s data is shown individually and is normalized to its average pre-drug baseline peak height (equivalent to 100% on the y axis). Box plots show median and 25th and 75th percentiles; whiskers extend from the minimum to maximum value. Right: normalized peak height (mean ± SEM for each drug group) of dopamine transients induced by VTA stimulation, recorded every 5 min over the 90 min following the first injection in animals that received the COMT inhibitor compared with those that received vehicle as drug 1. F As in E, but comparing data from animals that received the DAT blocker with data from those that received vehicle as drug 2; right-hand plot shows peak height over the 90 min following the second injection. G As in the left-hand plot of E, but showing the quantification of the latency from stimulation to peak (left) and of the decay from the peak to T50 (right). H As in G, but for the DAT blocker. l The effect of amphetamine (magenta) on evoked dopamine release in PFC. Left: schematic of experiment structure. Middle: evoked dopamine release normalized to the average pre-drug baseline peak height, binned over 15 min centered on the time point of interest (30 min after drug injection), and presented as mean ± SEM across animals within each group. Right: quantification of the signal decay, with each animal’s data shown individually and normalized to its average pre-drug baseline decay constant. Statistical analysis: Neither COMT inhibition nor DAT blockade altered any measure of evoked release in PFC (all main effects or interactions with drug: F < 2.67, p > 0.12, with the exception of a GBR-12909 × time interaction on post-peak signal decay (F1,21 = 6.0, p = 0.023), caused by small changes in the slope pre- and post-drug in the two groups, which likely reflects the difficulty of performing an exponential fit on small cortical transients rather than a drug effect, as there was no difference between vehicle and GBR-12909 after drug administration (p = 0.70); 95% CI for difference between tolcapone v vehicle: release: −21.55 to 12.59; latency: −13.75 to 12.85; decay: −65.85 to 56.25; for GBR12909 v vehicle: release: −6.36 to 28.21; latency: −18.10 to 36.91; decay: −84.48 to 58.01). Subsequent follow up analyses using Bayesian independent samples t tests comparing effects of each drug v vehicle also showed no evidence in support of a difference between the groups on any measure (tolcapone: all BF10 < 0.49; GBR-12909: all BF10 < 0.71). By contrast, amphetamine both increased the peak height (independent samples t test: t15 = −4.6, p = 0.0003; 95% CI for difference: 37.33 to 101.16; Bayesian independent samples t test: BF10 = 70.79) and decreased the decay constant value (t15 = 2.4, p = 0.030; 95% CI for difference: −8.12 to −140.09; Bayesian independent samples t test: BF10 = 2.42) compared to vehicle.

DAT blockade, but not COMT inhibition, increases motivation to work for reward

We investigated how differential regulation of dopamine clearance by DAT and COMT influences motivation to work for reward. While striatal dopamine and DAT have been linked to motivational drive by numerous previous studies [2224, 60], much less is known about COMT’s influence [54, 58].

We therefore first examined the separate and interactive effects of DAT and COMT on motivation to work for reward on a progressive ratio (PR) task (Fig. 4). DAT blockade increased motivational drive indexed by willingness to work for reward and sustained engagement. It selectively increased active lever presses (main effect of GBR-12909: F1,19 = 26.4, p < 0.001, Fig. 4C, D) and speeded re-engagement latencies (main effect of GBR-12909: F1,19 = 15.4, p = 0.001, Fig. 4E, F). In contrast, there were no reliable effects of COMT inhibition, either in isolation or in interaction with DAT blockade, on any measures in the PR task (all F < 3.1, p > 0.09, Fig. 4C–F).

Fig. 4. Effects of DAT blockade and COMT inhibition on the progressive ratio (PR) task.

Fig. 4

A within-subjects design was used to test drug effects; n = 21. A Depiction of the progression across trials of lever press ratios required to obtain reward during PR sessions. B Schematic of experiment structure. Drug injections are indicated below the timeline. Timing of drug administration shown is for cohort 2 (see Supplementary Methods). C Number of responses on the active lever during the PR session under the four drug 1 & drug 2 treatment conditions: vehicle & vehicle (black), COMT inhibitor & vehicle (red), vehicle & DAT blocker (blue), and COMT inhibitor & DAT blocker (purple). Each data point shows the lever press session total for one animal. Boxplots show median and 25th and 75th percentiles; whiskers extend from the minimum to maximum value. Lever press data are shown on a log10 scale for clarity. D As in C, but for responses on the inactive lever. Data points again show lever press session totals for individual animals and are displayed on a log10 scale. E As in C, but for the latency to collect reward following its delivery. Each data point shows the cross-trial average latency for one animal. F As in C, but for the latency to re-engage with the task by recommencing lever pressing following the consumption of reward. Data points show cross-trial average latencies for individual animals. Statistical analysis: DAT blockade increased active lever presses (main effect of GBR-12909: F1,19 = 26.4, p = 6 × 10−5) and speeded task performance (main effect of GBR-12909 on re-engagement latency: F1,19 = 15.4, p = 0.001; main effect of GBR-12909 on reward collection latency: F1,19 = 4.1, p = 0.056) but did not affect inactive lever presses (F1,19 = 2.7, p = 0.118). COMT inhibition did not alter PR task performance, either on its own or when combined with DAT blockade (all F < 3.1, p > 0.09).

DAT and COMT have distinct effects on value updating during multi-step decision making

The results of our voltammetry and PR experiments largely reinforce traditional views of DAT’s and COMT’s roles in dopamine transmission and behavior, emphasizing DAT’s role in shaping striatal dopamine transmission and aspects of motivational drive. However, given the importance of precisely-regulated dopamine transmission for appropriate reward-guided learning and flexible decision making, we hypothesized that DAT and COMT might affect these processes in the context of a more complex decision-making task likely to recruit both striatal and cortical circuitry (Fig. 5). Unlike other similar rodent multi-step paradigms [5153], our task included reversals in both reward and action-state transition probabilities (Fig. 5C). Crucially, this allowed us to examine the influence of DAT and COMT over different aspects of behavioral flexibility: how efficiently mice respond either when the location of high probability reward changes (reward reversals) or when the initial choice required to reach the high probability reward changes (transition reversals).

Fig. 5. Effects of DAT blockade and COMT inhibition on performance during a multi-step decision making task.

Fig. 5

A within-subjects design was used to test drug effects; n = 8, 26 vehicle/drug sessions per mouse (8 × GBR-12909, 8 × tolcapone, 5 × each drug’s vehicle), average range = 434–799 trials per session per mouse. A Diagram of two-step task apparatus and trial events. B Diagram of the task state space; reward and transition probabilities are shown for one block type. C Example session. Top panel: black line shows exponential moving average of subjects’ choices (tau = 8 trials). The correct choice (high or low) for each block is indicated by the gray bars. Middle panel: reward probabilities for each block for the left (red) and right (blue) sides. Bottom panel: transition probabilities linking first step actions to second step states; yellow line shows probability of top→left and bottom→right transitions, purple line shows probability of top→right and bottom→left transitions. Colors in middle and bottom panel of C match those in B. D, F Trial rate as a function of session time (D DAT blockade; F COMT inhibition). Trial rates were smoothed with a Gaussian of 5 min standard deviation. Shaded areas show cross-subject SEM. E, G Median second-step reaction times following common and rare transitions (E DAT blockade; G COMT inhibition). The second-step reaction time is the time from choosing high or low to entering the active side port at the second step. Error bars show cross subject SEM. HK Choice probability trajectories around reversals in reward probabilities (H DAT blockade, J COMT inhibition) and transition probabilities (I DAT blockade, K COMT inhibition). Pale lines show cross-subject mean choice probability trajectory and dark lines show double exponential fit. L, M Probability of repeating the first step choice on the next trial as a function of the trial outcome (rewarded or not) and experience state transition (common or rare) (L DAT blockade, M COMT inhibition). Statistical analysis: DAT blockade, but not COMT inhibition, increased trial rate (main effect of veh/drug: F1,7 = 9.79, p = 0.016; veh/drug × DAT/COMT interaction: F1,7 = 6.52, p = 0.038; post-hoc paired t tests, DAT: p = 0.012, COMT: p = 0.98). DAT blockade, but not COMT inhibition, also speeded second-step reaction times (main effect of veh/drug: F1,7 = 8.20, p = 0.024; veh/drug × DAT/COMT interaction: F1,7 = 7.76, p = 0.027), particularly following rare transitions (veh/drug × common/rare interaction: F1,7 = 9.26, p = 0.019; 3-way interaction did not quite reach significance: veh/drug × common/rare × DAT/COMT interaction: F1,7 = 4.35, p = 0.076). Neither DAT blockade nor COMT inhibition changed the choice rates at the end of each block prior to a reversal (all F < 1.59, p > 0.24). DAT blockade and COMT inhibition both modulated the speed of reward reversals, assessed by fitting a double exponential to the choice probability trajectory and using permutation testing to evaluate whether differences between drug and corresponding vehicle condition were significant (reward reversals: GBR-12909: p = 0.0012; tolcapone: p = 0.0496; transition reversals: all p > 0.23). In addition, direct comparison of post-reversal choices highlighted that the effect depended on whether the reward or transition probabilities reversed (veh/drug × DAT/COMT × reward/transition-reversal interaction: F1,7 = 6.88, p = 0.034; reversals in the reward probabilities: veh/drug × DAT/COMT interaction: F1,7 = 17.85, p = 0.004; reversals in transition probabilities: all F < 0.36, p > 0.56). A mixed effects logistic regression showed that the mice’s choices were sensitive to both the outcome (reward/no reward) and the transition structure (common/rare) (both p < 0.001). However, neither of these were reliably affected by drug condition (p > 0.086 for all interactions of outcome and transition with drug condition). Instead, the drug manipulations—and particularly DAT blockade—affected the regression model’s “correct” predictor, which captures a general tendency to repeat choices commonly associated with the high reward probability option (correct × veh/drug × DAT/COMT interaction: p = 0.016; DAT blockade, p = 0.056; COMT inhibition: p = 0.28). *, p < 0.05; **, p < 0.01.

Mice became proficient at the task, triggering reversals on average in <30 trials prior to drug administration, and performing on average 434–799 trials during each of 26 experimental sessions. We first examined the influence of DAT or COMT on willingness-to-work given that the actions incurred negligible effort-related cost. Consistent with our PR experiment, DAT blockade but not COMT inhibition increased the rate at which subjects performed trials (veh/drug × DAT/COMT interaction: F1,7 = 6.52, p = 0.038, Fig. 5D, F). There was also a selective speeding of second-step reaction times following DAT blockade, particularly following rare transitions (veh/drug × DAT/COMT interaction: F1,7 = 7.76, p = 0.027, veh/drug × common/rare interaction: F1,7 = 9.26, p = 0.019, Fig. 5E, G). These findings further support a selective role for DAT, but not COMT, in invigorating reward-guided responding.

We next assessed how DAT blockade and COMT inhibition affected choice performance, focusing on reward and transition reversals. Neither drug affected pre-reversal choice rates at the end of blocks (all F1,7 < 1.59, p > 0.24). Following reward reversals, DAT blockade significantly slowed (p = 0.0012) and COMT inhibition significant improved (p = 0.0496) adaptation, as measured by the choice probability trajectories on and off drug (Fig. 5H, J). Importantly, however, neither drug affected the speed of adaptation following reversals in transition probabilities (all p > 0.22) (Fig. 5I, J; Supplementary Tables 6, 7). To further examine the specificity of this reversal effect, we directly compared whether choices immediately after each type of reversal were selectively influenced by drug administration. This again highlighted that the effect of DAT blockade and COMT inhibition depended on whether the reward or transition probabilities reversed (veh/drug × DAT/COMT × reward/transition-reversal interaction: F1,7 = 6.88, p = 0.034). Specifically, the drugs selectively influenced adaptation after reversals in the reward probabilities (veh/drug × DAT/COMT interaction: F1,7 = 17.85, p = 0.004) but had no influence after reversals in transition probabilities (all F < 0.36, p > 0.56). Taken together, these data demonstrate that dopamine clearance mechanisms shape adaptive behaviors only when there is a change in reward location and not when the action sequence required to reach that reward reverses.

To gain insight into what might be causing these changes in behavioral flexibility, we investigated the specific influence of DAT and COMT on decision-making strategies and trial-to-trial learning. Consistent with previous work using the same task [50], mice were sensitive to the transition structure of the task, not simply being more likely to repeat any choice that was rewarded, as would be expected if they were exclusively using a “model-free” reinforcement learning strategy, but instead being more likely to repeat a rewarded choice after a common than a rare transition (Fig. 5L, M). A comparison of reinforcement learning models showed that mice’s choices were best explained by a model that employed both model-free and model-based control (see Supplementary Information for discussion of the mice’s strategies, Fig. S5, Supplementary Tables 8, 9). Moreover, a logistic regression analysis of the factors influencing choice repetition demonstrated a significant effect of both outcome and transition (p < 0.001 mixed effects logistic regression).

Importantly, and more unexpectedly, neither DAT nor COMT manipulations reliably modified these measures, as might be expected if either mechanism were simply disrupting trial-by-trial reward learning (p > 0.086 for all interactions of outcome and transition with drug condition). There was also no effect on choice or motor biases (p > 0.21). Instead, the drug manipulations—and particularly DAT blockade—affected the regression model’s “correct” predictor, which models a tendency to repeat choices to the first step option that has a higher probability of leading to reward at the end of the trial (correct × veh/drug x DAT/COMT interaction: p = 0.016). This predictor captures the influence of the extended history prior to the previous trial [49]. Together, these results imply that DAT blockade impairs reward-guided behavioral flexibility not by biasing decision making strategies or through a direct effect on reward learning, but instead by disrupting the cumulative effect of past choices and outcomes on reward-driven choices.

Discussion

Here we demonstrate distinct roles for DAT and COMT in flexible reward-guided decision making. FCV recordings confirmed a role for DAT recycling, but not COMT degradation, in regulating fast fluctuations in NAc dopamine transmission but showed that neither affected evoked transients in PFC, indicating that clearance mechanisms other than DAT and COMT—potentially including NET—contribute to regulation of cortical dopamine at sub-second timescales. Behaviorally, we confirmed that DAT blockade, but not COMT inhibition, increased motivational drive and promoted task engagement, as previously observed [22, 23]. Importantly, in line with evidence indicating distinct but interconnected relationships between striatal and cortical dopamine to enable appropriate levels of flexibility and focus [19], we additionally demonstrated specific and bidirectional roles for dopamine clearance mechanisms in behavioral flexibility, with DAT impairing and COMT causing a relative improvement in performance selectively following reversals in reward probabilities.

Previous studies have indicated that DAT can affect motivation but has limited, if any, influence over reward learning [2224, 29, 34, 60]. Here, we demonstrated that DAT not only modulates motivational performance and task engagement but, notably, can also influence behavioral flexibility. Specifically, DAT blockade selectively disrupted the ability to adapt following reversals in reward probabilities. Strikingly, the effect of DAT blockade was not observed following reversals in the transition probabilities linking first-step choices to second-step reward states, even though these also required animals to adapt their behavior (in this case, at the first-step choice) in order to maximize rewards obtained. Therefore, the effect cannot be attributed to a general behavioral inflexibility, a finding also supported by the lack of influence on any motor level biases in the logistic regression model. Instead, our data are consistent with DAT influencing reward-driven alternations in behavioral policies in addition to shaping motivational components of reward-guided behavior.

In agreement with previous studies [54, 56, 58], we found that DAT blockade both increased and extended evoked NAc dopamine transients. The effects were similar to those seen on spontaneous transients in freely-moving animals following administration of a catecholamine transporter blocker or stimulant drugs [13, 43]. In addition, we found no effect of DAT blockade on evoked release in prelimbic PFC, consistent with the sparse DAT expression in this region [61]. Therefore, DAT blockade predominantly exerts its effects through regulation of striatal dopamine.

Given that fast fluctuations in striatal dopamine correlate with reward prediction error signals, which are strongly linked to animals’ ability to form certain reward-related associations [39, 62], it may seem surprising that there is such limited evidence in animal models linking DAT to regulation of adaptive behavior. However, although the reward reversal impairment might initially seem to imply a change in reward learning, there was in fact no reliable evidence that DAT blockade caused alterations to specific trial-by-trial reinforcement learning or a change in reinforcement learning strategies. Instead, it appears that DAT more prominently helps to regulate how the history of past choices and outcomes influences behavioral policies, which can shape how quickly animals detect a change in reward value. Such a conclusion aligns with studies showing effects of dopamine on flexible responding that go beyond influences on trial-by-trial reinforcement learning. For instance, DAT knockdown mice, which have increased tonic dopamine (though, unlike in this study, reduced phasic transients), have reduced flexibility in response to changes in effort cost that has been linked to a reduced sensitivity to recent outcome history [63]. Similarly, a human DAT polymorphism specifically influenced perseverative errors during a probabilistic reward reversal task [30]. These findings imply that the behavioral consequences of DAT blockade on reward learning and behavioral flexibility will depend strongly on how changeable and uncertain rewards are in any context.

Research on COMT’s role in behavior has largely focused on cognitive functions; its significance for reward-guided behavior remains relatively unstudied [33]. We found that COMT inhibition had no effect on motivational aspects of behavior, consistent with its lack of effect on NAc dopamine [54, 64, 65]. More surprisingly, given robust associations between COMT activity and dopamine levels recorded over longer (min) timescales by microdialysis, there was no evidence that COMT inhibition had an effect on the size or kinetics of VTA-evoked transients in prelimbic PFC, at least within the sensitivity limits afforded by FCV for dopamine (~10 nM level of detection: [66]) and sample sizes used for these comparisons (n = 11–14) (although it is important to note that our data also only provide anecdotal evidence in favor of the null hypothesis that vehicle and tolcapone had equal effects on release, see Supplementary Table 4). This was not due to a lack of sensitivity to sub-second changes in cortex, as amphetamine increased VTA-evoked transient size and slowed the decay of cortical signals, while the NET blocker atomoxetine slowed signal decay in PFC. Taken together with previous studies, this points towards COMT having a selective influence in regulating cortical dopamine transmission over longer timescales than are required for precise reinforcement learning [36, 57, 59]. These results, however, contrast with a report of differences in dopamine overflow in COMT knockout mice [46]. This discrepancy may reflect different methodological and analytical approaches between this previous study and the one reported here (medial forebrain bundle stimulation combined with constant potential amperometry interspersed with brief switches to FCV in [34] compared with VTA stimulation combined with FCV and chemometric analysis in the current study). However, it may also speak to differences between the acute changes in COMT in adult animals induced by acute and reversible pharmacological inhibition, compared to the constitutive changes present in the knockout mouse, in which compensatory changes are possible [25]. Consistent with this proposal, human COMT genotype is associated with changes in midbrain dopamine function [67, 68], suggesting that the prolonged changes in dopaminergic transmission that presumably result from genetically-encoded differences in COMT activity may induce subtle functional changes elsewhere in the dopamine system. This highlights the need for a greater understanding of the timescales over which compensatory changes across cortical and subcortical dopamine systems occur.

Although COMT does not appear to regulate the kinetics of evoked dopamine transients, COMT inhibition nonetheless selectively speeded adaptation to reward reversals relative to vehicle injections. The precise mechanism by which COMT regulates reward-driven behavioral flexibility remains unclear, as there were no consistent changes in behavioral strategy or reinforcement learning that emerged from the analyses. Although acute pharmacological inhibition is different than the chronic changes in enzymatic activity arising from the human COMT Val/Met polymorphism [69], our findings are concordant with reports of increased sensitivity to rewards and faster reinforcement learning in Met allele carriers, who have lower COMT activity than Val allele homozygotes [70, 71]. Moreover, the absence of any detectable influence of COMT inhibition on cortical or striatal dopamine transients, similar to those that would be generated by reward prediction error-driven dopamine activity, implies that its role is not specifically to modulate how individual experiences shape learning but instead to regulate dopamine-driven cortical states over minute-by-minute timescales. Given that our data indicate that NET may be a key regulator of cortical dopamine signaling at fast (second-by-second) timescales, it will be interesting in future studies to investigate how NET blockers influence rapid reinforcement learning.

Together, our data suggest that DAT and COMT differentially regulate specific aspects of reward-based behavioral flexibility. Surprisingly, however, we found no evidence that either do this by influencing the balance of reinforcement learning strategies or shaping trial-by-trial learning. These findings demonstrate the importance of different regulators of dopamine signaling, operating over distinct timescales and in different brain regions, in shaping multiple aspects of adaptive reward-guided behavior beyond trial-by-trial reinforcement learning. In addition, they reinforce the idea that appropriate levels of behavioral flexibility, motivation, and engagement rely, at least in part, on the balance of striatal and cortical dopamine. Such data, therefore, highlight both the promise and the complexities associated with the development of novel pharmacological agents to remedy dopamine-dependent behavioral changes seen in patients with psychiatric disorders.

Supplementary information

Supplementary Material (7.5MB, pdf)

Acknowledgements

We thank Greg Daubney for performing histology; Richard Keithley for advice and assistance with chemometric analysis; and Katie Jennings, Paul Harrison, and Jaime McCutcheon for useful discussions about the data. This research was funded in whole, or in part, by the Wellcome Trust (4-year studentship WT096586 to CK, fellowships WT090051MA and 202831/Z/16/Z to MEW), as well as the Royal Society (EMT) and the BBSRC (project grant: BB/M024148/1). For the purpose of open access, the author has applied a CC BY public copyright licence to any Author Accepted Manuscript version arising from this submission.

Author contributions

CK, EMT, and MEW conceived the project and, with TA, designed the experiments. CK, KHJ, CV, and AH collected the data. CK and TA analyzed the data, and CK, TA, EMT, and MEW wrote the manuscript.

Competing interests

The authors have no competing interests to declare.

Footnotes

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

These authors contributed equally: Elizabeth M. Tunbridge, Mark E. Walton.

Supplementary information

The online version contains supplementary material available at 10.1038/s41380-021-01194-y.

References

  • 1.Izquierdo A, Jentsch JD. Reversal learning as a measure of impulsive and compulsive behavior in addictions. Psychopharmacology. 2012;219:607–20. doi: 10.1007/s00213-011-2579-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 2.Izquierdo A, Brigman JL, Radke AK, Rudebeck PH, Holmes A. The neural basis of reversal learning: an updated perspective. Neuroscience. 2017;345:12–26. doi: 10.1016/j.neuroscience.2016.03.021. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Howes OD, Kambeitz J, Kim E, Stahl D, Slifstein M, Abi-Dargham A, et al. The nature of dopamine dysfunction in schizophrenia and what this means for treatment. Arch Gen Psychiatry. 2012;69:776–86. doi: 10.1001/archgenpsychiatry.2012.169. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Goldman-Rakic PS, Castner SA, Svensson TH, Siever LJ, Williams GV. Targeting the dopamine D1 receptor in schizophrenia: insights for cognitive dysfunction. Psychopharmacology. 2004;174:3–16. doi: 10.1007/s00213-004-1793-y. [DOI] [PubMed] [Google Scholar]
  • 5.Swainson R, Rogers RD, Sahakian BJ, Summers BA, Polkey CE, Robbins TW. Probabilistic learning and reversal deficits in patients with Parkinson’s disease or frontal or temporal lobe lesions: possible adverse effects of dopaminergic medication. Neuropsychologia. 2000;38:596–612. doi: 10.1016/s0028-3932(99)00103-7. [DOI] [PubMed] [Google Scholar]
  • 6.Montague PR, Dayan P, Sejnowski TJ. A framework for mesencephalic dopamine systems based on predictive Hebbian learning. J Neurosci. 1996;16:1936–47. doi: 10.1523/JNEUROSCI.16-05-01936.1996. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Schultz W, Dayan P, Montague PR. A neural substrate of prediction and reward. Science. 1997;275:1593–9. doi: 10.1126/science.275.5306.1593. [DOI] [PubMed] [Google Scholar]
  • 8.Eshel N, Tian J, Bukwich M, Uchida N. Dopamine neurons share common response function for reward prediction error. Nat Neurosci. 2016;19:479–86. doi: 10.1038/nn.4239. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Steinberg EE, Keiflin R, Boivin JR, Witten IB, Deisseroth K, Janak PH. A causal link between prediction errors, dopamine neurons and learning. Nat Neurosci. 2013;16:966–73. doi: 10.1038/nn.3413. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Hamid AA, Pettibone JR, Mabrouk OS, Hetrick VL, Schmidt R, Vander Weele CM, et al. Mesolimbic dopamine signals the value of work. Nat Neurosci. 2016;19:117–26. doi: 10.1038/nn.4173. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Sulzer D, Cragg SJ, Rice ME. Striatal dopamine neurotransmission: regulation of release and uptake. Basal Ganglia. 2016;6:123–48. doi: 10.1016/j.baga.2016.02.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Budygin EA, Kilpatrick MR, Gainetdinov RR, Wightman RM. Correlation between behavior and extracellular dopamine levels in rat striatum: comparison of microdialysis and fast-scan cyclic voltammetry. Neurosci Lett. 2000;281:9–12. doi: 10.1016/s0304-3940(00)00813-2. [DOI] [PubMed] [Google Scholar]
  • 13.Robinson DL, Wightman RM. Nomifensine amplifies subsecond dopamine signals in the ventral striatum of freely-moving rats. J Neurochem. 2004;90:894–903. doi: 10.1111/j.1471-4159.2004.02559.x. [DOI] [PubMed] [Google Scholar]
  • 14.Karoum F, Chrapusta SJ, Egan MF. 3-Methoxytyramine is the major metabolite of released dopamine in the rat frontal cortex: reassessment of the effects of antipsychotics on the dynamics of dopamine release and metabolism in the frontal cortex, nucleus accumbens, and striatum by a simple two pool model. J Neurochem. 1994;63:972–9. doi: 10.1046/j.1471-4159.1994.63030972.x. [DOI] [PubMed] [Google Scholar]
  • 15.Tunbridge EM, Harrison PJ, Weinberger DR. Catechol-o-methyltransferase, cognition, and psychosis: val158met and beyond. Biol Psychiatry. 2006;60:141–51. doi: 10.1016/j.biopsych.2005.10.024. [DOI] [PubMed] [Google Scholar]
  • 16.Radke AK, Kocharian A, Covey DP, Lovinger DM, Cheer JF, Mateo Y, et al. Contributions of nucleus accumbens dopamine to cognitive flexibility. Eur J Neurosci. 2019;50:2023–35. doi: 10.1111/ejn.14152. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Ellwood IT, Patel T, Wadia V, Lee AT, Liptak AT, Bender KJ, et al. Tonic or phasic stimulation of dopaminergic projections to prefrontal cortex causes mice to maintain or deviate from previously learned behavioral strategies. J Neurosci. 2017;37:8315–29. doi: 10.1523/JNEUROSCI.1221-17.2017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Verharen JPH, de Jong JW, Roelofs TJM, Huffels CFM, van Zessen R, Luijendijk MCM, et al. A neuronal mechanism underlying decision-making deficits during hyperdopaminergic states. Nat Commun. 2018;9:731. doi: 10.1038/s41467-018-03087-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Cools R, D’Esposito M. Inverted-U shaped dopamine actions on human working memory and cognitive control. Biol Psychiatry. 2011;69:e113–25. doi: 10.1016/j.biopsych.2011.03.028. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Clarke HF, Cardinal RN, Rygula R, Hong YT, Fryer TD, Sawiak SJ, et al. Orbitofrontal dopamine depletion upregulates caudate dopamine and alters behavior via changes in reinforcement sensitivity. J Neurosci. 2014;34:7663–76. doi: 10.1523/JNEUROSCI.0718-14.2014. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Roberts AC, Salvia MD, Wilkinson LS, Collins P, Muir JL, Everitt BJ, et al. 6-Hydroxydopamine lesions of the prefrontal cortex in monkeys enhance performance on an analog of the Wisconsin Card Sort Test: possible interactions with subcortical dopamine. J Neurosci. 1994;14:2531–44. doi: 10.1523/JNEUROSCI.14-05-02531.1994. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Cagniard B, Balsam PD, Brunner D, Zhuang X. Mice with chronically elevated dopamine exhibit enhanced motivation, but not learning, for a food reward. Neuropsychopharmacology. 2006;31:1362–70. doi: 10.1038/sj.npp.1300966. [DOI] [PubMed] [Google Scholar]
  • 23.Cagniard B, Beeler JA, Britt JP, McGehee DS, Marinelli M, Zhuang X. Dopamine scales performance in the absence of new learning. Neuron. 2006;51:541–7. doi: 10.1016/j.neuron.2006.07.026. [DOI] [PubMed] [Google Scholar]
  • 24.Peciña S, Cagniard B, Berridge KC, Aldridge JW, Zhuang X. Hyperdopaminergic mutant mice have higher “wanting” but not “liking” for sweet rewards. J Neurosci. 2003;23:9395–402. doi: 10.1523/JNEUROSCI.23-28-09395.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 25.Barkus C, Korn C, Stumpenhorst K, Laatikainen LM, Ballard D, Lee S, et al. Genotype-dependent effects of comt inhibition on cognitive function in a highly specific, novel mouse model of altered COMT activity. Neuropsychopharmacology. 2016;41:3060–9. doi: 10.1038/npp.2016.119. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Farrell SM, Tunbridge EM, Braeutigam S, Harrison PJ. COMT Val158Met genotype determines the direction of cognitive effects produced by catechol-O-methyltransferase inhibition. Biol Psychiatry. 2012;71:538–44. doi: 10.1016/j.biopsych.2011.12.023. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Colzato LS, Waszak F, Nieuwenhuis S, Posthuma D, Hommel B. The flexible mind is associated with the catechol-O-methyltransferase (COMT) Val158Met polymorphism: evidence for a role of dopamine in the control of task-switching. Neuropsychologia. 2010;48:2764–8. doi: 10.1016/j.neuropsychologia.2010.04.023. [DOI] [PubMed] [Google Scholar]
  • 28.Aarts E, Roelofs A, Franke B, Rijpkema M, Fernández G, Helmich RC, et al. Striatal dopamine mediates the interface between motivational and cognitive control in humans: evidence from genetic imaging. Neuropsychopharmacology. 2010;35:1943–51. doi: 10.1038/npp.2010.68. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Kaiser RH, Treadway MT, Wooten DW, Kumar P, Goer F, Murray L, et al. Frontostriatal and dopamine markers of individual differences in reinforcement learning: a multi-modal investigation. Cereb Cortex. 2018;28:4281–90. doi: 10.1093/cercor/bhx281. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.den Ouden HEM, Daw ND, Fernandez G, Elshout JA, Rijpkema M, Hoogman M, et al. Dissociable effects of dopamine and serotonin on reversal learning. Neuron. 2013;80:1090–100. doi: 10.1016/j.neuron.2013.08.030. [DOI] [PubMed] [Google Scholar]
  • 31.Krugel LK, Biele G, Mohr PNC, Li S-C, Heekeren HR. Genetic variation in dopaminergic neuromodulation influences the ability to rapidly and flexibly adapt decisions. Proc Natl Acad Sci USA. 2009;106:17951–6. doi: 10.1073/pnas.0905191106. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Doll BB, Bath KG, Daw ND, Frank MJ. Variability in dopamine genes dissociates model-based and model-free reinforcement learning. J Neurosci. 2016;36:1211–22. doi: 10.1523/JNEUROSCI.1901-15.2016. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Tunbridge EM, Huber A, Farrell MS, Stumpenhorst K, Harrison PJ, Walton ME. The role of catechol-O-methyltransferase in reward processing and addiction. CNS Neurol Disord-Drug Targets. 2012;11:306–23. doi: 10.2174/187152712800672409. [DOI] [PubMed] [Google Scholar]
  • 34.Costa VD, Tran VL, Turchi J, Averbeck BB. Dopamine modulates novelty seeking behavior during decision making. Behav Neurosci. 2014;128:556–66. doi: 10.1037/a0037128. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Seu E, Lang A, Rivera RJ, Jentsch JD. Inhibition of the norepinephrine transporter improves behavioral flexibility in rats and monkeys. Psychopharmacology. 2009;202:505–19. doi: 10.1007/s00213-008-1250-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Tunbridge EM, Bannerman DM, Sharp T, Harrison PJ. Catechol-O-methyltransferase inhibition improves set-shifting performance and elevates stimulated dopamine release in the rat prefrontal cortex. J Neurosci. 2004;24:5331–5. doi: 10.1523/JNEUROSCI.1124-04.2004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Hart AS, Rutledge RB, Glimcher PW, Phillips PEM. Phasic dopamine release in the rat nucleus accumbens symmetrically encodes a reward prediction error term. J Neurosci. 2014;34:698–704. doi: 10.1523/JNEUROSCI.2489-13.2014. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Byers S, Buchler IP, DePasquale M, Rowley HL, Kulkarni RS, Pinder L, et al. Novel, non-nitrocatechol catechol-O-methyltransferase inhibitors modulate dopamine neurotransmission in the frontal cortex and improve cognitive flexibility. Psychopharmacology. 2020;237:2695–707. doi: 10.1007/s00213-020-05566-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Schultz W. Dopamine reward prediction-error signalling: a two-component response. Nat Rev Neurosci. 2016;17:183–95. doi: 10.1038/nrn.2015.26. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Harrison PJ, Tunbridge EM. Catechol-O-methyltransferase (COMT): a gene contributing to sex differences in brain function, and to sexual dimorphism in the predisposition to psychiatric disorders. Neuropsychopharmacology. 2008;33:3037–45. doi: 10.1038/sj.npp.1301543. [DOI] [PubMed] [Google Scholar]
  • 41.Männistö PT, Kaakkola S. Catechol-O-methyltransferase (COMT): Biochemistry, Molecular Biology, Pharmacology, and Clinical Efficacy of the New Selective COMT Inhibitors. Pharm Rev. 1999;51:593–628. [PubMed] [Google Scholar]
  • 42.Izenwasser S, Werling LL, Cox BM. Comparison of the effects of cocaine and other inhibitors of dopamine uptake in rat striatum, nucleus accumbens, olfactory tubercle, and medial prefrontal cortex. Brain Res. 1990;520:303–9. doi: 10.1016/0006-8993(90)91719-w. [DOI] [PubMed] [Google Scholar]
  • 43.Daberkow DP, Brown HD, Bunner KD, Kraniotis SA, Doellman MA, Ragozzino ME, et al. Amphetamine paradoxically augments exocytotic dopamine release and phasic dopamine signals. J Neurosci. 2013;33:452–63. doi: 10.1523/JNEUROSCI.2136-12.2013. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Bymaster FP, Katner JS, Nelson DL, Hemrick-Luecke SK, Threlkeld PG, Heiligenstein JH, et al. Atomoxetine increases extracellular levels of norepinephrine and dopamine in prefrontal cortex of rat: a potential mechanism for efficacy in attention deficit/hyperactivity disorder. Neuropsychopharmacology. 2002;27:699–711. doi: 10.1016/S0893-133X(02)00346-9. [DOI] [PubMed] [Google Scholar]
  • 45.Syed ECJ, Grima LL, Magill PJ, Bogacz R, Brown P, Walton ME. Action initiation shapes mesolimbic dopamine encoding of future rewards. Nat Neurosci. 2016;19:34–6. doi: 10.1038/nn.4187. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Yavich L, Forsberg MM, Karayiorgou M, Gogos JA, Mannisto PT. Site-specific role of catechol-o-methyltransferase in dopamine overflow within prefrontal cortex and dorsal striatum. J Neurosci. 2007;27:10196–209. doi: 10.1523/JNEUROSCI.0665-07.2007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Yorgason JT, Jones SR, España RA. Low and high affinity dopamine transporter inhibitors block dopamine uptake within 5 sec of intravenous injection. Neuroscience. 2011;182:125–32. doi: 10.1016/j.neuroscience.2011.03.017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Daw ND, Gershman SJ, Seymour B, Dayan P, Dolan RJ. Model-based influences on humans’ choices and striatal prediction errors. Neuron. 2011;69:1204–15. doi: 10.1016/j.neuron.2011.02.027. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Akam T, Costa R, Dayan P. Simple plans or sophisticated habits? State, transition and learning interactions in the two-step task. PLOS Comput Biol. 2015;11:e1004648. doi: 10.1371/journal.pcbi.1004648. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Akam T, Rodrigues-Vaz I, Marcelo I, Zhang X, Pereira M, Oliveira RF, et al. The anterior cingulate cortex predicts future states to mediate model-based action selection. Neuron. 2021;109:1–15. doi: 10.1016/j.neuron.2020.10.013. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Groman SM, Massi B, Mathias SR, Curry DW, Lee D, Taylor JR. Neurochemical and behavioral dissections of decision-making in a rodent multi-stage task. J Neurosci. 2018;2219–18. [DOI] [PMC free article] [PubMed]
  • 52.Hasz BM, Redish AD. Deliberation and procedural automation on a two-step task for rats. Front Integr Neurosci. 2018;12:30. doi: 10.3389/fnint.2018.00030. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Miller KJ, Botvinick MM, Brody CD. Dorsal hippocampus contributes to model-based planning. Nat Neurosci. 2017;20:1269–76. doi: 10.1038/nn.4613. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Budygin EA, Gainetdinov RR, Kilpatrick MR, Rayevsky KS, Männistö PT, Wightman RM. Effect of tolcapone, a catechol-O-methyltransferase inhibitor, on striatal dopaminergic transmission during blockade of dopamine uptake. Eur J Pharm. 1999;370:125–31. doi: 10.1016/s0014-2999(99)00084-9. [DOI] [PubMed] [Google Scholar]
  • 55.Carboni E, Silvagni A, Vacca C, Di, Chiara G. Cumulative effect of norepinephrine and dopamine carrier blockade on extracellular dopamine increase in the nucleus accumbens shell, bed nucleus of stria terminalis and prefrontal cortex. J Neurochem. 2006;96:473–81. doi: 10.1111/j.1471-4159.2005.03556.x. [DOI] [PubMed] [Google Scholar]
  • 56.Huotari M, Gainetdinov R, Männistö PT. Microdialysis studies on the action of tolcapone on pharmacologically-elevated extracellular dopamine levels in conscious rats. Pharm Toxicol. 1999;85:233–8. doi: 10.1111/j.1600-0773.1999.tb02014.x. [DOI] [PubMed] [Google Scholar]
  • 57.Lapish CC, Ahn S, Evangelista LM, So K, Seamans JK, Phillips AG. Tolcapone enhances food-evoked dopamine efflux and executive memory processes mediated by the rat prefrontal cortex. Psychopharmacology. 2009;202:521–30. doi: 10.1007/s00213-008-1342-1. [DOI] [PubMed] [Google Scholar]
  • 58.Raevskii KS, Gainetdinov RR, Budygin EA, Mannisto P, Wightman M. Dopaminergic transmission in the rat striatum in vivo in conditions of pharmacological modulation. Neurosci Behav Physiol. 2002;32:183–8. doi: 10.1023/a:1013931609942. [DOI] [PubMed] [Google Scholar]
  • 59.Käenmäki M, Tammimäki A, Myöhänen T, Pakarinen K, Amberg C, Karayiorgou M, et al. Quantitative role of COMT in dopamine clearance in the prefrontal cortex of freely moving mice: quantitative role of COMT in the prefrontal cortex. J Neurochem. 2010;114:1745–55. doi: 10.1111/j.1471-4159.2010.06889.x. [DOI] [PubMed] [Google Scholar]
  • 60.Yin HH, Zhuang X, Balleine BW. Instrumental learning in hyperdopaminergic mice. Neurobiol Learn Mem. 2006;85:283–8. doi: 10.1016/j.nlm.2005.12.001. [DOI] [PubMed] [Google Scholar]
  • 61.Sesack SR, Hawrylak VA, Matus C, Guido MA, Levey AI. Dopamine axon varicosities in the prelimbic division of the rat prefrontal cortex exhibit sparse immunoreactivity for the dopamine transporter. J Neurosci. 1998;18:2697–708. doi: 10.1523/JNEUROSCI.18-07-02697.1998. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Flagel SB, Clark JJ, Robinson TE, Mayo L, Czuj A, Willuhn I, et al. A selective role for dopamine in stimulus–reward learning. Nature. 2011;469:53–7. doi: 10.1038/nature09588. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Beeler JA, Daw N, Frazier CRM, Zhuang X. Tonic dopamine modulates exploitation of reward learning. Front Behav Neurosci. 2010;4:170. doi: 10.3389/fnbeh.2010.00170. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 64.Acquas E, Carboni E, Ree RHA, Prada M, Chiara G. Extracellular concentrations of dopamine and metabolites in the rat caudate after oral administration of a novel catechol-o-methyltransferase inhibitor Ro 40–7592. J Neurochem. 1992;59:326–30. doi: 10.1111/j.1471-4159.1992.tb08907.x. [DOI] [PubMed] [Google Scholar]
  • 65.Garris PA, Wightman RM. Distinct pharmacological regulation of evoked dopamine efflux in the amygdala and striatum of the rat in vivo. Synapse. 1995;20:269–79. doi: 10.1002/syn.890200311. [DOI] [PubMed] [Google Scholar]
  • 66.Puthongkham P, Venton BJ. Recent advances in fast-scan cyclic voltammetry. Analyst. 2020;145:1087–102. doi: 10.1039/c9an01925a. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Akil M, Kolachana BS, Rothmond DA, Hyde TM, Weinberger DR, Kleinman JE. Catechol-O-methyltransferase genotype and dopamine regulation in the human brain. J Neurosci. 2003;23:2008–13. doi: 10.1523/JNEUROSCI.23-06-02008.2003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Meyer-Lindenberg A, Kohn PD, Kolachana B, Kippenhan S, McInerney-Leo A, Nussbaum R, et al. Midbrain dopamine and prefrontal function in humans: interaction and modulation by COMT genotype. Nat Neurosci. 2005;8:594–6. doi: 10.1038/nn1438. [DOI] [PubMed] [Google Scholar]
  • 69.Tunbridge EM, Narajos M, Harrison CH, Beresford C, Cipriani A, Harrison PJ. Which dopamine polymorphisms are functional? Systematic review and meta-analysis of COMT, DAT, DBH, DDC, DRD1–5, MAOA, MAOB, TH, VMAT1, and VMAT2. Biol Psychiatry. 2019;86:608–20. doi: 10.1016/j.biopsych.2019.05.014. [DOI] [PubMed] [Google Scholar]
  • 70.Corral‐Frías NS, Pizzagalli DA, Carré JM, Michalski LJ, Nikolova YS, Perlis RH, et al. COMT Val158Met genotype is associated with reward learning: a replication study and meta-analysis. Genes Brain Behav. 2016;15:503–13. doi: 10.1111/gbb.12296. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71.Lancaster TM, Heerey EA, Mantripragada K, Linden DEJ. Replication study implicates COMT val158met polymorphism as a modulator of probabilistic reward learning. Genes Brain Behav. 2015;14:486–92. doi: 10.1111/gbb.12228. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Material (7.5MB, pdf)

Articles from Molecular Psychiatry are provided here courtesy of Nature Publishing Group

RESOURCES