Abstract
Active reward pursuit is supported by the balance between the cognitive and habitual control of behavior. The cognitive, goal-directed strategy relies on prospective evaluation of anticipated consequences, which allows behavior to readily adapt when circumstances change. Repetition of successful actions promotes less cognitively taxing habits, in which behavior is automatically executed without prospective consideration. Disruption in either of these behavioral regulatory systems contributes to the symptoms that underlie many psychiatric disorders. Here, I review recently identified neural substrates, at multiple neural levels, that contribute to habits and outline gaps in knowledge that must be addressed to fully understand the neural mechanisms of behavioral control.
Two behavioral control systems contribute to active reward pursuit and decision making, one cognitive and one habitual. The cognitive, goal-directed system mediates behavior that relies on consideration of the learned relationship between actions and their consequences (Adams and Dickinson, 1981), allowing for flexible decision making. The habitual system allows behavior to be relatively automatically executed on the basis of prior success. Thus habits are elicited by antecedent events and do not involve prospective consideration of expected outcomes (Adams and Dickinson, 1981; Dickinson, 1985). Failure in either of these behavioral regulatory systems is characteristic of myriad psychiatric and neurodegenerative diseases. An overreliance on habit is especially associated with the compulsivity that manifests in several conditions (Voon et al., 2015; Gillan et al., 2016), including obsessive-compulsive disorder (Gillan et al., 2011; Gillan et al., 2014), schizophrenia (Griffiths et al., 2014; Morris et al., 2015), addiction (Belin et al., 2013), alcoholism (Corbit and Janak, 2016), and compulsive overeating (Corbit, 2016). On the other hand, failures in the acquisition and execution of habits is characteristic of Huntington’s (Heindel et al., 1988) and Parkinson’s disease (Agostino et al., 1996; Knowlton et al., 1996; Cohen and Pourcher, 2007). Therefore, it is necessary to understand the neural substrates of these behavioral regulatory systems and how these two, seemingly opposing, systems coordinate behavioral control.
The aim of this review is to highlight recently identified neural substrates, from the circuit level to the cellular level and down to the epigenetic level, that contribute to habits and to identify the gaps in knowledge that must be addressed to understand how behavioral control arises. Understanding how, for example, cortical input to the striatum alters not only striatal cell activity, but also the molecular and epigenetic mechanisms within those cells, will reveal how neuronal function is altered to give rise to the network activity that may direct behavioral output.
Identifying goal-directed actions and habits
Although instrumental behavior, in which voluntary actions are performed to cause a particular outcome, may start out goal directed, with repeated practice behavioral control can shift to the more automatic, habitual system to perform the exact same behavior (e.g., pressing a lever for a food pellet) more efficiently. Consequently, simply observing instrumental responding is insufficient to determine whether performance is being achieved via a goal-directed or habit strategy because the instrumental response itself is not indicative of whether prospective consideration of the outcome is taking place. A common approach to untangle these behavioral control systems is to devalue the outcome and subsequently examine behavioral responding (Dickinson, 1985). Behavioral responding is assessed in a non-reinforced probe test to ensure instrumental performance relies on the anticipated, rather than experienced, outcome value to guide behavior. This will demonstrate whether the goal (i.e. the outcome) is part of the representational structure guiding behavior. Goal-directed behaviors rely on the learned association between an action and the anticipated outcome. Therefore, responding will diminish if the anticipated outcome is no longer valued. Conversely, habits, which are defined in terms of their autonomy from goal representation, will continue regardless of whether the outcome is devalued. The presence of goal-directed control can also be revealed by degrading the contingency between an action and its outcome by freely delivering the outcome whether or not a response is performed, or by explicitly omitting the outcome following a response, both of which will decrease responding if behavior is goal-directed. Behaviors such as learned motor responses, motor-skills, procedural, and action-sequence learning have the overlapping feature of automaticity and share neural circuitry with habits as defined above, and thus may be related to, and sometimes defined as habits. Goal-directed behavior shares features with model-based reinforcement learning, where prospective consideration of potential outcomes drives flexible behavior, while habits have been proposed to be akin to model-free reinforcement learning, in which behavior is driven by retrospective events (Daw et al., 2005; Daw et al., 2011; Voon et al., 2015; Groman et al., 2019).
Neural circuits supporting goal-directed and habit behavior
Early studies demonstrating that animals, under some conditions (e.g., limited training), showed knowledge of anticipated outcomes, and under other conditions (e.g., extended training) showed evidence of habit (e.g., (Tolman and Gleitman, 1949)), led to the idea that these two forms of learning might be mediated in parallel by distinct neural mechanisms. Packard and McGaugh (1996) showed that following extended training on a cross-maze to reach a food outcome, in which a response habit strategy (i.e., body turn) eventually dominates over the goal-oriented, place-strategy (e.g., learned location within the room), inactivating the dorsolateral striatum at test prevented expression of the response habit strategy and exposed the conservation of the goal-oriented, place-strategy. These findings were among the first to establish that habits are learned concurrently with other types of goal-directed behaviors and mediated by different neural systems. This dissociability between the behavioral control systems has since been extended specifically to goal-directed and habitual instrumental behavior. There is now considerable converging evidence (reviewed previously (Balleine and O’Doherty, 2009; Burton et al., 2015; Malvaez and Wassum, 2018)) demonstrating the dorsolateral striatum (DLS) is engaged in and necessary for habit (Yin et al., 2004; Smith and Graybiel, 2013), while the dorsomedial striatum (DMS) is engaged in and necessary for goal-directed behavior (Yin et al., 2005a; Yin et al., 2005b; Thorn et al., 2010). Notably, the data supporting this dissociability has been collected across a variety of assays—lesion (Yin et al., 2004; Yin et al., 2005b), inactivation (Yin et al., 2005a, 2006; Corbit and Janak, 2010) and neural recording studies (Jog et al., 1999; Barnes et al., 2005; Jin and Costa, 2010; Thorn et al., 2010; Smith and Graybiel, 2013; Jin et al., 2014; Regier et al., 2015; Rueda-Orozco and Robbe, 2015)—and across species— mice (Hilario et al., 2012), rats (Yin et al., 2004, 2005a; Yin et al., 2005b; Corbit and Janak, 2010), non-human primates (Miyachi et al., 1997; Miyachi et al., 2002), and humans (Tricomi et al., 2009; Liljeholm et al., 2011; McNamee et al., 2015), but see (de Wit et al., 2018). This body of literature has, generally, set up a dichotomy between the DLS and DMS in the competition for behavioral control. However, more recent evidence suggests these systems may be engaged simultaneously throughout instrumental learning, suggesting more cooperative function (Li et al., 2016; Kupferschmidt et al., 2017; Malvaez et al., 2018). This section highlights neuroanatomical structures recently identified to play a crucial role in goal-directed and habit behavior.
Evidence suggests several cortical regions may influence behavioral control strategy. The DMS receives several inputs from limbic and prefrontal cortical regions (McGeorge and Faull, 1989) that have themselves been implicated in goal-directed behavior (Corbit and Balleine, 2003; Murray and Izquierdo, 2007; Ostlund and Balleine, 2007a, b; Corbit et al., 2013; Wilson et al., 2014; Bradfield et al., 2015; Schuck et al., 2016; Wikenheiser and Schoenbaum, 2016). Often, the neuronal activity or gray matter volumes of these regions are compromised in diseases characterized by an overreliance on habit (Voon et al., 2015; Fettes et al., 2017). The orbitofrontal cortex (OFC) displays the greatest change in neuronal firing rate during goal-directed control of behavior, but not habits (Gremel and Costa, 2013). Furthermore, new data has revealed that updating goal expectations triggers plasticity of OFC dendritic spines, the principle sites of excitatory synapses, and this plasticity is necessary for goal-directed behavior (Whyte et al., 2019). Inhibiting OFC excitatory activity (Gremel and Costa, 2013) or dendritic spine plasticity (Whyte et al., 2019) disrupts goal-directed behavior, together indicating the OFC is important for maintaining goal-related information.
Using a within-subject task that promotes the use of either habits or goal-directed behavior, depending on the training context, Gremel and colleagues (2016) showed that OFC projections specifically to the DMS are vital for goal-directed control. Inhibiting the activity of OFC terminals in the DMS prevents goal-directed learning and forces reliance on the habit strategy, even in the training context where the goal-directed strategy should dominate behavioral control (Gremel et al., 2016). Consistent with the disruption of goal-directed behavior as a consequence of reduced OFC output activity, chronic ethanol exposure decreases the excitability of OFC projection neurons, specifically reduces neurotransmitter release onto DMS neurons, and disrupts goal directed control of behavior (Renteria et al., 2018). That excitatory OFC output activity is essential for goal-directed behavior is in accordance with findings implicating the OFC in representing potential outcomes (Ostlund and Balleine, 2007a, b), including those needed for goal-directed instrumental behavior (Zimmermann et al., 2017; Zimmermann et al., 2018; Malvaez et al., 2019), and indicates that the OFC must communicate with the DMS for successful goal-directed behavior, otherwise habits emerge. Moreover, human subjects with lower gray matter volume in the medial OFC and striatum are biased toward model-free (habit) learning in the two-step sequential learning task, which measures whether choices are made using the predicted outcome of each choice (model-based, goal-directed learning) or the retrospective success of the preceding choice (model-free, habit learning) (Voon et al., 2015). However, this is not to imply that goal-directed behavior only needs OFC-DMS interactions. Indeed, DMS inputs from the basolateral amygdala (Corbit et al., 2013) and thalamus (Bradfield et al., 2013) are also necessary for the acquisition and expression of goal-directed behavior.
Relative to all we know about the various neural substrates of goal-directed action (for additional review see (Balleine and O’Doherty, 2010; Griffiths et al., 2014)), there are far fewer defined neural substrates of habits. This is largely due to the fact that neural substrates contributing to behavioral control have been typically identified through their necessity for goal-directed behavior; when a brain structure necessary for goal-directed behavior is compromised, habit strategies usurp behavioral control. Moreover, very little is known whether or how goal-directed circuit activity changes to allow adaptive habits to emerge.
Only a handful of studies have identified cortical structures that contribute to habit behavior. Some evidence suggests the infralimbic cortex (IL) is required not only for the acquisition of habit responses (Smith and Graybiel, 2013), but also for preserving habitual behavior. When habits are established, inhibiting the IL restores goal-directed behavior (Coutureau and Killcross, 2003; Smith et al., 2012). Using a within-session dual reinforcement schedule, one promoting goal-directed behavior and another promoting habit responding, it was demonstrated that, following unreinforced lever presses during the outcome devaluation test, IL firing activity is suppressed during goal-directed actions, but is sustained while responding on the habit-promoting reinforcement schedule (Barker et al., 2017). Selectively inhibiting IL activity immediately after each lever press when animals are using a habit strategy yields sensitivity to contingency degradation, essentially reverting animals back to goal-directed control of behavior (Barker et al., 2017). These findings suggest the IL is involved in the expression of habits, but it is unclear whether IL activity is achieving this role by suppressing goal-directed behavior, sustaining habitual responding, or some other strategy-selection process. Indeed, the IL is thought to suppress previously learned contingencies to promote habits (Coutureau and Killcross, 2003; Killcross and Coutureau, 2003; Smith et al., 2012; Barker et al., 2014). Considering the IL does not project to the DLS (Mailly et al., 2013), the canonical brain region supporting habits, it will be interesting to explore how the IL interacts with other striatal and/or cortical regions to direct behavioral control. It is unknown, for example, if IL projections to limbic regions implicated in behavioral inhibition are involved in the maintenance of habit behavior by suppressing goal-directed behaviors or if IL projections to the ventral striatum are influencing the incentive processes that modulate instrumental performance (Corbit et al., 2001).
Cell-type specific contributions to goal-directed and habit behavior
In addition to identifying the neuroanatomical structures contributing to behavioral control, some effort has been made to identify the cell-type specific contributions to instrumental control. Within each striatal subregion there are two classes of spiny projection neurons (SPNs), which are a type of GABAergic inhibitory cell that comprise ~95% of striatal neurons, that directly (dSPNs) or indirectly (iSPNs) project to basal ganglia output nuclei (Freeze et al., 2013). These pathways have traditionally been thought to work in opposition (Albin et al., 1989; Graybiel, 2000). Indeed, post-synaptic strength is augmented in dSPNs, but attenuated in iSPNs, in the DMS following the acquisition of goal-directed behavior (Shan et al., 2014). Furthermore, habit learning induces post-synaptic depression of iSPNs in the DLS (Shan et al., 2015). However, there is also evidence of coordinated pathway-specific activity. During the execution of well-trained actions, DLS dSPNs and iSPNs are both active (Cui et al., 2013) and necessary for the initiation and execution of well-trained action sequences (Tecuapetla et al., 2016). Moreover, both DLS pathways display strengthened responses to cortical input following habit training (O’Hare et al., 2016). These conflicting findings on pathway-specific activity (for review, see (Malvaez and Wassum, 2018)) may be reconciled by more comprehensive studies examining the subregion-specific contributions of each striatal pathway in habits.
Only recently, have investigations focused on the role of discrete striatal interneurons in the context of goal-directed versus habitual behavior (Figure 1). Striatal fast-spiking interneurons (FSIs), most prominent in the DLS (Gerfen, 1985), display increased excitability following habit training, firing more readily in response to excitatory input, and FSI activity in the DLS is necessary for the expression of habits (O’Hare et al., 2017). Moreover, DLS FSI activity was found to be necessary for establishing the DLS SPN properties that predict habit behavior (O’Hare et al., 2017). In the DMS, a reduction in FSIs has been implicated in the development of pathological repetitive behaviors via dysregulation of SPN baseline firing rates (Burguière et al., 2013). In a mouse model of compulsive behavior, OFC mediated activation of DMS FSIs restores inhibition of DMS SPN activity and alleviates repetitive behavior (Burguière et al., 2013). Together, these studies indicate FSIs may differentially regulate striatal microcircuits in a subregion-specific manner.
Figure 1. Striatal cell-types and potential interactions regulating behavioral control.
The dorsomedial (DMS) and dorsolateral striatum (DLS) both contain spiny projection neurons (SPNs), that directly (dSPNs) or indirectly (iSPNs) project to basal ganglia output nuclei, as well as various interneurons such as fast-spiking interneurons (FSIs), tyrosine-hydroxylase expressing interneurons (THINs), and cholinergic interneurons (CINs). These interneurons can modulate the activity of SPNs to differentially affect goal-directed and habitual behavior, however, several of the interactions between the specific cell-types have not been delineated in the context of goal-directed and habit behaviors.
Striatal microcircuits are also regulated by tyrosine-hydroxylase expressing interneurons (THINs) that receive excitatory thalamic and cortical input, as well as integrate dopaminergic and cholinergic signaling (Ibáñez-Sandoval et al., 2010; Ibáñez-Sandoval et al., 2015). Kaminer et al., (2019) found that THIN lesions throughout the dorsal striatum do not impair motivation to lever press for a food reward in the progressive ratio test, in which the response requirement increases for each successive reward, but do produce insensitivity to outcome devaluation (Kaminer et al., 2019). This impairment in goal-directed behavior was found to correlate with THIN lesions restricted to the more posterior striatum (Kaminer et al., 2019), which corresponds with the necessity of the posterior DMS, but not the anterior DMS, for goal-directed behavior (Yin et al., 2005b). A potential means by which THINs affect behavioral control is via integration of cholinergic signaling. Indeed, the activity of cholinergic interneurons in the DMS has previously been found to be necessary for updating action-outcome contingencies for effective goal-directed behavior (Bradfield et al., 2013). However, it is still unclear how striatal THIN activity is modulated by cortical and/or thalamic inputs during instrumental learning and whether this differs between dorsomedial and dorsolateral striatal subregions to coordinate behavioral control. Furthermore, given the known role of dopamine signaling in habit learning (Faure et al., 2005; Nelson and Killcross, 2006; Hernandez et al., 2013), it will be useful to investigate whether the effects of dopaminergic input to the striatum are mediated via striatal THINs.
Cellular signaling mechanisms of habits
Long-lasting changes in neuronal excitability are required to support instrumental learning and likely also behavioral control strategies. Recent studies have begun to elucidate the intracellular signaling mechanisms that contribute to such plasticity. Both dSPNs and iSPNs activity is altered by the activation of G-protein coupled receptors (GPCRs) (Wang et al., 2006; Shen et al., 2008), which relay incoming signals by either stimulating or inhibiting the activity of intracellular targets (Figure 2). Dopamine D1 and D2 receptors, located on dSPNs and iSPNS, respectively, are GPCRs that have opposing effects on the activity of the catalytic enzyme adenylyl cyclase and have both been implicated in habits. Antagonism of D1 receptors, which effectively reduces dSPN output, prevents the acceleration of habits, while antagonism of D2 receptors, which facilitates iSPN output, exacerbates habit formation (Nelson and Killcross, 2013). Adenosine A2A receptors (A2ARs) are GPCRs exclusively expressed on iSPNs that have also been implicated in behavioral control (Yu et al., 2009; Li et al., 2016). A2ARs contain the activating Gs α subunit, therefore, A2AR activation will lead to an increase of iSPN GABAergic output (Shen et al., 2008). In the DMS, activating A2AR signaling, effectively increasing iSPN output, during outcome delivery renders animals habitual (Li et al., 2016), while deleting A2ARs, effectively decreasing iSPN output, specifically in the DMS enhances goal-directed behavior (Li et al., 2016). This suggests A2AR-mediated signaling during the time of reward exerts control of instrumental behavior (Figure 2). Correspondingly, loss of goal-directed behavior induced by psychostimulant exposure can be restored by suppressing A2AR-mediated signaling in the DMS during goal-updating (Furlong et al., 2015).
Figure 2. Striatal output activity is regulated by G-protein coupled receptors and contributes to behavioral control.
In the dorsomedial striatum (DMS), G-protein coupled receptors are known to modulate spiny projection neuron (SPN) activity. (a) Adenosine A2A receptors (A2ARs), exclusively expressed in indirect output pathway spiny projection neurons (iSPNs), contain the activating G-protein (Gs). A2AR activation in the DMS increases the inhibitory activity of the indirect output pathway (iSPN), ultimately inhibiting goal-directed behavior. Conversely, inhibiting A2ARs in the DMS prevents the inhibitory iSPN output activity, thereby enhancing goal-directed behavior. (b) Excitatory input from the orbitofrontal cortex (OFC) increases DMS direct pathway spiny projection neuron (dSPN) output activity and is necessary for goal-directed behavior. (c) OFC pre-synaptic activity in the DMS is negatively regulated by the cannabinoid type 1 (CB1) that is coupled to inhibitory G-proteins (Gi), which causes reduced dSPN activity and suppresses goal-directed behavior, rendering animals habitual.
Beyond GPCRs located directly on SPNs, the cannabinoid type 1 (CB1) GPCR has also been implicated in behavioral control (Figure 2). CB1 receptors are located pre-synaptically and contain the Gi α subunit, therefore, CB1 receptor activation reduces the probability of neurotransmitter release onto post-synaptic cells (Gerdeman and Lovinger, 2001). Genetic deletion of the cannabinoid type 1 (CB1) GPCR impairs habit learning (Hilário et al., 2007) and activation of CB1 on DMS-projecting OFC neurons is necessary to suppress goal-directed behavior (Gremel et al., 2016). Interestingly, chronic activation of CB1 by THC accelerates the transition to habits and impairs iSPN long-term depression and synaptic depotentiation (Nazzaro et al., 2012)— a potential pathway whereby chronic cannabinoid use could lead to maladaptive habitual behavior. A crucial open question is how the intracellular signaling cascades that are activated by these receptors contribute to the plasticity underlying habits.
Striatal neuroplasticity depends on the activation of intracellular signaling cascades such as the extracellular signal-relate kinase (ERK) pathway (Mazzucchelli et al., 2002; Shiflett and Balleine, 2011; Hutton et al., 2017). Indeed, striatal ERK signaling has been implicated in motor-skill learning (Mazzucchelli et al., 2002; Bureau et al., 2010; Hutton et al., 2017) and has been proposed as a candidate molecular regulator of habit (Shiflett et al., 2010; Shiflett and Balleine, 2011). Recent studies have revealed mechanisms upstream of ERK that may be involved in regulating the shift in behavioral control (Figure 3). Activation of the tyrosine receptor kinase B (TrkB), the receptor for Brain-derived Neurotrophic Factor (BDNF), leads to the activation of several small G-proteins that ultimately activate ERK (Huang and Reichardt, 2003). Pitts et al., (2018) reported that TrkB mediates plasticity necessary for both actions and habits (Pitts et al., 2018). Overexpressing a truncated form of TrkB, which cannot activate intracellular signaling pathways, in the DMS prevented goal-directed behavior. In contrast, expressing the inactive TrkB in the DLS impaired the acquisition of habit learning (Pitts et al., 2018). It is noteworthy that the stress hormone corticosterone reduces TrkB levels in the brain (Gourley et al., 2012), causes a shift from full length TrkB to the truncated, inactive from of TrkB (Barfield et al., 2017), and biases behavioral control toward habits (Gourley et al., 2012; Barfield et al., 2017), similar to the ability of stress to promote habit formation (Dias-Ferreira et al., 2009).
Figure 3. Striatal subregion-specific activation of intracellular signaling mechanisms differentially regulates behavioral control.
The extracellular signal-related kinase (ERK) pathway is activated in the dorsal striatum during instrumental learning and may potentially contribute to the transcription necessary for synaptic plasticity. (a,b) The tyrosine receptor kinase B (TrkB), the receptor for brain-derived neurotrophic factor (BDNF), is a potential upstream mechanism of ERK activation in both the dorsomedial striatum (DMS) and dorsolateral striatum (DLS). (a) In the DMS, TrkB activation is necessary for goal-directed behavior, (b) whereas in the DLS TrkB activation contributes to habits. In the DLS, (b) the proton-gated acid-sensing ion channel isoform ASIC1a activates ERK signaling and is necessary for motor learning. It is unknown whether ASIC1a contributes specifically to instrumental habit learning.
Another potential component regulating habits, recently gleaned from motor learning studies, is the regulation of ERK by proton-gated acid-sensing ion channels (ASICs), in particular isoform ASIC1a. Genetic deletion or acute inhibition of ASIC1a in the dorsal striatum impairs activation of ERK signaling and impairs motor learning, without affecting initial motor performance (Yu et al., 2018). Reintroducing ASIC1a in the DLS of Asic1a knockout mice is sufficient to restore motor learning (Yu et al., 2018). Although high-frequency stimulation can trigger substantial release of neurotransmitter along with sufficient protons to activate ASICs (Du et al., 2014), the endogenous striatal inputs that activate striatal ASIC1a during instrumental learning are unknown. Additional studies are also necessary to clarify the contribution of ASICs in a subregion- and pathway-specific manner.
Establishing long-term changes in habit-related plasticity
The long-lasting neuroplasticity— from changes in synaptic strength to altered dendritic structure— that has been reported in striatal SPNs following habit learning suggest stable changes in gene expression occur to alter neuronal function. Epigenetic mechanisms alter accessibility to DNA for the transcriptional machinery to coordinate gene expression and are, therefore, fundamental regulators of the transcriptional processes mediating stable changes in neuronal function and memory (Kouzarides, 2007; Day and Sweatt, 2011; Peixoto and Abel, 2013). One of the best studied epigenetic regulators are histone deacetylases (HDACs), which remove acetyl groups from histone tails, creating a repressive chromatin state that prevents active gene transcription (Renthal and Nestler, 2009; McQuown and Wood, 2011; Robison and Nestler, 2011; Vogel-Ciernia and Wood, 2012). With sufficient activity-dependent signaling, such as that induced by a salient learning event, HDACs, particularly HDAC3 (McQuown et al., 2011; McQuown and Wood, 2011; Malvaez et al., 2013; Rogge et al., 2013; Alaghband et al., 2017; Kwapis et al., 2017), can be removed temporarily to allow for histone acetyltransferases (HATs) to acetylate histones. Histone acetylation neutralizes the positive charge of the lysine residues on the histone’s N-terminal tails, relaxing the chromatin structure to, generally, promote the active gene expression that subserves long-lasting synaptic plasticity and learning (Renthal and Nestler, 2009; McQuown and Wood, 2011; Robison and Nestler, 2011; Vogel-Ciernia and Wood, 2012).
Recently, the role of this epigenetic mechanism in the development of habit learning was examined. Using a multifaceted approach, including pharmacological HDAC inhibition, novel viral constructs to bi-directionally manipulate HDAC3 in the dorsal striatum, and chromatin immunoprecipitation, it was revealed that in both the DLS or DMS the transcriptional repressor HDAC3 is removed from the promoters of key neuronal activity-related genes (Figure 4) only following extended training, when conditions are ripe for habits to form (Malvaez et al., 2018). Furthermore, compromised HDAC3 activity in either the DLS or DMS promotes habits at a point when behavior should be goal-directed (Malvaez et al., 2018), indicating HDAC3 functions as a critical negative regulator of the gene transcription underlying habit formation in both the DLS and the DMS. It remains unknown what cortical or thalamic inputs are involved in activating the appropriate intracellular signaling mechanisms to modulate the repressive function of HDAC3 as behavior shifts toward habit. Furthermore, future studies must examine how HDAC3 regulates the transcription necessary for establishing changes in the signaling dynamics of the individual cell-types in the DMS and DLS to coordinate the development of habits.
Figure 4. HDAC3 negatively regulates habits in the DMS and DLS.
Histone modifications regulate the access to DNA for the transcriptional machinery and are fundamental in neuronal function and memory. (a) Early in instrumental training, when behavior is goal-directed, the transcriptional repressor histone deacetylase 3 (HDAC3) is engaged near gene promoters, removing acetyl groups from the histone’s N-terminal tails, creating a transcriptionally repressive chromatin state where DNA is less accessible to the transcriptional machinery. (b) With sufficient training, when animals become habitual, HDAC3 is disengaged from gene promoters by allowing the addition of acetyl groups (Ac) to the histone tails, which neutralized the interaction between the DNA and histone tails and leads to a relaxed chromatin state. This allows for the transcription of plasticity-related genes. It is unknown which histone acetyltransferase (HAT) contributes to this process and how these mechanisms are activated.
Exposure to cocaine causes an abnormally rapid development of habits and dysregulates striatal synaptic plasticity (Corbit et al., 2014). Interestingly, cocaine experience also induces a specific histone acetylation profile in the striatum (Kumar et al., 2005), possibly establishing an epigenetic landscape ripe for potentiating maladaptive habit learning. Indeed, very recent work has demonstrated that acetylation of histone H3, along with the expression of the plasticity-related gene Bdnf, was upregulated in the DLS following cocaine experience, and this was associated with a bias toward use of a response habit spatial navigation strategy (Harvey et al., 2019). This was also associated with decreased acetylation of histone H3 in the hippocampus (Harvey et al., 2019), possibly disrupting the acquisition of the goal-oriented strategy. These findings demonstrate that histone acetylation is differentially regulated in distinct brain regions, possibly altering the neuronal function of these two systems in an opposing manner to contribute to behavioral control. It remains unknown how the activity of the HATs and HDACs that regulate histone acetylation are altered as habits develop in the cortical structures highlighted above that contribute to behavioral control.
Beyond chromatin modifications (e.g., histone acetylation, methylation, phosphorylation), transcription is also epigenetically regulated via chromatin remodeling, which refers to the restructuring of nucleosomes (Saha et al., 2006), and via DNA methylation (Day and Sweatt, 2010), mechanisms that are both pivotal for memory formation (Day and Sweatt, 2010; Vogel-Ciernia et al., 2013). Interestingly, mutations in components of the chromatin remodeling complex have been implicated in autism (Neale et al., 2012; O’Roak et al., 2012) and schizophrenia (Loe-Mie et al., 2010), both of which have symptoms of maladaptive habits (Griffiths et al., 2014; Alvares et al., 2016). DNA methylation is also interesting in the context of habit learning given that it works in concert with histone acetylation to regulate memory processes (Miller et al., 2008). It is currently unknown whether these mechanisms are also involved in instrumental learning, and if so, what the gene targets are, and do these mechanism constrain the acquisition and/or expression of goal-directed or habitual behavior.
Significant open questions
An overreliance on habit is associated with the various forms of compulsivity that manifest across a range of conditions (Voon et al., 2015; Gillan et al., 2016) where maladaptive reward-seeking is pursued at the expense of goal-directed action strategies, including addiction (Belin et al., 2013; Sjoerds et al., 2013; Everitt and Robbins, 2016; Vandaele and Janak, 2017), obsessive-compulsive disorder (Gillan et al., 2011; Gillan et al., 2014), and schizophrenia (Griffiths et al., 2014; Morris et al., 2015). Although significant progress has been made in identifying the neural mechanisms involved in goal-directed and habit behavior, several gaps in knowledge remain. A lack of goal-directed behavior often manifests as habitual control of behavior, but this can emerge due to various deficits in specific elements of goal-directed behavior, such as the inability to update the value of the prospective outcome, failures in retrieving the current outcome value, or the inability to decipher distinct features of an outcome, and/or the direct strengthening of habits. Although several putative habit substrates have been identified, as discussed above, the unique function of these substrates in behavioral control, as well as the mechanisms involved in arbitrating these regulatory systems, have not been clearly defined. It remains to be understood how habits are encoded in the ensemble activity of direct and indirect pathway projection neurons. How does the cell-type specific ensemble activity change in each striatal subregion as habits develop? And importantly, how do these pathways interact? From the studies highlighted here, it is clear that behavioral output is regulated at multiple levels—from molecular, to cellular, to brain network interactions. Adaptive behavioral control does not arise from any single brain region or epigenetic mechanism in isolation, but from the coordinated interactions of all of these regulatory mechanisms. To understand how behavior transitions between the goal-directed and habit systems, it is critical to understand how each neural level influences the other. Changes at the molecular level will give rise to functional changes in cell activity, which in turn may alter neural responses to incoming signals (e.g., environmental stimuli), and ultimately influence behavioral output. This approach may facilitate our understanding of how the engagement of one cortical-striatal network versus another is controlled.
Supplementary Material
Significance.
Habits are an adaptive component of our daily lives that allow us to efficiently perform routine tasks, and their disruption contributes to the symptoms that underlie many psychiatric disorders. This review highlights the neural substrates, from brain networks, to cellular and epigenetic mechanisms, recently identified to play a crucial role in goal-directed and habit behavior.
Acknowledgements
The author would like to thank Dr. Kate M. Wassum for helpful comments on this manuscript. This work was funded by DA046679.
Footnotes
Conflict of interest statement: The author declares no competing financial interests.
References
- Adams CD, Dickinson A (1981) Instrumental responding following reinforcer devaluation. The Quarterly Journal of Experimental Psychology 33:109–121. [Google Scholar]
- Agostino R, Sanes JN, Hallett M (1996) Motor skill learning in Parkinson’s disease. J Neurol Sci 139:218–226. [PubMed] [Google Scholar]
- Alaghband Y, Kwapis JL, López AJ, White AO, Aimiuwu OV, Al-Kachak A, Bodinayake KK, Oparaugo NC, Dang R, Astarabadi M, Matheos DP, Wood MA (2017) Distinct roles for the deacetylase domain of HDAC3 in the hippocampus and medial prefrontal cortex in the formation and extinction of memory. Neurobiol Learn Mem 145:94–104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Albin RL, Young AB, Penney JB (1989) The functional anatomy of basal ganglia disorders. Trends Neurosci 12:366–375. [DOI] [PubMed] [Google Scholar]
- Alvares GA, Balleine BW, Whittle L, Guastella AJ (2016) Reduced goal-directed action control in autism spectrum disorder. Autism Res. [DOI] [PubMed] [Google Scholar]
- Balleine BW, O’Doherty JP (2009) Human and Rodent Homologies in Action Control: Corticostriatal Determinants of Goal-Directed and Habitual Action. Neuropsychopharmacology. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Balleine BW, O’Doherty JP (2010) Human and rodent homologies in action control: corticostriatal determinants of goal-directed and habitual action. Neuropsychopharmacology 35:48–69. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Barfield ET, Gerber KJ, Zimmermann KS, Ressler KJ, Parsons RG, Gourley SL (2017) Regulation of actions and habits by ventral hippocampal trkB and adolescent corticosteroid exposure. PLoS Biol 15. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Barker JM, Taylor JR, Chandler LJ (2014) A unifying model of the role of the infralimbic cortex in extinction and habits. Learn Mem 21:441–448. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Barker JM, Glen WB, Linsenbardt DN, Lapish CC, Chandler LJ (2017) Habitual Behavior Is Mediated by a Shift in Response-Outcome Encoding by Infralimbic Cortex. eNeuro 4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Barnes TD, Kubota Y, Hu D, Jin DZ, Graybiel AM (2005) Activity of striatal neurons reflects dynamic encoding and recoding of procedural memories. Nature 437:1158–1161. [DOI] [PubMed] [Google Scholar]
- Belin D, Belin-Rauscent A, Murray JE, Everitt BJ (2013) Addiction: failure of control over maladaptive incentive habits. Curr Opin Neurobiol. [DOI] [PubMed] [Google Scholar]
- Bradfield LA, Bertran-Gonzalez J, Chieng B, Balleine BW (2013) The thalamostriatal pathway and cholinergic control of goal-directed action: interlacing new with existing learning in the striatum. Neuron 79:153–166. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bradfield LA, Dezfouli A, van Holstein M, Chieng B, Balleine BW (2015) Medial Orbitofrontal Cortex Mediates Outcome Retrieval in Partially Observable Task Situations. Neuron 88:1268–1280. [DOI] [PubMed] [Google Scholar]
- Bureau G, Carrier M, Lebel M, Cyr M (2010) Intrastriatal inhibition of extracellular signal-regulated kinases impaired the consolidation phase of motor skill learning. Neurobiol Learn Mem 94:107–115. [DOI] [PubMed] [Google Scholar]
- Burguière E, Monteiro P, Feng G, Graybiel AM (2013) Optogenetic stimulation of lateral orbitofronto-striatal pathway suppresses compulsive behaviors. Science 340:1243–1246. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Burton AC, Nakamura K, Roesch MR (2015) From ventral-medial to dorsal-lateral striatum: neural correlates of reward-guided decision-making. Neurobiol Learn Mem 117:51–59. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cohen H, Pourcher E (2007) Intact encoding, impaired consolidation in procedural learning in Parkinson’s disease. Exp Brain Res 179:703–708. [DOI] [PubMed] [Google Scholar]
- Corbit LH (2016) Effects of obesogenic diets on learning and habitual responding. Current Opinion in Behavioral Sciences 9:84–90. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Corbit LH, Balleine BW (2003) The role of prelimbic cortex in instrumental conditioning. Behav Brain Res 146:145–157. [DOI] [PubMed] [Google Scholar]
- Corbit LH, Janak PH (2010) Posterior dorsomedial striatum is critical for both selective instrumental and Pavlovian reward learning. Eur J Neurosci 31:1312–1321. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Corbit LH, Janak PH (2016) Habitual Alcohol Seeking: Neural Bases and Possible Relations to Alcohol Use Disorders. Alcohol Clin Exp Res. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Corbit LH, Muir JL, Balleine BW (2001) The role of the nucleus accumbens in instrumental conditioning: Evidence of a functional dissociation between accumbens core and shell. J Neurosci 21:3251–3260. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Corbit LH, Leung BK, Balleine BW (2013) The role of the amygdala-striatal pathway in the acquisition and performance of goal-directed instrumental actions. J Neurosci 33:17682–17690. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Corbit LH, Chieng BC, Balleine BW (2014) Effects of repeated cocaine exposure on habit learning and reversal by N-acetylcysteine. Neuropsychopharmacology 39:1893–1901. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Coutureau E, Killcross S (2003) Inactivation of the infralimbic prefrontal cortex reinstates goal-directed responding in overtrained rats. Behav Brain Res 146:167–174. [DOI] [PubMed] [Google Scholar]
- Cui G, Jun SB, Jin X, Pham MD, Vogel SS, Lovinger DM, Costa RM (2013) Concurrent activation of striatal direct and indirect pathways during action initiation. Nature 494:238–242. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Daw ND, Niv Y, Dayan P (2005) Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control. Nat Neurosci 8:1704–1711. [DOI] [PubMed] [Google Scholar]
- Daw ND, Gershman SJ, Seymour B, Dayan P, Dolan RJ (2011) Model-based influences on humans’ choices and striatal prediction errors. Neuron 69:1204–1215. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Day JJ, Sweatt JD (2010) DNA methylation and memory formation. Nature Neuroscience 13:1319. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Day JJ, Sweatt JD (2011) Epigenetic mechanisms in cognition. Neuron 70:813–829. [DOI] [PMC free article] [PubMed] [Google Scholar]
- de Wit S, Kindt M, Knot SL, Verhoeven AAC, Robbins TW, Gasull-Camos J, Evans M, Mirza H, Gillan CM (2018) Shifting the balance between goals and habits: Five failures in experimental habit induction. J Exp Psychol Gen 147:1043–1065. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dias-Ferreira E, Sousa JC, Melo I, Morgado P, Mesquita AR, Cerqueira JJ, Costa RM, Sousa N (2009) Chronic stress causes frontostriatal reorganization and affects decision-making. Science 325:621–625. [DOI] [PubMed] [Google Scholar]
- Dickinson A (1985) Actions and Habits: the development of behavioural autonomy. Philosphical Transactions of the Royal Society of London B308:67–78. [Google Scholar]
- Du J, Reznikov LR, Price MP, Zha X-m, Lu Y, Moninger TO, Wemmie JA, Welsh MJ (2014) Protons are a neurotransmitter that regulates synaptic plasticity in the lateral amygdala. [DOI] [PMC free article] [PubMed]
- Everitt BJ, Robbins TW (2016) Drug Addiction: Updating Actions to Habits to Compulsions Ten Years On. Annu Rev Psychol 67:23–50. [DOI] [PubMed] [Google Scholar]
- Faure A, Haberland U, Conde F, El Massioui N (2005) Lesion to the nigrostriatal dopamine system disrupts stimulus-response habit formation. J Neurosci 25:2771–2780. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fettes P, Schulze L, Downar J (2017) Cortico-Striatal-Thalamic Loop Circuits of the Orbitofrontal Cortex: Promising Therapeutic Targets in Psychiatric Illness. Front Syst Neurosci 11:25. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Freeze BS, Kravitz AV, Hammack N, Berke JD, Kreitzer AC (2013) Control of basal ganglia output by direct and indirect pathway projection neurons. J Neurosci 33:18531–18539. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Furlong TM, Supit AS, Corbit LH, Killcross S, Balleine BW (2015) Pulling habits out of rats: adenosine 2A receptor antagonism in dorsomedial striatum rescues meth-amphetamine-induced deficits in goal-directed action. Addict Biol. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gerdeman G, Lovinger DM (2001) CB1 cannabinoid receptor inhibits synaptic release of glutamate in rat dorsolateral striatum. J Neurophysiol 85:468–471. [DOI] [PubMed] [Google Scholar]
- Gerfen CR (1985) The neostriatal mosaic. I. Compartmental organization of projections from the striatum to the substantia nigra in the rat. J Comp Neurol 236:454–476. [DOI] [PubMed] [Google Scholar]
- Gillan CM, Robbins TW, Sahakian BJ, van den Heuvel OA, van Wingen G (2016) The role of habit in compulsivity. Eur Neuropsychopharmacol 26:828–840. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gillan CM, Papmeyer M, Morein-Zamir S, Sahakian BJ, Fineberg NA, Robbins TW, de Wit S (2011) Disruption in the balance between goal-directed behavior and habit learning in obsessive-compulsive disorder. Am J Psychiatry 168:718–726. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gillan CM, Morein-Zamir S, Urcelay GP, Sule A, Voon V, Apergis-Schoute AM, Fineberg NA, Sahakian BJ, Robbins TW (2014) Enhanced avoidance habits in obsessive-compulsive disorder. Biol Psychiatry 75:631–638. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gourley SL, Swanson AM, Jacobs AM, Howell JL, Mo M, Dileone RJ, Koleske AJ, Taylor JR (2012) Action control is mediated by prefrontal BDNF and glucocorticoid receptor binding. Proc Natl Acad Sci U S A 109:20714–20719. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Graybiel AM (2000) The basal ganglia. Curr Biol 10:R509–511. [DOI] [PubMed] [Google Scholar]
- Gremel CM, Costa RM (2013) Orbitofrontal and striatal circuits dynamically encode the shift between goal-directed and habitual actions. Nat Commun 4:2264. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gremel CM, Chancey JH, Atwood BK, Luo G, Neve R, Ramakrishnan C, Deisseroth K, Lovinger DM, Costa RM (2016) Endocannabinoid Modulation of Orbitostriatal Circuits Gates Habit Formation. Neuron 90:1312–1324. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Griffiths KR, Morris RW, Balleine BW (2014) Translational studies of goal-directed action as a framework for classifying deficits across psychiatric disorders. Front Syst Neurosci 8:101. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Groman SM, Massi B, Mathias SR, Curry DW, Lee D, Taylor JR (2019) Neurochemical and Behavioral Dissections of Decision-Making in a Rodent Multistage Task. J Neurosci 39:295–306. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Harvey E, Blurton-Jones M, Kennedy PJ (2019) Hippocampal BDNF regulates a shift from flexible, goal-directed to habit memory system function following cocaine abstinence. Hippocampus. [DOI] [PMC free article] [PubMed] [Google Scholar] [Retracted]
- Heindel WC, Butters N, Salmon DP (1988) Impaired learning of a motor skill in patients with Huntington’s disease. Behav Neurosci 102:141–147. [DOI] [PubMed] [Google Scholar]
- Hernandez LF, Kubota Y, Hu D, Howe MW, Lemaire N, Graybiel AM (2013) Selective effects of dopamine depletion and L-DOPA therapy on learning-related firing dynamics of striatal neurons. J Neurosci 33:4782–4795. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hilario M, Holloway T, Jin X, Costa RM (2012) Different dorsal striatum circuits mediate action discrimination and action generalization. Eur J Neurosci 35:1105–1114. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hilário MR, Clouse E, Yin HH, Costa RM (2007) Endocannabinoid signaling is critical for habit formation. Front Integr Neurosci 1:6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Huang EJ, Reichardt LF (2003) Trk receptors: roles in neuronal signal transduction. Annu Rev Biochem 72:609–642. [DOI] [PubMed] [Google Scholar]
- Hutton SR, Otis JM, Kim EM, Lamsal Y, Stuber GD, Snider WD (2017) ERK/MAPK signaling is required for pathway-specific striatal motor functions. J Neurosci. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ibáñez-Sandoval O, Xenias HS, Tepper JM, Koós T (2015) Dopaminergic and cholinergic modulation of striatal tyrosine hydroxylase interneurons. Neuropharmacology 95:468–476. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ibáñez-Sandoval O, Tecuapetla F, Unal B, Shah F, Koós T, Tepper JM (2010) Electrophysiological and morphological characteristics and synaptic connectivity of tyrosine hydroxylase-expressing neurons in adult mouse striatum. J Neurosci 30:6999–7016. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jin X, Costa RM (2010) Start/stop signals emerge in nigrostriatal circuits during sequence learning. Nature 466:457–462. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jin X, Tecuapetla F, Costa RM (2014) Basal ganglia subcircuits distinctively encode the parsing and concatenation of action sequences. Nat Neurosci 17:423–430. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Jog MS, Kubota Y, Connolly CI, Hillegaart V, Graybiel AM (1999) Building neural representations of habits. Science 286:1745–1749. [DOI] [PubMed] [Google Scholar]
- Kaminer J, Espinoza D, Bhimani S, Tepper JM, Koos T, Shiflett MW (2019) Loss of striatal tyrosine-hydroxylase interneurons impairs instrumental goal-directed behavior. Eur J Neurosci. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Killcross S, Coutureau E (2003) Coordination of actions and habits in the medial prefrontal cortex of rats. Cereb Cortex 13:400–408. [DOI] [PubMed] [Google Scholar]
- Knowlton BJ, Mangels JA, Squire LR (1996) A neostriatal habit learning system in humans. Science 273:1399–1402. [DOI] [PubMed] [Google Scholar]
- Kouzarides T (2007) Chromatin modifications and their function. Cell 128:693–705. [DOI] [PubMed] [Google Scholar]
- Kumar A, Choi KH, Renthal W, Tsankova NM, Theobald DE, Truong HT, Russo SJ, Laplant Q, Sasaki TS, Whistler KN, Neve RL, Self DW, Nestler EJ (2005) Chromatin remodeling is a key mechanism underlying cocaine-induced plasticity in striatum. Neuron 48:303–314. [DOI] [PubMed] [Google Scholar]
- Kupferschmidt DA, Juczewski K, Cui G, Johnson KA, Lovinger DM (2017) Parallel, but Dissociable, Processing in Discrete Corticostriatal Inputs Encodes Skill Learning. Neuron 96:476–489.e475. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kwapis JL, Alaghband Y, López AJ, White AO, Campbell RR, Dang RT, Rhee D, Tran AV, Carl AE, Matheos DP, Wood MA (2017) Context and Auditory Fear are Differentially Regulated by HDAC3 Activity in the Lateral and Basal Subnuclei of the Amygdala. Neuropsychopharmacology 42:1284–1294. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Li Y, He Y, Chen M, Pu Z, Chen L, Li P, Li B, Li H, Huang ZL, Li Z, Chen JF (2016) Optogenetic Activation of Adenosine A2A Receptor Signaling in the Dorsomedial Striatopallidal Neurons Suppresses Goal-Directed Behavior. Neuropsychopharmacology 41:1003–1013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Liljeholm M, Tricomi E, O’Doherty JP, Balleine BW (2011) Neural correlates of instrumental contingency learning: differential effects of action-reward conjunction and disjunction. J Neurosci 31:2474–2480. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Loe-Mie Y, Lepagnol-Bestel AM, Maussion G, Doron-Faigenboim A, Imbeaud S, Delacroix H, Aggerbeck L, Pupko T, Gorwood P, Simonneau M, Moalic JM (2010) SMARCA2 and other genome-wide supported schizophrenia-associated genes: regulation by REST/NRSF, network organization and primate-specific evolution. Hum Mol Genet 19:2841–2857. [DOI] [PubMed] [Google Scholar]
- Mailly P, Aliane V, Groenewegen HJ, Haber SN, Deniau JM (2013) The rat prefrontostriatal system analyzed in 3D: evidence for multiple interacting functional units. J Neurosci 33:5718–5727. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Malvaez M, Wassum KM (2018) Regulation of habit formation in the dorsal striatum. Curr Opin Behav Sci 20:67–74. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Malvaez M, Shieh C, Murphy MD, Greenfield VY, Wassum KM (2019) Distinct cortical-amygdala projections drive reward value encoding and retrieval. Nat Neurosci 22:762–769. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Malvaez M, McQuown SC, Rogge GA, Astarabadi M, Jacques V, Carreiro S, Rusche JR, Wood MA (2013) HDAC3-selective inhibitor enhances extinction of cocaine-seeking behavior in a persistent manner. Proc Natl Acad Sci U S A 110:2647–2652. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Malvaez M, Greenfield VY, Matheos DP, Angelillis NA, Murphy MD, Kennedy PJ, Wood MA, Wassum KM (2018) Habits Are Negatively Regulated by Histone Deacetylase 3 in the Dorsal Striatum. Biol Psychiatry. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mazzucchelli C, Vantaggiato C, Ciamei A, Fasano S, Pakhotin P, Krezel W, Welzl H, Wolfer DP, Pagès G, Valverde O, Marowsky A, Porrazzo A, Orban PC, Maldonado R, Ehrengruber MU, Cestari V, Lipp HP, Chapman PF, Pouysségur J, Brambilla R (2002) Knockout of ERK1 MAP kinase enhances synaptic plasticity in the striatum and facilitates striatal-mediated learning and memory. Neuron 34:807–820. [DOI] [PubMed] [Google Scholar]
- McGeorge AJ, Faull RL (1989) The organization of the projection from the cerebral cortex to the striatum in the rat. Neuroscience 29:503–537. [DOI] [PubMed] [Google Scholar]
- McNamee D, Liljeholm M, Zika O, O’Doherty JP (2015) Characterizing the associative content of brain structures involved in habitual and goal-directed actions in humans: a multivariate FMRI study. J Neurosci 35:3764–3771. [DOI] [PMC free article] [PubMed] [Google Scholar]
- McQuown SC, Wood MA (2011) HDAC3 and the molecular brake pad hypothesis. Neurobiol Learn Mem 96:27–34. [DOI] [PMC free article] [PubMed] [Google Scholar]
- McQuown SC, Barrett RM, Matheos DP, Post RJ, Rogge GA, Alenghat T, Mullican SE, Jones S, Rusche JR, Lazar MA, Wood MA (2011) HDAC3 is a critical negative regulator of long-term memory formation. J Neurosci 31:764–774. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Miller CA, Campbell SL, Sweatt JD (2008) DNA methylation and histone acetylation work in concert to regulate memory formation and synaptic plasticity. Neurobiol Learn Mem 89:599–603. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Miyachi S, Hikosaka O, Lu X (2002) Differential activation of monkey striatal neurons in the early and late stages of procedural learning. Exp Brain Res 146:122–126. [DOI] [PubMed] [Google Scholar]
- Miyachi S, Hikosaka O, Miyashita K, Kárádi Z, Rand MK (1997) Differential roles of monkey striatum in learning of sequential hand movement. Exp Brain Res 115:1–5. [DOI] [PubMed] [Google Scholar]
- Morris RW, Quail S, Griffiths KR, Green MJ, Balleine BW (2015) Corticostriatal control of goal-directed action is impaired in schizophrenia. Biol Psychiatry 77:187–195. [DOI] [PubMed] [Google Scholar]
- Murray EA, Izquierdo A (2007) Orbitofrontal cortex and amygdala contributions to affect and action in primates. Ann N Y Acad Sci 1121:273–296. [DOI] [PubMed] [Google Scholar]
- Nazzaro C, Greco B, Cerovic M, Baxter P, Rubino T, Trusel M, Parolaro D, Tkatch T, Benfenati F, Pedarzani P, Tonini R (2012) SK channel modulation rescues striatal plasticity and control over habit in cannabinoid tolerance. Nat Neurosci 15:284–293. [DOI] [PubMed] [Google Scholar]
- Neale BM et al. (2012) Patterns and rates of exonic de novo mutations in autism spectrum disorders. Nature 485:242–245. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nelson A, Killcross S (2006) Amphetamine exposure enhances habit formation. J Neurosci 26:3805–3812. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Nelson AJ, Killcross S (2013) Accelerated habit formation following amphetamine exposure is reversed by D1, but enhanced by D2, receptor antagonists. Front Neurosci 7:76. [DOI] [PMC free article] [PubMed] [Google Scholar]
- O’Hare JK, Ade KK, Sukharnikova T, Van Hooser SD, Palmeri ML, Yin HH, Calakos N (2016) Pathway-Specific Striatal Substrates for Habitual Behavior. Neuron 89:472–479. [DOI] [PMC free article] [PubMed] [Google Scholar]
- O’Hare JK, Li H, Kim N, Gaidis E, Ade K, Beck J, Yin H, Calakos N (2017) Striatal fast-spiking interneurons selectively modulate circuit output and are required for habitual behavior. Elife 6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- O’Roak BJ et al. (2012) Sporadic autism exomes reveal a highly interconnected protein network of de novo mutations. Nature 485:246–250. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ostlund SB, Balleine BW (2007a) Orbitofrontal cortex mediates outcome encoding in Pavlovian but not instrumental conditioning. J Neurosci 27:4819–4825. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ostlund SB, Balleine BW (2007b) The contribution of orbitofrontal cortex to action selection. Ann N Y Acad Sci 1121:174–192. [DOI] [PubMed] [Google Scholar]
- Packard MG, McGaugh JL (1996) Inactivation of hippocampus or caudate nucleus with lidocaine differentially affects expression of place and response learning. Neurobiol Learn Mem 65:65–72. [DOI] [PubMed] [Google Scholar]
- Peixoto L, Abel T (2013) The role of histone acetylation in memory formation and cognitive impairments. Neuropsychopharmacology 38:62–76. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Pitts EG, Li DC, Gourley SL (2018) Bidirectional coordination of actions and habits by TrkB in mice. Sci Rep 8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Regier PS, Amemiya S, Redish AD (2015) Hippocampus and subregions of the dorsal striatum respond differently to a behavioral strategy change on a spatial navigation task. J Neurophysiol 114:1399–1416. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Renteria R, Baltz ET, Gremel CM (2018) Chronic alcohol exposure disrupts top-down control over basal ganglia action selection to produce habits. Nat Commun 9:211. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Renthal W, Nestler EJ (2009) Histone acetylation in drug addiction. Semin Cell Dev Biol 20:387–394. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Robison AJ, Nestler EJ (2011) Transcriptional and epigenetic mechanisms of addiction. Nat Rev Neurosci 12:623–637. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rogge GA, Singh H, Dang R, Wood MA (2013) HDAC3 is a negative regulator of cocaine-context-associated memory formation. J Neurosci 33:6623–6632. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Rueda-Orozco PE, Robbe D (2015) The striatum multiplexes contextual and kinematic information to constrain motor habits execution. Nat Neurosci 18:453–460. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Saha A, Wittmeyer J, Cairns BR (2006) Chromatin remodelling: the industrial revolution of DNA around histones. Nat Rev Mol Cell Biol 7:437–447. [DOI] [PubMed] [Google Scholar]
- Schuck NW, Cai MB, Wilson RC, Niv Y (2016) Human Orbitofrontal Cortex Represents a Cognitive Map of State Space. Neuron 91:1402–1412. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Shan Q, Christie MJ, Balleine BW (2015) Plasticity in striatopallidal projection neurons mediates the acquisition of habitual actions. Eur J Neurosci 42:2097–2104. [DOI] [PubMed] [Google Scholar]
- Shan Q, Ge M, Christie MJ, Balleine BW (2014) The acquisition of goal-directed actions generates opposing plasticity in direct and indirect pathways in dorsomedial striatum. J Neurosci 34:9196–9201. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Shen W, Flajolet M, Greengard P, Surmeier DJ (2008) Dichotomous dopaminergic control of striatal synaptic plasticity. Science 321:848–851. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Shiflett MW, Balleine BW (2011) Contributions of ERK signaling in the striatum to instrumental learning and performance. Behav Brain Res 218:240–247. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Shiflett MW, Brown RA, Balleine BW (2010) Acquisition and performance of goal-directed instrumental actions depends on ERK signaling in distinct regions of dorsal striatum in rats. J Neurosci 30:2951–2959. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Sjoerds Z, de Wit S, van den Brink W, Robbins TW, Beekman AT, Penninx BW, Veltman DJ (2013) Behavioral and neuroimaging evidence for overreliance on habit learning in alcohol-dependent patients. Transl Psychiatry 3:e337. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Smith KS, Graybiel AM (2013) A dual operator view of habitual behavior reflecting cortical and striatal dynamics. Neuron 79:361–374. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Smith KS, Virkud A, Deisseroth K, Graybiel AM (2012) Reversible online control of habitual behavior by optogenetic perturbation of medial prefrontal cortex. Proc Natl Acad Sci U S A 109:18932–18937. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tecuapetla F, Jin X, Lima SQ, Costa RM (2016) Complementary Contributions of Striatal Projection Pathways to Action Initiation and Execution. Cell. [DOI] [PubMed] [Google Scholar]
- Thorn CA, Atallah H, Howe M, Graybiel AM (2010) Differential dynamics of activity changes in dorsolateral and dorsomedial striatal loops during learning. Neuron 66:781–795. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Tolman EC, Gleitman H (1949) Studies in spatial learning; place and response learning under different degrees of motivation. J Exp Psychol 39:653–659. [DOI] [PubMed] [Google Scholar]
- Tricomi E, Balleine BW, O’Doherty JP (2009) A specific role for posterior dorsolateral striatum in human habit learning. Eur J Neurosci 29:2225–2232. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Vandaele Y, Janak PH (2017) Defining the place of habit in substance use disorders. Prog Neuropsychopharmacol Biol Psychiatry. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Vogel-Ciernia A, Wood MA (2012) Molecular brake pad hypothesis: pulling off the brakes for emotional memory. Rev Neurosci 23:607–626. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Vogel-Ciernia A et al. (2013) The neuron-specific chromatin regulatory subunit BAF53b is necessary for synaptic plasticity and memory. Nat Neurosci 16:552–561. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Voon V, Derbyshire K, Rück C, Irvine MA, Worbe Y, Enander J, Schreiber LR, Gillan C, Fineberg NA, Sahakian BJ, Robbins TW, Harrison NA, Wood J, Daw ND, Dayan P, Grant JE, Bullmore ET (2015) Disorders of compulsivity: a common bias towards learning habits. Mol Psychiatry 20:345–352. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wang Z, Kai L, Day M, Ronesi J, Yin HH, Ding J, Tkatch T, Lovinger DM, Surmeier DJ (2006) Dopaminergic control of corticostriatal long-term synaptic depression in medium spiny neurons is mediated by cholinergic interneurons. Neuron 50:443–452. [DOI] [PubMed] [Google Scholar]
- Whyte AJ, Kietzman HW, Swanson AM, Butkovich LM, Barbee BR, Bassell GJ, Gross C, Gourley SL (2019) Reward-Related Expectations Trigger Dendritic Spine Plasticity in the Mouse Ventrolateral Orbitofrontal Cortex. J Neurosci 39:4595–4605. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wikenheiser AM, Schoenbaum G (2016) Over the river, through the woods: cognitive maps in the hippocampus and orbitofrontal cortex. Nat Rev Neurosci 17:513–523. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Wilson RC, Takahashi YK, Schoenbaum G, Niv Y (2014) Orbitofrontal cortex as a cognitive map of task space. Neuron 81:267–279. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yin HH, Knowlton BJ, Balleine BW (2004) Lesions of dorsolateral striatum preserve outcome expectancy but disrupt habit formation in instrumental learning. Eur J Neurosci 19:181–189. [DOI] [PubMed] [Google Scholar]
- Yin HH, Knowlton BJ, Balleine BW (2005a) Blockade of NMDA receptors in the dorsomedial striatum prevents action-outcome learning in instrumental conditioning. Eur J Neurosci 22:505–512. [DOI] [PubMed] [Google Scholar]
- Yin HH, Knowlton BJ, Balleine BW (2006) Inactivation of dorsolateral striatum enhances sensitivity to changes in the action-outcome contingency in instrumental conditioning. Behav Brain Res 166:189–196. [DOI] [PubMed] [Google Scholar]
- Yin HH, Ostlund SB, Knowlton BJ, Balleine BW (2005b) The role of the dorsomedial striatum in instrumental conditioning. Eur J Neurosci 22:513–523. [DOI] [PubMed] [Google Scholar]
- Yu C, Gupta J, Chen JF, Yin HH (2009) Genetic deletion of A2A adenosine receptors in the striatum selectively impairs habit formation. J Neurosci 29:15100–15103. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yu Z, Wu Y-J, Wang Y-Z, Liu D-S, Song X-L, Jiang Q, Li Y, Zhang S, Xu N-J, Zhu MX, Li W-G, Xu T-L (2018) The acid-sensing ion channel ASIC1a mediates striatal synapse remodeling and procedural motor learning. [DOI] [PMC free article] [PubMed]
- Zimmermann KS, Yamin JA, Rainnie DG, Ressler KJ, Gourley SL (2017) Connections of the Mouse Orbitofrontal Cortex and Regulation of Goal-Directed Action Selection by Brain-Derived Neurotrophic Factor. Biol Psychiatry 81:366–377. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Zimmermann KS, Li CC, Rainnie DG, Ressler KJ, Gourley SL (2018) Memory Retention Involves the Ventrolateral Orbitofrontal Cortex: Comparison with the Basolateral Amygdala. Neuropsychopharmacology 43:674. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.




