Skip to main content
UKPMC Funders Author Manuscripts logoLink to UKPMC Funders Author Manuscripts
. Author manuscript; available in PMC: 2021 Apr 7.
Published in final edited form as: Nat Rev Neurosci. 2020 Mar 30;21(5):247–263. doi: 10.1038/s41583-020-0289-z

The transition to compulsion in addiction

Christian Lüscher 1,2,, Trevor W Robbins 3,, Barry J Everitt 3,
PMCID: PMC7610550  EMSID: EMS120704  PMID: 32231315

Abstract

Compulsion is a cardinal symptom of drug addiction (severe substance use disorder). However, compulsion is observed in only a small proportion of individuals who repeatedly seek and use addictive substances. Here, we integrate accounts of the neuropharmacological mechanisms that underlie the transition to compulsion with overarching learning theories, to outline how compulsion develops in addiction. Importantly, we emphasize the conceptual distinctions between compulsive drug-seeking behaviour and compulsive drug-taking behaviour (that is, use). In the latter, an individual cannot stop using a drug despite major negative consequences, possibly reflecting an imbalance in frontostriatal circuits that encode reward and aversion. By contrast, an individual may compulsively seek drugs (that is, persist in seeking drugs despite the negative consequences of doing so) when the neural systems that underlie habitual behaviour dominate goal-directed behavioural systems, and when executive control over this maladaptive behaviour is diminished. This distinction between different aspects of addiction may help to identify its neural substrates and new treatment strategies.


In severe addiction, individuals spend much time and effort in procuring drugs for subsequent use that may not be in rich supply or immediately available, despite considerable risk of personal, physical and social harm. Such compulsive drug seeking, despite adverse or negative consequences, occurs in the absence of the drug. Drug seeking therefore cannot be simply the pharmacological effect of the taken substance, although the drug may further promote such behaviour. Drug taking, by contrast, is profoundly affected by the self-administered drug from the first ingestion. This distinction between two types of compulsive behaviour — compulsive drug taking and compulsive drug seeking — is made in this Review, as these behaviours may be governed by different psychological processes and dissociable neural circuits. The distinction also has implications for the way animal models of addiction are developed and used.

Indeed, the fifth edition of the American Psychiatric Association’s Diagnostic and Statistical Manual of Mental Disorders (DSM-5)1 has also subtly changed the way in which psychiatrists might consider addiction — now referred to as ‘substance use disorder’ — by abandoning some criteria that focused mostly on behaviours related to drug dependence, and placing more diagnostic emphasis on the compulsive quality of drug seeking. Thus, the notion that compulsivity is central to addiction is supported by the listing in the DSM-5 of symptoms that include an excessive time taken to search for drugs, the neglect of other goal-directed behaviours (such as employment and family activities) and even a failure to avoid physical self-harm, as well as subjective correlates of drug-seeking behaviour, such as craving. In this Review, we mostly use the term ‘addiction’ rather than the more cumbersome ‘severe substance use disorder’ as defined in the DSM-5.

Here, we describe addiction as a manifestation of contrasting behaviours of compulsive drug seeking and taking (BOXES 1,2) within a learning theory framework (BOX 3). We integrate this approach with recent findings from circuit neuroscience and discuss how distinct and dissociable neural and psychological mechanisms may underlie these distinct facets of addictive behaviour. We outline how the transition from controlled to compulsive drug seeking may involve an imbalance in the activity of frontostriatal circuits underlying goal-directed behaviour. We further discuss an optogenetic model involving direct self-stimulation of the mesolimbic dopamine (DA) system that provides evidence for a gain of function in cortical ‘top-down’ control. We contrast this model with evidence from human and animal studies of addiction that indicates a predominance of habitual over goal-directed behaviour associated with a loss of executive control by the prefrontal cortex (PFC). Finally, we suggest ways of resolving these discrepant views in the context of the distinction between drug seeking and drug taking with important theoretical and methodological implications for the field.

Box 1. Animal models of drug taking.

Self-administration

Laboratory animals will readily self-administer drugs that are addictive in humans by, for example, learning to press a lever for an intravenous infusion140. This ‘controlled drug taking’ is usually performed under continuous or low-ratio schedules of reinforcement (that is, all or most responses deliver an infusion) and the rate of infusions is titrated. Drinking alcohol is a consummatory response that may be preceded by a nose-poke or lever-press taking response (see BOX 2 for more on seeking–taking chains141). For intravenous drugs, the lever press is effectively both the taking response and the consummatory response, as it is followed only by the drug’s effect in the brain. Drug self-administration or taking has been used to define the reinforcing or rewarding mechanisms of addictive drugs and the neuroadaptations that follow extended use, particularly in the dopamine reward circuitry142,143.

Optogenetic dopamine neuron self-stimulation

Optogenetic dopamine neuron self-stimulation (oDASS) has been proposed to facilitate mechanistic investigations of drug reward focusing on the mesolimbic dopamine circuit. Mice will vigorously self-stimulate dopamine neurons in the ventral tegmental area (VTA) through oDASS25. Such self-stimulation is occluded by injections of addictive drugs, readily supports adaptive behaviours such as cue-associated reward seeking and yields synaptic adaptations in the VTA and ventral striatum typically seen with addictive drugs. oDASS may also occlude drug self-administration, suggesting overlapping underlying neural circuits.

Loss of control over use: escalation of intake

Loss of control over drug taking can be reflected in escalation of drug intake that occurs after long-term, but not short-term, daily self-administration sessions for intravenous drugs, especially cocaine81,144, or, in the case of alcohol, persistent drinking over several weeks145. Brain adaptations to long-term drug intake may raise reward thresholds (hedonic set points)146,147 and the efficacy of the reinforcing effects of cocaine148 (reviewed elsewhere149).

However, escalation of intake neither predicts nor is necessary for compulsive drug taking (for example, as in the three-criteria model (see below)), but rather it is a consequence of developing the three behavioural criteria of addiction47. Escalation of cocaine intake also does not predict development of compulsive cocaine seeking118. Intermittent access to cocaine or alcohol, rather than extended access, might favour the transition to compulsive cocaine intake150 and alcohol intake151,152, as well as compulsive alcohol seeking82.

Another model of loss of control over taking is the altered inter-infusion interval of cocaine seen during periods of unlimited ‘binge’ access153. In addition, the progressive disruption of titration, as characterized by a burst-like pattern of infusions, is a predictive behavioural marker of the future transition to compulsivity in vulnerable individuals154.

Addiction-like behaviour in rats

Compulsive drug taking can be defined as uncontrolled use despite negative consequences. This implies a ready drug supply and focuses on behaviour most proximal to the drug, such as drinking, smoking or self-injecting. Compulsive alcohol taking is measured in animals that persist in drinking despite quinine adulteration, emerges only after a long drinking history and is evident even after 9 months of abstinence, is specific to alcohol and is independent of physical dependence145,151,152.

For intravenous drugs, punishment-resistant instrumental taking responses can be taken as a measure of compulsion47,155,156. In the three-criteria model of addiction-like behaviour47,157, rats are defined as compulsive if they meet three criteria: persistent responding for the drug despite receiving mild footshock punishment; persistent responding when the drug is signalled to be unavailable (but not under extinction conditions); and increased motivation for the drug under a progressive ratio. This model captures individual vulnerability to addiction: only ~20% of rats initially exposed to cocaine develop punishment resistance after 100 days of drug use. About 50% of animals persist in oDASS responding punished with mild footshock, demonstrating another form of compulsion48.

Box 2. Animal models of drug-seeking behaviour.

Drug seeking in instrumental chained schedules

The most important feature of the seeking–taking chained schedule of drug reinforcement is that seeking responses on one lever (usually scheduled under random intervals) are never reinforced and instead only give access to a second, ‘taking’ lever, responses which deliver a drug intravenously. Such schedules have made possible investigation of the associative (goal-directed or habitual) structure underlying drug-seeking behaviour80,158,159.

For alcohol, responses on a single lever can be used to measure seeking behaviour, as they are followed by a nose-poke taking response that gives access to alcohol and the consummatory response of drinking51,70,82,99.

Cue-controlled drug seeking

Other procedures emphasize the impact of drug-associated conditioned stimuli (CSs) on drug seeking, particularly those in which CSs act as conditioned reinforcers because they are presented response contingently. In second-order schedules, seeking responses are reinforced by drug-associated CSs that bridge the delay to receiving an intravenous infusion of a drug or access to alcohol. Thus, seeking responses less tightly or less reliably deliver the drug and may thus be more susceptible to habitual control than are taking responses. Second-order schedules have been used to study cocaine, heroin and alcohol seeking160162, and to probe the neural circuitry underlying CS-controlled drug seeking and the transition from goal-directed behaviour to seeking habits163.

Incubation of craving

Cue-controlled seeking after abstinence has revealed the phenomenon called the ‘incubation of craving’, whereby a drug-conditioned reinforcer (a drug-associated CS delivered response contingently) supports seeking behaviour that progressively increases the longer the period of abstinence following long-access self-administration sessions164. This procedure models the considerable influence of drug cues on relapse after abstinence and, in some sense, models an aspect of the loss of control over drug seeking, and has been suggested to reflect enhanced cue-induced craving during abstinence in humans. A complex neural circuitry is involved in the incubation effect that may also be drug specific. Incubation of craving has been reviewed extensively165,166 and is not considered in detail here.

Extinction–reinstatement

This is a procedure that models relapse. After a limited period of drug taking, the instrumental lever press and the drug-associated CS are extinguished (that is, the drug is no longer made available). The reinstatement of responding is then assessed in a relapse test, during which lever presses result in CS presentation alone (which acts as a conditioned reinforcer), or following stress or a non-contingent injection of the drug. This procedure has revealed a rich circuitry involving the nucleus accumbens and the medial prefrontal cortex, as well as molecular mechanisms underlying relapse167169. However, we do not cover this topic in this Review, as the procedure is not designed to examine the neural basis of drug taking or seeking, except in the complex postinstrumental-extinction state. Moreover, instrumental extinction is not generally a feature of voluntary abstinence in addiction in humans.

Compulsive drug seeking

Compulsive drug seeking has been operationalized as persistent instrumental seeking behaviour despite the risk of, or actual, punishment of the seeking, but not the taking, response118,170. In seeking–taking schedules for cocaine or alcohol, mild footshock punishment of the seeking response occurs randomly on completion of 50% of the seeking intervals, instead of access to the taking response. Taking responses are never punished, and drug delivery is never associated with shock, thereby contrasting with models of compulsive drug taking in BOX 1. Only 20% of individuals develop compulsive drug seeking, thus capturing individual differences in vulnerability to addiction. In a new procedure based on optogenetic dopamine neuron self-stimulation in a seeking– taking chain, lever pressing during a random interval gives access to a taking lever that delivers optogenetic dopamine neuron self-stimulation, which is then punished. When seeking responses are punished, they continue in a fraction of a population of mice, indicating differential vulnerability to compulsivity48.

Box 3. Actions and habits, learning theory and circuits.

Three major aspects of learning may contribute to drug seeking and drug taking: Pavlovian conditioning, instrumental goal-directed learning and stimulus–response (habit) learning.

In Pavlovian conditioning, previously neutral stimuli (such as drug paraphernalia or drug-associated contexts) are conditioned to the effects of a drug because of their predictive relationship. Pavlovian conditioned stimuli (CSs) can motivate seeking and taking through Pavlovian–instrumental transfer (PIT), and can reinforce and sustain seeking responses by acting as conditioned reinforcers. PIT, conditioned reinforcement and their neural bases are not considered here but are reviewed elsewhere91,98,171,172. Notably, drug-associated CSs also capture attention and can elicit sign tracking90,173.

Goal-directed (instrumental) behaviours depend on the value of the goal (so-called action–outcome associations). Drug self-administration might be preceded by ‘foraging’ for the drug (or drug seeking), especially if it is only occasionally available. Entire sequences of behaviour may be maintained by drug-predictive stimuli; such stimuli become goals or subgoals in themselves and are termed ‘conditioned reinforcers’42.

The great majority of neural data come from studies with food reinforcement. Plasticity in the posterior dorsomedial striatum is required for acquiring and performing goal-directed action, as is the basolateral amygdala, which updates goal-value information81,171,174. By contrast, prelimbic projections to the posterior dorsomedial striatum are required to acquire, but not to perform, goal-directed action175. The lateral–ventral orbitofrontal cortex (OFC) controls the flexibility of instrumental behaviour when action–outcome or stimulus–outcome contingencies are devalued, or when Pavlovian stimuli guide choice176,177, whereas in rats the medial OFC is necessary for retrieving representations of the outcome in the absence of the expected outcome to guide goal-directed action178. In primates, certain OFC regions are required for revaluing rewards, whereas others re-evaluate rewards in behavioural choice and compare stimulus values in choice settings179 (that is, in goal-directed behaviour that requires decision-making).

Habitual behaviour is underpinned by stimulus–response associations and often occurs after extensive training, when behaviour becomes autonomous; that is, independent of outcome value and elicited by particular stimuli or contexts180. An instrumental behaviour can be diagnosed as habitual when it is performed even after goal devaluation (as with drug tolerance) or with contingency degradation. Habits place lower demand on limited cognitive resources and thus free them up for flexible adaptation to changing environmental contingencies, which requires effort and goal-directed control181.

Plasticity in the anterior dorsolateral striatum is required for acquiring and performing food-reinforced stimulus– response habits, as impairing its function reinstates seeking behaviour sensitive to outcome devaluation or contingency degradation171,182. The infralimbic cortex is also involved in habit learning183; optogenetic inhibition of the infralimbic cortex prevents habit acquisition184,185. However, no direct projections connect the infralimbic cortex to the anterior dorsolateral striatum, so how these areas interact remains to be established.

Notably, the nucleus accumbens (NAc) is not required for the acquisition or performance of instrumental behaviour. However, the core and shell of the NAc are differentially required for the impact of CSs on behaviour and for the influence of outcome value on responding172. Lesion or inactivation of the NAc does not affect instrumental learning or sensitivity to contingency degradation57,81,185,186. By contrast, the NAc core and shell are dissociably required for conditioned reinforcement187, PIT91 and sign tracking172,187.

Recent findings obtained with projection-specific transgenic mouse lines suggest that neurons projecting to the NAc core mediate association, whereas shell projectors encode motivation188. As a result, self-stimulation of shell projectors is fully supported only after conditioning has occurred. In the case of optogenetic dopamine neuron self-stimulation, both streams are activated together, explaining the very strong reinforcement.

Everyday behaviour probably consists of a mix of goal-directed and habitual behaviour. Goal-directed and habitual behaviour are often well coordinated, but may also compete. People who abuse stimulants may rely more on habitual control than on goal-directed control for laboratory-based tasks, suggesting an imbalance of these learning systems37.

Drug taking and reinforcement

Animal models of self-administration (BOX 1) show that all addictive drugs are positively reinforcing2. Acquisition of this instrumental behaviour requires DA signalling in the mesolimbic pathway, particularly by neurons of the ventral tegmental area (VTA) that project to the medial shell of the nucleus accumbens (NAc; part of the ventral striatum)35 (FIG. 1). This mesolimbic pathway has also long been implicated in the mediation of both the anticipation of and the reinforcing effects of many natural reinforcers including, notably, food6. Whereas both food and alcohol share a common consummatory response of ingestion, there is no equivalent consummatory response for intravenous drugs other than the self-administrating lever press. The clear separation between instrumental and consummatory behaviour for food has made possible a detailed analysis of the separate circuits involved in eating and instrumental responding for food7,8. However, the confounding of instrumental and consummatory (that is, instrumental self-administration) behaviour for intravenous drugs has made the analysis of these components both difficult and more ambiguous. It has been addressed by the introduction of so-called seeking–taking instrumental chained procedures (FIG. 2; BOX 2), which have largely illuminated the instrumental aspects of drug seeking, rather than drug taking, as is apparent in this Review.

Titrated.

Adjusted in intensity, to measure the effect of a stimulus on behaviour.

Fig. 1. Neural circuits engaged in drug seeking, drug taking and the transition to compulsion in addiction.

Fig. 1

a| Addictive drugs of different pharmacological classes have a common initial effect of increasing levels of dopamine in the nucleus accumbens (NAc) — particularly dopamine released by neurons projecting from the ventral tegmental area (VTA). This effect is viewed as crucial for initial drug reinforcement. Drug taking depends on plasticity of projections from the medial prefrontal cortex (mPFC) and orbitofrontal cortex (OFC) to the dorsomedial striatum (DMS). b | Initially, drug seeking is goal-directed and depends on the DMS and afferents from the mPFC and OFC. The NAc is not required for instrumental drug-taking behaviour but has a major role in mediating the reinforcing effects of drug-associated conditioned stimuli (CS) on seeking responses. The NAc is functionally related to the DMS via the serially looping circuitry that involves the VTA and substantia nigra par compacta (SNc). c | When drug seeking is well established, it is under the dominant control of the dorsolateral striatum (DLS; putamen in primates), which receives its major cortical afferents from the motor cortex (MC). The DLS may be recruited through the recurrent circuitry that links the NAc with the VTA and, progressively, via the substantia nigra, with the DLS. Note that back-projecting medium spiny neurons (MSNs) express dopamine D1 receptors and preferentially synapse onto GABA interneurons in the midbrain (green). The acquisition of CS-controlled drug seeking depends on the basolateral amygdala (BLA) and its projection to the NAc (shown in part b). The maintenance of established drug seeking habits does not depend on the BLA (shown in faded purple in part c) but instead depends on the central amygdala (CeA; purple in part c). The CeA has direct projections to the SNc and can therefore influence the dopaminergic innervation of the anterior DLS (aDLS). d | Compulsive drug seeking depends on the loss of prefrontal cortical ‘top-down’ control over the striatal mechanisms underlying drug-seeking habits (denoted by shading of the DLS and grey shading of the mPFC and OFC). This model may be contrasted with the ‘gain-of-function’ model derived from optogenetic dopamine neuron self-stimulation studies (FIG. 2). Thus, two models are discussed in the main text: one that sees compulsion as an excessively goaldirected action, possibly mediated by the OFC, and a contrasting model that builds on the role of habits in compulsion and the failure to disengage the DLS (see also FIG. 4). D2R, dopamine D2 receptor.

Fig. 2. Assessing compulsive drug-taking and drug-seeking behaviours in animal models.

Fig. 2

a| In an operant chamber with an active lever and an inactive lever, responding on the active lever results in drug infusion (drug taking), and a presented light stimulus becomes a drug conditioned stimulus through Pavlovian conditioning (left panel). Here, compulsive drug taking is defined as persistent responding when the lever press is punished at the same time as drug infusion (right panel). b | In a seeking–taking chained schedule of reinforcement, the animal learns to press a seeking lever under a random interval (for example, lasting 60 s on average, but ranging from 45 to 120 s). The lever press is never reinforced, but gives access to a second, taking lever, pressing on which results in drug infusion (usually after each press). During the test for compulsive seeking, pressing the seeking lever results in either access to the taking lever or mild footshock punishment delivered randomly on completion of half of the trials. Compulsive seeking is measured as persistent responding on the seeking lever under probabilistic punishment118.

In the case of drug taking, each substance has its own range of molecular targets, potentially creating a complex interoceptive experience mediated by a neural network including the insula9. Moreover, the increase in the level of mesolimbic DA that is elicited by most addictive drugs mediates reinforcement through distinct cellular mechanisms. For example, cocaine, a potent stimulant, blocks DA reuptake by neuronal terminals, whereas nicotine can directly depolarize DA neurons10. Alcohol has complex actions, including at GABA receptors and glutamate receptors, that cause DA transients in the NAc11. Opioids, by contrast, inhibit inhibitory interneurons that express the µ-opioid receptor and thereby disinhibit the firing of the VTA DA neurons12. There are transient increases in DA release in the NAc of mice injected with heroin (visualized using genetically encoded DA sensors)13 as a result of enhanced DA neuron activity14. Whether opioid drug reinforcement depends on mesolimbic DA has been questioned, however, because heroin self-administration persists after DA depletion and during blockade of DA receptors in the NAc1517. Furthermore, in individuals addicted to heroin who are receiving methadone substitution, intravenously administered heroin results in a marked euphoric ‘high’ but does not increase the level of striatal DA18 (reviewed in REF.19).

The initial mesolimbic DA level increase caused by a single dose of cocaine (and to varying extents by other addictive drugs3) potentiates excitatory afferents onto DA neurons in the VTA, thus leaving a trace that outlasts the drug’s presence in the brain20,21. This trace can be mimicked by optogenetically activating VTA DA neurons22. Repeated exposure to a drug can also induce synaptic transmission and plasticity in the NAc23. Excitatory synapses between neurons in the medial PFC (mPFC) or ventral hippocampus and DA D1 receptor (D1R)-expressing medium spiny neurons (MSNs) in the NAc are strengthened24. This form of plasticity has been linked to locomotor sensitization and cue-elicited reinstatement of taking responses and is also observed after optogenetic DA neuron self-stimulation (oDASS)25 (FIG. 3; BOX 1). Afferents onto DA D2 receptor (D2R)-expressing MSNs, particularly those from the basolateral amygdala (BLA), are also potentiated after extended access to high doses of cocaine26.

Escalation of drug intake.

Increase of drug intake during extended (long-access) periods of self-administration.

Fig. 3. Circuits undergoing gain of function with oDASS.

Fig. 3

a| Mice expressing channel rhodopsin in dopamine transporter-expressing neurons learn to self-stimulate dopamine neurons in the ventral tegmental area (VTA) — known as optogenetic dopamine neuron self-stimulation (oDASS). b | Once acquired, these adaptive behaviours are observed in all animals and depend on a potentiation of medial prefrontal cortex (mPFC) and ventral hippocampus (vHipp) to nucleus accumbens (NAc) afferents, particularly onto medium spiny neurons (MSNs) expressing dopamine D1 receptors (D1Rs). Synapses between basolateral amygdala (BLA) neurons and their dopamine D2 receptor-expressing targets in the NAc may also undergo potentiation, akin to observations with extended-access cocaine self-administration. c | In the last stage of the test, every third lever press is punished, yielding two distinct groups: one that perseveres with oDASS and the other that ceases oDASS. Ex vivo quantification of the synaptic strength of the orbitofrontal cortex (OFC)-to-dorsomedial striatum (DMS) projection shows enhanced connectivity in compulsive mice. Similar neural changes were observed in a seek–take version of oDASS. SNc, substantia nigra pars compacta.

Many drugs, particularly opioids and alcohol, cause an aversive withdrawal syndrome on the abrupt termination of long-term use that the animal learns to avoid (that is, through negative reinforcement)27. The circuit adaptations that mediate this negative reinforcement are triggered by stress mediators, such as corticotropin-releasing factor and noradrenaline in the extended amygdala, as well as by adaptations in the NAc itself28. For example, an upregulation of the dynorphin–κ-opioid receptor system in the brains of individuals addicted to alcohol or opioids has been argued to underlie dysphoria associated with withdrawal29. Notably, stress mediators eventually lead to a potentiation of excitatory projections onto D2R-expressing MSNs, such as those from the paraventricular thalamus, from the lateral or medial habenula or from the BLA30,31.

These initial positively and negatively reinforcing adaptations are prevalent as a result of drug self-administration and withdrawal, but do not necessarily reflect compulsion. However, they may contribute to the loss of control over drug taking (BOX 1) and the transition to compulsive drug taking or compulsive drug seeking.

Compulsion is central to addiction

Learning theory and the dopamine system.

Focus on a behavioural manifestation of addiction makes possible a useful theoretical analysis in terms of learning theory, which has helped to bridge computational accounts of behaviour and their implementation by discrete neural systems in the brain (BOX 3). Much of this learning theory was originally established in studies with experimental animals, but it has translated remarkably well to behavioural and functional neuroimaging studies of addiction in humans.

This learning theory perspective posits that addiction is a consequence of alterations in the activity of limbic– corticostriatal circuits (FIG. 1) that arise in part from the excessive DA modulation described above. Studies using positron emission tomography show that, whereas recreational stimulant use (without addiction) increases the release of DA in the ventral striatum32, addiction is associated with increased DA release in response to drug-associated cues in the putamen (which together with the caudate makes up the dorsal striatum)33. This cue-elicited DA release occurs despite the evidence that striatal dopaminergic transmission is blunted overall in people who are addicted34,35.

Burgeoning evidence suggests that addiction to drugs is associated with a general bias to a habitual (also known as ‘model-free’) mode of behaviour, as distinct from goal-directed (or ‘model-based’) behaviour. Habitual behaviour is generally associated with activity in the dorsolateral striatum (DLS; or the putamen in primates), whereas goal-directed behaviour is associated with activity in the dorsomedial striatum (DMS; or the caudate in primates) and the ventral striatum36 (FIG. 1). This is true of addiction to stimulants37,38, nicotine39 and alcohol40. In individuals addicted to alcohol40, brain areas implicated in goal-directed action (namely the ventromedial PFC and anterior striatum) were less active than in controls, whereas the posterior putamen, which is implicated in habit learning, was more engaged. These findings may be consistent with evidence of larger putamen volume in stimulant drug abusers and their relatives, although the functional link of such differences with habit learning has yet to be demonstrated41.

Reward thresholds.

The minimal stimulation intensities required to produce a reinforcing effect.

Natural reinforcers.

Rewards such as food and sex that motivate behaviour in animals and humans. They may be distinguished from artificial rewards such as addictive drugs that may nevertheless depend on the same neural systems in the brain.

Interoceptive.

Relating to internal, or bodily, states that are interpreted by the brain through a process called ‘interoception’. In the case of drugs, bodily changes (such as increases in heart rate caused by stimulants) are an important component of the subjective effects of drugs.

Stimulant.

A drug that increases arousal and activity. Amphetamine is a typical example of a stimulant, sometimes called a ‘psychomotor stimulant’.

Locomotor sensitization.

Enhanced motor responses to the same does of a stimulant drug that follows intermittent, repeated dosing.

Cue-elicited reinstatement.

Reinstatement of performance of previously extinguished drug-taking responses, supported by drug cues acting as conditioned reinforcers.

Extended amygdala.

Neuroanatomical term that includes the centromedial amygdala, bed nucleus of the stria terminalis and, according to some, the shell of the nucleus accumbens and a group of neurons in the basal forebrain that links these structures.

Dysphoria.

A state of unhappiness or suboptimal mood in humans.

Pavlovian–instrumental transfer.

(PIT). Transfer of learning whereby conditioned stimuli associated with a reward can increase a separately trained instrumental response for that reward (specific transfer) or for other rewards (general transfer).

Compulsive behaviour can be understood as a narrowing of the range of goals that persists in the face of adverse consequences. This apparent goal devaluation, including tolerance to the euphoric effects of the drug, further suggests that addictive behaviour itself indeed has habitual qualities. A major question arising from the learning theory perspective is therefore whether compulsive drug taking (which is synonymous with compulsive drug use) and compulsive drug seeking (or ‘foraging’, which often occurs when resources are scarce and at personal risk) are goal-directed or habitual. Moreover, habit-like behaviour can also be perseverative to the extent that it can be said to be ‘out of control’. Thus, to relate habits to compulsion, a second factor — that of a lack of top-down executive control over behaviour — probably also has to be postulated42.

Loss of top-down control.

Such a lack of top-down control can readily be postulated in addiction (but see the ‘gain-of-function’ scenario discussed later). Impairments in frontal functioning in addiction may result either from effects of the drugs themselves or constitutional, predisposing factors43,44. Thus, for example, loss of grey matter in the orbitofrontal cortex (OFC) has been observed as a function of the duration of drug use in addicted individuals, whereas a loss of white matter in the right inferior frontal gyrus, associated with impairments in response inhibition, has been observed not only in stimulant-addicted individuals but also in their non-drug-taking siblings44,45.

Recreational drug taking does not inevitably lead to fronto-executive impairments44 nor to addiction; only a relatively small proportion of people who use stimulant drugs, for example, become addicted46, and individual differences must therefore also be taken into account. However, for those individuals who do become addicted, there must presumably be some progressive change in the functioning of neural circuitries that mediates learning processes and their executive control.

Sign tracking.

Behaviour whereby the animal approaches a conditioned stimulus predictive of reward — as opposed to approaching a reward (or goal) directly (‘goal tracking’).

Contingency degradation.

Degradation of the predictive relationship between responses and outcomes; for example, by presentation of ‘free’ (that is, response-independent) outcomes or extinction.

Quinine.

A bitter crystalline compound present in cinchona bark that is used to adulterate an otherwise readily ingested liquid.

Compulsive drug taking

Compulsive drug taking refers to self-administration in the face of punishment or aversive consequences; for example, persistent drug taking despite mild footshock or, in the case of an orally consumed drug such as alcohol, adulteration (for example, with quinine).

In the three-criteria model of addiction to stimulants47 (BOX 1), rats were considered to show ‘addiction-like behaviour’ when they exhibited each of the three main behavioural features of addiction: first, a reduced ability to inhibit taking responses when cocaine was signalled as available; second, a heightened motivation for cocaine as assessed under a progressive ratio schedule of reinforcement; and third, persistent cocaine-taking responses when cocaine infusions were punished by footshock (compulsion). Rats showing all three criteria were classed as showing addiction-like behaviour. Rats showing the first and second criteria were not necessarily compulsive. Hence, rats showing none, one or two of the criteria made up the majority of the population and could be described as resilient47. Unbiased clustering algorithms for several behavioural parameters after the introduction of punishment have arrived at similar classifications in oDASS studies48.

There are large individual differences in the three-criteria model; only about 20% of rats that have an extended history of cocaine self-administration are compulsive in this sense47, matching the human data46. Furthermore, in rats showing addiction-like behaviour, long-term depression in the NAc was permanently impaired, whereas in non-addicted rats that maintained controlled drug intake, long-term depression progressively reduced49. Further studies will be required to identify the circuitry and mechanisms involved in these behavioural and plasticity changes. Notably, measuring compulsive opioid taking in the three-criteria model is not possible, because opioid drugs exert analgesic effects that confound any interpretation of perseverance in the face of punishment.

There has been surprisingly little analysis of the goal-directed or habitual nature of this drug-taking behaviour or its neural basis (FIG. 1). Nevertheless, some evidence suggests that in rats showing compulsive methamphetamine taking (as defined by the three-criteria model), there is an imbalance in the activity of different corticostriatal circuits, with more activity in OFC–DMS connections and less engagement of the mPFC– ventrolateral striatum circuitry50. Moreover, rats that initially choose alcohol over saccharin go on to display compulsive alcohol drinking, as assessed by resistance to footshock punishment or by perseverance despite quinine adulteration of alcohol51. In the latter study, expression of the GABA transporter GAT3 was selectively decreased in the brains of rats that preferred alcohol, and GAT3 expression is also decreased in the central amygdala (CeA) of alcohol-dependent humans, suggesting that pre-existing differences in GABAergic mechanisms are involved in the development of compulsive alcohol use. By contrast, a study of alcohol drinking in mice showed that a decrease in the activity of an mPFC–dorsal periaqueductal grey pathway during initial alcohol exposure predicted greater tolerance for quinine adulteration and compulsion (although the distribution of alcohol drinking remained unimodal)52. Further studies may reveal how this PFC–brainstem projection interacts with the circuits described above in the emergence of compulsive alcohol taking, perhaps implicating the CeA, which projects to similar brainstem regions as does the PFC53.

Progressive ratio schedule.

A behavioural procedure whereby the number of required responses increases after each reward delivery. The number of responses at which the animal ceases to respond is called the ‘break point’.

Analgesic.

Relating to the effects of a drug that relieves pain (for example, morphine).

Value updating.

The perception of a change in value of a reinforcer after an antecedent manipulation; for example, the value of food is decreased following ingestion of the food to satiety.

Incubation of cocaine craving.

The increase in instrumental responding for a drugassociated conditioned stimulus that occurs the longer the period of abstinence from drug taking.

Another tool for studying compulsive responding with primary reinforcement is that of mice trained in an oDASS protocol in which a mouse’s attempts to self-stimulate VTA neurons are punished on a certain proportion of occasions with footshocks25,48 (FIG. 3). In a yoked punishment design in which the total number of electric shocks received by each group was matched (so that quantification of shock-related neuronal expression of the immediate early gene Fos was unbiased), the activity of OFC projections to the dorsal striatum was enhanced in compulsive mice compared with non-compulsive mice. As in the three-criteria stimulant-addiction model, there were individual differences in punishment resistance, such that a bimodal distribution emerged in which about 50% of mice persisted in responding when oDASS was associated with footshock. Thus, strong activation of mesolimbic DA neurons can overcome the suppressant effect of punishment, and this effect may be related to compulsive stimulant taking.

Indeed, the enhanced activity of OFC–dorsal striatum projections in animals that compulsively selfadministered methamphetamine50 also emerged in mice exposed to a sensitizing regimen of non-contingent, systemically administered cocaine54. Furthermore, optogenetic inhibition of OFC neurons that project to the dorsal striatum in compulsive rats decreased responding, mirroring the permanently potentiated OFC–dorsal striatum transmission in punishment-resistant mice. Artificial potentiation of OFC–dorsal striatum synapses (through high-frequency optogenetic stimulation) also caused compulsive responding in punishment-sensitive mice, whereas depotentiation of these synapses in compulsive mice had the converse effect48.

Given the likely role of the OFC in goal-directed behaviour and value updating, the oDASS model might represent a form of compulsive responding that has a goal-directed basis. A recent study argues that OFC neurons specifically projecting to the dorsal striatum encode the integrated value of reward55. Thus, in the oDASS compulsion model, a gain of function in the OFC–striatal pathway might promote overestimation of the value of the drug experience relative to punishment, biasing instrumental behaviour towards drug consumption.

Controlled and compulsive drug seeking

Controlled drug seeking.

The acquisition (that is, early stages) of cocaine seeking that explicitly relies on a drugconditioned stimulus to reinforce responding depends on circuitry that links the BLA and the NAc core56–58 (FIG. 1). Hence, selective lesion or inactivation of either the BLA or the NAc, or their disconnection, has no effect on cocaine self-administration, but impairs or abolishes the acquisition of cocaine seeking under a second-order schedule of reinforcement5658. This difference emphasizes the distinction between drug taking (or the reinforcement of taking) and drug seeking at a neural circuit level. It is also consistent with the established function of this circuitry to mediate conditioned reinforcement59, which is the process that also underlies cued reinstatement of extinguished responding (a relapse procedure60) and the incubation of cocaine craving61 (BOX 1).

Synaptic potentiation of excitatory afferents selectively onto D1R-expressing MSNs may represent a cellular mechanism of cue-associated seeking and locomotor sensitization24,62. Daily cocaine exposure during a short interval (less than 3 hours) potentiated afferents from the mPFC and ventral hippocampus, whereas BLA afferents, especially those that projected to D2R-expressing MSNs, underwent plasticity only with extended cocaine access26. These drug-adaptive responses may relate to the physiological role of these projections. Neurons in the BLA-to-NAc projection that express the marker cholecystokinin (CCK) synapse preferentially onto D2R-expressing MSNs to encode aversive stimuli63. Similarly, the expression of the genes Ppp1r1b and Rspo2 marks neurons preferentially activated by reward-related or aversion-related stimuli, respectively63,64. It may be of interest to investigate whether these projections that encode aversive stimuli are involved in negative reinforcement.

Taken together, these findings indicate a circuit and synaptic basis of drug-seeking behaviour that is dissociable from the neural mechanisms of drug reinforcement that depend on activity in the NAc and underlie drug-taking behaviour.

From controlled to compulsive drug seeking.

To understand the distinction between controlled and compulsive drug seeking, it is important to consider two theoretical points. First, there may be an imbalance between the control of goal-directed actions versus habits over drug-seeking behaviour. Second, such an imbalance might represent enhanced habit formation together with decreased executive control over the performance of drug-seeking habits, or enhanced motivated performance of, and loss of executive control over, goal-directed actions. These potential theoretical explanations of compulsion were previously more vividly conceptualized as a distinction between the ‘must do’ and ‘must have’ nature of drug seeking65.

The neural circuitry that underlies goal-directed actions and stimulus–response habits has been identified through detailed analyses of instrumental responding for food rewards. In such analyses, devaluing the reinforcer or degrading the contingency does not prevent food-seeking behaviour in some animals, signifying a shift to habitual behaviour. This identified circuitry therefore provides the backdrop for studies of drug seeking6668 (FIG. 1; BOX 3).

Direct pathway.

Projection of dopamine D1 receptor-expressing medium spiny neurons in the striatum to the midbrain. The indirect pathway involves a striatal projection of dopamine D2 receptor-expressing medium spiny neurons to the pallidum.

Consistent with its goal-directed function, the posterior DMS (pDMS) in rats is required for the acquisition of cocaine seeking69 and for alcohol seeking at a time when responding is still sensitive to reinforcer devaluation70. Optogenetic inhibition of direct pathway MSNs (which express D1R) in the DMS decreased responding for access to alcohol71, whereas optogenetic stimulation of corticostriatal inputs in vivo at frequencies that induced long-term depression in DMS slices increased responding for alcohol — an effect that was blocked by a D2R antagonist, indicating a link between neuronal plasticity in the DMS and goal-directed drug seeking71.

Nevertheless, much converging evidence suggests a prominent role of the anterior DLS (aDLS; putamen in primates) in the transition from drug taking to compulsive drug seeking in the absence of the drug42,65,72. This role is especially supported in studies of drug cue-related reactivity in functional MRI studies of addiction in humans. Whereas people who drink alcohol or take cannabis on a recreational basis show increased activation in the ventral striatum on seeing drug-related cues, individuals who are addicted to alcohol, cannabis or stimulants exhibit enhanced cue reactivity in the dorsal striatum7375. Furthermore, in primates, long-term alcohol exposure resulted in increased dendritic spine density and enhanced glutamatergic transmission in the putamen, but not in the caudate (DMS)76. The excitability of striatal neurons was also enhanced in monkeys following heavy drinking, whereas GABAergic transmission was selectively suppressed in the putamen of these monkeys. These changes together indicate that prolonged heavy drinking results in a shift in the balance of inhibitory and excitatory transmission that promotes synaptic activation of putamen output neurons76.

Similarly, when rats learn over the course of several weeks to seek cocaine maintained by presentations of a cocaine-related cue, aDLS circuitry has a dominant role in controlling performance. Extracellular DA levels in the aDLS, but not in the NAc, are correlated with well-established cocaine-seeking behaviour77. Moreover, DA receptor blockade in the aDLS, but not in the NAc or pDMS, reduces cocaine seeking, but not taking, behaviour in animals that had been performing this behaviour over several weeks69,78. These observations imply that seeking eventually becomes reliant on the habit system. In the same task, heroin-seeking behaviour, over an extended course of training, similarly becomes dependent on dopaminergic transmission in the aDLS79.

Seeking–taking chained procedures are tasks in which animals must make one instrumental response (such as a lever press) to ‘seek’ the drug that enables them to then perform another distinct action, such as a nose poke or a different lever press to ‘take’ the drug (resulting in an intravenous drug infusion or the opportunity to drink alcohol or another addictive drug). One study trained animals to perform a seeking–taking chained procedure for alcohol and observed the same shift from pDMS to aDLS control in 4–7 weeks. This change was associated with a loss of sensitivity to reinforcer devaluation, indicating a transition from goal-directed to habit control70. Similarly, with the same behavioural paradigm, cocaine seeking became dependent on the aDLS and insensitive to reinforcer devaluation. When the aDLS was then inactivated, taking the habit system offline, goal-directed cocaine seeking sensitive to devaluation was reinstated80. Together, these observations imply that DMS-dependent goal-directed seeking behaviour is progressively dominated by habitual seeking behaviour that relies on DLS activity.

It is neither necessarily aberrant nor maladaptive for the instrumental seeking of drugs, or of any reward, to become habitual. In normal circumstances, habitual seeking of a food reward can return rapidly to goal-directed control in the presence of negative outcomes or negative feedback72,81. As Gremel et al.68 emphasize, it is not an individual who is either goal-directed or habitual; rather, it is the nature of control over an individual’s instrumental actions within a particular context or in the presence of particular conditioned stimuli. Although drug seeking may initially become habitual, it is the transition to compulsive drug seeking that establishes it as maladaptive, and this transition may entail a loss of executive control over its performance and a loss of goal-directed behaviour that also ensures its dominance. It is perhaps the difficulty in relinquishing drug-seeking habits and making possible a return to goal-directed control, as is readily the case for food-seeking habits, that further characterizes the maladaptive nature of such drug-seeking habits82.

Underlying the ventral-to-dorsal striatal transition in the control over habitual cocaine seeking65,83 is a spiralling pattern of connectivity between the midbrain and the striatum in primates84 and in rats85 (FIG. 1c). DA neurons of the medial VTA project to the medial NAc shell, from where GABA D1R-expressing MSNs project back to GABA interneurons in the midbrain, which can inhibit more laterally located DA neurons. These DA neurons, in turn, project to the core and eventually to the lateral shell, of the NAc. After several loops, this disinhibitory motif reaches the substantia nigra pars compacta (SNc), which sends its dopaminergic axons to the dorsal striatum. The mPFC exerts potent top-down control on the early loops of the spiral, whereas the OFC controls loops that originate in the SNc, suggesting the sequential involvement of the mPFC and OFC during the emergence of compulsion84,85.

This model of functional interaction between the ventral striatum and the dorsal striatum is supported by the finding that disconnecting the NAc core and the aDLS (by disabling each structure unilaterally but in opposite hemispheres) disrupts well-established cocaine seeking as effectively as does bilateral blockade of DA receptors in the aDLS86. Moreover, DA release in the aDLS, as measured by in vivo voltammetry during conditioned-stimulus-elicited cocaine seeking, was shown to depend on the integrity of the ipsilateral NAc core87. Electrophysiological stimulation of the medial ventral striatum was also demonstrated to alter the activity of VTA and SNc neurons projecting to the DLS88,89. However, the specific synaptic connectivity underlying the operation of this circuitry has yet to be fully investigated.

Thus, although aDLS circuitry becomes dominant in controlling well-established drug seeking, its engagement depends on connectivity with the ventral striatum. This provides a mechanism by which incentive processes, including Pavlovian association and conditioned reinforcement, dependent on activity in the NAc, can influence instrumental behaviours that are controlled by the dorsal striatum90,91. Recruitment of DA-dependent aDLS control over cocaine seeking depends on the BLA, whereas maintenance of the cocaine-seeking habit depends on the CeA and its DA-dependent functional interaction with the aDLS92, paralleling similar CeA–aDLS circuitry involved in habitual responding for food93. The amygdala and aDLS are not directly connected, however, so the circuitry mediating this functional interaction must be polysynaptic. Indeed, this has been demonstrated in vivo: stimulation of the BLA bidirectionally modulated aDLS MSN activity via glutamatergic mechanisms in the NAc core92. The pathways functionally linking the CeA to the DLS may involve direct projections from the CeA to the SNc that are also required for Pavlovian conditioned orienting towards food-conditioned stimuli94.

An imbalance between goal-directed and habitual drug seeking, with eventual dominance of the habit system, is much less evident in the control over drug taking, where, as we have seen, the NAc and DMS may retain a dominant role. This difference may reflect the relative predictability of the drug outcomes for taking versus seeking: whereas the relationship between drug-taking actions and outcomes is highly predictive of the drug effect, drug seeking less reliably predicts what are often delayed drug experiences95.

One cause of the progressive imbalance of systems controlling drug seeking may be that addictive drugs enhance the consolidation of habit learning in the DLS and its associated circuitry42,81. This is supported by findings that amphetamine, alcohol and nicotine can each accelerate the development of habitual control of seeking of natural rewards in animals70,96,97 and humans98. Moreover, habitual seeking responses for cocaine, alcohol and nicotine become resistant to reinforcer devaluation (implicating the habitual system) more rapidly than does responding for a food reward70,99101.

At a neural circuit level, this shift from goal-directed to habitual responding has been suggested to reflect an imbalance between the DMS and the DLS102. Consistent with this, inhibition of OFC projections to the DMS results in rats being entirely reliant on habit circuitry when goal-directed circuits would usually be dominant, and selective attenuation of glutamatergic transmission at OFC terminals in the DMS prevents the use of goal-directed behaviour11,68. Cannabinoid receptors (CB1 receptors) play a key gatekeeper role on OFC terminals in the DMS, as their deletion completely prevents the transition from goal-directed to habitual control over instrumental seeking behaviour68.

Drug-seeking habits might be a prerequisite sub-strate for compulsive behaviour to manifest, rather than — or in addition to — any bottom-up modulation of goal-directed behaviour by the motivational impact of processes such as sensitization or withdrawal, which may also modulate habit-based control. However, another likely mechanism for habit dysregulation is a loss of top-down executive control.

Loss of goal and control in compulsive drug seeking.

Impairments to top-down control over drug-seeking habits and impairments to the goal-directed system are emergent dysfunctional factors that, we hypothesize, jointly result in compulsive drug seeking65,68. Evidence of frontal lobe involvement in such processes is available from studies in humans41 and in experimental animal models103. For example, chronic stimulant selfadministration in rhesus monkeys causes substantial reductions in cortical glucose metabolism104, and amphetamine self-administration reduces spine density in the OFC of rats105. In humans addicted to cocaine, there is grey matter loss in widespread regions of the anterior cortex — including the OFC, inferior frontal cortex, cingulate, temporal gyrus and insula — as a function of stimulant use41, as well as hypometabolism in the frontal and cingulate cortices43.

Although the cellular correlates remain to be determined, such anatomically diverse frontal deficits can produce various distinct impairments in executive control. These include loss of goal representations following ventromedial PFC impairments (which may competitively advantage habits) and impairment of inhibitory control over both drug-seeking and drug-taking behaviour as a consequence of right inferior frontal cortex dysfunctional involvement, regardless of whether such behaviour is predominantly goal-directed or habitual45,106.

These impairments in executive control probably contribute to major decision-making deficits107,108, which may in turn exacerbate the drive to compulsive drug seeking and addiction. The loss of inhibitory control over habits may be especially important in this regard. Recent evidence109 from motor control theory has shown that, for simple actions, competing habitual and goal-directed responses occur in parallel, but that habits are automatically prepared at short latency. In the normal situation, a prepared habit can be held in check, to allow the slower, more reflective, goal-directed process to override it and occur instead. However, if this top-down inhibitory control is lost, the habitual tendency would ‘win’ the control over response output.

One challenge for investigating the neural circuit basis of compulsive drug seeking in animal models is that only a relatively small proportion of individuals develop this behaviour and, even then, only after an extended history of drug taking110. This strongly suggests that compulsivity is also a consequence of the effects of drugs administered over the long term on neural circuits that only manifests itself in vulnerable individuals47,51,111113. Consistent with this view are findings from a study114 in which the function of frontal structures implicated in the top-down control over drug seeking was experimentally impaired before initial drug self-administration. Lesions of the OFC or the anterior cingulate, prelimbic, infralimbic or anterior insular cortex did not result in either more rapid development of compulsive drug seeking or larger numbers of individuals displaying compulsion compared with non-lesioned controls. This strongly suggests that any impairments in cortical top-down control are emergent, as is also evident in human imaging studies41. Functional impairment of the lateral habenula was also found to have no effect on compulsive drug seeking115, despite its purported role in processing negative feedback116.

The most direct experimental evidence demonstrating that impaired cortical function can be a result of long-term drug exposure and causal in the development of compulsive drug seeking comes from studies in rats deploying a cocaine seeking–taking chained procedure with probabilistic punishment of seeking responses117. In confirming the original observation118 that compulsivity develops only in a subgroup of rats, it was shown that prelimbic neurons become markedly hypoactive after a history of long-term cocaine taking. Moreover, optogenetic stimulation of the quiescent prelimbic cortex in compulsive, punishment-resistant rats reduced their compulsive cocaine seeking, whereas optogenetic inhibition of the prelimbic cortex in non-compulsive rats increased seeking responses in the face of punishment117. However, the prelimbic cortex is implicated in pain perception119, and changes in sensitivity to footshock following optogenetic manipulation of the prelimbic cortex were not assessed or controlled for in this study; thus, it is possible that such changes could have contributed to punishment-resistant responding.

These data strongly suggest that hypoactivity in the prelimbic cortex at a time when cocaine seeking is insensitive to reinforcer devaluation (and is therefore habitual and aDLS dependent80) is causally involved in compulsive drug seeking. Moreover, in rats, seeking under punishment in the seeking–taking task was shown to depend specifically on a discrete zone of the aDLS120. Prelimbic neurons project to several subcortical sites, including the NAc core and the DMS, and so it is plausible that decreasing compulsive cocaine seeking by optogenetic stimulation of the prelimbic cortex is associated with re-establishment of goal-directed control over seeking behaviour, whereas optogenetic inhibition of this structure impairs the goal-directed system, leaving the habit system to dominate seeking behaviour. Supporting this view, long-term intermittent ethanol exposure and withdrawal was shown to disrupt top-down control over goal-directed action-selection processes, producing habits121, and to lead to selectively reduced OFC output to the direct output pathway in the DMS. By contrast, increasing the activity of OFC circuits restored goal-directed behaviour in these animals.

An alternative hypothesis to the model in which changes in prelimbic deficits promote compulsive drug seeking is that, by analogy with the models of compulsive drug taking outlined above, a ‘gain of function’ in the OFC leads to a revaluation of the reinforcing value of the drug versus the shock, in favour of the drug, leading to compulsive drug seeking. Preliminary supporting evidence comes not from drug seeking but from a modification of the oDASS mouse model25 in which seeking responses in a seeking–taking chain were punished122. OFC–dorsal striatum synapses were potentiated in compulsive mice (that is, mice whose seeking responses were resistant to punishment) compared with OFC–dorsal striatum synapses in non-compulsive or naive mice. By contrast, mPFC–medial dorsal striatum synapses in the compulsive mice remained unchanged. These observations suggest a pathological gain of function in parts of the PFC — including the OFC — contrasting with the results involving long-term drug self-administration summarized above. This gain-of-function model may represent an early phase of drug use as it does not capture the end state of stimulant addiction, which is characterized by hypometabolism of the OFC43, loss of grey matter throughout the PFC41 and presumed loss of PFC function, as exemplified by profound deficits in cognitive and executive functioning107,108,123.

The hypothesis that compulsive drug seeking is associated with an impaired ability to relinquish, rather than to engage, aDLS control over the seeking of alcohol was tested in a seeking–taking procedure with probabilistic punishment82 (FIG. 4a,b). In most rats, seeking but not taking behaviours became sensitive to (that is, were reduced by) DA receptor blockade in the aDLS over time. However, only a small proportion of rats — those in which the aDLS controlled seeking responses — were compulsive, persisting in alcohol seeking in the face of punishment. The non-compulsive individuals, which could inhibit alcohol seeking under punishment, were also able to disengage aDLS control over responding. By contrast, compulsive rats were unable to disengage the aDLS strongly. These results suggest that engaging the aDLS habit system is required for compulsive alcohol seeking to develop, and that an inability to disengage the aDLS might define the maladaptive nature of drug-seeking habits in addiction81,82,124. A key hypothesis to be tested is that disengagement of the aDLS in individuals resistant to developing compulsive alcohol seeking is accompanied by re-engagement of the DMS-dependent goal-directed system72,82.

Fig. 4. Two examples of the emergence of compulsion in rodents.

Fig. 4

a | Failure to disengage the dorsolateral striatum (DLS) reflecting compulsive alcohol consumption. Rats are trained to seek alcohol under a seeking–taking chained schedule of reinforcement (similar to that in FIG. 2b but in which, during training, pressing the taking lever leads to presentation of ethanol from a drinking port). Alcohol reinforcement depends on dopamine (DA) in the nucleus accumbens (NAc), and the acquisition of seeking responses depends on DA in the dorsomedial striatum (DMS) (both shaded blue in the striatal schematic on the left). The maintenance of well-established alcohol seeking loses its dependence on the DMS and becomes dependent on dopaminergic transmission in the anterior DLS (aDLS) (shaded dark blue in the striatal schematic on the right). b | A vulnerable subgroup of rats persists in seeking alcohol even when the seeking lever is probabilistically punished; that is, they are compulsive. Rats that seek alcohol compulsively are unable to disengage the aDLS; in these rats, seeking remains sensitive to DA receptor blockade in the aDLS. By contrast, rats that refrain from seeking lose their sensitivity to aDLS DA receptor blockade; that is, their seeking responses are no longer decreased by this manipulation as was the case earlier in training, when drug seeking was goal-directed. The development of compulsive alcohol seeking is therefore predicted both by engagement of the aDLS and by an inability to disengage it when compulsive, suggesting the maladaptive nature of compulsive alcohol-seeking habits. c | Optogenetic DA neuron self-stimulation (oDASS) results in the emergence of a bimodal distribution of compulsive seeking behaviour. Light stimulation leads to strong activation of the mesolimbic pathway and saturating DA transients in the NAc. Here, a seeking–taking chained schedule is illustrated during which medial prefrontal cortex (mPFC) control of the NAc undergoes synaptic plasticity through DA modulation of glutamatergic afferents. d | During the test phase, pressing the seeking lever is punished, which results in some animals renouncing pressing the taking lever, whereas others persist in pressing it. Strengthened orbitofrontal cortex (OFC)-to-DMS projections are the neural correlate of compulsive oDASS. DAT, dopamine transporter. Schematic in part b inspired by REF.82.

Addressing possible critiques

Although the notion that habitual processes have a role in compulsive drug seeking is consonant with neurobehavioural evidence in animals and many anecdotal accounts of drug addiction ‘habits’, there are important critiques of this position — some arising from misunderstandings of the original proposal and other, alternative, perspectives — that have to be taken into account.

The most basic misunderstanding is that drug seeking is incontrovertibly compulsive at the individual level in all situations and contexts. Although both constitutional factors and exposure to drugs may bias behaviour towards habits rather than top-down goal-directed control, such control is not always absent, and behaviour can be ameliorated by environmental contingencies. Thus, providing alternative goals or reinforcers and engaging choice processes is likely to reduce compulsive seeking, although we suggest it would hardly eradicate it. Some evidence in rats125 indicates that drug choices can be combated by making palatable food available as an alternative reward in a discrete trial, concurrent choice procedure. However, although it is certainly true that most rats may prefer the food reinforcer, several studies have shown that a small proportion of animals nevertheless prefer, for example, cocaine126 or alcohol51 — and we argue that this corresponds to the (probably small) proportion of animals destined to exhibit compulsive-seeking tendencies characteristic of addiction. In general, this is congruent with our own evidence that a relatively small proportion of cocaine abusers become addicted41. A similar point may be made with respect to so-called contingency management of addicted individuals127,128, which has shown some success. However, these points do not represent evidence against habitual compulsive drug seeking; habitual behaviour would be predicted to be combated by arranging a broader choice of goals.

Alternative accounts of compulsive drug seeking consider a radically distinct notion; namely, that it results from enhanced goal-directed tendencies associated specifically with drugs. This position is perhaps most compatible with the ‘gain-of-function’ oDASS model also reviewed here (FIG. 4c,d). This hypothesis was recently arrived at by Hogarth et al.129, mainly on the basis of the suggestion that individuals who use multiple drugs, including individuals who are dependent on both alcohol and nicotine, show control-like goal-directed behaviour, including that for drugs. It is difficult to test this hypothesis adequately in individuals diagnosed with substance use disorders, as shown by the recent study129 in which a heterogeneous group of individuals each abusing multiple drugs exhibited reduced motivation for water and food reinforcers, and yet showed normal devaluation when satiated. One possible, parsimonious interpretation of this result is that goal-directed motivation is impaired in these individuals, perhaps as a consequence of a narrowed choice of goals, consistent with behavioural and neural findings of apparent deficits in goal-directed behaviour and enhanced appetitive habitual behaviour in addiction37. Nevertheless, the theoretical possibility that goal-directed behaviour is heightened specifically for drug reinforcers remains to be tested experimentally. Enhanced goal-directed behaviours towards drugs could also represent a transition between drug abuse and addiction, before compulsive drug seeking assumes its habitual qualities.

Another expression of the goal-directed nature of compulsive drug seeking has been suggested to be its apparent flexibility. Achieving some variability in the expression of instrumental behaviour could be taken as evidence against the autonomous (often mistaken for automatic) nature of habits. However, mere variability in motoric expression does not necessarily provide evidence against perseverative tendencies; impaired reversal learning, for example, may involve overdue focus on cues formerly related to reward, regardless of their location. Furthermore, it is clear that ‘motor habits’ can be complex sequences of behaviour, as ‘slips of action’, such as the well-known dramatic example of arriving in one’s office at work rather than another, intended destination, owing to one’s habitual tendency. Moreover, new developments in learning theory postulate so-called successor representations that are intermediate between representations of action–outcome and stimulus– response associations, and that may reflect ‘cognitive habits’130 relevant to addiction. ‘Cognitive habits’ would not necessarily require invariant motor outputs and may even be internalized (for example, as obsessive thoughts about drugs).

One study reported an apparent invalidation of the notion that habits are central to compulsive drug seeking using a combination of the seeking–taking paradigm and the three-criteria addiction model in rats. However, this study tested the somewhat different proposition that habits are necessary for ‘the development of addiction-like behaviour in rats’131. In an elegant puzzle-solving seeking–taking task, rats learned to perform three distinct seeking–taking chains (involving different, heterogeneous sequences of lever pressing, nose poking and wheel turning) to self-administer cocaine. Under these conditions, seeking remained goal-directed (as previously shown132134) and was not reduced by DLS DA receptor blockade. Indeed, such DA receptor blockade, rather than reducing the motivation to seek cocaine, actually increased seeking, thereby indicating that the aDLS had in fact been engaged even in this situation, perhaps by removing competition between the habit and goal-directed systems. However, as this study did not go on to test the effects of probabilistic punishment of one or more of the seeking responses in the three chains, it remains unclear whether cocaine seeking can become compulsive when it is goal-directed82. Instead, the rats were trained to perform three instrumental chains over a long period, and then the resistance to punishment of a taking response was assessed in the three-criteria addiction model47. Although some individuals showed resistance to punishment, the nose-poke taking response tested was completely different to the previously trained instrumental chains. The authors of the study concluded that habits are not required for addiction-like behaviour, although only as assessed in the three-criteria model, which is most relevant to drug taking — consistent with the literature reviewed above suggesting that cocaine-taking responses may persist as goal-directed. However, they failed to test the hypothesis concerning the habitual qualities of compulsive drug seeking.

Contingency management.

A behavioural modification intervention that reinforces desired behaviour (such as abstinence) with incentives.

Reversal learning.

Learning of a reversal of the reward contingencies of two options, reflecting behavioural adaptation to environmental change.

Concluding summary

We have identified neurobehavioural mechanisms that contribute to the transition to compulsive behaviour in addiction to several classes of drugs of abuse, including stimulants such as cocaine and alcohol. We distinguish between compulsive drug-seeking and compulsive drug-taking behaviour, which are frequently, perhaps mistakenly, conflated in the published literature. Both may contribute to the key symptoms of addiction specified in the DSM-5, although drug-seeking behaviour is obviously a defining element. We have further made a distinction between controlled and compulsive drug seeking: the former being in the early stages of drug abuse and goal-directed, and the latter, during the course of addiction and involving progressive habitual control. Compulsive drug seeking hypothetically depends both on increasing involvement of the dorsal striatum and also on the progressive loss of top-down, executive control resulting from a loss of PFC and cingulate cortex function. This transition consists not only of a loss of goal representations, which leads to preferential habitual control, but also of a loss of inhibitory control over habits, complementing a possible strengthening of their bottom-up striatal mediation.

We have taken the opportunity in this Review to incorporate the latest data obtained using the novel approach of direct optogenetic stimulation of VTA DA neurons (oDASS), which, through two different procedures, can result in compulsive seeking and taking behaviours. Such compulsive responding is associated with a gain of function in OFC neurons that drive activity in the dorsal striatum. At first sight, this apparently contradicts the main hypothesis presented here of distinct mechanisms underlying drug seeking and drug use (see also BOX 4 for open questions). For human addiction9 and animal models of drug addiction, it is currently unclear how this gain of function relates to the reduction of striatal DA activity34,135,136 and OFC activity41,103,105,137 repeatedly demonstrated in earlier studies of animals and humans — and therefore to the loss of top-down control of striatal function exhibited that results from long-term drug taking. However, there are two possible reasons for the apparent discrepancy. First, it may be that the oDASS procedure reflects an early stage in the transition from compulsive use to drug addiction, after which OFC–striatal gain of function precedes OFC–striatal loss of function. Future studies with addictive drugs will be able to test directly whether a phase of gain of function also occurs in the transition to compulsive drug seeking. Second, drugs (such as amphetamine) might exert less-specific neurochemical actions and even neurotoxic effects at diverse neural loci compared with the highly cell-specific optogenetic stimulation.

Box 4. Outstanding questions.

Although the application of learning theory to our understanding of drug addiction, along with the latest generation of circuit investigations, has advanced our understanding of the emergence of compulsion in addiction, many questions remain unanswered. Here we list some that we feel are within reach of being experimentally addressed soon.

  • Is activation of the mesolimbic dopamine system a necessary and sufficient mechanism of reinforcement for all addictive drugs, and what is the role of the dopamine system in the addicted state?

  • Can optogenetic self- stimulation of mesolimbic dopamine neurons serve as a tool to reveal the mechanisms of addiction shared by all drugs of abuse?

  • How can the subjective experience of drug consumption in humans and its underlying neural circuitry best be studied using operational measures in animals while avoiding anthropomorphic inferences?

  • Is there a transition from goal- directed to habitual drug taking in addiction, or does drug- taking behaviour persist as goal- directed? Is there a transition from a gain of function of orbitofrontal cortex to dorsomedial striatum projections to a loss of function between the pre- addicted state and end- stage addiction?

  • Does the transition to compulsive drug seeking involve a loss of ‘top- down’ control from prefrontal circuits to striatal circuits? Alternatively, is compulsive behaviour a manifestation of a goal- directed choice of a hypervalued drug over negative consequences?

  • What is the utility of concurrent choice procedures in models and the treatment of addiction and in understanding the hypothetical narrowing of goals that occurs in the addiction process?

The latter important point is crucial for evaluating how optogenetic methods may be best utilized in addiction research. They may be especially important for testing predictions based on neuropsychological models (such as in REF.65) about likely neural circuitry that can be mechanistically characterized using this approach48. The use of optogenetics and similar circuit interrogation methods may also pave the way to possible new treatments. For example, as drug seeking occurs by definition before obtaining a drug and in order to obtain a drug, it is a key element of the process of relapse after abstinence, as well as of the addicted state. Treatments designed to reduce the drug-seeking phase, rather than drug use or taking, therefore have great potential. A circuit model is emerging that explains addiction as altered strength of synaptic connections, opening the possibility for refining circuit therapies such as deep brain stimulation138 or transcranial magnetic stimulation139 currently used for other indications to be used to reduce drug-seeking behaviour and enhance clinical success.

Acknowledgements

The authors acknowledge the Swiss National Science Foundation and the European Research Council (C.L.) as well as the UK Medical Research Council (grant MR/N02530X/1) and the Wellcome Trust (Investigator Award WT 104631/Z/14/Z/) for financial support (B.J.E., T.W.R.). The authors thank D. Belin and M. Loureiro for help with the figures.

Footnotes

Author contributions

All authors researched data for the article, contributed substantially to discussion of the article’s content, wrote the article, and reviewed and edited the manuscript before submission.

Competing interests

C.L. has no competing interests. T.W.R. consults for Cambridge Cognition, Unilever, Cassava and Greenfield BioVentures. He holds research grants from Shionogi & Co., Ltd and GlaxoSmithKline plc., and receives royalties from Cambridge Cognition for CANTAB. He also receives editorial honoraria from Springer and Elsevier. B.J.E. has no competing interests.

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

  • 1.American Psychiatric Association. Diagnostic and Statistical Manual of Mental Disorders: DSM-5. American Psychiatric Association; 2013. [Google Scholar]
  • 2.Sanchis-Segura C, Spanagel R. Behavioural assessment of drug reinforcement and addictive features in rodents: an overview. Addict Biol. 2006;11:2–38. doi: 10.1111/j.1369-1600.2006.00012.x. [DOI] [PubMed] [Google Scholar]
  • 3.Di Chiara G, Imperato A. Drugs abused by humans preferentially increase synaptic dopamine concentrations in the mesolimbic system of freely moving rats. Proc Natl Acad Sci USA. 1988;85:5274–5278. doi: 10.1073/pnas.85.14.5274. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Lammel S, et al. Input-specific control of reward and aversion in the ventral tegmental area. Nature. 2012;491:212–217. doi: 10.1038/nature11527. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Yang H, et al. Nucleus accumbens subnuclei regulate motivated behavior via direct inhibition and disinhibition of VTA dopamine subpopulations. Neuron. 2018;97:434–449 e434. doi: 10.1016/j.neuron.2017.12.022. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Kelley A, Berridge K. The neuroscience of natural rewards: relevance to addictive drugs. J Neurosci. 2002;22:3306–3311. doi: 10.1523/JNEUROSCI.22-09-03306.2002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Stuber GD, Wise RA. Lateral hypothalamic circuits for feeding and reward. Nat Neurosci. 2016;19:198–205. doi: 10.1038/nn.4220. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Balleine B. Neural bases of food-seeking: affect, arousal and reward in corticostriatolimbic circuits. Phys Behav. 2005;86:717–730. doi: 10.1016/j.physbeh.2005.08.061. [DOI] [PubMed] [Google Scholar]
  • 9.Naqvi N, Rudrauf D, Damasio H, Bechara A. Damage to the insula disrupts addiction to cigarette smoking. Science. 2007;315:531–534. doi: 10.1126/science.1135926. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Lüscher C, Ungless M. The mechanistic classification of addictive drugs. PLoS Med. 2006;3:e437. doi: 10.1371/journal.pmed.0030437. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Abrahao KP, Salinas AG, Lovinger DM. Alcohol and the brain: neuronal molecular targets, synapses, and circuits. Neuron. 2017;96:1223–1238. doi: 10.1016/j.neuron.2017.10.032. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12.Johnson S, North R. Opioids excite dopamine neurons by hyperpolarization of local interneurons. J Neurosci. 1992;12:483–488. doi: 10.1523/JNEUROSCI.12-02-00483.1992. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Patriarchi T, et al. Ultrafast neuronal imaging of dopamine dynamics with designed genetically encoded sensors. Science. 2018;360:eaat4422. doi: 10.1126/science.aat4422. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14.Corre J, et al. Dopamine neurons projecting to medial shell of the nucleus accumbens drive heroin reinforcement. eLife. 2018 doi: 10.7554/eLife.39945. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Ettenberg A, Pettit HO, Bloom FE, Koob GF. Heroin and cocaine intravenous self-administration in rats: mediation by separate neural systems. Psychopharmacology. 1982;78:204–209. doi: 10.1007/BF00428151. [DOI] [PubMed] [Google Scholar]
  • 16.Pettit H, Ettenberg A, Bloom F, Koob G. Destruction of dopamine in the nucleus accumbens selectively attenuates cocaine but not heroin self-administration in rats. Psychopharmacology. 1984;84:167–173. doi: 10.1007/BF00427441. [DOI] [PubMed] [Google Scholar]
  • 17.Ting AKR, van der Kooy D. The neurobiology of opiate motivation. Cold Spring Harb Perspect Med. 2012 doi: 10.1101/cshperspect.a012096. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Daglish MR, et al. Brain dopamine response in human opioid addiction. Br J Psychiatry. 2008;193:65–72. doi: 10.1192/bjp.bp.107.041228. [DOI] [PubMed] [Google Scholar]
  • 19.Nutt DJ, Lingford-Hughes A, Erritzoe D, Stokes PR. The dopamine theory of addiction: 40 years of highs and lows. Nat Rev Neurosci. 2015;16:305–312. doi: 10.1038/nrn3939. [DOI] [PubMed] [Google Scholar]
  • 20.Saal D, Dong Y, Bonci A, Malenka RC. Drugs of abuse and stress trigger a common synaptic adaptation in dopamine neurons. Neuron. 2003;37:577–582. doi: 10.1016/s0896-6273(03)00021-7. [DOI] [PubMed] [Google Scholar]
  • 21.Ungless MA, Whistler JL, Malenka RC, Bonci A. Single cocaine exposure in vivo induces long-term potentiation in dopamine neurons. Nature. 2001;411:583–587. doi: 10.1038/35079077. Original demonstration of drug-evoked synaptic plasticity in VTA dopamine neurons after a single injection of cocaine. [DOI] [PubMed] [Google Scholar]
  • 22.Brown MT, et al. Drug-driven AMPA receptor redistribution mimicked by selective dopamine neuron stimulation. PLoS One. 2010;5:e15870. doi: 10.1371/journal.pone.0015870. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Mameli M, et al. Cocaine-evoked synaptic plasticity: persistence in the VTA triggers adaptations in the NAc. Nat Neurosci. 2009;12:1036–1041. doi: 10.1038/nn.2367. [DOI] [PubMed] [Google Scholar]
  • 24.Pascoli V, et al. Contrasting forms of cocaine-evoked plasticity control components of relapse. Nature. 2014;509:459–464. doi: 10.1038/nature13257. [DOI] [PubMed] [Google Scholar]
  • 25.Pascoli V, Terrier J, Hiver A, Lüscher C. Sufficiency of mesolimbic dopamine neuron stimulation for the progression to addiction. Neuron. 2015;88:1054–1066. doi: 10.1016/j.neuron.2015.10.017. [DOI] [PubMed] [Google Scholar]
  • 26.Terrier J, Lüscher C, Pascoli V. Cell-type specific insertion of GluA2-lacking AMPARs with cocaine exposure leading to sensitization, cue-induced seeking, and incubation of craving. Neuropsychopharmacology. 2016;41:1779–1789. doi: 10.1038/npp.2015.345. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27.Koob G, Le Moal M. Drug addiction, dysregulation of reward, and allostasis. Neuropsychopharmacology. 2001;24:97–129. doi: 10.1016/S0893-133X(00)00195-0. [DOI] [PubMed] [Google Scholar]
  • 28.Koob GF, Volkow ND. Neurobiology of addiction: a neurocircuitry analysis. Lancet Psychiatry. 2016;3:760–773. doi: 10.1016/S2215-0366(16)00104-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Koob G. Neurobiological substrates for the dark side of compulsivity in addiction. Neuropharmacology. 2009;56(1):18–31. doi: 10.1016/j.neuropharm.2008.07.043. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Zhu Y, Wienecke CF, Nachtrab G, Chen X. A thalamic input to the nucleus accumbens mediates opiate dependence. Nature. 2016;530:219–222. doi: 10.1038/nature16954. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Boulos LJ, et al. Mu opioid receptors in the medial habenula contribute to naloxone aversion. Neuropsychopharmacology. 2020;45:247–255. doi: 10.1038/s41386-019-0395-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Boileau I, et al. Modeling sensitization to stimulants in humans: an [11C]raclopride/positron emission tomography study in healthy men. Arch Gen Psychiatry. 2006;63:1386–1395. doi: 10.1001/archpsyc.63.12.1386. [DOI] [PubMed] [Google Scholar]
  • 33.Wong D, et al. Increased occupancy of dopamine receptors in human striatum during cue-elicited cocaine craving. Neuropsychopharmacology. 2006;31:2716–2727. doi: 10.1038/sj.npp.1301194. [DOI] [PubMed] [Google Scholar]
  • 34.Trifilieff P, Ducrocq F, van der Veldt S, Martinez D. Blunted dopamine transmission in addiction: potential mechanisms and implications for behavior. Semin Nucl Med. 2017;47:64–74. doi: 10.1053/j.semnuclmed.2016.09.003. [DOI] [PubMed] [Google Scholar]
  • 35.Volkow ND, et al. Stimulant-induced dopamine increases are markedly blunted in active cocaine abusers. Mol Psychiatry. 2014;19:1037–1043. doi: 10.1038/mp.2014.58. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Balleine BW, O’Doherty JP. Human and rodent homologies in action control: corticostriatal determinants of goal-directed and habitual action. Neuropsychopharmacology. 2010;35:48–69. doi: 10.1038/npp.2009.131. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Ersche KD, et al. Carrots and sticks fail to change behavior in cocaine addiction. Science. 2016;352:1468–1471. doi: 10.1126/science.aaf3700. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Voon V, et al. Disorders of compulsivity: a common bias towards learning habits. Mol Psychiatry. 2014;20:345–352. doi: 10.1038/mp.2014.44. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Luijten M, et al. Goal-directed and habitual control in smokers. Nicotine Tob Res. 2019;22:188–195. doi: 10.1093/ntr/ntz001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Sjoerds Z, et al. Behavioral and neuroimaging evidence for overreliance on habit learning in alcohol-dependent patients. Transl Psychiatry. 2013;3:e337. doi: 10.1038/tp.2013.107. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Ersche KD, Williams GB, Robbins TW, Bullmore ET. Meta-analysis of structural brain abnormalities associated with stimulant drug dependence and neuroimaging of addiction vulnerability and resilience. Curr Opin Neurobiol. 2013;23:615–624. doi: 10.1016/j.conb.2013.02.017. [DOI] [PubMed] [Google Scholar]
  • 42.Everitt BJ, Robbins TW. Drug addiction: updating actions to habits to compulsions ten years on. Annu Rev Psychol. 2016;67:23–50. doi: 10.1146/annurev-psych-122414-033457. [DOI] [PubMed] [Google Scholar]
  • 43.Goldstein R, Volkow N. Drug addiction and its underlying neurobiological basis: neuroimaging evidence for the involvement of the frontal cortex. Am J Psychiatry. 2002;159:1642–1652. doi: 10.1176/appi.ajp.159.10.1642. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Ersche K, et al. Abnormal structure of frontostriatal brain systems is associated with aspects of impulsivity and compulsivity in cocaine dependence. Brain. 2011;134:2013–2024. doi: 10.1093/brain/awr138. Demonstration that neural mechanisms of impaired top-down inhibitory control in human stimulant addiction are pre-existing rather than caused by drug exposure. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Ersche K, et al. Abnormal brain structure implicated in stimulant drug addiction. Science. 2012;335:601–604. doi: 10.1126/science.1214463. [DOI] [PubMed] [Google Scholar]
  • 46.Anthony JC, Warner LA, Kessler RC. Comparative epidemiology of dependence on tobacco, alcohol, controlled substances, and inhalants: basic findings from the National Comorbidity Survey. Exp Clin Psychopharmacol. 1994;2:244–268. [Google Scholar]
  • 47.Deroche-Gamonet V, Belin D, Piazza PV. Evidence for addiction-like behavior in the rat. Science. 2004;305:1014–1017. doi: 10.1126/science.1099020. First demonstration of individual differences using an addiction model in rats. [DOI] [PubMed] [Google Scholar]
  • 48.Pascoli V, et al. Stochastic synaptic plasticity underlying compulsion in a model of addiction. Nature. 2018;564:366–371. doi: 10.1038/s41586-018-0789-4. Demonstration of apparent ‘gain of function’ in orbitofrontal–striatal circuit mediating compulsive reward-related behaviour caused by selective optogenetic stimulation of VTA dopamine neurons. [DOI] [PubMed] [Google Scholar]
  • 49.Kasanetz F, et al. Transition to addiction is associated with a persistent impairment in synaptic plasticity. Science. 2010;328:1709–1712. doi: 10.1126/science.1187801. [DOI] [PubMed] [Google Scholar]
  • 50.Hu YZ, et al. Compulsive drug use is associated with imbalance of orbitofrontal- and prelimbic-striatal circuits in punishment-resistant individuals. Proc Natl Acad Sci USA. 2019;116:9066–9071. doi: 10.1073/pnas.1819978116. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Augier E, et al. A molecular mechanism for choosing alcohol over an alternative reward. Science. 2018;360:1321–1326. doi: 10.1126/science.aao1157. Individual differences in choice preference for adulterated alcohol over saccharin predict subsequent compulsive drinking in rats and correlate with downregulation of GABAergic function in the central amygdala, relevant to parallel human studies of alcohol dependence. [DOI] [PubMed] [Google Scholar]
  • 52.Siciliano CA, et al. A cortical-brainstem circuit predicts and governs compulsive alcohol drinking. Science. 2019;366:1008–1012. doi: 10.1126/science.aay1186. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Lee H, Gallagher M, Holland P. The central amygdala projection to the substantia nigra reflects prediction error information in appetitive conditioning. Learn Mem. 2010;17:531–538. doi: 10.1101/lm.1889510. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Wall NR, et al. Complementary genetic targeting and monosynaptic input mapping reveal recruitment and refinement of distributed corticostriatal ensembles by cocaine. Neuron. 2019;104:916–930 e915. doi: 10.1016/j.neuron.2019.10.032. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 55.Hirokawa J, Vaughan A, Masset P, Ott T, Kepecs A. Frontal cortex neuron types categorically encode single decision variables. Nature. 2019;576:446–451. doi: 10.1038/s41586-019-1816-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 56.Whitelaw RB, Markou A, Robbins TW, Everitt BJ. Excitotoxic lesions of the basolateral amygdala impair the acquisition of cocaine-seeking behaviour under a second-order schedule of reinforcement. Psychopharmacology. 1996;127:213–224. [PubMed] [Google Scholar]
  • 57.Ito R, Robbins T, Everitt B. Differential control over cocaine-seeking behavior by nucleus accumbens core and shell. Nat Neurosci. 2004;7:389–397. doi: 10.1038/nn1217. [DOI] [PubMed] [Google Scholar]
  • 58.Di Ciano P, Everitt BJ. Direct interactions between the basolateral amygdala and nucleus accumbens core underlie cocaine-seeking behavior by rats. J Neurosci. 2004;24:7167–7173. doi: 10.1523/JNEUROSCI.1581-04.2004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Cador M, Robbins TW, Everitt BJ. Involvement of the amygdala in stimulus-reward associations: interaction with the ventral striatum. Neuroscience. 1989;30:77–86. doi: 10.1016/0306-4522(89)90354-0. [DOI] [PubMed] [Google Scholar]
  • 60.Mcfarland K, Kalivas P. The circuitry mediating cocaine-induced reinstatement of drug-seeking behavior. J Neurosci. 2001;21:8655–8663. doi: 10.1523/JNEUROSCI.21-21-08655.2001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 61.Pickens C, et al. Neurobiology of the incubation of drug craving. Trends Neurosci. 2011;34:411–420. doi: 10.1016/j.tins.2011.06.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 62.Pascoli V, Turiault M, Lüscher C. Reversal of cocaine-evoked synaptic potentiation resets drug-induced adaptive behaviour. Nature. 2011;481:71–75. doi: 10.1038/nature10709. First demonstration that the reversal of drug-evoked synaptic potentiation abolishes adaptive behaviour. [DOI] [PubMed] [Google Scholar]
  • 63.Shen CJ, et al. Cannabinoid CB1 receptors in the amygdalar cholecystokinin glutamatergic afferents to nucleus accumbens modulate depressive-like behavior. Nat Med. 2019;25:337–349. doi: 10.1038/s41591-018-0299-9. [DOI] [PubMed] [Google Scholar]
  • 64.Kim J, Pignatelli M, Xu S, Itohara S, Tonegawa S. Antagonistic negative and positive neurons of the basolateral amygdala. Nat Neurosci. 2016;19:1636–1646. doi: 10.1038/nn.4414. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 65.Everitt BJ, Robbins TW. Neural systems of reinforcement for drug addiction: from actions to habits to compulsion. Nat Neurosci. 2005;8:1481–1489. doi: 10.1038/nn1579. [DOI] [PubMed] [Google Scholar]
  • 66.Balleine B, Liljeholm M, Ostlund S. The integrative function of the basal ganglia in instrumental conditioning. Behav Brain Res. 2009;199:43–52. doi: 10.1016/j.bbr.2008.10.034. [DOI] [PubMed] [Google Scholar]
  • 67.Hilario MR, Costa RM. High on habits. Front Neurosci. 2008;2:208–217. doi: 10.3389/neuro.01.030.2008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Gremel CM, et al. Endocannabinoid modulation of orbitostriatal circuits gates habit formation. Neuron. 2016;90:1312–1324. doi: 10.1016/j.neuron.2016.04.043. Molecular basis of possible top-down prefrontal circuitry regulating habit formation. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 69.Murray JE, Belin D, Everitt BJ. Double dissociation of the dorsomedial and dorsolateral striatal control over the acquisition and performance of cocaine seeking. Neuropsychopharmacology. 2012;37:2456–2466. doi: 10.1038/npp.2012.104. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Corbit L, Nie H, Janak P. Habitual alcohol seeking: time course and the contribution of subregions of the dorsal striatum. Biol Psychiatry. 2012;72:389–395. doi: 10.1016/j.biopsych.2012.02.024. Experimental evidence for the involvement of the dorsal striatum in alcohol-seeking behaviour. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71.Hellard ER, et al. Optogenetic control of alcohol-seeking behavior via the dorsomedial striatal circuit. Neuropharmacology. 2019;155:89–97. doi: 10.1016/j.neuropharm.2019.05.022. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 72.Ostlund S, Balleine B. On habits and addiction: an associative analysis of compulsive drug seeking. Drug Discov Today Dis Model. 2008;5:235–245. doi: 10.1016/j.ddmod.2009.07.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73.Vollstadt-Klein S, et al. Initial, habitual and compulsive alcohol use is characterized by a shift of cue processing from ventral to dorsal striatum. Addiction. 2010;105:1741–1749. doi: 10.1111/j.1360-0443.2010.03022.x. [DOI] [PubMed] [Google Scholar]
  • 74.Zhou A, et al. Cue reactivity in the ventral striatum characterizes heavy cannabis use, whereas reactivity in the dorsal striatum mediates dependent use. Biol Psychiatry Cogn Neurosci Neuroimaging. 2019;4:751–762. doi: 10.1016/j.bpsc.2019.04.006. [DOI] [PubMed] [Google Scholar]
  • 75.Volkow ND, et al. Cocaine cues and dopamine in dorsal striatum: mechanism of craving in cocaine addiction. J Neurosci. 2006;26:6583–6588. doi: 10.1523/JNEUROSCI.1544-06.2006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 76.Cuzon Carlson VC, et al. Synaptic and morphological neuroadaptations in the putamen associated with long-term, relapsing alcohol drinking in primates. Neuropsychopharmacology. 2011;36:2513–2528. doi: 10.1038/npp.2011.140. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 77.Ito R, Dalley J, Robbins T, Everitt B. Dopamine release in the dorsal striatum during cocaine-seeking behavior under the control of a drug-associated cue. J Neurosci. 2002;22:6247–6253. doi: 10.1523/JNEUROSCI.22-14-06247.2002. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 78.Vanderschuren L, Di Ciano P, Everitt B. Involvement of the dorsal striatum in cue-controlled cocaine seeking. J Neurosci. 2005;25:8665–8670. doi: 10.1523/JNEUROSCI.0925-05.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 79.Hodebourg R, et al. Heroin seeking becomes dependent on dorsal striatal dopaminergic mechanisms and can be decreased by N-acetylcysteine. Eur J Neurosci. 2019;50:2036–2044. doi: 10.1111/ejn.13894. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 80.Zapata A, Minney V, Shippenberg T. Shift from goal-directed to habitual cocaine seeking after prolonged experience in rats. J Neurosci. 2010;30:15457–15463. doi: 10.1523/JNEUROSCI.4072-10.2010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 81.Smith RJ, Laiks LS. Behavioral and neural mechanisms underlying habitual and compulsive drug seeking. Prog Neuropsychopharmacol Biol Psychiatry. 2018;87:11–21. doi: 10.1016/j.pnpbp.2017.09.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 82.Giuliano C, Belin D, Everitt BJ. Compulsive alcohol seeking results from a failure to disengage dorsolateral striatal control over behavior. J Neurosci. 2019;39:1744–1754. doi: 10.1523/JNEUROSCI.2615-18.2018. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 83.Everitt BJ, Robbins TW. From the ventral to the dorsal striatum: devolving views of their roles in drug addiction. Neurosci Biobehav Rev. 2013 doi: 10.1016/j.neubiorev.2013.02.010. [DOI] [PubMed] [Google Scholar]
  • 84.Haber S, Fudge J, McFarland N. Striatonigrostriatal pathways in primates form an ascending spiral from the shell to the dorsolateral striatum. J Neurosci. 2000;20:2369–2382. doi: 10.1523/JNEUROSCI.20-06-02369.2000. Groundbreaking neuroanatomical study providing an anatomical substrate for the transition to compulsive drug-seeking behaviour, based on discovery of a spiralling organization of striatal feed-forward circuitry. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 85.Ikemoto S. Dopamine reward circuitry: two projection systems from the ventral midbrain to the nucleus accumbens-olfactory tubercle complex. Brain Res Rev. 2007;56:27–78. doi: 10.1016/j.brainresrev.2007.05.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 86.Belin D, Everitt BJ. Cocaine seeking habits depend upon dopamine-dependent serial connectivity linking the ventral with the dorsal striatum. Neuron. 2008;57:432–441. doi: 10.1016/j.neuron.2007.12.019. A critical test of the relative roles of the ventral striatum and the dorsal striatum in the transition to compulsive drug-seeking behaviour. [DOI] [PubMed] [Google Scholar]
  • 87.Willuhn I, Burgeno LM, Everitt BJ, Phillips PE. Hierarchical recruitment of phasic dopamine signaling in the striatum during the progression of cocaine use. Proc Natl Acad Sci USA. 2012;109:20703–20708. doi: 10.1073/pnas.1213460109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 88.Wouterlood FG, et al. Mesencephalic dopamine neurons interfacing the shell of nucleus accumbens and the dorsolateral striatum in the rat. J Neurosci Res. 2018;96:1518–1542. doi: 10.1002/jnr.24242. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 89.Bocklisch C, et al. Cocaine disinhibits dopamine neurons by potentiation of GABA transmission in the ventral tegmental area. Science. 2013;341:1521–1525. doi: 10.1126/science.1237059. [DOI] [PubMed] [Google Scholar]
  • 90.Cardinal R. The contribution of the amygdala, nucleus accumbens, and prefrontal cortex to emotion and motivated behaviour. Int Congr Ser. 2003;1250:347–370. [Google Scholar]
  • 91.Corbit L, Balleine B. The general and outcome-specific forms of Pavlovian-instrumental transfer are differentially mediated by the nucleus accumbens core and shell. J Neurosci. 2011;31:11786–11794. doi: 10.1523/JNEUROSCI.2711-11.2011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 92.Murray JE, et al. Basolateral and central amygdala differentially recruit and maintain dorsolateral striatum-dependent cocaine-seeking habits. Nat Commun. 2015;6 doi: 10.1038/ncomms10088. 10088. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 93.Lingawi NW, Balleine BW. Amygdala central nucleus interacts with dorsolateral striatum to regulate the acquisition of habits. J Neurosci. 2012;32:1073–1081. doi: 10.1523/JNEUROSCI.4806-11.2012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 94.El-Amamy H, Holland P. Dissociable effects of disconnecting amygdala central nucleus from the ventral tegmental area or substantia nigra on learned orienting and incentive motivation. Eur J Neurosci. 2007;25:1557–1567. doi: 10.1111/j.1460-9568.2007.05402.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 95.Urcelay GP, Jonkman S. Delayed rewards facilitate habit formation. J Exp Psychol Anim Learn Cogn. 2019;45:413–421. doi: 10.1037/xan0000221. [DOI] [PubMed] [Google Scholar]
  • 96.Nelson A, Killcross S. Amphetamine exposure enhances habit formation. J Neurosci. 2006;26:3805–3812. doi: 10.1523/JNEUROSCI.4305-05.2006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 97.Schmitzer-Torbert N, et al. Post-training cocaine administration facilitates habit learning and requires the infralimbic cortex and dorsolateral striatum. Neurobiol Learn Mem. 2015;118:105–112. doi: 10.1016/j.nlm.2014.11.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 98.Hogarth L, Balleine B, Corbit L, Killcross S. Associative learning mechanisms underpinning the transition from recreational drug use to addiction. Ann N Y Acad Sci. 2013;1282:12–24. doi: 10.1111/j.1749-6632.2012.06768.x. [DOI] [PubMed] [Google Scholar]
  • 99.Dickinson A, Wood N, Smith J. Alcohol seeking by rats: action or habit? Q J Exp Psychol B. 2002;55:331–348. doi: 10.1080/0272499024400016. [DOI] [PubMed] [Google Scholar]
  • 100.Miles F, Everitt B, Dickinson A. Oral cocaine seeking by rats: action or habit? Behav Neurosci. 2003;117:927–938. doi: 10.1037/0735-7044.117.5.927. [DOI] [PubMed] [Google Scholar]
  • 101.Clemens KJ, Castino MR, Cornish JL, Goodchild AK, Holmes NM. Behavioral and neural substrates of habit formation in rats intravenously self-administering nicotine. Neuropsychopharmacology. 2014;39:2584–2593. doi: 10.1038/npp.2014.111. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 102.Malvaez M, Wassum KM. Regulation of habit formation in the dorsal striatum. Opin Behav Sci. 2018;20:67–74. doi: 10.1016/j.cobeha.2017.11.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 103.Schoenbaum G, Shaham Y. The role of orbitofrontal cortex in drug addiction: a review of preclinical studies. Biol Psychiatry. 2008;63:256–262. doi: 10.1016/j.biopsych.2007.06.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 104.Hampson R, Porrino L, Opris I, Stanford T, Deadwyler S. Effects of cocaine rewards on neural representations of cognitive demand in nonhuman primates. Psychopharmacology. 2010;213:105–118. doi: 10.1007/s00213-010-2017-2. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 105.Crombag H, Gorny G, Li Y, Kolb B, Robinson T. Opposite effects of amphetamine self-administration experience on dendritic spines in the medial and orbital prefrontal cortex. Cereb Cortex. 2005;15:341–348. doi: 10.1093/cercor/bhh136. [DOI] [PubMed] [Google Scholar]
  • 106.Morein-Zamir S, Robbins TW. Fronto-striatal circuits in response-inhibition: relevance to addiction. Brain Res. 2014;21:488–497. doi: 10.1016/j.brainres.2014.09.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 107.Rogers RD, et al. Dissociable deficits in the decision-making cognition of chronic amphetamine abusers, opiate abusers, patients with focal damage to prefrontal cortex, and tryptophan-depleted normal volunteers: evidence for monoaminergic mechanisms. Neuropsychopharmacology. 1999;20:322–339. doi: 10.1016/S0893-133X(98)00091-8. [DOI] [PubMed] [Google Scholar]
  • 108.Bechara A, et al. Decision-making deficits, linked to a dysfunctional ventromedial prefrontal cortex, revealed in alcohol and stimulant abusers. Neuropsychologia. 2001;39:376–389. doi: 10.1016/s0028-3932(00)00136-6. [DOI] [PubMed] [Google Scholar]
  • 109.Hardwick RM, Forrence AD, Krakauer JW, Haith AM. Time-dependent competition between goal-directed and habitual response preparation. Nat Hum Behav. 2019;3:1252–1262. doi: 10.1038/s41562-019-0725-0. [DOI] [PubMed] [Google Scholar]
  • 110.Jonkman S, Pelloux Y, Everitt BJ. Drug intake is sufficient, but conditioning is not necessary for the emergence of compulsive cocaine seeking after extended self-administration. Neuropsychopharmacology. 2012;37:1612–1619. doi: 10.1038/npp.2012.6. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 111.Belin D, Mar AC, Dalley JW, Robbins TW, Everitt BJ. High impulsivity predicts the switch to compulsive cocaine-taking. Science. 2008;320:1352–1355. doi: 10.1126/science.1158136. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 112.Belin D, Belin-Rauscent A, Everitt BJ, Dalley JW. In search of predictive endophenotypes in addiction: insights from preclinical research. Genes Brain Behav. 2016;15:74–88. doi: 10.1111/gbb.12265. [DOI] [PubMed] [Google Scholar]
  • 113.Marchant NJ, Campbell EJ, Kaganovsky K. Punishment of alcohol-reinforced responding in alcohol preferring P rats reveals a bimodal population: Implications for models of compulsive drug seeking. Prog Neuropsychopharmacol Biol Psychiatry. 2018;87:68–77. doi: 10.1016/j.pnpbp.2017.07.020. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 114.Pelloux Y, Murray JE, Everitt BJ. Differential roles of the prefrontal cortical subregions and basolateral amygdala in compulsive cocaine seeking and relapse after voluntary abstinence in rats. Eur J Neurosci. 2013;38:3018–3026. doi: 10.1111/ejn.12289. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 115.Zapata A, Hwang EK, Lupica CR. lateral habenula involvement in impulsive cocaine seeking. Neuropsychopharmacology. 2017;42:1103–1112. doi: 10.1038/npp.2016.286. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 116.Matsumoto M, Hikosaka O. Lateral habenula as a source of negative reward signals in dopamine neurons. Nature. 2007;447:1111–1115. doi: 10.1038/nature05860. [DOI] [PubMed] [Google Scholar]
  • 117.Chen B, et al. Rescuing cocaine-induced prefrontal cortex hypoactivity prevents compulsive cocaine seeking. Nature. 2013;496:359–362. doi: 10.1038/nature12024. [DOI] [PubMed] [Google Scholar]
  • 118.Pelloux Y, Everitt BJ, Dickinson A. Compulsive drug seeking by rats under punishment: effects of drug taking history. Psychopharmacology. 2007;194:127–137. doi: 10.1007/s00213-007-0805-0. [DOI] [PubMed] [Google Scholar]
  • 119.Fan XC, et al. Hypersensitivity of prelimbic cortex neurons contributes to aggravated nociceptive responses in rats with experience of chronic inflammatory pain. Front Mol Neurosci. 2018;11:85. doi: 10.3389/fnmol.2018.00085. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 120.Jonkman S, Pelloux Y, Everitt BJ. Differential roles of the dorsolateral and midlateral striatum in punished cocaine seeking. J Neurosci. 2012;32:4645–4650. doi: 10.1523/JNEUROSCI.0348-12.2012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 121.Renteria R, Baltz ET, Gremel CM. Chronic alcohol exposure disrupts top-down control over basal ganglia action selection to produce habits. Nat Commun. 2018;9:211. doi: 10.1038/s41467-017-02615-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 122.Harada M, Hiver A, Pascoli V, Lüscher C. Cortico-striatal synaptic plasticity underlying compulsive reward seeking. bioRxiv. 2019 doi: 10.1101/789495. [DOI] [Google Scholar]
  • 123.Rogers RD, Robbins T. Investigating the neurocognitive deficits associated with chronic drug misuse. Curr Opin Neurobiol. 2001;11:250–257. doi: 10.1016/s0959-4388(00)00204-x. [DOI] [PubMed] [Google Scholar]
  • 124.Belin D, Belin-Rauscent A, Murray JE, Everitt BJ. Addiction: failure of control over maladaptive habits. Curr Opin Biol. 2013;23:564–572. doi: 10.1016/j.conb.2013.01.025. [DOI] [PubMed] [Google Scholar]
  • 125.Ahmed SH. Trying to make sense of rodents’ drug choice behavior. Neuropsychopharmacol Biol Psychiatry. 2018;87:3–10. doi: 10.1016/j.pnpbp.2017.09.027. [DOI] [PubMed] [Google Scholar]
  • 126.Cantin L, et al. Cocaine is low on the value ladder of rats: possible evidence for resilience to addiction. PLoS One. 2010;5:e11592. doi: 10.1371/journal.pone.0011592. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 127.Higgins ST, Heil SH, Lussier JP. Clinical implications of reinforcement as a determinant of substance use disorders. Annu Rev Psychol. 2004;55:431–461. doi: 10.1146/annurev.psych.55.090902.142033. [DOI] [PubMed] [Google Scholar]
  • 128.Davis DR, et al. A review of the literature on contingency management in the treatment of substance use disorders, 2009-2014. Prev Med. 2016;92:36–46. doi: 10.1016/j.ypmed.2016.08.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 129.Hogarth L, et al. Intact goal-directed control in treatment-seeking drug users indexed by outcome-devaluation and Pavlovian to instrumental transfer: critique of habit theory. Eur J Neurosci. 2019;50:2513–2525. doi: 10.1111/ejn.13961. [DOI] [PubMed] [Google Scholar]
  • 130.Momennejad I, et al. The successor representation in human reinforcement learning. Nat Hum Behav. 2017;1:680–692. doi: 10.1038/s41562-017-0180-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 131.Singer BF, Fadanelli M, Kawa AB, Robinson TE. Are cocaine-seeking “habits” necessary for the development of addiction-like behavior in rats? J Neurosci. 2018;38:60–73. doi: 10.1523/JNEUROSCI.2458-17.2017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 132.Colwill RM, Rescorla RA. Associative structures in instrumental learning. Psychol Learn Motiv. 1986;20:55–104. [Google Scholar]
  • 133.Holland P. Relations between Pavlovian-instrumental transfer and reinforcer devaluation. J Exp Psychol Anim Behav Process. 2004;30:104–117. doi: 10.1037/0097-7403.30.2.104. [DOI] [PubMed] [Google Scholar]
  • 134.Kosaki Y, Dickinson A. Choice and contingency in the development of behavioral autonomy during instrumental conditioning. J Exp Psychol Anim Behav Process. 2010;36:334–342. doi: 10.1037/a0016887. [DOI] [PubMed] [Google Scholar]
  • 135.Weiss F, Markou A, Lorang MT, Koob G. Basal extracellular dopamine levels in the nucleus accumbens are decreased during cocaine withdrawal after unlimited-access self-administration. Brain Res. 1992;593:314–318. doi: 10.1016/0006-8993(92)91327-b. [DOI] [PubMed] [Google Scholar]
  • 136.Mateo Y, Lack C, Morgan D, Roberts D, Jones S. Reduced dopamine terminal function and insensitivity to cocaine following cocaine binge self-administration and deprivation. Neuropsychopharmacology. 2005;30:1455–1463. doi: 10.1038/sj.npp.1300687. [DOI] [PubMed] [Google Scholar]
  • 137.Volkow ND, Fowler JS. Addiction, a disease of compulsion and drive: involvement of the orbitofrontal cortex. Cereb Cortex. 2000;10:318–325. doi: 10.1093/cercor/10.3.318. [DOI] [PubMed] [Google Scholar]
  • 138.Pelloux Y, Baunez C. Deep brain stimulation for addiction: why the subthalamic nucleus should be favored. Curr Opin Neurobiol. 2013;23:713–720. doi: 10.1016/j.conb.2013.02.016. [DOI] [PubMed] [Google Scholar]
  • 139.Terraneo A, et al. Transcranial magnetic stimulation of dorsolateral prefrontal cortex reduces cocaine use: a pilot study. Eur Neuropsychopharmacol. 2016;26:37–44. doi: 10.1016/j.euroneuro.2015.11.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 140.Weeks JR. Experimental morphine addiction: method for automatic intravenous injections in unrestrained rats. Science. 1962;138:143–144. doi: 10.1126/science.138.3537.143. [DOI] [PubMed] [Google Scholar]
  • 141.Dickinson A. Conditioning and associative learning. Br Med Bull. 1981;37:165–168. doi: 10.1093/oxfordjournals.bmb.a071695. [DOI] [PubMed] [Google Scholar]
  • 142.Wise RA. Role of brain dopamine in food reward and reinforcement. Philos Trans R Soc Lond B Biol Sci. 2006;361:1149–1158. doi: 10.1098/rstb.2006.1854. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 143.Koob GF. New dimensions in human laboratory models of addiction. Addict Biol. 2009;14:1–8. doi: 10.1111/j.1369-1600.2008.00127.x. [DOI] [PubMed] [Google Scholar]
  • 144.Ahmed SH, Koob G. Transition from moderate to excessive drug intake: change in hedonic set point. Science. 1998;282:298–300. doi: 10.1126/science.282.5387.298. [DOI] [PubMed] [Google Scholar]
  • 145.Wolffgramm J. An ethopharmacological approach to the development of drug addiction. Neurosci Biobehav Rev. 1991;15:515–519. doi: 10.1016/s0149-7634(05)80142-3. [DOI] [PubMed] [Google Scholar]
  • 146.Ahmed SH, Koob G. Long-lasting increase in the set point for cocaine self-administration after escalation in rats. Psychopharmacology. 1999;146:303–312. doi: 10.1007/s002130051121. [DOI] [PubMed] [Google Scholar]
  • 147.O’Dell LE, Roberts AJ, Smith RT, Koob GF. Enhanced alcohol self-administration after intermittent versus continuous alcohol vapor exposure. Alcohol Clin Exp Res. 2004;28:1676–1682. doi: 10.1097/01.alc.0000145781.11923.4e. [DOI] [PubMed] [Google Scholar]
  • 148.Paterson NE, Markou A. Increased motivation for self-administered cocaine after escalated cocaine intake. Neuroreport. 2003;14:2229–2232. doi: 10.1097/00001756-200312020-00019. [DOI] [PubMed] [Google Scholar]
  • 149.Koob G. Negative reinforcement in drug addiction: the darkness within. Curr Opin Neurobiol. 2013;23:559–563. doi: 10.1016/j.conb.2013.03.011. [DOI] [PubMed] [Google Scholar]
  • 150.Kawa AB, Bentzley BS, Robinson TE. Less is more: prolonged intermittent access cocaine self-administration produces incentive-sensitization and addiction-like behavior. Psychopharmacology. 2016;233:3587–3602. doi: 10.1007/s00213-016-4393-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 151.Lesscher H, Vanderschuren L. Compulsive drug use and its neural substrates. Rev Neurosci. 2012;23:731–745. doi: 10.1515/revneuro-2012-0066. [DOI] [PubMed] [Google Scholar]
  • 152.Vengeliene V, Celerier E, Chaskiel L, Penzo F, Spanagel R. Compulsive alcohol drinking in rodents. Addict Biol. 2009;14:384–396. doi: 10.1111/j.1369-1600.2009.00177.x. [DOI] [PubMed] [Google Scholar]
  • 153.Tornatzky W, Miczek KA. Cocaine self-administration “binges”: transition from behavioral and autonomic regulation toward homeostatic dysregulation in rats. Psychopharmacology. 2000;148:289–298. doi: 10.1007/s002130050053. [DOI] [PubMed] [Google Scholar]
  • 154.Belin D, Balado E, Piazza P, Deroche-Gamonet V. Pattern of intake and drug craving predict the development of cocaine addiction-like behavior in rats. Biol Psychiatry. 2009;65:863–868. doi: 10.1016/j.biopsych.2008.05.031. [DOI] [PubMed] [Google Scholar]
  • 155.Seif T, et al. Cortical activation of accumbens hyperpolarization-active NMDARs mediates aversion-resistant alcohol intake. Nat Neurosci. 2013;16:1094–1100. doi: 10.1038/nn.3445. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 156.Torres OV, et al. Compulsive methamphetamine taking under punishment is associated with greater cue-induced drug seeking in rats. Behav Brain Res. 2017;326:265–271. doi: 10.1016/j.bbr.2017.03.009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 157.Belin D, Deroche-Gamonet V, Jaber M. Cocaine-induced sensitization is associated with altered dynamics of transcriptional responses of the dopamine transporter, tyrosine hydroxylase, and dopamine D2 receptors in C57Bl/6J mice. Psychopharmacology. 2007;193:567–578. doi: 10.1007/s00213-007-0790-3. [DOI] [PubMed] [Google Scholar]
  • 158.Olmstead M, Lafond M, Everitt BJ, Dickinson A. Cocaine seeking by rats is a goal-directed action. Behav Neurosci. 2001;115:394–402. [PubMed] [Google Scholar]
  • 159.Olmstead M, Parkinson J, Miles F, Everitt BJ, Dickinson A. Cocaine-seeking by rats: regulation, reinforcement and activation. Psychopharmacology. 2000;152:123–131. doi: 10.1007/s002130000498. [DOI] [PubMed] [Google Scholar]
  • 160.Goldberg SR. Comparable behavior maintained under fixed-ratio and second-order schedules of food presentation, cocaine injection or d-amphetamine injection in the squirrel monkey. J Pharmacol Exp Ther. 1973;186:18–30. [PubMed] [Google Scholar]
  • 161.Everitt BJ, Robbins TW. Second-order schedules of drug reinforcement in rats and monkeys: measurement of reinforcing efficacy and drug-seeking behaviour. Psychopharmacology. 2000;153:17–30. doi: 10.1007/s002130000566. [DOI] [PubMed] [Google Scholar]
  • 162.Giuliano C, et al. Evidence for a long-lasting compulsive alcohol seeking phenotype in rats. Neuropsychopharmacology. 2018;43:728–738. doi: 10.1038/npp.2017.105. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 163.Everitt BJ. Neural and psychological mechanisms underlying compulsive drug seeking habits and drug memories-indications for novel treatments of addiction. Eur J Neurosci. 2014;40:2163–2182. doi: 10.1111/ejn.12644. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 164.Grimm JW, Hope BT, Wise RA, Shaham Y. Incubation of cocaine craving after withdrawal. Nature. 2001;412:141–142. doi: 10.1038/35084134. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 165.Venniro M, Caprioli D, Shaham Y. Animal models of drug relapse and craving: from drug priming-induced reinstatement to incubation of craving after voluntary abstinence. Prog Brain Res. 2016;224:25–52. doi: 10.1016/bs.pbr.2015.08.004. [DOI] [PubMed] [Google Scholar]
  • 166.Wolf ME. Synaptic mechanisms underlying persistent cocaine craving. Nat Rev Neurosci. 2016;17:351–365. doi: 10.1038/nrn.2016.39. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 167.Kalivas PW, McFarland K. Brain circuitry and the reinstatement of cocaine-seeking behavior. Psychopharmacology. 2003;168:44–56. doi: 10.1007/s00213-003-1393-2. [DOI] [PubMed] [Google Scholar]
  • 168.Szumlinski K, et al. Homer isoforms differentially regulate cocaine-induced neuroplasticity. Neuropsychopharmacology. 2006;31:768–777. doi: 10.1038/sj.npp.1300890. [DOI] [PubMed] [Google Scholar]
  • 169.Kalivas PW. The glutamate homeostasis hypothesis of addiction. Nat Rev Neurosci. 2009;10:561–572. doi: 10.1038/nrn2515. [DOI] [PubMed] [Google Scholar]
  • 170.Vanderschuren LJ, Everitt BJ. Drug seeking becomes compulsive after prolonged cocaine self-administration. Science. 2004;305:1017–1019. doi: 10.1126/science.1098975. [DOI] [PubMed] [Google Scholar]
  • 171.Ostlund S, Balleine B. Differential involvement of the basolateral amygdala and mediodorsal thalamus in instrumental action selection. J Neurosci. 2008;28:4398–4405. doi: 10.1523/JNEUROSCI.5472-07.2008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 172.Cardinal R, Parkinson JA, Hall J, Everitt BJ. Emotion and motivation: the role of the amygdala, ventral striatum, and prefrontal cortex. Neurosci Biobehav Rev. 2002;26:321–352. doi: 10.1016/s0149-7634(02)00007-6. [DOI] [PubMed] [Google Scholar]
  • 173.Kuhn BN, Campus P, Flagel SB. The Neurobiological Mechanisms Underlying Sign-Tracking Behavior. Michigan Publishing; 2018. [Google Scholar]
  • 174.Yin HH, Ostlund SB, Knowlton BJ, Balleine BW. The role of the dorsomedial striatum in instrumental conditioning. Eur J Neurosci. 2005;22:513–523. doi: 10.1111/j.1460-9568.2005.04218.x. [DOI] [PubMed] [Google Scholar]
  • 175.Hart G, Bradfield LA, Fok SY, Chieng B, Balleine BW. The bilateral prefronto-striatal pathway is necessary for learning new goal-directed actions. Curr Biol. 2018;28:2218–2229 e7. doi: 10.1016/j.cub.2018.05.028. [DOI] [PubMed] [Google Scholar]
  • 176.Balleine B, Leung B, Ostlund S. The orbitofrontal cortex, predicted value, and choice. Ann N Y Acad Sci. 2011;1239:43–50. doi: 10.1111/j.1749-6632.2011.06270.x. [DOI] [PubMed] [Google Scholar]
  • 177.Parkes SL, et al. Insular and ventrolateral orbitofrontal cortices differentially contribute to goal-directed behavior in rodents. Cereb Cortex. 2018;28:2313–2325. doi: 10.1093/cercor/bhx132. [DOI] [PubMed] [Google Scholar]
  • 178.Bradfield LA, Dezfouli A, van Holstein M, Chieng B, Balleine BW. Medial orbitofrontal cortex mediates outcome retrieval in partially observable task situations. Neuron. 2015;88:1268–1280. doi: 10.1016/j.neuron.2015.10.044. [DOI] [PubMed] [Google Scholar]
  • 179.Murray EA, Rudebeck PH. Specializations for reward-guided decision-making in the primate ventral prefrontal cortex. Nat Rev Neurosci. 2018;19:404–417. doi: 10.1038/s41583-018-0013-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 180.Dickinson A. Actions and habits: the development of behavioural autonomy. Philos Trans R Soc Lond B. 1985;308:67–78. [Google Scholar]
  • 181.Wood W, Runger D. Psychology of habit. Annu Rev Psychol. 2016;67:289–314. doi: 10.1146/annurev-psych-122414-033417. [DOI] [PubMed] [Google Scholar]
  • 182.Yin HH, Knowlton BJ, Balleine BW. Lesions of dorsolateral striatum preserve outcome expectancy but disrupt habit formation in instrumental learning. Eur J Neurosci. 2004;19:181–189. doi: 10.1111/j.1460-9568.2004.03095.x. [DOI] [PubMed] [Google Scholar]
  • 183.Killcross S, Coutureau E. Coordination of actions and habits in the medial prefrontal cortex of rats. Cereb Cortex. 2003;13:400–408. doi: 10.1093/cercor/13.4.400. [DOI] [PubMed] [Google Scholar]
  • 184.Smith KS, Virkud A, Deisseroth K, Graybiel AM. Reversible online control of habitual behavior by optogenetic perturbation of medial prefrontal cortex. Proc Natl Acad Sci USA. 2012;109:18932–18937. doi: 10.1073/pnas.1216264109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 185.Smith KS, Graybiel AM. A dual operator view of habitual behavior reflecting cortical and striatal dynamics. Neuron. 2013;79:361–374. doi: 10.1016/j.neuron.2013.05.038. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 186.Cardinal RN, Cheung TH. Nucleus accumbens core lesions retard instrumental learning and performance with delayed reinforcement in the rat. BMC Neurosci. 2005;6:9. doi: 10.1186/1471-2202-6-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 187.Parkinson JA, Olmstead MC, Burns LH, Robbins TW, Everitt BJ. Dissociation in effects of lesions of the nucleus accumbens core and shell on appetitive pavlovian approach behavior and the potentiation of conditioned reinforcement and locomotor activity by D-amphetamine. J Neurosci. 1999;19:2401–2411. doi: 10.1523/JNEUROSCI.19-06-02401.1999. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 188.Heymann G, et al. Synergy of distinct dopamine projection populations in behavioral reinforcement. Neuron. 2019;105:909–920 e5. doi: 10.1016/j.neuron.2019.11.024. [DOI] [PMC free article] [PubMed] [Google Scholar]

RESOURCES