Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2023 May 26.
Published in final edited form as: Addict Behav. 2022 Dec 22;140:107595. doi: 10.1016/j.addbeh.2022.107595

Computational models of behavioral addictions: state of the art and future directions

Ayaka Kato 1,2, Kanji Shimomura 3, Dimitri Ognibene 4,5, Muhammad A Parvaz 6, Laura A Berner 7,8, Kenji Morita 3,9, Vincenzo G Fiore 8,*
PMCID: PMC10214928  NIHMSID: NIHMS1898474  PMID: 36621045

Abstract

Non-pharmacological behavioral addictions, such as pathological gambling, videogaming, social networking, or internet use, are becoming major public health concerns. It is not yet clear how behavioral addictions could share many major neurobiological and behavioral characteristics with substance use disorders, despite the absence of direct pharmacological influences. A deeper understanding of the neurocognitive mechanisms of addictive behavior is needed, and computational modeling could be one promising approach to explain intricately entwined cognitive and neural dynamics. This review describes computational models of addiction based on reinforcement learning algorithms, Bayesian inference, and biophysical neural simulations. We discuss whether computational frameworks originally conceived to explain maladaptive behavior in substance use disorders can be effectively extended to non-substance-related behavioral addictions. Moreover, we introduce recent studies on behavioral addictions that exemplify the possibility of such extension and propose future directions.

Keywords: computational modeling, model-based, model-free, reinforcement learning, Bayesian active inference, neural models, neural simulations

Introduction

Psychobiological and neurocomputational investigations in addictive disorders have largely focused on the effects of substances of abuse on neural dynamics, cognitive processes and behavior (cf. reviews: Everitt & Robbins, 2016; Koob & Volkow, 2016; Mollick & Kober, 2020; Redish, Jensen, & Johnson, 2008; Smith, Taylor, & Bilek, 2021). However, recent studies strongly suggest that non-pharmacological behavioral addictions share with substance use disorders key neurobiological (Antons, Brand, & Potenza, 2020; Potenza, 2013), computational (Lindstrom et al., 2021; Ognibene, Fiore, & Gu, 2019; Redish, Jensen, Johnson, & Kurth-Nelson, 2007; Shimomura, Kato, & Morita, 2021), and behavioral features (Grant & Chamberlain, 2014; Grant, Potenza, Weinstein, & Gorelick, 2010). These include widely accepted behavioral addictions such as pathological gambling (el-Guebaly, Mudry, Zohar, Tavares, & Potenza, 2012), as well as others on which the consensus is still forming, such as videogaming (Petry & O'Brien, 2013; Yao, Potenza, & Zhang, 2017), social network or internet addiction (Jorgenson, Hsiao, & Yen, 2016; Veisani, Jalilian, & Mohamadian, 2020), compulsive buying (Granero et al., 2016; Grant et al., 2010), compulsive sexual behavior or pornography addiction (Griffiths, 2016; Love, Laier, Brand, Hatch, & Hajela, 2015) and finally, more controversial, disordered eating behaviors such as binge eating (Wiss, Avena, & Gold, 2020; Wiss, Avena, & Rada, 2018; Wiss, Criscitelli, Gold, & Avena, 2017).

In this review, we cast a wide net relying on an inclusive definition of addictions: a relapsing, chronic disorder characterized by an initial pursuit of a desired outcome that leads to the inflexible repetition of maladaptive behaviors, despite the harmful consequences (Everitt & Robbins, 2016; Koob & Volkow, 2016). This definition highlights two complementary elements of behavioral and cognitive control in addictions. First, it emphasizes a transition from reinforcing action-outcome associations to compulsive stimulus-responses, i.e., from goal-oriented to habitual behavior (Ersche et al., 2016; Everitt & Robbins, 2013; Volkow & Morales, 2015). In other words, an ‘urge’ to respond to a reinforced cue is triggered irrespective of an actual desire for the outcome (cf. 'need' vs 'want', Berridge & Robinson, 2016) or any assessment about desired future environment or body states (cf. 'model-free control', Dolan & Dayan, 2013). Second, the chronic and relapsing elements of the definition assign an important role to an underperforming goal-oriented behavior and forward planning (or 'model-based control', cf. Dolan & Dayan, 2013), possibly due to an incomplete, incorrect, or otherwise impaired belief structure or internal model of both environment and body states. For instance, incorrect representations of future positive and negative interoceptive outcomes can lead to craving (Grimm, Hope, Wise, & Shaham, 2001; Gu & Filbey, 2017), often followed by the reinstatement of the addictive behavior (relapse), even after prolonged periods of abstinence.

Here we consider computational models of addiction based on reinforcement learning algorithms, Bayesian inference and biophysical neural simulations, with a focus on ‘model-free’ and ‘model-based’ aberrant control. We discuss whether computational models originally conceived to describe substance use disorders could be validly extended to behavioral addictions and we present computational models that have been specifically developed to describe maladaptive behaviors in behavioral addictions.

Reinforcement learning models

Reinforcement learning (RL, Sutton & Barto, 1998) is the dominant approach for modeling addictive behaviors (CPSYMAP, Kato, Kunisato, Katahira, Okimura, & Yamashita, 2020). In RL, a behavioral policy determines one’s actions at each state, resulting in state transitions that can yield positive or negative outcomes, so allowing an agent (e.g., a person or an animal) to pursue total reward maximization with temporal discounting. In model-based RL, agents use an internal model of the environment (i.e., representations of transitions and rewards) to estimate the values of behavioral policies and plan a course of action-state transitions. By contrast, in model-free RL, agents estimate immediately available state/action values, typically through updating them by using reward prediction errors (RPEs, Watkins & Dayan, 1992). As a result, model-based RL is computationally costly yet flexible to changes in the environment, once these are represented in the internal model, whereas model-free RL is computationally parsimonious yet characterized by low flexibility (Strehl, Li, & Littman, 2009). Within this perspective, addictions are described as a dysfunction of value-based behavior that affects both model-free and model-based control modalities (Redish et al., 2008).

Based on the understanding that substances of abuse interfere with dopamine signals, which have been suggested to encode RPEs in biological agents (Schultz, Dayan, & Montague, 1997; see section: Neural models), a seminal model (Redish, 2004) proposed that drugs of abuse could act as fictitious RPEs. Due to their exogenous cause, these RPEs cannot be canceled out by reward predictions, so that the estimated value of drug indefinitely increases. Other models have proposed that enhanced RPEs may cause a decrement in the reward system sensitivity (Dezfouli et al., 2009), resistance to habituation (Di Chiara, 1999), or sensitization to reinforced cues (Bernheim & Rangel, 2004), and might accumulate through hierarchical decision-making processes (Keramati & Gutkin, 2013). Any of these dysfunctions would promote overwhelming biases towards the model-free control, driving the compulsive and inflexible selection of addictive behaviors, irrespective of the negative outcomes (e.g., detrimental effects for one’s health or social relations). These models assume that over-reliance on model-free control is caused by dopamine-related, drug-induced, alterations in healthy neural circuit dynamics (Koob & Volkow, 2016; Korpi et al., 2015; Luscher & Malenka, 2011). However, aberrant functioning in the neural regions involved in reward processing have been also described in association with compulsive use of pornography (Hilton, 2013), compulsive sexual behavior (Gola et al., 2017; Golec, Draps, Stark, Pluta, & Gola, 2021), compulsive buying behavior (Granero et al., 2016), eating disorders (Baik, 2013; G. K. W. Frank, Shott, Stoddard, Swindle, & Pryor, 2021; Wiss et al., 2018), problematic videogaming (Palaus, Marron, Viejo-Sobera, & Redolar-Ripoll, 2017), and internet addiction (Love et al., 2015). These findings indicate that aberrant RPE signals putatively responsible for the over-reliance on model-free, at the expenses of model-based, control can be elicited in the absence of pharmacological manipulation, e.g., due to predisposing factors (Antons et al., 2020), suggesting the computational models based on these mechanisms can be used to describe behavioral addictions, as well. For instance, a bias towards model-free control has been reported in binge eating disorder using a task designed to highlight model-free vs model-based arbitration (Voon et al., 2015). In another RL model tackling social network use, participants showed high sensitivity to social rewards and reliance on RPE updates to determine their post sharing policies (Lindstrom et al., 2021). However, formal testing with computational modeling is still very sparce across behavioral addictions.

Interestingly, RL models focusing only on aberrant model-free control underperform when trying to account for those behaviors in addiction that are not cue-induced, e.g., because they are novel, complex, or context-dependent. To fill this gap, several models have proposed to include addiction-related dysfunctions in the model-based control component, so focusing on the generation, update, and recall of state-action-state transitions. Crucially, although dopamine is suggested to be also involved in model-based control (Deserno et al., 2015; Wunderlich, Smittenaar, & Dolan, 2012), aberrant RPEs are not directly considered in the computational models focusing on the dysfunctions of model-based control, rendering irrelevant the issue of whether behavioral addictions can develop in the absence of the drug-based manipulation of the brain reward system. Model-based control dysfunctions fall into a few categories: forward planning malfunctions (Redish & Johnson, 2007), incomplete representations in the internal model (Redish et al., 2007), or incomplete access to the internal model during recall (Simon & Daw, 2012) . In other words, this new class of RL models focuses on one’s internal representation of the environment, showing that incomplete or incorrect representations of state transitions (or mental forward explorations of these transitions) can drive addiction-like suboptimal goal selections, planning and ultimately behaviors. This approach, which changes the focus of investigation from the generation of a habitual response to an impaired ability to plan and select goals, was used in several models to account for behavioral addictions.

One study (Redish et al., 2007) simulated state misrepresentation in pathological gambling. While gambling, one can experience big wins and subsequent losses, developing a (mis)recognition that there is a state associated with wins and a different state associated with losses, thus misrepresenting the same state as two different states. Then, negative RPEs caused by losses would not attenuate a large positive value of the state associated with wins, and this misassignment of credit due to the discrepancy between the actual environment and its internal representation could lead to pathological gambling and relapse. Another proposal addresses the issue of forward planning malfunctions by focusing on the relation between environment complexity and cognitive resources available to the agent (Fiore, Ognibene, Adinoff, & Gu, 2018; Ognibene et al., 2019). In this case, a mismatch between resources available and those required results in repetitive suboptimal behavioral policies, reducing the sampling of contingencies in the environment and escalating the exploration cost across phenotypes characterized by different model-based and model-free control balance. Thus, addiction-like behaviors can emerge in agents with bounded model-based resources, also inducing an inadequate representation of the environment and irrespective of RPE malfunctions. Another study (Shimomura et al., 2021) developed a model that relied on the "successor representation" (SR) of states (Dayan, 1993), a process suggested to be used by humans (Momennejad et al., 2017; Russek, Momennejad, Botvinick, Gershman, & Daw, 2017). In SR encoding for a given policy, states are similar if they give access to similar sets of states, e.g., two doors leading to the same reward room would have similar SR representations. Formally, a state is represented by a matrix of expected cumulative discounted future state occupancies under a certain policy. The matrix that describes the relationship among states enables partially model-based behavior through model-free RL-like RPE-based value update. Shimomura et al. (2021) proposed that: 1) through a long-standing reward-obtaining behavior, one potentially establishes “dimension-reduced SR”, and (2) the reduced SR can become rigid. Under such a rigid and dimension-reduced SR, a sustained large positive RPE is generated at the state with reward due to the inaccurate value approximations caused by inadequate state representations, irrespective of any pharmacological manipulation, potentially enhancing reward-obtaining behavior. Moreover, negative outcomes occurring after the addictive positive reward cannot induce changes in behaviors, under the rigid reduced SR.

All these models (Ognibene et al., 2019; Redish et al., 2007; Shimomura et al., 2021) are theoretically applicable across behavioral addictions, irrespective of the type of reward. Potentially related to the environment exploration and representation dysfunctions, studies have indicated that a key factor in the development of behavioral addictions such as problematic gambling, videogaming, shopping, or social network use can be found in the complex (i.e., difficult to compute and predict) organization of rewards experienced on a variable ratio reinforcement schedule (Cash, Rae, Steel, & Winkler, 2012; Greenberg, Zhai, Hoff, Krishnan-Sarin, & Potenza, 2022; Young & Abreu, 2011). Another study has shown that gamblers are characterized by reduction in directed (uncertainty-based) exploration and not in random exploration compared with healthy controls (Wiehler, Chakroun, & Peters, 2021). Finally, a deficit in the exploration-exploitation balance has been also suggested for binge-eating disorder (Reiter, Heinze, Schlagenhauf, & Deserno, 2017). However, the mechanisms proposed in these models have not yet been directly tested in ad hoc experiments.

Bayesian and active inference models

Computational models based on Bayesian inference suggest that the brain computes probability distributions associating states, actions and events or outcomes (whether value-based or not). These probability distributions, termed as prior beliefs or priors, are updated into posterior beliefs or posteriors, relying on a signal of precision in prediction error, i.e., the dopamine-encoded discrepancy between one’s priors and actual state-action-outcome observations (Friston et al., 2012). This is a relatively new approach in comparison with RL, with a comparably smaller literature in terms of models of addictive behaviors. Current analyses carried out in relation with substance use disorders have highlighted slower belief updating and related behavioral adaptation, as in perseverative habitual responses (Ide, Hu, Zhang, Yu, & Li, 2015) and reduced ability to use environment representations to guide choice selections, as in over-reliance on model-free control (Harle, Zhang, et al., 2015). Other studies have highlighted the relation between neural responses evoked by non-value based prediction errors and likelihood to relapse (Harle, Stewart, et al., 2015; Harle, Yu, & Paulus, 2019), suggesting that the aberrant RPEs described within the RL framework might be part of a more generalized dysfunction across all prediction errors. One further study (Schwöbel, Markovic, Smolka, & Kiebel, 2021) has proposed that context inference may play a key role in substance use disoders, in a mechanism analogous to context-based RL (Redish et al., 2007). An advantage that Bayesian inference models have on RL algorithms is that they estimate the computational processes underlying belief updates, irrespective of rewards, therefore allowing their seamless use across addictive behaviors. One example is provided by a recent investigation into instrumental learning in bulimia nervosa (Berner et al., in press), in which a Bayesian observer model highlighted slow belief updates and associated behavioral rigidity, consistent with similar investigations in substance use disorders (Ide et al., 2015). Further investigations using Bayesian inference models are required to reveal whether these mechanisms can be found across behavioral addictions and highlight shared computational mechanisms with substance use disorders.

Among the theories based on Bayesian inference, active inference (Friston, 2013; Friston et al., 2015) has emerged to describe behaviors as the result of the minimization of dopamine-encoded prediction error (Friston et al., 2012). This theory has been successful in accounting for a wide range of physiological and behavioral phenomena, including substance use disorders (Smith, Taylor, et al., 2021). Similar to the described effects of over-reinforcement of addiction-related cues, resulting in an over-reliance on model-free control, active inference explains compulsive behavior in addiction in terms of excessive prediction error signaling. This in turn, results in excessive belief confidence (precision), characterized as narrow distributions for the priors (Kinley, Amlung, & Becker, 2022). Such distributions make it more likely to repeat the choice associated with the addictive behaviors (Miller, Kiverstein, & Rietveld, 2020; Schwartenbeck et al., 2015) and at the same time prevent further updates, e.g., to include negative outcomes or interoceptive signals, contributing to craving (Gu, 2018; Gu & Filbey, 2017). Consistent with this hypothesis, increased belief updating in association with drug related positive values, and a reduced sensitivity to negative outcomes has been described across substance use disorders (Smith, Kirlic, Stewart, Touthang, Kuplicki, Khalsa, et al., 2021; Smith et al., 2020). This hypothesis has not been formally tested in behavioral addictions, yet. However, the already discussed ubiquitous presence of both reward processing dysfunctions and aberrant rewards in association with behavioral addictions (Baik, 2013; G. K. W. Frank et al., 2021; Gola et al., 2017; Golec et al., 2021; Granero et al., 2016; Hilton, 2013; Love et al., 2015; Palaus et al., 2017; Wiss et al., 2018) once again suggests that a process triggered by excessive prediction error signaling will be found also across compulsive behaviors associated with behavioral addictions.

Active inference also describes model-based dysfunctions as affecting the formation or recall of the structure of priors characterizing an internal model. Different models have investigated aberrant forward planning in terms of low confidence in future outcomes (Schwartenbeck et al., 2015), reduced precision in the state transition matrix (Fradkin, Adams, Parr, Roiser, & Huppert, 2020), reduced confidence in the generated internal model of the environment (Smith, Kirlic, Stewart, Touthang, Kuplicki, Khalsa, et al., 2021), or a reduced ability to generate deep policies (Mirza, Adams, Parr, & Friston, 2019). These deficits in turn generate the belief that events projected in the future, when computable, are characterized by uncertainty and unpredictability, and this bias seems to remain stable in the long term (Smith, Kirlic, Stewart, Touthang, Kuplicki, McDermott, et al., 2021). Therefore, immediate, precise, and easy to compute (and to predict) rewards are, once again, preferred, in a process analogous to the dysfunctions associated with model-based control for the RL framework. We previously discussed that a common feature across several behavioral addictions is the presence of environments characterized by complex reward schedules that are difficult to compute (Cash et al., 2012; Greenberg et al., 2022; Young & Abreu, 2011). The active inference framework generalizes this principle beyond the need to focus on rewards, entailing that behavioral addictions can emerge in the intersection between model-based dysfunctions and any sufficiently complex environment (e.g., due to ramified or variable state-action-outcome contingencies). This would include behavioral addictions characterized by complex reward schedules (such as gambling or videogaming), as well as others characterized by difficult to compute and variable health related, economic, or social outcomes (e.g., social network use, eating disorders, compulsive buying, compulsive sexual behavior, or pornography addiction).

Neural models

Both RL and active inference perspectives rely on dopamine signals to trigger behavioral plasticity. The RL paradigm interprets dopamine burst firings as encoding RPEs, responsible for value-based updates (Montague, Dayan, & Sejnowski, 1996; Schultz et al., 1997; Watabe-Uchida, Eshel, & Uchida, 2017). Instead, the active inference approach postulates that dopamine signals represent precision in event distribution predictions and trigger the update of beliefs (Friston et al., 2016; Friston et al., 2015). Whether in terms of rewards or precision, the ubiquity and robustness of the neurocomputational mechanisms underlying dopamine signals (Fiore, Dolan, Strausfeld, & Hirth, 2015) has led to the early belief that dopamine release would be significantly affected only by extreme events, such as pharmacological manipulations. Indeed, substances of abuse interfere with dopaminergic signals, albeit neither homogenously nor linearly (Nutt, Lingford-Hughes, Erritzoe, & Stokes, 2015), triggering significant and long-lasting synaptic alterations across several brain regions (Korpi et al., 2015; Luscher & Malenka, 2011). However, as mentioned above, several investigations have now revealed that a similar, prediction error-based (cf. Shimomura et al., 2021), multifaceted role is played by dopamine in behavioral addictions (Antons et al., 2020; Baik, 2013), including pathological gambling (Clark, Boileau, & Zack, 2019; Potenza, 2013) , videogaming (Liu et al., 2017; Palaus et al., 2017; Weinstein, 2010), compulsive sexual behavior (Kraus, Voon, & Potenza, 2016; Voon et al., 2014), compulsive use of pornography (Gola et al., 2017; Hilton, 2013), and binge eating (Bello & Hajnal, 2010; Volkow et al., 2002; Wang et al., 2011). These investigations further suggest that the neural plasticity triggered by drug-induced dopamine signals is comparable with the same dopamine-mediated process triggered by the consumption of palatable food, variable reward schedules characterizing gambling or videogaming, use of pornography and so forth.

A key target of dopamine-mediated neuroplasticity are the cortico-striatal synapses (Everitt & Robbins, 2016; Koob & Volkow, 2016; Luscher, Robbins, & Everitt, 2020). In biophysical neural models simulating cortico-striatal circuit dynamics, mesolimbic dopamine bursts trigger cortico-striatal long-term synaptic potentiation (Montague et al., 1996; Nelson & Kreitzer, 2014; Redgrave, Prescott, & Gurney, 1999). These alterations in turn bias future choice selections, favoring the repetition of the stimulus-response combination that led to dopamine signals, resulting in instrumental conditioning, thus providing the neural mechanisms underlying the described increased reliance on model-free control (Barto, 1995; M. J. Frank, Seeberger, & O'Reilly R, 2004; Gurney, Prescott, & Redgrave, 2001a, 2001b). Furthermore, the presence of multiple parallel cortico-striatal circuits characterized by different functions, but similar architectures (Haber, 2016; Jahanshahi, Obeso, Rothwell, & Obeso, 2015; Obeso, Rodriguez-Oroz, Stamelou, Bhatia, & Burn, 2014) led to the hypothesis that dopamine signals could affect the neural dynamics of multiple circuits at the same time (Fiore et al., 2018). In particular, the neural dynamics of dorsal (sensorimotor selections) and ventral (value processing and goal selections) cortico-striatal circuits are usually associated with model-free and model-based control (Dolan & Dayan, 2013; O'Doherty, Cockburn, & Pauli, 2017). Therefore, in the dorsal circuit dopamine signals are assumed to bias sensorimotor selections and model-free behavior. Conversely, in the prefrontal circuit, these signals are hypothesized to bias the selections of goals or future values, affecting forward planning and goal selection plasticity, or the model-based control system (Fiore et al., 2018).

In terms of neural circuit transient dynamics (Durstewitz, Huys, & Koppe, 2021) cortico-striatal long-term potentiation triggered by mesolimbic dopamine signals results in increased circuit stability (Fiore et al., 2018). Neural models indicated that drug-induced mesocortical dopamine signals have a similar effect on prefrontal cortico-cortical connectivity and dynamics (Lapish, Balaguer-Ballester, Seamans, Phillips, & Durstewitz, 2015), further deteriorating state-transition flexibility, and strengthening attractor-like dynamics. Although the effects of mesocortical dopamine release in substance use disorders are multifaceted (Ceceli, Bradberry, & Goldstein, 2022), attractor-like dynamics in the prefrontal cortex are consistent with the reported rigid representation of future state-action values (cf. 'incentive salience', Ceceli et al., 2022) and interoceptive states (Gu et al., 2015; Naqvi & Bechara, 2009), both key elements in the phenomena of craving and relapse. As discussed for the mesolimbic dopaminergic signals, these phenomena associated with mesocortical dopamine-induced alterations are not restricted to pharmacological manipulations, since behavioral addictions are based on the same dopamine dynamics (Antons et al., 2020; Baik, 2013). Further studies into the specific neural mechanisms underlying behavioral addictions will be needed to confirm or disprove this hypothesized similarity.

Conclusions and future directions

The objective of computational psychiatry (Huys, Maia, & Frank, 2016; Montague, Dolan, Friston, & Dayan, 2012) is to develop neurocomputational measures of disease- and subject-specific neural and cognitive mechanisms underlying decision-making, with the ultimate goal to inform precision diagnosis and treatment. Models relying on RL algorithms, Bayesian inference and neural dynamics that focus on vulnerabilities related to model-free and model-based control can explain the emergence and rigidity of maladaptive choices, despite the adverse consequences. Although most of the models here discussed were developed to describe substance use disorders, they can explain hallmark features of a wide range of behavioral addictions, with RL and active inference models more suitable to investigate (aberrant) structures of rewards and beliefs, respectively, and neural models dedicated to the investigation of (aberrant) attractor dynamics in neural activity. A key challenge in relation with behavioral addictions is to determine which of these compulsive behaviors qualifies as a legitimate form of addiction. We propose that the multifaceted neurocomputational representations of substance use disorders here described across modelling frameworks can be used as a benchmark to formally define a cluster of alterations that characterizes addictions. Those behavioral addictions found to meet these criteria -e.g., behavioral rigidity due to over-reliance on model-free control, impaired forward planning due to inadequate representations in model-based control, or increased stability in transient neural dynamics- should be included as a form of addiction. Some data suggested that behavioral and brain activity measures estimated relying on computational models can outperform traditional clinical measures in predicting clinical status, likelihood of relapse or vulnerability in substance use disorders (e.g., see: Harle, Stewart, et al., 2015; Yu et al., 2020). Thus, we expect the discussed computational models could provide a guide for behavioral addiction classification, in the near future.

Finally, it has been suggested (Heilig, Epstein, Nader, & Shaham, 2016; van den Ende et al., 2022) that these models have so far neglected the complex interplay of social (e.g., peer influence or isolation, societal stigma or tolerance etc.) and psychobiological factors. As many behavioral addictions are clearly affected by social interactions, it will be crucial for future investigations to include these social components (cf. Frolichs, Rosenblau, & Korn, 2022), using a new generation of tasks and neuro-computational models.

Acknowledgements

AK was supported by the RIKEN JRA fellowship and a Grant-in-Aid for JSPS Research Fellow (19J12156). DO is supported by the European Union's Horizon 2020 research and innovation programme under grant agreement (No. 824153 POTION) and by the project COURAGE - A social media companion safeguarding and educating students (no. 95567), funded by the Volkswagen Foundation inside the initiative Artificial Intelligence and the Society of the Future. MAP is supported by a grant from National Institute of Drug Abuse (K01DA043615). LAB is supported by grants from the National Institute of Mental Health (K23MH118418; R21MH124352; R01MH126448), a NARSAD Young Investigator Grant from the Brain & Behavior Research Foundation, and a Feeding Hope Fund Research Grant from the National Eating Disorders Association. KM was supported by Grant-in-Aid for Scientific Research No. 20H05049 and No. 19K21809 of the Japan Society for the Promotion of Science (JSPS) and the Ministry of Education, Culture, Sports, Science and Technology in Japan. VGF is funded by the Mental Illness Research, Education, and Clinical Center (MIRECC VISN 2) at the James J. Peter Veterans Affairs Medical Center, Bronx, NY.

References

  1. Antons S, Brand M, & Potenza MN (2020). Neurobiology of cue-reactivity, craving, and inhibitory control in non-substance addictive behaviors. J Neurol Sci, 415, 116952. [DOI] [PubMed] [Google Scholar]
  2. Baik JH (2013). Dopamine signaling in reward-related behaviors. Front Neural Circuits, 7, 152. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Barto AG (1995). Adaptive critics and the basal ganglia. In Houk JC, Davis JL & B. DG (Eds.), Models of Information Processing in the Basal Ganglia. Cambridge, MA: : MIT Press; [Google Scholar]
  4. Bello NT, & Hajnal A (2010). Dopamine and binge eating behaviors. Pharmacol Biochem Behav, 97(1), 25–33. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Berner LA, Fiore VG, Chen J, Krueger A, Kaye WH, Viranda T, & de Wit S (in press). Impaired belief updating and devaluation in adult women with bulimia nervosa. Translational Psychiatry. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Bernheim BD, & Rangel A (2004). Addiction and Cue-Triggered Decision Processes. Am Econ Rev, 94(5), 1558–1590. [DOI] [PubMed] [Google Scholar]
  7. Berridge KC, & Robinson TE (2016). Liking, wanting, and the incentive-sensitization theory of addiction. Am Psychol, 71(8), 670–679. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Cash H, Rae CD, Steel AH, & Winkler A (2012). Internet Addiction: A Brief Summary of Research and Practice. Curr Psychiatry Rev, 8(4), 292–298. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Ceceli AO, Bradberry CW, & Goldstein RZ (2022). The neurobiology of drug addiction: cross-species insights into the dysfunction and recovery of the prefrontal cortex. Neuropsychopharmacology, 47(1), 276–291. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Clark L, Boileau I, & Zack M (2019). Neuroimaging of reward mechanisms in Gambling disorder: an integrative review. Mol Psychiatry, 24(5), 674–693. [DOI] [PubMed] [Google Scholar]
  11. Dayan P (1993). Improving Generalization for Temporal Difference Learning: The Successor Representation. Neural Computation, 5(4), 613–624. [Google Scholar]
  12. Deserno L, Huys QJ, Boehme R, Buchert R, Heinze HJ, Grace AA, … Schlagenhauf F (2015). Ventral striatal dopamine reflects behavioral and neural signatures of model-based control during sequential decision making. Proc Natl Acad Sci U S A, 112(5), 1595–1600. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Dezfouli A, Piray P, Keramati MM, Ekhtiari H, Lucas C, & Mokri A (2009). A neurocomputational model for cocaine addiction. Neural Comput, 21(10), 2869–2893. [DOI] [PubMed] [Google Scholar]
  14. Di Chiara G (1999). Drug addiction as dopamine-dependent associative learning disorder. Eur J Pharmacol, 375(1-3), 13–30. [DOI] [PubMed] [Google Scholar]
  15. Dolan RJ, & Dayan P (2013). Goals and habits in the brain. Neuron, 80(2), 312–325. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Durstewitz D, Huys QJM, & Koppe G (2021). Psychiatric Illnesses as Disorders of Network Dynamics. Biol Psychiatry Cogn Neurosci Neuroimaging, 6(9), 865–876. [DOI] [PubMed] [Google Scholar]
  17. el-Guebaly N, Mudry T, Zohar J, Tavares H, & Potenza MN (2012). Compulsive features in behavioural addictions: the case of pathological gambling. Addiction, 107(10), 1726–1734. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Ersche KD, Gillan CM, Jones PS, Williams GB, Ward LH, Luijten M, … Robbins TW (2016). Carrots and sticks fail to change behavior in cocaine addiction. Science, 352(6292), 1468–1471. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Everitt BJ, & Robbins TW (2013). From the ventral to the dorsal striatum: devolving views of their roles in drug addiction. Neurosci Biobehav Rev, 37(9 Pt A), 1946–1954. [DOI] [PubMed] [Google Scholar]
  20. Everitt BJ, & Robbins TW (2016). Drug Addiction: Updating Actions to Habits to Compulsions Ten Years On. Annu Rev Psychol, 67, 23–50. [DOI] [PubMed] [Google Scholar]
  21. Fiore VG, Dolan RJ, Strausfeld NJ, & Hirth F (2015). Evolutionarily conserved mechanisms for the selection and maintenance of behavioural activity. Philos Trans R Soc Lond B Biol Sci, 370(1684). [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Fiore VG, Ognibene D, Adinoff B, & Gu X (2018). A Multilevel Computational Characterization of Endophenotypes in Addiction. eNeuro, 5(4). [DOI] [PMC free article] [PubMed] [Google Scholar]
  23. Fradkin I, Adams RA, Parr T, Roiser JP, & Huppert JD (2020). Searching for an anchor in an unpredictable world: A computational model of obsessive compulsive disorder. Psychol Rev, 127(5), 672–699. [DOI] [PubMed] [Google Scholar]
  24. Frank GKW, Shott ME, Stoddard J, Swindle S, & Pryor TL (2021). Association of Brain Reward Response With Body Mass Index and Ventral Striatal-Hypothalamic Circuitry Among Young Women With Eating Disorders. JAMA Psychiatry, 78(10), 1123–1133. [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Frank MJ, Seeberger LC, & O'Reilly R C (2004). By carrot or by stick: cognitive reinforcement learning in parkinsonism. Science, 306(5703), 1940–1943. [DOI] [PubMed] [Google Scholar]
  26. Friston K (2013). Active inference and free energy. Behav Brain Sci, 36(3), 212–213. [DOI] [PubMed] [Google Scholar]
  27. Friston K, FitzGerald T, Rigoli F, Schwartenbeck P, J OD, & Pezzulo G (2016). Active inference and learning. Neurosci Biobehav Rev, 68, 862–879. [DOI] [PMC free article] [PubMed] [Google Scholar]
  28. Friston K, Rigoli F, Ognibene D, Mathys C, Fitzgerald T, & Pezzulo G (2015). Active inference and epistemic value. Cogn Neurosci, 6(4), 187–214. [DOI] [PubMed] [Google Scholar]
  29. Friston K, Shiner T, FitzGerald T, Galea JM, Adams R, Brown H, … Bestmann S (2012). Dopamine, affordance and active inference. PLoS Comput Biol, 8(1), e1002327. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Frolichs KMM, Rosenblau G, & Korn CW (2022). Incorporating social knowledge structures into computational models. Nat Commun, 13(1), 6205. [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Gola M, Wordecha M, Sescousse G, Lew-Starowicz M, Kossowski B, Wypych M, … Marchewka A (2017). Can Pornography be Addictive? An fMRI Study of Men Seeking Treatment for Problematic Pornography Use. Neuropsychopharmacology, 42(10), 2021–2031. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Golec K, Draps M, Stark R, Pluta A, & Gola M (2021). Aberrant orbitofrontal cortex reactivity to erotic cues in Compulsive Sexual Behavior Disorder. J Behav Addict, 10(3), 646–656. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Granero R, Fernandez-Aranda F, Mestre-Bach G, Steward T, Bano M, Del Pino-Gutierrez A, … Jimenez-Murcia S (2016). Compulsive Buying Behavior: Clinical Comparison with Other Behavioral Addictions. Front Psychol, 7, 914. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Grant JE, & Chamberlain SR (2014). Impulsive action and impulsive choice across substance and behavioral addictions: cause or consequence? Addict Behav, 39(11), 1632–1639. [DOI] [PubMed] [Google Scholar]
  35. Grant JE, Potenza MN, Weinstein A, & Gorelick DA (2010). Introduction to behavioral addictions. Am J Drug Alcohol Abuse, 36(5), 233–241. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Greenberg NR, Zhai ZW, Hoff RA, Krishnan-Sarin S, & Potenza MN (2022). An exploratory study of problematic shopping and problematic video gaming in adolescents. PLoS One, 17(8), e0272228. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Griffiths MD (2016). Compulsive sexual behaviour as a behavioural addiction: the impact of the internet and other issues. Addiction, 111(12), 2107–2108. [DOI] [PubMed] [Google Scholar]
  38. Grimm JW, Hope BT, Wise RA, & Shaham Y (2001). Neuroadaptation. Incubation of cocaine craving after withdrawal. Nature, 412(6843), 141–142. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Gu X (2018). Incubation of craving: a Bayesian account. Neuropsychopharmacology, 43(12), 2337–2339. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Gu X, & Filbey F (2017). A Bayesian Observer Model of Drug Craving. JAMA Psychiatry, 74(4), 419–420. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Gu X, Lohrenz T, Salas R, Baldwin PR, Soltani A, Kirk U, … Montague PR (2015). Belief about nicotine selectively modulates value and reward prediction error signals in smokers. Proc Natl Acad Sci U S A, 112(8), 2539–2544. [DOI] [PMC free article] [PubMed] [Google Scholar]
  42. Gurney K, Prescott TJ, & Redgrave P (2001a). A computational model of action selection in the basal ganglia. I. A new functional anatomy. Biol Cybern, 84(6), 401–410. [DOI] [PubMed] [Google Scholar]
  43. Gurney K, Prescott TJ, & Redgrave P (2001b). A computational model of action selection in the basal ganglia. II. Analysis and simulation of behaviour. Biol Cybern, 84(6), 411–423. [DOI] [PubMed] [Google Scholar]
  44. Haber SN (2016). Corticostriatal circuitry. Dialogues Clin Neurosci, 18(1), 7–21. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Harle KM, Stewart JL, Zhang S, Tapert SF, Yu AJ, & Paulus MP (2015). Bayesian neural adjustment of inhibitory control predicts emergence of problem stimulant use. Brain, 138(Pt 11), 3413–3426. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Harle KM, Yu AJ, & Paulus MP (2019). Bayesian computational markers of relapse in methamphetamine dependence. Neuroimage Clin, 22, 101794. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Harle KM, Zhang S, Schiff M, Mackey S, Paulus MP, & Yu AJ (2015). Altered Statistical Learning and Decision-Making in Methamphetamine Dependence: Evidence from a Two-Armed Bandit Task. Front Psychol, 6, 1910. [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. Heilig M, Epstein DH, Nader MA, & Shaham Y (2016). Time to connect: bringing social context into addiction neuroscience. Nat Rev Neurosci, 17(9), 592–599. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Hilton DL Jr. (2013). Pornography addiction - a supranormal stimulus considered in the context of neuroplasticity. Socioaffect Neurosci Psychol, 3, 20767. [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Huys QJ, Maia TV, & Frank MJ (2016). Computational psychiatry as a bridge from neuroscience to clinical applications. Nat Neurosci, 19(3), 404–413. [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Ide JS, Hu S, Zhang S, Yu AJ, & Li CS (2015). Impaired Bayesian learning for cognitive control in cocaine dependence. Drug Alcohol Depend, 151, 220–227. [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Jahanshahi M, Obeso I, Rothwell JC, & Obeso JA (2015). A fronto-striato-subthalamic-pallidal network for goal-directed and habitual inhibition. Nat Rev Neurosci, 16(12), 719–732. [DOI] [PubMed] [Google Scholar]
  53. Jorgenson AG, Hsiao RC, & Yen CF (2016). Internet Addiction and Other Behavioral Addictions. Child Adolesc Psychiatr Clin N Am, 25(3), 509–520. [DOI] [PubMed] [Google Scholar]
  54. Kato A, Kunisato Y, Katahira K, Okimura T, & Yamashita Y (2020). Computational Psychiatry Research Map (CPSYMAP): A New Database for Visualizing Research Papers. Front Psychiatry, 11, 578706. [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Keramati M, & Gutkin B (2013). Imbalanced decision hierarchy in addicts emerging from drug-hijacked dopamine spiraling circuit. PLoS One, 8(4), e61489. [DOI] [PMC free article] [PubMed] [Google Scholar]
  56. Kinley I, Amlung M, & Becker S (2022). Pathologies of precision: A Bayesian account of goals, habits, and episodic foresight in addiction. Brain Cogn, 158, 105843. [DOI] [PubMed] [Google Scholar]
  57. Koob GF, & Volkow ND (2016). Neurobiology of addiction: a neurocircuitry analysis. Lancet Psychiatry, 3(8), 760–773. [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. Korpi ER, den Hollander B, Farooq U, Vashchinkina E, Rajkumar R, Nutt DJ, … Dawe GS (2015). Mechanisms of Action and Persistent Neuroplasticity by Drugs of Abuse. Pharmacol Rev, 67(4), 872–1004. [DOI] [PubMed] [Google Scholar]
  59. Kraus SW, Voon V, & Potenza MN (2016). Neurobiology of Compulsive Sexual Behavior: Emerging Science. Neuropsychopharmacology, 41(1), 385–386. [DOI] [PMC free article] [PubMed] [Google Scholar]
  60. Lapish CC, Balaguer-Ballester E, Seamans JK, Phillips AG, & Durstewitz D (2015). Amphetamine Exerts Dose-Dependent Changes in Prefrontal Cortex Attractor Dynamics during Working Memory. J Neurosci, 35(28), 10172–10187. [DOI] [PMC free article] [PubMed] [Google Scholar]
  61. Lindstrom B, Bellander M, Schultner DT, Chang A, Tobler PN, & Amodio DM (2021). A computational reward learning account of social media engagement. Nat Commun, 12(1), 1311. [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. Liu L, Yip SW, Zhang JT, Wang LJ, Shen ZJ, Liu B, … Fang XY (2017). Activation of the ventral and dorsal striatum during cue reactivity in Internet gaming disorder. Addict Biol, 22(3), 791–801. [DOI] [PMC free article] [PubMed] [Google Scholar]
  63. Love T, Laier C, Brand M, Hatch L, & Hajela R (2015). Neuroscience of Internet Pornography Addiction: A Review and Update. Behav Sci (Basel), 5(3), 388–433. [DOI] [PMC free article] [PubMed] [Google Scholar]
  64. Luscher C, & Malenka RC (2011). Drug-evoked synaptic plasticity in addiction: from molecular changes to circuit remodeling. Neuron, 69(4), 650–663. [DOI] [PMC free article] [PubMed] [Google Scholar]
  65. Luscher C, Robbins TW, & Everitt BJ (2020). The transition to compulsion in addiction. Nat Rev Neurosci, 21(5), 247–263. [DOI] [PMC free article] [PubMed] [Google Scholar]
  66. Miller M, Kiverstein J, & Rietveld E (2020). Embodying addiction: A predictive processing account. Brain Cogn, 138, 105495. [DOI] [PMC free article] [PubMed] [Google Scholar]
  67. Mirza MB, Adams RA, Parr T, & Friston K (2019). Impulsivity and Active Inference. J Cogn Neurosci, 31(2), 202–220. [DOI] [PubMed] [Google Scholar]
  68. Mollick JA, & Kober H (2020). Computational models of drug use and addiction: A review. J Abnorm Psychol., 129(6), 544–555. [DOI] [PMC free article] [PubMed] [Google Scholar]
  69. Momennejad I, Russek EM, Cheong JH, Botvinick MM, Daw ND, & Gershman SJ (2017). The successor representation in human reinforcement learning. Nat Hum Behav, 1(9), 680–692. [DOI] [PMC free article] [PubMed] [Google Scholar]
  70. Montague PR, Dayan P, & Sejnowski TJ (1996). A framework for mesencephalic dopamine systems based on predictive Hebbian learning. J Neurosci, 16(5), 1936–1947. [DOI] [PMC free article] [PubMed] [Google Scholar]
  71. Montague PR, Dolan RJ, Friston KJ, & Dayan P (2012). Computational psychiatry. Trends Cogn Sci, 16(1), 72–80. [DOI] [PMC free article] [PubMed] [Google Scholar]
  72. Naqvi NH, & Bechara A (2009). The hidden island of addiction: the insula. Trends Neurosci, 32(1), 56–67. [DOI] [PMC free article] [PubMed] [Google Scholar]
  73. Nelson AB, & Kreitzer AC (2014). Reassessing models of basal ganglia function and dysfunction. Annu Rev Neurosci, 37, 117–135. [DOI] [PMC free article] [PubMed] [Google Scholar]
  74. Nutt DJ, Lingford-Hughes A, Erritzoe D, & Stokes PR (2015). The dopamine theory of addiction: 40 years of highs and lows. Nat Rev Neurosci, 16(5), 305–312. [DOI] [PubMed] [Google Scholar]
  75. O'Doherty JP, Cockburn J, & Pauli WM (2017). Learning, Reward, and Decision Making. Annu Rev Psychol, 68, 73–100. [DOI] [PMC free article] [PubMed] [Google Scholar]
  76. Obeso JA, Rodriguez-Oroz MC, Stamelou M, Bhatia KP, & Burn DJ (2014). The expanding universe of disorders of the basal ganglia. Lancet, 384(9942), 523–531. [DOI] [PubMed] [Google Scholar]
  77. Ognibene D, Fiore VG, & Gu X (2019). Addiction beyond pharmacological effects: The role of environment complexity and bounded rationality. Neural Netw, 116, 269–278. [DOI] [PMC free article] [PubMed] [Google Scholar]
  78. Palaus M, Marron EM, Viejo-Sobera R, & Redolar-Ripoll D (2017). Neural Basis of Video Gaming: A Systematic Review. Front Hum Neurosci, 11, 248. [DOI] [PMC free article] [PubMed] [Google Scholar]
  79. Petry NM, & O'Brien CP (2013). Internet gaming disorder and the DSM-5. Addiction, 108(7), 1186–1187. [DOI] [PubMed] [Google Scholar]
  80. Potenza MN (2013). Neurobiology of gambling behaviors. Curr Opin Neurobiol, 23(4), 660–667. [DOI] [PMC free article] [PubMed] [Google Scholar]
  81. Redgrave P, Prescott TJ, & Gurney K (1999). The basal ganglia: a vertebrate solution to the selection problem? Neuroscience, 89(4), 1009–1023. [DOI] [PubMed] [Google Scholar]
  82. Redish AD (2004). Addiction as a computational process gone awry. Science, 306(5703), 1944–1947. [DOI] [PubMed] [Google Scholar]
  83. Redish AD, Jensen S, & Johnson A (2008). A unified framework for addiction: vulnerabilities in the decision process. Behav Brain Sci, 31(4), 415–437; discussion 437-487. [DOI] [PMC free article] [PubMed] [Google Scholar]
  84. Redish AD, Jensen S, Johnson A, & Kurth-Nelson Z (2007). Reconciling reinforcement learning models with behavioral extinction and renewal: implications for addiction, relapse, and problem gambling. Psychol Rev, 114(3), 784–805. [DOI] [PubMed] [Google Scholar]
  85. Redish AD, & Johnson A (2007). A computational model of craving and obsession. Ann N Y Acad Sci, 1104, 324–339. [DOI] [PubMed] [Google Scholar]
  86. Reiter AM, Heinze HJ, Schlagenhauf F, & Deserno L (2017). Impaired Flexible Reward-Based Decision-Making in Binge Eating Disorder: Evidence from Computational Modeling and Functional Neuroimaging. Neuropsychopharmacology, 42(3), 628–637. [DOI] [PMC free article] [PubMed] [Google Scholar]
  87. Russek EM, Momennejad I, Botvinick MM, Gershman SJ, & Daw ND (2017). Predictive representations can link model-based reinforcement learning to model-free mechanisms. PLoS Comput Biol, 13(9), e1005768. [DOI] [PMC free article] [PubMed] [Google Scholar]
  88. Schultz W, Dayan P, & Montague PR (1997). A neural substrate of prediction and reward. Science, 275(5306), 1593–1599. [DOI] [PubMed] [Google Scholar]
  89. Schwartenbeck P, FitzGerald TH, Mathys C, Dolan R, Wurst F, Kronbichler M, & Friston K (2015). Optimal inference with suboptimal models: addiction and active Bayesian inference. Med Hypotheses, 84(2), 109–117. [DOI] [PMC free article] [PubMed] [Google Scholar]
  90. Schwöbel S, Marković D, Smolka MN, & Kiebel SJ (2021). Balancing control: A Bayesian interpretation of habitual and goal-directed behavior. Journal of Mathematical Psychology, 100(102472). [Google Scholar]
  91. Shimomura K, Kato A, & Morita K (2021). Rigid reduced successor representation as a potential mechanism for addiction. EurJ Neurosci, 53(11), 3768–3790. [DOI] [PMC free article] [PubMed] [Google Scholar]
  92. Simon DA, & Daw ND (2012). Dual-system learning models and drugs of abuse. In Gutkin B & Ahmed SH (Eds.), Computational Neuroscience of Drug Addiction (pp. 145–161). New York: Springer-Verlag [Google Scholar]
  93. Smith R, Kirlic N, Stewart JL, Touthang J, Kuplicki R, Khalsa SS, … Aupperle RL (2021). Greater decision uncertainty characterizes a transdiagnostic patient sample during approach-avoidance conflict: a computational modelling approach. J Psychiatry Neurosci, 46(1), E74–E87. [DOI] [PMC free article] [PubMed] [Google Scholar]
  94. Smith R, Kirlic N, Stewart JL, Touthang J, Kuplicki R, McDermott TJ, … Aupperle RL (2021). Long-term stability of computational parameters during approach-avoidance conflict in a transdiagnostic psychiatric patient sample. Sci Rep, 11(1), 11783. [DOI] [PMC free article] [PubMed] [Google Scholar]
  95. Smith R, Schwartenbeck P, Stewart JL, Kuplicki R, Ekhtiari H, Paulus MP, & Tulsa I (2020). Imprecise action selection in substance use disorder: Evidence for active learning impairments when solving the explore-exploit dilemma. Drug Alcohol Depend, 215, 108208. [DOI] [PMC free article] [PubMed] [Google Scholar]
  96. Smith R, Taylor S, & Bilek E (2021). Computational Mechanisms of Addiction: Recent Evidence and Its Relevance to Addiction Medicine. Current Addiction Reports(8), 509–519. [Google Scholar]
  97. Strehl AL, Li L, & Littman ML (2009). Reinforcement Learning in Finite MDPs: PAC Analysis. Journal of Machine Learning Research, 10(11), 2413–2444. [Google Scholar]
  98. Sutton RS, & Barto AG (1998). Reinforcement Learning: An Introduction. Cambridge, MA: MIT Press. [Google Scholar]
  99. van den Ende MWJ, Epskamp S, Lees MH, van der Maas HLJ, Wiers RW, & Sloot PMA (2022). A review of mathematical modeling of addiction regarding both (neuro-) psychological processes and the social contagion perspectives. Addict Behav, 127, 107201. [DOI] [PubMed] [Google Scholar]
  100. Veisani Y, Jalilian Z, & Mohamadian F (2020). Relationship between internet addiction and mental health in adolescents. J Educ Health Promot, 9, 303. [DOI] [PMC free article] [PubMed] [Google Scholar]
  101. Volkow ND, & Morales M (2015). The Brain on Drugs: From Reward to Addiction. Cell, 162(4), 712–725. [DOI] [PubMed] [Google Scholar]
  102. Volkow ND, Wang GJ, Fowler JS, Logan J, Jayne M, Franceschi D, … Pappas N (2002). "Nonhedonic" food motivation in humans involves dopamine in the dorsal striatum and methylphenidate amplifies this effect. Synapse, 44(3), 175–180. [DOI] [PubMed] [Google Scholar]
  103. Voon V, Derbyshire K, Ruck C, Irvine MA, Worbe Y, Enander J, … Bullmore ET (2015). Disorders of compulsivity: a common bias towards learning habits. Mol Psychiatry, 20(3), 345–352. [DOI] [PMC free article] [PubMed] [Google Scholar]
  104. Voon V, Mole TB, Banca P, Porter L, Morris L, Mitchell S, … Irvine M (2014). Neural correlates of sexual cue reactivity in individuals with and without compulsive sexual behaviours. PLoS One, 9(7), e102419. [DOI] [PMC free article] [PubMed] [Google Scholar]
  105. Wang GJ, Geliebter A, Volkow ND, Telang FW, Logan J, Jayne MC, … Fowler JS (2011). Enhanced striatal dopamine release during food stimulation in binge eating disorder. Obesity (Silver Spring), 19(8), 1601–1608. [DOI] [PMC free article] [PubMed] [Google Scholar]
  106. Watabe-Uchida M, Eshel N, & Uchida N (2017). Neural Circuitry of Reward Prediction Error. Annu Rev Neurosci, 40, 373–394. [DOI] [PMC free article] [PubMed] [Google Scholar]
  107. Watkins CJ, & Dayan P (1992). Q-Learning. Machine Learning, 8(3-4), 279–292. [Google Scholar]
  108. Weinstein AM (2010). Computer and video game addiction-a comparison between game users and non-game users. Am J Drug Alcohol Abuse, 36(5), 268–276. [DOI] [PubMed] [Google Scholar]
  109. Wiehler A, Chakroun K, & Peters J (2021). Attenuated Directed Exploration during Reinforcement Learning in Gambling Disorder. J Neurosci, 41(11), 2512–2522. [DOI] [PMC free article] [PubMed] [Google Scholar]
  110. Wiss DA, Avena N, & Gold M (2020). Food Addiction and Psychosocial Adversity: Biological Embedding, Contextual Factors, and Public Health Implications. Nutrients, 12(11). [DOI] [PMC free article] [PubMed] [Google Scholar]
  111. Wiss DA, Avena N, & Rada P (2018). Sugar Addiction: From Evolution to Revolution. Front Psychiatry, 9, 545. [DOI] [PMC free article] [PubMed] [Google Scholar]
  112. Wiss DA, Criscitelli K, Gold M, & Avena N (2017). Preclinical evidence for the addiction potential of highly palatable foods: Current developments related to maternal influence. Appetite, 115, 19–27. [DOI] [PubMed] [Google Scholar]
  113. Wunderlich K, Smittenaar P, & Dolan RJ (2012). Dopamine enhances model-based over model-free choice behavior. Neuron, 75(3), 418–424. [DOI] [PMC free article] [PubMed] [Google Scholar]
  114. Yao YW, Potenza MN, & Zhang JT (2017). Internet Gaming Disorder Within the DSM-5 Framework and With an Eye Toward ICD-11. Am J Psychiatry, 174(5), 486–487. [DOI] [PubMed] [Google Scholar]
  115. Young KS, & Abreu C. N. d. (2011). Internet addiction : a handbook and guide to evaluation and treatment. Hoboken, NJ: John Wiley & Sons. [Google Scholar]
  116. Yu J-C, Fiore VG, Briggs RW, Braud J, Rubia K, Adinoff B, & Gu X (2020). An insula - driven network computes decision uncertainty and promotes abstinence in chronic cocaine users. European Journal of Neuroscience, 52(12), 4923–4936. [DOI] [PMC free article] [PubMed] [Google Scholar]

RESOURCES