Abstract
Background
Human decision making exhibits a mixture of model-based and model-free control. Recent evidence indicates arbitration between these two modes of control (“metacontrol”) is based on their relative costs and benefits. While model-based control may increase accuracy, it requires greater computational resources, so people only invoke model-based control when potential rewards exceed those of model-free control. We used a sequential decision task, while concurrently manipulating performance incentives, to ask if symptoms and traits of psychopathology decrease or increase model-based control in response to incentives.
Methods
We recruited a non-patient population of 839 online participants using Amazon Mechanical Turk, who completed transdiagnostic self-report measures encompassing symptoms, traits, and factors. We fit a dual controller reinforcement learning model and obtained a computational measure of model-based control separately for small incentives and large incentives.
Results
None of the constructs were related to a failure of large incentives to boost model-based control. In fact, for the sensation seeking trait and anxious depression factor, higher scores were associated with a larger incentive effect, whereby greater levels of these constructs were associated with larger increases in model-based control. Many constructs showed decreases in model-based control as a function of severity, but a social withdrawal factor was positively correlated; alcohol use and social anxiety were unrelated to model-based control.
Conclusions
Our results demonstrate that model-based control can reliably be improved independent of construct severity for most measures. This suggests incentives may be a useful intervention for boosting model-based control across a range of symptom and trait severity.
Keywords: computational psychiatry, reinforcement learning, model-based control, psychiatric constructs, incentives, habits and goals
Introduction
Decisions are sometimes the product of habit, and sometimes the product of planning. The two forms of decision making embody complementary strengths and weaknesses: habits are inflexible (and hence sometimes inaccurate) but require minimal cognitive effort, whereas plans support flexible goal pursuit but require greater cognitive effort. This dichotomy has significant clinical implications, because some psychopathological behaviors can be understood as arising from the hegemony of habits over plans. For example, drug addiction and obsessive-compulsive disorder are associated with an inability to overcome maladaptive habits (1).
Computational models have led to a formal description of the habit-planning dichotomy by specifying precise algorithmic hypotheses (2–4), with wide-ranging ramifications for the neurobiology of decision making and its breakdown in psychopathology. In particular, reinforcement learning (RL) models have operationalized habits in terms of “model-free” control and planning in terms of “model-based” control. Model-free control selects actions based on the degree to which they have been rewarded, using cached reward predictions that are updated by trial and error. Because these cached predictions can only be updated by interaction with the environment (though see (5)), they will be insensitive to changes in the environment that have not been directly experienced, leading to the brittleness characteristic of habits. Model-based control selects actions based on an internal model of the environment, which specifies how actions affect the state of the environment. By using mental simulation of the internal model, model-based control can generate sequential plans that adapt flexibly to changes in the environment, even without direct experience. However, mental simulation is cognitively costly, and thus people will prefer model-free control when cognitive resources are scarce (6), accuracy is poorly incentivized (7), or mental simulation produces unreliable reward predictions (2, 8).
In the current study, we examine the balance of model-free and model-based control using an RL paradigm that allows us to quantify the degree to which model-based control can be incentivized. This allows us to ask a critical question for the treatment of certain psychiatric symptoms and traits: can incentives ameliorate model-based deficits associated with certain clinical symptoms and traits? We approach this question by collecting a broad range of commonly used self-report measures and well-defined transdiagnostic traits in a diverse online population, measuring the relationship between these measures and a computationally-derived estimate of model-based control under different incentive conditions.
Model-based Control and Psychopathology
Empirical studies indicate that model-based control is disrupted across an array of disorders and psychopathological constructs. In particular, model-based impairments have been revealed in schizophrenia (9), binge eating disorder (10), OCD (10), and methamphetamine dependence (10). However, studies of alcohol dependence have been equivocal, with research finding behavioral deficits (11) and reduced prefrontal signatures of model-based control (12) in detoxified alcohol dependent patients, but no association between model-based control and problematic drinking in adolescents (13).
While these studies focused on DSM diagnoses, a recent study by Gillan and colleagues (14) examined model-based control among several transdiagnostic traits. They found that a factor containing compulsive behavior and intrusive thought items was negatively associated with model-based control, whereas an anxious depression factor showed no relationship, and a social withdrawal factor had a small positive association with model-based control (14). Comparing diagnostic groups has several problems, including high levels of comorbidity and shared symptoms among people in different diagnostic groups (15–17). On the other hand using continuous outcomes allows researchers to understand how cognitive processes, such as model-based control, co-vary with increasing symptom and trait severity, across several clinically-related constructs.
Gillan and colleagues measured model-based control using a sequential decision task developed by Daw et al. (2011), which we will refer to as the “Daw two-step task.” This task has been the standard assessment tool for measuring model-based control in clinical studies. However, recent research has shown that the Daw two-step task does not incentivize model-based control, because model-free and model-based control lead to roughly equivalent performance on the task (18). Moreover, increasing incentives in the Daw two-step task does not increase model-based control (7). For this reason, the task is not useful for addressing the question of whether psychiatric symptoms and traits are associated with an inability to boost model-based control in response to incentives. Fortunately, Kool and colleagues (18) have developed a novel sequential decision task in which model-based control does lead to improved performance, and incentives are effective at amplifying model-based control on the task (7). In particular, informing participants that their earnings would be multiplied by 5 on particular trials (the “high stakes” condition) was effective at increasing their reliance on model-based control relative to a baseline (“low stakes”) condition. Our goal in this paper is to revisit the approach of Gillan and colleagues using the Kool two-step task, which allows us to measure the effect of incentives on model-based control across a range of severity on several psychiatric constructs.
Principles of Metacontrol
On the basis of behavioral data from the Kool two-step task, we have argued that arbitration between model-based and model-free control is implemented by a cost-benefit analysis (see also (19)). According to this view, a metacontroller approximates the relative costs and benefits of using each controller and chooses one based on the optimal cost-benefit ratio (Figure 1A). In the Daw two-step task, model-based control confers no benefit, and hence the metacontroller will prefer the cognitively cheaper model-free controller. In the Kool two-step task, by contrast, model-based control does confer a benefit, so it will tend to be preferred by the metacontroller. This explains why people are overall more model-based on the Kool task than on the Daw task, and why incentives increase reliance on model-based control only on the Kool task.
We note that this is not the only way to understand the tension between habits and planning; for example, theories based on the free energy principle frame this tension in terms of model comparison (20). However, since the model-based/model-free distinction is currently the most prominent and well-characterized framework, we will focus on it in this paper.
We will use this framework to interpret the results of our study, by investigating three patterns (schematized in Figure 1B): (1) the effect of psychiatric constructs on model-based control; (2) the effect of incentives on model-based control; and (3) the interaction between these two effects. We expect, based on the work of Gillan et al. (2016), that many psychiatric constructs will be associated with lower model-based control. Furthermore, we expect, based on the work of Kool et al. (2017), that incentives will increase model-based control overall. The critical question is whether incentives manifest as a fixed boost in model-based control regardless of symptom or trait severity, or whether severity modulates the boost. The answer to this question will have implications for the effectiveness of incentive-based interventions. If high symptom or trait severity is associated with low sensitivity to incentives, then it is unlikely that such interventions will be effective for clinically relevant psychiatric symptoms and traits. If, on the other hand, the incentive effect observed by Kool and colleagues is independent of symptom or trait severity, then such interventions may hold promise.
Methods and Materials
Participants
We recruited 941 participants using Amazon Mechanical Turk. Participants gave informed consent and the study was approved by the Harvard Committee on the Use of Human Subjects.
Participants had to reside in the United States, have a 90% approval rating and 100 completed HITs. Participants completed a computer adaptive IQ test, the novel two step paradigm, 19 clinical scales, a clinical demographic questionnaire (Table 1), and an additional cognitive task. Compensation was $20 with a performance bonus ($0.48 – $1.11). Following the exclusion criteria (Supplement), the final sample included 839 participants that were 48.75% female and 51.25% male, ranged from 18–69 years old (M = 34.95, SD = 10.1) with IQ (M = 99.1, SD = 9.71).
Table 1.
Self-reported Clinical Characteristics | n=839 | |
---|---|---|
Endorsed Specific Diagnosis | 36.83% | (309) |
2 Diagnoses | 13.59% | (114) |
>= 3 Diagnoses | 6.80% | (56) |
| ||
Past Treatment: Any Treatment | 30.99% | (260) |
Partial, Inpatient, Residential | 8.46% | (71) |
| ||
Current Treatment: Any Treatment | 19.79% | (166) |
Partial, Inpatient, Residential | 4.29% | (36) |
Psychiatric Medication | 9.77% | (81) |
Self-Report Measures
Participants completed several self-report measures (descriptions in Supplementary Table S1, M (SD) in S2) including the Apathy Evaluation Scale (21), trait portion of the State-Trait Anxiety Inventory (22), Alcohol Use Disorders Identification Test (23), Zung Depression Scale (24), Short Schizotypy Scale (25), Obsessive-Compulsive Inventory-Revised (26), Social Anxiety Scale (27), Eating Attitudes Test (28), Intolerance of Uncertainty Scale (29), Anxiety Sensitivity Index-3 (30), Ruminative Response Scale (31), Difficulties with Emotion Regulation Scale (32), Distress Tolerance Scale (33), Barratt Impulsiveness Scale-11 (34), and the UPPS-P Impulsivity Scale (35–37) (the UPPS-P is comprised of positive & negative urgency, sensation seeking, lack of premeditation, and lack of perseverance). We summarized nine of these self-report measures (Supplementary Table S1) with three latent constructs (anxious depression, compulsive behavior and intrusive thought, and social withdrawal) that were generated using the factor loadings from Gillan and colleagues 2016 (14).
Sequential Decision Task
Participants performed 200 trials of the two-step task developed by Kool and colleagues (7), which allowed us to measure adaptive increases in model-based control by comparing the degree of model-based control across high and low stakes (Figure 2A/B). In this task, the participant randomly starts in one of two first-stage states, choosing one of two rocket ships (which are randomly mapped to response keys). Conditional on this choice, the participant deterministically transitions to the purple or red alien planet. On this alien planet, the participant then receives a reward indicated by alien treasure. The amount of reward obtained at each planet changes randomly and gradually over the task, independently for each planet (a Gaussian random walk between 0 and +9, additional details in Supplement).
If the participant fails to make a response, no reward is delivered and the task proceeds to the next trial. Each trial was randomly assigned to one of two conditions. In the “high stakes” condition, the participant received 5 times the alien treasure reward (5x multiplier; Figure 2A). In the “low stakes” condition, the participant received the displayed alien treasure reward.
Analyses
Computational Model
We estimated model-based control for high stakes (large incentives) and low stakes (small incentives) with a dual-system reinforcement learning model (38, 39). In this framework, model-free control learns the value of actions through direct experience and is insensitive to sudden environmental changes. Alternatively, model-based control uses a model of the environment to instantaneously update which first stage choice leads to each alien planet. The balance between model-free and model-based control is governed by a free parameter (w) in the computational model. We fit the parameter “w” for low stakes (small incentives) and high stakes (large incentives) separately (additional model fitting details in Supplement).
Predictors of Model-based Control (Bayesian Regression)
Bayesian linear regression (40, 41) was used to quantify the relationship between metacontrol, average reward rate, and self-report measures, while controlling for age, IQ, and gender. A classic general linear model results in a single beta value point estimate and confidence intervals around that beta value that represent the percentage of the time the interval would contain the population beta value if the study was conducted many times. In contrast, rather than providing a single point estimate, a Bayesian regression provides a posterior distribution of beta values given the data, quantifying the uncertainty about the betas. We summarized the posterior using a 95% highest posterior density (HPD) credible interval around the mode. We also report the posterior probabilities that the beta value is greater or less zero.
We used the brms package (42) with the default prior σ ~ student-t(3,0,10) and separately regressed each self-report measure and average reward rate onto model-based control (w) while controlling for age, IQ, and gender. We included a regressor expressing the interaction between the self-report measure and high or low stakes. Thus, we examined changes in model-based control across stakes as a function of each construct separately. This is consistent with Gillan and colleagues (14) and also retains the construct validity of these measures, as the constructs have been widely examined independently in prior research. We also ran a separate analysis entering all scales into a single regression, though it is possible the correlations across measures diminishes the construct validity of any individual measure. This is because it is unknown which portion of a single measures’ variance accounts for the effects while controlling for other measures. In addition, we ran separate regressions adding the inverse temperature parameter as a confound.
Factor Scores
The three transdiagnostic factors were generated using the published factor loadings from Gillan and colleagues (14). The authors factor analyzed nine self-report scales reducing the data to three dimensions subsequently termed anxious depression, compulsive behavior and intrusive thought, and social withdrawal. Using the scales overlapping with the Gillan 2016 study (14) (see Supplement Table S1), we set their published factor loadings as independent variables in a regression and set the item level scores for our participants as the dependent variable, thereby generating factor scores for our participants. Using Bayesian regression, we entered factor scores for all three factors concurrently as independent variables and the interaction with stakes while controlling for age, IQ, and gender, with model-based control (w) as the dependent variable. Thus, we examined changes in model-based control (w) as a function of the interaction between stakes and the psychiatric factors. In a separate analysis, we also added inverse temperature as a covariate to mirror the self-report regressions.
Results
Stakes, Reward Rate, and Model-based Control
Consistent with the findings of Kool and colleagues 2017 (7), a paired-sample t-test indicated participants engaged in significantly more model-based control (w) in the high stakes compared to the low stakes condition (Figure 2C), t(838) = 10.21, p < .001; Cohen’s d = 0.35. Using Bayesian regression, the HPD for high and low stakes indicated that greater average reward rate predicted greater model-based control with high certainty (narrow credible intervals); low stakes condition (Figure 2D), (βmean = 0.473, CI = [0.417, 0.526]), high stakes condition (βmean = 0.357, CI = [0.290, 0.404]). These replicate the key findings from Kool and colleagues (7, 18).
Self-report Measures and Model-based Control
Figure 3A displays the mean beta and credible interval for the association between self-report measures and model-based control. For the majority of constructs measured, Bayesian regressions revealed high certainty of the presence of relationships with model-based control (w; Table 2). For example, the posterior placed nearly all its mass on the beta value for distress intolerance (Figure 3B) being less than zero. This means we can have high certainty that the relationship between distress intolerance and model-based control (w) is negative.
Table 2.
Stakes: | High | Low | High-Low | |
---|---|---|---|---|
|
||||
Construct | Neg | Neg | Neg | |
Obsessive-Compulsive | 100% | 100% | 73% | |
Distress Intolerance | 100% | 100% | 26% | |
Positive Urgency | 100% | 100% | 24% | |
Negative Urgency | 99% | 100% | 27% | |
Anxiety Sensitivity | 97% | 99% | 32% | |
Depression | 96% | 100% | 17% | |
Uncertainty Intolerance | 100% | 95% | 79% | |
Emotion Dysregulation | 99% | 95% | 65% | |
Apathy | 95% | 98% | 40% | |
Disordered Eating | 95% | 98% | 38% | |
Perseverance (lack of) | 91% | 98% | 29% | |
Barratt Impulsiveness | 90% | 99% | 27% | |
Schizotypy | 92% | 93% | 46% | |
Premeditation (lack of) | 97% | 80% | 79% | |
Trait Anxiety | 75% | 99% | 14% | |
Rumination | 94% | 78% | 72% | |
Alcohol Use | 65% | 82% | 36% | |
Sensation Seeking | 38% | 100% | 1% | |
Social Anxiety | 59% | 37% | 66% |
Of note, sensation seeking (Figure 3C) was the only measure showing an interaction with stakes, such that there was a large negative relationship between sensation seeking and model-based control (100% of distribution is highly negative) that was present during low stakes was not present during high stakes, with the posterior distribution roughly centered around 0 (38% negative, 62% positive). Corresponding to this difference, 99% of the posterior distribution was positive for the high-low stakes interaction. Scatter plots showing original data points are in Supplementary Figure S1.
Additionally, the results of entering all measures into a single regression are in supplemental figure S4, but should be interpreted with caution because the correlations between measures (i.e., correlation heatmap; S3) diminishes their construct validity. Including inverse temperature largely reiterated the same pattern of results, though slightly weaker (Supplemental Figure S5). Nevertheless, inverse temperature and model-based control are highly correlated (r(1676) = −0.4478, p < .001) and suffer from non-identifiability in the computational model. Therefore, including both builds unnecessary redundancy into the regression model.
Factors and Model-based Control
Table 2 and Figure 4A show the mean beta values and credible intervals for the three transdiagnostic factors (i.e. compulsive behavior and intrusive thought, anxious depression, social withdrawal) in predicting model-based control across stakes. The anxious depression factor (Figure 4B) showed a similar pattern of results to sensation seeking, with 99% of the posterior probability for low stakes below 0, indicating a strong anti-correlation between anxious depression and model-based control; this effect was eliminated for high stakes, with the posterior probability closely centered around 0 (42% negative, 58% positive). For the high-low stakes interaction, 96% of the posterior was positive. The compulsive behavior and intrusive thought factor (Figure 4C) posterior distribution was nearly 100% negative across stakes. Moreover, the mean beta and credible interval demonstrated the largest effect in predicting reductions in model-based control relative to the other two factors. Using the 95% cutoff, the social withdrawal factor (Figure 4D) predicted increases in model-based control for low stakes but this effect was slightly weakened for high stakes (93%). In a Bayesian framework, the high posterior probability for beta values below zero (e.g., compulsive behavior and intrusive thought factor) and large probabilities for beta values above zero (i.e., stakes interaction in anxious depression) provides relatively high certainty about the presence of three different relationships between the factors and model-based control. Scatter plots with original data points in Supplement S2. After adding inverse temperature to the model, the results remained consistent, though they were slightly weaker (Supplemental Figure S6).
Discussion
The primary goal of the current study was to ask the question, do people higher in symptom or trait severity boost model-based control in response to incentives? We found that 1) incentives boost model-based control across a range of severity on several psychiatric constructs, 2) sensation seeking and the anxious-depression factor showed a larger incentive effect, whereby higher severity was associated with greater boosts in model-based control. Finally, 3) most constructs were associated with model-based deficits but there were some exceptions, including social anxiety and alcohol use, which showed no relationship with model-based control, and the social withdrawal factor, which was related to increased model-based control.
In contrast to prior research on model-based control deficits in psychopathology, which has conceptualized these deficits as fixed individual traits (9–14), our metacontrol framework conceptualizes them as dynamic and adaptive. The results of the current study suggest that model-based control can be flexibly deployed depending on incentives in the environment and increased clinical symptoms and traits do not diminish this flexibility. This has important clinical implications because interventions that use incentives have shown a range of positive psychiatric and health-related outcomes. For example, contingency management is an incentive based intervention, whereby individuals diagnosed with substance use disorders are provided incentives in exchange for evidence of behavioral change (43). Moreover, incentives have been widely efficacious in fostering health-related goals such as smoking cessation, increased physical activity, healthy eating, and reduced alcohol consumption (44). Incentives also consistently improve response inhibition in patient populations diagnosed with ADHD (45). Potentially one reason these incentives have been found to be efficacious, is that incentives may enhance processes underlying model-based control. Future studies could extend the current results in diagnosed patient samples by providing incentives for increases in model-based control, and testing if these changes correspond to treatment outcomes.
Second, our results showed an incentive interaction with both sensation seeking and the anxious depression factor, where increasing scores on these measures predicted larger boosts in model-based control in response to incentives. Although, these two constructs show the same pattern of results, it may be due to different mechanisms. Sensation seeking is putatively characterized by increased appetitive drive and decreased sensitivity to punishment (46). One possibility is that the high stakes condition may promote increased model-based control through hypersensitivity to reward, and this is consistent with theoretical models (46). For the anxious depression factor, although it is unclear to us why this factor was related to larger boosts in model-based control, it seems unlikely that it is related to hypersensitivity to reward.
Third, the self-report measures generally showed a negative relationship with model-based control, with the exception of social anxiety and alcohol use, which were unrelated to model-based control, and sensation seeking, which showed an interaction with incentives. Some of the self-report measures had strong correlations (27% of the correlations were r > .5 but only 3% were r > .7; see S3 for a correlation matrix). Thus, the negative relationship to model-based control across most constructs may be due to shared variance of a common illness factor (47, 48). However, this is unlikely to be the sole explanation for our findings, because the majority of the correlations between the constructs were weak or moderate and, furthermore, the psychiatric factors show three separate relationships to model-based control. (i.e., compulsive behavior shows a negative relationship, social withdrawal shows a positive relationship, anxious depression shows an interaction with incentives).
Limitations
There are several limitations to the current study. First, our analyses were unable to tease apart whether deficit arise from impairment of the model-based controller itself, a dysfunctional metacontroller, or misrepresentation of the costs and benefits of choices. Future experimental and modeling work will be needed to tease apart these possibilities. Second, some constructs were sparsely sampled at the higher ranges, and given the current sample, we cannot make claims about whether incentives lead to similar boosts in model-based control among those at the highest end of severity (i.e., formally diagnosed and severe psychiatric patients). However, many of the participants here reported diagnoses and intensive psychiatric treatment (i.e. inpatient or residential care; Table 1). Nevertheless, our data show the incentive effect is largely uniform across the observed range for most constructs (Supplement S1, S2). Third, based on our questions of interest, we did not test whether certain self-report measures are associated with model-based control while simultaneously accounting for the effect of the other self-report measures, though we did take this approach with the factors. Future research could examine which constructs have unique relationships with the effects of incentives on model-based control.
Fourth, model-based control depends on a domain-general cognitive mechanisms, such as working memory (6, 49). If a particular symptom is associated with a deficit in working memory, then this will manifest as reduced model-based control, but it will also manifest as a deficit in many other tasks. Future studies would have to measure (in the same individual) multiple tasks with overlapping cognitive demands to understand whether this is a major confound when examining model-based control and clinical constructs.
Finally, one might be concerned that the task used here is “degenerate” as a sequential decision task, since there is only a single choice that results in deterministic transitions. Understanding whether our results generalize to the more complex sequential decision tasks characteristic of real-world environments remains an important unanswered question.
Conclusions
In this study, we found evidence for a cost-benefit metacontrol process that boosts model-based control under high incentive conditions regardless of symptom or trait severity for many psychiatric constructs. Exceptions included sensation seeking and the anxious depression factor, for which higher construct severity was associated with larger boosts in model-based control for high relative to low incentive conditions. Alcohol use and social anxiety showed no relationship, while social withdrawal showed a positive relationship. The psychiatric factors revealed three separate relationships to model-based control, with the anxious depression factor showing an interaction with incentives, thereby demonstrating the advantage of using psychiatric dimensions to detect specific relationships between self-report measures and computational constructs. Future work may seek to test the efficacy of using incentives to boost model-based control in patient populations and the relationship between these effects and treatment outcomes.
Supplementary Material
Table 3.
Stakes: | High | Low | High-Low | |
---|---|---|---|---|
|
||||
Factor | Neg | Neg | Neg | |
Anxious Depression | 42% | 99% | 4% | |
Compulsive-Intrusive | 100% | 100% | 60% | |
Social Withdrawal | 7% | 0% | 88% |
Acknowledgments
This project was partially supported by a training grant from the NIH Blueprint for Neuroscience Research (T90DA022759/R90DA023427). Its’ contents are solely the responsibility of the authors and do not necessarily represent the official views of the NIH. We would like to thank Catherine Hartley for sharing her stimuli, Michal Kosinksi for use of the IQ test, Drs. Bruce Rosen and Maria Mody for their work with the AMNTP, Ista Zahn for his helpful feedback during the revision process, and the members of the Computational Cognitive Neuroscience Laboratory for their support and feedback.
Footnotes
Financial Disclosures
The authors report no biomedical financial interests or potential conflicts of interest.
Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.
References
- 1.Everitt BJ, Robbins TW. Neural systems of reinforcement for drug addiction: from actions to habits to compulsion. Nat Neurosci. 2005;8:1481–1489. doi: 10.1038/nn1579. [DOI] [PubMed] [Google Scholar]
- 2.Daw ND, Niv Y, Dayan P. Uncertainty-based competition between prefrontal and dorsolateral striatal systems for behavioral control. Nat Neurosci. 2005;8:1704–11. doi: 10.1038/nn1560. [DOI] [PubMed] [Google Scholar]
- 3.Dolan RJ, Dayan P. Goals and habits in the brain. Neuron. 2013;80:312–25. doi: 10.1016/j.neuron.2013.09.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Kool W, Cushman FA, Gershman SJ. Competition and Cooperation Between Multiple Reinforcement Learning Systems. In: Morris RW, Bornstein AM, Shenhav A, editors. Understanding Goal-Directed Decision Making: Computations and Circuits. New York: Elsevier; 2017. [Google Scholar]
- 5.Gershman SJ, Markman AB, Otto aR. Retrospective revaluation in sequential decision making: a tale of two systems. J Exp Psychol Gen. 2014;143:182–94. doi: 10.1037/a0030844. [DOI] [PubMed] [Google Scholar]
- 6.Otto AR, Gershman SJ, Markman AB, Daw ND. The Curse of Planning: Dissecting Multiple Reinforcement-Learning Systems by Taxing the Central Executive. Psychol Sci. 2013;24:751–761. doi: 10.1177/0956797612463080. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Kool W, Gershman SJ, Cushman F. Cost-benefit arbitration between multiple reinforcement learning systems. Psychol Sci. 2017;28:1321–1333. doi: 10.1177/0956797617708288. [DOI] [PubMed] [Google Scholar]
- 8.Wan Lee S, Shimojo S, O’Doherty JP. Neural Computations Underlying Arbitration between Model-Based and Model-free Learning. Neuron. 2014;81:687–699. doi: 10.1016/j.neuron.2013.11.028. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Culbreth AJ, Westbrook A, Daw ND, Botvinick M, Barch DM. Reduced model-based decision-making in schizophrenia. J Abnorm Psychol. 2016;125:777–787. doi: 10.1037/abn0000164. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Voon V, Derbyshire K, Rück C, Irvine MA, Worbe Y, Enander J, et al. Disorders of compulsivity: a common bias towards learning habits. Mol Psychiatry. 2015;20:345–352. doi: 10.1038/mp.2014.44. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Sebold M, Deserno L, Nebe S, Schad DJ, Garbusow M, Hägele C, et al. Model-based and model-free decisions in alcohol dependence. Neuropsychobiology. 2014;70:122–131. doi: 10.1159/000362840. [DOI] [PubMed] [Google Scholar]
- 12.Sebold M, Nebe S, Garbusow M, Guggenmos M, Schad DJ, Beck A, et al. When Habits Are Dangerous: Alcohol Expectancies and Habitual Decision Making Predict Relapse in Alcohol Dependence. Biol Psychiatry. 2017;82:847–856. doi: 10.1016/j.biopsych.2017.04.019. [DOI] [PubMed] [Google Scholar]
- 13.Nebe S, Kroemer NB, Schad DJ, Bernhardt N, Sebold M, Müller DK, et al. No association of goal-directed and habitual control with alcohol consumption in young adults. Addict Biol. 2017;23:379–393. doi: 10.1111/adb.12490. [DOI] [PubMed] [Google Scholar]
- 14.Gillan CM, Kosinski M, Whelan R, Phelps EA, Daw ND. Characterizing a psychiatric symptom dimension related to deficits in goal-directed control. Elife. 2016;5:1–24. doi: 10.7554/eLife.11305. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Cuthbert BN, Insel TR. Toward the future of psychiatric diagnosis: the seven pillars of RDoC. BMC Med. 2013;11:126. doi: 10.1186/1741-7015-11-126. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Insel T, Cuthbert B, Garvie M, Heinssen R, Pine DS, Quinn K, et al. Research Domain Criteria (RDoC): Toward a New Classification Framework for Research on Mental Disorders. Am J Psychiatry. 2010;167:748–751. doi: 10.1176/appi.ajp.2010.09091379. [DOI] [PubMed] [Google Scholar]
- 17.Clark LA, Cuthbert B, Lewis-Fernández R, Narrow WE, Reed GM. Three Approaches to Understanding and Classifying Mental Disorder: ICD-11, DSM-5, and the National Institute of Mental Health’s Research Domain Criteria (RDoC) Psychol Sci Public Interes. 2017;18:72–145. doi: 10.1177/1529100617727266. [DOI] [PubMed] [Google Scholar]
- 18.Kool W, Cushman FA, Gershman SJ. When Does Model-Based Control Pay Off? PLOS Comput Biol. 2016;12:1–34. doi: 10.1371/journal.pcbi.1005090. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Boureau YL, Sokol-Hessner P, Daw ND. Deciding How To Decide: Self-Control and Meta-Decision Making. Trends Cogn Sci. 2015;19:700–710. doi: 10.1016/j.tics.2015.08.013. [DOI] [PubMed] [Google Scholar]
- 20.FitzGerald THB, Dolan RJ, Friston KJ. Model averaging, optimal inference, and habit formation. Front Hum Neurosci. 2014;8:1–11. doi: 10.3389/fnhum.2014.00457. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Marin R, Biedrzycki R, Firinciogullari S. Reliability and validity of the apathy evaluation scale. Psychiatry Res. 1991;38:143–162. doi: 10.1016/0165-1781(91)90040-v. [DOI] [PubMed] [Google Scholar]
- 22.Spielberger CD, Gorsuch RL, Lushene PR, Vagg PR, Jacobs AG. Manual for the State-Trait Anxiety Inventory. Consulting Psychologists Press; 1983. [Google Scholar]
- 23.Saunders JB, Aasland OG, Babor TF, De La Fuente JR, Grant M. Development of the Alcohol Use Disorders Identification Test (AUDIT): WHO Collaborative Project on Early Detection of Persons with Harmful Alcohol Consumption. II. Addiction. 1993;88:791–804. doi: 10.1111/j.1360-0443.1993.tb02093.x. [DOI] [PubMed] [Google Scholar]
- 24.Zung W, Durham N. A Self-Rating Depression Scale. Arch Gen Psychiatry. 1965;12:63–70. doi: 10.1001/archpsyc.1965.01720310065008. [DOI] [PubMed] [Google Scholar]
- 25.Mason O, Linney Y, Claridge G. Short scales for measuring schizotypy. Schizophr Res. 2005;78:293–296. doi: 10.1016/j.schres.2005.06.020. [DOI] [PubMed] [Google Scholar]
- 26.Foa EB, Huppert JD, Leiberg S, Langner R, Kichic R, Hajcak G, Salkovskis PM. The obsessive-compulsive inventory: Development and validation of a short version. Psychol Assess. 2002;14:485–496. [PubMed] [Google Scholar]
- 27.Liebowitz MR. Social phobia. Mod Probl Pharmacopsychiatry. 1987;22:141–173. doi: 10.1159/000414022. [DOI] [PubMed] [Google Scholar]
- 28.Garner D, Olmsted MP, Bohr Y, Garfinkel P. The eating attitudes test: psychometric features and clinical correlates. Psychol Med. 1982;12:871–878. doi: 10.1017/s0033291700049163. [DOI] [PubMed] [Google Scholar]
- 29.Carleton RN, Norton MAPJ, Asmundson JG. Fearing the unknown: A short version of the Intolerance of Uncertainty Scale. J Anxiety Disord. 2007;21:105–117. doi: 10.1016/j.janxdis.2006.03.014. [DOI] [PubMed] [Google Scholar]
- 30.Taylor S, Zvolensky MJ, Cox BJ, Deacon B, Heimberg RG, Ledley DR, et al. Robust dimensions of anxiety sensitivity: Development and initial validation of the Anxiety Sensitivity Index-3. Psychol Assess. 2007;19:176–188. doi: 10.1037/1040-3590.19.2.176. [DOI] [PubMed] [Google Scholar]
- 31.Nolen-Hoeksema S, Morrow J. A prospective study of depression and posttraumatic stress symptoms after a natural disaster: The 1989 Loma Prieta earthquake. J Pers Soc Psychol. 1991;61:115–121. doi: 10.1037//0022-3514.61.1.115. [DOI] [PubMed] [Google Scholar]
- 32.Gratz KL, Roemer L. Multidimensional assessment of emotion regulation and dysregulation. J Psychopathol Behav Assess. 2004;26:41–54. [Google Scholar]
- 33.Simons JS, Gaher RM. The distress tolerance scale: Development and validation of a self-report measure. Motiv Emot. 2005;29:83–102. [Google Scholar]
- 34.Patton JH, Stanford MS, Barratt ES. Factor structure of the Barratt impulsiveness scale. J Clin Psychol. 1995;51:768–74. doi: 10.1002/1097-4679(199511)51:6<768::aid-jclp2270510607>3.0.co;2-1. [DOI] [PubMed] [Google Scholar]
- 35.Lynam DR, Smith GT, Cyders MA, Fischer S, Whiteside SP. The UPPS-P questionnaire measure of five dispositions to rash action (unpublished technical report) Purdue University; West Lafayette, IN: 2007. [Google Scholar]
- 36.Cyders MA, Smith GT, Spillane NS, Fischer S, Annus AM, Peterson C. Integration of impulsivity and positive mood to predict risky behavior: Development and validation of a measure of positive urgency. Psychol Assess. 2007;19:107–118. doi: 10.1037/1040-3590.19.1.107. [DOI] [PubMed] [Google Scholar]
- 37.Whiteside SP, Lynam DR. The five factor model and impulsivity: Using a structural model of personality to understand impulsivity. Pers Individ Dif. 2001;30:669–689. [Google Scholar]
- 38.Sutton RS, Barto AG, Book AB. Reinforcement learning: An Introduction. Cambridge, MA: MIT Press; 1998. [Google Scholar]
- 39.Daw ND, Gershman SJ, Seymour B, Dayan P, Dolan RJ. Model-based influences on humans’ choices and striatal prediction errors. Neuron. 2011;69:1204–1215. doi: 10.1016/j.neuron.2011.02.027. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Baldwin SA, Larson MJ. An introduction to using Bayesian linear regression with clinical data. Behav Res Ther. 2016;98:58–75. doi: 10.1016/j.brat.2016.12.016. [DOI] [PubMed] [Google Scholar]
- 41.Gelman A, Carlin JB, Stern HS, Dunson DB, Vehtari A, Rubin DB. Bayesian Data Analysis, Third Edition. Chapman Hall/CRC Texts Statistical Science. Boca Raton: CRC Press; 2013. [Google Scholar]
- 42.Burkner PC. brms: An R Package for Bayesian Multilevel Models using Stan. J Stat Softw. 2013;80:1–28. [Google Scholar]
- 43.Davis DR, Kurti AN, Skelly JM, Redner R, White TJ, Higgins ST. A review of the literature on contingency management in the treatment of substance use disorders, 2009–2014. Prev Med. 2016;92:36–46. doi: 10.1016/j.ypmed.2016.08.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Mantzari E, Vogt F, Shemilt I, Wei Y, Higgins JPT, Marteau TM. Personal financial incentives for changing habitual health-related behaviors: A systematic review and meta-analysis. Prev Med. 2015;75:75–85. doi: 10.1016/j.ypmed.2015.03.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Ma I, van Duijvenvoorde A, Scheres A. The interaction between reinforcement and inhibitory control in ADHD: A review and research guidelines. Clin Psychol Rev. 2016;44:94–111. doi: 10.1016/j.cpr.2016.01.001. [DOI] [PubMed] [Google Scholar]
- 46.Rochat L, Billieux J, Gagnon J, Van der Linden M. A multifactorial and integrative approach to impulsivity in neuropsychology: insights from the UPPS model of impulsivity. J Clin Exp Neuropsychol. 2018;40:45–61. doi: 10.1080/13803395.2017.1313393. [DOI] [PubMed] [Google Scholar]
- 47.Caspi A, Moffitt TE. All for One and One for All: Mental Disorders in One Dimension. Am J Psychiatry. 2018:1–14. doi: 10.1176/appi.ajp.2018.17121383. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Caspi A, Houts RM, Belsky DW, Goldman-Mellor SJ, Harrington H, Israel S, et al. The p Factor: One General Psychopathology Factor in the Structure of Psychiatric Disorders? Clin Psychol Sci. 2014;2:119–137. doi: 10.1177/2167702613497473. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Otto AR, Raio CM, Chiang A, Phelps EA, Daw ND. Working-memory capacity protects model-based learning from stress. Proc Natl Acad Sci U S A. 2013;110:20941–20946. doi: 10.1073/pnas.1312011110. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.