Abstract
Delay discounting paradigms have gained widespread popularity across clinical research. Given the prevalence in the field, researchers have set lofty expectations for the importance of delay discounting as a key transdiagnostic process and a ‘core’ process underlying specific domains of dysfunction (e.g. addiction). We believe delay discounting has been prematurely reified as, in and of itself, a core process underlying psychological dysfunction, despite significant concerns with the construct validity of discounting rates. Specifically, high delay discounting rates are only modestly related to measures of psychological dysfunction and therefore are not ‘core’ to these more complex behavioral problems. Furthermore, discounting rates do not appear to be specifically related to any disorder(s) or dimension(s) of psychopathology. This raises fundamental concerns about the utility of discounting, if the measure is only loosely associated with most forms of psychopathology. This stands in striking contrast to claims that discounting can serve as a ‘marker’ for specific disorders, despite never demonstrating adequate sensitivity or specificity for any disorder that we are aware of. Finally, empirical evidence does not support the generalizability of discounting rates to other decisions made either in the lab or in the real-world, and therefore discounting rates cannot and should not serve as a summary measure of an individual's decision-making patterns. We provide recommendations for improving future delay discounting research, but also strongly encourage researchers to consider whether the empirical evidence supports the field's hyper-focus on discounting.
Key words: Alcohol, behavioral economics, construct validity, decision-making, delay discounting, impulsivity, RDoC, substance use, transdiagnostic
Introduction
Delay discounting is a staple for examining intertemporal choice (ITC) in clinical research. In fact, a Google Scholar search for ‘delay discounting’ gives hundreds of results in the past 5 years alone. Delay discounting rates (of rewards) intend to measure the extent to which a future reward (or incentive) is reduced in value relative to an immediate reward as a function of the temporal delay of the future reward. Delay discounting paradigms have enjoyed widespread popularity in the field. For example, there are meta-analyses examining the association between performance on delay discounting tasks in healthy controls compared to those with a range of clinical disorders, such as addictive disorders (MacKillop et al., 2011), attention deficit hyperactivity disorder (Jackson & MacKillop, 2016), and other disorders including depression, disordered eating, and psychotic disorders (Amlung et al., 2019). Steeper delay discounting rates have been associated with so many different disorders that it has increasingly been discussed as a possible transdiagnostic process underlying a variety of common mental health problems (Amlung et al., 2019; Bickel et al., 2019; Finn, Gunn, & Gerst, 2015; Lempert, Steinglass, Pinto, Kable, & Simpson, 2019). Although delay discounting is discussed as a potential key transdiagnostic process in psychopathology, we believe, given the available research, it remains difficult to even describe what process these rates capture and how central it might be in psychopathology.
The premise of this paper is that the large body of current, as well as future, research on the relationships between decision-making, ITC, and psychopathology will have more value if there is a greater understanding of the significant problems and limitations in delay discounting research up to this point. We posit that there has been a premature theoretical acceptance of delay discounting as, in and of itself, a core process underlying psychological dysfunction. Furthermore, we believe there is a growing disconnect between the empirical evidence of the utility of delay discounting in clinical science and both the incredible popularity of the task and the lofty goals for its usage in clinical research.
For example, researchers continue to promote the importance and centrality of delay discounting in clinical disorders, including labeling discounting as a core trans-disease (Bickel & Mueller, 2009; Bickel, Jarmolowicz, Mueller, Koffarnus, & Gatchalian, 2012) and/or transdiagnostic (Amlung et al., 2019) process, or that delay discounting would fulfill the promises of the Research Domain Criteria (RDoC; Insel et al., 2010) initiative (Lempert et al., 2019). We certainly applaud research that aims to study processes across multiple disorders and embraces the dimensional approaches championed by the RDoC. However, our primary concern is that delay discounting, and subsequently the discounting rates obtained from the tasks, have been conflated with the actual underlying latent construct of interest (i.e. impulsive choice). Delay discounting is at best a candidate paradigm at one level of analysis to examine some, but certainly not all, processes that influence ITC patterns (a view shared by Dai & Busemeyer, 2014; Read, Frederick, & Scholten, 2013). In stark contrast, ITC, in our view, is a broad label to describe the complex and multifaceted processes that contribute to how individuals make decisions in the real-world related to maximizing benefits over time. Although we certainly concede it is unreasonable to expect any measure to capture ITC processes entirely, we do believe it is vital to stringently examine whether a popular measure provides enough information generalizable to the actual construct of interest. This is not merely a semantic argument; this premature reification has led to a drastic hyper-focus on a particular task that has yet, despite its popularity, to show substantial utility in clinical science. This paper will describe three key issues:
Discounting research has not provided adequate evidence of convergent validity to provide confidence in how to characterize discounting rates using other validated constructs.
Discounting rates also have not shown evidence of divergent validity when examining the association between discounting rates and other well-validated psychological measures, which presents another fundamental theoretical concern for how to interpret these rates.
The generalizability of delay discounting rates to other types of decisions, laboratory or real-world, is extremely limited. Therefore, discounting rates should not be considered a generalizable summary of an individual's decision-making or ITC patterns.
Convergent validity concerns
Despite hundreds of studies, discounting rates are poorly understood in terms of basic convergent validity with well-validated psychological measures. For example, delay discounting tasks have enjoyed widespread use in the study of addictive behaviors with meta-analyses finding that groups with addictive behaviors tend to discount at higher rates than healthy controls (MacKillop et al., 2011) and that discounting rates are related to continuous measures of addiction severity (Amlung, Vedelago, Acker, Balodis, & MacKillop, 2017). Differences in delay discounting rates are hypothesized to reflect variations in self-control, where higher discounting rates are thought to reflect deficits in self-control (or impulsivity) that lead individuals to choose smaller immediate options (e.g. intoxication) over long-term larger rewards (e.g. gainful employment). Although this explanation certainly has face value in its relationship to substance use pathology, the empirical findings have struggled to support this interpretation. Discounting rates are only modestly related to addiction severity based on meta-analysis (r = 0.14; Amlung et al., 2017), which must call into question how ‘core’ this process can be if it accounts for ~2% of the variance of symptom severity. Moreover, discounting rates are largely uncorrelated with other measures of impulsivity, which call into question the hypothesized relationship between discounting and addiction (Amlung et al., 2017; Kvam, Romeu, Turner, Vassileva, & Busemeyer, 2021; MacKillop et al., 2016; Sharma, Markon, & Clark, 2014). Delay discounting rates are not synonymous with impulsive decision-making as they are sometimes used in the literature. In fact, it does not appear that the constructs are even closely related. Rather, impulsivity and poor self-control in the context of decision-making reflect numerous processes, which clearly are not captured by delay discounting tasks.
Furthermore, discounting rates, to our knowledge, have not shown strong and replicable associations with any relevant psychological phenomena to provide a compelling explanation of what these rates characterize. For example, a recent large sample study found that discounting rates were uncorrelated or only modestly correlated (r values <0.20) with all tested cognitive abilities and personality measures, and that these correlations became even lower when controlling for income and education (Yeh, Myerson, & Green, 2020). This is consistent with previous literature that has shown only modest associations between discounting rates and measures of executive function (Bobova, Finn, Rickert, & Lucas, 2009; Weatherly & Ferraro, 2011) and personality (Bobova et al., 2009; Hirsh, Morisano, & Peterson, 2008). We believe Yeh et al. (2020) provided a very well thought-out and insightful study, however we take issue with some of the broader conclusions given the presented results, specifically:
The current findings suggest that steep discounting, a behavior strongly related to behavioral problems, is not simply an indicator of generally poor cognitive functioning or a measure of impulsiveness in healthy young adults as assessed by personality tests, but is an important individual difference characteristic in its own right. (p. 8)
As previously stated, we do not believe the evidence supports the strength of a relationship between discounting and behavior problems; we believe it is more accurate to say there is simply a modest reliable association. Furthermore, and most importantly, we are unsure what makes discounting an ‘important’ individual difference until the measure demonstrates its importance above and beyond existing measures (incremental validity). We agree it is positive that discounting is not simply a redundant measure of a construct with already well-established measures (e.g. general intelligence). However, we believe discounting is so poorly characterized that it is essentially impossible to even describe what discounting rates mean in terms of well-established constructs, given its poor relationship to other impulsive decision tasks, impulsive personality, and executive functioning measures. We do not want to overstate our case and claim that the signal being detected through discounting tasks as useless; however, we believe researchers must be aware about how little we know about what performance on this task means theoretically. Moreover, the burden of proof must be on the researchers who claim the centrality and usefulness of discounting to provide concrete and empirical examples of its utility.
In the same vein, although delay discounting rates have been shown to be significantly influenced by experimental manipulations (Read et al., 2013; Wilson & Daly, 2004), the processes responsible for these changes are unknown. Rung and Madden (2018) provided a review and meta-analysis of 92 published studies that examined methods to reduce discounting rates and reported that although many techniques succeed in reducing discounting rates (with substantial variability), there is no clear picture of how these manipulations influence discounting rates, or whether these changes coincide with reductions in impulsive decision-making more broadly. Importantly, research has demonstrated that discounting rates can be effectively influenced by a plethora of superficial task characteristics (Read et al., 2013) and therefore any observed changes in discounting rates must be closely examined. Therefore, although task manipulations can be valuable to probe a task to gain a better understanding of the underlying processes, after nearly 100 studies about reducing discounting rates, we still have not gained much general knowledge about how to characterize the signal being picked up through the task. Taken together, discounting rates stand on shaky theoretical ground, and subsequently, studies that attempt to manipulate discounting rates have struggled to illuminate the processes captured by the task.
In summary, we believe, given the enormous volume of discounting data, we know discouragingly little about the processes underlying the task or even how to characterize the rates in terms of validated constructs. If modest correlations are somewhat expected between laboratory tasks and complex behaviors (e.g. real-world substance use), then our theories must match this theoretical complexity. Discounting cannot be both too ‘basic’ an assessment to be associated strongly with the measures of complex behaviors (i.e. substance use), but also be a ‘core’ process underlying multiple disorders. Furthermore, we cannot let the simplicity and face validity of the task distract us from rigorously testing the task. For example, perhaps discounting taps into a certain basic cognitive process that serves as an underlying risk factor for impulsive ITCs and then consequently substance use risk. Then research should aim to find a measure, or more likely measures, that illuminate impulsive ITC patterns more broadly. This would return focus to the actual construct of interest (i.e. generalizable processes in impulsive choice) that serve as more direct risk factors for psychopathology. In this vein, we agree with recommendations in Sharma et al. (2014) that researchers should aim to connect their laboratory studies as much as possible to real-world decisions and behaviors. We cannot simply infer a face-valid cascade from a very basic assessment to complex behaviors.
Divergent validity concerns
Despite its face-valid, hypothesized connection with problematic substance use, research has demonstrated that steeper discounting of rewards compared to controls is associated with depression, bipolar, schizophrenia, borderline personality disorder, bulimia nervosa, binge-eating disorder (Amlung et al., 2019), and lower intelligence (Bailey, Gerst, & Finn, 2020; Shamosh & Gray, 2008). Notably, effect sizes are comparable when contrasting controls to clinical populations, although effects appear slightly larger in more severe clinical populations such as those with psychotic-spectrum disorders or illicit substance use disorders (Amlung et al., 2019; MacKillop et al., 2011). This lack of divergent validity is cause for significant concern for interpreting these abundant group differences. Although self-control deficits are a common interpretation for the relationship between steeper delay discounting rates and externalizing behavior, this interpretation seems unlikely to apply to all, or even most, disorders associated with high discounting rates (e.g. depression). To be clear, it is plausible that disparate pathological processes could result in steeper discounting rates in different disorders (i.e. ‘equifinality’; Cicchetti & Rogosch, 1996). However, this is an empirical question that requires more research into the different processes, factors, and mechanisms that contribute to variations in delay discounting rates across clinical samples (Story, Moutoussis, & Dolan, 2016). Until theories about specific mechanisms are formally tested, researchers should be wary of untested, usually ad-hoc explanations of the observed group differences.
Perhaps steeper discounting rates are simply associated with the general psychopathology factor (Caspi et al., 2014) and an underlying risk factor for most psychological disorders. As reviewed above, discounting rates appear to have a mostly nonspecific relationship to overall psychological severity. This drastically changes the interpretations provided in the literature, which tend to have diagnosis- or dimension-specific explanations with almost no empirical backing. This lack of divergent validity leads to possibly sobering questions about the utility of delay discounting rates. For example, if assessed in a group of individuals with unclear diagnostic status, delay discounting rates would be essentially useless in predicting diagnostic status [e.g. alcohol use disorder (AUD) v. depression]. This stands in striking contrast to claims that discounting can and does serve as a ‘biomarker’ (Kwako, Bickel, & Goldman, 2018) or ‘behavioral marker’ (Athamneh et al., 2020; Bickel et al., 2012; Bickel, Koffarnus, Moody, & Wilson, 2014; Turner, Athamneh, Basso, & Bickel, 2021) given it wholly fails to be either adequately sensitive or specific to any psychological phenomena to warrant such status. However, in a highly cited review, Bickel et al. (2014) come to drastically different conclusions saying ‘Our review suggests that temporal discounting (1) identifies individuals who are drug-dependent, (2) identifies those at risk of developing drug dependence, (3) acts as a gauge of addiction severity, (4) correlates with all stages of addiction development…’ (abstract). We agree discounting is modestly associated with many aspects of addiction; however, this in no way indicates that discounting can reliably identify any clinical population. Furthermore, commonly cited studies that make such strong claims of the utility of discounting rates to predict future substance use only reported modest to very modest associations (Audrain-McGovern et al., 2009; Fernie et al., 2013; Khurana et al., 2013). Importantly, for discounting rates to be valuable in terms of identification of clinical populations, it would need to show incremental validity over already existing measures. Framed this way, it should be obvious that one would never select to screen participants or patients for AUD using a discounting task instead of, for example, the Alcohol Use Disorder Identification Test (AUDIT), a brief, freely available, self-report measure, which across studies has shown a median sensitivity of 0.86 and specificity of 0.89 for identifying AUD (Reinert & Allen, 2002). We certainly understand the tremendous value of laboratory tasks to provide information that self-report measures cannot, however it is important to be realistic about the utility of each in different situations. In summary, modest associations with criteria of interest (e.g. addiction severity) do not qualify as strong evidence for the importance of that measure. Discounting rates must demonstrate that they are highly predictive of criteria of interest or that they outperform existing measures to have substantial predictive value. Moving forward, we believe the field must be much more stringent examining claims of the usefulness of discounting rates in the face of mounting evidence to the contrary.
We believe the delay discounting literature has failed to adequately examine delay discounting rates from a classic construct validity standpoint (Cronbach & Meehl, 1955). Despite face-valid explanations for discounting rates and the observed group differences in clinical populations, the empirical data are simply not there to provide confidence in these explanations. We again concur with Sharma et al. (2014) in stressing that researchers apply the same psychometric and construct validity considerations to behavioral tasks as they do self-report measures. Face-valid behavioral tasks should not be exempt from empirically demonstrating construct validity.
The generalizability (or lack thereof) of discounting rates
Evidence for claims of delay discounting serving as a generalizable measure of ITC is scant. Although researchers have demonstrated discounting rates of rewards are relatively stable over time (Odum, 2011; Ohmura, Takahashi, Kitamura, & Wehr, 2006), the empirical evidence does not suggest that discounting rates are highly informative about other decisions. Research has shown that a discounting rate from a certain discounting task is not highly informative even of performance on other discounting tasks. Weatherly and colleagues (Weatherly & Terrell, 2010; Weatherly, Terrell, & Derenne, 2010) performed exploratory and confirmatory factor analyses to show that discounting rates across five commodities are not best explained by a single discounting factor, a result congruent with the modeling results in Kvam et al. (2021). Furthermore, there is evidence that discounting rates can be heavily influenced by experimental manipulations (Rung & Madden, 2018) and task framing (Read et al., 2013). Therefore, it is possible that individuals have a trait-like baseline discounting rate that can be influenced by manipulations/circumstances (Peters, Miedl, & Büchel, 2012); however, it is unclear how one would identify this baseline or whether this baseline value has significant predictive value. Most importantly, this means that even within the rather limited scope of delay discounting of rewards, a single discounting rate provides only modest information about performance on very similar tasks. Therefore, delay discounting cannot serve as a summary measure for general ITC or decision-making patterns, which includes discounting across and between different commodities (Story et al., 2016), contexts, probabilistic assessment, and discounting of losses (Bailey, Gerst, & Finn, 2018), among other processes. We believe the generalizability and value of discounting rates has been drastically overstated given their inability to robustly predict other decisions made either in the lab or real life.
Finally, despite limited evidence on the generalizability of discounting rates, some researchers have called for interventions to reduce steepness of delay discounting rates as a prevention or intervention for those at risk for addiction (e.g. Bickel et al., 2017; Gray & MacKillop, 2015; Mahalingam, Stillwell, Kosinski, Rust, & Kogan, 2014; Volkow & Baler, 2015), whereas other researchers have used decreased discounting rates as the primary outcome measure in an intervention study (e.g. working memory training; Bickel, Yi, Landes, Hill, & Baxter, 2011). In clinical disorders, the problem is that impulsive choices increase the likelihood of maladaptive behavior (like problematic substance use), or behavior that does not optimize outcomes (e.g. low achievement), not that they have higher rates on a delay discounting task. We hope we have made the case that these admirable endeavors are overly focused on the singular task at the expense of the more important construct(s). Designing interventions to address performance on a single task is similar to instructors teaching the skills of a standardized test at the expense of the knowledge base the test was meant to assess. In this case, the assessment (discounting rates) is not even robustly related to the criteria of interest (real-world behavior or symptomology) and therefore, in our estimation, does not appear to be a logical target of intervention.
Future directions and recommendations
Given the fundamental issues with delay discounting, we believe it is clear the necessity to improve and innovate our research programs related to ITC processes in clinical populations. We currently have hundreds of delay discounting studies in clinical science and seemingly little generalizable knowledge beyond a disorganized collection of unexplained group differences. We will conclude with a brief description of some suggestions for improvement in the field. We will discuss:
Improving the measurement of discounting rates;
Suggestions to improve our understanding of discounting rates through mechanism-focused research; and
Innovating new paradigms to assess processes related to ITC beyond discounting rates.
Importantly, these suggestions will encompass only a small set of possible improvements.
Improving measurement of discounting
Although our primary concerns with delay discounting practices are theoretical, improving the measurement of delay discounting rates may provide a fruitful avenue to improve our understanding of the task. Although a specific review and explanation of measurement concerns is beyond the scope of the current paper, we have several broad recommendations. First, given the significant concerns raised in the current study, we have strong reservations about attempts to shorten existing discounting measures (e.g. Koffarnus & Bickel, 2014). Given that discounting rates are poorly understood in essentially all aspects of construct validity and show modest associations at best with external criteria, we do not understand why researchers would embrace a less reliable version of the task. For example, Koffarnus and Bickel (2014) reported a correlation of 0.67 between their five-trial adjusting discounting task (i.e. short-form measure) and a longer adjusting amount discounting task (i.e. original long-form). This means the short-form of the measure only predicts 45% of the variance of the original form, which based on meta-analysis is only expected to correlate with most criteria of interest around r < 0.20. We believe this could decrease reproducibility and increase spurious findings that will not help the field wrestle with the challenges reviewed in the current paper. These concepts are discussed in detail by Smith, McCarthy, and Anderson (2000) in relation to self-report measures, specifically the dangers of developing a short-form of a measure that itself is insufficiently validated.
In fact, we would suggest an opposite course of action. We believe researchers should look to embrace assessment and scoring methods that collect sufficient amounts of data and then model all the collected trial-level data (Dai & Busemeyer, 2014; Dai, Gunn, Gerst, Busemeyer, & Finn, 2016; Kvam et al., 2021; Molloy et al., 2020). This is in contrast to the majority of discounting scoring practices that rely on indifference points for each time-delay (e.g. 1 week, 1 month). Indeed, Kvam et al. (2021) provide code that researchers can use or adapt for their own purposes that implements the ‘direct difference’ model (Dai & Busemeyer, 2014). Interestingly, the ‘direct difference' model not only models all collected trial-level decisions, but there are also versions that can incorporate decision reaction time to further elucidate decision-making processes such as difficulty of deliberation (Dai & Busemeyer, 2014). If the researcher still wishes to use the standard hyperbolic model, there are estimation procedures that are not solely reliant on indifference points and model all collected data (Molloy et al., 2020; Vincent, 2016). Molloy et al. (2020) also provide usable codes for interested researchers.
We also warn that overly focusing on modeling the data, without regard to the theoretical concerns attached to those data, comes with significant drawbacks, and can even compound the issues discussed thus far. For instance, Johnson and Bickel (2008) recommend that researchers exclude data which do not have sufficiently decreasing indifference points. These criteria were suggested to improve fitting procedures when using the hyperbolic model. However, the danger is that these criteria lead to researchers throwing out data which do not conform to the hyperbolic model, and thus the model is tested only on data that are chosen to conform to it. Smith, Lawyer, and Swift (2018) found that close to a fifth of discounting data is discarded per study using the Johnson and Bickel (2008) criteria. Although these criteria are touted as suggestions, their widespread use in the literature suggests they are closer to conventions. Most importantly, they create a vicious circle in which the hyperbolic model has been lauded as the proper model for discounting, using only evidence that happens to favor the hyperbolic model. In other words, the excuse of having a ‘viable’ model is used to justify unscientific practices in modeling discounting data, specifically attempting to change the phenomena to fit the preferred model. This is especially problematic when studying clinical populations, whose decisions may not seem immediately ‘rational’ and where response ‘abnormality’ is the rule, not the exception. Furthermore, studies have shown that the hyperbolic model is not the most appropriate model for all participants and therefore it is not appropriate to assume all participants' performance must conform to an a priori model (Franck, Koffarnus, House, & Bickel, 2015; Gilroy, Franck, & Hantula, 2017). Similarly, Cheng and González-Vallejo (2016) demonstrated that the hyperbolic model may have significant performance concerns when discounting tasks are not presented in the traditional ‘titration’ procedure. Finally, acquiring the ‘correct’ or ‘best’ model for a given task is a goal that is secondary to making sure the task is actually a valid one for the construct at hand; indeed, a model's usefulness is always bounded above by the data's validity. We thus urge researchers to reconsider using the Johnson and Bickel (2008) criteria in the future and instead to return their focus to providing models that elucidate meaningful and generalizable psychological processes.
In summary, given the theoretical concerns described in the current paper, we believe researchers should be actively concerned with improving the quality of their measurement and not embracing practices that could increase measurement concerns.
Improving delay discounting construct validity
Researchers have not provided adequate evidence to properly characterize discounting rates to justify the majority of theoretical explanations. This should lead to an increase in scrutiny over studies providing group differences that are not further explained by empirical analyses. For example, a group difference in discounting between those with and without AUD should not conclude with an ad-hoc explanation of self-control deficits; this should be empirically corroborated with established measures of self-control (see Sharma et al., 2014). Furthermore, discounting must show robust, not just statistically significant, associations to claim strong relationships with constructs of interest. Relatedly, the use of extended task batteries and multivariate approaches would certainly yield a better understanding of how delay discounting performance relates to other established constructs. Snyder, Miyake, and Hankin (2015) provide a useful roadmap for ways of improving construct validity in the assessment of executive functioning, and many of the suggestions are germane to the current discussion. Just like executive functioning, ITC will never be captured by a single task. However, multivariate approaches can assist in elucidating the structure underlying many tasks and related psychological measures. Furthermore, these multivariate approaches can mitigate concerns from the ‘task impurity’ problem (Miyake & Friedman, 2012; Snyder et al., 2015), or the concern that any individual task score contains systematic variance not related to the construct of interest, but related to the task. Multivariate approaches, such as MacKillop et al. (2016) and Weatherly and Terrell (2010), especially when combined with cognitive modeling approaches (Dai & Busemeyer, 2014; Kvam et al., 2021; Molloy et al., 2020), can help researchers elucidate common processes underlying delay discounting and other relevant psychosocial measures.
Relatedly, studies with longitudinal data, especially intervention studies, must provide more rigorous support for interpretations related to delay discounting. For example, Bickel et al. (2011) showed that discounting rates were lowered significantly in stimulant abusers who received a working memory training protocol compared to those who received a control condition. Beyond methodological concerns with this and other working memory studies (see Gunn, Gerst, Wiemers, Redick, & Finn, 2018), a major concern is that many studies provide limited corroborating analyses to contextualize these findings. Specifically, if steeper delay discounting is related to executive functioning and consequently self-control, then it stands to reason that working memory training could improve task performance. However, providing group differences across conditions (i.e. working memory training v. control) does not provide strong evidence of the proposed mechanism. If the above hypothesis about the benefits of working memory training is true, then individuals who benefit the most from working memory training should be the same individuals who show the most improvement in delay discounting tasks. That is, researchers should focus on specifying the degree of change, if any, rather than a simple ‘present–absent’ assessment. Moreover, the same criticism applies to linking changes in delay discounting rates to changes in behavior such as drinking patterns. For example, studies that observe changes in discounting and changes in other types of decisions/behaviors after an intervention (Athamneh, Stein, & Bickel, 2019; Mellis et al., 2018; Snider, LaConte, & Bickel, 2016; Stein et al., 2017) are not adequate evidence to conclude any causal relationship, as Stein et al. (2017) suggested: ‘Accumulating laboratory-based evidence indicates that reducing delay discounting (devaluation of delayed outcomes) with the use of episodic future thinking (EFT; mental simulation of future events) improves dietary decision-making and other maladaptive behaviors’ (abstract). We must be more stringent about implying or reporting causal mechanisms that have not been empirically established, as mounting evidence indicates we do not have a strong grasp of the processes underlying discounting tasks or how these processes relate to other decisions.
Finally, there remains a muddled picture of how delay discounting relates to dysfunction, despite the abundance of studies. As described above, the lack of diagnostic/dimension specificity of delay discounting findings is of fundamental concern. Observing a relationship across many disorders is not convincing evidence of an important trans-disease process when those relationships are uniformly weak from a statistical perspective and poorly understood from a theory-development perspective. Despite efforts to characterize discounting rates as a transdiagnostic process, there remains minimal evidence to what the process is and exactly how it unifies the breadth of dysfunctions associated to it. Hierarchical multivariate approaches to modeling and conceptualizing psychological dysfunction, such as the Hierarchical Taxonomy of Psychopathology (HiTOP; Kotov et al., 2017), have shown tremendous benefits in empirically examining transdiagnostic processes. Previous research, for example, has shown that steeper delay discounting rates were associated with the general externalizing dimension of psychopathology, and not differentially related to specific externalizing disorders (Finn et al., 2015). This research must be continued and broadened substantially; indeed, current findings suggest steep delay discounting relates to processes that contribute to a large array of dysfunction even broader than the externalizing dimension. Practically speaking, delay discounting studies analyzing only a minimally diverse diagnostic sample (e.g. diagnostic group ‘X’ v. controls) will continue to have extreme difficulties in illuminating how to understand this task in clinical science at large.
Moving beyond discounting and embracing ITC
We described some methods to improve research related to delay discounting and clinical populations. However, our primary recommendation would be to heavily consider embracing alternative and creative assessments of ITC beyond delay discounting tasks. Traditional delay discounting tasks have significant limitations even when following all recommendations in the current paper. As discussed in Sharma et al. (2014), researchers should aim to connect their laboratory tasks to real-world decisions/behaviors as much as possible. For example, Finn, Gerst, Lake, and Bogg (2017) asked a high externalizing sample of students to make decisions related to attending/drinking at certain events that varied in terms of incentives (e.g. friends at the party), and disincentives (e.g. you have a test the next day) and found that individuals with antisocial personality traits where more likely to be uninfluenced by disincentive levels when making decisions. This paradigm has many similarities to traditional delay discounting tasks, but provides added complexity to examine specific processes related to externalizing psychopathology and gives the decisions made in the task more external validity. It is possible that more complex and ecologically valid tasks are needed to ‘bridge’ the gap between very basic tasks such as traditional monetary discounting and complex behaviors such as substance use. Moreover, we urge the field to focus on providing tasks and models that we can empirically demonstrate are strongly related to clinical phenomena. We cannot be overly enamored with one face-valid task we believe will solve all these problems for us. We strongly encourage researchers to more carefully examine how well narratives around the utility of discounting rates are backed by strong empirical support. Despite centering on delay discounting in the current paper, we believe these principles apply to the use of decision-making paradigms in clinical science at large.
Acknowledgements
We are grateful to Jerome Busemeyer (Indiana University-Bloomington) for his insightful comments on an early version of this paper.
Financial support
This research was supported by the National Institutes of Alcohol Abuse and Alcoholism grant (R01AA13650) to Peter Finn, the National Institutes of Drug Abuse (NIDA) grant (T32 DA24628) and National Institutes of Health grant (T32 MH103213) to Allen Bailey, and NIDA grant (T32 DA24628) to Ricardo J. Romeu.
Ethical standards
The authors assert that all procedures contributing to this work comply with the ethical standards of the relevant national and institutional committees on human experimentation and with the Helsinki Declaration of 1975, as revised in 2008.
Conflict of interest
None.
References
- Amlung, M., Marsden, E., Holshausen, K., Morris, V., Patel, H., Vedelago, L., … McCabe, R. E. (2019). Delay discounting as a transdiagnostic process in psychiatric disorders: A meta-analysis. JAMA Psychiatry, 76(11), 1176. doi: 10.1001/jamapsychiatry.2019.2102. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Amlung, M., Vedelago, L., Acker, J., Balodis, I., & MacKillop, J. (2017). Steep delay discounting and addictive behavior: A meta-analysis of continuous associations. Addiction, 112(1), 51–62. doi: 10.1111/add.13535. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Athamneh, L. N., Freitas Lemos, R., Basso, J. C., Tomlinson, D. C., Craft, W. H., Stein, M. D., … Bickel, W. K. (2020). The phenotype of recovery II: The association between delay discounting, self-reported quality of life, and remission status among individuals in recovery from substance use disorders. Experimental and Clinical Psychopharmacology. Advance online publication. 10.1037/pha0000389. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Athamneh, L. N., Stein, J. S., & Bickel, W. K. (2019). Narrative theory III: Evolutionary narratives addressing mating motives change discounting and tobacco valuation. Experimental and Clinical Psychopharmacology, 28(3), 276–290. 10.1037/pha0000315. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Audrain-McGovern, J., Rodriguez, D., Epstein, L. H., Cuevas, J., Rodgers, K., & Wileyto, E. P. (2009). Does delay discounting play an etiological role in smoking or is it a consequence of smoking? Drug and Alcohol Dependence, 103(3), 99–106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bailey, A. J., Gerst, K., & Finn, P. R. (2018). Delay discounting of losses and rewards in alcohol use disorder: The effect of working memory load. Psychology of Addictive Behaviors, 32(2), 197. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bailey, A. J., Gerst, K., & Finn, P. R. (2020). Intelligence moderates the relationship between delay discounting rate and problematic alcohol use. Psychology of Addictive Behaviors, 34(1), 175. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bickel, W. K., Athamneh, L. N., Basso, J. C., Mellis, A. M., DeHart, W. B., Craft, W. H., … Pope, D. (2019). Excessive discounting of delayed reinforcers as a trans-disease process: Update on the state of the science. Current Opinion in Psychology, 30, 59–64. doi: 10.1016/j.copsyc.2019.01.005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bickel, W. K., Jarmolowicz, D. P., Mueller, E. T., Koffarnus, M. N., & Gatchalian, K. M. (2012). Excessive discounting of delayed reinforcers as a trans-disease process contributing to addiction and other disease-related vulnerabilities: Emerging evidence. Pharmacology & Therapeutics, 134(3), 287–297. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bickel, W. K., Koffarnus, M. N., Moody, L., & Wilson, A. G. (2014). The behavioral- and neuro-economic process of temporal discounting: A candidate behavioral marker of addiction. Neuropharmacology, 76, 518–527. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bickel, W. K., & Mueller, E. T. (2009). Toward the study of trans-disease processes: A novel approach with special reference to the study of co-morbidity. Journal of Dual Diagnosis, 5(2), 131–138. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bickel, W. K., Stein, J. S., Moody, L. N., Snider, S. E., Mellis, A. M., & Quisenberry, A. J. (2017). Toward narrative theory: Interventions for reinforcer pathology in health behavior. In Stevens J. R. (Ed.), Impulsivity Nebraska Symposium on Motivation 64 (pp. 227–267). Springer International Publishing AG. doi: 10.1007/978-3-319-51721-6_8. [PubMed] [Google Scholar]
- Bickel, W. K., Yi, R., Landes, R. D., Hill, P. F., & Baxter, C. (2011). Remember the future: Working memory training decreases delay discounting among stimulant addicts. Biological Psychiatry, 69(3), 260–265. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Bobova, L., Finn, P. R., Rickert, M. E., & Lucas, J. (2009). Disinhibitory psychopathology and delay discounting in alcohol dependence: Personality and cognitive correlates. Experimental and Clinical Psychopharmacology, 17(1), 51. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Caspi, A., Houts, R. M., Belsky, D. W., Goldman-Mellor, S. J., Harrington, H., Israel, S., … Poulton, R. (2014). The p factor: One general psychopathology factor in the structure of psychiatric disorders? Clinical Psychological Science, 2(2), 119–137. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Cheng, J., & González-Vallejo, C. (2016). Attribute-wise vs. alternative-wise mechanism in intertemporal choice: Testing the proportional difference, trade-off, and hyperbolic models. Decision, 3(3), 190. [Google Scholar]
- Cicchetti, D., & Rogosch, F. A. (1996). Equifinality and multifinality in developmental psychopathology. Development and Psychopathology, 8(4), 597–600. [Google Scholar]
- Cronbach, L. J., & Meehl, P. E. (1955). Construct validity in psychological tests. Psychological Bulletin, 52(4), 281. [DOI] [PubMed] [Google Scholar]
- Dai, J., & Busemeyer, J. R. (2014). A probabilistic, dynamic, and attribute-wise model of intertemporal choice. Journal of Experimental Psychology: General, 143(4), 1489. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Dai, J., Gunn, R. L., Gerst, K. R., Busemeyer, J. R., & Finn, P. R. (2016). A random utility model of delay discounting and its application to people with externalizing psychopathology. Psychological Assessment, 28(10), 1198. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Fernie, G., Peeters, M., Gullo, M. J., Christiansen, P., Cole, J. C., Sumnall, H., & Field, M. (2013). Multiple behavioural impulsivity tasks predict prospective alcohol involvement in adolescents. Addiction, 108(11), 1916–1923. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Finn, P. R., Gerst, K., Lake, A., & Bogg, T. (2017). Decisions to attend and drink at party events: The effects of incentives and disincentives and lifetime alcohol and antisocial problems. Alcoholism: Clinical and Experimental Research, 41(9), 1622–1629. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Finn, P. R., Gunn, R. L., & Gerst, K. R. (2015). The effects of a working memory load on delay discounting in those with externalizing psychopathology. Clinical Psychological Science, 3(2), 202–214. doi: 10.1177/2167702614542279. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Franck, C. T., Koffarnus, M. N., House, L. L., & Bickel, W. K. (2015). Accurate characterization of delay discounting: A multiple model approach using approximate Bayesian model selection and a unified discounting measure. Journal of the Experimental Analysis of Behavior, 103(1), 218–233. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gilroy, S. P., Franck, C. T., & Hantula, D. A. (2017). The discounting model selector: Statistical software for delay discounting applications. Journal of the Experimental Analysis of Behavior, 107(3), 388–401. [DOI] [PubMed] [Google Scholar]
- Gray, J. C., & MacKillop, J. (2015). Impulsive delayed reward discounting as a genetically-influenced target for drug abuse prevention: A critical evaluation. Frontiers in Psychology, 6, 1104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Gunn, R. L., Gerst, K. R., Wiemers, E. A., Redick, T. S., & Finn, P. R. (2018). Predictors of effective working memory training in individuals with alcohol use disorders. Alcoholism: Clinical and Experimental Research, 42(12), 2432–2441. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Hirsh, J. B., Morisano, D., & Peterson, J. B. (2008). Delay discounting: Interactions between personality and cognitive ability. Journal of Research in Personality, 42(6), 1646–1650. [Google Scholar]
- Insel, T., Cuthbert, B., Garvey, M., Heinssen, R., Pine, D. S., Quinn, K., … Wang, P. (2010). Research domain criteria (RDoC): Toward a new classification framework for research on mental disorders. The American Journal of Psychiatry, 167(7), 748–751. 10.1176/appi.ajp.2010.09091379. [DOI] [PubMed] [Google Scholar]
- Jackson, J. N., & MacKillop, J. (2016). Attention-deficit/hyperactivity disorder and monetary delay discounting: A meta-analysis of case-control studies. Biological Psychiatry, 1(4), 316–325. doi: 10.1016/j.bpsc.2016.01.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Johnson, M. W., & Bickel, W. K. (2008). An algorithm for identifying nonsystematic delay-discounting data. Experimental and Clinical Psychopharmacology, 16(3), 264. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Khurana, A., Romer, D., Betancourt, L. M., Brodsky, N. L., Giannetta, J. M., & Hurt, H. (2013). Working memory ability predicts trajectories of early alcohol use in adolescents: The mediational role of impulsivity. Addiction, 108(3), 506–515. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Koffarnus, M. N., & Bickel, W. K. (2014). A 5-trial adjusting delay discounting task: Accurate discount rates in less than one minute. Experimental and Clinical Psychopharmacology, 22(3), 222. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kotov, R., Krueger, R. F., Watson, D., Achenbach, T. M., Althoff, R. R., Bagby, R. M., … Clark, L. A. (2017). The hierarchical taxonomy of psychopathology (HiTOP): A dimensional alternative to traditional nosologies. Journal of Abnormal Psychology, 126(4), 454. [DOI] [PubMed] [Google Scholar]
- Kvam, P. D., Romeu, R. J., Turner, B. M., Vassileva, J., & Busemeyer, J. R. (2021). Testing the factor structure underlying behavior using joint cognitive models: Impulsivity in delay discounting and Cambridge gambling tasks. Psychological Methods, 26(1), 18–37. 10.1037/met0000264. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Kwako, L. E., Bickel, W. K., & Goldman, D. (2018). Addiction biomarkers: Dimensional approaches to understanding addiction. Trends in Molecular Medicine, 24(2), 121–128. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Lempert, K. M., Steinglass, J. E., Pinto, A., Kable, J. W., & Simpson, H. B. (2019). Can delay discounting deliver on the promise of RDoC? Psychological Medicine, 49(2), 190–199. doi: 10.1017/s0033291718001770. [DOI] [PubMed] [Google Scholar]
- MacKillop, J., Amlung, M. T., Few, L. R., Ray, L. A., Sweet, L. H., & Munafo, M. R. (2011). Delayed reward discounting and addictive behavior: A meta-analysis. Psychopharmacology (Berlin), 216(3), 305–321. doi: 10.1007/s00213-011-2229-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- MacKillop, J., Weafer, J., Gray, J. C., Oshri, A., Palmer, A., & de Wit, H. (2016). The latent structure of impulsivity: Impulsive choice, impulsive action, and impulsive personality traits. Psychopharmacology, 233(18), 3361–3370. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Mahalingam, V., Stillwell, D., Kosinski, M., Rust, J., & Kogan, A. (2014). Who can wait for the future? A personality perspective. Social Psychological and Personality Science, 5(5), 573–583. [Google Scholar]
- Mellis, A. M., Athamneh, L. N., Stein, J. S., Sze, Y. Y., Epstein, L. H., & Bickel, W. K. (2018). Less is more: Negative income shock increases immediate preference in cross commodity discounting and food demand. Appetite, 129, 155–161. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Miyake, A., & Friedman, N. P. (2012). The nature and organization of individual differences in executive functions: Four general conclusions. Current Directions in Psychological Science, 21(1), 8–14. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Molloy, M. F., Romeu, R. J., Kvam, P. D., Finn, P. R., Busemeyer, J., & Turner, B. M. (2020). Hierarchies improve individual assessment of temporal discounting behavior. Decision, 7(3), 212–224. 10.1037/dec0000121. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Odum, A. L. (2011). Delay discounting: Trait variable? Behavioural processes, 87(1), 1–9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Ohmura, Y., Takahashi, T., Kitamura, N., & Wehr, P. (2006). Three-month stability of delay and probability discounting measures. Experimental and Clinical Psychopharmacology, 14(3), 318. [DOI] [PubMed] [Google Scholar]
- Peters, J., Miedl, S. F., & Büchel, C. (2012). Formal comparison of dual-parameter temporal discounting models in controls and pathological gamblers. PLoS One, 7(11), e47225. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Read, D., Frederick, S., & Scholten, M. (2013). DRIFT: An analysis of outcome framing in intertemporal choice. Journal of Experimental Psychology: Learning, Memory, and Cognition, 39(2), 573. [DOI] [PubMed] [Google Scholar]
- Reinert, D. F., & Allen, J. P. (2002). The alcohol use disorders identification test (AUDIT): A review of recent research. Alcoholism: Clinical and Experimental Research, 26(2), 272–279. [PubMed] [Google Scholar]
- Rung, J. M., & Madden, G. J. (2018). Experimental reductions of delay discounting and impulsive choice: A systematic review and meta-analysis. Journal of Experimental Psychology General, 147(9), 1349–1381. doi: 10.1037/xge0000462. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Shamosh, N. A., & Gray, J. R. (2008). Delay discounting and intelligence: A meta-analysis. Intelligence, 36(4), 289–305. [Google Scholar]
- Sharma, L., Markon, K. E., & Clark, L. A. (2014). Toward a theory of distinct types of ‘impulsive’ behaviors: A meta-analysis of self-report and behavioral measures. Psychological Bulletin, 140(2), 374. [DOI] [PubMed] [Google Scholar]
- Smith, K. R., Lawyer, S. R., & Swift, J. K. (2018). A meta-analysis of nonsystematic responding in delay and probability reward discounting. Experimental and Clinical Psychopharmacology, 26(1), 94. [DOI] [PubMed] [Google Scholar]
- Smith, G. T., McCarthy, D. M., & Anderson, K. G. (2000). On the sins of short-form development. Psychological Assessment, 12(1), 102. [DOI] [PubMed] [Google Scholar]
- Snider, S. E., LaConte, S. M., & Bickel, W. K. (2016). Episodic future thinking: Expansion of the temporal window in individuals with alcohol dependence. Alcoholism: Clinical and Experimental Research, 40(7), 1558–1566. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Snyder, H. R., Miyake, A., & Hankin, B. L. (2015). Advancing understanding of executive function impairments and psychopathology: Bridging the gap between clinical and cognitive approaches. Frontiers in Psychology, 6, 328. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Stein, J. S., Sze, Y. Y., Athamneh, L., Koffarnus, M. N., Epstein, L. H., & Bickel, W. K. (2017). Think fast: Rapid assessment of the effects of episodic future thinking on delay discounting in overweight/obese participants. Journal of Behavioral Medicine, 40(5), 832–838. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Story, G. W., Moutoussis, M., & Dolan, R. J. (2016). A computational analysis of aberrant delay discounting in psychiatric disorders. Frontiers in Psychology, 6, 1948. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Turner, J. K., Athamneh, L. N., Basso, J. C., & Bickel, W. K. (2021). The phenotype of recovery V: Does delay discounting predict the perceived risk of relapse among individuals in recovery from alcohol and drug use disorders. Alcoholism: Clinical and Experimental Research, 45(5), 1100–1108. 10.1111/acer.14600. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Vincent, B. T. (2016). Hierarchical Bayesian estimation and hypothesis testing for delay discounting tasks. Behavior Research Methods, 48(4), 1608–1620. [DOI] [PubMed] [Google Scholar]
- Volkow, N. D., & Baler, R. D. (2015). Now vs later brain circuits: Implications for obesity and addiction. Trends in Neurosciences, 38(6), 345–352. [DOI] [PubMed] [Google Scholar]
- Weatherly, J. N., & Ferraro, F. R. (2011). Executive functioning and delay discounting of four different outcomes in university students. Personality and Individual Differences, 51(2), 183–187. [Google Scholar]
- Weatherly, J. N., & Terrell, H. K. (2010). Delay discounting of different commodities II: Confirmatory analyses. The Journal of General Psychology, 138(1), 35–48. [DOI] [PubMed] [Google Scholar]
- Weatherly, J. N., Terrell, H. K., & Derenne, A. (2010). Delay discounting of different commodities. The Journal of General Psychology: Experimental, Psychological, and Comparative Psychology, 137(3), 273–286. [DOI] [PubMed] [Google Scholar]
- Wilson, M., & Daly, M. (2004). Do pretty women inspire men to discount the future? Proceedings of the Royal Society of London. Series B: Biological Sciences, 271(suppl_4), S177–S179. [DOI] [PMC free article] [PubMed] [Google Scholar]
- Yeh, Y.-H., Myerson, J., & Green, L. (2020). Delay discounting, cognitive ability, and personality: What matters? Psychonomic Bulletin & Review, 28(2), 28(2), 686–694. [DOI] [PMC free article] [PubMed] [Google Scholar]