Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2024 Jun 1.
Published in final edited form as: Cogn Affect Behav Neurosci. 2023 Mar 3;23(3):844–868. doi: 10.3758/s13415-023-01080-w

Anxiety as a disorder of uncertainty: implications for understanding maladaptive anxiety, anxious avoidance, and exposure therapy

Vanessa M Brown 1, Rebecca Price 1, Alexandre Y Dombrovski 1
PMCID: PMC10475148  NIHMSID: NIHMS1901333  PMID: 36869259

Abstract

In cognitive-behavioral conceptualizations of anxiety, exaggerated threat expectancies underlie maladaptive anxiety. This view has led to successful treatments, notably exposure therapy, but is not consistent with the empirical literature on learning and choice alterations in anxiety. Empirically, anxiety is better described as a disorder of uncertainty learning. How disruptions in uncertainty lead to impairing avoidance and are treated with exposure-based methods, however, is unclear. Here, we integrate concepts from neurocomputational learning models with clinical literature on exposure therapy to propose a new framework for understanding maladaptive uncertainty functioning in anxiety. Specifically, we propose that anxiety disorders are fundamentally disorders of uncertainty learning and that successful treatments, particularly exposure therapy, work by remediating maladaptive avoidance from dysfunctional explore/exploit decisions in uncertain, potentially aversive situations. This framework reconciles several inconsistencies in the literature and provides a path forward to better understand and treat anxiety.

Keywords: anxiety, uncertainty, avoidance, reinforcement learning, computational modeling

Introduction

Pathological anxiety results from excessive fear of specific situations, memories, and thoughts and causes impairing distress and avoidance of these feared situations (American Psychiatric Association, 2013; Yang et al., 2021). Fear-based symptom clusters in anxiety are characterized by excessive threat responses and include somatic hyperarousal symptoms prominent in panic disorder, social phobia, specific phobia, obsessive compulsive disorder, and posttraumatic stress disorder (Craske et al., 2009; Levin-Aspenson et al., 2021). These clinical presentations differ from distress disorders, such as generalized anxiety disorder, that feature worry and rumination symptoms and show more overlap with depression (Watson, 2005).

Anxiety has long been conceptualized as a disorder of maladaptive learning (Mowrer, 1939; Rachman, 1977; Zinbarg et al., 2022), in part due to the relative effectiveness of learning-based treatments, such as exposure therapy. Yet, exposure therapy and other psycho- and pharmacotherapies are only moderately effective in treating anxiety, and many people do not respond to or are not able to access treatment (Bandelow et al., 2017; Eddy et al., 2004; Watts et al., 2013). Recent advances in computational learning theory and neuroscience of learning (Craske et al., 2014; Levy & Schiller, 2021; Pine, 2017; Pulcu & Browning, 2019) have the potential to greatly improve our understanding and ultimately treatment and prevention of anxiety disorders. To benefit from these advances, we need to fully understand how treatments, such as exposure therapy, change learning, how learning is altered in pathological anxiety, and how contemporary computational learning theory can inform treatment. In particular, we need to understand how uncertainty-related behavior and neural function are disrupted in anxiety and how these disruptions can be remediated with treatment.

Current, cognitive-behavioral conceptualizations view anxiety as due to excessive learned threat expectancy, maintained by avoidance of feared outcomes and remediated by deliberately encountering anxiety-producing situations (Craske et al., 2008; Foa & Kozak, 1986). Yet, laboratory studies of learning in anxiety have failed to consistently find enhanced aversive conditioning or excessive valuation of aversive stimuli (Duits et al., 2015), and many techniques used in exposure therapy are inconsistent with excessive learned threat as the root of anxiety. Instead, uncertainty-related disruptions often are seen in anxiety (Carleton, 2016; Grupe & Nitschke, 2013; Piray & Daw, 2021; Pulcu & Browning, 2019). However, existing theoretical frameworks do not explain how dysfunctional learning and decision-making under uncertainty create avoidance and are remediated with exposure therapy. We focus on exposure therapy, because it is hypothesized to causally manipulate learning in anxiety; however, a number of other effective treatments for anxiety exist, including antidepressants. While we discuss human studies, they need to be considered in the context of the large animal literature on threat conditioning, avoidance, and their pharmacological manipulations (for reviews see Borsini et al., 2002; Harro, 2018; LeDoux et al., 2017; Maren, 2001, 2011; Quirk & Mueller, 2008).

Here, we consider the clinical observations of exposure therapy alongside neurocomputational theories of uncertainty and advance a view of maladaptive learning in anxiety based on uncertainty-sensitive learning and exploration. We propose 1) that maladaptive uncertainty learning is central to fear-based anxiety disorders, 2) that this disrupted uncertainty learning leads to dysfunctional explore/exploit decisions in aversive environments to cause impairing avoidance, and 3) that successful treatment, particularly exposure therapy, remediates these dysfunctions. Reconceptualizing treatment in this way will improve our understanding of how therapies for anxiety disorders work, rendering them more effective.

Current state of exposure therapy

Learning principles and development of exposure therapy

Mary Cover Jones first applied learning theory to reduce anxiety in the 1920s (Jones, 1924a, 1924b). Building on the studies of conditioning of John Watson and others, she showed that some learning techniques successfully reduced fear in children with various phobias. Exposure therapy techniques began to be systematically developed decades later, including systematic desensitization (Wolpe, 1961) and flooding/implosive therapy (Ayer, 1972; Cooper & Clum, 1989; Keane et al., 1989), informed by the laboratory work of Mowrer and others (Mowrer, 1951). The theoretical principles behind exposure therapy were then elaborated as emotional processing theory by Edna Foa and colleagues (Foa & Kozak, 1986). Emotional processing theory proposed that exposure therapy updated the maladaptive emotional content of memory structures through repeated experience with situations activating these structures. Across all of these approaches, the core steps of exposure therapy are clear: first, feared situations are identified, and then patients are guided through structured exposures. In these exposures, they engage in feared situations until a (often predetermined) stopping point without engaging in avoidance, escape, or safety behaviors.

Two points stand out from this foundational work: first, early theories of exposure used a behaviorist black box approach to fear and anxiety. In this framework, any fears result from conditioning by the external environment: anxiety, whether adaptive or not, reflects the presence of learned threat, maladaptive anxiety results from excessive learned threat, and so repeated exposures correct this excessive threat. This view does not account for individual differences or internal, cognitive factors bearing on how information from the environment is processed. Although theories of anxiety and exposure may include internal, cognitive processes (Foa & Kozak, 1986; Lovibond et al., 2008), acknowledge the role of proximal and distal risk factors conferring individual differences (Zinbarg et al., 2022), and note that these individual differences likely influence learning (Craske et al., 2014), the assumption remains that exposure therapy works through normative-learning processes. Based on this assumption, reducing maladaptive fear and avoidance in pathological anxiety is equivalent to reducing experimentally conditioned fear and avoidance in the laboratory.

Second, despite the clear and ongoing inspiration from learning principles, developing exposure as an effective treatment required much trial and error. Jones’ work reviews several approaches that she tried with different children, all well informed by learning principles, and notes several that were unsuccessful and even harmful (Jones, 1924a). Similarly, Wolpe’s description of systematic desensitization is clearly informed by testing different approaches with his patients (Wolpe, 1961). Exposure therapy, therefore, did not neatly derive from learning theory; rather, it is the result of extensive experimentation and refinement. Why was behaviorist learning theory alone insufficient for developing an effective treatment? As we will review in the next paragraphs, this trial-and-error development revealed that many approaches that should work according to basic conditioning theories alone are not effective, while the approaches that proved to be effective require explanation beyond that provided by basic learning explanations.

What is missing from our understanding of exposure?

Exposure therapy techniques underwent several empirical modifications after their initial development. For example, relaxation training was central to systematic desensitization and other early exposure formulations (Wolpe, 1961); relaxation was thought to serve as a competing response to inhibit and extinguish learned associations between anxiogenic stimuli and autonomic arousal. Yet, empirical benefits of relaxation training are limited and experts now recommend against this component (Abramowitz, 2013; Rothbaum & Schwartz, 2002). Another major challenge to emotional processing theory in particular involved the time course of fear reduction. Emotional processing theory makes a specific prediction that incorporating corrective information into fear structures leads to fear reduction both within and between each exposure session. However, only between-session fear reduction predicts treatment outcome, necessitating a reconceptualization of when and how symptom change relates to learning (reviewed in Asnaani et al., 2016; Craske et al., 2008).

Attempts to augment normative learning processes during exposure therapy with learning-informed pharmacological, somatic, or behavioral interventions have not been particularly successful. D-cycloserine, a partial NMDA receptor antagonist with effects on fear conditioning, shows mixed results when used during exposure sessions; the number and timing of doses may moderate response (Hofmann, 2014; Mataix-Cols et al., 2017; Norberg et al., 2008), but targeted, preregistered tests of proposed moderators failed to demonstrate an augmenting effect (Smits et al., 2020). Transcranial magnetic stimulation (TMS) or transcranial direct current stimulation (tDCS) to increase or decrease activity in learning-related cortical regions during exposure have not been consistently effective (Bulteau et al., 2022; Cobb et al., 2021; Isserles et al., 2021; Zaizar et al., 2021). Behavioral approaches that normatively enhance extinction, such as novelty-facilitated extinction (Dunsmoor et al., 2015), do not improve exposure therapy outcomes in people with anxiety (Steinman et al., 2022). These translational failures occurred despite robust preclinical findings: these approaches consistently change normative learning, yet do not improve treatment outcomes. The disappointing outcomes of these exposure augmentation strategies have several possible explanations, but they suggest that the learning processes responsible for symptom improvement are not successfully modified by strategies that augment normative learning.

Current theories of exposure also do not account for important treatment components. The inability to overcome avoidance and engage in exposures, especially during initial exposure sessions, is a major barrier to treatment response. Empirically, higher pretreatment avoidance predicts worse response to treatment over and above symptom severity, reductions in avoidance predate other symptom changes during exposure therapy, and avoidance is significantly reduced with successful treatment (Cottraux et al., 1993, 2001; Hansmeier et al., 2021; Salcioğlu et al., 2007; Wheaton et al., 2018). In practice, initial exposure sessions focus more on reducing avoidance than on changing expectations of feared outcomes. Yet, current theories view avoidance as a byproduct of excessive fear, not a target of treatment itself. Avoidance is instead targeted outside of exposure through psychoeducation or motivational techniques. Manuals for exposure therapy practitioners (Foa et al., 2012) emphasize the need to target the patient’s idiographic “core fear,” a notion found in current theories of exposure. For example, two people with cleanliness-related obsessive-compulsive disorder (OCD) will require different approaches to exposure if one person’s core fear is harm coming to their loved ones from germs and the other’s is losing control of their environment. Successful exposures targeting a core fear will show generalization to other feared situations not directly targeted (such as refraining from cleaning at home generalizing to being able to tolerate messes in one’s child’s classroom, to use the first example). In contrast, exposures that do not target the core fear, despite similar content and patient fear ratings (such as refraining from cleaning a break room at work that the person’s family will never visit), show less generalization and overall symptom reduction. Yet, the necessity of targeting the core fear—over and above tailoring exposures based on patients’ fear ratings—is not addressed by current theories of exposure.

More recently, researchers have begun to tackle some of these problems and have proposed new accounts of learning processes in exposure that account for the need to focus on core fears, as well as a renewed focus on avoidance. Inhibitory learning theory moves beyond a basic focus on extinction and proposes that learning deficits—specifically, failure to inhibit fear memories and forming new, safe contexts—underlie the development of anxiety and poor response to exposure therapy (Craske et al., 2014). Problems separating dangerous and safe contexts also support the need to focus on core fears that share a context, rather than unrelated fears that span contexts. The central role of avoidance, and possible pathways to understand avoidance in anxiety, also have been the focus of recent reviews and theoretical papers (Krypotos, 2015; Pittig et al., 2018, 2020). However, these inconsistencies are yet to be fully integrated into theories of what goes awry in anxiety and how treatments, such as exposure therapy, work.

Uncertainty and anxiety

Current ideas about the role of uncertainty in anxiety

If, as proposed in earlier models, anxiety disorders are disorders of excessive learned threat, people with pathological anxiety should show increased threat conditioning and valuation of negative stimuli in the laboratory. People with anxiety do show behavioral, psychophysiological, and neural differences with paradigms, such as fear conditioning, safety learning, and extinction learning; however, these learning differences are not consistent with the idea of greater aversive conditioning in anxiety. Rather, in threat learning, people with anxiety disorders show elevated responding to safety signals (CS−) during conditioning, greater responses to the cue previously associated with threat (CS+E) during extinction (Duits et al., 2015), and greater generalization of learned threat (Cooper et al., 2022; Fraunfelter et al., 2022), accompanied by reduced ventromedial prefrontal cortex (vmPFC) activation (Marin et al., 2017, 2020). Outside of learning paradigms, people with fear-based disorders, such as panic disorder and posttraumatic stress disorder, show elevated psychophysiological responses to uncertain threat (Gorka et al., 2017; Grillon et al., 2008, 2009; McTeague & Lang, 2012a).

To account for these findings, Grupe and Nitschke (2013) proposed the uncertainty and anticipation model of anxiety (UAMA), where anxiety is characterized by excessive anticipation of uncertain threat. UAMA delineates five areas of dysfunction: inflated estimates of threat costs and probability, hypervigilance, deficient safety learning, behavioral and cognitive avoidance, and heightened reactivity to uncertainty; these dysfunctions relate (in both overlapping and distinct ways) to altered functioning of anterior midcingulate cortex, vmPFC, orbitofrontal cortex (OFC), insula, amygdala, dorsomedial and dorsolateral prefrontal cortex, bed nucleus of the stria terminalis (BNST), and periaqueductal gray (PAG), among other areas. Similar to UAMA’s emphasis on uncertain threat, Lissek, Pine, and Grillon’s (Lissek et al., 2006) analysis of ‘”strong” versus “weak” situations (Mischel, 1977) in anxiety emphasizes the role of uncertainty. People with clinical anxiety show intact responding to strong situations with clear threats but show elevated physiological responses to weak situations with greater uncertainty about negative outcomes. According to another theory of intolerance of uncertainty in anxiety, anxiety is due to difficulties tolerating the distress of uncertainty or insufficient information (Carleton, 2016). Crucially, these theories do not propose that threat processing overall is disrupted in anxiety nor that anxiety results from representing all negative outcomes as worse or more likely. Instead, they propose that anxiety is specifically related to difficulties with threat only when it is uncertain.

Precisely defining uncertainty using computational process models

Extant theories converge on uncertainty-related dysfunctions underlying behavioral, physiological, neural, and self-report impairments in pathological anxiety. Still unclear, though, is how these different dysfunctions relate to each other and what generates them: for example, is deficient safety learning due to the same underlying dysfunction as excessive behavioral and cognitive avoidance, or are these separate dysfunctions that co-occur in anxiety? Whether separate or shared, what processes lead to these observed disruptions? In addition, commonly used self-report measures, such as the Intolerance of Uncertainty Scale (Buhr & Dugas, 2002) are vague about the source of uncertainty-related issues. If someone endorses an item, such as “uncertainty makes me uneasy, anxious, or stressed,” does that mean that uncertain threat probabilities cause them anxiety, that they value negative outcomes as more aversive, that they require greater control over their environment, that they have less emotion-regulation abilities, or some combination of the above? Similarly, fear conditioning and extinction studies can demonstrate that people with anxiety differ in responses to safe and threat-related stimuli but cannot explain whether these findings are from elevated responding to all cues in a threat-associated environment, difficulty distinguishing safe from threat-related stimuli, or other learning differences. To better understand how uncertainty relates to anxiety pathology, these constructs must be precisely defined and distinguished from each other. Computational models can mathematically define learning and choices processes affected by uncertainty and disambiguate among disruptions in components of these processes (Krypotos et al., 2020; Levy & Schiller, 2021; Montague et al., 2012).

When navigating the world, organisms must integrate and update information about relationships between stimuli and outcomes (learning) and select stimuli based on their possible outcomes (choice). Three primary families of computational models describe these learning and choice processes: descriptive models of choice under ambiguity and risk (Platt & Huettel, 2008), generative sequential sampling models of choice that represent evidence accumulation during decision processes (Forstmann et al., 2016; Ratcliff & McKoon, 2008), and reinforcement learning and similar error-driven value update models of learning (Niv, 2009; Sutton & Barto, 1998; Vilares & Kording, 2011). The first set of models describes choice during “decisions from description” (Hertwig et al., 2004), or how people choose when values and probabilities are explicitly presented to participants. These models are descriptive rather than generative: they can decompose choices into components but cannot describe how these choices are generated. The second set of models, sequential sampling models, are generative models of choice and can describe decisions from either described or learned values and probabilities; however, as these models have not been applied to decisions involving uncertainty in people with anxiety (see Aylward et al., 2020; Price et al., 2019; White et al., 2010, 2016 for nonuncertainty-related studies in anxiety), we will not focus on these models in this review. The third set of models, reinforcement learning and other error-driven value update models, are generative and describe learning and decision-making processes when information about value and probabilities must be learned (“decisions from experience”) rather than described.

These learning and choice models allow us to quantify different types of uncertainty (Bach & Dolan, 2012; Bland & Schaefer, 2012; Nassar et al., 2012; Payzan-LeNestour & Bossaerts, 2011; Yu & Dayan, 2005; Table 1; Fig. 1A). Definitions and distinctions among types of uncertainty vary, but one consistent distinction is between irreducible and reducible uncertainty. Irreducible uncertainty (also called known uncertainty or risk) arises from noisy relationships between stimuli and outcomes (e.g., from a coin flip); it can be precisely described but not reduced. In contrast, reducible uncertainty arises from incomplete knowledge of contingencies between stimuli and outcomes and, as the name implies, can be reduced when learning is possible. Reducible uncertainty can be divided into two types, depending on whether the uncertainty is about contingencies given a stable relationship between stimuli and outcomes (estimation uncertainty) or if it stems from uncertainty about which environment is currently governing stimulus-outcome contingencies or the stability of those contingencies (unexpected uncertainty; also termed changepoint probability or volatility; Nassar et al., 2012; Payzan-LeNestour & Bossaerts, 2011; Piray & Daw, 2021). See Fig. 1B for a representation of irreducible uncertainty, estimation uncertainty, and unexpected uncertainty during a reversal learning task. With high volatility or frequent changepoints causing high unexpected uncertainty, normative learning should show high learning rates, particularly after poorly predicted outcomes, as represented by the Pearce-Hall model of learning (Pearce & Hall, 1980). Meanwhile, with high noise causing high irreducible uncertainty, normative learning should slowly incorporate poorly predicted outcomes with a low learning rate, as represented by the Mackintosh model of learning (Mackintosh, 1975; Piray & Daw, 2021).

Table 1.

Key terms

Term Definition Related terms
Learning The process of integrating and updating information about relationships between stimuli and outcomes
Choice In instrumental learning, the process of selecting stimuli based on their possible outcomes Policy
Irreducible uncertainty Uncertainty from probabilistic relationships between stimuli and outcomes Known uncertainty; noise; risk
Reducible uncertainty Uncertainty from incomplete knowledge of relationships between stimuli and outcomes Unknown uncertainty; ambiguity
Estimation uncertainty Form of reducible uncertainty related to lack of knowledge about current relationships between stimuli and outcomes
Unexpected uncertainty Form of reducible uncertainty related to possibility of changes in contingencies Changepoint probability; volatility; unexpected uncertainty
Bandit tasks Reinforcement learning tasks with a single step where chosen stimuli lead directly to rewards
Markov Decision Process (MDP) Description of multi-step learning based on states, rewards, and actions
State Stage in MDP arrived at from an action made at another state; terminal states provide rewards
Transition probability Likelihood of a state leading to another state based on an action in an MDP
Options Decision policy specifying a set sequence of actions between several linked states in an MDP

Fig. 1.

Fig. 1

Types of uncertainty. A. Relationship among uncertainty types. B. Schematic of levels of types of uncertainty during a reversal learning task. Initially, outcome 1 is reinforced more than outcome 2 (reinforcement represented by dots in top part of figure, where x-axis indicates trials). Halfway through, these contingencies reverse, and a second reversal occurs ~75% of the way through. Approximate values of each type of uncertainty at different points in the task are represented in the lower part of the figure. Irreducible uncertainty is initially high before converging on a value close to the true noise level; this form of uncertainty only increases slightly after the changepoints. Estimation uncertainty is initially high, reflecting uncertainty about contingencies. This uncertainty reduces while contingencies are learned but then increases after changepoints, reflecting the need to relearn contingencies once they have changed. As changepoints become more frequent and unexpected uncertainty increases, corresponding increases in learning rate lead to faster reductions in estimation uncertainty between changepoints. Unexpected uncertainty is also initially high and decreases while contingencies are stable. After each changepoint, this uncertainty increases and stays higher, reflecting the increased possibility of additional changepoints

The relationship between anxiety and these types of uncertainty can be illustrated with an example of someone who fears health problems during a panic attack (outcome) when navigating a crowded mall (stimulus). Irreducible uncertainty refers to how reliably the crowded mall causes a panic attack; it will be high if being in a mall is a poor predictor of a panic attack and low if being in a mall is a definite predictor of having (or not having) a panic attack. Estimation uncertainty represents what is yet to be learned about the probability of the mall causing a panic attack, given the current set of contingencies; it will be high if more experiences being in the mall will lead to a better ability to predict the likelihood of panic attacks, and low if this contingency is already learned as accurately as possible. Unexpected uncertainty refers to the probability that the relationship between being in the mall and having a panic attack has changed. It will be high if the relationship between being in a mall and having a panic attack may have changed (e.g., after a recent panic attack in a similarly crowded restaurant, which may indicate that panic attacks in crowded areas are now more likely) and lower if the relationship between being in a mall and having a panic attack is stable.

Uncertainty engages brain regions involved in salience, value, and cognitive control (for in-depth reviews of neural correlates of uncertainty processing, see Bach & Dolan, 2012; Soltani & Izquierdo, 2019). The dorsal anterior cingulate cortex (dACC) forms the hub of the salience network (Seeley et al., 2007) and is required to adaptively adjust behavior after changepoints (Behrens et al., 2007; Hayden et al., 2011; Kennerley et al., 2006; Nassar et al., 2019; O’Reilly et al., 2013). The dACC and other noradrenergically innervated components of the salience network, including amygdala and insula, respond to surprising information, although whether they selectively signal one form of uncertainty over another is unclear (Bach & Dolan, 2012; Kuhnen & Knutson, 2005; Li et al., 2011; Soltani & Izquierdo, 2019). Serotonin signals from the dorsal raphe nucleus also communicate information about uncertainty (Grossman et al., 2022; Matias et al., 2017). Meanwhile, orbitofrontal, ventromedial, and ventrolateral prefrontal cortex and hippocampus interact with these uncertainty-activated areas to communicate information about context and expected value (Bartra et al., 2013; Schuck et al., 2016; Schuck & Niv, 2019; Walton et al., 2011)..

A key advantage of these computational models is that these types of uncertainty have precise and explicit mathematical definitions. For example, the irreducible uncertainty of a stimulus with a perfectly learned 75%/25% contingency can be represented by the variance of a Bernoulli distribution with a 75% probability of an outcome. This value can be used to quantitatively assess the effects of different levels of uncertainty on behavioral or physiological signals.

Choice models of decisions from description and findings in anxiety

Choice models of decisions from description are based on prospect theory and related behavioral economic theories (Kahneman & Tversky, 1979). These models measure whether, independent of value, the effects that risk (irreducible uncertainty), ambiguity (similar to reducible uncertainty, but pertains to situations where uncertainty cannot be reduced through learning), and valence (loss versus gain) have on whether an option is chosen. In these models, the probability of choosing each option is based on its expected value (magnitude of each outcome times its probability); risk aversion indicates a reduced likelihood of choosing options with greater risk, accounting for expected value; similarly, ambiguity aversion indicates a reduced likelihood of choosing options with greater ambiguity accounting for expected value, while loss aversion indicates a reduced likelihood to choose options with negative values, accounting for overall expected value. Choices involving risk, ambiguity, and loss engage value-sensitive brain areas such as OFC, vmPFC, and nucleus accumbens as well as salience-related areas like amygdala and insula (Bartra et al., 2013; De Martino et al., 2006; Hsu et al., 2005; Kuhnen & Knutson, 2005; Tom et al., 2007). These paradigms and models do not involve learning. As a result, they do not represent many real-life decisions but are useful for determining alterations in choice behavior when values and probabilities are explicitly presented.

Studies of risk, ambiguity, and loss aversion in anxiety point to increased risk and ambiguity aversion, particularly with losses, but no differences in loss aversion (Charpentier et al., 2017; Giorgetta et al., 2012; Hartley & Phelps, 2012; Maner et al., 2007, but see Sip et al., 2018). This pattern of decision-making differences indicates that people with high anxiety do not avoid choices with potential negative outcomes more than people with less anxiety, but they avoid choices where such outcomes are less predictable due to greater risk or ambiguity. These findings are consistent with intact valuation and decision-making with negative outcomes accompanied by difficulties deciding among uncertain, potentially negative outcomes. Risk and ambiguity aversion may also relate to greater psychophysiological responses for uncertain, but not certain, threat in anxiety (Gorka et al., 2017; Grillon et al., 2008, 2009). These findings suggest that anxiety is related to problems in choice processes. However, most real-world decisions require learning about values and probabilities, which in turn affect choices based on these experiences (Hertwig et al., 2004). Understanding how choice processes interact with potential learning differences in anxiety requires a fuller understanding of how learning and choice interact and go awry in anxiety.

Reinforcement learning studies of anxiety

Process-based models of learning can fill the explanation gap left by descriptive models of choice by illustrating how value and uncertainty are learned and used in decisions. Reinforcement learning conceptualizes value learning as driven by prediction errors from comparing received to expected values (Rescorla & Wagner, 1972; Schultz et al., 1997). Prediction errors are then used to update future expectations of value; a learning rate parameter in the model controls how rapidly these updates occur. In the simplest neural account of reinforcement learning, dopaminergic projections from the ventral tegmental area to striatum, particularly ventral striatum, communicate prediction error signals while ventromedial prefrontal cortex tracks value (Bartra et al., 2013; Diederen et al., 2016; Montague et al., 1996; Rangel et al., 2008). To track complex environments more representative of the real world, basic reinforcement learning models require modification. Specifically, they must track dynamic changes in task contingencies as well as different types of uncertainty that can help discern change points from other sources of variability. Behaviorally, these models incorporate effects of uncertainty on dynamic changes in learning rate.

Models tracking uncertainty during learning have shown the most consistent learning alterations in anxiety. Findings using these models are summarized in Table 2 and show several commonalities (of note, the included studies were based on the authors’ knowledge of the literature and augmented by PubMed and Google Scholar searches using the terms “reinforcement learning AND (anxiety OR PTSD OR OCD)” but do not represent a comprehensive, structured literature review following PRISMA guidelines).

Table 2.

Computational studies of disrupted learning from uncertainty in anxiety

Study Valence Contingencies Changepoint frequency Sample Effect of anxiety Other
Browning et al., 2015 Aversive and appetitive blocks Stable: 75%/25% Volatile: 80%/20% Stable: none Volatile: reversal every 20 trials High and low trait anxiety, n = 31 prescreened “to achieve an approximately even number of participants with scores in the ranges 20–30, 30–40, 40–50 and > 50” on the STAI No overall LR difference but reduced adjustment of LR between volatile and stable blocks with high trait anxiety in aversive valence only Inability to adjust LR related to reduced pupil dilation to trial-level volatility (unexpected uncertainty)
Brown et al., 2018 Aversive and appetitive blocks 75%/25% Signaled once learning occurred (~15 trials) Trauma-exposed with (n = 39) and without (n = 29) PTSD; MDD without anxiety (n = 20) Greater associability (LR adjustment based on past surprise) in aversive valence only Not present in MDD-only participants
Homan et al., 2019 Aversive 33%/67% for CS+, 0%/100% for CS− Reversal after 30 trials Trauma-exposed with (n = 24) and without (n = 30) PTSD Greater associability
Gagne et al., 2020 Aversive and appetitive blocks Same as Browning et al., 2015 Same as Browning et al., 2015 With or without GAD and/or MDD (GAD n = 12; MDD n = 20; healthy controls n = 26), unselected university (n = 30), and online participants (n = 147) Reduced LR adjustment across valences More related to internalizing symptoms than GAD/MDD symptoms specifically
Norbury et al., 2021 Aversive 33%/67% (one context) followed by 0%/100% in two contexts 30 trials per context Trauma-exposed with and without PTSD (n = 56) Greater tendency to form single latent state across contexts Not related to depression symptoms
Abend et al., 2022 Aversive Acquisition: 80%/20% for CS+, 0%/100% for CS−; Extinction: 0%/100% for both CS+ and CS− 10 trials acquisition; 8 trials extinction Youth and adults with (n = 111) and without (n = 104) generalized, social, and/or separation anxiety disorder Slower learning rate for CS− on CS+ trials Anxiety measured with STAI in adults and SCARED in youth
Zika et al., preprint Aversive Three conditions: 90%/10%, 75%/25%, 60%/40% About every 15 trials High and low trait anxiety (n = 89) In low irreducible uncertainty (90%/10%) condition, more likely to form multiple states

LR = learning rate.

Included studies were based on literature searches and were required to: focus on adults, have some proportion of participants with clinical levels of fear-based anxiety (e.g., not healthy undergraduates or only experimentally induced stress/anxiety or primarily MDD/GAD samples; based on inspecting each study’s cutoffs for high anxiety), include computational modeling of behavior (rather than only including neural findings), assess some measure of uncertainty or context, and including sufficient information to assess quality of modeling methods based on Brown et al. (2020) (e.g., not maximum likelihood estimation with no parameter recovery)

First, these studies find differences in parameters representing learning rate adjustments (including associability, or the amount learning rates adjust based on past absolute prediction errors associated with a stimulus; Le Pelley, 2004; Pearce & Hall, 1980), latent state formation (tendency to attribute outcomes to one cause versus another), and other learning measures dealing with uncertainty. Other parameters measuring overall learning rate, choice consistency, and outcome valuation show little relationship with anxiety. Second, studies with reward and loss learning conditions have generally found differences during loss learning only (Brown et al., 2018; Browning et al., 2015), although some have found differences in both conditions (Gagne et al., 2020).

Lastly, there may be some diagnostic specificity among internalizing disorders. Internalizing disorders comprise depression, anxiety, and related disorders (including PTSD and OCD) and can be further divided into fear- versus distress-based disorders and symptoms (Craske et al., 2009; Watson, 2005). Specifically, some studies have found learning differences that were related to fear, and not distress-based, disorders and symptoms (Brown et al., 2018; Norbury et al., 2021). In contrast, a study comparing GAD and MDD did not find specificity to symptoms of one of these disorders (Gagne et al., 2020). One explanation may be that fear-related symptoms and disorders are more related to these learning differences than distress-related disorders (Clark & Watson, 1991; McTeague & Lang, 2012b).

Other learning differences in the studies reviewed in Table 2 indicate differences in learning from good versus bad outcomes, but this learning impairment also appears in studies of depression (Abend et al., 2022; Brown et al., 2021; Gagne et al., 2020; Lamba et al., 2020; Wise & Dolan, 2020). The direction of results in individual studies varies; a recent computational meta-analysis suggested that anxiety and depression show greater punishment learning rates (Pike & Robinson, 2022). However, investigating the specificity of uncertainty-related learning disruptions to fear-based symptoms and disorders requires further studies in samples with a broad range of fear and distress-related symptoms and more precise measures of these symptom dimensions, rather than relying on measures such trait anxiety that measure internalizing (spanning depression and anxiety) symptoms more generally (Knowles & Olatunji, 2020).

Anxiety as maladaptive aversive uncertainty learning

Given the task design and model formulation differences in the studies reviewed above, what consistent learning differences are present in anxiety? Most of the above studies manipulated unexpected or irreducible uncertainty, but not both. Adaptive learning requires simultaneously assessing the level of different types of uncertainty in one’s environment and adjusting learning accordingly. In tasks or blocks with high unexpected uncertainty (20 or fewer trials per block; Brown et al., 2018; Zika et al., 2022 and volatile blocks in Browning et al., 2015; Gagne et al., 2020), people with anxiety show slightly slower learning overall, but accelerate learning more after very surprising outcomes that indicate obvious changes. Meanwhile, in tasks or conditions with high irreducible uncertainty (75%/25% contingency or less), people with anxiety show a generally higher learning rate, particularly after surprising losses (Homan et al., 2019; stable blocks in Browning et al., 2015; Gagne et al., 2020)—although this effect is less consistent (Norbury et al., 2021; Zika et al., 2022). These patterns suggest that people with anxiety have an impaired ability to discern whether prediction errors result from irreducible uncertainty, requiring a slower learning rate, or true changes in contingencies, requiring a higher learning rate. As a result, they show lower learning rates when learning rates should be high from high unexpected uncertainty and higher learning rates when learning should be slow from noisy outcomes. Other proposed explanations for this pattern of results include that people with anxiety may infer more information content from negative outcomes (Pulcu & Browning, 2019) or show specific reductions in estimated irreducible uncertainty (Piray & Daw, 2021). Crucially, despite inconsistent results from individual studies, using a common theory and family of generative models enables synthesis across studies to propose and test a shared impairment.

Maladaptive uncertainty learning can explain many non-computational findings. Failure to discern between irreducible and unexpected uncertainty can lead to difficulties determining when contexts have changed, potentially explaining fear extinction and retention impairments commonly seen in anxiety (Duits et al., 2015; Marin et al., 2017). Within a context, difficulties estimating the amount of irreducible uncertainty present may impair detection of consistently safe stimuli, as is found with reduced safety signal learning (Duits et al., 2015), and discerning differences in contingencies between similar stimuli, as seen with overgeneralization of conditioned fear (Cooper et al., 2022; Fraunfelter et al., 2022). In terms of neural function, anxiety, and particularly panic and other fear-based disorders, is linked to impaired noradrenergic function (Bremner et al., 1996; Charney, 1984; Hendrickson & Raskind, 2016; Kalk et al., 2011; Naegeli et al., 2018). Disrupted noradrenergic signaling leads to excessive activation of the salience network (Etkin & Wager, 2007; Sylvester et al., 2012), impairing learning about uncertainty. Altered activation of regions that control uncertainty signals, including prefrontal cortex and hippocampus, in anxiety (Aupperle & Paulus, 2010; Marin et al., 2017, 2020; Sun et al., 2020) lead to impaired regulation of these uncertainty signals.

Avoidance and anxiety

Current theories of avoidance in anxiety

Avoidance of feared situations is a disabling aspect of anxiety and the primary target in exposure therapy and other treatments. Why anxious people show persistent avoidance in the absence of outcomes is difficult to explain from the traditional behaviorist perspective (Richter et al., 2017). How is a (lack of) behavior maintained in the absence of any reinforcement? One influential early account of avoidance was Mowrer’s two-factor theory (Mowrer, 1951). It proposes that in the first, Pavlovian stage, aversive unconditioned stimuli cause the conditioned stimulus to be associated with fear; avoidance behavior is then reinforced by reductions in learned fear through instrumental learning. For example, if a person is bitten by a dog in a dog park, the dog park becomes associated with fear (stage one: Pavlovian conditioning). If the person then later approaches the dog park, their fear increases, and leaving the area of the dog park—avoidance—is reinforced by a reduction in this fear (stage two: instrumental learning). Despite providing an intuitive account of avoidance that is still employed clinically, two-factor theory had several inconsistencies (Rachman, 1976; Solomon & Wynne, 1954): among them, conditioned avoidance is remarkably resistant to extinction, continues even if it does not terminate the conditioned stimulus, and persists in the absence of fear.

After the demise of two-factor theory, research on instrumental avoidance learning waned and studies of learning in anxiety shifted to simpler Pavlovian fear conditioning paradigms (LeDoux et al., 2017). Some work on instrumental theories continued, notably resulting in Lovibond’s cognitive expectancy model (Lovibond et al., 2008). This model, like two-factor theory, proposes that the relationship between an aversive outcome and a conditioned stimulus is acquired through Pavlovian conditioning while avoidance of the conditioned stimulus is learned through instrumental learning. However, what is learned in the first, Pavlovian, stage is the expectancy of a negative outcome, not fear. During the second stage the organism learns which actions lead to the conditioned stimulus and which actions lead to safer outcomes with a lower expectation of the negative outcome. This avoidance of the expectancy of negative outcomes is processed as the removal of an aversive stimulus and so is reinforcing. Reframing learning in terms of expectancies, rather than drive reduction, addresses many of the difficulties that two-factor theory encountered. For example, expectancies maintain responses during extinction and in the absence of fear. However, the abstract concept of expectancy, with its roots in cognitive models, has been difficult to integrate into behavioral models of threat learning and avoidance.

Other recent theories of avoidance abandoned explanations of threat avoidance as an instrumental phenomenon. Drawing on nonhuman studies, LeDoux and colleagues (LeDoux et al., 2017) proposed that avoidance behavior can be broken down into three, potentially sequential, processes: Pavlovian conditioning leading to specifies-specific defensive responses (Bolles, 1970), such as withdrawal or freezing controlled by a lateral amygdala—central amygdala—periaqueductal gray circuit; goal-directed avoidance responses triggered by infralimbic prefrontal cortex (analogue of the human vmPFC) and controlled by a lateral amygdala—basal amygdala—nucleus accumbens circuit; and dorsal striatum-based habitual avoidance responses. In contrast to goal-directed avoidance behaviors, which are outcome-sensitive (e.g., “I avoid the mall, because I tend to get panic attacks there”), habitual responses are stimulus-triggered and outcome-insensitive (e.g., “I avoid large crowds”). LeDoux and colleagues refer to the concept of active versus passive coping to explain how avoidance goes awry in anxiety: passive coping, which they relate to maladaptive avoidance, is characterized by maladaptive Pavlovian defensive responses like withdrawal. Active coping can be more adaptive but can become pathological when habitual responses predominate. Therefore, LeDoux and colleagues propose that maladaptive avoidance in anxiety is caused by either excessive Pavlovian defensive responses or excessive habitual responses rather than adaptive goal-directed responses.

In a more clinically focused review, Arnaudova et al. (2017) suggested several other reasons for maladaptive avoidance in anxiety outside of instrumental learning: increased threat appraisal, increased automatic avoidance tendencies, decreased regulation of avoidance in the service of other goals, habitual avoidance responding from overtraining, and increasing psychological distance through experiential avoidance. As with models of uncertainty in anxiety, though, the authors do not offer a unifying framework explaining how these processes relate to each other and disruptions in anxiety. In addition, some of these explanations (increased threat appraisal, experiential avoidance, and decreased regulation of avoidance in service of other goals) can indeed be conceptualized with updated instrumental learning models (Huys & Renz, 2017; Lovibond et al., 2008). The other explanations, increased automatic avoidance tendencies and habitual avoidance responding, are similar to the processes proposed by LeDoux and colleagues; these are reviewed in the next paragraph.

Excessive Pavlovian withdrawal in the presence of aversive stimuli in anxiety has some evidence in the literature (Grillon et al., 2017; Loijen et al., 2020; Mkrtchian et al., 2017; but see Struijs et al., 2017). These studies do not show specificity to anxiety; instead, increased inhibition may be a feature of all internalizing disorders. Therefore, people with anxiety and other internalizing disorders may engage in excessive withdrawal with negative stimuli, manifesting as increased passive avoidance, as theorized by Arnaudova and colleagues (Arnaudova et al., 2017; increased automatic avoidance tendencies) and LeDoux and colleagues (LeDoux et al., 2017; maladaptive Pavlovian defensive responses). The evidence for habit learning abnormalities in anxiety is less clear. Studies with large, transdiagnostic samples show that increased habit learning, as measured by reduced model-based planning, is due to compulsive symptoms, not anxiety (Gillan et al., 2016, 2020, 2021; Heller et al., 2018), and people with anxiety actually show greater model-based planning in some contexts (Hunter et al., 2021). In addition, true habitual avoidance in humans may be much more limited than in animal studies of active avoidance (de Wit et al., 2018); studies finding greater habitual learning may instead be picking up on reduced model-based learning in compulsivity. Regardless, maladaptive avoidance in anxiety does not seem to stem from an increased tendency to form habits.

Other relevant theories of defensive behaviors, including avoidance, come from ethology and have not been empirically tested in anxiety. One idea is that behavior depends on the imminence of predator threat (Fanselow, 1994; Levy & Schiller, 2021; Mobbs et al., 2015, 2020): animals behave very differently when safe (predator threat unlikely), under pre-encounter threat (threat not present but more likely), post-encounter threat (threat present but not attacking), and circa-strike threat (predator attacking). Active avoidance behaviors are most likely during pre-encounter threat, where model-based planning to avoid greater threat is helpful, whereas passive avoidance and stereotyped defensive behaviors become more likely as the threat becomes imminent, during post-encounter or circa-strike threat (Mobbs et al., 2020). Threat imminence theories have not incorporated effects of uncertainty on defensive behavior; related work on safety proposes that certainty about safety, the opposite of uncertainty about threat, reduces defensive behaviors (Tashjian et al., 2021), whereas greater uncertainty about the source of threat in anxiety may increase perceived threat imminence (Raymond et al., 2017).

In summary, although Mowrer’s two-factor theory informed exposure therapy, it fails to explain many behaviors. Subsequent writers offered explanations for a broader range of avoidance behaviors, but these explanations required a departure from behaviorist specifications, impeding integration with current reinforcement learning theories. Accounts of avoidance that do not consider instrumental learning cannot explain instrumental behaviors that comprise active avoidance or else postulate behavioral tendencies, such as increased reliance on habitual behavior, that are inconsistent with empirical findings in anxiety. Ethological ideas about threat imminence help to understand situations where avoidance behavior arises but do not address the role of uncertainty and have yet to be applied to anxiety. In the following sections, we consider how reinforcement learning can formalize some of these ideas, such as cognitive expectancies and a spectrum of defensive behaviors, and advance a process account of uncertainty learning to explain avoidance in anxiety.

The explore-exploit dilemma and avoidance

Avoidance involves choices or actions, which are absent in Pavlovian conditioning paradigms primarily used to study anxiety. Choices an organism or agent makes about which stimuli they encounter and learn from affects what they know about their world. Uncertainty about the outcomes of available choices gives rise to the explore-exploit dilemma (Kaelbling et al., 1996; Sutton & Barto, 1998). This dilemma is whether one should choose certain, high value options to maximize short-term rewards (exploitation) or sample alternatives with lower or unknown values in the hope of discovering superior options and maximizing long-term rewards (exploration). Neurally, components of the frontoparietal control network as well as striatum and amygdala show activation to uncertainty when explore/exploit decisions are required (Blanchard & Gershman, 2018; Hogeveen et al., 2022).

The relative benefits of exploitation versus exploration depend on the type of uncertainty (Cohen et al., 2007). Exploring to reduce uncertainty can maximize value in the long run if estimation uncertainty predominates, such as at the beginning of a block of trials (Rich & Gureckis, 2018; Wilson et al., 2014, but see Payzan-LeNestour & Bossaerts, 2012). With high estimation uncertainty and several chances to maximize value, initial exploration reduces uncertainty, leading in turn to better knowledge of contingencies and choices resulting in higher-valued outcomes on later trials. Exploration also increases with high unexpected uncertainty if it helps detect possible change points (Navarro et al., 2016). In contrast, if uncertainty is primarily irreducible, exploration will not reduce uncertainty and so will not improve contingency knowledge for later choices. Exploration also is unhelpful in single-shot learning or brief episodes where too few future trials exist to benefit from reducing uncertainty (Rich & Gureckis, 2018; Wilson et al., 2014). This effect of differences in uncertainty on exploration versus exploitation is similar to uncertainty’s effect on learning rates: high relative estimation uncertainty leads to high learning rates and exploration, whereas high relative irreducible uncertainty leads to low learning rates and exploitation. The effect of unexpected uncertainty is unclear; because it leads to high learning rates, it may cause more exploration, or it may reduce exploration if uncertainty about contingencies possibly changing makes learning more about the current environment unhelpful.

These relationships between uncertainty and exploration apply to environments with rewarding outcomes. In aversive contexts, people show a different pattern of behavior: when choosing among loss outcomes, humans and other organisms seem to shift their goal from maximizing long-term value to avoiding negative outcomes as much as possible in the short term. In environments where all outcomes are negative, more uncertain outcomes offer the prospect of potential safety. This search for safety causes people to become more uncertainty seeking overall, even during single-shot learning (Krueger et al., 2017; Lejarraga & Hertwig, 2017). People also explore high estimation uncertainty options less when some options lead to catastrophic failure (Schulz et al., 2018). People may overexplore when uncertainty cannot be reduced or controlled, as with irreducible uncertainty, and underexplore to stay in safe areas when uncertainty can be controlled, as with estimation uncertainty. The effect of unexpected uncertainty on exploration in aversive contexts has yet to be investigated, but, like estimation uncertainty, it may decrease exploration, especially if the option to avoid making any potentially dangerous choice is available.

This myopic focus on minimizing short-term losses in aversive environments appears counterproductive but could improve survival in real-world environments where a small number of negative outcomes could cause injury, illness, or death and foreclose the chance of any future choices (Bateson, 2002; Korn & Bach, 2015; Mehlhorn et al., 2015). If anxiety is related to maladaptive aversive uncertainty learning, avoidance would result from increased exploitation and decreased exploration from misperception of uncertainty. However, fully formulating this effect requires an understanding of how instrumental avoidance relates to reinforcement learning.

Temporal difference Markov decision process models of avoidance

Lovibond’s cognitive expectancy model was formulated as “cognitive,” because a behaviorist framework could not accommodate the concept of internal expectancies as a driver of behavior. However, computational theories of temporal difference reinforcement learning (Sutton & Barto, 1987, 1998) are able to capture similar processes fully within a learning framework (Maia, 2010; Moutoussis et al., 2008; Seymour et al., 2004). To explain avoidance, these models must explain how previous actions and values influence later outcomes. The RL paradigms and models reviewed earlier can explain only single-step learning, where learning and choice behavior is influenced by immediate rewards and punishments. In RL, these are known as bandit problems. A more extended formulation of RL, known as Markov decision processes or MDPs, can explain how actions influence future outcomes as well.

MDP models of reinforcement learning describe learning as a multistep process, where a person (or more generally, an agent) travels through several states, some of which are associated with rewards (or losses; Fig. 2; Sutton & Barto, 1998). Often, an outcome is only experienced at the end of a trip, or episode, through the MDP. States with outcomes are termed terminal states, as the episode ends in one of these states and results in reward or punishment. The values of earlier actions and states are learned based their relationships to the outcomes obtained in terminal states. In an MDP, transitions between each set of states acquire values that depend on the probabilities of transitioning to future states in the MDP (transition probabilities) and the outcomes received when transitioning to a terminal state. Therefore, an MDP consists of states, rewards (in MDP terminology, rewards encompass all outcomes, including negative ones), actions describing what is done at each state, and transition probabilities between states.

Fig. 2.

Fig. 2

Markov decision processes. A. MDP formulation of an instrumental learning paradigm or bandit task. Each trial has two stimuli: one with a 75% chance of leading to reward and a 25% chance of leading to no reward; the other stimulus has the opposite probabilities. This MDP has three states: the state prior to choice (s3), where the agent is presented with the two stimuli; two outcome, or terminal, states, s2 and s1; and two actions, choosing one stimulus (a1) and choosing the other (a2). Transitioning from state s3 to the two terminal states s1 and s2 have rewards of 1 and 0, respectively. There is a transition probability for each action-transition pair: Pa1(s3, s1) (the probability of transitioning from state s3 to state s1 given action a1) is 0.75, Pa1(s3, s2) is 0.25, Pa2(s3, s1) is 0.25, and Pa2(s3, s2) is 0.75; these probabilities re-express the 75%/25% and 25%/75% outcome contingencies for the stimuli. In this MDP, there is one choice: select either a1 or a2. B. MDP formulation of an instrumental learning paradigm with two steps to the outcome. Compared to the MDP in panel A, this MDP has another state (s4) with another set of stimuli to choose between that lead to reward, and a state (s5) where one chooses between stimuli that probabilistically lead to states s4 or s3. C. MDP of a set of possible states and actions for a person with social anxiety going to a party. This example illustrates a subset of potential states and actions (e.g., another possible state after the “make a joke” action is that the group laughs at your joke, but then you accidentally sneeze on someone, which also leads to the core fear of negative social evaluation). Note that there are three terminal states: staying in a conversation (person’s goal, small positive reward), returning home without engaging in conversation (total avoidance, no reward), and being excluded (feared outcome, large negative reward). D. Abstraction of the MDP in Panel C. E. First stage of Pavlovian learning as proposed by (Moutoussis et al., 2008) and (Maia, 2010). Values propagate from the terminal states to other states and actions through learning. Values inside each state (represented by boxes) represent the learned value of each state, whereas values next to arrows represent the change in value when transitioning between those states. F. Second stage of instrumental learning as proposed by (Moutoussis et al., 2008) and (Maia, 2010). Actions are taken with frequencies, denoted by line thickness, based on the values acquired through Pavlovian learning in the first stage

As an example of a very simple MDP, we can reconceptualize a trial from a basic instrumental learning task (Fig. 2A). In the simple MDP described in Fig. 2A, all state transition rewards are immediately experienced. In a larger MDP, some state transitions lack immediate reward and must be learned (Fig. 2B, which is similar to the design of the popular two-step task; Daw et al., 2011). How choices are made is called a policy in MDP terminology, which can be the choice processes discussed earlier or learned over time through reinforcement learning. In the (Moutoussis et al., 2008) and (Maia, 2010) models, the tendency to choose each action is slowly increased or decreased based on whether the resulting reward is better or worse than expected using an actor-critic architecture. In the two-step MDP in Fig. 2B, the transitions from state s5 to s4 and s5 to s3 have no immediate reward; instead, the values of these state transitions are learned through experience and depend on the combination of policy, actions, and other states’ immediate rewards.

According to temporal-difference theories of avoidance learning using MDPs, each component of avoidance behaviors is represented by states and actions (Figs. 2C and D). Initially, the unconditioned stimulus is encountered, resulting in a negatively valued reward. Through Pavlovian conditioning, the state transitions more likely to lead to the unconditioned stimulus acquire more negative values than state transitions with a low probability of doing so (Fig. 2E). Then, when choosing actions, the agent preferentially chooses higher-valued state transitions less likely to result in the negative outcome (Fig. 2F). In the model of (Moutoussis et al., 2008), comparing the outcome after successful avoidance to other possible outcomes (i.e., encountering the unconditioned stimulus) using advantage learning (Dayan & Balleine, 2002) results in a positive value for avoidance behaviors and serves as the instrumental reinforcer. In (Maia, 2010) the reinforcement comes from a reduction in negative values when transitioning into safer states. In both models, avoidance continues to be reinforcing even if the negative outcome is not experienced after initial conditioning.

It is difficult, however, for these temporal difference models of avoidance to explain avoidance in anxiety. One feature of anxious avoidance is that the feared outcome (in the example in Fig. 2, being socially shunned) is rarely or never encountered. The proposed models require this outcome to be experienced during initial Pavlovian conditioning. Human learning, however, can happen without direct experience. Internal simulations of possible outcomes, observational learning, and instructed learning all cause states to acquire negative values and to be avoided (Askew & Field, 2007; Dymond et al., 2012; Muris & Field, 2010; Olsson & Phelps, 2004, 2007; Rachman, 1977). For example, a person who fears a panic attack in a public place, because it may lead to a heart attack and death may have learned this feared outcome from: simulating a possible negative outcome of death when feeling faint; hearing about a person who felt faint and then had a heart attack; observing a parent avoiding feeling faint and inferring a negative outcome from the parent’s avoidance; or being told that chest tightness is a sign of a heart attack.

Another component of MDPs relevant to clinical avoidance, but not yet addressed in temporal difference MDP models of avoidance, is the role of cached sequences of states and actions, or options (Sutton et al., 1999). Options allow sequences of states and actions to be grouped and executed together. In scenarios where certain sequences occur often, such as the steps of cooking a meal, options simplify complex MDPs by reducing them into sets of sequences rather than evaluating each state and action individually. Humans show frequent and flexible use of options and similar forms of cached sequences (Botvinick et al., 2009; Huys et al., 2015; Xia & Collins, 2021). People may rely on options more with greater threat imminence, as increased threat imminence causes use of less cognitively demanding and more model-free forms of decision-making (Mobbs et al., 2020). Notably, greater use of options leads to inflexible behavior that disregards outcomes of intermediate steps, resembling habitual behavior. Unlike habitual behavior, options are goal-directed behaviors that can be interrupted if indicated. The tendency of people with anxiety to fall back on avoidance behavior when uncertainty is higher or in situations with greater stress may reflect a greater reliance on options with greater threat imminence.

Viewing avoidance as the outcome of temporal difference learning allows us to model differences in learning and choice explicitly and examine their relationships with avoidance behavior. The few existing studies on temporal difference MDP-like learning in anxiety provide some insight into how avoidance behavior may arise in anxiety. (Vervliet et al., 2017) measured self-reported positive emotion as a proxy for model-derived positive prediction errors during an avoidance task. Participants’ reported emotions showed similar patterns as prediction errors inferred from a temporal difference account of avoidance, but participants with less distress tolerance had less specific relief signals. They proposed that intolerance of distress, common to all internalizing disorders, may be linked to overgeneralized avoidance learning. Initial modeling simulations also show that modified policies representing pessimistic or catastrophizing choice processes lead to excessively negative valuation and avoidance as well as risk aversion (Gagne & Dayan, 2022; Zorowitz et al., 2020). This form of choice process, where avoiding negative outcomes is favored over maximizing reward, may better represent normative, negative outcome-avoiding behavior in aversive environments than anxiety, however. Other theoretical work has suggested that excessive active avoidance results from differences in policy, where even slightly negative values are not chosen, or from learning, where state transitions would take on more negative values (Raymond et al., 2017); however, these learning and choice alterations have not been empirically demonstrated in anxiety. To our knowledge, no work has yet connected the maladaptive uncertainty learning seen experimentally in anxiety to these models of avoidance.

Uncertainty-sensitive markov decision-process models and anxious avoidance

Examining how uncertainty shapes explore/exploit decisions with temporal difference MDP models can help us understand how anxious avoidance may result from maladaptive uncertainty learning in anxiety. Adding uncertainty to temporal difference MDP models means that choices depend not only on learned values but also, through explore/exploit effects, on uncertainty (Fig. 3). Increases in exploitation will increase selection of known, safe states, while increased exploration will decrease reducible uncertainty and increase certainty about transition probabilities, but also lead to terminal states with negative outcomes more often. Meanwhile, in an aversive environment, policies shift to minimize negative outcomes in the short term. This shift can be represented by discarding choices whose range of potential outcomes include more negative values, as in the risk avoiding policies reviewed above (Gagne & Dayan, 2022; Zorowitz et al., 2020). Therefore, normative behavior in uncertain, aversive MDPs will show avoidance of uncertain states from increased exploitation of safer states. This avoidance maintains high uncertainty and perpetuates avoidance by preventing exploration of more uncertain states. Additionally, increases in any form of uncertainty will reduce the distinction between early safe and threat-related stages, causing previously safe stages to be perceived as more dangerous.

Fig. 3.

Fig. 3

Effects of uncertainty on explore/exploit decisions within a Markov decision process. Top, effect of each type of uncertainty on exploration in appetitive and aversive environments. In rewarding environments, irreducible uncertainty decreases exploration, while estimation uncertainty increases; effects are opposite in aversive environments. Effects of unexpected uncertainty have been less studied, so are qualified with ?, but appear to increase exploration in appetitive contexts and so may decrease exploration in aversive contexts. Bottom, possible levels of uncertainty under different circumstances and effects on exploration in aversive environments. As anxiety is hypothesized to primarily affect uncertainty estimation in aversive environments, explore-exploit behavior should show few differences with anxiety when learning about reward.

We propose that in anxiety, impaired learning about aversive uncertainty will increase the amount of estimation uncertainty (from impairments in detecting true change points) and unexpected uncertainty (from problems learning about irreducible uncertainty that cause change points to be over-inferred). Because these types of uncertainty normatively increase exploitation, greater estimation and unexpected uncertainty in anxiety will further increase exploitation of safer states and avoidance of uncertain states. Continued avoidance of uncertain states will maintain inappropriately elevated uncertainty estimates, ensuring continued avoidance.

Uncertainty about threat elicits distress and arousal above what is caused by threat itself (Bechara et al., 1997; de Berker et al., 2016) and causes uncertain states to be perceived as distressing. This distress increases with maladaptive increases in uncertainty, as commonly experienced by people with anxiety and may explain why states with higher uncertainty are perceived as aversive. Ethologically, uncertainty can modulate perceived threat imminence: one cannot rule out a worst-case scenario where the threat is near. Perceptions of greater threat imminence resulting from high uncertainty could shift defensive behaviors along the threat imminence spectrum toward more model-free or cached actions (Fanselow, 1994; Mobbs et al., 2015, 2020). This shift would promote passive avoidance behaviors, such as withdrawal from or freezing in states with uncertain threats. It also may lead to greater reliance on options and similar decision-making heuristics, resulting in inflexible avoidance behaviors. Neurally, reduced or altered frontoparietal network activation in the presence of altered uncertainty signals in anxiety (Grupe & Nitschke, 2013; Hauner et al., 2012; Sylvester et al., 2012) may underlie changes in explore/exploit decision-making.

Integrating maladaptive aversive uncertainty learning, avoidance, and exposure therapy

Exposure as remediating maladaptive uncertainty learning and resulting avoidance in anxiety

States high in uncertainty, particularly estimation and unexpected uncertainty, will be avoided in aversive environments. Exposure therapy encourages exploration of these avoided states. Through repeated exploration of states initially estimated to have high uncertainty, people with anxiety update uncertainty estimates closer to normative values (Fig. 4)—in contrast to the traditional account of exposure as extinction, where repeated encounters with feared stimuli normalize exaggerated learned threat. These reductions in uncertainty have further effects. First, states acquire more certainty about their likelihood of leading to feared versus safe outcomes. This certainty increases the value of states as their probability of leading to the feared outcome decreases. Second, the reduction in uncertainty itself increases exploration relative to exploitation, encouraging further experiences with previously avoided states. This increase in exploration is the opposite of the feedback cycle caused by avoidance—while avoidance maintains incorrect uncertainty estimates, in turn perpetuating avoidance, increased exploration remediates and reduces uncertainty estimates which in turn favors more exploration. Meanwhile, decreases in uncertainty reduce negative emotions, by reducing uncertainty-related distress, and enable use of more flexible decision-making strategies through decreased perceptions of threat imminence.

Fig. 4.

Fig. 4

Illustration of changes in value and types of uncertainty with anxiety, avoidance, and successful exposure therapy, and effects on changes in value and uncertainty on avoidance. Example scenario, from the social anxiety MDP in Fig. 2, illustrates a choice between an avoidance behavior, leaving the group to stand by the snack table, and a nonavoidance behavior, making a joke during the conversation. In normative behavior, the nonavoidance behavior has higher value but also greater irreducible uncertainty (e.g., there is a greater risk that making a joke will go poorly and lead to negative outcomes compared to leaving to get a snack). With high anxiety, before developing avoidance behaviors, uncertainty is miscalculated as greater estimation and/or unexpected uncertainty rather than irreducible uncertainty, leading to greater tendency to avoid the more uncertain stimulus and choose the safe avoidance behavior. Engaging in avoidance behaviors increases the value of the avoidance vs. nonavoidance behavior (due the temporal difference learning processes illustrated in Fig. 2E and F). Additionally, greater experience with outcomes stemming from the avoidance behavior reduces uncertainty associated with the chosen behavior while the uncertainty associated with the unchosen, nonavoidance behavior is not reduced. Both of these processes increase the tendency to avoid. Initial exposure sessions begin to correct uncertainty associated with the nonavoidance behavior through experience with that choice’s outcomes. Avoidance is decreased relative to pre-exposure behavior but is still higher than normative behavior. After repeated exposures, the relative values of avoidance vs. nonavoidance behavior normalize along with further corrections in uncertainty calculations. As a result, the tendency to avoid becomes similar to normative behavior

Conceptualizing exposure as reducing avoidance and uncertainty in this framework can account for empirical findings inconsistent with some previous theories, notably the different effects of between and within session fear reduction on treatment outcomes. In exposure therapy, each individual exposure exercise would serve as a single episode of an MDP. In MDPs, values and uncertainties are generally only updated once the episode is over; therefore, uncertainty estimates will not be updated until the patient completes the exposure and fails to encounter the feared outcome. After completing the exposure, maladaptive uncertainty estimates are updated for the next episode. Therefore, though some habituation to uncertainty-related fear and discomfort may occur during each exposure session, particularly if a single session contains repeated exposure experiences, the primary driver of response is the between-session update in uncertainty values. This update will manifest primarily in changes in between-session differences in reported fear, consistent with clinical findings.

Reductions in perceived uncertainty could affect other aspects of avoidance. By reducing threat imminence, lower uncertainty can diminish passive avoidance behavior and reliance on inflexible options. Exposure also may directly target the use of options and other forms of cached sequences. Instead of engaging in stereotyped sequences of avoidance behaviors (e.g., always using a remote entrance in a mall and taking a longer path to the intended store to avoid crowds), exposures encourage alternate behaviors incompatible with these sequences (e.g., taking the most direct path to the store) and induce more flexible behavior.

One open question is whether exposure therapy reduces learned uncertainty estimates or uncertainty learning itself. It is possible that, given multiple exposure sessions, patients become more accurate at updating values based on irreducible and unexpected uncertainty and show an improvement in underlying learning mechanisms. However, learning itself may not change during exposure, especially if exposure is not extensively practiced. Corrections in uncertainty estimates, but not vulnerabilities in uncertainty learning itself, may explain relapse after response to exposure therapy (Vervliet et al., 2013) and serve as a treatment target for novel therapeutic approaches.

Practical advantages of incorporating uncertainty into anxiety treatment

Reconceptualizing clinical anxiety in our proposed framework resolves many of the issues with exposure therapy and other forms of anxiety treatment. This framework addresses components of exposure that are not explained well by current theories of exposure therapy and explains how nonbehavioral treatments can also remediate maladaptive uncertainty learning. It also facilitates applications of basic research into learning processes and synergistic somatic treatments to exposure therapy.

First of all, remediating maladaptive uncertainty learning explains how components like avoidance and the need to target core fears relate to exposure therapy. This framework places avoidance as a central target of exposure: avoidance results from miscalculations of uncertainty and is reduced through exploration during exposure. Maladaptive avoidance, therefore, represents the extent of uncertainty miscalculations and decreases as uncertainty calculations are remediated. The importance of ensuring a focus on a patient’s core fear also can be explained within this framework: a core fear is represented by a specific terminal state in a specific MDP. Targeting portions of a MDP that are related to the core fear will lead to changes in uncertainty in that part of the MDP, rather than in other parts of the MDP (or other MDPs altogether) that do not affect that core fear.

Second, changes in uncertainty can also be accounted for by non-behavioral approaches. Although they target maladaptive uncertainty learning in different ways, shared effects on uncertainty may explain similar processes and outcomes across treatment approaches (Arch & Craske, 2008; Carpenter et al., 2018). Cognitive therapy techniques such as cognitive challenges and behavioral experiments also test relationships among components of an MDP to remediate maladaptive uncertainty calculations. “Downward arrow,” where patients make concrete connections between each layer of fear until reaching their core fear, and similar cognitive techniques make relationships between sequences of actions and states explicit, breaking up options. Third-wave mindfulness approaches target the distress resulting from increased uncertainty by embracing the inability to predict and control outcomes; these and other distress tolerance-focused approaches may reduce the effect of uncertainty on explore-exploit decisions and so reduce the effect of maladaptive uncertainty calculations on avoidance.

Next steps for understanding uncertainty dysfunctions in anxiety and treatment mechanisms

Conceptualizing disrupted uncertainty learning as the basis for avoidance and other impairments in anxiety provides several avenues to test this hypothesis and its predictions. First, although anxiety is related to greater physiological responses to uncertain threat in Pavlovian paradigms, the hypothesized relationship between impaired uncertainty learning, miscalculated uncertainty estimates, and avoidance needs to be tested using instrumental learning paradigms. These learning paradigms should use computational process models to derive precise measures of uncertainty and behavioral avoidance. Follow-up studies should investigate specifics about this relationship. Specifically, it is currently unclear whether uncertainty learning impairments result in avoidance only for fear-based disorders, versus for internalizing disorders more generally, and whether using disorder-specific stimuli (e.g., angry faces for people with social anxiety) has different effects than using generally aversive stimuli (e.g., losing points or receiving an electric shock).

Another area to test is whether differences in neural responses to threat in anxiety are due to impaired uncertainty processing. Overlapping brain regions and networks are implicated both in anxiety and uncertainty processing, particularly the salience and frontoparietal networks and regions including dorsal anterior cingulate cortex, vmPFC, insula, amygdala, BNST, and PAG. Neuromodulators, such as norepinephrine, serotonin, and dopamine, also are hypothesized to play roles in uncertainty processing and anxiety.

Clinically, alterations in uncertainty, and reductions in these alterations with successful exposure therapy, should correspond with several observations in anxiety. If anxious avoidance is due to altered uncertainty learning, greater uncertainty learning dysfunctions should increase with greater anxious avoidance as well as more severe anxiety symptoms overall. As a readout of impaired uncertainty estimates, changes in avoidance should predict and correlate with overall symptom improvement and treatment success. Additionally, if changes in maladaptive uncertainty learning with treatment reduce disruptions in uncertainty estimates, the extent of changes in uncertainty learning with treatment should predict reduced risk of relapse after treatment. One issue with these proposed relationships is the difficulty of accurately measuring behavioral avoidance; future work should test if ecological momentary assessment, passive sensing, or other novel approaches can provide valid measurement of avoidance behavior (Craske & Tsao, 1999; Rashid et al., 2020).

A full theory of maladaptive aversive learning and avoidance in anxiety does not just allow a better understanding of current approaches; this theoretical framework can be used to test augmentations and novel treatments and to tailor treatment components to individual patients. Basic human and non-human neurobehavioral learning research can be used to 1) test components of normative learning and decision-making to better understand how normative learning occurs in MDPs with uncertain, aversive outcomes, and 2) given a specified learning difference in clinical anxiety, to test how this difference can be remediated. Interventions incorporating insights from basic research on potential learning targets would then be tested in people with clinical anxiety. Possible treatments could affect several targets: they could remediate disrupted uncertainty learning, correct miscalculated uncertainty, or encourage exploration of uncertain states in negative environments. Such treatments could include new psychotherapy techniques, including those that draw on basic learning research, as well as somatic and pharmacological approaches targeting these processes.

Maladaptive aversive uncertainty learning and avoidance also should be connected to other impairments in anxiety. Fear-based disorders show hyperarousal and attention bias to threat. These impairments could be due to altered calculations of uncertainty that cause the world to be less well predicted and, by moving along the threat imminence spectrum, appear more threatening. Normative studies show that uncertainty during learning tasks affects attention (Stojić et al., 2020; Walker et al., 2019; but see Wise et al., 2019), but this has not been studied in anxiety. The use of psychometrically valid measurements will be particularly important for assessing this relationship (Price et al., 2019; Rodebaugh et al., 2016; Woody et al., 2017). Understanding these relationships will lead to a fuller explanation of impairments in anxiety and how they can be jointly targeted.

This theoretical framework also enables links between dysfunctional uncertainty learning and risk factors for anxiety to be studied. Understanding how early life stress, perceptions of control, anxiety sensitivity, and other risk factors for anxiety (Zinbarg et al., 2022) potentiate, maintain, and are caused by disrupted uncertainty learning can show how these risk factors can be modified or buffered. Longitudinal studies will be especially important to understand the relationships between these risk factors and uncertainty learning and avoidance over time (Struijs et al., 2018; Wise et al., 2022).

These research directions illustrate the limits of our proposed framework: many components require further testing before this framework can influence clinical practice. In addition, we make assumptions based on the current state of the literature (e.g., the distinction between fear- and distress-based disorders), which will require updates as our knowledge in these areas evolve. We also do not account for aspects of anxiety symptoms and exposure therapy processes that are more transdiagnostic, such as the role that increasing distress tolerance has on the ability to stay in anxiety-provoking situations. Additionally, the function of exposure therapy in non-fear based disorders, like generalized anxiety disorder or in eating disorders, is not explained by this framework.

More generally, our proposed framework exemplifies the benefits of drawing from both clinical knowledge and neurocomputational findings in computational psychiatry research. By integrating clinical knowledge from trial-and-error and verbal theories via clinical experience, lived experience, case reports, and clinical trials with basic knowledge from theoretical work and studies of basic behavior and neural function in humans and other organisms, we can identify gaps in knowledge and work to fill them, with the goal of improving treatment and prevention of psychiatric illness.

Conclusions

Exposure therapy and other treatments for anxiety show moderate success, but exposure therapy is based on theoretical assumptions (i.e., that pathological anxiety results from excessive learned threat) inconsistent with the empirical literature. Instead, uncertainty-related learning disruptions predominate in anxiety. Computational process models, particularly reinforcement learning and other error-driven learning models, precisely define different types of uncertainty and their effects on behavior. Research using these models suggests that anxiety is related to difficulty discerning irreducible uncertainty (uncertainty from noisy outcomes) from unexpected uncertainty (uncertainty from changes in the relationships between stimuli and outcomes). In instrumental learning, uncertainty affects learning and decision-making through effects on decisions to explore versus exploit. Uncertainty-sensitive, temporal difference, Markov decision processes can explain normative avoidance behavior, the relationship between disrupted uncertainty learning and avoidance in anxiety, and how treatments, such as exposure therapy, reduce avoidance through remediating uncertainty calculations. In turn, this framework provides avenues for further research to better understand anxiety and its treatment.

Funding

This work was supported by National Institutes of Health grants K23 MH122626 to VMB and R01 MH100095 and R01 MH048463 to AYD.

Conflicts of interest

VMB has received consulting fees from Aya Technologies. The other authors have no conflicts to disclose.

Footnotes

Code availability Not applicable.

Open practices statement There are no data or materials to make available and no experiments to preregister.

Ethics approval not applicable

Consent to participate/for publication not applicable

Data availability

Not applicable

References

  1. Abend R, Burk D, Ruiz SG, Gold AL, Napoli JL, Britton JC, Michalska KJ, Shechner T, Winkler AM, Leibenluft E, Pine DS, & Averbeck BB (2022). Computational modeling of threat learning reveals links with anxiety and neuroanatomy in humans. ELife, 11, e66169. 10.7554/eLife.66169 [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Abramowitz JS (2013). The Practice of Exposure Therapy: Relevance of Cognitive-Behavioral Theory and Extinction Theory. Behavior Therapy, 44(4), 548–558. 10.1016/j.beth.2013.03.003 [DOI] [PubMed] [Google Scholar]
  3. American Psychiatric Association (Ed.). (2013). Diagnostic and statistical manual of mental disorders: DSM-5 (5th ed). American Psychiatric Association. [Google Scholar]
  4. Arch JJ, & Craske MG (2008). Acceptance and commitment therapy and cognitive behavioral therapy for anxiety disorders: Different treatments, similar mechanisms? Clinical Psychology: Science and Practice, 15(4), 263–279. 10.1111/j.1468-2850.2008.00137.x [DOI] [Google Scholar]
  5. Arnaudova I, Kindt M, Fanselow M, & Beckers T (2017). Pathways towards the proliferation of avoidance in anxiety and implications for treatment. Behaviour Research and Therapy, 96, 3–13. 10.1016/j.brat.2017.04.004 [DOI] [PubMed] [Google Scholar]
  6. Askew C, & Field AP (2007). Vicarious learning and the development of fears in childhood. Behaviour Research and Therapy, 45(11), 2616–2627. 10.1016/j.brat.2007.06.008 [DOI] [PubMed] [Google Scholar]
  7. Asnaani A, McLean CP, & Foa EB (2016). Updating Watson & Marks (1971): How Has Our Understanding of the Mechanisms of Extinction Learning Evolved and Where Is Our Field Going Next? Behavior Therapy, 47(5), 654–668. 10.1016/j.beth.2016.02.003 [DOI] [PubMed] [Google Scholar]
  8. Aupperle RL, & Paulus MP (2010). Neural systems underlying approach and avoidance in anxiety disorders. Dialogues in Clinical Neuroscience, 12(4), 517–531. 10.31887/DCNS.2010.12.4/raupperle [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Ayer WA (1972). Implosive therapy: A review. Psychotherapy: Theory, Research, and Practice, 9(3), 242–250. [Google Scholar]
  10. Aylward J, Hales C, Robinson E, & Robinson OJ (2020). Translating a rodent measure of negative bias into humans: The impact of induced anxiety and unmedicated mood and anxiety disorders. Psychological Medicine, 50(2), 237–246. 10.1017/S0033291718004117 [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Bach DR, & Dolan RJ (2012). Knowing how much you don’t know: A neural organization of uncertainty estimates. Nature Reviews Neuroscience, 13(8), 572–586. 10.1038/nrn3289 [DOI] [PubMed] [Google Scholar]
  12. Bandelow B, Michaelis S, & Wedekind D (2017). Treatment of anxiety disorders. Dialogues in Clinical Neuroscience, 19(2), 93–107. 10.31887/DCNS.2017.19.2/bbandelow [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Bartra O, McGuire JT, & Kable JW (2013). The valuation system: A coordinate-based meta-analysis of BOLD fMRI experiments examining neural correlates of subjective value. NeuroImage, 76, 412–427. 10.1016/j.neuroimage.2013.02.063 [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Bateson M (2002). Recent advances in our understanding of risk-sensitive foraging preferences. Proceedings of the Nutrition Society, 61(4), 509–516. 10.1079/PNS2002181 [DOI] [PubMed] [Google Scholar]
  15. Bechara A, Damasio H, Tranel D, & Damasio AR (1997). Deciding Advantageously Before Knowing the Advantageous Strategy. Science, 275(5304), 1293–1295. 10.1126/science.275.5304.1293 [DOI] [PubMed] [Google Scholar]
  16. Behrens TEJ, Woolrich MW, Walton ME, & Rushworth MFS (2007). Learning the value of information in an uncertain world. Nature Neuroscience, 10(9), 1214–1221. 10.1038/nn1954 [DOI] [PubMed] [Google Scholar]
  17. Blanchard TC, & Gershman SJ (2018). Pure correlates of exploration and exploitation in the human brain. Cognitive, Affective, & Behavioral Neuroscience, 18(1), 117–126. 10.3758/s13415-017-0556-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Bland AR, & Schaefer A (2012). Different Varieties of Uncertainty in Human Decision-Making. Frontiers in Neuroscience, 6. 10.3389/fnins.2012.00085 [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Bolles RC (1970). Species-specific defense reactions and avoidance learning. Psychological Review, 77(1), 32–48. 10.1037/h0028589 [DOI] [Google Scholar]
  20. Borsini F, Podhorna J, & Marazziti D (2002). Do animal models of anxiety predict anxiolytic-like effects of antidepressants? Psychopharmacology, 163(2), 121–141. 10.1007/s00213-002-1155-6 [DOI] [PubMed] [Google Scholar]
  21. Botvinick MM, Niv Y, & Barto AG (2009). Hierarchically organized behavior and its neural foundations: A reinforcement learning perspective. Cognition, 113(3), 262–280. 10.1016/j.cognition.2008.08.011 [DOI] [PMC free article] [PubMed] [Google Scholar]
  22. Bremner JD, Krystal H, Southwick SM, & Charney DS (1996). Noradrenergic Mechanisms in Stress and Anxiety: 11. Clinical Studies. Synapse, 23, 39–51. [DOI] [PubMed] [Google Scholar]
  23. Brown VM, Chen J, Gillan CM, & Price RB (2020). Improving the reliability of computational analyses: Model-based planning and its relationship with compulsivity. Biological Psychiatry: Cognitive Neuroscience and Neuroimaging, 5(6), 601–609. ISSN 2451–9022. 10.1016/j.bpsc.2019.12.019 [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Brown VM, Zhu L, Solway A, Wang JM, McCurry KL, King-Casas B, & Chiu PH (2021). Reinforcement Learning Disruptions in Individuals With Depression and Sensitivity to Symptom Change Following Cognitive Behavioral Therapy. JAMA Psychiatry. 10.1001/jamapsychiatry.2021.1844 [DOI] [PMC free article] [PubMed] [Google Scholar]
  25. Brown VM, Zhu L, Wang JM, Frueh BC, King-Casas B, & Chiu PH (2018). Associability-modulated loss learning is increased in posttraumatic stress disorder. ELife, 7, e30150. 10.7554/eLife.30150 [DOI] [PMC free article] [PubMed] [Google Scholar]
  26. Browning M, Behrens TE, Jocham G, O’Reilly JX, & Bishop SJ (2015). Anxious individuals have difficulty learning the causal statistics of aversive environments. Nature Neuroscience, 18(4), 590–596. 10.1038/nn.3961 [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. Buhr K, & Dugas MJ (2002). The intolerance of uncertainty scale: Psychometric properties of the English version. Behaviour Research and Therapy, 40(8), 931–945. 10.1016/S0005-7967(01)00092-4 [DOI] [PubMed] [Google Scholar]
  28. Bulteau S, Laurin A, Bach-Ngohou K, Péré M, Vibet M-A, Hardouin J-B, Sebille V, Lagalice L, Faurel-Paul É, Acier D, Rabeyron T, Riche V-P, Sauvaget A, Melki F, Vigier T, Perreira Da Silva M, Charlet O, & Prié Y (2022). Feasibility of Combining Transcranial Direct Current Stimulation and Active Fully Embodied Virtual Reality for Visual Height Intolerance: A Double-Blind Randomized Controlled Study. Journal of Clinical Medicine, 11(2), 345. 10.3390/jcm11020345 [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Carleton RN (2016). Into the unknown: A review and synthesis of contemporary models involving uncertainty. Journal of Anxiety Disorders, 39, 30–43. 10.1016/j.janxdis.2016.02.007 [DOI] [PubMed] [Google Scholar]
  30. Carpenter JK, Andrews LA, Witcraft SM, Powers MB, Smits JAJ, & Hofmann SG (2018). Cognitive behavioral therapy for anxiety and related disorders: A meta-analysis of randomized placebo-controlled trials. Depression and Anxiety, 35(6), 502–514. 10.1002/da.22728 [DOI] [PMC free article] [PubMed] [Google Scholar]
  31. Charney DS (1984). Noradrenergic Function in Panic Anxiety: Effects of Yohimbine in Healthy Subjects and Patients With Agoraphobia and Panic Disorder. Archives of General Psychiatry, 41(8), 751. 10.1001/archpsyc.1984.01790190025003 [DOI] [PubMed] [Google Scholar]
  32. Charpentier CJ, Aylward J, Roiser JP, & Robinson OJ (2017). Enhanced Risk Aversion, But Not Loss Aversion, in Unmedicated Pathological Anxiety. Biological Psychiatry, 81(12), 1014–1022. 10.1016/j.biopsych.2016.12.010 [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Clark LA, & Watson D (1991). Tripartite model of anxiety and depression: Psychometric evidence and taxonomic implications. Journal of Abnormal Psychology, 100(3), 316–336. [DOI] [PubMed] [Google Scholar]
  34. Cobb AR, O’Connor P, Zaizar E, Caulfield K, Gonzalez-Lima F, & Telch MJ (2021). tDCS-Augmented in vivo exposure therapy for specific fears: A randomized clinical trial. Journal of Anxiety Disorders, 78, 102344. 10.1016/j.janxdis.2020.102344 [DOI] [PubMed] [Google Scholar]
  35. Cohen JD, McClure SM, & Yu AJ (2007). Should I stay or should I go? How the human brain manages the trade-off between exploitation and exploration. Philosophical Transactions of the Royal Society B: Biological Sciences, 362(1481), 933–942. 10.1098/rstb.2007.2098 [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Cooper NA, & Clum GA (1989). Imaginal flooding as a supplementary treatment for PTSD in combat veterans: A controlled study. Behavior Therapy, 20(3), 381–391. 10.1016/S0005-7894(89)80057-7 [DOI] [Google Scholar]
  37. Cooper SE, van Dis EAM, Hagenaars MA, Krypotos A-M, Nemeroff CB, Lissek S, Engelhard IM, & Dunsmoor JE (2022). A meta-analysis of conditioned fear generalization in anxiety-related disorders. Neuropsychopharmacology. 10.1038/s41386-022-01332-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Cottraux J, Messy P, Marks IM, Mollard E, & Bouvard M (1993). Predictive factors in the treatment of obsessive-compulsive disorders with fluvoxamine and/or behaviour therapy. Behavioural Psychotherapy, 21, 45–50. [Google Scholar]
  39. Cottraux J, Note I, Yao SN, Lafont S, Note B, Mollard E, Bouvard M, Sauteraud A, Bourgeois M, & Dartigues J-F (2001). A Randomized Controlled Trial of Cognitive Therapy versus Intensive Behavior Therapy in Obsessive Compulsive Disorder. Psychotherapy and Psychosomatics, 70(6), 288–297. 10.1159/000056269 [DOI] [PubMed] [Google Scholar]
  40. Craske MG, Kircanski K, Zelikowsky M, Mystkowski J, Chowdhury N, & Baker A (2008). Optimizing inhibitory learning during exposure therapy. Behaviour Research and Therapy, 46(1), 5–27. 10.1016/j.brat.2007.10.003 [DOI] [PubMed] [Google Scholar]
  41. Craske MG, Rauch SL, Ursano R, Prenoveau J, Pine DS, & Zinbarg RE (2009). What is an anxiety disorder? Depression and Anxiety, 26(12), 1066–1085. 10.1002/da.20633 [DOI] [PubMed] [Google Scholar]
  42. Craske MG, Treanor M, Conway CC, Zbozinek T, & Vervliet B (2014). Maximizing exposure therapy: An inhibitory learning approach. Behaviour Research and Therapy, 58, 10–23. 10.1016/j.brat.2014.04.006 [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Craske MG, & Tsao JCI (1999). Self-Monitoring With Panic and Anxiety Disorders. Psychological Assessment, 11(4), 466–479. [Google Scholar]
  44. Daw ND, Gershman SJ, Seymour B, Dayan P, & Dolan RJ (2011). Model-Based Influences on Humans’ Choices and Striatal Prediction Errors. Neuron, 69(6), 1204–1215. 10.1016/j.neuron.2011.02.027 [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Dayan P, & Balleine BW (2002). Reward, Motivation, and Reinforcement Learning. Neuron, 36(2), 285–298. 10.1016/S0896-6273(02)00963-7 [DOI] [PubMed] [Google Scholar]
  46. de Berker AO, Rutledge RB, Mathys C, Marshall L, Cross GF, Dolan RJ, & Bestmann S (2016). Computations of uncertainty mediate acute stress responses in humans. Nature Communications, 7(1), 10996. 10.1038/ncomms10996 [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. De Martino B, Kumaran D, Seymour B, & Dolan RJ (2006). Frames, Biases, and Rational Decision-Making in the Human Brain. Science, 313(5787), 684–687. 10.1126/science.1128356 [DOI] [PMC free article] [PubMed] [Google Scholar]
  48. de Wit S, Kindt M, Knot SL, Verhoeven AAC, Robbins TW, Gasull-Camos J, Evans M, Mirza H, & Gillan CM (2018). Shifting the balance between goals and habits: Five failures in experimental habit induction. Journal of Experimental Psychology: General, 147(7), 1043–1065. 10.1037/xge0000402 [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Diederen KMJ, Spencer T, Vestergaard MD, Fletcher PC, & Schultz W (2016). Adaptive Prediction Error Coding in the Human Midbrain and Striatum Facilitates Behavioral Adaptation and Learning Efficiency. Neuron, 90(5), 1127–1138. 10.1016/j.neuron.2016.04.019 [DOI] [PMC free article] [PubMed] [Google Scholar]
  50. Duits P, Cath DC, Lissek S, Hox JJ, Hamm AO, Engelhard IM, van den Hout MA, & Baas JMP (2015). Updated meta-analysis of classical fear conditioning in the anxiety disorders: review: updated meta-analysis of fear conditioning in anxiety disorders. Depression and Anxiety, 32(4), 239–253. 10.1002/da.22353 [DOI] [PubMed] [Google Scholar]
  51. Dunsmoor JE, Campese VD, Ceceli AO, LeDoux JE, & Phelps EA (2015). Novelty-Facilitated Extinction: Providing a Novel Outcome in Place of an Expected Threat Diminishes Recovery of Defensive Responses. Biological Psychiatry, 78(3), 203–209. 10.1016/j.biopsych.2014.12.008 [DOI] [PMC free article] [PubMed] [Google Scholar]
  52. Dymond S, Schlund MW, Roche B, De Houwer J, & Freegard GP (2012). Safe From Harm: Learned, Instructed, and Symbolic Generalization Pathways of Human Threat-Avoidance. PLoS ONE, 7(10), e47539. 10.1371/journal.pone.0047539 [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Eddy KT, Dutra L, Bradley R, & Westen D (2004). A multidimensional meta-analysis of psychotherapy and pharmacotherapy for obsessive-compulsive disorder. Clinical Psychology Review, 24(8), 1011–1030. 10.1016/j.cpr.2004.08.004 [DOI] [PubMed] [Google Scholar]
  54. Etkin A, & Wager TD (2007). Functional Neuroimaging of Anxiety: A Meta-Analysis of Emotional Processing in PTSD, Social Anxiety Disorder, and Specific Phobia. American Journal of Psychiatry, 164(10), 1476–1488. 10.1176/appi.ajp.2007.07030504 [DOI] [PMC free article] [PubMed] [Google Scholar]
  55. Fanselow MS (1994). Neural organization of the defensive behavior system responsible for fear. Psychonomic Bulletin & Review, 1(4), 429–438. 10.3758/BF03210947 [DOI] [PubMed] [Google Scholar]
  56. Foa EB, & Kozak MJ (1986). Emotional Processing of Fear: Exposure to Corrective Information. Psychological Bulletin, 99(1), 20–35. [PubMed] [Google Scholar]
  57. Foa EB, Yadin E, & Lichner TK (2012). Exposure and response (ritual) prevention for obsessive-compulsive disorder: Therapist guide (2nd ed). Oxford University Press. [Google Scholar]
  58. Forstmann BU, Ratcliff R, & Wagenmakers E-J (2016). Sequential Sampling Models in Cognitive Neuroscience: Advantages, Applications, and Extensions. Annual Review of Psychology, 67(1), 641–666. 10.1146/annurev-psych-122414-033645 [DOI] [PMC free article] [PubMed] [Google Scholar]
  59. Fraunfelter L, Gerdes ABM, & Alpers GW (2022). Fear one, fear them all: A Systematic Review and Meta-Analysis of Fear Generalization in Pathological Anxiety. Neuroscience & Biobehavioral Reviews, 104707. 10.1016/j.neubiorev.2022.104707 [DOI] [PubMed] [Google Scholar]
  60. Gagne C, & Dayan P (2022). Peril, prudence and planning as risk, avoidance and worry. Journal of Mathematical Psychology, 106, 102617. 10.1016/j.jmp.2021.102617 [DOI] [Google Scholar]
  61. Gagne C, Zika O, Dayan P, & Bishop SJ (2020). Impaired adaptation of learning to contingency volatility in internalizing psychopathology. ELife, 9, e61387. 10.7554/eLife.61387 [DOI] [PMC free article] [PubMed] [Google Scholar]
  62. Gillan CM, Kalanthroff E, Evans M, Weingarden HM, Jacoby RJ, Gershkovich M, Snorrason I, Campeas R, Cervoni C, Crimarco NC, Sokol Y, Garnaat SL, McLaughlin NCR, Phelps EA, Pinto A, Boisseau CL, Wilhelm S, Daw ND, & Simpson HB (2020). Comparison of the Association Between Goal-Directed Planning and Self-reported Compulsivity vs Obsessive-Compulsive Disorder Diagnosis. JAMA Psychiatry, 77(1), 77. 10.1001/jamapsychiatry.2019.2998 [DOI] [PMC free article] [PubMed] [Google Scholar]
  63. Gillan CM, Kosinski M, Whelan R, Phelps EA, & Daw ND (2016). Characterizing a psychiatric symptom dimension related to deficits in goal-directed control. ELife, 5, e11305. 10.7554/eLife.11305 [DOI] [PMC free article] [PubMed] [Google Scholar]
  64. Gillan CM, Vaghi MM, Hezemans FH, van Ghesel Grothe S, Dafflon J, Brühl AB, Savulich G, & Robbins TW (2021). Experimentally induced and real-world anxiety have no demonstrable effect on goal-directed behaviour. Psychological Medicine, 51(9), 1467–1478. 10.1017/S0033291720000203 [DOI] [PMC free article] [PubMed] [Google Scholar]
  65. Giorgetta C, Grecucci A, Zuanon S, Perini L, Balestrieri M, Bonini N, Sanfey AG, & Brambilla P (2012). Reduced risk-taking behavior as a trait feature of anxiety. Emotion, 12(6), 1373–1383. 10.1037/a0029119 [DOI] [PubMed] [Google Scholar]
  66. Gorka SM, Lieberman L, Shankman SA, & Phan KL (2017). Startle potentiation to uncertain threat as a psychophysiological indicator of fear-based psychopathology: An examination across multiple internalizing disorders. Journal of Abnormal Psychology, 126(1), 8–18. 10.1037/abn0000233 [DOI] [PMC free article] [PubMed] [Google Scholar]
  67. Grillon C, Lissek S, Rabin S, McDowell D, Dvir S, & Pine DS (2008). Increased Anxiety During Anticipation of Unpredictable But Not Predictable Aversive Stimuli as a Psychophysiologic Marker of Panic Disorder. American Journal of Psychiatry, 165(7), 898–904. 10.1176/appi.ajp.2007.07101581 [DOI] [PMC free article] [PubMed] [Google Scholar]
  68. Grillon C, Pine DS, Lissek S, Rabin S, Bonne O, & Vythilingam M (2009). Increased Anxiety During Anticipation of Unpredictable Aversive Stimuli in Posttraumatic Stress Disorder but not in Generalized Anxiety Disorder. Biological Psychiatry, 66(1), 47–53. 10.1016/j.biopsych.2008.12.028 [DOI] [PMC free article] [PubMed] [Google Scholar]
  69. Grillon C, Robinson OJ, Krimsky M, O’Connell K, Alvarez G, & Ernst M (2017). Anxiety-mediated facilitation of behavioral inhibition: Threat processing and defensive reactivity during a go/no-go task. Emotion, 17(2), 259–266. 10.1037/emo0000214 [DOI] [PMC free article] [PubMed] [Google Scholar]
  70. Grossman CD, Bari BA, & Cohen JY (2022). Serotonin neurons modulate learning rate through uncertainty. Current Biology, 32(3), 586–599.e7. 10.1016/jxub.2021.12.006 [DOI] [PMC free article] [PubMed] [Google Scholar]
  71. Grupe DW, & Nitschke JB (2013). Uncertainty and anticipation in anxiety: An integrated neurobiological and psychological perspective. Nature Reviews Neuroscience, 14(7), 488–501. 10.1038/nrn3524 [DOI] [PMC free article] [PubMed] [Google Scholar]
  72. Hansmeier J, Haberkamp A, Glombiewski JA, & Exner C (2021). The Behavior Avoidance Test: Association With Symptom Severity and Treatment Outcome in Obsessive-Compulsive Disorder. Frontiers in Psychiatry, 12, 781972. 10.3389/fpsyt.2021.781972 [DOI] [PMC free article] [PubMed] [Google Scholar]
  73. Harro J (2018). Animals, anxiety, and anxiety disorders: How to measure anxiety in rodents and why. Behavioural Brain Research, 352, 81–93. 10.1016/j.bbr.2017.10.016 [DOI] [PubMed] [Google Scholar]
  74. Hartley CA, & Phelps EA (2012). Anxiety and Decision-Making. Biological Psychiatry, 72(2), 113–118. 10.1016/j.biopsych.2011.12.027 [DOI] [PMC free article] [PubMed] [Google Scholar]
  75. Hauner KK, Mineka S, Voss JL, & Paller KA (2012). Exposure therapy triggers lasting reorganization of neural fear processing. Proceedings of the National Academy of Sciences, 109(23), 9203–9208. 10.1073/pnas.1205242109 [DOI] [PMC free article] [PubMed] [Google Scholar]
  76. Hayden BY, Heilbronner SR, Pearson JM, & Platt ML (2011). Surprise Signals in Anterior Cingulate Cortex: Neuronal Encoding of Unsigned Reward Prediction Errors Driving Adjustment in Behavior. Journal of Neuroscience, 31(11), 4178–4187. 10.1523/JNEUROSCI.4652-10.2011 [DOI] [PMC free article] [PubMed] [Google Scholar]
  77. Heller AS, Ezie CEC, Otto AR, & Timpano KR (2018). Model-based learning and individual differences in depression: The moderating role of stress. Behaviour Research and Therapy, 111, 19–26. 10.1016/j.brat.2018.09.007 [DOI] [PubMed] [Google Scholar]
  78. Hendrickson RC, & Raskind MA (2016). Noradrenergic dysregulation in the pathophysiology of PTSD. Experimental Neurology, 284, 181–195. 10.1016/jxxpneurol.2016.05.014 [DOI] [PubMed] [Google Scholar]
  79. Hertwig R, Barron G, Weber EU, & Erev I (2004). Decisions from Experience and the Effect of Rare Events in Risky Choice. Psychological Science, 15(8), 6. [DOI] [PubMed] [Google Scholar]
  80. Hofmann SG (2014). d-cycloserine for treating anxiety disorders: making good exposures better and bad exposures worse: d-cycloserine for anxiety disorders. Depression and Anxiety, 31(3), 175–177. 10.1002/da.22257 [DOI] [PMC free article] [PubMed] [Google Scholar]
  81. Hogeveen J, Mullins TS, Romero JD, Eversole E, Rogge-Obando K, Mayer AR, & Costa VD (2022). The neurocomputational bases of explore-exploit decision-making. Neuron, S0896627322002501. 10.1016/j.neuron.2022.03.014 [DOI] [PMC free article] [PubMed] [Google Scholar]
  82. Homan P, Levy I, Feltham E, Gordon C, Hu J, Li J, Pietrzak RH, Southwick S, Krystal JH, Harpaz-Rotem I, & Schiller D (2019). Neural computations of threat in the aftermath of combat trauma. Nature Neuroscience, 22(3), 470–476. 10.1038/s41593-018-0315-x [DOI] [PMC free article] [PubMed] [Google Scholar]
  83. Hsu M, Bhatt M, Adolphs R, Tranel D, & Camerer CF (2005). Neural Systems Responding to Degrees of Uncertainty in Human Decision-Making. Science, 310(5754), 1680–1683. 10.1126/science.1115327 [DOI] [PubMed] [Google Scholar]
  84. Hunter LE, Meer EA, Gillan CM, Hsu M, & Daw ND (2021). Increased and biased deliberation in social anxiety. Nature Human Behaviour. 10.1038/s41562-021-01180-y [DOI] [PMC free article] [PubMed] [Google Scholar]
  85. Huys QJM, Lally N, Faulkner P, Eshel N, Seifritz E, Gershman SJ, Dayan P, & Roiser JP (2015). Interplay of approximate planning strategies. Proceedings of the National Academy of Sciences, 112(10), 3098–3103. 10.1073/pnas.1414219112 [DOI] [PMC free article] [PubMed] [Google Scholar]
  86. Huys QJM, & Renz D (2017). A Formal Valuation Framework for Emotions and Their Control. Biological Psychiatry, 82(6), 413–420. 10.1016/j.biopsych.2017.07.003 [DOI] [PubMed] [Google Scholar]
  87. Isserles M, Tendler A, Roth Y, Bystritsky A, Blumberger DM, Ward H, Feifel D, Viner L, Duffy W, Zohar J, Keller CJ, Bhati MT, Etkin A, George MS, Filipcic I, Lapidus K, Casuto L, Vaishnavi S, Stein A, et al. (2021). Deep Transcranial Magnetic Stimulation Combined With Brief Exposure for Posttraumatic Stress Disorder: A Prospective Multisite Randomized Trial. Biological Psychiatry, S0006322321012646. 10.1016/j.biopsych.2021.04.019 [DOI] [PubMed] [Google Scholar]
  88. Jones MC (1924a). The Elimination of Children’s Fears. Journal of Experimental Psychology, 7(5), 382–390. 10.1037/h0072283 [DOI] [Google Scholar]
  89. Jones MC (1924b). A Laboratory Study of Fear: The Case Of Peter. The Pedagogical Seminary and Journal of Genetic Psychology, 31(4), 308–315. 10.1080/08856559.1924.9944851 [DOI] [PubMed] [Google Scholar]
  90. Kaelbling LP, Littman ML, & Moore AW (1996). Reinforcement Learning: A Survey. Journal of Artificial Intelligence Research, 4, 237–285. 10.1613/jair.301 [DOI] [Google Scholar]
  91. Kahneman D, & Tversky A (1979). Prospect Theory: An Analysis of Decision under Risk. Econometrica, 47(2), 263–292. [Google Scholar]
  92. Kalk N, Nutt D, & Lingford-Hughes A (2011). The role of central noradrenergic dysregulation in anxiety disorders: Evidence from clinical studies. Journal of Psychopharmacology, 25(1), 3–16. 10.1177/0269881110367448 [DOI] [PubMed] [Google Scholar]
  93. Keane TM, Fairbank JA, Caddell JM, & Zimfring RT (1989). Implosive (Flooding) Therapy Reduces Symptoms of P’ISD in Vietnam Combat Veterans. Behavior Therapy, 20, 245–260. [Google Scholar]
  94. Kennerley SW, Walton ME, Behrens TEJ, Buckley MJ, & Rushworth MFS (2006). Optimal decision making and the anterior cingulate cortex. Nature Neuroscience, 9(7), 940–947. 10.1038/nn1724 [DOI] [PubMed] [Google Scholar]
  95. Knowles KA, & Olatunji BO (2020). Specificity of trait anxiety in anxiety and depression: Meta-analysis of the State-Trait Anxiety Inventory. Clinical Psychology Review, 82, 101928. 10.1016/j.cpr.2020.101928 [DOI] [PMC free article] [PubMed] [Google Scholar]
  96. Korn CW, & Bach DR (2015). Maintaining Homeostasis by Decision-Making. PLOS Computational Biology, 11(5), e1004301. 10.1371/journal.pcbi.1004301 [DOI] [PMC free article] [PubMed] [Google Scholar]
  97. Krueger PM, Wilson RC, & Cohen JD (2017). Strategies for exploration in the domain of losses. Judgment and Decision Making, 12(2), 104–117. [Google Scholar]
  98. Krypotos A-M (2015). Avoidance learning: A review of theoretical models and recent developments. Frontiers in Behavioral Neuroscience, 9. 10.3389/fnbeh.2015.00189 [DOI] [PMC free article] [PubMed] [Google Scholar]
  99. Krypotos A-M, Crombez G, Meulders A, Claes N, & Vlaeyen JWS (2020). Decomposing conditioned avoidance performance with computational models. Behaviour Research and Therapy, 133, 103712. 10.1016/j.brat.2020.103712 [DOI] [PubMed] [Google Scholar]
  100. Kuhnen CM, & Knutson B (2005). The Neural Basis of Financial Risk Taking. Neuron, 47(5), 763–770. 10.1016/j.neuron.2005.08.008 [DOI] [PubMed] [Google Scholar]
  101. Lamba A, Frank MJ, & FeldmanHall O (2020). Anxiety Impedes Adaptive Social Learning Under Uncertainty. Psychological Science, 31(5), 592–603. 10.1177/0956797620910993 [DOI] [PubMed] [Google Scholar]
  102. Le Pelley ME (2004). The Role of Associative History in Models of Associative Learning: A Selective Review and a Hybrid Model. The Quarterly Journal of Experimental Psychology Section B, 57(3b), 193–243. 10.1080/02724990344000141 [DOI] [PubMed] [Google Scholar]
  103. LeDoux JE, Moscarello J, Sears R, & Campese V (2017). The birth, death and resurrection of avoidance: A reconceptualization of a troubled paradigm. Molecular Psychiatry, 22(1), 24–36. 10.1038/mp.2016.166 [DOI] [PMC free article] [PubMed] [Google Scholar]
  104. Lejarraga T, & Hertwig R (2017). How the threat of losses makes people explore more than the promise of gains. Psychonomic Bulletin & Review, 24(3), 708–720. 10.3758/s13423-016-1158-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
  105. Levin-Aspenson HF, Watson D, Ellickson-Larew S, Stanton K, & Stasik-O’Brien SM (2021). Beyond Distress and Fear: Differential Psychopathology Correlates of PTSD Symptom Clusters. Journal of Affective Disorders, 284, 9–17. 10.1016/j.jad.2021.01.090 [DOI] [PubMed] [Google Scholar]
  106. Levy I, & Schiller D (2021). Neural Computations of Threat. Trends in Cognitive Sciences, 25(2), 151–171. 10.1016/j.tics.2020.11.007 [DOI] [PMC free article] [PubMed] [Google Scholar]
  107. Li J, Schiller D, Schoenbaum G, Phelps EA, & Daw ND (2011). Differential roles of human striatum and amygdala in associative learning. Nature Neuroscience, 14(10), 1250–1252. 10.1038/nn.2904 [DOI] [PMC free article] [PubMed] [Google Scholar]
  108. Lissek S, Pine DS, & Grillon C (2006). The strong situation: A potential impediment to studying the psychobiology and pharmacology of anxiety disorders. Biological Psychology, 72(3), 265–270. 10.1016/j.biopsycho.2005.11.004 [DOI] [PubMed] [Google Scholar]
  109. Loijen A, Vrijsen JN, Egger JIM, Becker ES, & Rinck M (2020). Biased approach-avoidance tendencies in psychopathology: A systematic review of their assessment and modification. Clinical Psychology Review, 77, 101825. 10.1016/j.cpr.2020.101825 [DOI] [PubMed] [Google Scholar]
  110. Lovibond PF, Saunders JC, Weidemann G, & Mitchell CJ (2008). Evidence for expectancy as a mediator of avoidance and anxiety in a laboratory model of human avoidance learning. Quarterly Journal of Experimental Psychology, 61(8), 1199–1216. 10.1080/17470210701503229 [DOI] [PubMed] [Google Scholar]
  111. Mackintosh NJ (1975). A theory of attention: Variations in the associability of stimuli with reinforcement. Psychological Review, 82(4), 276–298. 10.1037/h0076778 [DOI] [Google Scholar]
  112. Maia TV (2010). Two-factor theory, the actor—Critic model, and conditioned avoidance. Learning & Behavior, 38(1), 50–67. 10.3758/LB.38.1.50 [DOI] [PubMed] [Google Scholar]
  113. Maner JK, Richey JA, Cromer K, Mallott M, Lejuez CW, Joiner TE, & Schmidt NB (2007). Dispositional anxiety and risk-avoidant decision-making. Personality and Individual Differences, 42(4), 665–675. 10.1016/j.paid.2006.08.016 [DOI] [Google Scholar]
  114. Maren S (2001). Neurobiology of Pavlovian Fear Conditioning. Annual Review of Neuroscience, 24(1), 897–931. 10.1146/annurev.neuro.24.1.897 [DOI] [PubMed] [Google Scholar]
  115. Maren S (2011). Seeking a Spotless Mind: Extinction, Deconsolidation, and Erasure of Fear Memory. Neuron, 70(5), 830–845. 10.1016/j.neuron.2011.04.023 [DOI] [PMC free article] [PubMed] [Google Scholar]
  116. Marin M-F, Hammoud MZ, Klumpp H, Simon NM, & Milad MR (2020). Multimodal Categorical and Dimensional Approaches to Understanding Threat Conditioning and Its Extinction in Individuals With Anxiety Disorders. JAMA Psychiatry, 77(6), 618. 10.1001/jamapsychiatry.2019.4833 [DOI] [PMC free article] [PubMed] [Google Scholar]
  117. Marin M-F, Zsido RG, Song H, Lasko NB, Killgore WDS, Rauch SL, Simon NM, & Milad MR (2017). Skin Conductance Responses and Neural Activations During Fear Conditioning and Extinction Recall Across Anxiety Disorders. JAMA Psychiatry, 74(6), 622. 10.1001/jamapsychiatry.2017.0329 [DOI] [PMC free article] [PubMed] [Google Scholar]
  118. Mataix-Cols D, Fernández de la Cruz L, Monzani B, Rosenfield D, Andersson E, Pérez-Vigil A, Frumento P, de Kleine RA, Difede J, Dunlop BW, Farrell LJ, Geller D, Gerardi M, Guastella AJ, Hofmann SG, Hendriks G-J, Kushner MG, Lee FS, Lenze EJ, et al. (2017). D-Cycloserine Augmentation of Exposure-Based Cognitive Behavior Therapy for Anxiety, Obsessive-Compulsive, and Posttraumatic Stress Disorders: A Systematic Review and Meta-analysis of Individual Participant Data. JAMA Psychiatry, 74(5), 501. 10.1001/jamapsychiatry.2016.3955 [DOI] [PubMed] [Google Scholar]
  119. Matias S, Lottem E, Dugué GP, & Mainen ZF (2017). Activity patterns of serotonin neurons underlying cognitive flexibility. ELife, 6, e20552. 10.7554/eLife.20552 [DOI] [PMC free article] [PubMed] [Google Scholar]
  120. McTeague LM, & Lang PJ (2012a). The anxiety spectrum and the reflex physiology of defense: from circumscribed fear to broad distress: Special Article: Defensive Reactivity & the Anxiety Spectrum. Depression and Anxiety, 29(4), 264–281. 10.1002/da.21891 [DOI] [PMC free article] [PubMed] [Google Scholar]
  121. McTeague LM, & Lang PJ (2012b). The anxiety spectrum and the reflex physiology of defense: from circumscribed fear to broad distress: special article: defensive reactivity & the anxiety spectrum. Depression and Anxiety, 29(4), 264–281. 10.1002/da.21891 [DOI] [PMC free article] [PubMed] [Google Scholar]
  122. Mehlhorn K, Newell BR, Todd PM, Lee MD, Morgan K, Braithwaite VA, Hausmann D, Fiedler K, & Gonzalez C (2015). Unpacking the exploration–exploitation tradeoff: A synthesis of human and animal literatures. Decision, 2(3), 191–215. 10.1037/dec0000033 [DOI] [Google Scholar]
  123. Mischel W (1977). The interaction of person and situation. In Personality at the crossroads: Current issues in interactional psychology. [Google Scholar]
  124. Mkrtchian A, Aylward J, Dayan P, Roiser JP, & Robinson OJ (2017). Modeling Avoidance in Mood and Anxiety Disorders Using Reinforcement Learning. Biological Psychiatry, 82(7), 532–539. 10.1016/j.biopsych.2017.01.017 [DOI] [PMC free article] [PubMed] [Google Scholar]
  125. Mobbs D, Hagan CC, Dalgleish T, Silston B, & Prévost C (2015). The ecology of human fear: Survival optimization and the nervous system. Frontiers in Neuroscience, 9. 10.3389/fnins.2015.00055 [DOI] [PMC free article] [PubMed] [Google Scholar]
  126. Mobbs D, Headley DB, Ding W, & Dayan P (2020). Space, Time, and Fear: Survival Computations along Defensive Circuits. Trends in Cognitive Sciences, 24(3), 228–241. 10.1016/j.tics.2019.12.016 [DOI] [PubMed] [Google Scholar]
  127. Montague P, Dayan P, & Sejnowski T (1996). A framework for mesencephalic dopamine systems based on predictive Hebbian learning. The Journal of Neuroscience, 16(5), 1936–1947. 10.1523/JNEUROSCI.16-05-01936.1996 [DOI] [PMC free article] [PubMed] [Google Scholar]
  128. Montague PR, Dolan RJ, Friston KJ, & Dayan P (2012). Computational psychiatry. Trends in Cognitive Sciences, 16(1), 72–80. 10.1016/j.tics.2011.11.018 [DOI] [PMC free article] [PubMed] [Google Scholar]
  129. Moutoussis M, Bentall RP, Williams J, & Dayan P (2008). A temporal difference account of avoidance learning. Network: Computation in Neural Systems, 19(2), 137–160. 10.1080/09548980802192784 [DOI] [PubMed] [Google Scholar]
  130. Mowrer OH (1939). A stimulus-response analysis of anxiety and its role as a reinforcing agent. Psychological Review, 46(6), 553–565. 10.1037/h0054288 [DOI] [Google Scholar]
  131. Mowrer OH (1951). Two-factor learning theory: Summary and comment. Psychological Review, 58(5), 350–354. 10.1037/h0058956 [DOI] [PubMed] [Google Scholar]
  132. Muris P, & Field AP (2010). The Role of Verbal Threat Information in the Development of Childhood Fear. “Beware the Jabberwock!”. Clinical Child and Family Psychology Review, 13(2), 129–150. 10.1007/s10567-010-0064-1 [DOI] [PMC free article] [PubMed] [Google Scholar]
  133. Naegeli C, Zeffiro T, Piccirelli M, Jaillard A, Weilenmann A, Hassanpour K, Schick M, Rufer M, Orr SP, & Mueller-Pfeiffer C (2018). Locus Coeruleus Activity Mediates Hyper-responsiveness in Posttraumatic Stress Disorder. Biological Psychiatry, 83(3), 254–262. 10.1016/j.biopsych.2017.08.021 [DOI] [PubMed] [Google Scholar]
  134. Nassar MR, Bruckner R, & Frank MJ (2019). Statistical context dictates the relationship between feedback-related EEG signals and learning. ELife, 8, e46975. 10.7554/eLife.46975 [DOI] [PMC free article] [PubMed] [Google Scholar]
  135. Nassar MR, Rumsey KM, Wilson RC, Parikh K, Heasly B, & Gold JI (2012). Rational regulation of learning dynamics by pupil-linked arousal systems. Nature Neuroscience, 15(7), 1040–1046. 10.1038/nn.3130 [DOI] [PMC free article] [PubMed] [Google Scholar]
  136. Navarro DJ, Newell BR, & Schulze C (2016). Learning and choosing in an uncertain world: An investigation of the explore–exploit dilemma in static and dynamic environments. Cognitive Psychology, 85, 43–77. 10.1016/j.cogpsych.2016.01.001 [DOI] [PubMed] [Google Scholar]
  137. Niv Y (2009). Reinforcement learning in the brain. Journal of Mathematical Psychology, 53(3), 139–154. 10.1016/j.jmp.2008.12.005 [DOI] [Google Scholar]
  138. Norberg MM, Krystal JH, & Tolin DF (2008). A Meta-Analysis of D-Cycloserine and the Facilitation of Fear Extinction and Exposure Therapy. Biological Psychiatry, 63(12), 1118–1126. 10.1016/j.biopsych.2008.01.012 [DOI] [PubMed] [Google Scholar]
  139. Norbury A, Brinkman H, Kowalchyk M, Monti E, Pietrzak RH, Schiller D, & Feder A (2021). Latent cause inference during extinction learning in trauma-exposed individuals with and without PTSD. Psychological Medicine, 1–12. 10.1017/S0033291721000647 [DOI] [PubMed] [Google Scholar]
  140. Olsson A, & Phelps EA (2004). Learned Fear of “Unseen” Faces after Pavlovian, Observational, and Instructed Fear. Psychological Science, 15(12), 822–828. 10.1111/j.0956-7976.2004.00762.x [DOI] [PubMed] [Google Scholar]
  141. Olsson A, & Phelps EA (2007). Social learning of fear. Nature Neuroscience, 10(9), 1095–1102. 10.1038/nn1968 [DOI] [PubMed] [Google Scholar]
  142. O’Reilly JX, Schuffelgen U, Cuell SF, Behrens TEJ, Mars RB, & Rushworth MFS (2013). Dissociable effects of surprise and model update in parietal and anterior cingulate cortex. Proceedings of the National Academy of Sciences, 110(38), E3660–E3669. 10.1073/pnas.1305373110 [DOI] [PMC free article] [PubMed] [Google Scholar]
  143. Payzan-LeNestour E, & Bossaerts P (2011). Risk, Unexpected Uncertainty, and Estimation Uncertainty: Bayesian Learning in Unstable Settings. PLoS Computational Biology, 7(1), e1001048. 10.1371/journal.pcbi.1001048 [DOI] [PMC free article] [PubMed] [Google Scholar]
  144. Payzan-LeNestour É, & Bossaerts P (2012). Do not Bet on the Unknown Versus Try to Find Out More: Estimation Uncertainty and “Unexpected Uncertainty” Both Modulate Exploration. Frontiers in Neuroscience, 6. 10.3389/fnins.2012.00150 [DOI] [PMC free article] [PubMed] [Google Scholar]
  145. Pearce JM, & Hall G (1980). A Model for Pavlovian Learning: Variations in the Effectiveness of Conditioned But Not of Unconditioned Stimuli. Psychological Review, 87(6), 532–552. [PubMed] [Google Scholar]
  146. Pike AC, & Robinson OJ (2022). Reinforcement Learning in Patients With Mood and Anxiety Disorders vs Control Individuals: A Systematic Review and Meta-analysis. JAMA Psychiatry, 79(4), 313. 10.1001/jamapsychiatry.2022.0051 [DOI] [PMC free article] [PubMed] [Google Scholar]
  147. Pine DS (2017). Clinical Advances From a Computational Approach to Anxiety. Biological Psychiatry, 82(6), 385–387. 10.1016/j.biopsych.2016.09.020 [DOI] [PMC free article] [PubMed] [Google Scholar]
  148. Piray P, & Daw ND (2021). A model for learning based on the joint estimation of stochasticity and volatility. Nature Communications, 12(1), 6587. 10.1038/s41467-021-26731-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  149. Pittig A, Treanor M, LeBeau RT, & Craske MG (2018). The role of associative fear and avoidance learning in anxiety disorders: Gaps and directions for future research. Neuroscience & Biobehavioral Reviews, 88, 117–140. 10.1016/j.neubiorev.2018.03.015 [DOI] [PubMed] [Google Scholar]
  150. Pittig A, Wong AHK, Glück VM, & Boschet JM (2020). Avoidance and its bi-directional relationship with conditioned fear: Mechanisms, moderators, and clinical implications. Behaviour Research and Therapy, 126, 103550. 10.1016/j.brat.2020.103550 [DOI] [PubMed] [Google Scholar]
  151. Platt ML, & Huettel SA (2008). Risky business: The neuroeconomics of decision making under uncertainty. Nature Neuroscience, 11(4), 398–403. 10.1038/nn2062 [DOI] [PMC free article] [PubMed] [Google Scholar]
  152. Price RB, Brown V, & Siegle GJ (2019). Computational Modeling Applied to the Dot-Probe Task Yields Improved Reliability and Mechanistic Insights. Biological Psychiatry, 85(7), 606–612. 10.1016/j.biopsych.2018.09.022 [DOI] [PMC free article] [PubMed] [Google Scholar]
  153. Pulcu E, & Browning M (2019). The Misestimation of Uncertainty in Affective Disorders. Trends in Cognitive Sciences, 23(10), 865–875. 10.1016/j.tics.2019.07.007 [DOI] [PubMed] [Google Scholar]
  154. Quirk GJ, & Mueller D (2008). Neural Mechanisms of Extinction Learning and Retrieval. Neuropsychopharmacology, 33, 56–72. [DOI] [PMC free article] [PubMed] [Google Scholar]
  155. Rachman S (1976). The passing of the two-stage theory of fear and avoidance: Fresh possibilities. Behaviour Research and Therapy, 14(2), 125–131. 10.1016/0005-7967(76)90066-8 [DOI] [PubMed] [Google Scholar]
  156. Rachman S (1977). The conditioning theory of fearacquisition: A critical examination. Behaviour Research and Therapy, 15(5), 375–387. 10.1016/0005-7967(77)90041-9 [DOI] [PubMed] [Google Scholar]
  157. Rangel A, Camerer C, & Montague PR (2008). A framework for studying the neurobiology of value-based decision making. Nature Reviews Neuroscience, 9(7), 545–556. 10.1038/nrn2357 [DOI] [PMC free article] [PubMed] [Google Scholar]
  158. Rashid H, Mendu S, Daniel KE, Beltzer ML, Teachman BA, Boukhechba M, & Barnes LE (2020). Predicting Subjective Measures of Social Anxiety from Sparsely Collected Mobile Sensor Data. Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies, 4(3), 1–24. 10.1145/341182335846237 [DOI] [Google Scholar]
  159. Ratcliff R, & McKoon G (2008). The Diffusion Decision Model: Theory and Data for Two-Choice Decision Tasks. Neural Computation, 20(4), 873–922. 10.1162/neco.2008.12-06-420 [DOI] [PMC free article] [PubMed] [Google Scholar]
  160. Raymond JG, Steele JD, & Seriès P (2017). Modeling Trait Anxiety: From Computational Processes to Personality. Frontiers in Psychiatry, 8. 10.3389/fpsyt.2017.00001 [DOI] [PMC free article] [PubMed] [Google Scholar]
  161. Rescorla RA, & Wagner AR (1972). A theory of Pavlovian conditioning: Variations in the effectiveness of reinforcement and nonreinforcement. In Black AH & Prokasy WF (Eds.), Classical Conditioning II (pp. 64–99). Appleton-Century-Crofts. [Google Scholar]
  162. Rich AS, & Gureckis TM (2018). Exploratory choice reflects the future value of information. Decision, 5(3), 177–192. 10.1037/dec0000074 [DOI] [Google Scholar]
  163. Richter J, Pittig A, Hollandt M, & Lueken U (2017). Bridging the Gaps Between Basic Science and Cognitive-Behavioral Treatments for Anxiety Disorders in Routine Care: Current Status and Future Demands. Zeitschrift Für Psychologie, 225(3), 252–267. 10.1027/2151-2604/a000309 [DOI] [Google Scholar]
  164. Rodebaugh TL, Scullin RB, Langer JK, Dixon DJ, Huppert JD, Bernstein A, Zvielli A, & Lenze EJ (2016). Unreliability as a threat to understanding psychopathology: The cautionary tale of attentional bias. Journal of Abnormal Psychology, 125(6), 840–851. 10.1037/abn0000184 [DOI] [PMC free article] [PubMed] [Google Scholar]
  165. Rothbaum BO, & Schwartz AC (2002). Exposure therapy for posttraumatic stress disorder. American Journal of Psychotherapy, 56(1), 59–75. 10.1176/appi.psychotherapy.2002.56.1.59 [DOI] [PubMed] [Google Scholar]
  166. Salcioğlu E, Başoğlu M, & Livanou M (2007). Effects of live exposure on symptoms of posttraumatic stress disorder: The role of reduced behavioral avoidance in improvement. Behaviour Research and Therapy, 45(10), 2268–2279. 10.1016/j.brat.2007.04.012 [DOI] [PubMed] [Google Scholar]
  167. Schuck NW, Cai MB, Wilson RC, & Niv Y (2016). Human Orbitofrontal Cortex Represents a Cognitive Map of State Space. Neuron, 91(6), 1402–1412. 10.1016/j.neuron.2016.08.019 [DOI] [PMC free article] [PubMed] [Google Scholar]
  168. Schuck NW, & Niv Y (2019). Sequential replay of nonspatial task states in the human hippocampus. Science, 364(6447), eaaw5181. 10.1126/science.aaw5181 [DOI] [PMC free article] [PubMed] [Google Scholar]
  169. Schultz W, Dayan P, & Montague PR (1997). A Neural Substrate of Prediction and Reward. Science, 275(5306), 1593–1599. [DOI] [PubMed] [Google Scholar]
  170. Schulz E, Wu CM, Huys QJM, Krause A, & Speekenbrink M (2018). Generalization and Search in Risky Environments. Cognitive Science, 42(8), 2592–2620. 10.1111/cogs.12695 [DOI] [PubMed] [Google Scholar]
  171. Seeley WW, Menon V, Schatzberg AF, Keller J, Glover GH, Kenna H, Reiss AL, & Greicius MD (2007). Dissociable Intrinsic Connectivity Networks for Salience Processing and Executive Control. Journal of Neuroscience, 27(9), 2349–2356. 10.1523/JNEUROSCI.5587-06.2007 [DOI] [PMC free article] [PubMed] [Google Scholar]
  172. Seymour B, O’Doherty JP, Dayan P, Koltzenburg M, Jones AK, Dolan RJ, Friston KJ, & Frackowiak RS (2004). Temporal difference models describe higher-order learning in humans. Neuron, 429, 4. [DOI] [PubMed] [Google Scholar]
  173. Sip KE, Gonzalez R, Taylor SF, & Stern ER (2018). Increased Loss Aversion in Unmedicated Patients with Obsessive–Compulsive Disorder. Frontiers in Psychiatry, 8, 309. 10.3389/fpsyt.2017.00309 [DOI] [PMC free article] [PubMed] [Google Scholar]
  174. Smits JAJ, Pollack MH, Rosenfield D, Otto MW, Dowd S, Carpenter J, Dutcher CD, Lewis EM, Witcraft SM, Papini S, Curtiss J, Andrews L, Kind S, Conroy K, & Hofmann SG (2020). Dose Timing of D-Cycloserine to Augment Exposure Therapy for Social Anxiety Disorder: A Randomized Clinical Trial. JAMA Network Open, 3(6), e206777. 10.1001/jamanetworkopen.2020.6777 [DOI] [PMC free article] [PubMed] [Google Scholar]
  175. Solomon RL, & Wynne LC (1954). Traumatic avoidance learning: The principles of anxiety conservation and partial irreversibility. Psychological Review, 61(6), 353–385. 10.1037/h0054540 [DOI] [PubMed] [Google Scholar]
  176. Soltani A, & Izquierdo A (2019). Adaptive learning under expected and unexpected uncertainty. Nature Reviews Neuroscience, 20(10), 635–644. 10.1038/s41583-019-0180-y [DOI] [PMC free article] [PubMed] [Google Scholar]
  177. Steinman SA, Dunsmoor JE, Gazman Z, Stovezky Y, Pascucci O, Pomerenke J, Phelps EA, Fyer A, & Simpson HB (2022). A Preliminary Test of Novelty-Facilitated Extinction in Individuals With Pathological Anxiety. Frontiers in Behavioral Neuroscience, 16, 873489. 10.3389/fnbeh.2022.873489 [DOI] [PMC free article] [PubMed] [Google Scholar]
  178. Stojić H, Orquin JL, Dayan P, Dolan RJ, & Speekenbrink M (2020). Uncertainty in learning, choice, and visual fixation. Proceedings of the National Academy of Sciences, 117(6), 3291–3300. 10.1073/pnas.1911348117 [DOI] [PMC free article] [PubMed] [Google Scholar]
  179. Struijs SY, Lamers F, Rinck M, Roelofs K, Spinhoven P, & Penninx BWJH (2018). The predictive value of Approach and Avoidance tendencies on the onset and course of depression and anxiety disorders. Depression and Anxiety, 35(6), 551–559. 10.1002/da.22760 [DOI] [PMC free article] [PubMed] [Google Scholar]
  180. Struijs SY, Lamers F, Vroling MS, Roelofs K, Spinhoven P, & Penninx BWJH (2017). Approach and avoidance tendencies in depression and anxiety disorders. Psychiatry Research, 256, 475–481. 10.1016/j.psychres.2017.07.010 [DOI] [PubMed] [Google Scholar]
  181. Sun D, Gold AL, Swanson CA, Haswell CC, Brown VM, Stjepanovic D, LaBar KS, & Morey RA (2020). Threat-induced anxiety during goal pursuit disrupts amygdala–prefrontal cortex connectivity in posttraumatic stress disorder. Translational Psychiatry, 10(1), 61. 10.1038/s41398-020-0739-4 [DOI] [PMC free article] [PubMed] [Google Scholar]
  182. Sutton RS, & Barto A (1987). A temporal-difference model of classical conditioning. Proceedings of the Ninth Annual Conference of the Cognitive Science Society. [Google Scholar]
  183. Sutton RS, & Barto AG (1998). Introduction to reinforcement learning.
  184. Sutton RS, Precup D, & Singh S (1999). Between MDPs and semi-MDPs: A framework for temporal abstraction in reinforcement learning. Artificial Intelligence, 112(1–2), 181–211. 10.1016/S0004-3702(99)00052-1 [DOI] [Google Scholar]
  185. Sylvester CM, Corbetta M, Raichle ME, Rodebaugh TL, Schlaggar BL, Sheline YI, Zorumski CF, & Lenze EJ (2012). Functional network dysfunction in anxiety and anxiety disorders. Trends in Neurosciences, 35(9), 527–535. 10.1016/j.tins.2012.04.012 [DOI] [PMC free article] [PubMed] [Google Scholar]
  186. Tashjian SM, Zbozinek TD, & Mobbs D (2021). A Decision Architecture for Safety Computations. Trends in Cognitive Sciences, 25(5), 342–354. 10.1016/j.tics.2021.01.013 [DOI] [PMC free article] [PubMed] [Google Scholar]
  187. Tom SM, Fox CR, Trepel C, & Poldrack RA (2007). The Neural Basis of Loss Aversion in Decision-Making Under Risk. Science, 315(5811), 515–518. 10.1126/science.1134239 [DOI] [PubMed] [Google Scholar]
  188. Vervliet B, Craske MG, & Hermans D (2013). Fear Extinction and Relapse: State of the Art. Annual Review of Clinical Psychology, 9(1), 215–248. 10.1146/annurev-clinpsy-050212-185542 [DOI] [PubMed] [Google Scholar]
  189. Vervliet B, Lange I, & Milad MR (2017). Temporal dynamics of relief in avoidance conditioning and fear extinction: Experimental validation and clinical relevance. Behaviour Research and Therapy, 96, 66–78. 10.1016/j.brat.2017.04.011 [DOI] [PubMed] [Google Scholar]
  190. Vilares I, & Kording K (2011). Bayesian models: The structure of the world, uncertainty, behavior, and the brain: Bayesian models and the world. Annals of the New York Academy of Sciences, 1224(1), 22–39. 10.1111/j.1749-6632.2011.05965.x [DOI] [PMC free article] [PubMed] [Google Scholar]
  191. Walker AR, Luque D, Le Pelley ME, & Beesley T (2019). The role of uncertainty in attentional and choice exploration. Psychonomic Bulletin & Review, 26(6), 1911–1916. 10.3758/s13423-019-01653-2 [DOI] [PubMed] [Google Scholar]
  192. Walton ME, Behrens TEJ, Noonan MP, & Rushworth MFS (2011). Giving credit where credit is due: Orbitofrontal cortex and valuation in an uncertain world: Orbitofrontal cortex: value assignment and comparison. Annals of the New York Academy of Sciences, 1239(1), 14–24. 10.1111/j.1749-6632.2011.06257.x [DOI] [PubMed] [Google Scholar]
  193. Watson D (2005). Rethinking the mood and anxiety disorders: A quantitative hierarchical model for DSM-V. Journal of Abnormal Psychology, 114(4), 522–536. 10.1037/0021-843X.114.4.522 [DOI] [PubMed] [Google Scholar]
  194. Watts BV, Schnurr PP, Mayo L, Young-Xu Y, Weeks WB, & Friedman MJ (2013). Meta-Analysis of the Efficacy of Treatments for Posttraumatic Stress Disorder. The Journal of Clinical Psychiatry, 74(06), e541–e550. 10.4088/JCP.12r08225 [DOI] [PubMed] [Google Scholar]
  195. Wheaton MG, Gershkovich M, Gallagher T, Foa EB, & Simpson HB (2018). Behavioral avoidance predicts treatment outcome with exposure and response prevention for obsessive-compulsive disorder. Depression and Anxiety, 35(3), 256–263. 10.1002/da.22720 [DOI] [PMC free article] [PubMed] [Google Scholar]
  196. White CN, Ratcliff R, & Vasey MW (2016). Anxiety-related threat bias in recognition memory: The moderating effect of list composition and semantic-similarity effects. Cognition and Emotion, 30(8), 1446–1460. 10.1080/02699931.2015.1070120 [DOI] [PubMed] [Google Scholar]
  197. White CN, Ratcliff R, Vasey MW, & McKoon G (2010). Anxiety enhances threat processing without competition among multiple inputs: A diffusion model analysis. Emotion, 10(5), 662–677. 10.1037/a0019474 [DOI] [PubMed] [Google Scholar]
  198. Wilson RC, Geana A, White JM, Ludvig EA, & Cohen JD (2014). Humans use directed and random exploration to solve the explore–exploit dilemma. Journal of Experimental Psychology: General, 143(6), 2074–2081. 10.1037/a0038199 [DOI] [PMC free article] [PubMed] [Google Scholar]
  199. Wise T, & Dolan RJ (2020). Associations between aversive learning processes and transdiagnostic psychiatric symptoms in a general population sample. Nature Communications, 11(1), 4179. 10.1038/s41467-020-17977-w [DOI] [PMC free article] [PubMed] [Google Scholar]
  200. Wise T, Michely J, Dayan P, & Dolan RJ (2019). A computational account of threat-related attentional bias. PLOS Computational Biology, 15(10), e1007341. 10.1371/journal.pcbi.1007341 [DOI] [PMC free article] [PubMed] [Google Scholar]
  201. Wise T, Zbozinek TD, Charpentier CJ, Michelini G, Hagan CC, & Mobbs D (2022). Computationally-defined markers of uncertainty aversion predict emotional responses during a global pandemic. Emotion. 10.1037/emo0001088 [DOI] [PMC free article] [PubMed] [Google Scholar]
  202. Wolpe J (1961). The systematic desensitization treatment of neuroses. The Journal of Nervous and Mental Disease, 132, 189–203. 10.1097/00005053-196103000-00001 [DOI] [PubMed] [Google Scholar]
  203. Woody ML, Miskovic V, Owens M, James KM, Feurer C, Sosoo EE, & Gibb BE (2017). Competition Effects in Visual Cortex Between Emotional Distractors and a Primary Task in Remitted Depression. Biological Psychiatry: Cognitive Neuroscience and Neuroimaging, 2(5), 396–403. 10.1016/j.bpsc.2016.12.007 [DOI] [PMC free article] [PubMed] [Google Scholar]
  204. Xia L, & Collins AGE (2021). Temporal and State Abstractions for Efficient Learning, Transfer, and Composition in Humans. Psychological Review, 128(4), 643–666. [DOI] [PMC free article] [PubMed] [Google Scholar]
  205. Yang X, Fang Y, Chen H, Zhang T, Yin X, Man J, Yang L, & Lu M (2021). Global, regional and national burden of anxiety disorders from 1990 to 2019: Results from the Global Burden of Disease Study 2019. Epidemiology and Psychiatric Sciences, 30, e36. 10.1017/S2045796021000275 [DOI] [PMC free article] [PubMed] [Google Scholar]
  206. Yu AJ, & Dayan P (2005). Uncertainty, Neuromodulation, and Attention. Neuron, 46(4), 681–692. 10.1016/j.neuron.2005.04.026 [DOI] [PubMed] [Google Scholar]
  207. Zaizar ED, Papini S, Gonzalez-Lima F, & Telch MJ (2021). Singular and combined effects of transcranial infrared laser stimulation and exposure therapy on pathological fear: A randomized clinical trial. Psychological Medicine, 1–10. 10.1017/S0033291721002270 [DOI] [PMC free article] [PubMed] [Google Scholar]
  208. Zika O, Wiech K, Reinecke A, Browning M, & Schuck NW (2022). Trait anxiety is associated with hidden state inference during aversive reversal learning [Preprint]. Neuroscience. 10.1101/2022.04.01.483303 [DOI] [PMC free article] [PubMed] [Google Scholar]
  209. Zinbarg RE, Williams AL, & Mineka S (2022). A Current Learning Theory Approach to the Etiology and Course of Anxiety and Related Disorders. Annual Review of Clinical Psychology, 18(1), 233–258. 10.1146/annurev-clinpsy-072220-021010 [DOI] [PubMed] [Google Scholar]
  210. Zorowitz S, Momennejad I, & Daw ND (2020). Anxiety, Avoidance, and Sequential Evaluation. Computational Psychiatry, 4(0), 1. 10.1162/CPSY_a_00026 [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

Not applicable

RESOURCES