SUMMARY
Beliefs—attitudes toward some state of the environment—guide action selection and should be robust to variability but sensitive to meaningful change. Beliefs about volatility (expectation of change) are associated with paranoia in humans, but the brain regions responsible for volatility beliefs remain unknown. The orbitofrontal cortex (OFC) is central to adaptive behavior, whereas the magnocellular mediodorsal thalamus (MDmc) is essential for arbitrating between perceptions and action policies. We assessed belief updating in a three-choice probabilistic reversal learning task following excitotoxic lesions of the MDmc (n = 3) or OFC (n = 3) and compared performance with that of unoperated monkeys (n = 14). Computational analyses indicated a double dissociation: MDmc, but not OFC, lesions were associated with erratic switching behavior and heightened volatility belief (as in paranoia in humans), whereas OFC, but not MDmc, lesions were associated with increased lose-stay behavior and reward learning rates. Given the consilience across species and models, these results have implications for understanding paranoia.
Graphical abstract
In brief
Suthaharan et al. demonstrate a causal role of the primate mediodorsal thalamus (MD) in beliefs about environmental volatility. Applying a behavioral paradigm and a computational model, they establish that belief volatility increases in paranoid people and monkeys with lesions to the MD. This suggests a role for the MD in paranoia.
INTRODUCTION
The ability to form and update beliefs about our actions and their consequences, especially in volatile environments, is at the core of advanced cognition.1 Indeed, disruptions in volatility beliefs are associated with maladaptive cognitive and behavioral outcomes, such as paranoia, the belief that others intend to exert harm.2 Volatility beliefs and manifestations of their disruption, like paranoia, can be captured in decision-making paradigms that incorporate environmental volatility, such as probabilistic reversal learning (PRL) tasks.3,4 In PRL tasks, contingencies between choices and outcomes reverse so that previously richly reinforced options become lean and vice versa. These tasks incentivize individuals to form a set of beliefs that are robust to probabilistic noise but flexible enough to track true change in action-outcome contingencies to maximize reward (Figure 1A). In prior work with human participants, key behavioral metrics of belief connote flexible3 and perseverative behavior.5 “Win switching” occurs when the subject selects a different option on the subsequent trial following a reward, or “win,” which, when frequent, marks excessive flexibility. On the other hand, “lose staying” happens when the same choice is repeated following an unrewarded trial, or “lose,” signifying persistence, which becomes perseverative if sustained. Therefore, paranoia is associated with an increase in win switching, a dearth of lose staying, and an elevated volatility belief.3,4
We can discern such volatility beliefs quantitatively by fitting Bayesian inference (BI) models to participants’ behavior and estimating the parameter values to account for individual patterns of choices.3 This computational modeling approach to study brain and behavior has been employed to better understand various aspects of human psychopathology, including paranoia. While traditional reinforcement learning models have provided insight into choice behavior in PRL tasks, there is evidence to suggest that BI models may be better equipped to capture rapid behavioral change in response to volatility (e.g., a reversal event).6,7 Indeed, BI models seem necessary to capture the elevated win-switching behavior observed with paranoia.3 It has been proposed that BI models might detect these nuances of behavior in PRL tasks by tracking latent (i.e., hidden) states, the unobservable features that dictate an environment’s underlying dynamics, which may allow more rapid recognition of state changes and more efficient behavioral adaptation.8,9 Two components are critical in such a model: (1) participants’ assumptions about how the task works and (2) how those assumptions lead to their decisions. One such BI model that is designed to capture volatility beliefs is the hierarchical Gaussian filter (HGF; Figure 1B).6,10 There are two model parameters that are particularly central to volatility processing: (1) an equilibrium value that attracts and stabilizes the agent’s estimate of volatility (m3, volatility belief; higher values indicate greater expectation of change) and (2) the rate of adjustment of beliefs about the values of each option (ω2, value learning rate; higher values indicate more rapid learning). In recent work, m3 has been shown to relate to risk for psychosis.6
It is of course possible that other model structures and types may yield different insights to paranoia in humans and reversal behavior in animals. The exercise here is to use a computational model whose parameters relate to paranoia in humans to model behavior in nonhuman primates to connect those literatures. Other connections may become apparent in future work.
Several brain regions and circuits have been implicated in the formation and updating of such volatility processing11 and/or paranoia in PRL tasks (reviewed in Soltani and Izquierdo12). fMRI, particularly in patients with paranoid persecutory delusional beliefs, has suggested that this volatility belief parameter (m3) correlates with the engagement of the dorsolateral prefrontal cortex (DLPFC) during PRL tasks.6 Data in nonhuman primates suggest that lesions of the magnocellular mediodorsal thalamus (MDmc) may also recapitulate the associated winswitching pattern.13 We note that MDmc receives inputs from the DLPFC, in addition to inputs from the ventral PFC,14,15 which may support the increased win-switching behavior after MDmc lesions. Animal work has shown that the thalamus, in partnership with the cortex, is a key nexus of perception and action16,17 and a mediator of behavioral change in the face of evolving contingencies.18 Yet, the MDmc has not been extensively implicated in human paranoia and thus warrants further investigation (but seeCrail-Melendez et al.19). Aspiration lesions of the orbitofrontal cortex (OFC) in nonhuman primates have been shown to disrupt PRL performance,20,21 though these effects may be more attributable to parts of the ventrolateral prefrontal cortex (VLPFC).22 Resting-state functional connectivity studies suggest that the OFC also plays a role in human paranoia.23 However, the HGF model has yet to be applied to choice behavior in PRL tasks performed by nonhuman primates, which could elucidate the neural underpinnings of volatility beliefs relevant to human paranoia.
In the present work, we sought to infer the potential role of the MDmc and OFC in paranoia by estimating volatility belief in a PRL task with the HGF model in monkeys with lesions to these regions. We hypothesized that damage to brain regions involved in action and perception would lead to a pattern of changes in PRL behavior and volatility beliefs in monkeys that are associated with paranoia in human participants. By comparing the effects of MDmc and OFC lesions on actions that can be attributed to volatility belief, we can begin to elucidate the functional contributions of different brain regions, and possibly neural circuits, to belief updating and, transitively, to paranoia in human participants. Ultimately, the identification of human participants with similar patterns of behavior as lesioned monkeys, and examination of how those patterns relate to self-reported paranoia symptoms (Figure 1C), can afford us the ability to draw inferences about the specificity of differential relationships between task behavior, computational model parameters, brain lesions, and psychopathology.
RESULTS
To understand the contributions of the MDmc and OFC to beliefs about volatility in the primate brain, we analyzed existing datasets from multiple labs that administered the same three-choice PRL task to rhesus macaques with selective, excitotoxic lesions of the MDmc or the OFC (STAR Methods). We focused particularly on win-switch and lose-stay behavior and beliefs about volatility using a BI model.
Flexibility and perseveration following brain lesions
We examined win-switch and lose-stay rates in monkeys with excitotoxic lesions to either the MDmc (n = 3) or OFC (n = 3) compared with non-lesioned controls (n = 14) (Figure 2A). We observed a significant two-way interaction between the lesion group (OFC, MDmc, or control) and reversal phase (before or after reversal) on win-switch behavior (Figure 2B, left; χ2 = 89.67, p < 0.001; generalized linear mixed model [GLMM]). To resolve this interaction, we first assessed group effects within each reversal phase. We identified an effect of lesion group on win switching before the reversal (χ2 = 61.17, p < 0.001) and after the reversal (χ2 = 580.6, p < 0.001). Pairwise comparisons within the pre-reversal phase revealed that, whereas MDmc-lesioned monkeys exhibited elevated win-switching behavior compared to non-lesioned controls (χ2 = 57.6, p < 0.001) and OFC-lesioned monkeys (χ2 = 32.8, p < 0.001), OFC-lesioned monkeys did not differ from non-lesioned controls (χ2 = 0.14, p = 0.712). Pairwise comparisons within the post-reversal phase revealed differences between all groups; MDmc-lesioned monkeys again showed greater win-switching behavior compared to non-lesioned controls (χ2 = 410.22, p < 0.001) and OFC-lesioned monkeys (χ2 = 486.63, p < 0.001), and here we observed lower win switching in OFC-lesioned monkeys versus non-lesioned controls (χ2 = 96.06, p < 0.001). We observed an effect of reversal phase on win switching in MDmc-lesioned monkeys (χ2 = 272.47, p < 0.001) but not OFC-lesioned monkeys (χ2 = 1.58, p = 0.209), which was characterized by a more dramatic increase in win switching (mean difference = 0.169, p < 0.001) from pre to post reversal than in non-lesioned controls (χ2 = 324.19, p < 0.001).
We also observed a similar two-way interaction between lesion group and reversal phase on lose-stay behavior (Figure 2B, right; χ2 = 41.45, p < 0.001). An effect of lesion group on lose-stay behavior was observed both before reversal (χ2 = 78.13, p < 0.001) and after reversal (χ2 = 96.78, p < 0.001). Compared to non-lesioned controls, lose-stay rates were altered both pre and post reversal in MDmc-lesioned (pre reversal: χ2 = 56.9, p < 0.001; post reversal: χ2 = 79.92, p < 0.001) and OFC-lesioned monkeys (pre reversal: χ2 = 37.19, p < 0.001; post reversal: χ2 = 6.17, p = 0.013). Lose-stay behavior did not differ between the two lesion groups before the reversal (χ2 = 0.85, p = 0.356), while we did see differences in lose-stay behavior after the reversal (χ2 = 68.08, p < 0.001). The effect of reversal within each group revealed a marked decrease in lose-stay behavior in both MDmc-lesioned monkeys (χ2 = 49.76, β = −0.58, p < 0.001) and non-lesioned controls (χ2 = 115.75, β = −0.48, p < 0.001), whereas a trend-level increase in lose-stay behavior was observed in OFC-lesioned monkeys (χ2 = 3.16, β = 0.18, p = 0.076). These data, collectively, indicate that lesions to the MDmc were associated with erratic switching behaviors regardless of reversal phase. Lesions of the OFC, however, were associated with inflexible choice behavior that is reversal phase dependent.
A double dissociation of volatility and value
To investigate the impact of brain lesions on how beliefs are updated under uncertainty, we applied the HGF model to trial-by-trial choice data from the MDmc lesion, OFC lesion, and non-lesion control monkeys (Figure S1). Computational modeling of behavior revealed distinct effects of lesion group on volatility belief and value learning (Figure 3). First, reversal enhanced volatility beliefs in MDmc-lesioned (m3; χ2 = 7736.09, p < 0.001; Figure 3A) and non-lesioned control monkeys (χ2 = 37.9 p < 0.001; Figure 3A), whereas lesions to the OFC blocked the increase in volatility beliefs associated with reversal (χ2 = 0.38, p = 0.535; Figure 3A). Furthermore, we observed a marginally greater increase in volatility beliefs in MDmc-lesioned compared to non-lesioned control monkeys (mean difference = 1.33, p = 0.081). With regard to learning about value, monkeys with OFC lesions and non-lesioned controls exhibited increased value learning post reversal relative to pre reversal (ω2; OFC: χ2 = 5.70, p = 0.017; non-lesion: χ2 = 21.50, p < 0.001; Figure 3B), whereas lesions in MDmc blocked the increase in value learning rate that accompanied reversal in the other groups (χ2 = 1.09, p = 0.296; Figure 3B).
Pairwise comparisons of effects of lesion group on volatility beliefs (Figure 3A) within the pre-reversal phase revealed that, whereas MDmc-lesioned monkeys exhibited elevated volatility belief compared to non-lesioned controls (χ2 = 3.99, β = 0.99, p = 0.046), OFC-lesioned monkeys did not differ from non-lesioned controls (χ2 = 0.40, p = 0.526) or from MDmc-lesioned monkeys (χ2 = 0.47, p = 0.495). Pairwise comparisons within the post-reversal phase revealed differences between all groups; MDmc-lesioned monkeys showed enhanced volatility beliefs compared to non-lesioned controls (χ2 = 15.43, β = 2.33, p < 0.001) and OFC-lesioned monkeys (χ2 = 61.42, β = 3.72, p < 0.001), and here we observed a decrease in volatility beliefs in OFC-lesioned monkeys compared to non-lesioned controls (χ2 = 5.29, β = −1.40, p = 0.021).
Pairwise comparisons of effects of lesion group on value learning (Figure 3B) within the pre-reversal phase revealed that, whereas OFC-lesioned monkeys exhibited an increase in value learning compared to non-lesioned controls (χ2 = 4.68, β = 1.26, p = 0.030) and MDmc-lesioned monkeys (χ2 = 8.98, β = 1.72, p = 0.003), MDmc-lesioned monkeys did not differ from non-lesioned controls (χ2 = 0.75, p = 0.386). Within the post-reversal phase, MDmc-lesioned monkeys exhibited reduced value learning relative to non-lesioned controls (χ2 = 18.74, p < 0.001) and OFC-lesioned monkeys (χ2 = 19.86, p < 0.001), whereas OFC-lesioned monkeys did not differ in value learning from non-lesioned controls (χ2 = 1.54, p = 0.214). This dissociation implies distinct neural and computational mechanisms underlying actions attributable to volatility and value beliefs; MDmc lesions disinhibited volatility beliefs but blunted sensitivity to the reversal for updating value learning, whereas OFC lesions disinhibited value learning but blunted sensitivity to the reversal for updating volatility beliefs. Given that lesions in these areas lead to opposing effects on these belief-updating parameters, this established a double dissociation between the function of MDmc and OFC regions.
In summary, MDmc-lesioned monkeys demonstrated increased win-switching and reduced lose-stay behavior after reversal; OFC-lesioned monkeys showed the opposite behavior. In terms of belief updating, MDmc lesions increased volatility beliefs, notably post reversal, while OFC lesions blocked changes in volatility beliefs; however, OFC lesions elevated value learning, while MDmc lesions blocked changes in value learning. These findings confirm a double dissociation. See Figure S2 for more fine-grained, within-monkey analyses.
Translational insights from monkey behavioral neuroscience to human psychopathology
In order to gain clinically relevant insights from the changes in beliefs about volatility and value learning parameters due to MDmc and OFC lesions, we compared PRL behavior of lesioned monkeys to that of humans with paranoia (Figure 4), as human paranoia has been associated with erratic switching behavior and an elevated sense of uncertainty about the environment.3 We analyzed existing datasets in human participants who completed either of two types of PRL tasks: a single-reversal task whose structure matches the monkey version24 and a multi-reversal task.3 The shift-induced increase in unexpected volatility, in the multi-reversal task, has been shown to increase sensitivity in detecting paranoia group differences in task performance.3
Paranoid participants evinced higher win switching pre and post reversal, a pattern also observed in the monkeys with MDmc lesions, in both versions of the task (Figures 4A and 4B). In the single-reversal task, we found a significant interaction between paranoia group and reversal phase for win-switch behavior (Figure 4A; χ2 = 18.11, p < 0.001, GLMM). In particular, we observed significantly more post-reversal win switching in paranoid compared to non-paranoid human participants (χ2 = 10.43, p = 0.001) but no pre-reversal, win-switching difference between paranoia groups (χ2 = 0.05, p = 0.818). On the other hand, in the multi-reversal task, we found a greater separation than for the single-reversal task in win-switching behavior between paranoid and non-paranoid human participants (Figure 4B; pre-shift: χ2 = 112.23, p < 0.001; post-shift: χ2 = 74.74, p < 0.001; Figure S3). Presumably, this difference occurs because participants are experiencing the first contingency reversal just like the monkeys completing the single-reversal task. However, in the multi-reversal version, reversals continued prior to and following the contingency shift, which we believe would increase expected and unexpected volatility.
Similarly, we estimated volatility beliefs and value learning in paranoid and non-paranoid human participants completing both versions of the PRL task. The pattern of elevated volatility beliefs and reduced value learning, observed in the monkeys, was reflected in both versions of the PRL task in human participants but more clearly in the multi-reversal version. In the single-reversal task, no effects of paranoia were identified for volatility belief (Figure 4C, top). In contrast, in the multi-reversal task, high paranoia substantially augmented overall volatility belief (Figure 4D, top; m3: χ2 = 61.16, p < 0.001), while shift marginally decreased it (m3: χ2 = 3.02, β = −0.09, p = 0.08). Thus, the multi-reversal task was more sensitive to differences in volatility belief between paranoia groups, regardless of shift, which mirrored the effects seen in MDmc-lesioned monkeys.
For value learning, an interaction between paranoia group and reversal phase was found in the single-reversal task (Figure 4C, bottom; ω2 : χ2 = 4.41, p = 0.036; GLMM). Lower value learning was observed in high-paranoia individuals within post but not pre reversal (ω2 : χ2 = 7.02, p = 0.008), whereas the reversal increased value learning within low- but not high-paranoia individuals (ω2 : χ2 = 46.05, p < 0.001). In the multi-reversal task, an interaction between paranoia group and reversal phase was also observed (Figure 4D, bottom; ω2 : χ2 = 6.43, p = 0.011; GLMM). However, unlike the single-reversal version, high-paranoia individuals had lower value learning than their low-paranoia counterparts within both pre (ω2 : χ2 = 27.34, p < 0.001) and post shift (ω2 : χ2 = 7.22, p = 0.007). The shift increased value learning only within the high-paranoia group (ω2 : χ2 = 4.62, p = 0.032). Overall, lower value learning was observed in the high-paranoia groups, much like MDmc-lesioned monkeys.
DISCUSSION
It is controversial whether it is feasible to model psychiatric symptoms like paranoia in nonhuman animals. Presently, we met that challenge and conclude that nonhuman primates do indeed display choice responses akin to participants with high paranoia during a PRL task, and, thus, they may serve as models for the exploration of psychiatric symptoms hitherto considered outside the realm of translational neuroscience. We describe a series of analyses of PRL datasets from monkeys and human participants. We found that, in a single-reversal PRL task, monkeys with MDmc lesions but not OFC lesions exhibited increased win-switch behavior, increased volatility beliefs, and decreased value learning rates, especially after the reversal in reward contingencies. By contrast, OFC but not MDmc lesions increased lose-stay behavior and led to increased value learning rates while blunting updating of volatility belief. This pattern of behavioral responses and altered neurocomputations observed in monkeys with MDmc lesions (i.e., elevated volatility belief) was similar to that observed in human participants with high levels of paranoia. Indeed, the pattern of win-switching and volatility belief change was consistent across two independent datasets that employed different reinforcement contingency schemes. In cognitive neuropsychology, double dissociations are essential for the identification of independent functions. We report a double dissociation in the behavioral effects and computational consequences of MDmc versus OFC lesions on belief updating.
The challenge with PRL is to harbor a set of beliefs that is robust to noise but sensitive to real underlying contingency change. It appears that lesions of the MDmc and OFC have doubly dissociable effects on these processes. Specifically, in a single-reversal task, win-switching behavior and volatility beliefs increased in response to the contingency reversal in control animals, whereas lose-stay rates decreased. This represents sensitivity to contingency change and the concomitant increase in volatility of the environment. Monkeys with OFC lesions fail to show this increase. This manifests as greater post-reversal lose-stay behavior. Conversely, monkeys with MDmc lesions make choices that are suggestive of elevated volatility beliefs both prior to reversal and a greater increase post reversal compared to control and OFC-lesioned animals, which is paralleled by the higher rates of win switching in the MDmc-lesioned animals. Furthermore, value learning rates increased post reversal in control animals, an effect mirrored in monkeys with OFC-lesions too. In contrast, monkeys with MDmc lesions fail to show this effect. It is possible that a higher-level task structure (level 3, HGF; Figure 1B), rather than value (level 2, HGF; Figure 1B), is more salient in MDmc-lesioned animals. In addition, in the absence of appropriate value learning, optimal choice responses may become noisier in MDmc-lesioned animals.25,26
Taken together, these data highlight the differential roles of MDmc and OFC nodes in belief processing. MDmc-lesioned animals appear to confuse stochasticity for real change (win switching more pre reversal, and even more post reversal), whereas OFC-lesioned animals do not update their beliefs in response to the volatility occasioned by reversal. Such maladaptive behavior in MDmc-lesioned monkeys manifests functionally as a failure to optimally learn complex reward associations and, thus, exploit the most rewarding option; distorted volatility beliefs could disrupt that ability to more frequently choose the highest probability reinforced option. Alternatively, maladaptive behavior manifests in OFC-lesioned monkeys as perseverative responding. Such perseveration is not driven by sticking to outdated value (indeed, changes in value learning rates due to reversal are consistent with non-lesioned controls) but, rather, by a lack of adaptive change in volatility belief in the face of real change. Finally, these data further highlight the overlap between MDmc-lesioned monkey behavior and that of people who are paranoid, who also confuse stochasticity for volatility prior to reversal and have higher m3 (i.e., greater expectation of change) and lower ω2 (i.e., slower value learning).
In addition to similarities, there were also some differences across datasets. While the single-reversal task in humans has superficial structural similarities to the monkey task, the fact that the monkeys underwent multiple reversal sessions (Figure S4) across days (and weeks) suggests that the beliefs that they build and update are more similar to the beliefs that human participants formed and updated during the multi-reversal task. This suggestion is supported by the computational analyses, which showed greater parallels between the MDmc-lesioned monkeys and humans with paranoia on the multi-reversal task; volatility beliefs were increased, and value learning rates were decreased. Hence, we demonstrate the utility of administering similar tasks and models across species and argue that computation may be a lingua franca27 that could connect disparate fields of research and model organisms. However, close inspection of the pre- and post-reversal data shows that monkey win switching increases post reversal. Human win-switching does not increase. We do not know precisely why the monkey and human behavioral data differ in this way. We can speculate that monkeys are more sensitive to rewards than humans are to abstract points. After reversal, this sensitivity could drive monkeys to switch more often to seek more rewarding options. Alternatively, monkeys and humans may be building differentially complex models of the task. In humans, this added complexity would accommodate reversals and shifts and contingencies, whereas monkeys may be switching erratically in search of an explanatory model.28,29 This would actually be consistent with something we observed in humans. Paranoid participants tend to behave more randomly than non-paranoid participants,3 although not differentially pre and post reversal.
Our data implicate MDmc contributing a role in volatility belief updating. Previous work has linked volatility belief updating to paranoia and persecutory delusions.3,4,6 What is the role (if any) of MD in paranoia and psychosis? There are isolated cases where circumscribed lesions cause new-onset paranoia in humans19,30 and frightening distortions of social stimuli,31 but there are also more diffuse (often mnemonic) problems associated with MD damage, which can, as a result of impaired interpersonal behavior, lead indirectly to paranoia. Alternatively, genetic changes and the atrophy in frontotemporal dementia (FTD) are often quite specific to MD,32 and psychotic symptoms (specifically paranoia) are common in FTD.25,33 We suggest that our data are more aligned with a direct impact of thalamic damage on belief updating rather than a secondary impairment in relational cognition, which, although possible in monkeys, would not explain the specific pattern of findings we report presently because relational cognition and social interaction were not part of the present task. Furthermore, our human data suggest that the differences in high vs. low paranoia are not more readily attributable to confounding differences in mood or general intellectual function (Figure S5).
In prior work we argued that, because our PRL task was relatively non-social, and rats are a relatively asocial species (but not completely asocial),34–36 the similarity in behavior between paranoid humans and rats treated with methamphetamine was consistent with a non-social explanation of paranoia. Our non-social result did not differ when we made the PRL more social,4,37 although participants did imbue the non-social stimuli with harmful intentions, so social cognition may have been in play. Nonhuman primates are more social than rodents. Sociality is often related to theory of mind (ToM)–the cognitive capacity to attribute mental states to others.38 This mentalizing process has been demonstrated in macaques39 and apes.40 However, ToM in humans and nonhuman primates may differ substantially; macaques do not attribute mental states to the Heider and Simmel animations.41 Furthermore, putative ToM-like capacities in primates seem to fall short of human abilities.42 ToM is associated with dorsomedial prefrontal cortex (DMPFC) activity in humans43 and macaques.39 The MDmc projects directly to the DMPFC.44 Disrupting the MDmc may perturb social valuation and inference in the DMPFC. However, the DMPFC may also be less social specific than previously believed.45–47 Indeed, a recent nonhuman primate study suggested that the DMPFC is involved in tracking and weighting the reliability of social and non-social information,48 and in humans, the DMPFC may be tracking uncertainty rather than ToM.49 Furthermore, human fMRI studies of the PRL suggest DMPFC responses during non-social reversal learning.5,50 It is possible, on one hand, that mediodorsal thalamus (MD) disruption could lead to abnormal ToM processing in the DMPFC via spurious inputs to the DMPFC. On the other hand, social belief volatility may reside in an encapsulated module independent from MD-mediated computations. Non-social environmental volatility is often attributed to social agents. Domain specificity may manifest at the level of algorithm (beliefs about environmental volatility vs. beliefs about agents’ intentions) or at the level of implementation (domain-general vs. domain-specific nodes and circuits).51 The involvement of the MD in the animism bias bears further investigation, although the bias seems absent in macaques.41 Even if the underlying mechanisms of paranoia are not inherently social, the ramifications of their perturbation often bear social implications.51 When people perceive or expect excess volatility, they typically find that anxiogenic and seek to explain it away. Typically, volatility is ascribed to other agents52 and their intentions. When the world feels unpredictable, paranoid beliefs arise.4 Consequently, our findings on the MDmc open an avenue to further explore the potential intersections and divergences between belief volatility and paranoia. Future work ought to leverage these tasks and models as well as other computational approaches to further elucidate the domain generality or specificity of paranoia.
Our findings have important implications for understanding the underlying mechanisms of paranoia and, relatedly, psychosis and schizophrenia. While prior work has implicated the OFC23,53 and the DLPFC6 in paranoia as well as the link between paranoia and reversal learning, our lesion findings highlight that the MDmc is a necessary node for controlling volatility belief. Future work ought to explore the human MDmc in the context of paranoia and its treatment. Furthermore, the widespread projection targets of MDmc within the prefrontal cortex may well prove relevant to belief updating; for example, the VLPFC. In the case of thalamo-prefrontal dynamics, it has been posited that damage to the thalamus often recapitulates the effects of prefrontal damage.54 This was not what we observed with OFC damage. Even though the OFC projects directly to the MDmc, we found a double dissociation of the effects of OFC and MDmc damage on behavior and computational model parameters, suggesting that they underwrite independent, dissociable functions. We further explored the effects of VLPFC lesions on PRL task behavior (Figure S6). VLPFC lesions had effects redolent of MDmc lesions on behavior, elevating win switching and volatility beliefs. This is consistent with the hypothesized relay function between the thalamus and PFC in the thalamo-prefrontal-cortical circuits. What, then, are we to conclude about OFC and MDmc function? Rodent work suggests that the direction of coupling between brain regions is relevant to the circuit processing of reversal learning–damage to projections from the OFC has different effects than damage to projections to the OFC.55 Similar manipulations of the projections between the MDmc and OFC will be revelatory in this regard. Furthermore, it is possible that the effects of circumscribed lesions have differential effects on some other brain region(s) with which both the OFC and MDmc interact. One possibility is the nucleus accumbens.55
More broadly, the current results suggest that not all PRL tasks are equally sensitive to paranoia in humans. While the overall patterns of behavioral response and model parameter change are consistent across tasks and species, the multi-reversal version seemed to separate paranoid from non-paranoid participants more clearly and to cleave more closely to the patterns observed in monkeys. This task type specificity implies that future studies intended to predict paranoia or track its prognosis might result in most predictive data when applying the multi-reversal version of the task. Based on the evidence provided here from MDmc-lesioned monkeys, we predict that human neuroimaging studies would detect an MDmc involvement in the more complex, multi-reversal version, which may be more difficult to detect using the single-reversal task. Furthermore, human participants with MDmc damage are predicted to be particularly challenged by a multi-reversal version of the PRL task.
Previous work has established an association between belief updating and paranoia through computational modeling.3,4,56 Here, we map computational markers of belief updating to specific brain regions in monkeys, where the MDmc and OFC play dissociable roles. Parallels between MDmc-lesioned monkeys and paranoid human participants suggest an influence of the MDmc in paranoia that warrants further investigation. Our study represents an initial cross-species effort to use computational modeling and common behavioral tasks to translate the neural mechanisms of belief updating from monkeys to humans and, ultimately, into the clinic.
Limitations
While we observed similarities between lesioned monkeys and humans with paranoia performing PRL, there were differences in behavioral patterns and in the tasks to which participants were exposed. Future work might present participants of both species with a multi-reversal task featuring a contingency shift. Furthermore, though our aim was to extend existing human computational psychiatry into the nonhuman primate realm, the possibility exists that a better-fitting model exists that spans these datasets. That will be a target for future investigation. Finally, the human imaging and neuropsychology literatures implicate the mediodorsal thalamus in paranoia and schizophreniform psychosis; however, this association bears investigation using PRL tasks, functional imaging, and computational modeling in human participants and patients in particular.
STAR★METHODS
RESOURCE AVAILABILITY
Lead contact
Further information and requests for resources and reagents should be directed to and will be fulfilled by either of the co-senior authors, Philip R. Corlett (philip.corlett@yale.edu) and Steve W. C. Chang (steve.chang@yale.edu).
Materials availability
This study did not generate new unique reagents.
Data and code availability
Monkey behavioral data presented in this paper will be available upon request. Human behavioral data from the single-reversal PRL task24 and the multi-reversal with contingency shift PRL task data4 are publically available. The analysis scripts for this paper can be found at https://github.com/psuthaharan/belief-update-monkeys.
EXPERIMENTAL MODEL AND SUBJECT DETAILS
All monkey and human experiments in this study were in accordance with local and national acts and committees for the use of animals in scientific research and behavioral research in humans. Experiments from Study 121 and Study 213 were performed in compliance with the United Kingdom Animals (Scientific Procedures) Act of 1986. A Home Office (UK) Project License (PPL 30/2678) obtained after review by the University of Oxford Animal Care and Ethical Review Committee licensed all procedures. The housing and husbandry were in compliance with the guidelines of the European Directive (2010/63/EU) for the care and use of laboratory animals.
All procedures from Study 322 were reviewed and approved by the National Institute of Mental Health (NIMH) Animal Care and Use Committee.
For one dataset,4 all human behavioral experiments were conducted at the Connecticut Mental Health Center in strict accordance with Yale University’s Human Investigation Committee who provided ethical review and exemption approval (no. 2000026290). Written informed consent was provided by all research participants. For the other dataset,24 all human behavioral experiments were internally reviewed and approved by the Research Ethics Committee at King’s College London, UK (ref: RESCM-19/20–0603). Human participants gave consent by ticking checkboxes online following the information sheet, and prior to the administration of questionnaires or tasks.
Monkeys and humans
A total of twenty rhesus macaque monkeys (Macaca mulatta, all males) across three laboratories and a total of 1,225 online human participants from two laboratories were included in the study. Each animal was individually-, pair-, or group-housed, and was kept on a 12-h light dark cycle and had access to water 24 h a day. All experiments were conducted during the light phase.
Monkeys
Fourteen monkeys – three from Study 1, three from Study 2, and eight from Study 3 – served as unoperated controls. Six monkeys – three from Study 2 and three from Study 3 – received excitotoxic lesions of different parts of the brain.
Humans
We had 692 online human participants (collected via Prolific), of whom 84 had high paranoia, who completed the single-reversal task of the task most similar to the monkey task.24 We had 533 online human participants (collected via CloudResearch), of whom 140 had high paranoia, who completed a multi-reversal version of the task.4
METHOD DETAILS
Surgical and lesion procedures for monkeys
Surgical procedures for the monkeys have been previously described in detail.13,22 Aseptic neurosurgery was conducted under general anesthesia (isoflurane, 1-2% to effect), in a dedicated operating theater and with pre- and post-operative analgesia and antibiotics. In each animal that received a lesion, during the surgery the skin and fascia were opened, the muscles retracted, and a bilateral bone flap was taken in the cranium over the target region. The dura over the posterior part of the hemisphere was cut and retracted to the midline. The splenium of the corpus callosum was then sectioned and the choroid plexus cut to provide access to and enable visualization of the posterior medial thalamus. Injections were then made as described below. A semi-circular dural flap was reflected toward the orbit to allow access to the ventral surface of the frontal lobe in each hemisphere. Injections were then made into the OFC on the basis of sulcal landmarks as described below.
Magnocellular mediodorsal thalamus (MDmc)
In brief, the procedure focused on the dorsal thalamic nuclei, specifically the magnocellular mediodorsal thalamus (MDmc) in the monkey brain. The coordinates of the intended lesion site (see Figure S7) within the mediodorsal thalamus were taken from a comprehensive monkey brain atlas.58 Moreover, evidence from prior studies13,59 on the use of ibotenic acid and NMDA for successful mediodorsal thalamic lesions in monkeys supported this surgical procedure. Ten x 1.0ul injections (5 per hemisphere) of ibotenic acid/NMDA, spaced at least 1 mm apart were made into the thalamus after visualizing the anterior-posterior extent of the MD in monkeys’ MD1, MD2, and MD3. MD lesions were confirmed histologically.13
Orbitofrontal cortex (OFC)
The procedure targeted Walker’s areas 11, 13, and 14 in each hemisphere of the granular orbitofrontal cortex of the monkey brain. Injections of ibotenic acid were made into each hemisphere in separate surgeries with at least two weeks recovery time between surgeries. In each surgery, multiple 0.1 ul ibotenic acid injections were made into the cortex on the orbital surface, specifically between the fundus of the lateral orbital sulcus and the rostral sulcus on the medial surface of the hemisphere. The anterior-posterior boundary of the lesion was the anterior and posterior ends of the medial and lateral orbital sulci. In each hemisphere, between 71 and 119 injections were made to cover the entirety of the OFC. The extent of the bilateral excitotoxic lesions in each monkey (see Figure S7) were verified by observing white hypersignal associated with edema in T2-weighted MRI scans taken within 5 days of the surgery and T1-weighted scans taken >1 year after the initial surgery.22
In sum, the monkeys we chose to study had bilateral excitotoxic lesions to their magnocellular mediodorsal thalamus (MDmc) and orbitofrontal cortex (OFC). Because we focused on win-switch and lose-stay behavior for estimating belief volatility, one animal from Study 3 was excluded from the present analysis as he was a formal outlier with regards to his win-switch behavior (see Figure S8).
Questionnaire for human participants
Human participants completed a revised Green et al. paranoid thought scale (R-GPTS) as a means to collect self-report paranoia levels.60 The paranoia group labels were based on subscale B of the R-GPTS; that is, high paranoia labels were based on a clinical cut-off of ≥ 11 (this threshold discriminates persecutory delusions from non-clinical paranoia, see Figure S9 for link between paranoia and PRL task behavior). Self-report depression levels were measured using the Beck Depression Inventory II (BDI-II) scale,61 excluding ‘suicidal thoughts’ item as per protocol; a clinical cut-off of ≥ 17 for clinical depression (i.e., high depression).
Three-choice PRL task
Single-reversal
A PRL task – with three options to choose from, and a single reversal of the reward probabilities of the options – was used in this study for the behavior of both monkeys and human participants.
Monkeys
This task has been described extensively elsewhere.13,20–22,62 In brief, while inside a wheeled transport cage, monkeys were positioned in front of a touch sensitive monitor. On each trial they were presented with three stimuli that were novel at the beginning of each 300-trial session. When monkeys touched one of the stimuli on the screen, a reward (190 mg food pellets, Noyes) was delivered based on a predetermined probabilistic reward schedule. Initially, one stimulus was associated with the highest probability of reward delivery. At around trial 150, there was a reversal in reward contingencies such that the stimulus associated with highest probability of reward became the lowest and the stimulus associated with the lowest changed to being associated with the highest probability of reward. For this report we analyzed one of four reward schedules that was similar to the human task: ‘stable’. This corresponded to Schedule 1 from a previous study22; pre-reversal reward contingencies followed a 0.61-0.2-0 pattern and changed to a post-reversal reward contingency of 0.19-0.44-0.76 pattern such that the most rewarding stimuli became the least rewarding stimuli and vice versa. Each monkey completed the stable version of the task for a total of 300 trials per session for five sessions (monkeys in the MDmc study completed five sessions so we selected the first five sessions from the OFC study). After each trial there was either a 2 s (Studies 1 and 2) or a 5 s (Study 3) intertrial interval before the next trial began. For Studies 1 and 2, a lunch box containing the monkey’s food for the day was opened at the conclusion of the 300-trial session whereas in Study 3, monkeys were given their daily food amount in their home cage.
Humans
The performance of the monkeys was compared to that of human participants who completed a similar version of the PRL task.24 This non-social PRL task involved presenting participants with a series of symbols across 60 trials. Each symbol yielded either a positive outcome (+10 points) or a negative outcome (−5 points). At the beginning of the task, participants were informed that among the three symbols, one had a high likelihood (80%) of providing +10 points, another had an even chance (50%), and the remaining one had a low probability (20%) of yielding +10 points. Crucially, participants were made aware that the probabilities associated with each symbol could change unpredictably during the course of the task. At trial 30, participants were explicitly asked to indicate which symbol they believed offered the highest probability of delivering points. Following this judgment phase, the contingencies of the symbols were altered for the final 30 trials – the symbol previously associated with the lowest probability of gaining points became the one with the highest probability, the symbol with the highest probability became the even probability symbol, and the symbol with an even probability became the one with the lowest probability. Upon task completion, participants were once again prompted to identify the symbol they thought had provided the most points throughout the task.
Multi-reversal
This version of the task is similar in theory but different in structure.3 Human participants completed this PRL task in two different scenarios – one with a non-social aspect using a deck of cards another with a social aspect using avatars representing partners. In the card deck scenario, participants were asked to choose from three decks – a positive (+100) or negative (−50) outcome – with the goal of earning the most points. It was also mentioned that the deck with the highest reinforcement probability could change. In the partner scenario, participants were asked to choose from three avatars and to imagine working on a group project with them. The avatars could represent helpful (+100) or hurtful (−50) partners and the partner that would give them the best chance to succeed on a project could also change. Data between these two versions were collapsed for the present analysis based on prior findings of no task performance differences.4 However, where things differ in this version from the single-reversal PRL, is 2-fold. The first difference is this idea of multiple reversals – every 40 trials participants were provided a break, following which probabilities automatically reassigned. In addition, there were performance-based reversals – after every 9 out of 10 consecutive selections of the highest reinforcement probability deck, unbeknownst to the participants, the underlying probabilities of receiving rewards were altered. The second difference is this idea of a shift (that occurs halfway through the experiment) – the contingencies of rewards initially follow a 90-50-10 pattern for the first 80 trials but switch to a 80-40-20 for the last 80 trials, making it more difficult to distinguish whether a loss was due to chance or a change in the best deck (or partner). Thus, at this halfway mark, individuals experience both a reversal and a shift.
Quantification and statistical analysis
Behavioral choice analysis
We quantified strategies of behavior on the task similar to prior work.3 The likelihood that participants would choose alternative options after positive feedback (win-switch) and select the same option after negative feedback (lose-stay) was measured. The rate of win-switch behavior was calculated as the total number of trials in which participants switched after positive feedback divided by the number of trials in which they received positive feedback. Similarly, the rate of lose-stay behavior was calculated as the total number of trials in which a participant persisted after negative feedback divided by the total negative feedback trials. The rates of win-switch and lose-stay behavior on each trial was calculated using a lagged moving average approach with a 20-trial window. This means that for each binned trial, we computed the average rate based on the previous 20 trials. This lagged moving average is depicted in Figure 2B (and 4A, 4B), where reversal (and shift in the multi-reversal version) is marked at the mid-way period of the experiment.
Computational modeling
We used a mean-reverting HGF model.6 In this model (Figure 1B), the component that incorporates task beliefs is called ‘the perceptual model’. The component that governs how beliefs are converted into choices is called ‘the response model’. The perceptual model has three hierarchical layers of belief about the task. The layers interact and influence one another through learning rate parameters. At the highest level (i.e., level 3), the model captures beliefs about changes in the task environment (i.e., how are values of the choices changing over time?). Level 2 characterizes beliefs on reward probabilities (i.e., the tendency of a choice to be rewarding). Level 1 characterizes task reward feedback (i.e., win or loss). These three levels of belief are then integrated and fed through a sigmoid response function to produce a decision (i.e., whether to either stay with the same option or switch to a different one). To calculate subject-specific estimates of belief about volatility (m3) and belief about reward value learning (ω2), we fit the Hierarchical Gaussian Filter (HGF) to the three-choice PRL data.57 We estimated these perceptual parameters for the first (monkeys: trials 1–150; single-reversal humans: trials 1–30, multi-reversal humans: trials 1–80) and second (monkeys: trials 151–300; single-reversal humans: trials 31–60, multi-reversal humans: trials 81–160) halves of the task. It is important to note that we did not model these two-halves separately; we used the estimated priors from the first half of the task as priors for the second half, thereby accounting for the mid-way contingency shift in a unified model approach (see Figure S10). Each of the agent’s choice (i.e., option 1, 2, or 3) and outcomes (reward or no reward) were entered as separate column vectors with rows corresponding to trials. Reward was encoded as ‘1’, no reward as ‘0’, and choices as ‘1’, ’2′, or ‘3’. We modified the perceptual model configuration file (tapas_hgf_ar1_binary_mab_config.m) to reflect the ‘winning’ model which uses a mean-reverting HGF perceptual model (refer to Table S1a in6) and a soft-max-mu03 decision model (tapas_softmax_mu3.m). Parameter recovery. We fit our HGF model to the observed PRL task data to estimate parameters that describe how an agent’s (monkey or human) beliefs update throughout the experiment. Our approach to demonstrating parameter recovery involved attempting to recover the parameter estimates that were originally determined from the true choice data using simulated task data. For the simulation, task performance for each agent was generated using the true model parameter estimates, the perceptual model, and the decision model. The values applied for the perceptual and decision configuration files were drawn from a prior study (specifically, the M6 – winning model; refer to their Table S1a).6 A dummy ‘zeroth’ trial served as our starting point to simulate the initial response and outcome. The agent’s responses (choice 1,2,or 3) were simulated based on the softmax function, considering the true model parameters and the experienced, simulated outcomes. Meanwhile, the outcomes (whether a reward of 1 or no reward of 0) were generated through a Bernoulli distribution, reflective of the agent’s reward schedules (for instance, the stable schedule for monkeys). For every agent, we created a set of simulated responses and outcomes. This entire process was repeated for i = 10 iterations, yielding 10 unique sets of simulated response-outcome data per agent. Subsequently, we refitted the HGF to the simulated data, which produced a set of belief parameters for each agent during every iteration. These simulated (or recovered) model parameters were correlated with the observed (or estimated) model parameters (see Figure S11) for the equilibrium value of belief volatility (m3) and beliefs about value learning (ω2) parameters, to assess performance of parameter recovery across the different variations of the PRL task for both monkeys and humans. Furthermore, plotting the simulated win-switch and lose-stay rates, at the group level, recapitulated the observed differences between lesion groups (see Figure S12). The HGF toolbox v5.3.1 is freely available for download in the TAPAS package at https://translationalneuromodeling.github.io/tapas. We installed and ran the package in MATLAB and Statistics Toolbox Release 2016a (MathWorks*, Natick, MA).
STATISTICAL ANALYSIS
General
Statistical analyses and effect size calculations were performed with an alpha of 0.05 and two-tailed p-values in RStudio: Integrated Development Environment for R, Version 4.3.0.
GLMMs
To combat non-normality and random effects, we employed generalized linear mixed models (GLMMs) to identify significant group differences (lesion group or paranoia group) in behavior and beliefs. We used a binomial GLMM for behavior, incorporating the counts of win-switch and win-stay for win-switch rate and lose-stay and lose-switch for lose-stay rate, and a Gaussian GLMM for the HGF belief parameters. Significant interaction effects in omnibus models were resolved by generating lower-order models.
Permutation tests
We shuffled the lesion groups to create random permutations of the data. For each permutation, we calculated the test statistic (i.e., difference of means) between the win-switching rates of the MDmc-lesioned monkeys and the non-lesioned control monkeys for post-reversal and pre-reversal trials; mean difference = (μpost,mdmc – μpre,mdmc) – (μpost,control – μpre,control). We repeated this process for n = 100 permutation to build the empirical null distribution. We calculated the p-value as the proportion of permuted test statistics that are greater than or equal to the observed test statistic (in our case, this was 0.169). The p-value represents the probability of obtaining a test statistic as extreme as the observed one, assuming the null hypothesis (H0: no difference between groups). If p-value <0.05, we reject the null hypothesis and conclude that there is a significant difference in win-switching between the lesion groups after the reversal. Lastly, Interquartile Range (IQR) tests were used to label data as outliers.
Supplementary Material
REAGENT or RESOURCE | SOURCE | IDENTIFIER |
---|---|---|
Other | ||
Control lesion data | Walton et al., 201021 | https://doi.org/10.1016/j.neuron.2010.02.027 |
MDmc lesion and control data | Chakraborty et al., 201613 | https://doi.org/10.7554/eLife.13588 |
OFC lesion and control data | Rudebeck et al., 201722 | https://doi.org/10.1016/j.neuron.2017.07.042 |
Multi-reversal PRL human data | Suthaharan et al., 20214 | https://doi.org/10.1038/s41562-021-01176-8 |
Single-reversal PRL human data | Barnby et al., 202224 | https://doi.org/10.1371/journal.pcbi.1010326 |
Software and algorithms | ||
MATLAB (Data modeling) | Mathworks | 2019a |
RStudio (Data visualization) | RStudio | 4.3.0 |
HGF toolbox v5.3.1 | Frässle et al., 202157 | https://translationalneuromodeling.github.io/tapas |
Codes | This paper | https://github.com/psuthaharan/belief-update-monkeys |
Highlights.
Lesions to the mediodorsal thalamus (MD) in monkeys heighten belief volatility
Orbitofrontal cortex (OFC) lesions impair value learning
This is a double dissociation of belief learning between brain regions
Shared computational modeling across species implicates MD in paranoia
ACKNOWLEDGMENTS
This work was supported by the Yale University Department of Psychiatry, the Connecticut Mental Health Center (CMHC) and Connecticut State Department of Mental Health and Addiction Services (DMHAS), and the Kavli Foundation (to P.S.). It was funded by NIMH R01MH12887 (to P.R.C.), NIMH R21MH120799-01 (to P.R.C.), NIMH R01MH128190 (to S.W.C.C.), and NIMH R01MH120081 (to S.W.C.C.). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
Footnotes
SUPPLEMENTAL INFORMATION
Supplemental information can be found online at https://doi.org/10.1016/j.celrep.2024.114355.
DECLARATION OF INTERESTS
P.R.C. is a cofounder of Tetricus Labs Inc., which did not fund this work.
REFERENCES
- 1.Walker EY, Pohl S, Denison RN, Barack DL, Lee J, Block N, Ma WJ, and Meyniel F. (2023). Studying the neural representations of uncertainty. Nat. Neurosci 26, 1857–1867. 10.1038/s41593-023-01444-y. [DOI] [PubMed] [Google Scholar]
- 2.Feeney EJ, Groman SM, Taylor JR, and Corlett PR (2017). Explaining Delusions: Reducing Uncertainty Through Basic and Computational Neuroscience. Schizophr. Bull 43, 263–272. 10.1093/schbul/sbw194. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Reed EJ, Uddenberg S, Suthaharan P, Mathys CD, Taylor JR, Groman SM, and Corlett PR (2020). Paranoia as a deficit in non-social belief updating. Elife 9, e56345. 10.7554/eLife.56345. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Suthaharan P, Reed EJ, Leptourgos P, Kenney JG, Uddenberg S, Mathys CD, Litman L, Robinson J, Moss AJ, Taylor JR, et al. (2021). Paranoia and belief updating during the COVID-19 crisis. Nat. Hum. Behav 5, 1190–1202. 10.1038/s41562-021-01176-8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Albein-Urios N, Chase H, Clark L, Kirkovski M, Davies C, and Enticott PG (2019). Increased perseverative errors following high-definition transcranial direct current stimulation over the ventrolateral cortex during probabilistic reversal learning. Brain Stimul. 12, 959–966. 10.1016/j.brs.2019.02.013. [DOI] [PubMed] [Google Scholar]
- 6.Cole DM, Diaconescu AO, Pfeiffer UJ, Brodersen KH, Mathys CD, Julkowski D, Ruhrmann S, Schilbach L, Tittgemeyer M, and Vogeley K. (2020). Atypical processing of uncertainty in individuals at risk for psychosis. Neuroimage. Clin 26, 102239. 10.1016/j.nicl.2020.102239. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Costa VD, Tran VL, Turchi J, and Averbeck BB (2015). Reversal Learning and Dopamine: A Bayesian Perspective. J. Neurosci 35, 2407–2416. 10.1523/JNEUROSCI.1989-14.2015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Gershman SJ, and Uchida N. (2019). Believing in dopamine. Nat. Rev. Neurosci 20, 703–714. 10.1038/s41583-019-0220-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Eckstein MK, Master SL, Dahl RE, Wilbrecht L, and Collins AGE (2022). Reinforcement learning and Bayesian inference provide complementary models for the unique advantage of adolescents in stochastic reversal. Dev. Cogn. Neurosci 55, 101106. 10.1016/j.dcn.2022.101106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Mathys CD, Lomakina EI, Daunizeau J, Iglesias S, Brodersen KH, Friston KJ, and Stephan KE (2014). Uncertainty in perception and the Hierarchical Gaussian Filter. Front. Hum. Neurosci 8, 825. 10.3389/fnhum.2014.00825. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Massi B, Donahue CH, and Lee D. (2018). Volatility Facilitates Value Updating in the Prefrontal Cortex. Neuron 99, 598–608.e4. 10.1016/j.neuron.2018.06.033. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Soltani A, and Izquierdo A. (2019). Adaptive learning under expected and unexpected uncertainty. Nat. Rev. Neurosci 20, 635–644. 10.1038/s41583-019-0180-y. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Chakraborty S, Kolling N, Walton ME, and Mitchell AS (2016). Critical role for the mediodorsal thalamus in permitting rapid reward-guided updating in stochastic reward environments. Elife 5, e13588. 10.7554/eLife.13588. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Mitchell AS (2015). The mediodorsal thalamus as a higher orderthalamic relay nucleus important for learning and decision-making. Neurosci. Biobehav. Rev. 54, 76–88. 10.1016/j.neubiorev.2015.03.001. [DOI] [PubMed] [Google Scholar]
- 15.Xiao D, Zikopoulos B, and Barbas H. (2009). Laminar and modular organization of prefrontal projections to multiple thalamic nuclei. Neuroscience 161, 1067–1081. 10.1016/j.neuroscience.2009.04.034. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Phillips JM, Kambi NA, Redinbaugh MJ, Mohanta S, and Saalmann YB (2021). Disentangling the influences of multiple thalamic nuclei on prefrontal cortex and cognitive control. Neurosci. Biobehav. Rev 128, 487–510. 10.1016/j.neubiorev.2021.06.042. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Wolff M, and Halassa MM (2024). The mediodorsal thalamus in executive control. Neuron 112, 893–908. 10.1016/j.neuron.2024.01.002. [DOI] [PubMed] [Google Scholar]
- 18.Mukherjee A, Lam NH, Wimmer RD, and Halassa MM (2021). Thalamic circuits for independent control of prefrontal signal and noise. Nature 600, 100–104. 10.1038/s41586-021-04056-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Crail-Melendez D, Atriano-Mendieta C, Carrillo-Meza R, and Ramirez-Bermudez J. (2013). Schizophrenia-like psychosis associated with right lacunar thalamic infarct. Neurocase 19, 22–26. 10.1080/13554794.2011.654211. [DOI] [PubMed] [Google Scholar]
- 20.Rudebeck PH, Behrens TE, Kennerley SW, Baxter MG, Buckley MJ, Walton ME, and Rushworth MFS (2008). Frontal Cortex Subregions Play Distinct Roles in Choices between Actions and Stimuli. J. Neurosci 28, 13775–13785. 10.1523/JNEUROSCI.3541-08.2008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Walton ME, Behrens TEJ, Buckley MJ, Rudebeck PH, and Rushworth MFS (2010). Separable learning systems in the macaque brain and the role of orbitofrontal cortex in contingent learning. Neuron 65, 927–939. 10.1016/j.neuron.2010.02.027. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Rudebeck PH, Saunders RC, Lundgren DA, and Murray EA (2017). Specialized Representations of Value in the Orbital and Ventrolateral Prefrontal Cortex: Desirability versus Availability of Outcomes. Neuron 95, 1208–1220.e5. 10.1016/j.neuron.2017.07.042. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Walther S, Lefebvre S, Conring F, Gangl N, Nadesalingam N, Alexaki D, Wüthrich F, Rüter M, Viher PV, Federspiel A, et al. (2022). Limbic links to paranoia: increased resting-state functional connectivity between amygdala, hippocampus and orbitofrontal cortex in schizophrenia patients with paranoia. Eur. Arch. Psychiatry Clin. Neurosci 272, 1021–1032. 10.1007/s00406-021-01337-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Barnby JM, Mehta MA, and Moutoussis M. (2022). The computational relationship between reinforcement learning, social inference, and paranoia. PLoS Comput. Biol 18, e1010326. 10.1371/journal.pcbi.1010326. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Perry BAL, Lomi E, and Mitchell AS (2021). Thalamocortical interactions in cognition and disease: The mediodorsal and anterior thalamic nuclei. Neurosci. Biobehav. Rev 130, 162–177. 10.1016/j.neubiorev.2021.05.032. [DOI] [PubMed] [Google Scholar]
- 26.Ouhaz Z, Fleming H, and Mitchell AS (2018). Cognitive Functions and Neurodevelopmental Disorders Involving the Prefrontal Cortex and Mediodorsal Thalamus. Front. Neurosci 12, 33. 10.3389/fnins.2018.00033. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Corlett PR, and Schoenbaum G. (2021). Leveraging Basic Science for the Clinic-From Bench to Bedside. JAMA Psychiatr. 78, 331–334. 10.1001/jamapsychiatry.2020.3656. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Tervo DGR, Kuleshova E, Manakov M, Proskurin M, Karlsson M, Lustig A, Behnam R, and Karpova AY (2021). The anterior cingulate cortex directs exploration of alternative strategies. Neuron 109, 1876–1887.e6. 10.1016/j.neuron.2021.03.028. [DOI] [PubMed] [Google Scholar]
- 29.Mitchell AS, Baxter MG, and Gaffan D. (2007). Dissociable Performance on Scene Learning and Strategy Implementation after Lesions to Magnocellular Mediodorsal Thalamic Nucleus. J. Neurosci 27, 11888–11895. 10.1523/JNEUROSCI.1835-07.2007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Loh A, Germann J, Qazi S, Husain R, Boutet A, Lozano AM, and Mansouri A. (2022). Lesion network mapping of ectopic craniopharyngioma identifies potential cause of psychosis: a case report. Acta Neurochir. 164, 3285–3289. 10.1007/s00701-022-05355-y. [DOI] [PubMed] [Google Scholar]
- 31.Delgado MG, and Bogousslavsky J (2013). “Distorteidolias” - fantastic perceptive distortion. A new, pure dorsomedial thalamic syndrome. Eur. Neurol 70, 6–9. 10.1159/000348361. [DOI] [PubMed] [Google Scholar]
- 32.Bocchetta M, Iglesias JE, Neason M, Cash DM, Warren JD, and Rohrer JD (2020). Thalamic nuclei in frontotemporal dementia: Medio-dorsal nucleus involvement is universal but pulvinar atrophy is unique to C9orf72. Hum. Brain Mapp 41, 1006–1016. 10.1002/hbm.24856. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.LandqvistWaldö M, Gustafson L, Passant U, and Englund E(2015). Psychotic symptoms in frontotemporal dementia: a diagnostic dilemma? Int. Psychogeriatr 27, 531–539. 10.1017/S1041610214002580. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Donaldson TN, Barto D, Bird CW, Magcalas CM, Rodriguez CI, Fink BC, and Hamilton DA (2018). Social order: Using the sequential structure of social interaction to discriminate abnormal social behavior in the rat. Learn. Motiv 61, 41–51. 10.1016/j.lmot.2017.03.003. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Kondrakiewicz K, Kostecki M, Szadzihska W, and Knapska E (2019). Ecological validity of social interaction tests in rats and mice. Genes Brain Behav. 18, e12525. 10.1111/gbb.12525. [DOI] [PubMed] [Google Scholar]
- 36.Urbach YK, Bode FJ, Nguyen HP, Riess O, and von Hörsten S (2010). Neurobehavioral Tests in Rat Models of Degenerative Brain Diseases. In Rat Genomics: Methods and Protocols Methods in Molecular Biology, Anegon I, ed. (Humana Press; ), pp. 333–356. 10.1007/978-1-60327-389-3_24. [DOI] [PubMed] [Google Scholar]
- 37.Suthaharan P, and Corlett PR (2023). Assumed shared belief about conspiracy theories in social networks protects paranoid individuals against distress. Sci. Rep 13, 6084. 10.1038/s41598-023-33305-w. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Barnby JM, Dayan P, and Bell V (2023). Formalising social representation to explain psychiatric symptoms. Trends Cogn. Sci 27, 317–332. 10.1016/j.tics.2022.12.004. [DOI] [PubMed] [Google Scholar]
- 39.Hayashi T, Akikawa R, Kawasaki K, Egawa J, Minamimoto T, Kobayashi K, Kato S, Hori Y, Nagai Y, Iijima A, et al. (2020). Macaques Exhibit Implicit Gaze Bias Anticipating Others’ False-Belief-Driven Actions via Medial Prefrontal Cortex. Cell Rep. 30, 4433–4444.e5. 10.1016/j.celrep.2020.03.013. [DOI] [PubMed] [Google Scholar]
- 40.Krupenye C, Kano F, Hirata S, Call J, and Tomasello M (2016). Great apes anticipate that other individuals will act according to false beliefs. Science 354, 110–114. 10.1126/science.aaf8110. [DOI] [PubMed] [Google Scholar]
- 41.Schafroth JL, Basile BM, Martin A, and Murray EA (2021). No evidence that monkeys attribute mental states to animated shapes in the Heider–Simmel videos. Sci. Rep 11, 3050. 10.1038/s41598-021-82702-6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Devaine M, San-Galli A, Trapanese C, Bardino G, Hano C, Saint Jalme M, Bouret S, Masi S, and Daunizeau J (2017). Reading wild minds: A computational assay of Theory of Mind sophistication across seven primate species. PLoS Comput. Biol 13, e1005833. 10.1371/journal.pcbi.1005833. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Schurz M, Radua J, Aichhorn M, Richlan F, and Perner J (2014). Fractionating theory of mind: A meta-analysis of functional brain imaging studies. Neurosci. Biobehav. Rev 42, 9–34. 10.1016/j.neubiorev.2014.01.009. [DOI] [PubMed] [Google Scholar]
- 44.Pergola G, Danet L, Pitel A-L, Carlesimo GA, Segobin S, Pariente J, Suchan B, Mitchell AS, and Barbeau EJ (2018). The Regulatory Role of the Human Mediodorsal Thalamus. Trends Cogn. Sci 22, 1011–1025. 10.1016/j.tics.2018.08.006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Chen LL, and Wise SP (1996). Evolution of Directional Preferences in the Supplementary Eye Field during Acquisition of Conditional Oculomotor Associations. J. Neurosci 16, 3067–3081. 10.1523/JNEUROSCI.16-09-03067.1996. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Jamali M, Grannan BL, Fedorenko E, Saxe R, Báez-Mendoza R, and Williams ZM (2021). Single-neuronal predictions of others’ beliefs in humans. Nature 591, 610–614. 10.1038/s41586-021-03184-0. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Corlett PR, Mollick JA, and Kober H (2022). Meta-analysis of human prediction error for incentives, perception, cognition, and action. Neuropsychopharmacology 47, 1339–1349. 10.1038/s41386-021-01264-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Mahmoodi A, Harbison C, Bongioanni A, Emberton A, Roumazeilles L, Sallet J, Khalighinejad N, and Rushworth MFS (2024).A frontopolar-temporal circuit determines the impact of social information in macaque decision making. Neuron 112, 84–92.e6. 10.1016/j.neuron.2023.09.035. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Berkay D, and Jenkins AC (2023). A Role for Uncertainty in the Neural Distinction Between Social and Nonsocial Thought. Perspect. Psychol. Sci 18, 491–502. 10.1177/17456916221112077. [DOI] [PubMed] [Google Scholar]
- 50.Cools R, Clark L, Owen AM, and Robbins TW (2002). Defining the Neural Mechanisms of Probabilistic Reversal Learning Using Event-Related Functional Magnetic Resonance Imaging. J. Neurosci 22, 4563–4567. 10.1523/JNEUROSCI.22-11-04563.2002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Lockwood PL, Apps MAJ, and Chang SWC (2020). Is There a ‘Social’ Brain? Implementations and Algorithms. Trends Cogn. Sci 24, 802–813. 10.1016/j.tics.2020.06.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Sullivan D, Landau MJ, and Rothschild ZK (2010). An existential function of enemyship: evidence that people attribute influence to personal and political enemies to compensate for threats to control. J. Pers. Soc. Psychol 98, 434–449. 10.1037/a0017457. [DOI] [PubMed] [Google Scholar]
- 53.Premkumar P, Fannon D, Sapara A, Peters ER, Anilkumar AP, Simmons A, Kuipers E, and Kumari V (2015). Orbitofrontal cortex, emotional decision-making and response to cognitive behavioural therapy for psychosis. Psychiatry Res. 231, 298–307. 10.1016/j.pscychresns.2015.01.013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Parnaudeau S, Bolkan SS, and Kellendonk C (2018). The Mediodorsal Thalamus: An Essential Partner of the Prefrontal Cortex for Cognition. Biol. Psychiatry 83, 648–656. 10.1016/j.biopsych.2017.11.008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Groman SM, Keistler C, Keip AJ, Hammarlund E, DiLeone RJ, Pittenger C, Lee D, and Taylor JR (2019). Orbitofrontal Circuits Control Multiple Reinforcement-Learning Processes. Neuron 103, 734–746.e3. 10.1016/j.neuron.2019.05.042. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Sheffield JM, Suthaharan P, Leptourgos P, and Corlett PR (2022). Belief Updating and Paranoia in Individuals With Schizophrenia. Biol. Psychiatry. Cogn. Neurosci. Neuroimaging 7, 1149–1157. 10.1016/j.bpsc.2022.03.013. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Frässle S, Aponte EA, Bollmann S, Brodersen KH, Do CT, Harrison OK, Harrison SJ, Heinzle J, Iglesias S, Kasper L, et al. (2021). TAPAS: An Open-Source Software Package for Translational Neuromodeling and Computational Psychiatry. Front. Psychiatry 12, 680811. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Ilinsky IA, and Kultas-Ilinsky K (1987). Sagittal cytoarchitectonic maps of theMacaca mulatta thalamus with a revised nomenclature of the motor-related nuclei validated by observations on their connectivity. J. Comp. Neurol 262, 331–364. 10.1002/cne.902620303. [DOI] [PubMed] [Google Scholar]
- 59.Browning PGF, Chakraborty S, and Mitchell AS (2015). Evidence for Mediodorsal Thalamus and Prefrontal Cortex Interactions during Cognition in Macaques. Cereb. Cortex 25, 4519–4534. 10.1093/cercor/bhv093. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Freeman D, Loe BS, Kingdon D, Startup H, Molodynski A, Rosebrock L, Brown P, Sheaves B, Waite F, and Bird JC (2021). The revised Green et al. , Paranoid Thoughts Scale (R-GPTS): psychometric properties, severity ranges, and clinical cut-offs. Psychol. Med 51, 244–253. 10.1017/S0033291719003155. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Beck AT, Steer RA, and Brown G (1996). Beck Depression Inventory–II. 10.1037/t00742-000. [DOI] [Google Scholar]
- 62.Noonan MP, Walton ME, Behrens TEJ, Sallet J, Buckley MJ, and Rushworth MFS (2010). Separate value comparison and learning mechanisms in macaque medial and lateral orbitofrontal cortex. Proc. Natl. Acad. Sci 107, 20547–20552. 10.1073/pnas.1012246107. [DOI] [PMC free article] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Monkey behavioral data presented in this paper will be available upon request. Human behavioral data from the single-reversal PRL task24 and the multi-reversal with contingency shift PRL task data4 are publically available. The analysis scripts for this paper can be found at https://github.com/psuthaharan/belief-update-monkeys.