Anterior cingulate cortex causally supports flexible learning under motivationally challenging and cognitively demanding conditions

Kianoush Banaie Boroujeni; Michelle K Sigona; Robert Louie Treuting; Thomas J Manuel; Charles F Caskey; Thilo Womelsdorf

doi:10.1371/journal.pbio.3001785

. 2022 Sep 6;20(9):e3001785. doi: 10.1371/journal.pbio.3001785

Anterior cingulate cortex causally supports flexible learning under motivationally challenging and cognitively demanding conditions

Kianoush Banaie Boroujeni ^1,^*, Michelle K Sigona ^2,³, Robert Louie Treuting ³, Thomas J Manuel ^2,³, Charles F Caskey ^2,^3,⁴, Thilo Womelsdorf ^1,^3,^*

Editor: Ben Seymour⁵

PMCID: PMC9481162 PMID: 36067198

Abstract

Anterior cingulate cortex (ACC) and striatum (STR) contain neurons encoding not only the expected values of actions, but also the value of stimulus features irrespective of actions. Values about stimulus features in ACC or STR might contribute to adaptive behavior by guiding fixational information sampling and biasing choices toward relevant objects, but they might also have indirect motivational functions by enabling subjects to estimate the value of putting effort into choosing objects. Here, we tested these possibilities by modulating neuronal activity in ACC and STR of nonhuman primates using transcranial ultrasound stimulation while subjects learned the relevance of objects in situations with varying motivational and cognitive demands. Motivational demand was indexed by varying gains and losses during learning, while cognitive demand was varied by increasing the uncertainty about which object features could be relevant during learning. We found that ultrasound stimulation of the ACC, but not the STR, reduced learning efficiency and prolonged information sampling when the task required averting losses and motivational demands were high. Reduced learning efficiency was particularly evident at higher cognitive demands and when subjects experienced loss of already attained tokens. These results suggest that the ACC supports flexible learning of feature values when loss experiences impose a motivational challenge and when uncertainty about the relevance of objects is high. Taken together, these findings provide causal evidence that the ACC facilitates resource allocation and improves visual information sampling during adaptive behavior.

Introduction

It is well established that the anterior cingulate cortex (ACC) and the anterior striatum contribute to flexible learning [1–4]. Widespread lesions of either structure can lead to nonadaptive behavior. When tasks require subjects to adjust choice strategies, lesions in the ACC cause subjects to shift away from a rewarding strategy even after having obtained reward for a choice [5] and reduce the ability to use error feedback to improve behavior [6–8]. Lesions in the striatum likewise reduce the ability to use negative error feedback to adjust choice strategies, which leads to perseveration of non-rewarded choices [9], or inconsistent switching to alternative options after errors [10]. A common interpretation of these lesion effects is that both structures are necessary for integrating the outcome of recent choices to update the expected values of possible choice options. According to this view, the ACC and striatum keep track of reward outcomes of available choice options in a given task environment.

However, it has remained elusive what type of reward outcome information is tracked in these structures and whether outcome information in ACC or the striatum affects learning even when it is not associated with specific actions. In many learning tasks used to study ACC or striatum functions, subjects learned associating reward outcomes with the direction of a saccadic eye movement or the direction of a manual movement [11–14]. Succeeding with these tasks requires computing probabilities of action-reward associations. However, recent studies have documented neurons in ACC and striatum that not only tracked action-reward association probabilities, but also the expected reward value of specific features of chosen objects [11,15–19]. Neuronal information about the value object features emerged slightly earlier in ACC than in striatum [15,16,19] and rapidly synchronized between both structures suggesting they are available across the frontostriatal network at similar times [20].

These findings suggest that the ACC or striatum may be functionally important to learn expected values of objects’ specific visual features and thereby mediate information-seeking behavior and visual attention [17,21–25]. There are at least 3 possible ways how feature-specific value information in these areas may support flexible learning. A first possibility is that either of these brain areas uses feature-specific value information for credit-assigning reward outcomes to goal-relevant object features. Such a credit assignment process is necessary during learning to reduce uncertainty about which features are most relevant in a given environment. Support for this suggestion comes from studies reporting of neurons in ACC that show stronger encoding of task variables in situations with higher uncertainty [26], respond to cues reducing uncertainties about outcomes [27], and form subpopulations encoding uncertain outcomes [16]. These prior studies predict that ACC or striatum will be important for learning values of object features when there is high uncertainty about the reward value of object features.

A second possibility is that the ACC or striatum uses feature-specific value information to determine whether subjects should continue with a current choice strategy or switch to a new strategy [28,29]. According to this framework, the ACC’s major role is to compute, track, and compare an ongoing choice value for going on with similar choices or switching to alternative choice options. This view predicts that when errors accumulate during learning, the estimated choice value for continuing similar choices is reduced relative to alternatives and ACC or striatum will activate to either directly modify choice behavior [28,29] or indirectly affect choices by guiding attention and information sampling away from recently chosen objects and toward other, potentially more rewarding objects [25].

A third possible route for value information in ACC and striatum to affect behavior assumes that information about the value of object features is not used to compute a choice value, but a motivational value for the subject that indicates whether it is worth to put continued effort into finding the most valuable object in a given environment [4,30]. According to such an effort-control framework, value signals across the ACC-striatum axis are used to compute the value of controlling task performance [31]. Similar to the choice-value framework, this motivational value framework predicts that ACC and striatum should become more important for learning when the number of conflicting features increases. While in the choice-value framework, an increase in conflicting features will evoke more ACC activity by increasing the number of comparisons between an ongoing choice value and the value of the other available options, in the effort-control framework, more feature conflict with the same motivational payoff requires more ACC activity to decide whether it is worth to put effort in the task or not [29,30]. Key motivational factors determining whether subjects put more effort into learning even difficult problems are the amount of reward that can be gained or the amount of punishment or loss that can be avoided by putting effort in the task. Support for suggesting an important role of the ACC to mediate how incentives and disincentives affect learning comes from studies showing ACC neurons responding vigorously to negative outcomes such as errors [32], respond to negatively valenced stimuli or events irrespective of error outcomes [33–35], and that fire stronger when the subject anticipates aversive events [27]. These insights suggest that the ACC and possibly the striatum will be relevant for learning particularly when it involves averting punishment or loss.

Here, we set up an experiment to test the relative roles of cognitive and motivational contributions of the ACC and striatum for the flexible learning of feature values. We adopted a transcranial ultrasound stimulation (TUS) protocol to temporarily modulate neuronal activity in the ACC or the striatum of rhesus monkeys while they learned values of object features through trial-and-error. The task independently varied cognitive load by varying the number of unrewarded features of objects and motivational context by varying the amount of gains or losses subjects could receive for successful and erroneous task performance (Fig 1A and 1B). Subjects learned a feature-reward rule by choosing 1 of 3 objects that varied in features of only 1 dimension (low cognitive load, e.g., different shapes), or in features of 2 or 3 dimensions (high cognitive load, e.g., varying shapes, surface patterns, and arm types) (Fig 1D). Independent of cognitive load, we varied how motivationally challenging task performance was by altering whether the learning context was a pure gain-only context or a mixed gain-loss context. In the gain-only contexts, subjects received 3 tokens for correct choices, while in the gain-loss contexts, subjects received 2 tokens for correct choices and lost 1 already attained token when choosing objects with non-rewarded features (Fig 1A–1C). Such a loss experience has been reported in previous studies to impose a motivational conflict [36], inferred also from vigilance responses triggered by experiencing losses [37]. The task required monkeys to collect 5 visual tokens before they were cashed out for fluid rewards.

Fig 1 — (A) A trial started with central gaze fixation and appearance of 3 objects. Monkeys could then explore objects and choose 1 object by fixating it for 700 ms. A correct choice triggered visual feedback (a yellow halo of the chosen object) and the appearance of green circles (tokens for reward) above the chosen object. The tokens were then animated and traveled to a token bar on the top of the screen. Incorrect choices triggered a blue halo, and in the gain-loss condition 1 blue token was shown that traveled to the token bar where one already attained token was removed. When ≥5 tokens were collected in the token bar, fluid reward was delivered, and the token bar reset to zero. (B) In successive learning blocks, different visual features were associated with reward and blocks alternated randomly between the gain-only (G) and gain-loss (GL) conditions. (C) In the gain-only motivational context, monkeys gained 3 tokens for correct and 0 penalty for each incorrect choice, whereas in the gain-loss context, they gained 2 tokens for each correct and lost 1 token for each incorrect response. The axes (right) show the 3 orthogonal independent variables of the task design (cognitive load, motivational context, and TUS conditions). (D) Cognitive load varied by increasing the number of object features from 1 to 3 and from block to block. (E) In each sonication or sham session, the experiment was paused after 6 learning blocks. There are 4 experimental conditions; TUS in ACC (ACC—TUS; red), or anterior striatum (STR-TUS; green), or sham ACC (ACC-Sham; dimmed red), or sham anterior striatum (STR-Sham; dimmed green); 30-ms bursts of TUS were delivered every 100 ms over a duration of 80 seconds (40 seconds each hemisphere). (F) Proportion of correct choices over trials since block begin for different cognitive loads (left panel; 1–3D, light to dark gray) and motivational contexts (gain-only, blue; gain-loss, red). (G) The average fixation duration on objects prior to choosing an object (*information sampling*) in the same format as in (F). The lines show the mean and the shaded error bars are SE. Data associated with this plot could be found at: https://figshare.com/projects/TUS_PlosBiology/144330. ACC, anterior cingulate cortex; STR, striatum; TUS, transcranial ultrasound stimulation.

With this design, we found that sonication of the ACC, but not the anterior striatum, with TUS led to a learning deficit when subjects experienced losses. This loss-triggered deficit was accompanied by inefficient information sampling and was most pronounced at high cognitive load.

Results

We applied transcranial focused ultrasound (TUS) to modulate neural activity in ACC (area 24) or anterior striatum (STR, head of the caudate nucleus) in 2 monkeys in separate learning sessions by adopting the same TUS protocol as in [38,39]. The sonication protocol imposed an approximately 6-mm wide/40-mm tall sonication region that has been shown previously to alter behavior in foraging tasks [40] to reduce functional connectivity of the sonicated area in macaques [38] and in in vitro preparations to modulate neuronal excitability to external inputs [41]. We provide detailed acoustic simulations of the ultrasound pressure dispersion around the target brain areas, the anatomical sub-millimeter targeting precision of TUS, and the validation of the applied ultrasound power through real-time power monitoring during the experiment in S3 Fig. We bilaterally sonicated or sham-sonicated the ACC or the STR in individual sessions immediately after monkeys had completed the first 6 learning blocks. We performed 12 experimental sessions for each TUS or sham condition in each of the 2 brain areas ACC or STR (a total of 48 sessions) with each of the 2 monkeys. Following the sonication procedure, monkeys resumed the task and proceeded on average for 23.6 (±4 SE) learning blocks (monkey W: 20.5 ± 4; monkey I: 26.5 ± 4) (Fig 1E).

Across learning blocks, monkeys reached the learning criterion (≥80% correct choices over 12 trials) systematically later in blocks with high cognitive load (linear mixed effect (LME) model with a main effect of cognitive load, p < 0.001; Figs 1F and S1A–S1C). Both monkeys also showed longer foveation durations onto the objects prior to making a choice when the cognitive load was high (LME main effect of cognitive load, p < 0.001; Figs 1G and S1D and S1E). Longer foveation durations index more extensive information sampling of object features at higher cognitive load. We defined information sampling as the duration monkeys fixated objects prior to the last fixation in a trial that was used by the subjects to choose an object. This metric indexes how long information was processed about feature values of the objects prior to committing to a choice. Monkeys also increased information sampling, significantly slowed learning, and showed reduced plateau performance (S2A–S2C Fig) in blocks with gains and losses (gain-loss contexts) compared to blocks with only gains (gain-only contexts) (Figs 1G and S2D and S2E; LME main effect of motivational context, p < 0.001).

TUS in ACC (ACC-TUS) but not sham-TUS in ACC (ACC-Sham) or TUS in striatum (STR-TUS) or sham-TUS in the striatum (STR-Sham) (Fig 2A) changed this behavioral pattern. TUS conditions showed a significant interaction with motivational context with ACC-TUS selectively slowing learning in the gain-loss contexts compared to the gain-only contexts (LME interaction of TUS condition and motivational context, t = 2.67, p = 0.007) (Figs 2B and S4A and S4B and S1 Table). ACC-TUS increased the number of trials needed to reach the learning criterion of 80% performance to 14.7 ± 0.8 trials (monkey W/I: 14.3 ± 1.2/15 ± 1.2) relative to the pre-TUS baseline (trials to criterion: 10.7 ± 1.4; monkey W/I: 9.9 ± 2.3/11.4 ± 1.8) (Wilcoxon test, p = 0.049; Figs 2B and S4C and S2D). The learning speed with ACC-TUS in the gain-loss context was significantly slower than in other TUS conditions (Kruskal–Wallis test, p = 0.003), ACC-Sham (pairwise Wilcoxon test, FDR multiple comparison corrected for dependent samples, p = 0.019), and to the conditions STR-TUS and STR-Sham (pairwise Wilcoxon test, FDR multiple comparison corrected for dependent samples, p = 0.019, p = 0.003) (Fig 2B). This effect interacted with cognitive load. The slower learning after ACC-TUS in the gain-loss condition was stronger when cognitive load was intermediate or high, i.e., in conditions with 2 or 3 distracting feature dimensions (random permutation, p < 0.05; LME 3-way interaction TUS condition, cognitive load, and motivational context, t = −2.8, p = 0.004) (Figs 3C and 3D and S5A and S2 Table). TUS did not affect learning in the gain-only contexts even at high cognitive load (Kruskal–Wallis test, p = 0.933) (Figs 3C and 3D and S5B).

Fig 3 — (A,B) Information sampling (the duration of fixating objects prior to choosing an object) is increased with TUS in ACC in gain-loss (A) but not gain-only (B) learning context. (C, D) TUS in ACC reduced post-learning plateau accuracy in gain-loss contexts (C) but not in gain-only contexts (D) (LMEs, p = 0.04) (see **S2 Table**). With FDR correction, this accuracy effect was significant at the session level, but not at the block level, indicating a low effect size (S10E Fig **and S1 Table**). (E) The accuracy was overall reduced with TUS in ACC in the 5 trials after experiencing a token loss in the gain-loss context (random permutation, p < 0.05). (F) GTI (*x-axis*) measures the signed average of tokens gained and lost in the preceding trials. TUS in ACC reduced the performance accuracy (*y-axis*) when monkeys had lost more tokens in the near past (negative GTI) (random permutation, p < 0.05). Data indicate the mean and the standard error of the mean. Accuracy on the trial level is normalized by the mean and standard deviation of the accuracy in the baseline (the first 6 blocks prior to the TUS) in the same TUS sessions. Black asterisks show significant main effect of TUS conditions and a significant difference between the TUS and the baseline (pre-TUS) conditions (for the TUS condition underneath the asterisks). Horizontal black lines indicate significant pairwise difference between TUS conditions. Data associated with this plot could be found at: https://figshare.com/projects/TUS_PlosBiology/144330. ACC, anterior cingulate cortex; GTI, gross token income; LME, linear mixed effect; TUS, transcranial ultrasound stimulation.

TUS conditions showed a significant interaction with motivational conditions on information sampling (LME, t = −4.03, p < 0.001). The slower learning in the gain-loss context after ACC-TUS was accompanied by prolonged information sampling compared to the pre-TUS baseline (ACC-TUS information sampling: 234 ± 6 ms monkey W/I: 230 ± 6/237 ± 7 ms, pre-TUS information sampling: 209 ± 11 ms monkey W/I: 197 ± 16/221 ± 11 ms; Wilcoxon test, p = 0.016) and compared to STR-TUS (Kruskal–Wallis test, p = 0.036; pairwise Wilcoxon test, FDR multiple comparison corrected for dependent samples, p = 0.03) (Figs 3A and S6A and S6C). Fixational information sampling in the gain-only context did not vary between TUS conditions (Kruskal–Wallis test, p = 0.55) (Figs 3B and S6B and S6D; for detailed information about the distribution of fixational information sampling and its bootstrap sampling distributions for different motivational contexts, cognitive load, and TUS conditions, see S7 Fig). Once subjects reached the learning criterion in a learning context, they could exploit the learned feature rule until the block changed to a new feature rule after approximately 30 to 55 trials. During this period, they showed overall high plateau performance, which was significantly lower in the gain-loss context (87% ± 0.07) than the gain-only contexts (90% ± 0.05, LME main effect of motivational context, t = −3.95, p = 0.001) (S2C Fig). ACC-TUS exacerbated this performance drop in the gain-loss block, leading to significantly lower plateau accuracy compared to the pre-TUS baseline condition and compared to other TUS conditions in the gain-loss learning context (LME interaction of TUS condition, motivational context and pre/post sonication, t = −2.05, p = 0.04; Figs 3C and S6E), but not in the gain-only learning context (Kruskal–Wallis test, p = 0.8) (Figs 3D and S6F). The mean plateau accuracy was reduced to 84.5% ± 1.4 (monkey W/I: 85% ± 1.4/84% ± 1.4) relative to the pre-TUS baseline (plateau accuracy: 88 ± 1.8; monkey W/I: 88 ± 2/88 ± 1.6). We confirmed these results using a second metric to estimate learning and plateau performance by fitting logistic general linear models (GLMs) to performance accuracy in each block (all previously significant results for trial-to-criterion and plateau accuracy remained significant when instead comparing inflection point and asymptote, S8 Fig).

We validated that the behavioral impairments emerged shortly after the sonication and lasted until the end of the ≤120-min long session (S9 Fig). We also confirmed that the observed behavioral effects were not only evident when considering individual blocks (the block-level analysis) (S1 and S2 Figs), but also when averaging the learning performance across blocks per session and applying session-level statistics (S10 Fig and S1 Table).

So far, we found that ACC-TUS slowed learning speed, increased information sampling durations, and reduced plateau accuracy in the gain-loss contexts. These results could be due to motivational difficulties when adjusting to the experience of losing an already attained token. To analyze this adjustment to losses, we analyzed the performance in trials following choices that led to losses. We found that experiencing a loss leads to overall poorer performance in subsequent trials after ACC-TUS, but not after ACC-Sham, STR-TUS, or STR-Sham (random permutation test, p < 0.05, Figs 3E and S11A–S11D). Importantly, this overall performance decrement was dependent on the recent history of losses. ACC-TUS reduced performance accuracy on trials in the gain-loss context, specifically when subjects had lost 2 or 3 tokens in the preceding 4 trials, but not when their net token gain in the past 4 trials was ≥0 tokens, or for trials in gain-only context (random permutation test, p < 0.05, Figs 3F and S11). This dependence of the ACC-TUS effect on the recent gross token income (GTI) was evident in both monkeys (S11E Fig) and could be a main reason that led to slower learning.

Discussion

We found that sonicating the ACC, but not the anterior striatum (head of the caudate), slowed down learning, prolonged visual information sampling of objects prior to choosing an object, and reduced overall performance when learning took place in a context with gains and losses and with high cognitive load. These behavioral impairments were specific to the ACC-TUS condition when comparing behavior to baseline performance prior to TUS and to sessions with sham-controlled TUS and STR-TUS. Moreover, the changes in behavioral adjustment were found when comparing average performance between individual sessions (S10 Fig), for performance variations across blocks of different sessions (Fig 2), and on a trial-by-trial level in impaired behavior when the loss of tokens accumulated over trials (Fig 3).

Taken together, these findings provide evidence that primate ACC supports the guidance of attention and information sampling in contexts that are motivationally challenging and cognitively demanding. Prior to discussing the implications of these findings, it should be noted that the sonication effects on neuronal activity in vivo are not well investigated. The sonication protocol we applied may involve changes in the excitability of neural tissue within the hot spot of the sonicated area [41] or effects on the fibers of passage through the sonication areas. Our discussion assumes a putative disruptive effect of TUS on neural activity within ACC. This assumption is based on a prior study that used the same TUS protocol and reported reduced functional connectivity of the sonicated brain area [38]. There is also the caveat that the sonication effects could indirectly follow altered activity in other areas through modulating connectivity with those areas.

Extending existing functional accounts of ACC functions

Our main findings suggest extensions to theoretical accounts of the overarching function of the ACC for adaptive behavior. First, the finding of prolonged information sampling after ACC-TUS supports recent electrophysiological and imaging studies showing that activity in the ACC predicts information sampling of visual objects and attention to maximize reward outcomes [17,18,25,42]. However, we found a functional deficit of information sampling only in the gain-loss learning context in which subjects expected lower payoff, suggesting that the ACC contributes to information sampling, particularly in motivationally challenging conditions.

Secondly, our core finding of compromised learning in motivationally challenging conditions partly supports and extends the view that ACC is essential to control effort [30]. The functional impairment in the gain-loss motivational context was most pronounced when subjects faced high cognitive load, i.e., in the condition that overall was most difficult. While this result pattern suggests that the overall difficulty may be the primary driver of the ACC-TUS effects, we outline below that the specific impairments after loss experiences (irrespective of cognitive load) and the overall reduced plateau performance with ACC-TUS in the gain-loss context indicate a predominant role of motivational processes over cognitive processes to drive the behavioral ACC-TUS effects. At a psychological level, the pronounced deficit in the loss context at high cognitive load is consistent with a neuroeconomic view of ACC function based on prospect theory, which suggest that losses induce a particularly strong internal demand for adjusting actions, which is intensified when the adjustment of the action itself gets more demanding by an increase in cognitive load [37,43,44]. A particular role of the ACC in mediating the motivational consequences of negative loss experiences is consistent with a wealth of studies about affective processing, mostly from human imaging studies, that show systematic ACC activation in the face of aversive or threatening experiences [33,34,45].

Our finding that experiencing losses was necessary to observe behavioral deficits from ACC-TUS shows that the presence of a cognitive conflict due to an increased number of features in the high cognitive load condition was not sufficient to alter performance. This result might seem at odds with literature implicating the ACC to be particularly relevant to resolving conflict [46,47]. However, increasing cognitive load in our task entailed increased perceptual interference from object features, it did not entail a conflict of sensory-motor mappings that is the hallmark of conflict-inducing flanker, Stroop, or Simon tasks, used to document a role of the ACC to mediate the resolution of conflict [26,46,48]. Therefore, our results do not oppose studies using those paradigms, which gave rise to the view that neuronal signaling in ACC contributes to resolving sensorimotor conflicts [2]. Our results instead point to the importance of the ACC in contributing to overcoming situations in which motivational challenges require enhanced encoding of task-relevant variables.

Taken together, our observed result pattern supports theories suggesting the ACC contributes to information sampling as well as to controlling motivational effort during adaptive behavior. The results suggest that these functions of the ACC are recruited when the task requires overcoming motivational challenges from anticipating losses and when learning takes place in a cognitively demanding situation. We discuss the specific rationale for this conclusion in the following.

ACC modulates the efficiency of information sampling

We found that ACC-TUS increased the duration of fixating objects prior to choosing an object in the loss contexts. Longer fixation durations are typically considered to reflect longer sampling of information from the foveated object. This has been inferred from the longer fixational sampling of objects associated with higher outcome uncertainty [49] or that explicitly carry information that reduces uncertainty about outcomes [50,51]. We, therefore, consider fixational sampling of visual objects to index information sampling. Consistent with this view, we found that ACC-TUS prolonged information sampling at high cognitive load when objects varied in more features from trial to trial and hence carried higher uncertainty about which feature was linked to reward (Figs 2D and 2E and S7). These results suggest that TUS might disrupt or modulate the activity of neuronal circuitries in ACC that would have responded to the demand for information about the feature values by controlling the duration of gaze fixations on available objects. Such activity has been documented to be encoded in ACC [16–19]. These studies found that neurons in the ACC encoded the pre-learned value of objects when they were fixated [18] or peripherally attended [16,19]. Moreover, subpopulations of ACC neurons also encode the value of objects that are not yet fixated but are possible future targets for fixations [17]. Disrupting these signals with TUS in our study is a plausible reason causing uncertainty about the value of objects and for prolonging information sampling durations.

In our study, ACC_TUS prolonged information sampling most prominently in the gain-loss context when subjects were uncertain not only about the possible gains following correct choices, but additionally about the possible losses following incorrect choices (Fig 3D and 3E). This is consistent with recent findings of neurons in the intact ACC that fire stronger when subjects fixate on cues that reduce uncertainty about anticipated losses [27]. In that study, trial-by-trial variations of ACC activity during the fixation of punishment-predicting cues correlated with trial-by-trial variations of seeking information about how aversive a pending outcome will be [27]. In our study, disrupting this type of activity with transcranial ultrasound may thus have disrupted the efficacy of acquiring information about the loss association of features while fixating objects. According to this interpretation, the TUS-induced prolongation of fixation durations indicates that an intact ACC critically enhances the efficiency of visual information sampling to reduce uncertainty about aversive outcomes.

ACC mediates feature-specific credit assignment for aversive outcomes

The altered information sampling after ACC-TUS was only evident in the gain-loss context and depended on experiencing losing tokens. We found that losing 1, 2, or 3 tokens significantly impaired performance in ACC-TUS compared to sham or striatum sonication (Fig 3F). This valence-specific finding might be linked to the relevance of the ACC for processing negative events and adjusting behavior after negative experiences. Imaging studies have consistently shown ACC activation in response to negatively valenced stimuli or events [33–35]. Behaviorally, the experience of loss outcomes is known to trigger autonomous vigilance responses that reorient attention and increase explorative behaviors [37,52,53]. Such loss-induced exploration can help avoid threatening stimuli, but it comes at the cost of reducing the depth of processing the loss-inducing stimulus [36]. Studies in humans have shown that stimuli associated with loss of monetary rewards or aversive outcomes (aversive images, odors, or electrical shocks) are less precisely memorized than stimulus features associated with positive outcomes [54–56]. Such a reduced influence of loss-associated stimuli on future behavior may partly be due to loss experiences curtailing the engagement with those stimuli and reducing the evaluation of their role for future choices, as demonstrated, e.g., in a sequential decision-making task [57]. One behavioral consequence of a reduced evaluation or memorization of aversive or loss-associated stimuli is an over-generalization of aversive outcomes to stimuli that only share some resemblance with the specific stimulus that caused the loss experience. Such over-generalization is evolutionary meaningful because it can support the faster recognition of similar stimuli as potentially threatening even if these stimuli are not a precise instance of a previously encountered, loss-inducing stimulus [36,58]. For our task, such a precise recognition of features of a chosen object was pivotal to learning from feature-specific outcomes. Our finding of reduced learning from loss outcomes, therefore, indicates that ACC-TUS exacerbated the difficulty to assign negative outcomes to features of the chosen object.

Neurophysiological support for this interpretation of the ACC to mediate negative credit assignment comes from studies showing a substantial proportion of neurons in ACC encode feature-specific prediction errors [15]. These neurons responded to an error outcome most strongly when it was unexpected and when the chosen object contained a specific visual feature. This is precisely the information that is needed to learn which visual features should be avoided in future trials. Consistent with this importance for learning, the study also reported that feature-specific prediction error signals in the ACC predict when neurons updated value expectations about specific features [15,59]. Thus, neurons in ACC signal feature-specific negative prediction errors and the updating of feature-specific value predictions. It seems likely that disrupting these signals with TUS will lead to impaired feature-specific credit assignment in our task. This scenario is supported by the finding that ACC-TUS impaired flexible learning most prominently at high load, i.e., when there was a high degree of uncertainty about which stimulus feature was associated with gains and which features were associated with losses (Fig 2C). This finding suggests that outcome processes such as credit assignment in an intact ACC critically contribute to the learning about aversive distractors.

A role of the ACC to determine learning rates for aversive outcomes

So far, we have discussed that ACC-TUS has likely reduced the efficiency of information sampling and this reduction could originate from the disruption of feature-specific credit assignment processes. These phenomena were exclusively observed in the loss context, and ACC-TUS did not change learning or information sampling in the gain-only context. Based on this finding, we propose that the ACC plays an important role in determining the learning rate for negative or aversive outcomes and thereby controls how fast subjects learn which object features in their environments have aversive consequences. Such learning from negative outcomes was particularly important for our task at higher cognitive load [60]. At high load, a single loss outcome provided unequivocal information that all features of the chosen object were non-rewarded features in the ongoing learning block. This was different for positive outcomes. Receiving a positive outcome was linked to only 1 feature out of 2 or 3 features of the chosen object, making it more difficult to discern the precise cause of the outcome. The higher informativeness of negative than positive outcomes can explain why ACC-TUS caused a selective learning impairment in the loss context when assuming that TUS introduced uncertainty in the neural information about the cause of the outcome. This interpretation is consistent with the established role of the ACC to encode different types of errors [32,61,62] and with computational evidence that learning from errors and other negative outcomes is dependent on a dedicated learning mechanism separately of learning from positive outcomes [60,63–65].

The suggested role of the ACC to determine the rate of learning from negative outcomes describes a mechanism for establishing which visual objects are distractors and should be actively avoided and suppressed in a given learning context [66]. When subjects experience aversive outcomes, the ACC may use these experiences to bias attention and information sampling away from the negatively associated stimuli. This suggestion is consistent with the fast neural responses in ACC to attention cues that trigger inhibition of distracting and enhancement of target information [16,19]. In these studies, the fast cue-onset activity reflects a push-pull attention effect that occurred during covert attentional orienting and was independent of actual motor actions. This observation supports the general notion that ACC circuits are critical for guiding covert and overt information sampling during adaptive behaviors [18,25].

Versatility of the focused-ultrasound protocol for transcranial neuromodulation

Our conclusions were made possible using a TUS protocol that was developed to interfere with local neural activity in deep neural structures which likely imposes a temporary functional disconnection of the sonicated area from its network [38,40]. We implemented an enhanced protocol that entailed quantifying the anatomical targeting precision (S3A and S3B Fig) and confirmed by computer simulations that sonication power reached the targets (S3C–S3E Fig). We also showed that the TUS pressure in ACC was comparative to the maximum pressure in the brain and noticeably higher than in other nearby brain structures such as the orbitofrontal cortex (see Materials and methods and S3F Fig). We also documented that the main behavioral effect (impaired learning) of ACC-TUS was evident relative to a within-task pre-sonication baseline and throughout the experimental behavioral session (S9 Fig) and that the main TUS-ACC effects were qualitatively consistent across monkeys (for monkey specific results, see S1–S10 Figs; there were no significant random effects of the factor monkey in the LME models). Importantly, the main behavioral effects of ACC-TUS in our task are consistent with the effects from more widespread, invasive lesions of the ACC in nonhuman primates. Widespread ACC lesions reduce affective responses to harmful stimuli [67], increase response perseverations [68], cause failures to use reward history to guide choices [7,8], and reduce control to inhibit prevalent motor programs [69,70]. Our study, therefore, illustrates the versatility of the TUS approach to modulate deeper structures such as the ACC that so far have been out of reach for noninvasive neuromodulation techniques such as transcranial magnetic stimulation (TMS) or transcranial direct current stimulation (tDCS) [71]. Despite these suggestions, it should be made explicit that there is so far a scarcity of insights about the specific effects of the TUS protocol on neural circuits. Therefore, future studies will need to investigate the neural mechanisms underlying the immediate and longer-lasting TUS effects on ACC neural circuits.

In summary, our results suggest that the ACC multiplexes motivational effort control and attentional control functions by tracking the costs of incorrect performance, optimizing feature-specific credit assignment for aversive outcomes, and actively guiding information sampling to visual objects during adaptive behaviors.

Materials and methods

Ethics statement

All procedures were in accordance with the National Institutes of Health Guide for the Care and Use of Laboratory Animals, the Society for Neuroscience Guidelines and Policies, and approved by the Vanderbilt University Institutional Animal Care and Use Committee (M1700198-01).

Experimental procedures

Two male macaque monkeys (monkey I 13.6 kg and monkey W 14.6 kg, 8 to 9 years of age) contributed to the experiments. They sat in a sound-proof booth in primate chairs with their head position fixed, facing a 21” LCD screen at a distance of 63 cm from their eyes to the screen center. Behavior, visual display, stimulus timing, and reward delivery were controlled by the Unified Suite for Experiments (USE), which integrates an IO-controller board with a unity3D video-engine-based control for displaying visual stimuli, controlling behavioral responses, and triggering reward delivery [72]. Prior to the ultrasound experiment, the animals were trained on the feature learning task in a kiosk training station [73]. Monkeys first learned to choose objects to receive an immediate fluid reward before a token system that provided animals with green circles per correct choice that symbolized tokens later cashed out for fluid reward. Tokens were presented above the chosen object and traveled to a token bar where they accumulated with successive correct performance. When 5 tokens were collected, the token bar blinked red/white, fluid was delivered through a sipper tube, and the token bar reset to 5 empty token placeholders (Fig 1A). The animals effortlessly adopted the token reward system as documented in [36]. Here, we used in separate blocks of 35 to 50 trials a condition with “gains-only” (3 tokens for correct choices, no penalties) and with “gains-and-losses” (2 tokens for correct choices and 1 token lost, i.e., removed from the token bar, for incorrect choices). The introduction of gains-and-losses effectively changed behavior. Animals learned slower, showed reduced plateau accuracy, enhanced exploratory sampling, and more checking of the token bar (S2B–S2H Fig).

Task paradigm

The task required monkeys to learn feature-reward rules in blocks of 35 to 60 trials through trial-and-error by choosing 1 of 3 objects. Objects were composed of multiple features, but only 1 feature was associated with reward (Fig 1A–1C). A trial started with the appearance of a black circle with a diameter of 1 cm (0.5° radius wide) on a uniform gray background on the screen. Monkeys fixated the black circle for 150 ms to start a trial. Within 500 ms after the central gaze fixation registration, 3 objects appeared on the screen randomly at 3 out of 4 possible locations with an equal distance from the screen center (10.5 cm, 5° eccentricity). Each stimulus had a diameter of 3 cm (approximately 1.5° radius wide). To choose an object, monkeys had to maintain fixation onto the object for at least 700 ms. Monkeys then had 5 s to choose 1 of 3 objects or the trial was aborted. Choosing the correct object was followed by a yellow halo around the stimulus as visual feedback (500 ms), an auditory tone, and either 2 or 3 tokens (green circles) added to the token bar (Fig 1A). Choosing an object without the rewarded target feature was followed by a blue halo around the selected objects, a low-pitched auditory feedback, and in the loss conditions, the presentation of a gray “loss” token that traveled to the token bar where one already attained token was removed. The timing of the feedback was identical for all types of feedback. In each session, monkeys were presented with up to 36 separate learning blocks, each with a unique feature-reward rule. Across all 48 experimental sessions, monkeys completed on average 23.6 (±4 SE) learning blocks per session (monkey W: 20.5 ± 4, monkey I: 26.5 ± 4). For each experimental session, a unique set of objects was defined by randomly selecting 3 dimensions and 3 feature values per dimension (e.g., 3 body shapes: oblong, pyramidal, and ellipsoid; 3 arm types: upward pointy, straight blunt, downward flared; 3 patterns: checkerboard, horizontal striped, vertical sinusoidal; Watson and colleagues, 2019 [74] have documented the complete library of features). Of this feature set, 3 different task conditions were defined: One task condition contained objects that varied in only 1 feature while all other features were identical, i.e., the object body shapes were oblong, pyramidal, and ellipsoid, but all objects had blunt straight arms and uniform gray color. A second task condition defined objects that varied in 2 feature dimensions (“2D” condition), and a third task condition defined objects that varied in 3 feature dimensions (“3D” condition). Learning is systematically more demanding with increasing number of feature dimensions that could contain the rewarded feature (for computational analysis of the task, see Womelsdorf and colleagues, 2021). We refer to the variations of the object feature dimensionality as cognitive load because it corresponds to the size of the feature space subjects have to search to find the rewarded feature (Figs 1E and S1), and 1D, 2D, and 3D conditions varied randomly across blocks.

Experimental design

Each session randomly varied across blocks 2 motivational conditions (gain/loss and gains-only) and 3 cognitive load conditions (1D, 2D, and 3D). Randomization ensured that all (6) combinations of conditions were present in the first 6 blocks prior to sonication and that all combinations of conditions were equally often shown in the 24 blocks after the first 6 blocks. After monkeys completed the first 6 learning blocks (on average 12 min after starting the experiment), the task was paused to apply TUS. After bilateral placement of the transducer and sonication (or sham sonication) of the brain areas, the task resumed for 18 to 24 more blocks (Fig 1D). Experimental sessions lasted 90 to 120 min.

Transcranial ultrasound stimulation (TUS)

For transcranial stimulation, we used a single element transducer with a curvature of 63.2 mm and an active diameter of 64 mm (H115MR, Sonic Concepts, Bothell, Washington, United States of America). The transducer was attached to a cone with a custom-made trackable arm. Before each session, we filled the transducer cone with warm water and sealed the cone with a latex membrane. A conductive ultrasound gel was used for coupling between the transducer cone and the shaved monkey’s head. A digital function generator (Keysight 33500B series, Santa Rosa, California, USA) was used to generate a periodic burst of 30-ms stimulation with a resonate frequency of 250 kHz and an interval of 100 ms for a total duration of 40 s and 1.2 MPa pressure (similar to [39,75]) (Fig 1D). A digital function generator was connected to a 150-watt amplifier with a gain of 55 dB in the continuous range to deliver the input voltage to the transducer (E&I, Rochester, New York, USA). We measured the transducer output in water using a calibrated ceramic needle hydrophone (HNC 0400, Onda Corp., Sunnyvale, California, USA) and created a linear relationship between the input voltage and peak pressure. To avoid hydrophone damage, only pressures below a mechanical index (MI) of 1.0 were measured and amplitudes above this were extrapolated. We have previously measured transducer output at MI > 1.0 with a calibrated optical hydrophone (Precision Acoustics, Dorchester, United Kingdom) to validate the linearity of this relationship at higher MI, but this calibrated device was not available during these studies. During stimulation, the bi-directionally coupled (ZABDC50-150HP+, Mini Circuits Brooklyn, New York, USA) feedforward and feedback voltage were monitored and logged using a Picoscope 5000 series (A-API; Pico Technology, Tyler, Texas, USA) and a custom written python script.

Four different sonication conditions were pseudo-randomly assigned to the experimental days per week for a 12-week experimental protocol per monkey. These 4 conditions consisted of high energy TUS in anterior striatum (TUS-STR), high energy TUS in anterior cingulate cortex (TUS-ACC), sham anterior striatum (Sham-STR), and sham anterior cingulate cortex (Sham-ACC) (Fig 1D). We sequentially targeted an area in both hemispheres (each hemisphere for a 40-s duration) with real-time monitoring of the distance of the transducer to the targeted area (S3A Fig) and monitoring of the feedforward power (S3B Fig). Sham conditions were identical to TUS conditions, only no power was transmitted to the transducer.

Data analysis

Trial-level statistical analysis

We tested TUS effects on behavior at the trial level using LMEs models [10] with 4 main factors: cognitive load (Cog_Load) with 3 levels (1D, 2D, and 3D distractor feature dimensions, ratio scale with values 1, 2, and 3), trial in block (TIB), previous trial outcome (Prev_Outc)) which is the number of tokens gained or lost in the previous trial, motivational token condition, which we call the motivational gain/loss context (MCtx_Gain/Loss) with 2 levels (1, for the loss condition, and 2, for the gain condition, nominal variable), TUS condition (TUS_Cnd) with 4 levels (Sham-ACC, TUS-ACC, Sham-STR, and TUS-STR), and time relative to stim (T2Stim) with 2 levels (before versus after stimulation). We used 3 other factors as random effects, a factor target features (Feat) with 4 levels (color, pattern, arm, and shape), weekday of the experiment (Day) with 4 levels (Tuesday, Wednesday, Thursday, and Friday), and the factor monkeys with 2 levels (W and I). We used these factors to predict 3 metrics (Metric): accuracy (Accuracy), reaction time (RT), and information sampling (Sample_Explr). The LME is formalized as in Eq 1.

M e t r i c = {C o g}_{L o a d} + T I B + {P r e v}_{O u t c} + {M C t x}_{G a i n / L o s s} + {T U S}_{C n d} + T 2 S t i m + (1 | D a y) + (1 | F e a t) + (1 | M o n k e y s) + b + ε

(1)

Block-level analysis of behavioral metrics

We used LME models to analyze across blocks how learning speed (indexed as the “learning trial” (LT) at which criterion performance was reached), post-learning accuracy (Accuracy), and metrics for choice and information sampling were affected by TUS. In addition to the fixed and random effects factors used for the LME in Eq 1, we also included the factor block switching condition (Switch_Cnd) with 2 levels (intra- and extra-dimensional switch). The LME had the form of Eq 2:

M e t r i c = {C o g}_{L o a d} + {M C t x}_{G a i n / L o s s} + {T U S}_{C n d} + T 2 S t i m + (1 | D a y) + (1 | M o n k e y s) + (1 | F e a t) + b + ε .

(2)

We extended the model to test for interactions of TUS_Cnd, MCtx_Gain/Loss, and T2Stim:

M e t r i c = {C o g}_{L o a d} + {M C t x}_{G a i n / L o s s} + {M C t x}_{G a i n / L o s s} \times {T U S}_{C n d} \times T 2 S t i m + (1 | D a y) + (1 | M o n k e y) + (1 | F e a t) + b + ε,

(3)

for interactions of T2Stim, Cog_Load, and TUS_Cnd:

M e t r i c = {C o g}_{L o a d} + {M C t x}_{G a i n / L o s s} + {T U S}_{C n d} \times {T 2 S t i m \times A t t}_{L o a d} \times {T U S}_{C n d} + (1 | D a y) + (1 | M o n k e y) + (1 | F e a t) + b + ε,

(4)

and for interactions of T2Stim, Cog_Load, MCtx_Gain/Loss, and TUS_Cnd:

M e t r i c = {C o g}_{L o a d} + {M C t x}_{G a i n / L o s s} {+ T U S}_{C n d} \times {T 2 S t i m \times A t t}_{L o a d} \times {T U S}_{C n d} \times {M C t x}_{G a i n / L o s s} + (1 | D a y) + (1 | M o n k e y) + (1 | F e a t) + b + ε .

(5)

Supporting information

S1 Text. Supplementary Information.

Contains detailed methods for precise TUS neuronavigation, TUS simulations, and data analysis.

(DOCX)

Click here for additional data file.^{(38.1KB, docx)}

S1 Fig. Cognitive load effect on learning and fixational information sampling.

(A, B) Both monkeys reached the learning criterion of 80% or more correct trials (based on a 12 trials forward-looking window). Learning is fastest at low cognitive load (light gray), and slowest at high cognitive load (dark gray). In all panels, the left column shows the results for monkey W, the middle for monkey I, and the right for both monkeys combined. (C) Post-learning accuracy is significantly reduced in higher cognitive load (LMEs, P < 0.001). (D, E) Information sampling is the duration in msec. of fixational sampling of objects prior to making a choice. Information sampling increased at the beginning of a block, reached a maximum during learning, and but remained elevated only at the highest cognitive load (LMEs, p < 0.001). (F) Choice reaction time is the time from stimulus onset to the onset of the final fixation (that chooses the object). It increased with cognitive load (Kruskal–Wallis test, p < 0.001). (G) Choice sampling is measured as the duration of sampling the chosen object prior to the final choice fixation. Choice sampling is increased in blocks with higher cognitive load (Kruskal–Wallis test, p < 0.001). (H) Asset sampling duration quantifies how long subjects fixate the token bar prior to choosing an object. Asset sampling is independent of cognitive load and more extensive in monkey I. Data show means and standard error of the mean. Data associated with this plot could be found at: https://figshare.com/projects/TUS_PlosBiology/144330.

(EPS)

Click here for additional data file.^{(1.3MB, eps)}

S2 Fig. Effects of gain-loss and gain-only learning contexts on learning and gaze sampling behaviors.

(A, B) Same format as S1A and S1B Fig. Learning is faster in the gain-only context (blue), than in the gain-loss context. In all panels, the left column shows the results for monkey W, the middle for the monkey I, and the right for both monkeys. (C) Post-learning accuracy is significantly reduced in the gain-loss context (Wilcoxon test, P < 0.001). (D, E) Information sampling reaches a higher maximum during learning in the gain-loss context (red) compared to the gain-only context (blue) (Wilcoxon test, P < 0.001). (F) Choice reaction time is slower in the gain-loss context (Wilcoxon test, p < 0.001). (G) Sampling of the object that is subsequently chosen is longer in the gain-loss context (Wilcoxon test, p < 0.001). (H) Asset sampling duration is longer in the gain-loss context (Wilcoxon test, p < 0.001). Data show mean and standard error of the mean. Data associated with this plot could be found at: https://figshare.com/projects/TUS_PlosBiology/144330.

(EPS)

Click here for additional data file.^{(1.1MB, eps)}

S3 Fig. Transcranial ultrasound stimulation localization, energy, and sonication focus specifications.

In each experimental session, we positioned the transducer sonication beam to focus on left and right hemisphere ACC and striatum and sonicated the area for 40 seconds in each hemisphere. (A) Both hemispheres in both monkeys were targeted precisely within an averaged sub-millimeter distance from the center of focal beam of the transducer to the anatomical target region. (B) Real-time monitoring of output voltage to the transducer confirmed reliable feedforward voltage range for both monkeys (see Materials and methods). (C,D) Computer simulations show a reliable range of RMS deviation values of negative peak pressure at the focus of the sonication attenuated at −3 dB (C), and −6 dB (D). (E) Maximum peak pressure at the targeted area. (F) For ACC-TUS sessions, to assure the effect is not attributed to OFC that appears to be aligned with the TUS orientation in Fig 2A, we compared the spatial average within a 4-mm cubic volume around each label target. Simulations showed ACC received significantly higher compared with OFC (Wilcoxon test, P < 0.0001) and showed comparative value relative to the maximum pressure values in brain. Lines in panels (A–D) show standard error of the mean, and in (F) show median across all individual sessions (small dots). Data associated with this plot could be found at: https://figshare.com/projects/TUS_PlosBiology/144330. ACC, anterior cingulate cortex; OFC, orbitofrontal cortex; RMS, root mean squared; TUS, transcranial ultrasound stimulation.

(EPS)

Click here for additional data file.^{(1MB, eps)}

S4 Fig. Transcranial ultrasound stimulation effect on learning for individual monkeys (left and middle column) and their average (right column).

(A, B) Learning curves for 4 different experimental conditions: high energy TUS in ACC (TUS-ACC; red), or anterior striatum (TUS-STR; green), or sham ACC (Sham-ACC; darkened red), or sham anterior striatum (Sham-STR; darkened green). Learning curves are shallower after TUS-ACC in the gain-loss context (A) but not in the gain-only context (B). (C, D) The reduced learning speed (increased trials-to-criterion) with TUS-ACC in the gain-loss learning context (C) but not in the gain-only learning context (D). Detailed statistics are provided in S1 Table. Data show mean and standard error of the mean. The black asterisks show significant main effect of TUS conditions and a significant difference between the TUS and the baseline (pre-TUS) conditions (for the TUS condition underneath the asterisks). Horizontal black lines indicate significant pairwise difference between TUS conditions. Data associated with this plot could be found at: https://figshare.com/projects/TUS_PlosBiology/144330.

(EPS)

Click here for additional data file.^{(1.6MB, eps)}

S5 Fig. TUS interaction of cognitive load and motivational context.

(A, B) TUS in ACC reduces learning speed at higher cognitive load in the gain-loss learning context (A), but not the gain-only learning context (B) (for the full multiple comparison corrected statistical results see S2 Table). (C) Marginally normalized trials-to-criterion are significantly higher with TUS-ACC in the gain-loss learning context, and the effect in TUS-ACC was only significant at higher cognitive load conditions 2D and 3D (random permutation p < 0.05). Error bars are standard error of the mean. The white rectangle in (C) shows learning in a TUS condition is different from other TUS conditions. Each cell is color coded with the mean value ± standard error of the mean with a low to high value gradient from left to right. The white asterisk shows cognitive loads in a learning context in a TUS condition is significantly different from other TUS conditions. In all panels, the left column shows the results for monkey W, the middle for the monkey I, and the right for both monkeys combined. Black cross (×) and asterisk shows significant interaction of cognitive load and TUS conditions. Black asterisks show significant main effect of TUS conditions and a significant difference between the TUS and the baseline (pre-TUS) conditions (for the TUS condition underneath the asterisks). Data associated with this plot could be found at: https://figshare.com/projects/TUS_PlosBiology/144330.

(EPS)

Click here for additional data file.^{(1.7MB, eps)}

S6 Fig. Transcranial ultrasound stimulation effects on explorative behavior.

(A, B) Information sampling curves showing the average fixation durations on objects before making a choice. With the beginning of a new learning block information sampling increases to a maximum during learning and then slowly reduces to a baseline level. TUS in ACC in the gain-loss learning context (A), but not in the gain-only context causes elevated exploratory sampling (B). (C, D) Information sampling is increased with TUS in ACC relative to baseline and other TUS conditions in the gain-loss context (C) but not the gain-only context (D). (E, F) Post-learning accuracy is not significantly different from the baseline and other TUS conditions in the block-level analysis, neither in gain-loss context (E), nor the gain-only context (F) (detailed statistics in S1 Table). The black asterisks show significant main effect of TUS conditions and a significant difference between the TUS and the baseline (pre-TUS) conditions (for the TUS condition underneath the asterisks). Horizontal black lines indicate significant pairwise difference between TUS conditions. Data associated with this plot could be found at: https://figshare.com/projects/TUS_PlosBiology/144330.

(EPS)

Click here for additional data file.^{(1.9MB, eps)}

S7 Fig. Distribution of gaze fixation duration before a choice.

(A) Distribution of gaze fixation duration on each object before making a choice (in brown) which we referred to as a measure of information sampling. In contrast to fixations indexing information sampling a “choice fixation” was registered when a fixation duration was ≥700 ms (in yellow). (B–E) Bootstrap distributions on information sampling for different cognitive load (B), motivational context (C), and TUS in gain-loss motivational context (D), and TUS in gain-only motivational context conditions (E). Data associated with this plot could be found at: https://figshare.com/projects/TUS_PlosBiology/144330.

(EPS)

Click here for additional data file.^{(2.5MB, eps)}

S8 Fig. Logistic regression GLM fit to performance accuracy.

(A–D) Average logistic regression fit using GLM on performance accuracy over learning blocks for different, (A), cognitive load, (B), motivational context, (C), TUS in gain-loss motivational context, and (D), TUS in gain-only motivational context conditions. Shaded error bars are standard errors of the mean. Data associated with this plot could be found at: https://figshare.com/projects/TUS_PlosBiology/144330. GLM, generalized linear mixed model; TUS, transcranial ultrasound stimulation.

(EPS)

Click here for additional data file.^{(2.5MB, eps)}

S9 Fig. Time course of the effect of transcranial ultrasound stimulation on learning and explorative behavior.

(A) Monkeys show a gradual deterioration of learning with TUS in ACC over time in the gain-loss conditions. (B) Information sampling increases over time relative to the time of TUS in ACC. The x-axis shows the time in minutes relative to TUS stimulation within a session. The time-step resolution is 20 min. Both monkeys show a similar time course with the slowing of learning from 20–40 min after the stimulation to the end of the session (randomized permutation test, p < 0.05). Shaded error bars are standard errors of the mean. Data associated with this plot could be found at: https://figshare.com/projects/TUS_PlosBiology/144330.

(EPS)

Click here for additional data file.^{(922.4KB, eps)}

S10 Fig. Session level effect of transcranial ultrasound stimulation on learning and information sampling.

Average results for each session and across sessions for monkey W and I, and both combined (left, middle, and right column, respectively). (A, B) TUS in ACC caused slower learning in both monkeys in the gain-loss context (A), but not in the gain-only context (B). (C, D) TUS in ACC causes longer information sampling than Sham STR in the gain-loss context (C). There was, however, no difference to baseline or in the gain-only context. (D). (E, F) TUS in ACC reduced post-learning accuracy relative to baseline and other TUS conditions in the gain-loss context (E), but not the gain-only context (F). For detailed statistics, see S1 Table. Data show session means for the pre-stimulation baseline blocks (gray) and across blocks after TUS/Sham (colored). Significant pairwise comparisons are indicated with black horizontal lines between each pair of conditions and the black asterisk shows significant session-level difference (post TUS versus baseline) of each behavioral measure. Data associated with this plot could be found at: https://figshare.com/projects/TUS_PlosBiology/144330.

(EPS)

Click here for additional data file.^{(3MB, eps)}

S11 Fig. Trial-level effects of transcranial ultrasound stimulation on post-outcome performance adjustment.

(A–D) Normalized performance accuracy in the 5 trials after receiving either a loss of 1 token (A), no loss after an incorrect response (B), a gain of 2 tokens (C), and a gain of 3 tokens (D). TUS in ACC (red) caused overall reduced performance accuracy in each monkey after losing 1 token or gaining 2 tokens that shows the effect is specific to the gain-loss context. (E) Both monkeys show reduced performance (y-axis) after TUS in ACC when the GTI was negative, i.e., then they on average had lost 1–3 tokens in the preceding 4 trials in gain-loss motivational context condition. (F) The performance accuracy was not different for any of GTI values in gain-only motivational context condition. Accuracy on the trial-level analysis is normalized by the mean and standard deviation of the accuracy of similar events in the baseline during the same TUS session. All statistics in these panels used randomization (permutation) tests with FDR correction of p-values for dependent samples with an alpha level of 0.05. The black asterisk shows points significantly different from pre-TUS baseline and relative to other TUS conditions. Data associated with this plot could be found at: https://figshare.com/projects/TUS_PlosBiology/144330. ACC, anterior cingulate cortex; GTI, gross token income; TUS, transcranial ultrasound stimulation.

(EPS)

Click here for additional data file.^{(1.6MB, eps)}

S1 Table

Statistical results considering data from individual blocks (block level, left) and average data from individual sessions (session level, right) for trials-to-criterion (A, B), information sampling durations (C, D), and the post-learning plateau accuracy (E, F). Each table shows results of 3 different tests for the 2 motivational learning contexts: gain-only (G, in blue font) and gain-loss (GL, in red font). The p-values on the diagonal show Wilcoxon tests for each TUS condition compared to its baseline (before the stimulation). The non-diagonal cells show p-values for pairwise comparisons of each pair of the TUS conditions. The overall Kruskal–Wallis test results is shown on the right outside of each table. All p-values are FDR-corrected with an alpha level of 0.05, as explained in the methods. Data associated with this table could be found at: https://figshare.com/projects/TUS_PlosBiology/144330.

(EPS)

Click here for additional data file.^{(1.4MB, eps)}

S2 Table

Statistical results split for low (left) and high (right) cognitive load conditions for trials-to-criterion (A, B), information sampling durations (C, D), and the post-learning plateau accuracy (E, F). Each table shows results of 3 different tests for the 2 motivational learning contexts: gain-only (G, in blue font) and gain-loss (GL, in red font). The p-values on the diagonal show Wilcoxon tests for each TUS condition compared to its baseline (before the stimulation). The non-diagonal cells show p-values for pairwise comparisons of each pair of the TUS conditions. The overall Kruskal–Wallis test results is shown on the right outside of each table. All p-values are FDR-corrected with an alpha level of 0.05, as explained in the methods. Data associated with this table could be found at: https://figshare.com/projects/TUS_PlosBiology/144330.

(EPS)

Click here for additional data file.^{(1.4MB, eps)}

Acknowledgments

The authors thank Dr. Marcus Watson for help with the experimental software, Huiwen Luo for assistance with calibrating optically tracked devices using MRI, and Adrienne Hawkes for technical assistance with voltage monitoring. The content is solely the responsibility of the authors and does not necessarily represent the official views of the National Institutes of Health.

Abbreviations

ACC: anterior cingulate cortex
GLM: generalized linear model
GTI: gross token income
LME: linear mixed effect
MI: modulation index
STR: striatum
tDCS: transcranial direct current stimulation
TMS: transcranial magnetic stimulation
TUS: transcranial ultrasound stimulation

Data Availability

All data supporting this study and its findings can be find using the link: https://figshare.com/projects/TUS_PlosBiology/144330. Custom MATLAB code generated for analyses, are available from the GitHub link: https://github.com/banaiek/TUS_PlosBiology_2022.git.

Funding Statement

This work was supported by the National Institute of Mental Health of the National Institutes of Health under Award Number R01MH123687 (TW) and 1UF1NS107666 (CC). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

1.Averbeck BB. Amygdala and ventral striatum population codes implement multiple learning rates for reinforcement learning. 2017 IEEE Symposium Series on Computational Intelligence (SSCI). 2017. pp. 1–5. doi: 10.1109/SSCI.2017.8285354 [DOI] [Google Scholar]
2.Heilbronner SR, Hayden BY. Dorsal Anterior Cingulate Cortex: A Bottom-Up View. Annu Rev Neurosci. 2016;39:149–170. doi: 10.1146/annurev-neuro-070815-013952 [DOI] [PMC free article] [PubMed] [Google Scholar]
3.Hikosaka O, Yasuda M, Nakamura K, Isoda M, Kim HF, Terao Y, et al. Multiple neuronal circuits for variable object–action choices based on short- and long-term memories. Proc Natl Acad Sci U S A. 2019;116:26313–26320. doi: 10.1073/pnas.1902283116 [DOI] [PMC free article] [PubMed] [Google Scholar]
4.Shenhav A, Cohen JD, Botvinick MM. Dorsal anterior cingulate cortex and the value of control. Nat Neurosci. 2016;19:1286–1291. doi: 10.1038/nn.4384 [DOI] [PubMed] [Google Scholar]
5.Camille N, Tsuchida A, Fellows LK. Double Dissociation of Stimulus-Value and Action-Value Learning in Humans with Orbitofrontal or Anterior Cingulate Cortex Damage. J Neurosci. 2011;31:15048–15052. doi: 10.1523/JNEUROSCI.3164-11.2011 [DOI] [PMC free article] [PubMed] [Google Scholar]
6.Buckley MJ, Mansouri FA, Hoda H, Mahboubi M, Browning PGF, Kwok SC, et al. Dissociable Components of Rule-Guided Behavior Depend on Distinct Medial and Prefrontal Regions. Science. 2009;325:52–58. doi: 10.1126/science.1172377 [DOI] [PubMed] [Google Scholar]
7.Kennerley SW, Walton ME, Behrens TEJ, Buckley MJ, Rushworth MFS. Optimal decision making and the anterior cingulate cortex. Nat Neurosci. 2006;9:940–947. doi: 10.1038/nn1724 [DOI] [PubMed] [Google Scholar]
8.Rudebeck PH, Behrens TE, Kennerley SW, Baxter MG, Buckley MJ, Walton ME, et al. Frontal Cortex Subregions Play Distinct Roles in Choices between Actions and Stimuli. J Neurosci. 2008;28:13775–13785. doi: 10.1523/JNEUROSCI.3541-08.2008 [DOI] [PMC free article] [PubMed] [Google Scholar]
9.Clarke HF, Robbins TW, Roberts AC. Lesions of the Medial Striatum in Monkeys Produce Perseverative Impairments during Reversal Learning Similar to Those Produced by Lesions of the Orbitofrontal Cortex. J Neurosci. 2008;28:10972–10982. doi: 10.1523/JNEUROSCI.1521-08.2008 [DOI] [PMC free article] [PubMed] [Google Scholar]
10.Rothenhoefer KM, Costa VD, Bartolo R, Vicario-Feliciano R, Murray EA, Averbeck BB. Effects of Ventral Striatum Lesions on Stimulus-Based versus Action-Based Reinforcement Learning. J Neurosci. 2017;37:6902–6914. doi: 10.1523/JNEUROSCI.0631-17.2017 [DOI] [PMC free article] [PubMed] [Google Scholar]
11.Hayden BY, Pearson JM, Platt ML. Fictive Reward Signals in the Anterior Cingulate Cortex. Science. 2009;324:948–950. doi: 10.1126/science.1168488 [DOI] [PMC free article] [PubMed] [Google Scholar]
12.Kennerley SW, Dahmubed AF, Lara AH, Wallis JD. Neurons in the Frontal Lobe Encode the Value of Multiple Decision Variables. J Cogn Neurosci. 2009;21:1162–1178. doi: 10.1162/jocn.2009.21100 [DOI] [PMC free article] [PubMed] [Google Scholar]
13.Lau B, Glimcher PW. Value Representations in the Primate Striatum during Matching Behavior. Neuron. 2008;58:451–463. doi: 10.1016/j.neuron.2008.02.021 [DOI] [PMC free article] [PubMed] [Google Scholar]
14.Kawai T, Yamada H, Sato N, Takada M, Matsumoto M. Roles of the Lateral Habenula and Anterior Cingulate Cortex in Negative Outcome Monitoring and Behavioral Adjustment in Nonhuman Primates. Neuron. 2015;88:792–804. doi: 10.1016/j.neuron.2015.09.030 [DOI] [PubMed] [Google Scholar]
15.Oemisch M, Westendorff S, Azimi M, Hassani SA, Ardid S, Tiesinga P, et al. Feature-specific prediction errors and surprise across macaque fronto-striatal circuits. Nat Commun. 2019;10:1–15. doi: 10.1038/s41467-018-08184-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
16.Banaie Boroujeni K, Tiesinga P, Womelsdorf T. Interneuron specific gamma synchronization indexes cue uncertainty and prediction errors in lateral prefrontal and anterior cingulate cortex. Elife. 2021;10:e69111. doi: 10.7554/eLife.69111 [DOI] [PMC free article] [PubMed] [Google Scholar]
17.Butler JL, Muller TH, Veselic S, Malalasekera WMN, Hunt LT, Behrens TEJ, et al. Covert valuation for information sampling and choice. bioRxiv. 2021. p. 2021.10.08.463476. doi: 10.1101/2021.10.08.463476 [DOI] [Google Scholar]
18.Hunt LT, Malalasekera WMN, de Berker AO, Miranda B, Farmer SF, Behrens TEJ, et al. Triple dissociation of attention and decision computations across prefrontal cortex. Nat Neurosci. 2018;21:1471–1481. doi: 10.1038/s41593-018-0239-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
19.Kaping D, Vinck M, Hutchison RM, Everling S, Womelsdorf T. Specific Contributions of Ventromedial, Anterior Cingulate, and Lateral Prefrontal Cortex for Attentional Selection and Stimulus Valuation. PLoS Biol. 2011;9:e1001224. doi: 10.1371/journal.pbio.1001224 [DOI] [PMC free article] [PubMed] [Google Scholar]
20.Voloh B, Oemisch M, Womelsdorf T. Phase of firing coding of learning variables across the fronto-striatal network during feature-based learning. Nat Commun. 2020;11:4669. doi: 10.1038/s41467-020-18435-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
21.Arcizet F, Krauzlis RJ. Covert spatial selection in primate basal ganglia. PLoS Biol. 2018;16:e2005930. doi: 10.1371/journal.pbio.2005930 [DOI] [PMC free article] [PubMed] [Google Scholar]
22.Boroujeni KB, Oemisch M, Hassani SA, Womelsdorf T. Fast spiking interneuron activity in primate striatum tracks learning of attention cues. Proc Natl Acad Sci U S A. 2020;117:18049–18058. doi: 10.1073/pnas.2001348117 [DOI] [PMC free article] [PubMed] [Google Scholar]
23.Mesulam M-M. A cortical network for directed attention and unilateral neglect. Ann Neurol. 1981;10:309–325. doi: 10.1002/ana.410100402 [DOI] [PubMed] [Google Scholar]
24.Womelsdorf T, Everling S. Long-Range Attention Networks: Circuit Motifs Underlying Endogenously Controlled Stimulus Selection. Trends Neurosci. 2015;38:682–700. doi: 10.1016/j.tins.2015.08.009 [DOI] [PubMed] [Google Scholar]
25.Monosov IE, Rushworth MFS. Interactions between ventrolateral prefrontal and anterior cingulate cortex during learning and behavioural change. Neuropsychopharmacology. 2021;1–15. doi: 10.1038/s41386-021-01079-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
26.Ebitz RB, Smith EH, Horga G, Schevon CA, Yates MJ, McKhann GM, et al. Human dorsal anterior cingulate neurons signal conflict by amplifying task-relevant information. bioRxiv. 2020. p. 2020.03.14.991745. doi: 10.1101/2020.03.14.991745 [DOI] [Google Scholar]
27.Jezzini A, Bromberg-Martin ES, Trambaiolli LR, Haber SN, Monosov IE. A prefrontal network integrates preferences for advance information about uncertain rewards and punishments. Neuron. 2021;109:2339–2352.e5. doi: 10.1016/j.neuron.2021.05.013 [DOI] [PMC free article] [PubMed] [Google Scholar]
28.Kolling N, Behrens TEJ, Mars RB, Rushworth MFS. Neural Mechanisms of Foraging. Science. 2012. [cited 2021 Sep 4]. Available from: https://www.science.org/doi/abs/10.1126/science.1216930. [DOI] [PMC free article] [PubMed] [Google Scholar]
29.Kolling N, Wittmann MK, Behrens TEJ, Boorman ED, Mars RB, Rushworth MFS. Value, search, persistence and model updating in anterior cingulate cortex. Nat Neurosci. 2016;19:1280–1285. doi: 10.1038/nn.4382 [DOI] [PMC free article] [PubMed] [Google Scholar]
30.Shenhav A, Musslick S, Lieder F, Kool W, Griffiths TL, Cohen JD, et al. Toward a Rational and Mechanistic Account of Mental Effort. Annu Rev Neurosci. 2017;40:99–124. doi: 10.1146/annurev-neuro-072116-031526 [DOI] [PubMed] [Google Scholar]
31.Shenhav A, Botvinick MM, Cohen JD. The Expected Value of Control: An Integrative Theory of Anterior Cingulate Cortex Function. Neuron. 2013;79:217–240. doi: 10.1016/j.neuron.2013.07.007 [DOI] [PMC free article] [PubMed] [Google Scholar]
32.Shen C, Ardid S, Kaping D, Westendorff S, Everling S, Womelsdorf T. Anterior Cingulate Cortex Cells Identify Process-Specific Errors of Attentional Control Prior to Transient Prefrontal-Cingulate Inhibition. Cereb Cortex. 2015;25:2213–2228. doi: 10.1093/cercor/bhu028 [DOI] [PMC free article] [PubMed] [Google Scholar]
33.Etkin A, Egner T, Kalisch R. Emotional processing in anterior cingulate and medial prefrontal cortex. Trends Cogn Sci. 2011;15:85–93. doi: 10.1016/j.tics.2010.11.004 [DOI] [PMC free article] [PubMed] [Google Scholar]
34.Laufer O, Israeli D, Paz R. Behavioral and Neural Mechanisms of Overgeneralization in Anxiety. Curr Biol. 2016;26:713–722. doi: 10.1016/j.cub.2016.01.023 [DOI] [PubMed] [Google Scholar]
35.Dugré JR, Dumais A, Bitar N, Potvin S. Loss anticipation and outcome during the Monetary Incentive Delay Task: a neuroimaging systematic review and meta-analysis. PeerJ. 2018;6:e4749. doi: 10.7717/peerj.4749 [DOI] [PMC free article] [PubMed] [Google Scholar]
36.Boroujeni KB, Watson M, Womelsdorf T. Gains and Losses Differentially Regulate Attentional Efficacy at Low and High Attentional Load. J Cogn Neurosci. 2022. doi: 10.1162/jocn_a_01885 [DOI] [PMC free article] [PubMed] [Google Scholar]
37.Yechiam E, Hochman G. Losses as modulators of attention: Review and analysis of the unique effects of losses over gains. Psychol Bull. 2013;139:497–518. doi: 10.1037/a0029383 [DOI] [PubMed] [Google Scholar]
38.Fouragnan EF, Chau BKH, Folloni D, Kolling N, Verhagen L, Klein-Flügge M, et al. The macaque anterior cingulate cortex translates counterfactual choice value into actual behavioral change. Nat Neurosci. 2019;22:797–808. doi: 10.1038/s41593-019-0375-6 [DOI] [PMC free article] [PubMed] [Google Scholar]
39.Verhagen L, Gallea C, Folloni D, Constans C, Jensen DE, Ahnine H, et al. Offline impact of transcranial focused ultrasound on cortical activation in primates. Elife. 2019;8:e40541. doi: 10.7554/eLife.40541 [DOI] [PMC free article] [PubMed] [Google Scholar]
40.Bongioanni A, Folloni D, Verhagen L, Sallet J, Klein-Flügge MC, Rushworth MFS. Activation and disruption of a neural mechanism for novel choice in monkeys. Nature. 2021;591:270–274. doi: 10.1038/s41586-020-03115-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
41.Clennell B, Steward TGJ, Elley M, Shin E, Weston M, Drinkwater BW, et al. Transient ultrasound stimulation has lasting effects on neuronal excitability. Brain Stimul. 2021;14:217–225. doi: 10.1016/j.brs.2021.01.003 [DOI] [PMC free article] [PubMed] [Google Scholar]
42.Kaanders P, Nili H, O’Reilly JX, Hunt L. Medial Frontal Cortex Activity Predicts Information Sampling in Economic Choice. J Neurosci. 2021;41:8403–8413. doi: 10.1523/JNEUROSCI.0392-21.2021 [DOI] [PMC free article] [PubMed] [Google Scholar]
43.Kahneman D, Tversky A. Prospect Theory: An Analysis of Decision under Risk. Econometrica. 1979;47:263–291. doi: 10.2307/1914185 [DOI] [Google Scholar]
44.Tversky A, Kahneman D. The Framing of Decisions and the Psychology of Choice. Science. 1981;211:453–458. doi: 10.1126/science.7455683 [DOI] [PubMed] [Google Scholar]
45.Gehring WJ, Willoughby AR. The Medial Frontal Cortex and the Rapid Processing of Monetary Gains and Losses. Science. 2002;295:2279–2282. doi: 10.1126/science.1066893 [DOI] [PubMed] [Google Scholar]
46.Botvinick MM, Braver TS, Barch DM, Carter CS, Cohen JD. Conflict monitoring and cognitive control. Psychol Rev. 2001;108:624–652. doi: 10.1037/0033-295x.108.3.624 [DOI] [PubMed] [Google Scholar]
47.Cole MW, Bagic A, Kass R, Schneider W. Prefrontal Dynamics Underlying Rapid Instructed Task Learning Reverse with Practice. J Neurosci. 2010;30:14245–14254. doi: 10.1523/JNEUROSCI.1662-10.2010 [DOI] [PMC free article] [PubMed] [Google Scholar]
48.Mansouri FA, Tanaka K, Buckley MJ. Conflict-induced behavioural adjustment: a clue to the executive functions of the prefrontal cortex. Nat Rev Neurosci. 2009;10:141–152. doi: 10.1038/nrn2538 [DOI] [PubMed] [Google Scholar]
49.Ghazizadeh A, Griggs W, Hikosaka O. Ecological Origins of Object Salience: Reward, Uncertainty, Aversiveness, and Novelty. Front Neurosci. 2016;10:378. doi: 10.3389/fnins.2016.00378 [DOI] [PMC free article] [PubMed] [Google Scholar]
50.Monosov IE. Anterior cingulate is a source of valence-specific information about value and uncertainty. Nat Commun. 2017;8:134. doi: 10.1038/s41467-017-00072-y [DOI] [PMC free article] [PubMed] [Google Scholar]
51.White JK, Bromberg-Martin ES, Heilbronner SR, Zhang K, Pai J, Haber SN, et al. A neural network for information seeking. Nat Commun. 2019;10:5168. doi: 10.1038/s41467-019-13135-z [DOI] [PMC free article] [PubMed] [Google Scholar]
52.Lejarraga T, Hertwig R. How the threat of losses makes people explore more than the promise of gains. Psychon Bull Rev. 2017;24:708–720. doi: 10.3758/s13423-016-1158-7 [DOI] [PMC free article] [PubMed] [Google Scholar]
53.Lejarraga T, Schulte-Mecklenbeck M, Pachur T, Hertwig R. The attention–aversion gap: how allocation of attention relates to loss aversion. Evol Hum Behav. 2019;40:457–469. doi: 10.1016/j.evolhumbehav.2019.05.008 [DOI] [Google Scholar]
54.Resnik J, Sobel N, Paz R. Auditory aversive learning increases discrimination thresholds. Nat Neurosci. 2011;14:791–796. doi: 10.1038/nn.2802 [DOI] [PubMed] [Google Scholar]
55.Schechtman E, Laufer O, Paz R. Negative Valence Widens Generalization of Learning. J Neurosci. 2010;30:10460–10464. doi: 10.1523/JNEUROSCI.2377-10.2010 [DOI] [PMC free article] [PubMed] [Google Scholar]
56.Shalev L, Paz R, Avidan G. Visual Aversive Learning Compromises Sensory Discrimination. J Neurosci. 2018;38:2766–2779. doi: 10.1523/JNEUROSCI.0889-17.2017 [DOI] [PMC free article] [PubMed] [Google Scholar]
57.Huys QJM, Eshel N, O’Nions E, Sheridan L, Dayan P, Roiser JP. Bonsai Trees in Your Head: How the Pavlovian System Sculpts Goal-Directed Choices by Pruning Decision Trees. PLoS Comput Biol. 2012;8:e1002410. doi: 10.1371/journal.pcbi.1002410 [DOI] [PMC free article] [PubMed] [Google Scholar]
58.Moscarello JM, Hartley CA. Agency and the calibration of motivated behavior. Trends Cogn Sci. 2017;21:725–735. doi: 10.1016/j.tics.2017.06.008 [DOI] [PubMed] [Google Scholar]
59.Quilodran R, Rothé M, Procyk E. Behavioral Shifts and Action Valuation in the Anterior Cingulate Cortex. Neuron. 2008;57:314–325. doi: 10.1016/j.neuron.2007.11.031 [DOI] [PubMed] [Google Scholar]
60.Womelsdorf T, Watson MR, Tiesinga P. Learning at Variable Attentional Load Requires Cooperation of Working Memory, Meta-learning and Attention-augmented Reinforcement Learning. J Cogn Neurosci. 2021;1–29. doi: 10.1162/jocn_a_01780 [DOI] [PMC free article] [PubMed] [Google Scholar]
61.Holroyd CB, Nieuwenhuis S, Yeung N, Nystrom L, Mars RB, Coles MGH, et al. Dorsal anterior cingulate cortex shows fMRI response to internal and external error signals. Nat Neurosci. 2004;7:497–498. doi: 10.1038/nn1238 [DOI] [PubMed] [Google Scholar]
62.Kennerley SW, Behrens TEJ, Wallis JD. Double dissociation of value computations in orbitofrontal and anterior cingulate neurons. Nat Neurosci. 2011;14:1581–1589. doi: 10.1038/nn.2961 [DOI] [PMC free article] [PubMed] [Google Scholar]
63.Cazé RD, van der Meer MAA. Adaptive properties of differential learning rates for positive and negative outcomes. Biol Cybern. 2013;107:711–719. doi: 10.1007/s00422-013-0571-5 [DOI] [PubMed] [Google Scholar]
64.Frank MJ, Seeberger LC, O’Reilly RC. By Carrot or by Stick: Cognitive Reinforcement Learning in Parkinsonism. Science. 2004;306:1940–1943. doi: 10.1126/science.1102941 [DOI] [PubMed] [Google Scholar]
65.Taswell CA, Costa VD, Murray EA, Averbeck BB. Ventral striatum’s role in learning from gains and losses. Proc Natl Acad Sci U S A. 2018;115:E12398–E12406. doi: 10.1073/pnas.1809833115 [DOI] [PMC free article] [PubMed] [Google Scholar]
66.Noonan MP, Crittenden BM, Jensen O, Stokes MG. Selective inhibition of distracting input. Behav Brain Res. 2018;355:36–47. doi: 10.1016/j.bbr.2017.10.010 [DOI] [PubMed] [Google Scholar]
67.Bliss-Moreau E, Santistevan AC, Bennett J, Moadab G, Amaral DG. Anterior Cingulate Cortex Ablation Disrupts Affective Vigor and Vigilance. J Neurosci. 2021;41:8075–8087. doi: 10.1523/JNEUROSCI.0673-21.2021 [DOI] [PMC free article] [PubMed] [Google Scholar]
68.Amiez C, Joseph JP, Procyk E. Reward Encoding in the Monkey Anterior Cingulate Cortex. Cereb Cortex. 2006;16:1040–1055. doi: 10.1093/cercor/bhj046 [DOI] [PMC free article] [PubMed] [Google Scholar]
69.Ma L, Chan JL, Johnston K, Lomber SG, Everling S. Macaque anterior cingulate cortex deactivation impairs performance and alters lateral prefrontal oscillatory activities in a rule-switching task. PLoS Biol. 2019;17:e3000045. doi: 10.1371/journal.pbio.3000045 [DOI] [PMC free article] [PubMed] [Google Scholar]
70.Shima K, Tanji J. Role for Cingulate Motor Area Cells in Voluntary Movement Selection Based on Reward. Science. 1998;282:1335–1338. doi: 10.1126/science.282.5392.1335 [DOI] [PubMed] [Google Scholar]
71.Polanía R, Nitsche MA, Ruff CC. Studying and modifying brain function with non-invasive brain stimulation. Nat Neurosci. 2018;21:174–187. doi: 10.1038/s41593-017-0054-4 [DOI] [PubMed] [Google Scholar]
72.Watson MR, Voloh B, Thomas C, Hasan A, Womelsdorf T. USE: An integrative suite for temporally-precise psychophysical experiments in virtual environments for human, nonhuman, and artificially intelligent agents. J Neurosci Methods. 2019;326:108374. doi: 10.1016/j.jneumeth.2019.108374 [DOI] [PubMed] [Google Scholar]
73.Womelsdorf T, Thomas C, Neumann A, Watson MR, Banaie Boroujeni K, Hassani SA, et al. A Kiosk Station for the Assessment of Multiple Cognitive Domains and Cognitive Enrichment of Monkeys. Front Behav Neurosci. 2021. doi: 10.3389/fnbeh.2021.721069 [DOI] [PMC free article] [PubMed] [Google Scholar]
74.Watson MR, Voloh B, Naghizadeh M, Womelsdorf T. Quaddles: A multidimensional 3-D object set with parametrically controlled and customizable features. Behav Res Ther. 2019;51:2522–2532. doi: 10.3758/s13428-018-1097-5 [DOI] [PubMed] [Google Scholar]
75.Khalighinejad N, Bongioanni A, Verhagen L, Folloni D, Attali D, Aubry J-F, et al. A Basal Forebrain-Cingulate Circuit in Macaques Decides It Is Time to Act. Neuron. 2020;105:370–384.e8. doi: 10.1016/j.neuron.2019.10.030 [DOI] [PMC free article] [PubMed] [Google Scholar]

PLoS Biol. doi: 10.1371/journal.pbio.3001785.r001

Decision Letter 0

Gabriel Gasque

20 Jan 2022

Dear Dr Womelsdorf,

Thank you for submitting your manuscript entitled "Transcranial Ultrasound Stimulation in Anterior Cingulate Cortex Impairs Information Sampling and Learning in Loss Contexts" for consideration as a Research Article by PLOS Biology.

Your manuscript has now been evaluated by the PLOS Biology editorial staff, as well as by an academic editor with relevant expertise, and I am writing to let you know that we would like to send your submission out for external peer review. Please accept my apologies for the delay in sending this decision to you.

Before we can send your manuscript to reviewers, we need you to complete your submission by providing the metadata that is required for full assessment. To this end, please login to Editorial Manager where you will find the paper in the 'Submissions Needing Revisions' folder on your homepage. Please click 'Revise Submission' from the Action Links and complete all additional questions in the submission questionnaire.

Once your full submission is complete, your paper will undergo a series of checks in preparation for peer review. Once your manuscript has passed the checks it will be sent out for review. To provide the metadata for your submission, please Login to Editorial Manager (https://www.editorialmanager.com/pbiology) within two working days, i.e. by Jan 24 2022 11:59PM.

If your manuscript has been previously reviewed at another journal, PLOS Biology is willing to work with those reviews in order to avoid re-starting the process. Submission of the previous reviews is entirely optional and our ability to use them effectively will depend on the willingness of the previous journal to confirm the content of the reports and share the reviewer identities. Please note that we reserve the right to invite additional reviewers if we consider that additional/independent reviewers are needed, although we aim to avoid this as far as possible. In our experience, working with previous reviews does save time.

If you would like to send previous reviewer reports to us, please email me at ggasque@plos.org to let me know, including the name of the previous journal and the manuscript ID the study was given, as well as attaching a point-by-point response to reviewers that details how you have or plan to address the reviewers' concerns.

During the process of completing your manuscript submission, you will be invited to opt-in to posting your pre-review manuscript as a bioRxiv preprint. Visit http://journals.plos.org/plosbiology/s/preprints for full details. If you consent to posting your current manuscript as a preprint, please upload a single Preprint PDF.

Given the disruptions resulting from the ongoing COVID-19 pandemic, please expect some delays in the editorial process. We apologise in advance for any inconvenience caused and will do our best to minimize impact as far as possible.

Feel free to email us at plosbiology@plos.org if you have any queries relating to your submission.

Kind regards,

Gabriel

Gabriel Gasque

Senior Editor

PLOS Biology

ggasque@plos.org

PLoS Biol. doi: 10.1371/journal.pbio.3001785.r002

Decision Letter 1

Kris Dickson

23 Mar 2022

Dear Dr Womelsdorf,

Thank you for submitting your manuscript "Transcranial Ultrasound Stimulation in Anterior Cingulate Cortex Impairs Information Sampling and Learning in Loss Contexts" for consideration as a Research Article at PLOS Biology. Please accept my sincere apologies for the long delays that you have experienced during the peer review process. I am now handling your submission since my colleague Gabriel is currently out of the office. Your manuscript has been evaluated by the PLOS Biology editors, an Academic Editor with relevant expertise, and by three independent reviewers.

The reviews are attached below. You will see that the reviewers are generally positive and find the manuscript interesting and well-done. However, they raise some overlapping concerns and ask that additional discussions about the limitations of the study should be included, as well as providing a clearer contextualization of the findings and the surrounding literature. In addition, Reviewer #3 raises concerns with the reporting of some of the statistical analyses and notes that an additional gain-loss only control experiment should be included.

In light of the reviews, we will not be able to accept the current version of the manuscript, but we would welcome re-submission of a much-revised version that takes into account the reviewers' comments. We cannot make any decision about publication until we have seen the revised manuscript and your response to the reviewers' comments. Your revised manuscript is also likely to be sent for further evaluation by the reviewers.

We expect to receive your revised manuscript within 3 months. Please email us (plosbiology@plos.org) if you have any questions or concerns, or would like to request an extension. At this stage, your manuscript remains formally under active consideration at our journal; please notify us by email if you do not intend to submit a revision so that we may end consideration of the manuscript at PLOS Biology.

**IMPORTANT - SUBMITTING YOUR REVISION**

Your revisions should address the specific points made by each reviewer. Please submit the following files along with your revised manuscript:

1. A 'Response to Reviewers' file - this should detail your responses to the editorial requests, present a point-by-point response to all of the reviewers' comments, and indicate the changes made to the manuscript.

*NOTE: In your point by point response to the reviewers, please provide the full context of each review. Do not selectively quote paragraphs or sentences to reply to. The entire set of reviewer comments should be present in full and each specific point should be responded to individually, point by point.

You should also cite any additional relevant literature that has been published since the original submission and mention any additional citations in your response.

2. In addition to a clean copy of the manuscript, please also upload a 'track-changes' version of your manuscript that specifies the edits made. This should be uploaded as a "Related" file type.

*Re-submission Checklist*

When you are ready to resubmit your revised manuscript, please refer to this re-submission checklist: https://plos.io/Biology_Checklist

To submit a revised version of your manuscript, please go to https://www.editorialmanager.com/pbiology/ and log in as an Author. Click the link labelled 'Submissions Needing Revision' where you will find your submission record.

Please make sure to read the following important policies and guidelines while preparing your revision:

*Published Peer Review*

Please note while forming your response, if your article is accepted, you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out. Please see here for more details:

https://blogs.plos.org/plos/2019/05/plos-journals-now-open-for-published-peer-review/

*PLOS Data Policy*

Please note that as a condition of publication PLOS' data policy (http://journals.plos.org/plosbiology/s/data-availability) requires that you make available all data used to draw the conclusions arrived at in your manuscript. If you have not already done so, you must include any data used in your manuscript either in appropriate repositories, within the body of the manuscript, or as supporting information (N.B. this includes any numerical values that were used to generate graphs, histograms etc.). For an example see here: http://www.plosbiology.org/article/info%3Adoi%2F10.1371%2Fjournal.pbio.1001908#s5

*Blot and Gel Data Policy*

We require the original, uncropped and minimally adjusted images supporting all blot and gel results reported in an article's figures or Supporting Information files. We will require these files before a manuscript can be accepted so please prepare them now, if you have not already uploaded them. Please carefully read our guidelines for how to prepare and upload this data: https://journals.plos.org/plosbiology/s/figures#loc-blot-and-gel-reporting-requirements

*Protocols deposition*

To enhance the reproducibility of your results, we recommend that if applicable you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols

Thank you again for your submission to our journal. We hope that our editorial process has been constructive thus far, and we welcome your feedback at any time. Please don't hesitate to contact us if you have any questions or comments.

Sincerely,

Richard

Richard Hodge, PhD

Associate Editor, PLOS Biology

rhodge@plos.org

*****************************************************

REVIEWS:

Reviewer #1: This study wants to answer long standing questions within the reward learning literature about the roles of striatum and anterior cingulate cortex. To do this they use an elegant combination of a novel neurostimulation, Transcranial ultrasound stimulation (TUS), with a learning paradigm and thoughtful behavioural analyses in macaque monkeys.

Specifically, the authors target two structures known to be involved in reward learning and decision making, anterior cingulate cortex (ACC) and striatum (STR) with their interfering neurostimulation and compare them to each other as well as two sham conditions.

Somewhat surprisingly, they find behavioural impairments only after ACC stimulation, not STR. Moreover, only when the learning requires the macaques to consider potential losses as well as gains, does an impairment emerge.

When looking the data more carefully, they find that this is driven by only the data in which the animals need to ignore irrelevant dimensions, increasing the credit assignment problem. ACC stimulation, however does not only slow learning to criterion, but also reduces accuracy after criterion and leads to clustered errors.

Overall, the results are solid and very interesting to the field. I do not have any major suggestions regarding data analysis. However, some of the framing, particularly in the introduction and parts of the discussion is confusing. The authors spend a lot of time discussing different theories of ACC during decision making that only speak indirectly to their own results. Instead, there is a large ACC literature on reward learning and representations of different value dimensions (such as effort, loss and gains) as well as credit assignment that probably makes a more relevant framing.

I have tried to make my concerns and potential solutions as concrete as possible below.

Major comments:

1) The authors very strongly emphasize two distinct ideas about ACC. One holds that ACC is involved in value guided decision, particularly when it comes to assessing alternatives, foraging, search and prospective planning. The other, claims ACC is exclusively concerned the signalling and resolution of conflict through control. However, while both theories have implications on what might happen during value guided learning, both are primarily concerned with decision making processes. It is not immediately clear how either theory would explain the learning deficits in a straightforward manner without some additions to the original theories. In fact, the current findings, if anything, are in direct contradiction with Conflict theories (in its many forms) because the primary function of ACC should have been linked to the load manipulation not the extension to the loss domain! Newer theories of conflict do include effort as a component but by no means suggest that losses are equivalent to effort and or that effort supersedes their postulated roles of ACC in general conflict resolution and signalling of difficult. In fact, the authors suggestion that their gain-loss condition is about effort seems rather post hoc to link their results to those theories, rather than being an established measure of effort. The other theory they talk about is at least neutral towards their findings and could easily be extended to consider ACC to not only have a role in decision making but also learning when situations are sufficiently ambiguous (high load) and there are mixed incentives, making it important to take into account undesirable hypotheticals. Additionally, theories emphasizing environmental value, are highly consistent with the observed clustering effects of low performance after ACC interferences, as ACC has been proposed to trigger behavioural change/switching/leaving of environments as a reaction to negative feedback or depleting patches.

Additionally, there are previous studies by Kennerley & Wallis on multiplexing (doi: 10.1038/nn.2961 ) and Hayden and Platt ( 10.1126/science.1168488 ) on hypothetical reward and ACC, that are nicely consistent with their findings (and the authors already cite).

To cut a long story short, their results and study are better placed within the literature of reward learning, multiplexing of different value dimension and credit assignment, as their study clearly a reward learning one. Of course, mentioning other theories can be valuable but it gets confusing when the introduction is set up to put two theories against each other that are not directly about the behavioural measures presented in the results.

2) Given ACC stimulation pretty much only leads to an impairment in the most challenging conditions (losses and gains combined with high cognitive load/multiple dimensions), one alternative explanation that should be mentioned is that it is simply a matter of impairment under the most challenging conditions, in which time to criterion is highest anyway. The authors, have two convincing pieces of evidence against this in the negative feedback specific effects and the higher errors even after criterion (although that could also be because it is just harder to remember with distractors and losses), but there should maybe be a brief mention of this more generic explanation of the results.

3) Post error adjustment is a strange term for their effect after ACC stimulation. Isn't is more like a lack of adjustment? While non stimulated animals only make occasional or random errors, ACC stimulation leads to clusters of errors that a characterized by an inability to improve performance even after negative feedback. Maybe a better term would be "error clustering", "low performance periods" or "an inability to escape low average reward/negative feedback through behavioural change".

4) One slightly odd feature of the task is that both choices and "explorative sampling" are based on fixation. That makes it quite difficult to interpret the change of fixation duration as a change in explorative sampling as it could also be linked to changed decision making. It also gives another reason why criterion might take longer with added dimensions because they might inadvertently pick an option when they first look at it when there are many possible features to observe. In short, the authors should probably be careful with their strong interpretation of viewing duration given the specific features of the task and mention why that measures is a bit complicated.

Minor comments:

A) The authors mention "Studies in humans have shown that stimuli associated with loss of monetary rewards or aversive outcomes (aversive images, odors, or electrical shocks) are less precisely memorized than stimulus features associated with positive outcomes (32-34)." There is also an interesting literature on planning and aversive pruning that might be relevant here. One example paper : doi:10.1371/journal.pcbi.1002410

B) In Figure 2D the cells look multi-coloured, which is a bit confusing. Is there are way to change the colour scheme?

C) Fig 3a unclear what the stat test is on. Is the only effect that comes out a test between aSTR stimulation and ACC stimulation?

D) I might be wrong here but it read as if the authors only do LME's for the plateau results. Is that correct and if so, why?

Reviewer #2: This study looks at the effects of focused ultrasound in the anterior cingulate cortex and anterior striatum in awake, behaving monkeys. It has a number of strengths - 1. There are multiple targets and the behavioral effects depend on which target was sonicated, 2. Real and sham sonications were interleaved, 3. The dispersion of ultrasound pressure was modeled in 3D. 4. The behavioral task was rich enough to differentiate attention, motivation, and learning. All facets of the study were well-executed.

There are 2 drawbacks - 1. The ultrasound pressure appears to spread into orbitofrontal cortex. This could contribute the some of the effects. 2. The measurement and interpretation of some of the behavioral outputs, particularly, "information sampling," or "explorative sampling," is not well-described. Both of these issues can be addressed by simply clarifying the text, without further experiments or analysis.

Minor comments

It's helpful to include line numbers in the draft.

The effects of sonication are described variously as "stimulation," or "disruption," and are compared to a lesion. In other studies, ultrasound seems to enhance performance. Could the authors discuss why they think ultrasound is disruptive? Is this simply due to the current results, or are there other studies showing disruption?

The authors do not define "information sampling" nor do they specify the behavior that they interpret as information sampling. It seems like it has something to do with fixation duration. Is it just fixation duration or is there more to it?

It seems like learning rate was estimated by "trials to criterion" and, sometimes, linear extrapolation. A better approach is to fit a logistic regression with rate and asymptote parameters.

The idea that ultrasound effects are seen mainly in low-motivation conditions is consistent with results recently published by Munoz et al. in Brain Stimulation.

It isn't clear how many sonications and shams were done in each target region per monkey. Based on the supplement, it appears to be 6 sonications and 6 shams per target per monkey. Please confirm if this is correct.

Please report the absolute de-rated peak negative pressure.

Reviewer #3: The anterior cingulate cortex and the striatum are both implicated in learning and sequential decision-making, but causal evidence is sparse, especially in the primate. The study thus fills an important gap in understanding the causal links between activity in these structures and the process of learning feature-reward rules. The study applied transcranial ultrasound stimulation to these two structures and evaluated behavioral indices of learning in a task which manipulates motivational context (gain-only, gain-loss) and attentional and/or cognitive load (low, medium, high). After stimulation to anterior cingulate cortex in the gain-loss context, learning was slower, asymptotic accuracy was lower, and gaze duration before the choice was longer. The context-specificity of these effects are likely due to the fact that ACC stimulation selectively disrupted performance following losses, consistent with a large literature linking this structure to post-error compensatory processes. The fact that slower learning was driven by the higher cognitive load condition also resonates with older literature linking this region to adaptation in the face of conflict and cognitive demands. By providing causal evidence for long standing theories that link ACC to post-error and performance monitoring functions, this study could potentially be a valuable contribution to the literature. However, it would benefit from discussing this literature, including an additional control analysis, improving the precision of the text, a thorough copy edit, and an acknowledgement of a few of the limitations of the results and methods.

Major comments:

1) It is difficult to understand the results obtained with the use of linear mixed effect (LME) models. A few examples:

"Learning is slowed down (trials-to-criterion increased) after TUS in ACC in the gain-loss context (left, LME's, p=0.007) but not in the gain-only contexts (right, n.s.)." (Figure 2B)

"TUS in ACC but not sham-TUS in ACC or TUS or sham-TUS in the striatum (Figure 2A) changed this behavioral pattern. ACC-TUS selectively slowed learning in the gain-loss contexts compared to the gain-only contexts (LME's, t=2.67, p=0.007)" (p. 5)

Is this the interaction between the motivational context and the TUS condition? Please specify what term is being reported here and for all linear mixed effects models. Please also add the reports in other places, where it seemed that a linear mixed effect model was being used, but only the post hoc comparisons were reported (i.e. p.6, lines 8-11).

2) In causal NHP papers, it is standard to report effects individually for each animal, particularly when there are only 2 animals used. It seems that the effects reported here may not have been individually significant (though this information was not provided, at least in the main text). This doesn't necessarily mean that the results are not interpretable, but it is a limitation that should be commented on in the discussion.

3) Simulation was the only evidence provided for the efficacy of ultrasound stimulation and there was no experimental verification that the procedure changed the neural activity within the targeted structures. This is a major concern for ultrasonic stimulation since we still know little about its impact on neuronal activity. In the absence of some kind of experimental verification that the effects of stimulation were directional and localized in ACC (vs affecting fibers of passage or connected regions, for example), the authors should eliminate all inferences about the specificity and directionality of the effects (i.e. "ACC is necessary", "disrupting ACC", etc.). These claims occurred throughout the manuscript. The limitations in these causal interpretations also need to be made clear in the discussion.

4) The loss-specificity of the results is an important part of the contribution of this paper, but the method used to show that ACC-TUS effects were specific to the losses appears to combine data in an odd way--combining the gain data between both contexts (one in which there was an effect and one in which there was not), rather than comparing gain and loss within the same gain-loss context. If I'm understanding this correctly, it would be helpful to perform an analysis analogous to Fig 3F and S9E for the gain-loss context only.

5) The loss-specificity and interaction with cognitive demands of the ACC-TUS stimulation resonate with a body of literature that links ACC to performance/error monitoring and compensatory functions. However, there was no discussion of these classic ideas and theories. It would be very helpful to situate these results in a proper historical context, rather than focusing the literature review on recent papers with tenuous relevance to these results.

Minor issues:

- When discussing the contextual change introduced by gain-only and gain-loss conditions, the terms "reward structure", "motivational/affective demands" and "payoff" are used interchangeably. This made it harder than necessary to understand the actual manipulations in the task. The interchangeability of these various descriptors also meant that the results occasionally veered off into speculative directions where the logic was tough to follow.

- The use of the term "exploration" in this paper was confusing. In the context of sequential decision-making tasks like this one, exploration has a technical meaning: a kind of non-exploitative decision focused on learning about the options. However, manuscript sometimes uses "explorative sampling" to refer to gaze duration. (Please note that the manuscript also sometimes used the term "fixational sampling" directly to talk about "explorative sampling" [e.g. p. 9].) It would be more clear to replace "explorative sampling" with a less loaded term, like "information sampling", as is already done in some parts of the text. This is particularly important because the text does refer to the explore/exploit dilemma in the traditional way elsewhere (i.e. p. 6).

- The term "attentional load" seems misplaced as the role of attention in this task is not clear. Increasing the number of stimulus features could increase load at a variety of cognitive levels, not just at the level of attention, and none of the analyses presented here suggest that this manipulation alters attention. Maybe "cognitive load" would be better?

- In the text, TUS conditions are called ACC-TUS, ACC-sham, aSTR-TUS, aSTR-sham, but in the figures they are called H-ACC, S-ACC, H-aSTR, S-aSTR. This second set of abbreviations is only explained in the supplement (Sham-ACC, High-ACC, Sham-STR, High-STR).

- Notation was inconsistent for a variety of other terms throughout the manuscript and supplement (e.g. "LMEs" and "LME's", "t" and "t-stat", "gain/loss condition" and "gain-loss condition", "STR" and "aSTR").

- Definitions for some acronyms were missing entirely from the main text and found only in the supplement.

- Mean values of two major dependent variables are missing in the text (plateau accuracy and information sampling), though mean values of learning time are reported.

- In the Supplement: "Where appropriate we used a non-parametric approach by fitting generalized linear mixed effects models (GLME's)" [...] "we used a link Identity function, and a Poisson distribution for the response variable." A model that makes an assumption about the distributions is not non-parametric. Also, a log link would be more common for a Poisson distribution because the distribution is undefined for negative parameters. The use of a non-standard link function should always be justified.

- p. 5 last paragraph "(LME's, t=2.67, p=0.007))" - extra parenthesis

- p. 6 first paragraph - reference to Figure 3C,D. Is this actually referencing Figure 2C,D?

- The use of the acronyms GLME and LME was a bit unusual. I couldn't find their definitions in the main text, but in the supplement, they were defined as "linear mixed effect models (LME)", "generalized linear mixed effects models (GLME's)". Typically these would be abbreviated as "Linear Mixed Effects (LME) models" or else could be extended to "LMEMs" or "GLMEMs" for brevity.

- Figure 1: the small table presenting motivational contexts and axes presenting task dimensions are not referenced by the caption and the panel is unlabeled.

- Figure 2B: it is not clear what is meant by the horizontal lines and the star.

- Figure 2C: what is represented by * and X ?

- Figure 2D: what does "marginally normalized trials-to-criterion" mean? (labeled "Diff. Norm. Trial-to-criterion" in the figure)

- Figure 3: it might help to maintain a common convention with the Figure 2 (i.e. gain-loss and gain-only results could be presented under the same letter). There are also more ambiguous symbols here that are not defined in the caption.

- Figure S4A - typo in the title ("gain-loss")

- Figure S5A - significance marked for monkey W but not in both monkeys together? Is this different from Figure 2C presenting the same result?

- Figure S6E - missing significance star in both monkeys? Is this different from Figure 3C presenting the same result?

PLoS Biol. 2022 Sep 6;20(9):e3001785. doi: 10.1371/journal.pbio.3001785.r003

Author response to Decision Letter 1

6 Jun 2022

Attachment

Submitted filename: ReplyToReviewers_06.docx

Click here for additional data file.^{(61.8KB, docx)}

PLoS Biol. doi: 10.1371/journal.pbio.3001785.r004

Decision Letter 2

Kris Dickson

25 Jul 2022

Dear Dr Womelsdorf,

Thank you for your patience while we considered your revised manuscript "Transcranial Ultrasound Stimulation in Anterior Cingulate Cortex Impairs Information Sampling and Learning in Loss Contexts" for publication as a Research Article at PLOS Biology. This revised version of your manuscript has been evaluated by the PLOS Biology editors, the Academic Editor and the original reviewers.

Based on the reviews, we are likely to accept this manuscript for publication, provided you satisfactorily address the remaining editorial requests (just below) and journal data and policy-related requirements detailed at the bottom of this email. Please note that all of these points must be addressed before we can move forward with your study.

TITLE: We'd would like you to consider an alternative title for this work that we feel will make the study more broadly accessible to our readership:

The anterior cingulate cortex is involved in flexible learning under motivationally challenging and cognitively demanding conditions

We appreciate that this removes information on the type of stimulation used, but thought that it allows the interesting biology to be more apparent.

ABSTRACT EDITS needed:

1) As per journal policy, the abstract needs to indicate that this work has been done in non-human primates.

2) We also ask that you do some light textual editing of the abstract to correct the English language errors in the key sentence that sets up the question they are asking.

The key sentence is missing "subjects":

“Information about feature values in ACC or STR might contribute to adaptive learning by guiding choices and information sampling to relevant objects, or they might have a more indirect, motivational function by allowing SUBJECTS to estimate the value of putting effort choosing objects.”

While you are making this correction, we'd also suggest you pass the abstract by a few colleagues not in your field. As PLOS Biology is a broad biology journal, your study will get more attention and interest if the abstract is clearly accessible to both neuroscientists and non-neuroscientists.

METHODS:

Please move the key methods sections necessary for readers to evaluate your work into the main paper, rather than having them all in supplemental materials. This will allow readers to more easily assess your study.

As you address these items, please take this last chance to review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the cover letter that accompanies your revised manuscript.

We expect to receive your revised manuscript within two weeks.

To submit your revision, please go to https://www.editorialmanager.com/pbiology/ and log in as an Author. Click the link labelled 'Submissions Needing Revision' to find your submission record. Your revised submission must include the following:

- a cover letter that should detail your responses to any editorial requests, if applicable, and whether changes have been made to the reference list

- a Response to Reviewers file that provides a detailed response to the reviewers' comments (if applicable)

- a track-changes file indicating any changes that you have made to the manuscript.

NOTE: If Supporting Information files are included with your article, note that these are not copyedited and will be published as they are submitted. Please ensure that these files are legible and of high quality (at least 300 dpi) in an easily accessible file format. For this reason, please be aware that any references listed in an SI file will not be indexed. For more information, see our Supporting Information guidelines:

https://journals.plos.org/plosbiology/s/supporting-information

*Published Peer Review History*

Please note that you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out. Please see here for more details:

https://blogs.plos.org/plos/2019/05/plos-journals-now-open-for-published-peer-review/

*Press*

Should you, your institution's press office or the journal office choose to press release your paper, please ensure you have opted out of Early Article Posting on the submission form. We ask that you notify us as soon as possible if you or your institution is planning to press release the article.

*Protocols deposition*

Please do not hesitate to contact me should you have any questions.

Sincerely,

Kris

Kris Dickson, Ph.D. (she/her)

Neurosciences Senior Editor/Section Manager,

kdickson@plos.org,

PLOS Biology

------------------------------------------------------------------------

ETHICS STATEMENT: Requires additional detail.

Currently says:

“All procedures were in accordance with the National Institutes of Health Guide for the Care and Use of Laboratory Animals, the Society for Neuroscience Guidelines and Policies, and approved by the Vanderbilt University Institutional Animal Care and Use Committee (M1700198-01).”

As per our NHP policy, “Non-human primate studies must be performed in accordance with the recommendations of the Weatherall report “The use of non-human primates in research”. Manuscripts describing research involving non-human primates must include details of animal welfare, including information about housing, feeding, and environmental enrichment, and steps taken to minimize suffering, including use of anesthesia and method of sacrifice if appropriate.

Our policy can be found at: https://journals.plos.org/plosbiology/s/animal-research#loc-non-human-primates”

Please ensure that these additional details are provided.

------------------------------------------------------------------------

DATA POLICY:

You may be aware of the PLOS Data Policy, which requires that all data be made available without restriction: http://journals.plos.org/plosbiology/s/data-availability. For more information, please also see this editorial: http://dx.doi.org/10.1371/journal.pbio.1001797

ALL DATA must either be contained in the paper or openly available via a static public database. We cannot accept sole deposition of data or code to GitHub or a similar non-static site (https://journals.plos.org/plosbiology/s/data-availability). We require deposition to a static site, like Zenodo, FigShare, OSF. As your existing data is on a publicly available GitHub site, once that data is updated per our formatting requirements (see below), it can be copied to Zenodo. See the process for doing this here: https://docs.github.com/en/repositories/archiving-a-github-repository/referencing-and-citing-content. Once you do this, it will also generate a DOI number that you can provide us with.

Note that we do not require all raw data. Rather, we ask that all individual quantitative observations that underlie the data summarized in the figures and results of your paper be made available. Please ensure that you provide the individual numerical values that underlie the summary data displayed in the following figure panels as they are essential for readers to assess your analysis and to reproduce it:

Fig1FG; Fig2BCD; Fig3A-F

Supplemental Fig1A-H; Fig2A-H; Fig3A-F; Fig4A-D; Fig5A-C; Fig6A-F; Fig7A-E; Fig8A-D; Fig9AB; Fig10A-F; Fig11A-F

NOTE: the numerical data provided should include all replicates AND the way in which the plotted mean and errors were derived (it should not present only the mean/average values).

NOTE 2: Please also ensure that FIGURE LEGENDS in your manuscript include information on where the underlying data can be found, and ensure your supplemental data file/s has a legend. (NG: This step is often forgotten!!!)

Please ensure that your Data Statement in the submission system accurately describes where your data can be found.

------------------------------------------------------------------------

Code Availability –

We ask that you provide open access to the custom MATLAB code used in this work. As per our open access policies, archives should provide a public repository of the described software. The repository must have been in existence for over five years or be hosting more than 1,000 projects. Our policies on Code sharing are here: https://journals.plos.org/plosbiology/s/materials-software-and-code-sharing.

-----------------------------------------------------------------

SPECIES INDICATED IN THE ABSTRACT? Required (and already mentioned above)

- Please note that per journal policy, the model system/species studied should be clearly stated in the abstract of your manuscript.

------------------------------------------------------------------------

DATA NOT SHOWN?

- Please note that per journal policy, we do not allow the mention of "data not shown", "personal communication", "manuscript in preparation" or other references to data that is not publicly available or contained within this manuscript. Please check your submission for any such statements and either remove mention of such data or provide figures presenting the results and the data underlying the figure(s).

------------------------------------------------------------------------

Reviewer remarks:

Reviewer's Responses to Questions

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

Reviewer #3: No

Reviewer #1: The authors have addressed all my comments. My congratulations on a great study!

Reviewer #2: Thank you for responding to my concerns. I have no further comments.

Reviewer #3: The authors have largely addressed my concerns and I am comfortable endorsing publication at this time.

Sorry for the delay in returning this review. I hope you are all enjoying a nice summer.

PLoS Biol. doi: 10.1371/journal.pbio.3001785.r005

Decision Letter 3

Kris Dickson

9 Aug 2022

Dear Dr Womelsdorf,

Thank you for the submission of your revised Research Article "The anterior cingulate cortex causally supports flexible learning under motivationally challenging and cognitively demanding conditions" for publication in PLOS Biology. On behalf of my colleagues and the Academic Editor, Ben Seymour, I am pleased to say that we can in principle accept your manuscript for publication, provided you address a 2 editorial issues any remaining formatting and reporting issues. The formatting and reporting issues will be detailed in an email you should receive within 2-3 business days from our colleagues in the journal operations team; no action is required from you until then. Please note that we will not be able to formally accept your manuscript and schedule it for publication until you have completed any requested changes.

When making any corrections requested by our operations team, please also see to the following:

1) Please correct your meta-data files. We appreciate the provision of your Excel datasheet and the behavioral data in FigShare, but note that the data is currently not provided in such a way as to allow readers to see the underlying summary data that went into the creation of each graph in your submission. Please update the files such that summary data used to create these data panels in the main and supplemental figures are listed in separate tabs that are clearly labeled for each figure and supplemental figure (i.e. Fig1F, Fig1G...SuppFig1A...) .

Note that we do not require all raw data. Rather, we ask that all individual quantitative observations that underlie the data summarized in the figures and results of your paper be made available. The numerical data provided should include all replicates AND the way in which the plotted mean and errors were derived (it should not present only the mean/average values).

2) Minor point - please replace "to" with "into" in the abstract:

"of putting effort INTO choosing objects."

Finally, please take a minute to log into Editorial Manager at http://www.editorialmanager.com/pbiology/, click the "Update My Information" link at the top of the page, and update your user information to ensure an efficient production process.

PRESS

We frequently collaborate with press offices. If your institution or institutions have a press office, please notify them about your upcoming paper at this point, to enable them to help maximise its impact. If the press office is planning to promote your findings, we would be grateful if they could coordinate with biologypress@plos.org. If you have previously opted in to the early version process, we ask that you notify us immediately of any press plans so that we may opt out on your behalf.

We also ask that you take this opportunity to read our Embargo Policy regarding the discussion, promotion and media coverage of work that is yet to be published by PLOS. As your manuscript is not yet published, it is bound by the conditions of our Embargo Policy. Please be aware that this policy is in place both to ensure that any press coverage of your article is fully substantiated and to provide a direct link between such coverage and the published work. For full details of our Embargo Policy, please visit http://www.plos.org/about/media-inquiries/embargo-policy/.

Thank you again for choosing PLOS Biology for publication and supporting Open Access publishing. We look forward to publishing your study.

Sincerely,

Kris

Kris Dickson, Ph.D. (she/her)

Neurosciences Senior Editor/Section Manager

PLOS Biology

kdickson@plos.org

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

S1 Text. Supplementary Information.

Contains detailed methods for precise TUS neuronavigation, TUS simulations, and data analysis.

(DOCX)

Click here for additional data file.^{(38.1KB, docx)}

S1 Fig. Cognitive load effect on learning and fixational information sampling.

(EPS)

Click here for additional data file.^{(1.3MB, eps)}

S2 Fig. Effects of gain-loss and gain-only learning contexts on learning and gaze sampling behaviors.

(EPS)

Click here for additional data file.^{(1.1MB, eps)}

S3 Fig. Transcranial ultrasound stimulation localization, energy, and sonication focus specifications.

(EPS)

Click here for additional data file.^{(1MB, eps)}

S4 Fig. Transcranial ultrasound stimulation effect on learning for individual monkeys (left and middle column) and their average (right column).

(EPS)

Click here for additional data file.^{(1.6MB, eps)}

S5 Fig. TUS interaction of cognitive load and motivational context.

(EPS)

Click here for additional data file.^{(1.7MB, eps)}

S6 Fig. Transcranial ultrasound stimulation effects on explorative behavior.

(EPS)

Click here for additional data file.^{(1.9MB, eps)}

S7 Fig. Distribution of gaze fixation duration before a choice.

(EPS)

Click here for additional data file.^{(2.5MB, eps)}

S8 Fig. Logistic regression GLM fit to performance accuracy.

(EPS)

Click here for additional data file.^{(2.5MB, eps)}

S9 Fig. Time course of the effect of transcranial ultrasound stimulation on learning and explorative behavior.

(EPS)

Click here for additional data file.^{(922.4KB, eps)}

S10 Fig. Session level effect of transcranial ultrasound stimulation on learning and information sampling.

(EPS)

Click here for additional data file.^{(3MB, eps)}

S11 Fig. Trial-level effects of transcranial ultrasound stimulation on post-outcome performance adjustment.

(EPS)

Click here for additional data file.^{(1.6MB, eps)}

S1 Table

(EPS)

Click here for additional data file.^{(1.4MB, eps)}

S2 Table

(EPS)

Click here for additional data file.^{(1.4MB, eps)}

Attachment

Submitted filename: ReplyToReviewers_06.docx

Click here for additional data file.^{(61.8KB, docx)}

Data Availability Statement

[pbio.3001785.ref001] 1.Averbeck BB. Amygdala and ventral striatum population codes implement multiple learning rates for reinforcement learning. 2017 IEEE Symposium Series on Computational Intelligence (SSCI). 2017. pp. 1–5. doi: 10.1109/SSCI.2017.8285354 [DOI] [Google Scholar]

[pbio.3001785.ref002] 2.Heilbronner SR, Hayden BY. Dorsal Anterior Cingulate Cortex: A Bottom-Up View. Annu Rev Neurosci. 2016;39:149–170. doi: 10.1146/annurev-neuro-070815-013952 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pbio.3001785.ref003] 3.Hikosaka O, Yasuda M, Nakamura K, Isoda M, Kim HF, Terao Y, et al. Multiple neuronal circuits for variable object–action choices based on short- and long-term memories. Proc Natl Acad Sci U S A. 2019;116:26313–26320. doi: 10.1073/pnas.1902283116 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pbio.3001785.ref004] 4.Shenhav A, Cohen JD, Botvinick MM. Dorsal anterior cingulate cortex and the value of control. Nat Neurosci. 2016;19:1286–1291. doi: 10.1038/nn.4384 [DOI] [PubMed] [Google Scholar]

[pbio.3001785.ref005] 5.Camille N, Tsuchida A, Fellows LK. Double Dissociation of Stimulus-Value and Action-Value Learning in Humans with Orbitofrontal or Anterior Cingulate Cortex Damage. J Neurosci. 2011;31:15048–15052. doi: 10.1523/JNEUROSCI.3164-11.2011 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pbio.3001785.ref006] 6.Buckley MJ, Mansouri FA, Hoda H, Mahboubi M, Browning PGF, Kwok SC, et al. Dissociable Components of Rule-Guided Behavior Depend on Distinct Medial and Prefrontal Regions. Science. 2009;325:52–58. doi: 10.1126/science.1172377 [DOI] [PubMed] [Google Scholar]

[pbio.3001785.ref007] 7.Kennerley SW, Walton ME, Behrens TEJ, Buckley MJ, Rushworth MFS. Optimal decision making and the anterior cingulate cortex. Nat Neurosci. 2006;9:940–947. doi: 10.1038/nn1724 [DOI] [PubMed] [Google Scholar]

[pbio.3001785.ref008] 8.Rudebeck PH, Behrens TE, Kennerley SW, Baxter MG, Buckley MJ, Walton ME, et al. Frontal Cortex Subregions Play Distinct Roles in Choices between Actions and Stimuli. J Neurosci. 2008;28:13775–13785. doi: 10.1523/JNEUROSCI.3541-08.2008 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pbio.3001785.ref009] 9.Clarke HF, Robbins TW, Roberts AC. Lesions of the Medial Striatum in Monkeys Produce Perseverative Impairments during Reversal Learning Similar to Those Produced by Lesions of the Orbitofrontal Cortex. J Neurosci. 2008;28:10972–10982. doi: 10.1523/JNEUROSCI.1521-08.2008 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pbio.3001785.ref010] 10.Rothenhoefer KM, Costa VD, Bartolo R, Vicario-Feliciano R, Murray EA, Averbeck BB. Effects of Ventral Striatum Lesions on Stimulus-Based versus Action-Based Reinforcement Learning. J Neurosci. 2017;37:6902–6914. doi: 10.1523/JNEUROSCI.0631-17.2017 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pbio.3001785.ref011] 11.Hayden BY, Pearson JM, Platt ML. Fictive Reward Signals in the Anterior Cingulate Cortex. Science. 2009;324:948–950. doi: 10.1126/science.1168488 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pbio.3001785.ref012] 12.Kennerley SW, Dahmubed AF, Lara AH, Wallis JD. Neurons in the Frontal Lobe Encode the Value of Multiple Decision Variables. J Cogn Neurosci. 2009;21:1162–1178. doi: 10.1162/jocn.2009.21100 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pbio.3001785.ref013] 13.Lau B, Glimcher PW. Value Representations in the Primate Striatum during Matching Behavior. Neuron. 2008;58:451–463. doi: 10.1016/j.neuron.2008.02.021 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pbio.3001785.ref014] 14.Kawai T, Yamada H, Sato N, Takada M, Matsumoto M. Roles of the Lateral Habenula and Anterior Cingulate Cortex in Negative Outcome Monitoring and Behavioral Adjustment in Nonhuman Primates. Neuron. 2015;88:792–804. doi: 10.1016/j.neuron.2015.09.030 [DOI] [PubMed] [Google Scholar]

[pbio.3001785.ref015] 15.Oemisch M, Westendorff S, Azimi M, Hassani SA, Ardid S, Tiesinga P, et al. Feature-specific prediction errors and surprise across macaque fronto-striatal circuits. Nat Commun. 2019;10:1–15. doi: 10.1038/s41467-018-08184-9 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pbio.3001785.ref016] 16.Banaie Boroujeni K, Tiesinga P, Womelsdorf T. Interneuron specific gamma synchronization indexes cue uncertainty and prediction errors in lateral prefrontal and anterior cingulate cortex. Elife. 2021;10:e69111. doi: 10.7554/eLife.69111 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pbio.3001785.ref017] 17.Butler JL, Muller TH, Veselic S, Malalasekera WMN, Hunt LT, Behrens TEJ, et al. Covert valuation for information sampling and choice. bioRxiv. 2021. p. 2021.10.08.463476. doi: 10.1101/2021.10.08.463476 [DOI] [Google Scholar]

[pbio.3001785.ref018] 18.Hunt LT, Malalasekera WMN, de Berker AO, Miranda B, Farmer SF, Behrens TEJ, et al. Triple dissociation of attention and decision computations across prefrontal cortex. Nat Neurosci. 2018;21:1471–1481. doi: 10.1038/s41593-018-0239-5 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pbio.3001785.ref019] 19.Kaping D, Vinck M, Hutchison RM, Everling S, Womelsdorf T. Specific Contributions of Ventromedial, Anterior Cingulate, and Lateral Prefrontal Cortex for Attentional Selection and Stimulus Valuation. PLoS Biol. 2011;9:e1001224. doi: 10.1371/journal.pbio.1001224 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pbio.3001785.ref020] 20.Voloh B, Oemisch M, Womelsdorf T. Phase of firing coding of learning variables across the fronto-striatal network during feature-based learning. Nat Commun. 2020;11:4669. doi: 10.1038/s41467-020-18435-3 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pbio.3001785.ref021] 21.Arcizet F, Krauzlis RJ. Covert spatial selection in primate basal ganglia. PLoS Biol. 2018;16:e2005930. doi: 10.1371/journal.pbio.2005930 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pbio.3001785.ref022] 22.Boroujeni KB, Oemisch M, Hassani SA, Womelsdorf T. Fast spiking interneuron activity in primate striatum tracks learning of attention cues. Proc Natl Acad Sci U S A. 2020;117:18049–18058. doi: 10.1073/pnas.2001348117 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pbio.3001785.ref023] 23.Mesulam M-M. A cortical network for directed attention and unilateral neglect. Ann Neurol. 1981;10:309–325. doi: 10.1002/ana.410100402 [DOI] [PubMed] [Google Scholar]

[pbio.3001785.ref024] 24.Womelsdorf T, Everling S. Long-Range Attention Networks: Circuit Motifs Underlying Endogenously Controlled Stimulus Selection. Trends Neurosci. 2015;38:682–700. doi: 10.1016/j.tins.2015.08.009 [DOI] [PubMed] [Google Scholar]

[pbio.3001785.ref025] 25.Monosov IE, Rushworth MFS. Interactions between ventrolateral prefrontal and anterior cingulate cortex during learning and behavioural change. Neuropsychopharmacology. 2021;1–15. doi: 10.1038/s41386-021-01079-2 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pbio.3001785.ref026] 26.Ebitz RB, Smith EH, Horga G, Schevon CA, Yates MJ, McKhann GM, et al. Human dorsal anterior cingulate neurons signal conflict by amplifying task-relevant information. bioRxiv. 2020. p. 2020.03.14.991745. doi: 10.1101/2020.03.14.991745 [DOI] [Google Scholar]

[pbio.3001785.ref027] 27.Jezzini A, Bromberg-Martin ES, Trambaiolli LR, Haber SN, Monosov IE. A prefrontal network integrates preferences for advance information about uncertain rewards and punishments. Neuron. 2021;109:2339–2352.e5. doi: 10.1016/j.neuron.2021.05.013 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pbio.3001785.ref028] 28.Kolling N, Behrens TEJ, Mars RB, Rushworth MFS. Neural Mechanisms of Foraging. Science. 2012. [cited 2021 Sep 4]. Available from: https://www.science.org/doi/abs/10.1126/science.1216930. [DOI] [PMC free article] [PubMed] [Google Scholar]

[pbio.3001785.ref029] 29.Kolling N, Wittmann MK, Behrens TEJ, Boorman ED, Mars RB, Rushworth MFS. Value, search, persistence and model updating in anterior cingulate cortex. Nat Neurosci. 2016;19:1280–1285. doi: 10.1038/nn.4382 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pbio.3001785.ref030] 30.Shenhav A, Musslick S, Lieder F, Kool W, Griffiths TL, Cohen JD, et al. Toward a Rational and Mechanistic Account of Mental Effort. Annu Rev Neurosci. 2017;40:99–124. doi: 10.1146/annurev-neuro-072116-031526 [DOI] [PubMed] [Google Scholar]

[pbio.3001785.ref031] 31.Shenhav A, Botvinick MM, Cohen JD. The Expected Value of Control: An Integrative Theory of Anterior Cingulate Cortex Function. Neuron. 2013;79:217–240. doi: 10.1016/j.neuron.2013.07.007 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pbio.3001785.ref032] 32.Shen C, Ardid S, Kaping D, Westendorff S, Everling S, Womelsdorf T. Anterior Cingulate Cortex Cells Identify Process-Specific Errors of Attentional Control Prior to Transient Prefrontal-Cingulate Inhibition. Cereb Cortex. 2015;25:2213–2228. doi: 10.1093/cercor/bhu028 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pbio.3001785.ref033] 33.Etkin A, Egner T, Kalisch R. Emotional processing in anterior cingulate and medial prefrontal cortex. Trends Cogn Sci. 2011;15:85–93. doi: 10.1016/j.tics.2010.11.004 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pbio.3001785.ref034] 34.Laufer O, Israeli D, Paz R. Behavioral and Neural Mechanisms of Overgeneralization in Anxiety. Curr Biol. 2016;26:713–722. doi: 10.1016/j.cub.2016.01.023 [DOI] [PubMed] [Google Scholar]

[pbio.3001785.ref035] 35.Dugré JR, Dumais A, Bitar N, Potvin S. Loss anticipation and outcome during the Monetary Incentive Delay Task: a neuroimaging systematic review and meta-analysis. PeerJ. 2018;6:e4749. doi: 10.7717/peerj.4749 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pbio.3001785.ref036] 36.Boroujeni KB, Watson M, Womelsdorf T. Gains and Losses Differentially Regulate Attentional Efficacy at Low and High Attentional Load. J Cogn Neurosci. 2022. doi: 10.1162/jocn_a_01885 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pbio.3001785.ref037] 37.Yechiam E, Hochman G. Losses as modulators of attention: Review and analysis of the unique effects of losses over gains. Psychol Bull. 2013;139:497–518. doi: 10.1037/a0029383 [DOI] [PubMed] [Google Scholar]

[pbio.3001785.ref038] 38.Fouragnan EF, Chau BKH, Folloni D, Kolling N, Verhagen L, Klein-Flügge M, et al. The macaque anterior cingulate cortex translates counterfactual choice value into actual behavioral change. Nat Neurosci. 2019;22:797–808. doi: 10.1038/s41593-019-0375-6 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pbio.3001785.ref039] 39.Verhagen L, Gallea C, Folloni D, Constans C, Jensen DE, Ahnine H, et al. Offline impact of transcranial focused ultrasound on cortical activation in primates. Elife. 2019;8:e40541. doi: 10.7554/eLife.40541 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pbio.3001785.ref040] 40.Bongioanni A, Folloni D, Verhagen L, Sallet J, Klein-Flügge MC, Rushworth MFS. Activation and disruption of a neural mechanism for novel choice in monkeys. Nature. 2021;591:270–274. doi: 10.1038/s41586-020-03115-5 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pbio.3001785.ref041] 41.Clennell B, Steward TGJ, Elley M, Shin E, Weston M, Drinkwater BW, et al. Transient ultrasound stimulation has lasting effects on neuronal excitability. Brain Stimul. 2021;14:217–225. doi: 10.1016/j.brs.2021.01.003 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pbio.3001785.ref042] 42.Kaanders P, Nili H, O’Reilly JX, Hunt L. Medial Frontal Cortex Activity Predicts Information Sampling in Economic Choice. J Neurosci. 2021;41:8403–8413. doi: 10.1523/JNEUROSCI.0392-21.2021 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pbio.3001785.ref043] 43.Kahneman D, Tversky A. Prospect Theory: An Analysis of Decision under Risk. Econometrica. 1979;47:263–291. doi: 10.2307/1914185 [DOI] [Google Scholar]

[pbio.3001785.ref044] 44.Tversky A, Kahneman D. The Framing of Decisions and the Psychology of Choice. Science. 1981;211:453–458. doi: 10.1126/science.7455683 [DOI] [PubMed] [Google Scholar]

[pbio.3001785.ref045] 45.Gehring WJ, Willoughby AR. The Medial Frontal Cortex and the Rapid Processing of Monetary Gains and Losses. Science. 2002;295:2279–2282. doi: 10.1126/science.1066893 [DOI] [PubMed] [Google Scholar]

[pbio.3001785.ref046] 46.Botvinick MM, Braver TS, Barch DM, Carter CS, Cohen JD. Conflict monitoring and cognitive control. Psychol Rev. 2001;108:624–652. doi: 10.1037/0033-295x.108.3.624 [DOI] [PubMed] [Google Scholar]

[pbio.3001785.ref047] 47.Cole MW, Bagic A, Kass R, Schneider W. Prefrontal Dynamics Underlying Rapid Instructed Task Learning Reverse with Practice. J Neurosci. 2010;30:14245–14254. doi: 10.1523/JNEUROSCI.1662-10.2010 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pbio.3001785.ref048] 48.Mansouri FA, Tanaka K, Buckley MJ. Conflict-induced behavioural adjustment: a clue to the executive functions of the prefrontal cortex. Nat Rev Neurosci. 2009;10:141–152. doi: 10.1038/nrn2538 [DOI] [PubMed] [Google Scholar]

[pbio.3001785.ref049] 49.Ghazizadeh A, Griggs W, Hikosaka O. Ecological Origins of Object Salience: Reward, Uncertainty, Aversiveness, and Novelty. Front Neurosci. 2016;10:378. doi: 10.3389/fnins.2016.00378 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pbio.3001785.ref050] 50.Monosov IE. Anterior cingulate is a source of valence-specific information about value and uncertainty. Nat Commun. 2017;8:134. doi: 10.1038/s41467-017-00072-y [DOI] [PMC free article] [PubMed] [Google Scholar]

[pbio.3001785.ref051] 51.White JK, Bromberg-Martin ES, Heilbronner SR, Zhang K, Pai J, Haber SN, et al. A neural network for information seeking. Nat Commun. 2019;10:5168. doi: 10.1038/s41467-019-13135-z [DOI] [PMC free article] [PubMed] [Google Scholar]

[pbio.3001785.ref052] 52.Lejarraga T, Hertwig R. How the threat of losses makes people explore more than the promise of gains. Psychon Bull Rev. 2017;24:708–720. doi: 10.3758/s13423-016-1158-7 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pbio.3001785.ref053] 53.Lejarraga T, Schulte-Mecklenbeck M, Pachur T, Hertwig R. The attention–aversion gap: how allocation of attention relates to loss aversion. Evol Hum Behav. 2019;40:457–469. doi: 10.1016/j.evolhumbehav.2019.05.008 [DOI] [Google Scholar]

[pbio.3001785.ref054] 54.Resnik J, Sobel N, Paz R. Auditory aversive learning increases discrimination thresholds. Nat Neurosci. 2011;14:791–796. doi: 10.1038/nn.2802 [DOI] [PubMed] [Google Scholar]

[pbio.3001785.ref055] 55.Schechtman E, Laufer O, Paz R. Negative Valence Widens Generalization of Learning. J Neurosci. 2010;30:10460–10464. doi: 10.1523/JNEUROSCI.2377-10.2010 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pbio.3001785.ref056] 56.Shalev L, Paz R, Avidan G. Visual Aversive Learning Compromises Sensory Discrimination. J Neurosci. 2018;38:2766–2779. doi: 10.1523/JNEUROSCI.0889-17.2017 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pbio.3001785.ref057] 57.Huys QJM, Eshel N, O’Nions E, Sheridan L, Dayan P, Roiser JP. Bonsai Trees in Your Head: How the Pavlovian System Sculpts Goal-Directed Choices by Pruning Decision Trees. PLoS Comput Biol. 2012;8:e1002410. doi: 10.1371/journal.pcbi.1002410 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pbio.3001785.ref058] 58.Moscarello JM, Hartley CA. Agency and the calibration of motivated behavior. Trends Cogn Sci. 2017;21:725–735. doi: 10.1016/j.tics.2017.06.008 [DOI] [PubMed] [Google Scholar]

[pbio.3001785.ref059] 59.Quilodran R, Rothé M, Procyk E. Behavioral Shifts and Action Valuation in the Anterior Cingulate Cortex. Neuron. 2008;57:314–325. doi: 10.1016/j.neuron.2007.11.031 [DOI] [PubMed] [Google Scholar]

[pbio.3001785.ref060] 60.Womelsdorf T, Watson MR, Tiesinga P. Learning at Variable Attentional Load Requires Cooperation of Working Memory, Meta-learning and Attention-augmented Reinforcement Learning. J Cogn Neurosci. 2021;1–29. doi: 10.1162/jocn_a_01780 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pbio.3001785.ref061] 61.Holroyd CB, Nieuwenhuis S, Yeung N, Nystrom L, Mars RB, Coles MGH, et al. Dorsal anterior cingulate cortex shows fMRI response to internal and external error signals. Nat Neurosci. 2004;7:497–498. doi: 10.1038/nn1238 [DOI] [PubMed] [Google Scholar]

[pbio.3001785.ref062] 62.Kennerley SW, Behrens TEJ, Wallis JD. Double dissociation of value computations in orbitofrontal and anterior cingulate neurons. Nat Neurosci. 2011;14:1581–1589. doi: 10.1038/nn.2961 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pbio.3001785.ref063] 63.Cazé RD, van der Meer MAA. Adaptive properties of differential learning rates for positive and negative outcomes. Biol Cybern. 2013;107:711–719. doi: 10.1007/s00422-013-0571-5 [DOI] [PubMed] [Google Scholar]

[pbio.3001785.ref064] 64.Frank MJ, Seeberger LC, O’Reilly RC. By Carrot or by Stick: Cognitive Reinforcement Learning in Parkinsonism. Science. 2004;306:1940–1943. doi: 10.1126/science.1102941 [DOI] [PubMed] [Google Scholar]

[pbio.3001785.ref065] 65.Taswell CA, Costa VD, Murray EA, Averbeck BB. Ventral striatum’s role in learning from gains and losses. Proc Natl Acad Sci U S A. 2018;115:E12398–E12406. doi: 10.1073/pnas.1809833115 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pbio.3001785.ref066] 66.Noonan MP, Crittenden BM, Jensen O, Stokes MG. Selective inhibition of distracting input. Behav Brain Res. 2018;355:36–47. doi: 10.1016/j.bbr.2017.10.010 [DOI] [PubMed] [Google Scholar]

[pbio.3001785.ref067] 67.Bliss-Moreau E, Santistevan AC, Bennett J, Moadab G, Amaral DG. Anterior Cingulate Cortex Ablation Disrupts Affective Vigor and Vigilance. J Neurosci. 2021;41:8075–8087. doi: 10.1523/JNEUROSCI.0673-21.2021 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pbio.3001785.ref068] 68.Amiez C, Joseph JP, Procyk E. Reward Encoding in the Monkey Anterior Cingulate Cortex. Cereb Cortex. 2006;16:1040–1055. doi: 10.1093/cercor/bhj046 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pbio.3001785.ref069] 69.Ma L, Chan JL, Johnston K, Lomber SG, Everling S. Macaque anterior cingulate cortex deactivation impairs performance and alters lateral prefrontal oscillatory activities in a rule-switching task. PLoS Biol. 2019;17:e3000045. doi: 10.1371/journal.pbio.3000045 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pbio.3001785.ref070] 70.Shima K, Tanji J. Role for Cingulate Motor Area Cells in Voluntary Movement Selection Based on Reward. Science. 1998;282:1335–1338. doi: 10.1126/science.282.5392.1335 [DOI] [PubMed] [Google Scholar]

[pbio.3001785.ref071] 71.Polanía R, Nitsche MA, Ruff CC. Studying and modifying brain function with non-invasive brain stimulation. Nat Neurosci. 2018;21:174–187. doi: 10.1038/s41593-017-0054-4 [DOI] [PubMed] [Google Scholar]

[pbio.3001785.ref072] 72.Watson MR, Voloh B, Thomas C, Hasan A, Womelsdorf T. USE: An integrative suite for temporally-precise psychophysical experiments in virtual environments for human, nonhuman, and artificially intelligent agents. J Neurosci Methods. 2019;326:108374. doi: 10.1016/j.jneumeth.2019.108374 [DOI] [PubMed] [Google Scholar]

[pbio.3001785.ref073] 73.Womelsdorf T, Thomas C, Neumann A, Watson MR, Banaie Boroujeni K, Hassani SA, et al. A Kiosk Station for the Assessment of Multiple Cognitive Domains and Cognitive Enrichment of Monkeys. Front Behav Neurosci. 2021. doi: 10.3389/fnbeh.2021.721069 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pbio.3001785.ref074] 74.Watson MR, Voloh B, Naghizadeh M, Womelsdorf T. Quaddles: A multidimensional 3-D object set with parametrically controlled and customizable features. Behav Res Ther. 2019;51:2522–2532. doi: 10.3758/s13428-018-1097-5 [DOI] [PubMed] [Google Scholar]

[pbio.3001785.ref075] 75.Khalighinejad N, Bongioanni A, Verhagen L, Folloni D, Attali D, Aubry J-F, et al. A Basal Forebrain-Cingulate Circuit in Macaques Decides It Is Time to Act. Neuron. 2020;105:370–384.e8. doi: 10.1016/j.neuron.2019.10.030 [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Anterior cingulate cortex causally supports flexible learning under motivationally challenging and cognitively demanding conditions

Kianoush Banaie Boroujeni

Michelle K Sigona

Robert Louie Treuting

Thomas J Manuel

Charles F Caskey

Thilo Womelsdorf

Roles

Abstract

Introduction

Fig 1. Task paradigm and TUS protocol.

Results

Fig 2. TUS of ACC and anterior striatum.

Fig 3. TUS effect on fixational information sampling and behavioral adjustment after gains and losses.

Discussion

Extending existing functional accounts of ACC functions

ACC modulates the efficiency of information sampling

ACC mediates feature-specific credit assignment for aversive outcomes

A role of the ACC to determine learning rates for aversive outcomes

Versatility of the focused-ultrasound protocol for transcranial neuromodulation

Materials and methods

Ethics statement

Experimental procedures

Task paradigm

Experimental design

Transcranial ultrasound stimulation (TUS)

Data analysis

Trial-level statistical analysis

Block-level analysis of behavioral metrics

Supporting information

Acknowledgments

Abbreviations

Data Availability

Funding Statement

References

Decision Letter 0

Gabriel Gasque

Roles

Decision Letter 1

Kris Dickson

Roles

Author response to Decision Letter 1

Decision Letter 2

Kris Dickson

Roles

Decision Letter 3

Kris Dickson

Roles

Associated Data

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases