Abstract
One dominant hypothesis about the function of the orbitofrontal cortex (OFC) is that the OFC signals the subjective values of possible outcomes to other brain areas for learning and decision making. This popular view generally neglects the fact that OFC is not necessary for simple value-based behavior (i.e., when values have been directly experienced). An alternative, emerging view suggests that OFC plays a more general role in representing structural information about the task or environment, derived from prior experience, and relevant to predicting behavioral outcomes, such as value. From this perspective, value signaling is simply one derivative of the core underlying function of OFC. New data in favor of both views have been accumulating rapidly. Here we review these new data in discussing the relative merits of these two ideas.
Introduction
In everyday life, both humans and animals choose between different alternative outcomes or goods (e.g., what to eat for dinner); this has been described as value-based decision making, since it is presumed to reflect subjective valuation of the different options [1]. The rubric underlying such value-based decision making assumes two consecutive steps: valuation of available options on a common scale and action selection based on these valuations. With the finding of neural correlates of this scalar value in the brain, especially in the OFC, this view has gained popularity [1,2]. OFC hence has become widely seen as a dedicated neural substrate for calculating subjective value, which other brain areas then use to guide choices among available outcomes or goods [3]. This is despite the relative lack of studies showing that OFC is fundamentally necessary for deciding between goods based on value, the widespread finding of value correlates in many other brain regions, and the lack of any definition of value that is independent of behavior [4,5]. With an emphasis on new data, below we will consider each of these problems, and then explore another emerging view suggesting that OFC plays a critical role in representing task structure and is only necessary for value-based behavior when such a representation – a cognitive model or map – is required for calculating the value underlying normal choice.
Neural correlates of value in the OFC
Perhaps the most influential evidence for the idea that the OFC’s core function is to signal value comes from primate recordings on an economic choice task [3,6], in which subjective value is measured though choice behavior. Although this idea certainly has a long history prior to this work, earlier studies generally emphasized the representation of associative information or associative conjunctions [7–10]. Work using the economic choice task was arguably the first to clearly dissociate neural correlates of “economic” value, or what is called revealed preference, from sensory and motor aspects of the underlying associations. That is, some OFC neurons appeared to respond only based upon the scalar value of the goods on offer and not about any of the unique information comprising the choice (cues, outcome features or quantities, direction or type of response). Of course, such correlates can only be interpreted within the narrow parametric space of the task (i.e., the specific cues used and particular outcomes on offer), but nevertheless the activity of this category of neurons, along with subsequent results in humans using fMRI [11,12], have been taken as strong evidence that OFC is fundamental to determining the value underlying our choices.
And correlative evidence supporting this role continues to accumulate; in various choice tasks, neuronal activity in the OFC correlates with value in fMRI imaging, electrophysiological, and calcium imaging studies [13–20]. These value representations are adaptive to outcome and context changes [15,21,22], modulated by gaze fixation [23] and covert shift of attention [24], and supported by a mixture of temporally dynamic and stable coding schemes [25]. OFC neurons projecting to striatum predominantly encode integrated value among several other decision variables [26], and even theta oscillations in the OFC are reported to track the values of new reward-predictive cues during learning [27].
When interpreting these data, however, it is important to bear in mind that what can be found in a correlative study is constrained by particular task designs. While each of these studies found value encoding in the OFC, each was designed for this purpose. And many do not employ the multidimensional approach that characterizes the economic choice studies. Instead, for practical reasons, these designs assume that neural correlates tracking just a single dimension of reward, such as likelihood, amount, or caloric content, necessarily represent value. Neural correlates tracking value across only one reward dimension may not be abstractly tuned ‘value’ neurons. Indeed even two dimensions provide a relatively weak test of the null hypothesis, given the variability of single-unit activity and the amount of training the subjects typically receive, which may drive changes in how specific outcomes within a narrow range (i.e., juices) are represented. As a result, tasks using few reward dimensions to determine value likely overestimate the number of neurons that represent a multi-dimensional, truly abstract value. Consistent with this idea, this permissive definition of value has led to observations of value coding being prevalent across the entire brain [4], even in the cerebellum [28] and hypothalamus [29]. Against this interpretation, while some OFC neurons do appear to strongly encode information that can be interpreted as value, nearly all neural recording studies have found that OFC neurons also respond to a variety of other task variables.
For instance, individual OFC neurons can be found with activity tuned to cue and outcome identity [16,30], spatial locations [31,32], action selection [14,33,34], integration of prior and current information [14,18], decision confidence [26,35], and task rules [36]. Even in now-classic studies of neuroeconomic value, in which value is determined from integration of multiple reward features, many OFC neurons represent the value of only some options, and still more respond based on sensory features of the goods [6]. Although not necessarily contradictory to the idea that the OFC signals value, this heterogeneity of findings does raise the possibility that value representation is not the core function of OFC, since it fails to explain so many neural correlates found there, even when the area is faced with what is a very narrow range of outcomes compared to what we experience in our daily lives.
Possibility of value as part of task structure representation
How can we account for such diverse neural correlates? An alternative hypothesis, which would predict such diversity, proposes that OFC represents states or “locations” and their transitions in the task space (i.e., the task structure or cognitive map) [37,38]. If this is the case, value can be seen as one important aspect or derivative of such a representation, particularly in the experimental settings typically employed in behavioral neuroscience studies. This hypothesis has considerable support from both older work as well as recent correlative studies in humans [39–44], non-human primates [45], and rodents [31,46–51].
For example, in perhaps the first human imaging study to focus on this specific question [40], subjects were trained to judge the age (young or old) of a face or a house that were spatially superimposed in the same image (Figure 1a). Whether to switch to judge a face or a house on a given trial was determined by a change of age on the previous trial. This created 16 unique trial types or states, the identification of which had to be inferred through trial transitions (Figure 1b). These “hidden” states, when necessary for behavioral learning and decision making, were found to be uniquely decodable from BOLD signals in the OFC (Figure 1c). The data provide direct evidence that the OFC represents task states that are crucial for cognitive mapping.
Critically, such hidden states are not the same as value, however their appropriate identification can clearly be useful for predicting value. For instance, performing properly in reversal tasks – something that is classically dependent on the OFC – can be facilitated by recognizing the hidden “reversal” state as distinct from initial learning [37]. This allows a new behavior to be acquired more quickly because it does not require first unlearning or modifying the prior learning, since it is not relevant in the new reversal state.
In an effort to show the interaction between the representation of value and states, we recently recorded single-unit activity from OFC while rats were performing a simple odor discrimination task in which performance reflected a knowledge of the odor sequence across trials [48]. Like the work above, proper responding in this task required tracking of prior events in order to discriminate hidden states predictive of reward (Figure 2a). In this context, we found that current value was a prominent feature of both the single-unit and neural-ensemble activity, but that the task states – positions in the sequence – were also decodable independent of this value information (Figure 2b,c). This included the hidden states, which were only discriminable based on prior events. The dissociation of value from structural information about the task suggests that the two are multiplexed in the OFC but separable at the neural-ensemble level, which argues against the idea that value is the ultimate output of the OFC and in favor of the proposal that encoding of structure is one fundamental function of the OFC.
The representation of hidden states independent of reward or other confounding factors is also evident in another odor sequence task, in which two alternating sequences of odor-guided trials were used to mimic a continuous T-maze [46]. Like the continuous T-maze, the two sequences have a common central path (the same odors at each position) with unique entry and exit paths (different odors at the same positions) (Figure 2d). Critically, there is no difference in value between the two sequences at any position – from a value-based perspective, there is simply a single loop in the task structure.
Recording single unit activity in OFC in this task, we found that neurons nevertheless differentiated the sequences at all positions; successful decoding was even observed at positions where both odor identities and values were exactly the same (i.e., hidden states) (Figure 2e). Thus, these data demonstrate that OFC neurons are able to disambiguate hidden states, even in the absence of differences in value or observable sensory inputs.
Encoding of structural information about the environment is also evident in OFC during sensory preconditioning. In this setting, when the reward is completely absent, OFC has been found to acquire representations of sensory-sensory associations in both rats [49] and humans [52]. Specifically, during the initial phase of the task – when two value-neutral sensory cues are paired – activities of both single units in rats and BOLD signals in humans change to reflect the associations between the two cues by exhibiting similar response patterns.
OFC is required for model-based value computation
Correlative studies discussed above collectively suggest that the value signaling is not something peculiar in the OFC but can be seen as part of the task structure representation. One could argue, however, that the task structure is represented for the sole purpose of computing value within the OFC, and thus the value signaling is still the major functional output of OFC. If this is the case, disrupting OFC function should always disrupt behavioral performance in value-based decision-making tasks, without affecting behavior when value is not at issue. And yet this is not what causal data shows. Instead, it is easy to find value-based behavior – even in explicit-choice tasks – in which inactivating OFC has no effect, and there are examples of behavior that do not seem to involve value at all that still require the OFC.
Value-based behavior that is independent of the OFC is easy to find and ranges from Pavlovian conditioning to operant responding to discrimination learning [4]. Indeed most behaviors in experimental settings, the vast majority of which are value-based, do not require OFC. However the most striking such evidence comes directly from a rodent version of the economic choice task first developed in monkeys to search for value representations [6]. In this task, rats were trained to make choices between pairs of food pellets [53] (Figure 3a). On each trial, the shape and quantity of each pellet on offer was indicated by the shape and number of segments of two symbols presented on touch screens in front of the rat. By touching the appropriate screen, the rat gained access to the type and quantity of the pellet associated with the symbol. Optogenetically inactivating either lateral [53] or medial [54] OFC had no effect on the choice behavior of well-trained rats in this setting, as indexed by the subjective preference, steepness of the choice curve, or transitivity between different pairs of goods (Figure 3b). Similar negative results were also reported in a choice task requiring information about probability and amount to be integrated; value-based choices on individual trials were not affected by lateral OFC inactivation [55]. While there are examples to counter these reports, in subjects that have less experience in the task or in which the OFC is artificially over-activated instead of being inhibited [13,56], the above work demonstrates that in very well-controlled settings, the OFC is not strictly necessary for value-based behavior, even when current value must be integrated across multiple dimensions of information. This should not be possible if the OFC’s core function is to calculate economic value.
Notably, at the same time that OFC is not necessary for value-based choices in the above studies, it is critical for value updating that relies on model-based inference. This was first evident in Pavlovian reinforcer devaluation, an iconic behavioral task to assess model-based inference. In this task, subjects are initially trained to associate a cue and a reward. After the outcome is paired with an unpleasant experience such as LiCl injection or satiety, normal subjects update value of the cue through its model-based association with the outcome, which is reflected in reduced responding to the cue during the probe test. Work using this procedure has shown repeatedly that OFC manipulations disrupt normal changes in behavior after devaluation [57–64]. This conclusion is consistent with many other studies using different behavioral tasks, such as overexpectation [65] and sensory preconditioning [66,67], in which model-based inference is needed for value updating. It is worth noting that testing the necessity of OFC in classic instrumental devaluation tasks has led to negative results, suggesting a dissociation in the support of strictly instrumental versus Pavlovian representations [58,68,69], although effects of OFC manipulations on instrumental devaluation have been reported when prior learning of the operanda has occurred, perhaps due to the increased complexity of the setting that may introduce Pavlovian information regarding hidden states [70].
If representing the task structure underlying model-based behavior is the core function of the OFC, then it predicts that OFC should be necessary for economic choice when the underlying value depends on a model of the task for its construction. In well trained subjects, it is unclear when this is true; however one way to force this would be to devalue one of the goods on offer prior to the critical testing. As in the Pavlovian devaluation task described above, this manipulation would force the use of a model to determine the value underlying normal behavior. We have recently done exactly this experiment, inactivating the lateral OFC in rats during economic choice when one of the outcomes was pre-fed prior to test (Figure 3c). In this setting, we found that inactivation again had no effect on the established choice, while at the same time it disrupted the normal effects of pre-feeding on the choice [53,71] (Figure 3d).
Of course, deficits in devaluation tasks after OFC manipulations, while consistent with a wider role, do not necessarily require one; instead they may simply reflect a more limited role in value-based decision-making [3]. To argue that behavioral deficits caused by OFC dysfunction actually reveal a failure in the general use of model-based information, outside the realm of value signaling per se, requires evidence of impairments in behavior that is orthogonal or even independent of value. Though this at first seems far-fetched, it turns out that such evidence is actually relatively abundant. For instance, OFC is required for outcome-specific Pavlovian-to-instrumental transfer [58,68], outcome-identity unblocking [72], facilitated learning caused by differential outcomes [73], and for proper sensory learning in the first phase of sensory preconditioning [59]. In each of these tasks, the critical role of the OFC is difficult to explain as signaling of scalar value, while being easy to explain as related to representing task structure. This is particularly true in a recent study showing that optogenetically inactivating OFC during the initial cue-cue learning during sensory preconditioning abolishes value inference in the later the probe test [59]. The initial cue-cue learning occurs in the absence of any overt value or reward; there is no behavioral response. However learning of the valueless sensory associations is critical to the later inference; thus these results demonstrate that the OFC is required for building associative models, even in the absence of any reward. Together with the electrophysiological recording data on the sensory preconditioning task [49] showing that OFC keeps track of these sensory-sensory associations even before rewards are introduced, these results provide clear evidence that OFC is critical for learning task structure independent of value.
Why would this be? One key aspect of the sensory preconditioning task is that it involves the acquisition of new information about the world – the cue pairs have never been encountered before. This is also true in economic choice, when subjects first encounter new pairs of goods; even if they have previously encountered the two options separately, relating them and accurately determining subjective preferences when options are compared for the first time requires new mapping. Interestingly, we have recently found that the OFC is necessary for economic choice under these conditions. That is, while inactivation of OFC has no effect on economic choice in our hands once the rats have experienced a pair of goods together, if we inactivate the lateral OFC in the very first session with a new pair, then rats show impairments at establishing their initial subjective preference [74] (Figure 3e, f). This is true even if the rats have experienced the pellets many times before as part of other pairs. This finding suggests that the OFC is necessary for establishing the relationships between pellets in the “goods space”, rather than for using the goods space to read out the economic value.
Conclusions
There is still a strong belief in the field that the core function of the OFC is to represent subjective or economic value, and that this value signal is the main output from OFC used by other brain areas to form behavioral plans and instruct action selection. Although this idea seems to be supported by an abundance of correlative studies, we argue against the proposal that value signaling is the core function of the OFC based on the poor definition of value, the widespread finding of similar value correlates in most of the brain, as well as causal data showing that the OFC is often not necessary for value-based choice. Conversely, OFC is often required for behaviors for which value is either not critical or is entirely absent during periods when OFC is required. Overall, these data are more consistent with the proposal that the primary function of OFC is to represent task structure, from which value is partly derived.
Acknowledgements
This work was supported by a grant from the NIDA (K99DA049888 to J.Z.) and the Intramural Research Program at NIDA (ZIA-DA000587 to G.S.).
Footnotes
Conflict of interest statement
The authors declare no competing interests.
We have no conflicts of interest.
Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.
References and recommended reading
- 1.Rangel A, Camerer C, Montague PR: A framework for studying the neurobiology of value-based decision making. Nat Rev Neurosci 2008, 9:545–556. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Wallis JD: Cross-species studies of orbitofrontal cortex and value-based decision-making. Nat Neurosci 2011, 15:13–19. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Padoa-Schioppa C, Conen KE: Orbitofrontal cortex: a neural circuit for economic decisions. Neuron 2017, 96:736–754. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Stalnaker TA, Cooch NK, Schoenbaum G: What the orbitofrontal cortex does not do. Nat Neurosci 2015, 18:620–627. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Hayden B, Niv Y: The case against economic values in the brain. PsyArXiv 2020.* A perspective paper that argues against the representation of economic value in the brain.
- 6.Padoa-Schioppa C, Assad JA: Neurons in the orbitofrontal cortex encode economic value. Nature 2006, 441:223–226. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Thorpe SJ, Rolls ET, Maddison S: The orbitofrontal cortex: neuronal activity in the behaving monkey. Exp Brain Res 1983, 49:93–115. [DOI] [PubMed] [Google Scholar]
- 8.Critchley HD, Rolls ET: Olfactory neuronal responses in the primate orbitofrontal cortex: analysis in an olfactory discrimination task. J Neurophysiol 1996, 75:1659–1672. [DOI] [PubMed] [Google Scholar]
- 9.Schoenbaum G, Eichenbaum H: Information coding in the rodent prefrontal cortex. I. Single-neuron activity in orbitofrontal cortex compared with that in pyriform cortex. J Neurophysiol 1995, 74:733–750. [DOI] [PubMed] [Google Scholar]
- 10.Tremblay L, Schultz W: Relative reward preference in primate orbitofrontal cortex. Nature 1999, 398:704–708. [DOI] [PubMed] [Google Scholar]
- 11.Plassmann H, O’Doherty J, Rangel A: Orbitofrontal cortex encodes willingness to pay in everyday economic transactions. J Neurosci 2007, 27:9984–9988. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Levy DJ, Glimcher PW: Comparing apples and oranges: using reward-specific and reward-general subjective value representation in the brain. J Neurosci 2011, 31:14693–14707. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Kuwabara M, Kang N, Holy TE, Padoa-Schioppa C: Neural mechanisms of economic choices in mice. Elife 2020, 9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Nogueira R, Abolafia JM, Drugowitsch J, Balaguer-Ballester E, Sanchez-Vives MV, Moreno-Bote R: Lateral orbitofrontal cortex anticipates choices and integrates prior with current information. Nat Commun 2017, 8:14823. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Banerjee A, Parente G, Teutsch J, Lewis C, Voigt FF, Helmchen F: Value-guided remapping of sensory cortex by lateral orbitofrontal cortex. Nature 2020, 585:245–250. [DOI] [PubMed] [Google Scholar]
- 16.Wang PY, Boboila C, Chin M, Higashi-Howard A, Shamash P, Wu Z, Stein NP, Abbott LF, Axel R: Transient and persistent representations of odor Value in prefrontal cortex. Neuron 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Rich EL, Wallis JD: Decoding subjective decisions from orbitofrontal cortex. Nat Neurosci 2016, 19:973–980. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Saez RA, Saez A, Paton JJ, Lau B, Salzman CD: Distinct roles for the amygdala and orbitofrontal cortex in representing the relative amount of expected reward. Neuron 2017, 95:70–77 e73. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Yan C, Su L, Wang Y, Xu T, Yin DZ, Fan MX, Deng CP, Hu Y, Wang ZX, Cheung EF, et al. : Multivariate neural representations of value during reward anticipation and consummation in the human orbitofrontal cortex. Sci Rep 2016, 6:29079. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Klein S, Kruse O, Markert C, Leon IT, Strahler J, Stark R: Subjective reward value of visual sexual stimuli is coded in human striatum and orbitofrontal cortex. Behav Brain Res 2020, 393. [DOI] [PubMed] [Google Scholar]
- 21.Xie J, Padoa-Schioppa C: Neuronal remapping and circuit persistence in economic decisions. Nat Neurosci 2016, 19:855–861. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Rustichini A, Conen KE, Cai X, Padoa-Schioppa C: Optimal coding and neuronal adaptation in economic decisions. Nat Commun 2017, 8:1208. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.McGinty VB, Rangel A, Newsome WT: Orbitofrontal cortex value signals depend on fixation location during free viewing. Neuron 2016, 90:1299–1311. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Xie Y, Nie C, Yang T: Covert shift of attention modulates the value encoding in the orbitofrontal cortex. Elife 2018, 7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Enel P, Wallis JD, Rich EL: Stable and dynamic representations of value in the prefrontal cortex. Elife 2020, 9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Hirokawa J, Vaughan A, Masset P, Ott T, Kepecs A: Frontal cortex neuron types categorically encode single decision variables. Nature 2019, 576:446–451. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Knudsen EB, Wallis JD: Closed-loop theta stimulation in the orbitofrontal cortex prevents reward-based learning. Neuron 2020, 106:537–547 e534. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Wagner MJ, Kim TH, Savall J, Schnitzer MJ, Luo L: Cerebellar granule cells encode the expectation of reward. Nature 2017, 544:96–100. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Stuber GD, Wise RA: Lateral hypothalamic circuits for feeding and reward. Nat Neurosci 2016, 19:198–205. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Namboodiri VMK, Otis JM, van Heeswijk K, Voets ES, Alghorazi RA, Rodriguez-Romaguera J, Mihalas S, Stuber GD: Single-cell activity tracking reveals that orbitofrontal neurons acquire and maintain a long-term memory to guide behavioral adaptation. Nat Neurosci 2019, 22:1110-+. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Wikenheiser AM, Marrero-Garcia Y, Schoenbaum G: Suppression of ventral hippocampal output impairs integrated orbitofrontal encoding of task structure. Neuron 2017, 95:1197–1207 e1193. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Yoo SBM, Sleezer BJ, Hayden BY: Robust encoding of spatial information in orbitofrontal cortex and striatum. J Cogn Neurosci 2018, 30:898–913. [DOI] [PubMed] [Google Scholar]
- 33.Yang L, Masmanidis SC: Differential encoding of action selection by orbitofrontal and striatal population dynamics. J Neurophysiol 2020, 124:634–644. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Yun M, Kawai T, Nejime M, Yamada H, Matsumoto M: Signal dynamics of midbrain dopamine neurons during economic decision-making in monkeys. Sci Adv 2020, 6. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Masset P, Ott T, Lak A, Hirokawa J, Kepecs A: Behavior- and modality-general representation of confidence in orbitofrontal cortex. Cell 2020, 182:112–126 e118. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Sleezer BJ, Castagno MD, Hayden BY: Rule encoding in orbitofrontal cortex and striatum guides selection. J Neurosci 2016, 36:11223–11237. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Wilson RC, Takahashi YK, Schoenbaum G, Niv Y: Orbitofrontal cortex as a cognitive map of task space. Neuron 2014, 81:267–279. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Niv Y: Learning task-state representations. Nat Neurosci 2019, 22:1544–1553.**A summary of recent research into the computational and neural bases of how task states are learned and represented.
- 39.Elliott Wimmer G, Buchel C: Learning of distant state predictions by the orbitofrontal cortex in humans. Nat Commun 2019, 10:2554. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Schuck NW, Cai MB, Wilson RC, Niv Y: Human orbitofrontal cortex represents a cognitive map of state space. Neuron 2016, 91:1402–1412.** Using fMRI imaging, this study provides evidence that human orbitofrontal cortex represents task-revevant information that defines task states during a decision-making task.
- 41.Vaidya AR, Jones HM, Castillo J, Badre D: Neural representation of abstract task structure during generalization. bioRxiv 2020:213009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Mack ML, Preston AR, Love BC: Ventromedial prefrontal cortex compression during concept learning. Nat Commun 2020, 11:46. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Baram AB, Muller TH, Nili H, Garvert M, Behrens TEJ: Entorhinal and ventromedial prefrontal cortices abstract and generalise the structure of reinforcement learning problems. bioRxiv 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Howard JD, Kahnt T: Identity-specific reward representations in orbitofrontal cortex are modulated by selective devaluation. J Neurosci 2017, 37:2627–2638. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Wang MZ, Hayden BY: Reactivation of associative structure specific outcome responses during prospective evaluation in reward-based choices. Nat Commun 2017, 8:15821. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Zhou J, Zong W, Jia C, Gardner MPH, Schoenbaum G: Prospective representations in rat orbitofrontal ensembles. bioRxiv 2020:268391. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 47.Zhou J, Montesinos-Cartagena M, Wikenheiser AM, Gardner MPH, Niv Y, Schoenbaum G: Complementary task structure representations in hippocampus and orbitofrontal cortex during an odor sequence task. Curr Biol 2019, 29:3402–3409 e3403. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Zhou J, Gardner MPH, Stalnaker TA, Ramus SJ, Wikenheiser AM, Niv Y, Schoenbaum G: Rat orbitofrontal ensemble activity contains multiplexed but dissociable representations of value and task structure in an odor sequence task. Curr Biol 2019, 29:897–907 e893.** In this study, we recorded single units from the OFC of rats performing an odor sequence task, in which 24 task states were created. Through ensemble analyses, we found that the value can be dissociated from task state representation in the OFC.
- 49.Sadacca BF, Wied HM, Lopatina N, Saini GK, Nemirovsky D, Schoenbaum G: Orbitofrontal neurons signal sensory associations underlying model-based inference in a sensory preconditioning task. Elife 2018, 7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Shobe JL, Bakhurin KI, Claar LD, Masmanidis SC: Selective modulation of orbitofrontal network activity during negative occasion Setting. J Neurosci 2017, 37:9415–9423. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Stalnaker TA, Berg B, Aujla N, Schoenbaum G: Cholinergic interneurons use orbitofrontal input to track beliefs about current state. J Neurosci 2016, 36:6242–6257. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Wang F, Schoenbaum G, Kahnt T: Interactions between human orbitofrontal cortex and hippocampus support model-based inference. PLoS Biology 2020, 18:e3000578. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Gardner MPH, Conroy JS, Shaham MH, Styer CV, Schoenbaum G: Lateral orbitofrontal inactivation dissociates devaluation-sensitive behavior and economic choice. Neuron 2017, 96:1192–1203 e1194.** In this study, the rats were trained with a rodent version of the economic task. Optogenetic inactivating lateral OFC neurons did not affect rats’ choice behavior in this task but disrupted normal behavior in an outcome devaluation task.
- 54.Gardner MP, Conroy JC, Styer CV, Huynh T, Whitaker LR, Schoenbaum G: Medial orbitofrontal inactivation does not affect economic choice. Elife 2018, 7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Constantinople CM, Piet AT, Bibawi P, Akrami A, Kopec C, Brody CD: Lateral orbitofrontal cortex promotes trial-by-trial learning of risky, but not spatial, biases. Elife 2019, 8. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Ballesta S, Shi W, Conen KE, Padoa-Schioppa C: Values encoded in orbitofrontal cortex are causally related to economic choices. Nature 2020.** This study perturbated the monkey OFC neuronal activity by using electric stimulation while the subjects were performing an economic choice task. Low-current stimulation caused biased choice behavior while high-current stimulation increased choice variability.
- 57.Bradfield LA, Hart G, Balleine BW: Inferring action-dependent outcome representations depends on anterior but not posterior medial orbitofrontal cortex. Neurobiol Learn Mem 2018, 155:463–473. [DOI] [PubMed] [Google Scholar]
- 58.Panayi MC, Killcross S: Functional heterogeneity within the rodent lateral orbitofrontal cortex dissociates outcome devaluation and reversal learning deficits. Elife 2018, 7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Hart EE, Sharpe MJ, Gardner MP, Schoenbaum G: Responding to preconditioned cues is devaluation sensitive and requires orbitofrontal cortex during cue-cue learning. Elife 2020, 9.** Using a preconditioning task, this study provides causal evidence that OFC is critical for cue-cue associations even without apparent reward.
- 60.Reber J, Feinstein JS, O’Doherty JP, Liljeholm M, Adolphs R, Tranel D: Selective impairment of goal-directed decision-making following lesions to the human ventromedial prefrontal cortex. Brain 2017, 140:1743–1756. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Howard JD, Reynolds R, Smith DE, Voss JL, Schoenbaum G, Kahnt T: Targeted stimulation of human orbitofrontal networks disrupts outcome-guided behavior. Curr Biol 2020, 30:490–498 e494.** This study shows evidence that the human OFC is required for outcome-guided behavior by using a reinforcer devaluation task.
- 62.Gallagher M, McMahan RW, Schoenbaum G: Orbitofrontal cortex and representation of incentive value in associative learning. J Neurosci 1999, 19:6610–6614. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.West EA, DesJardin JT, Gale K, Malkova L: Transient inactivation of orbitofrontal cortex blocks reinforcer devaluation in macaques. J Neurosci 2011, 31:15128–15135. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 64.Izquierdo A, Suda RK, Murray EA: Bilateral orbital prefrontal cortex lesions in rhesus monkeys disrupt choices guided by both reward value and reward contingency. J Neurosci 2004, 24:7540–7548. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Takahashi YK, Chang CY, Lucantonio F, Haney RZ, Berg BA, Yau HJ, Bonci A, Schoenbaum G: Neural estimates of imagined outcomes in the orbitofrontal cortex drive behavior and learning. Neuron 2013, 80:507–518. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Jones JL, Esber GR, McDannald MA, Gruber AJ, Hernandez A, Mirenzi A, Schoenbaum G: Orbitofrontal cortex supports behavior and learning using inferred but not cached values. Science 2012, 338:953–956. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 67.Wang F, Howard JD, Voss JL, Schoenbaum G, Kahnt T: Targeted Stimulation of an Orbitofrontal Network Disrupts Decisions Based on Inferred, Not Experienced Outcomes. J Neurosci 2020, 40:8726–8733. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Ostlund SB, Balleine BW: Orbitofrontal cortex mediates outcome encoding in Pavlovian but not instrumental conditioning. J Neurosci 2007, 27:4819–4825. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 69.Lichtenberg NT, Pennington ZT, Holley SM, Greenfield VY, Cepeda C, Levine MS, Wassum KM: Basolateral Amygdala to Orbitofrontal Cortex Projections Enable Cue-Triggered Reward Expectations. J Neurosci 2017, 37:8374–8384. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 70.Parkes SL, Ravassard PM, Cerpa JC, Wolff M, Ferreira G, Coutureau E: Insular and ventrolateral orbitofrontal cortices differentially contribute to goal-directed behavior in rodents. Cereb Cortex 2018, 28:2313–2325. [DOI] [PubMed] [Google Scholar]
- 71.Gardner MPH, Conroy JC, Sanchez DC, Zhou J, Schoenbaum G: Real-Time Value Integration during Economic Choice Is Regulated by Orbitofrontal Cortex. Curr Biol 2019, 29:4315–4322 e4314. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 72.McDannald MA, Lucantonio F, Burke KA, Niv Y, Schoenbaum G: Ventral striatum and orbitofrontal cortex are both required for model-based, but not model-free, reinforcement learning. J Neurosci 2011, 31:2700–2705. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 73.McDannald MA, Saddoris MP, Gallagher M, Holland PC: Lesions of orbitofrontal cortex impair rats’ differential outcome expectancy learning but not conditioned stimulus-potentiated feeding. J Neurosci 2005, 25:4626–4632. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 74.Gardner MPH, Sanchez D, Conroy JC, Wikenheiser AM, Zhou J, Schoenbaum G: Processing in lateral orbitofrontal cortex Is required to estimate subjective preference during initial, but not established, economic choice. Neuron 2020, 108:526–537.** By introducing novel offers and novel pairs in the rodent economic choice task, this study shows that optogenetic inhibiting the OFC slowed down the speed of rats to initially establish or modify a “good space” that rats use to make comparisons between possible outcomes.