Skip to main content
F1000Research logoLink to F1000Research
. 2020 Feb 28;9:F1000 Faculty Rev-158. [Version 1] doi: 10.12688/f1000research.21946.1

Under construction: ventral and lateral frontal lobe contributions to value-based decision-making and learning

Avinash R Vaidya 1,a, Lesley K Fellows 2,b
PMCID: PMC7050269  PMID: 32161644

Abstract

Even apparently simple choices, like selecting a dessert in a pastry shop, involve options characterized by multiple motivationally relevant attributes. Neuroeconomic research suggests that the human brain may track the subjective value of such options, allowing disparate reward-predictive information to be compared in a common currency. However, the brain mechanisms involved in identifying value-predictive features and combining these to assess the value of each decision option remain unclear. Here, we review recent evidence from studies of multi-attribute decision-making in people with focal frontal lobe damage and in healthy people undergoing functional magnetic resonance imaging. This work suggests that ventromedial and lateral prefrontal cortex and orbitofrontal cortex are important for forming value judgments under conditions of complexity. We discuss studies supporting the involvement of these regions in selecting among and evaluating option attributes during value judgment and decision-making and when learning from reward feedback. These findings are consistent with roles for these regions in guiding value construction. They argue for a more nuanced understanding of how ventral and lateral prefrontal cortex contribute to discovering and recognizing value, processes that are required under the complex conditions typical of many everyday decisions.

Keywords: Neuroeconomics, judgment, decision-making, reward, learning, brain, executive function, neuroimaging, neuropsychology

Introduction

While we easily solve the problem of choosing between cakes lined up in a baker’s display case, it is not clear how our brains make these open-ended choices between complex stimuli. One solution is to find a “value” for each option by integrating various motivationally relevant aspects of each cake into an overall estimate of its subjective rewardingness. These values then can be used to compare all options on a single scale (that is, in a common currency). In the lab, such values typically are quantified in dollars in bidding tasks or as Likert scale ratings. This relatively parsimonious model has been very successful in explaining behavior in many different settings. It has even been applied in algorithms that can learn to perform complex tasks like Atari video games that require maximizing an objective reward (points) 1. Certain regions of the frontal lobes—notably the ventromedial prefrontal cortex (vmPFC) and orbitofrontal cortex (OFC)—respond to the values of many different kinds of stimuli ( Figure 1). Such findings have led to the proposal that a common currency value scale might not only be useful for describing behavior but also be implemented as a neural code during decision-making 2, 3.

Figure 1. Schematic view of major divisions of the frontal lobes.

Figure 1.

PFC, prefrontal cortex.

Though elegant, this solution skips over how a buttercream-frosted dessert is converted to a discrete value. Recent work has begun to address the brain processes involved in arriving at subjective value estimates. These studies have examined how the brain notices value-predictive features and learns about and constructs values for the multi-feature objects that are the typical targets of our everyday choices. This line of investigation suggests that values of complex options are not so much carefully calculated as cobbled together or, under some conditions, inferred holistically from a rich slew of perceptual, mnemonic, and hedonic evidence. In this brief review, we will highlight recent functional magnetic resonance imaging (fMRI) and focal lesion studies (in humans) addressing the formation of values for multi-attribute options, whether based on subjective assessment or external goals or learned from reward feedback.

Reverse-engineering subjective value construction

Much as the history of the Roman Empire can be deciphered from the disparate sources of the limestone and marble that make up its ruins, so too “value” can be deconstructed. Breaking complex options down into their component, value-predictive parts can provide insight into how the brain represents information about the individual elements that contribute to value and how (and where) that information is brought together. This reverse-engineering approach has been applied to diverse decision options: photographs of politicians, familiar and novel foods, artwork, and pseudo-objects.

Functional imaging has found that information about the features or identity of stimuli such as foods, odors, and trinkets is encoded in OFC but that the vmPFC represents a more general value code 46. This literature has broadly indicated that vmPFC encodes values independent of stimulus category 2, 7. In contrast, OFC value signals appear to be bound more to the lower-level features or attributes of options 8, 9. Together, these data are consistent with the notion that vmPFC is involved in integrating information from diverse sources to construct an overall representation of option values.

If vmPFC is critical to developing a common currency value representation from multiple attribute-values, damage to this region should disrupt the ability to combine the values of individual attributes. We observed that patients with damage to the ventromedial frontal lobe (VMF) (subsuming the vmPFC and OFC) made reliable value estimates of complex stimuli but to arrive at those estimates drew on information that was different from that of healthy and frontal lobe–damaged controls. For example, those with VMF damage under-weighted “competence” but relied on “attractiveness” to a similar degree when making decisions about which political candidates to vote for 10. Likewise, in a separate study, participants with VMF damage systematically under-weighted some aspects of artwork, such as emotionality, but relied to a similar degree on simpler, visually perceptible attributes such as concreteness and visual balance in comparison with control groups 11.

These observations suggest a more nuanced perspective on the role of vmPFC in value judgment. Why do VMF-damaged patients neglect some attribute-values while reliably drawing on others? One possibility is that these attributes depend on latent information built up from conjunctions of simpler, more directly observable features. For example, determining the emotionality of artwork could depend on putting together information about the expressions of the characters with the circumstances in the scene. Both the hippocampus and medial frontal cortex have been implicated in inferring values for novel conjunctions of food ingredients 12, implying a role in such a constructive process. We recently found evidence that VMF damage specifically disrupts the ability to choose between multi-attribute pseudo-objects (“fribbles” 13) when their values were predicted by the conjunction of two attributes. In contrast, such patients were able to make correct choices when values could be “summed up” on the basis of the values of individual attributes 14.

In many cases, value may be built up from information that is not directly observable but rather recalled from memory, such as sensory experiences with food items. Several findings from the fMRI literature are consistent with this perspective, demonstrating that reactivation of prior reward associations can influence future decision-making, even in independent tasks 1517. In such cases, decision-making might involve prospective simulation of outcomes that depends on the hippocampus 1820. Indeed, lesions to the medial temporal lobe and hippocampus were recently shown to lead to slower, more stochastic choices between foods, consistent with a noisier decision process 21. Notably, we found that VMF damage did not affect choice stochasticity or the reliability of reported preferences in a similar choice task using artwork 22. One possible explanation of the lack of effect of VMF damage in this task, compared with food choices, is that the value of art is assessed on directly observable (that is, visual) information and does not require prospection or construction of values from information in memory.

The effects of VMF damage on preference-based choice may depend less on the importance of this region for forming values per se than on its role in organizing preferences on the basis of recalled outcomes from past choices. Although VMF damage has been found to consistently increase transitivity errors during preference-based choice in several studies (that is, failing to choose A over C when A is preferred over B and B over C) 2325, this region is not necessary for the formation of coherent, transitive preferences (Yu et al. 26, 2018, preprint). Damage to the hippocampus or VMF was recently shown to affect transitive inference of paired associates in humans, implying a shared role in inferring relations between options on the basis of episodic memory more generally 27. Thus, the hippocampus and vmPFC may work together to stabilize decision-making based on an inferred transitive ordering of options.

Building values toward a goal

Above, we focused on how value judgments may be constructed in relatively unconstrained settings from mnemonic, conceptual, and perceptual evidence. In this section, we discuss how contextual demands act to shape this process. More often than not, values must be adaptively tailored in response to current goals. For example, the values of potential options in a store might change if you were choosing something for yourself or for your grandmother. Identifying relevant attributes on the basis of such goals and selectively using these during value construction reduce the complexity of the decision space to make such choices tractable. Recent imaging work has uncovered putative mechanisms for selecting attributes on the basis of task demands and examined how neural value signals adapt in response to changing contexts.

Value signals in vmPFC are sensitive to the weighting of stimulus attributes on the basis of current goals. The attributes that correlate with vmPFC activity change depending on external contextual demands, such as instruction to focus on taste or health attributes of foods 28, 29 or to choose options with either positive or negative subjective value 30. The flexibility of this signal in response to arbitrary task instructions argues that this region can be tuned to the values of options as they relate to higher-order goals rather than their hedonic value independent of task.

Functional connectivity studies have shed light on how value representations may be altered by shifting demands. This work has shown that interactions between vmPFC and several other regions depend on demands for integrating information from different stimuli during valuation and decision-making. Lim et al. 31 (2013) showed that connectivity between vmPFC and the fusiform and posterior superior temporal gyri reflected the contributions of perceptual and semantic stimulus attributes to value judgments, respectively. Changes in task goals during value judgment appear to engage lateral PFC and increase connectivity between lateral PFC and vmPFC 29, 32. Thus, this circuit seems to be involved in shifting attention to relevant attributes for value judgment, as externally defined within a task.

A 2018 study by Tusche and Hutcherson 33 took a slightly different approach. They tested whether external demands to focus on specific attributes of social or dietary decisions altered the decodability of these attributes from brain activity measured by using fMRI (that is, asking whether it is possible to predict attributes of the current stimulus from the pattern of activity within a region). While the decodability of these attributes varied with task demands in lateral PFC, all stimulus attributes were decodable within an overlapping area of vmPFC across conditions. This analysis suggests that vmPFC has access to a rich representation of stimulus attributes even as task demands shift, possibly enabling a more dynamic and adaptive representation of stimulus values. In contrast, lateral PFC is more narrowly involved in specifying goal-relevant attributes.

Learning what matters

The first section of this review focused on how values are built from attributes in unconstrained settings and the second discussed how weights on these attributes are modified depending on goals. A third (related) stream of work in the realm of reinforcement learning addresses how the value-relevance of attributes is discovered through experience.

The large space of potentially value-relevant features in the natural environment poses a challenge to making appropriate inferences about the relationships between stimuli and outcomes in the real world 34. In many cases, relationships may be inferred from conspicuous but incidental short-term correlations that have no bearing on the underlying causal structure of these associations. Think of the problem gambler who has come to believe that a rattling sound at the base of a slot machine predicts a jackpot rather than a loose screw. Indeed, low-level features, such as brightness, volume, or contrast, can give certain attributes greater assumed relevance when inferences are drawn about such relationships. In conditioning tasks, perceptually salient cues are more easily trained and can “overshadow” other predictors 35. Visual saliency biases decision-making independent of expected value 22, 36, 37, indicating that these low-level features can exert a powerful influence on value construction.

Directing attention to stimulus features that are predictive of rewards can effectively bootstrap the learning process by narrowing the space of features to consider in constructing option values 38, 39. Recent work has indicated that VMF is important for guiding attention to reward-predictive features on the basis of outcome history. We found that people with VMF damage had less of a bias to attend to a feature that was incidentally paired with reward in a visual search task, compared with healthy and frontal-damaged controls 40. Functional imaging has also implicated vmPFC in boosting credit for reward-predictive cues, similarly implying a role in boosting the gain on learning feature–reward associations based on their predictive history 41.

Examining how participants’ choices track the outcome history of attributes within a given dimension can reveal what participants have inferred about which aspects of the task predict reward. Recent neuroimaging studies have used multidimensional learning tasks to identify the regions engaged in learning attribute relevance. These studies have found that regions within the frontoparietal attention network, including lateral PFC, are engaged when identifying predictive, relevant dimensions based on feedback history 42. In an analogous task, vmPFC was found to encode the attentionally weighted average of the reward value of option features while switches of attention between dimensions were associated with increased activity within the frontoparietal attention network 43. In a similar multidimensional learning task, we tested how frontal lobe damage affected learning when participants were informed (accurately) that one low-salience stimulus dimension was predictive of reward but that two other dimensions were not. VMF-damaged participants showed worse learning about the rewardingness of the relevant dimension compared with healthy controls but did not show any increased tendency to credit rewarding outcomes to the irrelevant dimensions. In contrast, left lateral PFC damage caused an increase in the misattribution of outcomes to two (more salient) stimulus dimensions 44.

Just as faulty inferences might be drawn about the associations of outcomes with visual stimulus features, so too can outcomes be attributed to events that were temporally proximate but not causally related 45. This “spread-of-effect” phenomenon was shown to be exaggerated in non-human primates with OFC lesions 46, evident in a blurring of the influence of recent choices and outcomes. More recently, convergent evidence from studies of human and non-human primates with frontal lesions has indicated that damage to anterior ventrolateral PFC or lateral OFC particularly increases this failure to assign credit to the appropriate choices 47, 48. Functional imaging in humans similarly has pointed to lateral OFC or ventrolateral PFC as being particularly important for credit assignment when choice–outcome associations must be maintained over a delay or intervening trials 49. These data suggest that lateral PFC or OFC may play a similar role in specifying the features relevant to predicting outcomes in the temporal domain and the visual domain.

Conclusions

Recent human lesion and functional imaging work has identified frontal lobe regions and related circuits that are involved in dynamically constructing the values of complex stimuli in preference-based choice, whether based on current goals or in the course of learning. The lesion results help to constrain interpretation of fMRI in healthy people, arguing for a more nuanced and specific role of VMF and lateral PFC in these processes. VMF damage leads to under-weighting specific higher-order stimulus attributes during value judgment and disrupts attention to relevant features that are predictive of rewards in the long-term during learning. In contrast, lateral PFC is involved in arbitrating between stimulus dimensions, in particular when task demands and contextual factors specify a subset of attributes relevant to a choice. Table 1 summarizes the main findings supporting this view.

Table 1. Overview of findings supporting the main conclusions of the article.

Frontal lobe regions Judgment and decision-making Reinforcement learning
Ventromedial frontal lobe
(ventromedial prefrontal cortex and
orbitofrontal cortex)
     •    Representing value-relevant
attributes 4, 6, 33
     •    Forming values from attribute
conjunctions 12, 14
     •    Representation of goal-congruent
values 2830
     •    Relevance weighted value
representation 43, 50
     •    Directing attention to predictive
attributes 40, 41, 44
Lateral frontal lobe (ventral and dorsal
lateral prefrontal cortex)
     •    Selection of goal-relevant attributes 28, 29, 32      •    Selection of predictive dimensions 4244
     •    Credit assignment 47, 48

These findings open up new questions. For example, what is the role of VMF in incorporating complex attributes during decision-making and value judgment? One possibility is that this region is necessary for forming values out of non-linear conjunctions between attributes (that is, for interpreting the interactions between individual features that are more than the sum of their parts). Another is that VMF damage disrupts the ability to construct a latent, holistic representation of conceptual information about stimuli, leading these patients to rely instead on attribute-value representations at the level of discrete elements. The latter explanation would be broadly in line with ideas that OFC is representing information about latent task states to inform decision-making and learning 51, 52.

What is the nature of the interaction between lateral PFC and vmPFC during value construction? Communication between these regions appears to depend on decisions to switch or maintain attention on particular stimulus dimensions 32, 43. Whereas some data argue that lateral PFC acts on vmPFC to impose contextual demands on the attributes used in making value judgments 29, other findings support a role for vmPFC in representing “schematic” information about a task that is passed to lateral PFC to define the template for cognitive control 53. The direction of information flow between regions may depend on the observability of contextual cues, where directly observed contextual information may engage lateral PFC 54 whereas hidden changes in task state that must be inferred from schematic knowledge may rely on vmPFC or OFC 55.

We have focused here on the neural implementation of value construction and the particular contributions of regions of the frontal lobes to solving this problem to allow choices between the complex objects that we face in our everyday choices. However, a full account of how the brain builds holistic value representations will require a better understanding of the dynamics between frontal lobe regions and the brain areas that carry the perceptual, conceptual, and mnemonic evidence that are the building blocks of this process. Such work will have broad importance in better understanding economic and social decision behaviors and addressing how those decision processes may go awry in neurological and psychiatric illnesses.

Acknowledgments

We thank Apoorva Bhandari (Department of Cognitive, Linguistic and Psychological Sciences at Brown University) for constructive discussions.

Editorial Note on the Review Process

F1000 Faculty Reviews are commissioned from members of the prestigious F1000 Faculty and are edited as a service to readers. In order to make these reviews as comprehensive and accessible as possible, the referees provide input before publication and only the final, revised version is published. The referees who approved the final version are listed with their names and affiliations but without their reports on earlier versions (any comments will already have been addressed in the published version).

The referees who approved this article are:

  • Markus Ullsperger, Institute of Psychology and Center for Behavioral Brain Sciences, Otto-von-Guericke-Universität Magdeburg, Magdeburg, Germany

  • Thorsten Kahnt, Department of Neurology, Northwestern University Feinberg School of Medicine, Chicago, USA

Funding Statement

The authors acknowledge funding support from the National Institutes of Health (NIMH F32 MH116592-01A1), the Natural Sciences and Engineering Research Council of Canada (RGPIN 2016-06066), and the Canadian Institutes of Health Research (MOP-1192).

The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

[version 1; peer review: 2 approved]

References

  • 1. Mnih V, Kavukcuoglu K, Silver D, et al. : Human-level control through deep reinforcement learning. Nature. 2015;518(7540):529–33. 10.1038/nature14236 [DOI] [PubMed] [Google Scholar]; F1000 Recommendation
  • 2. Chib VS, Rangel A, Shimojo S, et al. : Evidence for a common representation of decision values for dissimilar goods in human ventromedial prefrontal cortex. J Neurosci. 2009;29(39):12315–20. 10.1523/JNEUROSCI.2575-09.2009 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3. Levy DJ, Glimcher PW: The root of all value: a neural common currency for choice. Curr Opin Neurobiol. 2012;22(6):1027–38. 10.1016/j.conb.2012.06.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4. Suzuki S, Cross L, O'Doherty JP: Elucidating the underlying components of food valuation in the human orbitofrontal cortex. Nat Neurosci. 2017;20(12):1780–6. 10.1038/s41593-017-0008-x [DOI] [PMC free article] [PubMed] [Google Scholar]; F1000 Recommendation
  • 5. Howard JD, Gottfried JA, Tobler PN, et al. : Identity-specific coding of future rewards in the human orbitofrontal cortex. Proc Natl Acad Sci U S A. 2015;112(16):5195–200. 10.1073/pnas.1503550112 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6. McNamee D, Rangel A, O'Doherty JP: Category-dependent and category-independent goal-value codes in human ventromedial prefrontal cortex. Nat Neurosci. 2013;16(4):479–85. 10.1038/nn.3337 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7. Bartra O, McGuire JT, Kable JW: The valuation system: a coordinate-based meta-analysis of BOLD fMRI experiments examining neural correlates of subjective value. Neuroimage. 2013;76:412–27. 10.1016/j.neuroimage.2013.02.063 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8. Pegors TK, Kable JW, Chatterjee A, et al. : Common and unique representations in pFC for face and place attractiveness. J Cogn Neurosci. 2015;27(5):959–73. 10.1162/jocn_a_00777 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9. Howard JD, Kahnt T: Identity-Specific Reward Representations in Orbitofrontal Cortex Are Modulated by Selective Devaluation. J Neurosci. 2017;37(10):2627–38. 10.1523/JNEUROSCI.3473-16.2017 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10. Xia C, Stolle D, Gidengil E, et al. : Lateral orbitofrontal cortex links social impressions to political choices. J Neurosci. 2015;35(22):8507–14. 10.1523/JNEUROSCI.0526-15.2015 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11. Vaidya AR, Sefranek M, Fellows LK: Ventromedial Frontal Lobe Damage Alters how Specific Attributes are Weighed in Subjective Valuation. Cerebral Cortex. 2018;28(11):3857–67. 10.1093/cercor/bhx246 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 12. Barron HC, Dolan RJ, Behrens TE: Online evaluation of novel choices by simultaneous representation of multiple memories. Nat Neurosci. 2013;16(10):1492–8. 10.1038/nn.3515 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13. Barry TJ, Griffith JW, de Rossi S, et al. : Meet the Fribbles: novel stimuli for use within behavioural research. Front Psychol. 2014;5:103. 10.3389/fpsyg.2014.00103 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 14. Pelletier G, Fellows LK: A Critical Role for Human Ventromedial Frontal Lobe in Value Comparison of Complex Objects Based on Attribute Configuration. J Neurosci. 2019;39(21):4124–32. 10.1523/JNEUROSCI.2969-18.2019 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15. Wimmer GE, Büchel C: Reactivation of Reward-Related Patterns from Single Past Episodes Supports Memory-Based Decision Making. J Neurosci. 2016;36(10):2868–80. 10.1523/JNEUROSCI.3433-15.2016 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16. Wimmer GE, Shohamy D: Preference by association: how memory mechanisms in the hippocampus bias decisions. Science. 2012;338(6104):270–3. 10.1126/science.1223252 [DOI] [PubMed] [Google Scholar]
  • 17. Bornstein AM, Norman KA: Reinstated episodic context guides sampling-based decisions for reward. Nat Neurosci. 2017;20(7):997–1003. 10.1038/nn.4573 [DOI] [PubMed] [Google Scholar]; F1000 Recommendation
  • 18. Weber EU, Johnson EJ: Constructing Preferences from Memory. In: Lichtenstein S, Slovic P, eds. The Construction of Preference.New York, NY: Cambridge University Press;2006. 10.2139/ssrn.1301075 [DOI] [Google Scholar]
  • 19. Shadlen MN, Shohamy D: Decision Making and Sequential Sampling from Memory. Neuron. 2016;90(5):927–39. 10.1016/j.neuron.2016.04.036 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20. Gluth S, Sommer T, Rieskamp J, et al. : Effective Connectivity between Hippocampus and Ventromedial Prefrontal Cortex Controls Preferential Choices from Memory. Neuron. 2015;86(4):1078–90. 10.1016/j.neuron.2015.04.023 [DOI] [PubMed] [Google Scholar]
  • 21. Bakkour A, Palombo DJ, Zylberberg A, et al. : The hippocampus supports deliberation during value-based decisions. eLife. 2019;8: pii: e46080. 10.7554/eLife.46080 [DOI] [PMC free article] [PubMed] [Google Scholar]; F1000 Recommendation
  • 22. Vaidya AR, Fellows LK: Testing necessary regional frontal contributions to value assessment and fixation-based updating. Nat Commun. 2015;6:10120. 10.1038/ncomms10120 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23. Fellows LK, Farah MJ: The role of ventromedial prefrontal cortex in decision making: judgment under uncertainty or judgment per se? Cerebral Cortex. 2007;17(11):2669–74. 10.1093/cercor/bhl176 [DOI] [PubMed] [Google Scholar]
  • 24. Henri-Bhargava A, Simioni A, Fellows LK: Ventromedial frontal lobe damage disrupts the accuracy, but not the speed, of value-based preference judgments. Neuropsychologia. 2012;50(7):1536–42. 10.1016/j.neuropsychologia.2012.03.006 [DOI] [PubMed] [Google Scholar]
  • 25. Camille N, Griffiths CA, Vo K, et al. : Ventromedial frontal lobe damage disrupts value maximization in humans. J Neurosci. 2011;31(20):7527–32. 10.1523/JNEUROSCI.6527-10.2011 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26. Yu LQ, Dana J, Kable JW: Individuals with ventromedial frontal damage have more unstable but still fundamentally transitive preferences. bioRxiv. 2018; 384024. 10.1101/384024 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 27. Spalding KN, Schlichting ML, Zeithamova D, et al. : Ventromedial Prefrontal Cortex Is Necessary for Normal Associative Inference and Memory Integration. J Neurosci. 2018;38(15):3767–75. 10.1523/JNEUROSCI.2501-17.2018 [DOI] [PMC free article] [PubMed] [Google Scholar]; F1000 Recommendation
  • 28. Hare TA, Camerer CF, Rangel A: Self-control in decision-making involves modulation of the vmPFC valuation system. Science. 2009;324(5927):646–8. 10.1126/science.1168450 [DOI] [PubMed] [Google Scholar]; F1000 Recommendation
  • 29. Hare TA, Malmaud J, Rangel A: Focusing attention on the health aspects of foods changes value signals in vmPFC and improves dietary choice. J Neurosci. 2011;31(30):11077–87. 10.1523/JNEUROSCI.6383-10.2011 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30. Frömer R, Dean Wolf CK, Shenhav A: Goal congruency dominates reward value in accounting for behavioral and neural correlates of value-based decision-making. Nat Commun. 2019;10(1):4926. 10.1038/s41467-019-12931-x [DOI] [PMC free article] [PubMed] [Google Scholar]; F1000 Recommendation
  • 31. Lim SL, O’Doherty JP, Rangel A: Stimulus Value Signals in Ventromedial PFC Reflect the Integration of Attribute Value Signals Computed in Fusiform Gyrus and Posterior Superior Temporal Gyrus. J Neurosci. 2013;33(20):8729–41. 10.1523/JNEUROSCI.4809-12.2013 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32. Rudorf S, Hare TA: Interactions between dorsolateral and ventromedial prefrontal cortex underlie context-dependent stimulus valuation in goal-directed choice. J Neurosci. 2014;34(48):15988–96. 10.1523/JNEUROSCI.3192-14.2014 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33. Tusche A, Hutcherson CA: Cognitive regulation alters social and dietary choice by changing attribute representations in domain-general and domain-specific brain circuits. eLife. 2018;7:pii: e31185. 10.7554/eLife.31185 [DOI] [PMC free article] [PubMed] [Google Scholar]; F1000 Recommendation
  • 34. Sutton RS, Barto AG: Reinforcement Learning: An Introduction. Cambridge, MA, USA: The MIT Press;1998. Reference Source [Google Scholar]
  • 35. Pavlov IP: Conditioned Reflexes. London: Oxford University Press;1927. Reference Source [Google Scholar]
  • 36. Navalpakkam V, Koch C, Rangel A, et al. : Optimal reward harvesting in complex perceptual environments. Proc Natl Acad Sci U S A. 2010;107(11):5232–7. 10.1073/pnas.0911972107 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37. Towal RB, Mormann M, Koch C: Simultaneous modeling of visual saliency and value computation improves predictions of economic choice. Proc Natl Acad Sci U S A. 2013;110(40):E3858–E3867. 10.1073/pnas.1304429110 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38. Pearce JM, Mackintosh NJ: Two theories of attention: a review and a possible integration. In: Mitchell CJ, Le Pelley ME, eds. Attention and Associative Learning: From Brain to Behaviour.Oxford: Oxford University Press;2010. Reference Source [Google Scholar]
  • 39. Wilson RC, Niv Y: Inferring Relevance in a Changing World. Front Hum Neurosci. 2012;5:189. 10.3389/fnhum.2011.00189 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40. Vaidya AR, Fellows LK: Ventromedial Frontal Cortex Is Critical for Guiding Attention to Reward-Predictive Visual Features in Humans. J Neurosci. 2015;35(37):12813–23. 10.1523/JNEUROSCI.1607-15.2015 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41. Akaishi R, Kolling N, Brown JW, et al. : Neural Mechanisms of Credit Assignment in a Multicue Environment. J Neurosci. 2016;36(4):1096–112. 10.1523/JNEUROSCI.3159-15.2016 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42. Niv Y, Daniel R, Geana A, et al. : Reinforcement learning in multidimensional environments relies on attention mechanisms. J Neurosci. 2015;35(21):8145–57. 10.1523/JNEUROSCI.2978-14.2015 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43. Leong YC, Radulescu A, Daniel R, et al. : Dynamic Interaction between Reinforcement Learning and Attention in Multidimensional Environments. Neuron. 2017;93(2):451–63. 10.1016/j.neuron.2016.12.040 [DOI] [PMC free article] [PubMed] [Google Scholar]; F1000 Recommendation
  • 44. Vaidya AR, Fellows LK: Necessary Contributions of Human Frontal Lobe Subregions to Reward Learning in a Dynamic, Multidimensional Environment. J Neurosci. 2016;36(38):9843–58. 10.1523/JNEUROSCI.1337-16.2016 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45. Thorndike EL: The "Spread" or "Scatter" of the Influence from a Reward, in Relation to Gestalt Doctrines. Science. 1933;77(1998):368. 10.1126/science.77.1998.368 [DOI] [PubMed] [Google Scholar]
  • 46. Walton ME, Behrens TEJ, Buckley MJ, et al. : Separable learning systems in the macaque brain and the role of orbitofrontal cortex in contingent learning. Neuron. 2010;65(6):927–39. 10.1016/j.neuron.2010.02.027 [DOI] [PMC free article] [PubMed] [Google Scholar]; F1000 Recommendation
  • 47. Noonan MP, Chau BKH, Rushworth MFS, et al. : Contrasting Effects of Medial and Lateral Orbitofrontal Cortex Lesions on Credit Assignment and Decision-Making in Humans. J Neurosci. 2017;37(29):7023–35. 10.1523/JNEUROSCI.0692-17.2017 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48. Rudebeck PH, Saunders RC, Lundgren DA, et al. : Specialized Representations of Value in the Orbital and Ventrolateral Prefrontal Cortex: Desirability versus Availability of Outcomes. Neuron. 2017;95(5):1208-1220.e5. 10.1016/j.neuron.2017.07.042 [DOI] [PMC free article] [PubMed] [Google Scholar]; F1000 Recommendation
  • 49. Jocham G, Brodersen KH, Constantinescu AO, et al. : Reward-Guided Learning with and without Causal Attribution. Neuron. 2016;90(1):177–90. 10.1016/j.neuron.2016.02.018 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50. Hunt LT, Dolan RJ, Behrens TE: Hierarchical competitions subserving multi-attribute choice. Nat Neurosci. 2014;17(11):1613–22. 10.1038/nn.3836 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51. Wilson RC, Takahashi YK, Schoenbaum G, et al. : Orbitofrontal cortex as a cognitive map of task space. Neuron. 2014;81(2):267–79. 10.1016/j.neuron.2013.11.005 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52. Niv Y: Learning task-state representations. Nat Neurosci. 2019;22(10):1544–53. 10.1038/s41593-019-0470-8 [DOI] [PMC free article] [PubMed] [Google Scholar]; F1000 Recommendation
  • 53. Badre D, Nee DE: Frontal Cortex and the Hierarchical Control of Behavior. Trends Cogn Sci. 2018;22(2):170–88. 10.1016/j.tics.2017.11.005 [DOI] [PMC free article] [PubMed] [Google Scholar]; F1000 Recommendation
  • 54. Badre D, D'Esposito M: Functional magnetic resonance imaging evidence for a hierarchical organization of the prefrontal cortex. J Cogn Neurosci. 2007;19(12):2082–99. 10.1162/jocn.2007.19.12.2082 [DOI] [PubMed] [Google Scholar]
  • 55. Schuck NW, Cai MB, Wilson RC, et al. : Human Orbitofrontal Cortex Represents a Cognitive Map of State Space. Neuron. 2016;91(6):1402–12. 10.1016/j.neuron.2016.08.019 [DOI] [PMC free article] [PubMed] [Google Scholar]

Articles from F1000Research are provided here courtesy of F1000 Research Ltd

RESOURCES