Abstract
Empirical decision-making in diverse species deviates from the predictions of normative choice theory, but why such suboptimal behavior occurs is unknown. Here, we propose that deviations from optimality arise from biological decision mechanisms that have evolved to maximize choice performance within intrinsic biophysical constraints. Sensory processing utilizes specific computations such as divisive normalization to maximize information coding in constrained neural circuits, and recent evidence suggests that analogous computations operate in decision-related brain areas. These adaptive computations implement a relative value code that may explain the characteristic context-dependent nature of behavioral violations of classical normative theory. Examining decision-making at the computational level thus provides a crucial link between the architecture of biological decision circuits and the form of empirical choice behavior.
Introduction
Normative choice theories are the foundation of many modern approaches to decision-making, describing how the ideal or optimal chooser should make choices. In economics and psychology, rational choice models assume that choosers act to maximize a subjective measure of satisfaction termed expected utility [1]. In ecology, optimal foraging theory similarly assumes that organisms act to maximize an internal currency ultimately related to reproductive fitness [2]. Despite the rigorous mathematical framework and intuitive appeal of standard choice theories, empirical choice behavior violates the predictions of these optimality models in a wide range of species [3-9]. In particular, biological choosers demonstrate context-dependent preferences, where decisions depend on additional (often irrelevant) information beyond the values of the given alternatives. Here, we review how recent work on the neural representation of value information offers a biological rationale for these apparent violations of rationality. Consideration of such computational principles suggests that choice behavior reflects a utility optimization process operating under intrinsic biological constraints.
Context-dependent violations of rationality
A key principle of nearly all optimal decision-making theories is that preferences rely on a stable, independent valuation of each choice alternative. Given the assumption that individual alternatives are evaluated independently, decisions should be unaffected by uninformative contextual factors such as the previous history or the structure of the choice set. However, many of the documented behavioral deviations from optimality suggest that value is determined in a relative rather than absolute manner, with a characteristic dependence on choice context (Figure 1).
The most prominent violations of rationality involve modulation by factors present in the choice set at the time of decision (which we term spatial context, drawing an analogy between the choice set in decision studies and visual space in sensory studies). Under spatial context-dependence, the relative preference between two given alternatives changes with the quantity or quality (attributes) of other alternatives. A number of spatial context effects have been described in human choosers, including the attraction [4], similarity [8], and compromise [7] effects, each requiring a relationship between attributes defined in a multi-dimensional space. In the attraction effect, for example, the introduction of a third decoy alternative that is similar, but inferior in quality, to one of the two original alternatives can selectively shift relative preference towards the closest original alternative. Interestingly, analogous phenomena exist in diverse and evolutionarily distant species including insects, birds, and monkeys, suggesting context-dependent choice behavior may be intrinsic to biological decision-making mechanisms [3,5,6].
Context-dependent preferences also vary with the past history of choices and outcomes, or temporal context. While less studied in the literature, both human and animal choice can exhibit a dependence on previously experienced alternatives [7,10]. For example, in [7], a choice between consumer items (e.g. car tires) differing in two attributes (e.g. warranty and price) depends on the distribution of attributes seen in a previous choice. Temporal context-dependence has been identified in various decision behaviors observed in the field, including speed dating, commuting behavior, and housing choice [11-13]. Most spatial and temporal context effects involve multi-attribute choice between alternatives differing in multiple dimensions, with the pattern of context-dependence depending on the specific relationship of the alternatives in attribute space. However, alternatives varying in a single dimension or varying in integrated value can also drive context-dependence, arguing that context-dependence is a general feature of the biological decision process [5,14,15].
Both heuristic and mathematical models have been proposed that can reproduce empirically observed spatial and temporal context-dependent preferences [7-9,16,17]. However, why empirical choosers exhibit such patterns of suboptimal behavior remains unknown. In this review, we focus on the underlying neural systems that implement value representation and decision-making. An understanding of information processing in neural systems, and the constraints under which they operate, can provide a crucial link between neurobiological substrate and resulting choice behavior [5,18,19].
The neural representation of value
A critical step in understanding context-dependent choice behavior is identifying how neural circuits represent value-related information. While many brain areas are modulated by rewards, electrophysiological and neuroimaging studies have identified specific neural circuits that represent the subjective values of choice alternatives (see [19,20] for recent reviews). Value coding is prominent in a network of areas linked to action selection, including sensorimotor circuits in prefrontal cortex, posterior parietal cortex, and the dorsal striatum [21-25]. In these areas, the activity of action-specific neurons varies with action value, a representation believed to be critical for selecting between competing motor actions. In addition, value also modulates neural activity in brain areas without an explicit role in action selection. For example, despite exhibiting little motor selectivity, neurons in the monkey orbitofrontal cortex represent the value of stimuli and choice alternatives [26,27]. Similarly, ventromedial prefrontal cortex and ventral striatum display a consistent valuation signal generalizing over different types of reward in human subjects [20,28]. These brain areas may function as a global valuation system, determining and assigning alternative values in a given choice scenario prior to the decision process.
Contextual modulation in value coding
Context-dependent choice behavior implies that value is constructed in a comparative manner, relative to the spatial or temporal background. Guided by these behavioral effects, recent experiments have begun to examine how contextual factors influence neural activity in identified value coding circuits (Figure 2).
The effect of spatial context on value coding has been primarily examined in sensorimotor regions of frontal and parietal cortex underlying action selection and execution. In these areas, single neurons represent specific actions, allowing observation of alternative-specific neural activity. For example, in the monkey lateral intraparietal area (LIP), individual neurons fire selectively to both visual stimulus presentation in, and saccadic eye movements towards, specific subregions of visual space (the neural response field, or RF). Across a variety of paradigms, sensorimotor neural activity in LIP correlates closely with the integrated decision information necessary for choice. In perceptual discrimination tasks, individual LIP neuron activity closely matches the accumulated evidence for the saccade toward the RF [29-31]. In economic manipulations, LIP activity consistently represents the subjective value of individual saccades, regardless of whether value is determined by reward magnitude and probability, delay discounting, foraging requirements, or strategic valuation in a competitive game [23,25,32,33].
Rather than an absolute representation of individual action values, however, new studies suggest that LIP and other similar regions employ a relative form of valuation [22,34-36]. In a recent study, monkeys were presented varying saccade target arrays, with different rewards associated with specific target alternatives [34]. Despite a constant reward associated with a saccade to the neural RF, LIP activity is nevertheless suppressed by the value of extra-RF target alternatives (Figure 2A). Coupled with excitation in response to RF alternative value, this suppression by spatial context value instantiates a relative reward representation. Notably, this form of value modulation mirrors extensively studied spatial contextual phenomena in sensory processing [37], pointing towards potential unifying neural mechanisms we discuss below in Neural computation and biological constraints.
Similar to behavioral context-dependent phenomena, neural value coding is also sensitive to the recent history of rewards. While thus far unexamined in parietal brain areas, temporal contextual modulation has been observed in other neural circuits in reward-guided decision-making. For example, reward-related responses are prominent in the monkey orbitofrontal cortex (OFC), a prefrontal region with extensive connections to higher-order sensory cortices and limbic system areas. Neurons in OFC encode a number of decision-related variables, including the subjective value of specific alternative classes (e.g. type of fruit juice) and the value of the chosen alternatives [27].
Instead of a fixed relationship between value and neural activity, OFC value coding adapts to the local statistics of recently experienced rewards [38-40]. For example, in monkeys experiencing different ranges of reward magnitudes, narrow reward ranges produce steep firing rate functions of value while wide ranges elicit shallow ones ([40]; Figure 2B). Similar adaptive value coding has been observed in other monkey brain regions including the anterior cingulate cortex [38] and midbrain dopaminergic nuclei [41], as well as in reward-related circuits in the human brain [42]. Such adaptive changes in neural response functions are widespread in sensory processing, and may be crucial for neurons with limited dynamic ranges facing changing contexts in a non-stationary world [43].
Neural computation and biological constraints
The widespread prevalence of both spatial and temporal forms of contextual modulation suggests a conservation of function across different circuits and systems [37]. In particular, contextual processing is thought to be critical for the maximization of information coded in spiking activity. Neural systems face a number of constraints, imposed by factors such as the energetic cost of spiking activity, irreducible noise, and maximum firing rates. To explain how neural systems can maximize information coding in the face of such constraints, Barlow's efficient coding hypothesis proposed that sensory systems exploit widespread statistical regularities in the distribution of the sensory environment [44]. Under efficient coding, regularity-induced redundancies in the incoming information stream are removed by sensory systems, increasing the independence of neural responses to different stimuli (thus maximizing information and increasing efficiency). Contextual modulation may play a key role in the efficiency of neural responses to such spatiotemporal regularities [45-48].
A critical question then is how neural circuits implement the relative information coding evident in contextual modulation. At the neural level, spatial and temporal context effects are evident as a form of gain control, modifying the input-output function that relates value information to firing rates. We focus here on divisive normalization, a prominent form of nonlinear gain control widely observed in early sensory systems [49] and recently characterized in higher order processes such as attention, multisensory integration, and decision-making [34,50,51].
The normalization algorithm computes a ratio between a given neuron's response and the summed activity of a larger pool of neurons [49,52,53]. Consider a pool of n neurons with each neuron i having its own driving input di, with the vector d denoting a set of inputs to the neural pool (e.g. luminance at different points in visual space, or the values of alternatives in a choice set). A general form of normalization can be expressed by the function:
where the response of a given neuron i depends on both the direct driving input di to that neuron and the summed driving input from the entire pool (typically including neuron i). The parameters rmax and β govern the maximal response level and the effect of spontaneous activity, respectively, while the exponent α represents the exponentiation of inputs. The semisaturation term σ determines the how quickly the function approaches saturation with increasing input, governing the range of inputs over which the response function is most sensitive.
In the normalization algorithm, division by the pooled activity of other neurons drives spatial contextual modulation. For example, normalization explains phenomena such as cross-orientation suppression and surround suppression in visual cortex [54,55], where the neural response to a stimulus is decreased when superimposed with other stimuli that are ineffective in driving a response on their own. In a recent study, normalization also characterizes how LIP neurons represent relative value in different choice arrays [34]. In particular, LIP value coding under different target and reward conditions were closely predicted by a simplified normalization model:
In this formulation, the value of an action towards the neural response field (vi) is normalized by a term summing over the values of all available actions (v), incorporating reward characteristics of the current choice set into the neural representation of value. Thus the firing rate Ri representing the value of option i is a function of all alternative values v rather than vi alone. Notably, divisive normalization outperformed a number of alternative models in explaining the observed electrophysiology data, including models based on value difference or simple fractional value [34]. In the framework of efficient coding, spatial normalization implements a relative representation that preserves discriminability between alternative values while obeying constraints on neural activity (Figure 3A).
Normalization can also incorporate temporal contextual modulation via a semisaturation function σ(), which can introduce a divisive term that depends on the recent history of input. For example, a purely temporal form of context-dependent value coding can be written as:
where the firing rate representing the value of alternative i at time t is a function of both current and past values (extending back to some t0 , where t0 can go to −∞). In this form of temporal normalization, the effect of past value is mediated through the function σ().
In order to adapt to local statistics in a dynamic environment, σ() typically is structured to emphasize recent over distant past values (e.g. an exponentially decaying weighted average of past rewards). Intuitively, σ() can be viewed as an expectation of future inputs, based on samples from past history; because this term governs horizontal shifts of the response function, normalization aligns the widest range of spiking output to the most likely range of input (Figure 3B). In sensory processing, history-dependent normalization explains how neural responses adapt to changing stimulus properties such as luminance and contrast [49,53,56].
The divisive normalization algorithm has yet to be explicitly tested in cases of adaptive value coding, where reward-related activity varies with the range or variance of recent rewards [38-40]. These data suggest that value coding can adapt to higher-order moments of the recent reward distribution beyond the mean. The normalization algorithm can adapt to distributional statistics given the proper choice of the parameters, but the precise role of normalization in adaptive value coding remains a key target of future research.
Both value context effects described above can be combined in a single spatiotemporal normalization:
but no clear empirical evidence yet exists for such an integrated contextual value representation. In the monkey, orbitofrontal value coding shows strong temporal contextual modulation [39,40], but appears to be unaffected by aspects of the current choice set [57]. On the other hand, prefrontal [22] and parietal [34-36] value coding shows strong spatial contextual modulation, but the effect of temporal context has not been examined. The potential integration of spatial and temporal context effects in both neural processing and behavior thus remains an important open question.
Linking computation and choice behavior
Despite extensive documentation in the ecology and psychology literature, little is known about the neural mechanisms underlying context-dependent preferences. In sensory processing, both spatial and temporal forms of contextual modulation are linked to corresponding perceptual phenomena. Recent evidence suggests a similar linkage may exist between adaptive value coding computations and context-dependent decision-making.
A recent study quantified spatial context effects predicted by the normalization model and compared them to empirical choice behavior in monkey and human subjects [5]. Under value normalization, the overall activity signaling the subjective value of a choice alternative is normalized by a term including the summed value of available alternatives. Because this normalization is simultaneously applied to all alternatives, the composition of the choice set should not alter the relative ranking of alternatives by average firing rate. However, because division scales the difference between firing rates, normalization predicts that the discriminability between alternatives will vary with the choice set. Specifically, the relative preference between two alternatives should be diminished by increases in the value of a third alternative, even if it is never chosen. In value-guided choice experiments, both monkey and human choosers showed this novel form of context-dependent behavior predicted by the value normalization model.
While normalization provides a clear link between neural and behavioral spatial context effects, additional mechanisms can influence the decision process. In a choice task involving a presented, but subsequently unavailable, third alternative, human subjects show diminishing choice performance as context value decreases – an opposite effect from that predicted by value normalization [58]. These differing results suggest that specific aspects of the behavioral task are likely to play an important role in determining the influence of context. In [58], the value of alternatives were determined by subjects from visually-cued attributes (reward magnitude, probability), raising the possibility of additional interactions in the value integration process prior to choice ([59]; see Attributes and the valuation process). Given the brief initial availability of the third alternative, the timing of biophysical processes may also play a crucial role; examination of dynamical models suggest that normalization is a time-varying computation with a stronger influence late in the decision process [60-62].
The temporal context effects observed in electrophysiological experiments on value coding have not yet been directly linked to behavior, but the relationship between the internal representation of decision information and choice has been shown in perceptual decision-making tasks. In a recent study, human subjects viewed a rapid stream of sinusoidal gratings of varying orientations and judged whether the average orientation was closer to the cardinal or diagonal axis [14]. Analogous to economic decisions based on value, observed choices depended on the integrated decision variable (i.e. weighted sum of orientations). When the authors examined the impact of individual samples on choice, they found two effects: higher weights were applied to later samples (a recency bias) and to samples consistent with the average decision information at the time of viewing. Critically, these effects are explained by an adaptive gain process that adjusts the transfer function relating decision information to behavior; moreover, such adjustments aligns the maximal sensitivity of the function with the distribution of samples seen in the recent past. While not modeled explicitly with normalization, this work indicates that adaptive tuning of internal representations of decision information may be a critical feature of temporal context effects.
Attributes and the valuation process
A critical issue for future research concerns how overall decision values are constructed from separate alternative attributes and how contextual modulation affects this process. The electrophysiological and behavioral evidence discussed above focuses on the integrated value of choice alternatives, but the primary examples of context-dependence in the decision literature involve multi-attribute choice, where alternatives differ in multiple dimensions [3,4,6,8-10,63]. However, given the existence of adaptive gain control and contextual modulation at the level of integrated value, similar mechanisms may govern attribute processing prior to integration. Recent computational work suggests that a range normalization process over attributes can reproduce many of the documented context-dependent phenomena [18]. Furthermore, neuroimaging data in human subjects performing a multi-attribute reward-guided decision task are consistent with a hierarchical competition process involving normalization at the attribute level [59]. Because a standard divisive normalization algorithm operating at the attribute level cannot generate the full range of empirically observed phenomena, classic multi-attribute effects likely involve additional neural processes. Thus, at its most complex, choice behavior may involve different adaptive coding mechanisms operating at multiple stages of decision-making.
Conclusion
Understanding how the brain represents behaviorally relevant variables is a key step in linking behavior to the underlying neural mechanisms. Classic approaches to decision-making rely on a characterization of choice behavior, but recent interest has turned to identifying the neural basis of valuation and choice. Emerging evidence for a context-dependent neural representation of value, and its relevance for context-dependent violations of rationality, underscores the role of information processing in neural circuits. Fundamental aspects of information processing – such as the use of canonical gain control algorithms like normalization – occur in both sensory and decision-related systems, suggesting common principles of neural computation. In particular, context-dependent value coding may reflect an adaptive response to the intrinsic constraints of computing with biological circuitry.
Highlights.
Empirical decision-making often violates normative choice theory
Suboptimal choice can arise from contextual value coding in neural activity
Adaptive computations can maximize information coding in constrained neural circuits
Biological choice reflects a tradeoff between performance and the cost of computation
Acknowledgements
PWG is supported by the National Institute on Drug Abuse through grant R01-DA038063.
Footnotes
Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final citable form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.
References
- 1.Von Neumann J, Morgenstern O. Theory of Games and Economic Behavior. Princeton University Press; Princeton, NJ: 1944. [Google Scholar]
- 2.Stephens DW, Krebs JR. Foraging Theory. Princeton University Press; Princeton, NJ: 1986. [Google Scholar]
- 3.Bateson M. Context-dependent foraging choices in risk-sensitive starlings. Animal Behaviour. 2002;64:251–260. [Google Scholar]
- 4.Huber J, Payne JW, Puto C. Adding asymmetrically dominated alternatives: Violations of regularity and the similarity hypothesis. J Consum Res. 1982;9:90–98. [Google Scholar]
- 5.Louie K, Khaw MW, Glimcher PW. Normalization is a general neural mechanism for context-dependent decision making. Proc Natl Acad Sci U S A. 2013;110:6139–6144. doi: 10.1073/pnas.1217854110. [This paper shows that a normalization-based computational model predicts a novel pattern of context-dependent choice behavior, dependent on the summed value of integrated alternative values. The authors demonstrate that these normalization model predictions are evident in the behavior of both monkey and human choosers making value-guided decisions.] [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Shafir S, Waite TA, Smith BH. Context-dependent violations of rational choice in honeybees (Apis mellifera) and gray jays (Perisoreus canadensis). Behav Ecol Sociobiol. 2002;51:180–187. [Google Scholar]
- 7.Simonson I, Tversky A. Choice in context - tradeoff contrast and extremeness aversion. Journal of Marketing Research. 1992;29:281–295. [Google Scholar]
- 8.Tversky A. Elimination by aspects - a theory of choice. Psychol Rev. 1972;79:281. [Google Scholar]
- 9.Tversky A, Simonson I. Context-dependent preferences. Management Science. 1993;39:1179–1189. [Google Scholar]
- 10.Waite TA. Background context and decision making in hoarding gray gays. Behavioral Ecology. 2001;12:318–324. [Google Scholar]
- 11.Bhargava S, Fisman R. Contrast effects in sequential decisions: evidence from speed dating. Review of Economics and Statistics. 2014;96:444–457. [Google Scholar]
- 12.Simonsohn U. New Yorkers commute more everywhere: Contrast effects in the field. Review of Economics and Statistics. 2006;88:1–9. [Google Scholar]
- 13.Simonsohn U, Loewenstein G. Mistake #37: The effect of previously encountered prices on current housing demand. Economic Journal. 2006;116:175–199. [Google Scholar]
- 14.Cheadle S, Wyart V, Tsetsos K, Myers N, de Gardelle V, Herce Castanon S, Summerfield C. Adaptive gain control during human perceptual choice. Neuron. 2014;81:1429–1441. doi: 10.1016/j.neuron.2014.01.020. [This paper demonstrates adaptive information processing in human subjects performing a perceptual decision task, which required choice based on an integrated decision variable (average stimulus orientation) rather than simple sensory information. The authors find that both recent and consistent sensory evidence are more strongly weighted, consistent with an adaptive change in the internal function encoding decision-relevant information. Of special interest (*).] [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Morgan KV, Hurly TA, Bateson M, Asher L, Healy SD. Context-dependent decisions among options varying in a single dimension. Behav Processes. 2012;89:115–120. doi: 10.1016/j.beproc.2011.08.017. [DOI] [PubMed] [Google Scholar]
- 16.Usher M, McClelland JL. Loss aversion and inhibition in dynamical models of multialternative choice. Psychol Rev. 2004;111:757–769. doi: 10.1037/0033-295X.111.3.757. [DOI] [PubMed] [Google Scholar]
- 17.Roe RM, Busemeyer JR, Townsend JT. Multialternative decision field theory: a dynamic connectionist model of decision making. Psychol Rev. 2001;108:370–392. doi: 10.1037/0033-295x.108.2.370. [DOI] [PubMed] [Google Scholar]
- 18.Soltani A, De Martino B, Camerer C. A range-normalization model of context-dependent choice: a new model and evidence. PLoS Comput Biol. 2012;8:e1002607. doi: 10.1371/journal.pcbi.1002607. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Louie K, Glimcher PW. Efficient coding and the neural representation of value. Ann N Y Acad Sci. 2012;1251:13–32. doi: 10.1111/j.1749-6632.2012.06496.x. [DOI] [PubMed] [Google Scholar]
- 20.Bartra O, McGuire JT, Kable JW. The valuation system: a coordinate-based meta-analysis of BOLD fMRI experiments examining neural correlates of subjective value. Neuroimage. 2013;76:412–427. doi: 10.1016/j.neuroimage.2013.02.063. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Ding L, Hikosaka O. Comparison of reward modulation in the frontal eye field and caudate of the macaque. J Neurosci. 2006;26:6695–6703. doi: 10.1523/JNEUROSCI.0836-06.2006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Pastor-Bernier A, Cisek P. Neural correlates of biased competition in premotor cortex. J Neurosci. 2011;31:7083–7088. doi: 10.1523/JNEUROSCI.5681-10.2011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Platt ML, Glimcher PW. Neural correlates of decision variables in parietal cortex. Nature. 1999;400:233–238. doi: 10.1038/22268. [DOI] [PubMed] [Google Scholar]
- 24.Samejima K, Ueda Y, Doya K, Kimura M. Representation of action-specific reward values in the striatum. Science. 2005;310:1337–1340. doi: 10.1126/science.1115270. [DOI] [PubMed] [Google Scholar]
- 25.Sugrue LP, Corrado GS, Newsome WT. Matching behavior and the representation of value in the parietal cortex. Science. 2004;304:1782–1787. doi: 10.1126/science.1094765. [DOI] [PubMed] [Google Scholar]
- 26.Morrison SE, Salzman CD. The convergence of information about rewarding and aversive stimuli in single neurons. J Neurosci. 2009;29:11471–11483. doi: 10.1523/JNEUROSCI.1815-09.2009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Padoa-Schioppa C, Assad JA. Neurons in the orbitofrontal cortex encode economic value. Nature. 2006;441:223–226. doi: 10.1038/nature04676. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Levy DJ, Glimcher PW. The root of all value: a neural common currency for choice. Curr Opin Neurobiol. 2012;22:1027–1038. doi: 10.1016/j.conb.2012.06.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Roitman JD, Shadlen MN. Response of neurons in the lateral intraparietal area during a combined visual discrimination reaction time task. Journal of Neuroscience. 2002;22:9475–9489. doi: 10.1523/JNEUROSCI.22-21-09475.2002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Shadlen MN, Newsome WT. Neural basis of a perceptual decision in the parietal cortex (area LIP) of the rhesus monkey. J Neurophysiol. 2001;86:1916–1936. doi: 10.1152/jn.2001.86.4.1916. [DOI] [PubMed] [Google Scholar]
- 31.Yang T, Shadlen MN. Probabilistic reasoning by neurons. Nature. 2007;447:1075–1080. doi: 10.1038/nature05852. [DOI] [PubMed] [Google Scholar]
- 32.Louie K, Glimcher PW. Separating value from choice: delay discounting activity in the lateral intraparietal area. J Neurosci. 2010;30:5498–5507. doi: 10.1523/JNEUROSCI.5742-09.2010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Seo H, Barraclough DJ, Lee D. Lateral intraparietal cortex and reinforcement learning during a mixed- strategy game. J Neurosci. 2009;29:7278–7289. doi: 10.1523/JNEUROSCI.1479-09.2009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Louie K, Grattan LE, Glimcher PW. Reward value-based gain control: divisive normalization in parietal cortex. J Neurosci. 2011;31:10627–10639. doi: 10.1523/JNEUROSCI.1237-11.2011. [This paper shows that saccade-selective neurons in monkey parietal cortex represent action value information in a relative manner, dependent on the summed values of available actions. The authors demonstrate that this relative value code is best characterized by a divisive normalization computation; this finding is the first demonstration of normalization beyond strictly sensory brain regions, emphasizing the idea this computation may be a canonical feature of information processing in neural circuits.] [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Rorie AE, Gao J, McClelland JL, Newsome WT. Integration of sensory and reward information during perceptual decision-making in lateral intraparietal cortex (LIP) of the macaque monkey. PLoS One. 2010;5:e9308. doi: 10.1371/journal.pone.0009308. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Falkner AL, Krishna BS, Goldberg ME. Surround suppression sharpens the priority map in the lateral intraparietal area. J Neurosci. 2010;30:12787–12797. doi: 10.1523/JNEUROSCI.2327-10.2010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Schwartz O, Hsu A, Dayan P. Space and time in visual context. Nat Rev Neurosci. 2007;8:522–535. doi: 10.1038/nrn2155. [DOI] [PubMed] [Google Scholar]
- 38.Cai X, Padoa-Schioppa C. Neuronal encoding of subjective value in dorsal and ventral anterior cingulate cortex. J Neurosci. 2012;32:3791–3808. doi: 10.1523/JNEUROSCI.3864-11.2012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Kobayashi S, Pinto de Carvalho O, Schultz W. Adaptation of reward sensitivity in orbitofrontal neurons. J Neurosci. 2010;30:534–544. doi: 10.1523/JNEUROSCI.4009-09.2010. [This paper (along with [35]) demonstrates a temporal context effect in monkey orbitofrontal value coding, using reward distributions with constant means but differing standard deviations. The authors also find evidence for adaptation to distributional structure in behavioral measures such as error rates and saccadic reaction time, emphasizing the behavioral relevance of neural adaptation of neural reward sensitivity.] [DOI] [PMC free article] [PubMed] [Google Scholar]
- 40.Padoa-Schioppa C. Range-adapting representation of economic value in the orbitofrontal cortex. J Neurosci. 2009;29:14004–14014. doi: 10.1523/JNEUROSCI.3751-09.2009. [This paper (along with [34]) shows that value coding in monkey orbitofrontal cortex depends on the temporal history of rewards. The author shows that a simple range normalzation model, where the dynamic range of neural activity is scaled to the range of recent rewards, explains this temporal context effect on value coding.] [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Tobler PN, Fiorillo CD, Schultz W. Adaptive coding of reward value by dopamine neurons. Science. 2005;307:1642–1645. doi: 10.1126/science.1105370. [DOI] [PubMed] [Google Scholar]
- 42.Cox KM, Kable JW. BOLD subjective value signals exhibit robust range adaptation. J Neurosci. 2014;34:16533–16543. doi: 10.1523/JNEUROSCI.3927-14.2014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 43.Wark B, Lundstrom BN, Fairhall A. Sensory adaptation. Curr Opin Neurobiol. 2007;17:423–429. doi: 10.1016/j.conb.2007.07.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Barlow HB. Possible principles underlying the transformation of sensory messages. In: Rosenblith WA, editor. Sensory Communication. MIT Press; 1961. [Google Scholar]
- 45.Brenner N, Bialek W, de Ruyter van Steveninck R. Adaptive rescaling maximizes information transmission. Neuron. 2000;26:695–702. doi: 10.1016/s0896-6273(00)81205-2. [DOI] [PubMed] [Google Scholar]
- 46.Fairhall AL, Lewen GD, Bialek W, de Ruyter Van Steveninck RR. Efficiency and ambiguity in an adaptive neural code. Nature. 2001;412:787–792. doi: 10.1038/35090500. [DOI] [PubMed] [Google Scholar]
- 47.Simoncelli EP, Olshausen BA. Natural image statistics and neural representation. Annu Rev Neurosci. 2001;24:1193–1216. doi: 10.1146/annurev.neuro.24.1.1193. [DOI] [PubMed] [Google Scholar]
- 48.Vinje WE, Gallant JL. Natural stimulation of the nonclassical receptive field increases information transmission efficiency in V1. J Neurosci. 2002;22:2904–2915. doi: 10.1523/JNEUROSCI.22-07-02904.2002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Carandini M, Heeger DJ. Normalization as a canonical neural computation. Nat Rev Neurosci. 2012;13:51–62. doi: 10.1038/nrn3136. [This comprehensive review paper summarizes the extensive empirical evidence for divisive normalization in neural processing. Based on the ubiquity of this algorithm across sensory modalities, brain regions, and species, the authors suggest that normalization serves as a canonical neural computation.] [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Ohshiro T, Angelaki DE, DeAngelis GC. A normalization model of multisensory integration. Nat Neurosci. 2011;14:775–782. doi: 10.1038/nn.2815. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Reynolds JH, Heeger DJ. The normalization model of attention. Neuron. 2009;61:168–185. doi: 10.1016/j.neuron.2009.01.002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Albrecht DG, Geisler WS. Motion selectivity and the contrast-response function of simple cells in the visual cortex. Vis Neurosci. 1991;7:531–546. doi: 10.1017/s0952523800010336. [DOI] [PubMed] [Google Scholar]
- 53.Heeger DJ. Normalization of cell responses in cat striate cortex. Vis Neurosci. 1992;9:181–197. doi: 10.1017/s0952523800009640. [DOI] [PubMed] [Google Scholar]
- 54.Carandini M, Heeger DJ, Movshon JA. Linearity and normalization in simple cells of the macaque primary visual cortex. J Neurosci. 1997;17:8621–8644. doi: 10.1523/JNEUROSCI.17-21-08621.1997. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 55.Cavanaugh JR, Bair W, Movshon JA. Nature and interaction of signals from the receptive field center and surround in macaque V1 neurons. J Neurophysiol. 2002;88:2530–2546. doi: 10.1152/jn.00692.2001. [DOI] [PubMed] [Google Scholar]
- 56.Sinz F, Bethge M. Temporal adaptation enhances efficient contrast gain control on natural images. PLoS Comput Biol. 2013;9:e1002889. doi: 10.1371/journal.pcbi.1002889. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Padoa-Schioppa C, Assad JA. The representation of economic value in the orbitofrontal cortex is invariant for changes of menu. Nat Neurosci. 2008;11:95–102. doi: 10.1038/nn2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Chau BK, Kolling N, Hunt LT, Walton ME, Rushworth MF. A neural mechanism underlying failure of optimal choice with multiple alternatives. Nat Neurosci. 2014;17:463–470. doi: 10.1038/nn.3649. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59.Hunt LT, Dolan RJ, Behrens TE. Hierarchical competitions subserving multi-attribute choice. Nat Neurosci. 2014;17:1613–1622. doi: 10.1038/nn.3836. [This paper examines the behavioral neural correlates of multi-attribute choice in human subjects. The authors find that choice behavior and neuroimaging data are consistent with a hierarchy of information processing including competition at the level of both attributes and integrated values. These findings suggest that adaptive computations likely extend to multiple stages of the valuation and decision process.] [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60.Mikaelian S, Simoncelli EP. Modeling temporal response characteristics of V1 neurons with a dynamic normalization model. Neurocomputing. 2001;38-40:1461–1467. [Google Scholar]
- 61.LoFaro T, Louie K, Webb R, Glimcher P. The temporal dynamics of cortical normalization models of decision-making. Lett Biomath. 2014;1:209–220. [Google Scholar]
- 62.Louie K, LoFaro T, Webb R, Glimcher PW. Dynamic divisive normalization predicts time-varying value coding in decision-related circuits. J Neurosci. 2014;34:16046–16057. doi: 10.1523/JNEUROSCI.2851-14.2014. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 63.Trueblood JS, Brown SD, Heathcote A, Busemeyer JR. Not just for consumers: context effects are fundamental to decision making. Psychol Sci. 2013;24:901–908. doi: 10.1177/0956797612464241. [DOI] [PubMed] [Google Scholar]