Abstract
In this review, I briefly summarize current neurobiological studies of decision-making that bear on two general themes. The first focuses on the nature of neural representation and dynamics in a decision circuit. Experimental and computational results suggest that ramping-to-threshold in the temporal domain and trajectory of population activity in the state space represent a duality of perspectives on a decision process. Moreover, a decision circuit can display several different dynamical regimes, such as the ramping mode and the jumping mode with distinct defining properties. The second is concerned with the relationship between biologically-based mechanistic models and normative-type models. A fruitful interplay between experiments and these models at different levels of abstraction have enabled investigators to pose increasingly refined questions and gain new insights into the neural basis of decision-making. In particular, recent work on multi-alternative decisions suggests that deviations from rational models of choice behavior can be explained by established neural mechanisms.
Introduction
David Marr, a pioneer in Computational Neuroscience who died at the young age of 35 in 1980, is mostly remembered today for his three-step recipe of brain modeling [49]: first, formulate the problem and identify its normative solution (the way it should be, optimally). Second, search for computational algorithms that accomplish the optimal solution and, third, elucidate implementations of such algorithm(s) in the brain. Yet, it would do Marr injustice to forget his deep roots in neurobiology (and his seminal trilogy of papers on the theories of cerebellum, hippocampus and neocortex). Francis Crick, who interacted extensively with Marr, reminisced in 1991: ‘David’s work clearly falls into two phases: Marr I was concerned with neural circuitry and what it might compute. Marr II (the AI phase) was more functional. The emphasis was on the theory of the process and possible algorithms, with much less attention to realistic implementations. I believe that if he had lived he would have moved to a synthesis of these two approaches.’ [19]. The relationship and interplay between the question of ‘how’ (Marr I) versus the question of ‘why’ (Marr II) of brain functions continues to be a subject of epistemological discussions in Neuroscience today [11].
With the tremendous advances in neuroscience, time is ripe to go back and forth between different levels of Marr’s hierarchy: behavior, computational algorithm and neural circuit mechanism. Recent research on the neural basis of decision-making offers an illustration of this perspective par excellence, as reviewed here.
Interplay between normative theory and neural circuit mechanism
Modern neurobiological studies of decision-making took off around the turn of this century. On perceptual decision-making, pioneer work was done using a random-dots motion (RDM) direction discrimination task. In this task, subjects are trained to make a judgment about the direction of motion (e.g. left or right) in a near-threshold random dot display, and to report the perceived direction with a saccadic eye movement. Neurophysiological studies of behaving monkeys showed stochastic activity of single neurons in the posterior parietal cortex [67, 68, 65, 13] and prefrontal cortex [39] that were correlated with the subject’s judgment. On the other hand, neuroscientists began to examine valuation underlying reward-based choice behavior [57, 63, 71, 3, 24]. Around the same time, in a seemingly unrelated effort, computational neuroscientists were developing increasingly realistic models of neural persistent activity as a brain mechanism for working memory (active short-term memory). Biologically realistic synaptic circuit modeling revealed that a working memory system should not operate as fast switches (between a resting state and memory states); instead, recurrent synaptic excitation underlying self-sustained persistent activity needs to be slow [75]. It was soon recognized that this slow reverberation mechanism is precisely what is needed for decision-making computations, because a deliberate decision requires a temporal accumulation of evidence for or against different choice options (via slow transients), ultimately leading to a categorical choice (through attractor dynamics) [76]. The proposal of a common mechanism for decision-making and working memory is supported by physiological observations that single-neuron activity signals correlated with both processes are found in the same brain regions such as the prefrontal and parietal cortices [30, 77] and by a recent human study showing that disruption (using transcranial magnetic stimulation) of the prefrontal cortex (known to be important for working memory) causally affected decision making [62]. Over the last ten years, recurrent circuit models (RNMs) have been elaborated and extensively applied by computational and cognitive neuroscientists to perceptual decisions [76, 48, 78, 42, 20, 22, 79, 27••, 41•, 23, 77, 81, 9, 53•, 25•], action selection [44] and value-based choice behavior [70, 36•, 5 ••].
The idea of temporal accumulation of evidence through sequential analysis in decision-making has a long history in cognitive psychology. According to the drift diffusion model (DDM) [47,69], a ‘decision variable’ integrates relative evidence for two alternative choices (A and B), a decision is made when the decision variable reaches either a positive (for alternative A) or negative (for alternative B) threshold. DDM has proven to be quite successful with accounting for behavioral data [47, 69], and more recent monkey’s accuracy and reaction time in the RDM task [65, 30]. DDM can be considered as a normative theory, since it is the continuous-time equivalent of the sequential probability-ratio test (SPRT), which is the optimal procedure for making binary choices under uncertainty in the sense that it minimizes the mean decision time among all tests for a given error rate [10, 6].
How does RNM compare with DDM? First, they are at two quite different levels of abstraction. DDM assumes an infinite integration time; whereas RNM strives to identify biological mechanisms for a long but finite integration time. A plausible neural basis of a long integration time is the NMDA receptor dependent recurrent synaptic excitation. This model prediction remains to be tested experimentally. It has also been proposed that neural integrators could be realized with an appropriate network architecture devoid of recurrent connections [31]. Second, the functional benefit of time integration was demonstrated in the model by showing that performance improves when the system is allowed to integrate inputs over a longer time, but eventually plateaus with sufficiently long integration as the system reaches an attractor state representing a categorical choice [76]. This prediction was confirmed in a recent monkey experiment [38]. Third, whereas in DDM evidence shown at different time points has equal weight, RNM asserts that evidence available early on has a larger impact on the ultimate choice than evidence presented later and immediately before a decision is made. This RNM prediction was supported in an experiment where a brief pulse of sensory information was introduced at different time points [35, 79]. However, in more general situations when sensory evidence or attention varies continuously in time, information provided a long time ago may be forgotten, and a commitment may be reversed in the face of newly presented evidence [64]. The biological basis and possible fundamental limitation of integration time in decision-making remain an outstanding subject of future research.
In RNM, categorical formation is realized by attractor dynamics that depend on feedback inhibition. This differs from an implementation of DDM in which feedforward inhibition implements an explicit neural computation of the difference in the sensory inputs [50]. Monkey experiments provided evidence for both feedback inhibition [32] and feedforward inhibition [8]. Note that it is empirically unknown whether accumulation of evidence and formation of a categorical choice take place in a single neural circuit. Alternatively, these two computations may proceed sequentially in distinct brain areas, or simultaneously through cooperative and distributed dynamics across multiple brain areas [17].
Accumulation-to-threshold in time and population dynamics in state space
RNM can be viewed in two different ways. One can plot neural activity of neural pools (each selective for a particular choice) as a function of time (Fig. 1A–B), and a threshold can be introduced for the termination of a decision process (Fig. 1A). On the other hand, alternatively, one can portray the dynamics of a decision circuit in the state space, where the firing rates of neural pools selective for different options are plotted against each other (Fig. 1C). According to this view, different choices are represented by distinct ‘attractor’ states. The mathematical term ‘attractor’ simply means a dynamical system state is stable against small perturbations. An attractor does not have to be a steady state but can be a complex spatiotemporal pattern. Furthermore, a system’s attractor landscape is not rigidly fixed; any relatively sustained input (an external stimulus or top-down cognitive control signal) readily alters the attractor landscape in the state space, as shown in Fig. 1C (middle and right panels).
Figure 1.
Ramping-to-threshold and population dynamics of decision-making. (A) Stochastic ramping activity to a threshold (dashed line) in a RNM (three sample trials are shown). (B) Two neural populations selective for different choices display graded ramping followed by winner-take-all competition, in a simulation of motion direction discrimination task where the task difficulty is quantified by motion coherence c′. (C) The population dynamics of a RNM is displayed in the state space of firing rates rA and rB. without external input (left panel), in the presence of a motion stimulus with a low (middle panel) or high (right panel) coherence. Note that the attractor landscape sensitively depends on the input (middle versus right panel). (D) Population dynamics of ~ 65 cells recorded from the posterior parietal cortex in mice performing a virtual-navigation decision task. Trajectories are choice specific (red: right choice trials, blue: left choice trials). Left panel: Sample trial trajectories in correct trials. Middle and Right panels: individual trial trajectories (grey and black) on erroneous right choice and left choice trials, plotted with the mean trajectories for correct right (red) and left (blue) choice trials. Adapted from [76] for (B), from [77] for (C) and from [33] for (D).
The state space framework puts the emphasis on population dynamics rather than single neurons. It offers natural explanations for observations that cannot be readily explained by the pure DDM: time integration has a limit because eventually the system reaches an attractor state when ramping stops; a brief pulse of evidence has a stronger impact on the ultimate choice early on when the system is still in the vicinity of the boundary between the basins of choice attractor states (Fig. 1C, middle and right panels); reaction times are longer in error trials because a wrong choice means that the system has to pass by a saddle point where the dynamics is dramatically slower (Fig. 1C, middle panel). Finally, according to this population dynamics view, the decision is made as long as the system has gone into the basin of a choice attractor state far enough, even though the neural firing rate may not have yet attained a threshold level. Biologically, the notion of decision threshold should be understood as the firing level of decision neurons that is required for triggering a switch-like response in downstream premotor neurons [42]. Consistent with this perspective is the common observation that, in a visual search task, behavioral reaction time co-varies with the time it takes for the firing rates of competing visually responsive neurons to significantly diverge from each other, rather than to reach a fixed threshold level, in FEF [66, 80, 18, 34], SC [52] and LIP [37, 73, 2].
A recent paper provided remarkable support for the population dynamics perspective. Harvey et al. [33 ••] recorded ~65 (ranging 37–94) single neurons in the parietal cortex (using calcium imaging) from behaving mice. In a virtual-navigation decision task, animals were shown a cue indicating, after a delay period, which of two turns (left or right) was correct choice and yielded a reward on a given trial. It was reported that dynamical trajectory in the recorded population’s state space is well separated for two alternative choices (Fig. 1D, left panel). Moreover, in error trials, the animal’s choice decoded from the population activity often started along the path of the correct choice but later somehow veered to be aligned with the path of the erroneous choice (Fig. 1D, grey traces in middle and right panels). Another line of recent work, using simultaneous electrical recording from multiple single neurons, suggests that the population dynamics framework also applies to the motor system [15, 16].
Ramping mode versus jumping mode
In most experiments a spike train of a single neuron is recorded at a time, which is often too irregular for investigators to ascertain that its underlying firing rate actually climbs smoothly and quasi-linearly over time (the ramping mode). Alternatively, a neuron may display a discrete jump of firing rate at a time that varies from trial to trial, so that the trial-averaged firing rate shows a graded time course (the jumping mode). In one realization of the jumping mode, single neurons are endowed with bistability, for instance thanks to the presence of a calcium-activated inward current [75, 72, 26, 61]. If neurons switch from one state to the other randomly, approximately at a constant rate, then the trial-average of a single neuron (or the population average in a single trial) would exhibit a graded rate change [61, 60 •]. Another scenario does not require single neurons to be bistable, but a recurrent network may switch from a low spontaneous state to a high choice state [21, 23, 28, 43, 53 •]. In other words, the resting state of RNM (in the left panel of Fig. 1C) may be still present in the presence of stimulation (Fig. 1C, middle and right panels). The system remains at that state for sometime until noise eventually induces a transition from it to one of the choice attractor states, which is reflected in the temporal domain as a jump in the firing rate of decision neurons. This noise-induced stochastic event takes place at random times in different trials, resulting in a trial-average that may appear to a graded ramping time course [55, 61, 53 •].
Notably, in the jumping mode, without noise the system would remain in the resting state, therefore fluctuations are required for decision making. A poorly understood issue is the sources of noise or stochasticity in a decision process. Perceptual decisions (identification, discrimination, etc) are often hard because sensory information is noisy, and integration of sensory data over time is computationally desirable because it improves signal-to-noise ratio [47, 30]. However, there is also stochasticity intrinsic to a decision circuit, and the Fano factor (the ratio of the variance versus mean of spike counts) of neural integrators may itself increase over time [54, 14 •]. This is likely to be generally true for neural circuits involved in both perceptual decisions and value-based choices, and stochastic neural dynamics of decision systems may play a critical role in indeterminacy of decision behavior [29, 77].
One possible way to differentiate contrasting models is to analyze not only the mean firing rate but also the time course of the variance of firing rates [14 •]; future theoretical work is needed to quantitatively assess differential dynamics of neural variability for the ramping mode and the jumping mode. Furthermore, new methods to simultaneous recording from many neurons [15, 33 ••] will enable investigators to assess whether multiple cells display any synchronous switch, as predicted by the jumping mode of RNM. On the theory side, how the distribution of jumping times is determined by network properties remains to be clarified.
Neural mechanistic explanation of deviations from rational behavior
Often times what constitutes optimality for a given decision task is unclear. Whereas SPRT is optimal for two-alternative forced choice tasks, optimal tests for three or more options are not known. Several ways have been proposed to generalize DDM to multi-alternatives [51, 58, 13•, 40 •], one of them is shown in Fig. 2A. RNM generalized to multi-alternative decision-making [74, 27 ••, 1] typically assumes competition between neural pools with shared inhibition (Fig. 2B). This model fits well with human’s performance data from a 3-choice RDM direction discrimination experiment [58] (Fig. 1C, left panel). The decision behavior of a three-choice version of the attractor network model can be well described by a softmax function, P(1) = exp(σV1)/(exp(σV1) + exp(σV2) + exp(σV3)), where V1, V2 and V3 are the values or strengths of evidence for the three options, and σ is a parameter that quantifies the amount of stochasticity (Fig. 1C, left and right panels).
Figure 2.
Three-choice decision-making. (A) A proposed generalization of DDM to three-choice. Adapted from [58] with permission. (B) RNM for multiple choice. (C) Left panel: Probability of choosing option 1 as a function of the motion coherence in the direction 1. Filled circles: data from three human subjects of the experiment [58] (filled circles); open circles: smulation results of a 3-choice RNM (B); solid curves: best fit of model simulations with a softmax function. Different colors correspond to different pairs of motion coherence levels for options 2 and 3 (blue: 0 and 0, green: 10/10, yellow: 15/5, red: 20/20, black: 25/15). Right panel: Performance data from RNM simulations are plotted against those predicted by the softmax function. Model simulations were carried out by Nathaniel Smith. These results need to be confirmed in future studies. (D) In a value-based choice task [45 •], 3 options are offered in the order of values (1: best, 2: second best, 3: worst). According to normative decision theory, option 3 should be irrelevant and changing its value should not influence the relative probability of choosing option 1 among the first two options P(1)/(P(1)+P(2)). In contrast to this ideal optimality, in the monkey experiment a higher value for option 3 reduces the relative probability for choosing the best of the two better options, which is inconsistent with the softmax decision criterion. Figure kindly provided by K. Louie and P. Glimcher. (E) Similar finding as in (D) in another monkey experiment, when medial orbitofrontal cortex was lesioned. Adapted from [59 •] with permission.
One prediction of the softmax decision criterion is that the relative probability of choosing one of two options (say 1 and 2), P(1 | 1+2)=P(1)/(P(1)+P(2)), is independent of the strength of the third option V3. This prediction has been shown to be contradicted in economic choice behavior in a surprising way. In an experiment using both monkeys and humans, three choice options were associated with different reward values (1: best, 2: second best, 3: worst) [45 •, K Louie and P Glimcher, personal communication]. The third option has a lower value than both the first and second options, thus is irrelevant and should be ignored. Yet, when the value for the worst option 3 was increased (while remaining lower than those for options 1 and 2), subjects reduced the relative probability for choosing the best of the two better options, contrary to normative models of rational behavior (Fig. 2D). Similar findings were reported in another monkey experiment, but only when the medial orbitofrontal cortex was lesioned (Fig. 2E) [59 •]. These findings highlight the context dependence of neural representation of values [46].
Interestingly, deviations from rational behavior in the monkey experiment of [45 •, K Louie and P Glimcher, personal communication] can be concisely accounted for with the assumption that neural circuit is endowed with divisive normalization, namely the activity of a neuron is divided by the sum of activities of all neurons in the circuit; by virtue of normalization neural firing rates are correlated with the relative rather than absolute action values action values (K Louie and P Glimcher, personal communication). Divisive normalization has been widely observed in a number of cortical circuits [12], and has recently been suggested to play a useful role in decision-making [4, 46, 56]. Therefore, this combined approach using monkey behavior, physiology and model demonstrated how a neural circuit mechanism predicts robust behavioral trends that are not anticipated nor easily explained by optimality-based normative theories.
Further studies will clarify several important issues. First, it remains to be seen why this deviation from rational behavior was found in normal subjects in one experiment, yet only in animals with damaged brain in another experiment. Second, the above reasoning assumes that the parameter σ controlling stochasticity remains the same when the value of the third option is varied. However, σ may not be fixed, because an increased value for options other than the best ones renders the situation less certain, as a result the decision process might become more stochastic (characterized by a larger σ). It would be interesting to design new experimental manipulations to test directly these two contrasting scenarios. Third, although RNM simulations are consistent with behavioral data from a 3-choice perceptual decision experiment (Fig. 2C), additional data are needed to assess whether the third option with the lowest motion coherence does not affect the relative decisions between the other two options, as predicted by the softmax decision rule. New experiments will probe into the possibility that perceptual decisions conform to rational theory of decision-making (at least in well trained animals) whereas value-based choices are more prone to seemingly irrational behavioral effects which nevertheless can be explained by established neural computations such as normalization.
Conclusions
In this short review I focused on two themes of recent neurobiological studies of decision-making. One is the nature of neural decision dynamics. There are two complementary perspectives of a decision process: ramping-to-threshold in the temporal domain and population dynamics in the state space. Moreover, according to RNM, the same model (with modest variations of parameters) can display several different dynamical regimes such as the ramping mode and the jumping mode, suggesting that they could occur in different local circuits of the brain or under different conditions in a single brain area. New experimental approaches such as measurements and analysis of simultaneously recorded single neurons will shed new insights into these issues in the coming years. The second theme is the relationship between normative theories and neural circuit mechanisms. The fruitful interplay between RNM and DDM has shown that research in the field would greatly benefit from cross-talk between theories of models at different levels of abstraction, in close interaction with experiments. Epistemologically, neural circuit mechanisms should be not viewed as mere implementations of normative principles, but can provide a principled explanation of irrational choice effects observed in humans and nonhuman animals, as illustrated by recent findings with 3-choice experiments. Future research that integrates across cognitive, computational, and circuit levels will be especially promising in our quest to understand the neurobiology of decision behavior.
Acknowledgments
I would like to thank Paul Miller, Alireza Soltani, Chung-Chuan Lo, Kong-Fatt Wong, Moran Furman, Alberto Bernacchia, Tatiana Engel, Nathaniel Smith, Mike Shadlen, Jeff Schall, Daeyoel Lee for their contributions to the research described there; Kong-Fatt Wong, Nathaniel Smith, Kenway Louie and Paul Glimcher for their contributions to figures. This work was supported by the NIH grant MH062349 and the Kavli Foundation.
References
- 1.Albantakis L, Deco G. The encoding of alternatives in multiple-choice decision making. Proc Natl Acad Sci USA. 2009;106:10308–10313. doi: 10.1073/pnas.0901621106. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Balan PF, Oristaglio J, Schneider DM, Gottlieb J. Neuronal correlates of the set-size effect in monkey lateral intraparietal area. PLoS Biol. 2008;6:e158. doi: 10.1371/journal.pbio.0060158. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Barraclough DJ, Conroy ML, Lee D. Prefrontal cortex and decision making in a mixed-strategy game. Nat Neuroscience. 2004;7:404–410. doi: 10.1038/nn1209. [DOI] [PubMed] [Google Scholar]
- 4.Beck JM, Latham PE, Pouget A. Marginalization in neural circuits with divisive normalization. J Neurosci. 2011;31:15310–15319. doi: 10.1523/JNEUROSCI.1706-11.2011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5••.Bernacchia A, Seo H, Lee D, Wang XJ. A reservoir of time constants for memory traces in cortical neurons. Nat Neurosci. 2011;14:366–372. doi: 10.1038/nn.2752. The authors found that different neurons in three parietal and prefrontal areas display widely disparate time constants of reward memory trace, and the distribution of these time constants exhibits a power-law. Computational modeling was used to probe candidate mechanisms for generating such a broad range of timescales of reward signals and suggests that short or long time constants may be selectively deplored in flexible decision-making according to behavioral demands. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Bogacz R, Brown E, Moehlis J, Holmes P, Cohen JD. The physics of optimal decision making: a formal analysis of models of performance in two-alternative forced-choice tasks. Psychol Rev. 2006;113:700–765. doi: 10.1037/0033-295X.113.4.700. [DOI] [PubMed] [Google Scholar]
- 7.Bogacz R, Usher M, Zhang J, McClelland JL. Extending a biologically inspired model of choice: multi-alternatives, nonlinearity and value-based multidimensional choice. Philos Trans R Soc Lond, B, Biol Sci. 2007;362:1655–1670. doi: 10.1098/rstb.2007.2059. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Bollimunta A, Ditterich J. Local computation of decision-relevant net sensory evidence in parietal cortex. Cereb Cortex. 2012;22:903–917. doi: 10.1093/cercor/bhr165. [DOI] [PubMed] [Google Scholar]
- 9.Braun J, Mattia M. Attractors and noise: twin drivers of decisions and multistability. Neuroimage. 2010;52:740–751. doi: 10.1016/j.neuroimage.2009.12.126. [DOI] [PubMed] [Google Scholar]
- 10.Brown E, Gao J, Holmes P, Bogacz R, Gilzenrat M, Cohen JD. Simple neural networks that optimize decisions. Int J Bifurcation and Chaos. 2005;15:803–826. [Google Scholar]
- 11.Carandini M. From circuits to behavior: a bridge too far? Nat Neurosci. 2012;15:507–509. doi: 10.1038/nn.3043. [DOI] [PubMed] [Google Scholar]
- 12.Carandini M, Heeger DJ. Normalization as a canonical neural computation. Nat Rev Neurosci. 2011;13:51–62. doi: 10.1038/nrn3136. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13•.Churchland A, Kiani R, Shadlen M. Decision-making with multiple alternatives. Nat Neurosci. 2008;11:693–702. doi: 10.1038/nn.2123. This is the first single-neuron recording from monkeys performing a 3-choice perceptual decision task. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14•.Churchland AK, Kiani R, Chaudhuri R, Wang XJ, Pouget A, Shadlen MN. Variance as a signature of neural computations during decision making. Neuron. 2011;69:818–831. doi: 10.1016/j.neuron.2010.12.037. This work tackles the challenge of quantifying variability of neural firing rate that is differentiated from variability inherent in a stochastic point process. The proposed measure called “variance of conditional expectation” was shown to differentiate different classes of models for decision-making. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Churchland MM, Cunningham JP, Kaufman MT, Ryu SI, Shenoy KV. Cortical preparatory activity: representation of movement or first cog in a dynamical machine? Neuron. 2010;68:387–400. doi: 10.1016/j.neuron.2010.09.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Churchland MM, Cunningham JP, Kaufman MT, Foster JD, Nuyujukian P, Ryu SI, Shenoy KV. Neural population dynamics during reaching. Nature. 2012;487:51–56. doi: 10.1038/nature11129. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Cisek P, Kalaska JF. Neural mechanisms for interacting with a world full of action choices (2010) Annu Rev Neurosci. 33:269–298. doi: 10.1146/annurev.neuro.051508.135409. [DOI] [PubMed] [Google Scholar]
- 18.Cohen JY, Crowder EA, Heitz RP, Subraveti CR, Thompson KG, Woodman GF, Schall JD. Cooperation and competition among frontal eye field neurons during visual target selection. J Neurosci. 2010;30:3227–3238. doi: 10.1523/JNEUROSCI.4600-09.2010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Crick FH. In: From the Retina to the Neocortex: Selected Papers of David Marr. Vaina LM, editor. Boston, MA: Birkhäuser; 1991. pp. 314–315. [Google Scholar]
- 20.Deco G, Rolls E. Decision-making and Weber’s law: a neurophysiological model. Eur J Neurosci. 2006;24:901–916. doi: 10.1111/j.1460-9568.2006.04940.x. [DOI] [PubMed] [Google Scholar]
- 21.Deco G, Pérez-Sanagustín M, de Lafuente V, Romo R. Perceptual detection as a dynamical bistability phenomenon: a neurocomputational correlate of sensation. Proc Natl Acad Sci USA. 2007a;104:20073–20077. doi: 10.1073/pnas.0709794104. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Deco G, Scarano L, Soto-Faraco S. Weber’s law in decision making: integrating behavioral data in humans with a neurophysiological model. J Neurosci. 2007b;27:11192–11200. doi: 10.1523/JNEUROSCI.1072-07.2007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Deco G, Rolls ET, Romo R. Stochastic dynamics as a principle of brain function. Prog Neurobiol. 2009;88:1–16. doi: 10.1016/j.pneurobio.2009.01.006. [DOI] [PubMed] [Google Scholar]
- 24.Dorris MC, Glimcher PW. Activity in posterior parietal cortex is correlated with the relative subjective desirability of action. Neuron. 2004;44:365–378. doi: 10.1016/j.neuron.2004.09.009. [DOI] [PubMed] [Google Scholar]
- 25•.Engel T, Wang X-J. Same or different? A neural circuit mechanism of similarity based pattern-match decision making. J Neurosci. 2011;31:6982–6996. doi: 10.1523/JNEUROSCI.6150-10.2011. This work proposes a neural circuit model of delayed match-to-sample. At the core of the model is a module of neurons with mixed-selectivity that are segregated into two groups (those that show match enhancement or match suppression, respectively) [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Fransén E, Tahvildari B, Egorov AV, Hasselmo ME, Alonso AA. Mechanism of graded persistent cellular activity of entorhinal cortex layer v neurons. Neuron. 2006;49:735–746. doi: 10.1016/j.neuron.2006.01.036. [DOI] [PubMed] [Google Scholar]
- 27••.Furman M, Wang XJ. Similarity effect and optimal control of multiple-choice decision making. Neuron. 2008;60:1153–1168. doi: 10.1016/j.neuron.2008.12.003. In this paper the authors showed that a continuous neural circuit model reproduces single-neuron activity and behavioral performance in the 4-choice perceptual decision experiment of [13]. Furthermore, it demonstrates that a simple constant top-down control signal can instantiate speed-accuracy tradeoff in multi-alternative decisions, provided that the strength of this signal varies non-monotonically with the number of choice options. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Gigante G, Mattia M, Braun J, Del Giudice P. Bistable perception modeled as competing stochastic integrations at two levels. PLoS Comput Biol. 2009;5:e1000430. doi: 10.1371/journal.pcbi.1000430. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Glimcher PW. Indeterminacy in brain and behavior. Annu Rev Psychol. 2005;56:25–56. doi: 10.1146/annurev.psych.55.090902.141429. [DOI] [PubMed] [Google Scholar]
- 30.Gold JI, Shadlen MN. The neural basis of decision making. Annu Rev Neurosci. 2007;30:535–574. doi: 10.1146/annurev.neuro.29.051605.113038. [DOI] [PubMed] [Google Scholar]
- 31.Goldman MS. Memory without feedback in a neural network. Neuron. 2009;61:621–634. doi: 10.1016/j.neuron.2008.12.012. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Hanks TD, Ditterich J, Shadlen MN. Microstimulation of macaque area LIP affects decision-making in a motion discrimination task. Nat Neurosci. 2006;9:682–689. doi: 10.1038/nn1683. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33••.Harvey CD, Coen P, Tank DW. Choice-specific sequences in parietal cortex during a virtual-navigation decision task. Nature. 2012;484:62–68. doi: 10.1038/nature10918. In this remarkable study, the authors carried out calcium imaging of many neurons in rat’s posterior parietal cortex in a cued choice task. Decoding from population dynamics in the state space revealed distinct trajectories for the two options both in correct and erroneous trials. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Heitz RP, Cohen JY, Woodman GF, Schall JD. Neural correlates of correct and errant attentional selection revealed through N2pc and frontal eye field activity. J Neurophysiol. 2010;104:2433–2441. doi: 10.1152/jn.00604.2010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Huk AC, Shadlen MN. Neural activity in macaque parietal cortex reflects temporal integration of visual motion signals during perceptual decision making. J Neurosci. 2005;25:10420–10436. doi: 10.1523/JNEUROSCI.4684-04.2005. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36•.Hunt LT, Kolling N, Soltani A, Woolrich MW, Rushworth MF, Behrens TE. Mechanisms underlying cortical activity during value-guided choice. Nat Neurosci. 2012;15:470–476. doi: 10.1038/nn.3017. Magnetoencephalography was applied to human subjects performing a value-based choice task. A set of time-varying signals correlated with value comparison in a decision task were found and reproduced by the neural circuit model of [76,78,79] [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Ipata AE, Gee AL, Goldberg ME, Bisley JW. Activity in the lateral intraparietal area predicts the goal and latency of saccades in a free-viewing visual search task. J Neurosci. 2006;26:3656–3661. doi: 10.1523/JNEUROSCI.5074-05.2006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Kiani R, Hanks T, Shadlen M. Bounded integration in parietal cortex underlies decisions even when viewing duration is dictated by the environment. J Neurosci. 2008;28:3017–3029. doi: 10.1523/JNEUROSCI.4761-07.2008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Kim JN, Shadlen MN. Neural correlates of a decision in the dorsolateral prefrontal cortex of the macaque. Nature Neurosci. 1999;2:176–183. doi: 10.1038/5739. [DOI] [PubMed] [Google Scholar]
- 40•.Krajbich I, Rangel A. Multialternative drift-diffusion model predicts the relationship between visual fixations and choice in value-based decisions. Proc Natl Acad Sci USA. 2011;108:13852–13857. doi: 10.1073/pnas.1101328108. The authors propose a generalization of the drift diffusion model to 3-choices, that requires three racers, each of which computes the strength of evidence for a particular option minus the maximum of the strengths of evidence for the other two options. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41•.Liu F, Wang XJ. A common cortical circuit mechanism for perceptual categorical discrimination and veridical judgment. PLoS Comput Biol. 2008;4:e1000253. doi: 10.1371/journal.pcbi.1000253. This work demonstrates that the same model can account for behavioral and single-neuron physiological data in both veridical identification and categorical discrimination perceptual tasks. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 42.Lo CC, Wang XJ. Cortico-basal ganglia circuit mechanism for a decision threshold in reaction time tasks. Nature Neurosci. 2006;9:956–963. doi: 10.1038/nn1722. [DOI] [PubMed] [Google Scholar]
- 43.Lo C, Wang X-J. Speed-accuracy tradeoff by a control signal with balanced excitation and inhibition. Cereb Cortex. 2012 doi: 10.1152/jn.00845.2013. in press. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Lo CC, Boucher L, Paré M, Schall JD, Wang XJ. Proactive inhibitory control and attractor dynamics in countermanding action: a spiking neural circuit model. J Neurosci. 2009;29:9059–9071. doi: 10.1523/JNEUROSCI.6164-08.2009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45•.Louie K, Grattan LE, Glimcher PW. Reward value-based gain control: divisive normalization in parietal cortex. J Neurosci. 2011;31:10627–10639. doi: 10.1523/JNEUROSCI.1237-11.2011. The authors provided substantive evidence that valuation signals of single neurons in the lateral intraparietal area can be accounted for with divisive normalization. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Louie K, Glimcher PW. Efficient coding and the neural representation of value. Ann N Y Acad Sci. 2012;1251:13–32. doi: 10.1111/j.1749-6632.2012.06496.x. [DOI] [PubMed] [Google Scholar]
- 47.Luce RD. Response Time: their role in inferring elementary mental organization. Oxford University Press; New York: 1986. [Google Scholar]
- 48.Machens CK, Romo R, Brody CD. Flexible control of mutual inhibition: a neural model of two-interval discrimination. Science. 2005;18:1121–1124. doi: 10.1126/science.1104171. [DOI] [PubMed] [Google Scholar]
- 49.Marr D. Vision. New York: W.H. Freeman & Co; 1982. [Google Scholar]
- 50.Mazurek ME, Roitman JD, Ditterich J, Shadlen MN. A role for neural integrators in perceptual decision making. Cereb Cortex. 2003;13:1257–1269. doi: 10.1093/cercor/bhg097. [DOI] [PubMed] [Google Scholar]
- 51.McMillen T, Holmes P. The dynamics of choice among multiple alternatives. J Math Psych. 2006;50:30–57. [Google Scholar]
- 52.McPeek R, Keller E. Saccade target selection in the superior colliculus during a visual search task. J Neurophysiol. 2002;88:2019–2034. doi: 10.1152/jn.2002.88.4.2019. [DOI] [PubMed] [Google Scholar]
- 53•.Miller P, Katz DB. Stochastic transitions between neural states in taste processing and decision-making. J Neurosci. 2010;30:2559–2570. doi: 10.1523/JNEUROSCI.3047-09.2010. This modeling work shows that a sequence of quasi-stable states can be reproduced by a neural circuit model in the jumping mode. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 54.Miller P, Wang XJ. Power-law neuronal fluctuations in a recurrent network model of parametric working memory. J Neurophysiol. 2006a;95:1099–1114. doi: 10.1152/jn.00491.2005. [DOI] [PubMed] [Google Scholar]
- 55.Miller P, Wang XJ. Stability of discrete memory states to stochastic fluctuations in neuronal systems. Chaos. 2006b;16:026109. doi: 10.1063/1.2208923. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 56.Mirpour K, Bisley JW. Dissociating activity in the lateral intraparietal area from value using a visual foraging task. Proc Natl Acad Sci USA. 2012;109(25):10083–8. doi: 10.1073/pnas.1120763109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 57.Montague PR, Dayan P, Sejnowski TJ. A framework for mesencephalic dopamine systems based on predictive Hebbian learning. J Neurosci. 1996;16(5):1936–47. doi: 10.1523/JNEUROSCI.16-05-01936.1996. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Niwa M, Ditterich J. Perceptual decisions between multiple directions of visual motion. J Neurosci. 2008;28:4435–4445. doi: 10.1523/JNEUROSCI.5564-07.2008. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 59•.Noonan MP, Walton ME, Behrens TE, Sallet J, Buckley MJ, Rushworth MF. Separate value comparison and learning mechanisms in macaque medial and lateral orbitofrontal cortex. Proc Natl Acad Sci USA. 2010;107:20547–20552. doi: 10.1073/pnas.1012246107. This was the first report of violation of rational choice in a monkey 3-alternative value-based decision task. Contrary to [45], the violation was seen only in subjects with lesion of the lateral orbitofrontal cortex. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 60•.Okamoto H, Fukai T. Recurrent network models for perfect temporal integration of fluctuating correlated inputs. PLoS Comput Biol. 2009;5:e1000404. doi: 10.1371/journal.pcbi.1000404. This is a nice modeling work investigating the conditions under which neural activity that jumps suddenly in time is consistent with a smooth time course of neural activity across trials. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Okamoto H, Isomura Y, Takada M, Fukai T. Temporal integration by stochastic recurrent network dynamics with bimodal neurons. J Neurophysiol. 2007;97:3859–3867. doi: 10.1152/jn.01100.2006. [DOI] [PubMed] [Google Scholar]
- 62.Philiastides MG, Auksztulewicz R, Heekeren HR, Blankenburg F. Causal role of dorsolateral prefrontal cortex in human perceptual decision making. Curr Biol. 2011;21:980–983. doi: 10.1016/j.cub.2011.04.034. [DOI] [PubMed] [Google Scholar]
- 63.Platt ML, Glimcher PW. Neural correlates of decision variables in parietal cortex. Nature. 1999;400:233–238. doi: 10.1038/22268. [DOI] [PubMed] [Google Scholar]
- 64.Resulaj A, Kiani R, Wolpert DM, Shadlen MN. Changes of mind in decision-making. Nature. 2009;461:263–266. doi: 10.1038/nature08275. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 65.Roitman JD, Shadlen MN. Response of neurons in the lateral intraparietal area during a combined visual discrimination reaction time task. J Neurosci. 2002;22:9475–9489. doi: 10.1523/JNEUROSCI.22-21-09475.2002. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 66.Sato TR, Murthy A, Thompson KG, Schall JD. Search efficiency but not response interference affects visual selection in frontal eye field. Neuron. 2001;30:583–591. doi: 10.1016/s0896-6273(01)00304-x. [DOI] [PubMed] [Google Scholar]
- 67.Shadlen MN, Newsome WT. Motion perception: seeing and deciding. Proc Natl Acad Sci USA. 1996;93:628–633. doi: 10.1073/pnas.93.2.628. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 68.Shadlen MN, Newsome WT. Neural basis of a perceptual decision in the parietal cortex (area LIP) of the rhesus monkey. J Neurophysiol. 2001;86:1916–1936. doi: 10.1152/jn.2001.86.4.1916. [DOI] [PubMed] [Google Scholar]
- 69.Smith PL, Ratcliff R. Psychology and neurobiology of simple decisions. Trends Neurosci. 2004;27:161–168. doi: 10.1016/j.tins.2004.01.006. [DOI] [PubMed] [Google Scholar]
- 70.Soltani A, Wang XJ. A biophysically based neural model of matching law behavior: melioration by stochastic synapses. J Neurosci. 2006;26:3731–3744. doi: 10.1523/JNEUROSCI.5159-05.2006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 71.Sugrue LP, Corrado GC, Newsome WT. Matching behavior and representation of value in parietal cortex. Science. 2004;304:1782–1787. doi: 10.1126/science.1094765. [DOI] [PubMed] [Google Scholar]
- 72.Tegnér J, Compte A, Wang XJ. The dynamical stability of reverberatory neural circuits. Biol Cybern. 2002;87:471–481. doi: 10.1007/s00422-002-0363-9. [DOI] [PubMed] [Google Scholar]
- 73.Thomas NW, Paré M. Temporal processing of saccade targets in parietal cortex area LIP during visual search. J Neurophysiol. 2007;97:942–947. doi: 10.1152/jn.00413.2006. [DOI] [PubMed] [Google Scholar]
- 74.Usher M, McClelland J. On the time course of perceptual choice: the leaky competing accumulator model. Psychol Rev. 2001;108:550–592. doi: 10.1037/0033-295x.108.3.550. [DOI] [PubMed] [Google Scholar]
- 75.Wang XJ. Synaptic reverberation underlying mnemonic persistent activity. Trends in Neurosci. 2001;24:455–463. doi: 10.1016/s0166-2236(00)01868-3. [DOI] [PubMed] [Google Scholar]
- 76.Wang XJ. Probabilistic decision making by slow reverberation in cortical circuits. Neuron. 2002;36:955–968. doi: 10.1016/s0896-6273(02)01092-9. [DOI] [PubMed] [Google Scholar]
- 77.Wang XJ. Decision making in recurrent neuronal circuits. Neuron. 2008;60:215–234. doi: 10.1016/j.neuron.2008.09.034. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 78.Wong KF, Wang XJ. A recurrent network mechanism of time integration in perceptual decisions. J Neurosci. 2006;26:1314–1328. doi: 10.1523/JNEUROSCI.3733-05.2006. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 79.Wong KF, Huk AC, Shadlen MN, Wang XJ. Neural circuit dynamics underlying accumulation of time-varying evidence during perceptual decision-making. Frontiers in Comput Neurosci. 2007;1:6. doi: 10.3389/neuro.10/006.2007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 80.Woodman GF, Kang MS, Thompson K, Schall JD. The effect of visual search efficiency on response preparation: neurophysiological evidence for discrete flow. Psychol Sci. 2008;19:128–136. doi: 10.1111/j.1467-9280.2008.02058.x. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 81.Zylberberg A, Fernandez Slezak D, Roelfsema PR, Dehaene S, Sigman M. The brain’s router: a cortical network model of serial processing in the primate brain. PLoS Comput Biol. 2010;6:e1000765. doi: 10.1371/journal.pcbi.1000765. [DOI] [PMC free article] [PubMed] [Google Scholar]