Skip to main content
UKPMC Funders Author Manuscripts logoLink to UKPMC Funders Author Manuscripts
. Author manuscript; available in PMC: 2015 Apr 28.
Published in final edited form as: Neuron. 2014 Mar 19;81(6):1429–1441. doi: 10.1016/j.neuron.2014.01.020

Adaptive gain control during human perceptual choice

Samuel Cheadle 1, Valentin Wyart 2, Konstantinos Tsetsos 1, Nicholas Myers 1, Vincent de Gardelle 3, Santiago Herce Castañón 1, Christopher Summerfield 1
PMCID: PMC4411568  EMSID: EMS63128  PMID: 24656259

Abstract

Neural systems adapt to background levels of stimulation. Adaptive gain control has been extensively studied in sensory systems, but overlooked in decision-theoretic models. Here, we describe evidence for adaptive gain control during the serial integration of decision-relevant information. Human observers judged the average information provided by a rapid stream of visual events (samples). The impact that each sample wielded over choices depended on its consistency with the previous sample, with more consistent or expected samples wielding the greatest influence over choice. This bias was also visible in the encoding of decision information in pupillometric signals, and in cortical responses measured with functional neuroimaging. These data can be accounted for with a new serial sampling model in which the gain of information processing adapts rapidly to reflect the average of the available evidence.

INTRODUCTION

Optimal choices require information to be evaluated appropriately and combined without loss (Ernst and Banks, 2002; Gold and Shadlen, 2001; Wald and Wolfowitz, 1949). For example, an accurate medical diagnosis is made by considering all of the relevant symptoms, weighting each by the reliability with which it predicts a suspected condition. This intuition forms a mainstay of philosophical reflections on decision-making (Peirce, 1878) and is formalised in mathematically optimal models of choice behaviour, such as those based on serial integration of likelihoods (Ma et al., 2006; Wald and Wolfowitz, 1949; Yang and Shadlen, 2007). Empirically, this ideal observer framework accounts for choices, response latencies and neural activity observed during psychophysical tasks involving integrating noisy information over time, such as the random dot motion (RDM) paradigm (Beck et al., 2008; Gold and Shadlen, 2007).

However, unlike psychophysical judgments of stationary sensory signals, real-world choices (such as those faced by medical practitioners) often require the integration of variable, heterogenous or otherwise incommensurable information. Linear integration of highly variable information on an absolute scale poses a challenge for the nervous system, because it requires coding units to represent information across a broad dynamic range. One efficient solution is to encode information relative to the local context, by adapting the gain of neuronal firing to the range of information available over the short or long term (Barlow, 1961; Carandini and Heeger, 2012; Webb et al., 2003). For example, the visual system adapts to light levels over the diurnal cycle (Bartlett, 1965) or to stimulus contrast over the recent trial history (Greenlee and Heitger, 1988). In cortical neurons, adaptation is mediated both by habituation (Carandini and Ferster, 1997) and by normalisation mechanisms that scale the response of a neuron by that of its neighbours (Carandini and Heeger, 2012; Heeger, 1992). Beyond the sensory cortices, adaptive mechanisms may allow relative coding of utility, leaving economic choices vulnerable to the influence of local context (Louie and Glimcher, 2012; Padoa-Schioppa, 2009; Tremblay and Schultz, 1999), and provoking preference reversals and other violations of axiomatic rationality (Louie et al., 2013; Soltani et al., 2012). However, adaptive gain control has been overlooked in canonical models of information integration during perceptual choice (Bogacz et al., 2006; Ratcliff and McKoon, 2008).

Here, we describe a new serial sampling model of perceptual choice in which the gain of information processing adapts rapidly to the changing statistics of environment. We used this adaptive gain model to understand the behaviour of humans categorising a rapid stream of visual gratings on the basis of their angle of orientation. Humans performing this task exhibited two suboptimal biases in choice behaviour, visible in the impact that each grating (‘sample’) wielded over the eventual decision. Firstly, samples that carried similar decision information to their predecessor wielded greater influence on choice, even independent of their physical appearance (consistency bias). Secondly, samples that occurred later in the stream were also more diagnostic of choice (recency bias) (Tsetsos et al., 2012b). Simulations reveal that these two biases are both predicted by the adaptive gain model. Moreover, the gain of encoding of decision information in pupil diameter and in cortical brain responses followed tightly the predictions of the model. These findings provide evidence for remarkably rapid and flexible gain control during decision-making, define limits on the optimality of human judgment, and place a strong new constraint on computational models of perceptual choice.

RESULTS

Adaptive gain model

Consider a task in which observers view a fixed number of discrete samples of evidence occurring in succession (see Fig. 1b), with each sample k characterised by decision update DUk that can vary continuously between −1 and +1. The observers’ task is to judge whether the aggregate information ∑DUk is greater or less than zero. This ‘expanded judgment’ task is sometimes known as the Weather Prediction task (Poldrack et al., 2001; Yang and Shadlen, 2007). In this task, the optimal policy is to integrate the log-likelihood ratio associated with each sample, which is equivalent to basing choices on the sum of decision values ∑DUk. The adaptive gain model proposes that each decision update passes through a nonlinear (sigmoidal) transfer function f:

DU^k=f(DUkxk1,σ) [eq.1]

where xk corresponds to the point of maximal gain during processing of sample k, and σ corresponds to the shallowness of the non-linearity (step-like for σ => 0, linear for σ => ∞). A formal description of the transfer function is given in the Supplemental Information (S2).

Figure 1.

Figure 1

Experimental Design and Model. A. Category-level averaging task. Rapid visual streams of eight oriented Gabor patterns were presented at 4 Hz. Participants reported whether, on average, the tilt of the eight elements fell closer to the cardinal or diagonal axis. B. Schematic illustration of the adaptive transfer function for an example trial. Left panel: samples (blue dots, shaded by their order of occurrence from light to dark) are drawn from a distribution (inverted blue Gaussian) and characterised by a decision update value (position on x-axis). With each new sample, the transfer function (grey lines) shifts fractionally in the direction of the latest sample. Right panel: illustrative gain (difference on y-axis) from two samples (red lines) under two different transfer functions (light and dark grey lines). Gain is maximal when the inflection point of the transfer function coincides with the expectation of the sampling distribution (inverted blue Gaussian), i.e. given the dark grey curve. See also Supplemental Information, Fig. S1.

In the adaptive gain model, after each sample k the point of maximal gain xk is updated according to the feature information provided by sample k using a delta rule with learning rate α:

xk=xk1+α(DUkxk1) [eq.2]

Thus, after each sample k, the inflection point xk of the transfer function f is progressively adjusted towards the current expectation of the sampling distribution (Fig. 1b). Because the gradient of the transfer function is steepest at this point, samples arriving at, or close to, this point will be most diagnostic of choice. One important corollary of this model is that across samples, observers become most influenced by information with the highest probability of occurrence. Supplementary figure S1a displays how the shape of the transfer function adapts for changes in inflection point xk, and σ.

Finally, the output decision updates are subjected to a noisy sum with signal-to-noise ratio β and compared to a fixed category boundary falling at -β0 (zero for the ideal observer):

P)(respond>0)(=Φ(βk=18DU^k+β0) [eq.3]

where P(respond > 0) corresponds to the model-predicted probability of responding that the sequence is greater than zero, and Φ( ) corresponds to the probit decision rule, i.e., the Gaussian cumulative density function. The choice of this decision rule is motivated by the idea that each sample of evidence DUk is corrupted by additive Gaussian noise whose standard deviation relates inversely to the corresponding β parameter.

Adaptive gain model: simulations

We generated simulated streams of 8 samples of evidence (range: −1:1) and asked the model to classify them. For illustration, we began with an arbitrary set of parameters (β = 1, σ = 0.5, α = 0.5). The point of maximal gain xk was initialised to zero and updated according to equation 2.

When decisions are made on the basis of multiple samples of evidence, decisions may rely more heavily on the information provided by some samples than others. We thus used logistic regression to characterise the impact that each sample had on binary choices made by the model (and subsequently, humans). Our regression model comprised a total of 15 predictors in addition to an intercept term. The first eight predictors encoded the decision information for each of the eight samples (DUk) in order of occurrence, with the resulting coefficients (we call these ‘sample’ coefficients) wk reflecting the weight that each of the 8 successive samples wielded over the choice. The subsequent predictors and resulting coefficients wkδ were aimed at identifying how information in samples 2-8 was up-weighted or down-weighted as a function of its disparity with previous information. To this end we defined a new quantity δ(DU)k that encoded the absolute dissimilarity in decision information between each sample (excluding the first) and its predecessor (we set δ(DU)0 = 0):

δ(DU)k=DUkDUk1 [eq.4]

Predictors 9-15 encoded the interaction between DUk and δ(DU)k. The coefficients associated with these predictors (we call these ‘consistency’ coefficients) would be positive if dissimilarity between successive samples led to up-weighting and negative if dissimilarity led to down-weighting.

Thus, the full regression model, including the two per-sample predictors described above, was defined as follows

P)(respond>0)(=Φ(k=18(wk+wkδδ(DU)k)DUk+β0) [eq.5]

For the simulated data, the estimated sample coefficients wk and consistency coefficients wδk, are plotted against their corresponding sample k in Fig. 2a (left panel). These simulations reveal two clear predictions made by the adaptive gain model. First, although ‘sample’ coefficients are all positive, indicating that each contributed to the decision, those for later samples are more positive than those for earlier samples – a ‘recency’ bias (Fig. 2a, left panel; blue circles). The adaptive gain model thus predicts that later-occurring evidence will carry more weight in the decision. In the model, this occurs because the inflection point of the transfer function takes some time to converge to the mean of the sampling distribution, and so later-occurring samples will, on average, be more likely to fall in zone of maximum gain, where samples are more diagnostic of choice. Secondly, ‘consistency’ coefficients were all negative (Fig. 2a, left panel; green circles). Thus, the model predicts that observers will tend to disregard samples that differ strongly from their predecessor, because on average these will fall further from the inflection point of the adapting likelihood function. We call this a ‘consistency bias’, as it reflects the difficulty of evaluating successive samples with respect to a common standard.

Figure 2.

Figure 2

A. Model predictions using a simulation with arbitrary parameters. Estimated sample coefficients (blue), and consistency coefficients (green), for both an adaptive (top panel / circles) and static (bottom panel / triangles) version of the model. B. Strength of recency (top panel) and consistency (bottom panel) biases, shown for both the adaptive (α = 0.5, filled circles) and static (α = 0, filled triangles) version of the model, for a range of σ values controlling the gain of the nonlinear transfer function. C. Heatmaps showing predicted biases under a broad range of transfer function slope (σ) and learning rate (α) values. Red/yellow colouring shows positive values, blue/cyan shows negative values. Left panel: recency bias (DU5-8-DU1-4) is positive for learning rates greater than 1. Right panel: consistency bias is mostly negative for learning rates greater than 1.

For comparison, in Fig. 2a (right panel) we include the predictions of a ‘static’ model, which is identical in all respects except that the learning rate α is set to zero (β = 1, σ = 0.5, α = 0). This model reduces to a standard psychophysical model with a nonlinear transfer function (Naka and Rushton, 1966). As can be seen, the coefficients wk associated with DUk (encoding the weight given to each sample; blue triangles) are positive but do not increase over time, yielding no recency bias. Moreover, the coefficients wδk associated with DUk · δ(DU)k (encoding the up- or down-weighting of samples 2-8; green triangles) are close to zero, revealing no consistency bias. Thus, the effects described in Fig. 2a (left panel) depend on the adaptation of the transfer function across samples.

Next, we verified that these predictions are not specific to the parameterisation chosen above. In Fig. 2c, we summarise these two biases for values of σ and α in the range −1:1, setting the signal-to-noise level (β) to 1 (adding different levels of noise had no qualitative effect on these predictions). In the left panel, we calculate the recency bias as the difference between sample coefficients for late (DU5-8) and early (DU1-4) samples, and plot this summary measure for differing values of σ and α. Of note, these are positive for values of α > 0 (red/yellow shading), showing that the recency bias holds whenever the transfer function is updated towards the current expectation. By contrast, they are negative (showing a primacy effect) for values of α < 0, i.e. when the transfer function is updated away from the current expectation. This ability to predict primacy and recency shows the generality of the model and potential to capture a range of different weighting profiles, under distinct parameterisations. In the right panel we show the average of the 7 consistency coefficients under different parameterizations. Similarly, these are all negative for values of α > 0 (blue/cyan shading). The adaptive model displays both recency and consistency biases for a range of σ values, maximal as σ approaches 0 (i.e. when the transfer function is most step-like), whereas the static model fails to display either of these biases (Fig 2b). Together, these simulations show that the recency and consistency biases do not depend on specific values of σ or α (α > 0), and that both biases can be reversed via the introduction of a negative learning rate, which shifts the point of maximum gain away from the expectation of the sampling distribution.

Experiment 1: human behavioural data

We tested the predictions of the adaptive gain model by asking 23 healthy human observers to classify a stream of eight visual gratings (presented at 4 Hz) as more ‘cardinal’ or ‘diagonal’, providing feedback according to their aggregate decision update value (Fig. 1a). The decision update for each sample k (the quantity that we term DUk) depended nonlinearly on angle of orientation, with gratings whose orientation was closer to 0° or 90° providing evidence for one response, and those closer to 45° or −45° contributing evidence in favour of the opposite response (Fig. S1b and Supplemental Information). This task has the appealing property that decision updates are orthogonal to perceptual information, because very different angles (e.g. 10° and 80°) offer the same decision information, allowing us to distinguish any sequential effects in decision-making from low-level perceptual priming (Wyart et al., 2012a) (the independence of DUk from perceptual information is shown in Fig. S1b). Participants responded with a key press following presentation of all eight samples, allowing us to test (using the regression-based approach described in eq 5) for the presence of recency and consistency biases predicted by the adaptive gain model.

Using the regression approach described above to assess the impact that each sample had on human decisions, coefficients were less positive for early (k < 5) than late (k > 4) samples (Fig. 3a, left panel, blue circles; t(22) = 5.38, p < 0.001). Consistency coefficients for items 2-8 were on average negative-going (Fig.3a, left panel, green circles; t(22) = 2.05, p < 0.05). Thus human participants showed both the recency and consistency biases predicted by the model. Furthermore, the human data also reveal the existence of a ‘belief compatibility’ bias, reflecting the compatibility with the running average of the sequence, which displays the same qualitative pattern as the consistency bias, and which is also predicted by the adaptive gain model (see Supplemental Information for details and Fig. S3). Additionally, the recency and consistency biases displayed by human participants were present in both early and late periods of the experiment (Fig. S3), and did not depend on knowledge of the sequence length, as demonstrated in an additional behavioural experiment (Fig. S3).

Figure 3.

Figure 3

Behavioural results and model predictions. A. Experiment 1 B. Experiment 2. Regression coefficients from behavioural data (filled colour circles), together with adaptive model predictions (lines), estimated using a multivariate logistic regression of choice against a linear combination of factors. Blue dots and lines: sample coefficients wk for the eight sample decision updates (wk). Green dots and lines: consistency coefficients wδk indicating the influence of disparity between samples. Solid lines plot model predictions based on analytically derived parameter values, fitting against choice data. Dashed lines plot model predictions based on an exhaustive search over parameter space, minimising the mean squared error between behavioural and model parameter estimates. Error bars indicate +-1 S.E. See also Supplemental Information, Fig. S3.

We fit the adaptive gain model to the data by searching for the parameterisation that best predicted human choices given the input sequence (values of DUk), i.e. maximising the model log-likelihood corresponding to the probability of a model using a set of generative parameters β, σ, α to have produced the observed behavioural data. Details of this Bayesian model fitting approach are described in full in the Supplemental Information. Model choices obtained using the resulting maximum likelihood parameters (β = 0.63, σ = 0.67, and α = 0.26) were then subjected to the same regression analyses as human data, and are overlaid on Fig. 3a (solid blue and green lines). As can be seen, the model is able to capture the recency and consistency biases both qualitatively and quantitatively. The regression analysis yields 15 parameter estimates (Fig. 3a) and model predictions fall within the 95% confidence intervals produced by the human behavioural data for all but four of them. Additionally, we searched exhaustively over different values of the three parameters σ, β and α for the model parameterisation that minimised the difference with the human regression coefficients (dashed blue and green lines in Fig. 3a). The resulting parameters (β = 0.7, σ = 0.63, and α = 0.2) provided an equally good fit to the human data.

The statistically significant recency and consistency biases in human behaviour militate in favour of the adaptive gain model over its static counterparts. However, to compare these models more formally, fitting the static model to the data using the same maximum likelihood criteria described above produced a poorer fit compared to its adaptive model counterpart (shown in Fig. S3 Supplemental information) even when the extra parameter was taken into account by using the Bayesian Information Criterion (BIC) (see Experimental Procedures). In addition, directly comparing the residual mean squared error (MSE) between behavioural parameter estimates for each subject and model predictions, we found that the adaptive model was a significantly better fit to the data than its static counterpart (t(22) = 4.12, p < 0.0005). This comparison shows that the adjustment of the slope of the transfer function alone is not sufficient to account for the biases observed in human choice behaviour. The failure of the static model to account for human performance also demonstrates the suboptimality of integration in our task, because the static model describes the performance of a Bayesian ideal observer with or without input noise (Fig. S2 and text).

Experiment 1: pupillometry

Traditional theories (Aston-Jones and Cohen, 2005) and recent evidence (Cohen and Aston-Jones, 2005; Eldar et al., 2013; Gilzenrat et al., 2002; Nassar et al., 2012) have pointed to pupil diameter as a correlate of the gain of information processing during decision-making. Pupil diameter is partly driven by arousal-linked changes in the neural response of the locus coeruleus, which in turn modulates cortical activity via diffuse noradrenergic projections (Usher et al., 1999). We thus turned to pupillometric data recorded during experiment 1 to investigate how moment-by-moment changes in pupil diameter following the presentation of sample k were predicted by its decision information DUk, and how this correlation was modulated by consistency between current and previous sample δ(DU)k.

Pupil diameter changes slowly in response to cognitive and attentional factors, typically peaking 1-2 seconds post-stimulus, but its sub-second fluctuations can be disambiguated using a linear regression approach similar to that often employed to model the sluggish BOLD response (Nassar et al., 2012; Wierda et al., 2012; Zylberberg et al., 2012). Having successfully recorded pupil diameter from 20 of the 23 participants we created a regression model in which the pupil response at each time point t following sample k was modelled as a linear combination of ∣DUk∣ (reflecting the strength of the current evidence sample, associated with coefficients wk,dt) and ∣DUk∣ · δ(DU)k (reflecting how the neural encoding of evidence strength depends on its consistency with the previous evidence sample, associated with coefficients wk,dt δ):

pupil)(tk+dt)(=(wk,dt+wk,dtδδ(DU)k)DUk+wdtsumj=18DUj+dt [eq.6]

where pupil(tk + dt) is the normalized and base-lined pupil size at time dt following the onset of sample k (the final term encodes the mean decision information on that trial, ensuring that fluctuations in pupil diameter reflect sample-specific changes in DUk and not the global difficulty of the trial). The coefficients of this regression across time provide the time course of encoding of decision information in pupillometric signals (wk,dt) and the modulation of this encoding by the consistency of decision information contained in consecutive samples (wδk,dt). The evidence contained in the first sample (DU1) was excluded from the analysis to avoid interference from possible transient responses at the beginning of the sequence, unrelated to the decision update. These encoding functions, averaged across samples, are plotted in blue and green respectively in Fig. 4b. Pupil diameter tended to grow with increasing decision information, with a peak in encoding at about 1000 ms post-stimulus (blue trace; maximum significance at 1140 ms after sample onset, t(19) = 3.5, p < 0.001). Critically however, and consistent with behavioural findings, this encoding was negatively modulated by the difference to the previous sample δ(DU)k – samples whose predecessor provided very different decision information were encoded more weakly in pupillary signals. This is shown by plotting the coefficients (wδk,dt) associated with DUk · δ(DU)k, which diverged negatively from zero (Fig. 4b, green trace; maximum significance at 2000ms after sample onset, t(19) = 3.0, p < 0.004). In other words, pupillary responses track fluctuations in the strength of decision evidence, and the reduced gain of information processing when evidence is inconsistent with its predecessor, that mirror those observed in behaviour, and are predicted by the adaptive gain model. Demonstrating this effect another way, in Fig. 4c we plotted the encoding of DUk in pupil responses separately for samples with small and large values of δ(DU)k, i.e. that were similar/dissimilar to the previous sample based on a median split. Encoding was stronger when samples were more similar (full blue line) than when they were different (faded blue line).

Figure 4.

Figure 4

Pupil dilation in response to decision information. A. Pupil dilation over the course of a trial expressed as the proportion change relative to a pre-sequence baseline level. Vertical lines represent the onset of the 8 decision-relevant samples. B. Regression coefficents showing the encoding of decision information in pupil diameter (eq. 6) for the time window −500 to +2500ms relative to the onset of each sample. Blue curves - coefficents for decision update wk. Green coefficients for disparity interaction wδk. Each curve reflects coefficients averaged across samples 2 – 8. Coloured horizontal bars indicate regions of significance against baseline at the level p<0.05 uncorrected (light) and corrected (dark) for multiple comparisons. C. Pupil dilation encoding timecourse of decision update DUk divided into conditions of either small or large δ(DU)k – i.e. large or small shifts in decision space between the preceding (k-1) and current (k) sample – divided on the basis of a median split on δ(DU)k. Coloured horizontal bars indicate regions of significant difference between low and high δ(DU)k at p<0.05 uncorrected (light) and corrected (dark) for multiple comparisons. D. Pupil encoding timecourse of decision update DUk, for early (position 2-4) vs late (position 5-8) occurring samples. See also Supplemental Information, Fig. S4.

Previous models have accounted for recency effects in perceptual categorisation by assuming that integration is leaky, with information dissipating gradually across the choice period (Usher and McClelland, 2001; Wang, 2002). Our model of behaviour provides an alternative explanation for the recency bias, suggesting that it is (at least in part) due to dampened information gain for earlier samples, rather than exclusively by a subsequent leak of the information. If this is indeed the case, then momentary encoding of DUk in pupilk,t should be heightened for later samples. We thus averaged encoding curves for early (samples 2-4) and late (samples 5-8) occurring samples separately (see Experimental Procedures), and plotted them separately (Fig. 4d). Statistical comparison indicated that they diverged reliably at about 1 s post-stimulus, with stronger encoding for later samples (t(19) = 1.78, one-tailed p < 0.05), as predicted by the adaptive gain model.

Experiment 2: human behavioural data

Changes in pupil diameter offer a proxy for variations in the gain of cortical processing, but a more direct measure of brain activity can be obtained using functional neuroimaging. We thus asked a new cohort of 18 participants to perform the same task whilst undergoing functional magnetic resonance imaging (fMRI). Once again, observers viewed streams of 8 visual gratings presented at 4 Hz, and judged whether the angles of orientation were (on average) more ‘cardinal’ or more ‘diagonal’. Regression coefficients for wk and wδk are shown in figure 3b. As can be seen, behavioural data once again revealed a recency bias (t(18) = 3.9, p < 0.001) and a consistency bias (green circles; t(18) = 4.6, p < 0.001), similar to those observed in experiment 1. The best fits to human choices (solid lines) and regression parameters themselves (dashed lines) reveal that the adaptive gain model can approximate these very closely (parameters of the best-fitting model were β = 0.5, σ = 0.92, and α = 0.65). Model predictions for 13 out of 15 parameter estimates fell within the 95% confidence intervals of the behavioural data. Once again the adaptive model outperformed its static counterpart, in which the learning rate was fixed at zero. Replicating the results of experiment 1, the static model demonstrated significantly larger residuals (t(18) = 4.67 p < 0.0002) and poorer fits to the data even after taking into account the extra parameter using the BIC.

Experiment 2: fMRI data

The main effect of stimulus onset in experiment 2 evoked positive-going BOLD responses in a number of cortical regions, including visual cortical zones, the lateral parietal and prefrontal cortices, the pre-supplementary motor area and anterior insular cortex (Fig. S5 and Tables S1-S3, Supplemental Information). The main purpose of our experiment was to determine how BOLD signals encoded information provided by each stimulus (i.e. DUk) and how this encoding was modulated by the consistency between current and previous sample δ(DU)k. To this end, we focussed on three cortical clusters that have been implicated in perceptual category judgments in previous neuroimaging studies (Filimon et al., 2013; Grinband et al., 2006; Heekeren et al., 2008; Ho et al., 2009; Liu and Pleskac, 2011): the lateral parietal cortex (falling principally on the inferior parietal lobule; IPL); a region of the dorsal medial prefrontal cortex comprising mainly the pre-supplementary motor area stretching inferiorly to the anterior cingulate cortex (dMFC); and the anterior insular cortex (figure 5a, left, middle and right panels respectively).

Figure 5. fMRI data.

Figure 5

A. Clusters selected as regions of interest: IPL, dMFC and AINS, rendered onto axial/sagittal slices of the template brain of the Montreal Neurological institute at a threshold of p < 0.00001 uncorrected. B. Encoding of DUk (i.e. evidence favouring cardinal) in BOLD signals as a function of time in seconds (x-axis) for trials on which cardinal (solid blue line) or diagonal (dashed blue line) were chosen. Shading around each line shows standard error of the mean. Upper blue bars denote timepoints where encoding curves diverge significantly, at p<0.05. C. As for B, but plotting consistency coefficients wδk for BOLD signals. Green bars denote significant timepoints. D. Plot of the difference between encoding for each choice in A and B. Blue lines: divergence between encoding of DUk for the two choices. Green lines: divergence between encoding of DUk · δ(DU)k for the two choices. See also Supplemental Information, Fig. S5 and Tables S1-S3.

We analysed imaging data for each region using a comparable approach to that adopted for the eyetracking data: extracting fMRI timeseries aligned to stimulus onset for each trial allowed us to regress the decision information for each sample DUk onto the post-stimulus BOLD signals, yielding one parameter estimate per sample for each timepoint following the stimulus. We then averaged these timecourses across samples, and plotted separately for trials on which each of the two options (cardinal vs. diagonal) was chosen (blue lines in figure 5b; see Experimental Procedures). The resulting curves plot the extent to which decision information (e.g. relative evidence favouring cardinal over diagonal) was encoded in BOLD signals, both when participants chose cardinal and when they chose diagonal.

BOLD)(tk+dt)(=(wk,dt+wk,dtδδ(DU)k)DUk+wdtsumj=18DUj+dt [eq.7]

This approach parallels that frequently employed to identify regions encoding action values in reward-guided decision tasks (Boorman et al., 2009). In equation 7, BOLD(tk + dt) is the normalized and base-lined BOLD response in each region at time dt following the onset of sample k. The resulting coefficients are shown in figure 5b. Strong encoding of DUk as a function of choice was observed in each of the three regions of interest, as demonstrated by the divergence between wk,dt in BOLD signals for trials with opposing choices (peak statistics, at 8-10 s post-stimulus: IPL, t(18) = 2.78, p < 0.007; dMFC t(18) = 4.06, p < 0.001; AINS t(18) = 7.22, p < 0.000001; blue traces and bars in fig. 5b). In other words, BOLD signals in these regions encoded decision-relevant activity in a fashion that predicted choices.

Next, building upon the analyses described above, we assessed how the encoding of DUk in BOLD signals was modulated by the recent history of stimulation. To this end, we estimated regression coefficients for the interaction between DUk and δ(DU)k, i.e. the extent to which this encoding of DUk was heightened or dampened according to consistency with the previous sample. Coefficients for this regression were reversed with respect to those for wk,dt (green traces in Fig. 5c), consistent with the negative coefficients observed for wδk,dt in the behavioural and eyetracking data. This finding thus reveals a reliable mitigation of DUk encoding in all three regions when information diverges between samples (peak statistics, at 2-5 s post-stimulus: IPL, t(18) = 1.87, p < 0.05; dMFC t(18) = 2.02, p < 0.03; AINS t(18) = 2.38, p < 0.02; green bars in Fig. 5c). Indeed, plotting the relative encoding (for the two responses) of DUk and DUk ·δ(DU) in the two choices revealed reliable positive- and negative-going coefficients that matched those obtained for behavioural data (blue and green bars in Fig. 5d). In other words, encoding of decision information in the BOLD signal is modulated by the consistency between current and previous decision information.

Experiment 3. Reanalysis of EEG recordings

In an earlier published study (Wyart et al., 2012a), we recorded EEG data whilst participants performed the same cardinal/diagonal task. In these data, DUk was encoded in EEG signals peaking at 500ms post-stimulus over the parietal cortex. These data afforded the opportunity to test how the encoding of DUk was modulated by δ(DU)k. We found strong modulation between 400 and 500ms post-stimulus (t(14) = 4.7, p < 0.001), consistent with the encoding timecourse for DUk. These results are described in figure 6 and Supplemental information figure S6.

Figure 6.

Figure 6

EEG response to decision information. A. EEG encoding timecourse of decision information. Blue - decision update coefficients wk. Green – disparity interaction coefficients wδk. Curves display parameter coefficients from the multivariate regression of EEG signal amplitude against a linear combination of decision factors (eq. S4) for the time window −200 to +800ms relative to the onset of each sample. Each curve reflects coefficients averaged across samples 2 – 8. Coloured horizontal bars indicate regions of significance at the level p<.05 uncorrected for both wk (blue) and wδk (green). B. Timecourse of decision update DUk divided into conditions of either small (dark blue) or large (light blue) inter-item disparity across decision space (δ(DU)k), divided on the basis of a median split. Coloured horizontal bars indicate significance against baseline and black bars indicate regions of significant difference between low and high δ(DU)k, both at p<0.05 uncorrected. C. EEG encoding topographies of decision information at 500 ms following the corresponding element. Blue - decision update DUk. Green - disparity interaction DUk · δ(DU)k. Large dots indicate parietal electrodes of interest (CP3, CPz, CP4, P3, Pz, P4, POz). D. EEG encoding topographies for DUk divided into conditions of either small (left) or large (right) inter-item disparity across decision space, divided on the basis of a median split. Same conventions as in C. See also Supplemental Information, Fig. S6.

Experiment 4. Manipulation of presentation order

Statistical comparison strongly supported the view that the gain function adapts rapidly over time. Nevertheless, we decided to pit the static and adaptive versions of the model against each other in a further experiment for which the two models made different predictions. We created a version of the cardinal/diagonal discrimination task on which each trial consisted of just 4 samples, with 2 favouring cardinal and 2 favouring diagonal. This allowed us to systematically manipulate the order of presentation of the samples drawn from either category (cardinal [C] or diagonal [D]) in three discrete conditions: alternating (C-D-C-D or D-C-D-C), pairs (C-C-D-D or D-D-C-C) and sandwich (C-D-D-C or D-C-C-D). The adaptive and static gain models make very different predictions about how the weights associated with decision information on samples 1-4 should vary as a function of the three conditions, with static gain predicting no difference as a function of position in any of the three conditions (Fig. 7b, blue dots), and the adaptive gain model predicting different patterns in each (Fig. 7a, red dots). These predictions are shown in Fig. 7a alongside the values obtained from a new cohort of human participants (n = 16) performing the task (filled grey bars). Comparing the mean squared error (MSE) between behavioural parameter estimates and those for the two models produced significantly better fits for the adaptive compared to the static model (t(14) = 3.0, p < 0.001). Strikingly, the adaptive model is able to capture the large shifts in weighting associated with each condition (Fig. 7a). Subjecting these 4-item task data to the same analyses as experiments 1 and 2, we also observed the same recency (t(14) = 2.9, p < 0.02) and consistency (t(14) = 3.9, p < 0.003). See also Supplemental Information Fig. S7, for further behavioural results and model predictions from the 4-item task.

Figure 7.

Figure 7

Adaptive (A) and static (B) model predictions for different sequence classes. Decision weighting profiles based on four item sequences (experiment 4), estimated using a multivariate logistic regression of choice against a linear combination of the four decision updates. Behavioural weighting coefficients (filled grey bars) are displayed for three cardinal (C) - diagonal (D) sequence types, alternating (C-D-C-D or D-C-D-C), pairs (C-C-D-D or D-D-C-C) and sandwich (C-D-D-C or D-C-C-D).. Adaptive gain model parameter estimates from twenty separate runs are plotted for both adaptive (A, red circles) and static (B, blue circles), based on exhaustively searching over parameter space, minimising the MSE between behavioural and model parameter estimates. See also Supplemental Information, Fig. S7.

DISCUSSION

We used impact analyses in conjunction with an expanded judgment task and functional neuroimaging to assess how humans weighted decision-relevant information arriving in a rapid stream. We observed evidence for two suboptimal biases. Firstly, humans are unduly swayed by the samples that occurred shortly prior to choice (recency bias). Secondly, their decisions are less influenced by samples that carry dissimilar decision information to their predecessor (consistency bias). These two biases, and their neural expression in pupillometric and imaging data, are accounted for by a model in which the gain of information processing is adapted rapidly – within the timespan of a single trial – to coincide with the expectation of the distribution from which information is sampled. This ‘adaptive gain’ model thus describes a gain control mechanism by which categorical inferences become tuned to local statistics of decision-relevant information, much as low-level sensory systems maximise gain by adapting to background levels of stimulation (Bartlett, 1965; Carandini and Heeger, 2012; Fairhall et al., 2001).

The adaptive gain model makes two critical assumptions. The first is that categorical responses involve a nonlinear transformation of decision information. This assumption is common to standard psychophysical models of detection (Naka and Rushton, 1966) and cognitive models of categorical perception (Shepard, 1987; Tenenbaum and Griffiths, 2001), but has not hitherto been a prominent feature of the serial sampling approach. In categorisation, the well-known heightened sensitivity of observers close to a category boundary, and the consequent emergence of prototypical representations (perceptual ‘magnet’ effects) is naturally modelled with a sigmoidal transfer function of the sort adopted here (Bonnasse-Gahot and Nadal, 2008; Feldman et al., 2009). We have previously shown that perceptual averaging of multiple, simultaneously-occurring visual items is best described with a model in which decision information is transformed sigmoidally, leading to ‘robust averaging’ – reduced impact for samples of evidence that diverge from the central tendency of the information (de Gardelle and Summerfield, 2011).

The second assumption is that the point of maximal gain of evidence processing (i.e the steepest portion of the nonlinear transfer function, f) adapts to the local statistics of stimulation. As this happens, divergent or otherwise surprising evidence will tend to fall in a portion of decision space with lower gain (i.e. the shallower part of the transfer function), and thus contribute more weakly to choice. It should be noted that expected evidence will lead to a weaker transduced signal in absolute terms, consistent with the finding that expected stimuli elicit globally reduced neural signals (Summerfield and Egner, 2009; Summerfield et al., 2008; Todorovic et al., 2011). The biases that we report are not merely due to perceptual priming, because our cardinal/diagonal task carefully orthogonalises the magnitude of the perceptual update (perceptual similarity between adjacent samples) and decision update (difference in decision value between two adjacent samples).

The adaptive gain mechanism ensures that information that confirms (rather than disconfirms) the current belief will fall at the point of maximal gain and come to have more impact on the eventual choice. This furnishes the prediction – recently confirmed – that observers should exhibit heightened sensitivity to information that is more likely to occur, even when it is cued as irrelevant to the task at hand (Wyart et al., 2012b). Moreover, the stronger gain of processing of belief-congruent evidence provides a mechanism for understanding the confirmation bias, by which belief-consistent evidence is given more credence, with inconsistent evidence often downplayed or ignored (Nickerson, 1998). One study in which participants integrate discrete binary samples of evidence reported that belief-consistent evidence tends to be under-rather than over-weighted, but as noted by the authors of that study, this may reflect the fact that once a criterial level of evidence was achieved (i.e. 3 out of 5 successes), the decision could already be made with full certainty (de Lange et al., 2011). During speeded decision-making, the presence of a confirmatory bias may help to bring decisions about weak or ambiguous evidence to a close, avoiding prolonged deliberation and contributing to a maximisation of overall reward rate (Bogacz et al., 2006; Deneve, 2012; Drugowitsch et al., 2012).

In the current experiment, we provide evidence that gain adaptation occurs within the timeframe of a single trial. This means that information occurring at the end of the trial is (on average) processed with higher gain than information presented early on. Our model thus provides one explanation for the recency bias, whereby later-occurring evidence often holds more sway over decisions, at least over the timescale of integration required here (Ossmy et al., 2013). However, whereas past accounts of recent have invoked a forgetting process by which information is lost over time (Usher and McClelland, 2001), the adaptive gain model predicts that early samples should be processed with weaker gain at the point of occurrence. Accordingly, we found that pupillary signals, proposedly a generic index of the gain of cortical processing, encoded decision information more steeply for later relative to earlier samples. Nevertheless, our model does not rule out leak as an additional contributor to the recency bias, in particular where integration occurs over several seconds (Tsetsos et al., 2012a).

Psychophysical judgments made by well-trained humans and other primates have frequently been found to approach the standard of an ideal observer, i.e. one whose performance is limited only by the variability in the stimulus (Bogacz et al., 2006; Gold and Shadlen, 2001; Ma et al., 2006). Our model predicts that when input signals have low variance, for example when the evidence is a fixed quantity corrupted by noise, then performance will approach optimality. This occurs because all of the evidence falls within the narrow range for which gain is maximal, where the transfer function is approximately linear. In other words, our model makes the strong prediction that optimality in perceptual choice will depend directly on the variability of the input signal. This may help explain an long-standing discrepancy in the decision sciences, as to why humans appear to perform optimally in perceptual classification tasks (Bogacz et al., 2006) but are often inconsistent in their subjective preferences or economic choices (Kahneman et al., 1982; Vlaev et al., 2011). Unlike most psychophysical judgments, real-world choices often involve comparisons between dissimilar goods or incommensurable options, which the adaptive gain model predicts will be deviate from optimality. Indeed, recent theories have appealed to another form of adaptive control – divisive normalisation – as a source of contextual bias driving decoy effects and other preference instabilities (Louie et al., 2013). More generally, adaptive gain mechanisms may have evolved to ensure that, in world in which the decision-relevant information can change rapidly and unpredictably, the most probable information is processed with the highest gain, sacrificing optimality for computational efficiency (Carandini and Heeger, 2012). We have previously shown that rapid adaptation to the information on the previous trial guides decision-making when classification judgments are on nonstationary information (Summerfield et al., 2011).

One plausible mechanism by which adaptive gain control could be implemented is by adjustments to the tuning of neurons coding for expected information (Eldar et al., 2013). In the Supplemental Information (S2) we describe how our psychophysical model could be implemented in a biologically plausible fashion as population coding model in which the tuning profile across a population of neurons is adjusted in so that the tuning of neurons sensitive to expected information is sharpest. In this population model, the consistency and recency biases fall out of a very simple assumption: that sensory tuning, hence the gain with which the next sample is processed, is determined by the current state of activation in a subsequent layer that integrates sensory information in an additive fashion. This model not only predicts both the consistency and recency biases, but also correctly predicts that the encoding of decision information (e.g. the correlation between DUk and neurophysiological measures described here) will depend positively on the gain of information processing, and that fulfilled predictions per se would elicit weaker neural responses, because a more select group of sharply tuned neurons is activated by the input (Wiggs and Martin, 1998). This theory is consistent with the repeated finding that BOLD signals to expected information tend to be suppressed, rather than enhanced (Summerfield and Egner, 2009; Summerfield et al., 2008; Todorovic et al., 2011) as well as with a recent report that despite this ‘expectation suppression’, more probable signals can be decoded with heightened fidelity from appropriate visual regions (Kok et al., 2012). As such, our model bears close resemblance to ‘predictive coding’ accounts of perception, in which expectations constrain the space over which perceptual inferences can be made (Friston, 2005).

In conclusion, we describe evidence for adaptive gain control during human decision-making. Adaptive gain control provides a mechanism for neural systems to adjust their range of sensitivity to suit the information that is most likely to occur in the current environment. During serial integration of decision information, this mechanism prompts us to give most credence to expected information, and to downplay that which is outlying or unanticipated. This leads to maximal sensitivity to likely events, but gives rise to previously-described suboptimalities, such as the recency bias and consistency bias. This work unites two literatures on priming and decision-making and provides a framework for understanding decision-making in a changing world.

EXPERIMENTAL PROCEDURES

Participants

Healthy human volunteers (n=23, n=18, n=16, and n=16 in experiments 1-4 respectively) gave informed consent to participate in the study. All reported normal or corrected-to-normal vision and no history of neurological or psychiatric impairment.

Task

In all four experiments, participants viewed a stream of successive, centrally presented Gabor patches (‘samples’) with variable angle of orientation, and judged whether, on average, the orientation of the Gabors fell closer to the cardinal axes or diagonal axes. Each sample k was assigned a decision value DUk which mapped orientation onto decision value according to a “sawtooth” function, whose output ranged between +1 (−90°, 0°, +90°) and −1 (−45°, +45°) with decision values of zero at the category boundaries (22.5°, 67.5°, 112.5°, and 157.5°). The sawtooth function is shown in Supplemental information figure S1. Participants received fully informative feedback according to whether they correctly classified the stream as more cardinal (i.e. where ∑DUk > 0) or diagonal (where ∑DUk < 0).

Stimuli

Each stream consisted of 8 samples (exp 1-3) or 4 samples (exp 4). Each sample was a high contrast Gabor patch (spatial frequency 2 cycles per degree; standard deviation of Gaussian envelope 1 degree) presented against a uniform grey background. Samples were presented with an onset asynchrony of 250 ms (i.e. at 4 Hz). Sequences were preceded and succeeded by a mask created from the linear superposition of the four cardinal and diagonal Gabor patterns.

We collected response data differently in experiments 1/4 and 2/3. In experiments 2 and 3, responses followed the onset of a centrally-occuring green dot that succeeded the backward mask, and were made with a button (exp 2) or key (exp 3) press with the left or right hand (category-response mappings were counterbalanced across participants). Auditory feedback consisted of an ascending (400 Hz/800 Hz, 100 ms/100ms) or descending (800 Hz/400 Hz, 100 ms/100ms) tone signalling correct and incorrect responses respectively that onset 250 ms after response. In experiments 1 and 4, we used a different approach in which participants reported the integrated decision value on a continuous scale, allowing precise feedback to be administered. Responses were cued by a screen divided vertically at the centre, and the response was made by clicking the mouse on the left or right of the screen at a position corresponding to the integrated decision value. Immediately following response, visual feedback in the form of a vertical red line was presented along the scale indicating the position corresponding to the objective average of all elements. Trials were separated by a blank inter-stimulus interval of 1500 ms (exp 1 and 3), 1500-4500 ms (exp 2), or 600 ms (exp 4). Experiments consisted of 400 trials in blocks of 50 (exp 1 and 4), 450 trials in blocks of 90 (exp 2), or 672 trials in blocks of 96 (exp 3).

Design

In experiments 1 and 4, samples were drawn from one of two bimodal distributions, thereby ensuring an equal number of samples with more cardinal and more diagonal orientation. Bimodal distributions were shifted positively or negatively by a fixed value to control whether the sum of decision values ∑DUk favoured a cardinal or diagonal response. In experiment 2, sample orientations were drawn from one of two sigmoidal distributions over decision values (range −1 to +1). In experiment 4, the order of presentation of the 4 Gabors was manipulated to create three discrete conditions: alternation (i.e. C-D-C-D or D-C-D-C, 50% of trials), pairs (C-C-D-D or D-D-C-C, 25% of trials) and sandwich (C-D-D-C and D-C-C-D, 25% of trials), where C and D represent samples favouring the cardinal or diagonal categories. Because decision values were sampled from normal or bi-model distributions, orientations (perceptual values) were also drawn from distributions containing a subset of all possible angles.

Apparatus and recordings

Visual stimuli were generated, and behavioural responses recorded using Psychophysics-3 Toolbox (Brainard, 1997; Pelli, 1997) in addition to custom scripts written for MATLAB (The MathWorks, Natick, MA, USA). Data from experiment 2 were obtained in the fMRI scanner. Images were rear-projected into the scanner bore via a custom shielded Samsung 40 inch LCD screen (LTA400HF) with a 60 Hz refresh rate. For all other experiments observers viewed a standard cathode ray tube (CRT) monitor set to a display refresh rate of either 85Hz (exp 1) or 60Hz (exp 3 and 4) with a resolution of 1024 × 768, from a distance of 60 - 80cm in a darkened room.

Eye-tracking

In experiment 1 pupil diameter was measured continuously throughout the experiment using an Eyelink 1000 eye-tracking system, recording monocularly at a sampling rate of 250Hz. Calibration was performed twice, both before the start of the experiment and after 200 trials. Subjects were instructed to minimize head movement with the use of a chinrest. Data from three subjects was discarded due to poor calibration.

Preprocessing and encoding analysis of pupillometry data

Blink artifacts were removed from the data using a custom interpolation method in which spline fitting was performed based on pupil diameter 200ms prior to blink onset, and 200ms after offset. Subsequently, high frequency components were removed (data smoothing) using a 50ms sliding window. Pupil measures were z-scored within trials, expressed relative to a pre-trial baseline period (−280 to 0ms relative to sequence onset), and trimmed using cutoffs of ±3. Finally, the data was z-scored across trials before being entered into a general linear model (GLM). Three regressors were included in the GLM (equation 7, main text): the decision update DUk for each sample, the mean DU for the 8 samples comprising each sequence, and the interaction between DUk and the absolute difference in decision update between each sample and its predecessor DUk ·δ(DUk). Regressions were performed on a sample by sample basis for pupil measures from −500 to +2000ms relative to sample onset. Following this, parameter estimates were then averaged across samples. Significance testing of the pupil encoding curves was carried out using a mass univariate approach, corrected for multiple comparisons using a nonparametric random permutation test: after point estimate t-values had been obtained at all timepoints, values of DUk and dilation traces were shuffled 1000 times to produce randomly paired datasets. We then performed t-tests across the entire sequence time-course (−500 to 2000ms peristimulus time, in steps of 20ms) for each shuffled dataset, recording the maximum test statistic from the entire time-course. Point estimate statistics that fell within the 95th percentile (p < 0.025 or p > 0.975) of this null distribution based on shuffled values, were deemed significant. For further detail on the procedure see (Summerfield and Mangels, 2005).

Preprocessing and analysis of fMRI data

Images were acquired in a 3 Tesla (Siemens TRIO) with a 32 –channel head coil using a standard echo-planar imaging (EPI) sequence. Images were 64×64×36 volumes with voxel size 3×3×3 mm; acquired with a 2s repetition time (TR) and 30ms echo time. Five runs of 300 volumes were obtained, each of which lasted approximately 10 minutes and corresponded to one experimental block of 90 trials.

Preprocessing of the imaging data was carried out in SPM8 and included correction for head motion and slice acquisition timing, followed by spatial normalization to the standard template brain of the Montreal Neurological Institute (MNI brain). Images were resampled to 4mm cubic voxels and spatially smoothed with a 10mm full width at half-maximum isotropic Gaussian kernel. A 128s temporal high-pass filter was applied in order to exclude low-frequency artifacts. Temporal correlations were estimated using restricted maximum likelihood estimates of variance components using a first-order autoregressive model. The resulting nonsphericity was used to form maximum likelihood estimates of the activations.

All statistical analyses were first conducted at the level of individual subjects and the resulting estimates were carried forward for a second stage involving group-level inferences. We identified regions of interest (ROIs) using a simple design matrix in which a finite impulse response (FIR) was used to model each timepoint after stimulus onset for all trials. We then plotted voxels that responded significantly (p < 0.00001, uncorrected) in the third bin (6 s) post-stimulus. We chose to focus on IPL, dMFC and AINS because these regions have all been implicated in perceptual decision-making in recent fMRI studies. The purpose of our study was not to make strong claims about the localisation of regions involved in decision-making, but to use previously-described regions to test hypotheses pertaining to adaptive gain control.

BOLD timeseries were extracted from each region and averaged across voxels. We then extracted overlapping epochs from −4 s to 24 s around the onset of each stimulus, and used a GLM approach similar to that for the pupil and EEG data to assess how decision information was encoded in BOLD signals. Segregating trials into those where the participants’ choice was cardinal vs. diagonal, we regressed decision information [8 regressors encoding Duk associated with each sample, and 7 regressors encoding DUk · δ(DU)k] on the resulting BOLD signals at each timepoint separately. This resulted in an intercept and 15 parameter estimates for each timepoint per choice condition. We averaged across the 8 estimates for DUk and the 7 estimates for DUk · δ(DU)k and plotted the resulting average estimates for each timepoint and choice (cardinal vs. diagonal; Fig. 5b-c). These plots thus demonstrate the extent to which BOLD signals in each region covaried with decision information (e.g. the extent to which the information favoured cardinal), and how this covariation was dampened by difference to the previous sample, separately for trials on which the participants chose cardinal and where they chose diagonal.

Supplementary Material

1

REFERENCES

  1. Aston-Jones G, Cohen JD. Adaptive gain and the role of the locus coeruleus-norepinephrine system in optimal performance. J Comp Neurol. 2005;493:99–110. doi: 10.1002/cne.20723. [DOI] [PubMed] [Google Scholar]
  2. Barlow H. Sensory Communication. MIT Press; 1961. Possible principles underlying the transformation of sensory messages. [Google Scholar]
  3. Bartlett NR. Dark and light adaptation. In: Graham CH, editor. Vision and visual perception. John Wiley and Sons, Inc; New York: 1965. 1965. (New York: John Wiley and Sons, Inc.) [Google Scholar]
  4. Beck JM, Ma WJ, Kiani R, Hanks T, Churchland AK, Roitman J, Shadlen MN, Latham PE, Pouget A. Probabilistic population codes for Bayesian decision making. Neuron. 2008;60:1142–1152. doi: 10.1016/j.neuron.2008.09.021. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Bogacz R, Brown E, Moehlis J, Holmes P, Cohen JD. The physics of optimal decision making: a formal analysis of models of performance in two-alternative forced-choice tasks. Psychol Rev. 2006;113:700–765. doi: 10.1037/0033-295X.113.4.700. [DOI] [PubMed] [Google Scholar]
  6. Bonnasse-Gahot L, Nadal JP. Neural coding of categories: information efficiency and optimal population codes. J Comput Neurosci. 2008;25:169–187. doi: 10.1007/s10827-007-0071-5. [DOI] [PubMed] [Google Scholar]
  7. Boorman ED, Behrens TE, Woolrich MW, Rushworth MF. How green is the grass on the other side? Frontopolar cortex and the evidence in favor of alternative courses of action. Neuron. 2009;62:733–743. doi: 10.1016/j.neuron.2009.05.014. [DOI] [PubMed] [Google Scholar]
  8. Carandini M, Ferster D. A tonic hyperpolarization underlying contrast adaptation in cat visual cortex. Science. 1997;276:949–952. doi: 10.1126/science.276.5314.949. [DOI] [PubMed] [Google Scholar]
  9. Carandini M, Heeger DJ. Normalization as a canonical neural computation. Nat Rev Neurosci. 2012;13:51–62. doi: 10.1038/nrn3136. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Cohen JD, Aston-Jones G. Cognitive neuroscience: decision amid uncertainty. Nature. 2005;436:471–472. doi: 10.1038/436471a. [DOI] [PubMed] [Google Scholar]
  11. de Gardelle V, Summerfield C. Robust averaging during perceptual judgment. Proc Natl Acad Sci U S A. 2011;108:13341–13346. doi: 10.1073/pnas.1104517108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. de Lange FP, van Gaal S, Lamme VA, Dehaene S. How awareness changes the relative weights of evidence during human decision-making. PLoS Biol. 2011;9:e1001203. doi: 10.1371/journal.pbio.1001203. [DOI] [PMC free article] [PubMed] [Google Scholar]
  13. Deneve S. Making decisions with unknown sensory reliability. Front Neurosci. 2012;6:75. doi: 10.3389/fnins.2012.00075. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. Drugowitsch J, Moreno-Bote R, Churchland AK, Shadlen MN, Pouget A. The cost of accumulating evidence in perceptual decision making. J Neurosci. 2012;32:3612–3628. doi: 10.1523/JNEUROSCI.4010-11.2012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Eldar E, Cohen JD, Niv Y. The effects of neural gain on attention and learning. Nat Neurosci. 2013 doi: 10.1038/nn.3428. [DOI] [PMC free article] [PubMed] [Google Scholar]
  16. Ernst MO, Banks MS. Humans integrate visual and haptic information in a statistically optimal fashion. Nature. 2002;415:429–433. doi: 10.1038/415429a. [DOI] [PubMed] [Google Scholar]
  17. Fairhall AL, Lewen GD, Bialek W, de Ruyter Van Steveninck RR. Efficiency and ambiguity in an adaptive neural code. Nature. 2001;412:787–792. doi: 10.1038/35090500. [DOI] [PubMed] [Google Scholar]
  18. Feldman NH, Griffiths TL, Morgan JL. The influence of categories on perception: explaining the perceptual magnet effect as optimal statistical inference. Psychol Rev. 2009;116:752–782. doi: 10.1037/a0017196. [DOI] [PMC free article] [PubMed] [Google Scholar]
  19. Filimon F, Philiastides MG, Nelson JD, Kloosterman NA, Heekeren HR. How embodied is perceptual decision making? Evidence for separate processing of perceptual and motor decisions. J Neurosci. 2013;33:2121–2136. doi: 10.1523/JNEUROSCI.2334-12.2013. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Friston K. A theory of cortical responses. Philos Trans R Soc Lond B Biol Sci. 2005;360:815–836. doi: 10.1098/rstb.2005.1622. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Gilzenrat MS, Holmes BD, Rajkowski J, Aston-Jones G, Cohen JD. Simplified dynamics in a model of noradrenergic modulation of cognitive performance. Neural Netw. 2002;15:647–663. doi: 10.1016/s0893-6080(02)00055-2. [DOI] [PubMed] [Google Scholar]
  22. Gold JI, Shadlen MN. Neural computations that underlie decisions about sensory stimuli. Trends Cogn Sci. 2001;5:10–16. doi: 10.1016/s1364-6613(00)01567-9. [DOI] [PubMed] [Google Scholar]
  23. Gold JI, Shadlen MN. The neural basis of decision making. Annu Rev Neurosci. 2007;30:535–574. doi: 10.1146/annurev.neuro.29.051605.113038. [DOI] [PubMed] [Google Scholar]
  24. Greenlee MW, Heitger F. The functional role of contrast adaptation. Vision Res. 1988;28:791–797. doi: 10.1016/0042-6989(88)90026-0. [DOI] [PubMed] [Google Scholar]
  25. Grinband J, Hirsch J, Ferrera VP. A neural representation of categorization uncertainty in the human brain. Neuron. 2006;49:757–763. doi: 10.1016/j.neuron.2006.01.032. [DOI] [PubMed] [Google Scholar]
  26. Heeger DJ. Normalization of cell responses in cat striate cortex. Vis Neurosci. 1992;9:181–197. doi: 10.1017/s0952523800009640. [DOI] [PubMed] [Google Scholar]
  27. Heekeren HR, Marrett S, Ungerleider LG. The neural systems that mediate human perceptual decision making. Nat Rev Neurosci. 2008;9:467–479. doi: 10.1038/nrn2374. [DOI] [PubMed] [Google Scholar]
  28. Ho TC, Brown S, Serences JT. Domain general mechanisms of perceptual decision making in human cortex. J Neurosci. 2009;29:8675–8687. doi: 10.1523/JNEUROSCI.5984-08.2009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Kahneman D, Slovic P, Tversky A. Judgment Under Uncertainty: Heuristics and Biases. Cambridge University Press; New York: 1982. [DOI] [PubMed] [Google Scholar]
  30. Kok P, Jehee JF, de Lange FP. Less is more: expectation sharpens representations in the primary visual cortex. Neuron. 2012;75:265–270. doi: 10.1016/j.neuron.2012.04.034. [DOI] [PubMed] [Google Scholar]
  31. Liu T, Pleskac TJ. Neural correlates of evidence accumulation in a perceptual decision task. J Neurophysiol. 2011;106:2383–2398. doi: 10.1152/jn.00413.2011. [DOI] [PubMed] [Google Scholar]
  32. Louie K, Glimcher PW. Efficient coding and the neural representation of value. Ann N Y Acad Sci. 2012;1251:13–32. doi: 10.1111/j.1749-6632.2012.06496.x. [DOI] [PubMed] [Google Scholar]
  33. Louie K, Khaw MW, Glimcher PW. Normalization is a general neural mechanism for context-dependent decision making. Proc Natl Acad Sci U S A. 2013 doi: 10.1073/pnas.1217854110. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Ma WJ, Beck JM, Latham PE, Pouget A. Bayesian inference with probabilistic population codes. Nat Neurosci. 2006;9:1432–1438. doi: 10.1038/nn1790. [DOI] [PubMed] [Google Scholar]
  35. Naka KI, Rushton WA. S-potentials from colour units in the retina of fish (Cyprinidae) J Physiol. 1966;185:536–555. doi: 10.1113/jphysiol.1966.sp008001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Nassar MR, Rumsey KM, Wilson RC, Parikh K, Heasly B, Gold JI. Rational regulation of learning dynamics by pupil-linked arousal systems. Nat Neurosci. 2012;15:1040–1046. doi: 10.1038/nn.3130. [DOI] [PMC free article] [PubMed] [Google Scholar]
  37. Nickerson RS. Confirmation bias: a ubiquitous phenomenon in many guises. Review of General Psychology. 1998;2:175–220. [Google Scholar]
  38. Ossmy O, Moran R, Pfeffer T, Tsetsos K, Usher M, Donner TH. The timescale of perceptual evidence integration can be adapted to the environment. Curr Biol. 2013;23:981–986. doi: 10.1016/j.cub.2013.04.039. [DOI] [PubMed] [Google Scholar]
  39. Padoa-Schioppa C. Range-adapting representation of economic value in the orbitofrontal cortex. J Neurosci. 2009;29:14004–14014. doi: 10.1523/JNEUROSCI.3751-09.2009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  40. Peirce CS. The probability of induction. Popular Science Monthly. 1878;12:705–718. [Google Scholar]
  41. Poldrack RA, Clark J, Pare-Blagoev EJ, Shohamy D, Creso Moyano J, Myers C, Gluck MA. Interactive memory systems in the human brain. Nature. 2001;414:546–550. doi: 10.1038/35107080. [DOI] [PubMed] [Google Scholar]
  42. Ratcliff R, McKoon G. The diffusion decision model: theory and data for two-choice decision tasks. Neural Comput. 2008;20:873–922. doi: 10.1162/neco.2008.12-06-420. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Shepard RN. Toward a universal law of generalization for psychological science. Science. 1987;237:1317–1323. doi: 10.1126/science.3629243. [DOI] [PubMed] [Google Scholar]
  44. Soltani A, De Martino B, Camerer C. A range-normalization model of context-dependent choice: a new model and evidence. PLoS Comput Biol. 2012;8:e1002607. doi: 10.1371/journal.pcbi.1002607. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Summerfield C, Behrens TE, Koechlin E. Perceptual classification in a rapidly changing environment. Neuron. 2011;71:725–736. doi: 10.1016/j.neuron.2011.06.022. [DOI] [PMC free article] [PubMed] [Google Scholar]
  46. Summerfield C, Egner T. Expectation (and attention) in visual cognition. Trends Cogn Sci. 2009;13:403–409. doi: 10.1016/j.tics.2009.06.003. [DOI] [PubMed] [Google Scholar]
  47. Summerfield C, Mangels JA. Coherent theta-band EEG activity predicts item-context binding during encoding. Neuroimage. 2005;24:692–703. doi: 10.1016/j.neuroimage.2004.09.012. [DOI] [PubMed] [Google Scholar]
  48. Summerfield C, Trittschuh EH, Monti JM, Mesulam MM, Egner T. Neural repetition suppression reflects fulfilled perceptual expectations. Nat Neurosci. 2008;11:1004–1006. doi: 10.1038/nn.2163. [DOI] [PMC free article] [PubMed] [Google Scholar]
  49. Tenenbaum JB, Griffiths TL. Generalization, similarity, and Bayesian inference. Behav Brain Sci. 2001;24:629–640. doi: 10.1017/s0140525x01000061. discussion 652-791. [DOI] [PubMed] [Google Scholar]
  50. Todorovic A, van Ede F, Maris E, de Lange FP. Prior expectation mediates neural adaptation to repeated sounds in the auditory cortex: an MEG study. J Neurosci. 2011;31:9118–9123. doi: 10.1523/JNEUROSCI.1425-11.2011. [DOI] [PMC free article] [PubMed] [Google Scholar]
  51. Tremblay L, Schultz W. Relative reward preference in primate orbitofrontal cortex. Nature. 1999;398:704–708. doi: 10.1038/19525. [DOI] [PubMed] [Google Scholar]
  52. Tsetsos K, Chater N, Usher M. Salience driven value integration explains decision biases and preference reversal. Proceedings of the National Academy of Sciences of the United States of America. 2012a;109:9659–9664. doi: 10.1073/pnas.1119569109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  53. Tsetsos K, Gao J, McClelland JL, Usher M. Using Time-Varying Evidence to Test Models of Decision Dynamics: Bounded Diffusion vs. the Leaky Competing Accumulator Model. Front Neurosci. 2012b;6:79. doi: 10.3389/fnins.2012.00079. [DOI] [PMC free article] [PubMed] [Google Scholar]
  54. Usher M, Cohen JD, Servan-Schreiber D, Rajkowski J, Aston-Jones G. The role of locus coeruleus in the regulation of cognitive performance. Science. 1999;283:549–554. doi: 10.1126/science.283.5401.549. [DOI] [PubMed] [Google Scholar]
  55. Usher M, McClelland JL. The time course of perceptual choice: the leaky, competing accumulator model. Psychol Rev. 2001;108:550–592. doi: 10.1037/0033-295x.108.3.550. [DOI] [PubMed] [Google Scholar]
  56. Vlaev I, Chater N, Stewart N, Brown GD. Does the brain calculate value? Trends Cogn Sci. 2011;15:546–554. doi: 10.1016/j.tics.2011.09.008. [DOI] [PubMed] [Google Scholar]
  57. Wald A, Wolfowitz J. Bayes Solutions of Sequential Decision Problems. Proc Natl Acad Sci U S A. 1949;35:99–102. doi: 10.1073/pnas.35.2.99. [DOI] [PMC free article] [PubMed] [Google Scholar]
  58. Wang XJ. Probabilistic decision making by slow reverberation in cortical circuits. Neuron. 2002;36:955–968. doi: 10.1016/s0896-6273(02)01092-9. [DOI] [PubMed] [Google Scholar]
  59. Webb BS, Tinsley CJ, Barraclough NE, Parker A, Derrington AM. Gain control from beyond the classical receptive field in primate primary visual cortex. Vis Neurosci. 2003;20:221–230. doi: 10.1017/s0952523803203011. [DOI] [PubMed] [Google Scholar]
  60. Wierda SM, van Rijn H, Taatgen NA, Martens S. Pupil dilation deconvolution reveals the dynamics of attention at high temporal resolution. Proc Natl Acad Sci U S A. 2012;109:8456–8460. doi: 10.1073/pnas.1201858109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  61. Wiggs CL, Martin A. Properties and mechanisms of perceptual priming. Curr Opin Neurobiol. 1998;8:227–233. doi: 10.1016/s0959-4388(98)80144-x. [DOI] [PubMed] [Google Scholar]
  62. Wyart V, de Gardelle V, Scholl J, Summerfield C. Rhythmic fluctuations in evidence accumulation during decision making in the human brain. Neuron. 2012a;76:847–858. doi: 10.1016/j.neuron.2012.09.015. [DOI] [PMC free article] [PubMed] [Google Scholar]
  63. Wyart V, Nobre AC, Summerfield C. Dissociable prior influences of signal probability and relevance on visual contrast sensitivity. Proc Natl Acad Sci U S A. 2012b;109:3593–3598. doi: 10.1073/pnas.1120118109. [DOI] [PMC free article] [PubMed] [Google Scholar]
  64. Yang T, Shadlen MN. Probabilistic reasoning by neurons. Nature. 2007;447:1075–1080. doi: 10.1038/nature05852. [DOI] [PubMed] [Google Scholar]
  65. Zylberberg A, Oliva M, Sigman M. Pupil dilation: a fingerprint of temporal selection during the “attentional blink”. Front Psychol. 2012;3:316. doi: 10.3389/fpsyg.2012.00316. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

1

RESOURCES