The neural computation of inconsistent choice behavior

Vered Kurtz-David; Dotan Persitz; Ryan Webb; Dino J Levy

doi:10.1038/s41467-019-09343-2

. 2019 Apr 5;10:1583. doi: 10.1038/s41467-019-09343-2

The neural computation of inconsistent choice behavior

Vered Kurtz-David ¹, Dotan Persitz ¹, Ryan Webb ^2,³, Dino J Levy ^1,^4,^✉

PMCID: PMC6450930 PMID: 30952855

Abstract

Humans are often inconsistent (irrational) when choosing among simple bundles of goods, even without any particular changes to framing or context. However, the neural computations that give rise to such inconsistencies are still unknown. Similar to sensory perception and motor output, we propose that a substantial component of inconsistent behavior is due to variability in the neural computation of value. Here, we develop a novel index that measures the severity of inconsistency of each choice, enabling us to directly trace its neural correlates. We find that the BOLD signal in the vmPFC, ACC, and PCC is correlated with the severity of inconsistency on each trial and with the subjective value of the chosen alternative. This suggests that deviations from rational choice arise in the regions responsible for value computation. We offer a computational model of how variability in value computation is a source of inconsistent choices.

Humans are often inconsistent when choosing between alternatives, but the neural basis of deviations from economic rationality is unclear. Here, the authors show that irrational choices arise in the same brain regions responsible for value computation, implying that brain ‘noise’ may underlie inconsistency.

Introduction

A fundamental axiom in neoclassical theories of choice behavior is that the decision-maker is consistent in her choices. For example, if a decision-maker chooses some combination of milk and cookies (bundle A) over another combination of milk and cookies (bundle B) and also chooses bundle B when a third bundle C was available, then—if she is consistent in her choices—she should not strictly prefer bundle C over bundle A in any subsequent choice. Consistency is the fundamental axiom underlying rational behavior and the neoclassical construct of utility maximization¹.

The study of consistency was formalized with the Generalized Axiom of Revealed Preference (GARP)². Despite the centrality of rational behavior in neoclassical economics, since at least the 1950s studies have demonstrated that the choice behavior of humans violate consistency when choice sets are manipulated or framed^3–6. Such results have given rise to a behavioral approach to decision-making, in which agents are not strictly treated as consistent or rational⁴. Explanations for such anomalies establish that the human decision process is limited by, or maladapted to, the particular choice context under the study. For example, agents might simplify the choice process by using various heuristics^6–8, misunderstand the problem⁹, or in some cases, inconsistency might arise due to a limited cognitive capacity¹⁰.

However, more recently, lab experiments and real consumption data suggest that a degree of choice inconsistency might be present in human decision-making, even absent any particular framing or context induced by the experimenter. Subjects are often inconsistent and violate GARP^11–15 even when choosing over simple bundles; for example, simply switching their choices when presented with the same lotteries^16,17. Therefore, a degree of inconsistent behavior may be fundamental to the human decision-making process. Indeed, economic theorists have proposed that the valuations which underlie choice may themselves be stochastic^18,19. By design, these theories place weak constraints on the pattern of choice behavior because utilities are typically assumed to be unobservable. Neuroscientific methods, on the other hand, suggest a stronger test of this hypothesis²⁰. Many previous studies have identified several brain regions—primarily the ventral striatum (vStr), the ventromedial prefrontal cortex (vmPFC) and the posterior cingulate cortex (PCC)—that correlate with a utility function fit to choice behavior irrespective of the reward type^21–23. Whether the value representations in these areas obey consistency, and how this network might give rise to inconsistent choice behavior, is unknown.

Similar to sensory perception and motor output, we propose that a substantial component of inconsistent behavior is due to variability in the neural computation of value. There is ample evidence demonstrating that both the behavioral and neural responses to the same sensory input are variable^24–26. The predominant explanation for this phenomenon is due to a fundamental property of the nervous system—the inherent variability in neural activity^27,28. We propose that the same variability is responsible for inconsistency in choice behavior when the framing or the choice context is stable. Our modeling approach is an application of the Random Utility framework^18,20, a parsimonious account of the aggregation of value signals and neural variability in the course of a decision^29–32. In our model, valuations of choice options are inherently stochastic, and the skewed nature of neuronal activity³³ implies that the severity of an inconsistent choice results from larger fluctuations in the computation of value. This prediction might seem surprising. After all, one might not expect that a brain region which computes the valuations of choice alternatives is more active when those valuations are contradicted in an inconsistent choice. To asses this hypothesis, we measure the severity of an inconsistent choice on a trial-by-trial basis, and identify the neural correlates of inconsistent choices.

Measuring the severity of inconsistent choice presents a challenge for the analysis of the neural data. Existing methods for measuring inconsistency either count the number of GARP violations or estimate the extent of (hypothetical) changes to the dataset required to make choices consistent^{14,15,34–36}; therefore, they all assign one inconsistency score per subject. This aggregation is severely limited because it yields only a simple between-subject analysis based on the average neural activity over all choices. By construction, it ignores the trial-by-trial variation in both behavior and neural activity. Such variation provides information not just about which subjects are more inconsistent, but also the level of activity in different brain areas during an inconsistent choice. To overcome this limitation, we develop a trial-specific inconsistency index, which measures the severity of inconsistency contributed by each choice. We apply our novel index to a well-established choice task, and use it to explore the neural computation underlying inconsistent choices on a trial-by-trial basis.

A few neuroscientific studies have previously examined consistency, albeit using only aggregate-level inconsistency indices. Patients with lesions in the medial prefrontal cortex (mPFC) violate GARP and transitivity more often than non-lesioned controls, suggesting this region is necessary for consistent behavior^37,38. GARP violations increase with aging and are negatively correlated with gray-matter volumes in the ventrolateral prefrontal cortex³⁹. Neural correlates of intransitive lottery choices have been observed in BOLD signal from the vStr, anterior cingulate cortex (ACC) and the dorsolateral prefrontal cortex (dlPFC)⁴⁰. Although these studies have identified which brain regions are involved in inconsistent choices across subjects, no study has yet examined how such behavior might arise in healthy human brains on a trial-by-trial basis.

Therefore, we examine, using fMRI, the neural basis of choice inconsistency on each trial of a choice task. Importantly, in our task, subjects make choices over lotteries holding the framing fixed. Using our novel trial-specific inconsistency index, this design allows us to assess the degree of a violation of inconsistency on each trial. We then search for correlates to this index in the BOLD signal, with both a whole-brain analysis and a region of interest (ROI) analysis of brain regions known to participate in value-based choice: the vmPFC^21–23, the vStr^21,22, the PCC^22,41, and the ACC⁴², which is also related to choice difficulty, foraging, control, and monitoring^43,44. We find that the BOLD signal from these regions correlates with both inconsistency levels and utility. These findings are consistent with our computational model explaining how inconsistent choice behavior might arise in value-related regions.

Results

A novel trial-specific inconsistency index

For a systematic search for the neural computations that give rise to inconsistent choices, we propose an index to measure the severity of inconsistency on a given trial. Our novel index is based on a Leave-One-Out procedure (Fig. 1b) applied to the Money Metric Index (MMI), first introduced by Halevy et al.⁴⁵. The MMI is a parametric measure of the extent of GARP violations in a dataset of choices from linear budget sets. It measures the minimal adjustments (in percentages) of the budget lines required to reconcile the decision-maker’s choices with the best-fitting parametric utility function (see the Methods section and Fig. 1a).

Fig. 1 — A trial-specific inconsistency index. a Computation of the MMI. Let u(∙) be some utility function and let xⁱ be the bundle chosen by the subject in the trial i. A utility function induces a complete ranking on the bundles, i.e., if $u (x) > u (y)$ , then the bundle x is ranked above bundle y. Also, by the revealed preference principle⁷², when the subject chooses bundle xⁱ, she reveals that she ranks this bundle over all other feasible bundles. These two rankings may be incompatible as shown in the figure: u(∙) ranks all the bundles in the purple bold interval as better than bundle xⁱ, while the subject ranked xⁱ as her most desired bundle in the given budget set. The extent of this incompatibility is measured by computing the maximal expenditure for which the two rankings agree (the minimal parallel inward adjustment of the budget line). Given the utility function u(∙), we use average sum of squares to aggregate these adjustments over all observations. For a set of utility functions $U$ , we choose the utility function u(∙) for which the aggregate adjustment is minimal. We refer to the aggregate adjustment of this u(∙) as the MMI for the entire dataset D given the set of utility functions $U$ . b Leave-One-Out procedure. Denote the dataset by D. Denote the dataset that is generated by removing observation i from dataset D by $D_{- i}$ . For each observation i, the index is the difference between the aggregate index ${MMI}_{D}$ calculated for the entire dataset D and the aggregate index ${MMI}_{D_{- i}}$ calculated for the partial dataset $D_{- i}$ . Formally, the index for observation i is ${MMI}_{D} - {MMI}_{D_{- i}}$

For each trial, our index (trial-specific MMI) calculates the difference between the Aggregate MMI index (calculated over all observations) and the MMI index calculated over all observations less the given trial. Hence, our trial-specific MMI index measures the severity of inconsistency per trial within a subject (see the Methods section). We then use it as a regressor to track the neural correlates of choice inconsistency.

A key benefit of the Aggregate MMI index is that it also yields parameter estimates of the subject’s utility function. These subject-specific parameters may be used to estimate the subjective value (SV) assigned to the chosen bundle in each trial. We then use these SVs as another parametric regressor to identify the neural correlates of value modulation. Since SV does not depend on the specifics of the budget line (relative prices and endowment), while the trial-specific MMI depends on both, there is orthogonal information in the two regressors. Simultaneous identification of the severity of inconsistency on each trial, and the SV of that same trial, enables us to probe for the neural correlates of inconsistent choice behavior.

Behavior

Subjects made choices from linear budgets in the context of risk (following Choi et al.)¹³ inside the fMRI scanner. On each trial, subjects were presented with a set of 50/50 lotteries between two accounts, X and Y, and were asked to choose their preferred lottery (bundle). All possible lotteries in a given trial were represented along a budget line. The price ratios (slopes of the budget lines) and endowments were randomized across trials and subjects (Fig. 2, see the Methods section).

Experimental task (following Choi et al.¹³). a A Trial: Subjects were presented a visualization of a budget line with 50/50 lotteries between two accounts, labeled X and Y. Each point on the budget line represents a different lottery between the X and Y accounts. Subjects used a trackball to choose their preferred lottery (a bundle of the X and Y accounts) out of all possible lotteries along the line. For example, as depicted in Fig. 2a, the bundle (11,72) corresponds to a lottery with a 50% chance to win 11 tokens (account X), and a 50% chance to win 72 tokens (account Y), where 1 token equals 5 NIS (~$1.5). We varied and randomized the budget lines (slopes and endowments) across trials and subjects. At the end of the experiment, one trial was randomly selected as well as one of the accounts. The subject received the tokens she had allocated to the selected account in the selected trial. b Behavioral example: Given this budget line, the subject could choose A (10,35) with a 50% chance of winning 10 tokens (account X) and 50% chance of winning 35 tokens (account Y); or similarly, B (45,10). An extremely risk-seeking subject would choose C, the lottery with the maximal expected payoff, yielding a 50% chance of winning 60 tokens and a 50% chance of winning nothing. By contrast, an extremely risk-averse subject would choose D, the intersection with the 45-degree line. This bundle is a degenerate lottery, which allocates the same number of tokens for both X and Y accounts. c Timeline: Inside the scanner, subjects had a maximum of 12 s time window to make their choice, followed by a 9 s variable ITI. If subjects made a choice before the end of the 12 s time window, the remaining time was added to the ITI. There were 27 trials in each block, 4 blocks, for a total of 108 trials. Subjects completed a pre-scan questionnaire (see Supplementary Note 7 for an English version) and a practice block with a trackball outside the scanner to make sure the instructions and procedures were clear

In line with previous literature, none of the subjects’ choices satisfied GARP. However, there was considerable evidence that the subjects understood the task and did not behave randomly (Fig. 3a, c, Fig. 4a and Supplementary Note 2). Some predominant patterns in behavior are depicted in Fig. 4a (see Supplementary Figure 7 for scatterplots of all subjects). Figure 3b presents the recovered utility parameters in the sample (see Supplementary Table 6 for the individual recovered parameters). A comparison with the Choi et al.¹³ study reveals that the distributions of the Afriat inconsistency index (Methods and Supplementary Note 1)^2,34 are quite similar (Fig. 3c).

Fig. 4 — Representative subjects. a Scatterplots of prominent behaviors: The y-axis represents the share of tokens allocated to the Y account as a function of the log price ratio, log(p_x / _y) (x-axis). As the log price ratio increases, account Y becomes relatively cheaper. Subject 410 equalized expenditures between the two accounts, as she divided tokens proportionally to the price ratio (Cobb–Douglas preferences). Subject 104 exhibited similar behavior, but chose to allocate the entire endowment to the cheaper account in extreme slopes. Subject 403 chose the safe bundle, when the prices of X and Y were relatively similar and allocated most or all her tokens to the cheaper account when the price ratio between the accounts was relatively high (steeper slopes). Even a highly inconsistent subject, like subject 203, was sensitive to changes in prices, with the share of tokens to the Y-account declining as its price rises. b Variability of trial-specific MMI: Distributions of the trial-specific MMI for the four representative subjects from panel a showcase the heterogeneity of trial-specific MMI scores across and within subjects. For example, subject 203 was highly inconsistent throughout most of her trials, while subject 403 was mostly consistent

The parametric Aggregate MMI index is highly correlated with existing aggregate nonparametric indices (Fig. 3d, Afriat index: Spearman’s ρ = 0.538, p < 0.001; Supplementary Figure 3a, the number of GARP violations: ρ = 0.703, p < 0.001), suggesting that, although parametric, the Aggregate MMI is a good measure of inconsistency⁴⁵. Compared with nonparametric indices, there was considerable variability in the trial-specific MMI, therefore it can be used as a trial-by-trial regressor for neural activity (Fig. 4b).

Neuroimaging

We identify brain areas that correlate with the severity of inconsistency. A random-effect generalized linear model on the BOLD signal revealed that trial-specific MMI was positively correlated with activations in the mPFC and ACC (p < 0.0005, cluster-size corrected, Fig. 5a). This suggests that higher activation in these brain areas is correlated with more severe inconsistent choices on a given trial.

Fig. 5 — Neuroimaging results. a–d Whole brain: Results of RFX GLM, n = 33, p < 0.0005, cluster-size correction, x = 0 (MNI coordinates). Model regression: $\begin{matrix} BOLD = β_{0} + β_{1} RT + β_{2} {MMI}_{trial_specific} + β_{3} SV + β_{4} priceratio + β_{5} endowment \end{matrix}$ . Six additional motion-correction regressors were included as regressors of no interest. a Neural correlates of the trial-specific MMI. b Neural correlates of the SV: We present results for the frontal lobe. Other activations are detailed in Supplementary Table 4. c Conjunction analysis. d Overlay. e ROI: The ROI analysis revealed that choice inconsistency was correlated with activation in the dACC, vmPFC (p(Bonferroni) < 0.05) and PCC (p < 0.0005, cluster-size correction), but neither with vStr nor with V1. RFX GLM, n = 33, regression model as in a–d. For illustration purposes, we set the threshold to p < 0.001 (more stringent than an FDR correction). f Subject level: Subject-level analysis representing overlap of the SV and trial-specific MMI in the vmPFC, dACC, and PCC ROIs. For each subject, we conducted a conjunction analysis on the brain areas that significantly tracked trial-specific MMI and SV. Most subjects had an overlap region in the vmPFC (21 of 33), dACC (18 of 33), and PCC (21 of 33). FFX GLM, regression model as in a–d. We set a liberal threshold of p < 0.15 due to lack of statistical power. For three subjects, the threshold was set to p < 0.2, MNI coordinates (see Supplementary Table 5 as well). g Comparison of subject-level mean effects (% signal change, β values) of trial-specific MMI and SV in the vmPFC, dACC and PCC, using two-sided Wilcoxon sign-rank test (n.s. not significant, **p < 0.01). In all panels a–f, results are shown on the Colin 152-MNI brain

To verify that these areas also track value in our task (as previously reported^21–23), we calculated the SV of the chosen bundle on each trial for each subject. The SV regressor was also positively correlated with activations in the mPFC and ACC (p < 0.0005, cluster-size corrected, Fig. 5b). A conjunction analysis revealed that both inconsistency and value modulations were correlated with activation in the mPFC and ACC (Fig. 5c, d, 1456 overlapping voxels, 28.7% of the SV cluster). This substantial overlap suggests that the neural computations that give rise to inconsistent choices, hence deviations from rational choice, are related to the neural computations of value.

To increase the power of our analysis, we repeated our analysis on specific ROIs. Based on existing literature, we examined the vmPFC^21–23, vStr^21,22, dACC⁴², and PCC^22,41, to test if choice inconsistency is related to value-based circuits. We also examined V1 as a control area. The ACC, vmPFC, and PCC were positively correlated with trial-specific MMI (p(Bonferroni) < 0.05, in the vmPFC and ACC, p < 0.0005, cluster-size corrected in PCC) and SV (p(Bonferroni) < 0.05, in the vmPFC and ACC, q(FDR) < 0.05 in the PCC), though we did not find any significant activation in the vStr. As expected, V1 was not correlated with trial-specific MMI (Fig. 5e), suggesting that only value-related regions are involved with choice inconsistency. These results corroborate the whole-brain analysis.

To verify that this group-level overlap reflects an overlap at the single-subject level, we also searched for conjunct areas on a subject-by-subject basis. In 24 of 33 subjects, there was a conjunct region between trial-specific MMI and SV in one or more of the ROIs: vmPFC, dACC, and PCC (Fig. 5f, Supplementary Table 5). The hypothesis that the subject-specific mean effects (β values) for SV and trial-specific MMI were the same in the vmPFC and dACC could not be rejected (Wilcoxon sign-rank test, p = 0.0504 and p = 0.2877, respectively, multiple comparison corrected). However, for the PCC ROI, the difference was significant (p < 0.005). This suggests that the SV and trial-specific MMI predictors both have an important effect on the BOLD signal in the vmPFC and dACC (see Fig. 5g).

Motivation for using trial-specific MMI

To demonstrate the power and necessity of our trial-specific analysis, we also assessed whether a standard between-subject analysis using aggregate indices could identify the same brain areas found in our trial-by-trial analysis. We did not find significant correlations between any of the aggregate indices (aggregate MMI, Afriat index, and number of GARP violations) and the average change in the BOLD signal in any of the predefined ROIs (Supplementary Table 3).

In addition, we also examined whether similar activations could be found using a nonparametric trial-specific inconsistency index. The same Leave-One-Out procedure was implemented on the number of GARP violations, with no functional form assumptions (henceforth trial-specific violations). At the behavioral level, we found a significant correlation with our parametric trial-specific MMI index, (β = 0.0009, p < 0.0001, cluster regression). However, the RFX-GLM analysis using the nonparametric trial-specific violations index as a regressor did not yield any significant voxels (even with a more liberal threshold, Supplementary Figure 5b). This is likely due to the low variability of the trial-specific violations regressor (73.4% of the data points equal to 0, Supplementary Figure 5a). The Afriat index also yielded a null result (Supplementary Figure 5c). These null results provide additional motivation for using a parametric index with high variability across trials and subjects.

A model of valuation and inconsistent choices

We now propose a model in which neural variability can generate choice inconsistencies compatible with our empirical findings, including the observation that the BOLD activity correlates positively with both SV and inconsistent choice behavior.

Consider a decision-maker choosing between two alternatives {1, 2} with a valuation for each alternative given by v₁ > v₂. Define the first alternative—with the larger valuation—as the consistent choice (i.e., it obeys an ordered utility function u(∙), Supplementary Note 1). During a choice, the neural computations which encode and compare valuations are subject to variability. This is represented by a random utility comprised of the valuation v_i plus a random term e_i.

ṽ_{i} = v_{i} + e_{i}

The alternative with the largest random utility is chosen, therefore the decision-maker might choose inconsistently due to the random component (i.e., $ṽ_{2} > ṽ_{1}$ , Fig. 6a). The probability of inconsistent choice is determined by two factors:

The distribution of $ṽ_{i} .$

A skewed distribution of neural activity is observed in various contexts.³³ Therefore, we assume a skewed distribution for $ṽ_{i}$ , and use the log normal and generalized extreme value distributions as examples.
The difference between v₁ and v₂.

As the gap between valuations increases, the realization of the random component for the inconsistent alternative must be relatively larger for it to be chosen.

Fig. 6 — The neural random utility model with two alternatives. a Inconsistent binary choice: A draw (red) of utilities from a log normal distribution with mean v₁ = 4 and v₂ = 2, (log s.d. = 0.7). In this draw, $ṽ_{2} > ṽ_{1}$ , so the inconsistent option (with lower mean valuation) is chosen. b Average random utility: A sample of 2,000,000 utilities from the distribution in a. The average utility is higher when the inconsistent option is chosen. c Larger difference: A sample of utilities from distributions with a larger difference in mean valuations (log normal, means v₁ = 7 and v₂ = 2, (log s.d. = 0.7). The average utility of an inconsistent choice is larger when the choice is *more* inconsistent (the difference in mean valuations is larger compared to b)

The implications of this model for the neural data are the following: if the BOLD signal correlates with random utility, this signal will be higher when choices are more inconsistent, because the random component must make up the gap between the higher and lower valuation when the inconsistent option is chosen (Fig. 6a). Because the error distribution is skewed, it is larger on average when the gap in valuations is overcome (Fig. 6b), and particularly so when the gap between the consistent and inconsistent options is larger (Fig. 6c). Therefore, the model predicts that valuation regions of the brain will be more active on inconsistent choices.

The proposed relation between the BOLD signal and choice inconsistency also holds as the number of alternatives increases beyond our binary choice example. Define the choice of alternatives with lower valuations as more inconsistent. When a low value alternative is added to a choice set, its corresponding random component must be larger to overcome the utilities of all higher-valued alternatives. Moreover, the largest realization of an error is required for the lowest value option to be chosen, so on average the largest BOLD signal will arise on a trial in which the lowest valuation alternative is chosen. As the number of low value alternatives increases, this will yield a corresponding increase in the BOLD signal on more inconsistent trials (Fig. 7). This is true whether the BOLD signal correlates with the random utility of the chosen option or the aggregate random utility of all options (Supplementary Figure 8b). Moreover, it is robust over a range of error distributions which have the skewed property consistent with neural activity (Supplementary Figure 8a).

Fig. 7 — The neural random utility model with six alternatives. The alternatives are ranked in value (v₁ = 7,…,v₆ = 2) with the highest valued alternative termed the consistent choice. For inconsistent choices, the average utility of the chosen option increases as worse alternatives are chosen. This is because the random term, e_i, had to be much larger (e.g., compare Chose 6 vs. Chose 2). Error bars indicate standard errors

To demonstrate that the random fluctuations in value implied by NRUM are closely related to the observed neural activity, we assessed whether the same correlation pattern between the random utilities and inconsistency index could be observed. Based on the behavior of our subjects, we therefore simulated the valuation process implied by the NRUM and examined the correlation between the random utility valuations (equivalent to the BOLD signal) and the inconsistency of their simulated choices. For all subjects, we found significant positive correlation (Table 1 and Supplementary Figure 12), implying that NRUM is consistent with our main empirical finding.

Table 1.

Simulation results

SID	Gumbel dist.	Log normal dist.	SID	Gumbel dist.	Log normal dist.
103	0.1245*	0.1484*	412	0.1084*	0.1105*
104	0.1345*	0.1568*	413	0.1201*	0.1089*
202	0.1048*	0.1341*	414	0.1273*	0.1662*
203	0.1233*	0.1261*	415	0.1227*	0.1175*
204	0.1290*	0.1581*	416	0.0543*	0.0455*
205	0.1266*	0.1288*	417	0.1084*	0.1391*
206	0.1345*	0.1369*	418	0.1312*	0.1500*
401	0.1244*	0.1343*	419	0.0934*	0.0916*
402	0.1002*	0.0914*	420	0.1120*	0.1339*
403	0.0963*	0.1139*	421	0.1303*	0.1301*
404	0.0607*	0.0706*	422	0.0779*	0.0701*
405	0.1148*	0.1143*	424	0.1053*	0.0889*
406	0.1196*	0.1187*	426	0.1240*	0.1259*
407	0.1289*	0.1620*	427	0.1200*	0.1133*
408	0.1208*	0.1298*	428	0.1281*	0.1746*
409	0.1227*	0.1484*	430	0.1365*	0.1563*
410	0.1206*	0.1207*

Open in a new tab

The correlation coefficients r for the pooled simulated series per subject using two skewed distributions for the random fluctuations in value, the Gumbel and log normal (*p < 10⁻¹⁰).

Dissociation of the SV and trial-specific MMI regressors

As previously noted, there is orthogonal information in the SV and the trial-specific MMI, since the SV does not depend on the budget set. Indeed, these regressors are weakly negatively correlated (β = −0.00133, p < 0.001, clustered regression by subjects), even if we control for the expenditure and slope of the budget line. Also, as expected, less than 5% of the variance of the trial-specific MMI is explained by the SV regressor (R² = 0.0496, see subject-level correlations in Supplementary Table 7). By contrast, there is significant positive correlation between the SV and the BOLD signal, and between the trial-specific MMI and BOLD, when these regressors are included both separately and jointly in the GLM. Therefore, SV and trial-specific MMI appear to be dissociated in our analysis. An orthogonality analysis confirms these results (Supplementary Note 5, Supplementary Figure 11a and b).

Finally, a psychophysical interaction (PPI) analysis between our predefined ROIs (vmPFC, dACC, and PCC) and other brain regions revealed that the activity in the seed regions and other brain areas interacted with the SV and trial-specific MMI regressors (Methods and Supplementary Figure 11c). As might be expected, all three seed regions were interacting with motor and visual regions, for reasons likely related to task execution. In addition, we found interaction with other value-related regions, such as the Insula and dlPFC^23,42. Importantly, though, only under the SV context did we find interactions between the seed regions—specifically, the PCC and vmPFC interact with the dACC. This may indicate that choice inconsistency, represented by trial-specific MMI, results from a spontaneous or random process, not coordinated across the different nodes of the “value network”.

Controlling for other sources of choice inconsistency

It is possible that inconsistency arises due to other sources of noise—like imprecision in motor execution or the numerical representation of the choice options. To control for these alternative explanations, we conducted additional analyses using two functional localizers that were collected at the end of the main experiment.

First, in a motor imprecision localizer task, subjects were asked to reach a predefined location marked as a circle on the line (Supplementary Figure 1a). Motor imprecision was measured by the average Euclidean distance between the predefined target and the actual location the subject chose. Across subjects, the average motor noise and the Aggregate MMI were not significantly correlated (r = 0.144, p = 0.511, Supplementary Figure 1c). In addition, as might be expected, frontal lobe activity in premotor and motor areas (peak voxel at [17, 25, 42], MNI coordinates), positively correlated with the imprecision regressor (Supplementary Figure 1d). However, no voxels conjointly represented both motor imprecision and inconsistency level (i.e., trial-specific MMI).

In a second, numerical imprecision localizer task (Supplementary Figure 1b), we estimated the numerical execution of each subject in reaching a predefined {X,Y} coordinate on the line. Numerical imprecision was measured by the average Euclidean distance between the coordinates and the actual location subjects chose. The average numerical execution imprecision per subject was not correlated with the Aggregate MMI (r = 0.105, p = 0.61, Supplementary Figure 1f). We also did not find any neural correlates with the imprecision of numerical execution (Supplementary Figure 1g). These analyses suggest that the neural activations of inconsistency are not due to imprecisions in motor or numerical execution, and are mainly observed in value-related brain areas (Supplementary Figure 1e).

Controlling for choice difficulty

In binary choice tasks, choice difficulty is usually considered to be the difference in the SVs between the two options (ΔSV)—the smaller ΔSV, the more similar the options are, and the higher the difficulty level^44,46–48. Similar intuition holds for continuous choice sets, though a measure of difficulty must account for both the subject’s preferences and the larger number of choices. To address the role of choice difficulty in our dataset, and its possible relationship to choice inconsistency, we propose an index of choice difficulty in continuous choice sets and include it as a control in our main GLM (see the Methods section for details).

In our Choice Simplicity index, the closer its value is to 0, the more difficult is the choice. As expected, difficult choices lead to longer RTs (β = −2.782, p < 0.0001, clustered regression by subjects). Trial-specific MMI is negatively correlated with our Choice Simplicity index (β = −0.0302, p < 0.0001), meaning, the more difficult is the choice problem, the higher is the corresponding inconsistency level. However, the index also explains little of the variance in trial-specific MMI scores (R² = 0.0137), therefore, the relationship between choice difficulty and inconsistency is weak.

It is important to note that the random utility model that we propose predicts these findings. As noted in our modeling, choice difficulty is a key determinant of the probability of inconsistent choice; we would expect that choices are more inconsistent on more difficult problems. However, if there was no variability in valuation, then even the most difficult choice would not lead to inconsistencies (i.e., choice difficulty alone cannot lead to choice inconsistency). It is precisely the variability that “connects” choice difficulty to inconsistency, with the obvious implication that more difficult problems are more likely to produce inconsistency. Therefore, a weak negative correlation between the Choice Simplicity index and the level of inconsistency is expected.

In the RFX GLM analysis, our main findings for trial-specific MMI and SV clusters hold when controlling for difficulty (Supplementary Figure 9a–f). Moreover, the Choice Simplicity index was correlated with ACC activation (among other brain regions, but not the vmPFC and PCC), as extensively suggested by the literature^44,49 (p < 0.0005, cluster-size corrected, Supplementary Figure 9g).

Controlling for the role of confidence in decision-making

Another possible source for choice inconsistency is low levels of confidence in one’s own choice. Following Lebreton et al.⁵⁰, we modeled levels of confidence as the second-order polynomial of SV. Similarly to Lebreton et al.⁵⁰, we find a quadric relationship between RT and confidence levels, when controlling for the first-order polynomial of SV (β = −0.00155, p < 0.005), indicating subjects had the longest RTs in intermediate confidence-level choices.

As expected from the dissociation analysis above, we found that trial-specific MMI was correlated with our measure for confidence (β = −0.000027, p < 0.0001 in a clustered regression by subjects). Such result indicates that low levels of confidence correlate with high inconsistency scores; however, the R² of the model is very low (0.0395). When we added the confidence predictor to the RFX GLM, our main results hold, suggesting that the BOLD activity in mPFC and ACC was larger on inconsistent trials, even after controlling for confidence (Supplementary Figure 10).

Robustness of the trial-specific MMI

We ensured our results remained robust even after controlling for changes in heuristics over the blocks of the experiment (Supplementary Note 3, Supplementary Figure 4 and Supplementary Tables 1 and 2). The results remain unchanged also when we control for misspecification of the utility function, by using a different functional form (Methods, Supplementary Note 4 and Supplementary Figure 6).

Discussion

In this study, we explored the neural computations that give rise to inconsistent choice behavior when the framing and context of the choice problems are stable. We introduced a novel trial-specific inconsistency index and found that it was positively related to activations in the vmPFC, dACC, and PCC which, strikingly, lie in the same regions of cortex as value representations. Moreover, the functional connectivity networks of the SV are more interconnected than for inconsistency, suggesting that inconsistent choices might be driven by idiosyncratic fluctuations within these regions. The main results were corroborated with an ROI analysis on anatomically defined brain regions, and were robust to several alternative explanations including the influence of motor or numerical noise. We also proposed a novel index for measuring choice difficulty on a continuous budget line, and demonstrated our main result is robust to difficulty. Finally, including a proxy for confidence⁵⁰ does not alter our main findings.

Our main empirical finding is a positive correlation between the severity of choice inconsistency and the BOLD signal located primarily in value-related areas. Based on our behavioral and neural data, we proposed a computational model in which inconsistent choice behavior originates from the variability in neuronal value computation. In cases where neural variability is large enough to overcome the value difference between the alternatives, choices of low valuation alternatives may occur. If the error distribution is skewed, we should expect to see higher neural activation in value regions for more inconsistent choices. Therefore, this study provides evidence consistent with the view that choice inconsistency arises from variability in regions of the human brain that are known to be responsible for value computation.

The hypothesis that choice inconsistency is tied to variability in valuation is not novel¹⁸. However, the standard explanation in economics is that this variability arises from limitations in the data available to the researcher, not that choice itself is stochastic⁵¹. More recently, decision theorists have proposed that the source of choice inconsistency might be more fundamental; that is, choice is stochastic because utilities are stochastic^18,19,52,53. While such theories place important empirical constraints on the pattern of inconsistent behavior, these constraints are weak because utilities are assumed to be unobservable. By contrast, the ability to observe the neural valuations of choice alternatives on a given trial enables a much stricter test of the hypothesis that choice inconsistency is due to stochastic valuations²⁰. Therefore, we should expect to see the empirical results reported here if inconsistent choice behavior arises from stochastic valuations computed in value regions of the brain.

The main methodological contribution of this study is the trial-level index of inconsistency. Existing inconsistency indices are aggregate measures, which use an entire subject’s dataset to provide one inconsistency score. For this reason, aggregate measures which correlate BOLD signal across subjects lose statistical power; they ignore trial-by-trial variations in behavior—and its neural foundations—thus cannot take advantage of the rich trial-level measurements provided by the MRI scanner. In particular, the most informative trials that induced inconsistent choices are lost when averaging over all trials. By contrast, our proposed index tracks trial-by-trial variations in behavior and neural activations, therefore provides insight into the valuation and choice process when a subject chooses inconsistently. We should note its use need not be limited to neuroeconomic studies; standard behavioral laboratory experiments can use the trial-specific MMI index to test theories which imply varying behavior across trials (e.g., choice dynamics).

At first glance, our empirical results might seem surprising given both lesion and stimulation studies which demonstrate that activity in value-related regions is necessary for consistent choice^37,54. Moreover, Polania et al.⁵⁴ find that choice behavior becomes less accurate when frontal–parietal coupling is disrupted by tACS. However, the results from these studies are compatible with our proposed computational model for choice behavior. When value-related regions are absent or disrupted, choices are highly inconsistent because of the limited ability of the brain to compute the valuations necessary for consistent choice behavior. When these regions are intact, value signals can be computed, but with a degree of variability inherent to neural computation. Thus, choices are largely consistent, but exhibit a pattern in which inconsistent choices correlate with an increase in aggregate activity. As further evidence, lesioned subjects in Camille et al.³⁷ had an Afriat Index of 0.1 on average, compared with an average of 0.0623 in our sample, even though our subjects were facing a much more difficult task (11 trials vs. 108 trials).

The neural random utility model represents a parsimonious account of the variability in value signals during a decision^20,29. As such, the trial-by-trial variability we propose can arise from multiple sources, including higher order cognitive processes such as fluctuations in attention or heuristics or lower-level process like neuronal noise. At a computational level, a number of previous studies have explored the role of noisy computations in choice behavior, typically in the form of a bounded accumulation model, in which a noisy decision signal accumulates to some threshold (the drift-diffusion model, DDM)³⁰. These models have found support in both single-neuron recordings³⁰ and human imaging studies⁵⁵, with the fMRI studies in particular observing value signals in the vmPFC. Since the NRUM is a general formulation of bounded accumulation models²⁹, this large set of accumulation models can provide a computational account of the results we observe. Of primary importance, the distributions used in our examples (either Log normal or Gumbel distributions¹⁸) are skewed with a long right tail, a property consistent with neural data at both the level of single-neuron firing rates and aggregate network-level measures³³. Therefore, our results are compatible with the evidence for a skewed distribution of neural activity in value-related regions. To demonstrate that the valuations implied by the NRUM are closely related to the observed neural activity, we simulated the valuation process of our subjects based on their observed behavior and demonstrated a correlation between their random utilities (equivalent to the BOLD signal) and the inconsistency of their simulated choices (equivalent to the trial-specific MMI index).

What might be the source of this variability in value computation, and why should it persist in decision-making? Variability is inherent to neural computation⁵⁶, arising from thermodynamic noise at the cellular and synaptic level, and is present at all stages from primary sensory systems to motor execution^56,57. Network computations can filter, or integrate, this noise, but the maximal signal–noise ratio is bounded due to physiological constraints^58,59. Thus, some constrained optimal degree of noise persists at the network level across domains from perceptual³⁰ to value-based choice^28,31,32. In a seminal article, Fox et al.⁶⁰ find that “inconsistency in perception or performance should not be automatically attributed to fluctuations in task-related cognitive processes such as attention, but could also be due to ongoing fluctuations in intrinsic neuronal activity”. Indeed, a recent study conducted to explicitly separate the sources of behavioral variability finds that 89% of deviations from optimal choice can be attributed to errors in value inference, rather than sensory processing or action selection⁶¹. This is consistent with our finding of a limited role for noise in motor output or numerical representation in generating inconsistent choices. Instead, the deviations from consistency were observed in valuation regions, suggesting that the value of choice options might be fluctuating on a trial-by-trial basis. This result is in line with non-human studies: In primates, variability in the firing rate of orbitofrontal cortex neurons predicted choices of near indifferent alternatives⁶², while stability in neural populations in the medial frontal cortex accounted for variability in choice in rats.⁶³ Furthermore, neural variability can be influenced by varying levels of attention^64–66. These results point to a speed-accuracy tradeoff in decision-making governed by metabolic costs⁶⁷.

Taken together, these results question whether the form of inconsistency we observe should be considered sub-optimal. Our results are consistent with a degree of constrained-optimal variability around the normatively defined benchmark of a utility representation. In fact, our results may be interpreted as implying that inconsistent choice behavior is an integral feature of human decision-making.

Methods

Participants

Thirty-eight subjects participated in the study (17 females, mean age 25.3, 18–36). Subjects gave informed written consent before participating in the study, which was approved by the local ethics committee at Tel Aviv University and by the Helsinki Committee of Sheba Medical Center. Three subjects were dropped due to sharp head movements (> 3 mm). Another subject opted out from the experiment before completing the scan, and another subject was dropped due to anatomical abnormalities. We therefore report the data for the remaining 33 subjects.

Experimental task

We used a modification of the task presented by Choi et al.¹³. On each trial, subjects faced a visualization of a budget line (Fig. 2a). Each discrete (x, y) point on the budget line corresponds to a lottery with a 50% chance of winning the tokens allocated to account X (the x-axis coordinate) and a 50% chance of winning the tokens allocated to account Y (the y-axis coordinate). Thus, the budget line describes the possible allocations to accounts X and Y on a two-dimensional graph. On each trial, the subject was asked to choose the desired bundle (of X and Y tokens) from the budget line, knowing that only one of the accounts will be realized. At the end of the experiment, one of the trials was randomly selected for monetary payment (to satisfy incentive compatibility). The subject won the monetary value of the tokens allocated to the winning account on the trial drawn for the payment. Each token was worth 5 NIS ($1 ≅3.5 NIS).

In this task, the slope of the budget line determines the price of one unit of account X relative to one unit of account Y. We varied and randomized the budget lines (slopes and endowments) across trials and subjects. The x-axis and y-axis were scaled from 0 to 100 tokens, and the resolution of the budget line was 0.1 tokens. Subjects could not choose inside the budget line. In trials where subjects did not make any choice in the allotted time, a text reading “No choice was made” appeared on the screen. These trials were excluded from the analysis (35 trials out of 3,564 total trials). The average prize was 191.6 NIS + 100 NIS show up fee.

fMRI session

Subjects performed the experimental task using an fMRI compatible trackball to choose their preferred bundle. On each trial, subjects had a maximum of 12 s to make their choices, followed by a 9 s variable inter-trial-interval (jittered between trials). If subjects made their choice before the end of the maximal 12 s, the remaining time was added to the ITI. There were 27 trials in each block, and each subject completed four blocks, for a total of 108 trials.

After completing the main task, we obtained an anatomical scan and two functional localizers, one numerical and one motor (counter-balanced), aimed to control for alternative sources of choice inconsistency (see Functional Localizers below).

Instructions and pre-scan practice

Before the scan, subjects read an instruction sheet and completed a pre-scan questionnaire to verify the task is clear. The instructions included many examples and were written in simple terms to avoid confusion. In the pre-scan questionnaire, subjects were given several decision problems (i.e., graphs representing different budget sets) and were asked to identify intersections with the axes, identify the cheaper account, and calculate the possible winning prize (in terms of both tokens and NIS) for a specific (x, y) coordinate. After the pre-scan questionnaire, the experimenter went over their answers. In case the subject made a mistake, the experimenter explained the instructions orally, and then repeated the question in the questionnaire until the subject answered correctly. See an English translation of the instructions and pre-scan questionnaire in Supplementary Notes 6 and 7. Thereafter, subjects completed a practice block in front of a computer, using a similar trackball to the one used inside the fMRI, in order to imitate the motor movements required during the scan. The budget sets in the practice block were predefined to ensure all subjects encountered the same (substantial) variation of slopes and endowments.

Image acquisition

Scanning was performed at the Strauss Neuroimaging Center at Tel Aviv University, using a 3 T Siemens Prisma scanner with a 64-channel Siemens head coil. To measure blood oxygen level-dependent (BOLD) changes in brain activity during the experimental task, a T2*-weighted functional multi-band EPI pulse sequence was used (TR = 1.5 s; TE = 30 ms; flip angle = 70° matrix = 86 × 86; field of view (FOV) = 215 mm; slice thickness = 2.5 mm; band factor = 2). Fifty-two slices with no inter-slice gap were acquired in ascending interleaved order, and aligned 30° to the AC–PC plane to reduce signal dropout in the orbitofrontal area. Anatomical images were acquired using 1-mm isotropic MPRAGE scan, which was comprised from 208 axial slices without gaps at an orientation of −30° to the AC–PC plane.

fMRI data preprocessing

BrainVoyager QX (Brain Innovation) was used for image analysis, with additional analyses performed in MATLAB (MathWorks). Functional images were sinc-interpolated in time to adjust for staggered slice acquisition, corrected for any head movement by realigning all volumes to the first volume of the scanning session using six-parameter rigid body transformations. Spatial smoothing with a 6-mm FWHM Gaussian kernel was applied to the fMRI images. Images were then co-registered with each subject’s high-resolution anatomical scan and normalized using the Montreal Neurological Institute (MNI) template. All spatial transformations of the functional data used trilinear interpolation.

The General Axiom of Revealed Preference (GARP)

Consider a finite dataset $D = {\{(p^{i}, x^{i})\}}_{i = 1}^{n}$ , where $x^{i} \in R_{+}^{k}$ is the subject’s chosen bundle at prices $p^{i} \in R_{+ +}^{k}$ (k is the number of goods in the bundle). Bundle xⁱ is

Directly revealed preferred to another bundle x, denoted xⁱR⁰x, if pⁱxⁱ ≥ pⁱx.
Strictly directly revealed preferred to bundle x, denoted xⁱP⁰x, if pⁱxⁱ > pⁱx.
Revealed preferred to bundle x, denoted xⁱRx, if there exists a sequence of observed bundles (x^j, x^k,…, x^m), that are directly revealed preferred to one another, xⁱR⁰x^j, x^jR⁰x^k,…, x^mR⁰x. Relation R is therefore the transitive closure of the directly revealed preferred relation.

D satisfies the General Axiom of Revealed Preference (GARP), if every pair of observed bundles, xⁱRx^j, implies ¬(x^jP⁰xⁱ). We say a subject is consistent iff she satisfies GARP. We say that a utility function u(x) rationalizes D if xⁱR⁰x implies u(xⁱ) ≥ u(x). According to the Afriat theorem^1,2, there exists a well-behaved utility function (continuous, monotone, and concave) that rationalizes the data iff the subject satisfies GARP (Supplementary Figure 2b). Otherwise, a strict cycle of choices exists and we say that D violates GARP. By the Afriat’s theorem, if the dataset D does not satisfy GARP, then the subject cannot be described as a non-satiated utility maximizer and is therefore said to be inconsistent^1,2.

Aggregate inconsistency indices

As subjects often violate GARP, and therefore are inconsistent^11,14,45, one would like to measure their level of inconsistency. The simplest way would be to count the number of GARP violations. Other well-known nonparametric inconsistency indices are Afriat index^2,34, Varian index³⁵ and Houtman–Maks index³⁶ (see Supplementary Note 1 for detailed descriptions of these indices). For each subject, we calculated the number of GARP violations and Afriat index for the entire experiment (108 trials). We were unable to compute Varian index and Houtman–Maks index at the aggregate level as they are hard computationally³⁵ (see Appendix B in Halevy et al.⁴⁵). In the current study, we also compute a parametric index for inconsistency—the Aggregate MMI.

Aggregate MMI

Following Halevy et al.⁴⁵ consider the continuous and non-satiated utility function u(∙) as representing the preferences of the subject. u(∙) induces a complete ranking on the bundles such that if u(x) > u(y), then bundle x is preferred to bundle y. In addition, each actual choice induces a partial order on the bundles since when a subject chooses bundle xⁱ, she, by the principle of revealed preference, ranks this bundle over all other feasible bundles. If these two rankings are compatible for every choice made by the subject, u(∙) rationalizes D. Otherwise, if these two rankings are incompatible for some choice according to u(∙), some feasible bundles are ranked higher by u(∙) than the chosen Bundle xⁱ. For every observation, the incompatibility between the two rankings can be measured by the minimal expenditure (parallel inward movement of the budget line), such that the adjusted budget set does not include any bundle that is strictly preferred over xⁱ according to u(∙). Halevy et al.⁴⁵ show that this measure is exactly the well-known money metric⁶⁸. Formally, given the prices pⁱ, the money metric m(xⁱ,pⁱ,u) for observation i is the minimal expenditure required for the dataset to include a bundle y such that u(y) ≥ u(xⁱ):

m (x^{i}, p^{i}, u) = \min_{u (y) \geq u (x^{i})} p^{i} y

We normalize the money metric measure by the original expenditure, and therefore the adjustment for trial i is $v_{i}^{*} (D, u) = 1 - \frac{m (x^{i}, p^{i}, u)}{p^{i} x^{i}}$ (Fig. 1a). Hence, if no adjustment is needed, $v_{i}^{*} (D, u) = 0$ . Next, we aggregate the adjustments for all observations using some aggregator function $f (v^{*} (D, u))$ (specifically, the average sum of squares), and get a measure of the incompatibility between the utility function u(∙) and the dataset D, given the aggregator f. Finally, we iterate over all utility functions in the set of utility functions $U$ under investigation and look for the one with the smallest incompatibility with the dataset D. The MMI, denoted $I_{M} (D, f, U)$ , interprets the incompatibility between this utility function and D, as the incompatibility between the set of utility functions $U$ and D given the aggregator f.

I_{M} (D, f, U) = inf_{u \in U} f (v^{*} (D, u))

Halevy et al.⁴⁵ prove that had we examined the set of all continuous non-satiated utility functions (denoted $U^{C}$ ), the MMI would be equal to the nonparametric Varian inconsistency index, denoted I_v(D, f)³⁵. As it is not feasible to examine all utility functions in this set, they propose restricting to a specific functional form. The MMI thus includes a misspecification element. Fortunately, the MMI is separable additive in Varian inconsistency index and the misspecification:

I_{M} (D, f, U) = I_{v} (D, f) + Misspecification

The computation of MMI yields two measures: (a) the computation of aggregate MMI; (b) elicitation of subject-specific utility function parameters. The subject-specific utility function is the function for which aggregate MMI is minimal, and hence constitutes the best fit for the subject’s choices among the investigated family of utility functions $U$ .

Trial-specific index

A Leave-One-Out procedure. Let ε_D be an aggregate inconsistency index of dataset D. Let D_−i be a subset of D that includes all n−1 trials but the i^th observation. Let ε_D−i be the aggregate inconsistency index of D_−i, and let $(ε_{D} - ε_{D_{- i}})$ be the trial-specific inconsistency index of trial i.

The best practice would be to use Varian index as ε_D, as it is the nonparametric index with the highest number of degrees of freedom. However, it is not possible to compute Varian index for datasets with 108 observations in feasible time. Therefore, we had to choose between two other alternatives. The first was to use computational-convenient nonparametric indices (Afriat index and GARP violations). Nevertheless, when using those nonparametric indices, the trial-specific index $(ε_{D} - ε_{D_{- i}})$ usually equals 0 and therefore lacks the required variability across trials. The distribution of a nonparametric trial-specific index is depicted in Supplementary Figure 5a. One should notice that the variability in the trial-specific index is important when using the General Linear Model (GLM) to correlate the BOLD signal, as otherwise it lacks statistical power. Therefore, we picked aggregate MMI as ε_D, and refer to $(ε_{D} - ε_{D_{- i}})$ as trial-specific MMI. The code-package used to compute aggregate MMI and trial-specific MMI is available as open source in https://github.com/persitzd/RP-Toolkit.

Parametric utility

For parametric family of utility functions, we use the Disappointment Aversion model with CRRA functional form⁶⁹, as it includes many well-known types of preferences in the context of risk^13,45 (see also Appendix D of Halevy et al.⁴⁵). Formally,

SV (x_{1}^{i}, x_{2}^{i}) = γ ω (max \{x_{1}^{i}, x_{2}^{i}\}) + (1 - γ) ω (min \{x_{1}^{i}, x_{2}^{i}\}), (DA)

γ = \frac{1}{2 + β}, - 1 \leq β < \infty

ω (z) = \{\begin{matrix} \frac{z^{1 - ρ}}{1 - ρ}, ρ \geq 0 \\ ln (z), ρ = 1 \end{matrix} (CRRA)

where γ is the weight of the better outcome, and ω is a CRRA utility index with a relative risk aversion parameter ρ. When β = 0, this is the common Expected Utility function with parameter ρ (if, in addition ρ = 0, it is Expected Value and when ρ = 1 it is the Cobb–Douglas with equal exponents). When β > 0 the individual over-weights the probability that the lottery will yield the lower prize (“disappointment aversion”). When β → ∞, the subject cares only about the element with the lower quantity and therefore her optimal behavior would be to always choose the safe bundle, so that the lottery is meaningless (i.e., Leontief preferences). When β < 0, the individual overweights the probability that the lottery will yield the higher prize (“elation seeking”). When β = −1, the subject cares only about the element with the larger quantity (see Supplementary Figure 3b).

To use SV as a parametric regressor in our analysis, we calculated the value of the Disappointment Aversion model with CRRA functional form at the chosen bundle (x₁, x₂) in each trial i, using the subject’s recovered parameters (using the MMI), β and ρ.

Assessing the NRUM and inconsistency

On each trial, we reconstructed the set of bundles each subject encountered and calculated the SV using the parameters elicited by MMI method (Fig. 3c). These correspond to the v_is in the proposed model.

We calibrated two skewed distributions (the zero-mode Gumbel distribution and the zero-mean log normal distribution) for the neural noise e_i using the observed inconsistency level of each subject. The standard deviation of the distributions was chosen so that the average level of the Afriat inconsistency index matched the observed index.

Based on the v_is and the calibrated distributions for e_i, we calculated the random utility values ( $\tilde{v_{i}}$ ) for each alternative in every trial. For each trial, following value maximization, the chosen bundle was the alternative with the highest random utility value. We repeated this procedure for each subject 1,000 times for each distribution. Hence, we obtained 1,000 simulated datasets for each subject (for each of the two noise distributions).

Next, we tested whether the simulated datasets are compatible with our interpretation of the neural results. Note that we cannot simply use the trial-specific MMI index here, because we have already based the simulation on the parameters elicited by MMI to calculate the v_is. This would amount to double-dipping the data. Instead, we used the trial-specific Afriat index as a proxy for the trial-specific MMI index (note they are highly correlated, Fig. 3e). For each simulated trial, we calculated the noise of the chosen bundle as a proxy for the valuation noise in the BOLD signal. We pooled these two series across simulations and trials, and then assessed their correlation.

Whole-brain analysis of choice inconsistency

To identify the neural correlates of choice inconsistency, we estimated a general linear model (GLM) with 11 predictors. The trial-by-trial inconsistency index trial-specific MMI and the trial-by-trial SV, entered for the total trial duration up until the subject made a choice, normalized and convolved with the canonical hemodynamic response function (HRF). We modeled RT using a boxcar epoch function, whose duration was equal to the RT of the trial⁷⁰. The other predictors included the price ratio of the budget set (the slope), and the endowment measured by the safe portfolio on the 45 degrees line from the origin.

All these predictors were entered for the trial duration, normalized and convolved with the HRF. In addition, six motion-correction parameters and the constant were included as regressors of no interest to account for motion-related artifacts.

ROI analysis of choice inconsistency

We also conducted a region of interest (ROI) analysis, in order to increase the power of the statistical test. We defined the vmPFC and vStr ROIs based on the masks provided by Bartra et al.²¹. For the dACC, we drew a 12 -mm sphere around the peak voxel that Kolling et al.⁴³ reported. For the PCC and V1 ROIs, we used neurosynth.org meta-analyses masks. We then conducted the same RFX GLM reported above and correlated trial-specific MMI and SV with BOLD activity extracted from each of the ROIs.

PPI analysis

The time series of the BOLD signal in each ROI was z-scored to generate the time series of the neuronal signal for each source region as the physiological variable in the PPI. We tested (separately) each parametric regressor, SV and trial-specific MMI, as the psychological variable, suggesting that a given region can be connected to distinct regions/networks depending on task context⁷¹. The psychological regressors were normalized and convolved with the canonical HRF and entered into the regression model. An additional regressor represented the interaction between the psychological and physiological factors, and indicated in which areas there were significant differential functional connectivity with each seed ROI. We modeled RT similarly to the whole-brain analysis, and used the price ratio of the budget set and the endowment as control predictors. These predictors were entered for the trial duration, normalized and convolved with the HRF as well. In addition, six motion-correction parameters and the constant were included as regressors of no interest to account for motion-related artifacts.

We used RFX for the group-level analysis, and set the threshold to p < 0.0005 with cluster-size correction, similarly to the main GLM analysis. We hence ran six models in total (3 ROIs × 2 psychological contexts).

Orthogonality analysis

To examine the neural footprints of the non-correlated part of the SV and trial-specific MMI predictors, we used an orthogonality analysis. We ran a clustered regression of trial-specific MMI on SV and obtained residuals $ẽ$ , and similarly obtained residuals $ũ$ from a clustered regression of SV on trial-specific MMI. We repeated the RFX-GLM as in the whole-brain analysis, but replaced trial-specific MMI with $ẽ$ , and similarly ran the RFX-GLM replacing SV with $ũ$ .

Measuring choice difficulty

Since each decision problem in our task is continuous, we cannot simply use ΔSV as a choice difficulty index. Moreover, choice difficulty in our task is determined by the subject’s own preferences and not only by the slopes of the budget sets. For example, a risk-averse subject might struggle with steep slopes, as the temptation for allocating all tokens to one account rises. A risk-seeking subject, on the other hand, will have difficulties in moderate slopes. Therefore, we derive a novel measure for choice difficulty, which considers both subject’s elicited parameters and the continuum of the budget set.

We discretized each budget line into 1000 possible bundles. We then calculated the subjective value v_i,b,s of each bundle i along each budget set b for each subject s using the parameters elicited by the MMI. Next, we subtracted v_i,b,s from the maximal subjective value of that budget line to obtain a ΔV_i,b,s measure for each bundle along the budget set (analogous to taking the difference between two options in a binary choice design). We then averaged all the ΔV_i,b,s across the 1000 bundles, and normalized the result by the endowment of the given budget set to be able to compare across trials. Formally, denote $V_{b, s} = m a x_{i \in [1, \dots, 1, 000]} v_{i, b, s}$ . Then:

Choice {Simplicity}_{b, s} = \frac{\frac{\sum_{i = 1}^{N} [V_{b, s} - v_{i, b, s}]}{N}}{{Endowment}_{b}}

where i∈ [1,…,1000] is the bundle (N = 1000); b∈ [1,…,108] is the decision problems and s∈ [1,…,33] is the subject.

Difficult choices occur when the bundles along the budget line have similar values to the maximum value (V_b,s), making the options along the line relatively similar, which results in an overall low index. Hence, the higher is our index, the easier it is to make a choice, suggesting that trials with index values closer to 0, are the more difficult choices. Hence, we refer to this index as a choice simplicity index. The normalization with Endowment_b,s of the chosen bundle is aimed to minimize the problem of over-scoring trials with higher endowments. In such cases, the budget line is longer (further away from the origin), and therefore ΔV_i,b,s is bigger for the mere fact that the distance between bundles is bigger.

Controlling for changes in behavior

We control for an over-estimation of trial-specific MMI values due to changes in behavior across blocks, which may increase index values, though in fact subjects simply changed their preferences between blocks and thus were actually consistent as long as their preferences were stable. For every subject, we computed four different aggregate MMIs, based on the 27 trials in each block, rather than the 108 trials of the entire experiment. We implemented the Leave-One-Out procedure on the aggregate MMI of each block to generate trial-specific MMI-blocks. Accordingly, we also recovered different utility functional parameters for every block and computed the trial-by-trial SV, with respect to the block-specific parameters (SV-blocks). We ran the same RFX-GLM, and used trial-specific MMI-blocks and SV-blocks as our inconsistency and value modulation regressors, respectively.

Moreover, we classified subjects’ choice behavior in each block, and identified eight subjects (out of 33) who changed behavior across blocks (Supplementary Table 1). We thereafter ran the same RFX-GLM as in our main analysis (see Whole-brain analysis of choice inconsistency section), but this time used trial-specific MMI-blocks and SV-blocks only for the eight subjects who switched strategies. For the rest of our sample, we used trial-specific MMI and SV.

Controlling for the misspecification

In order to rule out the possibility that our results reflect the MMI’s misspecification element, we repeated our main analysis, using a different functional form. Halevy et al.⁴⁵ show that changing the functional form varies the misspecification element of the MMI, but the inconsistency element remains unchanged. Thus, we elicited subjects’ preferences using a constant absolute risk aversion (CARA) utility index with an absolute risk aversion parameter A (rather than CRRA utility index):

ω (z) = 1 - e^{- A z}; A > 0

Between-subjects analysis

We conducted a standard between-subject analysis and examined if the brain areas that we found in our trial-by-trial analysis would show up also in a basic between-subject analysis. We ran an RFX-GLM with one predictor, a dummy for the trial identity. We ran the model in all our predefined ROIs, i.e., vmPFC, dACC, bilateral vStr and PCC (see Region of Interest Analysis section for details about the masking), as well as the mPFC/ACC cluster that was correlated with the trial-specific MMI (MMI ROI). For each subject, in each ROI, we extracted the average GLM-coefficient β over the course of the entire experiment, to account for the average change in BOLD signal. We then correlated the average change in BOLD signal in each ROI with aggregate-level inconsistency indices—aggregate MMI, Afriat index, and number of GARP violations (see Aggregate inconsistency indices section for details). We corrected our analysis for multiple comparisons, using Bonferroni correction, and set p_i ≤ 0.05/(18 comparisons) = 0.0028 as our statistical threshold (Supplementary Table 3).

Functional localizers

In the motor imprecision functional localizer, subjects were presented with linear graphs, and had to reach a black target using a trackball. It resembled the main task, but excluded any numerical or value representation (Supplementary Figure 1a). Subjects completed 27 trials, and had a maximum of 6 s on each trial to reach the black target, followed by a variable ITI of 6 s (jittered between trials).

Similarly, in the numerical execution imprecision localizer, subjects were presented with linear graphs, and had to reach a target {x,y} coordinates (Supplementary Figure 1b) on the graph. A title read their current {x,y} cursor position at the top of the screen. The numerical localizer resembled the main task, but excluded any value-based decision. Subjects completed 27 trials, and had a maximum of 12 s on each trial to reach the {x,y} coordinates, followed by a variable ITI of 9 s (jittered between trials). In both localizers, the graphs were varied and randomized across trials and subjects. We calculated motor and numerical imprecisions as the Euclidean distance between the cursor position at the moment the subject clicked the trackball, and the predefined target. In order to identify the neural correlates of the motor/numerical imprecision, we used the trial-by-trial motor/numerical imprecision as a predictor in an RFX GLM. Other predictors included a boxcar epoch function for the trial duration to model RT, as well as the graph’s slope. All these predictors were entered for the trial duration, normalized and convolved with the HRF. In addition, six motion-correction parameters and the constant were included as regressors of no interest to account for motion-related artifacts.

Supplementary information

Supplementary Information^{(2.8MB, pdf)}

Transparent Peer Review File^{(2.1MB, pdf)}

Acknowledgements

We thank Y. David, A. Shuster and O. Ossmy for their helpful assistance. We also thank Y. Halevy and I. Saporta for fruitful discussions. This work was funded with grants from the Israel Science Foundation (#1104/13 for D.J.L. #1390/14 for D.P.), a joint grant to D.J.L. and D.P. from the Coller Foundation and the financial support of the Henry Crown Institute of Business Research in Israel.

Author contributions

D.J.L. and D.P. conceived the study. D.J.L., D.P., and V.K. designed the study. V.K. collected and analyzed the data. D.P. and R.W. performed analysis. D.J.L, D.P., R.W., and V.K. wrote the paper.

Data and code availability

The computer code used for the computation of MMI and the other inconsistency indices is available as an open source code at https://github.com/persitzd/RP-Toolkit. The datasets generated and/or analyzed during the current study, statistical maps and the rest of the computer code used to analyze further behavioral results, the imaging data and NRUM is available on OSF at https://osf.io/8jdfh/.

Competing interests

The authors declare no competing interests.

Footnotes

Journal peer review information: Nature Communications thanks David Smith and the anonymous reviewers for their contribution to the peer review of this work. Peer reviewer reports are available.

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information accompanies this paper at 10.1038/s41467-019-09343-2.

References

1.Afriat SN. The construction of utility functions from expenditure. Data. Int. Econ. Rev. (Phila.). 1967;8:67–77. doi: 10.2307/2525382. [DOI] [Google Scholar]
2.Varian HR. The nonparametric approach to demand analysis. Econometrica. 1982;50:945–973. doi: 10.2307/1912771. [DOI] [Google Scholar]
3.Simon HA. Rational choice and the structure of the environment. Psychol. Rev. 1956;63:129–138. doi: 10.1037/h0042769. [DOI] [PubMed] [Google Scholar]
4.Kahneman, D. & Tversky, A. Choiches, Values & Frames (Cambridge University Press, Cambridge, UK, 2000).
5.Tversky A. Intransitivity of preferences. Psychol. Rev. 1969;76:31–48. doi: 10.1037/h0026750. [DOI] [Google Scholar]
6.Tversky A, Kahneman D. The framing of decisions and the psychology of choice. Sci. (80-.). 1981;211:453–458. doi: 10.1126/science.7455683. [DOI] [PubMed] [Google Scholar]
7.Simon HA. A behavioral model of rational choice. Quaterly J. Econ. 1955;69:99–118. doi: 10.2307/1884852. [DOI] [Google Scholar]
8.Manzini P, Mariotti M. Sequentially rationalizable choice. Am. Econ. Rev. 2007;97:1824–1839. doi: 10.1257/aer.97.5.1824. [DOI] [Google Scholar]
9.Kahneman D, Tversky A. Prospect theory: an analysis of decision under risk. Econom. J. Econom. Soc. 1979;47:263–291. [Google Scholar]
10.Simon, H. A. Models of Bounded Rationality, Behavioral Economics and Business Organization, Vol. 2 (MIT Press, Cambridge, Mass.,1982).
11.Fisman R, Kariv S, Markovits D. Individual preferences for giving. Am. Econ. Rev. 2007;97:1858–1876. doi: 10.1257/aer.97.5.1858. [DOI] [Google Scholar]
12.Andreoni J, Miller J. Giving according to GARP: an experimental test of the consistency of preferences for Altruism. Econometrica. 2002;70:737–753. doi: 10.1111/1468-0262.00302. [DOI] [Google Scholar]
13.Choi S, Fisman R, Gale D, Kariv S. Consistency and heterogeneity of individual behavior under uncertainty. Am. Econ. Rev. 2007;97:1921–1938. doi: 10.1257/aer.97.5.1921. [DOI] [Google Scholar]
14.Dean M, Martin D. Measuring rationality with the minimum cost of revealed preference violations. Rev. Econ. Stat. 2016;98:524–534. doi: 10.1162/REST_a_00542. [DOI] [Google Scholar]
15.Echenique F, Lee S, Shum M. The money pump as a measure of revealed preference violations. J. Polit. Econ. 2011;119:1201–1223. doi: 10.1086/665011. [DOI] [Google Scholar]
16.Mosteller F, Nogee P. An experimental measurement of utility. J. Polit. Econ. 1951;59:371–404. doi: 10.1086/257106. [DOI] [Google Scholar]
17.Agranov M, Ortoleva P. Stochastic choice and preferences for randomization. J. Polit. Econ. 2017;125:40–68. doi: 10.1086/689774. [DOI] [Google Scholar]
18.McFadden D. Economic choices. Am. Econ. Rev. 2001;91:351–378. doi: 10.1257/aer.91.3.351. [DOI] [Google Scholar]
19.Gul F, Pesendorfer W. Random expected utility. Econometrica. 2006;74:121–146. doi: 10.1111/j.1468-0262.2006.00651.x. [DOI] [Google Scholar]
20.Webb, R., Glimcher, P. W., Levy, I., Stephanie, C. & Rutledge, R. B. Neural random utility: relating cardinal neural observables to stochastic choice behaviour. J. Neurosci. Psychol. Econ. 12, 45–72 (2019).
21.Bartra O, McGuire JT, Kable JW. The valuation system: a coordinate-based meta-analysis of BOLD fMRI experiments examining neural correlates of subjective value. Neuroimage. 2013;76:412–427. doi: 10.1016/j.neuroimage.2013.02.063. [DOI] [PMC free article] [PubMed] [Google Scholar]
22.Clithero JA, Rangel A. Informatic parcellation of the network involved in the computation of subjective value. Soc. Cogn. Affect. Neurosci. 2013;9:1289–1302. doi: 10.1093/scan/nst106. [DOI] [PMC free article] [PubMed] [Google Scholar]
23.Levy DJ, Glimcher PW. The root of all value: a neural common currency for choice. Curr. Opin. Neurobiol. 2012;22:1027–1038. doi: 10.1016/j.conb.2012.06.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
24.Arieli A, Sterkin A, Grinvald A, Aertsen A. Dynamics of ongoing activity: explanation of the large variability in evoked cortical responses. Sci. (80-.). 1996;273:1868–1871. doi: 10.1126/science.273.5283.1868. [DOI] [PubMed] [Google Scholar]
25.Werner G, Mountcastle VB. The variability of cantral neural activity in a sensory system, and its implications for the central reflection of sensory events. J. Neurophysiol. 1963;26:958–977. doi: 10.1152/jn.1963.26.6.958. [DOI] [PubMed] [Google Scholar]
26.Schumacher JF, Thompson SK, Olman CA. Contrast response functions for single Gabor patches: ROI ‑ based analysis over ‑ represents low ‑ contrast patches for GE BOLD. Front. Syst. Neurosci. 2011;5:1–10. doi: 10.3389/fnsys.2011.00019. [DOI] [PMC free article] [PubMed] [Google Scholar]
27.Tolhurst DJ, Movshon JA, Dean AF. The statistical reliability of signals in single neurons in cat and monkey visual cortex. Vision. Res. 1983;23:775–785. doi: 10.1016/0042-6989(83)90200-6. [DOI] [PubMed] [Google Scholar]
28.Glimcher PW. Indeterminacy in brain and behavior. Annu. Rev. Psychol. 2005;56:25–56. doi: 10.1146/annurev.psych.55.090902.141429. [DOI] [PubMed] [Google Scholar]
29.Webb, R. The (neural) dynamics of stochastic choice. Manage. Sci. 65, 230–255 (2019).
30.Gold JI, Shadlen MN, Shadlen MN. Neural computations that underlie decisions about sensory stimuli. Trends Cogn. Sci. 2001;5:10–16. doi: 10.1016/S1364-6613(00)01567-9. [DOI] [PubMed] [Google Scholar]
31.Shadlen MN, Shohamy D. Perspective decision making and sequential sampling from memory. Neuron. 2016;90:927–939. doi: 10.1016/j.neuron.2016.04.036. [DOI] [PMC free article] [PubMed] [Google Scholar]
32.Woodford M. Stochastic choice: an optimizing neuroeconomic model. Am. Econ. Rev. 2014;104:495–500. doi: 10.1257/aer.104.5.495. [DOI] [Google Scholar]
33.Buzsaki G, Mizuseki K. The log-dynamic brain: how skewed distributions affect network operations. Nat. Rev. Neurosci. 2014;15:264–278. doi: 10.1038/nrn3687. [DOI] [PMC free article] [PubMed] [Google Scholar]
34.Afriat SN. On a system of inequalities in demand analysis: an extension of the classical method. Int. Econ. Rev. (Phila.). 1973;14:460–472. doi: 10.2307/2525934. [DOI] [Google Scholar]
35.Varian HR. Goodness-of-fit in optimizing models. J. Econom. 1990;46:125–140. doi: 10.1016/0304-4076(90)90051-T. [DOI] [Google Scholar]
36.Houtman M, Maks JAH. Determining all maximal data subsets consistent with revealed preference. Kwant. Methode. 1985;19:89–104. [Google Scholar]
37.Camille N, Griffiths CA, Vo K, Fellows LK, Kable JW. Ventromedial frontal lobe damage disrupts value maximization in Humans. J. Neurosci. 2011;31:7527–7532. doi: 10.1523/JNEUROSCI.6527-10.2011. [DOI] [PMC free article] [PubMed] [Google Scholar]
38.Fellows LK, Farah MJ. The role of ventromedial prefrontal cortex in decision making: judgment under uncertainty or judgment per se? Cereb. Cortex. 2007;17:2669–2674. doi: 10.1093/cercor/bhl176. [DOI] [PubMed] [Google Scholar]
39.Chung H, Tymula A, Glimcher P. The reduction of ventrolateral prefrontal cortex grey matter volume correlates with loss of economic rationality in aging. J. Neurosci. 2017;37:12068–12077. doi: 10.1523/JNEUROSCI.1171-17.2017. [DOI] [PMC free article] [PubMed] [Google Scholar]
40.Kalenscher T, Tobler PN, Huijbers W, Daselaar SM, Pennartz CM. A. Neural signatures of intransitive preferences. Front. Hum. Neurosci. 2010;4:1–5. doi: 10.3389/fnhum.2010.00049. [DOI] [PMC free article] [PubMed] [Google Scholar]
41.Grueschow M, Polania R, Hare TA, Ruff CC. Automatic versus choice-dependent value representations in the human brain. Neuron. 2015;85:874–885. doi: 10.1016/j.neuron.2014.12.054. [DOI] [PubMed] [Google Scholar]
42.Hunt LT, Hayden BY. A distributed, hierarchical and recurrent framework for reward-based choice. Nat. Rev. Neurosci. 2017;18:172–182. doi: 10.1038/nrn.2017.7. [DOI] [PMC free article] [PubMed] [Google Scholar]
43.Kolling N, Behrens TEJ, Wittmann MK, Rushworth MFS. ScienceDirect multiple signals in anterior cingulate cortex. Curr. Opin. Neurobiol. 2016;37:36–43. doi: 10.1016/j.conb.2015.12.007. [DOI] [PMC free article] [PubMed] [Google Scholar]
44.Shenhav A, Straccia MA, Cohen JD, Botvinick MM. Anterior cingulate engagement in a foraging context reflects choice difficulty, not foraging value. Nat. Neurosci. 2014;17:1249–1254. doi: 10.1038/nn.3771. [DOI] [PMC free article] [PubMed] [Google Scholar]
45.Halevy Y, Persitz D, Zrill L. Parametric recoverability of preferences. J. Polit. Econ. 2018;126:1558–1593. doi: 10.1086/697741. [DOI] [Google Scholar]
46.Krajbich I, Armel C, Rangel A. Visual fixations and the computation and comparison of value in simple choice. Nat. Neurosci. 2010;13:1292–1298. doi: 10.1038/nn.2635. [DOI] [PubMed] [Google Scholar]
47.Basten U, Biele G, Heekeren HR, Fiebach CJ. How the brain integrates costs and benefits during decision making. Proc. Natl. Acad. Sci. U. S. A. 2010;107:21767–21772. doi: 10.1073/pnas.0908104107. [DOI] [PMC free article] [PubMed] [Google Scholar]
48.Krajbich I, Hare T, Bartling B, Morishima Y, Fehr E. A common mechanism underlying food choice and social decisions. PLoS Comput. Biol. 2015;11:1–24. doi: 10.1371/journal.pcbi.1004371. [DOI] [PMC free article] [PubMed] [Google Scholar]
49.Botvinick MM, Cohen JD, Carter CS. Conflict monitoring and anterior cingulate cortex: An update. Trends Cogn. Sci. 2004;8:539–546. doi: 10.1016/j.tics.2004.10.003. [DOI] [PubMed] [Google Scholar]
50.Lebreton, M., Abitbol, R., Daunizeau, J. & Pessiglione, M. Automatic integration of confidence in the brain valuation signal. Nat. Neurosci. 18, 1159–1167 (2015). [DOI] [PubMed]
51.Manski CF. No title. Theory Decis. 1977;8:229–230. doi: 10.1007/BF00133443. [DOI] [Google Scholar]
52.Hey JD. Why we should not be silent about noise. Exp. Econ. 2005;8:325–345. doi: 10.1007/s10683-005-5373-8. [DOI] [Google Scholar]
53.Apesteguia J, Ballester MA. Monotone stochastic choice models: the case of risk and time preferences. J. Polit. Econ. 2018;126:74–106. doi: 10.1086/695504. [DOI] [Google Scholar]
54.Polanía, R., Moisa, M., Opitz, A., Grueschow, M. & Ruff, C. C. The precision of value-based choices depends causally on fronto-parietal phase coupling. Nat. Commun. 6, 8090 (2015). [DOI] [PMC free article] [PubMed]
55.Hare TA, Schultz W, Camerer CF, Doherty JPO, Rangel A. Transformation of stimulus value signals into motor commands during simple choice. PNAS. 2011;108:18120–18125. doi: 10.1073/pnas.1109322108. [DOI] [PMC free article] [PubMed] [Google Scholar]
56.Faisal AA, Selen LPJ, Wolpert DM. Noise in the nervous system. Nat. Rev. Neurosci. 2008;9:292–303. doi: 10.1038/nrn2258. [DOI] [PMC free article] [PubMed] [Google Scholar]
57.Harris ChristopherM, M.Wolpert D. Signal-dependent noise determinesmotorplanning Christopher. Nature. 1998;394:780–784. doi: 10.1038/29528. [DOI] [PubMed] [Google Scholar]
58.Shadlen MN, Newsome WT. The variable discharge of cortical neurons: implications for connectivity, computation, and information coding. J. Neurosci. 1998;18:3870–3896. doi: 10.1523/JNEUROSCI.18-10-03870.1998. [DOI] [PMC free article] [PubMed] [Google Scholar]
59.Lennie P. The cost of cortical computation. Curr. Biol. 2003;13:493–497. doi: 10.1016/S0960-9822(03)00135-0. [DOI] [PubMed] [Google Scholar]
60.Fox MD, Snyder AZ, Vincent JL, Raichle ME. Intrinsic fluctuations within cortical systems account for intertrial variability in human behavior. Neuron. 2007;56:171–184. doi: 10.1016/j.neuron.2007.08.023. [DOI] [PubMed] [Google Scholar]
61.Drugowitsch J, et al. Computational precision of mental inference as critical source of human choice suboptimality. Neuron. 2016;92:1398–1411. doi: 10.1016/j.neuron.2016.11.005. [DOI] [PubMed] [Google Scholar]
62.Padoa-Schioppa C. Neuronal origins of choice variability in economic decisions. Neuron. 2013;80:1322–1336. doi: 10.1016/j.neuron.2013.09.013. [DOI] [PMC free article] [PubMed] [Google Scholar]
63.Kurikawa T, Haga T, Handa T, Harukuni R, Fukai T. Individual variability in decision-making. Nat. Neurosci. 2018;21:1764–1773. doi: 10.1038/s41593-018-0263-5. [DOI] [PubMed] [Google Scholar]
64.Denfield, G. H., Ecker, A. S., Shinn, T. J., Bethge, M. & Tolias, A. S. Attentional fluctuations induce shared variability in macaque primary visual cortex. Nat. Commun. 9, 2654 (2018). [DOI] [PMC free article] [PubMed]
65.Briggs F, Mangun GR, Usrey WM. Attention enhances synaptic efficacy and signal-to-noise in neural circuits. Nature. 2013;499:476–480. doi: 10.1038/nature12276. [DOI] [PMC free article] [PubMed] [Google Scholar]
66.Herrero JL, Gieselmann MA, Sanayei M, Thiele A. Attention-induced variance and noise correlation reduction in macaque v1 is mediated by NMDA receptors. Neuron. 2013;78:729–739. doi: 10.1016/j.neuron.2013.03.029. [DOI] [PMC free article] [PubMed] [Google Scholar]
67.Manohar SG, et al. Reward pays the cost of noise reduction in motor and cognitive control. Curr. Biol. 2015;25:1707–1716. doi: 10.1016/j.cub.2015.05.038. [DOI] [PMC free article] [PubMed] [Google Scholar]
68.Samuelson PA. Complementarity: an essay on the 40th anniversary of the Hicks-Allen revolution in demand theory. J. Econ. Lit. 1974;12:1255–1289. [Google Scholar]
69.Gul F. A theory of disappointment aversion. Econometrica. 1991;59:667–686. doi: 10.2307/2938223. [DOI] [Google Scholar]
70.Grinband J, Wager TD, Lindquist M, Ferrera VP, Hirsch J. Detection of time-varying signals in event-related fMRI designs. Neuroimage. 2008;43:509–520. doi: 10.1016/j.neuroimage.2008.07.065. [DOI] [PMC free article] [PubMed] [Google Scholar]
71.Smith DV, Gseir M, Speer ME, Delgado MR. Toward a cumulative science of functional integration: a meta-analysis of psychophysiological interactions. Humab Brain Mapp. 2016;2917:2904–2917. doi: 10.1002/hbm.23216. [DOI] [PMC free article] [PubMed] [Google Scholar]
72.Samuelson PS. A note on the pure theory of consumer’s behaviour. Economica. 1938;12:189–201. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supplementary Information^{(2.8MB, pdf)}

Transparent Peer Review File^{(2.1MB, pdf)}

Data Availability Statement

[CR1] 1.Afriat SN. The construction of utility functions from expenditure. Data. Int. Econ. Rev. (Phila.). 1967;8:67–77. doi: 10.2307/2525382. [DOI] [Google Scholar]

[CR2] 2.Varian HR. The nonparametric approach to demand analysis. Econometrica. 1982;50:945–973. doi: 10.2307/1912771. [DOI] [Google Scholar]

[CR3] 3.Simon HA. Rational choice and the structure of the environment. Psychol. Rev. 1956;63:129–138. doi: 10.1037/h0042769. [DOI] [PubMed] [Google Scholar]

[CR4] 4.Kahneman, D. & Tversky, A. Choiches, Values & Frames (Cambridge University Press, Cambridge, UK, 2000).

[CR5] 5.Tversky A. Intransitivity of preferences. Psychol. Rev. 1969;76:31–48. doi: 10.1037/h0026750. [DOI] [Google Scholar]

[CR6] 6.Tversky A, Kahneman D. The framing of decisions and the psychology of choice. Sci. (80-.). 1981;211:453–458. doi: 10.1126/science.7455683. [DOI] [PubMed] [Google Scholar]

[CR7] 7.Simon HA. A behavioral model of rational choice. Quaterly J. Econ. 1955;69:99–118. doi: 10.2307/1884852. [DOI] [Google Scholar]

[CR8] 8.Manzini P, Mariotti M. Sequentially rationalizable choice. Am. Econ. Rev. 2007;97:1824–1839. doi: 10.1257/aer.97.5.1824. [DOI] [Google Scholar]

[CR9] 9.Kahneman D, Tversky A. Prospect theory: an analysis of decision under risk. Econom. J. Econom. Soc. 1979;47:263–291. [Google Scholar]

[CR10] 10.Simon, H. A. Models of Bounded Rationality, Behavioral Economics and Business Organization, Vol. 2 (MIT Press, Cambridge, Mass.,1982).

[CR11] 11.Fisman R, Kariv S, Markovits D. Individual preferences for giving. Am. Econ. Rev. 2007;97:1858–1876. doi: 10.1257/aer.97.5.1858. [DOI] [Google Scholar]

[CR12] 12.Andreoni J, Miller J. Giving according to GARP: an experimental test of the consistency of preferences for Altruism. Econometrica. 2002;70:737–753. doi: 10.1111/1468-0262.00302. [DOI] [Google Scholar]

[CR13] 13.Choi S, Fisman R, Gale D, Kariv S. Consistency and heterogeneity of individual behavior under uncertainty. Am. Econ. Rev. 2007;97:1921–1938. doi: 10.1257/aer.97.5.1921. [DOI] [Google Scholar]

[CR14] 14.Dean M, Martin D. Measuring rationality with the minimum cost of revealed preference violations. Rev. Econ. Stat. 2016;98:524–534. doi: 10.1162/REST_a_00542. [DOI] [Google Scholar]

[CR15] 15.Echenique F, Lee S, Shum M. The money pump as a measure of revealed preference violations. J. Polit. Econ. 2011;119:1201–1223. doi: 10.1086/665011. [DOI] [Google Scholar]

[CR16] 16.Mosteller F, Nogee P. An experimental measurement of utility. J. Polit. Econ. 1951;59:371–404. doi: 10.1086/257106. [DOI] [Google Scholar]

[CR17] 17.Agranov M, Ortoleva P. Stochastic choice and preferences for randomization. J. Polit. Econ. 2017;125:40–68. doi: 10.1086/689774. [DOI] [Google Scholar]

[CR18] 18.McFadden D. Economic choices. Am. Econ. Rev. 2001;91:351–378. doi: 10.1257/aer.91.3.351. [DOI] [Google Scholar]

[CR19] 19.Gul F, Pesendorfer W. Random expected utility. Econometrica. 2006;74:121–146. doi: 10.1111/j.1468-0262.2006.00651.x. [DOI] [Google Scholar]

[CR20] 20.Webb, R., Glimcher, P. W., Levy, I., Stephanie, C. & Rutledge, R. B. Neural random utility: relating cardinal neural observables to stochastic choice behaviour. J. Neurosci. Psychol. Econ. 12, 45–72 (2019).

[CR21] 21.Bartra O, McGuire JT, Kable JW. The valuation system: a coordinate-based meta-analysis of BOLD fMRI experiments examining neural correlates of subjective value. Neuroimage. 2013;76:412–427. doi: 10.1016/j.neuroimage.2013.02.063. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR22] 22.Clithero JA, Rangel A. Informatic parcellation of the network involved in the computation of subjective value. Soc. Cogn. Affect. Neurosci. 2013;9:1289–1302. doi: 10.1093/scan/nst106. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR23] 23.Levy DJ, Glimcher PW. The root of all value: a neural common currency for choice. Curr. Opin. Neurobiol. 2012;22:1027–1038. doi: 10.1016/j.conb.2012.06.001. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR24] 24.Arieli A, Sterkin A, Grinvald A, Aertsen A. Dynamics of ongoing activity: explanation of the large variability in evoked cortical responses. Sci. (80-.). 1996;273:1868–1871. doi: 10.1126/science.273.5283.1868. [DOI] [PubMed] [Google Scholar]

[CR25] 25.Werner G, Mountcastle VB. The variability of cantral neural activity in a sensory system, and its implications for the central reflection of sensory events. J. Neurophysiol. 1963;26:958–977. doi: 10.1152/jn.1963.26.6.958. [DOI] [PubMed] [Google Scholar]

[CR26] 26.Schumacher JF, Thompson SK, Olman CA. Contrast response functions for single Gabor patches: ROI ‑ based analysis over ‑ represents low ‑ contrast patches for GE BOLD. Front. Syst. Neurosci. 2011;5:1–10. doi: 10.3389/fnsys.2011.00019. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR27] 27.Tolhurst DJ, Movshon JA, Dean AF. The statistical reliability of signals in single neurons in cat and monkey visual cortex. Vision. Res. 1983;23:775–785. doi: 10.1016/0042-6989(83)90200-6. [DOI] [PubMed] [Google Scholar]

[CR28] 28.Glimcher PW. Indeterminacy in brain and behavior. Annu. Rev. Psychol. 2005;56:25–56. doi: 10.1146/annurev.psych.55.090902.141429. [DOI] [PubMed] [Google Scholar]

[CR29] 29.Webb, R. The (neural) dynamics of stochastic choice. Manage. Sci. 65, 230–255 (2019).

[CR30] 30.Gold JI, Shadlen MN, Shadlen MN. Neural computations that underlie decisions about sensory stimuli. Trends Cogn. Sci. 2001;5:10–16. doi: 10.1016/S1364-6613(00)01567-9. [DOI] [PubMed] [Google Scholar]

[CR31] 31.Shadlen MN, Shohamy D. Perspective decision making and sequential sampling from memory. Neuron. 2016;90:927–939. doi: 10.1016/j.neuron.2016.04.036. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR32] 32.Woodford M. Stochastic choice: an optimizing neuroeconomic model. Am. Econ. Rev. 2014;104:495–500. doi: 10.1257/aer.104.5.495. [DOI] [Google Scholar]

[CR33] 33.Buzsaki G, Mizuseki K. The log-dynamic brain: how skewed distributions affect network operations. Nat. Rev. Neurosci. 2014;15:264–278. doi: 10.1038/nrn3687. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR34] 34.Afriat SN. On a system of inequalities in demand analysis: an extension of the classical method. Int. Econ. Rev. (Phila.). 1973;14:460–472. doi: 10.2307/2525934. [DOI] [Google Scholar]

[CR35] 35.Varian HR. Goodness-of-fit in optimizing models. J. Econom. 1990;46:125–140. doi: 10.1016/0304-4076(90)90051-T. [DOI] [Google Scholar]

[CR36] 36.Houtman M, Maks JAH. Determining all maximal data subsets consistent with revealed preference. Kwant. Methode. 1985;19:89–104. [Google Scholar]

[CR37] 37.Camille N, Griffiths CA, Vo K, Fellows LK, Kable JW. Ventromedial frontal lobe damage disrupts value maximization in Humans. J. Neurosci. 2011;31:7527–7532. doi: 10.1523/JNEUROSCI.6527-10.2011. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR38] 38.Fellows LK, Farah MJ. The role of ventromedial prefrontal cortex in decision making: judgment under uncertainty or judgment per se? Cereb. Cortex. 2007;17:2669–2674. doi: 10.1093/cercor/bhl176. [DOI] [PubMed] [Google Scholar]

[CR39] 39.Chung H, Tymula A, Glimcher P. The reduction of ventrolateral prefrontal cortex grey matter volume correlates with loss of economic rationality in aging. J. Neurosci. 2017;37:12068–12077. doi: 10.1523/JNEUROSCI.1171-17.2017. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR40] 40.Kalenscher T, Tobler PN, Huijbers W, Daselaar SM, Pennartz CM. A. Neural signatures of intransitive preferences. Front. Hum. Neurosci. 2010;4:1–5. doi: 10.3389/fnhum.2010.00049. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR41] 41.Grueschow M, Polania R, Hare TA, Ruff CC. Automatic versus choice-dependent value representations in the human brain. Neuron. 2015;85:874–885. doi: 10.1016/j.neuron.2014.12.054. [DOI] [PubMed] [Google Scholar]

[CR42] 42.Hunt LT, Hayden BY. A distributed, hierarchical and recurrent framework for reward-based choice. Nat. Rev. Neurosci. 2017;18:172–182. doi: 10.1038/nrn.2017.7. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR43] 43.Kolling N, Behrens TEJ, Wittmann MK, Rushworth MFS. ScienceDirect multiple signals in anterior cingulate cortex. Curr. Opin. Neurobiol. 2016;37:36–43. doi: 10.1016/j.conb.2015.12.007. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR44] 44.Shenhav A, Straccia MA, Cohen JD, Botvinick MM. Anterior cingulate engagement in a foraging context reflects choice difficulty, not foraging value. Nat. Neurosci. 2014;17:1249–1254. doi: 10.1038/nn.3771. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR45] 45.Halevy Y, Persitz D, Zrill L. Parametric recoverability of preferences. J. Polit. Econ. 2018;126:1558–1593. doi: 10.1086/697741. [DOI] [Google Scholar]

[CR46] 46.Krajbich I, Armel C, Rangel A. Visual fixations and the computation and comparison of value in simple choice. Nat. Neurosci. 2010;13:1292–1298. doi: 10.1038/nn.2635. [DOI] [PubMed] [Google Scholar]

[CR47] 47.Basten U, Biele G, Heekeren HR, Fiebach CJ. How the brain integrates costs and benefits during decision making. Proc. Natl. Acad. Sci. U. S. A. 2010;107:21767–21772. doi: 10.1073/pnas.0908104107. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR48] 48.Krajbich I, Hare T, Bartling B, Morishima Y, Fehr E. A common mechanism underlying food choice and social decisions. PLoS Comput. Biol. 2015;11:1–24. doi: 10.1371/journal.pcbi.1004371. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR49] 49.Botvinick MM, Cohen JD, Carter CS. Conflict monitoring and anterior cingulate cortex: An update. Trends Cogn. Sci. 2004;8:539–546. doi: 10.1016/j.tics.2004.10.003. [DOI] [PubMed] [Google Scholar]

[CR50] 50.Lebreton, M., Abitbol, R., Daunizeau, J. & Pessiglione, M. Automatic integration of confidence in the brain valuation signal. Nat. Neurosci. 18, 1159–1167 (2015). [DOI] [PubMed]

[CR51] 51.Manski CF. No title. Theory Decis. 1977;8:229–230. doi: 10.1007/BF00133443. [DOI] [Google Scholar]

[CR52] 52.Hey JD. Why we should not be silent about noise. Exp. Econ. 2005;8:325–345. doi: 10.1007/s10683-005-5373-8. [DOI] [Google Scholar]

[CR53] 53.Apesteguia J, Ballester MA. Monotone stochastic choice models: the case of risk and time preferences. J. Polit. Econ. 2018;126:74–106. doi: 10.1086/695504. [DOI] [Google Scholar]

[CR54] 54.Polanía, R., Moisa, M., Opitz, A., Grueschow, M. & Ruff, C. C. The precision of value-based choices depends causally on fronto-parietal phase coupling. Nat. Commun. 6, 8090 (2015). [DOI] [PMC free article] [PubMed]

[CR55] 55.Hare TA, Schultz W, Camerer CF, Doherty JPO, Rangel A. Transformation of stimulus value signals into motor commands during simple choice. PNAS. 2011;108:18120–18125. doi: 10.1073/pnas.1109322108. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR56] 56.Faisal AA, Selen LPJ, Wolpert DM. Noise in the nervous system. Nat. Rev. Neurosci. 2008;9:292–303. doi: 10.1038/nrn2258. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR57] 57.Harris ChristopherM, M.Wolpert D. Signal-dependent noise determinesmotorplanning Christopher. Nature. 1998;394:780–784. doi: 10.1038/29528. [DOI] [PubMed] [Google Scholar]

[CR58] 58.Shadlen MN, Newsome WT. The variable discharge of cortical neurons: implications for connectivity, computation, and information coding. J. Neurosci. 1998;18:3870–3896. doi: 10.1523/JNEUROSCI.18-10-03870.1998. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR59] 59.Lennie P. The cost of cortical computation. Curr. Biol. 2003;13:493–497. doi: 10.1016/S0960-9822(03)00135-0. [DOI] [PubMed] [Google Scholar]

[CR60] 60.Fox MD, Snyder AZ, Vincent JL, Raichle ME. Intrinsic fluctuations within cortical systems account for intertrial variability in human behavior. Neuron. 2007;56:171–184. doi: 10.1016/j.neuron.2007.08.023. [DOI] [PubMed] [Google Scholar]

[CR61] 61.Drugowitsch J, et al. Computational precision of mental inference as critical source of human choice suboptimality. Neuron. 2016;92:1398–1411. doi: 10.1016/j.neuron.2016.11.005. [DOI] [PubMed] [Google Scholar]

[CR62] 62.Padoa-Schioppa C. Neuronal origins of choice variability in economic decisions. Neuron. 2013;80:1322–1336. doi: 10.1016/j.neuron.2013.09.013. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR63] 63.Kurikawa T, Haga T, Handa T, Harukuni R, Fukai T. Individual variability in decision-making. Nat. Neurosci. 2018;21:1764–1773. doi: 10.1038/s41593-018-0263-5. [DOI] [PubMed] [Google Scholar]

[CR64] 64.Denfield, G. H., Ecker, A. S., Shinn, T. J., Bethge, M. & Tolias, A. S. Attentional fluctuations induce shared variability in macaque primary visual cortex. Nat. Commun. 9, 2654 (2018). [DOI] [PMC free article] [PubMed]

[CR65] 65.Briggs F, Mangun GR, Usrey WM. Attention enhances synaptic efficacy and signal-to-noise in neural circuits. Nature. 2013;499:476–480. doi: 10.1038/nature12276. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR66] 66.Herrero JL, Gieselmann MA, Sanayei M, Thiele A. Attention-induced variance and noise correlation reduction in macaque v1 is mediated by NMDA receptors. Neuron. 2013;78:729–739. doi: 10.1016/j.neuron.2013.03.029. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR67] 67.Manohar SG, et al. Reward pays the cost of noise reduction in motor and cognitive control. Curr. Biol. 2015;25:1707–1716. doi: 10.1016/j.cub.2015.05.038. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR68] 68.Samuelson PA. Complementarity: an essay on the 40th anniversary of the Hicks-Allen revolution in demand theory. J. Econ. Lit. 1974;12:1255–1289. [Google Scholar]

[CR69] 69.Gul F. A theory of disappointment aversion. Econometrica. 1991;59:667–686. doi: 10.2307/2938223. [DOI] [Google Scholar]

[CR70] 70.Grinband J, Wager TD, Lindquist M, Ferrera VP, Hirsch J. Detection of time-varying signals in event-related fMRI designs. Neuroimage. 2008;43:509–520. doi: 10.1016/j.neuroimage.2008.07.065. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR71] 71.Smith DV, Gseir M, Speer ME, Delgado MR. Toward a cumulative science of functional integration: a meta-analysis of psychophysiological interactions. Humab Brain Mapp. 2016;2917:2904–2917. doi: 10.1002/hbm.23216. [DOI] [PMC free article] [PubMed] [Google Scholar]

[CR72] 72.Samuelson PS. A note on the pure theory of consumer’s behaviour. Economica. 1938;12:189–201. [Google Scholar]

PERMALINK

The neural computation of inconsistent choice behavior

Vered Kurtz-David

Dotan Persitz

Ryan Webb

Dino J Levy

Abstract

Introduction

Results

A novel trial-specific inconsistency index

Fig. 1.

Behavior

Fig. 2.

Fig. 3.

Fig. 4.

Neuroimaging

Fig. 5.

Motivation for using trial-specific MMI

A model of valuation and inconsistent choices

Fig. 6.

Fig. 7.

Table 1.

Dissociation of the SV and trial-specific MMI regressors

Controlling for other sources of choice inconsistency

Controlling for choice difficulty

Controlling for the role of confidence in decision-making

Robustness of the trial-specific MMI

Discussion

Methods

Participants

Experimental task

fMRI session

Instructions and pre-scan practice

Image acquisition

fMRI data preprocessing

The General Axiom of Revealed Preference (GARP)

Aggregate inconsistency indices

Aggregate MMI

Trial-specific index

Parametric utility

Assessing the NRUM and inconsistency

Whole-brain analysis of choice inconsistency

ROI analysis of choice inconsistency

PPI analysis

Orthogonality analysis

Measuring choice difficulty

Controlling for changes in behavior

Controlling for the misspecification

Between-subjects analysis

Functional localizers

Supplementary information

Acknowledgements

Author contributions

Data and code availability

Competing interests

Footnotes

Supplementary information

References

Associated Data

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases