Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2023 May 18.
Published in final edited form as: Neuron. 2022 Mar 24;110(10):1615–1630. doi: 10.1016/j.neuron.2022.03.002

Logistic analysis of choice data: A primer

Camillo Padoa-Schioppa 1
PMCID: PMC9119943  NIHMSID: NIHMS1792657  PMID: 35334232

Abstract

Logistic regressions were developed in economics to model individual choice behavior. In recent years, they have become an important tool in decision neuroscience. Here I describe and discuss different logistic models, emphasizing the underlying assumptions and possible interpretations. Logistic models may be used to quantify a variety of behavioral traits, including the relative subjective value of different goods, the choice accuracy, risk attitudes, and choice biases. More complex logistic models can be used for choices between good bundles, in cases of non-linear value functions, and for choices between multiple options. Finally, logistic models can quantify the explanatory power of neuronal activity on choices, thus providing a valid alternative to receiver operating characteristic (ROC) analyses.

Introduction

A growing number of neuroscience studies focuses on the neural underpinnings of economic choices. In many experiments, human or animal subjects choose between different goods that vary on two or more dimensions. The subjective values of the goods are inferred from choices and used to interpret neural activity. The analytical tools used to infer subjective values vary across studies, but they often involve some form of logistic regression – an approach first developed in the economics literature (McFadden, 1974; Cramer, 2002). Logistic models can quantify a variety of effects such as choice accuracy (or its reciprocal, choice variability), risk attitudes, temporal discounting, and choice biases. Logistic models can also be used for more complex choices such as choices between good bundles and ternary choices. Furthermore, logistic models can quantify the (marginal) explanatory power of neuronal activity. The proliferation of logistic models and the rationale for adopting a particular model for a particular set of choices might seem unclear. Importantly, while using any logistic model, it is important to keep in mind the underlying assumptions and recognize the possible interpretations. The purpose of this primer is to provide a conceptual and practical tool for neuroscience researchers analyzing choice data. We present a unified formalism and an interpretation for logistic models that may be used in a variety of experimental conditions, and we indicate relevant examples in the literature. We illustrate the practical implications of common assumptions and discuss the interpretation of specific models. An exhaustive review of the literature on choice behavior is beyond our purposes. The examples discussed in the article are drawn from experiments in which nonhuman primates chose between different types of juices. However, all the models discussed here apply to a broad class of experimental designs and protocols.

Choices between two goods offered in variable quantities

Consider the experiment illustrated in Fig.1. In this session, a rhesus monkey chose between two juices, namely cherry juice and hawaiian punch. Offers were represented by circles on a computer monitor. Each juice was associated with a color, and the quantity offered on any given trial was represented by the radius of the corresponding circle. The animal indicated its choice with an eye movement, and the chosen juice was delivered at the end of each trial. Offered quantities varied randomly from trial to trial between 50 and 300 μl and between 100 and 600 μl, respectively. Fig.1B illustrates the choice the animal made in each trial. In this session, the animal preferred cherry juice, meaning that when the two juices were offered in equal quantities the animal consistently chose cherry juice. Henceforth, we will refer to the preferred juice as juice A and to the other juice as juice B. We will indicate with qA and qB the juice quantities offered on any given trial.

Figure 1.

Figure 1.

Choices between two juices offered in variable amounts. A. Experimental design. A monkey sat in front of a computer monitor with the head restrained. Offers appeared on the monitor and the gaze direction was monitored by a video camera. Two juice lines were placed in front of the animal’s mouth. At the beginning of each trial, the monkey fixated a dot in the center of the monitor. Two offers, represented by two color circles, appeared on the two sides of the fixation point. For each offer, the color indicated the juice type and the radius indicated the juice amount. The animal maintained center fixation for a randomly variable delay (1–2 s), at the end of which the center fixation point dimmed and the offers were replaced by a two saccade targets (go signal). The animal indicated its choice by gazing to one of the targets; it maintained target fixation for 0.75 s, at the end of which the chosen juice was delivered directly to the mouth. B. Example session, joint distribution of offers and choices. The y- and x-axis represent the offered quantities qA and qB. Each symbol represents one trial and the color indicates the chosen juice (red or blue for A or B, respectively). All juice quantities are expressed in units of a conventional quantum q = 100 μl. The session included 560 trials and joint distribution of offers was uniform within a domain. The logistic regression provided the measures ρ = 2.4 and η = 5.1. The black line indicates the indifference line ρ qA = qB. C. Choice frequency. The panel illustrates the same data as in B binning trials. For qA (y-axis) we used a bin size = 1/2; for qB (x-axis) we used a bin size = ρ/2. Thus in value units the bin size was the same for bot axes. Gray shades indicate the fraction of trials in which the animal chose juice B. D. Fitted sigmoid. We computed the log quantity ratio for each trials and we binned trials (bin size = 0.5; x-axis). For each bin, we computed the fraction of trials in which the animal chose juice B (y-axis). Black dots represent the fraction of B choices for each bin, and the gray sigmoid is that obtained from the logistic fit. The green dotted line highlights the flex point (log(qB/qA) = log(ρ)); the yellow dotted line highlights the derivative of the sigmoid through the flex point (η/4).

Given offers qA and qB, the animal chose juice A with probability PA and juice B with probability PB = 1 − PA. The goal of the logistic analysis is to model choices – that is, to provide a function that accurately relates PA and PB to qA and qB. We can indicate with PJ the probability of choosing juice J (J = A or B). The basic logistic model assumes that the probability of choosing juice J on any given trial is proportional to the value of juice J offered in that trial raised to some power. In formulas:

PJ=VJηIVIη (1)

where VJ is the value of juice J and I = A, B. The sum at the denominator is a normalization ensuring that probabilities add up to 1. The parameter η, which will be fitted, captures the choice accuracy (or choice consistency). To appreciate this point, consider how probabilities PA and PB change with η, given two values VA and VB. For example, consider the case in which VA is slightly larger than VB. When η becomes large, the term VAη in the denominator becomes increasingly dominant. Hence, PA becomes close to 1 and PB becomes close to 0. Conversely, when η is small, choices are more evenly split. Thus the larger η, the more the animal is likely to choose the higher value.

We can now re-write the probability of choosing juice B as follows:

PB=VBηVAη+VBη=eη log(VB)eη log(VA)+eη log(VB)=11+eη log(VB/VA) (2)

where log indicates the natural logarithm. The next step is to specify a value function. We assume the following function:

VJ=ρJqJ (3)

In essence, the value VJ equals the quantity qJ multiplied by a factor ρJ that expresses the subjective value of one unit volume of juice J. Notably, that the animal prefers juice A to juice B means ρA > ρB. Parameters ρA and ρB are to be fitted. However, Eqs.23 reveal that the logistic model only depends on the ratio ρ = ρA/ρB. Ultimately, this means that any value can only be measured relative to some other value (i.e., in some value units). Without losing generality, we may measure values in units of juice B and thus set ρB = 1. The parameter ρ is referred to as the “relative value”.

The logistic model has two free parameters, namely η and ρ. Substituting Eq.3 in Eq.2, we obtain the standard expression for a logistic model:

PB=11+eX (4)
X=a0+a1 log(qB/qA) (5)

where we defined:

a0=η log(ρ) (6)
a1=η (7)

Inverting Eq.4 reveals that

X=log(PB/(1PB))=log(PB/PA) (8)

The probability ratio PB/PA is termed “odds” and X is referred to as the log odds.

The goal of this analysis is to represent the data as accurately as possible, given the assumptions of the model. To do so, one performs a maximum likelihood estimation, defining the objective function

F=i=1NPchosen juicei (9)

where N is the number of trials in the session and Pchosen juicei is the probability that on trial i the animal would choose the juice it actually chose. This probability is computed using Eqs.45. To do a logistic regression means to identify values a0^ and a1^ that maximize F.

Of note, F in Eq.9 includes all trials except forced choices – i.e., trials for which either qA = 0 or qB = 0. Forced choices are excluded from the regression because the log in Eq.5 would be infinite.

Consider now Eqs.45. First, note that the function relating PB and log(qB/qA) is a sigmoid (Fig.1D). Second, resolving Eq.6, we obtain the estimate for the relative value ρ=ea0^/a1^. Eqs.45 reveal that ρ equals the quantity ratio qB/qA that makes the animal indifferent between the two offers. To appreciate this point, note that the indifference condition PB = 0.5 is equivalent to X = 0, which implies qB/qA=ea0^/a1^.

We may also compute the first derivative of the sigmoid function (Fig.1D) at the indifference point:

dPBd(log(qB/qA))|X=0=1(1+eX)2dXd(log(qB/qA))|X=0=a1^/4 (10)

Thus a1^/4 is the steepness of the sigmoid, confirming that η=a1^ is a measure of choice accuracy (the steeper the sigmoid, the more consistent are choices). Hence, η is inversely related to choice variability. In the language of statistical mechanics, η is referred to as the “inverse temperature”.

For the experimental session in Fig.1, the logistic regression provides the estimates ρ = 2.4 and η = 5.1.

Alternative logistic model: value difference versus log value ratio

A frequently used alternative to Eq.1 is a logistic model in which the probability of choosing juice J on any given trial is proportional to eηVJ. In other words,

PJ=eηVJIeηVI (11)

Again, η is a measure of choice accuracy. Numerous neuroscience studies have used this model (Kim et al., 2008; Amemori and Graybiel, 2012; Hunt et al., 2012; Ferrari-Toniolo et al., 2021). As in the previous section, we may assume the following value function:

VJ=ρJqJ (12)

The probability of choosing juice B may thus be written as follows:

PB=eηVBeηVA+eηVB=11+eη(VBVA) (13)

Again, we may measure values in units of juice B and thus set ρB = 1. We may also define ρ = ρA/ρB. We thus obtain the standard expression:

PB=11+eX
X=a0qA+a1qB (14)

where we defined a0 = −ηρ and a1 = η. After the logistic regression, we obtain the fitted values for ρ and η. As in the logistic model discussed in the previous section, ρ equals the quantity ratio qB/qA that makes the animal indifferent between the two offers and η is proportional to the sigmoid steepness at the indifference point.

We might refer to the model discussed in the previous section as the “log value ratio model” (because of Eq.2) and to the model described in this section as the “value difference model” (because of Eq.13). These two models make different predictions. In particular, referring to the experiment of Fig.1, consider the two sets of quantities [qA, qB] such that the animal chooses juice B 25% of the time (set 1) or 75% of the time (set 2). Consider now the plane defined by the two quantities. In the log value ratio model, PB = const implies qB/qA = const. Thus this model predicts that set 1 and set 2 are two straight lines that cross the axes origin and have different slopes (Fig.2A). Conversely, in the value difference model, the condition PB = const implies qBqA = const. Thus this model predicts that set 1 and set 2 are two straight lines that are parallel to each other and to the indifference line (PB=0.5) and that do not cross the axes origin (Fig.2B).

Figure 2.

Figure 2.

Comparing logistic models (cartoon). A. Log value ratio model. The two axes represent the offer values in units of juice B. The solid line represents the indifference line (PB=0.5); the two dotted line represent the conditions for which the animal chooses juice B 25% and 75% of the time (PB=0.25 and PB=0.75, respectively). In other words, the dotted lines capture the shape and the width of the fitted sigmoid. B. Value difference model. Same conventions as in panel A. In this model, the animal would split choices even when one of the two offers is vanishingly small (e.g., in this cartoon, for qA = 0 and qB = 1, PB = 0.75).

Intuitively, the former prediction seems more sensible – it captures the idea that the difficulty of choosing between $9 and $10 is similar to the difficulty of choosing between $0.9 and $1, while choosing between $0 and $1 is much easier. Another advantage of the log value ratio model is that parameters a0 and a1 are pure numbers and do not depend on the particular units used to express quantities or values. Finally, using value ratios makes it particularly convenient to write the logistic model when the value function is a product – e.g., when the two determinants of value are probability and quantity, or quantity and time. Conversely, the value difference model is more convenient when the value function is a sum – e.g., when the two determinants are quantity and cost, or when options are bundles. Thus for most of this article we will continue to use the log value ratio model. However, in the discussion of choices between bundles, we will use the value difference model.

Quantifying risk attitudes

In the following sections, we show how logistic analyses may be used to quantify a variety of behavioral phenomena and neural effects. We start by the analysis of choices under risk – i.e., choices between goods that might be obtained with probability <1. The value of a desirable good offered with probability p < 1 naturally depends on p. Moreover, choices often reflect some “risk attitude”. For example, many people would chose $490 for sure to a gamble that pays $1000 with probability p = 0.5 and $0 otherwise. Thus many choices under risk are not simply driven by the expected value of the offers. Risk attitudes can be quantified using logistic analyses and numerous neuroscience studies have taken this approach (Levy et al., 2010; Yamada et al., 2013; Raghuraman and Padoa-Schioppa, 2014; Strait et al., 2014; Ferrari-Toniolo et al., 2021).

Consider the experiment illustrated in Fig.3. In this session, a monkey chose between two juices, labeled A and B, offered in variable quantities and probabilities. Offers were represented by pie-shaped images. For each offer, the color represented the juice type, the radius of the pie represented the quantity, and the angle of the pie represented the probability (Fig.3A). For juice J (= A or B), we indicate the probability and quantity offered on any given trial respectively with pJ and qJ. We define the expected quantity as the product EQJ = pJ qJ. In the experiment, for each juice J, pJ and qJ varied randomly and independently from each other, subject to the constraint that EQJ be above a fixed minimum (Fig.3BC). Fig.3D illustrates the choices made by the animal in each trial as a function of the expected quantities EQA and EQB. The preferred juice was conventionally labeled as juice A.

Figure 3.

Figure 3.

Choices under uncertainty: quantifying the risk attitude. A. Experimental design. The trial structure was similar to that of Fig.1. However, in this experiment, offers were represented by incomplete pies. The color indicated the juice type, the radius indicates the juice amount, and the filled angle represented the probability. For example, in the trial shown here, juice A (red) was offered in larger quantity but smaller probability than juice B (blue). B. Example session, offer A distribution. The two axes represent probability (pA) and quantity (qA), each symbol represents one trial, and the color represents the chosen juice. Within the quantity range [0 4] and the probability range [0 1], the distribution of offers was uniform subject to the constraint that the expected quantity be not too small (pA qA ≥ max(qA)/10). C. Example session, offer B distribution. Same format as panel B. D. Joint distribution, expected quantities. In this plot, the two axes represent the two expected quantities, namely EQA = pA qA (y-axis) and EQB = pB qB (x-axis). E. Joint distribution, expected values. Here the two axes represent the two expected values, namely EVA=pAγ ρ qA and EVB=pBγ qB. On both axes, values are expressed in units of juice B. The black line is the indifference line (PB=0.5) and indicate the chosen juice. This plot was based on the results of the logistic analysis. Notably, the indifference line divides A choices (red) and B choices (blue) better than any straight line one could draw in panel D (because γ≠1). F. Same data as in panel E, binning trials. Gray shades indicate the fraction of trials in which the animal chose juice B. Bins with <5 trials were excluded. G. Fitted sigmoid surface. Axes represent the log quantity ratio, the log probability ratio and the frequency of B choices, respectively.

We indicate with PJ the probability that on any given trial the animal chose juice J. Again, the goal of the logistic analysis is to provide a function that accurately relates PA and PB to pA, pB, qA and qB. As for the case discussed above, where pA = pB = 1, the logistic model assumes that PJ is proportional to the value of juice J raised to some power (Eq.1). In this case, however, the value function also depends on the probability. We assume the following value function:

VJ=pJγρJqJ (15)

Again, without losing generality, we may set ρB = 1 and thus measure values in units of juice B. Thus the resulting logistic model has three free parameters: η is the inverse temperature, ρ = ρA/ρB is the relative value, and γ captures the risk attitude. Specifically, γ < 1 means that the monkey treats probabilities as higher than they are (pJγ>pJ; risk seeking). Conversely, γ > 1 means that the monkey treats probabilities as lower than they are (pJγ<pJ; risk aversion). If the monkey makes a very large number of choices, the condition γ = 1 (pJγ=pJ; risk indifference) yields the maximum payoff. In this sense, risk indifference is optimal.

An alternative way to model choices under risk is to use the following value function:

VJ=pJρJqJκ (16)

Eq.15 assumes that risk attitudes reflect a distortion in the computation of subjective probabilities. Conversely, Eq.16 attributes risk attitudes to a distortion of subjective quantities. The latter formulation is referred to as “expected utility theory” (Kreps, 1990). We generally prefer Eq.15 because it makes it transparent that the behavioral phenomenon – risk aversion or risk seeking – is related to the attitude toward probabilities. Furthermore, in this formulation, choices without risk, discussed in the first section, are a specific case. In other words, Eq.15 reduces to Eq.3 when all pJ = 1. Thus we will continue the presentation referring to this model.

Repeating the steps taken in Eq.2, we re-write the probability of choosing juice B as follows:

PB=11+eX
X=a0+a1 log(pB/pA)+a2 log(qB/qA) (17)

where we defined a0 = −η log(ρ), a1 = η γ and a2 = η. Once the logistic regression has been performed, we can compute the parameters that characterize choices, namely the relative value ρ=ea0^/a2^, the inverse temperature η=a2^, and the risk attitude γ=a1^/a2^.

For the experimental session in Fig.3, the logistic regression provided the estimates ρ = 2.1, γ = 0.53 and η = 6.2. Notably, γ < 1 indicates that the animal was risk seeking – a common finding in monkeys (Heilbronner and Hayden, 2013). Choices made by the animal in this session may also be plotted as a function of the expected values EVA=pAγ ρ qA and EVB=pBγ qB (Fig.3EF). In essence, the expected values are similar to the expected quantities (Fig.3D), except that they incorporate the relative value (ρ) and the probability distortion (γ). We observe that the two sets of data points corresponding to choices of juice A and juice B are more clearly separated in Fig.3E than in Fig.3D, confirming the fact that the parameter γ captures an important trait of the animal’s behavior.

Quantifying choice biases

Human and animal choices may be affected by a variety of biases (Kagel et al., 1995). In many cases, such biases have a binary form. For example, in the experiment of Fig.1, other things equal, the animal might be more likely to choose the offer presented on the right (side bias). Similarly, numerous studies reported a history-dependent bias termed choice hysteresis – other things equal, subjects were more likely to choose on any given trial the same good chosen in the previous trial (Padoa-Schioppa, 2013; Alos-Ferrer et al., 2016; Schoemann and Scherbaum, 2019). In other experiments, monkeys chose between two options offered with probability <1. For one of the two options, information about the trial outcome (good or poor luck) would be revealed to the animal before the end of the trial. Even though this information did not affect the reward likelihood, animals’ choices were biased in favor of the informative good (Blanchard et al., 2015). All of these biases can be effectively quantified with a logistic analysis. To illustrate this point, we examine an experiment of choices under sequential offers (Fig.4).

Figure 4.

Figure 4.

Choices under sequential offers: quantifying the order bias. A. Experimental design. The experimental design was similar to that in Fig.1. However, in this experiment, offer quantities were quantized and the two offers were presented sequentially. For each offer, the color indicated the juice type and the number of squares indicated the juice amount (in multiples of a fixed quantum). For each pair of offer quantities, the presentation order varied pseudorandomly. The two saccade targets had different colors associated with the two juices, and the left/right position of saccade targets was independent of the order. B. Joint distribution of offers. Dotted lines represent the indifference lines for AB trials (green), BA trials (purple) and all trials pooled (black). C. Choice pattern. An offer type was defined by two offered quantities in a particular order. In the plot, each symbol represents one offer type. Green and purple are associated with AB trials and BA trials, respectively. The sigmoid were obtained from the logistic regression (Eq.21). Other things equal, the animal was more likely to choose juice A in when juice A was offered second (BA trials). In other words, the animal had a bias favoring offer2 (order bias ϵ′ > 0).

In each session, a monkey chose between two juices, labeled A and B, offered in variable amounts. In this case, however, the two juices were presented sequentially. Offers were represented by sets of squares displayed on a computed monitor. For each offer, the color represented the juice type and the number of squares represented the quantity (offers were quantized; Fig5AB). For each pair of quantities [qA, qB], the presentation order varied randomly. We refer to trials in which juice A was offered first and trials in which juice B was offered first as “AB trials” and “BA trials”, respectively. We also refer to the first and second offers as “offer1” and “offer2”, respectively. This situation closely resembles that examined in the first section (Fig.1). However, as in other studies where offers were presented in sequence (Krajbich et al., 2010; Ballesta and Padoa-Schioppa, 2019; Rustichini et al., 2021), choices in this session where generally biased in favor of the second offer (Fig.4B). We may model this phenomenon as though the value of any particular good was reduced or enhanced when that good was offered first or second. We can thus write the logistic model as in Eq.1 and write the following value functions:

VA=ρA qA(1ϵ δorder,AB+ϵ δorder,BA) (18)
VB=ρB qB(1+ϵ δorder,ABϵ δorder,BA) (19)

where ϵ is a parameter that captures the order bias, δorder,AB = 1 in AB trials and 0 otherwise, and δorder, BA = 1 − δorder,AB. In practice, in each of Eqs.1819, one δ term always equals one and the other equals zero. Thus each value is multiplied by (1 − ϵ) or (1 + ϵ) depending on the presentation order. Note also that the sign of ϵ captures the direction of the effect. Specifically, if ϵ < 0, the bias favors offer1; if ϵ > 0, the bias favors offer2. As usual, we set ρB = 1 and define ρ = ρA/ρB. We can thus compute the log value ratio:

log(VB/VA)log(ρ)+log(qB/qA)+2ϵ(δorder,ABδorder,BA) (20)

where we have used Taylor’s expansion log(1 + x) ≈ x. Repeating the steps taken in Eq.2, we write the probability of choosing juice B as follows:

PB=11+eX
X=a0+a1 log(qB/qA)+a2(δorder,ABδorder,BA) (21)

where we defined a0 = −η log(ρ), a1 = η, and a2 = η ϵ. Intuitively, Eq.21 describes two sigmoids, one for AB trials and the other for BA trials. The two sigmoids are parallel to each other (same steepness a1) but they have different flex points (corresponding to (a0 ± a2)/a1) (Fig.4C).

Figure 5.

Figure 5.

Ternary choices. A. Experimental design. In this experiment, the animal chose between three options that varied for the juice type, the quantity and the probability. Each offer was represented by an incomplete pie. The color indicated the juice type, the radius indicated the juice quantity, and the filled angle represented the probability. In each trial, the centers of the three pies were placed on an invisible circle surrounding the fixation point, 120 degrees apart from each other. B. Offer distribution for the three juices. The three panels refer to juices A, B and C, respectively. In each panel, the two axes represent probability and quantity, each symbol represents one trial, and the color represents the chosen juice. The distribution of offers was uniform within a domain and subject to the constraint that the expected quantity be not below a set minimum. C. Joint distribution, expected values. This simplex plot (or ternary plot) illustrates data that varied on three dimensions, namely EVA, EVB and EVC. Each data point represents one trial and, for each trial, data are normalized by the sum of the three expected values. Each corner of the triangle corresponds to one juice type. Consider the top corner (juice C) and the bottom side opposite to it. For each point in the triangle, one may trace the line passing through that point and the top corner. On that line, the distance between the bottom side and the point equals the normalized EVC. Similarly, the equivalent distances between each side and the opposite corner are the normalized EVA and EVB. This session included binary choices (one of the goods = null) and forced choices (two of the goods = null). Data points on the three sides of the triangle represent binary choices, while data on the three corners represent forced choices. Trials in which the normalized EVC is large are close to the top corner; trials in which the normalized EVC is small are close to the bottom side (similarly for the other juices). Notably, the three dimensions are identified up to a factor (i.e., if the three EV are doubled, the position does not change). In this plot, the color of each data point also represents the expected values in RBG format, where the red, blue and green components of the color are set by EVA, EVB and EVC, respectively. Thus for any position on the triangle, the point’s brightness captures the value scale. D. Joint distribution, chosen juice. Same plot as in panel C, except that colors indicate the chosen juice. Notably, the three clouds of red, blue and green points are relatively well, but not perfectly, separated.

Once the logistic regression has been performed, we compute the parameters characterizing choices, namely the relative value ρ=ea0^/a1^, the inverse temperature η=a1^, and the order bias ϵ=a2^/a1^. An alternative and valid definition for the order bias is ϵ=2ρa2^/a1^. One advantage of this definition is that the order bias quantifies explicitly the distance between the two sigmoids. Indeed, the relative values specific to AB trials and BA trials may be defined, respectively, as ρAB=e(a0^+a2^)/a1^ and ρBA=e(a0^a2^)/a1^. Using Taylor’s expansion, we may then observe that ϵ′ ≈ ρBAρAB. For the experimental session in Fig.4A, the logistic regression provided the estimates ρ = 1.5, η = 3.8 and ϵ′ = 0.55.

The same approach has been used to model choice hysteresis (Padoa-Schioppa, 2013). In this case, the probability of choosing juice B is written as follows:

PB=11+eX
X=a0+a1 log(qB/qA)+a2(δprev choice,Bδprev choice,A) (22)

where δprev choice,J = 1 if in the previous trial the animal chose juice J and 0 otherwise. Again, Eq.22 describes two parallel sigmoids, one for trials following choices of juice A and the other for trials following choices of juice B. Following the logistic regression, one can compute the parameters characterizing choices, namely relative value ρ=ea0^/a1^, inverse temperature η=a1^, and choice hysteresis ξ=2 ρ a2^/a1^.

For any set of choices, one can also examine multiple choice biases, separately or at once (Cai and Padoa-Schioppa, 2019; Constantinople et al., 2019; Kuwabara et al., 2020). In essence, with a single analysis, one can model a family of parallel sigmoids (same inverse temperature). One caveat is that different variables included in the log odds X must be independent of each other. Also, when we fit a family of sigmoids, other things equal, the statistical power of each fitted parameter is reduced. Hence, one typically needs more trials.

Logistic models for ternary choices

All the cases discussed so far involved choices between two options. However, many daily situations involve choices between three or more options. Here we derive the logistic analysis for ternary choices. Consider the experiment illustrated in Fig.5. In this session, a monkey chose between three juices offered in variable quantities and probabilities. We label the three juices A, B and C based on the preference order. Offers were represented by pie-shaped images. For each offer, the color represented the juice type, the pie radius represented the quantity, and the pie angle represented the probability (Fig.5A). For juice J (= A, B or C), we indicate with pJ and qJ the probability and quantity offered on any given trial. We defined the expected quantity as the product EQJ = pJ qJ. For each juice J, pJ and qJ varied randomly and independently from each other, subject to the constraint that EQJ be above a fixed minimum (Fig.5B).

As for binary choices, the logistic model assumes that the probability PJ of choosing juice J is proportional to the value VJ raised to some power. Furthermore, we can assume the same value function used for binary choices under risk. Thus the logistic model is:

PJ=VJηIVIη
VJ=pJγρJqJ (23)

with I and J = A, B or C. Without losing generality, we express values in units of juice C and thus set ρC = 1. We thus indicate the relative values ρA/ρC and ρB/ρC simply as ρA and ρB, respectively. We can now write the probability of choosing each juice. For example, the probability of choosing juice A is

PA=eη log(VA)eη log(VA)+eη log(VB)+eη log(VC)
PA=eη log(pAγρAqA)eη log(pAγρAqA)+eη log(pBγρBqB)+eη log(pCγqC)
PA=11+eηlog(ρB/ρA)+ηγlog(pB/pA)+ηlog(qB/qA)+eηlog(1/ρA)+ηγlog(pC/pA)+ηlog(qC/qA)
PA=11+e(a0a1+a2log(pA/pB)+a3log(qA/qB))+e(a0+a2log(pA/pC)+a3log(qA/qC)) (24)

where we defined a0 = η log(ρA), a1 = η log(ρB), a2 = η γ, and a3 = η. Similarly, we can compute:

PB=11+e(a0+a1+a2log(pB/pA)+a3log(qB/qA))+e(a1+a2log(pB/pC)+a3log(qB/qC)) (25)
PC=11+e(a0+a2log(pC/pA)+a3log(qC/qA))+e(a1+a2log(pC/pB)+a3log(qC/qB)) (26)

The maximum likelihood estimation is computed as in the binary case (Eq.9) using the objective function

F=i=1NPchosen juicei (27)

where N is the number of trials and the probabilities Pchosen juicei are computed using Eqs.2426. As a supplement to this primer, we provide a Matlab function that performs logistic analysis for ternary choices.

For the experimental session in Fig.5, the logistic regression provides the estimates ρA = 3.6, ρB = 2.7, γ = 0.9 and η = 5.3. Based on these fitted parameters, we can compute the three expected values offered in each trial EVA=pAγ ρA qA, EVB=pBγ ρB qB and EVC=pCγ qC, and then construct a simplex plot in which the three expected values determine the position on the triangle (Fig.5CD). Using this format Fig.5D illustrates the choices made by the animal in each trial as a function of the expected values EVA, EVB and EVC.

Choices between bundles

In many situations, options available for choice are not individual goods but rather bundles of goods. For example, while contemplating a restaurant menu, one might choose between a dish of salmon and broccoli and one of tofu and bok choy. Several neuroscience studies have focused on choices between bundles (FitzGerald et al., 2009; Pastor-Bernier et al., 2019; Pastor-Bernier and Schultz, 2021). To discuss logistic analyses of such choices, we refer to the experiment illustrated in Fig.6. In this session, a monkey chose between two bundles, labeled T1 and T2, each constituted by two juices. Different juices constituted each bundle, and the 4 juices were labeled A, B, C and D. Bundle T1 included juices A and B; bundle T2 included juices C and D. Offers were represented by sets of symbols on a computer monitor. The color of the symbols represented the juice type and the number of symbols indicated the quantity. In each bundle, the preferred juice (A for T1; C for T2) was represented by circles and located on the upper half of the screen; the less preferred juice (B for T1; D for T2) was represented by squares and located on the lower half of the screen. Thus in the trial depicted in Fig.6A the animal chose between T1 = [2A+3B] and T2 = [3C+4D]. The monkey revealed its choice with a saccade. After a brief delay, the two juices in the chosen bundle were delivered sequentially, starting from the preferred juice and with a 1 s delay interval between the two juices. (Thus temporal discounting effects remained constant for each juice and can be considered incorporated in the juice’s relative value.)

Figure 6.

Figure 6.

Choices between juice bundles. A. Experimental design. This experiment involved 4 different juices labeled A, B, C and D. Each offer was a bundle of two juices (T1 or T2). Offered quantities were quantized. In the trial shown here, the animal chose between bundle T1, including 2A and 3B, and bundle T2, including 3C and 4D. After the animal indicated its choice with a saccade, the two juices in the chosen bundle were delivered sequentially, starting with the preferred one, and with a 1 s delay between them. Bundles were constructed such that juice A was preferred to juice B and juice C was preferred to juice D. B. B. Joint distribution of quantities in bundle T1. C. Joint distribution of quantities in bundle T2. D. Choice pattern. The x- and y-axis represent the values of bundles T1 and T2, respectively. Each point represents on offer type (defined by 4 juice quantities) and the color represents the fraction of trials in which the animal chose bundle T2 (see color bar). The dotted line is the indifference line.

To model choices between bundles, it is most convenient to use a value difference model. Thus the logistic can be written as follows:

PR=eηVRSeηVS (28)
VT1=ρAqA+ρBqB (29)
VT2=ρCqC+ρDqD (30)

where R and S = T1 or T2. As in other cases discussed above, the model in Eqs.2829 has one parameter too many (because η multiplies all the ρJ). We may thus choose to measure values in units of juice D by setting ρD = 1. We thus write the probability of choosing bundle T2 as follows:

PT2=11+eX
X=a1qA+a2qB+a3qC+a4qD (31)

where defined a1 = −ηρA, a2 = −ηρB, a3 = ηρC, and a4 = η. The relative values ρA, ρB and ρC and the inverse temperature η can be computed after the logistic regression from the fitted parameters. For the session illustrated in Fig.6, this analysis provided the estimates ρA = 2.3, ρB = 1.4, and ρC = 2.0.

Non-linear value functions

Throughout this primer, we always assumed that the value of a quantity qJ of juice J was proportional to qJ (Eq.3; linear value function). In Eqs.2930 we have also assumed that the value of a bundle equals the sum of the components’ values (additivity). In many situations, however, value functions might be non-linear and/or non-additive. For example, a common intuition is that the subjective value of an additional $100 is higher in times of poverty and lower in times of prosperity (sub-linear value function, or diminishing marginal returns). Along similar lines, the subjective value of bread and cheese together is intuitively different from the value sum of bread alone and cheese alone. These phenomena can be examined using the same logistic model as in Eq.26 while including quadratic and/or interaction terms in the value functions:

VT1=α0qA+α1qB+α2qA2+α3qB2+α4qAqB (32)
VT2=β0qC+β1qD+β2qC2+β3qD2+β4qCqD (33)

In essence, the quadratic terms α2, α3, β2 and β3 capture the non-linearities, while the interaction terms α4 and β4 capture the non-additivities. The logistic regression then proceeds as for linear value functions. For the session illustrated in Fig.6, this analysis indicated that all quadratic terms were <0 (sub-linearity). Furthermore, both interaction terms were negative (subadditivity). This was generally true across sessions (Xie and Padoa-Schioppa), and other studies yielded similar results (Pastor-Bernier et al., 2019).

The last observation might be viewed as a challenge to the linearity assumption made in the previous sections. To mitigate this concern, referring to the first section, we note that in the log value model non-linear value functions (Eq.3) are naturally expressed by elevating the product ρJ qJ to some exponent κJ. If exponents κA and κB do not differ from each other, both of them are effectively absorbed in the inverse temperature η (Eq.2). Thus, in reality, the value ratio models discussed in previous sections only assume that all value functions have the same nonlinearity (i.e., κA = κB). This assumption is substantially weaker than that of linear value functions. For the experiment illustrated in Fig.1, it essentially amounts to assuming that the set of quantities [qA, qB] that make the animal indifferent between the two juices form a straight line. For this experiment, the assumption seems reasonable (Fig.1). Supporting this point, for the experiment in Fig.6, the quadratic terms α2, α3, β2 and β3, while all <0, did not differ significantly from each other.

Including neuronal activity in logistic regressions

A number of studies have found that neurons in several brain regions encode the values of offered and/or chosen goods (Wallis and Kennerley, 2010; Schultz, 2015). This fact validates the classic idea that choices rely on the computation and comparison of subjective values. However, values can guide a variety of cognitive processes including perceptual attention, associative learning, emotion and motor control. Hence, that a neuronal population represents values does not necessarily imply that the population participates in a choice process. Furthermore, a long-term goal for decision neuroscience is to shed light on the circuit and neural mechanisms underlying economic decisions. An important step in that direction is to assess whether the activity of a particular neuron (or group of neurons) can contribute to explaining choices above and beyond what is expected based on behavioral variables alone. Consider for example a monkey choosing between juices A and B and a neuron whose activity we denote by r. As discussed above, a logistic analysis provides a function linking, for example, the probability of choosing juice B, PB, to the offered quantities qA and qB. The question is whether a better account of choices may be obtained using some function that also includes r. We write such function as

PB(qA,qB,r) (34)

In setting up this problem, one must consider two issues.

First, for any choice task, some population of neurons encodes the actual choice outcome. For example, if the task requires the monkey to indicate its choice with a leftward or rightward saccade, neurons encoding the direction of impending saccades will be excellent predictors of choice. However, calculating PB from their activity would be trivial and not provide any insight into the decision process. Thus one should ensure that the activity r included in Eq.34 precedes the decision, either because it takes place before a decision can be made or because it is from neurons that encode some pre-decision variable.

Second, for any choice task, there are typically neurons encoding the values of individual offers. In particular, when monkeys choose between juices A and B (Fig.7A), two groups of cells in the orbitofrontal cortex (OFC) encode the two offer values. Their activity is linearly related to quantities qA and qB. Thus on any given trial, the activity of a cell encoding the offer value B may be written as

rB=c0+c1qB+δrB (35)

where c0 and c1 are the parameters of the linear relation and δrB is the trial-by-trial stochastic variability in the cell activity, also referred to as neuronal noise. An equivalent expression may be written for cells encoding the offer value A. An interesting exercise consists in predicting choices based only on firing rates rA and rB. If Eq.35 is accurate and δrB is not too large, one can calculate PB with relatively good accuracy (Knutson et al., 2007; Webb et al., 2019). However, the goal pursued in this section is more ambitious, as we want to assess whether measures of neuronal activity can contribute to predicting the choice made on any given trial above and beyond what would be predicted based on the offers alone. Thus for the neuron in Eq.35, the question effectively is whether δrB can contribute to predicting choices.

Figure 7.

Figure 7.

Including neural activity in logistic analyses. A. Experimental design. B. Offer value cells, activity profiles. The analysis focused on offer types where choices were split between the two juices. For each offer type, trials were divided depending on the animal’s choice (E chosen; O chosen) and the activity was averaged separately for the two groups of trials. The resulting traces were averaged across offer types for each cell and then across cells. Neurons with positive and negative encoding were pooled, inverting the E and O for cells with negative encoding. The time window of interest (500 ms following the offer) is highlighted. C. ROC analysis. For each cell, the difference between the light blue and light red traces was quantified with an ROC analysis, which provided a choice probability (CP). The histogram illustrates the distribution of CPs obtained across the population. D. Logistic analysis of residuals. For each cell, we conducted a logistic analysis of the residuals (Eq.36). For each neuron, the neuronal activity explained choices with a marginal power (or neuronal bias) equal to μ=a2^/a1^. The histogram illustrates the distribution of μ measured across the population. E. Chosen juice cells, activity profiles. The analysis included all trials. Offer types were divided depending on the animal’s choice (E chosen; O chosen) and on whether the animal consistently chose the same juice (easy) or split choices between juices (split). For each group of trials, the activity profiles were averaged across trials for each cell and then across cells. The time window of interest (500 ms preceding the offer) is highlighted. F. ROC analysis. For each cell, the difference between the light blue (E chosen, split) and light red (O chosen, split) traces was quantified with an ROC analysis. The histogram illustrates the distribution of CPs obtained across the population. G. logistic analysis of residuals. For each cell, we conducted a logistic analysis of the residuals (Eq.37). The neuronal bias was equal to v=a2^/a1^. The histogram illustrates the distribution of ν measured across the population. Panels B-E were adapted from Padoa-Schioppa (2013).

When values are close to indifference, choices are split between the two offers (choice variability). The standard way to quantify the relation between neuronal variability and choice variability is based on an ROC analysis, which provides a choice probability (Britten et al., 1992). However, the question of interest here may be addressed more precisely using logistic regressions. This approach was developed to analyze neuronal activity in the lateral intraparietal area during a probabilistic inference task (Yang and Shadlen, 2007). More recently, it was used to examine neuronal activity in OFC during economic choices (Padoa-Schioppa, 2013; Masset et al., 2020). We will discuss two example analyses.

Consider the experiment in Fig.7. Neuronal recordings in monkeys performing this task have identified in the OFC different groups of neurons encoding individual offer values (offer value A or offer value B), the binary choice outcome (chosen juice) and the chosen value (Padoa-Schioppa and Assad, 2006). The first question focuses on offer value cells. Their activity recorded in a 0.5 s time window starting with the offer onset is described by Eq.35. Each neuron encodes the value of one juice; we refer to the encoded juice as juice E and to the other juice as juice O. To assess whether the activity of these cells can significantly contribute to predicting choices, we construct the following logistic model:

PE=11+eX
X=a0+a1 log(qE/qO)+a2 δrE (36)

where δrE is the residual activity remaining after the firing rate is regressed against qE. After the logistic regression, one can compute μ=a2^/a1^, which we might term neuronal bias. This analysis can be repeated for each offer value cell in the data set, and one can compute the distribution for the neuronal bias μ across the population. For the experiment in Fig.7, an analysis of 229 offer value cells indicated mean(μ) = 0.001 (p=0.12, t-test; Fig.7C). Notably, this measure is in the expected direction, but rather small. In this respect, it is worth noting that, similar to choice probabilities obtained from ROC analyses, neuronal biases are closely related to noise correlations. That is, if the relevant population is sufficient large, one should expect neuronal biases to be close to zero unless trial-to-trial stochastic fluctuations of different cells are correlated (Haefner et al., 2013). Interestingly, noise correlations in OFC are substantially smaller than in many other brain regions, which explains small neuronal biases (Conen and Padoa-Schioppa, 2015).

The second analysis focuses on chosen juice cells. Several sources of evidence suggest that different groups of neurons identified in OFC constitute the building blocks of a neural circuit in which decisions are formed. Moreover, this circuit may be modeled as a neural network and described by a dynamic system (Rustichini and Padoa-Schioppa, 2015). In this view, chosen juice cells represent the output layer of the network. The activity of these neurons at the end of the decision process is binary – high when the animal chooses one juice and low when it chooses the other juice. We refer to the encoded juice (eliciting high activity) as juice E and to the other juice as juice O. The analysis focuses on the activity of these neurons recorded in a 0.5 s time window before the offers appear on the monitor. This pre-offer activity captures the initial conditions of the dynamic system, which can substantially influence the network dynamics. Preliminary observations indicated that this pre-offer activity correlates with the eventual choice outcome. To quantify this effect we construct the following logistic model:

PE=11+eX
X=a0+a1log(qE/q0)+a2rE (37)

where rE is the pre-offer activity of chosen juice cells. After the logistic regression, one can compute v=a2^/a1^, which we might term initial bias. This analysis may be repeated for each chosen juice cell in the data set, and one can compute distribution for the initial bias ν. For the experiment in Fig.7, an analysis of 257 chosen juice cells indicated that mean(ν) was significantly above zero (Fig.7C) (Padoa-Schioppa, 2013).

Probit model and the interpretation of choice variability

We now return to the case of simple choices between goods that vary in quantity (Fig.1). The analysis discussed in the first section and summarized in Eqs.45 is referred to as logistic or “logit”. An alternative approach is to model the sigmoid with a different function, specifically with the cumulative function of the standard normal distribution. To do so, we replace Eqs.45 with:

PB=12πXet2/2dt (38)
X=a0+a1 log(qB/qA) (39)

Eqs.3839 are referred to as a “probit” model. As for the logit model, parameters a0 and a1 are fitted using maximum likelihood (Eq.9). Several neuroscience studies have used probit models (Padoa-Schioppa and Assad, 2006; Ballesta et al., 2020). In practice, the numerical values obtained for a0^ and a1^ with the logit and with the probit models are very similar. For example, for the session in Fig.1, the two analyses yielded ρlogit = 2.42 and ρprobit = 2.40. Thus the two models can often be used interchangeably.

The main advantage of the probit model is that it provides an alternative interpretation for choice variability. Referring to Fig.1, the phrase “choice variability” essentially captures the fact that the sigmoid is not infinitely steep – when two offers have unequal but similar value, monkeys split their choices. In the probit model, the sigmoid is a cumulative function, and the underlying normal distribution may be interpreted as a probability distribution for the log relative value. Thus the probit model affords two interpretations for choice variability:

  1. The relative value of the two juices is fixed. Choice variability reflects noise in the process of value comparison

  2. The relative value of the two juices is stochastic. Choice variability reflects trial-to-trial fluctuations in the relative value

Interpretation (a) is that implicitly discussed so far, and it is the most natural for the logit model. Using the probit, the definition of relative value remains unchanged (ρ=ea0^/a1^) and the fact that ρ equals the quantity ratio qB/qA that makes the animal indifferent between the two juices remains true. Similarly, the definition of sigmoid steepness η=a1^ is unchanged, and the fact that η is proportional to the slope of the sigmoid at the indifference point remains true.

Interpretation (b) is conceptually quite different. In this view, the process of value comparison (i.e., the decision per se) is noiseless and may be described by a step function. In other words, whenever the quantity ratio qB/qA is higher than the relative value ρ, the animal chooses juice B faultlessly. However, the relative value ρ is not fixed – it varies stochastically from trial to trial (Fig.8A). More precisely, according to (b), the log relative value is a stochastic variable with a normal distribution that has mean(log ρ)=a0^/a1^ and std(log ρ)=1/a1^. In formulas:

D(log ρ)=N(a0^/a1^,1/a1^) (40)

A mathematical proof of Eq.40 is provided in the Supplementary Material.

Figure 8.

Figure 8.

Hypothetical choice patterns. A. Probit analysis and stochastic values. The black sigmoid is the choice pattern fitted with a probit function – that is, the probability that the animal choose juice B as a function of the log quantity ratio. The underlying Gaussian, shown in light red, can be interpreted as a probability distribution for the log relative value. B. Choice pattern with a double step. The choice pattern (black) has a double step. The underlying probability distribution function (light red) is bimodal. Such choice pattern suggests the presence of a systematic choice bias, similar to that revealed in Fig.4. B. Choice pattern with lapses. This choice pattern has a sigmoidal shape but it does not saturate. That is, choices are split between the two options even when one of the offers is strongly dominant. In the cartoon, we set λ = 0.3. The dotted line indicates PB = 1. D. Non-monotonic choice pattern. This pattern cannot be interpreted as the cumulative function of a probability distribution.

Intuitively, both interpretations seem to capture something real. On the one hand, the subjective value of any good may well vary on a short time scale due to noise in value computation, recent history, or some other reason (b). On the other hand, like any mental function, the comparison of values is presumably a noisy process (a). In other words, multiple factors likely contribute to choice variability. That said, the economics literature provides an important lesson: teasing apart different contributions to choice variability is very difficult based on choices alone. In other words, both (a) and (b) are valid interpretations and one cannot adjudicate between them based on choices. However, one might be able to disambiguate between (a) and (b) – and, more generally, to identify different sources of choice variability – based on neuronal measures. In fact, doing so is arguably one of the ultimate goals of research in decision neuroscience.

Limits and failures of logistic analysis

As described in previous sections, logistic analysis can effectively model choices under a broad range of experimental protocols, and quantify a variety of behavioral effects. However, not all choices can be described using a logistic analysis. This section discusses a few examples.

For choices between two goods, logistic analysis is valid only when the choice pattern is reasonably well captured by some sigmoidal function. Fig.8B illustrates with a cartoon a situation where this is not the case. In this hypothetical session, a monkey chooses between two juices offered in variable amounts (Fig.1A). The choice pattern has a double step – for a sizeable range of quantity ratios, the animal splits its choices evenly between the two juices. If the distance between the two steps is large compared to the width of each step, any sigmoidal fit will miss an important aspect of this choice pattern. In practice, Fig.8B suggests the presence of some bias that could potentially be modeled with a logit. Indeed, the double step function may be thought of as the average of two independent sigmoid functions. If one can identify some factor that determines whether a particular trial contributes to one sigmoid function or the other, one may add that factor to the logistic regression and thus account for the whole data set. We have already encountered such a situation in Fig.4, where choices were described by two sigmoids. In that case, the factor separating trials was the order in which the two juices were offered, and the distance between the two sigmoids was thus interpreted as an order bias. In other cases, the relevant factor may be the spatial configuration (spatial bias), the outcome of the previous trial (choice hysteresis), etc.

Another interesting case is depicted in Fig.8C. Here the choice pattern is well described by a sigmoid, but choices do not saturate. In other words, even when the two values are very different, the animal chooses the lesser option in some fraction of trials. In perceptual decisions, such choices are termed “lapses” (Busse et al., 2011; Carandini and Churchland, 2013; Pisupati et al., 2021). In general, lapses can be considered errors, but they may also reflect a tendency towards exploration (Pisupati et al., 2021; Wilson et al., 2021). If lapses are present and symmetric (i.e., if they occur equally on the two ends of the sigmoid), choices may be modeled using the following sigmoid function in lieu of the logit:

PB=12+1a2π atan(a0+a1 log(qB/qA)) (41)

where atan indicates the inverse tangent and π = 3.141… Notably, while log(qB/qA) varies in the range (−∞, ∞), PB varies in the range (a2/2, 1 − a2/2). Parameters a0, a1 and a2 can be fitted using maximum likelihood (Eq.9). One can thus compute the parameters characterizing choices, namely the relative value ρ=ea0^/a1^, the inverse temperature η=a1^, and the lapse rate λ=a2^. In this formulation, the lapse rate can be interpreted as the fraction of trials in which the animal chooses randomly between the two offers. As usual, ρ is the quantity ratio qB/qA that makes the animal indifferent between the two offers. Importantly, here both the inverse temperature and the (negative) lapse rate contribute to choice accuracy. This point is intuitive (if there are fewer lapses, choices are more accurate); it may also be confirmed by computing the sigmoid slope at the indifference point:

dPBd(log(qB/qA))|X=0=1π(1λ)η (42)

Fig.8C illustrates a more dramatic situation, where the choice pattern is non monotonic. To picture how such a choice pattern might occur, imagine a judicious child who is given a $1 coin by her parents to go and buy gummy bears. Once at the candy store, the child talks with the vendor. In scenario (1), the vendor says that for $1 he can offer only 1 gummy bear; recognizing the bad deal, the child keeps her money and walks away. In scenario (2), the vendor offers 5 gummy bears; after some hesitation, the child decides to flip her coin and to choose accordingly – in other words, she will take the gummy bears with probability p=0.5. In scenario (3), the vendor offers 10 gummy bears; in this case, the child takes the bears without hesitation. Finally, in scenario (4), the vendor offers 100 gummy bears; that offer seems overwhelming and not quite right, and the child chooses to keep the money and walks back to her parents. Those of us with kids may find the story somewhat unlikely, but not completely implausible. Most relevant here, it would be impossible to model the child’s choices with a logistic analysis.

Lastly, logistic analysis can sometimes fail when subjects choose between multiple options. For example, consider the case of trinary choices. The logistic model of Eq.23 assumes that choices are “independent of irrelevant alternatives” (IIA) (Luce, 1959; McFadden, 1974). In other words, given two offers A and B, the probability ratio PA/PB should not depend on the presence or value of a third option C. If the value of C is low, the sum of PA + PB will be close to 1. If the value of C is larger, both PA and PB will be smaller, but the ratio PA/PB should not change. To confirm that IIA holds in Eq.23, note that the ratio PA/PB depends on VA and VB, but not on VC. Conversely, if IIA is violated, it would be impossible to write a logistic model such as that in Eq.23, where each VJ only depends on the properties of offer J. Intuitively, assuming that choices are IIA seems to make sense. However, a number of studies have shown that human choices violate IIA in various ways (Tversky and Simonson, 1993; Soltani et al., 2012; Spektor et al., 2021). These phenomena, often referred to as “decoy effects”, cannot be easily modeled with logistic analysis.

Conclusions

Goods available for choice can vary on a number of dimensions, including quality (e.g., the juice type), quantity, probability, delivery time, cost, etc. Furthermore, choices may involve three or more options, offers may be presented simultaneously or in sequence, and options may be formed by bundles of different goods. Each of these dimensions affects choices in ways that can be quantified through logistic analysis, provided that certain conditions are met. Logistic regressions can also quantify correlations between neuronal activity and choices, providing a valid alternative to ROC analyses. Importantly, different logistic models make different assumptions and provide alternative interpretations for the origins of choice variability. For all these reasons, logistic analysis has become and will continue to be a fundamental tool in decision neuroscience.

Supplementary Material

1

Logistic regressions are the primary method to analyze choice behavior in decision neuroscience studies. In this primer, Padoa-Schioppa describes how logistic analysis can quantify a variety of behavioral traits. He also discusses the assumptions and interpretations of different logistic models.

Acknowledgments:

The author thanks P. Natenzon for helpful discussions and W. Shi and J. Park for comments on the manuscript. Experimental data included in this article were collected by H. Schoknecht, W. Shi, J. Tu and J. Xie. Research in the author’s laboratory is supported by the National Institutes of Health (grants number R01-DA032758 and R01-MH104494 to CPS.

Footnotes

Publisher's Disclaimer: This is a PDF file of an unedited manuscript that has been accepted for publication. As a service to our customers we are providing this early version of the manuscript. The manuscript will undergo copyediting, typesetting, and review of the resulting proof before it is published in its final form. Please note that during the production process errors may be discovered which could affect the content, and all legal disclaimers that apply to the journal pertain.

Financial interests or conflicts of interest: None

References

  1. Alos-Ferrer C, Hugelschafer S, Li J (2016) Inertia and decision making. Front Psychol 7:169. [DOI] [PMC free article] [PubMed] [Google Scholar]
  2. Amemori K, Graybiel AM (2012) Localized microstimulation of primate pregenual cingulate cortex induces negative decision-making. Nat Neurosci 15:776–785. [DOI] [PMC free article] [PubMed] [Google Scholar]
  3. Ballesta S, Padoa-Schioppa C (2019) Economic decisions through circuit inhibition. Curr Biol 29:3814–3824 e3815. [DOI] [PMC free article] [PubMed] [Google Scholar]
  4. Ballesta S, Shi W, Conen KE, Padoa-Schioppa C (2020) Values encoded in orbitofrontal cortex are causally related to economic choices. Nature 588:450–453. [DOI] [PMC free article] [PubMed] [Google Scholar]
  5. Blanchard TC, Hayden BY, Bromberg-Martin ES (2015) Orbitofrontal cortex uses distinct codes for different choice attributes in decisions motivated by curiosity. Neuron 85:602–614. [DOI] [PMC free article] [PubMed] [Google Scholar]
  6. Britten KH, Shadlen MN, Newsome WT, Movshon JA (1992) The analysis of visual motion: a comparison of neuronal and psychophysical performance. J Neurosci 12:4745–4765. [DOI] [PMC free article] [PubMed] [Google Scholar]
  7. Busse L, Ayaz A, Dhruv NT, Katzner S, Saleem AB, Scholvinck ML, Zaharia AD, Carandini M (2011) The detection of visual contrast in the behaving mouse. J Neurosci 31:11351–11361. [DOI] [PMC free article] [PubMed] [Google Scholar]
  8. Cai X, Padoa-Schioppa C (2019) Neuronal evidence for good-based economic decisions under variable action costs. Nat Commun 10:393. [DOI] [PMC free article] [PubMed] [Google Scholar]
  9. Carandini M, Churchland AK (2013) Probing perceptual decisions in rodents. Nat Neurosci 16:824–831. [DOI] [PMC free article] [PubMed] [Google Scholar]
  10. Conen KE, Padoa-Schioppa C (2015) Neuronal variability in orbitofrontal cortex during economic decisions. J Neurophysiol 114:1367–1381. [DOI] [PMC free article] [PubMed] [Google Scholar]
  11. Constantinople CM, Piet AT, Bibawi P, Akrami A, Kopec C, Brody CD (2019) Lateral orbitofrontal cortex promotes trial-by-trial learning of risky, but not spatial, biases. Elife 8. [DOI] [PMC free article] [PubMed] [Google Scholar]
  12. Cramer JS (2002) The origins of logistic regression. In: Tinbergen Institute Working Paper No. 2002–119/4. [Google Scholar]
  13. Ferrari-Toniolo S, Bujold PM, Grabenhorst F, Baez-Mendoza R, Schultz W (2021) Nonhuman primates satisfy utility maximization in compliance with the continuity axiom of expected utility theory. J Neurosci 41:2964–2979. [DOI] [PMC free article] [PubMed] [Google Scholar]
  14. FitzGerald TH, Seymour B, Dolan RJ (2009) The role of human orbitofrontal cortex in value comparison for incommensurable objects. J Neurosci 29:8388–8395. [DOI] [PMC free article] [PubMed] [Google Scholar]
  15. Haefner RM, Gerwinn S, Macke JH, Bethge M (2013) Inferring decoding strategies from choice probabilities in the presence of correlated variability. Nat Neurosci 16:235–242. [DOI] [PubMed] [Google Scholar]
  16. Heilbronner SR, Hayden BY (2013) Contextual factors explain risk-seeking preferences in rhesus monkeys. Front Neurosci 7:7. [DOI] [PMC free article] [PubMed] [Google Scholar]
  17. Hunt LT, Kolling N, Soltani A, Woolrich MW, Rushworth MFS, Behrens TEJ (2012) Mechanisms underlying cortical activity during value-guided choice. Nat Neurosci 15:470–U169. [DOI] [PMC free article] [PubMed] [Google Scholar]
  18. Kagel JH, Battalio RC, Green L (1995) Economic choice theory: an experimental analysis of animal behavior. Cambridge, UK; New York, NY: Cambridge University Press. [Google Scholar]
  19. Kim S, Hwang J, Lee D (2008) Prefrontal coding of temporally discounted values during intertemporal choice. Neuron 59:161–172. [DOI] [PMC free article] [PubMed] [Google Scholar]
  20. Knutson B, Rick S, Wimmer GE, Prelec D, Loewenstein G (2007) Neural predictors of purchases. Neuron 53:147–156. [DOI] [PMC free article] [PubMed] [Google Scholar]
  21. Krajbich I, Armel C, Rangel A (2010) Visual fixations and the computation and comparison of value in simple choice. Nat Neurosci 13:1292–1298. [DOI] [PubMed] [Google Scholar]
  22. Kreps DM (1990) A course in microeconomic theory. Princeton, NJ: Princeton University Press. [Google Scholar]
  23. Kuwabara M, Kang N, Holy TE, Padoa-Schioppa C (2020) Neural mechanisms of economic choices in mice. Elife 9. [DOI] [PMC free article] [PubMed] [Google Scholar]
  24. Levy I, Snell J, Nelson AJ, Rustichini A, Glimcher PW (2010) Neural representation of subjective value under risk and ambiguity. J Neurophysiol 103:1036–1047. [DOI] [PubMed] [Google Scholar]
  25. Luce RD (1959) Individual choice behavior; a theoretical analysis. New York, NY: Wiley. [Google Scholar]
  26. Masset P, Ott T, Lak A, Hirokawa J, Kepecs A (2020) Behavior- and Modality-General Representation of Confidence in Orbitofrontal Cortex. Cell 182:112–126 e118. [DOI] [PMC free article] [PubMed] [Google Scholar]
  27. McFadden D (1974) Conditional logit analysis of qualitative choice behavior. In: Frontiers in Econometrics (Zarembka P, ed), pp 105–142. New York: Acedemic Press. [Google Scholar]
  28. Padoa-Schioppa C (2013) Neuronal origins of choice variability in economic decisions. Neuron 80:1322–1336. [DOI] [PMC free article] [PubMed] [Google Scholar]
  29. Padoa-Schioppa C, Assad JA (2006) Neurons in orbitofrontal cortex encode economic value. Nature 441:223–226. [DOI] [PMC free article] [PubMed] [Google Scholar]
  30. Pastor-Bernier A, Schultz W (2021) Mechanisms underlying economic revealed preference of multi-component bundles In primate orbitofrontal cortex.2021.2003.2002.433629.
  31. Pastor-Bernier A, Stasiak A, Schultz W (2019) Orbitofrontal signals for two-component choice options comply with indifference curves of revealed preference theory. Nat Commun 10:4885. [DOI] [PMC free article] [PubMed] [Google Scholar]
  32. Pisupati S, Chartarifsky-Lynn L, Khanal A, Churchland AK (2021) Lapses in perceptual decisions reflect exploration. Elife 10. [DOI] [PMC free article] [PubMed] [Google Scholar]
  33. Raghuraman AP, Padoa-Schioppa C (2014) Integration of multiple determinants in the neuronal computation of economic values. J Neurosci 34:11583–11603. [DOI] [PMC free article] [PubMed] [Google Scholar]
  34. Rustichini A, Padoa-Schioppa C (2015) A neuro-computational model of economic decisions. J Neurophysiol 114:1382–1398. [DOI] [PMC free article] [PubMed] [Google Scholar]
  35. Rustichini A, Domenech P, Civai C, deYoung C (2021) Working memory and attention in choice. SSRN:https://ssrn.com/abstract=3964802. [DOI] [PMC free article] [PubMed] [Google Scholar]
  36. Schoemann M, Scherbaum S (2019) Choice history bias in intertemporal choice. In. 10.31234/osf.io/7h9zj. [DOI]
  37. Schultz W (2015) Neuronal reward and decision signals: from theories to data. Physiol Rev 95:853–951. [DOI] [PMC free article] [PubMed] [Google Scholar]
  38. Soltani A, De Martino B, Camerer C (2012) A range-normalization model of context-dependent choice: a new model and evidence. PLoS Comput Biol 8:e1002607. [DOI] [PMC free article] [PubMed] [Google Scholar]
  39. Spektor MS, Bhatia S, Gluth S (2021) The elusiveness of context effects in decision making. Trends Cogn Sci 25:843–854. [DOI] [PubMed] [Google Scholar]
  40. Strait CE, Blanchard TC, Hayden BY (2014) Reward value comparison via mutual inhibition in ventromedial prefrontal cortex. Neuron 82:1357–1366. [DOI] [PMC free article] [PubMed] [Google Scholar]
  41. Tversky A, Simonson I (1993) Context-dependent preferences. Management Sciences 39:117–185. [Google Scholar]
  42. Wallis JD, Kennerley SW (2010) Heterogeneous reward signals in prefrontal cortex. Curr Opin Neurobiol 20:191–198. [DOI] [PMC free article] [PubMed] [Google Scholar]
  43. Webb R, Levy I, Lazzaro SC, Rutledge RB, Glimcher PW (2019) Neural random utility: relating cardinal neural observables to stochastic choice behavior. Journal of Neuroscience, Economics, and Psychology 12:45–72. [Google Scholar]
  44. Wilson RC, Bonawitz E, Costa VD, Ebitz RB (2021) Balancing exploration and exploitation with information and randomization. Current opinion in behavioral sciences 38:49–56. [DOI] [PMC free article] [PubMed] [Google Scholar]
  45. Xie J, Padoa-Schioppa C Neuronal activity in orbitofrontal cortex during choice between juice bundles. In preparation.
  46. Yamada H, Tymula A, Louie K, Glimcher PW (2013) Thirst-dependent risk preferences in monkeys identify a primitive form of wealth. Proc Natl Acad Sci U S A 110:15788–15793. [DOI] [PMC free article] [PubMed] [Google Scholar]
  47. Yang T, Shadlen MN (2007) Probabilistic reasoning by neurons. Nature 447:1075–1080. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

1

RESOURCES