Skip to main content
NIHPA Author Manuscripts logoLink to NIHPA Author Manuscripts
. Author manuscript; available in PMC: 2015 Jul 7.
Published in final edited form as: Curr Biol. 2014 Jun 19;24(13):1542–1547. doi: 10.1016/j.cub.2014.05.049

Dynamics of neural population responses in prefrontal cortex indicate changes of mind on single trials

Roozbeh Kiani 1,2,*, Christopher J Cueva 2,*, John B Reppas 2, William T Newsome 2,3
PMCID: PMC4191655  NIHMSID: NIHMS612741  PMID: 24954050

Summary

Decision-making is a complex process in which different sources of information are combined into a decision variable (DV) that guides action [1, 2]. Neurophysiological studies have typically sought insight into the dynamics of the decision-making process and its neural mechanisms through statistical analysis of large numbers of trials from sequentially recorded single neurons or small groups of neurons [3-6]. However, detecting and analyzing the DV on individual trials has been challenging [7]. Here we show that by recording simultaneously from hundreds of units in pre-arcuate gyrus of macaque monkeys performing a direction discrimination task, we can predict the monkey’s choices with high accuracy and decode DV dynamically as the decision unfolds on individual trials. This advance enabled us to study changes-of-mind (CoM’s) that occasionally happen before the final commitment to a decision [8-10]. On individual trials, the decoded DV varied significantly over time and occasionally changed its sign, identifying a potential CoM. Interrogating the system by random stopping of the decision-making process during the delay period after stimulus presentation confirmed the validity of identified CoM’s. Importantly, the properties of the candidate CoM’s also conformed to expectations based on prior theoretical and behavioral studies [8]: they were more likely to go from an incorrect to a correct choice; they were more likely for weak and intermediate stimuli than for strong stimuli; and they were more likely earlier in the trial. We suggest that simultaneous recording of large neural populations provides a good estimate of DV and explains idiosyncratic aspects of the decision-making process that were inaccessible before.

Results

Psychophysical studies of the decision-making process in various contexts suggest an underlying neural mechanism based on integration of evidence towards a decision criterion [11-17]. Supporting evidence for this mechanism has emerged from electrophysiological studies of the parietal cortex, frontal cortex, basal ganglia, and superior colliculus of monkeys performing simple perceptual decisions [3, 5, 18-22]. More recently magnetoencephalography, electroencephalography, and functional magnetic resonance imaging studies have revealed homologue mechanisms in the human brain [23-26].

Although, these studies have significantly advanced our understanding of the decision-making process they have mainly relied on statistical analyses across trials because of the stochastic nature of spiking activity at the single neuron level. Yet tracking the evolution of the DV on single trials, and relating fluctuations in the DV to internal cognitive states and overt behavior, are critical for incisive tests of current models of decision-making. Recent advances in multi-electrode recording promise to break this barrier through measurement and analysis of the underlying neural population responses on single trials. So far, this ability has been mainly used in the field of neural prosthetics where accurate, real-time decoding of neural population responses is necessary for guidance of motor prosthetic devices [e.g. 27, 28]. However, similar techniques can also be used to advance our understanding of cognitive processes, especially decision-making [7, 29].

We used 96-channel multi-electrode arrays to record from neural populations in area 8Ar of the prearcuate gyrus of two macaque monkeys while they performed a direction discrimination task [30, 31] (Fig. 1A). On each trial the monkey viewed a patch of randomly moving dots for 800 ms. After a delay period of variable length the monkey received the Go cue and reported the perceived motion direction by making a saccadic eye movement to one of the two available targets (T1 and T2). The multi-electrode array covered 4 mm × 4 mm of the cortical surface (Fig. 1B) and enabled us to record simultaneously from hundreds of single- and multi-neuron units in a significant portion of the prearcuate gyrus. Compatible with previous studies, many units showed differential activity for the two choices during the motion viewing and delay periods [20, 32] in addition to the peri-saccadic period [33] (Fig. 1C).

Figure 1.

Figure 1

A) Behavioral task. The monkey views 800 ms of random dots motion while maintaining gaze on a central fixation point. The strength and direction of motion varied randomly from trial to trial. After a variable delay period the monkey received the Go signal (fixation point disappeared) and reported the perceived motion direction with a saccadic eye movement to one of two visual targets. Correct responses were rewarded with juice after a short hold period. B) Two macaque monkeys were implanted with multi-channel electrode arrays in the prearcuate gyrus, which is marked with a blue box on the lateral surface of a typical macaque brain (top; University of Wisconsin Brain Collection). The exact location of each array with respect to the arcuate (as) and principal (ps) sulci is shown for each monkey (bottom). The white squares on the array show the locations of the ground pins. The portions of the principal and arcuate sulci that were visible in the craniotomy are indicated with black lines. Dashed segments at the end of a sulcus indicate that the sulcus extended in that direction beyond our window of visibility. C) Average responses of two example prearcuate units for correct ipsi- and contralateral choices. The units were recorded from the same electrode in the same session but had different motion and saccade selectivities. Shading indicates mean±s.e.m.

To explore the efficacy of simultaneous, high-density recording for analyzing dynamics of the decision-making process, we trained a logistic classifier to predict the monkey’s upcoming choice based on neural population responses at successive times during individual trials (100 ms sliding window; see Experimental Procedures). The classifier finds a set of linear weights (w) on the population neural responses (r) that maximizes the probability of correctly predicting the choice. Although it is possible to improve the prediction accuracy by adopting more sophisticated nonlinear models, in this report we adhere to the linear model for its simplicity and biological plausibility. Other models yielded qualitatively similar results. Figure 2A shows the cross-validated accuracy (see Methods) of our model, averaged across sessions. Model prediction accuracy is near chance at the beginning of the trial but rises quickly ~200 ms after motion onset, reaching perfection just before the saccade.

Figure 2.

Figure 2

The recorded neural populations afford high accuracy prediction of the monkeys’ choices. A) Prediction accuracy of the recorded population for the monkey’s choice. A logistic model was fit to 90% of the trials in each session and used to predict the choice in the remaining 10%, in sliding 100 ms bins. The dark trace and shading indicate mean±s.e.m across the sessions. The horizontal black bar indicates the time window used in the analyses in B and C. B) The neural population classifier is a better predictor of the monkey’s choice than the average of the individual units recorded in a session. The probability densities of population and individual unit choice prediction accuracies are highly distinct. Prediction accuracies were calculated for a 100 ms window immediately before the Go cue. C) Comparison of population choice prediction accuracy with the mean (left) and best (right) individual units. Each point represents one data collection session. Even the best individual units are inferior to the population.

Across 15 data sets, the prediction accuracy of the population responses was much higher than that of the average single unit (Fig. 2B). In the 100 ms window immediately before the Go cue, the cross-validated accuracy of the population prediction was close to 0.9 on average (mean±s.e.m, 0.86±0.01), while the average accuracy of individual units was barely above chance (mean±s.e.m, 0.557±0.001) and was significantly smaller than the population (t-test, p<10−8). More importantly, not only was population performance better than the average single unit; it was consistently superior to the best unit recorded in each session (Fig. 2C; t-test, p=3×10−5). The population was superior to individual units in other time epochs as well (data not shown).

The increased prediction accuracy afforded by multi-electrode recording enabled more accurate tracking of the decision variable over time. In essence, our logistic regression finds the best hyperplane that separates the population response patterns associated with the two choices. The population response pattern at each moment can be envisioned as a point in a high-dimensional space whose axes are the firing rates of individual units. The distance of this point from the discriminant hyperplane (wTr) represents the strength of model’s prediction: a small distance corresponds to low certainty about the monkey’s upcoming choice and a large distance corresponds to high certainty. We call this distance ‘model decision variable’. The model DV provides an estimate of the monkey’s internal DV, especially when the model prediction accuracy is high. Averaged across trials the model DV gradually increased from zero to large values (positive and negative values corresponded to T1 and T2 predictions respectively) (Fig. 3A). The rate of this rise depended on the strength of motion, especially during the motion-viewing period (Fig. 3A inset): DV increased more rapidly for stronger stimuli, compatible with previous observations in parietal and prefrontal neurons where the rate of change of neural responses depends on the strength of sensory evidence [3, 6, 20, 34, 35].

Figure 3.

Figure 3

The model decision variable indicates accumulation of evidence over time; intra-trial changes in the sign of the DV identify candidate changes-of-mind. A logistic regression (Eq. 1) was used to find the best hyperplane that separates the population response patterns corresponding to the two choices. The distance of the population response pattern from this discriminant hyperplane represents the model belief about the upcoming choice. We call this distance the model decision variable, or DV. A) Average decision variable across the sessions. Insert shows that the buildup of the decision variable during the motion-viewing period depends on stimulus strength (H: motion strength ≥ 20%; L: motion strength ≤6 %). The dark traces and shading represent mean±s.e.m. B) Two sample trials in which the model DV built up to a positive (solid) or negative (dashed) value and maintained its sign throughout the trial. The trials ended with T1 and T2 choices, respectively. C) Two sample trials in which the sign of the model DV flipped during the delay period, indicating a change of predicted choice based on the model. Arrows indicate the time of flip.

Similarly, on individual trials, the DV fluctuated around the discriminant hyperplane at the beginning of the motion-viewing period but gradually moved farther from the hyperplane over time. After gaining an initial distance, the population-based DV typically stayed on one side of the hyperplane during the late motion-viewing period and the ensuing delay (Fig. 3B, two example trials). On a minority of trials, however, the population response crossed from one side of the hyperplane to the other side during the trial, signaling a shift in the predicted choice from one target to the other (Fig. 3C, two example trials).

An intriguing possibility is that these changes in DV sign, calculated from the neural population response, identify changes of mind (CoM) that occur in human and animal observers as they make choices on the basis of variable evidence [7-10]. Alternatively, however, changes in sign of the DV might simply reflect noise from any number of sources that are irrelevant to performance on the task.

We conducted four analyses to test whether observed variations in the DV reflect, at least in part, genuine changes of mind. The first analysis concerns the reliability of DV fluctuations for monitoring the momentary ‘decision state’ of the system as time passes during long-duration trials. The remaining three analyses assess whether intra-trial DV sign changes conform to change-of-mind properties that are predicted by decision-making models and empirically observed in humans and monkeys performing a similar decision-making task.

Prediction of choice is reliable throughout long delay periods

The initial important question is whether intra-trial sign changes in the DV, like those illustrated in Figure 3C, are simply noise, or whether they accurately reflect moment-to-moment variation in the ‘decision state’ of the system—the decision that would be made if the trial were to end now? We addressed this question by separately analyzing predictive accuracy of the population activity for trials of different delay period duration. During each experiment, the duration of the delay period was varied randomly among several preset values (see Experimental Procedures). We grouped the trials into quintiles based on the length of the delay period. For each quintile we then calculated the hyperplane that best separated the T1 and T2 choice trials from the pattern of neural population response in the 100 msec immediately before the Go cue. For all quintiles the model achieved high cross-validated accuracies for predicting the monkey’s choice on individual trials (Fig. 4A; 0.76±0.02 for the shortest delays to 0.87±0.02 for the longest delay). The results were not critically dependent on using a separate model for each quintile; similar results were obtained when a single model was used for all quintiles (Fig. S1). Although predictive accuracy improved modestly for longer delay periods, it was far above chance even for the shortest delays. Thus the sign of the DV during the delay period is a good predictor of choice, even for the shortest duration trials.

Figure 4.

Figure 4

The sign of the model DV reliably reflects the incipient decision throughout the delay period. Intra-trial changes in the sign of the DV conform to CoM properties expected from theoretical and behavioral studies. A) Cross-validated prediction accuracy of the monkey’s choice for different delay durations. Trials were grouped into quintiles based on delay period duration. Accurate choice prediction on short duration trials confirms that variation in DV reflects the monkey’s decision state at early points in long duration trials. Thus, intra-trial changes in DV sign are consistent with CoM as opposed to extraneous noise (see Results). B) Probability of switching from one predicted choice to another in consecutive delay quintiles declines for stronger motion (C1, motion strength ≤6%; C2, 6% <motion strength ≤20%; C3, motion strength >20%). C) Switches in the predicted choice were more likely to rectify an erroneous choice, especially for stronger motion stimuli. D) Switches were more frequent early during delay. Error bars indicate s.e.m.

Importantly, there is no distinction between long and short duration trials prior to the first possible ‘go’ signal—the temporal structure of the trials are identical until that point. Thus predictive accuracy of the DV sign measured prior to the ‘go’ signal on shorter duration trials provides an objective estimate of predictive accuracy on longer duration trials had those trials ended at an earlier point in time. We therefore conclude that DV sign changes on longer duration trials (e.g. Fig. 3C) provide insight into the momentary decision state of the system, reflecting in part choices that would have been made had we terminated the trial earlier.

DV sign changes reflect expected properties of behavioral changes-of-mind

After identifying candidate CoM’s using DV sign changes, we performed three analyses to determine whether candidate CoM’s exhibit properties that are associated with actual CoM’s in behavioral studies and are expected from current models of the decision process [7, 8].

First, CoM’s should happen more frequently for weak- and intermediate- strength motion stimuli than for strong motion stimuli because counter-evidence that elicits a CoM will occur less frequently for stronger stimuli [8]. Figure 4B shows that the predicted trend is indeed present in our data (Eq. 3, β1=−0.23±0.04, p=1.6×10−8).

Second, CoM’s are more likely to steer the decision from an incorrect to a correct option than vice versa because they are based on evidence that is not yet processed during the initial stages of choice formation [8, 9]. On average, incorporation of additional evidence should improve the decision-maker’s accuracy. Consistent with this prediction, we observed that candidate CoM’s derived from our neural population data were more likely to shift the predicted choices from the incorrect target to the correct target (Fig. 4C). The difference in CoM towards correct versus wrong choices was significantly larger than zero (sign test, p<10−8) and grew as a function of motion strength (Eq. 4, β1=0.19±0.07, p=0.009). Thus, CoM’s increased the monkey’s overall reward intake. We also tested whether the monkey’s final choices after CoM’s were more likely to be correct compared to the trials in which CoM’s were not detected. Controlling for motion strength and delay duration, we found no significant difference between the two trial types (Eq. 5, β2=−0.004±0.043, p=0.92). Since CoM’s improved the probability of a correct response, they must have occurred selectively on trials for which the initial decision state was likely to be incorrect, perhaps due to lapses of attention earlier in the trial.

Third, the probability of a CoM should decrease as the monkey waits during the delay period. Due to the finite latency of visual signals, processing the last sensory evidence necessarily occurs during the delay period, and memory processes related to the visual stimulus may influence the final decision as well. As the delay period proceeds, however, a final commitment to a choice is increasingly likely. Consistent with this hypothesis, we observed a monotonic decline in the probability of a CoM with delay period duration (Fig 4D; Eq. 6, β2=−0.21±0.01, p<10−8).

The success of the model for the delay period encouraged us to investigate the properties of candidate CoM’s during the motion-viewing interval as well, even though success was less certain for two reasons. First, because we used a constant stimulus duration in all experiments, we were unable to validate candidate CoM’s during motion viewing using ‘short duration’ trials as described above for the delay period. Second, candidate CoM’s were less reliable during motion viewing as reflected in the lower predictive power of the model during this interval (Fig. 2A). Nevertheless, the properties of candidate CoM’s during the motion-viewing interval were similar to those during the delay period. CoM’s were less frequent for stronger stimuli (Fig. S2A; Eq. 3, β1=−0.064±0.021, p=0.003) and were more likely to change the DV in the direction of a correct judgment (Fig. S2B; Eq. 4, β1=0.71±0.18, p=0.001). The frequency of candidate CoM’s at the beginning of the motion-viewing period was low because the DV is initially near chance and must first build up toward one of the choices before a CoM can be detected reliably (see Experimental Procedures). Thus CoM frequency increased initially but declined after 300 ms of motion viewing (Fig. S2C) due to increased likelihood of commitment to a choice, consistent with the delay period results (Fig. 4D). These results from the motion-viewing period are encouraging but should be interpreted cautiously because of sensitivity of some results (Fig. S2C) to time window sizes (see Experimental Procedures).

Overall, candidate CoM’s identified from high-density recordings of prefrontal cortex conform to all three predictions based on behavioral analysis of changes of mind.

Discussion

Neurophysiological studies have typically sought insight into the dynamics of decision-making and its neural mechanisms through statistical analysis of large numbers of trials from sequentially recorded single neurons or small groups of neurons. Measuring and analyzing the DV on individual trials has been challenging due to technical and conceptual limitations. We sought to overcome these limitations by (i) recording simultaneously from hundreds of units in cortical areas hypothesized to contribute to the decision-making process, (ii) developing simple, efficient algorithms for estimation of the covert DV from neural population responses, and (iii) implementing a new analytic approach to verify the accuracy of the estimated DV fluctuations during the delay period.

The ability to track moment-to-moment variation of the DV enables the study of important aspects of the decision-making process that have been largely inaccessible thus far. As a proof of concept, we focused on changes of mind during peri-threshold judgments of motion direction. Our behavioral task created fertile conditions for CoM’s due to the noisy, temporally extended nature of the visual motion stimulus. Our first new finding is that the covert DV can be accurately tracked during single behavioral trials using the methods introduced in this paper. Crucially, trials with short delay periods verify the accuracy of estimates of the covert DV on longer trials; in essence, short-delay trials act as ‘probes’ of ongoing DV estimates on longer trials [34, 36, 37]. Our second new finding is that changes in the sign of the DV identify candidate CoM’s, and that candidate CoM’s conform to three different predictions based on prior behavioral and modeling studies. Together these results demonstrate the power of high-density neural recordings for single-trial estimates of the fluctuating DV and for detection of covert changes in decision state commonly referred to as changes-of-mind.

CoM’s can arise from various sources: changing sensory evidence [38], correction of an initial confusion about stimulus-response association [9], incorporation of a new decision policy, or retrieval of new information from memory [38]. Our monkeys’ CoM’s may stem from any these sources or others such as simple lapses in attention or effort. Accumulated over time, CoM’s tend to be self-corrections, often benefitting the decision-maker by improving accuracy. By characterizing the neural DV and identifying CoM’s on individual trials we show that monkeys, like humans, exhibit self-correcting behavior.

Past studies have documented cortical and subcortical fluctuations in neural activity that might correspond to changes of mind [7, 29, 39]. Those studies focused on the detection of discrete hidden states in the responses of single neurons [39] or small numbers of simultaneously recorded neurons [7, 29]. The discrete states are defined as specific patterns of spiking across the recorded population. Transitions from one state to another—often formalized by a Hidden Markov Model—can occasionally signal changes of mind. Here we extend those studies by extracting a continuous measure of the DV, characterizing its dynamics, and confirming the inferred CoM’s by interrogating the monkey’s choice at variable times.

The probabilistic characterization of the choice provides an analog estimation of the DV, compatible with the quantitative models of the decision-making process [12, 17, 35, 40]. In those models sources of information are integrated into an analog variable [31] that explains both the choice and reaction time. However, it remains to be seen whether various factors, such as priors and value information, that bear on the decision in those models also modify our estimated DV.

Our model incorporates responses of all neurons recorded on the array in a given experiment. Unlike many classic single neuron studies we did not focus on merely the units that were highly selective for the motion directions or target locations used in the task [20, 41, 42]. We do not know whether all the recorded neurons contributed to the decision-making process. Neither do we know the exact role that they may play in the process. However, the recorded neurons are informative about the monkey’s upcoming action as indicated by the model’s high prediction accuracy. We simply exploit this information to provide a probabilistic glimpse into the DV that supports choice and CoM. Our model is not a mechanistic account of how the DV is constructed or how commitment to a choice is made. Addressing those questions requires further experiments and selective recording and manipulation of neurons based on their response selectivity [e.g., 18, 34, 43, 44, 45].

The advent of techniques for routine, simultaneous recording of tens to hundreds of neurons offers unique opportunities for cognitive neuroscience, particularly for the detection and tracking of cognitive states and processes that occur unpredictably in time and have no overt behavioral correlate. Covert cognitive processes are difficult to monitor in classical neurophysiological studies that require time-locking and across-trial signal averaging, but they are potentially detectable in real time from neural population activity. Even when covert neural processes are detectable, however, their interpretation will depend on creative strategies for behavioral verification. Our study of CoM offers a first step in that direction.

Experimental Procedures

We recorded from populations of neurons in the prearcuate gyrus of two macaque monkeys while they performed a direction discrimination task. All training, surgery, and recording procedures conformed to the National Institutes of Health Guides for the Care and Use of Laboratory Animals and were approved by the Stanford University Animal Care and Use Committee. Due to the similarity of results across the monkeys we pooled the data for all analyses to increase statistical power.

Behavioral task

Figure 1A illustrates the sequence of events on each trial. Each trial began with the appearance of a central fixation point (FP; 0.3° diameter) at the center of the display monitor. The monkey was required to maintain gaze within ±1.5° of FP as long as it was visible on the screen. Eye position was measured with a scleral search coil (CNC Engineering, Seattle, WA), and inappropriate fixation breaks resulted in termination of the trial.

After the monkey fixated on the central fixation point, two targets appeared on the screen (T1 and T2). After a 250 ms delay, a random dot motion stimulus was presented for 800 ms and was followed by a delay period of variable duration. At the end of the delay period the FP disappeared (Go cue), instructing the monkey to report the perceived motion direction by making a saccadic eye movement to one of the targets. For a valid response the monkey’s saccade was required to land within 2-4° of the target (depending on target eccentricity). In 10 of 15 sessions the two targets were positioned on opposite sides of the screen (T1 contralateral, T2 ipsilateral). On the remaining 5 sessions both targets were positioned contralateral to the recorded hemisphere (T1 and T2 were randomly assigned). On each trial, motion direction was either toward T1 or T2. We controlled the difficulty of the trials by varying the percentage of dots moving coherently in the same direction (motion strength or coherence) as described previously [30, 34]. The set of motion strengths for each monkey were chosen to afford a wide range of accuracies, from chance to perfection. For monkey 1, the set included 0%, 2.5%, 5%, 10%, 20%, and 40% coherence. For monkey 2, the set included 0%, 1.2%, 2.4%, 4.8%, 9.6%, 19.2%, and 38.4% coherence in most sessions. For two sessions we tested slightly different sets: 0%, 6%, 12%, 24%, and 48% coherence in one session, and 0%, 1%, 2%, 4%, 8%, 16%, and 32% coherence in the other. The motion direction and strength varied randomly from trial to trial, but were fixed within each trial. The monkey received a liquid reward for choosing the target that corresponded to the motion direction. For 0% motion strength the monkey was rewarded randomly on half of the completed trials irrespective of the direction of the choice.

The variability of the delay period duration plays a crucial role in our analyses because it provides an objective means to test the accuracy of our neuronal model predictions about the subject’s impending choice. Different delay durations were used in various sessions, permitting useful tests of the robustness of our results. In seven sessions the delay duration of each trial was drawn randomly from these five values: 323 ms, 510 ms, 698 ms, 885 ms, and 1073 ms. In six sessions the delay durations were 673 ms, 760 ms, 848 ms, 935 ms, and 1023 ms. In one session they were 473 ms, 560 ms, 648 ms, 735 ms, and 823 ms; and in the last session they were drawn randomly from several discrete values in the range 300-1500 ms. None of our results strongly depend on the exact delay durations used in each session. We therefore present aggregated results across sessions.

The 15 sessions used in this study were chosen based only on the reliability of recordings, number of trials (>1000), and number of recorded units (>150). The dataset consists of 20938 trials and 3257 units.

Neural recording

The monkeys were implanted with 96-channel microelectrode arrays (electrode length=1.5 mm; spacing=0.4 mm; Blackrock Microsystems, Salt Lake City, UT) in the prearcuate gyrus (Fig. 1B). Neural spike waveforms were saved online (sampling rate, 30 kHz) and sorted offline (Plexon Inc., Dallas, TX). To improve the quality of sorting we used customized algorithms to remove recording artifacts that were registered by a large number of electrodes. Also, we merged redundant spike waveform clusters based on waveform shapes, firing rates and inter-spike intervals. We identified 100-250 single- and multi-units in each session (median=219). Throughout the paper we use the term ‘units’ to refer to both isolated single neurons and multi-units. All units were retained in our analyses irrespective of their selectivity.

Data analysis

We quantified the time-varying firing rate of each unit on each trial by counting the number of spikes in a sliding 100 ms window (step size=20 ms). We used an L1-regularized logistic regression [46] to predict the monkey’s choice based on neural responses on each trial:

Logit[Pt(T1)]=β0(t)+i=1nβi(t)ri(t) Eq. 1

where ri (t) is the firing rate of unit i at time t , n is the number of recorded units and the β coefficients are model parameters. L1-regularization imposes a constraint on the L1-norm of the coefficients to avoid overfitting. The model was cross-validated by using 90% of trials in each session as ‘training set’ for fitting the parameters and the remaining 10% as ‘test set’ to measure the model’s prediction accuracy. The best regularization parameter of the model was found by a 10-fold cross-validation within the training set. At any moment in time the model predicts a T1 choice if Logit [P(T1)] > 0 and a T2 choice if Logit [P(T1)] < 0. The model prediction is correct if it matches the monkey’s actual choice at the end of the trial. Fig. 2A shows the model prediction accuracy averaged across sessions. We used a similar logistic regression to gauge the prediction accuracy of individual units:

Logit[Pt,i(T1)]=β0(t)+β1(t)ri(t) Eq. 2

To compare individual units with the population (Fig. 2B-C). we analyzed data from the 100 ms window immediately before the Go cue.

The logistic regression of Eq. 1 essentially finds the hyperplane that best separates the population response patterns corresponding to the two choices. The population response pattern at each moment can be envisioned as a point in a high-dimensional space whose axes are the firing rates of individual units. The distance of this point from the discriminant hyperplane (the right-hand side of Eq. 1) represents the model belief about the upcoming choice (Logit [Pt(T1)], the left-hand side of Eq. 1). A small distance corresponds to a weak belief and a large distance corresponds to a strong belief. We call this distance the ‘model decision variable’ (DV) and use the changes in the sign of the DV to identify candidate ‘changes of mind’ (CoM) for the monkey (Fig. 3C). This approximation is valid especially where the model predicts the monkey’s choice accurately.

Our criterion for detecting possible CoM—a simply sign change in the DV—is lenient. It ensures that actual CoM’s are not missed, but it can include spurious CoM’s due to neural noise or the imperfection of the logistic classifier. We have explored more stringent and complex criteria. For example, a change of mind can be defined as a swing from high certainty for one choice (e.g. large positive DV) to another (e.g. large negative DV). These more stringent criteria have additional degrees of freedom (e.g., criterion on the magnitude of DV) and can be fine tuned to reduce the number of detected CoM’s. We explored this space, but the results reported in this paper are not critically dependent on the choice of criteria. For all analyses in this paper, therefore, we employed the simple sign change criterion to avoid additional degrees of freedom and fine-tuning of results.

To determine whether putative neural CoM’s conformed to properties expected from prior studies of behavioral changes of mind [8, 9], we analyzed neural CoM statistics during the delay period (after the dots and before the Go cue) when the model prediction accuracy was above 75% (Figs. 2A and 4A). We first divided the trials of each session into quintiles based on the length of the delay period. As explained above, in 14 of the 15 sessions the delay period was chosen randomly from five distinct values. Therefore the trials in each quintile of a session had identical delays and our grouping did not create spurious boundaries or mix trials with variable delay durations. In contrast to the “sliding window” analysis described above (e.g. Fig. 3), here we calculated the predicted decision (sign of the DV) independently for each delay period quintile. We retrained the logistic model (Eq. 1) separately for the trials in each quintile using the last 100 ms of their delay period. For the sessions where the time difference between consecutive quintiles was less than 100 ms, we slightly reduced the size of the window for the calculation of firing rates (87 ms) to ensure no overlap between the quintile models. We then calculated the number of neural CoM’s observed on each trial. For trials with the shortest delay periods (first quintile) it was impossible (by definition) to detect a CoM since only a single DV was calculated. For trials of the second quintile, a single opportunity for a CoM occurred, two opportunities for trials of the third quintile, and so forth. Importantly, the high overall prediction accuracy of the model (>75% correct) for trials of the shortest duration (first quintile) creates confidence that the DV calculated within each quintile of longer trials accurately estimates the choice the animal would have made had we stopped those trials earlier. It is thus reasonable to consider neural CoM’s detected on long trials to be real changes in the decision state of the animal even though we have no direct behavioral readout of the hidden decision state on these trials. Quantitative analyses in Figure 4 (see equations below) measured CoM statistics for those longer-duration trials (second and higher quintiles).

The results presented in this paper were not critically dependent on training separate models for different delay periods. The discriminant hyperplanes calculated for different delay quintiles were closely related to each other. Consequently, training and cross-validating a single model based on the last 87-100 ms of the delay period of all trials replicated all key results (Fig. S1).

To identify candidate CoM’s during the motion-viewing period (Fig. S2) we used the changes in the sign of the decision variable, calculated from the models trained on all trials of a session (Fig. 3). In contrast to the delay period, we could not independently verify the choice-predictive accuracy of CoM’s during the motion-viewing period because the motion duration was the same for all trials. A priori, candidate CoM’s based on only a DV sign change will be less reliable during the motion-viewing interval due to the lower overall accuracy of the model predictions early in the trial (Fig. 2A). To compensate for this reduced accuracy we used a more stringent criterion. To quantify the frequency and direction of CoM for different motion strengths (Fig. S2A-B), we required the sign of DV to persist for 150 ms before and after a sign change (‘persistence window’) to qualify as a CoM. To quantify the time course of CoM (Fig. S2C), we reduced this requirement to 75 ms to increase the number of independent estimates of the decision state during motion viewing. The trends in Fig. S2A-B are robust to variations of the persistence window. The trend in Fig. S2C, however, is sensitive to the duration of the persistence window and should be interpreted cautiously; larger windows reduce and even abolish the decline of CoM frequency in the late motion-viewing period. This occurs because long persistence windows impose a sterner criterion for measuring decision states and detecting CoM’s, which inevitably delays CoM’s to later stages of the motion viewing interval and makes the decision state less likely to change once established. We have observed similar sensitivity to window size in simulations (data not shown).

To test whether the frequency of CoM’s during the delay period varied with the stimulus strength we used a linear regression model:

η=β0+β1C+β2q Eq. 3

where η is the number of CoM detected by the model in each trial, C is the motion strength, q is the quintile that the trial delay duration belonged to (2 to 5), and βi are the regression coefficients. Only trials of the second and higher quintiles contributed to the analysis; as described above, the first quintile could not have a CoM in the delay period. q was included in the regression to control for the increased probability of CoM detection due to increased number of quintile transitions for longer delay periods.

We used a related linear regression to test whether CoM’s were more likely to change the monkey’s response from an incorrect to a correct choice for stronger stimuli:

Δs=β0+β1C+β2q Eq. 4

where Δs is the difference between the number of CoM’s that move the choice prediction from incorrect to correct and vice versa (Δs > 0 implies more incorrect to correct). This analysis focused on trials with C > 0 in which at least one CoM was detected. Trials with C = 0 were excluded because of ambiguity in the definition of a correct response. The null hypothesis is that Δs does not change with motion strength (H0 : β1 = 0). After rejecting the null hypothesis via the linear regression analysis, we used a sign rank test to show that the median of Δs was larger than zero across all trials used in the regression analysis (significant prevalence of incorrect to correct changes).

To test whether the overall probability of being correct was larger on trials in which CoM’s were detected we used the following logistic regression:

Logit[P(cor)]=β0+β1C+β2I+β3q Eq. 5

where I is an indicator variable (0 for trials with no CoM and 1 for trials with CoM). C and q are the motion strength and delay duration quintile, respectively (same as in Eq. 3). β2 > 0 indicates that the subject was correct more frequently on trials in which CoM’s were detected.

We tested the dependence of the probability of changes of mind on time using a logistic regression:

Logit[P(switch)]=β0+β1C+β2q Eq. 6

where q′ is the delay quintile in which a CoM could be detected minus one. Only trials in the second or higher quintiles contributed to this analysis because at least two delay bins were required to detect a CoM, as described above. Each trial contributed as many data points as permitted by the length of its delay period (maximum=4). The null hypothesis is that the probability of observing a switch does not depend on time (H0 : β2 = 0).

All completed trials (correct and incorrect) were included in the calculation of discriminant hyperplane and subsequent analyses. The grouping of motion strengths to C1 (C≤6%), C2 (6%<C≤20%), and C3 (C>20%) in Fig. 4, S1, and S2 is for illustrative purposes only. All statistical analyses that are mentioned above were performed using the actual motion strengths of the trials.

Supplementary Material

01

In Brief Summary.

Kiani et al show that covert decision variables (DV) that underlie choices can be tracked on single behavioral trials by simultaneous recording of large neural populations in prefrontal cortex. They use vacillations of the neural DV to identify changes of mind (CoM) in monkeys, and show that they closely match the properties of human CoM.

Highlights.

  1. Prefrontal neural population activity affords accurate decoding of decision variables

  2. The decoded DV makes covert aspects of the decision-making process accessible

  3. Moment to moment changes of the decoded DV identify changes-of-mind (CoM’s)

  4. Statistical properties of monkey CoM’s match predictions based on human CoM’s

Acknowledgements

We thank K. Shenoy, M. Kaufman, D. Kimmel, V. Mante, D. Peixoto, V. McGinty, and M. Churchland for helpful discussions, and we thank Jamie Sanders, Jessica Powell, Sania Fong, and Julian Brown for technical assistance throughout the project. We are particularly grateful to Dr. Stephen Ryu, who was the primary surgeon for the array implants. This research was supported by the Howard Hughes Medical Institute, the Air Force Research Laboratory (agreement number FA9550-07-1-0537), a Berry Postdoctoral Fellowship to RK, and a Sloan Research Fellowship to RK.

References

  • 1.Platt ML, Glimcher PW. Neural correlates of decision variables in parietal cortex. Nature. 1999;400:233–238. doi: 10.1038/22268. [DOI] [PubMed] [Google Scholar]
  • 2.Shadlen MN, Newsome WT. Motion perception: seeing and deciding. Proceedings of the National Academy of Sciences of the United States of America. 1996;93:628–633. doi: 10.1073/pnas.93.2.628. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Shadlen MN, Newsome WT. Neural basis of a perceptual decision in the parietal cortex (area LIP) of the rhesus monkey. Journal of neurophysiology. 2001;86:1916–1936. doi: 10.1152/jn.2001.86.4.1916. [DOI] [PubMed] [Google Scholar]
  • 4.Hussar CR, Pasternak T. Common rules guide comparisons of speed and direction of motion in the dorsolateral prefrontal cortex. J Neurosci. 2013;33:972–986. doi: 10.1523/JNEUROSCI.4075-12.2013. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Heitz RP, Schall JD. Neural mechanisms of speed-accuracy tradeoff. Neuron. 2012;76:616–628. doi: 10.1016/j.neuron.2012.08.030. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.de Lafuente V, Romo R. Neural correlate of subjective sensory experience gradually builds up across cortical areas. Proc Natl Acad Sci U S A. 2006;103:14266–14271. doi: 10.1073/pnas.0605826103. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Bollimunta A, Totten D, Ditterich J. Neural dynamics of choice: single-trial analysis of decision-related activity in parietal cortex. J Neurosci. 2012;32:12684–12701. doi: 10.1523/JNEUROSCI.5752-11.2012. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Resulaj A, Kiani R, Wolpert DM, Shadlen MN. Changes of mind in decision making. Nature. 2009;461:263–266. doi: 10.1038/nature08275. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 9.Rabbitt PM. Errors and error correction in choice-response tasks. J Exp Psychol. 1966;71:264–272. doi: 10.1037/h0022853. [DOI] [PubMed] [Google Scholar]
  • 10.Albantakis L, Branzi FM, Costa A, Deco G. A multiple-choice task with changes of mind. PloS one. 2012;7:e43131. doi: 10.1371/journal.pone.0043131. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Link SW. The Wave Theory of Difference and Similarity. Erlbaum; Hillsdale, NJ: 1992. [Google Scholar]
  • 12.Smith PL, Ratcliff R. Psychology and neurobiology of simple decisions. Trends Neurosci. 2004;27:161–168. doi: 10.1016/j.tins.2004.01.006. [DOI] [PubMed] [Google Scholar]
  • 13.Laming DRJ. Information Theory of Choice Reaction Time. Wiley; New York: 1968. [Google Scholar]
  • 14.Luce RD. Response Times: Their Role in Inferring Elementary Mental Organization. Oxford University Press; Belfast: 1986. [Google Scholar]
  • 15.Carpenter RH, Reddi BA, Anderson AJ. A simple two-stage model predicts response time distributions. The Journal of physiology. 2009;587:4051–4062. doi: 10.1113/jphysiol.2009.173955. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Krajbich I, Rangel A. Multialternative drift-diffusion model predicts the relationship between visual fixations and choice in value-based decisions. Proc Natl Acad Sci U S A. 2011;108:13852–13857. doi: 10.1073/pnas.1101328108. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Shadlen MN, Kiani R. Decision making as a window on cognition. Neuron. 2013;80:791–806. doi: 10.1016/j.neuron.2013.10.047. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Bollimunta A, Ditterich J. Local computation of decision-relevant net sensory evidence in parietal cortex. Cerebral cortex. 2012;22:903–917. doi: 10.1093/cercor/bhr165. [DOI] [PubMed] [Google Scholar]
  • 19.Horwitz GD, Newsome WT. Separate signals for target selection and movement specification in the superior colliculus. Science. 1999;284:1158–1161. doi: 10.1126/science.284.5417.1158. [DOI] [PubMed] [Google Scholar]
  • 20.Kim JN, Shadlen MN. Neural correlates of a decision in the dorsolateral prefrontal cortex of the macaque. Nature neuroscience. 1999;2:176–185. doi: 10.1038/5739. [DOI] [PubMed] [Google Scholar]
  • 21.Ratcliff R, Hasegawa YT, Hasegawa RP, Smith PL, Segraves MA. Dual diffusion model for single-cell recording data from the superior colliculus in a brightness-discrimination task. Journal of neurophysiology. 2007;97:1756–1774. doi: 10.1152/jn.00393.2006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Ding L, Gold JI. The basal ganglia's contributions to perceptual decision making. Neuron. 2013;79:640–649. doi: 10.1016/j.neuron.2013.07.042. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Donner TH, Siegel M, Fries P, Engel AK. Buildup of choice-predictive activity in human motor cortex during perceptual decision making. Current biology : CB. 2009;19:1581–1585. doi: 10.1016/j.cub.2009.07.066. [DOI] [PubMed] [Google Scholar]
  • 24.O'Connell RG, Dockree PM, Kelly SP. A supramodal accumulation-to-bound signal that determines perceptual decisions in humans. Nat Neurosci. 2012;15:1729–1735. doi: 10.1038/nn.3248. [DOI] [PubMed] [Google Scholar]
  • 25.Kayser AS, Buchsbaum BR, Erickson DT, D'Esposito M. The functional anatomy of a perceptual decision in the human brain. J Neurophysiol. 2010;103:1179–1194. doi: 10.1152/jn.00364.2009. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Heekeren HR, Marrett S, Bandettini PA, Ungerleider LG. A general mechanism for perceptual decision-making in the human brain. Nature. 2004;431:859–862. doi: 10.1038/nature02966. [DOI] [PubMed] [Google Scholar]
  • 27.Gilja V, Nuyujukian P, Chestek CA, Cunningham JP, Yu BM, Fan JM, Churchland MM, Kaufman MT, Kao JC, Ryu SI, et al. A high-performance neural prosthesis enabled by control algorithm design. Nat Neurosci. 2012;15:1752–1757. doi: 10.1038/nn.3265. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 28.Arce FI, Lee JC, Ross CF, Sessle BJ, Hatsopoulos NG. Directional information from neuronal ensembles in the primate orofacial sensorimotor cortex. J Neurophysiol. 2013;110:1357–1369. doi: 10.1152/jn.00144.2013. [DOI] [PubMed] [Google Scholar]
  • 29.Lemus L, Hernandez A, Luna R, Zainos A, Nacher V, Romo R. Neural correlates of a postponed decision report. Proc Natl Acad Sci U S A. 2007;104:17174–17179. doi: 10.1073/pnas.0707961104. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 30.Britten KH, Shadlen MN, Newsome WT, Movshon JA. The analysis of visual motion: a comparison of neuronal and psychophysical performance. J. Neurosci. 1992;12:4745–4765. doi: 10.1523/JNEUROSCI.12-12-04745.1992. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 31.Kiani R, Churchland AK, Shadlen MN. Integration of Direction Cues Is Invariant to the Temporal Gap between Them. J Neurosci. 2013;33:16483–16489. doi: 10.1523/JNEUROSCI.2094-13.2013. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Mante V, Sussillo D, Shenoy KV, Newsome WT. Context-dependent computation by recurrent dynamics in prefrontal cortex. Nature. 2013;503:78–84. doi: 10.1038/nature12742. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Chafee MV, Goldman-Rakic PS. Matching patterns of activity in primate prefrontal area 8a and parietal area 7ip neurons during a spatial working memory task. J Neurophysiol. 1998;79:2919–2940. doi: 10.1152/jn.1998.79.6.2919. [DOI] [PubMed] [Google Scholar]
  • 34.Kiani R, Hanks TD, Shadlen MN. Bounded integration in parietal cortex underlies decisions even when viewing duration is dictated by the environment. J. Neurosci. 2008;28:3017–3029. doi: 10.1523/JNEUROSCI.4761-07.2008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Purcell BA, Heitz RP, Cohen JY, Schall JD, Logan GD, Palmeri TJ. Neurally constrained modeling of perceptual decision making. Psychol Rev. 2010;117:1113–1143. doi: 10.1037/a0020311. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Thura D, Cisek P. Deliberation and commitment in the premotor and primary motor cortex during dynamic decision making. Neuron. 2014;81:1401–1416. doi: 10.1016/j.neuron.2014.01.031. [DOI] [PubMed] [Google Scholar]
  • 37.Stanford TR, Shankar S, Massoglia DP, Costello MG, Salinas E. Perceptual decision making in less than 30 milliseconds. Nat Neurosci. 2010;13:379–385. doi: 10.1038/nn.2485. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Resulaj A, Kiani R, Wolpert DM, Shadlen MN. Changes of mind in decision-making. Nature. 2009;461:263–266. doi: 10.1038/nature08275. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Horwitz GD, Newsome WT. Target selection for saccadic eye movements: prelude activity in the superior colliculus during a direction-discrimination task. Journal of neurophysiology. 2001;86:2543–2558. doi: 10.1152/jn.2001.86.5.2543. [DOI] [PubMed] [Google Scholar]
  • 40.Green DM, Swets JA. Signal Detection Theory and Psychophysics. Wiley; New York: 1966. [Google Scholar]
  • 41.Schall JD, Hanes DP. Neural basis of saccade target selection in frontal eye field during visual search. Nature. 1993;366:467–469. doi: 10.1038/366467a0. [DOI] [PubMed] [Google Scholar]
  • 42.Zaksas D, Pasternak T. Directional signals in the prefrontal cortex and in area MT during a working memory for visual motion task. J Neurosci. 2006;26:11726–11742. doi: 10.1523/JNEUROSCI.3420-06.2006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Hanks TD, Ditterich J, Shadlen MN. Microstimulation of macaque area LIP affects decision-making in a motion discrimination task. Nature neuroscience. 2006;9:682–689. doi: 10.1038/nn1683. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Salzman CD, Murasugi CM, Britten KH, Newsome WT. Microstimulation in visual area MT: effects on direction discrimination performance. J. Neurosci. 1992;12:2331–2355. doi: 10.1523/JNEUROSCI.12-06-02331.1992. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Law CT, Gold JI. Neural correlates of perceptual learning in a sensory-motor, but not a sensory, cortical area. Nature neuroscience. 2008;11:505–513. doi: 10.1038/nn2070. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Hastie T, Tibshirani R, Friedman J. The elements of statistical learning. 2nd Springer; New York: 2008. [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

01

RESOURCES