Skip to main content
PLOS Biology logoLink to PLOS Biology
. 2022 Dec 15;20(12):e3001861. doi: 10.1371/journal.pbio.3001861

Integrated neural dynamics of sensorimotor decisions and actions

David Thura 1,¤a, Jean-François Cabana 1,¤b, Albert Feghaly 1,¤c, Paul Cisek 1,*
Editor: Alexander Gail2
PMCID: PMC9754259  PMID: 36520685

Abstract

Recent theoretical models suggest that deciding about actions and executing them are not implemented by completely distinct neural mechanisms but are instead two modes of an integrated dynamical system. Here, we investigate this proposal by examining how neural activity unfolds during a dynamic decision-making task within the high-dimensional space defined by the activity of cells in monkey dorsal premotor (PMd), primary motor (M1), and dorsolateral prefrontal cortex (dlPFC) as well as the external and internal segments of the globus pallidus (GPe, GPi). Dimensionality reduction shows that the four strongest components of neural activity are functionally interpretable, reflecting a state transition between deliberation and commitment, the transformation of sensory evidence into a choice, and the baseline and slope of the rising urgency to decide. Analysis of the contribution of each population to these components shows meaningful differences between regions but no distinct clusters within each region, consistent with an integrated dynamical system. During deliberation, cortical activity unfolds on a two-dimensional “decision manifold” defined by sensory evidence and urgency and falls off this manifold at the moment of commitment into a choice-dependent trajectory leading to movement initiation. The structure of the manifold varies between regions: In PMd, it is curved; in M1, it is nearly perfectly flat; and in dlPFC, it is almost entirely confined to the sensory evidence dimension. In contrast, pallidal activity during deliberation is primarily defined by urgency. We suggest that these findings reveal the distinct functional contributions of different brain regions to an integrated dynamical system governing action selection and execution.


Recent theoretical models suggest that deciding about actions and executing them are not mediated by completely distinct neural mechanisms but are instead two modes of a single system. Analysis of neural activity in several cortical and subcortical regions reveals how they implement an integrated dynamical system that governs deliberation, selection, and execution of actions.

Introduction

During natural behavior, we are continuously interacting with a complex and dynamic world [1,2]. That world often does not wait for us to make up our minds about perceptual judgments or optimal choices, and inaction can lead to lost opportunities, or worse. Furthermore, we must often make decisions while we are already engaged in an action, such as while navigating through our environment or playing a sport [3,4]. These considerations suggest that the neural mechanisms involved in selecting and executing actions should be closely integrated within a unified sensorimotor control system [5]. Indeed, many neural studies have shown considerable overlap between the brain regions involved in action selection and sensorimotor control [610].

However, while the anatomical overlap between the distributed circuits of decision-making and sensorimotor control is well established, theoretical models of these processes remain largely separate. Decision-making is often modeled as the accumulation of evidence until a threshold is reached [1119], at which time a target is chosen. Models of movement control usually begin with that chosen target, toward which the system is guided through feedback and feedforward mechanisms [2022]. But if the neural circuits involved in action selection and sensorimotor control are truly as unified as neural data suggest, then theories of these processes should be similarly unified. One promising avenue toward an integrated account of selection and control is to consider both as aspects of a single distributed dynamical system, which transitions from a biased competition between actions [2328] into an “attractor” that specifies the initial conditions for implementing the chosen action through feedback control [2931].

Here, we test whether neural activity in key cortical and subcortical regions exhibits properties that would be expected from a unified dynamical system for action selection and sensorimotor control. We focus on cells recorded in monkey dorsal premotor (PMd) and primary motor cortex (M1), which are implicated in both selection and control [8,10,3236], as well as the dorsolateral prefrontal cortex (dlPFC), which is implicated in representing chosen actions [37]. In addition, we examine activity in the output nuclei of the basal ganglia, the globus pallidus externus (GPe) and internus (GPi), whose role in selection and/or motor control is under vigorous debate [3841]. Importantly, we examine the activity of all of these regions recorded in the same animals performing the same reach selection task, making it possible to quantitatively compare activities in different brain areas using the same metrics.

To disentangle neural activity related to deliberation, commitment, and movement, we trained monkeys to perform the “tokens task” (Fig 1) (see Methods). In the task, the subject must guess which of two targets will receive the majority of tokens jumping randomly from a central circle every 200 ms (Fig 1A). The subject does not have to wait until all tokens have jumped but can take an early guess, and after a target is reached, the remaining tokens jump more quickly (every 150 ms or every 50 ms in separate “Slow” and “Fast” blocks of trials). Thus, subjects are faced with a speed–accuracy trade-off (SAT)—to either wait to be confident about making the correct choice or to take an early guess and save some time, potentially increasing their overall reward rate. If we assume that commitment occurs shortly before movement onset, then we can delimit within each trial a period of deliberation (Fig 1B) during which neural activity should correlate with the sensory evidence related to token jumps as well as to subjective policies related to the SAT. Furthermore, because we can precisely quantify the success probability (SP) associated with each choice after every token jump, we can compute for each trial a temporal profile of the sensory evidence and categorize trials into similarity classes (Fig 1C), including “easy trials,” “ambiguous trials,” and “misleading trials” (see Methods for details).

Fig 1.

Fig 1

(A) During each trial of the “tokens task,” 15 tokens jump, one every 200 ms, from the central circle to one of two outer target circles. The subject’s task is to move the cursor (black cross) to the target that will ultimately receive the majority of the tokens. (B) Temporal profile of the “success probability” that a given target is correct. Once a target is reached, the remaining token jumps accelerate to one every 150 ms (“Slow” block) or 50 ms (“Fast” block). We subtract from movement onset the mean reaction time (RT), measured in a separate delayed-response task, to estimate commitment time (purple bar) and the success probability at commitment time (dotted red horizontal line). (C) Success probability for choosing the target on the right, in trial types defined on the basis of the success probability profile (see Methods), here computed after aligning to movement onset (vertical black dashed line). The vertical purple bar indicates estimated commitment time. Solid curves: correct target on the right; dashed curves: correct target on the left. (D) Recording locations. Medial is up. Dashed white lines indicate the estimated location of the globus pallidus. asl, lower limb of the arcuate sulcus; asu, upper limb of the arcuate sulcus; cs, central sulcus; ps, principal sulcus; spcd, superior precentral dimple.

Previous studies have shown that both human and monkey behavior in the tokens task is well explained by the “urgency-gating model” (UGM) [14,16,42], which suggests that during deliberation, the sensory evidence about each choice (provided by the token distribution) is continuously updated and combined with a nonspecific urgency signal, which grows over time in a block-dependent manner, and commitment to a given choice is made when the product of these reaches a threshold.

Recent neural recordings [36,43,44] have largely supported these proposals, but many questions remain. Are “decision-related” neurons part of a module for choosing a target, which sends its output to a separate module of “movement-related” neurons? Is commitment determined by the crossing of a neural “threshold”? Do the basal ganglia contribute to the deliberation process [39,4549], or do they simply reflect a choice taken in cortical regions [50,51] and contribute only to movement execution [52]? Answering these questions is difficult given the heterogeneity of cell properties [8,32,5356] and their apparently continuous distribution along rostro-caudal gradients [54,57] or cortical layers [58]. This leads one to consider whether, instead of serial modules, action selection and execution are two modes of a unified recurrent system distributed across the frontoparietal cerebral cortex and associated basal ganglia/thalamic loops.

Here, we test this proposal by examining activity across all of the regions we recorded in the tokens task (PMd, M1, GPe, GPi, and dlPFC), but without a priori classifying cells into putative functional categories. Instead, we use the neural space approach pioneered in recent years [5974], in which the entire system is described as a point in a very high-dimensional space defined by the activity of all recorded cells, and then reduced into a lower-dimensional representation that reveals the main factors governing cell activity across the system. We then perform specific analyses to characterize how activity unfolds during deliberation and commitment, comparing the dynamics of different regions, and quantify to what extent cell properties cluster into distinct functionally interpretable roles. Previous studies have used similar techniques to examine whether specific neural populations exhibit distinct [75] versus mixed selectivity [7678] to various task-related variables. Here, we ask about the relationships between decision-making and movement. In particular, models with serial decision and action stages predict that neural activity will be separable into components related to deliberation and other components related to movement initiation/execution. In contrast, recurrent attractor models predict a single population in which all of these factors are mixed in a continuum. Some of these results have previously appeared in abstract form [7982].

Results

Neural data

We recorded spiking activity from a total of 736 well-isolated individual neurons in the cerebral cortex and basal ganglia of two monkeys (S and Z), recorded at the locations shown in Fig 1D. Of these, 356 were recorded in PMd (237 from monkey S), 211 in M1 (79 from monkey S), 62 in dlPFC (60 from monkey S), 51 in GPe (19 from monkey S), and 56 in GPi (22 from monkey S). The properties of some of these neurons have been reported in previous publications, focusing on tuned activity in PMd and M1 and the basal ganglia [36,43,44,83]. Here, we additionally include neurons recorded in dlPFC, which, for technical reasons, were almost exclusively obtained in monkey S and to date only described in abstract form [79,80].

Dimensionality reduction of neural activity into principal components

To examine how the activity of this entire population evolves over time during the task, we performed principal components analysis (PCA) on cells recorded in all five brain regions, including any well-isolated neuron that was recorded in both Slow and Fast blocks (see Fig A in S1 Text for a schematic describing our approach). This included a total of 637 neurons, including 291 in PMd, 195 in M1, 53 in dlPFC, 45 in GPe, and 53 in GPi. In principle, there are many ways one can perform PCA. For example, one approach is to perform PCA separately on neurons from each region, analyzing the dynamics of each independently. Alternatively, one can perform PCA on all cells together, producing a “loading matrix” of coefficients (from each neuron to each PC) that allows one to “project” each population into the space of the same PCs. There are also many ways to align the data (on the start of the trial versus movement onset), different time periods one can include (only before movement, only after, or both), and different ways of grouping trials (into large averaged groups versus specific subtypes). Importantly, as shown in Fig B-E in S1 Text), our results were remarkably robust regardless of these choices and held up when applied to data from each monkey separately, consistently leading to the same conclusions. Using factor analysis and gaussian process factor analysis (GPFA) [67] produced very similar results, so for simplicity, here we focus on PCA because it is the simplest and most readily interpretable.

Here, we show analyses performed using PCA on data aligned on movement onset (because we are interested in the dynamics of commitment) performed on all cells together (because we seek quantitative comparisons of the same PCs across regions). Note that because we recorded many more neurons in PMd and M1 than in the other regions, the variance explained was dominated by the PMd/M1 dynamics. However, reducing the number of cells to an equal number in each region did not change the results apart from making them noisier and changing the percentage of variance explained by individual PCs (Fig Ca in S1 Text). Furthermore, we provided the PCA algorithm with trials grouped into only four large groups—right and left choices in the fast and slow blocks—and let further analyses reveal whether more specific subtypes (easy, ambiguous, etc.) can nevertheless be extracted from the results. Finally, we imposed symmetry on our population by following the “anti-neuron” assumption, which assumes that for every neuron we recorded, there exists a similar neuron with the opposite relationship to target direction (even for cells that are not tuned), effectively doubling the number of neurons. See the Methods section for the justification and motivation for this approach, and Fig E in S1 Text for analyses showing that this procedure does not alter our conclusions.

The first 20 PCs together explained 97.9% of the variance in activity over time across the four groups of trials (Slow-Left, Slow-Right, Fast-Left, and Fast-Right) used for the PCA. Fig 2 shows the temporal profile for the first 7 PCs, separately for easy, ambiguous, and misleading trials in both slow and fast blocks, for both left and right choices, each constructed as a weighted average of the 402 cells that contained trials in all of these conditions (Group 3; see Methods).

Fig 2. Components produced by PCA.

Fig 2

The cumulative variance explained by the first 20 components is shown at the bottom right, and the temporal profiles of the top 7 PCs are shown in the rest of the figure. Each of those 7 panels shows the average activity of all cells, weighted by their loading coefficient onto the given PC, for 12 trial groups, including combinations of 3 trial types: easy (blue), ambiguous (green), and misleading (red); two blocks: Slow (solid) and Fast (dashed); for choices made to the left or right (indicated for the components where they differ). Note that the sign is arbitrary because the loading matrix can have positive or negative values. Shaded regions indicate 95% confidence intervals computed using bootstrapping. In each panel, data are shown aligned to the start of token jumps as well as to movement onset. Commitment time is estimated to be 280 ms before movement onset, based on our prior studies [36, 44]. Each panel is scaled to have the same range in the y-axis. PCs are built from the 402 cells that have trials in all conditions. Data and code available at https://doi.org/10.6084/m9.figshare.20805586.

The first 4 components, which together explain 80.3% of total variance, are clearly interpretable in terms of the key elements of the UGM. The first PC (31.6% of variance) is nearly identical across all conditions and reflects the transition between deliberation (prior to commitment) and action (after movement onset). It is similar to the main condition-independent component reported in other neural space studies in both primates [8488] and rodents [76,89,90]. The second PC (25.5%) exhibits two phases. Prior to commitment, it reflects the time course of the sensory evidence on which the monkey made his choice, distinguishing easy, ambiguous, and misleading trials. After commitment, it simply reflects the choice made, without distinguishing trial types. The third PC (13.4%) reflects the block-dependent aspect of urgency, distinguishing between the slow and fast blocks (even before the start of the trial). The fourth PC (9.7%) reflects the time-dependent aspect of urgency until just before movement onset. The remaining components are similar to PCs 2 and 4 during deliberation but capture some of the heterogeneity across cell activity patterns after commitment and movement onset. We discuss these higher PCs in the S1 Text.

PC2 warrants special attention. Note that until the moment of commitment, it correlates very well with the evidence provided by the token movements. Indeed, as shown in Fig 3A and 3B, the correlation between the time-delayed evidence and the value of PC2 is highly significant (p < 10−100) with a correlation coefficient of R = 0.92. This is notable because the PCA algorithm was not given any information about these different trial types (easy, ambiguous, misleading, etc.) but was simply given data averaged across four large trial groups that only distinguished left versus right movements and slow versus fast blocks. Nevertheless, when the resulting temporal profiles of PC2 are calculated for specific trial types they clearly reflect how the evidence dynamically changes over the course of deliberation in those trials.

Fig 3. Comparison between PC2 and sensory evidence.

Fig 3

(A) Dashed lines show the sensory evidence in easy (blue), ambiguous (green), and misleading trials (red) calculated as the difference between success probability for the right target and 0.5. Shaded ribbons show the mean and 95% confidence interval of PC2 in the same trials (Slow block, Group 1 cells). The first vertical dotted line indicates commitment and the second indicates movement onset, on which all data are aligned. The evidence trace is delayed by 300 ms, which provides the best fit. Note that until the moment of commitment, the pattern of PC2 closely resembles the evidence, except for diverging toward one of the choices even in the absence of evidence during ambiguous trials. (B) The same data, prior to commitment, plotted as evidence versus PC2 in these six trial conditions. The correlation coefficient is R = 0.9234 and p-value is well below 0.001. (C) The distribution (N = 100) of correlation coefficients obtained by performing the same analysis on surrogate data sets generated using the Tensor Maximum Entropy approach (see Methods). Here, each surrogate data set is represented by the highest correlation coefficient of any of the top 10 PCs against the profile of evidence. The mean R is 0.5294 (s.t.d. = 0.1615). For comparison, the red line shows the R value from the real data (panel B), and it is higher than all of the R values from surrogate data (p < 0.01). Data and code available at https://doi.org/10.6084/m9.figshare.20805586.

Could this finding be a trivial consequence of overall cell tuning? To test this possibility, we used the Tensor Maximum Entropy (TME) method of Elsayed and Cunningham [91] to generate synthetic data sets that retain primary features such as tuning, but are otherwise random (see Methods), and processed them in the same way as the real data, including cell duplication. As shown in Fig 3C, when PCA is applied to such synthetic data sets, it does not produce components that are as well correlated with evidence as PC2 from our true data (p < 0.01). The implication is that the emergence of PC2 in the real neural data requires a consistent relationship between how cells reflect the final choice (left versus right) and how they reflect the evidence that leads to that choice during deliberation (easy versus ambiguous versus misleading, etc.).

Fig 4A and 4B shows the trajectories of the different trial types, separately for the Slow and Fast blocks, plotted in the space of PCs 1, 2, and 4 (PC3 was not used because it is approximately constant in time, as shown in Fig 2). See S1 Movie for an animated version of this figure. As indicated in Fig 4A (dotted black arrows), in both block types, the neural activity evolves in a clockwise manner in the space of PC1 and PC4, passing over a region of deliberation until reaching a commitment state (purple ellipses), whereupon it rapidly moves to a movement-specific initiation subspace (green ellipses) and then turns back toward the starting point during movement execution. Some of these phenomena have previously been reported using neural space analyses of preparatory and movement-related activity in cortical regions during instructed reaching tasks with a single target [8487,9294]. Here, our task allows us to examine in more detail what happens during the process of prolonged deliberation, while subjects are selecting among multiple targets.

Fig 4. Neural trajectories in the space of PCs 1, 2, and 4, constructed separately for easy, ambiguous, and misleading trials as well as the other (unclassified) trials.

Fig 4

Each panel shows data aligned on movement and computed from 1,400 ms before to 400 ms after, as well as data aligned on the first token jump and computed from 200 ms before to 500 ms after (using the same loading matrix). (A) Data from slow blocks (Group 1 cells). Shaded colored regions around each trajectory indicate the 95% confidence interval computed using bootstrapping. Dotted arrows indicate how the state of activity evolves over time. The gray wireframe encloses all states visited during the deliberation epoch (the “decision manifold”), across all trial types (Ψ = 0.235). Purple ellipses indicate the region in which commitment occurs (indicated for individual trial types by small colored circles), and green ellipses indicate the point at which movement is initiated. (B) Data from fast blocks (Group 2 cells), same format. Ψ = 0.403. (C) Comparison of the neural trajectories between the blocks, using cells that have data in all conditions (Group 3 cells). The blue wireframe shows the decision manifold for Slow blocks, and the red shows it for Fast blocks. To avoid clutter, we do not show the confidence intervals. (D) Same data as in (C) but projected into the space of PCs 2, 3, and 4. This facilitates comparison of the manifold and trajectories in the PC2-PC4 plane, as well as the contextual shift (blue double arrow) between the two blocks captured by PC3. Data and code available at https://doi.org/10.6084/m9.figshare.20805586. PC, principal component.

In Fig 4A and 4B, we have drawn a gray wireframe around all of the points from the beginning of the trial until commitment time (280 ms before movement onset), across all trial types, thus defining the subspace within which deliberation occurs. This subspace resembles a triangular surface that is extended mostly in PC2 and PC4 and curved slightly into PC1. It is quite thin—for example, in the slow block, the value of Ψ (see Methods) is 0.235, which is roughly equivalent to a triangular sheet whose thickness is 1/45th of the length of each side. We call this the “decision manifold.”

The flow of neural states upon the decision manifold is quite orderly, proceeding from bottom to top as time elapses and shifting left and right with the sensory evidence. For example, consider the misleading trials, in red, which clearly reveal the switch in sensory evidence. In effect, the flow of the neural state during deliberation resembles the temporal profile of evidence (Fig 1C) mapped onto that curved wireframe surface. The neural state continues to flow along the decision manifold until it reaches one of two edges (purple ellipses) at the time of commitment and then turns into PC1 and accelerates to rapidly flow along one of two paths, each corresponding to the choice taken, until movement initiation (green ellipses).

Fig 4C compares the decision manifolds between the two blocks, this time using only the cells that possess trials in all conditions (Group 3; see Methods). Note that in the space of PCs 1, 2, and 4 the shape of the manifold is quite similar, with just a slight upward shift of the Fast relative to the Slow block. Fig 4D plots the same data in the space of PCs 2, 3, and 4. This effectively flattens the trajectories shown in panel C into the PC2-PC4 plane, which captures how the neural state changes as a function of evidence and elapsing time within each block. The difference between the blocks is captured by the orthogonal shift along PC3. Similar shifts in neural space have been reported to capture learning in motor cortex [95].

Region-specific dynamics

While the structure of the neural space computed across all neurons is interesting, it is still more informative to compare that structure across the different brain regions in which we recorded. Because the loading matrix produced by PCA provides coefficients that map each individual neuron’s contribution to each PC, we can calculate the weighted average of each region separately, as shown in Fig 5.

Fig 5. The top four principal components shown in Fig 2, but here the weighted averages were computed separately from each of the five brain regions, using cells recorded in all conditions (Group 3).

Fig 5

For PC2, we report the R value of the correlation between evidence and neural activity during deliberation, as in Fig 3B. Data and code available at https://doi.org/10.6084/m9.figshare.20805586. dlPFC, dorsolateral prefrontal cortex; GPe, globus pallidus externus; GPi, globus pallidus internus; M1, primary motor cortex; PC, principal component; PMd, dorsal premotor cortex.

Importantly, note that while PMd and M1 both strongly reflect the first four components described above, the other regions do not. In particular, dlPFC shows a clear relationship to evidence in PC2 and a slight block dependence in PC3, but it shows virtually no effect of elapsing time or commitment (both PC1 and PC4 are relatively flat). By contrast, in both GPe and GPi, PC1 shows a clear response at commitment, but PC2 shows no relationship to evidence until the decision is made. These findings suggest distinct functional roles for these regions.

We can use these same region-specific PCs to construct region-specific neural space trajectories. Fig 6A shows this for the dorsal premotor cortex, where we see a structure that is quite similar to what was shown in Fig 4A for all neurons (not surprisingly since the PMd population is the largest). As before, we see a triangular decision manifold that is relatively thin (Ψ = 0.343) and extends along PC2 and PC4. However, note that it initially strongly leans in the negative PC1 direction and then curves around just before commitment (see the side view shown in Fig 6A, right). Interestingly, when plotted in the space of PCs 1, 2, and 4, the PMd decision manifold is curved as if it lies on the surface of a sphere (Fig 6C, spherical fit R2 = 0.65). In contrast, the decision manifold of M1 cortex (Fig 6B) is remarkably flat and thin (Ψ = 0.252) and leans into the positive PC1 direction. Nevertheless, the evolution of the neural state along the surface of the decision manifold in both regions obeys the same pattern shown in Fig 4, proceeding from bottom to top as time elapses and shifting left and right with the sensory evidence, always lying within the same subspace, curved for PMd and planar for M1.

Fig 6.

Fig 6

Neural trajectories in the space of PCs 1, 2, and 4, plotted using activity during Slow blocks (Group 1 cells), separately for PMd (A; see S2 Movie), and M1 (B; see S3 Movie). Separate trajectories are plotted for easy, ambiguous, and misleading trials, as well as all trials in which decisions were shorter than 1,400 ms (early) and in which decisions were longer than 1,400 ms (late). For clarity, confidence intervals have been omitted. As in Fig 4, trajectories are computed from data aligned on movement onset and extend from 1,400 ms prior. Blue (PMd) and red (M1) wireframes enclose all states before commitment (280 ms before movement). We also superimpose trajectories computed from data aligned on the start of the trial and until 500 ms later (projected through the same loading matrix). Dashed black arrows in panel B, left, indicate where these separate trajectories can be seen in the M1 space. In all panels, the trajectory of misleading trials in which the monkey correctly chose the right target is highlighted with a thicker line. Purple ellipses emphasize the time of commitment and green ellipses emphasize movement onset. (C) Spherical and planar fits to the decision manifolds computed from PMd (left) and M1 (right). (D) Comparison of spherical fits to bootstrapped data, assessed using the normalized radius and R2 value of the best fit sphere. Solid lines indicate the 99% confidence ellipse for each resampled population, and large dots show the fits to the original data. Data and code available at https://doi.org/10.6084/m9.figshare.20805586. M1, primary motor cortex; PC, principal component; PMd, dorsal premotor cortex.

As shown in Fig 6C, the curved PMd manifold fits well (R2 = 0.65) to a small sphere whose radius is approximately half of the manifold range (that is, the manifold wraps around nearly half of the sphere). By contrast, the planar M1 manifold produces a weaker fit (R2 = 0.44) to a sphere with a much larger radius. To evaluate the robustness of this difference in manifold shapes, we performed a bootstrap analysis with 1,000 repetitions of spherical fits to the decision manifold (see Methods) using cells randomly resampled from the original PMd and M1 populations. As shown in Fig 6D, bootstrapped PMd manifolds consistently fit well to a small sphere. While bootstrapped M1 manifolds were sometimes best fitted with a small sphere, these fits were always weak (R2 between 0.3 and 0.4). Importantly, no points from either distribution fell within the 99% confidence ellipse of the other distribution (p < 0.001), indicating that the difference in manifold shapes was highly robust. In the Discussion, we propose that these shapes reveal functionally meaningful differences between the neural dynamics of these two cortical regions.

It is also noteworthy that in both PMd and M1, the state reached at the moment of commitment (purple ellipses in Fig 6A and 6B) shows an orderly relationships with the reaction time in each trial type (shortest in “early” trials, and longest in “late” trials). This is in agreement with the observation that even at a single trial level, a consistent relationship between neural state and reaction time can be observed during a simple instructed reach task [84]. Furthermore, in M1, the trajectories after that point converge to arrive in a relatively compact subspace (green ellipses) at movement onset—what Churchland and colleagues [96] called an “optimal subspace.”

In contrast to PMd and M1, the deliberation manifold in dlPFC (Fig 7A) is almost exclusively extended along PC2 (Ψ = 0.564, like a cylinder whose length is 22 times its radius). Like PMd and M1, the dlPFC neural state shifts left and right along PC2 with sensory evidence but shows almost no component of elapsing time (Fig 5), and after commitment, it exhibits only a small excursion into PC1. This suggests that neural activity in dlPFC primarily reflects the sensory evidence used to make decisions in the task, consistent with many previous studies [9,97103].

Fig 7. Neural trajectories in the space of PCs 1, 2, and 4, plotted using activity during Slow blocks (Group 1 cells), separately for dlPFC (A), GPe (B), and GPi (C).

Fig 7

Same format as Fig 6. Data and code available at https://doi.org/10.6084/m9.figshare.20805586. dlPFC, dorsolateral prefrontal cortex; GPe, globus pallidus externus; GPi, globus pallidus internus; PC, principal component.

A strong contrast to the cortical data is seen when we examine the neural state of cells recorded in the globus pallidus (GPe and GPi). In both regions, the neural state during deliberation is confined within a subspace that is not a thin manifold but instead resembles a ball compressed along PC2 and extended along PC4 (Fig 7B and 7C), which reflects the temporal component of urgency. During deliberation, activity in these regions tends to rise along PC4 but does not evolve in an orderly fashion along PC2, as seen in cortex. This does not appear to be caused by the lower number of cells recorded in GPe and GPi as compared to PMd and M1, because similar results hold when we restrict all regions to the same small number of cells (Fig Ca in S1 Text). Nevertheless, by the time commitment occurs, the state of both GPe and GPi lies in a choice-specific subspace (purple ellipses) and then evolves quickly to a corresponding initiation subspace. These findings are consistent with our previous report of GPe/GPi activity in the tokens task, in which we suggested that these regions do not determine the choice but rather contribute to the process of commitment [44].

Analyses of the loading matrix

Another approach for inferring the putative functional contributions of different brain regions is to examine the distribution of the loading coefficients for cells in each region. For example, if a given population of cells is strongly related to the sensory evidence provided by token jumps, then cells in that population should tend to have higher magnitude loading onto PC2 than cells from another population that is less sensitive to evidence. As described in Methods, we characterized the distribution of loading coefficients for the first 11 PCs (which capture 95% of variance) by fitting the 11-dimensional space of points for each brain region with gaussian mixture models (GMMs) (for a related approach, see [77]). The results are shown in Fig 8.

Fig 8. Analysis of loading coefficients (all cells).

Fig 8

(A) Each point indicates the weight of the contribution of a given PMd cell to the principal component indicated on the axes. Colored ellipses indicate the centroid and 3 times standard deviation of each of the two gaussians (labeled 1 and 2) that provided the best fit to the distribution of these populations. (B) The same PMd gaussians in the 3D space of PC1, PC2, and PC4. (C) Same as A for M1. (D) Same as B for M1. (E) Same as A for cell populations in dlPFC (green), GPe (cyan), and GPi (purple), each of which was best fit with a single gaussian. Note that unlike A and C, here the right panel shows PC4 vs. PC1. (F) Same as B for dlPFC, GPe, and GPi. Data and code available at https://doi.org/10.6084/m9.figshare.20805586. dlPFC, dorsolateral prefrontal cortex; GPe, globus pallidus externus; GPi, globus pallidus internus; M1, primary motor cortex; PC, principal component; PMd, dorsal premotor cortex.

The distribution of loading coefficients for cells in PMd (Fig 8A) was highly distributed but not without structure, and was best fit with two gaussians. The first (1: 68% contribution to the fit) was dominated by cells only weakly contributing to PCs 1, 2, and 4 but strongly to PC3. The second (2: 32%) included cells that strongly contributed to PCs 1 and 2 but more weakly to PCs 3 and 4. The distribution for M1 was also best fit with two gaussians, one (1: 56%) contributing mostly to PC3 and another (2: 44%) contributing to 1, 2, and 4 but not 3. Thus, in both PMd and M1, there was a trend for cells that most strongly reflect the animal’s SAT (PC3) to be less strongly tuned to direction (PC2) and vice versa (Fig 8A and 8C, middle panels).

In contrast, populations in dlPFC, GPe, and GPi were well fit with a single gaussian (Fig 8E and 8F), though perhaps more structure would have been seen with larger numbers of cells. Importantly, there were significant and potentially functionally relevant differences between these regions. In particular, the distribution of loadings from dlPFC and GPi were nearly orthogonal to each other in the space of PCs 1, 2, and 4 (Fig 8F). The dlPFC was extended along PC2 and not along the other components, consistent with the proposal that it primarily carries information on the sensory evidence provided by token movements. In contrast, GPi was relatively narrow in PC2 and instead extended along PCs 3 and 4, related to the block and time-dependent aspects of urgency. Furthermore, there was a significant negative correlation (R = −0.32, p < 0.001) between GPi loadings on PC1 versus PC4 (Fig 8E, right panel, purple). In other words, cells that build up over time (positive on PC4) tend to reduce their activity after commitment (negative on PC1), while cells that decrease over time (negative on PC4) tend to increase after commitment (positive on PC1).

In summary, our analyses of loading matrices suggest that while different regions do appear to contribute to different subsets of PCs (e.g., the orthogonal relationship of dlPFC and GPi in Fig 8F), the population within each region does not contain distinct clusters. In other words, while there is some structure in the loading matrix (e.g., Fig 8A and 8B, middle panel), the distributions of properties within each region are continuous.

Could that continuity of properties be an artifact resulting from dimensionality reduction? That is, if distinct categories of cells with different functional roles really did exist, would our analyses be able to identify them? To address this question, we created a variety of synthetic populations of neuron-like units and applied to them the same analyses we used to examine real data, including PCA and GMM analyses of the resulting loading matrix. As described in the S1 Text (see Fig F in S1 Text), this yielded three conclusions: It confirmed that PCA does correctly identify all of the components from which our synthetic neural populations were constructed. However, it also showed that some of the higher-order components (like PC5 in Fig 2) can result from PCA “cancelling out” some of the firing patterns already captured by lower-order components (e.g., PC2), in order to explain things like neurons tuned only during movement. Nevertheless, in all cases, the GMM analysis of the loading matrix correctly identified all of the actual categories of units in the synthetic populations. This suggests that the lack of distinct clusters we found in our neural data indeed reflects the absence of separate categories in the real neural populations. In S1 Text, we also discuss other methods that test for uniformity and clustering [75,76].

Discussion

Many neurophysiological studies have suggested that decisions between actions unfold within the same sensorimotor regions responsible for the control of those actions [1,104106]. This includes FEF and LIP for gaze choices [7,107110] and PMd and MIP for reaching choices [8,36,110114]. Several computational models of the decision mechanism suggest that it behaves like a “recurrent attractor system” [2328], where reciprocally competing groups of neurons tuned for the available choices compete against each other until one group wins and the system falls into an attractor corresponding to a specific choice.

The results reported here provide strong support for this class of models. In particular, the cells we recorded in PMd and M1 do not appear to belong to separate categories related to decision-making versus movement preparation or execution but instead behave like part of an integrated dynamical system that implements a biased competition and transitions to commit to a choice through a winner-take-all process. During deliberation, the pattern of cell activity in these regions is confined to a highly constrained subspace in the shape of a thin manifold and is shifted around within that manifold by the decision variables pertinent to the task (here, the sensory evidence and the rising urgency). When commitment occurs, the same group of cells now transitions from the decision manifold to a roughly orthogonal tube-shaped subspace corresponding to a specific choice [76] and then quickly flows to a subspace related to movement initiation [85,96]. This is precisely the kind of “winner-take-all” phase transition that occurs in recurrent attractor models. Importantly, the transition to commitment is not equivalent to the crossing of a threshold that is definable as the critical firing rate of a given neuron type, but instead resembles falling off the edge of the decision manifold into an orthogonal subspace that loses sensitivity to evidence. As can be seen in Fig 6A and 6B, that edge is oriented approximately diagonally with respect to PC2 and PC4, implying that straightforward summation of the activity of cells tuned to the chosen target may not be constant across trial types, as indeed often observed [36,115]. In other words, commitment is more nuanced than a threshold crossing; it is a state transition in a dynamical system [23].

Additional insights can be obtained by examining the low-dimensional components produced by dimensionality reduction. In particular, it is noteworthy that during deliberation, the four strongest components of neural activity (which together account for just over 80% of the variance) capture the key elements of the urgency-gating model [14]: the momentary evidence (in PC2), the urgency signal (a context-dependent baseline in PC3 and time-dependent ramping in PC4), and the transition to commitment (PC1). Perhaps even more important is how these components are differently expressed in the different cell populations. In particular, the neural state in dlPFC varies almost entirely along PC2 while, conversely, the state in GPe/GPi during deliberation is primarily determined by PC3 and PC4 and not at all by PC2. This suggests that information about sensory evidence is provided by prefrontal cortex [9,97103] while the urgency signal is coming from the basal ganglia [44,116,117], and the two are combined in PMd and M1 to bias a competition between action choices [8,34,36,113,114,118].

The presence of the evidence-related component PC2 in the cortical data is particularly remarkable because the dimensionality reduction algorithm was not provided with any information about the variety of trial types (easy, ambiguous, etc.) that distinguish properties of deliberation but was merely given data averaged across four very large groups of trials: left or right choices during slow or fast blocks. Nevertheless, the difference of activity related to left versus right choices led the algorithm to assign a choice-related component that also happens to capture the evolving evidence for that choice. A neural population control method [91] verified that this is not a simple consequence of tuning. Additional analyses show that a unified evidence/choice component is obtained even if the PCA algorithm is only given data restricted to activity after movement onset (Fig Cb in S1 Text). This suggests consistency between cell properties before versus after movement onset [54], once again arguing against a categorical distinction between movement selection versus execution circuits and in favor of an integrated dynamical system, at least among the cells we recorded. Admittedly, the movements performed by our monkeys were quite simple and stereotyped, lacking the variation that would allow us to distinguish within our own data the neural components related to hand movement direction, joint angular velocities, muscle torques, or other execution-related variables. However, decades of previous neurophysiological recording studies have already established that neurons in PMd and M1 strongly reflect such variables [53,119131] and are directly involved in the online guidance of reaching actions [132135]. On the basis of such work, we therefore interpret the PMd/M1 activity we observed after movement onset as being related to the control of execution and not simply reflecting the target or goal.

The emergence of the other components is also highly robust. If the PCA algorithm is only given data from the slow block, PC3 is lost (unsurprisingly), but the other components remain (Fig Cd in S1 Text). Also not surprisingly, if we provide PCA with data from all 28 of our trial classes, the components are even more clearly distinguished, even though the number of cells that possess all of the required trials is reduced by 37% (Fig Ce in S1 Text). In fact, any reasonable subset of data we have tried (e.g., from each animal separately) leads the PCA algorithm to identify the same functionally relevant components, albeit sometimes in a slightly different order depending on how much variance is captured by each. Finally, similar features are obtained if we leave out the cell duplication step (see Methods), although the resulting decision manifolds become asymmetrically distorted and some of the structure of loading matrices is more difficult to see (Fig E in S1 Text).

Finally, it is noteworthy that our findings reliably reproduce many previous observations from very different tasks, including ones in which monkeys did not have to make any decision at all and were simply instructed to reach out to a single target. This includes the general flow of the neural state from target presentation to movement onset and offset [85] (Fig 4), the orderly relationship between reaction times and the neural state in PMd/M1 [84] (Fig 6A and 6B, purple ellipses), the compact subspace in M1 at movement onset [96] (Fig 6A and 6B, green ellipses), and the presence of condition-independent components related to state transitions and elapsing time [87] (Fig 2, PC1, PC4). Although most of our data come from a prolonged period of deliberation that is not present in those other studies, the phenomena related to preparation and execution, which are shared between paradigms, are nevertheless robustly reproduced. This further strengthens the proposal that action selection and sensorimotor control are two modes of a single unified dynamical system.

Indeed, analyses of the loading matrix suggest that among the cells we recorded in PMd and M1, there is no categorical distinction between those involved in selection and those responsible for movement. Further analyses of synthetic populations (e.g., Fig F in S1 Text) demonstrate that some of the higher order PCs we observed (PC5 and PC6; see Fig 2) may result from the heterogeneity of properties across our cell population, which includes purely decision-related and purely movement-related cells. However, analysis of the loading matrix concluded that these properties are not clustered into distinct categories as in synthetic data (Fig Fd in S1 Text) but are instead distributed along a continuum (Fig 8).

While some of these findings could have been anticipated from analyses of individual cells, one important observation that would not have been possible concerns the shape of the decision subspace in the cortical populations. In particular, it is highly consistent across all of the different trial types and always resembles a thin manifold. This suggests strong normalization dynamics that are an inherent feature of recurrent attractor models. That is, the state of neural activity is pushed to lie on a surface that conserves some quantity (e.g., total neural activity with respect to some baseline) but is then free to move upon that surface under the influence of evidence, urgency, or simply noise. Furthermore, the particular shape of the manifold reveals consistent differences in the dynamics of different neural populations. In particular, regardless of what data we provide to PCA, the M1 manifold is always almost perfectly flat while the PMd manifold always exhibits a characteristic curvature. Interestingly, a very similar curvature was observed in preliminary analyses of PMd data in a very different decision task [136]. In contrast, the decision subspaces in the globus pallidus are much more compact and nothing like the thin manifolds in cortex.

Recent work suggests that the shape of the manifold on which neural states lie can reveal important features of the underlying dynamics of the system [7072,137142]. We believe the same holds for our data, particularly regarding the difference between the shapes of the decision manifolds in PMd versus M1. Notably, like many of our findings, this difference in manifold shapes is remarkably robust. It is observed whether or not we impose symmetry through cell duplication (Fig E in S1 Text, and S4 Movie), whether or not we use a square-root transform (Fig Cc in S1 Text), and whether we use data from each monkey separately (Fig D in S1 Text). It is significant at p < 0.001 according to a bootstrap test that randomly samples cells from each population (Fig 6D).

While the consistency of this difference in manifold shapes suggests that it reveals something important about the distinct neural dynamics of PMd versus M1, identifying the functional meaning of this difference is challenging. Many plausible possibilities exist, and here we take inspiration from previous work on computational models of recurrent neural circuits [23,26,28,143] to discuss interpretations that address two aspects of these shapes: curvature in the PC1-PC2 plane and curvature in the PC1-PC4 plane (Fig 6A and 6B).

We suggest that curvature in the PC1-PC2 plane reveals nonlinear interactions between competing cells in a given cortical region. As described in S1 Text, in a simple 2-neuron attractor model, the flow field of activity during deliberation is strongly constrained to lie within a narrow manifold whose shape is influenced by the function that governs mutual inhibition between competing cells (Fig G in S1 Text). If that function is flat for low inputs and then steeply rises for high inputs (approximating winner-take-all dynamics), then the manifold is flat as we observed in M1. In contrast, if the function resembles a shallow sigmoid, then the manifold is curved as we observed in PMd, and as others have reported in parietal cortex [137]. This could be related to the proposal, made in earlier modeling work [25], that PMd implements a gradual competition between groups of cells related to different movements, while M1 behaves more like a winner-take-all system.

In addition, we observed that the PMd manifold bends in the PC1-PC4 plane, shifting the direction of flow along the component associated with the transition from deciding to acting. Importantly, the timing of the shift foreshadows the moment of commitment, proceeding it by approximately 200 ms. We propose that these features could be the signature of a gradually emerging positive feedback in the cortico-striatal-thalamo-cortical circuit, which gradually overcomes inhibitory signals preventing premature selection [144]. This hypothesis would predict a correlation between how directional selectivity begins to emerge in GPi and how the PMd state begins to flow toward commitment. Indeed, as shown in Fig H in S1 Text, such a correlation is very strong in our data. In particular, the speed at which the PMd population moves toward the commitment subspace (derivative of PC1 computed from PMd) is significantly and positively correlated with the emerging directional selectivity in GPi (absolute value of PC2 computed from GPi), consistent with positive feedback between these regions [143,145,146]. Preliminary modeling work [145] suggests that such positive feedback emerges when a critical contrast is reached between opposing groups of cells in PMd. This predicts that microstimulation in cortical regions will disrupt that critical contrast and delay movement onset if delivered just before (but not after) the moment of commitment, as recently confirmed [147]. However, establishing the causal relationships in the full recurrent circuit will require many additional future studies, including simultaneous microstimulation in one region and recording in the other.

In conclusion, our analyses support the hypothesis that decisions between actions emerge as a competition within the sensorimotor system [810,25,107,109,110], which is governed by recurrent attractor dynamics [2328], as supported by optogenetic stimulation studies in mice [148,149]. According to this model [145], the competition for reach selection occurs between cortical cell groups associated with different candidate reaching actions within arm-related regions of PMd and M1 and is biased by decision-related information coming at least in part from the prefrontal cortex [9]. As time passes, that competition is invigorated by a context-dependent urgency signal from the basal ganglia [44,116,117], which gradually amplifies the competitive dynamics in PMd/M1. As the contrast develops between the activity of cells voting for the different actions, a congruent bias is gradually induced in the striatum and pallidum, leading to a positive feedback that results in a winner-take-all process [44,143,146], which constitutes the commitment to an action choice. That process then brings the cortical system to a state suitable for initiating the selected action [96,150], setting into motion the “first cog” of a dynamical machine that controls our actions in the world [2931].

Methods

Subjects and apparatus

Neural recordings were performed in two male rhesus monkeys (Macaca mulatta; monkey S: 4 to 9 years old, 5 to 9 kg; monkey Z: 4 to 6 years old, 4 to 7kg). Animals were implanted, under anesthesia and aseptic conditions, with a titanium head fixation post and recording chambers. The University of Montreal animal ethics committee approved surgery, testing procedure, and animal care.

Monkeys sat head-fixed in a custom primate chair and performed two planar reaching tasks using a vertically oriented cordless stylus whose position was recorded by a digitizing tablet (CalComp, 125 Hz). Their nonacting hand was restrained on an arm rest with Velcro bands. In most sessions, unconstrained eye movements were recorded using an infrared camera (ASL, 120 Hz). Stimuli and continuous cursor feedback were projected onto a mirror suspended between the monkey’s gaze and the tablet, creating the illusion that they are in the plane of the tablet. Neural activity was recorded from the hemisphere contralateral to the acting hand with tungsten microelectrodes (1.0 to 1.5 MΩ, Frederic Haer and Alpha-Omega Eng.) moved with a computer-controlled microdrive (NAN Instruments) and electrophysiological signals were acquired with the AlphaLab system (Alpha-Omega Eng.). Spikes were sorted offline using Plexon Offline Sorter (Plexon). Electrodes were targeted based on 3D reconstructions (Brainsight, Rogue Research) using structural MRI images (Siemens 3.0 T). Behavioral data were collected with the software used to display the task (LabView, National Instruments).

Behavioral task and classification of trial types

Monkeys were trained to perform the “tokens” task (Fig 1A) in which they are presented with one central starting circle (1.75 cm radius) and two peripheral target circles (1.75 cm radius, arranged at 180° around a 5-cm radius circle). The monkey begins each trial by placing a handle in the central circle, in which 15 small tokens are randomly arranged. The tokens then begin to jump, one-by-one every 200 ms (“pre-decision interval”), from the center to one of the two peripheral targets always oriented at 180° to each other with respect to the center. The monkey’s task is to move the handle to the target that he believes will ultimately receive the majority of tokens. The monkey is allowed to make the decision as soon as he feels sufficiently confident and has 500 ms to bring the cursor into a target after leaving the center. When the monkey reaches a target, the remaining tokens move more quickly to their final targets (“post-decision interval,” which was either 150 ms in “Slow” blocks or 50 ms in “Fast” blocks (in a few sessions, the post-decision interval was reduced to 20 ms in fast blocks). Once all tokens have jumped, visual feedback is provided to the monkey (the chosen target turns green for correct choices or red for error trials) and a drop of water or fruit juice is delivered for choosing the correct target. A 1,500-ms intertrial interval precedes the following trial. We alternated between Slow and Fast blocks after about 75 to 125 trials, typically several times in each recording session.

The monkeys were also trained to perform a delayed reach (DR) task (usually 30 to 48 trials per recording session). In this task, the monkey again begins by placing the cursor in the central circle containing the 15 tokens. Next, one of six peripheral targets is presented (1.75 cm radius, spaced at 60° intervals around a 5-cm radius circle) and after a variable delay (500 ± 100 ms), the 15 tokens simultaneously jump into that target. This “GO signal” instructs the monkey to move the handle to the target to receive a drop of juice. This task is used to determine cell’s task response and tuning as well as the animal’s mean reaction time (RT), used as an estimate of the total delays attributable to sensory processing and response initiation. This quantity was then used to estimate the time of commitment in the tokens task. That is, the “decision time” (DT) in the tokens task was quantified as movement onset minus the mean RT from the DR task.

The tokens task allows us to calculate, at each moment in time, the “success probability” (SP) associated with choosing each target. To characterize the success probability profile for each trial, we calculated this quantity (with respect to the target ultimately chosen by the monkey) for each token jump (Fig 1B). For example, with a total of 15 tokens, if at a particular moment in time the right target contains NR tokens, the left contains NL tokens, and NC tokens remain in the center, then the probability that the target on the right will ultimately be the correct one (that is, the success probability of guessing right) is:

pR|NR,NL,NC=NC!2NCk=0minNC,7NL1k!NCk! (1)

Although each token jump in every trial was completely random, we could classify a posteriori some specific classes of trials embedded in the fully random sequence (e.g., Fig 1C). In previous studies, we defined “easy,” “ambiguous,” and “misleading” trials on the basis of their success probability profile defined with respect to the first token jump. In contrast, because here we were primarily interested in examining activity with respect to commitment, we defined these trial types according to the success probability with respect to commitment time, estimated to be 280 ms before movement onset. A trial was classified as “easy” if the SP was above 0.5 five tokens before commitment, above 0.55 three tokens before, and above 0.65 at the time of commitment. A trial was classified as “ambiguous” if SP was between 0.35 and 0.65 at five and three tokens before commitment as well as at the time of commitment. A trial was classified as “misleading” if SP was below 0.5 five tokens before commitment and then above 0.5 at the time of commitment. A trial was classified as “other” if it did not meet any of these criteria. Note that these four classes are nonoverlapping. In the “Slow” block, we defined “early” trials as those where DT < 1,400 and “late” as those where DT ≥ 1,400 ms. In the “Fast” block, “early” trials were defined as those where DT < 950 and “late” as those where DT ≥ 950 ms.

In all tokens task trials, the targets were presented in opposite directions from the center, in two of six possible locations around the circle. Their placement was chosen according to the tuning of recorded cells, and in cases where a cell was not tuned, the leftmost and rightmost targets were used. Thus, in all cases, there was a target to the left of the center and one to the right (sometimes at an oblique angle). Here, we grouped all of these into two groups: We defined the three targets between 90° and 270° as “left” targets, and the other three as “right” targets, and did not examine directional tuning in any more detail.

For the analyses in the present report, we defined 28 task conditions as follows. First, we separated trials into those recorded during the “Slow” block and those recorded during the “Fast” block, and in each block, we split trials into those in which a “left” target was chosen by the monkey and those in which a “right” target was chosen. This yields 4 main groups: Slow-Left, Slow-Right, Fast-Left, and Fast-Right. Next, we split each of these four main groups into Easy, Ambiguous, Misleading, and Other, yielding another 16 conditions (Slow-Easy-Left, Slow-Easy-Right, Slow-Ambiguous-Left, etc.). Finally, we again split our four main groups into Early and Late, yielding another 8 conditions (Slow-Early-Left, Slow-Early-Right, Slow-Late-Left, etc.).

Neural recording

Detailed methods for neural recording in PMd and M1 are described in Thura and Cisek [36], and for recording in the globus pallidus in Thura and Cisek [44]. Recordings in dlPFC used the same methods as PMd/M1 and focused on the region just dorsal to the caudal end of the principal sulcus (Fig 1D). We used 2 to 4 independently moveable electrodes for cortical recordings, and one electrode at a time for recordings in the globus pallidus. Thus, most of the cells whose activity is analyzed and reported here were not recorded simultaneously.

In all sessions, we focused on cells showing any change of activity in the tokens task, and monkeys were usually performing the task while we were searching for cells. When one or more task related cells were isolated, we ran a block of 30 to 48 trials of the DR task to determine spatial tuning and select a preferred target (PT) for each cell (that is, the target associated with the highest firing rate during one or more task epochs). Next, we ran blocks of tokens task trials using the PT of an isolated cell and the 180° opposite target (OT). We sometimes simultaneously recorded several task-related cells showing different spatial preferences, and since we always selected a single pair of targets, the actual best direction for all of the recorded cells was not always among these two. We usually started recording cells in the slow block because monkeys were more conservative in this condition. It was thus easier to assess cell properties online and more convenient to search for cells because fewer rewards were spent. When possible, cells were tested with multiple repetitions of slow and fast blocks to control for potential confounds related to evolving signals, elapsing time, and the monkey’s fatigue or satiation (see Thura and Cisek [43] for control analyses on this question).

Data analysis

A neuron was included in the analysis if it was recorded in both Slow and Fast blocks, thus including trials in each of the four main conditions: Slow-Left, Slow-Right, Fast-Left, and Fast-Right. This constraint was satisfied by 637 neurons out of the total 736 recorded across all brain regions (291 in PMd, 195 in M1, 53 in dlPFC, 45 in GPe, and 53 in GPi). For each neuron and each trial, neural activity was aligned to movement onset and the firing frequency was computed using partial spike intervals in ninety 20-ms bins from 1,400 ms before to 400 ms after movement onset. The firing rate was then square root transformed and smoothed using a 25-ms gaussian kernel. Finally, we imposed symmetry on our neural population by duplicating each of our neurons with an identical “anti-neuron” that has the same activity in all trials, but with the trial labels switched between left and right choices. We describe our rationale for including this step below in the section “Rationale for duplicating neurons.”

To perform PCA of neural activity, we followed the procedure outlined in Fig A in S1 Text (steps 1 to 4). First, for each cell, we grouped trials into our four main classes (Slow-Left, Slow-Right, Fast-Left, and Fast-Right). In each group, neural activity was averaged together regardless of the type of trial (easy, ambiguous, misleading, or other) and regardless of whether the choice was correct or not, or the reaction time was early or late. For each of these four classes, we constructed a 1,274 × 60 matrix where rows are individual neurons and columns are time bins (from 1,000 ms before to 200 ms after movements onset). We then concatenated these four matrices into a 1,274 × 240 matrix on which we performed standard PCA (using the pca function in Matlab 2019b), yielding a matrix of weights (“loading coefficients”) between cells and PCs as well as the variance explained by each PC. For further analysis, we only kept the top 20 PCs, which explained 97.9% of the total variance, though most of the interpretation will focus on the first four (total 80.3% of variance).

In addition to standard PCA, we also tried other dimensionality reduction methods, including factor analysis and GPFA [67,69], but these yielded almost identical results. This is not surprising. For example, the major advantage of GPFA is that it jointly provides filtering with dimensionality reduction, which is critical when analyzing data from individual trials in which many neurons were recorded simultaneously. Our neurons were not recorded simultaneously, so we had to combine trials into similarity classes under the assumption that specific trial types were associated with similar neural activities on different occasions. This meant that we averaged activity across multiple similar trials, yielding much of the noise reduction that GPFA would have otherwise provided. Given the similarity of our results with different dimensionality reduction techniques, we chose to use PCA because it is the simplest, most widely known and well understood, and ultimately easiest to interpret.

Using the 1,274 × 20 loading matrix, we computed the average neural activity profile along each PC for a given trial condition and a given population of neurons by multiplying each neuron’s activity in that condition by its loading coefficient for that PC, summing all of these together, and dividing by the number of neurons in that population. We calculated confidence intervals around this average neural activity by randomly resampling trials, with replacement within a given trial condition, 1,000 times for each neuron and time bin. We performed this process for all neurons together as well as for each of our neural populations separately (PMd, M1, dlPFC, GPe, and GPi).

Because not every neuron possessed a trial in each condition, potentially making it inappropriate to compare “PC space” trajectories from different conditions, we defined the following three groups of neurons designed to facilitate the analyses of primary interest. Group 1 consisted of neurons that possessed all conditions in the Slow block (Easy, Ambiguous, Misleading, Other, Early, and Late), for both Left and Right choices, as well as Early and Late conditions in the Fast block. This group was the focus of most of our analyses (e.g., Figs 4A, 6 and 7) and included a total of 557 cells (258 in PMd, 173 in M1, 50 in dlPFC, 35 in GPe, and 41 in GPi). Group 2 consisted of neurons that possessed Easy, Ambiguous, Misleading, and Other conditions in the Fast block. This group included 452 cells (226 in PMd, 126 in M1, 46 in dlPFC, and 27 each in GPe and GPi). Finally, Group 3 consisted of neurons that possessed trials in all conditions, including 402 cells (207 in PMd, 108 in M1, 44 in dlPFC, 21 in GPe, and 22 in GPi). We used Group 1 neurons for analyses restricted to data in the Slow block (e.g., Fig 4A), Group 2 neurons for analyses restricted to the Fast block (Fig 4B), and Group 3 for analyses that directly compared the two blocks (e.g., Fig 4C and 4D). Although these definitions were used to make quantitative comparisons most accurate, they did not have a strong impact on the qualitative aspects of our data, which were similar even if we restricted all analyses to Group 3, which was the most restricted.

Once we reconstructed the temporal profile along each PC for given conditions, we plotted each of them across time (e.g., Fig 2) and with respect to one another (e.g., Fig 4). To analyze the subspace visited by a neural population during the deliberation process, we examined PCs 1, 2, and 4 (see Fig 2) and constructed a 3D concave hull enclosing all of the neural states until the moment of commitment, for all conditions within a given block (using the alphaShape function in Matlab 2019b). For some of our brain regions, this subspace resembles a thin 2D sheet, which we call the “decision manifold.” To quantify its thickness, we compared its surface area / volume ratio with that of a perfect sphere with equal volume, using the equation Ψ=π1/36V2/3A, where A is the surface area and V is the volume [151]. The quantity Ψ is called “sphericity” and is a dimensionless scalar that ranges from 0 for a 2D surface to 1 for a perfect sphere (the geometric shape with the smallest surface area / volume ratio). For the reader’s intuition, we note that an equal-sided cube of any size has Ψ = 0.806, while a square sheet whose thickness is 1/50th of the length of each side has Ψ = 0.171.

To characterize the loading matrix, we considered each cell as a point in a high-dimensional space defined by its loading coefficients for the top 11 PCs (which accounted for 95.2% of the total variance). We then fit the resulting distribution of points for cells in a given brain region with GMMs using the Expectation–Maximization algorithm (fitgmdist function in Matlab 2019b). For each fit, we tried GMMs consisting of anywhere from one to six 11-dimensional gaussians, performing 100 randomly initialized fits for each, and then used the Bayesian information criterion (BIC) to select the best fitting model. We found that with 100 randomly initialized fits, we reliably found the same best solution each time.

Rationale for duplicating neurons

The rationale for duplicating the neurons stems from the fact that our population is inevitably a highly sparse undersampling of the millions of neurons in the regions where we recorded. Importantly, we can assume that this undersampling is not symmetric with respect to the proportions of neurons that contribute to movements to the left versus right. For example, we might have in our sample many high-firing cells preferring leftward movements, but fewer high-firing cells preferring rightward movements. Applying PCA to such asymmetrically undersampled data will force the algorithm to try to capture this variance by aligning PCs to that asymmetry. We believe this is not informative. We can assume that our sampling is unlikely to be symmetric with respect to leftward or rightward movements, so it will not reflect any real asymmetry that may or may not exist in the brain. It will only distort our data in ways that will make interpretations more difficult.

For this reason, just before performing PCA, we imposed symmetry on our data by using the “anti-neuron” approach classically used to produce population histograms of neural activity. In short, we assumed that for every given neuron that we recorded, there exists in the brain another neuron that we did not record, which has the same properties (same firing rate profile, same sensitivity to relevant variables, etc.) but has the opposite relationship with respect to the direction of movement. To implement this assumption, for each of our neurons, we created a “sister” neuron with the same firing data except with the trial labels switched between left and right choices. We did this for all neurons regardless of whether they are tuned or not. All of the analyses shown in the main paper apply this anti-neuron duplication, but in S1 Text, we show analogous results obtained without that step. As can be seen (Fig E in S1 Text), the only difference is that the PCs produced without anti-neuron duplication are slightly rotated and skewed versions of the ones produced after anti-neuron duplication, and their properties are slightly mixed between adjacent PCs. However, the general conclusions remain unchanged.

Control analyses

To determine whether the PCs we found in the population data are a trivial consequence of properties of single neurons (e.g., directional tuning), we applied a control based on the TME method of Elsayed and Cunningham [91], using code they provide at htpps://github.com/gamaleldin/TME. Briefly, this method constructs surrogate data sets that preserve the primary first- and second-order covariance in the data along the temporal, neural, and condition-dependent dimensions but are otherwise maximally random. For example, tuning is preserved but not correlated across time. Any metric applied to analyzing the real data can then be compared to that same metric applied to surrogate data sets. If the metric computed from the real data lies within the distribution of that metric computed from surrogate data, then whatever is measured by the metric is a simple consequence of first- or second-order covariance.

Here, we were interested to know whether our finding of PCs that covary with evidence is a simple consequence of cell tuning. To that end, we computed the correlation between the evidence provided by token jumps (e.g., Fig 1C) and the temporal profiles of the components generated by PCA when applied to surrogate data. Our metric was equal to the maximum absolute value of the correlation coefficient for any of the top 10 components. We calculated this metric for the components produced using the real data (in which the best correlation was with PC2) and compared it to the distribution of the metric for 100 surrogate data sets generated using the TME method. That is, for each of the 100 surrogate data sets, we performed PCA on the four main trial classes and then used these to calculate the temporal profile of the top 10 PCs for all trial types, correlated each of these with evidence, and used the highest correlation value. We consider the emergence of an evidence-related PC as a nontrivial consequence of tuning if the correlation metric of fewer than 5 of these 100 surrogate data sets is equal to or higher than the metric applied to the real data (that is, p < 0.05).

To determine whether the shapes of PMd and M1 decision manifolds were significantly different from each other, we performed a comparison using spherical fitting. As shown for the original data in Fig 6C, a curved decision manifold (like in PMd) is best fit by a sphere with a relatively small radius, while a flat manifold (like in M1) is best fit by a sphere with a large radius. To yield a comparable scale for cell groups with different firing rates, we normalized the radius by the maximum range of the decision manifold. Next, to test the robustness of this difference in shapes, we performed a bootstrap analysis with 1,000 iterations. For each iteration, we randomly resampled the population of cells in each region, with replacement, and performed PCA using movement-aligned data from the slow block (that is, only Slow-Left and Slow-Right trials). Because we only used the slow block, no SAT-dependent PC (like PC3 in Fig 2) would be found, so the first three would consistently resemble our PCs 1, 2, and 4. Thus, we calculated the weighted average for the first three PCs, computed the decision manifold, and performed planar and spherical fits to the result. The results were then quantified as a distribution of normalized radius and coefficient of determination (R2) values from the best fitting sphere. A high R2 with a low radius was taken to indicate that the manifold was curved. To assess significance, 99% confidence ellipses were drawn around each distribution.

Supporting information

S1 Text. Supporting information.

Fig A. Schematic diagram of the analysis approach. Step 1: Neural activity is recorded from individual cells while the monkey performs individual trials of different types (e.g., “easy trial to the right,” “misleading trial to the left,” etc.), and the temporal profile is aligned on movement onset time. Step 2: For each cell, activity is averaged from all trials during the slow block in which the target on the right was chosen, to produce an average “Slow-R all” temporal profile. This is repeated for Slow-L, Fast-R, and Fast-L trial groups. Step 3: The four trial groups are concatenated into a single row for each cell, and the rows of all cells grouped into a large matrix. Step 4: Principal components analysis is performed on that matrix to produce the “loading matrix” L. Step 5: To generate the trajectory for a specific trial type and a specific population, we first average the neural activity of a given cell over all trials of the given type (e.g., misleading trials in the slow block in which the right target was chosen). Step 6: To compute the profile for that trial type and for a given principal component, the average activity of each cell is weighted by the corresponding row (cell) and column (PC) of the loading matrix L and summed. Step 7: The profile of each PC is divided by the number of cells used. Step 8: The resulting PC profiles are plotted against each other to produce neural trajectories. Here, the trajectory for Slow-Misleading-R trials is highlighted and superimposed over the trajectories of other slow block trials, as in Fig 4a. Note that steps 5–8 can be computed using arbitrary subsets of data (different trial groupings, different subsets of cells), and arbitrary alignment (on movement onset or on the first token jump), because all they depend upon is the loading matrix computed by steps 1–4. Fig B. The top four principal components produced when the loading matrix was computed from each brain region separately, same format as Fig 2. On the right are shown the neural trajectories computed in the space of the components that reflect commitment, evidence, and elapsing time. Note that in some cases, the axis is inverted to facilitate comparison of these neural spaces to those shown in Figs 6 and 7. Data and code available at https://doi.org/10.6084/m9.figshare.20805586. Fig C. The top four principal components produced when the loading matrix was computed using only 35 cells per region (A), only data from movement onset to 400 ms later (B), data without square root transformation (C), only data from the slow block (D), or all data from all 28 trial conditions separately (E). Same format as Fig B. Data and code available at https://doi.org/10.6084/m9.figshare.20805586. Fig D. Results computed separately for each of the two animals. Panels A and B show the principal components (same format as Fig 2). Panels C and E show the top four components calculated using each brain region (similar to Fig 5). Panels D and F show the decision manifolds for PMd and M1 (similar to Fig 6). Data and code available at https://doi.org/10.6084/m9.figshare.20805586. Fig E. Results computed without the cell duplication step. (A) The top four principal components (similar to Fig 2). (B) The neural trajectories and decision manifold for PMd (similar to Fig 6a). (C) Neural trajectories and decision manifold for M1 (similar to Fig 2B). (D-F) Neural trajectories for dlPFC, GPe, and GPi (similar to Fig 7). (G) Loading distributions for PMd and M1 with respect to PC1 and PC2 (similar to Fig 8A and 8C, left). (H) Loading distributions for dlPFC, GPe, and GPi with respect to PC1, PC2, and PC4 (similar to Fig 8F). Data and code available at https://doi.org/10.6084/m9.figshare.20805586. Fig F. Analyses of synthetic populations. (A) The top 3 synthetic PCs (SPCs) obtained from a population of 600 neurons constructed using combinations of the original PCs 1, 2, and 4 (see Fig 2). (B) The loading coefficients of all 600 neurons, fitted with GMMs (gray ellipsoids). Colors indicate the three groups of neurons built with different combinations of those original PCs (see legend). (C) The top 3 SPCs obtained from a population of 600 neurons constructed only using the original PC1 and PC2 but separated into three distinct categories: cells that are only active during deliberation, cells only active during movement, and cells active during both epochs. (D) The loading coefficients of all 600 neurons, fitted with GMMs (gray ellipsoids). Colors indicate the three categories of neurons. Data and code available at https://doi.org/10.6084/m9.figshare.20805586. Fig G. Simulation of a simple 2-neuron recurrent competitive attractor. (A) The structure of the model is shown at left, next to the governing equations and parameter settings. Two forms of the interaction function f(x) are shown in the inset at right. (B) The flow field for a system with high gain (G = 1) and steep slope (S = 15) when the input evidence is balanced such that E1 = E2 = 0.2. Blue arrows depict the flow and shading indicates the speed (yellow = fastest, dark blue = slowest). The purple outline indicates a region around the stable equilibrium, in which the state will tend to remain even with substantial noise. Note that the dynamics flow quickly toward that region and then move more slowly within it. (C) The stable regions of the same system for several input patterns, ranging from strong evidence in favor of choice 2 (top left, blue) to strong evidence in favor of choice 1 (bottom right, red). Note that these align in what is approximately a straight line across the state space (compare to dashed black line). (D) Same as B but for a system in which the interaction function has low gain (G = 0.1) and a shallow slope (S = 5). (E) Stable regions of the system in D. Note that now, the stable regions for the same range of input patterns do not form a straight line but rather fall on a curve roughly equivalent to a circle (dashed black curve). (F) Same as C and E but for models in which the input itself is constrained to lie on a curve. Fig H. Comparison of selectivity in GPi with the slope of the PMd decision manifold. (A) Comparison within six conditions: all Slow block trials, Slow block trials where DT < 1,400, Slow block trials where DT > 1,400, all Fast block trials, Fast block trials where DT < 950, Fast block trials where DT > 950. Each panel plots the absolute value of PC2 computed from GPi (red, left y-axis, shaded region indicates 95% CI) along with the derivative of PC1 computed from PMd (blue, right y-axis). The black vertical line indicates the estimated time of commitment. At the top left corner are Pearson’s R and the p-value of the correlation between these two signals from 1,000 ms to 280 ms before movement. (B) The correlation of these signals, from 1,000 to 280 ms before movement onset, across all 6 conditions. Data and code available at https://doi.org/10.6084/m9.figshare.20805586.

(PDF)

S1 Movie. Neural trajectories in the space of PCs 1, 2, and 4, constructed separately for easy, ambiguous, and misleading trials as well as the other (unclassified) trials, using all cells recorded in the Slow block (Group 1 cells).

(MP4)

S2 Movie. Neural trajectories in the space of PCs 1, 2, and 4, plotted using PMd activity during Slow blocks (Group 1 cells).

(MP4)

S3 Movie. Neural trajectories in the space of PCs 1, 2, and 4, plotted using M1 activity during Slow blocks (Group 1 cells).

(MP4)

S4 Movie. Neural trajectories in the space of PCs 1, 2, and 4, constructed separately for easy, ambiguous, and misleading trials as well as the other (unclassified) trials, using all cells recorded in the Slow block (Group 1 cells), without the cell duplication step.

(MP4)

Acknowledgments

The authors wish to thank Terrence Sanger for valuable observations about our data, and John Kalaska and Andrea Green for additional observations and helpful comments on the manuscript.

Abbreviations

BIC

Bayesian information criterion

dlPFC

dorsolateral prefrontal cortex

DR

delayed reach

DT

decision time

GPFA

gaussian process factor analysis

GMM

gaussian mixture model

GPe

globus pallidus externus

GPi

globus pallidus internus

M1

primary motor cortex

OT

opposite target

PCA

principal components analysis

PMd

dorsal premotor cortex

PT

preferred target

RT

reaction time

SAT

speed–accuracy trade-off

SP

success probability

TME

Tensor Maximum Entropy

UGM

urgency-gating model

Data Availability

The data and MATLAB code necessary to reproduce the analyses shown in this paper are available at https://doi.org/10.6084/m9.figshare.20805586.

Funding Statement

This work was supported by Canadian Institutes of Health Research (https://cihr-irsc.gc.ca/e/193.html) grants MOP-102662 and PJT-166014, the Canadian Foundation for Innovation (https://www.innovation.ca/), Fonds de Recherche en Santé du Québec (https://frq.gouv.qc.ca/en/), the EJLB Foundation (www.ejlb.qc.ca) to PC, and fellowships from the FYSSEN Foundation (http://www.fondationfyssen.fr/en/) and the Groupe de Recherche sur le Système Nerveux Central (https://www.grsnc.org/home) to DT. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

  • 1.Cisek P, Kalaska JF. Neural mechanisms for interacting with a world full of action choices. AnnuRev Neurosci. 2010;33:269–298. doi: 10.1146/annurev.neuro.051508.135409 [DOI] [PubMed] [Google Scholar]
  • 2.Gomez-Marin A, Ghazanfar AA. The life of behavior. Neuron. 2019;104(1):25–36. doi: 10.1016/j.neuron.2019.09.017 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 3.Michalski J, Green AM, Cisek P. Reaching decisions during ongoing movements. J Neurophysiol. 2020;123(3):1090–1102. doi: 10.1152/jn.00613.2019 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Grießbach E, Incagli F, Herbort O, Cañal-Bruland R. Body dynamics of gait affect value-based decisions. Sci Rep. 2021;11(1):1–14. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 5.Cisek P. Cortical mechanisms of action selection: the affordance competition hypothesis. Phil Trans R Soc B. 2007;362(1485):1585–1599. doi: 10.1098/rstb.2007.2054 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Platt ML, Glimcher PW. Neural correlates of decision variables in parietal cortex. Nature. 1999;400(6741):233–238. doi: 10.1038/22268 [DOI] [PubMed] [Google Scholar]
  • 7.Roitman JD, Shadlen MN. Response of neurons in the lateral intraparietal area during a combined visual discrimination reaction time task. J Neurosci. 2002;22(21):9475–9489. doi: 10.1523/JNEUROSCI.22-21-09475.2002 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 8.Cisek P, Kalaska JF. Neural correlates of reaching decisions in dorsal premotor cortex: Specification of multiple direction choices and final selection of action. Neuron. 2005;45(5):801–814. doi: 10.1016/j.neuron.2005.01.027 [DOI] [PubMed] [Google Scholar]
  • 9.Kim J-N, Shadlen MN. Neural correlates of a decision in the dorsolateral prefrontal cortex of the macaque. Nat Neurosci. 1999;2(2):176–185. doi: 10.1038/5739 [DOI] [PubMed] [Google Scholar]
  • 10.Hernandez A, Nacher V, Luna R, Zainos A, Lemus L, Alvarez M, et al. Decoding a perceptual decision process across cortex. Neuron. 2010;66(2):300–314. doi: 10.1016/j.neuron.2010.03.031 [DOI] [PubMed] [Google Scholar]
  • 11.Ratcliff R. A theory of memory retrieval. Psychol Rev. 1978;83:59–108. [Google Scholar]
  • 12.Ratcliff R, Smith PL, Brown SD, McKoon G. Diffusion decision model: Current issues and history. Trends Cogn Sci. 2016;20(4):260–281. doi: 10.1016/j.tics.2016.01.007 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Ditterich J. Evidence for time-variant decision making. EurJNeurosci. 2006;24(12):3628–3641. doi: 10.1111/j.1460-9568.2006.05221.x [DOI] [PubMed] [Google Scholar]
  • 14.Cisek P, Puskas GA, El-Murr S. Decisions in changing conditions: The urgency-gating model. J Neurosci. 2009;29(37):11560–11571. doi: 10.1523/JNEUROSCI.1844-09.2009 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 15.Gluth S, Rieskamp J, Buchel C. Deciding when to decide: time-variant sequential sampling models explain the emergence of value-based decisions in the human brain. J Neurosci. 2012;32(31):10686–10698. doi: 10.1523/JNEUROSCI.0727-12.2012 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Thura D, Beauregard-Racine J, Fradet CW, Cisek P. Decision making by urgency gating: theory and experimental support. J Neurophysiol. 2012;108(11):2912–2930. doi: 10.1152/jn.01071.2011 [DOI] [PubMed] [Google Scholar]
  • 17.Murphy PR, Boonstra E, Nieuwenhuis S. Global gain modulation generates time-dependent urgency during perceptual choice in humans. Nat Commun. 2016;7:13526. doi: 10.1038/ncomms13526 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Farashahi S, Ting C-C, Kao C-H, Wu S-W, Soltani A. Dynamic combination of sensory and reward information under time pressure. PLoS Comput Biol. 2018;14(3):e1006070. doi: 10.1371/journal.pcbi.1006070 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Palestro JJ, Weichart E, Sederberg PB, Turner BM. Some task demands induce collapsing bounds: Evidence from a behavioral analysis. Psychon Bull Rev. 2018;25(4):1225–1248. doi: 10.3758/s13423-018-1479-9 [DOI] [PubMed] [Google Scholar]
  • 20.Kawato M. Internal models for motor control and trajectory planning. Curr Opin Neurobiol. 1999;9(6):718–727. doi: 10.1016/s0959-4388(99)00028-8 [DOI] [PubMed] [Google Scholar]
  • 21.Shadmehr R, Krakauer JW. A computational neuroanatomy for motor control. Exp Brain Res. 2008;185(3):359–381. doi: 10.1007/s00221-008-1280-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Franklin DW, Wolpert DM. Computational mechanisms of sensorimotor control. Neuron. 2011;72(3):425–442. doi: 10.1016/j.neuron.2011.10.006 [DOI] [PubMed] [Google Scholar]
  • 23.Grossberg S. Contour enhancement, short term memory, and constancies in reverberating neural networks. Stud Appl Math. 1973;52:213–257. [Google Scholar]
  • 24.Amari S-i. Dynamics of pattern formation in lateral-inhibition type neural fields. Biol Cybern. 1977;27(2):77–87. doi: 10.1007/BF00337259 [DOI] [PubMed] [Google Scholar]
  • 25.Cisek P. Integrated neural processes for defining potential actions and deciding between them: A computational model. J Neurosci. 2006;26(38):9761–9770. doi: 10.1523/JNEUROSCI.5605-05.2006 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 26.Wang XJ. Probabilistic decision making by slow reverberation in cortical circuits. Neuron 2002;36(5):955–968. doi: 10.1016/s0896-6273(02)01092-9 [DOI] [PubMed] [Google Scholar]
  • 27.Wu S, Hamaguchi K, Amari S-i. Dynamics and computation of continuous attractors. Neural Comput. 2008;20(4):994–1025. doi: 10.1162/neco.2008.10-06-378 [DOI] [PubMed] [Google Scholar]
  • 28.Standage D, You H, Wang DH, Dorris MC. Gain modulation by an urgency signal controls the speed-accuracy trade-off in a network model of a cortical decision circuit. Front Comput Neurosci. 2011;5:7. doi: 10.3389/fncom.2011.00007 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 29.Bullock D, Grossberg S. Neural dynamics of planned arm movements: Emergent invariants and speed-accuracy properties during trajectory formation. Psychol Rev. 1988;95(1):49–90. doi: 10.1037/0033-295x.95.1.49 [DOI] [PubMed] [Google Scholar]
  • 30.Bullock D, Cisek P, Grossberg S. Cortical networks for control of voluntary arm movements under variable force conditions. Cereb Cortex. 1998;8:48–62. doi: 10.1093/cercor/8.1.48 [DOI] [PubMed] [Google Scholar]
  • 31.Churchland MM, Cunningham JP, Kaufman MT, Ryu SI, Shenoy KV. Cortical preparatory activity: representation of movement or first cog in a dynamical machine? Neuron. 2010;68(3):387–400. doi: 10.1016/j.neuron.2010.09.015 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Wallis JD, Miller EK. From rule to response: neuronal processes in the premotor and prefrontal cortex. J Neurophysiol. 2003;90(3):1790–1806. doi: 10.1152/jn.00086.2003 [DOI] [PubMed] [Google Scholar]
  • 33.Michelet T, Duncan GH, Cisek P. Response competition in the primary motor cortex: Corticospinal excitability reflects response replacement during simple decisions. J Neurophysiol. 2010;104:119–127. doi: 10.1152/jn.00819.2009 [DOI] [PubMed] [Google Scholar]
  • 34.Pastor-Bernier A, Cisek P. Neural correlates of biased competition in premotor cortex. J Neurosci. 2011;31(19):7083–7088. doi: 10.1523/JNEUROSCI.5681-10.2011 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 35.Klein-Flügge MC, Bestmann S. Time-dependent changes in human corticospinal excitability reveal value-based competition for action during decision processing. J Neurosci. 2012;32(24):8373–8382. doi: 10.1523/JNEUROSCI.0270-12.2012 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Thura D, Cisek P. Deliberation and commitment in the premotor and primary motor cortex during dynamic decision making. Neuron. 2014;81(6):1401–1416. doi: 10.1016/j.neuron.2014.01.031 [DOI] [PubMed] [Google Scholar]
  • 37.Cai X, Padoa-Schioppa C. Contributions of Orbitofrontal and Lateral Prefrontal Cortices to Economic Choice and the Good-to-Action Transformation. Neuron. 2014;81(5):1140–1151. doi: 10.1016/j.neuron.2014.01.008 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Mink JW. The basal ganglia: focused selection and inhibition of competing motor programs. Prog Neurobiol. 1996;50(4):381–425. doi: 10.1016/s0301-0082(96)00042-1 [DOI] [PubMed] [Google Scholar]
  • 39.Redgrave P, Prescott TJ, Gurney K. The basal ganglia: a vertebrate solution to the selection problem? Neuroscience. 1999;89(4):1009–1023. doi: 10.1016/s0306-4522(98)00319-4 [DOI] [PubMed] [Google Scholar]
  • 40.Turner RS, Desmurget M. Basal ganglia contributions to motor control: a vigorous tutor. Curr Opin Neurobiol. 2010;20(6):704–16. doi: 10.1016/j.conb.2010.08.022 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Dudman JT, Krakauer JW. The basal ganglia: from motor commands to the control of vigor. Curr Opin Neurobiol. 2016;37:158–166. doi: 10.1016/j.conb.2016.02.005 [DOI] [PubMed] [Google Scholar]
  • 42.Thura D, Cos I, Trung J, Cisek P. Context-dependent urgency influences speed-accuracy trade-offs in decision-making and movement execution. J Neurosci. 2014;34(49):16442–16454. doi: 10.1523/JNEUROSCI.0162-14.2014 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Thura D, Cisek P. Modulation of Premotor and Primary Motor Cortical Activity during Volitional Adjustments of Speed-Accuracy Trade-Offs. J Neurosci. 2016;36(3):938–956. doi: 10.1523/JNEUROSCI.2230-15.2016 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Thura D, Cisek P. The Basal Ganglia Do Not Select Reach Targets but Control the Urgency of Commitment. Neuron. 2017;95(5):1160–70 e5. doi: 10.1016/j.neuron.2017.07.039 [DOI] [PubMed] [Google Scholar]
  • 45.Ding L, Gold JI. Separate, causal roles of the caudate in saccadic choice and execution in a perceptual decision task. Neuron. 2012;75(5):865–874. doi: 10.1016/j.neuron.2012.07.021 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Tai LH, Lee AM, Benavidez N, Bonci A, Wilbrecht L. Transient stimulation of distinct subpopulations of striatal neurons mimics changes in action value. Nat Neurosci. 2012;15(9):1281–1289. doi: 10.1038/nn.3188 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Santacruz SR, Rich EL, Wallis JD, Carmena JM. Caudate microstimulation increases value of specific choices. Curr Biol. 2017;27(21):3375–3383.e3. doi: 10.1016/j.cub.2017.09.051 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Amemori K-i, Amemori S, Gibson DJ, Graybiel AM. Striatal microstimulation induces persistent and repetitive negative decision-making predicted by striatal beta-band oscillation. Neuron. 2018;99(4):829–841.e6. doi: 10.1016/j.neuron.2018.07.022 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 49.Wang L, Rangarajan KV, Gerfen CR, Krauzlis RJ. Activation of striatal neurons causes a perceptual decision bias during visual change detection in mice. Neuron. 2018;97(6):1369–1381.e5. doi: 10.1016/j.neuron.2018.01.049 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 50.Arimura N, Nakayama Y, Yamagata T, Tanji J, Hoshi E. Involvement of the globus pallidus in behavioral goal determination and action specification. J Neurosci. 2013;33(34):13639–13653. doi: 10.1523/JNEUROSCI.1620-13.2013 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 51.Hoshi E. Cortico-basal ganglia networks subserving goal-directed behavior mediated by conditional visuo-goal association. Front Neural Circuits. 2013;7:158. doi: 10.3389/fncir.2013.00158 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 52.Desmurget M, Turner RS. Testing basal ganglia motor functions through reversible inactivations in the posterior internal globus pallidus. J Neurophysiol. 2008;99(3):1057–1076. doi: 10.1152/jn.01010.2007 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 53.Kalaska JF, Cohen DAD, Hyde ML, Prud’homme MJ. A comparison of movement direction-related versus load direction-related activity in primate motor cortex, using a two-dimensional reaching task. J Neurosci. 1989;9(6):2080–2102. doi: 10.1523/JNEUROSCI.09-06-02080.1989 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 54.Crammond DJ, Kalaska JF. Prior information in motor and premotor cortex: activity during the delay period and effect on pre-movement activity. J Neurophysiol. 2000;84(2):986–1005. doi: 10.1152/jn.2000.84.2.986 [DOI] [PubMed] [Google Scholar]
  • 55.Churchland MM, Shenoy KV. Temporal complexity and heterogeneity of single-neuron activity in premotor and motor cortex. J Neurophysiol. 2007;97(6):4235–4257. doi: 10.1152/jn.00095.2007 [DOI] [PubMed] [Google Scholar]
  • 56.Hoshi E, Tanji J. Distinctions between dorsal and ventral premotor areas: anatomical connectivity and functional properties. Curr Opin Neurobiol. 2007;17(2):234–242. doi: 10.1016/j.conb.2007.02.003 [DOI] [PubMed] [Google Scholar]
  • 57.Johnson PB, Ferraina S, Bianchi L, Caminiti R. Cortical networks for visual reaching: Physiological and anatomical organization of frontal and parietal arm regions. Cereb Cortex. 1996;6(2):102–119. [DOI] [PubMed] [Google Scholar]
  • 58.Chandrasekaran C, Peixoto D, Newsome WT, Shenoy KV. Laminar differences in decision-related neural activity in dorsal premotor cortex. Nat Commun. 2017;8(1):1–16. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 59.Smith AC, Brown EN. Estimating a state-space model from point process observations. Neural Comput. 2003;15(5):965–991. doi: 10.1162/089976603765202622 [DOI] [PubMed] [Google Scholar]
  • 60.Stopfer M, Jayaraman V, Laurent G. Intensity versus identity coding in an olfactory system. Neuron. 2003;39(6):991–1004. doi: 10.1016/j.neuron.2003.08.011 [DOI] [PubMed] [Google Scholar]
  • 61.Brown SL, Joseph J, Stopfer M. Encoding a temporally structured stimulus with a temporally structured neural representation. Nat Neurosci. 2005;8(11):1568–1576. doi: 10.1038/nn1559 [DOI] [PubMed] [Google Scholar]
  • 62.Levi R, Varona P, Arshavsky YI, Rabinovich MI, Selverston AI. The role of sensory network dynamics in generating a motor program. J Neurosci. 2005;25(42):9807–9815. doi: 10.1523/JNEUROSCI.2249-05.2005 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 63.Briggman KL, Abarbanel HD, Kristan WB Jr. From crawling to cognition: analyzing the dynamical interactions among populations of neurons. Curr Opin Neurobiol. 2006;16(2):135–144. doi: 10.1016/j.conb.2006.03.014 [DOI] [PubMed] [Google Scholar]
  • 64.Broome BM, Jayaraman V, Laurent G. Encoding and decoding of overlapping odor sequences. Neuron. 2006;51(4):467–482. doi: 10.1016/j.neuron.2006.07.018 [DOI] [PubMed] [Google Scholar]
  • 65.Churchland MM, Yu BM, Sahani M, Shenoy KV. Techniques for extracting single-trial activity patterns from large-scale neural recordings. Curr Opin Neurobiol. 2007;17(5):609–618. doi: 10.1016/j.conb.2007.11.001 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 66.Sasaki T, Matsuki N, Ikegaya Y. Metastability of active CA3 networks. J Neurosci. 2007;27(3):517–528. doi: 10.1523/JNEUROSCI.4514-06.2007 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 67.Yu BM, Cunningham JP, Santhanam G, Ryu SI, Shenoy KV, Sahani M. Gaussian-process factor analysis for low-dimensional single-trial analysis of neural population activity. J Neurophysiol. 2009;102(1):614–635. doi: 10.1152/jn.90941.2008 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 68.Shenoy KV, Sahani M, Churchland MM. Cortical control of arm movements: a dynamical systems perspective. Annu Rev Neurosci. 2013;36:337–359. doi: 10.1146/annurev-neuro-062111-150509 [DOI] [PubMed] [Google Scholar]
  • 69.Cunningham JP, Yu BM. Dimensionality reduction for large-scale neural recordings. Nat Neurosci. 2014;17(11):1500–1509. doi: 10.1038/nn.3776 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 70.Vyas S, Golub MD, Sussillo D, Shenoy KV. Computation Through Neural Population Dynamics. Annu Rev Neurosci. 2020;43:249–275. doi: 10.1146/annurev-neuro-092619-094115 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 71.Saxena S, Cunningham JP. Towards the neural population doctrine. Curr Opin Neurobiol. 2019;55:103–111. doi: 10.1016/j.conb.2019.02.002 [DOI] [PubMed] [Google Scholar]
  • 72.Ebitz RB, Hayden BY. The population doctrine in cognitive neuroscience. Neuron. 2021;109(19):3055–3068. doi: 10.1016/j.neuron.2021.07.011 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 73.Barack DL, Krakauer JW. Two views on the cognitive brain. Nat Rev Neurosci. 2021;22(6):359–371. doi: 10.1038/s41583-021-00448-6 [DOI] [PubMed] [Google Scholar]
  • 74.Urai AE, Doiron B, Leifer AM, Churchland AK. Large-scale neural recordings call for new insights to link brain and behavior. Nat Neurosci. 2022;25(1):11–19. doi: 10.1038/s41593-021-00980-9 [DOI] [PubMed] [Google Scholar]
  • 75.Hirokawa J, Vaughan A, Masset P, Ott T, Kepecs A. Frontal cortex neuron types categorically encode single decision variables. Nature. 2019;576(7787):446–451. doi: 10.1038/s41586-019-1816-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 76.Raposo D, Kaufman MT, Churchland AK. A category-free neural population supports evolving demands during decision-making. Nat Neurosci. 2014;17(12):1784–1792. doi: 10.1038/nn.3865 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 77.Blanchard TC, Piantadosi ST, Hayden BY. Robust mixture modeling reveals category-free selectivity in reward region neuronal ensembles. J Neurophysiol. 2018;119(4):1305–1318. doi: 10.1152/jn.00808.2017 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 78.Aoi MC, Mante V, Pillow JW. Prefrontal cortex exhibits multidimensional dynamic encoding during decision-making. Nat Neurosci. 2020;23(11):1410–1420. doi: 10.1038/s41593-020-0696-5 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 79.Thura D, Cisek P. Monkey frontal cortex reflects the time course of changing evidence for reach decisions. Soc Neurosci Abstr. 2010;Online. [Google Scholar]
  • 80.Thura D, Cisek P. A cortico-basal ganglia network for dynamic decision-making. Soc Neurosci Abstr. 2018;Online. [Google Scholar]
  • 81.Cisek P, Cabana J-F, Thura D, Feghaly A. The dynamics of neural population activity during decision-making. Soc Neurosci Abstr. 2015;Online. [Google Scholar]
  • 82.Cisek P, Thura D, Cabana J-F, Feghaly A. Neural dynamics of cortical and basal ganglia circuits during dynamic decision-making. Soc Neurosci Abstr. 2016;Online. [Google Scholar]
  • 83.Thura D, Guberman G, Cisek P. Trial-to-trial adjustments of speed-accuracy trade-offs in premotor and primary motor cortex. J Neurophysiol. 2017;117(2):665–683. doi: 10.1152/jn.00726.2016 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 84.Afshar A, Santhanam G, Byron MY, Ryu SI, Sahani M, Shenoy KV. Single-trial neural correlates of arm movement preparation. Neuron. 2011;71(3):555–564. doi: 10.1016/j.neuron.2011.05.047 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 85.Churchland MM, Cunningham JP, Kaufman MT, Foster JD, Nuyujukian P, Ryu SI, et al. Neural population dynamics during reaching. Nature. 2012;487(7405):51–56. doi: 10.1038/nature11129 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 86.Michaels JA, Dann B, Intveld RW, Scherberger H. Predicting reaction time from the neural state space of the premotor and parietal grasping network. J Neurosci. 2015;35(32):11415–11432. doi: 10.1523/JNEUROSCI.1714-15.2015 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 87.Kaufman MT, Seely JS, Sussillo D, Ryu SI, Shenoy KV, Churchland MM. The largest response component in the motor cortex reflects movement timing but not movement type. Eneuro. 2016;3(4). [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 88.Michaels JA, Scherberger H. Population coding of grasp and laterality-related information in the macaque fronto-parietal network. Sci Rep. 2018;8(1):1–15. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 89.Miri A, Warriner CL, Seely JS, Elsayed GF, Cunningham JP, Churchland MM, et al. Behaviorally selective engagement of short-latency effector pathways by motor cortex. Neuron. 2017;95(3):683–696.e11. doi: 10.1016/j.neuron.2017.06.042 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 90.Sauerbrei BA, Guo J-Z, Cohen JD, Mischiati M, Guo W, Kabra M, et al. Cortical pattern generation during dexterous movement is input-driven. Nature. 2020;577(7790):386–391. doi: 10.1038/s41586-019-1869-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 91.Elsayed GF, Cunningham JP. Structure in neural population recordings: an expected byproduct of simpler phenomena? Nat Neurosci. 2017;20(9):1310–1318. doi: 10.1038/nn.4617 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 92.Churchland MM, Byron MY, Cunningham JP, Sugrue LP, Cohen MR, Corrado GS, et al. Stimulus onset quenches neural variability: a widespread cortical phenomenon. Nat Neurosci. 2010;13(3):369–378. doi: 10.1038/nn.2501 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 93.Ames KC, Ryu SI, Shenoy KV. Neural dynamics of reaching following incorrect or absent motor preparation. Neuron. 2014;81(2):438–451. doi: 10.1016/j.neuron.2013.11.003 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 94.Gallego JA, Perich MG, Naufel SN, Ethier C, Solla SA, Miller LE. Cortical population activity within a preserved neural manifold underlies multiple motor behaviors. Nat Commun. 2018;9(1):4233. doi: 10.1038/s41467-018-06560-z [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 95.Sun X, O’Shea DJ, Golub MD, Trautmann EM, Vyas S, Ryu SI, et al. Cortical preparatory activity indexes learned motor memories. Nature. 2022;1–6. doi: 10.1038/s41586-021-04329-x [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 96.Churchland MM, Yu BM, Ryu SI, Santhanam G, Shenoy KV. Neural variability in premotor cortex provides a signature of motor preparation. J Neurosci. 2006;26(14):3697–3712. doi: 10.1523/JNEUROSCI.3762-05.2006 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 97.Hoshi E, Shima K, Tanji J. Task-dependent selectivity of movement-related neuronal activity in the primate prefrontal cortex. J Neurophysiol. 1998;80(6):3392–3397. doi: 10.1152/jn.1998.80.6.3392 [DOI] [PubMed] [Google Scholar]
  • 98.Rainer G, Asaad WF, Miller EK. Selective representation of relevant information by neurons in the primate prefrontal cortex. Nature. 1998;363(6885):577–579. doi: 10.1038/31235 [DOI] [PubMed] [Google Scholar]
  • 99.Quintana J, Fuster JM. From perceptions to actions: Temporal integrative functions of prefrontal and parietal neurons. Cereb Cortex. 1999;9(3):213–221. [DOI] [PubMed] [Google Scholar]
  • 100.Fuster JM, Bodner M, Kroger JK. Cross-modal and cross-temporal association in neurons of frontal cortex. Nature. 2000;405(6784):347–351. doi: 10.1038/35012613 [DOI] [PubMed] [Google Scholar]
  • 101.Miller EK. The prefrontal cortex and cognitive control. Nat Rev Neurosci. 2000;1(1):59–65. doi: 10.1038/35036228 [DOI] [PubMed] [Google Scholar]
  • 102.Rowe JB, Toni I, Josephs O, Frackowiak RS, Passingham RE. The prefrontal cortex: response selection or maintenance within working memory? Science. 2000;288(5471):1656–1660. doi: 10.1126/science.288.5471.1656 [DOI] [PubMed] [Google Scholar]
  • 103.Barraclough DJ, Conroy ML, Lee D. Prefrontal cortex and decision making in a mixed-strategy game. Nat Neurosci. 2004;7(4):404–410. doi: 10.1038/nn1209 [DOI] [PubMed] [Google Scholar]
  • 104.Schall JD. On building a bridge between brain and behavior. Annu Rev Psychol. 2004;55:23–50. doi: 10.1146/annurev.psych.55.090902.141907 [DOI] [PubMed] [Google Scholar]
  • 105.Gold JI, Shadlen MN. The neural basis of decision making. Annu Rev Neurosci. 2007;30:535–574. doi: 10.1146/annurev.neuro.29.051605.113038 [DOI] [PubMed] [Google Scholar]
  • 106.Shadlen MN, Kiani R, Hanks TD, Churchland AK. Neurobiology of decision making: An intentional framework. In: Engel C, Singer W, editors. Better than Conscious? Decision Making, the Human Mind, and Implications for Institutions. Cambridge, MA: MIT Press; 2008. p. 71–101. [Google Scholar]
  • 107.Thompson KG, Hanes DP, Bichot NP, Schall JD. Perceptual and motor processing stages identified in the activity of macaque frontal eye field neurons during visual search. J Neurophysiol. 1996;76(6):4040–4055. doi: 10.1152/jn.1996.76.6.4040 [DOI] [PubMed] [Google Scholar]
  • 108.Sato TR, Schall JD. Effects of stimulus-response compatibility on neural selection in frontal eye field. Neuron. 2003;38(4):637–648. doi: 10.1016/s0896-6273(03)00237-x [DOI] [PubMed] [Google Scholar]
  • 109.Stanford TR, Shankar S, Massoglia DP, Costello MG, Salinas E. Perceptual decision making in less than 30 milliseconds. Nat Neurosci. 2010;13(3):379–385. doi: 10.1038/nn.2485 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 110.de Lafuente V, Jazayeri M, Shadlen MN. Representation of accumulating evidence for a decision in two parietal areas. J Neurosci. 2015;35(10):4306–4318. doi: 10.1523/JNEUROSCI.2451-14.2015 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 111.Hernandez A, Zainos A, Romo R. Temporal evolution of a decision-making process in medial premotor cortex. Neuron. 2002;33(6):959–972. doi: 10.1016/s0896-6273(02)00613-x [DOI] [PubMed] [Google Scholar]
  • 112.Coallier É, Michelet T, Kalaska JF. Dorsal premotor cortex: neural correlates of reach target decisions based on a color-location matching rule and conflicting sensory evidence. J Neurophysiol. 2015;113(10):3543–3573. doi: 10.1152/jn.00166.2014 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 113.Andersen RA, Cui H. Intention, action planning, and decision making in parietal-frontal circuits. Neuron. 2009;63(5):568–583. doi: 10.1016/j.neuron.2009.08.028 [DOI] [PubMed] [Google Scholar]
  • 114.Pesaran B, Nelson MJ, Andersen RA. Free choice activates a decision circuit between frontal and parietal cortex. Nature. 2008;453(7193):406–409. doi: 10.1038/nature06849 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 115.Heitz RP, Schall JD. Neural mechanisms of speed-accuracy tradeoff. Neuron. 2012;76(3):616–628. doi: 10.1016/j.neuron.2012.08.030 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 116.Bogacz R, Wagenmakers EJ, Forstmann BU, Nieuwenhuis S. The neural basis of the speed-accuracy tradeoff. Trends Neurosci. 2010;33(1):10–16. doi: 10.1016/j.tins.2009.09.002 [DOI] [PubMed] [Google Scholar]
  • 117.Forstmann BU, Anwander A, Schafer A, Neumann J, Brown S, Wagenmakers EJ, et al. Cortico-striatal connections predict control over speed and accuracy in perceptual decision making. Proc Natl Acad Sci U S A. 2010;107(36):15916–15920. doi: 10.1073/pnas.1004932107 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 118.Klaes C, Westendorff S, Chakrabarti S, Gail A. Choosing goals, not rules: deciding among rule-based action plans. Neuron. 2011;70(3):536–548. doi: 10.1016/j.neuron.2011.02.053 [DOI] [PubMed] [Google Scholar]
  • 119.Scott SH, Kalaska JF. Reaching movements with similar hand paths but different arm orientations. I. Activity of individual cells in motor cortex. J Neurophysiol. 1997;77:826–853. doi: 10.1152/jn.1997.77.2.826 [DOI] [PubMed] [Google Scholar]
  • 120.Scott SH, Sergio LE, Kalaska JF. Reaching movements with similar hand paths but different arm orientations. II. Activity of individual cells in dorsal premotor cortex and parietal area 5. J Neurophysiol. 1997;78:2413–2426. doi: 10.1152/jn.1997.78.5.2413 [DOI] [PubMed] [Google Scholar]
  • 121.Sergio LE, Kalaska JF. Systematic changes in directional tuning of motor cortex cell activity with hand location in the workspace during generation of static isometric forces in constant spatial directions. J Neurophysiol. 1997;78:1170–1174. doi: 10.1152/jn.1997.78.2.1170 [DOI] [PubMed] [Google Scholar]
  • 122.Sergio LE, Kalaska JF. Changes in the temporal pattern of primary motor cortex activity in a directional isometric force versus limb movement task. J Neurophysiol. 1998;80:1577–1583. doi: 10.1152/jn.1998.80.3.1577 [DOI] [PubMed] [Google Scholar]
  • 123.Scott SH. The computational and neural basis of voluntary motor control and planning. Trends Cogn Sci. 2012;16(11):541–549. doi: 10.1016/j.tics.2012.09.008 [DOI] [PubMed] [Google Scholar]
  • 124.Pruszynski JA, Kurtzer I, Nashed JY, Omrani M, Brouwer B, Scott SH. Primary motor cortex underlies multi-joint integration for fast feedback control. Nature. 2011;478(7369):387–390. doi: 10.1038/nature10436 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 125.Herter TM, Korbel T, Scott SH. Comparison of neural responses in primary motor cortex to transient and continuous loads during posture. J Neurophysiol. 2009;101(1):150–163. doi: 10.1152/jn.90230.2008 [DOI] [PubMed] [Google Scholar]
  • 126.Gribble PL, Scott SH. Overlap of internal models in motor cortex for mechanical loads during reaching. Nature. 2002;417(6892):938–941. doi: 10.1038/nature00834 [DOI] [PubMed] [Google Scholar]
  • 127.Cabel DW, Cisek P, Scott SH. Neural activity in primary motor cortex related to mechanical loads applied to the shoulder and elbow during a postural task. J Neurophysiol. 2001;86 (4):2102–2108. doi: 10.1152/jn.2001.86.4.2102 [DOI] [PubMed] [Google Scholar]
  • 128.Holdefer R, Miller L. Primary motor cortical neurons encode functional muscle synergies. Exp Brain Res. 2002;146(2):233–243. doi: 10.1007/s00221-002-1166-x [DOI] [PubMed] [Google Scholar]
  • 129.Cheney PD, Fetz EE. Functional classes of primate corticomotoneuronal cells and their relation to active force. J Neurophysiol. 1980;44(4):773–791. doi: 10.1152/jn.1980.44.4.773 [DOI] [PubMed] [Google Scholar]
  • 130.Evarts EV. Relation of pyramidal tract activity to force exerted during voluntary movement. J Neurophysiol. 1968;31:14–27. doi: 10.1152/jn.1968.31.1.14 [DOI] [PubMed] [Google Scholar]
  • 131.Ajemian R, Green A, Bullock D, Sergio L, Kalaska J, Grossberg S. Assessing the function of motor cortex: single-neuron models of how neural response is modulated by limb biomechanics. Neuron. 2008;58(3):414–428. doi: 10.1016/j.neuron.2008.02.033 [DOI] [PubMed] [Google Scholar]
  • 132.Pastor-Bernier A, Tremblay E, Cisek P. Dorsal premotor cortex is involved in switching motor plans. Front Neuroeng. 2012;5(5). doi: 10.3389/fneng.2012.00005 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 133.Archambault PS, Ferrari-Toniolo S, Battaglia-Mayer A. Online control of hand trajectory and evolution of motor intention in the parietofrontal system. J Neurosci. 2011;31(2):742–752. doi: 10.1523/JNEUROSCI.2623-10.2011 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 134.Takei T, Lomber SG, Cook DJ, Scott SH. Transient deactivation of dorsal premotor cortex or parietal area 5 impairs feedback control of the limb in macaques. Curr Biol. 2021;31(7):1476–1487.e5. doi: 10.1016/j.cub.2021.01.049 [DOI] [PubMed] [Google Scholar]
  • 135.Georgopoulos AP, Kalaska JF, Caminiti R, Massey JT. Interruption of motor cortical discharge subserving aimed arm movements. Exp Brain Res. 1983;49(3):327–340. doi: 10.1007/BF00238775 [DOI] [PubMed] [Google Scholar]
  • 136.Nakahashi A, Cisek P. Dynamics of the neural state in premotor and parietal cortex during multi-attribute decision-making. Soc Neurosci Abstr. 2019;Online. [Google Scholar]
  • 137.Okazawa G, Hatch CE, Mancoo A, Machens CK, Kiani R. Representational geometry of perceptual decisions in the monkey parietal cortex. Cell. 2021;184(14):3748–61 e18. doi: 10.1016/j.cell.2021.05.022 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 138.Sohn H, Narain D, Meirhaeghe N, Jazayeri M. Bayesian computation through cortical latent dynamics. Neuron. 2019;103(5):934–947.e5. doi: 10.1016/j.neuron.2019.06.012 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 139.Chung S, Abbott LF. Neural population geometry: An approach for understanding biological and artificial neural networks. Curr Opin Neurobiol. 2021;70:137–144. doi: 10.1016/j.conb.2021.10.010 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 140.Saxena S, Russo AA, Cunningham J, Churchland MM. Motor cortex activity across movement speeds is predicted by network-level strategies for generating muscle activity. Elife. 2022;11:e67620. doi: 10.7554/eLife.67620 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 141.Russo AA, Khajeh R, Bittner SR, Perkins SM, Cunningham JP, Abbott LF, et al. Neural trajectories in the supplementary motor area and motor cortex exhibit distinct geometries, compatible with different classes of computation. Neuron. 2020;107(4):745–758.e6. doi: 10.1016/j.neuron.2020.05.020 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 142.Jiang X, Saggar H, Ryu SI, Shenoy KV, Kao JC. Structure in neural activity during observed and executed movements is shared at the neural population level, not in single neurons. Cell Rep. 2020;32(6):108006. doi: 10.1016/j.celrep.2020.108006 [DOI] [PubMed] [Google Scholar]
  • 143.Leblois A, Boraud T, Meissner W, Bergman H, Hansel D. Competition between feedback loops underlies normal and pathological dynamics in the basal ganglia. J Neurosci. 2006;26(13):3567–3583. doi: 10.1523/JNEUROSCI.5050-05.2006 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 144.Duque J, Lew D, Mazzocchio R, Olivier E, Ivry RB. Evidence for two concurrent inhibitory mechanisms during response preparation. J Neurosci. 2010;30(10):3793–3802. doi: 10.1523/JNEUROSCI.5722-09.2010 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 145.Thura D, Cisek P. A computational model of cortico-basal ganglia circuits for deciding between reaching actions. Soc Neurosci Abstr. 2019;Online. [Google Scholar]
  • 146.Brown JW, Bullock D, Grossberg S. How laminar frontal cortex and basal ganglia circuits interact to control planned and reactive saccades. Neural Netw. 2004;17(4):471–510. doi: 10.1016/j.neunet.2003.08.006 [DOI] [PubMed] [Google Scholar]
  • 147.Thura D, Cisek P. Microstimulation of dorsal premotor and primary motor cortex delays the volitional commitment to an action choice. J Neurophysiol. 2020;123(3):927–935. doi: 10.1152/jn.00682.2019 [DOI] [PubMed] [Google Scholar]
  • 148.Inagaki HK, Fontolan L, Romani S, Svoboda K. Discrete attractor dynamics underlies persistent activity in the frontal cortex. Nature. 2019;566(7743):212–217. doi: 10.1038/s41586-019-0919-7 [DOI] [PubMed] [Google Scholar]
  • 149.Finkelstein A, Fontolan L, Economo MN, Li N, Romani S, Svoboda K. Attractor dynamics gate cortical information flow during decision-making. Nat Neurosci. 2021;24(6):843–850. doi: 10.1038/s41593-021-00840-6 [DOI] [PubMed] [Google Scholar]
  • 150.Churchland MM, Santhanam G, Shenoy KV. Preparatory activity in premotor and motor cortex reflects the speed of the upcoming reach. J Neurophysiol. 2006;96(6):3130–46. doi: 10.1152/jn.00307.2006 [DOI] [PubMed] [Google Scholar]
  • 151.Wadell H. Volume, shape, and roundness of quartz particles. J Geol. 1935;43(3):250–280. [Google Scholar]

Decision Letter 0

Kris Dickson, PhD

18 May 2022

Dear Drs Thura and Cisek,

Thank you for submitting your manuscript entitled "Unified neural dynamics of decisions and actions in the cerebral cortex and basal ganglia" for consideration as a Research Article by PLOS Biology.

Your manuscript has now been evaluated by myself and an academic editor with relevant expertise. I am writing to let you know that we would like to send your submission out for external peer review. However, before we can send your manuscript to reviewers, we need you to complete your submission by providing the metadata that is required for full assessment. (See instructions below my signature.)

I also wanted to take a moment to (re)introduce myself. While it's been a while, you may recall me from our prior interactions when I was Deputy Editor at Neuron under Katja Brose. I’ve been freelance writing and editing for the past several years, and I recently started as the PLOS Biology Neuroscience Senior Editor/Section Manager. I am glad that you thought of PLOS Biology for this submission and I look forward to continued interactions with you during the peer review process.

With best wishes,

Kris

Kris Dickson, Ph.D. (she/her)

Neurosciences Senior Editor/Section Manager

PLOS Biology

kdickson@plos.org

***Full Submission instructions:

Please login to Editorial Manager where you will find the paper in the 'Submissions Needing Revisions' folder on your homepage. Please click 'Revise Submission' from the Action Links and complete all additional questions in the submission questionnaire.

Once your full submission is complete, your paper will undergo a series of checks in preparation for peer review. Once your manuscript has passed the checks it will be sent out for review. To provide the metadata for your submission, please Login to Editorial Manager (https://www.editorialmanager.com/pbiology) within two working days, i.e. by May 20 2022 11:59PM.

If your manuscript has been previously reviewed at another journal, PLOS Biology is willing to work with those reviews in order to avoid re-starting the process. Submission of the previous reviews is entirely optional and our ability to use them effectively will depend on the willingness of the previous journal to confirm the content of the reports and share the reviewer identities. Please note that we reserve the right to invite additional reviewers if we consider that additional/independent reviewers are needed, although we aim to avoid this as far as possible. In our experience, working with previous reviews does save time.

If you would like to send previous reviewer reports to us, please email me at kdickson@plos.org to let me know, including the name of the previous journal and the manuscript ID the study was given, as well as attaching a point-by-point response to reviewers that details how you have or plan to address the reviewers' concerns.

During the process of completing your manuscript submission, you will be invited to opt-in to posting your pre-review manuscript as a bioRxiv preprint. Visit http://journals.plos.org/plosbiology/s/preprints for full details. If you consent to posting your current manuscript as a preprint, please upload a single Preprint PDF.

Feel free to email us at plosbiology@plos.org if you have any queries relating to uploading your full submission.

Decision Letter 1

Kris Dickson, PhD

1 Jul 2022

Dear David and Paul,

Thank you for your patience while your manuscript "Unified neural dynamics of decisions and actions in the cerebral cortex and basal ganglia" was peer-reviewed at PLOS Biology. It has now been evaluated by the PLOS Biology editors, an Academic Editor with relevant expertise, and by 2 independent reviewers.

I am delighted to tell you that, based on the reviewer feedback, we are likely to accept this manuscript for publication. At this stage we simply ask that you address a few textual editorial requests, as well as the data and other policy-related requests listed at the bottom of this email.

Editorial requests:

TITLE:

We'd like to suggest a title change that we feel will make this work somewhat more accessible to our broad audience. Something along the lines of:

1) Sensorimotor decisions and actions reflect unified neural dynamics

2) Movement decisions and actions are mediated by similar neural dynamics

3) Movement decisions and actions are mediated by similar neural dynamics in the cerebral cortex and basal ganglia

ABSTRACT:

Our non-native English speaking editors felt that replacing "implemented" with "coded" or "regulated" or something similar would be preferable in the abstract.

BLURB for ETOC USE:

Please also provide a blurb which will be included in our weekly and monthly Electronic Table of Contents (eTOCs), sent out to readers of PLOS Biology. This blurb may also be used to promote your article on social media. The blurb should be about 30-40 words long and is subject to editorial changes. It should, without exaggeration, entice people to read your manuscript, should not be redundant with the title and should not contain acronyms or abbreviations. For examples, view our author guidelines: https://journals.plos.org/plosbiology/s/revising-your-manuscript#loc-blurb

As you address these items, please take this last chance to review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the cover letter that accompanies your revised manuscript.

We expect to receive your revised manuscript within two weeks.

To submit your revision, please go to https://www.editorialmanager.com/pbiology/ and log in as an Author. Click the link labelled 'Submissions Needing Revision' to find your submission record. Your revised submission must include the following:

- a cover letter that should detail your responses to any editorial requests, if applicable, and whether changes have been made to the reference list

- a Response to Reviewers file that provides a detailed response to the reviewers' comments (if applicable)

- a track-changes file indicating any changes that you have made to the manuscript.

NOTE: If Supporting Information files are included with your article, note that these are not copyedited and will be published as they are submitted. Please ensure that these files are legible and of high quality (at least 300 dpi) in an easily accessible file format. For this reason, please be aware that any references listed in an SI file will not be indexed. For more information, see our Supporting Information guidelines:

https://journals.plos.org/plosbiology/s/supporting-information

*Published Peer Review History*

Please note that you may have the opportunity to make the peer review history publicly available. The record will include editor decision letters (with reviews) and your responses to reviewer comments. If eligible, we will contact you to opt in or out. Please see here for more details:

https://blogs.plos.org/plos/2019/05/plos-journals-now-open-for-published-peer-review/

*Press*

Should you, your institution's press office or the journal office choose to press release your paper, please ensure you have opted out of Early Article Posting on the submission form. We ask that you notify us as soon as possible if you or your institution is planning to press release the article.

*Protocols deposition*

To enhance the reproducibility of your results, we recommend that if applicable you deposit your laboratory protocols in protocols.io, where a protocol can be assigned its own identifier (DOI) such that it can be cited independently in the future. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols

Please do not hesitate to contact me should you have any questions.

Sincerely,

Kris

Kris Dickson, Ph.D. (she/her)

Neurosciences Senior Editor/Section Manager,

kdickson@plos.org,

PLOS Biology

-----------------------------------------------------------------------

------------------------------------------------------------------------

DATA POLICY:

You may be aware of the PLOS Data Policy, which requires that all data be made available without restriction: http://journals.plos.org/plosbiology/s/data-availability. For more information, please also see this editorial: http://dx.doi.org/10.1371/journal.pbio.1001797

Note that we do not require all raw data. Rather, we ask that all individual quantitative observations that underlie the data summarized in the figures and results of your paper be made available. While I note that you'd indicated that the data are included as supplemental files, I only saw supplemental movie files.

Please add files, in one of the following forms, for the underlying data in:

Fig 2, Fig 3A-C, Fig 4A-D, Fig 5, Fig 6A,B,D, Fig 7A-C, Fig 8A-F

Supplemental Fig 2, Fig 3A-E, Fig 4A-F, Fig 5A-H, Fig 6A-D, Fig 7B-F, Fig 8A-B

1) As Supplementary files (e.g., excel). Please ensure that all data files are uploaded as 'Supporting Information' and are invariably referred to (in the manuscript, figure legends, and the Description field when uploading your files) using the following format verbatim: S1 Data, S2 Data, etc. Multiple panels of a single or even several figures can be included as multiple sheets in one excel file that is saved using exactly the following convention: S1_Data.xlsx (using an underscore).

2) As a deposition in a publicly available repository. Please also provide the accession code or a reviewer link so that we may view your data before publication.

Regardless of the method selected, please ensure that you provide the individual numerical values that underlie the summary data displayed in these figure panels as they are essential for readers to assess your analysis and to reproduce your findings.

Please also ENSURE THAT THE FIGURE LEGENDS in your manuscript include information on where the underlying data can be found, and ensure your supplemental data file/s has a legend. (NB - this step is often forgotten!!!)

NOTE: the numerical data provided should include all replicates AND the way in which the plotted mean and errors were derived (it should not present only the mean/average values).

Please ensure that your Data Statement in the submission system also accurately describes where your data can be found.

------------------------------------------------------------------------

CODE REQUESTS:

If applicable, we ask that you share publicly any code that you created and that directly relates to the results described in your article, following best practices for code sharing, such as:

- Deposition of your code in public repositories

- Citation of publicly-available code that is assigned a persistent identifier in your reference list

- Use of open source definition licenses for your code

------------------------------------------------------------------------

DATA NOT SHOWN?

As applicable, please note that per journal policy, we do not allow the mention of "data not shown", "personal communication", "manuscript in preparation" or other references to data that is not publicly available or contained within this manuscript. Please either remove mention of any such data or provide figures presenting the results and the data underlying these mentions.

------------------------------------------------------------------------

Reviewer remarks:

Reviewer's Responses to Questions

PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

Reviewer #1: This is a very important paper and is likely to be transformative on the field. I believe that it will become regarded as a classic, and will have a dramatic effect on the way data are analyzed. I will provide a review, below, but I hope that the editors and authors will see that my comments are meant to improve the paper, not to criticize it. Because this journal does not seem to have a blind comments to editor section in the peer review interface, I will state here that I am strongly supportive of publication and, because I think that these results could influence a broader audience, the journal should consider a News and View-like piece to accompany the article.

In short, this paper looks at the neuronal basis of decisions - an obviously important problem. The authors record in macaques in 5 regions (PMd, M1, DLPFC, GPi, and GPe) during a choice task. Rather than the normal practice of laserlike focus on a single neuron, and what can be inferred from them, the authors take a population-centered approach. In other words, they use the population doctrine not the single neuron doctrine. Specifically, they use PCA (although they use other methods with similar results in the Supplement). They find that:

* The first PC represents the transition from deliberation to commitment

* The second PC represents the accumulated evidence

* The third PC reflects the urgency, as determined by block

* The fourth PC reflects urgency as well

This tool, then lets them do some interesting things. For example, they are able to detect a new difference between PMd/M1 and DLPFC, and between these cortical regions and the GP structures. These differences are one that would not be visible with conventional single unit methods.

The primary importance of the paper comes from its entirely new perspective, one that helps us move past old ways of looking at how decision-making is organized, and towards new ones. Thus, the results, while important in and of themselves, are also important because they illustrate how to use population tools - even relatively simple ones - to think new thoughts about the neural basis of choice. Indeed, the paper has existed in poster form for several years, and the posters themselves have been passed around, and have become well-known among many scholars as major important works. It is gratifying to finally see them turned into a manuscript, which is sure to have an even larger impact.

Besides the perspective, the core empirical results of the paper are also important. First, the provide further support for - and elaboration of - the idea that action selection and decision rely not just on the same circuits, but instead are really two ways of thinking about the same process. But the most important and exciting result is that commitment to a decision is not a thresholding process, but rather an attractor process. This is a major change, and should engender a major rethink in the literature, so much of which is about thresholds. Beyond this, there are several other important results - the Discussion does a good job of summarizing and contextualizing them. This is, clearly, a major paper, that reflects a good deal of thought over many years.

MINOR COMMENTS

Given the current writing, it is hard to link the author's underlying hypotheses and the results one would expect. For example, in the first section 'Dimensionality reduction of neural activity into principal components', the authors spend a lot of time discussing how they did PCA and comparative analyses (GPFA). This is fine. But these methodological details seem to be present in place of (1) what the hypothesis being tested is, and (2) what the reader should expect to see in the PCA results given those and alternative hypotheses.

In the same 'Dimesionality reduction…' section, the authors bring special focus to PC2. I agree that it is interesting that it shows a split between "easy", "hard" and "ambiguous". But, should we have expected this given the hypothesized overlap between 'choice' and 'execution'? The connections between hypotheses and results are remain somewhat opaque.

It seems useful to use the TME method here. But, it sems like this is being used in place of a (mixed) selectivity analysis. Are we to interpret this control as saying the selectivity is random and mixed across the population? If so, it seems best to show this with something like a pairs or epairs test; note, I realize the authors used a clustering of loading coefficients later.

I think part of the claim with TME being made here is that 'no neuron subtype is solely responsible' for these dynamics seen in the PCA. But, how much of these neurons exhibit a pure, linear mixed or nonlinear mixed selectivity? How much of the PCA results are in fact due to dynamic changes in the nonlinear mixtures of units for task-relevant variables (see 5 next, also).

If the underlying components of the decision model are a priori known, why is PCA used rather than a focused method such as regression and their underlying subspaces? This would be more theory/hypothesis-driven. If using a regression approach, then the authors could introduce movement execution and choice regressors (and their interaction). Doing this would allow testing the claim they setout to do, but right now, it's unclear what constitutes 'execution' in the PCs; I don't see movement variables, except for left or right. If the authors meant for choice to == integration of evidence, and left/right to == execution, please state this more clearly and why left/right aren't just action rules or space codes.

Analysis of the loading matrix: I think this analysis has potential to answer some of the authors main questions. However, the section is unclear because the expected results given the hypotheses aren't stated up front. It would help the reader to know what they should be looking for to follow along

The authors state (lines 352-353): "this suggests that the lack of distinct clusters….reflect absence of categories in the neural populations". Is this statement meant to convey that the distribution of neuron encoding approaches what many called random mixed selectivity (Raposo et al., 2014; Hirokawa et al., 2019)? If so, I think it would be best to state that clearly and cite those works too.

abstract: what does "clusters within each region" mean?

I realize that many of the results come from as much as 7 years ago. As such it is not surprising that many of the references are limited to an earlier time. I do think the paper would be improved by adding many of the important papers that come from the period of more recency.

Saxena and Cunningham's paper on the population doctrine should be cited somewhere. Other relevant reviews include Urai and Churchland and Ebitz and Hayden.

Reviewer #2: While what Thura et al. present in this manuscript is not earthshattering new, the paper provides a very nice holistic view of the neural dynamics underlying a complex task, whose components were largely studied independently before. The authors trained monkeys to perform a visual perceptual decision-making task, which required the animals to report a choice through performing a hand movement. Neurons were recorded separately from a number of different brain areas, including prefrontal cortex, premotor and primary motor cortex, and the basal ganglia. Since the neurons were not recorded simultaneously, rather than taking a single-trial decoding approach, the authors relied on averaged neural activity from different classes of trials, separated by the animal's choice (left vs. right), time pressure on the current trial (slow vs. fast), and history of the sensory evidence (easy vs. ambiguous vs. misleading). The analysis presents the trials as a continuous dynamic process, including deliberation, choice commitment, movement preparation, and movement execution, in which the different brain areas engage in to varying degrees. The paper is very well written, and I don't have any major comments to add. I only have a number of smaller comments to consider:

1) Fig. 3 legend, "A. Dotted lines show the sensory evidence […]": There don't appear to be any dotted lines in this panel.

2) Fig. 4A, B: Is there a way to convince oneself that the final parts of the shown trajectories are related to active movement execution (rather than potentially just passively returning to a "neutral" state for the next trial)?

3) Lines 377-379, "Importantly, the transition to commitment is not equivalent to the crossing of a neural "threshold", but instead resembles falling off the edge of the decision manifold into an orthogonal subspace that loses sensitivity to evidence.": Why is it not possible to think of reaching this edge as reaching a decision threshold from the point of view of terminating the deliberation phase (ignoring, for the moment, what happens dynamically once the edge is reached)?

4) Lines 379-382, "As can be seen […] implying that straightforward summation of the activity of cells tuned to the chosen target may not be constant across trial types, as indeed often observed […].": What about the possibility of a time-variant decision criterion (as proposed in many more recent decision models)? Are the results inconsistent with such an idea?

Decision Letter 2

Kris Dickson, PhD

29 Sep 2022

Dear Dr Thura,

Thank you for the submission of your revised Research Article "Integrated neural dynamics of sensorimotor decisions and actions" for publication in PLOS Biology. On behalf of my colleagues and the Academic Editor,

Alex Gail, I am pleased to say that we can in principle accept your manuscript for publication, provided you address any remaining formatting and reporting issues. These will be detailed in an email you should receive within 2-3 business days from our colleagues in the journal operations team; no action is required from you until then. Please note that we will not be able to formally accept your manuscript and schedule it for publication until you have completed any requested changes.

Additionally, when submitting the final version of your submission, Alex has suggested that you might want to consider rephrasing part of your newly added text to improve the accessibility of the paper - specifically regarding the text block attempting to link your hypothesis with specific predictions that should be seen with the PCA-based data analytic approach. Alex noted that the newly added text seems to be mostly phrased from a single neuron perspective ("neurons [...] will be distinct [...]" and "neurons that mix all [...] factors"), which he felt didn't quite achieve this goal. He argued that the main idea of the dynamical systems approach is to emphasize system dynamics in a low-dimensional space. Alex therefore thought that translating the two alternative hypotheses that you want to compare against each other into two clearly distinguishable outcomes regarding low-dimensional state trajectories (or associated PC spectra) might be a more compelling approach and would help to make the results easier to interpret. We therefore ask that you consider this additional minor revision to the final version of the study.

Please take a minute to log into Editorial Manager at http://www.editorialmanager.com/pbiology/, click the "Update My Information" link at the top of the page, and update your user information to ensure an efficient production process.

PRESS

We frequently collaborate with press offices. If your institution or institutions have a press office, please notify them about your upcoming paper at this point, to enable them to help maximize its impact. If the press office is planning to promote your findings, we would be grateful if they could coordinate with biologypress@plos.org. If you have previously opted in to the early version process, we ask that you notify us immediately of any press plans so that we may opt out on your behalf.

We also ask that you take this opportunity to read our Embargo Policy regarding the discussion, promotion and media coverage of work that is yet to be published by PLOS. As your manuscript is not yet published, it is bound by the conditions of our Embargo Policy. Please be aware that this policy is in place both to ensure that any press coverage of your article is fully substantiated and to provide a direct link between such coverage and the published work. For full details of our Embargo Policy, please visit http://www.plos.org/about/media-inquiries/embargo-policy/.

Thank you again for choosing PLOS Biology for publication and supporting Open Access publishing. We look forward to publishing your study. 

Sincerely, 

Kris

Kris Dickson, Ph.D. (she/her)

Neurosciences Senior Editor/Section Manager

PLOS Biology

kdickson@plos.org

Associated Data

    This section collects any data citations, data availability statements, or supplementary materials included in this article.

    Supplementary Materials

    S1 Text. Supporting information.

    Fig A. Schematic diagram of the analysis approach. Step 1: Neural activity is recorded from individual cells while the monkey performs individual trials of different types (e.g., “easy trial to the right,” “misleading trial to the left,” etc.), and the temporal profile is aligned on movement onset time. Step 2: For each cell, activity is averaged from all trials during the slow block in which the target on the right was chosen, to produce an average “Slow-R all” temporal profile. This is repeated for Slow-L, Fast-R, and Fast-L trial groups. Step 3: The four trial groups are concatenated into a single row for each cell, and the rows of all cells grouped into a large matrix. Step 4: Principal components analysis is performed on that matrix to produce the “loading matrix” L. Step 5: To generate the trajectory for a specific trial type and a specific population, we first average the neural activity of a given cell over all trials of the given type (e.g., misleading trials in the slow block in which the right target was chosen). Step 6: To compute the profile for that trial type and for a given principal component, the average activity of each cell is weighted by the corresponding row (cell) and column (PC) of the loading matrix L and summed. Step 7: The profile of each PC is divided by the number of cells used. Step 8: The resulting PC profiles are plotted against each other to produce neural trajectories. Here, the trajectory for Slow-Misleading-R trials is highlighted and superimposed over the trajectories of other slow block trials, as in Fig 4a. Note that steps 5–8 can be computed using arbitrary subsets of data (different trial groupings, different subsets of cells), and arbitrary alignment (on movement onset or on the first token jump), because all they depend upon is the loading matrix computed by steps 1–4. Fig B. The top four principal components produced when the loading matrix was computed from each brain region separately, same format as Fig 2. On the right are shown the neural trajectories computed in the space of the components that reflect commitment, evidence, and elapsing time. Note that in some cases, the axis is inverted to facilitate comparison of these neural spaces to those shown in Figs 6 and 7. Data and code available at https://doi.org/10.6084/m9.figshare.20805586. Fig C. The top four principal components produced when the loading matrix was computed using only 35 cells per region (A), only data from movement onset to 400 ms later (B), data without square root transformation (C), only data from the slow block (D), or all data from all 28 trial conditions separately (E). Same format as Fig B. Data and code available at https://doi.org/10.6084/m9.figshare.20805586. Fig D. Results computed separately for each of the two animals. Panels A and B show the principal components (same format as Fig 2). Panels C and E show the top four components calculated using each brain region (similar to Fig 5). Panels D and F show the decision manifolds for PMd and M1 (similar to Fig 6). Data and code available at https://doi.org/10.6084/m9.figshare.20805586. Fig E. Results computed without the cell duplication step. (A) The top four principal components (similar to Fig 2). (B) The neural trajectories and decision manifold for PMd (similar to Fig 6a). (C) Neural trajectories and decision manifold for M1 (similar to Fig 2B). (D-F) Neural trajectories for dlPFC, GPe, and GPi (similar to Fig 7). (G) Loading distributions for PMd and M1 with respect to PC1 and PC2 (similar to Fig 8A and 8C, left). (H) Loading distributions for dlPFC, GPe, and GPi with respect to PC1, PC2, and PC4 (similar to Fig 8F). Data and code available at https://doi.org/10.6084/m9.figshare.20805586. Fig F. Analyses of synthetic populations. (A) The top 3 synthetic PCs (SPCs) obtained from a population of 600 neurons constructed using combinations of the original PCs 1, 2, and 4 (see Fig 2). (B) The loading coefficients of all 600 neurons, fitted with GMMs (gray ellipsoids). Colors indicate the three groups of neurons built with different combinations of those original PCs (see legend). (C) The top 3 SPCs obtained from a population of 600 neurons constructed only using the original PC1 and PC2 but separated into three distinct categories: cells that are only active during deliberation, cells only active during movement, and cells active during both epochs. (D) The loading coefficients of all 600 neurons, fitted with GMMs (gray ellipsoids). Colors indicate the three categories of neurons. Data and code available at https://doi.org/10.6084/m9.figshare.20805586. Fig G. Simulation of a simple 2-neuron recurrent competitive attractor. (A) The structure of the model is shown at left, next to the governing equations and parameter settings. Two forms of the interaction function f(x) are shown in the inset at right. (B) The flow field for a system with high gain (G = 1) and steep slope (S = 15) when the input evidence is balanced such that E1 = E2 = 0.2. Blue arrows depict the flow and shading indicates the speed (yellow = fastest, dark blue = slowest). The purple outline indicates a region around the stable equilibrium, in which the state will tend to remain even with substantial noise. Note that the dynamics flow quickly toward that region and then move more slowly within it. (C) The stable regions of the same system for several input patterns, ranging from strong evidence in favor of choice 2 (top left, blue) to strong evidence in favor of choice 1 (bottom right, red). Note that these align in what is approximately a straight line across the state space (compare to dashed black line). (D) Same as B but for a system in which the interaction function has low gain (G = 0.1) and a shallow slope (S = 5). (E) Stable regions of the system in D. Note that now, the stable regions for the same range of input patterns do not form a straight line but rather fall on a curve roughly equivalent to a circle (dashed black curve). (F) Same as C and E but for models in which the input itself is constrained to lie on a curve. Fig H. Comparison of selectivity in GPi with the slope of the PMd decision manifold. (A) Comparison within six conditions: all Slow block trials, Slow block trials where DT < 1,400, Slow block trials where DT > 1,400, all Fast block trials, Fast block trials where DT < 950, Fast block trials where DT > 950. Each panel plots the absolute value of PC2 computed from GPi (red, left y-axis, shaded region indicates 95% CI) along with the derivative of PC1 computed from PMd (blue, right y-axis). The black vertical line indicates the estimated time of commitment. At the top left corner are Pearson’s R and the p-value of the correlation between these two signals from 1,000 ms to 280 ms before movement. (B) The correlation of these signals, from 1,000 to 280 ms before movement onset, across all 6 conditions. Data and code available at https://doi.org/10.6084/m9.figshare.20805586.

    (PDF)

    S1 Movie. Neural trajectories in the space of PCs 1, 2, and 4, constructed separately for easy, ambiguous, and misleading trials as well as the other (unclassified) trials, using all cells recorded in the Slow block (Group 1 cells).

    (MP4)

    S2 Movie. Neural trajectories in the space of PCs 1, 2, and 4, plotted using PMd activity during Slow blocks (Group 1 cells).

    (MP4)

    S3 Movie. Neural trajectories in the space of PCs 1, 2, and 4, plotted using M1 activity during Slow blocks (Group 1 cells).

    (MP4)

    S4 Movie. Neural trajectories in the space of PCs 1, 2, and 4, constructed separately for easy, ambiguous, and misleading trials as well as the other (unclassified) trials, using all cells recorded in the Slow block (Group 1 cells), without the cell duplication step.

    (MP4)

    Attachment

    Submitted filename: ThuraCabanaFeghalyCisek PLOS_Responses2reviewers_AsSub.docx

    Data Availability Statement

    The data and MATLAB code necessary to reproduce the analyses shown in this paper are available at https://doi.org/10.6084/m9.figshare.20805586.


    Articles from PLOS Biology are provided here courtesy of PLOS

    RESOURCES