Skip to main content
UKPMC Funders Author Manuscripts logoLink to UKPMC Funders Author Manuscripts
. Author manuscript; available in PMC: 2015 Sep 1.
Published in final edited form as: Nat Neurosci. 2015 Jan 26;18(3):453–460. doi: 10.1038/nn.3924

The striatum multiplexes contextual and kinematic information to constrain motor habits execution

Pavel E Rueda-Orozco 1,2,3,4, David Robbe 1,2,3,4
PMCID: PMC4342106  EMSID: EMS61562  PMID: 25622144

Abstract

The striatum is required for the acquisition of procedural memories but its contribution to motor control once learning has occurred is unclear. Here we created a task in which rats learned a difficult motor sequence characterized by fine-tuned changes in running speed adjusted to spatial and temporal constraints. After training and extensive practice, we found that the behavior was habitual yet tetrode recordings in the dorsolateral striatum (DLS) revealed continuous integrative representations of running speed, position and time. These representations were weak in naive rats hand-guided to perform the same sequence and developed slowly after learning. Finally, DLS inactivation in well-trained animals preserved the structure of the sequence while increasing its trial-by-trial variability. We conclude that after learning the DLS continuously integrates task-relevant information to constrain the execution of motor habits. Our work provides a straightforward mechanism by which the basal ganglia may contribute to habit formation and motor control.

Introduction

In many of our everyday actions we perform specific sequences of movements with kinematic parameters (e.g., movement time, speed, trajectory) precisely adjusted to task-specific constraints1. Examples can be seen in the actions involved in driving a car, playing sports, using tools and performing arts. Several weeks to several months of training are necessary for the acquisition of these behaviors which are generally referred to as procedural memories or motor skills2. With practice, these skills can be performed automatically and there is converging evidence from lesion and physiological studies using motor sequence tasks in rodents3-10, monkeys11-14 and humans15,16 that the sensorimotor region of the striatum (the DLS in rodents) contributes to the execution of well-learned procedural memories, especially motor habits17. Nevertheless, the exact nature of this contribution remains unclear18 and debated19, possibly because action execution is the readout of higher order processes such as motor planning or action selection18. The clarification of this ambiguity is difficult for two reasons. First, behavioral performance during motor sequence tasks is generally quantified through global metrics (e.g., percentage of correct trials, average reaction time, number of lever presses), which overlook changes in kinematic parameters of movements occurring during learning (e.g., movements speed and trajectory, increased stereotypy of behavior and sensory stimuli). Second, due to the massive sensorimotor input of the striatum20 and its indirect connection toward motor circuits21, striatal spiking activity can be influenced by or influence kinematic parameters of action execution. Thus, during and after motor learning, it has not been established whether striatal spiking activity reflects primarily the acquisition of high-level motor control function (e.g., action planning, storage of motor programs), changes in low-level parameters (sensory and proprioceptive stimuli, dynamics of movements) that occurred during learning12 or specifically contributes to precise kinematic parameters associated with accurate execution12,18,19,22. Addressing this problem requires a paradigm that dissociates low-level from high-level processes associated with motor learning.

Results

We designed a task for rats that favors the generation of a motor sequence with fine-tuned kinematic parameters easily quantifiable. Specifically, we customized a motorized treadmill and trained rats to obtain rewards according to a spatiotemporal rule. Once the treadmill was turned on, animals could stop it and receive a drop of sucrose solution by entering a “stop area” located at the front of the treadmill (Fig. 1a). In addition to this spatial rule, a temporal constraint was added: stopping of the treadmill was only effective if animals waited at least 7 s (goal time) before entering the stop area (correct trials; Fig. 1b). If animals entered the stop area before the goal time, an error sound was played and they were forced to run for 20 s (incorrect non-rewarded trials; Fig. 1b). Initially, rats accelerated forward as soon as the treadmill was turned on and entered the stop area before the goal time, resulting in a majority of incorrect trials (Fig. 1c and Supplementary Video 1). After extensive training, rats executed a stereotyped sequence that could be divided in 3 overlapping phases: 1) passive displacement from the front to the rear portion of the treadmill, 2) stable running and 3) acceleration across the treadmill to enter the stop area (Fig. 1c, Supplementary Fig. 1 and Supplementary Video 2). The high level of stereotypy revealed by the trajectories of the animal was also demonstrated when tracking the position of the left forelimb (Supplementary Fig. 2). Learning occurred in a two-step process. Within a few sessions, animals learned to perform the “front-rear-front” motor sequence but in the majority of the trials they entered the stop area just before the goal time (sessions 15–45; Fig. 1d), which resulted in a low percentage of correct trials (Fig. 1e). Then, during the longest part of the training, animals progressively adjusted the kinematic parameters of execution to delay their entering in the stop area and increase their proficiency (sessions 30–100; Fig. 1d,e).

Figure 1. A fixed interval estimation task adapted to treadmill locomotion favored the learning of a stereotyped running sequence.

Figure 1

(a) Behavioral apparatus. (b) Task description. (c) Running trajectories (left) and entrance times in the stop area (right) for two sessions, before (top) and after (bottom) learning. Entrance times in the stop area (median ± first and third quartiles; d) and percentage of correct trials (e) for all the training sessions of a single animal.

We next examined if this learning process led to goal-directed or habitual performance of the running sequence. Motor habits are difficult to break and a classical test to probe the strength of habits is contingency degradation23,24. We designed two tests of contingency degradation adapted to our task. First, we shortened the treadmill (Fig. 2a) after extensive training (> 80 daily sessions), when the animals had reached stable performance (≥ 3 consecutive sessions with ≥ 72.5 % of correct trials). We observed that during several sessions following treadmill shortening, the animals persisted in performing the task using the previously learned kinematic parameters, yielding a majority of premature entrances into the stop area (Fig. 2a). Importantly, while the motor sequence was qualitatively identical before and after shortening (Fig. 2b), a great number of sessions were necessary for the animals to learn the new kinematic parameters adapted to the shorter treadmill (Fig. 2c). Second, we trained a separate group of animals to enter the stop area after a short goal time (4 or 5 s) and once the animals reached stable performance, the goal time was increased to 7 s. After this change, animals persisted in entering in the stop area just after the original goal time for several sessions (sessions 116–118; Fig. 2d) and it took the animals several sessions to return to the learning criterion (37, 16 and 10 sessions for Rat 7, 9 and 10, respectively). In both tests, behavioral persistence occurred despite the fact that rats occasionally performed correct trials under the new conditions (Fig. 2a,d). In some cases, several sessions after the change, the animals “relapsed” and performed the motor sequence using the previously learned kinematic parameters (Fig. 2c,d). This is in support of a motor habit that is difficult to break.

Figure 2. Evidence for habitual performance of the running sequence.

Figure 2

(a) Entrance times (median ± first and third quartiles) and percentage of correct trials, before and after shortening the treadmill. (b) Normalized averaged trajectories during the last session before treadmill shortening (orange trace) and the next 40 sessions (gray, all sessions; blue and red, sessions 5 and 40 after shortening). (c), Trajectory changes following treadmill shortening. Same color code than b. Rectangle in b shows the time at which the trajectory changes were computed. (d) Stop area entrance times (left: trial-by-trial; right median ± first and third quartiles) before and after changing the goal time. (e) Stop area entrance times (left: trial-by-trial; right median ± first and third quartiles) before and during reward-devaluation sessions (blue) performed at different times during training. Red dashed lines indicate goal times (d,e). All the data are taken from single animals and were reproduced in at least 3 animals.

Finally, we examined the impact of reward devaluation on task performance23,24. This was achieved by giving the animals free access to the sucrose solution before a probe session. We performed this procedure during training (Fig. 2e) and found that when animals had not yet reached behavioral proficiency, they disengaged from the task during the devaluation test (Fig. 2e). However, when we repeated the same procedure after further training, animals were less and less sensitive to reward devaluation (Fig. 2e). Altogether the results obtained in this set of behavioral manipulations indicate an habitual performance of the motor sequence after learning and extensive practice.

Next we performed tetrode recordings in the DLS of rats with stable proficient performance in the task and examined whether neuronal correlates relevant for execution of the running sequence could be extracted from the spiking activity of well-isolated units (n = 391 from 3 rats; Supplementary Fig. 3). In agreement with previous works, a minority of task-modulated units responded to the warning cue preceding the start of the treadmill (Supplementary Fig. 4)25, to reward delivery (Supplementary Fig. 5)26 or fired in bursts synchronized with limb movements (Supplementary Fig. 6)27,28. Still, as a population, striatal neurons displayed continuous sequential modulation of their spiking activity during task performance (Fig. 3a,b). To characterize the relationship between spiking activity and execution of the motor sequence we correlated instantaneous firing rates with the main behavioral variables (position on the treadmill, running speed and acceleration) and time. We found that a large fraction of these neurons displayed striking linear correlations between firing rate and either running speed (Fig. 4a-c) or position of the animal on the treadmill (Fig. 4d). The specificity of these correlations was quantified by computing Pearson’s partial correlation coefficients (Fig. 4a-d) and was confirmed when plotting error trials (Supplementary Fig. 7). Correlations between firing rate and running speed could not be accounted by transitions between trotting and galloping (Supplementary Fig. 8). Additionally, some units had their firing rate co-modulated by time, position and speed (Fig. 4e,f). Finally, striatal spiking activity was weakly correlated with acceleration (Fig. 4g). Next, we used a multiple regression analysis to quantify the extent to which the firing rates of striatal units could be explained by a linear combination of the main behavioral variables (time, position, speed and acceleration; see Online Methods for the validation of these variables as orthogonal predictors). First, the results obtained using partial correlation analysis were confirmed by examining the F values of each coefficient (medians of the F values for speed, position, time and acceleration were 877, 342, 151 and 70 respectively). Second, at the population level, the multiple correlation coefficients were consistently larger than the partial correlation coefficients (Fig. 4g), suggesting that firing rates were generally influenced by more than one variable (Fig. 4b,e). This possibility was confirmed by comparing regression models that took into account a single predictor with models that took into account several (2, 3 or 4) predictors (Supplementary Fig. 9). Finally, the proportion of units modulated by position, speed and acceleration and the predominance of linear correlations were confirmed with a distinct statistical method which did not rely on linear or monotonic relationships between spiking activity and behavioral variables (Supplementary Fig. 10). Altogether these results indicate that during habitual execution of the running sequence, spiking activity in the DLS is principally correlated with running speed and the position of the animal on the treadmill. Additionally, our data also revealed that neurons of the DLS multiplex information from several task-related variables.

Figure 3. Continuous sequential modulation of striatal spiking activity during sequence execution.

Figure 3

(a) Animal’s trajectory (blue) during a single trial and task events: L, light on; T, treadmill on; B, animal reaches the back of the treadmill; A, animal accelerates to cross the treadmill; E, animal reaches the stop area or terminates the sequence. (b) Averaged modulations of the firing rate of all recorded units, expressed as Z-score and sorted in time, during execution of the sequence, after learning and extensive practice.

Figure 4. Linear and integrative representation of running speed, position and time after learning.

Figure 4

(ae) Five illustrative units with firing rate correlated with running speed (ac), position on the treadmill (d), or a combination of time, position and speed (e). Left panels show spike rasters (top) and average perievent histograms (bottom), aligned to the end of the motor sequence (ac) or treadmill start (d,e). Middle scatter plots show firing rate vs. best correlated task variable (mean ± s.d.). Right panels show partial and multiple correlation coefficients between firing rate and task variables (*P < 0.001, Pearson’s partial correlation coefficient). (f) Two illustrative trials taken from the unit shown in e, with trajectories (top) and instantaneous firing rates (bottom). (g) Partial and multiple correlation coefficients between firing rate and task variables, sorted for all the recorded units (3 rats, same color code than ae). Correlation coefficients with P < 0.001 are on the right side of the dashed line.

The robust linear relationship between striatal activity and running speed could be a correlate of running speed control. If this was the case, this relationship should 1) improve when correlating running speed at a given time with preceding spiking activity and 2) decrease when correlating running speed at a given time with subsequent spiking activity. Such temporal relationship between a “leading” neuronal activity and a “following” behavior has been observed in the parietal cortex29 and was revealed by systematically shifting the spike times of the recorded units either forward or backward in time and recomputing correlation coefficients for different time shifts. In contrast with what was observed in the parietal cortex29, we found that, at the population level, the correlation coefficients between firing rate and running speed were equally affected when spike times were shifted forward or backward relative to behavior (Fig. 5a-c). The analysis revealed that high correlation values were maintained for time shifts ranging from − 0.25 to + 0.25 s (Fig. 5b,c). This result was confirmed by randomly jittering the spikes times of each unit on a trial-by-trial basis over different timescales and recomputing the correlation coefficients (Fig. 5d). We compared the decay of the correlation coefficients for different jitters with the auto-correlation function of the running speed profiles. We found that changes in running speed occurred slightly faster than changes in neural activity (Fig. 5d). This result is more compatible with a continuous modulatory function of the speed-related spiking activity rather than a sharp instruction signal generating the running sequence on a moment-to-moment basis.

Figure 5. Speed-related spiking activity changes more slowly than running speed.

Figure 5

(a) Spike times were systematically shifted forward (yellow) or backward (blue) relative to behavior. (b) Firing rate versus speed (mean ± s.d.) for forward and backward shifts, for the 3 illustrative speed-correlated units shown in Figure 4. (c) Partial correlation coefficients between firing rate and running speed, recalculated for each time shift (median, all recorded units). (d) Partial correlation coefficients between firing rate and running speed computed after randomly jittering the spike times (black) over increasing timescales. Normalized auto-correlation function of instantaneous running speeds (gray; median, all sessions). * and horizontal lines, P < 0.05 vs. central values (vertical dash lines), Kruskall-Wallis and Tukey’s HSD test.

Neuronal activity in the region of the DLS where we performed electrophysiological recordings is known to be sensitive to sensory stimulation of the limbs, trunk, neck, head and whiskers30. To rule out the possibility that position and speed-related modulations of firing rate passively reflect low-level sensorimotor activity that became stereotyped during learning12, we performed recordings in the DLS of “naive” rats (n = 3) hand-guided by the experimenter to perform the running sequence. For this purpose, when the treadmill started, the experimenter used a rectangular plate to gently push the animal towards the rear of the treadmill and to maintain its position for a few seconds. Then the plate was removed and the rats accelerated to cross the treadmill (Fig. 6a). To keep learning minimal, hand-guided animals did not receive rewards during these sessions. Running trajectories were highly similar in the well-trained and hand-guided animals (Fig. 6b). To our surprise the sequential modulation of striatal units observed in well-trained animals (Fig. 3b) was preserved in the hand-guided naive animals (Supplementary Fig. 11). Next, we examined possible differences in neuronal representation of the behavioral variables between well-trained and hand-guided animals. Partial and multiple correlation analyses showed that the correlation coefficients between firing rate and both running speed and position were strongly reduced in the hand-guided animals compared with trained animals (well-trained vs. hand-guided, P < 0.001; Fig. 6c,d). The correlation coefficients between firing rates and time and acceleration were less affected (well-trained vs. hand-guided, acceleration, P = 0.006; time, P = 0.044; Fig. 6c,d), certainly because these coefficients were already low in well-trained animals (Fig. 4g). The decrease in linear representation of speed and position was strong and could not be explained by differences in running speed between hand-guided and well-trained animals (Supplementary Fig. 12). One possibility is that position- and speed-related activities are dampened due to impaired vision by the guiding plate. At the population level, the similar timing and amplitude of striatal firing rate modulations in hand-guided and well-trained animals argue against such possibility (Fig. 3b and Supplementary Fig. 11). To address directly this issue we took advantage of the fact that hand-guided animals sampled the same range of positions on the treadmill with the plate (when guided from front to rear) and without it (when running freely from rear to front). Partial correlation coefficients between firing rate and position, were identical in both conditions (median ± std; r = 0.097 ± 0.1, no plate; r = 0.089 ± 0.07, plate; P = 0.31). Altogether the results obtained demonstrate that position and speed representations are not byproducts of the stereotyped sensorimotor activity.

Figure 6. Position and running speed representations in the DLS are specific of motor habits.

Figure 6

(a) Hand-guided control task. (b) Trajectories (mean ± s.d., single sessions) of a hand-guided (gray) and a well-trained (blue) animals. (c) Partial and multiple correlation coefficients between firing rate and task variables sorted for all the units, in the control task (Rat 5, 15 and 17, solid dots). For comparison, dashed lines indicate scaled distributions of correlation coefficients computed from well-trained animals. (d) Difference in the cumulative distributions of the correlation coefficients in well-trained and hand-guided animals. (e) Recording schedule for Rat 17 (top) and running trajectories during a hand-guided session (a), the first task training session (b), once the sequence is learned (c) and during overtraining (d). (f) Motor accuracy (blue) and task representations in the DLS (black, median of the multiple correlation coefficients) of Rat 17 during all the hand-guided (red) and training sessions. Horizontal lines, P < 0.05 vs. first data point during training, Kruskall-Wallis and Tukey’s HSD test.

The above conclusion was reinforced by the study of one rat for which, following one week of hand-guided sessions, we continued to record neuronal and behavioral activity during 60 training sessions (i.e., no more guiding plate; sucrose delivery for correct trials and long penalty runs for incorrect trials; Fig. 6e). The animal’s behavioral performance was poor during the initial training sessions but improved progressively and the front-rear-front motor sequence was learned (Fig. 6e,f). The strength of task representation in the DLS at different times during training was quantified using the median of the multiple correlation coefficients distributions between firing rates and time, position, speed and acceleration. The strength of task representation was similarly low during the stereotyped hand-guided and early training sessions (Fig. 6f), confirming that the guide plate had minor impact on position and speed representations. Noticeably, task representation increased ~ 20 sessions after the behavioral performance became stereotyped and dominated by rewarded trials (Fig. 6f). Altogether, these results demonstrated that representations in the DLS are not mere byproducts of the increased behavioral stereotypy associated with motor learning but emerge after extensive repetition of the correct motor sequence.

Our electrophysiological results revealed that after learning and extensive practice of a motor sequence, the DLS displayed position- and speed-related signals. The fact that the speed-related signal evolves more slowly than the running speed itself suggests that the DLS may modulate or guide the execution of the motor sequence, but does not generate the whole sequence on a moment-by-moment basis. To test this hypothesis, we transiently perturbed neuronal activity in the DLS of highly trained animals (n = 7 from 4 rats) through bilateral injections of muscimol at a dosage that elicited minimal effects on locomotion (50 ng μl−1, 1 μl per site31; Supplementary Figs. 13 and 15). The variability of the entrance times in the stop area was strikingly and reversibly increased following muscimol injections (Fig. 7a,b), yet there was no systematic change in the mean trajectory across experiments: the overall structure of the running sequence was always preserved (Fig. 7c and Supplementary Fig. 14). This result and the fact that rats in the muscimol condition reached fast running speeds (Supplementary Fig. 15c,d), discount the possibility that decrease in running speed is the cause of the increased variability in entrance times. The similar mean trajectories and number of rewards received (Supplementary Fig. 14a), before and after muscimol injections, suggest that animals were similarly engaged in the task in both conditions. Another possibility is that following muscimol injections animals were confused and did not start properly the task. This was not supported by the fact that average behavior was preserved following muscimol injections (Supplementary Fig. 14d-f). Moreover when we focused on trials during which animals started correctly the sequence the increased variability was still apparent after muscimol injection (Supplementary Fig. 14b,c).

Figure 7. Inactivation of the DLS increases trial-by-trial variability of running sequence execution.

Figure 7

(a) Stop area entrance times (top) and their variance (bottom) for the last 40 trials of single behavioral sessions one day before (black), 10 min after bilateral injections of muscimol (blue, 50 ng in 1 μl) or saline (green, 1 μl) and one day after injection (gray). (b) Group effect for muscimol and saline injections. (c) Average trajectories in a single experiment (same color code than in a). (d) Running speeds aligned relative to stop area entrance times, for one session before (left) and 10 min after (right) muscimol injections. Dashed red lines indicate treadmill speed. (e) Group effect for muscimol injections. Average (f) and variance (g) of the time spent by the animals in the rear portion of the treadmill (holding time). Adjustment index (median ± first and third quartiles) in control and muscimol conditions, after error trials (h, entrance time < goal time) and after trials in which animals entered the stop area at least 3 s after goal time (i). * P < 0.05, Kruskall-Wallis and Tukey’s HSD test (b, c, e, f and g) and Wilcoxon rank-sum test (h, i).

To determine the source of increased variability in entrance times after muscimol injections, speed time-courses were aligned relative to entrance times and plotted over the last 2.5 s preceding the termination of the sequence (Fig. 7d). In control condition the speed time-courses were highly stereotyped and consisted in an increase in running speed followed by a sharp deceleration (Fig. 7d). After muscimol injection the speed time-courses became highly variable and this effect was consistently observed across experiments (Fig. 7d,e). Examination of the speed time-courses during intra-DLS muscimol injection (Fig. 7d) suggested that an alteration in the timing at which rats initiated the final acceleration to cross the treadmill can not, by itself, explain the increased variance in arrival times. To further confirm this, we quantified the time the animals spent in the back of the treadmill and found no systematic change across experiments between control and muscimol conditions (Fig. 7f,g). These behavioral analyses show that both the timing and magnitude of the speed changes are altered after DLS perturbation.

An expected correlate of habitual performance is that, after an occasional error, proficient animals immediately adjust their performance on the next trial. We defined an adjustment index as the difference in entrance time between the error trial and the next trial divided by the size of the error. As expected, in control condition, the indexes were mainly above 1 but after muscimol injection, the indexes dropped (Fig. 7h). Trials in which animals entered in the stop area several seconds after the goal time were not considered as errors but are associated with an increased effort. Well-trained animals rarely performed twice in a row such “long” trials and in control condition the adjustment indexes after “long” trials (entrance time ≥ goal time + 3 s) were in average slightly inferior to 1 s. This immediate behavioral adjustment was impaired after muscimol injections (Fig. 7i). Altogether, after perturbation of DLS neuronal activity with muscimol, the animals’ capacity to perform the running sequence was spared but execution became highly variable. This increased variability was associated with a difficulty of the animals to run at the right speed at the right time and impaired adjustment of performance after incorrect trials.

Discussion

There are two confounding factors when studying the function of the DLS during motor learning. First, striatal spiking activity can both modulate actions32,33 and be modulated by actions27,28, through indirect projections toward motor circuits21 and a wide range of excitatory sensorimotor input20, respectively. Second, during learning, the dynamics of movements and associated sensory stimuli will change and eventually become stereotyped2. Thus, the inference of striatal mechanisms on the basis of changes in spiking dynamics during motor learning is non trivial and has been subject to debate12,18,19. Here we report that during habitual execution of a motor sequence, DLS neurons continuously represent in a combinatorial and linear manner contextual (time and position of the animal on the treadmill) and motor (running speed) information relevant for accurate performance. Importantly the representation of these variables was weak in naive rats performing the same motor sequence under the guidance of the experimenter and developed after learning. Thus, these representations do not merely reflect the stereotyped dynamics of “low-level” task parameters but are a signature of motor habits. Importantly, we found that perturbing these signals spared the average running trajectories but increased trial-by-trial variability and impaired the animals’ capacity to adjust their performance after incorrect trials. Altogether the integrative neuronal representations of speed, position and time that we discovered in the DLS, combined with the type of behavioral impairment observed after its neuronal inactivation, suggest that an important function of this region is to continuously constrain the execution of motor habits.

To control the trajectories of the naive animals during hand-guided sessions, we used a guiding plate which could alter the animals vision and artificially weaken position and speed representations. First, this possibility is unlikely as the visual cortex targets the dorsomedial striatum, not the DLS34. Second, in the hand-guided recording sessions, we found no difference in position representation when we analyzed separately the portion of the trials in which the plate was used (animals guided from the front to the rear portion of the treadmill) and the portion of the trials during which the plate was removed (when animals accelerated to cross the treadmill). Third, at the neuronal population level, the magnitude and temporal profiles of the firing rate modulations were similar during hand-guided and well-trained conditions. This suggests that the striatal units recorded received similar sensorimotor stimulation in both conditions. Forth, in the training sessions that immediately followed the hand-guided sessions, correlations between firing rate and speed and position remained low even if the guiding plate was not used. Finally, it is not obvious how, in well-trained animals, low-level sensory input can account for linear changes in striatal firing rate versus position and speed. For instance, in the three illustrative speed-correlated units shown in Figure 4, spiking activity was not rhythmical and therefore can not be explained by the locomotion-related dynamics of limbs, head or whiskers35,36.

It could also be argued that the position and/or speed correlates are related to reward expectation and that the lack of rewards in the hand-guided sessions is responsible for the weak position and speed representations. Such possibility is not supported by the fact that, at the population level, firing rates were similarly modulated in well-trained and hand-guided animals throughout the entire execution of the task. Additionally, in the recordings performed during learning, many trials were rewarded but position and speed correlates remained weak for several sessions before appearing. Conversely, the fact that in well-trained animals speed and position representations did not drop during the penalty part of error trials (Supplementary Fig. 7) is incompatible with a major contribution of reward expectation. Finally, in our task, the linear correlations between firing rate and speed or position can not be mistaken for reward expectation signals. If this was the case the speed-sensitive cells would not decrease their firing rate at the very end of the task when animals slowed down (Fig. 4a-c and Supplementary Fig. 7). And because rats occupied the front of the treadmill at the beginning and end of the trials, two times at which reward expectation is opposite, position representation can be isolated from reward expectation. In conclusion, our data are not compatible with the possibility that speed- and position-related activities in the DLS of well-trained animals simply reflected the stereotypical structure of movements or reward expectations. They are integrative signals acquired following extensive practice of the running sequence.

What could be the function of these signals? The linear correlations between firing rate and position are likely to provide contextual signals that inform the rat on its position relative to the front and the rear portions of the treadmill which are important spatial landmarks for successful performance37-39. The integration of such signals with a movement-related spiking activity could constrain or modulate the performance of the animal: if running speed is too fast or too slow in a certain portion of the treadmill, the integration of the speed- and position-related striatal activity in downstream brain regions would modulate motor commands either at cortical or subcortical levels. This hypothesis is supported by the fact that the speed-related activity in the DLS surrounds changes in running speed. Importantly, DLS perturbation with muscimol increased trial-by-trial variability of running trajectories, altered the animals’ capacity to modulate their performance after error trials but spared the average “front-rear-front” structure of the sequence. The constraining/modulatory function of the DLS we propose, along with its underlying neuronal mechanism, extends to motor habits a recent model of the role of the basal ganglia in motor learning40. In this model, mainly based on experiments in the songbird, the striatum combines efferent motor copies and context signals to reinforce and modulate motor plans outside the basal ganglia40. We hypothesize that during extensive practice of a learned motor sequence, coincident efferent motor copies and contextual signals in the DLS cause high-level integrative representations of the most important aspects of the learned motor sequence, in the case of our task, the animal’s running speed and position on the treadmill and to a lesser extent time. A constraining/modulatory function of these representations, distinct from action generation or action selection, is in line with the observation that Parkinson’s disease patients perform reaching movements with preserved spatial accuracy but present a specific deficit in speed selection mechanisms41 and by studies in rodents and non-human primates suggesting a specific contribution of the basal ganglia to the vigor of movements12,42. On the other hand, our results contrast with prominent works in rodents that suggested that the dorsal striatum only encodes the beginning and ending of action sequences through movement-independent signals that control actions like traffic lights control road traffic4,6,7. One possible explanation for such discrepancy is that the tasks used (T-maze and sequence of lever-presses) required limited motor control in the middle portion of the sequence. In support of this explanation, when mice were trained to perform faster lever-press sequences, modulation of striatal activity appeared in the middle of the sequence43. Our observation that spiking activity in the DLS is continuously modulated by kinematic and contextual parameters during sequence execution might also seems at odds with studies reporting dominant striatal activity that preceded action initiation or cancellation44,45. Here we provide evidence that continuous task-related signals modulate motor execution and therefore our work is compatible with a striatal race or accumulative model of action control46 which, in the case of tasks requiring brief movements44,45, would occur before action initiation. We further develop this concept by showing that after extended practice of a motor sequence, the DLS can guide movement execution through continuous integration of multiple task-relevant information. While the role of the DLS in habit formation has been known for a long time, our work revealed a straightforward neuronal mechanism to underlie such function and provides a new framework to understand the role of the basal ganglia in motor control.

Online Methods

All experimental procedures were conducted in accordance with standard ethical guidelines (European Communities Directive 86/60-EEC) and were approved by ethical committees (Ministère de l’enseignement supérieur et de la recherche, France, Ref 00172.01; Comité d’Experimentació Animal, Universitat de Barcelona, Spain, Ref 520/08).

Behavioral apparatus

The treadmill (Dattica) measured 80 cm long by 14 cm wide and was surrounded by 50 cm high plexiglas walls. There was no resting platform and the only walkable area was the treadmill belt, which was driven by a brushless digital motor (BG 44 SI, Dunkermotoren) controlled by a custom made program (LabVIEW, National Instruments) and a multifunction computer board (PXI-6254, National Instruments). A 60 W light bulb placed behind the camera illuminated the whole apparatus. One wall of the treadmill was equipped with a liquid well to deliver drops of sucrose solution (black triangle, Fig. 1a). A photodetector positioned at 10 cm from the front wall signaled the first entrance of the animal in the so-called stop area in each trial (black circle, Fig. 1a). A warning sound (1.5 kHz, 65 dB) signaled incorrect early entrances in the stop area.

Behavioral training

Animals were handled (2 h d−1 during 5 days), familiarized to run on the treadmill at increasing speeds and trained to perform the task. During training the treadmill speed was fixed for each animal (35–40 cm s−1, 40–160 trials per session, 1–2 sessions per day). Trials started independently of the animal’s position but after a few training sessions rats spent most of the inter-trial periods in the stop area. We established a criterion of performance accuracy (≥ 72.5 % of correct trials over the last 40 trials, for (≥ 3 consecutive sessions). During training the experimenter was not physically present in the behavioral room.

Animals

Long-Evans rats (n = 15, male, 250–400 g) were housed in pairs (individually after surgery) in stable conditions of temperature (22 °C) and humidity (60 %) with a constant light-dark cycle (12 h:12 h, all experimental procedures were performed during the light phase) and free access to food and water. 5 rats were used for electrophysiological recordings: Rat 1 and 4 completed at least 127 sessions before the start of electrophysiological recordings; Rat 5, 15 and 17 were used in the hand-guided control task and were implanted immediately after the habituation period; after 8 hand-guided sessions, Rat 17 was also trained and recorded in the regular version of the task and reached the learning criterion after 41 sessions. Rat 7, 8, 9 and 10, were over-trained for at least 40 sessions after reaching the learning criterion and then chronically implanted with cannulae in the DLS. Rat 13 and 14 were naive at the time of cannulae implantation and were used to determine the effects of intra-DLS injection of muscimol on locomotion. Rat 2, 6, 7, 9,10, 22 and 24 were used to probe the habitual nature of behavior.

Probing the habitual nature of the learned behavior

Rat 2, 22, 24 were trained to learning criterion and performed at least 25 sessions of overtraining. Then the treadmill was shortened by moving the front wall 30 cm toward the rear portion of the treadmill. Rat 7, 9 and 10 were trained with goal times at 4 s (180 sessions), 5 s (115 sessions) and 4 s (130 sessions), respectively. After training, goal time was changed to 7 s. At different time during training, Rat 6, 7, 9, 10, 22 and 24 were given unlimited access to the sucrose solution during ~ 24 h prior the reward devaluation session. During these sessions sucrose was not delivered at the end of the correct trials.

Implantation of tetrode arrays for unit recordings in behaving animals

Under deep isoflurane anesthesia, a 2-by-4 tetrode array (nichrome wires, 12.5 μm diameter, California Fine Wire) loaded on a NLX-9 micro-drive (Neuralynx) was implanted above the dorsal striatum (Supplementary Fig. 3a), through a craniotomy centered at ML= ± 3.6 mm and AP= + 0.6 mm from bregma. Tetrode tips were gold plated to reduce their impedance to 100–200 kΩ at 1 kHz47. Two miniature screws implanted above the cerebellum served as ground and reference. After recovery from the surgery (1–2 weeks), the tetrodes were lowered toward the DLS (25 to 100 μm d−1).

Data acquisition and processing

Wide band (0.1–8000 Hz) neurophysiological signals from the tetrode arrays were amplified 1000 times via a Plexon VLSI headstage and a PBX2 amplifier and continuously acquired at 20 kHz on two synchronized National Instruments A/D cards (PCI 6254, 16 bit resolution). Spike sorting was performed semi-automatically48 using the clustering software KlustaKwik (http://klustakwik.sourceforge.net) and the graphical spike sorting application Klusters (http://klusters.sourceforge.net)49.

Animal’s position, running speed and acceleration

To determine the position of the animals we used a CCD camera (scA640-70fc, Basler, 60 frame s−1, 9 pixels cm−1) positioned laterally to the treadmill and a fluorescent marker attached to the left forelimb. The marker’s positions were extracted with a custom made program (Vision, National Instruments) and averaged in 400 ms long sliding windows (the average duration of a step cycle). Animals’ speed and acceleration were derived from the distance traveled every 250 ms.

Firing rate modulation during task execution

Trials were aligned relative to treadmill onset and were divided into non-overlapping windows of 250 ms. For each trial, the firing rate in each window (spike count / 0.25) was smoothed with a Gaussian kernel filter of standard deviation 750 ms and instantaneous firing rates were averaged across trials. In a second approach, we computed perievent histograms with multiple points of references31. We restricted our analysis to trials in which the animals performed the archetypical front-rear-front sequence (typically > 70 % of the trials). We automatically selected archetypical trials by using a template matching method based on the averaged front-rear-front sequence performed during the entire session. For each selected trial, we identified 5 consecutive landmarks: 1) light on, 2) treadmill on, 3) reach of the back of the treadmill, 4) acceleration to cross the treadmill, 5) reaching the stop area. Then, for each animal we calculated the average time interval between landmarks 2–3, 3–4, 4–5 across all selected trials. The average time intervals between consecutive landmarks were divided in 250 ms long windows, determining for each animal, a fixed number of windows between landmarks. Next, all the selected trials were divided according to their respective landmarks times and the number of windows between landmarks. Finally, for each unit the instantaneous firing rate was computed, smoothed, transformed into Z-scores and averaged across trials.

Moment-to-moment correlation between firing rate and main task variables

Correlation plots shown in Figure 4 (middle panels) were computed from all the trials, binned in 250 ms long windows, from treadmill onset to treadmill offset. In some cases (Fig. 3, Fig. 6 and Supplementary Fig. 12), we restricted our analysis to trials or portions of trials displaying the archetypical front-rear-front sequence. To avoid biasing our correlation analysis with a minority of data points at the extreme range of the behavioral variables (e.g. lowest and highest speeds), we calculated the distributions of speeds and positions during each session and restricted the analysis to windows for which the value of the variable of interest (speed and position) fell inside the 90th percentile of its distribution. Using the entire speed and position data did not affect the results. We used partial correlation analysis to quantify the specific correlation between firing rate and a single task-relevant variable (position, speed, acceleration or time), once the effects of the others variables have been removed. We found no significant differences when computing Pearson’s and Spearman’s correlation coefficients and we only reported Pearson’s correlation coefficients. To quantify the extent to which the firing rate of the units was linearly correlated with a combination of task variables we used multiple regression analysis. For each session, multicollinearity in the predictor variables (position, speed, acceleration or time) was discarded by calculating the “variance inflation factor” (median 1.1, range 1.0–1.63; accepted values < 5), the “detection-tolerance index” (median 0.89, range 0.61–0.99; accepted values > 0.2) and the “conditional index” of the Belsley collinearity test (median 2.03, range 1–5.67; accepted values < 10; Matlab codes from Brian Lau http://www.subcortex.net/research/code/collinearity-diagnostics-matlab-code). To test if striatal firing rates were generally influenced by more than one variable we compared regression models that took into account a single predictor with models that took into account several (2, 3 or 4) predictors. This was done by computing the Akaike information criterion values for each model.

To test if changes in firing rate preceded running speed changes, for each trial, spike trains were systematically shifted by 100, 250, 500, 1000 or 2000 ms, backward or forward relative to the behavioral data. Then partial correlation coefficients between running speed and firing rate were recalculated for each shift value. To quantify the timescale of the relationship between firing rate and running speed, spikes times were jittered on a trial-by-trial basis and the jitter values were randomly chosen between ± 100, 250, 500, 1000 or 2000 ms. The partial correlation coefficients between speed and firing rate were recomputed for the different jitter ranges. To compare partial and multiple correlation coefficients between hand-guided and well-trained animals we restricted our correlation analysis to trials in which the animals performed the archetypical front-rear-front sequences.

All the results presented were reproduced when using different bin sizes (50, 100, 250, 500, 750 and 1000 ms, with a fixed smoothing window of 750 ms) or different smoothing windows (100, 250, 500, 750 and 1000 ms, with a fixed bin size of 100 ms)

Tuning curves of striatal units for position, speed and acceleration

For each recording session the ranges of position, speed and acceleration values were divided in 25 bins. The size of the bins was 3.6 cm for position, 5.8 cm s−1 for speed and ~ 10 cm s−2 for acceleration. The first and last bins of acceleration were adjusted to exclude the 5 % smallest and highest acceleration values. The total number of spikes fired in each bin was divided by the total time spent by the animal in that bin. The firing rate tuning curves obtained were smoothed with a Gaussian kernel filter whose standard deviation was 4 bins. To test for the significance of these tuning curves we used a shuffling/bootstrap procedure. For each trial, the spike train of the analyzed unit was circularized and the beginning of the trial was randomly chosen. Then, 3 surrogate tuning curves were recomputed (one for speed, one for position and one for acceleration). The operation was repeated 500 times to generate a global band of confidence (the 5 % highest and lowest values of the 500 × 3 surrogates tuning curves). These procedures reduce multiple comparison issue50 and the possibility that the firing rate modulation by a given variable (e.g., speed) is secondary to the modulation of firing rate by another variable (e.g., position and acceleration). Next, we designed a method to directly test if the significant relationship between firing rate and a given behavioral variable of interest (e.g. acceleration), can be accounted by the relationships of another behavioral variable (e.g. speed or position) with both firing rate and the variable of interest. For instance, for the unit shown in Supplementary Figure 10a, the increase in firing rate for maximal deceleration values (right panel), could result from the combined increased firing rate when the animal is in the front portion of the treadmill (Supplementary Fig. 10a) and the fact that maximum decelerations tend to occur at the front of the treadmill when the animal finished the running sequence. Fist, we computed the probability distribution of acceleration conditioned on position. Then acceleration values were pseudo-randomly reassigned on the basis of the probability distribution of acceleration conditioned on the position values. By doing so, the relationship between acceleration and position was preserved but the fine relationship between firing rate and acceleration was altered. Once all the acceleration values were generated, a surrogate tuning curve of firing rate relative to acceleration was recomputed and plotted (thin blue dashed line in left panel of Supplementary Fig. 10g). This operation was repeated 100 times. We considered that if one of the 100 surrogate tuning curves displayed a stronger modulation than the original tuning curve the modulation was not significant. For the tuning curves for acceleration, the entire procedure was performed twice: once to address speed confound and once to address position confound. The same procedure was also performed for tuning curves for speed and position. In the case of the unit in Supplementary Figure 10h, we noticed that the non-linear tuning curve for position (thick blue line, left panel) was largely explained by running speed (thin gray dashed lines, left panel). For all the tuning curves that remained significantly modulated, we systematically subtracted the mean surrogate tuning curve and examined if the correlation coefficient between tuning curve and the variable of interest improved.

Changes in striatal representations during learning

In Rat 17, to increase statistical power, units from 5 consecutive sessions were grouped together (25 ± 5.8 units per group), and we calculated the median value of their multiple correlation coefficients. For each session behavioral accuracy was estimated by computing the Pearson’s correlation coefficients between all the possible pairs of trajectories performed in a single session and averaging all the correlation coefficients.

Reversible inactivation of the DLS in behaving animals

We bilaterally implanted guide-cannulae in the DLS (coordinates relative to Bregma: AP = 0.4 mm, ML= ± 4.0 mm, DV = −3.5 mm) of naive (n = 2) and well-trained animals (> 4 months of training, n = 4) under deep isoflurane anesthesia. The injectors protruded the guide-cannulae by 1 mm. Local injections (1 μl per site at 0.2 μl min−1) were performed 10 min before the behavioral sessions. In naive animals, the GABAA agonist muscimol (Tocris) was diluted in saline and injected at different concentrations (50 ng μl−1, 500 ng μl−1, 1 μg μl−1). The highest dose induced potent motor impairments (rigidity, catalepsy). The two lowest doses had no apparent effect on basic locomotor activity in the home cage. To further characterize the effects of the lowest doses animals were video-recorded while running on the treadmill. During these sessions the treadmill was started every minute during 20 s at a fixed speed of 35 cm sec−1. Two blocks of 30 trials were separated by 30 min of rest in the home cage. Before the first block of trials, animals received bilateral injections of saline and before the second block animals were injected with muscimol. To compare the motor behavior between saline and muscimol injection (Supplementary Fig. 13) we subtracted the distribution of speeds (or positions) obtained during the saline and muscimol blocks. We generated surrogate muscimol and saline distributions by randomly selecting 15 trials from each condition and subtracted the distributions obtained. We repeated this procedure 300 times and determined a global band of confidence (the 5 % highest and lowest values of the surrogate differences).

To quantify the stereotypy of the speed time-courses during a single session, Pearson’s correlation coefficients were computed for all the possible pairs of trials (we restricted our correlation to the last 2.5 s before entrance time). The variance of these correlation coefficients was used as a measurement of running speed stereotypy. We quantified the animals’ ability to adjust their behavior after error trials (arrival time < goal time, “early” trials) and after trials with abnormally late entrance times (arrival time > goal time + 3 s, “late” trials). We defined an “adjustment index” as the difference in entrance times between the early (or late) trials and the next trial, divided by the size of the mistake (goal time − early (or late) arrival time). Our analysis was based on a 93 “early” and 87 “late” trials from 7 muscimol sessions and 224 “early” and 97 “late” trials from 30 control experiments. Data from control conditions (before muscimol, before and after saline injections) were pooled together due to the low number of “early” and “late” trials during these sessions.

Histology

Animals were deeply anesthetized and transcardially perfused with PBS followed by 4 % paraformaldehyde. Cresyl violet staining of coronal sections (60 μm) was performed. To quantify muscimol’s diffusion, we injected 1 μl of a solution containing 1,1′-Dioctadecyl-3,3,3′,3′-tetramethylindocarbocyanine perchlorate (DiI, 50 mM) in two animals.

General statements on data collection and statistics

No statistical methods were used to pre-determine sample sizes but our sample sizes (number of animals and total number of recorded units) are similar to those reported in previous publications4,7,31,38,39. Data collection could not be performed blindly however in most of the cases, the experimenter was not present in the experimental room during data collection. Moreover, analysis of the spiking activity and behavior was performed offline and Matlab programs were run in batch mode on all the data, independently of experimental conditions. The statistical tests used do not assumed normality of data distributions (which were always shown). The behavioral and neuronal data along with the Matlab codes used to generate the figures are available upon request to the corresponding author. A supplementary methods checklist is available.

Supplementary Material

1
sv1
Download video file (1.3MB, mp4)
sv2
Download video file (2.6MB, mp4)

Acknowledgments

We thank Drs John Krakauer, Kamran Diba, Jerome Epsztein, Olivier Manzoni and Henry Martin for excellent discussions and critical reading of this and/or earlier version of the manuscript; Ines Cordon Morillas, Carola Sales-Carbonell and Laetita Lalla for help with training of the animals; Jesus Perez-Ortega for help with LabView Vision; Dr Andrea Brovelli for advice with multiple regression analysis; Robert Martinez for technical help. This work was supported by the Spanish Ministerio de Ciencia e Innovación (Ramon-Y-Cajal program, D.R.) EU-fp7 (International Reintegration Grant, IRG230976, D.R.; Marie Curie International Incoming Fellowship, IIF253873, P.R.-O), INSERM (Avenir Program, D.R.), European Research Council (ERC-2013-CoG – 615699_NeuroKinematics, D.R.) and the Mexican Consejo Nacional de Ciencia y Tecnología (P.R.-O).

References

  • 1.Woodworth RS. The Accuracy of Voluntary Movement. Psychol. Rev. 1899;III:1–120. [Google Scholar]
  • 2.Lee TD. Motor Control in Everyday Actions. Human Kinetics; 2011. [Google Scholar]
  • 3.Bailey KR, Mair RG. The role of striatum in initiation and execution of learned action sequences in rats. J. Neurosci. 2006;26:1016–25. doi: 10.1523/JNEUROSCI.3883-05.2006. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 4.Barnes TD, Kubota Y, Hu D, Jin DZ, Graybiel AM. Activity of striatal neurons reflects dynamic encoding and recoding of procedural memories. Nature. 2005;437:1158–61. doi: 10.1038/nature04053. [DOI] [PubMed] [Google Scholar]
  • 5.Costa RM, Cohen D, Nicolelis MAL. Differential corticostriatal plasticity during fast and slow motor skill learning in mice. Curr. Biol. 2004;14:1124–34. doi: 10.1016/j.cub.2004.06.053. [DOI] [PubMed] [Google Scholar]
  • 6.Jin X, Costa RM. Start/stop signals emerge in nigrostriatal circuits during sequence learning. Nature. 2010;466:457–62. doi: 10.1038/nature09263. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 7.Jog MS, Kubota Y, Connolly CI, Hillegaart V, Graybiel AM. Building neural representations of habits. Science. 1999;286:1745–9. doi: 10.1126/science.286.5445.1745. [DOI] [PubMed] [Google Scholar]
  • 8.Packard MG, McGaugh JL. Inactivation of hippocampus or caudate nucleus with lidocaine differentially affects expression of place and response learning. Neurobiol. Learn. Mem. 1996;65:65–72. doi: 10.1006/nlme.1996.0007. [DOI] [PubMed] [Google Scholar]
  • 9.Yin HH, et al. Dynamic reorganization of striatal circuits during the acquisition and consolidation of a skill. Nat. Neurosci. 2009;12:333–41. doi: 10.1038/nn.2261. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 10.Tang C, Pawlak AP, Prokopenko V, West MO. Changes in activity of the striatum during formation of a motor habit. Eur. J. Neurosci. 2007;25:1212–27. doi: 10.1111/j.1460-9568.2007.05353.x. [DOI] [PubMed] [Google Scholar]
  • 11.Deffains M, Legallet E, Apicella P. Modulation of neuronal activity in the monkey putamen associated with changes in the habitual order of sequential movements. J. Neurophysiol. 2010;104:1355–69. doi: 10.1152/jn.00355.2010. [DOI] [PubMed] [Google Scholar]
  • 12.Desmurget M, Turner RS. Motor sequences and the basal ganglia: kinematics, not habits. J. Neurosci. 2010;30:7685–90. doi: 10.1523/JNEUROSCI.0163-10.2010. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 13.Miyachi S, Hikosaka O, Miyashita K, Kárádi Z, Rand MK. Differential roles of monkey striatum in learning of sequential hand movement. Exp. Brain Res. 1997;115:1–5. doi: 10.1007/pl00005669. [DOI] [PubMed] [Google Scholar]
  • 14.Miyachi S, Hikosaka O, Lu X. Differential activation of monkey striatal neurons in the early and late stages of procedural learning. Exp. brain Res. 2002;146:122–6. doi: 10.1007/s00221-002-1213-7. [DOI] [PubMed] [Google Scholar]
  • 15.Lehéricy S, et al. Distinct basal ganglia territories are engaged in early and advanced motor sequence learning. Proc. Natl. Acad. Sci. U. S. A. 2005;102:12566–71. doi: 10.1073/pnas.0502762102. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 16.Wymbs NF, Bassett DS, Mucha PJ, Porter M. a, Grafton ST. Differential recruitment of the sensorimotor putamen and frontoparietal cortex during motor chunking in humans. Neuron. 2012;74:936–46. doi: 10.1016/j.neuron.2012.03.038. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Graybiel AM. Habits, rituals, and the evaluative brain. Annu. Rev. Neurosci. 2008;31:359–87. doi: 10.1146/annurev.neuro.29.051605.112851. [DOI] [PubMed] [Google Scholar]
  • 18.Shmuelof L, Krakauer JW. Are we ready for a natural history of motor learning? Neuron. 2011;72:469–76. doi: 10.1016/j.neuron.2011.10.017. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Turner RS, Desmurget M. Basal ganglia contributions to motor control: a vigorous tutor. Curr. Opin. Neurobiol. 2010:1–13. doi: 10.1016/j.conb.2010.08.022. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Gerfen CR, Bolam JP. Handb. Basal Ganglia Struct. Funct. Academic Press; 2010. pp. 3–28. [Google Scholar]
  • 21.Winn P, Wilson DIG, Redgrave P. Handb. Basal Ganglia Struct. Funct. Academic Press; 2010. pp. 397–408. [Google Scholar]
  • 22.Shadmehr R, Krakauer JW. A computational neuroanatomy for motor control. Exp. brain Res. 2008;185:359–81. doi: 10.1007/s00221-008-1280-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 23.Yin HH, Knowlton BJ. The role of the basal ganglia in habit formation. Nat. Rev. Neurosci. 2006;7:464–76. doi: 10.1038/nrn1919. [DOI] [PubMed] [Google Scholar]
  • 24.Balleine BW, Dickinson a. Goal-directed instrumental action: contingency and incentive learning and their cortical substrates. Neuropharmacology. 1998;37:407–19. doi: 10.1016/s0028-3908(98)00033-1. [DOI] [PubMed] [Google Scholar]
  • 25.Apicella P, Scarnati E, Ljungberg T, Schultz W. Neuronal activity in monkey striatum related to the expectation of predictable environmental events. J. Neurophysiol. 1992;68:945–60. doi: 10.1152/jn.1992.68.3.945. [DOI] [PubMed] [Google Scholar]
  • 26.Apicella P, Ljungberg T, Scarnati E, Schultz W. Responses to reward in monkey dorsal and ventral striatum. Exp. Brain Res. 1991;85:491–500. doi: 10.1007/BF00231732. [DOI] [PubMed] [Google Scholar]
  • 27.West MO, et al. A region in the dorsolateral striatum of the rat exhibiting single-unit correlations with specific locomotor limb movements. J. Neurophysiol. 1990;64:1233–46. doi: 10.1152/jn.1990.64.4.1233. [DOI] [PubMed] [Google Scholar]
  • 28.Shi LH, Luo F, Woodward DJ, Chang JY. Neural responses in multiple basal ganglia regions during spontaneous and treadmill locomotion tasks in rats. Exp. Brain Res. 2004;157:303–14. doi: 10.1007/s00221-004-1844-y. [DOI] [PubMed] [Google Scholar]
  • 29.Whitlock JR, Pfuhl G, Dagslott N, Moser M-B, Moser EI. Functional split between parietal and entorhinal cortices in the rat. Neuron. 2012;73:789–802. doi: 10.1016/j.neuron.2011.12.028. [DOI] [PubMed] [Google Scholar]
  • 30.Cho J, West MO. Distributions of single neurons related to body parts in the lateral striatum of the rat. Brain Res. 1997;756:241–246. doi: 10.1016/s0006-8993(97)00143-1. [DOI] [PubMed] [Google Scholar]
  • 31.Gage GJ, Stoetzner CR, Wiltschko AB, Berke JD. Selective activation of striatal fast-spiking interneurons during choice execution. Neuron. 2010;67:466–79. doi: 10.1016/j.neuron.2010.06.034. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 32.Freeze BS, Kravitz a. V., Hammack N, Berke JD, Kreitzer a. C. Control of Basal Ganglia Output by Direct and Indirect Pathway Projection Neurons. J. Neurosci. 2013;33:18531–18539. doi: 10.1523/JNEUROSCI.1278-13.2013. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 33.Kravitz AV, et al. Regulation of parkinsonian motor behaviours by optogenetic control of basal ganglia circuitry. Nature. 2010;466:622–6. doi: 10.1038/nature09159. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 34.McGeorge AJ, Faull RLM. The organization of the projection from the cerebral cortex to the striatum in the rat. Neuroscience. 1989;29:503–537. doi: 10.1016/0306-4522(89)90128-0. [DOI] [PubMed] [Google Scholar]
  • 35.Ranade S, Hangya B, Kepecs A. Multiple modes of phase locking between sniffing and whisking during active exploration. J. Neurosci. 2013;33:8250–6. doi: 10.1523/JNEUROSCI.3874-12.2013. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 36.Ledberg A, Robbe D. Locomotion-related oscillatory body movements at 6-12 Hz modulate the hippocampal theta rhythm. PLoS One. 2011;6:e27575. doi: 10.1371/journal.pone.0027575. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 37.Berke JD, Breck JT, Eichenbaum H. Striatal versus hippocampal representations during win-stay maze performance. J. Neurophysiol. 2009;101:1575–87. doi: 10.1152/jn.91106.2008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 38.Meer M. A. A. Van Der, Johnson A, Schmitzer-torbert NC, Redish AD. Report Triple Dissociation of Information Processing in Dorsal Striatum, Ventral Striatum, and Hippocampus on a Learned Spatial Decision Task. Neuron. 2010;67:25–32. doi: 10.1016/j.neuron.2010.06.023. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 39.Yamin HG, Stern E. a, Cohen D. Parallel processing of environmental recognition and locomotion in the mouse striatum. J. Neurosci. 2013;33:473–84. doi: 10.1523/JNEUROSCI.4474-12.2013. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 40.Fee MS. Oculomotor learning revisited: a model of reinforcement learning in the basal ganglia incorporating an efference copy of motor actions. Front. Neural Circuits. 2012;6:38. doi: 10.3389/fncir.2012.00038. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 41.Mazzoni P, Hristova A, Krakauer JW. Why don’t we move faster? Parkinson’s disease, movement vigor, and implicit motivation. J. Neurosci. 2007;27:7105–16. doi: 10.1523/JNEUROSCI.0264-07.2007. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 42.Wang AY, Miura K, Uchida N. The dorsomedial striatum encodes net expected return, critical for energizing performance vigor. Nat. Neurosci. 2013;16:639–47. doi: 10.1038/nn.3377. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 43.Jin X, Tecuapetla F, Costa RM. Basal ganglia subcircuits distinctively encode the parsing and concatenation of action sequences. 2014 doi: 10.1038/nn.3632. doi:10.1038/nn.3632. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 44.Schmidt R, Leventhal DK, Mallet N, Chen F, Berke JD. Canceling actions involves a race between basal ganglia pathways. Nat. Neurosci. 2013;16:1118–24. doi: 10.1038/nn.3456. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 45.Cui G, et al. Concurrent activation of striatal direct and indirect pathways during action initiation. Nature. 2013;494:238–42. doi: 10.1038/nature11846. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 46.Ding L, Gold JI. The basal ganglia’s contributions to perceptual decision making. Neuron. 2013;79:640–9. doi: 10.1016/j.neuron.2013.07.042. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 47.Ferguson JE, Boldt C, Redish a D. Creating low-impedance tetrodes by electroplating with additives. Sens. Actuators. A. Phys. 2009;156:388–393. doi: 10.1016/j.sna.2009.10.001. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 48.Harris KD, Henze D. a, Csicsvari J, Hirase H, Buzsáki G. Accuracy of tetrode spike separation as determined by simultaneous intracellular and extracellular measurements. J. Neurophysiol. 2000;84:401–14. doi: 10.1152/jn.2000.84.1.401. [DOI] [PubMed] [Google Scholar]
  • 49.Hazan L, Zugaro M, Buzsáki G. Klusters, NeuroScope, NDManager: a free software suite for neurophysiological data processing and visualization. J. Neurosci. Methods. 2006;155:207–16. doi: 10.1016/j.jneumeth.2006.01.017. [DOI] [PubMed] [Google Scholar]
  • 50.Amarasingham A, Harrison MT, Hatsopoulos NG, Geman S. Conditional modeling and the jitter method of spike resampling. J. Neurophysiol. 2012;107:517–31. doi: 10.1152/jn.00633.2011. [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

1
sv1
Download video file (1.3MB, mp4)
sv2
Download video file (2.6MB, mp4)

RESOURCES