Abstract
The ability to perform a sequence of movements is a key component of motor skills, such as typing or playing a musical instrument. How the brain binds elementary movements together into meaningful actions has been a topic of much interest. Here, we describe two sequential reaching tasks that we use to investigate the neural substrate of skilled sequential movements in monkeys after long-term practice. The movement elements performed in these tasks are essentially identical, but are generated in two different contexts. In one task, monkeys perform reaching movements that are instructed by visual cues. In the other, the monkeys perform reaching movements that are generated from memory after extended practice. With this behavioral paradigm, we can dissociate the neural processes related to the acquisition and retention of motor skills from those related to movement execution.
Keywords: Motor skills, Sequential movements, Monkey, Visually-guided reaching, Memory-guided reaching, Long-term practice, Behavioral task
Background
In complex motor skills, such as typing or playing a musical instrument, individual movement elements are bound together into a fluid sequence, usually after extensive practice. How the brain acquires the associations between elements and orchestrates the production of the resulting sequence is a central question in neuroscience. The neural basis of sequential movements has been studied using various behavioral tasks in human and non-human primate subjects. For example, non-human primates were trained to perform a sequence of arm movements by manipulating a handle in a predetermined order (e.g., push-pull-turn; Tanji and Shima, 1994) or a sequence of button presses on a board ( Nakamura et al., 1998 ). In humans, the Serial Reaction Time (SRT) task has been widely used to examine procedural learning (Nissen and Bullemer, 1987; Robertson, 2007). In many sequential tasks, however, the subject is required to pause between sequential elements, which may break up the natural fluidity of skilled sequential movements (Nissen and Bullemer, 1987; Nakamura et al., 1998 ; Tanji and Shima, 1994). In addition, sequential tasks are often designed to be mastered within a relatively short period of training compared to the normally prolonged and staged development of motor skills (Tanji and Shima, 1994; Karni et al., 1995 ; Nakamura et al., 1998 ; Dayan and Cohen, 2011; Wymbs and Grafton, 2013). We have devised a variant of the SRT task to overcome these impediments. In our behavioral paradigm, subjects perform the task continuously at their own pace. The design of our behavioral paradigm enables us to detect slow improvement of the motor skills as a change of performance parameters. The learning curve of monkeys for even short sequences extends for substantial amounts of time with continued practice (up to 2 years, Matsuzaka et al., 2007 ). Furthermore, in our task, subjects execute essentially identical movements in two different contexts: memory-guided and visually-guided. While both contexts engage processes related to the production of motor output, only the former engages processes related to motor learning and memory. Thus, our behavioral paradigm allows us to disentangle the substrates for the learning of sequential movements from those for motor execution. Our paradigm has been instrumental in revealing neural properties associated with the acquisition and maintenance of skilled sequential movements in motor areas of monkeys (Picard and Strick, 2003; Matsuzaka et al., 2007 ; Picard et al., 2013 ; Ohbayashi et al., 2016 ; Ohbayashi, 2020).
Materials and Reagents
Monkeys (e.g., Cebus apella or Macaca mulatta)
Liquid for reward (water, fruit juice or diluted juice)
Equipment
-
Primate chair with a neck plate, arm restraints, and perch/waste tray (Figure 1)
These parts easily lock into place after adjustment to accommodate various subject sizes. Chairs with arm restraints are commercially available (e.g., Primate Products, Inc., https://www.primateproducts.com/ or Christ Instruments https://www.cristinstrument.com/).
-
Touch sensitive monitor (elo, https://www.elotouch.com/)
We have used touch sensitive monitors based on infrared and surface acoustic wave technology. While any technology will be adequate for monkey and human performance, it may be a consideration for certain applications.
-
Reward delivery system (bottle, tubing, solenoid and straw)
Our system is gravity-based and controlled by a solenoid (Parker Hannifin Co., NJ) positioned between the liquid bottle and the tube that the monkey drinks from Figure 1. The solenoid is normally closed and opened by a DC pulse sent from the task controller. In our experience, the opening of the valve makes an audible click.
Speaker to generate a tone for feedback of the task performance (i.e., correct or error)
Software
-
Program to control the behavioral task and record task performance
We use the TEMPO Experiment Control System from Reflective Computing Inc. (http://www.reflectivecomputing.com/). Any programmable software to generate video displays, current pulses, and acquire equipment-generated analog or digital signals would be appropriate.
Matlab (Mathworks, https://www.mathworks.com) or other software (e.g., SPSS, Microsoft Excel) for analysis of behavioral data
Procedure
-
General Notes
Monkeys are water-regulated as approved by the IACUC.
The monkeys are weighed and monitored daily. Their body weight should be carefully kept in the range approved by the IACUC throughout the experiment.
The monkey’s non-working arm is restrained to the arm holder which is a part of primate chairs using Velcro straps or an adjustable cuff (e.g., Christ Chair system, available here).
We train the monkeys to perform the tasks using standard reinforcement-based conditioning procedures.
-
General Task Description
The monkeys are trained to perform the Random and the Repeating tasks in alternating blocks. In both tasks, the monkeys are required to make reaching movements to targets displayed on a touch sensitive monitor with their right arm. However, the spatial visual cues are presented according to a different order and timing between the tasks (for details see Procedure C). There is no visual feature to distinguish the Random and Repeating tasks. Here, we describe general features that are common to both tasks.
The outlines of five square targets are displayed on the monitor when the task starts and remain visible as long as the task is running. Targets are arranged in a horizontal row and identified as numbers 1 to 5 from left to right (Figures 1 and 2A). The middle target is positioned at the horizontal center of the monitor. Because of the considerable size difference between Macaques and Cebus monkeys, target size varies depending on the species trained. For Cebus monkeys, the size of the targets is 37 mm. The targets are separated by 10 mm. For Macaque monkeys, the size of the targets is 54 mm. They are separated by 11 mm. With these parameters, the monkeys use most of their range of reaching in the frontal plane for task performance.
When the task starts, a trial is initiated and one of the targets is filled with yellow color (Figure 2).
To make a correct response, the monkey is required to contact the yellow target within 800 ms after its coloring.
A trial ends at the time of a new contact on the monitor or 800 ms elapse.
Immediately after the animal's contact on the monitor, the yellow fill disappears, and a new trial starts.
The task controller generates a feedback sound for each response (correct: 1 kHz tone, 100 ms duration; error: 50 Hz tone, 100-200 ms duration). In the case of errors or no response, the trial is immediately repeated.
The task controller generates a reward after a number of correct responses. Initially, the monkey receives a reward for every correct response. When the monkey understands the rules of the task, the reward frequency is gradually decreased to every 4-5 correct responses. The rate of reward is adjusted for each animal and depending on sequence length in the Repeating task so that the rewards are not associated with particular sequence elements (e.g., every 4th for a 3-element sequence). A current pulse generated by the task controller opens the solenoid in the reward delivery system for a period of time. This duration varies depending on the setup of the reward delivery system (i.e., height of the water bottle, tubing size).
Once the training session is initiated, the monkeys typically perform the task, touching one target after another without interruption, until satiety.
-
The Random and the Repeating Tasks
In the Random task, new targets are presented according to a pseudo-random sequence 100 ms after a correct response (Figures 2A and 3A). With this short delay, the monkeys have no time to try to guess the location of the next target. Therefore, in the Random task, the monkey performs visually-guided reaching.
In the Repeating task, new targets are presented according to a predetermined repeating sequence (e.g., 5-3-1-5-3-1 ... and 1-2-4-1-2-4 ... or 2-3-4-2-3-4 ...) (Figures 2C-2D). The number of movement elements in a sequence can be modified according to the experiment objective. We trained monkeys successfully on sequences of 3-12 elements in length. In the Repeating task, new targets are presented 400 ms after contact of the correct target (Figures 2B-3B). The monkeys are allowed to touch the new target during the 400 ms delay, before the presentation of the visual cue (yellow fill). When this happens, the current trial ends without coloring the target, the trial is recorded as correct and the task is incremented to the next trial/target. The delay promotes the performance of predictive responses in which the animal anticipates the next target in a sequence (Figures 2B right panel, 3C), but does not inhibit the flow of movements in a learned sequence. With practice, the monkey performs internally-guided reaching.
Each task is performed continuously in alternating blocks of 200-500 trials until the monkey stops working for a total usually in the order of 5,000 (Cebus) or 12,000 (Macaque) trials (Figure 3D). Monkeys are supplemented with water if they do not obtain enough during training.
-
Training Schedule
We first train the monkey on the Random task. Monkeys become proficient in the performance of the Random task after about 50 days of practice. The monkeys make correct responses in > 80% of trials within 10 training sessions. In the Random task, performance parameters (e.g., RT) stop improving after approximately 50 training sessions.
We then introduce the 1st sequence of the Repeating task. From then on, the monkey practices the sequence of the Repeating task and the Random task on every training session in alternating blocks of trials (Figure 3D). There is no interruption or visual distinction between blocks of Repeating and Random tasks.
New sequences of the Repeating task can be introduced at various time points in training. On the 1st day of the introduction of the sequence, after a block of the Random task, the task is switched to the Repeating task of a new sequence. There is no cue suggesting the change of the tasks or a new sequence. The monkey learns the sequence on his own by producing it repetitively.
Experimental manipulations can be performed at various time points during training depending on the objectives of the experiment.
In our experience learning curves are similar across individuals and primate species ( Matsuzaka et al., 2007 ; Obayashi and Picard, unpublished observations). For a sequence of three elements, monkeys (Cebus, rhesus) make predictive responses in more than 80% trials after approximately 50 training sessions. For a sequence of twelve elements, monkeys (rhesus) make predictive responses in more than 80% trials in 120-190 training sessions.
Data analysis
For every trial, we record various task parameters and measures of performance. Recorded performance measures are: correct response, wrong hit error, no hit error or corrective response (correct responses that followed an error). From the times of touch screen hits, we derive the Movement Time (MT), Target Hold Time (THT), and Response Time (RT) associated with each response. We define MT as the interval between the release of contact from one target to the time of touch of the next target. We define THT as the interval between the contact of a target and its release (in the next trial). We define RT during the Random task as the time between the presentation of a new target and contact of that target. RT during the Repeating task is defined as the time between two targets touches minus the delay time, 400 msec. This results in a negative RT when the monkey contacts the next target in the sequence before its coloring (Figure 3C). RT less than 150 ms is considered to be predictive ( Ohbayashi et al., 2016 , Ohbayashi, 2020). This value represents a conservative cut-off for predictive responses as it is too fast for a reaction time to the visual cue. We excluded the following trials from analysis: 1) corrective responses, because in this case the target is predictable as the error trial is repeated; 2) no-hit error responses, because it’s impossible to determine what caused the monkey not to respond in the allowed time frame (e.g., low motivation, distraction, or genuine hesitation). We perform a movement-based analysis of performance and experimental data. Data obtained for movements of the Repeating task are best compared with data for the corresponding movements performed during the Random task (e.g., data for movement 5 to 3 during the Repeating task vs data for movement 5 to 3 during the Random task). Because movements of the Repeating sequence are a subset of the movements of the Random task, more Random trials may be necessary to obtain a sufficient sample for analysis.
Acknowledgments
This protocol was adapted from Ohbayashi (2020). The work has been supported in part by National Institutes of Health Grants R21NS101499 to M.O. and the Brain Sciences Project of the CNSI & NINS BS291006 to MO.
Competing interests
The authors declare no competing financial interests.
Ethics
The care of the monkeys and the experimental protocols adhered to the National Institutes of Health Guide for the Care and Use of Laboratory Animals and the U.S. Public Health Service Policy on Humane Care and Use of Laboratory Animals. All procedures used followed institutional guidelines and were approved by the Institutional Animal Care and Use Committee.
Citation
Readers should cite both the Bio-protocol article and the original research article where this protocol was used.
References
- 1. Dayan E. and Cohen L. G.(2011). Neuroplasticity subserving motor skill learning. Neuron 72(3): 443-454. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2. Karni A., Meyer G., Jezzard P., Adams M. M., Turner R. and Ungerleider L. G.(1995). Functional MRI evidence for adult motor cortex plasticity during motor skill learning. Nature 377(6545): 155-158. [DOI] [PubMed] [Google Scholar]
- 3. Matsuzaka Y., Picard N. and Strick P. L.(2007). Skill representation in the primary motor cortex after long-term practice. J Neurophysiol 97(2): 1819-1832. [DOI] [PubMed] [Google Scholar]
- 4. Nakamura K., Sakai K. and Hikosaka O.(1998). Neuronal activity in medial frontal cortex during learning of sequential procedures. J Neurophysiol 80(5): 2671-2687. [DOI] [PubMed] [Google Scholar]
- 5. Nissen M. J. and Bullemer P.(1987). Attentional requirements of learning: Evidence from performance measures. Cogn Psychol 19(1): 1-32. [Google Scholar]
- 6. Ohbayashi M.(2020). Inhibition of protein synthesis in M1 of monkeys disrupts performance of sequential movements guided by memory. Elife 9: 53038. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7. Ohbayashi M., Picard N. and Strick P. L.(2016). Inactivation of the dorsal premotor area disrupts internally generated, but not visually guided, sequential movements. J Neurosci 36(6): 1971-1976. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8. Picard N., Matsuzaka Y. and Strick P. L.(2013). Extended practice of a motor skill is associated with reduced metabolic activity in M1. Nat Neurosci 16(9): 1340-1347. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9. Picard N. and Strick P. L.(2003). Activation of the supplementary motor area(SMA) during performance of visually guided movements. Cereb Cortex 13(9): 977-986. [DOI] [PubMed] [Google Scholar]
- 10. Robertson E. M.(2007). The serial reaction time task: implicit motor skill learning? J Neurosci 27(38): 10073-10075. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11. Tanji J. and Shima K.(1994). Role for supplementary motor area cells in planning several movements ahead. Nature 371(6496): 413-416. [DOI] [PubMed] [Google Scholar]
- 12. Wymbs N. F. and Grafton S. T.(2013). Contributions from the left PMd and the SMA during sequence retrieval as determined by depth of training. Exp Brain Res 224(1): 49-58. [DOI] [PMC free article] [PubMed] [Google Scholar]