Skip to main content
Proceedings of the National Academy of Sciences of the United States of America logoLink to Proceedings of the National Academy of Sciences of the United States of America
. 2008 Dec 1;105(49):19486–19491. doi: 10.1073/pnas.0808113105

Functional network reorganization during learning in a brain-computer interface paradigm

Beata Jarosiewicz a,b,1,2, Steven M Chase a,b,c,1, George W Fraser a,b, Meel Velliste a,b, Robert E Kass b,c, Andrew B Schwartz a,b,3
PMCID: PMC2614787  PMID: 19047633

Abstract

Efforts to study the neural correlates of learning are hampered by the size of the network in which learning occurs. To understand the importance of learning-related changes in a network of neurons, it is necessary to understand how the network acts as a whole to generate behavior. Here we introduce a paradigm in which the output of a cortical network can be perturbed directly and the neural basis of the compensatory changes studied in detail. Using a brain-computer interface, dozens of simultaneously recorded neurons in the motor cortex of awake, behaving monkeys are used to control the movement of a cursor in a three-dimensional virtual-reality environment. This device creates a precise, well-defined mapping between the firing of the recorded neurons and an expressed behavior (cursor movement). In a series of experiments, we force the animal to relearn the association between neural firing and cursor movement in a subset of neurons and assess how the network changes to compensate. We find that changes in neural activity reflect not only an alteration of behavioral strategy but also the relative contributions of individual neurons to the population error signal.

Keywords: brain-machine interface, neural prosthetics, perturbation, population vector algorithm


A wealth of evidence associates learning with changes in neural activity (for reviews, see refs. 14). For example, the tuning functions of neurons in the motor cortex can change when monkeys adapt to perturbations that interfere with the execution (57) or visual feedback (810) of their movements. Although the observed changes in neural activity in these studies are closely associated with the behavioral manifestation of learning, it is difficult to interpret the behavioral significance of the neural changes because the precise relationship between neural activity and behavioral output is unknown. However, recent developments in brain-computer interface technology now make it possible to control the activity of a cursor in a three-dimensional (3D) virtual environment using the spiking activity of ensembles of simultaneously recorded motor cortical units (1116). This “brain-control” paradigm is unique in that the behavior, cursor movement, is solely the result of neural activity in the population under study. Thus, any mismatches between desired cursor motion and decoded cursor motion can only be corrected by altering the activity of these recorded neurons.

The brain-computer interface also allows for a unique kind of perturbation that targets selective subsets of neurons. By altering the way that the firing activity of a subset of neurons is decoded and mapped into cursor movement, it is possible to test whether the tuning functions of the perturbed subset selectively change to compensate for the global error signal, or whether the entire population changes together.

To study the response to selective perturbations of this neural network, monkeys were trained to perform center-out movements to eight equally spaced targets in a 3D virtual environment under brain control, using the population vector (PV) algorithm (1618). Once control had stabilized, the decoding algorithm was perturbed by rotating the tuning functions of a randomly chosen subset of units by 90° about a common axis, creating a global visuomotor rotation in decoded cursor movement. In addition to the expected global compensation for the resulting error in cursor movement, we found that the perturbed units underwent larger compensatory changes in tuning than the unperturbed units.

Results

Monkeys implanted with arrays of chronic recording electrodes were trained to perform a 3D center-out task in virtual reality under brain control (Fig. 1 A and B). To establish a mapping from neural activity to cursor motion, each recorded unit was fitted with a cosine-tuning funtion (1921) centered on its preferred direction (PD) of movement. Intended velocity was then estimated using the PV algorithm as the sum of each unit's PD weighted by its normalized firing rate. Each experiment consisted of the following sequence of four sessions (Fig. 1C): calibration, control, perturbation, and washout. The calibration session was used to estimate the tuning function of each recorded unit; on average, 40 units were used per session (range: 23–60). PDs obtained from the calibration session were fixed and subsequently used for decoding. (Henceforth, the PDs used for decoding will be called dPDs, to distinguish them from the PDs measured during other sessions.) In the control session, the center-out task was performed under brain control using the dPDs obtained from the calibration session. In the perturbation session, the dPDs of a randomly selected subset of units were reassigned, causing this subset of rotated units to contribute different directional components to the population vector than they did in the control session. For this subset, new dPDs were created by rotating the original dPDs by 90° about a common axis (Fig. 1D). The nonrotated units kept their original dPDs. In the washout session, the perturbation was removed and the original dPDs were reinstated. Two sets of experiments were performed, differing in the number of units randomly selected for perturbation. In the first set of experiments, 25% of the units were rotated during the perturbation session; in the second, 50% were rotated.*

Fig. 1.

Fig. 1.

Experimental design. (A) Schematic of the center-out brain-control paradigm. Monkeys were required to move a spherical cursor from the center of an imaginary cube in 3D virtual reality (VR) to a spherical target appearing at one of its eight corners. (B) Average trajectories to each target during one control session. Axes are inset for reference. (C) Timeline of recording session in one data set (left to right). (D) Schematic dPD rotation for a z-axis perturbation. The original dPDs (black dots) of a subset of units were rotated 90° to create the reassigned dPDs (red ends of comets). (E) Mean (± SE) of the cursor deviation in the direction of the perturbation, evaluated when the cursor moved half the distance to the target. Separate bars are shown for control (C), early perturbation (EP), late perturbation (LP), early washout (EW), and late washout (LW). The top panel shows the results from experiments in which 25% of the units were rotated; the bottom panel shows the results from experiments in which 50% were rotated. (F) Average trajectories from all experiments in which 50% of the units were rotated. To average trajectories from different targets, each trajectory was rotated into a common space, where the x-axis represents movement toward the target and the y-axis represents movement orthogonal to the target in the direction of the applied perturbation (20). Trajectories are collapsed along the z-axis, which represents movement orthogonal to both the target and the perturbation (noise). For information on movement times, see Table S1.

Average trajectories (Fig. 1 B, E and F) indicate that cursor movements were accurate during the control session, but deviated toward the applied perturbation immediately after the onset of the perturbation session. To analyze performance changes in the perturbation session, trials were divided into early and late sessions, consisting of the first and last five repetitions to each target, respectively. Early in the perturbation session, the average deviation in the perturbation direction was 6.5 ± 0.65 mm for experiments where 25% of the units were rotated (15.8 ± 0.60 mm for the 50% experiments). In both cases, the deviation was significantly reduced in the late perturbation session (average deviation, 25% experiments: 3.3 ± 0.64 mm, P = 0.0006, paired t test; 50% experiments: 12.7 ± 0.64 mm, P = 0.0005). These trends reversed in the washout sessions: when the perturbation was removed, there was a clear after-effect in the cursor trajectory that decreased over time.

Three possible strategies that could be used to compensate for the errors caused by the perturbation are re-aiming, re-weighting, and re-mapping (Fig. 2). Re-aiming compensates for the applied perturbation by aiming for a virtual target located in the direction that offsets the visuomotor rotation (810). This is a global strategy, in that all units in the recorded population (both rotated and nonrotated) would have discharge rates that indicate the monkey's intended movement to a different, virtual target. This solution is suboptimal, however; the units active for a given target will, because of the perturbation, have dPDs that are highly dispersed (see Fig. 2C). This will tend to shorten the population vector, leading to a slower movement. Re-weighting minimizes error by suppressing the use of the rotated units: that is, by reducing their modulation depths so that they contribute less to the PV (see Fig. 2D). This is a local strategy, in that the brain must first identify these “noisy” units and then quiet them to remove their contribution to the PV. This may also be inefficient, as it leaves fewer units available to drive the cursor. The ideal solution is re-mapping: that is, selectively changing the directional tuning of the rotated units to match the way they are being used by the decoder (see Fig. 2E). We find elements of all three strategies in the neural responses.

Fig. 2.

Fig. 2.

Schematic of the influence of the perturbation on the population vector and of three possible compensation mechanisms. (A) Population vector before perturbation. When moving toward the target shown in blue, units with preferred directions pointing toward the target will fire above their baseline rates. The gray lines represent vectors pointing toward the dPD of each unit, with the length of the vector scaled by the neuron's normalized firing rate. The sum of these vectors is the population vector, shown in black. On average, this population vector will point straight toward the target. (B) After the perturbation, the same units shown in (A) will be recruited, but some will contribute differently to the population vector because of their rotated dPDs. Rotated units are shown in pink. The result is a population vector that does not point at the target. (C) Re-aiming. By aiming at the virtual target (dotted blue circle), a different set of neurons is recruited with PDs that point toward the virtual target. The net contribution of the rotated and nonrotated units will cause the population vector to move straight toward the actual target. Note that the length of the population vector is shorter than in (A) because the units that contribute to it have more dispersed dPDs. (D) Re-weighting. By selectively reducing the contribution of the rotated units (down-modulating their firing rates), the population vector straightens toward the target. (E) Re-mapping. By recruiting only unperturbed cells that point toward the target and perturbed cells that point 90° from the target, the population vector can be made to point directly at the target with its original length.

To differentiate among these possibilities, we fitted cosine tuning curves to the spike rate data collected during the control session and compared them to cosine tuning fits from spike rate data in the perturbation session. Firing rates were measured over the 200-ms interval ending when the cursor had moved half the distance to the target, in an attempt to give the monkey enough time to perceive the target direction and modulate its motor cortical unit activity according to its intended movement direction, but not enough time to re-aim the cursor to compensate for errors in its trajectory. To try to capture postlearning responses in the perturbation session, the first five movements to each target were removed from analysis.

To test for the use of a re-weighting strategy, we compared the modulation depths of the rotated and nonrotated sets of units across sessions.§ We found that, on average, the modulation depths (m) of the rotated units showed a significant decrease from the control to the perturbation session (Δm = −0.84 ± 0.17 Hz, P < 10−6, weighted t test) (see the SI Appendix), whereas the nonrotated units did not (Δm = −0.19 ± 0.11 Hz; P = 0.10). The difference between the rotated and nonrotated Δm's was significant (P = 0.001, weighted paired t test). Thus, on average, the relative contribution of the rotated units to the cursor movement was smaller during the perturbation session than it was during the control session. The difference between the rotated and nonrotated units diminished to a nonsignificant level in the washout period (comparing control to washout, ΔmNon-Rotated − ΔmRotated = 0.34 ± 0.20 sp/s, P = 0.09).

An analysis of the PD shifts revealed two results (Fig. 3). First, ignoring differences between rotated and nonrotated units, most points were shifted toward the applied perturbation, indicating that both rotated and nonrotated units shifted to accommodate the cursor deflection (25% experiments: ΔPD = 6.4 ± 0.48°; 50% experiments: ΔPD = 18.9 ± 0.89°; both were highly significant with P < 10−10, weighted t test). This is consistent with a global re-aiming strategy: if the monkeys had been aiming for a virtual target rotated in the opposite direction of the perturbation, regressing the spike rates against the direction of the actual targets would give PDs that point in the direction of the applied perturbation. The second result of this analysis is that the rotated units showed a greater shift toward the applied perturbation than the nonrotated units (see Fig. 3 A–D), which is consistent with a local re-mapping strategy (25% experiments: ΔPD = 4.61 ± 1.10°, P = 0.00003, weighted paired t test; 50% experiments: ΔPD = 5.59 ± 1.77°, P = 0.002). During the washout session (see Fig. 3 E and F), the difference between the rotated and nonrotated units disappeared (25% experiments: ΔPD = 0.09 ± 1.08°, P = 0.93, weighted paired t test; 50% experiments: ΔPD = −0.43 ± 1.36°, P = 0.75).

Fig. 3.

Fig. 3.

PDs shift during the perturbation session. (A and B) Shift in the PD measured during the perturbation session relative to the control session for all units in experiments where 25% (A) or 50% (B) of the units were rotated. Small dots represent the individual data points and large dots represent the means of the rotated (red) and nonrotated (blue) groups. The intensity of each point is proportional to the certainty of the estimate for that point (see SI Appendix). (C and D) Empirical cumulative distribution functions (CDFs) of the PD shift along the direction of applied perturbation. In each case, the CDF of the rotated group is shifted significantly to the right of the CDF of the nonrotated group. (E and F) Same as (A) and (B) for the washout session. The nonrotated group's mean is obscured by the rotated group's mean.

Although we found that the rotated units showed significant tuning changes relative to the nonrotated units, the differences were subtle. How much impact did these changes have on actual cursor movement? Because the population vector algorithm is linear, the movement of the cursor under brain control can be decomposed into a weighted sum of two trajectories: one from the rotated population and one from the nonrotated population (see the SI Appendix). This decomposition allows us to track the errors due to each subpopulation of units over time. Fig. 4 shows the perturbation-induced error of the trajectory components averaged over all experiments in which 50% of the units were rotated (see also Fig. 5). In the early trials (see Fig. 4A), error began to increase 125 ms after the target presentation, corresponding to the average reaction time. Between 125 ms and 400 ms, the rotated population caused the cursor to deviate away from the target while the nonrotated population did not contribute to error, indicating that the monkey was aiming directly for the target. The error attributable to the nonrotated population appeared 400 ms into the movement and was in the direction opposite that of the rotated units. We interpret this as evidence of a visual correction to the cursor deviation. After learning (see Fig. 4B), the nonrotated population responded immediately at the beginning of each movement, indicating that re-aiming had been incorporated as part of the motor plan.

Fig. 4.

Fig. 4.

Trajectory error contributions from the rotated and nonrotated populations of units as a function of time. The y-axis in each plot is the perturbation-induced error: that is, the deviation of the cursor from the ideal straight-line trajectory, in the direction of the applied perturbation. (A) Perturbation-induced error during the early part of the perturbation session. The black line denotes the actual cursor trajectory, the red line denotes the component of that trajectory arising from the rotated units, and the blue line denotes the component arising from the nonrotated units. Colored arrows indicate the time that the corresponding population first deviates significantly from baseline (according to a two-sided t test). (B) Same as (A), for trajectories recorded during the late part of the perturbation session. The early trajectories are shown as dotted lines for reference. (C) The difference in deviation from the early perturbation trials to the late perturbation trials for each population.

Fig. 5.

Fig. 5.

Comparison of trajectories between the early (dashed) and late (solid) perturbation sessions. To look at learning-related changes, we compared the trajectories from the nonrotated population (blue) to the trajectories that would have resulted from the rotated population if the perturbation had not been applied. (To compute this, we used the firing rates from the rotated population to construct a population vector with the decoding parameters from the control session.) In the early perturbation session, the nonrotated trajectory and the trajectory that the rotated population would have had, if not perturbed, overlap both in time (A) and in space (B). Late in the perturbation session, they overlap for the first 400 ms or so, and then they begin to diverge.

There was a difference in the learning profiles of the rotated and nonrotated units (see Fig. 4C). The nonrotated population showed the greatest learning effect in the middle of the movement; at the end of the movement, the magnitude of the deviation attributable to the nonrotated units was the same in the early perturbation session as in the late perturbation session. In contrast, the rotated population showed the greatest learning effect at the end of the movement (the average rotated-population trajectory deviation at the end of the movement in the early perturbation session was 59 ± 1.4 mm; in the late perturbation session it was 52 ± 1.6 mm; paired t test, P = 0.0004).

While it is difficult to precisely ascribe the amount of error correction to each of the compensation mechanisms outlined in Fig. 2, these temporal differences make it clear that the subtle re-weighting and re-mapping effects combined to have a real, functional impact on cursor trajectories. Further work will be required to elucidate exactly how each mechanism contributes to error compensation.

Discussion

Everyday behavior requires constant updating between intended and executed movements. For example, reaching for and lifting an object makes use of knowledge about its weight, center of mass, deformability, and surface characteristics. Interaction with the object then leads to an update of this knowledge (i.e., learning), which results in a change in the way the movement is performed. The very large number of cells participating in the network responsible for movement generation, and the huge number of interconnections between those cells, make it difficult to discover neural principles of learning. The brain-computer interface paradigm provides a fully observed neural network that, by experimental construction, drives the behavioral output in the context of a task with a straightforward learning objective.

In these experiments, we applied a unique type of perturbation by changing the way a subset of the recorded neurons mapped to cursor movement. The net result of this perturbation was a visuomotor rotation between the desired cursor movement and the decoded movement, and the dominant response was a global change in the activity level of all of the neurons, indicating that the monkey was re-aiming to counter the applied rotation.

On top of this global response was a local change specific to the perturbed subpopulation of neurons: the rotated subpopulation decreased their modulation depths and shifted their PDs more than the nonrotated subpopulation. Furthermore, the tuning of the two populations evolved in different ways as a function of time through the trial (see Figs. 4 and 5). How could the contributions of particular units to the population vector be recognized using only one global feedback signal, the movement of the cursor? One possibility is that learning occurs in a desired-movement space that has more dimensions than the three that identify direction (2223). For example, it has been suggested that motor cortical neurons change their PDs in a posture-dependent manner (24). If this were true in our neurons, it is likely that the rotated and nonrotated populations would include different postural sensitivities, and learning to associate particular (imagined) arm postures with various directions of movement could result in differential changes between the two populations of neurons. Another possibility is that these neurons solve the “credit-assignment problem” described in the artificial intelligence literature (2526). By using a form of Hebbian learning (27), each neuron could reduce its contribution to error independently of other neurons via noise-driven synaptic updating rules (2830). Regardless of mechanism, this study suggests that the brain is capable of at least partial selectivity in its modification of neural activity when only a global feedback signal is available.

Methods

Behavioral Paradigm and Recording.

Three male Rhesus monkeys (Macaca mulatta), ≈3 to 4 years of age, were used in these experiments. For behavioral training, monkeys sat in a primate chair facing a mirror that reflects a 3D image from a stereoscopic computer monitor in a periscope-like design, allowing the space in front of the monkey to remain available for free arm movements. Before electrode implantation, the monkey was trained to move its hand, fitted with an optical marker, to proportionally move a spherical cursor from the center of an imaginary cube to targets appearing at one of its eight corners for a liquid reward (31). All procedures were performed in accordance with the guidelines of the Institutional Care and Use Committee of the University of Pittsburgh.

Once the monkeys became proficient at this 3D center-out reaching task, they were implanted with chronic recording arrays. Two of the monkeys (Monkeys O and P) were implanted with four or more 16-channel intracortical electrode arrays (50-μm Teflon-coated tungsten wires, arranged in 2 × 8 grids with 300-μm spacing), while the other (Monkey A) was implanted with one 96-channel array (Cyberkinetics Neurotechnology Systems, Inc.). All implantations were visually placed in the proximal arm area of primary motor or premotor cortex. Recordings were amplified, filtered, and sorted on-line with a 96-channel Plexon MAP system (Plexon Inc.).

Some of the units recorded were well-isolated single cells and some contained two or more cells that couldn't easily be isolated from one another, but which were nevertheless tuned to intended movement direction as a group. Once an adequate number of units were obtained, the monkey was trained to perform the center-out task, using “brain control” while both of its arms were restrained, moving the cursor by modulating the spiking activity of the recorded units as detailed in the next section, Decoding Algorithm for Brain Control. Once the monkeys became proficient (> 70% success rate) at the brain-control, center-out task with the final settings (diameter of cursor and target ≤2.5 cm, width of imaginary cube, 11 cm), they began participating in the experiments. On each recording day, a calibration brain-control session (4–10 cycle sets) was run to obtain each unit's original PD, baseline rate, and modulation depth. These original parameters were then fixed and subsequently used for decoding, as described in the next section. The control session (10–30 cycle sets) was run using the original dPDs (“decoding” PDs), the perturbation session (24–50 cycle sets) was run using the reassigned dPDs for the rotated subset of units, and the washout session (20 cycle sets, or until the monkey stopped working) was run with the original dPDs reinstated. A typical recording session lasted ≈3 hours.

Decoding Algorithm for Brain Control.

The PV algorithm used for brain control (12, 18) assumes that each unit is cosine-tuned with intended movement direction (17):

graphic file with name zpq04908-5833-m01.jpg

where f is the firing rate of a model unit, b0 is its baseline firing rate, m is its modulation depth, and θ is the angle between the direction of intended movement and the unit's PD.

We adapted the PV algorithm described in refs. 12 and 18 as follows: The cosine tuning fit of each unit was calculated by regressing the firing rate of the unit to the target direction. The regression model was:

graphic file with name zpq04908-5833-m02.jpg

where Yki is the firing rate of unit i in the kth trial, εki is noise, d = (dkx,dky,dkz) is the unit-vector pointing in the direction of the target in the kth trial, and the β's are the regression coefficients. The baseline firing rate b0i was estimated as β0i, the modulation depth mi as the magnitude of the vector β→ = (βxiyizi), and the preferred direction pi as β→/mi.

Once the baseline firing rate, modulation depth, and preferred direction for decoding had been estimated for each unit, they were used to construct a movement estimate in real time as follows: Spike counts were measured at a regular rate (30 Hz for Monkeys O and P, 60 Hz for Monkey A) and converted to rates fi(t) by dividing by the sampling interval. Normalized rates ri(t) were computed through the equation

graphic file with name zpq04908-5833-m03.jpg

These normalized rates were then smoothed with a five-point boxcar filter. The filtered, normalized rates were converted to cursor velocity, v(t), as:

graphic file with name zpq04908-5833-m04.jpg

where nD is the number of movement dimensions (in our case, 3) and N is the number of units being used for decoding. The speed factor, ks, converts the magnitude of the PV from a normalized range to a physical speed in mm/s (typical values ranged from 75 to 200). Only units whose modulation depth exceeded a given cutoff (usually, 4–5 Hz) were used to construct cursor movements; this cutoff typically eliminates half to two-thirds of the units we record. Finally, cursor position, Cp, was updated every sampling interval as

graphic file with name zpq04908-5833-m05.jpg

Trajectories always started at the origin.

Trajectory Analysis.

To increase statistical power and help remove “drift” (bias caused by nonuniform distributions of dPDs, for example), trajectories to each target were first rotated into a common frame of reference where targets and errors were in similar directions. In this reference frame, r was defined to be the unit-vector pointing from the center of the cube toward the target, ŷr was defined as the unit-vector orthogonal to r pointing toward the direction of the applied perturbation, and the r was defined as the cross product of r and ŷr, where the r subscript indicates values in the reference coordinate frame. To perform this rotation, every point pr = (i,j,k) in the trajectory was multiplied by a rotation matrix R to form the new trajectory pr = pR. R was created as the matrix whose columns were the new desired axes. Thus, for a target located at t = (tx,ty,tz) and dPDs rotated clockwise by 90° about the z-axis:

graphic file with name zpq04908-5833-m06.jpg

where the prime symbol denotes the transpose operation and |a| denotes the norm of a.

In this new reference frame, movement along the xr-axis represents movement toward the target, movement along the yr-axis represents error in the direction of the applied perturbation, and movement along the zr-axis represents error orthogonal to the applied perturbation.

Average cursor trajectories were obtained by smoothing and combining individual trajectories as follows: To combine trajectories of different durations, the time axis of each trajectory was uniformly scaled to the mean movement duration, τAvg. That is, the time samples for an individual movement t of duration τ were scaled by a gain factor α = τAvg/τ to create a new set of time samples ts = tα. Each x, y, and z component of the trajectory was then independently resampled using spline interpolation to a common time axis consisting of 300 evenly sampled points (interp1 command in Matlab, The Mathworks). Finally, the mean and SE were calculated separately for each time point of the x, y, and z components.

Supplementary Material

Supporting Information

Acknowledgments.

We thank Eilon Vaadia for suggestions on behavioral data analysis, Ingrid Albrecht for assistance with animal training, and Nathaniel Daw, M. Chance Spalding, and Andrew Whitford for comments on an earlier draft of this document. Support contributed by National Institute of Health Grant NS-2–2346 (to A.B.S.) and Collaborative Research in Computational Neuroscience Grant EB005847 (to R.E.K.).

Footnotes

The authors declare no conflict of interest.

This article is a PNAS Direct Submission.

This article contains supporting information online at www.pnas.org/cgi/content/full/0808113105/DCSupplemental.

*

Monkey O contributed three data sets with a total of 16 rotated and 52 nonrotated units. Monkey P contributed 14 data sets with a total of 131 rotated and 407 nonrotated units. Monkey A contributed seven data sets in which 25% of the units were rotated in each session (the same perturbation percentage as Monkeys O and P), adding 95 rotated and 275 nonrotated units. An additional 12 data sets were collected from Monkey A, wherein 50% of the units in each session were perturbed. These data contained 228 rotated and 221 nonrotated units. The term “unit” here refers both to well-isolated single neurons and to clusters containing a few nonisolatable neurons.

Deviation was calculated as the distance orthogonal to the target, in the direction of the applied perturbation, calculated when the cursor had moved half the distance to the target.

Unless otherwise stated, all values are given as mean ± SE.

§

To analyze the change in modulation depth, data were combined from both the 25% and 50% experiments, yielding a total of 955 nonrotated and 470 rotated units.

We also tested for (and discarded) the possibility that re-aiming strategies other than a pure rotation could lead to the differential changes in PD that we observe. See the SI Text for details.

The cycle set was considered complete when each of the eight targets was successfully hit or unsuccessfully attempted at least three times, where the minimum number of unsuccessful attempts was set by the experimenter. Thus, if the animal performed perfectly, a cycle set would consist of eight trials.

References

  • 1.Buonomano DV, Merzenich MM. Cortical plasticity: from synapses to maps. Annu Rev Neurosci. 1998;21:149–186. doi: 10.1146/annurev.neuro.21.1.149. [DOI] [PubMed] [Google Scholar]
  • 2.Sanes JN, Donoghue JP. Plasticity and primary motor cortex. Annu Rev Neurosci. 2000;23:393–415. doi: 10.1146/annurev.neuro.23.1.393. [DOI] [PubMed] [Google Scholar]
  • 3.Paz R, Wise SP, Vaadia E. Viewing and doing: similar cortical mechanisms for perceptual and motor learning. Trends Neurosci. 2004;27:496–503. doi: 10.1016/j.tins.2004.04.013. [DOI] [PubMed] [Google Scholar]
  • 4.Siegelbaum SA, Kandel ER. Learning-related synaptic plasticity: LTP and LTD. Curr Opin Neurobiol. 1991;1:113–120. doi: 10.1016/0959-4388(91)90018-3. [DOI] [PubMed] [Google Scholar]
  • 5.Gandolfo F, Li CSR, Benda BJ, Padoa-Schioppa C, Bizzi E. Cortical correlates of learning in monkeys adapting to a new dynamical environment. Proc Natl Acad Sci USA. 2000;97:2259–2263. doi: 10.1073/pnas.040567097. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 6.Li CSR, Padoa-Schioppa C, Bizzi E. Neuronal correlates of motor performance and motor learning in the primary motor cortex of monkeys adapting to an external force field. Neuron. 2001;30:593–607. doi: 10.1016/s0896-6273(01)00301-4. [DOI] [PubMed] [Google Scholar]
  • 7.Padoa-Schioppa C, Li CSR, Bizzi E. Neuronal activity in the supplementary motor area of monkeys adapting to a new dynamic environment. J Neurophysiol. 2004;91:449–473. doi: 10.1152/jn.00876.2002. [DOI] [PubMed] [Google Scholar]
  • 8.Wise SP, Moody SL, Blomstrom KJ, Mitz AR. Changes in motor cortical activity during visuomotor adaptation. Exp Brain Res. 1998;121:285–299. doi: 10.1007/s002210050462. [DOI] [PubMed] [Google Scholar]
  • 9.Paz R, Boraud T, Natan C, Bergman H, Vaadia E. Preparatory activity in motor cortex reflects learning of local visuomotor skills. Nat Neurosci. 2003;6:882–890. doi: 10.1038/nn1097. [DOI] [PubMed] [Google Scholar]
  • 10.Paz R, Vaadia E. Learning-induced improvement in encoding and decoding of specific movement directions by neurons in the primary motor cortex. PLoS Biol. 2004;2:264–274. doi: 10.1371/journal.pbio.0020045. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 11.Wessberg J, et al. Real-time prediction of hand trajectory by ensembles of cortical neurons in primates. Nature. 2000;408:361–365. doi: 10.1038/35042582. [DOI] [PubMed] [Google Scholar]
  • 12.Taylor DM, Helms-Tillery SI, Schwartz AB. Direct cortical control of 3D neuroprosthetic devices. Science. 2002;296:1829–1832. doi: 10.1126/science.1070291. [DOI] [PubMed] [Google Scholar]
  • 13.Musallam S, Corneil BD, Greger B, Scherberger H, Andersen RA. Cognitive control signals for neural prosthetics. Science. 2004;305:258–262. doi: 10.1126/science.1097938. [DOI] [PubMed] [Google Scholar]
  • 14.Santhanam G, Ryu SI, Yu BM, Afshar A, Shenoy KV. A high-performance brain-computer interface. Nature. 2006;442:195–198. doi: 10.1038/nature04968. [DOI] [PubMed] [Google Scholar]
  • 15.Hochberg LR, et al. Neuronal ensemble control of prosthetic devices by a human with tetraplegia. Nature. 2006;442:164–171. doi: 10.1038/nature04970. [DOI] [PubMed] [Google Scholar]
  • 16.Velliste M, Perel S, Spalding MC, Whitford AS, Schwartz AB. Cortical control of a prosthetic arm for self-feeding. Nature. 2008;453:1098–1101. doi: 10.1038/nature06996. [DOI] [PubMed] [Google Scholar]
  • 17.Georgopoulos AP, Kettner RE, Schwartz AB. Primate motor cortex and free arm movements to visual targets in three-dimensional space. II. Coding of the direction of movement by a neuronal population. J Neurosci. 1988;8:2928–1937. doi: 10.1523/JNEUROSCI.08-08-02928.1988. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Schwartz AB, Taylor DM, Helms-Tillery SI. Extraction algorithms for cortical control of arm prosthetics. Curr Opin Neurobiol. 2001;11:701–707. doi: 10.1016/s0959-4388(01)00272-0. [DOI] [PubMed] [Google Scholar]
  • 19.Georgopoulos AP, Kalaska JF, Caminiti R, Massey JT. On the relations between the direction of two-dimensional arm movements and cell discharge in primate motor cortex. J Neurosci. 1982;2:1527–1537. doi: 10.1523/JNEUROSCI.02-11-01527.1982. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 20.Schwartz AB, Kettner RE, Georgopoulos AP. Primate motor cortex and free arm movements to visual targets in three-dimensional space. I. Relations between single cell discharge and direction of movement. J Neurosci. 1988;8:2913–2927. doi: 10.1523/JNEUROSCI.08-08-02913.1988. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 21.Truccolo W, Friehs GM, Donoghue JP, Hochberg LR. Primary motor cortex tuning to intended movement kinematics in humans with tetraplegia. J Neurosci. 2008;28:1163–1178. doi: 10.1523/JNEUROSCI.4415-07.2008. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 22.Mosier KM, Scheidt RA, Acosta S, Mussa-Ivaldi FA. Remapping hand movements in a novel geometrical environment. J Neurophysiol. 1994;94:4362–4372. doi: 10.1152/jn.00380.2005. [DOI] [PubMed] [Google Scholar]
  • 23.Liu X, Scheidt RA. Contributions of online visual feedback to the learning and generalization of novel finger coordination patterns. J Neurophysiol. 2008;99:2546–2557. doi: 10.1152/jn.01044.2007. [DOI] [PubMed] [Google Scholar]
  • 24.Scott SH, Kalaska JF. Reaching movements with similar hand paths but different arm orientations. I. Activity of individual cells in motor cortex. J Neurophysiol. 1997;77:826–852. doi: 10.1152/jn.1997.77.2.826. [DOI] [PubMed] [Google Scholar]
  • 25.Minsky ML. Steps toward artificial intelligence. Proc IRE. 1961;49:8–30. [Google Scholar]
  • 26.Cohen PR, Feigenbaum EA. The Handbook of Artificial Intelligence. Los Altos, CA: Kauffman; 1982. [Google Scholar]
  • 27.Hebb DO. The Organization of Behavior: A Neuropsychological Theory. New York: Wiley & Sons; 1949. [Google Scholar]
  • 28.Unnikrishnan KP, Venugopal KP. Alopex: a correlation based learning algorithm for feed-forward and recurrent neural networks. Neural Comput. 1994;6:469–490. [Google Scholar]
  • 29.Mazzoni P, Andersen RA, Jordan MI. A more biologically plausible learning rule than back propagation applied to a network model of cortical area 7a. Cereb Cortex. 1991;1:293–307. doi: 10.1093/cercor/1.4.293. [DOI] [PubMed] [Google Scholar]
  • 30.Fiete IR, Fee MS, Seung HS. Model of birdsong learning based on gradient estimation by dynamic perturbation of neural conductances. J Neurophysiol. 2007;98:2038–2057. doi: 10.1152/jn.01311.2006. [DOI] [PubMed] [Google Scholar]
  • 31.Reina GA, Moran DW, Schwartz AB. On the relationship between joint angular velocity and motor cortical discharge during reaching. J Neurophysiol. 2001;85:2576–2589. doi: 10.1152/jn.2001.85.6.2576. [DOI] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

Supporting Information

Articles from Proceedings of the National Academy of Sciences of the United States of America are provided here courtesy of National Academy of Sciences

RESOURCES