Rhesus monkeys learn to control a directional-key inspired brain machine interface via bio-feedback

Chenguang Zhang; Hao Wang; Shaohua Tang; Zheng Li

doi:10.1371/journal.pone.0286742

. 2024 Jan 17;19(1):e0286742. doi: 10.1371/journal.pone.0286742

Rhesus monkeys learn to control a directional-key inspired brain machine interface via bio-feedback

Chenguang Zhang ^1,^2,^¤, Hao Wang ³, Shaohua Tang ^1,^4,⁵, Zheng Li ^1,^2,^*

Editor: Shenbing Kuang⁶

PMCID: PMC10793883 PMID: 38232123

Abstract

Brain machine interfaces (BMI) connect brains directly to the outside world, bypassing natural neural systems and actuators. Neuronal-activity-to-motion transformation algorithms allow applications such as control of prosthetics or computer cursors. These algorithms lie within a spectrum between bio-mimetic control and bio-feedback control. The bio-mimetic approach relies on increasingly complex algorithms to decode neural activity by mimicking the natural neural system and actuator relationship while focusing on machine learning: the supervised fitting of decoder parameters. On the other hand, the bio-feedback approach uses simple algorithms and relies primarily on user learning, which may take some time, but can facilitate control of novel, non-biological appendages. An increasing amount of work has focused on the arguably more successful bio-mimetic approach. However, as chronic recordings have become more accessible and utilization of novel appendages such as computer cursors have become more universal, users can more easily spend time learning in a bio-feedback control paradigm. We believe a simple approach which leverages user learning and few assumptions will provide users with good control ability. To test the feasibility of this idea, we implemented a simple firing-rate-to-motion correspondence rule, assigned groups of neurons to virtual “directional keys” for control of a 2D cursor. Though not strictly required, to facilitate initial control, we selected neurons with similar preferred directions for each group. The groups of neurons were kept the same across multiple recording sessions to allow learning. Two Rhesus monkeys used this BMI to perform a center-out cursor movement task. After about a week of training, monkeys performed the task better and neuronal signal patterns changed on a group basis, indicating learning. While our experiments did not compare this bio-feedback BMI to bio-mimetic BMIs, the results demonstrate the feasibility of our control paradigm and paves the way for further research in multi-dimensional bio-feedback BMIs.

Introduction

Brain machine interfaces (BMI), or brain computer interfaces (BCI), connect brains to machines or computers. Here we focus on a subset of such systems which use intracortical neuronal recordings from invasive electrodes implanted into cortex. Such systems can control robotic arms [1–4], cursors on computers [5–8] or tablets [9], exoskeletons [10], and paralyzed limbs via functional electrical stimulation [11–14]. Some early work related to invasive BMI recorded from single or a few neurons [1, 15, 16] and depended on learning by the user to control the activity of recorded neurons. Thanks to the development of electrode array hardware, chronic neuronal ensemble recordings have become viable [17], motivating more complex decoding methods.

These methods hold various assumptions to leverage multi-channel neural activity to offer intuitive and higher dimensional control: the population vector algorithm [18] assumes neuronal tuning can be described by preferred directions; linear filters (including the discrete Wiener filter) allow asymmetrical preferred direction distributions but assume linear tuning [19–21]; Kalman filters [22, 23] assume a state-space Markovian model with linear state transitions and linear tuning. More recent methods acknowledge the stochasticity of neuronal firing and the complexity of the cortical-musculature pathway and of the dynamics-to-kinematics relationship: particle filters [24], recurrent neural networks [25–27], support vector regression [25, 28, 29], autoregressive moving average [30], kernel autoregressive moving average [31], unscented Kalman filters [6], specialized Bayesian filters [22, 32, 33], and point process filters [34–36]. They utilize complex algorithms to mimic, in a supervised-learning approach, the natural relationship between motor cortical activity and end-effector movement, i.e. neural tuning. The approach is based on the belief that movement intentions can be accurately decoded after the decoder parameters are fitted using data recorded from actual limb movements or some reasonable substitute (when these are not available). The primary advantage of this bio-mimetic approach is intuitive and immediate control: after a brief period of model fitting and user orientation, the BMI system “plugs in”, bypassing the existing motor system.

While this kind of BMI decoder may have good initial performance, research [37–40] has found that, after practicing with neural control, users can perform better when feed-back learning is available (the work of [37] suggests focus on proprioceptive feedback).

However, the increasingly complex and often non-linear transformations used in bio-mimetic decoders may hamper, in the long-run, the ability of the user to learn these systems [41]: users’ trial-and-error strategy might work well for a simple neuronal activity to end-effector movement relationship, but not as well for complex or non-linear decoders which may include probabilistic tuning models, deep neural networks, or models that mimic the physical properties of limbs.

On the other hand, technical improvements in the stability of chronic recordings make the bio-feedback BMI paradigm increasingly feasible. A novel appendage may not need complex bio-mimetic neural tuning; humans may learn to use it from scratch, similar to how infants learn to control their limbs. To test the feasibility of this approach, we tasked monkeys to learn to control a cursor via a relatively simple control system by trial and error. Such a system may have lower initial accuracy when compared to state-of-the-artbio-mimetic BMIs, but after long-term user learning, it may offer competitive control accuracy. Compared to the bio-mimetic approach, there is much less work in this area [11, 42–45].

We are interested in simple control rules with simple assumptions, like the early work of Fetz [1] and Kennedy et al. [16]. Inspired by the four directional keys on a computer keyboard, used daily by many people to control a novel end-effector (the computer cursor), we designed and implemented an algorithm, called group weight, which sums spike counts within groups of neurons and converts the sums to cursor speed by multiplying with coefficients (weights). We use the summed firing rates of groups of neurons, since firing rates of individual neurons are variable [41]. Our algorithm does not require the sophisticated parameter estimation efforts of bio-mimetic BMI decoder training; however, analysis of neuronal tuning (preferred directions) can help us choose neuron groups so that neurons between groups are less likely to fire together, which helps facilitate separate control of two dimensions initially. In this first step in testing the feasibility of this approach, we do not compare against state-of-the-art bio-mimetic decoders here, nor attempt to solve the plethora of issues related to long-term BMI control, but rather focus on demonstrating the ability of BMI users to learn to use our bio-feedback control system. We also aim to demonstrate control of multiple dimensions simultaneously; here as an initial step, we aim for two-dimensional control.

We conducted experiments with two Rhesus monkeys to demonstrate the ability to simultaneously control two dimensions and ability to improve performance with learning. They monkeys were able to control the 2D cursor. Our analyses show task performance significantly improved over approximately one week of practice. The trajectories become straighter, consistent with learning. We analyzed group-wise neural ensemble activities across the training period and found that variability in the output-potent direction increased, supporting the presence of learning to use this algorithm on a group level. We then analyzed individual neuron’s tuning properties and found their preferred direction (PD) changed in ways that were suitable for the algorithm, indicating the neurons’ contribution to learning the new algorithm.

Even though our experiments here were not of sufficient length to allow monkeys to gain high-accuracy control, our study demonstrates the feasibility of the group weight method and suggests it warrants further investigation, both as an approach in designing BMIs and as a tool to study neural changes during learning.

Materials and methods

Algorithm design

Group weight algorithm design

The group weight algorithm converts neurons’ firing rates to the velocity of the cursor using a simple transformation. For 2D control, the conversion relies on four groups corresponding to up, down, left, and right movement, with a pair of opposing groups per dimension similar to natural flexor and extensor muscles. Firing rates of neurons in a group are summed and normalized to obtain an action value. For each dimension, we use the net action value, the positive direction action value minus the negative direction action value, to drive the cursor:

a_{k} = m a x (\frac{(\sum_{i = 1}^{n_{k}} f r_{i}) - μ_{k}}{δ_{k}} + c, 0)

(1)

v_{x} = w (a_{1} - a_{2})

(2)

v_{y} = w (a_{3} - a_{4})

(3)

Here, a_k denotes the action value, the normalized group firing rate, of group k; n_k is the neuron count in this group; fr_i is the firing rate of the i-th neuron (Hz); μ_k, δ_k are mean and standard deviation of the group’s summed firing rate, respectively; and c and w are constant values set by the experimenter (we here use 1 and 0.375, respectively). See section Normalization method for details on how the normalization parameters μ_k and δ_k were set. In Eq 2, v_x is the x-axis velocity of cursor (units of screen cm/s, limited to within ±15cm/s), which is computed as the difference between the 1st and the 2nd action value (similarly for v_y, the y-axis velocity). The four groups control two dimensions of cursor movement on the computer screen (Fig 1a). The normalization aims to avoid imbalance in action value between opposing groups and handles changes in neuronal firing rate due to recording instability. Firing rates were computed using a 5-bin moving average of the binned spike count, with each bin 100ms in duration and non-overlapping.

Fig 1 — a. Schematic illustration of groups. The summed and normalized firing rate of each group of 4 neurons provides an action value. Four action values correspond to four opposing directions in two dimensions of cursor speed. b. Grouping neurons by preferred direction. We divide the 2D space of linear velocity encoding model coefficients (b₁, b₂ in Eq 4) into four quadrants, corresponding to each direction. We select neurons based on their preferred directions, encoding strength, and signal stability. Inset shows the same data plotted on a larger range so that all recorded neurons are visible. c. We implanted Utah microelectrode arrays into primary motor cortex hand representational area. Photo shows surgery for Monkey T. A: anterior, L: lateral. d. Sample spike waveforms 44 days after implantation for Monkey T. Waveforms of different colors indicate different units (for visualization only, we did not sort units for group weight control), and waveform thickness represents plus and minus one-half standard deviation. Panels are placed according to positions on the Utah array (wire bundle at bottom). Color shading per channel indicates group assignment. e. Experimental task. The monkey sat before a screen displaying the brain-controlled cursor (green dot) and task target (white ring) and uses brain activity to control the cursor. After moving the cursor into the target, the monkey receives a water reward.

Neuron grouping

Our neuron inclusion criteria were: neurons should have long-term stable recordings and their directional encoding should be relatively strong. The stability of a neuron’s signal was judged by the number of days the neuron was recorded. If a neuron was recorded for more than 5 days, we regard it as stable. We assigned neurons to groups by their preferred direction (PD). PDs were calculated from data where in the monkey used its contralateral hand to control the cursor via a joystick. The data was collected in a center-out cursor movement task before brain control sessions. The joystick position was mapped in a one-to-one manner to the cursor location on the screen. We used a linear regression (Eq 4, the b variables are fitted coefficients) to calculate each neuron’s prefer direction vector (b₁, b₂).

f r = b_{0} + b_{1} v_{x} + b_{2} v_{y}

(4)

We plot all neurons’ PDs in Fig 1b. We separate neurons according to their preferred directions into four groups by dividing the 2D space of coefficients into four equal-angled sectors, with divisions at 1/4π, 3/4π, 5/4π, and 7/4π radians (angle from origin). For each group, we selected the most strongly tuned neurons (largest magnitude of (b₁, b₂) vector) that had PDs within or nearby its sector. We wanted to use a large number of neurons, so as to mitigate individual neuronal variability; however, here we had to limit the number of neurons in each group because recordings were unstable, and including all recorded neurons would mean some neurons were likely to drop out during the course of the experiments, affecting the group’s firing rate sum’s distribution. We also wanted to avoid weakly tuned neurons, which are likely to have wrong estimates for PD, and when two or more such neurons with common variation are placed in different groups, their activity only contributes to the output-null space. Thus, in the experiments here, we used 4 neurons per group, a number that is not too high so that losing a neuron was highly probable, and not too low so that losing a neuron would cause a large control ability drop. Note that, while we used PD information to set the groups, this information was not used in subsequent brain control. One may ask why bother with a bio-feedback approach that requires PD estimation, which is an important concept in bio-mimetic control. The reason we used PD-based grouping is that monkeys would not participate in the experiment if initial performance was too poor. This is a quirk of animal experiments which is not expected to occur with highly-motivated human users in a clinical environment. Using groups chosen by PDs provided sufficient initial control for the animals to continue trying, though still too poor to complete the task well. In clinical practice, groups can be chosen by other approaches which do not need training data, such as singular value decomposition of the firing rate covariance matrix [46].

Normalization method

The normalization constants were calculated using data from 5 to 10 minutes of pre-experiment, performed before each session. We sum each group’s firing rates during the pre-experiment and calculate each group’s summed firing rate average (μ) and standard deviation (σ). Then we use these values for group firing rate normalization (Eq 1). During the pre-experiment, monkeys performed brain control of cursor via group weight, using normalization constants from the previous session. For the first session, normalization constants were calculated from data recorded while the monkey was idle.

Animals, surgery, and data recording

Surgical and recording methods were similar to our previous study [47]. All surgical and experimental procedures were in compliance with the United States National Institutes of Health Guide for the Care and Use of Laboratory Animals and were approved by the Institutional Animal Care and Use Committee of Beijing Normal University. Two adult male Rhesus monkeys (Macaca mulatta), weighing 7.9kg (Monkey T) and 8.1kg (Monkey K), were used in this research. They were implanted with Utah electrode arrays (96 channels, electrode length: 1.5mm. Blackrock Microsystems, USA) in their primary motor cortex (Fig 1c) approximately 15mm lateral of midline. Surgeries were performed under sterile conditions with isoflurane (2%) anesthesia according to standard Utah array implantation procedures. We began to collect 44 days after implantation (Monkey T) (sample unit waveforms shown in Fig 1d) and data 283 days after implantation (Monkey K). Monkeys previously had some practice (3 weeks for Monkey T, approximately 2 weeks for Monkey K) with the bio-feedback BMI before the collection of data presented here. This was necessary for the monkeys to become acquainted with BMI control and for development and debugging of our control software. We recorded extracellular signals using a 128-channel Omniplex A recording system (Plexon Inc, USA). For spike detection, we used threshold-crossings with threshold set to 5 standard deviations of the voltage. We used such a high threshold to obtain recordings with high signal quality, as only the largest 1 or 2 units would be detected. We did not use spike sorting in these experiments, so as to simplify correspondence of signals across days [48]. In this text, we refer to these unsorted units as neurons. Since such multi-units have firing rates which are sums of firing rates of individual neurons, and our algorithm sums firing rates in groups, this means the groups likely contain more single-units than 4.

Animals were housed and fed in accordance with the United States National Institutes of Health (NIH) Guide for the Care and Use of Laboratory Animals. Cage sizes exceeded NIH Guide standards, and cages were equipped with water dispensers and food receptacles. Animals were solo caged, and cages were kept in a temperature, humidity, air quality, air freshness, air pressure, light, and sound-controlled room, with less than 20 animals total. Animals were fed dry primate feed three times daily and fresh fruit (or dried fruit if water restricted) once daily. Animals were given unrestricted water during days not partaking in experiments and a measured quantity of water sufficient to meet survival needs during days partaking in experiments. Animal health was observed during feeding by animal care staff and prior to experiments by experimenters. Animal weight was measured regularly or before every experiment. Health was assessed by animal appearance (especially at implant locations), presence of abnormal behavior, body weight, and appearance of waste. Cages were cleaned once daily. Enrichment was provided via television. Isoflurane (2%) anesthesia was provided during surgery and ibuprofen analgesia was provided after surgery. After the study, Utah array implants were removed from both animals under anesthesia, and they were transferred to another primate research group at the same institution.

Task design

The two monkeys were trained to perform center-out reaching tasks (Fig 1e), both via a hand-controlled joystick held in the hand contralateral to the implant. In the center-out task, the monkey had to move the dot cursor into the ring target to obtain a water reward. Targets appeared at screen center and then at peripheral locations, so that sequential movements toward the target resulted in center-out movements. After monkeys were aquatinted with the task, the joystick was removed. During experiments, monkeys sat in a primate chair and faced a computer screen showing the task stimuli (approximately 0.75m away). Their heads were fixed but hands were free to move. The task differed slightly between the two monkeys. For Monkey K, the center and peripheral targets appeared alternately without delay, with peripheral targets appearing at random angles (uniform distribution in 0–360°), so that movements formed a center-out-and-back series. For Monkey T, only peripheral targets were shown, and cursor position was reset to the center after each trial, so movements were only outwards. These peripheral targets appeared at only four possible places, right (0°), up (90°), left (180°), and down (270°). These changes were made to facilitate analysis of learning. Additionally, we added a short, random duration (5.5–10 seconds in the first 14 sessions, and 2.5–5 seconds thereafter) delay period or freeze time to the task sequence before the movement toward each peripheral target. Monkey T could see the peripheral target and cursor, but was unable to move the cursor during the delay period. This allowed us to analyze Monkey T’s neuronal activity both during movement and during preparation. Other minor differences are given in S1 Table of the supplementary information. Trials were limited to 10–15 seconds in duration (first three sessions used 10s and remaining sessions used 15s, both excluding freeze time). If monkeys did not reach the target during that time, the trial failed. The hold time for targets was 200 milliseconds.

Experiment design

Prior to the experiments reported here, Monkey T had previous experience with group weight control: this consisted of 6 days, during which different groups were used compared to the data reported here, in which Monkey T did not learn to control. This was followed by 5 days with similar groups as the data reported here, during which Monkey T did not learn to control because of distraction by the presence of the joystick. Then we removed the joystick and gave Monkey T 6 days of consecutive practice, in which it did not have obvious improvement. We report here data from 8 consecutive days of practice that happened 20 days after the previous block, during which Monkey T showed improvement in task performance. Prior to the experiments reported here, Monkey K had previous experience with the group weight control paradigm on a different 2D behavioral task (16 days), but could not learn to control during that time. This monkey also had approximately two weeks of practice with brain-controlled center-out task using group weight, but did not learn to control during that time: in the first week (4 days), the monkey used different neuron groups than in the data reported here. In the second week (6 days), groups were similar but not exactly the same as the groups reported here. After these two weeks of training, there was a 3-week-pause before the collection of data reported here, which consists of 6 consecutive days of practice in which Monkey K showed improvements in task performance. Both monkeys had two practice sessions each day, with each session about 30 minutes in duration. Monkey T’s sessions were consecutive, whereas Monkey K’s training sessions were conducted separately in the morning and afternoon. Monkey T’s total trials per session ranged from 100 to 200; Monkey K’s total trials per session ranged from 150 to 400; differences were due to different task settings (see Task design). Since failed trials generally lasted longer than successful trials, each session’s total trial count would increase as the monkey learned to control. Before each bio-feedback control session, there was a 5 to 10-minute long pre-experiment portion during which we collected data to calculate the normalization constants (see Normalization method).

Data analysis

Calculation of random baseline for success rate

To verify that the success rates we observed were not due to chance, we asked what the chance rate will be if neural activity is independent from visual cue. To achieve the independency, we calculated the baseline success rates by random shuffling neural activities of 10-ms bins during cursor movement portions of sessions. Then, we fed the shuffled neural data into an offline simulation of our algorithm and center-out task (with the same set of task parameters). The task design is the same as the online version, except that we fix all trial durations to 10s. Failure to reach the target and hold within 10s was recorded as a failed trial. For Monkey K, whose cursor was not moved to the center automatically after each trial, we move the simulated cursor to the center of the last target, as if it had reached the last target. We average the success rates obtained in this manner across sessions for each monkey. Due to differences in task design between monkeys, the baseline values were different between monkeys.

Results

Task performance

Overall performance

Over approximately one week of training, monkeys’ task performance improved steadily (Fig 2a). The number of successfully finished trials increased (logistic regression R² values MT:0.604, MK: 0.622; slope t-test p-values MT: 0.0004, MK: 0.0023) and the trial success rate increased (logistic regression R2 values MT:0.578, MK: 0.705; slope t-test p-values MT, MK<0.001). Monkey T’s success rate improved from 22% to 98%, while Monkey K improved from 12% to 80%. We obtained random baseline success rates by shuffling the neural data offline and reconstructing the control paradigm and task in MATLAB (see methods). Both monkeys’ success rates were far better than the random baseline values, which were 12% for Monkey T and 2% for Monkey K, indicating that the monkeys participated in the task and performed better than chance. Task success rate for targets in each direction significantly increased (Fig 2b). Together, these trends indicated that both monkeys successfully learned to use our group weight control paradigm.

Trajectories occupancy

The improvement in task performance can be seen in the cursor trajectory occupancy maps (Fig 3). Here we only show Monkey T’s data, since Monkey K’s task has targets at many angles instead of just four. We segment the training into 4 stages (rows), with 4 sessions each, and separate by target direction (columns). We plot cursor trajectories (in screen space) in each panel, including both successful and failed trials. Early in the training, trajectories span almost the whole screen (Fig 3, 1st row) and occupy similar regions for different targets. The trajectories for the upward target are relatively more compact in space, indicating the monkey can move in this direction well in the early stage. As the monkey practices (row 2 and 3), the cursor trajectories each column become more compact in space. The monkey learns to move the cursor to the right half of the screen. In the late stage (row 4), movements towards each target can be seen. The trajectories show that the monkey is able to repeat existing cursor movement patterns, and create new ones: similar cursor trajectories repeat across trials (e.g. up target) and new trajectories are created (e.g. right and down). Patterns that existed at the beginning are refined earlier (e.g. up). New patterns need long term trial and error to learn and become refined later (e.g. right and down).

Trajectory straightness

We analyzed the straightness of cursor trajectories, a quantitative indicator of control ability. To measure straightness, we define the “trajectory score” as the length of the cursor trajectory during a successful trial divided by the length from the start point to the target center. A smaller value indicates a straighter trajectory. We separated trials according to the target direction (four quarters of the angular space, similar to those used in PD grouping: right, up, left and down). We found that both monkeys’ trajectories became straighter after learning. Monkey T’s trajectory score (Fig 4 upper) decreased in three directions (right: regression trend-line R² = 0.08, slope t-test p < 10⁻¹⁰; up: R² = 0.03, p < 10⁻⁴; down: R² = 0.13, p < 10⁻¹⁰) indicating that control quality improved for these three directions. Although, the trajectories for left direction trials became slightly more curved (R² = 0.01, p < 0.05, Fig 4 upper row 3rd panel), this was affected by high trajectory score trials in the late practice period and do not reflect performance decreases in this direction. For Monkey K, trajectories become straighter in three directions (right: regression trend-line R² = 0.01, slope t-test p < 0.05; up: R² = 0.02, p < 10⁻³; down: R² = 0.31, p < 10⁻⁷). The task for Monkey K had targets at random angles, thus it is difficult to find clear patterns or stereotyped movements.

Fig 4 — We separate trials according to target direction for Monkey T and according to the vector direction from cursor initial position to target center for Monkey K. The trajectory score is calculated by dividing trajectory length by the distance from cursor initial position to target center (smaller is straighter). Black lines indicate linear fits. We observed significant decreasing trends (**p <* 0.05, * * *p <* 0.001), except for the left target for Monkey T and the downwards target for Monkey K.

Group firing rate analysis

Our control paradigm is similar to the muscle-skeletal system in that an agonist group of neurons and an antagonist group of neurons co-activate to generate movement in a direction. When we flex our native arm, we need the agonist muscle group to activate while the antagonist group should deactivate (or have lower activation); the difference in activation of these two groups results in a net force, and thus influences the movement (output-potent), while the sum of activations does not (output-null). Thus, for our paradigm, we are interested in whether there were changes in values in the output-potent versus output-null space (Fig 5a) after learning. We use the collected neural data and offline-reconstruct activation values; We calculate the output-potent values as the group activation subtraction between the toward-target direction vs away-target direction. The output-null values are the summations of the two groups’ activation values. We pooled four directions and two monkeys’ data together and compare the output-null and output-potent values from the first sessions versus late sessions, to show the learning effect. Specifically for monkey K, we analyzed portions of successful trials in which cursor speed was medium, so as to exclude times when the monkey was not engaged in the task (idling) or when there were fast movements due to artifacts or non-task-related physical behavior. The criterion was: cursor speed between 25 to 75 percent of the maximum speed. For the output-null values, the later sessions showed larger average values than the early sessions (Fig 5b). However, this effect only existed for monkey T. For monkey K, the averages of output-null activity in the first and last sessions are the same. We believe this could be due to the slight differences in experimental paradigm.

Fig 5 — a. Output-potent and output-null space illustration. In our control paradigm, the cursor speed in a dimension is proportional to the difference of two opposing action values (potent), whereas the sum of two opposing action values does not influence cursor speed (null). b. Null-space analysis of neural learning. We averaged the output-null values a₁ + a₂ across time in each trial, and compare the output-null values from the early sessions versus the late sessions, pooling data from two monkeys and four directions. The null component average of the late sessions was slightly larger than that in the early sessions, (ANOVA, ***p <* 10⁻¹⁰), this effect mostly comes from monkey T, as there was no difference in monkey K.

In our control paradigm, only when the output-potent values are larger than 0 could the cursor move towards the target. As seen in Fig 6, the output-potent values increased in late sessions compared with early sessions. In early sessions the output-potent value was near 0, indicating that the monkeys’ effort did not result in effective movements. Near the end of the sessions, the output-potent values are larger than 0, indicating that the group activity is more effective.

Tuning change through learning

The group-level neural ensemble activity shows significant output-potent increase, so next we analyze where it came from. In this section, we looked through the directional tuning properties of each neuron and found that neuron tuning changes occur in a way that is suitable for using the algorithm.

In the calculation of tuning, the neuron firing rate was regressed to the target direction (for monkey K, the calculation used target direction relative to the cursor direction). We selected epochs when cursor was moving, since for monkey T there was a period when the cursor was frozen. Specifically for monkey K, we selected epochs only when the cursor speed was neither too high nor too low, similar to the above analysis.

We examined three key values, tuning depth, fit R², and preferred direction (PD). The preferred direction is the direction which has the highest firing rate, and the tuning depth and R² correspond to the range of tuning function as well as the goodness of tuning fitting curve. The following analysis compares between the early learning stage and the late learning stage (first and last ¼ of sessions).

The tuning depth did not show a significant change (Fig 7a). We separated neurons into direct neurons, whose firing rate was feed into the task, and indirect neurons, whose firing rate was not. Neither of the two groups’ tuning depths experienced a significant change. Note that the difference of tuning depth between the direct and indirect neurons is due to our neuron inclusion criteria, as we chose neurons with substantial tuning to avoid neural activity correlation in weakly-tuned neurons.

The quality of tuning, R², changed during the learning period (Fig 7b), in general increasing during learning. When we compared the change within direct neurons and indirect neurons; we found that the R² increased significantly (p = 7.93 * 10⁻⁸, paired-t-test) for the direct neurons, and significant (p = 3.43 * 10⁻¹³) but small changes for the indirect neurons.

In our group weight method, we group neurons into four groups, assigning each group to one direction as their contribution direction. We wanted to know whether learning changed the neurons’ PD towards their assigned direction (AD). In Fig 8a, an example neuron is shown. For this specific neuron, the PD changed towards the AD.

Next we calculated the PD for all direct neurons and compared to their AD. The analysis showed that neurons’ tuning PD changed towards the AD (Fig 8b). Here we compared the early and late sessions for both monkeys, and plot normalized direction difference between PD and AD, where 1 means they are opposite, and 0 means they are the same. For neurons from both monkeys, the normalized direction difference decreased significantly (z-test, n = 33, p = 0.012), which is consistent with our expectation (for more examples of PD change, see S1–S3 Figs).

Discussion

In this study, monkeys used a bio-feedback BMI based on a relatively simple control law that converts neuronal activity to 2D cursor movement. Over about a week’s practice, two monkeys improved their BMI control ability, increasing trial success rate by around 70%. We presented evidence in terms of both behavioral and neuronal activity which supports the occurrence of learning.

Related to our work are studies such as those by Kennedy et al. [16]. Their work relied on a single neuron to control each dimension, with firing rate change mapped to single-directional velocity, combined with position reset (to edge of screen) on click. Arduin et al. [44] proposed a bi-directional control method for one dimension and showed that the controlling neuron changed in firing rate differently from nearby, non-controlling neurons. Law et al. [45] showed that control accuracy of a bio-feedback decoder increases with the number of neurons. Like them, we use the combined firing rates of neurons in a group. Another related study was done by Moritz et al. [11], who used a relatively simple control method to build a functional electrical stimulation based neural bypass. Our work uses a somewhat similar neural-movement mapping as the above studies, but we control 2 dimensions using opposing groups of neurons. The opposing group mechanism lets us analyzed changes in the output-potent and output-null directions (Figs 5 and 6). The 2-dimensional control allows us to examine trajectories in 2D space (Fig 3) and analyze path straightness (Fig 4).

Readers might be interested in the difference between our control method and the population-vector algorithm [18]. Our method assigns neurons to groups that have orthogonal contribution directions, thus each neuron only influence one direction. We also do not normalize each neuron’s firing rate individually, but rather collectively in a group-wised manner. The reason for this is to decrease neurons’ firing rate variability by leveraging a group of neurons. We also use a threshold to remove the effect of small firing rates. Our method will have a same effect as the population vector algorithm under the case when the neurons’ preferred directions are evenly distributed among the four cardinal directions and have the same tuning depth. Our main goal when designing this group weight method is to minimize the assumptions on neural activity, to build a simple algorithm and let the bio-feedback learning take charge. The settings of groups and thresholds were done to avoid unintentional movement caused by neural firing variability.

BMI learning

The initial performance after the monkeys started using brain control is low, although higher than random control (Fig 2), indicating the monkeys already had some level of control. However, the performance did not approach saturation (in terms of trial success rate) until late in training after several sessions. Some studies in the field focus on the progression of neural learning of BMI mappings [49–51]. They perturb the well-learned BMI mapping by re-assigning the decoding parameters and found that monkeys’ performance increased within one session [49, 50] or across sessions [51, 52].

In comparison, our work instead focuses on developing one simpler mapping between neural activity and movements, thus we did not train monkeys to familiarize with other BMI mappings beforehand, but only with the one group weight mapping. Thus, unlike their study, we did not compare neural activity by projecting on the original mapping (there was none); however, the cursor trajectories show that the monkeys achieve control of the new mapping (Fig 3).

We calculated neurons’ PD by assuming cosine tuning. After fitting neural activity and the intended movement direction using a cosine, we obtained the peak direction (PD) the range of firing (tuning depth), and the quality of the fitting (R2). In each of the monkeys’ training sessions, PDs are ever-changing during learning, which is in line with previous work [38]. We also found that the tuning R2 increased, which is consistent with their results.

Besides measuring tuning properties, we also analyzed group level neural activity in terms of output potent-and-null-space (Figs 5 and 6) and found changes over time. The sum of activity of agonist and antagonist groups (output-null value) increased during learning in one monkey but not the other. This could be an effect of the task design. The difference of activity of the agonist and antagonist groups (output-potent value) consistently increased in both monkeys. This is consistent with the fact that both monkey’s control performance improved; the performance increase is explained by changes in the group-level activity in terms of output-potent space.

Bio-mimetic and bio-feedback BMI

While bio-mimetic BMIs focus on the decoder mimicking the neural activity and movement relationship, bio-feedback BMIs focus on the user’s feedback learning. Methods actually fall into a spectrum between bio-mimetic and bio-feedback, with some work relying on aspects of both, for example, using a linear decoder as in a bio-mimetic way, but also relying on bio-feedback to let the user increase performance. Ganguly and Carmena [39] showed that monkeys could learn to control a cursor using different sets of randomly-chosen linear filter parameters and switch readily between them, showing the strong adaptive capacity of the motor cortex. Balasubramanian et al. [4] showed that over long-term training, functional neuronal connections changed, suggesting long-term training may alter the motor cortex. Here, our method relies on a minimum of parameters, by assigning neurons to groups (though for monkey experiments, this assignment was made intelligently to promote engagement) instead of finding the parameters for each neuron, relying instead on bio-feedback for performance increase. Our study has limitations. One was instability in neural recording, which limited our ability to extend the training period. Our experiments were perhaps too short to see substantial changes in neuronal activity patterns. Since we did not spike sort, recording instability may have meant some neurons were included in a group in only a subset of sessions, which meant the group firing rate statistics changed between sessions. We tried to compensate for this using the normalization procedure, but the effect of recording instability is still visible (e.g. session 12 in Fig 2a left panel). Another area for improvement is the selection and grouping of neurons. We grouped based on preferred direction, which may be less optimal than using a method that considers existing connectivity structure, which can determine what is easily learnable [53, 54], at least in the short-term. While the ultimate goal of building bio-feedback BMIs with minimum parameters is to allow high control accuracy, we did not train monkeys long enough for learning to increase performance to a level comparable with bio-mimetic methods. This study was meant as a feasibility demonstration, a first step showing that simple, group-based method, for multi-dimensional control is learnable. Tweaking of the control law for better performance and longer duration training are needed in future studies.

Conclusion

We implemented the group weight method as a bio-feedback BMI control paradigm with simple assumptions. Through one week of training, monkeys showed increases in success rate and trajectories become straighter, indicating learning occurred. Group-based and single neuron metrics also indicated learning occurred. This simple bio-feedback control paradigm has potential to control multi-dimensional cursors or robotic limbs.

Supporting information

S1 Fig. Neuronal PD change across learning, monkey T.

Each column represents a group of neurons and each pie plot corresponds to a session. In each pie plot, the shaded area shows neurons’ assigned direction (AD) and the colored bars show neuron’s tuning (PD). In early learning (session 1), the neurons’ PDs are not close to their AD (except for group 3). However, in the late sessions (15 and 16), neurons’ PDs are closer to the AD.

(DOCX)

Click here for additional data file.^{(150KB, docx)}

S2 Fig. Neuronal PD change across learning, monkey K. Same notation as S1 Fig.

(DOCX)

Click here for additional data file.^{(132.7KB, docx)}

S3 Fig. Distance between neuronal PD and assigned directions AD across learning, both monkeys.

X-axis is time (session number) and Y-axis is the normalized |PD—AD|, which is 1 for opposite direction and 0 for same direction. Thus, the directional difference is mapped from [-π, π] to [0,1]. The sector of the assigned direction [-45°, 45°] (shaded sector in S1 Fig) is mapped to [0,0.25], and values lower than 0.5 indicate the channel contributes to the direction of movement. Most channels’ |PD–AD| is smaller than 0.5, showing that they contribute to the movement direction.

(DOCX)

Click here for additional data file.^{(321.4KB, docx)}

S1 Table. Task difference between the two monkeys.

(DOCX)

Click here for additional data file.^{(23.5KB, docx)}

Acknowledgments

We thank Timothy L. Hanson for the idea of the group-based control approach and helpful comments on the manuscript. We thank Xibin Xu, Feng Wang, Yin Yan, Yang Li, Ye Xin, Jie Li, Xiuli Chen, and Ying Li for assistance with animal surgeries. We thank Miguel A. L. Nicolelis for permission to use the BMI3 software suite. We thank Anna Wang Roe and the anonymous reviewers for their helpful comments on the manuscript.

Data Availability

All formatted files are available from the dandiset database: https://dandiarchive.org/dandiset/000338/draft Zhang, Chenguang (2022) groupweight BMI (Version draft) [Data set]. DANDI archive. (accession number(s) 000338).

Funding Statement

Z Li was supported by the STI 2030-Major Projects of the Ministry of Science and Technology of China, 2021ZD0200407, the National Key Research and Development Program of China, 2020YFC0832402, and the Innovation Team Project of Guangdong Provincial Department of Education, 2021KCXTD014. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

References

1.Fetz EE. Operant Conditioning of Cortical Unit Activity. Science. 1969;163(3870):955–958. doi: 10.1126/science.163.3870.955 [DOI] [PubMed] [Google Scholar]
2.Velliste M, Perel S, Spalding MC, Whitford AS, Schwartz AB. Cortical control of a prosthetic arm for self-feeding. Nature. 2008;453(7198):1098–1101. doi: 10.1038/nature06996 [DOI] [PubMed] [Google Scholar]
3.Hochberg LR, Bacher D, Jarosiewicz B, Masse NY, Simeral JD, Vogel J, et al. Reach and grasp by people with tetraplegia using a neurally controlled robotic arm. Nature. 2012;485(7398):372–375. doi: 10.1038/nature11076 [DOI] [PMC free article] [PubMed] [Google Scholar]
4.Balasubramanian K, Southerland J, Vaidya M, Qian K, Eleryan A, Fagg AH, et al. Operant conditioning of a multiple degree-of-freedom brain-machine interface in a primate model of amputation. In: 2013 35th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC); 2013. p. 303–306. [DOI] [PubMed]
5.Serruya MD, Hatsopoulos NG, Paninski L, Fellows MR, Donoghue JP. Instant neural control of a movement signal. Nature. 2002;416(6877):141–142. doi: 10.1038/416141a [DOI] [PubMed] [Google Scholar]
6.Li Z, O’Doherty JE, Hanson TL, Lebedev MA, Henriquez CS, Nicolelis MAL. Unscented Kalman Filter for Brain-Machine Interfaces. PLOS ONE. 2009;4(7):e6243. doi: 10.1371/journal.pone.0006243 [DOI] [PMC free article] [PubMed] [Google Scholar]
7.Gilja V, Pandarinath C, Blabe CH, Nuyujukian P, Simeral JD, Sarma AA, et al. Clinical translation of a high-performance neural prosthesis. Nature Medicine. 2015;21(10):1142–1145. doi: 10.1038/nm.3953 [DOI] [PMC free article] [PubMed] [Google Scholar]
8.Pandarinath C, Nuyujukian P, Blabe CH, Sorice BL, Saab J, Willett FR, et al. High performance communication by people with paralysis using an intracortical brain-computer interface. eLife. 2017;6:e18554. doi: 10.7554/eLife.18554 [DOI] [PMC free article] [PubMed] [Google Scholar]
9.Nuyujukian P, Sanabria JA, Saab J, Pandarinath C, Jarosiewicz B, Blabe CH, et al. Cortical control of a tablet computer by people with paralysis. PLOS ONE. 2018;13(11):e0204566. doi: 10.1371/journal.pone.0204566 [DOI] [PMC free article] [PubMed] [Google Scholar]
10.Vouga T, Zhuang KZ, Olivier J, Lebedev MA, Nicolelis MAL, Bouri M, et al. EXiO—A Brain-Controlled Lower Limb Exoskeleton for Rhesus Macaques. IEEE Transactions on Neural Systems and Rehabilitation Engineering. 2017;25(2):131–141. doi: 10.1109/TNSRE.2017.2659654 [DOI] [PubMed] [Google Scholar]
11.Moritz CT, Perlmutter SI, Fetz EE. Direct control of paralysed muscles by cortical neurons. Nature. 2008;456(7222):639–642. doi: 10.1038/nature07418 [DOI] [PMC free article] [PubMed] [Google Scholar]
12.Bouton CE, Shaikhouni A, Annetta NV, Bockbrader MA, Friedenberg DA, Nielson DM, et al. Restoring cortical control of functional movement in a human with quadriplegia. Nature. 2016;533(7602):247–250. doi: 10.1038/nature17435 [DOI] [PubMed] [Google Scholar]
13.Guggenmos DJ, Azin M, Barbay S, Mahnken JD, Dunham C, Mohseni P, et al. Restoration of function after brain damage using a neural prosthesis. Proceedings of the National Academy of Sciences. 2013;110(52):21177–21182. doi: 10.1073/pnas.1316885110 [DOI] [PMC free article] [PubMed] [Google Scholar]
14.Friedenberg DA, Schwemmer MA, Landgraf AJ, Annetta NV, Bockbrader MA, Bouton CE, et al. Neuroprosthetic-enabled control of graded arm muscle contraction in a paralyzed human. Scientific Reports. 2017;7(1):8386. doi: 10.1038/s41598-017-08120-9 [DOI] [PMC free article] [PubMed] [Google Scholar]
15.Fetz EE, Finocchio DV. Correlations between activity of motor cortex cells and arm muscles during operantly conditioned response patterns. Experimental Brain Research. 1975;23(3):217–240. doi: 10.1007/BF00239736 [DOI] [PubMed] [Google Scholar]
16.Kennedy PR, Bakay RAE, Moore MM, Adams K, Goldwaithe J. Direct control of a computer from the human central nervous system. IEEE Transactions on Rehabilitation Engineering. 2000;8(2):198–202. doi: 10.1109/86.847815 [DOI] [PubMed] [Google Scholar]
17.Nicolelis MAL, Dimitrov D, Carmena JM, Crist R, Lehew G, Kralik JD, et al. Chronic, multisite, multielectrode recordings in macaque monkeys. Proceedings of the National Academy of Sciences. 2003;100(19):11041–11046. doi: 10.1073/pnas.1934665100 [DOI] [PMC free article] [PubMed] [Google Scholar]
18.Georgopoulos AP, Schwartz AB, Kettner RE. Neuronal Population Coding of Movement Direction. Science, New Series. 1986;233(4771):1416–1419. doi: 10.1126/science.3749885 [DOI] [PubMed] [Google Scholar]
19.Carmena JM, Lebedev MA, Crist RE, O’Doherty JE, Santucci DM, Dimitrov DF, et al. Learning to Control a Brain–Machine Interface for Reaching and Grasping by Primates. PLOS Biology. 2003;1(2):e42. doi: 10.1371/journal.pbio.0000042 [DOI] [PMC free article] [PubMed] [Google Scholar]
20.Hochberg LR, Serruya MD, Friehs GM, Mukand JA, Saleh M, Caplan AH, et al. Neuronal ensemble control of prosthetic devices by a human with tetraplegia. Nature. 2006;442(7099):164–171. doi: 10.1038/nature04970 [DOI] [PubMed] [Google Scholar]
21.Lebedev MA, Carmena JM, O’Doherty JE, Zacksenhouse M, Henriquez CS, Principe JC, et al. Cortical Ensemble Adaptation to Represent Velocity of an Artificial Actuator Controlled by a Brain-Machine Interface. Journal of Neuroscience. 2005;25(19):4681–4693. doi: 10.1523/JNEUROSCI.4088-04.2005 [DOI] [PMC free article] [PubMed] [Google Scholar]
22.Wu H, Feng J, Zeng Y. Neural Decoding for Macaque’s Finger Position: Convolutional Space Model. IEEE Transactions on Neural Systems and Rehabilitation Engineering. 2019;27(3):543–551. doi: 10.1109/TNSRE.2019.2893406 [DOI] [PubMed] [Google Scholar]
23.Orsborn AL, Dangi S, Moorman HG, Carmena JM. Closed-Loop Decoder Adaptation on Intermediate Time-Scales Facilitates Rapid BMI Performance Improvements Independent of Decoder Initialization Conditions. IEEE Transactions on Neural Systems and Rehabilitation Engineering. 2012;20(4):468–477. doi: 10.1109/TNSRE.2012.2185066 [DOI] [PubMed] [Google Scholar]
24.Gao Y, Black MJ, Bienenstock E, Shoham S, Donoghue JP. Probabilistic inference of hand motion from neural activity in motor cortex. In: Proceedings of the 14th International Conference on Neural Information Processing Systems: Natural and Synthetic. NIPS’01. Cambridge, MA, USA: MIT Press; 2001. p. 213–220.
25.Sanchez JC, Sung-Phil Kim, Erdogmus D, Rao YN, Principe JC, Wessberg J, et al. Input-output mapping performance of linear and nonlinear models for estimating hand trajectories from cortical neuronal firing patterns. In: Proceedings of the 12th IEEE Workshop on Neural Networks for Signal Processing. Martigny, Switzerland: IEEE; 2002. p. 139–148. http://ieeexplore.ieee.org/document/1030025/.
26.Sussillo D, Nuyujukian P, Fan JM, Kao JC, Stavisky SD, Ryu S, et al. A recurrent neural network for closed-loop intracortical brain–machine interface decoders. Journal of Neural Engineering. 2012;9(2):026027. doi: 10.1088/1741-2560/9/2/026027 [DOI] [PMC free article] [PubMed] [Google Scholar]
27.Tseng PH, Urpi NA, Lebedev M, Nicolelis M. Decoding Movements from Cortical Ensemble Activity Using a Long Short-Term Memory Recurrent Network. Neural Computation. 2019;31(6):1085–1113. doi: 10.1162/neco_a_01189 [DOI] [PubMed] [Google Scholar]
28.Shpigelman L, Crammer K, Paz R, Vaadia E, Singer Y. A temporal kernel-based model for tracking hand-movements from neural activities. In: Proceedings of the 17th International Conference on Neural Information Processing Systems. NIPS’04. Cambridge, MA, USA: MIT Press; 2004. p. 1273–1280.
29.Hao Y, Zhang Q, Controzzi M, Cipriani C, Li Y, Li J, et al. Distinct neural patterns enable grasp types decoding in monkey dorsal premotor cortex. Journal of Neural Engineering. 2014;11(6):066011. doi: 10.1088/1741-2560/11/6/066011 [DOI] [PubMed] [Google Scholar]
30.Fisher J, Black MJ. Motor Cortical Decoding Using an Autoregressive Moving Average Model. In: 2005 IEEE Engineering in Medicine and Biology 27th Annual Conference. Shanghai, China: IEEE; 2005. p. 2130–2133. http://ieeexplore.ieee.org/document/1616881/. [DOI] [PubMed]
31.Shpigelman L, Lalazar H, Vaadia E. Kernel-ARMA for Hand Tracking and Brain-Machine interfacing During 3D Motor Control. Advances in Neural Information Processing Systems. 2008;21:1489–1496. [Google Scholar]
32.Brandman DM, Burkhart MC, Kelemen J, Franco B, Harrison MT, Hochberg LR. Robust Closed-Loop Control of a Cursor in a Person with Tetraplegia using Gaussian Process Regression. Neural Computation. 2018;30(11):2986–3008. doi: 10.1162/neco_a_01129 [DOI] [PMC free article] [PubMed] [Google Scholar]
33.Makin JG, O’Doherty JE, Cardoso MMB, Sabes PN. Superior arm-movement decoding from cortex with a new, unsupervised-learning algorithm. Journal of Neural Engineering. 2018;15(2):026010. doi: 10.1088/1741-2552/aa9e95 [DOI] [PubMed] [Google Scholar]
34.Eden UT, Frank LM, Barbieri R, Solo V, Brown EN. Dynamic Analysis of Neural Encoding by Point Process Adaptive Filtering. Neural Computation. 2004;16(5):971–998. doi: 10.1162/089976604773135069 [DOI] [PubMed] [Google Scholar]
35.Wang Y, Paiva ARC, Principe JC. A Monte Carlo Sequential Estimation for Point Process Optimum Filtering. In: The 2006 IEEE International Joint Conference on Neural Network Proceedings; 2006. p. 1846–1850.
36.Koyama S, Chase SM, Whitford AS, Velliste M, Schwartz AB, Kass RE. Comparison of brain–computer interface decoding algorithms in open-loop and closed-loop control. J Comput Neurosci. 2010; p. 15. doi: 10.1007/s10827-009-0196-9 [DOI] [PubMed] [Google Scholar]
37.Suminski AJ, Tkach DC, Fagg AH, Hatsopoulos NG. Incorporating Feedback from Multiple Sensory Modalities Enhances Brain–Machine Interface Control. Journal of Neuroscience. 2010;30(50):16777–16787. doi: 10.1523/JNEUROSCI.3967-10.2010 [DOI] [PMC free article] [PubMed] [Google Scholar]
38.Taylor DM, Tillery SIH, Schwartz AB. Direct Cortical Control of 3D Neuroprosthetic Devices. Science. 2002;296(5574):1829–1832. doi: 10.1126/science.1070291 [DOI] [PubMed] [Google Scholar]
39.Ganguly K, Carmena JM. Emergence of a Stable Cortical Map for Neuroprosthetic Control. PLoS Biology. 2009;7(7):e1000153. doi: 10.1371/journal.pbio.1000153 [DOI] [PMC free article] [PubMed] [Google Scholar]
40.Orsborn A, Moorman H, Overduin S, Shanechi M, Dimitrov D, Carmena J. Closed-Loop Decoder Adaptation Shapes Neural Plasticity for Skillful Neuroprosthetic Control. Neuron. 2014;82(6):1380–1393. doi: 10.1016/j.neuron.2014.04.048 [DOI] [PubMed] [Google Scholar]
41.Fetz EE. Volitional control of neural activity: implications for brain–computer interfaces. The Journal of Physiology. 2007;579(3):571–579. doi: 10.1113/jphysiol.2006.127142 [DOI] [PMC free article] [PubMed] [Google Scholar]
42.Moritz CT, Fetz EE. Volitional control of single cortical neurons in a brain–machine interface. Journal of Neural Engineering. 2011;8(2):025017. doi: 10.1088/1741-2560/8/2/025017 [DOI] [PMC free article] [PubMed] [Google Scholar]
43.Lansdell B, Milovanovic I, Mellema C, Fetz EE, Fairhall AL, Moritz CT. Reconfiguring Motor Circuits for a Joint Manual and BCI Task. IEEE Transactions on Neural Systems and Rehabilitation Engineering. 2020;28(1):248–257. doi: 10.1109/TNSRE.2019.2944347 [DOI] [PMC free article] [PubMed] [Google Scholar]
44.Arduin PJ, Frégnac Y, Shulz DE, Ego-Stengel V. “Master” Neurons Induced by Operant Conditioning in Rat Motor Cortex during a Brain-Machine Interface Task. Journal of Neuroscience. 2013;33(19):8308–8320. doi: 10.1523/JNEUROSCI.2744-12.2013 [DOI] [PMC free article] [PubMed] [Google Scholar]
45.Law AJ, Rivlis G, Schieber MH. Rapid acquisition of novel interface control by small ensembles of arbitrarily selected primary motor cortex neurons. Journal of Neurophysiology. 2014;112(6):1528–1548. doi: 10.1152/jn.00373.2013 [DOI] [PMC free article] [PubMed] [Google Scholar]
46.Badreldin I, Southerland J, Vaidya M, Eleryan A, Balasubramanian K, Fagg A, et al. Unsupervised decoder initialization for brain-machine interfaces using neural state space dynamics. In: 2013 6th International IEEE/EMBS Conference on Neural Engineering (NER); 2013. p. 997–1000.
47.Li S, Li J, Li Z. An Improved Unscented Kalman Filter Based Decoder for Cortical Brain-Machine Interfaces. Frontiers in Neuroscience. 2016;10. doi: 10.3389/fnins.2016.00587 [DOI] [PMC free article] [PubMed] [Google Scholar]
48.Dai J, Zhang P, Sun H, Qiao X, Zhao Y, Ma J, et al. Reliability of motor and sensory neural decoding by threshold crossings for intracortical brain–machine interface. Journal of Neural Engineering. 2019;16(3):036011. doi: 10.1088/1741-2552/ab0bfb [DOI] [PubMed] [Google Scholar]
49.Jarosiewicz B, Chase SM, Fraser GW, Velliste M, Kass RE, Schwartz AB. Functional network reorganization during learning in a brain-computer interface paradigm. Proceedings of the National Academy of Sciences of the United States of America. 2008;105(49):19486–19491. doi: 10.1073/pnas.0808113105 [DOI] [PMC free article] [PubMed] [Google Scholar]
50.Sadtler PT, Quick KM, Golub MD, Chase SM, Ryu SI, Tyler-Kabara EC, et al. Neural constraints on learning. Nature. 2014;512(7515):423–426. doi: 10.1038/nature13665 [DOI] [PMC free article] [PubMed] [Google Scholar]
51.Zhou X, Tien RN, Ravikumar S, Chase SM. Distinct types of neural reorganization during long-term learning. Journal of Neurophysiology. 2019;121(4):1329–1341. doi: 10.1152/jn.00466.2018 [DOI] [PMC free article] [PubMed] [Google Scholar]
52.Oby ER, Golub MD, Hennig JA, Degenhart AD, Tyler-Kabara EC, Yu BM, et al. New neural activity patterns emerge with long-term learning. Proceedings of the National Academy of Sciences. 2019;116(30):15210–15215. doi: 10.1073/pnas.1820296116 [DOI] [PMC free article] [PubMed] [Google Scholar]
53.Golub MD, Sadtler PT, Oby ER, Quick KM, Ryu SI, Tyler-Kabara EC, et al. Learning by neural reassociation. Nature Neuroscience. 2018;21(4):607–616. doi: 10.1038/s41593-018-0095-3 [DOI] [PMC free article] [PubMed] [Google Scholar]
54.Hwang E, Bailey P, Andersen R. Volitional Control of Neural Activity Relies on the Natural Motor Repertoire. Current Biology. 2013;23(5):353–361. doi: 10.1016/j.cub.2013.01.027 [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Supplementary Materials

S1 Fig. Neuronal PD change across learning, monkey T.

(DOCX)

Click here for additional data file.^{(150KB, docx)}

S2 Fig. Neuronal PD change across learning, monkey K. Same notation as S1 Fig.

(DOCX)

Click here for additional data file.^{(132.7KB, docx)}

S3 Fig. Distance between neuronal PD and assigned directions AD across learning, both monkeys.

(DOCX)

Click here for additional data file.^{(321.4KB, docx)}

S1 Table. Task difference between the two monkeys.

(DOCX)

Click here for additional data file.^{(23.5KB, docx)}

Data Availability Statement

[pone.0286742.ref001] 1.Fetz EE. Operant Conditioning of Cortical Unit Activity. Science. 1969;163(3870):955–958. doi: 10.1126/science.163.3870.955 [DOI] [PubMed] [Google Scholar]

[pone.0286742.ref002] 2.Velliste M, Perel S, Spalding MC, Whitford AS, Schwartz AB. Cortical control of a prosthetic arm for self-feeding. Nature. 2008;453(7198):1098–1101. doi: 10.1038/nature06996 [DOI] [PubMed] [Google Scholar]

[pone.0286742.ref003] 3.Hochberg LR, Bacher D, Jarosiewicz B, Masse NY, Simeral JD, Vogel J, et al. Reach and grasp by people with tetraplegia using a neurally controlled robotic arm. Nature. 2012;485(7398):372–375. doi: 10.1038/nature11076 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0286742.ref004] 4.Balasubramanian K, Southerland J, Vaidya M, Qian K, Eleryan A, Fagg AH, et al. Operant conditioning of a multiple degree-of-freedom brain-machine interface in a primate model of amputation. In: 2013 35th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC); 2013. p. 303–306. [DOI] [PubMed]

[pone.0286742.ref005] 5.Serruya MD, Hatsopoulos NG, Paninski L, Fellows MR, Donoghue JP. Instant neural control of a movement signal. Nature. 2002;416(6877):141–142. doi: 10.1038/416141a [DOI] [PubMed] [Google Scholar]

[pone.0286742.ref006] 6.Li Z, O’Doherty JE, Hanson TL, Lebedev MA, Henriquez CS, Nicolelis MAL. Unscented Kalman Filter for Brain-Machine Interfaces. PLOS ONE. 2009;4(7):e6243. doi: 10.1371/journal.pone.0006243 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0286742.ref007] 7.Gilja V, Pandarinath C, Blabe CH, Nuyujukian P, Simeral JD, Sarma AA, et al. Clinical translation of a high-performance neural prosthesis. Nature Medicine. 2015;21(10):1142–1145. doi: 10.1038/nm.3953 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0286742.ref008] 8.Pandarinath C, Nuyujukian P, Blabe CH, Sorice BL, Saab J, Willett FR, et al. High performance communication by people with paralysis using an intracortical brain-computer interface. eLife. 2017;6:e18554. doi: 10.7554/eLife.18554 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0286742.ref009] 9.Nuyujukian P, Sanabria JA, Saab J, Pandarinath C, Jarosiewicz B, Blabe CH, et al. Cortical control of a tablet computer by people with paralysis. PLOS ONE. 2018;13(11):e0204566. doi: 10.1371/journal.pone.0204566 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0286742.ref010] 10.Vouga T, Zhuang KZ, Olivier J, Lebedev MA, Nicolelis MAL, Bouri M, et al. EXiO—A Brain-Controlled Lower Limb Exoskeleton for Rhesus Macaques. IEEE Transactions on Neural Systems and Rehabilitation Engineering. 2017;25(2):131–141. doi: 10.1109/TNSRE.2017.2659654 [DOI] [PubMed] [Google Scholar]

[pone.0286742.ref011] 11.Moritz CT, Perlmutter SI, Fetz EE. Direct control of paralysed muscles by cortical neurons. Nature. 2008;456(7222):639–642. doi: 10.1038/nature07418 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0286742.ref012] 12.Bouton CE, Shaikhouni A, Annetta NV, Bockbrader MA, Friedenberg DA, Nielson DM, et al. Restoring cortical control of functional movement in a human with quadriplegia. Nature. 2016;533(7602):247–250. doi: 10.1038/nature17435 [DOI] [PubMed] [Google Scholar]

[pone.0286742.ref013] 13.Guggenmos DJ, Azin M, Barbay S, Mahnken JD, Dunham C, Mohseni P, et al. Restoration of function after brain damage using a neural prosthesis. Proceedings of the National Academy of Sciences. 2013;110(52):21177–21182. doi: 10.1073/pnas.1316885110 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0286742.ref014] 14.Friedenberg DA, Schwemmer MA, Landgraf AJ, Annetta NV, Bockbrader MA, Bouton CE, et al. Neuroprosthetic-enabled control of graded arm muscle contraction in a paralyzed human. Scientific Reports. 2017;7(1):8386. doi: 10.1038/s41598-017-08120-9 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0286742.ref015] 15.Fetz EE, Finocchio DV. Correlations between activity of motor cortex cells and arm muscles during operantly conditioned response patterns. Experimental Brain Research. 1975;23(3):217–240. doi: 10.1007/BF00239736 [DOI] [PubMed] [Google Scholar]

[pone.0286742.ref016] 16.Kennedy PR, Bakay RAE, Moore MM, Adams K, Goldwaithe J. Direct control of a computer from the human central nervous system. IEEE Transactions on Rehabilitation Engineering. 2000;8(2):198–202. doi: 10.1109/86.847815 [DOI] [PubMed] [Google Scholar]

[pone.0286742.ref017] 17.Nicolelis MAL, Dimitrov D, Carmena JM, Crist R, Lehew G, Kralik JD, et al. Chronic, multisite, multielectrode recordings in macaque monkeys. Proceedings of the National Academy of Sciences. 2003;100(19):11041–11046. doi: 10.1073/pnas.1934665100 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0286742.ref018] 18.Georgopoulos AP, Schwartz AB, Kettner RE. Neuronal Population Coding of Movement Direction. Science, New Series. 1986;233(4771):1416–1419. doi: 10.1126/science.3749885 [DOI] [PubMed] [Google Scholar]

[pone.0286742.ref019] 19.Carmena JM, Lebedev MA, Crist RE, O’Doherty JE, Santucci DM, Dimitrov DF, et al. Learning to Control a Brain–Machine Interface for Reaching and Grasping by Primates. PLOS Biology. 2003;1(2):e42. doi: 10.1371/journal.pbio.0000042 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0286742.ref020] 20.Hochberg LR, Serruya MD, Friehs GM, Mukand JA, Saleh M, Caplan AH, et al. Neuronal ensemble control of prosthetic devices by a human with tetraplegia. Nature. 2006;442(7099):164–171. doi: 10.1038/nature04970 [DOI] [PubMed] [Google Scholar]

[pone.0286742.ref021] 21.Lebedev MA, Carmena JM, O’Doherty JE, Zacksenhouse M, Henriquez CS, Principe JC, et al. Cortical Ensemble Adaptation to Represent Velocity of an Artificial Actuator Controlled by a Brain-Machine Interface. Journal of Neuroscience. 2005;25(19):4681–4693. doi: 10.1523/JNEUROSCI.4088-04.2005 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0286742.ref022] 22.Wu H, Feng J, Zeng Y. Neural Decoding for Macaque’s Finger Position: Convolutional Space Model. IEEE Transactions on Neural Systems and Rehabilitation Engineering. 2019;27(3):543–551. doi: 10.1109/TNSRE.2019.2893406 [DOI] [PubMed] [Google Scholar]

[pone.0286742.ref023] 23.Orsborn AL, Dangi S, Moorman HG, Carmena JM. Closed-Loop Decoder Adaptation on Intermediate Time-Scales Facilitates Rapid BMI Performance Improvements Independent of Decoder Initialization Conditions. IEEE Transactions on Neural Systems and Rehabilitation Engineering. 2012;20(4):468–477. doi: 10.1109/TNSRE.2012.2185066 [DOI] [PubMed] [Google Scholar]

[pone.0286742.ref024] 24.Gao Y, Black MJ, Bienenstock E, Shoham S, Donoghue JP. Probabilistic inference of hand motion from neural activity in motor cortex. In: Proceedings of the 14th International Conference on Neural Information Processing Systems: Natural and Synthetic. NIPS’01. Cambridge, MA, USA: MIT Press; 2001. p. 213–220.

[pone.0286742.ref025] 25.Sanchez JC, Sung-Phil Kim, Erdogmus D, Rao YN, Principe JC, Wessberg J, et al. Input-output mapping performance of linear and nonlinear models for estimating hand trajectories from cortical neuronal firing patterns. In: Proceedings of the 12th IEEE Workshop on Neural Networks for Signal Processing. Martigny, Switzerland: IEEE; 2002. p. 139–148. http://ieeexplore.ieee.org/document/1030025/.

[pone.0286742.ref026] 26.Sussillo D, Nuyujukian P, Fan JM, Kao JC, Stavisky SD, Ryu S, et al. A recurrent neural network for closed-loop intracortical brain–machine interface decoders. Journal of Neural Engineering. 2012;9(2):026027. doi: 10.1088/1741-2560/9/2/026027 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0286742.ref027] 27.Tseng PH, Urpi NA, Lebedev M, Nicolelis M. Decoding Movements from Cortical Ensemble Activity Using a Long Short-Term Memory Recurrent Network. Neural Computation. 2019;31(6):1085–1113. doi: 10.1162/neco_a_01189 [DOI] [PubMed] [Google Scholar]

[pone.0286742.ref028] 28.Shpigelman L, Crammer K, Paz R, Vaadia E, Singer Y. A temporal kernel-based model for tracking hand-movements from neural activities. In: Proceedings of the 17th International Conference on Neural Information Processing Systems. NIPS’04. Cambridge, MA, USA: MIT Press; 2004. p. 1273–1280.

[pone.0286742.ref029] 29.Hao Y, Zhang Q, Controzzi M, Cipriani C, Li Y, Li J, et al. Distinct neural patterns enable grasp types decoding in monkey dorsal premotor cortex. Journal of Neural Engineering. 2014;11(6):066011. doi: 10.1088/1741-2560/11/6/066011 [DOI] [PubMed] [Google Scholar]

[pone.0286742.ref030] 30.Fisher J, Black MJ. Motor Cortical Decoding Using an Autoregressive Moving Average Model. In: 2005 IEEE Engineering in Medicine and Biology 27th Annual Conference. Shanghai, China: IEEE; 2005. p. 2130–2133. http://ieeexplore.ieee.org/document/1616881/. [DOI] [PubMed]

[pone.0286742.ref031] 31.Shpigelman L, Lalazar H, Vaadia E. Kernel-ARMA for Hand Tracking and Brain-Machine interfacing During 3D Motor Control. Advances in Neural Information Processing Systems. 2008;21:1489–1496. [Google Scholar]

[pone.0286742.ref032] 32.Brandman DM, Burkhart MC, Kelemen J, Franco B, Harrison MT, Hochberg LR. Robust Closed-Loop Control of a Cursor in a Person with Tetraplegia using Gaussian Process Regression. Neural Computation. 2018;30(11):2986–3008. doi: 10.1162/neco_a_01129 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0286742.ref033] 33.Makin JG, O’Doherty JE, Cardoso MMB, Sabes PN. Superior arm-movement decoding from cortex with a new, unsupervised-learning algorithm. Journal of Neural Engineering. 2018;15(2):026010. doi: 10.1088/1741-2552/aa9e95 [DOI] [PubMed] [Google Scholar]

[pone.0286742.ref034] 34.Eden UT, Frank LM, Barbieri R, Solo V, Brown EN. Dynamic Analysis of Neural Encoding by Point Process Adaptive Filtering. Neural Computation. 2004;16(5):971–998. doi: 10.1162/089976604773135069 [DOI] [PubMed] [Google Scholar]

[pone.0286742.ref035] 35.Wang Y, Paiva ARC, Principe JC. A Monte Carlo Sequential Estimation for Point Process Optimum Filtering. In: The 2006 IEEE International Joint Conference on Neural Network Proceedings; 2006. p. 1846–1850.

[pone.0286742.ref036] 36.Koyama S, Chase SM, Whitford AS, Velliste M, Schwartz AB, Kass RE. Comparison of brain–computer interface decoding algorithms in open-loop and closed-loop control. J Comput Neurosci. 2010; p. 15. doi: 10.1007/s10827-009-0196-9 [DOI] [PubMed] [Google Scholar]

[pone.0286742.ref037] 37.Suminski AJ, Tkach DC, Fagg AH, Hatsopoulos NG. Incorporating Feedback from Multiple Sensory Modalities Enhances Brain–Machine Interface Control. Journal of Neuroscience. 2010;30(50):16777–16787. doi: 10.1523/JNEUROSCI.3967-10.2010 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0286742.ref038] 38.Taylor DM, Tillery SIH, Schwartz AB. Direct Cortical Control of 3D Neuroprosthetic Devices. Science. 2002;296(5574):1829–1832. doi: 10.1126/science.1070291 [DOI] [PubMed] [Google Scholar]

[pone.0286742.ref039] 39.Ganguly K, Carmena JM. Emergence of a Stable Cortical Map for Neuroprosthetic Control. PLoS Biology. 2009;7(7):e1000153. doi: 10.1371/journal.pbio.1000153 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0286742.ref040] 40.Orsborn A, Moorman H, Overduin S, Shanechi M, Dimitrov D, Carmena J. Closed-Loop Decoder Adaptation Shapes Neural Plasticity for Skillful Neuroprosthetic Control. Neuron. 2014;82(6):1380–1393. doi: 10.1016/j.neuron.2014.04.048 [DOI] [PubMed] [Google Scholar]

[pone.0286742.ref041] 41.Fetz EE. Volitional control of neural activity: implications for brain–computer interfaces. The Journal of Physiology. 2007;579(3):571–579. doi: 10.1113/jphysiol.2006.127142 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0286742.ref042] 42.Moritz CT, Fetz EE. Volitional control of single cortical neurons in a brain–machine interface. Journal of Neural Engineering. 2011;8(2):025017. doi: 10.1088/1741-2560/8/2/025017 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0286742.ref043] 43.Lansdell B, Milovanovic I, Mellema C, Fetz EE, Fairhall AL, Moritz CT. Reconfiguring Motor Circuits for a Joint Manual and BCI Task. IEEE Transactions on Neural Systems and Rehabilitation Engineering. 2020;28(1):248–257. doi: 10.1109/TNSRE.2019.2944347 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0286742.ref044] 44.Arduin PJ, Frégnac Y, Shulz DE, Ego-Stengel V. “Master” Neurons Induced by Operant Conditioning in Rat Motor Cortex during a Brain-Machine Interface Task. Journal of Neuroscience. 2013;33(19):8308–8320. doi: 10.1523/JNEUROSCI.2744-12.2013 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0286742.ref045] 45.Law AJ, Rivlis G, Schieber MH. Rapid acquisition of novel interface control by small ensembles of arbitrarily selected primary motor cortex neurons. Journal of Neurophysiology. 2014;112(6):1528–1548. doi: 10.1152/jn.00373.2013 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0286742.ref046] 46.Badreldin I, Southerland J, Vaidya M, Eleryan A, Balasubramanian K, Fagg A, et al. Unsupervised decoder initialization for brain-machine interfaces using neural state space dynamics. In: 2013 6th International IEEE/EMBS Conference on Neural Engineering (NER); 2013. p. 997–1000.

[pone.0286742.ref047] 47.Li S, Li J, Li Z. An Improved Unscented Kalman Filter Based Decoder for Cortical Brain-Machine Interfaces. Frontiers in Neuroscience. 2016;10. doi: 10.3389/fnins.2016.00587 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0286742.ref048] 48.Dai J, Zhang P, Sun H, Qiao X, Zhao Y, Ma J, et al. Reliability of motor and sensory neural decoding by threshold crossings for intracortical brain–machine interface. Journal of Neural Engineering. 2019;16(3):036011. doi: 10.1088/1741-2552/ab0bfb [DOI] [PubMed] [Google Scholar]

[pone.0286742.ref049] 49.Jarosiewicz B, Chase SM, Fraser GW, Velliste M, Kass RE, Schwartz AB. Functional network reorganization during learning in a brain-computer interface paradigm. Proceedings of the National Academy of Sciences of the United States of America. 2008;105(49):19486–19491. doi: 10.1073/pnas.0808113105 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0286742.ref050] 50.Sadtler PT, Quick KM, Golub MD, Chase SM, Ryu SI, Tyler-Kabara EC, et al. Neural constraints on learning. Nature. 2014;512(7515):423–426. doi: 10.1038/nature13665 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0286742.ref051] 51.Zhou X, Tien RN, Ravikumar S, Chase SM. Distinct types of neural reorganization during long-term learning. Journal of Neurophysiology. 2019;121(4):1329–1341. doi: 10.1152/jn.00466.2018 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0286742.ref052] 52.Oby ER, Golub MD, Hennig JA, Degenhart AD, Tyler-Kabara EC, Yu BM, et al. New neural activity patterns emerge with long-term learning. Proceedings of the National Academy of Sciences. 2019;116(30):15210–15215. doi: 10.1073/pnas.1820296116 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0286742.ref053] 53.Golub MD, Sadtler PT, Oby ER, Quick KM, Ryu SI, Tyler-Kabara EC, et al. Learning by neural reassociation. Nature Neuroscience. 2018;21(4):607–616. doi: 10.1038/s41593-018-0095-3 [DOI] [PMC free article] [PubMed] [Google Scholar]

[pone.0286742.ref054] 54.Hwang E, Bailey P, Andersen R. Volitional Control of Neural Activity Relies on the Natural Motor Repertoire. Current Biology. 2013;23(5):353–361. doi: 10.1016/j.cub.2013.01.027 [DOI] [PMC free article] [PubMed] [Google Scholar]

PERMALINK

Rhesus monkeys learn to control a directional-key inspired brain machine interface via bio-feedback

Chenguang Zhang

Hao Wang

Shaohua Tang

Zheng Li

Roles

Abstract

Introduction

Materials and methods

Algorithm design

Group weight algorithm design

Fig 1. Bio-feedback BMI paradigm, array implantation, and neuronal signals.

Neuron grouping

Normalization method

Animals, surgery, and data recording

Task design

Experiment design

Data analysis

Calculation of random baseline for success rate

Results

Task performance

Overall performance

Fig 2. Monkey task performance.

Trajectories occupancy

Fig 3. Cursor trajectories clustered through time.

Trajectory straightness

Fig 4. Trajectory scores decrease during learning.

Group firing rate analysis

Fig 5. Output-null value distribution change little.

Fig 6. Output-potent activity increase significantly.

Tuning change through learning

Fig 7. Tuning depth and R2 of direct and indirect neurons indicate learning occurred.

Fig 8. The tuning change of individual neurons.

Discussion

BMI learning

Bio-mimetic and bio-feedback BMI

Conclusion

Supporting information

Acknowledgments

Data Availability

Funding Statement

References

Associated Data

Supplementary Materials

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases

Fig 7. Tuning depth and R² of direct and indirect neurons indicate learning occurred.