Abstract
For people suffering from severe paralysis, communication can be difficult or nearly impossible. Technology systems called brain-computer interfaces (BCIs) are being developed to assist these people with communication by using their brain activity to control a computer without any muscle activity. To benefit the development of BCIs that employ neural activity related to speech, we investigated if neural activity patterns related to different articulator movements can be distinguished from each other. We recorded with electrocorticography (ECoG), the neural activity related to different articulator movements in 4 epilepsy patients and classified which articulator participants moved based on the sensorimotor cortex activity patterns. The same was done for different movement directions of a single articulator, the tongue. In both experiments highly accurate classification was obtained, on average 92% for different articulators and 85% for different tongue directions. Furthermore, the data show that only a small part of the sensorimotor cortex is needed for classification (ca. 1 cm2). We show that recordings from small parts of the sensorimotor cortex contain information about different articulator movements which might be used for BCI control. Our results are of interest for BCI systems that aim to decode neural activity related to (actual or attempted) movements from a contained cortical area.
Subject terms: Brain-machine interface, Neuromuscular disease
Introduction
Even though speech has been suggested to involve more than one hundred muscles1–3 and the control of articulator movements is very precise and complex, after several years of training during childhood most people can speak without much effort. However, for some people who have very severe forms of paralysis the control of the articulators is completely absent and communication by speech is severely impaired and sometimes even impossible4–6. This situation, where people have intact cognition but no or very limited ways to communicate due to paralysis, is called Locked-In Syndrome (LIS)4. People with LIS may benefit from current developments in assistive communication technology. A brain-computer-interface (BCI) system for instance, is a system that enables people with LIS to control a computer without muscle involvement, using only brain activity. Brain activity from the sensorimotor cortex (SMC) has been the focus of a large part of BCI research as it can be used as a source of control signals and it has been shown that neural activity related to hand movements can be reliably recorded with implanted electrodes7–9, and used for BCI control in a home-use setting10.
Recent studies have however, also investigated the neural basis of speech movements for BCI control. Conceptually, the direct identification of internally spoken words using the SMC activity generated by the associated attempted articulator movement would provide people with LIS with a highly intuitive and muscle-independent manner of communication. Several studies have now demonstrated that it is indeed possible to decode phonemes, sounds and words using SMC activity measured with electrocorticographic (ECoG) electrodes (e.g.11–16). In addition, whole sentences could be reconstructed based on neural activity during natural speech (word accuracy around 75% in a set of 10 words14). Yet, solving the problem of reliable and real-time speech decoding, is highly complex due to the high dimensionality of speech and may therefore benefit from a thorough understanding of the representation of the involved articulators in the SMC and of an assessment of the subareas of the SMC that provide the best classification results. Recent work showed a high level of speech decoding from perisylvian regions combined, including auditory cortex responding to the generated speech17. Given lack of auditory feedback in LIS, an earlier functional magnetic resonance imaging (fMRI) study by Bleichner et al.18 investigated the sensorimotor cortex neural activity patterns selectively for isolated movements of different articulators (larynx, tongue, lips and jaw) and showed that these can be distinguished (i.e. classified) from each other with high accuracy (ca. 90%). However, in contrast to fMRI, which measures throughout the whole depth of the SMC (where electrodes for BCI implants cannot reach), ECoG measures from the surface of the cortex and it remains to be determined therefore if similar accuracy levels can be achieved for articulator movement classification using ECoG surface electrodes. In addition, it has been suggested that during speech different positions of the same articulator result in different SMC activity patterns19,20, although more research is needed to investigate the extent of this.
Therefore, we investigated if movements of different articulators (lips, tongue, jaw, larynx), but also different movements of the same articulator can be distinguished from one another based on sensorimotor cortex activity with ECoG recordings. For the latter we chose the tongue since this is an important articulator in differentiating sounds21,22 and it can be predicted which sound somebody said based solely on information about tongue position23. Furthermore, the tongue can easily be moved in different directions. In addition, we investigated which SMC areas contained the most information about different articulator and tongue movements. Finally, as it is important to minimize electrode-grid sizes to keep surgical risks low, we investigated the minimal grid size at which accurate classification was possible.
We analysed high density ECoG recordings from the sensorimotor cortex of 4 participants with epilepsy. A template matching classification method with 10-fold cross-validation was used to classify different articulator movements and different tongue movement directions based on spatial or spatio-temporal SMC activity patterns. We focused on high-frequency band (HFB; 60–130 Hz) power changes in the SMC, since HFB power has been shown to correlate well with articulator movements24,25 and has been linked to neural firing26–28.
Methods
Participants
Four patients (A–D, age 19–41 y; median 34.5 y, 2 females) participated in this study, while they were treated in the University Medical Center Utrecht for epilepsy. Subdural clinical-ECoG electrodes (interelectrode distance 1 cm) were implanted sub-chronically (subject A, B and D) or were used during awake surgery (subject C) to record neural signals for clinical purposes. For research purposes, a non-clinical high-density (HD) electrode grid was placed (sub-chronically or temporarily during awake surgery) over a clinically non-relevant area (the sensorimotor cortex) with the consent of the participants. HD electrodes were 1 or 1.17 mm in diameter with a 3- or 4-mm inter-electrode distance (Table 1). For the current study, only signals recorded with the HD electrodes were used in the analyses.
Table 1.
Subject | A | B | C | D |
---|---|---|---|---|
Tasks performed | ||||
Articulator Task run 1 | X | X | X | X |
Articulators Task run 2 | X | X | ||
Tongue Task (inside mouth) run 1 | X | X | X | |
Tongue Task (inside mouth) run 2 | X | |||
Tongue Task (outside mouth) | X | X | ||
Electrode diameter (mm) | 1 | 1.17 | 1 | 1 |
Inter-electrode distance (mm) | 4 | 4 | 3 | 3 |
Number of recorded electrodes | 64 | 128 | 128 | 128 |
In the top section, we show which subject did which task. In the bottom section, we indicate per subject the details of the electrode grids.
This study was conducted in accordance with the Declaration of Helsinki (2013) and was approved by the ethics committee of the University Medical Center Utrecht. Written informed consent was given by all subjects.
Tasks
The participants performed two tasks. First, we wanted to see if we could replicate earlier fMRI findings by Bleichner et al.18 showing that movements of different articulators can be classified from SMC neural activity. Therefore, in the first task (the ‘Articulator Task’), subjects were asked to make four different articulator movements. The movements were (1) pursing the lips, (2) clenching the teeth, (3) moving the tongue from left to right behind the teeth, (4) making a ‘mmm’ sound. These movements involve the lips, jaw, tongue and larynx, respectively, and were chosen to be similar to those used by Bleichner and colleagues18. For the patients with subchronic electrode implants, the task was presented on a computer screen at a comfortable distance from the participants. During the intraoperative recordings, the task was presented on a tablet that was attached to a pole such that it was clearly visible to the participant. A trial started with a 1500 ms visual cue, being the Dutch word ‘lippen’ (lips), ‘tanden’ (teeth), ‘tong’ (tongue), or ‘mmm’. This visual cue instructed the subjects to start the movement and hold it for as long as the cue was visible. Movement trials were interleaved with rest trials, indicated by a ‘-’ symbol, in which the subject was instructed not to move and keep the articulators in a comfortable, neutral position. Trials were followed by an inter-trial interval of 1500 ms (subjects A, B & D) or 2000 ms (subject C) during which a fixation cross was presented. Each movement was repeated 20 times in random order as were the rest trials (20 in total).
In the second task (the ‘Tongue Task’), the subjects were instructed to make four different tongue movements inside the mouth. The movements were (1) up, (2) down, (3) left and (4) right from rest position. This task was similar to the first task except that the visual cues now were arrows pointing in each direction. During the rest trials, subjects were instructed not to move the tongue in any specific direction but keep it in a comfortable position at the bottom of the mouth cavity.
Some subjects performed some additional runs of the experiments, see Table 1. Subject C only performed one task (the Articulator Task) due to limited time in the intraoperative setting. Subjects A & B performed the Tongue Task also once with the tongue outside the mouth during which they continuously stuck out their tongue and moved it in the indicated direction.
Data acquisition & preprocessing
The sampling frequency of the neural signal recordings was 512 Hz (subject A; Micromed, Treviso, Italy) or 2000 Hz (subjects B, C & D; Blackrock Microsystems LLC, Salt Lake City, USA). In total, we recorded from 64 electrodes for subject A and from 128 electrodes for subjects B-D. For all subjects except subject C, the electrode positions were identified from a post-operative computed tomography (CT) scan and were subsequently plotted over a 3D surface of the subjects pre-operative magnetic resonance imaging (MRI) scan29,30. For subject C, there was no CT scan as the data was recorded intraoperatively and electrode positions were determined using the correlation between the HFB power pattern over the grid (during rest) and the underlying anatomical structure (e.g. the location of sulci or blood vessels) to estimate the electrode positions. This method has been validated for localization of HD electrodes31.
We used Matlab software (The Mathworks, Inc., Natick, MA, USA) for all data analysis. First, flat or noisy electrodes were removed from further analysis similar as in Salari et al.32. For the remaining electrodes, we applied a notch filter for the removal of line noise and harmonics thereof. For removal of any remaining artefacts, a common-average re-referencing was applied. Subsequently, we used a Gabor wavelet33 to calculate the high frequency band (60–130 Hz) power per sample point for each included electrode. The Gabor wavelet was calculated with a full width half maximum (fwhm) of 4 wavelets per frequency in bins of 1 Hz for all frequencies between 60 and 130 Hz. The Gabor wavelet results were subsequently log transformed (10*log10) and averaged over frequencies to create the HFB power signal per electrode. Finally, we determined for each electrode if it was responsive to the task. To that purpose, we computed, per included electrode, the r2-value by correlating the mean activity levels during active periods (from cue until the end of a trial) and rest periods (rest trials) with the task design. This was done for each movement separately. We determined the significance level of the r2-value of each electrode by using a Monte Carlo distribution (shuffling active and rest labels using 10.000 permutations, Alpha = 0.05, false discovery rate corrected). This was done once with all trials to determine the spatial activity pattern for each movement and again for electrode selection during classification but only on the training set due to the 10-fold cross-validation procedure (see below for details). Electrodes that did not show a significant HFB power response to any of the movements of a task were removed from analysis of that task. For classification, we used a 10-fold cross-validation, meaning that for each cross-validation iteration we selected the significant electrodes only based on the training set (not including the test trials). By doing so, the electrodes that were non-responsive to the task were excluded only on the basis of the training set. This was done to make sure that feature selection and model training was not affected by the test set (the trials that had to be classified).
Classification procedures
For classification, we used a template matching procedure (see below for details) that is similar to what has been used successfully before for the classification of speech sounds and hand gestures7,15. First, we smoothed the signal of each electrode with a moving average window of 0.5 seconds around each sample point, which has been shown previously to be an optimal setting for classification of movements from ECoG neural recordings34. Note that analyses with shorter smoothing kernels (down to 0.1 seconds) did not improve classification results and are therefore not presented here. Next, we z-scored the smoothed signal and epoched the result in trials of 2 seconds, each starting at cue onset. We then performed two types of template matching classification procedures with a 10-fold cross-validation as explained below.
First, we discarded time and created a spatial power template for each of the movements by averaging the power signal of each electrode over the whole trial period (i.e. spatial classification). Subsequently, for the 10-fold cross-validation, trials were split into 10 equal sets of trials (each set containing 8 trials, 2 of each class) and each set was taken out of the total data set once and used for classification, while the remaining trials were used for creating the spatial templates of the four conditions by averaging (per electrode) the values of all remaining trials within one class. These templates were then correlated with the trials that were taken out and the template with the highest correlation to each trial was chosen in a winner-takes-all fashion. This was repeated until all sets were classified. Finally, the classification accuracy for each set was calculated by counting the number of correctly classified trials divided by the total number of trials. We then calculated the mean accuracy and standard deviation over cross-validation iterations.
Second, to investigate whether including temporal information would lead to better classification scores, we repeated the classification procedure method but without averaging the power over the trial period, thereby including temporal information in the classification (i.e. spatio-temporal classification).
Anatomical localization of informative electrodes
To investigate where exactly the most informative electrodes for articulator and tongue movement classification were located, we used a random search procedure in which we randomly chose a subset of electrodes to classify from and repeated this 5000 times. Subsequently, for each electrode we calculated what the average accuracy was if that electrode was part of the electrode set. These scores were subsequently z-scored to find the hotspots of electrodes that were the most informative. We investigated how the number of iterations for this procedure influenced the final pattern of most informative electrodes by determining the informative-electrode-pattern at each iteration and subsequently calculating the mean correlation of that pattern with the pattern of all subsequent iterations. This shows how much the pattern changes with each extra iteration. That is, with only a few iterations the result will be highly variable since only a few of the possible electrode combinations have been tested. However, with more iterations, the result will differ less with the result of the previous iteration as more electrode combinations have been tested. The less the pattern changes, the more reliable the pattern result is. We found that 5000 iterations gave a reliable result, see Supplementary Fig. S1.
Anatomical scale of informative areas
For an eventual BCI implantation, it is desirable to keep the size of the implanted grid as small as possible, which reduces surgical risk. Therefore, we wanted to assess not only where the most informative electrodes where located given the current grid sizes but also what the classification accuracy would be for smaller grids. Therefore, we used a searchlight approach in which we started with a small grid size (only 1 electrode) and then increased the grid size step-by-step, each time with one extra row and column of electrodes (successively the grid size was 1 × 1, 2 × 2, 3 × 3, 4 × 4 etc.), until the real grid size was reached. This was done to find the minimal size of the area that was needed for accurate classification, given that electrodes could not be anywhere, but were restricted to be closely located to each other as would be the case with smaller ECoG grids. Per grid size and per location, we calculated the accuracy score (only for spatial classification). We determined the maximum accuracy score per grid size (best of all sampled locations) to assess the minimally required grid size for accurate classification. Note that for both the searchlight and random search analyses, we used algorithms that were based on leave-one-trial-out cross-validation as these algorithms were designed to be able to work with little data.
Topographical organization of the articulators
While previous research has indicated overlap between the areas that are active during movements of different articulators35, the articulators are also known to show a topographical organization in the SMC25,35. We used a procedure described by Bruurmijn and colleagues36,37 to normalize the SMC of each subject into an isotropic coordinate system to further investigate the topographical organization of the representation of the different articulator movements. We created a group average activity pattern of the recorded SMC for each articulator movement and each tongue movement. This was done by plotting for each electrode, the r2 value for each movement within the normalized coordinate system. In addition, we made a winner-takes-all plot, indicating for each electrode, which movement had the highest r2-value. Only electrodes with a r2-value of more than 0.2 in at least one of the conditions were used for this analysis.
Ethical approval
All procedures performed in studies involving human participants were in accordance with the ethical standards of the institutional and/or national research committee and with the 1964 Helsinki declaration and its later amendments or comparable ethical standards.
Informed consent
Informed consent was obtained from all individual participants included in the study.
Results
In general, subjects did not report to have any difficulty performing the tasks, except for subject A who reported that the Articulator Task was a little fast. We did not exclude any trials based on performance, however, since performance of most of the movements had to be done with the mouth closed, and performance could not be accurately assessed.
During the first run, 60.94% (39/64), 80.47% (103/128), 87.50% (112/128), and 84.38% (108/128), of all electrodes of subjects A-D, respectively, showed a significant correlation with at least one of the conditions of the Articulator Task. Subject A & B performed this task twice and for the second run 71.88% (46/64) and 92.97% (119/128) of the electrodes were significant for subjects A & B, respectively. For the Tongue Task, 53.12% (34/64), 68.75% (88/128), and 64.84% (83/128) of the electrodes of subjects A, B and D showed a significant correlation with the task. Note that subject C did not perform this task. Subject A & B also performed a run with the tongue outside the mouth, resulting in 51.56% (33/64) and 68.75% (88/128) of the electrodes showing a significant correlation with the task. Subject B performed a second run with the tongue inside the mouth and 64.06% (82/128) of the electrodes were significant.
Spatial classification accuracy
When classification was based on only spatial features, average classification accuracy (mean over the four subjects) for the Articulator Task was 92.19% (SD = 6.64, n = 4) for run 1 and 86.25% (SD = 1.77, n = 2) for run 2 (Fig. 1). For the Tongue Task the classification accuracy was 84.58% (SD = 6.41, n = 3) for run 1. Only one subject did a second run of this task and had a classification accuracy of 76.25%. For the Tongue Task with the tongue outside the mouth, the mean accuracy was 80.00% (SD = 19.45, n = 2). Classification was significant (p < 0.001) for all subjects, runs and tasks. See Supplementary Fig. S2 for the confusion matrices.
Spatio-temporal classification accuracy
When classification was based on spatio-temporal features, mean classification accuracy was 89.38% (SD = 6.50, n = 4) for run 1 and 87.50% (SD = 3.54, n = 2) for run 2 (Fig. 1) for the Articulator Task. For the Tongue Task the classification accuracy was 80.42% (SD = 9.21, n = 3) for run 1. The one subject who did a second run had a classification accuracy of 75.00%. For the Tongue Task with the tongue outside the mouth the mean accuracy was 81.25% (SD = 15.91, n = 2). Classification was significant (p < 0.001) for all subjects, runs and tasks. See Supplementary Fig. S3 for the confusion matrices.
Anatomical localization
Since most participants performed only one run of each task, the next results are based on the first run of each participant. For the participants who performed multiple runs, however, the results of the second run are similar to that of the first, as displayed in the Supplementary Material (Figs S4 & S5).
The most informative electrodes for classification of articulator movements were more spread out over the sensorimotor cortex than the electrodes that were most informative for classification of tongue movements, which were clustered in the central aspects of the vSMC (Fig. 2).
Results of the searchlight procedure show for each cortical area sampling size the maximum classification accuracy score that can be obtained. For the Articulator Task, the cortical area necessary for a subject to reach more than 70% accuracy was on average 0.62 cm2 (SD = 0.40, n = 4), and to reach more that 80% on average 1.18 cm2 (SD = 0.83, n = 4). For the Tongue Task, 0.87 cm2 (SD = 0.31, n = 3) was needed to reach 70% accuracy and 1.05 cm2 (SD = 0.21, n = 2) to reach 80% accuracy (Fig. 3; note that subject A did not reach 80% for tongue movement directions). These results suggest that the cortical foci (i.e. electrodes) that contain information about movements of different articulators or different tongue directions are relatively close together.
Visual inspection of the location of the most informative electrodes, as obtained in the searchlight procedure, shows that the cortical area that can be used for classification of different articulator movements is larger than the area for different tongue direction movements (Fig. 4). Some areas contribute both to articulator and tongue movement classification, which suggests that within the same small ECoG grid, classification of both different articulator movements and different tongue movement directions can be accurate.
Topographical organization of the articulators
For each articulator movement and tongue movement direction, we investigated where on the sensorimotor cortex the related activity was located (Fig. 5; for individual subject maps see Supplementary Fig. S6). For both the different articulator movements and the different tongue movement directions, we found much overlap. Yet, there seemed to be a topographical organization of articulator representation, with a ventral to dorsal representation of the jaw, larynx, tongue, lips and again larynx. For the different tongue movements, the topographical ordering was less clear (especially in S1) although within M1 the leftward movements seemed to be located somewhat posterior-ventral whereas rightward movements were located more anterior-dorsal. The upward movements seemed to be somewhat more ventral and the downward movements somewhat more dorsal. Note however, that these results are based only on the first run, since only a limited number of participants performed additional runs, and that not all sampled areas were covered by all subjects.
Discussion
In this study, we showed that with a very straightforward and robust classification procedure, movements of different articulators can be distinguished from each other using HD-ECoG signals. In addition, we found that different movement directions of the tongue can also be distinguished. Both articulator and tongue movements can be classified from a small patch of cortical area (ca. 1 cm2). Accurate classification was accomplished using only spatial information (spatial classification). Adding time as a feature of the classification (spatio-temporal classification) did not lead to much higher classification accuracies. This may be explained by a jitter in the onset times of movements within the same class and in the associated neural responses. Correcting for variations in neural activity onset may potentially lead to even better classification than we currently obtained (see for instance7). Finally, visual inspection of the activity patterns for each articulator movement and tongue movement direction suggests a topographical ordering, most clearly for the movements of different articulators, with a ventral to dorsal ordering of the jaw, larynx, the tongue, lips and a second area for larynx, respectively.
Overall, the repeated runs yielded similar classification results, although the classification scores of the second run were generally somewhat lower, possibly due to fatigue and associated suboptimal task compliance. Since we were not able to correct for performance (as most movements were invisible from the outside), we cannot rule out that task performance may have caused potential differences in classification accuracy between runs or between subjects. For the articulator movements, subject A indicated to sometimes have difficulty making the movements in time, which may explain partly why the scores were somewhat lower for that subject than for the other subjects. Alternatively, differences in electrode coverage may be associated with variability in classification accuracy between subjects. Indeed, for the two subjects (A & B) who performed one run of tongue movements outside the mouth, there was no noticeable difference in performance and both participants were very accurate, but classification accuracy was substantially lower in subject A. Further investigations as to where these inter-subject differences originate are needed. Since there is a correspondence between ECoG broadband activity and fMRI BOLD responses38,39 and since indeed the current results correspond well with earlier fMRI results18, pre-surgical fMRI investigations may elucidate what factors influence inter-subject differences in classification results.
Electrodes that were most informative for distinguishing different articulators covered a larger area of the sensorimotor cortex and were less clustered than those most informative for the classification of tongue movements, which were somewhat clustered in the central aspects of the SMC (similar as in40). Yet, a large cortical area is not required for high classification scores: accurate classification was obtained from areas of approximately 1 cm2. This indicates that small patches of the sensorimotor cortex contain information about movements of different articulators or different movement directions of the tongue. The area that can be used for grid placement leading to accurate classification was more concise for tongue movements than for articulator movements. Importantly however, our results suggest that classification of both different articulator movements and different tongue movement directions can be accurate from small cortical areas in similar locations.
The current results are in agreement with a previous fMRI study that showed that lip, tongue, jaw and larynx movements can be distinguished from each other based on sensorimotor cortex activity18, albeit with a different recording modality and less cortical coverage (limited to the skull-lining surface). In addition, our finding that different tongue movement directions have distinguishable representations agrees with single cell recordings in primates that have shown different neurons to be related to different movements of the tongue. Stimulation of some neurons for instance resulted in tongue protrusion whereas stimulation of other neurons resulted in retraction41. Furthermore, some neurons showed significantly different firing rates for tongue protrusion in different directions42. Here we extend those results by showing that different movement directions can be measured from the human SMC using surface electrode recordings of relatively large neuronal ensembles. These results also agree with previous investigations that suggest that, during speech, different positions of the same articulator (i.e. the tongue) lead to different SMC activity patterns19,20. For speech-BCI applications it is important to target the cortical areas that contain information about speech movements of both between and within articulators. The current results may benefit the development of speech-BCIs by providing insights in where and how neural activity of different articulators and different movements of the same articulator are represented in the SMC. Furthermore, in agreement to earlier fMRI studies43 we found that distinctive information about articulator movements and tongue movement directions can be extracted from both the left and right hemisphere, suggesting that the representation of articulator movements is bilateral.
We found significant overlap in the location of neural activation associated with the movements of different articulators. Visual inspection of the group activity patterns, however, revealed a ventral-dorsal topographical ordering in the neural activity hotspots for the different articulator movements, with the jaw and larynx mostly ventral, the tongue superior to that and the lips and a second area for larynx mostly dorsal. Note however, that these results are based on a group map in which not all areas are equally covered by all subjects and that these results are open for interpretation. These results are similar, however, to those of previous studies with respect to the notion of the two larynx areas25,44 and the localization of the tongue in the middle of the vSMC25,35,44. Jaw-related activity, however, was found along the central sulcus with the highest activity mostly ventral, while some other studies found this to be more dorsal to the tongue area25,35. This discrepancy may be caused by methodological differences: in our study, people had to clench the jaw, while in the other studies, the jaw was opened instead of closed. Stimulation results44 indeed support the notion of jaw activity along the central sulcus, also ventrally, similar to what we found. For the lips, we found a similar location with respect to the ventral-dorsal ordering as a previous study25, but the activity hotspot was more in the precentral gyrus in our study, whereas Bouchard et al.25 found it to be more in the postcentral gyrus. Again, task differences may have caused this discrepancy: we used lip protrusion whereas Bouchard et al.25 used speech movements. In conclusion, the location of neural activity associated with non-speech articulator movements demonstrated in the current study corresponds well to that of speech movements shown in earlier work. Therefore, the current results may be beneficial also for speech-BCI applications that aim to target cortical areas that contain information about different articulators.
For the different tongue movements, the topographical representation was less clear than for the articulators (especially in S1) and we found a large overlap in the neural patterns for different tongue movements. However, although the results may have been driven more by some subjects than by others due to differences in the sampled area (see also Supplementary Fig. S6), there seemed to be some spatial ordering of the representation of different directions of tongue movement within M1. Future research with more participants may indicate if a topographical ordering for different tongue directions indeed exists in the SMC.
The current results are of interest for the development of implantable BCIs in general and speech BCIs in particular. First, a fundamental issue in the development of implantable BCIs is to define the areas that need to be targeted for the system to perform accurately, and without confounds of responding to external stimuli (e.g. auditory). Our results provide insight in where information about different articulators and different tongue movements are most strongly represented and estimate that with small electrode grids in the SMC region (ca. 1 cm2), we are able to get high classification results. These results are important since a smaller implant poses significantly less risk and burden of the surgical procedure45. Obviously, with a smaller grid, precise placement becomes increasingly important. Preoperative fMRI may be used to define the optimal target location. This method has been successfully used previously for hand movements10 and visual responses46. Whether or not fMRI is a suitable technique to localize the target locations for a speech BCI remains to be determined. In addition, since the hotspots of articulator movement neural activity can be relatively small (ca. 1 cm), we would like to stress the importance of HD-electrode grids for classification of articulator movements. Clinical electrode grids typically have an interelectrode distance of 1 cm, which leads to significant spatial under-sampling, which most likely hampers our ability to distinguish different articulator movements from one another. Indeed, the importance of HD-grids has also been pointed out in previous speech-BCI studies14,15. Second, our results suggest that multiple degrees of freedom can be extracted from a small cortical area, which might potentially benefit current BCI systems. A previously described home based BCI system by Vansteensel and colleagues10 for instance, contrasts SMC activity during attempted hand movement against rest and has been shown to be very reliable10. However, this system makes use of simple control signals (1 degree of freedom) which limits the speed of communication (2–3 letters per minute). Here we showed that a system such as described by Vansteensel and colleagues10 may be extended with more degrees of freedom by using vSMC activity related to articulator and tongue movements. This may potentially increase the control speed of such BCIs. Finally, a recent study reported encouraging decoding of speech from perisylvian areas including premotor, SMC and temporal regions17. The current study focused on SMC (to exclude contributions of auditory processing of own and other’s speech) and on discrimination of discrete movements. The complementary findings encourage further investigation into region-specific, high-density ECoG grids combined with advanced decoding approaches17,47 for restoring communication in LIS.
It is important to note that replication of these experiments with attempted movements is needed since current results might be influenced by, for instance, sensory feedback which is not present for attempted movements. Earlier studies on this topic with hand movements, however, showed that indeed neural signal changes associated with attempted movements correspond well to those of actual movements, despite the absence of sensory feedback36,48. This suggests that the current results bear relevance for BCI applications where no sensory feedback is available. Finally, it is important to test how the current results relate to real time applications in which subjects receive feedback on the classification performance. It has been suggested for instance that neural feedback may improve classification accuracies49,50.
Conclusion
We demonstrate here that with high-density subdural ECoG recordings from the sensorimotor cortex surface, movements of different articulators can be distinguished from each other based on neural activity. In addition, we showed that different movements of one articulator (the tongue) can be classified with high accuracy. Both between and within articulator movements can be classified from a very small cortical area (ca. 1 cm2). These findings are of relevance for the development of speech-BCIs that aim to provide people with severe paralysis a muscle-independent tool for communication.
Supplementary information
Acknowledgements
The authors thank the participants for their contribution. Also, they would like to thank the staff of the clinical neurophysiology department and the neurosurgeons. This study was funded by the European Union (ERC-Advanced ‘iConnect’ grant 320708).
Author Contributions
E.S. collected and analyzed the data and produced the main manuscript text and figures. Z.F., M.V. and N.R. supervised the analyses and co-wrote the manuscript. M.B. prepared the analyses for Fig. 5 and E.A co-collected the data. All authors reviewed the manuscript.
Data Availability
Data can be made available upon request through a data sharing contract.
Competing Interests
The authors declare no competing interests.
Footnotes
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Supplementary information accompanies this paper at 10.1038/s41598-019-50834-5.
References
- 1.Levelt, W. J. M. Speaking: From Intention to Articulation. (MIT Press, 1993).
- 2.Meister IG, Wilson SM, Deblieck C, Wu AD, Iacoboni M. The Essential Role of Premotor Cortex in Speech Perception. Curr. Biol. 2007;17:1692–1696. doi: 10.1016/j.cub.2007.08.064. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 3.Guenther, F. H. & Hickok, G. Chapter 9 - Role of the auditory system in speech production. in Handbook of Clinical Neurology (eds. Aminoff, M. J., Boller, F. & Swaab, D. F.) 129, 161–175 (Elsevier, 2015). [DOI] [PubMed]
- 4.American Congress of Rehabilitation Medicine Recommendations for use of uniform nomenclature pertinent to patients with severe alterations in consciousness. Arch. Phys. Med. Rehabil. 1995;76:205–209. doi: 10.1016/S0003-9993(95)80031-X. [DOI] [PubMed] [Google Scholar]
- 5.Posner, J. B., Plum, F., Saper, C. B. & Schiff, N. Plum and Posner’s Diagnosis of Stupor and Coma. (Oxford University Press, USA, 2007).
- 6.Smith E, Delargy M. Locked-in syndrome. BMJ. 2005;330:406–409. doi: 10.1136/bmj.330.7488.406. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Branco MP, et al. Decoding hand gestures from primary somatosensory cortex using high-density ECoG. NeuroImage. 2017;147:130–142. doi: 10.1016/j.neuroimage.2016.12.004. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Hochberg LR, et al. Reach and grasp by people with tetraplegia using a neurally controlled robotic arm. Nature. 2012;485:372–375. doi: 10.1038/nature11076. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Collinger JL, et al. High-performance neuroprosthetic control by an individual with tetraplegia. The Lancet. 2013;381:557–564. doi: 10.1016/S0140-6736(12)61816-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Vansteensel MJ, et al. Fully Implanted Brain–Computer Interface in a Locked-In Patient with ALS. N. Engl. J. Med. 2016;375:2060–6. doi: 10.1056/NEJMoa1608085. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Kellis S, et al. Decoding spoken words using local field potentials recorded from the cortical surface. J. Neural Eng. 2010;7:056007. doi: 10.1088/1741-2560/7/5/056007. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Brumberg JS, Wright EJ, Andreasen DS, Guenther FH, Kennedy PR. Classification of Intended Phoneme Production from Chronic Intracortical Microelectrode Recordings in Speech-Motor Cortex. Front. Neurosci. 2011;5:65. doi: 10.3389/fnins.2011.00065. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Mugler EM, et al. Direct classification of all American English phonemes using signals from functional speech motor cortex. J. Neural Eng. 2014;11:035015. doi: 10.1088/1741-2560/11/3/035015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Herff C, et al. Brain-to-text: decoding spoken phrases from phone representations in the brain. Neural Technol. 2015;9:217. doi: 10.3389/fnins.2015.00217. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Ramsey NF, et al. Decoding spoken phonemes from sensorimotor cortex with high-density ECoG grids. NeuroImage. 2018;180:301–311. doi: 10.1016/j.neuroimage.2017.10.011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Pei X, Barbour DL, Leuthardt EC, Schalk G. Decoding vowels and consonants in spoken and imagined words using electrocorticographic signals in humans. J. Neural Eng. 2011;8:046028. doi: 10.1088/1741-2560/8/4/046028. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Anumanchipalli GK, Chartier J, Chang EF. Speech synthesis from neural decoding of spoken sentences. Nature. 2019;568:493–498. doi: 10.1038/s41586-019-1119-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Bleichner MG, et al. Classification of mouth movements using 7 T fMRI. J. Neural Eng. 2015;12:066026. doi: 10.1088/1741-2560/12/6/066026. [DOI] [PubMed] [Google Scholar]
- 19.Salari E, Freudenburg ZV, Vansteensel MJ, Ramsey NF. The influence of prior pronunciations on sensorimotor cortex activity patterns during vowel production. J. Neural Eng. 2018;15:066025. doi: 10.1088/1741-2552/aae329. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Chartier J, Anumanchipalli GK, Johnson K, Chang EF. Encoding of Articulatory Kinematic Trajectories in Human Speech Sensorimotor Cortex. Neuron. 2018;98:1042–1054.e4. doi: 10.1016/j.neuron.2018.04.031. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Booij, G. The Phonology of Dutch. (Clarendon Press, 1999).
- 22.Rietveld, A. & van Heuven, V. Algemene Fonetiek. (Coutinho, 2001).
- 23.Wang, J., Green, J. R. & Samal, A. Individual articulator’s contribution to phoneme production. In 2013 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 7785–7789, 10.1109/ICASSP.2013.6639179 (2013).
- 24.Crone NE, Miglioretti DL, Gordon B, Lesser RP. Functional mapping of human sensorimotor cortex with electrocorticographic spectral analysis. II. Event-related synchronization in the gamma band. Brain. 1998;121:2301–2315. doi: 10.1093/brain/121.12.2301. [DOI] [PubMed] [Google Scholar]
- 25.Bouchard KE, Mesgarani N, Johnson K, Chang EF. Functional organization of human sensorimotor cortex for speech articulation. Nature. 2013;495:327–332. doi: 10.1038/nature11911. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Manning JR, Jacobs J, Fried I, Kahana MJ. Broadband shifts in LFP power spectra are correlated with single-neuron spiking in humans. J. Neurosci. Off. J. Soc. Neurosci. 2009;29:13613. doi: 10.1523/JNEUROSCI.2041-09.2009. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Miller KJ, Sorensen LB, Ojemann JG, Nijs M. den. Power-Law Scaling in the Brain Surface Electric Potential. PLOS Comput. Biol. 2009;5:e1000609. doi: 10.1371/journal.pcbi.1000609. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Ray S, Maunsell JHR. Different Origins of Gamma Rhythm and High-Gamma Activity in Macaque Visual Cortex. PLOS Biol. 2011;9:e1000610. doi: 10.1371/journal.pbio.1000610. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Hermes D, Miller KJ, Noordmans HJ, Vansteensel MJ, Ramsey NF. Automated electrocorticographic electrode localization on individually rendered brain surfaces. J. Neurosci. Methods. 2010;185:293–298. doi: 10.1016/j.jneumeth.2009.10.005. [DOI] [PubMed] [Google Scholar]
- 30.Branco MP, et al. ALICE: A tool for automatic localization of intra-cranial electrodes for clinical and high-density grids. J. Neurosci. Methods. 2018;301:43–51. doi: 10.1016/j.jneumeth.2017.10.022. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Branco MP, Leibbrand M, Vansteensel MJ, Freudenburg ZV, Ramsey NF. GridLoc: An automatic and unsupervised localization method for high-density ECoG grids. NeuroImage. 2018;179:225–234. doi: 10.1016/j.neuroimage.2018.06.050. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Salari E, Freudenburg ZV, Vansteensel MJ, Ramsey NF. Spatial-Temporal Dynamics of the Sensorimotor Cortex: Sustained and Transient Activity. IEEE Trans. Neural Syst. Rehabil. Eng. 2018;26:1084–1092. doi: 10.1109/TNSRE.2018.2821058. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Bruns A. Fourier-, Hilbert- and wavelet-based signal analysis: are they really different approaches? J. Neurosci. Methods. 2004;137:321–332. doi: 10.1016/j.jneumeth.2004.03.002. [DOI] [PubMed] [Google Scholar]
- 34.Branco MP, Freudenburg ZV, Aarnoutse EJ, Vansteensel MJ, Ramsey NF. Optimization of sampling rate and smoothing improves classification of high frequency power in electrocorticographic brain signals. Biomed. Phys. Eng. Express. 2018;4:045012. doi: 10.1088/2057-1976/aac3ac. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Grabski K, et al. Functional MRI assessment of orofacial articulators: Neural correlates of lip, jaw, larynx, and tongue movements. Hum. Brain Mapp. 2012;33:2306–2321. doi: 10.1002/hbm.21363. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Bruurmijn MLCM, Pereboom IPL, Vansteensel MJ, Raemaekers MAH, Ramsey NF. Preservation of hand movement representation in the sensorimotor areas of amputees. Brain. 2017;140:3166–3178. doi: 10.1093/brain/awx274. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Bruurmijn, M. L. C. M., Schellekens, W., Raemaekers, M. A. H. & Ramsey, N. F. A Novel 2d Standard Cartesian Representation for the Human Sensorimotor Cortex. Neuroinformatics (In press). [DOI] [PMC free article] [PubMed]
- 38.Hermes D, et al. Neurophysiologic correlates of fMRI in human motor cortex. Hum. Brain Mapp. 2012;33:1689–1699. doi: 10.1002/hbm.21314. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 39.Siero JC, et al. BOLD matches neuronal activity at the mm scale: A combined 7 T fMRI and ECoG study in human sensorimotor cortex. NeuroImage. 2014;101:177–184. doi: 10.1016/j.neuroimage.2014.07.002. [DOI] [PubMed] [Google Scholar]
- 40.Schalk G, et al. Real-time detection of event-related brain activity. NeuroImage. 2008;43:245–249. doi: 10.1016/j.neuroimage.2008.07.037. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 41.Yao D, et al. Neuronal Activity Patterns in Primate Primary Motor Cortex Related to Trained or Semiautomatic Jaw and Tongue Movements. J. Neurophysiol. 2002;87:2531–2541. doi: 10.1152/jn.2002.87.5.2531. [DOI] [PubMed] [Google Scholar]
- 42.Murray GM, Sessle BJ. Functional properties of single neurons in the face primary motor cortex of the primate. III. Relations with different directions of trained tongue protrusion. J. Neurophysiol. 1992;67:775–785. doi: 10.1152/jn.1992.67.3.775. [DOI] [PubMed] [Google Scholar]
- 43.Bunton K. Speech versus Nonspeech: Different Tasks, Different Neural Organization. Semin. Speech Lang. 2008;29:267–275. doi: 10.1055/s-0028-1103390. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Penfield W, Boldrey E. Somatic motor and sensory representation in the cerebral cortex of man as studied by electrical stimulation. Brain J. Neurol. 1937;60:389–443. doi: 10.1093/brain/60.4.389. [DOI] [Google Scholar]
- 45.Wong CH, et al. Risk factors for complications during intracranial electrode recording in presurgical evaluation of drug resistant partial epilepsy. Acta Neurochir. (Wien) 2009;151:37. doi: 10.1007/s00701-008-0171-7. [DOI] [PubMed] [Google Scholar]
- 46.Zhang D, et al. Toward a minimally invasive brain–computer interface using a single subdural channel: A visual speller study. NeuroImage. 2013;71:30–41. doi: 10.1016/j.neuroimage.2012.12.069. [DOI] [PubMed] [Google Scholar]
- 47.Angrick M, et al. Speech synthesis from ECoG using densely connected 3D convolutional neural networks. J. Neural Eng. 2019;16:036019. doi: 10.1088/1741-2552/ab0c59. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 48.Blokland, Y. et al. Detection of event-related desynchronization during attempted and imagined movements in tetraplegics for brain switch control. In 2012 Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC) 3967–3969, 10.1109/EMBC.2012.6346835 (2012). [DOI] [PubMed]
- 49.Wolpaw JR, Birbaumer N, McFarland DJ, Pfurtscheller G, Vaughan TM. Brain–computer interfaces for communication and control. Clin. Neurophysiol. 2002;113:767–791. doi: 10.1016/S1388-2457(02)00057-3. [DOI] [PubMed] [Google Scholar]
- 50.Taylor DM, Tillery SIH, Schwartz AB. Direct Cortical Control of 3D Neuroprosthetic Devices. Science. 2002;296:1829–1832. doi: 10.1126/science.1070291. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Supplementary Materials
Data Availability Statement
Data can be made available upon request through a data sharing contract.