Abstract
Brain-computer interfaces (BCIs) based on motor imagery (MI) utilizing multi-channel electroencephalogram (EEG) data are commonly used to improve motor function of people with motor disabilities. EEG channel selection can enhance MI classification accuracy by selecting informative channels, accordingly reducing redundant information. The sequential backward floating search (SBFS) approach has been considered as one of the best feature selection methods. In this paper, SBFS is first implemented to select the optimal EEG channels in MI-BCI. Further, to reduce the time complexity of SBFS, the modified SBFS is proposed and applied to left and right hand MI tasks. In the modified SBFS, based on the map of EEG channels at the scalp, the symmetrical channels are selected as channel pairs and acceleration is thus realized by removing or adding multiple channels in each iteration. Extensive experiments were conducted on four public BCI datasets. Experimental results show that the SBFS achieves significantly higher classification accuracy (p < 0.001) than using all channels and conventional MI channels (i.e., C3, C4, and Cz). Moreover, the proposed method outperforms the state-of-the-art selection methods.
Keywords: electroencephalogram (EEG), channel selection, sequential backward floating search (SBFS), motor imagery (MI), brain-computer interface (BCI)
1. Introduction
Brain-computer interface (BCI) refers to a complete system that processes signals from human brain to control different communication devices (Gao et al., 2021). With the advantages of non-invasiveness, portability, low cost, and high temporal resolution, electroencephalogram (EEG) is widely used in BCI systems (Padfield et al., 2019). Potential signals that are commonly used in EEG-based BCI system mainly include P300 evoked potentials (Picton, 1992; Li et al., 2010), steady state visually evoked potentials (SSVEP) (Wang et al., 2008; Zhang et al., 2018), and event-related desynchronization/synchronization (ERD/ERS) (Pfurtscheller and Da Silva, 1999; Pfurtscheller and Neuper, 2006).
Compared with stimuli-based BCI, the potential signals for motor imagery (MI) (Ang and Guan, 2016; Yang et al., 2020) can be easier acquired without external stimulus. MI tasks can bring out cortical rhythm amplitude suppression (ERD) and enhancement (ERS) over primary sensorimotor areas (Taniguchi et al., 2000; Neuper et al., 2005). According to ERD/ERS phenomenon, the corresponding imagery category can be determined. Therefore, it is of great significance to select brain area with active neural activities as signal sources to improve the quality of EEG signals. Excessive channels not only deteriorate the portability of BCI system, but also increase the difficulty of signal analysis (Handiru and Prasad, 2016). Selecting appropriate EEG channels for different subjects can improve the performance of MI-based BCI system.
According to the prior knowledge of neurology, C3, C4, and Cz electrodes and their surrounding channels contain the most information related to MI, for which these specific channels are commonly selected. Although the experience-dependent artificial channel selection is easy for preparation and implementation, it could be not sufficient for each subject. The popular channel selection schemes (Alotaiby et al., 2015) for MI can be mainly divided as embedded techniques, filtering techniques (Baig et al., 2020), wrapper techniques, etc. Embedded techniques integrate the channel selection processes with the model training process, such as recursive channel estimation with the training of support vector machine (SVM) (Lal et al., 2004; Schröder et al., 2005). Filtering techniques are usually based on EEG signal statistics such as common spatial pattern (CSP) filter coefficients (Tam et al., 2011) and specific criteria such as mutual information (Ang et al., 2012). Wrapper techniques typically adopt wrapper approachs with complete (Kamrunnahar et al., 2009), random (Wei and Wang, 2011) or sequential (Qiu et al., 2016) search strategies for subset channel selection (Arvaneh et al., 2010). In addition, neural network genetic method (Yang et al., 2012) and bispectrum-based method (Jin et al., 2020) were investigated for EEG channel selection. Recently, neurophysiological approaches based on correlation (Jin et al., 2019) and Granger causality (Varsehi and Firoozabadi, 2021) have also been used in MI channel selection. However, the EEG channel selection methods of existing studies have either shown unsatisfactory performance or can only be used for specific types of data (Varsehi and Firoozabadi, 2021).
Sequential backward floating search (SBFS) is a well-known feature selection method which has been used to process various physiological signals (Tork et al., 2013; Karnaukh et al., 2018; Ahirwal, 2021) and to perform body state assessments (Dreißig et al., 2020). In this paper, SBFS is utilized in EEG channel selection for MI-based BCI. The main contributions of this paper are as follows:
1) To the best of our knowledge, this is the first time SBFS has been utilized for EEG channel selection.
2) The modified SBFS was proposed and applied to left and right hand MI tasks to reduce the time complexity of SBFS.
3) Extensive experiments were conducted on four datasets to confirm the effectiveness of the proposed method.
The remainder of this paper is detailed as follows. Section 2 describes the data used in this paper and the proposed methods. Section 3 presents the results. The discussion is provided in Section 4, and finally we conclude the paper in Section 5.
2. Materials and methods
2.1. Datasets
In this work, four common public datasets were used to evaluate the proposed methods. All EEG data were collected from the subjects' brain through acquisition equipments rather than artificially generated.
1) BCI Competition IV-dataset 1: This dataset recorded 59 channels of EEG signals from 7 healthy subjects (Tangermann et al., 2012). We only used the data collected from subject a, b, f, and g, since the other data were artificially generated. Each subject participated in two classes (from the three classes left hand, right hand, and foot) of MI tasks. Each data included two runs, where each run contained 100 trials. In these two runs, arrows pointing left, right or down were displayed on the screen for visual cues. Cues were shown for a period of 4 s, during which the subjects were asked to perform the MI task. After and before the task, there were 2 s of blank and 2 s of display with a fixation cross presented in the center of the screen. Namely, the fixation cross was superimposed on the cues for 6 s. Each trial for the EEG data acquisition is illustrated in Figure 1A. The EEG signals were downsampled to 100 Hz.
Figure 1.

Time interval of one trial. (A–D) is for dataset 1), 2), 3) and 4), respectively.
2) BCI Competition III-dataset IIIa: The dataset was recorded from 3 subjects (k3, k6, and l1) in 60 channels with a sampling rate of 250 Hz (Blankertz et al., 2006). The subjects performed imagery left hand, right hand, foot or tongue movements according to a cue of random order. When a trial began, the first 2 s were quiet black-screen. Then an acoustic stimulus and a cross “+” were presented at t = 2 s. From t = 3 s an arrow pointing to left, right, up or down was shown for 1 s. In the meantime, the subjects imagined the movement corresponding to the arrow until t = 7 s. Each trial for the EEG data acquisition is shown in Figure 1B. The number of trials per class was 90 or 60 for different subjects. We only use the left and right hand MI trials in this study.
3) BCI Competition III-dataset IVa: The dataset was recorded from 5 healthy subjects (aa, al, av, aw, and ay) (Blankertz et al., 2006). The subjects performed one of the left hand, right hand and right foot MI within 3.5 s of the occurrence of the visual cues. Target cues were presented at random intervals (1.75–2.25 s), during which subjects could relax. Each trial for the EEG data acquisition is presented in Figure 1C. Each subject participated in 280 trials. The EEG signals were recorded with 118 channels and were downsampled at 100 Hz.
4) BCI competition IV-dataset 2a: The dataset recorded the EEG data of 9 subjects (A01–A09) who participated in the 4-class (left hand, right hand, both feet, and tongue) MI experiments (Tangermann et al., 2012). Raw data were collected at 22 channels and 250 Hz sampling rate. Each subject's data were recorded in 2 sessions, each session consisted of 6 runs, and each run contained 48 trials. i.e., each session was composed of 288 trials in 4 classes, and each class contained 72 trials. We only classified the trials of left hand and right hand in Session 1. The timeline for a trial is about 7.5 s, as detailed in Figure 1D. At the beginning (t = 0 s), a cross “+” appeared on the black screen. After 2 s (t = 2 s), an arrow pointing either to the left, right, down or up appeared and stayed on the screen for 1.25 s. The subjects performed the desired MI tasks until t = 6 s. After a short break, the screen went black again.
2.2. Data preprocessing
The acquired EEG data were refined in the preliminary analysis prior to channel selection, feature extraction, and classification. A portion of Figure 2 shows the preprocessing procedure.
Figure 2.

Block diagram of the proposed framework.
1) Filtering: A third-order butterworth filter was applied to raw EEG data in the filtering part. The EEG data from each trial were filtered between 8 and 30 Hz.
2) Segmentation: The filtered EEG data were segmented by extracting data segments related to event types. MI events mainly consist of two intervals: MI and other states (rest or black screen). For dataset 1), 2), 3) and 4), we used fixed time windows of 2–6 s, 3–6 s, 0–3 s, and 3–6 s, respectively. More details exhibit in Figure 1.
2.3. Channel selection
2.3.1. SBFS-based channel selection
The purpose of channel selection is to identify important channels and remove redundant and irrelevant channels. The SBFS starts with a complete set, which is based on a top-down approach (Pudil et al., 1994). We investigated the SBFS method for EEG channel selection in MI classification. In this study, Y denotes the entire channel set. Xk denotes the subset of channels containing k channels. J(Xk) denotes the classification performance of a subset Xk. The SBFS algorithm for channel selection is given in Algorithm 1.
Algorithm 1.
EEG channel selection using SBFS.
| Input: the set of all channels, Y = {y1, y2, ..., yd} |
| • The SBFS algorithm takes the entire channel as input. Output: Xk = {xj|j = 1, 2, ..., k; xj ∈ Y}, where k = (0, 1, 2, ..., d) |
| • SBFS returns a subset of channels; the number of selected channels k, where k < d. Initialization: Xk = Y, k = d |
| • We initialize the algorithm with the given channel set such that k = d. Step 1(Exclusion): |
| k = k − 1 |
| Go to Step 2 |
| • In step 1, we remove a channel x− from the channel subset Xk. |
| • x− is the channel that maximizes our criterion function upon removal, that is, the channel which is associated with the optimal classification performance if it is removed from Xk. Step 2(Conditional Inclusion): |
| if J(Xk + x) > J(Xk+1): |
| k = k + 1 |
| Go to Step 1 |
| • In step 2, we search for channels that would improve the classification performance when added back to the channel subset. If such channels exist, we add the channel x+ that maximizes the performance improvement. If k = n or an improvement cannot be made (i.e., such channel x+ cannot be found), go back to step 1; else, repeat the current step. Termination: k = 2 |
| • The channel subset of size k contains the desired number of channels 2. |
The advantage of applying the update strategy to SBFS in EEG channel selection is the possibility to increase the value of optimal accuracy or decrease the number of channels of optimal accuracy. It is described as follows: the SBFS algorithm pursues the maximum accuracy under the current number of channels, and the intermediate (Inclusion) process of the later channel selection may result in the increase of the accuracy of the previous number of channels. Our update strategy is to replace with the maximum accuracy each time.
2.3.2. Reducing time complexity: Modified SBFS
Since SBFS is a search method, it makes sense to speed up the search process without compromising accuracy. Considering that mu (8–13 Hz bands) and beta (14–30 Hz bands) ERD/ERS phenomenon are elicited during imagined hand movements (Ramoser et al., 2000), depending on the location of the channels in the cerebral cortex, left-right symmetrical channels can be treated as a channel pair. As is shown in Figure 3, channels of the same color are considered as a channel pair. For example, red C3 and C4, blue CP1 and CP2, and green FC1 and FC2 are channel pairs, respectively. Both of them are left-right symmetrical with respect to the straight lines of CZ and CPZ. Thus, the whole set contains fewer channel pairs and the modified SBFS can remove or add multiple channels at a time. The main differences between SBFS and modified SBFS methods are shown in Figure 4. One can observe that the modified SBFS contains far fewer channel pairs than before. The time spent on searching process can be greatly reduced.
Figure 3.

Location of EEG electrodes used for data acquisition, taking BCI competition IV-dataset 1 as an example. Channels of the same color are treated as a channel pair for selection.
Figure 4.
Search chart of the SBFS and the modified SBFS.
2.4. Feature extraction
CSP (Ramoser et al., 2000) is an efficient feature extraction algorithm for binary classification tasks, which has been extensively used in MI-based BCI (Dong et al., 2017; Chen B. et al., 2018). It finds a spatial filter to maximize the differences in variance between two classes of multi-channel EEG data. Let Ca and Cb be the normalized covariance matrices of the two classes of EEG signals which are averaged over trials. The composite spatial covariance matrix is Cc = Ca + Cb. Decomposing Cc, we can obtain , where λ is the eigenvalue and U is the eigenvector. And the whitening transformation is
| (1) |
Then the covariance matrices Ca and Cb can be transformed into:
| (2) |
| (3) |
It can be seen that Sa and Sb have the same eigenvector. Then we have Sa + Sb = I. Any orthonormal matrix V satisfies the following
| (4) |
Using the orthonormal matrix V, Sa and Sb can be decomposed as follows:
| (5) |
| (6) |
and Λa + Λb = I. The projection matrix is
| (7) |
W is called the CSP weight matrix. The optimal features can be obtained in the least square case. Finally, the vector of the features is expressed as:
| (8) |
where, x(t) is EEG data.
2.5. Classification
SVM theory was proposed by Vapnik (1999). The core idea of SVM is to separate the data from the two classes by finding a hyperplane with the largest possible margin. As one of the most commonly used BCI-based MI classifiers, SVM (Subasi and Gursoy, 2010; Qin et al., 2019) can effectively solve the classification problem of two classes of EEG data. In this study, we used an SVM with a radial basis function kernel to classify MI tasks after feature extraction. The separation of training and test data is realized by using 10-fold cross validation in the classification part.
2.6. Framework overview
Firstly, the raw MI EEG data of each subject were preprocessed. The SBFS and the modified SBFS method were applied to the training data to obtain the selected channels. The CSP spatial domain filter was applied for training data to acquire weight matrices. Finally, the SVM classifier was trained and the classification performance with 10-fold cross validation was obtained. The block diagram of our proposed framework is shown in Figure 2.
3. Results
3.1. Classification accuracy and significance
The classification accuracies of all subjects from datasets 1), 2), 3) using different methods are shown in Table 1. The classification accuracies of 14 subjects which participated left and right hand MI using different methods are shown in Table 2. The optimal classification accuracy for each subject and mean are in bold. The last row of Tables 1, 2 gives the paired t-test results of the SBFS or the modified SBFS with the current column method. The C3C4Cz method indicates that only EEG data from these 3 channels are used in the classification of MI. From Table 1, for each single subject, the highest classification accuracy was obtained with SBFS. In particular, subjects aw and ay achieved 100% classification accuracy. Compared with all channels, the average performance improvement of the SBFS method in datasets 1), 2), 3) was 18.8, 31.1, and 27.9%, respectively. Meanwhile, the SBFS method improved by 18.3, 31.3, and 29.7%, respectively, compared with the C3C4Cz method. From Table 2, the average accuracy of the SBFS and the modified SBFS is improved by 21.4 and 20.4%, respectively. It is shown that the accuracy of the SBFS is significantly better than all channels and conventional MI channels (p < 0.0001). There is no significant difference (p = 0.1522) between the SBFS and the modified SBFS.
Table 1.
Comparison of classification accuracy (%) with different methods on datasets 1), 2), 3).
| Subject | Methods | ||
|---|---|---|---|
| All channels | C3C4Cz | SBFS | |
| a | 52.0 | 43.5 | 74.5 |
| b | 44.0 | 51.0 | 64.5 |
| f | 49.0 | 46.5 | 63.5 |
| g | 50.0 | 63.0 | 68.0 |
| Mean ± std | 48.8 ± 3.4 | 49.3 ± 5.5 | 67.6 ± 5.0 |
| k3 | 45.6 | 53.3 | 78.9 |
| k6 | 46.7 | 41.7 | 76.7 |
| l1 | 50.0 | 46.7 | 80.0 |
| Mean ± std | 47.4 ± 2.3 | 47.2 ± 5.8 | 78.5 ± 1.7 |
| aa | 62.1 | 58.8 | 84.4 |
| al | 62.3 | 74.1 | 95.0 |
| av | 47.5 | 60.0 | 75.0 |
| aw | 75.6 | 68.0 | 100 |
| ay | 67.5 | 45.0 | 100 |
| Mean ± std | 63.0 ± 10.3 | 61.2 ± 11.0 | 90.9 ± 10.9 |
| P-value | <0.0001 | <0.0001 | − |
“−” Denotes the missing values. The optimal classification accuracies in each row are in bold.
Table 2.
Comparison of classification accuracy (%) with different methods on 14 subjects which participate in left and right hand MI tasks.
| Subject | Methods | ||
|---|---|---|---|
| All channels | SBFS | Modified SBFS | |
| b | 44.0 | 64.5 | 62.5 |
| g | 50.0 | 68.0 | 69.0 |
| k3 | 45.6 | 78.9 | 74.4 |
| k6 | 46.7 | 76.7 | 71.7 |
| l1 | 50.0 | 80.0 | 81.7 |
| A01 | 45.7 | 70.7 | 67.9 |
| A02 | 51.4 | 66.4 | 65.7 |
| A03 | 55.0 | 77.9 | 79.3 |
| A04 | 46.4 | 62.9 | 63.6 |
| A05 | 51.4 | 65.0 | 67.9 |
| A06 | 49.3 | 63.6 | 61.4 |
| A07 | 57.9 | 69.3 | 65.6 |
| A08 | 60.7 | 87.1 | 87.1 |
| A09 | 48.6 | 71.4 | 70.7 |
| Mean ± std | 50.2 ± 4.8 | 71.6 ± 7.4 | 70.6 ± 7.6 |
| P-value | <0.0001 | 0.1522 | − |
“−” Denotes the missing values. The optimal classification accuracies in each row are in bold.
3.2. Number of selected channels
The number of selected channels with optimal classification accuracy is shown in Table 3. From Table 3, the number of channels selected by the SBFS shows a substantial decrease compared to all channels. In terms of averages, this is specifically shown as 23 vs. 59, 12 vs. 60, 25 vs. 118, and 10 vs. 22. Overall, the number of selected channels ranges from one-fifth to one-half of the total number of usable channels. The number of channels selected by the modified SBFS method is similar to that of the SBFS.
Table 3.
Comparison of the number of channels selected at the highest accuracy.
| Subject | Methods | ||
|---|---|---|---|
| All channels | SBFS | Modified SBFS | |
| a | 59 | 10 | − |
| b | 59 | 45 | 14 |
| f | 59 | 4 | − |
| g | 59 | 33 | 17 |
| Average | 59 | 23 | − |
| k3 | 60 | 10 | 14 |
| k6 | 60 | 19 | 20 |
| l1 | 60 | 6 | 20 |
| Average | 60 | 12* | 18 |
| aa | 118 | 40 | − |
| al | 118 | 31 | − |
| av | 118 | 37 | − |
| aw | 118 | 6 | − |
| ay | 118 | 13 | − |
| Average | 118 | 25* | − |
| A01 | 22 | 7 | 9 |
| A02 | 22 | 4 | 8 |
| A03 | 22 | 13 | 17 |
| A04 | 22 | 3 | 14 |
| A05 | 22 | 7 | 13 |
| A06 | 22 | 14 | 6 |
| A07 | 22 | 19 | 19 |
| A08 | 22 | 8 | 10 |
| A09 | 22 | 15 | 9 |
| Average | 22 | 10 | 12* |
Denotes that the original number is not an integer.
“−” Denotes the missing values.
3.3. Computation time
In order to compare the computation time between the SBFS method and the modified SBFS method, the results of 14 subjects who participated in a left and right hand MI task from datasets 1), 2), 4) were used. The two algorithms were implemented and tested using MATLAB 20191 configured on Windows 10 professional operating system and the experiments were performed on an Intel (R) Core (TM) i5-8265U CPU @ 1.60GHz, 8.00 GB RAM computer. As can be seen from Figure 5, the modified SBFS method for channel selection is faster than the SBFS method. It is precisely because more than one channel were added or deleted each time that the number of iterations was reduced, which greatly saved the time.
Figure 5.

Computation time comparisons between SBFS and modified SBFS.
The computation time of the method is affected by several factors, such as the number of all channels, the number of trials, the length of trials, software and hardware configuration etc. For example, using a parallel for loop (parfor) in Matlab to speed up the algorithm, the iterations of the parfor loop can run in parallel on multiple cores of the target hardware (our computer has 4 cores), and speed of the algorithm obtained by testing is shown to be about 4 times faster. During the initial sessions (also called the training sessions) in BCI experiments, the desired parameters are adjusted offline according to the signals collected from different subjects. Thus, the running speed of the channel selection part is not a concern, even for the SBFS method. In short, users have the flexibility to choose the SBFS or the modified SBFS methods according to specific practical situations.
3.4. Comparison with other selection methods
We compared SBFS-based EEG channel selection method with other algorithms in this field. For fairness of comparison, the data preprocessing, feature extraction and classifier were used identically.
CSP-rank (Tam et al., 2011) is a channel selection method in MI-based BCI using CSP. The method is based on the sorting of CSP filters. To be specific, we first rank the absolute values of the filter coefficients in each filter respectively, and then take the electrodes with the next largest coefficients from the two spatial filters in turn.
Improved sequential floating forward selection (ISFFS) (Qiu et al., 2016) combines the practical distribution of channels and an intelligent selection algorithm to select EEG channels.
Correlation-based channel selection (CCS) (Jin et al., 2019) assumes that there is a high correlation between task-related channels, then the relevant channels are selected.
Tables 4, 5 presents the classification accuracy and the number of channels at optimal accuracy for different methods, respectively. The optimal classification accuracy for each subject and mean are in bold. The SBFS method achieved the best classification accuracy for both single subjects and mean values. The SBFS and the ISFFS methods are similar in the number of selected channels, less than the CSP-rank and the CCS method.
Table 4.
Comparison of classification accuracy (%) with different channel selection methods on BCI competition IV-dataset 1.
| Subject | Methods | |||
|---|---|---|---|---|
| CSP-rank | ISFFS | CCS | SBFS | |
| a | 62.0 | 64.5 | 57.5 | 74.5 |
| b | 57.5 | 61.5 | 58.0 | 64.5 |
| f | 53.5 | 63.5 | 55.0 | 63.5 |
| g | 61.5 | 63.5 | 63.0 | 68.0 |
| Mean | 58.6 | 63.3 | 58.4 | 67.6 |
The optimal classification accuracies in each row are in bold.
Table 5.
Comparison of the number of selected channels with different channel selection methods on BCI competition IV-dataset 1.
| Subject | Methods | |||
|---|---|---|---|---|
| CSP-rank | ISFFS | CCS | SBFS | |
| a | 53 | 41 | 42 | 10 |
| b | 27 | 11 | 21 | 45 |
| f | 57 | 8 | 38 | 4 |
| g | 7 | 24 | 25 | 33 |
| Average | 36 | 21 | 32* | 23 |
Denotes that the original number is not an integer.
4. Discussion
4.1. Maps of the selected channels
We used MATLAB 2019 (see text footnote 1) with the EEGLAB toolbox (Delorme and Makeig, 2004) to plot topographic maps of subjects g, k3, and av from each of the datasets 1), 2), 3), as shown in Figure 6. The map of the channels selected by SBFS is shown in Figure 7.
Figure 6.

Topographic maps under two channel settings of subjects g, k3 and av from datasets 1), 2), 3). For each topographic map, the mean value of all trials in the training data is taken. The value of the unselected channels in the second column maps is set to 0.
Figure 7.

Maps of selected channels by SBFS for datasets 1), 2), 3). The blue circles represent selected channels. Darker colors indicate more selections. For dataset 1), channels selected more than twice from four subjects are colored in the top map. For dataset 2), channels selected more than once from three subjects are colored in the middle map. For dataset 3), channels selected more than twice from five subjects are colored in the bottom map.
The location of channels was compared with topographic maps. On the whole, the channels selected by SBFS were consistent with the corresponding ERD phenomena for all channels. Channel C3, C4, and Cz or their surrounding channels located in the motor area of the brain were selected multiple times. For dataset 1), Channel C4 and Cz were selected twice. Also, some channels (C2, C5, CFC3, and CCP2) around C3 and C4 were selected multiple times. Figure 6 shows that subject g in the right hand MI task, the ERD phenomena occurred mainly in the left cerebral cortex. For dataset 2), Channel C3 was selected. Channel 32 and 40, located around channel Cz, were also selected twice. In Figure 6, the ERD phenomena of subject k3 mainly occurred in the right cerebral cortex during left hand MI task. For dataset 3), all 5 subjects performed foot and right hand MI tasks. Channel C3 and Cz were selected twice. They were surrounded by channel CCP3 and CCP4 which were also selected multiple times. As shown in Figure 6, the ERD phenomena appears in the left cerebral cortex when subject av was performing a right hand MI task.
The SBFS method selected channels successively from bottom (i.e., serial number is larger) to top (i.e., serial number is smaller). This may lead to channels which are not related to MI being selected as well. The irregular channel positions resulted from an evaluation criterion by using cross-validation accuracy method. Many selected channels were located in the posterior part of the cerebral cortex. In the case of equal classification accuracy, the channels with larger ordinal numbers, i.e., the backward position, were retained, and the channels with smaller ordinal numbers, i.e., the forward position, were removed.
The topographic maps plotted using the EEGLAB toolbox under all channels and channels selected by the modified SBFS for subject k6 are shown in Figure 8, respectively. From Figure 8, it can also be seen that the topographic map of the channels selected by the modified SBFS was basically consistent with that of all channels. Specifically, clear ERD phenomenon can be observed in the left hand and right hand topographic maps under channels selected by modified SBFS.
Figure 8.

Topographic maps under two channel settings of subjects k6. For each topographic map, the mean value of all trials in the training data is taken. The value of the unselected channels in the maps is set to 0.
4.2. Parameter sensitivity analysis
Figure 9 plots the variation of classification accuracy with the number of selected channels by SBFS. With the increase of the number of selected channels, the overall trend of classification accuracy increases, then decreases. This may be due to the fact that there are few channels containing available information at the beginning and the initial accuracy is low. As the number of channels increases, the useful information keeps increasing and the accuracy is improved. With the sustainable increase of the number of channels, the number of redundant information channels increases as well, leading to the decrease of accuracy. Specifically, the change in accuracy with the number of channels is different for each subject. For example, the classification accuracies of subjects a, b, k6 and av have a slight decrease initially.
Figure 9.

Classification accuracy of subjects from datasets 1), 2), 3) under different number of selected channels by SBFS.
4.3. Future works
The SBFS method obviously outperforms other competing channel selection algorithms. In the following study, the proposed framework will be tested on datasets containing more subjects, such as the Physionet dataset (Goldberger et al., 2000), to strengthen sufficient persuasiveness. In addition, the SBFS can be combined with other features to further improve the classification performance in MI-based BCI. For example, for small sample data, it might be combined with regularized CSP feature (Lu et al., 2010).
In this paper, we studied the SBFS method for channel selection of MI tasks. The idea of a feature selection algorithm combined with task-related effective features can be extended to other tasks, e.g., for the SSVEP task, a combination of the SBFS and canonical correlation analysis (Lin et al., 2006) can be used, and for the rapid serial visual presentation task (Xu et al., 2021), a combination of the SBFS and hierarchical discriminant component analysis (Parra et al., 2007) can be used. Moreover, the proposed symmetrical strategy which is with respect to the optimization time can also be extended to other tasks, such as EEG-based emotion recognition (Chen T. et al., 2018; Gao et al., 2020; Tang et al., 2022).
Note that, ERD and ERS phenomenon are found not only in EEG but also in magnetoencephalography (MEG) recordings. As another non-invasive physiological signal, MEG-based BCI often involves more sensors. The existing MEG instrument based on superconducting quantum interference device technology is typically composed of 275 (gradiometer) or 306 (204 gradiometer and 120 magnetometer) sensors. Although a large number of sensors can provide higher spatial-temporal resolution for evaluating brain activity patterns, not all sensors can significantly improve classification accuracy. In addition, a larger number of channels implies a greater computation complexity. Recently, Roy et al. (2019, 2020) assessd the effect of channel selection using intelligent algorithms on MEG decoding of MI for the first time. Therefore, the application of the proposed method to MEG data can be explored in the future.
5. Conclusion
In this paper, the SBFS method is first applied to EEG channel selection, combining CSP features and an SVM classifier to form a new decoding framework. In order to reduce the time complexity of SBFS, the modified SBFS method is proposed, in which symmetrical channel pairs are removed or added in each iteration depending on the location of EEG channels at the scalp. Experimental results show that the proposed method can significantly improve the classification accuracy while reducing the number of EEG channels. The study provides a new approach to improve the reliability of future BCI systems.
Data availability statement
The original contributions presented in the study are included in the article/supplementary material, further inquiries can be directed to the corresponding author/s.
Author contributions
CT: data collection, methodology, and writing—original draft. TG: data collation and analysis. YL: writing—reviewing. BC: reviewing and supervision. All authors contributed to the article and approved the submitted version.
Funding
This work was supported by the National Natural Science Foundation of China with Grant Nos. (U21A20485 and 61976175).
Conflict of interest
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Publisher's note
All claims expressed in this article are solely those of the authors and do not necessarily represent those of their affiliated organizations, or those of the publisher, the editors and the reviewers. Any product that may be evaluated in this article, or claim that may be made by its manufacturer, is not guaranteed or endorsed by the publisher.
Acknowledgments
We thank Ms. Mei Feng for her linguistic assistance during the preparation of this manuscript.
Footnotes
1MATLAB is a commercial mathematical software from MathWorks, Inc. in the U.S. 2019a is the version number.
References
- Ahirwal M. K. (2021). Analysis and identification of eeg features for mental stress, in Evolution in Computational Intelligence, eds Bhateja V. (Singapore: Springer; ), 201–209. [Google Scholar]
- Alotaiby T., Abd El-Samie F. E., Alshebeili S. A., Ahmad I. (2015). A review of channel selection algorithms for eeg signal processing. EURASIP J. Adv. Signal Process 2015, 1–21. 10.1186/s13634-015-0251-9 [DOI] [Google Scholar]
- Ang K. K., Chin Z. Y., Zhang H., Guan C. (2012). Mutual information-based selection of optimal spatial-temporal patterns for single-trial eeg-based bcis. Pattern Recognit. 45, 2137–2144. 10.1016/j.patcog.2011.04.018 [DOI] [Google Scholar]
- Ang K. K., Guan C. (2016). Eeg-based strategies to detect motor imagery for control and rehabilitation. IEEE Trans. Neural Syst. Rehabil. Eng. 25, 392–401. 10.1109/TNSRE.2016.2646763 [DOI] [PubMed] [Google Scholar]
- Arvaneh M., Guan C., Ang K. K., Quek H. C. (2010). EEG channel selection using decision tree in brain-computer interface, in Proceedings of the Second APSIPA Annual Summit and Conference (Singapore: ), 225–230. [Google Scholar]
- Baig M. Z., Aslam N., Shum H. P. (2020). Filtering techniques for channel selection in motor imagery eeg applications: a survey. Artif. Intell. Rev. 53, 1207–1232. 10.1007/s10462-019-09694-8 [DOI] [Google Scholar]
- Blankertz B., Muller K.-R., Krusienski D. J., Schalk G., Wolpaw J. R., Schlogl A., et al. (2006). The bci competition III: validating alternative approaches to actual bci problems. IEEE Trans. Neural Syst. Rehabil. Eng. 14, 153–159. 10.1109/TNSRE.2006.875642 [DOI] [PubMed] [Google Scholar]
- Chen B., Li Y., Dong J., Lu N., Qin J. (2018). Common spatial patterns based on the quantized minimum error entropy criterion. IEEE Trans. Syst. Man Cybern. Syst. 50, 4557–4568. 10.1109/TSMC.2018.2855106 [DOI] [Google Scholar]
- Chen T., Ju S., Yuan X., Elhoseny M., Ren F., Fan M., et al. (2018). Emotion recognition using empirical mode decomposition and approximation entropy. Comput. Electr. Eng. 72, 383–392. 10.1016/j.compeleceng.2018.09.022 [DOI] [Google Scholar]
- Delorme A., Makeig S. (2004). Eeglab: an open source toolbox for analysis of single-trial eeg dynamics including independent component analysis. J. Neurosci. Methods 134, 9–21. 10.1016/j.jneumeth.2003.10.009 [DOI] [PubMed] [Google Scholar]
- Dong J., Chen B., Lu N., Wang H., Zheng N. (2017). Correntropy induced metric based common spatial patterns, in 2017 IEEE 27th International Workshop on Machine Learning for Signal Processing (MLSP) (Tokyo: IEEE; ), 1–6. [Google Scholar]
- Dreißig M., Baccour M. H., Schäck T., Kasneci E. (2020). Driver drowsiness classification based on eye blink and head movement features using the k-NN algorithm, in 2020 IEEE Symposium Series on Computational Intelligence (SSCI) (Canberra, ACT: IEEE; ), 889–896. [Google Scholar]
- Gao X., Wang Y., Chen X., Gao S. (2021). Interface, interaction, and intelligence in generalized brain-computer interfaces. Trends Cogn. Sci. 25, 671–684. 10.1016/j.tics.2021.04.003 [DOI] [PubMed] [Google Scholar]
- Gao Y., Wang X., Potter T., Zhang J., Zhang Y. (2020). Single-trial eeg emotion recognition using granger causality/transfer entropy analysis. J. Neurosci. Methods 346, 108904. 10.1016/j.jneumeth.2020.108904 [DOI] [PubMed] [Google Scholar]
- Goldberger A. L., Amaral L. A., Glass L., Hausdorff J. M., Ivanov P. C., Mark R. G., et al. (2000). Physiobank, physiotoolkit, and physionet: components of a new research resource for complex physiologic signals. Circulation 101, e215-e220. 10.1161/01.CIR.101.23.e215 [DOI] [PubMed] [Google Scholar]
- Handiru V. S., Prasad V. A. (2016). Optimized bi-objective eeg channel selection and cross-subject generalization with brain-computer interfaces. IEEE Trans. Hum. Mach. Syst. 46, 777–786. 10.1109/THMS.2016.2573827 [DOI] [Google Scholar]
- Jin J., Liu C., Daly I., Miao Y., Li S., Wang X., et al. (2020). Bispectrum-based channel selection for motor imagery based brain-computer interfacing. IEEE Trans. Neural Syst. Rehabil. Eng. 28, 2153–2163. 10.1109/TNSRE.2020.3020975 [DOI] [PubMed] [Google Scholar]
- Jin J., Miao Y., Daly I., Zuo C., Hu D., Cichocki A. (2019). Correlation-based channel selection and regularized feature optimization for mi-based bci. Neural Netw. 118, 262–270. 10.1016/j.neunet.2019.07.008 [DOI] [PubMed] [Google Scholar]
- Kamrunnahar M., Dias N., Schiff S. (2009). Optimization of electrode channels in brain computer interfaces. Annu. Int. Conf. IEEE Eng. Med. Biol. Soc. 2009, 6477–64780. 10.1109/IEMBS.2009.5333585 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Karnaukh O., Karplyuk Y., Nikitiuk N. (2018). Evaluation of machine learning techniques for ECG t-wave alternans, in 2018 IEEE 38th International Conference on Electronics and Nanotechnology (ELNANO) (Kyiv: IEEE; ), 346–350. [Google Scholar]
- Lal T. N., Schroder M., Hinterberger T., Weston J., Bogdan M., Birbaumer N., et al. (2004). Support vector channel selection in bci. IEEE Trans. Biomed. Eng. 51, 1003–1010. 10.1109/TBME.2004.827827 [DOI] [PubMed] [Google Scholar]
- Li Y., Long J., Yu T., Yu Z., Wang C., Zhang H., et al. (2010). An eeg-based bci system for 2-d cursor control by combining mu/beta rhythm and p300 potential. IEEE Trans. Biomed. Eng. 57, 2495–2505. 10.1109/TBME.2010.2055564 [DOI] [PubMed] [Google Scholar]
- Lin Z., Zhang C., Wu W., Gao X. (2006). Frequency recognition based on canonical correlation analysis for ssvep-based bcis. IEEE Trans. Biomed. Eng. 53, 2610–2614. 10.1109/TBME.2006.886577 [DOI] [PubMed] [Google Scholar]
- Lu H., Eng H.-L., Guan C., Plataniotis K. N., Venetsanopoulos A. N. (2010). Regularized common spatial pattern with aggregation for eeg classification in small-sample setting. IEEE Trans. Biomed. Eng. 57, 2936–2946. 10.1109/TBME.2010.2082540 [DOI] [PubMed] [Google Scholar]
- Neuper C., Scherer R., Reiner M., Pfurtscheller G. (2005). Imagery of motor actions: differential effects of kinesthetic and visual-motor mode of imagery in single-trial eeg. Cognit. Brain Res. 25, 668–677. 10.1016/j.cogbrainres.2005.08.014 [DOI] [PubMed] [Google Scholar]
- Padfield N., Zabalza J., Zhao H., Masero V., Ren J. (2019). Eeg-based brain-computer interfaces using motor-imagery: techniques and challenges. Sensors 19, 1423. 10.3390/s19061423 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Parra L. C., Christoforou C., Gerson A. C., Dyrholm M., Luo A., Wagner M., et al. (2007). Spatiotemporal linear decoding of brain state. IEEE Signal Process. Mag. 25, 107–115. 10.1109/MSP.2008.4408447 [DOI] [Google Scholar]
- Pfurtscheller G., Da Silva F. L. (1999). Event-related eeg/meg synchronization and desynchronization: basic principles. Clin. Neurophysiol. 110, 1842–1857. 10.1016/S1388-2457(99)00141-8 [DOI] [PubMed] [Google Scholar]
- Pfurtscheller G., Neuper C. (2006). Future prospects of erd/ers in the context of brain-computer interface (bci) developments. Prog. Brain Res. 159, 433–437. 10.1016/S0079-6123(06)59028-4 [DOI] [PubMed] [Google Scholar]
- Picton T. W. (1992). The p300 wave of the human event-related potential. J. Clin. Neurophysiol. 9, 456–479. 10.1097/00004691-199210000-00002 [DOI] [PubMed] [Google Scholar]
- Pudil P., Novovičová J., Kittler J. (1994). Floating search methods in feature selection. Pattern Recognit. Lett. 15, 1119–1125. 10.1016/0167-8655(94)90127-917281766 [DOI] [Google Scholar]
- Qin X., Zheng Y., Chen B. (2019). Extract eeg features by combining power spectral density and correntropy spectral density, in 2019 Chinese Automation Congress (CAC) (Hangzhou: IEEE; ), 2455–2459. [Google Scholar]
- Qiu Z., Jin J., Lam H.-K., Zhang Y., Wang X., Cichocki A. (2016). Improved sffs method for channel selection in motor imagery based BCI. Neurocomputing 207, 519–527. 10.1016/j.neucom.2016.05.035 [DOI] [Google Scholar]
- Ramoser H., Muller-Gerking J., Pfurtscheller G. (2000). Optimal spatial filtering of single trial eeg during imagined hand movement. IEEE Trans. Rehabil. Eng. 8, 441–446. 10.1109/86.895946 [DOI] [PubMed] [Google Scholar]
- Roy S., Rathee D., Chowdhury A., McCreadie K., Prasad G. (2020). Assessing impact of channel selectionon decoding of motor and cognitive imagery from meg data. J. Neural Eng. 17, 056037. 10.1088/1741-2552/abbd21 [DOI] [PubMed] [Google Scholar]
- Roy S., Rathee D., McCreadie K., Prasad G. (2019). Channel selection improves meg-based brain-computer interface, in 2019 9th International IEEE/EMBS Conference on Neural Engineering (NER) (San Francisco, CA: IEEE; ), 295–298. [Google Scholar]
- Schröder M., Lal T. N., Hinterberger T., Bogdan M., Hill N. J., Birbaumer N., et al. (2005). Robust eeg channel selection across subjects for brain-computer interfaces. EURASIP J. Adv. Signal Process. 2005, 1–10. 10.1155/ASP.2005.3103 [DOI] [Google Scholar]
- Subasi A., Gursoy M. I. (2010). Eeg signal classification using pca, ica, lda and support vector machines. Expert. Syst. Appl. 37, 8659–8666. 10.1016/j.eswa.2010.06.065 [DOI] [Google Scholar]
- Tam W.-K., Ke Z., Tong K.-Y. (2011). Performance of common spatial pattern under a smaller set of eeg electrodes in brain-computer interface on chronic stroke patients: a multi-session dataset study, in 2011 Annual International Conference of the IEEE Engineering in Medicine and Biology Society (Boston, MA: IEEE; ), 6344–6347. [DOI] [PubMed] [Google Scholar]
- Tang C., Li Y., Chen B. (2022). Comparison of cross-subject eeg emotion recognition algorithms in the bci controlled robot contest in world robot contest 2021. Brain Sci. Adv. 8, 142–152. 10.26599/BSA.2022.9050013 [DOI] [Google Scholar]
- Tangermann M., Müller K.-R., Aertsen A., Birbaumer N., Braun C., Brunner C., et al. (2012). Review of the BCI competition IV. Front. Neurosci. 6, 55. 10.3389/fnins.2012.00055 [DOI] [PMC free article] [PubMed] [Google Scholar]
- Taniguchi M., Kato A., Fujita N., Hirata M., Tanaka H., Kihara T., et al. (2000). Movement-related desynchronization of the cerebral cortex studied with spatially filtered magnetoencephalography. Neuroimage 12, 298–306. 10.1006/nimg.2000.0611 [DOI] [PubMed] [Google Scholar]
- Tork S., Yazdchi M. R., Karimizadeh A. (2013). How health related to quality of sleep and breathing: how to associate sleep disorder breathing and extracted respiratory signal, in 2013 21st Iranian Conference on Electrical Engineering (ICEE) (Mashhad: IEEE; ), 1–6. [Google Scholar]
- Vapnik V. (1999). The Nature of Statistical Learning Theory. New York, NY: Springer Science & Business Media. [Google Scholar]
- Varsehi H., Firoozabadi S. M. P. (2021). An eeg channel selection method for motor imagery based brain-computer interface and neurofeedback using granger causality. Neural Netw. 133, 193–206. 10.1016/j.neunet.2020.11.002 [DOI] [PubMed] [Google Scholar]
- Wang Y., Gao X., Hong B., Jia C., Gao S. (2008). Brain-computer interfaces based on visual evoked potentials. IEEE Eng. Med. Biol. Mag. 27, 64–71. 10.1109/MEMB.2008.923958 [DOI] [PubMed] [Google Scholar]
- Wei Q., Wang Y. (2011). Binary multi-objective particle swarm optimization for channel selection in motor imagery based brain-computer interfaces, in 2011 4th International Conference on Biomedical Engineering and Informatics (BMEI), Vol. 2 (Shanghai: IEEE; ), 667–670. [Google Scholar]
- Xu M., Chen Y., Wang D., Wang Y., Zhang L., Wei X. (2021). Multi-objective optimization approach for channel selection and cross-subject generalization in rsvp-based bcis. J. Neural Eng. 18, 046076. 10.1088/1741-2552/ac0489 [DOI] [PubMed] [Google Scholar]
- Yang J., Singh H., Hines E. L., Schlaghecken F., Iliescu D. D., Leeson M. S., et al. (2012). Channel selection and classification of electroencephalogram signals: an artificial neural network and genetic algorithm-based approach. Artif. Intell. Med. 55, 117–126. 10.1016/j.artmed.2012.02.001 [DOI] [PubMed] [Google Scholar]
- Yang Q., Zhang X., Chen B. (2020). Mi3dnet: a compact cnn for motor imagery eeg classification with visualizable dense layer parameters, in 2020 42nd Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC) (Montreal, QC: IEEE; ), 510–513. [DOI] [PubMed] [Google Scholar]
- Zhang Y., Guo D., Li F., Yin E., Zhang Y., Li P., et al. (2018). Correlated component analysis for enhancing the performance of ssvep-based brain-computer interface. IEEE Trans. Neural Syst. Rehabil. Eng. 26, 948–956. 10.1109/TNSRE.2018.2826541 [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
The original contributions presented in the study are included in the article/supplementary material, further inquiries can be directed to the corresponding author/s.

