Skip to main content
Computational Intelligence and Neuroscience logoLink to Computational Intelligence and Neuroscience
. 2019 Mar 28;2019:7529572. doi: 10.1155/2019/7529572

An Improved Refined Composite Multivariate Multiscale Fuzzy Entropy Method for MI-EEG Feature Extraction

Mingai Li 1,2,, Ruotu Wang 1, Jinfu Yang 1,2, Lijuan Duan 1
PMCID: PMC6458931  PMID: 31049051

Abstract

Feature extraction of motor imagery electroencephalogram (MI-EEG) has shown good application prospects in the field of medical health. Also, multivariate entropy-based feature extraction methods have been gradually applied to analyze complex multichannel biomedical signals, such as EEG and electromyography. Compared with traditional multivariate entropies, refined composite multivariate multiscale fuzzy entropy (RCmvMFE) overcomes the defect of unstable entropy values caused by the scale factor increase and is beneficial towards obtaining richer feature information. However, the coarse-grained process of RCmvMFE is mean filtered, which weakens Gaussian noise and is powerless against random impulse noise interference. This yields poor quality feature information and low accuracy classification. In this paper, RCmvMFE is improved (IRCmvMFE) by using composite filters in the coarse-grained procedure to enhance filter performance. Median filters are employed to remove the impulse noise interference from multichannel MI-EEG signals, and these filtered MI-EEGs are further smoothed by the mean filters. The multiscale IRCmvMFEs are calculated for all channels of composite filtered MI-EEGs, forming a feature vector, and a support vector machine is used for pattern classification. Based on two public datasets with different motor imagery tasks, the recognition results of 10 × 10-fold cross-validation achieved 99.43% and 99.86%, respectively, and the statistical analysis of experimental results was completed, showing the effectiveness of IRCmvMFE, as well. The proposed IRCmvMFE-based feature extraction method is superior compared to entropy-based and traditional methods.

1. Introduction

Brain-computer interface (BCI) is a new type of human-computer interaction technology that enables the brain to control external devices [1, 2]. Motor imagery electroencephalogram- (MI-EEG-) based BCI has great prospects in the field of rehabilitation medical engineering. One of the key technologies of BCI is the ability to effectively extract features from complex multichannel MI-EEG signals.

Previous studies focus on time-frequency methods in MI-EEG feature extraction, including wavelet transform (WT) [3], discrete WT (DWT) [4], Hilbert–Huang transform (HHT) [3], dual-tree complex WT (DTCWT) [5], empirical mode decomposition (EMD) [6], and common spatial pattern- (CSP-) based methods, including CSP, filter bank CSP (FBCSP), discriminant FBCSP (DFBCSP), sparse FBCSP (SFBCSP), and spectrally weighted CSP (SWCSP) [79]. With the development of nonlinear dynamics, entropy-based methods have been widely utilized in the analysis of biomedical signals. Specifically, the most prevalent methods are approximate entropy (ApEn) and sample entropy (SampEn) because of the power these methods have to quantify the complexity of a time series [1012]. Nevertheless, sensitivity to selected parameters will lead to entropy mutation. To solve this weakness, fuzzy entropy (FE) was proposed for EEG analysis, where fuzzy membership functions replace Heaviside functions [1315]. Research shows that FE alleviates the problem of entropy mutation; however, these methods analyze at a single scale, which loses useful information. Therefore, multiscale SampEn (MSE) [1619], multiscale FE (MFE) [20, 21], and improved MFE (IMFE) [22] were put forward to explore deeper information. Unfortunately, using larger scale factors for short time series may cause inaccurate MSEs and MFEs. To achieve more reliable results, composite MFE (CMFE) [23, 24] was developed as a rolling bearing fault detection method.

Although the above univariate methods have shown good performance, they are only suitable for single-channel recording analyses. They fail to measure multichannel data synchronously and ignore the dynamic characteristics across channels [25]. So, SampEn was extended to produce multivariate SampEn (mvSE) [26] and multivariate MSE (mvMSE) [2629] to analyze multichannel signals more effectively. Considering the disadvantages of SampEn in mvSE and mvMSE, multivariate FE (mvFE) and multivariate MFE (mvMFE) [30, 31] were yielded by replacing SampEn with FE. Recently, as an improvement of mvMFE, a refined composite mvMFE (RCmvMFE) was proposed to analyze fault signals and biomedical signals [32, 33]. In RCmvMFE, the entropy stability is improved and the signals' length sensitivity is reduced. However, the coarse-grained process of RCmvMFE is a mean filter that smoothens signals but does not eliminate random impulse noise interference. It is inevitable to produce high-amplitude electrooculogram and electromyography interference during the acquisition of MI-EEG. This is not conducive to extracting valid feature information from multichannel MI-EEG signals. In this paper, improved RCmvMFE (IRCmvMFE) is developed by combining median [34] and mean filters in the coarse-grained process to further improve filter effect, i.e., first the median filter is applied to each channel to remove pulse interference, and then the mean filter is used for further smoothing. Subsequently, IRCmvMFE is proposed to extract features from multichannel MI-EEG signals. The experimental research shows the effectiveness of IRCmvMFE.

The rest of the paper is described as follows: Section 2 introduces the process of extracting MI-EEG features using IRCmvMFE, Section 3 describes the experiments performed, Section 4 discusses the results, and Section 5 provides the conclusions.

2. Feature Extraction with IRCmvMFE

By combining median filters and mean filters in coarse-grained processes, RCmvMFE is improved to produce IRCmvMFE, which is applied to extract features of MI-EEG. The main steps are as follows: preprocessing, optimal channel selecting, performing multivariate coarse-grained analysis of preprocessed MI-EEG data, calculating IRCmvMFE, and constructing a feature vector. The support vector machine (SVM) was used to classify the feature vector. The block diagram of the proposed method is displayed in Figure 1.

Figure 1.

Figure 1

Block diagram of the proposed method.

2.1. Preprocessing MI-EEG Signals

For two-class motor imagery tasks, assume that X T,Ci 0=[X T,Ci 0(1), X T,Ci 0(2),…,X T,Ci 0(e)]T represents the ith channel MI-EEG sequence of the Tth task, where T ∈ {1,2}, i=1,2,…, p; e and p represent the sample points and the number of total channels, respectively. X T,Ci 0 is bandpass filtered to the frequency band associated with the tasks and is expressed as X T,Ci 1=[X T,Ci 1(1), X T,Ci 1(2),…,X T,Ci 1(e)]T. The motor imagery time period [a, b] is taken as the optimal sampling interval, and MI-EEG signals in the segment are summarized as X T,Ci 2=[X T,Ci 2(1),…,X T,Ci 2(N)]T, where N=ba+1 represents the sampled MI-EEG points within the optimal sampling interval.

2.2. Channel Selection

When the brain is engaged in motor imagery, only parts of channels are activated in the form of the power spectrum. Extracting the features of all channels not only increases the computational complexity but also increases the feature information redundancy and reduces the classification accuracy [35]. Therefore, the choice of optimal channels is important. In this paper, the Fisher score of the average power spectrum of X T,Ci 2 is calculated to select channels according to the following equation:

Fi=P1iP2i2varP1i+varP2i, (1)

where P 1(i) and P 2(i) represent the average power spectrum on the ith channel of class 1 and class 2 motor imagery tasks, respectively. var() is the variance and F(i) represents the ith channel Fisher score. The larger the F(i), the greater contribution of the ith channel. The signals of p′ channels with the top F(i) are selected for subsequent research. X T,Ci 2 is rewritten as X T,Ci=[X T,Ci(1),…,X T,Ci(N)]T, where i=1,2,…, p′, in which p′ stands for the number of selected channels.

2.3. Coarse-Graining of IRCmvMFE

Step 1. —

In the coarse-grained process of IRCmvMFE, the median filter is first performed on X T,Ci. Supposing the filter size is j=2k or j=2k+1, the data in the window would be sorted in ascending order with the filter output being

yT,Cis=medXT,Cis=XT,Cik+1,j=2k+1,12XT,Cik+XT,Cik+1,j=2k, (2)

where s ∈ {1,…, N} and X T,Ci(k) means the kth maximum value in the window.

Step 2. —

For the scale factor τ, the kth coarse-grained sequence on C i channel of Tth class task is

yT,Cik,τj=1τs=j1τ+kjτ+k1yT,Cis,1jN,1kτ, (3)

where N′=int[N/τ] represents the sample points of the coarse-grained sequence. Therefore, τ multivariate coarse-grained sequences are obtained and described as Y T,Ci 1,τ=[y T,Ci 1,τ(1), y T,Ci 1,τ(2),…, y T,Ci 1,τ(N′)], Y T,Ci 2,τ=[y T,Ci 2,τ(1), y T,Ci 2,τ(2),…, y T,Ci 2,τ(N′)],…, Y T,Ci τ,τ=[y T,Ci τ,τ(1), y T,Ci τ,τ(2),…, y T,Ci τ,τ(N′)].

2.4. IRCmvMFE Calculation

Step 1. —

The multivariate coarse-grained sequence Y T,Ci k,τ is executed for multivariate embedded reconstruction, with the multivariate composite delay vectors Z T,m k,τ(i) calculated as

ZT,mk,τi=yT,C1k,τi,,yT,C1k,τi+m11λ1,yT,C2k,τi,,yT,C2k,τi+m21λ2,,yT,Cpk,τi,,yT,Cpk,τi+mp1λp, (4)

where i ∈ [1, N′ − n] and M=[m 1, m 2,…, m p] and λ=[λ 1, λ 2,…, λ p] are the embedding dimension vector and time delay vector, respectively. Additionally, m=∑i=1 p m i and n=max{M} × max{λ}.

Step 2. —

The distance of any two multivariate composite delay vectors Z T,m k,τ(i) and Z T,m k,τ(j) is computed in the following equation:

dZT,mk,τi,ZT,mk,τj=dT,mij,k,τ=maxyT,Cgk,τi+l1,yT,Chk,τj+l1,l=1,2,,m, (5)

where i, j ∈ [1, N′ − n], ij, and g, h ∈ [1, p′].

Step 3. —

Given a threshold r, suppose the fuzzy membership function is μ(x, r)=e d2/r, the similarity D T,m ij,k,τ between Z T,m k,τ(i) and Z T,m k,τ(j) is

DT,mij,k,τ=μdT,mij,k,τ,r=expdT,mij,k,τ2r. (6)

Step 4. —

The average membership grade ϕ T,m k,τ(r) can be obtained using the following equation:

ϕT,mk,τr=1N1i=1Nnj=1,jiN1DT,mij,k,τNn1. (7)

Step 5. —

Repeat the above steps, extend the dimension of the multivariate composite delay vector from m to m+1 and derive ϕ T,m+1 k,τ. For each Z T,m k,τ(i), we get τϕ T,m k,τ(r) and τϕ T,m+1 k,τ(r). The average ϕ¯T,mk,τr and ϕ¯T,m+1k,τr are calculated. The definition of IRCmvMFE is as follows:

IRCmvMFEXT,Ci,M,τ,n,r=lnϕ¯T,m+1τrϕ¯T,mτr. (8)

The procedure for calculating IRCmvMFE is summarized in Algorithm 1.

Algorithm 1.

Algorithm 1

The procedure for calculating IRCmvMFE.

2.5. Determination of a Maximum Scale Factor

As the number of scale factors increases, multivariate coarse-grained sequences become smoother. Scale factors that are too large omit useful information and reduce classification accuracy. Therefore, the impact on sequence smoothness and classification accuracy should be considered comprehensively to determine its maximum scale factor τ max.

2.6. Construction of a Feature Vector

For τ ∈ [1, τ max], IRCmvMFE at τ scale in the Tth class task, i.e., IRCmvT τ, is estimated and combined to form the feature vector F T:

FT=IRCmvT1,IRCmvT2,,IRCmvTτmaxRτmax. (9)

The feature vectors of the two tasks are fused in parallel to obtain the feature vector of MI-EEG:

F=F1F2R2×τmax. (10)

3. Experimental Research

3.1. Data Description and Preprocessing

MI-EEG data were obtained from dataset III in the BCI Competition II [36] and dataset IVa in the BCI Competition III [37]. MI-EEG signals on channels C3, Cz, and C4 were recorded in dataset III of BCI Competition II, where the data were from a healthy subject who imagined left-right hand movement. Left- and right-motor imagery tasks were each performed 140 times for a total of 280 experimental trials. The signals were sampled at 128 Hz and filtered to 0.5–30 Hz. The MI-EEG collection timing scheme is shown in Figure 2(a). The subject was at rest for the first 2 s, and the corresponding motor imagery task was completed according to the screen prompts from 3 s to 9 s. To better distinguish the two-class tasks, this paper used the sampling interval [451, 900].

Figure 2.

Figure 2

Timing scheme of MI-EEG collection. (a) Left-right-hand motor imagery task. (b) Right-hand/foot motor imagery task.

The dataset IVa of BCI Competition III recorded the MI-EEG signals of five healthy subjects using 118 channels during right-hand (RH) and right-foot (RF) motor imagery tasks. The original sampling rate was 1000 Hz, but we downsampled these data to 100 Hz. The subjects performed the corresponding imaginary movement according to the prompts in the first 3.5 s and then rested for a random epoch between 1.75 s and 2.25 s. The timing scheme of MI-EEG collection during the right-hand-foot motor imagery task is shown in Figure 2(b). Each subject performed 280 trials, with 140 each of the RH and RF motor imagery tasks. In this paper, MI-EEG related to mu rhythm (8–13 Hz) and beta rhythm (14–32 Hz) related to motor imagery tasks were selected, i.e., the original MI-EEG signals were preprocessed by a bandpass filter of 8–32 Hz. The data between 0.5 s and 3.5 s were used for subsequent experimental research.

3.2. Channel Selection

Channel selection directly affects the quality of feature information and classification accuracy. It is essential to select the optimal channels before extracting MI-EEG features. There was a close relationship between the signals on channels C3, Cz, and C4 in the left-right-hand motor imagery task, so the data of these three channels were used for feature extraction. When RH and RF motor imagery tasks were conducted in dataset IVa from BCI Competition III, the Fisher Score of each channel was calculated by equation (1). The scores of different subjects are shown in Figure 3.

Figure 3.

Figure 3

The Fisher scores of all channels with different subjects. (a) Subject “aa.” (b) Subject “al.” (c) Subject “av.” (d) Subject “aw.” (e) Subject “ay.”

For each subject, the score of each channel is different and for different subjects, scores from the same channel are different. Thus, the optimal channels for each subject are different due to individual differences. The channels with the top three Fisher scores can be used as the optimal channels. The detailed information is shown in Table 1.

Table 1.

The optimal channel combination.

Subjects Optimal channel combination
“aa” 60, 70, 69
“al” 52, 23, 61
“av” 31, 95, 102
“aw” 56, 65, 75
“ay” 60, 52, 43

3.3. Comparison of Coarse-Grained Sequences between IRCmvMFE and RCmvMFE Methods

To confirm the effectiveness of IRCmvMFE in extracting MI-EEG features, the coarse-grained processes of RCmvMFE and IRCmvMFE were compared. The relevant parameters were selected as follows: m k=2, λ k=1, r=0.2SD,  and τ=10, where SD represents the standard deviation of X T,Ci. According to Table 1, the channel k with the highest Fisher score of each subject was selected. The experimental process was as follows: when a motor imagery task was performed, at τ scale, the first j(0 ≤ jNτ) points of X T,k were removed in turn. The RCmvMFEs of the remaining points were calculated separately, and they were composed of time series recorded as RCT,k,i=[RCT,k,i(1), RCT,k,i(2),…, RCT,k,i(Nτ+1)], where T ∈ {1,2}, i ∈ [1, n e], and n e represents the number of experiments. The RCT,k,i of n e experiments were superimposed and averaged to obtain the average time series RCT,k. The average time series of IRCmvMFE was obtained the same way as IRCT,k. When imaging left-right-hand motor imagery, training set data were used for analysis, i.e., n e=70. Similarly, n e was selected as 140 when the RH and RF motor imagery were performed. The amplitude of the original MI-EEG signals and coarse-grained sequences of RCmvMFE and IRCmvMFE during left-right-hand motor imagery are displayed in Figure 4. Similarly, the experimental results from imaging right-hand-foot movement are shown in Figure 5.

Figure 4.

Figure 4

Comparison of MI-EEG and the coarse-grained sequences using RCmvMFE and IRCmvMFE during (a) left-hand motor imagery task and (b) right-hand motor imagery task.

Figure 5.

Figure 5

Comparison of MI-EEG signals and the coarse-grained sequences by RCmvMFE and IRCmvMFE during RH and RF motor imagery tasks. (a) Subject “aa” with RH motor imagery. (b) Subject “aa” with RF motor imagery. (c) Subject “al” with RH motor imagery. (d) Subject “al” with RF motor imagery. (e) Subject “av” with RH motor imagery. (f) Subject “av” with RF motor imagery. (g) Subject “aw” with RH motor imagery. (h) Subject “aw” with RF motor imagery. (i) Subject “ay” with RH motor imagery. (j) Subject “ay” with RF motor imagery.

It can be seen from Figure 4 that the original MI-EEG signals had larger fluctuations, which was obviously improved after the coarse-grained process of both RCmvMFE and IRCmvMFE; and the smoothness of IRCmvMFE was better than RCmvMFE. In Figure 5, there are different intensity impulse noise interferences for different subjects. The coarse-grained sequences of both RH and RF motor imagery tasks using the RCmvMFE and IRCmvMFE of each subject changed with the fluctuations of the original MI-EEG but oscillated more smoothly. For subject “aw,” the impulse noise is not obvious, and the coarse-grained sequence of IRCmvMFE had larger fluctuations than that of RCmvMFE. But the intensity of impulse noise interference is higher for other subjects. Both RCT,k and IRCT,k showed better smoothness, and IRCT,k was superior to RCT,k for rapid MI-EEG changes. The reason is that the coarse-grained process of RCmvMFE is equivalent to a mean filter, which has the effect of low-pass filtering and smoothing and can remove some random interference. However, it is helpless against impulse noise caused by sudden factors such as eye-movements, blinks, and motion. In the coarse-grained IRCmvMFE, the median filter is assigned to remove the impulse noise interference, and then the filtered signals are smoothed by a mean filter.

3.4. Selection of Parameters in IRCmvMFE

The parameter selection will affect the estimate of IRCmvMFE. According to equation (8), the estimation of IRCmvMFE is not only related to the preprocessed MI-EEG but also involves selecting an embedding dimension vector M=[m 1, m 2,…, m p], time delay vector λ=[λ 1, λ 2,…, λ p], threshold r, and scale factor τ. The selection of parameter M was similar to reference [32], i.e., m k=2. Parameter λ does not have any proven standards, so for simplicity, λ k was selected as 1. The threshold r was determined as r=0.2SD.

In addition, the selection of τ influenced the filter effect in the coarse-grained process of MI-EEG and affected the extracted features and the classification results in turn. The larger the τ, the larger the calculation and the better the recognition. In contrast, a smaller τ resulted in poor filter performance [34]. When τ ∈ [1,75], the IRCmvMFEs with imaging left-right-hand movements were estimated and then classified by SVM. Gaussian kernel function was employed in this paper, and SVM optimized by grid search. When τ ∈ [1,45], the same experiment was performed with right-hand-foot motor imagery tasks. The 10 × 10-fold cross-validation (CV) was used to eliminate the contingency in the feature extraction process of MI-EEG. The average classification accuracy of the 10 × 10-fold CV is shown in Figure 6.

Figure 6.

Figure 6

Effects of changing τ on the classification results during (a) left-right-hand motor imagery and (b) right-hand-foot motor imagery.

In Figure 6(a), the classification results gradually increased as the scale factor τ increased. When τ was from 55 to 75, the classification accuracy tended to be stable and close to 100%, and the highest recognition was obtained at 65 scale. Therefore, the maximum τ about left-right-hand motor imagery was selected as 65. In Figure 6(b), with the increased τ, the average recognition rate of each subject first increased and then later decreased. In this paper, the τ max values of subjects “aa”, “al”, “av”, “aw,” and “ay” during right hand-foot-motor imagery were chosen as 41, 37, 33, 38, and 39, respectively. And, τ max is related to the mathematical model of the coarse-grained process of IRCmvMFE. There is a significant difference in τ max during different types of two-class motor imagery tasks, while the difference between multiple subjects during the same type of tasks is not obvious.

3.5. Comparison of Multiple Entropy-Based Feature Extraction Methods

In this section, the comparative study of IRCmvMFE and various entropy-based feature extraction methods was conducted. To make the comparison process more objective, the same dataset was selected as reference [13, 22], i.e., dataset III from BCI Competition II, and SVM was used for classification. The classification result of IMFE was derived from [22], and the related parameters of other entropy-based methods were selected as references [13]. The average recognition results of 10 × 10-fold CV and standard deviations are displayed in Figure 7.

Figure 7.

Figure 7

Comparison of multiple entropy-based feature extraction methods.

In Figure 7, the classification result of MFE was higher than SampEn, FE, and MSE. Because the fuzzy membership function was used to enhance the stability of MFE, richer feature information from the multiscale was collected. At the same scale, the information of multiple coarse-grained sequences was integrated by CMFE, yielding a slightly better result. Based on the parameters' independent optimization strategy, the preferred parameters were used by IMFE to extract features from the MI-EEG, and the recognition accuracy was further improved. Despite the results of mvSE, mvFE, and mvMSE being poor, mvMFE, RCmvMFE, and IRCmvMFE showed the advantages of multivariate entropy methods over traditional univariate entropies, both in terms of classification accuracy and standard deviation. This was mainly because these feature extraction methods evaluated the multivariate complexity of multichannel data and expressed the dynamic relationships and synchronizations across channels.

IRCmvMFE, RCmvMFE, and mvMFE methods displayed superiority on dataset III from BCI Competition II. To further illustrate the improvement of IRCmvMFE, a comparative study of these three methods was performed based on dataset IVa and using SVM for classification. The classification results with 10×10-fold CV are shown in Table 2. For each subject, the recognition rates obtained by using RCmvMFE to extract features of MI-EEG were higher than those of mvMFE because the multivariate feature of RCmvMFE was considered at the same scale, and the defect of unstable entropy values, i.e., coarse-grained time series shortening with scale factor increases, was overcome. Moreover, a composite filter technique was applied in the coarse-grained process of IRCmvMFE to eliminate burst-like impulse noise and the Gaussian noise of the MI-EEG, which produced better quality information. For different subjects, IRCmvMFE achieved better recognition accuracy and a smaller standard deviation than RCmvMFE, illustrating the stability and superiority of IRCmvMFE. Further, according to Figure 5, the impulse noise interference was not obvious for subject “aw,” and the recognition result by IRCmvMFE was slightly better than RCmvMFE. However, there was greater impulse noise interference for most subjects (“aa,” “al,” “av,” “ay”), after using IRCmvMFE to enhance the filter effect, the recognition results were obviously improved.

Table 2.

Comparison of average recognition rates with 10 × 10-fold CV (%) of multivariate entropy-based feature extraction methods.

Methods Subjects Average results (%)
“aa” “al” “av” “aw” “ay”
mvMFE 75.54 ± 1.70 86.00 ± 0.98 74.46 ± 1.54 84.07 ± 1.52 79.32 ± 1.24 79.88 ± 1.40
RCmvMFE 93.82 ± 1.48 98.46 ± 0.61 95.86 ± 1.01 98.79 ± 0.72 97.04 ± 0.79 96.97 ± 0.92
IRCmvMFE 99.39 ± 0.38 99.71 ± 0.15 98.61 ± 0.62 99.61 ± 0.26 99.82 ± 0.35 99.43 ± 0.35

3.6. Statistical Analysis

In this section, statistical analysis was performed to further describe the development of IRCmvMFE. The kappa coefficient, which was designed to measure the classification precision and the comparison of performance in multiclass tasks, was made fairer. This method is a common indicator for evaluating the performance of BCI systems [38, 39]. The calculation of κ coefficient was expressed as

κ=p0pe1pe, (11)

where p 0 represents the classification accuracy and p e means the probability of opportunity consistency. For a two-class task, if the number of samples across classes was equal, then the value of p e was 0.5. Using equation (11), the mean kappa coefficients of IRCmvMFE, RCmvMFE, and mvMFE with 10 × 10-fold CV were calculated. The results are shown in Table 3.

Table 3.

Kappa coefficients of multivariate entropy-based methods.

Methods Subjects Mean
“aa” “al” “av” “aw” “ay”
mvMFE 0.5108 0.7200 0.4928 0.6814 0.5864 0.5983
RCmvMFE 0.8764 0.9692 0.9172 0.9794 0.9408 0.9366
IRCmvMFE 0.9928 0.9942 0.9722 0.9942 0.9986 0.9904

Comparing the mean kappa values, the results of MI-EEG feature extraction from each subject was highest when using IRCmvMFE; this result revealed that IRCmvMFE had better consistency than those of RCmvMFE and mvMFE.

3.7. Comparison of Multiple Traditional Feature Extraction Methods

A variety of traditional feature extraction methods [39] were compared with the method presented in this paper, using SVM as a classifier. In Table 4, the top classification results and average classification of 10 × 10-fold CV of referenced feature extraction methods [37] on BCI competition II are displayed. IRCmvMFE achieved the highest classification accuracy over the referenced methods, and its 10 × 10-fold CV results were also better; it also showed the ability of IRCmvMFE to quantify the complexity of multichannel signals and implied its superiority in extracting features from MI-EEG signals.

Table 4.

Comparison of multiple traditional feature extraction methods on BCI competition II.

Reference number Methods Top classification rates (%) Average classification rates (%)
[3] WT 83.57
[3] HHT 87.86
[4] DWT 96.06
[5] DTCWT 91.07
[6] EMD 99.48
[7] CSP 82.86
This paper IRCmvMFE 100 99.86

Note: “—” represents that average recognition rate of 10 × 10-fold CV is not given in the reference.

The CSP-based feature extraction methods have been extensively studied on BCI competition III. The experimental results of 10 × 10-fold CV with CSP, filter bank CSP (FBCSP), discriminant FBCSP (DFBCSP), sparse FBCSP (SFBCSP), and spectrally weighted CSP (SWCSP) methods were from references [8, 9]. The method presented in this paper was compared with these methods, and the recognition rates are shown in Table 5. The results of CSP-based feature extraction were lower than those of IRCmvMFE. CSP-based methods only considered the spatial characteristics of MI-EEG signals, ignoring the features in other domains. IRCmvMFE effectively extracted nonlinear dynamic features of MI-EEG, correctly analyzed multichannel signals, and had good applicability in multiple subjects.

Table 5.

Comparison with multiple CSP-based feature extraction methods in BCI competition III.

Reference number Methods Subjects Average results (%)
“aa” “al” “av” “aw” “ay”
[8] CSP 79.89 97.89 70.39 92.14 92.14 86.67
[8] FBCSP 90.39 97.82 72.54 97.21 94.54 90.50
[8] DFBCSP 90.32 98.46 75.14 97.82 95.29 91.40
[8] SFBCSP 91.54 98.57 77.43 97.03 94.69 92.05
[9] SWCSP 94.2 99.2 78 97.7 95.6 93.0
This paper IRCmvMFE 99.39 99.71 98.61 99.61 99.82 99.43

4. Discussion

In this paper, IRCmvMFE was proposed as a feature extraction method for MI-EEG signals. In IRCmvMFE, a composite filter technique was applied to improve the coarse-grained process of RCmvMFE, which eliminated impulse noise interference due to random factors, produced smoother MI-EEG time series, and enhanced the filter results. The optimal channels and the optimal parameters were selected to calculate IRCmvMFE for each subject when imaging left-right-hand or right-hand-foot movement. Multiscale IRCmvMFEs were constructed as a feature vector. Entropy-based and traditionally referenced feature extraction methods were compared on two public datasets. The kappa coefficients of IRCmvMFE, RCmvMFE, and mvMFE were calculated for statistical analysis. The results implied the superiority and applicability of IRCmvMFE for the analysis of two-class motor imagery tasks. In the future, we will continue to focus on the research of multiclass motor imagery tasks.

5. Conclusions

A novel nonlinear dynamics method based on RCmvMFE, called IRCmvMFE, was introduced in this study. This method provides a potential tool for the nonlinear dynamic analysis of multichannel MI-EEG signals. RCmvMFE was developed using a composite filter technique in the coarse-grained process, which effectively removes impulse noise interference, better reflects the dynamic correlations both within and across channels, and is more closely matched the nonlinear and time-varying characteristics of MI-EEG and produced better features and classification accuracy. IRCmvMFE was applied to the analysis of multichannel MI-EEG signals and was compared to other commonly used feature extraction methods. IRCmvMFE yielded the highest classification results and improved stability; it also displayed the applicability of IRCmvMFE for MI-EEG feature extraction and provided a useful tool for the analysis of other complex, two-class biological signals.

Acknowledgments

This research was financially supported by the National Natural Science Foundation of China (Nos. 81471770, 11882003, and 61672070) and the Natural Science Foundation of Beijing (No. 7132021).

Data Availability

Two previously reported datasets were used to support this study and are available at http://bbci.de/competition/ii/ and http://www.bbci.de/competition/iii. These datasets are cited at relevant places within the text as references [36, 37].

Conflicts of Interest

The authors declare that there are no conflicts of interest regarding the publication of this article.

References

  • 1.Shi J. H., Shen J. Z., Wang P. Feature extraction and classification of EEG signals in four kinds of motion imagination. Journal of Zhejiang University (Engineering Science) 2012;46(2):338–344. [Google Scholar]
  • 2.Bashar S. K., Hassan A. R., Bhuiyan M. I. H. Motor imagery movements classification using multivariate EMD and short time fourier transform. Proceedings of the Annual IEEE India Conference 2015; December 2015; New Delhi, India. [Google Scholar]
  • 3.Sun H. W., Fu Y. F., Xiong X., et al. Identification of EEG induced by motor imagery based on hilbert-huang transform. Acta Automatica Sinica. 2015;41(9):1686–1692. [Google Scholar]
  • 4.He K. R., Wang X. G., Zou L., et al. Brain-computer interface design using signal powers extracted during motor imagery tasks. Chinese Journal of Biomedical Engineering. 2011;20(4):139–149. [Google Scholar]
  • 5.Bashar S. K., Hassan A. R., Bhuiyan M. I. H. Identification of motor imagery movements from EEG signals using dual tree complex wavelet transform. Proceedings of the 2015 International Conference on Advances in Computing, Communications and Informatics; August 2015; Kochi, India. IEEE; pp. 290–296. [Google Scholar]
  • 6.Medina S. B., Duque M. L. Fuzzy entropy relevance analysis in DWT and EMD for BCI motor imagery applications. Ingeniería. 2015;20(1):9–19. [Google Scholar]
  • 7.Liu C., Zhao H. B., Li C. S., Wang H. CSP/SVM-based EEG classification of imagined hand movements. Journal of Northeastern University (Natural Science) 2010;31(8):1098–1011. [Google Scholar]
  • 8.Zhang Y., Zhou G., Jin J., Wang X., Cichocki A. Optimizing spatial patterns with sparse filter bands for motor-imagery based brain-computer interface. Journal of Neuroscience Methods. 2015;255(1):85–91. doi: 10.1016/j.jneumeth.2015.08.004. [DOI] [PubMed] [Google Scholar]
  • 9.Meng J., Huang G., Zhang D., Zhu X. Optimizing spatial spectral patterns jointly with channel configuration for brain-computer interface. Neurocomputing. 2013;104:115–126. doi: 10.1016/j.neucom.2012.11.004. [DOI] [Google Scholar]
  • 10.Kale K. K., Gawande J. P. Automated feature extraction of epileptic EEG using approximate entropy. Proceedings of the 12th International Conference on Hybrid Intelligent Systems (HIS); December 2012; Adelaide, Australia. IEEE; pp. 474–477. [Google Scholar]
  • 11.Wang L., Xu G. Z., Yang S., Wang J., Guo M., Yan W. Motor imagery BCI research based on sample entropy and SVM. Proceedings of the International Conference on Electromagnetic Field Problems & Applications; June 2012; Dalian, Liaoning, China. IEEE; pp. 1–4. [Google Scholar]
  • 12.Li L., Cao R., Xiang J. Comparative study of approximate entropy and sample entropy based on characterization of EEG. Computer Engineering and Design. 2014;35(3):1021–1026. [Google Scholar]
  • 13.Tian J., Luo Z. Z. Motor imagery EEG feature extraction based on fuzzy entropy. Journal of HuaZhong University of Science and Technology. 2013;41(1):92–95. [Google Scholar]
  • 14.Cao Y., Cai L., Wang J., et al. Characterization of complexity in the electroencephalograph activity of Alzheimer’s disease based on fuzzy entropy. Chaos: An Interdisciplinary Journal of Nonlinear Science. 2015;25(8) doi: 10.1063/1.4929148.083116 [DOI] [PubMed] [Google Scholar]
  • 15.Xiang J., Li C., Li H., et al. The detection of epileptic seizure signals based on fuzzy entropy. Journal of Neuroscience Methods. 2015;243:18–25. doi: 10.1016/j.jneumeth.2015.01.015. [DOI] [PubMed] [Google Scholar]
  • 16.Liu Q., Chen Y. F., Fan S. Z., Abbod M. F., Shieh J. S. EEG signals analysis using multiscale entropy for depth of anesthesia monitoring during surgery through artificial neural networks. Computational and Mathematical Methods in Medicine. 2015;2015:16. doi: 10.1155/2015/232381.232381 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 17.Mizuno T., Takahashi T., Cho R. Y., et al. Assessment of EEG dynamical complexity in Alzheimer’s disease using multiscale entropy. Clinical Neurophysiology. 2010;121(9):1438–1446. doi: 10.1016/j.clinph.2010.03.025. [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 18.Tsai P. H., Chang S. C., Liu F. C., Jenho Tsao, Wang Y. H., Lo M. T. A novel application of multiscale entropy in electroencephalography to predict the efficacy of acetylcholinesterase inhibitor in alzheimer’s disease. Computational and Mathematical Methods in Medicine. 2015;2015:8. doi: 10.1155/2015/953868.953868 [DOI] [PMC free article] [PubMed] [Google Scholar]
  • 19.Humeau-Heurtier A. The multiscale entropy algorithm and its variants: a review. Entropy. 2015;17(5):3110–3123. doi: 10.3390/e17053110. [DOI] [Google Scholar]
  • 20.Zou X., Lei M. Pattern recognition of surface electromyography signal based on multi-scale fuzzy entropy. Journal of Biomedical Engineering. 2012;29(6):1184–1188. [PubMed] [Google Scholar]
  • 21.Zheng J., Cheng J., Yang Y., Luo S. A rolling bearing fault diagnosis method based on multi-scale fuzzy entropy and variable predictive model-based class discrimination. Mechanism and Machine Theory. 2014;78:187–200. doi: 10.1016/j.mechmachtheory.2014.03.014. [DOI] [Google Scholar]
  • 22.Li M., Liu H., Zhu W., Yang J. Applying improved multiscale fuzzy entropy for feature extraction of MI-EEG. Applied Sciences. 2017;7(1):p. 92. doi: 10.3390/app7010092. [DOI] [Google Scholar]
  • 23.Zheng J., Pan H., Cheng J. Rolling bearing fault detection and diagnosis based on composite multiscale fuzzy entropy and ensemble support vector machines. Mechanical Systems and Signal Processing. 2017;85:746–759. doi: 10.1016/j.ymssp.2016.09.010. [DOI] [Google Scholar]
  • 24.Zheng J. D., Pan H. Y., Cheng S. J., et al. Composite multi-scale fuzzy entropy based rolling bearing fault diagnosis method. Journal of Vibration and Shock. 2016;35(8):116–123. [Google Scholar]
  • 25.Li P., Y Liu C., Li L. P., et al. Multiscale multivariate fuzzy entropy analysis. Acta Physica Sincia. 2013;62(12)120512 [Google Scholar]
  • 26.Ahmed M. U., Mandic D. P. Multivariate multiscale entropy: a tool for complexity analysis of multichannel data. Physica A. 2017;465:261–276. doi: 10.1103/PhysRevE.84.061918. [DOI] [PubMed] [Google Scholar]
  • 27.Ni L., Cao J., Wang R. Time-dependent multivariate multiscale entropy based analysis on brain consciousness diagnosis. Proceedings of the BICS: International Conference on Brain Inspired Cognitive Systems; June 2013; Beijing, China. [Google Scholar]
  • 28.Ahmed M. U., Mandic D. P. Multivariate multiscale entropy analysis. IEEE Signal Processing Letters. 2012;19(2):91–94. doi: 10.1109/lsp.2011.2180713. [DOI] [Google Scholar]
  • 29.Ni L., Cao J., Wang R. Dynamic multivariate multiscale entropy based analysis on brain death diagnosis. Science China Technological Sciences. 2015;58(3):425–433. doi: 10.1007/s11431-014-5757-0. [DOI] [Google Scholar]
  • 30.Wang X. Y., Xu Y. H. Signal classification method for Parkinson gait based on multivariate multiscale fuzzy entropy. Beijing Biomedical Engineering. 2016;35(5):489–496. [Google Scholar]
  • 31.Ahmed M. U., Chanwimalueang T., Thayyil S., Mandic D. A multivariate multiscale fuzzy entropy algorithm with application to uterine EMG complexity analysis. Entropy. 2016;19(1):p. 2. doi: 10.3390/e19010002. [DOI] [Google Scholar]
  • 32.Zheng J., Tu D., Pan H., Hu X., Liu T., Liu Q. A refined composite multivariate multiscale fuzzy entropy and laplacian score-based fault diagnosis method for rolling bearings. Entropy. 2017;19(11):p. 585. doi: 10.3390/e19110585. [DOI] [Google Scholar]
  • 33.Azami H., Escudero J. Refined composite multivariate generalized multiscale fuzzy entropy: a tool for complexity analysis of multichannel signals. Physica A: Statistical Mechanics and its Applications. 2017;465:261–276. doi: 10.1016/j.physa.2016.07.077. [DOI] [Google Scholar]
  • 34.Zhang L., Pan Z. M. Application of multi-filter in processing of data received from three-dimension digital compass. Modern Electronics Technique. 2008;22:148–150. [Google Scholar]
  • 35.Li M. A., Guo S. D., Yang J. F., Sun Y. J. A novel EEG feature extraction method based on OEMD and CSP algorithm. Journal of Intelligent & Fuzzy Systems. 2016;30(5):2971–2983. doi: 10.3233/ifs-151896. [DOI] [Google Scholar]
  • 36.Blankertz B., Müller K. R., Curio G., et al. The BCI Competition 2003: progress and perspectives in detection and discrimination of EEG single trials. IEEE Transactions on Bio-Medical Engineering. 2004;51(6):100–106. doi: 10.1109/tbme.2004.826692. [DOI] [PubMed] [Google Scholar]
  • 37. BCI competition III, http://www.bbci.de/competition/iii.
  • 38.Meng J. J., Sheng X. J., Yao L., et al. Common spatial spectral pattern for motor imagery tasks in small channel configuration. Chinese Journal of Biomedical Engineering. 2013;32(5):553–561. [Google Scholar]
  • 39.She Q. S., Ma Y. L., Meng M., Luo Z. Multiclass posterior probability twin SVM for motor imagery EEG classification. Computational Intelligence and Neuroscience. 2015;2015:9. doi: 10.1155/2015/251945.251945 [DOI] [PMC free article] [PubMed] [Google Scholar]

Associated Data

This section collects any data citations, data availability statements, or supplementary materials included in this article.

Data Availability Statement

Two previously reported datasets were used to support this study and are available at http://bbci.de/competition/ii/ and http://www.bbci.de/competition/iii. These datasets are cited at relevant places within the text as references [36, 37].


Articles from Computational Intelligence and Neuroscience are provided here courtesy of Wiley

RESOURCES