Abstract
Using single-trial cortical signals calculated by weighted minimum norm solution estimation (WMNE), the present study explored a feature extraction method based on rhythm entropy to classify the scalp electroencephalography (EEG) signals of error response from that of correct response during performing auditory-track tasks in cocktail party environment. The classification rate achieved 89.7% with single-trial (≈700 ms) when using support vector machine(SVM) with the leave-one-out-cross-validation (LOOCV). And high discriminative regions mainly distributed at the medial frontal cortex (MFC), the left supplementary motor area (lSMA) and the right supplementary motor area (rSMA). The mean entropy value for error trials was significantly lower than that for correct trials in the discriminative cortices. By time-varying network analysis, different information flows changed among these discriminative regions with time, i.e. error processing showed a left-bias information flow, and correct processing presented a right-bias information flow. These findings revealed that the rhythm information based on single cortical signals could be well used to describe characteristics of error-related EEG signals and further provided a novel application about auditory attention for brain computer interfaces (BCIs).
Introduction
In everyday life, the flood of sensory information were regulated by attention system into a manageable stream, and attention orienting played a primary role in complex visual environment by finding relevant information and filtering out irrelevant information to bias the target selection and processing1. Typically, two mechanisms were thought to be included in the process: endogenous orienting (goal-driven, top-down), directed the attention to the information related locations in space, and exogenous orienting (stimulus-driven, bottom-up), reflexively triggered by prominent and behaviorally relevant stimuli2. Classic research on attention orienting was involved by the analysis of the cocktail party phenomenon coined by Cherry in 19533.
The cocktail party effect was the phenomenon that people can focus their auditory attention on a stimulus while filtering out other stimuli, similar with a partygoer being able to concentrate on a single conversation in a noisy room, namely, the process reflected the influence of top-down attention. It might also describe a similar phenomenon that occurs when one can immediately detect words of importance originating from unattended stimuli, for instance hearing one’s name in another conversation, which referred to the bottom-up controlled attention4–6.
A lot of researches was conducted to investigate dynamic changes in cortical activity during tracking the dynamic speech stimulus4,5, and the findings suggested that attentional orienting modulated the neural responses to one of speakers’ voices. If a listener successfully tracked one speaker in a multi-speakers’ environment, the neural responses showed highly correlated with the attended speaker4,7–9. And the neural generator of this effect was localized in the left hemisphere10. However, when people were absent-minded, they often failed to keep track of the goals that needed to be noticed; that is, the wrong execution was mostly due to the lack of attention to the target stimulus.
Over the last decade, a lot of researches focused on the theories of attention orienting11. In contrast, little is known about the connection of attention with BCIs. Several recent studies have investigated the impact of attention on BCIs. A typical example was to utilize attentional modulation of steady-state visual evoked potential (SSVEP) to implement an online BCI system, and the results showed the SSVEP amplitude could be enhanced by attention, thus improving the speed and accuracy of the BCI system12. Another work researched SSVEP under strong attention and poor attention of flash conditions, the results indicated that the SSVEP was modulated by attention and the effect of modulations was related to the frequency of flash stimulation13. Further, the simultaneously presented tactile and visual stimuli were used to investigate the influence of attention shift on SSVEP. The significant attention switching was observed within both types of stimuli and between different stimuli. Similarly, the unattended experiments were also investigated in the BCI studies. For instance, the error-related negative (ERN) potential reflected an error-monitoring process of the brain and could be detected in scalp EEG recordings14,15. The ERN arose after an erroneous response and the maximum peak was localized at the medial frontal regions14,16. Recently, researchers found that the ERN potential was used in BCI for adjusting command outputs of BCI systems when subjects observed incorrect outputs from BCI systems, thus facilitating the development of BCI systems with improved accuracy17.
Although BCI studies regarding attention have been conducted, some shortcomings still existed. Firstly, methods, which were used to select feature from the EEG signals, were not related to the cognitive functions. For example, information entropy described the generating rate of new information of nonlinear dynamical systems and it has been shown as an effective measure to select EEG signals features18. However, information entropy ignored the association between EEG activities and subjects’ cognitive states. Consequently, a similar order state of EEG signal sequence could be found among different cognitive states, which hampered the applications in clinical areas19. Secondly, the BCI performance was limited by the measurement manners of EEG. Compared with the intracranial EEG, which was directly recorded from the cortex surface, the scalp EEG could be easily affected by the effect of volume conduction and reference electrodes20–22, causing an imprecise measurement of physiological significance. Although the intracranial EEG described more precise temporal and spatial information than that of the scalp EEG, it was invasive and only feasible for a very limited number of subjects21,23. Thirdly, the regional EEG parameters could not adequately reflect the cognitive process. Multiple brain regions were involved in the cognitive processing and reflected by the EEG activities24. Recently, network analysis methods have attracted wide-spread attention in neuroscience and it proved to be an efficient way to measure the connections between regions in the cognitive functions25,26.
In the present study, the cocktail party experiment paradigm, which was closer to the real environment, was used to research the error-related attention. In order to overcome the drawbacks of previous studies on the scalp EEG, the weight minimum norm (WMN) method was used to estimate cortical activities with single-trial. Then, we adopted a feature extraction method, rhythm entropy, to classify the error-related auditory processing from correct auditory processing based on single cortical signals. Rhythm entropy (RhEn) was developed by combining the information entropy with the power of EEG rhythm, which was an important feature for cognitive research based on spontaneous EEG27. Finally, adaptive directed transfer function (ADTF), one of the most frequently used methods for assessing the dynamic causality relationship among various brain regions28,29, was applied to calculate the time-varying connectivity patterns in the different conditions.
Result
Reaction Time (RT)
As shown in Fig. 1, mean RTs for correct response were shorter than those for error response (paired t-test: t = −7.2, p < 0.05, d = −0.66).
Brain regions with high discriminative power
For visual representation, the cortical spatial distribution was reconstructed according to the R2 value on each dipole. Three brain regions, i.e. medial frontal cortex (MFC), left supplementary motor area (lSMA) and right supplementary motor area (rSMA), exhibited greater correlation than others (Fig. 2A and Table 1), got high discriminative power for identifying error response trials from correct response trials during tracking the cued speaker.
Table 1.
Brain region | L/R | BA | Number of dipoles | Mean R2 | MNI Coordinates | ||
---|---|---|---|---|---|---|---|
x | y | z | |||||
Medial frontal cortex (MFC) | |||||||
Medial superior frontal gyrus | L | BA8 | 6 | 0.360 | −8 | 40 | 54 |
R | BA8 | 7 | 0.350 | 8 | 50 | 48 | |
Middle frontal gyrus | L | N/A | 7 | 0.363 | −25 | 9 | 56 |
R | BA6 | 6 | 0.346 | 44 | 0 | 56 | |
Superior frontal gyrus | L | BA8 | 12 | 0.384 | −14 | 38 | 56 |
R | BA9 | 13 | 0.347 | 22 | 56 | 36 | |
Anterior Cingulum Cortex | L | BA32 | 2 | 0.328 | −4 | 30 | 22 |
Left supplementary motor area (lSMA) | |||||||
Postcentral Gyrus | L | BA4 | 5 | 0.350 | −44 | −16 | 54 |
Precentral Gyrus | L | BA4 | 6 | 0.326 | −38 | −22 | 46 |
Supplementary motor area | L | BA6 | 6 | 0.324 | −10 | 24 | 60 |
Insula | L | N/A | 3 | 0.323 | −34 | −12 | 16 |
Inferior parietal lobule | L | BA2 | 3 | 0.355 | −58 | −26 | 50 |
Right supplementary motor area (rSMA) | |||||||
Precentral Gyrus | R | N/A | 1 | 0.341 | 48 | −8 | 52 |
Postcentral Gyrus | R | BA3 | 5 | 0.341 | 56 | −14 | 54 |
Supplementary motor area | R | BA6 | 5 | 0.325 | 4 | 6 | 56 |
Superior temporal gyrus | R | BA40 | 1 | 0.345 | 66 | −20 | 14 |
Classification accuracy
The SVM with LOOCV achieved an average accuracy of 89.7%(SD ± 3.6%). A good generalization performance of SVM classifier were observed (Table 2 and Fig. 2B), i.e. SP: 89.6% ± 4.9%, SE: 89.8% ± 5.2%, AUC: 94.0% ± 5.2%.
Table 2.
Subject | CA | SP | SE | AUC |
---|---|---|---|---|
1 | 0.893 | 0.857 | 0.929 | 0.952 |
2 | 0.893 | 0.929 | 0.857 | 0.967 |
3 | 0.833 | 0.800 | 0.867 | 0.804 |
4 | 0.889 | 0.833 | 0.944 | 0.965 |
5 | 0.933 | 0.933 | 0.933 | 0.931 |
6 | 0.882 | 0.941 | 0.824 | 0.891 |
7 | 0.921 | 0.895 | 0.947 | 0.922 |
8 | 0.906 | 0.938 | 0.875 | 0.972 |
9 | 0.969 | 0.941 | 0.987 | 0.981 |
10 | 0.933 | 0.892 | 0.975 | 0.998 |
11 | 0.900 | 0.933 | 0.867 | 0.929 |
12 | 0.893 | 0.857 | 0.929 | 0.965 |
13 | 0.886 | 0.909 | 0.864 | 0.934 |
14 | 0.857 | 0.809 | 0.905 | 0.981 |
15 | 0.923 | 0.896 | 0.950 | 0.884 |
16 | 0.821 | 0.857 | 0.786 | 0.996 |
17 | 0.933 | 0.980 | 0.887 | 0.998 |
18 | 0.885 | 0.923 | 0.846 | 0.846 |
19 | 0.923 | 0.932 | 0.914 | 0.953 |
20 | 0.864 | 0.864 | 0.864 | 0.935 |
Mean | 0.897 | 0.896 | 0.898 | 0.940 |
Note: CA represents classification accuracy, SP represents specificity, SE represents sensitivity and AUC represents area under ROC curve.
Relationship between RT and rhythm entropy (RhEn)
RhEn of the MFC induced by error processing (i.e. error trials) was significant positively correlated with RT. RhEn of the lSMA and rSMA induced by error processing were non-significantly correlated with RT, respectively (Fig. 3A). No correlations were observed during correct response between RT and RhEn of each discriminative brain region (Fig. 3B). More detailed information was also shown in Table 3.
Table 3.
Correlation | Err. (RT vs. En.) | Cor. (RT vs. En.) | Err. | Cor. | Paired t-test | |||
---|---|---|---|---|---|---|---|---|
Brain region | r | p | r | p | ME. | ME. | t | p |
MFC | 0.550 | 0.018* | −0.305 | 0.219 | 0.200 | 0.800 | −3.141 | 0.005* |
lSMA | 0.260 | 0.298 | −0.242 | 0.334 | 0.373 | 0.443 | −0.404 | 0.691 |
rSMA | 0.252 | 0.313 | −0.338 | 0.170 | 0.566 | 0.554 | 0.085 | 0.933 |
MFC + lSMA | 0.490 | 0.039* | −0.408 | 0.093 | 0.275 | 0.645 | −2.481 | 0.023* |
MFC + rSMA | 0.435 | 0.071 | −0.411 | 0.090 | 0.322 | 0.718 | −2.647 | 0.016* |
lSMA + rSMA | 0.270 | 0.278 | −0.302 | 0.224 | 0.449 | 0.487 | −0.261 | 0.797 |
MFC + lSMA + rSMA | 0.411 | 0.091 | −0.392 | 0.108 | 0.339 | 0.625 | −2.115 | 0.048* |
Note: Err. represents error response, Cor. represents correct response, En. is the entropy value and ME. represents normalized mean entropy of all subjects.
Information flow between the discriminative regions
To investigate time characteristics of two response types, i.e. correct and error, time-varying networks were conducted. Here, significant discriminative regions, i.e. the medial frontal cortex (MFC), left supplementary motor area (lSMA) and right supplementary motor area (rSMA), were utilized to serve as network nodes. For time series of networks was estimated by cortical signals within three discriminative regions, which calculated by the averaged scalp ERPs (Fig. 4). Results of the time-varying network analysis were illustrated in Fig. 5A and B.
The error time-varying networks showed the left-bias information flows between the lSMA and the MFC (i.e. lSMA → MFC and MFC → lSMA, Fig. 5A). Then, right information flow between rSMA and MFC appeared, while the correct time-varying networks showed the right-bias information flow between the rSMA and the MFC (Fig. 5B).
Relationship between RT and information flow
Information flows between the network nodes changed with time varying. During error processing, the weaker DTF values of information flow between the medial frontal cortex and the left supplementary motor area, i.e. MFC → lSMA and lSMA → MFC, of time-varying networks at 430 ms were related to longer error RT (Fig. 5C), respectively. The pure flow between the MFC and the lSMA showed an information flow from MFC to lSMA, positively being correlated with error RTs. During correct processing, the stronger DTF values from the MFC to the right SMA, i.e. MFC → rSMA and rSMA → MFC, of time-varying networks at 200 ms were related to longer correct RT (Fig. 5D), respectively. The pure flow between the MFC and rSMA showed an information flow from MFC to rSMA, negatively being correlated with correct RTs.
Disscussion
Based on the scalp EEG recording, the present study utilized the RhEn of cortical signal with single-trial, calculated by weighted minimum norm solution estimation (WMNE), to identify two response types (correct vs. error) differences at the cortical level during performing the auditory-tracked tasks in multi-speakers’ environment. We found that: 1) An averaged accuracy achieved 89.7% and discriminative cortex differences mainly distributed in the medial frontal cortex (MFC), the left SMA, and the right SMA; 2) The mean RhEn for error trials was significantly lower than that for correct trials in the discriminative cortices. In addition, the larger RhEn for error trials was related to longer RT. 3) Time-varying networks analysis based on discriminative regions and averaged cortical source waveforms further revealed that error-related networks represented the left-bias information flow and the correct-related networks represented the right bias information flow.
RhEn of discriminative cortices and RT
Our classified method successfully extracted reliable differences between correct and error response with the mean classification rate of 89.7%(with SD: ±3.6%), which was superior to previous related works5,7,30,31 that this method was non-invasive cortical dynamical signal with single trials of short-time duration (~700 ms EEG data).
Previous study found that the medial frontal cortex played a crucial role in producing error-related potential4,32–34. Activation of the MFC reflected error-related processing35. As shown in Fig. 2A, we also found that the discriminative cortical regions between correct processing and error processing being obtained via R2 values, i.e. the relationship between rhythm entropies and class tags, and SVM mainly focused on the medial frontal cortex, revealing that the MFC was an important brain area in monitoring function between the correct and error processing in multi-speaker environment.
Entropy was a powerful tool to quantify complexity in nonlinear dynamics of neural activities. The irregularity and unpredictability of brain activity induced by attentional selection were regarded as neural complexity related to brain functions and information processing between neurons36. Previous studies have found that the lower entropy meant worse behavior performance19,37. We also found that RT for error trials significantly longer than that for correct trials (Fig. 1), indicating that increased uncertainty induced by unsuccessfully focusing attention to the cued speaker. And the mean RhEn significantly lower for error trials in MFC than that for correct trials (Fig. 3C and Table 3), suggesting that if a listener successful focused attention on a cued speaker with correct trials, the integration of segregated neuronal groups and incoming stimuli with ongoing performance induced high complexity of cortical dynamics. While if a listener did not attend a cued speaker (i.e. error trials), the decoupling and isolation of the underlying system from external factors may lead to the lower RhEn values, consistent with the previous theory confirmed by representational mathematical models36,38.
In addition, the larger RhEn values following with longer RT in error trials (Fig. 3A and Table 3) represented the increased irregularity and unpredictability, while there existed a trend that smaller RhEn values were related to the longer RT in correct trials (Fig. 3B and Table 3) indicated that the decreased coupling between internal system and external factors may result in the low complexity. Therefore, the lower RhEn values at the MFC in error trials during auditory processing suggested that the decoupling and segregation between the MFC and the dorsolateral prefrontal cortex, i.e. the left and the right SMA were involved in abnormal attentional control and consequently in wrong cognitive performance as well.
Previous studied found that the posterior central gyrus might be involved in the generation of processes that activated the error related potential39 and left dorsal lateral frontal cortex was selectively active during error trials40. Our findings suggested that the contribution of the left SMA and MFC to high classification rate may be related to wrong responses. And the right SMA may mainly be relevant to correct response types due to previous findings that activation was observed in the right SMA in correct trials35,41.The right-bias information flow may provide an evidence to support the idea during correct response (Fig. 5B).
Information flow via time-varying networks
As described above, converging evidence revealed that left SMA was closely correlated with error related potential39,40. Our results of time-varying networks displayed information flow between the MFC and the left SMA in error trials (Fig. 5A). Previous findings suggested that the error-processing system consisted of a monitoring system for detecting errors and an optimized behavior compensation system42. When perceiving an error existing because of failure to attend the cued speaker, the left SMA firstly sent information to the MFC, and at the same time, a feedback was received from the MFC. In the process of error handing, the MFC acted as a filter to match stimuli (i.e. error or correct feedback) and reactions43, and then information was sent to the right SMA. In correct trials, a right-bias information flow between the MFC and the right SMA was existed (Fig. 5B). During the period from 320 ms to 540 ms, similar time-varying network connectivity patterns were observed in both correct and error trials, implying the underlying coordination of the MFC and bilateral SMA to control cognitive performance.
Moreover, during the error time-varying network at 430 ms (Fig. 5C), the smaller ADTF values of both MFC → lSMA and lSMA → MFC were related to the longer RT (Fig. 5C, left and middle panels). The pure flow showed MFC → lSMA was positively correlated with RT (Fig. 5C, right panel). During the correct time-varying network at 200 ms (Fig. 5D), the bigger ADTF values of both MFC → rSMA and rSMA → MFC were related to the longer RT (Fig. 5D, left and middle panels). The pure flow showed MFC → rSMA was negatively correlated with RT (Fig. 5D, right panel). These findings suggested that the coupling with MFC and lSMA was weaken leading to increased RT in error trials. And the coupling with MFC and rSMA was strengthened resulting in shorter RT in correct trials.
BCI Application
In the present study, cortical activities could provide more precise spatial physiology information compared to scalp EEG21; Compared to traditional entropy methods such as approximate entropy, RhEn has an ability to incorporate individual’s cognitive state27. As shown in Table 2 and Fig. 2B, a short time duration (~700 ms) was enough to distinguish individual’s error states using the present method, which allowed the possibility of near real-time EEG processing. Using time-varying network analysis, time-varying network at 430 ms was related to error processing in wrong response and time-varying network at 200 ms was related to correct processing in right response (Fig. 5), which both were earlier than individual’s RT. These findings suggested a possible role for improving BCIs performance in the future. It was noted that for test samples, the time cost of one trial calculation was about 3 s under the MATLAB platform. Here, the most time-consuming part was the minimum norm solution estimation which taken about 0.9 s for each calculation. In the future BCI application, the efficiency of operation could be improved by using faster programming language and optimized algorithms.
Conclusion
The present study was the first to utilize a feature extraction method, rhythm entropy coming from single cortical EEG signals with short-time duration(~700 ms) which were converted via WMNE from scalp EEG recordings during auditory processing in multi-speakers’ environment, to investigate error processing. Three brain areas, i.e. MFC, left SMA and right SMA, got high classification rate reflecting cortical discriminative source distribution between error processing and correct processing. Time-varying networks further revealed that information flows changed between these brain areas with time, i.e. time-varying network at 430 ms was related to error processing in wrong trials and at 200 ms was related to correct processing in right trials. Taken together, these findings suggested that the reduced cognitive performance on auditory error response was associated with impaired cortical information processing, as indicated by the lower complexity of the EEG.
Material and Methods
Participants
Twenty subjects (mean ± standard deviation (SD) age, 22 ± 3.5 years; all males; right-handed) took part in the experiment. None of them reported any history of hearing impairment or neurological problems. Informed consent was signed prior to the study, and subjects also received a monetary compensation after experiments. All experiments were approved by the ethical committee of Chongqing university of Posts and Telecommunications. All experimental methods were conducted in accordance with the ethical guidelines determined by the National Ministry of Health, Labour and Welfare and the Declaration of Helsinki (BMJ 1991; 302:1194).
Stimuli and Design
The experiment design was similar with the previous study4. A sentence contained in the form [ready “call sign” go to “color” “number” point now]. For example, ready “skylark” go to “blue” “four” point now. Here, 60 unique sentences were combined by two call signs (sparrow or skylark), three colors (red, blue or green) and three numbers (two, five or seven). All sentences were read by using Chinese.
Before the experiment, subjects firstly listened to each of speakers alone and were able to report the color and number with at least 100% accuracy. In the experiment, a fixation cross (0.5° × 0.5°) at the center of the monitor were displayed throughout the entire block of trials. Each trial began with the fixation cross flashing for 50 ms. After a 700 ms delay, a cue was presented for 50 ms. The cue was defined as a call sign that tracked by listeners. After a short (100–300 ms) SOA, two different sentences spoken by a male and a female speaker were simultaneous presented about 2 s: one to the left ear, and the other to the right ear. Subjects were required to attend to one sentence, which the call sign was cued, and responded to the point where the call sign bird would go. The point (i.e. color-number combination) was fixed and shown visually on a monitor during each trial block.
Scalp EEG recording and preprocessing
EEG was recorded using a 64-channel NeuroScan system (Quik-Cap, band pass: 0.05–100 Hz, sampling rate: 1000 Hz, impedances <5kΩ) at the scalp. The EEG analysis procedure was shown in Fig. 6. To monitor ocular movements and eye blinks, EOG signals were simultaneously recorded from four surface electrodes, one pair placed over the higher and lower eyelid and the other pair placed 1 cm lateral to the outer corner of the left and right orbit. Cz was used as the reference during recording online. Then, the EEG recordings were divided into epochs (200 ms pre- to 4500 ms post-stimulus onset). Trials with blinks and eye movement were rejected offline and an artifact criterion of ±75 μV was used at all the other scalp sites to reject trials with excessive electromyography (EMGs) or other noise transients. EEG recordings were filtered with a band-pass of 0.1–30 Hz. The data were re-referenced by reference electrode standardization technique44 (REST, www.neuro.uestc.edu.cn/rest) (Fig. 6A). EEG epoching was then extracted for the time period beginning auditory stimulus offset and lasting until 700 ms after auditory stimulus offset, and performed the next analysis (Fig. 6B). Then, single-trial EEG epochs were sorted according to response types, i.e. correct-related and error-related, and were averaged from each subject to compute the ERPs (Fig. 6C).
Network nodes definition (discriminative pattern)
Network nodes definition involved the following main steps (Fig. 6B): 1) cortical activities estimation; 2) Information Entropy; 3) R-square analysis; 4) SVM and 5) Back projection.
Cortical activities estimation
The conventional source localization procedure, weighted minimum norm estimation (WMNE), was used to estimate the cortical activities. For single-trial EEG epochs, three frequency-bands, i.e. theta (4–8 HZ), alpha (8–13 HZ), beta (13–30 HZ), were separately extracted by the wavelet transformation. Then cortical activities were calculated by applying linear inverse operator W to the three frequency-band signals:
1 |
where x(t) represented the n-channels EEG data at time t and S(t) denoted corresponding cortical activities. W was obtained by:
2 |
Here, C and R referred to covariance matrices of the noise and sources, respectively. A was the gain matrix, calculated via the Brainstorm toolbox (http://neuroimage.usc.edu/brainstorm/), and the regularization parameter, , was calculated by:
3 |
A fixed value of 5 was used for the signal-to-noise ratio (SNR), which reflected the value in the evoked response experiments45.
Here, a 3-shell realistic head model was adopted for EEG source activities estimation, where the conductivities for the cortex, skull, and scalp were 1.0 Ω− 1 m− 1, 1/80 Ω− 1 m− 1, and 1.0 Ω− 1 m− 1respectively. The solution space was restricted to the cortical grey matter, the hippocampus, and other possible source activity areas, consisting of 15002 cubic mesh voxels with 10 mm inter-distance. The lead field matrix was calculated by the boundary element method (BEM)46.
Rhythm Entropy
After acquiring the cortical activities S(t), the power of cortical activities in each trial was then calculated by the following equation:
4 |
where m was the number of sample points and the information entropy was measured as follow:
5 |
where (i = 1, 2, 3) represented the normalized power of poweri and was calculated via divided by the sum of three frequency-band powers, i.e. theta, alpha and beta:
6 |
R-square analysis
R2 analysis was a common criterion of separability in BCIs research47, and often used to indicate correlation between features and class tags:
7 |
where presented feature vector of target and represented feature vector of non-target. was the standard deviation. The R2 value reflected the difference in the power of the two classes, with the larger R2 value denoting the greater difference between two classes37. For determining the threshold for further SVM classification, ten thresholds, i.e. from 0.1 times the maximum R2 value to the maximum R2 value and step length was set to 0.1, were selected to evaluate the performance of SVM classification. The final threshold was set to 0.6 times of the maximum R2 value among all dipoles because most of the subjects got the best classification rate under this threshold in the present study48,49. The dipoles with R2 value exceeding the threshold were chosen for further SVM classification.
Support vector machine (SVM)
In the experiment, the number of correct trials was greater than that of error trials (ACC > 80%, i.e. correct trials > 96, error trials < 24). For each subject, the number of error trials was at least 20 to ensure the training sample size. Correct trials were randomly selected to make it consistent with the amount of error trials.
SVM was developed by Vapnik based on statistics learning theory (SLT). As its excellent generalization performance, SVM has been applied in a wide variety of issues. SVM had the feature of empirical risk minimization (ERM) and global optimum solution50. We trained a SVM classifier with radial basis kernel function to extract highly discriminative brain regions. The goal of a SVM classifier with RBF kernel was to find a decision function by solving the following optimization problem51:
8 |
where w was the normal of the hyperplane; the function mapped the vector in a higher dimensional space52; was a measure of the misclassification errors for non-separable cases; and C traded off the empirical risk and model complexity, and was set by grid search algorithm53. Here, C ranged from 10−8 to 108 and the step length was set to 100.8.
If the SVM classifier could reflect the relationship between features and the class labels very well, the classifier was considered that it could predict the classes of new samples with good performance. Therefore, classification accuracy (CA), sensitivity (SE), specificity (SP) and area under ROC curve (AUC) were utilized to evaluate the classification performance of SVM classifier54. At the same time, leave-one-out cross-validation (LOOCV) was applied to evaluate the generalization performance of SVM for a small sample size.
The percentage of the number of samples predicted correctly in the test set over the total samples, CA, was calculated as follows:
9 |
where true positive (TP) was the number of positive samples correctly predicted and true negative (TN) was the number of negative samples correctly predicted. False positive (FP) denoted the number of positive samples incorrectly predicted and false negative (FN) denoted the number of negative samples incorrectly predicted.
SE and SP were calculated by the following formula, respectively:
10 |
11 |
SE referred to the ratio of correctly classified positive samples to the total population of positive samples, whereas SP was the ratio of correctly classified negative samples to the total population of negative samples.
Back projection
Both R2 analysis and SVM classifier were used to differentiate between correct-response related and error-response related brain regions (i.e. the positions of dipoles), each feature influenced the classification via its R2 value. The larger the R2 value was, the greater it affected the final discrimination. However, the correlational vector R2 was in a dimension-reduced space. To determine the discriminative brain areas, R2 values were mapped back to the high-dimensional space (i.e. dipole space). The correlational vector in the dimension-reduced subspace can be projected back to the original feature space according to the following formula:
12 |
For a given dipole i, the correlational representation was and the identity matrix was Ui in current study. Finally, the correlations were reconverted into the MNI space to obtain the discriminative regions. A threshold was required to determine brain areas that had significantly distinct characteristics between the correct- and error-response trials. For each dipole, a statistically meaningful threshold was derived by using 0.6 times of the maximum R2 value among all dipoles, because most of the subjects got the best classification rate under this threshold in the present study.
Time series estimation
For averaged ERPs at scalp, cortical activities (i.e. cortical ERPs) were estimated via WMNE (details described in the above section “Cortical activities estimation”). The time series in the discriminative brain regions localized in the above session of “network nodes definition” (Fig. 6B), were computed via averaging cortical activities of the dipoles within sources respectively (Fig. 6C).
Time-varying network analysis
After obtaining discriminative sources (Fig. 6B) and time series in discriminative sources (Fig. 6C), time-varying network analysis was performed (Fig. 6D).
ADTF calculation
The multivariate adaptive autoregressive (MVAAR) model of source waves was computed by the following equation:
13 |
where X(t) represented the cortical source wave over the entire time window, w(k, t) was the coefficients matrix of the time-varying model, which calculated by the Kalman filter algorithm, and represented the multivariate independent white noise. The symbol p denoted the MVAAR model order selected by Schwarz Bayesian Criterion28,55.
As mentioned above, the discriminative brain areas as the cortical sources (Fig. 6B) were applied in the time-varying network analysis. After obtaining the MVAAR model coefficient (w(k, t)), H(f, t) was obtained from w(f, t), which was then transformed by Eq. (13) into the frequency domain. The Hij element of H(f, t) described the directional information flow between the jth and the ith element at each time point t as:
14 |
15 |
where , wk was the matrix of the time-varying model coefficients. and were transformed into the frequency domain as X(f,t) and , respectively.
Defining the directed causal interrelation from the jth to the ith element, the normalized ADTF was described between (0, 1) as follows,
16 |
To obtain the total information flow from a single node, the integrated ADTF was calculated as the ratio of summation of ADTF values divided by the interested frequency bands [f1, f2]:
17 |
We chose average ADTF values over 4–30 Hz to acquire the final directional information flow according to the range of three frequency bands.
Surrogate data testing
The distribution of ADTF estimator under the null hypothesis of no causal interactions was not well determined, since the ADTF function had a highly nonlinear correlation with the time series where it derived. In view of this, the phases of the Fourier coefficients were independently and randomly iterated to produce a new surrogate data, which was a nonparametric statistical test28. The spectral structure of the time series was retained in the process of iterating the phases of the Fourier coefficients. The shuffling procedure was repeated 200 times for each model-derived time series of each subject to establish an empirical distribution of the ADTF value under the null hypothesis of no connectivity.
Correlation analysis
We performed Pearson correlation analysis to investigate the following relationships: 1) RT and entropy values of discriminative regions; and 2) RT and ADTF values of information flow between discriminative regions. All thresholds were set at p < 0.05. Here, ADTF value of information flow was adjusted by dividing the weighted degree of network.
Acknowledgements
This research is supported by the National Natural Science Foundation of China (#61671097); the Chongqing Research Program of Basic Science and Frontier Technology (No. cstc2017jcyjBX0007; No. cstc2015jcyjA10024); the Chongqing Key Laboratory Improvement Plan (cstc2014pt-sy40001) and the University Innovation Team Construction Plan Funding Project of Chongqing (CXTDG201602009).
Author Contributions
Conceived, designed the experiments and wrote the manuscript: YT. Performed the experiments, analyzed the data and wrote the first draft: W.X. Contributed reagents/materials/analysis tools: W.X., Y.T. Discussed the experiment design, analyzed the data and discussed the experiment results: Y.T., L.Y., W.X.
Competing Interests
The authors declare no competing interests.
Footnotes
Yin Tian and Wei Xu contributed equally to this work.
Publisher's note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
- 1.Tian Y, Liang S, Yao D. Attentional orienting and response inhibition: insights from spatial-temporal neuroimaging. Neuroscience Bulletin. 2014;30:141–152. doi: 10.1007/s12264-013-1372-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Tian Y, Chica AB, Xu P, Yao D. Differential consequences of orienting attention in parallel and serial search: An ERP study. Brain Research. 2011;1391:81. doi: 10.1016/j.brainres.2011.03.062. [DOI] [PubMed] [Google Scholar]
- 3.Cherry EC. Some Experiments on the Recognition of Speech, with One and with Two Ears. Journal of the Acoustical Society of America. 1953;25:975–979. doi: 10.1121/1.1907229. [DOI] [Google Scholar]
- 4.Mesgarani N, Chang EF. Selective cortical representation of attended speaker in multi-talker speech perception. Nature. 2012;485:233. doi: 10.1038/nature11020. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.O’Sullivan JA, et al. Attentional Selection in a Cocktail Party Environment Can Be Decoded from Single-Trial EEG. Cerebral Cortex. 2015;25:1697. doi: 10.1093/cercor/bht355. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Marchegiani L, Karadogan SG, Andersen T, Larsen J, Hansen LK. The Role of Top-Down Attention in the Cocktail Party: Revisiting Cherry’s Experiment after Sixty Years. International Conference on Machine Learning and Applications and Workshops. 2011;1:183–188. [Google Scholar]
- 7.Ding N, Simon JZ. Emergence of neural encoding of auditory objects while listening to competing speakers. Proceedings of the National Academy of Sciences of the United States of America. 2012;109:11854. doi: 10.1073/pnas.1205381109. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Ding N, Simon JZ. Neural coding of continuous speech in auditory cortex during monaural and dichotic listening. Journal of Neurophysiology. 2012;107:78. doi: 10.1152/jn.00297.2011. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Zion Golumbic EM, et al. Mechanisms underlying selective neuronal tracking of attended speech at a “cocktail party”. Neuron. 2013;77:980–991. doi: 10.1016/j.neuron.2012.12.037. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Power AJ, Foxe JJ, Forde EJ, Reilly RB, Lalor EC. At what time is the cocktail party? A late locus of selective attention to natural speech. European Journal of Neuroscience. 2012;35:1497–1503. doi: 10.1111/j.1460-9568.2012.08060.x. [DOI] [PubMed] [Google Scholar]
- 11.Corbetta M, Patel G, Shulman GL. The Reorienting System of the Human Brain: From Environment to Theory of Mind. Neuron. 2008;58:306–324. doi: 10.1016/j.neuron.2008.04.017. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Zhang D, et al. An independent brain-computer interface using covert non-spatial visual selective attention. Journal of Neural Engineering. 2010;7:16010. doi: 10.1088/1741-2560/7/1/016010. [DOI] [PubMed] [Google Scholar]
- 13.Wu Z, Yao D. The influence of cognitive tasks on different frequencies steady-state visual evoked potentials. Brain Topography. 2007;20:97–104. doi: 10.1007/s10548-007-0035-0. [DOI] [PubMed] [Google Scholar]
- 14.Martin S, Christian N. Error-related potentials during continuous feedback: using EEG to detect errors of different type and severity. Frontiers in Human Neuroscience. 2015;9:155. doi: 10.3389/fnhum.2015.00155. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Gehring WJ, Coles MG, Meyer DE, Donchin E. A brain potential manifestation of error-related processing. Electroencephalography & Clinical Neurophysiology Supplement. 1995;44:261. [PubMed] [Google Scholar]
- 16.Trujillo LT, Allen JJB. Theta EEG dynamics of the error-related negativity - Clinical Neurophysiology. Clinical Neurophysiology. 2007;118:645–668. doi: 10.1016/j.clinph.2006.11.009. [DOI] [PubMed] [Google Scholar]
- 17.Tong J, Lin Q, Xiao R, Ding L. Combining multiple features for error detection and its application in brain-computer interface. Biomedical Engineering Online. 2016;15:1–15. doi: 10.1186/s12938-016-0134-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Wang, F., Lin, J., Wang, W. & Wang, H. EEG-based mental fatigue assessment during driving by using sample entropy and rhythm energy. IEEE International Conference on Cyber Technology in Automation, Control, and Intelligent Systems. 1906–1911(2015).
- 19.Tian, Y. et al. Spectral Entropy Can Predict Changes of Working Memory Performance Reduced by Short-Time Training in the Delayed-Match-to-Sample Task. Frontiers in Human Neuroscience11, 10.3389/fnhum.2017.00437 (2017). [DOI] [PMC free article] [PubMed]
- 20.Tian Y, Yao D. Why do we need to use a zero reference? Reference influences on the ERPs of audiovisual effects. Psychophysiology. 2013;50:1282–1290. doi: 10.1111/psyp.12130. [DOI] [PubMed] [Google Scholar]
- 21.Liu T, et al. Cortical Dynamic Causality Network for Auditory-Motor Tasks. IEEE Transactions on Neural Systems and Rehabilitation Engineering. 2016;99:1–1. doi: 10.1109/TNSRE.2016.2608359. [DOI] [PubMed] [Google Scholar]
- 22.Yao D. A method to standardize a reference of scalp EEG recordings to a point at infinity. Physiological Measurement. 2001;22:693–711. doi: 10.1088/0967-3334/22/4/305. [DOI] [PubMed] [Google Scholar]
- 23.Xu P, et al. Cortical network properties revealed by SSVEP in anesthetized rats. Scientific Reports. 2013;3:2496. doi: 10.1038/srep02496. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Tian, Y. et al. Predictors for drug effects with brain disease: Shed new light from EEG parameters to brain connectomics. European Journal of Pharmaceutical Sciences 110,26–36 (2017). [DOI] [PubMed]
- 25.Tian Y, Ma W, Tian C, Xu P, Yao D. Brain oscillations and electroencephalography scalp networks during tempo perception. Neuroscience Bulletin. 2013;29:731–736. doi: 10.1007/s12264-013-1352-9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Ding JR, et al. Altered functional and structural connectivity networks in psychogenic non-epileptic seizures. Plos One. 2013;8:e63850. doi: 10.1371/journal.pone.0063850. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Singh P, Joshi SD, Patney RK, Saha K. Fourier-Based Feature Extraction for Classification of EEG Signals Using EEG Rhythms. Circuits Systems & Signal Processing. 2016;35:1–16. doi: 10.1007/s00034-015-0047-z. [DOI] [Google Scholar]
- 28.Wilke C, Ding L, He B. Estimation of time-varying connectivity patterns through the use of an adaptive directed transfer function. IEEE transactions on bio-medical engineering. 2008;55:2557–2564. doi: 10.1109/TBME.2008.919885. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Li F, et al. The time-varying networks in P300: a task-evoked EEG study. IEEE Transactions on Neural Systems & Rehabilitation Engineering A Publication of the IEEE Engineering in Medicine & Biology Society. 2016;24:725–733. doi: 10.1109/TNSRE.2016.2523678. [DOI] [PubMed] [Google Scholar]
- 30.Choi I, Rajaram S, Varghese LA, Shinn-Cunningham BG. Quantifying attentional modulation of auditory-evoked cortical responses from single-trial electroencephalography. Frontiers in Human Neuroscience. 2013;7:115. doi: 10.3389/fnhum.2013.00115. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Looney D, Park C, Xia Y, Kidmose P. Towards estimating selective auditory attention from EEG using a novel time-frequency-synchronisation framework. International Joint Conference on Neural Networks. 2010;9:1–5. [Google Scholar]
- 32.Holroyd CB, Coles MG. The neural basis of human error processing: reinforcement learning, dopamine, and the error-related negativity. Psychological Review. 2002;109:679–709. doi: 10.1037/0033-295X.109.4.679. [DOI] [PubMed] [Google Scholar]
- 33.Iannaccone R, et al. Conflict monitoring and error processing: new insights from simultaneous EEG-fMRI. Neuroimage. 2015;105:395–407. doi: 10.1016/j.neuroimage.2014.10.028. [DOI] [PubMed] [Google Scholar]
- 34.Van Veen V, Carter CS. The anterior cingulate as a conflict monitor: fMRI and ERP studies. Physiology & Behavior. 2002;77:477–482. doi: 10.1016/S0031-9384(02)00930-7. [DOI] [PubMed] [Google Scholar]
- 35.KIEHL, et al. Error processing and the rostral anterior cingulate: An event-related fMRI study. Psychophysiology. 2000;37:216–223. doi: 10.1111/1469-8986.3720216. [DOI] [PubMed] [Google Scholar]
- 36.Sohn H, et al. Linear and non-linear EEG analysis of adolescents with attention-deficit/hyperactivity disorder during a cognitive task. Clinical Neurophysiology. 2010;121:1863–1870. doi: 10.1016/j.clinph.2010.04.007. [DOI] [PubMed] [Google Scholar]
- 37.Zhang R, et al. Predicting Inter-session Performance of SMR-Based Brain–Computer Interface Using the Spectral Entropy of Resting-State EEG. Brain Topography. 2015;28:680. doi: 10.1007/s10548-015-0429-3. [DOI] [PubMed] [Google Scholar]
- 38.Pincus SM, Goldberger L, Steven M, Physiologi Physiological time-series analysis: what does regularity quantify? American Journal of Physiology. 1994;266:1643–1656. doi: 10.1152/ajpheart.1994.266.4.H1643. [DOI] [PubMed] [Google Scholar]
- 39.Davidson DJ, Indefrey P. An event-related potential study on changes of violation and error responses during morphosyntactic learning. Journal of Cognitive Neuroscience. 2009;21:433–446. doi: 10.1162/jocn.2008.21031. [DOI] [PubMed] [Google Scholar]
- 40.Carter CS, et al. Anterior cingulate cortex, error detection, and the online monitoring of performance. Science. 1998;280:747. doi: 10.1126/science.280.5364.747. [DOI] [PubMed] [Google Scholar]
- 41.Garavan H, Ross TJ, Kaufman J, Stein EA. A midline dissociation between error-processing and response-conflict monitoring. Neuroimage. 2003;20:1132. doi: 10.1016/S1053-8119(03)00334-3. [DOI] [PubMed] [Google Scholar]
- 42.Coles MG, Scheffers MK, Holroyd CB. Why is there an ERN/Ne on correct trials? Response representations, stimulus-related components, and the theory of error-processing. Biological Psychology. 2001;56:173–189. doi: 10.1016/S0301-0511(01)00076-X. [DOI] [PubMed] [Google Scholar]
- 43.Holroyd CB, Yeung N, Coles MGH, Cohen JD. A Mechanism for Error Detection in Speeded Response Time Tasks. Journal of Experimental Psychology General. 2005;134:163. doi: 10.1037/0096-3445.134.2.163. [DOI] [PubMed] [Google Scholar]
- 44.Yao D, et al. A comparative study of different references for EEG spectral mapping: the issue of the neutral reference and the use of the infinity reference. Physiological Measurement. 2005;26:173. doi: 10.1088/0967-3334/26/3/003. [DOI] [PubMed] [Google Scholar]
- 45.Lin FH, Witzel T, Dale AM, Belliveau JW, Stufflebeam SM. Spectral spatiotemporal imaging of cortical oscillations and interactions in the human brain. Neuroimage. 2004;23:582–595. doi: 10.1016/j.neuroimage.2004.04.027. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Fuchs M, Drenckhahn R, Wischmann HA, Wagner M. An improved boundary element method for realistic volume-conductor modeling. IEEE transactions on bio-medical engineering. 1998;45:980–997. doi: 10.1109/10.704867. [DOI] [PubMed] [Google Scholar]
- 47.Pfurtscheller G, Vaughan ATM, Wolpaw JR, Mcfarland D, Birbaumer N. Brain-computer interfaces for communication and control. Psychophysiology. 2002;43:517–532. doi: 10.1016/s1388-2457(02)00057-3. [DOI] [PubMed] [Google Scholar]
- 48.Hsu HH, Hsieh CW. Feature Selection via Correlation Coefficient Clustering. Journal of Software. 2010;5:1371–1377. [Google Scholar]
- 49.Zhang Y, Xu P, Cheng K, Yao D. Multivariate synchronization index for frequency recognition of SSVEP-based brain-computer interface. Journal of Neuroscience Methods. 2013;221C:32. doi: 10.1016/j.jneumeth.2013.07.018. [DOI] [PubMed] [Google Scholar]
- 50.Netherlands, S. Support Vector Machine (SVM). (Springer Netherlands, 2008).
- 51.Wang L, Shen H, Tang F, Zang Y, Hu D. Combined structural and resting-state functional MRI analysis of sexual dimorphism in the young adult human brain: an MVPA approach. Neuroimage. 2012;61:931–940. doi: 10.1016/j.neuroimage.2012.03.080. [DOI] [PubMed] [Google Scholar]
- 52.Renukadevi, N. T. & Thangaraj, P. Performance Evaluation of SVM – RBF Kernel for Medical Image Classification. Global Journal of Computer Science & Technology (2013).
- 53.Liu C, Yin SQ, Zhang M, Zeng Y, Liu JY. An Improved Grid Search Algorithm for Parameters Optimization on SVM. Applied Mechanics & Materials. 2014;644–650:2216–2219. doi: 10.4028/www.scientific.net/AMM.644-650.2216. [DOI] [Google Scholar]
- 54.Galar M, Fernandez A, Barrenechea E, Bustince H, Herrera F. A Review on Ensembles for the Class Imbalance Problem: Bagging-, Boosting-, and Hybrid-Based Approaches. IEEE Transactions on Systems Man & Cybernetics Part C. 2012;42:463–484. doi: 10.1109/TSMCC.2011.2161285. [DOI] [Google Scholar]
- 55.Schwarz G. Estimating the dimension of a model. The annals of statistics. 1978;6:461–464. doi: 10.1214/aos/1176344136. [DOI] [Google Scholar]