Abstract
The functional near-infrared spectroscopy-based brain-computer interface (fNIRS-BCI) systems recognize patterns in brain signals and generate control commands, thereby enabling individuals with motor disabilities to regain autonomy. In this study hand gripping data is acquired using fNIRS neuroimaging system, preprocessing is performed using nirsLAB and features extraction is performed using deep learning (DL) Algorithms. For feature extraction and classification stack and fft methods are proposed. Convolutional neural networks (CNN), long short-term memory (LSTM), and bidirectional long-short-term memory (Bi-LSTM) are employed to extract features. The stack method classifies these features using a stack model and the fft method enhances features by applying fast Fourier transformation which is followed by classification using a stack model. The proposed methods are applied to fNIRS signals from twenty participants engaged in a two-class hand-gripping motor activity. The classification performance of the proposed methods is compared with conventional CNN, LSTM, and Bi-LSTM algorithms and one another. The proposed fft and stack methods yield 90.11% and 87.00% classification accuracies respectively, which are significantly higher than those achieved by CNN (85.16%), LSTM (79.46%), and Bi-LSTM (81.88%) conventional algorithms. The results show that the proposed stack and fft methods can be effectively used for the classification of the two and three-class problems in fNIRS-BCI applications.
Introduction
In recent years, to meet the growing need for communication, control methods, and advancements in human-computer interaction have given rise to the Brain-computer interface (BCI), which allows its users to control the human body independent of the output pathways from the brain, thereby enabling the control of external devices through the modulation of brain signals [1,2]. BCI systems recognize specific patterns in brain signals to generate control commands, enabling users to interact with prosthetics, computers, and other BCI systems [3]. Recently real-time, flexible, and adaptable BCI systems such as real-time motor imagery-based lower-limb exoskeleton controllers [4], virtual rehabilitation systems [5], remote-controlled assistive devices, and activated robotic devices pose high demands for signal processing and classification techniques. Real-time BCI applications need to deal with the non-stationary and dynamic nature of the brain signals, as well as interference from various sources of noise and artifacts. A typical BCI system is composed of five distinct phases, as depicted in Fig 1. Details of each phase following the literature review are given in this section. The first phase involves collecting signals from the brain using different methods, including invasive, semi-invasive, and non-invasive neuroimaging [6–9]. Available data acquisition modalities for BCI systems are electroencephalography (EEG), functional magnetic resonance imaging (fMRI), positron emission tomography (PET) [10], single photon emission computed tomography (SPECT), and near-infrared spectroscopy (NIRS) [11,12]. These technologies use different physical principles to measure brain activity. EEG is a neuroimaging technique that has limitations when used in BCI applications due to its low spatial resolution, sensitivity to artifacts and noise, and sensitivity to head movements [3,13,14]. fMRI has a high spatial resolution and can produce detailed images of the brain, but it is costly and requires subjects to lie still in a confined space [15]. On the other hand, fNIRS is relatively inexpensive, non-invasive, portable, and can measure both neural activity and blood flow, with less sensitivity to artifacts and noise [13,16–18]. In the second phase, signals are pre-processed to remove physiological artifacts that cause errors in the results [19]. In recent years, numerous techniques have been proposed such as filtering [20], normalization [21,22], and independent component analysis (ICA) [23]. In the third phase, independent and internal features are extracted [17] to overcome the redundancy of signals and for dimensionality reduction. There are several techniques for feature selection, including filter methods based on statistical properties such as variance or correlation, wrapper methods based on predictive performance, and embedded methods which are part of the learning process of a machine learning algorithm. Similarly, specific channels are identified for addressing a particular research question. Common methods for channel identification and selection include principal component analysis (PCA), z-scoring [24], the highest signal-to-noise ratio, t-scoring [25], and clustering [26]. These methods involve identifying groups of channels with similar patterns of activity or with high sensitivity to the neural activity of interest. In the fourth phase, the classification of the signals is performed based on extracted features. Machine learning (ML) algorithms, such as support vector machines (SVM), k-nearest neighbors (k-NN), linear discriminant analysis (LDA), Logistic regression (LR), Naive Bayes (NB), and random forests (RF), have been used [26–29]. Besides ML algorithms, deep learning (DL), such as convolutional neural networks (CNNs), recurrent neural networks (RNNs), long short-term memory (LSTM), and Bi-directions long short-term memory (Bi-LSTM) models have also been applied to neuroimaging signals classification [30,31]. These models are designed to learn hierarchical representations of the data, and they have been shown to achieve superior performance compared to traditional machine learning algorithms in many studies. These models have been employed in categorizing various brain activity patterns that occur during a range of cognitive and motor tasks, as well as in identifying neurological afflictions such as Alzheimer’s disease and Parkinson’s disease. In the last phase of BCI systems, classified signals are used to generate control commands, and to operate external devices [16]. In the neuroimaging field for BCI applications, fNIRS has become a popular technique due to its non-invasive nature, portability, and ability to measure brain activity in real-world settings [32]. In fNIRS, use a modified Beer-Lambert law (MBLL) to measure changes in cortical blood oxy- and deoxy-hemoglobin by employing two or more near-infrared wavelengths within the range of 700–1000 nm [33]. fNIRS’ ability to track changes in concentration of oxy and deoxy hemoglobin signal, has a comparatively improved temporal resolution as compared to fMRI [34] and better spatial resolution as compared to EEG [35], making it more suitable for BCI applications. fNIRS-BCI signals processing and classification techniques have made significant strides yet are facing several challenges. Isolation of relevant information from the noisy biological signal and lack of standardized protocol for data acquisition and preprocessing hinder the comparability of results across similar studies. Besides that, diverse neural patterns across individuals and the adaptability of BCI systems to individual differences demand the development of universally effective algorithms. Available ML classifiers require manual feature extraction, which can be time-consuming and may miss important information. In contrast, DL algorithms can learn complex features from raw data without the need for manual feature engineering [36]. Furthermore, DL algorithms can be trained on large datasets, which can improve the accuracy and generalizability of the analysis [37]. This can be especially beneficial in clinical applications, where accurate diagnosis and treatment depend on reliable and reproducible data analysis. This study aims to enhance the classification accuracy of the fNIRS-BCI application. For this purpose, conventional DL algorithms (CNN, LSTM, and Bi-LSTM) are employed for feature extraction, which are concatenated and then used in two proposed methods to classify the binary classification hand-gripping motor activity. In the first method extracted features from the conventional DL are used to train the proposed stack model and in the second method fast Fourier transformation (fft) technique is applied to the extracted features, then classification is performed using the proposed stack model. To evaluate the effectiveness of the proposed methods, the classification accuracies from the stack and fft methods are compared with conventional CNN, LSTM, and Bi-LSTM algorithms.
Fig 1. Illustration of a typical BCI system.
The five phases of the BCI system are data acquisition, data pre-processing, feature extraction, classification, and application interface.
Materials and methods
Participants
Twenty right-handed (10 male and 10 female) participants with an age range of 25 ± 5 years are selected. All participants are provided with written informed consent before participating in the study. None of the participants are left-handed, have consumed caffeine less than 4 hours before the experiment, and have neurological disorders. The Institutional Review Board of Air University, Islamabad, Pakistan approved this study (approval number: AU/EA/2022/02/011), and data collection is carried out in compliance with the most recent version of the Declaration of Helsinki.
Signal acquisition setup
Following the literature continuous-wave fNIRS imaging system (NIRSport2 acquisition system by NIRx medical technologies) [38] is utilized to acquire hand-gripping motor activity in the brain. The Optode placement shown in Fig 2 is designed to allow for imaging of the motor cortex area of the brain, following the 10–20 standard system, utilizing 8 sources and 8 detectors (16 optodes in total) [37,38]. The optodes are positioned on the motor cortex with a distance of 3 cm between each source-detector pair [39]. This arrangement resulted in the creation of a total of twenty channels (source-detector location), providing a comprehensive representation of the hemodynamic changes in response to the motor task. The sampling frequency was configured to 10.1725 Hz.
Fig 2. Optode placement following 10–20 international system.
Sources are marked with pink circles while detectors are marked with blue circles. The setup included a total of eight sources and eight detectors, spaced 3 centimeters apart from each other, resulting in 20 channels for recording brain signals.
Experimental paradigm
Initiating each experiment, the participants were instructed to take 30 seconds of rest which was then followed by 10 seconds of hand gripping and a subsequent 20 seconds of rest while remaining at rest. At the end of each experiment, a final 30 seconds of rest was provided for baseline correction of the signals. Each participant underwent 10 trials, as depicted in Fig 3. Excluding the initial (30 seconds) and final (30 seconds) resting periods, the total duration of each experiment for each subject was 300 seconds. The experimental paradigm followed is represented in Fig 3.
Fig 3. Experimental paradigm.
Following 30 seconds of rest, each trial involved a 10-second of hand gripping activity, followed by 20 seconds of rest, again followed by 30 seconds of final rest. A total of 10 trials were conducted, resulting in a complete experimental duration of 360 seconds.
Signal preprocessing
According to the literature, preprocessing of motor task (hand-gripping) is conducted using nirsLAB [16,37,38]. The nirsLAB (version: v201904_64bit) is specifically designed to process fNIRS data and offers several functionalities including the elimination of artifacts, calculation of hemodynamic response functions, and creation of brain activity maps. The data acquired from each channel of the fNIRS imaging system is in the form of light intensity from the light detectors placed on the skull surface of the motor cortex area. This light intensity is then converted into changes in the concentration of oxyhemoglobin (Δ ) and deoxyhemoglobin (Δ )[18,40,41]. The Modified Beer-Lambert Law (MBLL)is used to calculate the changes in hemoglobin concentrations through the application of Eq 1:
(1) |
Where,
are the extinction coefficients of HbO and HbR in .
represent the concentration changes of HbO and HbR, respectively, in [ μM ] .
denote the absorbance measured at time t using two different wavelengths and .
d = Differential path length factor.
l = Distance between source and detector (3 cm).
Eq 1 characterizes the hemodynamic response and allows for the determination of the relative changes in oxygenated and deoxygenated hemoglobin levels in response to the motor task. To ensure the validity of the data, a bandpass filter is applied to remove physiological and instrumental noise. The filter parameters are set following the established literature [17], with a low-pass cutoff frequency of 0.2 Hz and a high-pass cutoff frequency of 0.01 Hz. Filtered data is in the form of changes in the concentration of oxyhemoglobin (HbO), deoxyhemoglobin (HbR), and total hemoglobin (HbT). In Fig 4 filtered data from channel 1 of subject 1 is plotted.
Fig 4. Filtered data from Channel 1 of Subject 1, showing HbO, HbR, and HBT concentration changes.
Features extraction and classifications
The classification of filtered fNIRS signal is performed following the proposed methods, graphically represented in Fig 5. Conventional CNN, LSTM, and Bi-LSTM models are used to extract the features that are utilized in the two proposed methods. ∙ In the stack method, the extracted features are utilized as inputs to train a proposed stacked algorithm for classification. ∙ In the fft method, extracted features from conventional DL models are transformed into the frequency domain using fft. The transformed features are then used as inputs to train a proposed stacked algorithm for the classification.
Fig 5. Schematic representation: In the proposed stack method extracted features are used to train the proposed stacked model.
In the proposed fft method extracted features are transformed using fft transformation, and then used to train the proposed stacked model for classification. Classified signals are used to generate commands to control prosthetic hand-gripping control.
Feature extraction
The following is the description of the conventional DL models used for spatial and temporal feature extraction (summarizing critical information about brain activity and enabling effective analysis). Adam optimizer with a 0.0001 learning rate and categorical cross-entropy loss function is used to train these conventional DL algorithms. CNN model incorporates two 1D convolutional layers (64 neurons, 32 batch size, and kernel size 2 × 2), a max-pooling layer (1D pooling with pool size = 2), batch normalization, a flattened layer, dropout (0.2), dense layer (32 neurons, ReLU activation), and an output dense layer (2 neurons, ReLU activation). This composition enables the network to extract intricate patterns from data, as local temporal and dependencies and spatial features from the dropout layer. 32 features are extracted from 1D-CNN. LSTM and Bi-LSTM models are comprised of two layers of LSTM and Bi-LSTM respectively (64 neurons), augmented with auxiliary layers including max-pooling (1D pooling with pool size = 2), batch normalization, flattening, dropout (0.2), dense layer (32 neurons, ReLU activation), and an output dense layer (2 neurons, ReLU activation). LSTM layers effectively capture long-range dependencies as temporal feature, enabling the network to grasp complex relationships in the data. Similarly, the utilization of two Bi-LSTM layers introduces bidirectional context, allowing the network to consider both past and future information for a more comprehensive understanding of the sequence. By this configuration of the layers in LSTM and Bi-LSTM algorithm, 32 features are extracted from each algorithm. The inclusion of auxiliary layers, such as max pooling for dimensionality reduction, batch normalization for stability, and dropout for regularization, collectively enhances the model’s robustness. A total of 96 features extracted from these DL algorithms (32 from each) are used for the classification of hand-gripping activity. Once the features are extracted, these are split into test (20%) and train set (80%) to train and test the classifiers for the classification.
Classification
In the proposed stack method, the stack model with an input layer (input is extracted features from the dropout layers of conventional DL algorithms), followed by two dense layers (64 neurons, ReLU activation in each layer) and an output dense layer (2 neurons, ReLU activation function) are used. In the proposed fft method, extracted features from the conventional DL models are transformed using fft transformation technique and then are classified using the proposed stack model. The stacking approach allows for a comprehensive evaluation of the effectiveness of the extracted features in the classification of hand-gripping motor activity.
Command generation
Acquired fNIRS signals from the brain are classified using the proposed methods and hand-open and hand-close commands generated to control the movement of the prosthetic hand gripping.
Results
In this section, the effectiveness of the proposed methods described earlier is evaluated. The assessment of the proposed methods is in terms of model losses, accuracies, and confusion matrix. Where the model losses, accuracies, and confusion matrix of proposed stack and fft methods are compared with conventional DL algorithms.
Performance in terms of the confusion matrix
The confusion matrix in Fig 6a shows that the CNN model correctly predicts 84 samples of activity out of 88 samples and 85 samples of rest out of 95. Fig 6b shows that the LSTM model correctly predicts 75 samples of activity out of 88 samples and 91 samples of rest out of 95. Fig 6c shows that the Bi-LSTM model correctly predicts 79 samples of activity out of 88 samples and 78 samples of rest out of 95. Fig 6d shows that the stack method correctly predicts 85 samples of activity out of 88 samples and 92 samples of rest out of 95. Fig 6e shows that the fft method correctly predicts 86 samples of activity out of 88 samples and 94 samples of rest out of 95.
Fig 6. Performance evaluation in terms of the confusion matrix.
Proposed stack and proposed fft methods are compared with conventional DL (CNN, LSTM, Bi-LSTM) models’ performance. (a) CNN model confusion matrix. (b) LSTM model confusion matrix. (c) Bi-LSTM model confusion matrix. (d) Stack method confusion matrix. (e) fft method confusion matrix.
In comparison amongst the proposed methods, fft method provides the highest percentage of correct predictions for both activity and rest samples. This demonstrates the effectiveness of transforming the features into the frequency domain and using them to train the proposed stacked model for prediction. A high rate of true prediction depicted in the confusion matrix implies improvement in the precision, recall, and accuracy of the proposed methods for real-time BCI applications.
Performance in terms of model losses
The performance of conventional DL and the proposed stack models is assessed by observing the changes in both the testing and training loss over the course of several training epochs. The testing loss represents the model’s ability to generalize and make accurate predictions on unseen data, while the training loss signifies the model’s ability to fit the training data. These losses are commonly expressed as mean squared error or cross-entropy loss and are calculated after each epoch. In the proposed stack and fft methods, as depicted in Fig 7, the testing and training losses are low compared to the conventional DL models under evaluation. A decrease in value losses during model training has improved the accuracy of the stack model used for fft and stack method.
Fig 7. Training and testing loss history of the proposed stack and proposed fft methods is compared with the conventional DL models.
(a) CNN model value losses during test and train (b) LSTM model value losses during test and train. (c) Bi-LSTM model value losses during test and train. (d) Stack method value losses during test and train (e) fft method value losses during test and train.
Performance in terms of model training and testing accuracy
The performance in terms of model training and testing accuracy over different training epochs represents the percentage of correctly classified samples. In this study, the model testing and training accuracy of the proposed fft and stack methods is compared to that of other conventional DL models under consideration. As shown in Fig 8, the accuracy of the proposed methods improved with an increasing number of epochs, reaching its maximum value after a certain number of training iterations, indicating that in the proposed methods stacking model can effectively learn from the training data.
Fig 8. Training and testing accuracy of the proposed stack and proposed fft methods are compared with conventional DL models.
(a) CNN model value accuracy during test and train (b) LSTM model value accuracy during test and train. (c) Bi-LSTM model value accuracy during test and train. (d) Stack method value accuracy during test and train. (e) fft method value accuracy during test and train.
Performance in terms of accuracy
Classification accuracies of the proposed methods and conventional DL model are compared, in Table 1 subject-wise classification accuracies are given. These table values provide a comprehensive evaluation of the models’ performance and allow for a direct comparison between the different methods. Fig 9 depicts the average accuracy comparison of proposed methods and conventional DL models. Classification accuracy of 90.11 ± 3.65% and 87.00 ± 3.53% for the HbO dataset is achieved using the proposed fft and stack method respectively. Proposed methods perform better in comparison to the classification accuracies of 85.16 ± 7.12%, 79.46 ± 3.48%, and 81.88 ± 5.89% using the conventional CNN, LSTM, and Bi-LSTM algorithms respectively.
Table 1. Subject-wise classification accuracies by using CNN, LSTM, Bi-LSTM, proposed stack, and fft methods for the classification of a 2-class hand-gripping HBO-fNIRS data.
Subject-wise classification accuracies | |||||
---|---|---|---|---|---|
Subjects | fft Method (%) | Stack Method (%) | LSTM Algo. (%) | Bi-LSTM Algo. (%) | CNN Algo. (%) |
Sub 1 | 93.85 | 90.46 | 83.08 | 79.23 | 92.31 |
Sub 2 | 93.85 | 88.92 | 79.23 | 84.62 | 90.38 |
Sub 3 | 86.15 | 84.06 | 77.69 | 76.92 | 85.38 |
Sub 4 | 89.23 | 85.38 | 80.00 | 80.77 | 89.23 |
Sub 5 | 90.77 | 90.77 | 78.46 | 87.69 | 92.46 |
Sub 6 | 93.85 | 88.62 | 73.85 | 86.92 | 93.85 |
Sub 7 | 85.38 | 86.15 | 77.69 | 78.46 | 83.08 |
Sub 8 | 85.38 | 83.54 | 80.77 | 76.92 | 77.69 |
Sub 9 | 96.92 | 93.08 | 83.08 | 91.54 | 87.69 |
Sub 10 | 89.23 | 85.15 | 75.38 | 82.31 | 86.92 |
Sub 11 | 89.23 | 87.69 | 83.08 | 85.38 | 88.77 |
Sub 12 | 91.54 | 88.38 | 80.00 | 90.00 | 72.31 |
Sub 13 | 89.23 | 89.23 | 82.31 | 80.77 | 94.62 |
Sub 14 | 85.38 | 83.08 | 71.54 | 73.85 | 77.69 |
Sub 15 | 85.38 | 84.00 | 80.00 | 80.00 | 84.62 |
Sub 16 | 93.08 | 85.31 | 82.31 | 84.62 | 78.46 |
Sub 17 | 93.85 | 90.00 | 80.00 | 86.15 | 89.23 |
Sub 18 | 87.69 | 82.31 | 81.54 | 72.31 | 86.92 |
Sub 19 | 94.62 | 93.08 | 84.62 | 70.77 | 68.46 |
Sub 20 | 87.69 | 80.85 | 74.62 | 88.46 | 83.08 |
Average | 90.11 ± 3.65 | 87.00 ± 3.53 | 79.46 ± 3.48 | 81.88 ± 5.89 | 85.16 ± 7.12 |
Fig 9. Average classification accuracies for fft and stack methods, LSTM, Bi-LSTM, and CNN algorithms.
F1 score values provide a comprehensive evaluation performance of various models. The proposed fft method demonstrates strong performance, achieving a balance between recall and precision. Overall results in Table 2 underscore the importance of the proposed fft and stack methods.
Table 2. Subject-wise F-1 scores by using CNN, LSTM, Bi-LSTM, proposed stack, and fft methods for the classification of a 2-class hand-gripping HBO-fNIRS data.
Subject-wise F1-Scores | |||||
---|---|---|---|---|---|
Subjects | CNN (%) | LSTM (%) | Bi-LSTM (%) | Stack Method (%) | fft Method (%) |
Sub 1 | 0.9551 | 0.8910 | 0.8782 | 0.9679 | 0.8718 |
Sub 2 | 0.8979 | 0.8315 | 0.8550 | 0.8824 | 0.9050 |
Sub 3 | 0.7458 | 0.8987 | 0.9063 | 0.9193 | 0.9193 |
Sub 4 | 0.8546 | 0.6263 | 0.7451 | 0.7344 | 0.9040 |
Sub 5 | 0.8697 | 0.8916 | 0.9282 | 0.8975 | 0.9345 |
Sub 6 | 0.8712 | 0.7575 | 0.7773 | 0.8195 | 0.9380 |
Sub 7 | 0.6960 | 0.6798 | 0.8460 | 0.7307 | 0.8895 |
Sub 8 | 0.7357 | 0.6200 | 0.8851 | 0.8473 | 0.9284 |
Sub 9 | 0.8016 | 0.8805 | 0.8675 | 0.8454 | 0.9129 |
Sub 10 | 0.8210 | 0.8743 | 0.9222 | 0.9403 | 0.9863 |
Sub 11 | 0.6667 | 0.8354 | 0.9208 | 0.9104 | 0.8708 |
Sub 12 | 0.7492 | 0.7522 | 0.8777 | 0.8471 | 0.9349 |
Sub 13 | 0.6737 | 0.7347 | 0.8704 | 0.8473 | 0.8147 |
Sub 14 | 0.8758 | 0.7742 | 0.9149 | 0.8665 | 0.9795 |
Sub 15 | 0.6083 | 0.8271 | 0.8125 | 0.8979 | 0.8750 |
Sub 16 | 0.7227 | 0.8503 | 0.7771 | 0.6938 | 0.8201 |
Sub 17 | 0.7460 | 0.8789 | 0.8855 | 0.8037 | 0.9694 |
Sub 18 | 0.6273 | 0.7112 | 0.7565 | 0.8284 | 0.8781 |
Sub 19 | 0.7063 | 0.8721 | 0.9527 | 0.8797 | 0.9532 |
Sub 20 | 0.5000 | 0.7892 | 0.9126 | 0.9297 | 0.9698 |
Average | 0.7562 | 0.7988 | 0.8646 | 0.8545 | 0.9128 |
The proposed methods demonstrate better performance for HbO datasets, achieving significantly higher average classification accuracies as compared to conventional DL models. This superiority is reflected in the results of a statistical comparison, which indicates a p-value of less than 0.0125 (Bonferroni correction applied). This is further demonstrated by the low p-value obtained in the comparison between the proposed fft, and proposed stack, DL algorithms, which is less than 0.0125, demonstrated in Table 3.
Table 3. Statistical significance of the proposed method.
Bonferroni correction applied (p < 0.0125) | ||||
---|---|---|---|---|
Proposed stack model | CNN | LSTM | Bi-LSTM | |
Proposed fft vs. | 0 . 008426795 |
Validation of the proposed method
For validation purposes an open-access fNIRS dataset [42] is used, thirty participants engaged in three distinct motor movements (left and right-hand unilateral complex finger-tapping, and foot-tapping) repeated 25 times each. The initial classification accuracy achieved on this dataset was 70.4 ± 18.4% using a traditional support vector machine (SVM) classifier and leave-one-out cross-validation. However, the proposed fft method has demonstrated a remarkable enhancement, achieving an impressive average accuracy of 89% across the three-class problem. This improvement signifies a notable advancement beyond the earlier 80% accuracy mentioned in the original paper. This improvement in the accuracy using the fft method for the classification of the three-class problem as compared to the mentioned accuracy in original research on this data set demonstrates that fft and stack methods can provide a significant benefit to boosting discriminative power for complex fNIRS data through better classification performance.
Discussion
The research community has been consistently making efforts toward enhancing the classification accuracy of state-of-the-art fNIRS-BCI systems. To achieve this goal, various methodologies have been explored including, high special and temporal resolution neuroimaging modalities for data acquisition [33], the use of advanced signal processing techniques, such as ICA and wavelet transforms [32,43,44], general linear models [45], optimal feature selection techniques [17], [46], optimize feature selection using hybrid genetic algorithm-support vector machine (GA-SVM) [47], and optimal classification techniques [48]. To improve the performance of BCI applications use of DL algorithms [49], hybrid models that combine traditional ML algorithms with DL [16], and the integration of task-related physiological signals [50] are also highlighted. These efforts reflect a concentrated attempt to improve the performance of fNIRS-BCI systems, with the ultimate aim of providing reliable and accurate information about brain activities in real-world applications. Following the literature, a new approach is introduced to enhance the classification accuracy in fNIRS-BCI problems. The proposed methods involve feature extraction from conventional DL models and classification of these features using proposed stack and fft methods. The stack method is to train the proposed stacking model on these features from DL algorithms and in the fft method, these features are modified using fft transformation and then are used to train the stacking model. The proposed methods result in improved classification accuracy of the motor activity. The effectiveness of the proposed methods is evaluated by comparison with the traditional DL algorithm’s performance. A similar gait analysis study performed using statistical features selection, and ML algorithms for classification using the different combinations of features has gained a maximum accuracy of up to 86.7% [51]. An artificial neural network (ANN) achieved 78% accuracy in classifying six-arm actions using features computed from IIR-filtered data samples [52]. The study’s findings [16], show that DL models achieved a maximum accuracy of 88.5% for classifying the two-class problem, whereas the proposed methods in this study have achieved an accuracy of 90.11 ± 3.65% and 87.00 ± 3.53% using fft and stack method respectively. This study offers a new perspective in the field of fNIRS-BCI by merging the strengths of features from DL, fft transformation of these features, and the stack model. The better performance of the proposed fft method is because of the fft transformation of extracted features from the DL models and proposed stack model. This can be particularly useful in fNIRS studies where changes in brain activity are associated with specific frequencies, such as in the case of cortical oscillations [53]. By applying the fft transformation to the fNIRS extracted features from conventional DL, the time-domain signals can be transformed into the frequency domain, allowing for the identification of specific frequency components that may be relevant to the analysis. These frequency components are then used as features in the subsequent stacking process, potentially improving the classification or prediction performance of the model. In addition, using the fft reduces the dimensionality of the feature space by extracting only the relevant frequency components. This makes the subsequent stacking process more computationally efficient and reduces the risk of overfitting the model. As seen in Fig 9, the proposed methods exhibit higher accuracy of classification when compared with conventional DL models for the oxyhemoglobin dataset. This highlights the effectiveness of the proposed methods and concatenation of features from conventional DL models. The proposed fft and stacking methods have limitations such as loss of temporal information due to fft transformation, reduced interpretability due to stacking, and risk of overfitting. It may also be redundant if DL features already contain frequency information. The method was evaluated on two and three-class fNIRS datasets and used to control the hand-open and hand-close of the prosthetic hand-gripping control.
Conclusion
This study aims to enhance the classification accuracy of an fNIRS-BCI system through the implementation of fft transformation of features from DL algorithms and the use of a stack model. The proposed fft method applies fft transformation of the features from CNN, LSTM, and Bi-LSTM achieving an accuracy of 90.11 ± 3.65%, which is higher than conventional DL algorithms (CNN, LSTM, and Bi-LSTM). The average classification accuracies achieved by using the proposed fft method is significantly (p < 0.0125) higher than the LSTM, Bi-LSTM, and CNN algorithms. The results confirm the enhanced performance of the fft and stack methods over conventional LSTM, Bi-LSTM, and CNN algorithms in terms of accuracy, computational power, and precision. These methods potentially could advance the fNIRS-BCI systems application in cognitive monitoring and assistive technologies and could precise BCI-driven therapeutic and control devices. Despite promising results from fft and stack methods, there are limitations of fft method, as transformation during fft method loses some of the temporal information in brain signals and for large data sets stack model could face scalability challenges which increase computational cost. In the future integration of additional frequency-domain features and model optimizations to minimize computational load, could aid to use of these methods for real-time applications. Besides that these methods could be tested in the future for a wider range of motor and cognitive tasks.
Acknowledgments
The authors express their gratitude to the National Centre of Robotics and Automation (NCRA), Rawalpindi, Pakistan, for furnishing the essential aid required to carry out this research. The authors would also like to express their appreciation to the volunteers who dedicated their time and cooperation to collecting data.
Data Availability
Ms. Maria Malik (maria.malik@au.edu.pk) is point of contact for data correspondence from Air University. The dataset is available to her all the time in a dedicated repository/dedicated computer. All the dataset are recorded and stored in her system for backup.
Funding Statement
This study was supported by Korea Institute for Advancement of Technology(KIAT) grant funded by the Korea Government(MOTIE) in the form of a grant awarded to S. Y. Ko (RS-2024-00406796, HRD Program for Industrial Innovation). The specific roles of this author are articulated in the ‘author contributions’ section. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
References
- 1.Bhuvaneshwari M, Kanaga EGM, Anitha J, Raimond K, George ST. A comprehensive review on deep learning techniques for a BCI-based communication system. Demystifying big data, machine learning, and deep learning for healthcare analytics. 2021. p. 131–157. doi: 10.1016/b978-0-12-821633-0.00013-1 [DOI]
- 2.Kundu S, Ari S. Brain-computer interface speller system for alternative communication: a review. IRBM 2022;43(4):317–24. doi: 10.1016/j.irbm.2021.07.001 [DOI] [Google Scholar]
- 3.Ferreira A, Celeste WC, Cheein FA, Bastos-Filho TF, Sarcinelli-Filho M, Carelli R. Human-machine interfaces based on EMG and EEG applied to robotic systems. J Neuroeng Rehabil. 2008;5:10. doi: 10.1186/1743-0003-5-10 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 4.Choi J, Kim KT, Jeong JH, Kim L, Lee SJ, Kim H. Developing a motor imagery-based real-time asynchronous hybrid bci controller for a lower-limb exoskeleton. Sensors (Basel) 2020;20(24):7309. doi: 10.3390/s20247309 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Lazcano-Herrera AG, Fuentes-Aguilar RQ, Chairez I, Alonso-Valerdi LM, Gonzalez-Mendoza M, Alfaro-Ponce M. Review on BCI virtual rehabilitation and remote technology based on EEG for assistive devices. Appl Sci 2022;12(23):12253. doi: 10.3390/app122312253 [DOI] [Google Scholar]
- 6.Min B-K, Marzelli MJ, Yoo S-S. Neuroimaging-based approaches in the brain-computer interface. Trends Biotechnol 2010;28(11):552–60. doi: 10.1016/j.tibtech.2010.08.002 [DOI] [PubMed] [Google Scholar]
- 7.Alharbi H. Identifying thematics in a brain-computer interface research. Comput Intell Neurosci. 2023;2023:2793211. doi: 10.1155/2023/2793211 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 8.Hassani K, Lee W-S. An experimental study on semi-invasive acupuncture-based EEG signal acquisition. In: The 3rd international winter conference on brain-computer interface. 2015. p. 1–4. doi: 10.1109/iww-bci.2015.7073048 [DOI]
- 9.Sun Y, Shen A, Sun J, Du C, Chen X, Wang Y, et al. Minimally invasive local-skull electrophysiological modification with piezoelectric drill. IEEE Trans Neural Syst Rehabil Eng. 2022;30:2042–51. doi: 10.1109/TNSRE.2022.3192543 [DOI] [PubMed] [Google Scholar]
- 10.Nasrallah I, Dubroff J. An overview of PET neuroimaging. Semin Nucl Med 2013;43(6):449–61. doi: 10.1053/j.semnuclmed.2013.06.003 [DOI] [PubMed] [Google Scholar]
- 11.Kaimal G, Ayaz H, Herres J, Dieterich-Hartwell R, Makwana B, Kaiser DH, et al. Functional near-infrared spectroscopy assessment of reward perception based on visual self-expression: Coloring, doodling, and free drawing. Arts Psychother. 2017;55:85–92. doi: 10.1016/j.aip.2017.05.004 [DOI] [Google Scholar]
- 12.Naseer N, Hong K-S. Classification of functional near-infrared spectroscopy signals corresponding to the right- and left-wrist motor imagery for development of a brain-computer interface. Neurosci Lett. 2013;553:84–9. doi: 10.1016/j.neulet.2013.08.021 [DOI] [PubMed] [Google Scholar]
- 13.Khan H, Naseer N, Yazidi A, Eide PK, Hassan HW, Mirtaheri P. Analysis of human gait using hybrid EEG-fNIRS-based BCI system: a review. Front Hum Neurosci. 2021;14:613254. doi: 10.3389/fnhum.2020.613254 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 14.Ma W, Wang C, Sun X, Lin X, Niu L, Wang Y. MBGA-Net: A multi-branch graph adaptive network for individualized motor imagery EEG classification. Comput Methods Programs Biomed. 2023;240:107641. doi: 10.1016/j.cmpb.2023.107641 [DOI] [PubMed] [Google Scholar]
- 15.Jack CR Jr, Bernstein MA, Fox NC, Thompson P, Alexander G, Harvey D, et al. The Alzheimer’s disease neuroimaging initiative (ADNI): MRI methods. J Magn Reson Imaging 2008;27(4):685–91. doi: 10.1002/jmri.21049 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Hamid H, Naseer N, Nazeer H, Khan MJ, Khan RA, Shahbaz Khan U. Analyzing classification performance of fNIRS-BCI for gait rehabilitation using deep neural networks. Sensors (Basel) 2022;22(5):1932. doi: 10.3390/s22051932 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 17.Nazeer H, Naseer N, Khan RA, Noori FM, Qureshi NK, Khan US, et al. Enhancing classification accuracy of fNIRS-BCI using features acquired from vector-based phase analysis. J Neural Eng 2020;17(5):056025. doi: 10.1088/1741-2552/abb417 [DOI] [PubMed] [Google Scholar]
- 18.Jawad Khan M, Hong K-S, Naseer N, Bhutta MR. Drowsiness detection in dorsolateral-prefrontal cortex using fNIRS for a passive-BCI. In: 2015 15th international conference on control, automation and systems (ICCAS). 2015. p. 1811–1816. doi: 10.1109/iccas.2015.7364653 [DOI]
- 19.Zhang F, Cheong D, Khan AF, Chen Y, Ding L, Yuan H. Correcting physiological noise in whole-head functional near-infrared spectroscopy. J Neurosci Methods. 2021;360:109262. doi: 10.1016/j.jneumeth.2021.109262 [DOI] [PubMed] [Google Scholar]
- 20.Patashov D, Menahem Y, Gurevitch G, Kameda Y, Goldstein D, Balberg M. fNIRS: Non-stationary preprocessing methods. Biomed Signal Process Control. 2023;79:104110. doi: 10.1016/j.bspc.2022.104110 [DOI] [Google Scholar]
- 21.Sun J, Qin W, Jin L, Dong M, Yang X, Zhu Y, et al. Impact of global normalization in FMRI acupuncture studies. Evid Based Complement Alternat Med. 2012;2012:467061. doi: 10.1155/2012/467061 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Wu CW, Chen C-L, Liu P-Y, Chao Y-P, Biswal BB, Lin C-P. Empirical evaluations of slice-timing, smoothing, and normalization effects in seed-based, resting-state functional magnetic resonance imaging analyses. Brain Connect 2011;1(5):401–10. doi: 10.1089/brain.2011.0018 [DOI] [PubMed] [Google Scholar]
- 23.Eichele T, Calhoun VD, Debener S. Mining EEG-fMRI using independent component analysis. Int J Psychophysiol 2009;73(1):53–61. doi: 10.1016/j.ijpsycho.2008.12.018 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Nazeer H, Naseer N, Mehboob A, Khan MJ, Khan RA, Khan US, et al. Enhancing classification performance of fNIRS-BCI by identifying cortically active channels using the z-score method. Sensors (Basel) 2020;20(23):6995. doi: 10.3390/s20236995 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Zotev V, Phillips R, Yuan H, Misaki M, Bodurka J. Self-regulation of human brain activity using simultaneous real-time fMRI and EEG neurofeedback. Neuroimage. 2014;85:985–95. doi: 10.1016/j.neuroimage.2013.04.126 [DOI] [PubMed] [Google Scholar]
- 26.Himberg J, Hyvärinen A, Esposito F. Validating the independent components of neuroimaging time series via clustering and visualization. Neuroimage 2004;22(3):1214–22. doi: 10.1016/j.neuroimage.2004.03.027 [DOI] [PubMed] [Google Scholar]
- 27.Khalil MA, Ramirez M, Can J, George K. Implementation of machine learning in BCI based lie detection. In: 2022 IEEE World AI IoT congress (AIIoT). 2022. p. 213–217. doi: 10.1109/aiiot54504.2022.9817162 [DOI]
- 28.Abu Farha N, Al-Shargie F, Tariq U, Al-Nashash H. Improved cognitive vigilance assessment after artifact reduction with wavelet independent component analysis. Sensors (Basel) 2022;22(8):3051. doi: 10.3390/s22083051 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Moaveninejad S, D’Onofrio V, Tecchio F, Ferracuti F, Iarlori S, Monteriù A, et al. Fractal Dimension as a discriminative feature for high accuracy classification in motor imagery EEG-based brain-computer interface. Comput Methods Programs Biomed. 2024;244:107944. doi: 10.1016/j.cmpb.2023.107944 [DOI] [PubMed] [Google Scholar]
- 30.Ahmed S, Saif AFMS, Hanif MI, Shakil MMN, Jaman MM, Haque MMU, et al. Att-BiL-SL: attention-based Bi-LSTM and sequential LSTM for describing video in the textual formation. Appl Sci 2021;12(1):317. doi: 10.3390/app12010317 [DOI] [Google Scholar]
- 31.Wang Z, Zhang J, Zhang X, Chen P, Wang B. Transformer model for functional near-infrared spectroscopy classification. IEEE J Biomed Health Inform 2022;26(6):2559–69. doi: 10.1109/JBHI.2022.3140531 [DOI] [PubMed] [Google Scholar]
- 32.Jahani S, Setarehdan SK. Improved estimation of hemodynamic response in fNIRS using protocol constraint and wavelet transform decomposition based adaptive algorithm. 2020 [Google Scholar]
- 33.Chen X, Song X, Chen L, An X, Ming D. Performance improvement for detecting brain function using fNIRS: a multi-distance probe configuration with PPL method. Front Hum Neurosci. 2020;14:569508. doi: 10.3389/fnhum.2020.569508 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 34.Cui X, Bray S, Bryant DM, Glover GH, Reiss AL. A quantitative comparison of NIRS and fMRI across multiple cognitive tasks. Neuroimage 2011;54(4):2808–21. doi: 10.1016/j.neuroimage.2010.10.069 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 35.Anwar AR, Muthalib M, Perrey S, Galka A, Granert O, Wolff S, et al. Effective connectivity of cortical sensorimotor networks during finger movement tasks: a simultaneous fNIRS, fMRI, EEG study. Brain Topogr 2016;29(5):645–60. doi: 10.1007/s10548-016-0507-1 [DOI] [PubMed] [Google Scholar]
- 36.He Q, Feng L, Jiang G, Xie P. Multimodal multitask neural network for motor imagery classification with EEG and fNIRS signals. IEEE Sensors J 2022;22(21):20695–706. doi: 10.1109/jsen.2022.3205956 [DOI] [Google Scholar]
- 37.Akhter J, Naseer N, Nazeer H, Khan H, Mirtaheri P. Enhancing classification accuracy with integrated contextual gate network: deep learning approach for functional near-infrared spectroscopy brain-computer interface application. Sensors (Basel) 2024;24(10):3040. doi: 10.3390/s24103040 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Shabbir Minhas H, Nazeer H, Naseer N, Shahbaz Khan U, Ansari AR, Nawaz R. Enhancing classification accuracy of fNIRS-BCI for gait rehabilitation. IEEE Access. 2024;12:117944–54. doi: 10.1109/access.2024.3443066 [DOI] [Google Scholar]
- 39.Okada E, Firbank M, Schweiger M, Arridge SR, Cope M, Delpy DT. Theoretical and experimental investigation of near-infrared light propagation in a model of the adult head. Appl Opt 1997;36(1):21–31. doi: 10.1364/ao.36.000021 [DOI] [PubMed] [Google Scholar]
- 40.Naseer N, Hong MJ, Hong K-S. Online binary decision decoding using functional near-infrared spectroscopy for the development of brain-computer interface. Exp Brain Res 2014;232(2):555–64. doi: 10.1007/s00221-013-3764-1 [DOI] [PubMed] [Google Scholar]
- 41.Naseer N, Hong K-S. Classification of functional near-infrared spectroscopy signals corresponding to the right- and left-wrist motor imagery for development of a brain-computer interface. Neurosci Lett. 2013;553:84–9. doi: 10.1016/j.neulet.2013.08.021 [DOI] [PubMed] [Google Scholar]
- 42.Bak S, Park J, Shin J, Jeong J. Open-access fNIRS dataset for classification of unilateral finger- and foot-tapping. Electronics 2019;8(12):1486. doi: 10.3390/electronics8121486 [DOI] [Google Scholar]
- 43.Zhao Y, Sun P-P, Tan F-L, Hou X, Zhu C-Z. NIRS-ICA: A MATLAB toolbox for independent component analysis applied in fNIRS studies. Front Neuroinform. 2021;15:683735. doi: 10.3389/fninf.2021.683735 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 44.Duan L, Zhao Z, Lin Y, Wu X, Luo Y, Xu P. Wavelet-based method for removing global physiological noise in functional near-infrared spectroscopy. Biomed Opt Express 2018;9(8):3805–20. doi: 10.1364/BOE.9.003805 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 45.Li R, Yang D, Fang F, Hong K-S, Reiss AL, Zhang Y. Concurrent fNIRS and EEG for brain function investigation: a systematic, methodology-focused review. Sensors (Basel) 2022;22(15):5865. doi: 10.3390/s22155865 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 46.Goshvarpour A, Goshvarpour A. Matching pursuit-based analysis of fNIRS in combination with cascade PCA and reliefF for mental task recognition. Expert Syst Appl. 2023;213:119283. doi: 10.1016/j.eswa.2022.119283 [DOI] [Google Scholar]
- 47.Noori FM, Naseer N, Qureshi NK, Nazeer H, Khan RA. Optimal feature selection from fNIRS signals using genetic algorithms for BCI. Neurosci Lett. 2017;647:61–6. doi: 10.1016/j.neulet.2017.03.013 [DOI] [PubMed] [Google Scholar]
- 48.Qureshi NK, Naseer N, Noori FM, Nazeer H, Khan RA, Saleem S. Enhancing classification performance of functional near-infrared spectroscopy- brain-computer interface using adaptive estimation of general linear model coefficients. Front Neurorobot. 2017;11:33. doi: 10.3389/fnbot.2017.00033 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 49.Eastmond C, Subedi A, De S, Intes X. Deep learning in fNIRS: A review. Neurophotonics 2022;9(4):041411. doi: 10.1117/1.NPh.9.4.041411 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 50.Seidel-Marzi O, Hähner S, Ragert P, Carius D. Task-related hemodynamic response alterations during slacklining: an fNIRS study in advanced slackliners. Front Neuroergon. 2021;2:644490. doi: 10.3389/fnrgo.2021.644490 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 51.Khan RA, Naseer N, Qureshi NK, Noori FM, Nazeer H, Khan MU. fNIRS-based Neurorobotic Interface for gait rehabilitation. J Neuroeng Rehabil 2018;15(1):7. doi: 10.1186/s12984-018-0346-2 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 52.Sattar NY, Kausar Z, Usama SA, Farooq U, Shah MF, Muhammad S, et al. fNIRS-based upper limb motion intention recognition using an artificial neural network for transhumeral amputees. Sensors (Basel) 2022;22(3):726. doi: 10.3390/s22030726 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Re R, Contini L, Contini D, Orihuela-Espina F, Torricelli AM, Spinelli L. Cerebral resting state oscillations study with TD fNIRS. Diffuse optical spectroscopy and imaging IX. 2023. p. 17. doi: 10.1117/12.2668895 [DOI]