Abstract
The surface Electromyography (sEMG) signal contains information about movement intention generated by the human brain, and it is the most intuitive and common solution to control robots, orthotics, prosthetics and rehabilitation equipment. In recent years, gesture decoding based on sEMG signals has received a lot of research attention. In this paper, the effects of muscle fatigue, forearm angle and acquisition time on the accuracy of gesture decoding were researched. Taking 11 static gestures as samples, four specific muscles (i.e., superficial flexor digitorum (SFD), flexor carpi ulnaris (FCU), extensor carpi radialis longus (ECRL) and finger extensor (FE)) were selected to sample sEMG signals. Root Mean Square (RMS), Waveform Length (WL), Zero Crossing (ZC) and Slope Sign Change (SSC) were chosen as signal eigenvalues; Linear Discriminant Analysis (LDA) and Probabilistic Neural Network (PNN) were used to construct classification models, and finally, the decoding accuracies of the classification models were obtained under different influencing elements. The experimental results showed that the decoding accuracy of the classification model decreased by an average of 7%, 10%, and 13% considering muscle fatigue, forearm angle and acquisition time, respectively. Furthermore, the acquisition time had the biggest impact on decoding accuracy, with a maximum reduction of nearly 20%.
Keywords: surface electromyography, machine learning, gesture decoding, muscle fatigue, forearm angle, acquisition time
1. Introduction
Over the past few decades, the field of Human Machine Interfaces (HMI) has attracted increasing interest due to its intuitive applications in the medical field. Researchers have explored signals on humans, including electroencephalography (EEG), electrocorticography (ECoG), mechanomyography (MMG) and surface electromyography (sEMG). The EEG signal has considerable practical value due to its non-invasiveness, but its signal-to-noise ratio (SNR) is low and susceptible to external interference [1]. The ECoG signal is an invasive signal, for which electrodes need to be implanted into the cerebral cortex. It has limited access to nerve information and can even cause consistent harm to the human body [2]. Compared to the previous two types of signals, MMG has the benefit of being unaffected by skin surface impedance or electrode displacement. However, its limitations include poor SNR and sensitivity to external noise [3]. Therefore, sEMG was chosen as the acquisition signal in this study.
The sEMG signal reflects the nerve activity state and is related to limb movement. During limb movement, the corresponding neural information can be obtained by back pushing sEMG signal, which has the advantages of non-invasive acquisition and bionics [4,5]. Therefore, in the past few decades, the sEMG signal has become the most popular choice for developing intuitive human-machine interfaces [6] and has been widely used in medical applications, virtual reality interfaces, nerve rehabilitation, prosthetic control, etc. [7,8,9].
The form of human muscle contraction can be divided into two categories: static and dynamic contraction. During static contraction, the length of muscle fibers does not change, and the joints do not move, but muscle fibers remain at the state of contraction [5]. Conversely, during dynamic contraction, the length of muscle fibers changes, and the joints continue to move. Therefore, the research field of motion decoding based on the sEMG signal can also be roughly divided into two categories. The first is to research the discrete motion which corresponds to static contraction of muscles through sEMG signal, such as keeping hands still or making the peace sign [10,11]. The second is to use the sEMG signal to predict the continuous motion changes of the joint which corresponds to the dynamic contraction of muscles, such as changes of joint torque and joint angle [12,13,14].
With the exploration of continuous movement still in its infancy, there is a lot of potential of advance in the future [15]. Therefore, classification accuracy improvement and prediction time reduction in gesture decoding remain the most researched issues in the sEMG signal field [16,17]. This paper explored only the decoding of discrete movements of limbs through sEMG signal. Discrete motion classification is currently the most mature and fruitful method in the field of human action decoding, based on sEMG.
The representative papers in recent years are as follows: Min et al. [18] proposed a cross individual gesture decoding method based on Long Short-Term Memory network (LSTM)—Cross individual dual network structure (CI-LSTM) in 2021. Compared with other algorithm models, the decoding accuracy of the model was improved by 9.15% on average. Wang et al. [19] used a genetic algorithm to optimize the number of signal channels and concluded that using 11 of the 16 channels can achieve 97% of the best performance of gesture decoding. Additionally, placing the electrodes in the middle of the forearm, rather than in the proximal forearm, can result in better performance. Ulysse et al. [20] applied deep learning approach to the field of gesture decoding and proposed a new migration learning scheme using convolutional neural networks, which achieved 98.31% offline decoding accuracy for 7 gestures of more than 17 participants and 68.98% offline decoding accuracy for 18 gestures of more than 10 participants. Anany et al. [12] researched the continuous decoding of forearm motion in 2019 and discovered that subject specific, hand specific and object specific decoding models offer better decoding accuracy than generic models. Additionally, Adewuyi et al. [21] analyzed the contribution of internal and external hand muscles to finger motion classification. The research showed that combining internal muscles’ sEMG data and wrist motions can significantly improve the robustness of gesture decoding.
Although the above research works have achieved promising results, most of those training and testing data were mixed in one or several days. Moreover, most of them focused on the design of appropriate channel number, feature set and classification model. However, human limb movement is a joint movement of muscles and bones controlled by the nervous system. Studies [22] have shown that different individuals have different habitual exercise patterns, and even the same person has different models of motions under different external, physical and psychological conditions. In addition, the time and frequency features of the sEMG change with the thickness and temperature of the skin, thickness of the fat between the muscle and the skin, velocity of the blood flow and location of the sEMG sensors [23]. However, the majority of the literature presented in this paper did not investigate the impact of the change of these elements on gesture decoding.
Based on this, three influencing elements were designed in this paper, namely, muscle fatigue, forearm angle and acquisition time. These elements are the most common negative factors influencing gesture decoding, based on sEMG. When the arms are held at the same position maintaining one gesture, these active tightened muscles fatigue quickly [24], and the position of the forearm is accidentally modified to induce forearm angle changes [25]. Moreover, exploring the impact of varied sEMG signal acquisition times on gesture decoding accuracy is critical for sEMG signal robustness [26].
In order to train the classification model considering these three elements, individuals were instructed to make the same gesture in different periods, different forearm angles and different muscle fatigue levels. The control variable method was then used to validate the classification accuracy of the model to compare the negative impact of these elements on gesture decoding.
The rest of this paper is organized as follows: the experimental apparatus, muscle selection, forearm angle, gesture selection and experimental settings are described in Section 2. The feature extraction and classification methods used in this experiment are reported in Section 3. The results of impact of three influencing elements for classification accuracy are discussed in Section 4. Finally, Section 5 concludes the paper and presents further research directions.
2. Apparatus and Experiments
The experiments were performed by five able-bodied subjects, namely, three males and two females, age = 23 ± 2, with their dominant hand. All five subjects were right hand dominant. Before the experiment, all subjects were informed about the experiment and provided the informed consent. The testing procedure was in accordance with the declaration of Helsinki.
2.1. Apparatus
The sEMG signal of the forearm was collected by a Myon Aktos-mini EMG amplifier (Cometa company, Milan, Italy) (Figure 1). The apparatus used disposable gel electrodes (H124SG) to attach to the target muscles. The gel electrodes can provide lower skin contact impedance, reduce the influence of external interference source and improve the signal-to-noise ratio, compared to dry electrodes [4]. The sampling rate of the sEMG amplifier was 2000 Hz, and the Butterworth filter (20–500 Hz) was used for bandpass filtering.
2.2. Muscle Selection
Since the experimental apparatus has four double electrode channels, based on previous experience and reference [27], four specific muscles were chosen for placing gel electrodes in all subjects: superficial flexor digitorum (SFD), which plays a role in finger bending except for the thumb and works in internal flexion of the wrist joint; flexor carpi ulnaris (FCU), which contracts in internal rotation of the wrist, downward wrist deviation, and elbow joint flexion; extensor carpi radialis longus (ECRL), which acts in wrist external rotation, wrist upward deviation and elbow extension; and finger extensor (FE), which works in finger extension and wrist extension except for the thumb (Figure 2).
2.3. Forearm Angle
The elbows of all subjects were placed on the table when they performed gesture movements, so the forearm angle referred to the angle between the forearm and the tabletop. In order to comprehensively analyze the negative impact of the angle on the sEMG signal from small angle difference and large angle difference, the forearm angle range and the quality of sEMG signal typically utilized in actual gesture decoding were also considered. In this paper, three forearm angles were selected, namely, 30°, 45° and 75° (Figure 3). During the experiment, the upper and lower angle deviation did not exceed ± 5°.
2.4. Gesture Selection
In practice, hand movements can be roughly divided into three categories according to the strength and type of muscle contraction [27,28]: (1) basic hand movements: hand closing (HC) and hand opening (HO); (2) wrist movements: wrist flexion (WF), wrist extension (WE), ulnar deviation (UD) and radial deviation (RD); (3) finger movements: thumb touches index finger (TI), middle finger (TM), ring finger (TR), little finger (TL), and the five fingertips touch (FL) (Figure 4). These hand movements basically cover the common gestures in daily life. In addition, there was a relax gesture (RE) as a reference, which was not analyzed.
2.5. Experimental Setting
Each subject was provided with a motion instruction regarding how the experimental task was to be performed. For all experiments, subjects sat up straight, put their elbow on the table, exerted slight force on their arms, and tried to retain the force of each hand movement as consistent as possible.
The sEMG signal was sampled for five consecutive days, with an interval of one day for each sampling. Therefore, the signal data were measured three times totally, and were divided into categories , and . Forearm angles were also divided into 30°, 45°, and 75° respectively. Starting from the relaxation gesture, each gesture lasted for 5 s, and the interval between each gesture was 10 s. A group of 11 gestures was made up the normal muscle group and recorded as Class ; then, restarting in a short time from the relaxation gesture, each gesture lasted for 15 s, and there was no interval between each gesture. A group of 11 gestures was made up the fatigue muscle group, regarded as Class .
Regarding muscle fatigue, it has been demonstrated that as muscle fatigue continues, the frequency domain power spectrum shifts to the low frequency direction [24]. At the same time, the time domain feature RMS will increase, and Mean Power Frequency (MPF) feature of the frequency domain will decrease [29,30]. As a result, these two eigenvalues have been frequently used as sensitive muscle tiredness indicators.
In this paper, all subjects were performed the muscle fatigue verification experiment at a 45° forearm angle according to the experimental description above; then, the power spectrum of the sEMG signal was analyzed, as shown in Figure 5. Due to paper space limitations, one of each type of gestures of one subject was chosen as the representation, namely, HO, UD and TM. Additionally, the RMS and MPF features of each gesture of same subject were analyzed (Table 1). The results show that the spectrum of fatigued muscle group (Class ) goes to the left when compared to the spectrum of normal muscle group (Class ), indicating that the power spectrum shifts to the low frequency during muscular exhaustion. Moreover, the RMS and FPM values in Table 1 increased and decreased, respectively, indicating the presence of muscular exhaustion.
Table 1.
Gesture | RMS | FPM | ||
---|---|---|---|---|
Normal | Fatigue | Normal | Fatigue | |
HC | 182.23 | 252.05 | 90.35 | 65.45 |
HO | 168.33 | 227.97 | 95.36 | 82.63 |
WF | 71.61 | 118.07 | 132.28 | 65.70 |
WE | 87.10 | 118.75 | 137.46 | 105.80 |
UD | 116.77 | 141.20 | 94.69 | 76.93 |
RD | 200.10 | 229.87 | 119.02 | 103.70 |
TI | 80.76 | 109.14 | 124.95 | 75.34 |
TM | 138.16 | 172.96 | 84.46 | 59.37 |
TR | 111.89 | 104.92 | 76.07 | 51.34 |
TL | 154.70 | 132.99 | 99.48 | 94.76 |
FL | 138.62 | 108.16 | 86.60 | 98.89 |
It must also be noted that due to factors such as gesture delay and error, when extracting valid gesture data, the first second and last second of each sampling were discarded, and only the middle three seconds were retained as valid data in Class , and only 12–14 s were retained as valid data in Class .
In order to avoid muscle accumulated fatigue, the relaxation time after each Class and Class was ten minutes. Taking into account three different issues, 18 types of datasets could be obtained, and each type of dataset had five groups. The detailed dataset classification is shown in Table 2.
Table 2.
FA (30°) | FA (45°) | FA (75°) | ||
---|---|---|---|---|
Day1 | Normal | A_a_30 | A_a_45 | A_a_75 |
Fatigue | A_b_30 | A_b_45 | A_b_75 | |
Day2 | Normal | B_a_30 | B_a_45 | B_a_75 |
Fatigue | B_b_30 | B_b_45 | B_b_75 | |
Day3 | Normal | C_a_30 | C_a_45 | C_a_75 |
Fatigue | C_b_30 | C_b_45 | C_b_75 |
3. Methodology
The raw signal was collected to obtain the dataset, and then, segmentation was needed. A shorter window size increases the time resolution and captures more complex gestures. However, a window size that is too small might lead to classification error. On the contrary, longer window length can be used to avoid high deviation and variance but will cause user perception delay [17,31]. In this paper, a 400 ms window size with 50 ms increments was used to split the raw sEMG signal.
3.1. Feature Extraction
Generally, the most important factor for reliable classification results for any signal is the extraction of discriminating features [32]. The extracted features for sEMG signal analysis can be divided into three categories, namely, Time Domain (TD), Frequency Domain (FD) and Time-Frequency Domain (TFD) features [33]. TD features contain the information about the sEMG signal that is extracted from signal amplitude, while FD features contain information regarding the power spectral density of the signal. TFD features can characterize varying frequency information at different time locations, providing non-stationary information of the analyzed signal. Compared with the other two types of features, because of their low computational difficulty, small amount of computation and good classification performance, time-domain features are widely used [34,35].
In this paper, four TD features were used to extract the sEMG signal, namely Root Mean Square (RMS), Waveform Length (WL), Zero Crossing (ZC) and Slope Sign Change (SSC) [36,37]. Each channel had four features, and each apparatus had four channels. Therefore, the input dimensions of classification were sixteen, and because there were fewer input dimensions, no dimensionality reduction but normalization was required.
-
(1)Root Mean Square: RMS is the square root of the average power of the signal at a given time. This feature quantifies the effort of the muscle. RMS is defined as follows:
where is the length of the window size and is the th sample point.(1) -
(2)Waveform Length: WL is the measurement of the waveform amplitude, frequency and duration of the signal, which is an index to measure the complexity of the signal. WL is defined as follows:
where is the length of the window size and is the th sample point.(2) -
(3)Zero Crossing: ZC is the number of zero crossing at a given time period. ZC provides important information regarding the FD characteristics of the signal and is an important indicator of muscle fatigue. ZC is defined as follows:
where is the length of window size, is the th sample point and is the voltage threshold, which was selected referring to the signal noise of the skin surface. In this paper, the threshold was set at 20 mV.(3) -
(4)Slope Sign Change: SSC is the number of times the slope of the measured waveform changes signs. It provides important information about the FD characteristics and is defined as follows:
where is the length of the window size, is the th sample point, and is the voltage threshold, which was selected referring to the signal noise of the skin surface. In this paper, the threshold was set at 20 mV.(4)
3.2. Classification Model
As the last step of gesture decoding, selecting an appropriate classification model will help improve the recognition accuracy and generalization ability. At present, commonly used classification models include Linear Discriminant Analysis (LDA), K-Nearest Neighbor (KNN), Bayes, Support Vector Machine (SVM), Neural Network, etc. In this paper, LDA and Probabilistic Neural Network (PNN) were selected as the classification models.
3.2.1. LDA Classification Model
LDA classification learning is a classic linear learning algorithm. Its principle is simple: Given a training dataset, try to project the dataset into a low-dimensional space, making the projection points of the same data as close as possible while the projection points of different data as far as possible (Figure 6). The data from the testing datasets are projected into the same low dimensional space, and then classified according to the position of the projection points.
On one hand, to make the projection points of same data lie close to one another, their covariance should be minimized. On the other hand, the distances between centers of different kinds of classes need to be far away causing the projection spacing of them to be larger. Here, two concepts are introduced. The within-class scatter matrix is defined as
(5) |
where is the number of classes; and are the set and mean vector of the th class, respectively. The inter-class scatter matrix is defined as
(6) |
where is the number of th class, and is the mean vector for all classes. Considering both intra-class and inter-class scatter matrices, we can achieve classification through Formula (7):
(7) |
where the closed-form solution of matrix is composed of largest non-zero generalized eigenvalues corresponding to the eigenvectors of ; and are the dimensions of features before and after projection; is the trace of the matrix.
3.2.2. PNN Classification Model
Probabilistic Neural Network (PNN) was first proposed by Dr. D.F. Specht in 1989. Currently, it is commonly used for pattern classification. Its schematic diagram is shown in Figure 7:
The weight of radial layer is directly taken from input dataset , and dimension Q of corresponds to the number of neurons in the radial layer. When is input into the network, the distance between and is obtained using the Euclidean distance and stored in dist. The value in dist is then multiplied by the polarization factor point by point. The results are applied to the radial basis function provided as output .
It is worth noting that when polarization factor is too small, the input of activation function will become smaller; therefore, output will increase following radial basis function conversion, which will improve the contribution of the training dataset to the current dataset, leading data overfitting. On the contrary, if polarization factor is too large, the contribution of the training dataset to the current dataset will decrease, leading to data underfitting.
The weight of the competition layer was set as the expected value vector matrix , in which each row vector had only one element of 1, representing the corresponding category; the remaining elements were 0, and then, the product of the matrix and the radial layer output was calculated. Finally, was obtained through the competition calculation of the transfer function of the competition layer, the larger element took the value 1, and the rest was 0, so that the classification of the input vector could be completed.
As a variant of the RBF network, PNN network has a simple structure and few parameters to be adjusted. Furthermore, it can achieve arbitrary nonlinear approximation. Compared with the traditional BP network, the training time of the PNN network is only slightly longer than the time of reading data and has global convergence, which is suitable for real time gesture decoding.
3.3. Learning Framework
In this paper, the influences of the subjects at different muscle fatigue levels, forearm angles and acquisition times on gesture decoding were studied. To this end, the TD features from sEMG data were extracted. A detailed flow is described in Figure 8. The features datasets were then divided into two sets, one for training and the other for decoding. To control variables, the four different categories were divided for analysis, namely:
-
(1)
Influence of Muscle Fatigue (MF): Compare the effects of different muscle fatigue levels (normal or fatigue), same acquisition time and forearm angle on gesture decoding accuracy.
-
(2)
Influence of Forearm Angle (FA): Compare the effects of different forearm angles (30°, 45°, 75°), same muscle fatigue level and acquisition times on gesture decoding accuracy.
-
(3)
Influence of Acquisition Time (AT): Compare the effects of different acquisition times (), same muscle fatigue level and forearm angles on gesture decoding accuracy.
-
(4)
Influence of MF, FA and AT: Compare the effects of different muscle fatigue, forearm angle and acquisition time on gesture decoding accuracy.
4. Results and Discussions
In the practical application of gesture decoding based on the sEMG signal, trained classification models have never used a validation dataset. In order to fit the accuracy of the actual classification model for gesture decoding, the commonly used cross validation method (dividing the dataset into k mutually exclusive subsets of similar size, then successively selecting the union of k-1 subsets as the training set and the remaining subsets as the test set) was not employed; instead, the method in which the training and testing datasets are completely separated from each other was used (data from the testing dataset were not used for training). Furthermore, the results were the mean values obtained by disrupting the data cycle training ten times. Additionally, decoding accuracy was selected as validation metric.
4.1. Influence of Muscle Fatigue
In this section, Class datasets trained models were used to test Class and Class datasets, respectively, to obtain the effect of muscle fatigue on the accuracy of gesture decoding. At the same time, Class datasets trained models were used to test Class and Class datasets, respectively. Results are shown in Figure 9.
The results obtained by the two models using Class and Class as both training and testing datasets, respectively, were close, and average decoding accuracies were approximately 95%. When using Class to test Class and Class to test Class , the results obtained by the two models were also close. However, compared to the first two types of models, a considerable reduction was observed in the average decoding accuracies, which were close to 88%. In addition, decoding accuracy rates fluctuated greatly, with a maximum gap of nearly 15%. This pointed to the effect muscle fatigue has on the accuracy of gesture decoding, making it unstable. Additionally, a Pearson correlation analysis was performed on each subject to verify the universality of the influence of muscle fatigue on the model (Table 3). When compared to using Class datasets for both training and testing, using Class datasets for both training and testing datasets instead resulted in lower correlation coefficient R values. This could be related to the fact that muscle fatigue in each participant was unpredictable. Nevertheless, almost all R values were higher than 0.4. Most of the R values of Class to Class and Class to Class in each subject were over 0.5.
Table 3.
Model | Subj.1 | Subj.2 | Subj.3 | Subj.4 | Subj.5 | |
---|---|---|---|---|---|---|
Class to Class | LDA | 0.6409 | 0.8109 | 0.7111 | 0.4123 | 0.5616 |
PNN | 0.4987 | 0.6057 | 0.6943 | 0.5313 | 0.5413 | |
Class to Class | LDA | 0.4884 | 0.8545 | 0.3950 | 0.6125 | 0.4910 |
PNN | 0.4402 | 0.6866 | 0.4828 | 0.4836 | 0.5880 | |
Class to Class | LDA | 0.5360 | 0.9183 | 0.6695 | 0.6302 | 0.6870 |
PNN | 0.6141 | 0.7128 | 0.7331 | 0.7391 | 0.6751 | |
Class to Class | LDA | 0.9255 | 0.5556 | 0.5789 | 0.5843 | 0.7746 |
PNN | 0.3734 | 0.5210 | 0.6684 | 0.6731 | 0.6448 |
4.2. Influence of Forearm Angle
The results from the previous section showed that muscle fatigue had an impact on the accuracy of gesture decoding. Furthermore, under same conditions, when using Class and Class as training and testing datasets, respectively, while using Class to test Class and Class to test Class , the results were close. Therefore, it is reasonable to assume that when comparing the effect of forearm angle and acquisition time on the classification models, the accuracies of using single Class or Class datasets will be similar. Here, only Class datasets were used in this section, for paper limitation reasons.
To avoid the influence of signal acquisition time, the training datasets and corresponding testing datasets were collected on the same day. The datasets with forearm angles of 30°, 45° and 75° collected in three days were used to test the datasets with forearm angles of 30°, 45° and 75° collected on the same day. The results are shown in Figure 10.
The results of the above two classification models showed that when using same angle data for both training and testing, the accuracies of the two models were the highest. However, the greater the gap of forearm angle was the lower the decoding accuracy. For instance, when 30° of forearm angle was used as training dataset and 45° and 75° as testing datasets, the average decoding accuracies decreased by about 3% and 7%, respectively, compared to using 30° as testing dataset. At the same time, when the forearm angle of 45° was used as training dataset to test 30° and 75°, the decoding accuracies returned by the classification models were similar. It is confirmed that forearm angle had an impact on the accuracy of gesture decoding, and the greater the angle difference, the bigger the impact was. Moreover, a Pearson correlation analysis was performed on each subject to verify the universality of the influence of forearm angle on the model (Table 4). It can be seen that almost all R values were greater than 0.7 except for subject 1 in 45 as validation dataset, confirming the universality of the impact of forearm angle.
Table 4.
Trend | Model | Subj.1 | Subj.2 | Subj.3 | Subj.4 | Subj.5 |
---|---|---|---|---|---|---|
30° as Validation dataset | LDA | 0.8437 | 0.8741 | 0.7607 | 0.7948 | 0.8622 |
PNN | 0.8273 | 0.8542 | 0.8206 | 0.7484 | 0.9582 | |
45° as Validation dataset | LDA | 0.5925 | 0.9974 | 0.9486 | 0.9368 | 0.7738 |
PNN | 0.6883 | 0.8715 | 0.9388 | 0.9458 | 0.8211 | |
75° as Validation dataset | LDA | 0.9536 | 0.9428 | 0.9797 | 0.7718 | 0.9981 |
PNN | 0.9147 | 0.9358 | 0.9486 | 0.7408 | 0.9429 |
4.3. Influence of Acquisition Time
Similar to Section 4.2, the impact of acquisition time on gesture decoding was discussed in this section. In order to control the impact of muscle fatigue and forearm angle, only Class datasets were used. Additionally, datasets with acquisition time of were used to test the datasets of acquisition time of respectively, with the same forearm angle (Figure 11).
From the results in Figure 11, it became clear that when using the same acquisition time data as both training and testing datasets, the accuracies of the two models were the highest, and the average decoding accuracies were close to 95%, which were similar to the results of forearm angle. Moreover, the bigger the gap of acquisition time was, the lower the decoding accuracy. For example, using as training dataset, as testing dataset, while using dataset to test , the average decoding accuracies both decreased by more than 20%. Nevertheless, models trained by were used to test datasets and respectively, and average decoding accuracies decreased lower than the above models, by approximately 5–10%. Similarly, to the previous section, Pearson correlation coefficient was used to validate the universality of the conclusion (Table 5). The majority of R values were rarely lower than 0.95. This is due to the fact that acquisition time had a considerable impact on gesture decoding (maximum above 20%), and the longer the collection time span, the more severe the impact. The Pearson correlation coefficient became insensitive in this instance, and tiny variations in the decoding accuracy of various participants will not induce changes in the correlation coefficient. Additionally, all subjects showed the same trend.
Table 5.
Trend | Model | Subj.1 | Subj.2 | Subj.3 | Subj.4 | Subj.5 |
---|---|---|---|---|---|---|
day1 as Validation dataset | LDA | 0.9881 | 0.9995 | 0.9929 | 0.9815 | 0.9865 |
PNN | 0.9916 | 0.9996 | 0.9736 | 0.9697 | 0.9938 | |
day2 as Validation dataset | LDA | 0.9884 | 0.9837 | 0.9581 | 0.9772 | 0.9620 |
PNN | 0.8750 | 0.9627 | 0.9940 | 0.9812 | 0.9913 | |
day3 as Validation dataset | LDA | 0.9960 | 0.9987 | 0.9289 | 0.9996 | 0.9984 |
PNN | 0.9941 | 0.9960 | 0.9515 | 0.9907 | 0.9869 |
In summary, among factors such as muscle fatigue, forearm angle and acquisition time, the acquisition time had the greatest impact on the accuracy of gesture decoding, which may also be related to the position offset of the disposable electrode patch at different acquisition times.
4.4. Influence of MF, FA and AT
The three influencing factors (i.e., different muscle fatigue level, different forearm angle and different acquisition time) were included in each classification model. Therefore, 18 classification models were designed within three categories (, , ) for each subject (Table 6).
Table 6.
Model | AB_C (6 Models) | ||
1 | A_b_30 | B_a_45 | C_a_75 |
2 | A_a_30 | B_b_45 | C_b_75 |
3 | A_a_45 | B_b_75 | C_a_30 |
4 | A_b_45 | B_a_75 | C_b_30 |
5 | A_a_75 | B_a_30 | C_b_45 |
6 | A_b_75 | B_b_30 | C_a_45 |
Model | BC_A (6 Models) | ||
1 | B_b_30 | C_a_45 | A_a_75 |
2 | B_a_30 | C_b_45 | A_b_75 |
3 | B_a_45 | C_b_75 | A_a_30 |
4 | B_b_45 | C_a_75 | A_b_30 |
5 | B_a_75 | C_a_30 | A_b_45 |
6 | B_b_75 | C_b_30 | A_a_45 |
Model | CA_B (6 Models) | ||
1 | C_b_30 | A_a_45 | B_a_75 |
2 | C_a_30 | A_b_45 | B_b_75 |
3 | C_a_45 | A_b_75 | B_a_30 |
4 | C_b_45 | A_a_75 | B_b_30 |
5 | C_a_75 | A_b_30 | B_b_45 |
6 | C_b_75 | A_b_30 | B_a_45 |
For example, in category, the data collected by and were used for training, and data acquired by were used for testing. For the first model in category, two of the five groups in A_b_30 and B_a_45 were randomly selected, respectively, to form training datasets and then choose one of five groups successively in C_a_75 as testing dataset, and the same procedures were followed for the other 17 models. To lower bias in the experimental results, the participants with the best and worst decoding performance were eliminated, and then, ten decoding result points were deducted from each of the three subjects in the center, for a total of 30 points in each model. The obtained results are shown in Figure 12.
According to the results, the decoding accuracy rates of all major categories were not stable, with a maximum rate gap of more than 25%. Conversely, it was easy to find that the trend of decoding accuracies between and was similar, but the average accuracies of were higher than that of , nearly 5–10%. On the other hand, the trend of decoding accuracies of was stable, which was different from the other two categories. As a result, it is postulated that a major reason for this situation was the different acquisition time. Moreover, when choosing bigger gap of forearm angle data as training datasets (randomly choose two out of five groups in angle of 30° and 75°, respectively), the average decoding accuracies generally deteriorated.
5. Conclusions
In this paper, the effects of muscle fatigue, forearm angle and acquisition time on the validation accuracy of gesture decoding were investigated. For this purpose, four specific muscles (i.e., SFD, FCU, ECRL and FE) and 11 hand movements, commonly used in daily life, were selected. Meanwhile, four TD features (RMS, WL, ZC, and SSC) and two classification models (LDA and PNN) were chosen to analyze the sEMG signal.
The analysis of the signal was performed in four parts: The first part was an analysis of the influence of muscle fatigue on the accuracy of gesture decoding. The second and third parts were analyses of the influence of forearm angle and acquisition time on the accuracy of gesture decoding. The final part was a comprehensive analysis of the effect of the three elements mentioned above on gesture decoding accuracy.
From Section 4.1, it was concluded that muscle fatigue had an impact on the classification model for gesture decoding, by decreasing the average accuracy by approximately 7%. Nevertheless, the validation accuracy exceeded 88%, which showed that the negative impact of muscle fatigue was relatively small (Figure 9). Based on the results of Section 4.2 and Section 4.3, it can be concluded that when using same forearm angle or acquisition time data as both training and testing datasets, the decoding accuracies of the two models were the highest, both close to 95%. Furthermore, it was observed that the average test accuracy was deteriorating with the increasing gap of forearm angle and acquisition time. However, the negative impact on the accuracy of gesture decoding of acquisition time was substantially higher than that of forearm angle. The maximum accuracy decrease of acquisition time was more than 20%, whereas that of forearm angle was less than 10%. Finally, the impact of three influencing elements on the classification model was comprehensively considered. The results showed that the decoding accuracy of each category was not stable. Among them, and had a similar accuracy trend, and the average accuracies of were higher than that of , close to 5–10%. However, the trend of average decoding accuracies of was relatively stable.
On the one hand, human hand movement is one of the most complex limb movements, with many factors that hinder gesture decoding. On the other hand, as a non-invasive signal connected to limb movement, sEMG has several advantages in an intuitive human–computer interface, including convenience, freedom of space and light constraints. Therefore, the study of sEMG-based gesture decoding opens new opportunities for future practical applications, such as prosthetic control and virtual reality.
Although this article has provided theoretical and experimental results, many issues that need further discussion still remain. First, the conditions for classifying muscle fatigue levels are only outlined, so a more precise and effective definition of the conditions for muscle fatigue classification is needed. Second, this paper only explored the EMG signals of the three angles of the forearm when the human body sits upright, which only covers a small part of the actual human movement. Therefore, more scenarios need to be explored. Third, this article only used the EMG data collected in three days. In the future, we will research the changes of EMG signal over time and the impact on the classification model of gesture decoding in a longer time span. Finally, the experiment described in this paper was carried out in a university laboratory, and the volunteers there were graduate students pursuing a master’s degree, so they were young and had little age difference. Furthermore, the problem of the left-hand or right-hand dominant was not considered in this research. In the following expansion experiment, volunteers of various ages and dominant hands will be recruited to carry out the experiment in order to ensure that the experimental outcomes are universal.
Author Contributions
Conceptualization, Z.Q., Z.L. and J.W.; methodology, Z.Q., Z.L. and J.W.; software, Z.Q. and J.W.; validation, Z.Q. and Z.L.; formal analysis, Z.L. and Y.C.; investigation, Z.Q., Z.L. and J.W.; re-sources, Z.L. and Y.C.; data curation, Z.Q.; writing—original draft preparation, Z.Q.; writ-ing—review and editing, Z.L.; visualization, Z.Q.; supervision, Z.L. and Y.C.; project administra-tion, Z.L.; funding acquisition, Z.L. and Y.C. All authors have read and agreed to the published version of the manuscript.
Funding
This research was funded by the National Natural Science Foundation of China [Grant No. 61801122].
Institutional Review Board Statement
The study was conducted according to the guidelines of the Declaration of Helsinki, and approved by the Institutional Review Board of School of Mechanical Engineering and Automation of Fuzhou University (202107).
Informed Consent Statement
Not applicable.
Data Availability Statement
Data sharing is not applicable to this article. Please contact the authors for further requests.
Conflicts of Interest
The authors declare no potential conflict of interest with respect to the research, authorship and publication of the article.
Footnotes
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
- 1.Mohamed A.K., Marwala T., John L.R. Single-trial EEG Discrimination between Wrist and Finger Movement Imagery and Execution in a Sensorimotor BCI; Proceedings of the Annual International Conference of the IEEE Engineering in Medicine and Biology Society; Boston, MA, USA. 30 August–3 September 2011; [DOI] [PubMed] [Google Scholar]
- 2.Yanagisawa T., Hirata M., Saitoh Y., Kishima H., Matsushita K., Goto T., Fukuma R., Yokoi H., Kamitani Y., Yoshimine T. Electrocorticographic control of a prosthetic arm in paralyzed patients. Ann. Neurol. 2012;71:353–361. doi: 10.1002/ana.22613. [DOI] [PubMed] [Google Scholar]
- 3.Xie H.B., Zheng Y.P., Guo J.Y. Classification of the mechanomyogram signal using a wavelet packet transform and singular value decomposition for multifunction prosthesis control. Physiol. Meas. 2009;30:441–457. doi: 10.1088/0967-3334/30/5/002. [DOI] [PubMed] [Google Scholar]
- 4.Simao M., Mendes N., Gibaru O., Neto P. A Review on Electromyography Decoding and Pattern Recognition for Human-Machine Interaction. IEEE Access. 2019;7:39564–39582. doi: 10.1109/ACCESS.2019.2906584. [DOI] [Google Scholar]
- 5.Jaramillo-Yánez A., Benalcázar M.E., Mena-Maldonado E. Real-Time Hand Gesture decoding Using Surface Electromyography and Machine Learning: A Systematic Literature Review. Sensors. 2020;20:2467. doi: 10.3390/s20092467. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 6.Saponas T.S., Tan D.S., Dan M., Turner J., Landay J.A. Making Muscle-Computer Interfaces More Practical; Proceedings of the 28th International Conference on Human Factors in Computing Systems, CHI 2010; Atlanta, GA, USA. 10–15 April 2010. [Google Scholar]
- 7.Wang N., Lao K., Zhang X. Design and Myoelectric Control of an Anthropomorphic Prosthetic Hand. J. Bionic Eng. 2017;14:47–59. doi: 10.1016/S1672-6529(16)60377-3. [DOI] [Google Scholar]
- 8.Kwon Y., Dwivedi A., Mcdaid A.J., Liarokapis M. Electromyography Based Decoding of Dexterous, In-Hand Manipulation of Objects: Comparing Task Execution in Real World and Virtual Reality. IEEE Access. 2021;9:37297–37310. doi: 10.1109/ACCESS.2021.3062364. [DOI] [Google Scholar]
- 9.Kiguchi K., Hayashi Y. An EMG-Based Control for an Upper-Limb Power-Assist Exoskeleton Robot. IEEE Trans. Syst. Man Cybern. Part B (Cybern.) 2012;42:1064–1071. doi: 10.1109/TSMCB.2012.2185843. [DOI] [PubMed] [Google Scholar]
- 10.Li C., Li G., Jiang G., Chen D., Liu H. Surface EMG data aggregation processing for intelligent prosthetic action recognition. Neural Comput. Appl. 2018;32:1–12. doi: 10.1007/s00521-018-3699-3. [DOI] [Google Scholar]
- 11.Qi J., Jiang G., Li G., Sun Y., Tao B. Surface EMG hand gesture recognition system based on PCA and GRNN. Neural Comput. Appl. 2020;32:6343–6351. doi: 10.1007/s00521-019-04142-8. [DOI] [Google Scholar]
- 12.Dwivedi A., Kwon Y., Mcdaid A.J., Liarokapis M. A Learning Scheme for EMG Based Decoding of Dexterous, In-Hand Manipulation Motions. IEEE Trans. Neural Syst. Rehabil. Eng. 2019;27:2205–2215. doi: 10.1109/TNSRE.2019.2936622. [DOI] [PubMed] [Google Scholar]
- 13.Kwon Y., Dwivedi A., Mcdaid A.J., Liarokapis M. On Muscle Selection for EMG Based Decoding of Dexterous, In-Hand Manipulation Motions; Proceedings of the 2018 40th Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC); Honolulu, HI, USA. 18–21 July 2018; [DOI] [PubMed] [Google Scholar]
- 14.Liu X., Sacks J., Zhang M., Richardson A.G., Lucas T.H., der Spiegel J.V. The Virtual Trackpad: An Electromyography-Based, Wireless, Real-Time, Low-Power, Embedded Hand-Gesture-Recognition System Using an Event-Driven Artificial Neural Network. IEEE Trans. Circuits Syst. II Express Briefs. 2017;64:1257–1261. doi: 10.1109/TCSII.2016.2635674. [DOI] [Google Scholar]
- 15.Yang J., Pan J., Li J. sEMG-based continuous hand gesture decoding using GMM-HMM and threshold model; Proceedings of the 2017 IEEE International Conference on Robotics and Biomimetics (ROBIO); Macau, China. 5–8 December 2017. [Google Scholar]
- 16.Xiong C.H., Chen W.R., Sun B.Y., Liu M.-J., Yue S.-G., Chen W.-B. Design and Implementation of an Anthropomorphic Hand for Replicating Human Grasping Functions. IEEE Trans. Robot. 2016;32:652–671. doi: 10.1109/TRO.2016.2558193. [DOI] [Google Scholar]
- 17.Na Y., Choi C., Lee H.D., Kim J. A Study on Estimation of Joint Force Through Isometric Index Finger Abduction With the Help of SEMG Peaks for Biomedical Applications. IEEE Trans. Cybern. 2017;46:2–8. doi: 10.1109/TCYB.2014.2386856. [DOI] [PubMed] [Google Scholar]
- 18.Min H., Chen Z., Fang B., Xia Z., Song Y., Wang Z., Zhou Q., Sun F., Liu C. Cross-Individual Gesture decoding Based on Long Short-Term Memory Networks. Sci. Program. 2021;2021:6680417. [Google Scholar]
- 19.Wang Z., Fang Y., Li G., Liu H. Facilitate sEMG-Based Human–Machine Interaction Through Channel Optimization. Int. J. Humanoid Robot. 2019;16:797–809. doi: 10.1142/S0219843619410019. [DOI] [Google Scholar]
- 20.UCté-Allard Fall C.L., Drouin A., Campeau-Lecours A., Gosselin C., Glette K., Laviolette F., Gosselin B. Deep Learning for Electromyographic Hand Gesture Signal Classification Using Transfer Learning. IEEE Trans. Neural Syst. Rehabil. Eng. 2019;27:760–771. doi: 10.1109/TNSRE.2019.2896269. [DOI] [PubMed] [Google Scholar]
- 21.Adewuyi A.A., Hargrove L.J., Kuiken T.A. An Analysis of Intrinsic and Extrinsic Hand Muscle EMG for Improved Pattern Recognition Control. IEEE Trans. Neural Syst. Rehabil. Eng. 2016;24:485–494. doi: 10.1109/TNSRE.2015.2424371. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 22.Sensinger J.W., Lock B.A., Kuiken T.A. Adaptive Pattern Recognition of Myoelectric Signals: Exploration of Conceptual Framework and Practical Algorithms. IEEE Trans. Neural Syst. Rehabil. Eng. 2009;17:270–278. doi: 10.1109/TNSRE.2009.2023282. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Major N., Malinzak M.D. Netter’s Correlative Imaging: Musculoskeletal Anatomy. Elsevier Health Science; Amsterdam, The Netherlands: 2010. [Google Scholar]
- 24.Cifrek M., Medved V., Tonkovi S., Ostojić S. Surface EMG based muscle fatigue evaluation in biomechanics. Clin. Biomech. 2009;24:327–340. doi: 10.1016/j.clinbiomech.2009.01.010. [DOI] [PubMed] [Google Scholar]
- 25.Upper Limb Elbow Joint Angle Estimation Based on Electromyography Using Artificial Neural Network; Proceedings of the South East Asian Technical University Consortium; Yogyakarta, Indonesia. 12–13 March 2018. [Google Scholar]
- 26.Allard U.C., Nougarou F., Fall C.L., Giguère P., Gosselin C., Laviolette F., Gosselin B. A convolutional neural network for robotic arm guidance using sEMG based frequency-features; Proceedings of the IEEE/RSJ International Conference on Intelligent Robots & Systems; Daejeon, Korea. 9–14 October 2016. [Google Scholar]
- 27.Xun C., Wang Z.J. Pattern recognition of number gestures based on a wireless surface EMG system. Biomed. Signal Process. Control. 2013;8:184–192. [Google Scholar]
- 28.Villarejo J.J., Costa R.M., Bastos T., Frizera A. Identification of low level sEMG signals for individual finger prosthesis; Proceedings of the Biosignals and Biorobotics Conference (2014): Biosignals and Robotics for Better and Safer Living (BRC), 5th ISSNIP-IEEE; Salvador, Brazil. 26–28 May 2014. [Google Scholar]
- 29.Chang J., Chablat D., Bennis F., Ma L. Advances in Physical Ergonomics and Human Factors. Springer; Cham, Switzerland: 2016. Estimating the EMG Response Exclusively to Fatigue during Sustained Static Maximum Voluntary Contraction; pp. 29–39. [Google Scholar]
- 30.Hill E., Housh T., Smith C., Schmidt R., Johnson G. Muscle- and Mode-Specific Responses of the Forearm Flexors to Fatiguing, Concentric Muscle Actions. Sports. 2016;4:47. doi: 10.3390/sports4040047. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 31.Smith L.H., Hargrove L.J., Lock B.A., Kuiken T.A. Determining the optimal window length for pattern recognition-based myoelectric control: Balancing the competing effects of classification error and controller delay. IEEE Trans. Neural Syst. Rehabil. Eng. 2011;19:186–192. doi: 10.1109/TNSRE.2010.2100828. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Boyali A., Hashimoto N. Spectral Collaborative Representation based Classification for Hand Gestures recognition on Electromyography Signals. Biomed. Signal Process. Control. 2016;24:11–18. doi: 10.1016/j.bspc.2015.09.001. [DOI] [Google Scholar]
- 33.Oskoei M.A., Hu H. Myoelectric control systems: A survey. Biomed. Signal Process. Control. 2007;2:275–294. doi: 10.1016/j.bspc.2007.07.009. [DOI] [Google Scholar]
- 34.Phinyomark A., Quaine F., Charbonnier S., Serviere C., Bernard F.T., Laurillau Y. EMG feature evaluation for improving myoelectric pattern recognition robustness. Expert Syst. Appl. 2013;40:4832–4840. doi: 10.1016/j.eswa.2013.02.023. [DOI] [Google Scholar]
- 35.Lorrain T., Ning J., Farina D. Influence of the training set on the accuracy of surface EMG classification in dynamic contractions for the control of multifunction prostheses. J. Neuro Eng. Rehabil. 2011;8:25. doi: 10.1186/1743-0003-8-25. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Tkach D., He H., Kuiken T.A. Study of stability of time-domain features for electromyographic pattern recognition. J. Neuro Eng. Rehabil. 2010;7:21. doi: 10.1186/1743-0003-7-21. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 37.Dwivedi A., Kwon Y., Mcdaid A.J., Liarokapis M. EMG Based Decoding of Object Motion in Dexterous; Proceedings of the 2018 7th IEEE International Conference on Biomedical Robotics and Biomechatronics (Biorob); Enschede, The Netherlands. 26–29 August 2018. [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
Data sharing is not applicable to this article. Please contact the authors for further requests.