Abstract
Goal: This article presents the design and validation of an accurate automatic diagnostic system to classify intramuscular EMG (iEMG) signals into healthy, myopathy, or neuropathy categories to aid the diagnosis of neuromuscular diseases. Methods: First, an iEMG signal is decimated to produce a set of “disjoint” downsampled signals, which are decomposed by the lifting wavelet transform (LWT). The Higuchi's fractal dimensions (FDs) of LWT coefficients in the subbands are computed. The FDs of LWT subband coefficients are fused with one-dimensional local binary pattern derived from each downsampled signal. Next, a multilayer perceptron neural network (MLPNN) determines the class labels of downsampled signals. Finally, the sequence of class labels is fed to the Boyer-Moore majority vote (BMMV) algorithm, which assigns a class to every iEMG signal. Results: The MLPNN-BMMV classifier was experimented with 250 iEMG signals belonging to three categories. The performance of the classifier was validated in comparison with state-of-the-art approaches. The MLPNN-BMMV has resulted in impressive performance measures (%) using a 10-fold cross-validation—accuracy = , sensitivity (normal) =
, sensitivity (myopathy) =
, sensitivity (neuropathy) =
, specificity (normal) =
, specificity (myopathy) =
, and specificity (neuropathy) =
—surpassing the existing approaches. Conclusions: A future research direction is to validate the classifier performance with diverse iEMG datasets, which would lead to the design of an affordable real-time expert system for neuromuscular disorder diagnosis.
Keywords: Fractal dimension, intramuscular electromyography, lifting wavelet transform, local binary pattern, majority vote, multilayer perceptron neural network, neuromuscular disorders
I. Introduction
Electromyography (EMG) is the recording of electrical activities produced by skeletal muscles with noninvasive or invasive methods. Surface EMG (sEMG) is recorded using surface electrodes placed on the skin above the muscle for prosthetic control and rehabilitation purposes [1]–[3]. Whereas, intramuscular EMG (iEMG) is collected using a monopolar or concentric needle electrode inserted through the skin into the muscle tissue. Neuromuscular disorders are known to alter the morphology and physiology of the basic component of the peripheral nervous system called the motor unit (MU). Since stimulation of motor neurons induces contraction or shortening of the muscle fibers, any damage to motor neurons affects muscle function [1], [4]. The extracellular needle EMG records the motor unit action potential (MUAP), the analysis of which provides crucial information for the diagnosis of neuromuscular disorders [1]. Healthy MUAPs have only two to four phases and are smaller in amplitude. By contrast, neurogenic disorders are characterized by long-duration, high-amplitude, and polyphasic MUAPs; in myopathy, MUAPs are smaller in amplitude, polyphasic, and of shorter duration [1], [5]. Furthermore, a frequency content shift toward low and high frequencies is observed in the iEMG signal collected from subjects suffering from neuropathy and myopathy, respectively [6], [7]. Traditionally, MUAPs are examined by the visual and audio characteristics of iEMG signals. However, this practice has been proven effective only to detect some disorders and remains inadequate to explain less apparent deviations or assorted patterns of abnormalities [8]. Therefore, several computer-aided EMG classification algorithms have been developed over the last two decades to accurately classify the iEMG data for reliably diagnosing neuromuscular abnormalities [8].
Typically, a computer-based EMG classifier first extracts features from the raw EMG signals and then employs a classification algorithm to discriminate these features. The selection of an optimal feature set is pivotal for improving the classification accuracy [9], [10]. Therefore, diverse feature sets were explored by various researchers: time domain features [5], [11], [12]; frequency domain features [13]–[16]; and time-frequency (TF) domain features [17]–[25]. An alternate approach is to combine features from different domains [8], [26]–[29].
In like manner, several classifiers were attempted to enhance the iEMG classification performance. Typical examples include artificial neural networks (ANN) [12], [14], [20], [27]–[30]; deep learning algorithm [17]; neuro-fuzzy system [8], [13]; support vector machine (SVM) [7], [18], [26], [31], [32]; -nearest neighbor (KNN) [19], [22], [24], [33]; machine learning [11]; quadratic classifier [21]; and random forest decision tree [23], [34]. Moreover, iEMG signals were often preprocessed prior to classification, e.g., [5], [15], [16].
We propose a combination of multilayer perceptron neural network (MLPNN) and Boyer-Moore majority vote (BMMV), jointly known as MLPNN-BMMV, which is supplied with fused iEMG feature sets. Each feature vector is generated by concatenating Higuchi's fractal dimensions (FDs) of lifting wavelet transform (LWT) coefficients in the subbands and one-dimensional (1-D) local binary pattern (LBP) of a preprocessed iEMG signal.
The myopathy and neuropathy cause a decrease and increase in the number of functional muscle fibers per MU, respectively (p. 226 of [1]). Consequently, neuromuscular disorders alter the duration, amplitude, and phase of MUAPs, and hence induce changes in the iEMG signal characteristics (see Supp. Material I-A). The FD is a useful metric to quantify various MUAP properties buried in the iEMG signal (e.g., firing rate, MUAP amplitude, waveform phases, etc.) and to track variations in signal structure [35]. Furthermore, FD can distinguish specific states of physiological functions from electrophysiological signals [36] and characterize the complexity of nonstationary and nonlinear iEMG signals [37]. On the other hand, 1-D LBP is sensitive to local changes in a signal. Hence it is proven effective in the diagnosis of Parkinson's disease from gait signals [38], Alzheimer's disease [39] and epilepsy [40] from electroencephalography (EEG) signals, and arrhythmia/atrial fibrillation from electrocardiography (ECG) signals [41], [42]. Besides, it has been applied to distinguish muscle activities and rest from sEMG signals [43]. Recall that neuromuscular diseases alter both MUAP firing rates and configurations [44]. Based on the capability of 1-D LBP to capture the signal morphology in terms of relative changes in amplitude within a neighborhood of an electrophysiological signal, we hypothesize that it can help characterize abnormal iEMG patterns caused by a pathological condition. We therefore fused the FD and 1-D LBP features because they carry complementary information: the FD serves as a global measure of complexity quantifying variations in the signal structure, whereas, the 1-D LBP represents the local signal activity around each data point.
The empirical results from the classification of preprocessed (without downsampling) iEMG signals in Table I reveal that the ANN and a few variants of SVM could lead to a better classification compared to discriminant analysis, decision tree, and ensemble learning. Nevertheless, the classification performance does not heavily rely on the choice of the classifier. Instead, the downsampling plays a pivotal role in enhancing the classifier performance by “artificially” increasing the number of samples. This is because an increase in the sample size improves the classification accuracy by reducing the variance of the estimator, even though the samples are not independent. Furthermore, shorter samples due to downsampling do not interfere with the classifier performance to our advantage.
TABLE I. The iEMG Signals Were Classified 25 Times (10-Fold CV) Using a Set of Classifiers Listed in the Left Column. The Average
(%) From 25 Runs Is Reported for Each Classifier. The Best Classifier Under Each Category and the Respective Average
Are Boldfaced.
iEMG Classifier | Average ![]() |
---|---|
LDA |
![]() |
FT |
![]() |
Coarse Tree | 72.00 |
Medium Tree | 72.40 |
Ensemble Boosted Tree | 75.20 |
Ensemble Bagged Tree | 77.39 |
ESD |
![]() |
Ensemble Subspace KNN | 66.56 |
Ensemble Random Undersampling Boosted Tree | 78.56 |
FKNN |
![]() |
Medium KNN | 72.00 |
Coarse KNN | 60.00 |
Cosine KNN | 74.00 |
Cubic KNN | 72.00 |
Weighted KNN | 73.60 |
Linear SVM | 80.59 |
QSVM |
![]() |
CSVM | 81.97 |
Fine Gaussian SVM | 60.00 |
Medium Gaussian SVM | 77.60 |
Coarse Gaussian SVM | 60.00 |
MLPNN | 86.29 |
MLPNN-BMMV |
![]() |
A comparison of performance measures—classification accuracy, sensitivity, and specificity—of MLPNN-BMMV with those of competing approaches is provided in Section III-A. Importantly, the MLPNN-BMMV has contributed to outstanding performance measures—accuracy = , sensitivity (normal) =
, sensitivity (myopathy) =
, sensitivity (neuropathy) =
, specificity (normal) =
, specificity (myopathy) =
, and specificity (neuropathy) =
—surpassing those of state-of-the-art approaches. The novelty of the proposed classifier is four-fold:
-
1)
The LWT has not yet been applied to analyze or classify either sEMG or iEMG signals.
-
2)
FDs of wavelet subband coefficients are introduced to classify iEMG signals.
-
3)
1-D LBP has been used so far to analyze gait [38], EEG [39], [40], ECG [41], [42], and sEMG signals [43], but not in the context of iEMG classification.
-
4)
MLPNN classifier outputs are refined with BMMV in a unique manner. The majority voting algorithm is conventionally fed with the labels of signal segments, e.g., [5]; whereas, we adopted BMMV to refine the labels of “disjoint” downsampled signals.
II. Materials and Methods
A. iEMG Dataset
This study was performed using a publicly available clinical iEMG database [45]. The data were collected from 10 healthy volunteers (six males) aged 21 to 37 years, seven myopathy subjects (five males) aged 19 to 63 years, and eight neuropathy patients (four males) aged 35 to 67 years. A standard concentric needle electrode with a leading-off area of was inserted into brachial biceps muscles and a surface ground electrode was placed on the limb to record the iEMG signals. During data acquisition, participants applied a slight and constant contraction without needle movements. More information on the dataset is provided in Supp. Material II-A.
We constructed two datasets comprising iEMG signals recorded from the biceps brachii muscle to investigate the proposed approach:
-
•
Experimental dataset: A total of 250 iEMG signals—150 signals from 10 healthy, 50 from six myopathy, and 50 from five neuropathy subjects.
-
•
Validation dataset: An exclusive set (no overlap with the experimental dataset) of 100 randomly selected iEMG signals—60 signals from 10 healthy, 20 from four myopathy, and 20 from five neuropathy subjects—distributed with the same proportion across the three classes as that of the experimental dataset.
B. iEMG Classifier Framework
The MLPNN-BMMV classifier for neuromuscular disorder diagnosis is illustrated with a schematic diagram in Fig. 1. A raw iEMG signal is preprocessed and decimated by a factor of to produce a set of
“disjoint” downsampled signals.1 Each downsampled iEMG signal is decomposed by LWT to generate the frequency subbands,
and
, which contain the scaling/wavelet coefficients. The FDs of LWT subband coefficients are in turn computed with Higuchi's algorithm. The 1-D LBP is obtained for the downsampled iEMG signal. The fused feature set (FDs of LWT subband coefficients and 1-D LBP) from every downsampled iEMG signal is fed to the MLPNN classifier. The class labels—normal, myopathy, and neuropathy—assigned by the MLPNN are supplied to the BMMV that ultimately determines the class to which an iEMG signal belongs. The MLPNN-BMMV classifier outcome is subject to a 10-fold cross-validation (CV).
Fig. 1.
Overall schematic of MLPNN-BMMV-iEMG classifier to diagnose neuromuscular disorders. raw iEMG signals from a dataset are preprocessed and downsampled by a factor
to produce
signals. LWT subband coefficients of each downsampled signal are fed to Higuchi's algorithm that computes the coefficients’ FDs. The 1-D LBPs of downsampled signals are deduced. The features—FDs of LWT subband coefficients and 1-D LBP—extracted from a downsampled signal are concatenated to form a feature vector. A total of
feature vectors are supplied to MLPNN to be classified into healthy, myopathy, or neuropathy categories. Finally, the BMMV algorithm assigns a class to an iEMG signal by applying the majority vote rule on a sequence of
labels.
C. Lifting Wavelet Transform
The LWT proposed by Sweldens et. al [46] captures the features from a nonstationary iEMG signal and localizes them in both the time and frequency domains. Compared to the discrete wavelet transform (DWT), LWT has less computational and memory requirements [47].
The LWT decomposes the input signal into the frequency subbands,
and
, containing the scaling/wavelet coefficients. The typical lifting wavelet decomposition consists of four steps, namely, split, predict, update, and normalization [48], as depicted in Fig. 2. A detailed description of LWT is provided in Supp. Material II-B. A major advantage of the lifting scheme is that the entire information content of the input signal is preserved during the analysis process. This means that perfect reconstruction of the signal is possible by using the same predictor
and updater
in both the analysis and synthesis stages of the lifting scheme. Precisely, if we have
and
, then the odd
and even
polyphase coefficients can be calculated, enabling the lossless reconstruction of
.
Fig. 2.
Single-level LWT scheme consisting of split, predict, update, and normalization steps. denotes the input signal.
and
represent even and odd polyphase components, respectively. Predict is a high-pass filter and update is a low-pass filter.
and
are the normalization factors corresponding to
and
, respectively.
The features extracted from the LWT coefficients in each subband represent the signal properties. The level of decomposition is selected as five as set in a DWT-based classifier proposed in [8], [31], [32]; as a result, we obtain six frequency subbands, and
, containing the scaling/wavelet coefficients. Hence, six features are derived from the LWT subband coefficients. The rbio3.7 wavelet function is chosen to compute the LWT coefficients for the reasons mentioned in Supp. Material III-A.
D. Fractal Dimension via Higuchi's Method
Various approaches have been proposed to compute the FD, e.g., Higuchi's, Katz's, and Petrosian's method [49]–[51]. FD features extracted by Katz's and Higuchi's methods were shown to be discriminative in EMG analyses [37]. Studies concerning the fractal analysis of neurological signals have demonstrated that Higuchi's method is superior to Katz's and Petrosian's methods [52], [53].
The FD computed via Higuchi's method is effectively used as a complexity measure of physiological time series, which helps identify the hidden information buried in the signal [51]. The FD estimate by Higuchi's method is claimed to be the most accurate one [36], [54]. Furthermore, for fast evaluation of signal nonlinearity, the numerical approach by Higuchi was proven to be very efficient [51]. Therefore, we rely on Higuchi's method to calculate the FDs of LWT coefficients in each subband. Interested readers are directed to Supp. Material II-C for the description of Higuchi's algorithm to compute FD [49], [55].
E. One-Dimensional Local Binary Pattern
The 1-D LBP code represents the local activity of a signal around a data point in relation to its value [43]. It implies that LBP is not contingent on the absolute value of a signal amplitude or any DC offset. The LBP code can thus be regarded as a robust feature for iEMG classification. For details of 1-D LBP algorithm [40], one may refer to Supp. Material II-D.
F. Feature Fusion
A feature vector of length 262 is constructed from a given iEMG or downsampled signal by concatenating the following: (i) FDs of LWT coefficients in frequency subbands, and
; (ii) 1-D LBP vector of length 256.
G. Multilayer Perceptron Neural Network
The ANN is known for its wider applicability, capability to learn complex as well as nonlinear relations, and the ability to be trained by examples. Its performance is unaffected by factors such as human fatigue, emotional states, and habituation, when used for diagnostic purposes. Moreover, ANN is well-suited for rapid identification, analysis of conditions, and real-time diagnosis [7], [8].
We implement the ANN with three layers—input, hidden, and output—which is termed as MLPNN. The features extracted from downsampled signals serve as inputs for MLPNN. The output layer consists of three nodes representing normal, myopathy, and neuropathy classes associated with the target vectors [100], [010], and [001], respectively. The three-layer MLPNN configuration with one hidden layer having 25 nodes is found suitable for our application. During the training, the initial set of random weights are adjusted with the conjugate gradient backpropagation (Polak-Ribiere update) to reduce the difference between the network outputs and the target outputs. The classifier outcomes—
class labels—are supplied to BMMV, which determines the actual class of iEMG signals. We resort to a 10-fold CV to rigorously evaluate the classification performance. The
-fold CV diminishes the bias in classification, thereby producing a robust estimate of the classifier's error rate [7]. Moreover, the data are randomly partitioned into folds such that the classes are represented in almost the same proportion in each fold as in the entire dataset, namely, stratified
-fold CV, to improve the classification accuracy [7].
H. Boyer-Moore Majority Vote Algorithm
The BMMV finds the majority of a sequence of elements using linear time and constant space [56]. For a given iEMG recording, class labels assigned by MLPNN are regarded as the elements of a sequence. The algorithm finds a majority element if there is one: that is, a class label that occurs repeatedly for more than
times. However, if there is no majority, the algorithm will return an output label, namely, indeterminate class (see the illustration in Supp. Material II-E).
III. Results
A. Performance Evaluation of iEMG Classifier
The automated iEMG classifier scheme was implemented in MATLAB R2017b that assigns an iEMG signal to healthy, myopathy, or neuropathy category. 250 iEMG signals from the experimental dataset were decimated by a factor of (refer to Supp. Material III-B) resulting in 2250
downsampled signals, which were divided into a nonoverlapping training set and test set. Initially, the fused feature sets derived from the training set and the respective class labels were employed for learning. Once the MLPNN was trained, feature vectors from the test set were supplied to the classifier to determine the labels. We performed a 10-fold CV, where the dataset comprising 2250 downsampled iEMG signals was split into 10 subsets called folds, each one containing 225 signals. At every instance, 225 iEMG signals in a fold were used for testing and the remaining 2025 signals from the other nine folds for training. This procedure was repeated 10 times by changing the training and the corresponding test set. After the 10-fold CV, BMMV assigned a class label to each iEMG signal by applying the majority rule on the sequence of the respective downsampled signal labels.
The hyperparameters of MLPNN—number of hidden layer neurons and total training epochs
—were estimated to be 25 and 300, respectively, with a 70–30 hold-out validation performed on a grid of possible values as illustrated in Fig. 3 using the validation set.
Fig. 3.
Graphical representation of the 3-D landscape of values returned by a grid search with a 70–30 hold-out validation for tuning the MLPNN hyperparameters,
and
. To estimate the optimal set of hyperparameters,
and
were varied within the interval
in steps of five and within
in steps of 50, respectively. The hyperparameter values,
and
, corresponding to
denoted by the tallest peak marking an accuracy of 83.33% in the surface plot are adopted in our implementation.
The performance evaluation measures, namely, total classification accuracy , sensitivity
, and specificity
, are indicative of the effectiveness of the classifier. We compare the performance of our approach with state-of-the-art approaches using these measures defined as follows:
![]() |
where TP, TN, FP, and FN refer to the number of true positive, true negative, false positive, and false negative instances with respect to the class under consideration, respectively [26]. In (1)–(3), true or false denotes the assigned classification being correct or incorrect, while positive or negative denotes the assignment to the positive or the negative category.
The proposed MLPNN-BMMV and subcategories of the state-of-the-art classifiers—discriminant analysis, decision tree, ensemble learning, KNN, and SVM—were rigorously validated by executing them 25 times to perform iEMG classification with a 10-fold CV. The investigated classifiers were supplied with the fused feature sets deduced from the experimental dataset. The average computed from 25 runs is recorded in Table I for every approach to highlight the improvement in classification (
) achieved by MLPNN-BMMV. We listed other performance measures of the best-performing classifier in each category, i.e., linear discriminant analysis (LDA), fine tree (FT), ensemble subspace discriminant (ESD), fine KNN (FKNN), and quadratic SVM (QSVM), besides those of MLPNN-BMMV in Table II. Notice that the performance of MLPNN, QSVM, and cubic SVM (CSVM) is comparable and better than the rest of the classifiers. Recall that the improvement in classification is not contingent only on the choice of the classifier, rather heavily relies on the downsampling since “artificially” increasing the sample size reduces the variance of the estimator (refer to the last two rows of Table I).
TABLE II. Average Performance Measures (%) in (1)–(3) for the Best iEMG Classifier Under Each Category. Each Classifier Performed a 10-Fold CV 25 Times.
Performance | LDA | FT | ESD | FKNN | QSVM | MLPNN-BMMV |
---|---|---|---|---|---|---|
Measures (%) | ||||||
![]() |
50.40 | 73.20 | 78.67 | 74.80 | 85.97 | 99.87 |
![]() |
50.00 | 78.00 | 85.12 | 82.00 | 90.48 | 99.97 |
![]() |
56.00 | 64.00 | 72.64 | 56.00 | 80.00 | 99.68 |
![]() |
46.00 | 68.00 | 65.36 | 72.00 | 78.40 | 99.76 |
![]() |
68.00 | 75.00 | 73.80 | 68.00 | 84.96 | 99.72 |
![]() |
73.50 | 89.00 | 93.94 | 90.00 | 94.98 | 99.98 |
![]() |
80.50 | 90.00 | 92.50 | 94.50 | 95.00 | 99.96 |
The average confusion matrix is shown in Table III to provide empirical evidence for the outstanding performance of the approach. We remark that the classification performance did not alter significantly, when we repeated the experiment with a 4-fold CV, suggesting that the improvement in the results is not due to overfitting. Each column and row of the confusion matrix denote the percentage of cases in the desired and the actual class, respectively, or vice versa as given below:
![]() |
where the subscripts ,
, and
represent the healthy, myopathy, and neuropathy classes, respectively. For instance,
is calculated as follows:
![]() |
Thus, the percentages of correctly classified signals are located along the main diagonal of and the nondiagonal values denote the misclassification percentages.
TABLE III. Average Confusion Matrix Constructed With the MLPNN-BMMV (10-Fold CV) Classifier Outcome (%). The Classifier Was Executed 25 Times With the Experimental Dataset.
Class Label | Healthy | Myopathy | Neuropathy |
---|---|---|---|
Healthy | 99.97 | 0 | 0.03 |
Myopathy* | 0.24 | 99.68 | 0 |
Neuropathy** | 0.16 | 0 | 99.76 |
*indeterminate class – 0.08% **indeterminate class – 0.08%.
Most importantly, Table IV summarizes the comparison of performance measures between our method and those reported in the literature using the same iEMG dataset. The last row of Table IV corresponding to MLPNN-BMMV underscores the surpassing performance of the proposed classifier in terms of ,
, and
. To conclude, the classification results from MLPNN-BMMV are quite reliable to aid the diagnosis of neurogenic and myogenic disorders, which is empirically verified by the performance measures (%):
,
,
,
,
,
, and
. In addition, we have listed the classification performance reported by a few binary classifiers (Nor and ALS) with the same iEMG dataset in Supp. Material III-C for comparison.
TABLE IV. Comparison of Performance Measures (%) of the Proposed (Boldfaced) and State-of-the-Art iEMG Classifiers. Nor, Myo, and Neuro (ALS) Represent Normal, Myopathy, and Neuropathy (Amyotrophic Lateral Sclerosis), Respectively. All Approaches Classified the iEMG Data From the Same Dataset.
Method (Year) | Classes/Study Group |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
---|---|---|---|---|---|---|---|---|
[26] (2013) | 3/Nor, Myo, Neuro | 97.00 | 98.70 | 95.30 | 96.00 | 95.60 | 97.60 | 97.50 |
[15] (2014) | 3/Nor, Myo, ALS | 92.55 | 90.30 | 91.00 | 96.33 | 94.83 | 97.16 | 96.83 |
[57] (2014) | 3/Nor, Myo, ALS | 93.08 | 99.00 | 93.25 | 87.00 | 96.62 | 93.00 | 100.00 |
[34] (2015) | 3/Nor, Myo, ALS | 96.67 | 94.75 | 95.66 | 99.58 | 97.83 | 98.00 | 99.17 |
[5] (2015) | 3/Nor, Myo, ALS | 98.00 | 100.00 | 94.00 | 96.00 | 97.00 | 99.50 | 99.50 |
[19] (2015) | 3/Nor, Myo, ALS | 84.00 | 89.33 | 64.00 | 88.00 | – | – | – |
[18] (2016) | 3/Nor, Myo, ALS | 88.60 | 88.00 | 91.00 | 93.00 | – | – | – |
[21] (2019) | 3/Nor, Myo, ALS | 99.03 | 99.58 | 98.50 | 97.59 | – | – | – |
MLPNN-BMMV | 3/Nor, Myo, Neuro |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
![]() |
In proportion to the signals available under each category in the iEMG dataset, we selected an imbalanced dataset (150 normal, 50 myopathy, and 50 neuropathy). Subsequently, we also performed the classification of a balanced dataset, i.e., 50 iEMG signals under each category, using MLPNN-BMMV with a 10-fold CV 25 times. To this end, 50 normal iEMGs were uniform-randomly selected from the available set of 150 signals acquired from healthy subjects. The resulting performance measures (%) are almost similar to those obtained earlier, which are listed below: ,
,
,
,
,
, and
.
This application in MATLAB runs on a Microsoft Windows 10 Home Single Language HP notebook (Intel(R) Core(TM) i5-6200U 2.30 GHz CPU, 4.0 GB 2401 MHz RAM, 64-bit Operating System). The average execution time taken by MLPNN-BMMV to complete 25 trial runs with a 4-fold and 10-fold CV is s and
s, respectively.
B. Limitations of the Study
Admittedly, the study has two inherent limitations owing to the iEMG dataset, even though several researchers have employed this dataset for validating their classification algorithms. (1) Lack of Age-Matched Groups. A one-way-between-subjects analysis of variance (ANOVA) was conducted to compare the age across the three groups. The analysis shows that there is a significant difference in the age across groups [,
]. The post hoc comparisons using the Tukey's honest significance test reveal that the age in the healthy group (
) is significantly lower (
) than the neuropathy group (
), but not than the myopathy group (
,
). Moreover, the age in the neuropathy group is significantly different from the myopathy group (
). (2) Small Sample Size. Even though we considered 250 samples in the dataset, the iEMG signals were recorded from 25 subjects—10 healthy, seven myopathy, and eight neuropathy subjects—and hence classifiers would encounter the problem of small sample size. Strictly speaking, this challenge is difficult to overcome in most instances: small sample size is prevalent in medical/biological datasets, as it is hard to acquire a huge number of samples; besides, the bias is evident even with a sample size of 1000 in the performance estimates obtained from
-fold CV [58]. We attempted to circumvent this shortcoming empirically by changing the
value in the CV (
) and by performing repeated CV (25 times). The fact that the estimates from these approaches (changing
and repeated CV) have a small variance is a good indicator to ensure the quality of the results. Another expedient to obtain robust and unbiased estimates with a small sample size is by resorting to a nested CV recommended by Varma et al in [59], which we will investigate in the future.
IV. Conclusion
Over the past two decades, the classification of iEMG signals into healthy, myopathy, and neuropathy categories has been an effervescent research topic, as it could potentially assist the diagnosis of neuromuscular disorders [5], [7], [8], [11], [13], [15], [16], [21], [26]–[34], [60]. In order to enhance the performance of an iEMG classifier, the extraction of an efficient feature set is deemed crucial [8], [9]. Therefore, research efforts have been dedicated to achieving a higher classification performance by experimenting with diverse sets of features that could maximize the discriminant power with reduced redundancy.
The proposed classifier makes use of fused feature sets, which comprise FDs of LWT subband coefficients from level-five decomposition and 1-D LBP of decimated iEMG signals. The MLPNN implemented with the selected hyperparameters, and
, returns a sequence of class labels for each signal. Subsequently, BMMV assigns a class label to every iEMG signal by applying the majority vote rule to the sequence of labels. Notably, as reported in Section III-A, the classification performance measures of MLPNN-BMMV outclassed the state-of-the-art approaches tested with the same dataset.
We envisage the following future research directions: (i) The performance of MLPNN-BMMV will be tested using the nested CV as mentioned in Section III-B. By rigorously validating our approach with iEMG datasets having age-matched groups and larger sample sizes, an affordable expert system will be designed for real-time applications in clinical practice. (ii) This classification framework will be applied to sEMG/high-density sEMG data to investigate whether it requires to be modified to classify noninvasively recorded EMG data into the underlying categories. (iii) The proposed approach will be tailored to analyze other electrophysiological signals: diagnosis of sleep and emotion disorders from EEG and ECG; predicting the risk of preterm labor from electrohysterogram.
Supplementary Materials
Additional information related to the following sections can be found in the downloadable Supp. Material file: Section I, II-A, II-C, II-D, II-E, II-H, and III-A. Furthermore, the rationale for the choice of rbio3.7 wavelet for performing LWT and the empirical procedure to determine the decimation factor for decomposing the iEMG signals are described. A multimedia file pertaining to this article can be downloaded and viewed from the following link: https://tinyurl.com/iEMG-NMD-Detection.
Acknowledgment
The authors would like to thank Shyamala Magdalene for her help with the illustrations, proofreading, and narration/compilation of the multimedia file. Furthermore, they appreciate the anonymous reviewers and the Associate Editor for their constructive comments that enabled them improve the article.
Funding Statement
This work was supported by the European Union's Horizon 2020 Research and Innovation Program under Grant 826589.
Footnotes
By “disjoint” we mean that the downsampled signals are not overlapping. We remark that disjoint signals do not imply that they are statistically independent.
Contributor Information
Shobha Jose, Email: shobhajose14@karunya.edu.in.
S. Thomas George, Email: thomasgeorge@karunya.edu.
M. S. P. Subathra, Email: subathra@karunya.edu.
Vikram Shenoy Handiru, Email: vshenoy@kesslerfoundation.org.
Poornaselvan Kittu Jeevanandam, Email: p.kittu.jeevanandam@accenture.com.
Umberto Amato, Email: umberto.amato@na.imm.cnr.it.
Easter Selvan Suviseshamuthu, Email: ESelvan@kesslerfoundation.org.
References
- [1].Preston D. C. and Shapiro B. E., Electromyography and Neuromuscular Disorders E-Book: Clinical-Electrophysiologic Correlations (Expert Consult-Online and Print). Amsterdam, The Netherlands: Elsevier Health Sciences, 2012. [Google Scholar]
- [2].Micera S., Carpaneto J., and Raspopovic S., “Control of hand prostheses using peripheral information,” IEEE Rev. Biomed. Eng., vol. 3, pp. 48–68, Oct. 2010. [DOI] [PubMed] [Google Scholar]
- [3].Criswell E., Cram's Introduction to Surface Electromyography. Boston, MA, USA: Jones & Bartlett Publishers, 2010. [Google Scholar]
- [4].Arikidis N. S., Abel E. W., and Forster A., “Interscale wavelet maximum - a fine to coarse algorithm for wavelet analysis of the EMG interference pattern,” IEEE Trans. Biomed. Eng., vol. 49, no. 4, pp. 337–344, Apr. 2002. [DOI] [PubMed] [Google Scholar]
- [5].Naik G. R., Selvan S. E., and Nguyen H. T., “Single-channel EMG classification with ensemble-empirical-mode-decomposition-based ICA for diagnosing neuromuscular disorders,” IEEE Trans. Neural Syst. Rehabil. Eng., vol. 24, no. 7, pp. 734–743, Jul. 2015. [DOI] [PubMed] [Google Scholar]
- [6].Merletti R., Parker P. A., and Parker P. J., Electromyography: Physiology, Engineering, and Non-invasive Applications, vol. 11. Hoboken, NJ, USA: Wiley, 2004. [Google Scholar]
- [7].Subasi A., “Medical decision support system for diagnosis of neuromuscular disorders using DWT and fuzzy support vector machines,” Comput. Biol. Med., vol. 42, no. 8, pp. 806–815, 2012. [DOI] [PubMed] [Google Scholar]
- [8].Subasi A., “Classification of EMG signals using combined features and soft computing techniques,” Appl. Soft Comput., vol. 12, no. 8, pp. 2188–2198, 2012. [Google Scholar]
- [9].Phinyomark A., Phukpattaranont P., and Limsakul C., “Feature reduction and selection for EMG signal classification,” Expert Syst. With Appl., vol. 39, no. 8, pp. 7420–7431, 2012. [Google Scholar]
- [10].Yousefi J. and Hamilton-Wright A., “Characterizing EMG data using machine-learning tools,” Comput. Biol. Med., vol. 51, pp. 1–13, 2014. [DOI] [PubMed] [Google Scholar]
- [11].Pattichis C. S. and Schizas C. N., “Genetics-based machine learning for the assessment of certain neuromuscular disorders,” IEEE Trans. Neural Netw., vol. 7, no. 2, pp. 427–439, Mar. 1996. [DOI] [PubMed] [Google Scholar]
- [12].Elamvazuthi I. et al. , “Electromyography (EMG) based classification of neuromuscular disorders using multi-layer perceptron,” Procedia Comput. Sci., vol. 76, pp. 223–228, 2015. [Google Scholar]
- [13].Koçer S., “Classification of EMG signals using neuro-fuzzy system and diagnosis of neuromuscular diseases,” J. Med. Syst., vol. 34, no. 3, pp. 321–329, 2010. [DOI] [PubMed] [Google Scholar]
- [14].Mishra V. K., Bajaj V., and Kumar A., “Classification of normal, ALS, and myopathy EMG signals using ELM classifier,” in Proc. 2nd Int. Conf. Advances Elect., Electron., Inf., Commun. Bio-informatics, 2016, pp. 455–459. [Google Scholar]
- [15].Gokgoz E. and Subasi A., “Effect of multiscale PCA de-noising on EMG signal classification for diagnosis of neuromuscular disorders,” J. Med. Syst., vol. 38, no. 4, 2014, Art. no. 31. [DOI] [PubMed] [Google Scholar]
- [16].Mishra V. K., Bajaj V., Kumar A., Sharma D., and Singh G. K., “An efficient method for analysis of EMG signals using improved empirical mode decomposition,” AEU-Int. J. Electron. Commun., vol. 72, pp. 200–209, 2017. [Google Scholar]
- [17].Sengur A., Akbulut Y., Guo Y., and Bajaj V., “Classification of amyotrophic lateral sclerosis disease based on convolutional neural network and reinforcement sample learning algorithm,” Health Inf. Sci. Syst., vol. 5, no. 1, 2017, Art. no. 9. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [18].Defino J. and Vasanthi S. M., “Classification of neuromuscular diseases using dominant MUAP based on wavelet domain features and improving its accuracy using SVM,” Int. J. Res. Scientific Innovations, vol. 3, no. 5, pp. 112–118, 2016. [Google Scholar]
- [19].Krishna V. A. and Thomas P., “Classification of EMG signals using spectral features extracted from dominant motor unit action potential,” Int. J. Eng. Adv. Technol., vol. 4, no. 5, pp. 196–200, 2015. [Google Scholar]
- [20].Vallejo M., Gallego C. J., Duque-Muñoz L., and Delgado-Trejos E., “Neuromuscular disease detection by neural networks and fuzzy entropy on time-frequency analysis of electromyography signals,” Expert Syst., vol. 35, no. 4, 2018, Paper e12274. [Google Scholar]
- [21].Hazarika A., Dutta L., Barthakur M., and Bhuyan M., “A multiview discriminant feature fusion-based nonlinear process assessment and diagnosis: Application to medical diagnosis,” IEEE Trans. Instrum. Meas., vol. 68, no. 7, pp. 2498–2506, Jul. 2019. [Google Scholar]
- [22].Kiran P. U., Abhiram N., Taran S., and Bajaj V., “TQWT based features for classification of ALS and healthy EMG signals,” Amer. J. Comput. Sci. and Inf. Technol., vol. 6, 2018, Art. no. 19. [Google Scholar]
- [23].Joshi D., Tripathi A., Sharma R., and Pachori R. B., “Computer aided detection of abnormal EMG signals based on tunable-Q wavelet transform,” in Proc. 4th Int. Conf. Signal Process. Integr. Netw., 2017, pp. 544–549. [Google Scholar]
- [24].Doulah A. B. M. S. U. and Fattah S. A., “Neuromuscular disease classification based on mel frequency cepstrum of motor unit action potential,” in Proc. Int. Conf. Elect. Eng. Inf. Commun. Technol., 2014, pp. 1–4. [Google Scholar]
- [25].Jose S., George S. T., and Roopchand P., “DWT-based electromyogram signal classification using maximum likelihood-estimated features for neurodiagnostic applications,” Signal, Image Video Process., vol. 14, no. 3, pp. 601–608, 2019. [Google Scholar]
- [26].Kamali T., Boostani R., and Parsaei H., “A multi-classifier approach to MUAP classification for diagnosis of neuromuscular disorders,” IEEE Trans. Neural Syst. Rehabil. Eng., vol. 22, no. 1, pp. 191–200, Jan. 2013. [DOI] [PubMed] [Google Scholar]
- [27].Abel E. W., Zacharia P. C., Forster A., and Farrow T. L., “Neural network analysis of the EMG interference pattern,” Med. Eng. Phys., vol. 18, no. 1, pp. 12–17, 1996. [DOI] [PubMed] [Google Scholar]
- [28].Christodoulou C. I. and Pattichis C. S., “Combining neural classifiers in EMG diagnosis,” in Proc. 6th Eur. Congr. Intell. Techn. Soft Comput., 1998, vol. 3, pp. 1837–41. [Google Scholar]
- [29].Pattichis C. S. and Elia A. G., “Autoregressive and cepstral analyses of motor unit action potentials,” Med. Eng. Phys., vol. 21, no. 6-7, pp. 405–419, 1999. [DOI] [PubMed] [Google Scholar]
- [30].Subasi A., Yilmaz M., and Ozcalik H. R., “Classification of EMG signals using wavelet neural network,” J. Neuroscience Methods, vol. 156, no. 1-2, pp. 360–367, 2006. [DOI] [PubMed] [Google Scholar]
- [31].Subasi A., “Classification of EMG signals using PSO optimized SVM for diagnosis of neuromuscular disorders,” Comput. Biol. and Med., vol. 43, no. 5, pp. 576–586, 2013. [DOI] [PubMed] [Google Scholar]
- [32].Subasi A., “A decision support system for diagnosis of neuromuscular disorders using DWT and evolutionary support vector machines,” Signal, Image Video Process., vol. 9, no. 2, pp. 399–408, 2015. [Google Scholar]
- [33].Hazarika A., Dutta L., Boro M., Barthakur M., and Bhuyan M., “An automatic feature extraction and fusion model: Application to electromyogram (EMG) signal classification,” Int. J. Multimedia Inf. Retrieval, vol. 7, no. 3, pp. 173–186, 2018. [Google Scholar]
- [34].Gokgoz E. and Subasi A., “Comparison of decision tree algorithms for EMG signal classification using DWT,” Biomed. Signal Process. Control, vol. 18, pp. 138–144, 2015. [Google Scholar]
- [35].Raghavendra B. and Dutt D. N., “Signal characterization using fractal dimension,” Fractals, vol. 18, no. 03, pp. 287–292, 2010. [Google Scholar]
- [36].Esteller R., Vachtsevanos G., Echauz J., and Litt B., “A comparison of waveform fractal dimension algorithms,” IEEE Trans. Circuits Syst. I: Fundam. Theory Appl., vol. 48, no. 2, pp. 177–183, Feb. 2001. [Google Scholar]
- [37].Coelho A. L. V. and Lima C. A. M., “Assessing fractal dimension methods as feature extractors for EMG signal classification,” Eng. Appl. Artif. Intell., vol. 36, pp. 81–98, 2014. [Google Scholar]
- [38].Ertuğrul Ö. F., Kaya Y., Tekin R., and Almalı M. N., “Detection of Parkinson's disease by shifted one dimensional local binary patterns from gait,” Expert Syst. With Appl., vol. 56, pp. 156–163, 2016. [Google Scholar]
- [39].Tirunagari S., Kouchaki S., Abasolo D., and Poh N., “One dimensional local binary patterns of electroencephalogram signals for detecting Alzheimer's disease,” in Proc. 22nd Int. Conf. Digit. Signal Process., 2017, pp. 1–5. [Google Scholar]
- [40].Kaya Y., Uyar M., Tekin R., and Yıldırım S., “1D-local binary pattern based feature extraction for classification of epileptic EEG signals,” Appl. Math. Comput., vol. 243, pp. 209–219, 2014. [Google Scholar]
- [41].Mondéjar-Guerra V., Novo J., Rouco J., Penedo M. G., and Ortega M., “Heartbeat classification fusing temporal and morphological information of ECGs via ensemble of classifiers,” Biomed. Signal Process. Control, vol. 47, pp. 41–48, 2019. [Google Scholar]
- [42].Yazid M. and Rahman M. A., “Variable step dynamic threshold local binary pattern for classification of atrial fibrillation,” Artif. Intell. Med., vol. 108, 2020, Art. no. 101932. [DOI] [PubMed] [Google Scholar]
- [43].McCool P., Chatlani N., Petropoulakis L., Soraghan J. J., Menon R., and Lakany H., “Lower arm electromyography (EMG) activity detection using local binary patterns,” IEEE Trans. Neural Syst. and Rehabil. Eng., vol. 22, no. 5, pp. 1003–1012, Sep. 2014. [DOI] [PubMed] [Google Scholar]
- [44].Daube J. R. and Rubin D. I., “Needle electromyography,” Muscle Nerve: Official J. Amer. Assoc. Electrodiagnostic Med., vol. 39, no. 2, pp. 244–270, 2009. [DOI] [PubMed] [Google Scholar]
- [45].Nikolic M., “Detailed analysis of clinical electromyography signals: EMG decomposition, findings and firing pattern analysis in controls and patients with myopathy and amytrophic lateral sclerosis,” Ph.D. thesis, Faculty of Health Science, Univ. of Copenhagen, København, Denmark, Aug. 2001. [Google Scholar]
- [46].Sweldens W., “The lifting scheme: A construction of second generation wavelets,” SIAM J. Math. Anal., vol. 29, no. 2, pp. 511–546, 1998. [Google Scholar]
- [47].Sweldens W., “Lifting scheme: A new philosophy in biorthogonal wavelet constructions,” in Proc. Wavelet Appl. Signal Image Process. III, 1995, vol. 2569, pp. 68–79. [Google Scholar]
- [48].Claypoole R. L., Davis G. M., Sweldens W., and Baraniuk R. G., “Nonlinear wavelet transforms for image coding via lifting,” IEEE Trans. Image Process., vol. 12, no. 12, pp. 1449–1459, Dec. 2003. [DOI] [PubMed] [Google Scholar]
- [49].Higuchi T., “Approach to an irregular time series on the basis of the fractal theory,” Physica D: Nonlinear Phenom., vol. 31, no. 2, pp. 277–283, 1988. [Google Scholar]
- [50].Katz M. J., “Fractals and the analysis of waveforms,” Comput. Biol. and Med., vol. 18, no. 3, pp. 145–156, 1988. [DOI] [PubMed] [Google Scholar]
- [51].Kesić S. and Spasić S. Z., “Application of Higuchi's fractal dimension from basic to clinical neurophysiology: A review,” Comput. Methods Programs Biomed., vol. 133, pp. 55–70, 2016. [DOI] [PubMed] [Google Scholar]
- [52].Solhjoo S., Nasrabadi A. M., and Golpayegani M. R. H., “EEG-based mental task classification in hypnotized and normal subjects,” in Proc. 27th Annu. Int. Conf. IEEE Eng. Med. Biol. Soc., 2006, pp. 2041–2043. [DOI] [PubMed] [Google Scholar]
- [53].Ahmadlou M., Adeli H., and Adeli A., “Fractality analysis of frontal brain in major depressive disorder,” Int. J. Psychophysiology, vol. 85, no. 2, pp. 206–211, 2012. [DOI] [PubMed] [Google Scholar]
- [54].Raghavendra B. S., Dutt D. N., Halahalli H. N., and John J. P., “Complexity analysis of EEG in patients with schizophrenia using fractal dimension,” Physiological Meas., vol. 30, no. 8, pp. 795–808, 2009. [DOI] [PubMed] [Google Scholar]
- [55].Tricot C., Curves and Fractal Dimension. Berlin, Germany: Springer Science & Business Media, 1994. [Google Scholar]
- [56].Boyer R. S. and Moore J. S., “MJRTY—A fast majority vote algorithm,” in Automated Reasoning. Berlin, Germany: Springer, 1991, pp. 105–117. [Google Scholar]
- [57].Duque C. J. G., Muñoz L. D., Mejía J. G., and Trejos E. D., “Discrete wavelet transform and k-nn classification in EMG signals for diagnosis of neuromuscular disorders,” in Proc. 2014 XIX Symp. Image, Signal Process. Artif. Vis., 2014, pp. 1–5. [Google Scholar]
- [58].Vabalas A., Gowen E., Poliakoff E., and Casson A. J., “Machine learning algorithm validation with a limited sample size,” PloS One, vol. 14, no. 11, 2019, Paper e0224365. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [59].Varma S. and Simon R., “Bias in error estimation when using cross-validation for model selection,” BMC Bioinf., vol. 7, no. 1, 2006, Art. no. 91. [DOI] [PMC free article] [PubMed] [Google Scholar]
- [60].Katsis C. D., Goletsis Y., Likas A., Fotiadis D. I., and Sarmas I., “A novel method for automated EMG decomposition and MUAP classification,” Artif. Intell. Med., vol. 37, no. 1, pp. 55–64, 2006. [DOI] [PubMed] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.