Abstract
In the last decade, the proactive diagnosis of diseases with artificial intelligence and its aligned technologies has been an exciting and fruitful area. One of the areas in medical care where constant monitoring is required is cardiovascular diseases. Arrhythmia, one of the cardiovascular diseases, is generally diagnosed by doctors using Electrocardiography (ECG), which records the heart’s rhythm and electrical activity. The use of neural networks has been extensively adopted to identify abnormalities in the last few years. It is found that the probability of detecting arrhythmia increases if the denoised signal is used rather than the raw input signal. This paper compares six filters implemented on ECG signals to improve classification accuracy. Custom convolutional neural networks (CCNNs) are designed to filter ECG data. Extensive experiments are drawn by considering the six ECG filters and the proposed custom CCNN models. Comparative analysis reveals that the proposed models outperform the competitive models in various performance metrics.
Keywords: filters, denoising, customized CCNNs, median filters, Gaussian filter, wavelet filters, moving average filters, Savitzky–Golay filters, low-pass Butterworth filters
1. Introduction
Medical assistance with technology has been increasing rapidly over the last few years. It is an efficient way to treat and monitor patients who require constant medical support. Cardiovascular diseases are primal diseases that humans have been facing for many years. The heart is a vital organ in the body, and identification and diagnosis of heart diseases is critical. These diseases are caused by interruption of oxygen supply to heart muscles due to blockages, resulting in muscle death. Reports cite that 17.9 million people die of arrhythmia each year, estimated at 32% of all deaths [1]. With changing lifestyles and food habits worldwide, all the age groups are now being affected by these diseases, and the pattern of identification became random. Generally, arrhythmia is identified by ECG, which records electricity flowing through the heart [2]. ECG must be studied deeply to identify trends in the graph to predict arrhythmia. A person that is affected by the disease and is prone to it needs continuous observation by ECG.
The scarcity of trained medical professionals in lower-middle- and low-income countries is prevalent, especially in rural areas [3]. Even continuous monitoring of heart disease by an individual is so difficult. With ECG data, arrhythmia identification can be made using computer-aided diagnosis systems [2]. Recent advancements in the computer field, such as enhanced GPUs and computer vision and the availability of datasets, have made it possible to identify trends in ECG which indicate these diseases [4]. The use of neural networks has made it fast, easy, and cheap to give accurate results. Many countries worldwide are now focusing on the healthcare sector and giving importance to its annual budget, but it can still not fulfill society’s aspirations. Furthermore, the lack of health workers puts pressure on existing members, which can be solved by automating some tasks and making machines think using deep learning [5].
Solutions with deep learning and machine learning have been suggested for many medical applications, especially those requiring immediate attention, constant monitoring, and patients in critical condition. Deep learning helps in image classification and audio analysis, now widely used in the research community. The significant contributions of this work can be stated as follows:
Denoising the raw ECG data to extract accurate data.
The use of custom convolution neural networks produced 94% and 93% accuracy to analyze the signals and observe the patterns.
The novelty of this paper lies in the comprehensive comparison of six (6) widely used filters for denoising waveforms, finding out the filter which performs best on ECG data. Then, applying the filter onto the data and using the denoised information to train and test the three (3) custom convolutional neural networks (CCNNs) built using a combination of different types of layers and corresponding parameters. The advantages and disadvantages of implementing custom convolutional neural networks (CCNNs) on ECG data are widely discussed.
The paper is organized as follows. Section 2 states the literature review. Section 3 describes the data set and the filters used for denoising. Section 4 represents the architecture of the custom convolution neural networks used. Section 5 presents the results and discussion. Finally, Section 6 concludes the findings of the implementation and future scope of the work.
2. Related Work
Detection of arrhythmia is an important and challenging task. ECG signals are analyzed to identify the patterns and detect the insights. Almahamdy et al. [6] used different denoising techniques to filter out and get a pure ECG signal. Hilbert transform is one of the efficient methods for denoising the signal. Sabut et al. [7] used this method and adaptive filters for filtering out the input ECG. Altan et al. [8] applied Hilbert transform on the initial data to identify the complexes, and the results obtained were used to diagnose heart diseases. Zhang et al. [9] used wavelet energy and sub-smoothing filters to eliminate unwanted disturbances on ECG signals generated while recording the data. Chandra et al. [10] performed denoising techniques and feature extraction to detect arrhythmia. Median filter, Gaussian filter, and other filters were used by Subbiah et al. [11] to denoise the signal and identify which filter gives the best result. Kumar et al. [12] used multidimensional noise cancellation of ECG signal to extract pure ECG signal. Rajeshwari et al. [13] used efficient preprocessing techniques on MRI images. Pace et al. [14] developed interactive algorithm to segment heart chambers, epicardial surfaces and great vessel walls from pedic cardiac MRI of congenital heart diseases. Sraitih et al. [15] implemented different machine learning algorithms such as support vector machine (SVM) K-nearest neighbor(KNN) on the preprocessed ECG data for classifying different types of beats. Berntson et al. [16] used a high-pass filter to measure root mean square successive heart period variability. Ali et al. [17] used different deep learning architectures, such as convolutional neural networks (CNNs), long short-term memory (LSTM), autoencoders, etc., to classify ECG signals to detect arrhythmia. Naz et al. [18] took ECG data as an input, convert the data into a binary image, classify using deep learning, and discussed the limitations of using ECG data rather than image data. Wu et al. [19] used convolutional neural networks on a denoised ECG signal to classify different peaks in the signal. Aziz et al. [20] used two-event corresponding moving average (TERMA) and fractional Fourier transform (FrFT) to extract different peaks and then used machine learning to classify the heart as normal or abnormal.
Pattern detection and feature extraction from the raw data produce multi-layered long-duration crucial in the whole process. Patro et al. [21] used ANN on ECG data to identify patterns based on psychological and geometrical conditions of the heart used for biometric identification. Lastre et al. [22] identified heart abnormalities after FIR smoothing on data. Acharya et al. [23] implemented grayscale feature extraction from electrocardiographic images for identifying CAD using the GMM classifier. Extraction of heart rate signals from ECG to identify CAD by using linear and nonlinear analysis was performed by Acharya et al. [24]. Bhyri et al. [25] used CNN for feature extraction from ECG to identify QRS complexes in the data. Lin et al. [26] efficiently implemented the use of convolutional neural networks for feature extraction using a person’s facial features, which indeed helps identify coronary heart diseases. ML algorithms were used by Akella et al. [27] for feature extraction and classification of heart diseases. Valluraiah et al. [28] identified and located R peaks and QRS detection using the Hilbert transform.
The last phase would identify and classify arrhythmia and heart diseases using the preprocessed data4 and extracted features. Yıldırım et al. [29] used 1-dimensional CNN to identify cardiac arrhythmia with the help of long-duration ECG signals. This method achieved 91% accuracy and took much less time when compared with traditional methods.
Luz et al. [30] performed both preprocessing of ECG data and classification of two significant types of arrhythmias using the preprocessed data. A relevance vector machine (RVM) is used to classify five types of arrhythmias which produced more significant results than Gayathri et al. [31]. Rajpurkar et al. [32] used 34-layered convolutional neural networks to identify arrhythmia using ECG signals. ECG signals are converted into 2-D vectors by Li et al. [33] to detect irregular heartbeats to identify abnormalities that have produced more significant results. Avanzato et al. [34] used multi-layered CNN onto the ECG dataset to classify CAD, and it produced an accuracy of ~98%. Alizadehsani et al. [35] used different ML algorithms to analyze factors that cause CAD and thus detect CAD. Acharya et al. [36] and Acharya et al. [37] used multi-layered CNN for analyzing long-duration ECG signals for detecting CHD. The detailed overview of models, datasets, and their accuracy in diagnosis of heart related diseases are presented in Table 1.
Table 1.
Author | Models | Disease | Datasets | Accuracy |
---|---|---|---|---|
Altan et al. [8] | Deep belief networks | Coronary artery disease | Made a dataset from collecting data | 98.88% |
Ali et al. [17] | CNN, LSTM, RNN | Arrythmia classification | Combination of different publicly available datasets | - |
Naz et al. [18] | Pretrained CNNs | ECG classification | MIT-BIH database | 91.2 |
Wu et al. [19] | Convolutional neural networks | Arrhythmia | MIT-BIH database | 97.41 |
Patro et al. [21] | Artificial neural network | Feature extraction from ECG signals. | MIT-BIH ECG ID database signal | - |
Acharya et al. [23] | Gaussian Mixture Model (GMM) | Coronary artery disease | The CAD datasets from the University California Irvine a database | 95% |
Acharya et al. [24] | Convolution neural network | Coronary artery disease | Physio net databases | 95.11% |
Bhyri et al. [25] | heart diseases | CSE ECG database | around 99% | |
Lin et al. [26] | Deep convolutional neural networks | coronary artery disease | Combination of datasets | 95% |
Akella et al. [27] | SVM, K-NN, artificial neural network | coronary artery disease | UCI dataset | 93.03% |
Yıldırım et al. [29] | 16-layer standard CNN | Arrhythmia | MIT-BIH Arrhythmia database | 86.67% |
Luz et al. [30] | Arrhythmia | MIT-BIH, EDB, AHA, CU, NST databases | - | |
Gayathri et al. [31] | Relevance vector machine | Arrhythmia | MIT/BIH database | RVM boosts generalization capability |
Rajpurkar et al. [32] | 34-layer convolutional neural network | Arrhythmia | Own dataset with a combination of datasets | |
Li et al. [33] | CNN-based classification on ECG signals. | ECG classification | MIT-BIH arrhythmia database, | 99.1% |
Avanzato et al. [34] | Convolutional neural networks | coronary artery disease | MIT-BIH arrhythmia database | 98.33% |
Alizadehsani et al. [35] | ML algorithms | Coronary artery disease | Combination of different datasets | - |
Acharya et al. [36] | 11-layer deep convolutional neural network | congestive heart failure | BIDMC: Congestive Heart Failure Database, Fantasia Database, MIT-BIH database | 99.01% |
Acharya et al. [37] | Time level and frequency domain analysis | Coronary artery disease | CAD dataset | 96.8 |
3. Methodology
This section is as follows. Section 3.1 briefly describing the dataset and its contents. Section 3.2 contains preprocessing information about the data. Section 3.3 explains about different filters applied to ECG signals. Section 3.4 describes the model architecture, which is further explained in Section 3.4.1, Section 3.4.2 and Section 3.4.3 about model 1, model 2, and model 3, respectively. Finally, Section 3.5 explains performance metrics used to evaluate and measure the model’s performance.
3.1. Data Description
Forty-eight half-hour labeled two-channel ambulatory ECG recordings were presented in the MIT-BIH Arrhythmia Database. In the dataset above, 47 people were investigated between 1975 and 1979 in the BIH arrhythmia laboratory [38] and published in 2005. A random selection of twenty-three recordings was made from a collection of 4000 24-h ambulatory ECG recordings collected from a diverse population of outpatients (40%) and inpatients (60%) at Boston’s Beth Israel Hospital. The remaining 25 recordings were selected from the same set to include rare but clinically significant arrhythmias. This was done to ensure the dataset is generalized and contains a variety of arrhythmias. The ECG recording of the database was converted to digital format, and two cardiologists resolved any difference present in data. The annotation of heartbeats is presented in Figure 1.
3.2. Preprocessing
The ECG signal from MIT-BIH Arrhythmia Database was sampled at 360 Hz. The data from the dataset are well-curated, but in the real-world scenario, ECG signals contain noise, so to test and train the proposed model to perform under those circumstances, the random noise was added to the dataset. The denoising of the dataset was performed using wavelet transformation, median filter, 1-D Gaussian filter, Moving Average filter, Savitzky–Golay filter, and low-pass Butterworth filter. The performance of the filters is presented in the result section, out of which the median filter performed well compared to other filters; therefore, we used it to denoise the ECG signal. ECG signals are segmented into heartbeats as per the annotation provided in the dataset. A Z-score normalization procedure normalizes each ECG segment to increase the data and reduce noise.
This procedure overcomes the problem of amplitude scaling and removes the offset effect. After preprocessing, the data in the dataset are first randomized, then divided into training sets and testing sets with 80% and 20% of total data, respectively. Eighty percent of training data is fed to the model to train on the data, and then the 20% test data is used to test and evaluate the model.
3.3. Filters
Noises are unwanted signals in data acquisition that must be denoised for processing signals for critical situations. The authors used six filters to denoise and filter out the noise from the ECG data. The filters are as follows:
-
1.
Median Filter:
The median denoising filter filters out salt and pepper type noise [39]. It is a nonlinear filter. In this filter, a window slides over the input, and in each case, the median of the window is calculated, and the median value replaces other pixels.
-
2.
Gaussian Filter:
A Gaussian denoising filter is also known as a Gaussian blur. It denoises the signal and creates kernels with normal distribution [40]. The window size is restricted to a value in this filter as Gaussian filters generally use an infinite range for each input. A Gaussian impulse has been created that denoises the entire input data or signal.
-
3.
Moving Average Filter:
Smoothing the signal is one of the main tasks performed by the Moving Average filter. This filter finds the average over the data points on the whole input signal to smoothen the signal [41]. This filter smoothens short-term fluctuations and disturbances to observe long-term trends in the signal.
-
4.
Savitzky–Golay filter:
In this filter, some points to fit a polynomial and replace the input with the output, which smoothens the signal on performing this process over the signal input [42,43].
-
5.
Low-Pass Butter Filter:
It is designed to make frequency response as flat as possible on the passband. It allows the input signal to appear at the output until the frequency is lower than the cutoff frequency. It is always assumed to make the signal smooth and keep the frequency low [44].
-
6.
Wavelet Denoising Filter:
Wavelet consists of oscillations where the oscillation begins at 0, increases, and decreases. This filter filters the signal in the wavelet space using the threshold value then inverts the filtered signal to produce the original [43].
3.4. Architecture
This paper presents three (1D) custom CCNN out of which two 1-dimensional (1D) custom CCNN structures (Model 1, Model 2) consisting of five (5) convolution layers, five (5) max-pooling layers, and one (1) fully connected layer. The third custom CCNN (Model-3) consists of four (4) convolution layers, four (4) and three (3) fully connected layers, max-pooling layers convolution layer, dropout, and max-pooling alternate each other in the models.
3.4.1. Model 1
In model 1, as shown in Figure 2, the input shape is (2160, 1), and each layer has RELU as its activation function. Each alternating convolution layer has filters as 400, 256, 178, 88, and 44, respectively, and kernel sizes of 20, 15, 7, 5, and 3. The dropout rate between the first three sets of convolution and max pool layers is 0.5, but it is set to 0.25 for the last two sets. Max pooling layers are set to pool size as 2, consisting of strides which are also set to 2, and padding is present. The final layer, the output layer, has an activation function as a sigmoid.
3.4.2. Model 2
In Model 2 presented in Figure 3, the structure is as follows: a dropout separates the convolution and max-pooling layers. The input layer has an input size of (2160, 1). The convolution layers, including the input layer, have an activation function as RELU. The filters of alternating convolution layers are set to 600, 400, 266, 178, and 88, respectively, and the kernel size of each alternating layer is 20, 15, 10, 7, and 20, respectively. Each convolution layer and max-pooling layer set has a dropout place between them, with a rate of 0.25 throughout the model. Each max pool layer has padding activated, has a max pool size of two (2), and strides set to two (2). The model is flattened before the dense layer, and the dense layer has activation specified as sigmoid.
3.4.3. Model 3
The structure of Model-3 shown in Figure 4 is as follows: a dropout separates the convolution layer and max-pooling layer. The input layer has an input size of (2160, 1). The convolution layers, including the input layer, have an activation function as relu. The filters of alternating convolution layers are set to 256, 128, 72, and 36, respectively, and the kernel size of each layer is 5. Each convolution layer and max-pooling layer set has a dropout place between them, with a rate of 0.50 throughout the model. Each max pool layer has padding activated, has a max pool size of two (2), and strides set to two (2). Before the dense layers, the model is flattened. Three dense layers are placed with neurons (50, 32, 1), activation function used in the first two dense layers is relu, and the activation function specified in the third layer is sigmoid.
3.5. Performance Matrix
The filters are compared using peak to signal noise ratio (PSNR), which is defined as the ratio between the maximum value of the signal to the distorting noise in the signal.
(1) |
where MSE is mean square error, RMSE is root mean squared error, and L represents number of maximum possible intensity levels.
Evaluation of a model is one of the essential steps in building a neural network. Evaluation of a model focuses on testing the model’s performance on the test dataset after training the model. ECG signal classification performance measure is done using loss, accuracy, sensitivity, specificity, precision, and recall.
The confusion matrix, the N × N matrix, where N represents numbers of classes in classification, is one of the critical elements in the performance metric. It is plotted as follows.
True positive (TP) | False Positive (FP) |
True Negative (TN) | False Negative (FN) |
Here, true positive represents that the classified data point is positive and classified as positive by the model. True negative represents that the model’s classified data point is negative and is classified as negative by the model. False positive represents that the classified data point is negative and is classified as positive by the model. False negative represents that the classified data point is positive and is classified as negative by the model.
Accuracy = (TP + TN)/(TP + FP + FN + TN) | (2) |
Sensitivity = TP/(TP + FN) | (3) |
Specificity = TN/(TN + FP) | (4) |
precision = TP/(TP + FP) | (5) |
Recall = TP/(TP + FN) | (6) |
Figure 5 represents the process flow starting from the data and classifying it.
ECG signal is taken from the dataset on which one-hot encoding is performed. These data are normalized using Z-score, and then heartbeat segmentation is done. The data are denoised using different filters such as wavelet transform, low-pass Butterworth filter, Savitzky–Golay filter, moving average filter, median filter, and gaussian filter. The preprocessed and denoised data is now split into training and testing sets and then fed to CCNNs to perform the heartbeat classification.
4. Results and Discussion
This part of the paper as follows: Section 4.1 provides details about the filters used in the study, and their performance is analyzed on the dataset. Comparative analyses of the filters are done in Section 4.1.7 using peak to signal noise ratio (PSNR); Section 4.2 explains and presents the results of three different convolutional neural networks used in this experimental study.
4.1. Denoising
Denoising is a process in which signals are reconstructed and extracted from noisy and mixed signals. Its main goal is to eliminate noise and preserve helpful information. Some denoising techniques are discussed, and their results are presented in this section.
4.1.1. Wavelet Denoising Filter
Wavelet denoising depends on the wavelet representation of the signals. Small values in the wavelet domain are Gaussian noise which can be removed by setting coefficients below threshold or zero, or all coefficients are shrunk toward zero by the given amount. This study used soft thresholding, Bayes Shrink algorithm, and sym8 wavelet in denoising raw signals. It achieved a peak signal-to-noise ratio of 56.9, and the results of wavelet denoising can be seen in Figure 6.
4.1.2. Median Filter
It is a nonlinear filtering technique often used to remove impulse noise from signals. Removal of noise from the raw signal is a preprocessing step. The central concept behind median filtering is to run signal entry by entry, replacing each entry with the median of the neighboring entries. The pattern of neighbors used to find the median is called a window, which slides entry by entry over the entire signal. The peak to signal noise achieved by the median filter in this study is 87.3, and the results of median filter denoising can be seen in Figure 7.
4.1.3. Gaussian Filter
The Gaussian function is the impulse response of the Gaussian filter. The accurate gaussian response would have an infinite impulse response. Convolution of the input signal and Gaussian filter modify and denoise signals. The window size is restricted to a value in this filter as Gaussian filters generally use an infinite range for each input. A gaussian impulse is created, which denoises the entire input data or signal. In this study, the Gaussian filter achieved a peak to signal noise value is 86.5, and the output of the Gaussian filter can be seen in Figure 8.
4.1.4. Moving Average Filter
A moving average filter is simply a low-pass Finite Impulse Response (FIR). This filter is used to regulate an array of sample data or signals. Samples of the input are taken at a time. Then, input samples and an average of those values are taken to signal output. It is found that as the length of the filter increases, the smoothness of the output increases. The peak to signal noise ratio achieved by moving the average filter is 81.05 and the denoised signal produced, and its comparison with a raw signal is presented in Figure 9.
4.1.5. Savitzky–Golay Filter
Savitzky–Golay filters are commonly used to remove signals whose frequency span is significant. Savitzky–Golay filters are also known as digital smoothing polynomial filters or least-squares smoothing filters. These filters perform better than standard averaging FIR filters in some applications, which filters high-frequency signals with noise. Savitzky–Golay filters are more successful at preserving high-frequency signals. The peak to signal noise ratio achieved by the Savitzky–Golay filter is 80.5 in this study, and the filter is implemented using Scipy library with window size 25 and polynomial of order 7 is used to fit the sample. The performance of the Savitzky–Golay filter can be seen in Figure 10.
4.1.6. Low-Pass Butterworth Filter
Butterworth filters are those filters whose frequency is flat after the passband region. The output provided by the low pass filter from DC up to a cut-off frequency f(H) and the signal above the frequency gets rejected by the low-pass Butterworth filter. The peak to signal noise ratio achieved by the low-pass Butterworth filter is 78.6 in this study, and the performance of the low-pass Butterworth filter is shown in Figure 11.
4.1.7. Comparison between Filters
The details of the ECG signals of the MIT-BIH dataset are present in the three files. They are (.hea) denoted as a header file, (.dat) denoted as a binary file, and (.atr) denoted as a binary annotation file.
The header file holds comprehensive information about the ECG signal, such as lead used for the patient and the number of leads used to diagnose diseases, the sampling frequency of the signals, and patient details. Format of the signal is present in the binary file, and information related to beats is stored in the binary file. Multiple filters are implemented on ECG signals before segmenting them into the single-single heartbeats to remove different kinds of noises like muscle artifact noise, electrode motion artifact noise, and baseline wander. A comparison of the outputs of the filters was made using PSNR.
It can be observed from Table 2 that median and Gaussian filters perform better than other filters in removing noise from raw ECG signals.
Table 2.
Filters | Wavelet Transform | Low-Pass Butterworth Filter | Savitzky–Golay Filter | Moving Average | Gaussian Filter | Median Filter |
---|---|---|---|---|---|---|
PSNR | 56.9 | 78.6 | 80.5 | 81.05 | 86.5 | 87.3 |
4.2. Results of CCNNs
This experimental study was conducted on Google colab with Tesla K80 GPU, CPU Intel(R) Xenon(R), RAM 13 Gb. We use the proposed two convolutional neural networks and one Residual neural network. These are trained using heartbeats segmented from the ECG signal database. The results achieved by the models are presented in Table 3 and Table 4. It can be seen from the tables that Model-1 achieved better results than the other two models.
Table 3.
Model | Training Loss | Training Accuracy | Training Sensitivity | Training Specificity | Training Recall | Training Precision | Training F1-Score |
---|---|---|---|---|---|---|---|
Model-3 | 0.0533 | 0.9829 | 0.9598 | 0.9933 | 0.9598 | 0.9853 | 0.9708 |
Model-1 | 0.0373 | 0.9888 | 0.9771 | 0.9942 | 0.9771 | 0.9872 | 0.9762 |
Model-2 | 0.0357 | 0.9907 | 0.9824 | 0.9946 | 0.9824 | 0.9890 | 0.9848 |
Table 4.
Model | Validation Loss | Validation Accuracy | Validation Sensitivity | Validation Specificity | Validation Recall | Validation Precision | Validation F1-Score |
---|---|---|---|---|---|---|---|
Model-3 | 0.3831 | 0.8671 | 0.4081 | 0.8250 | 0.3888 | 0.4351 | 0.3833 |
Model-1 | 0.3171 | 0.8737 | 0.4525 | 0.8502 | 0.4030 | 0.4438 | 0.3859 |
Model-2 | 0.2754 | 0.9325 | 0.4214 | 0.8625 | 0.4214 | 0.5207 | 0.4338 |
4.3. AUC–ROC CURVE
Receiver operating characteristic curve (ROC) is a plot between two parameters—True Positive Rate (TPR) and False Positive Rate (FPR)—which are plotted on Y and X axis, respectively. ROC curve is plotted by computing TPR and FPR at different thresholds and plotting it onto the graph to find the best threshold for the model.
TPR = TP/(TP + FN) | (7) |
FPR = FP/(FP + TN) | (8) |
ROC plot shows the following:
Relationship between sensitivity and specificity. As sensitivity increases specificity increases.
The classification power of the model at different thresholds. As the threshold decreases more data items are classified as positive.
Test accuracy which can be identified as the closer the curve to the top leftmost corner of the graph accurate the model is. An ideal curve would go straight from zero up to the top-left corner and then parallel to the X-axis. The curve which will be nearer to the diagonal would be less accurate.
Area Under Curve (AUC) is used to summarize the performance of the ROC curve. AUC curve is the measure of the ability of the model to distinguish between the classes.
From Figure 12, it is evident that Model-2 curve is much nearer to the top left corner and Model-3 is much nearer to the diagonal. Therefore, Model-2 has a higher AUC value when compared to Model-1 and Model-3. Furthermore, Figure 13, Figure 14 and Figure 15 represent AUC values of the model at different number of training points. Model-2 at any instant has the higher AUC value when compared with Model-1 and Model-3. Therefore, Model-2 has higher performance than Model-1 and Model-3 when the models are compared using ROC curve that is shown in Figure 12 and AUC values.
4.4. Confusion Matrix
Figure 16, Figure 17 and Figure 18 represent confusion matrix of model-1, model-2, and model-3, respectively. Confusion matrix is used to obtain parameters of evaluation such as accuracy, sensitivity, specificity, precession, and recall.
From Table 4, it can be noted that validation accuracies are 86%, 87%, and 93% for Model-3, Model-1, and Model-2, respectively. These values implies that Model-2 had performed well compared to other two models on the fed ECG data.
5. Discussion
The deep learning architectures Model-1, Model-2, and Model-3 proposed for ECG classification are motivated by classification and analysis [45,46]. Various studies are performed using conventional neural networks to characterize abnormal ECG signals. Our ECG monitoring and classification system, which is patient-specific, was developed using a three-layer convolutional neural network structure. R-wave was used to detect ventricular and supraventricular electrical activity in this system, giving 99.60% and 97.60% accuracy, respectively. In Zubain et al. [47], 3-layer convolutional neural network model was trained using R peak ECG beat patterns and achieved an accuracy of 92.7% in detecting five different ECG classes. Four ECG classes were characterized by an 11-layer convolutional neural network using two and five seconds of ECG signal. They yielded an accuracy of 92.50%, a sensitivity of 98.09%, and a specificity of 93.13 for two-second ECG. Furthermore, this system achieved 94.90% accuracy, 99.13% sensitivity, and 81.44% specificity for five seconds of ECG signals (Acharya et al. [48]). Robust features were extracted from ECG signals using alternative convolutional, pooling, and dropout layers. Then, features were linked to fully connected layers for ECG signal characterization. The results presented in Table 3 and Table 4 reveal that the proposed model achieves remarkable results. Moreover, the implementation of the proposed architecture is economical and needs light hardware because architecture needs only 1-dimensional convolutions.
We have trained the proposed models with six epochs with 32 batch size. We have considered the epoch value as 6 as if we try with higher epoch size models will start memorizing the ECG patterns, and, thus, may lead to overfitting problem. Model-1 consists of 1,956,651 trainable parameters and has an average epoch time of 401 s. Model-2 consists of 5,274,443 trainable parameters and has an average epoch time of 883 s. Model-3 consists of 462,167 trainable parameters and has an epoch time as 163 s. Thus, Model-2 achieves better average epoch computation time than other models. The overall time complexity is . Here, represents length of the filter. is the length of the input. is the depth of the filter. defines the number of filters.
The advantages of proposed models are as follows:
The proposed CCNN model is robust.
There is no requirement for QRS detection.
CCNN structure consists of feature extraction, selection, and classification.
The proposed model is light on the computation side; it is cost-effective.
The limitations are as follows:
The training phase of CCNN is much higher.
A huge database is required to fulfill the training criteria.
CCNN required a fixed ECG signal; thus, ECG signal length must be fixed for both the training and testing phase.
6. Conclusions and Future Scope
The most common cause of heart attack is coronary artery disease (CAD). Despite significant technical developments, an automated diagnosis method that is both reliable and efficient is required for the early detection of CAD. Three custom CCNN structures (Model 1, Model 2, and Model 3) were built out of which first two models consisted of five convolutional layers, five max-pooling layers, and one fully connected layer, and the third Model-3 consists of four convolutional layers, four max-pooling layers, and three fully connected layers to detect two classes (regular and CAD). Model-1 achieved 93.03% accuracy, 52.18% sensitivity, and 84.45% specificity. While Model-2 achieved 89.03% accuracy, 47.92% sensitivity, and 95.88% specificity. Model-3 achieved 89.56% accuracy, 47.48% sensitivity, and 87.20% specificity. The new technique can aid clinicians in accurately diagnosing coronary artery disease.
The approach is easy to use, inexpensive, and suitable for cardiac screening in developing countries. The scientists can use an extensive database to improve the CCNN structure in future research. This research can also be applied to the early detection of coronary artery disease (CAD), different phases of myocardial infarction (MI), and congestive heart failure (CHF) utilizing ECG signals. This will aid medics in providing appropriate medication and saving lives.
Acknowledgments
We gratefully acknowledge the support of the research from the Symbiosis Centre for Applied Artificial Intelligence (SCAAI) and Symbiosis International University (SIU) for providing us with resources and a platform to develop the paper. This work was also supported by Taif University Researchers Supporting Project Number (TURSP-2020/114), Taif University, Taif, Saudi Arabia.
Author Contributions
Data curation: A.M., G.D. and S.G.; Writing original draft: A.M., G.D. and S.G.; Supervision: S.G. and K.K.; Project administration: S.G., K.K. and A.Z.; Conceptualization: A.M. and G.D.; Methodology: A.M., G.D., S.G. and M.K.; Validation: D.K., H.-N.L. and K.K.; Visualization: A.M. and G.D.; Resources: A.M. and M.K.; Review and Editing: S.G., K.K., A.Z. and D.K.; Funding acquisition: M.K. and H.-N.L. All authors have read and agreed to the published version of the manuscript.
Funding
This work was supported in part by the National Research Foundation of Korea (NRF) Grant funded by the Korean government (MSIP) (NRF-2021R1A2B5B03002118). This research was supported by the Ministry of Science and ICT (MSIT), Korea, under the ITRC (Information Technology Research Center) support program (IITP-2021-0-01835) supervised by the IITP (Institute of Information & Communications Technology Planning & Evaluation).
Institutional Review Board Statement
Not applicable.
Informed Consent Statement
Not applicable.
Data Availability Statement
MIT-BIH Arrhythmia dataset used in this research to train three custom convolutional neural networks and validate results. The authors confirmed that this dataset is available publicly in the physionet database.
Conflicts of Interest
The authors declare no conflict of interest.
Footnotes
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.
References
- 1. [(accessed on 15 October 2021)]. Available online: https://www.who.int/health-topics/cardiovascular-diseases#tab=tab_1.
- 2.Thakor N.V., Zhu Y.S. Applications of adaptive filtering to ECG analysis: Noise cancellation and arrhythmia detection. IEEE Trans. Biomed. Eng. 1991;38:785–794. doi: 10.1109/10.83591. [DOI] [PubMed] [Google Scholar]
- 3.Zühlke L., Lawrenson J., Comitis G., De Decker R., Brooks A., Fourie B., Hugo-Hamman C. Congenital heart disease in low-and lower-middle-income countries: Current status and new opportunities. Curr. Cardiol. Rep. 2019;21:163. doi: 10.1007/s11886-019-1248-z. [DOI] [PubMed] [Google Scholar]
- 4.Kamaruddin N.H., Murugappan M., Omar M.I. Early prediction of Cardiovascular Diseases using ECG signals; Proceedings of the Congenital Heart Disease in Low-and Lower-Middle-Income Countries: Current Status and New Opportunities); Pulau Pinang, Malaysia. 5–6 December 2012; Piscataway, NJ, USA: IEEE; 2012. pp. 48–53. [Google Scholar]
- 5.Rajkumar A., Ganesan M., Lavanya R. Arrhythmia classification on ECG using Deep Learning; Proceedings of the 2019 5th International Conference on Advanced Computing & Communication Systems (ICACCS); Coimbatore, India. 15–16 March 2019; Piscataway, NJ, USA: IEEE; 2019. pp. 365–369. [Google Scholar]
- 6.AlMahamdy M., Riley H.B. Performance study of different denoising methods for ECG signals. Procedia Comput. Sci. 2014;37:325–332. doi: 10.1016/j.procs.2014.08.048. [DOI] [Google Scholar]
- 7.Sahoo S., Biswal P., Das T., Sabut S. De-noising of ECG Signal and QRS Detection Using Hilbert Transform and Adaptive Thresholding. Procedia Technol. 2016;25:68–75. doi: 10.1016/j.protcy.2016.08.082. [DOI] [Google Scholar]
- 8.Gokhan A., Novruz A., Yakup K. Diagnosis of Coronary Artery Disease Using Deep Belief Networks. Eur. J. Eng. Nat. Sci. 2017;2:29–36. [Google Scholar]
- 9.Zhang D., Wang S., Li F., Wang J., Kumar A., Sheng V., Ding X. An ECG Signal De-Noising Approach Based on Wavelet Energy and Sub-Band Smoothing Filter. Appl. Sci. 2019;9:4968. doi: 10.3390/app9224968. [DOI] [Google Scholar]
- 10.Chandra S., Sharma A., Singh G.K. Feature extraction of ECG signal. J. Med. Eng. Technol. 2018;42:306–316. doi: 10.1080/03091902.2018.1492039. [DOI] [PubMed] [Google Scholar]
- 11.Stalin S., Rajkumar P., Rajendran K. Reduction of Noises in ECG Signal by Various Filters. Int. J. Eng. Res. Technol. 2014;3:656–660. doi: 10.13140/RG.2.2.35951.89763. [DOI] [Google Scholar]
- 12.Bhoi A.K., Sherpa K.S., Phurailatpam D., Tamang J.S., Giri P.K. Multidimensional approaches for noise cancellation of ECG signal; Proceedings of the 2015 International Conference on Communications and Signal Processing (ICCSP); Melmaruvathur, India. 2–4 April 2015; pp. 66–70. [DOI] [Google Scholar]
- 13.Rajeshwari S., Sharmila T. Efficient quality analysis of MRI images using preprocessing techniques; Proceedings of the 2013 IEEE Conference on Information and Communication Technologies; Thuckalay, India. 11–12 April 2013; pp. 391–396. [DOI] [Google Scholar]
- 14.Pace D., Dalca A., Geva T., Powell A., Moghari M.H., Golland P. Interactive Whole-Heart Segmentation in Congenital Heart Disease. Med. Image Comput. Comput. Assist. Interv. 2015;9351:80–88. doi: 10.1007/978-3-319-24574-4_10. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Sraitih M., Jabrane Y., Hassani A.H.E. An Automated System for ECG Arrhythmia Detection Using Machine Learning Techniques. J. Clin. Med. 2021;10:5450. doi: 10.3390/jcm10225450. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Gary B., David L., Chen Y.-J. Filter properties of root mean square successive difference (RMSSD) for heart rate. Psychophysiology. 2005;42:246–252. doi: 10.1111/j.1469-8986.2005.00277.x. [DOI] [PubMed] [Google Scholar]
- 17.Khan A.H., Hussain M., Malik M.K. Arrhythmia Classification Techniques Using Deep Neural Network. Complexity. 2021;2021:9919588. doi: 10.1155/2021/9919588. [DOI] [Google Scholar]
- 18.Naz M., Shah J.H., Khan M.A., Sharif M., Raza M., Damaševičius R. From ECG signals to images: A transformation based approach for deep learning. PeerJ Comput. Sci. 2021;7:e386. doi: 10.7717/peerj-cs.386. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 19.Wu M., Lu Y., Yang W., Wong S.Y. A Study on Arrhythmia via ECG Signal Classification Using the Convolutional Neural Network. Front. Comput. Neurosci. 2021;14:564015. doi: 10.3389/fncom.2020.564015. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 20.Saira A., Sajid A., Alouini M.-S. ECG-based machine-learning algorithms for heartbeat classification. Sci. Rep. 2021;11:18738. doi: 10.1038/s41598-021-97118-5. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Kiran P., Kumar P. Effective Feature Extraction of ECG for Biometric Application. Procedia Comput. Sci. 2017;115:296–306. doi: 10.1016/j.procs.2017.09.138. [DOI] [Google Scholar]
- 22.Lastre-Domínguez C., Shmaliy Y.S., Ibarra-Manzano O., Munoz-Minjares J., Morales-Mendoza L.J. ECG Signal Denoising and Features Extraction Using Unbiased FIR Smoothing. BioMed Res. Int. 2019;2019:2608547. doi: 10.1155/2019/2608547. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Acharya U.R., Sree S.V., Muthu Rama Krishnan M., Krishnananda N., Ranjan S., Umesh P. Automated classification of patients with coronary artery disease using grayscale features from left ventricle echocardiographic images. Comput. Methods Programs Biomed. 2013;112:624–632. doi: 10.1016/j.cmpb.2013.07.012. [DOI] [PubMed] [Google Scholar]
- 24.Acharya U.R., Fujita H., Oh S.L., Adam M., Tan J.H., Chua K. Automated Detection of Coronary Artery Disease Using Different Durations of ECG Segments with Convolutional Neural Network. Knowl. Based Syst. 2017;132:62–71. doi: 10.1016/j.knosys.2017.06.003. [DOI] [Google Scholar]
- 25.Bhyri C., Hamde S., Waghmare L. ECG feature extraction and disease diagnosis. J. Med. Eng. Technol. 2011;35:354–361. doi: 10.3109/03091902.2011.595530. [DOI] [PubMed] [Google Scholar]
- 26.Lin S., Li Z., Fu B., Chen S., Li X., Wang Y., Wang X., Lv B., Xu B., Song X., et al. Feasibility of using deep learning to detect coronary artery disease based on facial photo. Eur. Heart J. 2020;41:4400–4411. doi: 10.1093/eurheartj/ehaa640. [DOI] [PubMed] [Google Scholar]
- 27.Akella A., Kaushik V. Machine Learning Algorithms for Predicting Coronary Artery Disease: Efforts toward an Open-Source Solution. Future Sci. OA. 2020;7:FSO698. doi: 10.2144/fsoa-2020-0206. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 28.Valluraiah P., Biswal B. ECG signal analysis using Hilbert transform; Proceedings of the 2015 IEEE Power, Communication and Information Technology Conference (PCITC); Bhubaneswar, India. 15–17 October 2015; pp. 465–469. [DOI] [Google Scholar]
- 29.Yıldırım Ö., Pławiak P., Tan R.S., Acharya U.R. Arrhythmia Detection Using Deep Convolutional Neural Network with Long Duration ECG Signals. Comput. Biol. Med. 2018;102:411–420. doi: 10.1016/j.compbiomed.2018.09.009. [DOI] [PubMed] [Google Scholar]
- 30.Luz E., Schwartz W., Chávez G., Menotti D. ECG-based Heartbeat Classification for Arrhythmia Detection: A Survey. Comput. Methods Programs Biomed. 2015;127:144–164. doi: 10.1016/j.cmpb.2015.12.008. [DOI] [PubMed] [Google Scholar]
- 31.Gayathri S., Manikandan S., Latha V. ECG Arrhythmia Detection and Classification Using Relevance Vector Machine. Procedia Eng. 2012;38:1333–1339. doi: 10.1016/j.proeng.2012.06.164. [DOI] [Google Scholar]
- 32.Rajpurkar P., Hannun A., Haghpanahi M., Bourn C., Ng A. Cardiologist-Level Arrhythmia Detection with Convolutional Neural Networks. Nat. Med. 2017;25:65–69. doi: 10.1038/s41591-018-0268-3. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Li J., Si Y., Xu T., Jiang S. Deep Convolutional Neural Network Based ECG Classification System Using Information Fusion and One-Hot Encoding Techniques. Math. Probl. Eng. 2018;2018:7354081. doi: 10.1155/2018/7354081. [DOI] [Google Scholar]
- 34.Roberta A., Beritelli F. Automatic ECG Diagnosis Using Convolutional Neural Network. Electronics. 2020;9:951. doi: 10.3390/electronics9060951. [DOI] [Google Scholar]
- 35.Alizadehsani R., Abdar M., Roshanzamir M., Khosravi A., Kebria P., Khozeimeh F., Nahavandi S., Sarrafzadegan N., Acharya U.R. Machine learning-based coronary artery disease diagnosis: A comprehensive review. Comput. Biol. Med. 2019;111:103346. doi: 10.1016/j.compbiomed.2019.103346. [DOI] [PubMed] [Google Scholar]
- 36.Acharya U.R., Fujita H., Oh S.L., Hagiwara Y., Tan J.H., Adam M., Tan R.S. Deep Convolutional Neural Network for the Automated Diagnosis of Congestive Heart Failure Using ECG Signals. Appl. Intell. 2019;49:16–27. doi: 10.1007/s10489-018-1179-1. [DOI] [Google Scholar]
- 37.Acharya U.R., Faust O., Subbhuraam V.S., Goutham S., Martis R., Kadri N.A., Suri J. Linear and nonlinear analysis of normal and CAD-affected heart rate signals. Comput. Methods Programs Biomed. 2014;113:55–68. doi: 10.1016/j.cmpb.2013.08.017. [DOI] [PubMed] [Google Scholar]
- 38.Moody G.B., Mark R.G. The MIT-BIH Arrhythmia Database on CD-ROM and software for use with it. Comput. Cardiol. 1990;17:185–188. [Google Scholar]
- 39.Hao W., Chen Y., Xin Y. ECG baseline wander correction by mean-median filter and discrete wavelet transform; Proceedings of the 2011 Annual International Conference of the IEEE Engineering in Medicine and Biology Society; Boston, MA, USA. 30 August–3 September 2011; Piscataway, NJ, USA: IEEE; 2011. pp. 2712–2715. [DOI] [PubMed] [Google Scholar]
- 40.Deng G., Cahill L.W. An adaptive Gaussian filter for noise reduction and edge detection; Proceedings of the 1993 IEEE Conference Record Nuclear Science Symposium and Medical Imaging Conference; San Francisco, CA, USA. 31 October–6 November 1993; Piscataway, NJ, USA: IEEE; 1993. pp. 1615–1619. [Google Scholar]
- 41. [(accessed on 21 February 2022)]. Available online: https://en.wikipedia.org/wiki/Moving_average#:~:text=In%20statistics%2C%20a%20moving%20average,of%20finite%20impulse%20response%20filter.
- 42.Luo J., Ying K., Bai J. Savitzky–Golay smoothing and differentiation filter for even number data. Signal Process. 2005;85:1429–1434. doi: 10.1016/j.sigpro.2005.02.002. [DOI] [Google Scholar]
- 43.Singh B.N., Tiwari A.K. Optimal selection of wavelet basis function applied to ECG signal denoising. Digit. Signal Process. 2006;16:275–287. doi: 10.1016/j.dsp.2005.12.003. [DOI] [Google Scholar]
- 44.Basu S., Mamud S. Comparative Study on the Effect of Order and Cut off Frequency of Butterworth Low Pass Filter for Removal of Noise in ECG Signal; Proceedings of the 2020 IEEE 1st International Conference for Convergence in Engineering (ICCE); Kolkata, India. 5–6 September 2020; pp. 156–160. [DOI] [Google Scholar]
- 45.Howard A.G. Some Improvements on Deep Convolutional Neural Network Based Image Classification. arXiv. 20141312.5402 [Google Scholar]
- 46.Krizhevsky A., Sutskever I., Hinton G. ImageNet Classification with Deep Convolutional Neural Networks. Neural Inf. Process. Syst. 2012;60:84–90. doi: 10.1145/3065386. [DOI] [Google Scholar]
- 47.Zubair M., Kim J., Yoon C. An automated ECG beat classification system using convolutional neural networks; Proceedings of the 6th International Conference on IT Convergence and Security (ICITCS); Prague, Czech Republic. 26 September 2016; pp. 1–5. [Google Scholar]
- 48.Acharya U.R., Fujita H., Lih O.S., Hagiwara Y., Tan J.H., Adam M. Automated detection of arrhythmias using different intervals of tachycardia ECG segments with a convolutional neural network. Inf. Sci. 2017;405:81–90. doi: 10.1016/j.ins.2017.04.012. [DOI] [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
MIT-BIH Arrhythmia dataset used in this research to train three custom convolutional neural networks and validate results. The authors confirmed that this dataset is available publicly in the physionet database.