Abstract
Background and Objective. The new coronavirus disease (known as COVID-19) was first identified in Wuhan and quickly spread worldwide, wreaking havoc on the economy and people's everyday lives. As the number of COVID-19 cases is rapidly increasing, a reliable detection technique is needed to identify affected individuals and care for them in the early stages of COVID-19 and reduce the virus's transmission. The most accessible method for COVID-19 identification is Reverse Transcriptase-Polymerase Chain Reaction (RT-PCR); however, it is time-consuming and has false-negative results. These limitations encouraged us to propose a novel framework based on deep learning that can aid radiologists in diagnosing COVID-19 cases from chest X-ray images. Methods. In this paper, a pretrained network, DenseNet169, was employed to extract features from X-ray images. Features were chosen by a feature selection method, i.e., analysis of variance (ANOVA), to reduce computations and time complexity while overcoming the curse of dimensionality to improve accuracy. Finally, selected features were classified by the eXtreme Gradient Boosting (XGBoost). The ChestX-ray8 dataset was employed to train and evaluate the proposed method. Results and Conclusion. The proposed method reached 98.72% accuracy for two-class classification (COVID-19, No-findings) and 92% accuracy for multiclass classification (COVID-19, No-findings, and Pneumonia). The proposed method's precision, recall, and specificity rates on two-class classification were 99.21%, 93.33%, and 100%, respectively. Also, the proposed method achieved 94.07% precision, 88.46% recall, and 100% specificity for multiclass classification. The experimental results show that the proposed framework outperforms other methods and can be helpful for radiologists in the diagnosis of COVID-19 cases.
1. Introduction
The new Coronavirus disease, also known as COVID-19, was initially discovered in Wuhan, China, in December 2019 [1]. COVID-19 is the name of the disease, and SARS-CoV-2 is the name of the virus. This novel infection spread from Wuhan to a large portion of China in less than 30 days [2]. On 11 March 2021, the World Health Organization (WHO) declared the outbreak a pandemic [3, 4]. Since 19 November 2020, the COVID-19 pandemic has had a detrimental effect on the world, with approximately 219,456,675 confirmed cases and 4,547,782 deaths reported till 27 September 2021; in addition, nearly 7.7 million workers have lost their jobs in America [5]. The global recession and closure of schools and institutions worldwide have significantly impacted mental and physical health [3]. The majority of Coronaviruses infect animals; however, due to their zoonotic nature, they can infect humans [6]. As a result, it can potentially infect human airway cells, leading to pneumonia, severe respiratory infections, renal failure, and even death. Fever, cough, sore throat, headache, weariness, muscle soreness, and shortness of breath are common COVID-19 symptoms [7].
Vivid screening of infected individuals allows them to be isolated and treated and is a crucial and essential step in combating COVID-19 [1]. Reverse Transcriptase-Polymerase Chain Reaction (RT-PCR) testing, which can identify SARS-CoV-2 RNA from respiratory material, is the most common technique for detecting COVID-19 patients [8]. It requires specialized materials and equipment that are not readily available; because of the large number of false-negative results, it takes at least 12 hours, which is inconvenient considering that positive COVID-19 patients should be identified and followed up on as soon as possible [4, 9, 10]. Chest CT scan is another option for detecting the disease, which is more accurate than RT-PCR; For instance, 75% of negative RT-PCR samples had positive results on chest CT scans [11]. CT scans have several drawbacks, including image collection time, related cost, and CT equipment availability [12]. When compared to CT scans, X-ray images are less expensive and more easily available [13]. As a result, the focus of the research is only on the use of X-ray imaging as a screening tool for COVID-19 patients.
Researchers discovered that COVID-19 patients' lungs contain visual markings such as ground-glass opacities—hazy darker areas that may distinguish COVID-19-infected individuals from noninfected patients [14, 15]. However, due to the limitations of experts, time constraints, and the irreversible consequences of misdiagnosis [6], it is crucial to discover a different approach to get faster and more reliable outcomes. The technological advancements facilitate the process of diagnosing the diseases; in other words, the widespread use of Artificial Intelligence (AI) [16], mainly its areas such as machine learning and deep learning, is extremely constructive, and researchers have made significant use of AI and deep learning in various medical areas [4, 17]. Convolutional neural network (CNN) architecture is one of the most prominent deep learning techniques in the medical imaging field, with outstanding results [18].
Pretrained neural networks are used in this paper, which is one of the most recent techniques. Using easily accessible pretrained models, the proposed method extracts features from X-ray images. We utilize one of the feature selection methods in the second phase to acquire an appropriate number of features for classification. Finally, we use the eXtreme Gradient Boosting (XGBoost) classifier to classify the specified features. A preprint of this paper has previously been published [19]. The major contributions of this paper are summarized as follows:
We used an easily accessible pretrained model, DenseNet169 for feature extraction, and XGBoost for classification as a brand-new approach.
Using a feature selection approach, i.e., analysis of variance (ANOVA), improves prediction performance and obtains an adequate number of features for classification and reduces complexity.
Effectiveness of the proposed method is evaluated using the ChestX-ray8 dataset. The experimental results show that the proposed method effectively classifies COVID-19 cases, and the classification accuracy is considerably increased.
The rest of the paper is organized as follows. Section 2 describes related works. In Section 3, used materials and methods will be presented. In Section 4, the experimental results are reported and analyzed. Finally, Section 5 will present a summary of the findings and conclusions.
2. Related Works
Researchers worldwide are now fighting against COVID-19; using radiological imaging and deep learning has made significant progress in this approach. Wang et al. [8] developed COVID-Net, a deep model for COVID-19 detection that categorized normal, non-COVID-19 pneumonia, and COVID-19 classes with 92.4 percent accuracy. Apostolopoulos and Mpesiena [20] applied transfer learning and employed COVID-19, healthy, and pneumonia X-ray images to develop their model. Ozturk et al. [6] proposed using the DarkNet model to build a deep network. This model contains 17 convolution layers and utilizes the Leaky ReLU activation function. The mentioned model was 98.08% accurate for binary classes, and for multiclass cases, it was 87.02% accurate. Nasiri and Hasani [21] employed DenseNet169 to extract features from X-ray images and used XGBoost for classification; they gained 98.24% and 89.70% in binary and multiclass classification, respectively.
Qaid et al. [22] applied deep and transfer learning approaches to build reliable, general, and robust models for identifying COVID-19. Abdulkareem and Mpesiena [23] proposed a model to detect COVID-19 cases in smart hospitals utilizing machine learning and the Internet of Things. Chen and Rezaei [24] proposed a method for extracting 18 different features from X-ray images. The minimal features are chosen using a metaheuristic algorithm called the Archimedes optimization to reduce the approach's complexity. Khorami et al. [25] proposed a method for extracting a combination of gray-level co-occurrence matrix (GLCM) and Discrete Wavelet Transform (DWT) features from X-ray images, followed by classification of the images using an improved CNN model, based on the Red Fox Optimization algorithm. Waheed et al. [26] developed an Auxiliary Classifier Generative Adversarial Network (ACGAN)-based model called CovidGAN to create synthetic X-ray images and improve the accuracy of COVID-19 classification.
Sethy et al. [27] devised an in-depth feature combined support vector machine (SVM) based method for detecting coronavirus-infected individuals using X-ray images. SVM is examined for COVID-19 identification utilizing the deep features of 13 different CNN models. Fareed Ahmad et al. [28] utilized X-ray images for training deep CNN models like MobileNet, ResNet50, and InceptionV3 with a variety of options, including starting from scratch, fine-tuning with learned weights of all layers, and fine-tuning with learned weights and augmentation. Abbas et al. [29] verified a deep CNN termed Decompose, Transfer, and Compose (DeTraC) for COVID-19 chest X-ray image classification. Wang et al. [30] presented the Parallel Channel Attention Feature Fusion Module (PCAF) and a new convolutional neural network MCFF-Net based on PCAF. The network uses three classifiers to boost recognition efficiency: 1-FC, GAP-FC, and Conv1-GAP. Ucar and Korkmaz [31] developed the SqueezeNet that goes toward its light network design, is optimized for the COVID-19 detection with the Bayesian optimization additive.
Additionally, Kang et al. [32] presented a transfer learning model that handles a dataset of COVID-19-infected patients' CT images. They achieved a test accuracy of 79.3%. Khan et al. [1] represented CoroNet, a deep CNN model for automatic diagnosing of COVID-19 from chest X-ray images. The proposed model is built on the Xception architecture. Narin et al. [33] proposed five models for diagnosing people with pneumonia and coronavirus using X-ray images.
Similarly, He et al. [34] created a deep learning method to categorize COVID-19. They scanned 746 CT images, 349 of which were of infected patients and 397 of healthy people. The Self-Trans technique is proposed in this approach, which combines contrastive self-supervised learning with transfer learning to gain unbiased and robust feature representations while avoiding overfitting, resulting in a 94% accuracy rate. Xu et al. [35] applied deep learning techniques to create an early screening model to discriminate COVID-19 from influenza-A viral pneumonia and healthy cases using chest CT scans. Hemdan et al. [36] used 50 validated chest X-ray images and 25 confirmed positive COVID-19 cases and developed the COVIDX-Net, which incorporates seven distinct architectures of deep convolutional neural network models, such as VGG19 as well as the second version of Google MobileNet. Minaee et al. [37] used publicly available datasets to build a dataset of 5000 chest X-rays. A board-certified radiologist discovered images that showed the existence of the COVID-19 virus. Four prominent convolutional neural networks were trained to detect COVID-19 disease using transfer learning.
3. Materials and Methods
The proposed method employs the DenseNet169 deep neural network, the ANOVA feature selection method, and the XGBoost algorithm, which will be discussed in the following section.
3.1. DenseNet169
A CNN's overall architecture is composed of two core parts: a feature extractor and a classifier. Convolution and pooling layers are the two essential layers of CNN architecture. Each node in the convolution layer extracts features from the input images by performing a convolution operation on the input nodes. The max-pooling layer abstracts the features by averaging or calculating the maximum value of input nodes [38, 39]. DenseNet is a highly supervised network containing a 5-layer dense block with a k = 4 rate of growth and the standard ResNet structure. Each layer's output in a DenseNet dense block includes the output of all previous layers, incorporating both low-level and high-level features of the input image, making it suitable for object detection [40]. The ILSVRC 2012 classification dataset used for training DenseNet contains 1,000 classes and 1.2 million images. The dataset images were cropped with the size of 224 × 224 before using as input for DenseNet. DenseNet presented a new connectivity pattern that introduced direct connections from any layer to all the following layers to further improve information flow across layers [41]. In DenseNet, the l th layer takes all feature maps x0, x1, x2,…, xl−1 from the preceding layers as input, which is described by
(1) |
where Hl(·) is a singular tensor and [x0, x1, x2,…, xl−1] is the concatenated features from l − 1 layers. To preserve the feature-map size constant, each side of the inputs is zero-padded by one pixel for convolutional layers with kernel size 3 × 3. DenseNet employed 1 × 1 convolution and 2 × 2 average pooling as transition layers between adjoining dense blocks. A global average pooling is conducted at the end of the last dense block, and then a Softmax classifier is connected. In the three dense blocks, the feature-map sizes are 32 × 32, 16 × 16, and 8 × 8, respectively. On five distinct competitive benchmarks, this innovative architecture reached state-of-the-art accuracy for recognizing the objects [38, 41].
3.2. Analysis of Variance Feature Selection
New issues develop as a result of the creation of large datasets. Consequently, reliable and unique feature selection approaches are required [42]. Feature selection can assist with data visualization and understanding and minimize measurement and storage needs, training and utilization times, and overcoming the curse of dimensionality to enhance prediction performance [42, 43]. ANOVA is a well-known statistical approach for comparing several independent means [44]. The ANOVA approach ranks features by calculating the ratio of variances between and within groups [45].
The ratio indicates how strongly the λ th feature is linked to the group variables. The following equation is used to calculate the ratio F value of λ th g-gap dipeptide in two benchmark datasets:
(2) |
where sB2(λ) and sW2(λ) are the sample variance between groups (also known as Mean Square Between, MSB) and within groups (also known as Mean Square Within, MSW), respectively, and can be calculated as
(3) |
(4) |
The degrees of freedom for MSB and MSW are dfB=K − 1 and dfW=N − K, respectively. The number of groups and the total number of samples are represented by K and N, respectively. The frequency of the λ th feature in the j th sample in the i th group is denoted by fij(λ). The number of samples in the i th group is denoted by ni [46].
3.3. Extreme Gradient Boosting (XGBoost)
Chen and Guestrin proposed an efficient and scalable variation of the Gradient Boosting algorithm called XGBoost. XGBoost has been widely employed by data scientists recently, and it had desirable results in a wide range of machine learning competitions [47, 48]. In certain ways, XGBoost differs from Gradient Boosted Decision Trees (GBDT). First of all, the GBDT algorithm only employs a first-order Taylor expansion, whereas XGBoost augments the loss function with a second-order Taylor expansion. Secondly, the objective function uses normalization to prevent overfitting and reduce the method's complexity [49–51]. Third, XGBoost is extremely adaptable, allowing users to create their own optimization objectives and evaluation criteria. Nevertheless, by establishing class weight and using Area Under the Curve (AUC) as an assessment criterion, the XGBoost classifier can handle unbalanced training data efficiently. In summary, XGBoost is a scalable and flexible tree structure improvement model that can manage sparse data, enhance algorithm speed, and minimize computing time and memory for large-scale data [52].
Formally, the XGBoost algorithm can be described as follows.
Given a training dataset of n samples T={(x1, y1), (x2, y2),…, (xn, yn)}xi ∈ ℝmyi ∈ ℝ, the objective function can be defined by
(5) |
where measures the difference between the target yi and the prediction and ft denotes the prediction score of t th tree [53]. Ω(ft) represents the regularization term, which control the model's complexity to avoid overfitting [50]. The estimated loss function can be computed based on Taylor expansion of the objective function:
(6) |
where denotes each sample's first derivative and denotes each sample's second derivative. The first and second derivatives of each sample are all that the loss function requires [54].
3.4. Proposed Method
In this study, preprocessing methods were employed on the dataset, including label encoder for classes and normalization on images. As a result, less redundant data are given as the input to the network. Deeply influenced by the brain's structure, deep learning as a subfield of machine learning has emerged. In medical image processing, as in many other areas, deep learning approaches have demonstrated excellent results in past years [33]. ImageNet is a dataset of millions of images organized into 1000 categories when it comes to image processing. The next step was to apply several pretrained models that were trained based on this dataset. DenseNet169 had the best performance among those models, so it was selected as the feature extractor in the proposed method. The X-ray dataset images were scaled at a fixed size of 224 × 224 pixels, the DenseNet169 input size.
The final layer of the DenseNet169 network, which was used to predict ImageNet dataset labels, was eliminated. Global average pooling, a pooling method designed to substitute fully connected layers in classical CNNs, was added in the final layer of the network. One of the benefits of global average pooling is that there are no parameters to adjust in this layer; therefore, no training is needed. Additionally, because global average pooling sums up the dimensional information, it is more robust to spatial translations of the input [55]. The X-ray images were given to the network to extract features from DenseNet169, and 1664 features were extracted as a result.
When a learning model is given many features and few samples, it is likely to overfit, causing its performance to degrade. Among researchers, feature selection is a widely used strategy for reducing dimensionality [56]. In order to reduce the classification time and increase the classifier performance, the ANOVA feature selection method was employed to reduce the number of features. Thus, the range of 50 to 500 features was applied to select the best number of features for classification (using validation set). Finally, the selected features were given to the XGBoost to detect COVID-19. Figure 1 shows the general framework of the proposed method.
4. Results and Discussion
Several performance metrics such as precision, recall, specificity, and F1-Score, as well as accuracy, were utilized to evaluate several deep learning models with the proposed methodology because accuracy alone cannot evaluate a model's usefulness [57]. Accuracy is the ratio of the number of correctly predicted samples to the total number of samples. The following equation can be used to calculate accuracy:
(7) |
where TP and TN denote the number of true positives and true negatives, respectively.
Precision is the proportion of predicted true-positive values to the total number of predicted true-positive and false-positive values. A model with a low precision is prone to a high false-positive rate. Precision can be calculated using the following equation:
(8) |
where FP denotes the number of false positives.
The number of true positives divided by the sum of true positives and false negatives is known as recall or sensitivity. When there is a large cost associated with false negatives, the model statistic used to pick the optimal model is recall. Recall can be computed using the following equation:
(9) |
where FN denotes the number of false negatives.
Specificity is the proportion of predicted true negatives to the summation of predicted true negatives and false positives. Specificity can be determined using the following equation:
(10) |
F 1-score combines precision and recall. As a result, both false positives and false negatives are included while calculating this score. It is not as simple as accuracy for comparison. However, F1-score is generally more valuable than accuracy, particularly if the problem is an imbalanced classification problem. The following equation can be used to calculate the F1-score:
(11) |
In this study, the dataset that Ozturk et al. [6] collected has been employed, gathered from two distinct sources, and includes COVID-19, No-findings, and Pneumonia, as shown in Figure 2. The first class of dataset contained 43 women, and 82 men confirmed they were infected with COVID-19. The average age of 26 COVID-19 confirmed individuals is about 55 years old, according to the age information supplied. The remaining two classes were chosen randomly from the Wang et al. [58] ChestX-ray8 dataset, including 500 No-findings and 500 Pneumonia images.
Two distinct perspectives were conducted to identify and classify COVID-19. First, the proposed technique was validated in order to classify two classes labeled COVID-19 and No-findings. Second, the proposed approach was used to classify three different groups: COVID-19, No-findings, and Pneumonia. In the first aspect, the two-class problem, the proposed method effectiveness is measured using the 5-fold cross-validation. A total of 80% of the dataset was used for training and 20% for testing. Following the extraction of features by DenseNet169, ANOVA selected 67 features from 1664 as an optimal number for classification, resulting in about 96% of features being reduced, and the XGBoost classification process was significantly sped up.
The 5-fold cross-validation had an average accuracy of 98.72%, and the confusion matrix was computed for each fold and overlapped, as shown in Figure 3. The confusion matrix entries acquired in all folds are used to generate the overlapping confusion matrix. It shows that the proposed architecture correctly identified COVID-19 and No-findings with 100% and 98.43% accuracy, respectively. In other words, the proposed method performs better at detecting true-positive samples.
The achieved precision, recall, specificity, and F1-score values were 99.21%, 93.33%, 100%, and 97.87%, respectively. Table 1 represents the comparison of the proposed method with Ozturk et al. [6] and Nasiri and Hasani [21] in terms of accuracy, precision, recall, specificity, and F1-score values for each fold and the average of all folds, which Nasiri and Hasani [21] had better results than Ozturk et al. [6] and the proposed method outperforms them all except recall. The good performance of the proposed method can be attributed to the superiority of the XGBoost, which has a good generalization and less overfitting. As a result, the proposed method outperforms other methods in terms of test accuracy. Furthermore, using ANOVA helps the proposed method select the most distinctive features from the feature space and better separate COVID-19, No-findings, and Pneumonia classes, leading to higher classification accuracy.
Table 1.
Performance metrics (%) | Methods | 1-fold | 2-fold | 3-fold | 4-fold | 5-fold | Average |
---|---|---|---|---|---|---|---|
Recall | Proposed method | 94.73 | 90.47 | 92.00 | 96.96 | 92.59 | 93.33 |
Ozturk et al. | 100 | 96.42 | 90.47 | 93.75 | 93.18 | 95.13 | |
Nasiri and Hasani | 95.20 | 95.40 | 96.70 | 81.40 | 91.40 | 92.08 | |
| |||||||
Specificity | Proposed method | 100 | 100 | 100 | 100 | 100 | 100 |
Ozturk et al. | 100 | 96.42 | 90.47 | 93.75 | 93.18 | 95.30 | |
Nasiri and Hasani | 100 | 100 | 100 | 89.90 | 100 | 99.78 | |
| |||||||
Precision | Proposed method | 99.53 | 99.05 | 99.01 | 99.46 | 99.00 | 99.21 |
Ozturk et al. | 100 | 94.52 | 98.14 | 98.57 | 98.58 | 98.03 | |
Nasiri and Hasani | 99.50 | 99.50 | 99.40 | 95.30 | 99.02 | 98.54 | |
| |||||||
F 1-score | Proposed method | 98.41 | 97.02 | 97.42 | 98.96 | 97.57 | 97.87 |
Ozturk et al. | 100 | 95.52 | 93.79 | 95.93 | 95.62 | 96.51 | |
Nasiri and Hasani | 98.50 | 98.50 | 98.20 | 92.50 | 97.30 | 97.00 | |
| |||||||
Accuracy | Proposed method | 99.20 | 98.40 | 98.40 | 99.20 | 98.40 | 98.72 |
Ozturk et al. | 100 | 97.60 | 96.80 | 97.60 | 97.60 | 98.08 | |
Nasiri and Hasani | 99.20 | 99.20 | 99.20 | 95.20 | 98.40 | 98.24 |
In the multiclass problem, 80% of the X-ray images dataset was used for training, and the remaining 20% was employed as the test set. ANOVA was used to select 275 features out of 1664 as the ideal number for classification. Consequently, almost 84% of features were decreased, and XGBoost classification process was substantially ramped up, and performance improved. The accuracy on the test set was 92%. The confusion matrix is illustrated in Figure 4. Like two-class problem, this confusion matrix indicates that the proposed method had a better result in finding COVID-19 than No-findings and Pneumonia. Precision, recall, specificity, and F1-score values of 94.07%, 88.46%, 100%, and 92.42% were reached, respectively. Regarding accuracy, precision, recall, specificity, and F1-score values of the test set, Table 2 compares the proposed approach to Ozturk et al. [6] and Nasiri and Hasani [21].
Table 2.
Methods | Performance metrics (%) | ||||
---|---|---|---|---|---|
Recall | Specificity | Precision | F 1 -score | Accuracy | |
Proposed method | 88.46 | 100 | 94.07 | 92.42 | 92.00 |
Ozturk et al. | 88.17 | 93.66 | 90.97 | 89.44 | 89.33 |
Nasiri and Hasani | 95.20 | 100 | 92.50 | 91.20 | 89.70 |
The proposed method was applied to nine pretrained networks for both binary and multiclass problems. As shown in Table 3, the average of 5-fold cross-validation accuracy was employed to compare approaches in binary class problem, whereas the best fold accuracy was used to compare approaches on the multiclass problem. DenseNet169 outperforms other pretrained networks in both binary and multiclass problems. Additionally, the gradient-based class activation mapping (Grad-CAM) [59] was used to represent the decision area on a heatmap. Figure 5 illustrates the heatmaps for three COVID-19 cases, confirming that the proposed method extracted correct features for detection of COVID-19, and the model is mostly concentrated on the lung area. Radiologists might use these heatmaps to evaluate the chest area more accurately.
Table 3.
DNN | Binary class accuracy (%) | Multiclass accuracy (%) |
---|---|---|
DenseNet169 | 98.72 | 92.00 |
InceptionV3 | 92.96 | 82.22 |
NASNetLarge | 94.88 | 84.00 |
ResNet152 | 94.71 | 77.33 |
VGG16 | 97.43 | 88.88 |
VGG19 | 97.28 | 88.88 |
Xception | 95.68 | 80.88 |
EfficientNetB0 | 97.92 | 88.88 |
InceptionResNetV2 | 94.88 | 83.11 |
The proposed method was compared to relevant works in Table 4. Gunraj et al. [8] applied 16,756 X-ray chest images from diverse sources to develop COVID-Net and achieved a 92.40% accuracy rate on the multiclass classification problem. Sethy et al. [27] reached 95.38% accuracy using ResNet50 and SVM, which was evaluated by 50 X-ray images. Wang et al. [30] proposed M-Inception for 195 COVID-19 infected patients and 258 healthy cases, and as a result, they achieved an 82.90% accuracy rate. Hemdan et al. [36] trained and evaluated COVIDX-Net using 25 confirmed COVID-19 and 25 noninfected cases X-ray images, achieving a 90.00% accuracy rate.
Table 4.
Study | Type of images | Number of cases | Method used | Accuracy (%) | Drawbacks |
---|---|---|---|---|---|
Wang et al. [8] | Chest X-ray | 358 COVID-19 (+) | COVID-Net | 92.40 | Use of an unbalanced dataset |
8066 COVID-19 (−) | High computational complexity due to training of deep neural network | ||||
5538 Pneumonia | |||||
| |||||
Sethy et al. [27] | Chest X-ray | 25 COVID-19 (+) | ResNet50 + SVM | 95.38 | Use of a dataset with a limited number of samples |
25 COVID-19 (−) | |||||
| |||||
Hemdan et al. [36] | Chest X-ray | 25 COVID-19 (+) | COVIDX-Net | 90.00 | Use of a dataset with a limited number of samples |
25 COVID-19 (−) | |||||
| |||||
Narin et al. [33] | Chest X-ray | 50 COVID-19 (+) | Deep CNN | 98.00 | Use of a dataset with a limited number of samples |
50 COVID-19 (−) | ResNet50 | ||||
| |||||
Ying et al. [60] | Chest CT | 777 COVID-19 (+) | DRE-Net | 86.00 | Low accuracy |
708 healthy | |||||
| |||||
Wang et al. [30] | Chest CT | 195 COVID-19 (+) | M-Inception | 82.90 | Low accuracy |
258 COVID-19 (−) | |||||
| |||||
Zheng et al. [61] | Chest CT | 313 COVID-19 (+) | UNet + 3D deep network | 90.80 | High computational complexity due to training of deep neural network |
229 COVID-19 (−) | |||||
| |||||
Xu et al. [35] | Chest CT | 219 COVID-19 (+) | ResNet + location attention | 86.70 | Low accuracy |
224 viral pneumonia | High computational complexity due to training of deep neural network | ||||
175 healthy | |||||
| |||||
Ozturk et al. [6] | Chest X-ray | 125 COVID-19 (+) | DarkCovidNet | 98.08 | Use of a limited number of COVID-19 samples |
500 No-Findings | |||||
125 COVID-19 (+) | 87.02 | High computational complexity due to training of deep neural network | |||
500 No-Findings | |||||
500 Pneumonia | |||||
| |||||
Proposed method | Chest X-ray | 125 COVID-19 (+) | DenseNet169+ ANOVA + XGBoost | 98.72 | Sensitivity to the number of features selected by the ANOVA |
500 No-Findings | |||||
125 COVID-19 (+) | 92.00 | Use of a limited number of COVID-19 samples | |||
500 No-Findings | |||||
500 Pneumonia |
Narin et al. [33] used 50 public source COVID-19 chest X-ray images and 50 normal images from another source to test three alternative CNN models, obtaining a 98.00% accuracy rate. Song et al. [60] employed 777 confirmed COVID-19 patients and 708 normal cases CT images to develop their deep model based on the pretrained network model, ResNet50, which reached an 86.0% accuracy rate.
Zheng et al. [61] gained 90.80% accuracy employing CT images of 313 positive COVID-19 and 229 normal cases to develop their model. Xu et al. [35] applied ResNet on the dataset of 219 confirmed COVID-19 and 224 Pneumonia and 175 normal CT images, scoring 86.70% performance. Ozturk et al. [6] used 125 positive COVID-19, 500 No-findings, and 500 Pneumonia X-ray images to develop their model, resulting in 98.08% for two-class and 87.02% multiclass accuracy rate. The dataset that Ozturk et al. [6] gathered from various sources was used in this paper. For the two-class and multiclass classification problems, 98.72% and 92.00% accuracy rates were obtained, respectively, in this paper. Table 4 shows that the proposed approach outperforms most of the existing deep learning-based models in terms of accuracy. However, it should be emphasized that the findings in Table 4 were derived from different datasets and different experimental setups. This study's limitations and drawbacks include using a limited number of COVID-19 X-ray images (i.e., 125 samples) and sensitivity of the proposed method's performance to the number of features selected by ANOVA so that the number of features should be selected by trial and error.
5. Conclusion
Early diagnosis of COVID-19 is a crucial step to prevent mortality and transmission of the virus. RT-PCR is the most accessible tool to identify COVID-19, but finding other alternatives for this problem is essential due to false-negative outcomes and time limitations. Chest CT and X-ray images are suitable substitutes for RT-PCR, but because of the lack of CT hardware, X-ray images are a superior tool for diagnosing COVID-19. AI and machine learning-based methods play a crucial role in the quicker detection of COVID-19. In this study, a pretrained model, DenseNet169, was utilized to extract features from X-ray images, and ANOVA was employed to select features to decrease classification time and improve performance. Finally, selected features were classified using XGBoost. The ChestX-ray8 dataset was used to evaluate the proposed method. The proposed method reached 98.72% accuracy for the two-class problem and 92% accuracy for the multiclass problem. The proposed method's precision, recall, and specificity rates on the two-class problem were 99.21%, 93.33%, and 100%, respectively. Also, for the multiclass problem, the proposed method achieved 94.07% precision, 88.46% recall, and 100% specificity. The experimental results show that the proposed method outperforms other state-of-the-art methods, and radiologists might use it to detect COVID-19 cases more accurately.
Data Availability
Publicly available ChestX-ray8 dataset was used in this study. The source code of the proposed method required to reproduce the predictions and results is available at https://github.com/seyyedalialavi2000/COVID-19-detection.
Disclosure
A preprint of this paper has been submitted to arXiv.org, which is available at https://arxiv.org/abs/2110.06340.
Conflicts of Interest
The authors declare that they have no conflicts of interest.
References
- 1.Khan A. I., Shah J. L., Bhat M. M. CoroNet: a deep neural network for detection and diagnosis of COVID-19 from chest x-ray images. Computer Methods and Programs in Biomedicine . 2020;196 doi: 10.1016/j.cmpb.2020.105581.105581 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 2.Wu Z., McGoogan J. M. Characteristics of and important lessons from the coronavirus disease 2019 (COVID-19) outbreak in China. JAMA . 2020;323(13):1239–1242. doi: 10.1001/jama.2020.2648. [DOI] [PubMed] [Google Scholar]
- 3.Manoj M., Srivastava G., Somayaji S. R. K., Gadekallu T. R., Maddikunta P. K. R., Bhattacharya S. An incentive based approach for COVID-19 planning using blockchain technology. Proceedings of the 2020 IEEE Globecom Workshops (GC Wkshps); December 2020; Taipei, Taiwan. pp. 1–6. [DOI] [Google Scholar]
- 4.Bhattacharya S., Reddy Maddikunta P. K., Pham Q.-V., et al. Deep learning and medical image processing for coronavirus (COVID-19) pandemic: a survey. Sustainable Cities and Society . 2021;65 doi: 10.1016/j.scs.2020.102589.102589 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 5.Fronstin P., Woodbury S. A. How Many Americans Have Lost Jobs with Employer Health Coverage During the Pandemic . Washington, DC, USA: The Commonwealth Fund; 2020. https://research.upjohn.org/externalpapers/90 . [Google Scholar]
- 6.Ozturk T., Talo M., Yildirim E. A., Baloglu U. B., Yildirim O., Rajendra Acharya U. Automated detection of COVID-19 cases using deep neural networks with X-ray images. Computers in Biology and Medicine . 2020;121 doi: 10.1016/j.compbiomed.2020.103792.103792 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 7.Bhatt T., Kumar V., Pande S., Malik R., Khamparia A., Gupta D. A review on COVID-19. Artificial Intelligence and Machine Learning for COVID-19 . 2021;924(April):25–42. doi: 10.1007/978-3-030-60188-1_2. [DOI] [Google Scholar]
- 8.Gunraj H., Wang L., Wong A. COVIDNet-CT: a tailored deep convolutional neural network design for detection of COVID-19 cases from chest CT images. Frontiers of Medicine . 2020;7:1–12. doi: 10.3389/fmed.2020.608525. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 9.Tabik S., Gomez-Rios A., Martin-Rodriguez J. L., et al. COVIDGR dataset and COVID-SDNet methodology for predicting COVID-19 based on chest X-ray images. IEEE Journal of Biomedical and Health Informatics . 2020;24(12):3595–3605. doi: 10.1109/JBHI.2020.3037127. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 10.Huang P., Liu T., Huang L., et al. Use of chest CT in combination with negative RT-PCR assay for the 2019 novel coronavirus but high clinical suspicion. Radiology . 2020;295(1):22–23. doi: 10.1148/radiol.2020200330. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 11.Ai T., Yang Z., Hou H., et al. Correlation of chest CT and RT-PCR testing for coronavirus disease 2019 (COVID-19) in China: a report of 1014 cases. Radiology . 2020;296(2):E32–E40. doi: 10.1148/radiol.2020200642. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 12.Ahishali M., Degerli A., Yamac M., et al. Advance warning methodologies for COVID-19 using chest x-ray images. IEEE Access . 2021;9:41052–41065. doi: 10.1109/access.2021.3064927. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 13.Ilyas M., Rehman H., Nait-Ali A. Detection of Covid-19 from chest X-ray images using artificial intelligence: an early review. 2020. https://arxiv.org/abs/2004.05436 .
- 14.Fang Y., Pang P. Senivity of chest CT for COVID-19: comparasion to RT-PCR. Radiology . 2020;296:15–17. doi: 10.1148/radiol.2020200432. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 15.Xie X., Zhong Z., Zhao W., Zheng C., Wang F., Liu J. Chest CT for typical coronavirus disease 2019 (COVID-19) pneumonia: relationship to negative RT-PCR testing. Radiology . 2020;296(2):E41–E45. doi: 10.1148/radiol.2020200343. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 16.Shi F., Wang J., Shi J., et al. Review of artificial intelligence techniques in imaging data acquisition, segmentation, and diagnosis for COVID-19. IEEE Reviews in Biomedical Engineering . 2020;14:4–15. doi: 10.1109/RBME.2020.2987975. [DOI] [PubMed] [Google Scholar]
- 17.Zhu N., Zhang D., Wang W., et al. A novel coronavirus from patients with pneumonia in China, 2019. New England Journal of Medicine . 2020;382(8):727–733. doi: 10.1056/nejmoa2001017. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 18.Lecun Y., Bengio Y., Hinton G. Deep learning. Nature . 2015;521(7553):436–444. doi: 10.1038/nature14539. [DOI] [PubMed] [Google Scholar]
- 19.Nasiri H., Alavi S. A. A novel framework based on deep learning and anova feature selection method for diagnosis of COVID-19 cases from chest X-ray images. 2021. https://arxiv.org/abs/2110.06340 . [DOI] [PMC free article] [PubMed]
- 20.Apostolopoulos I. D., Mpesiana T. A. COVID-19: automatic detection from x-ray images utilizing transfer learning with convolutional neural networks. Physical and Engineering Sciences in Medicine . 2020;43(2):635–640. doi: 10.1007/s13246-020-00865-4. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 21.Nasiri H., Hasani S. Automated detection of COVID-19 cases from chest X-ray images using deep neural network and XGBoost. 2021. https://arxiv.org/abs/2109.02428 . [DOI] [PMC free article] [PubMed]
- 22.Qaid T. S., Mazaar H., Al-Shamri M. Y. H., Alqahtani M. S., Raweh A. A., Alakwaa W. Hybrid deep-learning and machine-learning models for predicting COVID-19. Computational Intelligence and Neuroscience . 2021;2021:11. doi: 10.1155/2021/9996737.9996737 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 23.Abdulkareem K. H., Mohammed M. A., Salim A., et al. Realizing an effective COVID-19 diagnosis system based on machine learning and IoT in smart hospital environment. IEEE Internet of Things Journal . 2021;8(21):15919–15928. doi: 10.1109/jiot.2021.3050775. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 24.Chen L., Rezaei T. A new optimal diagnosis system for coronavirus (COVID-19) diagnosis based on Archimedes optimization algorithm on chest X-ray images. Computational Intelligence and Neuroscience . 2021;2021:9. doi: 10.1155/2021/7788491.7788491 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 25.Khorami E., Mahdi Babaei F., Azadeh A. Optimal diagnosis of COVID-19 based on convolutional neural network and red Fox optimization algorithm. Computational Intelligence and Neuroscience . 2021;2021:11. doi: 10.1155/2021/4454507.4454507 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 26.Waheed A., Goyal M., Gupta D., Khanna A., Al-Turjman F., Pinheiro P. R. CovidGAN: data augmentation using auxiliary classifier GAN for improved covid-19 detection. IEEE Access . 2020;8:91916–91923. doi: 10.1109/access.2020.2994762. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 27.Sethy P. K., Behera S. K., Ratha P. K., Biswas P. Detection of coronavirus disease (COVID-19) based on deep features and support vector machine. International Journal of Mathematical, Engineering and Management Sciences . 2020;5(4):643–651. doi: 10.33889/ijmems.2020.5.4.052. [DOI] [Google Scholar]
- 28.Ahmad F., Farooq A., Ghani M. U. Deep ensemble model for classification of novel coronavirus in chest X-ray images. Computational Intelligence and Neuroscience . 2021;2021:17. doi: 10.1155/2021/8890226.8890226 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 29.Abbas A., Abdelsamea M. M., Gaber M. M. Classification of COVID-19 in chest X-ray images using DeTraC deep convolutional neural network. Applied Intelligence . 2021;51(2):854–864. doi: 10.1007/s10489-020-01829-7. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 30.Wang W., Li Y., Wang X., Li J., Zhang P. COVID-19 patients detection in chest X-ray images via MCFF-net. Proceedings of the 2021 13th International Conference on Advanced Computational Intelligence (ICACI); May 2021; Wanzhou, China. pp. 318–322. [DOI] [Google Scholar]
- 31.Ucar F., Korkmaz D. COVID diagnosis-Net: deep Bayes-SqueezeNet based diagnosis of the coronavirus disease 2019 (COVID-19) from X-ray images. Medical Hypotheses . 2020;140 doi: 10.1016/j.mehy.2020.109761.109761 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 32.Wang S., Kang B., Ma J., et al. A deep learning algorithm using CT images to screen for Corona Virus Disease (COVID-19) European Radiology . 2021;31:1–9. doi: 10.1007/s00330-021-07715-1. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 33.Narin A., Kaya C., Pamuk Z. Zonguldak, Turkey: 2020. Department of biomedical engineering, zonguldak bulent ecevit university, 67100. https://arxiv.org/abs/2003.10849 . [Google Scholar]
- 34.He X., Yang X., Zhang S., et al. Sample-efficient deep learning for COVID-19 diagnosis based on CT scans. IEEE Transactions on Medical Imaging . 2020;XX(Xx) doi: 10.1101/2020.04.13.20063941. [DOI] [Google Scholar]
- 35.Xu X., Jiang X., Ma C., et al. A deep learning system to screen novel coronavirus disease 2019 pneumonia. Engineering . 2020;6(10):1122–1129. doi: 10.1016/j.eng.2020.04.010. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 36.Hemdan E. E.-D., Shouman M. A., Karar M. E. Covidx-net: a framework of deep learning classifiers to diagnose COVID-19 in x-ray images. 2020. https://arxiv.org/abs/2003.11055 .
- 37.Minaee S., Kafieh R., Sonka M., Yazdani S., Jamalipour Soufi G. Deep-COVID: predicting COVID-19 from chest X-ray images using deep transfer learning. Medical Image Analysis . 2020;65 doi: 10.1016/j.media.2020.101794.101794 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 38.Adnan M., Rahman F., Imrul M., AL N., Shabnam S. Handwritten bangla character recognition using inception convolutional neural network. International Journal of Computer Application . 2018;181(17):48–59. doi: 10.5120/ijca2018917850. [DOI] [Google Scholar]
- 39.Lecun Y., Bottou L., Bengio Y., Haffner P. Gradient-based learning applied to document recognition. Proceedings of the IEEE . 1998;86(11):2278–2324. doi: 10.1109/5.726791. [DOI] [Google Scholar]
- 40.Zhou P., Ni B., Geng C., Hu J., Xu Y. Scale-transferrable object detection. CVF Open Access . 1:528–537. [Google Scholar]
- 41.Huang G., Liu Z., Van Der Maaten L., Weinberger K. Q. Densely connected convolutional networks. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition; July 2017; Honolulu, HI, USA. pp. 4700–4708. [DOI] [Google Scholar]
- 42.Chandrashekar G., Sahin F. A survey on feature selection methods. Computers & Electrical Engineering . 2014;40(1):16–28. doi: 10.1016/j.compeleceng.2013.11.024. [DOI] [Google Scholar]
- 43.Guyon I., Elisseeff A. Journal of Machine Learning Research . Vol. 3. Mar: 2003. An introduction to variable and feature selection; pp. 1157–1182. [Google Scholar]
- 44.Johnson K. J., Synovec R. E. Pattern recognition of jet fuels: comprehensive GC × GC with ANOVA-based feature selection and principal component analysis. Chemometrics and Intelligent Laboratory Systems . 2002;60(1–2):225–237. doi: 10.1016/S0169-7439(01)00198-8. [DOI] [Google Scholar]
- 45.Lin H., Ding H. Predicting ion channels and their types by the dipeptide mode of pseudo amino acid composition. Journal of Theoretical Biology . 2011;269(1):64–69. doi: 10.1016/j.jtbi.2010.10.019. [DOI] [PubMed] [Google Scholar]
- 46.Ding H., Guo S.-H., Deng E.-Z., et al. Prediction of Golgi-resident protein types by using feature selection technique. Chemometrics and Intelligent Laboratory Systems . 2013;124:9–13. doi: 10.1016/j.chemolab.2013.03.005. [DOI] [Google Scholar]
- 47.Chen T., Guestrin C. Xgboost: a scalable tree boosting system. Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining; August 2016; San Francisco, CA, USA. pp. 785–794. [Google Scholar]
- 48.Chelgani S. C. Estimation of gross calorific value based on coal analysis using an explainable artificial intelligence. Machine Learning with Applications . 2021;6100116 [Google Scholar]
- 49.Song K., Yan F., Ding T., Gao L., Lu S. A steel property optimization model based on the XGBoost algorithm and improved PSO. Computational Materials Science . 2020;174 doi: 10.1016/j.commatsci.2019.109472.109472 [DOI] [Google Scholar]
- 50.Chehreh Chelgani S., Nasiri H., Tohry A. Modeling of particle sizes for industrial HPGR products by a unique explainable AI tool- A “Conscious Lab” development. Advanced Powder Technology . 2021;32(11):4141–4148. doi: 10.1016/j.apt.2021.09.020. [DOI] [Google Scholar]
- 51.Chelgani S. C., Nasiri H., Alidokht M. Interpretable modeling of metallurgical responses for an industrial coal column flotation circuit by XGBoost and SHAP-A ‘conscious-lab’ development. International Journal of Mining Science and Technology . 2021;31(6):1135–1144. [Google Scholar]
- 52.Zhang D., Chen H.-D., Zulfiqar H., et al. IBLP: an XGBoost-based predictor for identifying bioluminescent proteins. Computational and Mathematical Methods in Medicine . 2021;2021:15. doi: 10.1155/2021/6664362.6664362 [DOI] [PMC free article] [PubMed] [Google Scholar]
- 53.Nasiri H., Homafar A., Chelgani S. C. Prediction of uniaxial compressive strength and modulus of elasticity for Travertine samples using an explainable artificial intelligence. Results in Geophysical Sciences . 2021;8 doi: 10.1016/j.ringps.2021.100034.100034 [DOI] [Google Scholar]
- 54.Yu B., Qiu W., Chen C., et al. SubMito-XGBoost: predicting protein submitochondrial localization by fusing multiple feature information and eXtreme gradient boosting. Bioinformatics . 2020;36(4):1074–1081. doi: 10.1093/bioinformatics/btz734. [DOI] [PubMed] [Google Scholar]
- 55.Lin M., Chen Q., Yan S. Network in network. Proceedings of the 2nd International Conference of Learning Representations ICLR 2014-Conference Track Proceedings; April 2014; Banff, AB, Canada. pp. 1–10. [Google Scholar]
- 56.Tang J., Alelyani S., Liu H. Feature selection for classification: a review. Data Classification Algorithms and Applications . 2014;37 [Google Scholar]
- 57.Maxwell A., Li R, Yang B, et al. Deep learning architectures for multi-label classification of intelligent health risk prediction. BMC Bioinformatics . 2017;18(14):523–131. doi: 10.1186/s12859-017-1898-z. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 58.Wang X., Peng Y., Lu L., Lu Z., Bagheri M., Summers R. M. Chest x-ray 8: hospital-scale chest x-ray database and benchmarks on weakly-supervised classification and localization of common thorax diseases. Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition; July 2017; Honolulu, HI, USA. pp. 2097–2106. [DOI] [Google Scholar]
- 59.Selvaraju R. R., Cogswell M., Das A., Vedantam R., Parikh D., Batra D. Grad-cam: visual explanations from deep networks via gradient-based localization. Proceedings of the IEEE International Conference on Computer Vision; October 2017; Venice, Italy. pp. 618–626. [DOI] [Google Scholar]
- 60.Song Y., Zheng H., Li L., et al. Deep learning enables accurate diagnosis of novel coronavirus (COVID-19) with CT images. IEEE/ACM Transactions on Computational Biology and Bioinformatics . 2021;18(6):2775–2780. doi: 10.1109/tcbb.2021.3065361. [DOI] [PMC free article] [PubMed] [Google Scholar]
- 61.Zheng C., Deng X., Fu Q., Zhou Q. Deep learning-based detection for COVID-19 from chest CT using weak label. MedRxiv . 2020 [Google Scholar]
Associated Data
This section collects any data citations, data availability statements, or supplementary materials included in this article.
Data Availability Statement
Publicly available ChestX-ray8 dataset was used in this study. The source code of the proposed method required to reproduce the predictions and results is available at https://github.com/seyyedalialavi2000/COVID-19-detection.