Parallel CNN-ELM: A multiclass classification of chest X-ray images to identify seventeen lung diseases including COVID-19

Md Nahiduzzaman; Md Omaer Faruq Goni; Rakibul Hassan; Md Robiul Islam; Md Khalid Syfullah; Saleh Mohammed Shahriar; Md Shamim Anower; Mominul Ahsan; Julfikar Haider; Marcin Kowalski

doi:10.1016/j.eswa.2023.120528

. 2023 May 27;229:120528. doi: 10.1016/j.eswa.2023.120528

Parallel CNN-ELM: A multiclass classification of chest X-ray images to identify seventeen lung diseases including COVID-19

Md Nahiduzzaman ^a,¹, Md Omaer Faruq Goni ^a,², Rakibul Hassan ^a,³, Md Robiul Islam ^a,⁴, Md Khalid Syfullah ^a,⁵, Saleh Mohammed Shahriar ^a,⁶, Md Shamim Anower ^b,⁷, Mominul Ahsan ^c,⁸, Julfikar Haider ^d,⁹, Marcin Kowalski ^e,^⁎,¹⁰

PMCID: PMC10223636 PMID: 37274610

Abstract

Numerous epidemic lung diseases such as COVID-19, tuberculosis (TB), and pneumonia have spread over the world, killing millions of people. Medical specialists have experienced challenges in correctly identifying these diseases due to their subtle differences in Chest X-ray images (CXR). To assist the medical experts, this study proposed a computer-aided lung illness identification method based on the CXR images. For the first time, 17 different forms of lung disorders were considered and the study was divided into six trials with each containing two, two, three, four, fourteen, and seventeen different forms of lung disorders. The proposed framework combined robust feature extraction capabilities of a lightweight parallel convolutional neural network (CNN) with the classification abilities of the extreme learning machine algorithm named CNN-ELM. An optimistic accuracy of 90.92% and an area under the curve (AUC) of 96.93% was achieved when 17 classes were classified side by side. It also accurately identified COVID-19 and TB with 99.37% and 99.98% accuracy, respectively, in 0.996 microseconds for a single image. Additionally, the current results also demonstrated that the framework could outperform the existing state-of-the-art (SOTA) models. On top of that, a secondary conclusion drawn from this study was that the prospective framework retained its effectiveness over a range of real-world environments, including balanced-unbalanced or large-small datasets, large multiclass or simple binary class, and high- or low-resolution images. A prototype Android App was also developed to establish the potential of the framework in real-life implementation.

Keywords: COVID19, Convolutional neural network, Extreme learning machine, Mobile apps, Pneumonia, Tuberculosis

1. Introduction

With the advancement in medical science and technology, a large number of previously incurable diseases now can be completely treatable (Vandenberg et al., 2021). Despite significant advances, emerging new diseases such as COVID-19 continue to present new challenges of quick and accurate identification and finding relevant treatment solutions. COVID-19 has spread in every corner of the world and affected millions of people, and many have lost their lives. It has been estimated that at the beginning of March 2022 nearly six million people have died and over 450 million have been afflicted by COVID-19 in the past three years (World Health Organization, 2022). Flu-like symptoms such as fever, dry cough, fatigue, and difficulty in breathing are generally experienced by the patients. In the severe cases, the COVID-19 frequently results in life-threatening pneumonia (Huff & Singh, 2020), which is a respiratory infection that incapacitates the lungs. When a healthy individual breathes, the lungs' tiny sacs called alveoli get filled with air. However, if a person contracts pneumonia, the alveoli get loaded with pus and fluid, obstructing breathing, and limiting oxygen absorption (Ruuskanen et al., 2011). Around 7% of the global population (450 million people) is impacted by pneumonia alone, and around 2 million people die from pneumonia each year (Ruuskanen et al., 2011).

The popular reverse transcription-polymerase chain reaction (RT-PCR) takes up to forty-eight hours to confirm the presence of coronavirus (Zhu et al., 2020). Since this technique is highly time-consuming and owing to the lack of other resources, a COVID-19 infected person can continue to spread the virus to their close contacts (Vandenberg et al., 2021). However, COVID-19 can be detected using chest X-ray analysis, which is comparatively a faster technique to speed up the diagnosis. However, the X-ray image has to be analyzed by a radiologist manually. Every year in the United States alone, more than 35 million CXR images are collected as part of medical treatment (Kamel et al., 2017). Increasing workload and exhaustion, are already a commonplace among the radiologists who must routinely review in excess of 100 CXR images each day (Kamel et al., 2017). Furthermore, radiologists' diagnosis can differ because of human judgment increasing the prospect of incorrect diagnosis. Numerous other lung diseases such as tuberculosis, cardiomegaly, opacity, and pleural that can also be detected from the X-ray image analysis, make it even more challenging for the radiologists to correctly identify the diseases. Therefore, it requires an automated and intelligent system such as artificial intelligence (AI) that can quickly and accurately identify any lung diseases with high classification accuracy and without consuming a lot of time or resource so that the system can be applied to real-world disease identification while significantly reducing the workload of the radiologists.

Machine learning (ML) and Deep learning (DL) have been applied effectively and efficiently in a wide variety of medical applications. However, for image classification, traditional machine learning models often rely on hand-crafted features, which necessitate multiple steps for extraction. In contrast, DL models have the advantage of automatically learning and extracting relevant features from the data, bypassing the need for manual feature extraction step. The key advantage of the DL in this context is the elimination of the complex feature engineering process typically associated with traditional machine learning techniques (Molina et al., 2021). Due to the recent availability of large-scale data sets, various attempts have been made to automatically identify lung-related anomalies using CXR images.

In 2017, Wang et al. developed a larger database of CXR images that consisted of eight thoracic diseases with over 100,000 images (Wang et al., 2017). Later, this database was expanded by adding images of additional six thoracic diseases to create a ChestX-ray14 (CXR14) dataset containing fourteen thoracic diseases. In the CXR14 dataset, radiologists provided a small number of CXR images with hand-labeled bounding boxes (B-Boxes) to reveal the affected area of the disease in the CXR images. They used several pre-trained transfer learning (TL) models (AlexNet, GoogleNet, VGGNet-16, and ResNet-50) for detecting the diseases from the CXR images and achieved the highest average area under the curve (AUC) of 74.51% using ResNet-50. Yao et al. handled multilabel classification of thoracic diseases from CXR14 images by using a DenseNet-121 as an encoder (Yao et al. 2017). Subsequently, a decoder was utilized to optimize interdependencies among the target labels in predicting 14 pathogenic abnormalities using a long short-term memory (LSTM) network. The proposed framework surpassed the results of Wang et al. with an average AUC of 0.798. Likewise, Rajpurkar et al. proposed a CNN architecture containing 121 layers named CheXNet to detect pneumonia from the CXR14 dataset (Rajpurkar et al., 2017). They employed class activation mappings to display the characteristics region associated with the disease from CXR images (CAMs). In comparison to the prior research (Wang et al., 2017, Yao et al., 2017), the CheXNet produced higher AUC for each condition. Their model’s F1 metric was 0.435, compared to the average F1-score of 0.387 given by four radiologists. On the other hand, Kumar et al. developed a boosted cascaded CNN (BCCNN) for multilabel classification from the CXR14 dataset (Kumar et al., 2018). For calculating the loss for the multilabel classification, they used binary relevance and a pairwise error loss function. The BCCNN achieved a higher AUC for only the cardiomegaly disease with 91.33% than the previous state-of-the-art (SOTA) models (Rajpurkar et al., 2017). The CXR14 database consisted of multiple diseases and was imbalanced. In 2018, Ge et al. addressed these two problems by utilizing a novel error function named multilabel softmax loss (Ge et al., 2018). Various TL models (DenseNet-121, ResNet18, and VGG) were used for predicting the 14 diseases and achieved the highest AUC of 85.37% while using the ensembling model (DenseNet121-VGG) that surpassed the previous SOTA models (Kumar et al., 2018, Rajpurkar et al., 2017, Wang et al., 2017, Yao et al., 2017). In contrast, Gundel et al. developed a location-aware dense network (DNetLoc) based on DenseNet-121 to identify abnormalities in the CXR images from two well-known datasets: CXR14 and PLCO (Guendel et al., 2018). The authors optimally leveraged high-resolution CXR data by comprising spatial knowledge from CXR abnormalities. The proposed DNetLoc (AUC 80.7%) surpassed the AUC performance of Wang et al. (AUC 74.51%). In 2019, Baltruscha et al. employed two TL models named ResNet-38 and ResNet-101 to detect the thoracic diseases from the CXR14 dataset (Baltruschat et al., 2019). The classification was carried out by blending non-image data, for instance, patient age, gender, etc., with CXR images. ResNet-38 achieved the highest Receiver Operating Characteristic (ROC) for discriminating against 14 lung diseases when compared to the SOTA models. In order to minimize the negative effects of imbalance present in the CXR14 dataset, Wang et al. introduced an adaptive sampling strategy that automatically increases the weight of the comparatively poorly performed classes (Wang et al., 2020). For disease detection, they employed a TL model called DenseNet-121, which was trained adaptively and yielded a promising outcome. Ouyang et al. localized and diagnosed the abnormalities from the CXR14 and CheXpert databases while using a new attention-driven weakly supervised algorithm (Ouyang et al., 2020). They used gradient-base visual attention in a holistic way and achieved a more favorable mean AUC of 81.9% which was higher than the AUCs obtained from the SOTA models. Guan and Huang developed a category-wise residual attention learning model for multi-label CXR14 images classification (Guan & Huang, 2020). They considered two methods: feature embedding and attention learning. ResNet-50 and DenseNet-121 models were used for embedding the features and achieved a mean AUC of 81.6% that outperformed the SOTA models.

Oh et al. proposed a patch-based CNN that was trained with a small dataset consisting of three diseases: bacterial pneumonia, tuberculosis (TB), and viral/COVID-19 (Oh et al., 2020). The authors segmented the lung contour using the fully conventional DenseNet103 and then performed classification by ResNet-18. To detect COVID-19 and pneumonia, Khan et al. constructed a CoroNet based on the pre-trained Xception model (Khan et al. 2020). For three-class (Normal vs. Bacterial Pneumonia vs. COVID-19) and four-class (Normal vs. COVID- 19 vs. Bacterial vs. Viral Pneumonia) classifications, the proposed model achieved a precision and recall score of 93.2% and 98.2%, respectively. Pandit et al. employed VGG19 TL model to detect the COVID-19 (Pandit et al., 2021). The authors trained their model with 1,428 CXR images and obtained an accuracy of 92.53 % for binary class (COVID19 vs Normal) and 96 % for three-class (Normal vs Bacterial Pneumonia vs COVID19) classifications. To detect COVID19, viral pneumonia, and bacterial pneumonia from CXR images, Yamac et al. built a convolutional sparse support estimator network based on a neural network (Yamac et al., 2021). After training their model with a total of 6,200 CXR images, an accuracy of 0.8707 for four-class classification and 0.959 for binary classification (COVID-19 and normal). Gour and Jain employed two TL models, VGG19 and Xception, and utilized a softmax classifier to detect COVID-19 from both CXR and CT images (Gour & Jain, 2022). A sensitivity of 97.62% for three-class classification (COVID-19 vs. Pneumonia vs. Normal) was achieved. To detect viral and COVID19 pneumonia from CXR images, Chowdhury et al. used several pre-trained TL models (Chowdhury et al., 2020). For training their models, numerous datasets were merged and accuracies of 99.7% and 97.9% were attained for two- and three-class (normal vs viral vs COVID-19) classifications respectively. Rahman et al. achieved a promising accuracy of 93.3% while using pre-trained DenseNet-201 model for a three-class classification (normal vs bacterial vs viral pneumonia) (Rahman et al., 2020a).

Akter et al. used a variety of TL models, including VGG19 and GoogLeNet to identify COVID-19 (Akter et al., 2021). To balance the datasets, they employed data augmentation and trained the TL models using a total of 52,000 CXR images. Using MobileNetV2, they achieved a high accuracy of 98% for binary classification in 2 hours, 50 minutes, and 21 seconds of compilation time. To diagnose COVID-19, Rasheed et al. developed two classifiers: logistic regression (LR) and CNN (Rasheed et al., 2021). For data augmentation, a generative adversarial network was deployed, and principal component analysis (PCA) was used to identify the most significant features. A model was developed by training 308 CXR images and PCA facilitated to attain an accuracy of 97.6%. Chandra et al. created a computer-aided (CAD) system to detect TB from CXR images (Chandra et al, 2020). The authors employed a guided image filter to de-noise the image before performing lung segmentation and classification. After extracting features, a support vector machine was used to attain accuracies of 95.60% and 99.40% based on the Montgomery and Shenzhen (SZ) datasets, respectively. Sahlol et al. retrieved 50,000 important features from CXR images using a pre-trained MobileNet model. The authors selected relevant features by employing an artificial ecosystem-based optimization algorithm. TB was detected from SZ and Dataset 2 with accuracies of 90.2% and 94.1%, respectively. Rahman et al. used nine TL and two U-net models to diagnose TB, whereas they achieved the highest accuracy of 98.6% using DenseNet201 (Rahman et al., 2020b).

Fig. 1 illustrates the classification performance of selected state-of-the-art (SOTA) deep learning models in relation to the number of classes and the number of parameters involved when classifying lung diseases. It is evident that numerous models are available for multiclass lung disease classification; however, a discernible trend emerges as the number of classes increases. With the growth in the number of classes, the number of parameters escalates, subsequently leading to a decline in classification performance. This demonstrates that model complexity is directly proportional to the number of classes, rendering simple models insufficient for effectively distinguishing between a large number of lung diseases.

Although there exist many models, it has been aimed here to showcase the best five SOTA models where the selected models represent the highest performance levels achieved in their respective categories. For instance, the best AUC for 14-class classification was achieved by the model presented by Wang et al. (2017), while the highest accuracy of 99.57% for TB classification was obtained by Chowdhury et al. (2020). By choosing the top-performing models in each category, it is intended to provide a concise overview of the current advancements in this research area, illustrating the benchmark performances for various classification tasks ranging from 14-class to binary class.

In recent years, various SOTA models have demonstrated promising results in detecting multiple lung diseases, particularly in two-, three-, and four-class classifications. However, the accuracy of these models in classifying 14 distinct lung-related disorders using the CXR14 dataset was notably low. Consequently, researchers have predominantly relied on the AUC value for performance comparison. The majority of SOTA models that focused on the CXR14 dataset solely compared the AUC, neglecting other essential performance metrics such as precision, recall, specificity, and accuracy, as their values were deemed inconsequential. Remarkably, Rajpurkar et al. achieved an f1-metric of 0.435, the highest value in the last decade using the CXR14 dataset (Rajpurkar et al., 2017). However, earlier SOTA models struggled to deliver satisfactory classification performance, especially when taking into account the heightened complexity of the models, characterized by a larger number of parameters and layers. These observations underscore the limitations of existing research in handling the classification of a more extensive range of lung diseases, highlighting the need for further advancements in deep learning models to address these challenges. The development of novel methodologies capable of maintaining high classification performance, even with increased complexity, remains a crucial research objective in the field of lung disease classification.

Upon reviewing the relevant SOTA studies, the following challenges for detecting the lung diseases using the CXR images are identified.

•
Challenge 1: Binary classification between normal and one specific lung disease refers to a highly idealized situation. Real-world CXR images would contain features associated with a variety of lung diseases creating a large multiclass classification problem. Furthermore, this raises the level of complexity in classification owing to larger number of parameters and layers.
•
Challenge 2: The CXR image datasets are highly imbalanced in nature as some diseases are more frequently identified than the others. For instance, in the CXR14 dataset, the number of hernia images appeared 227 times, whereas infiltration was found in 19,871 images. One of the primary issues is that each class must contribute substantially equal to the final categorization.
•
Challenge 3: Many of the lung diseases such as COVID-19 are life threatening, therefore detecting each lung-related disease with high classification accuracy and faster processing time is of utmost importance.

The main goal of this study was to develop a computer-aided framework that could detect multi-class (17 classes: atelectasis, cardiomegaly, effusion, infiltration, mass, nodule, pneumothorax, consolidation, edema, emphysema, bacterial pneumonia, viral pneumonia, COVID-19, pleural thickening, fibrosis, hernia, tuberculosis) lung-related diseases fast and accurately with a relatively small number of parameters and layers in any practical environment (larger or small number of classes, larger or smaller dataset, balanced or unbalanced dataset, higher or lower resolution CXR images), in order to demonstrate the proposed framework's suitability in real-world applications without requiring huge computational resources. The classification of 17 types of lung diseases has been attempted for the first time in this study with simpler processing to achieve a more optimistic result than the SOTA models while cutting down on processing time and the number parameters, layers, and size. Aside from that, a hybrid parallel CNN-ELM model, combining both DL and ML models, was developed. It is challenging to integrate the ELM with parallel CNN to perform Grad-CAM visualization. Up until now, according to the authors’ best knowledge, limited study has shown Grad-CAM or any other visualization by combining the DL and ML models. Therefore, in this study, a hybrid framework was designed that combined ELM with parallel CNN to calculate the gradient from the last layer of the ELM to the first layer of the CNN, which makes the model used for explaining the decision-making of the black box CNN-ELM model. The novel hybrid model’s interpretability was demonstrated by Grad-CAM visualization to explain which part of the model focused more on images during classification than the other. Furthermore, a prototype mobile app was developed to simulate the real-life application of the proposed framework.

2. Prospective framework

Fig. 2 depicts the prospective framework for detecting multi-class lung disorders. It was crucial to merge different publicly available datasets that contained lung-related disorders in order to create a competitive dataset that represented a scenario much closer to the real world. In this study, CXR images only related to seventeen most well-known lung-related diseases including COVID-19 were combined. CXR images are easy to obtain from the patients and comparatively cheaper than other imaging methods (CT scan or MRI). No consideration was given to other less-known diseases or other sources of images in this approach. The CXR images were reshaped and normalized, then inputted into a lightweight CNN for extracting the most discriminating features. The preference was given to the lightweight CNN over other TL models due to its lower number of layers and parameters. After extracting the features, standardization was applied to the features. Finally, ELM was proposed as a classifier to detect 17 classes lung diseases from 250 extracted features. Apart from these, a heatmap from Grad-CAM was used to explore the black-box approach of the proposed parallel CNN-ELM model. From the author's best knowledge, this is the first time that a DL model was combined with a ML model to explain the proposed hybrid parallel CNN-ELM with Grad-CAM visualization. The step required for calculating the gradient from the last layer (output) of the ELM to the first layer is shown in Algorithm 1. This is carried out by replacing the final layer of CNN with the layers (input, hidden, and output layers) of ELM and integrating the trained weights and biases of the ELM hidden layers with the CNN model.

Algorithm 1

CNN-ELM algorithm for lung disease classification and for Grad-CAM visualization

1:	ModelCNN: parameter set-up CNN model
2:	Train the ModelCNN (70 epochs)
3:	FE: feature extraction from last dense layer before classification layer
4:	ModelELM: parameter set-up
5:	Train the ModelELM using extracted features (1 epoch)
6:		InputWeight (W): generates randomly
7:		OutputWeight (β): ReLU (X.W)^-1.Y, where X and Y be the input and output
8:	Merging CNN-ELM
9:		Drop last 2 dense layers from the ModelCNN
10:		Add 2 new dense layers in the ModelCNN with same parameter of the ModelELM
11:		Set weight for newly added 2 layers using layers weight (W, β) of the ModelELM
12:	Classification using CNN-ELM

Diseases Name	No of Training Images	No of Testing Images
Atelectasis	3,724	843
Cardiomegaly	874	219
Effusion	3,164	791
Infiltration	7,637	1,910
Mass	1,711	428
Nodule	2,164	541
Pneumothorax	1,755	439
Consolidation	1,048	262
Edema	502	126
Emphysema	714	178
Bacterial Pneumonia	2,222	555
Viral Pneumonia	1,194	299
COVID-19	3,354	838
Pleural Thickening	901	225
Fibrosis	582	145
Hernia	88	22
Tuberculosis	829	207
Total	32,463	8,028

Scheme	Total Nodes in Input Layer	Total Nodes in Hidden Layer	Total Nodes in Output Layer
Trial 1–17 class	250	1500	17
Trial 2–14 class	250	1500	14
Trial 3–4 class	250	500	4
Trial 4–3 class	250	500	3
Trial 5–2 class	250	200	1
Trial 6–2 class	250	200	1
Activation Function	ReLU

Layer (Type)	Output Shape	Parameters
model (Functional)	(None, 124, 124, 192)	43,776
Conv7 (Conv2D)	(None, 122, 122, 16)	27,664
bn1 (BatchNormalization)	(None, 122, 122, 16)	64
Av7 (Activation)	(None, 122, 122, 16)	0
mp1 (MaxPooling2D)	(None, 61, 61, 16)	0
Conv8 (Conv2D)	(None, 59, 59, 8)	1,160
bn2 (BatchNormalization)	(None, 59, 59, 8)	32
av2 (Activation)	(None, 59, 59, 8)	0
mp2 (MaxPooling2D)	(None, 29, 29, 8)	0
dp1 (Dropout)	(None, 29, 29, 8)	0
ft (Flatten)	(None, 6,728)	0
dense (Dense)	(None, 1,024)	6,890,496
bn4 (BatchNormalization)	(None, 1,024)	4,096
dp2 (Dropout)	(None, 1,024)	0
Feature Extraction (Dense)	(None, 250)	256,250
Hidden Layer (Dense)	(None, 1,500)	376,500
av3 (Activation)	(None, 1,500)	0
Output (Dense)	(None, 17)	25,500
Total Parameters	7,625,538
Trainable Parameters	7,623,442
Non-trainable Parameters	2,096

Trial Number	Number of Classes	Disease Names	Training Images	Testing Images
Trial 2	14	Atelectasis	3,372	843
		Cardiomegaly	874	219
		Effusion	3,164	791
		Infiltration	7,637	1,910
		Mass	1,711	428
		Nodule	2,164	541
		Pneumonia	258	64
		Pneumothorax	1,755	439
		Consolidation	1,048	262
		Edema	502	126
		Emphysema	714	178
		Fibrosis	582	145
		Pleural Thickening	901	225
		Hernia	88	22
		Total	24,770	6,139

Trial 3	4	Normal	8,153	2,039
		COVID-19	3,354	838
		Bacterial Pneumonia	2,222	555
		Viral Pneumonia	1,194	299
		Total	14,923	3,731

Trial 4	3	Normal	8,153	2,039
		Pneumonia	1,194	299
		COVID-19	3,354	838
		Total	12,701	3,176

Trial 5	2	Normal	8,153	2,039
		COVID-19	3,354	838
		Total	11,507	2,877

Trial 6	2	Normal	8,153	2,039
		Tuberculosis	829	207
		Total	8,982	2,246

Name	Parameters
Programming Language	Python
Environment	PyCharm Community Edition (2021.2.3)
Backend	Keras with TensorFlow
Processor	11th generation Intel(R) Core (TM) i9-11900 CPU @2.50 GHz
Installed RAM	32 GB
GPU	NVIDIA GeForce, RTX 3090 24 GB
Operating system	Windows 10 Pro
Input	Chest X-Ray Images
Input Size	124 × 124
App Development Platform	Android Studio 2021.1.1 (Bumblebee)
Tensorflow Lite	tensorflow-lite-support:0.3.0
Metadata Extractor	tensorflow-lite-metadata:0.3.0
Tensorflow Lite GPU Acceleration	tensorflow-lite-gpu:0.3.0

Diseases Name	Precision	Recall	F1-Score	Accuracy (%)
Atelectasis (0)	0.90	0.89	0.89	–
Cardiomegaly (1)	0.95	0.87	0.90	–
Effusion (2)	0.88	0.90	0.89	–
Infiltrate (3)	0.86	0.94	0.90	–
Mass (4)	0.92	0.85	0.88	–
Nodule (5)	0.85	0.84	0.84	–
Pneumothorax (6)	0.91	0.87	0.89	–
Consolidation (7)	0.95	0.87	0.89	–
Edema (8)	0.98	0.86	0.92	–
Emphysema (9)	0.94	0.80	0.87	–
Bacterial Pneumonia (10)	0.97	0.98	0.97	–
Viral Pneumonia (11)	0.96	0.94	0.95	–
COVID-19 (12)	0.99	1.00	0.99	–
Pleural Thickening (13)	0.94	0.83	0.88	–
Fibrosis (14)	0.94	0.79	0.86	–
Hernia (15)	1.00	0.86	0.93	–
Tuberculosis (16)	0.99	0.98	0.99	–
Average	0.94	0.89	0.91	90.92

Disease Name	Precision	Recall	F1-Score	Accuracy (%)
Normal (0)	0.98	0.98	0.98	–
COVID-19 (1)	0.96	0.96	0.96	–
Bacterial Pneumonia (2)	0.90	0.92	0.91	–
Viral Pneumonia (3)	0.84	0.78	0.81	–
Average	0.92	0.91	0.91	95.33

Diseases	Area Under Curve (AUC)
Diseases	Wang et al., 2017	Yao et al., 2017	Guendel et al., 2018	Kumar et al., 2018	Wang et al., 2020	Rajpurkar et al., 2017	CNN-ELM
Atelectasis	0.7158	0.772	0.826	0.7618	0.814	0.8094	0.9711
Cardiomegaly	0.8065	0.904	0.911	0.9133	0.899	0.9248	0.9789
Effusion	0.7843	0.859	0.885	0.8635	0.873	0.8638	0.9742
Infiltrate	0.6089	0.695	0.716	0.6923	0.701	0.7345	0.9722
Mass	0.7057	0.792	0.854	0.7502	0.840	0.8676	0.9425
Nodule	0.6706	0.717	0.774	0.6662	0.775	0.7802	0.9510
Pneumonia	0.6326	0.713	0.765	0.7145	0.662	0.7680	0.9123
Pneumothorax	0.8055	0.841	0.872	0.8594	0.865	0.8887	0.9686
Consolidation	0.7078	0.788	0.806	0.7838	0.789	0.7901	0.9387
Edema	0.8345	0.882	0.892	0.8880	0.874	0.8878	0.9685
Emphysema	0.8149	0.829	0.925	0.8982	0.924	0.9371	0.9689
Fibrosis	0.7688	0.767	0.820	0.7559	0.809	0.8047	0.9363
Pleural Thickening	0.7082	0.765	0.785	0.7739	0.772	0.8062	0.9373
Hernia	0.7667	0.914	0.941	0.8024	0.923	0.9164	0.9155
Mean	0.7451	0.761	0.807	0.7945	0.82	0.8414	0.9526

Scheme No	Reference	No. of Class	Processing Times (seconds)	Precision	Recall	Accuracy	AUC
Trial 3	Khan et al., 2020	4	–	89.84%	89.94%	89.6%	–
	Yamac et al., 2021		–	–	79.79%	87.07%	–
	Chandra et al., 2020		–	80.92%	85.66%	76.46%
	CNN-ELM		0.000997	92%	91%	95.33%	99.17%
Trial 4	Chandra et al., 2021	3	–	–	–	93.41%	–
	Pandit et al., 2021		–	–	86.7%	92.53%
	Sekeroglu and Ozsahin, 2020		–	92.70%	92.70%	95.99%	–
	Wang et al., 2020		–	93.33%	93.33%	93.3%	–
	Jain et al., 2021		–	96.33%	93%	97.97%	–
	Asnaou & Chawki, 2021		0.159	92.38%	92.11%	92.18%
	Ozturk et al., 2020		<1	89.96%	85.35%	87.02%
	Chandra et al., 2020		–	94.27%	96.40%	95.66%	–
	CNN-ELM		0.000996	99%	99%	99.30%	99.88%
Trial 5	Pandit et al.,2021	2	–	–	92.64%	96%	–
	Panwar et al., 2020		–	–	–	88.10%	88.10%
	Akter et al., 2021		–	–	–	89.6%	–
	Khan et al., 2020		–	93%	98.2%	89.6%	–
	Sekeroglu and Ozsahin, 2020		–	–	93.92%	98.39%	96.48%
	Chowdhury et al., 2020		–	97%	98%	98%	–
	Ozturk et al., 2020		<1	98.03%	95.13%	98.08%
	Brunese et al., 2020		2.498	–	94%	98%	–
	CNN-ELM		0.000996	99%	99%	99.37%	99.91%
Trial 6	Chandra et al., 2020	2	–	99.42%	99.40%	99.40%	99%
	Rahman et al., 2020a, Rahman et al., 2020b		–	98.57%	98.56%	98.60%	–
	Sahlol et al., 2020		–	–	91.94%	90.23%	–
	Ayaz et al., 2021		–	–	–	97.59%	99%
	Lopes and Valiati, 2017		–	–	–	84.7%	92.60%
	Duong et al., 2021		–	–	97.3%	98.7%	99%
	CNN-ELM		0.000996	100%	99%	99.82%	99.98%

Model Name (Ref.)	Number of Layers	Number of Parameters (million)
VGG16 (Brunese et al., 2020, Lopes and Valiati, 2017, Pandit et al., 2021, Wang et al., 2017)	16	138.3
DenseNet-121 (Guendel et al., 2018, Rajpurkar et al., 2017, Wang et al., 2020, Yao et al., 2017)	121	8.1
CoroNet (Khan et al., 2020)	71	33.97
DarkNet (Panwar et al., 2020)	106	40.5
Inception Resnet V2 (Asnaou & Chawki, 2021)	164	55.8
DenseNet 201 (Chowdhury et al., 2020, Rahman et al., 2020b)	201	20.2
ResNet 50 (Lopes and Valiati, 2017, Wang et al., 2017)	50	25.6
Inception-V3 (Jain et al., 2021)	48	23.8
Xception (Jain et al., 2021)	71	22.9
AlexNet (Duong et al., 2021, Wang et al., 2017)	8	62.37
Proposed Framework	7	7.6

PERMALINK

Parallel CNN-ELM: A multiclass classification of chest X-ray images to identify seventeen lung diseases including COVID-19

Md Nahiduzzaman

Md Omaer Faruq Goni

Rakibul Hassan

Md Robiul Islam

Md Khalid Syfullah

Saleh Mohammed Shahriar

Md Shamim Anower

Mominul Ahsan

Julfikar Haider

Marcin Kowalski

Abstract

1. Introduction

Fig. 1.

2. Prospective framework

Algorithm 1

Fig. 2.

2.1. Coupling of data sets

Table 1.

Fig. 3.

2.2. Features extraction

Fig. 4.

2.3. Extreme Learning Machine (ELM)

Table 2.

Table 3.

3. Experimental procedure

Table 4.

Table 5.

4. Results and discussions

4.1. Trial 1: Multiclass-17 lung diseases

Fig. 5.

Table 6.

Fig. 6.

4.2. Trial 2: Multiclass-14 lung diseases

Fig. 7.

Table 7.

Fig. 8.

4.3. Trial 3: Multiclass- COVID-19, bacterial and viral pneumonia

Fig. 9.

Table 8.

4.4. Trial 4: Multiclass-COVID-19 and pneumonia

Fig. 10.

Table 9.

4.5. Trial 5: Binary-Normal and COVID-19

Fig. 11.

Table 10.

4.6. Trial 6: Binary-Normal and TB

Fig. 12.

Table 11.

4.7. Performance comparison with SOTA models

Table 12.

Fig. 13.

Table 13.

Table 14.

4.8. Model’s interpretability capability

Fig. 14.

4.9. Key technical contributions and limitations

5. Application of CNN-ELM model in Android App

5.1. App design and development

Fig. 15.

5.2. App testing

Fig. 16.

6. Conclusion

CRediT authorship contribution statement

Declaration of Competing Interest

Data availability

References

Associated Data

Data Availability Statement

ACTIONS

PERMALINK

RESOURCES

Similar articles

Cited by other articles

Links to NCBI Databases