Enhancing multiclass COVID-19 prediction with ESN-MDFS: Extreme smart network using mean dropout feature selection technique

Saghir Ahmed; Basit Raza; Lal Hussain; Touseef Sadiq; Ashit Kumar Dutta

doi:10.1371/journal.pone.0310011

. 2024 Nov 12;19(11):e0310011. doi: 10.1371/journal.pone.0310011

Enhancing multiclass COVID-19 prediction with ESN-MDFS: Extreme smart network using mean dropout feature selection technique

Saghir Ahmed ¹, Basit Raza ^1,^*, Lal Hussain ^2,^3,^*, Touseef Sadiq ^4,^*, Ashit Kumar Dutta ⁵

Editor: Catalin Buiu⁶

PMCID: PMC11556731 PMID: 39531465

Abstract

Deep learning and artificial intelligence offer promising tools for improving the accuracy and efficiency of diagnosing various lung conditions using portable chest x-rays (CXRs). This study explores this potential by leveraging a large dataset containing over 6,000 CXR images from publicly available sources. These images encompass COVID-19 cases, normal cases, and patients with viral or bacterial pneumonia. The research proposes a novel approach called "Enhancing COVID Prediction with ESN-MDFS" that utilizes a combination of an Extreme Smart Network (ESN) and a Mean Dropout Feature Selection Technique (MDFS). This study aimed to enhance multi-class lung condition detection in portable chest X-rays by combining static texture features with dynamic deep learning features extracted from a pre-trained VGG-16 model. To optimize performance, preprocessing, data imbalance, and hyperparameter tuning were meticulously addressed. The proposed ESN-MDFS model achieved a peak accuracy of 96.18% with an AUC of 1.00 in a six-fold cross-validation. Our findings demonstrate the model’s superior ability to differentiate between COVID-19, bacterial pneumonia, viral pneumonia, and normal conditions, promising significant advancements in diagnostic accuracy and efficiency.

1. Introduction

COVID-19 pandemic is the greatest incident affecting the lives of many people and thus a major issue concerned is the need to have an accurate diagnosis. However, [1] researchers stress that diagnosing the disease can be done with the help of the Real-Time reverse transcription-Polymerase Chain Reaction (RT-PCR) method, whereas some researchers [2] indicate that biomarkers are of great value during the early ages of the disease diagnosis. The pandemic has also been responsible for a shift in the mortality rates, with older people being the ones who have mostly died [3,4] draws our attention to various comorbidities and organ injuries which transform the treatment plan of COVID-19 in the clinical setting. These research, by virtue of the fact, stress the existence and severity of dire results associated with delayed and improper diagnosis which can be a real threat to the survival of humankind. The latest studies have revealed that deep learning models can be used for coronavirus image classification. The researchers [5] managed to achieve an accurate detection COVID-19 from X-ray images with ResNet101. The researchers of [6] also reported a high accuracy using the CNN. In the paper authored by [7], the same Xception model was combined with an additional channel attention mechanism for the automated classifier of the Computed Tomography (CT) scan of COVID-19 case making it extremely precise. The studies taken together imply that the deep learning models can identify COVID-19 cases through images, which is useful in medical diagnosis and disease detection. It is said that the sophisticated and palatial deep learning model is much greater than its thinner versions. This is because the deeper complex models and algorithms that are sized bigger have their own advantages and disadvantages. From one point of view, the large size of deep learning models allows them to capture more complicated and subtle patterns in the data, thus, improving their performance and accuracy [8].

While deep learning models have shown promise in classifying COVID-19 images, there are still limitations that need to be addressed. One such limitation is the lack of research on applying domain adaptation techniques to overcome the challenge of the cross-dataset problem. Existing solutions, such as COVID19-DANet, have shown some promise but still require further improvement to achieve better results across different datasets [9]. Deep learning models, including Convolutional Neural Networks (CNNs), perform well when trained and tested on the same dataset but show significantly lower performance when applied to different datasets. This indicates a lack of generalization across different data sources. The problems of COVID-19 classification, also problem that the quality and quantity of available COVID-19 image datasets vary significantly, which affects the training and performance of deep learning models. Inconsistent data quality can lead to unreliable model predictions [10]. Data enhancement methods are crucial for improving model performance, but there is a need for more sophisticated techniques to handle the variability in COVID-19 image datasets effectively. Supervised learning methods, while effective, require large amounts of labelled data, which is often scarce in the context of COVID-19. The researchers [11] highlights the challenges of extensive computational resources, limited annotated datasets, and a large amount of unlabeled data. The researchers [12] further emphasizes the difficulty of assessing severity due to small datasets and the high dimensionality of images. The authors [13,14] both note the limitations of single task learning and the need for more efficient models. These studies collectively underscore the need for more robust and efficient deep learning models in COVID-19 image classification. These limitations hamper the development of robust models. Semi-supervised and unsupervised learning methods have been explored, but they still face challenges in achieving high accuracy and reliability in COVID-19 image classification.

The authors [15]proposed a novel automated framework for the classification of tuberculosis, COVID-19, and pneumonia from chest x-ray images using deep learning and an improved optimization technique. The proposed deep learning-based framework achieved high classification accuracy 98.2%, 99.0%, and 98.7%) on three different datasets for tuberculosis, COVID-19, and pneumonia detection from chest X-ray images. The authors employed the Wilcoxon signed-rank test to statistically validate the superior performance of their proposed method. The integration of feature fusion was instrumental in enhancing the method’s accuracy. The researchers [16] proposed a wrapper-based technique to improve the classification performance of chest infection (including COVID-19) detection using X-rays by extracting deep features using pretrained deep learning models and optimizing them using various optimization techniques, while also using a network selection technique to select the deep learning models. The proposed deep learning framework achieved a high classification accuracy of 97.7% in detecting chest infections, including COVID-19. Rigorous validation confirmed the framework’s reliability for classifying both COVID-19 and other chest infections, suggesting its potential as a valuable tool for clinicians.

If we further look into the problems of COVID-19, the one major constraint is the reliance on binary classifiers or building classifiers based on only a few classes, hindering comprehensive classification [17]. Additionally, most studies focus on flat single-feature imaging modalities without incorporating clinical information or utilizing the hierarchical structure of pneumonia, leading to clinical challenges [18]. The availability of limited COVID-19 imaging data poses a challenge for developing effective automated picture segmentation methods, impacting quantitative assessment and disease monitoring [19]. Moreover, the biggest challenge lies in the availability of training data, with data augmentation methods like Generative Adversarial Networks (GAN) -based augmentation found to be subpar compared to classical methods for COVID-19 image classification [19,20].

This paper explores how the size of a deep learning model influences different tasks and identifies certain implications when working with such models. Larger models have been seen to be associated with higher computational costs and more parameters especially when dealing with complex models meaning that the models cannot be easily deployed in any device with limited resource capabilities [21]. However, prior research has demonstrated that such approaches may not always be the case, such as in the application of CNNs to detect brain tumours where less input sizes describe heightened accuracy coupled with enhanced training times, evidenced from the use of the 64px inputs models as opposed to the larger input models [22]. However, in practice, when using models such as autoencoders in deep learning on datasets where features are globally similar but locally dissimilar, the authors noted that using smaller batch sizes improve model performance and yield better biologically relevant information, which raises the cost of batch size when design the model [23]. The measures regarding challenges of large deep learning (DL) models consists of methods like degree of parallelism, lighter data matrices and system enhancement to increase the potential [24].

Lightweight deep learning models have the capability to reduce the model size and memory requirements, and to optimize the model in order to make it efficient to implement on edge device. The primary goal of these models is to minimize computational complexity while still delivering high performance [25,26]. For example, a lightweight deep learning model created for identifying human posture exhibited a significantly smaller size of 46.2 MB, in contrast to the baseline model’s 227.8 MB [27]. Similarly, a model developed for detecting ophthalmic diseases was reported to be ten times lighter than the popular biomedical segmentation model UNet, with a memory size of around 35 MB. These smaller models play a critical role in facilitating real-time processing on battery-operated devices and enable efficient deployment on edge devices with limited resources.

The advancement of lightweight CNN models proves beneficial for diverse applications, particularly in situations with constrained computational resources. These models offer advantages such as reduced inference delay, minimal memory requirements for deployment on embedded devices, and the ability to swiftly update over-the-air [28]. For deployment on mobile devices with limited resources, lightweight Convolutional Neural Network (CNN) models are crucial. These models achieve high accuracy while keeping computational costs low. MobileNetV2 model utilizes depth-wise separable convolutions and inverted residual connections to reduce computations without sacrificing accuracy. It achieves state-of-the-art performance on various tasks, making it ideal for mobile vision applications [29]. ShuffleNet is another lightweight architecture specifically designed for mobile devices with limited computational power. It employs pointwise group convolutions and channel shuffle operations to achieve lower computational demands while maintaining accuracy [30]. EfficientNet is from the family of models surpasses previous CNNs in both accuracy and efficiency. Obtained by scaling up MobileNets and ResNets using neural architecture search, EfficientNets achieve state-of-the-art accuracy on diverse datasets while being smaller and faster during inference compared to other models [31]. Furthermore, techniques such as pruning, quantization, and knowledge distillation can further reduce the size of CNN models. This makes them even more suitable for deployment on resource-constrained devices [32].

Lightweight CNN models are a game-changer for real-time video surveillance. They offer several key advantages: minimal inference delay, meaning they process video frames quickly for real-time analysis, low memory requirements allowing them to run on resource-constrained devices, and the capability to be trained, fine-tuned, and deployed in a distributed manner [28]. This makes them ideal for embedded systems and facilitates efficient processing of large video datasets for wider deployment. The emergence of lightweight deep learning methodologies has gained prominence due to their ability to facilitate efficient and real-time processing on edge devices. These methodologies can be broadly categorized into two approaches: developing lightweight deep learning algorithms from scratch and transforming existing models into more compact versions. Researchers have explored various lightweight models, such as SqueezeNet, ShuffleNet, and MobileNet, comparing their performance parameters with conventional models like AlexNet and GoogleNet [33]. These lightweight models have shown promising results across numerous daily life applications. Moreover, lightweight deep learning algorithms have been applied to studying slip performance in composite materials used in construction, showcasing their versatility [34]. Overall, lightweight deep learning techniques offer a promising avenue for efficient processing in resource-constrained environments, facilitating real-time processing and reducing computational complexity.

The size of trained models presents deep learning models for COVID-19 classification on edge devices hence important to design light models. Various studies have pointed out that, it is beneficial to exist in the method to decrease the number of model parameters while retaining high accuracy in order to optimize model implementation in the edge environment [35, 36, 37]. Such as attention modules and mixed loss functions has been suggested to reduce the size of models while incurring a similar level of performance so that the models can effectively be deployed on edge devices that have restricted resources [38]. While models such as MobileNetV2 are more lightweight, they have emerged dominant in performance with constrained memory needs to increase the efficacy of deploying the model within edge devices [39]. The efficient deep learning neural networks combined with wearable medical sensors can be embedded in smartphone applications and similar other devices, preserving patient privacy and ensuring efficient resource use[40]. Based on all above evidence we suggest that the size of trained models depends heavily on machine learning and AI implementation when deployed for COVID-19 classification on edge devices. They will remain pertinent due to the need for model compression, selective ensemble methods, and other developments like mixed-precision training. The idea is to allow the precise and real-time execution of deep learning models on source devices, which may help accelerate and improve the COVID-19 detection.

Distinguishing COVID-19 from other lung infections remains a challenging task. While researchers are actively developing tools to improve prediction performance, limitations persist in the preprocessing and processing stages. This study addresses these challenges by focusing on the preprocessing stage. We propose a novel approach that utilizes median filtering and interpolation methods to remove noise from the imaging data. Additionally, we address data imbalance using data augmentation techniques and a stratified 5-fold cross-validation strategy to prevent overfitting and ensure a balanced distribution for training and validation purposes. Furthermore, we optimize the hyperparameters of the VGG-16 deep learning algorithm through a grid search method. Finally, we introduce the ESN-MDFS system, a novel approach that combines an Extreme Smart Network (ESN) with a Mean Dropout Feature Selection Technique (MDFS). This system aims to improve multi-class detection (COVID-19, normal, viral pneumonia, and bacterial pneumonia) by extracting static features using Grey Level Co-occurrence Matrix (GLCM) analysis and dynamic features through the pre-trained VGG-16 model.

2. Materials and methods

2.1. Proposed model

This study enhances multiclass COVID-19 prediction through a novel approach encompassing the following key elements as reflected in Fig 1A and 1B:

Optimized pre-processing: Chest X-ray image quality was improved using techniques such as interpolation, data cleaning, augmentation, feature engineering, image enhancement, morphological operations, segmentation, and transformation.
Feature extraction: Dynamic VGG-19 and static GLCM features were computed from multiclass data to capture diverse image characteristics.
Feature selection: A hybrid feature space (HFS) was refined using feature selection methods to eliminate redundant features, thereby improving prediction performance and model size for efficient deployment on edge devices
The optimal HFS was then utilized to the robust optimized XGBoost algorithm for improved prediction
Hyperparameter tuning: The hyperparameters of the XGBoost machine learning algorithm were meticulously optimized.

Deep features extracted from VGG-19 provide a powerful representation of image content. They capture high-level semantic information about the image, such as the presence of specific objects or patterns. In the context of COVID-19 classification, these features can effectively discriminate between different lung pathologies, including pneumonia, viral pneumonia, and COVID-19. By leveraging the hierarchical structure of VGG-19, these features can capture subtle visual patterns that are often challenging for traditional image processing techniques.

Static GLCM features, on the other hand, provide complementary information about the texture and spatial relationships between pixels in an image. These features are sensitive to image patterns and structures, which can be crucial for differentiating between different types of lung abnormalities. By combining deep features and GLCM features, it is possible to create a more robust and discriminative feature space for multi-class COVID-19 classification.

The hybrid feature space (HFS)

Deep features and GLCM features capture different aspects of image information, leading to improved classification performance.
The combination of these features can better differentiate between subtle visual patterns associated with different lung diseases.
The use of multiple feature types can help to reduce the impact of noise and variations in image quality.

By effectively fusing these features and employing appropriate machine learning techniques, we developed highly accurate and reliable COVID-19 classification models.

2.2. Proposed model algorithm: Enhancing COVID Prediction with ESN-MDFS

Preprocessing Step:

foreach image in imageDataset
- Apply interpolation during image resizing (224,224)
- Apply medianBlur for denoising
- Apply intensity Normalization
end foreach
Data Augmentation:

foreach class in classes
- find difference in classes
- apply data augmentation using library imageDataGenerator
end foreach
Data Split

Split dataset into train and test using train_test_split method
- train = 0.8
- test = 0.2
Features Selection:
- Static Features–using GLCM (25 Features)
- Dynamic Features–using VGG16 (1024 Features)
Hybrid Features Space:
- Combined Static and Dynamic Features–HFS
- Apply Mean Dropout Technique for Selection of Important Features from HFS
Train Models on Train and Test
- Train XGBoost Model on HFS
- generating ROC, Confusion_matrix, Classification_Report
Deploy Model
- Deploy smart model on edge devices

2.3. Dataset

To train our deep CNN for distinguishing COVID-19 from other pneumonia types, we leveraged a diverse dataset compiled from several publicly available sources, similar to the approach used in previous studies [41–43]. This dataset incorporates chest X-ray images of COVID-19 (N = 1525): sourced from Cohen et al. via GitHub [44], Radiopaedia, SIRM, TCIA, and Pneumonia (N = 3863): retrieved from the Kaggle repository, Normal individuals (N = 1525): sourced from the Kaggle repository and the NIH dataset. This multifaceted dataset, encompassing images from various public sources, strengthens the generalizability and robustness of our model.

2.4. Preprocessing

2.4.1. Image preprocessing

To unlock valuable insights from images, we employ image preprocessing. This vital step refines image quality and readies it for further analysis. Fig 2 showcases three key aspects of this transformation: noise reduction for a clearer view, feature enhancement for sharper details, and normalization for seamless integration into subsequent steps.

1. Interpolation:
- Interpolation deals with the problem of missing data in an image. It fills in the gaps in the image data with estimated values based on the surrounding pixels. This can be important for tasks such as image resizing, scaling, and registration.
2. Noise Removal:
- Noise is unwanted information that can corrupt an image and interfere with subsequent analysis. It can originate from various sources, such as sensor imperfections, environmental factors, and data transmission errors.
- Noise removal aims to remove or suppress this noise while preserving the true image content. Various noise removal filters are available, each targeting specific types of noise. Common filters include median blur filter.
3. Intensity Normalization:
- Intensity normalization aims to adjust the intensity values of an image to a desired range. This can be necessary for tasks such as image registration, segmentation, and feature extraction.
- We utilized histogram equalization.

2.4.2. Data augmentation

To compensate for potential limitations in the dataset, we employed two crucial strategies: data augmentation and stratified splitting. Data augmentation artificially expands the dataset by generating variations of existing samples, making the model more robust to real-world variations and preventing overfitting [45]. Techniques like adding noise, applying transformations, and generating synthetic data were utilized to achieve equal representation of all classes, further enhanced by stratified splitting during data division. This ensures each training and test set accurately reflects the distribution of classes in the original data. The Fig 3 depicts the data augmentation methods.

Rotation:

Randomly rotates the image by an angle within a predefined range. This can help the model learn to recognize objects from different angles. Range: -50 to 20 degrees, probability: 0.2.
Horizontal Flipping:

Flips the image horizontally (left-right). This can help the model learn to recognize objects that are not symmetrically aligned. Probability: 1.0
Vertical Flipping:

Flips the image vertically (top-bottom). This can help the model learn to recognize objects that are not symmetrically aligned. Probability: 1.0
Image Shearing:

Definition: Applies a shearing transformation to the image, distorting it in a parallelogram-like shape. This data augmentation technique helps the model develop viewpoint invariance, allowing it to recognize objects even when viewed from different angles. Angle: -40 to 40, probability: 0.2.
Gamma Contrast:

Adjusts the gamma value of the image, changing the overall brightness and contrast. This data augmentation technique can enhance the model’s illumination invariance, allowing it to recognize objects even under varying lighting conditions. Range: 0.5 to 2, pobability: 0.2.
Sigmoid Contrast:

Applies a sigmoid function to the image’s pixel intensities, enhancing the contrast between dark and bright areas. This can help the model learn to recognize objects with different brightness levels. Coefficient: 5 to 10, probability: 0.2.
Linear Contrast:

Adds or subtracts a constant value to all pixel intensities of the image, adjusting the overall brightness. This can help the model learn to recognize objects under different lighting conditions. Delta: -0.2 to 0.2, probability: 0.2.
Elastic Transform:

Applies an elastic transformation to the image, distorting it in a rubber-like manner. This can help the model learn to recognize objects under different deformations. Alpha: 60, Sigma: 4, Probability: 0.2
Polar Transform:

Converts an image from Cartesian coordinates to polar coordinates, applies random rotations and shifts, and then converts back to Cartesian coordinates. This can help the model learn to recognize objects under different rotational perspectives. Max magnitude: -0.2 to 0.7, probability: 0.2.
Jigsaw Transform:

Divides an image into multiple smaller patches and randomly rearranges them, creating a new image. This can help the model learn to recognize objects from fragmented views. grid size: 4x4 to 8x8, Pixel interpolation: 3 to 7, Probability: 0.2.
Invert Image:

Negates all pixel intensities of the image, creating a negative image. This can help the model learn to recognize objects based on their shape and texture, not just their color. Probability: 1.0
Polarize Image:

Randomly increases or decreases the saturation of the image, creating a more intense or muted color palette. This can help the model learn to recognize objects under different color conditions. Factor: 0.5 to 2, Probability: 0.2.

After the augmentation is applied the images are equalized to N = 2521.

2.4.3. Image resize

For image resizing, we opted for the "inter area" interpolation method, a technique often used in computer vision to estimate the value of a pixel based on its surroundings [46]. This specific method, utilizing the average values of nearby pixels, excels at producing smooth and accurate results, especially when downscaling images. Unlike some other interpolation methods, "inter area" considers the contributions of multiple surrounding pixels, leading to a visually pleasing and artifact-free outcome.

2.4.4. Hyperparameters optimization

Optimizing hyperparameters plays a crucial role in fine-tuning the performance of deep learning models like VGG-16. By adjusting these settings, we can achieve optimal accuracy, minimize overfitting, and improve the model’s generalizability to unseen data.

Here’s a breakdown of key VGG-16 hyperparameters and their potential grid search values:

Learning Rate: The learning rate governs the step size taken during gradient descent, affecting the speed of weight updates in the model. Grid Values: [0.0001, 0.001, 0.01, 0.1]
Momentum: Helps the model overcome shallow local minima by incorporating the direction of past gradients. Grid Values: [0.0, 0.5, 0.9, 0.99]
Weight Decay: Regularizes the model by penalizing large weights, preventing overfitting. Grid Values: [1e-4, 1e-5, 1e-6, 0]
Batch Size: Number of samples processed together during training. Grid Values: [8, 16, 32, 64.
Optimizer: Algorithm used to update the model’s weights based on the loss function. Grid Values: [Adam, SGD, RMSprop]
Number of Training Epochs: Epochs (number of training dataset passes). Grid Values: [10, 20, 50, 100]
Activation Function: Embeds non-linear decision boundaries, empowering the network to capture intricate interactions between features. Grid Values: [ReLU, Leaky ReLU, tanh]
Dropout Rate: Randomly drops out neurons during training, preventing co-adaptation and improving generalizability. Grid Values: [0.2, 0.3, 0.4, 0.5]
Early Stopping: Monitors a validation metric and stops training when it stagnates, preventing overfitting. Grid Values: [Patience: 5, 10, 15]

2.4.5. VGG-16 hyperparameter optimization

To optimize the performance of our model, we employed a grid search method to identify the most effective hyperparameter settings. Following optimal values were selected as optimal: Learning Rate: 0.001, Momentum: 0.9, Weight Decay: 1e-5, Batch Size: 32, Optimizer: Adam, Epochs: 50, Activation: ReLU, Dropout: 0.3, Early Stopping Patience: 10.

2.5. Feature extraction

Deep features extracted from VGG-19 provide a powerful representation of image content. They capture high-level semantic information about the image, such as the presence of specific objects or patterns. In the context of COVID-19 classification, these features can effectively discriminate between different lung pathologies, including pneumonia, viral pneumonia, and COVID-19. By leveraging the hierarchical structure of VGG-19, these features can capture subtle visual patterns that are often challenging for traditional image processing techniques.

Static GLCM features, on the other hand, provide complementary information about the texture and spatial relationships between pixels in an image. These features are sensitive to image patterns and structures, which can be crucial for differentiating between different types of lung abnormalities. By combining deep features and GLCM features, it is possible to create a more robust and discriminative feature space for multi-class COVID-19 classification.

2.5.1. Static feature extraction based on Grey-level Co-occurrence Matrix (GLCM)

This approach utilizes GLCM, a texture feature extraction method introduced by Haralick in 1973, to analyze the input image [47].

The Gray Level Co-occurrence Matrix (GLCM) is a statistical technique used to extract texture features from images by analyzing the spatial relationships between pixel intensities. Its applications span various domains, including SAR imagery for land cover classification (water, vegetation, urban areas) [48] and medical imaging for detecting retinal abnormalities [49], where color features have shown superior accuracy. Gray Level Co-occurrence Matrix (GLCM) analysis computes the frequency of pixel pairs with specific intensity values and spatial relationships, forming a matrix from which statistical features can be extracted. In this study, 25 GLCM features were initially calculated and subsequently reduced to 17 through MeanDropout feature selection.

GLCM characterizes texture by analyzing the spatial relationships between neighboring pixels. This is achieved in two steps:

Step 1: Building the GLCM. Pixel pairs separated by a specific distance (d) and direction (θ) are counted and tabulated. This establishes a spatial relationship between a reference pixel and its neighbors.
Step 2: Feature extraction. From the GLCM, a set of scalar quantities is computed, each capturing different aspects of the original texture. These quantities, collectively forming the GLCM features, represent the frequency of various gray-level combinations occurring within the image [47].

The GLCM extracts texture features from images by analyzing the spatial relationships between pairs of pixels. Introduced in 1973 by Haralick et al. [34. GLCM characterizes texture through various statistical measures derived from the second-order statistics of the image. Obtaining GLCM features involves two steps:

Spatial Co-occurrence Calculation: For each pixel in the image, the frequency of its gray level co-occurring with the gray levels of its neighbors at a specific distance (d) and direction (θ) is tabulated. This establishes a spatial relationship between the reference pixel and its neighbors.
Feature Extraction: From the co-occurrence matrix, a set of scalar features are computed. These features capture various aspects of the original texture, such as contrast, homogeneity, and directionality.

The resulting GLCM matrix encodes the frequency of different gray level combinations within the image, providing valuable information about the underlying texture patterns [47]. Texture features computed from GLCM are Inverse Difference Moment [50], Contrast, Energy [50], Entropy [50], Cluster Shade, Sum of Average, Homogeneity [50–52], Sum of Square Variance [53], and Correlation [52] etc. The GLCM features are detailed and utilized in studies [53–56].

2.5.2. Dynamic feature extraction from VGG-16 CNN model

VGG-16 is a convolutional neural network (CNN) architecture that was proposed by researchers at the Visual Geometry Group (VGG) at the University of Oxford in 2014. It is named after the group that developed it and the fact that it has 16 weight layers (excluding the pooling and fully connected layers). The VGG-16 architecture was designed to participate in the ImageNet Large Scale Visual Recognition Challenge (ILSVRC) in 2014. The challenge involved classifying images into one of 1,000 categories. VGG-16, a convolutional neural network introduced by the Visual Geometry Group at Oxford University in 2014,achieved prominence through its success in the ImageNet Large Scale Visual Recognition Challenge. Its architecture, characterized by a series of small 3x3 convolutional filters, led to a model with 13 convolutional and 3 fully connected layers. While its depth enhanced feature extraction capabilities, it also increased susceptibility to overfitting on smaller datasets. Nevertheless, VGG-16 remains influential due to its strong performance on large-scale image classification tasks.

The VGG16 model was employed for feature extraction, generating 1024-dimensional feature vectors for each image in the dataset. To adapt the model to the specific characteristics of our target problem, the final four fully connected layers of VGG16 were re-trained on the selected datasets. Total trainable parameters for VGG-16 were 1,051,648 (first Dense) + 2,099,328 (second Dense) + 4,100 (final Dense) = 3,155,076 trainable parameters. To adapt the model to the specific task, the final four layers were fine-tuned using the selected datasets. Optimal performance hinges on careful selection of hyperparameters and we chosen the learning rate, optimizer, batch size, epochs, and regularization techniques by optimizing the hyperparameters using Bayesian optimization to fine-tune these hyperparameters.

2.5.3. Hybrid feature model

By fusing static GLCM features with dynamic features learned from the VGG-16 model, a robust hybrid feature space is created. This approach effectively leverages the complementary strengths of both modalities: GLCM’s emphasis on textural information and VGG-16’s extraction of high-level visual representations. The resulting feature space significantly enhances image analysis tasks, including classification, object recognition, and texture analysis.

2.5.4. Mean dropout feature selection method for Hybrid Feature space (HFS)

We proposed a Mean dropout technique for feature selection, it define as follows:-

Let X be the set of data represented by the All_Features.
Let C be the last column index of the All_Feature, which contain the class of the features.
Let F be the set of features (columns other than the Class column).
Let Y be the set of unique class labels form the last column.

The equation of Mean Dropout represented as:

A l l F e a t u r e s M e a n (y, f) = \frac{1}{N_{y}} \sum_{x \in X} x [f] \cdot [y = x [C]]

Where:

AllFeaturesMean(y,f) represents the mean value of feature f with class y.
N_y is the number of instances in class y.
x[f] represents the value of feature f for data point x
[y = x[C]] is an indicator function that equals 1 if y is equal x[C], and 0 otherwise.
The sum is taken overall datapoints x in the dataset.

In more concise manner we can represent above equation as below:

A l l F e a t u r e s M e a n (y) = {\frac{1}{N_{y}} \sum_{x \in X} x [f] \cdot [y = x [C]] | f \in F}

Where:

AllFeaturesMean(y) is a set of mean values for all feature within class y
f ϵ F iterates over all features in the dataset.

dropping all those features where mean values are different classes of same features is same, we can modify the above equation as follows:

S e l e c t e d F e a t u r e s (y) = f \in F ∣ A l l F e a t u r e s M e a n (y, f) \neq A l l F e a t u r e s M e a n (\dot{y}, f) f o r a l l y^{'} \in Y, \dot{y} \neq y

Where:

SelectedFeatures(y) is the set of features within class y after filtering out those with equal mean values.
AllFeaturesMean(y,f) represents the mean value of feature f within class y.
$\overset{'}{y} \neq y$ ensures that we are comparing distinct classes.
The condition $A l l F e a t u r e s M e a n (y, f) \neq A l l F e a t u r e s M e a n (\overset{'}{y}, f)$ checks if the mean values of features f are not equal across different classes.

To represent the Selected features for all classes:

S e l e c t e d F e a t u r e s A l l C l a s s e s = \underset{y \in Y}{\cap} S e l e c t e d F e a t u r e s (y)

Where:

SelectedFeaturesAllClasses is the set of features that have distinct mean values across all classes.
$\cap$ . represent the intersection operation over all classes.

2.5.5. Extreme Boosting (XGBoost) model for classification

The combined static and dynamic feature set was input into an XGBoost model for multiclass classification. XGBoost [57], an ensemble learning algorithm, constructs multiple models sequentially, with each subsequent model addressing the shortcomings of its predecessors. This approach, rooted in gradient boosting, incorporates regularization to prevent overfitting and accommodate various loss functions [58].

XGBoost traditionally uses convex loss functions, recent research has explored custom and non-convex loss functions to enhance performance in specific applications [59]. For instance, [60] investigated the use of squared logistics loss (SqLL) to improve accuracy. [59] developed weighted softmax loss functions for industrial applications, while [61] proposed a generalized XGBoost method accommodating both convex and some non-convex loss functions. These advancements demonstrate XGBoost’s versatility and potential for tailored solutions in various domains, including big data analysis and multi-objective parameter regularization. The purpose of the XGBoost classifier is multifaceted and versatile, as evidenced by various research studies. XGBoost is utilized for enhancing prediction accuracy in diverse fields such as meteorology for hailstorm forecasting [62], detecting patterns in financial datasets to differentiate between solvable and bankrupt situations [63], improving learner performance prediction in Intelligent Tutoring Systems by enhancing models like Performance Factor Analysis and DAS3H [64], and detecting malware in Internet of Medical Things (IoMT) data for better medical assistance through dimensionality reduction and efficient classification [65]. The XGBoost algorithm’s scalability, robustness, and proficiency with complex datasets make it a valuable tool for increasing prediction accuracy, addressing class imbalances, enhancing performance prediction models, and improving data analysis in various domains.

By building upon these principles, XGBoost has demonstrated superior performance in tasks such as lung cancer detection. While traditional gradient boosting involves a single optimization step, XGBoost employs a two-stage approach. This separation aims to improve both the optimization process itself and the selection of the step direction. But the XGBoost solve,

\frac{\partial S (y, f^{(m - 1)} (x) + f_{m} (x))}{\partial f_{m} (x)} = 0

(1)

For every x in data to directly fix the step. We have,

S (y, f^{(m - 1)} (x) + f_{m} (x))

(2)

\approx S (y, f^{(m - 1)} (x)) + g_{m} (x) f_{m} (x) + \frac{1}{2} h_{m} (x) f_{m} {(x)}^{2}

(3)

\approx S (y, f^{(m - 1)} (x)) + g_{m} (x) f_{m} (x) + \frac{1}{2} h_{m} (x) f_{m} {(x)}^{2}

(4)

Leveraging the second-order Taylor expansion to approximate the loss function, where g_m (x) is gradient and h_m (x) is Hessian.

h_{m} (x) = \frac{\partial^{2} S (Y, f (x))}{\partial f {(x)}^{2}}, h e r e f (x) = f^{(m - 1)} (x)

Then, loss function can be rewritten as:

S (f_{m}) \approx \sum_{i = 1}^{n} [g_{m} (x_{i}) f_{m} (x_{i}) + \frac{1}{2} h_{m} (x_{i}) f_{m} x^{2}] + c o n s t

(5)

\propto \sum_{j = 1}^{p_{m}} \sum_{i \in R_{j m}} [g_{m} (x_{i}) K_{j m} + \frac{1}{2} h_{m} (x_{i}) K_{j m}^{2}]

(6)

In region j, lets G_jm denotes sum of gradient and the sum of Hessian is represented by H_jm, then equation will be,

S (f_{m}) \propto \sum_{j = 1}^{P_{m}} [G_{j m} K_{j m} + \frac{1}{2} H_{j m} K_{j m}^{2}]

(7)

The optimal value can be computed using below function

K_{j m} = - \frac{G_{j m}}{H_{j m}}, where j = 1, 2, \dots ‥, P_{m}

(8)

We get loss function when we plug it back

S (f_{m}) \propto - \frac{1}{2} \sum_{j = 1}^{P_{m}} \frac{G_{j m}^{2}}{H_{j m}}

(9)

The tree structure is marked using this function. The lesser the score indicates better structure [57]. The maximum gain for every split is:

g a i n = \frac{1}{2} [\frac{G_{j m L e f t}^{2}}{H_{j m L e f t}} + \frac{G_{j m R i g h t}^{2}}{H_{j m R i g h t}} - \frac{G_{j m}^{2}}{H_{j m}}]

(10)

Which is,

g a i n = \frac{1}{2} [\frac{G_{j m L e f t}^{2}}{H_{j m L e f t}} + \frac{G_{j m R i g h t}^{2}}{H_{j m R i g h t}} - \frac{{(G_{j m L e f t} + G_{j m R i g h t})}^{2}}{H_{j m L e f t} + H_{j m R i g h t}}] .

(11)

For improved performance, we can rewrite the loss function as follows, incorporating regularization criteria:

S (f_{m}) \propto \sum_{j = 1}^{P_{m}} [G_{j m} K_{j m} + \frac{1}{2} H_{j m} K_{j m}^{2}] + γ P_{m} + \frac{1}{2} \sum_{j = 1}^{P_{m}} K_{j m}^{2} + α \sum_{j = 1}^{P_{m}} |K_{j m}|

(12)

= \sum_{j = 1}^{P_{m}} [G_{j m} K_{j m} + \frac{1}{2} (H_{j m} + λ) K_{j m}^{2} + α |K_{j m}|] + γ P_{m}

(13)

Where γ penalizes the number of leave, α denotes L1 regularization while λ denotes L2 regularization. The optimal weight can calculate for each region j as:

K_{j m} = \{\begin{matrix} - \frac{G_{j m} + α}{H_{j m} + λ} & G_{j m} < - α \\ - \frac{G_{j m} - α}{H_{j m} + λ} & G_{j m} > α \\ 0 & e l s e \end{matrix}\}

(14)

And the gain is,

g a i n = \frac{1}{2} [\frac{P_{α} (G_{j m L e f t}^{2})}{H_{j m L e f t} + λ} + \frac{P_{α} (G_{j m R i g h t}^{2})}{H_{j m R i g h t} + λ} - \frac{P_{α} {(G_{j m})}^{2}}{H_{j m} + λ}] - γ

(15)

Where,

P_{α} (G) = \{\begin{matrix} G + α & G < - α \\ G - α & G > α \\ 0 & e l s e \end{matrix}\}

(16)

The XGBoost classifier stands out for several reasons. It offers a rich set of randomization and regularization options during the learning process, which helps to prevent overfitting and improve model generalizability. Additionally, XGBoost boasts faster training times and user-friendliness. To leverage these advantages in our study, we employed the following hyperparameters as reflected in Table 1.

Table 1. Hyperparameters optimization of XGBoost algorithm.

Model	Hyperparameters	Tunned Parameters
XGBoost	1- booster -: gbtree, gblinear 2- colsample_bytree -: 0.4, 0.6, 0.8, 1 3- learning_rate -: 0.01, 0.1, 0.2, 0.4 4- max_depth -: 2, 3, 4, 6 5- n_estimators -: 200, 300, 400, 500 6- subsample -: 0.4, 0.6, 0.8, 1	1- subsample -: 0.8 2- n_estimators -: 200 3- max_depth -: 6 4- learning_rate -: 0.1 5- colsample_bytree -: 1 6- booster -: gbtree

Open in a new tab

The core challenge in optimizing the loss function is finding the minimum value, which can be local or global depending on the function’s shape (e.g., quadratic functions). To address overfitting, XGBoost introduces new regularization features, enhancing its ability to resist this common problem. The detailed structure of XGBoost is illustrated in Fig 4.

2.6. Performance evaluations measures

We employ standard performance evaluation metrics as outlined in [66]:

2.6.1. Precision

Precion = \frac{Number of relevant items retreived}{Number of retrieved items} = P (relevent ∣ retreived)

2.6.2. Recall

Recall (R) represents the proportion of relevant documents that the model successfully retrieves out of all the relevant documents in the dataset.

Recall = \frac{Number of relevant items retreived}{Total Nuber of Relevent Document}

2.6.3. F-measure

The F1-measure calculation treats each record as a query-class pair. In this context, each class represents the desired documents for the query (record), and we compute both recall and precision for each class within that record. The F₁-measure of record j and class i is defined as follows:

F_{ij} = \frac{2 * Recall (i, j) * precion (i, j)}{Recall (i, j) + precion (i, j)}

2.7. Receiver operating characteristic (ROC) curve

To assess our classifier’s ability to distinguish between COVID-19 and non-COVID-19 cases, we employed sensitivity (True Positive Rate) and specificity (1-False Positive Rate). We assigned binary labels to cases and generated a Receiver Operating Characteristic (ROC) curve. This curve plots sensitivity against specificity. The ROC curve’s shape and the Area Under the Curve (AUC) quantify the classifier’s performance. Higher AUC indicates better separation between the two classes. Sensitivity reflects the proportion of correctly identified COVID-19 cases, while specificity reflects the proportion of correctly identified non-COVID-19 cases [67]. The ROC curve and AUC provide a visual and numerical assessment of the classifier’s ability to differentiate between the disease and healthy cases [68].

3. Results

This section presents a detailed analysis of the proposed ESN-MDFS model’s performance through confusion matrices, tabular data, and AUC values. Additionally, a comparative evaluation with existing studies is provided.

Fig 5 presents the multi-class COVID-19 detection results exclusively based on VGG-16 deep features, as visualized through confusion matrices (5a), classification reports (5b), AUC curves (5c), and accuracy loss curves (5d). Relying solely on deep features, the model achieved an overall accuracy of 93% in classifying the four target classes. Notably, the AUC for multi-class differentiation (bacterial, COVID-19, viral) was 0.99, while perfect discrimination (AUC of 1.00) was observed for the normal class.

Fig 6 illustrates the multi-class COVID-19 detection performance solely based on XGBoost-processed static features, as depicted in the confusion matrix (6a), classification report (6b), and AUC curve (6c). Relying exclusively on static features, the model achieved an overall accuracy of 86% in classifying the four target classes.

The Fig 7 reflected the confusion matrix distinguish the four classes by utilizing Multiclass (COVID-19, normal, bacterial, viral pneumonia,) detection by utilizing Intelligent Extreme Smart Deep Network without mean dropout.

The Table 2 reflects the multiclass classification performance by utilizing Intelligent Extreme Smart Deep Network Hybrid Feature Space without mean dropout. The model achieved an impressive accuracy of 95.54% in classifying images into four classes: Bacterial, COVID-19, Normal, and Viral. This indicates that the model correctly classified 95.54% of the images in the dataset. To detect the Bacteria, the model demonstrated excellent performance with a precision, recall, and F1-score of 99.20%, 98.21%, and 98.70% respectively. This suggests that the model is highly effective in correctly identifying bacterial infections. To predict COVID-19, while the performance for COVID-19 is still good, it is slightly lower compared to the other classes. The precision, recall, and F1-score are 93.66%, 90.69%, and 92.15% respectively. This indicates that there might be some room for improvement in accurately identifying COVID-19 cases. To detect the normal subject, the model achieved a precision of 92.10%, recall of 94.84%, and F1-score of 93.45% for the normal class. These metrics suggest reasonable performance in identifying normal cases. To predict the viral, similar to bacterial, the model showed excellent performance in classifying viral infections with a precision, recall, and F1-score of 97.25%, 98.41%, and 97.83% respectively.

Table 2. Multiclass COVID-19 detection by utilizing ESN: Extreme Smart Network without MeanDropout feature space.

Class	Precision	Recall	F1-Score	Support
Bacterial	99.20%	98.21%	98.70%	504
COVID-19	93.66%	90.69%	92.15%	505
Normal	92.10%	94.84%	93.45%	504
Viral	97.25%	98.41%	97.83%	504
Accuracy			95.54%	2017
Macro Avg	95.55%	95.54%	95.53%	2017
Weighted Avg	95.55%	95.54%	95.53%	2017

Open in a new tab

The Fig 8 reflected the confusion matrix distinguish the four classes by utilizing the proposed ESN-MDFS Covid model.

The Table 3 reflects the multiclass classification performance by using ESN-MDFS Covid model. The model achieved an impressive accuracy of 96.18% in classifying images into four classes: Bacterial, COVID-19,Normal, and Viral. This indicates that the model correctly classified 96.18% of the images in the dataset. For Bacterial and Viral, the model demonstrated exceptional performance with precision, recall, and F1-score values close to 98% for both classes. This suggests that the model is highly effective in correctly identifying bacterial and viral infections. To detect COVID-19 and Normal, while the performance for COVID-19 and Normal classes is also good, it is slightly lower compared to Bacterial and Viral. The model achieved precision, recall, and F1-score values around 94% for both classes, indicating room for improvement in accurately differentiating between these two classes. The results suggest that the ESN-MDFS model is a promising approach for classifying lung conditions from medical images. It achieves high accuracy and performs well across all four classes, even better than the model presented in Table 2.

Table 3. Multiclass COVID-19 detection by utilizing ESN-MDFS: Extreme Smart Network using mean dropout feature selection technique.

Class	Precision	Recall	F1-Score	Support
Bacterial	98.03	98.81%	98.42%	504
COVID-19	94.34%	92.48%	93.40%	505
Normal	94.27%	94.64%	94.46%	504
Viral	98.03%	98.81%	98.42	504
Accuracy			96.18%	2017
Macro Avg	96.17%	96.18%	96.17%	2017
Weighted Avg	96.17%	96.17%	96.17%	2017

Open in a new tab

The Fig 9 reflects the accuracy-loss graph for multi-class Covid-19 detection at 150 epochs and using the proposed ESN-MDFS Covid model. The highest AUC of 0.99 was yielded to detect bacterial and viral pneumonia followed by AUC of 0.96 to detect normal and AUC of 0.95 to detect the COVID-19 from multiclass.

Fig 10 illustrates the accuracy of the ESN-MDFS model across seven cross-validation folds. The model achieved a mean accuracy of 95.57% with a standard deviation of 0.54, demonstrating consistent performance across different data subsets.

Table 4 presents a comparison of the proposed ESN-MDFS model with several existing lightweight models for multiclass COVID-19 detection. The comparison focuses on model size and accuracy. For Model Size, there’s a significant difference in model sizes, with the proposed ESN-MDFS being significantly smaller (889 KB) compared to other models. To compute accuracy, the proposed ESN-MDFS model outperforms all other lightweight models in terms of accuracy (96.18%). Trade-off between Size and Accuracy: While larger models like DL trained Model Size and EfficientNetV2L offer higher accuracy, they also demand significantly more computational resources. ESN-MDFS model demonstrates a compelling balance between model size and accuracy, making it a potential candidate for deployment on resource-constrained devices.

Table 4. Comparison of Multiclass COVID-19 detection by utilizing ESN-MDFS with existing Lightweight models.

S#	Size comparison (Non-Lightweight VS Lightweight)		Accuracy comparison (Non-Lightweight VS Lightweight)
S#	Model	Size	Model	Accuracy
1	DL trained Model Size	616 MB	DL trained Model Size	95.54%
2	EfficientNetV2L	455 MB	EfficientNetV2L	73.00%
3	ConvNeXtTiny	108 MB	ConvNeXtTiny	72.00%
4	MobileNetV2	12 MB	MobileNetV2	67.00%
5	Proposed ESN-MDFS	889 KB	Proposed ESN-MDFS	96.18%

Open in a new tab

This study identified redundant information within the static features, necessitating parameter reduction. The high dimensionality of GLCM-extracted texture features, especially when considering multiple distances and angles, posed a significant challenge. To address this, we introduced ESN-MDFS, which employs Mean Dropout to refine the hybrid feature space (HFS) by eliminating less informative features. This approach yielded a substantial reduction in model size, resulting in a remarkable accuracy of 96.18%. The resulting lightweight model prioritizes performance and efficiency, making it ideal for resource-constrained edge devices. Reduced storage requirements, faster computation, and lower power consumption are key advantages of this compact architecture.

4. Discussions

The COVID-19 pandemic has led to a global health crisis, marked by millions of confirmed cases and substantial mortality rates. While numerous studies have explored the application of Convolutional Neural Networks (CNNs) for COVID-19 classification using chest X-rays and CT scans, most research has been limited to binary comparisons, differentiating COVID-19 from pneumonia or normal conditions. This binary approach falls short of the diagnostic complexities often inherent in infectious diseases.

Table 5 presents a comparative analysis of several studies focused on COVID-19 lung infection detection using AI-based methods. The comparison encompasses key factors such as modality (X-ray or CT), dataset size, methodology, and performance metrics. Regarding modality, most studies utilized X-ray imaging for analysis, except for Ying et al., which employed CT scans. For Dataset Size, there’s a significant variation in dataset sizes across studies, ranging from relatively small datasets (Sethy et al.) to larger ones (Ying et al. and this study). Regarding methodology, a diverse range of methods was employed, including CNN, ResNet50, SVM, DRE-Net, texture features with machine learning, and the proposed ESN-MDFS. To compute performance, the proposed ESN-MDFS method achieved the highest accuracy (96.18%) among the compared studies, surpassing other methods in terms of overall classification performance. Ghoshal et al. and Sethy et al. used smaller datasets and simpler models, resulting in lower accuracy compared to the proposed method. Ying et al. used CT scans, which provide more detailed information than X-rays, but still achieved a lower accuracy than the proposed method. Hussain et al. focused on two-class classification, while this study addressed a more complex four-class classification problem and achieved higher accuracy.

Table 5. Comparison of AI-assisted recent studies for COVID-19 lung infection.

Authors	Modality	Subjects	Method	Performance
Ghoshal et al. [69]	X-Ray	COVID-19 90 and other conditions	CNN	92.9% (Acc.)
Sethy et al. [70]	X-ray	COVID-19 and Normal 25 images	ResNet50 and SVM	95.33%(Acc.)
Ying et al. [71]	CT	COVID-19 777 images and 708 images of Normal	DRE-Net	86% (Acc.)
Hussain et al. [72]	X-Ray	COVID-19 Bacterial & Viral 145 images and 138 Normal	Texture features using Machine learning. Two-class classification i) covid-19 vs normal ii) Covid-19 vs viral pneumonia iii) Covid-19 vs Bacterial pneumonia iv) Four-class (Covid-19, Bacteria, Viral and Normal)	100% accuracy 97.56% Accuracy 97.44% Accuracy 79.52% Accuracy
Pratiwi et al. [73]	CT	Covid—(1251) Non-Covid–(1229)	Two Classes Deep learning VGG-16	88.54% Accuracy
This study	X-Ray	COVID-19 (N = 1525), non-COVID-19 normal (N = 1525), viral pneumonia (N = 1342) and bacterial Pneumonia (N = 2521) After augmentation N = 2521	4-class (Normal, Bacterial Pneumonia, viral Pneumonia and COVID-19) using ESN-MDFS approach	96.18% Accuracy AUC of 0.99

Open in a new tab

The primary outcome measured in this study [73] is the accuracy of COVID-19 detection using CT-scan images and various preprocessing methods. The main findings of this study are that different preprocessing methods, including resizing, enhancement, and normalization, had an impact on the accuracy of COVID-19 classification using a deep learning model (VGG-16), and the highest accuracy of 88.54% was achieved using a combination of deformed resizing, CLAHE enhancement, and normalization to the range of [0 1] and [-1 1].

The proposed ESN-MDFS model surpasses existing methods in accurately classifying multiclass COVID-19 infections. By integrating Mean Dropout Feature Selection, the model effectively balances performance and computational efficiency. Leveraging X-ray imaging, the model effectively differentiates between COVID-19, bacterial pneumonia, viral pneumonia, and normal conditions. Demonstrating exceptional accuracy across various lung infection types, the model’s compact size makes it suitable for resource-constrained environments. Although not explicitly evaluated, the model’s strong performance suggests potential adaptability to diverse datasets and clinical contexts.

5. Conclusion

Our proposed ESN-MDFS model significantly advances COVID-19 detection by accurately differentiating chest X-rays into four categories: COVID-19, bacterial pneumonia, viral pneumonia, and normal. This multi-class classification system has the potential to revolutionize patient care by streamlining clinical workflows, enabling early diagnosis, optimizing patient triage, and facilitating disease progression monitoring. These capabilities position ESN-MDFS as a critical tool in combating COVID-19. This innovative approach maintains high model accuracy while drastically reducing memory footprint, making it suitable for resource-constrained edge devices. Deploying this optimized model enables real-time, point-of-care lung nodule detection, eliminating the need for centralized servers. Clinicians can benefit from immediate diagnostic insights, facilitating faster treatment decisions and improved patient outcomes. Beyond accuracy, this approach streamlines workflows by automating chest X-ray analysis, reducing diagnostic turnaround times, and enhancing overall efficiency. Early disease detection, particularly for conditions like COVID-19, is facilitated by the model’s improved sensitivity and accuracy. Moreover, the ability to differentiate between various pneumonia types enables effective patient triage and resource allocation. By tracking changes in chest X-ray features over time, clinicians can gain valuable insights into disease progression and tailor treatment strategies accordingly.

5.1. Limitations and future directions

While the ESN-MDFS model demonstrates promising results, several limitations and opportunities for improvement exist. The model’s performance is influenced by dataset quality, diversity, and image acquisition protocols. Although GLCM features enhance performance, manual feature engineering is time-consuming. Additionally, the model’s black-box nature hinders interpretability and clinical adoption. To address these challenges, future research should focus on expanding the dataset, automating feature extraction, improving model interpretability, incorporating additional data modalities, optimizing for real-time performance, and conducting rigorous benchmarking. By pursuing these directions, the model’s potential can be fully realized.

Acknowledgments

Ashit Kumar Dutta would like to express sincere gratitude to AlMaarefa University, Riyadh, Saudi Arabia, for providing funding to conduct this research.

Abbreviations

CXRs: chest x-rays
ESN-MDFS: Extreme Smart Network using Mean Dropout Feature Selection Technique
RT-PCR: Real-Time reverse transcription-Polymerase Chain Reaction
CT: Computed Tomography
CNNs: Convolutional Neural Networks
GAN: Generative Adversarial Networks
DL: Deep learning
GLCM: Grey Level Co-occurrence Matrix
CLs: convolutional layers
FLs: fully connected layers
ILSVRC: ImageNet Large Scale Visual Recognition Challenge ()
VGG: Visual Geometry Group ()
ReLU: Rectified Linear Unit
ROC: Receiver Operating characteristic
AUC: Area Under the Curve

Data Availability

"The datasets used in this study are publicly available. The COVID-19 chest X-ray (CXR) images were obtained from Cohen et al. via GitHub (https://github.com/ieee8023/covid-chestxray-dataset). Additional CXR images were sourced from Radiopaedia (https://radiopaedia.org/), The Cancer Imaging Archive (TCIA) (https://www.cancerimagingarchive.net/), and SIRM (https://www.sirm.org/category/senza-categoria/covid-19/ & https://sirm.org/?s=COVID-19). The pneumonia CXR images (N = 3863) and normal (healthy) CXR images were acquired from the Kaggle repository (https://www.kaggle.com/paultimothymooney/chestxray)."

Funding Statement

Ashit Kumar Dutta would like to express sincere gratitude to AlMaarefa University, Riyadh, Saudi Arabia, for providing funding to conduct this research.

References

1.KEWEDAR S. M. R. and ABULAMOUN K. A. A., “The impact of COVID-19 on global health and other aspects of human life,” J Exp Clin Med, vol. 39, no. 2, pp. 536–547, Mar. 2022, doi: 10.52142/omujecm.39.2.45 [DOI] [Google Scholar]
2.Dadkhah M., Talei S., Doostkamel D., Molaei S., and Rezaei N., “The impact of COVID-19 on diagnostic biomarkers in neuropsychiatric and neuroimmunological diseases: a review,” Rev Neurosci, vol. 33, no. 1, pp. 79–92, Jan. 2022, doi: 10.1515/revneuro-2020-0154 [DOI] [PubMed] [Google Scholar]
3.Zaim S., Chong J. H., Sankaranarayanan V., and Harky A., “COVID-19 and Multiorgan Response,” Curr Probl Cardiol, vol. 45, no. 8, p. 100618, Aug. 2020, doi: 10.1016/j.cpcardiol.2020.100618 [DOI] [PMC free article] [PubMed] [Google Scholar]
4.Pelicioni P. H. S. and Lord S. R., “COVID-19 will severely impact older people’s lives, and in many more ways than you think!,” Braz J Phys Ther, vol. 24, no. 4, pp. 293–294, Jul. 2020, doi: 10.1016/j.bjpt.2020.04.005 [DOI] [PMC free article] [PubMed] [Google Scholar]
5.Constantinou M., Exarchos T., Vrahatis A. G., and Vlamos P., “COVID-19 Classification on Chest X-ray Images Using Deep Learning Methods,” Int J Environ Res Public Health, vol. 20, no. 3, p. 2035, Jan. 2023, doi: 10.3390/ijerph20032035 [DOI] [PMC free article] [PubMed] [Google Scholar]
6.Kanavos A., Papadimitriou O., and Maragoudakis M., “Enhancing COVID-19 Diagnosis from Chest X-Ray Images Using Deep Convolutional Neural Networks,” in 2023 18th International Workshop on Semantic and Social Media Adaptation & Personalization (SMAP)18th International Workshop on Semantic and Social Media Adaptation & Personalization (SMAP 2023), IEEE, Sep. 2023, pp. 1–6. doi: 10.1109/SMAP59435.2023.10255200 [DOI] [Google Scholar]
7.Ghosh S. and Chatterjee A., “Automated COVID-19 CT Image Classification using Multi-head Channel Attention in Deep CNN,” Jul. 2023. [Google Scholar]
8.Pachot A. and Patissier C., “Towards Sustainable Artificial Intelligence: An Overview of Environmental Protection Uses and Issues,” Green and Low-Carbon Economy, Feb. 2023, doi: 10.47852/bonviewGLCE3202608 [DOI] [Google Scholar]
9.Ouni R. and Alhichri H., “Cross-dataset domain adaptation for the classification COVID-19 using chest computed tomography images,” Nov. 2023, 2311.08524 [Google Scholar]
10.Zhou T., Liu F., Lu H., Peng C., and Ye X., “A Review of Deep Learning Imaging Diagnostic Methods for COVID-19,” Electronics (Basel), vol. 12, no. 5, p. 1167, Feb. 2023, doi: 10.3390/electronics12051167 [DOI] [Google Scholar]
11.Lin L. et al., “Robust COVID-19 Detection in CT Images with CLIP,” Mar. 2024. [Google Scholar]
12.Garg A., Alag S., and Duncan D., “CoSev: Data-Driven Optimizations for COVID-19 Severity Assessment in Low-Sample Regimes,” Diagnostics, vol. 14, no. 3, p. 337, Feb. 2024, doi: 10.3390/diagnostics14030337 [DOI] [PMC free article] [PubMed] [Google Scholar]
13.Irsyad A., Tjandrasa H., and Hidayati S. C., “Deep Learning Approach for Segmentation and Classification of COVID-19 in Lung CT Scan Images,” in 2023 IEEE 7th International Conference on Information Technology, Information Systems and Electrical Engineering (ICITISEE), IEEE, Nov. 2023, pp. 202–207. doi: 10.1109/ICITISEE58992.2023.10404485 [DOI] [Google Scholar]
14.Yamathi L., Rani K. S., and Krishna P. V., “Deep Feature Wise Attention Based Convolutional Neural Network for Covid-19 Detection Using Lung CT Scan Images,” Journal of Applied Engineering and Technological Science (JAETS), vol. 4, no. 2, pp. 1057–1075, Jun. 2023, doi: 10.37385/jaets.v4i2.2163 [DOI] [Google Scholar]
15.Ali Z. et al., “A deep learning‐based x‐ray imaging diagnosis system for classification of tuberculosis, COVID‐19, and pneumonia traits using evolutionary algorithm,” Int J Imaging Syst Technol, vol. 34, no. 1, Jan. 2024, doi: 10.1002/ima.23014 [DOI] [Google Scholar]
16.Ali M. U. et al., “Deep learning network selection and optimized information fusion for enhanced COVID ‐19 detection,” Int J Imaging Syst Technol, vol. 34, no. 2, Mar. 2024, doi: 10.1002/ima.23001 [DOI] [Google Scholar]
17.Albatoul A. et al., “COVID-19 detection and classification: key AI challenges and recommendations for the way forward,” Journal of Pulmonology and Respiratory Research, vol. 7, no. 1, pp. 010–014, May 2023, doi: 10.29328/journal.jprr.1001044 [DOI] [Google Scholar]
18.Shani Deo Pandey Gautam Sharma, Sharma Gautam, Chauhan Aditya, and Shailja Varshney Ms, “COVID-19 Detection using Deep Learning,” International Journal of Advanced Research in Science, Communication and Technology, pp. 154–164, Apr. 2023, doi: 10.48175/IJARSCT-9489 [DOI] [Google Scholar]
19.Fedoruk O., Klimaszewski K., Ogonowski A., and Możdżonek R., “Performance of GAN-based augmentation for deep learning COVID-19 image classification,” 2024, p. 030001. doi: 10.1063/5.0203379 [DOI] [Google Scholar]
20.Morani K. and Unay D., “Deep learning-based automated COVID-19 classification from computed tomography images,” Comput Methods Biomech Biomed Eng Imaging Vis, vol. 11, no. 6, pp. 2145–2160, Nov. 2023, doi: 10.1080/21681163.2023.2219765 [DOI] [Google Scholar]
21.Giacomini D., Hashem M. B., Suarez J., and Trivedi A. R., “Towards Model-Size Agnostic, Compute-Free, Memorization-based Inference of Deep Learning,” in 2024 37th International Conference on VLSI Design and 2024 23rd International Conference on Embedded Systems (VLSID), IEEE, Jan. 2024, pp. 180–185. doi: 10.1109/VLSID60093.2024.00036 [DOI] [Google Scholar]
22.Zhao Z., “The effect of input size on the accuracy of a convolutional neural network performing brain tumor detection,” in International Conference on Mechatronics Engineering and Artificial Intelligence (MEAI 2022), Zhao C., Ed., SPIE, Mar. 2023, p. 85. doi: 10.1117/12.2672694 [DOI] [Google Scholar]
23.Kerley C. I. et al., “Batch size go big or go home: counterintuitive improvement in medical autoencoders with smaller batch size,” in Medical Imaging 2023: Image Processing I. Išgum and O. Colliot, Eds., SPIE, Apr. 2023, p. 17. doi: 10.1117/12.2653643 [DOI] [PMC free article] [PubMed] [Google Scholar]
24.Nagrecha K., “Systems for Parallel and Distributed Large-Model Deep Learning Training,” Jan. 2023. [Google Scholar]
25.Hsu F.-S. et al., “Lightweight Deep Neural Network Embedded with Stochastic Variational Inference Loss Function for Fast Detection of Human Postures,” Entropy, vol. 25, no. 2, p. 336, Feb. 2023, doi: 10.3390/e25020336 [DOI] [PMC free article] [PubMed] [Google Scholar]
26.Sharma P. et al., “A lightweight deep learning model for automatic segmentation and analysis of ophthalmic images,” Sci Rep, vol. 12, no. 1, p. 8508, May 2022, doi: 10.1038/s41598-022-12486-w [DOI] [PMC free article] [PubMed] [Google Scholar]
27.Kim K., Jang S.-J., Park J., Lee E., and Lee S.-S., “Lightweight and Energy-Efficient Deep Learning Accelerator for Real-Time Object Detection on Edge Devices,” Sensors, vol. 23, no. 3, p. 1185, Jan. 2023, doi: 10.3390/s23031185 [DOI] [PMC free article] [PubMed] [Google Scholar]
28.Khan M. A., Menouar H., and Hamila R., “LCDnet: a lightweight crowd density estimation model for real-time video surveillance,” J Real Time Image Process, vol. 20, no. 2, p. 29, Apr. 2023, doi: 10.1007/s11554-023-01286-8 [DOI] [Google Scholar]
29.Sandler M., Howard A., Zhu M., Zhmoginov A., and Chen L.-C., “MobileNetV2: Inverted Residuals and Linear Bottlenecks,” in 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, IEEE, Jun. 2018, pp. 4510–4520. doi: 10.1109/CVPR.2018.00474 [DOI] [Google Scholar]
30.Zhang X., Zhou X., Lin M., and Sun J., “ShuffleNet: An Extremely Efficient Convolutional Neural Network for Mobile Devices,” in 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, IEEE, Jun. 2018, pp. 6848–6856. doi: 10.1109/CVPR.2018.00716 [DOI] [Google Scholar]
31.Roberts M. et al., “Common pitfalls and recommendations for using machine learning to detect and prognosticate for COVID-19 using chest radiographs and CT scans,” Nat Mach Intell, vol. 3, no. 3, pp. 199–217, Mar. 2021, doi: 10.1038/s42256-021-00307-0 [DOI] [Google Scholar]
32.Guo X., Pimentel A. D., and Stefanov T., “Automated Exploration and Implementation of Distributed CNN Inference at the Edge,” IEEE Internet Things J, vol. 10, no. 7, pp. 5843–5858, Apr. 2023, doi: 10.1109/JIOT.2023.3237572 [DOI] [Google Scholar]
33.Wang C.-H., Huang K.-Y., Yao Y., Chen J.-C., Shuai H.-H., and Cheng W.-H., “Lightweight Deep Learning: An Overview,” IEEE Consumer Electronics Magazine, pp. 1–12, 2022, doi: 10.1109/MCE.2022.3181759 [DOI] [Google Scholar]
34.Yuan X. et al., “Cycle Performance of Aerated Lightweight Concrete Windowed and Windowless Wall Panel from the Perspective of Lightweight Deep Learning,” Comput Intell Neurosci, vol. 2022, pp. 1–14, Jun. 2022, doi: 10.1155/2022/3968607 [DOI] [PMC free article] [PubMed] [Google Scholar]
35.Lou L., Liang H., and Wang Z., “Deep-Learning-Based COVID-19 Diagnosis and Implementation in Embedded Edge-Computing Device,” Diagnostics, vol. 13, no. 7, p. 1329, Apr. 2023, doi: 10.3390/diagnostics13071329 [DOI] [PMC free article] [PubMed] [Google Scholar]
36.Hassan M. M., AlRakhami M. S., Alabrah A. A., and AlQahtani S. A., “An Intelligent Edge-as-a-Service Framework to Combat COVID-19 Using Deep Learning Techniques,” Mathematics, vol. 11, no. 5, p. 1216, Mar. 2023, doi: 10.3390/math11051216 [DOI] [Google Scholar]
37.Ukwandu O., Hindy H., and Ukwandu E., “An Evaluation of Lightweight Deep Learning Techniques in Medical Imaging for High Precision COVID-19 Diagnostics,” May 2023. [DOI] [PMC free article] [PubMed] [Google Scholar]
38.Zhang Z., Ma Y., and Li K., “An improved X-ray image diagnosis method for COVID-19 pneumonia on a lightweight neural network embedded device,” in Proceedings of the 2023 3rd International Conference on Bioinformatics and Intelligent Computing, New York, NY, USA: ACM, Feb. 2023, pp. 351–357. doi: 10.1145/3592686.3592749 [DOI] [Google Scholar]
39.Morani K. and Unay D., “Deep learning-based automated COVID-19 classification from computed tomography images,” Comput Methods Biomech Biomed Eng Imaging Vis, vol. 11, no. 6, pp. 2145–2160, Nov. 2023, doi: 10.1080/21681163.2023.2219765 [DOI] [Google Scholar]
40.Hassantabar S. et al., “CovidDeep: SARS-CoV-2/COVID-19 Test Based on Wearable Medical Sensors and Efficient Neural Networks,” IEEE Transactions on Consumer Electronics, vol. 67, no. 4, pp. 244–256, Nov. 2021, doi: 10.1109/TCE.2021.3130228 [DOI] [Google Scholar]
41.Goel T., Murugan R., Mirjalili S., and Chakrabartty D. K., “OptCoNet: an optimized convolutional neural network for an automatic diagnosis of COVID-19,” Applied Intelligence, vol. 51, no. 3, pp. 1351–1366, Mar. 2021, doi: 10.1007/s10489-020-01904-z [DOI] [PMC free article] [PubMed] [Google Scholar]
42.Narin A., Kaya C., and Pamuk Z., “Automatic detection of coronavirus disease (COVID-19) using X-ray images and deep convolutional neural networks,” Pattern Analysis and Applications, May 2021, doi: 10.1007/s10044-021-00984-y [DOI] [PMC free article] [PubMed] [Google Scholar]
43.Ibrahim D. M., Elshennawy N. M., and Sarhan A. M., “Deep-chest: Multi-classification deep learning model for diagnosing COVID-19, pneumonia, and lung cancer chest diseases,” Comput Biol Med, vol. 132, p. 104348, May 2021, doi: 10.1016/j.compbiomed.2021.104348 [DOI] [PMC free article] [PubMed] [Google Scholar]
44.Cohen J. P., Morrison P., Dao L., Roth K., Duong T. Q., and Ghassemi M., “COVID-19 Image Data Collection: Prospective Predictions Are the Future,” Jun. 2020. [Google Scholar]
45.Shorten C. and Khoshgoftaar T. M., “A survey on Image Data Augmentation for Deep Learning,” J Big Data, vol. 6, no. 1, p. 60, Dec. 2019, doi: 10.1186/s40537-019-0197-0 [DOI] [PMC free article] [PubMed] [Google Scholar]
46.Hashemzadeh M., Asheghi B., and Farajzadeh N., “Content-aware image resizing: An improved and shadow-preserving seam carving method,” Signal Processing, vol. 155, no. 7, pp. 233–246, Feb. 2019, doi: 10.1016/j.sigpro.2018.09.037 [DOI] [Google Scholar]
47.Haralick R. M., Dinstein I., and Shanmugam K., “Textural Features for Image Classification,” pp. 610–621, Nov. 1973, doi: 10.1109/TSMC.1973.4309314 [DOI] [Google Scholar]
48.James J., Heddallikar A., Choudhari P., and Chopde S., “Analysis of Features in SAR Imagery Using GLCM Segmentation Algorithm,” 2021, pp. 253–266. doi: 10.1007/978-981-16-1681-5_16 [DOI] [Google Scholar]
49.Giraddi S., Pujari J., and Seeri S., “Role of GLCM Features in Identifying Abnormalities in the Retinal Images,” International Journal of Image, Graphics and Signal Processing, vol. 7, no. 6, pp. 45–51, May 2015, doi: 10.5815/ijigsp.2015.06.06 [DOI] [Google Scholar]
50.Soh L., Tsatsoulis C., and Member S., “Texture Analysis of SAR Sea Ice Imagery,” IEEE Transactions on Geoscience and Remote Sensing, vol. 37, no. 2, pp. 780–795, 1999, doi: 10.1109/36.752194 [DOI] [Google Scholar]
51.Berbar M. A., “Hybrid methods for feature extraction for breast masses classification,” Egyptian Informatics Journal, pp. 1–11, 2017, doi: 10.1016/j.eij.2017.08.001 [DOI] [Google Scholar]
52.Manjunath S., “Texture Features and KNN in Classification of Flower Images D S Guru,” 2010. [Google Scholar]
53.Nithya R., “Comparative study on feature extraction method for breast cancer classification,” Journal of Theoretical and Applied Infrormation Technology, vol. 33, no. 2, pp. 220–226, 2011. [Google Scholar]
54.Parvez A., “Feature Computation using CUDA Platform,” International Conference on Trends in Electronics and Informatics, vol. 9, no. 17, pp. 296–300, 2017. [Google Scholar]
55.Rathore S., “Automatic Colon Cancer Detection and Classifcation,” 2018. doi: 10.13140/RG.2.2.26988.05765 [DOI] [Google Scholar]
56.Amrit G. and Singh P., “Performance analysis of various machine learning-based approaches for detection and classification of lung cancer in humans,” vol. 3456789, 2018. [Google Scholar]
57.Chen T. and Guestrin C., “XGBoost,” in Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, New York, NY, USA, New York, USA: ACM, Aug. 2016, pp. 785–794. doi: 10.1145/2939672.2939785 [DOI] [Google Scholar]
58.Friedman J. H., “Greedy function approximation: A gradient boosting machine,” Ann Stat, vol. 29, no. 5, pp. 1189–1232, 2001, doi: 10.2307/2699986 [DOI] [Google Scholar]
59.Bukowski M., Kurek J., Świderski B., and Jegorowa A., “Custom Loss Functions in XGBoost Algorithm for Enhanced Critical Error Mitigation in Drill-Wear Analysis of Melamine-Faced Chipboard,” Sensors, vol. 24, no. 4, p. 1092, Feb. 2024, doi: 10.3390/s24041092 [DOI] [PMC free article] [PubMed] [Google Scholar]
60.Sharma N., Anju, and Juneja A, “Extreme Gradient Boosting with Squared Logistic Loss Function,” 2019, pp. 313–322. doi: 10.1007/978-981-13-0923-6_27 [DOI] [Google Scholar]
61.Guang Y., “Generalized XGBoost Method,” Sep. 2021. [Google Scholar]
62.Abhinaya P., Reddy C. K. K., Ranjan A., and Ozer O., “Explicit Monitoring and Prediction of Hailstorms With XGBoost Classifier for Sustainability,” 2024, pp. 107–132. doi: 10.4018/979-8-3693-3896-4.ch006 [DOI] [Google Scholar]
63.Garg K., Gill K. S., Malhotra S., Devliyal S., and Sunil G., “Implementing the XGBOOST Classifier for Bankruptcy Detection and Smote Analysis for Balancing Its Data,” in 2024 2nd International Conference on Computer, Communication and Control (IC4), IEEE, Feb. 2024, pp. 1–5. doi: 10.1109/IC457434.2024.10486274 [DOI] [Google Scholar]
64.Hakkal S. and Lahcen A. A., “XGBoost To Enhance Learner Performance Prediction,” Computers and Education: Artificial Intelligence, vol. 7, p. 100254, Dec. 2024, doi: 10.1016/j.caeai.2024.100254 [DOI] [Google Scholar]
65.Dhanya L. and Chitra R., “A novel autoencoder based feature independent GA optimised XGBoost classifier for IoMT malware detection,” Expert Syst Appl, vol. 237, p. 121618, Mar. 2024, doi: 10.1016/j.eswa.2023.121618 [DOI] [Google Scholar]
66.Jalil Z. et al., “COVID-19 Related Sentiment Analysis Using State-of-the-Art Machine Learning and Deep Learning Techniques,” Front Public Health, vol. 9, Jan. 2022, doi: 10.3389/fpubh.2021.812735 [DOI] [PMC free article] [PubMed] [Google Scholar]
67.Hussain L. et al., “Prostate cancer detection using machine learning techniques by employing combination of features extracting strategies,” Cancer Biomarkers, vol. 21, no. 2, pp. 393–413, Feb. 2018, doi: 10.3233/CBM-170643 [DOI] [PubMed] [Google Scholar]
68.Rathore S., Hussain M., and Khan A., “Automated colon cancer detection using hybrid of novel geometric features and some traditional features,” Comput Biol Med, vol. 65, no. March, pp. 279–296, Oct. 2015, doi: 10.1016/j.compbiomed.2015.03.004 [DOI] [PubMed] [Google Scholar]
69.Ghoshal B. and Tucker A., “Estimating Uncertainty and Interpretability in Deep Learning for Coronavirus (COVID-19) Detection,” Mar. 2020. [Google Scholar]
70.Sethy P. K., “Detection of coronavirus Disease (COVID-19) based on Deep Features and Support Vector Machine,” no. April, 2020. [Google Scholar]
71.Ct C., “Deep learning Enables Accurate Diagnosis of Novel Coronavirus,” 2020. [DOI] [PMC free article] [PubMed] [Google Scholar]
72.Hussain L. et al., “Machine-learning classification of texture features of portable chest X-ray accurately classifies COVID-19 lung infection,” Biomed Eng Online, vol. 19, no. 1, p. 88, Dec. 2020, doi: 10.1186/s12938-020-00831-x [DOI] [PMC free article] [PubMed] [Google Scholar]
73.Pratiwi N. G., Nabila Y., Fiqraini R., and Setiawan A. W., “Effect of CT-Scan Image Resizing, Enhancement and Normalization on Accuracy of Covid-19 Detection,” in 2021 International Seminar on Intelligent Technology and Its Applications (ISITIA), IEEE, Jul. 2021, pp. 17–22. doi: 10.1109/ISITIA52817.2021.9502217 [DOI] [Google Scholar]

PLoS One. doi: 10.1371/journal.pone.0310011.r001

Decision Letter 0

Catalin Buiu

22 Jul 2024

PONE-D-24-21569Enhancing Multiclass COVID-19 Prediction with ESN-MDFS: Extreme Smart Network using Mean Dropout Feature Selection TechniquePLOS ONE

Dear Dr. Hussain,

Thank you for submitting your manuscript to PLOS ONE. After careful consideration, we feel that it has merit but does not fully meet PLOS ONE’s publication criteria as it currently stands. Therefore, we invite you to submit a revised version of the manuscript that addresses the points raised during the review process.

Please submit your revised manuscript by Sep 05 2024 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.
A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.
An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

If you would like to make changes to your financial disclosure, please include your updated statement in your cover letter. Guidelines for resubmitting your figure files are available below the reviewer comments at the end of this letter.

If applicable, we recommend that you deposit your laboratory protocols in protocols.io to enhance the reproducibility of your results. Protocols.io assigns your protocol its own identifier (DOI) so that it can be cited independently in the future. For instructions see: https://journals.plos.org/plosone/s/submission-guidelines#loc-laboratory-protocols. Additionally, PLOS ONE offers an option for publishing peer-reviewed Lab Protocol articles, which describe protocols hosted on protocols.io. Read more information on sharing protocols at https://plos.org/protocols?utm_medium=editorial-email&utm_source=authorletters&utm_campaign=protocols.

We look forward to receiving your revised manuscript.

Kind regards,

Catalin Buiu

Academic Editor

PLOS ONE

Journal Requirements:

1. When submitting your revision, we need you to address these additional requirements.

Please ensure that your manuscript meets PLOS ONE's style requirements, including those for file naming. The PLOS ONE style templates can be found at

https://journals.plos.org/plosone/s/file?id=wjVg/PLOSOne_formatting_sample_main_body.pdf and

https://journals.plos.org/plosone/s/file?id=ba62/PLOSOne_formatting_sample_title_authors_affiliations.pdf

2. Please note that PLOS ONE has specific guidelines on code sharing for submissions in which author-generated code underpins the findings in the manuscript. In these cases, all author-generated code must be made available without restrictions upon publication of the work. Please review our guidelines at https://journals.plos.org/plosone/s/materials-and-software-sharing#loc-sharing-code and ensure that your code is shared in a way that follows best practice and facilitates reproducibility and reuse.

3. Please provide a complete Data Availability Statement in the submission form, ensuring you include all necessary access information or a reason for why you are unable to make your data freely accessible. If your research concerns only data provided within your submission, please write "All data are in the manuscript and/or supporting information files" as your Data Availability Statement.

[Note: HTML markup is below. Please do not edit.]

Reviewers' comments:

Reviewer's Responses to Questions

Comments to the Author

1. Is the manuscript technically sound, and do the data support the conclusions?

The manuscript must describe a technically sound piece of scientific research with data that supports the conclusions. Experiments must have been conducted rigorously, with appropriate controls, replication, and sample sizes. The conclusions must be drawn appropriately based on the data presented.

Reviewer #1: Yes

Reviewer #2: Yes

Reviewer #3: Yes

**********

2. Has the statistical analysis been performed appropriately and rigorously?

Reviewer #1: I Don't Know

Reviewer #2: Yes

Reviewer #3: Yes

**********

3. Have the authors made all data underlying the findings in their manuscript fully available?

The PLOS Data policy requires authors to make all data underlying the findings described in their manuscript fully available without restriction, with rare exception (please refer to the Data Availability Statement in the manuscript PDF file). The data should be provided as part of the manuscript or its supporting information, or deposited to a public repository. For example, in addition to summary statistics, the data points behind means, medians and variance measures should be available. If there are restrictions on publicly sharing data—e.g. participant privacy or use of data from a third party—those must be specified.

Reviewer #1: No

Reviewer #2: Yes

Reviewer #3: No

**********

4. Is the manuscript presented in an intelligible fashion and written in standard English?

PLOS ONE does not copyedit accepted manuscripts, so the language in submitted articles must be clear, correct, and unambiguous. Any typographical or grammatical errors should be corrected at revision, so please note any specific errors here.

Reviewer #1: Yes

Reviewer #2: No

Reviewer #3: Yes

**********

5. Review Comments to the Author

Please use the space provided to explain your answers to the questions above. You may also include additional comments for the author, including concerns about dual publication, research ethics, or publication ethics. (Please upload your review as an attachment if it exceeds 20,000 characters)

Reviewer #1: Authors should address the following major revisions:

1) These research, by virtue of the fact, stress the existence and severity of dire results associated with delayed and improper diagnosis which can be a real threat to the survival of

humankind. The latest studies have revealed that deep learning models can be used for coronavirus image classification.- add the following work for this statement such as: A Deep learning-based X-Ray Imaging Diagnosis System for Classification of Tuberculosis, COVID19, and Pneumonia Traits using Evolutionary Algorithm

2) While deep learning models have shown promise in classifying COVID-19 images, there are still limitations that need to be addressed. One such limitation is the lack of research on applying domain adaptation techniques to overcome the challenge of the cross-dataset problem.- add the following work for this statement such as: Deep learning network selection and optimized information fusion for enhanced COVID-19 detection

3) Related work of this manuscript can be further enhanced by adding few recent works. Also, discuss the cutting edges and gaps that should be linked with the proposed work.

4) How many features are extracted from the VGG model? How many layers used for the re-training on the selected datasets?

5) how many parameters are trained for the VGG model and how the hyperparameters are selected? add in the manuscript.

6) What is the purpose of GLCM features and how many features are extracted?

7) What is the loss function of the XGBosst classifier and what is the purpose of this classifier?

Reviewer #2: The paper proposes a novel approach called “Enhancing COVID Prediction with ESN-MDFS”, which combines an Extreme Smart Network (ESN) and a Mean Dropout Feature Selection Technique (MDFS) to improve the multi-class detection of COVID-19 and three other lung conditions (normal, viral, and bacterial) using portable chest x-rays (CXRs). This method extracts both static features using Grey-level Co-occurrence Matrix (GLCM) texture analysis and dynamic features through a VGG-16 pre-trained deep learning model.

The authors highlight the limitations of traditional diagnostic methods, such as RT-PCR and biomarkers. Their study seeks to overcome challenges like data variability and the need for large labeled datasets by developing a robust, lightweight deep learning model for COVID-19 detection.

The authors present the key steps of the proposed method, namely:

1. Preprocessing: Images are resized, denoised using median blur, and normalized for intensity.

2. Data Augmentation: Techniques like rotation, flipping, shearing, contrast adjustments, and various transforms are applied to balance the dataset and prevent overfitting.

3. Feature Selection: Static Features are extracted using Grey Level Co-occurrence Matrix (GLCM) analysis, while Dynamic Features are extracted using a pre-trained VGG-16 model. The Hybrid Feature Space combines static and dynamic features, optimized using the Mean Dropout Feature Selection Technique (MDFS).

4. Model Training: The hybrid feature space is used to train an XGBoost model, evaluated using precision, recall, and F1-score metrics.

The proposed model demonstrated high performance in multi-class classification of lung conditions, achieving an accuracy of 95.54% without mean dropout and 96.18% with mean dropout. High scores were achieved for precision, recall, and F1-score across all four classes (bacterial pneumonia, COVID-19, normal lungs, and viral pneumonia). The reported Area Under the Curve (AUC) values were 0.99 for bacterial and viral pneumonia, 0.96 for normal, and 0.95 for COVID-19.

The ESN-MDFS model surpassed existing lightweight models in both accuracy (achieving 96.18% compared to the next best at 95.54%) and compactness (889 KB compared to the next best at 12 MB). The authors demonstrated that the proposed model is ideal for deployment on resource-constrained edge devices, facilitating immediate diagnostic insights and more rapid treatment decisions.

Suggestions:

1. Review the paper for grammatical errors and ensure subject-verb agreement and correct tense usage throughout the manuscript.

2. Ensure that all terms and concepts are clearly defined and consistently used throughout the paper. Consider adding a table of abbreviations to improve comprehensibility and avoid ambiguity.

3. Be consistent when reporting results. Use the same format for all metrics and comparisons. For example, in Table 5, ensure that the number of samples in each category is presented consistently across all discussed models.

4. Outline potential future work, including improvements to the model and adaptation to other imaging modalities.

Reviewer #3: 1. The paper's formatting needs improvement for consistency. Please ensure that the table formatting is uniform throughout the document. Additionally, standardize the captions for figures and tables to maintain a cohesive style. Lastly, align the equations consistently, either centering them or left-aligning them.

2. The comparasion experiments should be run multiple times to show the mean and std of the results.

3. Considering that VGG16 is a well-known model, it may not be necessary to include the entire model architecture as a figure or provide extensive introductions to it. Consider simplifying this section to focus on the more novel aspects of your work.

4. While using a fixed model for feature extraction is a common technique, the concepts of "Dynamic Features" and "Static Features" in this context are unclear. Please provide a clear definition of what you consider dynamic and static features and how they relate to your proposed method.

5. The baseline methods compared in this paper appear to be outdated, with references [61]-[63] being papers published in 2020. Given the rapid advancements in this field, it would be beneficial to compare your work with more recent publications. For example, consider including a comparison with the work by Pratiwi et al. (2021), "Effect of CT-scan image resizing, enhancement and normalization on accuracy of covid-19 detection," which also utilizes VGG16. Providing a justification for not comparing with more recent works would strengthen your paper.

6. Lack of ablation studies. The paper would benefit from the inclusion of ablation studies to demonstrate the impact of each module in the proposed framework.

7. The novelty of the proposed method is not clearly evident. The combination of data augmentation, using a fixed CNN model for feature extraction, and training an SVM or XGBoost classifier is a well-established and widely used technique across various fields. To enhance the contribution of this paper, consider highlighting the specific innovations or improvements your method offers compared to existing approaches. Clearly articulate how your proposed framework advances the state-of-the-art or addresses limitations of previous methods in the context of your specific application.

**********

6. PLOS authors have the option to publish the peer review history of their article (what does this mean?). If published, this will include your full peer review and any attached files.

If you choose “no”, your identity will remain anonymous but your review may still be made public.

Do you want your identity to be public for this peer review? For information about this choice, including consent withdrawal, please see our Privacy Policy.

Reviewer #1: No

Reviewer #2: No

Reviewer #3: No

**********

[NOTE: If reviewer comments were submitted as an attachment file, they will be attached to this email and accessible via the submission site. Please log into your account, locate the manuscript record, and check for the action link "View Attachments". If this link does not appear, there are no attachment files.]

While revising your submission, please upload your figure files to the Preflight Analysis and Conversion Engine (PACE) digital diagnostic tool, https://pacev2.apexcovantage.com/. PACE helps ensure that figures meet PLOS requirements. To use PACE, you must first register as a user. Registration is free. Then, login and navigate to the UPLOAD tab, where you will find detailed instructions on how to use the tool. If you encounter any issues or have any questions when using PACE, please email PLOS at figures@plos.org. Please note that Supporting Information files do not need this step.

PLoS One. 2024 Nov 12;19(11):e0310011. doi: 10.1371/journal.pone.0310011.r002

Author response to Decision Letter 0

2 Aug 2024

To: Prof. Dr. Joerg Heber

PLOS ONE, Editor-in-Chief

Re: Manuscript submission to PLOS ONE

PONE-D-24-21569

Enhancing Multiclass COVID-19 Prediction with ESN-MDFS: Extreme Smart Network using Mean Dropout Feature

Selection Technique

Date: July 31, 2024

Dear Dr. Heber

We are pleased to inform you that we have revised the manuscript in the light of reviewers’ comments. The reviewers’ recommendations were extremely useful, and we have addressed all their recommendations in the revised manuscript. Please see below for responses to each individual comment. We hope that the revisions in the manuscript and our accompanying responses will be sufficient to make our manuscript suitable for publication in PLOS ONE.

S. No. Comments Rebuttal

Reviewer 1 Comments

1 This research, by virtue of the fact, stress the existence and severity of dire results associated with delayed and improper diagnosis which can be a real threat to the survival of humankind. The latest studies have revealed that deep learning models can be used for coronavirus image classification. - add the following work for this statement such as: A Deep learning-based X-Ray Imaging Diagnosis System for Classification of Tuberculosis, COVID19, and Pneumonia Traits using Evolutionary Algorithm The authors [15] proposed a novel automated framework for the classification of tuberculosis, COVID-19, and pneumonia from chest x-ray images using deep learning and an improved optimization technique. The proposed deep learning-based framework achieved high classification accuracy 98.2%, 99.0%, and 98.7%) on three different datasets for tuberculosis, COVID-19, and pneumonia detection from chest X-ray images. The authors employed the Wilcoxon signed-rank test to statistically validate the superior performance of their proposed method. The integration of feature fusion was instrumental in enhancing the method's accuracy.

2 While deep learning models have shown promise in classifying COVID-19 images, there are still limitations that need to be addressed. One such limitation is the lack of research on applying domain adaptation techniques to overcome the

challenge of the cross-dataset problem. - add the following work for this statement such as: Deep learning network selection and optimized information fusion for enhanced COVID-19 detection The researchers [16] proposed a wrapper-based technique to improve the classification performance of chest infection (including COVID-19) detection using X-rays by extracting deep features using pretrained deep learning models and optimizing them using various optimization techniques, while also using a network selection technique to select the deep learning models. The proposed deep learning framework achieved a high classification accuracy of 97.7% in detecting chest infections, including COVID-19. Rigorous validation confirmed the framework's reliability for classifying both COVID-19 and other chest infections, suggesting its potential as a valuable tool for clinicians.

3 Related work of this manuscript can be further enhanced by adding few recent works. Also, discuss the cutting edges

and gaps that should be linked with the proposed work. The issues have been addressed. Some related work from recent studies as suggested by the reviewers have been incorporated in the literature and discussion sections. Moreover, future directions also been added.

4 How many features are extracted from the VGG model? How many layers used for the re-training on the selected

datasets? The VGG16 model was employed to extract 1024 features. To adapt the model to the specific task, the final four layers were fine-tuned using the selected datasets

5 What is the purpose of GLCM features and how many features are extracted? The Gray Level Co-occurrence Matrix (GLCM) is a statistical technique used to extract texture features from images by analyzing the spatial relationships between pixel intensities. Its applications span various domains, including SAR imagery for land cover classification (water, vegetation, urban areas) [48] and medical imaging for detecting retinal abnormalities [49], where color features have shown superior accuracy. GLCM calculates the relative frequency of pixel pairs with specific intensities and spatial configurations, creating a matrix from which statistical measures can be derived. For this study, we computed 25 GLCM features, subsequently reducing them to 17 using MeanDropout feature selection.

6 What is the loss function of the XGBoost classifier and what is the purpose of this classifier? XGBoost traditionally uses convex loss functions, recent research has explored custom and non-convex loss functions to enhance performance in specific applications [59]. For instance, [60] investigated the use of squared logistics loss (SqLL) to improve accuracy. [59] developed weighted softmax loss functions for industrial applications, while [61] proposed a generalized XGBoost method accommodating both convex and some non-convex loss functions. These advancements demonstrate XGBoost's versatility and potential for tailored solutions in various domains, including big data analysis and multi-objective parameter regularization.

The purpose of the XGBoost classifier is multifaceted and versatile, as evidenced by various research studies. XGBoost is utilized for enhancing prediction accuracy in diverse fields such as meteorology for hailstorm forecasting [62], detecting patterns in financial datasets to differentiate between solvable and bankrupt situations [63], improving learner performance prediction in Intelligent Tutoring Systems by enhancing models like Performance Factor Analysis and DAS3H [64], and detecting malware in Internet of Medical Things (IoMT) data for better medical assistance through dimensionality reduction and efficient classification [65]. The XGBoost algorithm's scalability, robustness, and proficiency with complex datasets make it a valuable tool for increasing prediction accuracy, addressing class imbalances, enhancing performance prediction models, and improving data analysis in various domains.

Reviewer 2 Comments

1 Review the paper for grammatical errors and ensure subject-verb agreement and correct tense usage throughout the

manuscript. The issue has been addressed

2 Ensure that all terms and concepts are clearly defined and consistently used throughout the paper. Consider adding a

table of abbreviations to improve comprehensibility and avoid ambiguity. The issue has been addressed and highlighted in red color

3 Be consistent when reporting results. Use the same format for all metrics and comparisons. For example, in Table 5,

ensure that the number of samples in each category is presented consistently across all discussed models. The issue has been addressed and highlighted in red color

4 Outline potential future work, including improvements to the model and adaptation to other imaging modalities. The issue has been addressed and highlighted in red color in the introduction section

Reviewer 3 Comments

1 The paper's formatting needs improvement for consistency. Please ensure that the table formatting is

uniform throughout the document. Additionally, standardize the captions for figures and tables to maintain a cohesive

style. Lastly, align the equations consistently, either centering them or left aligning them. The issue has been addressed

2 The comparison experiments should be run multiple times to show the mean and std of the results.

Fig. 10. Fold vs accuracy curve to distinguish multi-class using ESN-MDFS

Figure 10 illustrates the accuracy of the ESN-MDFS model across seven cross-validation folds. The model achieved a mean accuracy of 95.57% with a standard deviation of 0.54, demonstrating consistent performance across different data subsets.

3 Considering that VGG16 is a well-known model, it may not be necessary to include the entire model architecture as a

figure or provide extensive introductions to it. Consider simplifying this section to focus on the more novel aspects of

your work. The issue has been addressed and details have been incorporated according as advised by the esteemed reviewer

4 While using a fixed model for feature extraction is a common technique, the concepts of "Dynamic Features" and

"Static Features" in this context are unclear. Please provide a clear definition of what you consider dynamic and static

features and how they relate to your proposed method. Deep features extracted from VGG-19 provide a powerful representation of image content. They capture high-level semantic information about the image, such as the presence of specific objects or patterns. In the context of COVID-19 classification, these features can effectively discriminate between different lung pathologies, including pneumonia, viral pneumonia, and COVID-19. By leveraging the hierarchical structure of VGG-19, these features can capture subtle visual patterns that are often challenging for traditional image processing techniques.

5 The baseline methods compared in this paper appear to be outdated, with references [61]-[63] being papers

published in 2020. Given the rapid advancements in this field, it would be beneficial to compare your work with more recent publications. For example, consider including a comparison with the work by Pratiwi et al. (2021), "Effect of CT scan image resizing, enhancement and normalization on accuracy of covid-19 detection," which also utilizes VGG16.

Providing a justification for not comparing with more recent works would strengthen your paper. The primary outcome measured in this study [73] is the accuracy of COVID-19 detection using CT-scan images and various preprocessing methods. The main findings of this study are that different preprocessing methods, including resizing, enhancement, and normalization, had an impact on the accuracy of COVID-19 classification using a deep learning model (VGG-16), and the highest accuracy of 88.54% was achieved using a combination of deformed resizing, CLAHE enhancement, and normalization to the range of [0 1] and [-1 1].

Initial model comparisons were conducted using contemporary models available at the time of results generation. However, we acknowledge the value of the reviewers' suggestions to benchmark against the most recent state-of-the-art research.

6 Lack of ablation studies. The paper would benefit from the inclusion of ablation studies to demonstrate the impact of each module in the proposed framework. As per suggestion we calculate the results through VGG16 as a single module.

a) b)

c) d)

Fig. 5 Multi-class Covid-19 detection using VGG-16, a) Confusion Matrix, b) Classification Report, c) AUC, d) Accuracy-Loss Curve

Figure 5 presents the multi-class COVID-19 detection results exclusively based on VGG-16 deep features, as visualized through confusion matrices (5a), classification reports (5b), AUC curves (5c), and accuracy loss curves (5d). Relying solely on deep features, the model achieved an overall accuracy of 93% in classifying the four target classes. Notably, the AUC for multi-class differentiation (bacterial, COVID-19, viral) was 0.99, while perfect discrimination (AUC of 1.00) was observed for the normal class.

As per suggested, we obtain the results through XGBoost also, the results are presented follow: -

a) b)

Fig. 6 Multi-class Covid-19 detection using XGBoost, a) Confusion Matrix, b) Classification Report, c) AUC, d) Accuracy-Loss Curve

Figure 6 illustrates the multi-class COVID-19 detection performance solely based on XGBoost-processed static features, as depicted in the confusion matrix (6a), classification report (6b), and AUC curve (6c). Relying exclusively on static features, the model achieved an overall accuracy of 86% in classifying the four target classes.

7 The novelty of the proposed method is not clearly evident. The combination of data augmentation, using a fixed CNN

model for feature extraction, and training an SVM or XGBoost classifier is a well-established and widely used technique

across various fields. To enhance the contribution of this paper, consider highlighting the specific innovations or

improvements your method offers compared to existing approaches. Clearly articulate how your proposed framework

advances the state-of-the-art or addresses limitations of previous methods in the context of your specific application. This study enhances multiclass COVID-19 prediction through a novel approach encompassing the following key elements:

• Optimized pre-processing: Chest X-ray image quality was improved using techniques such as interpolation, data cleaning, augmentation, feature engineering, image enhancement, morphological operations, segmentation, and transformation.

• Feature extraction: Dynamic VGG-19 and static GLCM features were computed from multiclass data to capture diverse image characteristics.

• Feature selection: A hybrid feature space (HFS) was refined using feature selection methods to eliminate redundant features, thereby improving prediction performance and model size for efficient deployment on edge devices

• The optimal HFS was then utilized to the robust optimized XGBoost algorithm for improved prediction

• Hyperparameter tuning: The hyperparameters of the XGBoost machine learning algorithm were meticulously optimized.

The hybrid feature space (HFS)

• Deep features and GLCM features capture different aspects of image information, leading to improved classification performance.

• The combination of these features can better differentiate between subtle visual patterns associated with different lung diseases.

• The use of multiple feature types can help to reduce the impact of noise and variations in image quality.

By effectively fusing these features and employing appropriate machine learning techniques, we developed highly accurate and reliable COVID-19 classification models.

Looking forward to hearing from you.

Regards,

Dr. Lal Hussain

Attachment

Submitted filename: Final Rebuttle letter July 31.docx

pone.0310011.s001.docx^{(315.4KB, docx)}

PLoS One. doi: 10.1371/journal.pone.0310011.r003

Decision Letter 1

Catalin Buiu

7 Aug 2024

PONE-D-24-21569R1Enhancing Multiclass COVID-19 Prediction with ESN-MDFS: Extreme Smart Network using Mean Dropout Feature Selection TechniquePLOS ONE

Dear Dr. Hussain,

You did not respond to comment no. 5 from Reviewer 1. Please address it, but in a separate paragraph. Your previous responses were appended to the original comment, making them difficult to follow. Please submit your revised manuscript by Sep 21 2024 11:59PM. If you will need more time than this to complete your revisions, please reply to this message or contact the journal office at plosone@plos.org. When you're ready to submit your revision, log on to https://www.editorialmanager.com/pone/ and select the 'Submissions Needing Revision' folder to locate your manuscript file.

Please include the following items when submitting your revised manuscript:

A rebuttal letter that responds to each point raised by the academic editor and reviewer(s). You should upload this letter as a separate file labeled 'Response to Reviewers'.
A marked-up copy of your manuscript that highlights changes made to the original version. You should upload this as a separate file labeled 'Revised Manuscript with Track Changes'.
An unmarked version of your revised paper without tracked changes. You should upload this as a separate file labeled 'Manuscript'.

We look forward to receiving your revised manuscript.

Kind regards,

Catalin Buiu

Academic Editor

PLOS ONE

Journal Requirements:

Please review your reference list to ensure that it is complete and correct. If you have cited papers that have been retracted, please include the rationale for doing so in the manuscript text, or remove these references and replace them with relevant current references. Any changes to the reference list should be mentioned in the rebuttal letter that accompanies your revised manuscript. If you need to cite a retracted article, indicate the article’s retracted status in the References list and also include a citation and full reference for the retraction notice.

Additional Editor Comments:

You did not respond to comment no. 5 from Reviewer 1. Please address it, but in a separate paragraph. Your previous responses were appended to the original comments, making them difficult to follow.

[Note: HTML markup is below. Please do not edit.]

PLoS One. 2024 Nov 12;19(11):e0310011. doi: 10.1371/journal.pone.0310011.r004

Author response to Decision Letter 1

8 Aug 2024

To: Prof. Dr. Joerg Heber

PLOS ONE, Editor-in-Chief

Re: Manuscript submission to PLOS ONE

PONE-D-24-21569R1

Enhancing Multiclass COVID-19 Prediction with ESN-MDFS: Extreme Smart Network using Mean Dropout Feature

Selection Technique

Date: August 08, 2024

Dear Dr. Heber

Journal Requirements:

Response: We have checked each reference and ensured that there is retracted article found.

Additional Editor Comments:

You did not respond to comment no. 5 from Reviewer 1. Please address it, but in a separate paragraph. Your previous responses were appended to the original comments, making them difficult to follow.

Response: We have addressed the comments accordingly.

Reviewers’ comments

S. No. Comments Rebuttal

Reviewer 1 Comments

3 Related work of this manuscript can be further enhanced by adding few recent works. Also, discuss the cutting edges

4 How many features are extracted from the VGG model? How many layers used for the re-training on the selected

datasets? The VGG16 model was employed to extract 1024 features. To adapt the model to the specific task, the final four layers were fine-tuned using the selected datasets

5 What is the purpose of GLCM features and how many features are extracted? The Gray Level Co-occurrence Matrix (GLCM) is a statistical technique used to extract texture features from images by analyzing the spatial relationships between pixel intensities. Its applications span various domains, including SAR imagery for land cover classification (water, vegetation, urban areas) [48] and medical imaging for detecting retinal abnormalities [49], where color features have shown superior accuracy. Gray Level Co-occurrence Matrix (GLCM) analysis computes the frequency of pixel pairs with specific intensity values and spatial relationships, forming a matrix from which statistical features can be extracted. In this study, 25 GLCM features were initially calculated and subsequently reduced to 17 through MeanDropout feature selection.